CN113817770B - Construction method and application of CD73 gene humanized non-human animal - Google Patents
Construction method and application of CD73 gene humanized non-human animal Download PDFInfo
- Publication number
- CN113817770B CN113817770B CN202110933098.6A CN202110933098A CN113817770B CN 113817770 B CN113817770 B CN 113817770B CN 202110933098 A CN202110933098 A CN 202110933098A CN 113817770 B CN113817770 B CN 113817770B
- Authority
- CN
- China
- Prior art keywords
- gene
- human
- seq
- humanized
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 101000678236 Homo sapiens 5'-nucleotidase Proteins 0.000 title claims abstract description 355
- 238000010276 construction Methods 0.000 title claims abstract description 35
- 125000003729 nucleotide group Chemical group 0.000 claims description 148
- 239000002773 nucleotide Substances 0.000 claims description 145
- 102000045309 human NT5E Human genes 0.000 claims description 58
- 238000000034 method Methods 0.000 claims description 54
- 230000008685 targeting Effects 0.000 claims description 51
- 239000013598 vector Substances 0.000 claims description 39
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 24
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 15
- 102100039498 Cytotoxic T-lymphocyte protein 4 Human genes 0.000 claims description 11
- 101710089372 Programmed cell death protein 1 Proteins 0.000 claims description 10
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 9
- 238000001727 in vivo Methods 0.000 claims description 9
- 108020004999 messenger RNA Proteins 0.000 claims description 8
- 108010021064 CTLA-4 Antigen Proteins 0.000 claims description 7
- 229940045513 CTLA4 antagonist Drugs 0.000 claims description 7
- 108010074708 B7-H1 Antigen Proteins 0.000 claims description 6
- 102100024216 Programmed cell death 1 ligand 1 Human genes 0.000 claims description 6
- 238000012239 gene modification Methods 0.000 claims description 6
- 229920001184 polypeptide Polymers 0.000 claims description 6
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 6
- 102100029822 B- and T-lymphocyte attenuator Human genes 0.000 claims description 5
- 102100027207 CD27 antigen Human genes 0.000 claims description 5
- 101150013553 CD40 gene Proteins 0.000 claims description 5
- 102100032937 CD40 ligand Human genes 0.000 claims description 5
- 102100034458 Hepatitis A virus cellular receptor 2 Human genes 0.000 claims description 5
- 101710083479 Hepatitis A virus cellular receptor 2 homolog Proteins 0.000 claims description 5
- 101000864344 Homo sapiens B- and T-lymphocyte attenuator Proteins 0.000 claims description 5
- 101000914511 Homo sapiens CD27 antigen Proteins 0.000 claims description 5
- 101000868215 Homo sapiens CD40 ligand Proteins 0.000 claims description 5
- 101000868279 Homo sapiens Leukocyte surface antigen CD47 Proteins 0.000 claims description 5
- 101000831007 Homo sapiens T-cell immunoreceptor with Ig and ITIM domains Proteins 0.000 claims description 5
- 101000914514 Homo sapiens T-cell-specific surface glycoprotein CD28 Proteins 0.000 claims description 5
- 101000801234 Homo sapiens Tumor necrosis factor receptor superfamily member 18 Proteins 0.000 claims description 5
- 101000851370 Homo sapiens Tumor necrosis factor receptor superfamily member 9 Proteins 0.000 claims description 5
- 101000863873 Homo sapiens Tyrosine-protein phosphatase non-receptor type substrate 1 Proteins 0.000 claims description 5
- 102000017578 LAG3 Human genes 0.000 claims description 5
- 101150030213 Lag3 gene Proteins 0.000 claims description 5
- 102100032913 Leukocyte surface antigen CD47 Human genes 0.000 claims description 5
- 229940126547 T-cell immunoglobulin mucin-3 Drugs 0.000 claims description 5
- 102100024834 T-cell immunoreceptor with Ig and ITIM domains Human genes 0.000 claims description 5
- 102100027213 T-cell-specific surface glycoprotein CD28 Human genes 0.000 claims description 5
- 101710165473 Tumor necrosis factor receptor superfamily member 4 Proteins 0.000 claims description 5
- 102100022153 Tumor necrosis factor receptor superfamily member 4 Human genes 0.000 claims description 5
- 102100040245 Tumor necrosis factor receptor superfamily member 5 Human genes 0.000 claims description 5
- 102100036856 Tumor necrosis factor receptor superfamily member 9 Human genes 0.000 claims description 5
- 102100029948 Tyrosine-protein phosphatase non-receptor type substrate 1 Human genes 0.000 claims description 5
- 230000005017 genetic modification Effects 0.000 claims description 5
- 235000013617 genetically modified food Nutrition 0.000 claims description 5
- 150000007523 nucleic acids Chemical group 0.000 claims description 4
- 102100023990 60S ribosomal protein L17 Human genes 0.000 claims description 3
- 108091036066 Three prime untranslated region Proteins 0.000 claims description 3
- 102100022464 5'-nucleotidase Human genes 0.000 abstract description 111
- 241000699670 Mus sp. Species 0.000 abstract description 94
- 239000003814 drug Substances 0.000 abstract description 37
- 241000283984 Rodentia Species 0.000 abstract description 25
- 238000010171 animal model Methods 0.000 abstract description 9
- 108020004414 DNA Proteins 0.000 description 82
- 241000699666 Mus <mouse, genus> Species 0.000 description 64
- 108090000623 proteins and genes Proteins 0.000 description 62
- 210000004027 cell Anatomy 0.000 description 42
- 206010028980 Neoplasm Diseases 0.000 description 41
- 108700024394 Exon Proteins 0.000 description 36
- 238000011577 humanized mouse model Methods 0.000 description 31
- 108091027544 Subgenomic mRNA Proteins 0.000 description 29
- 238000001514 detection method Methods 0.000 description 29
- 230000005030 transcription termination Effects 0.000 description 28
- 229940079593 drug Drugs 0.000 description 24
- 241001465754 Metazoa Species 0.000 description 22
- 210000001744 T-lymphocyte Anatomy 0.000 description 22
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 22
- 241000700159 Rattus Species 0.000 description 21
- 230000001105 regulatory effect Effects 0.000 description 19
- 238000003780 insertion Methods 0.000 description 18
- 230000037431 insertion Effects 0.000 description 18
- 230000014509 gene expression Effects 0.000 description 17
- 108091092195 Intron Proteins 0.000 description 16
- 239000003550 marker Substances 0.000 description 15
- 239000000523 sample Substances 0.000 description 15
- 238000012163 sequencing technique Methods 0.000 description 15
- 241001529936 Murinae Species 0.000 description 14
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 14
- 230000000694 effects Effects 0.000 description 14
- 239000012634 fragment Substances 0.000 description 14
- 238000012216 screening Methods 0.000 description 14
- 238000012217 deletion Methods 0.000 description 13
- 230000037430 deletion Effects 0.000 description 13
- 210000000265 leukocyte Anatomy 0.000 description 13
- 238000005215 recombination Methods 0.000 description 13
- 230000006798 recombination Effects 0.000 description 13
- 238000011160 research Methods 0.000 description 13
- 201000010099 disease Diseases 0.000 description 12
- 238000006467 substitution reaction Methods 0.000 description 12
- 210000001519 tissue Anatomy 0.000 description 12
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 11
- 238000011740 C57BL/6 mouse Methods 0.000 description 11
- 108020004705 Codon Proteins 0.000 description 11
- 206010061218 Inflammation Diseases 0.000 description 11
- 229960005305 adenosine Drugs 0.000 description 11
- 230000004054 inflammatory process Effects 0.000 description 11
- 238000011282 treatment Methods 0.000 description 11
- 108091081024 Start codon Proteins 0.000 description 10
- 102000004169 proteins and genes Human genes 0.000 description 10
- 238000011144 upstream manufacturing Methods 0.000 description 9
- 238000012795 verification Methods 0.000 description 9
- 102000053602 DNA Human genes 0.000 description 8
- 238000002105 Southern blotting Methods 0.000 description 8
- 238000010362 genome editing Methods 0.000 description 8
- 210000005259 peripheral blood Anatomy 0.000 description 8
- 239000011886 peripheral blood Substances 0.000 description 8
- 230000001225 therapeutic effect Effects 0.000 description 8
- 108091026890 Coding region Proteins 0.000 description 7
- 210000001671 embryonic stem cell Anatomy 0.000 description 7
- 238000002474 experimental method Methods 0.000 description 7
- 210000004881 tumor cell Anatomy 0.000 description 7
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 7
- 238000011816 wild-type C57Bl6 mouse Methods 0.000 description 7
- 108020005345 3' Untranslated Regions Proteins 0.000 description 6
- 210000001266 CD8-positive T-lymphocyte Anatomy 0.000 description 6
- 241000282693 Cercopithecidae Species 0.000 description 6
- 241000283973 Oryctolagus cuniculus Species 0.000 description 6
- 210000004369 blood Anatomy 0.000 description 6
- 239000008280 blood Substances 0.000 description 6
- 238000013373 clone screening Methods 0.000 description 6
- DDRJAANPRJIHGJ-UHFFFAOYSA-N creatinine Chemical compound CN1CC(=O)NC1=N DDRJAANPRJIHGJ-UHFFFAOYSA-N 0.000 description 6
- 238000011161 development Methods 0.000 description 6
- 230000018109 developmental process Effects 0.000 description 6
- 235000013601 eggs Nutrition 0.000 description 6
- 238000001976 enzyme digestion Methods 0.000 description 6
- 210000003743 erythrocyte Anatomy 0.000 description 6
- 238000011156 evaluation Methods 0.000 description 6
- 238000004519 manufacturing process Methods 0.000 description 6
- 230000013011 mating Effects 0.000 description 6
- 230000007246 mechanism Effects 0.000 description 6
- 210000000952 spleen Anatomy 0.000 description 6
- 108700026220 vif Genes Proteins 0.000 description 6
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- -1 Glucosyl Phosphatidylinositol Chemical class 0.000 description 5
- 150000001413 amino acids Chemical class 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 210000001165 lymph node Anatomy 0.000 description 5
- 210000001616 monocyte Anatomy 0.000 description 5
- 210000003101 oviduct Anatomy 0.000 description 5
- 239000013612 plasmid Substances 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 108091033409 CRISPR Proteins 0.000 description 4
- 101000889276 Homo sapiens Cytotoxic T-lymphocyte protein 4 Proteins 0.000 description 4
- 206010021143 Hypoxia Diseases 0.000 description 4
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- 238000010459 TALEN Methods 0.000 description 4
- 108010017070 Zinc Finger Nucleases Proteins 0.000 description 4
- UDMBCSSLTHHNCD-KQYNXXCUSA-N adenosine 5'-monophosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O UDMBCSSLTHHNCD-KQYNXXCUSA-N 0.000 description 4
- 210000000625 blastula Anatomy 0.000 description 4
- 238000004113 cell culture Methods 0.000 description 4
- 210000000349 chromosome Anatomy 0.000 description 4
- 238000007796 conventional method Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 4
- 238000010363 gene targeting Methods 0.000 description 4
- 210000003714 granulocyte Anatomy 0.000 description 4
- 230000007954 hypoxia Effects 0.000 description 4
- 230000005934 immune activation Effects 0.000 description 4
- 238000009169 immunotherapy Methods 0.000 description 4
- 108010034529 leucyl-lysine Proteins 0.000 description 4
- 210000002540 macrophage Anatomy 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000002829 reductive effect Effects 0.000 description 4
- 210000003289 regulatory T cell Anatomy 0.000 description 4
- 210000001082 somatic cell Anatomy 0.000 description 4
- 102100024222 B-lymphocyte antigen CD19 Human genes 0.000 description 3
- UDMBCSSLTHHNCD-UHFFFAOYSA-N Coenzym Q(11) Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(O)=O)C(O)C1O UDMBCSSLTHHNCD-UHFFFAOYSA-N 0.000 description 3
- 101100193633 Danio rerio rag2 gene Proteins 0.000 description 3
- 102000016607 Diphtheria Toxin Human genes 0.000 description 3
- 108010053187 Diphtheria Toxin Proteins 0.000 description 3
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- 102000001554 Hemoglobins Human genes 0.000 description 3
- 108010054147 Hemoglobins Proteins 0.000 description 3
- 101000980825 Homo sapiens B-lymphocyte antigen CD19 Proteins 0.000 description 3
- 108010025815 Kanamycin Kinase Proteins 0.000 description 3
- 241000124008 Mammalia Species 0.000 description 3
- 241000699660 Mus musculus Species 0.000 description 3
- 101100193635 Mus musculus Rag2 gene Proteins 0.000 description 3
- 238000011579 SCID mouse model Methods 0.000 description 3
- 206010039491 Sarcoma Diseases 0.000 description 3
- 241000282887 Suidae Species 0.000 description 3
- 108091093126 WHP Posttrascriptional Response Element Proteins 0.000 description 3
- LNQVTSROQXJCDD-UHFFFAOYSA-N adenosine monophosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(CO)C(OP(O)(O)=O)C1O LNQVTSROQXJCDD-UHFFFAOYSA-N 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 210000004102 animal cell Anatomy 0.000 description 3
- 108010038633 aspartylglutamate Proteins 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 210000003719 b-lymphocyte Anatomy 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 230000037396 body weight Effects 0.000 description 3
- 210000004899 c-terminal region Anatomy 0.000 description 3
- 201000011510 cancer Diseases 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 229940109239 creatinine Drugs 0.000 description 3
- 230000000857 drug effect Effects 0.000 description 3
- 238000003205 genotyping method Methods 0.000 description 3
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 3
- 108010050848 glycylleucine Proteins 0.000 description 3
- 239000001963 growth medium Substances 0.000 description 3
- 210000000987 immune system Anatomy 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 208000032839 leukemia Diseases 0.000 description 3
- 210000001161 mammalian embryo Anatomy 0.000 description 3
- 238000000520 microinjection Methods 0.000 description 3
- 210000000822 natural killer cell Anatomy 0.000 description 3
- 238000011580 nude mouse model Methods 0.000 description 3
- 210000000056 organ Anatomy 0.000 description 3
- 230000003234 polygenic effect Effects 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 208000024891 symptom Diseases 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 238000002560 therapeutic procedure Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000007704 transition Effects 0.000 description 3
- 230000004614 tumor growth Effects 0.000 description 3
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 2
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 2
- 208000023275 Autoimmune disease Diseases 0.000 description 2
- 206010006187 Breast cancer Diseases 0.000 description 2
- 208000026310 Breast neoplasm Diseases 0.000 description 2
- 238000010453 CRISPR/Cas method Methods 0.000 description 2
- 206010009944 Colon cancer Diseases 0.000 description 2
- 241000699800 Cricetinae Species 0.000 description 2
- 206010059866 Drug resistance Diseases 0.000 description 2
- 108010042407 Endonucleases Proteins 0.000 description 2
- 102000004533 Endonucleases Human genes 0.000 description 2
- 240000006890 Erythroxylum coca Species 0.000 description 2
- 208000000461 Esophageal Neoplasms Diseases 0.000 description 2
- 206010015548 Euthanasia Diseases 0.000 description 2
- 241000699694 Gerbillinae Species 0.000 description 2
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 2
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 2
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 2
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 2
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 2
- FOEHRHOBWFQSNW-KATARQTJSA-N Leu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N)O FOEHRHOBWFQSNW-KATARQTJSA-N 0.000 description 2
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 2
- 206010025323 Lymphomas Diseases 0.000 description 2
- 101100290374 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) mcd-4 gene Proteins 0.000 description 2
- 206010030155 Oesophageal carcinoma Diseases 0.000 description 2
- 206010033128 Ovarian cancer Diseases 0.000 description 2
- 206010061535 Ovarian neoplasm Diseases 0.000 description 2
- 238000002944 PCR assay Methods 0.000 description 2
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 2
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 2
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 2
- 241000121210 Sigmodontinae Species 0.000 description 2
- YXONONCLMLHWJX-SZMVWBNQSA-N Trp-Glu-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 YXONONCLMLHWJX-SZMVWBNQSA-N 0.000 description 2
- 102100033728 Tumor necrosis factor receptor superfamily member 18 Human genes 0.000 description 2
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 2
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 230000000259 anti-tumor effect Effects 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 230000008827 biological function Effects 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 238000004820 blood count Methods 0.000 description 2
- 239000004202 carbamide Substances 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 2
- 210000001728 clone cell Anatomy 0.000 description 2
- 208000029742 colonic neoplasm Diseases 0.000 description 2
- 210000004748 cultured cell Anatomy 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 238000005520 cutting process Methods 0.000 description 2
- 210000000805 cytoplasm Anatomy 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 108010009297 diglycyl-histidine Proteins 0.000 description 2
- 208000035475 disorder Diseases 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 238000009509 drug development Methods 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 201000004101 esophageal cancer Diseases 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 238000000684 flow cytometry Methods 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 238000005534 hematocrit Methods 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 230000001900 immune effect Effects 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 230000002757 inflammatory effect Effects 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 230000016507 interphase Effects 0.000 description 2
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 210000004185 liver Anatomy 0.000 description 2
- 210000004698 lymphocyte Anatomy 0.000 description 2
- 238000001823 molecular biology technique Methods 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 239000000178 monomer Substances 0.000 description 2
- 239000002547 new drug Substances 0.000 description 2
- 208000002154 non-small cell lung carcinoma Diseases 0.000 description 2
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 2
- 239000013641 positive control Substances 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 238000010839 reverse transcription Methods 0.000 description 2
- 238000003757 reverse transcription PCR Methods 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 230000009261 transgenic effect Effects 0.000 description 2
- UFTFJSFQGQCHQW-UHFFFAOYSA-N triformin Chemical compound O=COCC(OC=O)COC=O UFTFJSFQGQCHQW-UHFFFAOYSA-N 0.000 description 2
- 208000029729 tumor suppressor gene on chromosome 11 Diseases 0.000 description 2
- 230000005760 tumorsuppression Effects 0.000 description 2
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- NHBKXEKEPDILRR-UHFFFAOYSA-N 2,3-bis(butanoylsulfanyl)propyl butanoate Chemical compound CCCC(=O)OCC(SC(=O)CCC)CSC(=O)CCC NHBKXEKEPDILRR-UHFFFAOYSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- 102000004008 5'-Nucleotidase Human genes 0.000 description 1
- 208000031261 Acute myeloid leukaemia Diseases 0.000 description 1
- 102000009346 Adenosine receptors Human genes 0.000 description 1
- 108050000203 Adenosine receptors Proteins 0.000 description 1
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 1
- IHRGVZXPTIQNIP-NAKRPEOUSA-N Ala-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C)N IHRGVZXPTIQNIP-NAKRPEOUSA-N 0.000 description 1
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- 102100036475 Alanine aminotransferase 1 Human genes 0.000 description 1
- 108010082126 Alanine transaminase Proteins 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 201000003076 Angiosarcoma Diseases 0.000 description 1
- 208000031873 Animal Disease Models Diseases 0.000 description 1
- USNSOPDIZILSJP-FXQIFTODSA-N Arg-Asn-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O USNSOPDIZILSJP-FXQIFTODSA-N 0.000 description 1
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 1
- NYZGVTGOMPHSJW-CIUDSAMLSA-N Arg-Glu-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N NYZGVTGOMPHSJW-CIUDSAMLSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 1
- AOJYORNRFWWEIV-IHRRRGAJSA-N Arg-Tyr-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 AOJYORNRFWWEIV-IHRRRGAJSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 1
- IHUJUZBUOFTIOB-QEJZJMRPSA-N Asn-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N IHUJUZBUOFTIOB-QEJZJMRPSA-N 0.000 description 1
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 1
- QEQVUHQQYDZUEN-GUBZILKMSA-N Asn-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N QEQVUHQQYDZUEN-GUBZILKMSA-N 0.000 description 1
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 1
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 1
- MYVBTYXSWILFCG-BQBZGAKWSA-N Asn-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N MYVBTYXSWILFCG-BQBZGAKWSA-N 0.000 description 1
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 1
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 1
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 1
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 1
- CXBOKJPLEYUPGB-FXQIFTODSA-N Asp-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N CXBOKJPLEYUPGB-FXQIFTODSA-N 0.000 description 1
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 1
- UFAQGGZUXVLONR-AVGNSLFASA-N Asp-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)O UFAQGGZUXVLONR-AVGNSLFASA-N 0.000 description 1
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 1
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- XQFLFQWOBXPMHW-NHCYSSNCSA-N Asp-Val-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XQFLFQWOBXPMHW-NHCYSSNCSA-N 0.000 description 1
- GZYDPEJSZYZWEF-MXAVVETBSA-N Asp-Val-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O GZYDPEJSZYZWEF-MXAVVETBSA-N 0.000 description 1
- 108010003415 Aspartate Aminotransferases Proteins 0.000 description 1
- 102000004625 Aspartate Aminotransferases Human genes 0.000 description 1
- 206010003571 Astrocytoma Diseases 0.000 description 1
- 208000036170 B-Cell Marginal Zone Lymphoma Diseases 0.000 description 1
- 208000010839 B-cell chronic lymphocytic leukemia Diseases 0.000 description 1
- 208000003950 B-cell lymphoma Diseases 0.000 description 1
- 208000032791 BCR-ABL1 positive chronic myelogenous leukemia Diseases 0.000 description 1
- 235000017166 Bambusa arundinacea Nutrition 0.000 description 1
- 235000017491 Bambusa tulda Nutrition 0.000 description 1
- 206010004593 Bile duct cancer Diseases 0.000 description 1
- 206010005003 Bladder cancer Diseases 0.000 description 1
- 206010005949 Bone cancer Diseases 0.000 description 1
- 208000018084 Bone neoplasm Diseases 0.000 description 1
- 208000003174 Brain Neoplasms Diseases 0.000 description 1
- 238000011752 CBA/J (JAX™ mouse strain) Methods 0.000 description 1
- 238000010354 CRISPR gene editing Methods 0.000 description 1
- 241000282461 Canis lupus Species 0.000 description 1
- 206010008342 Cervix carcinoma Diseases 0.000 description 1
- 208000005243 Chondrosarcoma Diseases 0.000 description 1
- 208000010833 Chronic myeloid leukaemia Diseases 0.000 description 1
- 206010009900 Colitis ulcerative Diseases 0.000 description 1
- JTNKVWLMDHIUOG-IHRRRGAJSA-N Cys-Arg-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JTNKVWLMDHIUOG-IHRRRGAJSA-N 0.000 description 1
- XGIAHEUULGOZHH-GUBZILKMSA-N Cys-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N XGIAHEUULGOZHH-GUBZILKMSA-N 0.000 description 1
- VNLYIYOYUNGURO-ZLUOBGJFSA-N Cys-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N VNLYIYOYUNGURO-ZLUOBGJFSA-N 0.000 description 1
- SBDVXRYCOIEYNV-YUMQZZPRSA-N Cys-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N SBDVXRYCOIEYNV-YUMQZZPRSA-N 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- 238000011765 DBA/2 mouse Methods 0.000 description 1
- 201000004624 Dermatitis Diseases 0.000 description 1
- 206010014733 Endometrial cancer Diseases 0.000 description 1
- 206010014759 Endometrial neoplasm Diseases 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 208000006168 Ewing Sarcoma Diseases 0.000 description 1
- 102100027280 Fanconi anemia group A protein Human genes 0.000 description 1
- 201000008808 Fibrosarcoma Diseases 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- KZKBJEUWNMQTLV-XDTLVQLUSA-N Gln-Ala-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZKBJEUWNMQTLV-XDTLVQLUSA-N 0.000 description 1
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 1
- TWTWUBHEWQPMQW-ZPFDUUQYSA-N Gln-Ile-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWTWUBHEWQPMQW-ZPFDUUQYSA-N 0.000 description 1
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 1
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 1
- XIYWAJQIWLXXAF-XKBZYTNZSA-N Gln-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XIYWAJQIWLXXAF-XKBZYTNZSA-N 0.000 description 1
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 1
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 1
- KLJMRPIBBLTDGE-ACZMJKKPSA-N Glu-Cys-Asn Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O KLJMRPIBBLTDGE-ACZMJKKPSA-N 0.000 description 1
- VFZIDQZAEBORGY-GLLZPBPUSA-N Glu-Gln-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VFZIDQZAEBORGY-GLLZPBPUSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- ZWMYUDZLXAQHCK-CIUDSAMLSA-N Glu-Met-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O ZWMYUDZLXAQHCK-CIUDSAMLSA-N 0.000 description 1
- XEKAJTCACGEBOK-KKUMJFAQSA-N Glu-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XEKAJTCACGEBOK-KKUMJFAQSA-N 0.000 description 1
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 1
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- DUYYPIRFTLOAJQ-YUMQZZPRSA-N Gly-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN DUYYPIRFTLOAJQ-YUMQZZPRSA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- 208000001258 Hemangiosarcoma Diseases 0.000 description 1
- 208000035186 Hemolytic Autoimmune Anemia Diseases 0.000 description 1
- 208000027761 Hepatic autoimmune disease Diseases 0.000 description 1
- MWWOPNQSBXEUHO-ULQDDVLXSA-N His-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 MWWOPNQSBXEUHO-ULQDDVLXSA-N 0.000 description 1
- NNBWMLHQXBTIIT-HVTMNAMFSA-N His-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N NNBWMLHQXBTIIT-HVTMNAMFSA-N 0.000 description 1
- AJTBOTWDSRSUDV-ULQDDVLXSA-N His-Phe-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O AJTBOTWDSRSUDV-ULQDDVLXSA-N 0.000 description 1
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 1
- FBVHRDXSCYELMI-PBCZWWQYSA-N His-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O FBVHRDXSCYELMI-PBCZWWQYSA-N 0.000 description 1
- XHQYFGPIRUHQIB-PBCZWWQYSA-N His-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CN=CN1 XHQYFGPIRUHQIB-PBCZWWQYSA-N 0.000 description 1
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 1
- DRKZDEFADVYTLU-AVGNSLFASA-N His-Val-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DRKZDEFADVYTLU-AVGNSLFASA-N 0.000 description 1
- 102100032742 Histone-lysine N-methyltransferase SETD2 Human genes 0.000 description 1
- 208000017604 Hodgkin disease Diseases 0.000 description 1
- 208000021519 Hodgkin lymphoma Diseases 0.000 description 1
- 208000010747 Hodgkins lymphoma Diseases 0.000 description 1
- 101000914673 Homo sapiens Fanconi anemia group A protein Proteins 0.000 description 1
- 101000654725 Homo sapiens Histone-lysine N-methyltransferase SETD2 Proteins 0.000 description 1
- 102000003839 Human Proteins Human genes 0.000 description 1
- 108090000144 Human Proteins Proteins 0.000 description 1
- 206010020751 Hypersensitivity Diseases 0.000 description 1
- 206010020850 Hyperthyroidism Diseases 0.000 description 1
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- HLYBGMZJVDHJEO-CYDGBPFRSA-N Ile-Arg-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HLYBGMZJVDHJEO-CYDGBPFRSA-N 0.000 description 1
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 1
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 1
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 1
- APDIECQNNDGFPD-PYJNHQTQSA-N Ile-His-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N APDIECQNNDGFPD-PYJNHQTQSA-N 0.000 description 1
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- DBXXASNNDTXOLU-MXAVVETBSA-N Ile-Leu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DBXXASNNDTXOLU-MXAVVETBSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- IDMNOFVUXYYZPF-DKIMLUQUSA-N Ile-Lys-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IDMNOFVUXYYZPF-DKIMLUQUSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- ZFWISYLMLXFBSX-KKPKCPPISA-N Ile-Trp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N ZFWISYLMLXFBSX-KKPKCPPISA-N 0.000 description 1
- NSPNUMNLZNOPAQ-SJWGOKEGSA-N Ile-Tyr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N NSPNUMNLZNOPAQ-SJWGOKEGSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 1
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 1
- 208000008839 Kidney Neoplasms Diseases 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- 102220470475 L-seryl-tRNA(Sec) kinase_C57L_mutation Human genes 0.000 description 1
- 208000031671 Large B-Cell Diffuse Lymphoma Diseases 0.000 description 1
- 208000018142 Leiomyosarcoma Diseases 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- WCTCIIAGNMFYAO-DCAQKATOSA-N Leu-Cys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O WCTCIIAGNMFYAO-DCAQKATOSA-N 0.000 description 1
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 1
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- ORWTWZXGDBYVCP-BJDJZHNGSA-N Leu-Ile-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(C)C ORWTWZXGDBYVCP-BJDJZHNGSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- VVQJGYPTIYOFBR-IHRRRGAJSA-N Leu-Lys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N VVQJGYPTIYOFBR-IHRRRGAJSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- ONHCDMBHPQIPAI-YTQUADARSA-N Leu-Trp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N ONHCDMBHPQIPAI-YTQUADARSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 1
- 208000031422 Lymphocytic Chronic B-Cell Leukemia Diseases 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 1
- SVJRVFPSHPGWFF-DCAQKATOSA-N Lys-Cys-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVJRVFPSHPGWFF-DCAQKATOSA-N 0.000 description 1
- SSYOBDBNBQBSQE-SRVKXCTJSA-N Lys-Cys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O SSYOBDBNBQBSQE-SRVKXCTJSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- DAOSYIZXRCOKII-SRVKXCTJSA-N Lys-His-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O DAOSYIZXRCOKII-SRVKXCTJSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- 102000043136 MAP kinase family Human genes 0.000 description 1
- 108091054455 MAP kinase family Proteins 0.000 description 1
- 208000025205 Mantle-Cell Lymphoma Diseases 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 1
- GTRWUQSSISWRTL-NAKRPEOUSA-N Met-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCSC)N GTRWUQSSISWRTL-NAKRPEOUSA-N 0.000 description 1
- WWWGMQHQSAUXBU-BQBZGAKWSA-N Met-Gly-Asn Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O WWWGMQHQSAUXBU-BQBZGAKWSA-N 0.000 description 1
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 1
- RRIHXWPHQSXHAQ-XUXIUFHCSA-N Met-Ile-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O RRIHXWPHQSXHAQ-XUXIUFHCSA-N 0.000 description 1
- WTHGNAAQXISJHP-AVGNSLFASA-N Met-Lys-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WTHGNAAQXISJHP-AVGNSLFASA-N 0.000 description 1
- ZWBCVBHKXHPCEI-BVSLBCMMSA-N Met-Phe-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N ZWBCVBHKXHPCEI-BVSLBCMMSA-N 0.000 description 1
- 206010027476 Metastases Diseases 0.000 description 1
- 208000034578 Multiple myelomas Diseases 0.000 description 1
- 241000699729 Muridae Species 0.000 description 1
- 101100120552 Mus musculus Foxp3 gene Proteins 0.000 description 1
- 241000699669 Mus saxicola Species 0.000 description 1
- 201000003793 Myelodysplastic syndrome Diseases 0.000 description 1
- 208000033761 Myelogenous Chronic BCR-ABL Positive Leukemia Diseases 0.000 description 1
- 208000033776 Myeloid Acute Leukemia Diseases 0.000 description 1
- 208000009525 Myocarditis Diseases 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 208000001894 Nasopharyngeal Neoplasms Diseases 0.000 description 1
- 206010061306 Nasopharyngeal cancer Diseases 0.000 description 1
- 208000012902 Nervous system disease Diseases 0.000 description 1
- 208000025966 Neurological disease Diseases 0.000 description 1
- 208000015914 Non-Hodgkin lymphomas Diseases 0.000 description 1
- 101150101087 Nt5e gene Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 206010061902 Pancreatic neoplasm Diseases 0.000 description 1
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 1
- XXAOSEUPEMQJOF-KKUMJFAQSA-N Phe-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XXAOSEUPEMQJOF-KKUMJFAQSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 1
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 1
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 1
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 description 1
- YOFKMVUAZGPFCF-IHRRRGAJSA-N Phe-Met-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O YOFKMVUAZGPFCF-IHRRRGAJSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- PTDAGKJHZBGDKD-OEAJRASXSA-N Phe-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O PTDAGKJHZBGDKD-OEAJRASXSA-N 0.000 description 1
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 1
- 244000082204 Phyllostachys viridis Species 0.000 description 1
- 235000015334 Phyllostachys viridis Nutrition 0.000 description 1
- 206010035226 Plasma cell myeloma Diseases 0.000 description 1
- AHXPYZRZRMQOAU-QXEWZRGKSA-N Pro-Asn-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1)C(O)=O AHXPYZRZRMQOAU-QXEWZRGKSA-N 0.000 description 1
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 1
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 1
- RYJRPPUATSKNAY-STECZYCISA-N Pro-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@@H]2CCCN2 RYJRPPUATSKNAY-STECZYCISA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 1
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 1
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 238000012356 Product development Methods 0.000 description 1
- 206010060862 Prostate cancer Diseases 0.000 description 1
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 208000015634 Rectal Neoplasms Diseases 0.000 description 1
- 206010038389 Renal cancer Diseases 0.000 description 1
- 206010039710 Scleroderma Diseases 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- JJKSSJVYOVRJMZ-FXQIFTODSA-N Ser-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)CN=C(N)N JJKSSJVYOVRJMZ-FXQIFTODSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- WEQAYODCJHZSJZ-KKUMJFAQSA-N Ser-His-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 WEQAYODCJHZSJZ-KKUMJFAQSA-N 0.000 description 1
- JLPMFVAIQHCBDC-CIUDSAMLSA-N Ser-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N JLPMFVAIQHCBDC-CIUDSAMLSA-N 0.000 description 1
- OCWWJBZQXGYQCA-DCAQKATOSA-N Ser-Lys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O OCWWJBZQXGYQCA-DCAQKATOSA-N 0.000 description 1
- AXVNLRQLPLSIPQ-FXQIFTODSA-N Ser-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N AXVNLRQLPLSIPQ-FXQIFTODSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- 102000007562 Serum Albumin Human genes 0.000 description 1
- 108010071390 Serum Albumin Proteins 0.000 description 1
- 108020004688 Small Nuclear RNA Proteins 0.000 description 1
- 102000039471 Small Nuclear RNA Human genes 0.000 description 1
- 108020004459 Small interfering RNA Proteins 0.000 description 1
- 208000021712 Soft tissue sarcoma Diseases 0.000 description 1
- 208000027520 Somatoform disease Diseases 0.000 description 1
- 208000005718 Stomach Neoplasms Diseases 0.000 description 1
- 206010042971 T-cell lymphoma Diseases 0.000 description 1
- 208000027585 T-cell non-Hodgkin lymphoma Diseases 0.000 description 1
- 208000024313 Testicular Neoplasms Diseases 0.000 description 1
- 206010057644 Testis cancer Diseases 0.000 description 1
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- XUGYQLFEJYZOKQ-NGTWOADLSA-N Thr-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XUGYQLFEJYZOKQ-NGTWOADLSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- XGUAUKUYQHBUNY-SWRJLBSHSA-N Thr-Trp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XGUAUKUYQHBUNY-SWRJLBSHSA-N 0.000 description 1
- 208000024770 Thyroid neoplasm Diseases 0.000 description 1
- 108020004566 Transfer RNA Proteins 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- NXAPHBHZCMQORW-FDARSICLSA-N Trp-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NXAPHBHZCMQORW-FDARSICLSA-N 0.000 description 1
- IBBBOLAPFHRDHW-BPUTZDHNSA-N Trp-Asn-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N IBBBOLAPFHRDHW-BPUTZDHNSA-N 0.000 description 1
- XKGZEDNYGPNJAR-XIRDDKMYSA-N Trp-Asn-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N XKGZEDNYGPNJAR-XIRDDKMYSA-N 0.000 description 1
- OENGVSDBQHHGBU-QEJZJMRPSA-N Trp-Glu-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OENGVSDBQHHGBU-QEJZJMRPSA-N 0.000 description 1
- XGFOXYJQBRTJPO-PJODQICGSA-N Trp-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XGFOXYJQBRTJPO-PJODQICGSA-N 0.000 description 1
- NLMXVDDEQFKQQU-CFMVVWHZSA-N Tyr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLMXVDDEQFKQQU-CFMVVWHZSA-N 0.000 description 1
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 1
- RYSNTWVRSLCAJZ-RYUDHWBXSA-N Tyr-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RYSNTWVRSLCAJZ-RYUDHWBXSA-N 0.000 description 1
- JWGXUKHIKXZWNG-RYUDHWBXSA-N Tyr-Gly-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JWGXUKHIKXZWNG-RYUDHWBXSA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 1
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 1
- RCMWNNJFKNDKQR-UFYCRDLUSA-N Tyr-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 RCMWNNJFKNDKQR-UFYCRDLUSA-N 0.000 description 1
- 201000006704 Ulcerative Colitis Diseases 0.000 description 1
- 208000007097 Urinary Bladder Neoplasms Diseases 0.000 description 1
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 description 1
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 1
- REJBPZVUHYNMEN-LSJOCFKGSA-N Val-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N REJBPZVUHYNMEN-LSJOCFKGSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- OPGWZDIYEYJVRX-AVGNSLFASA-N Val-His-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OPGWZDIYEYJVRX-AVGNSLFASA-N 0.000 description 1
- HQYVQDRYODWONX-DCAQKATOSA-N Val-His-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N HQYVQDRYODWONX-DCAQKATOSA-N 0.000 description 1
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 1
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- 208000033559 Waldenström macroglobulinemia Diseases 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 208000037844 advanced solid tumor Diseases 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 208000026935 allergic disease Diseases 0.000 description 1
- 230000007815 allergy Effects 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 229940125644 antibody drug Drugs 0.000 description 1
- 238000009175 antibody therapy Methods 0.000 description 1
- 102000025171 antigen binding proteins Human genes 0.000 description 1
- 108091000831 antigen binding proteins Proteins 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 210000002565 arteriole Anatomy 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 208000006673 asthma Diseases 0.000 description 1
- 230000001363 autoimmune Effects 0.000 description 1
- 201000000448 autoimmune hemolytic anemia Diseases 0.000 description 1
- 201000003710 autoimmune thrombocytopenic purpura Diseases 0.000 description 1
- 239000011425 bamboo Substances 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 208000026900 bile duct neoplasm Diseases 0.000 description 1
- 102000023732 binding proteins Human genes 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 238000010876 biochemical test Methods 0.000 description 1
- 210000005068 bladder tissue Anatomy 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 210000000601 blood cell Anatomy 0.000 description 1
- 238000009534 blood test Methods 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 201000010881 cervical cancer Diseases 0.000 description 1
- 238000002512 chemotherapy Methods 0.000 description 1
- 235000013330 chicken meat Nutrition 0.000 description 1
- 208000006990 cholangiocarcinoma Diseases 0.000 description 1
- 208000032852 chronic lymphocytic leukemia Diseases 0.000 description 1
- 230000009194 climbing Effects 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 229910017052 cobalt Inorganic materials 0.000 description 1
- 239000010941 cobalt Substances 0.000 description 1
- GUTLYIVDDKVIGB-UHFFFAOYSA-N cobalt atom Chemical compound [Co] GUTLYIVDDKVIGB-UHFFFAOYSA-N 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 230000000139 costimulatory effect Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 230000030609 dephosphorylation Effects 0.000 description 1
- 238000006209 dephosphorylation reaction Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 206010012818 diffuse large B-cell lymphoma Diseases 0.000 description 1
- 238000007877 drug screening Methods 0.000 description 1
- 238000002651 drug therapy Methods 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 230000002526 effect on cardiovascular system Effects 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 210000002889 endothelial cell Anatomy 0.000 description 1
- 230000017188 evasion or tolerance of host immune response Effects 0.000 description 1
- 230000035558 fertility Effects 0.000 description 1
- 230000004720 fertilization Effects 0.000 description 1
- 201000003444 follicular lymphoma Diseases 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 235000012631 food intake Nutrition 0.000 description 1
- 206010017758 gastric cancer Diseases 0.000 description 1
- 238000012224 gene deletion Methods 0.000 description 1
- 238000003209 gene knockout Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 208000005017 glioblastoma Diseases 0.000 description 1
- 230000001434 glomerular Effects 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 230000036449 good health Effects 0.000 description 1
- 201000010536 head and neck cancer Diseases 0.000 description 1
- 208000014829 head and neck neoplasm Diseases 0.000 description 1
- 230000003862 health status Effects 0.000 description 1
- 208000006454 hepatitis Diseases 0.000 description 1
- 231100000283 hepatitis Toxicity 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 230000013632 homeostatic process Effects 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 210000001822 immobilized cell Anatomy 0.000 description 1
- 230000000984 immunochemical effect Effects 0.000 description 1
- 238000013394 immunophenotyping Methods 0.000 description 1
- 230000001976 improved effect Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 201000010982 kidney cancer Diseases 0.000 description 1
- 230000003907 kidney function Effects 0.000 description 1
- 238000011813 knockout mouse model Methods 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 206010024627 liposarcoma Diseases 0.000 description 1
- 238000010859 live-cell imaging Methods 0.000 description 1
- 201000007270 liver cancer Diseases 0.000 description 1
- 208000014018 liver neoplasm Diseases 0.000 description 1
- 201000005202 lung cancer Diseases 0.000 description 1
- 208000020816 lung neoplasm Diseases 0.000 description 1
- 210000003563 lymphoid tissue Anatomy 0.000 description 1
- 108010089256 lysyl-aspartyl-glutamyl-leucine Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 201000007924 marginal zone B-cell lymphoma Diseases 0.000 description 1
- 208000021937 marginal zone lymphoma Diseases 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 230000004066 metabolic change Effects 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- 229910021645 metal ion Inorganic materials 0.000 description 1
- 230000009401 metastasis Effects 0.000 description 1
- 108091070501 miRNA Proteins 0.000 description 1
- 239000002679 microRNA Substances 0.000 description 1
- 230000002906 microbiologic effect Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000010172 mouse model Methods 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 230000012177 negative regulation of immune response Effects 0.000 description 1
- 201000008383 nephritis Diseases 0.000 description 1
- 230000000324 neuroprotective effect Effects 0.000 description 1
- 210000000440 neutrophil Anatomy 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 201000008968 osteosarcoma Diseases 0.000 description 1
- 208000027753 pain disease Diseases 0.000 description 1
- 201000002528 pancreatic cancer Diseases 0.000 description 1
- 208000008443 pancreatic carcinoma Diseases 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 230000010412 perfusion Effects 0.000 description 1
- 230000035699 permeability Effects 0.000 description 1
- 230000000144 pharmacologic effect Effects 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 208000031223 plasma cell leukemia Diseases 0.000 description 1
- 230000010118 platelet activation Effects 0.000 description 1
- 238000004393 prognosis Methods 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 108010043671 prostatic acid phosphatase Proteins 0.000 description 1
- 108700019620 rat Foxp3 Proteins 0.000 description 1
- 206010038038 rectal cancer Diseases 0.000 description 1
- 201000001275 rectum cancer Diseases 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 239000005871 repellent Substances 0.000 description 1
- 230000002940 repellent Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 201000009410 rhabdomyosarcoma Diseases 0.000 description 1
- 206010039073 rheumatoid arthritis Diseases 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 239000011435 rock Substances 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 230000000392 somatic effect Effects 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 210000004989 spleen cell Anatomy 0.000 description 1
- 108010005652 splenotritin Proteins 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 201000011549 stomach cancer Diseases 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 206010042863 synovial sarcoma Diseases 0.000 description 1
- 201000000596 systemic lupus erythematosus Diseases 0.000 description 1
- 201000003120 testicular cancer Diseases 0.000 description 1
- 201000002510 thyroid cancer Diseases 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 238000011269 treatment regimen Methods 0.000 description 1
- 230000005748 tumor development Effects 0.000 description 1
- 230000005751 tumor progression Effects 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 230000003827 upregulation Effects 0.000 description 1
- 201000005112 urinary bladder cancer Diseases 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 230000002792 vascular Effects 0.000 description 1
- 230000008728 vascular permeability Effects 0.000 description 1
- 230000006442 vascular tone Effects 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K67/00—Rearing or breeding animals, not otherwise provided for; New or modified breeds of animals
- A01K67/027—New or modified breeds of vertebrates
- A01K67/0275—Genetically modified vertebrates, e.g. transgenic
- A01K67/0278—Knock-in vertebrates, e.g. humanised vertebrates
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K49/00—Preparations for testing in vivo
- A61K49/0004—Screening or testing of compounds for diagnosis of disorders, assessment of conditions, e.g. renal clearance, gastric emptying, testing for diabetes, allergy, rheuma, pancreas functions
- A61K49/0008—Screening agents using (non-human) animal models or transgenic animal models or chimeric hosts, e.g. Alzheimer disease animal model, transgenic model for heart failure
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
- C07K14/70596—Molecules with a "CD"-designation not provided for elsewhere
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/18—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans
- C07K16/28—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants
- C07K16/2896—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants against molecules with a "CD"-designation, not provided for elsewhere
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/40—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against enzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N5/00—Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
- C12N5/06—Animal cells or tissues; Human cells or tissues
- C12N5/0602—Vertebrate cells
- C12N5/0603—Embryonic cells ; Embryoid bodies
- C12N5/0606—Pluripotent embryonic cells, e.g. embryonic stem cells [ES]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y301/00—Hydrolases acting on ester bonds (3.1)
- C12Y301/03—Phosphoric monoester hydrolases (3.1.3)
- C12Y301/03005—5'-Nucleotidase (3.1.3.5)
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/5005—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells
- G01N33/5008—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells for testing or evaluating the effect of chemical or biological compounds, e.g. drugs, cosmetics
- G01N33/5011—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells for testing or evaluating the effect of chemical or biological compounds, e.g. drugs, cosmetics for testing antineoplastic activity
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2207/00—Modified animals
- A01K2207/15—Humanized animals
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2227/00—Animals characterised by species
- A01K2227/10—Mammal
- A01K2227/105—Murine
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2267/00—Animals characterised by purpose
- A01K2267/03—Animal model, e.g. for test or diseases
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2267/00—Animals characterised by purpose
- A01K2267/03—Animal model, e.g. for test or diseases
- A01K2267/0331—Animal model for proliferative diseases
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2503/00—Use of cells in diagnostics
- C12N2503/02—Drug screening
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2510/00—Genetically modified cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/10—Plasmid DNA
- C12N2800/106—Plasmid DNA for vertebrates
- C12N2800/107—Plasmid DNA for vertebrates for mammalian
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2333/00—Assays involving biological materials from specific organisms or of a specific nature
- G01N2333/435—Assays involving biological materials from specific organisms or of a specific nature from animals; from humans
- G01N2333/705—Assays involving receptors, cell surface antigens or cell surface determinants
- G01N2333/70596—Molecules with a "CD"-designation not provided for elsewhere in G01N2333/705
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2500/00—Screening for compounds of potential therapeutic value
- G01N2500/10—Screening for compounds of potential therapeutic value involving cells
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biochemistry (AREA)
- Immunology (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- Medicinal Chemistry (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Toxicology (AREA)
- Cell Biology (AREA)
- Urology & Nephrology (AREA)
- Gynecology & Obstetrics (AREA)
- Physics & Mathematics (AREA)
- Gastroenterology & Hepatology (AREA)
- Reproductive Health (AREA)
- Pathology (AREA)
- Hematology (AREA)
- Developmental Biology & Embryology (AREA)
- Environmental Sciences (AREA)
- Animal Behavior & Ethology (AREA)
- Veterinary Medicine (AREA)
- Tropical Medicine & Parasitology (AREA)
- General Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- Animal Husbandry (AREA)
- Biodiversity & Conservation Biology (AREA)
- Food Science & Technology (AREA)
- Diabetes (AREA)
Abstract
The invention relates to humanized genetically modified non-human animals, in particular to genetically modified rodents, especially genetically modified mice, in particular to a construction method of an animal model for expressing humanized CD73 protein and application thereof in the field of biological medicine.
Description
Technical Field
The invention belongs to the fields of animal genetic engineering and genetic modification, and in particular relates to a CD73 gene humanized non-human animal, a construction method thereof and application thereof in the field of biological medicine.
Background
Immunotherapy attacks and kills cancer cells by activating the immune system, an important area of tumor research in recent years. Some drugs related to tumor immunotherapy have been used for treating cancer, and drugs are marketed and applied to various indications, such as monoclonal antibodies targeting T cell costimulatory molecules CTLA-4, PD-1 and their ligands, to achieve definite therapeutic effects, but the average response rate of patients is low. Considering that tumor occurrence is a result of the combined action of multiple links and factors, a single immunotherapy strategy has limited therapeutic effect, and a large number of patients can develop drug resistance to treatment, it is a trend of tumor treatment to combine two or more immunotherapy approaches in clinic. The development of more drugs that can be used to enhance the ability of the immune system to recognize and kill tumors has been one of the hot spots in immunological research.
CD73 is an extracellular-5' -nucleotidase encoded by the NT5E gene (also known as CD73 gene), and has a protein molecular weight of 70KD, and is widely expressed on the cell surface of human endothelial cells, lymphocytes such as Tregs (suppressor T cells), etc., and is anchored on the cell surface by Glucosyl Phosphatidylinositol (GPI) to form homodimers, but can also be cleaved and dissociated from the circulatory system. The ATP of the upstream CD73 converts Adenosine Monophosphate (AMP) with immune activation, the generated AMP is converted into adenosine by the CD73, the adenosine can bind to a downstream adenosine receptor, and then a series of signal paths related to immune activation such as LCK, MAPK, PKC are inhibited through adenosine signals, so that the immune activation-related signal paths are inhibited, and the immune activation-related signal paths play a role in immune inhibition and are an autoprotection mechanism of a human body.
The Tumor Microenvironment (TME) is composed of various cells, cytoplasm, enzymes, cytokines, metabolites and the like, has the characteristics of obvious hypoxia, low pH and high pressure, and has huge difference with normal tissues. CD73 is a typical Hypoxia Inducible Factor (HIF) target gene, and the tumor hypoxia environment induces HIF-1 up-regulation, which in turn results in a significant increase in CD73 expression in the tumor microenvironment, further catalyzing the production of adenosine, thereby suppressing the immune system and promoting tumor development and progression. The research shows that the expression of CD73 in various tumors is increased, and the prognosis of a non-small cell lung cancer patient with high expression of CD73 is poor, which indicates that tumor cells possibly use a CD73 mechanism to generate immune escape. Preclinical studies have shown that targeting CD73 can produce good antitumor effects (Stagg J, et al anti-CD73 antibody therapy inhibits breast tumor growth and metastasis.proc Natl Acad Sci U S A (2010) 107:1547-52.). In addition, treatment regimens such as PD-1/PD-L1 mab, chemotherapy, etc. may cause elevated tumor microenvironment CD73 expression, which in turn may be accompanied by inhibition of immune response by adenosine signaling (i.e., the appearance of acquired drug resistance), and the combination of CD73 blocking therapy with other immune molecule modulators (e.g., CTLA-4 antibodies, PD-1 antibodies) may help to achieve better therapeutic efficacy, is an attractive option (Beavis PA, et al Oncoimmunology.2015;4 (11): e 1046675.).
Because the target mechanism is clear and the preclinical research result is obvious, more than ten enterprises are developing CD73 antibody medicines worldwide at present, and a plurality of projects enter the early clinical stage, for example, the CD73 inhibition type monoclonal antibody BMS-986179 of BMS is in the clinical research of stage I, and the indication is advanced solid tumor; CD 73-inhibitory monoclonal antibody, olecalumab, developed by AZ/MedImmune, has entered phase II clinical studies on recurrent ovarian cancer. In addition, there are some small molecular compounds aimed at the target, such as a biotechnology innovation company named Arcus Biosciences, which is thrown in the field of 1.07 billions of dollars of Google 2017, 11-month repellents, and one of the main products of the company is a small molecular drug AB680 targeting CD73, so that early clinical research and clinical approval are completed at present. Considering that no drug aiming at the target is marketed at present, the existing drug can not meet clinical requirements, and more companies are expected to be put into drug development of the target in the future along with deep research.
The experimental animal disease model is an indispensable research tool for researching the etiology and pathogenesis of human disease occurrence, developing control technology and developing medicines. The existing animal models related to CD73 are mainly gene knockout mice, and are mainly used for researching CD73, adenosine biological functions (genotype, function and regulation) and related disease mechanisms. For example, hayo Castrop et al (2004), in order to investigate whether adenosine formation is dependent on extracellular nucleotide hydrolysis, prepared CD73 deleted mice by disrupting exons 2 and 3 of the CD73 gene, which were normal in behaviour and fertility compared to wild type mice, but the glomerular arteriole contractile capacity was significantly reduced upon Henle perfusion, concluding that production of adenosine on the glomeruli was largely dependent on CD 73-mediated dephosphorylation of AMP; patrycja Koszalka et al (2004) prepared CD73 knockout mice lacking exon 2 in order to investigate the effect of extracellular adenosine formed in vascular homeostasis, confirming that extracellular adenosine formed by CD73 can regulate coronary vascular tone and inhibit platelet activation. Linda F.Thompson et al (2004) prepared mice with exon 3 deleted from the CD73 gene in order to investigate whether CD73 was functionally regulating vascular permeability during basal conditions or hypoxia, confirming that CD73 is a key regulator of cardiovascular permeability in vivo. (Hayo cage, et al J Clin invest.2004Sep;114 (5): 634-42; koszalka P, et al circulation research.2004;95:814-821;Cornelia Blume,et al.PLoS One.2012;7 (5): e 37100.) (Thompson LF, et al JEXp Med 2004:1395-1405) although more research was carried out using these mice later, for example, cornelia Blume et al (2012) found Patrycja Koszalka et al to have reduced renal function in mice with CD73 deletion and autoimmune inflammation without induction. In summary, the existing model is mainly used for researching CD73 and adenosine biological functions (genotype, function and regulation) and related disease mechanisms.
CD73 functions by forming homodimers, each monomer comprising a binding site for an N-terminal domain and a C-terminal domain, the two domains being linked by a short helix, the two monomers being linked by a non-covalent bond and anchored to the plasma membrane after being linked to the GPI by a C-terminal serine residue. In the mature form, a 26 amino acid signal peptide in the N-terminal domain catalyzes the divalent zinc, cobalt, etc. metal ion in coordination, while the C-terminal domain is primarily the binding site of AMP. In the development of CD73 target-related drugs, due to the species differences between humans and mice, the accuracy of binding antibodies to target proteins sometimes reaches a single amino acid change in the protein (Joseph Conlon J, et al immunol 2013May 15;190 (10): 5216-5225), and therefore, ordinary mice cannot be used to screen and evaluate the efficacy of targeted human CD73 drugs. The in vivo model widely used for researching the drug effect of the antibody drug at present is a humanized tumor allograft mouse model, but the targeting and the specificity of the model in the research of specific targets are not strong, so the accuracy of the result of researching the drug effect by the model is not high.
A genetically humanized animal model, i.e., a normal or mutant gene animal model that more closely approximates the physiological or disease characteristics of a human, can be created by substituting a homologous gene of the animal genome with a normal or mutant gene of human origin using gene editing techniques. The humanized animal has important application value, such as the humanized animal model transplanted by cells or tissues can be improved and promoted by gene humanized, and more importantly, the humanized protein can be expressed or partially expressed in the animal body due to the insertion of human gene fragments, can be used as a target spot of a medicament capable of only recognizing the amino acid sequence of the human protein, and provides possibility for screening anti-human antibodies and other medicaments at animal level. However, due to the differences in physiology and pathology between animals and humans, coupled with the complexity of genes (i.e., genetic factors), it remains the greatest challenge to construct "efficient" humanized animal models for new drug development (Scheer N, snaith M, wolf CR, seibler J. Generation and utility of genetically humanized mouse models, drug Discov Today;18 (23-24): 1200-11, 2013).
Disclosure of Invention
In a first aspect of the invention, a method of constructing a non-human animal humanized with a CD73 gene is provided, wherein the non-human animal expresses human or humanized CD73 protein in vivo.
Preferably, the non-human animal has reduced or absent expression of endogenous CD73 protein.
Preferably, the humanized CD73 protein comprises all or a portion of a human CD73 protein. It is further preferred that all or part of the amino acid sequence encoded by exons 1 to 9 of the human CD73 gene is comprised.
In one embodiment of the invention, the amino acid sequence of the humanized CD73 protein comprises one of the following groups:
a) SEQ ID NO:4 or a part or all of the amino acid sequence shown in 4;
b) And SEQ ID NO:4 is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or at least 99%;
c) And SEQ ID NO:4 of no more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or no more than 1 amino acid; or alternatively, the first and second heat exchangers may be,
d) And SEQ ID NO:4, comprising substitution, deletion and/or insertion of one or more amino acid residues.
Preferably, the non-human animal genome comprises a human or humanized CD73 gene.
Preferably, the non-human animal genome comprises all or part of exons 1 to 9 of the human CD73 gene. Further preferred, one or a combination of two or more exons from exon 1 to exon 9 of the human CD73 gene are included.
In one embodiment of the present invention, exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, exon 7, exon 8 and exon 9 of the human CD73 gene are included.
Preferably, all or part of exon 1, all or part of exons 2 to 8 and all or part of exons 9, preferably also introns 1-2 and/or introns 8-9, wherein part of exon 1 comprises at least a nucleotide sequence of 100bp, for example at least a nucleotide sequence of 100, 150, 200, 250, 300, 310, 330, 335, 336, 337, 338, 339, 340, 350, 370, 390, 396bp, further preferably 339 or 396 bp; the portion of exon 1 comprises from the start codon of exon 1 to the last nucleotide of exon 1, and the portion of exon 9 comprises at least a 50bp nucleotide sequence, for example at least a 50, 100, 150, 160, 161, 162, 163, 164, 165, 170, 190, 200, 500, 700, 900, 1000, 1500, 1700, 1900, 1944bp nucleotide sequence, further preferred a 164 or 1944bp nucleotide sequence; the portion of exon 9 comprises a stop codon from the first nucleotide of exon 9.
In a specific embodiment of the invention, all or part of the CDs sequence of the human CD73 gene or all or part of the full length nucleotide sequence of human CD73 is comprised.
In one embodiment of the invention, the non-human animal comprises SEQ ID NO:9 or 17.
Preferably, the non-human animal comprises all or part of the nucleotide sequence encoding human CD73 protein. Further preferred comprises a sequence encoding SEQ ID NO: 4.
In one embodiment of the invention, the humanized CD73 gene comprises one of the following groups:
a) SEQ ID NO:9 or SEQ ID NO:17 or a portion of the nucleotide sequence set forth in seq id no;
b) And SEQ ID NO:9 or SEQ ID NO:17 is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or at least 99%;
c) And SEQ ID NO:9 or SEQ ID NO:17 of no more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or no more than 1 nucleotide; or alternatively, the first and second heat exchangers may be,
d) And SEQ ID NO:9 or SEQ ID NO:17, a nucleotide sequence comprising one or more nucleotide residues that are substituted, deleted and/or inserted.
Preferably, the humanized CD73 gene further comprises a nucleotide sequence encoding a 2A peptide, preferably a nucleotide sequence encoding P2A or T2A. Further preferred comprises SEQ ID NO:10, and a nucleotide sequence shown in seq id no.
Preferably, the humanized CD73 gene further comprises a nucleotide sequence encoding SEQ ID NO: 11.
Preferably, the humanized CD73 gene further comprises a transcription termination sequence. Further preferred is one or a combination of two or more of 3' UTR, polyA, WPRE, STOP or lox 2. Still further preferred are those comprising a STOP sequence as set forth in SEQ ID NO:20 and/or the 3' utr sequence as set forth in SEQ ID NO: 22.
Preferably, the humanized CD73 gene further comprises a Neo cassette.
Preferably, the humanized CD73 gene further comprises endogenous regulatory sequences. More preferably, an endogenous promoter nucleotide sequence.
In one embodiment of the invention, the humanized CD73 gene comprises one of the following groups in order from 5 'to 3':
a) Encoding SEQ ID NO:11, a nucleotide sequence encoding a 2A peptide, a CDs sequence of the human CD73 gene, and a transcription termination sequence;
b) Encoding SEQ ID NO:11, a nucleotide sequence encoding a 2A peptide, a CDs sequence of the human CD73 gene, a transcription termination sequence, and a Neo cassette;
C) A CDs sequence of the human CD73 gene and/or a portion of the full-length nucleotide sequence of human CD73 and a transcription termination sequence;
d) A portion of the CDs sequence of the human CD73 gene and/or the full-length nucleotide sequence of human CD73, a transcription termination sequence, and a Neo cassette;
e) Endogenous regulatory sequences encoding SEQ ID NO:11, a nucleotide sequence encoding a 2A peptide, a CDs sequence of the human CD73 gene, and a transcription termination sequence;
f) Endogenous regulatory sequences encoding SEQ ID NO:11, a nucleotide sequence encoding a 2A peptide, a CDs sequence of the human CD73 gene, a transcription termination sequence, and a Neo cassette;
g) Endogenous regulatory sequences, a portion of the CDs sequence of the human CD73 gene and/or the full-length nucleotide sequence of human CD73, and transcription termination sequences;
h) Endogenous regulatory sequences, a portion of the CDs sequence of the human CD73 gene and/or the full-length nucleotide sequence of human CD73, transcription termination sequences, and Neo cassettes; or alternatively, the first and second heat exchangers may be,
i) A nucleotide sequence encoding a 2A peptide and a portion of the full length nucleotide sequence of human CD 73.
Preferably, the humanized CD73 gene comprises SEQ ID NO:23.
in one embodiment of the invention, the humanized CD73 gene comprises one of the following groups:
(i) Comprising SEQ ID NO: 12. 13 or 18, or a portion of a nucleotide sequence set forth in seq id no;
(ii) Comprising a sequence identical to SEQ ID NO: 12. 13 or 18 is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or at least 99%;
(iii) Comprising a sequence identical to SEQ ID NO: 12. 13 or 18, no more than 10, 9, 8, 7, 6, 5, 4, 3, 2, or no more than 1 nucleotide; or alternatively, the first and second heat exchangers may be,
(iv) Comprising a sequence identical to SEQ ID NO: 12. 13 or 18, including substitutions, deletions and/or insertions of one or more nucleotides.
In one embodiment of the invention, the mRNA transcribed from the humanized CD73 gene comprises one of the following groups:
(i) Comprising SEQ ID NO:19 or a portion of the nucleotide sequence set forth in seq id no;
(ii) Comprising a sequence identical to SEQ ID NO:19 is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or at least 99%;
(iii) Comprising a sequence identical to SEQ ID NO:19 does not differ by more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or not more than 1 nucleotide; or alternatively, the first and second heat exchangers may be,
(iv) Comprising a sequence identical to SEQ ID NO:19, comprising substitutions, deletions and/or insertions of one or more nucleotides.
Preferably, the non-human animal genome further comprises additional genetic modifications, and more preferably, the additional genes comprise at least one of PD-1, PD-L1, CTLA-4, LAG-3, BTLA, CD27, CD28, CD47, CD137, CD154, OX40, SIRPalpha, TIGIT, TIM-3, CD40, and GITR genes.
Preferably, the construction method comprises introducing a nucleotide sequence comprising the human or humanized CD73 gene into the non-human animal CD73 locus. It is further preferred that all or part of exons 1 to 9 comprising the human CD73 gene is used to introduce the non-human animal CD73 locus.
In one embodiment of the present invention, the human CD73 gene is introduced into the CD73 locus with exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, exon 7, exon 8 and exon 9 contained in the human CD73 gene.
Preferably, the non-human animal CD73 locus is introduced with a nucleic acid sequence comprising all or part of exon 1, all or part of exons 2 to 8 and all or part of exon 9, preferably further comprising introns 1-2 and/or introns 8-9, wherein part of exon 1 comprises at least a nucleotide sequence of 100bp, e.g. at least a nucleotide sequence of 100, 150, 200, 250, 300, 310, 330, 335, 336, 337, 338, 339, 340, 350, 370, 390, 396bp, further preferably a nucleotide sequence of 339 or 396 bp; the portion of exon 1 comprises from the start codon of exon 1 to the last nucleotide of exon 1, and the portion of exon 9 comprises at least a 50bp nucleotide sequence, for example at least a 50, 100, 150, 160, 161, 162, 163, 164, 165, 170, 190, 200, 500, 700, 900, 1000, 1500, 1700, 1900, 1944bp nucleotide sequence, further preferred a 164 or 1944bp nucleotide sequence; the portion of exon 9 comprises a stop codon from the first nucleotide of exon 9.
In one embodiment of the invention, the construction method comprises introducing a non-human animal CD73 locus with all or part of the CDs sequence comprising the human CD73 gene or all or part of the full length nucleotide sequence of human CD 73.
Preferably, the construction method comprises introducing a cDNA sequence comprising the human CD73 gene into the non-human animal CD73 locus.
In one embodiment of the invention, the construction method comprises using a nucleic acid sequence comprising SEQ ID NO: 9. 12, 13, 17 or 18 into the non-human animal CD73 locus.
Preferably, the construction method comprises introducing a non-human animal CD73 locus with a nucleic acid sequence comprising all or part of a nucleotide sequence encoding a human CD73 protein. Further preferred is a polypeptide comprising a sequence encoding SEQ ID NO:4 into the non-human animal CD73 locus.
Preferably, the nucleotide sequence of the human or humanized CD73 gene is operably linked to a regulatory element endogenous to the non-human animal. Further preferred are linked to a non-human animal promoter.
Preferably, the site of introduction is located after the endogenous regulatory element of the CD73 gene. Preferably, the endogenous regulatory element is an endogenous promoter.
Preferably, the site of introduction is located on exons 1 to 9 of the non-human animal CD73 gene. Further preferred are the full length nucleotide sequences located at the start codon to the stop codon of the CD73 gene or the non-human animal CD73 gene.
Preferably, the site of introduction is located on exons 2 to 9 of the non-human animal CD73 gene. Further preferred are located on exon 2 and/or intron 2-3.
Preferably, the introduction described in this application includes, but is not limited to, insertion, substitution or transgene.
Preferably, the non-human animal is homozygous or heterozygous.
Preferably, the humanized CD73 gene is contained on at least one chromosome in the genome of the non-human animal.
Preferably, at least one cell in said non-human animal expresses human or humanized CD73 protein.
Preferably, the construction of the non-human animal is performed using gene editing techniques including gene targeting techniques using embryonic stem cells, regular clustered interval short palindromic repeat (CRISPR/Cas 9) techniques, zinc Finger Nuclease (ZFN) techniques, transcription activator-like effector nuclease (TALEN) techniques, homing endonucleases (megabase megaribozymes) or other molecular biology techniques.
Preferably, the construction of the non-human animal is performed using a targeting vector, wherein the targeting vector comprises a portion of the human CD73 gene. It is further preferred that all or part of exons 1 to 9 of the human CD73 gene are contained. Even more preferably, the gene comprises one or a combination of two or more exons from exon 1 to exon 9 of the human CD73 gene.
In one embodiment of the present invention, exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, exon 7, exon 8 and exon 9 of the human CD73 gene are included.
Preferably comprises all or part of exon 1, all or part of exons 2 to 8 and all or part of exons 9, more preferably also comprises introns 1-2 and/or introns 8-9, wherein part of exon 1 comprises at least a nucleotide sequence of 100bp, for example at least a nucleotide sequence of 100, 150, 200, 250, 300, 310, 330, 335, 336, 337, 338, 339, 340, 350, 370, 390, 396bp, further preferably comprises a nucleotide sequence of 339 or 396 bp; the portion of exon 1 comprises from the start codon of exon 1 to the last nucleotide of exon 1, and the portion of exon 9 comprises at least a 50bp nucleotide sequence, for example at least a 50, 100, 150, 160, 161, 162, 163, 164, 165, 170, 190, 200, 500, 700, 900, 1000, 1500, 1700, 1900, 1944bp nucleotide sequence, further preferred a 164 or 1944bp nucleotide sequence; the portion of exon 9 comprises a stop codon from the first nucleotide of exon 9.
In a specific embodiment of the invention, all or part of the CDs sequence of the human CD73 gene or all or part of the full length nucleotide sequence of human CD73 is comprised.
In a specific embodiment of the invention, a polypeptide encoding SEQ ID NO: 4.
In a specific embodiment of the invention, the polypeptide comprises SEQ ID NO: 9. 12, 13, 17 or 18.
Preferably, the targeting vector further comprises a 5 'arm (5' homology arm) and/or a 3 'arm (3' homology arm). The 5' arm is selected from 100-10000 nucleotides in length of non-human animal CD73 genomic DNA. Preferably, the 5' arm has at least 90% homology to NCBI accession number NC_ 000075.6. Further preferred, the 5' arm sequence hybridizes to SEQ ID NO: 5. 7 or 15 has at least 90% homology. The 3' arm is selected from 100-10000 nucleotides in length of non-human animal CD73 genomic DNA. Preferably, the 3' arm has at least 90% homology to NCBI accession number NC_ 000075.6. Further preferred, the 3' arm sequence hybridizes to SEQ ID NO: 6. 8 or 16 has at least 90% homology.
Preferably, the 5 'arm is a DNA fragment homologous to the 5' end of the transition region to be altered.
Preferably, the 3 'arm is a DNA fragment homologous to the 3' end of the transition region to be altered.
Preferably, the transition region to be altered is located on exons 1 to 9 of the non-human animal CD73 gene.
In one embodiment of the invention, it is located on exon 2 and/or intron 2-3 of the non-human animal CD73 gene.
In one embodiment of the invention, the non-human animal CD73 gene is located from the start codon to the stop codon.
In one embodiment of the present invention, the construction method comprises introducing the targeting vector into a non-human animal cell, culturing the cell (preferably an embryonic stem cell), transplanting the cultured cell into a female non-human animal oviduct, allowing the female non-human animal to develop, and identifying and screening the non-human animal to obtain the CD73 gene humanization.
Preferably, to increase recombination efficiency, construction of non-human animals can also be performed using sgrnas targeting the CD73 gene together with the targeting vector described above. Wherein the sgRNA targets the non-human animal CD73 gene while the sequence of the sgRNA is on the target sequence on the CD73 gene to be altered.
Preferably, the target site of the sgRNA is located on the sequence from exon 1 to exon 9 of the CD73 gene. Further preferred, the target site of the sgRNA is located on exon 2, intron 2-3 and/or exon 3 of the CD73 gene.
In a specific embodiment of the invention, the target site of the sgRNA is located on exon 2 and introns 2-3 of the CD73 gene.
Preferably, the target site sequence of the sgRNA is shown as SEQ ID NO: 26-41.
In one embodiment of the present invention, the construction method comprises introducing the targeting vector, the sgRNA targeting CD73 gene and Cas9 into a non-human animal cell, culturing the cell (preferably an embryonic stem cell), transplanting the cultured cell into a female non-human animal oviduct, allowing the female non-human animal to develop, and identifying and screening the non-human animal to obtain the CD73 gene humanized.
Preferably, the non-human animal is selected from any non-human animal that can be genetically engineered to be humanized, such as rodents, pigs, chickens, rabbits, monkeys, etc.
Preferably, the non-human animal is a non-human mammal. Further preferably, the non-human mammal is a rodent. Still more preferably, the rodent is a rat or mouse.
Preferably, the non-human animal is an immunodeficient non-human mammal. Further preferred, the immunodeficient non-human mammal is an immunodeficient rodent, an immunodeficient pig, an immunodeficient rabbit or an immunodeficient monkey. Still more preferably, the immunodeficient rodent is an immunodeficient rodent Defective mice or rats. Most preferably, the immunodeficient mouse is NOD-Prkdc scid IL-2rγ null Mouse, NOD-Rag 1 -/- -IL2rg -/- (NRG) mice, rag 2 -/- -IL2rg -/- (RG) mice, NOD/SCID mice or nude mice.
In a second aspect of the invention there is provided a non-human animal humanized with a CD73 gene, the non-human animal genome comprising a human or humanized CD73 gene.
Preferably, the non-human animal genome comprises all or part of exons 1 to 9 of the human CD73 gene. Further preferred, one or a combination of two or more exons from exon 1 to exon 9 of the human CD73 gene are included.
In one embodiment of the present invention, exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, exon 7, exon 8 and exon 9 of the human CD73 gene are included.
Preferably, all or part of exon 1, all or part of exons 2 to 8 and all or part of exon 9 are comprised, preferably also introns 1-2 and/or introns 8-9, wherein part of exon 1 comprises from the start codon of exon 1 to the last nucleotide of exon 1 and part of exon 9 comprises from the first nucleotide of exon 9 to the stop codon.
In a specific embodiment of the invention, all or part of the CDs sequence of the human CD73 gene or all or part of the full length nucleotide sequence of human CD73 is comprised.
In one embodiment of the invention, the non-human animal comprises SEQ ID NO:9 or 17.
Preferably, the non-human animal comprises all or part of the nucleotide sequence encoding human CD73 protein. Further preferred comprises a sequence encoding SEQ ID NO: 4.
In one embodiment of the invention, the humanized CD73 gene comprises one of the following groups:
a) SEQ ID NO:9 or SEQ ID NO:17 or a portion of the nucleotide sequence set forth in seq id no;
b) And SEQ ID NO:9 or SEQ ID NO:17 is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or at least 99%;
c) And SEQ ID NO:9 or SEQ ID NO:17 of no more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or no more than 1 nucleotide; or alternatively, the first and second heat exchangers may be,
d) And SEQ ID NO:9 or SEQ ID NO:17, a nucleotide sequence comprising one or more nucleotide residues that are substituted, deleted and/or inserted.
Preferably, the humanized CD73 gene further comprises a 2A peptide, preferably P2A or T2A. Further preferred comprises SEQ ID NO:10, and a nucleotide sequence shown in seq id no.
Preferably, the humanized CD73 gene further comprises a nucleotide sequence encoding SEQ ID NO: 11.
Preferably, the humanized CD73 gene further comprises a transcription termination sequence. Further preferred is one or a combination of two or more of 3' UTR, polyA, WPRE or lox 2. Still further preferred comprises SEQ ID NO:20 and/or SEQ ID NO: 22.
Preferably, the humanized CD73 gene further comprises a Neo cassette.
Preferably, the humanized CD73 gene further comprises endogenous regulatory sequences. More preferably, an endogenous promoter nucleotide sequence.
In one embodiment of the invention, the humanized CD73 gene comprises one of the following groups in order from 5 'to 3':
a) Encoding SEQ ID NO:11, a nucleotide sequence encoding a 2A peptide, a CDs sequence of the human CD73 gene, and a transcription termination sequence;
b) Encoding SEQ ID NO:11, a nucleotide sequence encoding a 2A peptide, a CDs sequence of the human CD73 gene, a transcription termination sequence, and a Neo cassette;
C) A CDs sequence of the human CD73 gene and/or a portion of the full-length nucleotide sequence of human CD73 and a transcription termination sequence;
d) A portion of the CDs sequence of the human CD73 gene and/or the full-length nucleotide sequence of human CD73, a transcription termination sequence, and a Neo cassette;
e) Endogenous regulatory sequences encoding SEQ ID NO:11, a nucleotide sequence encoding a 2A peptide, a CDs sequence of the human CD73 gene, and a transcription termination sequence;
f) Endogenous regulatory sequences encoding SEQ ID NO:11, a nucleotide sequence encoding a 2A peptide, a CDs sequence of the human CD73 gene, a transcription termination sequence, and a Neo cassette;
g) Endogenous regulatory sequences, a portion of the CDs sequence of the human CD73 gene and/or the full-length nucleotide sequence of human CD73, and transcription termination sequences;
h) Endogenous regulatory sequences, a portion of the CDs sequence of the human CD73 gene and/or the full-length nucleotide sequence of human CD73, transcription termination sequences, and Neo cassettes; or alternatively, the first and second heat exchangers may be,
i) A nucleotide sequence encoding a 2A peptide and a portion of the full length nucleotide sequence of human CD 73.
Preferably, the humanized CD73 gene comprises SEQ ID NO:23.
in one embodiment of the invention, the humanized CD73 gene comprises one of the following groups:
(i) Comprising SEQ ID NO: 12. 13 or 18, or a portion of a nucleotide sequence set forth in seq id no;
(ii) Comprising a sequence identical to SEQ ID NO: 12. 13 or 18 is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or at least 99%;
(iii) Comprising a sequence identical to SEQ ID NO: 12. 13 or 18, no more than 10, 9, 8, 7, 6, 5, 4, 3, 2, or no more than 1 nucleotide; or alternatively, the first and second heat exchangers may be,
(iv) Comprising a sequence identical to SEQ ID NO: 12. 13 or 18, including substitutions, deletions and/or insertions of one or more nucleotides.
In one embodiment of the invention, the mRNA transcribed from the humanized CD73 gene comprises one of the following groups:
(i) Comprising SEQ ID NO:19 or a portion of the nucleotide sequence set forth in seq id no;
(ii) Comprising a sequence identical to SEQ ID NO:19 is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or at least 99%;
(iii) Comprising a sequence identical to SEQ ID NO:19 does not differ by more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or not more than 1 nucleotide; or alternatively, the first and second heat exchangers may be,
(iv) Comprising a sequence identical to SEQ ID NO:19, comprising substitutions, deletions and/or insertions of one or more nucleotides.
Preferably, the non-human animal expresses human or humanized CD73 protein in vivo.
Preferably, the non-human animal has reduced or absent expression of endogenous CD73 protein.
Preferably, the humanized CD73 protein comprises all or a portion of a human CD73 protein. It is further preferred that all or part of the amino acid sequence encoded by exons 1 to 9 of the human CD73 gene is comprised.
In one embodiment of the invention, the amino acid sequence of the humanized CD73 protein comprises one of the following groups:
a) SEQ ID NO:4 or a part or all of the amino acid sequence shown in 4;
b) And SEQ ID NO:4 is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or at least 99%;
c) And SEQ ID NO:4 of no more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or no more than 1 amino acid; or alternatively, the first and second heat exchangers may be,
d) And SEQ ID NO:4, comprising substitution, deletion and/or insertion of one or more amino acid residues.
Preferably, the non-human animal genome further comprises additional genetic modifications, and more preferably, the additional genes comprise at least one of PD-1, PD-L1, CTLA-4, LAG-3, BTLA, CD27, CD28, CD47, CD137, CD154, OX40, SIRPalpha, TIGIT, TIM-3, CD40, and GITR genes.
Preferably, the non-human animal is selected from any non-human animal that can be genetically edited to produce a humanized gene, such as rodents, pigs, rabbits, monkeys, etc.
Preferably, the non-human animal is a non-human mammal. Further preferably, the non-human mammal is a rodent. Still more preferably, the rodent is a rat or mouse.
Preferably, the non-human animal is an immunodeficient non-human mammal. Further preferred, the immunodeficient non-human mammal is an immunodeficient rodent, an immunodeficient pig, an immunodeficient rabbit or an immunodeficient monkey. Still more preferably, the immunodeficient rodent is an immunodeficient mouse or rat. Most preferably, the immunodeficient mouse is NOD-Prkdc scid IL-2rγ null Mouse, NOD-Rag 1 -/- -IL2rg -/- (NRG) mice, rag 2 -/- -IL2rg -/- (RG) mice, NOD/SCID mice or nude mice.
In a third aspect of the invention, there is provided a targeting vector comprising a portion of the human CD73 gene.
Preferably, the targeting vector comprises all or part of exons 1 to 9 of the human CD73 gene. Further preferred are combinations comprising one or more than two exons from exon 1 to exon 9 of the human CD73 gene.
In one embodiment of the present invention, exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, exon 7, exon 8 and exon 9 of the human CD73 gene are included.
Preferably comprises all or part of exon 1, all or part of exons 2 to 8 and all or part of exons 9, more preferably also comprises introns 1-2 and/or introns 8-9, wherein part of exon 1 comprises at least a nucleotide sequence of 100bp, for example at least a nucleotide sequence of 100, 150, 200, 250, 300, 310, 330, 335, 336, 337, 338, 339, 340, 350, 370, 390, 396bp, further preferably comprises a nucleotide sequence of 339 or 396 bp; the portion of exon 1 comprises from the start codon of exon 1 to the last nucleotide of exon 1, and the portion of exon 9 comprises at least a 50bp nucleotide sequence, for example at least a 50, 100, 150, 160, 161, 162, 163, 164, 165, 170, 190, 200, 500, 700, 900, 1000, 1500, 1700, 1900, 1944bp nucleotide sequence, further preferred a 164 or 1944bp nucleotide sequence; the portion of exon 9 comprises a stop codon from the first nucleotide of exon 9.
In a specific embodiment of the invention, all or part of the CDs sequence of the human CD73 gene or all or part of the full length nucleotide sequence of human CD73 is comprised.
In a specific embodiment of the invention, a polypeptide encoding SEQ ID NO: 4.
In a specific embodiment of the invention, the polypeptide comprises SEQ ID NO: 9. 12, 13, 17 or 18.
Preferably, the targeting vector further comprises a 5 'arm (5' homology arm) and/or a 3 'arm (3' homology arm). The 5' arm is selected from 100-10000 nucleotides in length of non-human animal CD73 genomic DNA. Preferably, the 5' arm has at least 90% homology to NCBI accession number NC_ 000075.6. Further preferred, the 5' arm sequence hybridizes to SEQ ID NO: 5. 7 or 15 has at least 90% homology. The 3' arm is selected from 100-10000 nucleotides in length of non-human animal CD73 genomic DNA. Preferably, the 3' arm has at least 90% homology to NCBI accession number NC_ 000075.6. Further preferred, the 3' arm sequence hybridizes to SEQ ID NO: 6. 8 or 16 has at least 90% homology.
Preferably, the targeting vector further comprises a marker gene. Further preferably, the marker gene is a gene encoding a negative selection marker. Still more preferably, the gene encoding the negative selection marker is the diphtheria toxin A subunit encoding gene (DTA).
In one embodiment of the invention, the targeting vector further comprises a resistance gene selected from positive clones. Further preferably, the resistance gene screened by the positive clone is neomycin phosphotransferase coding sequence Neo.
In one embodiment of the present invention, the targeting vector further comprises a specific recombination system. Further preferably, the specific recombination system is a Frt recombination site (conventional LoxP recombination systems may also be selected). The specific recombination system is provided with two Frt recombination sites which are respectively connected with two sides of the resistance gene.
Preferably, the non-human animal is selected from any non-human animal that can be genetically edited to produce a humanized gene, such as rodents, pigs, rabbits, monkeys, etc.
Preferably, the non-human animal is a non-human mammal. Further preferably, the non-human mammal is a rodent. Still more preferably, the rodent is a rat or mouse.
Preferably, the non-human animal is an immunodeficient non-human mammal. Further preferred, the immunodeficient non-human mammal is an immunodeficient rodent, an immunodeficient pig, an immunodeficient rabbit or an immunodeficient monkey. Still more preferably, the immunodeficient rodent is an immunodeficient mouse or rat. Most preferably, the immunodeficient mouse is NOD-Prkdc scid IL-2rγ null Mouse, NOD-Rag 1 -/- -IL2rg -/- (NRG) mice, rag 2 -/- -IL2rg -/- (RG) mice, NOD/SCID mice or nude mice.
In a fourth aspect of the invention there is provided a sgRNA which targets the non-human animal CD73 gene, with the sequence of the sgRNA being on the target sequence on the CD73 gene to be altered.
Preferably, the target site of the sgRNA is located on the sequence from exon 1 to exon 9 of the CD73 gene. Further preferred, the target site of the sgRNA is located on exon 2, intron 2-3 and/or exon 3 of the CD73 gene.
In a specific embodiment of the invention, the target site of the sgRNA is located on exon 2 and introns 2-3 of the CD73 gene.
Preferably, the target site sequence of the sgRNA is shown as SEQ ID NO: 26-41.
In a fifth aspect of the invention there is provided a DNA molecule encoding the sgRNA described above. Preferably, the DNA molecule double strand is respectively an upstream sequence and a downstream sequence of the sgRNA or a forward and reverse oligonucleotide sequence after adding an enzyme cutting site.
In one embodiment of the present invention, the DNA molecule double strand is SEQ ID NO:32 and SEQ ID NO:42.
in one embodiment of the present invention, the DNA molecule double strand is SEQ ID NO:39 and SEQ ID NO:45.
In one embodiment of the present invention, the DNA molecule double strand is SEQ ID NO:43 and SEQ ID NO:44.
in one embodiment of the present invention, the DNA molecule double strand is SEQ ID NO:46 and SEQ ID NO:47.
in a sixth aspect of the invention there is provided an sgRNA vector comprising the above-described sgRNA.
In a seventh aspect of the invention, there is provided a cell comprising the targeting vector described above, the sgRNA described above, the DNA molecule described above or the sgRNA vector described above.
In an eighth aspect of the invention, there is provided the use of the targeting vector, the sgRNA, the DNA molecule, the sgRNA vector or the cell described above for CD73 gene modification. Preferably comprises the use in knockout, insertion or replacement of the CD73 gene.
In a ninth aspect of the invention, there is provided a humanized CD73 gene, said humanized CD73 gene comprising a portion of a human CD73 gene.
Preferably, the humanized CD73 gene comprises all or part of exons 1 to 9 of the human CD73 gene. Further preferred, one or a combination of two or more exons from exon 1 to exon 9 of the human CD73 gene are included.
In one embodiment of the present invention, exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, exon 7, exon 8 and exon 9 of the human CD73 gene are included.
Preferably, all or part of exon 1, all or part of exons 2 to 8 and all or part of exons 9, more preferably also introns 1-2 and/or introns 8-9, wherein part of exon 1 comprises at least a nucleotide sequence of 100bp, for example at least a nucleotide sequence of 100, 150, 200, 250, 300, 310, 330, 335, 336, 337, 338, 339, 340, 350, 370, 390, 396bp, further preferably a nucleotide sequence of 339 or 396 bp; the portion of exon 1 comprises from the start codon of exon 1 to the last nucleotide of exon 1, and the portion of exon 9 comprises at least a 50bp nucleotide sequence, for example at least a 50, 100, 150, 160, 161, 162, 163, 164, 165, 170, 190, 200, 500, 700, 900, 1000, 1500, 1700, 1900, 1944bp nucleotide sequence, further preferred a 164 or 1944bp nucleotide sequence; the portion of exon 9 comprises a stop codon from the first nucleotide of exon 9.
In a specific embodiment of the invention, all or part of the CDs sequence of the human CD73 gene or all or part of the full length nucleotide sequence of human CD73 is comprised.
Preferably, the humanized CD73 gene comprises all or a portion of a nucleotide sequence encoding a human CD73 protein. Further preferred comprises a sequence encoding SEQ ID NO: 4.
In one embodiment of the invention, the humanized CD73 gene comprises one of the following groups:
a) SEQ ID NO:9 or SEQ ID NO:17 or a portion of the nucleotide sequence set forth in seq id no;
b) And SEQ ID NO:9 or SEQ ID NO:17 is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or at least 99%;
c) And SEQ ID NO:9 or SEQ ID NO:17 of no more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or no more than 1 nucleotide; or alternatively, the first and second heat exchangers may be,
d) And SEQ ID NO:9 or SEQ ID NO:17, a nucleotide sequence comprising one or more nucleotide residues that are substituted, deleted and/or inserted.
Preferably, the humanized CD73 gene further comprises a nucleotide sequence encoding a 2A peptide, preferably a nucleotide sequence encoding P2A or T2A. Further preferred comprises SEQ ID NO:10, and a nucleotide sequence shown in seq id no.
Preferably, the humanized CD73 gene further comprises a nucleotide sequence encoding SEQ ID NO: 11.
Preferably, the humanized CD73 gene further comprises a transcription termination sequence, preferably one or a combination of two or more of 3' utr, polyA, WPRE, STOP or lox 2. Further preferred are those comprising a STOP sequence as set forth in SEQ ID NO:20 and/or 3' UTR sequences as set forth in SEQ ID NO: 22.
Preferably, the humanized CD73 gene further comprises a Neo cassette.
Preferably, the humanized CD73 gene further comprises endogenous regulatory sequences. Further preferred are endogenous promoter nucleotide sequences.
In one embodiment of the invention, the humanized CD73 gene comprises one of the following groups in order from 5 'to 3':
a) Encoding SEQ ID NO:11, a nucleotide sequence encoding a 2A peptide, a CDs sequence of the human CD73 gene, and a transcription termination sequence;
b) Encoding SEQ ID NO:11, a nucleotide sequence encoding a 2A peptide, a CDs sequence of the human CD73 gene, a transcription termination sequence, and a Neo cassette;
c) A CDs sequence of the human CD73 gene and/or a portion of the full-length nucleotide sequence of human CD73 and a transcription termination sequence;
d) A portion of the CDs sequence of the human CD73 gene and/or the full-length nucleotide sequence of human CD73, a transcription termination sequence, and a Neo cassette;
E) Endogenous regulatory sequences encoding SEQ ID NO:11, a 2A peptide, a CDs sequence and a transcription termination sequence of the human CD73 gene;
f) Endogenous regulatory sequences encoding SEQ ID NO:11, a 2A peptide, a CDs sequence of the human CD73 gene, a transcription termination sequence, and a Neo cassette;
g) Endogenous regulatory sequences, a portion of the CDs sequence of the human CD73 gene and/or the full-length nucleotide sequence of human CD73, and transcription termination sequences;
h) Endogenous regulatory sequences, a portion of the CDs sequence of the human CD73 gene and/or the full-length nucleotide sequence of human CD73, transcription termination sequences, and Neo cassettes; or alternatively, the first and second heat exchangers may be,
i) A nucleotide sequence encoding a 2A peptide and a portion of the full length nucleotide sequence of human CD 73.
Preferably, the humanized CD73 gene comprises SEQ ID NO:23.
in one embodiment of the invention, the humanized CD73 gene comprises one of the following groups:
(i) Comprising SEQ ID NO: 12. 13 or 18, or a portion of a nucleotide sequence set forth in seq id no;
(ii) Comprising a sequence identical to SEQ ID NO: 12. 13 or 18 is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or at least 99%;
(iii) Comprising a sequence identical to SEQ ID NO: 12. 13 or 18, no more than 10, 9, 8, 7, 6, 5, 4, 3, 2, or no more than 1 nucleotide; or alternatively, the first and second heat exchangers may be,
(iv) Comprising a sequence identical to SEQ ID NO: 12. 13 or 18, including substitutions, deletions and/or insertions of one or more nucleotides.
In one embodiment of the invention, the mRNA transcribed from the humanized CD73 gene comprises one of the following groups:
(i) Comprising SEQ ID NO:19 or a portion of the nucleotide sequence set forth in seq id no;
(ii) Comprising a sequence identical to SEQ ID NO:19 is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or at least 99%;
(iii) Comprising a sequence identical to SEQ ID NO:19 does not differ by more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or not more than 1 nucleotide; or alternatively, the first and second heat exchangers may be,
(iv) Comprising a sequence identical to SEQ ID NO:19, comprising substitutions, deletions and/or insertions of one or more nucleotides.
In a tenth aspect of the invention, there is provided a humanized CD73 protein, said humanized CD73 protein comprising a portion of a human CD73 protein.
Preferably, the humanized CD73 protein comprises all or a portion of the amino acid sequence encoded by exons 1 to 9 of a human CD73 gene. Further preferably, the humanized CD73 protein is encoded by the humanized CD73 gene described above.
In one embodiment of the invention, the amino acid sequence of the humanized CD73 protein comprises one of the following groups:
a) SEQ ID NO:4 or a part or all of the amino acid sequence shown in 4;
b) And SEQ ID NO:4 is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or at least 99%;
c) And SEQ ID NO:4 of no more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or no more than 1 amino acid; or alternatively, the first and second heat exchangers may be,
d) And SEQ ID NO:4, comprising substitution, deletion and/or insertion of one or more amino acid residues.
In an eleventh aspect of the present invention, there is provided a polygenic modified non-human animal, wherein the non-human animal is the non-human animal described above or the non-human animal obtained by the above construction method, and wherein the genome of the non-human animal comprises a modification of at least one of the genes PD-1, PD-L1, CTLA-4, LAG-3, BTLA, CD27, CD28, CD47, CD137, CD154, OX40, SIRPalpha, TIGIT, TIM-3, CD40 and GITR.
In a twelfth aspect of the present invention, there is provided a method of constructing a polygenically modified non-human animal comprising the steps of:
providing the non-human animal or the non-human animal obtained by the construction method;
and (II) mating the non-human animal provided in the step (I) with other non-human animals modified by genes, performing in vitro fertilization or directly performing gene editing, and screening to obtain the non-human animal modified by multiple genes.
Preferably, the other genetically modified non-human animal comprises a humanized non-human animal of at least one of the genes PD-1, PD-L1, CTLA-4, LAG-3, BTLA, CD27, CD28, CD47, CD137, CD154, OX40, SIRPalpha, TIGIT, TIM-3, CD40 and GITR.
Preferably, each of the plurality of genes humanized in the genome of the polygenously modified non-human animal may be homozygous or heterozygous.
In a thirteenth aspect of the invention, there is provided a non-human animal or progeny thereof obtained by the above construction method.
In a fourteenth aspect of the present invention, there is provided a tumor-bearing or inflammation model of an animal, said tumor-bearing or inflammation model being derived from a non-human animal as described above or a non-human animal obtained by the above construction method.
In a fifteenth aspect of the present invention, there is provided a method of constructing a tumor-bearing or inflammation model of an animal, the method comprising the step of obtaining a CD73 gene humanized non-human animal or a polygenic modified non-human animal using the above construction method.
In a sixteenth aspect, the present invention provides a use of the non-human animal described above or a non-human animal obtained by the above construction method in the preparation of a tumor-bearing or inflammatory animal model.
In a seventeenth aspect of the invention there is provided a cell or cell line or primary cell culture derived from a non-human animal as described above or a non-human animal obtained by a method of construction as described above, or a tumor-bearing or inflammation model as described above. Preferably, the cell or cell line or primary cell culture is incapable of developing into an animal subject.
In an eighteenth aspect of the invention, there is provided a tissue or organ or culture thereof derived from the non-human animal described above or the non-human animal obtained by the above construction method, or the tumor-bearing or inflammation model described above. Preferably, the tissue or organ or culture thereof is incapable of developing into an animal subject.
In a nineteenth aspect of the present invention, there is provided a tumor tissue after tumor-bearing, said tumor tissue being derived from the above-described non-human animal or the above-described non-human animal obtained by the construction method, or the above-described tumor-bearing or inflammation model. Preferably, the tumor tissue after tumor-bearing cannot develop into an animal individual.
In a twentieth aspect of the present invention, there is provided a CD73 gene humanized cell comprising a human or humanized CD73 gene. Preferably comprising the humanized CD73 gene described above.
Preferably, the cells express human or humanized CD73 protein. Further preferred are the above-mentioned humanized CD73 proteins of the present invention. Preferably, the cells are unable to develop into an individual animal.
In a twenty-first aspect of the present invention, there is provided a CD73 gene-deleted cell which lacks all or part of exons 1 to 9 of CD73 gene. Preferably, all or part of exon 2 is deleted.
In a specific embodiment of the invention, the deletion encodes the sequence of SEQ ID NO:2 from nucleotide sequence 180 to 189.
In another embodiment of the invention, the nucleotide sequence of the CD73 gene is deleted starting from the start codon and ending at the stop codon.
Preferably, the cells are unable to develop into an individual animal.
In a twenty-second aspect of the present invention, there is provided a non-human animal having a deletion of all or part of exons 1 to 9 of a CD73 gene. Preferably, all or part of exon 2 is deleted.
In a specific embodiment of the invention, the deletion encodes the sequence of SEQ ID NO:2 from nucleotide sequence 180 to 189.
In another embodiment of the invention, the nucleotide sequence of the CD73 gene is deleted starting from the start codon and ending at the stop codon.
In a twenty-third aspect of the present invention, there is provided a method for constructing a non-human animal or a cell with a CD73 gene deletion, wherein the construction method uses the sgRNA.
In a twenty-fourth aspect of the present invention, there is provided a construct comprising the humanized CD73 gene described above.
Preferably, the construct expresses a human or humanized CD73 protein, which humanized CD73 protein is a humanized CD73 protein as described above.
In a twenty-fifth aspect of the invention, there is provided a cell comprising the construct described above.
Preferably, the cells are unable to develop into an individual animal.
In a twenty-sixth aspect of the invention, there is provided a tissue comprising the above-described cells.
Preferably, the tissue is unable to develop into an individual animal.
In a twenty-seventh aspect of the present invention there is provided use of a non-human animal derived from the above described non-human animal, the above described construction method, the above described humanized CD73 gene, the above described humanized CD73 protein, the above described tumor-bearing or inflammation model, the above described cell or cell line or primary cell culture, the above described tissue or organ or culture thereof, the above described tumor-bearing tissue, the above described cell, the above described construct, the above described cell or the above described tissue in the product development of an immune process involving human cells, in the manufacture of antibodies, or as a model system for pharmacological, immunological, microbiological, medical research; or in the production and use of animal experimental disease models for the development of new diagnostic and/or therapeutic strategies; or in screening, verifying, evaluating or researching CD73 channel function, human CD73 channel signal mechanism, human targeting antibody, human targeting medicine, medicine effect, immune related disease medicine and antitumor medicine, screening and evaluating human medicine and medicine effect research.
Preferably, the use is not a method of treatment and/or diagnosis of a disease.
In a twenty-eighth aspect of the present invention, there is provided a method of screening for a human CD 73-specific modulator, the method comprising administering the modulator to an individual; wherein the individual is selected from the non-human animals described above, the non-human animals obtained by the above construction method, or the tumor-bearing or inflammation model described above.
Preferably, the screening method further comprises implanting tumor cells into the individual.
Preferably, the modulator is selected from CAR-T, a drug; preferably, the drug is an antibody.
Preferably, the screening method further comprises detecting tumor suppression.
Preferably, the regulator is monoclonal antibody or bispecific antibody or the combination of two or more drugs.
Preferably, the detection comprises determining the size and/or proliferation rate of the tumour cells.
Preferably, the method of detection comprises vernier caliper measurement, flow cytometry detection and/or animal live imaging detection.
Preferably, the detecting comprises assessing an individual's weight, fat mass, activation pathway, neuroprotective activity, or metabolic change, including a change in food consumption or water consumption.
Preferably, the tumor cells are derived from a human or non-human animal.
Preferably, the screening method is not a therapeutic method. The method is used for screening or evaluating medicines, detecting and comparing the medicine effects of candidate medicines to determine which candidate medicines can be taken as medicines and which can not be taken as medicines, or comparing the medicine effect sensitivity degree of different medicines, namely that the treatment effect is not necessarily the same, but is only one possibility.
In a twenty-ninth aspect of the present invention, there is provided an evaluation method of an intervention program, the evaluation method comprising implanting tumor cells into an individual, applying the intervention program to the individual implanted with the tumor cells, and detecting and evaluating a tumor suppression effect of the individual after applying the intervention program; wherein the individual is selected from the group consisting of the non-human animal described above, the non-human animal obtained by the above construction method, the non-human animal described above or its progeny, or the tumor-bearing or inflammation model described above.
Preferably, the intervention regimen is selected from the group consisting of CAR-T, drug therapy. Further preferably, the drug is an antigen binding protein. The antibody binding protein is an antibody.
Preferably, the tumor cells are derived from a human or non-human animal.
Preferably, the method of evaluation of the intervention regimen is not a therapeutic method. The evaluation method detects and evaluates the effect of the intervention program to determine whether the intervention program has a therapeutic effect, i.e. the therapeutic effect is not necessarily, but is just one possibility.
In a thirty-first aspect of the present invention there is provided a non-human animal derived from the above, a non-human animal obtained by the above construction method, a non-human animal or progeny thereof, a tumor-bearing or inflammation model as described above for use in the manufacture of a human CD 73-specific modulator.
In a thirty-first aspect of the present invention, there is provided the use of a non-human animal derived from the above, a non-human animal obtained by the above construction method, a non-human animal or progeny thereof, a tumor-bearing or inflammatory model as described above, in the manufacture of a medicament for the treatment of a tumor or autoimmune disease.
The humanized non-human animal of the CD73 gene can normally express human or humanized CD73 protein in vivo, can be used for drug screening, drug effect evaluation, autoimmune diseases and tumor treatment aiming at a target site of a human CD73 channel, and can accelerate the development process of new drugs and save time and cost.
The "immune-related diseases" described herein include, but are not limited to, allergy, asthma, dermatitis, myocarditis, nephritis, hepatitis, systemic lupus erythematosus, rheumatoid arthritis, scleroderma, hyperthyroidism, primary thrombocytopenic purpura, autoimmune hemolytic anemia, ulcerative colitis, autoimmune liver disease, diabetes, pain or neurological disorders, and the like.
The "tumor" as described herein includes, but is not limited to, lymphoma, brain cancer, non-small cell lung cancer, cervical cancer, esophageal cancer, leukemia, ovarian cancer, nasopharyngeal cancer, breast cancer, endometrial cancer, colon cancer, rectal cancer, gastric cancer, bladder cancer, lung cancer, bronchial cancer, bone cancer, prostate cancer, pancreatic cancer, liver and bile duct cancer, esophageal cancer, renal cancer, thyroid cancer, head and neck cancer, testicular cancer, glioblastoma, astrocytoma, melanoma, myelodysplastic syndrome, and sarcomas. Wherein the leukemia is selected from acute lymphoblastic (lymphoblastic) leukemia, acute myelogenous leukemia, chronic lymphocytic leukemia, multiple myeloma, plasma cell leukemia, and chronic myelogenous leukemia; the lymphoma is selected from hodgkin's lymphoma and non-hodgkin's lymphoma, including B-cell lymphoma, diffuse large B-cell lymphoma, follicular lymphoma, mantle cell lymphoma, marginal zone B-cell lymphoma, T-cell lymphoma, and waldenstrom's macroglobulinemia; the sarcoma is selected from osteosarcoma, ewing sarcoma, leiomyosarcoma, synovial sarcoma, soft tissue sarcoma, angiosarcoma, liposarcoma, fibrosarcoma, rhabdomyosarcoma, and chondrosarcoma.
The invention relates to all or part of the whole, the whole is the whole, the part is the part of the whole or the whole individual.
The "humanized CD73 protein" described herein includes a portion derived from a human CD73 protein and a portion not derived from a human CD73 protein. Wherein the "humanized CD73 protein" comprises a sequence of 5-574 amino acids which are identical to the amino acid sequence of human CD73 protein, either continuously or at intervals.
The "humanized CD73 gene" described in the present invention includes a portion derived from a human CD73 gene and a portion derived from a non-human CD73 gene. Wherein the "humanized CD73 gene" comprises 20-46000bp nucleotide sequences which are continuous or spaced and are identical to the nucleotide sequence of the human CD73 gene, preferably 20-1725, 20-45928bp, more preferably 20, 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1725, 1800, 1900, 2000, 2500, 3000, 5000, 10000, 20000, 30000, 40000, 45000 or 45928bp nucleotide sequences which are continuous or spaced and are identical to the nucleotide sequence of the human CD73 gene.
The "exon 1 to exon 9" or "all of the exons 1 to 9" described in the present invention includes the nucleotide sequences of the exons 1 and their introns, i.e., the exons 1, 1 to 2, 2 to 3, 3 to 4, 4 to 5, 5 to 6, 6 to 7, 7 to 8, 8 to 9 and 9 exons.
The "intron 1-2" as used herein means an intron between exon 1 and exon 2.
The "locus" as used herein refers broadly to the location of a gene on a chromosome, and in a narrow sense to a DNA fragment on a gene, either a gene or a portion of a gene. For example, the "CD73 locus" means a DNA fragment of an optional stretch on exons 1 to 9 of the CD73 gene. In one embodiment of the present invention, the introduced CD73 locus may be a DNA fragment of an optional stretch on exons 1 to 9 of the CD73 gene.
The "nucleotide sequence" as used herein includes natural or modified ribonucleotide sequences and deoxyribonucleotide sequences. Preferably DNA, cDNA, pre-mRNA, mRNA, rRNA, hnRNA, miRNAs, scRNA, snRNA, siRNA, sgRNA, tRNA.
The term "treatment" as used herein means slowing, interrupting, arresting, controlling, stopping, alleviating, or reversing the progression or severity of a sign, symptom, disorder, condition, or disease, but does not necessarily refer to the complete elimination of all disease-related signs, symptoms, conditions, or disorders. The term "treatment" or the like refers to a therapeutic intervention that ameliorates signs, symptoms, etc. of a disease or pathological state after the disease has begun to develop.
"homology" as used herein means that a person skilled in the art, while maintaining a structure or function similar to a known sequence, can adjust the sequence according to actual working requirements, using sequences having (including but not limited to) 1%,2%,3%,4%,5%,6%,7%,8%,9%,10%,11%,12%,13%,14%,15%,16%,17%,18%,19%,20%,21%,22%,23%,24%,25%,26%,27%,28%,29%,30%,31%,32%,33%,34%,35%,36%,37%,38%,39%,40%,41%,42%,43%,44%,45%,46%,47%,48%,49%,50%,51%,52%,53%,54%,55%,56%,57%,58%,59%,60%,70%,80%,81%,82%,83%,84%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%, 99.99.3%, 99.99.7%, 99.99.99%, 99.9%, as compared with sequences obtained in the prior art.
One skilled in the art can determine and compare sequence elements or degrees of identity to distinguish additional mouse and human sequences.
In one aspect, the non-human animal is a mammal. In one aspect, the non-human animal is a small mammal, such as a murine. In one embodiment, the genetically humanized non-human animal is a rodent. In one embodiment, the rodent is selected from a mouse, a rat, and a hamster. In one embodiment, the rodent is selected from a murine family. In one embodiment, the genetically modified animal is from the family of the hamsidae (e.g., hamster-like), hamsidae (e.g., hamster, new world rats and mice, voles), murine superfamily (true mice and rats, gerbils, spiny rats, coronary rats), equine island murine (mountain climbing mice, rock mice, tailed rats, motor gas rats and mice), spiny murine (e.g., spiny sleeping rats) and mole murine (e.g., mole rats, bamboo rats and zokors). In a particular embodiment, the genetically modified rodent is selected from the group consisting of a true mouse or rat (murine superfamily), a gerbil, a spiny mouse, and a coronary rat. In one embodiment, the genetically modified mouse is from a member of the murine family. In one embodiment, the animal is a rodent. In a particular embodiment, the rodent is selected from a mouse and a rat. In one embodiment, the non-human animal is a mouse.
In a particular embodiment, the non-human animal is a rodent, which is a mouse selected from the group consisting of BALB/C, A/He, A/J, A/WySN, AKR, AKR/A, AKR/J, AKR/N, TA1, TA2, RF, SWR, C3H, C BR, SJL, C57L, DBA/2, KM, NIH, ICR, CFW, FACA, C BL/A, C57BL/An, C57BL/GrFa, C57BL/KaLwN, C57BL/6J, C BL/6ByJ, C57BL/6NJ, C57BL/10ScSn, C57BL/10Cr, and C57BL/Ola of the C57BL, C58, CBA/Br, CBA/Ca, CBA/J, CBA/st, CBA/H strain.
The practice of the present invention will employ, unless otherwise indicated, conventional techniques of cell biology, cell culture, molecular biology, transgenic biology, microbiology, recombinant DNA and immunology. These techniques are explained in detail in the following documents. For example: molecular Cloning A Laboratory Manual,2ndEd., by Sambrook, fritschand Maniatis (Cold Spring Harbor Laboratory Press:1989); DNA Cloning, volumes I and II (D.N.Glcovered., 1985); oligonucleotide Synthesis (m.j. Gaited., 1984); mullisetal, u.s.pat.no.4, 683, 195; nucleic Acid Hybridization (B.D.Hames & S.J.Higginseds.1984); transcription And Translation (B.D.Hames & S.J.Higginseds.1984); culture Of Animal Cells (R.I.Freshney, alanR.Liss, inc., 1987); immobilized Cells And Enzymes (IRL Press, 1986); perbal, A Practical Guide To Molecular Cloning (1984); the services, methods In ENZYMOLOGY (j. Abelson and m. Simon, eds. Inch, academic Press, inc., new York), special, vols.154and 155 (wuetal. Eds.) and vol.185, "Gene Expression Technology" (d. Goeddel, ed.); gene Transfer Vectors For Mammalian Cells (j.h.miller and M.P.Caloseds.,1987,Cold Spring Harbor Laboratory); immunochemical Methods In Cell And Molecular Biology (Mayer and Walker, eds., academic Press, london, 1987); handbook Of Experimental Immunology, volumes V (d.m. weir and c.c. blackwell, eds., 1986); and Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratory Press, cold Spring Harbor, n.y., 1986).
All patents and publications mentioned in this specification are incorporated herein by reference in their entirety. It will be appreciated by those skilled in the art that certain changes may be made thereto without departing from the spirit or scope of the invention. The following examples further illustrate the invention in detail and are not to be construed as limiting the scope of the invention or the particular methods described herein.
Drawings
Embodiments of the present invention are described in detail below with reference to the attached drawing figures, wherein:
fig. 1: schematic comparison of mouse and human CD73 genes (not to scale).
Fig. 2: CD73 Gene humanized mice CD73 gene schematic diagram one (not to scale).
Fig. 3: targeting strategy used to prepare CD73 gene humanized mice is shown schematically (not to scale).
Fig. 4: southern Blot results, wherein ZL001 to ZL018 are mouse numbers, M is Marker, WT is wild type control, H 2 O is water control.
Fig. 5: rat tail PCR of F1 generation to identify somatic genotype, wherein 1F 1001-1F 1011 are rat tail numbers, WT is wild type control, M is Marker, H 2 O is water control.
Fig. 6: schematic (not to scale) of the mouse FRT recombination process for CD73 gene humanization;
fig. 7: targeting strategies employed to prepare CD73 gene humanized mice are shown schematically (not to scale).
Fig. 8: sgRNA activity assay, wherein Con is the negative control and PC is the positive control.
Fig. 9: genotyping results of somatic cells of F0-generation mice, wherein F0-01 to F0-14 are the mouse numbers, WT is wild-type control, M is Marker, H 2 O is water control.
Fig. 10: genotyping results of F1-generation mouse somatic cells, wherein 2F1-01 to 2F1-11 are the mouse numbers, PC is the positive control, WT is the wild-type control, M is Marker, H 2 O is water control.
Fig. 11: southern blot detection results, wherein 2F1-01 to 2F1-11 are mouse numbers and WT is wild type control.
Fig. 12: CD73 gene humanized mice CD73 gene schematic two (not to scale).
Fig. 13: targeting strategies employed to prepare CD73 gene humanized mice are shown schematically (not to scale).
Fig. 14: genotyping results of somatic cells of F1-generation mice, wherein 3F1-01 to 3F1-11 are the number of mice, M is Marker, H 2 O is water control;
fig. 15: RT-PCR detection results are shown in a schematic way, wherein +/- + is wild type control, H/H is CD73 gene humanized homozygous mouse, H 2 O is water control;
fig. 16: flow detection result of leukocyte subpopulation ratio in spleen;
fig. 17: flow-through detection of T cell subpopulations in spleen;
Fig. 18: flow-through detection of leukocyte subpopulations in lymph nodes;
fig. 19: flow detection results of T cell subpopulations in lymph nodes;
fig. 20: flow detection results of the white blood cell subpopulation ratio in the peripheral blood;
fig. 21: flow detection results of T cell subpopulations in peripheral blood;
fig. 22: blood routine detection results;
fig. 23: and (5) blood biochemical detection results.
Detailed Description
The invention will be further described with reference to specific embodiments, and advantages and features of the invention will become apparent from the description. These examples are merely exemplary and do not limit the scope of the invention in any way. It will be understood by those skilled in the art that various changes and substitutions of details and forms of the technical solution of the present invention may be made without departing from the spirit and scope of the present invention, but these changes and substitutions fall within the scope of the present invention.
In each of the following examples, the devices and materials were obtained from several companies as indicated below:
c57BL/6 mice and Flp tool mice were purchased from the national rodent laboratory animal seed center of China food and drug inspection institute;
AIO kit source Beijing Bai Osai Gene biotechnology Co., ltd., product number BCG-DX-004;
APC anti-mouse CD73 anti-body (mCD 73 APC) was purchased from Biolegend under the accession number 127209;
PerCP/Cy55 Anti-mouse TCR Beta Chain Antibody (mTcRβPerCP) was purchased from Biolegend under the accession number 109228;
PE anti-human CD73 (Ecto-5' -nucleotidase) anti-body (hCD 73 PE) was purchased from Biolegend under the accession number 344003;
FITC anti-mouse CD19 anti-body (hCD 19 FITC) was purchased from Biolegend under the accession number 115506;
PE anti-human CD73 anti-body (hCD 73 PE) was purchased from Biolegend under the accession number 344003;
PE/Cy TM 7anti-mouse/rat Foxp3 anti-body (mFoxp 3 PE) from eBioscience under the trade designation 25-5773-82;
Brilliant Violet 510 TM anti-mouse CD4 anti-body (mCD 4) was purchased from Biolegend under the accession number 100559;
APC anti-mouse CD73 anti-body (mCD 73 APC) was purchased from Biolegend under the accession number 127209.
The present invention contemplates engineering a non-human animal (e.g., a mouse) to contain all or part of the gene sequence encoding human CD73 protein in the non-human animal so that it expresses human or humanized CD73 protein in vivo. Non-human mammals expressing human CD73 protein can be obtained by various gene editing systems and methods of preparation, including but not limited to gene targeting techniques using embryonic stem cells (embryonic stem cell, ES), zinc Finger Nuclease (ZFN) techniques, transcription activator-like effector nuclease (TALEN) techniques, homing endonucleases (megabase megaribozymes), regular clustered interval short palindromic repeat (Clustered Regularly Interspaced Short Palindromic Repeats, CRISPR) techniques, or other molecular biology techniques. The embodiment of the invention mainly takes the gene homologous recombination technology of ES cells as an example to describe how the humanized mice can be obtained by constructing a specific targeting mouse CD73 gene targeting vector to realize sequence replacement.
EXAMPLE 1CD73 Gene humanized mice
The comparison of the mouse CD73 Gene (NCBI Gene ID:23959,Primary source:MGI:99782,UniProt ID:Q61503, based on transcript NM-011851.4 (SEQ ID NO: 1) and its encoded protein NP-035981.1 (SEQ ID NO: 2)) and the human CD73 Gene (NCBI Gene ID:4907,Primary source:HGNC:8021,UniProt ID:P21589, based on transcript NM-002526.4 (SEQ ID NO: 3) and its encoded protein NP-002517.1 (SEQ ID NO: 4)) is shown in FIG. 1.
Whereas human or mouse CD73 has multiple subtypes or transcripts, the methods described herein are applicable to other subtypes or transcripts.
For the purposes of the present invention, a gene sequence encoding a human CD73 protein may be introduced at an endogenous mouse CD73 locus such that the mouse expresses the human or humanized CD73 protein. The mouse cells are modified by gene editing techniques, for example, by inserting a sequence encoding the human CD73 protein into the endogenous mouse CD73 locus exon 2. In order to allow more stable and better expression of human CD73 protein in mice, the mouse promoter was maintained and P2A was inserted before the human CD73 coding sequence (as shown in SEQ ID NO: 10). The schematic diagram of the finally obtained modified humanized mouse CD73 gene is shown in figure 2, the mouse regulates and controls the expression of a humanized CD73 sequence through an endogenous promoter, and the CD73 protein expressed in vivo is human CD73 protein; and further designed a targeting strategy as shown in figure 3. Because the inserted recombination sequences have transcription termination sequences such as stop codons and the like, and partial sequences of the coding region of the mice are deleted, the mouse coding region on the humanized mouse CD73 gene cannot be transcribed and translated normally.
In the schematic of the targeting strategy shown in FIG. 3, the targeting vector is shown containing sequences of the upstream homology arm and downstream homology arm of mouse CD73, and the A-1 fragment (SEQ ID NO: 13) comprising P2A, human CD73 sequence, mouse 3' UTR and STOP sequence. Wherein the upstream homology arm sequence (5 'homology arm, SEQ ID NO: 7) is identical to the 88348101-88352454 nucleotide sequence of NCBI accession NC_000075.6, and the downstream homology arm sequence (3' homology arm, SEQ ID NO: 8) is identical to the 88354132-88358698 nucleotide sequence of NCBI accession NC_ 000075.6; the human CD73 sequence (SEQ ID NO: 9) is identical to nucleotide sequence 58-1782 of NCBI accession No. NM-002526.4; STOP sequence is shown in SEQ ID NO:20, the sequence of the 3' UTR of the mouse is shown as SEQ ID NO: shown at 22. The A-1 fragment also comprises a resistance gene for positive clone screening, namely neomycin phosphotransferase coding sequence Neo, and two site-specific recombination system Frt recombination sites which are arranged in the same direction are arranged on two sides of the resistance gene to form a Neo box (neocassette). Wherein, P2A, human CD73 sequence, mouse 3' UTR and STOP sequence and Neo box sequence are arranged in sequence from 5' to 3' on A-1 fragment, and sequence containing human CD73 gene is directly connected with upstream homology arm sequence; the ligation downstream of the Neo cassette and the murine locus was designed to be 5' -GCGGAACCCTTCGAAGTTCCTATTCTCTAGAAAGTATAGGAA CTTCcc agagcccaaggcactcattattagcactgggtgagctccatg-3' (SEQ ID NO: 23), wherein the sequence "CTTCThe "second" C "is the last nucleotide of the Neo cassette, sequence"ccagaThe "first" c "of the" is the first nucleotide of the murine sequence.
In addition, a coding gene (coding gene for diphtheria toxin A subunit (DTA)) with a negative selection marker was also constructed downstream of the targeting vector 3' homology arm.
Targeting vector construction can be performed by conventional methods, such as enzyme digestion ligation, and the like. The constructed targeting vector can be subjected to preliminary verification through enzyme digestion and then sent to a sequencing company for sequencing verification. The sequencing verification correct targeting vector is transfected into embryonic stem cells of a C57BL/6 mouse through electroporation, the obtained cells are screened by utilizing a positive clone screening marker gene, the integration condition of exogenous genes is detected and confirmed by utilizing PCR and Southern Blot technology, correct positive clone cells are screened, clones which are positive through PCR identification are subjected to Southern Blot detection, the PCR detection result is shown in fig. 4, 18 clones are positive clones, and ZL004, ZL007, ZL009, ZL010, ZL011, ZL012, ZL013, ZL014, ZL016 and ZL018 are positive clones and are not inserted randomly through sequencing.
The PCR assay included the following primers:
Mut F1:5’-GAGTGGTATGTGCAGCTCAGGGAAG-3’(SEQ ID NO:24),
Mut R1:5’-GCTCCAAGAAACGAAAAGGCCGAGA-3’(SEQ ID NO:25);
the positive clone selected was introduced into the isolated blastula (white mouse) according to the technique known in the art, and the resulting chimeric blastula was transferred to a culture medium for a short time and then transplanted into the oviduct of a recipient master mouse (white mouse) to produce F0-generation chimeric mice (black-white interphase). And backcrossing the F0 generation chimeric mice and wild mice to obtain F1 generation mice, detecting by using Mut F1 primers and Mut R1 primers, wherein the detection results are shown in figure 5, and obtaining positive F1 mice by 11F 1 generation mice which are all positive mice, and then mating the F1 generation heterozygous mice with each other to obtain the F2 generation homozygous mice. The positive mice and Flp tool mice can also be mated to remove the positive clone screening marker gene (the process is schematically shown in figure 6), and then the humanized homozygous mice (V1) of the CD73 gene expressing the human CD73 protein can be obtained through the mating.
Other gene editing methods can also be used to obtain humanized mice, such as gene editing using a CRISPR/Cas system, designing a targeting strategy as shown in fig. 7, in which fig. 7 shows a targeting vector comprising homologous arm sequences upstream and downstream of the mouse CD73 gene, and an a-2 fragment comprising the human CD73 nucleotide sequence, wherein the upstream homologous arm sequence (5 'homology arm, SEQ ID NO: 5) is identical to NCBI accession nc_000075.6 at 88350685-88352454, and the downstream homologous arm sequence (3' homology arm, SEQ ID NO: 6) is identical to NCBI accession nc_000075.6 at 88353811-88354943; wherein the nucleotide sequence of the A-2 fragment is shown as SEQ ID NO. 12, and the nucleotide sequence of the human CD73 is shown as SEQ ID NO. 9, which is identical to the nucleotide sequence of the A-1 fragment human CD73 in FIG. 3. The protein sequences of the modified humanized mice are SEQ ID NO. 4 respectively.
Targeting vector construction can be performed by conventional methods, such as enzyme digestion ligation, and the like. After the constructed targeting vector is subjected to primary verification through enzyme digestion, the targeting vector is sent to a sequencing company for sequencing verification. The targeting vector with correct sequencing verification was used for subsequent experiments.
The target sequence determines the targeting specificity of the sgrnas and the efficiency of inducing Cas9 cleavage of the gene of interest. Therefore, efficient and specific target sequence selection and design are a prerequisite for construction of sgRNA expression vectors. The sgRNA sequence that recognizes the target site is designed and synthesized. The target sites are located on exon 2 and introns 2-3 of the CD73 gene, and the target site sequence of each sgRNA on CD73 is as follows:
SgRNA1 target site sequence (SEQ ID NO: 26): 5'-AGCGCAAACATTAAGGCACGGGG-3';
SgRNA2 target site sequence (SEQ ID NO: 27): 5'-TTAATGTTTGCGCTCAGAATTGG-3';
SgRNA3 target site sequence (SEQ ID NO: 28): 5'-GAGCGCAAACATTAAGGCACGGG-3';
SgRNA4 target site sequence (SEQ ID NO: 29): 5'-AATTCTGAGCGCAAACATTAAGG-3';
SgRNA5 target site sequence (SEQ ID NO: 30): 5'-CCTCTAGCACATCAGATATCTGG-3';
SgRNA6 target site sequence (SEQ ID NO: 31): 5'-TGAGCGCAAACATTAAGGCACGG-3';
SgRNA7 target site sequence (SEQ ID NO: 32): 5'-GATTGTTGGATATACTTCAAAGG-3';
SgRNA8 target site sequence (SEQ ID NO:): 5'-TTGGAAATTTAACGTTTCTGAGG-3';
SgRNA9 target site sequence (SEQ ID NO: 34): 5'-CATCACTTGCGGGGAAGTCCTGG-3';
SgRNA10 target site sequence (SEQ ID NO: 35): 5'-TGCTGAGTCCATCACTTGCGGGG-3';
SgRNA11 target site sequence (SEQ ID NO: 36): 5'-CAGGACTTCCCCGCAAGTGATGG-3';
SgRNA12 target site sequence (SEQ ID NO: 37): 5'-GTCTGTAACCCCGATAAGCTTGG-3';
SgRNA13 target site sequence (SEQ ID NO: 38): 5'-TTGCATCTGCGGAGTTAGCATGG-3';
SgRNA14 target site sequence (SEQ ID NO: 39): 5'-GCCATCGACCATTCGTGCACAGG-3';
SgRNA15 target site sequence (SEQ ID NO: 40): 5'-CCCTGTGCACGAATGGTCGATGG-3';
SgRNA16 target site sequence (SEQ ID NO: 41): 5'-AGTGGAGCCCTGTGCACGAATGG-3';
the activity of a plurality of sgrnas was detected using UCA kit, and it was found from the results that the sgrnas have different activities, and the detection results are shown in table 1 and fig. 8.
Table 1: sgRNA relative Activity assay results
Although the activity of the sgrnas 2, 4 and 8 is relatively low, which may be caused by the specificity of the target site sequence, according to our experiments, the values of the sgrnas 2, 4 and 8 are still significantly higher than those of the control group, and the sgrnas 2, 4 and 8 can still be judged to be active, the activity meets the requirement of the gene targeting experiment, and the sgrnas 7 and 14 are selected for subsequent experiments. Forward oligonucleotides and reverse oligonucleotides were obtained by adding cleavage sites to the 5' end and the complementary strand, respectively (see Table 2), and after annealing, the annealed products were ligated to pT7-sgRNA plasmids (plasmids were linearized with BbsI) to obtain expression vectors pT7-CD73-7 and pT7-CD73-14, respectively.
TABLE 2 forward and reverse oligonucleotide sequences of sgRNA7 and sgRNA14
pT7-sgRNA vector A fragment DNA (SEQ ID NO: 21) containing the T7 promoter and sgRNA scaffold was synthesized by plasmid synthesis company and ligated to a backbone vector (source Takara, cat. No. 3299) by cleavage (EcoRI and BamHI) in sequence, and the results were verified by sequencing by a professional sequencing company, indicating that the plasmid of interest was obtained.
Wild-type C57BL/6 mouse fertilized eggs are taken, and the in vitro transcription products of pT7-CD73-7 and pT7-CD73-14 plasmids (transcribed by using an Ambion in vitro transcription kit according to the method of the specification), targeting vectors and Ca s9 mRNA are premixed and injected into cytoplasm or nucleus of the mouse fertilized eggs by using a microinjection instrument. Microinjection of fertilized eggs was performed according to the method of the "mouse embryo handling laboratory Manual (third edition)" (andela, nagel, chemical industry Press, 2006), the fertilized eggs after injection were transferred into a culture medium for short-term culture, then transplanted into oviducts of recipient mice for development, and the obtained mice (F0 generation) were subjected to hybridization and selfing to expand population numbers and establish stable CD73 humanized mice.
The genotype of somatic cells of F0 mice (the primer sequences are the same) can be identified by the same PCR method as described in this example, and the identification results of some F0 mice are shown in FIG. 9, in which 5 mice numbered F0-01, F0-02, F0-04, F0-11 and F0-13 are positive mice, and the 8 mice are further verified to be positive mice by sequencing without random insertion.
WT-F:5’-TTTGATAACGGTGTGGAAGGACTGAT-3’(SEQ ID NO:48)
WT-R:5’-AGACAGCGATGCCCTTCAGAGGATA-3’(SEQ ID NO:49)
WT-F:5’-TTTGATAACGGTGTGGAAGGACTGAT-3’(SEQ ID NO:48)
Mut-R:5’-TACACGGTGAACCAGATAGTGCCC-3’(SEQ ID NO:50)
GT-F:5’-GTGGGCGGAATCCATGTGGTGTATG-3’(SEQ ID NO:51)
GT-R:5’-GCCCCAGGGCGATGATCTTATTCAC-3’(SEQ ID NO:52)
CD73 humanized mice identified as positive for F0 were mated with C57BL/6 wild type mice to give F1 mice. The same PCR method can be used to genotype F1 mice, and the results are shown in FIG. 10, which shows that 11 mice numbered 2F1-01 to 2F1-11 are positive mice. Southern blot was performed on these 11 mice identified as positive by F1 PCR to confirm the presence of random insertions. Cutting rat tail to extract genome DNA, digesting the genome with BglII enzyme or NcoI enzyme, transferring film and hybridizing. The probes 5'probe and 3' probe are respectively positioned on the left side of the 5 'homology arm and the 3' homology arm, and the lengths of specific probes and target fragments are shown in Table 3.
TABLE 3 lengths of specific probes and fragments of interest
Restriction enzyme | Probe with a probe tip | Wild fragment size | Recombinant sequence fragment size |
BglII | 5’Probe | 3.1kb | 5.2kb |
NcoI | lox2 Stop Probe(3’) | -- | 4.6kb |
The probe synthesis primers were as follows:
5’Probe-F(SEQ ID NO:53):5’-AGAGACCCTTTAGACCTGGATCAGC-3’
5’Probe-R(SEQ ID NO:54):5’-ACCTAGAAAACTTTGGTGGGACATGA-3’
lox2 Stop Probe(3’)-F(SEQ ID NO:55):5’-AACTGATGAATGGGAGCAGTGGTGG-3’
lox2 Stop Probe(3’)-R(SEQ ID NO:56):5’-GCAGACACTCTATGCCTGTGTGGAG-3’
the Southern blot detection results are shown in FIG. 11. The results of the 5' probe and STOP probe were combined and further verified by sequencing that none of the 11 mice numbered 2F1-01 to 2F1-11 had random insertions, confirming that these 11 mice were positive heterozygous mice and that random insertions did not exist. This shows that the CD73 humanized genetically engineered mice (V1) can be constructed with stable passage and no random insertion by using the method.
RT-PCR can be used to detect expression of humanized CD73mRNA in a CD73 gene humanized mouse. 1 wild C57BL/6 mice and humanized CD73 homozygote mice of 7 weeks old are respectively selected, bladder tissues are taken after neck removal and euthanasia, total RNA of cells is extracted, reverse transcription is carried out to cDNA by using a reverse transcription kit, PCR amplification is carried out, and primer sequences are shown in Table 4.
TABLE 4 RT PCR detection primer sequences and target fragment lengths
The detection results showed (see FIG. 15) that murine CD73mRNA expression was detectable in wild-type C57BL/6 mouse cells and humanized CD73mRNA expression was not detected; humanized CD73mRNA expression was detected in humanized CD73 homozygous mouse cells, and murine CD73mRNA expression was not detected.
In order to obtain the CD73 humanized mouse, the human CD73 genome sequence can be replaced to the mouse locus, the mouse CD73 gene can be modified through a BAC transgenic method, the schematic diagram of the finally obtained humanized mouse CD73 gene is shown in FIG. 12, and a targeting strategy diagram shown in FIG. 13 is designed. The targeting vector is shown to contain sequences of the upstream and downstream homology arms of mouse CD73, and the A-3 fragment comprising the sequence of the human CD73 gene. Wherein the upstream homology arm sequence (5 'homology arm, SEQ ID NO: 15) is identical to the 88323954-88327685 nucleotide sequence of NCBI accession NC_000075.6, and the downstream homology arm sequence (3' homology arm, SEQ ID NO: 16) is identical to the 88372532-88377482 nucleotide sequence of NCBI accession NC_ 000075.6; the human CD73 sequence (SEQ ID NO: 17) is identical to nucleotide sequence 85450140-85496067 of NCBI accession NC_ 000006.12; the A-3 fragment (shown as SEQ ID NO: 18) also comprises a resistance gene for positive clone screening, namely neomycin phosphotransferase coding sequence Neo, and two site-specific recombination system LOXP recombination sites which are arranged in the same direction are arranged on two sides of the resistance gene.
In addition, a coding gene (coding gene for diphtheria toxin A subunit (DTA)) with a negative selection marker was also constructed downstream of the targeting vector 3' homology arm.
Targeting vector construction can be performed by conventional methods, such as enzyme digestion ligation, and the like. The constructed targeting vector can be subjected to preliminary verification through enzyme digestion and then sent to a sequencing company for sequencing verification. And (3) carrying out electroporation transfection of a targeting vector with correct sequencing verification into embryonic stem cells of a C57BL/6 mouse, screening the obtained cells by utilizing a positive clone screening marker gene, detecting and confirming the integration condition of exogenous genes by utilizing PCR and Southern Blot technology, screening out correct positive clone cells, identifying positive clones by PCR, carrying out Southern Blot detection, displaying the PCR detection result as positive clones, carrying out sequencing confirmation, and selecting positive clones without random insertion for the next step.
The PCR assay included the following primers:
L-GT-F:5’-AAGATGTTTCTTGTCCCGGGTTTCG-3’(SEQ ID NO:57)
L-GT-R:5’-GCAGTAAATGAGAGCTGCGTGAGGG-3’(SEQ ID NO:14)
the positive clone selected was introduced into the isolated blastula (white mouse) according to the technique known in the art, and the resulting chimeric blastula was transferred to a culture medium for a short time and then transplanted into the oviduct of a recipient master mouse (white mouse) to produce F0-generation chimeric mice (black-white interphase). The F0 generation chimeric mice were back crossed with wild type mice to obtain F1 generation mice, which were tested using L-GT-F and L-GT-R primers, the test results are shown in FIG. 14, and the results show that 11 mice were all positive mice. And then mating the F1 generation heterozygote mice with each other to obtain the F2 generation homozygous mice. The positive mice and Cre tool mice can be mated to remove the positive clone screening marker genes, and then the humanized homozygote mice (V2) of the CD73 gene expressing the human CD73 protein can be obtained through the mutual mating. The coding region and the protein sequence of the mRNA sequence of the humanized mouse after transformation are SEQ ID NO. 19 and SEQ ID NO. 4 respectively.
EXAMPLE 2 expression of humanized proteins in CD73 Gene humanized mice
The expression of the humanized CD73 protein in the mice can be confirmed by a conventional detection method, and in this example, protein expression detection is performed on the two humanized CD73 gene mice obtained in example 1 by flow cytometry.
1 mouse of wild type C57BL/6 mice of 8 weeks old, and humanized homozygous mice of V1 version and V2 version of CD73 gene were selected, spleen cells were taken after euthanasia, and anti-mouse TCR beta Antibody PerCP/Cy5.5 anti-mouse TCR beta chain anti-body (mTcRbeta PerCP), anti-mouse CD19 Antibody FITC anti-mouse CD19 anti-body (hCD 19 FITC), anti-human CD73 Antibody PE anti-human CD73 anti-body (hCD 73 PE), anti-mouse Foxp3 Antibody PE/CyTM7 anti-mouse/rate Foxp3 anti-body (mFoxp 3 PE), anti-mouse CD4 Antibody Brilliant Violet 510, respectively TM anti-mouse CD4 anti-body (mCD 4) and anti-mouse CD73 Antibody APC anti-mouse CD73 (mCD 73 APC) identification staining followed by flow detection, the results are shown below:
TABLE 5 stream test results
The above results indicate that murine CD73 protein was detected in C57BL/6 mice and that human CD73 protein was not detected; human CD73 protein was detected in V1 and V2 versions of CD73 gene humanized homozygous mice, and no murine CD73 protein was detected.
Further flow assays were performed for the spleen, lymph node and white blood cells and T cell immunophenotyping in peripheral blood of wild type C57BL/6 mice and V1 version CD73 humanized homozygous mice. The results of the leukocyte subtype and the T-cell subtype detection in spleen are shown in FIG. 16 and FIG. 17, respectively, and it can be seen from the figures that the leukocyte subtypes such as B cells, T cells, NK cells, CD4+ T cells, CD8+ T cells, granulocytes (Granulocyte), DC cells, macrophages (Macrophage) and monocytes (Monocyte) in spleen samples of mice homozygous for the CD73 gene humanized are substantially identical to those of C57BL/6 wild-type mice (FIG. 16), and the T-cell subtype percentages such as CD4+ T cells, CD8+ T cells and Tregs cells are identical to those of C57BL/6 wild-type mice (FIG. 17).
The results of the detection of the leukocyte subtypes in lymph nodes and T cell subtypes are shown in FIG. 18 and FIG. 19, respectively, and it can be seen from the figures that the leukocyte subtypes such as B cells, T cells, NK cells, CD4+ T cells, CD8+ T cells and the like in the lymph node samples of the humanized homozygous CD73 gene mice are consistent with those of the wild type C57BL/6 mice (FIG. 18), and the T cell subtype percentages such as CD4+ T cells, CD8+ T cells and Tregs cells are substantially consistent with those of the wild type C57BL/6 mice (FIG. 19).
The results of the detection of leukocyte subtypes and T cell subtypes in peripheral blood are shown in FIG. 20 and FIG. 21, respectively, and it can be seen from the figures that the percentages of the leukocyte subtypes such as B cells, T cells, NK cells, CD4+ T cells, CD8+ T cells, granulocytes (Granulocyte), DC cells, macrophages (Macrophage) and monocytes (Monocyte) in the peripheral blood samples of the CD73 humanized homozygous mice are substantially identical to those of the C57BL/6 wild-type mice (FIG. 20), and the percentages of the T cell subtypes such as CD4+ T cells, CD8+ T cells and Tregs cells are substantially identical to those of the C57BL/6 wild-type mice (FIG. 21).
The above results indicate that the humanized modification of the CD73 gene did not affect the differentiation, development and distribution of leukocytes and T cells in mice, in spleen, lymphoid tissues and peripheral blood.
Further, wild-type C57BL/6 mice and CD73 humanized homozygous mice were collected, and subjected to blood routine and blood biochemical tests, the blood routine results are shown in FIG. 22, and the results of white blood cell count (WBC), red blood cell count (RBC), hemoglobin concentration (HB), hematocrit (HCT), mean red blood cell volume (MCV), mean red blood cell hemoglobin content (MCH), mean red blood cell hemoglobin concentration (MCHC), red blood cell distribution width (RDW), platelet count (PLT), mean Platelet Volume (MPV), neutrophil count (NE), lymphocyte count (LY) and monocyte count (MO) in the peripheral blood samples of the CD73 humanized homozygous mice are substantially identical to those of the C57BL/6 wild-type mice; the results of biochemical blood tests are shown in FIG. 23, and the results of glutamic pyruvic transaminase (ALT), glutamic oxaloacetic transaminase (AST), serum Total Cholesterol (TC), creatinine (CR), blood Glucose (GLU), triglyceride (TRIG), UREA (UREA) and serum Albumin (ALB) in the peripheral blood sample of the CD73 gene humanized homozygous mice are substantially identical to those of the C57BL/6 wild type mice. The above results indicate that humanized engineering of the CD73 gene does not affect the composition and morphology of blood cells, nor does it alter AST and ALT levels and liver health status.
EXAMPLE 3 Dual humanized or polygenic humanized mice
Humanized mice using the CD73 gene prepared as described above can also be prepared into a multi-humanized mouse model comprising a humanized CD73 gene. For example, fertilized egg cells from other genetically modified mice are selected for use in microinjection and embryo transfer processes, e.g., gene editing is performed on fertilized egg cells from PD-1 humanized mice to obtain a PD-1 and CD73 double-gene humanized mouse model. The CD73 humanized mouse homozygote or heterozygote can be mated with other gene modified homozygote or heterozygote, the offspring is screened, according to Mendelian genetic rule, the CD73 humanized mouse and other gene modified double-gene or polygene modified heterozygote can be obtained at a certain probability, and the heterozygote is mated with each other, so that the double-gene or polygene modified homozygote can be obtained.
Taking a double humanized CTLA4/CD73 mouse as an example, since the CTLA4 and CD73 genes of the mouse are respectively positioned on chromosome 1 and chromosome 9, the double humanized CTLA4/CD73 gene mouse can be obtained by selecting the CTLA4 humanized mouse and the CD73 humanized mouse for mating and screening positive offspring mice.
EXAMPLE 4 evaluation of drug efficacy
The CD73 gene humanized mice prepared by the method can be used for evaluating the efficacy of a regulator targeting human CD 73. Taking CD73 humanized mouse homozygote, subcutaneously inoculating mouse colon cancer cell MC38-hCD73 (MC 38 cell expressing human CD 73), and growing until tumor volume reaches about 100mm 3 The tumor volume fraction was then used as a control or treatment group (n=5), which was intraperitoneally injected with 10mg/kg of anti-human CD73 antibody MEDI9447, and an equal volume of hIgG was injected into the control group at a dosing frequency of 2 doses per week for 6 doses, and the tumor volume of individual mice after inoculation reached 3000mm 3 An end of life test was performed. Tumor volumes were measured periodically and the body weight of the mice was weighed. Overall, the animals in the treatment and control groups were in good health status during the experiment. At the end of the experiment, the body weight of each group of animals increased, and the body weights of the treated group and the control group of mice were not significantly different during the whole experimental period, indicating that the animals were well tolerated by MEDI 9447. However, from the tumor-measuring results, at the end of the experiment, the tumor volume of the mice in the treated group was significantly smaller than that of the control group, and it was found that the tumor growth in the mice was significantly inhibited after treatment with the human CD73 antibody MEDI 9447. The above results demonstrate that the use of anti-human CD73 antibodies has a significant inhibitory effect on tumor growth in CD73 humanized mice, thereby demonstrating that humanized CD73 animal models can be used to evaluate the effectiveness of CD 73-targeting drugs in vivo, as well as evaluate the therapeutic effect of CD73 targeting.
The preferred embodiments of the present invention have been described in detail above, but the present invention is not limited to the specific details of the above embodiments, and various simple modifications can be made to the technical solution of the present invention within the scope of the technical concept of the present invention, and all the simple modifications belong to the protection scope of the present invention.
In addition, the specific features described in the above embodiments may be combined in any suitable manner, and in order to avoid unnecessary repetition, various possible combinations are not described further.
Moreover, any combination of the various embodiments of the invention can be made without departing from the spirit of the invention, which should also be considered as disclosed herein.
Sequence listing
<110> Baioer chart (Beijing) pharmaceutical technology Co., ltd
Construction method and application of <120> CD73 gene humanized non-human animal
<130> 1
<150> CN202010807504.X
<151> 2020-08-12
<160> 63
<170> SIPOSequenceListing 1.0
<210> 1
<211> 3580
<212> DNA/RNA
<213> Mouse (Mouse)
<400> 1
agtttagtag aggccccggt ggccggtgct cactgggctc agggccagtc cacccgctca 60
acgcgctcta gccagccatg cgtcccgcgg ccgctaaggt acccaagtgg ctgcttctcg 120
cactgagcgc tctactacca cagtggcccg cagccagtgc ctgggagctc acgatcctgc 180
acacaaacga cgtgcacagc cggctagagc agaccagcga tgactccacc aagtgcctca 240
acgccagcct gtgtgtgggc ggcgtggccc ggctctttac caaggtgcag cagatccgca 300
aggaagaacc caacgtgctg tttttggatg ccggagacca gtaccagggc accatctggt 360
tcaccgttta caaaggcctt gaagtggcac acttcatgaa catcctgggc tacgatgcta 420
tggcactggg aaatcatgaa tttgataacg gtgtggaagg actgattgat cccctcctca 480
gaaacgttaa atttccaatt ctgagcgcaa acattaaggc acgggggcct ctagcacatc 540
agatatctgg actttttctg ccatctaaag ttctctctgt tggcggtgag gttgtgggga 600
ttgttggata tacttcaaag gaaacccctt tcctctcaaa tccagggaca aatttagtct 660
ttgaagatga aatctctgca ttgcagcctg aagtagataa actaaagacg ctaaatgtga 720
ataagatcat cgccctgggg cactctggtt ttgagatgga caaacttatc gctcagaaag 780
ttcgaggtgt ggacatcgtg gtgggaggac actccaacac ctttctctac acaggaaatc 840
caccttccaa agaagtgcct gcggggaagt acccattcat agtcaccgca gatgatggac 900
ggcaggtgcc tgtggtccag gcctatgcct ttggcaaata cctgggctac ctgaaggttg 960
agtttgatga taaaggcaat gttatcactt cctatggaaa tcccattctt ctcaacagca 1020
gcattcctga agatgcgacc atcaaagcag acattaacca atggaggata aaattagata 1080
attattctac ccaggaactc gggagaacga tcgtctacct ggatggctcc actcagacgt 1140
gccgcttcag ggaatgcaac atgggaaacc tgatctgtga tgccatgatt aacaacaacc 1200
tcagacaccc agatgaaatg ttttggaacc acgtgtccat gtgcattgta aacggaggtg 1260
gcatccggtc ccccattgat gagaagaaca atggtaccat cacctgggag aacctggctg 1320
ctgtgctgcc ctttggaggg acatttgacc tcgtccaatt aaaagggtcc accctgaaga 1380
aggcttttga gcacagcgtg catcgctatg gccagtccac cggagagttc ctgcaagtgg 1440
gtggaatcca tgtggtgtac gatattaacc gaaagccctg gaacagagtg gtccaattag 1500
aagttctctg caccaagtgt cgagtgccca tctatgagcc tcttgaaatg gataaagtgt 1560
ataaagtgac cctcccaagc tatctggcca acggtggaga tggattccag atgataaaag 1620
atgaattact aaagcatgac tctggtgatc aagatatcag cgtggtttct gaatacatct 1680
caaaaatgaa agtagtttac ccagccgttg aagggcggat caagttctct gcagcaagtc 1740
attaccaggg aagctttcct ttagtaattc tttcattttg ggcaatgatc cttattttgt 1800
accaataaca gggagtctcc ttgtccttga tgtcaaacta catttttctt ccagtgatat 1860
tcatatctgc ctctggagac ctggctttgt aacagcactc atcatcctca aggttcctag 1920
cagatgttct tcacaaggaa gagactgtaa catcatttgt tggggccagc aactcagtga 1980
gcagatagaa agtcacagtg aaccaacagg gtccttctgg cagggagtgg gtaaggggaa 2040
acaactagat gtagcttgca tagccacata acacatctgg ttaccatttc cctttctatt 2100
catttctaat ccatcaaaca attgatgttt acatacaact tcatcatcgc cagctctggt 2160
ggcacatgcc tgtggtcacg gcacttggca gggaggagag gatggctgca agttctaggc 2220
cagcctgaca tatgtagagt ttcaggccag tcagctagat atcaagactc acacacacaa 2280
acaaacatta taatttacaa gtagattttt gtagacaagt cttatgataa gctaagcaga 2340
aagggtcgac ttgttcaagg tcagacatct ccaaataata catctaagat ctgtacctga 2400
gtctttttac ctcaagtcca acctctcgcc tactgtcaag tctcctctct tcctgcggtt 2460
ggtctcagac aacaaatctg tttctgcctt ccatagtacc ctttcttttg ggctcttgtt 2520
gtctctcagg tttgagagag tagctactgg acaggattct ttcctacaac accatgtgcc 2580
tttggtgagt catagacaaa ctgtatacag cagataatag attagtccag ggactgcaaa 2640
aggcagtcag ggacaggcag gaaagggaaa ggagaatatg accaggactt acagaagaga 2700
gtagatgggt tccataaaca ctagttaagg agcagaacat ggctccaaag cacaacactg 2760
gttcttatta actcctgtgc gactccaaaa gccctgcctc tctccttcct tcttcctcac 2820
atggaggtgc catgagagaa cttcctaaaa taaactgtaa ttctcaacct gcacctgtcc 2880
tgtccaagat gccaacacgg ctctcattca agtcacacat gtacaatcca atattctgga 2940
aggcaagagt gaatttaaga agtggtatac ataaatgttt gaagtcactg agactcaaat 3000
cccacacaac cactgtaagg catactcagg tcaagacatg agaagaccag caggacagtg 3060
agccctatgg gaggcccaag agatacccag actgatgaaa tgatctgtgc ccacagaaca 3120
ccagaccaag acagtcctgt gaccaagtga gcatagtcag cactgtacag tagttaagaa 3180
tccttaacta taaaccagca gctagatgtc tggacaggga acatgttggt ttcctagtgt 3240
ttacgaatat taaggactct tgacccaaac cgtttaaaat tccaatcctt ttatgaagtt 3300
tgttgcagtg aaacaatgcc ttcagttctt tttgaatgtg tagattagtt ataaacagaa 3360
atgtcagtaa gactgtaaaa aaaaatgaag aaaggtgcat atgaggtata tttttatgct 3420
tggcgagtaa ataaggagaa atcttatagc ataattttta aagaacattt gtataatttt 3480
tctaaatatg tgtatatgta tattttatgc agcagtattg agaaatgacc ttggactatc 3540
ttgtaattgt aaattccaaa aataaagctg aagatatagt 3580
<210> 2
<211> 576
<212> PRT
<213> Mouse (Mouse)
<400> 2
Met Arg Pro Ala Ala Ala Lys Val Pro Lys Trp Leu Leu Leu Ala Leu
1 5 10 15
Ser Ala Leu Leu Pro Gln Trp Pro Ala Ala Ser Ala Trp Glu Leu Thr
20 25 30
Ile Leu His Thr Asn Asp Val His Ser Arg Leu Glu Gln Thr Ser Asp
35 40 45
Asp Ser Thr Lys Cys Leu Asn Ala Ser Leu Cys Val Gly Gly Val Ala
50 55 60
Arg Leu Phe Thr Lys Val Gln Gln Ile Arg Lys Glu Glu Pro Asn Val
65 70 75 80
Leu Phe Leu Asp Ala Gly Asp Gln Tyr Gln Gly Thr Ile Trp Phe Thr
85 90 95
Val Tyr Lys Gly Leu Glu Val Ala His Phe Met Asn Ile Leu Gly Tyr
100 105 110
Asp Ala Met Ala Leu Gly Asn His Glu Phe Asp Asn Gly Val Glu Gly
115 120 125
Leu Ile Asp Pro Leu Leu Arg Asn Val Lys Phe Pro Ile Leu Ser Ala
130 135 140
Asn Ile Lys Ala Arg Gly Pro Leu Ala His Gln Ile Ser Gly Leu Phe
145 150 155 160
Leu Pro Ser Lys Val Leu Ser Val Gly Gly Glu Val Val Gly Ile Val
165 170 175
Gly Tyr Thr Ser Lys Glu Thr Pro Phe Leu Ser Asn Pro Gly Thr Asn
180 185 190
Leu Val Phe Glu Asp Glu Ile Ser Ala Leu Gln Pro Glu Val Asp Lys
195 200 205
Leu Lys Thr Leu Asn Val Asn Lys Ile Ile Ala Leu Gly His Ser Gly
210 215 220
Phe Glu Met Asp Lys Leu Ile Ala Gln Lys Val Arg Gly Val Asp Ile
225 230 235 240
Val Val Gly Gly His Ser Asn Thr Phe Leu Tyr Thr Gly Asn Pro Pro
245 250 255
Ser Lys Glu Val Pro Ala Gly Lys Tyr Pro Phe Ile Val Thr Ala Asp
260 265 270
Asp Gly Arg Gln Val Pro Val Val Gln Ala Tyr Ala Phe Gly Lys Tyr
275 280 285
Leu Gly Tyr Leu Lys Val Glu Phe Asp Asp Lys Gly Asn Val Ile Thr
290 295 300
Ser Tyr Gly Asn Pro Ile Leu Leu Asn Ser Ser Ile Pro Glu Asp Ala
305 310 315 320
Thr Ile Lys Ala Asp Ile Asn Gln Trp Arg Ile Lys Leu Asp Asn Tyr
325 330 335
Ser Thr Gln Glu Leu Gly Arg Thr Ile Val Tyr Leu Asp Gly Ser Thr
340 345 350
Gln Thr Cys Arg Phe Arg Glu Cys Asn Met Gly Asn Leu Ile Cys Asp
355 360 365
Ala Met Ile Asn Asn Asn Leu Arg His Pro Asp Glu Met Phe Trp Asn
370 375 380
His Val Ser Met Cys Ile Val Asn Gly Gly Gly Ile Arg Ser Pro Ile
385 390 395 400
Asp Glu Lys Asn Asn Gly Thr Ile Thr Trp Glu Asn Leu Ala Ala Val
405 410 415
Leu Pro Phe Gly Gly Thr Phe Asp Leu Val Gln Leu Lys Gly Ser Thr
420 425 430
Leu Lys Lys Ala Phe Glu His Ser Val His Arg Tyr Gly Gln Ser Thr
435 440 445
Gly Glu Phe Leu Gln Val Gly Gly Ile His Val Val Tyr Asp Ile Asn
450 455 460
Arg Lys Pro Trp Asn Arg Val Val Gln Leu Glu Val Leu Cys Thr Lys
465 470 475 480
Cys Arg Val Pro Ile Tyr Glu Pro Leu Glu Met Asp Lys Val Tyr Lys
485 490 495
Val Thr Leu Pro Ser Tyr Leu Ala Asn Gly Gly Asp Gly Phe Gln Met
500 505 510
Ile Lys Asp Glu Leu Leu Lys His Asp Ser Gly Asp Gln Asp Ile Ser
515 520 525
Val Val Ser Glu Tyr Ile Ser Lys Met Lys Val Val Tyr Pro Ala Val
530 535 540
Glu Gly Arg Ile Lys Phe Ser Ala Ala Ser His Tyr Gln Gly Ser Phe
545 550 555 560
Pro Leu Val Ile Leu Ser Phe Trp Ala Met Ile Leu Ile Leu Tyr Gln
565 570 575
<210> 3
<211> 3562
<212> DNA/RNA
<213> person (human)
<400> 3
actcgccggc actcgcccgg ctcgcccgct ttcgcaccca gttcacgcgc cacagctatg 60
tgtccccgag ccgcgcgggc gcccgcgacg ctactcctcg ccctgggcgc ggtgctgtgg 120
cctgcggctg gcgcctggga gcttacgatt ttgcacacca acgacgtgca cagccggctg 180
gagcagacca gcgaggactc cagcaagtgc gtcaacgcca gccgctgcat gggtggcgtg 240
gctcggctct tcaccaaggt tcagcagatc cgccgcgccg aacccaacgt gctgctgctg 300
gacgccggcg accagtacca gggcactatc tggttcaccg tgtacaaggg cgccgaggtg 360
gcgcacttca tgaacgccct gcgctacgat gccatggcac tgggaaatca tgaatttgat 420
aatggtgtgg aaggactgat cgagccactc ctcaaagagg ccaaatttcc aattctgagt 480
gcaaacatta aagcaaaggg gccactagca tctcaaatat caggacttta tttgccatat 540
aaagttcttc ctgttggtga tgaagttgtg ggaatcgttg gatacacttc caaagaaacc 600
ccttttctct caaatccagg gacaaattta gtgtttgaag atgaaatcac tgcattacaa 660
cctgaagtag ataagttaaa aactctaaat gtgaacaaaa ttattgcact gggacattcg 720
ggttttgaaa tggataaact catcgctcag aaagtgaggg gtgtggacgt cgtggtggga 780
ggacactcca acacatttct ttacacaggc aatccacctt ccaaagaggt gcctgctggg 840
aagtacccat tcatagtcac ttctgatgat gggcggaagg ttcctgtagt ccaggcctat 900
gcttttggca aatacctagg ctatctgaag atcgagtttg atgaaagagg aaacgtcatc 960
tcttcccatg gaaatcccat tcttctaaac agcagcattc ctgaagatcc aagcataaaa 1020
gcagacatta acaaatggag gataaaattg gataattatt ctacccagga attagggaaa 1080
acaattgtct atctggatgg ctcctctcaa tcatgccgct ttagagaatg caacatgggc 1140
aacctgattt gtgatgcaat gattaacaac aacctgagac acacggatga aatgttctgg 1200
aaccacgtat ccatgtgcat tttaaatgga ggtggtatcc ggtcgcccat tgatgaacgc 1260
aacaatggca caattacctg ggagaacctg gctgctgtat tgccctttgg aggcacattt 1320
gacctagtcc agttaaaagg ttccaccctg aagaaggcct ttgagcatag cgtgcaccgc 1380
tacggccagt ccactggaga gttcctgcag gtgggcggaa tccatgtggt gtatgatctt 1440
tcccgaaaac ctggagacag agtagtcaaa ttagatgttc tttgcaccaa gtgtcgagtg 1500
cccagttatg accctctcaa aatggacgag gtatataagg tgatcctccc aaacttcctg 1560
gccaatggtg gagatgggtt ccagatgata aaagatgaat tattaagaca tgactctggt 1620
gaccaagata tcaacgtggt ttctacatat atctccaaaa tgaaagtaat ttatccagca 1680
gttgaaggtc ggatcaagtt ttccacagga agtcactgcc atggaagctt ttctttaata 1740
tttctttcac tttgggcagt gatctttgtt ttataccaat agccaaaaat tctccttgcc 1800
tttaatgtgt gaaactgcat tttttcaagt gagattcaaa tctgcctttt aggacctggc 1860
tttgtgacag caaaaaccat ctttacaggc tcctagaagc tgaaggttag agcattataa 1920
aatgaagaga cagacatgat tactcagggt cagcaaccta gtgagttaga aaaaaaatta 1980
acatagggcc ctataaggag aaagccaact atgttaagtt tacgtgtcca aattttaatg 2040
aaattttact aacaatttta aaccatattt ttcttcttca tatccatttc taatccatca 2100
aacagcttat gtttacataa aattttatca ttcacaagga agttttaagc acactgtctc 2160
atttgatatc cacaacttat ttttggtagg aaagagagat gtttttccca cctgtcagat 2220
gaaaaaactg aagctcaaaa agggttgact tgaccataca gctaatgctg acagatccaa 2280
gacctagacc taggtctttt gaactcaagt ccagcattct caactatatc aagttactgt 2340
tcagaatact taatatctcc tctcttcata attatcaata gccccaagct catggatgac 2400
aaatctctgc tttatttctt gtctctattt tttcacttta tagctcctgt tataatagca 2460
agtttaatgg tataaacaca ggataccatc ctctcttgca acacccatgt gcctttgatg 2520
agtcaggtag caagctgtag tagataatga gaaaggccag aggctgcaaa agacagtcaa 2580
aggacacgag agaaaggaag gggaagaaca ggactccagg actgttttat attatagaaa 2640
agcaagagct aaagagcatt tacacatgtt aaacagatac ttgttaagca tagtgcctga 2700
cacacggcat tagctgttat tttatgagat tccatcagct ctgcctctgt cctctttctt 2760
ctaacatgaa ggtatcatga gaagagaacc ttctaacata agctgtaatt ctaaacctgc 2820
acttgtccct ctccagcaag aggctagcac tgaattcatt ctactcatac tacacaccca 2880
gttatggaat gtccagagtt ctcgaagaaa ataaatgact ttaggaagag gtatacattt 2940
tttaagtcgc tctgcctcca aatctgaaca gtcactgtaa atcattctta agcccagata 3000
tgagaacttc tgctggaaag tgggaccctc tgagtgggtg gtcagaaaat acccatgctg 3060
atgaaatgac ctatgcccaa agaacaaata cttaacgtgg gagtggaacc acatgagcct 3120
gctcagctct gcataagtaa ttcaagaaat gggaggcttc accttaaaaa cagtgtgcaa 3180
atggcagcta gaggttttga taggaagtat gtttgtttct tagtgtttac aaatattaag 3240
tactcttgat acaaaatata cttttaaact tcataacctt tttataaaag ttgttgcagc 3300
aaaataatag cctcggttct atgcatatat ggattagcta taaaaaatgt caataagatt 3360
gtacaaggaa aattagagaa agtcacattt agggtttatt ttttacactt ggccagtaaa 3420
atagggtaaa tcctattaga attttttaaa gaactttttt taagtttcct aaatctgtgt 3480
gtgtattgtg aagtggtata agaaatgact ttgaaccact ttgcaattgt agattcccaa 3540
caataaaatt gaagataagc tc 3562
<210> 4
<211> 574
<212> PRT
<213> person (human)
<400> 4
Met Cys Pro Arg Ala Ala Arg Ala Pro Ala Thr Leu Leu Leu Ala Leu
1 5 10 15
Gly Ala Val Leu Trp Pro Ala Ala Gly Ala Trp Glu Leu Thr Ile Leu
20 25 30
His Thr Asn Asp Val His Ser Arg Leu Glu Gln Thr Ser Glu Asp Ser
35 40 45
Ser Lys Cys Val Asn Ala Ser Arg Cys Met Gly Gly Val Ala Arg Leu
50 55 60
Phe Thr Lys Val Gln Gln Ile Arg Arg Ala Glu Pro Asn Val Leu Leu
65 70 75 80
Leu Asp Ala Gly Asp Gln Tyr Gln Gly Thr Ile Trp Phe Thr Val Tyr
85 90 95
Lys Gly Ala Glu Val Ala His Phe Met Asn Ala Leu Arg Tyr Asp Ala
100 105 110
Met Ala Leu Gly Asn His Glu Phe Asp Asn Gly Val Glu Gly Leu Ile
115 120 125
Glu Pro Leu Leu Lys Glu Ala Lys Phe Pro Ile Leu Ser Ala Asn Ile
130 135 140
Lys Ala Lys Gly Pro Leu Ala Ser Gln Ile Ser Gly Leu Tyr Leu Pro
145 150 155 160
Tyr Lys Val Leu Pro Val Gly Asp Glu Val Val Gly Ile Val Gly Tyr
165 170 175
Thr Ser Lys Glu Thr Pro Phe Leu Ser Asn Pro Gly Thr Asn Leu Val
180 185 190
Phe Glu Asp Glu Ile Thr Ala Leu Gln Pro Glu Val Asp Lys Leu Lys
195 200 205
Thr Leu Asn Val Asn Lys Ile Ile Ala Leu Gly His Ser Gly Phe Glu
210 215 220
Met Asp Lys Leu Ile Ala Gln Lys Val Arg Gly Val Asp Val Val Val
225 230 235 240
Gly Gly His Ser Asn Thr Phe Leu Tyr Thr Gly Asn Pro Pro Ser Lys
245 250 255
Glu Val Pro Ala Gly Lys Tyr Pro Phe Ile Val Thr Ser Asp Asp Gly
260 265 270
Arg Lys Val Pro Val Val Gln Ala Tyr Ala Phe Gly Lys Tyr Leu Gly
275 280 285
Tyr Leu Lys Ile Glu Phe Asp Glu Arg Gly Asn Val Ile Ser Ser His
290 295 300
Gly Asn Pro Ile Leu Leu Asn Ser Ser Ile Pro Glu Asp Pro Ser Ile
305 310 315 320
Lys Ala Asp Ile Asn Lys Trp Arg Ile Lys Leu Asp Asn Tyr Ser Thr
325 330 335
Gln Glu Leu Gly Lys Thr Ile Val Tyr Leu Asp Gly Ser Ser Gln Ser
340 345 350
Cys Arg Phe Arg Glu Cys Asn Met Gly Asn Leu Ile Cys Asp Ala Met
355 360 365
Ile Asn Asn Asn Leu Arg His Thr Asp Glu Met Phe Trp Asn His Val
370 375 380
Ser Met Cys Ile Leu Asn Gly Gly Gly Ile Arg Ser Pro Ile Asp Glu
385 390 395 400
Arg Asn Asn Gly Thr Ile Thr Trp Glu Asn Leu Ala Ala Val Leu Pro
405 410 415
Phe Gly Gly Thr Phe Asp Leu Val Gln Leu Lys Gly Ser Thr Leu Lys
420 425 430
Lys Ala Phe Glu His Ser Val His Arg Tyr Gly Gln Ser Thr Gly Glu
435 440 445
Phe Leu Gln Val Gly Gly Ile His Val Val Tyr Asp Leu Ser Arg Lys
450 455 460
Pro Gly Asp Arg Val Val Lys Leu Asp Val Leu Cys Thr Lys Cys Arg
465 470 475 480
Val Pro Ser Tyr Asp Pro Leu Lys Met Asp Glu Val Tyr Lys Val Ile
485 490 495
Leu Pro Asn Phe Leu Ala Asn Gly Gly Asp Gly Phe Gln Met Ile Lys
500 505 510
Asp Glu Leu Leu Arg His Asp Ser Gly Asp Gln Asp Ile Asn Val Val
515 520 525
Ser Thr Tyr Ile Ser Lys Met Lys Val Ile Tyr Pro Ala Val Glu Gly
530 535 540
Arg Ile Lys Phe Ser Thr Gly Ser His Cys His Gly Ser Phe Ser Leu
545 550 555 560
Ile Phe Leu Ser Leu Trp Ala Val Ile Phe Val Leu Tyr Gln
565 570
<210> 5
<211> 1770
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 5
cagatgagga aagtgaagga gactgaggtt gcactgataa cctcagtgga tttaggtacc 60
gaggaagtaa aatgacctta gcagccaaac caagtgggcg actgcccaga gctggggtta 120
tgcaggaggc cgcacgagga agggacagac agggctttct gaagaaatct gctgtgactc 180
tgagggtggg aatatcatac agatggtact caacgccacc cctgctctac ctgaagtcct 240
gtgcacagag taaatccaga acctgctgag gtatcttgtt cacctcacca gcaagtcctt 300
ccttagtcta agctgctgtg gatggggctg tctgagagct tccttttaaa gtcttcgact 360
tggagaacga caaaagaatt ctaggacaag gcaggtgtac agtgtacact ttacaagtgt 420
aaagcagggc tgtagtagat tagtttatga ggaagaaagt tccattaaaa tgtaagcatg 480
ggagagaaca agtgcttagg aggagggtac acactcttgg gttcgggctg ctcaaaataa 540
gagatgtcct tgtcattgca acagccacac atactggttt tggtggcttc tgaagctgca 600
gctccaaagg tagctaagga aacttttccc ttttttggta aatgagatca tagggtggtt 660
cttatgtgca ctggctagga ttacagctgg cacggtaaaa ctcgagatca caggagctat 720
aggggaacat taagctatct ttactgtaaa caaagtttta gttttagttg ggagacttcc 780
agaaaatacc taggaaagga taaaagcaag ccttaagcac agttcattgc tattttaaca 840
tttctatttg ataacaaaaa ttttcctata taaaaggcct attgtccctc tgtctgcaac 900
cttcacagcc aactttgttc attgggctga aattcttgat tctcagctgg cagaaggctt 960
cctccagacg ccaaagcaag tgacttcttt ctctttccac gtacctgtaa tgttctcatg 1020
gctttctatc ctgctcgggg tacaggggtg atcaatgccc agtctaacag agacttctgt 1080
caggcctgat tcgcctagat gaattggaga agtccttgag tatagcagtt gtgaggactt 1140
gagatgtgac ccaagggaag caatgaaagg gggttctggg ggtgtctgca ttctggtctg 1200
ggcctgacct gtgggctaac tcccgaaaac aatgaagatg agctgggtag gccctggtaa 1260
aggtggtctg tgagccatct ggctggaggg ggctgggaca gggagtccca aatgcacaca 1320
ttcacagcag cttcactgga tgaggcgccc agggcaacaa tgctgtacct acctgtgatc 1380
tacgtgtctc tgggctcagc ccaaggggcc cctttggagg taaggagaac actagaacat 1440
gtatgtggtc aggtgtcata gaggccagcc cattttggtg ccactgggct aacatgatat 1500
actttaaaat atgcttttat tggaaaattc cagatactta attctttttt tttctttttt 1560
cttttctgtt ttacttaggc actgggaaat catgaatttg ataacggtgt ggaaggactg 1620
attgatcccc tcctcagaaa cgttaaattt ccaattctga gcgcaaacat taaggcacgg 1680
gggcctctag cacatcagat atctggactt tttctgccat ctaaagttct ctctgttggc 1740
ggtgaggttg tggggattgt tggatatact 1770
<210> 6
<211> 1133
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 6
cacgaatggt cgatggctgt gtctccctta gtctgctgtt tggagaaact ctttttacac 60
tgggctttgc agacttccct cttgaaacct ctttatgaga cttttgcatt tgggacactt 120
tcattttagt gccggttctt agaaactaac ctgagagcat ctaggtcaac tggaggactg 180
ccttgcaata gagtgctggg tgagacttca ggacgccatg tcggtggtcc cctcgtgacc 240
actgaagact tgctaagttt gggctccagg cttggaggcc ttgtttatgt ctcagcatag 300
gattcccagg gcctatgcta cccagagccc aaggcactca ttattagcac tgggtgagct 360
ccatgaggtt cgcgaccctt cctagggcag gccatgttta gatgaggcct ctatgatgca 420
atgcagcaat cagttctact ccaatggtgc cacaccctgc ctgcaagtaa gcagttgtca 480
tctaccatgc taaaattagt ttccaagggc cagcaagatg gctctggggt aatgaagggg 540
actttccacc tcacctgata cctggagttc aaaccctgga acctacatgg taggaggaga 600
gatctgaaat ccctaagtat aaataaataa aaaaaacaaa taaataaaat gtaaaaacct 660
ccttttgaaa ttggctttca aaacctacat gttgacattt taggtctgaa gacattttag 720
gtttccaggc aaagcaatga cactggcagc cgttgtaaag accatgcctt tgtgctcaga 780
catgtttttc atggttagtg tgaaggccta aatgtaaaag gagctggaaa caaaccatac 840
ctgaagtctc tttgtagtat gagattcagt ttgtgcaaat atatcatcta tggcccagtg 900
agatggttca acagaaaatg atgcttgctg catcgcctga tgtcctgagt ttgatcccca 960
ggacccacat ggtggacaaa gagaaccaat tcttgaaaat tgtcctctga cctccacgca 1020
tgtattgtag tatgtatgtg caaacaaaaa taaatgttac tgtgatggtt tgtacatgct 1080
tggcccaggg aatggcacca ttagaaggtg tgaccttgtt ggagtaggtg tgc 1133
<210> 7
<211> 4354
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 7
cctttagtcc taataataag gtggcagaaa cagttggatc tctgtgttca aggctagcct 60
agtctgcata gagagttcct ggccggctaa ggtgcatagt aagatcctat tacaaaaata 120
aatgaatgag taaataaaag taaactacta ccgtagttgt gctggttagt tcttgccaac 180
ttaacacaaa cgagaatcac ctgagaagag gttcctcacg tgagcaggct ggactgtgcg 240
cacacatgtg ggacattttt ctgactggtg attgatgtga gagagcccag cccactgtag 300
gcggtgtcac ccatgggcag gtgggcctgg gtgacataag acaaacaagg cagtaaacag 360
cactcctctg tgtcctctcc atcagatttt ggtctggggt ttctgccctg actttgctca 420
gtgatacact gtcaactgga agaagcagca aatcctttcc tctccatgtt gtgctgtggt 480
cattgtgttt attacaataa atcaaaaaag agaaaaagac aagaaaacaa agtaaagcag 540
gctatgcttt tgccaagaaa acagccctcc caaaatgcac ttcacccaaa tactgccctc 600
tcccttctgt gtggtaattg cgacaatgcc tccctccttg cgcatggtat ttcagattct 660
aagggatggg gtaatgtgtc atacttttcc ttcctcaacg ctagtgttac caggcacgca 720
tgtggtcgct tcctcaacgg ctgactagct accctgagag ggcagaaaga aggccacagg 780
agtagggaag gcttatgcca gtgcttctgg cttggttgac tcaaattgtc cttctaggaa 840
gaacaatgtc cccagaactg ccattttggg gaaagggagc ctttggatct gacacttgag 900
gccacaggcc cagatgacaa gtctcagaga tagtttttcg ggtctggaaa gctagtggaa 960
aatggacaca tgacaactgt cacacagacc aaaagaaccc cacagatgtt tgaagtttgg 1020
agctgaagat gagggtgact aaagctattt ggtcctctat caagagcatt ggtgtccatg 1080
aacaggaatg ccctccccaa gggatacaaa atcaagatag tcacagcctg tactcaagtc 1140
agggtgggac ggaacaagat attgaagaag caacaaacaa aaacttgaac gttcaccata 1200
ccctaaggat agaacagaac tgatgagttg gttagtcttc cgtggaaaac tgtggatcca 1260
tgtgttgctg ttgtgcacgt ctgttcccag cattctagaa gctaatgcaa gactcagttc 1320
gaggccatcc tgagctacac agtgagaccc tgtctcaaaa aaaccccaaa acaagaaaca 1380
tccaccaaat acaaaactct gtacttaaga ttattacagg gctggagaga tggctcagtg 1440
gttaagagca ctgactgctc ttccagaggt cctgagttca attcccagca accacatggt 1500
ggctcataat catctgtggt gagatctgat gccctcttct ggagtgtctg aagatagcta 1560
caatgtactc ttataaataa aataaaaaaa tcttttttta aaaaattatt atataacaaa 1620
gggtaattta ttttaccata ggaagaaaac ttgatgtttt aagactagaa gaaagaaaaa 1680
caaatgtctg agactgagag accctttaga cctggatcag ctcttagaga agttaactag 1740
ctaggaggag gggagactgg gtatagcatc agaggtgggg aaaggtgtga gttcagcaac 1800
catgaacttt ccaggggaga cagggtggga ctgtggatag ctgcgtgctt cctagagagt 1860
tcactcagcc ctttgagtga caggtgctca gtcagcgcca gtgtgccaaa tgaggaaaca 1920
aggtgaacag gacaaaaagg ccaacataac ttgagagcac tcagtttagc ccggagacag 1980
acaacaaatg accccacaat atagttctgg aatgttctgg tgttacatga ctggtggtca 2040
tgtcccacca aagttttcta ggtttttgtc ttcttattaa aacgttctaa taaaggaact 2100
cagggataat aaaggaacaa caaggaaata gcattcagat ttctccctaa aagttcactt 2160
acgggacagt ttgccctatt caacatgtgc cccaaactag ctcaggccag attgcctccc 2220
taccctcata cctagacata actggagtat gactgaatag aaattgtttg atggttgttc 2280
aggagagaag gaacttaact ccattgtctg gagagagtgg tatgtgcagc tcagggaagg 2340
tggggtccta ggttgtcaag tgcattgttt ggagggtggt gtgtgcacat agtatgtgca 2400
gtaccaggct gccaagtgca gtgttagaag aggtgtatgt gcattgcaca agccaggttc 2460
attactaaac tatttgttgt gcttacttac ctgcctcact gggaagagga aacgaggtgg 2520
aggagaatgt accatggccc ttgttagcat caaggaccta gagccagtgc tgccaggcct 2580
ctctcagatg aggaaagtga aggagactga ggttgcactg ataacctcag tggatttagg 2640
taccgaggaa gtaaaatgac cttagcagcc aaaccaagtg ggcgactgcc cagagctggg 2700
gttatgcagg aggccgcacg aggaagggac agacagggct ttctgaagaa atctgctgtg 2760
actctgaggg tgggaatatc atacagatgg tactcaacgc cacccctgct ctacctgaag 2820
tcctgtgcac agagtaaatc cagaacctgc tgaggtatct tgttcacctc accagcaagt 2880
ccttccttag tctaagctgc tgtggatggg gctgtctgag agcttccttt taaagtcttc 2940
gacttggaga acgacaaaag aattctagga caaggcaggt gtacagtgta cactttacaa 3000
gtgtaaagca gggctgtagt agattagttt atgaggaaga aagttccatt aaaatgtaag 3060
catgggagag aacaagtgct taggaggagg gtacacactc ttgggttcgg gctgctcaaa 3120
ataagagatg tccttgtcat tgcaacagcc acacatactg gttttggtgg cttctgaagc 3180
tgcagctcca aaggtagcta aggaaacttt tccctttttt ggtaaatgag atcatagggt 3240
ggttcttatg tgcactggct aggattacag ctggcacggt aaaactcgag atcacaggag 3300
ctatagggga acattaagct atctttactg taaacaaagt tttagtttta gttgggagac 3360
ttccagaaaa tacctaggaa aggataaaag caagccttaa gcacagttca ttgctatttt 3420
aacatttcta tttgataaca aaaattttcc tatataaaag gcctattgtc cctctgtctg 3480
caaccttcac agccaacttt gttcattggg ctgaaattct tgattctcag ctggcagaag 3540
gcttcctcca gacgccaaag caagtgactt ctttctcttt ccacgtacct gtaatgttct 3600
catggctttc tatcctgctc ggggtacagg ggtgatcaat gcccagtcta acagagactt 3660
ctgtcaggcc tgattcgcct agatgaattg gagaagtcct tgagtatagc agttgtgagg 3720
acttgagatg tgacccaagg gaagcaatga aagggggttc tgggggtgtc tgcattctgg 3780
tctgggcctg acctgtgggc taactcccga aaacaatgaa gatgagctgg gtaggccctg 3840
gtaaaggtgg tctgtgagcc atctggctgg agggggctgg gacagggagt cccaaatgca 3900
cacattcaca gcagcttcac tggatgaggc gcccagggca acaatgctgt acctacctgt 3960
gatctacgtg tctctgggct cagcccaagg ggcccctttg gaggtaagga gaacactaga 4020
acatgtatgt ggtcaggtgt catagaggcc agcccatttt ggtgccactg ggctaacatg 4080
atatacttta aaatatgctt ttattggaaa attccagata cttaattctt tttttttctt 4140
ttttcttttc tgttttactt aggcactggg aaatcatgaa tttgataacg gtgtggaagg 4200
actgattgat cccctcctca gaaacgttaa atttccaatt ctgagcgcaa acattaaggc 4260
acgggggcct ctagcacatc agatatctgg actttttctg ccatctaaag ttctctctgt 4320
tggcggtgag gttgtgggga ttgttggata tact 4354
<210> 8
<211> 4567
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 8
ccagagccca aggcactcat tattagcact gggtgagctc catgaggttc gcgacccttc 60
ctagggcagg ccatgtttag atgaggcctc tatgatgcaa tgcagcaatc agttctactc 120
caatggtgcc acaccctgcc tgcaagtaag cagttgtcat ctaccatgct aaaattagtt 180
tccaagggcc agcaagatgg ctctggggta atgaagggga ctttccacct cacctgatac 240
ctggagttca aaccctggaa cctacatggt aggaggagag atctgaaatc cctaagtata 300
aataaataaa aaaaacaaat aaataaaatg taaaaacctc cttttgaaat tggctttcaa 360
aacctacatg ttgacatttt aggtctgaag acattttagg tttccaggca aagcaatgac 420
actggcagcc gttgtaaaga ccatgccttt gtgctcagac atgtttttca tggttagtgt 480
gaaggcctaa atgtaaaagg agctggaaac aaaccatacc tgaagtctct ttgtagtatg 540
agattcagtt tgtgcaaata tatcatctat ggcccagtga gatggttcaa cagaaaatga 600
tgcttgctgc atcgcctgat gtcctgagtt tgatccccag gacccacatg gtggacaaag 660
agaaccaatt cttgaaaatt gtcctctgac ctccacgcat gtattgtagt atgtatgtgc 720
aaacaaaaat aaatgttact gtgatggttt gtacatgctt ggcccaggga atggcaccat 780
tagaaggtgt gaccttgttg gagtaggtgt gccattgtgg atgtgggttt aagaccctca 840
ccctagctac ctgaaagtca gttttctgct agcagccttc agaagaagat gtagaactct 900
cagctctgcc tgcatcatgc atgcctagat gctgccaagt tcccaccttg atgataatgg 960
cctgaacctc tgaacctctg aacctgtagc cagccctaat taaatgttgt cctttataag 1020
acttgccatg gtcatagtat ctgtgcacag cagtaaaacc ctaagacagt tacatatata 1080
tatatattgg ttatatatgt gtatgtatat atacatatat atatgtatat gtgtatgtat 1140
atatacatat atatatatgt atatgtatat gtatatgtat atgtatatgt atatgtatat 1200
gtatatgtat atgtatatgt atatgtatag tttgtatgta ccatctacag ccttaaccca 1260
atgcaatatt aggtgaatgc aaccaagaac attgatttaa atttttcatg accactcaag 1320
cttttgtaca aaatgtattc atacattttg ctgttagtaa gtgaagagag ctttaacaag 1380
agattgtatt agtgtgttgg tatataaata gtacaatgaa attcttttaa tatccatttc 1440
attaatattc tttagggaca aatttagtct ttgaagatga aatctctgca ttgcagcctg 1500
aagtagataa actaaagacg ctaaatgtga ataagatcat cgccctgggg cactctggtt 1560
ttgagatgga caaacttatc gctcagaaag ttcgaggtgt ggacatcgtg gtgggaggac 1620
actccaacac ctttctctac acaggtaatt gttttgatgc gttctgcaag ctaggatgtt 1680
cattcagtgt gcagtatatg atgtattagc tgtaatatgc aggattatgt gttatacttt 1740
gcattagatg aggacaggta atcctgctgg gagacgttcc aagaattcat tcatgtcatg 1800
attaaaagtc tacactacaa gcacatagca cacataatga aactaaaaaa attattgatc 1860
tccagacaaa ggtcagccaa cggtagaatt tttaaaagat gaaatttttt tttagatatt 1920
ttcagtggtt cttggacctg aatccagacc tttctgtatt ttaaacacct atattcccag 1980
cttttggttt tctaagacag agtttcacta tctagccctg attggcccca ttctcccacc 2040
tctcaggaga atggtaggac tataagcata cacctctgta tctggtttgg aaagtttcat 2100
catcatcatc attttcatag aacttcttag aatttttaaa tgctgaatat atattacatt 2160
tggaatcagg aagtgagata cagtacaaat tatagaaaga aaagaaacac gatattgagt 2220
caaagtcctc attcctaact cacttgaatg accttgaaca ggcaagtcac ctgaccctct 2280
gggcactttg gctttctaat aagaagtctg tcaccgtttt gaagattaag agtctgggaa 2340
cagttgactt cactggtttg gccttgggtg aatgtccccc tggctcttca tcatatcaca 2400
acatgggaga gaatcaggaa gggacatgta cagggagccc aggtgtgtgt ggtggccctc 2460
ctgttacaac cactccctca catagaacta actgtggtct caaaagaact actctcctct 2520
gagggtgcta gctccagtga ccttgtcacc tcacattaga cactccttga aggtttcacc 2580
gtccctcagt gtcaccacag tgaggactaa taagctgcta acacatgaac acacactaga 2640
caaaccacat ccaagacaca acagtttcag gtgagtcttc atacttgcct ggtcttatta 2700
caattctaca gcaatcctgt ggatatgacc tattcaaact ctgggggctt tgtagctctt 2760
ccccaaggga gttttagaag gctacaacac acagcagaaa ccagttgatg ctcctgccac 2820
ttcacccttt atgttggctt gagccacagt tagcatgaca ttttacaaga ggatctaaca 2880
ggtatacgtg tacagacttg tgcactctca gagatatgta taaagtaaac cgaaaatccc 2940
caaagcaccc agactatcct gcttgtattg ttgttgttct tcagtttctt ttcttgttat 3000
tttgtttctt tttaccttga gacaggatct tatattgttt gtttcatttt tatggagaca 3060
gagtctcact ttgctttgta gtcctggctg acaacgttat atgttgactt aaccttgact 3120
gtaaggtccc ccttctactg gtatgacagg aagcactaat gtcacaagaa taccatcaca 3180
cacacacaca cacacacaca cacacacaca gagccacccc ttactccaga tgatgcaatc 3240
acccccccct ttgtcacttt gatgtttctt taccatctgt gccactttgt ggcatcattt 3300
gggcacccac agtctgctgc accgtgtggc tactgattac ggcaccctgc tagcccaaga 3360
agctggaggg aatggtgatt atccttttgc tcttacaact attgccccag ccagcaagtc 3420
ctgcaacatg agagatctga gggtgagctg tagcttgctc atctcctgaa tccagtctct 3480
taaacgctga cccgagtcta agtcatttgt ttcacatttc ataatagatt accaagatta 3540
aacttgctcc catgaaaatg tgttttggct ctcttttgag gactaaatat agcaattaaa 3600
ggacatgata tgaaatgaga cctactcatg tatttaaaca aatttttgcc agccttaaaa 3660
aaaaaatttt ttttcagaca ggctgacaag aaaactcagc agataaaggt acttgctgta 3720
ctagcctaga gacctgagtc tgaacccata taagggcaga aggagagaaa ctaaagccca 3780
aaagttagtt gtcctctgac ctcatgtgta ccacagcata tcctcatatt tattctgtct 3840
ctctctctct cacttactca ctctctctct ctctctcact tactcactct ctctttctct 3900
ctcacataca cacacaaaca cacacacacc acacacactg aaaaaaactt aagcaatctt 3960
ttactaaacc attttatgaa attaccaaga agcaacaaga gagagagaga gagagagaga 4020
gagagagaga gagagagaga gagagagaga gaagaaagga aaggaaagga agaaaagaga 4080
agagaagaga agagaagaga agaaaagaaa agaaaagaaa agaaaagaaa agaaaagaaa 4140
agaaaagaaa agaaattacc aagaagctaa attaagtctg ttaaacatta catagtaaat 4200
gtagtaaaaa gaacacagct tgtggaagca tacatccttt tgtgttttaa gtccagatct 4260
tattgtctga tctggaattt actctgtagc ccaggctaga ctcaaatcat ggcagtcttt 4320
ctatgtaagc ctctgtagag ctgggaatag tggcatgagc catcatgctc agctagaaat 4380
ttaaagttta agaaatgtct cagcatgtga catgcacaca gcccaaagga aaacgtccct 4440
gaagcacctg aatagcacct gatagactct ggtcttttat ttagtctttt gtgtgtatta 4500
ctgcccacta aattagttcc cttcttgtct tgtttgttgt ttatttgtaa cccacattct 4560
aaaaacc 4567
<210> 9
<211> 1725
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 9
atgtgtcccc gagccgcgcg ggcgcccgcg acgctactcc tcgccctggg cgcggtgctg 60
tggcctgcgg ctggcgcctg ggagcttacg attttgcaca ccaacgacgt gcacagccgg 120
ctggagcaga ccagcgagga ctccagcaag tgcgtcaacg ccagccgctg catgggtggc 180
gtggctcggc tcttcaccaa ggttcagcag atccgccgcg ccgaacccaa cgtgctgctg 240
ctggacgccg gcgaccagta ccagggcact atctggttca ccgtgtacaa gggcgccgag 300
gtggcgcact tcatgaacgc cctgcgctac gatgccatgg cactgggaaa tcatgaattt 360
gataatggtg tggaaggact gatcgagcca ctcctcaaag aggccaaatt tccaattctg 420
agtgcaaaca ttaaagcaaa ggggccacta gcatctcaaa tatcaggact ttatttgcca 480
tataaagttc ttcctgttgg tgatgaagtt gtgggaatcg ttggatacac ttccaaagaa 540
accccttttc tctcaaatcc agggacaaat ttagtgtttg aagatgaaat cactgcatta 600
caacctgaag tagataagtt aaaaactcta aatgtgaaca aaattattgc actgggacat 660
tcgggttttg aaatggataa actcatcgct cagaaagtga ggggtgtgga cgtcgtggtg 720
ggaggacact ccaacacatt tctttacaca ggcaatccac cttccaaaga ggtgcctgct 780
gggaagtacc cattcatagt cacttctgat gatgggcgga aggttcctgt agtccaggcc 840
tatgcttttg gcaaatacct aggctatctg aagatcgagt ttgatgaaag aggaaacgtc 900
atctcttccc atggaaatcc cattcttcta aacagcagca ttcctgaaga tccaagcata 960
aaagcagaca ttaacaaatg gaggataaaa ttggataatt attctaccca ggaattaggg 1020
aaaacaattg tctatctgga tggctcctct caatcatgcc gctttagaga atgcaacatg 1080
ggcaacctga tttgtgatgc aatgattaac aacaacctga gacacacgga tgaaatgttc 1140
tggaaccacg tatccatgtg cattttaaat ggaggtggta tccggtcgcc cattgatgaa 1200
cgcaacaatg gcacaattac ctgggagaac ctggctgctg tattgccctt tggaggcaca 1260
tttgacctag tccagttaaa aggttccacc ctgaagaagg cctttgagca tagcgtgcac 1320
cgctacggcc agtccactgg agagttcctg caggtgggcg gaatccatgt ggtgtatgat 1380
ctttcccgaa aacctggaga cagagtagtc aaattagatg ttctttgcac caagtgtcga 1440
gtgcccagtt atgaccctct caaaatggac gaggtatata aggtgatcct cccaaacttc 1500
ctggccaatg gtggagatgg gttccagatg ataaaagatg aattattaag acatgactct 1560
ggtgaccaag atatcaacgt ggtttctaca tatatctcca aaatgaaagt aatttatcca 1620
gcagttgaag gtcggatcaa gttttccaca ggaagtcact gccatggaag cttttcttta 1680
atatttcttt cactttgggc agtgatcttt gttttatacc aatag 1725
<210> 10
<211> 57
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 10
gctactaact tcagcctgct gaagcaggct ggagacgtgg aggagaaccc tggacct 57
<210> 11
<211> 9
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 11
ggaagcgga 9
<210> 12
<211> 5025
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 12
ggaagcggag ctactaactt cagcctgctg aagcaggctg gagacgtgga ggagaaccct 60
ggacctatgt gtccccgagc cgcgcgggcg cccgcgacgc tactcctcgc cctgggcgcg 120
gtgctgtggc ctgcggctgg cgcctgggag cttacgattt tgcacaccaa cgacgtgcac 180
agccggctgg agcagaccag cgaggactcc agcaagtgcg tcaacgccag ccgctgcatg 240
ggtggcgtgg ctcggctctt caccaaggtt cagcagatcc gccgcgccga acccaacgtg 300
ctgctgctgg acgccggcga ccagtaccag ggcactatct ggttcaccgt gtacaagggc 360
gccgaggtgg cgcacttcat gaacgccctg cgctacgatg ccatggcact gggaaatcat 420
gaatttgata atggtgtgga aggactgatc gagccactcc tcaaagaggc caaatttcca 480
attctgagtg caaacattaa agcaaagggg ccactagcat ctcaaatatc aggactttat 540
ttgccatata aagttcttcc tgttggtgat gaagttgtgg gaatcgttgg atacacttcc 600
aaagaaaccc cttttctctc aaatccaggg acaaatttag tgtttgaaga tgaaatcact 660
gcattacaac ctgaagtaga taagttaaaa actctaaatg tgaacaaaat tattgcactg 720
ggacattcgg gttttgaaat ggataaactc atcgctcaga aagtgagggg tgtggacgtc 780
gtggtgggag gacactccaa cacatttctt tacacaggca atccaccttc caaagaggtg 840
cctgctggga agtacccatt catagtcact tctgatgatg ggcggaaggt tcctgtagtc 900
caggcctatg cttttggcaa atacctaggc tatctgaaga tcgagtttga tgaaagagga 960
aacgtcatct cttcccatgg aaatcccatt cttctaaaca gcagcattcc tgaagatcca 1020
agcataaaag cagacattaa caaatggagg ataaaattgg ataattattc tacccaggaa 1080
ttagggaaaa caattgtcta tctggatggc tcctctcaat catgccgctt tagagaatgc 1140
aacatgggca acctgatttg tgatgcaatg attaacaaca acctgagaca cacggatgaa 1200
atgttctgga accacgtatc catgtgcatt ttaaatggag gtggtatccg gtcgcccatt 1260
gatgaacgca acaatggcac aattacctgg gagaacctgg ctgctgtatt gccctttgga 1320
ggcacatttg acctagtcca gttaaaaggt tccaccctga agaaggcctt tgagcatagc 1380
gtgcaccgct acggccagtc cactggagag ttcctgcagg tgggcggaat ccatgtggtg 1440
tatgatcttt cccgaaaacc tggagacaga gtagtcaaat tagatgttct ttgcaccaag 1500
tgtcgagtgc ccagttatga ccctctcaaa atggacgagg tatataaggt gatcctccca 1560
aacttcctgg ccaatggtgg agatgggttc cagatgataa aagatgaatt attaagacat 1620
gactctggtg accaagatat caacgtggtt tctacatata tctccaaaat gaaagtaatt 1680
tatccagcag ttgaaggtcg gatcaagttt tccacaggaa gtcactgcca tggaagcttt 1740
tctttaatat ttctttcact ttgggcagtg atctttgttt tataccaata gggcgcgccc 1800
agggagtctc cttgtccttg atgtcaaact acatttttct tccagtgata ttcatatctg 1860
cctctggaga cctggctttg taacagcact catcatcctc aaggttccta gcagatgttc 1920
ttcacaagga agagactgta acatcatttg ttggggccag caactcagtg agcagataga 1980
aagtcacagt gaaccaacag ggtccttctg gcagggagtg ggtaagggga aacaactaga 2040
tgtagcttgc atagccacat aacacatctg gttaccattt ccctttctat tcatttctaa 2100
tccatcaaac aattgatgtt tacatacaac ttcatcatcg ccagctctgg tggcacatgc 2160
ctgtggtcac ggcacttggc agggaggaga ggatggctgc aagttctagg ccagcctgac 2220
atatgtagag tttcaggcca gtcagctaga tatcaagact cacacacaca aacaaacatt 2280
ataatttaca agtagatttt tgtagacaag tcttatgata agctaagcag aaagggtcga 2340
cttgttcaag gtcagacatc tccaaataat acatctaaga tctgtacctg agtcttttta 2400
cctcaagtcc aacctctcgc ctactgtcaa gtctcctctc ttcctgcggt tggtctcaga 2460
caacaaatct gtttctgcct tccatagtac cctttctttt gggctcttgt tgtctctcag 2520
gtttgagaga gtagctactg gacaggattc tttcctacaa caccatgtgc ctttggtgag 2580
tcatagacaa actgtataca gcagataata gattagtcca gggactgcaa aaggcagtca 2640
gggacaggca ggaaagggaa aggagaatat gaccaggact tacagaagag agtagatggg 2700
ttccataaac actagttaag gagcagaaca tggctccaaa gcacaacact ggttcttatt 2760
aactcctgtg cgactccaaa agccctgcct ctctccttcc ttcttcctca catggaggtg 2820
ccatgagaga acttcctaaa ataaactgta attctcaacc tgcacctgtc ctgtccaaga 2880
tgccaacacg gctctcattc aagtcacaca tgtacaatcc aatattctgg aaggcaagag 2940
tgaatttaag aagtggtata cataaatgtt tgaagtcact gagactcaaa tcccacacaa 3000
ccactgtaag gcatactcag gtcaagacat gagaagacca gcaggacagt gagccctatg 3060
ggaggcccaa gagataccca gactgatgaa atgatctgtg cccacagaac accagaccaa 3120
gacagtcctg tgaccaagtg agcatagtca gcactgtaca gtagttaaga atccttaact 3180
ataaaccagc agctagatgt ctggacaggg aacatgttgg tttcctagtg tttacgaata 3240
ttaaggactc ttgacccaaa ccgtttaaaa ttccaatcct tttatgaagt ttgttgcagt 3300
gaaacaatgc cttcagttct ttttgaatgt gtagattagt tataaacaga aatgtcagta 3360
agactgtaaa aaaaaatgaa gaaaggtgca tatgaggtat atttttatgc ttggcgagta 3420
aataaggaga aatcttatag cataattttt aaagaacatt tgtataattt ttctaaatat 3480
gtgtatatgt atattttatg cagcagtatt gagaaatgac cttggactat cttgtaattg 3540
taaattccaa aaataaagct gaagatatag tcctttgtct ttggtcttac ttcttctctt 3600
aacacttgta ccctcaggga tgcagcaaag acaatgttca cttcaaaact gagacgaaca 3660
atgctctaga attaagggtt ccggatcctc ggggacacca aatatggcga tctcggcctt 3720
ttcgtttctt ggagctggga catgtttgcc atcgatccat ctaccaccag aacggccgtt 3780
agatctgctg ccaccgttgt ttccaccgaa gaaaccaccg ttgccgtaac caccacgacg 3840
gttgttgcta aagaagctgc caccgccacg gccaccgttg tagccgccgt tgttgttatt 3900
gtagttgctc atgttatttc tggcacttct tggttttcct cttaagtgag gaggaacata 3960
accattctcg ttgttgtcgt tgatgcttaa attttgcact tgttcgctca gttcagccat 4020
aatatgaaat gcttttcttg ttgttcttac ggaataccac ttgccaccta tcaccacaac 4080
taactttttc ccgttcctcc atctctttta tatttttttt ctcgagggat ctttgtgaag 4140
gaaccttact tctgtggtgt gacataattg gacaaactac ctacagagat ttaaagctct 4200
aaggtaaata taaaattttt aagtgtataa tgtgttaaac tactgattct aattgtttgt 4260
gtattttaga ttccaaccta tggaactgat gaatgggagc agtggtggaa tgcctttaat 4320
gaggaaaacc tgttttgctc agaagaaatg ccatctagtg atgatgaggc tactgctgac 4380
tctcaacatt ctactcctcc aaaaaagaag agaaaggtag aagaccccaa ggactttcct 4440
tcagaattgc taagtttttt gagtcatgct gtgtttagta atagaactct tgcttgcttt 4500
gctatttaca ccacaaagga aaaagctgca ctgctataca agaaaattat ggaaaaatat 4560
tctgtaacct ttataagtag gcataacagt tataatcata acatactgtt ttttcttact 4620
ccacacaggc atagagtgtc tgctattaat aactatgctc aaaaattgtg tacctttagc 4680
tttttaattt gtaaaggggt taataaggaa tatttgatgt atagtgcctt gactagagat 4740
cataatcagc cataccacat ttgtagaggt tttacttgct ttaaaaaacc tcccacacct 4800
ccccctgaac ctgaaacata aaatgaatgc aattgttgtt gttaacttgt ttattgcagc 4860
ttataatggt tacaaataaa gcaatagcat cacaaatttc acaaataaag catttttttc 4920
actgcattct agttgtggtt tgtccaaact catcaatgta tcttatcatg tctggatctg 4980
acatggtaag taagcttggg ctgcaggtcg agggacctaa cgcgt 5025
<210> 13
<211> 7174
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 13
ggaagcggag ctactaactt cagcctgctg aagcaggctg gagacgtgga ggagaaccct 60
ggacctatgt gtccccgagc cgcgcgggcg cccgcgacgc tactcctcgc cctgggcgcg 120
gtgctgtggc ctgcggctgg cgcctgggag cttacgattt tgcacaccaa cgacgtgcac 180
agccggctgg agcagaccag cgaggactcc agcaagtgcg tcaacgccag ccgctgcatg 240
ggtggcgtgg ctcggctctt caccaaggtt cagcagatcc gccgcgccga acccaacgtg 300
ctgctgctgg acgccggcga ccagtaccag ggcactatct ggttcaccgt gtacaagggc 360
gccgaggtgg cgcacttcat gaacgccctg cgctacgatg ccatggcact gggaaatcat 420
gaatttgata atggtgtgga aggactgatc gagccactcc tcaaagaggc caaatttcca 480
attctgagtg caaacattaa agcaaagggg ccactagcat ctcaaatatc aggactttat 540
ttgccatata aagttcttcc tgttggtgat gaagttgtgg gaatcgttgg atacacttcc 600
aaagaaaccc cttttctctc aaatccaggg acaaatttag tgtttgaaga tgaaatcact 660
gcattacaac ctgaagtaga taagttaaaa actctaaatg tgaacaaaat tattgcactg 720
ggacattcgg gttttgaaat ggataaactc atcgctcaga aagtgagggg tgtggacgtc 780
gtggtgggag gacactccaa cacatttctt tacacaggca atccaccttc caaagaggtg 840
cctgctggga agtacccatt catagtcact tctgatgatg ggcggaaggt tcctgtagtc 900
caggcctatg cttttggcaa atacctaggc tatctgaaga tcgagtttga tgaaagagga 960
aacgtcatct cttcccatgg aaatcccatt cttctaaaca gcagcattcc tgaagatcca 1020
agcataaaag cagacattaa caaatggagg ataaaattgg ataattattc tacccaggaa 1080
ttagggaaaa caattgtcta tctggatggc tcctctcaat catgccgctt tagagaatgc 1140
aacatgggca acctgatttg tgatgcaatg attaacaaca acctgagaca cacggatgaa 1200
atgttctgga accacgtatc catgtgcatt ttaaatggag gtggtatccg gtcgcccatt 1260
gatgaacgca acaatggcac aattacctgg gagaacctgg ctgctgtatt gccctttgga 1320
ggcacatttg acctagtcca gttaaaaggt tccaccctga agaaggcctt tgagcatagc 1380
gtgcaccgct acggccagtc cactggagag ttcctgcagg tgggcggaat ccatgtggtg 1440
tatgatcttt cccgaaaacc tggagacaga gtagtcaaat tagatgttct ttgcaccaag 1500
tgtcgagtgc ccagttatga ccctctcaaa atggacgagg tatataaggt gatcctccca 1560
aacttcctgg ccaatggtgg agatgggttc cagatgataa aagatgaatt attaagacat 1620
gactctggtg accaagatat caacgtggtt tctacatata tctccaaaat gaaagtaatt 1680
tatccagcag ttgaaggtcg gatcaagttt tccacaggaa gtcactgcca tggaagcttt 1740
tctttaatat ttctttcact ttgggcagtg atctttgttt tataccaata gggcgcgccc 1800
agggagtctc cttgtccttg atgtcaaact acatttttct tccagtgata ttcatatctg 1860
cctctggaga cctggctttg taacagcact catcatcctc aaggttccta gcagatgttc 1920
ttcacaagga agagactgta acatcatttg ttggggccag caactcagtg agcagataga 1980
aagtcacagt gaaccaacag ggtccttctg gcagggagtg ggtaagggga aacaactaga 2040
tgtagcttgc atagccacat aacacatctg gttaccattt ccctttctat tcatttctaa 2100
tccatcaaac aattgatgtt tacatacaac ttcatcatcg ccagctctgg tggcacatgc 2160
ctgtggtcac ggcacttggc agggaggaga ggatggctgc aagttctagg ccagcctgac 2220
atatgtagag tttcaggcca gtcagctaga tatcaagact cacacacaca aacaaacatt 2280
ataatttaca agtagatttt tgtagacaag tcttatgata agctaagcag aaagggtcga 2340
cttgttcaag gtcagacatc tccaaataat acatctaaga tctgtacctg agtcttttta 2400
cctcaagtcc aacctctcgc ctactgtcaa gtctcctctc ttcctgcggt tggtctcaga 2460
caacaaatct gtttctgcct tccatagtac cctttctttt gggctcttgt tgtctctcag 2520
gtttgagaga gtagctactg gacaggattc tttcctacaa caccatgtgc ctttggtgag 2580
tcatagacaa actgtataca gcagataata gattagtcca gggactgcaa aaggcagtca 2640
gggacaggca ggaaagggaa aggagaatat gaccaggact tacagaagag agtagatggg 2700
ttccataaac actagttaag gagcagaaca tggctccaaa gcacaacact ggttcttatt 2760
aactcctgtg cgactccaaa agccctgcct ctctccttcc ttcttcctca catggaggtg 2820
ccatgagaga acttcctaaa ataaactgta attctcaacc tgcacctgtc ctgtccaaga 2880
tgccaacacg gctctcattc aagtcacaca tgtacaatcc aatattctgg aaggcaagag 2940
tgaatttaag aagtggtata cataaatgtt tgaagtcact gagactcaaa tcccacacaa 3000
ccactgtaag gcatactcag gtcaagacat gagaagacca gcaggacagt gagccctatg 3060
ggaggcccaa gagataccca gactgatgaa atgatctgtg cccacagaac accagaccaa 3120
gacagtcctg tgaccaagtg agcatagtca gcactgtaca gtagttaaga atccttaact 3180
ataaaccagc agctagatgt ctggacaggg aacatgttgg tttcctagtg tttacgaata 3240
ttaaggactc ttgacccaaa ccgtttaaaa ttccaatcct tttatgaagt ttgttgcagt 3300
gaaacaatgc cttcagttct ttttgaatgt gtagattagt tataaacaga aatgtcagta 3360
agactgtaaa aaaaaatgaa gaaaggtgca tatgaggtat atttttatgc ttggcgagta 3420
aataaggaga aatcttatag cataattttt aaagaacatt tgtataattt ttctaaatat 3480
gtgtatatgt atattttatg cagcagtatt gagaaatgac cttggactat cttgtaattg 3540
taaattccaa aaataaagct gaagatatag tcctttgtct ttggtcttac ttcttctctt 3600
aacacttgta ccctcaggga tgcagcaaag acaatgttca cttcaaaact gagacgaaca 3660
atgctctaga attaagggtt ccggatcctc ggggacacca aatatggcga tctcggcctt 3720
ttcgtttctt ggagctggga catgtttgcc atcgatccat ctaccaccag aacggccgtt 3780
agatctgctg ccaccgttgt ttccaccgaa gaaaccaccg ttgccgtaac caccacgacg 3840
gttgttgcta aagaagctgc caccgccacg gccaccgttg tagccgccgt tgttgttatt 3900
gtagttgctc atgttatttc tggcacttct tggttttcct cttaagtgag gaggaacata 3960
accattctcg ttgttgtcgt tgatgcttaa attttgcact tgttcgctca gttcagccat 4020
aatatgaaat gcttttcttg ttgttcttac ggaataccac ttgccaccta tcaccacaac 4080
taactttttc ccgttcctcc atctctttta tatttttttt ctcgagggat ctttgtgaag 4140
gaaccttact tctgtggtgt gacataattg gacaaactac ctacagagat ttaaagctct 4200
aaggtaaata taaaattttt aagtgtataa tgtgttaaac tactgattct aattgtttgt 4260
gtattttaga ttccaaccta tggaactgat gaatgggagc agtggtggaa tgcctttaat 4320
gaggaaaacc tgttttgctc agaagaaatg ccatctagtg atgatgaggc tactgctgac 4380
tctcaacatt ctactcctcc aaaaaagaag agaaaggtag aagaccccaa ggactttcct 4440
tcagaattgc taagtttttt gagtcatgct gtgtttagta atagaactct tgcttgcttt 4500
gctatttaca ccacaaagga aaaagctgca ctgctataca agaaaattat ggaaaaatat 4560
tctgtaacct ttataagtag gcataacagt tataatcata acatactgtt ttttcttact 4620
ccacacaggc atagagtgtc tgctattaat aactatgctc aaaaattgtg tacctttagc 4680
tttttaattt gtaaaggggt taataaggaa tatttgatgt atagtgcctt gactagagat 4740
cataatcagc cataccacat ttgtagaggt tttacttgct ttaaaaaacc tcccacacct 4800
ccccctgaac ctgaaacata aaatgaatgc aattgttgtt gttaacttgt ttattgcagc 4860
ttataatggt tacaaataaa gcaatagcat cacaaatttc acaaataaag catttttttc 4920
actgcattct agttgtggtt tgtccaaact catcaatgta tcttatcatg tctggatctg 4980
acatggtaag taagcttggg ctgcaggtcg agggacctaa cgcgtcacga atggtcgatg 5040
gctgtgtctc ccttagtctg ctgtttggag aaactctttt tacactgggc tttgcagact 5100
tccctcttga aacctcttta tgagactttt gcatttggga cactttcatt ttagtgccgg 5160
ttcttagaaa ctaacctgag agcatctagg tcaactggag gactgccttg caatagagtg 5220
ctgggtgaga cttcaggacg ccatgtcggt ggtcccctcg tgaccactga agacttgcta 5280
agtttgggct ccaggcttgg aggccttgtt tatgtctcag cataggattc ccagggccta 5340
tgctacgaag ttcctattct ctagaaagta taggaacttc aggtctgaag aggagtttac 5400
gtccagccaa gctagcttgg ctgcaggtcg tcgaaattct accgggtagg ggaggcgctt 5460
ttcccaaggc agtctggagc atgcgcttta gcagccccgc tgggcacttg gcgctacaca 5520
agtggcctct ggcctcgcac acattccaca tccaccggta ggcgccaacc ggctccgttc 5580
tttggtggcc ccttcgcgcc accttctact cctcccctag tcaggaagtt cccccccgcc 5640
ccgcagctcg cgtcgtgcag gacgtgacaa atggaagtag cacgtctcac tagtctcgtg 5700
cagatggaca gcaccgctga gcaatggaag cgggtaggcc tttggggcag cggccaatag 5760
cagctttgct ccttcgcttt ctgggctcag aggctgggaa ggggtgggtc cgggggcggg 5820
ctcaggggcg ggctcagggg cggggcgggc gcccgaaggt cctccggagg cccggcattc 5880
tgcacgcttc aaaagcgcac gtctgccgcg ctgttctcct cttcctcatc tccgggcctt 5940
tcgacctgca gcctgttgac aattaatcat cggcatagta tatcggcata gtataatacg 6000
acaaggtgag gaactaaacc atgggatcgg ccattgaaca agatggattg cacgcaggtt 6060
ctccggccgc ttgggtggag aggctattcg gctatgactg ggcacaacag acaatcggct 6120
gctctgatgc cgccgtgttc cggctgtcag cgcaggggcg cccggttctt tttgtcaaga 6180
ccgacctgtc cggtgccctg aatgaactgc aggacgaggc agcgcggcta tcgtggctgg 6240
ccacgacggg cgttccttgc gcagctgtgc tcgacgttgt cactgaagcg ggaagggact 6300
ggctgctatt gggcgaagtg ccggggcagg atctcctgtc atctcacctt gctcctgccg 6360
agaaagtatc catcatggct gatgcaatgc ggcggctgca tacgcttgat ccggctacct 6420
gcccattcga ccaccaagcg aaacatcgca tcgagcgagc acgtactcgg atggaagccg 6480
gtcttgtcga tcaggatgat ctggacgaag agcatcaggg gctcgcgcca gccgaactgt 6540
tcgccaggct caaggcgcgc atgcccgacg gcgatgatct cgtcgtgacc catggcgatg 6600
cctgcttgcc gaatatcatg gtggaaaatg gccgcttttc tggattcatc gactgtggcc 6660
ggctgggtgt ggcggaccgc tatcaggaca tagcgttggc tacccgtgat attgctgaag 6720
agcttggcgg cgaatgggct gaccgcttcc tcgtgcttta cggtatcgcc gctcccgatt 6780
cgcagcgcat cgccttctat cgccttcttg acgagttctt ctgaggggat caattctcta 6840
gagctcgctg atcagcctcg actgtgcctt ctagttgcca gccatctgtt gtttgcccct 6900
cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc taataaaatg 6960
aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt ggggtggggc 7020
aggacagcaa gggggaggat tgggaagaca atagcaggca tgctggggat gcggtgggct 7080
ctatggcttc tgaggcggaa agaaccagct ggggctcgac tagagcttgc ggaacccttc 7140
gaagttccta ttctctagaa agtataggaa cttc 7174
<210> 14
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 14
gcagtaaatg agagctgcgt gaggg 25
<210> 15
<211> 3732
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 15
ggcagtgttg gggtaggtta tggagagaga agaaaaatgt ctggaagccc gggagcctgg 60
catggctgtc ctctgagagg ccctactgat tgagcatctg actgagacag aatcagatgc 120
ttacacccaa ccattgggct gaagttgggg atccctatga ttgaattagg ggaaggattg 180
aagaagctga aggggagggc aaccctatag gaagaccaac agtttcaact aaaccagacc 240
agaccccaag gagctcccag agactgagcc accaacaagg cagcatacac cagcaggtca 300
aagatccacc ctggcacata tgtagcagag gtctacctag tctggcctca gtgggagaag 360
atttgcttga tccttgagag acttgagacc ccaggaaagg aggaggccgg gtgcagtgca 420
ggggaagcac tctttcagag tcaaggagga agagggggaa tgggatgagg aactgtggga 480
ggggggacag agggagggca atgcctggaa tgtaaataag taaaataatt aaaaaaaaga 540
aaaggaaaag aaaaaaaaat gtctggaagc catatatgag atgtaaagac tattgaaaat 600
ttatgagttg tggctgaaca aagcaggtgg acagcatcct accgcaaggg gccagataga 660
aagccacccc agagttccag cagactctgt gggatacaaa ggtccagtgg taaccactgg 720
accaaggcac acactctcag tccggtgaca gtgagccagc cttatcttca tagcgttttg 780
ccccaccctg cttactttag gaacaagagt tttataagtt ggactactca ggcagtgaca 840
tcactagctc agcatgccaa gggatcactg acttagtata catgcaggaa gtaggttctt 900
ttctagataa cactagtttg ccccaaaccc agtgtcttga aagggagtca gttgacctcc 960
agatagccct tatcaaaatg ttcttccttc attgtttcct tgagtgggta ggagaacggg 1020
gaatgttgat gctccacttc cctgtgtgat cacacaccac tgaacattca ataatatctt 1080
caaattatgt atttgcatct tgtgaattcc tacagacttg aggatgtagc tcggttggta 1140
gagtgattgt ctggcataca ggaagcccag ggttcaatcc ccactactgt gtgaactgca 1200
cagtgacaca tgcttttaac cccaacacta gaaagacaga cacaagagga aagagctcca 1260
ggccatagat gacagtacaa tgagctggag ggcatcttga actgcacgta accttgtcaa 1320
agaagaagaa aaagatttct acagttcaat taaagcagat gcaaatgaca gggacagaag 1380
gacaacaaaa tggcttcatg ggtcaaggta cttacaagcc tcgtgacctg agtgtcatcg 1440
ctagaactta cattgtgaga aaaatctgcc ctgaaaattg tcttctgatc tttacacatg 1500
tgttatatgt gcacactgca cacatacaca cgagcgaaat taataaataa atgcattttt 1560
ttcaattggg gctttccccc cctccctttt ttgagacagg gtttctctgt gtagccctgg 1620
ctatcctgtt actcattctg tagaccaggc taccctcaaa ctcagagatt tgcttgcctt 1680
tgcctcacaa gcactgggat taaaggtatg tgccatcacc accaaactaa taaaaaaaaa 1740
tttaaaggaa atgtctacaa gaaaggacac aaatggtcca taagtttcta tttgtgtttc 1800
ccagttggtt tgacatgctg accatctttc aaaccaactg ggaaacacac atagaaacta 1860
cagtgagata tcacatgtgc cataaggtga ctatcattta agacaaaaaa aaaatagagt 1920
tggtgaagat gtacagagac ttttcgttgc atgtgagaac aagcaacata gcaggtcctt 1980
acttcacctc ccaactaaac tatttcatca gccgtgaacc ctggtctgat tctggtttca 2040
taccttatga ataagctgct atgaatgtga gcattcaaat acacacctag tgtgaagtgt 2100
tggcttttga aagaaaggaa atgctgacac actgagtctt gaaggcattg caccaagtaa 2160
aacaaattcc gtcagaaaag aacaactatg aggtccttag aatgatcaaa tctatacaaa 2220
ctggtaatgc aatgttgact gccagaggct ggaaaggaag gaaatggaga ggtggttttg 2280
gttagttggt agtttagttt gtttgtgttg ttgttgttgc tgctgctgtt gttgaggcag 2340
gcttcactaa ctcactacat gaccagactg ttttcctgcc gggactccca agaactgtaa 2400
ttataggtga ttgccaccgt gttgtgtggg agtcactgtt taatgagttt acaatgcaat 2460
cttggaaaga taaaaatgtt ctgggaaatg aatgatggca aatatctgta aacttaggac 2520
agtacgcggt aatgcacatc ctgtatatat gatgtatatt ttattgcaat tgaaaagcaa 2580
cttggttgtg ggtgagatag ggcctcacta tgtaccccat actttcaact ctcaactcag 2640
tctcctgagt gctaggatta caggtatgcc ccaatgtagt gtgtgtgtgt gtgtgtgtgt 2700
gtgtgtgtgt gtgtgtgtaa caatgcagac ctggtgcatt ctggaattgt gtgggcatta 2760
gataacttct ataatgacca gaacacacgg gaacctcacc aatactcttt cctttctcag 2820
aactactagc gtgttgacta tacttcattc aaacacaggg gttaatccat tagacaggga 2880
atttgtgctg tggcgattta tatttaatga tttgttcata acaaatatgg atccccacta 2940
ttactgctaa aagctttaca caaacacaca cacacacaca cacacacaca aacacacaca 3000
cacacaaaca cacacacacc acaaaaggta agggaggagc gccccagagt agctgtcagg 3060
tttacccgct aggatccgcc ttagctttca gagctggccc tatgccaccg ctgagggtcc 3120
caaagattgc tttgctcctg agattccagc tcgaatacgt ggagcggttg cgtcctcaaa 3180
cctaagtagc gtctcggccg gtgtgtcccg gtcccaccga agcaaccttc cagcgcacct 3240
ctcaggcggg ctctggaccc ctctcactcc cacgtttgtc ttcctttcct tccccccccc 3300
accccccccg ccccagcccc ttctttttcg gcgaccgagc caccgagcaa cacaagccgc 3360
tgtacactct tctggagaaa tcggtgtaat agagagtgag gatccggaac gctgcccctg 3420
cagttgtcac cgccgcccgc ccggggactg cggcacccag cctctcggac ctgctcgctc 3480
accaggcgaa caacctcctc tctgttccct gcgggctgcg tgtgctgggc cagaggctcc 3540
cgggtgcctc gggtgccagg gtcgcatcgg gtgactcggg gagtgtgtct gcaagcggga 3600
gggcctgggg ctggaagagg cgggtctgtc cccgccccgc cctcctctac tccgcagttt 3660
agtagaggcc ccggtggccg gtgctcactg ggctcagggc cagtccaccc gctcaacgcg 3720
ctctagccag cc 3732
<210> 16
<211> 4951
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 16
aagtggcatc caaattttct tgctcaacct aaaaccacac ttaacatttt tattttagct 60
aacaaaatgt ggattattta tggtgtataa catggtgttt taagatgtgt accaacactg 120
tggaatggct gtgtcaagct aattagcaat gccttctctg tgcttgctct gaatttctca 180
gtggctatat ttctacaaga gtccttttcc ctgttcctcc tttgtatttc ctctctggta 240
ttccctttgt cttcttgata tacctctcta ttgtgattta gaaaactaaa cgtgtgatac 300
tgggctgaag agccagctca atggttaaga gcacctgacg cttttgcaga ggacccaggt 360
tcaaatccca gctcacaacc atatgggatt tagggccatg ttcttgcctt tgagcaccat 420
gcatgcaagt ggtacacaga caaatatgca gtcaaaacac acacgagcat aaaaaaatta 480
ccattatgat agtaagctga ggttggctta cacaatcacc tatgtatagc agcagtcagc 540
aggagacata cactgtggtg taccaacact tctttcactc tggcagttaa ctatacaaaa 600
actgagagat caccacaggc cacattcctg cctccaaatt cctgtccact ctgtctcagc 660
agcattttcc cataacacat cttctgtctg ccacattctt tgtgcctggt aatgctgcga 720
tcggacagga agctcaggga tgagtaagac atagctctgt gcccacagct tagtaaagag 780
agaaatccaa gcttctctct gtctaggagg cattcacact cttactttgg taggttctat 840
gcctcttctg gtccatctac tttaatacaa aagctatttt acaagactct agccacagaa 900
ggaaatatct aacctactga ctatagtgcc actcatcttg attgaagtta ttgatccatg 960
gataagaatt gttacccaaa ccaagacagt aagaagcctg cagtgacctt tctaatgtat 1020
atgtaaatgt aagacaagct tttccctgcc tcctgtgcag tgattccata gttgtcttat 1080
ctgtctcccc tacataaagg cacttgaaaa acagtagtca acaggtctga agctgagaca 1140
aaaatgacgt ggaaacccca tatgattcaa gagctgtaaa gctcatacta aagtcagctc 1200
tattcccacc cctctgtggc tggattacat aaggatgcac tttgtattct ctcaaaacaa 1260
gactgccagc tctgtcttct gcaaccgaga acccaaacca gctccaatta caagagaaca 1320
tccaaaagcg tgcctgaaca cacacatagg tgggcttaag aggtaggagt caaacctcaa 1380
cattctagat gttaagtaaa acctctcaaa cccctcaggg aaaagtacac agtttccttc 1440
tcatactact aaaatgtagc agaaaagcta gaaataacta aataggtcac acagtgcttg 1500
gagcttggag ctcattccct tgcagattta atttcaatta agtctctcat tttaccagac 1560
ttctgtgata ccctttacat cttgggctgg agagttcaca cagctgttaa aagtgatggc 1620
tgctcttgca gaggaccagg gttcaatttc cactcatgtg gtacctcaca agcatctgta 1680
actccagccc caggagatcc aatgccctct ccaacctcct caggcaccag acatacacat 1740
agtgtacata cacataaaat aaatcttaaa aatcatacta attaaatctt ttcaagtatg 1800
gatactagca ataatatatg taaataaata tatatgtggg tacagtgtaa tgtgtagaaa 1860
agagaacagg aaaggctaag tattagaggc caagaatata gattgaatat aggcaaaaga 1920
aatgtgttat ggaagtggat ataaagctaa atgtttttct aagtctaact ttgtcaaaag 1980
aatttcattt ctggtgctag ataagtacat ggaaatatat ttgatactga aagtataaga 2040
tccgatgtaa agcaccatgg ctgccgtccc agtatctatc agtgtgtgtc tttttttttt 2100
tacaacattt tttataataa agtgtattta aagaactttt aaaatgaact cttttgaaat 2160
attatggcac atgcatatta attccagtac tcatgagttc aaggacagcc ttgggctatg 2220
cagcaagttc tagtctatca taaaactctg tctcaaaacc caaaacatga accatgcaaa 2280
taaagtacct tggcaacatc cttttcaatg gtgagctctt gggccttgcc tccatgtcca 2340
caacgatgca catttctgca tgctgcactg tgccatggca cgcatgttga ggtcagagga 2400
caagccggag gggtctgctc tcttcttccg ccatgtggat tccagggatc aaactcaggt 2460
ccctagctta gcagcaaagt gattttcctt acagagaaat cttctcaggc tctcctttac 2520
ttttcttacg gtcatcaatt ctgttgcagt atggtcccac ccaaatgact cttttaatct 2580
aagtttcctc ctgaaaggct ctgcctccaa atatagtcac tgtgagattt atagttttaa 2640
cataggaatt tggaaaagac ataattcaga ccgtaacaat gagtgaggcc tacttataga 2700
aaatgtgata tataatatat atataaatgt tatatgtgta catatgtata acatatatac 2760
acattagtca tacaaagtta aataagtcag tcatattata aaaggaaaaa gaataattcc 2820
tatatctgca agatgttttg tatatagcat cacattgcat ttccatggta actgtgtggc 2880
cttgtcttgt aggcgatgat gttaggccag aaaactatca aggtggaagt ggatattagc 2940
ccagaaactt ggaataccta aaatacaatt tgcaaaacac ataaaattca agaagaatga 3000
agaccaaagt gtggacactt tgtccctcct tagaatgggg aacaaaatac ccatggaagg 3060
agttacagag acaaagtttg gagctgagac gaaaggatgg accatctaga gactgctaca 3120
tctggggatc catcccataa tcagcctcca aatgctgaca ccattgcata cgccagcaag 3180
attttgctga aaggacccag atatagctgt ctcttgtgag gctatgctgg tgcctggcaa 3240
acacagaagt ggatgctcac agtcagctat tggatggaac acagggcccc caatggagga 3300
gctagagaaa gtacccaaga aactaaaggg gtctgcaacc ctataggtgg aacaacaata 3360
tgaactaacc tgtacccccg gatctcgtgt ctctagctgc atatgaatca gaagatggcc 3420
tagtcggtca tcattgggaa gaggcgcccc ttggtcttgc aaactttata tgcctcagta 3480
caggggaata ccagggccaa gaagtgggag tgagtgggta ggggagttgg gggagggagg 3540
atatggggga cttttgggat agcatttgaa atgtaaatga agaaaatacc taattaaaaa 3600
aaaaaagaag aagaagaaaa tgtgttgtgt atgtatatac atgaaccagc atggctagac 3660
acacagaaga gagactttga accttggccc acttgcttct tcacactaaa tacaattgca 3720
tttgacgcaa aagaatacaa agactgtggg tcttacttgt ttgctttcat tcaaaaacag 3780
caacacaggg ctgcagaatg gttcagtaga caatgtgctt gccatgcaag aatgatgacc 3840
aaagtttgaa tttccagtgt ctacataaaa gtcagaaggt ttgtgggcat gcctataatc 3900
ccagtactcc ggaggcagag actggagagg gcatgagcag caaactaaaa gctccagagg 3960
cagcgagaga gcctgcctca gtaaacaaag tagacagtca tagagaaaga cacccagtac 4020
caacctcagg tcgccagaca aacgtgcaca cgtgggcaag agcatccaca cacatgtact 4080
caaacacaca caaacacatg cacactgcac atacagagag ggagggagag agaaggagag 4140
agagggagag agactcaaaa agccctagca ataagtggca tatgttttag taaaatcaag 4200
acagtaaata caaaaggccg taatacctaa accacttttt attgctattt gttagtcttt 4260
attaaaacaa aaaatatatt ccaaacacta aagtacatga tatttataaa aatatacaaa 4320
gtgaaaagta catttctatt attctggtat gttattctat accaagtgtt tacatccagg 4380
atgtcatgga ggtagcttcc ttttgtacct aaagtaaaaa taaatccatt taataggaga 4440
caatagcaca aggttcattt ttaattccta aaagctctac tatagtaata ctctatgtga 4500
atgtgcgtgt gtgcacacac acacacacac acacacacaa acacacacac acacagtaat 4560
ccacaccctt cctggcacaa acagcaatta atagcaatgt tttgtctctg acctaataaa 4620
agcctcagaa ctaaaagctt tctggtattc attctaaagc aatttaacat aaatattaac 4680
atttccaccc tcctttaaaa atacttttaa tactactaaa atttgaagtt tccaatcaat 4740
agatgaacag ccaaaatctg gtttccctac tctatagaag tgagttttca ttacaatgcc 4800
acaaccttga acagttcctg gttttgtttt tcatctaact aaatggatta gtcaaaaagt 4860
aaaggagcta gagagatggt tcagcagtta acagctcctg ctgtgcaatt atgagaacaa 4920
gaaatctgat tccaccaccc atgtgacaag c 4951
<210> 17
<211> 45928
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 17
atgtgtcccc gagccgcgcg ggcgcccgcg acgctactcc tcgccctggg cgcggtgctg 60
tggcctgcgg ctggcgcctg ggagcttacg attttgcaca ccaacgacgt gcacagccgg 120
ctggagcaga ccagcgagga ctccagcaag tgcgtcaacg ccagccgctg catgggtggc 180
gtggctcggc tcttcaccaa ggttcagcag atccgccgcg ccgaacccaa cgtgctgctg 240
ctggacgccg gcgaccagta ccagggcact atctggttca ccgtgtacaa gggcgccgag 300
gtggcgcact tcatgaacgc cctgcgctac gatgccatgg taagacccga gcccgcgccc 360
gggatagtag tcccggactg agagaggagc cgggctggaa aagcagcgga tggcagagtg 420
tggcaagcct aggtccaggg cgcggagaga tgtggggata aagtgagact ccggccagtg 480
tgccagctgg atgcatagaa gtcccttgga cgattcgtct taaacggacg ttattgcgcc 540
cccactatga gatgggagta tttagaggag caggacccag tcctgcctgc gaggggctta 600
aggtggggtg cgtagaagca ctgacagtta tccagagcta gggactgaat cggatccctc 660
acgcagctct catttactgc tagatctttc aaagaatgct ttatctcaat ctagatcttt 720
cgaaaaatgc tttatctcaa tcttattgaa agggaaaccg cgtcgaagaa gctgaataaa 780
ttaacaggca ccatccccgc aaaaagggag acgtgataag aatggacacc agatctcttt 840
gaccccggag cttgagagta attaagctca cagctttggg gtagattctc tgttcaatta 900
ccttctatgt gaccttgagc aagttattta ctctttcttg cctcagttct atcctgtata 960
aaaataagga ttataattgt acatagtgca catagtgtgg tctctatatt attaccagac 1020
acatagtgtg gtgtctatat atgttgttat taccagacac tataatatag ggttaagaaa 1080
aggaaagaaa gaaagaaaga gaaagaaaga aagaaacttc agagagagcc aaaatgtgga 1140
aagacttggg tttcaaagga tttcaataaa gtaggtttgg cggcagagat ttaggaagag 1200
tccaacttaa gagataatgg gctcttaaaa catgtgcaaa agtttagttc aaagtcccaa 1260
atgttaggtt aaaggtaatg ccttttctat tgcactaaat tctagaaagg gagcagaggt 1320
ggttgcacag taataaataa gtacaaccaa gatgtatata tggtggttaa aaaaaaaata 1380
aaaccaccac ccaaatttaa ccctgtgtaa aagccttgag taaagggatt cagcaagact 1440
gtggagaaac ttgggagaag aaaaaagagc tttggaagag gaagaactga gaaatctcca 1500
gagaaacgtc cagagaaatc tgtgaagtcc acaagtgtgg aagggaaagg gaagtgtgat 1560
taggctggga tacagaaggc taacagtaca ctttatatga gaagagctta gaggtcagga 1620
gagcacaatg ctgaggggtt tgtttggttt tattttcccc tctggaaact tggagaaagg 1680
aggacaggaa aagatgataa gagggcacca ttgtttggtt ttaaaggtcc ctccccaacc 1740
ccccacaaca agaaagctca ttttaatgag atgagagaag gggagaaagc tccctccgtt 1800
ctgatgcttt tgagcttatc agggacaccc tgcagcagaa attccagctg atccttcagc 1860
cttcagagtt gatgaggtgg ggctcacaca catctggatt tgaagaagaa tcgtgggttt 1920
caagaactct gtggtctctt aagcattggt attccacagc tacacttccc tgatactttg 1980
cactcagtaa tgacaaaata aatatttgta gaattgagtt gttgctaccc tgtgtctcca 2040
cttcctccca gaagggtaga ataaacttac atgtgtttgt tttattttgc atataaacca 2100
cgactttgta aactttactt tttgtgacac tcttttctcc acaattgtgt gaaccctttt 2160
tagtggaaaa ataagaatga agaattccta gttttcccta ataatatttc cctcttgtaa 2220
gtctagttga gctcctggtg ttaatcatgg tatcaaatgt ttatgacagg acatctaggt 2280
gaggcctgag taaactgctg taagttacag cctatctctt ctgtgtttgt tcagaatcct 2340
ttttgttgga tgaaattact ttctttcaaa ttttgctcaa gaaaattgaa ttgggttctg 2400
aggaaaaata ggctacttac gtaatgttgg cagggggtaa agattgggag cggatcacat 2460
ggcaaggtta gatggctgtg tcctcaggac ttactctgtg cagtttgggg gtgtgtcaaa 2520
catgatttga cgtgcataaa atcccagttc tgtagtgcta aagcatgtgc taaagctctg 2580
ctgtactcca gtattgcaca gatgtttcac tgtcaattgt ttatattata ctaaccctca 2640
gagaatttat tgctgttgtg gtataaacgt tcctccactg taaccactca ccgcatattg 2700
attcctagga cactggggat tcaaatcagg gtggaaaaca gttgtggagg ggcatctcca 2760
gtttcctgga gactggacct agttcaaagg agccacataa aaggactgaa gcatatggag 2820
gacaaagagg aatttcctag aatccagcta aaacacagga gtcccctcga gcaggaactt 2880
ccaaggaggt gtacactttc tagacctact ggagtaatcc tgattgctgt agaggaccca 2940
gggcaatcct ggagaccccc tggaaccact actaggcaga aaactgggaa gaggatggca 3000
aaactctcca gttagtgtag ctgtctcagt ggaaggactg ttgtcccctg aggcacgagg 3060
aagacagcat ggaagcactg acagtatgcg cttggcctgg ttactttctc tgagcttcat 3120
gttccttagc tgtaaaacag gagtgataac ccccatcttg cagaggtttt gtgaggaata 3180
catgagataa tccctgctgg tgctcactga aagtgagttt catgtgccct gggtggggag 3240
gaggggaatt gtgaggaagg gaaggtttgg gttgtgcatc agattatctg attctcacag 3300
cctggcacag tctaccacct gcctgccctc taccatctta cctaagtacc attgcctaaa 3360
ctatttttgc tatagttact cattgagcag acttatgaaa tctccccaca gcacaaaact 3420
gaaggtatcc acctccagag gtgagattct caccaagaca ggctgtgggg aaggtagctt 3480
tgagcaggtc tctagggtcc cctagaaagg tgccttttct ctgcttggct gctgcttctg 3540
aggccctggc tgctgagccc acttcctgcc cataaggctg cagccttagc cctagccaat 3600
tgctcctaaa tgggggaggc agcagccagg gtgggaacca tggcccccgc ctttaactca 3660
cccaggggac aggttagtca gatagatggt cctttgttcg ccaccctcca cctcccacat 3720
agacaacatt ctaatgtgtt tgatggactg catatagttt tgtttctaca tgcactttca 3780
aaacctttgt gtgcttatat tttataatgg catacccagt cctgttattt agccctttct 3840
gtttcttact tttgtcctga acttattgtt tttatgattt gtccatgttg ccctagatgc 3900
ccttatacct atacatcact gtgaattgtg ctctgtgtat tttacccatc tccacgtatt 3960
ttacccatct gctccctggc tgatgggttt ggagatggcg tctaacaccc tgcctccaca 4020
aataatgctg ctccagatat cttcctggat ggtctcacca gatgatttct acccaggagc 4080
agaattgctg gccacagggc atactttgag tactacaaga ctggtttcct caatgactga 4140
catcctccat tctcttccac ctactcccat ccccaaacag gacatgtaaa tttctatatt 4200
ttacctatat atcatggcag tatccagctt tattgtaggt gtaaaataca cattgctgct 4260
ttatcatgta cttttctaat aactagtgag tttggacatt ttttgatatg tttgttaact 4320
ttttaagctt cctcttctct aattgcttat tcatattctt tggtattttt taaattgggt 4380
tagctttata ttttttgact ttcaggagtt ataggttttg atctgtgatc aattttagac 4440
attgcaaata tattatgatg atattaggtc tattaatttt gtccatggta tctttcattg 4500
aacagaaatt cttcatttct tgtaattgaa ttaatctttt tttccttgtg gcttgtgctt 4560
ttgaaagttt gcctaatagt ttgctaattt aaagcttatc acctttaaca ctcctcactg 4620
gaaattcaga gatcaccacg gttctctctg ccttttaaga gataaatcta caatctgtgg 4680
ctatgcaatt cacttctcag catcaccagt tcctcatcta taaaatagaa gtttgaacca 4740
gcttacctct gaggtcccat acatatttga acaactggag ccctacagtg aaatgttttg 4800
attcagtgaa gggacaggag gatggggcaa aggtgtagag gcaggaatct acacaatgga 4860
ccaggttgga ccagagtgag gtagcaggaa tcatggcaga agaggttgtg taaactctaa 4920
aggccacctg agaagtttga gttatgttgc catgggcagc catgtggttt ctgcgtagaa 4980
gaccactgtg gtataataaa aaataaatat ttggtccttg tccccagtac ctgtcacaga 5040
gttcctaaaa ccctagaaat ttcctaagta atagaaatgt cctttgtgat tcataataaa 5100
ccccatgata atacctgggt ttatgctcct gcagacttag ggtgagatcc ctagatagct 5160
tcaagatgga accagtcacc agcaagtcca agagattagg ggatcaaaga gttggaactt 5220
tcggtaccac ccaccaaagt ccaggaaagg agggtgggac tggagattca gctctacaaa 5280
aactcttgaa caagatgtga ggagcttttt agttgctgag catttggagg ttcctggagg 5340
gtgcacaccc aggtagggca tgaaagctcc gcattgcccc catatcttgg cctatccatt 5400
cttcacttgg ctgttcatct atatcctttg tatagccttt ataataaacc aatgagtgag 5460
ttctgtaagc tgtcctagca agttaattga agccaaggag gaagtaatgg aaacccctga 5520
tttatagcca ttcagccaga agcacaacct caggcttgtg atcagcctga agtcagggca 5580
gtcttgtagg attgagccct taacctatgg gatctgacaa tgtcttgaga tagtatcaga 5640
attgaattca attataggac actcagtttg tgttttctgg agtactgttt ggtgtctggg 5700
ggaaatccca cacacatctg gtcacagaag tattctgtgt tcagtgtgac agtagaaaaa 5760
gcagtttgtg tgtatgtgtg tgtttgtgct tttgtatgct aatgagatga tctatggctg 5820
ggggccctta gatagcttcg ggaagggggc tggtcgccag aaagactaat ttatgactag 5880
agggttagaa ctttcagccc tacctctccc acttctagag ggtagatggg ctacagattg 5940
agttcggtct ccaacagcca gtgaattaat caatcatgcc ttcacgataa accttcatga 6000
agacccctaa acaaccagat tcagagagct tccaagttgg tgaacatatt gaggtgctgg 6060
gaaggtggtg cacatggagt gggcatggaa gctttgccca tccactccca tgccttgccc 6120
tgtgcatctc ttcacttgct tgttccctag tcgtatgctt tataataagc cagtaaagtg 6180
ctttccaagt tctgtgactc gttctagcca attatcaatc ccaaggatag ggtcatggga 6240
acctccaatt tataactagt tggttgggaa tacaggtggc ccaggactct cgactggtgt 6300
ctgaagtggg ggcagtcttg tgcatttgag ccttctcact tgtgggatct gatgctaagt 6360
ccaggtagag aatgtcagaa tgcatctgaa ttgttggaca ccaagttggt gtccacacag 6420
aattggggaa ttggttgatg tgaggaaaaa ccctcccagc tacataggga aatcacggtt 6480
ttggaagatg agcagaaaag ggtgggaggg agaagacaag agagaagagg gtgaggaaag 6540
aggccaggca gcagtccaag tgtgagatga taacacctga actctggtaa ttgaggagac 6600
caaatccacg ggaaccagct gcaaaaggca ttctgtctgg taggtatagg gctaagggag 6660
taaacgtggg ggtgagactg ggatcaagtc cccttcctac cacttagaat ctgtggactg 6720
ggagcaggtt atttaatctc atgaaacctc catttccaca tgtgtgaggt atgtgattgt 6780
tgtctccacc cctagggtgc tgtggggttt acaagagata gtagtgcaca taatgtgctt 6840
agcacagggc cagcccctag taagtgccgg ctgtacttgc cttctctttg caggggtggc 6900
tcaggagagg gtgtaatcaa aagatgatta tgagggttca agcaccaggc aatgggagag 6960
tcagcaggtt ttaatgcttc cacacagagg ctggattctt ctacttccgc ttaccgctga 7020
cagctgaggc ctgctacctc aagccctgaa atgcagtctg aagacttttc ctggacttac 7080
tgggcaccat tggactgtac agggctgtgg cagacaggag cagtggaaag agatttggat 7140
gggacgtcag aacacccaga ttttacttct tggctcttcc acttactagc tgtgtgaatt 7200
tggcagtggt ttccttatct ataaaatggg ggaattggag agggtggtct actgcatgtt 7260
acttctcagc acctcagctc ctccccaccc ctctctcttg cttctaatct gcctacatca 7320
cttggtttgt ctctggacaa gtctaagttt gcatgacaac tctctaattt ctccttcaca 7380
gttatttatt tattttttgg cagctatttc cttgtggaga gatagcgtag gtctttcagg 7440
tgcttccatt tggccactgt ttgcaggtga agaggtttcc aggatgctcg cgggatgtta 7500
ctctggtagt aacagaaatg atggcctggg gaatgggaag agtatgagcc cagttttgag 7560
cctgagtctt tggggctcat ttattcattc aatctgcaaa tcttcattca gcattactac 7620
atgccaggct ctctgctaag tgctgaggat agcgatcagg acctgctagg gtgaaaatta 7680
ctcttcagaa gatagcagag tggataatgg ccattcgagt tcctcttaac tacacagagg 7740
gactgggtaa gatctataaa ggagagaggt taaacagcaa agcaaaagtg actgaaattc 7800
taacacccat gcatggggat tcttgagcta aagatctggt ctgtagaaga agacacatca 7860
gaccctcata tttagcatct aagccgaaat cagcagtaag catgtttctt ctcccataaa 7920
aacttgtccc taagaagaga gacaagtgct tacaaggact cttatgttaa aggaaactgt 7980
cacaattact ttatcttcag tcatgaagcc tttcttgagg aaagaaaaat tagcaatact 8040
agtttgggaa gcttgaaaga gagattctgt ttgcaatgac acttttaaaa aggcaacaac 8100
aaccaaaaaa gtaattattc ctaaagctca ggtgtcagaa gttgtgaatg tctagggggt 8160
tgtggatccc caacctcaga gacaggaaca gcaccactcc catcctgggg ctgtggcctc 8220
aggcctctgc ttcctctttt tcagtctaaa ttccagagca gagaggtgct ggatgaccag 8280
cttcaggcag gggagggagg aagtaaaggt agtttctcag ctcatcttgt cctcgtacta 8340
ttaatagcat ctgttgtttt cttaactgaa aattggagag aaccaaacaa atcaacatca 8400
tattggcggc tttctgcaca tcactcacaa atacatctga atgctctgta ctctccagtg 8460
tttattttcc tcagccacat aatcggaatt tgaacattct tttgacactt cttttccctc 8520
acttgaataa ttagttctat atataatcca aatattagct ggagccctct agcccctcag 8580
agatatagtt gaaccttcct ggacctcaat tagttgggca gttaattctg aatgttgatg 8640
tgcctatttt gcacgtatgg agagtttgta cgtcccccct ccccaaacaa gtagaatgct 8700
gtctatttgg tacttgctca ttcatacatt ctttcaacaa ccaaacccag ttgcagctgg 8760
gttgaacccc agggtgtttc ctgctattca gtgtgatgcc acattcacct gctgtggcca 8820
acattccttg gcccaggact gcagggatgg aagccaaccc aacatcagaa gtgggcaaca 8880
atgccagaaa cctatcaacc aaaacaaatg taatttgtaa tcctccctgg atgatgttga 8940
gtgattggca actgagcccc gtcatatagt aaatcaggct gtctctcttt aaaaaagaga 9000
gacagagaga gagagagtct cccaatgttg cctaggctgg tctcaaactc ctgggctcaa 9060
gcaatcatcc cttcttgcct cccaaaatgc tgggattaca ggcatgagcc accatgccca 9120
gcctaggctg tctctttagc tcttcaatta tagggcttcc tgcaacctca gcccttaacg 9180
aatgagctag cgtgtaagtg actagctaaa taaatgtgct gcaaacctct gacaaatggc 9240
tttttgacat tattaaaatt tatttatgtg gaacaaaaaa attcttttct tctgcatcat 9300
gcatcctaca ctgcttttaa agaaactctg ctatcctttg ttttaccgtc taacatttct 9360
ttgtttccct tctgacatgt ttcagcctct tgtcatcact tggaacattt ttactgcctc 9420
atccctctag tttccatttc ctagcctagc ccagcccagt atttttgaaa cccaaatacc 9480
tggaattttg cctgatgttt aaagccaaaa ccaaatgaaa atgtgaagaa ccaaaggaga 9540
ttttattctt ctgtgactca aaatcctgtt gttattgtta aatgaaccct gtgggataca 9600
gtctggtatt tttaacacct tctgggggta tttgtcaata tttacatttg atgtggatag 9660
cttataaaag tccagtggct ttgggtaaac cctgttcact aagaataaga cccagaaggt 9720
catatgaatc aacgatcaaa gaatgaagat attctgaatc caagtttctt gttttgatta 9780
ttaaaaatac cttgcaatta gttaggcatg agagaaagct tgtgtataaa gtttggcttg 9840
taattcacat gaattctcta tacaggccaa aaactacctt ccctttggga atatacaatg 9900
accttatttg gaaattactt caggtaatta tgaagtaata atatctctat tcatatgcta 9960
aaaagaattc aaaataacat tcaattatca tctcttagaa actaaattaa acgggtactc 10020
acccaagttg aaatgcaaac aaatgcatga tatcattatg tgaccctccc ctagccctta 10080
ccctgtccat ggcagatgct aagaaaaact tgtccctaag aagagagaag tcaattgatt 10140
tagtcagcct ttcccactgt gtttggactg agcctcagaa ccctccttat ccccatactg 10200
caaggagtta ctgtcaatca actagagatg gcgctcaaga taaaattatt ttccatcctg 10260
tactaatata ataataagca tggaatctat ttattggttt tttcacaaca ataaaatata 10320
tctttagcaa gtaacaaatg gtagaggtaa tacttttaga taaagaggga agaatccttc 10380
tgacttttct ctttatcaca ttttacccac acacaatgtg gatggaatct gcaactgaat 10440
ttagcagtca cactgaaact ctgctgcatg accacactgt cctgaggcca tgtgccaact 10500
tctatgtttt tttttcccaa agctcttcca aggaagagct gccacttctc catttactct 10560
tttcacctgt aaatgaatgc tgccgttggt attgagaaaa tgggtgaatg aaagaaatca 10620
tgagaacact cctggggtga gagcaagagg aatgaaaaca cgtggtgtga gagcaggtca 10680
gagatggtgg ttaacacagg aaccaggatg gaggcctcag acttgaccct ggcctgcagg 10740
acgttggcct gcattgtctg caggaggaaa cacagaggtc agtggatatc ctgcggtctt 10800
ccattaaggc ctcttactat tctggaacat tttattttta tagagaaatc tcccacttcc 10860
tgatctcccc agttagaatt gattgatcaa agcagtccct ctccctgtta ttaaataatt 10920
aaataatatg cattagttga tccctataac cacttaaaat gtatctaaca caggagaaat 10980
ctttggtaaa tatttactga atgaatgaac aaataaataa ttggagggag ggagaaagag 11040
acagagttaa atcctgaggc tggagttttt ggtccttcag ccccttatcc cattgcccct 11100
ggcagcccca tccatgtgac cttgaattct gtccaagtct catcactttc tatgtgtgtt 11160
gtaatggaaa aaaggttaca aaacacagtc tttctggcct ggagtaccaa aatagtaaag 11220
acttctgttt tgaaagcctc tgacaaatga acacaacagc cacagtgata actttagcag 11280
gaaattgaac cacaaaaagc aagcgagcca caaagccctg aaggctaaaa atgcctcaca 11340
ggccaagagg cagttaggtt gcacctgaga cagaggctgc tttcttaact gggaaatggg 11400
tgtctccaga tatataaagc tgtgaggaga tcaccatggc ccagctctga cgagccactg 11460
gtacagcatt gttggttagt ttgcctcaac ctgcagcaga gtttttattc caagagttaa 11520
cagttagtta ggctttttcc aaaggttgag atgacacccc aactcccaca cccatttccg 11580
tcctgcccca gcttgaaacc aggcaaagtg agttaagaca ttgggagatg ggaggggaca 11640
ggaggggtag aggcctggca cccctctctc ttctgtttca agagacactt tgtggagttg 11700
tctagagaga ctgggggtaa ctgctccttc tgcccccaat ctccccttag ctccccactc 11760
ctccctatcc tgccaggcca accaccagca gagaagggcg cttagctcca gtgacgctgt 11820
gagtgaggta cttgtgagga tgcatatggt tcctgttgga aatgcacgga ggagcagtga 11880
aacaccagaa gccccatcct ctctcctgcc cgtgccctca tgcccagcca gctgaatatt 11940
cctgtggctt ccactcctgc actgtcccct cagtgtcacc tccccaccac attcccacag 12000
ccccacttgc tgtgtcacct ctcccctgag ggagaggtgt cttcctgcca gtctcttcac 12060
caccaggctc acacaactcc catccattca ctgcaggctg ccagactgag ctttccaagc 12120
acccccggac tgcctccagc cctactttct cttcaaggct tcccacttcc ctcaaaatga 12180
aagccctgac tccttggaca ccatacagac cctccctgat ctcccagcga tgctcttagc 12240
agcatcattt ttttttctgc cagattttca ggtgccctca gttccaggag caccaaatca 12300
cctggatgcc atgagctgct tatgacacag ttcaaggtgt gacatctttg aggtctctcc 12360
ataacacctc tactgcagac gtctgtctct ggtgtatcag ctgtaccttg cacacatgtc 12420
tctaaccatc tgtccactgt gtctgtctgt tcacacatca gattgaccct atgttctctg 12480
tgacaccagg tgagctctgg caccagtgcc cagcagatga ttactgaatg attaataaaa 12540
ctactcactc cacctgctgc ttttaaatgg gctccatttt ggtatgattt ggtgcgtgaa 12600
ccattaaaac ttttcagaag tcaggatttg aaacaaactt ggtaaaaagt agttaagcaa 12660
tgtcttcgtt gtaaagtgag aaaccaagtt gcttggaaac aggtgaaaag cttagccagc 12720
agcttcacaa ttcacagcct ggcactccag tagccctctc tggacaattg cccaggtttt 12780
cttgttttca tttctgtgtt ttaatttaca agttaccttt atatttctac taatacctta 12840
ctggcacttg tttgtgaatg aaacaatttt tttgagagtt tgttaattgt tctgtgtgag 12900
tgtggaaagt ttctttcata taagatcatc aattcttttt ttaaaaatag tgttttaggt 12960
tttctcctac ataaagaaag tggtgggggt gaggagctgg gggtggacca gatgctcaca 13020
ttgtatttct atgattttgc tgttggagat actgaagatc attttgctgg gctttgctct 13080
cttaaattac agaaaagtgc atttggtgaa gtttagtgtt tttagtaaac atatacaaga 13140
tgtcataggg atagaattag tgaaggaata gggttcgtga atgaatgagc tcaatcttag 13200
taatgaaatg tttggaaagc aacagaacaa gagagaaaga gcatcagaac atgatttagg 13260
taccccatac atttatacaa ataaaaaaaa gatctgattt agggtcatag aaatcacata 13320
cctaaactga tggttaaaca ataagtggca agtaacagtt tagctacaaa aaatgtacag 13380
gaacacataa ggaaagttac atgggaactt tcagagagga ccaaaaaagg tggaataaga 13440
ttatctgctc tgttatgatc ccctgtattt ctctgctcta tgttcttctc ttgggcagtc 13500
ttttcctgag atgttttcat gttattaatc aaaaataata gaaacgaata aactagatca 13560
atgcatttac aagatttttt gagtgcctgc agtatgcctc ggtgtggtgg gaacttggtg 13620
tactgaggtg agtagcatga tcctgaagag cttaggaatc actgagttct cgtaagtgca 13680
atagaagaaa aactggtaga aggaagtcaa atgaaaatac aagtttctaa agaaaataag 13740
tatttacaaa ggaatttgtc tgcattaatt tctgtggaaa agttagctat gtatagaaga 13800
aagaaacttg ttagtaattc tttcattcat tcatttattc attcaacagt gaatccctgc 13860
catggtagta ctaactactt gcactgtggt agtgcaaggg atacatttgc tgtgcaagta 13920
gttaggaatt tttttttttt tttttttttt ttttagtgct ttgtgtatgc taggctttat 13980
gcaaatatac agccaagcac agaggttaag aactttggaa tcaaaccgaa gtggttctag 14040
tcccagctcc cccttacaaa ttgtgtgatc ttcaacagtt catttaactt ctctagatct 14100
cagattttct catcagtaaa atgagtgcag taagagtacc accctcacag catgaaatca 14160
gattttctgt actctttact aactcacttt ccatgacgct ggcatgtggc aagtgctgac 14220
tcaatactgg ctcatttcct taactgcttt tattttattc attcactgcc attccagagg 14280
aaactgcggt tctgagatgt gccctgggtc actgctgagg agcggacgga gcctgggagg 14340
gagctacccc tgccttgttt cctcacactt tatggggtgg gtgtcaggaa tgaggctgga 14400
ggggaaggca aaaccagatc cctaagagcc ccctggaccc tgcttagggt gtttacttct 14460
atcataaagg ctgtgcttct gtagagacct aaggcagaga gtgccaaggt caggtttgca 14520
tttgagaaaa gtttttctgg cagcaatata aagaatagtt tagcaaaaga aatgagaata 14580
gctggaaacc cagttaggag gctgtggtag tccaggagca cactaaggca gcaacaatag 14640
gtggatggaa actggagcat ttaggtaagg gagagggatg ccagaggacg tgatggtgag 14700
ggagagggaa gagggtggga ctccaccgca ccagggcttc tggctttatt gactgggtat 14760
atggtggcat cctgggccaa gacagtgagt cacagggaag aggctttgag cctggtaagc 14820
ttgggacact tttgagacat tgggaggaaa tgacacatgg cagttgacat acagatctag 14880
acgtcaggag aaaagtcagg gctggagatt ttcggtttac agttgaaggt atgagtagga 14940
atgagatcat ttactcagtc catcccacca cattgagcac catgaataag gggctcctct 15000
ccccatgggg caagagaggt gaaaagacag tcaaagtcac tactttcagg aggattaggc 15060
tcaagttagg atggggaggt ggagaaggca tcaaacaaat aaatgaaaac tataacttca 15120
gttattgcca agtgctagga atataatgtt gctgttaaac tgcaaggtgg ctaacaagtt 15180
ggatagtctt tcttgaagaa ctgtagaaac attcattaaa aacaaaataa ttttatagtt 15240
aagcaagaaa gtgattgtac tataacaagg gagacttatt tcaccatagg aaaaaaatca 15300
taacattcta agaatagaag gaaaaaataa caattaccca agcctgcaac aacctgtaag 15360
accagatcag cttttaagag gaaagttaat tagataacag gaagaggtaa aaatgagatc 15420
atcagagatg ggggaaaagg caaatttaga aattctgaat ttaatagggg agacaagtat 15480
gtaggaatat aaacacattg tgctttctct gcacaaacac attcacatga ggatgtatac 15540
acaacccttg caggcacagg tactcagtga atgcctgcgt gccaaggagg gggacagcac 15600
tgagcaggag aggtggcagg gcctggtggc ccaggagcac tcatatagtc aggagactga 15660
tataaaagaa agatcctgct aacatgccat tacagagtat ggaatgagca aggaaggaag 15720
agggtgaggc tgtaagagag tgcacaggcc ctcatttaaa tggacgggtc accagccagt 15780
acttcatgcc tgttcctaga tgacccacaa agtgaaacga aactggatgt ggtatttagc 15840
aagtctgaga tttgctggtt taagtactaa gttagcaaac cagccttggc agccagagca 15900
ggtagggtag ggggcccaag gagatgggtg tatccaggag gccaatggag gaaggagagg 15960
aaggttttct gggaagccag ctgacatgtc atgctttgtc agtggggcct ctgcatggat 16020
tccctccact cccccccagc ccccaactca ctacggggag ctgcaaaatg acagacaatg 16080
ggtggttttg tacccagcag ccactcctca accagtgagg aatgggagag agataagaag 16140
gtgggaaaca aggattgctt gtcaccatcc ctgctcctcc tgaagccctg cgctgggagc 16200
aggtacagaa cccgctgaag gtcacctaat catttcacca tggagttctt ccccagatgt 16260
gaaactgctg ggactggagc aaagggtgcc cagagctgac tggaggagga agccagagcc 16320
acactggaac atatggagat gctagctcct gcctgctatt gaggaacttc attttcccag 16380
aagaattaga ggaccacagg agtgcttggg aagaagcaat tgtggcaacc taggtgaatg 16440
atgacctagg aggtgaaatg aggaaggagg cctttggtct ctgccaacca tacatttcag 16500
gttgcctttt gctaaagtcc tgctgatgtt aatcattctc tccagcagat cgcggcttga 16560
atcatggggt ctgaaatgtg ggtttatgag agaaaaacaa catggaaaaa ttgggcagag 16620
gctggtaaat gtgaacattc aaacctgccc ccaatctgag atggtgggac agtggacccc 16680
tggccagctc tccaggtgtt gcgtatgcac attggcaaac cacctttatt aggtggtgtg 16740
acacctgggt agcaattctc aagcaatgtg atgatctgaa tgtccctggg cctgacccag 16800
gtgaggagga cacaaaatca tgcaatatgt ctcatagagc ttagtcagtt ttgagaaata 16860
ctggactaat gttatgtttt taaagcacct aattcttttt ctcttttctg ttttatctag 16920
gcactgggaa atcatgaatt tgataatggt gtggaaggac tgatcgagcc actcctcaaa 16980
gaggccaaat ttccaattct gagtgcaaac attaaagcaa aggggccact agcatctcaa 17040
atatcaggac tttatttgcc atataaagtt cttcctgttg gtgatgaagt tgtgggaatc 17100
gttggataca cttccaaaga aacccctttt ctctcaaatc caggtatttt ctacttttat 17160
agcactcaat gcttgaaaat agatgcccta aatcacagct tggcattata tttatggact 17220
gtagataaaa gtaggaacaa ctgggtaaat agtgcactag aataacactg ataaaagcaa 17280
tctggtaaat tctatgctgt tgttccaagg tacttcttaa agagatgtcc tcttctgctc 17340
agtgaacttc tttgttgaca aaaacagtat ctttgtctaa ttcagtgctt tcaaaatggg 17400
aagaggagac aatgtatagc cagtgctaaa cagtacagtt tcatttacta gagaatgaga 17460
gcctcaaaac tcactgccct tctcctgtgg agccatgagc acagagccag agttttacct 17520
caatttacct tattgctcct tttctaggat gattattaca tgtccagcta actcaccttt 17580
ctcgcttttc attattcatc tcgttttaat gacattcttg tatctgactt tctgggaaat 17640
atatatagtg gtcttatgat atatatggat atatataagt gtatgtatat acatttatag 17700
acatatatac acatatgtat atataatata tataatctta agacacacac atacatatat 17760
gacacatata catacataca tataagacac acacacatat ataatcttaa gacccttcaa 17820
gaaactattt tgaggcactt ttataagaaa actttttgga atattttggc aatgagaaca 17880
aaaatgacat tttctttaaa ttatccaaat taaatgtgtt tccatatttg aagaaaaaaa 17940
gggaaaatag gagccagggt tatctagaaa catgagtttt cattaagaac taactgataa 18000
aggacatata attgacaatt caaaaatata tatttactgt tgttgttgtt ataaaataaa 18060
taagtccagg ccattggaaa caacaacaga aaagtcagct gtgtttgctg agagcagcaa 18120
gcgggtcctg cagctggcat agaggcagag ctggaggcag ggtggagcgg ctggggctgg 18180
agtggtcagt gagaagagtg agagaacttg ctaaagaacc ccccaacccc accctggggt 18240
caggtttact tgctgaagaa cctccttggt ccccttgaga gctgtgtaaa acatccctgg 18300
ggcacaggac attccagaag tgaggtcaga gtcacaagct aggatgcagg ctccatgagg 18360
acagaaacct ctgctgtctc actattgccc aatgactgtg cacagtgcct ggcatatgat 18420
aagtcctcag gaagcattgc tgggatgaat acatgagagc aggaaggtat aggggccaca 18480
gtcacagaac agcagatagt tccatttcct ggagcatggg ttctgactag gaaaagctag 18540
aaagttggat cagggctaga ttgtagggct gagaagttca ggtgtgattt ggaggaaagc 18600
attgtctctt ctttaaaagg cacccataag gaggtaggag gtgcaggtct tggtggaaag 18660
gcacagcagc tccagaaatc aaactgcccc agctgggact ctgaccagcc ctttaccagg 18720
ctgtaacaga gaggggatca aaagtgacac tgctgtttcc agcagctgct ggaagagagg 18780
gctgcatttt gcagggccca gtgtgggggg aaagggatgg ggtggaatag aatgtttcca 18840
ggagcactgc ttcttccaag actacaggcc cagcggcctg gtgtgggttc ttcctacagt 18900
tctttgaccc taatcaagcc cagttctccc agtcctagtg gcctgggcac ccacagggtt 18960
ataacctgta tttctagcac atttaatcta atttatttta gctagatcat taaatacctg 19020
aaatacaact ctcctcatga catcttgaaa tataagccat gctttgtata tatgggatgc 19080
ctggtcctgt gaggtttctg atataataga ggaatacatt cctttgtcct cagaggcatt 19140
tctctgaatt ttttttcctt cttggcagcc ataccttaag caaaagatgg tagcatactc 19200
tcgacattta gtcattcccc tgggtacctg tctgtgtgcc aggcactgtt ttaggcacca 19260
gggatactga ccaaaaaaaa aaaagttgcc cactggagct tacatattca gtgggggaat 19320
caataaatga gtaaatggtt tggtatgtca gaaggtgcta aggctgtgga gaagaattaa 19380
gcaggagggg aatggcaagt gcagcagggg ggaaggtata ccttgttgtt tcaaagagat 19440
gctcaaggaa aggctcacat agaaggcaac atagcactga gggtcaacgg ggagccctgg 19500
gcttcttggg gcaaccaaca ggaagcagat tgtccttgat gttctcataa tagacaacag 19560
tggcaaaact acaactcttg atttttcaac attgcctatg agttacagac agccttgcct 19620
gtccaacaca tctgtgagct gctaattatt aggttgtcat tctggggcct ccctggctat 19680
gatgtcagcc atgaattatc ttccatggta atttgcactt gtggagttgg ctaatgttaa 19740
ggattcagga gttcccctgc tgaagcacac agacattcct aaacctcaca gtggaacaag 19800
ggatgcattt ctaataatct tgttactctt tggattagta cttttcaaac tggagtttgc 19860
agactcctgc attttgaaag cttttaatga aatgtcattc tcccaccctc cttcccttgt 19920
ttagtgttag ttcttagaaa tcagtgtaag aacgtctgga tctactggag gactccctag 19980
tcacagagta ccagacatga cttcagggca gcatgtctgt atttgcagtc atcttcatgc 20040
cagtgatagc cataaaacct caaagcttcc tcaaatgtgg ctaacagagc cagttggagg 20100
aaccagttgc cttgctgcac acagagaacc tgggttgcac agtgagagcc atgcttgact 20160
actgataaca aaaggctatc gtttgttacc actatgtgca gtaataagca cccagggaag 20220
aatgtacaca atcccagtgc cagcaacatt tagatgaggc cctcatgatt aacacagagc 20280
tcaataatgc tctaatgctg gcattattta tatatttata tattttttga ggcagagtct 20340
tgctcttgtt gcccaggctg gagtgcagtg gtatggtctc ggctcattgc agcctccgcc 20400
tggcaggttc aagacattct cctgcctcag ccttccaagt acctggcatg tgcctccacg 20460
cctgaataat ttttgtattt ttagtagaga cagggtttcg ccattttgcc tgggctggtc 20520
tcaaactcct ggcatcaagt gatctgcctg ctgcagcctc ccaaagtact gagattacaa 20580
gcgtgagcca ctgtgcctgg cctgctggca ttaaagagaa catgacaaag ttatcaactg 20640
ccatattaaa ataggcattc aaggactact tgctgacagt tttaagtttg aacactgatt 20700
atcccaagtt tccaacgaac gtcttggagt tgctaatact cttttgtgta tcatgcttac 20760
gtgcttaggc attttctgca ttggtattgt ggaagttgag atgtacaaat atataaaaca 20820
atatctgaag cctcctcaca gtttgcaacc aagttcaagc aaaacaatat ccacaacctt 20880
aaactaatac aacataaagt gaatgtaatg aggatattga tttgaatttt tatttacttc 20940
ttaggggttt tgtatttaat gaataaatat gttttggttt tactgactct tgagctttag 21000
taaattaagg tgtttaacct ttgcatgtta atatgtatat taagtaatat aataaaaatt 21060
gttaataaat atccatttta tttattttgt tccttaggga caaatttagt gtttgaagat 21120
gaaatcactg cattacaacc tgaagtagat aagttaaaaa ctctaaatgt gaacaaaatt 21180
attgcactgg gacattcggg ttttgaaatg gataaactca tcgctcagaa agtgaggggt 21240
gtggacgtcg tggtgggagg acactccaac acatttcttt acacaggtaa ttgtttcaaa 21300
aggattgcat gggccaggat gtccagataa gcactgtgtc tcttttgcct ttgtaactgt 21360
tattactctt tttactgcta tttaatatgt aatgtatatt atatgatcta taatatatat 21420
gtaatataca ttaaatggga acatgtgcaa atcttacaaa cattactaag aatttattga 21480
tttcatgatt ataagtcaat tccatatatt tatgtatata ataaatatat aagacataaa 21540
atatgtataa aacatatata taccatacat atgaaaaaca aaaagactat tgagccccaa 21600
ccaagtcatt tatgatagat tcttatcaaa aagcaacatg tcatataaaa actttaagca 21660
taaacaaaat aatgtaaata aggaccttta taactcaagg tggatttgat ttgtaggttg 21720
cacaggtagc agaggctgga cctcgagtgt gagtcctaac tctgctgcag actaggaata 21780
taagtatcct ggaagccaca tgaccctggg tcctcaggag agcatgcact cttcctaccc 21840
actacctgac tacccccacc actctctttt tttctctctc aaattggtgt taagtagttt 21900
tcatcagaat aactctttta tgatttttcc actttaaacc tataatgtcc caaaactttt 21960
gaatctggtt ccacctttcc cacttccccc accttgcact gcatagatct ttgaagtatt 22020
cccaaggaaa ggaagaagag gctggtaaac agttcttcta tcagcattcg caaagaaggc 22080
tttggcgtgg gaacagccct ccccttgaag aggctcctgc ttctaccatc cttatgtggg 22140
ctgtcaaaga ccatgctatt ttgataaaag cgaaactcaa aataccccaa acaataagta 22200
ggaaataccc aaaaacatta aaagggatta tcactaagaa agtaattttt attttctttc 22260
tagtactttg gagtattttc aaataatgaa tgcatgttac ttttgcaatc ggaaagcaaa 22320
atataaatta taaaaaggaa agaaacacaa tattgggtca aaagtcccca ttcctaattc 22380
acttgaatga ctttgagcag gtcacttagc acctctgaat ctgctagctt agcttcaaaa 22440
taggggatga tcatgctcaa agttccacgt tactgtgaaa attatatgag atgtaaaaat 22500
atgtgaaagc agtttataaa ctataaatag ctatattctg gtatgcagcg tcatggaggt 22560
tgcctgctta taaaaatatg tcagtcaaaa tcataactca caaactaaag ttggccttca 22620
cttaagaaaa taccagtatg aatgtgtgct aattaaattc cttcacagat attttacttc 22680
ttggtaaaca caggctaagg aataatttca aactgagtta aactgacagg ttgtgaaatc 22740
aatttatggg ttatgaccag tagtagggct tttttttttg acagataaga ctaaaataga 22800
ttaggcactt acattactca taagaatgat aaatattgtt ccatgggaaa agatgtatat 22860
aagatgtcaa gaacagcccc aggcacatag cagacaatcc ataaatgtga tgaggatgac 22920
gactagcact ggacattgtt agcatggata acattgcaat caaagagttt caacaactcc 22980
atgagggtta aatgaactaa gagagccaag acgaggaaat aattggagca ggtcactggg 23040
aagcactggg taggggtgaa gactcaacaa agtccttctg actctaccac taatggtggg 23100
gtcctttcaa tgcctgatta ccctcatctg tggaatgaag gtgataacaa cagtacctat 23160
tataaggggt cactgtgatt taatatgtgt aatgaatgta taggtaaatg gccttagcac 23220
agggcctagt atgttagctg ttcttatttt agttatgtgc tggctcactg gataagacct 23280
gtgaaaaccc aaaccttagc ttttcactga ggcagtgtca gcacccttgg acacacagtg 23340
ggagctagat atgtctcctc ccccttcttc ctttagaata atggtttcaa aaagtttaaa 23400
ctatgacccg tagtaagaag aacattttag acagggagcc aaaacacaca aacatatatg 23460
tgtgtgtgcg ttcacatacg aacatgcaaa agcacttgta tgtgcaaaca catacaagtg 23520
aaagaaaact ttcacaaagc acatggaatg caccccaact tttctgtttt actgttttat 23580
tcacttaatg ccagctacca ctcattgaat tgattttctt gtactgtagt taatccatgt 23640
ttctaaaaac tcagctttaa atgggacgtg accaaaataa tgcatgaaca ctcaaaggag 23700
tttcatccct tcattcagat gtgaacattc ctgtgttccc cttgacttgt ctttacacca 23760
tcttaatcaa tgctgtctta cagcaacaat tagtcattca tagcctgtgg tttgtgtcca 23820
tcagtgagtg tgccctgata gacctgaaaa gattgggaga ccatgcccag cttcccatta 23880
tttctgtagc ctgtcctttg ttcccataac agtacagtca tgccttagta tctgtggggg 23940
attggttcta gaactccctg tggataccaa aatccatgga tgctcaagtc ccttatataa 24000
atggtgtagc atttgcatat gacctgtgca catcctcgca tatattttaa gtcatctcta 24060
gattacttat aatacttgat acaatgtaaa tgatatgtaa atagttgtta tactgtattg 24120
tttagggagc aaagacaaga aaaaatatat gttcagtaca gataaaacca tcctttttta 24180
gaaaataact ttttgttcat ggctggctga atccaacaat gcaaaaccca cagttagaga 24240
cagccaactg tatgtgcctt tgctaagtag ataaatttag ttaatgatga aggactggat 24300
gaaagaaaga tttgaggata aaaattagag tttgcttatc tgtaaatcct gaagtaaata 24360
attacagtct ctgagaaaac tgagataact aatacttagt catttacttc acatttcatg 24420
attgcttacc aagattaaac ttgttctgat gaaaatcttc acttttcata ctcatcgttt 24480
tgatgagtga acttaacaat taaaggtggc tgggcacagc agctcatgcc tataatccca 24540
gaattttggg aggctgaggc agaagaattg cttgagccca ggagttcgag accggcctgg 24600
gcaagatagg gagaccccca tatctacaaa aaaaattttt aaaaattagc caagtgtagg 24660
ggcatatgcc tgtagtccca cctactcagg aggctgaggt gggagaattg cttgaatcgg 24720
ggaggttgag gatgcagtga gctatgatca ctccactgaa ctccagtctg agtgacagag 24780
tgaaaaccct gtctcaacaa cagcaacaac agcaaaaaca ttaaaggtaa ttccaggaca 24840
caatagggat gaggcatatt gcctgtttaa ataaactttc ctggccttaa gaaaacaact 24900
ctgctgaata attttatgga attagaatga cattaaaata atgaatctga taaatatgat 24960
tagttaataa aaatacagtt aaaataatac atcccttgga attttaaatc ttactgagaa 25020
ccatttctgc acgcacagta tccaatatac aaacacacat ggaagaaaac ctttccaaag 25080
ctcgtggatt gcaccctcat attttctgtt ctgttatttg agtttttaaa tgctggttac 25140
cacccattat attaatttta ttgtattgta gcttgctcat atttttttaa aactcagctt 25200
caaataaggc atgactaaaa cagtgcatga atggatccct tccaaatgtt gaaagttatt 25260
ccagtttctt tcctttccaa gtaccatgca tatgcagcta accctagcat tggctgacca 25320
gtctaacatc agagtgaaga gaagggaact tccctctaca aggcttacaa aatgataaag 25380
ttacaaagat aagagctaaa atgcaaacgc tttctctaaa gaagttaaat cctagagtca 25440
cgaactggtc catgttgctt cctttagaga aaaaagtagt aaaatcagtg caggttgaaa 25500
cctgcagcag gtacaacagg ggactctatt ctctgtgctg tccttgaagg gactccaaag 25560
atgctgtttc tctgtttgcc tcccagtatt ttaagtgagt tttggattct gcaaatcaat 25620
ctggctagct atggccttgg gtgaatataa gcaccagttt ctatagccgc aagttaaagg 25680
gatttgaatt gttaattctg tcatccctcc cattctaaca tcctgttacc tagaaaatca 25740
tccatacata gagcctgaaa gcttactttt caggatttgg ggaaataaaa aaaaatttgg 25800
tagaatacct catactttca ttaggctact gatatgaact taattattcc tctaactcct 25860
ttgaggctca gaccttctgt cccctgtctt atcactctca tctaatcacc tccaagcaat 25920
gtcccacgtc actgagaatt tagtatccaa tttcctatct tctgctccac tccagctcct 25980
gccttcacct ggcagagtgt ccatgtgggt agttgatcca ataccccaag ttcagagtcc 26040
cttgatctcc tcaatccagt gactgacacc tccactccat ttcagcaact tcacttctgg 26100
ggccacaggc tgagacttga ttatctccag aatttctcca tgtccaaaac tgaaatggga 26160
tagttccctt gaccccttcg tgggactcat gaaggggttg tcttgtttac tcatcctgca 26220
gctctcaact cctcatggga gggagagcac ataggtgagt gggtgcagag ggtgggacaa 26280
gtgcttctgg gtgctggcag gagtagaact ctgtgcggcc ccgcagcagc atctaggtga 26340
gtatctgtga cccctggagc cgcagagggc atgtgttaca gtgtgctctt ttagctttgc 26400
tgtccatgga cagcttaagt gttttaacag ctcagtgtga cagccctctg tatcctgaac 26460
tcttgttcag caaccaggaa ggatcagctt gcacgaacaa attgaagatg gtgaatgtgg 26520
agaactttat tgctgatgaa aatggctctc agtgggatgg ggagctggaa aggggatgga 26580
gtgggaaggt gtctccccct ggagttaggc catccccagc tgaactcttc tccgaggtcc 26640
taccgtcaag ccatccctct gaagtcaagc tgcttttcac cagtgtcaag cttcttctct 26700
tctctccttc tctgccattc cactgccagt ggatcctggg gtttttatgg ggacaggatg 26760
gggggcaggg cgggccaggg tgattttgga aaaggcaaca ttcaggcagc aaaatgggga 26820
tataaagttc tcactttggg ccacgggtcc aggcttgagg gtggagccat tgccagggac 26880
cccacccttt tctacctagt atttccctgc ctccactctc caactctctt atcacaggcc 26940
tcctatccat aaatctttct cactccctta ttttttttag agctcacctt cagacttatg 27000
aaagctcaag tcttttgaat tcctctttcc ctcccagtcc agcaggcttc acttctctcc 27060
ttgctagcct ggaagtgtcc attacaccct tgactgtgtc gcggctgaca catggaaggc 27120
aggtgtctgg tattgttaaa tgaacgttaa caagtgcaac ttctttaaaa agtaagattt 27180
ctttatttga ttgatcagtt ttttttttaa ttaagtatct aatttccatt gcttctggat 27240
ttaggcacac atattgaata taaaattcaa aacttttcat gtatctaaag aaataatttt 27300
aatgaccaca gaattttgct cttaaaacaa atggagcttg caatataatt ataactaaga 27360
cagactacaa gcacctgctg tgttttcttc atgctgccat caggctggtg catagtctga 27420
tgggatggat gcatccctgc tgtaatagga ataagtaaag agggtgggag actgttgttg 27480
gtgtttacaa tcattattca tctctccatt aattttcatc cttgcagttg agataatctt 27540
gccttttaga acgagaaaac aatcttatta cttacaacat tcacaagaca gggaaacact 27600
cctcaacaaa gaaccatttc ctttgggcat ttgataagaa ctgctgacat ctgtgtttat 27660
taagtgatgt gggagaatca gccaaggggc agaggtgtgg gtggtagctt caaaggtaat 27720
ttcctgactg agtgtggttg ggcatgttgg caatcccagt gtttgcaatt ccagtgcttt 27780
gggaggctga ggcaggagca tcacttgagc ccaggagttt gaggctgcag tgagctatga 27840
ttacgccact gcactccagc cagtgtgaca gagagagacc ttgtctcaaa aaaaaaaaaa 27900
aaaaggattt tgctttcctg gtctgtttga atgtggtaag cacaagaact tatttgtact 27960
ggtggtcacc cacttgggtc ctttcagtaa acttctggcc atggcccagc ctgagcacct 28020
gcttccttta cagagcaaag ccagagcctt ctggtttgta atttttgcag ccacataaca 28080
ctgatccatt ttagctgtct catgaaactg ctttccaaga tgaagtgaga ctatccgaaa 28140
ccaaaatcta aaaagaatga ctacagaacg aatggcagtc caaggtcatt tggcagggta 28200
gaattataag gaaaggtgat ggccagctac tgtgaagtct taagcccact cataggtacg 28260
aatctcttac caggactgaa gtttgtgttt ccttaacttt gctgcttgat ggtcttctgc 28320
aaaggatctc atttctccag ctttcactgg gaagtggcat ggggttttcg aaagtgcttt 28380
aaactctaag aactggtttc ctctcaggct tagccaccac tagctgtctg actttggaca 28440
agttactcaa ccactctgag catcagcttt ctttttttaa aatggtgaca gtgatatggc 28500
taccttagag agttgtggag attaagtgag ataatatatg aacaaaatat ataaatatgt 28560
ctatattaga catatatgat ctgttaaata tatttatata tatatataga aagaatatta 28620
gagaactgac atatttatct tagacctact atgttctagg gcccagttct gctcatgtat 28680
tatctctcta tatatatagt attgagagag agaaagagag agagagctag agaaatagaa 28740
accagtagga tatatcctat ctgaagaact ctaatacagg tatttcactg aacaaaatgg 28800
cttctcctta aaatgtattc caacaccaat ttcccctcct tgcttctcat acatattaat 28860
tcacagaaat tatattggtc ataaggcggt aggactatga ataatttttt tcttctgcat 28920
ttgccaaata ttctgtaatg ctagttttat ttttataaca aatttttaaa gtccctgtaa 28980
accctataaa attaactact aatcctaaac ttgaactgta atacatattt ttgttaaata 29040
aaaaaaaaga tatcctaaat ttgagaatct aagctaaatt cacaaaatga ctaaatgtat 29100
cttcttactc ttaaacgata agaaaccagg aatgcagtgg tttaacccaa acagcctagg 29160
tattcttcac tcatttctct tgtcctcagt atatgtttcc cccaggccaa gaaacattta 29220
tggctaggaa ttcatgactg ctcatctgta agacactctc caagtactct gtcatgattt 29280
ggaaccagag ctttggcctt ccattccaga tgacctccag gttctaagct ttatctgaag 29340
tactccacac tacattgcac accagcatga tgggaaggaa tgactcaagg gaactgggca 29400
ttcagaccac tcaccaacta cacacagcct ttcacacagc agtagacaac ctttgtagga 29460
agacctctgg ccaaatattt atatatcagg agtatatcaa tgtatggtca cagttacctc 29520
ctttttaatc tcaagaaata aagtcattac ttctgaaaag aacaccagta atggaactac 29580
acgggcatgc caagatgaaa attagtctct ctctctctct ccaccccatt cttttggcca 29640
ctgcatttta aacaggccat agtgatagaa ccagaaggta gctgtgtttc gattattttt 29700
gttttcttat ttttaaacat tgctttatag aagcctgttc tataatgtaa gaagtgaata 29760
ggaatcctag tatctggaat gatgctgagc catggagatg gccagattcc atgtgaagag 29820
aagaggctgg cagaaagcta agtatttgct cgctttccct accctgtatt catggagcag 29880
ccattacaaa agatgacgca ccagaccatg ggcctggtct tgctaggtgg tggagatcca 29940
aaatgctggg cactcactgg agctccgtga agatctgttg aatgagtgac taaatgtgtg 30000
gtgaactgtc cctacctaca gggttccctg ctttgcccag cctatcaggc ttgggtacaa 30060
gcagatagcc tggattggat ttcatgaggg ctttgtgccc actctgtata ctaccggaag 30120
atgcataatc tctttagact tcctgggtca tcactgaaag gccagaggca agaatttgag 30180
cagtaaattg aagcatgtgc agagtgacat agtctgggca tagataacag tacctgaccc 30240
ctcaaattcc ctaaagttga catgacagct tgtgcatcat ggtttggtac agagatgact 30300
gaagagacag tggttgctcc tctggttgct ggacaccttt tgaatgggct gtgatgactg 30360
gagagcgtgc tctgtcctgg tcacctgaag gtgctcctag actagcagaa gcagagacag 30420
cagccaggcc tgactggaga gctggaggga ctgggtgcag gggacttcta ggccagcagt 30480
ttctgtgctg tgctcggagg ggctctggtg gcaggaacct agtcctgggt tccagctcta 30540
ccagggtgca gcctgaggaa ctcagctttc ctatttattg gcttgccttg ccacaaagga 30600
tttcttgtgt gttaaagata ctaatagctt aaacaaacaa acaaacaaac aaacaaacta 30660
ttgatttaaa ccactaatca aatctttggt tttacaaatt tagaaaataa aattaggccc 30720
cagggactaa atgtctctcc tatggtccag aaaccagact gaaactgacc tagtagcaaa 30780
acccattctc ctgtatccca actcccagtt caggttcctc ttcattagct cccacccagc 30840
tgcatagttg aggacactga attgagggga cctggggtaa agggctggca ctcaggcatc 30900
gctggcttct acatgattcc accacatgga aagcatcata atcatgtatg atccctagaa 30960
aagtgtttgc taaactcctt aagttgtcat tcaacaaaca tttatctagc acctaccaca 31020
tgctaggcaa attctagaga cacagctgcc cctgccctca tggaactaaa tgtcttgaaa 31080
gagatactgg ccctgatcag attttttaaa aatacataat tatgcaaata aatcatatta 31140
tatataatta caaagcatgg tatgaaagga aaagtacaag atgctgtgaa agcgtataaa 31200
aggaggactt ggccaagtct agaggatcat cagtgagtgt tccctgatgg actaacatta 31260
gggccaagat cagaaagatg tccaggagtt aacatggtca ataatcactg ttaatcttta 31320
ttaagcattt cccttgagtc gagtcaggct tcacttgctg cattaacttc tttaatctcc 31380
acaataactc tgtaaggtag gtgcttttgc tttttttcca ctttacagat gaataaacaa 31440
gaagtaaaaa gatgtaacta attcacatgt agccctctgg cttcagaagc tctttactcc 31500
ataaggttcc tgggcacgtg ggtgtggtct aggaatggga agaggctaag aatgctgaag 31560
ggcagactga atgttccaga tgaagctggc gtgataagaa agggccacat catgtgggtc 31620
ttataggtta tgatattcat gttaaaactg ttaatgtttt tgaaatatct aaaaccagga 31680
agtgcatgat catatttatg ttttagaaag atcactcagg ctgctgtgtg gagactggag 31740
tagaaagaag catagaagag acctacagag actaggaagc tgtcacagta acctaggcac 31800
aagatagtag tggtcaagac tagcttgctt gaggaagtgt agagaagtag acagataaaa 31860
ggagcatcca aggagtgtac acttctaagt aatttaaaag aaagagtcaa atcccttaga 31920
agcaatcatt tgaaataacc taattaccaa tcctttttat ccagtcctct tatgttgtcc 31980
caattcagga attcttaaac cagttcttgt gttaaaatca tagcatggac tgccttacaa 32040
cattctatac ttcagtccat tgaaaaccac atcaggccag gtgcgtggct catgcctata 32100
agcccagcac ttttggaggc caaggcgaga ggactgcttg agcccaggaa ttggagacca 32160
ctgaactcta gcctgggcaa cagaatgaga ccctgttgaa agagagaaaa ggaagaaaga 32220
aagaaaaaag aggaaggaag gagggaaaaa aggaaaaaag ggagggaggg aaggtggaag 32280
aaagaaaaaa aaagaaagag aaagaaaagg ctgcttcctg gcacttctat taccatgtga 32340
cacgagaagg gcaattgtcc agacccacac ttaggttcac cacagtcctt ggttcccagg 32400
acagagtcct ttgaggttcc ttcaagcacc cacaccatgc tggaaaatgc aggctttata 32460
aactccagat gtcagtccac aactccacag ccccattggg tttggttcct ggatgaattc 32520
tcggctagat gactggaata gagttctggt gaacatcttc cttcctgcca agatgacaag 32580
tcaggtttct aaagacagat gtgtagctct ttgtggaaaa aatattctgg gcacgaacat 32640
cttcggctga acggcatgga atggtgtgga aaatgtgaag ccctggactg gtttctgtgt 32700
cagtggtttc ctgggactgc cgagttgaag gggagcagaa agggctcagc tgtggaggag 32760
gcagagatta gccaaactag accagctatc gccccagctg catggatagc atcttccttc 32820
ttgttcctga ttaagctcat ttaatatgcc aatgacaaaa ggatcatgaa ggtctttcca 32880
aatgaatgaa tgttttctgg agctgagatt gctggctcct atttgcatag tagtgcatac 32940
tgaggcatct gaccggatta ctggattcct aacttttaca ttctttttaa gctcagatat 33000
ttctacggaa ggcagaaaga taaaggccaa aaaaagatca ggcttgccta gctcttgggt 33060
ttccaggaaa ctgacagact cctgatatct gcaactagtg ctttctctcc ctgggcgggg 33120
cgttctccgc caccttgtgg acatttgggg aactgcaccg ccccagcatc tatgatcaga 33180
ggaactgaca gaggctgcca taactacctg gtcccctgaa ataaaaacca ggcaccaggt 33240
ttgcccaaag gaggaccaac acatagcaga atgaggctga gtaggatgtt aaccttccat 33300
gctggtcttc cattatcacg ttatctgcca ctgtcacaga caacatctgc atccgctaca 33360
gatggagatg gaacaaaggc tttcactgga tctcaagttg gatgcttctg taaaagtaaa 33420
gttccaagac ctttttcttt tttttcccca gtgacagcaa gaaaattgga tgttactatt 33480
ggcaacaact atttcctgct tgtgaagcgt tggtatttgc agtgcttctg ccctgtggtt 33540
attgagctga aaggggggct gctcttgaca taggagcatg gcaggattgg attcttcata 33600
acatgctacc cagcttgtct tatgtgaaaa gagaagccca gaaagagctt gggacacaaa 33660
gatgcaaatc ctgggtttgc atttgatttg ttaagtgtcc tttcacaacc agcttgcaga 33720
ctttagggga aaaaaaatac atggtaggct tgtgggtagt ggagaaaaca taagctccag 33780
tacctcacag cctgggtttt cagtctgcct cctcccaacc ctcatcacat accagctgtg 33840
tgaccgtggg tactctgtct atgacaatca attttcttgt ctgcagagta cagataatag 33900
aggacctgcc ccatgggaag cttggaggac taaatgagat taagcccagg gcctggctca 33960
tggccgattc tcgtaacagg tccatgtgtg taatcaacat gtgctgaagt tctctggacc 34020
ctctgtaaat atctaatagg aacgactagc catgggtgtg atgagggaga tgctagaaac 34080
agtaatgggg tcaaatgaat ctgtcccata gtctctaatt tgaaacagct gaactgtgcc 34140
attgtctcct tggaggtaag atataagcca cagagtgcct gaagcaggtc tacagttaag 34200
gaacttaaat gtgctaagtg attaaaggct gggctgccta ctccctgtat ggcgacactc 34260
tccaccctcc tgcatactct ccactactcc aggcagtggt gggcctccta gcctctgggt 34320
gatgagcagc tcccacatct cccctttgag acagaggcct gcagaacctg accagtgtga 34380
gggctggctg gggtcaggag tgtgcctgct ggttggccct caaggtgaat gtggctcttc 34440
taaaagacac ctgaagggat gcagaattta gcagcaaacg ccaagaacag ctctttgtca 34500
tctctgcatc ttctactaaa gattttaagc tccgaagtat atgtctgggg tttctagaga 34560
ttgctattcc cttataaggg aaaaaataca cccttattgt atatttattg agcacctaca 34620
atgtgccagg caccccagct agggagttta gatatttaat cctctcagct gccctgtaaa 34680
atagggattg tttaaaatgg atttcgccca ccttataggc aaggaaactg aactacagta 34740
tggtaaagtg gcttggctaa tccacctggc tagtgagaat ggagccacag tgaacctgga 34800
ggatactctt cacggcgttc catctttgtg tgccccacag cacaagcaca atgcccgggt 34860
cttaggaaga gcttggtaaa tgactgctga gttgagaatg gtaggattca tgtaccaaaa 34920
gggcatttgg aaatgggcaa atgcccagat agggctctga aagactagct atgtagagca 34980
acagatggca aatcatcaat ttaaaactct gtcatccagc cagtagcccg ctggcctaca 35040
gccagccatg tatgtacaag ggctgacttg atcagctgtg gctcttttat ttcaggcaat 35100
ccaccttcca aagaggtgcc tgctgggaag tacccattca tagtcacttc tgatgatggg 35160
cggaaggttc ctgtagtcca ggcctatgct tttggcaaat acctaggcta tctgaagatc 35220
gagtttgatg aaagaggaaa cgtcatctct tcccatggaa atcccattct tctaaacagc 35280
agcattcctg aaggtaagtg aagttcaggg gaatgttcca ccaatctaaa atttagatgg 35340
ctggatattt tgctccttcc catttttttc ctttaatgtt tcaggaaaag actataatac 35400
tgttgaagaa tttagtttct tccttgagtt tgccctcttc atggatagat ttaaaacaca 35460
cccaaatgcc aaaagcccca cttgcttctt caagtcttct cagcatcaga ttccatacat 35520
cactgcttcc tctgcctttt aaaatcttga tcctgacatg ccaaagccac tctcagcttg 35580
tgtttacagg agagataact gaactaaagg ggttctaatt gggcagagcg gggctcaagt 35640
cctggccccc caactagagg cagtgtgcct taggcaagtt gcccaatctc atgggcctca 35700
gtttctttat caaatgaggg agtaccttat ttatatgtct gagtctattg cgggatctgg 35760
ccagcagccc gcaatgcaat ggggctctct ctttgttcct aggcagattg gcaggttgag 35820
aaataataga cacacacaag atagtgaaag ctgggtccag gggggtcact gccttctggt 35880
cctgcggtgc caacaatgca ctggatatac cagcatttat tattaagttt agtgagggca 35940
ggggtacgtt agtgagggat ttagggtcat ttgattatga ggtgagatgg tcacatgggg 36000
atgaagtaat tctttaacat aacatttgta tgtagaagta cagtacattt gtatgtagaa 36060
gtacagtatg agataagaat ttacaatata gtgtgtgcgt cagtaatttc taacagagcc 36120
ttagaacaga aacacagtct ttccataacc tatgattagc aagatattaa tcagcagtaa 36180
caattgcaac aaaaactggt tacaaataat ccatggaaac aggacatgaa gctagataac 36240
cagttagacc agaaattctc agaggggagt acgccttaac cctaaagagg cctagaagag 36300
cggcggcaag atgagcgtgt ttatagccct atcttatcca tatggacagg ccccccccat 36360
gcgtccgttt ataggctccc cacaagggtc acattccatt cccagagcta tgaacatctg 36420
cttttctggg ataggaatct tggtgatggg aaacctccct gactgcacgt ccatttgtag 36480
gctctctgca gggggaagca catcacgtgc tgttggctca ttgtggcagt ccaacctggc 36540
attgtcttta cacaatcctg catgcaattt tgtatttaca ataatcagga gcatttcatc 36600
ttttattccg tagcaatagt ttcagggcgt ctccctacat gagtctcaga gtaaaaatac 36660
atttttatat cattgatttt catatgtgta cagaacacac tcctgtgctt gtctctcact 36720
ttcccttttc tgttgactat gatcatgtca ggaagatgag gtgcacagca cccaggtgag 36780
cctgtagtca caacacagcc ctctcagtag ccagatggga aaggggccca ggaccacact 36840
gacagcgata gcccagctgt acctcagcag gccttttata actgcccctg ggcctttcag 36900
agctcctacc gccaccacta atgctgtccc aggtgggcca atgggtagct catacagatg 36960
tgatgagctg gtagatgttc agaatattct taaattaaga tgcattccag gtcttattac 37020
caggaaacaa aatagaagtt cctaaggaaa tttcatggtc aaaaccaaaa tgcacacaaa 37080
tatggtaaac aaatatgttc tcacagcaag taagatagat gatgtttcat tccttttcca 37140
gaatttagcc cagtgtgaga tttaattgta gggtaccttc ttttctttct tctagatcca 37200
agcataaaag cagacattaa caaatggagg ataaaattgg ataattattc tacccaggaa 37260
ttagggaaaa caattgtcta tctggatggc tcctctcaat catgccgctt tagagaatgc 37320
aacatgggca acctgatttg tgatgcaatg gtaagtcatc agcaggagtg gacatatgct 37380
agggaggaag gaaaggaaga gggaagagga aggaaggatg cgagaaggga tagatcgata 37440
gctacagaat gagggatttc aaaacatttt tagccagggt ggtggcatgt gcctatagtc 37500
ccagctactc gggaggctta ggtgggaaga tggcttgagc ccaggaggct gcagtgagct 37560
ataattgcac cagtgcactc cagcctgggc aaaagagcga gactctgtct ctaaaattaa 37620
ataaatatga aaaataaata aaacattagt gcctaaacaa aagcctgatg tagcttcata 37680
ctcagtacct tcttaaggtg ctggaaggag atcaggtaac caccaacata ctgctttaca 37740
agaactaaaa cctttatttt aaggtgttct tagatgtctg aattttggat gttgctttcc 37800
ttttgtaaga tgatgcaaat gcacaaactc tgctaccctg gagatatcaa cataatagat 37860
aaaagcatat ctgtttggcc agactgtttt tcactcatct gttcacttga ctccagagaa 37920
ccctggcctt gtgtgacagg aacctctttc acaggcacag aaagacaaca cacagtttta 37980
atggtaataa caagtcctat ggctgagcaa gaccaggtat gaagaacaag cccgtctact 38040
tgtcaccagg cggaatggcc atggtagctc tctgggggcc ttactgagcc aaggtctcct 38100
atgtcggact actacttcca tgggctcaca gcctgcaccc agggaagtca tttctgtcag 38160
tgcttggcat gcattgccca gtgcccagga tacaactgca ctcgataaat gtttgccgtg 38220
taaatgacaa caatcatgct catgccaaga ccagactcat gatgtctcct cagcatcatg 38280
cagtcaatgc cagagaagta gttaaggaga agctggtata taaatgttaa gagaagtctg 38340
gagggaaaca acccaggctg ttggcagtgc ttatccactc tgttaaagag gctaggacaa 38400
aggagagctt ttgcttttta tttatttatt tatttaatta attaatttat ttatttattg 38460
agactgagtc ttgctctgtc gcccaggctg gagtgcagtg gcatgatctc agctcactgc 38520
aacctccgtc ccttgggttc aagcgattct cctgcttcag cctcccaagt agctgggatt 38580
agaggtgtcc actaccacac ccagctaatt tttgtatctt tagtagagac ggggtttcac 38640
catgttggct agactggtct cgaactcctg accttaggtg atccacctgc cacagcctcc 38700
caaagtgctg ggatttacag gtgtgagcca ctatgcctgg cctttttttt tttttttttt 38760
cggctttttg tctcctaaat gagaatatat tcattttatt tcttataact aaacatatat 38820
tttgtaaaat aaaagtaaga atggatgtga ttctccccag cttcaaaagg ctgtctctgt 38880
ctttccaaaa ttgccaactt gggaatcagc tgcagacata gacgtggagg ccaggttctc 38940
tttggcatga ggtctggtgt tgagcaggct ctagggccaa gccctcagaa tcaggcagag 39000
ggcatggcct gccctggaga agcctgtgca ggagtgactc tgcaggccac cttgcatggg 39060
aggcccagtc ccgacaccag ggctacaacc gagagaccat tttgctaagg ccacctggcc 39120
tgacagtatg ggagttttag gagggggaaa taagagtttg acatggactg gtaacagtat 39180
atttttactt ctgggagcat aaaattagtt ctaattatag ttgagagatc cctgggaaca 39240
cagcagcagg tgaaaagcag ctttatttcc tcactagagc taaccagatt cctggagctg 39300
atcctaagga agaagagcca gagtaactag tgtaaatctg tgctactgtt gcagattaac 39360
aacaacctga gacacacgga tgaaatgttc tggaaccacg tatccatgtg cattttaaat 39420
ggaggtggta tccggtcgcc cattgatgaa cgcaacaatg gtatgctccc aggcccagct 39480
cctcagtgtg tcatgttctc tctctgatct ccttttctgt tatggttctt gccctgaaca 39540
cacccatgtg cagcctcagt tcatctctgc tgccagccat gtaccagaat gtcagtcctt 39600
taaattaggg ttagccatta ttaaagtcta aaacctagcc tgaaggacac acacaagctt 39660
gtgtgttctt atatgtattc acacacagaa gctgcaggta aggttagtgg aactcaaact 39720
tggtctgaat ttctgactta caaagctata ctagttgttt ctgatttatg caaaataaca 39780
acagtcacca ccttggcatc tgaaaagaga caaaatcacc aatcttctgt cctgaaaaag 39840
acataccact ggattaaaat gttaccttag tatatttaca gatatccaag attatatcat 39900
atggcaaaac aaatcaacta cacaaatact caagacactc ttgtagccct ccctgattta 39960
ctgtgaaaac caatcagtat cagtataagg ccccaaacgc atcccattgt cagacaacag 40020
ggacaggttg caggggcaca ttgtttgtct ggaggaacgg aacaatgcaa aacaaactac 40080
tggagatcca gccccaagct agcagatcta agactgagca tggtgaactt ctgaggcaca 40140
gccatttgtg agctcccgcc atcttgccca gtggtggagt gcagggccag aggcctccta 40200
aactcattcc cttcctcagt gtcaggaacc acttagacat agctaatgat gctctataag 40260
cttcccatgt ccccctcatt tcttccccca gcgctgaggg aaactggtca tggtgtaagc 40320
attttctcaa gctattttcc ttcttgcctc atctgtgact accctcaggc acaattacct 40380
gggagaacct ggctgctgta ttgccctttg gaggcacatt tgacctagtc cagttaaaag 40440
gttccaccct gaagaaggcc tttgagcata gcgtgcaccg ctacggccag tccactggag 40500
agttcctgca ggtgggcggt aagtcaccca tcctgtaggg ctggcccatc caaagtgaca 40560
tggcatttcc tgctggttgg ctcagcttcc ccttcaccaa aattcctgtg gtcaaactgg 40620
tttggatttt tcccgaccca acaccaatac cttacccttt aatggctgtg gactctcaga 40680
gccactaaga ttaggttggg gtcaaagccc caaagcccca aagccctggc aataagcagt 40740
atttgccaag tctgatttgg acatcagatg ggccctctaa gcaggtcacc tgctttgagg 40800
tctagtttct aataaaatct tatttaagca ttcaattatt ttttttttaa atcctctcct 40860
ctgttctact catcgttgcc tggaaccatg ccaagattta aaggaatgct gtccaatctg 40920
tattaaccag gatgattcct ggaaggagca gcctactgca cagacatacc tgaacacctg 40980
gttctagtcc cactgtgacc tggggcaagt tctcaacccc tctgggcaca gggagaaaac 41040
ctttcctgtt tatctcatgg actgcagtga aagttgaatg gaaataaatg cttcctaaat 41100
tgccaagggc ttctgaaaca cttttgttaa taattccaca gtccagtgaa ttaatccttt 41160
aaaagcaatc ttattttacc tggaaatcat aatctggcct cattttttta accttaggtt 41220
gtggtcccgc ttgacagata gaattgcttt cctgagtttt cctctgcaga acacattgct 41280
gttgagactg aagggctaca gaaggcagtg gggccctgtc ctaatgccta tggggcaacc 41340
caaacatcct ccattttccc agaagggagt tgagggaaaa gcagttttct gatgctctag 41400
caaagcactg catagctcta tgccttgtgg aagtcaccaa atcccaaatg attaccattt 41460
aatggtcttc aaaattgtaa aatgctctct gcctctgatc tgtttctcac tttctaaggc 41520
cattgaatca atttagtaga ccaattcagc attttaaggg aaaaacagaa aaagatgtgg 41580
ttttgtccat gtagagtcac cctttgccta ttctggtcca gagctggttt tccttccagt 41640
ggttctcata tgtcctgtaa ggactcaagg ataaagggtg gctgcaatgt cacaattctc 41700
ttgacaaaat ttcactcagg tttaaaccaa aggaaaaacc acagccccat gccccaattg 41760
tagagtttgg ccaaaattcc ttaggaaaca gaaatctccc tttggatctg gtgaaaacag 41820
attcatttct tttctaggaa tccatgtggt gtatgatctt tcccgaaaac ctggagacag 41880
agtagtcaaa ttagatgttc tttgcaccaa gtgtcgagtg cccagttatg accctctcaa 41940
aatggacgag gtatataagg tgatcctccc aaacttcctg gccaatggtg gagatgggtt 42000
ccagatgata aaagatgaat tattaagaca tgactctggt aagcatgact gtctcttcct 42060
ttctctaaag aacaacaaaa ttgggccaag aaggggagct actagtggtg ccaggagggc 42120
aaagtgactc cctgtattgc tgactccctc ctgctttgct tgttggaaag cagcagcaca 42180
gcacagcaga gaggaatatg taccctgtca atccccgagc acacacatgc cttcatccaa 42240
ccccagagac aaaacatttg taaatcaaaa atgttttaaa attcttttaa ctacatctaa 42300
gcagctattt aaattgtgaa atataagtga gttttgcatt aacagaaata tcacacctct 42360
ggaattttag gatatggaaa accatcacat taaatcattt aaattttctt tcttgaagaa 42420
gtttctgtgt gaatcagtca ggtagaagtg gatggggtga gcgccgccct ttcagcacag 42480
tgggttggtg gtacaaatca taggaataac catacttgat gtcagttggt acttctggca 42540
gcattgaggg acattatatg ccagcctcac ttttagaaat tctgttgaga gtcacattat 42600
gtgcatacag accactgaca taggacacac ttccccttcc tcttcctagg gcagagacca 42660
gcatgcttac aagacacaca agcagcttct tggcatctct gggtagttcc accaggatct 42720
tcctcagtct tatcatgctg gacctgtcat tttagctact tattccaagg agcgtggagg 42780
gcaggaccag accaaaggag gcagcccttc ctgggaggaa aggtgctggc ttcttctgtc 42840
ccttccgtga cagaagatgg aagggcaggc gtaaaagccc catgagacaa agaggctgat 42900
agccatcagc tgtccaccac tcaagcctag actgaaggat gttagagaga ggagctcatg 42960
cagaaggcta aggaagccct ggacctaagc gttctcaaca ctggttatac attagaatca 43020
cttcaggagc attttaaatt aaataccaat ggctcggccg cagctcttaa gagattctaa 43080
gttaactgga ttggggctaa actctttttc agtgattcta gtgtacaccc agaggtgaaa 43140
accactgaag agagaagggt tcttaagtac tggagtgata tttttaaaat aaaattctgt 43200
tgggttccca acagaacatg ggttcccaac aaacatgggt ttgttgtagc tataggcaat 43260
tagcatttat cctctcctct ctttggtaaa ggaaatcccc caagtcccat catcaattac 43320
atgaaaataa ttattgggca ggcaaacaaa aggaagccag ttttatctct caactaattg 43380
ctcttgaccc agttccctaa tgcccttgtt ctggagtggt tcaaaggcta tgatgtcaat 43440
gacctctttc tagagtcatc acagagctga ggcaactcta cttcccccaa ttacccaata 43500
ttgaacaaca gacacatttt acatctggag catgaaaaag caatcaaatt ctttgtactt 43560
atgatcacta gcgttcttgt cttctgtgac actttacccc tgcttatgaa attgcttccc 43620
ttttataatt acaaaggact accttactgt tgattgataa ttaccttttc tgtagttaaa 43680
ataatgtata tgtttttcta ggtgaccaag atatcaacgt ggtttctaca tatatctcca 43740
aaatgaaagt aatttatcca gcagttgaag gtcggatcaa gttttccaca ggaagtcact 43800
gccatggaag cttttcttta atatttcttt cactttgggc agtgatcttt gttttatacc 43860
aatagccaaa aattctcctt gcctttaatg tgtgaaactg cattttttca agtgagattc 43920
aaatctgcct tttaggacct ggctttgtga cagcaaaaac catctttaca ggctcctaga 43980
agctgaaggt tagagcatta taaaatgaag agacagacat gattactcag ggtcagcaac 44040
ctagtgagtt agaaaaaaaa ttaacatagg gccctataag gagaaagcca actatgttaa 44100
gtttacgtgt ccaaatttta atgaaatttt actaacaatt ttaaaccata tttttcttct 44160
tcatatccat ttctaatcca tcaaacagct tatgtttaca taaaatttta tcattcacaa 44220
ggaagtttta agcacactgt ctcatttgat atccacaact tatttttggt aggaaagaga 44280
gatgtttttc ccacctgtca gatgaaaaaa ctgaagctca aaaagggttg acttgaccat 44340
acagctaatg ctgacagatc caagacctag acctaggtct tttgaactca agtccagcat 44400
tctcaactat atcaagttac tgttcagaat acttaatatc tcctctcttc ataattatca 44460
atagccccaa gctcatggat gacaaatctc tgctttattt cttgtctcta ttttttcact 44520
ttatagctcc tgttataata gcaagtttaa tggtataaac acaggatacc atcctctctt 44580
gcaacaccca tgtgcctttg atgagtcagg tagcaagctg tagtagataa tgagaaaggc 44640
cagaggctgc aaaagacagt caaaggacac gagagaaagg aaggggaaga acaggactcc 44700
aggactgttt tatattatag aaaagcaaga gctaaagagc atttacacat gttaaacaga 44760
tacttgttaa gcatagtgcc tgacacacgg cattagctgt tattttatga gattccatca 44820
gctctgcctc tgtcctcttt cttctaacat gaaggtatca tgagaagaga accttctaac 44880
ataagctgta attctaaacc tgcacttgtc cctctccagc aagaggctag cactgaattc 44940
attctactca tactacacac ccagttatgg aatgtccaga gttctcgaag aaaataaatg 45000
actttaggaa gaggtataca ttttttaagt cgctctgcct ccaaatctga acagtcactg 45060
taaatcattc ttaagcccag atatgagaac ttctgctgga aagtgggacc ctctgagtgg 45120
gtggtcagaa aatacccatg ctgatgaaat gacctatgcc caaagaacaa atacttaacg 45180
tgggagtgga accacatgag cctgctcagc tctgcataag taattcaaga aatgggaggc 45240
ttcaccttaa aaacagtgtg caaatggcag ctagaggttt tgataggaag tatgtttgtt 45300
tcttagtgtt tacaaatatt aagtactctt gatacaaaat atacttttaa acttcataac 45360
ctttttataa aagttgttgc agcaaaataa tagcctcggt tctatgcata tatggattag 45420
ctataaaaaa tgtcaataag attgtacaag gaaaattaga gaaagtcaca tttagggttt 45480
attttttaca cttggccagt aaaatagggt aaatcctatt agaatttttt aaagaacttt 45540
ttttaagttt cctaaatctg tgtgtgtatt gtgaagtggt ataagaaatg actttgaacc 45600
actttgcaat tgtagattcc caacaataaa attgaagata agctctttgg tctttgatag 45660
ttatttcttc tctcattcaa acacctgcca actaaagaca atatgactga aggaaaactc 45720
tgacagattt tcactggtat catttctctt gatcataagc cacttaccat ctgataaccc 45780
tccatagccc ctcacacccc taaaatgaga agaaaaatgc atccaatttt taagtaggag 45840
agaaaagtga gtggtatttt aaggaagact actaaatttg gtggctttga tatacagaat 45900
ctgatataaa aatggttttg aggccagt 45928
<210> 18
<211> 48382
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 18
atgtgtcccc gagccgcgcg ggcgcccgcg acgctactcc tcgccctggg cgcggtgctg 60
tggcctgcgg ctggcgcctg ggagcttacg attttgcaca ccaacgacgt gcacagccgg 120
ctggagcaga ccagcgagga ctccagcaag tgcgtcaacg ccagccgctg catgggtggc 180
gtggctcggc tcttcaccaa ggttcagcag atccgccgcg ccgaacccaa cgtgctgctg 240
ctggacgccg gcgaccagta ccagggcact atctggttca ccgtgtacaa gggcgccgag 300
gtggcgcact tcatgaacgc cctgcgctac gatgccatgg taagacccga gcccgcgccc 360
gggatagtag tcccggactg agagaggagc cgggctggaa aagcagcgga tggcagagtg 420
tggcaagcct aggtccaggg cgcggagaga tgtggggata aagtgagact ccggccagtg 480
tgccagctgg atgcatagaa gtcccttgga cgattcgtct taaacggacg ttattgcgcc 540
cccactatga gatgggagta tttagaggag caggacccag tcctgcctgc gaggggctta 600
aggtggggtg cgtagaagca ctgacagtta tccagagcta gggactgaat cggatccctc 660
acgcagctct catttactgc tagatctttc aaagaatgct ttatctcaat ctagatcttt 720
cgaaaaatgc tttatctcaa tcttattgaa agggaaaccg cgtcgaagaa gctgaataaa 780
ttaacaggca ccatccccgc aaaaagggag acgtgataag aatggacacc agatctcttt 840
gaccccggag cttgagagta attaagctca cagctttggg gtagattctc tgttcaatta 900
ccttctatgt gaccttgagc aagttattta ctctttcttg cctcagttct atcctgtata 960
aaaataagga ttataattgt acatagtgca catagtgtgg tctctatatt attaccagac 1020
acatagtgtg gtgtctatat atgttgttat taccagacac tataatatag ggttaagaaa 1080
aggaaagaaa gaaagaaaga gaaagaaaga aagaaacttc agagagagcc aaaatgtgga 1140
aagacttggg tttcaaagga tttcaataaa gtaggtttgg cggcagagat ttaggaagag 1200
tccaacttaa gagataatgg gctcttaaaa catgtgcaaa agtttagttc aaagtcccaa 1260
atgttaggtt aaaggtaatg ccttttctat tgcactaaat tctagaaagg gagcagaggt 1320
ggttgcacag taataaataa gtacaaccaa gatgtatata tggtggttaa aaaaaaaata 1380
aaaccaccac ccaaatttaa ccctgtgtaa aagccttgag taaagggatt cagcaagact 1440
gtggagaaac ttgggagaag aaaaaagagc tttggaagag gaagaactga gaaatctcca 1500
gagaaacgtc cagagaaatc tgtgaagtcc acaagtgtgg aagggaaagg gaagtgtgat 1560
taggctggga tacagaaggc taacagtaca ctttatatga gaagagctta gaggtcagga 1620
gagcacaatg ctgaggggtt tgtttggttt tattttcccc tctggaaact tggagaaagg 1680
aggacaggaa aagatgataa gagggcacca ttgtttggtt ttaaaggtcc ctccccaacc 1740
ccccacaaca agaaagctca ttttaatgag atgagagaag gggagaaagc tccctccgtt 1800
ctgatgcttt tgagcttatc agggacaccc tgcagcagaa attccagctg atccttcagc 1860
cttcagagtt gatgaggtgg ggctcacaca catctggatt tgaagaagaa tcgtgggttt 1920
caagaactct gtggtctctt aagcattggt attccacagc tacacttccc tgatactttg 1980
cactcagtaa tgacaaaata aatatttgta gaattgagtt gttgctaccc tgtgtctcca 2040
cttcctccca gaagggtaga ataaacttac atgtgtttgt tttattttgc atataaacca 2100
cgactttgta aactttactt tttgtgacac tcttttctcc acaattgtgt gaaccctttt 2160
tagtggaaaa ataagaatga agaattccta gttttcccta ataatatttc cctcttgtaa 2220
gtctagttga gctcctggtg ttaatcatgg tatcaaatgt ttatgacagg acatctaggt 2280
gaggcctgag taaactgctg taagttacag cctatctctt ctgtgtttgt tcagaatcct 2340
ttttgttgga tgaaattact ttctttcaaa ttttgctcaa gaaaattgaa ttgggttctg 2400
aggaaaaata ggctacttac gtaatgttgg cagggggtaa agattgggag cggatcacat 2460
ggcaaggtta gatggctgtg tcctcaggac ttactctgtg cagtttgggg gtgtgtcaaa 2520
catgatttga cgtgcataaa atcccagttc tgtagtgcta aagcatgtgc taaagctctg 2580
ctgtactcca gtattgcaca gatgtttcac tgtcaattgt ttatattata ctaaccctca 2640
gagaatttat tgctgttgtg gtataaacgt tcctccactg taaccactca ccgcatattg 2700
attcctagga cactggggat tcaaatcagg gtggaaaaca gttgtggagg ggcatctcca 2760
gtttcctgga gactggacct agttcaaagg agccacataa aaggactgaa gcatatggag 2820
gacaaagagg aatttcctag aatccagcta aaacacagga gtcccctcga gcaggaactt 2880
ccaaggaggt gtacactttc tagacctact ggagtaatcc tgattgctgt agaggaccca 2940
gggcaatcct ggagaccccc tggaaccact actaggcaga aaactgggaa gaggatggca 3000
aaactctcca gttagtgtag ctgtctcagt ggaaggactg ttgtcccctg aggcacgagg 3060
aagacagcat ggaagcactg acagtatgcg cttggcctgg ttactttctc tgagcttcat 3120
gttccttagc tgtaaaacag gagtgataac ccccatcttg cagaggtttt gtgaggaata 3180
catgagataa tccctgctgg tgctcactga aagtgagttt catgtgccct gggtggggag 3240
gaggggaatt gtgaggaagg gaaggtttgg gttgtgcatc agattatctg attctcacag 3300
cctggcacag tctaccacct gcctgccctc taccatctta cctaagtacc attgcctaaa 3360
ctatttttgc tatagttact cattgagcag acttatgaaa tctccccaca gcacaaaact 3420
gaaggtatcc acctccagag gtgagattct caccaagaca ggctgtgggg aaggtagctt 3480
tgagcaggtc tctagggtcc cctagaaagg tgccttttct ctgcttggct gctgcttctg 3540
aggccctggc tgctgagccc acttcctgcc cataaggctg cagccttagc cctagccaat 3600
tgctcctaaa tgggggaggc agcagccagg gtgggaacca tggcccccgc ctttaactca 3660
cccaggggac aggttagtca gatagatggt cctttgttcg ccaccctcca cctcccacat 3720
agacaacatt ctaatgtgtt tgatggactg catatagttt tgtttctaca tgcactttca 3780
aaacctttgt gtgcttatat tttataatgg catacccagt cctgttattt agccctttct 3840
gtttcttact tttgtcctga acttattgtt tttatgattt gtccatgttg ccctagatgc 3900
ccttatacct atacatcact gtgaattgtg ctctgtgtat tttacccatc tccacgtatt 3960
ttacccatct gctccctggc tgatgggttt ggagatggcg tctaacaccc tgcctccaca 4020
aataatgctg ctccagatat cttcctggat ggtctcacca gatgatttct acccaggagc 4080
agaattgctg gccacagggc atactttgag tactacaaga ctggtttcct caatgactga 4140
catcctccat tctcttccac ctactcccat ccccaaacag gacatgtaaa tttctatatt 4200
ttacctatat atcatggcag tatccagctt tattgtaggt gtaaaataca cattgctgct 4260
ttatcatgta cttttctaat aactagtgag tttggacatt ttttgatatg tttgttaact 4320
ttttaagctt cctcttctct aattgcttat tcatattctt tggtattttt taaattgggt 4380
tagctttata ttttttgact ttcaggagtt ataggttttg atctgtgatc aattttagac 4440
attgcaaata tattatgatg atattaggtc tattaatttt gtccatggta tctttcattg 4500
aacagaaatt cttcatttct tgtaattgaa ttaatctttt tttccttgtg gcttgtgctt 4560
ttgaaagttt gcctaatagt ttgctaattt aaagcttatc acctttaaca ctcctcactg 4620
gaaattcaga gatcaccacg gttctctctg ccttttaaga gataaatcta caatctgtgg 4680
ctatgcaatt cacttctcag catcaccagt tcctcatcta taaaatagaa gtttgaacca 4740
gcttacctct gaggtcccat acatatttga acaactggag ccctacagtg aaatgttttg 4800
attcagtgaa gggacaggag gatggggcaa aggtgtagag gcaggaatct acacaatgga 4860
ccaggttgga ccagagtgag gtagcaggaa tcatggcaga agaggttgtg taaactctaa 4920
aggccacctg agaagtttga gttatgttgc catgggcagc catgtggttt ctgcgtagaa 4980
gaccactgtg gtataataaa aaataaatat ttggtccttg tccccagtac ctgtcacaga 5040
gttcctaaaa ccctagaaat ttcctaagta atagaaatgt cctttgtgat tcataataaa 5100
ccccatgata atacctgggt ttatgctcct gcagacttag ggtgagatcc ctagatagct 5160
tcaagatgga accagtcacc agcaagtcca agagattagg ggatcaaaga gttggaactt 5220
tcggtaccac ccaccaaagt ccaggaaagg agggtgggac tggagattca gctctacaaa 5280
aactcttgaa caagatgtga ggagcttttt agttgctgag catttggagg ttcctggagg 5340
gtgcacaccc aggtagggca tgaaagctcc gcattgcccc catatcttgg cctatccatt 5400
cttcacttgg ctgttcatct atatcctttg tatagccttt ataataaacc aatgagtgag 5460
ttctgtaagc tgtcctagca agttaattga agccaaggag gaagtaatgg aaacccctga 5520
tttatagcca ttcagccaga agcacaacct caggcttgtg atcagcctga agtcagggca 5580
gtcttgtagg attgagccct taacctatgg gatctgacaa tgtcttgaga tagtatcaga 5640
attgaattca attataggac actcagtttg tgttttctgg agtactgttt ggtgtctggg 5700
ggaaatccca cacacatctg gtcacagaag tattctgtgt tcagtgtgac agtagaaaaa 5760
gcagtttgtg tgtatgtgtg tgtttgtgct tttgtatgct aatgagatga tctatggctg 5820
ggggccctta gatagcttcg ggaagggggc tggtcgccag aaagactaat ttatgactag 5880
agggttagaa ctttcagccc tacctctccc acttctagag ggtagatggg ctacagattg 5940
agttcggtct ccaacagcca gtgaattaat caatcatgcc ttcacgataa accttcatga 6000
agacccctaa acaaccagat tcagagagct tccaagttgg tgaacatatt gaggtgctgg 6060
gaaggtggtg cacatggagt gggcatggaa gctttgccca tccactccca tgccttgccc 6120
tgtgcatctc ttcacttgct tgttccctag tcgtatgctt tataataagc cagtaaagtg 6180
ctttccaagt tctgtgactc gttctagcca attatcaatc ccaaggatag ggtcatggga 6240
acctccaatt tataactagt tggttgggaa tacaggtggc ccaggactct cgactggtgt 6300
ctgaagtggg ggcagtcttg tgcatttgag ccttctcact tgtgggatct gatgctaagt 6360
ccaggtagag aatgtcagaa tgcatctgaa ttgttggaca ccaagttggt gtccacacag 6420
aattggggaa ttggttgatg tgaggaaaaa ccctcccagc tacataggga aatcacggtt 6480
ttggaagatg agcagaaaag ggtgggaggg agaagacaag agagaagagg gtgaggaaag 6540
aggccaggca gcagtccaag tgtgagatga taacacctga actctggtaa ttgaggagac 6600
caaatccacg ggaaccagct gcaaaaggca ttctgtctgg taggtatagg gctaagggag 6660
taaacgtggg ggtgagactg ggatcaagtc cccttcctac cacttagaat ctgtggactg 6720
ggagcaggtt atttaatctc atgaaacctc catttccaca tgtgtgaggt atgtgattgt 6780
tgtctccacc cctagggtgc tgtggggttt acaagagata gtagtgcaca taatgtgctt 6840
agcacagggc cagcccctag taagtgccgg ctgtacttgc cttctctttg caggggtggc 6900
tcaggagagg gtgtaatcaa aagatgatta tgagggttca agcaccaggc aatgggagag 6960
tcagcaggtt ttaatgcttc cacacagagg ctggattctt ctacttccgc ttaccgctga 7020
cagctgaggc ctgctacctc aagccctgaa atgcagtctg aagacttttc ctggacttac 7080
tgggcaccat tggactgtac agggctgtgg cagacaggag cagtggaaag agatttggat 7140
gggacgtcag aacacccaga ttttacttct tggctcttcc acttactagc tgtgtgaatt 7200
tggcagtggt ttccttatct ataaaatggg ggaattggag agggtggtct actgcatgtt 7260
acttctcagc acctcagctc ctccccaccc ctctctcttg cttctaatct gcctacatca 7320
cttggtttgt ctctggacaa gtctaagttt gcatgacaac tctctaattt ctccttcaca 7380
gttatttatt tattttttgg cagctatttc cttgtggaga gatagcgtag gtctttcagg 7440
tgcttccatt tggccactgt ttgcaggtga agaggtttcc aggatgctcg cgggatgtta 7500
ctctggtagt aacagaaatg atggcctggg gaatgggaag agtatgagcc cagttttgag 7560
cctgagtctt tggggctcat ttattcattc aatctgcaaa tcttcattca gcattactac 7620
atgccaggct ctctgctaag tgctgaggat agcgatcagg acctgctagg gtgaaaatta 7680
ctcttcagaa gatagcagag tggataatgg ccattcgagt tcctcttaac tacacagagg 7740
gactgggtaa gatctataaa ggagagaggt taaacagcaa agcaaaagtg actgaaattc 7800
taacacccat gcatggggat tcttgagcta aagatctggt ctgtagaaga agacacatca 7860
gaccctcata tttagcatct aagccgaaat cagcagtaag catgtttctt ctcccataaa 7920
aacttgtccc taagaagaga gacaagtgct tacaaggact cttatgttaa aggaaactgt 7980
cacaattact ttatcttcag tcatgaagcc tttcttgagg aaagaaaaat tagcaatact 8040
agtttgggaa gcttgaaaga gagattctgt ttgcaatgac acttttaaaa aggcaacaac 8100
aaccaaaaaa gtaattattc ctaaagctca ggtgtcagaa gttgtgaatg tctagggggt 8160
tgtggatccc caacctcaga gacaggaaca gcaccactcc catcctgggg ctgtggcctc 8220
aggcctctgc ttcctctttt tcagtctaaa ttccagagca gagaggtgct ggatgaccag 8280
cttcaggcag gggagggagg aagtaaaggt agtttctcag ctcatcttgt cctcgtacta 8340
ttaatagcat ctgttgtttt cttaactgaa aattggagag aaccaaacaa atcaacatca 8400
tattggcggc tttctgcaca tcactcacaa atacatctga atgctctgta ctctccagtg 8460
tttattttcc tcagccacat aatcggaatt tgaacattct tttgacactt cttttccctc 8520
acttgaataa ttagttctat atataatcca aatattagct ggagccctct agcccctcag 8580
agatatagtt gaaccttcct ggacctcaat tagttgggca gttaattctg aatgttgatg 8640
tgcctatttt gcacgtatgg agagtttgta cgtcccccct ccccaaacaa gtagaatgct 8700
gtctatttgg tacttgctca ttcatacatt ctttcaacaa ccaaacccag ttgcagctgg 8760
gttgaacccc agggtgtttc ctgctattca gtgtgatgcc acattcacct gctgtggcca 8820
acattccttg gcccaggact gcagggatgg aagccaaccc aacatcagaa gtgggcaaca 8880
atgccagaaa cctatcaacc aaaacaaatg taatttgtaa tcctccctgg atgatgttga 8940
gtgattggca actgagcccc gtcatatagt aaatcaggct gtctctcttt aaaaaagaga 9000
gacagagaga gagagagtct cccaatgttg cctaggctgg tctcaaactc ctgggctcaa 9060
gcaatcatcc cttcttgcct cccaaaatgc tgggattaca ggcatgagcc accatgccca 9120
gcctaggctg tctctttagc tcttcaatta tagggcttcc tgcaacctca gcccttaacg 9180
aatgagctag cgtgtaagtg actagctaaa taaatgtgct gcaaacctct gacaaatggc 9240
tttttgacat tattaaaatt tatttatgtg gaacaaaaaa attcttttct tctgcatcat 9300
gcatcctaca ctgcttttaa agaaactctg ctatcctttg ttttaccgtc taacatttct 9360
ttgtttccct tctgacatgt ttcagcctct tgtcatcact tggaacattt ttactgcctc 9420
atccctctag tttccatttc ctagcctagc ccagcccagt atttttgaaa cccaaatacc 9480
tggaattttg cctgatgttt aaagccaaaa ccaaatgaaa atgtgaagaa ccaaaggaga 9540
ttttattctt ctgtgactca aaatcctgtt gttattgtta aatgaaccct gtgggataca 9600
gtctggtatt tttaacacct tctgggggta tttgtcaata tttacatttg atgtggatag 9660
cttataaaag tccagtggct ttgggtaaac cctgttcact aagaataaga cccagaaggt 9720
catatgaatc aacgatcaaa gaatgaagat attctgaatc caagtttctt gttttgatta 9780
ttaaaaatac cttgcaatta gttaggcatg agagaaagct tgtgtataaa gtttggcttg 9840
taattcacat gaattctcta tacaggccaa aaactacctt ccctttggga atatacaatg 9900
accttatttg gaaattactt caggtaatta tgaagtaata atatctctat tcatatgcta 9960
aaaagaattc aaaataacat tcaattatca tctcttagaa actaaattaa acgggtactc 10020
acccaagttg aaatgcaaac aaatgcatga tatcattatg tgaccctccc ctagccctta 10080
ccctgtccat ggcagatgct aagaaaaact tgtccctaag aagagagaag tcaattgatt 10140
tagtcagcct ttcccactgt gtttggactg agcctcagaa ccctccttat ccccatactg 10200
caaggagtta ctgtcaatca actagagatg gcgctcaaga taaaattatt ttccatcctg 10260
tactaatata ataataagca tggaatctat ttattggttt tttcacaaca ataaaatata 10320
tctttagcaa gtaacaaatg gtagaggtaa tacttttaga taaagaggga agaatccttc 10380
tgacttttct ctttatcaca ttttacccac acacaatgtg gatggaatct gcaactgaat 10440
ttagcagtca cactgaaact ctgctgcatg accacactgt cctgaggcca tgtgccaact 10500
tctatgtttt tttttcccaa agctcttcca aggaagagct gccacttctc catttactct 10560
tttcacctgt aaatgaatgc tgccgttggt attgagaaaa tgggtgaatg aaagaaatca 10620
tgagaacact cctggggtga gagcaagagg aatgaaaaca cgtggtgtga gagcaggtca 10680
gagatggtgg ttaacacagg aaccaggatg gaggcctcag acttgaccct ggcctgcagg 10740
acgttggcct gcattgtctg caggaggaaa cacagaggtc agtggatatc ctgcggtctt 10800
ccattaaggc ctcttactat tctggaacat tttattttta tagagaaatc tcccacttcc 10860
tgatctcccc agttagaatt gattgatcaa agcagtccct ctccctgtta ttaaataatt 10920
aaataatatg cattagttga tccctataac cacttaaaat gtatctaaca caggagaaat 10980
ctttggtaaa tatttactga atgaatgaac aaataaataa ttggagggag ggagaaagag 11040
acagagttaa atcctgaggc tggagttttt ggtccttcag ccccttatcc cattgcccct 11100
ggcagcccca tccatgtgac cttgaattct gtccaagtct catcactttc tatgtgtgtt 11160
gtaatggaaa aaaggttaca aaacacagtc tttctggcct ggagtaccaa aatagtaaag 11220
acttctgttt tgaaagcctc tgacaaatga acacaacagc cacagtgata actttagcag 11280
gaaattgaac cacaaaaagc aagcgagcca caaagccctg aaggctaaaa atgcctcaca 11340
ggccaagagg cagttaggtt gcacctgaga cagaggctgc tttcttaact gggaaatggg 11400
tgtctccaga tatataaagc tgtgaggaga tcaccatggc ccagctctga cgagccactg 11460
gtacagcatt gttggttagt ttgcctcaac ctgcagcaga gtttttattc caagagttaa 11520
cagttagtta ggctttttcc aaaggttgag atgacacccc aactcccaca cccatttccg 11580
tcctgcccca gcttgaaacc aggcaaagtg agttaagaca ttgggagatg ggaggggaca 11640
ggaggggtag aggcctggca cccctctctc ttctgtttca agagacactt tgtggagttg 11700
tctagagaga ctgggggtaa ctgctccttc tgcccccaat ctccccttag ctccccactc 11760
ctccctatcc tgccaggcca accaccagca gagaagggcg cttagctcca gtgacgctgt 11820
gagtgaggta cttgtgagga tgcatatggt tcctgttgga aatgcacgga ggagcagtga 11880
aacaccagaa gccccatcct ctctcctgcc cgtgccctca tgcccagcca gctgaatatt 11940
cctgtggctt ccactcctgc actgtcccct cagtgtcacc tccccaccac attcccacag 12000
ccccacttgc tgtgtcacct ctcccctgag ggagaggtgt cttcctgcca gtctcttcac 12060
caccaggctc acacaactcc catccattca ctgcaggctg ccagactgag ctttccaagc 12120
acccccggac tgcctccagc cctactttct cttcaaggct tcccacttcc ctcaaaatga 12180
aagccctgac tccttggaca ccatacagac cctccctgat ctcccagcga tgctcttagc 12240
agcatcattt ttttttctgc cagattttca ggtgccctca gttccaggag caccaaatca 12300
cctggatgcc atgagctgct tatgacacag ttcaaggtgt gacatctttg aggtctctcc 12360
ataacacctc tactgcagac gtctgtctct ggtgtatcag ctgtaccttg cacacatgtc 12420
tctaaccatc tgtccactgt gtctgtctgt tcacacatca gattgaccct atgttctctg 12480
tgacaccagg tgagctctgg caccagtgcc cagcagatga ttactgaatg attaataaaa 12540
ctactcactc cacctgctgc ttttaaatgg gctccatttt ggtatgattt ggtgcgtgaa 12600
ccattaaaac ttttcagaag tcaggatttg aaacaaactt ggtaaaaagt agttaagcaa 12660
tgtcttcgtt gtaaagtgag aaaccaagtt gcttggaaac aggtgaaaag cttagccagc 12720
agcttcacaa ttcacagcct ggcactccag tagccctctc tggacaattg cccaggtttt 12780
cttgttttca tttctgtgtt ttaatttaca agttaccttt atatttctac taatacctta 12840
ctggcacttg tttgtgaatg aaacaatttt tttgagagtt tgttaattgt tctgtgtgag 12900
tgtggaaagt ttctttcata taagatcatc aattcttttt ttaaaaatag tgttttaggt 12960
tttctcctac ataaagaaag tggtgggggt gaggagctgg gggtggacca gatgctcaca 13020
ttgtatttct atgattttgc tgttggagat actgaagatc attttgctgg gctttgctct 13080
cttaaattac agaaaagtgc atttggtgaa gtttagtgtt tttagtaaac atatacaaga 13140
tgtcataggg atagaattag tgaaggaata gggttcgtga atgaatgagc tcaatcttag 13200
taatgaaatg tttggaaagc aacagaacaa gagagaaaga gcatcagaac atgatttagg 13260
taccccatac atttatacaa ataaaaaaaa gatctgattt agggtcatag aaatcacata 13320
cctaaactga tggttaaaca ataagtggca agtaacagtt tagctacaaa aaatgtacag 13380
gaacacataa ggaaagttac atgggaactt tcagagagga ccaaaaaagg tggaataaga 13440
ttatctgctc tgttatgatc ccctgtattt ctctgctcta tgttcttctc ttgggcagtc 13500
ttttcctgag atgttttcat gttattaatc aaaaataata gaaacgaata aactagatca 13560
atgcatttac aagatttttt gagtgcctgc agtatgcctc ggtgtggtgg gaacttggtg 13620
tactgaggtg agtagcatga tcctgaagag cttaggaatc actgagttct cgtaagtgca 13680
atagaagaaa aactggtaga aggaagtcaa atgaaaatac aagtttctaa agaaaataag 13740
tatttacaaa ggaatttgtc tgcattaatt tctgtggaaa agttagctat gtatagaaga 13800
aagaaacttg ttagtaattc tttcattcat tcatttattc attcaacagt gaatccctgc 13860
catggtagta ctaactactt gcactgtggt agtgcaaggg atacatttgc tgtgcaagta 13920
gttaggaatt tttttttttt tttttttttt ttttagtgct ttgtgtatgc taggctttat 13980
gcaaatatac agccaagcac agaggttaag aactttggaa tcaaaccgaa gtggttctag 14040
tcccagctcc cccttacaaa ttgtgtgatc ttcaacagtt catttaactt ctctagatct 14100
cagattttct catcagtaaa atgagtgcag taagagtacc accctcacag catgaaatca 14160
gattttctgt actctttact aactcacttt ccatgacgct ggcatgtggc aagtgctgac 14220
tcaatactgg ctcatttcct taactgcttt tattttattc attcactgcc attccagagg 14280
aaactgcggt tctgagatgt gccctgggtc actgctgagg agcggacgga gcctgggagg 14340
gagctacccc tgccttgttt cctcacactt tatggggtgg gtgtcaggaa tgaggctgga 14400
ggggaaggca aaaccagatc cctaagagcc ccctggaccc tgcttagggt gtttacttct 14460
atcataaagg ctgtgcttct gtagagacct aaggcagaga gtgccaaggt caggtttgca 14520
tttgagaaaa gtttttctgg cagcaatata aagaatagtt tagcaaaaga aatgagaata 14580
gctggaaacc cagttaggag gctgtggtag tccaggagca cactaaggca gcaacaatag 14640
gtggatggaa actggagcat ttaggtaagg gagagggatg ccagaggacg tgatggtgag 14700
ggagagggaa gagggtggga ctccaccgca ccagggcttc tggctttatt gactgggtat 14760
atggtggcat cctgggccaa gacagtgagt cacagggaag aggctttgag cctggtaagc 14820
ttgggacact tttgagacat tgggaggaaa tgacacatgg cagttgacat acagatctag 14880
acgtcaggag aaaagtcagg gctggagatt ttcggtttac agttgaaggt atgagtagga 14940
atgagatcat ttactcagtc catcccacca cattgagcac catgaataag gggctcctct 15000
ccccatgggg caagagaggt gaaaagacag tcaaagtcac tactttcagg aggattaggc 15060
tcaagttagg atggggaggt ggagaaggca tcaaacaaat aaatgaaaac tataacttca 15120
gttattgcca agtgctagga atataatgtt gctgttaaac tgcaaggtgg ctaacaagtt 15180
ggatagtctt tcttgaagaa ctgtagaaac attcattaaa aacaaaataa ttttatagtt 15240
aagcaagaaa gtgattgtac tataacaagg gagacttatt tcaccatagg aaaaaaatca 15300
taacattcta agaatagaag gaaaaaataa caattaccca agcctgcaac aacctgtaag 15360
accagatcag cttttaagag gaaagttaat tagataacag gaagaggtaa aaatgagatc 15420
atcagagatg ggggaaaagg caaatttaga aattctgaat ttaatagggg agacaagtat 15480
gtaggaatat aaacacattg tgctttctct gcacaaacac attcacatga ggatgtatac 15540
acaacccttg caggcacagg tactcagtga atgcctgcgt gccaaggagg gggacagcac 15600
tgagcaggag aggtggcagg gcctggtggc ccaggagcac tcatatagtc aggagactga 15660
tataaaagaa agatcctgct aacatgccat tacagagtat ggaatgagca aggaaggaag 15720
agggtgaggc tgtaagagag tgcacaggcc ctcatttaaa tggacgggtc accagccagt 15780
acttcatgcc tgttcctaga tgacccacaa agtgaaacga aactggatgt ggtatttagc 15840
aagtctgaga tttgctggtt taagtactaa gttagcaaac cagccttggc agccagagca 15900
ggtagggtag ggggcccaag gagatgggtg tatccaggag gccaatggag gaaggagagg 15960
aaggttttct gggaagccag ctgacatgtc atgctttgtc agtggggcct ctgcatggat 16020
tccctccact cccccccagc ccccaactca ctacggggag ctgcaaaatg acagacaatg 16080
ggtggttttg tacccagcag ccactcctca accagtgagg aatgggagag agataagaag 16140
gtgggaaaca aggattgctt gtcaccatcc ctgctcctcc tgaagccctg cgctgggagc 16200
aggtacagaa cccgctgaag gtcacctaat catttcacca tggagttctt ccccagatgt 16260
gaaactgctg ggactggagc aaagggtgcc cagagctgac tggaggagga agccagagcc 16320
acactggaac atatggagat gctagctcct gcctgctatt gaggaacttc attttcccag 16380
aagaattaga ggaccacagg agtgcttggg aagaagcaat tgtggcaacc taggtgaatg 16440
atgacctagg aggtgaaatg aggaaggagg cctttggtct ctgccaacca tacatttcag 16500
gttgcctttt gctaaagtcc tgctgatgtt aatcattctc tccagcagat cgcggcttga 16560
atcatggggt ctgaaatgtg ggtttatgag agaaaaacaa catggaaaaa ttgggcagag 16620
gctggtaaat gtgaacattc aaacctgccc ccaatctgag atggtgggac agtggacccc 16680
tggccagctc tccaggtgtt gcgtatgcac attggcaaac cacctttatt aggtggtgtg 16740
acacctgggt agcaattctc aagcaatgtg atgatctgaa tgtccctggg cctgacccag 16800
gtgaggagga cacaaaatca tgcaatatgt ctcatagagc ttagtcagtt ttgagaaata 16860
ctggactaat gttatgtttt taaagcacct aattcttttt ctcttttctg ttttatctag 16920
gcactgggaa atcatgaatt tgataatggt gtggaaggac tgatcgagcc actcctcaaa 16980
gaggccaaat ttccaattct gagtgcaaac attaaagcaa aggggccact agcatctcaa 17040
atatcaggac tttatttgcc atataaagtt cttcctgttg gtgatgaagt tgtgggaatc 17100
gttggataca cttccaaaga aacccctttt ctctcaaatc caggtatttt ctacttttat 17160
agcactcaat gcttgaaaat agatgcccta aatcacagct tggcattata tttatggact 17220
gtagataaaa gtaggaacaa ctgggtaaat agtgcactag aataacactg ataaaagcaa 17280
tctggtaaat tctatgctgt tgttccaagg tacttcttaa agagatgtcc tcttctgctc 17340
agtgaacttc tttgttgaca aaaacagtat ctttgtctaa ttcagtgctt tcaaaatggg 17400
aagaggagac aatgtatagc cagtgctaaa cagtacagtt tcatttacta gagaatgaga 17460
gcctcaaaac tcactgccct tctcctgtgg agccatgagc acagagccag agttttacct 17520
caatttacct tattgctcct tttctaggat gattattaca tgtccagcta actcaccttt 17580
ctcgcttttc attattcatc tcgttttaat gacattcttg tatctgactt tctgggaaat 17640
atatatagtg gtcttatgat atatatggat atatataagt gtatgtatat acatttatag 17700
acatatatac acatatgtat atataatata tataatctta agacacacac atacatatat 17760
gacacatata catacataca tataagacac acacacatat ataatcttaa gacccttcaa 17820
gaaactattt tgaggcactt ttataagaaa actttttgga atattttggc aatgagaaca 17880
aaaatgacat tttctttaaa ttatccaaat taaatgtgtt tccatatttg aagaaaaaaa 17940
gggaaaatag gagccagggt tatctagaaa catgagtttt cattaagaac taactgataa 18000
aggacatata attgacaatt caaaaatata tatttactgt tgttgttgtt ataaaataaa 18060
taagtccagg ccattggaaa caacaacaga aaagtcagct gtgtttgctg agagcagcaa 18120
gcgggtcctg cagctggcat agaggcagag ctggaggcag ggtggagcgg ctggggctgg 18180
agtggtcagt gagaagagtg agagaacttg ctaaagaacc ccccaacccc accctggggt 18240
caggtttact tgctgaagaa cctccttggt ccccttgaga gctgtgtaaa acatccctgg 18300
ggcacaggac attccagaag tgaggtcaga gtcacaagct aggatgcagg ctccatgagg 18360
acagaaacct ctgctgtctc actattgccc aatgactgtg cacagtgcct ggcatatgat 18420
aagtcctcag gaagcattgc tgggatgaat acatgagagc aggaaggtat aggggccaca 18480
gtcacagaac agcagatagt tccatttcct ggagcatggg ttctgactag gaaaagctag 18540
aaagttggat cagggctaga ttgtagggct gagaagttca ggtgtgattt ggaggaaagc 18600
attgtctctt ctttaaaagg cacccataag gaggtaggag gtgcaggtct tggtggaaag 18660
gcacagcagc tccagaaatc aaactgcccc agctgggact ctgaccagcc ctttaccagg 18720
ctgtaacaga gaggggatca aaagtgacac tgctgtttcc agcagctgct ggaagagagg 18780
gctgcatttt gcagggccca gtgtgggggg aaagggatgg ggtggaatag aatgtttcca 18840
ggagcactgc ttcttccaag actacaggcc cagcggcctg gtgtgggttc ttcctacagt 18900
tctttgaccc taatcaagcc cagttctccc agtcctagtg gcctgggcac ccacagggtt 18960
ataacctgta tttctagcac atttaatcta atttatttta gctagatcat taaatacctg 19020
aaatacaact ctcctcatga catcttgaaa tataagccat gctttgtata tatgggatgc 19080
ctggtcctgt gaggtttctg atataataga ggaatacatt cctttgtcct cagaggcatt 19140
tctctgaatt ttttttcctt cttggcagcc ataccttaag caaaagatgg tagcatactc 19200
tcgacattta gtcattcccc tgggtacctg tctgtgtgcc aggcactgtt ttaggcacca 19260
gggatactga ccaaaaaaaa aaaagttgcc cactggagct tacatattca gtgggggaat 19320
caataaatga gtaaatggtt tggtatgtca gaaggtgcta aggctgtgga gaagaattaa 19380
gcaggagggg aatggcaagt gcagcagggg ggaaggtata ccttgttgtt tcaaagagat 19440
gctcaaggaa aggctcacat agaaggcaac atagcactga gggtcaacgg ggagccctgg 19500
gcttcttggg gcaaccaaca ggaagcagat tgtccttgat gttctcataa tagacaacag 19560
tggcaaaact acaactcttg atttttcaac attgcctatg agttacagac agccttgcct 19620
gtccaacaca tctgtgagct gctaattatt aggttgtcat tctggggcct ccctggctat 19680
gatgtcagcc atgaattatc ttccatggta atttgcactt gtggagttgg ctaatgttaa 19740
ggattcagga gttcccctgc tgaagcacac agacattcct aaacctcaca gtggaacaag 19800
ggatgcattt ctaataatct tgttactctt tggattagta cttttcaaac tggagtttgc 19860
agactcctgc attttgaaag cttttaatga aatgtcattc tcccaccctc cttcccttgt 19920
ttagtgttag ttcttagaaa tcagtgtaag aacgtctgga tctactggag gactccctag 19980
tcacagagta ccagacatga cttcagggca gcatgtctgt atttgcagtc atcttcatgc 20040
cagtgatagc cataaaacct caaagcttcc tcaaatgtgg ctaacagagc cagttggagg 20100
aaccagttgc cttgctgcac acagagaacc tgggttgcac agtgagagcc atgcttgact 20160
actgataaca aaaggctatc gtttgttacc actatgtgca gtaataagca cccagggaag 20220
aatgtacaca atcccagtgc cagcaacatt tagatgaggc cctcatgatt aacacagagc 20280
tcaataatgc tctaatgctg gcattattta tatatttata tattttttga ggcagagtct 20340
tgctcttgtt gcccaggctg gagtgcagtg gtatggtctc ggctcattgc agcctccgcc 20400
tggcaggttc aagacattct cctgcctcag ccttccaagt acctggcatg tgcctccacg 20460
cctgaataat ttttgtattt ttagtagaga cagggtttcg ccattttgcc tgggctggtc 20520
tcaaactcct ggcatcaagt gatctgcctg ctgcagcctc ccaaagtact gagattacaa 20580
gcgtgagcca ctgtgcctgg cctgctggca ttaaagagaa catgacaaag ttatcaactg 20640
ccatattaaa ataggcattc aaggactact tgctgacagt tttaagtttg aacactgatt 20700
atcccaagtt tccaacgaac gtcttggagt tgctaatact cttttgtgta tcatgcttac 20760
gtgcttaggc attttctgca ttggtattgt ggaagttgag atgtacaaat atataaaaca 20820
atatctgaag cctcctcaca gtttgcaacc aagttcaagc aaaacaatat ccacaacctt 20880
aaactaatac aacataaagt gaatgtaatg aggatattga tttgaatttt tatttacttc 20940
ttaggggttt tgtatttaat gaataaatat gttttggttt tactgactct tgagctttag 21000
taaattaagg tgtttaacct ttgcatgtta atatgtatat taagtaatat aataaaaatt 21060
gttaataaat atccatttta tttattttgt tccttaggga caaatttagt gtttgaagat 21120
gaaatcactg cattacaacc tgaagtagat aagttaaaaa ctctaaatgt gaacaaaatt 21180
attgcactgg gacattcggg ttttgaaatg gataaactca tcgctcagaa agtgaggggt 21240
gtggacgtcg tggtgggagg acactccaac acatttcttt acacaggtaa ttgtttcaaa 21300
aggattgcat gggccaggat gtccagataa gcactgtgtc tcttttgcct ttgtaactgt 21360
tattactctt tttactgcta tttaatatgt aatgtatatt atatgatcta taatatatat 21420
gtaatataca ttaaatggga acatgtgcaa atcttacaaa cattactaag aatttattga 21480
tttcatgatt ataagtcaat tccatatatt tatgtatata ataaatatat aagacataaa 21540
atatgtataa aacatatata taccatacat atgaaaaaca aaaagactat tgagccccaa 21600
ccaagtcatt tatgatagat tcttatcaaa aagcaacatg tcatataaaa actttaagca 21660
taaacaaaat aatgtaaata aggaccttta taactcaagg tggatttgat ttgtaggttg 21720
cacaggtagc agaggctgga cctcgagtgt gagtcctaac tctgctgcag actaggaata 21780
taagtatcct ggaagccaca tgaccctggg tcctcaggag agcatgcact cttcctaccc 21840
actacctgac tacccccacc actctctttt tttctctctc aaattggtgt taagtagttt 21900
tcatcagaat aactctttta tgatttttcc actttaaacc tataatgtcc caaaactttt 21960
gaatctggtt ccacctttcc cacttccccc accttgcact gcatagatct ttgaagtatt 22020
cccaaggaaa ggaagaagag gctggtaaac agttcttcta tcagcattcg caaagaaggc 22080
tttggcgtgg gaacagccct ccccttgaag aggctcctgc ttctaccatc cttatgtggg 22140
ctgtcaaaga ccatgctatt ttgataaaag cgaaactcaa aataccccaa acaataagta 22200
ggaaataccc aaaaacatta aaagggatta tcactaagaa agtaattttt attttctttc 22260
tagtactttg gagtattttc aaataatgaa tgcatgttac ttttgcaatc ggaaagcaaa 22320
atataaatta taaaaaggaa agaaacacaa tattgggtca aaagtcccca ttcctaattc 22380
acttgaatga ctttgagcag gtcacttagc acctctgaat ctgctagctt agcttcaaaa 22440
taggggatga tcatgctcaa agttccacgt tactgtgaaa attatatgag atgtaaaaat 22500
atgtgaaagc agtttataaa ctataaatag ctatattctg gtatgcagcg tcatggaggt 22560
tgcctgctta taaaaatatg tcagtcaaaa tcataactca caaactaaag ttggccttca 22620
cttaagaaaa taccagtatg aatgtgtgct aattaaattc cttcacagat attttacttc 22680
ttggtaaaca caggctaagg aataatttca aactgagtta aactgacagg ttgtgaaatc 22740
aatttatggg ttatgaccag tagtagggct tttttttttg acagataaga ctaaaataga 22800
ttaggcactt acattactca taagaatgat aaatattgtt ccatgggaaa agatgtatat 22860
aagatgtcaa gaacagcccc aggcacatag cagacaatcc ataaatgtga tgaggatgac 22920
gactagcact ggacattgtt agcatggata acattgcaat caaagagttt caacaactcc 22980
atgagggtta aatgaactaa gagagccaag acgaggaaat aattggagca ggtcactggg 23040
aagcactggg taggggtgaa gactcaacaa agtccttctg actctaccac taatggtggg 23100
gtcctttcaa tgcctgatta ccctcatctg tggaatgaag gtgataacaa cagtacctat 23160
tataaggggt cactgtgatt taatatgtgt aatgaatgta taggtaaatg gccttagcac 23220
agggcctagt atgttagctg ttcttatttt agttatgtgc tggctcactg gataagacct 23280
gtgaaaaccc aaaccttagc ttttcactga ggcagtgtca gcacccttgg acacacagtg 23340
ggagctagat atgtctcctc ccccttcttc ctttagaata atggtttcaa aaagtttaaa 23400
ctatgacccg tagtaagaag aacattttag acagggagcc aaaacacaca aacatatatg 23460
tgtgtgtgcg ttcacatacg aacatgcaaa agcacttgta tgtgcaaaca catacaagtg 23520
aaagaaaact ttcacaaagc acatggaatg caccccaact tttctgtttt actgttttat 23580
tcacttaatg ccagctacca ctcattgaat tgattttctt gtactgtagt taatccatgt 23640
ttctaaaaac tcagctttaa atgggacgtg accaaaataa tgcatgaaca ctcaaaggag 23700
tttcatccct tcattcagat gtgaacattc ctgtgttccc cttgacttgt ctttacacca 23760
tcttaatcaa tgctgtctta cagcaacaat tagtcattca tagcctgtgg tttgtgtcca 23820
tcagtgagtg tgccctgata gacctgaaaa gattgggaga ccatgcccag cttcccatta 23880
tttctgtagc ctgtcctttg ttcccataac agtacagtca tgccttagta tctgtggggg 23940
attggttcta gaactccctg tggataccaa aatccatgga tgctcaagtc ccttatataa 24000
atggtgtagc atttgcatat gacctgtgca catcctcgca tatattttaa gtcatctcta 24060
gattacttat aatacttgat acaatgtaaa tgatatgtaa atagttgtta tactgtattg 24120
tttagggagc aaagacaaga aaaaatatat gttcagtaca gataaaacca tcctttttta 24180
gaaaataact ttttgttcat ggctggctga atccaacaat gcaaaaccca cagttagaga 24240
cagccaactg tatgtgcctt tgctaagtag ataaatttag ttaatgatga aggactggat 24300
gaaagaaaga tttgaggata aaaattagag tttgcttatc tgtaaatcct gaagtaaata 24360
attacagtct ctgagaaaac tgagataact aatacttagt catttacttc acatttcatg 24420
attgcttacc aagattaaac ttgttctgat gaaaatcttc acttttcata ctcatcgttt 24480
tgatgagtga acttaacaat taaaggtggc tgggcacagc agctcatgcc tataatccca 24540
gaattttggg aggctgaggc agaagaattg cttgagccca ggagttcgag accggcctgg 24600
gcaagatagg gagaccccca tatctacaaa aaaaattttt aaaaattagc caagtgtagg 24660
ggcatatgcc tgtagtccca cctactcagg aggctgaggt gggagaattg cttgaatcgg 24720
ggaggttgag gatgcagtga gctatgatca ctccactgaa ctccagtctg agtgacagag 24780
tgaaaaccct gtctcaacaa cagcaacaac agcaaaaaca ttaaaggtaa ttccaggaca 24840
caatagggat gaggcatatt gcctgtttaa ataaactttc ctggccttaa gaaaacaact 24900
ctgctgaata attttatgga attagaatga cattaaaata atgaatctga taaatatgat 24960
tagttaataa aaatacagtt aaaataatac atcccttgga attttaaatc ttactgagaa 25020
ccatttctgc acgcacagta tccaatatac aaacacacat ggaagaaaac ctttccaaag 25080
ctcgtggatt gcaccctcat attttctgtt ctgttatttg agtttttaaa tgctggttac 25140
cacccattat attaatttta ttgtattgta gcttgctcat atttttttaa aactcagctt 25200
caaataaggc atgactaaaa cagtgcatga atggatccct tccaaatgtt gaaagttatt 25260
ccagtttctt tcctttccaa gtaccatgca tatgcagcta accctagcat tggctgacca 25320
gtctaacatc agagtgaaga gaagggaact tccctctaca aggcttacaa aatgataaag 25380
ttacaaagat aagagctaaa atgcaaacgc tttctctaaa gaagttaaat cctagagtca 25440
cgaactggtc catgttgctt cctttagaga aaaaagtagt aaaatcagtg caggttgaaa 25500
cctgcagcag gtacaacagg ggactctatt ctctgtgctg tccttgaagg gactccaaag 25560
atgctgtttc tctgtttgcc tcccagtatt ttaagtgagt tttggattct gcaaatcaat 25620
ctggctagct atggccttgg gtgaatataa gcaccagttt ctatagccgc aagttaaagg 25680
gatttgaatt gttaattctg tcatccctcc cattctaaca tcctgttacc tagaaaatca 25740
tccatacata gagcctgaaa gcttactttt caggatttgg ggaaataaaa aaaaatttgg 25800
tagaatacct catactttca ttaggctact gatatgaact taattattcc tctaactcct 25860
ttgaggctca gaccttctgt cccctgtctt atcactctca tctaatcacc tccaagcaat 25920
gtcccacgtc actgagaatt tagtatccaa tttcctatct tctgctccac tccagctcct 25980
gccttcacct ggcagagtgt ccatgtgggt agttgatcca ataccccaag ttcagagtcc 26040
cttgatctcc tcaatccagt gactgacacc tccactccat ttcagcaact tcacttctgg 26100
ggccacaggc tgagacttga ttatctccag aatttctcca tgtccaaaac tgaaatggga 26160
tagttccctt gaccccttcg tgggactcat gaaggggttg tcttgtttac tcatcctgca 26220
gctctcaact cctcatggga gggagagcac ataggtgagt gggtgcagag ggtgggacaa 26280
gtgcttctgg gtgctggcag gagtagaact ctgtgcggcc ccgcagcagc atctaggtga 26340
gtatctgtga cccctggagc cgcagagggc atgtgttaca gtgtgctctt ttagctttgc 26400
tgtccatgga cagcttaagt gttttaacag ctcagtgtga cagccctctg tatcctgaac 26460
tcttgttcag caaccaggaa ggatcagctt gcacgaacaa attgaagatg gtgaatgtgg 26520
agaactttat tgctgatgaa aatggctctc agtgggatgg ggagctggaa aggggatgga 26580
gtgggaaggt gtctccccct ggagttaggc catccccagc tgaactcttc tccgaggtcc 26640
taccgtcaag ccatccctct gaagtcaagc tgcttttcac cagtgtcaag cttcttctct 26700
tctctccttc tctgccattc cactgccagt ggatcctggg gtttttatgg ggacaggatg 26760
gggggcaggg cgggccaggg tgattttgga aaaggcaaca ttcaggcagc aaaatgggga 26820
tataaagttc tcactttggg ccacgggtcc aggcttgagg gtggagccat tgccagggac 26880
cccacccttt tctacctagt atttccctgc ctccactctc caactctctt atcacaggcc 26940
tcctatccat aaatctttct cactccctta ttttttttag agctcacctt cagacttatg 27000
aaagctcaag tcttttgaat tcctctttcc ctcccagtcc agcaggcttc acttctctcc 27060
ttgctagcct ggaagtgtcc attacaccct tgactgtgtc gcggctgaca catggaaggc 27120
aggtgtctgg tattgttaaa tgaacgttaa caagtgcaac ttctttaaaa agtaagattt 27180
ctttatttga ttgatcagtt ttttttttaa ttaagtatct aatttccatt gcttctggat 27240
ttaggcacac atattgaata taaaattcaa aacttttcat gtatctaaag aaataatttt 27300
aatgaccaca gaattttgct cttaaaacaa atggagcttg caatataatt ataactaaga 27360
cagactacaa gcacctgctg tgttttcttc atgctgccat caggctggtg catagtctga 27420
tgggatggat gcatccctgc tgtaatagga ataagtaaag agggtgggag actgttgttg 27480
gtgtttacaa tcattattca tctctccatt aattttcatc cttgcagttg agataatctt 27540
gccttttaga acgagaaaac aatcttatta cttacaacat tcacaagaca gggaaacact 27600
cctcaacaaa gaaccatttc ctttgggcat ttgataagaa ctgctgacat ctgtgtttat 27660
taagtgatgt gggagaatca gccaaggggc agaggtgtgg gtggtagctt caaaggtaat 27720
ttcctgactg agtgtggttg ggcatgttgg caatcccagt gtttgcaatt ccagtgcttt 27780
gggaggctga ggcaggagca tcacttgagc ccaggagttt gaggctgcag tgagctatga 27840
ttacgccact gcactccagc cagtgtgaca gagagagacc ttgtctcaaa aaaaaaaaaa 27900
aaaaggattt tgctttcctg gtctgtttga atgtggtaag cacaagaact tatttgtact 27960
ggtggtcacc cacttgggtc ctttcagtaa acttctggcc atggcccagc ctgagcacct 28020
gcttccttta cagagcaaag ccagagcctt ctggtttgta atttttgcag ccacataaca 28080
ctgatccatt ttagctgtct catgaaactg ctttccaaga tgaagtgaga ctatccgaaa 28140
ccaaaatcta aaaagaatga ctacagaacg aatggcagtc caaggtcatt tggcagggta 28200
gaattataag gaaaggtgat ggccagctac tgtgaagtct taagcccact cataggtacg 28260
aatctcttac caggactgaa gtttgtgttt ccttaacttt gctgcttgat ggtcttctgc 28320
aaaggatctc atttctccag ctttcactgg gaagtggcat ggggttttcg aaagtgcttt 28380
aaactctaag aactggtttc ctctcaggct tagccaccac tagctgtctg actttggaca 28440
agttactcaa ccactctgag catcagcttt ctttttttaa aatggtgaca gtgatatggc 28500
taccttagag agttgtggag attaagtgag ataatatatg aacaaaatat ataaatatgt 28560
ctatattaga catatatgat ctgttaaata tatttatata tatatataga aagaatatta 28620
gagaactgac atatttatct tagacctact atgttctagg gcccagttct gctcatgtat 28680
tatctctcta tatatatagt attgagagag agaaagagag agagagctag agaaatagaa 28740
accagtagga tatatcctat ctgaagaact ctaatacagg tatttcactg aacaaaatgg 28800
cttctcctta aaatgtattc caacaccaat ttcccctcct tgcttctcat acatattaat 28860
tcacagaaat tatattggtc ataaggcggt aggactatga ataatttttt tcttctgcat 28920
ttgccaaata ttctgtaatg ctagttttat ttttataaca aatttttaaa gtccctgtaa 28980
accctataaa attaactact aatcctaaac ttgaactgta atacatattt ttgttaaata 29040
aaaaaaaaga tatcctaaat ttgagaatct aagctaaatt cacaaaatga ctaaatgtat 29100
cttcttactc ttaaacgata agaaaccagg aatgcagtgg tttaacccaa acagcctagg 29160
tattcttcac tcatttctct tgtcctcagt atatgtttcc cccaggccaa gaaacattta 29220
tggctaggaa ttcatgactg ctcatctgta agacactctc caagtactct gtcatgattt 29280
ggaaccagag ctttggcctt ccattccaga tgacctccag gttctaagct ttatctgaag 29340
tactccacac tacattgcac accagcatga tgggaaggaa tgactcaagg gaactgggca 29400
ttcagaccac tcaccaacta cacacagcct ttcacacagc agtagacaac ctttgtagga 29460
agacctctgg ccaaatattt atatatcagg agtatatcaa tgtatggtca cagttacctc 29520
ctttttaatc tcaagaaata aagtcattac ttctgaaaag aacaccagta atggaactac 29580
acgggcatgc caagatgaaa attagtctct ctctctctct ccaccccatt cttttggcca 29640
ctgcatttta aacaggccat agtgatagaa ccagaaggta gctgtgtttc gattattttt 29700
gttttcttat ttttaaacat tgctttatag aagcctgttc tataatgtaa gaagtgaata 29760
ggaatcctag tatctggaat gatgctgagc catggagatg gccagattcc atgtgaagag 29820
aagaggctgg cagaaagcta agtatttgct cgctttccct accctgtatt catggagcag 29880
ccattacaaa agatgacgca ccagaccatg ggcctggtct tgctaggtgg tggagatcca 29940
aaatgctggg cactcactgg agctccgtga agatctgttg aatgagtgac taaatgtgtg 30000
gtgaactgtc cctacctaca gggttccctg ctttgcccag cctatcaggc ttgggtacaa 30060
gcagatagcc tggattggat ttcatgaggg ctttgtgccc actctgtata ctaccggaag 30120
atgcataatc tctttagact tcctgggtca tcactgaaag gccagaggca agaatttgag 30180
cagtaaattg aagcatgtgc agagtgacat agtctgggca tagataacag tacctgaccc 30240
ctcaaattcc ctaaagttga catgacagct tgtgcatcat ggtttggtac agagatgact 30300
gaagagacag tggttgctcc tctggttgct ggacaccttt tgaatgggct gtgatgactg 30360
gagagcgtgc tctgtcctgg tcacctgaag gtgctcctag actagcagaa gcagagacag 30420
cagccaggcc tgactggaga gctggaggga ctgggtgcag gggacttcta ggccagcagt 30480
ttctgtgctg tgctcggagg ggctctggtg gcaggaacct agtcctgggt tccagctcta 30540
ccagggtgca gcctgaggaa ctcagctttc ctatttattg gcttgccttg ccacaaagga 30600
tttcttgtgt gttaaagata ctaatagctt aaacaaacaa acaaacaaac aaacaaacta 30660
ttgatttaaa ccactaatca aatctttggt tttacaaatt tagaaaataa aattaggccc 30720
cagggactaa atgtctctcc tatggtccag aaaccagact gaaactgacc tagtagcaaa 30780
acccattctc ctgtatccca actcccagtt caggttcctc ttcattagct cccacccagc 30840
tgcatagttg aggacactga attgagggga cctggggtaa agggctggca ctcaggcatc 30900
gctggcttct acatgattcc accacatgga aagcatcata atcatgtatg atccctagaa 30960
aagtgtttgc taaactcctt aagttgtcat tcaacaaaca tttatctagc acctaccaca 31020
tgctaggcaa attctagaga cacagctgcc cctgccctca tggaactaaa tgtcttgaaa 31080
gagatactgg ccctgatcag attttttaaa aatacataat tatgcaaata aatcatatta 31140
tatataatta caaagcatgg tatgaaagga aaagtacaag atgctgtgaa agcgtataaa 31200
aggaggactt ggccaagtct agaggatcat cagtgagtgt tccctgatgg actaacatta 31260
gggccaagat cagaaagatg tccaggagtt aacatggtca ataatcactg ttaatcttta 31320
ttaagcattt cccttgagtc gagtcaggct tcacttgctg cattaacttc tttaatctcc 31380
acaataactc tgtaaggtag gtgcttttgc tttttttcca ctttacagat gaataaacaa 31440
gaagtaaaaa gatgtaacta attcacatgt agccctctgg cttcagaagc tctttactcc 31500
ataaggttcc tgggcacgtg ggtgtggtct aggaatggga agaggctaag aatgctgaag 31560
ggcagactga atgttccaga tgaagctggc gtgataagaa agggccacat catgtgggtc 31620
ttataggtta tgatattcat gttaaaactg ttaatgtttt tgaaatatct aaaaccagga 31680
agtgcatgat catatttatg ttttagaaag atcactcagg ctgctgtgtg gagactggag 31740
tagaaagaag catagaagag acctacagag actaggaagc tgtcacagta acctaggcac 31800
aagatagtag tggtcaagac tagcttgctt gaggaagtgt agagaagtag acagataaaa 31860
ggagcatcca aggagtgtac acttctaagt aatttaaaag aaagagtcaa atcccttaga 31920
agcaatcatt tgaaataacc taattaccaa tcctttttat ccagtcctct tatgttgtcc 31980
caattcagga attcttaaac cagttcttgt gttaaaatca tagcatggac tgccttacaa 32040
cattctatac ttcagtccat tgaaaaccac atcaggccag gtgcgtggct catgcctata 32100
agcccagcac ttttggaggc caaggcgaga ggactgcttg agcccaggaa ttggagacca 32160
ctgaactcta gcctgggcaa cagaatgaga ccctgttgaa agagagaaaa ggaagaaaga 32220
aagaaaaaag aggaaggaag gagggaaaaa aggaaaaaag ggagggaggg aaggtggaag 32280
aaagaaaaaa aaagaaagag aaagaaaagg ctgcttcctg gcacttctat taccatgtga 32340
cacgagaagg gcaattgtcc agacccacac ttaggttcac cacagtcctt ggttcccagg 32400
acagagtcct ttgaggttcc ttcaagcacc cacaccatgc tggaaaatgc aggctttata 32460
aactccagat gtcagtccac aactccacag ccccattggg tttggttcct ggatgaattc 32520
tcggctagat gactggaata gagttctggt gaacatcttc cttcctgcca agatgacaag 32580
tcaggtttct aaagacagat gtgtagctct ttgtggaaaa aatattctgg gcacgaacat 32640
cttcggctga acggcatgga atggtgtgga aaatgtgaag ccctggactg gtttctgtgt 32700
cagtggtttc ctgggactgc cgagttgaag gggagcagaa agggctcagc tgtggaggag 32760
gcagagatta gccaaactag accagctatc gccccagctg catggatagc atcttccttc 32820
ttgttcctga ttaagctcat ttaatatgcc aatgacaaaa ggatcatgaa ggtctttcca 32880
aatgaatgaa tgttttctgg agctgagatt gctggctcct atttgcatag tagtgcatac 32940
tgaggcatct gaccggatta ctggattcct aacttttaca ttctttttaa gctcagatat 33000
ttctacggaa ggcagaaaga taaaggccaa aaaaagatca ggcttgccta gctcttgggt 33060
ttccaggaaa ctgacagact cctgatatct gcaactagtg ctttctctcc ctgggcgggg 33120
cgttctccgc caccttgtgg acatttgggg aactgcaccg ccccagcatc tatgatcaga 33180
ggaactgaca gaggctgcca taactacctg gtcccctgaa ataaaaacca ggcaccaggt 33240
ttgcccaaag gaggaccaac acatagcaga atgaggctga gtaggatgtt aaccttccat 33300
gctggtcttc cattatcacg ttatctgcca ctgtcacaga caacatctgc atccgctaca 33360
gatggagatg gaacaaaggc tttcactgga tctcaagttg gatgcttctg taaaagtaaa 33420
gttccaagac ctttttcttt tttttcccca gtgacagcaa gaaaattgga tgttactatt 33480
ggcaacaact atttcctgct tgtgaagcgt tggtatttgc agtgcttctg ccctgtggtt 33540
attgagctga aaggggggct gctcttgaca taggagcatg gcaggattgg attcttcata 33600
acatgctacc cagcttgtct tatgtgaaaa gagaagccca gaaagagctt gggacacaaa 33660
gatgcaaatc ctgggtttgc atttgatttg ttaagtgtcc tttcacaacc agcttgcaga 33720
ctttagggga aaaaaaatac atggtaggct tgtgggtagt ggagaaaaca taagctccag 33780
tacctcacag cctgggtttt cagtctgcct cctcccaacc ctcatcacat accagctgtg 33840
tgaccgtggg tactctgtct atgacaatca attttcttgt ctgcagagta cagataatag 33900
aggacctgcc ccatgggaag cttggaggac taaatgagat taagcccagg gcctggctca 33960
tggccgattc tcgtaacagg tccatgtgtg taatcaacat gtgctgaagt tctctggacc 34020
ctctgtaaat atctaatagg aacgactagc catgggtgtg atgagggaga tgctagaaac 34080
agtaatgggg tcaaatgaat ctgtcccata gtctctaatt tgaaacagct gaactgtgcc 34140
attgtctcct tggaggtaag atataagcca cagagtgcct gaagcaggtc tacagttaag 34200
gaacttaaat gtgctaagtg attaaaggct gggctgccta ctccctgtat ggcgacactc 34260
tccaccctcc tgcatactct ccactactcc aggcagtggt gggcctccta gcctctgggt 34320
gatgagcagc tcccacatct cccctttgag acagaggcct gcagaacctg accagtgtga 34380
gggctggctg gggtcaggag tgtgcctgct ggttggccct caaggtgaat gtggctcttc 34440
taaaagacac ctgaagggat gcagaattta gcagcaaacg ccaagaacag ctctttgtca 34500
tctctgcatc ttctactaaa gattttaagc tccgaagtat atgtctgggg tttctagaga 34560
ttgctattcc cttataaggg aaaaaataca cccttattgt atatttattg agcacctaca 34620
atgtgccagg caccccagct agggagttta gatatttaat cctctcagct gccctgtaaa 34680
atagggattg tttaaaatgg atttcgccca ccttataggc aaggaaactg aactacagta 34740
tggtaaagtg gcttggctaa tccacctggc tagtgagaat ggagccacag tgaacctgga 34800
ggatactctt cacggcgttc catctttgtg tgccccacag cacaagcaca atgcccgggt 34860
cttaggaaga gcttggtaaa tgactgctga gttgagaatg gtaggattca tgtaccaaaa 34920
gggcatttgg aaatgggcaa atgcccagat agggctctga aagactagct atgtagagca 34980
acagatggca aatcatcaat ttaaaactct gtcatccagc cagtagcccg ctggcctaca 35040
gccagccatg tatgtacaag ggctgacttg atcagctgtg gctcttttat ttcaggcaat 35100
ccaccttcca aagaggtgcc tgctgggaag tacccattca tagtcacttc tgatgatggg 35160
cggaaggttc ctgtagtcca ggcctatgct tttggcaaat acctaggcta tctgaagatc 35220
gagtttgatg aaagaggaaa cgtcatctct tcccatggaa atcccattct tctaaacagc 35280
agcattcctg aaggtaagtg aagttcaggg gaatgttcca ccaatctaaa atttagatgg 35340
ctggatattt tgctccttcc catttttttc ctttaatgtt tcaggaaaag actataatac 35400
tgttgaagaa tttagtttct tccttgagtt tgccctcttc atggatagat ttaaaacaca 35460
cccaaatgcc aaaagcccca cttgcttctt caagtcttct cagcatcaga ttccatacat 35520
cactgcttcc tctgcctttt aaaatcttga tcctgacatg ccaaagccac tctcagcttg 35580
tgtttacagg agagataact gaactaaagg ggttctaatt gggcagagcg gggctcaagt 35640
cctggccccc caactagagg cagtgtgcct taggcaagtt gcccaatctc atgggcctca 35700
gtttctttat caaatgaggg agtaccttat ttatatgtct gagtctattg cgggatctgg 35760
ccagcagccc gcaatgcaat ggggctctct ctttgttcct aggcagattg gcaggttgag 35820
aaataataga cacacacaag atagtgaaag ctgggtccag gggggtcact gccttctggt 35880
cctgcggtgc caacaatgca ctggatatac cagcatttat tattaagttt agtgagggca 35940
ggggtacgtt agtgagggat ttagggtcat ttgattatga ggtgagatgg tcacatgggg 36000
atgaagtaat tctttaacat aacatttgta tgtagaagta cagtacattt gtatgtagaa 36060
gtacagtatg agataagaat ttacaatata gtgtgtgcgt cagtaatttc taacagagcc 36120
ttagaacaga aacacagtct ttccataacc tatgattagc aagatattaa tcagcagtaa 36180
caattgcaac aaaaactggt tacaaataat ccatggaaac aggacatgaa gctagataac 36240
cagttagacc agaaattctc agaggggagt acgccttaac cctaaagagg cctagaagag 36300
cggcggcaag atgagcgtgt ttatagccct atcttatcca tatggacagg ccccccccat 36360
gcgtccgttt ataggctccc cacaagggtc acattccatt cccagagcta tgaacatctg 36420
cttttctggg ataggaatct tggtgatggg aaacctccct gactgcacgt ccatttgtag 36480
gctctctgca gggggaagca catcacgtgc tgttggctca ttgtggcagt ccaacctggc 36540
attgtcttta cacaatcctg catgcaattt tgtatttaca ataatcagga gcatttcatc 36600
ttttattccg tagcaatagt ttcagggcgt ctccctacat gagtctcaga gtaaaaatac 36660
atttttatat cattgatttt catatgtgta cagaacacac tcctgtgctt gtctctcact 36720
ttcccttttc tgttgactat gatcatgtca ggaagatgag gtgcacagca cccaggtgag 36780
cctgtagtca caacacagcc ctctcagtag ccagatggga aaggggccca ggaccacact 36840
gacagcgata gcccagctgt acctcagcag gccttttata actgcccctg ggcctttcag 36900
agctcctacc gccaccacta atgctgtccc aggtgggcca atgggtagct catacagatg 36960
tgatgagctg gtagatgttc agaatattct taaattaaga tgcattccag gtcttattac 37020
caggaaacaa aatagaagtt cctaaggaaa tttcatggtc aaaaccaaaa tgcacacaaa 37080
tatggtaaac aaatatgttc tcacagcaag taagatagat gatgtttcat tccttttcca 37140
gaatttagcc cagtgtgaga tttaattgta gggtaccttc ttttctttct tctagatcca 37200
agcataaaag cagacattaa caaatggagg ataaaattgg ataattattc tacccaggaa 37260
ttagggaaaa caattgtcta tctggatggc tcctctcaat catgccgctt tagagaatgc 37320
aacatgggca acctgatttg tgatgcaatg gtaagtcatc agcaggagtg gacatatgct 37380
agggaggaag gaaaggaaga gggaagagga aggaaggatg cgagaaggga tagatcgata 37440
gctacagaat gagggatttc aaaacatttt tagccagggt ggtggcatgt gcctatagtc 37500
ccagctactc gggaggctta ggtgggaaga tggcttgagc ccaggaggct gcagtgagct 37560
ataattgcac cagtgcactc cagcctgggc aaaagagcga gactctgtct ctaaaattaa 37620
ataaatatga aaaataaata aaacattagt gcctaaacaa aagcctgatg tagcttcata 37680
ctcagtacct tcttaaggtg ctggaaggag atcaggtaac caccaacata ctgctttaca 37740
agaactaaaa cctttatttt aaggtgttct tagatgtctg aattttggat gttgctttcc 37800
ttttgtaaga tgatgcaaat gcacaaactc tgctaccctg gagatatcaa cataatagat 37860
aaaagcatat ctgtttggcc agactgtttt tcactcatct gttcacttga ctccagagaa 37920
ccctggcctt gtgtgacagg aacctctttc acaggcacag aaagacaaca cacagtttta 37980
atggtaataa caagtcctat ggctgagcaa gaccaggtat gaagaacaag cccgtctact 38040
tgtcaccagg cggaatggcc atggtagctc tctgggggcc ttactgagcc aaggtctcct 38100
atgtcggact actacttcca tgggctcaca gcctgcaccc agggaagtca tttctgtcag 38160
tgcttggcat gcattgccca gtgcccagga tacaactgca ctcgataaat gtttgccgtg 38220
taaatgacaa caatcatgct catgccaaga ccagactcat gatgtctcct cagcatcatg 38280
cagtcaatgc cagagaagta gttaaggaga agctggtata taaatgttaa gagaagtctg 38340
gagggaaaca acccaggctg ttggcagtgc ttatccactc tgttaaagag gctaggacaa 38400
aggagagctt ttgcttttta tttatttatt tatttaatta attaatttat ttatttattg 38460
agactgagtc ttgctctgtc gcccaggctg gagtgcagtg gcatgatctc agctcactgc 38520
aacctccgtc ccttgggttc aagcgattct cctgcttcag cctcccaagt agctgggatt 38580
agaggtgtcc actaccacac ccagctaatt tttgtatctt tagtagagac ggggtttcac 38640
catgttggct agactggtct cgaactcctg accttaggtg atccacctgc cacagcctcc 38700
caaagtgctg ggatttacag gtgtgagcca ctatgcctgg cctttttttt tttttttttt 38760
cggctttttg tctcctaaat gagaatatat tcattttatt tcttataact aaacatatat 38820
tttgtaaaat aaaagtaaga atggatgtga ttctccccag cttcaaaagg ctgtctctgt 38880
ctttccaaaa ttgccaactt gggaatcagc tgcagacata gacgtggagg ccaggttctc 38940
tttggcatga ggtctggtgt tgagcaggct ctagggccaa gccctcagaa tcaggcagag 39000
ggcatggcct gccctggaga agcctgtgca ggagtgactc tgcaggccac cttgcatggg 39060
aggcccagtc ccgacaccag ggctacaacc gagagaccat tttgctaagg ccacctggcc 39120
tgacagtatg ggagttttag gagggggaaa taagagtttg acatggactg gtaacagtat 39180
atttttactt ctgggagcat aaaattagtt ctaattatag ttgagagatc cctgggaaca 39240
cagcagcagg tgaaaagcag ctttatttcc tcactagagc taaccagatt cctggagctg 39300
atcctaagga agaagagcca gagtaactag tgtaaatctg tgctactgtt gcagattaac 39360
aacaacctga gacacacgga tgaaatgttc tggaaccacg tatccatgtg cattttaaat 39420
ggaggtggta tccggtcgcc cattgatgaa cgcaacaatg gtatgctccc aggcccagct 39480
cctcagtgtg tcatgttctc tctctgatct ccttttctgt tatggttctt gccctgaaca 39540
cacccatgtg cagcctcagt tcatctctgc tgccagccat gtaccagaat gtcagtcctt 39600
taaattaggg ttagccatta ttaaagtcta aaacctagcc tgaaggacac acacaagctt 39660
gtgtgttctt atatgtattc acacacagaa gctgcaggta aggttagtgg aactcaaact 39720
tggtctgaat ttctgactta caaagctata ctagttgttt ctgatttatg caaaataaca 39780
acagtcacca ccttggcatc tgaaaagaga caaaatcacc aatcttctgt cctgaaaaag 39840
acataccact ggattaaaat gttaccttag tatatttaca gatatccaag attatatcat 39900
atggcaaaac aaatcaacta cacaaatact caagacactc ttgtagccct ccctgattta 39960
ctgtgaaaac caatcagtat cagtataagg ccccaaacgc atcccattgt cagacaacag 40020
ggacaggttg caggggcaca ttgtttgtct ggaggaacgg aacaatgcaa aacaaactac 40080
tggagatcca gccccaagct agcagatcta agactgagca tggtgaactt ctgaggcaca 40140
gccatttgtg agctcccgcc atcttgccca gtggtggagt gcagggccag aggcctccta 40200
aactcattcc cttcctcagt gtcaggaacc acttagacat agctaatgat gctctataag 40260
cttcccatgt ccccctcatt tcttccccca gcgctgaggg aaactggtca tggtgtaagc 40320
attttctcaa gctattttcc ttcttgcctc atctgtgact accctcaggc acaattacct 40380
gggagaacct ggctgctgta ttgccctttg gaggcacatt tgacctagtc cagttaaaag 40440
gttccaccct gaagaaggcc tttgagcata gcgtgcaccg ctacggccag tccactggag 40500
agttcctgca ggtgggcggt aagtcaccca tcctgtaggg ctggcccatc caaagtgaca 40560
tggcatttcc tgctggttgg ctcagcttcc ccttcaccaa aattcctgtg gtcaaactgg 40620
tttggatttt tcccgaccca acaccaatac cttacccttt aatggctgtg gactctcaga 40680
gccactaaga ttaggttggg gtcaaagccc caaagcccca aagccctggc aataagcagt 40740
atttgccaag tctgatttgg acatcagatg ggccctctaa gcaggtcacc tgctttgagg 40800
tctagtttct aataaaatct tatttaagca ttcaattatt ttttttttaa atcctctcct 40860
ctgttctact catcgttgcc tggaaccatg ccaagattta aaggaatgct gtccaatctg 40920
tattaaccag gatgattcct ggaaggagca gcctactgca cagacatacc tgaacacctg 40980
gttctagtcc cactgtgacc tggggcaagt tctcaacccc tctgggcaca gggagaaaac 41040
ctttcctgtt tatctcatgg actgcagtga aagttgaatg gaaataaatg cttcctaaat 41100
tgccaagggc ttctgaaaca cttttgttaa taattccaca gtccagtgaa ttaatccttt 41160
aaaagcaatc ttattttacc tggaaatcat aatctggcct cattttttta accttaggtt 41220
gtggtcccgc ttgacagata gaattgcttt cctgagtttt cctctgcaga acacattgct 41280
gttgagactg aagggctaca gaaggcagtg gggccctgtc ctaatgccta tggggcaacc 41340
caaacatcct ccattttccc agaagggagt tgagggaaaa gcagttttct gatgctctag 41400
caaagcactg catagctcta tgccttgtgg aagtcaccaa atcccaaatg attaccattt 41460
aatggtcttc aaaattgtaa aatgctctct gcctctgatc tgtttctcac tttctaaggc 41520
cattgaatca atttagtaga ccaattcagc attttaaggg aaaaacagaa aaagatgtgg 41580
ttttgtccat gtagagtcac cctttgccta ttctggtcca gagctggttt tccttccagt 41640
ggttctcata tgtcctgtaa ggactcaagg ataaagggtg gctgcaatgt cacaattctc 41700
ttgacaaaat ttcactcagg tttaaaccaa aggaaaaacc acagccccat gccccaattg 41760
tagagtttgg ccaaaattcc ttaggaaaca gaaatctccc tttggatctg gtgaaaacag 41820
attcatttct tttctaggaa tccatgtggt gtatgatctt tcccgaaaac ctggagacag 41880
agtagtcaaa ttagatgttc tttgcaccaa gtgtcgagtg cccagttatg accctctcaa 41940
aatggacgag gtatataagg tgatcctccc aaacttcctg gccaatggtg gagatgggtt 42000
ccagatgata aaagatgaat tattaagaca tgactctggt aagcatgact gtctcttcct 42060
ttctctaaag aacaacaaaa ttgggccaag aaggggagct actagtggtg ccaggagggc 42120
aaagtgactc cctgtattgc tgactccctc ctgctttgct tgttggaaag cagcagcaca 42180
gcacagcaga gaggaatatg taccctgtca atccccgagc acacacatgc cttcatccaa 42240
ccccagagac aaaacatttg taaatcaaaa atgttttaaa attcttttaa ctacatctaa 42300
gcagctattt aaattgtgaa atataagtga gttttgcatt aacagaaata tcacacctct 42360
ggaattttag gatatggaaa accatcacat taaatcattt aaattttctt tcttgaagaa 42420
gtttctgtgt gaatcagtca ggtagaagtg gatggggtga gcgccgccct ttcagcacag 42480
tgggttggtg gtacaaatca taggaataac catacttgat gtcagttggt acttctggca 42540
gcattgaggg acattatatg ccagcctcac ttttagaaat tctgttgaga gtcacattat 42600
gtgcatacag accactgaca taggacacac ttccccttcc tcttcctagg gcagagacca 42660
gcatgcttac aagacacaca agcagcttct tggcatctct gggtagttcc accaggatct 42720
tcctcagtct tatcatgctg gacctgtcat tttagctact tattccaagg agcgtggagg 42780
gcaggaccag accaaaggag gcagcccttc ctgggaggaa aggtgctggc ttcttctgtc 42840
ccttccgtga cagaagatgg aagggcaggc gtaaaagccc catgagacaa agaggctgat 42900
agccatcagc tgtccaccac tcaagcctag actgaaggat gttagagaga ggagctcatg 42960
cagaaggcta aggaagccct ggacctaagc gttctcaaca ctggttatac attagaatca 43020
cttcaggagc attttaaatt aaataccaat ggctcggccg cagctcttaa gagattctaa 43080
gttaactgga ttggggctaa actctttttc agtgattcta gtgtacaccc agaggtgaaa 43140
accactgaag agagaagggt tcttaagtac tggagtgata tttttaaaat aaaattctgt 43200
tgggttccca acagaacatg ggttcccaac aaacatgggt ttgttgtagc tataggcaat 43260
tagcatttat cctctcctct ctttggtaaa ggaaatcccc caagtcccat catcaattac 43320
atgaaaataa ttattgggca ggcaaacaaa aggaagccag ttttatctct caactaattg 43380
ctcttgaccc agttccctaa tgcccttgtt ctggagtggt tcaaaggcta tgatgtcaat 43440
gacctctttc tagagtcatc acagagctga ggcaactcta cttcccccaa ttacccaata 43500
ttgaacaaca gacacatttt acatctggag catgaaaaag caatcaaatt ctttgtactt 43560
atgatcacta gcgttcttgt cttctgtgac actttacccc tgcttatgaa attgcttccc 43620
ttttataatt acaaaggact accttactgt tgattgataa ttaccttttc tgtagttaaa 43680
ataatgtata tgtttttcta ggtgaccaag atatcaacgt ggtttctaca tatatctcca 43740
aaatgaaagt aatttatcca gcagttgaag gtcggatcaa gttttccaca ggaagtcact 43800
gccatggaag cttttcttta atatttcttt cactttgggc agtgatcttt gttttatacc 43860
aatagccaaa aattctcctt gcctttaatg tgtgaaactg cattttttca agtgagattc 43920
aaatctgcct tttaggacct ggctttgtga cagcaaaaac catctttaca ggctcctaga 43980
agctgaaggt tagagcatta taaaatgaag agacagacat gattactcag ggtcagcaac 44040
ctagtgagtt agaaaaaaaa ttaacatagg gccctataag gagaaagcca actatgttaa 44100
gtttacgtgt ccaaatttta atgaaatttt actaacaatt ttaaaccata tttttcttct 44160
tcatatccat ttctaatcca tcaaacagct tatgtttaca taaaatttta tcattcacaa 44220
ggaagtttta agcacactgt ctcatttgat atccacaact tatttttggt aggaaagaga 44280
gatgtttttc ccacctgtca gatgaaaaaa ctgaagctca aaaagggttg acttgaccat 44340
acagctaatg ctgacagatc caagacctag acctaggtct tttgaactca agtccagcat 44400
tctcaactat atcaagttac tgttcagaat acttaatatc tcctctcttc ataattatca 44460
atagccccaa gctcatggat gacaaatctc tgctttattt cttgtctcta ttttttcact 44520
ttatagctcc tgttataata gcaagtttaa tggtataaac acaggatacc atcctctctt 44580
gcaacaccca tgtgcctttg atgagtcagg tagcaagctg tagtagataa tgagaaaggc 44640
cagaggctgc aaaagacagt caaaggacac gagagaaagg aaggggaaga acaggactcc 44700
aggactgttt tatattatag aaaagcaaga gctaaagagc atttacacat gttaaacaga 44760
tacttgttaa gcatagtgcc tgacacacgg cattagctgt tattttatga gattccatca 44820
gctctgcctc tgtcctcttt cttctaacat gaaggtatca tgagaagaga accttctaac 44880
ataagctgta attctaaacc tgcacttgtc cctctccagc aagaggctag cactgaattc 44940
attctactca tactacacac ccagttatgg aatgtccaga gttctcgaag aaaataaatg 45000
actttaggaa gaggtataca ttttttaagt cgctctgcct ccaaatctga acagtcactg 45060
taaatcattc ttaagcccag atatgagaac ttctgctgga aagtgggacc ctctgagtgg 45120
gtggtcagaa aatacccatg ctgatgaaat gacctatgcc caaagaacaa atacttaacg 45180
tgggagtgga accacatgag cctgctcagc tctgcataag taattcaaga aatgggaggc 45240
ttcaccttaa aaacagtgtg caaatggcag ctagaggttt tgataggaag tatgtttgtt 45300
tcttagtgtt tacaaatatt aagtactctt gatacaaaat atacttttaa acttcataac 45360
ctttttataa aagttgttgc agcaaaataa tagcctcggt tctatgcata tatggattag 45420
ctataaaaaa tgtcaataag attgtacaag gaaaattaga gaaagtcaca tttagggttt 45480
attttttaca cttggccagt aaaatagggt aaatcctatt agaatttttt aaagaacttt 45540
ttttaagttt cctaaatctg tgtgtgtatt gtgaagtggt ataagaaatg actttgaacc 45600
actttgcaat tgtagattcc caacaataaa attgaagata agctctttgg tctttgatag 45660
ttatttcttc tctcattcaa acacctgcca actaaagaca atatgactga aggaaaactc 45720
tgacagattt tcactggtat catttctctt gatcataagc cacttaccat ctgataaccc 45780
tccatagccc ctcacacccc taaaatgaga agaaaaatgc atccaatttt taagtaggag 45840
agaaaagtga gtggtatttt aaggaagact actaaatttg gtggctttga tatacagaat 45900
ctgatataaa aatggttttg aggccagtat aacttcgtat aatgtatgct atacgaagtt 45960
ataggtctga agaggagttt acgtccagcc aagctagcgg ctccggtgcc cgtcagtggg 46020
cagagcgcac atcgcccaca gtccccgaga agttgggggg aggggtcggc aattgaaccg 46080
gtgcctagag aaggtggcgc ggggtaaact gggaaagtga tgtcgtgtac tggctccgcc 46140
tttttcccga gggtggggga gaaccgtata taagtgcagt agtcgccgtg aacgttcttt 46200
ttcgcaacgg gtttgccgcc agaacacagg taagtgccgt gtgtggttcc cgcgggcctg 46260
gcctctttac gggttatggc ccttgcgtgc cttgaattac ttccacctgg ctccagtacg 46320
tgattcttga tcccgagctg gagccagggg cgggccttgc gctttaggag ccccttcgcc 46380
tcgtgcttga gttgaggcct ggcctgggcg ctggggccgc cgcgtgcgaa tctggtggca 46440
ccttcgcgcc tgtctcgctg ctttcgataa gtctctagcc atttaaaatt tttgatgacc 46500
tgctgcgacg ctttttttct ggcaagatag tcttgtaaat gcgggccagg atctgcacac 46560
tggtatttcg gtttttgggg ccgcgggcgg cgacggggcc cgtgcgtccc agcgcacatg 46620
ttcggcgagg cggggcctgc gagcgcggcc accgagaatc ggacgggggt agtctcaagc 46680
tggccggcct gctctggtgc ctggcctcgc gccgccgtgt atcgccccgc cctgggcggc 46740
aaggctggcc cggtcggcac cagttgcgtg agcggaaaga tggccgcttc ccggccctgc 46800
tccagggggc tcaaaatgga ggacgcggcg ctcgggagag cgggcgggtg agtcacccac 46860
acaaaggaaa ggggcctttc cgtcctcagc cgtcgcttca tgtgactcca cggagtaccg 46920
ggcgccgtcc aggcacctcg attagttctg gagcttttgg agtacgtcgt ctttaggttg 46980
gggggagggg ttttatgcga tggagtttcc ccacactgag tgggtggaga ctgaagttag 47040
gccagcttgg cacttgatgt aattctcctt ggaatttgcc ctttttgagt ttggatcttg 47100
gttcattctc aagcctcaga cagtggttca aagttttttt cttccatttc aggtgtcgtg 47160
agcggccgga agcttcacga ggccgccacc cccggggttg acaattaatc atcggcatag 47220
tatatcggca tagtataata cgacaaggtg aggaactaaa ccatgggatc ggccattgaa 47280
caagatggat tgcacgcagg ttctccggcc gcttgggtgg agaggctatt cggctatgac 47340
tgggcacaac agacaatcgg ctgctctgat gccgccgtgt tccggctgtc agcgcagggg 47400
cgcccggttc tttttgtcaa gaccgacctg tccggtgccc tgaatgaact gcaggacgag 47460
gcagcgcggc tatcgtggct ggccacgacg ggcgttcctt gcgcagctgt gctcgacgtt 47520
gtcactgaag cgggaaggga ctggctgcta ttgggcgaag tgccggggca ggatctcctg 47580
tcatctcacc ttgctcctgc cgagaaagta tccatcatgg ctgatgcaat gcggcggctg 47640
catacgcttg atccggctac ctgcccattc gaccaccaag cgaaacatcg catcgagcga 47700
gcacgtactc ggatggaagc cggtcttgtc gatcaggatg atctggacga agagcatcag 47760
gggctcgcgc cagccgaact gttcgccagg ctcaaggcgc gcatgcccga cggcgatgat 47820
ctcgtcgtga cccatggcga tgcctgcttg ccgaatatca tggtggaaaa tggccgcttt 47880
tctggattca tcgactgtgg ccggctgggt gtggcggacc gctatcagga catagcgttg 47940
gctacccgtg atattgctga agagcttggc ggcgaatggg ctgaccgctt cctcgtgctt 48000
tacggtatcg ccgctcccga ttcgcagcgc atcgccttct atcgccttct tgacgagttc 48060
ttctgagggg atcaattctc tagagctcgc tgatcagcct cgactgtgcc ttctagttgc 48120
cagccatctg ttgtttgccc ctcccccgtg ccttccttga ccctggaagg tgccactccc 48180
actgtccttt cctaataaaa tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct 48240
attctggggg gtggggtggg gcaggacagc aagggggagg attgggaaga caatagcagg 48300
catgctgggg atgcggtggg ctctatggct tctgaggcgg aaagaaccag ctggggctcg 48360
actagagctt gcggaaccct tc 48382
<210> 19
<211> 3582
<212> DNA/RNA
<213> Artificial sequence (Artificial Sequence)
<400> 19
agtttagtag aggccccggt ggccggtgct cactgggctc agggccagtc cacccgctca 60
acgcgctcta gccagccatg tgtccccgag ccgcgcgggc gcccgcgacg ctactcctcg 120
ccctgggcgc ggtgctgtgg cctgcggctg gcgcctggga gcttacgatt ttgcacacca 180
acgacgtgca cagccggctg gagcagacca gcgaggactc cagcaagtgc gtcaacgcca 240
gccgctgcat gggtggcgtg gctcggctct tcaccaaggt tcagcagatc cgccgcgccg 300
aacccaacgt gctgctgctg gacgccggcg accagtacca gggcactatc tggttcaccg 360
tgtacaaggg cgccgaggtg gcgcacttca tgaacgccct gcgctacgat gccatggcac 420
tgggaaatca tgaatttgat aatggtgtgg aaggactgat cgagccactc ctcaaagagg 480
ccaaatttcc aattctgagt gcaaacatta aagcaaaggg gccactagca tctcaaatat 540
caggacttta tttgccatat aaagttcttc ctgttggtga tgaagttgtg ggaatcgttg 600
gatacacttc caaagaaacc ccttttctct caaatccagg gacaaattta gtgtttgaag 660
atgaaatcac tgcattacaa cctgaagtag ataagttaaa aactctaaat gtgaacaaaa 720
ttattgcact gggacattcg ggttttgaaa tggataaact catcgctcag aaagtgaggg 780
gtgtggacgt cgtggtggga ggacactcca acacatttct ttacacaggc aatccacctt 840
ccaaagaggt gcctgctggg aagtacccat tcatagtcac ttctgatgat gggcggaagg 900
ttcctgtagt ccaggcctat gcttttggca aatacctagg ctatctgaag atcgagtttg 960
atgaaagagg aaacgtcatc tcttcccatg gaaatcccat tcttctaaac agcagcattc 1020
ctgaagatcc aagcataaaa gcagacatta acaaatggag gataaaattg gataattatt 1080
ctacccagga attagggaaa acaattgtct atctggatgg ctcctctcaa tcatgccgct 1140
ttagagaatg caacatgggc aacctgattt gtgatgcaat gattaacaac aacctgagac 1200
acacggatga aatgttctgg aaccacgtat ccatgtgcat tttaaatgga ggtggtatcc 1260
ggtcgcccat tgatgaacgc aacaatggca caattacctg ggagaacctg gctgctgtat 1320
tgccctttgg aggcacattt gacctagtcc agttaaaagg ttccaccctg aagaaggcct 1380
ttgagcatag cgtgcaccgc tacggccagt ccactggaga gttcctgcag gtgggcggaa 1440
tccatgtggt gtatgatctt tcccgaaaac ctggagacag agtagtcaaa ttagatgttc 1500
tttgcaccaa gtgtcgagtg cccagttatg accctctcaa aatggacgag gtatataagg 1560
tgatcctccc aaacttcctg gccaatggtg gagatgggtt ccagatgata aaagatgaat 1620
tattaagaca tgactctggt gaccaagata tcaacgtggt ttctacatat atctccaaaa 1680
tgaaagtaat ttatccagca gttgaaggtc ggatcaagtt ttccacagga agtcactgcc 1740
atggaagctt ttctttaata tttctttcac tttgggcagt gatctttgtt ttataccaat 1800
agccaaaaat tctccttgcc tttaatgtgt gaaactgcat tttttcaagt gagattcaaa 1860
tctgcctttt aggacctggc tttgtgacag caaaaaccat ctttacaggc tcctagaagc 1920
tgaaggttag agcattataa aatgaagaga cagacatgat tactcagggt cagcaaccta 1980
gtgagttaga aaaaaaatta acatagggcc ctataaggag aaagccaact atgttaagtt 2040
tacgtgtcca aattttaatg aaattttact aacaatttta aaccatattt ttcttcttca 2100
tatccatttc taatccatca aacagcttat gtttacataa aattttatca ttcacaagga 2160
agttttaagc acactgtctc atttgatatc cacaacttat ttttggtagg aaagagagat 2220
gtttttccca cctgtcagat gaaaaaactg aagctcaaaa agggttgact tgaccataca 2280
gctaatgctg acagatccaa gacctagacc taggtctttt gaactcaagt ccagcattct 2340
caactatatc aagttactgt tcagaatact taatatctcc tctcttcata attatcaata 2400
gccccaagct catggatgac aaatctctgc tttatttctt gtctctattt tttcacttta 2460
tagctcctgt tataatagca agtttaatgg tataaacaca ggataccatc ctctcttgca 2520
acacccatgt gcctttgatg agtcaggtag caagctgtag tagataatga gaaaggccag 2580
aggctgcaaa agacagtcaa aggacacgag agaaaggaag gggaagaaca ggactccagg 2640
actgttttat attatagaaa agcaagagct aaagagcatt tacacatgtt aaacagatac 2700
ttgttaagca tagtgcctga cacacggcat tagctgttat tttatgagat tccatcagct 2760
ctgcctctgt cctctttctt ctaacatgaa ggtatcatga gaagagaacc ttctaacata 2820
agctgtaatt ctaaacctgc acttgtccct ctccagcaag aggctagcac tgaattcatt 2880
ctactcatac tacacaccca gttatggaat gtccagagtt ctcgaagaaa ataaatgact 2940
ttaggaagag gtatacattt tttaagtcgc tctgcctcca aatctgaaca gtcactgtaa 3000
atcattctta agcccagata tgagaacttc tgctggaaag tgggaccctc tgagtgggtg 3060
gtcagaaaat acccatgctg atgaaatgac ctatgcccaa agaacaaata cttaacgtgg 3120
gagtggaacc acatgagcct gctcagctct gcataagtaa ttcaagaaat gggaggcttc 3180
accttaaaaa cagtgtgcaa atggcagcta gaggttttga taggaagtat gtttgtttct 3240
tagtgtttac aaatattaag tactcttgat acaaaatata cttttaaact tcataacctt 3300
tttataaaag ttgttgcagc aaaataatag cctcggttct atgcatatat ggattagcta 3360
taaaaaatgt caataagatt gtacaaggaa aattagagaa agtcacattt agggtttatt 3420
ttttacactt ggccagtaaa atagggtaaa tcctattaga attttttaaa gaactttttt 3480
taagtttcct aaatctgtgt gtgtattgtg aagtggtata agaaatgact ttgaaccact 3540
ttgcaattgt agattcccaa caataaaatt gaagataagc tc 3582
<210> 20
<211> 1349
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 20
attaagggtt ccggatcctc ggggacacca aatatggcga tctcggcctt ttcgtttctt 60
ggagctggga catgtttgcc atcgatccat ctaccaccag aacggccgtt agatctgctg 120
ccaccgttgt ttccaccgaa gaaaccaccg ttgccgtaac caccacgacg gttgttgcta 180
aagaagctgc caccgccacg gccaccgttg tagccgccgt tgttgttatt gtagttgctc 240
atgttatttc tggcacttct tggttttcct cttaagtgag gaggaacata accattctcg 300
ttgttgtcgt tgatgcttaa attttgcact tgttcgctca gttcagccat aatatgaaat 360
gcttttcttg ttgttcttac ggaataccac ttgccaccta tcaccacaac taactttttc 420
ccgttcctcc atctctttta tatttttttt ctcgagggat ctttgtgaag gaaccttact 480
tctgtggtgt gacataattg gacaaactac ctacagagat ttaaagctct aaggtaaata 540
taaaattttt aagtgtataa tgtgttaaac tactgattct aattgtttgt gtattttaga 600
ttccaaccta tggaactgat gaatgggagc agtggtggaa tgcctttaat gaggaaaacc 660
tgttttgctc agaagaaatg ccatctagtg atgatgaggc tactgctgac tctcaacatt 720
ctactcctcc aaaaaagaag agaaaggtag aagaccccaa ggactttcct tcagaattgc 780
taagtttttt gagtcatgct gtgtttagta atagaactct tgcttgcttt gctatttaca 840
ccacaaagga aaaagctgca ctgctataca agaaaattat ggaaaaatat tctgtaacct 900
ttataagtag gcataacagt tataatcata acatactgtt ttttcttact ccacacaggc 960
atagagtgtc tgctattaat aactatgctc aaaaattgtg tacctttagc tttttaattt 1020
gtaaaggggt taataaggaa tatttgatgt atagtgcctt gactagagat cataatcagc 1080
cataccacat ttgtagaggt tttacttgct ttaaaaaacc tcccacacct ccccctgaac 1140
ctgaaacata aaatgaatgc aattgttgtt gttaacttgt ttattgcagc ttataatggt 1200
tacaaataaa gcaatagcat cacaaatttc acaaataaag catttttttc actgcattct 1260
agttgtggtt tgtccaaact catcaatgta tcttatcatg tctggatctg acatggtaag 1320
taagcttggg ctgcaggtcg agggaccta 1349
<210> 21
<211> 132
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 21
gaattctaat acgactcact atagggggtc ttcgagaaga cctgttttag agctagaaat 60
agcaagttaa aataaggcta gtccgttatc aacttgaaaa agtggcaccg agtcggtgct 120
tttaaaggat cc 132
<210> 22
<211> 1865
<212> DNA/RNA
<213> Artificial sequence (Artificial Sequence)
<400> 22
cagggagtct ccttgtcctt gatgtcaaac tacatttttc ttccagtgat attcatatct 60
gcctctggag acctggcttt gtaacagcac tcatcatcct caaggttcct agcagatgtt 120
cttcacaagg aagagactgt aacatcattt gttggggcca gcaactcagt gagcagatag 180
aaagtcacag tgaaccaaca gggtccttct ggcagggagt gggtaagggg aaacaactag 240
atgtagcttg catagccaca taacacatct ggttaccatt tccctttcta ttcatttcta 300
atccatcaaa caattgatgt ttacatacaa cttcatcatc gccagctctg gtggcacatg 360
cctgtggtca cggcacttgg cagggaggag aggatggctg caagttctag gccagcctga 420
catatgtaga gtttcaggcc agtcagctag atatcaagac tcacacacac aaacaaacat 480
tataatttac aagtagattt ttgtagacaa gtcttatgat aagctaagca gaaagggtcg 540
acttgttcaa ggtcagacat ctccaaataa tacatctaag atctgtacct gagtcttttt 600
acctcaagtc caacctctcg cctactgtca agtctcctct cttcctgcgg ttggtctcag 660
acaacaaatc tgtttctgcc ttccatagta ccctttcttt tgggctcttg ttgtctctca 720
ggtttgagag agtagctact ggacaggatt ctttcctaca acaccatgtg cctttggtga 780
gtcatagaca aactgtatac agcagataat agattagtcc agggactgca aaaggcagtc 840
agggacaggc aggaaaggga aaggagaata tgaccaggac ttacagaaga gagtagatgg 900
gttccataaa cactagttaa ggagcagaac atggctccaa agcacaacac tggttcttat 960
taactcctgt gcgactccaa aagccctgcc tctctccttc cttcttcctc acatggaggt 1020
gccatgagag aacttcctaa aataaactgt aattctcaac ctgcacctgt cctgtccaag 1080
atgccaacac ggctctcatt caagtcacac atgtacaatc caatattctg gaaggcaaga 1140
gtgaatttaa gaagtggtat acataaatgt ttgaagtcac tgagactcaa atcccacaca 1200
accactgtaa ggcatactca ggtcaagaca tgagaagacc agcaggacag tgagccctat 1260
gggaggccca agagataccc agactgatga aatgatctgt gcccacagaa caccagacca 1320
agacagtcct gtgaccaagt gagcatagtc agcactgtac agtagttaag aatccttaac 1380
tataaaccag cagctagatg tctggacagg gaacatgttg gtttcctagt gtttacgaat 1440
attaaggact cttgacccaa accgtttaaa attccaatcc ttttatgaag tttgttgcag 1500
tgaaacaatg ccttcagttc tttttgaatg tgtagattag ttataaacag aaatgtcagt 1560
aagactgtaa aaaaaaatga agaaaggtgc atatgaggta tatttttatg cttggcgagt 1620
aaataaggag aaatcttata gcataatttt taaagaacat ttgtataatt tttctaaata 1680
tgtgtatatg tatattttat gcagcagtat tgagaaatga ccttggacta tcttgtaatt 1740
gtaaattcca aaaataaagc tgaagatata gtcctttgtc tttggtctta cttcttctct 1800
taacacttgt accctcaggg atgcagcaaa gacaatgttc acttcaaaac tgagacgaac 1860
aatgc 1865
<210> 23
<211> 90
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 23
gcggaaccct tcgaagttcc tattctctag aaagtatagg aacttcccag agcccaaggc 60
actcattatt agcactgggt gagctccatg 90
<210> 24
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 24
gagtggtatg tgcagctcag ggaag 25
<210> 25
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 25
gctccaagaa acgaaaaggc cgaga 25
<210> 26
<211> 23
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 26
agcgcaaaca ttaaggcacg ggg 23
<210> 27
<211> 23
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 27
ttaatgtttg cgctcagaat tgg 23
<210> 28
<211> 23
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 28
gagcgcaaac attaaggcac ggg 23
<210> 29
<211> 23
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 29
aattctgagc gcaaacatta agg 23
<210> 30
<211> 23
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 30
cctctagcac atcagatatc tgg 23
<210> 31
<211> 23
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 31
tgagcgcaaa cattaaggca cgg 23
<210> 32
<211> 23
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 32
gattgttgga tatacttcaa agg 23
<210> 33
<211> 23
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 33
ttggaaattt aacgtttctg agg 23
<210> 34
<211> 23
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 34
catcacttgc ggggaagtcc tgg 23
<210> 35
<211> 23
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 35
tgctgagtcc atcacttgcg ggg 23
<210> 36
<211> 23
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 36
caggacttcc ccgcaagtga tgg 23
<210> 37
<211> 23
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 37
gtctgtaacc ccgataagct tgg 23
<210> 38
<211> 23
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 38
ttgcatctgc ggagttagca tgg 23
<210> 39
<211> 23
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 39
gccatcgacc attcgtgcac agg 23
<210> 40
<211> 23
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 40
ccctgtgcac gaatggtcga tgg 23
<210> 41
<211> 23
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 41
agtggagccc tgtgcacgaa tgg 23
<210> 42
<211> 23
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 42
cctttgaagt atatccaaca atc 23
<210> 43
<211> 27
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 43
tagggattgt tggatatact tcaaagg 27
<210> 44
<211> 27
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 44
aaaccctttg aagtatatcc aacaatc 27
<210> 45
<211> 23
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 45
cctgtgcacg aatggtcgat ggc 23
<210> 46
<211> 27
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 46
tagggccatc gaccattcgt gcacagg 27
<210> 47
<211> 27
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 47
aaaccctgtg cacgaatggt cgatggc 27
<210> 48
<211> 26
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 48
tttgataacg gtgtggaagg actgat 26
<210> 49
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 49
agacagcgat gcccttcaga ggata 25
<210> 50
<211> 24
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 50
tacacggtga accagatagt gccc 24
<210> 51
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 51
gtgggcggaa tccatgtggt gtatg 25
<210> 52
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 52
gccccagggc gatgatctta ttcac 25
<210> 53
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 53
agagaccctt tagacctgga tcagc 25
<210> 54
<211> 26
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 54
acctagaaaa ctttggtggg acatga 26
<210> 55
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 55
aactgatgaa tgggagcagt ggtgg 25
<210> 56
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 56
gcagacactc tatgcctgtg tggag 25
<210> 57
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 57
aagatgtttc ttgtcccggg tttcg 25
<210> 58
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 58
ggactgattg atcccctcct cagaa 25
<210> 59
<211> 22
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 59
gcacctgccg tccatcatct gc 22
<210> 60
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 60
acgtgctgtt tttggatgcc 20
<210> 61
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 61
tggtgtgcaa aatcgtaagc 20
<210> 62
<211> 22
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 62
tcaccatctt ccaggagcga ga 22
<210> 63
<211> 21
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 63
gaaggccatg ccagtgagct t 21
Claims (6)
1. A method for constructing a non-human animal humanized with a CD73 gene, wherein the genome of the non-human animal comprises the humanized CD73 gene, the non-human animal expresses human CD73 protein in vivo, the humanized CD73 gene comprises a part of the human CD73 gene, and the part of the human CD73 gene is SEQ ID NO:9 or SEQ ID NO:17, said construction method comprising the steps of introducing into a nucleic acid sequence of SEQ ID NO:17 to exon 9 and 3' utr of the non-human animal CD73 gene, or alternatively, the amino acid sequence of SEQ ID NO:9 is inserted at exon 2 and intron 2-3 of the non-human animal CD73 gene.
2. The method of claim 1, wherein the amino acid sequence of the human CD73 protein is SEQ ID NO:4, and a polypeptide having the amino acid sequence shown in 4.
3. The method of claim 1, wherein the humanized CD73 gene comprises one of the group consisting of:
(i) Comprising SEQ ID NO: 12. 13 or 18;
(ii) The transcribed mRNA is SEQ ID NO:19, and a nucleotide sequence shown in seq id no.
4. The method of claim 1, wherein the method of constructing comprises constructing using a targeting vector comprising a portion of the human CD73 gene, the targeting vector further comprising a 5' arm and a 3' arm, the 5' arm sequence being SEQ ID NO: 5. 7 or 15; the 3' arm is SEQ ID NO: 6. 8 or 16.
5. The method of claim 1-4, wherein the non-human animal genome further comprises additional genetic modifications, wherein the additional genetic modifications are at least one of PD-1, PD-L1, CTLA-4, LAG-3, BTLA, CD27, CD28, CD47, CD137, CD154, OX40, SIRPalpha, TIGIT, TIM-3, CD40 and GITR genes.
6. The method according to claim 5, wherein the non-human animal is a rat or a mouse.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010807504 | 2020-08-12 | ||
CN202010807504X | 2020-08-12 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113817770A CN113817770A (en) | 2021-12-21 |
CN113817770B true CN113817770B (en) | 2024-04-05 |
Family
ID=78922920
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110933098.6A Active CN113817770B (en) | 2020-08-12 | 2021-08-12 | Construction method and application of CD73 gene humanized non-human animal |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113817770B (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108588126A (en) * | 2017-03-31 | 2018-09-28 | 北京百奥赛图基因生物技术有限公司 | The preparation method and application of the humanization modified animal model of CD47 genes |
CN111197058A (en) * | 2018-11-20 | 2020-05-26 | 北京百奥赛图基因生物技术有限公司 | Preparation method and application of humanized CD73 gene animal model |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2019231791B2 (en) * | 2018-03-09 | 2022-08-11 | Agenus Inc. | Anti-CD73 antibodies and methods of use thereof |
-
2021
- 2021-08-12 CN CN202110933098.6A patent/CN113817770B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108588126A (en) * | 2017-03-31 | 2018-09-28 | 北京百奥赛图基因生物技术有限公司 | The preparation method and application of the humanization modified animal model of CD47 genes |
CN111197058A (en) * | 2018-11-20 | 2020-05-26 | 北京百奥赛图基因生物技术有限公司 | Preparation method and application of humanized CD73 gene animal model |
WO2020103882A1 (en) * | 2018-11-20 | 2020-05-28 | Beijing Biocytogen Co., Ltd. | Genetically modified non-human animal with human or chimeric cd73 |
Also Published As
Publication number | Publication date |
---|---|
CN113817770A (en) | 2021-12-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2022201329B2 (en) | Genetically modified cells, tissues, and organs for treating disease | |
CN111837036B (en) | Genetically modified non-human animals with human or chimeric genes | |
KR20230110373A (en) | Genetically modified cells, tissues, and organs for treating disease | |
CN111057721B (en) | Preparation method and application of humanized IL-4 and/or IL-4R alpha modified animal model | |
KR20150027752A (en) | Humanized il-7 rodents | |
CN112779285B (en) | Construction method and application of humanized IL-10 and IL-10RA gene modified animal | |
CN111793646B (en) | Construction method and application of non-human animal subjected to IL1R1 gene humanization transformation | |
KR20220002609A (en) | Modification of Mammalian Cells Using Artificial Micro-RNAs and Compositions of These Products to Alter Properties of Mammalian Cells | |
CN113429472B (en) | CD94 and NKG2A gene humanized non-human animal and its preparation method and use | |
KR20220119038A (en) | Methods for Activation and Expansion of Tumor Infiltrating Lymphocytes | |
CN114164177A (en) | Humanized transgenic animal | |
CN114277055A (en) | Non-human animal humanized by IL1B and IL1A genes and construction method and application thereof | |
CN113429486A (en) | Construction method and application of genetically modified non-human animal | |
RU2744831C2 (en) | Non-human animal exhibiting diminished upper and lower motor neuron function and sensory perception | |
CN114134152B (en) | GLP1R gene humanized non-human animal and construction method and application thereof | |
CN113817770B (en) | Construction method and application of CD73 gene humanized non-human animal | |
CN112501204B (en) | IL21R gene humanized non-human animal and construction method and application thereof | |
CN112553252B (en) | Construction method and application of TNFR2 gene humanized non-human animal | |
CN113831403A (en) | Construction method and application of humanized non-human animal of STING gene | |
CN114853871B (en) | Humanized non-human animal of CSF1 and/or CSF1R gene, construction method and application thereof | |
CN115010800B (en) | Construction method and application of PVRIG gene humanized non-human animal | |
CN112501202B (en) | CXCR4 gene humanized non-human animal and construction method and application thereof | |
CN114621971A (en) | Genetically modified non-human animal, and construction method and application thereof | |
CN115043929A (en) | Construction method and application of VSIG4 gene humanized non-human animal | |
CN115260303A (en) | Construction method and application of CD70 gene humanized non-human animal |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |