CN112501203A - Il17rb基因人源化的非人动物的构建方法及应用 - Google Patents
Il17rb基因人源化的非人动物的构建方法及应用 Download PDFInfo
- Publication number
- CN112501203A CN112501203A CN202110157090.5A CN202110157090A CN112501203A CN 112501203 A CN112501203 A CN 112501203A CN 202110157090 A CN202110157090 A CN 202110157090A CN 112501203 A CN112501203 A CN 112501203A
- Authority
- CN
- China
- Prior art keywords
- il17rb
- gene
- humanized
- seq
- human animal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 101100071931 Homo sapiens IL17RB gene Proteins 0.000 title claims abstract description 69
- 238000010276 construction Methods 0.000 title claims abstract description 27
- 101001019600 Homo sapiens Interleukin-17 receptor B Proteins 0.000 claims abstract description 140
- 102100035014 Interleukin-17 receptor B Human genes 0.000 claims abstract description 84
- 239000002773 nucleotide Substances 0.000 claims abstract description 60
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 60
- 230000008685 targeting Effects 0.000 claims abstract description 32
- 239000013598 vector Substances 0.000 claims abstract description 31
- 102000054811 human IL17RB Human genes 0.000 claims abstract description 30
- 241001465754 Metazoa Species 0.000 claims abstract description 21
- 239000003814 drug Substances 0.000 claims abstract description 13
- 238000012216 screening Methods 0.000 claims abstract description 10
- 108090000623 proteins and genes Proteins 0.000 claims description 59
- 238000000034 method Methods 0.000 claims description 35
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 19
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 18
- 201000010099 disease Diseases 0.000 claims description 17
- 101000998146 Homo sapiens Interleukin-17A Proteins 0.000 claims description 12
- 102100033461 Interleukin-17A Human genes 0.000 claims description 12
- 238000011161 development Methods 0.000 claims description 10
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 9
- 238000001514 detection method Methods 0.000 claims description 9
- 150000007523 nucleic acids Chemical group 0.000 claims description 8
- 108020004999 messenger RNA Proteins 0.000 claims description 7
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 6
- 230000014509 gene expression Effects 0.000 claims description 6
- 230000000857 drug effect Effects 0.000 claims description 5
- 238000011160 research Methods 0.000 claims description 5
- 238000012795 verification Methods 0.000 claims description 5
- 238000011156 evaluation Methods 0.000 claims description 4
- 238000004519 manufacturing process Methods 0.000 claims description 4
- 230000008569 process Effects 0.000 claims description 4
- 230000001225 therapeutic effect Effects 0.000 claims description 4
- 230000000694 effects Effects 0.000 claims description 3
- 238000001727 in vivo Methods 0.000 claims description 3
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 3
- 230000000259 anti-tumor effect Effects 0.000 claims description 2
- 210000005260 human cell Anatomy 0.000 claims description 2
- 230000001900 immune effect Effects 0.000 claims description 2
- 230000002906 microbiologic effect Effects 0.000 claims description 2
- 230000000144 pharmacologic effect Effects 0.000 claims description 2
- 150000003839 salts Chemical class 0.000 claims 1
- 238000010171 animal model Methods 0.000 abstract description 20
- 206010028980 Neoplasm Diseases 0.000 abstract description 12
- 229940079593 drug Drugs 0.000 abstract description 6
- 238000012827 research and development Methods 0.000 abstract description 3
- 208000026278 immune system disease Diseases 0.000 abstract description 2
- 230000007246 mechanism Effects 0.000 abstract description 2
- 239000002547 new drug Substances 0.000 abstract description 2
- 230000006801 homologous recombination Effects 0.000 abstract 1
- 238000002744 homologous recombination Methods 0.000 abstract 1
- 108020004414 DNA Proteins 0.000 description 40
- 241000699666 Mus <mouse, genus> Species 0.000 description 30
- 241000699670 Mus sp. Species 0.000 description 26
- 210000004027 cell Anatomy 0.000 description 19
- 108700024394 Exon Proteins 0.000 description 18
- 150000001413 amino acids Chemical class 0.000 description 15
- 238000005516 engineering process Methods 0.000 description 12
- 238000003780 insertion Methods 0.000 description 12
- 230000037431 insertion Effects 0.000 description 12
- 238000005215 recombination Methods 0.000 description 12
- 230000006798 recombination Effects 0.000 description 12
- 241000700159 Rattus Species 0.000 description 11
- 239000012634 fragment Substances 0.000 description 11
- 241000283984 Rodentia Species 0.000 description 9
- 239000003550 marker Substances 0.000 description 9
- 102000004169 proteins and genes Human genes 0.000 description 9
- 101001019603 Mus musculus Interleukin-17 receptor B Proteins 0.000 description 8
- 239000000523 sample Substances 0.000 description 8
- 102000015696 Interleukins Human genes 0.000 description 7
- 108010063738 Interleukins Proteins 0.000 description 7
- 238000011577 humanized mouse model Methods 0.000 description 7
- 108010027338 isoleucylcysteine Proteins 0.000 description 7
- 238000006467 substitution reaction Methods 0.000 description 7
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 6
- 238000012239 gene modification Methods 0.000 description 6
- 210000001519 tissue Anatomy 0.000 description 6
- 230000007704 transition Effects 0.000 description 6
- 238000011740 C57BL/6 mouse Methods 0.000 description 5
- 101001019598 Homo sapiens Interleukin-17 receptor A Proteins 0.000 description 5
- 102100035018 Interleukin-17 receptor A Human genes 0.000 description 5
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 5
- 208000005718 Stomach Neoplasms Diseases 0.000 description 5
- 108091027544 Subgenomic mRNA Proteins 0.000 description 5
- 108010060199 cysteinylproline Proteins 0.000 description 5
- 210000001671 embryonic stem cell Anatomy 0.000 description 5
- 206010017758 gastric cancer Diseases 0.000 description 5
- 230000001105 regulatory effect Effects 0.000 description 5
- 201000011549 stomach cancer Diseases 0.000 description 5
- 108010061238 threonyl-glycine Proteins 0.000 description 5
- KNENKKKUYGEZIO-FXQIFTODSA-N Asn-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N KNENKKKUYGEZIO-FXQIFTODSA-N 0.000 description 4
- 206010006187 Breast cancer Diseases 0.000 description 4
- 208000026310 Breast neoplasm Diseases 0.000 description 4
- 241001529936 Murinae Species 0.000 description 4
- 108010057466 NF-kappa B Proteins 0.000 description 4
- 102000003945 NF-kappa B Human genes 0.000 description 4
- 206010061902 Pancreatic neoplasm Diseases 0.000 description 4
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 4
- 238000002105 Southern blotting Methods 0.000 description 4
- 238000010362 genome editing Methods 0.000 description 4
- 108010054155 lysyllysine Proteins 0.000 description 4
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 description 4
- 201000002528 pancreatic cancer Diseases 0.000 description 4
- 208000008443 pancreatic carcinoma Diseases 0.000 description 4
- 108010051242 phenylalanylserine Proteins 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- 108010026333 seryl-proline Proteins 0.000 description 4
- 230000019491 signal transduction Effects 0.000 description 4
- 210000001541 thymus gland Anatomy 0.000 description 4
- XAGIMRPOEJSYER-CIUDSAMLSA-N Ala-Cys-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XAGIMRPOEJSYER-CIUDSAMLSA-N 0.000 description 3
- IVKWMMGFLAMMKJ-XVYDVKMFSA-N Ala-His-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IVKWMMGFLAMMKJ-XVYDVKMFSA-N 0.000 description 3
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 3
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 3
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 3
- DNYRZPOWBTYFAF-IHRRRGAJSA-N Asn-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)O DNYRZPOWBTYFAF-IHRRRGAJSA-N 0.000 description 3
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 3
- WJHYGGVCWREQMO-GHCJXIJMSA-N Asp-Cys-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WJHYGGVCWREQMO-GHCJXIJMSA-N 0.000 description 3
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 3
- OETOANMAHTWESF-KKUMJFAQSA-N Cys-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CS)N OETOANMAHTWESF-KKUMJFAQSA-N 0.000 description 3
- DTCCMDYODDPHBG-ACZMJKKPSA-N Gln-Ala-Cys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O DTCCMDYODDPHBG-ACZMJKKPSA-N 0.000 description 3
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 3
- ALUBSZXSNSPDQV-WDSKDSINSA-N Gln-Cys-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ALUBSZXSNSPDQV-WDSKDSINSA-N 0.000 description 3
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 3
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 3
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 3
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 3
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 3
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 3
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 3
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 3
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 3
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 3
- 101000998151 Homo sapiens Interleukin-17F Proteins 0.000 description 3
- XHBYEMIUENPZLY-GMOBBJLQSA-N Ile-Pro-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O XHBYEMIUENPZLY-GMOBBJLQSA-N 0.000 description 3
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 3
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 3
- 102100033454 Interleukin-17F Human genes 0.000 description 3
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 3
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 3
- YESNGRDJQWDYLH-KKUMJFAQSA-N Leu-Phe-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YESNGRDJQWDYLH-KKUMJFAQSA-N 0.000 description 3
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 3
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 3
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 3
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 3
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 3
- AOFZWWDTTJLHOU-ULQDDVLXSA-N Met-Lys-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AOFZWWDTTJLHOU-ULQDDVLXSA-N 0.000 description 3
- 241000699729 Muridae Species 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 3
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 3
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 3
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 3
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 3
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 3
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 3
- 206010039491 Sarcoma Diseases 0.000 description 3
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 3
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 3
- SGZVZUCRAVSPKQ-FXQIFTODSA-N Ser-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N SGZVZUCRAVSPKQ-FXQIFTODSA-N 0.000 description 3
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 3
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 3
- WACMTVIJWRNVSO-CWRNSKLLSA-N Trp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O WACMTVIJWRNVSO-CWRNSKLLSA-N 0.000 description 3
- YXSSXUIBUJGHJY-SFJXLCSZSA-N Trp-Thr-Phe Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)[C@H](O)C)C(O)=O)C1=CC=CC=C1 YXSSXUIBUJGHJY-SFJXLCSZSA-N 0.000 description 3
- KWKJGBHDYJOVCR-SRVKXCTJSA-N Tyr-Ser-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O KWKJGBHDYJOVCR-SRVKXCTJSA-N 0.000 description 3
- JOQSQZFKFYJKKJ-GUBZILKMSA-N Val-Arg-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N JOQSQZFKFYJKKJ-GUBZILKMSA-N 0.000 description 3
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 3
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 3
- 208000006673 asthma Diseases 0.000 description 3
- -1 at least 200 Chemical class 0.000 description 3
- 210000002459 blastocyst Anatomy 0.000 description 3
- 210000000349 chromosome Anatomy 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 3
- 208000032839 leukemia Diseases 0.000 description 3
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 108020004707 nucleic acids Proteins 0.000 description 3
- 102000039446 nucleic acids Human genes 0.000 description 3
- 210000000056 organ Anatomy 0.000 description 3
- 230000037361 pathway Effects 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 108010079317 prolyl-tyrosine Proteins 0.000 description 3
- 108010053725 prolylvaline Proteins 0.000 description 3
- 238000003757 reverse transcription PCR Methods 0.000 description 3
- 208000024891 symptom Diseases 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- UQYCFWDXGAGNGW-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-3-methylpentanoyl)amino]-3-methylpentanoyl]amino]acetyl]amino]-3-phenylpropanoic acid Chemical compound CCC(C)C(N)C(=O)NC(C(C)CC)C(=O)NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 UQYCFWDXGAGNGW-UHFFFAOYSA-N 0.000 description 2
- 241000699725 Acomys Species 0.000 description 2
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 2
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 2
- RKQRHMKFNBYOTN-IHRRRGAJSA-N Arg-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N RKQRHMKFNBYOTN-IHRRRGAJSA-N 0.000 description 2
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 2
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 2
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 2
- ZVUMKOMKQCANOM-AVGNSLFASA-N Asn-Phe-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVUMKOMKQCANOM-AVGNSLFASA-N 0.000 description 2
- TVIZQBFURPLQDV-DJFWLOJKSA-N Asp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N TVIZQBFURPLQDV-DJFWLOJKSA-N 0.000 description 2
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 2
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 2
- 208000023275 Autoimmune disease Diseases 0.000 description 2
- 108010074708 B7-H1 Antigen Proteins 0.000 description 2
- 102000008096 B7-H1 Antigen Human genes 0.000 description 2
- 102100021943 C-C motif chemokine 2 Human genes 0.000 description 2
- 101710155857 C-C motif chemokine 2 Proteins 0.000 description 2
- 108091033409 CRISPR Proteins 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 241000699800 Cricetinae Species 0.000 description 2
- JTNKVWLMDHIUOG-IHRRRGAJSA-N Cys-Arg-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JTNKVWLMDHIUOG-IHRRRGAJSA-N 0.000 description 2
- YZFCGHIBLBDZDA-ZLUOBGJFSA-N Cys-Asp-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YZFCGHIBLBDZDA-ZLUOBGJFSA-N 0.000 description 2
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 2
- YQEHNIKPAOPBNH-DCAQKATOSA-N Cys-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N YQEHNIKPAOPBNH-DCAQKATOSA-N 0.000 description 2
- FNXOZWPPOJRBRE-XGEHTFHBSA-N Cys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CS)N)O FNXOZWPPOJRBRE-XGEHTFHBSA-N 0.000 description 2
- 102000004127 Cytokines Human genes 0.000 description 2
- 108090000695 Cytokines Proteins 0.000 description 2
- 102000016607 Diphtheria Toxin Human genes 0.000 description 2
- 108010053187 Diphtheria Toxin Proteins 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- 208000000461 Esophageal Neoplasms Diseases 0.000 description 2
- 241000699694 Gerbillinae Species 0.000 description 2
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 2
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 2
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 2
- OKQLXOYFUPVEHI-CIUDSAMLSA-N Gln-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N OKQLXOYFUPVEHI-CIUDSAMLSA-N 0.000 description 2
- YRHZWVKUFWCEPW-GLLZPBPUSA-N Gln-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YRHZWVKUFWCEPW-GLLZPBPUSA-N 0.000 description 2
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 2
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 2
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 2
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 2
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- KYMUEAZVLPRVAE-GUBZILKMSA-N His-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KYMUEAZVLPRVAE-GUBZILKMSA-N 0.000 description 2
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 2
- ILUVWFTXAUYOBW-CUJWVEQBSA-N His-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N)O ILUVWFTXAUYOBW-CUJWVEQBSA-N 0.000 description 2
- VXZZUXWAOMWWJH-QTKMDUPCSA-N His-Thr-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VXZZUXWAOMWWJH-QTKMDUPCSA-N 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 101001019602 Homo sapiens Interleukin-17 receptor C Proteins 0.000 description 2
- 101000998181 Homo sapiens Interleukin-17B Proteins 0.000 description 2
- 101000853002 Homo sapiens Interleukin-25 Proteins 0.000 description 2
- 101000950669 Homo sapiens Mitogen-activated protein kinase 9 Proteins 0.000 description 2
- 102000003839 Human Proteins Human genes 0.000 description 2
- 108090000144 Human Proteins Proteins 0.000 description 2
- 102100035012 Interleukin-17 receptor C Human genes 0.000 description 2
- 102100033101 Interleukin-17B Human genes 0.000 description 2
- 102100036680 Interleukin-25 Human genes 0.000 description 2
- 108010025815 Kanamycin Kinase Proteins 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- 206010025323 Lymphomas Diseases 0.000 description 2
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 2
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 2
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 2
- 102000043136 MAP kinase family Human genes 0.000 description 2
- 108091054455 MAP kinase family Proteins 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- 102100037809 Mitogen-activated protein kinase 9 Human genes 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 206010030155 Oesophageal carcinoma Diseases 0.000 description 2
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 2
- SSWJYJHXQOYTSP-SRVKXCTJSA-N Pro-His-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O SSWJYJHXQOYTSP-SRVKXCTJSA-N 0.000 description 2
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 2
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 description 2
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 2
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 2
- 101710089372 Programmed cell death protein 1 Proteins 0.000 description 2
- 102000001708 Protein Isoforms Human genes 0.000 description 2
- 108010029485 Protein Isoforms Proteins 0.000 description 2
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 2
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 2
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 2
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 2
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 2
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 2
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- 241000121210 Sigmodontinae Species 0.000 description 2
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 2
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 2
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 2
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 2
- ZOCJFNXUVSGBQI-HSHDSVGOSA-N Thr-Trp-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O ZOCJFNXUVSGBQI-HSHDSVGOSA-N 0.000 description 2
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 2
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 2
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 2
- UKWSFUSPGPBJGU-VFAJRCTISA-N Trp-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O UKWSFUSPGPBJGU-VFAJRCTISA-N 0.000 description 2
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 2
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 2
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 2
- YKZVPMUGEJXEOR-JYJNAYRXSA-N Val-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N YKZVPMUGEJXEOR-JYJNAYRXSA-N 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 201000011510 cancer Diseases 0.000 description 2
- 238000013373 clone screening Methods 0.000 description 2
- 238000013377 clone selection method Methods 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 230000001086 cytosolic effect Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 208000035475 disorder Diseases 0.000 description 2
- 238000001976 enzyme digestion Methods 0.000 description 2
- 201000004101 esophageal cancer Diseases 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000010363 gene targeting Methods 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 210000003101 oviduct Anatomy 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 2
- 102000005962 receptors Human genes 0.000 description 2
- 108020003175 receptors Proteins 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 210000004881 tumor cell Anatomy 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- NHBKXEKEPDILRR-UHFFFAOYSA-N 2,3-bis(butanoylsulfanyl)propyl butanoate Chemical compound CCCC(=O)OCC(SC(=O)CCC)CSC(=O)CCC NHBKXEKEPDILRR-UHFFFAOYSA-N 0.000 description 1
- 208000031261 Acute myeloid leukaemia Diseases 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 1
- 201000003076 Angiosarcoma Diseases 0.000 description 1
- 208000031873 Animal Disease Models Diseases 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 1
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 1
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 1
- UYXXMIZGHYKYAT-NHCYSSNCSA-N Asn-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N UYXXMIZGHYKYAT-NHCYSSNCSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 1
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 1
- VZNOVQKGJQJOCS-SRVKXCTJSA-N Asp-Asp-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VZNOVQKGJQJOCS-SRVKXCTJSA-N 0.000 description 1
- BIVYLQMZPHDUIH-WHFBIAKZSA-N Asp-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)O BIVYLQMZPHDUIH-WHFBIAKZSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 1
- FIAKNCXQFFKSSI-ZLUOBGJFSA-N Asp-Ser-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O FIAKNCXQFFKSSI-ZLUOBGJFSA-N 0.000 description 1
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 1
- 206010003571 Astrocytoma Diseases 0.000 description 1
- 208000036170 B-Cell Marginal Zone Lymphoma Diseases 0.000 description 1
- 208000010839 B-cell chronic lymphocytic leukemia Diseases 0.000 description 1
- 208000003950 B-cell lymphoma Diseases 0.000 description 1
- 208000032791 BCR-ABL1 positive chronic myelogenous leukemia Diseases 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 235000017166 Bambusa arundinacea Nutrition 0.000 description 1
- 235000017491 Bambusa tulda Nutrition 0.000 description 1
- 206010004593 Bile duct cancer Diseases 0.000 description 1
- 206010005003 Bladder cancer Diseases 0.000 description 1
- 206010005949 Bone cancer Diseases 0.000 description 1
- 208000018084 Bone neoplasm Diseases 0.000 description 1
- 208000003174 Brain Neoplasms Diseases 0.000 description 1
- 101710155856 C-C motif chemokine 3 Proteins 0.000 description 1
- 238000011752 CBA/J (JAX™ mouse strain) Methods 0.000 description 1
- 238000010354 CRISPR gene editing Methods 0.000 description 1
- 241000700193 Calomyscus Species 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 206010008342 Cervix carcinoma Diseases 0.000 description 1
- 102000000013 Chemokine CCL3 Human genes 0.000 description 1
- 208000005243 Chondrosarcoma Diseases 0.000 description 1
- 208000010833 Chronic myeloid leukaemia Diseases 0.000 description 1
- 206010009900 Colitis ulcerative Diseases 0.000 description 1
- 206010009944 Colon cancer Diseases 0.000 description 1
- VZKXOWRNJDEGLZ-WHFBIAKZSA-N Cys-Asp-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O VZKXOWRNJDEGLZ-WHFBIAKZSA-N 0.000 description 1
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 1
- JLZCAZJGWNRXCI-XKBZYTNZSA-N Cys-Thr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O JLZCAZJGWNRXCI-XKBZYTNZSA-N 0.000 description 1
- HPZAJRPYUIHDIN-BZSNNMDCSA-N Cys-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CS)N HPZAJRPYUIHDIN-BZSNNMDCSA-N 0.000 description 1
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 1
- 238000011765 DBA/2 mouse Methods 0.000 description 1
- 201000004624 Dermatitis Diseases 0.000 description 1
- 206010014733 Endometrial cancer Diseases 0.000 description 1
- 206010014759 Endometrial neoplasm Diseases 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 206010015548 Euthanasia Diseases 0.000 description 1
- 208000006168 Ewing Sarcoma Diseases 0.000 description 1
- 102100027280 Fanconi anemia group A protein Human genes 0.000 description 1
- 201000008808 Fibrosarcoma Diseases 0.000 description 1
- YNNXQZDEOCYJJL-CIUDSAMLSA-N Gln-Arg-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N YNNXQZDEOCYJJL-CIUDSAMLSA-N 0.000 description 1
- LLVXTGUTDYMJLY-GUBZILKMSA-N Gln-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LLVXTGUTDYMJLY-GUBZILKMSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- LGWNISYVKDNJRP-FXQIFTODSA-N Gln-Ser-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGWNISYVKDNJRP-FXQIFTODSA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 1
- ZSIDREAPEPAPKL-XIRDDKMYSA-N Glu-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N ZSIDREAPEPAPKL-XIRDDKMYSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- UPADCCSMVOQAGF-LBPRGKRZSA-N Gly-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)CN)C(O)=O)=CNC2=C1 UPADCCSMVOQAGF-LBPRGKRZSA-N 0.000 description 1
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- 208000001258 Hemangiosarcoma Diseases 0.000 description 1
- 102100031573 Hematopoietic progenitor cell antigen CD34 Human genes 0.000 description 1
- 208000035186 Hemolytic Autoimmune Anemia Diseases 0.000 description 1
- 208000027761 Hepatic autoimmune disease Diseases 0.000 description 1
- ZZLWLWSUIBSMNP-CIUDSAMLSA-N His-Asp-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZZLWLWSUIBSMNP-CIUDSAMLSA-N 0.000 description 1
- IMCHNUANCIGUKS-SRVKXCTJSA-N His-Glu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IMCHNUANCIGUKS-SRVKXCTJSA-N 0.000 description 1
- XVZJRZQIHJMUBG-TUBUOCAGSA-N His-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CN=CN1)N XVZJRZQIHJMUBG-TUBUOCAGSA-N 0.000 description 1
- PUFNQIPSRXVLQJ-IHRRRGAJSA-N His-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N PUFNQIPSRXVLQJ-IHRRRGAJSA-N 0.000 description 1
- 208000017604 Hodgkin disease Diseases 0.000 description 1
- 208000021519 Hodgkin lymphoma Diseases 0.000 description 1
- 208000010747 Hodgkins lymphoma Diseases 0.000 description 1
- 101000914673 Homo sapiens Fanconi anemia group A protein Proteins 0.000 description 1
- 101000777663 Homo sapiens Hematopoietic progenitor cell antigen CD34 Proteins 0.000 description 1
- 101001019588 Homo sapiens Interleukin-17 receptor D Proteins 0.000 description 1
- 101000998178 Homo sapiens Interleukin-17C Proteins 0.000 description 1
- 101000998176 Homo sapiens Interleukin-17D Proteins 0.000 description 1
- 101000617830 Homo sapiens Sterol O-acyltransferase 1 Proteins 0.000 description 1
- 206010020751 Hypersensitivity Diseases 0.000 description 1
- 206010020850 Hyperthyroidism Diseases 0.000 description 1
- 206010021245 Idiopathic thrombocytopenic purpura Diseases 0.000 description 1
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- 102000004554 Interleukin-17 Receptors Human genes 0.000 description 1
- 108010017525 Interleukin-17 Receptors Proteins 0.000 description 1
- 101710186071 Interleukin-17 receptor B Proteins 0.000 description 1
- 102100035015 Interleukin-17 receptor D Human genes 0.000 description 1
- 102100033105 Interleukin-17C Human genes 0.000 description 1
- 102100033096 Interleukin-17D Human genes 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 241000581650 Ivesia Species 0.000 description 1
- 208000008839 Kidney Neoplasms Diseases 0.000 description 1
- 102220470475 L-seryl-tRNA(Sec) kinase_C57L_mutation Human genes 0.000 description 1
- 208000031671 Large B-Cell Diffuse Lymphoma Diseases 0.000 description 1
- 208000018142 Leiomyosarcoma Diseases 0.000 description 1
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 1
- 208000031422 Lymphocytic Chronic B-Cell Leukemia Diseases 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- 208000025205 Mantle-Cell Lymphoma Diseases 0.000 description 1
- NKDSBBBPGIVWEI-RCWTZXSCSA-N Met-Arg-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NKDSBBBPGIVWEI-RCWTZXSCSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- UZBQXELAFPCGRV-SZMVWBNQSA-N Met-Trp-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZBQXELAFPCGRV-SZMVWBNQSA-N 0.000 description 1
- FSTWDRPCQQUJIT-NHCYSSNCSA-N Met-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N FSTWDRPCQQUJIT-NHCYSSNCSA-N 0.000 description 1
- 101710151805 Mitochondrial intermediate peptidase 1 Proteins 0.000 description 1
- 102100024193 Mitogen-activated protein kinase 1 Human genes 0.000 description 1
- 208000034578 Multiple myelomas Diseases 0.000 description 1
- 101000854961 Mus musculus WD repeat and HMG-box DNA-binding protein 1 Proteins 0.000 description 1
- 241000699669 Mus saxicola Species 0.000 description 1
- 201000003793 Myelodysplastic syndrome Diseases 0.000 description 1
- 208000033761 Myelogenous Chronic BCR-ABL Positive Leukemia Diseases 0.000 description 1
- 208000033776 Myeloid Acute Leukemia Diseases 0.000 description 1
- 208000009525 Myocarditis Diseases 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- 208000001894 Nasopharyngeal Neoplasms Diseases 0.000 description 1
- 206010061306 Nasopharyngeal cancer Diseases 0.000 description 1
- 208000012902 Nervous system disease Diseases 0.000 description 1
- 208000015914 Non-Hodgkin lymphomas Diseases 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 206010033128 Ovarian cancer Diseases 0.000 description 1
- 206010061535 Ovarian neoplasm Diseases 0.000 description 1
- 238000002944 PCR assay Methods 0.000 description 1
- 244000082204 Phyllostachys viridis Species 0.000 description 1
- 235000015334 Phyllostachys viridis Nutrition 0.000 description 1
- 206010035226 Plasma cell myeloma Diseases 0.000 description 1
- KTFZQPLSPLWLKN-KKUMJFAQSA-N Pro-Gln-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KTFZQPLSPLWLKN-KKUMJFAQSA-N 0.000 description 1
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 1
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 1
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 1
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 1
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 1
- 102100040678 Programmed cell death protein 1 Human genes 0.000 description 1
- 206010060862 Prostate cancer Diseases 0.000 description 1
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 1
- 238000010802 RNA extraction kit Methods 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 208000015634 Rectal Neoplasms Diseases 0.000 description 1
- 206010038389 Renal cancer Diseases 0.000 description 1
- 206010039085 Rhinitis allergic Diseases 0.000 description 1
- 206010039710 Scleroderma Diseases 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- 108020004688 Small Nuclear RNA Proteins 0.000 description 1
- 102000039471 Small Nuclear RNA Human genes 0.000 description 1
- 108020004459 Small interfering RNA Proteins 0.000 description 1
- 208000021712 Soft tissue sarcoma Diseases 0.000 description 1
- 208000027520 Somatoform disease Diseases 0.000 description 1
- 102100021993 Sterol O-acyltransferase 1 Human genes 0.000 description 1
- 101000697584 Streptomyces lavendulae Streptothricin acetyltransferase Proteins 0.000 description 1
- 241000282898 Sus scrofa Species 0.000 description 1
- 208000018359 Systemic autoimmune disease Diseases 0.000 description 1
- 206010042971 T-cell lymphoma Diseases 0.000 description 1
- 208000027585 T-cell non-Hodgkin lymphoma Diseases 0.000 description 1
- 238000010459 TALEN Methods 0.000 description 1
- 208000024313 Testicular Neoplasms Diseases 0.000 description 1
- 206010057644 Testis cancer Diseases 0.000 description 1
- 210000000068 Th17 cell Anatomy 0.000 description 1
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 1
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 1
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- 208000031981 Thrombocytopenic Idiopathic Purpura Diseases 0.000 description 1
- 208000024770 Thyroid neoplasm Diseases 0.000 description 1
- 108020004566 Transfer RNA Proteins 0.000 description 1
- PTAWAMWPRFTACW-SZMVWBNQSA-N Trp-Gln-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PTAWAMWPRFTACW-SZMVWBNQSA-N 0.000 description 1
- CXPJPTFWKXNDKV-NUTKFTJISA-N Trp-Leu-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CXPJPTFWKXNDKV-NUTKFTJISA-N 0.000 description 1
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 1
- 201000006704 Ulcerative Colitis Diseases 0.000 description 1
- 208000007097 Urinary Bladder Neoplasms Diseases 0.000 description 1
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- IWZYXFRGWKEKBJ-GVXVVHGQSA-N Val-Gln-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IWZYXFRGWKEKBJ-GVXVVHGQSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- 208000033559 Waldenström macroglobulinemia Diseases 0.000 description 1
- 108010017070 Zinc Finger Nucleases Proteins 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 230000033289 adaptive immune response Effects 0.000 description 1
- 208000026935 allergic disease Diseases 0.000 description 1
- 201000010105 allergic rhinitis Diseases 0.000 description 1
- 230000007815 allergy Effects 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 230000006909 anti-apoptosis Effects 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 201000000448 autoimmune hemolytic anemia Diseases 0.000 description 1
- 201000003710 autoimmune thrombocytopenic purpura Diseases 0.000 description 1
- 239000011425 bamboo Substances 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 208000026900 bile duct neoplasm Diseases 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 238000010805 cDNA synthesis kit Methods 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 201000010881 cervical cancer Diseases 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 208000006990 cholangiocarcinoma Diseases 0.000 description 1
- 208000032852 chronic lymphocytic leukemia Diseases 0.000 description 1
- 210000001728 clone cell Anatomy 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 208000029742 colonic neoplasm Diseases 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 206010012818 diffuse large B-cell lymphoma Diseases 0.000 description 1
- 230000007783 downstream signaling Effects 0.000 description 1
- 238000009509 drug development Methods 0.000 description 1
- 238000003255 drug test Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 210000002889 endothelial cell Anatomy 0.000 description 1
- 210000003979 eosinophil Anatomy 0.000 description 1
- 210000002919 epithelial cell Anatomy 0.000 description 1
- 201000003444 follicular lymphoma Diseases 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 230000007614 genetic variation Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 208000005017 glioblastoma Diseases 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 201000010536 head and neck cancer Diseases 0.000 description 1
- 208000014829 head and neck neoplasm Diseases 0.000 description 1
- 208000006454 hepatitis Diseases 0.000 description 1
- 231100000283 hepatitis Toxicity 0.000 description 1
- 210000001822 immobilized cell Anatomy 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 230000000984 immunochemical effect Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 208000027866 inflammatory disease Diseases 0.000 description 1
- 230000002757 inflammatory effect Effects 0.000 description 1
- 230000004054 inflammatory process Effects 0.000 description 1
- 230000028709 inflammatory response Effects 0.000 description 1
- 230000015788 innate immune response Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 201000010982 kidney cancer Diseases 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 206010024627 liposarcoma Diseases 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 201000007270 liver cancer Diseases 0.000 description 1
- 208000014018 liver neoplasm Diseases 0.000 description 1
- 201000005202 lung cancer Diseases 0.000 description 1
- 208000020816 lung neoplasm Diseases 0.000 description 1
- 210000004698 lymphocyte Anatomy 0.000 description 1
- 230000000527 lymphocytic effect Effects 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 230000012976 mRNA stabilization Effects 0.000 description 1
- 210000002540 macrophage Anatomy 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 201000007924 marginal zone B-cell lymphoma Diseases 0.000 description 1
- 208000021937 marginal zone lymphoma Diseases 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000013011 mating Effects 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 108091070501 miRNA Proteins 0.000 description 1
- 239000002679 microRNA Substances 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 210000001616 monocyte Anatomy 0.000 description 1
- 238000010172 mouse model Methods 0.000 description 1
- 201000008383 nephritis Diseases 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 230000003472 neutralizing effect Effects 0.000 description 1
- 208000002154 non-small cell lung carcinoma Diseases 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 201000008968 osteosarcoma Diseases 0.000 description 1
- 102000002574 p38 Mitogen-Activated Protein Kinases Human genes 0.000 description 1
- 108010068338 p38 Mitogen-Activated Protein Kinases Proteins 0.000 description 1
- 208000027753 pain disease Diseases 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 230000003285 pharmacodynamic effect Effects 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 208000031223 plasma cell leukemia Diseases 0.000 description 1
- 230000003234 polygenic effect Effects 0.000 description 1
- 108091033319 polynucleotide Proteins 0.000 description 1
- 102000040430 polynucleotide Human genes 0.000 description 1
- 239000002157 polynucleotide Substances 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 206010038038 rectal cancer Diseases 0.000 description 1
- 201000001275 rectum cancer Diseases 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 201000009410 rhabdomyosarcoma Diseases 0.000 description 1
- 206010039073 rheumatoid arthritis Diseases 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 239000011435 rock Substances 0.000 description 1
- 230000000392 somatic effect Effects 0.000 description 1
- 206010042863 synovial sarcoma Diseases 0.000 description 1
- 201000000596 systemic lupus erythematosus Diseases 0.000 description 1
- 238000010809 targeting technique Methods 0.000 description 1
- 201000003120 testicular cancer Diseases 0.000 description 1
- 201000002510 thyroid cancer Diseases 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 238000002054 transplantation Methods 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 208000029729 tumor suppressor gene on chromosome 11 Diseases 0.000 description 1
- 201000005112 urinary bladder cancer Diseases 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
- C07K14/715—Receptors; Cell surface antigens; Cell surface determinants for cytokines; for lymphokines; for interferons
- C07K14/7155—Receptors; Cell surface antigens; Cell surface determinants for cytokines; for lymphokines; for interferons for interleukins [IL]
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K67/00—Rearing or breeding animals, not otherwise provided for; New or modified breeds of animals
- A01K67/027—New or modified breeds of vertebrates
- A01K67/0275—Genetically modified vertebrates, e.g. transgenic
- A01K67/0278—Knock-in vertebrates, e.g. humanised vertebrates
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/8509—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells for producing genetically modified animals, e.g. transgenic
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2217/00—Genetically modified animals
- A01K2217/07—Animals genetically altered by homologous recombination
- A01K2217/072—Animals genetically altered by homologous recombination maintaining or altering function, i.e. knock in
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2227/00—Animals characterised by species
- A01K2227/10—Mammal
- A01K2227/105—Murine
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2267/00—Animals characterised by purpose
- A01K2267/03—Animal model, e.g. for test or diseases
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Environmental Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biophysics (AREA)
- Wood Science & Technology (AREA)
- Biomedical Technology (AREA)
- Veterinary Medicine (AREA)
- Biochemistry (AREA)
- Animal Behavior & Ethology (AREA)
- Immunology (AREA)
- Microbiology (AREA)
- Animal Husbandry (AREA)
- Biodiversity & Conservation Biology (AREA)
- Plant Pathology (AREA)
- Cell Biology (AREA)
- Physics & Mathematics (AREA)
- Toxicology (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明提供了一种IL17RB基因人源化的非人动物的构建方法,利用同源重组的方式将编码人IL17RB蛋白的核苷酸序列导入非人动物基因组中,该动物体内能正常表达人源化IL17RB蛋白,可以作为人IL17RB信号机理研究、肿瘤及免疫性疾病药物筛选的动物模型,对免疫靶点的新药研发具有重要的应用价值。本发明还提供了一种人源化IL17RB蛋白、一种人源化IL17RB基因、一种IL17RB基因的靶向载体,以及上述构建方法获得的非人动物及其在生物医药领域的应用。
Description
技术领域
本发明属于动物基因工程和基因遗传修饰领域,具体地说,涉及一种IL17RB基因改造非人动物模型的构建方法及其在生物医药领域的应用。
背景技术
白细胞介素(Interleukin,IL)17细胞因子(简称IL17)家族是一类特征性细胞因子,主要由活化的Th17细胞分泌。IL17家族目前已经发现了6个成员,分别是IL17A、IL17B、IL17C、IL17D、IL17E和IL17F。IL17受体(简称IL17R)也形成了一个独特的家族,目前发现了5个同源亚基,分别是IL17RA、IL17RB、IL17RC、IL17RD、IL17RE。IL17通过与受体结合,启动其下游信号通路(包括MAP激酶途径、NF-kB途径、mRNA稳定信号途径、ERK信号途径以及JAK/STAT信号途径等),刺激多种细胞产生炎症介质,已成为免疫和炎性疾病的关键参与因子,并可导致器官特异性或全身性自身免疫疾病。
IL17RB在机体许多器官组织表面均有表达,通过与IL17B和IL17E结合,激活Jun激酶(JNK)、p38丝裂原激活蛋白激酶(p38MAPK)和核因子-κB(NF-κB)、嗜酸细胞产物如单核细胞趋化蛋白-1(MCP-1)、巨噬细胞炎性蛋白-1α(MIP-1α)、IL6和IL8的表达,参与先天和适应性免疫应答。近年来的研究发现,IL17RB与肿瘤(例如乳腺癌、胃癌、胰腺癌等)和自身免疫性疾病(例如哮喘、过敏性鼻炎等)等密切相关。例如,IL-17RB可通过NF-kB通路介导抗凋亡,促进乳腺肿瘤的发生;胃癌组织IL17RB蛋白表达水平高于癌旁组织,对胃癌的发生发展具有一定作用;利用IL17RB中和抗体能够有效抑制胰腺癌的生长;IL17RB在肠道炎症反应中表达水平上调;IL17RB遗传性变异可能在Th2介导的哮喘免疫反应中发挥着重要的作用。
实验动物疾病模型对于研究人类疾病发生的病因、发病机制、开发防治技术和开发药物是不可缺少的研究工具。但由于动物与人类的生理结构和代谢系统本身的差异,传统的动物模型并不能很好的反映人体的真实状况,在动物体内建立更接近人类的生理特征的疾病模型是生物医药行业的迫切需求。随着基因工程技术的不断发展和成熟,用人类基因替代或置换动物的同源性基因已经实现,通过这种方式开发人源化实验动物模型是动物模型未来的发展方向。其中基因人源化动物模型,即利用基因编辑技术,用人源正常或突变基因替换动物基因组的同源基因,可建立更接近人类生理或疾病特征的正常或突变基因动物模型。基因人源化动物不但本身具有重要应用价值,如通过基因人源化可改进和提升细胞或组织移植人源化动物模型,更重要的是,由于人类基因片段的插入,动物体内可表达或部分表达人源蛋白,可作为仅能识别人蛋白序列的药物的靶点,为在动物水平进行抗人抗体及其它药物的筛选提供了可能。但是,并不是任何基因或者替换任何基因的任何区域,都可以成功实现人源化动物模型的制备。在该领域,最重要的也是最具挑战性的是根据实际需求,对特定非人动物基因的特定序列,选择相应人源基因的特定区域进行插入或替换,从而获得可以表达人源化蛋白且具有抗体筛选等功能的人源化动物模型。
因此,由于动物与人类在生理学及病理学方面存在差异,加上IL17RB基因的复杂性,如何能构建出“有效”的IL17RB人源化动物模型用于新药研发仍是最大的挑战。
发明内容
鉴于IL17/IL17R各配体及受体复杂的作用机制,以及在肿瘤和自身免疫性疾病治疗领域的巨大应用价值,为进一步探索其相关生物学特性,提高临床前期药效试验的有效性,提高研发成功率,使临床前期的试验更有效并使研发失败最小化,本领域急需开发更多IL17/IL17R相关信号通路的非人动物模型。此外,本方法得到的非人动物还可与其它基因人源化非人动物交配得到多基因人源化动物模型,用于筛选和评估针对该信号通路的人用药及联合用药的药效研究。本发明在学术和临床研究中具有广阔的应用前景。具体的,
本发明的第一方面,提供了一种IL17RB基因人源化的非人动物的构建方法,所述的非人动物的基因组中包含人IL17RB基因的全部或部分。优选的,包含人IL17RB基因的1号至11号外显子的全部或部分。进一步优选的,包含人IL17RB基因的1号至11号外显子中的任一个或两个以上的组合。更进一步优选的,包含人IL17RB基因的2号至10号外显子。
在本发明的一个具体实施方式中,所述的非人动物的基因组中包含人IL17RB基因的2号外显子的部分、3号至9号外显子的全部和10号外显子的部分,优选还包含2-3号内含子和/或9-10号内含子。其中,2号外显子的部分至少包含从2号外显子3’-5’的长度为18bp、19bp、20bp、21bp、22bp、23bp、24bp或25bp的核苷酸序列,10号外显子的部分至少包含从10号外显子5’-3’的长度为10bp、11、12、13、14、15bp、20bp、30bp或40bp的核苷酸序列。
优选的,所述的非人动物包含人源化IL17RB基因。所述的人源化IL17RB基因包含人IL17RB基因的全部或部分。优选包含人IL17RB基因的2号外显子的部分、3号至9号外显子的全部和10号外显子的部分,优选还包含2-3号内含子和/或9-10号内含子。
优选的,所述的构建方法包括用包含人IL17RB基因的1号至11号外显子的全部或部分插入或替换至非人动物IL17RB基因座。进一步优选的,包括用包含人IL17RB基因的1号至11号外显子中的任一个或两个以上的组合插入或替换至非人动物IL17RB基因座。更进一步优选的,包括用包含人IL17RB基因的2号至10号外显子插入或替换至非人动物IL17RB基因座。
在本发明的一个具体实施方式中,所述的构建方法包括用包含人IL17RB基因的2号外显子的部分、3号至9号外显子的全部和10号外显子的部分,优选还包含2-3号内含子和/或9-10号内含子,插入或替换至非人动物IL17RB基因座。
在本发明的一个具体实施方式中,所述的构建方法包括用包含人源化IL17RB基因的核苷酸序列插入或替换至非人动物IL17RB基因座。
本发明的第二方面,提供了一种IL17RB基因人源化的非人动物的构建方法,所述的非人动物体内表达人或人源化IL17RB蛋白。
所述的人源化IL17RB蛋白包含人IL17RB蛋白的全部或部分。优选的,所述的人源化IL17RB蛋白包含人IL17RB蛋白胞外区的全部或部分。进一步优选的,所述的人源化IL17RB蛋白还包含人IL17RB蛋白的信号肽、跨膜区和/或胞质区。
优选的,所述的人源化IL17RB蛋白包含人IL17RB蛋白胞外区至少200个氨基酸,例如至少200、210、220、230、240、250、260、261、262、263、264、265、266、267、268、269、270、280、285个氨基酸,优选的,所述的人源化IL17RB蛋白包含人IL17RB蛋白胞外区266个氨基酸。
在本发明的一个具体实施方式中,所述的人源化IL17RB蛋白包含人IL17RB蛋白胞外区的从N端第1-10(例如1、2、3、4、5、6、7、8、9、10)个氨基酸开始至C端1-15(例如1、2、3、4、5、6、7、8、9、10、11、12、13、14、15)个氨基酸为止。
优选的,所述的构建方法包括用包含编码人IL17RB蛋白的胞外区的核苷酸序列插入或替换至非人动物IL17RB基因座。进一步优选的,所述的构建方法包括用包含编码人IL17RB蛋白胞外区的从N端第1-10(例如1、2、3、4、5、6、7、8、9、10)个氨基酸开始至C端1-15(例如1、2、3、4、5、6、7、8、9、10、11、12、13、14、15)个氨基酸的核苷酸序列插入或替换至非人动物IL17RB基因座。
优选的,所述的构建方法包括用包含编码人源化IL17RB蛋白的核苷酸序列插入或替换至非人动物IL17RB基因座。
本发明的第三方面,提供了一种IL17RB基因人源化的非人动物的构建方法,所述的构建方法包括用包含编码SEQ ID NO:2第22-287位的核苷酸序列替换至非人动物IL17RB基因座。
优选的,所述的构建方法包括用包含SEQ ID NO:5所示核苷酸序列替换至非人动物IL17RB基因座。
优选的,所述的插入或替换的位点为内源调控元件之后。
优选的,所述的插入首先破坏非人动物内源IL17RB基因的编码框或者所述的插入步骤即可实现内源IL17RB基因编码框不表达的目的。
优选的,所述的替换为相应位置的替换。
优选的,所述的替换至非人动物IL17RB基因座为替换非人动物IL17RB基因中编码SEQ ID NO:1第22-284位的核苷酸序列,或者替换非人动物与NCBI登录号为NC_000080.7的第29720048至29728828位所示序列相同的核苷酸序列。
所述的非人动物为小鼠或大鼠。
优选的,所述的非人动物体内表达人或人源化IL17RB蛋白。优选的,所述的人源化IL17RB蛋白包含SEQ ID NO:2第22-287位的氨基酸序列。
优选的,所述的非人动物内源IL17RB蛋白的表达降低或缺失。
在本发明的一个具体实施方式中,所述的人源化IL17RB蛋白包含SEQ ID NO:10所示的氨基酸序列,或者,包含与SEQ ID NO:10具有60%、65%、70%、75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%同一性的氨基酸序列,或者,包含与SEQ ID NO:10所示氨基酸序列差异不超过10、9、8、7、6、5、4、3、2或不超过1个氨基酸,或者,包含具有SEQ ID NO:10所示的包括取代、缺失和/或插入一个或多个氨基酸残基的氨基酸序列。
优选的,所述的非人动物的基因组中包含人源化IL17RB基因。所述的人源化IL17RB基因包含SEQ ID NO:5所示的核苷酸序列。
在本发明的一个具体实施方式中,所述的人源化IL17RB基因转录的mRNA包含SEQID NO:9所示的核苷酸序列,或者,包含与SEQ ID NO:9所示的核苷酸序列具有60%、65%、70%、75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%同一性的核苷酸序列,或者,包含与SEQ ID NO:9所示核苷酸序列差异不超过10、9、8、7、6、5、4、3、2或不超过1个核苷酸,或者,包含具有SEQ ID NO:9所示的核苷酸序列所示的,包括取代、缺失和/或插入一个或多个核苷酸的核苷酸序列。
优选的,所述的人IL17RB基因、人源化IL17RB基因、人源化IL17RB蛋白或者编码人IL17RB蛋白的核苷酸序列通过IL17RB内源性调控元件调控。
本发明使用基因编辑技术进行IL17RB基因人源化的非人动物的构建,所述基因编辑技术包括利用胚胎干细胞的基因打靶技术、CRISPR/Cas9技术、锌指核酸酶技术、转录激活子样效应因子核酸酶技术、归巢核酸内切酶或其他分子生物学技术。
在本发明的一个具体实施方式中,使用靶向载体进行非人动物的构建。
所述的靶向载体包含人IL17RB基因的全部或部分。优选的,包含人IL17RB基因的1号至11号外显子的全部或部分。进一步优选的,包含人IL17RB基因的1号至11号外显子中的任一个或两个以上的组合。更进一步优选的,包含人IL17RB基因的2号至10号外显子。再进一步优选的,所述的非人动物的基因组中包含人IL17RB基因的2号外显子的部分、3号至9号外显子的全部和10号外显子的部分,优选还包含2-3号内含子和/或9-10号内含子。其中,2号外显子的部分至少包含从2号外显子3’-5’的长度为18bp、19bp、20bp、21bp、22bp、23bp、24bp或25bp的核苷酸序列,10号外显子的部分至少包含从10号外显子5’-3’的长度为10bp、11、12、13、14、15bp、20bp、30bp或40bp的核苷酸序列。
在本发明的一个具体实施方式中,所述的靶向载体包含编码SEQ ID NO:2第22-287位的核苷酸序列或包含SEQ ID NO:5所示核苷酸序列。
所述的靶向载体还包含5’臂和/或3’臂。
其中,所述的5’臂为与待改变的转换区5’端同源的DNA片段。优选的,其选自与NCBI登录号为NC_000080.7至少具有90%同源性的核苷酸。进一步优选的,其长度为3000-4500bp。
在本发明的一个具体实施方式中,所述的5’臂的核苷酸序列如SEQ ID NO:3所示。
所述的3’臂为与待改变的转换区3’端同源的DNA片段。优选的,其选自与NCBI登录号为NC_000080.7至少具有90%同源性的核苷酸。进一步优选的,其长度为4000-5000bp。
在本发明的一个具体实施方式中,所述的3’臂的核苷酸序列如SEQ ID NO:4所示。
优选的,所述的待改变的转换区位于非人动物IL17RB基因的2号至10号外显子上。
在本发明的一个具体实施方式中,所述的构建方法包括将上述靶向载体导入非人动物细胞中,培养该细胞(优选为胚胎干细胞),然后将培养后的细胞移植至雌性非人动物输卵管内,允许其发育,鉴定筛选获得IL17RB基因人源化的非人动物。
优选的,为提高重组效率,还可以使用靶向IL17RB基因的sgRNA与上述靶向载体一起进行非人动物的构建。其中,所述的sgRNA靶向非人动物IL17RB基因,同时所述sgRNA的序列在待改变的IL17RB基因上的靶序列上。优选的,所述的sgRNA的靶位点位于IL17RB基因的2号和/或10号外显子上。
本发明的第四方面,提供了一种上述的构建方法获得的IL17RB基因人源化的非人动物。
本发明的第五方面,提供了一种IL17RB基因缺失的非人动物的构建方法,所述的构建方法包括敲除内源IL17RB基因的1号至11号外显子的全部或部分。优选敲除1号至11号外显子中的任一个或两个以上的组合。进一步优选包含2号至10号外显子。
在本发明的一个具体实施方式中,包含2号外显子的部分、3号至9号外显子的全部和10号外显子的部分,优选还包含2-3号内含子和/或9-10号内含子。
本发明的第六方面,提供了一种上述构建方法获得的IL17RB基因缺失的非人动物。
本发明的第七方面,提供了一种IL17RB基因的靶向载体,所述的靶向载体包含人IL17RB基因的全部或部分。优选的,包含人IL17RB基因的1号至11号外显子的全部或部分。进一步优选的,包含人IL17RB基因的1号至11号外显子中的任一个或两个以上的组合。更进一步优选的,包含人IL17RB基因的2号至10号外显子。再进一步优选的,所述的非人动物的基因组中包含人IL17RB基因的2号外显子的部分、3号至9号外显子的全部和10号外显子的部分,优选还包含2-3号内含子和/或9-10号内含子。其中,2号外显子的部分至少包含从2号外显子3’-5’的长度为18bp、19bp、20bp、21bp、22bp、23bp、24bp或25bp的核苷酸序列,10号外显子的部分至少包含从10号外显子5’-3’的长度为10bp、11、12、13、14、15bp、20bp、30bp或40bp的核苷酸序列。
在本发明的一个具体实施方式中,所述的靶向载体包含编码SEQ ID NO:2第22-287位的核苷酸序列或包含SEQ ID NO:5所示核苷酸序列。
所述的靶向载体还包含5’臂和/或3’臂。
其中,所述的5’臂为与待改变的转换区5’端同源的DNA片段。优选的,其选自与NCBI登录号为NC_000080.7至少具有90%同源性的核苷酸。进一步优选的,其长度为3000-4500bp。
在本发明的一个具体实施方式中,所述的5’臂的核苷酸序列如SEQ ID NO:3所示。
所述的3’臂为与待改变的转换区3’端同源的DNA片段。优选的,其选自与NCBI登录号为NC_000080.7至少具有90%同源性的核苷酸。进一步优选的,其长度为4000-5000bp。
在本发明的一个具体实施方式中,所述的3’臂的核苷酸序列如SEQ ID NO:4所示。
优选的,所述的待改变的转换区位于非人动物IL17RB基因的2号至10号外显子上。
优选的,所述的靶向载体还包含可选择的基因标记。
优选的,所述标记基因为负筛选标记的编码基因。进一步优选的,所述负筛选标记的编码基因为白喉毒素A亚基的编码基因(DTA)。
优选的,所述靶向载体还包括阳性克隆筛选的抗性基因。进一步优选的,所述阳性克隆筛选的抗性基因为新霉素磷酸转移酶编码序列Neo。
优选的,所述靶向载体还包括特异性重组系统。进一步优选的,所述特异性重组系统为Frt重组位点(也可选择常规的LoxP重组系统)。所述的特异性重组系统为2个,分别装在抗性基因的两侧。
本发明的第八方面,提供了一种包含上述靶向载体的细胞。
本发明的第九方面,提供了一种上述靶向载体或者包含靶向载体的细胞在IL17RB基因修饰中的应用。
本发明的第十方面,提供了一种人源化IL17RB蛋白,所述的人源化IL17RB蛋白包含人IL17RB蛋白的全部或部分。
优选的,所述的人源化IL17RB蛋白包含人IL17RB蛋白胞外区的全部或部分。进一步优选的,所述的人源化IL17RB蛋白还包含人IL17RB蛋白的信号肽、跨膜区和/或胞质区。
优选的,所述的人源化IL17RB蛋白包含人IL17RB蛋白胞外区至少200个氨基酸,例如至少200、210、220、230、240、250、260、261、262、263、264、265、266、267、268、269、270、280、285个氨基酸,优选的,所述的人源化IL17RB蛋白包含人IL17RB蛋白胞外区266个氨基酸。
在本发明的一个具体实施方式中,所述的人源化IL17RB蛋白包含人IL17RB蛋白胞外区的从N端第1-10(例如1、2、3、4、5、6、7、8、9、10)个氨基酸开始至C端1-15(例如1、2、3、4、5、6、7、8、9、10、11、12、13、14、15)个氨基酸为止。
优选的,所述的人源化IL17RB蛋白包含人IL17RB基因2号至10号外显子编码的氨基酸序列。进一步优选的,包含人IL17RB基因2号外显子的部分、3号至9号外显子的全部和10号外显子的部分编码的氨基酸序列。
优选的,所述的人源化IL17RB蛋白包含SEQ ID NO:2第22-287位氨基酸序列。
在本发明的一个具体实施方式中,所述的人源化IL17RB蛋白的氨基酸序列如SEQID NO:10所示,或者,与SEQ ID NO:10具有60%、65%、70%、75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%同一性的氨基酸序列,或者,与SEQ ID NO:10所示氨基酸序列差异不超过10、9、8、7、6、5、4、3、2或不超过1个氨基酸,或者,具有SEQ ID NO:10所示的包括取代、缺失和/或插入一个或多个氨基酸残基的氨基酸序列。
本发明的第十一方面,提供了一种编码上述人源化IL17RB蛋白的人源化IL17RB基因。优选的,所述的人源化IL17RB基因包含人IL17RB基因的全部或部分。进一步优选包含人IL17RB基因的1号至11号外显子中的任一个或两个以上的组合。更进一步优选的,所述的人源化IL17RB基因包含人IL17RB的2号外显子的部分、3号至9号外显子的全部和10号外显子的部分。
优选的,所述的人源化IL17RB基因包含SEQ ID NO:5所示的核苷酸序列。
在本发明的一个具体实施方式中,所述的人源化IL17RB基因转录的mRNA包含SEQID NO:9所示的核苷酸序列,或者,包含与SEQ ID NO:9所示的核苷酸序列具有60%、65%、70%、75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%同一性的核苷酸序列,或者,包含与SEQ ID NO:9所示核苷酸序列差异不超过10、9、8、7、6、5、4、3、2或不超过1个核苷酸,或者,包含具有SEQ ID NO:9所示。
本发明的第十二方面,提供了一种表达上述人源化IL17RB蛋白的构建体。优选的,所述的构建体包含上述人源化IL17RB基因。
本发明的第十三方面,提供了一种包含上述构建体的细胞。
本发明的第十四方面,提供了一种包含上述细胞的组织。
本发明的第十五方面,提供了一种多基因修饰的非人动物的构建方法,所述的构建方法包括:
(a)应用上述的构建方法制备获得非人动物;
(b)将步骤(a)制备获得的非人动物与除IL17RB外的其他基因修饰的动物交配、体外授精或直接进行基因编辑,并进行筛选,得到多基因人源化修饰的非人动物。
优选的,所述多基因人源化修饰的非人动物为双基因人源化非人动物、三基因人源化非人动物、四基因人源化非人动物、五基因人源化非人动物、六基因人源化非人动物、七基因人源化非人动物、八基因人源化非人动物或九基因人源化非人动物。
优选的,所述的除IL17RB外的其他基因修饰的动物选自基因PD-1、PD-L1、IL17A、IL17F、IL17RA或IL17RC等修饰的动物中的一种或两种以上的组合。
本发明的第十六方面,提供了一种荷瘤动物模型,所述的动物模型的制备方法包括通过上述的构建方法制备动物的步骤。
优选的,所述的荷瘤动物模型的制备方法还包括在上述方法制备的非人动物或其后代植入肿瘤细胞的步骤。
本发明的第十七方面,提供了一种IL17RB基因修饰的细胞、组织或器官,所述的细胞、组织或器官采用上述IL17RB基因人源化的非人动物的构建方法或多基因修饰的非人动物的构建方法获得,或者,所述的细胞、组织或器官来源于上述构建的IL17RB基因人源化的非人动物或者上述构建的多基因修饰的非人动物,或者上述的荷瘤动物模型。
本发明的第十八方面,提供了上述的构建方法获得的IL17RB基因人源化的非人动物、上述的人源化IL17RB蛋白或者上述的人源化IL17RB基因的应用,所述的应用为非疾病诊断、非疾病治疗目的,所述的应用包括:
A)涉及人类细胞的与IL17RB相关的免疫过程的产品开发中的应用;
B)作为药理学、免疫学、微生物学和医学研究的与IL17RB相关的模型系统中的应用;
C)涉及生产和利用动物实验疾病模型用于与IL17RB相关的病原学研究和/或用于开发诊断策略和/或用于开发治疗策略中的应用;
D)在体内研究人IL17RB信号通路调节剂的筛选、药效检测、评估疗效、验证或评价中的应用;或者,
E)研究IL17RB基因功能,研究针对人IL17RB靶位点的药物、药效,研究与IL17RB相关的免疫相关疾病药物以及抗肿瘤药物方面的应用。
本发明的第十九方面,提供了一种上述的构建方法获得的IL17RB基因人源化的非人动物、上述构建方法获得的IL17RB基因缺失的非人动物、上述构建方法获得的多基因修饰的非人动物、上述的荷瘤动物模型在制备治疗或预防肿瘤、免疫相关疾病或炎症的药物中的应用。
本发明所述的“肿瘤”包括但不限于淋巴瘤、脑癌、非小细胞肺癌、宫颈癌、食道癌、白血病、卵巢癌、鼻咽癌、乳腺癌、子宫内膜癌、结肠癌、直肠癌、胃癌、膀胱癌、肺癌、支气管癌、骨癌、前列腺癌、胰腺癌、肝和胆管癌、食管癌、肾癌、甲状腺癌、头颈部癌、睾丸癌、胶质母细胞瘤、星形细胞瘤、黑色素瘤、骨髓增生异常综合征、以及肉瘤。其中,所述的白血病选自急性淋巴细胞性(成淋巴细胞性)白血病、急性骨髓性白血病、髓性白血病、慢性淋巴细胞性白血病、多发性骨髓瘤、浆细胞白血病、以及慢性骨髓性白血病;所述淋巴瘤选自霍奇金淋巴瘤和非霍奇金淋巴瘤,包括B细胞淋巴瘤、弥漫性大B细胞淋巴瘤、滤泡性淋巴瘤、套细胞淋巴瘤、边缘区B细胞淋巴瘤、T细胞淋巴瘤、和瓦尔登斯特伦巨球蛋白血症;所述肉瘤选自骨肉瘤、尤文肉瘤、平滑肌肉瘤、滑膜肉瘤、软组织肉瘤、血管肉瘤、脂肪肉瘤、纤维肉瘤、横纹肌肉瘤、以及软骨肉瘤。
在本发明的一个具体实施方式中,所述的肿瘤选自乳腺癌、胃癌或胰腺癌。
本发明所述的“免疫相关疾病”包括但不限于过敏、哮喘、皮炎、心肌炎、肾炎、肝炎、系统性红斑狼疮、类风湿性关节炎、硬皮病、甲状腺功能亢进、原发性血小板减少性紫癜、自身免疫性溶血性贫血、溃疡性结肠炎、自身免疫性肝病、糖尿病、疼痛或神经障碍等。
本发明所述的“细胞”可以为任何来源于动物或人的细胞,包括但不限于淋巴细胞、单核细胞、巨噬细胞、内皮细胞、上皮细胞、CD34+胸腺细胞、神经元或者肿瘤细胞。
本发明所述的“人源化IL17RB蛋白”,包含来源于人IL17RB蛋白的部分和非人动物IL17RB蛋白的部分。
本发明所述的“人源化IL17RB基因”包含来源于人IL17RB基因的部分和非人动物IL17RB基因的部分。
本发明所述的“包含”或“包括”为开放式写法,当在本申请中用于描述蛋白质或核酸的序列时,所述蛋白质或核酸可以是由所述序列组成,或者在所述蛋白质或核酸的一端或两端可以具有额外的氨基酸或核苷酸,但仍然具有本发明所述的活性。
本发明所述的“xx号至xx号外显子”或者“xx号外显子至xx号外显子”包含外显子及其期间的内含子。例如“3号至9号外显子”包含3号外显子、3-4号内含子、4号外显子、4-5号内含子、5号外显子、5-6号内含子、6号外显子、6-7号内含子、7号外显子、7-8号内含子、8号外显子、8-9号内含子和9号外显子的核苷酸序列。
本发明所述的“x-xx号内含子”表示x号外显子至xx号外显子之间的内含子。例如“2-3号内含子”表示2号外显子至3号外显子之间的内含子。
本发明所述的“基因座”广义上讲代表基因在染色体上所占的位置,狭义上讲代表某一基因上的一段DNA片段,即可以是一个基因也可以是一个基因的一部分。例如所述的“IL17RB基因座”表示IL17RB基因上的任选一段的DNA片段。在本发明的一个具体实施方式中,被替换的IL17RB基因座可以是IL17RB基因2号外显子至10号外显子上的任选一段的DNA片段。
本发明所述的“核苷酸序列”包含天然的或经过修饰的核糖核苷酸序列、脱氧核糖核苷酸序列。优选为DNA、cDNA、pre-mRNA、mRNA、rRNA、hnRNA、miRNAs、scRNA、snRNA、siRNA、sgRNA、tRNA。
本发明所述“治疗(treating)”(或“治疗(treat)”或“治疗(treatment)”)表示减缓、中断、阻止、控制、停止、减轻、或逆转一种体征、症状、失调、病症、或疾病的进展或严重性,但不一定涉及所有疾病相关体征、症状、病症、或失调的完全消除。术语“治疗(treating)”等是指在疾病已开始发展后改善疾病或病理状态的体征、症状等等的治疗干预。
本发明所述“同源性”,是指在使用氨基酸序列或核苷酸序列的方面,本领域技术人员在保证与已知序列相似结构或功能的前提下,可以根据实际工作需要对序列进行调整,使使用序列与现有技术获得的序列相比,具有(包括但不限于)1%,2%,3%,4%,5%,6%,7%,8%,9%,10%,11%,12%,13%,14%,15%,16%,17%,18%,19%,20%,21%,22%,23%,24%,25%,26%,27%,28%,29%,30%,31%,32%,33%,34%,35%,36%,37%,38%,39%,40%,41%,42%,43%,44%,45%,46%,47%,48%,49%,50%,51%,52%,53%,54%,55%,56%,57%,58%,59%,60%,70%,80%,81%,82%,83%,84%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%,99.1%,99.2%,99.3%,99.4%,99.5%,99.6%,99.7%,99.8%,99.9%的同一性。
本领域的技术人员能够确定并比较序列元件或同一性程度,以区分另外的小鼠和人序列。
在一个方面,所述非人动物是哺乳动物,优选啮齿类动物、猪、兔子、猴子等任何可以进行基因编辑制备基因人源化的非人哺乳动物。在一个方面,所述非人动物是小型哺乳动物,例如跳鼠科。在一个实施方式中,所述基因人源化的非人动物是啮齿动物。在一个实施方式中,所述啮齿动物选自小鼠、大鼠和仓鼠。在一个实施方式中,所述啮齿动物选自鼠家族。在一个实施方式中,所述基因修饰的动物来自丽仓鼠科(例如小鼠样仓鼠)、仓鼠科(例如仓鼠、新世界大鼠和小鼠、田鼠)、鼠总科(真小鼠和大鼠、沙鼠、刺毛鼠、冠毛大鼠)、马岛鼠科(登山小鼠、岩小鼠、有尾大鼠、马达加斯加大鼠和小鼠)、刺睡鼠科(例如多刺睡鼠)和鼹形鼠科(例如摩尔大鼠、竹大鼠和鼢鼠)家族。在一个特定实施方式中,所述基因修饰的啮齿动物选自真小鼠或大鼠(鼠总科)、沙鼠、刺毛鼠和冠毛大鼠。在一个实施方式中,所述基因修饰的小鼠来自鼠科家族成员。在一个实施方式中,所述动物是啮齿动物。在一个特定实施方式中,所述啮齿动物选自小鼠和大鼠。在一个实施方式中,所述非人动物是小鼠。
在一个特定实施方式中,所述非人动物是啮齿动物,其为选自BALB/c、A、A/He、A/J、A/WySN、AKR、AKR/A、AKR/J、AKR/N、TA1、TA2、RF、SWR、C3H、C57BR、SJL、C57L、DBA/2、KM、NIH、ICR、CFW、FACA、C57BL/A、C57BL/An、C57BL/GrFa、C57BL/KaLwN、C57BL/6、C57BL/6J、C57BL/6ByJ、C57BL/6NJ、C57BL/10、 C57BL/10ScSn、C57BL/10Cr和C57BL/Ola的C57BL、C58、CBA/Br、CBA/Ca、CBA/J、CBA/st、CBA/H品系的小鼠。
除非特别说明,本发明的实践将采取细胞生物学、细胞培养、分子生物学、转基因生物学、微生物学、重组DNA和免疫学的传统技术。这些技术在以下文献中进行了详细的解释。例如:Molecular Cloning A Laboratory Manual,2ndEd.,ed. By Sambrook,FritschandManiatis(Cold Spring Harbor Laboratory Press:1989);DNA Cloning,Volumes I and II (D.N.Glovered.,1985);Oligonucleotide Synthesis (M.J.Gaited.,1984);Mullisetal. U.S. Pat.No.4,683,195;Nucleic Acid Hybridization(B.D.Hames& S.J.Higginseds.1984);Transcription And Translation (B.D.Hames&S.J.Higginseds.1984);Culture Of Animal Cells (R.I.Freshney,AlanR.Liss,Inc.,1987);Immobilized Cells And Enzymes (IRL Press,1986);B.Perbal,A PracticalGuide To Molecular Cloning(1984);the series,Methods In ENZYMOLOGY (J.Abelsonand M.Simon,eds.-in-chief,Academic Press,Inc.,New York),specifically,Vols.154 and 155 (Wuetal.eds.) and Vol.185,″Gene Expression Technology″(D.Goeddel,ed.);Gene Transfer Vectors For Mammalian Cells (J.H.Miller andM.P.Caloseds.,1987,Cold Spring Harbor Laboratory);Immunochemical Methods InCell And Molecular Biology (Mayer and Walker,eds.,Academic Press,London,1987);Handbook Of Experimental Immunology,Volumes V (D.M.Weir andC.C.Blackwell,eds.,1986);and Manipulating the Mouse Embryo,(Cold SpringHarbor Laboratory Press,Cold Spring Harbor,N.Y.,1986)。
以上只是概括了本发明的一些方面,不是也不应该认为是在任何方面限制本发明。
本说明书提到的所有专利和出版物都是通过参考文献作为整体而引入本发明的。本领域的技术人员应认识到,对本发明可作某些改变并不偏离本发明的构思或范围。
下面的实施例进一步详细说明本发明,不能认为是限制本发明或本发明所说明的具体方法的范围。
附图说明
以下,结合附图来详细说明本发明的实施例,其中:
图1:小鼠IL17RB基因和人IL17RB基因座对比示意图(非按比例)。
图2:小鼠IL17RB基因人源化改造示意图(非按比例)。
图3:IL17RB基因打靶策略及靶向载体设计示意图(非按比例)。
图4:IL17RB重组后细胞Southern blot结果,其中WT为野生型对照。
图5:IL17RB基因人源化小鼠FRT重组过程示意图(非按比例)。
图6:IL17RB基因人源化小鼠F1代鼠尾PCR鉴定结果,其中,WT为野生型,H2O为水对照,PC为阳性对照。
图7:C57BL/6野生型小鼠(WT)和IL17RB基因人源化纯合子小鼠(IL17RB)胸腺组织中IL17RB mRNA检测结果,其中M为Marker,H2O为水对照,WT为野生型对照。
具体实施方式
下面结合具体实施例来进一步描述本发明,本发明的优点和特点将会随着描述而更为清楚。但这些实施例仅是范例性的,并不对本发明的范围构成任何限制。本领域技术人员应该理解的是,在不偏离本发明的精神和范围下可以对本发明技术方案的细节和形式进行修改或替换,但这些修改和替换均落入本发明的保护范围内。
在下述每一实施例中,设备和材料是从以下所指出的几家公司获得:
AseI、NdeI、BamHI酶购自NEB,货号分别为R0526M、R0111S、R0136M;
C57BL/6小鼠、Flp工具鼠购自中国食品药品检定研究院国家啮齿类实验动物种子中心;
PrimeScript™ 1st strand cDNA Synthesis Kit购自TAKARA,货号6110A;
TRIzol™ Reagent购自Invitrogen,货号15596018;
Zombie NIR™ Fixable Viability Kit购自Biolegend,货号B245952;
RNAprep pure培养细胞/细菌总RNA提取试剂盒购自天根,货号DP430;
Attune Nxt Acoustic Focusing Cytometer购自Thermo Fisher,型号AttuneNxt;
Heraeus™ Fresco™ 21 Microcentrifuge购自Thermo Fisher,型号Fresco 21;
BioTek Epoch Microplate Reader购自BioTeK,型号EPOCH。
实施例1 IL17RB基因人源化小鼠
小鼠IL17RB基因(NCBI Gene ID:50905,Primary source:MGI:1355292,UniProt:Q9JIP3,位于14号染色体NC_000080.7的第29718125至29730853位,基于转录本NM_019583.3及其编码蛋白NP_062529.2(SEQ ID NO:1))和人IL17RB基因(NCBI Gene ID:55540,Primary source:HGNC:18015,UniProt ID:Q9NRM6,位于3号染色体NC_000003.12的第53846550至53867388位,基于转录本NM_018725.4及其编码蛋白NP_061195.2(SEQ IDNO:2))对比示意图如图1所示。
为了达到本发明的目的,可在小鼠内源IL17RB基因座引入编码人IL17RB蛋白的核苷酸序列,使得该小鼠表达人或人源化IL17RB蛋白。具体来说,用基因编辑技术,在小鼠IL17RB基因调节元件的控制下,用包含人IL17RB基因的2号外显子部分序列至10号外显子部分序列替换小鼠2号外显子的部分序列至10号外显子的部分序列,得到人源化IL17RB基因座示意图如图2所示,实现对小鼠IL17RB基因的人源化改造。
设计如图3所示的打靶策略,图中显示了靶向载体上含有小鼠IL17RB基因上游和下游的同源臂序列,以及包含人IL17RB序列的A片段。其中,上游同源臂序列(5’同源臂,SEQID NO:3)与NCBI登录号为NC_000080.7的第29728829至29732752位核苷酸序列相同,下游同源臂序列(3’同源臂,SEQ ID NO:4)与NCBI登录号为NC_000080.7的第29715472至29719566位核苷酸序列相同。A片段上人IL17RB的核苷酸序列(SEQ ID NO:5)与NCBI登录号为NC_000003.12的第53848667至53860143位核苷酸序列相同;人IL17RB序列下游与小鼠的连接设计为5’-aggtgaataagctttgttttttccagacaaaagcaagccgggaggctggctgcctctcttcctggtgctgctggtggctgtgtgg-3’(SEQ ID NO:6),其中序列“agccg”的最后一个“g”是人的最后一个核苷酸,序列“ggagg”的第一个“g”是小鼠序列的第一个核苷酸。
靶向载体上还包括用于阳性克隆筛选的抗性基因,即新霉素磷酸转移酶编码序列Neo,并在抗性基因的两侧装上两个同向排列的位点特异性重组系统Frt重组位点,组成Neo盒(Neo cassette)。其中Neo盒5’端与小鼠基因的连接设计为5’-tttctacatagtcttgctgtggtgcacagggtgtggctgaacttcataggGATATCGAATTCCGAAGTTCCTATTCTCTAGAAAGTATAGGAACTTCAGG-3’(SEQ ID NO:7),其中序列“atagg”的最后一个“g”是小鼠的最后一个核苷酸,序列“GATAT”的“G”是Neo盒的第一个核苷酸;Neo盒3’端与小鼠基因的连接设计为5’-TCTCTAGAAAGTATAGGAACTTCATCAGTCAGGTACATAATGGTGGATCCggtagaagaaccctaccacctgcagggagaaaatggtacaaatcacagtt-3’(SEQ ID NO:8),其中序列“GATCC”的最后一个“C”是Neo盒的最后一个核苷酸,序列“ggtag”的第一个“g”是小鼠的第一个核苷酸。此外,还在靶向载体3’同源臂下游构建了具有负筛选标记的编码基因(白喉毒素A亚基的编码基因(DTA))。改造后的人源化小鼠IL17RB的mRNA序列如SEQ ID NO:9所示,表达的蛋白序列如SEQ ID NO:10所示。
鉴于人IL17RB具有多种亚型或转录本,本文所述方法可应用于其它亚型或转录本。
靶向载体构建可采用常规方法进行,如酶切连接等。构建好的靶向载体通过酶切进行初步验证后,再送测序公司进行测序验证。将测序验证正确的靶向载体电穿孔转染入C57BL/6小鼠的胚胎干细胞中,利用阳性克隆筛选标记基因对得到的细胞进行筛选,并利用PCR和Southern Blot技术进行检测确认外源基因的整合情况,筛选出正确的阳性克隆细胞,经PCR鉴定为阳性的克隆再进行Southern Blot(分别用NdeI或BamHI或AseI消化细胞DNA并使用3个探针进行杂交,探针及目的片段长度如表1所示)检测,结果如图4所示,检测结果表明12个经PCR验证为阳性的克隆,经测序进一步验证发现除均为阳性克隆且无随机插入,具体编号为1-C07、1-C10、1-G08、2-B10、2-D04、2-D09、2-F02、2-F07、2-H06、3-A08、3-D07、3-E04。
表1:具体探针及目的片段长度
其中,PCR测定包括下述引物:
F1:5’-GGCCTTGCTTGCATATTGTTCCAC-3’(SEQ ID NO:11),
R1:5’-GTCTGGTCTTCGATGACCTATTGCC-3’(SEQ ID NO:12);
F2:5’-GCTCGACTAGAGCTTGCGGA-3’(SEQ ID NO:13),
R2:5’-TAGTTAGGAGCTAAAGCGGTCAGGC-3’(SEQ ID NO:14);
Southern Blot检测包括如下探针引物:
5’探针(5’Probe):
5’Probe-F:5’-TGAACCCCTGGACATGAAGGTT-3’(SEQ ID NO:15),
5’Probe-R:5’-AGGCCAGTGATGACAGGCTTA-3’(SEQ ID NO:16);
3’探针(3’Probe):
3’Probe-F:5’-GCTACATGACACGTTCATCAGGATG-3’(SEQ ID NO:17),
3’Probe-R:5’-TGTAAATCCTGGCTTGGTTGCTG-3’(SEQ ID NO:18);
Neo探针(Neo Probe):
Neo Probe-F:5’-GGATCGGCCATTGAACAAGAT-3’(SEQ ID NO:19),
Neo Probe-R:5’-CAGAAGAACTCGTCAAGAAGGC-3’(SEQ ID NO:20)。
将筛选出的正确阳性克隆细胞(黑色鼠)按照本领域已知的技术导入已分离好的囊胚中(白色鼠),得到的嵌合囊胚转移至培养液中短暂培养后移植至受体母鼠(白色鼠)的输卵管,可生产F0代嵌合体鼠(黑白相间)。将F0代嵌合鼠与野生型鼠回交获得F1代鼠,再将F1代杂合小鼠互相交配即可获得F2代纯合子鼠。还可将阳性鼠与Flp工具鼠交配去除阳性克隆筛选标记基因(该过程示意图见图5)后,再通过互相交配即可得到IL17RB基因人源化纯合子小鼠。可通过PCR鉴定子代小鼠体细胞的基因型(引物如表2所示),示例性的F1代小鼠(已去除Neo标记基因)的鉴定结果见图6,其中,编号为F1-01、F1-02、F1-03、F1-04、F1-05的5只小鼠均为阳性杂合小鼠。
表2:引物名称及具体序列
这表明使用本方法能构建出可稳定传代且无随机插入的IL17RB基因人源化小鼠。
可通过常规检测方法确认阳性小鼠体内人源化IL17RB mRNA的表达情况,例如RT-PCR等。具体来说,分别取6周龄雌性C57BL/6野生型小鼠和本实施例制得的IL17RB基因人源化纯合子小鼠各1只,脱颈安乐死后取胸腺组织,研磨制备细胞悬液,按照TRIzol试剂盒说明书抽提细胞RNA,反转录成cDNA后进行RT-PCR检测,检测结果如图7所示。从图7中可以看出,在C57BL/6野生型小鼠胸腺中检测到鼠IL17RBmRNA(图7A),没有检测到人源化IL17RBmRNA(图7B);在IL17RB基因人源化纯合子小鼠胸腺组织中仅检测到人源化IL17RBmRNA(图7B),没有检测到鼠IL17RBmRNA(图7A)。
RT-PCR引物序列包括:
mIL17RB-F:5’-TCGTCAAGACAAGTGTGGCA-3’(SEQ ID NO:28)
mIL17RB-R:5’-CGACAGACGGTGTGATGGAA-3’(SEQ ID NO:29)
hIL17RB-F:5’-ACACAGTCTATTTCATTGGGGC-3’(SEQ ID NO:30)
hIL17RB-R:5’-CAGGCACTGTCACAAAGGGT-3’(SEQ ID NO:31)
GAPDH-F:5’-TCACCATCTTCCAGGAGCGAGA-3’(SEQ ID NO:32)
GAPDH-R:5’-GAAGGCCATGCCAGTGAGCTT-3’(SEQ ID NO:33)
实施例2 双重人源化或多重双人源化小鼠的制备
利用本方法或制得的IL17RB小鼠还可以制备双人源化或多人源化小鼠模型。如,前述实施例1中,囊胚显微注射使用的胚胎干细胞可选择来源于含有PD-1、PD-L1、IL17A、IL17F、IL17RA、IL17RC等其它基因修饰的小鼠,或者,也可在人源化IL17RB小鼠的基础上,利用分离小鼠ES胚胎干细胞和基因重组打靶技术,获得IL17RB与其它基因修饰的双基因或多基因修饰的小鼠模型。也可将本方法得到的IL17RB小鼠纯合子或杂合子与其它基因修饰的纯合或杂合小鼠交配,对其后代进行筛选,根据孟德尔遗传规律,可有一定机率得到人源化IL17RB与其它基因修饰的双基因或多基因修饰的杂合小鼠,再将杂合子相互交配可以得到双基因或多基因修饰的纯合子,利用这些双基因或多基因修饰的小鼠可以进行靶向人IL17RB和其它基因调节剂的体内药效验证等。
以上详细描述了本发明的优选实施方式,但是,本发明并不限于上述实施方式中的具体细节,在本发明的技术构思范围内,可以对本发明的技术方案进行多种简单变型,这些简单变型均属于本发明的保护范围。
另外需要说明的是,在上述具体实施方式中所描述的各个具体技术特征,在不矛盾的情况下,可以通过任何合适的方式进行组合,为了避免不必要的重复,本发明对各种可能的组合方式不再另行说明。
此外,本发明的各种不同的实施方式之间也可以进行任意组合,只要其不违背本发明的思想,其同样应当视为本发明所公开的内容。
序列表
<110> 百奥赛图(北京)医药科技股份有限公司
<120> IL17RB基因人源化的非人动物的构建方法及应用
<130> 1
<160> 33
<170> SIPOSequenceListing 1.0
<210> 1
<211> 499
<212> PRT
<213> 小鼠(Mouse)
<400> 1
Met Leu Leu Val Leu Leu Ile Leu Ala Ala Ser Cys Arg Ser Ala Leu
1 5 10 15
Pro Arg Glu Pro Thr Ile Gln Cys Gly Ser Glu Thr Gly Pro Ser Pro
20 25 30
Glu Trp Met Val Gln His Thr Leu Thr Pro Gly Asp Leu Arg Asp Leu
35 40 45
Gln Val Glu Leu Val Lys Thr Ser Val Ala Ala Glu Glu Phe Ser Ile
50 55 60
Leu Met Asn Ile Ser Trp Ile Leu Arg Ala Asp Ala Ser Ile Arg Leu
65 70 75 80
Leu Lys Ala Thr Lys Ile Cys Val Ser Gly Lys Asn Asn Met Asn Ser
85 90 95
Tyr Ser Cys Val Arg Cys Asn Tyr Thr Glu Ala Phe Gln Ser Gln Thr
100 105 110
Arg Pro Ser Gly Gly Lys Trp Thr Phe Ser Tyr Val Gly Phe Pro Val
115 120 125
Glu Leu Ser Thr Leu Tyr Leu Ile Ser Ala His Asn Ile Pro Asn Ala
130 135 140
Asn Met Asn Glu Asp Ser Pro Ser Leu Ser Val Asn Phe Thr Ser Pro
145 150 155 160
Gly Cys Leu Asn His Val Met Lys Tyr Lys Lys Gln Cys Thr Glu Ala
165 170 175
Gly Ser Leu Trp Asp Pro Asp Ile Thr Ala Cys Lys Lys Asn Glu Lys
180 185 190
Met Val Glu Val Asn Phe Thr Thr Asn Pro Leu Gly Asn Arg Tyr Thr
195 200 205
Ile Leu Ile Gln Arg Asp Thr Thr Leu Gly Phe Ser Arg Val Leu Glu
210 215 220
Asn Lys Leu Met Arg Thr Ser Val Ala Ile Pro Val Thr Glu Glu Ser
225 230 235 240
Glu Gly Ala Val Val Gln Leu Thr Pro Tyr Leu His Thr Cys Gly Asn
245 250 255
Asp Cys Ile Arg Arg Glu Gly Thr Val Val Leu Cys Ser Glu Thr Ser
260 265 270
Ala Pro Ile Pro Pro Asp Asp Asn Arg Arg Met Leu Gly Gly Trp Leu
275 280 285
Pro Leu Phe Leu Val Leu Leu Val Ala Val Trp Val Leu Ala Ala Gly
290 295 300
Ile Tyr Leu Thr Trp Arg Gln Gly Arg Ser Thr Lys Thr Ser Phe Pro
305 310 315 320
Ile Ser Thr Met Leu Leu Pro Leu Ile Lys Val Leu Val Val Tyr Pro
325 330 335
Ser Glu Ile Cys Phe His His Thr Val Cys Arg Phe Thr Asp Phe Leu
340 345 350
Gln Asn Tyr Cys Arg Ser Glu Val Ile Leu Glu Lys Trp Gln Lys Lys
355 360 365
Lys Ile Ala Glu Met Gly Pro Val Gln Trp Leu Thr Thr Gln Lys Gln
370 375 380
Ala Ala Asp Lys Val Val Phe Leu Leu Pro Ser Asp Val Pro Thr Leu
385 390 395 400
Cys Asp Ser Ala Cys Gly His Asn Glu Gly Ser Ala Arg Glu Asn Ser
405 410 415
Gln Asp Leu Phe Pro Leu Ala Phe Asn Leu Phe Cys Ser Asp Phe Ser
420 425 430
Ser Gln Thr His Leu His Lys Tyr Leu Val Val Tyr Leu Gly Gly Ala
435 440 445
Asp Leu Lys Gly Asp Tyr Asn Ala Leu Ser Val Cys Pro Gln Tyr His
450 455 460
Leu Met Lys Asp Ala Thr Ala Phe His Thr Glu Leu Leu Lys Ala Thr
465 470 475 480
Gln Ser Met Ser Val Lys Lys Arg Ser Gln Ala Cys His Asp Ser Cys
485 490 495
Ser Pro Leu
<210> 2
<211> 502
<212> PRT
<213> 人(human)
<400> 2
Met Ser Leu Val Leu Leu Ser Leu Ala Ala Leu Cys Arg Ser Ala Val
1 5 10 15
Pro Arg Glu Pro Thr Val Gln Cys Gly Ser Glu Thr Gly Pro Ser Pro
20 25 30
Glu Trp Met Leu Gln His Asp Leu Ile Pro Gly Asp Leu Arg Asp Leu
35 40 45
Arg Val Glu Pro Val Thr Thr Ser Val Ala Thr Gly Asp Tyr Ser Ile
50 55 60
Leu Met Asn Val Ser Trp Val Leu Arg Ala Asp Ala Ser Ile Arg Leu
65 70 75 80
Leu Lys Ala Thr Lys Ile Cys Val Thr Gly Lys Ser Asn Phe Gln Ser
85 90 95
Tyr Ser Cys Val Arg Cys Asn Tyr Thr Glu Ala Phe Gln Thr Gln Thr
100 105 110
Arg Pro Ser Gly Gly Lys Trp Thr Phe Ser Tyr Ile Gly Phe Pro Val
115 120 125
Glu Leu Asn Thr Val Tyr Phe Ile Gly Ala His Asn Ile Pro Asn Ala
130 135 140
Asn Met Asn Glu Asp Gly Pro Ser Met Ser Val Asn Phe Thr Ser Pro
145 150 155 160
Gly Cys Leu Asp His Ile Met Lys Tyr Lys Lys Lys Cys Val Lys Ala
165 170 175
Gly Ser Leu Trp Asp Pro Asn Ile Thr Ala Cys Lys Lys Asn Glu Glu
180 185 190
Thr Val Glu Val Asn Phe Thr Thr Thr Pro Leu Gly Asn Arg Tyr Met
195 200 205
Ala Leu Ile Gln His Ser Thr Ile Ile Gly Phe Ser Gln Val Phe Glu
210 215 220
Pro His Gln Lys Lys Gln Thr Arg Ala Ser Val Val Ile Pro Val Thr
225 230 235 240
Gly Asp Ser Glu Gly Ala Thr Val Gln Leu Thr Pro Tyr Phe Pro Thr
245 250 255
Cys Gly Ser Asp Cys Ile Arg His Lys Gly Thr Val Val Leu Cys Pro
260 265 270
Gln Thr Gly Val Pro Phe Pro Leu Asp Asn Asn Lys Ser Lys Pro Gly
275 280 285
Gly Trp Leu Pro Leu Leu Leu Leu Ser Leu Leu Val Ala Thr Trp Val
290 295 300
Leu Val Ala Gly Ile Tyr Leu Met Trp Arg His Glu Arg Ile Lys Lys
305 310 315 320
Thr Ser Phe Ser Thr Thr Thr Leu Leu Pro Pro Ile Lys Val Leu Val
325 330 335
Val Tyr Pro Ser Glu Ile Cys Phe His His Thr Ile Cys Tyr Phe Thr
340 345 350
Glu Phe Leu Gln Asn His Cys Arg Ser Glu Val Ile Leu Glu Lys Trp
355 360 365
Gln Lys Lys Lys Ile Ala Glu Met Gly Pro Val Gln Trp Leu Ala Thr
370 375 380
Gln Lys Lys Ala Ala Asp Lys Val Val Phe Leu Leu Ser Asn Asp Val
385 390 395 400
Asn Ser Val Cys Asp Gly Thr Cys Gly Lys Ser Glu Gly Ser Pro Ser
405 410 415
Glu Asn Ser Gln Asp Leu Phe Pro Leu Ala Phe Asn Leu Phe Cys Ser
420 425 430
Asp Leu Arg Ser Gln Ile His Leu His Lys Tyr Val Val Val Tyr Phe
435 440 445
Arg Glu Ile Asp Thr Lys Asp Asp Tyr Asn Ala Leu Ser Val Cys Pro
450 455 460
Lys Tyr His Leu Met Lys Asp Ala Thr Ala Phe Cys Ala Glu Leu Leu
465 470 475 480
His Val Lys Gln Gln Val Ser Ala Gly Lys Arg Ser Gln Ala Cys His
485 490 495
Asp Gly Cys Cys Ser Leu
500
<210> 3
<211> 3924
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 3
tggcatcctt gcctgagtat tctcccatat cttccacccc tttctacccc cttcacccaa 60
ctcttcctct ctcctttgcc accagattct tctccgggat aagaaaccca actagctgcc 120
ctgagccatt tctcaaaaca agccaatcaa gtacccaatt ccttatgttt gtcccacctc 180
cccttgccaa ggtcaggaga ggtcgctcca gtttctgtgt gcatgtgaca tcaagaactt 240
tcctgcccgc ctttctgtct caggggccaa gaaatggctg acttgtccct ggacctgacc 300
tctctatccc tgctgtccca aagccccttg ccttttgcct tgagctgact acagcaggct 360
gtgttctgcc agagcagccc agggactggc actcagctac tcaccgagta aacacgtgtt 420
ggcaatcggc tgtgacttgc tcagctgaga gggagtgctc tctacttgct ctgcacagcc 480
ccatgggctg tttttcaaaa ctggaaactg attgttatct aaacatctgc ctggctcctc 540
agccccatag ggcaaagttc tcagatagaa gctggaaaac aaggccgtct ccattttgtc 600
ttcccaccag ggcaggtact ctggccttgc atgttcctgt cacagtgctc agctccatgc 660
caagagcatg cacctttgca ccctggtagc aggcaagggt ttaccggtca gaaaggctgc 720
tatgaaagct tattgttccc aggaaccact gagcctgaac ggctcgctga ggttgagtgg 780
gagcccagga tggcccagtt ccaggcgagg gtggggctac aacctccctt ctgtcaatct 840
ccttggtcag gcaccagaac aagtttctca ggaaactatt ccttgcaatc caacagaggc 900
tagggacgga gccttctcca ggtgaatcaa aaccagaatc ttcacctgaa gtctgtctca 960
ctgatgaatt tagtctgtat cccagtcggg cttcagcaaa tacagccact tgactccacc 1020
caacctctta gcagggagaa gcaggacttg tgcctgaagc tctctccaat tccactgtct 1080
acgtgtctgg tgagcccagc tctgctaacg atataggaag ggctgaagca gcagaattcc 1140
aggctctgta gccctttctc taaccccaaa tgatactggg ctatccccct ctcaagccat 1200
ttcatctcaa aaggatggtc cctggcccac cttcccaaaa ttaaggtccg agaggtggtg 1260
gagagtcggg cagatctcac tcaggcagaa gcagcactgc caaatgtcag aaccttagct 1320
cggaacccac agtgctgaga caccagcttc gggaccccag atgtaaccgc caggcctcat 1380
tctataggag tacacttaac cagaaagcct gagaggccag gggtggagat tcctatcact 1440
gttaccttgt aactgttaag ttcctgtcgc tagttctctc aaaacccaga aggtttcagt 1500
ggcttgagcc cctccgcttc aggtcctcac ccagcctagg acctgaaaag ctagctactc 1560
acccgggaca actccgcacc tgggccagga cgcgaaggac aggagccagg agagccgaga 1620
aggcagtggc cgctgatcct cgcccgctgc tctttaactg gcgtcccggg agtgccgcga 1680
gctgctccgt ctggctccag gcgcgggtgc ggtcgcccca cctggcctgc ggcacacacc 1740
tgtgcgcggg gctcgacacg ccccgcgcct gctggagctt cagtcctccc gggcctgagt 1800
cattcgcacc ccgcctaccc ccgcgagccc cgagggcgac aaggcggcca ctttactgtc 1860
ttgggaccag gccagcgccg gcttgggcag ggcggtactg tgggaggcgg agcccggaga 1920
actgcggcgg gcgcctggat aagaggaccc tggacctctg gccccagctc cgcgtggtgg 1980
tgggcggtgg ccagtggccg ggccatgttg ctagtgttgc tgatcttggc tgcatcgtgc 2040
aggagcgccc tgcctcgaga gccggtaagc atccccctcc attgtgggca ccctttcggt 2100
tcccatccgt ttagctgaac cctgacaagc gacaagcgcc tgaaaacctg ccagctcaac 2160
tgtcaaggaa ttgtccatga accatcctta gaaaggggcc agtttgcacc cctgagctaa 2220
ccgcgatggg tcttgaactc acaccaccca tctattgcca tcctggaagg tatcagttgt 2280
cagactgctt cccaggggaa tgaggtttag aactgtcaca gatgccacct gggcatttga 2340
gtcgtagtat tctcctcaaa aagaaaaagg gtggtactca actcacacag gtatgccttg 2400
tcctgggtta atgatgacaa acaagacagt actgcttaca tctgtgctgc ccacctcttg 2460
ggtcaacaga cagcctgcat cagtttgctt gcttgctttt cctttctgtc tagggccatg 2520
ttaactcctg aaggcagctt tcctttcttc tccagtcccc atgaatgaaa gaatctttct 2580
aggagggaaa agcgcatcct caacaaggca ttcagatgca agccctgctc tcttcacaca 2640
ttggaaggca tccttggtga ctgtggaaag tcacagctta tttatggtgt cctgaagccc 2700
ttgccgtctc tgtgcaagag aaaaaaactg tcctgggggc agctgtgaag aacattgaat 2760
gaactctgca aaatgtagag ccctcacctg gcttttccta cacaccttgt cttgctccca 2820
ttcaagtaga acttggttcc tagtaggtgg ctcatttgct taataagaga atgtcacaaa 2880
gcaaaggaca gtggatgaca tcatccaaac cccatttgct ccatcaaaac ctctgctttc 2940
tgccttcacc aaggttttgg aaatgagaga gctgagatct taagagtcct ttgggtttgt 3000
cacattgtgt cttgccactg aagattactg acaactaaaa taaatgtttt aggctcaata 3060
tactatttcc aaactagtac tcttcccttt taatggaaac gtcaaagggt ttagtaactg 3120
gcctttcgta gacagtaggg gggttttact tttttgaaag ctgttttaga aagatgaggt 3180
cactgtcccc aacggatttc taagtagctc agctgagttc tggatcattc agaagtatga 3240
tatgagcttg tgtgtgtgtg tgtgtgtgtg tcctgcgggc tgcagtcatt gctccccgct 3300
ttcttctcct ctttgcatag ccaatcttta atttgaagaa attaataaac attacatggt 3360
cctgaaaacc aacagttaag cattagatga tacaagtggc ctcaccgtgg ccctcagtaa 3420
tccgaggcag ggaaagaaag catgcatgag gcctacctgg tcttcaaggg ggtgagcagc 3480
tgccagaaaa ccaggtgtac caatggctaa gccagggccc atcacagcta cagtggggca 3540
gctcaattcc cctcaagttg gaaagtgcaa tcaaactagc tcttttcttt taaaacattt 3600
aaagaattgt atttttaatt tgtgtgtgtg tgtgtgtgtg tgccctcaga aaccacgact 3660
atgatttctg gagctagagt gacaggcaat tgctgtgagt tgtccagtat gggagctgga 3720
gcctaaatgc aggttttctg caagaacaat atgctgtctt aaccactgat tcatctctcc 3780
agctcccggc tcatcttttc gttgaaaact tgtcacttgt atatgcatct aaggaacaat 3840
attggtgtgg tgcctaattt ggtaaagtga ggagtgtggt ttactggctt caacatgaca 3900
tttggtttct ctttcctgca gact 3924
<210> 4
<211> 4095
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 4
ggtagaagaa ccctaccacc tgcagggaga aaatggtaca aatcacagtt gacccaactc 60
ctaaaatgat aagtagaggt cacatctgtg tcacagtaca gaagtttcag cagcttgctt 120
taaacacagt gacctcaaac ctcagcagct gctgagttta tctgaaaaga acagggcttg 180
actatgaagg ctgatttgtt gataaggact taaaaataca tcttttcaag tgcttttaag 240
attcaagcat cagtgtttca ggtattcaaa aggactatca aggacctttt ctcctgtcgg 300
gaacaaagct ctgaccctga tctggctgac agtctggtgg ggagccttta gtagttttag 360
actactgtaa aggcctgcta tctgtcacac agttaacaaa gtccccccct tctcctccag 420
gaaggagcac gaagacgtcc tttcctattt ccaccatgct cctgcccctc attaaggtcc 480
tggtggttta tccttctgag atatgtttcc atcacaccgt ctgtcgcttc actgactttc 540
ttcaaaacta ctgcagaagt gaggtcatcc ttgaaaaatg gcagaaaaag aaaatcgccg 600
agatggggcc ggtacagtgg ctgaccactc agaagcaagc ggcagataaa gtggtcttcc 660
ttcttcccag tgacgtcccg accctttgtg acagtgcctg tggccacaat gagggcagcg 720
ccagggagaa ctctcaggat ctgttccctc ttgcctttaa cctcttttgt agtgatttca 780
gcagccagac gcatctgcac aaatacctgg tggtctatct tgggggagca gacctcaaag 840
gcgactataa tgccctgagt gtctgccccc aatatcatct catgaaggac gccacagctt 900
tccacacaga acttctcaag gctacgcaga gcatgtcagt gaagaaacgc tcacaagcct 960
gccatgatag ctgttcaccc ttgtagtcca cccgggggaa tagagactct gaagccttcc 1020
tactctccct tccagtgaca aatgctgtgt gacgactctg aaatgtgtgg gagaggctgt 1080
gtggaggtag tgctatgtac aaacttgctt taaaactgga gtttgcaaag tcaacctgag 1140
catacacgcc tgaggctagt cattggctgg atttatgaag acaacacagt tacagacaat 1200
aatgagtggg acctacattt gggatatacc caaagctggg taatgattat cactgagaac 1260
cacgcactct ggccatgagg taatacggca cttccctgtc aggctgtctg tcaggttggg 1320
tctgtcttgc actgcccatg ctctatgctg cacgtagacc gttttgtaac attttaatct 1380
gttaatgaat aatccgtttg ggaggctctc actaatgtgt agcttcctaa gagaagaagc 1440
ctattacaca cacaagcaga cactctgcct gaccagatga tccagtttat gtgtaaccac 1500
tgtactgttt gctgttggga cagtgctttc ccttgccaaa cttttggaca ggtaaccttt 1560
accttttatc ttagaagggc taggaactga acccagggta ttatgcacac tagacaaatg 1620
ctctgacact gaattacacc tcagtctccc acgggtgact ctctaagtga atatgcaagc 1680
atatcaccag cagggcggtg gtggcacacg cctttaatcc cagcactcgg gaggcagagg 1740
caggcggatt tctgagttca aggtcagctt ggtctacaga gtgagttcca ggacagccag 1800
ggctacacag agaaaccctg tcttggaaaa accaaaaaaa ctaaccaatc aaccaaagcc 1860
ccacatcacc agctggatgt ggtggcacgt ggcacatacc tttaatccca gcactcaagt 1920
ggaagaggca ggaggacctc tgggtgtgtg aggccagcca gagacagagt gaggctctgt 1980
cccaaacaaa cacaagttgc tacctataaa tccagtgtct acttttgtcc agttacctgt 2040
ccactattga agttcgcaga gttcttggtt attacataac atttcctatt caagtcaaaa 2100
ctaattcttc aagtgagatc aactctcttt aagtaatata tctttttcta taactaattg 2160
cgtaaataaa aacccaccaa gaactggaga ggtggctccg aggcttagag cgctggctgc 2220
tcttccagag gacttggttc ggtttccagc aaccacatag ctgtttatag ccaccttaac 2280
tccagtgtca aagggatctg ataaccctct ctggcttccc aggccactct ccatgcattt 2340
tacaaagaca ttttatatac agcaggactt tgcttgcatg tatgtctgtg taccacatgt 2400
gcttggtgcc tgaggtcagg cgagggtgcc ggatgccctg gaactggaat taggaacaga 2460
tgtgagccac catgtggact ctgggaatcc aatcccagtc tcctgcaaga acaaatgctc 2520
ttaactgatt cgctatctag ctatacccca atgtaaataa atcttaattt aaatctaaaa 2580
taaaccccca accagaacaa gcaggaacag tatacagcct tgctaggtct tgaatcttct 2640
agctccttac gttctcaaag cagctttttc ttaatgtaga aactacagca atgtatatta 2700
tcataagtat ttgtctagta tggcttcata acgcacgatc aatctatttt gaaatgggac 2760
cctgtttcag gtagttgact gctttgatat ctgtacctga aggtctggca tgttggacac 2820
ccctttaatc ctagggctcc gaggcagaag caggcaaatc tcttgagttc gaggccagcc 2880
tggtctacac agtaagttcc aggacagcca gggctacata actgagaaac cttgtttaaa 2940
aaacaaaaaa caaaacaaaa gcaagtttgt aattgatccg gtaagtagca aacactttta 3000
aaaactatgt atttttaatt atgtcttagg ggtgtgtgtg tgtatctgca atgggtatgt 3060
gcatgtgagt atccactcac aagagtatct atccactcag aagtatgact cacaagaggc 3120
caaaggcctc agatgcctta gggctggagc tacagatggt tgggagcggc ctgacctgcg 3180
ctgggacctg aaccccagat cttccagaag agcagcgagt ggccttcctc actgagctgc 3240
ctctccagtt ccctggggca cacattttaa gttgcagtgc actggaagga agcaaacaga 3300
gccgtgggtt ctggggaaag actctgtttc aaaaaaaaaa aaggtcatgt tcgagcattt 3360
gtacatcatg tacaaggtcc tggtttcaat ccacagcaat acaaaaagtg tgctaaagag 3420
ccaatgtaaa agggattcat tcactctagc acaagtctta caaataaaat aaaaaccctg 3480
cagccaactt catcagcaaa tactctgata aggtggaggc agtgcaggcg tggcacacga 3540
gcgtcacatc cagtgtatga tgtcagaggc agtgcaggcg tggcaaggct cggctatttg 3600
ggaagacagt tgccagaagc ttgtaaaagt gctgctaaca gatggatata gcttgtgcca 3660
aaacttggac cacaaaccaa cttcctccac atcaacacct accactgtaa gtgaccttca 3720
tgtgcggaat ggtcacaaca ggcgcactgt gcttcgggga gcgcagggat gagtgtcaac 3780
tgtccaatca gcccagaaac ataaaagcat atcctataca tttgatccaa aacataaaaa 3840
gcaggtaatt ttcaaatgag gaagtacatc tccaaatagt atgccccgct tgctcgaagc 3900
tgctttagct ggttactctt acagcgccag cagggctgtg cacccgagac tgctacagtc 3960
catggtgatc ccgcggctca tctgacccac tgttacctgc ctcccagtcc tccccaggac 4020
aagcctggtg acagacacgg taggagcaga tgcctaacag ccagctaaac aattctgctc 4080
tacctaccct gcctg 4095
<210> 5
<211> 11477
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 5
gttcaatgtg gctctgaaac tggtaggtgc attagagaat gggtatttgg ggctttacat 60
tgaaagcaca gttggacttt taggaagcct aatataggca ataggtcatc gaagaccaga 120
cacagggtga cccattgctt tattggctga tgcagatgaa gtctctgtct gctcgaagca 180
gaactctggg gctctgtggg aatgaggccg tgtttcagac aggctagcct cttggccagt 240
caagatggga cttgccatac aaggtggaaa gtgatgcctg ccactctgca gagggagatc 300
aaggccgtgg aaggaaggtc acttagagtt gggaccgaga agcaaatgtt tcataggaga 360
gcgggtgttt gagatggtgg catttgaaga atgggtaggc ttctgcccaa aaaatgtggg 420
taggaagagt gagctggagg agagatggca ccagcaaagg ttaggaggtg ggaaagtggg 480
ggctgtgtgg gccagctggc tggaatccag ggccaaagcc tgaggaaggc aggcctgact 540
ctaataaaca tgtcaaatac actaaatata tacatgtgta catacacagg tacctggagt 600
ggtcccacgt gctctgtagg tgaaagccag gaggttgaga gtacttctgt tagatcaaaa 660
ttatctgggc tataaatcag ccagcgctaa gaaacatttg gccttagcag gaagaactga 720
ggaattgggt caggcgtggt ggcttatacc tataatccaa gcactttggg aggctgaggt 780
gggaggatca tttgaggcca ggagtttgag atcatcttag gcaacagagt gagaccccat 840
ctctacaaaa aacatttttt taattaaaaa aaaaagaagt gaggaattgt gaatggggga 900
aggtttggtg tgagtcactg agaggagatg gcatggcatc agactgggct gggaaagaca 960
gtgtcatgga agtcttgctt tttcccaggg ccatctccag agtggatgct acaacatgat 1020
ctaatccccg gagacttgag ggacctccga gtagaacctg ttacaactag tgttgcaaca 1080
ggggactatt caattttgat gaatgtaagc tgggtactcc gggcagatgg taagtttgca 1140
tccatcagag ataaggccca aagtgtctac taaatcagaa atgtgcagac tatatgaacc 1200
attaattccc cttctacgca taaccaacag aaatacatgc atacacgcac caaaagccac 1260
atactagggt gatcgcagta gcactactta caacaggcaa cattagaaag cactcaaatg 1320
cccaacagta gaaaggaatc accagataaa ttccgtcaca gcaacaagaa tgaatgatct 1380
gtaactacgt acaacaatat ggatgaatgt cacaaacata aggcagattc aaaagagttt 1440
gtacatttat atctgtgtgc atgcctgcag gcatacatgc catatgatcc catttacaga 1500
aagtgccaag caaacaaaac caatctagtg tttagaagtc aggatagtag ccaggagcgg 1560
tggctaacgc ctgtaatccc agcactttgg gaggccgagg tgggtggatc acttgaggcc 1620
aggagttcga gaccagcctg gccaacatgg caaaacccca tctctactaa gaatacaaaa 1680
attagccaga tgtggtgttg cacacctgta atcccagcta ctcgggaggc tgaggcacga 1740
gaatcgcttg aacccaggag gcggagcttg cagtgagctg tgatcgcacc actgcactcc 1800
agcagcctgg gcaacagagt gaaactccgt ctcaaaaaaa aaaaaaaaag caaaagacca 1860
ggcgtggtgg cgcatgcctg taatcccagc actttgggag gccaaggcgg gtagattact 1920
tgaggtcagg agttcaagac cagcctggcc aacatggtga aactccatct ctcctaaaaa 1980
tacaaaaaat tagccaggca tggtggcaca tgcctgtaat cccagctact tgggaggctg 2040
aggcaggaga ataacttgaa cccaggaggt ggaggttgca gtgagttgag actgtgccat 2100
tgcactcctg cctgggtgac agagagactg tctcaaaata aaataaaata aaataaaata 2160
aaagtcaggg tagtggttcc tgctggggta ggaataactg taaggaagta tgaggagcac 2220
acctgaggga gtttcttaac ctgcatactg gttgcatgag tgtcccgatg catcaagtgg 2280
tacacttatg tgtgtacttt tctgtgtatg tacttcatgg aagaagataa ggtaaggcga 2340
gagatgattt tccccacccc acttcagttt agggaaagca ctggcttaga tagattactt 2400
ttccacttcc cagatgacgt cacgaagctt ttggcatttc cccaagattg tgctgcaagg 2460
caccatatga gttctttcag tgagctactc ctgggatcct aatgcccatc cttttggttt 2520
ccatctgcta aattcagatc ggaaaaaagt ctacctggtc aagaatccat gagtgccagg 2580
cacagagaga ggagtcacct acagtctcca cccaggagga ttcagagggg taggggaggc 2640
agacacaggc acaatgtcag gggagtccca gaagggagct gtgcttcatc tgagtgggat 2700
agggcgagag gagctggaaa gggtctctgg tgaggttgct ttaggtgaga tgtggccata 2760
agggtttggg cccttacctg gcacagagtg agctctttgt cagctgtggt tatgggctgt 2820
ggtgcatgag tgagttgggc tttaaaagag tgaaacttga agaggtagat aatgggagga 2880
aaggaccttc agggagaaga atcagcatgg ataaaggtgc aatcggaaca ggtgtggttt 2940
ggtgatgaca gtcttggagg gtctttgtgg gtctccctga ggccagcctc ttcctcttaa 3000
aagagaccca gaatagtgca gagctatagc cacctttgca ggcatcctat gaaattcaag 3060
ctgacctatg aggaggaggg agcttgggct ttggaatctg aattcgtatc tgctacttgc 3120
tgatgagttt tctaagttta agcagagctc cctgaaactt cctaagcctc acgaacccca 3180
tctgtaaact ggagacagtg gtgcctacct ttcagtacta ttgtgaatat gaaccgagac 3240
ataaaataaa gcatgctagt aaagcatcta gcatcaagtc agccacacag caagtgctaa 3300
ataaaccaat gtcctcttgc ctatctcggc agccagcatc cgcttgttga aggccaccaa 3360
gatttgtgtg acgggcaaaa gcaacttcca gtcctacagc tgtgtgaggt gcaattacac 3420
agaggccttc cagactcaga ccagaccctc tggtggtaaa gtaagcactt ttttgttttt 3480
tgttttgttt tttgagatag catcttgttc tgtcacccag gctggagtgc agtggtgcga 3540
tcatggctca ctgcgacctc aatcttctgg gctcaagcga tcgttctact tcagcctccc 3600
aagcagctgg gactacaggc atgagccacc agacctggct aatttttgta gtttttgtag 3660
aggggggttt caccgtgttg ctggtcttga attccagtgc tcaagcgatc tgcctgcctc 3720
agcttcccaa agtgctggga ttacaggtat gagccactgt atccgggcat agacactttt 3780
atgtatttgg ctaattgttg ctgaaagcta tgccctttgt ttggggagca tggatgatgt 3840
gctgctcaca cgggctggta aatagctatg actcagagct tagagcaaga tcccttggtt 3900
caaatcccat tttcggctac ttctctgttt tgtgcctttt gtaagtcagc atctctgggc 3960
ctcagttttt ccctaccagc caatggaaga gaattggact taaattatat cctggcttta 4020
aaactttata cttctctgat tctgtaaatt gtttagtgcc cattccctaa gtagacattg 4080
gttggagacc taagttttca cagagtgagt aatgaaatac aaactaaaag ggcaacacac 4140
taaggtatac tgtttctgta ctgcagtgtt ttgggaattg agagttcctt gctttgcctt 4200
tcagtggaca ttttcctaca tcggcttccc tgtagagctg aacacagtct atttcattgg 4260
ggcccataat attcctaatg caaatatgaa tgaagatggc ccttccatgt ctgtgaattt 4320
cacctcacca ggtaaacttc ctcatttgtt tattattctt tgtcttgctg ggatgcctgc 4380
tttgcgatat gcacagagag agcccaggga accctggata aggcttttgg ccttgctaaa 4440
tctcagttcc cttatctaaa gcatggacac agtagtaacc atactttacc cacagagcag 4500
agagaataaa gtcaggtaaa atgtaggaaa aagtctgttc ctgtatgcca ggcactgtgc 4560
tgaatacttg atatgcatca tctcacttaa gcatctcagc aaccccaagg ttgggatagc 4620
tgcattttac agatgaagaa actgagaggt taagtcactt tccctgggtt atacacacat 4680
aaaagtgaag ctgacattca cactttggtc cttctagagc aatgcacctg agccactctt 4740
tatatcactg atgcaaaatg tttgttcgtt cctatctcag ctttctagag tgggctgcat 4800
atggctgggc ctctgagaat ctacagactc cagttggctt gtccccaagc tgtccatgag 4860
aggcagggtg gtcctgagtt ggatgctggg cccttgcccc agctttgtga cacaggacca 4920
gttctctggt accacagctg caatgtcaca cacagggctg tgatgaggat gagacaggat 4980
ggtgtgtgtg aagggtctat ctgagccgag tcacccatcc cataccacca tcctgtcatc 5040
tgtcacaggc aaggcatcaa agccaaacgg taatttccaa ttcactgctg gtcttggaga 5100
ttgtggtcca gggtgttttt ggtgctgcac catttactac acgtggtttc caactttctt 5160
gatgagagca aacctcattt tgttttcaaa taagcttttt tattttggaa ttcttttatt 5220
ttattattta tttatttatt tattttttag acggggtttt gctcttgttg ccctggctgg 5280
agtgcagtgg caccatctca gctcactgca acctccgcct cttggattta agtgattctc 5340
ctgcctcagc ctcccaagaa gctaggatta cagtcatgag ccaccacacc cagctaattt 5400
ttgtattttt agtagagacg gggtttcact atgatggcta ggctagtctc aaactcctaa 5460
cctcgggtga tctgcctacc ttggcctccc aaagtgctgg gattacagga gtgagccacc 5520
gtacccagcc ttattttgga ataattttag atttacagaa aagttgcaaa gatagtacag 5580
tttccatata cctctcaccc agcttctcct attaacgtct tgtattacca tgatacattt 5640
gtcaaaacta agaaaccaac attagtacgt gtctgttaac taaactccag actttatttg 5700
gacttcacct gctttttcat gaatgtcctc tttctgttcc agttgcattt agtccctgtg 5760
tgtctctcct ctggtctgta acagtttctt agtctttgtt tttcagaacg ttgacagtct 5820
tgagttctga ccaggcatcc tgtagaatgt cccccagtgt gggtttgtct gatgtttgtc 5880
tcatgattag acagggaatt cataccacag aggtgccggg cccttcttat cccatcaggg 5940
gtccatggca cccacatgac accaggctgg gggtcaccat caacactaga ttaatgtggg 6000
gactgccaga cttctccatg taaatctact gtttttccca actctcgttt ttggaaacaa 6060
gtcactaatt ctaccccacc ctggggtgag gataggggag tgttaggctt catgtccttg 6120
aaggggcagc atctacagat attaccaaat gtccttttca aatgatggat ggttttcgag 6180
tcctctctta gggatcaaag atactgtttt gataaaggct gtatttaaat tagactctgg 6240
ttatcacgta atgggcagat gttctctgtt tagcagtaaa atgttagagc cccacaaaaa 6300
attagccggg cgtggtggcg ggcgcctgta gtcccagcta ctcaggagcc tgaggcagga 6360
gaatggcgtg aacctgggtg gcagagctag cagtgagctg agattgcgcc actgcactcc 6420
agcctaggcg acagagcaag actccggctc aaaaaaaaaa aaaaaaaaaa aaaaattaga 6480
gccccatttg aatttgccat cttacatctg gaatgttttg gcaatttgtt acaggtgtgg 6540
tatgcgtatg tgtgtgtgca cttggcaaag ggggtggggc agagatacct ttttctaaaa 6600
tgtaaaaact ttcattcaaa tttccaggct gcctagacca cataatgaaa tataaaaaaa 6660
agtgtgtcaa ggccggtaag taaatacggc atttgctttt attatttgaa gaaacttgta 6720
acttgaggca gcatagctct cttcatcttt gcacagaaga actgagccct aggggagagt 6780
tgaatatcca agcacccccg atggtggtgg cagaagcacc tatggctcct gatctccagc 6840
cagtactttg cccactgagt gcagcagcct gtccagaaag gggtagaccc tcagtacatt 6900
tatatagtgg agcaacagtc acaaagaacc acccccatcc acttggttaa caacatttta 6960
gcccaaatgg ctcagcagtg ccaatcaaag gctcttcctg attgatggga gtcctacgag 7020
cccttccacc tgctcagatg ctagcaccac ccacctgtgg ccatggggat cctaagaacc 7080
ttctgtttgg attagcaaac attcgaggtg ggagtgtgag ttcgctaaag gactggccca 7140
caattacgct ctcaatttgt gtcctcaaag aggatcactt ttcacaggct agtgaatttt 7200
accccttgaa ttttgtagta atccccttta ggattattag tccctagtgc ccagtttcca 7260
agggtttgtg taaatttctg agtaatatag tgttggaatg ctaatgaggc tagtggtatt 7320
tggatcagcc cagaactctt aggcaccctt cactgcacaa ggcatccctg gtaccctttc 7380
agacagccgt ccatgctgtc cttgctagaa ggcagatttg gacgctgagc ttcccaccaa 7440
gggtacagag aacactgcag gtgtgaggag gtcagagggc cttgaagtca gtcctgaacc 7500
tgaactccta aagacagttc tgagccaggg cctcaaatgg tctcctcggt ggagaagata 7560
atttctgctg ttatacattg ttgttcctgg aaatcagtcc agtccttatt tgcaagtcac 7620
atgaatgaaa cgactgactt tttctgccag agcagaccta tcagggtata tcctctaagc 7680
tacgagccat ttccttcttt ctttgcatct agtgaagctg gcagaatggt ctgatggatg 7740
aagctcctgg aacacaaaat attctagtgg ttgtagtcct gattgtgagt tcacttgctc 7800
tataatctta gacatgtttt ccaacctgta gctctttctc cccatcctgg gaagtagaaa 7860
atactgaccc tatcccttaa ctcaaaagag aggttgaaaa cagcagtgag tagcagaacc 7920
aatccgtgcc tcccaccata ctcagcccag ggaagctgtt tttagcaaag cagcgatgta 7980
cggatcccca aactgtcttg ggcagcaagt actttgagac tttggaaaca atgataaaaa 8040
gttcactatg tagcagcgtc cctatctctt ggacattgtt cctgagaact tgggtttgtt 8100
tccataaagc cctgtgggga acttaaatga gggaagagtt agcaagttca tctacatggt 8160
tctctctcaa ctcacaggaa gcctgtggga tccgaacatc actgcttgta agaagaatga 8220
ggagacagta gaagtgaact tcacaaccac tcccctggga aacagataca tggctcttat 8280
ccaacacagc actatcatcg ggttttctca ggtgtttgag gtactttttc tcttcgtccc 8340
cttcacctcg tcctccagct ttccttagtg tgttgaagga aaatggggac agtgttgtcc 8400
aaacgtgctg ggctactttg atgaattcaa cagataaaaa caagtccctg tgggcatcct 8460
gagaggtgtc actgtgaacc tcaaaccata gaaagaagga agtctccgaa gcaggaaatg 8520
agcccttctt tacacatggt ctagtgaaag gagtcctttt gggaagcaaa gctgctttgc 8580
tggaagcctc gaaatctgct cactcctgtg cctcataatt tttggggggc ggaggttgac 8640
acagggtctc actctgtcat ccaggctgga atgcagtggc acaatcacag cttagtgcag 8700
cctccacctc ccgggctcaa gtgatcctcc cacctcagcc tccccagata gctgggatta 8760
caggcatgtg ccaccatgcc gagctaattt tttcatcttt agtagagacg gtttcgccat 8820
gttgcccagg ctggtctcaa aatactgggc tcaagccatc tgcctgcctt ggcctcccaa 8880
agtgttggga ttacaggggt gagccagtgc acctggcacc tcataattct tgactctctc 8940
tctcttaagc cacaccagaa gaaacaaacg cgagcttcag tggtgattcc agtgactggg 9000
gatagtgaag gtgctacggt gcaggtaaag ttcagtgagc tgctctgggg agggaaggga 9060
catagaagac tgttccatca ttcattgctt ttaaggatga gttctctctt gtcaaatgca 9120
cttctgccag cagacaccag ttaagtggcg ttcatggggg ctctttcgct gcagcctcca 9180
ccgtgctgag gtcaggaggc cgacgtggca gttgtggtcc cttttgcttg tattaatggc 9240
tgctgacctt ccaaagcact ttttattttc attttctgtc acagacactc agggatagca 9300
gtaccatttt acttccgcaa gcctttaact gcaagatgaa gctgcaaagg gtttgaaatg 9360
ggaaggtttg agttccaggc agcgtatgaa ctctggagag gggctgccag tcctctctgg 9420
gccgcagcgg acccagctgg aacacaggaa gttggagcag taggtgctcc ttcacctctc 9480
agtatgtctc tttcaactct agtttttgag gtggggacac aggaggtcca gtgggacaca 9540
gccactcccc aaagagtaag gagcttccat gcttcattcc ctggcataaa aagtgctcaa 9600
acacaccaga gggggcaggc accagccagg gtatgatggc tactaccctt ttctggagaa 9660
ccatagactt cccttactac agggacttgc atgtcctaaa gcactggctg aaggaagcca 9720
agaggatcac tgctgctcct tttttctaga ggaaatgttt gtctacgtgg taagatatga 9780
cctagccctt ttaggtaagc gaactggtat gttagtaacg tgtacaaagt ttaggttcag 9840
accccgggag tcttgggcac gtgggtctcg ggtcactggt tttgacttta gggctttgtt 9900
acagatgtgt gaccaagggg aaaatgtgca tgacaacact agaggtatgg gcgaagccag 9960
aaagaaggga agttttggct gaagtaggag tcttggtgag attttgctct gatgcatggt 10020
gtgaactttc tgagcctctt gtttttcctc agctgactcc atattttcct acttgtggca 10080
gcgactgcat ccgacataaa ggaacagttg tgctctgccc acaaacaggc gtccctttcc 10140
ctctggataa cagtaagtgc ccagtaactt caaccagatg atcaaagtgg ctcacacaca 10200
gtcactgccc cccactcagt atgtggaagg gttgtgtgta tgtgggcagt gcaaggggtc 10260
gctgcctgtg tacactgaac tggggtgcag agaaagccaa cagtgctgtc ccagagaacc 10320
tagaatctga gtaagaacag gctttatttg taaaaccact cgtgactctt tacaaagcag 10380
gatacacaga agggaaaaaa atacacagtg caaaatggat gttctgagtg ccacaaggat 10440
ctgctgaaaa aagccaaaga tgtaagatgg ctgggtatat atgagaatga atatttcact 10500
atattctgat tcaattacca gtctcagtgg cccaggatga gcttttggtg tggtcacatg 10560
gccaacattt ggataacaaa tgaggaataa tggtaccgcc tcactagtgc ctgagaacag 10620
catgttctgg aaaatgtctc tggagttaga gatgtgttag ctttttcatt acagatggag 10680
aaatacaatg tttacacaac agtccagggg tggggtcaaa agttggaagg tgtcattaga 10740
cgcagccaaa taaagtgaag acaacccagg tgactggcag ccctgacttg tgcgtgggcg 10800
aagccttaca gattcctggg cactctgtgc ctgagcttac ctgatgttct tgtgaggcgg 10860
gtggcactat cctccatgta tgtcagtcta acaagacggc ctgtaaaaat gtcatctata 10920
tgtgctatgt atgtaagcag ttgtacccag aataacatta attccttaaa gaaccaaaaa 10980
aactggacag aaaccctagt tcctactgtg aaaatgccgg tgcatatcag gacaaagaag 11040
ccaaagagat tttaggactg ttcttaactt cctgggtccc atatcccttt tgagaatcta 11100
atgcaagacc tgcacactct agggaaaaag atatgattgg cagactctct aaagagcttc 11160
tgactgatgg ttttaaaggg atctaaatct ctacaaagtg gctgggcgcg gtgacacatg 11220
cctgtaatcc cagctactca ggaggctgag gcagcattgc ttgaacccgg gagacggagg 11280
ctgcagtgaa ccgagattgc gccactgcac tccagcctag gcaacaaagc gagactctgt 11340
ctcaaaacta aaaaataata aaaaataaat aaacctctat aaagtatacc aagtcttagt 11400
ttttaaatta agagataagt gtggatttgt tttccaaagg tgaataagct ttgttttttc 11460
cagacaaaag caagccg 11477
<210> 6
<211> 85
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 6
aggtgaataa gctttgtttt ttccagacaa aagcaagccg ggaggctggc tgcctctctt 60
cctggtgctg ctggtggctg tgtgg 85
<210> 7
<211> 100
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 7
tttctacata gtcttgctgt ggtgcacagg gtgtggctga acttcatagg gatatcgaat 60
tccgaagttc ctattctcta gaaagtatag gaacttcagg 100
<210> 8
<211> 100
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 8
tctctagaaa gtataggaac ttcatcagtc aggtacataa tggtggatcc ggtagaagaa 60
ccctaccacc tgcagggaga aaatggtaca aatcacagtt 100
<210> 9
<211> 2070
<212> DNA/RNA
<213> 人工序列(Artificial Sequence)
<400> 9
gtgggaggcg gagcccggag aactgcggcg ggcgcctgga taagaggacc ctggacctct 60
ggccccagct ccgcgtggtg gtgggcggtg gccagtggcc gggccatgtt gctagtgttg 120
ctgatcttgg ctgcatcgtg caggagcgcc ctgcctcgag agccgactgt tcaatgtggc 180
tctgaaactg ggccatctcc agagtggatg ctacaacatg atctaatccc cggagacttg 240
agggacctcc gagtagaacc tgttacaact agtgttgcaa caggggacta ttcaattttg 300
atgaatgtaa gctgggtact ccgggcagat gccagcatcc gcttgttgaa ggccaccaag 360
atttgtgtga cgggcaaaag caacttccag tcctacagct gtgtgaggtg caattacaca 420
gaggccttcc agactcagac cagaccctct ggtggtaaat ggacattttc ctacatcggc 480
ttccctgtag agctgaacac agtctatttc attggggccc ataatattcc taatgcaaat 540
atgaatgaag atggcccttc catgtctgtg aatttcacct caccaggctg cctagaccac 600
ataatgaaat ataaaaaaaa gtgtgtcaag gccggaagcc tgtgggatcc gaacatcact 660
gcttgtaaga agaatgagga gacagtagaa gtgaacttca caaccactcc cctgggaaac 720
agatacatgg ctcttatcca acacagcact atcatcgggt tttctcaggt gtttgagcca 780
caccagaaga aacaaacgcg agcttcagtg gtgattccag tgactgggga tagtgaaggt 840
gctacggtgc agctgactcc atattttcct acttgtggca gcgactgcat ccgacataaa 900
ggaacagttg tgctctgccc acaaacaggc gtccctttcc ctctggataa caacaaaagc 960
aagccgggag gctggctgcc tctcttcctg gtgctgctgg tggctgtgtg ggtgctggca 1020
gctgggatct acctaacttg gaggcaagga aggagcacga agacgtcctt tcctatttcc 1080
accatgctcc tgcccctcat taaggtcctg gtggtttatc cttctgagat atgtttccat 1140
cacaccgtct gtcgcttcac tgactttctt caaaactact gcagaagtga ggtcatcctt 1200
gaaaaatggc agaaaaagaa aatcgccgag atggggccgg tacagtggct gaccactcag 1260
aagcaagcgg cagataaagt ggtcttcctt cttcccagtg acgtcccgac cctttgtgac 1320
agtgcctgtg gccacaatga gggcagcgcc agggagaact ctcaggatct gttccctctt 1380
gcctttaacc tcttttgtag tgatttcagc agccagacgc atctgcacaa atacctggtg 1440
gtctatcttg ggggagcaga cctcaaaggc gactataatg ccctgagtgt ctgcccccaa 1500
tatcatctca tgaaggacgc cacagctttc cacacagaac ttctcaaggc tacgcagagc 1560
atgtcagtga agaaacgctc acaagcctgc catgatagct gttcaccctt gtagtccacc 1620
cgggggaata gagactctga agccttccta ctctcccttc cagtgacaaa tgctgtgtga 1680
cgactctgaa atgtgtggga gaggctgtgt ggaggtagtg ctatgtacaa acttgcttta 1740
aaactggagt ttgcaaagtc aacctgagca tacacgcctg aggctagtca ttggctggat 1800
ttatgaagac aacacagtta cagacaataa tgagtgggac ctacatttgg gatataccca 1860
aagctgggta atgattatca ctgagaacca cgcactctgg ccatgaggta atacggcact 1920
tccctgtcag gctgtctgtc aggttgggtc tgtcttgcac tgcccatgct ctatgctgca 1980
cgtagaccgt tttgtaacat tttaatctgt taatgaataa tccgtttggg aggctctcac 2040
taatgtgtag cttcctaaga gaagaagcct 2070
<210> 10
<211> 502
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 10
Met Leu Leu Val Leu Leu Ile Leu Ala Ala Ser Cys Arg Ser Ala Leu
1 5 10 15
Pro Arg Glu Pro Thr Val Gln Cys Gly Ser Glu Thr Gly Pro Ser Pro
20 25 30
Glu Trp Met Leu Gln His Asp Leu Ile Pro Gly Asp Leu Arg Asp Leu
35 40 45
Arg Val Glu Pro Val Thr Thr Ser Val Ala Thr Gly Asp Tyr Ser Ile
50 55 60
Leu Met Asn Val Ser Trp Val Leu Arg Ala Asp Ala Ser Ile Arg Leu
65 70 75 80
Leu Lys Ala Thr Lys Ile Cys Val Thr Gly Lys Ser Asn Phe Gln Ser
85 90 95
Tyr Ser Cys Val Arg Cys Asn Tyr Thr Glu Ala Phe Gln Thr Gln Thr
100 105 110
Arg Pro Ser Gly Gly Lys Trp Thr Phe Ser Tyr Ile Gly Phe Pro Val
115 120 125
Glu Leu Asn Thr Val Tyr Phe Ile Gly Ala His Asn Ile Pro Asn Ala
130 135 140
Asn Met Asn Glu Asp Gly Pro Ser Met Ser Val Asn Phe Thr Ser Pro
145 150 155 160
Gly Cys Leu Asp His Ile Met Lys Tyr Lys Lys Lys Cys Val Lys Ala
165 170 175
Gly Ser Leu Trp Asp Pro Asn Ile Thr Ala Cys Lys Lys Asn Glu Glu
180 185 190
Thr Val Glu Val Asn Phe Thr Thr Thr Pro Leu Gly Asn Arg Tyr Met
195 200 205
Ala Leu Ile Gln His Ser Thr Ile Ile Gly Phe Ser Gln Val Phe Glu
210 215 220
Pro His Gln Lys Lys Gln Thr Arg Ala Ser Val Val Ile Pro Val Thr
225 230 235 240
Gly Asp Ser Glu Gly Ala Thr Val Gln Leu Thr Pro Tyr Phe Pro Thr
245 250 255
Cys Gly Ser Asp Cys Ile Arg His Lys Gly Thr Val Val Leu Cys Pro
260 265 270
Gln Thr Gly Val Pro Phe Pro Leu Asp Asn Asn Lys Ser Lys Pro Gly
275 280 285
Gly Trp Leu Pro Leu Phe Leu Val Leu Leu Val Ala Val Trp Val Leu
290 295 300
Ala Ala Gly Ile Tyr Leu Thr Trp Arg Gln Gly Arg Ser Thr Lys Thr
305 310 315 320
Ser Phe Pro Ile Ser Thr Met Leu Leu Pro Leu Ile Lys Val Leu Val
325 330 335
Val Tyr Pro Ser Glu Ile Cys Phe His His Thr Val Cys Arg Phe Thr
340 345 350
Asp Phe Leu Gln Asn Tyr Cys Arg Ser Glu Val Ile Leu Glu Lys Trp
355 360 365
Gln Lys Lys Lys Ile Ala Glu Met Gly Pro Val Gln Trp Leu Thr Thr
370 375 380
Gln Lys Gln Ala Ala Asp Lys Val Val Phe Leu Leu Pro Ser Asp Val
385 390 395 400
Pro Thr Leu Cys Asp Ser Ala Cys Gly His Asn Glu Gly Ser Ala Arg
405 410 415
Glu Asn Ser Gln Asp Leu Phe Pro Leu Ala Phe Asn Leu Phe Cys Ser
420 425 430
Asp Phe Ser Ser Gln Thr His Leu His Lys Tyr Leu Val Val Tyr Leu
435 440 445
Gly Gly Ala Asp Leu Lys Gly Asp Tyr Asn Ala Leu Ser Val Cys Pro
450 455 460
Gln Tyr His Leu Met Lys Asp Ala Thr Ala Phe His Thr Glu Leu Leu
465 470 475 480
Lys Ala Thr Gln Ser Met Ser Val Lys Lys Arg Ser Gln Ala Cys His
485 490 495
Asp Ser Cys Ser Pro Leu
500
<210> 11
<211> 24
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 11
ggccttgctt gcatattgtt ccac 24
<210> 12
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 12
gtctggtctt cgatgaccta ttgcc 25
<210> 13
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 13
gctcgactag agcttgcgga 20
<210> 14
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 14
tagttaggag ctaaagcggt caggc 25
<210> 15
<211> 22
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 15
tgaacccctg gacatgaagg tt 22
<210> 16
<211> 21
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 16
aggccagtga tgacaggctt a 21
<210> 17
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 17
gctacatgac acgttcatca ggatg 25
<210> 18
<211> 23
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 18
tgtaaatcct ggcttggttg ctg 23
<210> 19
<211> 21
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 19
ggatcggcca ttgaacaaga t 21
<210> 20
<211> 22
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 20
cagaagaact cgtcaagaag gc 22
<210> 21
<211> 24
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 21
ggacagtcac ttgctacaca cgtt 24
<210> 22
<211> 23
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 22
aagtgaccac cacccttacc ttg 23
<210> 23
<211> 22
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 23
gctgaggcag cattgcttga ac 22
<210> 24
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 24
gggagttatt cctatgtcat ggcca 25
<210> 25
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 25
aaatcagcct tcatagtcaa gccct 25
<210> 26
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 26
gacaagcgtt agtaggcaca tatac 25
<210> 27
<211> 24
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 27
gctccaattt cccacaacat tagt 24
<210> 28
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 28
tcgtcaagac aagtgtggca 20
<210> 29
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 29
cgacagacgg tgtgatggaa 20
<210> 30
<211> 22
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 30
acacagtcta tttcattggg gc 22
<210> 31
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 31
caggcactgt cacaaagggt 20
<210> 32
<211> 22
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 32
tcaccatctt ccaggagcga ga 22
<210> 33
<211> 21
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 33
gaaggccatg ccagtgagct t 21
Claims (10)
1.一种IL17RB基因人源化的非人动物的构建方法,其特征在于,所述的构建方法包括用包含编码SEQ ID NO:2第22-287位的核苷酸序列替换至非人动物IL17RB基因座。
2.根据权利要求1所述的构建方法,其特征在于,所述的构建方法包括用包含SEQ IDNO:5所示核苷酸序列替换至非人动物IL17RB基因座。
3.根据权利要求1或2所述的构建方法,其特征在于,所述的替换至非人动物IL17RB基因座为替换非人动物IL17RB基因中编码SEQ ID NO:1第22-284位的核苷酸序列,或者替换非人动物与NCBI登录号为NC_000080.7的第29720048至29728828位所示序列相同的核苷酸序列,所述的非人动物为小鼠或大鼠。
4.根据权利要求1或2所述的构建方法,其特征在于,所述的非人动物体内表达人源化IL17RB蛋白,同时内源IL17RB蛋白的表达降低或缺失,所述的人源化IL17RB蛋白包含SEQID NO:10所示的氨基酸序列。
5.根据权利要求1或2所述的构建方法,其特征在于,所述的非人动物的基因组中包含人源化IL17RB基因,所述的人源化IL17RB基因转录的mRNA包含SEQ ID NO:9所示的核苷酸序列。
6.根据权利要求1或2所述的构建方法,其特征在于,使用靶向载体进行非人动物的构建,其中,所述的靶向载体包含编码SEQ ID NO:2第22-287位的核苷酸序列或包含SEQ IDNO:5所示核苷酸序列,所述的靶向载体还包含5’臂和/或3’臂,所述的5’臂的核苷酸序列如SEQ ID NO:3所示,所述的3’臂的核苷酸序列如SEQ ID NO:4所示。
7.一种IL17RB基因的靶向载体,其特征在于,所述的靶向载体包含编码SEQ ID NO:2第22-287位的核苷酸序列或包含SEQ ID NO:5所示核苷酸序列,所述的靶向载体还包含5’臂和/或3’臂,所述的5’臂的核苷酸序列如SEQ ID NO:3所示,所述的3’臂的核苷酸序列如SEQID NO:4所示。
8.一种人源化IL17RB蛋白,其特征在于,所述的人源化IL17RB蛋白的氨基酸序列如SEQID NO:10所示。
9.一种编码权利要求8所述人源化IL17RB蛋白的人源化IL17RB基因,其特征在于,所述的人源化IL17RB基因包含SEQ ID NO:5所示的核苷酸序列。
10.权利要求1-6任一所述的构建方法获得的IL17RB基因人源化的非人动物、权利要求8所述的人源化IL17RB蛋白或者权利要求9所述的人源化IL17RB基因的应用,其特征在于,所述的应用为非疾病诊断、非疾病治疗目的,所述的应用包括:
A)涉及人类细胞的与IL17RB相关的免疫过程的产品开发中的应用;
B)作为药理学、免疫学、微生物学和医学研究的与IL17RB相关的模型系统中的应用;
C)涉及生产和利用动物实验疾病模型用于与IL17RB相关的病原学研究和/或用于开发诊断策略和/或用于开发治疗策略中的应用;
D)在体内研究人IL17RB信号通路调节剂的筛选、药效检测、评估疗效、验证或评价中的应用;或者,
E)研究IL17RB基因功能,研究针对人IL17RB靶位点的药物、药效,研究与IL17RB相关的免疫相关疾病药物以及抗肿瘤药物方面的应用。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110157090.5A CN112501203B (zh) | 2021-02-05 | 2021-02-05 | Il17rb基因人源化的非人动物的构建方法及应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110157090.5A CN112501203B (zh) | 2021-02-05 | 2021-02-05 | Il17rb基因人源化的非人动物的构建方法及应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112501203A true CN112501203A (zh) | 2021-03-16 |
CN112501203B CN112501203B (zh) | 2021-06-04 |
Family
ID=74952591
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110157090.5A Active CN112501203B (zh) | 2021-02-05 | 2021-02-05 | Il17rb基因人源化的非人动物的构建方法及应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112501203B (zh) |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102858985A (zh) * | 2009-07-24 | 2013-01-02 | 西格马-奥尔德里奇有限责任公司 | 基因组编辑方法 |
CN111057721A (zh) * | 2018-10-12 | 2020-04-24 | 北京百奥赛图基因生物技术有限公司 | 人源化IL-4和/或IL-4Rα改造动物模型的制备方法及应用 |
CN111218425A (zh) * | 2019-01-17 | 2020-06-02 | 北京百奥赛图基因生物技术有限公司 | 人源化转基因动物 |
CN111304246A (zh) * | 2019-12-17 | 2020-06-19 | 百奥赛图江苏基因生物技术有限公司 | 一种人源化细胞因子动物模型、制备方法及应用 |
CN111793646A (zh) * | 2020-09-08 | 2020-10-20 | 北京百奥赛图基因生物技术有限公司 | Il1r1基因人源化改造的非人动物的构建方法及其应用 |
CN111808882A (zh) * | 2020-09-01 | 2020-10-23 | 北京百奥赛图基因生物技术有限公司 | Il7r基因人源化改造的动物模型的构建方法及其应用 |
-
2021
- 2021-02-05 CN CN202110157090.5A patent/CN112501203B/zh active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102858985A (zh) * | 2009-07-24 | 2013-01-02 | 西格马-奥尔德里奇有限责任公司 | 基因组编辑方法 |
CN111057721A (zh) * | 2018-10-12 | 2020-04-24 | 北京百奥赛图基因生物技术有限公司 | 人源化IL-4和/或IL-4Rα改造动物模型的制备方法及应用 |
CN111218425A (zh) * | 2019-01-17 | 2020-06-02 | 北京百奥赛图基因生物技术有限公司 | 人源化转基因动物 |
CN111304246A (zh) * | 2019-12-17 | 2020-06-19 | 百奥赛图江苏基因生物技术有限公司 | 一种人源化细胞因子动物模型、制备方法及应用 |
CN111808882A (zh) * | 2020-09-01 | 2020-10-23 | 北京百奥赛图基因生物技术有限公司 | Il7r基因人源化改造的动物模型的构建方法及其应用 |
CN111793646A (zh) * | 2020-09-08 | 2020-10-20 | 北京百奥赛图基因生物技术有限公司 | Il1r1基因人源化改造的非人动物的构建方法及其应用 |
Non-Patent Citations (4)
Title |
---|
CHURCH,D.M.等: "Reference Sequence: NC_000080.7", 《NCBI》 * |
FLAMAR AL等: "Reference Sequence: NP_062529.2", 《NCBI》 * |
MUZNY,D.M.等: "Reference Sequence: NC_000003.12", 《NCBI》 * |
SONG Y等: "Reference Sequence: NP_061195.2", 《NCBI》 * |
Also Published As
Publication number | Publication date |
---|---|
CN112501203B (zh) | 2021-06-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111057721B (zh) | 人源化IL-4和/或IL-4Rα改造动物模型的制备方法及应用 | |
CN111793646B (zh) | Il1r1基因人源化改造的非人动物的构建方法及其应用 | |
CN113388640B (zh) | Ccr4基因人源化的非人动物及其构建方法和应用 | |
CN112779285B (zh) | 人源化il-10和il-10ra基因改造动物的构建方法和应用 | |
CN112430621B (zh) | Il2ra基因人源化的非人动物的构建方法及应用 | |
CN111218425B (zh) | 人源化转基因动物 | |
CN111304246A (zh) | 一种人源化细胞因子动物模型、制备方法及应用 | |
CN114277055A (zh) | Il1b和il1a基因人源化的非人动物及其构建方法和应用 | |
CN113429486A (zh) | 基因修饰非人动物的构建方法及应用 | |
CN113046390B (zh) | Csf1r基因人源化的非人动物及其构建方法和应用 | |
CN112458117B (zh) | Cd155基因人源化的非人动物的构建方法及应用 | |
CN112300265B (zh) | Il33基因人源化的非人动物的构建方法和应用 | |
CN113774086A (zh) | A2ar基因人源化非人动物及其构建方法和应用 | |
CN112481303B (zh) | Il15ra基因人源化非人动物及其构建方法和应用 | |
CN112501204B (zh) | Il21r基因人源化非人动物及其构建方法和应用 | |
CN112501205B (zh) | Ceacam1基因人源化非人动物的构建方法及应用 | |
CN112553213B (zh) | Cx3cr1基因人源化的非人动物及其构建方法和应用 | |
CN113831403A (zh) | Sting基因人源化非人动物的构建方法及应用 | |
CN112501203B (zh) | Il17rb基因人源化的非人动物的构建方法及应用 | |
CN112553252B (zh) | Tnfr2基因人源化的非人动物的构建方法和应用 | |
CN112048522A (zh) | Tmem173基因人源化改造的动物模型的构建方法及其应用 | |
CN114134152A (zh) | Glp1r基因人源化的非人动物及其构建方法和应用 | |
CN113234139A (zh) | Tnfsf9基因人源化的非人动物及其构建方法和应用 | |
CN113264996A (zh) | 一种人源化的非人动物及其制备方法和应用 | |
CN112501202B (zh) | Cxcr4基因人源化的非人动物及其构建方法和应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |