CN106701709B - Atm基因突变体及其应用 - Google Patents
Atm基因突变体及其应用 Download PDFInfo
- Publication number
- CN106701709B CN106701709B CN201611130281.8A CN201611130281A CN106701709B CN 106701709 B CN106701709 B CN 106701709B CN 201611130281 A CN201611130281 A CN 201611130281A CN 106701709 B CN106701709 B CN 106701709B
- Authority
- CN
- China
- Prior art keywords
- leu
- ser
- glu
- lys
- ile
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 101150065175 Atm gene Proteins 0.000 title claims abstract description 77
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 121
- 206010003594 Ataxia telangiectasia Diseases 0.000 claims abstract description 120
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 86
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 86
- 230000035772 mutation Effects 0.000 claims abstract description 76
- 239000012472 biological sample Substances 0.000 claims abstract description 75
- 238000012216 screening Methods 0.000 claims abstract description 34
- 229920001184 polypeptide Polymers 0.000 claims abstract description 13
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 13
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 13
- 239000003153 chemical reaction reagent Substances 0.000 claims description 14
- 239000002773 nucleotide Substances 0.000 claims description 10
- 125000003729 nucleotide group Chemical group 0.000 claims description 10
- 208000026350 Inborn Genetic disease Diseases 0.000 claims description 7
- 208000016361 genetic disease Diseases 0.000 claims description 7
- 108020004711 Nucleic Acid Probes Proteins 0.000 claims description 5
- 239000002853 nucleic acid probe Substances 0.000 claims description 5
- 102220095519 rs876660869 Human genes 0.000 claims description 5
- 230000001131 transforming effect Effects 0.000 claims description 5
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 3
- 238000000034 method Methods 0.000 abstract description 34
- 239000000523 sample Substances 0.000 description 57
- 238000012163 sequencing technique Methods 0.000 description 47
- 108020004414 DNA Proteins 0.000 description 45
- 108091028043 Nucleic acid sequence Proteins 0.000 description 36
- 108090000623 proteins and genes Proteins 0.000 description 35
- 108010004586 Ataxia Telangiectasia Mutated Proteins Proteins 0.000 description 28
- 102000000872 ATM Human genes 0.000 description 24
- 210000004027 cell Anatomy 0.000 description 24
- 239000002299 complementary DNA Substances 0.000 description 20
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 15
- 238000012217 deletion Methods 0.000 description 15
- 230000037430 deletion Effects 0.000 description 15
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 14
- 241000282414 Homo sapiens Species 0.000 description 12
- 108010034529 leucyl-lysine Proteins 0.000 description 12
- 102000004169 proteins and genes Human genes 0.000 description 12
- 238000006243 chemical reaction Methods 0.000 description 11
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 9
- 241000880493 Leptailurus serval Species 0.000 description 9
- 210000004369 blood Anatomy 0.000 description 9
- 239000008280 blood Substances 0.000 description 9
- 201000010099 disease Diseases 0.000 description 9
- 239000000047 product Substances 0.000 description 9
- 206010064571 Gene mutation Diseases 0.000 description 8
- 108010009298 lysylglutamic acid Proteins 0.000 description 8
- 238000012408 PCR amplification Methods 0.000 description 7
- 108010004073 cysteinylcysteine Proteins 0.000 description 7
- 238000000605 extraction Methods 0.000 description 7
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 6
- 230000000295 complement effect Effects 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 5
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 5
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 5
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 5
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 5
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 5
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 5
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 5
- 239000013614 RNA sample Substances 0.000 description 5
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 5
- 108010005233 alanylglutamic acid Proteins 0.000 description 5
- 150000001413 amino acids Chemical group 0.000 description 5
- 230000003321 amplification Effects 0.000 description 5
- 108010077245 asparaginyl-proline Proteins 0.000 description 5
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 5
- 238000010276 construction Methods 0.000 description 5
- 239000000706 filtrate Substances 0.000 description 5
- 239000012634 fragment Substances 0.000 description 5
- 108010056582 methionylglutamic acid Proteins 0.000 description 5
- 238000003199 nucleic acid amplification method Methods 0.000 description 5
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 4
- IGULQRCJLQQPSM-DCAQKATOSA-N Arg-Cys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IGULQRCJLQQPSM-DCAQKATOSA-N 0.000 description 4
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 4
- ABLJDBFJPUWQQB-DCAQKATOSA-N Cys-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N ABLJDBFJPUWQQB-DCAQKATOSA-N 0.000 description 4
- XZKJEOMFLDVXJG-KATARQTJSA-N Cys-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)N)O XZKJEOMFLDVXJG-KATARQTJSA-N 0.000 description 4
- 238000007400 DNA extraction Methods 0.000 description 4
- 108700024394 Exon Proteins 0.000 description 4
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 4
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 4
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 4
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 4
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 4
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 4
- 108020004485 Nonsense Codon Proteins 0.000 description 4
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 4
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 4
- 108010044940 alanylglutamine Proteins 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 108010062796 arginyllysine Proteins 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 108010069495 cysteinyltyrosine Proteins 0.000 description 4
- 238000013461 design Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 230000002068 genetic effect Effects 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- 108010037850 glycylvaline Proteins 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- 108010057821 leucylproline Proteins 0.000 description 4
- 108010064235 lysylglycine Proteins 0.000 description 4
- 230000037434 nonsense mutation Effects 0.000 description 4
- 230000001717 pathogenic effect Effects 0.000 description 4
- 238000010839 reverse transcription Methods 0.000 description 4
- 238000007480 sanger sequencing Methods 0.000 description 4
- 238000012795 verification Methods 0.000 description 4
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 3
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 3
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 3
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 3
- CRCCTGPNZUCAHE-DCAQKATOSA-N Arg-His-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 CRCCTGPNZUCAHE-DCAQKATOSA-N 0.000 description 3
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 3
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 3
- FTMRPIVPSDVGCC-GUBZILKMSA-N Arg-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FTMRPIVPSDVGCC-GUBZILKMSA-N 0.000 description 3
- GMRGSBAMMMVDGG-GUBZILKMSA-N Asn-Arg-Arg Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N GMRGSBAMMMVDGG-GUBZILKMSA-N 0.000 description 3
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 3
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 3
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 3
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 3
- 102000002804 Ataxia Telangiectasia Mutated Proteins Human genes 0.000 description 3
- 108091026890 Coding region Proteins 0.000 description 3
- VIRYODQIWJNWNU-NRPADANISA-N Cys-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N VIRYODQIWJNWNU-NRPADANISA-N 0.000 description 3
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 3
- 208000031448 Genomic Instability Diseases 0.000 description 3
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 3
- UICOTGULOUGGLC-NUMRIWBASA-N Gln-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UICOTGULOUGGLC-NUMRIWBASA-N 0.000 description 3
- DQLVHRFFBQOWFL-JYJNAYRXSA-N Gln-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)O DQLVHRFFBQOWFL-JYJNAYRXSA-N 0.000 description 3
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 3
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 3
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 3
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 3
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 3
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 3
- LYSMQLXUCAKELQ-DCAQKATOSA-N His-Asp-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N LYSMQLXUCAKELQ-DCAQKATOSA-N 0.000 description 3
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 3
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 3
- KBHYLOIVRVBBEB-JBDRJPRFSA-N Ile-Cys-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N KBHYLOIVRVBBEB-JBDRJPRFSA-N 0.000 description 3
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 3
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 3
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 3
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 3
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 3
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 3
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 3
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 3
- 108010065920 Insulin Lispro Proteins 0.000 description 3
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 3
- USTCFDAQCLDPBD-XIRDDKMYSA-N Leu-Asn-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N USTCFDAQCLDPBD-XIRDDKMYSA-N 0.000 description 3
- WCTCIIAGNMFYAO-DCAQKATOSA-N Leu-Cys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O WCTCIIAGNMFYAO-DCAQKATOSA-N 0.000 description 3
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 3
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 3
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 3
- FEHQLKKBVJHSEC-SZMVWBNQSA-N Leu-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FEHQLKKBVJHSEC-SZMVWBNQSA-N 0.000 description 3
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 3
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 3
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 3
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 3
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 3
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 3
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 3
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 3
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 3
- ZAWOJFFMBANLGE-CIUDSAMLSA-N Lys-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N ZAWOJFFMBANLGE-CIUDSAMLSA-N 0.000 description 3
- MLLKLNYPZRDIQG-GUBZILKMSA-N Lys-Cys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N MLLKLNYPZRDIQG-GUBZILKMSA-N 0.000 description 3
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 3
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 3
- WKUXWMWQTOYTFI-SRVKXCTJSA-N Lys-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N WKUXWMWQTOYTFI-SRVKXCTJSA-N 0.000 description 3
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 3
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 3
- MCNGIXXCMJAURZ-VEVYYDQMSA-N Met-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCSC)N)O MCNGIXXCMJAURZ-VEVYYDQMSA-N 0.000 description 3
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 3
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 3
- GNUCSNWOCQFMMC-UFYCRDLUSA-N Phe-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 GNUCSNWOCQFMMC-UFYCRDLUSA-N 0.000 description 3
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 3
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 3
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 3
- 238000002123 RNA extraction Methods 0.000 description 3
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 3
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 3
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 3
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 3
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 3
- VPRHDRKAPYZMHL-SZMVWBNQSA-N Trp-Leu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 VPRHDRKAPYZMHL-SZMVWBNQSA-N 0.000 description 3
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 3
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 3
- FASACHWGQBNSRO-ZEWNOJEFSA-N Tyr-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FASACHWGQBNSRO-ZEWNOJEFSA-N 0.000 description 3
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 3
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 3
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 3
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- 108010068265 aspartyltyrosine Proteins 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 230000012820 cell cycle checkpoint Effects 0.000 description 3
- 238000003745 diagnosis Methods 0.000 description 3
- 231100000221 frame shift mutation induction Toxicity 0.000 description 3
- 230000037433 frameshift Effects 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 3
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 3
- 108010089804 glycyl-threonine Proteins 0.000 description 3
- 238000012165 high-throughput sequencing Methods 0.000 description 3
- 108010092114 histidylphenylalanine Proteins 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 3
- 108010054155 lysyllysine Proteins 0.000 description 3
- 239000013610 patient sample Substances 0.000 description 3
- 210000005259 peripheral blood Anatomy 0.000 description 3
- 239000011886 peripheral blood Substances 0.000 description 3
- 230000002093 peripheral effect Effects 0.000 description 3
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 3
- 102000054765 polymorphisms of proteins Human genes 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 108010015796 prolylisoleucine Proteins 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 230000035945 sensitivity Effects 0.000 description 3
- 239000007787 solid Substances 0.000 description 3
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 3
- 108010073969 valyllysine Proteins 0.000 description 3
- 238000007482 whole exome sequencing Methods 0.000 description 3
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 2
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 2
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 2
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 2
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 2
- FOHXUHGZZKETFI-JBDRJPRFSA-N Ala-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N FOHXUHGZZKETFI-JBDRJPRFSA-N 0.000 description 2
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 2
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 2
- SGFBVLBKDSXGAP-GKCIPKSASA-N Ala-Phe-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N SGFBVLBKDSXGAP-GKCIPKSASA-N 0.000 description 2
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 2
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 2
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 2
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 2
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 2
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 2
- UABKKMXFBRQKKI-BJDJZHNGSA-N Arg-Cys-Ser-Arg Chemical compound N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O UABKKMXFBRQKKI-BJDJZHNGSA-N 0.000 description 2
- VDBKFYYIBLXEIF-GUBZILKMSA-N Arg-Gln-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VDBKFYYIBLXEIF-GUBZILKMSA-N 0.000 description 2
- DJAIOAKQIOGULM-DCAQKATOSA-N Arg-Glu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O DJAIOAKQIOGULM-DCAQKATOSA-N 0.000 description 2
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 2
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 2
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 2
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 2
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 2
- AUIJUTGLPVHIRT-FXQIFTODSA-N Arg-Ser-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AUIJUTGLPVHIRT-FXQIFTODSA-N 0.000 description 2
- JPAWCMXVNZPJLO-IHRRRGAJSA-N Arg-Ser-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JPAWCMXVNZPJLO-IHRRRGAJSA-N 0.000 description 2
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 2
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 2
- LUVODTFFSXVOAG-ACZMJKKPSA-N Asn-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N LUVODTFFSXVOAG-ACZMJKKPSA-N 0.000 description 2
- XWFPGQVLOVGSLU-CIUDSAMLSA-N Asn-Gln-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XWFPGQVLOVGSLU-CIUDSAMLSA-N 0.000 description 2
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 2
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 2
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 2
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 2
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 2
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 2
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 2
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 2
- UBGGJTMETLEXJD-DCAQKATOSA-N Asn-Leu-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O UBGGJTMETLEXJD-DCAQKATOSA-N 0.000 description 2
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 2
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 2
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 2
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 2
- MDDXKBHIMYYJLW-FXQIFTODSA-N Asn-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N MDDXKBHIMYYJLW-FXQIFTODSA-N 0.000 description 2
- NNDSLVWAQAUPPP-GUBZILKMSA-N Asn-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N NNDSLVWAQAUPPP-GUBZILKMSA-N 0.000 description 2
- FTNRWCPWDWRPAV-BZSNNMDCSA-N Asn-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTNRWCPWDWRPAV-BZSNNMDCSA-N 0.000 description 2
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 2
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 2
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 2
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 2
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 2
- WEDGJJRCJNHYSF-SRVKXCTJSA-N Asp-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N WEDGJJRCJNHYSF-SRVKXCTJSA-N 0.000 description 2
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 2
- YRBGRUOSJROZEI-NHCYSSNCSA-N Asp-His-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O YRBGRUOSJROZEI-NHCYSSNCSA-N 0.000 description 2
- LBFYTUPYYZENIR-GHCJXIJMSA-N Asp-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N LBFYTUPYYZENIR-GHCJXIJMSA-N 0.000 description 2
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 2
- SCQIQCWLOMOEFP-DCAQKATOSA-N Asp-Leu-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SCQIQCWLOMOEFP-DCAQKATOSA-N 0.000 description 2
- NONWUQAWAANERO-BZSNNMDCSA-N Asp-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 NONWUQAWAANERO-BZSNNMDCSA-N 0.000 description 2
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 2
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 2
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 2
- 208000037051 Chromosomal Instability Diseases 0.000 description 2
- 208000031404 Chromosome Aberrations Diseases 0.000 description 2
- 208000037088 Chromosome Breakage Diseases 0.000 description 2
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 2
- QLCPDGRAEJSYQM-LPEHRKFASA-N Cys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)C(=O)O QLCPDGRAEJSYQM-LPEHRKFASA-N 0.000 description 2
- XABFFGOGKOORCG-CIUDSAMLSA-N Cys-Asp-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XABFFGOGKOORCG-CIUDSAMLSA-N 0.000 description 2
- IIGHQOPGMGKDMT-SRVKXCTJSA-N Cys-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N IIGHQOPGMGKDMT-SRVKXCTJSA-N 0.000 description 2
- MGAWEOHYNIMOQJ-ACZMJKKPSA-N Cys-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MGAWEOHYNIMOQJ-ACZMJKKPSA-N 0.000 description 2
- CFQVGYWKSLKWFX-KBIXCLLPSA-N Cys-Glu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CFQVGYWKSLKWFX-KBIXCLLPSA-N 0.000 description 2
- UPURLDIGQGTUPJ-ZKWXMUAHSA-N Cys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N UPURLDIGQGTUPJ-ZKWXMUAHSA-N 0.000 description 2
- VTJLJQGUMBWHBP-GUBZILKMSA-N Cys-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N VTJLJQGUMBWHBP-GUBZILKMSA-N 0.000 description 2
- HAYVLBZZBDCKRA-SRVKXCTJSA-N Cys-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N HAYVLBZZBDCKRA-SRVKXCTJSA-N 0.000 description 2
- NIXHTNJAGGFBAW-CIUDSAMLSA-N Cys-Lys-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N NIXHTNJAGGFBAW-CIUDSAMLSA-N 0.000 description 2
- ZGERHCJBLPQPGV-ACZMJKKPSA-N Cys-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N ZGERHCJBLPQPGV-ACZMJKKPSA-N 0.000 description 2
- ABLQPNMKLMFDQU-BIIVOSGPSA-N Cys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CS)N)C(=O)O ABLQPNMKLMFDQU-BIIVOSGPSA-N 0.000 description 2
- JIZRUFJGHPIYPS-SRVKXCTJSA-N Cys-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O JIZRUFJGHPIYPS-SRVKXCTJSA-N 0.000 description 2
- 238000001712 DNA sequencing Methods 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 108091092584 GDNA Proteins 0.000 description 2
- RGRMOYQUIJVQQD-SRVKXCTJSA-N Gln-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N RGRMOYQUIJVQQD-SRVKXCTJSA-N 0.000 description 2
- RRYLMJWPWBJFPZ-ACZMJKKPSA-N Gln-Asn-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RRYLMJWPWBJFPZ-ACZMJKKPSA-N 0.000 description 2
- KWLMLNHADZIJIS-CIUDSAMLSA-N Gln-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N KWLMLNHADZIJIS-CIUDSAMLSA-N 0.000 description 2
- UZMWDBOHAOSCCH-ACZMJKKPSA-N Gln-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O UZMWDBOHAOSCCH-ACZMJKKPSA-N 0.000 description 2
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 2
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 2
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 2
- CLPQUWHBWXFJOX-BQBZGAKWSA-N Gln-Gly-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O CLPQUWHBWXFJOX-BQBZGAKWSA-N 0.000 description 2
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 2
- HHQCBFGKQDMWSP-GUBZILKMSA-N Gln-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HHQCBFGKQDMWSP-GUBZILKMSA-N 0.000 description 2
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 2
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 2
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 2
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 2
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 2
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 2
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 2
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 2
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 2
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 2
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 2
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 2
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 2
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 2
- VOORMNJKNBGYGK-YUMQZZPRSA-N Glu-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N VOORMNJKNBGYGK-YUMQZZPRSA-N 0.000 description 2
- XOFYVODYSNKPDK-AVGNSLFASA-N Glu-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOFYVODYSNKPDK-AVGNSLFASA-N 0.000 description 2
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 2
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 2
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 2
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 2
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 2
- YUXIEONARHPUTK-JBACZVJFSA-N Glu-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CCC(=O)O)N YUXIEONARHPUTK-JBACZVJFSA-N 0.000 description 2
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 2
- PAZQYODKOZHXGA-SRVKXCTJSA-N Glu-Pro-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O PAZQYODKOZHXGA-SRVKXCTJSA-N 0.000 description 2
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 2
- GUOWMVFLAJNPDY-CIUDSAMLSA-N Glu-Ser-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GUOWMVFLAJNPDY-CIUDSAMLSA-N 0.000 description 2
- UCZXXMREFIETQW-AVGNSLFASA-N Glu-Tyr-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O UCZXXMREFIETQW-AVGNSLFASA-N 0.000 description 2
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 2
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 2
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 2
- LERGJIVJIIODPZ-ZANVPECISA-N Gly-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)C)C(O)=O)=CNC2=C1 LERGJIVJIIODPZ-ZANVPECISA-N 0.000 description 2
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 2
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 2
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 2
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 2
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 2
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 2
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 2
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 2
- LVWIJITYHRZHBO-IXOXFDKPSA-N His-Leu-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LVWIJITYHRZHBO-IXOXFDKPSA-N 0.000 description 2
- MTHDIEPOBSRDIV-ULQDDVLXSA-N His-Met-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 MTHDIEPOBSRDIV-ULQDDVLXSA-N 0.000 description 2
- RLAOTFTXBFQJDV-KKUMJFAQSA-N His-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CN=CN1 RLAOTFTXBFQJDV-KKUMJFAQSA-N 0.000 description 2
- UWNUQPZUSRFIIN-JUKXBJQTSA-N His-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N UWNUQPZUSRFIIN-JUKXBJQTSA-N 0.000 description 2
- QLBXWYXMLHAREM-PYJNHQTQSA-N His-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N QLBXWYXMLHAREM-PYJNHQTQSA-N 0.000 description 2
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 2
- DRKZDEFADVYTLU-AVGNSLFASA-N His-Val-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DRKZDEFADVYTLU-AVGNSLFASA-N 0.000 description 2
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 2
- LDRALPZEVHVXEK-KBIXCLLPSA-N Ile-Cys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N LDRALPZEVHVXEK-KBIXCLLPSA-N 0.000 description 2
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 2
- AMSYMDIIIRJRKZ-HJPIBITLSA-N Ile-His-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AMSYMDIIIRJRKZ-HJPIBITLSA-N 0.000 description 2
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 2
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 2
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 2
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 2
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 2
- IALVDKNUFSTICJ-GMOBBJLQSA-N Ile-Met-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IALVDKNUFSTICJ-GMOBBJLQSA-N 0.000 description 2
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 2
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 2
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 2
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 2
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 2
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 2
- CZOAJJGXTGUYOJ-SPOWBLRKSA-N Ile-Trp-Cys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O)=CNC2=C1 CZOAJJGXTGUYOJ-SPOWBLRKSA-N 0.000 description 2
- JSLIXOUMAOUGBN-JUKXBJQTSA-N Ile-Tyr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JSLIXOUMAOUGBN-JUKXBJQTSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 2
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 2
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 2
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 2
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 2
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 2
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 2
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 2
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 2
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 2
- AUNMOHYWTAPQLA-XUXIUFHCSA-N Leu-Met-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AUNMOHYWTAPQLA-XUXIUFHCSA-N 0.000 description 2
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 2
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 2
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 2
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 2
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 2
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 2
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 2
- QONKWXNJRRNTBV-AVGNSLFASA-N Leu-Pro-Met Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N QONKWXNJRRNTBV-AVGNSLFASA-N 0.000 description 2
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 2
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 2
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 2
- URHJPNHRQMQGOZ-RHYQMDGZSA-N Leu-Thr-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O URHJPNHRQMQGOZ-RHYQMDGZSA-N 0.000 description 2
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 2
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 2
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 2
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 2
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 2
- WLCYCADOWRMSAJ-CIUDSAMLSA-N Lys-Asn-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O WLCYCADOWRMSAJ-CIUDSAMLSA-N 0.000 description 2
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 2
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 2
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 2
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 2
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 2
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 2
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 2
- DKTNGXVSCZULPO-YUMQZZPRSA-N Lys-Gly-Cys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O DKTNGXVSCZULPO-YUMQZZPRSA-N 0.000 description 2
- KNKJPYAZQUFLQK-IHRRRGAJSA-N Lys-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N KNKJPYAZQUFLQK-IHRRRGAJSA-N 0.000 description 2
- GTAXSKOXPIISBW-AVGNSLFASA-N Lys-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N GTAXSKOXPIISBW-AVGNSLFASA-N 0.000 description 2
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 2
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 2
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 2
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 2
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 2
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 2
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 2
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 2
- CNXOBMMOYZPPGS-NUTKFTJISA-N Lys-Trp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O CNXOBMMOYZPPGS-NUTKFTJISA-N 0.000 description 2
- IMDJSVBFQKDDEQ-MGHWNKPDSA-N Lys-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N IMDJSVBFQKDDEQ-MGHWNKPDSA-N 0.000 description 2
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 2
- USPJSTBDIGJPFK-PMVMPFDFSA-N Lys-Tyr-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O USPJSTBDIGJPFK-PMVMPFDFSA-N 0.000 description 2
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 2
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 2
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 2
- XOMXAVJBLRROMC-IHRRRGAJSA-N Met-Asp-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOMXAVJBLRROMC-IHRRRGAJSA-N 0.000 description 2
- RCMDUFDXDYTXOK-CIUDSAMLSA-N Met-Gln-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(O)=O RCMDUFDXDYTXOK-CIUDSAMLSA-N 0.000 description 2
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 2
- SLQDSYZHHOKQSR-QXEWZRGKSA-N Met-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCSC SLQDSYZHHOKQSR-QXEWZRGKSA-N 0.000 description 2
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 2
- DJJBHQHOZLUBCN-WDSOQIARSA-N Met-Lys-Trp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DJJBHQHOZLUBCN-WDSOQIARSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 2
- 102000038030 PI3Ks Human genes 0.000 description 2
- 108091007960 PI3Ks Proteins 0.000 description 2
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 2
- NOFBJKKOPKJDCO-KKXDTOCCSA-N Phe-Ala-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NOFBJKKOPKJDCO-KKXDTOCCSA-N 0.000 description 2
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 2
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 2
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 2
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 2
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 2
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 2
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 2
- TXKWKTWYTIAZSV-KKUMJFAQSA-N Phe-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N TXKWKTWYTIAZSV-KKUMJFAQSA-N 0.000 description 2
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 2
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 2
- CKJACGQPCPMWIT-UFYCRDLUSA-N Phe-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CKJACGQPCPMWIT-UFYCRDLUSA-N 0.000 description 2
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 2
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 2
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 2
- QCARZLHECSFOGG-CIUDSAMLSA-N Pro-Glu-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O QCARZLHECSFOGG-CIUDSAMLSA-N 0.000 description 2
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 2
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 2
- VWHJZETTZDAGOM-XUXIUFHCSA-N Pro-Lys-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VWHJZETTZDAGOM-XUXIUFHCSA-N 0.000 description 2
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 2
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 2
- SVXXJYJCRNKDDE-AVGNSLFASA-N Pro-Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CN=CN1 SVXXJYJCRNKDDE-AVGNSLFASA-N 0.000 description 2
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 2
- CZCCVJUUWBMISW-FXQIFTODSA-N Pro-Ser-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O CZCCVJUUWBMISW-FXQIFTODSA-N 0.000 description 2
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 2
- AJJDPGVVNPUZCR-RHYQMDGZSA-N Pro-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1)O AJJDPGVVNPUZCR-RHYQMDGZSA-N 0.000 description 2
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 2
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 2
- 108091028664 Ribonucleotide Proteins 0.000 description 2
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 2
- UCXDHBORXLVBNC-ZLUOBGJFSA-N Ser-Asn-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O UCXDHBORXLVBNC-ZLUOBGJFSA-N 0.000 description 2
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 2
- RFBKULCUBJAQFT-BIIVOSGPSA-N Ser-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CO)N)C(=O)O RFBKULCUBJAQFT-BIIVOSGPSA-N 0.000 description 2
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 2
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 2
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 2
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 2
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 2
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 2
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 2
- UGHCUDLCCVVIJR-VGDYDELISA-N Ser-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N UGHCUDLCCVVIJR-VGDYDELISA-N 0.000 description 2
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 2
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 2
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 2
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 2
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 2
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 2
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 2
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 2
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 2
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 2
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 2
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 2
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 2
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 2
- VEVYMLNYMULSMS-AVGNSLFASA-N Ser-Tyr-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEVYMLNYMULSMS-AVGNSLFASA-N 0.000 description 2
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 2
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 2
- 206010043189 Telangiectasia Diseases 0.000 description 2
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 2
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 2
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 2
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 2
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 2
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 2
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 2
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 2
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 2
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 2
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 2
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 2
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 2
- YGZWVPBHYABGLT-KJEVXHAQSA-N Thr-Pro-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YGZWVPBHYABGLT-KJEVXHAQSA-N 0.000 description 2
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 2
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 2
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 2
- OMRWDMWXRWTQIU-YJRXYDGGSA-N Thr-Tyr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N)O OMRWDMWXRWTQIU-YJRXYDGGSA-N 0.000 description 2
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 2
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 2
- PNKDNKGMEHJTJQ-BPUTZDHNSA-N Trp-Arg-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N PNKDNKGMEHJTJQ-BPUTZDHNSA-N 0.000 description 2
- TWJDQTTXXZDJKV-BPUTZDHNSA-N Trp-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O TWJDQTTXXZDJKV-BPUTZDHNSA-N 0.000 description 2
- IUFQHOCOKQIOMC-XIRDDKMYSA-N Trp-Asn-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N IUFQHOCOKQIOMC-XIRDDKMYSA-N 0.000 description 2
- ULHASJWZGUEUNN-XIRDDKMYSA-N Trp-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O ULHASJWZGUEUNN-XIRDDKMYSA-N 0.000 description 2
- JTMZSIRTZKLBOA-NWLDYVSISA-N Trp-Thr-Gln Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O JTMZSIRTZKLBOA-NWLDYVSISA-N 0.000 description 2
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 2
- UBKKNELWDCBNCF-STQMWFEESA-N Tyr-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UBKKNELWDCBNCF-STQMWFEESA-N 0.000 description 2
- WTTRJMAZPDHPGS-KKXDTOCCSA-N Tyr-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(O)=O WTTRJMAZPDHPGS-KKXDTOCCSA-N 0.000 description 2
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 2
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 2
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 2
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 2
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 2
- VXCAZHCVDBQMTP-NRPADANISA-N Val-Cys-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VXCAZHCVDBQMTP-NRPADANISA-N 0.000 description 2
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 2
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 2
- SDSCOOZQQGUQFC-GVXVVHGQSA-N Val-His-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SDSCOOZQQGUQFC-GVXVVHGQSA-N 0.000 description 2
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 2
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 2
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 2
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 2
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 2
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 2
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 2
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 2
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 2
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 2
- NSUUANXHLKKHQB-BZSNNMDCSA-N Val-Pro-Trp Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC2=CC=CC=C12 NSUUANXHLKKHQB-BZSNNMDCSA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 2
- PGBMPFKFKXYROZ-UFYCRDLUSA-N Val-Tyr-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N PGBMPFKFKXYROZ-UFYCRDLUSA-N 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 208000021024 autosomal recessive inheritance Diseases 0.000 description 2
- 231100000005 chromosome aberration Toxicity 0.000 description 2
- 210000000795 conjunctiva Anatomy 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- 238000012350 deep sequencing Methods 0.000 description 2
- 239000005547 deoxyribonucleotide Substances 0.000 description 2
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 239000012154 double-distilled water Substances 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 239000012149 elution buffer Substances 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 108010081551 glycylphenylalanine Proteins 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 238000011835 investigation Methods 0.000 description 2
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 239000006166 lysate Substances 0.000 description 2
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 239000002336 ribonucleotide Substances 0.000 description 2
- 125000002652 ribonucleotide group Chemical group 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 230000037436 splice-site mutation Effects 0.000 description 2
- 230000002269 spontaneous effect Effects 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 208000009056 telangiectasis Diseases 0.000 description 2
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- 239000011534 wash buffer Substances 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 101150028074 2 gene Proteins 0.000 description 1
- 101150058750 ALB gene Proteins 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 1
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 1
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- AHPWQERCDZTTNB-FXQIFTODSA-N Arg-Cys-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AHPWQERCDZTTNB-FXQIFTODSA-N 0.000 description 1
- QAXCZGMLVICQKS-SRVKXCTJSA-N Arg-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QAXCZGMLVICQKS-SRVKXCTJSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 1
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 1
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 1
- JZRLLSOWDYUKOK-SRVKXCTJSA-N Asn-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N JZRLLSOWDYUKOK-SRVKXCTJSA-N 0.000 description 1
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 1
- XLHLPYFMXGOASD-CIUDSAMLSA-N Asn-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLHLPYFMXGOASD-CIUDSAMLSA-N 0.000 description 1
- SUEIIIFUBHDCCS-PBCZWWQYSA-N Asn-His-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUEIIIFUBHDCCS-PBCZWWQYSA-N 0.000 description 1
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- CMCIMCAQIULNDJ-CIUDSAMLSA-N Asp-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N CMCIMCAQIULNDJ-CIUDSAMLSA-N 0.000 description 1
- UBPMOJLRVMGTOQ-GARJFASQSA-N Asp-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)C(=O)O UBPMOJLRVMGTOQ-GARJFASQSA-N 0.000 description 1
- LDLZOAJRXXBVGF-GMOBBJLQSA-N Asp-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N LDLZOAJRXXBVGF-GMOBBJLQSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- QOCFFCUFZGDHTP-NUMRIWBASA-N Asp-Thr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QOCFFCUFZGDHTP-NUMRIWBASA-N 0.000 description 1
- LEYKQPDPZJIRTA-AQZXSJQPSA-N Asp-Trp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LEYKQPDPZJIRTA-AQZXSJQPSA-N 0.000 description 1
- 206010008025 Cerebellar ataxia Diseases 0.000 description 1
- 206010008027 Cerebellar atrophy Diseases 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- QADHATDBZXHRCA-ACZMJKKPSA-N Cys-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N QADHATDBZXHRCA-ACZMJKKPSA-N 0.000 description 1
- WZZGXXNRSZIQFC-VGDYDELISA-N Cys-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N WZZGXXNRSZIQFC-VGDYDELISA-N 0.000 description 1
- BLGNLNRBABWDST-CIUDSAMLSA-N Cys-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BLGNLNRBABWDST-CIUDSAMLSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- OZHXXYOHPLLLMI-CIUDSAMLSA-N Cys-Lys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OZHXXYOHPLLLMI-CIUDSAMLSA-N 0.000 description 1
- MBRWOKXNHTUJMB-CIUDSAMLSA-N Cys-Pro-Glu Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O MBRWOKXNHTUJMB-CIUDSAMLSA-N 0.000 description 1
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 1
- ALNKNYKSZPSLBD-ZDLURKLDSA-N Cys-Thr-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ALNKNYKSZPSLBD-ZDLURKLDSA-N 0.000 description 1
- AZDQAZRURQMSQD-XPUUQOCRSA-N Cys-Val-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AZDQAZRURQMSQD-XPUUQOCRSA-N 0.000 description 1
- YQEHNIKPAOPBNH-DCAQKATOSA-N Cys-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N YQEHNIKPAOPBNH-DCAQKATOSA-N 0.000 description 1
- 230000005778 DNA damage Effects 0.000 description 1
- 231100000277 DNA damage Toxicity 0.000 description 1
- 230000005971 DNA damage repair Effects 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 206010011878 Deafness Diseases 0.000 description 1
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 1
- 208000034951 Genetic Translocation Diseases 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 1
- DLOHWQXXGMEZDW-CIUDSAMLSA-N Gln-Arg-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DLOHWQXXGMEZDW-CIUDSAMLSA-N 0.000 description 1
- JFOKLAPFYCTNHW-SRVKXCTJSA-N Gln-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N JFOKLAPFYCTNHW-SRVKXCTJSA-N 0.000 description 1
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 1
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- CXFUMJQFZVCETK-FXQIFTODSA-N Gln-Cys-Gln Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O CXFUMJQFZVCETK-FXQIFTODSA-N 0.000 description 1
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 1
- DDNIZQDYXDENIT-FXQIFTODSA-N Gln-Glu-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N DDNIZQDYXDENIT-FXQIFTODSA-N 0.000 description 1
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 1
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- KSKFIECUYMYWNS-AVGNSLFASA-N Gln-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N KSKFIECUYMYWNS-AVGNSLFASA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- DOMHVQBSRJNNKD-ZPFDUUQYSA-N Gln-Met-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DOMHVQBSRJNNKD-ZPFDUUQYSA-N 0.000 description 1
- RWCBJYUPAUTWJD-NHCYSSNCSA-N Gln-Met-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O RWCBJYUPAUTWJD-NHCYSSNCSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 1
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 1
- XKPOCESCRTVRPL-KBIXCLLPSA-N Glu-Cys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XKPOCESCRTVRPL-KBIXCLLPSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- ZJFNRQHUIHKZJF-GUBZILKMSA-N Glu-His-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O ZJFNRQHUIHKZJF-GUBZILKMSA-N 0.000 description 1
- BIHMNDPWRUROFZ-JYJNAYRXSA-N Glu-His-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BIHMNDPWRUROFZ-JYJNAYRXSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 1
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 1
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 1
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- LWYUQLZOIORFFJ-XKBZYTNZSA-N Glu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O LWYUQLZOIORFFJ-XKBZYTNZSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- ZNOHKCPYDAYYDA-BPUTZDHNSA-N Glu-Trp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNOHKCPYDAYYDA-BPUTZDHNSA-N 0.000 description 1
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 1
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 1
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 1
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 1
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 1
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 1
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 1
- KNNSUUOHFVVJOP-GUBZILKMSA-N His-Glu-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N KNNSUUOHFVVJOP-GUBZILKMSA-N 0.000 description 1
- CNHSMSFYVARZLI-YJRXYDGGSA-N His-His-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CNHSMSFYVARZLI-YJRXYDGGSA-N 0.000 description 1
- GJMHMDKCJPQJOI-IHRRRGAJSA-N His-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 GJMHMDKCJPQJOI-IHRRRGAJSA-N 0.000 description 1
- QEYUCKCWTMIERU-SRVKXCTJSA-N His-Lys-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QEYUCKCWTMIERU-SRVKXCTJSA-N 0.000 description 1
- HYWZHNUGAYVEEW-KKUMJFAQSA-N His-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HYWZHNUGAYVEEW-KKUMJFAQSA-N 0.000 description 1
- JSQIXEHORHLQEE-MEYUZBJRSA-N His-Phe-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JSQIXEHORHLQEE-MEYUZBJRSA-N 0.000 description 1
- UOYGZBIPZYKGSH-SRVKXCTJSA-N His-Ser-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N UOYGZBIPZYKGSH-SRVKXCTJSA-N 0.000 description 1
- IAYPZSHNZQHQNO-KKUMJFAQSA-N His-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N IAYPZSHNZQHQNO-KKUMJFAQSA-N 0.000 description 1
- WSXNWASHQNSMRX-GVXVVHGQSA-N His-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WSXNWASHQNSMRX-GVXVVHGQSA-N 0.000 description 1
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 1
- IIXDMJNYALIKGP-DJFWLOJKSA-N Ile-Asn-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IIXDMJNYALIKGP-DJFWLOJKSA-N 0.000 description 1
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 1
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 1
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 1
- VQUCKIAECLVLAD-SVSWQMSJSA-N Ile-Cys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VQUCKIAECLVLAD-SVSWQMSJSA-N 0.000 description 1
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 1
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 1
- UQXADIGYEYBJEI-DJFWLOJKSA-N Ile-His-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N UQXADIGYEYBJEI-DJFWLOJKSA-N 0.000 description 1
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 1
- AFERFBZLVUFWRA-HTFCKZLJSA-N Ile-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)O)N AFERFBZLVUFWRA-HTFCKZLJSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- FCWFBHMAJZGWRY-XUXIUFHCSA-N Ile-Leu-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N FCWFBHMAJZGWRY-XUXIUFHCSA-N 0.000 description 1
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 1
- CEPIAEUVRKGPGP-DSYPUSFNSA-N Ile-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 CEPIAEUVRKGPGP-DSYPUSFNSA-N 0.000 description 1
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 1
- IMRKCLXPYOIHIF-ZPFDUUQYSA-N Ile-Met-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IMRKCLXPYOIHIF-ZPFDUUQYSA-N 0.000 description 1
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 1
- VOCZPDONPURUHV-QEWYBTABSA-N Ile-Phe-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VOCZPDONPURUHV-QEWYBTABSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- QQVXERGIFIRCGW-NAKRPEOUSA-N Ile-Ser-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N QQVXERGIFIRCGW-NAKRPEOUSA-N 0.000 description 1
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 1
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 1
- XVUAQNRNFMVWBR-BLMTYFJBSA-N Ile-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N XVUAQNRNFMVWBR-BLMTYFJBSA-N 0.000 description 1
- HQLSBZFLOUHQJK-STECZYCISA-N Ile-Tyr-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HQLSBZFLOUHQJK-STECZYCISA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- 206010061598 Immunodeficiency Diseases 0.000 description 1
- 208000029462 Immunodeficiency disease Diseases 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 1
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 1
- YSKSXVKQLLBVEX-SZMVWBNQSA-N Leu-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 YSKSXVKQLLBVEX-SZMVWBNQSA-N 0.000 description 1
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 1
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 1
- HMDDEJADNKQTBR-BZSNNMDCSA-N Leu-His-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMDDEJADNKQTBR-BZSNNMDCSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 1
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- ALSRJRIWBNENFY-DCAQKATOSA-N Lys-Arg-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O ALSRJRIWBNENFY-DCAQKATOSA-N 0.000 description 1
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- PHHYNOUOUWYQRO-XIRDDKMYSA-N Lys-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N PHHYNOUOUWYQRO-XIRDDKMYSA-N 0.000 description 1
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- MTBLFIQZECOEBY-IHRRRGAJSA-N Lys-Met-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O MTBLFIQZECOEBY-IHRRRGAJSA-N 0.000 description 1
- INMBONMDMGPADT-AVGNSLFASA-N Lys-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N INMBONMDMGPADT-AVGNSLFASA-N 0.000 description 1
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 1
- CRIODIGWCUPXKU-AVGNSLFASA-N Lys-Pro-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O CRIODIGWCUPXKU-AVGNSLFASA-N 0.000 description 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- QGQGAIBGTUJRBR-NAKRPEOUSA-N Met-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC QGQGAIBGTUJRBR-NAKRPEOUSA-N 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- BLIPQDLSCFGUFA-GUBZILKMSA-N Met-Arg-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O BLIPQDLSCFGUFA-GUBZILKMSA-N 0.000 description 1
- IHITVQKJXQQGLJ-LPEHRKFASA-N Met-Asn-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N IHITVQKJXQQGLJ-LPEHRKFASA-N 0.000 description 1
- ZMYHJISLFYTQGK-FXQIFTODSA-N Met-Asp-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMYHJISLFYTQGK-FXQIFTODSA-N 0.000 description 1
- FVKRQMQQFGBXHV-QXEWZRGKSA-N Met-Asp-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FVKRQMQQFGBXHV-QXEWZRGKSA-N 0.000 description 1
- SDTSLIMYROCDNS-FXQIFTODSA-N Met-Cys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O SDTSLIMYROCDNS-FXQIFTODSA-N 0.000 description 1
- IZLCDZDNZFEDHB-DCAQKATOSA-N Met-Cys-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N IZLCDZDNZFEDHB-DCAQKATOSA-N 0.000 description 1
- IECZNARPMKQGJC-XIRDDKMYSA-N Met-Gln-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N IECZNARPMKQGJC-XIRDDKMYSA-N 0.000 description 1
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 1
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 1
- ZIIMORLEZLVRIP-SRVKXCTJSA-N Met-Leu-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZIIMORLEZLVRIP-SRVKXCTJSA-N 0.000 description 1
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 1
- HOZNVKDCKZPRER-XUXIUFHCSA-N Met-Lys-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HOZNVKDCKZPRER-XUXIUFHCSA-N 0.000 description 1
- CRVSHEPROQHVQT-AVGNSLFASA-N Met-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N CRVSHEPROQHVQT-AVGNSLFASA-N 0.000 description 1
- KLGIQJRMFHIGCQ-ZFWWWQNUSA-N Met-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCSC)C(=O)NCC(O)=O)=CNC2=C1 KLGIQJRMFHIGCQ-ZFWWWQNUSA-N 0.000 description 1
- UXJHNUBJSQQIOC-SZMVWBNQSA-N Met-Trp-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O UXJHNUBJSQQIOC-SZMVWBNQSA-N 0.000 description 1
- 108010021466 Mutant Proteins Proteins 0.000 description 1
- 102000008300 Mutant Proteins Human genes 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108020002230 Pancreatic Ribonuclease Proteins 0.000 description 1
- 102000005891 Pancreatic ribonuclease Human genes 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- YMORXCKTSSGYIG-IHRRRGAJSA-N Phe-Arg-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N YMORXCKTSSGYIG-IHRRRGAJSA-N 0.000 description 1
- FSPGBMWPNMRWDB-AVGNSLFASA-N Phe-Cys-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N FSPGBMWPNMRWDB-AVGNSLFASA-N 0.000 description 1
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 1
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 1
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- BPIFSOUEUYDJRM-DCPHZVHLSA-N Phe-Trp-Ala Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C)C(O)=O)C1=CC=CC=C1 BPIFSOUEUYDJRM-DCPHZVHLSA-N 0.000 description 1
- AGTHXWTYCLLYMC-FHWLQOOXSA-N Phe-Tyr-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 AGTHXWTYCLLYMC-FHWLQOOXSA-N 0.000 description 1
- 108090000430 Phosphatidylinositol 3-kinases Proteins 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- DRIJZWBRGMJCDD-DCAQKATOSA-N Pro-Gln-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O DRIJZWBRGMJCDD-DCAQKATOSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 1
- HATVCTYBNCNMAA-AVGNSLFASA-N Pro-Leu-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O HATVCTYBNCNMAA-AVGNSLFASA-N 0.000 description 1
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 1
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 238000010802 RNA extraction kit Methods 0.000 description 1
- 108700005075 Regulator Genes Proteins 0.000 description 1
- 206010057190 Respiratory tract infections Diseases 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 206010039509 Scab Diseases 0.000 description 1
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- WOUIMBGNEUWXQG-VKHMYHEASA-N Ser-Gly Chemical compound OC[C@H](N)C(=O)NCC(O)=O WOUIMBGNEUWXQG-VKHMYHEASA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 1
- FZEUTKVQGMVGHW-AVGNSLFASA-N Ser-Phe-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZEUTKVQGMVGHW-AVGNSLFASA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- 206010067868 Skin mass Diseases 0.000 description 1
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- IRKWVRSEQFTGGV-VEVYYDQMSA-N Thr-Asn-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IRKWVRSEQFTGGV-VEVYYDQMSA-N 0.000 description 1
- QWMPARMKIDVBLV-VZFHVOOUSA-N Thr-Cys-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O QWMPARMKIDVBLV-VZFHVOOUSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- VOHWDZNIESHTFW-XKBZYTNZSA-N Thr-Glu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O VOHWDZNIESHTFW-XKBZYTNZSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- NQVDGKYAUHTCME-QTKMDUPCSA-N Thr-His-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O NQVDGKYAUHTCME-QTKMDUPCSA-N 0.000 description 1
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 1
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 1
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- MCDVZTRGHNXTGK-HJGDQZAQSA-N Thr-Met-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O MCDVZTRGHNXTGK-HJGDQZAQSA-N 0.000 description 1
- HSQXHRIRJSFDOH-URLPEUOOSA-N Thr-Phe-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HSQXHRIRJSFDOH-URLPEUOOSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 1
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 1
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- WCTYCXZYBNKEIV-SXNHZJKMSA-N Trp-Glu-Ile Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)=CNC2=C1 WCTYCXZYBNKEIV-SXNHZJKMSA-N 0.000 description 1
- RRXPAFGTFQIEMD-IVJVFBROSA-N Trp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N RRXPAFGTFQIEMD-IVJVFBROSA-N 0.000 description 1
- CXPJPTFWKXNDKV-NUTKFTJISA-N Trp-Leu-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CXPJPTFWKXNDKV-NUTKFTJISA-N 0.000 description 1
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 1
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 1
- WKQNLTQSCYXKQK-VFAJRCTISA-N Trp-Lys-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WKQNLTQSCYXKQK-VFAJRCTISA-N 0.000 description 1
- NIHNMOSRSAYZIT-BPNCWPANSA-N Tyr-Ala-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NIHNMOSRSAYZIT-BPNCWPANSA-N 0.000 description 1
- CRWOSTCODDFEKZ-HRCADAONSA-N Tyr-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CRWOSTCODDFEKZ-HRCADAONSA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 1
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 1
- YRBHLWWGSSQICE-IHRRRGAJSA-N Tyr-Asp-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O YRBHLWWGSSQICE-IHRRRGAJSA-N 0.000 description 1
- ARPONUQDNWLXOZ-KKUMJFAQSA-N Tyr-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ARPONUQDNWLXOZ-KKUMJFAQSA-N 0.000 description 1
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 1
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 1
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 1
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 1
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 1
- GZOCMHSZGGJBCX-ULQDDVLXSA-N Tyr-Lys-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O GZOCMHSZGGJBCX-ULQDDVLXSA-N 0.000 description 1
- NKMFRGPKTIEXSK-ULQDDVLXSA-N Tyr-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NKMFRGPKTIEXSK-ULQDDVLXSA-N 0.000 description 1
- UPODKYBYUBTWSV-BZSNNMDCSA-N Tyr-Phe-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=C(O)C=C1 UPODKYBYUBTWSV-BZSNNMDCSA-N 0.000 description 1
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 1
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 1
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 1
- JHDZONWZTCKTJR-KJEVXHAQSA-N Tyr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JHDZONWZTCKTJR-KJEVXHAQSA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 1
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 1
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 1
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 230000031018 biological processes and functions Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 210000005252 bulbus oculi Anatomy 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 230000005886 chromosome breakage Effects 0.000 description 1
- 238000003759 clinical diagnosis Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 231100000895 deafness Toxicity 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 230000005782 double-strand break Effects 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 210000002310 elbow joint Anatomy 0.000 description 1
- 230000001815 facial effect Effects 0.000 description 1
- 230000001605 fetal effect Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010037389 glutamyl-cysteinyl-lysine Proteins 0.000 description 1
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 210000003128 head Anatomy 0.000 description 1
- 208000016354 hearing loss disease Diseases 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000007813 immunodeficiency Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000005865 ionizing radiation Effects 0.000 description 1
- 230000002147 killing effect Effects 0.000 description 1
- 210000000265 leukocyte Anatomy 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 230000008774 maternal effect Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 230000024121 nodulation Effects 0.000 description 1
- 230000007119 pathological manifestation Effects 0.000 description 1
- 238000002205 phenol-chloroform extraction Methods 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 230000019612 pigmentation Effects 0.000 description 1
- 238000003793 prenatal diagnosis Methods 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 239000012264 purified product Substances 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 238000003753 real-time PCR Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000028617 response to DNA damage stimulus Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000005204 segregation Methods 0.000 description 1
- 238000010008 shearing Methods 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 206010040872 skin infection Diseases 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 108091035539 telomere Proteins 0.000 description 1
- 210000003411 telomere Anatomy 0.000 description 1
- 102000055501 telomere Human genes 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
Images
Landscapes
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
Abstract
本发明公开了ATM基因突变体及其应用,具体涉及分离的编码ATM基因突变体的核酸、分离的多肽、筛选易患共济失调毛细血管扩张症的生物样品的方法、筛选易患共济失调毛细血管扩张症的生物样品的系统以及用于筛选易患共济失调毛细血管扩张症的生物样品的试剂盒。其中,分离的编码ATM基因突变体的核酸,与SEQ ID NO:1相比,具有选自下列的至少一种突变:c.4777‑2A>T、c.3078G>T、c.7983_7985delTGT和c.477_481delATCTC。通过检测该突变体在生物样品中是否存在,可以有效地检测生物样品是否易患共济失调毛细血管扩张症。
Description
技术领域
本发明涉及技术领域。具体而言,本发明涉及ATM基因突变体及其应用。
背景技术
共济失调毛细血管扩张症(ataxia-telangiectasia,AT)是一种罕见的常染色体隐性遗传(autosomal recessive inherited,AR)病,又叫Louis-Bar综合征,于1941年由Louis-Bar首先报道。AT发病率约1/4万~1/10万,多于儿童期起病,临床主要表现为进行性发展的小脑性共济失调,眼球结膜和面部皮肤的毛细血管扩张,反复发作的呼吸道感染;同时还具有对射线的杀伤作用异常敏感,染色体不稳定性,易患恶性肿瘤,免疫缺陷和抗射线的DNA复制合成等特征。
Savitsky和Uziel等率先克隆AT的疾病基因(ATM),并将其定位于11q22.3。ATM是AT唯一的致病基因,全长约150kb,而编码序列只有12kb,共有66个外显子,编码一个有3056个氨基酸残基,分子质量为350000的蛋白质。ATM基因产物ATM编码蛋白作为直接感受DNA双链断裂(DSBs)损伤信号的上游分子,以其激酶活性,催化多种重要功能底物蛋白的磷酸化,使受损的DNA停止于细胞周期检测点并对其进行修复,对维持染色体的稳定性起重要作用。
ATM基因的克隆为AT的基因诊断和症状前诊断奠定了基础,并且很大程度上解决了该病临床诊断困难的问题。ATM蛋白在胎儿和成人组织内均有广泛表达,其功能域包括磷酯酰肌醇3-激酶(phosphoinositide-3kinase,PI3K)、Rad3、亮氨酸拉链(leucine zipper,LZ)、局部的连接区域(focaladhesion target,FAT)、局部的连接区域C端(FATC)等。PI3K蛋白家族成员和具有Rad3结构域的蛋白家族成员参与了激活细胞周期关卡,控制端粒长度和应答DNA损伤等重要的生物学过程。
AT最显著的病理学表现是基因组的不稳定性,主要反应在高频率的染色体断裂,自发染色体末端融合和染色体内同源重组及染色体易位,以及体内、外染色体自发畸变率和辐射诱发染色体畸变率的异常增加。作为DNA损伤信号传导通路中最早的传感和中枢调控基因,ATM编码的ATM蛋白在电离辐射的诱导下被激活,并磷酸化其下游蛋白,通过调控细胞周期关卡、DNA损伤修复、细胞凋亡等维持基因组稳定性。AT细胞由于ATM基因突变,缺失了ATM蛋白,丧失了对下游基因的调控作用,使受损的DNA的合成受抑,阻碍DNA的修复过程,从而导致染色体的断裂和基因组不稳定。
ATM基因的突变具有极高的异质性,突变方式和突变检测都极为复杂。到目前为止,国外已经报道了400余种ATM基因突变,绝大多数AT患者为复合性杂合突变,突变形式有错义突变、无义突变、剪切位点突变、小片段插入突变、小片段缺失突变、大片段缺失和同义突变等,突变位点遍布基因全长,无突变热点。
因而,目前对共济失调毛细血管扩张症的致病基因ATM突变基因的研究仍有待深入。
发明内容
本发明旨在至少在一定程度上解决上述技术问题之一。为此,本发明的一个目的在于提出一种能够有效筛选易患共济失调毛细血管扩张症的生物样品的方法。
本发明是基于发明人的下列工作完成的:发明人通过高通量外显子组测序联合候选基因突变验证的方法确定了共济失调毛细血管扩张症新的致病基因突变位点(ATM基因的c.5697C>A、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT、c.477_481delATCTC和c.8851_8987del突变)。
根据本发明的第一方面,本发明提出了一种分离的编码ATM基因突变体的核酸。根据本发明的实施例,所述核酸与SEQ ID NO:1相比,具有选自下列的至少一种突变:c.5697C>A、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT、c.477_481delATCTC和c.8851_8987del(exon 65del)。即相对于野生型ATM基因,本发明的ATM基因具有c.5697C>A、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT、c.477_481delATCTC和c.8851_8987del(exon 65del)的至少一种。根据本发明的实施例,发明人确定了ATM基因的新突变体,该突变体与共济失调毛细血管扩张症的发病密切相关,从而通过检测该新突变体在生物样品中是否存在,可以有效地检测生物样品是否易患共济失调毛细血管扩张症。
根据本发明的第二个方面,本发明提出了一种分离的多肽,该多肽与SEQ ID NO:2相比,具有选自下列的至少一种突变:p.Cys1899X、p.Glu2950fs、p.Trp1026Cys、p.Val2662del、p.Ile159fs和p.2951_2995del,通过检测生物样品中是否表达该多肽,可以有效地检测生物样品是否易患共济失调毛细血管扩张症。根据本发明的实施例,该多肽是由上述核酸编码的。
根据本发明的地三个方面,本发明提出了一种筛选易患共济失调毛细血管扩张症的生物样品的方法,该方法包括以下步骤:从所述生物样品提取核酸样本;确定所述核酸样本的核酸序列;所述核酸样本的核酸序列与SEQ ID NO:1相比,具有选自c.5697C>A、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT、c.477_481delATCTC和c.8851_8987del(exon 65del)的至少一种突变是所述生物样品易患共济失调毛细血管扩张症的指示,任选地,所述生物样品为选自人体血液,任选地,所述核酸样本为全基因组DNA,任选地,所述共济失调毛细血管扩张症是常染色体隐性遗传疾病。通过根据本发明实施例的筛选易感共济失调毛细血管扩张症的生物样品的方法,可以有效地筛选易感共济失调毛细血管扩张症的生物样品。
根据本发明的第四方面,本发明提出了一种筛选易患共济失调毛细血管扩张症的生物样品的系统,该系统包括:核酸提取装置,所述核酸提取装置用于从所述生物样品提取核酸样本;核酸序列确定装置,所述核酸序列确定装置与所述核酸提取装置相连,用于对所述核酸样本进行分析,以便确定所述核酸样本的核酸序列;判断装置,所述判断装置与所述核酸序列确定装置相连,以便基于所述核酸样本的核酸序列与SEQ ID NO:1相比,是否具有c.5697C>A、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT、c.477_481delATCTC和c.8851_8987del(exon 65del)的至少一种突变,判断所述生物样品是否易患共济失调毛细血管扩张症;任选地,所述共济失调毛细血管扩张症是常染色体隐性遗传疾病。利用该系统,能够有效地实施前述筛选易感共济失调毛细血管扩张症的生物样品的方法,从而可以有效地筛选易感共济失调毛细血管扩张症的生物样品。
根据本发明的第五方面,本发明提出了一种用于筛选易患共济失调毛细血管扩张症的生物样品的试剂盒,该试剂盒含有:适于检测ATM基因突变体的试剂,其中与SEQ IDNO:1相比,所述ATM基因突变体具有选自下列的至少一种突变:c.5697C>A、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT、c.477_481delATCTC和c.8851_8987del(exon 65del),任选地,所述试剂为核酸探针或引物,任选地,所述核酸探针或引物具有如SEQ ID NO:7-16所示的核苷酸序列,任选地,所述是常染色体隐性遗传疾病。
根据本发明的第六方面,本发明还提出了一种构建体。根据本发明的实施例,该构建体包含前面所述的分离的编码ATM基因突变体的核酸。需要说明的是,“构建体包含前面所述的分离的编码ATM基因突变体的核酸”表示,本发明的构建体包含与SEQ ID NO:1相比具有c.5697C>A、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT、c.477_481delATCTC和c.8851_8987del(exon 65del)至少一种突变。由此,利用本发明的构建体转化受体细胞获得的重组细胞,能够有效地用作共济失调毛细血管扩张症相关研究的模型。
根据本发明的第七方面,本发明还提出了一种重组细胞。根据本发明的实施例,该重组细胞是通过前面所述的构建体转化受体细胞而获得的。根据本发明的一些实施例,本发明的重组细胞,能够有效地用作共济失调毛细血管扩张症相关研究的模型。
对于本发明中所述核酸,本领域技术人员应当理解,实际包括互补双链的任意一条,或者两条。为了方便,在本发明的实施例中,虽然多数情况下只给出了一条链,但实际上也公开了与之互补的另一条链。例如,提及SEQ ID NO:1,实际包括其互补序列。本领域技术人员还可以理解,利用一条链可以检测另一条链,反之亦然。
本发明的附加方面和优点将在下面的描述中部分给出,部分将从下面的描述中变得明显,或通过本发明的实践了解到。
附图说明
本发明的上述和/或附加的方面和优点从结合下面附图对实施例的描述中将变得明显和容易理解,其中:
图1:显示了根据本发明一个实施例的筛选易患共济失调毛细血管扩张症的生物样品的系统及其组成部分的示意图,其中,
A为根据本发明实施例的筛选易患共济失调毛细血管扩张症患者的生物样品的系统的示意图,
B为根据本发明实施例的核酸提取装置的示意图,
C为根据本发明实施例的核酸序列确定装置的示意图。
图2:显示了根据本发明一个实施例的共济失调毛细血管扩张症患者的家系图谱。
图3:显示了根据本发明一个实施例的AT家系1的ATM基因突变体的Sanger法测序结果。
图4:显示了根据本发明一个实施例的AT家系2的ATM基因突变体的Sanger法测序结果。
图5:显示了根据本发明一个实施例的AT家系3的ATM基因突变体的Sanger法测序结果。
图6:显示了根据本发明一个实施例的AT家系1的ATM基因突变体的QPCR的结果图。
图7:显示了根据本发明一个实施例的AT家系2的ATM基因突变体的Sanger法测序结果。
具体实施方式
下面详细描述本发明的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,旨在用于解释本发明,而不能理解为对本发明的限制。
ATM基因突变体
根据本发明的第一方面,本发明提出了一种分离的编码ATM基因突变体的核酸。根据本发明的实施例,所述核酸与SEQ ID NO:1相比,具有选自下列的至少一种突变:c.5697C>A、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT、c.477_481delATCTC和c.8851_8987del(exon 65del)。即相对于野生型ATM基因,本发明的ATM基因具有c.5697C>A、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT、c.477_481delATCTC和c.8851_8987del(exon 65del)突变的至少一种。根据本发明的实施例,发明人确定了ATM基因的新突变体,该突变体与共济失调毛细血管扩张症的发病密切相关,从而通过检测该新突变体在生物样品中是否存在,可以有效地检测生物样品是否易患共济失调毛细血管扩张症。
在本文中所使用的表达方式“编码ATM突变体的核酸”,是指与编码ATM突变体的基因相对应的核酸物质,即核酸的类型不受特别限制,可以是任何包含与ATM突变体的编码基因相对应的脱氧核糖核苷酸和/或核糖核苷酸的聚合物,包括但不限于DNA、RNA或cDNA。根据本发明的一个具体示例,前面所述的编码ATM突变体的核酸为DNA。根据本发明的实施例,发明人确定了ATM基因的新突变体,这些新突变体与共济失调毛细血管扩张症的发病密切相关,从而通过检测该新突变体在生物样品中是否存在,可以有效地检测生物样品是否易患共济失调毛细血管扩张症,也可以通过检测这些突变体在生物体中是否存在,可以有效地预测生物体是否易患共济失调毛细血管扩张症。
该编码ATM突变体的核酸,是本申请的发明人通过高通量外显子组测序联合候选基因突变验证的方法确定的共济失调毛细血管扩张症的致病基因上的新突变。该突变位点在现有技术中并未被提到。
其中,野生型ATM基因的cDNA具有如下所示的核苷酸序列:
ATGAGTCTAGTACTTAATGATCTGCTTATCTGCTGCCGTCAACTAGAACATGATAGAGCTACAGAACGAAAGAAAGAAGTTGAGAAATTTAAGCGCCTGATTCGAGATCCTGAAACAATTAAACATCTAGATCGGCATTCAGATTCCAAACAAGGAAAATATTTGAATTGGGATGCTGTTTTTAGATTTTTACAGAAATATATTCAGAAAGAAACAGAATGTCTGAGAATAGCAAAACCAAATGTATCAGCCTCAACACAAGCCTCCAGGCAGAAAAAGATGCAGGAAATCAGTAGTTTGGTCAAATACTTCATCAAATGTGCAAACAGAAGAGCACCTAGGCTAAAATGTCAAGAACTCTTAAATTATATCATGGATACAGTGAAAGATTCATCTAATGGTGCTATTTACGGAGCTGATTGTAGCAACATACTACTCAAAGACATTCTTTCTGTGAGAAAATACTGGTGTGAAATATCTCAGCAACAGTGGTTAGAATTGTTCTCTGTGTACTTCAGGCTCTATCTGAAACCTTCACAAGATGTTCATAGAGTTTTAGTGGCTAGAATAATTCATGCTGTTACCAAAGGATGCTGTTCTCAGACTGACGGATTAAATTCCAAATTTTTGGACTTTTTTTCCAAGGCTATTCAGTGTGCGAGACAAGAAAAGAGCTCTTCAGGTCTAAATCATATCTTAGCAGCTCTTACTATCTTCCTCAAGACTTTGGCTGTCAACTTTCGAATTCGAGTGTGTGAATTAGGAGATGAAATTCTTCCCACTTTGCTTTATATTTGGACTCAACATAGGCTTAATGATTCTTTAAAAGAAGTCATTATTGAATTATTTCAACTGCAAATTTATATCCATCATCCGAAAGGAGCCAAAACCCAAGAAAAAGGTGCTTATGAATCAACAAAATGGAGAAGTATTTTATACAACTTATATGATCTGCTAGTGAATGAGATAAGTCATATAGGAAGTAGAGGAAAGTATTCTTCAGGATTTCGTAATATTGCCGTCAAAGAAAATTTGATTGAATTGATGGCAGATATCTGTCACCAGGTTTTTAATGAAGATACCAGATCCTTGGAGATTTCTCAATCTTACACTACTACACAAAGAGAATCTAGTGATTACAGTGTCCCTTGCAAAAGGAAGAAAATAGAACTAGGCTGGGAAGTAATAAAAGATCACCTTCAGAAGTCACAGAATGATTTTGATCTTGTGCCTTGGCTACAGATTGCAACCCAATTAATATCAAAGTATCCTGCAAGTTTACCTAACTGTGAGCTGTCTCCATTACTGATGATACTATCTCAGCTTCTACCCCAACAGCGACATGGGGAACGTACACCATATGTGTTACGATGCCTTACGGAAGTTGCATTGTGTCAAGACAAGAGGTCAAACCTAGAAAGCTCACAAAAGTCAGATTTATTAAAACTCTGGAATAAAATTTGGTGTATTACCTTTCGTGGTATAAGTTCTGAGCAAATACAAGCTGAAAACTTTGGCTTACTTGGAGCCATAATTCAGGGTAGTTTAGTTGAGGTTGACAGAGAATTCTGGAAGTTATTTACTGGGTCAGCCTGCAGACCTTCATGTCCTGCAGTATGCTGTTTGACTTTGGCACTGACCACCAGTATAGTTCCAGGAACGGTAAAAATGGGAATAGAGCAAAATATGTGTGAAGTAAATAGAAGCTTTTCTTTAAAGGAATCAATAATGAAATGGCTCTTATTCTATCAGTTAGAGGGTGACTTAGAAAATAGCACAGAAGTGCCTCCAATTCTTCACAGTAATTTTCCTCATCTTGTACTGGAGAAAATTCTTGTGAGTCTCACTATGAAAAACTGTAAAGCTGCAATGAATTTTTTCCAAAGCGTGCCAGAATGTGAACACCACCAAAAAGATAAAGAAGAACTTTCATTCTCAGAAGTAGAAGAACTATTTCTTCAGACAACTTTTGACAAGATGGACTTTTTAACCATTGTGAGAGAATGTGGTATAGAAAAGCACCAGTCCAGTATTGGCTTCTCTGTCCACCAGAATCTCAAGGAATCACTGGATCGCTGTCTTCTGGGATTATCAGAACAGCTTCTGAATAATTACTCATCTGAGATTACAAATTCAGAAACTCTTGTCCGGTGTTCACGTCTTTTGGTGGGTGTCCTTGGCTGCTACTGTTACATGGGTGTAATAGCTGAAGAGGAAGCATATAAGTCAGAATTATTCCAGAAAGCCAAGTCTCTAATGCAATGTGCAGGAGAAAGTATCACTCTGTTTAAAAATAAGACAAATGAGGAATTCAGAATTGGTTCCTTGAGAAATATGATGCAGCTATGTACACGTTGCTTGAGCAACTGTACCAAGAAGAGTCCAAATAAGATTGCATCTGGCTTTTTCCTGCGATTGTTAACATCAAAGCTAATGAATGACATTGCAGATATTTGTAAAAGTTTAGCATCCTTCATCAAAAAGCCATTTGACCGTGGAGAAGTAGAATCAATGGAAGATGATACTAATGGAAATCTAATGGAGGTGGAGGATCAGTCATCCATGAATCTATTTAACGATTACCCTGATAGTAGTGTTAGTGATGCAAACGAACCTGGAGAGAGCCAAAGTACCATAGGTGCCATTAATCCTTTAGCTGAAGAATATCTGTCAAAGCAAGATCTACTTTTCTTAGACATGCTCAAGTTCTTGTGTTTGTGTGTAACTACTGCTCAGACCAATACTGTGTCCTTTAGGGCAGCTGATATTCGGAGGAAATTGTTAATGTTAATTGATTCTAGCACGCTAGAACCTACCAAATCCCTCCACCTGCATATGTATCTAATGCTTTTAAAGGAGCTTCCTGGAGAAGAGTACCCCTTGCCAATGGAAGATGTTCTTGAACTTCTGAAACCACTATCCAATGTGTGTTCTTTGTATCGTCGTGACCAAGATGTTTGTAAAACTATTTTAAACCATGTCCTTCATGTAGTGAAAAACCTAGGTCAAAGCAATATGGACTCTGAGAACACAAGGGATGCTCAAGGACAGTTTCTTACAGTAATTGGAGCATTTTGGCATCTAACAAAGGAGAGGAAATATATATTCTCTGTAAGAATGGCCCTAGTAAATTGCCTTAAAACTTTGCTTGAGGCTGATCCTTATTCAAAATGGGCCATTCTTAATGTAATGGGAAAAGACTTTCCTGTAAATGAAGTATTTACACAATTTCTTGCTGACAATCATCACCAAGTTCGCATGTTGGCTGCAGAGTCAATCAATAGATTGTTCCAGGACACGAAGGGAGATTCTTCCAGGTTACTGAAAGCACTTCCTTTGAAGCTTCAGCAAACAGCTTTTGAAAATGCATACTTGAAAGCTCAGGAAGGAATGAGAGAAATGTCCCATAGTGCTGAGAACCCTGAAACTTTGGATGAAATTTATAATAGAAAATCTGTTTTACTGACGTTGATAGCTGTGGTTTTATCCTGTAGCCCTATCTGCGAAAAACAGGCTTTGTTTGCCCTGTGTAAATCTGTGAAAGAGAATGGATTAGAACCTCACCTTGTGAAAAAGGTTTTAGAGAAAGTTTCTGAAACTTTTGGATATAGACGTTTAGAAGACTTTATGGCATCTCATTTAGATTATCTGGTTTTGGAATGGCTAAATCTTCAAGATACTGAATACAACTTATCTTCTTTTCCTTTTATTTTATTAAACTACACAAATATTGAGGATTTCTATAGATCTTGTTATAAGGTTTTGATTCCACATCTGGTGATTAGAAGTCATTTTGATGAGGTGAAGTCCATTGCTAATCAGATTCAAGAGGACTGGAAAAGTCTTCTAACAGACTGCTTTCCAAAGATTCTTGTAAATATTCTTCCTTATTTTGCCTATGAGGGTACCAGAGACAGTGGGATGGCACAGCAAAGAGAGACTGCTACCAAGGTCTATGATATGCTTAAAAGTGAAAACTTATTGGGAAAACAGATTGATCACTTATTCATTAGTAATTTACCAGAGATTGTGGTGGAGTTATTGATGACGTTACATGAGCCAGCAAATTCTAGTGCCAGTCAGAGCACTGACCTCTGTGACTTTTCAGGGGATTTGGATCCTGCTCCTAATCCACCTCATTTTCCATCGCATGTGATTAAAGCAACATTTGCCTATATCAGCAATTGTCATAAAACCAAGTTAAAAAGCATTTTAGAAATTCTTTCCAAAAGCCCTGATTCCTATCAGAAAATTCTTCTTGCCATATGTGAGCAAGCAGCTGAAACAAATAATGTTTATAAGAAGCACAGAATTCTTAAAATATATCACCTGTTTGTTAGTTTATTACTGAAAGATATAAAAAGTGGCTTAGGAGGAGCTTGGGCCTTTGTTCTTCGAGACGTTATTTATACTTTGATTCACTATATCAACCAAAGGCCTTCTTGTATCATGGATGTGTCATTACGTAGCTTCTCCCTTTGTTGTGACTTATTAAGTCAGGTTTGCCAGACAGCCGTGACTTACTGTAAGGATGCTCTAGAAAACCATCTTCATGTTATTGTTGGTACACTTATACCCCTTGTGTATGAGCAGGTGGAGGTTCAGAAACAGGTATTGGACTTGTTGAAATACTTAGTGATAGATAACAAGGATAATGAAAACCTCTATATCACGATTAAGCTTTTAGATCCTTTTCCTGACCATGTTGTTTTTAAGGATTTGCGTATTACTCAGCAAAAAATCAAATACAGTAGAGGACCCTTTTCACTCTTGGAGGAAATTAACCATTTTCTCTCAGTAAGTGTTTATGATGCACTTCCATTGACAAGACTTGAAGGACTAAAGGATCTTCGAAGACAACTGGAACTACATAAAGATCAGATGGTGGACATTATGAGAGCTTCTCAGGATAATCCGCAAGATGGGATTATGGTGAAACTAGTTGTCAATTTGTTGCAGTTATCCAAGATGGCAATAAACCACACTGGTGAAAAAGAAGTTCTAGAGGCTGTTGGAAGCTGCTTGGGAGAAGTGGGTCCTATAGATTTCTCTACCATAGCTATACAACATAGTAAAGATGCATCTTATACCAAGGCCCTTAAGTTATTTGAAGATAAAGAACTTCAGTGGACCTTCATAATGCTGACCTACCTGAATAACACACTGGTAGAAGATTGTGTCAAAGTTCGATCAGCAGCTGTTACCTGTTTGAAAAACATTTTAGCCACAAAGACTGGACATAGTTTCTGGGAGATTTATAAGATGACAACAGATCCAATGCTGGCCTATCTACAGCCTTTTAGAACATCAAGAAAAAAGTTTTTAGAAGTACCCAGATTTGACAAAGAAAACCCTTTTGAAGGCCTGGATGATATAAATCTGTGGATTCCTCTAAGTGAAAATCATGACATTTGGATAAAGACACTGACTTGTGCTTTTTTGGACAGTGGAGGCACAAAATGTGAAATTCTTCAATTATTAAAGCCAATGTGTGAAGTGAAAACTGACTTTTGTCAGACTGTACTTCCATACTTGATTCATGATATTTTACTCCAAGATACAAATGAATCATGGAGAAATCTGCTTTCTACACATGTTCAGGGATTTTTCACCAGCTGTCTTCGACACTTCTCGCAAACGAGCCGATCCACAACCCCTGCAAACTTGGATTCAGAGTCAGAGCACTTTTTCCGATGCTGTTTGGATAAAAAATCACAAAGAACAATGCTTGCTGTTGTGGACTACATGAGAAGACAAAAGAGACCTTCTTCAGGAACAATTTTTAATGATGCTTTCTGGCTGGATTTAAATTATCTAGAAGTTGCCAAGGTAGCTCAGTCTTGTGCTGCTCACTTTACAGCTTTACTCTATGCAGAAATCTATGCAGATAAGAAAAGTATGGATGATCAAGAGAAAAGAAGTCTTGCATTTGAAGAAGGAAGCCAGAATACAACTATTTCTAGCTTGAGTGAAAAAAGTAAAGAAGAAACTGGAATAAGTTTACAGGATCTTCTCTTAGAAATCTACAGAAGTATAGGGGAGCCAGATAGTTTGTATGGCTGTGGTGGAGGGAAGATGTTACAACCCATTACTAGACTACGAACATATGAACACGAAGCAATGTGGGGCAAAGCCCTAGTAACATATGACCTCGAAACAGCAATCCCCTCATCAACACGCCAGGCAGGAATCATTCAGGCCTTGCAGAATTTGGGACTCTGCCATATTCTTTCCGTCTATTTAAAAGGATTGGATTATGAAAATAAAGACTGGTGTCCTGAACTAGAAGAACTTCATTACCAAGCAGCATGGAGGAATATGCAGTGGGACCATTGCACTTCCGTCAGCAAAGAAGTAGAAGGAACCAGTTACCATGAATCATTGTACAATGCTCTACAATCTCTAAGAGACAGAGAATTCTCTACATTTTATGAAAGTCTCAAATATGCCAGAGTAAAAGAAGTGGAAGAGATGTGTAAGCGCAGCCTTGAGTCTGTGTATTCGCTCTATCCCACACTTAGCAGGTTGCAGGCCATTGGAGAGCTGGAAAGCATTGGGGAGCTTTTCTCAAGATCAGTCACACATAGACAACTCTCTGAAGTATATATTAAGTGGCAGAAACACTCCCAGCTTCTCAAGGACAGTGATTTTAGTTTTCAGGAGCCTATCATGGCTCTACGCACAGTCATTTTGGAGATCCTGATGGAAAAGGAAATGGACAACTCACAAAGAGAATGTATTAAGGACATTCTCACCAAACACCTTGTAGAACTCTCTATACTGGCCAGAACTTTCAAGAACACTCAGCTCCCTGAAAGGGCAATATTTCAAATTAAACAGTACAATTCAGTTAGCTGTGGAGTCTCTGAGTGGCAGCTGGAAGAAGCACAAGTATTCTGGGCAAAAAAGGAGCAGAGTCTTGCCCTGAGTATTCTCAAGCAAATGATCAAGAAGTTGGATGCCAGCTGTGCAGCGAACAATCCCAGCCTAAAACTTACATACACAGAATGTCTGAGGGTTTGTGGCAACTGGTTAGCAGAAACGTGCTTAGAAAATCCTGCGGTCATCATGCAGACCTATCTAGAAAAGGCAGTAGAAGTTGCTGGAAATTATGATGGAGAAAGTAGTGATGAGCTAAGAAATGGAAAAATGAAGGCATTTCTCTCATTAGCCCGGTTTTCAGATACTCAATACCAAAGAATTGAAAACTACATGAAATCATCGGAATTTGAAAACAAGCAAGCTCTCCTGAAAAGAGCCAAAGAGGAAGTAGGTCTCCTTAGGGAACATAAAATTCAGACAAACAGATACACAGTAAAGGTTCAGCGAGAGCTGGAGTTGGATGAATTAGCCCTGCGTGCACTGAAAGAGGATCGTAAACGCTTCTTATGTAAAGCAGTTGAAAATTATATCAACTGCTTATTAAGTGGAGAAGAACATGATATGTGGGTATTCCGACTTTGTTCCCTCTGGCTTGAAAATTCTGGAGTTTCTGAAGTCAATGGCATGATGAAGAGAGACGGAATGAAGATTCCAACATATAAATTTTTGCCTCTTATGTACCAATTGGCTGCTAGAATGGGGACCAAGATGATGGGAGGCCTAGGATTTCATGAAGTCCTCAATAATCTAATCTCTAGAATTTCAATGGATCACCCCCATCACACTTTGTTTATTATACTGGCCTTAGCAAATGCAAACAGAGATGAATTTCTGACTAAACCAGAGGTAGCCAGAAGAAGCAGAATAACTAAAAATGTGCCTAAACAAAGCTCTCAGCTTGATGAGGATCGAACAGAGGCTGCAAATAGAATAATATGTACTATCAGAAGTAGGAGACCTCAGATGGTCAGAAGTGTTGAGGCACTTTGTGATGCTTATATTATATTAGCAAACTTAGATGCCACTCAGTGGAAGACTCAGAGAAAAGGCATAAATATTCCAGCAGACCAGCCAATTACTAAACTTAAGAATTTAGAAGATGTTGTTGTCCCTACTATGGAAATTAAGGTGGACCACACAGGAGAATATGGAAATCTGGTGACTATACAGTCATTTAAAGCAGAATTTCGCTTAGCAGGAGGTGTAAATTTACCAAAAATAATAGATTGTGTAGGTTCCGATGGCAAGGAGAGGAGACAGCTTGTTAAGGGCCGTGATGACCTGAGACAAGATGCTGTCATGCAACAGGTCTTCCAGATGTGTAATACATTACTGCAGAGAAACACGGAAACTAGGAAGAGGAAATTAACTATCTGTACTTATAAGGTGGTTCCCCTCTCTCAGCGAAGTGGTGTTCTTGAATGGTGCACAGGAACTGTCCCCATTGGTGAATTTCTTGTTAACAATGAAGATGGTGCTCATAAAAGATACAGGCCAAATGATTTCAGTGCCTTTCAGTGCCAAAAGAAAATGATGGAGGTGCAAAAAAAGTCTTTTGAAGAGAAATATGAAGTCTTCATGGATGTTTGCCAAAATTTTCAACCAGTTTTCCGTTACTTCTGCATGGAAAAATTCTTGGATCCAGCTATTTGGTTTGAGAAGCGATTGGCTTATACGCGCAGTGTAGCTACTTCTTCTATTGTTGGTTACATACTTGGACTTGGTGATAGACATGTACAGAATATCTTGATAAATGAGCAGTCAGCAGAACTTGTACATATAGATCTAGGTGTTGCTTTTGAACAGGGCAAAATCCTTCCTACTCCTGAGACAGTTCCTTTTAGACTCACCAGAGATATTGTGGATGGCATGGGCATTACGGGTGTTGAAGGTGTCTTCAGAAGATGCTGTGAGAAAACCATGGAAGTGATGAGAAACTCTCAGGAAACTCTGTTAACCATTGTAGAGGTCCTTCTATATGATCCACTCTTTGACTGGACCATGAATCCTTTGAAAGCTTTGTATTTACAGCAGAGGCCGGAAGATGAAACTGAGCTTCACCCTACTCTGAATGCAGATGACCAAGAATGCAAACGAAATCTCAGTGATATTGACCAGAGTTTCAACAAAGTAGCTGAACGTGTCTTAATGAGACTACAAGAGAAACTGAAAGGAGTGGAAGAAGGCACTGTGCTCAGTGTTGGTGGACAAGTGAATTTGCTCATACAGCAGGCCATAGACCCCAAAAATCTCAGCCGACTTTTCCCAGGATGGAAAGCTTGGGTGTGA(SEQ ID NO:1),
其编码的蛋白质具有如下所示的氨基酸序列:
MSLVLNDLLICCRQLEHDRATERKKEVEKFKRLIRDPETIKHLDRHSDSKQGKYLNWDAVFRFLQKYIQKETECLRIAKPNVSASTQASRQKKMQEISSLVKYFIKCANRRAPRLKCQELLNYIMDTVKDSSNGAIYGADCSNILLKDILSVRKYWCEISQQQWLELFSVYFRLYLKPSQDVHRVLVARIIHAVTKGCCSQTDGLNSKFLDFFSKAIQCARQEKSSSGLNHILAALTIFLKTLAVNFRIRVCELGDEILPTLLYIWTQHRLNDSLKEVIIELFQLQIYIHHPKGAKTQEKGAYESTKWRSILYNLYDLLVNEISHIGSRGKYSSGFRNIAVKENLIELMADICHQVFNEDTRSLEISQSYTTTQRESSDYSVPCKRKKIELGWEVIKDHLQKSQNDFDLVPWLQIATQLISKYPASLPNCELSPLLMILSQLLPQQRHGERTPYVLRCLTEVALCQDKRSNLESSQKSDLLKLWNKIWCITFRGISSEQIQAENFGLLGAIIQGSLVEVDREFWKLFTGSACRPSCPAVCCLTLALTTSIVPGTVKMGIEQNMCEVNRSFSLKESIMKWLLFYQLEGDLENSTEVPPILHSNFPHLVLEKILVSLTMKNCKAAMNFFQSVPECEHHQKDKEELSFSEVEELFLQTTFDKMDFLTIVRECGIEKHQSSIGFSVHQNLKESLDRCLLGLSEQLLNNYSSEITNSETLVRCSRLLVGVLGCYCYMGVIAEEEAYKSELFQKAKSLMQCAGESITLFKNKTNEEFRIGSLRNMMQLCTRCLSNCTKKSPNKIASGFFLRLLTSKLMNDIADICKSLASFIKKPFDRGEVESMEDDTNGNLMEVEDQSSMNLFNDYPDSSVSDANEPGESQSTIGAINPLAEEYLSKQDLLFLDMLKFLCLCVTTAQTNTVSFRAADIRRKLLMLIDSSTLEPTKSLHLHMYLMLLKELPGEEYPLPMEDVLELLKPLSNVCSLYRRDQDVCKTILNHVLHVVKNLGQSNMDSENTRDAQGQFLTVIGAFWHLTKERKYIFSVRMALVNCLKTLLEADPYSKWAILNVMGKDFPVNEVFTQFLADNHHQVRMLAAESINRLFQDTKGDSSRLLKALPLKLQQTAFENAYLKAQEGMREMSHSAENPETLDEIYNRKSVLLTLIAVVLSCSPICEKQALFALCKSVKENGLEPHLVKKVLEKVSETFGYRRLEDFMASHLDYLVLEWLNLQDTEYNLSSFPFILLNYTNIEDFYRSCYKVLIPHLVIRSHFDEVKSIANQIQEDWKSLLTDCFPKILVNILPYFAYEGTRDSGMAQQRETATKVYDMLKSENLLGKQIDHLFISNLPEIVVELLMTLHEPANSSASQSTDLCDFSGDLDPAPNPPHFPSHVIKATFAYISNCHKTKLKSILEILSKSPDSYQKILLAICEQAAETNNVYKKHRILKIYHLFVSLLLKDIKSGLGGAWAFVLRDVIYTLIHYINQRPSCIMDVSLRSFSLCCDLLSQVCQTAVTYCKDALENHLHVIVGTLIPLVYEQVEVQKQVLDLLKYLVIDNKDNENLYITIKLLDPFPDHVVFKDLRITQQKIKYSRGPFSLLEEINHFLSVSVYDALPLTRLEGLKDLRRQLELHKDQMVDIMRASQDNPQDGIMVKLVVNLLQLSKMAINHTGEKEVLEAVGSCLGEVGPIDFSTIAIQHSKDASYTKALKLFEDKELQWTFIMLTYLNNTLVEDCVKVRSAAVTCLKNILATKTGHSFWEIYKMTTDPMLAYLQPFRTSRKKFLEVPRFDKENPFEGLDDINLWIPLSENHDIWIKTLTCAFLDSGGTKCEILQLLKPMCEVKTDFCQTVLPYLIHDILLQDTNESWRNLLSTHVQGFFTSCLRHFSQTSRSTTPANLDSESEHFFRCCLDKKSQRTMLAVVDYMRRQKRPSSGTIFNDAFWLDLNYLEVAKVAQSCAAHFTALLYAEIYADKKSMDDQEKRSLAFEEGSQNTTISSLSEKSKEETGISLQDLLLEIYRSIGEPDSLYGCGGGKMLQPITRLRTYEHEAMWGKALVTYDLETAIPSSTRQAGIIQALQNLGLCHILSVYLKGLDYENKDWCPELEELHYQAAWRNMQWDHCTSVSKEVEGTSYHESLYNALQSLRDREFSTFYESLKYARVKEVEEMCKRSLESVYSLYPTLSRLQAIGELESIGELFSRSVTHRQLSEVYIKWQKHSQLLKDSDFSFQEPIMALRTVILEILMEKEMDNSQRECIKDILTKHLVELSILARTFKNTQLPERAIFQIKQYNSVSCGVSEWQLEEAQVFWAKKEQSLALSILKQMIKKLDASCAANNPSLKLTYTECLRVCGNWLAETCLENPAVIMQTYLEKAVEVAGNYDGESSDELRNGKMKAFLSLARFSDTQYQRIENYMKSSEFENKQALLKRAKEEVGLLREHKIQTNRYTVKVQRELELDELALRALKEDRKRFLCKAVENYINCLLSGEEHDMWVFRLCSLWLENSGVSEVNGMMKRDGMKIPTYKFLPLMYQLAARMGTKMMGGLGFHEVLNNLISRISMDHPHHTLFIILALANANRDEFLTKPEVARRSRITKNVPKQSSQLDEDRTEAANRIICTIRSRRPQMVRSVEALCDAYIILANLDATQWKTQRKGINIPADQPITKLKNLEDVVVPTMEIKVDHTGEYGNLVTIQSFKAEFRLAGGVNLPKIIDCVGSDGKERRQLVKGRDDLRQDAVMQQVFQMCNTLLQRNTETRKRKLTICTYKVVPLSQRSGVLEWCTGTVPIGEFLVNNEDGAHKRYRPNDFSAFQCQKKMMEVQKKSFEEKYEVFMDVCQNFQPVFRYFCMEKFLDPAIWFEKRLAYTRSVATSSIVGYILGLGDRHVQNILINEQSAELVHIDLGVAFEQGKILPTPETVPFRLTRDIVDGMGITGVEGVFRRCCEKTMEVMRNSQETLLTIVEVLLYDPLFDWTMNPLKALYLQQRPEDETELHPTLNADDQECKRNLSDIDQSFNKVAERVLMRLQEKLKGVEEGTVLSVGGQVNLLIQQAIDPKNLSRLFPGWKAWV(SEQ ID NO:2)。
与野生型ATM基因的cDNA相比,c.477_481del ATCTC突变基因的cDNA具有如下所示的核苷酸序列:
ATGAGTCTAGTACTTAATGATCTGCTTATCTGCTGCCGTCAACTAGAACATGATAGAGCTACAGAACGAAAGAAAGAAGTTGAGAAATTTAAGCGCCTGATTCGAGATCCTGAAACAATTAAACATCTAGATCGGCATTCAGATTCCAAACAAGGAAAATATTTGAATTGGGATGCTGTTTTTAGATTTTTACAGAAATATATTCAGAAAGAAACAGAATGTCTGAGAATAGCAAAACCAAATGTATCAGCCTCAACACAAGCCTCCAGGCAGAAAAAGATGCAGGAAATCAGTAGTTTGGTCAAATACTTCATCAAATGTGCAAACAGAAGAGCACCTAGGCTAAAATGTCAAGAACTCTTAAATTATATCATGGATACAGTGAAAGATTCATCTAATGGTGCTATTTACGGAGCTGATTGTAGCAACATACTACTCAAAGACATTCTTTCTGTGAGAAAATACTGGTGTGAAATAGCAACAGTGGTTAGAATTGTTCTCTGTGTACTTCAGGCTCTATCTGAAACCTTCACAAGATGTTCA(SEQ ID NO:3),
上述c.477_481del ATCTC突变基因编码的氨基酸序列如下:
MSLVLNDLLICCRQLEHDRATERKKEVEKFKRLIRDPETIKHLDRHSDSKQGKYLNWDAVFRFLQKYIQKETECLRIAKPNVSASTQASRQKKMQEISSLVKYFIKCANRRAPRLKCQELLNYIMDTVKDSSNGAIYGADCSNILLKDILSVRKYWCEIATVVRIVLCVLQALSETFTRCS(SEQ ID NO:4)。
与野生型ATM基因的cDNA相比,c.4777-2A>T的突变基因的cDNA具有如下所示的核苷酸序列:
ATGAGTCTAGTACTTAATGATCTGCTTATCTGCTGCCGTCAACTAGAACATGATAGAGCTACAGAACGAAAGAAAGAAGTTGAGAAATTTAAGCGCCTGATTCGAGATCCTGAAACAATTAAACATCTAGATCGGCATTCAGATTCCAAACAAGGAAAATATTTGAATTGGGATGCTGTTTTTAGATTTTTACAGAAATATATTCAGAAAGAAACAGAATGTCTGAGAATAGCAAAACCAAATGTATCAGCCTCAACACAAGCCTCCAGGCAGAAAAAGATGCAGGAAATCAGTAGTTTGGTCAAATACTTCATCAAATGTGCAAACAGAAGAGCACCTAGGCTAAAATGTCAAGAACTCTTAAATTATATCATGGATACAGTGAAAGATTCATCTAATGGTGCTATTTACGGAGCTGATTGTAGCAACATACTACTCAAAGACATTCTTTCTGTGAGAAAATACTGGTGTGAAATATCTCAGCAACAGTGGTTAGAATTGTTCTCTGTGTACTTCAGGCTCTATCTGAAACCTTCACAAGATGTTCATAGAGTTTTAGTGGCTAGAATAATTCATGCTGTTACCAAAGGATGCTGTTCTCAGACTGACGGATTAAATTCCAAATTTTTGGACTTTTTTTCCAAGGCTATTCAGTGTGCGAGACAAGAAAAGAGCTCTTCAGGTCTAAATCATATCTTAGCAGCTCTTACTATCTTCCTCAAGACTTTGGCTGTCAACTTTCGAATTCGAGTGTGTGAATTAGGAGATGAAATTCTTCCCACTTTGCTTTATATTTGGACTCAACATAGGCTTAATGATTCTTTAAAAGAAGTCATTATTGAATTATTTCAACTGCAAATTTATATCCATCATCCGAAAGGAGCCAAAACCCAAGAAAAAGGTGCTTATGAATCAACAAAATGGAGAAGTATTTTATACAACTTATATGATCTGCTAGTGAATGAGATAAGTCATATAGGAAGTAGAGGAAAGTATTCTTCAGGATTTCGTAATATTGCCGTCAAAGAAAATTTGATTGAATTGATGGCAGATATCTGTCACCAGGTTTTTAATGAAGATACCAGATCCTTGGAGATTTCTCAATCTTACACTACTACACAAAGAGAATCTAGTGATTACAGTGTCCCTTGCAAAAGGAAGAAAATAGAACTAGGCTGGGAAGTAATAAAAGATCACCTTCAGAAGTCACAGAATGATTTTGATCTTGTGCCTTGGCTACAGATTGCAACCCAATTAATATCAAAGTATCCTGCAAGTTTACCTAACTGTGAGCTGTCTCCATTACTGATGATACTATCTCAGCTTCTACCCCAACAGCGACATGGGGAACGTACACCATATGTGTTACGATGCCTTACGGAAGTTGCATTGTGTCAAGACAAGAGGTCAAACCTAGAAAGCTCACAAAAGTCAGATTTATTAAAACTCTGGAATAAAATTTGGTGTATTACCTTTCGTGGTATAAGTTCTGAGCAAATACAAGCTGAAAACTTTGGCTTACTTGGAGCCATAATTCAGGGTAGTTTAGTTGAGGTTGACAGAGAATTCTGGAAGTTATTTACTGGGTCAGCCTGCAGACCTTCATGTCCTGCAGTATGCTGTTTGACTTTGGCACTGACCACCAGTATAGTTCCAGGAACGGTAAAAATGGGAATAGAGCAAAATATGTGTGAAGTAAATAGAAGCTTTTCTTTAAAGGAATCAATAATGAAATGGCTCTTATTCTATCAGTTAGAGGGTGACTTAGAAAATAGCACAGAAGTGCCTCCAATTCTTCACAGTAATTTTCCTCATCTTGTACTGGAGAAAATTCTTGTGAGTCTCACTATGAAAAACTGTAAAGCTGCAATGAATTTTTTCCAAAGCGTGCCAGAATGTGAACACCACCAAAAAGATAAAGAAGAACTTTCATTCTCAGAAGTAGAAGAACTATTTCTTCAGACAACTTTTGACAAGATGGACTTTTTAACCATTGTGAGAGAATGTGGTATAGAAAAGCACCAGTCCAGTATTGGCTTCTCTGTCCACCAGAATCTCAAGGAATCACTGGATCGCTGTCTTCTGGGATTATCAGAACAGCTTCTGAATAATTACTCATCTGAGATTACAAATTCAGAAACTCTTGTCCGGTGTTCACGTCTTTTGGTGGGTGTCCTTGGCTGCTACTGTTACATGGGTGTAATAGCTGAAGAGGAAGCATATAAGTCAGAATTATTCCAGAAAGCCAAGTCTCTAATGCAATGTGCAGGAGAAAGTATCACTCTGTTTAAAAATAAGACAAATGAGGAATTCAGAATTGGTTCCTTGAGAAATATGATGCAGCTATGTACACGTTGCTTGAGCAACTGTACCAAGAAGAGTCCAAATAAGATTGCATCTGGCTTTTTCCTGCGATTGTTAACATCAAAGCTAATGAATGACATTGCAGATATTTGTAAAAGTTTAGCATCCTTCATCAAAAAGCCATTTGACCGTGGAGAAGTAGAATCAATGGAAGATGATACTAATGGAAATCTAATGGAGGTGGAGGATCAGTCATCCATGAATCTATTTAACGATTACCCTGATAGTAGTGTTAGTGATGCAAACGAACCTGGAGAGAGCCAAAGTACCATAGGTGCCATTAATCCTTTAGCTGAAGAATATCTGTCAAAGCAAGATCTACTTTTCTTAGACATGCTCAAGTTCTTGTGTTTGTGTGTAACTACTGCTCAGACCAATACTGTGTCCTTTAGGGCAGCTGATATTCGGAGGAAATTGTTAATGTTAATTGATTCTAGCACGCTAGAACCTACCAAATCCCTCCACCTGCATATGTATCTAATGCTTTTAAAGGAGCTTCCTGGAGAAGAGTACCCCTTGCCAATGGAAGATGTTCTTGAACTTCTGAAACCACTATCCAATGTGTGTTCTTTGTATCGTCGTGACCAAGATGTTTGTAAAACTATTTTAAACCATGTCCTTCATGTAGTGAAAAACCTAGGTCAAAGCAATATGGACTCTGAGAACACAAGGGATGCTCAAGGACAGTTTCTTACAGTAATTGGAGCATTTTGGCATCTAACAAAGGAGAGGAAATATATATTCTCTGTAAGAATGGCCCTAGTAAATTGCCTTAAAACTTTGCTTGAGGCTGATCCTTATTCAAAATGGGCCATTCTTAATGTAATGGGAAAAGACTTTCCTGTAAATGAAGTATTTACACAATTTCTTGCTGACAATCATCACCAAGTTCGCATGTTGGCTGCAGAGTCAATCAATAGATTGTTCCAGGACACGAAGGGAGATTCTTCCAGGTTACTGAAAGCACTTCCTTTGAAGCTTCAGCAAACAGCTTTTGAAAATGCATACTTGAAAGCTCAGGAAGGAATGAGAGAAATGTCCCATAGTGCTGAGAACCCTGAAACTTTGGATGAAATTTATAATAGAAAATCTGTTTTACTGACGTTGATAGCTGTGGTTTTATCCTGTAGCCCTATCTGCGAAAAACAGGCTTTGTTTGCCCTGTGTAAATCTGTGAAAGAGAATGGATTAGAACCTCACCTTGTGAAAAAGGTTTTAGAGAAAGTTTCTGAAACTTTTGGATATAGACGTTTAGAAGACTTTATGGCATCTCATTTAGATTATCTGGTTTTGGAATGGCTAAATCTTCAAGATACTGAATACAACTTATCTTCTTTTCCTTTTATTTTATTAAACTACACAAATATTGAGGATTTCTATAGATCTTGTTATAAGGTTTTGATTCCACATCTGGTGATTAGAAGTCATTTTGATGAGGTGAAGTCCATTGCTAATCAGATTCAAGAGGACTGGAAAAGTCTTCTAACAGACTGCTTTCCAAAGATTCTTGTAAATATTCTTCCTTATTTTGCCTATGAGGGTACCAGAGACAGTGGGATGGCACAGCAAAGAGAGACTGCTACCAAGGTCTATGATATGCTTAAAAGTGAAAACTTATTGGGAAAACAGATTGATCACTTATTCATTAGTAATTTACCAGAGATTGTGGTGGAGTTATTGATGACGTTACATGAGCCAGCAAATTCTAGTGCCAGTCAGAGCACTGACCTCTGTGACTTTTCAGGGGATTTGGATCCTGCTCCTAATCCACCTCATTTTCCATCGCATGTGATTAAAGCAACATTTGCCTATATCAGCAATTGTCATAAAACCAAGTTAAAAAGCATTTTAGAAATTCTTTCCAAAAGCCCTGATTCCTATCAGAAAATTCTTCTTGCCATATGTGAGCAAGCAGCTGAAACAAATAATGTTTATAAGAAGCACAGAATTCTTAAAATATATCACCTGTTTGTTAGTTTATTACTGAAAGATATAAAAAGTGGCTTAGGAGGAGCTTGGGCCTTTGTTCTTCGAGACGTTATTTATACTTTGATTCACTATATCAACCAAAGGCCTTCTTGTATCATGGATGTGTCATTACGTAGCTTCTCCCTTTGTTGTGACTTATTAAGTCAGGTTTGCCAGACAGCCGTGACTTACTGTAAGGATGCTCTAGAAAACCATCTTCATGTTATTGTTGGTACACTTATACCCCTTGTGTATGAGCAGGTGGAGGTTCAGAAACAGGTATTGGACTTGTTGAAATACTTAGTGATAGATAACAAGGATAATGAAAACCTCTATATCACGATTAAGCTTTTAGATCCTTTTCCTGACCATGTTGTTTTTAAGGATTTGCGTATTACTCAGCAAAAAATCAAATACAGTAGAGGACCCTTTTCACTCTTGGAGTTCTCAGGA(SEQ ID NO:5),
上述c.4777-2A>T的突变基因编码的氨基酸序列如下:
MSLVLNDLLICCRQLEHDRATERKKEVEKFKRLIRDPETIKHLDRHSDSKQGKYLNWDAVFRFLQKYIQKETECLRIAKPNVSASTQASRQKKMQEISSLVKYFIKCANRRAPRLKCQELLNYIMDTVKDSSNGAIYGADCSNILLKDILSVRKYWCEISQQQWLELFSVYFRLYLKPSQDVHRVLVARIIHAVTKGCCSQTDGLNSKFLDFFSKAIQCARQEKSSSGLNHILAALTIFLKTLAVNFRIRVCELGDEILPTLLYIWTQHRLNDSLKEVIIELFQLQIYIHHPKGAKTQEKGAYESTKWRSILYNLYDLLVNEISHIGSRGKYSSGFRNIAVKENLIELMADICHQVFNEDTRSLEISQSYTTTQRESSDYSVPCKRKKIELGWEVIKDHLQKSQNDFDLVPWLQIATQLISKYPASLPNCELSPLLMILSQLLPQQRHGERTPYVLRCLTEVALCQDKRSNLESSQKSDLLKLWNKIWCITFRGISSEQIQAENFGLLGAIIQGSLVEVDREFWKLFTGSACRPSCPAVCCLTLALTTSIVPGTVKMGIEQNMCEVNRSFSLKESIMKWLLFYQLEGDLENSTEVPPILHSNFPHLVLEKILVSLTMKNCKAAMNFFQSVPECEHHQKDKEELSFSEVEELFLQTTFDKMDFLTIVRECGIEKHQSSIGFSVHQNLKESLDRCLLGLSEQLLNNYSSEITNSETLVRCSRLLVGVLGCYCYMGVIAEEEAYKSELFQKAKSLMQCAGESITLFKNKTNEEFRIGSLRNMMQLCTRCLSNCTKKSPNKIASGFFLRLLTSKLMNDIADICKSLASFIKKPFDRGEVESMEDDTNGNLMEVEDQSSMNLFNDYPDSSVSDANEPGESQSTIGAINPLAEEYLSKQDLLFLDMLKFLCLCVTTAQTNTVSFRAADIRRKLLMLIDSSTLEPTKSLHLHMYLMLLKELPGEEYPLPMEDVLELLKPLSNVCSLYRRDQDVCKTILNHVLHVVKNLGQSNMDSENTRDAQGQFLTVIGAFWHLTKERKYIFSVRMALVNCLKTLLEADPYSKWAILNVMGKDFPVNEVFTQFLADNHHQVRMLAAESINRLFQDTKGDSSRLLKALPLKLQQTAFENAYLKAQEGMREMSHSAENPETLDEIYNRKSVLLTLIAVVLSCSPICEKQALFALCKSVKENGLEPHLVKKVLEKVSETFGYRRLEDFMASHLDYLVLEWLNLQDTEYNLSSFPFILLNYTNIEDFYRSCYKVLIPHLVIRSHFDEVKSIANQIQEDWKSLLTDCFPKILVNILPYFAYEGTRDSGMAQQRETATKVYDMLKSENLLGKQIDHLFISNLPEIVVELLMTLHEPANSSASQSTDLCDFSGDLDPAPNPPHFPSHVIKATFAYISNCHKTKLKSILEILSKSPDSYQKILLAICEQAAETNNVYKKHRILKIYHLFVSLLLKDIKSGLGGAWAFVLRDVIYTLIHYINQRPSCIMDVSLRSFSLCCDLLSQVCQTAVTYCKDALENHLHVIVGTLIPLVYEQVEVQKQVLDLLKYLVIDNKDNENLYITIKLLDPFPDHVVFKDLRITQQKIKYSRGPFSLLEFSG(SEQ ID NO:6)。
发明人发现的ATM基因突变体与SEQ ID NO:1相比,本发明的ATM基因具有一个c.5697C>A、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT、c.477_481delATCTC、exon65del突变的至少一种。
具体地,ATM基因突变体的c.5697C>A突变相对于野生型ATM基因的cDNA的第5697位的C突变为A,由此,导致其编码的蛋白发生p.Cys1899X无义突变。c.4777-2A>T突变后的基因序列为SEQ ID NO:5,p.Glu2950fs突变蛋白的序列为SEQ ID NO:6。c.3078G>T突变相对于野生型ATM基因的cDNA的第3078位的G突变为T,由此,导致其编码的蛋白发生了p.Trp1026Cys错义突变。c.7983_7985delTGT突变相对于野生型ATM基因cDNA的第7983_7985位的TGT缺失了,因而导致其编码的蛋白发生了p.Val2662del的缺失突变。c.477_481delATCTC突变相对于野生型ATM基因的cDNA的第477_481位的ATCTC缺失了,因而发生了移码突变p.Ile159fs,其突变后的基因以及蛋序列为SEQ ID NO:3和SEQ ID NO:4。c.8851_8987del(exon 65del)突变相对于野生型ATM基因的cDNA的第8851-8987位的基因序列:
GTCCTTCTATATGATCCACTCTTTGACTGGACCATGAATCCTTTGAAAGCTTTGTATTTACAGCAGAGGCCGGAAGATGAAACTGAGCTTCACCCTACTCTGAATGCAGATGACCAAGAATGCAAACGAAATCTCAG缺失了,在缺失序列后即是终止密码子TGA,突变的cDNA包含8853个碱基;由此导致编码的蛋白在第2951-2995位的VLLYDPLFDWTMNPLKALYLQQRPEDETELHPTLNADDQECKRNL缺失,并随即翻译终止,突变蛋白包含2950个氨基酸,即p.2951_2995del。
因此根据本发明的具体实施例,共济失调毛细血管扩张症是常染色体隐性遗传病,具有上述至少一种突变为共济失调毛细血管扩张症疾病的携带者或者患者。
根据本发明的一个实施例,通过检测是否具有c.5697C>A、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT、c.477_481delATCTC、c.8851_8987del的至少一种突变,可以准确有效地预测生物体是否患常染色体隐性遗传型共济失调毛细血管扩张症。
根据本发明的第二个方面,本发明提出了一种分离的多肽。根据本发明的实施例,与SEQ ID NO:2相比,该多肽具有选自下列的至少一种突变:p.Cys1899X、p.Glu2950fs、p.Trp1026Cys、p.Val2662del、p.Ile159fs、p.2951_2995del。根据本发明的具体实施例,该多肽是由前述分离的编码ATM基因突变体的核酸编码的。通过检测生物样品中是否表达该多肽,可以有效地检测生物样品是否易感共济失调毛细血管扩张症,也可以通过检测这些多肽在生物体中是否存在,可以有效地预测生物体是否易感共济失调毛细血管扩张症。
筛选易患共济失调毛细血管扩张症的生物样品的方法
根据本发明的第三个方面,本发明提出了一种筛选易患共济失调毛细血管扩张症的生物样品的方法。根据本发明的实施例,该筛选易患共济失调毛细血管扩张症的生物样品的方法可以包括以下步骤:
首先,从所述生物样品提取核酸样本。根据本发明的实施例,生物样品的类型并不受特别限制,只要从该生物样品中能够提取到反映生物样品ATM是否存在突变的核酸样本即可。根据本发明的实施例,生物样品可以为选自人体血液。由此,可以方便地进行取样和检测,从而能够进一步提高筛选易患共济失调毛细血管扩张症的生物样品的效率。根据本发明的实施例,这里所使用的术语“核酸样本”应做广义理解,其可以是任何能够反映生物样品中ATM是否存在突变的样本,例如可以是从生物样品中直接提取的全基因组DNA,也可以是该全基因组中包含ATM编码序列的一部分,可以是从生物样品中提取的总RNA,也可以是从生物样品中提取的mRNA。根据本发明的一个实施例,所述核酸样本为全基因组DNA。由此,可以扩大生物样品的来源范围,并且可以同时对生物样品的多种信息进行确定,从而能够提高筛选易患共济失调毛细血管扩张症的生物样品的效率。另外,根据本发明的实施例,针对采用RNA作为核酸样本,从生物样品提取核酸样本可以进一步包括:从生物样品提取RNA样本,优选RNA样本为mRNA;以及基于所得到的RNA样本,通过反转录反应,获得cDNA样本,所得到的cDNA样本构成核酸样本。由此,可以进一步提高利用RNA作为核酸样本筛选易患共济失调毛细血管扩张症的生物样品的效率。
接下来,在得到核酸样本之后,可以对核酸样本进行分析,从而能够确定所得到核酸样本的核酸序列。根据本发明的实施例,确定所得到核酸样本的核酸序列的方法和设备并不受特别限制。根据本发明的具体实施例,可以通过测序方法,确定核酸样本的核酸序列。根据本发明的实施例,可以用于进行测序的方法和设备并不受特别限制。根据本发明的实施例,可以采用第二代测序技术,也可以采用第三代以及第四代或者更先进的测序技术。根据本发明的具体示例,可以利用选自Hiseq2000、SOLiD、454和单分子测序装置的至少一种对核酸序列进行测序。由此,结合最新的测序技术,针对单个位点可以达到较高的测序深度,检测灵敏度和准确性大大提高,因而能够利用这些测序装置的高通量、深度测序的特点,进一步提高对核酸样本进行检测分析的效率。从而,能够提高后续对测序数据进行分析时的精确性和准确度。由此,根据本发明的实施例,确定核酸样本的核酸序列可以进一步包括:首先,针对所得到的核酸样本,构建核酸测序文库;以及对所得到的核酸测序文库进行测序,以便获得由多个测序数据构成的测序结果。根据本发明的一些实施例,可以采用选自Hiseq2000、SOLiD、454和单分子测序装置的至少一种对所得到的核酸测序文库进行测序。另外,根据本发明的实施例,可以对核酸样本进行筛选,富集ATM外显子,该筛选富集可以在构建测序文库之前,构建测序文库过程中,或者构建测序文库之后进行。根据本发明的一个实施例,针对核酸样本,构建核酸测序文库进一步包括:利用ATM基因外显子特异性引物,对核酸样本进行PCR扩增;以及针对所得到的扩增产物,构建核酸测序文库。由此,可以通过PCR扩增,富集ATM基因外显子,从而能够进一步提高筛选易感共济失调毛细血管扩张症的生物样品的效率。根据本发明的实施例,ATM基因外显子特异性引物的序列不受特别限制,根据本发明的优选实施例,这些ATM基因外显子特异性引物具有如下表所示的核苷酸序列,即SEQ ID NO:7-16所示的核苷酸序列。发明人惊奇地发现,通过采用这些引物,可以在PCR反应体系中显著有效地完成对ATM外显子尤其是c.5697C>A、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT、c.477_481delATCTC和c.8851_8987del(exon 65del)突变所在的外显子序列的扩增。需要说明的是,这些SEQ ID NO:7-16所示的核苷酸序列是本发明的发明人在付出了艰苦的劳动后,意外获得的。
根据本发明的具体实施例,上述的共济失调毛细血管扩张症是常染色体隐性遗传疾病。
关于针对核酸样本,构建测序文库的方法和流程,本领域技术人员可以根据不同的测序技术进行适当选择,关于流程的细节,可以参见测序仪器的厂商例如Illumina公司所提供的规程,例如参见Illumina公司Multiplexing Sample Preparation Guide(Part#1005361;Feb2010)或Paired-End SamplePrep Guide(Part#1005063;Feb 2010),通过参照将其并入本文。根据本发明的实施例,从生物样品提取核酸样本的方法和设备,也不受特别限制,可以采用商品化的核酸提取试剂盒进行。
需要说明的是,在这里所使用的术语“核酸序列”应作广义理解,其可以是在对核酸样本进行测序得到的测序数据进行组装后,得到的完整的核酸序列信息,也可以是直接采用通过对核酸样本进行测序所得到的测序数据(reads)作为核酸序列,只要这些核酸序列中含有对应ATM基因的编码序列即可。
最后,在确定核酸样本的核酸序列之后,将所得到的核酸样本的核酸序列与SEQID NO:1的序列相比对。如果在所得到的核酸序列中具有c.5697C>A、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT、c.477_481delATCTC和c.8851_8987del(exon 65del)突变的至少一种,则指示生物样品易患共济失调毛细血管扩张症。由此,通过根据本发明实施例的筛选易患共济失调毛细血管扩张症的生物样品的方法,可以有效地筛选易患共济失调毛细血管扩张症的生物样品。根据本发明的实施例,对核酸序列与SEQ ID NO:1进行比对的方法和设备并不受特别限制,可以采用任意常规的软件进行操作,根据本发明的具体实例,可以采用SOAP软件进行比对。
需要说明的是,根据本发明实施例的“筛选易患共济失调毛细血管扩张症的生物样品的方法”的用途不受特别限制,例如可以用作非诊断目的的筛选方法。
筛选易患共济失调毛细血管扩张症的生物样品的系统和试剂盒
根据本发明的第四个方面,本发明提出了一种筛选易患共济失调毛细血管扩张症的生物样品的系统。
参考图1,根据本发明的实施例,该筛选易患共济失调毛细血管扩张症的生物样品的系统1000包括核酸提取装置100、核酸序列确定装置200以及判断装置300。
根据本发明的实施例,核酸提取装置100用于从生物样品提取核酸样本。如前所述,根据本发明的实施例,核酸样本的类型并不受特别限制,对于采用RNA作为核酸样本,则核酸提取装置进一步包括RNA提取单元101和反转录单元102,其中,提取单元101用于从生物样品提取RNA样本,反转录单元102与RNA提取单元101相连,用于对RNA样本进行反转录反应,以便获得cDNA样本,所得到的cDNA样本构成核酸样本。
根据本发明的实施例,核酸序列确定装置200与核酸提取装置100相连,用于对核酸样本进行分析,以便确定核酸样本的核酸序列。如前所示,可以采用测序的方法确定核酸样本的核酸序列。由此,根据本发明的一个实施例,所述核酸序列确定装置200可以进一步包括:文库构建单元201以及测序单元202。文库构建单元201用于针对核酸样本,构建核酸测序文库;测序单元202与文库构建单元201相连,用于对核酸测序文库进行测序,以便获得由多个测序数据构成的测序结果。如前所述,可以通过PCR扩增,富集ATM外显子,进一步提高筛选易患共济失调毛细血管扩张症的生物样品的效率。由此,文库构建单元201可以进一步包括PCR扩增模块(图中未示出),在该PCR扩增模块中设置有ATM外显子特异性引物,以便利用ATM外显子特异性引物,对所述核酸样本进行PCR扩增,根据本发明的具体实施例,ATM基因外显子特异性引物具有如SEQ ID NO:7-16所示的核苷酸序列。根据本发明的实施例,测序单元202可以包括选自HISEQ2000、SOLiD、454和单分子测序装置的至少一种。由此,结合最新的测序技术,针对单个位点可以达到较高的测序深度,检测灵敏度和准确性大大提高,因而能够利用这些测序装置的高通量、深度测序的特点,进一步提高对核酸样本进行检测分析的效率。从而,提高后续对测序数据进行分析时的精确性和准确度。
根据本发明的实施例,判断装置300与核酸序列确定装置200相连,适于将核酸样本的核酸序列进行比对,以便基于核酸样本的核酸序列与SEQ ID NO:1的区别判断生物样品是否易患共济失调毛细血管扩张症。具体地,基于核酸样本的核酸序列与SEQ ID NO:1相比,是否具有c.5697C>A、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT、c.477_481delATCTC和c.8851_8987del(exon 65del)的至少一种突变,判断生物样品是否易患共济失调毛细血管扩张症。如前所述,根据本发明的一个实施例,核酸样本的核酸序列与SEQID NO:1相比,具有c.5697C>A、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT、c.477_481delATCTC和c.8851_8987del(exon 65del)的至少一种突变,是生物样品易患共济失调毛细血管扩张症的指示。如前所述,根据本发明的实施例,对核酸序列与SEQ ID NO:1进行比对的设备并不受特别限制,可以采用任意常规的软件进行操作,根据本发明的具体实例,可以采用SOAP软件进行比对。
由此,利用该系统,能够有效地实施前述筛选易患共济失调毛细血管扩张症的生物样品的方法,从而可以有效地筛选易患共济失调毛细血管扩张症的生物样品。
根据本发明的第五方面,本发明提出了一种用于筛选易患共济失调毛细血管扩张症的生物样品的试剂盒。根据本发明的实施例,该用于筛选易患共济失调毛细血管扩张症的生物样品的试剂盒包括:适于检测ATM基因突变体的试剂,其中与SEQ ID NO:1相比,该ATM基因突变体具有c.5697C>A、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT、c.477_481delATCTC和c.8851_8987del(exon 65del)的至少一种突变。利用根据本发明的实施例的试剂盒,能够有效地筛选易患共济失调毛细血管扩张症的生物样品。在本文中,所使用的术语“适于检测ATM基因突变体的试剂”应做广义理解,即可以是检测ATM编码基因的试剂,也可以是检测ATM突变体多肽的试剂,例如可以采用识别特异性位点的抗体。根据本发明的一个实施例,所述试剂为核酸探针,由此,可以高效地筛选易患共济失调毛细血管扩张症的生物样品。根据本发明的具体实施例,上述共济失调毛细血管扩张症为常染色体隐性遗传疾病。
需要说明的是,在本文前面筛选易患共济失调毛细血管扩张症的生物样品的方法部分中所描述的特征和优点,同样适用于筛选易患共济失调毛细血管扩张症的生物样品的系统或者试剂盒,在此不再赘述。
构建体及重组细胞
根据本发明的第六方面,本发明还提出了一种构建体。根据本发明的实施例,该构建体包含前面所述的分离的编码ATM基因突变体的核酸。需要说明的是,“构建体包含前面所述的分离的编码ATM基因突变体的核酸”表示,本发明的构建体包含与SEQ ID NO:1相比具有c.5697C>A、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT、c.477_481delATCTC和c.8851_8987del(exon 65del)至少一种突变。由此,利用本发明的构建体转化受体细胞获得的重组细胞,能够有效地用作共济失调毛细血管扩张症相关研究的模型。其中,所述受体细胞的种类不受特别限制,例如可以为大肠杆菌细胞、哺乳动物细胞,优选该受体细胞来源于哺乳动物。
在本发明中所使用的术语“构建体”是指这样的一种遗传载体,其包含特定核酸序列,并且能够将目的核酸序列转入宿主细胞中,以获得重组细胞。根据本发明的实施例,构建体的形式不受特别限制。根据本发明的实施例,其可以为质粒、噬菌体、人工染色体、粘粒(Cosmid)、病毒的至少一种,优选质粒。质粒作为遗传载体,具有操作简单,可以携带较大片段的性质,便于操作和处理。质粒的形式也不受特别限制,既可以是环形质粒,也可以是线性质粒,即可以是单链的,也可以是双链的。本领域技术人员可以根据需要进行选择。在本发明中所使用的术语“核酸”可以是任何包含脱氧核糖核苷酸或者核糖核苷酸的聚合物,包括但不限于经过修饰的或者未经修饰的DNA、RNA,其长度不受任何特别限制。对于用于构建重组细胞的构建体,优选所述核酸为DNA,因为DNA相对于RNA而言,其更稳定,并且易于操作。
根据本发明的第七方面,本发明还提出了一种重组细胞。根据本发明的实施例,该重组细胞是通过前面所述的构建体转化受体细胞而获得的。从而,本发明的重组细胞能够有效表达构建体所携带的ATM基因突变体。根据本发明的一些实施例,本发明的重组细胞,能够有效地用作共济失调毛细血管扩张症相关研究的模型。根据本发明的实施例,受体细胞的种类不受特别限制,例如可以为大肠杆菌细胞、哺乳动物细胞,优选所述受体细胞来源于非人哺乳动物。
对于本发明中所述核酸,本领域技术人员应当理解,实际包括互补双链的任意一条,或者两条。为了方便,在本发明的实施例中,虽然多数情况下只给出了一条链,但实际上也公开了与之互补的另一条链。例如,提及SEQ ID NO:1,实际包括其互补序列。本领域技术人员还可以理解,利用一条链可以检测另一条链,反之亦然。
下面参考具体实施例,对本发明进行说明,需要说明的是,这些实施例仅仅是说明性的,而不能理解为对本发明的限制。
若未特别指明,实施例中所采用的技术手段为本领域技术人员所熟知的常规手段,可以参照《分子克隆实验指南》第三版或者相关产品进行,所采用的试剂和产品也均为可商业获得的。未详细描述的各种过程和方法是本领域中公知的常规方法,所用试剂的来源、商品名以及有必要列出其组成成分者,均在首次出现时标明,其后所用相同试剂如无特殊说明,均与首次标明的内容相同。
实施例1确定共济失调毛细血管扩张症的致病基因
1、样本收集
发明人收集到了三个两代的共济失调毛细血管扩张症(本文中也可以称将共济失调毛细血管扩张症简称为AT)的患者家系,以及随即收集的该家系外的500名正常人的基因。图2显示了AT家系1、AT家系2以及AT家系3的家系图谱。如图2所示,其中,□表示正常男性,○表示正常女性,■表示男性患者,●表示女性患者。由图2可知,每个家系共有3名成员,其中第二代子女为AT患者,第一代的父母为正常成员。
三个家系中的三名患者均表现为儿童期出现行走不稳症状,同时伴有眼球结膜处血丝,面部和关节多处色素沉着及皮肤结节形成。体检发现体态偏瘦,多处明显色素沉着,尤其在耳后和双肘关节等处可见结痂形成;明显的眼球结膜毛细血管扩张和面颊部毛细血管扩张,并且对头部进行核磁共振检查(MRI)发现小脑萎缩。
1、DNA提取
采用OMEGA Blood DNA Midi Kit全血DNA提取试剂盒从外周血样品中提取DNA,提取步骤如下:
1)取2ml全血样本,加入150ul OB Protease,2.1ml Buffer BL和20ul RNase A,最大速度漩涡1分钟,彻底混匀。
2)65摄氏度水浴15-20分钟,并在水浴过程中漩涡5次。
3)加入2.2ml无水乙醇,最大速度漩涡30秒,彻底混匀。
4)将3.5ml裂解液移入带过滤柱的15ml离心管,4000转离心5分钟,取出过滤柱,倒掉过滤液体,放回过滤柱。
5)将第3步剩余裂解液加入带过滤柱的15ml离心管,4000转离心5分钟,取出过滤柱,倒掉过滤液体,放回过滤柱。
6)加入3ml HB Buffer,洗涤过滤柱,4000转离心5分钟,取出过滤柱,倒掉过滤液体,放回过滤柱。
7)加入3ml DNA Wash Buffer,4000转离心5分钟,取出过滤柱,倒掉过滤液体,放回过滤柱。
8)再次加入3ml DNA Wash Buffer,4000转离心5分钟,取出过滤柱,倒掉过滤液体,放回过滤柱。
9)4000转离心15分钟,甩干过滤柱。
10)将过滤柱移至新的15ml离心管,加入500ul 70摄氏度的Elution Buffer,室温静置5分钟,4000转离心5分钟,收集含有DNA的过滤液。
11)再次将过滤柱移至新的15ml离心管,加入500ul 70摄氏度的Elution Buffer,室温静置5分钟,4000转离心5分钟,收集含有DNA的过滤液。
2、外显子捕获与测序
我们挑选了三名患者进行外显子组测序及数据分析,具体做法如下。
随后,发明人用Agilent SureSelect Human All Exon Kit结合Solexa高通量测序技术对病例AT家系1-II:1、AT家系2-II:1、AT家系3-II:1的外显子组序列进行了测序,具体如下:
1)将基因组DNA随机打断成150-200bp左右的片段,随后在片段两端分别连接上接头制备杂交文库(参见http://www.illumina.com/提供的Illumina/Solexa标准建库说明书)。
2)文库经纯化后经过ligation-mediated PCR(LM-PCR)的线性扩增与SureSelectBiotiny lated RNA Library(BAITS)进行杂交富集,再经过LM-PCR的线性扩增后进行上机测序。测序平台为Illumina Hiseq 2000,读取长度为90bp,每个样本的平均测序深度最少为80×。
3)测序后获得的原始数据由Illumina basecalling Software 1.7进行处理,经过过滤去污染、使用SOAPaligner 2.20比对参考基因组,获得比对到基因组上的Uniquemapped reads。靶区域的基因型由SOAPsnp。
3、结论
在病例AT家系1-II:1中均发现有60711个单核苷酸多态性(SNPs)和3916处的插入/缺失。在病例AT家系2-II:1中发现了62518个单核苷酸多态性(SNPs)和4118处的插入/缺失,病例AT家系3-II:1中发现了58283个单核苷酸多态性(SNPs)和3687处的插入/缺失,将结果与dbSNP数据库(http://hgdownload.cse.ucsc.edu/goldenPath/hg19/database/ snp132.txt.gz),HapMap数据库(ftp://ftp.ncbi.nlm.nih.gov/hapmap),千人基因组数据库(ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp),炎黄数据库(http:// yh.genomics.org.cn/)等公共数据库进行比较,过滤掉所有已知变异。利用SIFT软件进行SNP功能预测.
分别发现患者AT家系1-II:1在ATM基因存在一个复合杂合的突变c.5697C>A(p.Cys1899X,无义突变)、exon 65del,其中患者父亲携带杂合突变c.5697C>A,患者母亲携带杂合缺失exon 65del。
AT家系2-II:1在ATM基因存在一个复合杂合的突变c.477_481del ATCTC(移码突变)、c.4777-2A>T(剪切位点突变),其中患者父亲携带杂合缺失c.477_481del ATCTC,患者母亲携带杂合突变c.4777-2A>T。
AT家系3-II:1在ATM基因存在一个复合杂合的突变c.3078G>T(p.Trp1026Cys,错义突变)、c.7983_7985delTGT,其中患者父亲携带杂合突变c.3078G>T,患者母亲携带杂合缺失c.7983_7985delTGT。
研究表明,通过家族中疾病共分离和正常人SNP筛查,这三个复合杂合位点突变可引起常染色体隐性遗传性的AT疾病,因此可能是致病位点。
实施例2 Sanger法测序验证共济失调毛细血管扩张症的致病基因
采集3例AT患者及其父母外周血,利用常规酚-氯仿法抽提外周血白细胞中的基因组DNA,利用分光光度计测量DNA的浓度及纯度,所得的每个标本基因组DNA的OD260/OD280均位于1.7-2.0之间,浓度不少于200ng/ul,总量不少于30μg。
分别对3名患者(即图2中的患者II:1)、6名家系内正常人(即该3例患者的父母,均未发病)和500名家系外正常人(即与表中所有患者无血缘关系的对照)基因进行检测,针对ATM基因所有外显子序列设计引物,通过PCR扩增,产物纯化,测序的方法获得ATM有关序列,根据序列测定结果属于突变型还是野生型,验证ATM与AT疾病之间相关性。具体方法步骤如下:
1、DNA提取:
分别采取3名家系内患者、6名家系内正常人和500名家系外正常人外周静脉血按照上述方法提取基因组DNA,分光光度计测DNA含量。
2、引物设计及PCR反应
引物设计参考人类基因组序列数据库hg19/build37.1,具体见下。
引物序列:
b)反应体系:10μL
试剂 | 体积 |
2×GC BufferⅠ | 5.0μl |
dNTPs(10mM) | 0.4μl |
LA Taq酶(5U/μl) | 0.1μl |
Primers(100ng/μl) | 0.2μl/个 |
gDNA模板(50ng/μl) | 1.0μl |
ddH2O Add to | 10.0μl |
c)反应条件:
3)将步骤2中获得的获自3例AT患者、6名家系内正常人和500名家系外正常人PCR扩增产物直接进行DNA测序。
表1 AT家系中的患者情况
在患者家族成员中对ATM基因突变位点所在编码序列及侧翼序列进行突变排查,分别发现患者AT家系1-II:1在ATM基因存在一个复合杂合的突变c.5697C>A(p.Cys1899X,无义突变)、exon 65del,其中患者父亲携带杂合突变c.5697C>A,患者母亲携带杂合缺失exon 65del。
AT家系2-II:1在ATM基因存在一个复合杂合的突变c.477_481del ATCTC(移码突变)、c.4777-2A>T(剪切位点突变),其中患者父亲携带杂合缺失c.477_481del ATCTC,患者母亲携带杂合突变c.4777-2A>T。
AT家系3-II:1在ATM基因存在一个复合杂合的突变c.3078G>T(p.Trp1026Cys,错义突变)、c.7983_7985delTGT,其中患者父亲携带杂合突变c.3078G>T,患者母亲携带杂合缺失c.7983_7985delTGT。(见表2及图3-5)。
同时,我们在500个家族外无关正常对照中排查未检测到的突变位点。
表2 AT家系成员的基因突变情况
本发明的ATM基因中的是否具有c.5697C>A、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT、c.477_481delATCTC和c.8851_8987del(exon 65del)突变的至少一种,可判断此家族中未发病的人群患病的可能性,同时可用于家族后代患病几率的评估与诊断,为患者及家庭提供遗传咨询及产前诊断避免携带该突变的下一代出生。
研究证实ATM基因突变可以引起共济失调毛细血管扩张症。我们的研究表明:ATM基因中的c.5697C>A、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT、c.477_481delATCTC和c.8851_8987del(exon 65del)突变是引起常染色体隐性遗传性AT疾病的突变。
本发明公开了六种已知耳聋基因(ATM)的新突变,证实了其为常染色体隐性遗传性的分子病因,在500例中国正常人群中进行该位点的筛查,均为阴性。本研究丰富了ATM基因突变谱,为开展遗传性AT疾病的分子诊断提供了基因学依据。
实施例3检测试剂盒
制备检测试剂盒,其中含有检测突变的c.5679C>A、c.477_481delATCTC、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT的引物对见下表:
按照实施例1所述的方法提取待测者DNA,以所提取的DNA为模板与上述引物进行PCR反应,按照本领域常规方法对PCR产物纯化,将纯化的产物测序。观察测序所得到的序列是否有c.5679C>A、c.477_481delATCTC、c.4777-2A>T、c.3078G>T、c.7983_7985delTGT突变。
实施例4
分别对AT家系1内1个患者样本和1个患者的母亲(非患者)样本进行QPCR。具体方法步骤如下:
1、DNA提取:
采集患者和其母亲的外周静脉血用QIAamp DNA BloodMiNi Kit(Qiagen,Hilden,Germany)方法提取基因组DNA。
2、引物设计及PCR反应
1)QPCR引物:
2)QPCR实验仪器及条件:ABI 7500荧光定量PCR仪
3)实验用量及试剂:模板为1ul每个做三个平行样。
3、最低循环数值(Ct)比较法计算ATM基因的65号外显子的相对拷贝数应用v vCt法,与内对照ALB基因比较可以得出ATM基因的65号外显子的相对拷贝数(图6)。
在患者家族成员中对ATM基因突变位点所在编码序列及侧翼序列进行突变排查,分别发现患者AT家系1-II:1在ATM基因存在exon 65del,其中患者父亲无该突变,患者母亲携带杂合缺失exon 65del。
实施例5
c.4777-2A>T的验证
分别对AT家系2内1个患者样本和2个非患者样本进行PCR。具体方法步骤如下:
1、RNA提取以CDNA制备:采集患者和其父母亲的外周静脉血,使用QIAGEN公司生产的RNA提取试剂盒提取总RNA,并反转录为cDNA。
2、引物设计及PCR反应
a)AT家系2剪切位点变异位点验证引物
b)反应体系:10μL
试剂 | 体积 |
2×GC BufferⅠ | 5.0μl |
dNTPs(10mM) | 0.4μl |
LA Taq酶(5U/μl) | 0.1μl |
Primers(100ng/μl) | 0.2μl/个 |
gDNA模板(50ng/μl) | 1.0μl |
ddH2O Add to | 10.0μl |
c)反应条件:
3、将步骤1所得的cDNA进行PCR扩增然后直接进行DNA测序,发现患者及母亲的cDNA序列发生了杂合的改变,患者父亲未有改变(图7),由此认为该突变会造成cDNA序列的改变从而影响氨基酸的改变。
在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不一定指的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任何的一个或多个实施例或示例中以合适的方式结合。
尽管上面已经示出和描述了本发明的实施例,可以理解的是,上述实施例是示例性的,不能理解为对本发明的限制,本领域的普通技术人员在不脱离本发明的原理和宗旨的情况下在本发明的范围内可以对上述实施例进行变化、修改、替换和变型。
SEQUENCE LISTING
<110> 中南大学湘雅医院
深圳华大基因科技有限公司
<120> ATM基因突变体及其应用
<130> PIDC131299-1
<160> 22
<170> PatentIn version 3.3
<210> 1
<211> 9171
<212> DNA
<213> Homo sapiens
<400> 1
atgagtctag tacttaatga tctgcttatc tgctgccgtc aactagaaca tgatagagct 60
acagaacgaa agaaagaagt tgagaaattt aagcgcctga ttcgagatcc tgaaacaatt 120
aaacatctag atcggcattc agattccaaa caaggaaaat atttgaattg ggatgctgtt 180
tttagatttt tacagaaata tattcagaaa gaaacagaat gtctgagaat agcaaaacca 240
aatgtatcag cctcaacaca agcctccagg cagaaaaaga tgcaggaaat cagtagtttg 300
gtcaaatact tcatcaaatg tgcaaacaga agagcaccta ggctaaaatg tcaagaactc 360
ttaaattata tcatggatac agtgaaagat tcatctaatg gtgctattta cggagctgat 420
tgtagcaaca tactactcaa agacattctt tctgtgagaa aatactggtg tgaaatatct 480
cagcaacagt ggttagaatt gttctctgtg tacttcaggc tctatctgaa accttcacaa 540
gatgttcata gagttttagt ggctagaata attcatgctg ttaccaaagg atgctgttct 600
cagactgacg gattaaattc caaatttttg gacttttttt ccaaggctat tcagtgtgcg 660
agacaagaaa agagctcttc aggtctaaat catatcttag cagctcttac tatcttcctc 720
aagactttgg ctgtcaactt tcgaattcga gtgtgtgaat taggagatga aattcttccc 780
actttgcttt atatttggac tcaacatagg cttaatgatt ctttaaaaga agtcattatt 840
gaattatttc aactgcaaat ttatatccat catccgaaag gagccaaaac ccaagaaaaa 900
ggtgcttatg aatcaacaaa atggagaagt attttataca acttatatga tctgctagtg 960
aatgagataa gtcatatagg aagtagagga aagtattctt caggatttcg taatattgcc 1020
gtcaaagaaa atttgattga attgatggca gatatctgtc accaggtttt taatgaagat 1080
accagatcct tggagatttc tcaatcttac actactacac aaagagaatc tagtgattac 1140
agtgtccctt gcaaaaggaa gaaaatagaa ctaggctggg aagtaataaa agatcacctt 1200
cagaagtcac agaatgattt tgatcttgtg ccttggctac agattgcaac ccaattaata 1260
tcaaagtatc ctgcaagttt acctaactgt gagctgtctc cattactgat gatactatct 1320
cagcttctac cccaacagcg acatggggaa cgtacaccat atgtgttacg atgccttacg 1380
gaagttgcat tgtgtcaaga caagaggtca aacctagaaa gctcacaaaa gtcagattta 1440
ttaaaactct ggaataaaat ttggtgtatt acctttcgtg gtataagttc tgagcaaata 1500
caagctgaaa actttggctt acttggagcc ataattcagg gtagtttagt tgaggttgac 1560
agagaattct ggaagttatt tactgggtca gcctgcagac cttcatgtcc tgcagtatgc 1620
tgtttgactt tggcactgac caccagtata gttccaggaa cggtaaaaat gggaatagag 1680
caaaatatgt gtgaagtaaa tagaagcttt tctttaaagg aatcaataat gaaatggctc 1740
ttattctatc agttagaggg tgacttagaa aatagcacag aagtgcctcc aattcttcac 1800
agtaattttc ctcatcttgt actggagaaa attcttgtga gtctcactat gaaaaactgt 1860
aaagctgcaa tgaatttttt ccaaagcgtg ccagaatgtg aacaccacca aaaagataaa 1920
gaagaacttt cattctcaga agtagaagaa ctatttcttc agacaacttt tgacaagatg 1980
gactttttaa ccattgtgag agaatgtggt atagaaaagc accagtccag tattggcttc 2040
tctgtccacc agaatctcaa ggaatcactg gatcgctgtc ttctgggatt atcagaacag 2100
cttctgaata attactcatc tgagattaca aattcagaaa ctcttgtccg gtgttcacgt 2160
cttttggtgg gtgtccttgg ctgctactgt tacatgggtg taatagctga agaggaagca 2220
tataagtcag aattattcca gaaagccaag tctctaatgc aatgtgcagg agaaagtatc 2280
actctgttta aaaataagac aaatgaggaa ttcagaattg gttccttgag aaatatgatg 2340
cagctatgta cacgttgctt gagcaactgt accaagaaga gtccaaataa gattgcatct 2400
ggctttttcc tgcgattgtt aacatcaaag ctaatgaatg acattgcaga tatttgtaaa 2460
agtttagcat ccttcatcaa aaagccattt gaccgtggag aagtagaatc aatggaagat 2520
gatactaatg gaaatctaat ggaggtggag gatcagtcat ccatgaatct atttaacgat 2580
taccctgata gtagtgttag tgatgcaaac gaacctggag agagccaaag taccataggt 2640
gccattaatc ctttagctga agaatatctg tcaaagcaag atctactttt cttagacatg 2700
ctcaagttct tgtgtttgtg tgtaactact gctcagacca atactgtgtc ctttagggca 2760
gctgatattc ggaggaaatt gttaatgtta attgattcta gcacgctaga acctaccaaa 2820
tccctccacc tgcatatgta tctaatgctt ttaaaggagc ttcctggaga agagtacccc 2880
ttgccaatgg aagatgttct tgaacttctg aaaccactat ccaatgtgtg ttctttgtat 2940
cgtcgtgacc aagatgtttg taaaactatt ttaaaccatg tccttcatgt agtgaaaaac 3000
ctaggtcaaa gcaatatgga ctctgagaac acaagggatg ctcaaggaca gtttcttaca 3060
gtaattggag cattttggca tctaacaaag gagaggaaat atatattctc tgtaagaatg 3120
gccctagtaa attgccttaa aactttgctt gaggctgatc cttattcaaa atgggccatt 3180
cttaatgtaa tgggaaaaga ctttcctgta aatgaagtat ttacacaatt tcttgctgac 3240
aatcatcacc aagttcgcat gttggctgca gagtcaatca atagattgtt ccaggacacg 3300
aagggagatt cttccaggtt actgaaagca cttcctttga agcttcagca aacagctttt 3360
gaaaatgcat acttgaaagc tcaggaagga atgagagaaa tgtcccatag tgctgagaac 3420
cctgaaactt tggatgaaat ttataataga aaatctgttt tactgacgtt gatagctgtg 3480
gttttatcct gtagccctat ctgcgaaaaa caggctttgt ttgccctgtg taaatctgtg 3540
aaagagaatg gattagaacc tcaccttgtg aaaaaggttt tagagaaagt ttctgaaact 3600
tttggatata gacgtttaga agactttatg gcatctcatt tagattatct ggttttggaa 3660
tggctaaatc ttcaagatac tgaatacaac ttatcttctt ttccttttat tttattaaac 3720
tacacaaata ttgaggattt ctatagatct tgttataagg ttttgattcc acatctggtg 3780
attagaagtc attttgatga ggtgaagtcc attgctaatc agattcaaga ggactggaaa 3840
agtcttctaa cagactgctt tccaaagatt cttgtaaata ttcttcctta ttttgcctat 3900
gagggtacca gagacagtgg gatggcacag caaagagaga ctgctaccaa ggtctatgat 3960
atgcttaaaa gtgaaaactt attgggaaaa cagattgatc acttattcat tagtaattta 4020
ccagagattg tggtggagtt attgatgacg ttacatgagc cagcaaattc tagtgccagt 4080
cagagcactg acctctgtga cttttcaggg gatttggatc ctgctcctaa tccacctcat 4140
tttccatcgc atgtgattaa agcaacattt gcctatatca gcaattgtca taaaaccaag 4200
ttaaaaagca ttttagaaat tctttccaaa agccctgatt cctatcagaa aattcttctt 4260
gccatatgtg agcaagcagc tgaaacaaat aatgtttata agaagcacag aattcttaaa 4320
atatatcacc tgtttgttag tttattactg aaagatataa aaagtggctt aggaggagct 4380
tgggcctttg ttcttcgaga cgttatttat actttgattc actatatcaa ccaaaggcct 4440
tcttgtatca tggatgtgtc attacgtagc ttctcccttt gttgtgactt attaagtcag 4500
gtttgccaga cagccgtgac ttactgtaag gatgctctag aaaaccatct tcatgttatt 4560
gttggtacac ttatacccct tgtgtatgag caggtggagg ttcagaaaca ggtattggac 4620
ttgttgaaat acttagtgat agataacaag gataatgaaa acctctatat cacgattaag 4680
cttttagatc cttttcctga ccatgttgtt tttaaggatt tgcgtattac tcagcaaaaa 4740
atcaaataca gtagaggacc cttttcactc ttggaggaaa ttaaccattt tctctcagta 4800
agtgtttatg atgcacttcc attgacaaga cttgaaggac taaaggatct tcgaagacaa 4860
ctggaactac ataaagatca gatggtggac attatgagag cttctcagga taatccgcaa 4920
gatgggatta tggtgaaact agttgtcaat ttgttgcagt tatccaagat ggcaataaac 4980
cacactggtg aaaaagaagt tctagaggct gttggaagct gcttgggaga agtgggtcct 5040
atagatttct ctaccatagc tatacaacat agtaaagatg catcttatac caaggccctt 5100
aagttatttg aagataaaga acttcagtgg accttcataa tgctgaccta cctgaataac 5160
acactggtag aagattgtgt caaagttcga tcagcagctg ttacctgttt gaaaaacatt 5220
ttagccacaa agactggaca tagtttctgg gagatttata agatgacaac agatccaatg 5280
ctggcctatc tacagccttt tagaacatca agaaaaaagt ttttagaagt acccagattt 5340
gacaaagaaa acccttttga aggcctggat gatataaatc tgtggattcc tctaagtgaa 5400
aatcatgaca tttggataaa gacactgact tgtgcttttt tggacagtgg aggcacaaaa 5460
tgtgaaattc ttcaattatt aaagccaatg tgtgaagtga aaactgactt ttgtcagact 5520
gtacttccat acttgattca tgatatttta ctccaagata caaatgaatc atggagaaat 5580
ctgctttcta cacatgttca gggatttttc accagctgtc ttcgacactt ctcgcaaacg 5640
agccgatcca caacccctgc aaacttggat tcagagtcag agcacttttt ccgatgctgt 5700
ttggataaaa aatcacaaag aacaatgctt gctgttgtgg actacatgag aagacaaaag 5760
agaccttctt caggaacaat ttttaatgat gctttctggc tggatttaaa ttatctagaa 5820
gttgccaagg tagctcagtc ttgtgctgct cactttacag ctttactcta tgcagaaatc 5880
tatgcagata agaaaagtat ggatgatcaa gagaaaagaa gtcttgcatt tgaagaagga 5940
agccagaata caactatttc tagcttgagt gaaaaaagta aagaagaaac tggaataagt 6000
ttacaggatc ttctcttaga aatctacaga agtatagggg agccagatag tttgtatggc 6060
tgtggtggag ggaagatgtt acaacccatt actagactac gaacatatga acacgaagca 6120
atgtggggca aagccctagt aacatatgac ctcgaaacag caatcccctc atcaacacgc 6180
caggcaggaa tcattcaggc cttgcagaat ttgggactct gccatattct ttccgtctat 6240
ttaaaaggat tggattatga aaataaagac tggtgtcctg aactagaaga acttcattac 6300
caagcagcat ggaggaatat gcagtgggac cattgcactt ccgtcagcaa agaagtagaa 6360
ggaaccagtt accatgaatc attgtacaat gctctacaat ctctaagaga cagagaattc 6420
tctacatttt atgaaagtct caaatatgcc agagtaaaag aagtggaaga gatgtgtaag 6480
cgcagccttg agtctgtgta ttcgctctat cccacactta gcaggttgca ggccattgga 6540
gagctggaaa gcattgggga gcttttctca agatcagtca cacatagaca actctctgaa 6600
gtatatatta agtggcagaa acactcccag cttctcaagg acagtgattt tagttttcag 6660
gagcctatca tggctctacg cacagtcatt ttggagatcc tgatggaaaa ggaaatggac 6720
aactcacaaa gagaatgtat taaggacatt ctcaccaaac accttgtaga actctctata 6780
ctggccagaa ctttcaagaa cactcagctc cctgaaaggg caatatttca aattaaacag 6840
tacaattcag ttagctgtgg agtctctgag tggcagctgg aagaagcaca agtattctgg 6900
gcaaaaaagg agcagagtct tgccctgagt attctcaagc aaatgatcaa gaagttggat 6960
gccagctgtg cagcgaacaa tcccagccta aaacttacat acacagaatg tctgagggtt 7020
tgtggcaact ggttagcaga aacgtgctta gaaaatcctg cggtcatcat gcagacctat 7080
ctagaaaagg cagtagaagt tgctggaaat tatgatggag aaagtagtga tgagctaaga 7140
aatggaaaaa tgaaggcatt tctctcatta gcccggtttt cagatactca ataccaaaga 7200
attgaaaact acatgaaatc atcggaattt gaaaacaagc aagctctcct gaaaagagcc 7260
aaagaggaag taggtctcct tagggaacat aaaattcaga caaacagata cacagtaaag 7320
gttcagcgag agctggagtt ggatgaatta gccctgcgtg cactgaaaga ggatcgtaaa 7380
cgcttcttat gtaaagcagt tgaaaattat atcaactgct tattaagtgg agaagaacat 7440
gatatgtggg tattccgact ttgttccctc tggcttgaaa attctggagt ttctgaagtc 7500
aatggcatga tgaagagaga cggaatgaag attccaacat ataaattttt gcctcttatg 7560
taccaattgg ctgctagaat ggggaccaag atgatgggag gcctaggatt tcatgaagtc 7620
ctcaataatc taatctctag aatttcaatg gatcaccccc atcacacttt gtttattata 7680
ctggccttag caaatgcaaa cagagatgaa tttctgacta aaccagaggt agccagaaga 7740
agcagaataa ctaaaaatgt gcctaaacaa agctctcagc ttgatgagga tcgaacagag 7800
gctgcaaata gaataatatg tactatcaga agtaggagac ctcagatggt cagaagtgtt 7860
gaggcacttt gtgatgctta tattatatta gcaaacttag atgccactca gtggaagact 7920
cagagaaaag gcataaatat tccagcagac cagccaatta ctaaacttaa gaatttagaa 7980
gatgttgttg tccctactat ggaaattaag gtggaccaca caggagaata tggaaatctg 8040
gtgactatac agtcatttaa agcagaattt cgcttagcag gaggtgtaaa tttaccaaaa 8100
ataatagatt gtgtaggttc cgatggcaag gagaggagac agcttgttaa gggccgtgat 8160
gacctgagac aagatgctgt catgcaacag gtcttccaga tgtgtaatac attactgcag 8220
agaaacacgg aaactaggaa gaggaaatta actatctgta cttataaggt ggttcccctc 8280
tctcagcgaa gtggtgttct tgaatggtgc acaggaactg tccccattgg tgaatttctt 8340
gttaacaatg aagatggtgc tcataaaaga tacaggccaa atgatttcag tgcctttcag 8400
tgccaaaaga aaatgatgga ggtgcaaaaa aagtcttttg aagagaaata tgaagtcttc 8460
atggatgttt gccaaaattt tcaaccagtt ttccgttact tctgcatgga aaaattcttg 8520
gatccagcta tttggtttga gaagcgattg gcttatacgc gcagtgtagc tacttcttct 8580
attgttggtt acatacttgg acttggtgat agacatgtac agaatatctt gataaatgag 8640
cagtcagcag aacttgtaca tatagatcta ggtgttgctt ttgaacaggg caaaatcctt 8700
cctactcctg agacagttcc ttttagactc accagagata ttgtggatgg catgggcatt 8760
acgggtgttg aaggtgtctt cagaagatgc tgtgagaaaa ccatggaagt gatgagaaac 8820
tctcaggaaa ctctgttaac cattgtagag gtccttctat atgatccact ctttgactgg 8880
accatgaatc ctttgaaagc tttgtattta cagcagaggc cggaagatga aactgagctt 8940
caccctactc tgaatgcaga tgaccaagaa tgcaaacgaa atctcagtga tattgaccag 9000
agtttcaaca aagtagctga acgtgtctta atgagactac aagagaaact gaaaggagtg 9060
gaagaaggca ctgtgctcag tgttggtgga caagtgaatt tgctcataca gcaggccata 9120
gaccccaaaa atctcagccg acttttccca ggatggaaag cttgggtgtg a 9171
<210> 2
<211> 3056
<212> PRT
<213> Homo sapiens
<400> 2
Met Ser Leu Val Leu Asn Asp Leu Leu Ile Cys Cys Arg Gln Leu Glu
1 5 10 15
His Asp Arg Ala Thr Glu Arg Lys Lys Glu Val Glu Lys Phe Lys Arg
20 25 30
Leu Ile Arg Asp Pro Glu Thr Ile Lys His Leu Asp Arg His Ser Asp
35 40 45
Ser Lys Gln Gly Lys Tyr Leu Asn Trp Asp Ala Val Phe Arg Phe Leu
50 55 60
Gln Lys Tyr Ile Gln Lys Glu Thr Glu Cys Leu Arg Ile Ala Lys Pro
65 70 75 80
Asn Val Ser Ala Ser Thr Gln Ala Ser Arg Gln Lys Lys Met Gln Glu
85 90 95
Ile Ser Ser Leu Val Lys Tyr Phe Ile Lys Cys Ala Asn Arg Arg Ala
100 105 110
Pro Arg Leu Lys Cys Gln Glu Leu Leu Asn Tyr Ile Met Asp Thr Val
115 120 125
Lys Asp Ser Ser Asn Gly Ala Ile Tyr Gly Ala Asp Cys Ser Asn Ile
130 135 140
Leu Leu Lys Asp Ile Leu Ser Val Arg Lys Tyr Trp Cys Glu Ile Ser
145 150 155 160
Gln Gln Gln Trp Leu Glu Leu Phe Ser Val Tyr Phe Arg Leu Tyr Leu
165 170 175
Lys Pro Ser Gln Asp Val His Arg Val Leu Val Ala Arg Ile Ile His
180 185 190
Ala Val Thr Lys Gly Cys Cys Ser Gln Thr Asp Gly Leu Asn Ser Lys
195 200 205
Phe Leu Asp Phe Phe Ser Lys Ala Ile Gln Cys Ala Arg Gln Glu Lys
210 215 220
Ser Ser Ser Gly Leu Asn His Ile Leu Ala Ala Leu Thr Ile Phe Leu
225 230 235 240
Lys Thr Leu Ala Val Asn Phe Arg Ile Arg Val Cys Glu Leu Gly Asp
245 250 255
Glu Ile Leu Pro Thr Leu Leu Tyr Ile Trp Thr Gln His Arg Leu Asn
260 265 270
Asp Ser Leu Lys Glu Val Ile Ile Glu Leu Phe Gln Leu Gln Ile Tyr
275 280 285
Ile His His Pro Lys Gly Ala Lys Thr Gln Glu Lys Gly Ala Tyr Glu
290 295 300
Ser Thr Lys Trp Arg Ser Ile Leu Tyr Asn Leu Tyr Asp Leu Leu Val
305 310 315 320
Asn Glu Ile Ser His Ile Gly Ser Arg Gly Lys Tyr Ser Ser Gly Phe
325 330 335
Arg Asn Ile Ala Val Lys Glu Asn Leu Ile Glu Leu Met Ala Asp Ile
340 345 350
Cys His Gln Val Phe Asn Glu Asp Thr Arg Ser Leu Glu Ile Ser Gln
355 360 365
Ser Tyr Thr Thr Thr Gln Arg Glu Ser Ser Asp Tyr Ser Val Pro Cys
370 375 380
Lys Arg Lys Lys Ile Glu Leu Gly Trp Glu Val Ile Lys Asp His Leu
385 390 395 400
Gln Lys Ser Gln Asn Asp Phe Asp Leu Val Pro Trp Leu Gln Ile Ala
405 410 415
Thr Gln Leu Ile Ser Lys Tyr Pro Ala Ser Leu Pro Asn Cys Glu Leu
420 425 430
Ser Pro Leu Leu Met Ile Leu Ser Gln Leu Leu Pro Gln Gln Arg His
435 440 445
Gly Glu Arg Thr Pro Tyr Val Leu Arg Cys Leu Thr Glu Val Ala Leu
450 455 460
Cys Gln Asp Lys Arg Ser Asn Leu Glu Ser Ser Gln Lys Ser Asp Leu
465 470 475 480
Leu Lys Leu Trp Asn Lys Ile Trp Cys Ile Thr Phe Arg Gly Ile Ser
485 490 495
Ser Glu Gln Ile Gln Ala Glu Asn Phe Gly Leu Leu Gly Ala Ile Ile
500 505 510
Gln Gly Ser Leu Val Glu Val Asp Arg Glu Phe Trp Lys Leu Phe Thr
515 520 525
Gly Ser Ala Cys Arg Pro Ser Cys Pro Ala Val Cys Cys Leu Thr Leu
530 535 540
Ala Leu Thr Thr Ser Ile Val Pro Gly Thr Val Lys Met Gly Ile Glu
545 550 555 560
Gln Asn Met Cys Glu Val Asn Arg Ser Phe Ser Leu Lys Glu Ser Ile
565 570 575
Met Lys Trp Leu Leu Phe Tyr Gln Leu Glu Gly Asp Leu Glu Asn Ser
580 585 590
Thr Glu Val Pro Pro Ile Leu His Ser Asn Phe Pro His Leu Val Leu
595 600 605
Glu Lys Ile Leu Val Ser Leu Thr Met Lys Asn Cys Lys Ala Ala Met
610 615 620
Asn Phe Phe Gln Ser Val Pro Glu Cys Glu His His Gln Lys Asp Lys
625 630 635 640
Glu Glu Leu Ser Phe Ser Glu Val Glu Glu Leu Phe Leu Gln Thr Thr
645 650 655
Phe Asp Lys Met Asp Phe Leu Thr Ile Val Arg Glu Cys Gly Ile Glu
660 665 670
Lys His Gln Ser Ser Ile Gly Phe Ser Val His Gln Asn Leu Lys Glu
675 680 685
Ser Leu Asp Arg Cys Leu Leu Gly Leu Ser Glu Gln Leu Leu Asn Asn
690 695 700
Tyr Ser Ser Glu Ile Thr Asn Ser Glu Thr Leu Val Arg Cys Ser Arg
705 710 715 720
Leu Leu Val Gly Val Leu Gly Cys Tyr Cys Tyr Met Gly Val Ile Ala
725 730 735
Glu Glu Glu Ala Tyr Lys Ser Glu Leu Phe Gln Lys Ala Lys Ser Leu
740 745 750
Met Gln Cys Ala Gly Glu Ser Ile Thr Leu Phe Lys Asn Lys Thr Asn
755 760 765
Glu Glu Phe Arg Ile Gly Ser Leu Arg Asn Met Met Gln Leu Cys Thr
770 775 780
Arg Cys Leu Ser Asn Cys Thr Lys Lys Ser Pro Asn Lys Ile Ala Ser
785 790 795 800
Gly Phe Phe Leu Arg Leu Leu Thr Ser Lys Leu Met Asn Asp Ile Ala
805 810 815
Asp Ile Cys Lys Ser Leu Ala Ser Phe Ile Lys Lys Pro Phe Asp Arg
820 825 830
Gly Glu Val Glu Ser Met Glu Asp Asp Thr Asn Gly Asn Leu Met Glu
835 840 845
Val Glu Asp Gln Ser Ser Met Asn Leu Phe Asn Asp Tyr Pro Asp Ser
850 855 860
Ser Val Ser Asp Ala Asn Glu Pro Gly Glu Ser Gln Ser Thr Ile Gly
865 870 875 880
Ala Ile Asn Pro Leu Ala Glu Glu Tyr Leu Ser Lys Gln Asp Leu Leu
885 890 895
Phe Leu Asp Met Leu Lys Phe Leu Cys Leu Cys Val Thr Thr Ala Gln
900 905 910
Thr Asn Thr Val Ser Phe Arg Ala Ala Asp Ile Arg Arg Lys Leu Leu
915 920 925
Met Leu Ile Asp Ser Ser Thr Leu Glu Pro Thr Lys Ser Leu His Leu
930 935 940
His Met Tyr Leu Met Leu Leu Lys Glu Leu Pro Gly Glu Glu Tyr Pro
945 950 955 960
Leu Pro Met Glu Asp Val Leu Glu Leu Leu Lys Pro Leu Ser Asn Val
965 970 975
Cys Ser Leu Tyr Arg Arg Asp Gln Asp Val Cys Lys Thr Ile Leu Asn
980 985 990
His Val Leu His Val Val Lys Asn Leu Gly Gln Ser Asn Met Asp Ser
995 1000 1005
Glu Asn Thr Arg Asp Ala Gln Gly Gln Phe Leu Thr Val Ile Gly
1010 1015 1020
Ala Phe Trp His Leu Thr Lys Glu Arg Lys Tyr Ile Phe Ser Val
1025 1030 1035
Arg Met Ala Leu Val Asn Cys Leu Lys Thr Leu Leu Glu Ala Asp
1040 1045 1050
Pro Tyr Ser Lys Trp Ala Ile Leu Asn Val Met Gly Lys Asp Phe
1055 1060 1065
Pro Val Asn Glu Val Phe Thr Gln Phe Leu Ala Asp Asn His His
1070 1075 1080
Gln Val Arg Met Leu Ala Ala Glu Ser Ile Asn Arg Leu Phe Gln
1085 1090 1095
Asp Thr Lys Gly Asp Ser Ser Arg Leu Leu Lys Ala Leu Pro Leu
1100 1105 1110
Lys Leu Gln Gln Thr Ala Phe Glu Asn Ala Tyr Leu Lys Ala Gln
1115 1120 1125
Glu Gly Met Arg Glu Met Ser His Ser Ala Glu Asn Pro Glu Thr
1130 1135 1140
Leu Asp Glu Ile Tyr Asn Arg Lys Ser Val Leu Leu Thr Leu Ile
1145 1150 1155
Ala Val Val Leu Ser Cys Ser Pro Ile Cys Glu Lys Gln Ala Leu
1160 1165 1170
Phe Ala Leu Cys Lys Ser Val Lys Glu Asn Gly Leu Glu Pro His
1175 1180 1185
Leu Val Lys Lys Val Leu Glu Lys Val Ser Glu Thr Phe Gly Tyr
1190 1195 1200
Arg Arg Leu Glu Asp Phe Met Ala Ser His Leu Asp Tyr Leu Val
1205 1210 1215
Leu Glu Trp Leu Asn Leu Gln Asp Thr Glu Tyr Asn Leu Ser Ser
1220 1225 1230
Phe Pro Phe Ile Leu Leu Asn Tyr Thr Asn Ile Glu Asp Phe Tyr
1235 1240 1245
Arg Ser Cys Tyr Lys Val Leu Ile Pro His Leu Val Ile Arg Ser
1250 1255 1260
His Phe Asp Glu Val Lys Ser Ile Ala Asn Gln Ile Gln Glu Asp
1265 1270 1275
Trp Lys Ser Leu Leu Thr Asp Cys Phe Pro Lys Ile Leu Val Asn
1280 1285 1290
Ile Leu Pro Tyr Phe Ala Tyr Glu Gly Thr Arg Asp Ser Gly Met
1295 1300 1305
Ala Gln Gln Arg Glu Thr Ala Thr Lys Val Tyr Asp Met Leu Lys
1310 1315 1320
Ser Glu Asn Leu Leu Gly Lys Gln Ile Asp His Leu Phe Ile Ser
1325 1330 1335
Asn Leu Pro Glu Ile Val Val Glu Leu Leu Met Thr Leu His Glu
1340 1345 1350
Pro Ala Asn Ser Ser Ala Ser Gln Ser Thr Asp Leu Cys Asp Phe
1355 1360 1365
Ser Gly Asp Leu Asp Pro Ala Pro Asn Pro Pro His Phe Pro Ser
1370 1375 1380
His Val Ile Lys Ala Thr Phe Ala Tyr Ile Ser Asn Cys His Lys
1385 1390 1395
Thr Lys Leu Lys Ser Ile Leu Glu Ile Leu Ser Lys Ser Pro Asp
1400 1405 1410
Ser Tyr Gln Lys Ile Leu Leu Ala Ile Cys Glu Gln Ala Ala Glu
1415 1420 1425
Thr Asn Asn Val Tyr Lys Lys His Arg Ile Leu Lys Ile Tyr His
1430 1435 1440
Leu Phe Val Ser Leu Leu Leu Lys Asp Ile Lys Ser Gly Leu Gly
1445 1450 1455
Gly Ala Trp Ala Phe Val Leu Arg Asp Val Ile Tyr Thr Leu Ile
1460 1465 1470
His Tyr Ile Asn Gln Arg Pro Ser Cys Ile Met Asp Val Ser Leu
1475 1480 1485
Arg Ser Phe Ser Leu Cys Cys Asp Leu Leu Ser Gln Val Cys Gln
1490 1495 1500
Thr Ala Val Thr Tyr Cys Lys Asp Ala Leu Glu Asn His Leu His
1505 1510 1515
Val Ile Val Gly Thr Leu Ile Pro Leu Val Tyr Glu Gln Val Glu
1520 1525 1530
Val Gln Lys Gln Val Leu Asp Leu Leu Lys Tyr Leu Val Ile Asp
1535 1540 1545
Asn Lys Asp Asn Glu Asn Leu Tyr Ile Thr Ile Lys Leu Leu Asp
1550 1555 1560
Pro Phe Pro Asp His Val Val Phe Lys Asp Leu Arg Ile Thr Gln
1565 1570 1575
Gln Lys Ile Lys Tyr Ser Arg Gly Pro Phe Ser Leu Leu Glu Glu
1580 1585 1590
Ile Asn His Phe Leu Ser Val Ser Val Tyr Asp Ala Leu Pro Leu
1595 1600 1605
Thr Arg Leu Glu Gly Leu Lys Asp Leu Arg Arg Gln Leu Glu Leu
1610 1615 1620
His Lys Asp Gln Met Val Asp Ile Met Arg Ala Ser Gln Asp Asn
1625 1630 1635
Pro Gln Asp Gly Ile Met Val Lys Leu Val Val Asn Leu Leu Gln
1640 1645 1650
Leu Ser Lys Met Ala Ile Asn His Thr Gly Glu Lys Glu Val Leu
1655 1660 1665
Glu Ala Val Gly Ser Cys Leu Gly Glu Val Gly Pro Ile Asp Phe
1670 1675 1680
Ser Thr Ile Ala Ile Gln His Ser Lys Asp Ala Ser Tyr Thr Lys
1685 1690 1695
Ala Leu Lys Leu Phe Glu Asp Lys Glu Leu Gln Trp Thr Phe Ile
1700 1705 1710
Met Leu Thr Tyr Leu Asn Asn Thr Leu Val Glu Asp Cys Val Lys
1715 1720 1725
Val Arg Ser Ala Ala Val Thr Cys Leu Lys Asn Ile Leu Ala Thr
1730 1735 1740
Lys Thr Gly His Ser Phe Trp Glu Ile Tyr Lys Met Thr Thr Asp
1745 1750 1755
Pro Met Leu Ala Tyr Leu Gln Pro Phe Arg Thr Ser Arg Lys Lys
1760 1765 1770
Phe Leu Glu Val Pro Arg Phe Asp Lys Glu Asn Pro Phe Glu Gly
1775 1780 1785
Leu Asp Asp Ile Asn Leu Trp Ile Pro Leu Ser Glu Asn His Asp
1790 1795 1800
Ile Trp Ile Lys Thr Leu Thr Cys Ala Phe Leu Asp Ser Gly Gly
1805 1810 1815
Thr Lys Cys Glu Ile Leu Gln Leu Leu Lys Pro Met Cys Glu Val
1820 1825 1830
Lys Thr Asp Phe Cys Gln Thr Val Leu Pro Tyr Leu Ile His Asp
1835 1840 1845
Ile Leu Leu Gln Asp Thr Asn Glu Ser Trp Arg Asn Leu Leu Ser
1850 1855 1860
Thr His Val Gln Gly Phe Phe Thr Ser Cys Leu Arg His Phe Ser
1865 1870 1875
Gln Thr Ser Arg Ser Thr Thr Pro Ala Asn Leu Asp Ser Glu Ser
1880 1885 1890
Glu His Phe Phe Arg Cys Cys Leu Asp Lys Lys Ser Gln Arg Thr
1895 1900 1905
Met Leu Ala Val Val Asp Tyr Met Arg Arg Gln Lys Arg Pro Ser
1910 1915 1920
Ser Gly Thr Ile Phe Asn Asp Ala Phe Trp Leu Asp Leu Asn Tyr
1925 1930 1935
Leu Glu Val Ala Lys Val Ala Gln Ser Cys Ala Ala His Phe Thr
1940 1945 1950
Ala Leu Leu Tyr Ala Glu Ile Tyr Ala Asp Lys Lys Ser Met Asp
1955 1960 1965
Asp Gln Glu Lys Arg Ser Leu Ala Phe Glu Glu Gly Ser Gln Asn
1970 1975 1980
Thr Thr Ile Ser Ser Leu Ser Glu Lys Ser Lys Glu Glu Thr Gly
1985 1990 1995
Ile Ser Leu Gln Asp Leu Leu Leu Glu Ile Tyr Arg Ser Ile Gly
2000 2005 2010
Glu Pro Asp Ser Leu Tyr Gly Cys Gly Gly Gly Lys Met Leu Gln
2015 2020 2025
Pro Ile Thr Arg Leu Arg Thr Tyr Glu His Glu Ala Met Trp Gly
2030 2035 2040
Lys Ala Leu Val Thr Tyr Asp Leu Glu Thr Ala Ile Pro Ser Ser
2045 2050 2055
Thr Arg Gln Ala Gly Ile Ile Gln Ala Leu Gln Asn Leu Gly Leu
2060 2065 2070
Cys His Ile Leu Ser Val Tyr Leu Lys Gly Leu Asp Tyr Glu Asn
2075 2080 2085
Lys Asp Trp Cys Pro Glu Leu Glu Glu Leu His Tyr Gln Ala Ala
2090 2095 2100
Trp Arg Asn Met Gln Trp Asp His Cys Thr Ser Val Ser Lys Glu
2105 2110 2115
Val Glu Gly Thr Ser Tyr His Glu Ser Leu Tyr Asn Ala Leu Gln
2120 2125 2130
Ser Leu Arg Asp Arg Glu Phe Ser Thr Phe Tyr Glu Ser Leu Lys
2135 2140 2145
Tyr Ala Arg Val Lys Glu Val Glu Glu Met Cys Lys Arg Ser Leu
2150 2155 2160
Glu Ser Val Tyr Ser Leu Tyr Pro Thr Leu Ser Arg Leu Gln Ala
2165 2170 2175
Ile Gly Glu Leu Glu Ser Ile Gly Glu Leu Phe Ser Arg Ser Val
2180 2185 2190
Thr His Arg Gln Leu Ser Glu Val Tyr Ile Lys Trp Gln Lys His
2195 2200 2205
Ser Gln Leu Leu Lys Asp Ser Asp Phe Ser Phe Gln Glu Pro Ile
2210 2215 2220
Met Ala Leu Arg Thr Val Ile Leu Glu Ile Leu Met Glu Lys Glu
2225 2230 2235
Met Asp Asn Ser Gln Arg Glu Cys Ile Lys Asp Ile Leu Thr Lys
2240 2245 2250
His Leu Val Glu Leu Ser Ile Leu Ala Arg Thr Phe Lys Asn Thr
2255 2260 2265
Gln Leu Pro Glu Arg Ala Ile Phe Gln Ile Lys Gln Tyr Asn Ser
2270 2275 2280
Val Ser Cys Gly Val Ser Glu Trp Gln Leu Glu Glu Ala Gln Val
2285 2290 2295
Phe Trp Ala Lys Lys Glu Gln Ser Leu Ala Leu Ser Ile Leu Lys
2300 2305 2310
Gln Met Ile Lys Lys Leu Asp Ala Ser Cys Ala Ala Asn Asn Pro
2315 2320 2325
Ser Leu Lys Leu Thr Tyr Thr Glu Cys Leu Arg Val Cys Gly Asn
2330 2335 2340
Trp Leu Ala Glu Thr Cys Leu Glu Asn Pro Ala Val Ile Met Gln
2345 2350 2355
Thr Tyr Leu Glu Lys Ala Val Glu Val Ala Gly Asn Tyr Asp Gly
2360 2365 2370
Glu Ser Ser Asp Glu Leu Arg Asn Gly Lys Met Lys Ala Phe Leu
2375 2380 2385
Ser Leu Ala Arg Phe Ser Asp Thr Gln Tyr Gln Arg Ile Glu Asn
2390 2395 2400
Tyr Met Lys Ser Ser Glu Phe Glu Asn Lys Gln Ala Leu Leu Lys
2405 2410 2415
Arg Ala Lys Glu Glu Val Gly Leu Leu Arg Glu His Lys Ile Gln
2420 2425 2430
Thr Asn Arg Tyr Thr Val Lys Val Gln Arg Glu Leu Glu Leu Asp
2435 2440 2445
Glu Leu Ala Leu Arg Ala Leu Lys Glu Asp Arg Lys Arg Phe Leu
2450 2455 2460
Cys Lys Ala Val Glu Asn Tyr Ile Asn Cys Leu Leu Ser Gly Glu
2465 2470 2475
Glu His Asp Met Trp Val Phe Arg Leu Cys Ser Leu Trp Leu Glu
2480 2485 2490
Asn Ser Gly Val Ser Glu Val Asn Gly Met Met Lys Arg Asp Gly
2495 2500 2505
Met Lys Ile Pro Thr Tyr Lys Phe Leu Pro Leu Met Tyr Gln Leu
2510 2515 2520
Ala Ala Arg Met Gly Thr Lys Met Met Gly Gly Leu Gly Phe His
2525 2530 2535
Glu Val Leu Asn Asn Leu Ile Ser Arg Ile Ser Met Asp His Pro
2540 2545 2550
His His Thr Leu Phe Ile Ile Leu Ala Leu Ala Asn Ala Asn Arg
2555 2560 2565
Asp Glu Phe Leu Thr Lys Pro Glu Val Ala Arg Arg Ser Arg Ile
2570 2575 2580
Thr Lys Asn Val Pro Lys Gln Ser Ser Gln Leu Asp Glu Asp Arg
2585 2590 2595
Thr Glu Ala Ala Asn Arg Ile Ile Cys Thr Ile Arg Ser Arg Arg
2600 2605 2610
Pro Gln Met Val Arg Ser Val Glu Ala Leu Cys Asp Ala Tyr Ile
2615 2620 2625
Ile Leu Ala Asn Leu Asp Ala Thr Gln Trp Lys Thr Gln Arg Lys
2630 2635 2640
Gly Ile Asn Ile Pro Ala Asp Gln Pro Ile Thr Lys Leu Lys Asn
2645 2650 2655
Leu Glu Asp Val Val Val Pro Thr Met Glu Ile Lys Val Asp His
2660 2665 2670
Thr Gly Glu Tyr Gly Asn Leu Val Thr Ile Gln Ser Phe Lys Ala
2675 2680 2685
Glu Phe Arg Leu Ala Gly Gly Val Asn Leu Pro Lys Ile Ile Asp
2690 2695 2700
Cys Val Gly Ser Asp Gly Lys Glu Arg Arg Gln Leu Val Lys Gly
2705 2710 2715
Arg Asp Asp Leu Arg Gln Asp Ala Val Met Gln Gln Val Phe Gln
2720 2725 2730
Met Cys Asn Thr Leu Leu Gln Arg Asn Thr Glu Thr Arg Lys Arg
2735 2740 2745
Lys Leu Thr Ile Cys Thr Tyr Lys Val Val Pro Leu Ser Gln Arg
2750 2755 2760
Ser Gly Val Leu Glu Trp Cys Thr Gly Thr Val Pro Ile Gly Glu
2765 2770 2775
Phe Leu Val Asn Asn Glu Asp Gly Ala His Lys Arg Tyr Arg Pro
2780 2785 2790
Asn Asp Phe Ser Ala Phe Gln Cys Gln Lys Lys Met Met Glu Val
2795 2800 2805
Gln Lys Lys Ser Phe Glu Glu Lys Tyr Glu Val Phe Met Asp Val
2810 2815 2820
Cys Gln Asn Phe Gln Pro Val Phe Arg Tyr Phe Cys Met Glu Lys
2825 2830 2835
Phe Leu Asp Pro Ala Ile Trp Phe Glu Lys Arg Leu Ala Tyr Thr
2840 2845 2850
Arg Ser Val Ala Thr Ser Ser Ile Val Gly Tyr Ile Leu Gly Leu
2855 2860 2865
Gly Asp Arg His Val Gln Asn Ile Leu Ile Asn Glu Gln Ser Ala
2870 2875 2880
Glu Leu Val His Ile Asp Leu Gly Val Ala Phe Glu Gln Gly Lys
2885 2890 2895
Ile Leu Pro Thr Pro Glu Thr Val Pro Phe Arg Leu Thr Arg Asp
2900 2905 2910
Ile Val Asp Gly Met Gly Ile Thr Gly Val Glu Gly Val Phe Arg
2915 2920 2925
Arg Cys Cys Glu Lys Thr Met Glu Val Met Arg Asn Ser Gln Glu
2930 2935 2940
Thr Leu Leu Thr Ile Val Glu Val Leu Leu Tyr Asp Pro Leu Phe
2945 2950 2955
Asp Trp Thr Met Asn Pro Leu Lys Ala Leu Tyr Leu Gln Gln Arg
2960 2965 2970
Pro Glu Asp Glu Thr Glu Leu His Pro Thr Leu Asn Ala Asp Asp
2975 2980 2985
Gln Glu Cys Lys Arg Asn Leu Ser Asp Ile Asp Gln Ser Phe Asn
2990 2995 3000
Lys Val Ala Glu Arg Val Leu Met Arg Leu Gln Glu Lys Leu Lys
3005 3010 3015
Gly Val Glu Glu Gly Thr Val Leu Ser Val Gly Gly Gln Val Asn
3020 3025 3030
Leu Leu Ile Gln Gln Ala Ile Asp Pro Lys Asn Leu Ser Arg Leu
3035 3040 3045
Phe Pro Gly Trp Lys Ala Trp Val
3050 3055
<210> 3
<211> 543
<212> DNA
<213> Homo sapiens
<400> 3
atgagtctag tacttaatga tctgcttatc tgctgccgtc aactagaaca tgatagagct 60
acagaacgaa agaaagaagt tgagaaattt aagcgcctga ttcgagatcc tgaaacaatt 120
aaacatctag atcggcattc agattccaaa caaggaaaat atttgaattg ggatgctgtt 180
tttagatttt tacagaaata tattcagaaa gaaacagaat gtctgagaat agcaaaacca 240
aatgtatcag cctcaacaca agcctccagg cagaaaaaga tgcaggaaat cagtagtttg 300
gtcaaatact tcatcaaatg tgcaaacaga agagcaccta ggctaaaatg tcaagaactc 360
ttaaattata tcatggatac agtgaaagat tcatctaatg gtgctattta cggagctgat 420
tgtagcaaca tactactcaa agacattctt tctgtgagaa aatactggtg tgaaatagca 480
acagtggtta gaattgttct ctgtgtactt caggctctat ctgaaacctt cacaagatgt 540
tca 543
<210> 4
<211> 181
<212> PRT
<213> Homo sapiens
<400> 4
Met Ser Leu Val Leu Asn Asp Leu Leu Ile Cys Cys Arg Gln Leu Glu
1 5 10 15
His Asp Arg Ala Thr Glu Arg Lys Lys Glu Val Glu Lys Phe Lys Arg
20 25 30
Leu Ile Arg Asp Pro Glu Thr Ile Lys His Leu Asp Arg His Ser Asp
35 40 45
Ser Lys Gln Gly Lys Tyr Leu Asn Trp Asp Ala Val Phe Arg Phe Leu
50 55 60
Gln Lys Tyr Ile Gln Lys Glu Thr Glu Cys Leu Arg Ile Ala Lys Pro
65 70 75 80
Asn Val Ser Ala Ser Thr Gln Ala Ser Arg Gln Lys Lys Met Gln Glu
85 90 95
Ile Ser Ser Leu Val Lys Tyr Phe Ile Lys Cys Ala Asn Arg Arg Ala
100 105 110
Pro Arg Leu Lys Cys Gln Glu Leu Leu Asn Tyr Ile Met Asp Thr Val
115 120 125
Lys Asp Ser Ser Asn Gly Ala Ile Tyr Gly Ala Asp Cys Ser Asn Ile
130 135 140
Leu Leu Lys Asp Ile Leu Ser Val Arg Lys Tyr Trp Cys Glu Ile Ala
145 150 155 160
Thr Val Val Arg Ile Val Leu Cys Val Leu Gln Ala Leu Ser Glu Thr
165 170 175
Phe Thr Arg Cys Ser
180
<210> 5
<211> 4785
<212> DNA
<213> Homo sapiens
<400> 5
atgagtctag tacttaatga tctgcttatc tgctgccgtc aactagaaca tgatagagct 60
acagaacgaa agaaagaagt tgagaaattt aagcgcctga ttcgagatcc tgaaacaatt 120
aaacatctag atcggcattc agattccaaa caaggaaaat atttgaattg ggatgctgtt 180
tttagatttt tacagaaata tattcagaaa gaaacagaat gtctgagaat agcaaaacca 240
aatgtatcag cctcaacaca agcctccagg cagaaaaaga tgcaggaaat cagtagtttg 300
gtcaaatact tcatcaaatg tgcaaacaga agagcaccta ggctaaaatg tcaagaactc 360
ttaaattata tcatggatac agtgaaagat tcatctaatg gtgctattta cggagctgat 420
tgtagcaaca tactactcaa agacattctt tctgtgagaa aatactggtg tgaaatatct 480
cagcaacagt ggttagaatt gttctctgtg tacttcaggc tctatctgaa accttcacaa 540
gatgttcata gagttttagt ggctagaata attcatgctg ttaccaaagg atgctgttct 600
cagactgacg gattaaattc caaatttttg gacttttttt ccaaggctat tcagtgtgcg 660
agacaagaaa agagctcttc aggtctaaat catatcttag cagctcttac tatcttcctc 720
aagactttgg ctgtcaactt tcgaattcga gtgtgtgaat taggagatga aattcttccc 780
actttgcttt atatttggac tcaacatagg cttaatgatt ctttaaaaga agtcattatt 840
gaattatttc aactgcaaat ttatatccat catccgaaag gagccaaaac ccaagaaaaa 900
ggtgcttatg aatcaacaaa atggagaagt attttataca acttatatga tctgctagtg 960
aatgagataa gtcatatagg aagtagagga aagtattctt caggatttcg taatattgcc 1020
gtcaaagaaa atttgattga attgatggca gatatctgtc accaggtttt taatgaagat 1080
accagatcct tggagatttc tcaatcttac actactacac aaagagaatc tagtgattac 1140
agtgtccctt gcaaaaggaa gaaaatagaa ctaggctggg aagtaataaa agatcacctt 1200
cagaagtcac agaatgattt tgatcttgtg ccttggctac agattgcaac ccaattaata 1260
tcaaagtatc ctgcaagttt acctaactgt gagctgtctc cattactgat gatactatct 1320
cagcttctac cccaacagcg acatggggaa cgtacaccat atgtgttacg atgccttacg 1380
gaagttgcat tgtgtcaaga caagaggtca aacctagaaa gctcacaaaa gtcagattta 1440
ttaaaactct ggaataaaat ttggtgtatt acctttcgtg gtataagttc tgagcaaata 1500
caagctgaaa actttggctt acttggagcc ataattcagg gtagtttagt tgaggttgac 1560
agagaattct ggaagttatt tactgggtca gcctgcagac cttcatgtcc tgcagtatgc 1620
tgtttgactt tggcactgac caccagtata gttccaggaa cggtaaaaat gggaatagag 1680
caaaatatgt gtgaagtaaa tagaagcttt tctttaaagg aatcaataat gaaatggctc 1740
ttattctatc agttagaggg tgacttagaa aatagcacag aagtgcctcc aattcttcac 1800
agtaattttc ctcatcttgt actggagaaa attcttgtga gtctcactat gaaaaactgt 1860
aaagctgcaa tgaatttttt ccaaagcgtg ccagaatgtg aacaccacca aaaagataaa 1920
gaagaacttt cattctcaga agtagaagaa ctatttcttc agacaacttt tgacaagatg 1980
gactttttaa ccattgtgag agaatgtggt atagaaaagc accagtccag tattggcttc 2040
tctgtccacc agaatctcaa ggaatcactg gatcgctgtc ttctgggatt atcagaacag 2100
cttctgaata attactcatc tgagattaca aattcagaaa ctcttgtccg gtgttcacgt 2160
cttttggtgg gtgtccttgg ctgctactgt tacatgggtg taatagctga agaggaagca 2220
tataagtcag aattattcca gaaagccaag tctctaatgc aatgtgcagg agaaagtatc 2280
actctgttta aaaataagac aaatgaggaa ttcagaattg gttccttgag aaatatgatg 2340
cagctatgta cacgttgctt gagcaactgt accaagaaga gtccaaataa gattgcatct 2400
ggctttttcc tgcgattgtt aacatcaaag ctaatgaatg acattgcaga tatttgtaaa 2460
agtttagcat ccttcatcaa aaagccattt gaccgtggag aagtagaatc aatggaagat 2520
gatactaatg gaaatctaat ggaggtggag gatcagtcat ccatgaatct atttaacgat 2580
taccctgata gtagtgttag tgatgcaaac gaacctggag agagccaaag taccataggt 2640
gccattaatc ctttagctga agaatatctg tcaaagcaag atctactttt cttagacatg 2700
ctcaagttct tgtgtttgtg tgtaactact gctcagacca atactgtgtc ctttagggca 2760
gctgatattc ggaggaaatt gttaatgtta attgattcta gcacgctaga acctaccaaa 2820
tccctccacc tgcatatgta tctaatgctt ttaaaggagc ttcctggaga agagtacccc 2880
ttgccaatgg aagatgttct tgaacttctg aaaccactat ccaatgtgtg ttctttgtat 2940
cgtcgtgacc aagatgtttg taaaactatt ttaaaccatg tccttcatgt agtgaaaaac 3000
ctaggtcaaa gcaatatgga ctctgagaac acaagggatg ctcaaggaca gtttcttaca 3060
gtaattggag cattttggca tctaacaaag gagaggaaat atatattctc tgtaagaatg 3120
gccctagtaa attgccttaa aactttgctt gaggctgatc cttattcaaa atgggccatt 3180
cttaatgtaa tgggaaaaga ctttcctgta aatgaagtat ttacacaatt tcttgctgac 3240
aatcatcacc aagttcgcat gttggctgca gagtcaatca atagattgtt ccaggacacg 3300
aagggagatt cttccaggtt actgaaagca cttcctttga agcttcagca aacagctttt 3360
gaaaatgcat acttgaaagc tcaggaagga atgagagaaa tgtcccatag tgctgagaac 3420
cctgaaactt tggatgaaat ttataataga aaatctgttt tactgacgtt gatagctgtg 3480
gttttatcct gtagccctat ctgcgaaaaa caggctttgt ttgccctgtg taaatctgtg 3540
aaagagaatg gattagaacc tcaccttgtg aaaaaggttt tagagaaagt ttctgaaact 3600
tttggatata gacgtttaga agactttatg gcatctcatt tagattatct ggttttggaa 3660
tggctaaatc ttcaagatac tgaatacaac ttatcttctt ttccttttat tttattaaac 3720
tacacaaata ttgaggattt ctatagatct tgttataagg ttttgattcc acatctggtg 3780
attagaagtc attttgatga ggtgaagtcc attgctaatc agattcaaga ggactggaaa 3840
agtcttctaa cagactgctt tccaaagatt cttgtaaata ttcttcctta ttttgcctat 3900
gagggtacca gagacagtgg gatggcacag caaagagaga ctgctaccaa ggtctatgat 3960
atgcttaaaa gtgaaaactt attgggaaaa cagattgatc acttattcat tagtaattta 4020
ccagagattg tggtggagtt attgatgacg ttacatgagc cagcaaattc tagtgccagt 4080
cagagcactg acctctgtga cttttcaggg gatttggatc ctgctcctaa tccacctcat 4140
tttccatcgc atgtgattaa agcaacattt gcctatatca gcaattgtca taaaaccaag 4200
ttaaaaagca ttttagaaat tctttccaaa agccctgatt cctatcagaa aattcttctt 4260
gccatatgtg agcaagcagc tgaaacaaat aatgtttata agaagcacag aattcttaaa 4320
atatatcacc tgtttgttag tttattactg aaagatataa aaagtggctt aggaggagct 4380
tgggcctttg ttcttcgaga cgttatttat actttgattc actatatcaa ccaaaggcct 4440
tcttgtatca tggatgtgtc attacgtagc ttctcccttt gttgtgactt attaagtcag 4500
gtttgccaga cagccgtgac ttactgtaag gatgctctag aaaaccatct tcatgttatt 4560
gttggtacac ttatacccct tgtgtatgag caggtggagg ttcagaaaca ggtattggac 4620
ttgttgaaat acttagtgat agataacaag gataatgaaa acctctatat cacgattaag 4680
cttttagatc cttttcctga ccatgttgtt tttaaggatt tgcgtattac tcagcaaaaa 4740
atcaaataca gtagaggacc cttttcactc ttggagttct cagga 4785
<210> 6
<211> 1595
<212> PRT
<213> Homo sapiens
<400> 6
Met Ser Leu Val Leu Asn Asp Leu Leu Ile Cys Cys Arg Gln Leu Glu
1 5 10 15
His Asp Arg Ala Thr Glu Arg Lys Lys Glu Val Glu Lys Phe Lys Arg
20 25 30
Leu Ile Arg Asp Pro Glu Thr Ile Lys His Leu Asp Arg His Ser Asp
35 40 45
Ser Lys Gln Gly Lys Tyr Leu Asn Trp Asp Ala Val Phe Arg Phe Leu
50 55 60
Gln Lys Tyr Ile Gln Lys Glu Thr Glu Cys Leu Arg Ile Ala Lys Pro
65 70 75 80
Asn Val Ser Ala Ser Thr Gln Ala Ser Arg Gln Lys Lys Met Gln Glu
85 90 95
Ile Ser Ser Leu Val Lys Tyr Phe Ile Lys Cys Ala Asn Arg Arg Ala
100 105 110
Pro Arg Leu Lys Cys Gln Glu Leu Leu Asn Tyr Ile Met Asp Thr Val
115 120 125
Lys Asp Ser Ser Asn Gly Ala Ile Tyr Gly Ala Asp Cys Ser Asn Ile
130 135 140
Leu Leu Lys Asp Ile Leu Ser Val Arg Lys Tyr Trp Cys Glu Ile Ser
145 150 155 160
Gln Gln Gln Trp Leu Glu Leu Phe Ser Val Tyr Phe Arg Leu Tyr Leu
165 170 175
Lys Pro Ser Gln Asp Val His Arg Val Leu Val Ala Arg Ile Ile His
180 185 190
Ala Val Thr Lys Gly Cys Cys Ser Gln Thr Asp Gly Leu Asn Ser Lys
195 200 205
Phe Leu Asp Phe Phe Ser Lys Ala Ile Gln Cys Ala Arg Gln Glu Lys
210 215 220
Ser Ser Ser Gly Leu Asn His Ile Leu Ala Ala Leu Thr Ile Phe Leu
225 230 235 240
Lys Thr Leu Ala Val Asn Phe Arg Ile Arg Val Cys Glu Leu Gly Asp
245 250 255
Glu Ile Leu Pro Thr Leu Leu Tyr Ile Trp Thr Gln His Arg Leu Asn
260 265 270
Asp Ser Leu Lys Glu Val Ile Ile Glu Leu Phe Gln Leu Gln Ile Tyr
275 280 285
Ile His His Pro Lys Gly Ala Lys Thr Gln Glu Lys Gly Ala Tyr Glu
290 295 300
Ser Thr Lys Trp Arg Ser Ile Leu Tyr Asn Leu Tyr Asp Leu Leu Val
305 310 315 320
Asn Glu Ile Ser His Ile Gly Ser Arg Gly Lys Tyr Ser Ser Gly Phe
325 330 335
Arg Asn Ile Ala Val Lys Glu Asn Leu Ile Glu Leu Met Ala Asp Ile
340 345 350
Cys His Gln Val Phe Asn Glu Asp Thr Arg Ser Leu Glu Ile Ser Gln
355 360 365
Ser Tyr Thr Thr Thr Gln Arg Glu Ser Ser Asp Tyr Ser Val Pro Cys
370 375 380
Lys Arg Lys Lys Ile Glu Leu Gly Trp Glu Val Ile Lys Asp His Leu
385 390 395 400
Gln Lys Ser Gln Asn Asp Phe Asp Leu Val Pro Trp Leu Gln Ile Ala
405 410 415
Thr Gln Leu Ile Ser Lys Tyr Pro Ala Ser Leu Pro Asn Cys Glu Leu
420 425 430
Ser Pro Leu Leu Met Ile Leu Ser Gln Leu Leu Pro Gln Gln Arg His
435 440 445
Gly Glu Arg Thr Pro Tyr Val Leu Arg Cys Leu Thr Glu Val Ala Leu
450 455 460
Cys Gln Asp Lys Arg Ser Asn Leu Glu Ser Ser Gln Lys Ser Asp Leu
465 470 475 480
Leu Lys Leu Trp Asn Lys Ile Trp Cys Ile Thr Phe Arg Gly Ile Ser
485 490 495
Ser Glu Gln Ile Gln Ala Glu Asn Phe Gly Leu Leu Gly Ala Ile Ile
500 505 510
Gln Gly Ser Leu Val Glu Val Asp Arg Glu Phe Trp Lys Leu Phe Thr
515 520 525
Gly Ser Ala Cys Arg Pro Ser Cys Pro Ala Val Cys Cys Leu Thr Leu
530 535 540
Ala Leu Thr Thr Ser Ile Val Pro Gly Thr Val Lys Met Gly Ile Glu
545 550 555 560
Gln Asn Met Cys Glu Val Asn Arg Ser Phe Ser Leu Lys Glu Ser Ile
565 570 575
Met Lys Trp Leu Leu Phe Tyr Gln Leu Glu Gly Asp Leu Glu Asn Ser
580 585 590
Thr Glu Val Pro Pro Ile Leu His Ser Asn Phe Pro His Leu Val Leu
595 600 605
Glu Lys Ile Leu Val Ser Leu Thr Met Lys Asn Cys Lys Ala Ala Met
610 615 620
Asn Phe Phe Gln Ser Val Pro Glu Cys Glu His His Gln Lys Asp Lys
625 630 635 640
Glu Glu Leu Ser Phe Ser Glu Val Glu Glu Leu Phe Leu Gln Thr Thr
645 650 655
Phe Asp Lys Met Asp Phe Leu Thr Ile Val Arg Glu Cys Gly Ile Glu
660 665 670
Lys His Gln Ser Ser Ile Gly Phe Ser Val His Gln Asn Leu Lys Glu
675 680 685
Ser Leu Asp Arg Cys Leu Leu Gly Leu Ser Glu Gln Leu Leu Asn Asn
690 695 700
Tyr Ser Ser Glu Ile Thr Asn Ser Glu Thr Leu Val Arg Cys Ser Arg
705 710 715 720
Leu Leu Val Gly Val Leu Gly Cys Tyr Cys Tyr Met Gly Val Ile Ala
725 730 735
Glu Glu Glu Ala Tyr Lys Ser Glu Leu Phe Gln Lys Ala Lys Ser Leu
740 745 750
Met Gln Cys Ala Gly Glu Ser Ile Thr Leu Phe Lys Asn Lys Thr Asn
755 760 765
Glu Glu Phe Arg Ile Gly Ser Leu Arg Asn Met Met Gln Leu Cys Thr
770 775 780
Arg Cys Leu Ser Asn Cys Thr Lys Lys Ser Pro Asn Lys Ile Ala Ser
785 790 795 800
Gly Phe Phe Leu Arg Leu Leu Thr Ser Lys Leu Met Asn Asp Ile Ala
805 810 815
Asp Ile Cys Lys Ser Leu Ala Ser Phe Ile Lys Lys Pro Phe Asp Arg
820 825 830
Gly Glu Val Glu Ser Met Glu Asp Asp Thr Asn Gly Asn Leu Met Glu
835 840 845
Val Glu Asp Gln Ser Ser Met Asn Leu Phe Asn Asp Tyr Pro Asp Ser
850 855 860
Ser Val Ser Asp Ala Asn Glu Pro Gly Glu Ser Gln Ser Thr Ile Gly
865 870 875 880
Ala Ile Asn Pro Leu Ala Glu Glu Tyr Leu Ser Lys Gln Asp Leu Leu
885 890 895
Phe Leu Asp Met Leu Lys Phe Leu Cys Leu Cys Val Thr Thr Ala Gln
900 905 910
Thr Asn Thr Val Ser Phe Arg Ala Ala Asp Ile Arg Arg Lys Leu Leu
915 920 925
Met Leu Ile Asp Ser Ser Thr Leu Glu Pro Thr Lys Ser Leu His Leu
930 935 940
His Met Tyr Leu Met Leu Leu Lys Glu Leu Pro Gly Glu Glu Tyr Pro
945 950 955 960
Leu Pro Met Glu Asp Val Leu Glu Leu Leu Lys Pro Leu Ser Asn Val
965 970 975
Cys Ser Leu Tyr Arg Arg Asp Gln Asp Val Cys Lys Thr Ile Leu Asn
980 985 990
His Val Leu His Val Val Lys Asn Leu Gly Gln Ser Asn Met Asp Ser
995 1000 1005
Glu Asn Thr Arg Asp Ala Gln Gly Gln Phe Leu Thr Val Ile Gly
1010 1015 1020
Ala Phe Trp His Leu Thr Lys Glu Arg Lys Tyr Ile Phe Ser Val
1025 1030 1035
Arg Met Ala Leu Val Asn Cys Leu Lys Thr Leu Leu Glu Ala Asp
1040 1045 1050
Pro Tyr Ser Lys Trp Ala Ile Leu Asn Val Met Gly Lys Asp Phe
1055 1060 1065
Pro Val Asn Glu Val Phe Thr Gln Phe Leu Ala Asp Asn His His
1070 1075 1080
Gln Val Arg Met Leu Ala Ala Glu Ser Ile Asn Arg Leu Phe Gln
1085 1090 1095
Asp Thr Lys Gly Asp Ser Ser Arg Leu Leu Lys Ala Leu Pro Leu
1100 1105 1110
Lys Leu Gln Gln Thr Ala Phe Glu Asn Ala Tyr Leu Lys Ala Gln
1115 1120 1125
Glu Gly Met Arg Glu Met Ser His Ser Ala Glu Asn Pro Glu Thr
1130 1135 1140
Leu Asp Glu Ile Tyr Asn Arg Lys Ser Val Leu Leu Thr Leu Ile
1145 1150 1155
Ala Val Val Leu Ser Cys Ser Pro Ile Cys Glu Lys Gln Ala Leu
1160 1165 1170
Phe Ala Leu Cys Lys Ser Val Lys Glu Asn Gly Leu Glu Pro His
1175 1180 1185
Leu Val Lys Lys Val Leu Glu Lys Val Ser Glu Thr Phe Gly Tyr
1190 1195 1200
Arg Arg Leu Glu Asp Phe Met Ala Ser His Leu Asp Tyr Leu Val
1205 1210 1215
Leu Glu Trp Leu Asn Leu Gln Asp Thr Glu Tyr Asn Leu Ser Ser
1220 1225 1230
Phe Pro Phe Ile Leu Leu Asn Tyr Thr Asn Ile Glu Asp Phe Tyr
1235 1240 1245
Arg Ser Cys Tyr Lys Val Leu Ile Pro His Leu Val Ile Arg Ser
1250 1255 1260
His Phe Asp Glu Val Lys Ser Ile Ala Asn Gln Ile Gln Glu Asp
1265 1270 1275
Trp Lys Ser Leu Leu Thr Asp Cys Phe Pro Lys Ile Leu Val Asn
1280 1285 1290
Ile Leu Pro Tyr Phe Ala Tyr Glu Gly Thr Arg Asp Ser Gly Met
1295 1300 1305
Ala Gln Gln Arg Glu Thr Ala Thr Lys Val Tyr Asp Met Leu Lys
1310 1315 1320
Ser Glu Asn Leu Leu Gly Lys Gln Ile Asp His Leu Phe Ile Ser
1325 1330 1335
Asn Leu Pro Glu Ile Val Val Glu Leu Leu Met Thr Leu His Glu
1340 1345 1350
Pro Ala Asn Ser Ser Ala Ser Gln Ser Thr Asp Leu Cys Asp Phe
1355 1360 1365
Ser Gly Asp Leu Asp Pro Ala Pro Asn Pro Pro His Phe Pro Ser
1370 1375 1380
His Val Ile Lys Ala Thr Phe Ala Tyr Ile Ser Asn Cys His Lys
1385 1390 1395
Thr Lys Leu Lys Ser Ile Leu Glu Ile Leu Ser Lys Ser Pro Asp
1400 1405 1410
Ser Tyr Gln Lys Ile Leu Leu Ala Ile Cys Glu Gln Ala Ala Glu
1415 1420 1425
Thr Asn Asn Val Tyr Lys Lys His Arg Ile Leu Lys Ile Tyr His
1430 1435 1440
Leu Phe Val Ser Leu Leu Leu Lys Asp Ile Lys Ser Gly Leu Gly
1445 1450 1455
Gly Ala Trp Ala Phe Val Leu Arg Asp Val Ile Tyr Thr Leu Ile
1460 1465 1470
His Tyr Ile Asn Gln Arg Pro Ser Cys Ile Met Asp Val Ser Leu
1475 1480 1485
Arg Ser Phe Ser Leu Cys Cys Asp Leu Leu Ser Gln Val Cys Gln
1490 1495 1500
Thr Ala Val Thr Tyr Cys Lys Asp Ala Leu Glu Asn His Leu His
1505 1510 1515
Val Ile Val Gly Thr Leu Ile Pro Leu Val Tyr Glu Gln Val Glu
1520 1525 1530
Val Gln Lys Gln Val Leu Asp Leu Leu Lys Tyr Leu Val Ile Asp
1535 1540 1545
Asn Lys Asp Asn Glu Asn Leu Tyr Ile Thr Ile Lys Leu Leu Asp
1550 1555 1560
Pro Phe Pro Asp His Val Val Phe Lys Asp Leu Arg Ile Thr Gln
1565 1570 1575
Gln Lys Ile Lys Tyr Ser Arg Gly Pro Phe Ser Leu Leu Glu Phe
1580 1585 1590
Ser Gly
1595
<210> 7
<211> 20
<212> DNA
<213> Artificial
<220>
<223> 引物
<400> 7
gtactgccca ccagaacctt 20
<210> 8
<211> 23
<212> DNA
<213> Artificial
<220>
<223> 引物
<400> 8
ccaaactgct atcctgaata tgg 23
<210> 9
<211> 20
<212> DNA
<213> Artificial
<220>
<223> 引物
<400> 9
attttgcttg gggccataat 20
<210> 10
<211> 21
<212> DNA
<213> Artificial
<220>
<223> 引物
<400> 10
cagacagagt gctttctttg g 21
<210> 11
<211> 20
<212> DNA
<213> Artificial
<220>
<223> 引物
<400> 11
agggtttgca gtggaagaaa 20
<210> 12
<211> 21
<212> DNA
<213> Artificial
<220>
<223> 引物
<400> 12
tgacaatgaa accaagagca a 21
<210> 13
<211> 20
<212> DNA
<213> Artificial
<220>
<223> 引物
<400> 13
tggcaaggtg agtatgttgg 20
<210> 14
<211> 20
<212> DNA
<213> Artificial
<220>
<223> 引物
<400> 14
acatgcgaac ttggtgatga 20
<210> 15
<211> 20
<212> DNA
<213> Artificial
<220>
<223> 引物
<400> 15
tgcttgacct tcaatgctgt 20
<210> 16
<211> 20
<212> DNA
<213> Artificial
<220>
<223> 引物
<400> 16
ggcaatggaa aagaaagcag 20
<210> 17
<211> 25
<212> DNA
<213> Artificial
<220>
<223> QPCR引物
<400> 17
tgatccactc tttgactgga ccatg 25
<210> 18
<211> 25
<212> DNA
<213> Artificial
<220>
<223> QPCR引物
<400> 18
gcattcagag tagggtgaag ctcag 25
<210> 19
<211> 24
<212> DNA
<213> Artificial
<220>
<223> QPCR引物
<400> 19
gccagtatgg tgaaaccctg tctc 24
<210> 20
<211> 23
<212> DNA
<213> Artificial
<220>
<223> QPCR引物
<400> 20
agattctcct gcctcagcct ctc 23
<210> 21
<211> 20
<212> DNA
<213> Artificial
<220>
<223> 引物
<400> 21
ccccttgtgt atgagcaggt 20
<210> 22
<211> 20
<212> DNA
<213> Artificial
<220>
<223> 引物
<400> 22
acttctccca agcagcttcc 20
Claims (7)
1.一种分离的编码ATM基因突变体的核酸,其特征在于,与SEQ ID NO:1相比,所述核酸具有选自下列的至少一种突变:c.4777-2A>T、c.3078G>T、c.7983_7985delTGT和c.477_481delATCTC。
2.一种分离的多肽,其特征在于,与SEQ ID NO:2相比,所述分离的多肽具有选自下列的至少一种突变:p.Glu2950fs、p.Trp1026Cys、p.Val2662del和p.Ile159fs。
3.根据权利要求2所述的分离的多肽,所述多肽是由权利要求1所述的核酸编码的。
4.一种用于筛选易患共济失调毛细血管扩张症的生物样品的试剂盒,其特征在于,含有:
适于检测ATM基因突变体的试剂,其中与SEQ ID NO:1相比,所述ATM基因突变体具有选自下列的至少一种突变:c.4777-2A>T、c.3078G>T、c.7983_7985delTGT和c.477_481delATCTC,
所述试剂为核酸探针或引物,
所述核酸探针或引物具有如SEQ ID NO:7-16所示的核苷酸序列。
5.根据权利要求4所述的试剂盒,其特征在于,所述共济失调毛细血管扩张症为常染色体隐性遗传疾病。
6.一种构建体,其特征在于,包含权利要求1所述的分离的编码ATM基因突变体的核酸。
7.一种重组细胞,其特征在于,所述重组细胞是通过权利要求6所述的构建体转化受体细胞而获得的。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611130281.8A CN106701709B (zh) | 2013-05-20 | 2013-05-20 | Atm基因突变体及其应用 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611130281.8A CN106701709B (zh) | 2013-05-20 | 2013-05-20 | Atm基因突变体及其应用 |
CN201310188517.3A CN104178487B (zh) | 2013-05-20 | 2013-05-20 | Atm基因突变体及其应用 |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310188517.3A Division CN104178487B (zh) | 2013-05-20 | 2013-05-20 | Atm基因突变体及其应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106701709A CN106701709A (zh) | 2017-05-24 |
CN106701709B true CN106701709B (zh) | 2020-02-07 |
Family
ID=51959835
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310188517.3A Active CN104178487B (zh) | 2013-05-20 | 2013-05-20 | Atm基因突变体及其应用 |
CN201611130281.8A Active CN106701709B (zh) | 2013-05-20 | 2013-05-20 | Atm基因突变体及其应用 |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310188517.3A Active CN104178487B (zh) | 2013-05-20 | 2013-05-20 | Atm基因突变体及其应用 |
Country Status (1)
Country | Link |
---|---|
CN (2) | CN104178487B (zh) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105779460A (zh) * | 2014-12-22 | 2016-07-20 | 深圳华大基因研究院 | 分离的编码acd突变体的核酸及其应用 |
US10619208B2 (en) * | 2015-01-19 | 2020-04-14 | Erydel S.P.A. | Method of evaluating the response of ataxia telangiectasia patients to glucocorticoids treatment |
CN105985963B (zh) * | 2015-05-21 | 2017-09-22 | 深圳华大基因研究院 | Crx基因突变体及其应用 |
CN110272912B (zh) * | 2019-07-16 | 2021-04-06 | 北京林业大学 | 蛋白激酶atm基因序列、应用及调控植物叶片衰老进程的方法 |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102154509A (zh) * | 2010-12-28 | 2011-08-17 | 深圳华大基因科技有限公司 | 用于检测clcn1基因突变的方法、试剂盒及特异性引物 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1314381A (zh) * | 2000-03-22 | 2001-09-26 | 上海博德基因开发有限公司 | 一种新的多肽——人运动性共济失调-毛细管扩张atm蛋白20和编码这种多肽的多核苷税 |
CN102206701B (zh) * | 2010-09-19 | 2015-01-21 | 深圳华大基因科技有限公司 | 遗传性疾病相关基因的鉴定方法 |
CN103031365B (zh) * | 2011-09-30 | 2014-11-26 | 益善生物技术股份有限公司 | 一种atm基因突变检测特异性引物和液相芯片 |
-
2013
- 2013-05-20 CN CN201310188517.3A patent/CN104178487B/zh active Active
- 2013-05-20 CN CN201611130281.8A patent/CN106701709B/zh active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102154509A (zh) * | 2010-12-28 | 2011-08-17 | 深圳华大基因科技有限公司 | 用于检测clcn1基因突变的方法、试剂盒及特异性引物 |
Non-Patent Citations (3)
Title |
---|
Using next-generation sequencing as a genetic diagnostic tool in rare autosomal recessive neurologic Mendelian disorders;Zhao Chen,et al;《Neurobiology of Aging》;20130530;2442.e11-e17 * |
中国人共济失调毛细血管扩张症ATM基因突变研究;江泓等;《中华医学遗传学杂志》;20050430;第22卷(第2期);121-124页 * |
中国汉族人ATM基因的单核苷酸多态与点突变;汤洪伟等;《中华医学遗传学杂志》;20041231;第21卷(第6期);579-582页 * |
Also Published As
Publication number | Publication date |
---|---|
CN104178487B (zh) | 2017-09-19 |
CN106701709A (zh) | 2017-05-24 |
CN104178487A (zh) | 2014-12-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Wu et al. | Delineation of the common critical region in Williams syndrome and clinical correlation of growth, heart defects, ethnicity, and parental origin | |
EP3090068B1 (en) | Method and kit for non-invasively detecting egfr gene mutations | |
Hitzemann et al. | A strategy for the integration of QTL, gene expression, and sequence analyses | |
KR101693387B1 (ko) | 비정상적인 미토콘드리아 dna, 그것에 연관된 융합 트랜스크립트 및 혼성화 프로브 | |
CN110541025B (zh) | 杜氏肌营养不良基因缺陷的检测方法、引物组合物及试剂盒 | |
JPH09512702A (ja) | Dnaミスマッチ修復遺伝子に関する組成物および方法 | |
CN101679971A (zh) | 青光眼恶化风险的判定方法 | |
CN106701709B (zh) | Atm基因突变体及其应用 | |
CN109055518A (zh) | 用于常见遗传病检测的探针、基因芯片及制备方法和应用 | |
AU2016351311B9 (en) | SCAP gene mutant and the application thereof | |
CN104745592B (zh) | Cyp4v2基因突变体及其应用 | |
CN113355332A (zh) | Heg1基因突变体及其应用 | |
KR101985864B1 (ko) | 유방암 및 난소암 등 암 진단용 조성물 및 이의 용도 | |
CN109943569B (zh) | 分离的编码ifnlr1突变体的核酸及其应用 | |
CN104745591B (zh) | Cyp4v2基因突变体及其应用 | |
Aten et al. | Split hand‐foot malformation, tetralogy of Fallot, mental retardation and a 1 Mb 19p deletion—evidence for further heterogeneity? | |
CN114875148A (zh) | 一种家族性多发性脂肪瘤检测试剂盒及引物组的应用 | |
CN107475420A (zh) | 无头精子症致病新基因及其应用 | |
CN110878346B (zh) | 基因突变体及其应用 | |
US20020137079A1 (en) | Isolated polynucleotide associated with type II diabetes mellitus and methods of use thereof | |
JP5897704B2 (ja) | ブラキスパイナ突然変異の検出 | |
CN105802974B (zh) | Bcs1l基因突变体及其应用 | |
JP2005511095A (ja) | D−アミノ酸オキシダーゼのバイアレリック・マーカー及びその用途 | |
CN113151287B (zh) | 双肾缺如/双肾发育不良致病新基因及其应用 | |
NL2030347B1 (en) | Primer set for detecting mutations of human serpinf1 gene and kit thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |