CA2362300A1 - Human lipid-associated proteins - Google Patents
Human lipid-associated proteins Download PDFInfo
- Publication number
- CA2362300A1 CA2362300A1 CA002362300A CA2362300A CA2362300A1 CA 2362300 A1 CA2362300 A1 CA 2362300A1 CA 002362300 A CA002362300 A CA 002362300A CA 2362300 A CA2362300 A CA 2362300A CA 2362300 A1 CA2362300 A1 CA 2362300A1
- Authority
- CA
- Canada
- Prior art keywords
- seq
- leu
- ser
- glu
- lys
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 108090000623 proteins and genes Proteins 0.000 title abstract description 115
- 102000004169 proteins and genes Human genes 0.000 title abstract description 70
- 241000282414 Homo sapiens Species 0.000 title abstract description 53
- 150000002632 lipids Chemical class 0.000 title abstract description 26
- 238000000034 method Methods 0.000 claims abstract description 159
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 151
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 151
- 239000002157 polynucleotide Substances 0.000 claims abstract description 151
- 230000014509 gene expression Effects 0.000 claims abstract description 77
- 239000005557 antagonist Substances 0.000 claims abstract description 17
- 239000000556 agonist Substances 0.000 claims abstract description 15
- 239000012634 fragment Substances 0.000 claims description 113
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 99
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 82
- 229920001184 polypeptide Polymers 0.000 claims description 75
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 73
- 239000000523 sample Substances 0.000 claims description 67
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 65
- 238000009396 hybridization Methods 0.000 claims description 56
- 239000002773 nucleotide Substances 0.000 claims description 56
- 125000003729 nucleotide group Chemical group 0.000 claims description 56
- 230000000295 complement effect Effects 0.000 claims description 35
- 230000000694 effects Effects 0.000 claims description 33
- 150000001875 compounds Chemical class 0.000 claims description 32
- 201000010099 disease Diseases 0.000 claims description 26
- 239000008194 pharmaceutical composition Substances 0.000 claims description 25
- 238000004519 manufacturing process Methods 0.000 claims description 19
- 238000011282 treatment Methods 0.000 claims description 18
- 238000012216 screening Methods 0.000 claims description 15
- 230000002163 immunogen Effects 0.000 claims description 14
- 230000003247 decreasing effect Effects 0.000 claims description 9
- 239000000546 pharmaceutical excipient Substances 0.000 claims description 8
- 230000009261 transgenic effect Effects 0.000 claims description 5
- 238000012258 culturing Methods 0.000 claims description 2
- 230000002018 overexpression Effects 0.000 claims description 2
- 239000013604 expression vector Substances 0.000 abstract description 18
- 210000004027 cell Anatomy 0.000 description 88
- 150000007523 nucleic acids Chemical class 0.000 description 72
- 108020004414 DNA Proteins 0.000 description 62
- 235000018102 proteins Nutrition 0.000 description 62
- 108091028043 Nucleic acid sequence Proteins 0.000 description 50
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 47
- 239000002299 complementary DNA Substances 0.000 description 45
- 239000013598 vector Substances 0.000 description 43
- 208000035475 disorder Diseases 0.000 description 39
- 239000013615 primer Substances 0.000 description 38
- 102000039446 nucleic acids Human genes 0.000 description 35
- 108020004707 nucleic acids Proteins 0.000 description 35
- 108091034117 Oligonucleotide Proteins 0.000 description 31
- 210000001519 tissue Anatomy 0.000 description 31
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 30
- 235000001014 amino acid Nutrition 0.000 description 29
- 230000027455 binding Effects 0.000 description 28
- 239000000203 mixture Substances 0.000 description 28
- 229940024606 amino acid Drugs 0.000 description 25
- 150000001413 amino acids Chemical class 0.000 description 25
- 230000015572 biosynthetic process Effects 0.000 description 24
- 238000004458 analytical method Methods 0.000 description 23
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 22
- 239000013612 plasmid Substances 0.000 description 22
- 238000000746 purification Methods 0.000 description 21
- 235000012000 cholesterol Nutrition 0.000 description 19
- 238000005516 engineering process Methods 0.000 description 19
- 238000002869 basic local alignment search tool Methods 0.000 description 18
- 239000012528 membrane Substances 0.000 description 18
- 238000012163 sequencing technique Methods 0.000 description 18
- 238000003556 assay Methods 0.000 description 17
- 238000006467 substitution reaction Methods 0.000 description 17
- 230000002068 genetic effect Effects 0.000 description 16
- 239000000758 substrate Substances 0.000 description 16
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 14
- 239000002585 base Substances 0.000 description 14
- 238000013518 transcription Methods 0.000 description 14
- 230000035897 transcription Effects 0.000 description 14
- -1 GMZ ganglioside Chemical class 0.000 description 13
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 13
- 238000006243 chemical reaction Methods 0.000 description 13
- 239000003814 drug Substances 0.000 description 13
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 13
- 230000001105 regulatory effect Effects 0.000 description 13
- 239000000243 solution Substances 0.000 description 13
- 238000003786 synthesis reaction Methods 0.000 description 13
- 238000002493 microarray Methods 0.000 description 12
- 238000002360 preparation method Methods 0.000 description 12
- 230000001225 therapeutic effect Effects 0.000 description 12
- 230000003321 amplification Effects 0.000 description 11
- 108020004999 messenger RNA Proteins 0.000 description 11
- 238000003199 nucleic acid amplification method Methods 0.000 description 11
- 230000014616 translation Effects 0.000 description 11
- 230000003612 virological effect Effects 0.000 description 11
- 102000004190 Enzymes Human genes 0.000 description 10
- 230000000692 anti-sense effect Effects 0.000 description 10
- 229940088598 enzyme Drugs 0.000 description 10
- 230000006870 function Effects 0.000 description 10
- 108020001507 fusion proteins Proteins 0.000 description 10
- 102000037865 fusion proteins Human genes 0.000 description 10
- 239000003550 marker Substances 0.000 description 10
- 238000013519 translation Methods 0.000 description 10
- 108090000790 Enzymes Proteins 0.000 description 9
- 241000880493 Leptailurus serval Species 0.000 description 9
- 241001465754 Metazoa Species 0.000 description 9
- 108010038807 Oligopeptides Proteins 0.000 description 9
- 102000015636 Oligopeptides Human genes 0.000 description 9
- 108091093037 Peptide nucleic acid Proteins 0.000 description 9
- 125000000539 amino acid group Chemical group 0.000 description 9
- 230000001580 bacterial effect Effects 0.000 description 9
- 239000003153 chemical reaction reagent Substances 0.000 description 9
- 239000003795 chemical substances by application Substances 0.000 description 9
- 238000000338 in vitro Methods 0.000 description 9
- 230000004048 modification Effects 0.000 description 9
- 238000012986 modification Methods 0.000 description 9
- 230000008569 process Effects 0.000 description 9
- 241000894007 species Species 0.000 description 9
- 201000001320 Atherosclerosis Diseases 0.000 description 8
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 8
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 8
- 241000283973 Oryctolagus cuniculus Species 0.000 description 8
- 230000002759 chromosomal effect Effects 0.000 description 8
- 210000000349 chromosome Anatomy 0.000 description 8
- 230000000875 corresponding effect Effects 0.000 description 8
- 108010049041 glutamylalanine Proteins 0.000 description 8
- 239000005090 green fluorescent protein Substances 0.000 description 8
- 108010034529 leucyl-lysine Proteins 0.000 description 8
- 208000024172 Cardiovascular disease Diseases 0.000 description 7
- 108090000994 Catalytic RNA Proteins 0.000 description 7
- 102000053642 Catalytic RNA Human genes 0.000 description 7
- 108020004635 Complementary DNA Proteins 0.000 description 7
- 241000196324 Embryophyta Species 0.000 description 7
- 208000018522 Gastrointestinal disease Diseases 0.000 description 7
- 102000005720 Glutathione transferase Human genes 0.000 description 7
- 108010070675 Glutathione transferase Proteins 0.000 description 7
- 208000019693 Lung disease Diseases 0.000 description 7
- 208000012902 Nervous system disease Diseases 0.000 description 7
- 238000007792 addition Methods 0.000 description 7
- 238000003776 cleavage reaction Methods 0.000 description 7
- 238000003745 diagnosis Methods 0.000 description 7
- 230000002526 effect on cardiovascular system Effects 0.000 description 7
- 108010092114 histidylphenylalanine Proteins 0.000 description 7
- 108010085325 histidylproline Proteins 0.000 description 7
- 239000003112 inhibitor Substances 0.000 description 7
- 239000007788 liquid Substances 0.000 description 7
- 108010009298 lysylglutamic acid Proteins 0.000 description 7
- 238000013507 mapping Methods 0.000 description 7
- 239000002245 particle Substances 0.000 description 7
- 239000002987 primer (paints) Substances 0.000 description 7
- 238000011160 research Methods 0.000 description 7
- 108091092562 ribozyme Proteins 0.000 description 7
- 230000007017 scission Effects 0.000 description 7
- 108010026333 seryl-proline Proteins 0.000 description 7
- 239000000126 substance Substances 0.000 description 7
- 235000000346 sugar Nutrition 0.000 description 7
- 108010061238 threonyl-glycine Proteins 0.000 description 7
- 108091026890 Coding region Proteins 0.000 description 6
- 206010028980 Neoplasm Diseases 0.000 description 6
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 6
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 6
- 108010047495 alanylglycine Proteins 0.000 description 6
- 239000000427 antigen Substances 0.000 description 6
- 108091007433 antigens Proteins 0.000 description 6
- 102000036639 antigens Human genes 0.000 description 6
- 230000004071 biological effect Effects 0.000 description 6
- 238000010367 cloning Methods 0.000 description 6
- 238000012217 deletion Methods 0.000 description 6
- 230000037430 deletion Effects 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 201000006549 dyspepsia Diseases 0.000 description 6
- 238000009472 formulation Methods 0.000 description 6
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 6
- 238000003018 immunoassay Methods 0.000 description 6
- 230000001939 inductive effect Effects 0.000 description 6
- 208000015181 infectious disease Diseases 0.000 description 6
- 230000003993 interaction Effects 0.000 description 6
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 6
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 6
- 230000037356 lipid metabolism Effects 0.000 description 6
- 210000004185 liver Anatomy 0.000 description 6
- 230000035772 mutation Effects 0.000 description 6
- 230000000926 neurological effect Effects 0.000 description 6
- 238000012545 processing Methods 0.000 description 6
- 239000000725 suspension Substances 0.000 description 6
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 6
- 241000894006 Bacteria Species 0.000 description 5
- 108010010234 HDL Lipoproteins Proteins 0.000 description 5
- 102000015779 HDL Lipoproteins Human genes 0.000 description 5
- 206010019708 Hepatic steatosis Diseases 0.000 description 5
- 241000282412 Homo Species 0.000 description 5
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 5
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 5
- 102100024640 Low-density lipoprotein receptor Human genes 0.000 description 5
- 241000124008 Mammalia Species 0.000 description 5
- 208000025966 Neurological disease Diseases 0.000 description 5
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 5
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 5
- 108010087924 alanylproline Proteins 0.000 description 5
- 230000000890 antigenic effect Effects 0.000 description 5
- 108010062796 arginyllysine Proteins 0.000 description 5
- 210000001124 body fluid Anatomy 0.000 description 5
- 238000004113 cell culture Methods 0.000 description 5
- 230000001413 cellular effect Effects 0.000 description 5
- 229940079593 drug Drugs 0.000 description 5
- 239000003937 drug carrier Substances 0.000 description 5
- 239000003623 enhancer Substances 0.000 description 5
- 239000000284 extract Substances 0.000 description 5
- 238000000684 flow cytometry Methods 0.000 description 5
- 239000000499 gel Substances 0.000 description 5
- 108010010147 glycylglutamine Proteins 0.000 description 5
- 108010050848 glycylleucine Proteins 0.000 description 5
- 108010015792 glycyllysine Proteins 0.000 description 5
- 108010028295 histidylhistidine Proteins 0.000 description 5
- 108010025306 histidylleucine Proteins 0.000 description 5
- 230000028993 immune response Effects 0.000 description 5
- 238000001727 in vivo Methods 0.000 description 5
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 108010056582 methionylglutamic acid Proteins 0.000 description 5
- 229920000642 polymer Polymers 0.000 description 5
- 230000002829 reductive effect Effects 0.000 description 5
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 5
- 231100000240 steatosis hepatitis Toxicity 0.000 description 5
- 238000012360 testing method Methods 0.000 description 5
- 230000009466 transformation Effects 0.000 description 5
- 241000701447 unidentified baculovirus Species 0.000 description 5
- 208000030507 AIDS Diseases 0.000 description 4
- 206010002383 Angina Pectoris Diseases 0.000 description 4
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 4
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 4
- 206010008635 Cholestasis Diseases 0.000 description 4
- 108020004705 Codon Proteins 0.000 description 4
- 102000053602 DNA Human genes 0.000 description 4
- 238000001712 DNA sequencing Methods 0.000 description 4
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 4
- 108091060211 Expressed sequence tag Proteins 0.000 description 4
- 206010016654 Fibrosis Diseases 0.000 description 4
- 201000011240 Frontotemporal dementia Diseases 0.000 description 4
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 4
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 4
- 101710154606 Hemagglutinin Proteins 0.000 description 4
- 102100026122 High affinity immunoglobulin gamma Fc receptor I Human genes 0.000 description 4
- 101000913074 Homo sapiens High affinity immunoglobulin gamma Fc receptor I Proteins 0.000 description 4
- 208000035150 Hypercholesterolemia Diseases 0.000 description 4
- 208000031226 Hyperlipidaemia Diseases 0.000 description 4
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 4
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 4
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 4
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 4
- 108010001831 LDL receptors Proteins 0.000 description 4
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 4
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 4
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 4
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 4
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 4
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 4
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 4
- 208000014060 Niemann-Pick disease Diseases 0.000 description 4
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 4
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 4
- ZYFVNVRFVHJEIU-UHFFFAOYSA-N PicoGreen Chemical compound CN(C)CCCN(CCCN(C)C)C1=CC(=CC2=[N+](C3=CC=CC=C3S2)C)C2=CC=CC=C2N1C1=CC=CC=C1 ZYFVNVRFVHJEIU-UHFFFAOYSA-N 0.000 description 4
- 101710176177 Protein A56 Proteins 0.000 description 4
- 206010037394 Pulmonary haemorrhage Diseases 0.000 description 4
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 4
- 102000009822 Sterol Regulatory Element Binding Proteins Human genes 0.000 description 4
- 108010020396 Sterol Regulatory Element Binding Proteins Proteins 0.000 description 4
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical group O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 4
- 241000700605 Viruses Species 0.000 description 4
- 239000004480 active ingredient Substances 0.000 description 4
- 239000002671 adjuvant Substances 0.000 description 4
- 108010005233 alanylglutamic acid Proteins 0.000 description 4
- 238000000137 annealing Methods 0.000 description 4
- 238000013459 approach Methods 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 4
- 108010077245 asparaginyl-proline Proteins 0.000 description 4
- 108010047857 aspartylglycine Proteins 0.000 description 4
- 230000003115 biocidal effect Effects 0.000 description 4
- 230000033228 biological regulation Effects 0.000 description 4
- 239000010839 body fluid Substances 0.000 description 4
- 239000000872 buffer Substances 0.000 description 4
- 239000002775 capsule Substances 0.000 description 4
- 150000001720 carbohydrates Chemical class 0.000 description 4
- 235000014633 carbohydrates Nutrition 0.000 description 4
- 210000003169 central nervous system Anatomy 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 230000007870 cholestasis Effects 0.000 description 4
- 231100000359 cholestasis Toxicity 0.000 description 4
- 230000007423 decrease Effects 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 239000008298 dragée Substances 0.000 description 4
- 206010014665 endocarditis Diseases 0.000 description 4
- 230000002255 enzymatic effect Effects 0.000 description 4
- RWSXRVCMGQZWBV-WDSKDSINSA-N glutathione Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@@H](CS)C(=O)NCC(O)=O RWSXRVCMGQZWBV-WDSKDSINSA-N 0.000 description 4
- PEDCQBHIVMGVHV-UHFFFAOYSA-N glycerol group Chemical group OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 4
- 230000013595 glycosylation Effects 0.000 description 4
- 238000006206 glycosylation reaction Methods 0.000 description 4
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 4
- 108010037850 glycylvaline Proteins 0.000 description 4
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 4
- 208000019622 heart disease Diseases 0.000 description 4
- 239000000185 hemagglutinin Substances 0.000 description 4
- 108010018006 histidylserine Proteins 0.000 description 4
- 210000000688 human artificial chromosome Anatomy 0.000 description 4
- 208000006575 hypertriglyceridemia Diseases 0.000 description 4
- 238000002347 injection Methods 0.000 description 4
- 239000007924 injection Substances 0.000 description 4
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 4
- 238000002372 labelling Methods 0.000 description 4
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 4
- 239000002502 liposome Substances 0.000 description 4
- 210000004072 lung Anatomy 0.000 description 4
- 239000006166 lysate Substances 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 230000001404 mediated effect Effects 0.000 description 4
- 210000000653 nervous system Anatomy 0.000 description 4
- 230000009871 nonspecific binding Effects 0.000 description 4
- 230000036961 partial effect Effects 0.000 description 4
- 230000002265 prevention Effects 0.000 description 4
- 230000005855 radiation Effects 0.000 description 4
- 238000003127 radioimmunoassay Methods 0.000 description 4
- 230000006798 recombination Effects 0.000 description 4
- 239000011347 resin Substances 0.000 description 4
- 229920005989 resin Polymers 0.000 description 4
- 230000004044 response Effects 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- 150000003839 salts Chemical class 0.000 description 4
- 208000012672 seasonal affective disease Diseases 0.000 description 4
- 239000007787 solid Substances 0.000 description 4
- 239000003381 stabilizer Substances 0.000 description 4
- 238000002560 therapeutic procedure Methods 0.000 description 4
- 201000005060 thrombophlebitis Diseases 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- 238000002054 transplantation Methods 0.000 description 4
- 150000003626 triacylglycerols Chemical class 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- 238000005406 washing Methods 0.000 description 4
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 3
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 3
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 3
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 3
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 3
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 3
- 208000024827 Alzheimer disease Diseases 0.000 description 3
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 3
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 3
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 3
- 241000972773 Aulopiformes Species 0.000 description 3
- 102100022548 Beta-hexosaminidase subunit alpha Human genes 0.000 description 3
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 3
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 3
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 3
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 3
- 238000002965 ELISA Methods 0.000 description 3
- 108010042407 Endonucleases Proteins 0.000 description 3
- 102000004533 Endonucleases Human genes 0.000 description 3
- 244000148064 Enicostema verticillatum Species 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- 208000004930 Fatty Liver Diseases 0.000 description 3
- 108010010803 Gelatin Proteins 0.000 description 3
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 3
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 3
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 3
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 3
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 3
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 3
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 3
- 229930186217 Glycolipid Natural products 0.000 description 3
- 241000238631 Hexapoda Species 0.000 description 3
- 108060003951 Immunoglobulin Proteins 0.000 description 3
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 3
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 3
- 108010065920 Insulin Lispro Proteins 0.000 description 3
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 3
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 3
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 3
- 108010007622 LDL Lipoproteins Proteins 0.000 description 3
- 102000007330 LDL Lipoproteins Human genes 0.000 description 3
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 3
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 3
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 3
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 3
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 3
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 3
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 3
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 3
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 3
- 239000004472 Lysine Substances 0.000 description 3
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 3
- 241000699666 Mus <mouse, genus> Species 0.000 description 3
- 208000008589 Obesity Diseases 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- 108010002747 Pfu DNA polymerase Proteins 0.000 description 3
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 3
- 241000700159 Rattus Species 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- 229920002684 Sepharose Polymers 0.000 description 3
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 3
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 3
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 3
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 3
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 3
- 108091081024 Start codon Proteins 0.000 description 3
- 229930182558 Sterol Natural products 0.000 description 3
- 208000022292 Tay-Sachs disease Diseases 0.000 description 3
- 102100036407 Thioredoxin Human genes 0.000 description 3
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 3
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 3
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 3
- 241000723873 Tobacco mosaic virus Species 0.000 description 3
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 3
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 3
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 3
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 3
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 3
- 238000009825 accumulation Methods 0.000 description 3
- 238000010171 animal model Methods 0.000 description 3
- 239000007864 aqueous solution Substances 0.000 description 3
- 108010008355 arginyl-glutamine Proteins 0.000 description 3
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 3
- 210000001106 artificial yeast chromosome Anatomy 0.000 description 3
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 3
- 108010038633 aspartylglutamate Proteins 0.000 description 3
- 108010068265 aspartyltyrosine Proteins 0.000 description 3
- 239000011324 bead Substances 0.000 description 3
- 230000000903 blocking effect Effects 0.000 description 3
- 201000011510 cancer Diseases 0.000 description 3
- 210000000170 cell membrane Anatomy 0.000 description 3
- 230000007882 cirrhosis Effects 0.000 description 3
- 208000019425 cirrhosis of liver Diseases 0.000 description 3
- 238000000576 coating method Methods 0.000 description 3
- 239000003184 complementary RNA Substances 0.000 description 3
- 230000009918 complex formation Effects 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 108010016616 cysteinylglycine Proteins 0.000 description 3
- 108010060199 cysteinylproline Proteins 0.000 description 3
- 230000007547 defect Effects 0.000 description 3
- 206010012601 diabetes mellitus Diseases 0.000 description 3
- 238000007877 drug screening Methods 0.000 description 3
- 230000002124 endocrine Effects 0.000 description 3
- 206010015037 epilepsy Diseases 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 208000010706 fatty liver disease Diseases 0.000 description 3
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 3
- 239000008273 gelatin Substances 0.000 description 3
- 229920000159 gelatin Polymers 0.000 description 3
- 235000019322 gelatine Nutrition 0.000 description 3
- 235000011852 gelatine desserts Nutrition 0.000 description 3
- 238000010353 genetic engineering Methods 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 3
- 150000002327 glycerophospholipids Chemical class 0.000 description 3
- 229910052739 hydrogen Inorganic materials 0.000 description 3
- 230000001900 immune effect Effects 0.000 description 3
- 102000018358 immunoglobulin Human genes 0.000 description 3
- 229940027941 immunoglobulin g Drugs 0.000 description 3
- 230000005764 inhibitory process Effects 0.000 description 3
- 230000003834 intracellular effect Effects 0.000 description 3
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 3
- 108010000761 leucylarginine Proteins 0.000 description 3
- 108010057821 leucylproline Proteins 0.000 description 3
- 108010012058 leucyltyrosine Proteins 0.000 description 3
- 108010064235 lysylglycine Proteins 0.000 description 3
- 108010054155 lysyllysine Proteins 0.000 description 3
- 108010017391 lysylvaline Proteins 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 239000003068 molecular probe Substances 0.000 description 3
- 230000001537 neural effect Effects 0.000 description 3
- 230000003472 neutralizing effect Effects 0.000 description 3
- 235000020824 obesity Nutrition 0.000 description 3
- 210000000056 organ Anatomy 0.000 description 3
- 102000044160 oxysterol binding protein Human genes 0.000 description 3
- 108010040421 oxysterol binding protein Proteins 0.000 description 3
- 150000003904 phospholipids Chemical class 0.000 description 3
- 102000054765 polymorphisms of proteins Human genes 0.000 description 3
- 238000001556 precipitation Methods 0.000 description 3
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 3
- 108010077112 prolyl-proline Proteins 0.000 description 3
- 108010015796 prolylisoleucine Proteins 0.000 description 3
- 108010090894 prolylleucine Proteins 0.000 description 3
- 230000018883 protein targeting Effects 0.000 description 3
- 239000011541 reaction mixture Substances 0.000 description 3
- 238000005215 recombination Methods 0.000 description 3
- 238000011084 recovery Methods 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 235000019515 salmon Nutrition 0.000 description 3
- 230000035945 sensitivity Effects 0.000 description 3
- 108010048818 seryl-histidine Proteins 0.000 description 3
- 150000003384 small molecules Chemical class 0.000 description 3
- 239000007790 solid phase Substances 0.000 description 3
- 239000002904 solvent Substances 0.000 description 3
- 239000000600 sorbitol Substances 0.000 description 3
- 230000009870 specific binding Effects 0.000 description 3
- 150000003408 sphingolipids Chemical class 0.000 description 3
- 238000010561 standard procedure Methods 0.000 description 3
- 150000003432 sterols Chemical class 0.000 description 3
- 235000003702 sterols Nutrition 0.000 description 3
- 238000003860 storage Methods 0.000 description 3
- 150000008163 sugars Chemical class 0.000 description 3
- 208000011580 syndromic disease Diseases 0.000 description 3
- 229940124597 therapeutic agent Drugs 0.000 description 3
- 108060008226 thioredoxin Proteins 0.000 description 3
- 229940113082 thymine Drugs 0.000 description 3
- 230000002103 transcriptional effect Effects 0.000 description 3
- 238000001890 transfection Methods 0.000 description 3
- 230000032258 transport Effects 0.000 description 3
- 241000701161 unidentified adenovirus Species 0.000 description 3
- 238000011144 upstream manufacturing Methods 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- CABVTRNMFUVUDM-VRHQGPGLSA-N (3S)-3-hydroxy-3-methylglutaryl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C[C@@](O)(CC(O)=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 CABVTRNMFUVUDM-VRHQGPGLSA-N 0.000 description 2
- KJTLQQUUPVSXIM-ZCFIWIBFSA-M (R)-mevalonate Chemical compound OCC[C@](O)(C)CC([O-])=O KJTLQQUUPVSXIM-ZCFIWIBFSA-M 0.000 description 2
- TZCPCKNHXULUIY-RGULYWFUSA-N 1,2-distearoyl-sn-glycero-3-phosphoserine Chemical compound CCCCCCCCCCCCCCCCCC(=O)OC[C@H](COP(O)(=O)OC[C@H](N)C(O)=O)OC(=O)CCCCCCCCCCCCCCCCC TZCPCKNHXULUIY-RGULYWFUSA-N 0.000 description 2
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical compound OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 description 2
- 102100024643 ATP-binding cassette sub-family D member 1 Human genes 0.000 description 2
- 208000003200 Adenoma Diseases 0.000 description 2
- 208000002016 Adenosine monophosphate deaminase deficiency Diseases 0.000 description 2
- 208000002485 Adiposis dolorosa Diseases 0.000 description 2
- 201000011452 Adrenoleukodystrophy Diseases 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 2
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 2
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 2
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 2
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 2
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 2
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 2
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 2
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 2
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 2
- 206010001881 Alveolar proteinosis Diseases 0.000 description 2
- 208000000044 Amnesia Diseases 0.000 description 2
- 208000031091 Amnestic disease Diseases 0.000 description 2
- 206010002329 Aneurysm Diseases 0.000 description 2
- 208000019901 Anxiety disease Diseases 0.000 description 2
- 208000003017 Aortic Valve Stenosis Diseases 0.000 description 2
- 108010071619 Apolipoproteins Proteins 0.000 description 2
- 102000007592 Apolipoproteins Human genes 0.000 description 2
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 2
- JTWOBPNAVBESFW-FXQIFTODSA-N Arg-Cys-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N JTWOBPNAVBESFW-FXQIFTODSA-N 0.000 description 2
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 2
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 2
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 2
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 2
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 2
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 2
- 206010003226 Arteriovenous fistula Diseases 0.000 description 2
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 2
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 2
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 2
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 2
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 2
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 2
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 2
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 2
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 2
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 2
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 2
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 2
- NURJSGZGBVJFAD-ZLUOBGJFSA-N Asp-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O NURJSGZGBVJFAD-ZLUOBGJFSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- LTXGDRFJRZSZAV-CIUDSAMLSA-N Asp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N LTXGDRFJRZSZAV-CIUDSAMLSA-N 0.000 description 2
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 2
- OEDJQRXNDRUGEU-SRVKXCTJSA-N Asp-Leu-His Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O OEDJQRXNDRUGEU-SRVKXCTJSA-N 0.000 description 2
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 2
- JXGJJQJHXHXJQF-CIUDSAMLSA-N Asp-Met-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O JXGJJQJHXHXJQF-CIUDSAMLSA-N 0.000 description 2
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 2
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 2
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 2
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 2
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 2
- 206010003598 Atelectasis Diseases 0.000 description 2
- 208000012219 Autonomic Nervous System disease Diseases 0.000 description 2
- 201000001178 Bacterial Pneumonia Diseases 0.000 description 2
- 206010004552 Bicuspid aortic valve Diseases 0.000 description 2
- 208000008439 Biliary Liver Cirrhosis Diseases 0.000 description 2
- 208000015163 Biliary Tract disease Diseases 0.000 description 2
- 208000033222 Biliary cirrhosis primary Diseases 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- 208000004020 Brain Abscess Diseases 0.000 description 2
- 206010006458 Bronchitis chronic Diseases 0.000 description 2
- 208000007257 Budd-Chiari syndrome Diseases 0.000 description 2
- 102000000584 Calmodulin Human genes 0.000 description 2
- 108010041952 Calmodulin Proteins 0.000 description 2
- 241000282472 Canis lupus familiaris Species 0.000 description 2
- 241000283707 Capra Species 0.000 description 2
- 201000009030 Carcinoma Diseases 0.000 description 2
- 208000017897 Carcinoma of esophagus Diseases 0.000 description 2
- 206010007559 Cardiac failure congestive Diseases 0.000 description 2
- 208000031229 Cardiomyopathies Diseases 0.000 description 2
- 206010058892 Carnitine deficiency Diseases 0.000 description 2
- 206010050215 Carnitine palmitoyltransferase deficiency Diseases 0.000 description 2
- 206010008609 Cholangitis sclerosing Diseases 0.000 description 2
- 208000006545 Chronic Obstructive Pulmonary Disease Diseases 0.000 description 2
- 108700010070 Codon Usage Proteins 0.000 description 2
- 206010009900 Colitis ulcerative Diseases 0.000 description 2
- 206010009944 Colon cancer Diseases 0.000 description 2
- 108020004394 Complementary RNA Proteins 0.000 description 2
- 208000002330 Congenital Heart Defects Diseases 0.000 description 2
- 206010010356 Congenital anomaly Diseases 0.000 description 2
- 108091035707 Consensus sequence Proteins 0.000 description 2
- 206010010774 Constipation Diseases 0.000 description 2
- 208000011990 Corticobasal Degeneration Diseases 0.000 description 2
- 208000019736 Cranial nerve disease Diseases 0.000 description 2
- 208000020406 Creutzfeldt Jacob disease Diseases 0.000 description 2
- 208000003407 Creutzfeldt-Jakob Syndrome Diseases 0.000 description 2
- 208000010859 Creutzfeldt-Jakob disease Diseases 0.000 description 2
- 208000011231 Crohn disease Diseases 0.000 description 2
- HNNGTYHNYDOSKV-FXQIFTODSA-N Cys-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N HNNGTYHNYDOSKV-FXQIFTODSA-N 0.000 description 2
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 2
- LBOLGUYQEPZSKM-YUMQZZPRSA-N Cys-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N LBOLGUYQEPZSKM-YUMQZZPRSA-N 0.000 description 2
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 2
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 2
- KJTLQQUUPVSXIM-UHFFFAOYSA-N DL-mevalonic acid Natural products OCCC(O)(C)CC(O)=O KJTLQQUUPVSXIM-UHFFFAOYSA-N 0.000 description 2
- 208000019505 Deglutition disease Diseases 0.000 description 2
- 206010012289 Dementia Diseases 0.000 description 2
- 208000016192 Demyelinating disease Diseases 0.000 description 2
- 208000012239 Developmental disease Diseases 0.000 description 2
- 229920002307 Dextran Polymers 0.000 description 2
- 208000032131 Diabetic Neuropathies Diseases 0.000 description 2
- 206010012735 Diarrhoea Diseases 0.000 description 2
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 2
- 208000005872 Diffuse Esophageal Spasm Diseases 0.000 description 2
- 208000035762 Disorder of lipid metabolism Diseases 0.000 description 2
- 206010014561 Emphysema Diseases 0.000 description 2
- 206010062608 Endocarditis noninfective Diseases 0.000 description 2
- 241000701867 Enterobacteria phage T7 Species 0.000 description 2
- 206010058838 Enterocolitis infectious Diseases 0.000 description 2
- 208000007217 Esophageal Stenosis Diseases 0.000 description 2
- 206010061846 Extradural abscess Diseases 0.000 description 2
- 208000024720 Fabry Disease Diseases 0.000 description 2
- 206010061857 Fat necrosis Diseases 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- 208000009796 Gangliosidoses Diseases 0.000 description 2
- 208000007882 Gastritis Diseases 0.000 description 2
- 208000005577 Gastroenteritis Diseases 0.000 description 2
- 208000012671 Gastrointestinal haemorrhages Diseases 0.000 description 2
- 208000015872 Gaucher disease Diseases 0.000 description 2
- 208000003736 Gerstmann-Straussler-Scheinker Disease Diseases 0.000 description 2
- 206010072075 Gerstmann-Straussler-Scheinker syndrome Diseases 0.000 description 2
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 2
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 2
- KZEUVLLVULIPNX-GUBZILKMSA-N Gln-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N KZEUVLLVULIPNX-GUBZILKMSA-N 0.000 description 2
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 2
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 2
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 2
- DITJVHONFRJKJW-BPUTZDHNSA-N Gln-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DITJVHONFRJKJW-BPUTZDHNSA-N 0.000 description 2
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 2
- 206010018374 Glomerulonephritis minimal lesion Diseases 0.000 description 2
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 2
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 2
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 2
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 2
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 2
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 2
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 2
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 2
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 2
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 2
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 2
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 2
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 2
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 2
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 2
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 2
- 108010024636 Glutathione Proteins 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 2
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 2
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 2
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 2
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 2
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 2
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 2
- JZNWSCPGTDBMEW-UHFFFAOYSA-N Glycerophosphorylethanolamin Natural products NCCOP(O)(=O)OCC(O)CO JZNWSCPGTDBMEW-UHFFFAOYSA-N 0.000 description 2
- ZWZWYGMENQVNFU-UHFFFAOYSA-N Glycerophosphorylserin Natural products OC(=O)C(N)COP(O)(=O)OCC(O)CO ZWZWYGMENQVNFU-UHFFFAOYSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 208000024869 Goodpasture syndrome Diseases 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- 206010019280 Heart failures Diseases 0.000 description 2
- 208000018565 Hemochromatosis Diseases 0.000 description 2
- 206010019713 Hepatic vein thrombosis Diseases 0.000 description 2
- 208000002972 Hepatolenticular Degeneration Diseases 0.000 description 2
- IDQNVIWPPWAFSY-AVGNSLFASA-N His-His-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O IDQNVIWPPWAFSY-AVGNSLFASA-N 0.000 description 2
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 2
- 208000023105 Huntington disease Diseases 0.000 description 2
- 102000004286 Hydroxymethylglutaryl CoA Reductases Human genes 0.000 description 2
- 108090000895 Hydroxymethylglutaryl CoA Reductases Proteins 0.000 description 2
- 208000000563 Hyperlipoproteinemia Type II Diseases 0.000 description 2
- 206010020772 Hypertension Diseases 0.000 description 2
- 206010020961 Hypocholesterolaemia Diseases 0.000 description 2
- 108010046315 IDL Lipoproteins Proteins 0.000 description 2
- 201000009794 Idiopathic Pulmonary Fibrosis Diseases 0.000 description 2
- 208000009995 Idiopathic pulmonary hemosiderosis Diseases 0.000 description 2
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 2
- IIXDMJNYALIKGP-DJFWLOJKSA-N Ile-Asn-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IIXDMJNYALIKGP-DJFWLOJKSA-N 0.000 description 2
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 2
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 2
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 2
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 2
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 2
- USXAYNCLFSUSBA-MGHWNKPDSA-N Ile-Phe-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N USXAYNCLFSUSBA-MGHWNKPDSA-N 0.000 description 2
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 2
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 2
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 2
- 206010021518 Impaired gastric emptying Diseases 0.000 description 2
- 206010061218 Inflammation Diseases 0.000 description 2
- 208000029523 Interstitial Lung disease Diseases 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- 206010023126 Jaundice Diseases 0.000 description 2
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 2
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 2
- 206010062062 Large intestinal obstruction Diseases 0.000 description 2
- 208000003465 Lecithin Cholesterol Acyltransferase Deficiency Diseases 0.000 description 2
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 2
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 2
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 2
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 2
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 2
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 2
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 2
- HGUUMQWGYCVPKG-DCAQKATOSA-N Leu-Pro-Cys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HGUUMQWGYCVPKG-DCAQKATOSA-N 0.000 description 2
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 2
- RNYLNYTYMXACRI-VFAJRCTISA-N Leu-Thr-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O RNYLNYTYMXACRI-VFAJRCTISA-N 0.000 description 2
- OZTZJMUZVAVJGY-BZSNNMDCSA-N Leu-Tyr-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OZTZJMUZVAVJGY-BZSNNMDCSA-N 0.000 description 2
- 208000024369 Libman-Sacks endocarditis Diseases 0.000 description 2
- 206010049287 Lipodystrophy acquired Diseases 0.000 description 2
- 208000004883 Lipoid Nephrosis Diseases 0.000 description 2
- 206010024612 Lipoma Diseases 0.000 description 2
- 102000004895 Lipoproteins Human genes 0.000 description 2
- 108090001030 Lipoproteins Proteins 0.000 description 2
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 2
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 2
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 2
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 2
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 2
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 2
- PFZWARWVRNTPBR-IHPCNDPISA-N Lys-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N PFZWARWVRNTPBR-IHPCNDPISA-N 0.000 description 2
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 2
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 2
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 2
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 2
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 2
- 206010026712 Mallory-Weiss syndrome Diseases 0.000 description 2
- 229930195725 Mannitol Natural products 0.000 description 2
- 108010049137 Member 1 Subfamily D ATP Binding Cassette Transporter Proteins 0.000 description 2
- 206010027202 Meningitis bacterial Diseases 0.000 description 2
- 206010027260 Meningitis viral Diseases 0.000 description 2
- 208000036626 Mental retardation Diseases 0.000 description 2
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 2
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 2
- 206010068836 Metabolic myopathy Diseases 0.000 description 2
- 201000011442 Metachromatic leukodystrophy Diseases 0.000 description 2
- 208000019022 Mood disease Diseases 0.000 description 2
- 241000699670 Mus sp. Species 0.000 description 2
- 208000021642 Muscular disease Diseases 0.000 description 2
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 2
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 2
- 201000008235 Mycoplasma pneumoniae pneumonia Diseases 0.000 description 2
- 208000003926 Myelitis Diseases 0.000 description 2
- 208000009525 Myocarditis Diseases 0.000 description 2
- 201000009623 Myopathy Diseases 0.000 description 2
- 206010028643 Myopathy endocrine Diseases 0.000 description 2
- 208000023137 Myotoxicity Diseases 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 206010028813 Nausea Diseases 0.000 description 2
- 206010028851 Necrosis Diseases 0.000 description 2
- 208000009905 Neurofibromatoses Diseases 0.000 description 2
- 208000010577 Niemann-Pick disease type C Diseases 0.000 description 2
- 208000026516 Norum disease Diseases 0.000 description 2
- 208000011623 Obstructive Lung disease Diseases 0.000 description 2
- 206010030113 Oedema Diseases 0.000 description 2
- 206010030155 Oesophageal carcinoma Diseases 0.000 description 2
- 206010030184 Oesophageal spasm Diseases 0.000 description 2
- 206010030194 Oesophageal stenosis Diseases 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 206010033645 Pancreatitis Diseases 0.000 description 2
- 208000027099 Paranoid disease Diseases 0.000 description 2
- 208000018737 Parkinson disease Diseases 0.000 description 2
- 201000004602 Peliosis Hepatis Diseases 0.000 description 2
- 208000008469 Peptic Ulcer Diseases 0.000 description 2
- DPUOLKQSMYLRDR-UBHSHLNASA-N Phe-Arg-Ala Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 DPUOLKQSMYLRDR-UBHSHLNASA-N 0.000 description 2
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 2
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 2
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 2
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 2
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 2
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 2
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 2
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 2
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 2
- KAJLHCWRWDSROH-BZSNNMDCSA-N Phe-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 KAJLHCWRWDSROH-BZSNNMDCSA-N 0.000 description 2
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 2
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 2
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 2
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 2
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 2
- 108091023022 Phosphatidylserine decarboxylase Proteins 0.000 description 2
- 206010063985 Phytosterolaemia Diseases 0.000 description 2
- 208000000609 Pick Disease of the Brain Diseases 0.000 description 2
- 206010035610 Pleural Neoplasms Diseases 0.000 description 2
- 208000002151 Pleural effusion Diseases 0.000 description 2
- 206010035724 Pneumonia mycoplasmal Diseases 0.000 description 2
- 206010035737 Pneumonia viral Diseases 0.000 description 2
- 239000002202 Polyethylene glycol Substances 0.000 description 2
- 101710182846 Polyhedrin Proteins 0.000 description 2
- 206010058989 Portal vein occlusion Diseases 0.000 description 2
- 201000009454 Portal vein thrombosis Diseases 0.000 description 2
- 206010036376 Postherpetic Neuralgia Diseases 0.000 description 2
- 208000012654 Primary biliary cholangitis Diseases 0.000 description 2
- 102000029797 Prion Human genes 0.000 description 2
- 108091000054 Prion Proteins 0.000 description 2
- 208000024777 Prion disease Diseases 0.000 description 2
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 2
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 2
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 2
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 2
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 2
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 2
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 2
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 2
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 2
- DGDCSVGVWWAJRS-AVGNSLFASA-N Pro-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 DGDCSVGVWWAJRS-AVGNSLFASA-N 0.000 description 2
- 206010036774 Proctitis Diseases 0.000 description 2
- 206010036783 Proctitis ulcerative Diseases 0.000 description 2
- RJKFOVLPORLFTN-LEKSSAKUSA-N Progesterone Chemical compound C1CC2=CC(=O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H](C(=O)C)[C@@]1(C)CC2 RJKFOVLPORLFTN-LEKSSAKUSA-N 0.000 description 2
- 208000012287 Prolapse Diseases 0.000 description 2
- 208000007123 Pulmonary Atelectasis Diseases 0.000 description 2
- 208000010378 Pulmonary Embolism Diseases 0.000 description 2
- 206010037368 Pulmonary congestion Diseases 0.000 description 2
- 208000006193 Pulmonary infarction Diseases 0.000 description 2
- 206010037423 Pulmonary oedema Diseases 0.000 description 2
- 108020004518 RNA Probes Proteins 0.000 description 2
- 239000003391 RNA probe Substances 0.000 description 2
- 206010037779 Radiculopathy Diseases 0.000 description 2
- 208000003782 Raynaud disease Diseases 0.000 description 2
- 208000012322 Raynaud phenomenon Diseases 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- 208000030934 Restrictive pulmonary disease Diseases 0.000 description 2
- 208000007014 Retinitis pigmentosa Diseases 0.000 description 2
- 201000007981 Reye syndrome Diseases 0.000 description 2
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 2
- 241000714474 Rous sarcoma virus Species 0.000 description 2
- 208000021811 Sandhoff disease Diseases 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 2
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 2
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 2
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 2
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- IOVBCLGAJJXOHK-SRVKXCTJSA-N Ser-His-His Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IOVBCLGAJJXOHK-SRVKXCTJSA-N 0.000 description 2
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 2
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 2
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 2
- SRKMDKACHDVPMD-SRVKXCTJSA-N Ser-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N SRKMDKACHDVPMD-SRVKXCTJSA-N 0.000 description 2
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 2
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 2
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 2
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 2
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 2
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 2
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- 206010049416 Short-bowel syndrome Diseases 0.000 description 2
- 208000002227 Sitosterolemia Diseases 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 102000011971 Sphingomyelin Phosphodiesterase Human genes 0.000 description 2
- 108010061312 Sphingomyelin Phosphodiesterase Proteins 0.000 description 2
- 208000029033 Spinal Cord disease Diseases 0.000 description 2
- 208000010112 Spinocerebellar Degenerations Diseases 0.000 description 2
- 229920002472 Starch Polymers 0.000 description 2
- 102100036325 Sterol 26-hydroxylase, mitochondrial Human genes 0.000 description 2
- 208000006011 Stroke Diseases 0.000 description 2
- 206010042265 Sturge-Weber Syndrome Diseases 0.000 description 2
- 201000000002 Subdural Empyema Diseases 0.000 description 2
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 2
- 229930006000 Sucrose Natural products 0.000 description 2
- 208000001163 Tangier disease Diseases 0.000 description 2
- 108010006785 Taq Polymerase Proteins 0.000 description 2
- 206010043118 Tardive Dyskinesia Diseases 0.000 description 2
- MUMGGOZAMZWBJJ-DYKIIFRCSA-N Testostosterone Chemical compound O=C1CC[C@]2(C)[C@H]3CC[C@](C)([C@H](CC4)O)[C@@H]4[C@@H]3CCC2=C1 MUMGGOZAMZWBJJ-DYKIIFRCSA-N 0.000 description 2
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 2
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 2
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 2
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 2
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 2
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 2
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 2
- ZEJBJDHSQPOVJV-UAXMHLISSA-N Thr-Trp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZEJBJDHSQPOVJV-UAXMHLISSA-N 0.000 description 2
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 2
- GWEVSGVZZGPLCZ-UHFFFAOYSA-N Titan oxide Chemical compound O=[Ti]=O GWEVSGVZZGPLCZ-UHFFFAOYSA-N 0.000 description 2
- 208000000323 Tourette Syndrome Diseases 0.000 description 2
- 208000016620 Tourette disease Diseases 0.000 description 2
- 108091023040 Transcription factor Proteins 0.000 description 2
- 102000040945 Transcription factor Human genes 0.000 description 2
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 2
- 241000209140 Triticum Species 0.000 description 2
- 235000021307 Triticum Nutrition 0.000 description 2
- NXAPHBHZCMQORW-FDARSICLSA-N Trp-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NXAPHBHZCMQORW-FDARSICLSA-N 0.000 description 2
- WLBZWXXGSOLJBA-HOCLYGCPSA-N Trp-Gly-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 WLBZWXXGSOLJBA-HOCLYGCPSA-N 0.000 description 2
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 2
- QUIXRGCMQOXUSV-SZMVWBNQSA-N Trp-Pro-Pro Chemical compound O=C([C@@H]1CCCN1C(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)N1CCC[C@H]1C(O)=O QUIXRGCMQOXUSV-SZMVWBNQSA-N 0.000 description 2
- 208000026911 Tuberous sclerosis complex Diseases 0.000 description 2
- 208000007930 Type C Niemann-Pick Disease Diseases 0.000 description 2
- 206010045261 Type IIa hyperlipidaemia Diseases 0.000 description 2
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 2
- HKIUVWMZYFBIHG-KKUMJFAQSA-N Tyr-Arg-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O HKIUVWMZYFBIHG-KKUMJFAQSA-N 0.000 description 2
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 2
- IIJWXEUNETVJPV-IHRRRGAJSA-N Tyr-Arg-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N)O IIJWXEUNETVJPV-IHRRRGAJSA-N 0.000 description 2
- MNWINJDPGBNOED-ULQDDVLXSA-N Tyr-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 MNWINJDPGBNOED-ULQDDVLXSA-N 0.000 description 2
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 2
- 201000006704 Ulcerative Colitis Diseases 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- 108010062497 VLDL Lipoproteins Proteins 0.000 description 2
- 241000700618 Vaccinia virus Species 0.000 description 2
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 2
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 2
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 2
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 2
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 2
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 2
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 2
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 2
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 2
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 2
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 2
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 2
- 206010046996 Varicose vein Diseases 0.000 description 2
- 206010047115 Vasculitis Diseases 0.000 description 2
- 208000012346 Venoocclusive disease Diseases 0.000 description 2
- 206010047249 Venous thrombosis Diseases 0.000 description 2
- 206010047700 Vomiting Diseases 0.000 description 2
- 208000027207 Whipple disease Diseases 0.000 description 2
- 208000018839 Wilson disease Diseases 0.000 description 2
- 230000003187 abdominal effect Effects 0.000 description 2
- 208000004622 abetalipoproteinemia Diseases 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 2
- DPXJVFZANSGRMM-UHFFFAOYSA-N acetic acid;2,3,4,5,6-pentahydroxyhexanal;sodium Chemical compound [Na].CC(O)=O.OCC(O)C(O)C(O)C(O)C=O DPXJVFZANSGRMM-UHFFFAOYSA-N 0.000 description 2
- 230000001154 acute effect Effects 0.000 description 2
- 208000005652 acute fatty liver of pregnancy Diseases 0.000 description 2
- 239000008272 agar Substances 0.000 description 2
- 235000010419 agar Nutrition 0.000 description 2
- 239000011543 agarose gel Substances 0.000 description 2
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 2
- 230000004075 alteration Effects 0.000 description 2
- 230000006986 amnesia Effects 0.000 description 2
- 206010002026 amyotrophic lateral sclerosis Diseases 0.000 description 2
- 238000002399 angioplasty Methods 0.000 description 2
- 208000022531 anorexia Diseases 0.000 description 2
- 230000036506 anxiety Effects 0.000 description 2
- 206010002906 aortic stenosis Diseases 0.000 description 2
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 208000011775 arteriosclerosis disease Diseases 0.000 description 2
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 208000006673 asthma Diseases 0.000 description 2
- 201000009904 bacterial meningitis Diseases 0.000 description 2
- 230000004888 barrier function Effects 0.000 description 2
- 208000018300 basal ganglia disease Diseases 0.000 description 2
- 208000021654 bicuspid aortic valve disease Diseases 0.000 description 2
- 208000027119 bilirubin metabolic disease Diseases 0.000 description 2
- 230000008827 biological function Effects 0.000 description 2
- 230000008236 biological pathway Effects 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 210000004204 blood vessel Anatomy 0.000 description 2
- 201000009267 bronchiectasis Diseases 0.000 description 2
- 206010006451 bronchitis Diseases 0.000 description 2
- 239000007975 buffered saline Substances 0.000 description 2
- 230000002308 calcification Effects 0.000 description 2
- 238000005251 capillar electrophoresis Methods 0.000 description 2
- 239000001768 carboxy methyl cellulose Substances 0.000 description 2
- 208000005761 carcinoid heart disease Diseases 0.000 description 2
- 230000000747 cardiac effect Effects 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 239000003054 catalyst Substances 0.000 description 2
- 206010007776 catatonia Diseases 0.000 description 2
- 239000013592 cell lysate Substances 0.000 description 2
- 230000006037 cell lysis Effects 0.000 description 2
- 230000004663 cell proliferation Effects 0.000 description 2
- 208000015114 central nervous system disease Diseases 0.000 description 2
- 206010008129 cerebral palsy Diseases 0.000 description 2
- 208000001088 cerebrotendinous xanthomatosis Diseases 0.000 description 2
- 208000026106 cerebrovascular disease Diseases 0.000 description 2
- 239000013522 chelant Substances 0.000 description 2
- 238000007385 chemical modification Methods 0.000 description 2
- 201000001352 cholecystitis Diseases 0.000 description 2
- 201000001883 cholelithiasis Diseases 0.000 description 2
- 208000007451 chronic bronchitis Diseases 0.000 description 2
- 238000012875 competitive assay Methods 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 208000028831 congenital heart disease Diseases 0.000 description 2
- 208000019227 congenital lipoid adrenal hyperplasia due to STAR deficency Diseases 0.000 description 2
- 208000029078 coronary artery disease Diseases 0.000 description 2
- 210000004351 coronary vessel Anatomy 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- 201000009805 cryptogenic organizing pneumonia Diseases 0.000 description 2
- 206010061428 decreased appetite Diseases 0.000 description 2
- 230000007812 deficiency Effects 0.000 description 2
- 230000003412 degenerative effect Effects 0.000 description 2
- 201000001981 dermatomyositis Diseases 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 230000001496 desquamative effect Effects 0.000 description 2
- 239000003599 detergent Substances 0.000 description 2
- 235000005911 diet Nutrition 0.000 description 2
- 230000037213 diet Effects 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 208000010643 digestive system disease Diseases 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- 208000016097 disease of metabolism Diseases 0.000 description 2
- 238000002224 dissection Methods 0.000 description 2
- VZFRNCSOCOPNDB-UHFFFAOYSA-N domoic acid Natural products OC(=O)C(C)C=CC=C(C)C1CNC(C(O)=O)C1CC(O)=O VZFRNCSOCOPNDB-UHFFFAOYSA-N 0.000 description 2
- 208000010118 dystonia Diseases 0.000 description 2
- 208000002296 eclampsia Diseases 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 230000002616 endonucleolytic effect Effects 0.000 description 2
- 208000037902 enteropathy Diseases 0.000 description 2
- 201000000165 epidural abscess Diseases 0.000 description 2
- 201000005619 esophageal carcinoma Diseases 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 208000021045 exocrine pancreatic carcinoma Diseases 0.000 description 2
- 201000001155 extrinsic allergic alveolitis Diseases 0.000 description 2
- 201000001386 familial hypercholesterolemia Diseases 0.000 description 2
- 201000005577 familial hyperlipidemia Diseases 0.000 description 2
- 201000006061 fatal familial insomnia Diseases 0.000 description 2
- 239000010685 fatty oil Substances 0.000 description 2
- 239000000945 filler Substances 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 2
- 239000007850 fluorescent dye Substances 0.000 description 2
- 238000002509 fluorescent in situ hybridization Methods 0.000 description 2
- 201000006440 gangliosidosis Diseases 0.000 description 2
- 206010017758 gastric cancer Diseases 0.000 description 2
- 208000010749 gastric carcinoma Diseases 0.000 description 2
- 230000002496 gastric effect Effects 0.000 description 2
- 208000018685 gastrointestinal system disease Diseases 0.000 description 2
- 210000001035 gastrointestinal tract Anatomy 0.000 description 2
- 208000001288 gastroparesis Diseases 0.000 description 2
- 102000054767 gene variant Human genes 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 2
- 229960003180 glutathione Drugs 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 208000018578 heart valve disease Diseases 0.000 description 2
- 208000024798 heartburn Diseases 0.000 description 2
- 230000002440 hepatic effect Effects 0.000 description 2
- 208000007386 hepatic encephalopathy Diseases 0.000 description 2
- 206010019680 hepatic infarction Diseases 0.000 description 2
- 208000006454 hepatitis Diseases 0.000 description 2
- 231100000283 hepatitis Toxicity 0.000 description 2
- 206010073071 hepatocellular carcinoma Diseases 0.000 description 2
- 201000011200 hepatorenal syndrome Diseases 0.000 description 2
- 229940088597 hormone Drugs 0.000 description 2
- 239000005556 hormone Substances 0.000 description 2
- 210000003917 human chromosome Anatomy 0.000 description 2
- 210000004408 hybridoma Anatomy 0.000 description 2
- JYGXADMDTFJGBT-VWUMJDOOSA-N hydrocortisone Chemical compound O=C1CC[C@]2(C)[C@H]3[C@@H](O)C[C@](C)([C@@](CC4)(O)C(=O)CO)[C@@H]4[C@@H]3CCC2=C1 JYGXADMDTFJGBT-VWUMJDOOSA-N 0.000 description 2
- 239000001257 hydrogen Substances 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 208000036796 hyperbilirubinemia Diseases 0.000 description 2
- 208000020346 hyperlipoproteinemia Diseases 0.000 description 2
- 206010020718 hyperplasia Diseases 0.000 description 2
- 208000022098 hypersensitivity pneumonitis Diseases 0.000 description 2
- 208000015210 hypertensive heart disease Diseases 0.000 description 2
- 208000029498 hypoalphalipoproteinemia Diseases 0.000 description 2
- 208000003532 hypothyroidism Diseases 0.000 description 2
- 230000002989 hypothyroidism Effects 0.000 description 2
- 230000036737 immune function Effects 0.000 description 2
- 230000005847 immunogenicity Effects 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 208000027139 infectious colitis Diseases 0.000 description 2
- 201000007119 infective endocarditis Diseases 0.000 description 2
- 230000002757 inflammatory effect Effects 0.000 description 2
- 230000004054 inflammatory process Effects 0.000 description 2
- 208000030603 inherited susceptibility to asthma Diseases 0.000 description 2
- 208000014674 injury Diseases 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 230000002452 interceptive effect Effects 0.000 description 2
- 239000000543 intermediate Substances 0.000 description 2
- 208000036971 interstitial lung disease 2 Diseases 0.000 description 2
- 208000028774 intestinal disease Diseases 0.000 description 2
- 208000003243 intestinal obstruction Diseases 0.000 description 2
- 238000007917 intracranial administration Methods 0.000 description 2
- 201000002161 intrahepatic cholestasis of pregnancy Diseases 0.000 description 2
- 208000002551 irritable bowel syndrome Diseases 0.000 description 2
- 230000000302 ischemic effect Effects 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- FZWBNHMXJMCXLU-BLAUPYHCSA-N isomaltotriose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OC[C@@H]1[C@@H](O)[C@H](O)[C@@H](O)[C@@H](OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C=O)O1 FZWBNHMXJMCXLU-BLAUPYHCSA-N 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 208000017169 kidney disease Diseases 0.000 description 2
- 206010023497 kuru Diseases 0.000 description 2
- 239000008101 lactose Substances 0.000 description 2
- 230000003902 lesion Effects 0.000 description 2
- 230000013190 lipid storage Effects 0.000 description 2
- 208000006132 lipodystrophy Diseases 0.000 description 2
- 208000000680 lipomatosis Diseases 0.000 description 2
- 208000019423 liver disease Diseases 0.000 description 2
- 201000003453 lung abscess Diseases 0.000 description 2
- 208000037841 lung tumor Diseases 0.000 description 2
- 210000003712 lysosome Anatomy 0.000 description 2
- 230000001868 lysosomic effect Effects 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- HQKMJHAJHXVSDF-UHFFFAOYSA-L magnesium stearate Chemical compound [Mg+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O HQKMJHAJHXVSDF-UHFFFAOYSA-L 0.000 description 2
- 230000005291 magnetic effect Effects 0.000 description 2
- 239000000594 mannitol Substances 0.000 description 2
- 235000010355 mannitol Nutrition 0.000 description 2
- 201000007261 marantic endocarditis Diseases 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 230000002503 metabolic effect Effects 0.000 description 2
- 210000003470 mitochondria Anatomy 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 238000000329 molecular dynamics simulation Methods 0.000 description 2
- 230000036651 mood Effects 0.000 description 2
- 208000005264 motor neuron disease Diseases 0.000 description 2
- 201000006417 multiple sclerosis Diseases 0.000 description 2
- 201000000585 muscular atrophy Diseases 0.000 description 2
- 201000006938 muscular dystrophy Diseases 0.000 description 2
- 206010028417 myasthenia gravis Diseases 0.000 description 2
- 208000010125 myocardial infarction Diseases 0.000 description 2
- 208000031225 myocardial ischemia Diseases 0.000 description 2
- 230000008693 nausea Effects 0.000 description 2
- 230000017074 necrotic cell death Effects 0.000 description 2
- 208000018389 neoplasm of cerebral hemisphere Diseases 0.000 description 2
- 230000001613 neoplastic effect Effects 0.000 description 2
- 201000004931 neurofibromatosis Diseases 0.000 description 2
- 208000018360 neuromuscular disease Diseases 0.000 description 2
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 2
- 208000016135 nonbacterial thrombotic endocarditis Diseases 0.000 description 2
- 239000003960 organic solvent Substances 0.000 description 2
- 208000008443 pancreatic carcinoma Diseases 0.000 description 2
- 206010033675 panniculitis Diseases 0.000 description 2
- 208000000689 peptic esophagitis Diseases 0.000 description 2
- 208000011906 peptic ulcer disease Diseases 0.000 description 2
- 238000010647 peptide synthesis reaction Methods 0.000 description 2
- 208000008494 pericarditis Diseases 0.000 description 2
- 208000029308 periodic paralysis Diseases 0.000 description 2
- 208000027232 peripheral nervous system disease Diseases 0.000 description 2
- 239000008177 pharmaceutical agent Substances 0.000 description 2
- 239000000825 pharmaceutical preparation Substances 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 2
- 150000008104 phosphatidylethanolamines Chemical class 0.000 description 2
- 102000005875 phosphatidylserine decarboxylase Human genes 0.000 description 2
- 230000026731 phosphorylation Effects 0.000 description 2
- 238000006366 phosphorylation reaction Methods 0.000 description 2
- 201000003437 pleural cancer Diseases 0.000 description 2
- 206010035653 pneumoconiosis Diseases 0.000 description 2
- 201000003144 pneumothorax Diseases 0.000 description 2
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 208000005987 polymyositis Diseases 0.000 description 2
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 2
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 2
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 2
- 230000001323 posttranslational effect Effects 0.000 description 2
- 201000011461 pre-eclampsia Diseases 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 230000002028 premature Effects 0.000 description 2
- 208000032666 primary 1 hypoalphalipoproteinemia Diseases 0.000 description 2
- 201000000742 primary sclerosing cholangitis Diseases 0.000 description 2
- 230000000750 progressive effect Effects 0.000 description 2
- 201000002212 progressive supranuclear palsy Diseases 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 230000000644 propagated effect Effects 0.000 description 2
- 230000004952 protein activity Effects 0.000 description 2
- 208000020016 psychiatric disease Diseases 0.000 description 2
- 201000003489 pulmonary alveolar proteinosis Diseases 0.000 description 2
- 208000005333 pulmonary edema Diseases 0.000 description 2
- 230000002685 pulmonary effect Effects 0.000 description 2
- 201000009732 pulmonary eosinophilia Diseases 0.000 description 2
- 201000003456 pulmonary hemosiderosis Diseases 0.000 description 2
- 208000002815 pulmonary hypertension Diseases 0.000 description 2
- 230000007575 pulmonary infarction Effects 0.000 description 2
- 208000008128 pulmonary tuberculosis Diseases 0.000 description 2
- 208000024981 pyrosis Diseases 0.000 description 2
- 238000011002 quantification Methods 0.000 description 2
- 206010061928 radiculitis Diseases 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 201000003068 rheumatic fever Diseases 0.000 description 2
- 208000004124 rheumatic heart disease Diseases 0.000 description 2
- 210000003705 ribosome Anatomy 0.000 description 2
- 201000000306 sarcoidosis Diseases 0.000 description 2
- 201000000980 schizophrenia Diseases 0.000 description 2
- 208000010157 sclerosing cholangitis Diseases 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 238000002864 sequence alignment Methods 0.000 description 2
- 230000035939 shock Effects 0.000 description 2
- 230000019491 signal transduction Effects 0.000 description 2
- 235000019812 sodium carboxymethyl cellulose Nutrition 0.000 description 2
- 229920001027 sodium carboxymethylcellulose Polymers 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 235000019698 starch Nutrition 0.000 description 2
- 230000007863 steatosis Effects 0.000 description 2
- 201000000498 stomach carcinoma Diseases 0.000 description 2
- 239000005720 sucrose Substances 0.000 description 2
- 238000001356 surgical procedure Methods 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- 108060008004 synaptotagmin Proteins 0.000 description 2
- 102000003137 synaptotagmin Human genes 0.000 description 2
- 201000000596 systemic lupus erythematosus Diseases 0.000 description 2
- 208000016505 systemic primary carnitine deficiency disease Diseases 0.000 description 2
- 239000000454 talc Substances 0.000 description 2
- 229910052623 talc Inorganic materials 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- 230000002537 thrombolytic effect Effects 0.000 description 2
- 230000000699 topical effect Effects 0.000 description 2
- 230000008733 trauma Effects 0.000 description 2
- 239000002753 trypsin inhibitor Substances 0.000 description 2
- 208000009999 tuberous sclerosis Diseases 0.000 description 2
- 229940035893 uracil Drugs 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- 208000027185 varicose disease Diseases 0.000 description 2
- 230000002792 vascular Effects 0.000 description 2
- 201000011531 vascular cancer Diseases 0.000 description 2
- 208000019553 vascular disease Diseases 0.000 description 2
- 206010055031 vascular neoplasm Diseases 0.000 description 2
- 201000010044 viral meningitis Diseases 0.000 description 2
- 208000009421 viral pneumonia Diseases 0.000 description 2
- KYRUKRFVOACELK-UHFFFAOYSA-N (2,5-dioxopyrrolidin-1-yl) 3-(4-hydroxyphenyl)propanoate Chemical compound C1=CC(O)=CC=C1CCC(=O)ON1C(=O)CCC1=O KYRUKRFVOACELK-UHFFFAOYSA-N 0.000 description 1
- LNAZSHAWQACDHT-XIYTZBAFSA-N (2r,3r,4s,5r,6s)-4,5-dimethoxy-2-(methoxymethyl)-3-[(2s,3r,4s,5r,6r)-3,4,5-trimethoxy-6-(methoxymethyl)oxan-2-yl]oxy-6-[(2r,3r,4s,5r,6r)-4,5,6-trimethoxy-2-(methoxymethyl)oxan-3-yl]oxyoxane Chemical compound CO[C@@H]1[C@@H](OC)[C@H](OC)[C@@H](COC)O[C@H]1O[C@H]1[C@H](OC)[C@@H](OC)[C@H](O[C@H]2[C@@H]([C@@H](OC)[C@H](OC)O[C@@H]2COC)OC)O[C@@H]1COC LNAZSHAWQACDHT-XIYTZBAFSA-N 0.000 description 1
- COEXAQSTZUWMRI-STQMWFEESA-N (2s)-1-[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 COEXAQSTZUWMRI-STQMWFEESA-N 0.000 description 1
- ASWBNKHCZGQVJV-UHFFFAOYSA-N (3-hexadecanoyloxy-2-hydroxypropyl) 2-(trimethylazaniumyl)ethyl phosphate Chemical compound CCCCCCCCCCCCCCCC(=O)OCC(O)COP([O-])(=O)OCC[N+](C)(C)C ASWBNKHCZGQVJV-UHFFFAOYSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 1
- IXPNQXFRVYWDDI-UHFFFAOYSA-N 1-methyl-2,4-dioxo-1,3-diazinane-5-carboximidamide Chemical compound CN1CC(C(N)=N)C(=O)NC1=O IXPNQXFRVYWDDI-UHFFFAOYSA-N 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- UFBJCMHMOXMLKC-UHFFFAOYSA-N 2,4-dinitrophenol Chemical compound OC1=CC=C([N+]([O-])=O)C=C1[N+]([O-])=O UFBJCMHMOXMLKC-UHFFFAOYSA-N 0.000 description 1
- AMBKWKJGMIHTJR-UHFFFAOYSA-N 2-[2-[2-[(2-azaniumyl-3-methylbutanoyl)amino]propanoylamino]propanoylamino]-3-phenylpropanoate Chemical compound CC(C)C(N)C(=O)NC(C)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=CC=C1 AMBKWKJGMIHTJR-UHFFFAOYSA-N 0.000 description 1
- PFCLMNDDPTZJHQ-XLPZGREQSA-N 2-amino-7-[(2r,4s,5r)-4-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-1h-pyrrolo[2,3-d]pyrimidin-4-one Chemical compound C1=CC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](CO)O1 PFCLMNDDPTZJHQ-XLPZGREQSA-N 0.000 description 1
- HZLCGUXUOFWCCN-UHFFFAOYSA-N 2-hydroxynonadecane-1,2,3-tricarboxylic acid Chemical compound CCCCCCCCCCCCCCCCC(C(O)=O)C(O)(C(O)=O)CC(O)=O HZLCGUXUOFWCCN-UHFFFAOYSA-N 0.000 description 1
- WOVKYSAHUYNSMH-RRKCRQDMSA-N 5-bromodeoxyuridine Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(Br)=C1 WOVKYSAHUYNSMH-RRKCRQDMSA-N 0.000 description 1
- LRSASMSXMSNRBT-UHFFFAOYSA-N 5-methylcytosine Chemical compound CC1=CNC(=O)N=C1N LRSASMSXMSNRBT-UHFFFAOYSA-N 0.000 description 1
- ZKHQWZAMYRWXGA-KQYNXXCUSA-J ATP(4-) Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)[C@H]1O ZKHQWZAMYRWXGA-KQYNXXCUSA-J 0.000 description 1
- 244000215068 Acacia senegal Species 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 108010024223 Adenine phosphoribosyltransferase Proteins 0.000 description 1
- ZKHQWZAMYRWXGA-UHFFFAOYSA-N Adenosine triphosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)C(O)C1O ZKHQWZAMYRWXGA-UHFFFAOYSA-N 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- DWINFPQUSSHSFS-UVBJJODRSA-N Ala-Arg-Trp Chemical compound N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O DWINFPQUSSHSFS-UVBJJODRSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 1
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 1
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 1
- CRWFEKLFPVRPBV-CIUDSAMLSA-N Ala-Gln-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CRWFEKLFPVRPBV-CIUDSAMLSA-N 0.000 description 1
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 1
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- CXISPYVYMQWFLE-VKHMYHEASA-N Ala-Gly Chemical compound C[C@H]([NH3+])C(=O)NCC([O-])=O CXISPYVYMQWFLE-VKHMYHEASA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- LTSBJNNXPBBNDT-HGNGGELXSA-N Ala-His-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)O LTSBJNNXPBBNDT-HGNGGELXSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 1
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 1
- OPZJWMJPCNNZNT-DCAQKATOSA-N Ala-Leu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N OPZJWMJPCNNZNT-DCAQKATOSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 1
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 1
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 1
- DRARURMRLANNLS-GUBZILKMSA-N Ala-Met-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O DRARURMRLANNLS-GUBZILKMSA-N 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 1
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 1
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- CQJHFKKGZXKZBC-BPNCWPANSA-N Ala-Pro-Tyr Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CQJHFKKGZXKZBC-BPNCWPANSA-N 0.000 description 1
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- LFFOJBOTZUWINF-ZANVPECISA-N Ala-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O)=CNC2=C1 LFFOJBOTZUWINF-ZANVPECISA-N 0.000 description 1
- PHQXWZGXKAFWAZ-ZLIFDBKOSA-N Ala-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 PHQXWZGXKAFWAZ-ZLIFDBKOSA-N 0.000 description 1
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 1
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 1
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 1
- 108010025188 Alcohol oxidase Proteins 0.000 description 1
- 208000007848 Alcoholism Diseases 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 108090000672 Annexin A5 Proteins 0.000 description 1
- 102000004121 Annexin A5 Human genes 0.000 description 1
- 108020000948 Antisense Oligonucleotides Proteins 0.000 description 1
- 102100029470 Apolipoprotein E Human genes 0.000 description 1
- 101710095339 Apolipoprotein E Proteins 0.000 description 1
- 102000018757 Apolipoprotein L1 Human genes 0.000 description 1
- 108010052469 Apolipoprotein L1 Proteins 0.000 description 1
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- JGDGLDNAQJJGJI-AVGNSLFASA-N Arg-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N JGDGLDNAQJJGJI-AVGNSLFASA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- YUIGJDNAGKJLDO-JYJNAYRXSA-N Arg-Arg-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YUIGJDNAGKJLDO-JYJNAYRXSA-N 0.000 description 1
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 1
- WESHVRNMNFMVBE-FXQIFTODSA-N Arg-Asn-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N WESHVRNMNFMVBE-FXQIFTODSA-N 0.000 description 1
- CPSHGRGUPZBMOK-CIUDSAMLSA-N Arg-Asn-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CPSHGRGUPZBMOK-CIUDSAMLSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- OTUQSEPIIVBYEM-IHRRRGAJSA-N Arg-Asn-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OTUQSEPIIVBYEM-IHRRRGAJSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- FBLMOFHNVQBKRR-IHRRRGAJSA-N Arg-Asp-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FBLMOFHNVQBKRR-IHRRRGAJSA-N 0.000 description 1
- JVMKBJNSRZWDBO-FXQIFTODSA-N Arg-Cys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O JVMKBJNSRZWDBO-FXQIFTODSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 1
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 1
- PPPXVIBMLFWNSK-BQBZGAKWSA-N Arg-Gly-Cys Chemical compound C(C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N PPPXVIBMLFWNSK-BQBZGAKWSA-N 0.000 description 1
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- YBIAYFFIVAZXPK-AVGNSLFASA-N Arg-His-Arg Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YBIAYFFIVAZXPK-AVGNSLFASA-N 0.000 description 1
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 1
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 1
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 1
- FNXCAFKDGBROCU-STECZYCISA-N Arg-Ile-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FNXCAFKDGBROCU-STECZYCISA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 1
- NOZYDJOPOGKUSR-AVGNSLFASA-N Arg-Leu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O NOZYDJOPOGKUSR-AVGNSLFASA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 1
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 1
- JOADBFCFJGNIKF-GUBZILKMSA-N Arg-Met-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O JOADBFCFJGNIKF-GUBZILKMSA-N 0.000 description 1
- MNBHKGYCLBUIBC-UFYCRDLUSA-N Arg-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCNC(N)=N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MNBHKGYCLBUIBC-UFYCRDLUSA-N 0.000 description 1
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 1
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 1
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 1
- STHNZYKCJHWULY-AVGNSLFASA-N Arg-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O STHNZYKCJHWULY-AVGNSLFASA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- YFHATWYGAAXQCF-JYJNAYRXSA-N Arg-Pro-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YFHATWYGAAXQCF-JYJNAYRXSA-N 0.000 description 1
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- FBXMCPLCVYUWBO-BPUTZDHNSA-N Arg-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N FBXMCPLCVYUWBO-BPUTZDHNSA-N 0.000 description 1
- LYJXHXGPWDTLKW-HJGDQZAQSA-N Arg-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O LYJXHXGPWDTLKW-HJGDQZAQSA-N 0.000 description 1
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 1
- JKRPBTQDPJSQIT-RCWTZXSCSA-N Arg-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O JKRPBTQDPJSQIT-RCWTZXSCSA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 1
- POZKLUIXMHIULG-FDARSICLSA-N Arg-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCN=C(N)N)N POZKLUIXMHIULG-FDARSICLSA-N 0.000 description 1
- LOVIQNMIPQVIGT-BVSLBCMMSA-N Arg-Trp-Phe Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)C1=CC=CC=C1 LOVIQNMIPQVIGT-BVSLBCMMSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- BWMMKQPATDUYKB-IHRRRGAJSA-N Arg-Tyr-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=C(O)C=C1 BWMMKQPATDUYKB-IHRRRGAJSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 200000000007 Arterial disease Diseases 0.000 description 1
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 1
- NXVGBGZQQFDUTM-XVYDVKMFSA-N Asn-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N NXVGBGZQQFDUTM-XVYDVKMFSA-N 0.000 description 1
- AKEBUSZTMQLNIX-UWJYBYFXSA-N Asn-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N AKEBUSZTMQLNIX-UWJYBYFXSA-N 0.000 description 1
- GMRGSBAMMMVDGG-GUBZILKMSA-N Asn-Arg-Arg Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N GMRGSBAMMMVDGG-GUBZILKMSA-N 0.000 description 1
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 1
- GOVUDFOGXOONFT-VEVYYDQMSA-N Asn-Arg-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GOVUDFOGXOONFT-VEVYYDQMSA-N 0.000 description 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 1
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 1
- JZRLLSOWDYUKOK-SRVKXCTJSA-N Asn-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N JZRLLSOWDYUKOK-SRVKXCTJSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- BKDDABUWNKGZCK-XHNCKOQMSA-N Asn-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O BKDDABUWNKGZCK-XHNCKOQMSA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- QEQVUHQQYDZUEN-GUBZILKMSA-N Asn-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N QEQVUHQQYDZUEN-GUBZILKMSA-N 0.000 description 1
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 1
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 1
- UBGGJTMETLEXJD-DCAQKATOSA-N Asn-Leu-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O UBGGJTMETLEXJD-DCAQKATOSA-N 0.000 description 1
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- OSZBYGVKAFZWKC-FXQIFTODSA-N Asn-Pro-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O OSZBYGVKAFZWKC-FXQIFTODSA-N 0.000 description 1
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 1
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 1
- JXMREEPBRANWBY-VEVYYDQMSA-N Asn-Thr-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JXMREEPBRANWBY-VEVYYDQMSA-N 0.000 description 1
- MLJZMGIXXMTEPO-UBHSHLNASA-N Asn-Trp-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O MLJZMGIXXMTEPO-UBHSHLNASA-N 0.000 description 1
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 1
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- CXBOKJPLEYUPGB-FXQIFTODSA-N Asp-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N CXBOKJPLEYUPGB-FXQIFTODSA-N 0.000 description 1
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 1
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- WKGJGVGTEZGFSW-FXQIFTODSA-N Asp-Asn-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O WKGJGVGTEZGFSW-FXQIFTODSA-N 0.000 description 1
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 1
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 1
- FMWHSNJMHUNLAG-FXQIFTODSA-N Asp-Cys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FMWHSNJMHUNLAG-FXQIFTODSA-N 0.000 description 1
- KVPHTGVUMJGMCX-BIIVOSGPSA-N Asp-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)C(=O)O KVPHTGVUMJGMCX-BIIVOSGPSA-N 0.000 description 1
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 1
- RSMIHCFQDCVVBR-CIUDSAMLSA-N Asp-Gln-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RSMIHCFQDCVVBR-CIUDSAMLSA-N 0.000 description 1
- LJRPYAZQQWHEEV-FXQIFTODSA-N Asp-Gln-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O LJRPYAZQQWHEEV-FXQIFTODSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 1
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 1
- ICZWAZVKLACMKR-CIUDSAMLSA-N Asp-His-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 ICZWAZVKLACMKR-CIUDSAMLSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 1
- YVHGKXAOSVBGJV-CIUDSAMLSA-N Asp-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N YVHGKXAOSVBGJV-CIUDSAMLSA-N 0.000 description 1
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 1
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 1
- IOXWDLNHXZOXQP-FXQIFTODSA-N Asp-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N IOXWDLNHXZOXQP-FXQIFTODSA-N 0.000 description 1
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 1
- XUVTWGPERWIERB-IHRRRGAJSA-N Asp-Pro-Phe Chemical compound N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O XUVTWGPERWIERB-IHRRRGAJSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 1
- LEYKQPDPZJIRTA-AQZXSJQPSA-N Asp-Trp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LEYKQPDPZJIRTA-AQZXSJQPSA-N 0.000 description 1
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 1
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 1
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000416162 Astragalus gummifer Species 0.000 description 1
- 206010003591 Ataxia Diseases 0.000 description 1
- 206010003594 Ataxia telangiectasia Diseases 0.000 description 1
- 208000037260 Atherosclerotic Plaque Diseases 0.000 description 1
- 241000201370 Autographa californica nucleopolyhedrovirus Species 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- WOVKYSAHUYNSMH-UHFFFAOYSA-N BROMODEOXYURIDINE Natural products C1C(O)C(CO)OC1N1C(=O)NC(=O)C(Br)=C1 WOVKYSAHUYNSMH-UHFFFAOYSA-N 0.000 description 1
- OGBVRMYSNSKIEF-UHFFFAOYSA-N Benzylphosphonic acid Chemical class OP(O)(=O)CC1=CC=CC=C1 OGBVRMYSNSKIEF-UHFFFAOYSA-N 0.000 description 1
- 101000800130 Bos taurus Thyroglobulin Proteins 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 102000002110 C2 domains Human genes 0.000 description 1
- 108050009459 C2 domains Proteins 0.000 description 1
- 102000005701 Calcium-Binding Proteins Human genes 0.000 description 1
- 108010045403 Calcium-Binding Proteins Proteins 0.000 description 1
- 241000173351 Camvirus Species 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 241000701489 Cauliflower mosaic virus Species 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 108010012236 Chemokines Proteins 0.000 description 1
- 102000019034 Chemokines Human genes 0.000 description 1
- 239000005496 Chlorsulfuron Substances 0.000 description 1
- 206010008631 Cholera Diseases 0.000 description 1
- 108010004942 Chylomicron Remnants Proteins 0.000 description 1
- 108010004103 Chylomicrons Proteins 0.000 description 1
- 108010035532 Collagen Proteins 0.000 description 1
- 102000008186 Collagen Human genes 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 241000192700 Cyanobacteria Species 0.000 description 1
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 1
- PRVVCRZLTJNPCS-FXQIFTODSA-N Cys-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N PRVVCRZLTJNPCS-FXQIFTODSA-N 0.000 description 1
- GMXSSZUVDNPRMA-FXQIFTODSA-N Cys-Arg-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GMXSSZUVDNPRMA-FXQIFTODSA-N 0.000 description 1
- BYALSSDCQYHKMY-XGEHTFHBSA-N Cys-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)O BYALSSDCQYHKMY-XGEHTFHBSA-N 0.000 description 1
- SFUUYRSAJPWTGO-SRVKXCTJSA-N Cys-Asn-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SFUUYRSAJPWTGO-SRVKXCTJSA-N 0.000 description 1
- GSNRZJNHMVMOFV-ACZMJKKPSA-N Cys-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N GSNRZJNHMVMOFV-ACZMJKKPSA-N 0.000 description 1
- YZFCGHIBLBDZDA-ZLUOBGJFSA-N Cys-Asp-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YZFCGHIBLBDZDA-ZLUOBGJFSA-N 0.000 description 1
- VIRYODQIWJNWNU-NRPADANISA-N Cys-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N VIRYODQIWJNWNU-NRPADANISA-N 0.000 description 1
- SBDVXRYCOIEYNV-YUMQZZPRSA-N Cys-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N SBDVXRYCOIEYNV-YUMQZZPRSA-N 0.000 description 1
- ZMWOJVAXTOUHAP-ZKWXMUAHSA-N Cys-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N ZMWOJVAXTOUHAP-ZKWXMUAHSA-N 0.000 description 1
- PRHGYQOSEHLDRW-VGDYDELISA-N Cys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N PRHGYQOSEHLDRW-VGDYDELISA-N 0.000 description 1
- ODDOYXKAHLKKQY-MMWGEVLESA-N Cys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N ODDOYXKAHLKKQY-MMWGEVLESA-N 0.000 description 1
- MRVSLWQRNWEROS-SVSWQMSJSA-N Cys-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CS)N MRVSLWQRNWEROS-SVSWQMSJSA-N 0.000 description 1
- IZUNQDRIAOLWCN-YUMQZZPRSA-N Cys-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N IZUNQDRIAOLWCN-YUMQZZPRSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- HBHMVBGGHDMPBF-GARJFASQSA-N Cys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N HBHMVBGGHDMPBF-GARJFASQSA-N 0.000 description 1
- IDFVDSBJNMPBSX-SRVKXCTJSA-N Cys-Lys-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O IDFVDSBJNMPBSX-SRVKXCTJSA-N 0.000 description 1
- KGIHMGPYGXBYJJ-SRVKXCTJSA-N Cys-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CS KGIHMGPYGXBYJJ-SRVKXCTJSA-N 0.000 description 1
- RESAHOSBQHMOKH-KKUMJFAQSA-N Cys-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N RESAHOSBQHMOKH-KKUMJFAQSA-N 0.000 description 1
- KJJASVYBTKRYSN-FXQIFTODSA-N Cys-Pro-Asp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC(=O)O)C(=O)O KJJASVYBTKRYSN-FXQIFTODSA-N 0.000 description 1
- KSMSFCBQBQPFAD-GUBZILKMSA-N Cys-Pro-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 KSMSFCBQBQPFAD-GUBZILKMSA-N 0.000 description 1
- BCFXQBXXDSEHRS-FXQIFTODSA-N Cys-Ser-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BCFXQBXXDSEHRS-FXQIFTODSA-N 0.000 description 1
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 1
- MWVDDZUTWXFYHL-XKBZYTNZSA-N Cys-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)O MWVDDZUTWXFYHL-XKBZYTNZSA-N 0.000 description 1
- JLZCAZJGWNRXCI-XKBZYTNZSA-N Cys-Thr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O JLZCAZJGWNRXCI-XKBZYTNZSA-N 0.000 description 1
- SAEVTQWAYDPXMU-KATARQTJSA-N Cys-Thr-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O SAEVTQWAYDPXMU-KATARQTJSA-N 0.000 description 1
- JIZRUFJGHPIYPS-SRVKXCTJSA-N Cys-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O JIZRUFJGHPIYPS-SRVKXCTJSA-N 0.000 description 1
- NGOIQDYZMIKCOK-NAKRPEOUSA-N Cys-Val-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NGOIQDYZMIKCOK-NAKRPEOUSA-N 0.000 description 1
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 1
- QQAYIVHVRFJICE-AEJSXWLSSA-N Cys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N QQAYIVHVRFJICE-AEJSXWLSSA-N 0.000 description 1
- ATFSDBMHRCDLBV-BPUTZDHNSA-N Cys-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N ATFSDBMHRCDLBV-BPUTZDHNSA-N 0.000 description 1
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 241000701022 Cytomegalovirus Species 0.000 description 1
- IGXWBGJHJZYPQS-SSDOTTSWSA-N D-Luciferin Chemical compound OC(=O)[C@H]1CSC(C=2SC3=CC=C(O)C=C3N=2)=N1 IGXWBGJHJZYPQS-SSDOTTSWSA-N 0.000 description 1
- 101150074155 DHFR gene Proteins 0.000 description 1
- 108010017826 DNA Polymerase I Proteins 0.000 description 1
- 102000004594 DNA Polymerase I Human genes 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- CYCGRDQQIOGCKX-UHFFFAOYSA-N Dehydro-luciferin Natural products OC(=O)C1=CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 CYCGRDQQIOGCKX-UHFFFAOYSA-N 0.000 description 1
- 108010054576 Deoxyribonuclease EcoRI Proteins 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 1
- LVGKNOAMLMIIKO-UHFFFAOYSA-N Elaidinsaeure-aethylester Natural products CCCCCCCCC=CCCCCCCCC(=O)OCC LVGKNOAMLMIIKO-UHFFFAOYSA-N 0.000 description 1
- 102000010911 Enzyme Precursors Human genes 0.000 description 1
- 108010062466 Enzyme Precursors Proteins 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108091029865 Exogenous DNA Proteins 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- 108050001049 Extracellular proteins Proteins 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- BJGNCJDXODQBOB-UHFFFAOYSA-N Fivefly Luciferin Natural products OC(=O)C1CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 BJGNCJDXODQBOB-UHFFFAOYSA-N 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 102000003688 G-Protein-Coupled Receptors Human genes 0.000 description 1
- 108090000045 G-Protein-Coupled Receptors Proteins 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- RGRMOYQUIJVQQD-SRVKXCTJSA-N Gln-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N RGRMOYQUIJVQQD-SRVKXCTJSA-N 0.000 description 1
- CKNUKHBRCSMKMO-XHNCKOQMSA-N Gln-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O CKNUKHBRCSMKMO-XHNCKOQMSA-N 0.000 description 1
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 1
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 1
- ZDJZEGYVKANKED-NRPADANISA-N Gln-Cys-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O ZDJZEGYVKANKED-NRPADANISA-N 0.000 description 1
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 1
- DQPOBSRQNWOBNA-GUBZILKMSA-N Gln-His-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O DQPOBSRQNWOBNA-GUBZILKMSA-N 0.000 description 1
- LKVCNGLNTAPMSZ-JYJNAYRXSA-N Gln-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N LKVCNGLNTAPMSZ-JYJNAYRXSA-N 0.000 description 1
- OOLCSQQPSLIETN-JYJNAYRXSA-N Gln-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)O OOLCSQQPSLIETN-JYJNAYRXSA-N 0.000 description 1
- XWIBVSAEUCAAKF-GVXVVHGQSA-N Gln-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N XWIBVSAEUCAAKF-GVXVVHGQSA-N 0.000 description 1
- KHGGWBRVRPHFMH-PEFMBERDSA-N Gln-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHGGWBRVRPHFMH-PEFMBERDSA-N 0.000 description 1
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 1
- MTCXQQINVAFZKW-MNXVOIDGSA-N Gln-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MTCXQQINVAFZKW-MNXVOIDGSA-N 0.000 description 1
- JKGHMESJHRTHIC-SIUGBPQLSA-N Gln-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JKGHMESJHRTHIC-SIUGBPQLSA-N 0.000 description 1
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 1
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 1
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 1
- KSKFIECUYMYWNS-AVGNSLFASA-N Gln-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N KSKFIECUYMYWNS-AVGNSLFASA-N 0.000 description 1
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 1
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 1
- DQLVHRFFBQOWFL-JYJNAYRXSA-N Gln-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)O DQLVHRFFBQOWFL-JYJNAYRXSA-N 0.000 description 1
- FALJZCPMTGJOHX-SRVKXCTJSA-N Gln-Met-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O FALJZCPMTGJOHX-SRVKXCTJSA-N 0.000 description 1
- SFAFZYYMAWOCIC-KKUMJFAQSA-N Gln-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SFAFZYYMAWOCIC-KKUMJFAQSA-N 0.000 description 1
- AQPZYBSRDRZBAG-AVGNSLFASA-N Gln-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N AQPZYBSRDRZBAG-AVGNSLFASA-N 0.000 description 1
- HHRAEXBUNGTOGZ-IHRRRGAJSA-N Gln-Phe-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O HHRAEXBUNGTOGZ-IHRRRGAJSA-N 0.000 description 1
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 1
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 1
- UWMDGPFFTKDUIY-HJGDQZAQSA-N Gln-Pro-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWMDGPFFTKDUIY-HJGDQZAQSA-N 0.000 description 1
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 1
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 1
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 1
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 1
- CTJRFALAOYAJBX-NWLDYVSISA-N Gln-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N)O CTJRFALAOYAJBX-NWLDYVSISA-N 0.000 description 1
- BJVBMSTUUWGZKX-JYJNAYRXSA-N Gln-Tyr-His Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BJVBMSTUUWGZKX-JYJNAYRXSA-N 0.000 description 1
- UGEZSPWLJABDAR-KKUMJFAQSA-N Gln-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N UGEZSPWLJABDAR-KKUMJFAQSA-N 0.000 description 1
- QXQDADBVIBLBHN-FHWLQOOXSA-N Gln-Tyr-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QXQDADBVIBLBHN-FHWLQOOXSA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- KEBACWCLVOXFNC-DCAQKATOSA-N Glu-Arg-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KEBACWCLVOXFNC-DCAQKATOSA-N 0.000 description 1
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- SVZIKUHLRKVZIF-GUBZILKMSA-N Glu-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N SVZIKUHLRKVZIF-GUBZILKMSA-N 0.000 description 1
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- SAEBUDRWKUXLOM-ACZMJKKPSA-N Glu-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O SAEBUDRWKUXLOM-ACZMJKKPSA-N 0.000 description 1
- FLQAKQOBSPFGKG-CIUDSAMLSA-N Glu-Cys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLQAKQOBSPFGKG-CIUDSAMLSA-N 0.000 description 1
- RQNYYRHRKSVKAB-GUBZILKMSA-N Glu-Cys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O RQNYYRHRKSVKAB-GUBZILKMSA-N 0.000 description 1
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- NUSWUSKZRCGFEX-FXQIFTODSA-N Glu-Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O NUSWUSKZRCGFEX-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- APHGWLWMOXGZRL-DCAQKATOSA-N Glu-Glu-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O APHGWLWMOXGZRL-DCAQKATOSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- DRLVXRQFROIYTD-GUBZILKMSA-N Glu-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N DRLVXRQFROIYTD-GUBZILKMSA-N 0.000 description 1
- COSBSYQVPSODFX-GUBZILKMSA-N Glu-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N COSBSYQVPSODFX-GUBZILKMSA-N 0.000 description 1
- ZMVCLTGPGWJAEE-JYJNAYRXSA-N Glu-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)O ZMVCLTGPGWJAEE-JYJNAYRXSA-N 0.000 description 1
- ZPASCJBSSCRWMC-GVXVVHGQSA-N Glu-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N ZPASCJBSSCRWMC-GVXVVHGQSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- JJSVALISDCNFCU-SZMVWBNQSA-N Glu-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JJSVALISDCNFCU-SZMVWBNQSA-N 0.000 description 1
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 1
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 1
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- JHSRJMUJOGLIHK-GUBZILKMSA-N Glu-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N JHSRJMUJOGLIHK-GUBZILKMSA-N 0.000 description 1
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 1
- UMHRCVCZUPBBQW-GARJFASQSA-N Glu-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UMHRCVCZUPBBQW-GARJFASQSA-N 0.000 description 1
- GMAGZGCAYLQBKF-NHCYSSNCSA-N Glu-Met-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GMAGZGCAYLQBKF-NHCYSSNCSA-N 0.000 description 1
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 1
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 1
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 1
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 1
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 1
- LWYUQLZOIORFFJ-XKBZYTNZSA-N Glu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O LWYUQLZOIORFFJ-XKBZYTNZSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- ZNOHKCPYDAYYDA-BPUTZDHNSA-N Glu-Trp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNOHKCPYDAYYDA-BPUTZDHNSA-N 0.000 description 1
- JLCYOCDGIUZMKQ-JBACZVJFSA-N Glu-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)O)N JLCYOCDGIUZMKQ-JBACZVJFSA-N 0.000 description 1
- QEJKKJNDDDPSMU-KKUMJFAQSA-N Glu-Tyr-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O QEJKKJNDDDPSMU-KKUMJFAQSA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- ZALGPUWUVHOGAE-GVXVVHGQSA-N Glu-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZALGPUWUVHOGAE-GVXVVHGQSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 102000053187 Glucuronidase Human genes 0.000 description 1
- 108010060309 Glucuronidase Proteins 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 1
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- LERGJIVJIIODPZ-ZANVPECISA-N Gly-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)C)C(O)=O)=CNC2=C1 LERGJIVJIIODPZ-ZANVPECISA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 1
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 1
- JPWIMMUNWUKOAD-STQMWFEESA-N Gly-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN JPWIMMUNWUKOAD-STQMWFEESA-N 0.000 description 1
- NMROINAYXCACKF-WHFBIAKZSA-N Gly-Cys-Cys Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O NMROINAYXCACKF-WHFBIAKZSA-N 0.000 description 1
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 1
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- VAXIVIPMCTYSHI-YUMQZZPRSA-N Gly-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN VAXIVIPMCTYSHI-YUMQZZPRSA-N 0.000 description 1
- HHSOPSCKAZKQHQ-PEXQALLHSA-N Gly-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN HHSOPSCKAZKQHQ-PEXQALLHSA-N 0.000 description 1
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 1
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 1
- SIYTVHWNKGIGMD-HOTGVXAUSA-N Gly-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)CN SIYTVHWNKGIGMD-HOTGVXAUSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 1
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- YKJUITHASJAGHO-HOTGVXAUSA-N Gly-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN YKJUITHASJAGHO-HOTGVXAUSA-N 0.000 description 1
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 1
- SJLKKOZFHSJJAW-YUMQZZPRSA-N Gly-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN SJLKKOZFHSJJAW-YUMQZZPRSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 1
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- BXDLTKLPPKBVEL-FJXKBIBVSA-N Gly-Thr-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O BXDLTKLPPKBVEL-FJXKBIBVSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- RJVZMGQMJOQIAX-GJZGRUSLSA-N Gly-Trp-Met Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(O)=O RJVZMGQMJOQIAX-GJZGRUSLSA-N 0.000 description 1
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 1
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 208000034308 Grand mal convulsion Diseases 0.000 description 1
- 229920000084 Gum arabic Polymers 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 1
- JHVCZQFWRLHUQR-DCAQKATOSA-N His-Arg-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N JHVCZQFWRLHUQR-DCAQKATOSA-N 0.000 description 1
- ZPVJJPAIUZLSNE-DCAQKATOSA-N His-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O ZPVJJPAIUZLSNE-DCAQKATOSA-N 0.000 description 1
- WJUYPBBCSSLVJE-CIUDSAMLSA-N His-Asn-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N WJUYPBBCSSLVJE-CIUDSAMLSA-N 0.000 description 1
- KYMUEAZVLPRVAE-GUBZILKMSA-N His-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KYMUEAZVLPRVAE-GUBZILKMSA-N 0.000 description 1
- UZZXGLOJRZKYEL-DJFWLOJKSA-N His-Asn-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UZZXGLOJRZKYEL-DJFWLOJKSA-N 0.000 description 1
- MDBYBTWRMOAJAY-NHCYSSNCSA-N His-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MDBYBTWRMOAJAY-NHCYSSNCSA-N 0.000 description 1
- WZOGEMJIZBNFBK-CIUDSAMLSA-N His-Asp-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WZOGEMJIZBNFBK-CIUDSAMLSA-N 0.000 description 1
- UOAVQQRILDGZEN-SRVKXCTJSA-N His-Asp-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UOAVQQRILDGZEN-SRVKXCTJSA-N 0.000 description 1
- BDHUXUFYNUOUIT-SRVKXCTJSA-N His-Asp-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BDHUXUFYNUOUIT-SRVKXCTJSA-N 0.000 description 1
- WGVPDSNCHDEDBP-KKUMJFAQSA-N His-Asp-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WGVPDSNCHDEDBP-KKUMJFAQSA-N 0.000 description 1
- SWSVTNGMKBDTBM-DCAQKATOSA-N His-Gln-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SWSVTNGMKBDTBM-DCAQKATOSA-N 0.000 description 1
- HVCRQRQPIIRNLY-IUCAKERBSA-N His-Gln-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N HVCRQRQPIIRNLY-IUCAKERBSA-N 0.000 description 1
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 1
- SDTPKSOWFXBACN-GUBZILKMSA-N His-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O SDTPKSOWFXBACN-GUBZILKMSA-N 0.000 description 1
- JCOSMKPAOYDKRO-AVGNSLFASA-N His-Glu-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N JCOSMKPAOYDKRO-AVGNSLFASA-N 0.000 description 1
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 1
- STWGDDDFLUFCCA-GVXVVHGQSA-N His-Glu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O STWGDDDFLUFCCA-GVXVVHGQSA-N 0.000 description 1
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 1
- JBSLJUPMTYLLFH-MELADBBJSA-N His-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O JBSLJUPMTYLLFH-MELADBBJSA-N 0.000 description 1
- BZKDJRSZWLPJNI-SRVKXCTJSA-N His-His-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O BZKDJRSZWLPJNI-SRVKXCTJSA-N 0.000 description 1
- LYDKQVYYCMYNMC-SRVKXCTJSA-N His-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LYDKQVYYCMYNMC-SRVKXCTJSA-N 0.000 description 1
- DEOQGJUXUQGUJN-KKUMJFAQSA-N His-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DEOQGJUXUQGUJN-KKUMJFAQSA-N 0.000 description 1
- DPQIPEAHIYMUEJ-IHRRRGAJSA-N His-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N DPQIPEAHIYMUEJ-IHRRRGAJSA-N 0.000 description 1
- IGBBXBFSLKRHJB-BZSNNMDCSA-N His-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 IGBBXBFSLKRHJB-BZSNNMDCSA-N 0.000 description 1
- BKOVCRUIXDIWFV-IXOXFDKPSA-N His-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 BKOVCRUIXDIWFV-IXOXFDKPSA-N 0.000 description 1
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 1
- KYFGGRHWLFZXPU-KKUMJFAQSA-N His-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N KYFGGRHWLFZXPU-KKUMJFAQSA-N 0.000 description 1
- YAEKRYQASVCDLK-JYJNAYRXSA-N His-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N YAEKRYQASVCDLK-JYJNAYRXSA-N 0.000 description 1
- SVVULKPWDBIPCO-BZSNNMDCSA-N His-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SVVULKPWDBIPCO-BZSNNMDCSA-N 0.000 description 1
- SOYCWSKCUVDLMC-AVGNSLFASA-N His-Pro-Arg Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CCCNC(=N)N)C(=O)O SOYCWSKCUVDLMC-AVGNSLFASA-N 0.000 description 1
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 1
- FHKZHRMERJUXRJ-DCAQKATOSA-N His-Ser-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 FHKZHRMERJUXRJ-DCAQKATOSA-N 0.000 description 1
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 1
- FFKJUTZARGRVTH-KKUMJFAQSA-N His-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FFKJUTZARGRVTH-KKUMJFAQSA-N 0.000 description 1
- FBVHRDXSCYELMI-PBCZWWQYSA-N His-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O FBVHRDXSCYELMI-PBCZWWQYSA-N 0.000 description 1
- XHQYFGPIRUHQIB-PBCZWWQYSA-N His-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CN=CN1 XHQYFGPIRUHQIB-PBCZWWQYSA-N 0.000 description 1
- IXQGOKWTQPCIQM-YJRXYDGGSA-N His-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O IXQGOKWTQPCIQM-YJRXYDGGSA-N 0.000 description 1
- DEMIXZCKUXVEBO-BWAGICSOSA-N His-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O DEMIXZCKUXVEBO-BWAGICSOSA-N 0.000 description 1
- UIRUVUUGUYCMBY-KCTSRDHCSA-N His-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CN=CN3)N UIRUVUUGUYCMBY-KCTSRDHCSA-N 0.000 description 1
- FRDFAWHTPDKRHG-ULQDDVLXSA-N His-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 FRDFAWHTPDKRHG-ULQDDVLXSA-N 0.000 description 1
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 1
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 1
- XQFRJNBWHJMXHO-RRKCRQDMSA-N IDUR Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(I)=C1 XQFRJNBWHJMXHO-RRKCRQDMSA-N 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 1
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 1
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 1
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 1
- OVPYIUNCVSOVNF-KQXIARHKSA-N Ile-Gln-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N OVPYIUNCVSOVNF-KQXIARHKSA-N 0.000 description 1
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 1
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 1
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 1
- URWXDJAEEGBADB-TUBUOCAGSA-N Ile-His-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N URWXDJAEEGBADB-TUBUOCAGSA-N 0.000 description 1
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- NUKXXNFEUZGPRO-BJDJZHNGSA-N Ile-Leu-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NUKXXNFEUZGPRO-BJDJZHNGSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 1
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 1
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 1
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- JJQQGCMKLOEGAV-OSUNSFLBSA-N Ile-Thr-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)O)N JJQQGCMKLOEGAV-OSUNSFLBSA-N 0.000 description 1
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 1
- RTSQPLLOYSGMKM-DSYPUSFNSA-N Ile-Trp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N RTSQPLLOYSGMKM-DSYPUSFNSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 102000017727 Immunoglobulin Variable Region Human genes 0.000 description 1
- 108010067060 Immunoglobulin Variable Region Proteins 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 108020005350 Initiator Codon Proteins 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-N L-arginine Chemical compound OC(=O)[C@@H](N)CCCN=C(N)N ODKSFYDXXFIFQN-BYPYZUCNSA-N 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- 125000000510 L-tryptophano group Chemical group [H]C1=C([H])C([H])=C2N([H])C([H])=C(C([H])([H])[C@@]([H])(C(O[H])=O)N([H])[*])C2=C1[H] 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 1
- VKOAHIRLIUESLU-ULQDDVLXSA-N Leu-Arg-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VKOAHIRLIUESLU-ULQDDVLXSA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 1
- BAJIJEGGUYXZGC-CIUDSAMLSA-N Leu-Asn-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N BAJIJEGGUYXZGC-CIUDSAMLSA-N 0.000 description 1
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 1
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- ZDSNOSQHMJBRQN-SRVKXCTJSA-N Leu-Asp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZDSNOSQHMJBRQN-SRVKXCTJSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- AOFYPTOHESIBFZ-KKUMJFAQSA-N Leu-His-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O AOFYPTOHESIBFZ-KKUMJFAQSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- FOBUGKUBUJOWAD-IHPCNDPISA-N Leu-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FOBUGKUBUJOWAD-IHPCNDPISA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- PKKMDPNFGULLNQ-AVGNSLFASA-N Leu-Met-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O PKKMDPNFGULLNQ-AVGNSLFASA-N 0.000 description 1
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 1
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- YESNGRDJQWDYLH-KKUMJFAQSA-N Leu-Phe-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YESNGRDJQWDYLH-KKUMJFAQSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- KTOIECMYZZGVSI-BZSNNMDCSA-N Leu-Phe-His Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KTOIECMYZZGVSI-BZSNNMDCSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 1
- QONKWXNJRRNTBV-AVGNSLFASA-N Leu-Pro-Met Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N QONKWXNJRRNTBV-AVGNSLFASA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 1
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- HOMFINRJHIIZNJ-HOCLYGCPSA-N Leu-Trp-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O HOMFINRJHIIZNJ-HOCLYGCPSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 1
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- DDWFXDSYGUXRAY-UHFFFAOYSA-N Luciferin Natural products CCc1c(C)c(CC2NC(=O)C(=C2C=C)C)[nH]c1Cc3[nH]c4C(=C5/NC(CC(=O)O)C(C)C5CC(=O)O)CC(=O)c4c3C DDWFXDSYGUXRAY-UHFFFAOYSA-N 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- GGAPIOORBXHMNY-ULQDDVLXSA-N Lys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)O GGAPIOORBXHMNY-ULQDDVLXSA-N 0.000 description 1
- DNEJSAIMVANNPA-DCAQKATOSA-N Lys-Asn-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DNEJSAIMVANNPA-DCAQKATOSA-N 0.000 description 1
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 1
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 1
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 1
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- OPTCSTACHGNULU-DCAQKATOSA-N Lys-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCCN OPTCSTACHGNULU-DCAQKATOSA-N 0.000 description 1
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 1
- MRWXLRGAFDOILG-DCAQKATOSA-N Lys-Gln-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRWXLRGAFDOILG-DCAQKATOSA-N 0.000 description 1
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 1
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- CRNNMTHBMRFQNG-GUBZILKMSA-N Lys-Glu-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N CRNNMTHBMRFQNG-GUBZILKMSA-N 0.000 description 1
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 1
- KNKJPYAZQUFLQK-IHRRRGAJSA-N Lys-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N KNKJPYAZQUFLQK-IHRRRGAJSA-N 0.000 description 1
- SQJSXOQXJYAVRV-SRVKXCTJSA-N Lys-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N SQJSXOQXJYAVRV-SRVKXCTJSA-N 0.000 description 1
- GTAXSKOXPIISBW-AVGNSLFASA-N Lys-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N GTAXSKOXPIISBW-AVGNSLFASA-N 0.000 description 1
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 1
- SPCHLZUWJTYZFC-IHRRRGAJSA-N Lys-His-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O SPCHLZUWJTYZFC-IHRRRGAJSA-N 0.000 description 1
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 1
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 1
- YWJQHDDBFAXNIR-MXAVVETBSA-N Lys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N YWJQHDDBFAXNIR-MXAVVETBSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- GFWLIJDQILOEPP-HSCHXYMDSA-N Lys-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N GFWLIJDQILOEPP-HSCHXYMDSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- PINHPJWGVBKQII-SRVKXCTJSA-N Lys-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N PINHPJWGVBKQII-SRVKXCTJSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 1
- AHFOKDZWPPGJAZ-SRVKXCTJSA-N Lys-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N AHFOKDZWPPGJAZ-SRVKXCTJSA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- JQSIGLHQNSZZRL-KKUMJFAQSA-N Lys-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N JQSIGLHQNSZZRL-KKUMJFAQSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- BEGQVWUZFXLNHZ-IHPCNDPISA-N Lys-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 BEGQVWUZFXLNHZ-IHPCNDPISA-N 0.000 description 1
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 1
- INMBONMDMGPADT-AVGNSLFASA-N Lys-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N INMBONMDMGPADT-AVGNSLFASA-N 0.000 description 1
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 1
- ZZHPLPSLBVBWOA-WDSOQIARSA-N Lys-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N ZZHPLPSLBVBWOA-WDSOQIARSA-N 0.000 description 1
- KFSALEZVQJYHCE-AVGNSLFASA-N Lys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N KFSALEZVQJYHCE-AVGNSLFASA-N 0.000 description 1
- PIXVFCBYEGPZPA-JYJNAYRXSA-N Lys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N PIXVFCBYEGPZPA-JYJNAYRXSA-N 0.000 description 1
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 1
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- ZNAPAUSAUBHENO-IHPCNDPISA-N Lys-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)[C@H](CCCCN)N ZNAPAUSAUBHENO-IHPCNDPISA-N 0.000 description 1
- BIWVMACFGZFIEB-VFAJRCTISA-N Lys-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCCN)N)O BIWVMACFGZFIEB-VFAJRCTISA-N 0.000 description 1
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 1
- XATKLFSXFINPSB-JYJNAYRXSA-N Lys-Tyr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O XATKLFSXFINPSB-JYJNAYRXSA-N 0.000 description 1
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 1
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 1
- BLIPQDLSCFGUFA-GUBZILKMSA-N Met-Arg-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O BLIPQDLSCFGUFA-GUBZILKMSA-N 0.000 description 1
- IIPHCNKHEZYSNE-DCAQKATOSA-N Met-Arg-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O IIPHCNKHEZYSNE-DCAQKATOSA-N 0.000 description 1
- ACYHZNZHIZWLQF-BQBZGAKWSA-N Met-Asn-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ACYHZNZHIZWLQF-BQBZGAKWSA-N 0.000 description 1
- IHITVQKJXQQGLJ-LPEHRKFASA-N Met-Asn-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N IHITVQKJXQQGLJ-LPEHRKFASA-N 0.000 description 1
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 1
- DNDVVILEHVMWIS-LPEHRKFASA-N Met-Asp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DNDVVILEHVMWIS-LPEHRKFASA-N 0.000 description 1
- NCVJJAJVWILAGI-SRVKXCTJSA-N Met-Gln-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NCVJJAJVWILAGI-SRVKXCTJSA-N 0.000 description 1
- RZJOHSFAEZBWLK-CIUDSAMLSA-N Met-Gln-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N RZJOHSFAEZBWLK-CIUDSAMLSA-N 0.000 description 1
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 1
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 1
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 1
- UKUMISIRZAVYOG-CIUDSAMLSA-N Met-Glu-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O UKUMISIRZAVYOG-CIUDSAMLSA-N 0.000 description 1
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 1
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 description 1
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 1
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 1
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 1
- MVBZBRKNZVJEKK-DTWKUNHWSA-N Met-Gly-Pro Chemical compound CSCC[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N MVBZBRKNZVJEKK-DTWKUNHWSA-N 0.000 description 1
- NHDMNXBBSGVYGP-PYJNHQTQSA-N Met-His-Ile Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)CC1=CN=CN1 NHDMNXBBSGVYGP-PYJNHQTQSA-N 0.000 description 1
- RRIHXWPHQSXHAQ-XUXIUFHCSA-N Met-Ile-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O RRIHXWPHQSXHAQ-XUXIUFHCSA-N 0.000 description 1
- AFVOKRHYSSFPHC-STECZYCISA-N Met-Ile-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFVOKRHYSSFPHC-STECZYCISA-N 0.000 description 1
- JYPITOUIQVSCKM-IHRRRGAJSA-N Met-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCSC)N JYPITOUIQVSCKM-IHRRRGAJSA-N 0.000 description 1
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 1
- UFOWQBYMUILSRK-IHRRRGAJSA-N Met-Lys-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 UFOWQBYMUILSRK-IHRRRGAJSA-N 0.000 description 1
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 1
- VAGCEUUEMMXFEX-GUBZILKMSA-N Met-Met-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O VAGCEUUEMMXFEX-GUBZILKMSA-N 0.000 description 1
- JKXVPNCSAMWUEJ-GUBZILKMSA-N Met-Met-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O JKXVPNCSAMWUEJ-GUBZILKMSA-N 0.000 description 1
- NTYQUVLERIHPMU-HRCADAONSA-N Met-Phe-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N NTYQUVLERIHPMU-HRCADAONSA-N 0.000 description 1
- JQHYVIKEFYETEW-IHRRRGAJSA-N Met-Phe-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=CC=C1 JQHYVIKEFYETEW-IHRRRGAJSA-N 0.000 description 1
- HUURTRNKPBHHKZ-JYJNAYRXSA-N Met-Phe-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 HUURTRNKPBHHKZ-JYJNAYRXSA-N 0.000 description 1
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 1
- WRXOPYNEKGZWAZ-FXQIFTODSA-N Met-Ser-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O WRXOPYNEKGZWAZ-FXQIFTODSA-N 0.000 description 1
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 1
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 1
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 1
- RMLWDZINJUDMEB-IHRRRGAJSA-N Met-Tyr-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RMLWDZINJUDMEB-IHRRRGAJSA-N 0.000 description 1
- YJNDFEWPGLNLNH-IHRRRGAJSA-N Met-Tyr-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CC=C(O)C=C1 YJNDFEWPGLNLNH-IHRRRGAJSA-N 0.000 description 1
- 101100261636 Methanothermobacter marburgensis (strain ATCC BAA-927 / DSM 2133 / JCM 14651 / NBRC 100331 / OCM 82 / Marburg) trpB2 gene Proteins 0.000 description 1
- PCZOHLXUXFIOCF-UHFFFAOYSA-N Monacolin X Natural products C12C(OC(=O)C(C)CC)CC(C)C=C2C=CC(C)C1CCC1CC(O)CC(=O)O1 PCZOHLXUXFIOCF-UHFFFAOYSA-N 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 206010060860 Neurological symptom Diseases 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 108091093105 Nuclear DNA Proteins 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 108091005461 Nucleic proteins Chemical group 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 238000009004 PCR Kit Methods 0.000 description 1
- 102000057297 Pepsin A Human genes 0.000 description 1
- 108090000284 Pepsin A Proteins 0.000 description 1
- 108010067902 Peptide Library Proteins 0.000 description 1
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 1
- XWBJLKDCHJVKAK-KKUMJFAQSA-N Phe-Arg-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XWBJLKDCHJVKAK-KKUMJFAQSA-N 0.000 description 1
- MPGJIHFJCXTVEX-KKUMJFAQSA-N Phe-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O MPGJIHFJCXTVEX-KKUMJFAQSA-N 0.000 description 1
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 1
- ZWJKVFAYPLPCQB-UNQGMJICSA-N Phe-Arg-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O ZWJKVFAYPLPCQB-UNQGMJICSA-N 0.000 description 1
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 1
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 1
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- UEHNWRNADDPYNK-DLOVCJGASA-N Phe-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N UEHNWRNADDPYNK-DLOVCJGASA-N 0.000 description 1
- LXUJDHOKVUYHRC-KKUMJFAQSA-N Phe-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N LXUJDHOKVUYHRC-KKUMJFAQSA-N 0.000 description 1
- PSBJZLMFFTULDX-IXOXFDKPSA-N Phe-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N)O PSBJZLMFFTULDX-IXOXFDKPSA-N 0.000 description 1
- VLZGUAUYZGQKPM-DRZSPHRISA-N Phe-Gln-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VLZGUAUYZGQKPM-DRZSPHRISA-N 0.000 description 1
- OPEVYHFJXLCCRT-AVGNSLFASA-N Phe-Gln-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O OPEVYHFJXLCCRT-AVGNSLFASA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 1
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 1
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 1
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 1
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 1
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- OVJMCXAPGFDGMG-HKUYNNGSSA-N Phe-Gly-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OVJMCXAPGFDGMG-HKUYNNGSSA-N 0.000 description 1
- ZKSLXIGKRJMALF-MGHWNKPDSA-N Phe-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N ZKSLXIGKRJMALF-MGHWNKPDSA-N 0.000 description 1
- MYQCCQSMKNCNKY-KKUMJFAQSA-N Phe-His-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O)N MYQCCQSMKNCNKY-KKUMJFAQSA-N 0.000 description 1
- FXPZZKBHNOMLGA-HJWJTTGWSA-N Phe-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FXPZZKBHNOMLGA-HJWJTTGWSA-N 0.000 description 1
- GXDPQJUBLBZKDY-IAVJCBSLSA-N Phe-Ile-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GXDPQJUBLBZKDY-IAVJCBSLSA-N 0.000 description 1
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 1
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 1
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 1
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 1
- MJAYDXWQQUOURZ-JYJNAYRXSA-N Phe-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MJAYDXWQQUOURZ-JYJNAYRXSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 1
- OXKJSGGTHFMGDT-UFYCRDLUSA-N Phe-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C1=CC=CC=C1 OXKJSGGTHFMGDT-UFYCRDLUSA-N 0.000 description 1
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 1
- WKLMCMXFMQEKCX-SLFFLAALSA-N Phe-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O WKLMCMXFMQEKCX-SLFFLAALSA-N 0.000 description 1
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 1
- GRVMHFCZUIYNKQ-UFYCRDLUSA-N Phe-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GRVMHFCZUIYNKQ-UFYCRDLUSA-N 0.000 description 1
- CZQZSMJXFGGBHM-KKUMJFAQSA-N Phe-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O CZQZSMJXFGGBHM-KKUMJFAQSA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 1
- BSJCSHIAMSGQGN-BVSLBCMMSA-N Phe-Pro-Trp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O BSJCSHIAMSGQGN-BVSLBCMMSA-N 0.000 description 1
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 1
- CXMSESHALPOLRE-MEYUZBJRSA-N Phe-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O CXMSESHALPOLRE-MEYUZBJRSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- 102000004861 Phosphoric Diester Hydrolases Human genes 0.000 description 1
- 108090001050 Phosphoric Diester Hydrolases Proteins 0.000 description 1
- 101100124346 Photorhabdus laumondii subsp. laumondii (strain DSM 15139 / CIP 105565 / TT01) hisCD gene Proteins 0.000 description 1
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 1
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- DRVIASBABBMZTF-GUBZILKMSA-N Pro-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1 DRVIASBABBMZTF-GUBZILKMSA-N 0.000 description 1
- QBFONMUYNSNKIX-AVGNSLFASA-N Pro-Arg-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QBFONMUYNSNKIX-AVGNSLFASA-N 0.000 description 1
- XZGWNSIRZIUHHP-SRVKXCTJSA-N Pro-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 XZGWNSIRZIUHHP-SRVKXCTJSA-N 0.000 description 1
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 1
- KQCCDMFIALWGTL-GUBZILKMSA-N Pro-Asn-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 KQCCDMFIALWGTL-GUBZILKMSA-N 0.000 description 1
- VOHFZDSRPZLXLH-IHRRRGAJSA-N Pro-Asn-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VOHFZDSRPZLXLH-IHRRRGAJSA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- LCWXSALTPTZKNM-CIUDSAMLSA-N Pro-Cys-Glu Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O LCWXSALTPTZKNM-CIUDSAMLSA-N 0.000 description 1
- XJROSHJRQTXWAE-XGEHTFHBSA-N Pro-Cys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XJROSHJRQTXWAE-XGEHTFHBSA-N 0.000 description 1
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 1
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 1
- BFXZQMWKTYWGCF-PYJNHQTQSA-N Pro-His-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BFXZQMWKTYWGCF-PYJNHQTQSA-N 0.000 description 1
- XQHGISDMVBTGAL-ULQDDVLXSA-N Pro-His-Phe Chemical compound C([C@@H](C(=O)[O-])NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H]1[NH2+]CCC1)C1=CC=CC=C1 XQHGISDMVBTGAL-ULQDDVLXSA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- BWCZJGJKOFUUCN-ZPFDUUQYSA-N Pro-Ile-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O BWCZJGJKOFUUCN-ZPFDUUQYSA-N 0.000 description 1
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 1
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- SRBFGSGDNNQABI-FHWLQOOXSA-N Pro-Leu-Trp Chemical compound N([C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C(=O)[C@@H]1CCCN1 SRBFGSGDNNQABI-FHWLQOOXSA-N 0.000 description 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- SMFQZMGHCODUPQ-ULQDDVLXSA-N Pro-Lys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SMFQZMGHCODUPQ-ULQDDVLXSA-N 0.000 description 1
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- APIAILHCTSBGLU-JYJNAYRXSA-N Pro-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@@H]2CCCN2 APIAILHCTSBGLU-JYJNAYRXSA-N 0.000 description 1
- ANESFYPBAJPYNJ-SDDRHHMPSA-N Pro-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ANESFYPBAJPYNJ-SDDRHHMPSA-N 0.000 description 1
- GNADVDLLGVSXLS-ULQDDVLXSA-N Pro-Phe-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GNADVDLLGVSXLS-ULQDDVLXSA-N 0.000 description 1
- BUEIYHBJHCDAMI-UFYCRDLUSA-N Pro-Phe-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BUEIYHBJHCDAMI-UFYCRDLUSA-N 0.000 description 1
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 1
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 1
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- GBUNEGKQPSAMNK-QTKMDUPCSA-N Pro-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2)O GBUNEGKQPSAMNK-QTKMDUPCSA-N 0.000 description 1
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 1
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 1
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 1
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- 108010029485 Protein Isoforms Proteins 0.000 description 1
- 102000001708 Protein Isoforms Human genes 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- LCTONWCANYUPML-UHFFFAOYSA-M Pyruvate Chemical compound CC(=O)C([O-])=O LCTONWCANYUPML-UHFFFAOYSA-M 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 108091034057 RNA (poly(A)) Proteins 0.000 description 1
- 238000010802 RNA extraction kit Methods 0.000 description 1
- 108700005075 Regulator Genes Proteins 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 108010003581 Ribulose-bisphosphate carboxylase Proteins 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 241000242677 Schistosoma japonicum Species 0.000 description 1
- 108091081021 Sense strand Proteins 0.000 description 1
- 239000012506 Sephacryl® Substances 0.000 description 1
- 229920005654 Sephadex Polymers 0.000 description 1
- 239000012507 Sephadex™ Substances 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 1
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- DBIDZNUXSLXVRG-FXQIFTODSA-N Ser-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N DBIDZNUXSLXVRG-FXQIFTODSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- TUYBIWUZWJUZDD-ACZMJKKPSA-N Ser-Cys-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(N)=O TUYBIWUZWJUZDD-ACZMJKKPSA-N 0.000 description 1
- XSYJDGIDKRNWFX-SRVKXCTJSA-N Ser-Cys-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XSYJDGIDKRNWFX-SRVKXCTJSA-N 0.000 description 1
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 1
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 1
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 1
- PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 1
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 1
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 1
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 1
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- QPPYAWVLAVXISR-DCAQKATOSA-N Ser-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QPPYAWVLAVXISR-DCAQKATOSA-N 0.000 description 1
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 1
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- QNBVFKZSSRYNFX-CUJWVEQBSA-N Ser-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N)O QNBVFKZSSRYNFX-CUJWVEQBSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- ATEQEHCGZKBEMU-GQGQLFGLSA-N Ser-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N ATEQEHCGZKBEMU-GQGQLFGLSA-N 0.000 description 1
- HAUVENOGHPECML-BPUTZDHNSA-N Ser-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 HAUVENOGHPECML-BPUTZDHNSA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- LSHUNRICNSEEAN-BPUTZDHNSA-N Ser-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CO)N LSHUNRICNSEEAN-BPUTZDHNSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 244000000231 Sesamum indicum Species 0.000 description 1
- 241000700584 Simplexvirus Species 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 1
- 229920002125 Sokalan® Polymers 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 241000256251 Spodoptera frugiperda Species 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- ZMZDMBWJUHKJPS-UHFFFAOYSA-M Thiocyanate anion Chemical compound [S-]C#N ZMZDMBWJUHKJPS-UHFFFAOYSA-M 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-N Thiophosphoric acid Chemical class OP(O)(S)=O RYYWUUFWQRZTIU-UHFFFAOYSA-N 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 1
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- VOGXLRKCWFLJBY-HSHDSVGOSA-N Thr-Arg-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VOGXLRKCWFLJBY-HSHDSVGOSA-N 0.000 description 1
- VASYSJHSMSBTDU-LKXGYXEUSA-N Thr-Asn-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O VASYSJHSMSBTDU-LKXGYXEUSA-N 0.000 description 1
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 1
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- NLJKZUGAIIRWJN-LKXGYXEUSA-N Thr-Asp-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O NLJKZUGAIIRWJN-LKXGYXEUSA-N 0.000 description 1
- GCXFWAZRHBRYEM-NUMRIWBASA-N Thr-Gln-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O GCXFWAZRHBRYEM-NUMRIWBASA-N 0.000 description 1
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 1
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- VOHWDZNIESHTFW-XKBZYTNZSA-N Thr-Glu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O VOHWDZNIESHTFW-XKBZYTNZSA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 1
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 1
- DDDLIMCZFKOERC-SVSWQMSJSA-N Thr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N DDDLIMCZFKOERC-SVSWQMSJSA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 1
- QHUWWSQZTFLXPQ-FJXKBIBVSA-N Thr-Met-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QHUWWSQZTFLXPQ-FJXKBIBVSA-N 0.000 description 1
- SIEZEMFJLYRUMK-YTWAJWBKSA-N Thr-Met-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N)O SIEZEMFJLYRUMK-YTWAJWBKSA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 1
- GYUUYCIXELGTJS-MEYUZBJRSA-N Thr-Phe-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O GYUUYCIXELGTJS-MEYUZBJRSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- VGNLMPBYWWNQFS-ZEILLAHLSA-N Thr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O VGNLMPBYWWNQFS-ZEILLAHLSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- SOUPNXUJAJENFU-SWRJLBSHSA-N Thr-Trp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O SOUPNXUJAJENFU-SWRJLBSHSA-N 0.000 description 1
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 1
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 208000007536 Thrombosis Diseases 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 102000009843 Thyroglobulin Human genes 0.000 description 1
- 229920001615 Tragacanth Polymers 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- AVYVKJMBNLPWRX-WFBYXXMGSA-N Trp-Ala-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 AVYVKJMBNLPWRX-WFBYXXMGSA-N 0.000 description 1
- HYNAKPYFEYJMAS-XIRDDKMYSA-N Trp-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HYNAKPYFEYJMAS-XIRDDKMYSA-N 0.000 description 1
- TWJDQTTXXZDJKV-BPUTZDHNSA-N Trp-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O TWJDQTTXXZDJKV-BPUTZDHNSA-N 0.000 description 1
- ADBFWLXCCKIXBQ-XIRDDKMYSA-N Trp-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ADBFWLXCCKIXBQ-XIRDDKMYSA-N 0.000 description 1
- IUFQHOCOKQIOMC-XIRDDKMYSA-N Trp-Asn-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N IUFQHOCOKQIOMC-XIRDDKMYSA-N 0.000 description 1
- HJTYJQVRIQXMHM-XIRDDKMYSA-N Trp-Asp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N HJTYJQVRIQXMHM-XIRDDKMYSA-N 0.000 description 1
- DNUJCLUFRGGSDJ-YLVFBTJISA-N Trp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N DNUJCLUFRGGSDJ-YLVFBTJISA-N 0.000 description 1
- RXEQOXHCHQJMSO-IHPCNDPISA-N Trp-His-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O RXEQOXHCHQJMSO-IHPCNDPISA-N 0.000 description 1
- WNZRNOGHEONFMS-PXDAIIFMSA-N Trp-Ile-Tyr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WNZRNOGHEONFMS-PXDAIIFMSA-N 0.000 description 1
- MEZCXKYMMQJRDE-PMVMPFDFSA-N Trp-Leu-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CC(C)C)C(O)=O)C1=CC=C(O)C=C1 MEZCXKYMMQJRDE-PMVMPFDFSA-N 0.000 description 1
- WKQNLTQSCYXKQK-VFAJRCTISA-N Trp-Lys-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WKQNLTQSCYXKQK-VFAJRCTISA-N 0.000 description 1
- UMIACFRBELJMGT-GQGQLFGLSA-N Trp-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UMIACFRBELJMGT-GQGQLFGLSA-N 0.000 description 1
- GNCPKOZDOCQRAF-BPUTZDHNSA-N Trp-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GNCPKOZDOCQRAF-BPUTZDHNSA-N 0.000 description 1
- GSCPHMSPGQSZJT-JYBASQMISA-N Trp-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GSCPHMSPGQSZJT-JYBASQMISA-N 0.000 description 1
- SWSUXOKZKQRADK-FDARSICLSA-N Trp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SWSUXOKZKQRADK-FDARSICLSA-N 0.000 description 1
- BABINGWMZBWXIX-BPUTZDHNSA-N Trp-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BABINGWMZBWXIX-BPUTZDHNSA-N 0.000 description 1
- IELISNUVHBKYBX-XDTLVQLUSA-N Tyr-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IELISNUVHBKYBX-XDTLVQLUSA-N 0.000 description 1
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 1
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 1
- XMNDQSYABVWZRK-BZSNNMDCSA-N Tyr-Asn-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XMNDQSYABVWZRK-BZSNNMDCSA-N 0.000 description 1
- BARBHMSSVWPKPZ-IHRRRGAJSA-N Tyr-Asp-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BARBHMSSVWPKPZ-IHRRRGAJSA-N 0.000 description 1
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 1
- FGJWNBBFAUHBEP-IHPCNDPISA-N Tyr-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N FGJWNBBFAUHBEP-IHPCNDPISA-N 0.000 description 1
- XBWKCYFGRXKWGO-SRVKXCTJSA-N Tyr-Cys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O XBWKCYFGRXKWGO-SRVKXCTJSA-N 0.000 description 1
- XKDOQXAXKFQWQJ-SRVKXCTJSA-N Tyr-Cys-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O XKDOQXAXKFQWQJ-SRVKXCTJSA-N 0.000 description 1
- FQNUWOHNGJWNLM-QWRGUYRKSA-N Tyr-Cys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FQNUWOHNGJWNLM-QWRGUYRKSA-N 0.000 description 1
- FFCRCJZJARTYCG-KKUMJFAQSA-N Tyr-Cys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O FFCRCJZJARTYCG-KKUMJFAQSA-N 0.000 description 1
- UXUFNBVCPAWACG-SIUGBPQLSA-N Tyr-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N UXUFNBVCPAWACG-SIUGBPQLSA-N 0.000 description 1
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 1
- WZQZUVWEPMGIMM-JYJNAYRXSA-N Tyr-Gln-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WZQZUVWEPMGIMM-JYJNAYRXSA-N 0.000 description 1
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 1
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 1
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 1
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 1
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 1
- OHNXAUCZVWGTLL-KKUMJFAQSA-N Tyr-His-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CS)C(=O)O)N)O OHNXAUCZVWGTLL-KKUMJFAQSA-N 0.000 description 1
- OSXNCKRGMSHWSQ-ACRUOGEOSA-N Tyr-His-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSXNCKRGMSHWSQ-ACRUOGEOSA-N 0.000 description 1
- PJWCWGXAVIVXQC-STECZYCISA-N Tyr-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PJWCWGXAVIVXQC-STECZYCISA-N 0.000 description 1
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 1
- LQGDFDYGDQEMGA-PXDAIIFMSA-N Tyr-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N LQGDFDYGDQEMGA-PXDAIIFMSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- AVIQBBOOTZENLH-KKUMJFAQSA-N Tyr-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AVIQBBOOTZENLH-KKUMJFAQSA-N 0.000 description 1
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 1
- BBSPTGPYIPGTKH-JYJNAYRXSA-N Tyr-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BBSPTGPYIPGTKH-JYJNAYRXSA-N 0.000 description 1
- XDGPTBVOSHKDFT-KKUMJFAQSA-N Tyr-Met-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O XDGPTBVOSHKDFT-KKUMJFAQSA-N 0.000 description 1
- KHUVIWRRFMPVHD-JYJNAYRXSA-N Tyr-Met-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O KHUVIWRRFMPVHD-JYJNAYRXSA-N 0.000 description 1
- VXFXIBCCVLJCJT-JYJNAYRXSA-N Tyr-Pro-Pro Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(O)=O VXFXIBCCVLJCJT-JYJNAYRXSA-N 0.000 description 1
- KWKJGBHDYJOVCR-SRVKXCTJSA-N Tyr-Ser-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O KWKJGBHDYJOVCR-SRVKXCTJSA-N 0.000 description 1
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 1
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 1
- MDXLPNRXCFOBTL-BZSNNMDCSA-N Tyr-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MDXLPNRXCFOBTL-BZSNNMDCSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- VSYROIRKNBCULO-BWAGICSOSA-N Tyr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O VSYROIRKNBCULO-BWAGICSOSA-N 0.000 description 1
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 1
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 1
- NAHUCETZGZZSEX-IHPCNDPISA-N Tyr-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N NAHUCETZGZZSEX-IHPCNDPISA-N 0.000 description 1
- GPLTZEMVOCZVAV-UFYCRDLUSA-N Tyr-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 GPLTZEMVOCZVAV-UFYCRDLUSA-N 0.000 description 1
- OJCISMMNNUNNJA-BZSNNMDCSA-N Tyr-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 OJCISMMNNUNNJA-BZSNNMDCSA-N 0.000 description 1
- JQOMHZMWQHXALX-FHWLQOOXSA-N Tyr-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JQOMHZMWQHXALX-FHWLQOOXSA-N 0.000 description 1
- AFWXOGHZEKARFH-ACRUOGEOSA-N Tyr-Tyr-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=C(O)C=C1 AFWXOGHZEKARFH-ACRUOGEOSA-N 0.000 description 1
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 1
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- JFAWZADYPRMRCO-UBHSHLNASA-N Val-Ala-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JFAWZADYPRMRCO-UBHSHLNASA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- NWDOPHYLSORNEX-QXEWZRGKSA-N Val-Asn-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N NWDOPHYLSORNEX-QXEWZRGKSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 1
- JXGWQYWDUOWQHA-DZKIICNBSA-N Val-Gln-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N JXGWQYWDUOWQHA-DZKIICNBSA-N 0.000 description 1
- AHHJARQXFFGOKF-NRPADANISA-N Val-Glu-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N AHHJARQXFFGOKF-NRPADANISA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 1
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 1
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 1
- OJPRSVJGNCAKQX-SRVKXCTJSA-N Val-Met-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OJPRSVJGNCAKQX-SRVKXCTJSA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- WANVRBAZGSICCP-SRVKXCTJSA-N Val-Pro-Met Chemical compound CSCC[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C)C(O)=O WANVRBAZGSICCP-SRVKXCTJSA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- NGXQOQNXSGOYOI-BQFCYCMXSA-N Val-Trp-Gln Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 NGXQOQNXSGOYOI-BQFCYCMXSA-N 0.000 description 1
- LZRWTJSPTJSWDN-FKBYEOEOSA-N Val-Trp-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N LZRWTJSPTJSWDN-FKBYEOEOSA-N 0.000 description 1
- OEVFFOBAXHBXKM-HSHDSVGOSA-N Val-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N)O OEVFFOBAXHBXKM-HSHDSVGOSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- IXKSXJFAGXLQOQ-XISFHERQSA-N WHWLQLKPGQPMY Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 IXKSXJFAGXLQOQ-XISFHERQSA-N 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 235000010489 acacia gum Nutrition 0.000 description 1
- 239000000205 acacia gum Substances 0.000 description 1
- 235000011054 acetic acid Nutrition 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 108020002494 acetyltransferase Proteins 0.000 description 1
- 102000005421 acetyltransferase Human genes 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 125000002252 acyl group Chemical group 0.000 description 1
- 230000010933 acylation Effects 0.000 description 1
- 238000005917 acylation reaction Methods 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 229940040563 agaric acid Drugs 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 238000011256 aggressive treatment Methods 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 201000007930 alcohol dependence Diseases 0.000 description 1
- 150000001298 alcohols Chemical class 0.000 description 1
- 235000010443 alginic acid Nutrition 0.000 description 1
- 239000000783 alginic acid Substances 0.000 description 1
- 229920000615 alginic acid Polymers 0.000 description 1
- 229960001126 alginic acid Drugs 0.000 description 1
- 150000004781 alginic acids Chemical class 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 229940126575 aminoglycoside Drugs 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 239000004410 anthocyanin Substances 0.000 description 1
- 229930002877 anthocyanin Natural products 0.000 description 1
- 235000010208 anthocyanin Nutrition 0.000 description 1
- 150000004636 anthocyanins Chemical class 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000000340 anti-metabolite Effects 0.000 description 1
- 230000002788 anti-peptide Effects 0.000 description 1
- 229940100197 antimetabolite Drugs 0.000 description 1
- 239000002256 antimetabolite Substances 0.000 description 1
- 239000000074 antisense oligonucleotide Substances 0.000 description 1
- 238000012230 antisense oligonucleotides Methods 0.000 description 1
- 230000001640 apoptogenic effect Effects 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 238000000376 autoradiography Methods 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 1
- 239000003833 bile salt Substances 0.000 description 1
- 229940093761 bile salts Drugs 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 238000004166 bioassay Methods 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 210000000481 breast Anatomy 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 229950004398 broxuridine Drugs 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 230000021523 carboxylation Effects 0.000 description 1
- 238000006473 carboxylation reaction Methods 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 230000030833 cell death Effects 0.000 description 1
- 230000005754 cellular signaling Effects 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 1
- 238000012412 chemical coupling Methods 0.000 description 1
- 239000005081 chemiluminescent agent Substances 0.000 description 1
- VJYIFXVZLXQVHO-UHFFFAOYSA-N chlorsulfuron Chemical compound COC1=NC(C)=NC(NC(=O)NS(=O)(=O)C=2C(=CC=CC=2)Cl)=N1 VJYIFXVZLXQVHO-UHFFFAOYSA-N 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000003200 chromosome mapping Methods 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 229920001436 collagen Polymers 0.000 description 1
- 210000001072 colon Anatomy 0.000 description 1
- 238000004440 column chromatography Methods 0.000 description 1
- 238000002648 combination therapy Methods 0.000 description 1
- 230000009137 competitive binding Effects 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 235000005822 corn Nutrition 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- 238000009402 cross-breeding Methods 0.000 description 1
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 1
- 230000007123 defense Effects 0.000 description 1
- 230000007850 degeneration Effects 0.000 description 1
- 238000002716 delivery method Methods 0.000 description 1
- 239000003398 denaturant Substances 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 235000014113 dietary fatty acids Nutrition 0.000 description 1
- 108010009297 diglycyl-histidine Proteins 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 238000006471 dimerization reaction Methods 0.000 description 1
- 229960001760 dimethyl sulfoxide Drugs 0.000 description 1
- FRKBLBQTSTUKOV-UHFFFAOYSA-N diphosphatidyl glycerol Natural products OP(O)(=O)OCC(OP(O)(O)=O)COP(O)(O)=O FRKBLBQTSTUKOV-UHFFFAOYSA-N 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 208000037765 diseases and disorders Diseases 0.000 description 1
- 238000010494 dissociation reaction Methods 0.000 description 1
- 230000005593 dissociations Effects 0.000 description 1
- 239000012153 distilled water Substances 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 239000002552 dosage form Substances 0.000 description 1
- 230000003828 downregulation Effects 0.000 description 1
- 239000000890 drug combination Substances 0.000 description 1
- 238000007878 drug screening assay Methods 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 230000005584 early death Effects 0.000 description 1
- 208000025688 early-onset autosomal dominant Alzheimer disease Diseases 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 230000001804 emulsifying effect Effects 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 230000012202 endocytosis Effects 0.000 description 1
- 210000004696 endometrium Anatomy 0.000 description 1
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 1
- 210000002889 endothelial cell Anatomy 0.000 description 1
- 230000003511 endothelial effect Effects 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 230000032050 esterification Effects 0.000 description 1
- 238000005886 esterification reaction Methods 0.000 description 1
- 229940011871 estrogen Drugs 0.000 description 1
- 239000000262 estrogen Substances 0.000 description 1
- LVGKNOAMLMIIKO-QXMHVHEDSA-N ethyl oleate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OCC LVGKNOAMLMIIKO-QXMHVHEDSA-N 0.000 description 1
- 229940093471 ethyl oleate Drugs 0.000 description 1
- 238000001704 evaporation Methods 0.000 description 1
- 230000008020 evaporation Effects 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 125000004030 farnesyl group Chemical group [H]C([*])([H])C([H])=C(C([H])([H])[H])C([H])([H])C([H])([H])C([H])=C(C([H])([H])[H])C([H])([H])C([H])([H])C([H])=C(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 229930195729 fatty acid Natural products 0.000 description 1
- 239000000194 fatty acid Substances 0.000 description 1
- 150000004665 fatty acids Chemical group 0.000 description 1
- 230000004720 fertilization Effects 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 230000004761 fibrosis Effects 0.000 description 1
- 230000009969 flowable effect Effects 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 230000037406 food intake Effects 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 239000012458 free base Substances 0.000 description 1
- 238000004108 freeze drying Methods 0.000 description 1
- 239000000446 fuel Substances 0.000 description 1
- 238000002825 functional assay Methods 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 125000002686 geranylgeranyl group Chemical group [H]C([*])([H])/C([H])=C(C([H])([H])[H])/C([H])([H])C([H])([H])/C([H])=C(C([H])([H])[H])/C([H])([H])C([H])([H])/C([H])=C(C([H])([H])[H])/C([H])([H])C([H])([H])C([H])=C(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010040856 glutamyl-cysteinyl-alanine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010008671 glycyl-tryptophyl-methionine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 239000008187 granular material Substances 0.000 description 1
- 238000000227 grinding Methods 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- YQOKLYTXVFAUCW-UHFFFAOYSA-N guanidine;isothiocyanic acid Chemical compound N=C=S.NC(N)=N YQOKLYTXVFAUCW-UHFFFAOYSA-N 0.000 description 1
- ZJYYHGLJYGJLLN-UHFFFAOYSA-N guanidinium thiocyanate Chemical compound SC#N.NC(N)=N ZJYYHGLJYGJLLN-UHFFFAOYSA-N 0.000 description 1
- 230000003394 haemopoietic effect Effects 0.000 description 1
- 210000003958 hematopoietic stem cell Anatomy 0.000 description 1
- 210000003494 hepatocyte Anatomy 0.000 description 1
- 230000002363 herbicidal effect Effects 0.000 description 1
- 239000004009 herbicide Substances 0.000 description 1
- 238000013537 high throughput screening Methods 0.000 description 1
- 101150113423 hisD gene Proteins 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 229940098197 human immunoglobulin g Drugs 0.000 description 1
- 125000001183 hydrocarbyl group Chemical group 0.000 description 1
- 229960000890 hydrocortisone Drugs 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 239000001866 hydroxypropyl methyl cellulose Substances 0.000 description 1
- 235000010979 hydroxypropyl methyl cellulose Nutrition 0.000 description 1
- 229920003088 hydroxypropyl methyl cellulose Polymers 0.000 description 1
- UFVKGYZPFZQRLF-UHFFFAOYSA-N hydroxypropyl methyl cellulose Chemical compound OC1C(O)C(OC)OC(CO)C1OC1C(O)C(O)C(OC2C(C(O)C(OC3C(C(O)C(O)C(CO)O3)O)C(CO)O2)O)C(CO)O1 UFVKGYZPFZQRLF-UHFFFAOYSA-N 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 208000026278 immune system disease Diseases 0.000 description 1
- 230000003053 immunization Effects 0.000 description 1
- 230000000984 immunochemical effect Effects 0.000 description 1
- 238000007901 in situ hybridization Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 230000001524 infective effect Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 229910052500 inorganic mineral Inorganic materials 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- 238000001361 intraarterial administration Methods 0.000 description 1
- 102000027411 intracellular receptors Human genes 0.000 description 1
- 108091008582 intracellular receptors Proteins 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 238000007913 intrathecal administration Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 238000007914 intraventricular administration Methods 0.000 description 1
- 238000007852 inverse PCR Methods 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- QRXWMOHMRWLFEY-UHFFFAOYSA-N isoniazide Chemical compound NNC(=O)C1=CC=NC=C1 QRXWMOHMRWLFEY-UHFFFAOYSA-N 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 238000011005 laboratory method Methods 0.000 description 1
- 101150066555 lacZ gene Proteins 0.000 description 1
- 239000004922 lacquer Substances 0.000 description 1
- 235000014655 lactic acid Nutrition 0.000 description 1
- 239000004816 latex Substances 0.000 description 1
- 229920000126 latex Polymers 0.000 description 1
- 231100000518 lethal Toxicity 0.000 description 1
- 230000001665 lethal effect Effects 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 230000029226 lipidation Effects 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 150000004668 long chain fatty acids Chemical class 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- PCZOHLXUXFIOCF-BXMDZJJMSA-N lovastatin Chemical compound C([C@H]1[C@@H](C)C=CC2=C[C@H](C)C[C@@H]([C@H]12)OC(=O)[C@@H](C)CC)C[C@@H]1C[C@@H](O)CC(=O)O1 PCZOHLXUXFIOCF-BXMDZJJMSA-N 0.000 description 1
- 229960004844 lovastatin Drugs 0.000 description 1
- QLJODMDSTUBWDW-UHFFFAOYSA-N lovastatin hydroxy acid Natural products C1=CC(C)C(CCC(O)CC(O)CC(O)=O)C2C(OC(=O)C(C)CC)CC(C)C=C21 QLJODMDSTUBWDW-UHFFFAOYSA-N 0.000 description 1
- 239000000314 lubricant Substances 0.000 description 1
- 210000004698 lymphocyte Anatomy 0.000 description 1
- 239000008176 lyophilized powder Substances 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010076718 lysyl-glutamyl-tryptophan Proteins 0.000 description 1
- 239000011777 magnesium Substances 0.000 description 1
- 235000019359 magnesium stearate Nutrition 0.000 description 1
- 239000006249 magnetic particle Substances 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000031852 maintenance of location in cell Effects 0.000 description 1
- 235000011090 malic acid Nutrition 0.000 description 1
- 230000000873 masking effect Effects 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 210000004379 membrane Anatomy 0.000 description 1
- 230000008172 membrane trafficking Effects 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 229920000609 methyl cellulose Polymers 0.000 description 1
- 239000001923 methylcellulose Substances 0.000 description 1
- 235000010981 methylcellulose Nutrition 0.000 description 1
- YACKEPLHDIMKIO-UHFFFAOYSA-N methylphosphonic acid Chemical class CP(O)(O)=O YACKEPLHDIMKIO-UHFFFAOYSA-N 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- 235000013336 milk Nutrition 0.000 description 1
- 239000008267 milk Substances 0.000 description 1
- 210000004080 milk Anatomy 0.000 description 1
- 239000011707 mineral Substances 0.000 description 1
- 230000034839 mitotic sister chromatid segregation Effects 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- 238000010172 mouse model Methods 0.000 description 1
- 238000002887 multiple sequence alignment Methods 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 210000005170 neoplastic cell Anatomy 0.000 description 1
- 230000004770 neurodegeneration Effects 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 239000012454 non-polar solvent Substances 0.000 description 1
- 231100000956 nontoxicity Toxicity 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 239000003921 oil Substances 0.000 description 1
- 235000019198 oils Nutrition 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 239000003791 organic solvent mixture Substances 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 230000005298 paramagnetic effect Effects 0.000 description 1
- 238000007911 parenteral administration Methods 0.000 description 1
- 235000016236 parenteral nutrition Nutrition 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 239000013610 patient sample Substances 0.000 description 1
- 230000006320 pegylation Effects 0.000 description 1
- 229940111202 pepsin Drugs 0.000 description 1
- HTYIXCKSEQQCJO-UHFFFAOYSA-N phenaglycodol Chemical compound CC(C)(O)C(C)(O)C1=CC=C(Cl)C=C1 HTYIXCKSEQQCJO-UHFFFAOYSA-N 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- BQVCCPGCDUSGOE-UHFFFAOYSA-N phenylarsine oxide Chemical compound O=[As]C1=CC=CC=C1 BQVCCPGCDUSGOE-UHFFFAOYSA-N 0.000 description 1
- WTJKGGKOPKCXLL-RRHRGVEJSA-N phosphatidylcholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCC=CCCCCCCCC WTJKGGKOPKCXLL-RRHRGVEJSA-N 0.000 description 1
- 150000003905 phosphatidylinositols Chemical class 0.000 description 1
- 102000036213 phospholipid binding proteins Human genes 0.000 description 1
- 108091011000 phospholipid binding proteins Proteins 0.000 description 1
- 150000008300 phosphoramidites Chemical class 0.000 description 1
- 239000000049 pigment Substances 0.000 description 1
- 239000006187 pill Substances 0.000 description 1
- 108010025488 pinealon Proteins 0.000 description 1
- 229920001983 poloxamer Polymers 0.000 description 1
- 229920000447 polyanionic polymer Polymers 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 229920005862 polyol Polymers 0.000 description 1
- 150000003077 polyols Chemical class 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 238000002953 preparative HPLC Methods 0.000 description 1
- 239000000186 progesterone Substances 0.000 description 1
- 229960003387 progesterone Drugs 0.000 description 1
- QKWLAUAUUGXSSE-UHFFFAOYSA-L prohexadione-calcium Chemical compound [Ca+2].CCC(=O)C1=C([O-])CC(C([O-])=O)CC1=O QKWLAUAUUGXSSE-UHFFFAOYSA-L 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 230000001915 proofreading effect Effects 0.000 description 1
- XJMOSONTPMZWPB-UHFFFAOYSA-M propidium iodide Chemical compound [I-].[I-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CCC[N+](C)(CC)CC)=C1C1=CC=CC=C1 XJMOSONTPMZWPB-UHFFFAOYSA-M 0.000 description 1
- 210000002307 prostate Anatomy 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 235000004252 protein component Nutrition 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- 230000006337 proteolytic cleavage Effects 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 238000000163 radioactive labelling Methods 0.000 description 1
- 102000016914 ras Proteins Human genes 0.000 description 1
- 238000009790 rate-determining step (RDS) Methods 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000013812 regulation of steroid biosynthetic process Effects 0.000 description 1
- 125000006853 reporter group Chemical group 0.000 description 1
- 230000001850 reproductive effect Effects 0.000 description 1
- 239000011369 resultant mixture Substances 0.000 description 1
- 210000001995 reticulocyte Anatomy 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 238000013391 scatchard analysis Methods 0.000 description 1
- 238000010187 selection method Methods 0.000 description 1
- 239000006152 selective media Substances 0.000 description 1
- 230000000405 serological effect Effects 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 239000008159 sesame oil Substances 0.000 description 1
- 235000011803 sesame oil Nutrition 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000002002 slurry Substances 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 235000010413 sodium alginate Nutrition 0.000 description 1
- 239000000661 sodium alginate Substances 0.000 description 1
- 229940005550 sodium alginate Drugs 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 239000007901 soft capsule Substances 0.000 description 1
- 239000012439 solid excipient Substances 0.000 description 1
- 230000007928 solubilization Effects 0.000 description 1
- 238000005063 solubilization Methods 0.000 description 1
- 108010005652 splenotritin Proteins 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 230000010473 stable expression Effects 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 210000000130 stem cell Anatomy 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 235000011044 succinic acid Nutrition 0.000 description 1
- 150000003444 succinic acids Chemical class 0.000 description 1
- 125000000446 sulfanediyl group Chemical group *S* 0.000 description 1
- FRGKKTITADJNOE-UHFFFAOYSA-N sulfanyloxyethane Chemical compound CCOS FRGKKTITADJNOE-UHFFFAOYSA-N 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 239000006188 syrup Substances 0.000 description 1
- 235000020357 syrup Nutrition 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 239000003826 tablet Substances 0.000 description 1
- 235000012222 talc Nutrition 0.000 description 1
- 235000002906 tartaric acid Nutrition 0.000 description 1
- 229960003604 testosterone Drugs 0.000 description 1
- 231100001274 therapeutic index Toxicity 0.000 description 1
- 238000005382 thermal cycling Methods 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- 229940094937 thioredoxin Drugs 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 229960002175 thyroglobulin Drugs 0.000 description 1
- 238000012090 tissue culture technique Methods 0.000 description 1
- 239000010936 titanium Substances 0.000 description 1
- 239000004408 titanium dioxide Substances 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 235000010487 tragacanth Nutrition 0.000 description 1
- 239000000196 tragacanth Substances 0.000 description 1
- 229940116362 tragacanth Drugs 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- 230000017105 transposition Effects 0.000 description 1
- 238000011269 treatment regimen Methods 0.000 description 1
- 101150081616 trpB gene Proteins 0.000 description 1
- 101150111232 trpB-1 gene Proteins 0.000 description 1
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241000701366 unidentified nuclear polyhedrosis viruses Species 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- 210000003934 vacuole Anatomy 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 230000028973 vesicle-mediated transport Effects 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 235000012431 wafers Nutrition 0.000 description 1
- 108010000998 wheylin-2 peptide Proteins 0.000 description 1
- QAOHCFGKCWTBGC-QHOAOGIMSA-N wybutosine Chemical compound C1=NC=2C(=O)N3C(CC[C@H](NC(=O)OC)C(=O)OC)=C(C)N=C3N(C)C=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O QAOHCFGKCWTBGC-QHOAOGIMSA-N 0.000 description 1
- QAOHCFGKCWTBGC-UHFFFAOYSA-N wybutosine Natural products C1=NC=2C(=O)N3C(CCC(NC(=O)OC)C(=O)OC)=C(C)N=C3N(C)C=2N1C1OC(CO)C(O)C1O QAOHCFGKCWTBGC-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P1/00—Drugs for disorders of the alimentary tract or the digestive system
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P25/00—Drugs for disorders of the nervous system
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P3/00—Drugs for disorders of the metabolism
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P3/00—Drugs for disorders of the metabolism
- A61P3/06—Antihyperlipidemics
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P43/00—Drugs for specific purposes, not provided for in groups A61P1/00-A61P41/00
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P9/00—Drugs for disorders of the cardiovascular system
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2217/00—Genetically modified animals
- A01K2217/05—Animals comprising random inserted nucleic acids (transgenic)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2799/00—Uses of viruses
- C12N2799/02—Uses of viruses as vector
- C12N2799/021—Uses of viruses as vector for the expression of a heterologous nucleic acid
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Pharmacology & Pharmacy (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Animal Behavior & Ethology (AREA)
- Engineering & Computer Science (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Obesity (AREA)
- Hematology (AREA)
- Diabetes (AREA)
- Cardiology (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Neurology (AREA)
- Neurosurgery (AREA)
- Toxicology (AREA)
- Zoology (AREA)
- Gastroenterology & Hepatology (AREA)
- Biochemistry (AREA)
- Heart & Thoracic Surgery (AREA)
- Genetics & Genomics (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Peptides Or Proteins (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Investigating Or Analysing Biological Materials (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
Abstract
The invention provides human lipid-associated proteins (LIPAP) and polynucleotides which identify and encode LIPAP. The invention also provides expression vectors, host cells, antibodies, agonists, and antagonists. The invention also provides methods for diagnosing, treating, or preventing disorders associated with expression of LIPAP.
Description
HUMAN LIPID-ASSOCIATED PROTEINS
TECHNICAL FIELD
This invention relates to nucleic acid and amino acid sequences of human lipid-associated proteins and to the use of these sequences in the diagnosis, treatment, and prevention of cardiovascular, neurological, and gastrointestinal disorders, and disorders of lipid metabolism.
BACKGROUND OF THE INVENTION
Lipids are water-insoluble, oily, or greasy substances that are soluble in nonpolar solvents such as chloroform or ether. Neutral lipids (triacylglycerols) serve as major fuels and energy stores. Polar lipids, such as phospholipids, sphingolipids, glycolipids, and cholesterol, are key structural components of cell membranes. Lipids and proteins are associated in a variety of ways.
Glycolipids form vesicles that carry proteins within cells and cell membranes. Interactions between lipids and proteins function in targeting proteins and glycolipids involved in a variety of processes, such as cell signaling and cell proliferation, to specific membrane and intracellular locations. Proteins are associated with the biosynthesis, transport, and uptake of lipids. In addition, key proteins involved in signal transduction and protein targeting have lipid-derived groups added to them post-translationally (Stryer, L. (1995) Biochemistry, W.H. Freeman and Co., New York NY, pp. 264-267, 934).
A major class of phospholipids are the phosphoglycerides, which are composed of a glycerol backbone, two fatty acid chains, and a phosphorylated alcohol. Principal phosphoglycerides are phosphatidylcholine, phosphatidylethanolamine, phosphatidylserine, phosphatidylinositol, and diphosphatidylglycerol. Many enzymes involved in phosphoglyceride synthesis are associated with membranes (Meyers, R.A. (1995) Molecular Biolo~~and Biotechnolo~y, VCH
Publishers Inc., New York NY, pp. 494-501; Stryer, supra, pp. 264-267). The enzyme phosphatidylserine decarboxylase catalyzes the conversion of phosphatidylserine to phosphatidylethanolamine, using a pyruvate cofactor.
The two forms of yeast phosphatidylserine decarboxylase are localized to the inner mitochondria) membrane and to the Golgi/vacuole membrane, respectively. The mammalian enzyme, also localized to the inner mitochondria) membrane, is made as a proenzyme and subsequently cleaved to alpha and beta subunits (Voelker, D.R. (1997) Biochim. Biophys. Acta 1348:236-244).
Cholesterol, composed of four fused hydrocarbon rings with an alcohol at one end, moderates the fluidity of membranes in which it is incorporated. In addition, cholesterol is used in the synthesis of such hormones as cortisol, progesterone, estrogen, and testosterone. Bile salts derived from cholesterol facilitate the digestion of lipids. Cholesterol in the skin forms a barrier that prevents excess water evaporation from the body. Farnesyl and geranylgeranyl groups, which are derived from cholesterol biosynthesis intermediates, are post-translationally added to signal transduction proteins such as ras and protein-targeting proteins such as rab. These modifications are important for the activities of these proteins (Guyton, A.C. Textbook of Medical Physiology-(1991) W.B. Saunders Company, Philadelphia PA, pp.760-763; Stryer, supra, pp. 279-280, 691-702, 934).
Mammals obtain cholesterol derived from both de novo biosynthesis and the diet. The liver is the major site of cholesterol biosynthesis in mammals. Biosynthesis is accomplished via a series of enzymatic steps known as the mevalonate pathway. The rate-limiting step is the conversion of hydroxymethylglutaryl-Coenzyme A (HMG-CoA) to mevalonate by HMG-CoA reductase.
The drug lovastatin, a potent inhibitor of HMG-CoA reductase, is given to patients to reduce their serum cholesterol levels. Cholesterol derived from de novo biosynthesis or from the diet is transported in the body fluids in the form of lipoprotein particles. These particles also transport triacylglycerols. The particles consist of a core of hydrophobic lipids surrounded by a shell of polar lipids and apolipoproteins. The protein components serve in the solubilization of hydrophobic lipids and also contain cell-targeting signals. Lipoproteins include chylomicrons, chylomicron remnants, very-low-density lipoproteins (VLDL), intermediate-density lipoproteins (IDL), low-density lipoproteins (LDL), and high-density lipoproteins (HDL) (Meyers, supra; Stryer, supra, pp. 691-702). There is a strong inverse correlation between the levels of plasma HDL and risk of premature coronary heart disease.
ApoL is an HDL apolipoprotein expressed in the pancreas (Duchateau, P.N. et al. (1997) J. Biol. Chem.
272:25576-25582).
Most cells outside the liver and intestine take up cholesterol from the blood rather than synthesize it themselves. Cell surface LDL receptors bind LDL particles which are then internalized by endocytosis (Meyers, supra). Absence of the LDL receptor, the cause of the disease familial hypercholesterolemia, leads to increased plasma cholesterol levels and ultimately to atherosclerosis (Stryer, supra, pp. 691-702).
Proteins involved in cholesterol uptake and biosynthesis are tightly regulated in response to cellular cholesterol levels. The sterol regulatory element binding protein (SREBP) is a sterol-responsive transcription factor. Under normal cholesterol conditions, SREBP
resides in the endoplasmic reticulum membrane. When cholesterol levels are low, a regulated cleavage of SREBP
occurs which releases the extracellular domain of the protein. This cleaved domain is then transported to the nucleus where it activates the transcription of the LDL receptor gene, and genes encoding enzymes of cholesterol synthesis, by binding the sterol regulatory element (SRE) upstream of the genes (Yang, J. et al. (1995) J. Biol. Chem. 270:12152-12161). Regulation of cholesterol uptake and biosynthesis also occurs via the oxysterol-binding protein (OSBP). Oxysterols are oxidation products formed during the catabolism of cholesterol, and are involved in regulation of steroid biosynthesis.
OSBP is a high-affinity intracellular receptor for a variety of oxysterols that down-regulate cholesterol synthesis and stimulate cholesterol esterification (Lagace, T.A. et al. (1997) Biochem. J. 326:205-213).
TECHNICAL FIELD
This invention relates to nucleic acid and amino acid sequences of human lipid-associated proteins and to the use of these sequences in the diagnosis, treatment, and prevention of cardiovascular, neurological, and gastrointestinal disorders, and disorders of lipid metabolism.
BACKGROUND OF THE INVENTION
Lipids are water-insoluble, oily, or greasy substances that are soluble in nonpolar solvents such as chloroform or ether. Neutral lipids (triacylglycerols) serve as major fuels and energy stores. Polar lipids, such as phospholipids, sphingolipids, glycolipids, and cholesterol, are key structural components of cell membranes. Lipids and proteins are associated in a variety of ways.
Glycolipids form vesicles that carry proteins within cells and cell membranes. Interactions between lipids and proteins function in targeting proteins and glycolipids involved in a variety of processes, such as cell signaling and cell proliferation, to specific membrane and intracellular locations. Proteins are associated with the biosynthesis, transport, and uptake of lipids. In addition, key proteins involved in signal transduction and protein targeting have lipid-derived groups added to them post-translationally (Stryer, L. (1995) Biochemistry, W.H. Freeman and Co., New York NY, pp. 264-267, 934).
A major class of phospholipids are the phosphoglycerides, which are composed of a glycerol backbone, two fatty acid chains, and a phosphorylated alcohol. Principal phosphoglycerides are phosphatidylcholine, phosphatidylethanolamine, phosphatidylserine, phosphatidylinositol, and diphosphatidylglycerol. Many enzymes involved in phosphoglyceride synthesis are associated with membranes (Meyers, R.A. (1995) Molecular Biolo~~and Biotechnolo~y, VCH
Publishers Inc., New York NY, pp. 494-501; Stryer, supra, pp. 264-267). The enzyme phosphatidylserine decarboxylase catalyzes the conversion of phosphatidylserine to phosphatidylethanolamine, using a pyruvate cofactor.
The two forms of yeast phosphatidylserine decarboxylase are localized to the inner mitochondria) membrane and to the Golgi/vacuole membrane, respectively. The mammalian enzyme, also localized to the inner mitochondria) membrane, is made as a proenzyme and subsequently cleaved to alpha and beta subunits (Voelker, D.R. (1997) Biochim. Biophys. Acta 1348:236-244).
Cholesterol, composed of four fused hydrocarbon rings with an alcohol at one end, moderates the fluidity of membranes in which it is incorporated. In addition, cholesterol is used in the synthesis of such hormones as cortisol, progesterone, estrogen, and testosterone. Bile salts derived from cholesterol facilitate the digestion of lipids. Cholesterol in the skin forms a barrier that prevents excess water evaporation from the body. Farnesyl and geranylgeranyl groups, which are derived from cholesterol biosynthesis intermediates, are post-translationally added to signal transduction proteins such as ras and protein-targeting proteins such as rab. These modifications are important for the activities of these proteins (Guyton, A.C. Textbook of Medical Physiology-(1991) W.B. Saunders Company, Philadelphia PA, pp.760-763; Stryer, supra, pp. 279-280, 691-702, 934).
Mammals obtain cholesterol derived from both de novo biosynthesis and the diet. The liver is the major site of cholesterol biosynthesis in mammals. Biosynthesis is accomplished via a series of enzymatic steps known as the mevalonate pathway. The rate-limiting step is the conversion of hydroxymethylglutaryl-Coenzyme A (HMG-CoA) to mevalonate by HMG-CoA reductase.
The drug lovastatin, a potent inhibitor of HMG-CoA reductase, is given to patients to reduce their serum cholesterol levels. Cholesterol derived from de novo biosynthesis or from the diet is transported in the body fluids in the form of lipoprotein particles. These particles also transport triacylglycerols. The particles consist of a core of hydrophobic lipids surrounded by a shell of polar lipids and apolipoproteins. The protein components serve in the solubilization of hydrophobic lipids and also contain cell-targeting signals. Lipoproteins include chylomicrons, chylomicron remnants, very-low-density lipoproteins (VLDL), intermediate-density lipoproteins (IDL), low-density lipoproteins (LDL), and high-density lipoproteins (HDL) (Meyers, supra; Stryer, supra, pp. 691-702). There is a strong inverse correlation between the levels of plasma HDL and risk of premature coronary heart disease.
ApoL is an HDL apolipoprotein expressed in the pancreas (Duchateau, P.N. et al. (1997) J. Biol. Chem.
272:25576-25582).
Most cells outside the liver and intestine take up cholesterol from the blood rather than synthesize it themselves. Cell surface LDL receptors bind LDL particles which are then internalized by endocytosis (Meyers, supra). Absence of the LDL receptor, the cause of the disease familial hypercholesterolemia, leads to increased plasma cholesterol levels and ultimately to atherosclerosis (Stryer, supra, pp. 691-702).
Proteins involved in cholesterol uptake and biosynthesis are tightly regulated in response to cellular cholesterol levels. The sterol regulatory element binding protein (SREBP) is a sterol-responsive transcription factor. Under normal cholesterol conditions, SREBP
resides in the endoplasmic reticulum membrane. When cholesterol levels are low, a regulated cleavage of SREBP
occurs which releases the extracellular domain of the protein. This cleaved domain is then transported to the nucleus where it activates the transcription of the LDL receptor gene, and genes encoding enzymes of cholesterol synthesis, by binding the sterol regulatory element (SRE) upstream of the genes (Yang, J. et al. (1995) J. Biol. Chem. 270:12152-12161). Regulation of cholesterol uptake and biosynthesis also occurs via the oxysterol-binding protein (OSBP). Oxysterols are oxidation products formed during the catabolism of cholesterol, and are involved in regulation of steroid biosynthesis.
OSBP is a high-affinity intracellular receptor for a variety of oxysterols that down-regulate cholesterol synthesis and stimulate cholesterol esterification (Lagace, T.A. et al. (1997) Biochem. J. 326:205-213).
The copines are phospholipid-binding proteins believed to function in membrane trafficking.
Copines promote lipid vesicle aggregation. They contain a C2 domain associated with membrane activity and an annexin-type domain that mediates interactions between integral and extracellular proteins and is associated with calcium binding and regulation (Creutz, C.E.
(1998) J. Biol. Chem.
273:1393-1402). Other C2-containing proteins include the synaptotagmins, a family of proteins involved in vesicular trafficking. Synaptotagmin concentrations in cerebrospinal fluid have been found to be reduced in early-onset Alzheimer's disease (Gottfries, C.G. et al.
(1998) J. Neural Transm.
105:773-786).
Lipids and their associated proteins have roles in human diseases and disorders. Increased synthesis of long-chain fatty acids occurs in neoplasms including those of the breast, prostate, ovary, colon and endometrium. There is a strong inverse correlation between the levels of plasma HDL and risk of premature coronary heart disease. Absence of the LDL receptor, the cause of familial hypercholesterolemia, leads to increased plasma cholesterol levels and ultimately to atherosclerosis (Stryer, supra, pp. 691-702). The arterial disease atherosclerosis is characterized by the formation of fatty lesions on the inside of the arterial wall. These lesions promote the loss of arterial flexibility and the formation of blood clots (Guyton, supra). Oxysterols are present in human atherosclerotic plaques and believed to play an active role in plaque development (Brown, A.J. (1999) Atherosclerosis 142:1-28). Steatosis, or fatty liver, is characterized by the accumulation of triglycerides in the liver and may occur in association with a variety of conditions including alcoholism, diabetes, obesity, and prolonged parenteral nutrition. Steatosis may lead to fibrosis and cirrhosis of the liver. In Tay-Sachs disease, the GMZ ganglioside (a sphingolipid) accumulates in lysosomes of the central nervous system due to a lack of the enzyme N-acetylhexosaminidase. Patients suffer nervous system degeneration leading to early death (Fauci, A.S. et al. (1998) Harrison's Principles of Internal Medicine McGraw-Hill, New York NY
p. 2171). The Niemann-Pick diseases are caused by defects in lipid metabolism.
Niemann-Pick diseases types A and B are caused by accumulation of sphingomyelin (a sphingolipid) and other lipids in the central nervous system due to a defect in the enzyme sphingomyelinase, leading to neurodegeneration and lung disease. Niemann-Pick disease type C results from a defect in cholesterol transport, leading to the accumulation of sphingomyelin and cholesterol in lysosomes and a secondary reduction in sphingomyelinase activity. Neurological symptoms such as grand mal seizures, ataxia, and loss of previously learned speech, manifest 1-2 years after birth. A mutation in the NPC protein, which contains a putative cholesterol-sensing domain, was found in a mouse model of Niemann-Pick disease type C (Fauci, supra,, p. 2175; Loftus, S.K. et al. (1997) Science 277:232-235).
The discovery of new human lipid-associated proteins and the polynucleotides encoding them satisfies a need in the art by providing new compositions which are useful in the diagnosis, prevention, and treatment of cardiovascular, neurological, and gastrointestinal disorders, and disorders of lipid metabolism.
SUMMARY OF THE INVENTION
The invention features purified polypeptides, human lipid-associated proteins, referred to collectively as "LIPAP" and individually as "LIPAP-I," "LIPAP-2," "LIPAP-3,"
"LIPAP-4," "LIPAP-5," "LIPAP-6," "LIPAP-7," "LIPAP-8," "LIPAP-9," "LIPAP-10," "LIPAP-11," and "LIPAP-12." In one aspect, the invention provides an isolated polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, c) a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, or d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12. In one alternative, the invention provides an isolated polypeptide comprising the amino acid sequence of SEQ ID NO:1-12.
The invention further provides an isolated polynucleotide encoding a polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:I-12, c) a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, or d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12. In one alternative, the polynucleotide is selected from the group consisting of SEQ ID N0:13-24.
Additionally, the invention provides a recombinant polynucleotide comprising a promoter sequence operably linked to a polynucleotide encoding a polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID NO: I-12, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:I-12, c) a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, or d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:I-12. In one alternative, the invention provides a cell transformed with the recombinant polynucleotide. In another alternative, the invention provides a transgenic organism comprising the recombinant polynucleotide.
The invention also provides a method for producing a polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, b) a naturally occurnng amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, c) a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, or d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:I-12. The method comprises a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said cell is transformed with a recombinant polynucleotide comprising a promoter sequence operably linked to a polynucleotide encoding the polypeptide, and b) recovering the polypeptide so expressed.
Additionally, the invention provides an isolated antibody which specifically binds to a polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, c) a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, or d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12.
The invention further provides an isolated polynucleotide comprising a) a polynucleotide sequence selected from the group consisting of SEQ ID N0:13-24, b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID N0:13-24, c) a polynucleotide sequence complementary to a), or d) a polynucleotide sequence complementary to b). In one alternative, the polynucleotide comprises at least 60 contiguous nucleotides.
Additionally, the invention provides a method for detecting a target polynucleotide in a sample, said target polynucleotide having a sequence of a polynucleotide comprising a) a polynucleotide sequence selected from the group consisting of SEQ ID N0:13-24, b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID N0:13-24, c) a polynucleotide sequence complementary to a), or d) a polynucleotide sequence complementary to b). The method comprises a) hybridizing the sample with a probe comprising at least 16 contiguous nucleotides comprising a sequence complementary to said target polynucleotide in the sample, and which probe specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization complex is formed between said probe and said target polynucleotide, and b) detecting the presence or absence of said hybridization complex, and optionally, if present, the amount thereof. In one alternative, the probe comprises at least 30 contiguous nucleotides. In another alternative, the probe comprises at least 60 contiguous nucleotides.
The invention further provides a pharmaceutical composition comprising an effective amount of a polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID
NO:l-12, b) a naturally occurring amino acid sequence having at least 90%
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, c) a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ
ID NO:1-12, or d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, and a pharmaceutically acceptable excipient. The invention additionally provides a method of treating a disease or condition associated with decreased expression of functional LIPAP, comprising administering to a patient in need of such treatment the pharmaceutical composition.
The invention also provides a method for screening a compound for effectiveness as an agonist of a polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID
NO:1-12, b) a naturally occurring amino acid sequence having at least 90%
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, c) a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ
ID NO:1-12, or d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12. The method comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting agonist activity in the sample. In one alternative, the invention provides a pharmaceutical composition comprising an agonist compound identified by the method and a pharmaceutically acceptable excipient. In another alternative, the invention provides a method of treating a disease or condition associated with decreased expression of functional LIPAP, comprising administering to a patient in need of such treatment the pharmaceutical composition.
Additionally, the invention provides a method for screening a compound for effectiveness as an antagonist of a polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID NO: l-12, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, c) a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, or d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID
NO:1-12. The method comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting antagonist activity in the sample. In one alternative, the invention provides a pharmaceutical composition comprising an antagonist compound identified by the method and a pharmaceutically acceptable excipient. In another alternative, the invention provides a method of treating a disease or condition associated with overexpression of functional LIPAP, comprising administering to a patient in need of such treatment the pharmaceutical composition.
The invention further provides a method for screening a compound for effectiveness in altering expression of a target polynucleotide, wherein said target polynucleotide comprises a sequence selected from the group consisting of SEQ ID N0:13-24, the method comprising a) exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered expression of the target polynucleotide.
BRIEF DESCRIPTION OF THE TABLES
Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ
ID NOs), clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble full-WO 00/49043 PCT/iJS00/04160 length sequences encoding LIPAP.
Table 2 shows features of each polypeptide sequence, including potential motifs, homologous sequences, and methods, algorithms, and searchable databases used for analysis of LIPAP.
Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific expression patterns of each nucleic acid sequence as determined by northern analysis;
diseases, disorders, or conditions associated with these tissues; and the vector into which each cDNA
was cloned.
Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones encoding LIPAP were isolated.
Table 5 shows the tools, programs, and algorithms used to analyze LIPAP, along with applicable descriptions, references, and threshold parameters.
DESCRIPTION OF THE INVENTION
Before the present proteins, nucleotide sequences, and methods are described, it is understood that this invention is not limited to the particular machines, materials and methods described, as these may vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to limit the scope of the present invention which will be limited only by the appended claims.
It must be noted that as used herein and in the appended claims, the singular forms "a," "an,"
and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so forth.
Unless defined otherwise, all technical and scientific terms used herein have the same meanings as commonly understood by one of ordinary skill in the art to which this invention belongs.
Although any machines, materials, and methods similar or equivalent to those described herein can be used to practice or test the present invention, the preferred machines, materials and methods are now described. All publications mentioned herein are cited for the purpose of describing and disclosing the cell lines, protocols, reagents and vectors which are reported in the publications and which might be used in connection with the invention. Nothing herein is to be construed as an admission that the invention is not entitled to antedate such disclosure by virtue of prior invention.
DEFINITIONS
"LIPAP" refers to the amino acid sequences of substantially purified LIPAP
obtained from any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant.
The term "agonist" refers to a molecule which intensifies or mimics the biological activity of WO 00/49043 CA 02362300 2001-07-31 pCT~S00/04160 LIPAP. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other compound or composition which modulates the activity of LIPAP either by directly interacting with LIPAP or by acting on components of the biological pathway in which LIPAP
participates.
An "allelic variant" is an alternative form of the gene encoding LIPAP.
Allelic variants may result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in polypeptides whose structure or function may or may not be altered. A gene may have none, one, or many allelic variants of its naturally occurring form. Common mutational changes which give rise to allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides.
Each of these types of changes may occur alone, or in combination with the others, one or more times in a given sequence.
"Altered" nucleic acid sequences encoding LIPAP include those sequences with deletions, insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as LIPAP or a polypeptide with at least one functional characteristic of LIPAP. Included within this definition are polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of the polynucleotide encoding LIPAP, and improper or unexpected hybridization to allelic variants, with a locus other than the normal chromosomal locus for the polynucleotide sequence encoding LIPAP.
The encoded protein may also be "altered," and may contain deletions, insertions, or substitutions of amino acid residues which produce a silent change and result in a functionally equivalent LIPAP.
Deliberate amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as long as the biological or immunological activity of LIPAP is retained. For example, negatively charged amino acids may include aspartic acid and glutamic acid, and positively charged amino acids may include lysine and arginine. Amino acids with uncharged polar side chains having similar hydrophilicity values may include: asparagine and glutamine; and serine and threonine. Amino acids with uncharged side chains having similar hydrophilicity values may include: leucine, isoleucine, and valine; glycine and alanine; and phenylalanine and tyrosine.
The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic molecules. Where "amino acid sequence" is recited to refer to an amino acid sequence of a naturally occurring protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid sequence to the complete native amino acid sequence associated with the recited protein molecule.
"Amplification" relates to the production of additional copies of a nucleic acid sequence.
Amplification is generally carried out using polymerise chain reaction (PCR) technologies well known in the art.
The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity of LIPAP. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small molecules, or any other compound or composition which modulates the activity of LIPAP either by directly interacting with LIPAP or by acting on components of the biological pathway in which LIPAP
participates.
The term "antibody" refers to intact immunoglobulin molecules as well as to fragments thereof, such as Fab, F(ab')2, and Fv fragments, which are capable of binding an epitopic determinant.
Antibodies that bind LIPAP polypeptides can be prepared using intact polypeptides or using fragments containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired.
Commonly used carriers that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animal.
The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that makes contact with a particular antibody. When a protein or a fragment of a protein is used to immunize a host animal, numerous regions of the protein may induce the production of antibodies which bind specifically to antigenic determinants (particular regions or three-dimensional structures on the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen used to elicit the immune response) for binding to an antibody.
The term "antisense" refers to any composition capable of base-pairing with the "sense" strand of a specific nucleic acid sequence. Antisense compositions may include DNA;
RNA; peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified sugar groups such as 2'-methoxyethyl sugars or 2'-methoxyethoxy sugars; or oligonucleotides having modified bases such as 5-methyl cytosine, 2'-deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense molecules may be produced by any method including chemical synthesis or transcription. Once introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring nucleic acid sequence produced by the cell to form duplexes which block either transcription or translation. The designation "negative" or "minus" can refer to the antisense strand, and the designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule.
The term "biologically active" refers to a protein having structural, regulatory, or biochemical functions of a naturally occurring molecule. Likewise, "immunologically active" refers to the capability of the natural, recombinant, or synthetic LIPAP, or of any oligopeptide thereof, to induce a specific immune response in appropriate animals or cells and to bind with specific antibodies.
The terms "complementary" and "complementarity" refer to the natural binding of polynucleotides by base pairing. For example, the sequence "5' A-G-T 3"' bonds to the complementary sequence "3' T-C-A 5'." Complementarity between two single-stranded molecules may be "partial,"
such that only some of the nucleic acids bind, or it may be "complete," such that total complementarity exists between the single stranded molecules. The degree of complementarity between nucleic acid strands has significant effects on the efficiency and strength of the hybridization between the nucleic acid strands. This is of particular importance in amplification reactions, which depend upon binding between nucleic acid strands, and in the design and use of peptide nucleic acid (PNA) molecules.
A "composition comprising a given polynucleotide sequence" and a "composition comprising a given amino acid sequence" refer broadly to any composition containing the given polynucleotide or amino acid sequence. The composition may comprise a dry formulation or an aqueous solution.
Compositions comprising polynucleotide sequences encoding LIPAP or fragments of LIPAP may be employed as hybridization probes. The probes may be stored in freeze-dried form and may be associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be deployed in an aqueous solution containing salts (e.g., NaCI), detergents (e.g., sodium dodecyl sulfate;
SDS), and other components (e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.).
"Consensus sequence" refers to a nucleic acid sequence which has been resequenced to resolve uncalled bases, extended using the XL-PCR kit (Perkin-Elmer, Norwalk CT) in the 5' and/or the 3' direction, and resequenced, or which has been assembled from the overlapping sequences of one or more Incyte Clones and, in some cases, one or more public domain ESTs, using a computer program for fragment assembly, such as the GELVIEW fragment assembly system (GCG, Madison WI). Some sequences have been both extended and assembled to produce the consensus sequence.
"Conservative amino acid substitutions" are those substitutions that, when made, least interfere with the properties of the original protein, i.e., the structure and especially the function of the protein is conserved and not significantly changed by such substitutions. The table below shows amino acids which may be substituted for an original amino acid in a protein and which are regarded as conservative amino acid substitutions.
Original Residue Conservative Substitution Ala Gly, Ser Arg His, Lys Asn Asp, Gln, His Asp Asn, Glu Cys Ala, Ser Gln Asn, Glu, His Glu Asp, Gln, His Gly Ala His Asn, Arg, Gln, Glu Ile Leu, Val Leu Ile, Val Lys Arg, Gln, Glu Met Leu, Ile Phe His, Met, Leu, Trp, Tyr Ser Cys, Thr Thr Ser, Val Trp Phe, Tyr Tyr His, Phe, Trp Val Ile, Leu, Thr Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, (b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of the side chain.
A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the absence of one or more amino acid residues or nucleotides.
The term "derivative" refers to the chemical modification of a polypeptide sequence, or a polynucleotide sequence. Chemical modifications of a polynucleotide sequence can include, for example, replacement of hydrogen by an alkyl, acyl, hydroxyl, or amino group.
A derivative polynucleotide encodes a polypeptide which retains at least one biological or immunological function of the natural molecule. A derivative polypeptide is one modified by glycosylation, pegylation, or any similar process that retains at least one biological or immunological function of the polypeptide from which it was derived.
A "fragment" is a unique portion of LIPAP or the polynucleotide encoding LIPAP
which is identical in sequence to but shorter in length than the parent sequence. A
fragment may comprise up to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 15, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid residues in length. Fragments may be preferentially selected from certain regions of a molecule. For example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected from the first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown in a certain defined sequence.
Clearly these lengths are exemplary, and any length that is supported by the specification, including the Sequence Listing, tables, and figures, may be encompassed by the present embodiments.
A fragment of SEQ ID N0:13-24 comprises a region of unique polynucleotide sequence that specifically identifies SEQ ID N0:13-24, for example, as distinct from any other sequence in the same genome. A fragment of SEQ ID N0:13-24 is useful, for example, in hybridization and amplification technologies and in analogous methods that distinguish SEQ ID N0:13-24 from related polynucleotide sequences. The precise length of a fragment of SEQ ID N0:13-24 and the region of SEQ ID N0:13-24 to which the fragment corresponds are routinely determinable by one of ordinary skill in the art based on the intended purpose for the fragment.
A fragment of SEQ ID NO:1-12 is encoded by a fragment of SEQ ID N0:13-24. A
fragment of SEQ ID NO:1-12 comprises a region of unique amino acid sequence that specifically identifies SEQ
ID NO:1-12. For example, a fragment of SEQ ID NO:1-12 is useful as an immunogenic peptide for the development of antibodies that specifically recognize SEQ ID NO:1-12. The precise length of a fragment of SEQ ID NO:1-12 and the region of SEQ ID NO:1-12 to which the fragment corresponds are routinely determinable by one of ordinary skill in the art based on the intended purpose for the fragment.
The term "similarity" refers to a degree of complementarity. There may be partial similarity or complete similarity. The word "identity" may substitute for the word "similarity." A partially complementary sequence that at least partially inhibits an identical sequence from hybridizing to a target nucleic acid is referred to as "substantially similar." The inhibition of hybridization of the completely complementary sequence to the target sequence may be examined using a hybridization assay (Southern or northern blot, solution hybridization, and the like) under conditions of reduced stringency. A substantially similar sequence or hybridization probe will compete for and inhibit the binding of a completely similar (identical) sequence to the target sequence under conditions of reduced stringency. This is not to say that conditions of reduced stringency are such that non-specific binding is permitted, as reduced stringency conditions require that the binding of two sequences to one another be a specific (i.e., a selective) interaction. The absence of non-specific binding may be tested by the use of a second target sequence which lacks even a partial degree of complementarity (e.g., less than about 30% similarity or identity). In the absence of non-specific binding, the substantially similar sequence or probe will not hybridize to the second non-complementary target sequence.
The phrases "percent identity" and "% identity," as applied to polynucleotide sequences, refer to the percentage of residue matches between at least two polynucleotide sequences aligned using a standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps in the sequences being compared in order to optimize alignment between two sequences, and therefore achieve a more meaningful comparison of the two sequences.
Percent identity between polynucleotide sequences may be determined using the default parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN
version 3.12e sequence alignment program. This program is part of the LASERGENE software package, a suite of molecular biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in Higgins, D.G.
and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 8:189-191. For pairwise alignments of polynucleotide sequences, the default parameters are set as follows: Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue weight table is selected as the default. Percent identity is reported by CLUSTAL V as the "percent similarity" between aligned polynucleotide sequence pairs.
Alternatively, a suite of commonly used and freely available sequence comparison algorithms is provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment Search Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available from several sources, including the NCBI, Bethesda, MD, and on the Internet at http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis programs including "blastn," that is used to align a known polynucleotide sequence with other polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bl2.html. The "BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST
programs are commonly used with gap and other parameters set to default settings. For example, to compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 2Ø9 (May-07-1999) set at default parameters. Such default parameters may be, for example:
Matrix: BLOSUM62 Reward for match: 1 Penalty for mismatch: -2 Open Gap: 5 and Extension Gap: 2 penalties Gap x drop-off: 50 Expect: l0 Word Size: Il Filter: on Percent identity may be measured over the length of an entire defined sequence, for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous nucleotides. Such lengths are exemplary only, and it is understood that any fragment length supported by the sequences shown herein, in the tables, figures, or Sequence Listing, may be used to describe a length over which percentage identity may be measured.
Nucleic acid sequences that do not show a high degree of identity may nevertheless encode similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences that all encode substantially the same protein.
The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to the percentage of residue matches between at least two polypeptide sequences aligned using a standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment methods take into account conservative amino acid substitutions. Such conservative substitutions, explained in more detail above, generally preserve the hydrophobicity and acidity at the site of substitution, thus preserving the structure (and therefore function) of the polypeptide.
Percent identity between polypeptide sequences may be determined using the default parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN
version 3.12e sequence alignment program (described and referenced above). For pairwise alignments of polypeptide sequences using CLUSTAL V, the default parameters are set as follows:
Ktuple=I, gap penalty=3, window=5, and "diagonals saved"=5. The PAM250 matrix is selected as the default residue weight table. As with polynucleotide alignments, the percent identity is reported by CLUSTAL V as the "percent similarity" between aligned polypeptide sequence pairs.
Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences"
tool Version 2Ø9 (May-07-1999) with blastp set at default parameters. Such default parameters may be, for example:
Matrix: BLOSUM62 Open Gap: 11 and Extension Gap: 1 penalties Gap x drop-off: 50 Expect: 10 Word Size: 3 Filter: on Percent identity may be measured over the length of an entire defined polypeptide sequence, for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, over the length of a fragment taken from a larger, defined polypeptide sequence, for instance, a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least I50 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be used to describe a length over which percentage identity may be measured.
"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for stable mitotic chromosome segregation and maintenance.
The term "humanized antibody" refers to antibody molecules in which the amino acid sequence in the non-antigen binding regions has been altered so that the antibody more closely resembles a human antibody, and still retains its original binding ability.
"Hybridization" refers to the process by which a polynucleotide strand anneals with a complementary strand through base pairing under defined hybridization conditions. Specific WO 00/49043 PCT/iJS00/04160 hybridization is an indication that two nucleic acid sequences share a high degree of identity. Specific hybridization complexes form under permissive annealing conditions and remain hybridized after the "washing" step(s). The washing steps) is particularly important in determining the stringency of the hybridization process, with more stringent conditions allowing less non-specific binding, i.e., binding between pairs of nucleic acid strands that are not perfectly matched.
Permissive conditions for annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in the art and may be consistent among hybridization experiments, whereas wash conditions may be varied among experiments to achieve the desired stringency, and therefore hybridization specificity. Permissive annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about I % (w/v) SDS, and about 100 pg/ml denatured salmon sperm DNA.
Generally, stringency of hybridization is expressed, in part, with reference to the temperature under which the wash step is carried out. Generally, such wash temperatures are selected to be about 5°C to 20°C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. An equation for calculating Tm and conditions for nucleic acid hybridization are well known and can be found in Sambrook et al., 1989, Molecular CloningYA Laboratory Manual, 2°d ed., vol. 1-3, Cold Spring Harbor Press, Plainview NY; specifically see volume 2, chapter 9.
High stringency conditions for hybridization between polynucleotides of the present invention include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1 % SDS, for I hour.
Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC concentration may be varied from about 0. I to 2 x SSC, with SDS being present at about 0.1 %.
Typically, blocking reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, denatured salmon sperm DNA at about 100-200 pg/ml. Organic solvent, such as formamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, such as for RNA:DNA hybridizations. Useful variations on these wash conditions will be readily apparent to those of ordinary skill in the art. Hybridization, particularly under high stringency conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is strongly indicative of a similar role for the nucleotides and their encoded polypeptides.
The term "hybridization complex" refers to a complex formed between two nucleic acid sequences by virtue of the formation of hydrogen bonds between complementary bases. A
hybridization complex may be formed in solution (e.g., Cot or Rot analysis) or formed between one nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a solid support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which cells or their nucleic acids have been fixed).
The words "insertion" and "addition" refer to changes in an amino acid or nucleotide sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively.
"Immune response" can refer to conditions associated with inflammation, trauma, immune disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect cellular and systemic defense systems.
The term "microarray" refers to an arrangement of distinct polynucleotides on a substrate.
The terms "element" and "array element" in a microarray context, refer to hybridizable polynucleotides arranged on the surface of a substrate.
The term "modulate" refers to a change in the activity of LIPAP. For example, modulation may cause an increase or a decrease in protein activity, binding characteristics, or any other biological, functional, or immunological properties of LIPAP.
The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or synthetic origin which may be single-stranded' or double-stranded and may represent the sense or the antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material.
"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a functional relationship with the second nucleic acid sequence. For instance, a promoter is operably linked to a coding sequence if the promoter affects the transcription or expression of the coding sequence. Generally, operably linked DNA sequences may be in close proximity or contiguous and, where necessary to join two protein coding regions, in the same reading frame.
"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, and may be pegylated to extend their lifespan in the cell.
"Probe" refers to nucleic acid sequences encoding LIPAP, their complements, or fragments thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes.
"Primers" are short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by complementary base-pairing. The primer may then be extended along the target DNA strand by a DNA
polymerase enzyme. Primer pairs can be used for amplification (and identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR).
Probes and primers as used in the present invention typically comprise at least IS contiguous nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers may be considerably longer than these examples, and it is understood that any length supported by the specification, including the tables, figures, and Sequence Listing, may be used.
Methods for preparing and using probes and primers are described in the references, for example Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, 2"d ed., vol. 1-3, Cold Spring Harbor Press, Plainview NY; Ausubel et a1.,1987, Current Protocols in Molecular Biolo~y, Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis et al., 1990, PCR Protocols, A Guide to Methods and Applications, Academic Press, San Diego CA. PCR primer pairs can be derived from a known sequence, for example, by using computer programs intended for that purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge MA).
Oligonucleotides for use as primers are selected using software known in the art for such purpose. For example, OLIGO 4.06 software is useful for the selection of PCR
primer pairs of up to 100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 nucleotides from an input polynucleotide sequence of up to 32 kilobases.
Similar primer selection programs have incorporated additional features for expanded capabilities. For example, the PrimOU
primer selection program (available to the public from the Genome Center at University of Texas South West Medical Center, Dallas TX) is capable of choosing specific primers from megabase sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 primer selection program (available to the public from the Whitehead Institute/MIT Center for Genome Research, Cambridge MA) allows the user to input a "mispriming library," in which sequences to avoid as primer binding sites are user-specified. Primer3 is useful, in particular, for the selection of oligonucleotides for microarrays. (The source code for the latter two primer selection programs may also be obtained from their respective sources and modified to meet the user's specific needs.) The PrimeGen program (available to the public from the UK Human Genome Mapping Project Resource Centre, Cambridge UK) designs primers based on multiple sequence alignments, thereby allowing selection of primers that hybridize to either the most conserved or least conserved regions of aligned nucleic acid sequences.
Hence, this program is useful for identification of both unique and conserved oligonucleotides and polynucleotide fragments. The oligonucleotides and polynucleotide fragments identified by any of the above selection methods are useful in hybridization technologies, for example, as PCR or sequencing primers, microarray elements, or specific probes to identify fully or partially complementary polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited to those described above.
A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence that is made by an artificial combination of two or more otherwise separated segments of sequence.
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques such as those described in Sambrook, supra. The term recombinant includes nucleic acids that have been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter sequence.
Such a recombinant nucleic acid may be part of a vector that is used, for example, to transform a cell.
Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is expressed, inducing a protective immunological response in the mammal.
An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose instead of deoxyribose.
The term "sample" is used in its broadest sense. A sample suspected of containing nucleic acids encoding LIPAP, or fragments thereof, or LIPAP itself, may comprise a bodily fluid; an extract from a cell, chromosome, organelle, or membrane isolated from a cell; a cell;
genomic DNA, RNA, or cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc.
The terms "specific binding" and "specifically binding" refer to that interaction between a protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or synthetic binding composition. The interaction is dependent upon the presence of a particular structure of the protein, e.g., the antigenic determinant or epitope, recognized by the binding molecule. For example, if an antibody is specific for epitope "A," the presence of a polypeptide containing the epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A and the antibody will reduce the amount of labeled A that binds to the antibody.
The term "substantially purified" refers to nucleic acid or amino acid sequences that are removed from their natural environment and are isolated or separated, and are at least 60% free, preferably at least 75% free, and most preferably at least 90% free from other components with which they are naturally associated.
A "substitution" refers to the replacement of one or more amino acids or nucleotides by different amino acids or nucleotides, respectively.
"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound.
"Transformation" describes a process by which exogenous DNA enters and changes a recipient cell. Transformation may occur under natural or artificial conditions according to various methods well known in the art, and may rely on any known method for the insertion of foreign nucleic acid sequences into a prokaryotic or eukaryotic host cell. The method for transformation is selected based on the type of host cell being transformed and may include, but is not limited to, viral infection, electroporation, heat shock, lipofection, and particle bombardment. The term "transformed"
cells includes stably transformed cells in which the inserted DNA is capable of replication either as an autonomously replicating plasmid or as part of the host chromosome, as well as transiently transformed cells which express the inserted DNA or RNA for limited periods of time.
A "transgenic organism," as used herein, is any organism, including but not limited to animals and plants, in which one or more of the cells of the organism contains heterologous nucleic acid introduced by way of human intervention, such as by transgenic techniques well known in the art. The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in vitro fertilization, but rather is directed to the introduction of a recombinant DNA
molecule. The transgenic organisms contemplated in accordance with the present invention include bacteria, cyanobacteria, fungi, and plants and animals. The isolated DNA of the present invention can be introduced into the host by methods known in the art, for example infection, transfection, transformation or transconjugation. Techniques for transferring the DNA of the present invention into such organisms are widely known and provided in references such as Sambrook et al. (1989), supra.
A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2Ø9 (May-07-1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or greater sequence identity over a certain defined length. A variant may be described as, for example, an "allelic" (as defined above), "splice," "species," or "polymorphic" variant. A
splice variant may have significant identity to a reference molecule, but will generally have a greater or lesser number of polynucleotides due to alternate splicing of exons during mRNA processing. The corresponding polypeptide may possess additional functional domains or lack domains that are present in the reference molecule. Species variants are polynucleotide sequences that vary from one species to another. The resulting polypeptides generally will have significant amino acid identity relative to each other. A
polymorphic variant is a variation in the polynucleotide sequence of a particular gene between individuals of a given species. Polymorphic variants also may encompass "single nucleotide polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide base. The presence of SNPs may be indicative of, for example, a certain population, a disease state, or a propensity for a disease state.
A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2Ø9 (May-07-1999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98%
or greater sequence identity over a certain defined length of one of the polypeptides.
THE INVENTION
The invention is based on the discovery of new human lipid-associated proteins (LIPAP), the polynucleotides encoding LIPAP, and the use of these compositions for the diagnosis, treatment, or prevention of cardiovascular, neurological, and gastrointestinal disorders, and disorders of lipid metabolism.
Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding LIPAP. Columns I and 2 show the sequence identification numbers (SEQ ID NOs) of the polypeptide and nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte clones in which nucleic acids encoding each LIPAP were identified, and column 4 shows the cDNA
libraries from which these clones were isolated. Column 5 shows Incyte clones and their corresponding cDNA
libraries. Clones for which cDNA libraries are not indicated were derived from pooled cDNA libraries.
The Incyte clones in column 5 were used to assemble the consensus nucleotide sequence of each LIPAP
and are useful as fragments in hybridization technologies.
The columns of Table 2 show various properties of each of the polypeptides of the invention:
column I references the SEQ ID NO; column 2 shows the number of amino acid residues in each polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylation sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 shows homologous sequences as identified by BLAST analysis and the identity of each polypeptide;
and column 7 shows analytical methods and in some cases, searchable databases to which the analytical methods were applied. The methods of column 7 were used to characterize each polypeptide through sequence homology and protein motifs.
The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions associated with nucleotide sequences encoding LIPAP. The first column of Table 3 lists the nucleotide SEQ ID NOs. Column 2 lists fragments of the nucleotide sequences of column 1.
These fragments are useful, for example, in hybridization or amplification technologies to identify SEQ ID N0:13-24 and to distinguish between SEQ ID N0:13-24 and related polynucleotide sequences. The polypeptides encoded by these fragments are useful, for example, as immunogenic peptides.
Column 3 lists tissue categories which express LIPAP as a fraction of total tissues expressing LIPAP. Column 4 lists diseases, disorders, or conditions associated with those tissues expressing LIPAP as a fraction of total tissues expressing LIPAP. Column 5 lists the vectors used to subclone each cDNA library.
The columns of Table 4 show descriptions of the tissues used to construct the cDNA libraries from which cDNA clones encoding LIPAP were isolated. Column I references the nucleotide SEQ ID
NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 shows the tissue origins and other descriptive information relevant to the cDNA
libraries in column 2.
SEQ ID N0:21 maps to chromosome 11 within the interval from 92.5 to 96.3 centiMorgans.
This interval also contains a gene encoding a G-protein coupled receptor associated with epilepsy.
The invention also encompasses LIPAP variants. A preferred LIPAP variant is one which has at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid sequence identity to the LIPAP amino acid sequence, and which contains at least one functional or structural characteristic of LIPAP.
The invention also encompasses polynucleotides which encode LIPAP. In a particular embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID N0:13-24, which encodes LIPAP. The polynucleotide sequences of SEQ ID N0:13-24, as presented in the Sequence Listing, embrace the equivalent RNA sequences, wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose instead of deoxyribose.
The invention also encompasses a variant of a polynucleotide sequence encoding LIPAP. In particular, such a variant polynucleotide sequence will have at least about 80%, or alternatively at least about 90%, or even at least about 95% polynucleotide sequence identity to the polynucleotide sequence encoding LIPAP. A particular aspect of the invention encompasses a variant of a polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID
N0:13-24 which has at least about 80%, or alternatively at least about 90%, or even at least about 95% polynucleotide sequence identity to a nucleic acid sequence selected from the group consisting of SEQ ID N0:13-24.
Any one of the polynucleotide variants described above can encode an amino acid sequence which contains at least one functional or structural characteristic of LIPAP.
It will be appreciated by those skilled in the art that as a result of the degeneracy of the genetic code, a multitude of polynucleotide sequences encoding LIPAP, some bearing minimal similarity to the polynucleotide sequences of any known and naturally occurring gene, may be produced. Thus, the invention contemplates each and every possible variation of polynucleotide sequence that could be made by selecting combinations based on possible codon choices. These combinations are made in accordance with the standard triplet genetic code as applied to the polynucleotide sequence of naturally occurring LIPAP, and all such variations are to be considered as being specifically disclosed.
Although nucleotide sequences which encode LIPAP and its variants are generally capable of hybridizing to the nucleotide sequence of the naturally occurring LIPAP under appropriately selected conditions of stringency, it may be advantageous to produce nucleotide sequences encoding LIPAP or its derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally occurring codons. Codons may be selected to increase the rate at which expression of the peptide occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with which particular codons are utilized by the host. Other reasons for substantially altering the nucleotide sequence encoding LIPAP and its derivatives without altering the encoded amino acid sequences include the production of RNA transcripts having more desirable properties, such as a greater half-life, than transcripts produced from the naturally occurring sequence.
The invention also encompasses production of DNA sequences which encode LIPAP
and LIPAP derivatives, or fragments thereof, entirely by synthetic chemistry.
After production, the synthetic sequence may be inserted into any of the many available expression vectors and cell systems using reagents well known in the art. Moreover, synthetic chemistry may be used to introduce mutations into a sequence encoding LIPAP or any fragment thereof.
Also encompassed by the invention are polynucleotide sequences that are capable of hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID
N0:13-24 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 152:507-511.) Hybridization conditions, including annealing and wash conditions, are described in "Definitions."
Methods for DNA sequencing are well known in the art and may be used to practice any of the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment of DNA polymerise I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerise (Perkin-Elmer), thermostable T7 polymerise (Amersham Pharmacia Biotech, Piscataway NJ), or combinations of polymerises and proofreading exonucleases such as those found in the ELONGASE
amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence preparation is automated with machines such as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV), PTC200 thermal cycler (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler (Perkin-Elmer).
Sequencing is then carried out using either the ABI 373 or 377 DNA sequencing system (Perkin-Elmer), the MEGABACE 1000 DNA sequencing system (Molecular Dynamics, Sunnyvale CA), or other systems known in the art. The resulting sequences are analyzed using a variety of algorithms which are well known in the art. (See, e.g., Ausubel, F.M. (1997) Short Protocols in Molecular BioloQV, John Wiley & Sons, New York NY, unit 7.7; Meyers, R.A. (1995) Molecular Biolo~y and CA 02362300 2001-07-31 PCT~jS00/04160 Biotechnolo~y, Wiley VCH, New York NY, pp. 856-853.) The nucleic acid sequences encoding LIPAP may be extended utilizing a partial nucleotide sequence and employing various PCR-based methods known in the art to detect upstream sequences, such as promoters and regulatory elements. For example, one method which may be employed, restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic.
2:318-322.) Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown sequence from a circularized template. The template is derived from restriction fragments comprising a known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids Res.
16:8186.) A third method, capture PCR, involves PCR amplification of DNA
fragments adjacent to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al.
(1991) PCR Methods Applic. 1:11 I-119.) In this method, multiple restriction enzyme digestions and ligations may be used to insert an engineered double-stranded sequence into a region of unknown sequence before performing PCR. Other methods which may be used to retrieve unknown sequences are known in the art. (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060).
Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries (Clontech, Palo Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in finding intron/exon junctions. For all PCR-based methods, primers may be designed using commercially available software, such as OLIGO 4.06 Primer Analysis software (National Biosciences, Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal to the template at temperatures of about 68°C to 72°C.
When screening for full-length cDNAs, it is preferable to use libraries that have been size-selected to include larger cDNAs. In addition, random-primed libraries, which often include sequences containing the 5' regions of genes, are preferable for_situations in which an oligo d(T) library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence into 5' non-transcribed regulatory regions.
Capillary electrophoresis systems which are commercially available may be used to analyze the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide-specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Perkin-Elmer), and the entire process from loading of samples to computer analysis and electronic data display may be computer controlled.
Capillary electrophoresis is especially preferable for sequencing small DNA
fragments which may be present in limited amounts in a particular sample.
CA 02362300 2001-07-31 pCT~S00/04160 In another embodiment of the invention, polynucleotide sequences or fragments thereof which encode LIPAP may be cloned in recombinant DNA molecules that direct expression of LIPAP, or fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy of the genetic code, other DNA sequences which encode substantially the same or a functionally equivalent amino acid sequence may be produced and used to express LIPAP.
The nucleotide sequences of the present invention can be engineered using methods generally known in the art in order to alter LIPAP-encoding sequences for a variety of purposes including, but not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide-mediated site-directed mutagenesis may be used to introduce mutations that create new restriction sites, alter glycosylation patterns, change codon preference, produce splice variants, and so forth.
The nucleotides of the present invention may be subjected to DNA shuffling techniques such as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat.
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or improve the biological properties of LIPAP, such as its biological or enzymatic activity or its ability to bind to other molecules or compounds. DNA shuffling is a process by which a library of gene variants is produced using PCR-mediated recombination of gene fragments. The library is then subjected to selection or screening procedures that identify those gene variants with the desired properties. These preferred variants may then be pooled and further subjected to recursive rounds of DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial"
breeding and rapid molecular evolution. For example, fragments of a single gene containing random point mutations may be recombined, screened, and then reshuffled until the desired properties are optimized. Alternatively, fragments of a given gene may be recombined with fragments of homologous genes in the same gene family, either from the same or different species, thereby maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable manner.
In another embodiment, sequences encoding LIPAP may be synthesized, in whole or in part, using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. ( 1980) Nucleic Acids Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser.
7:225-232.) Alternatively, LIPAP itself or a fragment thereof may be synthesized using chemical methods.
For example, peptide synthesis can be performed using various solid-phase techniques. (See, e.g., Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved using the ABI 431A
peptide synthesizer (Perkin-Elmer). Additionally, the amino acid sequence of LIPAP, or any part thereof, may be altered during direct synthesis and/or combined with sequences from other proteins, or any part thereof, to CA 02362300 2001-07-31 PCT~jS00/04160 produce a variant polypeptide.
The peptide may be substantially purified by preparative high performance liquid chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier ( 1990) Methods Enzymol. 182:392-421.) The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing.
(See, e.g., Creighton, T. (1984) Proteins, Structures and Molecular Properties, WH Freeman, New York NY.) In order to express a biologically active LIPAP, the nucleotide sequences encoding LIPAP or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains the necessary elements for transcriptional and translational control of the inserted coding sequence in a suitable host. These elements include regulatory sequences, such as enhancers, constitutive and inducible promoters, and 5' and 3' untranslated regions in the vector and in polynucleotide sequences encoding LIPAP. Such elements may vary in their strength and specificity.
Specific initiation signals may also be used to achieve more efficient translation of sequences encoding LIPAP. Such signals include the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where sequences encoding LIPAP and its initiation codon and upstream regulatory sequences are inserted into the appropriate expression vector, no additional transcriptional or translational control signals may be needed. However, in cases where only coding sequence, or a fragment thereof, is inserted, exogenous translational control signals including an in-frame ATG initiation codon should be provided by the vector. Exogenous translational elements and initiation codons may be of various origins, both natural and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers appropriate for the particular host cell system used. (See, e.g., Scharf, D. et al. ( 1994) Results Probl. Cell Differ.
20:125-162.) Methods which are well known to those skilled in the art may be used to construct expression vectors containing sequences encoding LIPAP and appropriate transcriptional and translational control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et al. (1995) Current Protocols in Molecular Biolo~y, John Wiley & Sons, New York NY, ch. 9, 13, and 16.) A variety of expression vector/host systems may be utilized to contain and express sequences encoding LIPAP. These include, but are not limited to, microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors;
yeast transformed with yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus);
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or animal cell systems. The invention is not limited by the host cell employed.
In bacterial systems, a number of cloning and expression vectors may be selected depending upon the use intended for polynucleotide sequences encoding LIPAP. For example, routine cloning, subcloning, and propagation of polynucleotide sequences encoding LIPAP can be achieved using a multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 plasmid (Life Technologies). Ligation of sequences encoding LIPAP into the vector's multiple cloning site disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of transformed bacteria containing recombinant molecules. In addition, these vectors may be useful for in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem.
264:5503-5509.) When large quantities of LIPAP are needed, e.g. for the production of antibodies, vectors which direct high level expression of LIPAP may be used. For example, vectors containing the strong, inducible TS or T7 bacteriophage promoter may be used.
Yeast expression systems may be used for production of LIPAP. A number of vectors containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH
promoters, may be used in the yeast Saccharomyces cerevisiae or Pichia pastoris. In addition, such vectors direct either the secretion or intracellular retention of expressed proteins and enable integration of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 1995, supra;
Bitter, G.A. et al. ( 1987) Methods Enzymol. 153:516-544; and Scorer, C.A. et al. ( 1994) Bio/Technology 12:181-184.) Plant systems may also be used for expression of LIPAP. Transcription of sequences encoding LIPAP may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV
used alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO
J. 6:307-311).
Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock promoters may be used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Brogue, R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105.) These constructs can be introduced into plant cells by direct DNA transformation or pathogen-mediated transfection. (See, e.g., The McGraw Hill Yearbook of Science and Technolo~y (1992) McGraw Hill, New York NY, pp.
191-196.) In mammalian cells, a number of viral-based expression systems may be utilized. In cases where an adenovirus is used as an expression vector, sequences encoding LIPAP
may be ligated into an adenovirus transcription/translation complex consisting of the late promoter and tripartite leader sequence. Insertion in a non-essential E1 or E3 region of the viral genome may be used to obtain infective virus which expresses LIPAP in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc.
Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV-based vectors may also be used for high-level protein expression.
Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are constructed and delivered via conventional delivery methods (liposomes, polycationic amino polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. ( 1997) Nat. Genet. 15:345-355.) For long term production of recombinant proteins in mammalian systems, stable expression of LIPAP in cell lines is preferred. For example, sequences encoding LIPAP can be transformed into cell lines using expression vectors which may contain viral origins of replication and/or endogenous expression elements and a selectable marker gene on the same or on a separate vector. Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media before being switched to selective media. The purpose of the selectable marker is to confer resistance to a selective agent, and its presence allows growth and recovery of cells which successfully express the introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue culture techniques appropriate to the cell type.
Any number of selection systems may be used to recover transformed cell lines.
These include, but are not limited to, the herpes simplex virus thymidine kinase and adenine phosphoribosyltransferase genes, for use in tk- and apr= cells, respectively. (See, e.g., Wigler, M. et al. ( 1977) Cell 11:223-232;
Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, or herbicide resistance can be used as the basis for selection. For example, dhfr confers resistance to methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and als and pat confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J.
Mol. Biol. 150:1-14.) Additional selectable genes have been described, e.g., trpB and hisD, which alter cellular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc.
Natl. Acad. Sci. USA
85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins (GFP; Clontech),13 glucuronidase and its substrate a-glucuronide, or luciferase and its substrate luciferin may be used.
These markers can be used not only to identify transformants, but also to quantify the amount of transient or stable protein expression attributable to a specific vector system. (See, e.g., Rhodes, C.A.
(1995) Methods Mol. Biol. 55:121-131.) Although the presence/absence of marker gene expression suggests that the gene of interest is also present, the presence and expression of the gene may need to be confirmed. For example, if the sequence encoding LIPAP is inserted within a marker gene sequence, transformed cells containing sequences encoding LIPAP can be identified by the absence of marker gene function. Alternatively, a marker gene can be placed in tandem with a sequence encoding LIPAP under the control of a single promoter. Expression of the marker gene in response to induction or selection usually indicates expression of the tandem gene as well.
In general, host cells that contain the nucleic acid sequence encoding LIPAP
and that express LIPAP may be identified by a variety of procedures known to those of skill in the art. These procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR
amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or chip based technologies for the detection and/or quantification of nucleic acid or protein sequences.
Immunological methods for detecting and measuring the expression of LIPAP
using either specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence activated cell sorting (FAGS). A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two non-interfering epitopes on LIPAP is preferred, but a competitive binding assay may be employed. These and other assays are well known in the art. (See, e.g., Hampton, R. et al. (1990) Serological Methods, a Laboratory Manual, APS Press, St. Paul MN, Sect. IV; Coligan, J.E.
et al. (1997) Current Protocols in Immunolo~y, Greene Pub. Associates and Wiley-Interscience, New York NY; and Pound, J.D. ( 1998) Immunochemical Protocols, Humana Press, Totowa NJ.) A wide variety of labels and conjugation techniques are known by those skilled in the art and may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization or PCR probes for detecting sequences related to polynucleotides encoding LIPAP include oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide.
Alternatively, the sequences encoding LIPAP, or any fragments thereof, may be cloned into a vector for the production of an mRNA probe. Such vectors are known in the art, are commercially available, and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety of commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega (Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for ease of detection include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as well as substrates, cofactors, inhibitors, magnetic particles, and the like.
Host cells transformed with nucleotide sequences encoding LIPAP may be cultured under conditions suitable for the expression and recovery of the protein from cell culture. The protein produced by a transformed cell may be secreted or retained intracellularly depending on the sequence and/or the vector used. As will be understood by those of skill in the art, expression vectors containing polynucleotides which encode LIPAP may be designed to contain signal sequences which direct secretion of LIPAP through a prokaryotic or eukaryotic cell membrane.
In addition, a host cell strain may be chosen for its ability to modulate expression of the inserted sequences or to process the expressed protein in the desired fashion.
Such modifications of the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro" or "pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. Different host cells which have specific cellular machinery and characteristic mechanisms for post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct modification and processing of the foreign protein.
In another embodiment of the invention, natural, modified, or recombinant nucleic acid sequences encoding LIPAP may be ligated to a heterologous sequence resulting in translation of a fusion protein in any of the aforementioned host systems. For example, a chimeric LIPAP protein containing a heterologous moiety that can be recognized by a commercially available antibody may facilitate the screening of peptide libraries for inhibitors of LIPAP
activity. Heterologous protein and peptide moieties may also facilitate purification of fusion proteins using commercially available affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate resins, respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity purification of fusion proteins using commercially available monoclonal and polyclonal antibodies that specifically recognize these epitope tags. A fusion protein may also be engineered to contain a proteolytic cleavage site located between the LIPAP encoding sequence and the heterologous protein sequence, so that LIPAP may be cleaved away from the heterologous moiety following purification.
Methods for fusion protein expression and purification are discussed in Ausubel (1995, supra, ch.
10). A variety of commercially available kits may also be used to facilitate expression and purification of fusion proteins.
In a further embodiment of the invention, synthesis of radiolabeled LIPAP may be achieved in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These systems couple transcription and translation of protein-coding sequences operably associated with the T7, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid precursor, for example, 35S-methionine.
Fragments of LIPAP may be produced not only by recombinant means, but also by direct peptide synthesis using solid-phase techniques. (See, e.g., Creighton, supra, pp. 55-60.) Protein synthesis may be performed by manual techniques or by automation. Automated synthesis may be achieved, for example, using the ABI 431A peptide synthesizer (Perkin-Elmer).
Various fragments of LIPAP may be synthesized separately and then combined to produce the full length molecule.
THERAPEUTICS
Chemical and structural similarity, e.g., in the context of sequences and motifs, exists between regions of LIPAP and human lipid-associated proteins. In addition, the expression of LIPAP is closely associated with cardiovascular and gastrointestinal tissues, and tissues of the nervous system.
Therefore, LIPAP appears to play a role in cardiovascular, neurological, and gastrointestinal disorders, and disorders of lipid metabolism. In the treatment of disorders associated with increased LIPAP
expression or activity, it is desirable to decrease the expression or activity of LIPAP. In the treatment of disorders associated with decreased LIPAP expression or activity, it is desirable to increase the expression or activity of LIPAP.
Therefore, in one embodiment, LIPAP or a fragment or derivative thereof may be administered to a subject to treat or prevent a disorder associated with decreased expression or activity of LIPAP.
Examples of such disorders include, but are not limited to, a cardiovascular disorder including blood vessel disorders such as arteriovenous fistula, atherosclerosis, hypertension, vasculitis, Raynaud's disease, aneurysms, arterial dissections, varicose veins, thrombophlebitis and phlebothrombosis, vascular tumors, and complications of thrombolysis, balloon angioplasty, vascular replacement, and coronary artery bypass graft surgery; heart disorders such as congestive heart failure, ischemic heart disease, angina pectoris, myocardial infarction, hypertensive heart disease, degenerative valvular heart disease, calcific aortic valve stenosis, congenitally bicuspid aortic valve, mitral annular calcification, mural valve prolapse, rheumatic fever and rheumatic heart disease, infective endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic lupus erythematosus, carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart disease, congenital heart disease, and complications of cardiac transplantation; and lung disorders such as congenital lung anomalies, atelectasis, pulmonary congestion and edema, pulmonary embolism, pulmonary hemorrhage, pulmonary infarction, pulmonary hypertension, vascular sclerosis, obstructive pulmonary disease, restrictive pulmonary disease, chronic obstructive pulmonary disease, emphysema, chronic bronchitis, bronchial asthma, bronchiectasis, bacterial pneumonia, viral and mycoplasmal pneumonia, lung abscess, pulmonary tuberculosis, diffuse interstitial diseases, pneumoconioses, sarcoidosis, idiopathic pulmonary fibrosis, desquamative interstitial pneumonitis, hypersensitivity pneumonitis, pulmonary eosinophilia bronchiolitis obliterans-organizing pneumonia, diffuse pulmonary hemorrhage syndromes, Goodpasture's syndromes, idiopathic pulmonary hemosiderosis, pulmonary involvement in collagen-vascular disorders, pulmonary alveolar proteinosis, lung tumors, inflammatory and noninflammatory pleural effusions, pneumothorax, pleural tumors, drug-induced lung disease, radiation-induced lung disease, and complications of lung transplantation; a neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, CA 02362300 2001-07-31 pCT~S00/04160 amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system disease; prion diseases including kuru, Creutzfeldt-Jakob disease, and Gerstmann-Straussler-Scheinker syndrome;
fatal familial insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and polymyositis; inherited, metabolic, endocrine, and toxic myopathies;
myasthenia gravis, periodic paralysis; mental disorders including mood, anxiety, and schizophrenic disorders; seasonal affective disorder (SAD); akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, and familial frontotemporal dementia; a gastrointestinal disorder such as dysphagia, peptic esophagitis, esophageal spasm, esophageal stricture, esophageal carcinoma, dyspepsia, indigestion, gastritis, gastric carcinoma, anorexia, nausea, emesis, gastroparesis, antral or pyloric edema, abdominal angina, pyrosis, gastroenteritis, intestinal obstruction, infections of the intestinal tract, peptic ulcer, cholelithiasis, cholecystitis, cholestasis, pancreatitis, pancreatic carcinoma, biliary tract disease, hepatitis, hyperbilirubinemia, cirrhosis, passive congestion of the liver, hepatoma, infectious colitis, ulcerative colitis, ulcerative proctitis, Crohn's disease, Whipple's disease, Mallory-Weiss syndrome, colonic carcinoma, colonic obstruction, irritable bowel syndrome, short bowel syndrome, diarrhea, constipation, gastrointestinal hemorrhage, acquired immunodeficiency syndrome (AIDS) enteropathy, jaundice, hepatic encephalopathy, hepatorenal syndrome, hepatic steatosis, hemochromatosis, Wilson's disease, alpha,-antitrypsin deficiency, Reye's syndrome, primary sclerosing cholangitis, liver infarction, portal vein obstruction and thrombosis, centrilobular necrosis, peliosis hepatis, hepatic vein thrombosis, veno-occlusive disease, preeclampsia, eclampsia, acute fatty liver of pregnancy, intrahepatic cholestasis of pregnancy, and hepatic tumors including nodular hyperplasias, adenomas, and carcinomas; and a disorder of lipid metabolism such as fatty liver, cholestasis, primary biliary cirrhosis, carnitine deficiency, carnitine palmitoyltransferase deficiency, myoadenylate deaminase deficiency, hypertriglyceridemia, lipid storage disorders such Fabry's disease, Gaucher's disease, Niemann-Pick's disease, metachromatic leukodystrophy, adrenoleukodystrophy, GMZ gangliosidosis, and ceroid lipofuscinosis, abetalipoproteinemia, Tangier disease, hyperlipoproteinemia, diabetes mellitus, lipodystrophy, lipomatoses, acute panniculitis, disseminated fat necrosis, adiposis dolorosa, lipoid adrenal hyperplasia, minimal change disease, lipomas, atherosclerosis, hypercholesterolemia, hypercholesterolemia with hypertriglyceridemia, primary hypoalphalipoproteinemia, hypothyroidism, renal disease, liver disease, lecithin:cholesterol acyltransferase deficiency, cerebrotendinous xanthomatosis, sitosterolemia, hypocholesterolemia, Tay-Sachs disease, Sandhoff's disease, hyperlipidemia, hyperlipemia, lipid myopathies, and obesity.
In another embodiment, a vector capable of expressing LIPAP or a fragment or derivative thereof may be administered to a subject to treat or prevent a disorder associated with decreased expression or activity of LIPAP including, but not limited to, those described above.
In a further embodiment, a pharmaceutical composition comprising a substantially purified LIPAP in conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent a disorder associated with decreased expression or activity of LIPAP including, but not limited to, those provided above.
In still another embodiment, an agonist which modulates the activity of LIPAP
may be administered to a subject to treat or prevent a disorder associated with decreased expression or activity of LIPAP including, but not limited to, those listed above.
In a further embodiment, an antagonist of LIPAP may be administered to a subject to treat or prevent a disorder associated with increased expression or activity of LIPAP.
Examples of such disorders include, but are not limited to, those cardiovascular, neurological, and gastrointestinal disorders, and disorders of lipid metabolism, described above. In one aspect, an antibody which specifically binds LIPAP may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to cells or tissues which express LIPAP.
In an additional embodiment, a vector expressing the complement of the polynucleotide encoding LIPAP may be administered to a subject to treat or prevent a disorder associated with increased expression or activity of LIPAP including, but not limited to, those described above.
In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary sequences, or vectors of the invention may be administered in combination with other appropriate therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made by one of ordinary skill in the art, according to conventional pharmaceutical principles. The combination of therapeutic agents may act synergistically to effect the treatment or prevention of the various disorders described above. Using this approach, one may be able to achieve therapeutic efficacy with lower dosages of each agent, thus reducing the potential for adverse side effects.
An antagonist of LIPAP may be produced using methods which are generally known in the art.
In particular, purified LIPAP may be used to produce antibodies or to screen libraries of pharmaceutical agents to identify those which specifically bind LIPAP. Antibodies to LIPAP
may also be generated using methods that are well known in the art. Such antibodies may include, but are not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit dimer formation) are generally preferred for therapeutic use.
For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, and others may be immunized by injection with LIPAP or with any fragment or oligopeptide thereof which has immunogenic properties. Depending on the host species, various adjuvants may be used to increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral gels such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG (bacilli Calmette-Guerin) and Corynebacterium parvum are especially preferable.
It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to LIPAP
have an amino acid sequence consisting of at least about 5 amino acids, and generally will consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or fragments are identical to a portion of the amino acid sequence of the natural protein and contain the entire amino acid sequence of a small, naturally occurring molecule. Short stretches of LIPAP amino acids may be fused with those of another protein, such as KLH, and antibodies to the chimeric molecule may be produced.
Monoclonal antibodies to LIPAP may be prepared using any technique which provides for the production of antibody molecules by continuous cell lines in culture. These include, but are not limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D.
et al. (1985) J.
Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA
80:2026-2030; and Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) In addition, techniques developed for the production of "chimeric antibodies,"
such as the splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. ( 1984) Proc.
Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, S.
et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single chain antibodies may be adapted, using methods known in the art, to produce LIPAP-specific single chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., Burton, D.R. ( 1991 ) Proc. Natl. Acad. Sci. USA 88:10134-10137.) Antibodies may also be produced by inducing in vivo production in the lymphocyte population or by screening immunoglobulin libraries or panels of highly specific binding reagents as disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci.
USA 86:3833-3837; Winter, G.
et al. ( I 991 ) Nature 349:293-299.) Antibody fragments which contain specific binding sites for LIl'AP may also be generated. For example, such fragments include, but are not limited to, F(ab~z fragments produced by pepsin digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of the F(ab~2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and easy identification of monoclonal Fab fragments with the desired specificity.
(See, e.g., Huse, W.D. et al. (1989) Science 246:1275-1281.) Various immunoassays may be used for screening to identify antibodies having the desired specificity. Numerous protocols for competitive binding or immunoradiometric assays using either polyclonal or monoclonal antibodies with established specificities are well known in the art. Such immunoassays typically involve the measurement of complex formation between LIPAP and its specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two non-interfering LIPAP epitopes is generally used, but a competitive binding assay may also be employed (Pound, supra).
Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques may be used to assess the affinity of antibodies for LIPAP. Affinity is expressed as an association constant, Ka, which is defined as the molar concentration of LIPAP-antibody complex divided by the molar concentrations of free antigen and free antibody under equilibrium conditions. The Ka determined for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for multiple LIPAP epitopes, represents the average affinity, or avidity, of the antibodies for LIPAP. The K~ determined for a preparation of monoclonal antibodies, which are monospecific for a particular LIPAP epitope, represents a true measure of affinity. High-affinity antibody preparations with K
ranging from about 109 to 10''- L/mole are preferred for use in immunoassays in which the LIPAP-antibody complex must withstand rigorous manipulations. Low-affinity antibody preparations with Ka ranging from about 106 to 10' L/mole are preferred for use in immunopurification and similar procedures which ultimately require dissociation of LIPAP, preferably in active form, from the antibody (Catty, D. (1988) Antibodies, Volume I: A Practical Approach, IRL
Press, Washington, DC;
Liddell, J.E. and Cryer, A. (1991) A Practical Guide to Monoclonal Antibodies, John Wiley & Sons, New York NY).
The titer and avidity of polyclonal antibody preparations may be further evaluated to determine the quality and suitability of such preparations for certain downstream applications. For example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg specific antibody/ml, is generally employed in procedures requiring precipitation of LIPAP-antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for antibody quality and usage in various applications, are generally available.
(See, e.g., Catty, supra, and Coligan et al. supra.) In another embodiment of the invention, the polynucleotides encoding LIPAP, or any fragment or complement thereof, may be used for therapeutic purposes. In one aspect, the complement of the polynucleotide encoding LIPAP may be used in situations in which it would~be desirable to block the transcription of the mRNA. In particular, cells may be transformed with sequences complementary to polynucleotides encoding LIPAP. Thus, complementary molecules or fragments may be used to modulate LIPAP activity, or to achieve regulation of gene function. Such technology is now well known in the art, and sense or antisense oligonucleotides or larger fragments can be designed from various locations along the coding or control regions of sequences encoding LIPAP.
Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for delivery of nucleotide sequences to the targeted organ, tissue, or cell population. Methods which are well known to those skilled in the art can be used to construct vectors to express nucleic acid sequences complementary to the polynucleotides encoding LIPAP. (See, e.g., Sambrook, sera; Ausubel, 1995, supra.) Genes encoding LIPAP can be turned off by transforming a cell or tissue with expression vectors which express high levels of a polynucleotide, or fragment thereof, encoding LIPAP. Such constructs may be used to introduce untranslatable sense or antisense sequences into a cell. Even in the absence of integration into the DNA, such vectors may continue to transcribe RNA molecules until they are disabled by endogenous nucleases. Transient expression may last for a month or more with a non-replicating vector, and may last even longer if appropriate replication elements are part of the vector system.
As mentioned above, modifications of gene expression can be obtained by designing complementary sequences or antisense molecules (DNA, RNA, or PNA) to the control, 5', or regulatory regions of the gene encoding LIPAP. Oligonucleotides derived from the transcription initiation site, e.g., between about positions -10 and +10 from the start site, may be employed. Similarly, inhibition can be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it causes inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, B.E. and B.I. Carr, Molecular and Immunolo_ ig c Approaches, Futura Publishing, Mt. Kisco NY, pp.
163-177.) A
complementary sequence or antisense molecule may also be designed to block translation of mRNA by preventing the transcript from binding to ribosomes.
Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme molecule to complementary target RNA, followed by endonucleolytic cleavage.
For example, engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze endonucleolytic cleavage of sequences encoding LIPAP.
Specific ribozyme cleavage sites within any potential RNA target are initially identified by scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, corresponding to the region of the target gene containing the cleavage site, may be evaluated for secondary structural features which may render the oligonucleotide inoperable.
The suitability of candidate targets may also be evaluated by testing accessibility to hybridization with complementary oligonucleotides using ribonuclease protection assays.
Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared by any method known in the art for the synthesis of nucleic acid molecules. These include techniques for chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis.
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA
sequences encoding LIPAP. Such DNA sequences may be incorporated into a wide variety of vectors with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that synthesize complementary RNA, constitutively or inducibly, can be introduced into cell lines, cells, or tissues.
RNA molecules may be modified to increase intracellular stability and half-life. Possible modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' ends of the molecule, or the use of phosphorothioate or 2' O-methyl rather than phosphodiesterase linkages within the backbone of the molecule. This concept is inherent in the production of PNAs and can be extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases.
Many methods for introducing vectors into cells or tissues are available and equally suitable for use in vivo, in vitro, and ex vivo. For ex vivo therapy, vectors may be introduced into stem cells taken from the patient and clonally propagated for autologous transplant back into that same patient.
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. ( 1997) Nat.
Biotechnol. 15:462-466.) Any of the therapeutic methods described above may be applied to any subject in need of such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and monkeys.
An additional embodiment of the invention relates to the administration of a pharmaceutical or sterile composition, in conjunction with a pharmaceutically acceptable carrier, for any of the therapeutic effects discussed above. Such pharmaceutical compositions may consist of LIPAP, antibodies to LIPAP, and mimetics, agonists, antagonists, or inhibitors of LIPAP. The compositions may be administered alone or in combination with at least one other agent, such as a stabilizing compound, which may be administered in any sterile, biocompatible pharmaceutical carrier including, but not limited to, saline, buffered saline, dextrose, and water. The compositions may be administered to a patient alone, or in combination with other agents, drugs, or hormones.
The pharmaceutical compositions utilized in this invention may be administered by any number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, sublingual, or rectal means.
In addition to the active ingredients, these pharmaceutical compositions may contain suitable pharmaceutically-acceptable carriers comprising excipients and auxiliaries which facilitate processing of the active compounds into preparations which can be used pharmaceutically.
Further details on techniques for formulation and administration may be found in the latest edition of Remin tg on's Pharmaceutical Sciences (Maack Publishing, Easton PA).
Pharmaceutical compositions for oral administration can be formulated using pharmaceutically acceptable carriers well known in the art in dosages suitable for oral administration. Such carriers enable the pharmaceutical compositions to be formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by the patient.
Pharmaceutical preparations for oral use can be obtained through combining active compounds with solid excipient and processing the resultant mixture of granules (optionally, after grinding) to obtain tablets or dragee cores. Suitable auxiliaries can be added, if desired.
Suitable excipients include carbohydrate or protein fillers, such as sugars, including lactose, sucrose, mannitol, and sorbitol; starch from corn, wheat, rice, potato, or other plants; cellulose, such as methyl cellulose, hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; gums, including arabic and tragacanth; and proteins, such as gelatin and collagen. If desired, disintegrating or solubilizing agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, and alginic acid or a salt thereof, such as sodium alginate.
Dragee cores may be used in conjunction with suitable coatings, such as concentrated sugar solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures.
Dyestuffs or pigments may be added to the tablets or dragee coatings for product identification or to characterize the quantity of active compound, i.e., dosage.
Pharmaceutical preparations which can be used orally include push-fit capsules made of gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol.
Push-fit capsules can contain active ingredients mixed with fillers or binders, such as lactose or CA 02362300 2001-07-31 pC~/US00/04160 starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty oils, liquid, or liquid polyethylene glycol with or without stabilizers.
Pharmaceutical formulations suitable for parenteral administration may be formulated in aqueous solutions, preferably in physiologically compatible buffers such as Hanks' solution, Ringer's solution, or physiologically buffered saline. Aqueous injection suspensions may contain substances which increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran. Additionally, suspensions of the active compounds may be prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils, such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate, triglycerides, or liposomes. Non-lipid polycationic amino polymers may also be used for delivery. Optionally, the suspension may also contain suitable stabilizers or agents to increase the solubility of the compounds and allow for the preparation of highly concentrated solutions.
For topical or nasal administration, penetrants appropriate to the particular barrier to be permeated are used in the formulation. Such penetrants are generally known in the art.
The pharmaceutical compositions of the present invention may be manufactured in a manner that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes.
The pharmaceutical composition may be provided as a salt and can be formed with many acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, and succinic acids.
Salts tend to be more soluble in aqueous or other protonic solvents than are the corresponding free base forms. In other cases, the preparation may be a lyophilized powder which may contain any or all of the following: 1 mM to 50 mM histidine, 0.1 % to 2% sucrose, and 2% to 7%
mannitol, at a pH range of 4.5 to 5.5, that is combined with buffer prior to use.
After pharmaceutical compositions have been prepared, they can be placed in an appropriate container and labeled for treatment of an indicated condition. For administration of LIPAP, such labeling would include amount, frequency, and method of administration.
Pharmaceutical compositions suitable for use in the invention include compositions wherein the active ingredients are contained in an effective amount to achieve the intended purpose. The determination of an effective dose is well within the capability of those skilled in the art.
For any compound, the therapeutically effective dose can be estimated initially either in cell culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, or pigs.
An animal model may also be used to determine the appropriate concentration range and route of administration. Such information can then be used to determine useful doses and routes for administration in humans.
WO 00/49043 CA 02362300 2001-07-31 pCT~JS00/04160 A therapeutically effective dose refers to that amount of active ingredient, for example LIPAP
or fragments thereof, antibodies of LIl'AP, and agonists, antagonists or inhibitors of LIPAP, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined by standard pharmaceutical procedures in cell cultures or with experimental animals, such as by calculating the EDSO (the dose therapeutically effective in 50% of the population) or LDSO (the dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the therapeutic index, which can be expressed as the LDSO/EDSO ratio.
Pharmaceutical compositions which exhibit large therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are used to formulate a range of dosage for human use. The dosage contained in such compositions is preferably within a range of circulating concentrations that includes the EDSO with little or no toxicity. The dosage varies within this range depending upon the dosage form employed, the sensitivity of the patient, and the route of administration.
The exact dosage will be determined by the practitioner, in light of factors related to the subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the active moiety or to maintain the desired effect. Factors which may be taken into account include the severity of the disease state, the general health of the subject, the age, weight, and gender of the subject, time and frequency of administration, drug combination(s), reaction sensitivities, and response to therapy.
Long-acting pharmaceutical compositions may be administered every 3 to 4 days, every week, or biweekly depending on the half-life and clearance rate of the particular formulation.
Normal dosage amounts may vary from about 0.1 ~g to 100,000 fig, up to a total dose of about 1 gram, depending upon the route of administration. Guidance as to particular dosages and methods of delivery is provided in the literature and generally available to practitioners in the art. Those skilled in the art will employ different formulations for nucleotides than for proteins or their inhibitors.
Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, conditions, locations, etc.
DIAGNOSTICS
In another embodiment, antibodies which specifically bind LIPAP may be used for the diagnosis of disorders characterized by expression of LIPAP, or in assays to monitor patients being treated with LIPAP or agonists, antagonists, or inhibitors of LIPAP.
Antibodies useful for diagnostic purposes may be prepared in the same manner as described above for therapeutics. Diagnostic assays for LIPAP include methods which utilize the antibody and a label to detect LIPAP in human body fluids or in extracts of cells or tissues. The antibodies may be used with or without modification, and may be labeled by covalent or non-covalent attachment of a reporter molecule.
A wide variety of reporter molecules, several of which are described above, are known in the art and may be used.
A variety of protocols for measuring LIPAP, including ELISAs, RIAs, and FACS, are known in CA 02362300 2001-07-31 PCT~jS00/04160 the art and provide a basis for diagnosing altered or abnormal levels of LIPAP
expression. Normal or standard values for LIPAP expression are established by combining body fluids or cell extracts taken from normal mammalian subjects, for example, human subjects, with antibody to LIPAP under conditions suitable for complex formation. The amount of standard complex formation may be quantitated by various methods, such as photometric means. Quantities of LIPAP
expressed in subject, control, and disease samples from biopsied tissues are compared with the standard values. Deviation between standard and subject values establishes the parameters for diagnosing disease.
In another embodiment of the invention, the polynucleotides encoding LIPAP may be used for diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and quantify gene expression in biopsied tissues in which expression of LIPAP may be correlated with disease. The diagnostic assay may be used to determine absence, presence, and excess expression of LIPAP, and to monitor regulation of LIPAP levels during therapeutic intervention.
In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide sequences, including genomic sequences, encoding LIPAP or closely related molecules may be used to identify nucleic acid sequences which encode LIPAP. The specificity of the probe, whether it is made from a highly specific region, e.g., the 5'regulatory region, or from a less specific region, e.g., a conserved motif, and the stringency of the hybridization or amplification will determine whether the probe identifies only naturally occurring sequences encoding LIPAP, allelic variants, or related sequences.
Probes may also be used for the detection of related sequences, and may have at least 50%
sequence identity to any of the LIPAP encoding sequences. The hybridization probes of the subject invention may be DNA or RNA and may be derived from the sequence of SEQ ID
N0:13-24 or from genomic sequences including promoters, enhancers, and introns of the LIPAP
gene.
Means for producing specific hybridization probes for DNAs encoding LIPAP
include the cloning of polynucleotide sequences encoding LIPAP or LIl'AP derivatives into vectors for the production of mRNA probes. Such vectors are known in the art, are commercially available, and may be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a variety of reporter groups, for example, by radionuclides such as 32P or 35S, or by enzymatic labels, such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like.
Polynucleotide sequences encoding LIPAP may be used for the diagnosis of disorders associated with expression of LIPAP. Examples of such disorders include, but are not limited to, a cardiovascular disorder including blood vessel disorders such as arteriovenous fistula, atherosclerosis, hypertension, vasculitis, Raynaud's disease, aneurysms, arterial dissections, varicose veins, CA 02362300 2001-07-31 pC'j'/US00/04160 thrombophlebitis and phlebothrombosis, vascular tumors, and complications of thrombolysis, balloon angioplasty, vascular replacement, and coronary artery bypass graft surgery;
heart disorders such as congestive heart failure, ischemic heart disease, angina pectoris, myocardial infarction, hypertensive heart disease, degenerative valvular heart disease, calcific aortic valve stenosis, congenitally bicuspid aortic valve, mural annular calcification, mural valve prolapse, rheumatic fever and rheumatic heart disease, infective endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic lupus erythematosus, carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart disease, congenital heart disease, and complications of cardiac transplantation; and lung disorders such as congenital lung anomalies, atelectasis, pulmonary congestion and edema, pulmonary embolism, pulmonary hemorrhage, pulmonary infarction, pulmonary hypertension, vascular sclerosis, obstructive pulmonary disease, restrictive pulmonary disease, chronic obstructive pulmonary disease, emphysema, chronic bronchitis, bronchial asthma, bronchiectasis, bacterial pneumonia, viral and mycoplasmal pneumonia, lung abscess, pulmonary tuberculosis, diffuse interstitial diseases, pneumoconioses, sarcoidosis, idiopathic pulmonary fibrosis, desquamative interstitial pneumonitis, hypersensitivity pneumonitis, pulmonary eosinophilia bronchiolitis obliterans-organizing pneumonia, diffuse pulmonary hemorrhage syndromes, Goodpasture's syndromes, idiopathic pulmonary hemosiderosis, pulmonary involvement in collagen-vascular disorders, pulmonary alveolar proteinosis, lung tumors, inflammatory and noninflammatory pleural effusions, pneumothorax, pleural tumors, drug-induced lung disease, radiation-induced lung disease, and complications of lung transplantation; a neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system disease; prion diseases including kuru, Creutzfeldt-Jakob disease, and Gerstmann-Straussler-Scheinker syndrome;
fatal familial insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and polymyositis; inherited, metabolic, endocrine, and toxic myopathies;
myasthenia gravis, periodic paralysis; mental disorders including mood, anxiety, and schizophrenic disorders; seasonal affective disorder (SAD); akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive supranuclear palsy, CA 02362300 2001-07-31 pCT~S00/04160 corticobasal degeneration, and familial frontotemporal dementia; a gastrointestinal disorder such as dysphagia, peptic esophagitis, esophageal spasm, esophageal stricture, esophageal carcinoma, dyspepsia, indigestion, gastritis, gastric carcinoma, anorexia, nausea, emesis, gastroparesis, antral or pyloric edema, abdominal angina, pyrosis, gastroenteritis, intestinal obstruction, infections of the intestinal tract, peptic ulcer, cholelithiasis, cholecystitis, cholestasis, pancreatitis, pancreatic carcinoma, biliary tract disease, hepatitis, hyperbilirubinemia, cirrhosis, passive congestion of the liver, hepatoma, infectious colitis, ulcerative colitis, ulcerative proctitis, Crohn's disease, Whipple's disease, Mallory-Weiss syndrome, colonic carcinoma, colonic obstruction, irritable bowel syndrome, short bowel syndrome, diarrhea, constipation, gastrointestinal hemorrhage, acquired immunodeficiency syndrome (AIDS) enteropathy, jaundice, hepatic encephalopathy, hepatorenal syndrome, hepatic steatosis, hemochromatosis, Wilson's disease, alpha,-antitrypsin deficiency, Reye's syndrome, primary sclerosing cholangitis, liver infarction, portal vein obstruction and thrombosis, centrilobular necrosis, peliosis hepatis, hepatic vein thrombosis, veno-occlusive disease, preeclampsia, eclampsia, acute fatty liver of pregnancy, intrahepatic cholestasis of pregnancy, and hepatic tumors including nodular hyperplasias, adenomas, and carcinomas; and a disorder of lipid metabolism such as fatty liver, cholestasis, primary biliary cirrhosis, carnitine deficiency, carnitine palmitoyltransferase deficiency, myoadenylate deaminase deficiency, hypertriglyceridemia, lipid storage disorders such Fabry's disease, Gaucher's disease, Niemann-Pick's disease, metachromatic leukodystrophy, adrenoleukodystrophy, GMZ gangliosidosis, and ceroid lipofuscinosis, abetalipoproteinemia, Tangier disease, hyperlipoproteinemia, diabetes mellitus, lipodystrophy, lipomatoses, acute panniculitis, disseminated fat necrosis, adiposis dolorosa, lipoid adrenal hyperplasia, minimal change disease, lipomas, atherosclerosis, hypercholesterolemia, hypercholesterolemia with hypertriglyceridemia, primary hypoalphalipoproteinemia, hypothyroidism, renal disease, liver disease, lecithin:cholesterol acyltransferase deficiency, cerebrotendinous xanthomatosis, sitosterolemia, hypocholesterolemia, Tay-Sachs disease, Sandhoff's disease, hyperlipidemia, hyperlipemia, lipid myopathies, and obesity. The polynucleotide sequences encoding LIPAP may be used in Southern or northern analysis, dot blot, or other membrane-based technologies; in PCR technologies; in dipstick, pin, and multiformat ELISA-like assays; and in microarrays utilizing fluids or tissues from patients to detect altered LIPAP expression.
Such qualitative or quantitative methods are well known in the art.
In a particular aspect, the nucleotide sequences encoding LIPAP may be useful in assays that detect the presence of associated disorders, particularly those mentioned above. The nucleotide sequences encoding LIPAP may be labeled by standard methods and added to a fluid or tissue sample from a patient under conditions suitable for the formation of hybridization complexes. After a suitable incubation period, the sample is washed and the signal is quantified and compared with a standard value. If the amount of signal in the patient sample is significantly altered in comparison to a control WO 00/49043 CA 02362300 2001-07-31 pCT/ITS00/04160 sample then the presence of altered levels of nucleotide sequences encoding LIPAP in the sample indicates the presence of the associated disorder. Such assays may also be used to evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the treatment of an individual patient.
In order to provide a basis for the diagnosis of a disorder associated with expression of LIPAP, a normal or standard profile for expression is established. This may be accomplished by combining body fluids or cell extracts taken from normal subjects, either animal or human, with a sequence, or a fragment thereof, encoding LIPAP, under conditions suitable for hybridization or amplification.
Standard hybridization may be quantified by comparing the values obtained from normal subjects with values from an experiment in which a known amount of a substantially purified polynucleotide is used.
Standard values obtained in this manner may be compared with values obtained from samples from patients who are symptomatic for a disorder. Deviation from standard values is used to establish the presence of a disorder.
Once the presence of a disorder is established and a treatment protocol is initiated, hybridization assays may be repeated on a regular basis to determine if the level of expression in the patient begins to approximate that which is observed in the normal subject.
The results obtained from successive assays may be used to show the efficacy of treatment over a period ranging from several days to months.
With respect to cancer, the presence of an abnormal amount of transcript (either under- or overexpressed) in biopsied tissue from an individual may indicate a predisposition for the development of the disease, or may provide a means for detecting the disease prior to the appearance of actual clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ preventative measures or aggressive treatment earlier thereby preventing the development or further progression of the cancer.
Additional diagnostic uses for oligonucleotides designed from the sequences encoding LIPAP
may involve the use of PCR. These oligomers may be chemically synthesized, generated enzymatically, or produced in vitro. Oligomers will preferably contain a fragment of a polynucleotide encoding LIPAP, or a fragment of a polynucleotide complementary to the polynucleotide encoding LIPAP, and will be employed under optimized conditions for identification of a specific gene or condition. Oligomers may also be employed under less stringent conditions for detection or quantification of closely related DNA or RNA sequences.
Methods which may also be used to quantify the expression of LIPAP include radiolabeling or biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be accelerated by running the assay in a high-throughput format where the oligomer of interest is presented in various dilutions and a spectrophotometric or colorimetric response gives rapid quantitation.
In further embodiments, oligonucleotides or longer fragments derived from any of the polynucleotide sequences described herein may be used as targets in a microarray. The microarray can be used to monitor the expression level of large numbers of genes simultaneously and to identify genetic variants, mutations, and polymorphisms. This information may be used to determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, and to develop and monitor the activities of therapeutic agents.
Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al.
(1996) Proc. Natl. Acad. Sci.
USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251116;
Shalom D. et al.
(1995) PCT application W095/35505; Heller, R.A. et al. (1997) Proc. Natl.
Acad. Sci. USA 94:2150-2155; and Heller, M.J. et al. ( 1997) U.S. Patent No. 5,605,662.) In another embodiment of the invention, nucleic acid sequences encoding LIPAP
may be used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. The sequences may be mapped to a particular chromosome, to a specific region of a chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial P1 constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) Nat.
Genet. 15:345-355; Price, C.M. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet. 7:149-154.) Fluorescent in situ hybridization (FISH) may be correlated with other physical chromosome mapping techniques and genetic map data. (See, e.g., Heinz-Ulrich, et al.
(1995) in Meyers, supra, pp.
965-968.) Examples of genetic map data can be found in various scientific journals or at the Online Mendelian Inheritance in Man (OMIM) World Wide Web site. Correlation between the location of the gene encoding LIPAP on a physical chromosomal map and a specific disorder, or a predisposition to a specific disorder, may help define the region of DNA associated with that disorder. The nucleotide sequences of the invention may be used to detect differences in gene sequences among normal, carrier, and affected individuals.
In situ hybridization of chromosomal preparations and physical mapping techniques, such as linkage analysis using established chromosomal markers, may be used for extending genetic maps.
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, may reveal associated markers even if the number or arm of a particular human chromosome is not known.
New sequences can be assigned to chromosomal arms by physical mapping. This provides valuable information to investigators searching for disease genes using positional cloning or other gene discovery techniques. Once the disease or syndrome has been crudely localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to l 1q22-23, any sequences mapping to that area may represent associated or regulatory genes for further investigation. (See, e.g., Gatti, R.A. et al.
(1988) Nature 336:577-580.) The nucleotide sequence of the subject invention may also be used to detect differences in the chromosomal location due to translocation, inversion, etc., among normal, carrier, or affected individuals.
In another embodiment of the invention, LIPAP, its catalytic or immunogenic fragments, or oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug screening techniques. The fragment employed in such screening may be free in solution, affixed to a solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes between LIPAP and the agent being tested may be measured.
Another technique for drug screening provides for high throughput screening of compounds having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. ( 1984) PCT
application W084/03564.) In this method, large numbers of different small test compounds are synthesized on a solid substrate. The test compounds are reacted with LIPAP, or fragments thereof, and washed. Bound LIPAP is then detected by methods well known in the art.
Purified LIPAP can also be coated directly onto plates for use in the aforementioned drug screening techniques.
Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a solid support.
In another embodiment, one may use competitive drug screening assays in which neutralizing antibodies capable of binding LIPAP specifically compete with a test compound for binding LIPAP. In this manner, antibodies can be used to detect the presence of any peptide which shares one or more antigenic determinants with LIPAP.
In additional embodiments, the nucleotide sequences which encode LIPAP may be used in any molecular biology techniques that have yet to be developed, provided the new techniques rely on properties of nucleotide sequences that are currently known, including, but not limited to, such properties as the triplet genetic code and specific base pair interactions.
Without further elaboration, it is believed that one skilled in the art can, using the preceding description, utilize the present invention to its fullest extent. The following preferred specific embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder of the disclosure in any way whatsoever.
The disclosures of all patents, applications, and publications mentioned above and below, in particular U.S. Ser. No. 60/120,703 and U.S. Ser. No. 60/142,762, are hereby expressly incorporated by reference.
WO 00/49043 CA 02362300 2001-07-31 pCT~S00/04160 EXAMPLES
I. Construction of cDNA Libraries RNA was purchased from Clontech or isolated from tissues described in Table 4.
Some tissues were homogenized and lysed in guanidinium isothiocyanate, while others were homogenized and lysed in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged over CsCI
cushions or extracted with chloroform. RNA was precipitated from the lysates with either isopropanol or sodium acetate and ethanol, or by other routine methods.
Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA purity.
In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA
was isolated using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA
purification kit (Ambion, Austin TX).
In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA
libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra, units 5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA
was digested with the appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 bp) using SEPHACRYL S 1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., PBLUESCRIPT
plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), or pINCY (Incyte Pharmaceuticals, Palo Alto CA). Recombinant plasmids were transformed into competent E. coli cells including XLl-Blue, XL1-BIueMRF, or SOLR from Stratagene or DHSa, DH10B, or ElectroMAX DH10B from Life Technologies.
II. Isolation of cDNA Clones Plasmids were recovered from host cells by in vivo excision using the UNIZAP
vector system (Stratagene) or by cell lysis. Plasmids were purified using at least one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an AGTC Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 8 Plus Plasmid, Ultra Plasmid purification systems or the R.E.A.L. PREP 96 plasmid purification kit from QIAGEN.
Following precipitation, plasmids were resuspended in 0.1 ml of distilled water and stored, with or WO 00/49043 CA 02362300 2001-07-31 pCT/US00/04160 without lyophilization, at 4°C.
Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 384-well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence scanner (Labsystems Oy, Helsinki, Finland).
III. Sequencing and Analysis cDNA sequencing reactions were processed using standard methods or high-throughput instrumentation such as the ABI CATALYST 800 (Perkin-Elmer) thermal cycler or the PTC-200 thermal cycler (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared using reagents provided by Amersham Pharmacia Biotech or supplied in ABI
sequencing kits such as the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Perkin-Elmer).
Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides were carried out using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the ABI PRISM 373 or 377 sequencing system (Perkin-Elmer) in conjunction with standard ABI protocols and base calling software; or other sequence analysis systems known in the art. Reading frames within the cDNA sequences were identified using standard methods (reviewed in Ausubel, 1997, supra, unit 7.7). Some of the cDNA sequences were selected for extension using the techniques disclosed in Example VI.
The polynucleotide sequences derived from cDNA sequencing were assembled and analyzed using a combination of software programs which utilize algorithms well known to those skilled in the art. Table 5 summarizes the tools, programs, and algorithms used and provides applicable descriptions, references, and threshold parameters. The first column of Table 5 shows the tools, programs, and algorithms used, the second column provides brief descriptions thereof, the third column presents appropriate references, all of which are incorporated by reference herein in their entirety, and the fourth column presents, where applicable, the scores, probability values, and other parameters used to evaluate the strength of a match between two sequences (the higher the score, the greater the homology between two sequences). Sequences were analyzed using MACDNASIS PRO software (Hitachi Software Engineering, South San Francisco CA) and LASERGENE software (DNASTAR).
Polynucleotide and polypeptide sequence alignments were generated using the default parameters specified by the clustal algorithm as incorporated into the MEGALIGN multisequence alignment program (DNASTAR), which also calculates the percent identity between aligned sequences.
The polynucleotide sequences were validated by removing vector, linker, and polyA sequences and by masking ambiguous bases, using algorithms and programs based on BLAST, dynamic programing, and dinucleotide nearest neighbor analysis. The sequences were then queried against a selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases, and BLOCKS, PRINTS, DOMO, PRODOM, and PFAM to acquire annotation using programs based on BLAST, FASTA, and BLIMPS. The sequences were assembled into full length polynucleotide sequences using programs based on Phred, Phrap, and Consed, and were screened for open reading frames using programs based on GeneMark, BLAST, and FASTA. The full length polynucleotide sequences were translated to derive the corresponding full length amino acid sequences, and these full length sequences were subsequently analyzed by querying against databases such as the GenBank databases (described above), SwissProt, BLOCKS, PRINTS, DOMO, PRODOM, Prosite, and Hidden Markov Model (HMM)-based protein family databases such as PFAM. HMM is a probabilistic approach which analyzes consensus primary structures of gene families. (See, e.g., Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) The programs described above for the assembly and analysis of full length polynucleotide and amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ ID
N0:13-24. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and amplification technologies were described in The Invention section above.
IV. Northern Analysis Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra, ch. 7; Ausubel, 1995, supra, ch. 4 and 16.) Analogous computer techniques applying BLAST were used to search for identical or related molecules in nucleotide databases such as GenBank or LIFESEQ (Incyte Pharmaceuticals). This analysis is much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer search can be modified to determine whether any particular match is categorized as exact or similar. The basis of the search is the product score, which is defined as:
% sequence identity x % maximum BLAST score The product score takes into account both the degree of similarity between two sequences and the length of the sequence match. For example, with a product score of 40, the match will be exact within a 1 % to 2% error, and, with a product score of 70, the match will be exact.
Similar molecules are usually identified by selecting those which show product scores between 15 and 40, although lower scores may identify related molecules.
The results of northern analyses are reported as a percentage distribution of libraries in which WO 00/49043 PCT/iJS00/04160 the transcript encoding LIPAP occurred. Analysis involved the categorization of cDNA libraries by organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, reproductive, and urologic. The disease/condition categories included cancer, inflammation, trauma, cell proliferation, neurological, and pooled. For each category, the number of libraries expressing the sequence of interest was counted and divided by the total number of libraries across all categories.
Percentage values of tissue-specific and disease- or condition-specific expression are reported in Table V. Chromosomal Mapping of LIPAP Encoding Polynucleotides The cDNA sequences which were used to assemble SEQ ID N0:19-24 were compared with sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other implementations of the Smith-Waterman algorithm. Sequences from these databases that matched SEQ ID N0:19-24 were assembled into clusters of contiguous and overlapping sequences using assembly algorithms such as Phrap (Table 5). Radiation hybrid and genetic mapping data available from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for Genome Research (WIGR), and Genethon were used to determine if any of the clustered sequences had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment of all sequences of that cluster, including its particular SEQ ID NO:, to that map location.
The genetic map location of SEQ ID N0:21 is described in The Invention as a range, or interval, of human chromosome 11. The map position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's p-arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in humans, although this can vary widely due to hot and cold spots of recombination.) The cM distances are based on genetic markers mapped by Genethon which provide boundaries for radiation hybrid markers whose sequences were included in each of the clusters.
Human genome maps and other resources available to the public, such as the NCBI "GeneMap'99"
World Wide Web site (http:l/www.ncbi.nlm.nih.gov/genemap/), can be employed to determine if previously identified disease genes map within or in proximity to the intervals indicated above.
VI. Extension of LIPAP Encoding Polynucleotides The full length nucleic acid sequences of SEQ ID N0:13-24 were produced by extension of an appropriate fragment of the full length molecule using oligonucleotide primers designed from this fragment. One primer was synthesized to initiate 5' extension of the known fragment, and the other primer, to initiate 3' extension of the known fragment. The initial primers were designed using OLIGO
Copines promote lipid vesicle aggregation. They contain a C2 domain associated with membrane activity and an annexin-type domain that mediates interactions between integral and extracellular proteins and is associated with calcium binding and regulation (Creutz, C.E.
(1998) J. Biol. Chem.
273:1393-1402). Other C2-containing proteins include the synaptotagmins, a family of proteins involved in vesicular trafficking. Synaptotagmin concentrations in cerebrospinal fluid have been found to be reduced in early-onset Alzheimer's disease (Gottfries, C.G. et al.
(1998) J. Neural Transm.
105:773-786).
Lipids and their associated proteins have roles in human diseases and disorders. Increased synthesis of long-chain fatty acids occurs in neoplasms including those of the breast, prostate, ovary, colon and endometrium. There is a strong inverse correlation between the levels of plasma HDL and risk of premature coronary heart disease. Absence of the LDL receptor, the cause of familial hypercholesterolemia, leads to increased plasma cholesterol levels and ultimately to atherosclerosis (Stryer, supra, pp. 691-702). The arterial disease atherosclerosis is characterized by the formation of fatty lesions on the inside of the arterial wall. These lesions promote the loss of arterial flexibility and the formation of blood clots (Guyton, supra). Oxysterols are present in human atherosclerotic plaques and believed to play an active role in plaque development (Brown, A.J. (1999) Atherosclerosis 142:1-28). Steatosis, or fatty liver, is characterized by the accumulation of triglycerides in the liver and may occur in association with a variety of conditions including alcoholism, diabetes, obesity, and prolonged parenteral nutrition. Steatosis may lead to fibrosis and cirrhosis of the liver. In Tay-Sachs disease, the GMZ ganglioside (a sphingolipid) accumulates in lysosomes of the central nervous system due to a lack of the enzyme N-acetylhexosaminidase. Patients suffer nervous system degeneration leading to early death (Fauci, A.S. et al. (1998) Harrison's Principles of Internal Medicine McGraw-Hill, New York NY
p. 2171). The Niemann-Pick diseases are caused by defects in lipid metabolism.
Niemann-Pick diseases types A and B are caused by accumulation of sphingomyelin (a sphingolipid) and other lipids in the central nervous system due to a defect in the enzyme sphingomyelinase, leading to neurodegeneration and lung disease. Niemann-Pick disease type C results from a defect in cholesterol transport, leading to the accumulation of sphingomyelin and cholesterol in lysosomes and a secondary reduction in sphingomyelinase activity. Neurological symptoms such as grand mal seizures, ataxia, and loss of previously learned speech, manifest 1-2 years after birth. A mutation in the NPC protein, which contains a putative cholesterol-sensing domain, was found in a mouse model of Niemann-Pick disease type C (Fauci, supra,, p. 2175; Loftus, S.K. et al. (1997) Science 277:232-235).
The discovery of new human lipid-associated proteins and the polynucleotides encoding them satisfies a need in the art by providing new compositions which are useful in the diagnosis, prevention, and treatment of cardiovascular, neurological, and gastrointestinal disorders, and disorders of lipid metabolism.
SUMMARY OF THE INVENTION
The invention features purified polypeptides, human lipid-associated proteins, referred to collectively as "LIPAP" and individually as "LIPAP-I," "LIPAP-2," "LIPAP-3,"
"LIPAP-4," "LIPAP-5," "LIPAP-6," "LIPAP-7," "LIPAP-8," "LIPAP-9," "LIPAP-10," "LIPAP-11," and "LIPAP-12." In one aspect, the invention provides an isolated polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, c) a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, or d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12. In one alternative, the invention provides an isolated polypeptide comprising the amino acid sequence of SEQ ID NO:1-12.
The invention further provides an isolated polynucleotide encoding a polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:I-12, c) a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, or d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12. In one alternative, the polynucleotide is selected from the group consisting of SEQ ID N0:13-24.
Additionally, the invention provides a recombinant polynucleotide comprising a promoter sequence operably linked to a polynucleotide encoding a polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID NO: I-12, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:I-12, c) a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, or d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:I-12. In one alternative, the invention provides a cell transformed with the recombinant polynucleotide. In another alternative, the invention provides a transgenic organism comprising the recombinant polynucleotide.
The invention also provides a method for producing a polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, b) a naturally occurnng amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, c) a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, or d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:I-12. The method comprises a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said cell is transformed with a recombinant polynucleotide comprising a promoter sequence operably linked to a polynucleotide encoding the polypeptide, and b) recovering the polypeptide so expressed.
Additionally, the invention provides an isolated antibody which specifically binds to a polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, c) a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, or d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12.
The invention further provides an isolated polynucleotide comprising a) a polynucleotide sequence selected from the group consisting of SEQ ID N0:13-24, b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID N0:13-24, c) a polynucleotide sequence complementary to a), or d) a polynucleotide sequence complementary to b). In one alternative, the polynucleotide comprises at least 60 contiguous nucleotides.
Additionally, the invention provides a method for detecting a target polynucleotide in a sample, said target polynucleotide having a sequence of a polynucleotide comprising a) a polynucleotide sequence selected from the group consisting of SEQ ID N0:13-24, b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID N0:13-24, c) a polynucleotide sequence complementary to a), or d) a polynucleotide sequence complementary to b). The method comprises a) hybridizing the sample with a probe comprising at least 16 contiguous nucleotides comprising a sequence complementary to said target polynucleotide in the sample, and which probe specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization complex is formed between said probe and said target polynucleotide, and b) detecting the presence or absence of said hybridization complex, and optionally, if present, the amount thereof. In one alternative, the probe comprises at least 30 contiguous nucleotides. In another alternative, the probe comprises at least 60 contiguous nucleotides.
The invention further provides a pharmaceutical composition comprising an effective amount of a polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID
NO:l-12, b) a naturally occurring amino acid sequence having at least 90%
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, c) a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ
ID NO:1-12, or d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, and a pharmaceutically acceptable excipient. The invention additionally provides a method of treating a disease or condition associated with decreased expression of functional LIPAP, comprising administering to a patient in need of such treatment the pharmaceutical composition.
The invention also provides a method for screening a compound for effectiveness as an agonist of a polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID
NO:1-12, b) a naturally occurring amino acid sequence having at least 90%
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, c) a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ
ID NO:1-12, or d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12. The method comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting agonist activity in the sample. In one alternative, the invention provides a pharmaceutical composition comprising an agonist compound identified by the method and a pharmaceutically acceptable excipient. In another alternative, the invention provides a method of treating a disease or condition associated with decreased expression of functional LIPAP, comprising administering to a patient in need of such treatment the pharmaceutical composition.
Additionally, the invention provides a method for screening a compound for effectiveness as an antagonist of a polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID NO: l-12, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, c) a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:1-12, or d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID
NO:1-12. The method comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting antagonist activity in the sample. In one alternative, the invention provides a pharmaceutical composition comprising an antagonist compound identified by the method and a pharmaceutically acceptable excipient. In another alternative, the invention provides a method of treating a disease or condition associated with overexpression of functional LIPAP, comprising administering to a patient in need of such treatment the pharmaceutical composition.
The invention further provides a method for screening a compound for effectiveness in altering expression of a target polynucleotide, wherein said target polynucleotide comprises a sequence selected from the group consisting of SEQ ID N0:13-24, the method comprising a) exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered expression of the target polynucleotide.
BRIEF DESCRIPTION OF THE TABLES
Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ
ID NOs), clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble full-WO 00/49043 PCT/iJS00/04160 length sequences encoding LIPAP.
Table 2 shows features of each polypeptide sequence, including potential motifs, homologous sequences, and methods, algorithms, and searchable databases used for analysis of LIPAP.
Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific expression patterns of each nucleic acid sequence as determined by northern analysis;
diseases, disorders, or conditions associated with these tissues; and the vector into which each cDNA
was cloned.
Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones encoding LIPAP were isolated.
Table 5 shows the tools, programs, and algorithms used to analyze LIPAP, along with applicable descriptions, references, and threshold parameters.
DESCRIPTION OF THE INVENTION
Before the present proteins, nucleotide sequences, and methods are described, it is understood that this invention is not limited to the particular machines, materials and methods described, as these may vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to limit the scope of the present invention which will be limited only by the appended claims.
It must be noted that as used herein and in the appended claims, the singular forms "a," "an,"
and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so forth.
Unless defined otherwise, all technical and scientific terms used herein have the same meanings as commonly understood by one of ordinary skill in the art to which this invention belongs.
Although any machines, materials, and methods similar or equivalent to those described herein can be used to practice or test the present invention, the preferred machines, materials and methods are now described. All publications mentioned herein are cited for the purpose of describing and disclosing the cell lines, protocols, reagents and vectors which are reported in the publications and which might be used in connection with the invention. Nothing herein is to be construed as an admission that the invention is not entitled to antedate such disclosure by virtue of prior invention.
DEFINITIONS
"LIPAP" refers to the amino acid sequences of substantially purified LIPAP
obtained from any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant.
The term "agonist" refers to a molecule which intensifies or mimics the biological activity of WO 00/49043 CA 02362300 2001-07-31 pCT~S00/04160 LIPAP. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other compound or composition which modulates the activity of LIPAP either by directly interacting with LIPAP or by acting on components of the biological pathway in which LIPAP
participates.
An "allelic variant" is an alternative form of the gene encoding LIPAP.
Allelic variants may result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in polypeptides whose structure or function may or may not be altered. A gene may have none, one, or many allelic variants of its naturally occurring form. Common mutational changes which give rise to allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides.
Each of these types of changes may occur alone, or in combination with the others, one or more times in a given sequence.
"Altered" nucleic acid sequences encoding LIPAP include those sequences with deletions, insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as LIPAP or a polypeptide with at least one functional characteristic of LIPAP. Included within this definition are polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of the polynucleotide encoding LIPAP, and improper or unexpected hybridization to allelic variants, with a locus other than the normal chromosomal locus for the polynucleotide sequence encoding LIPAP.
The encoded protein may also be "altered," and may contain deletions, insertions, or substitutions of amino acid residues which produce a silent change and result in a functionally equivalent LIPAP.
Deliberate amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as long as the biological or immunological activity of LIPAP is retained. For example, negatively charged amino acids may include aspartic acid and glutamic acid, and positively charged amino acids may include lysine and arginine. Amino acids with uncharged polar side chains having similar hydrophilicity values may include: asparagine and glutamine; and serine and threonine. Amino acids with uncharged side chains having similar hydrophilicity values may include: leucine, isoleucine, and valine; glycine and alanine; and phenylalanine and tyrosine.
The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic molecules. Where "amino acid sequence" is recited to refer to an amino acid sequence of a naturally occurring protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid sequence to the complete native amino acid sequence associated with the recited protein molecule.
"Amplification" relates to the production of additional copies of a nucleic acid sequence.
Amplification is generally carried out using polymerise chain reaction (PCR) technologies well known in the art.
The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity of LIPAP. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small molecules, or any other compound or composition which modulates the activity of LIPAP either by directly interacting with LIPAP or by acting on components of the biological pathway in which LIPAP
participates.
The term "antibody" refers to intact immunoglobulin molecules as well as to fragments thereof, such as Fab, F(ab')2, and Fv fragments, which are capable of binding an epitopic determinant.
Antibodies that bind LIPAP polypeptides can be prepared using intact polypeptides or using fragments containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired.
Commonly used carriers that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animal.
The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that makes contact with a particular antibody. When a protein or a fragment of a protein is used to immunize a host animal, numerous regions of the protein may induce the production of antibodies which bind specifically to antigenic determinants (particular regions or three-dimensional structures on the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen used to elicit the immune response) for binding to an antibody.
The term "antisense" refers to any composition capable of base-pairing with the "sense" strand of a specific nucleic acid sequence. Antisense compositions may include DNA;
RNA; peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified sugar groups such as 2'-methoxyethyl sugars or 2'-methoxyethoxy sugars; or oligonucleotides having modified bases such as 5-methyl cytosine, 2'-deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense molecules may be produced by any method including chemical synthesis or transcription. Once introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring nucleic acid sequence produced by the cell to form duplexes which block either transcription or translation. The designation "negative" or "minus" can refer to the antisense strand, and the designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule.
The term "biologically active" refers to a protein having structural, regulatory, or biochemical functions of a naturally occurring molecule. Likewise, "immunologically active" refers to the capability of the natural, recombinant, or synthetic LIPAP, or of any oligopeptide thereof, to induce a specific immune response in appropriate animals or cells and to bind with specific antibodies.
The terms "complementary" and "complementarity" refer to the natural binding of polynucleotides by base pairing. For example, the sequence "5' A-G-T 3"' bonds to the complementary sequence "3' T-C-A 5'." Complementarity between two single-stranded molecules may be "partial,"
such that only some of the nucleic acids bind, or it may be "complete," such that total complementarity exists between the single stranded molecules. The degree of complementarity between nucleic acid strands has significant effects on the efficiency and strength of the hybridization between the nucleic acid strands. This is of particular importance in amplification reactions, which depend upon binding between nucleic acid strands, and in the design and use of peptide nucleic acid (PNA) molecules.
A "composition comprising a given polynucleotide sequence" and a "composition comprising a given amino acid sequence" refer broadly to any composition containing the given polynucleotide or amino acid sequence. The composition may comprise a dry formulation or an aqueous solution.
Compositions comprising polynucleotide sequences encoding LIPAP or fragments of LIPAP may be employed as hybridization probes. The probes may be stored in freeze-dried form and may be associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be deployed in an aqueous solution containing salts (e.g., NaCI), detergents (e.g., sodium dodecyl sulfate;
SDS), and other components (e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.).
"Consensus sequence" refers to a nucleic acid sequence which has been resequenced to resolve uncalled bases, extended using the XL-PCR kit (Perkin-Elmer, Norwalk CT) in the 5' and/or the 3' direction, and resequenced, or which has been assembled from the overlapping sequences of one or more Incyte Clones and, in some cases, one or more public domain ESTs, using a computer program for fragment assembly, such as the GELVIEW fragment assembly system (GCG, Madison WI). Some sequences have been both extended and assembled to produce the consensus sequence.
"Conservative amino acid substitutions" are those substitutions that, when made, least interfere with the properties of the original protein, i.e., the structure and especially the function of the protein is conserved and not significantly changed by such substitutions. The table below shows amino acids which may be substituted for an original amino acid in a protein and which are regarded as conservative amino acid substitutions.
Original Residue Conservative Substitution Ala Gly, Ser Arg His, Lys Asn Asp, Gln, His Asp Asn, Glu Cys Ala, Ser Gln Asn, Glu, His Glu Asp, Gln, His Gly Ala His Asn, Arg, Gln, Glu Ile Leu, Val Leu Ile, Val Lys Arg, Gln, Glu Met Leu, Ile Phe His, Met, Leu, Trp, Tyr Ser Cys, Thr Thr Ser, Val Trp Phe, Tyr Tyr His, Phe, Trp Val Ile, Leu, Thr Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, (b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of the side chain.
A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the absence of one or more amino acid residues or nucleotides.
The term "derivative" refers to the chemical modification of a polypeptide sequence, or a polynucleotide sequence. Chemical modifications of a polynucleotide sequence can include, for example, replacement of hydrogen by an alkyl, acyl, hydroxyl, or amino group.
A derivative polynucleotide encodes a polypeptide which retains at least one biological or immunological function of the natural molecule. A derivative polypeptide is one modified by glycosylation, pegylation, or any similar process that retains at least one biological or immunological function of the polypeptide from which it was derived.
A "fragment" is a unique portion of LIPAP or the polynucleotide encoding LIPAP
which is identical in sequence to but shorter in length than the parent sequence. A
fragment may comprise up to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 15, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid residues in length. Fragments may be preferentially selected from certain regions of a molecule. For example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected from the first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown in a certain defined sequence.
Clearly these lengths are exemplary, and any length that is supported by the specification, including the Sequence Listing, tables, and figures, may be encompassed by the present embodiments.
A fragment of SEQ ID N0:13-24 comprises a region of unique polynucleotide sequence that specifically identifies SEQ ID N0:13-24, for example, as distinct from any other sequence in the same genome. A fragment of SEQ ID N0:13-24 is useful, for example, in hybridization and amplification technologies and in analogous methods that distinguish SEQ ID N0:13-24 from related polynucleotide sequences. The precise length of a fragment of SEQ ID N0:13-24 and the region of SEQ ID N0:13-24 to which the fragment corresponds are routinely determinable by one of ordinary skill in the art based on the intended purpose for the fragment.
A fragment of SEQ ID NO:1-12 is encoded by a fragment of SEQ ID N0:13-24. A
fragment of SEQ ID NO:1-12 comprises a region of unique amino acid sequence that specifically identifies SEQ
ID NO:1-12. For example, a fragment of SEQ ID NO:1-12 is useful as an immunogenic peptide for the development of antibodies that specifically recognize SEQ ID NO:1-12. The precise length of a fragment of SEQ ID NO:1-12 and the region of SEQ ID NO:1-12 to which the fragment corresponds are routinely determinable by one of ordinary skill in the art based on the intended purpose for the fragment.
The term "similarity" refers to a degree of complementarity. There may be partial similarity or complete similarity. The word "identity" may substitute for the word "similarity." A partially complementary sequence that at least partially inhibits an identical sequence from hybridizing to a target nucleic acid is referred to as "substantially similar." The inhibition of hybridization of the completely complementary sequence to the target sequence may be examined using a hybridization assay (Southern or northern blot, solution hybridization, and the like) under conditions of reduced stringency. A substantially similar sequence or hybridization probe will compete for and inhibit the binding of a completely similar (identical) sequence to the target sequence under conditions of reduced stringency. This is not to say that conditions of reduced stringency are such that non-specific binding is permitted, as reduced stringency conditions require that the binding of two sequences to one another be a specific (i.e., a selective) interaction. The absence of non-specific binding may be tested by the use of a second target sequence which lacks even a partial degree of complementarity (e.g., less than about 30% similarity or identity). In the absence of non-specific binding, the substantially similar sequence or probe will not hybridize to the second non-complementary target sequence.
The phrases "percent identity" and "% identity," as applied to polynucleotide sequences, refer to the percentage of residue matches between at least two polynucleotide sequences aligned using a standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps in the sequences being compared in order to optimize alignment between two sequences, and therefore achieve a more meaningful comparison of the two sequences.
Percent identity between polynucleotide sequences may be determined using the default parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN
version 3.12e sequence alignment program. This program is part of the LASERGENE software package, a suite of molecular biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in Higgins, D.G.
and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 8:189-191. For pairwise alignments of polynucleotide sequences, the default parameters are set as follows: Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue weight table is selected as the default. Percent identity is reported by CLUSTAL V as the "percent similarity" between aligned polynucleotide sequence pairs.
Alternatively, a suite of commonly used and freely available sequence comparison algorithms is provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment Search Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available from several sources, including the NCBI, Bethesda, MD, and on the Internet at http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis programs including "blastn," that is used to align a known polynucleotide sequence with other polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bl2.html. The "BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST
programs are commonly used with gap and other parameters set to default settings. For example, to compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 2Ø9 (May-07-1999) set at default parameters. Such default parameters may be, for example:
Matrix: BLOSUM62 Reward for match: 1 Penalty for mismatch: -2 Open Gap: 5 and Extension Gap: 2 penalties Gap x drop-off: 50 Expect: l0 Word Size: Il Filter: on Percent identity may be measured over the length of an entire defined sequence, for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous nucleotides. Such lengths are exemplary only, and it is understood that any fragment length supported by the sequences shown herein, in the tables, figures, or Sequence Listing, may be used to describe a length over which percentage identity may be measured.
Nucleic acid sequences that do not show a high degree of identity may nevertheless encode similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences that all encode substantially the same protein.
The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to the percentage of residue matches between at least two polypeptide sequences aligned using a standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment methods take into account conservative amino acid substitutions. Such conservative substitutions, explained in more detail above, generally preserve the hydrophobicity and acidity at the site of substitution, thus preserving the structure (and therefore function) of the polypeptide.
Percent identity between polypeptide sequences may be determined using the default parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN
version 3.12e sequence alignment program (described and referenced above). For pairwise alignments of polypeptide sequences using CLUSTAL V, the default parameters are set as follows:
Ktuple=I, gap penalty=3, window=5, and "diagonals saved"=5. The PAM250 matrix is selected as the default residue weight table. As with polynucleotide alignments, the percent identity is reported by CLUSTAL V as the "percent similarity" between aligned polypeptide sequence pairs.
Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences"
tool Version 2Ø9 (May-07-1999) with blastp set at default parameters. Such default parameters may be, for example:
Matrix: BLOSUM62 Open Gap: 11 and Extension Gap: 1 penalties Gap x drop-off: 50 Expect: 10 Word Size: 3 Filter: on Percent identity may be measured over the length of an entire defined polypeptide sequence, for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, over the length of a fragment taken from a larger, defined polypeptide sequence, for instance, a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least I50 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be used to describe a length over which percentage identity may be measured.
"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for stable mitotic chromosome segregation and maintenance.
The term "humanized antibody" refers to antibody molecules in which the amino acid sequence in the non-antigen binding regions has been altered so that the antibody more closely resembles a human antibody, and still retains its original binding ability.
"Hybridization" refers to the process by which a polynucleotide strand anneals with a complementary strand through base pairing under defined hybridization conditions. Specific WO 00/49043 PCT/iJS00/04160 hybridization is an indication that two nucleic acid sequences share a high degree of identity. Specific hybridization complexes form under permissive annealing conditions and remain hybridized after the "washing" step(s). The washing steps) is particularly important in determining the stringency of the hybridization process, with more stringent conditions allowing less non-specific binding, i.e., binding between pairs of nucleic acid strands that are not perfectly matched.
Permissive conditions for annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in the art and may be consistent among hybridization experiments, whereas wash conditions may be varied among experiments to achieve the desired stringency, and therefore hybridization specificity. Permissive annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about I % (w/v) SDS, and about 100 pg/ml denatured salmon sperm DNA.
Generally, stringency of hybridization is expressed, in part, with reference to the temperature under which the wash step is carried out. Generally, such wash temperatures are selected to be about 5°C to 20°C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. An equation for calculating Tm and conditions for nucleic acid hybridization are well known and can be found in Sambrook et al., 1989, Molecular CloningYA Laboratory Manual, 2°d ed., vol. 1-3, Cold Spring Harbor Press, Plainview NY; specifically see volume 2, chapter 9.
High stringency conditions for hybridization between polynucleotides of the present invention include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1 % SDS, for I hour.
Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC concentration may be varied from about 0. I to 2 x SSC, with SDS being present at about 0.1 %.
Typically, blocking reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, denatured salmon sperm DNA at about 100-200 pg/ml. Organic solvent, such as formamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, such as for RNA:DNA hybridizations. Useful variations on these wash conditions will be readily apparent to those of ordinary skill in the art. Hybridization, particularly under high stringency conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is strongly indicative of a similar role for the nucleotides and their encoded polypeptides.
The term "hybridization complex" refers to a complex formed between two nucleic acid sequences by virtue of the formation of hydrogen bonds between complementary bases. A
hybridization complex may be formed in solution (e.g., Cot or Rot analysis) or formed between one nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a solid support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which cells or their nucleic acids have been fixed).
The words "insertion" and "addition" refer to changes in an amino acid or nucleotide sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively.
"Immune response" can refer to conditions associated with inflammation, trauma, immune disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect cellular and systemic defense systems.
The term "microarray" refers to an arrangement of distinct polynucleotides on a substrate.
The terms "element" and "array element" in a microarray context, refer to hybridizable polynucleotides arranged on the surface of a substrate.
The term "modulate" refers to a change in the activity of LIPAP. For example, modulation may cause an increase or a decrease in protein activity, binding characteristics, or any other biological, functional, or immunological properties of LIPAP.
The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or synthetic origin which may be single-stranded' or double-stranded and may represent the sense or the antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material.
"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a functional relationship with the second nucleic acid sequence. For instance, a promoter is operably linked to a coding sequence if the promoter affects the transcription or expression of the coding sequence. Generally, operably linked DNA sequences may be in close proximity or contiguous and, where necessary to join two protein coding regions, in the same reading frame.
"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, and may be pegylated to extend their lifespan in the cell.
"Probe" refers to nucleic acid sequences encoding LIPAP, their complements, or fragments thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes.
"Primers" are short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by complementary base-pairing. The primer may then be extended along the target DNA strand by a DNA
polymerase enzyme. Primer pairs can be used for amplification (and identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR).
Probes and primers as used in the present invention typically comprise at least IS contiguous nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers may be considerably longer than these examples, and it is understood that any length supported by the specification, including the tables, figures, and Sequence Listing, may be used.
Methods for preparing and using probes and primers are described in the references, for example Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, 2"d ed., vol. 1-3, Cold Spring Harbor Press, Plainview NY; Ausubel et a1.,1987, Current Protocols in Molecular Biolo~y, Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis et al., 1990, PCR Protocols, A Guide to Methods and Applications, Academic Press, San Diego CA. PCR primer pairs can be derived from a known sequence, for example, by using computer programs intended for that purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge MA).
Oligonucleotides for use as primers are selected using software known in the art for such purpose. For example, OLIGO 4.06 software is useful for the selection of PCR
primer pairs of up to 100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 nucleotides from an input polynucleotide sequence of up to 32 kilobases.
Similar primer selection programs have incorporated additional features for expanded capabilities. For example, the PrimOU
primer selection program (available to the public from the Genome Center at University of Texas South West Medical Center, Dallas TX) is capable of choosing specific primers from megabase sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 primer selection program (available to the public from the Whitehead Institute/MIT Center for Genome Research, Cambridge MA) allows the user to input a "mispriming library," in which sequences to avoid as primer binding sites are user-specified. Primer3 is useful, in particular, for the selection of oligonucleotides for microarrays. (The source code for the latter two primer selection programs may also be obtained from their respective sources and modified to meet the user's specific needs.) The PrimeGen program (available to the public from the UK Human Genome Mapping Project Resource Centre, Cambridge UK) designs primers based on multiple sequence alignments, thereby allowing selection of primers that hybridize to either the most conserved or least conserved regions of aligned nucleic acid sequences.
Hence, this program is useful for identification of both unique and conserved oligonucleotides and polynucleotide fragments. The oligonucleotides and polynucleotide fragments identified by any of the above selection methods are useful in hybridization technologies, for example, as PCR or sequencing primers, microarray elements, or specific probes to identify fully or partially complementary polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited to those described above.
A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence that is made by an artificial combination of two or more otherwise separated segments of sequence.
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques such as those described in Sambrook, supra. The term recombinant includes nucleic acids that have been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter sequence.
Such a recombinant nucleic acid may be part of a vector that is used, for example, to transform a cell.
Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is expressed, inducing a protective immunological response in the mammal.
An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose instead of deoxyribose.
The term "sample" is used in its broadest sense. A sample suspected of containing nucleic acids encoding LIPAP, or fragments thereof, or LIPAP itself, may comprise a bodily fluid; an extract from a cell, chromosome, organelle, or membrane isolated from a cell; a cell;
genomic DNA, RNA, or cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc.
The terms "specific binding" and "specifically binding" refer to that interaction between a protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or synthetic binding composition. The interaction is dependent upon the presence of a particular structure of the protein, e.g., the antigenic determinant or epitope, recognized by the binding molecule. For example, if an antibody is specific for epitope "A," the presence of a polypeptide containing the epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A and the antibody will reduce the amount of labeled A that binds to the antibody.
The term "substantially purified" refers to nucleic acid or amino acid sequences that are removed from their natural environment and are isolated or separated, and are at least 60% free, preferably at least 75% free, and most preferably at least 90% free from other components with which they are naturally associated.
A "substitution" refers to the replacement of one or more amino acids or nucleotides by different amino acids or nucleotides, respectively.
"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound.
"Transformation" describes a process by which exogenous DNA enters and changes a recipient cell. Transformation may occur under natural or artificial conditions according to various methods well known in the art, and may rely on any known method for the insertion of foreign nucleic acid sequences into a prokaryotic or eukaryotic host cell. The method for transformation is selected based on the type of host cell being transformed and may include, but is not limited to, viral infection, electroporation, heat shock, lipofection, and particle bombardment. The term "transformed"
cells includes stably transformed cells in which the inserted DNA is capable of replication either as an autonomously replicating plasmid or as part of the host chromosome, as well as transiently transformed cells which express the inserted DNA or RNA for limited periods of time.
A "transgenic organism," as used herein, is any organism, including but not limited to animals and plants, in which one or more of the cells of the organism contains heterologous nucleic acid introduced by way of human intervention, such as by transgenic techniques well known in the art. The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in vitro fertilization, but rather is directed to the introduction of a recombinant DNA
molecule. The transgenic organisms contemplated in accordance with the present invention include bacteria, cyanobacteria, fungi, and plants and animals. The isolated DNA of the present invention can be introduced into the host by methods known in the art, for example infection, transfection, transformation or transconjugation. Techniques for transferring the DNA of the present invention into such organisms are widely known and provided in references such as Sambrook et al. (1989), supra.
A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2Ø9 (May-07-1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or greater sequence identity over a certain defined length. A variant may be described as, for example, an "allelic" (as defined above), "splice," "species," or "polymorphic" variant. A
splice variant may have significant identity to a reference molecule, but will generally have a greater or lesser number of polynucleotides due to alternate splicing of exons during mRNA processing. The corresponding polypeptide may possess additional functional domains or lack domains that are present in the reference molecule. Species variants are polynucleotide sequences that vary from one species to another. The resulting polypeptides generally will have significant amino acid identity relative to each other. A
polymorphic variant is a variation in the polynucleotide sequence of a particular gene between individuals of a given species. Polymorphic variants also may encompass "single nucleotide polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide base. The presence of SNPs may be indicative of, for example, a certain population, a disease state, or a propensity for a disease state.
A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2Ø9 (May-07-1999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98%
or greater sequence identity over a certain defined length of one of the polypeptides.
THE INVENTION
The invention is based on the discovery of new human lipid-associated proteins (LIPAP), the polynucleotides encoding LIPAP, and the use of these compositions for the diagnosis, treatment, or prevention of cardiovascular, neurological, and gastrointestinal disorders, and disorders of lipid metabolism.
Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding LIPAP. Columns I and 2 show the sequence identification numbers (SEQ ID NOs) of the polypeptide and nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte clones in which nucleic acids encoding each LIPAP were identified, and column 4 shows the cDNA
libraries from which these clones were isolated. Column 5 shows Incyte clones and their corresponding cDNA
libraries. Clones for which cDNA libraries are not indicated were derived from pooled cDNA libraries.
The Incyte clones in column 5 were used to assemble the consensus nucleotide sequence of each LIPAP
and are useful as fragments in hybridization technologies.
The columns of Table 2 show various properties of each of the polypeptides of the invention:
column I references the SEQ ID NO; column 2 shows the number of amino acid residues in each polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylation sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 shows homologous sequences as identified by BLAST analysis and the identity of each polypeptide;
and column 7 shows analytical methods and in some cases, searchable databases to which the analytical methods were applied. The methods of column 7 were used to characterize each polypeptide through sequence homology and protein motifs.
The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions associated with nucleotide sequences encoding LIPAP. The first column of Table 3 lists the nucleotide SEQ ID NOs. Column 2 lists fragments of the nucleotide sequences of column 1.
These fragments are useful, for example, in hybridization or amplification technologies to identify SEQ ID N0:13-24 and to distinguish between SEQ ID N0:13-24 and related polynucleotide sequences. The polypeptides encoded by these fragments are useful, for example, as immunogenic peptides.
Column 3 lists tissue categories which express LIPAP as a fraction of total tissues expressing LIPAP. Column 4 lists diseases, disorders, or conditions associated with those tissues expressing LIPAP as a fraction of total tissues expressing LIPAP. Column 5 lists the vectors used to subclone each cDNA library.
The columns of Table 4 show descriptions of the tissues used to construct the cDNA libraries from which cDNA clones encoding LIPAP were isolated. Column I references the nucleotide SEQ ID
NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 shows the tissue origins and other descriptive information relevant to the cDNA
libraries in column 2.
SEQ ID N0:21 maps to chromosome 11 within the interval from 92.5 to 96.3 centiMorgans.
This interval also contains a gene encoding a G-protein coupled receptor associated with epilepsy.
The invention also encompasses LIPAP variants. A preferred LIPAP variant is one which has at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid sequence identity to the LIPAP amino acid sequence, and which contains at least one functional or structural characteristic of LIPAP.
The invention also encompasses polynucleotides which encode LIPAP. In a particular embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID N0:13-24, which encodes LIPAP. The polynucleotide sequences of SEQ ID N0:13-24, as presented in the Sequence Listing, embrace the equivalent RNA sequences, wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose instead of deoxyribose.
The invention also encompasses a variant of a polynucleotide sequence encoding LIPAP. In particular, such a variant polynucleotide sequence will have at least about 80%, or alternatively at least about 90%, or even at least about 95% polynucleotide sequence identity to the polynucleotide sequence encoding LIPAP. A particular aspect of the invention encompasses a variant of a polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID
N0:13-24 which has at least about 80%, or alternatively at least about 90%, or even at least about 95% polynucleotide sequence identity to a nucleic acid sequence selected from the group consisting of SEQ ID N0:13-24.
Any one of the polynucleotide variants described above can encode an amino acid sequence which contains at least one functional or structural characteristic of LIPAP.
It will be appreciated by those skilled in the art that as a result of the degeneracy of the genetic code, a multitude of polynucleotide sequences encoding LIPAP, some bearing minimal similarity to the polynucleotide sequences of any known and naturally occurring gene, may be produced. Thus, the invention contemplates each and every possible variation of polynucleotide sequence that could be made by selecting combinations based on possible codon choices. These combinations are made in accordance with the standard triplet genetic code as applied to the polynucleotide sequence of naturally occurring LIPAP, and all such variations are to be considered as being specifically disclosed.
Although nucleotide sequences which encode LIPAP and its variants are generally capable of hybridizing to the nucleotide sequence of the naturally occurring LIPAP under appropriately selected conditions of stringency, it may be advantageous to produce nucleotide sequences encoding LIPAP or its derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally occurring codons. Codons may be selected to increase the rate at which expression of the peptide occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with which particular codons are utilized by the host. Other reasons for substantially altering the nucleotide sequence encoding LIPAP and its derivatives without altering the encoded amino acid sequences include the production of RNA transcripts having more desirable properties, such as a greater half-life, than transcripts produced from the naturally occurring sequence.
The invention also encompasses production of DNA sequences which encode LIPAP
and LIPAP derivatives, or fragments thereof, entirely by synthetic chemistry.
After production, the synthetic sequence may be inserted into any of the many available expression vectors and cell systems using reagents well known in the art. Moreover, synthetic chemistry may be used to introduce mutations into a sequence encoding LIPAP or any fragment thereof.
Also encompassed by the invention are polynucleotide sequences that are capable of hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID
N0:13-24 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 152:507-511.) Hybridization conditions, including annealing and wash conditions, are described in "Definitions."
Methods for DNA sequencing are well known in the art and may be used to practice any of the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment of DNA polymerise I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerise (Perkin-Elmer), thermostable T7 polymerise (Amersham Pharmacia Biotech, Piscataway NJ), or combinations of polymerises and proofreading exonucleases such as those found in the ELONGASE
amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence preparation is automated with machines such as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV), PTC200 thermal cycler (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler (Perkin-Elmer).
Sequencing is then carried out using either the ABI 373 or 377 DNA sequencing system (Perkin-Elmer), the MEGABACE 1000 DNA sequencing system (Molecular Dynamics, Sunnyvale CA), or other systems known in the art. The resulting sequences are analyzed using a variety of algorithms which are well known in the art. (See, e.g., Ausubel, F.M. (1997) Short Protocols in Molecular BioloQV, John Wiley & Sons, New York NY, unit 7.7; Meyers, R.A. (1995) Molecular Biolo~y and CA 02362300 2001-07-31 PCT~jS00/04160 Biotechnolo~y, Wiley VCH, New York NY, pp. 856-853.) The nucleic acid sequences encoding LIPAP may be extended utilizing a partial nucleotide sequence and employing various PCR-based methods known in the art to detect upstream sequences, such as promoters and regulatory elements. For example, one method which may be employed, restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic.
2:318-322.) Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown sequence from a circularized template. The template is derived from restriction fragments comprising a known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids Res.
16:8186.) A third method, capture PCR, involves PCR amplification of DNA
fragments adjacent to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al.
(1991) PCR Methods Applic. 1:11 I-119.) In this method, multiple restriction enzyme digestions and ligations may be used to insert an engineered double-stranded sequence into a region of unknown sequence before performing PCR. Other methods which may be used to retrieve unknown sequences are known in the art. (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060).
Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries (Clontech, Palo Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in finding intron/exon junctions. For all PCR-based methods, primers may be designed using commercially available software, such as OLIGO 4.06 Primer Analysis software (National Biosciences, Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal to the template at temperatures of about 68°C to 72°C.
When screening for full-length cDNAs, it is preferable to use libraries that have been size-selected to include larger cDNAs. In addition, random-primed libraries, which often include sequences containing the 5' regions of genes, are preferable for_situations in which an oligo d(T) library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence into 5' non-transcribed regulatory regions.
Capillary electrophoresis systems which are commercially available may be used to analyze the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide-specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Perkin-Elmer), and the entire process from loading of samples to computer analysis and electronic data display may be computer controlled.
Capillary electrophoresis is especially preferable for sequencing small DNA
fragments which may be present in limited amounts in a particular sample.
CA 02362300 2001-07-31 pCT~S00/04160 In another embodiment of the invention, polynucleotide sequences or fragments thereof which encode LIPAP may be cloned in recombinant DNA molecules that direct expression of LIPAP, or fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy of the genetic code, other DNA sequences which encode substantially the same or a functionally equivalent amino acid sequence may be produced and used to express LIPAP.
The nucleotide sequences of the present invention can be engineered using methods generally known in the art in order to alter LIPAP-encoding sequences for a variety of purposes including, but not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide-mediated site-directed mutagenesis may be used to introduce mutations that create new restriction sites, alter glycosylation patterns, change codon preference, produce splice variants, and so forth.
The nucleotides of the present invention may be subjected to DNA shuffling techniques such as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat.
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or improve the biological properties of LIPAP, such as its biological or enzymatic activity or its ability to bind to other molecules or compounds. DNA shuffling is a process by which a library of gene variants is produced using PCR-mediated recombination of gene fragments. The library is then subjected to selection or screening procedures that identify those gene variants with the desired properties. These preferred variants may then be pooled and further subjected to recursive rounds of DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial"
breeding and rapid molecular evolution. For example, fragments of a single gene containing random point mutations may be recombined, screened, and then reshuffled until the desired properties are optimized. Alternatively, fragments of a given gene may be recombined with fragments of homologous genes in the same gene family, either from the same or different species, thereby maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable manner.
In another embodiment, sequences encoding LIPAP may be synthesized, in whole or in part, using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. ( 1980) Nucleic Acids Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser.
7:225-232.) Alternatively, LIPAP itself or a fragment thereof may be synthesized using chemical methods.
For example, peptide synthesis can be performed using various solid-phase techniques. (See, e.g., Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved using the ABI 431A
peptide synthesizer (Perkin-Elmer). Additionally, the amino acid sequence of LIPAP, or any part thereof, may be altered during direct synthesis and/or combined with sequences from other proteins, or any part thereof, to CA 02362300 2001-07-31 PCT~jS00/04160 produce a variant polypeptide.
The peptide may be substantially purified by preparative high performance liquid chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier ( 1990) Methods Enzymol. 182:392-421.) The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing.
(See, e.g., Creighton, T. (1984) Proteins, Structures and Molecular Properties, WH Freeman, New York NY.) In order to express a biologically active LIPAP, the nucleotide sequences encoding LIPAP or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains the necessary elements for transcriptional and translational control of the inserted coding sequence in a suitable host. These elements include regulatory sequences, such as enhancers, constitutive and inducible promoters, and 5' and 3' untranslated regions in the vector and in polynucleotide sequences encoding LIPAP. Such elements may vary in their strength and specificity.
Specific initiation signals may also be used to achieve more efficient translation of sequences encoding LIPAP. Such signals include the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where sequences encoding LIPAP and its initiation codon and upstream regulatory sequences are inserted into the appropriate expression vector, no additional transcriptional or translational control signals may be needed. However, in cases where only coding sequence, or a fragment thereof, is inserted, exogenous translational control signals including an in-frame ATG initiation codon should be provided by the vector. Exogenous translational elements and initiation codons may be of various origins, both natural and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers appropriate for the particular host cell system used. (See, e.g., Scharf, D. et al. ( 1994) Results Probl. Cell Differ.
20:125-162.) Methods which are well known to those skilled in the art may be used to construct expression vectors containing sequences encoding LIPAP and appropriate transcriptional and translational control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et al. (1995) Current Protocols in Molecular Biolo~y, John Wiley & Sons, New York NY, ch. 9, 13, and 16.) A variety of expression vector/host systems may be utilized to contain and express sequences encoding LIPAP. These include, but are not limited to, microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors;
yeast transformed with yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus);
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or animal cell systems. The invention is not limited by the host cell employed.
In bacterial systems, a number of cloning and expression vectors may be selected depending upon the use intended for polynucleotide sequences encoding LIPAP. For example, routine cloning, subcloning, and propagation of polynucleotide sequences encoding LIPAP can be achieved using a multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 plasmid (Life Technologies). Ligation of sequences encoding LIPAP into the vector's multiple cloning site disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of transformed bacteria containing recombinant molecules. In addition, these vectors may be useful for in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem.
264:5503-5509.) When large quantities of LIPAP are needed, e.g. for the production of antibodies, vectors which direct high level expression of LIPAP may be used. For example, vectors containing the strong, inducible TS or T7 bacteriophage promoter may be used.
Yeast expression systems may be used for production of LIPAP. A number of vectors containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH
promoters, may be used in the yeast Saccharomyces cerevisiae or Pichia pastoris. In addition, such vectors direct either the secretion or intracellular retention of expressed proteins and enable integration of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 1995, supra;
Bitter, G.A. et al. ( 1987) Methods Enzymol. 153:516-544; and Scorer, C.A. et al. ( 1994) Bio/Technology 12:181-184.) Plant systems may also be used for expression of LIPAP. Transcription of sequences encoding LIPAP may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV
used alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO
J. 6:307-311).
Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock promoters may be used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Brogue, R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105.) These constructs can be introduced into plant cells by direct DNA transformation or pathogen-mediated transfection. (See, e.g., The McGraw Hill Yearbook of Science and Technolo~y (1992) McGraw Hill, New York NY, pp.
191-196.) In mammalian cells, a number of viral-based expression systems may be utilized. In cases where an adenovirus is used as an expression vector, sequences encoding LIPAP
may be ligated into an adenovirus transcription/translation complex consisting of the late promoter and tripartite leader sequence. Insertion in a non-essential E1 or E3 region of the viral genome may be used to obtain infective virus which expresses LIPAP in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc.
Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV-based vectors may also be used for high-level protein expression.
Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are constructed and delivered via conventional delivery methods (liposomes, polycationic amino polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. ( 1997) Nat. Genet. 15:345-355.) For long term production of recombinant proteins in mammalian systems, stable expression of LIPAP in cell lines is preferred. For example, sequences encoding LIPAP can be transformed into cell lines using expression vectors which may contain viral origins of replication and/or endogenous expression elements and a selectable marker gene on the same or on a separate vector. Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media before being switched to selective media. The purpose of the selectable marker is to confer resistance to a selective agent, and its presence allows growth and recovery of cells which successfully express the introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue culture techniques appropriate to the cell type.
Any number of selection systems may be used to recover transformed cell lines.
These include, but are not limited to, the herpes simplex virus thymidine kinase and adenine phosphoribosyltransferase genes, for use in tk- and apr= cells, respectively. (See, e.g., Wigler, M. et al. ( 1977) Cell 11:223-232;
Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, or herbicide resistance can be used as the basis for selection. For example, dhfr confers resistance to methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and als and pat confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J.
Mol. Biol. 150:1-14.) Additional selectable genes have been described, e.g., trpB and hisD, which alter cellular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc.
Natl. Acad. Sci. USA
85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins (GFP; Clontech),13 glucuronidase and its substrate a-glucuronide, or luciferase and its substrate luciferin may be used.
These markers can be used not only to identify transformants, but also to quantify the amount of transient or stable protein expression attributable to a specific vector system. (See, e.g., Rhodes, C.A.
(1995) Methods Mol. Biol. 55:121-131.) Although the presence/absence of marker gene expression suggests that the gene of interest is also present, the presence and expression of the gene may need to be confirmed. For example, if the sequence encoding LIPAP is inserted within a marker gene sequence, transformed cells containing sequences encoding LIPAP can be identified by the absence of marker gene function. Alternatively, a marker gene can be placed in tandem with a sequence encoding LIPAP under the control of a single promoter. Expression of the marker gene in response to induction or selection usually indicates expression of the tandem gene as well.
In general, host cells that contain the nucleic acid sequence encoding LIPAP
and that express LIPAP may be identified by a variety of procedures known to those of skill in the art. These procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR
amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or chip based technologies for the detection and/or quantification of nucleic acid or protein sequences.
Immunological methods for detecting and measuring the expression of LIPAP
using either specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence activated cell sorting (FAGS). A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two non-interfering epitopes on LIPAP is preferred, but a competitive binding assay may be employed. These and other assays are well known in the art. (See, e.g., Hampton, R. et al. (1990) Serological Methods, a Laboratory Manual, APS Press, St. Paul MN, Sect. IV; Coligan, J.E.
et al. (1997) Current Protocols in Immunolo~y, Greene Pub. Associates and Wiley-Interscience, New York NY; and Pound, J.D. ( 1998) Immunochemical Protocols, Humana Press, Totowa NJ.) A wide variety of labels and conjugation techniques are known by those skilled in the art and may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization or PCR probes for detecting sequences related to polynucleotides encoding LIPAP include oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide.
Alternatively, the sequences encoding LIPAP, or any fragments thereof, may be cloned into a vector for the production of an mRNA probe. Such vectors are known in the art, are commercially available, and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety of commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega (Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for ease of detection include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as well as substrates, cofactors, inhibitors, magnetic particles, and the like.
Host cells transformed with nucleotide sequences encoding LIPAP may be cultured under conditions suitable for the expression and recovery of the protein from cell culture. The protein produced by a transformed cell may be secreted or retained intracellularly depending on the sequence and/or the vector used. As will be understood by those of skill in the art, expression vectors containing polynucleotides which encode LIPAP may be designed to contain signal sequences which direct secretion of LIPAP through a prokaryotic or eukaryotic cell membrane.
In addition, a host cell strain may be chosen for its ability to modulate expression of the inserted sequences or to process the expressed protein in the desired fashion.
Such modifications of the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro" or "pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. Different host cells which have specific cellular machinery and characteristic mechanisms for post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct modification and processing of the foreign protein.
In another embodiment of the invention, natural, modified, or recombinant nucleic acid sequences encoding LIPAP may be ligated to a heterologous sequence resulting in translation of a fusion protein in any of the aforementioned host systems. For example, a chimeric LIPAP protein containing a heterologous moiety that can be recognized by a commercially available antibody may facilitate the screening of peptide libraries for inhibitors of LIPAP
activity. Heterologous protein and peptide moieties may also facilitate purification of fusion proteins using commercially available affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate resins, respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity purification of fusion proteins using commercially available monoclonal and polyclonal antibodies that specifically recognize these epitope tags. A fusion protein may also be engineered to contain a proteolytic cleavage site located between the LIPAP encoding sequence and the heterologous protein sequence, so that LIPAP may be cleaved away from the heterologous moiety following purification.
Methods for fusion protein expression and purification are discussed in Ausubel (1995, supra, ch.
10). A variety of commercially available kits may also be used to facilitate expression and purification of fusion proteins.
In a further embodiment of the invention, synthesis of radiolabeled LIPAP may be achieved in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These systems couple transcription and translation of protein-coding sequences operably associated with the T7, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid precursor, for example, 35S-methionine.
Fragments of LIPAP may be produced not only by recombinant means, but also by direct peptide synthesis using solid-phase techniques. (See, e.g., Creighton, supra, pp. 55-60.) Protein synthesis may be performed by manual techniques or by automation. Automated synthesis may be achieved, for example, using the ABI 431A peptide synthesizer (Perkin-Elmer).
Various fragments of LIPAP may be synthesized separately and then combined to produce the full length molecule.
THERAPEUTICS
Chemical and structural similarity, e.g., in the context of sequences and motifs, exists between regions of LIPAP and human lipid-associated proteins. In addition, the expression of LIPAP is closely associated with cardiovascular and gastrointestinal tissues, and tissues of the nervous system.
Therefore, LIPAP appears to play a role in cardiovascular, neurological, and gastrointestinal disorders, and disorders of lipid metabolism. In the treatment of disorders associated with increased LIPAP
expression or activity, it is desirable to decrease the expression or activity of LIPAP. In the treatment of disorders associated with decreased LIPAP expression or activity, it is desirable to increase the expression or activity of LIPAP.
Therefore, in one embodiment, LIPAP or a fragment or derivative thereof may be administered to a subject to treat or prevent a disorder associated with decreased expression or activity of LIPAP.
Examples of such disorders include, but are not limited to, a cardiovascular disorder including blood vessel disorders such as arteriovenous fistula, atherosclerosis, hypertension, vasculitis, Raynaud's disease, aneurysms, arterial dissections, varicose veins, thrombophlebitis and phlebothrombosis, vascular tumors, and complications of thrombolysis, balloon angioplasty, vascular replacement, and coronary artery bypass graft surgery; heart disorders such as congestive heart failure, ischemic heart disease, angina pectoris, myocardial infarction, hypertensive heart disease, degenerative valvular heart disease, calcific aortic valve stenosis, congenitally bicuspid aortic valve, mitral annular calcification, mural valve prolapse, rheumatic fever and rheumatic heart disease, infective endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic lupus erythematosus, carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart disease, congenital heart disease, and complications of cardiac transplantation; and lung disorders such as congenital lung anomalies, atelectasis, pulmonary congestion and edema, pulmonary embolism, pulmonary hemorrhage, pulmonary infarction, pulmonary hypertension, vascular sclerosis, obstructive pulmonary disease, restrictive pulmonary disease, chronic obstructive pulmonary disease, emphysema, chronic bronchitis, bronchial asthma, bronchiectasis, bacterial pneumonia, viral and mycoplasmal pneumonia, lung abscess, pulmonary tuberculosis, diffuse interstitial diseases, pneumoconioses, sarcoidosis, idiopathic pulmonary fibrosis, desquamative interstitial pneumonitis, hypersensitivity pneumonitis, pulmonary eosinophilia bronchiolitis obliterans-organizing pneumonia, diffuse pulmonary hemorrhage syndromes, Goodpasture's syndromes, idiopathic pulmonary hemosiderosis, pulmonary involvement in collagen-vascular disorders, pulmonary alveolar proteinosis, lung tumors, inflammatory and noninflammatory pleural effusions, pneumothorax, pleural tumors, drug-induced lung disease, radiation-induced lung disease, and complications of lung transplantation; a neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, CA 02362300 2001-07-31 pCT~S00/04160 amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system disease; prion diseases including kuru, Creutzfeldt-Jakob disease, and Gerstmann-Straussler-Scheinker syndrome;
fatal familial insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and polymyositis; inherited, metabolic, endocrine, and toxic myopathies;
myasthenia gravis, periodic paralysis; mental disorders including mood, anxiety, and schizophrenic disorders; seasonal affective disorder (SAD); akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, and familial frontotemporal dementia; a gastrointestinal disorder such as dysphagia, peptic esophagitis, esophageal spasm, esophageal stricture, esophageal carcinoma, dyspepsia, indigestion, gastritis, gastric carcinoma, anorexia, nausea, emesis, gastroparesis, antral or pyloric edema, abdominal angina, pyrosis, gastroenteritis, intestinal obstruction, infections of the intestinal tract, peptic ulcer, cholelithiasis, cholecystitis, cholestasis, pancreatitis, pancreatic carcinoma, biliary tract disease, hepatitis, hyperbilirubinemia, cirrhosis, passive congestion of the liver, hepatoma, infectious colitis, ulcerative colitis, ulcerative proctitis, Crohn's disease, Whipple's disease, Mallory-Weiss syndrome, colonic carcinoma, colonic obstruction, irritable bowel syndrome, short bowel syndrome, diarrhea, constipation, gastrointestinal hemorrhage, acquired immunodeficiency syndrome (AIDS) enteropathy, jaundice, hepatic encephalopathy, hepatorenal syndrome, hepatic steatosis, hemochromatosis, Wilson's disease, alpha,-antitrypsin deficiency, Reye's syndrome, primary sclerosing cholangitis, liver infarction, portal vein obstruction and thrombosis, centrilobular necrosis, peliosis hepatis, hepatic vein thrombosis, veno-occlusive disease, preeclampsia, eclampsia, acute fatty liver of pregnancy, intrahepatic cholestasis of pregnancy, and hepatic tumors including nodular hyperplasias, adenomas, and carcinomas; and a disorder of lipid metabolism such as fatty liver, cholestasis, primary biliary cirrhosis, carnitine deficiency, carnitine palmitoyltransferase deficiency, myoadenylate deaminase deficiency, hypertriglyceridemia, lipid storage disorders such Fabry's disease, Gaucher's disease, Niemann-Pick's disease, metachromatic leukodystrophy, adrenoleukodystrophy, GMZ gangliosidosis, and ceroid lipofuscinosis, abetalipoproteinemia, Tangier disease, hyperlipoproteinemia, diabetes mellitus, lipodystrophy, lipomatoses, acute panniculitis, disseminated fat necrosis, adiposis dolorosa, lipoid adrenal hyperplasia, minimal change disease, lipomas, atherosclerosis, hypercholesterolemia, hypercholesterolemia with hypertriglyceridemia, primary hypoalphalipoproteinemia, hypothyroidism, renal disease, liver disease, lecithin:cholesterol acyltransferase deficiency, cerebrotendinous xanthomatosis, sitosterolemia, hypocholesterolemia, Tay-Sachs disease, Sandhoff's disease, hyperlipidemia, hyperlipemia, lipid myopathies, and obesity.
In another embodiment, a vector capable of expressing LIPAP or a fragment or derivative thereof may be administered to a subject to treat or prevent a disorder associated with decreased expression or activity of LIPAP including, but not limited to, those described above.
In a further embodiment, a pharmaceutical composition comprising a substantially purified LIPAP in conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent a disorder associated with decreased expression or activity of LIPAP including, but not limited to, those provided above.
In still another embodiment, an agonist which modulates the activity of LIPAP
may be administered to a subject to treat or prevent a disorder associated with decreased expression or activity of LIPAP including, but not limited to, those listed above.
In a further embodiment, an antagonist of LIPAP may be administered to a subject to treat or prevent a disorder associated with increased expression or activity of LIPAP.
Examples of such disorders include, but are not limited to, those cardiovascular, neurological, and gastrointestinal disorders, and disorders of lipid metabolism, described above. In one aspect, an antibody which specifically binds LIPAP may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to cells or tissues which express LIPAP.
In an additional embodiment, a vector expressing the complement of the polynucleotide encoding LIPAP may be administered to a subject to treat or prevent a disorder associated with increased expression or activity of LIPAP including, but not limited to, those described above.
In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary sequences, or vectors of the invention may be administered in combination with other appropriate therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made by one of ordinary skill in the art, according to conventional pharmaceutical principles. The combination of therapeutic agents may act synergistically to effect the treatment or prevention of the various disorders described above. Using this approach, one may be able to achieve therapeutic efficacy with lower dosages of each agent, thus reducing the potential for adverse side effects.
An antagonist of LIPAP may be produced using methods which are generally known in the art.
In particular, purified LIPAP may be used to produce antibodies or to screen libraries of pharmaceutical agents to identify those which specifically bind LIPAP. Antibodies to LIPAP
may also be generated using methods that are well known in the art. Such antibodies may include, but are not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit dimer formation) are generally preferred for therapeutic use.
For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, and others may be immunized by injection with LIPAP or with any fragment or oligopeptide thereof which has immunogenic properties. Depending on the host species, various adjuvants may be used to increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral gels such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG (bacilli Calmette-Guerin) and Corynebacterium parvum are especially preferable.
It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to LIPAP
have an amino acid sequence consisting of at least about 5 amino acids, and generally will consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or fragments are identical to a portion of the amino acid sequence of the natural protein and contain the entire amino acid sequence of a small, naturally occurring molecule. Short stretches of LIPAP amino acids may be fused with those of another protein, such as KLH, and antibodies to the chimeric molecule may be produced.
Monoclonal antibodies to LIPAP may be prepared using any technique which provides for the production of antibody molecules by continuous cell lines in culture. These include, but are not limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D.
et al. (1985) J.
Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA
80:2026-2030; and Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) In addition, techniques developed for the production of "chimeric antibodies,"
such as the splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. ( 1984) Proc.
Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, S.
et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single chain antibodies may be adapted, using methods known in the art, to produce LIPAP-specific single chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., Burton, D.R. ( 1991 ) Proc. Natl. Acad. Sci. USA 88:10134-10137.) Antibodies may also be produced by inducing in vivo production in the lymphocyte population or by screening immunoglobulin libraries or panels of highly specific binding reagents as disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci.
USA 86:3833-3837; Winter, G.
et al. ( I 991 ) Nature 349:293-299.) Antibody fragments which contain specific binding sites for LIl'AP may also be generated. For example, such fragments include, but are not limited to, F(ab~z fragments produced by pepsin digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of the F(ab~2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and easy identification of monoclonal Fab fragments with the desired specificity.
(See, e.g., Huse, W.D. et al. (1989) Science 246:1275-1281.) Various immunoassays may be used for screening to identify antibodies having the desired specificity. Numerous protocols for competitive binding or immunoradiometric assays using either polyclonal or monoclonal antibodies with established specificities are well known in the art. Such immunoassays typically involve the measurement of complex formation between LIPAP and its specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two non-interfering LIPAP epitopes is generally used, but a competitive binding assay may also be employed (Pound, supra).
Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques may be used to assess the affinity of antibodies for LIPAP. Affinity is expressed as an association constant, Ka, which is defined as the molar concentration of LIPAP-antibody complex divided by the molar concentrations of free antigen and free antibody under equilibrium conditions. The Ka determined for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for multiple LIPAP epitopes, represents the average affinity, or avidity, of the antibodies for LIPAP. The K~ determined for a preparation of monoclonal antibodies, which are monospecific for a particular LIPAP epitope, represents a true measure of affinity. High-affinity antibody preparations with K
ranging from about 109 to 10''- L/mole are preferred for use in immunoassays in which the LIPAP-antibody complex must withstand rigorous manipulations. Low-affinity antibody preparations with Ka ranging from about 106 to 10' L/mole are preferred for use in immunopurification and similar procedures which ultimately require dissociation of LIPAP, preferably in active form, from the antibody (Catty, D. (1988) Antibodies, Volume I: A Practical Approach, IRL
Press, Washington, DC;
Liddell, J.E. and Cryer, A. (1991) A Practical Guide to Monoclonal Antibodies, John Wiley & Sons, New York NY).
The titer and avidity of polyclonal antibody preparations may be further evaluated to determine the quality and suitability of such preparations for certain downstream applications. For example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg specific antibody/ml, is generally employed in procedures requiring precipitation of LIPAP-antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for antibody quality and usage in various applications, are generally available.
(See, e.g., Catty, supra, and Coligan et al. supra.) In another embodiment of the invention, the polynucleotides encoding LIPAP, or any fragment or complement thereof, may be used for therapeutic purposes. In one aspect, the complement of the polynucleotide encoding LIPAP may be used in situations in which it would~be desirable to block the transcription of the mRNA. In particular, cells may be transformed with sequences complementary to polynucleotides encoding LIPAP. Thus, complementary molecules or fragments may be used to modulate LIPAP activity, or to achieve regulation of gene function. Such technology is now well known in the art, and sense or antisense oligonucleotides or larger fragments can be designed from various locations along the coding or control regions of sequences encoding LIPAP.
Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for delivery of nucleotide sequences to the targeted organ, tissue, or cell population. Methods which are well known to those skilled in the art can be used to construct vectors to express nucleic acid sequences complementary to the polynucleotides encoding LIPAP. (See, e.g., Sambrook, sera; Ausubel, 1995, supra.) Genes encoding LIPAP can be turned off by transforming a cell or tissue with expression vectors which express high levels of a polynucleotide, or fragment thereof, encoding LIPAP. Such constructs may be used to introduce untranslatable sense or antisense sequences into a cell. Even in the absence of integration into the DNA, such vectors may continue to transcribe RNA molecules until they are disabled by endogenous nucleases. Transient expression may last for a month or more with a non-replicating vector, and may last even longer if appropriate replication elements are part of the vector system.
As mentioned above, modifications of gene expression can be obtained by designing complementary sequences or antisense molecules (DNA, RNA, or PNA) to the control, 5', or regulatory regions of the gene encoding LIPAP. Oligonucleotides derived from the transcription initiation site, e.g., between about positions -10 and +10 from the start site, may be employed. Similarly, inhibition can be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it causes inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, B.E. and B.I. Carr, Molecular and Immunolo_ ig c Approaches, Futura Publishing, Mt. Kisco NY, pp.
163-177.) A
complementary sequence or antisense molecule may also be designed to block translation of mRNA by preventing the transcript from binding to ribosomes.
Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme molecule to complementary target RNA, followed by endonucleolytic cleavage.
For example, engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze endonucleolytic cleavage of sequences encoding LIPAP.
Specific ribozyme cleavage sites within any potential RNA target are initially identified by scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, corresponding to the region of the target gene containing the cleavage site, may be evaluated for secondary structural features which may render the oligonucleotide inoperable.
The suitability of candidate targets may also be evaluated by testing accessibility to hybridization with complementary oligonucleotides using ribonuclease protection assays.
Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared by any method known in the art for the synthesis of nucleic acid molecules. These include techniques for chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis.
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA
sequences encoding LIPAP. Such DNA sequences may be incorporated into a wide variety of vectors with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that synthesize complementary RNA, constitutively or inducibly, can be introduced into cell lines, cells, or tissues.
RNA molecules may be modified to increase intracellular stability and half-life. Possible modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' ends of the molecule, or the use of phosphorothioate or 2' O-methyl rather than phosphodiesterase linkages within the backbone of the molecule. This concept is inherent in the production of PNAs and can be extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases.
Many methods for introducing vectors into cells or tissues are available and equally suitable for use in vivo, in vitro, and ex vivo. For ex vivo therapy, vectors may be introduced into stem cells taken from the patient and clonally propagated for autologous transplant back into that same patient.
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. ( 1997) Nat.
Biotechnol. 15:462-466.) Any of the therapeutic methods described above may be applied to any subject in need of such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and monkeys.
An additional embodiment of the invention relates to the administration of a pharmaceutical or sterile composition, in conjunction with a pharmaceutically acceptable carrier, for any of the therapeutic effects discussed above. Such pharmaceutical compositions may consist of LIPAP, antibodies to LIPAP, and mimetics, agonists, antagonists, or inhibitors of LIPAP. The compositions may be administered alone or in combination with at least one other agent, such as a stabilizing compound, which may be administered in any sterile, biocompatible pharmaceutical carrier including, but not limited to, saline, buffered saline, dextrose, and water. The compositions may be administered to a patient alone, or in combination with other agents, drugs, or hormones.
The pharmaceutical compositions utilized in this invention may be administered by any number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, sublingual, or rectal means.
In addition to the active ingredients, these pharmaceutical compositions may contain suitable pharmaceutically-acceptable carriers comprising excipients and auxiliaries which facilitate processing of the active compounds into preparations which can be used pharmaceutically.
Further details on techniques for formulation and administration may be found in the latest edition of Remin tg on's Pharmaceutical Sciences (Maack Publishing, Easton PA).
Pharmaceutical compositions for oral administration can be formulated using pharmaceutically acceptable carriers well known in the art in dosages suitable for oral administration. Such carriers enable the pharmaceutical compositions to be formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by the patient.
Pharmaceutical preparations for oral use can be obtained through combining active compounds with solid excipient and processing the resultant mixture of granules (optionally, after grinding) to obtain tablets or dragee cores. Suitable auxiliaries can be added, if desired.
Suitable excipients include carbohydrate or protein fillers, such as sugars, including lactose, sucrose, mannitol, and sorbitol; starch from corn, wheat, rice, potato, or other plants; cellulose, such as methyl cellulose, hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; gums, including arabic and tragacanth; and proteins, such as gelatin and collagen. If desired, disintegrating or solubilizing agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, and alginic acid or a salt thereof, such as sodium alginate.
Dragee cores may be used in conjunction with suitable coatings, such as concentrated sugar solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures.
Dyestuffs or pigments may be added to the tablets or dragee coatings for product identification or to characterize the quantity of active compound, i.e., dosage.
Pharmaceutical preparations which can be used orally include push-fit capsules made of gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol.
Push-fit capsules can contain active ingredients mixed with fillers or binders, such as lactose or CA 02362300 2001-07-31 pC~/US00/04160 starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty oils, liquid, or liquid polyethylene glycol with or without stabilizers.
Pharmaceutical formulations suitable for parenteral administration may be formulated in aqueous solutions, preferably in physiologically compatible buffers such as Hanks' solution, Ringer's solution, or physiologically buffered saline. Aqueous injection suspensions may contain substances which increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran. Additionally, suspensions of the active compounds may be prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils, such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate, triglycerides, or liposomes. Non-lipid polycationic amino polymers may also be used for delivery. Optionally, the suspension may also contain suitable stabilizers or agents to increase the solubility of the compounds and allow for the preparation of highly concentrated solutions.
For topical or nasal administration, penetrants appropriate to the particular barrier to be permeated are used in the formulation. Such penetrants are generally known in the art.
The pharmaceutical compositions of the present invention may be manufactured in a manner that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes.
The pharmaceutical composition may be provided as a salt and can be formed with many acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, and succinic acids.
Salts tend to be more soluble in aqueous or other protonic solvents than are the corresponding free base forms. In other cases, the preparation may be a lyophilized powder which may contain any or all of the following: 1 mM to 50 mM histidine, 0.1 % to 2% sucrose, and 2% to 7%
mannitol, at a pH range of 4.5 to 5.5, that is combined with buffer prior to use.
After pharmaceutical compositions have been prepared, they can be placed in an appropriate container and labeled for treatment of an indicated condition. For administration of LIPAP, such labeling would include amount, frequency, and method of administration.
Pharmaceutical compositions suitable for use in the invention include compositions wherein the active ingredients are contained in an effective amount to achieve the intended purpose. The determination of an effective dose is well within the capability of those skilled in the art.
For any compound, the therapeutically effective dose can be estimated initially either in cell culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, or pigs.
An animal model may also be used to determine the appropriate concentration range and route of administration. Such information can then be used to determine useful doses and routes for administration in humans.
WO 00/49043 CA 02362300 2001-07-31 pCT~JS00/04160 A therapeutically effective dose refers to that amount of active ingredient, for example LIPAP
or fragments thereof, antibodies of LIl'AP, and agonists, antagonists or inhibitors of LIPAP, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined by standard pharmaceutical procedures in cell cultures or with experimental animals, such as by calculating the EDSO (the dose therapeutically effective in 50% of the population) or LDSO (the dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the therapeutic index, which can be expressed as the LDSO/EDSO ratio.
Pharmaceutical compositions which exhibit large therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are used to formulate a range of dosage for human use. The dosage contained in such compositions is preferably within a range of circulating concentrations that includes the EDSO with little or no toxicity. The dosage varies within this range depending upon the dosage form employed, the sensitivity of the patient, and the route of administration.
The exact dosage will be determined by the practitioner, in light of factors related to the subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the active moiety or to maintain the desired effect. Factors which may be taken into account include the severity of the disease state, the general health of the subject, the age, weight, and gender of the subject, time and frequency of administration, drug combination(s), reaction sensitivities, and response to therapy.
Long-acting pharmaceutical compositions may be administered every 3 to 4 days, every week, or biweekly depending on the half-life and clearance rate of the particular formulation.
Normal dosage amounts may vary from about 0.1 ~g to 100,000 fig, up to a total dose of about 1 gram, depending upon the route of administration. Guidance as to particular dosages and methods of delivery is provided in the literature and generally available to practitioners in the art. Those skilled in the art will employ different formulations for nucleotides than for proteins or their inhibitors.
Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, conditions, locations, etc.
DIAGNOSTICS
In another embodiment, antibodies which specifically bind LIPAP may be used for the diagnosis of disorders characterized by expression of LIPAP, or in assays to monitor patients being treated with LIPAP or agonists, antagonists, or inhibitors of LIPAP.
Antibodies useful for diagnostic purposes may be prepared in the same manner as described above for therapeutics. Diagnostic assays for LIPAP include methods which utilize the antibody and a label to detect LIPAP in human body fluids or in extracts of cells or tissues. The antibodies may be used with or without modification, and may be labeled by covalent or non-covalent attachment of a reporter molecule.
A wide variety of reporter molecules, several of which are described above, are known in the art and may be used.
A variety of protocols for measuring LIPAP, including ELISAs, RIAs, and FACS, are known in CA 02362300 2001-07-31 PCT~jS00/04160 the art and provide a basis for diagnosing altered or abnormal levels of LIPAP
expression. Normal or standard values for LIPAP expression are established by combining body fluids or cell extracts taken from normal mammalian subjects, for example, human subjects, with antibody to LIPAP under conditions suitable for complex formation. The amount of standard complex formation may be quantitated by various methods, such as photometric means. Quantities of LIPAP
expressed in subject, control, and disease samples from biopsied tissues are compared with the standard values. Deviation between standard and subject values establishes the parameters for diagnosing disease.
In another embodiment of the invention, the polynucleotides encoding LIPAP may be used for diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and quantify gene expression in biopsied tissues in which expression of LIPAP may be correlated with disease. The diagnostic assay may be used to determine absence, presence, and excess expression of LIPAP, and to monitor regulation of LIPAP levels during therapeutic intervention.
In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide sequences, including genomic sequences, encoding LIPAP or closely related molecules may be used to identify nucleic acid sequences which encode LIPAP. The specificity of the probe, whether it is made from a highly specific region, e.g., the 5'regulatory region, or from a less specific region, e.g., a conserved motif, and the stringency of the hybridization or amplification will determine whether the probe identifies only naturally occurring sequences encoding LIPAP, allelic variants, or related sequences.
Probes may also be used for the detection of related sequences, and may have at least 50%
sequence identity to any of the LIPAP encoding sequences. The hybridization probes of the subject invention may be DNA or RNA and may be derived from the sequence of SEQ ID
N0:13-24 or from genomic sequences including promoters, enhancers, and introns of the LIPAP
gene.
Means for producing specific hybridization probes for DNAs encoding LIPAP
include the cloning of polynucleotide sequences encoding LIPAP or LIl'AP derivatives into vectors for the production of mRNA probes. Such vectors are known in the art, are commercially available, and may be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a variety of reporter groups, for example, by radionuclides such as 32P or 35S, or by enzymatic labels, such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like.
Polynucleotide sequences encoding LIPAP may be used for the diagnosis of disorders associated with expression of LIPAP. Examples of such disorders include, but are not limited to, a cardiovascular disorder including blood vessel disorders such as arteriovenous fistula, atherosclerosis, hypertension, vasculitis, Raynaud's disease, aneurysms, arterial dissections, varicose veins, CA 02362300 2001-07-31 pC'j'/US00/04160 thrombophlebitis and phlebothrombosis, vascular tumors, and complications of thrombolysis, balloon angioplasty, vascular replacement, and coronary artery bypass graft surgery;
heart disorders such as congestive heart failure, ischemic heart disease, angina pectoris, myocardial infarction, hypertensive heart disease, degenerative valvular heart disease, calcific aortic valve stenosis, congenitally bicuspid aortic valve, mural annular calcification, mural valve prolapse, rheumatic fever and rheumatic heart disease, infective endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic lupus erythematosus, carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart disease, congenital heart disease, and complications of cardiac transplantation; and lung disorders such as congenital lung anomalies, atelectasis, pulmonary congestion and edema, pulmonary embolism, pulmonary hemorrhage, pulmonary infarction, pulmonary hypertension, vascular sclerosis, obstructive pulmonary disease, restrictive pulmonary disease, chronic obstructive pulmonary disease, emphysema, chronic bronchitis, bronchial asthma, bronchiectasis, bacterial pneumonia, viral and mycoplasmal pneumonia, lung abscess, pulmonary tuberculosis, diffuse interstitial diseases, pneumoconioses, sarcoidosis, idiopathic pulmonary fibrosis, desquamative interstitial pneumonitis, hypersensitivity pneumonitis, pulmonary eosinophilia bronchiolitis obliterans-organizing pneumonia, diffuse pulmonary hemorrhage syndromes, Goodpasture's syndromes, idiopathic pulmonary hemosiderosis, pulmonary involvement in collagen-vascular disorders, pulmonary alveolar proteinosis, lung tumors, inflammatory and noninflammatory pleural effusions, pneumothorax, pleural tumors, drug-induced lung disease, radiation-induced lung disease, and complications of lung transplantation; a neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system disease; prion diseases including kuru, Creutzfeldt-Jakob disease, and Gerstmann-Straussler-Scheinker syndrome;
fatal familial insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and polymyositis; inherited, metabolic, endocrine, and toxic myopathies;
myasthenia gravis, periodic paralysis; mental disorders including mood, anxiety, and schizophrenic disorders; seasonal affective disorder (SAD); akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive supranuclear palsy, CA 02362300 2001-07-31 pCT~S00/04160 corticobasal degeneration, and familial frontotemporal dementia; a gastrointestinal disorder such as dysphagia, peptic esophagitis, esophageal spasm, esophageal stricture, esophageal carcinoma, dyspepsia, indigestion, gastritis, gastric carcinoma, anorexia, nausea, emesis, gastroparesis, antral or pyloric edema, abdominal angina, pyrosis, gastroenteritis, intestinal obstruction, infections of the intestinal tract, peptic ulcer, cholelithiasis, cholecystitis, cholestasis, pancreatitis, pancreatic carcinoma, biliary tract disease, hepatitis, hyperbilirubinemia, cirrhosis, passive congestion of the liver, hepatoma, infectious colitis, ulcerative colitis, ulcerative proctitis, Crohn's disease, Whipple's disease, Mallory-Weiss syndrome, colonic carcinoma, colonic obstruction, irritable bowel syndrome, short bowel syndrome, diarrhea, constipation, gastrointestinal hemorrhage, acquired immunodeficiency syndrome (AIDS) enteropathy, jaundice, hepatic encephalopathy, hepatorenal syndrome, hepatic steatosis, hemochromatosis, Wilson's disease, alpha,-antitrypsin deficiency, Reye's syndrome, primary sclerosing cholangitis, liver infarction, portal vein obstruction and thrombosis, centrilobular necrosis, peliosis hepatis, hepatic vein thrombosis, veno-occlusive disease, preeclampsia, eclampsia, acute fatty liver of pregnancy, intrahepatic cholestasis of pregnancy, and hepatic tumors including nodular hyperplasias, adenomas, and carcinomas; and a disorder of lipid metabolism such as fatty liver, cholestasis, primary biliary cirrhosis, carnitine deficiency, carnitine palmitoyltransferase deficiency, myoadenylate deaminase deficiency, hypertriglyceridemia, lipid storage disorders such Fabry's disease, Gaucher's disease, Niemann-Pick's disease, metachromatic leukodystrophy, adrenoleukodystrophy, GMZ gangliosidosis, and ceroid lipofuscinosis, abetalipoproteinemia, Tangier disease, hyperlipoproteinemia, diabetes mellitus, lipodystrophy, lipomatoses, acute panniculitis, disseminated fat necrosis, adiposis dolorosa, lipoid adrenal hyperplasia, minimal change disease, lipomas, atherosclerosis, hypercholesterolemia, hypercholesterolemia with hypertriglyceridemia, primary hypoalphalipoproteinemia, hypothyroidism, renal disease, liver disease, lecithin:cholesterol acyltransferase deficiency, cerebrotendinous xanthomatosis, sitosterolemia, hypocholesterolemia, Tay-Sachs disease, Sandhoff's disease, hyperlipidemia, hyperlipemia, lipid myopathies, and obesity. The polynucleotide sequences encoding LIPAP may be used in Southern or northern analysis, dot blot, or other membrane-based technologies; in PCR technologies; in dipstick, pin, and multiformat ELISA-like assays; and in microarrays utilizing fluids or tissues from patients to detect altered LIPAP expression.
Such qualitative or quantitative methods are well known in the art.
In a particular aspect, the nucleotide sequences encoding LIPAP may be useful in assays that detect the presence of associated disorders, particularly those mentioned above. The nucleotide sequences encoding LIPAP may be labeled by standard methods and added to a fluid or tissue sample from a patient under conditions suitable for the formation of hybridization complexes. After a suitable incubation period, the sample is washed and the signal is quantified and compared with a standard value. If the amount of signal in the patient sample is significantly altered in comparison to a control WO 00/49043 CA 02362300 2001-07-31 pCT/ITS00/04160 sample then the presence of altered levels of nucleotide sequences encoding LIPAP in the sample indicates the presence of the associated disorder. Such assays may also be used to evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the treatment of an individual patient.
In order to provide a basis for the diagnosis of a disorder associated with expression of LIPAP, a normal or standard profile for expression is established. This may be accomplished by combining body fluids or cell extracts taken from normal subjects, either animal or human, with a sequence, or a fragment thereof, encoding LIPAP, under conditions suitable for hybridization or amplification.
Standard hybridization may be quantified by comparing the values obtained from normal subjects with values from an experiment in which a known amount of a substantially purified polynucleotide is used.
Standard values obtained in this manner may be compared with values obtained from samples from patients who are symptomatic for a disorder. Deviation from standard values is used to establish the presence of a disorder.
Once the presence of a disorder is established and a treatment protocol is initiated, hybridization assays may be repeated on a regular basis to determine if the level of expression in the patient begins to approximate that which is observed in the normal subject.
The results obtained from successive assays may be used to show the efficacy of treatment over a period ranging from several days to months.
With respect to cancer, the presence of an abnormal amount of transcript (either under- or overexpressed) in biopsied tissue from an individual may indicate a predisposition for the development of the disease, or may provide a means for detecting the disease prior to the appearance of actual clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ preventative measures or aggressive treatment earlier thereby preventing the development or further progression of the cancer.
Additional diagnostic uses for oligonucleotides designed from the sequences encoding LIPAP
may involve the use of PCR. These oligomers may be chemically synthesized, generated enzymatically, or produced in vitro. Oligomers will preferably contain a fragment of a polynucleotide encoding LIPAP, or a fragment of a polynucleotide complementary to the polynucleotide encoding LIPAP, and will be employed under optimized conditions for identification of a specific gene or condition. Oligomers may also be employed under less stringent conditions for detection or quantification of closely related DNA or RNA sequences.
Methods which may also be used to quantify the expression of LIPAP include radiolabeling or biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be accelerated by running the assay in a high-throughput format where the oligomer of interest is presented in various dilutions and a spectrophotometric or colorimetric response gives rapid quantitation.
In further embodiments, oligonucleotides or longer fragments derived from any of the polynucleotide sequences described herein may be used as targets in a microarray. The microarray can be used to monitor the expression level of large numbers of genes simultaneously and to identify genetic variants, mutations, and polymorphisms. This information may be used to determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, and to develop and monitor the activities of therapeutic agents.
Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al.
(1996) Proc. Natl. Acad. Sci.
USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251116;
Shalom D. et al.
(1995) PCT application W095/35505; Heller, R.A. et al. (1997) Proc. Natl.
Acad. Sci. USA 94:2150-2155; and Heller, M.J. et al. ( 1997) U.S. Patent No. 5,605,662.) In another embodiment of the invention, nucleic acid sequences encoding LIPAP
may be used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. The sequences may be mapped to a particular chromosome, to a specific region of a chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial P1 constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) Nat.
Genet. 15:345-355; Price, C.M. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet. 7:149-154.) Fluorescent in situ hybridization (FISH) may be correlated with other physical chromosome mapping techniques and genetic map data. (See, e.g., Heinz-Ulrich, et al.
(1995) in Meyers, supra, pp.
965-968.) Examples of genetic map data can be found in various scientific journals or at the Online Mendelian Inheritance in Man (OMIM) World Wide Web site. Correlation between the location of the gene encoding LIPAP on a physical chromosomal map and a specific disorder, or a predisposition to a specific disorder, may help define the region of DNA associated with that disorder. The nucleotide sequences of the invention may be used to detect differences in gene sequences among normal, carrier, and affected individuals.
In situ hybridization of chromosomal preparations and physical mapping techniques, such as linkage analysis using established chromosomal markers, may be used for extending genetic maps.
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, may reveal associated markers even if the number or arm of a particular human chromosome is not known.
New sequences can be assigned to chromosomal arms by physical mapping. This provides valuable information to investigators searching for disease genes using positional cloning or other gene discovery techniques. Once the disease or syndrome has been crudely localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to l 1q22-23, any sequences mapping to that area may represent associated or regulatory genes for further investigation. (See, e.g., Gatti, R.A. et al.
(1988) Nature 336:577-580.) The nucleotide sequence of the subject invention may also be used to detect differences in the chromosomal location due to translocation, inversion, etc., among normal, carrier, or affected individuals.
In another embodiment of the invention, LIPAP, its catalytic or immunogenic fragments, or oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug screening techniques. The fragment employed in such screening may be free in solution, affixed to a solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes between LIPAP and the agent being tested may be measured.
Another technique for drug screening provides for high throughput screening of compounds having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. ( 1984) PCT
application W084/03564.) In this method, large numbers of different small test compounds are synthesized on a solid substrate. The test compounds are reacted with LIPAP, or fragments thereof, and washed. Bound LIPAP is then detected by methods well known in the art.
Purified LIPAP can also be coated directly onto plates for use in the aforementioned drug screening techniques.
Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a solid support.
In another embodiment, one may use competitive drug screening assays in which neutralizing antibodies capable of binding LIPAP specifically compete with a test compound for binding LIPAP. In this manner, antibodies can be used to detect the presence of any peptide which shares one or more antigenic determinants with LIPAP.
In additional embodiments, the nucleotide sequences which encode LIPAP may be used in any molecular biology techniques that have yet to be developed, provided the new techniques rely on properties of nucleotide sequences that are currently known, including, but not limited to, such properties as the triplet genetic code and specific base pair interactions.
Without further elaboration, it is believed that one skilled in the art can, using the preceding description, utilize the present invention to its fullest extent. The following preferred specific embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder of the disclosure in any way whatsoever.
The disclosures of all patents, applications, and publications mentioned above and below, in particular U.S. Ser. No. 60/120,703 and U.S. Ser. No. 60/142,762, are hereby expressly incorporated by reference.
WO 00/49043 CA 02362300 2001-07-31 pCT~S00/04160 EXAMPLES
I. Construction of cDNA Libraries RNA was purchased from Clontech or isolated from tissues described in Table 4.
Some tissues were homogenized and lysed in guanidinium isothiocyanate, while others were homogenized and lysed in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged over CsCI
cushions or extracted with chloroform. RNA was precipitated from the lysates with either isopropanol or sodium acetate and ethanol, or by other routine methods.
Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA purity.
In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA
was isolated using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA
purification kit (Ambion, Austin TX).
In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA
libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra, units 5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA
was digested with the appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 bp) using SEPHACRYL S 1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., PBLUESCRIPT
plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), or pINCY (Incyte Pharmaceuticals, Palo Alto CA). Recombinant plasmids were transformed into competent E. coli cells including XLl-Blue, XL1-BIueMRF, or SOLR from Stratagene or DHSa, DH10B, or ElectroMAX DH10B from Life Technologies.
II. Isolation of cDNA Clones Plasmids were recovered from host cells by in vivo excision using the UNIZAP
vector system (Stratagene) or by cell lysis. Plasmids were purified using at least one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an AGTC Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 8 Plus Plasmid, Ultra Plasmid purification systems or the R.E.A.L. PREP 96 plasmid purification kit from QIAGEN.
Following precipitation, plasmids were resuspended in 0.1 ml of distilled water and stored, with or WO 00/49043 CA 02362300 2001-07-31 pCT/US00/04160 without lyophilization, at 4°C.
Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 384-well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence scanner (Labsystems Oy, Helsinki, Finland).
III. Sequencing and Analysis cDNA sequencing reactions were processed using standard methods or high-throughput instrumentation such as the ABI CATALYST 800 (Perkin-Elmer) thermal cycler or the PTC-200 thermal cycler (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared using reagents provided by Amersham Pharmacia Biotech or supplied in ABI
sequencing kits such as the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Perkin-Elmer).
Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides were carried out using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the ABI PRISM 373 or 377 sequencing system (Perkin-Elmer) in conjunction with standard ABI protocols and base calling software; or other sequence analysis systems known in the art. Reading frames within the cDNA sequences were identified using standard methods (reviewed in Ausubel, 1997, supra, unit 7.7). Some of the cDNA sequences were selected for extension using the techniques disclosed in Example VI.
The polynucleotide sequences derived from cDNA sequencing were assembled and analyzed using a combination of software programs which utilize algorithms well known to those skilled in the art. Table 5 summarizes the tools, programs, and algorithms used and provides applicable descriptions, references, and threshold parameters. The first column of Table 5 shows the tools, programs, and algorithms used, the second column provides brief descriptions thereof, the third column presents appropriate references, all of which are incorporated by reference herein in their entirety, and the fourth column presents, where applicable, the scores, probability values, and other parameters used to evaluate the strength of a match between two sequences (the higher the score, the greater the homology between two sequences). Sequences were analyzed using MACDNASIS PRO software (Hitachi Software Engineering, South San Francisco CA) and LASERGENE software (DNASTAR).
Polynucleotide and polypeptide sequence alignments were generated using the default parameters specified by the clustal algorithm as incorporated into the MEGALIGN multisequence alignment program (DNASTAR), which also calculates the percent identity between aligned sequences.
The polynucleotide sequences were validated by removing vector, linker, and polyA sequences and by masking ambiguous bases, using algorithms and programs based on BLAST, dynamic programing, and dinucleotide nearest neighbor analysis. The sequences were then queried against a selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases, and BLOCKS, PRINTS, DOMO, PRODOM, and PFAM to acquire annotation using programs based on BLAST, FASTA, and BLIMPS. The sequences were assembled into full length polynucleotide sequences using programs based on Phred, Phrap, and Consed, and were screened for open reading frames using programs based on GeneMark, BLAST, and FASTA. The full length polynucleotide sequences were translated to derive the corresponding full length amino acid sequences, and these full length sequences were subsequently analyzed by querying against databases such as the GenBank databases (described above), SwissProt, BLOCKS, PRINTS, DOMO, PRODOM, Prosite, and Hidden Markov Model (HMM)-based protein family databases such as PFAM. HMM is a probabilistic approach which analyzes consensus primary structures of gene families. (See, e.g., Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) The programs described above for the assembly and analysis of full length polynucleotide and amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ ID
N0:13-24. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and amplification technologies were described in The Invention section above.
IV. Northern Analysis Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra, ch. 7; Ausubel, 1995, supra, ch. 4 and 16.) Analogous computer techniques applying BLAST were used to search for identical or related molecules in nucleotide databases such as GenBank or LIFESEQ (Incyte Pharmaceuticals). This analysis is much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer search can be modified to determine whether any particular match is categorized as exact or similar. The basis of the search is the product score, which is defined as:
% sequence identity x % maximum BLAST score The product score takes into account both the degree of similarity between two sequences and the length of the sequence match. For example, with a product score of 40, the match will be exact within a 1 % to 2% error, and, with a product score of 70, the match will be exact.
Similar molecules are usually identified by selecting those which show product scores between 15 and 40, although lower scores may identify related molecules.
The results of northern analyses are reported as a percentage distribution of libraries in which WO 00/49043 PCT/iJS00/04160 the transcript encoding LIPAP occurred. Analysis involved the categorization of cDNA libraries by organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, reproductive, and urologic. The disease/condition categories included cancer, inflammation, trauma, cell proliferation, neurological, and pooled. For each category, the number of libraries expressing the sequence of interest was counted and divided by the total number of libraries across all categories.
Percentage values of tissue-specific and disease- or condition-specific expression are reported in Table V. Chromosomal Mapping of LIPAP Encoding Polynucleotides The cDNA sequences which were used to assemble SEQ ID N0:19-24 were compared with sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other implementations of the Smith-Waterman algorithm. Sequences from these databases that matched SEQ ID N0:19-24 were assembled into clusters of contiguous and overlapping sequences using assembly algorithms such as Phrap (Table 5). Radiation hybrid and genetic mapping data available from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for Genome Research (WIGR), and Genethon were used to determine if any of the clustered sequences had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment of all sequences of that cluster, including its particular SEQ ID NO:, to that map location.
The genetic map location of SEQ ID N0:21 is described in The Invention as a range, or interval, of human chromosome 11. The map position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's p-arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in humans, although this can vary widely due to hot and cold spots of recombination.) The cM distances are based on genetic markers mapped by Genethon which provide boundaries for radiation hybrid markers whose sequences were included in each of the clusters.
Human genome maps and other resources available to the public, such as the NCBI "GeneMap'99"
World Wide Web site (http:l/www.ncbi.nlm.nih.gov/genemap/), can be employed to determine if previously identified disease genes map within or in proximity to the intervals indicated above.
VI. Extension of LIPAP Encoding Polynucleotides The full length nucleic acid sequences of SEQ ID N0:13-24 were produced by extension of an appropriate fragment of the full length molecule using oligonucleotide primers designed from this fragment. One primer was synthesized to initiate 5' extension of the known fragment, and the other primer, to initiate 3' extension of the known fragment. The initial primers were designed using OLIGO
4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal to the target sequence at temperatures of about 68°C to about 72°C. Any stretch of nucleotides which would result in hairpin structures and primer-primer dimerizations was avoided.
Selected human cDNA libraries were used to extend the sequence. If more than one extension was necessary or desired, additional or nested sets of primers were designed.
High fidelity amplification was obtained by PCR using methods well known in the art. PCR
was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg'+, (NH4)~S04, and (3-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme (Life Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for primer pair PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec;
Step 3: 60°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min;
Step 7: storage at 4°C. In the alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1: 94°C, 3 min; Step 2:
94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min;
Step 5: Steps 2, 3, and 4 repeated 20 times;
Step 6: 68°C, 5 min; Step 7: storage at 4°C.
The concentration of DNA in each well was determined by dispensing 100 pl PICOGREEN
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in 1X TE
and 0.5 pl of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the concentration of DNA. A 5 ~1 to 10 ~1 aliquot of the reaction mixture was analyzed by electrophoresis on a 1 % agarose mini-gel to determine which reactions were successful in extending the sequence.
The extended nucleotides were desalted and concentrated, transferred to 384-well plates, digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and sonicated or sheared prior to relegation into pUC 18 vector (Amersham Pharmacia Biotech). For shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose gels, fragments were excised, and agar digested with Agar ACE
(Promega). Extended clones were relegated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, and transfected into competent E. coli cells. Transformed cells were selected on antibiotic-containing media, individual colonies were picked and cultured overnight at 37°C
in 384-well plates in LB/2x carb liquid media.
The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase (Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following parameters:
Step I: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, I min; Step 4: 72°C, 2 min; Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA
recoveries were reamplified using the same conditions as described above. Samples were diluted with 20%
dimethysulfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer sequencing primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE
Terminator cycle sequencing ready reaction kit (Perkin-Elmer).
In like manner, the nucleotide sequences of SEQ ID N0:13-24 are used to obtain 5' regulatory sequences using the procedure above, oligonucleotides designed for such extension, and an appropriate genomic library.
VII. Labeling and Use of Individual Hybridization Probes Hybridization probes derived from SEQ ID N0:13-24 are employed to screen cDNAs, genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base pairs, is specifically described, essentially the same procedure is used with larger nucleotide fragments.
Oligonucleotides are designed using state-of-the-art software such as OLIGO
4.06 software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 ,uCi of [y-3zP] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase (DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a SEPHADEX
G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). An aliquot containing 10' counts per minute of the labeled probe is used in a typical membrane-based hybridization analysis of human genomic DNA digested with one of the following endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xba I, or Pvu II (DuPont NEN).
The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate.
Hybridization patterns are visualized using autoradiography or an alternative imaging means and compared.
VIII. Microarrays A chemical coupling procedure and an ink jet device can be used to synthesize array elements on the surface of a substrate. (See, e.g., Baldeschweiler, supra.) An array analogous to a dot or slot blot may also be used to arrange and link elements to the surface of a substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be produced by hand or using available methods and machines and contain any appropriate number of elements.
After hybridization, nonhybridized probes are removed and a scanner used to determine the levels and patterns of fluorescence. The degree of complementarity and the relative abundance of each probe which hybridizes to an element on the microarray may be assessed through analysis of the scanned images.
Full-length cDNAs, Expressed Sequence Tags (ESTs), or fragments thereof may comprise the elements of the microarray. Fragments suitable for hybridization can be selected using software well known in the art such as LASERGENE software (DNASTAR). Full-length cDNAs, ESTs, or fragments thereof corresponding to one of the nucleotide sequences of the present invention, or selected at random from a cDNA library relevant to the present invention, are arranged on an appropriate substrate, e.g., a glass slide. The cDNA is fixed to the slide using, e.g., UV
cross-linking followed by thermal and chemical treatments and subsequent drying. (See, e.g., Schena, M.
et al. (1995) Science 270:467-470; Shalom D. et al. ( 1996) Genome Res. 6:639-645.) Fluorescent probes are prepared and used for hybridization to the elements on the substrate. The substrate is analyzed by procedures described above.
IX. Complementary Polynucleotides Sequences complementary to the LIPAP-encoding sequences, or any parts thereof, are used to detect, decrease, or inhibit expression of naturally occurring LIPAP. Although use of oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are designed using OLIGO
4.06 software (National Biosciences) and the coding sequence of LIPAP. To inhibit transcription, a complementary oligonucleotide is designed from the most unique 5' sequence and used to prevent promoter binding to the coding sequence. To inhibit translation, a complementary oligonucleotide is designed to prevent ribosomal binding to the LIPAP-encoding transcript.
X. Expression of LIPAP
Expression and purification of LIPAP is achieved using bacterial or virus-based expression systems. For expression of LIPAP in bacteria, cDNA is subcloned into an appropriate vector containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA
transcription. Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid promoter and the TS or T7 bacteriophage promoter in conjunction with the lac operator regulatory element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21 (DE3).
Antibiotic resistant bacteria express LIPAP upon induction with isopropyl beta-D-thiogalactopyranoside (IPTG). Expression of LIPAP in eukaryotic cells is achieved by infecting insect or mammalian cell lines with recombinant Auto rg-aphica californica nuclear polyhedrosis virus (AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is replaced with cDNA encoding LIPAP by either homologous recombination or bacterial-mediated transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to infect Spodoptera frugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases.
Infection of the latter requires additional genetic modifications to baculovirus. (See Engelhard, E.K. et WO 00/49043 CA 02362300 2001-07-31 pCT~JS00/04160 al. (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945.) In most expression systems, LIPAP is synthesized as a fusion protein with, e.g., glutathione S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton enzyme from Schistosoma japonicum, enables the purification of fusion proteins on immobilized glutathione under conditions that maintain protein activity and antigenicity (Amersham Pharmacia Biotech). Following purification, the GST moiety can be proteolytically cleaved from LIPAP at specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6-His, a stretch of six consecutive histidine residues, enables purification on metal-chelate resins (QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, supra, ch.
10 and 16). Purified LIPAP obtained by these methods can be used directly in the following activity assay.
IS XI. Demonstration of LIPAP Activity Selected candidate lipid molecules, such as C4 sterols, oxysterol, apolipoprotein E, and phospholipids, are arrayed in the wells of a multi-well plate. LIPAP, or biologically active fragments thereof, are labeled with'Z5I Bolton-Hunter reagent. (See, e.g., Bolton A.E.
and W.M. Hunter (1973) Biochem. J. 133:529-539.) The selected candidate lipid molecules are incubated with the labeled LIPAP and washed. Any wells with labeled LIPAP complex are assayed. Data obtained using different concentrations of LIPAP are used to calculate values for the number, affinity, and association of LIPAP with the candidate molecules. Significant binding of LIPAP to the candidate lipid molecules is indicative of LIPAP activity.
XII. Functional Assays LIPAP function is assessed by expressing the sequences encoding LIPAP at physiologically elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression vector containing a strong promoter that drives high levels of cDNA
expression. Vectors of choice include pCMV SPORT (Life Technologies) and pCR3.1 (Invitrogen, Carlsbad CA), both of which contain the cytomegalovirus promoter. 5-10 ~g of recombinant vector are transiently transfected into a human cell line, for example, an endothelial or hematopoietic cell line, using either liposome formulations or electroporation. I-2 ,ug of an additional plasmid containing sequences encoding a marker protein are co-transfected. Expression of a marker protein provides a means to distinguish transfected cells from nontransfected cells and is a reliable predictor of cDNA expression from the recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP;
Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), an automated, laser optics-based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to evaluate the apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake of fluorescent molecules that diagnose events preceding or coincident with cell death. These events include changes in nuclear DNA content as measured by staining of DNA with propidium iodide;
changes in cell size and granularity as measured by forward light scatter and 90 degree side light scatter; down-regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake;
alterations in expression of cell surface and intracellular proteins as measured by reactivity with specific antibodies; and alterations in plasma membrane composition as measured by the binding of fluorescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are discussed in Ormerod, M.G. ( 1994) Flow Cytometry, Oxford, New York NY.
The influence of LIPAP on gene expression can be assessed using highly purified populations of cells transfected with sequences encoding LIPAP and either CD64 or CD64-GFP. CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions of human immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake Success NY).
mRNA can be purified from the cells using methods well known by those of skill in the art. Expression of mRNA encoding LIPAP and other genes of interest can be analyzed by northern analysis or microarray techniques.
XIII. Production of LIPAP Specific Antibodies LIPAP substantially purified using polyacrylamide gel electrophoresis (PAGE;
see, e.g., Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to immunize rabbits and to produce antibodies using standard protocols.
Alternatively, the LIPAP amino acid sequence is analyzed using LASERGENE
software (DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is synthesized and used to raise antibodies by means known to those of skill in the art. Methods for selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well described in the art. (See, e.g., Ausubel, 1995, supra, ch. 11.) Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431A
peptide synthesizer (Perkin-Elmer) using fmoc-chemistry and coupled to KL,H
(Sigma-Aldrich, St.
Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase immunogenicity. (See, e.g., Ausubel, 1995, supra.) Rabbits are immunized with the oligopeptide-KL,H
complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti-LIPAP
activity by, for example, binding the peptide or LIPAP to a substrate, blocking with 1 % BSA, reacting with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG.
WO 00/49043 CA 02362300 2001-07-31 pCT~S00/04160 XIV. Purification of Naturally Occurring LIPAP Using Specific Antibodies Naturally occurring or recombinant LIPAP is substantially purified by immunoaffinity chromatography using antibodies specific for LIPAP. An immunoaffinity column is constructed by covalently coupling anti-LIPAP antibody to an activated chromatographic resin, such as CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is blocked and washed according to the manufacturer's instructions.
Media containing LIPAP are passed over the immunoaffinity column, and the column is washed under conditions that allow the preferential absorbance of LIl'AP
(e.g., high ionic strength buffers in the presence of detergent). The column is eluted under conditions that disrupt antibody/LIPAP binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such as urea or thiocyanate ion), and LIPAP is collected.
Various modifications and variations of the described methods and systems of the invention will be apparent to those skilled in the art without departing from the scope and spirit of the invention.
Although the invention has been described in connection with certain embodiments, it should be understood that the invention as claimed should not be unduly limited to such specific embodiments.
Indeed, various modifications of the described modes for carrying out the invention which are obvious to those skilled in molecular biology or related fields are intended to be within the scope of the following claims.
WO 00/49043 CA 02362300 2001-07-31 pCT~S00/04160 ~, .; - -- o ~r . --M En O ~. ri O
N O O E-~ H O E-~
o H z ~ off o r-~ o w r~ r~ o o ~ m ~ ~ ~ N
- - H q o U - o cx t7 a a7 rx ~n ~o ~ ~n ---E-I W H ~ H W
W O W O ~O Gc, H O ~O
O O rI ~-M N
--wr-~wHx .o~, -- xr-~x ~ ~ r.~
m o mn ~ E-r ,-i o C~,x o ,~ ~r o y ~ N x M M ~ ~ mn oo U o0 E-m, ~ m [~
~ n o o Cc, M N U1 N
x m cx o l0 ow 0~
H
tf1 L!1 l0 L~ N l0 01 - O
Op M H H ~O
W 01 ~
O Lfl N L~ I~ IIW l~ lf1.-. LC1 x w-7 al D t~
N Pa l0 OW -i rl O rl N lI1M N
N v tf1 Ul N "
'~' CO N M tI1 H O
~
v0 v-i - ,--~ . [-~ -- - v0 - ,--~
l0 -,.. ..- - - ., ~ .
H ,.. x - N - N ~ - --. L~ -~ ~ N
l0 N L~ e-i N ~ ~O
H
!-. ~ O w-i -~ rl O M O
O (~ O rl O
O rl rl O
,~ -, yo o ,~ o E-~ o cn E-~
E-~ E-~ (-a E-mo z ~
E-m o~~ Nz~~ off o oz z ' o H~ ~ E ~ z~ ,-oz a x~v~ ~z~ zN~~
H
z~ zo r~ aoa oHrx rx ~H ~ w H
v wo aU -m z~~ w oa xw o r~
-x oa -(> H ~ W ~ E-~N
w' '~ '-' Y~ " '-' '.
v N
~ ~, x ~, ,- x -- ~ -- ~
~, m (a v ri ~- H H v H l0 H M l0 O O r-I N O O O
l0 xr-~r.~r~rlx ~xo Mxr~ HN ~oxx Nr C~m o ~r ,-, x ,~ ,~ a~ C7.~.o u, o N ~ ~r o ~ o u, M co r-~
o pyo M x x d~ N d~ ~
fx E-I ~ ~0 z ~
M CO H E-~
~ W -i 01 rl rl Ov N 10 O
O L(1 d~ M 01 ~] rl W
cY A.i f.Y] Ca N
o~ M m M t~ o, o 00 ~r ~
o~ ~ ,~ u, ~o 0 ~ z u, v1 r.~
H H M h V~ Lll N O O V~
W H rl H CO ~ N O
M ~D
,~
H M ~O ~ 1f1 ~ 01 rl C1~O M
x ,~ ~ [~ z M W
M H - (Y]
l0 01 H ~ 00 ~O N C~ H N
N ~ M V~ N 'J '-' Lf1H
'-' H
~
e-I Ov N O N H
Ca -,--~ -10 -CO -ri -l0 -10 -r-I
- -- ~ _ ,.- - - . ~.- - ,.. ,..
x w ~ ~ ., ~., ~ x ~ ~o x -~ ~ N ~ ~ .-1 00 ~ M N
V~ N H 00 l0 c-1 N 1p L~ d' L!1 v-1 c-i r1 rl N O M O r-I
O O (~ O rl tf1 O O Lf1 M O M \O
M
o E-mno E-, o 0 o E-moo H H
E-~ ~o H N N E-~o ~r C~ o W O H O E-~ H E-~ ~ E-~O ~
M N L~ O O ~ ~ ~ rC
O r.~
Z H a ~I H E-~ En ~ z fl.~ ~ U E~ c o t~ W u N ~z U n C7 ~ '~ 7 m m o '''~ H N o z M
w~ zo wx ~ ~ ~ ~ ~
a x a Ll O U W C4 H W G: Gq W
P~l - E-~ E-i CL - 0.~- -- - -~7 ~C1 u7 cn _. ~q .r ..
v ~- -- v ~ ~.
,~ ~ , N c0 Ll N N CL Lfl ,~,~W O N .-I rl l0 r-1 l0 l0 c-I lD
C .-1 O t0 rl O O~ O
rl N GL rl -I c-1 CLn ri x Ci.i R; [~ x C~ (~ Ct.,CO
H W IW H
x x O E~ N Cc~ Cz~ N Cid~D l0 ~ ~ O N pG O O M
N lI1 01 O
O L(1 W -i Lf1 V~ N d' N ~ d~
C1 l0 LC1 l0 C~ O E-~
O [-a O
Ll1 Ov M ~ H ~ ~ (~ l0 rl O E-~ 01 ~ e-i ~ M O W
H O O M
~
u-1 ~ ~ rl N l0 rl H V~ rl N U7 N Lf1 ~l l0 ~ (Y., L~ l0 FC U1 ~
~-i N L~ CO I~ l~ CO CO O rl l0 0: ~ d~ O M Ca d~ d' H L~
l0 Op ~ ~ N M M M O CO
01 (Y~ [~ M W U N (Y~ FI,' (.Ta O (Y~
O
r-i O1 rl tf1 O N rl l0 .-i rl e-1 " N M .-I -~ N U~
-~ U7 v V~
.--~ M ri N N ~D M t~
o H H H H z H H
0 o o ~ o z z r.~ W PO U f~ CO U
tI1 M O ~D U'1l0 H
L~ 01 CO 111 d~ ~ ul H lfl M 01 M H l0 00 O N ~ l0 CO d' d~ N
r-1 H oW f1 l0 l~ N O O
H
U ~ N ~r c~ ~ M o o~
v-1 N N N tIlH H
'~
O
~~
z M ~ L!1 l0 l~ CO 01 O
H W -i ~-I u-I rl ~-ic-i N
r1 U
Ot ~
W
z ~n v..
~o ~.~
z p, Ca N H N M d~ Ill l0 l~ CO
H
ri I
w O
WO 00/49043 CA 02362300 2001-07-31 pCT~S00/04160 t~ ~ ,~
o N ~ o E, E-~ E-m n O
O - O O
H H U
N R i ~'r ~
U ~ ~ O frl U1 ~--- N z -- --N N
~ o a ~ o .oxN goo HxN
E-~ a~ C~ Cx, N 0.n O~ V~
O O
N 00 t~ M H O
~ Z, z O
tf1 N ~ ~ CO 00 00 r7 l0 c-1 Lh [~
Lf1 M O
U1 z O 00 _ ~ M 00 Lfl IYu H l0 O r-1 ~ O 01 M
/Tj ~ L~ [~
CO N U Lt1 I~
N N N V~
V v H O H M
O
. ~o E-, . ~o ~
F, -, W ~-, . ~ _ W E., M f-~ ~ l0 ~
H N e-I
G-~
l0 z N M O CO
O ~.,' [~ O
M
O E-1 a O O E-1 O
M ,'N L~ C/~
Nom o~ox Hmn Ho ~N
"
z~~ ~~ H~
z N G1 U
N oN
o xa a~
~
Pa U .-. H -- Cq ~ U ~ r ~
N . H ~.
. N
(a ~O l0 l0 10 H O O
lp ~ E-~ E E-~ ,-i ~
E-~ E-~ Ea H
Ei w r~~o ~o~ wMo xo a O H M L~
~7 I~ 00 O N M
M O z Lf1 U7 ~
r-I Lll O IIl N
N W H M
(Y., M
Lf1 ~ z ~ Lll O
H 01 U7 l~
~
Ltl H ~ ~ N 00 N N '-' M
"
r-I .~ N
_ H _ .~x~ ~x~ _~w~ _ ~ t~ N H -. ~
M N d~ H
H N
H
M O O N ~-~-I rl M O H O
O ~
O
o E-~ E-~ o H o M o t~ W
E-~ H E, H O W w G~7 E'~
d~ W O O
H Ln W
oz~o wow oz~w oz ~ ~ H
H
U
H V~ a N N H
N O U
x ~ ~ a ~
- - w _C c n -, m -- x -- oo ~ .r ~ .-~ ~ ..
v l0 t--i ~ M
~O H O ~O H
N lp H
H H
~owHx wHw HNHx Hx xHoo ~o~ xMo~ xo ~~z~ ~zM ~,~z ~~o o M o, M a ~ N
v o N m t~
o d~
d~ H O I~ H
M l0 M Lf1 l0 Lll H N e-i O 01 H
(Y., (3a a., M
H M M
l0 M z 01 Lf1 H ~ L(1 L
~ f~
l0 M
l0 H ~ O e-I LC1 N N '-~ M
'~ M
V~
N H H v-i H O O O
H H H
t~ O O W O
z z w z ,R E~ a H U
u7 O
a rx w z M r-I ~ O
dl M N O 00 M O do [~
O V~ 00 L!1 Lfl H
r-i 10 O1 M M
U I~ t~ M t N N M M
'~
O
-~
z O ~-I N M V~
Ca H
r1 U
O~
~
W
z~
~s o -~
z O H N
r-i r-I r-i .H
W
O
W
WO 00/49043 CA 02362300 2001-07-31 pCT~jS00/04160 u7 fx U
-~ pa W
U
f~
~ U7 U7 V7 C/1 rl U7 U7 O
(~
~ H w xH a,H w~ xH x H
1.W U) H U ~ U7 H w U U7 U U7 (S ~7 ~ r.>r H ~ H FC H o ~ O r.>r O r->r Ca O FC
~z a owaa as o s~waa a a oa ~a,cor~rim ~wa,mm m cn a~
bl M ~,' ?f LT
1.', a -~, r1 ~-1 .T", 1J
N
O -~ t11 S-I O t5~ v w ~ ''C N t~ ~ '~ U -rl ~-I
1.1 ri .j", Ul t~ -rl >_.,'U
L!1 N
f~ N -r-I r-i ri 'CS ri ,x O
U ~ ,~ M 7., ~ t"., ~ U U1 (O U
-rf O I t,'Jl'>'f Z71 rl I r-I
~ r-I Ul ~
w ~ r1 -- -ri ~ .R N -, r-I C1~
CO (IS
'L3 c0 w fJ, O a Ln ~ I co O I I
u1 O d~
1J O S-1 (a 1J ~"" w-i ~-I ~'., O ~'., O S=', ~ ~-1 .rr C~ N ri ~', r-1 r.,' ri M ~ G~ Ul ~ ,Q ri C7.~
e--I rl -rl 1J 4l ~ SV O 4l 4J c~ 17 (O (a N H N In 'L3 H U7 .U U7 ~I ~ 1.~ ~ N ~ N
~ rd aJ N N
ov H O O O U N ~ O N O N U7 N .-n f~
N
C~ ~-1 ~'.. 11 r-1 S-1 i31 ~-I -r-I
b1 ~ ri ~1 4) -- o ~, w ~ v~ a~ a, -- o z Ts -- t~ tn --N M
U W fsa S~ ~ I
I
~
.p.cW
o .,.
tT' .~ ~ N
M
N ~
4) ,~ ~ (a .. ~ ~
N
~.1 ~ N .~ ~ ~ I
I~
M
r-I -r-I .,..I ~ H
C~ c"I r-I
M
-4~ ~, oq o ~xz ~
_ c z S-I ~i H ~-1 N
o >~ w ~ a o ~, _ -N
[~ M
-~ N d' z ~, z z .~
v >~
~ ~ O~ N O
~
N Z
-r1 o Z z z ~n w ~' ,~ ~ u, , 0 H .-1 H N N L~ N t z z zzz z z 111 ~ c0 N
t11 c0 rl \O N O
~., l~ M 00 H rl d~ Ov M 00 e-1 CO M
r-i d~ M N
y -I cn ~ u1 cn amo ~r Lt1 co H t~
Ul r-i d~ ,-I u1 ~r ~ In O M M N N M Ll1 ~O C~ rl V~ M
CO N ~O
-~ ~ M M H U7 00 N U7 U7 U7 N N H
U7 H ~ U7 U7 00 ~
r-I I~ V~ U1 t11 M H V7 fl V7 1~ N ow a ~ '~ CO U7 rl lp I~ ~ ~ U1 M
H OO L~ 01 CIA CO
~
ri H U1 00 ~ N Ul lD M e-1 w1 [~ O ~
r-i O ~ O
v Ul C~ M M lIl N M Lll l0 O O V~
M I~ O~ N l0 l0 M
~ ~ O rl U7 N M Ul H Cll U7 N II1 ~ Ll1 U7 V7 CO U7 U7 II1 H N
N ~ t~ H N a, u~ H u1 cn O ~
ri ~ d~ U7 00 U7 U7 d~ l0 l ~ O t!1 M 00 O H 01 u~
O H 00 l0 ~ N l0 00 OWf1 h d~ 111 ~ O C~ N O N l~ lD
L(l W M 01 M N N CO 111 N M ~ L!1 ~ rl N
U7 r-I 10 v-i N ~O N M O
O ~ H cn ~o m u~ v1 cn ,-I H H
u~ ~n u1 v~ u~ u~ v1 ,~ ~
?~ ~
M u1 H cn u~ H
cn H v-W o0 O c0 O~ M ~O rl N N
t~ O 00 ~ t11 N ~O
c0 CO N O\ CO Lh CO M r1 CO L~ O ~
IIl M M l0 00 ~ CO N N l0 l0 1D
oW rl M r-i LIl ~' N M M tf1 I~ V~ N
-I M N l0 L~ rl tll ~ d~
N lO N
H H H ~n ~n H ~n H H v1 H ~n ~n H cn ~n u1 u1 cn H H ~n ~n H ~
N
o a, o, ~0 0 ri M CO O tI1 N O
U
~
~ M ~ V~ l~ N Ll1 ~
a q e-I N M ~ t17 0 Z
CA 02362300 2001-07-31 PCT~js00/04160 ~o ~ x~ ~ x '~ s~ cn N L1 to U O ~d U ~d O ~ O
W O m ~ O Ll W ~ O O Pa G1 W ~ f~
u1 ~ G: ~ FC a O >~ a ~ ~ O ~ r~ O
-''i m ~ C W ~ Cue, m CG N f~ Pa O N rx N f.~ rx C7 I U ~ ~ I f.~ C7 0.. I G~ C7 Pa C~ ~ Pa o (a V1 I U7 U7 I I (~, U1 I U1 I I U7 I U7 I I U7 I I I
~ f~HP.~ Cl.~H~ N0.~H f~H~~H C>~HH C~HoGH
1J (a H U7 ~'"r H U7 W Pr ~i U1 H U1 W fir' U1 H V7 V7 H U7 W U7 N L1 H r-C H H FC ~ H FC H rC H ~ H FC FC H r.C FC
oaa oa~~Haa oa as oaa oa~a ~mm ~cox~noar~ ~r~ cam ~r~r~ ~r~xa~
~"., r-I M W Gi M
'1~ ~ F.i 11 J-1 l0 ~r-I fri ~ '~ ~.i H N ~ri 00 N 'rl ~ri 1-1 U ~ rl i.i ,fZ ~ L G~ O
bt O N ~ I tJ~ O '~ wl Sa ~
w ~-i -- Sa V~ ~-i -- ~ N U1 I~ ~H f.~
-H O GL S-I M O f~ U1 -rl V~ H O tn 1J S-1 N O O LI1 ~.1 .(", U1 U7 ~ ~'., b1 N
..f"., ~ U1 ~, J-~ V~ ~ -H fx Ol O Lll N ø, ~-, r'-I
1J (a 'H ~ M 1.~ ~ rl ~-I 11 01 .f-', U7 -rl 01 U7 '~ .-i ~ CO U1 L I ~, (a 01 -rl O '~ I~
H -H o U~ ~,o a ~~N p,,~ ~~H
x ua~tT xs.~ a~~ts o~~.~~
U O r.>; ~.I --- O ~ U U1 U7 ~- U 'r .R '-' u1 ~ .. ~ ~ H ~
.H ~ ~ w t~ U1 d' ~ U
U ~ ~ 1.~ ~ ~ .. W .~ 4J ~
1.~ ~ crs ~ O U W ~ ~ ~° ~ rtJ
N v w Sa -~-~ ~ 'LJ ~ x ~ -N u7 'Lf E to N ~ ~ U -~ ~ , f~ 'x .~ w f(f -ri ~ 1~ U ~., Ra 't O ,Il Z37 ~ b1 N ~" M ~ ',~ N
n ~-I s..~ u~ U M ~ -H -H -~I ~ 'Li ' I -H d' ~ t~ .,~ °° tn ~
~°
i..~ ~ ~ ~ ~° N ~ v d' N °° .H '1 ,H cn .-r -~ M ~ '~ ~ o N
S~-. 'L~ N C~ O O 1.1 ~ M U7 ~1 ~ O ~ 2J x y~ Cx~ ~ ~ U~
~.r N ~ a! tv H M ~y O U N a ~ O H ~1 t"., I ~ I O " I O I
S-I I ~ ~ ~ ~-I I ~ I ~ ~ I N -~ ~' _~ .. ~o ~-I ~ ~° a ~
~°
~n ~ O °° ~S I c~ R~ -. d' .,~ ~ .~ ~ N l..i N ~"~ ,q w d' f~ w ~° R, m ~°
V ''~ ~ ~n 't~ d' ~ '-I ,~C o ~ d' U '~ I ~ ~ ~n ~..~ ~ ~ I -~ '~ ,.. ~ '-I
rti o ~~ >' Ls ~ o f.~ ~ ~ ~ '-7 3 U C~ ~ o W GL ~ q ~ ~ ~ G ~C
~, -H x -H s~ -~ -~I a~ o v -- ~ H o N o H .c~ o v~ w .H m a a ~I ~ o ~, ~ ~ U ~a ~n x N
a o a, a, c~ ,-, .~' m o ~~ z ~, M M z t/1 -r-I M r-i M N
~, o ~ ~ z z z a°, j, z ~ M ~r, N z N rl M M
r z zz z z zz tf1 rl V~ N l~ M
O N pp 01 Lfl N O\ Ov 00 H O
~~ N r-i Ov N M N N N N CO rl M V~
r1 JJ H N ~ CO U7 U7 V1 H U1 O U1 U7 U1 ~H
(~
-r-I r-I U1 ~ H l0 00 N Lll 01 U7 Ov CO l0 lyr N ~ '-~ O O~ O r-i O~ ~ V~ N tf1 ~ ~ N l0 O M N rl N N N N rl d' M
,~ N cn H H H ~n cn ~r H u~ cn 1J ~"., Cl~
r-I H r-I L!1 d~ 01 rl 00 01 OWE 00 ~ ,H p Q1 01 N 01 CO N M ~' N 01 N
N ~ Lf1 N N C~ rl N H M N vH N N
pn ~ o, u~ u1 u1 H cn u1 ~ H ~n ~n H
tIl M H H N L(1 M (~ (~ M M M rl O 01 c-I l0 O
l0 dW -I O~ 00 O 01 l0 M M O Lll CO rl Lll rl 01 r1 N CO Iw-I N rl M N M N H H N H N N ~
~n H cn H cn H ~n ~n ?~ H H u1 ~ H H cn ~n v~
N
,~ N N L~ L
pp M N
~ ~ N N
~ Gl z C~ c0 O~ O .-1 x a~
~
a ~
~ wH
xw W
q oa ~
~m M
O (a ~-I
O\
.ri Q) 01 1J -H C~
U Y.~ M
O ~ b1 w 1~ U v +~ ~ ~L ~
rtS H H
v .~ ,~ v .H ~ ~
H .F., 0..
O
. H Ca ~-I
U
O
(d U
U
H Ul w rtJ rtf .rt ~-I ~ a~ ~
'Ci N .,1 H
o ~
~
~
~
tn .H t .~ .H
O Q' ~ a -m +~
~
v ~ .H
.u o cn a ~
~
H N
z M
~i c-i ~-1 1!
~ O H
-H
~"., Uj O , d, H
O
CL
ri N
~ ~
.r-I 01 U .'1 N
N
.. N
H v-i z H En P~ p, U U U U U U
z U
W H W H H H
v a u. a ~ t~ a oa ca w w O O r-I M N M M
H
O M tI1 M ~ H
L(1 ~I1 Lfl l0 l0 U1 O
~ o 0 0 0 0 0 H
"
O .( ~"., ~"., ~'., T.,' ~"., , O H CO h Lfl M
O tm O O M O O M O V~
' M N
[~ .,.~ .,~ .,..I .,..I.,.~ .,..I
O . . . . .
O r v >~ v v ~ v v f~ N ~
S~ ~
w O w w O w w O w O
O O
ri -r-I -ri ri -r-1 r-I
-r-I r-I -r-I -r-1 -r-I -r-I
-~ r~ r~ r~ l~ r~ r~ l~ r~
GT-~l~ 1~ 1.1 l~
H r1 r-A r-I .--1 r-i r-i r-i r~ H ri r-I
~ w ~ ..~ .-~ .H w ~ w w w w C >~ N N J~ v v J~ N ~
~ ~
U H U U H U U H U H
H H
M
c~
o ~
N N
U1 ~ ' ~
O
H M rl M CO ~ O O 01 O CO ~ tI1 h r-1 v v h ~ ~ ~
~-Ici c-f rl Ov c1 N M c-i w cr OO
C~ ~ ' ~ ~ N .-. . QJ v .
O ~' ' . ~ ~ ~
N
O o O L(1 to O .('., ~., rl o o r-i o d~ rl O
dW
-~I
W Ll1 ~ ~,-~ p~ ~ ~ ,~ ~ ~
~.,~ ~ ~ . ~ L!1 .
O N O N N O N ~ N O ~ H
- ,--i. . ,--I . ~ O
~ ,~ .r ~ ~
v O f~ O O .-. , , , ~. I)f O (a H H (a (a fa O ~ O
~.
tn 'r v ~- v0 v ~ ~ '-' w ~
U ~ o~ G ~-I ~ ~..I -~-I ~ c0 M
U1 -'i r-i h -1 -.-i -.-i U -r-I
(a f~ -r-i (~ c-I U (0 (L3 ~
v ~ v v ,~ v ~ -~, -.~
,-~ N ~ r-I ~ v ~-, ~
a ,-, ,-, v .-~
H > m > > u~ > ~n w w ~n w ~ ~ m ~ ~ >
m . ~
-~ -~ -.-I -~ v v -~I N ~
v o o v v U o -~
U v U o U
JJ l~ .~-J 1J 1~ -rl -rl JJ -- ~- 1J 1J UI l~
U7 1J .u ~-- -r 1~
U .T".,U U ~ U ~ O U O ~
rti ~ fa ~ ~ U
(~
O ~
- O
'~ ' '~ ~ 'r~ O O 'Cf O O
O ~ O O O ~ 'L3 O ~ O
O ~-I O O O O W -~ 1~
rl O S-I ?-I O r-I S-a ~-I ~ O O O
.n S-1 ~.1 y.1 ~-1 1.1 !a (~
1J ,' ,' 1.1 ~.d 1J
''~ 1 1J ,7 '~ '~ ~1 T1 '(~
Gz OW f.~ G~ ~n ~ ~ ~
cn -I ~-I ~n f.~ f.~
~-I cn ~n ~ s-~ ~
s..l ~
N t~ v v N N N v N N t0 c(f N N to tt~ ~ v v rti v ~
(x (x (x z cx C7 x x C7 C7 z C7 U c~ U !x U C~ U z z U
h JJ h O N O~ Lf1 O
~, h h O r-W ~ N
O
~"I~ N ao av ~ ri v rl U I I I I ~ I
I I
L71M N ~-i Lf1 L!1 ~
H M O M h ~ h (~ ~
f-1~ N h 00 ~ O
Lf1 C W -I e-i Ca H
M V~ L!1 l0 h 00 w ~, O ~ U U U U U
o z z z z z U ~ H H H H H
'S~'' l0 ~ ~.. M ~. \D ~
O QO ~ N
H I~ ~' CO N O d~ L~ V~ LC1 M W t~
M N M M CO N N lfl N O~ N rl rl N O N N .-~I
O
O O O O O O
Ei v O O --- O v O O
v O ~-O
U ~ S~ >~ >~ S~ f~ G
O
O O O O O O
~ - - ~ - - -~ - ~ ~
0 f.a ~ S-I w S-I t~ ~r S-I
w S-~ w ~
-- E-a -- -- r0 -- -- -- r~
r~ ~ E-~ En N E-~
E-~ r~ H
M \ 00 [~ ~-1 O rl ~.I
S-I ~-I \ ~O ~-I \
\ ~I \ \
rl ~ O N ~ 00 0 M Q) N N S~ >~ 4l 7~ 4) .~'.
t0 M O Ln ~ w N tn N w w w O O w O
O w O
O 11 O O r-1 O O rl r-I rl J~ O 1.J r-1 L
11 rl 1J
~ ~ ~ O O
~
~i ~i ~-1 S-1 .S S
f~ f.~ ~ i1. ~1 f~
~ ~ N N N
U ~ U U ~ U U U .-a .H .H ~I ~ ~ ~
r-I ~ ~
~ w ~ ~ .-i ~ ~ .-a rl r-i w ~ w ~ w w ~ w r~ s~ ~ d N ~ rtS t~ N
v v !~ s~ N s~
W ~ !~
U H U U U U U U U
U U H H U H
H U H
U
M
U
a ~r N t11 -r-I'~ c-i JJ
O O Ll1 lO ~D O
u~
Ei ~ ~ a wr -, -.
M
N O ~-i V~ ~ N r-I
S-I rl w _ y -i ~ 00 M 4J
O O O rl l0 w-1 rl N >:., j~" O O r-1 O V~
W n '~ ~ eo ~p . . yn W ~ " '.' ~ ~.
O N ~ N N N O O O ~
-i -I ~ r-i i V O r1 r r r O (a O ~ O ~- ~. ~ H
H ~ O v f~ O
(a (a U1 v l~ "' N '~ O o0 ~-i U \ Lfl Lll '-' ~ ~.1 ~..I
)_', Cue. 7-', \ ..
rl U C~ o O f~ c-1 M (a N -rl ri r-1 (a U
ri N 1J N M Ql N M N ri rl r1 ~ r-I r-I rl rl JJ 1J (a 1.1 N
,' U1 'J ,7 r~ ~ ~ JJ
J-~ U7 ~ 1J U7 -~ v -~ o -~-I o o U
v o o -rl o U U v -~
~ N >~ ~
1J 1J 1.W---- 1~ -- ~ -- U1 -rl .U 1~ tJ~ U1 -r-I
~ J..~ ~ J.-~ 1J
U f~ U U U to ~ ~ O
O !~ f~ ~ ~ U
~ -r-I ~ UI ~ U7 U7 U7 ' ~2, U7 ~ ~ G~ ~ C2, U1 -r-I ri .~ ~
'~ O ' ~ '0 ~ ~ ~ O
O ~ ~ '~ O O O O ''d O O O
O S-I O O O O O O -~
t~ O O a ~ w 1~
O S-I ~ Y~ O
~-I ~-I ~ ~ ~ ~ ~ ''~
~ cd 5 S-I TS ''C rtS
' ~ ~ N 1-~ S-v N ~ v v N ~ N c0 N td v 4W 0 N td N N rd N N
t~
~xc~xz xzc~ z~C7 cxzULl zUC~ zUxo4 m 1.! l11 N l0 c0 M
.$-'.,N ~O N a~ t~
M N M O~ V~ N
I I I I I I
~J1 r-I CO 01 Ul d~ O~
H 00 e-1 [~ Ln M H
(Ij v N N N ~ N
S-I
~
C~
Ca H
01 O rl N M
Qt e-i N N N N N
z W
WO 00/49043 CA 02362300 2001-07-31 pCT/jjs00/04160 v >~ w b1 ~ N ~
w j ~ ,~ -I ''L~ ~ ~., o -rl r-I Lll o rl ?d .-I
M N ~Y o O 11 (~ 1~ - ~-I fa ~1 ,~ ~n w 0 0 w w ~ ~ v ~ 0.
r6 ~ O ~ x ~
N -~I v ,~ ~ o N '~
X ~o w v tff v 'L7 ~ w U U O O ~I v v ,~ O ~ v o~ to Q y.~ ~ v rl U1 U rl fa ~-1 w 3 ~.I ~ ~.I ,S'.., .i.~ ov 1J U7 v O -~ m ~ O ~n N ~ ~ w p, ,~
v -~I C2, w N ?r N v rl cn o ,~ -~ N w cn Wo c0 -3 ~
rl ~ >~ w ~ w w v U f~ 'L3 ''C3 ~ N ~ ~ m N ''C
u1 U7 JJ v f~ ~-1 r-I ?W v t~ f(i X S.~ ~-i 'C~
'~
u1 u1 ~I W ~ ~ N t.m-i 1.i ~ ~, N U
N
~, -~I ~ '~ O ~-i O .~ U v U7 w U .-I U7 (~
w w b~ ~ N v ~ ~ S-i 1~ ~ u1 1>
to N (6 .R >~ +~ O
U
~ -~I N ~ ?,'L3O w .~ ~I S~ r~ w U ~Q O v S~
'a~ ~ -~I U ~ +~ U ~ U w -.~ w 3 c~ -~I
U ~ w N
-r-IO ~-I U7 U7 v O ~ S3W U7 1~ O
O ~-i U7 ,.C, O ~ U7 r-i W 6 ~ )~ N ,~ >~ FC '~
+~ rti U O -rl 'LS
'~ y r-i .~". b1 ' ~
U7 ~"., ~ U U ~ O ~ N '~
N w ~ wi ,-~I -ri N rl U -~-I ~
N U U7 Si r~ O
'L3 N .t~ rl 1~ O '~ U1 ~ ~ '~ O >~
O U ,~
ca U '-I N t4 ?, .~'., U N U1 b1 U
U1 O ~-I O I
(fS
.-i ,~ rt3 N N ~ ~ U 0 ~ O P~l ra 1~ Si ''W c0 rd ~
~I
zs >~ w v U ~ b~ w 3 ~ ~ b~
~ ~ o -~I ~ ~
w o v o ~ ~w m C~>,~ ~ ~ 3 r~ ~
~ s~ 'd ~2 m ~ -~ .-~ ?W -rd r-I -.~ w ~
r-I ~ >~ -~ +~ v ~.I S-m~ - u1 r~ R~ ~ f~ ~ ra ~.I
~
v rl ~ ~I v ''~ ~3 ~ ~ ~ ~ w m ~
rtt cn o .~ v ~ o v f.~ v w ~ m ~ v ~I v ~ v cz, sa cn ~
m ~ v ~I
.(~',(~ U U1 1.1 U1 ~ ~ U7 . 1.1 !n (a (a ~., N rl J..1 N
(d N
~
w b~ -rl (0 U rtS N ~ 'a-W-~ N 'Lj U ~-I ~ N d N O
~
O
r~ ~ N ~ v ~ U7 '~ ~ ~., w ~ w ri rl -~ ~ a, rl O cn ~
f~
N ~ (a ~ tl7 U7 S~ O 1~ S-i r-I ''Cj ~ ri c~ O -r-I rt ~ O ~ N ra v O
U 1~ N 10 > >~ r0 .~ ~r rl 'L3 ''O
w >~ t~ 2S 7.1 'LS ~ 1~ ~ ~ r-I
b1-~ ~ f~ ~ v ~ ~.t o~
~
v ''~ ~d >r -~ ~ -~ ''O U1 ,~ cd ~ w w ~ ~--I~ ~ ~ ~ S-~ ?i U1 -~ ~
rd 4l !-I v .-I ,~., O FC
0 0 o o o rt o ~ rl ~ ~I ~ $ ~n bW ,~ v v u1 -~ t6 0 ~ w - rt w ~
~
U w w ~I w w N w w ~ c~ c~ U O ~I rl .
~ ~I ~
~
~ t6 ~ N v N ~ ~ N w cn ~ ~
>~ ~ ~ cn w zS . 23 ~ ~ ~ ~-I ~I cn rd o .~ z3 is cn Q, ~ 3 ~n ~ -~ -~
v ~0 p, N v ~, N N rl v (d v U7 ~d ~-i i-I
U ~ rl ~ O rl U
~, J-J 1.J .~! t~ -r-11.~ ~, T3 ~"~ ,~ 'a-~
N I ri ''~ 1~ U1 U 'L~ U7 ~'., ?d ~ b1 -ca W o rt1 r0 ,~ (~ v o ?~ ,~ o v -~I
1~ ~ ~-I (~ ~ ~ ~ o v .y -i rl ~-i rl U ~ 'a-~ ~ O ~-I 'L3 ~ V~ ~ t'r ~ O v ~ N m -r-i v O O >~ O O (a O O U ~ f~ ~-I ,~ N
O ~ '~ rl w ''C
O
m tn u7 u1 w tn ~ '~ 1md ,~ -~ ~
~-I ~ ~-I ?~ ~-I ra N ~d .u -.i -rl -~ w ~ rl ~ N U -~I w O
U ,Q N u7 'L3 ~ .-i ~ U
O
w ~ rl ~ ?~ -.-i +y v w ~ ~ FC
U R~ ~ 4l a o v ~ ~
o FC r.>rr.>r FC ~n ~ ~ t G) o ~ -~ 3 -~ ~ 3 rl ~ o ca ~ U ~a U o o ~
~ ~ ~I o x ca o N E
v U ~ ~
~ ~
' S-a Ci ~ ~ ~ U ri U ,~ E O ~
w U rl u7 f~ '~ ,~ ~ 'C3 U1 O v w ~
231 lT b~ b1 -rl tT r6 ~ v O ~-I i~
~ O o ~ v W 0 a a S-I
>~ f~ ~ >~ v ~ U rl ~r ~I w O Sa v O~ ~ w ~ ~ ~ O tT z p, 2f to -.-i-.-i -~-i-~-I ~ -~ v 'Cf 1~ S-I N
cn N O ~-I ~ v U in v cn ~ ?W ~ m u~ >J
v ~I u~ r~ z3 v N 3 U
m ~I v rl ~
~
U ~ 'CS ~ ~ ~ ~ ~ b1 !~ v N U ~C
U '~-I~ U -.n ~ i-t rl -i v -~--IO O v U rtf v ~ u~ O
~ U1 ~, N
~ .rs ~a ~d w ~ I ,-I v ~ o v v ~C it ~ ~ o >~ ~
,.~
, ~ a ~ ~' ~ v ro ~
v ~ ~ ~' ~ ~ v ~
u - ~ v ai ,~
. ~ ~ ,~ , I
?, .
~
U U ~ U U (~ U v t~ ~ u1 w ~ ~
b1 O v ,~ p, U7 f~ ~ U7 O
v ~ O ~ ~ -~ ~ ~, N v O w U1 ~I
O ~ w w O ~ N ~ t ~
!-i ?-I ~-I S-a ~.I I G.~ ~ ~ O ,R
,~ '~ v U1 CJ1.~ ~-I ov ..
~ -r-I v .Ll -~ (a ~ w ,il w w c~ w In >~ w fly v ~
H O ~ ~ w ~ ~I o, ~
U cn U7 U1 U1 U7 U U7 N W (a '~ C~, W-i (~ O ''~ 'C~
~'., ~-I
>~ ~ I >~ !~ ~ ~ ~ > ~ ~ ~ >~ v .~
w w ~I v 3 r~ >~
O'~ ONc~U O~ Oc(i(1,(aO(~N~~~v'd vvv--I~
U
U U 7-I U U U U 3 O .-i N C~ O ,~
~ W ~ ~ ~ 3 ~ ~-I
ri w ~ N O !~ ~ O ~ U7 --i v E-~
m O ~-I ~ ~ O
U7 U1 IJl U1 '~ U7 O .~ F~r r-I 'L~
,L',.~'., ~".,~ O U7 O ,.>~ U1 c~ N
r-I U rl ~ to t>~ r0 ~ t6 S-I -~ to >~ w U wl w O U tn >~ U1 w '., ~
3 3 ~ 3 3 0 3 w v ~ ~ E -~ ~n U z3 ~ u~ -~ ~ ~ v ~
N O v I v LT ~ C7 ~ ~ (~ O v ~,' ~ p:
?W ~r.i".y.,w?, ~.I ?~'~ ~ U7 O .!~, ~
.4.1 t~ N ?~ w w r- N
~ S-~ ~I S-W ~ N (a w ~ O ,~ ~
O .1J >~ If '~ ~ ~-I N v 'L3 O N .U
r6 (fi rtS ~ N cti ~ ~ w -~-I O rtf ~I tfS U ~ ~ N ~~I v ~ rl ~ O
~-I ~-I ~I ?-Wr ~ O wl ~ U7 ri S-I
~-I ~ r-a wl x U .aJ w ~-I O
(d v S-a N
,~ ,~ .L2 .~ ~ v ~-I -~ r-i ,CZ
O I 'L3 N ~ v U1 U7 ca >~
, ~ -I ,Il 7-I v ~ w -~I
~ O N O N
a CZw -~ -~ w -r-I ~ , N S-W a N ~ ~I E'i ~ w ~y f v ~l M r rJl U7 ~ w -r-I , C~
~ ~"., ~ S-1 1J
N U ri -~-, U1 ,L,' ri M rl N N
O O O rl O O
o s z ~ H
~ w ~ a ~n a r~
a w w w ~ w ~ w v ~o oz v r-1 M ~ LC1 l0 L~ 00 Ca U ~ ~ r-i c-W -i I--i >~
d ~w ~
tn O
I ,L.,' w ca v r, 1J U1 M ''Lj~ t~ -~ ~ O
~ ~ -O E ~ ~ !~ O O U7 h ~o ~ ,~ O
~ ~
~-1 ~1 O (~ ~-1 ~ (If v ~1 (l1 1~ ~-I
~ r-I ~ O .-.
w w w ~ ~
o ~ ~
w w -r . >~ ~ o~
Selected human cDNA libraries were used to extend the sequence. If more than one extension was necessary or desired, additional or nested sets of primers were designed.
High fidelity amplification was obtained by PCR using methods well known in the art. PCR
was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg'+, (NH4)~S04, and (3-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme (Life Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for primer pair PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec;
Step 3: 60°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min;
Step 7: storage at 4°C. In the alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1: 94°C, 3 min; Step 2:
94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min;
Step 5: Steps 2, 3, and 4 repeated 20 times;
Step 6: 68°C, 5 min; Step 7: storage at 4°C.
The concentration of DNA in each well was determined by dispensing 100 pl PICOGREEN
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in 1X TE
and 0.5 pl of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the concentration of DNA. A 5 ~1 to 10 ~1 aliquot of the reaction mixture was analyzed by electrophoresis on a 1 % agarose mini-gel to determine which reactions were successful in extending the sequence.
The extended nucleotides were desalted and concentrated, transferred to 384-well plates, digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and sonicated or sheared prior to relegation into pUC 18 vector (Amersham Pharmacia Biotech). For shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose gels, fragments were excised, and agar digested with Agar ACE
(Promega). Extended clones were relegated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, and transfected into competent E. coli cells. Transformed cells were selected on antibiotic-containing media, individual colonies were picked and cultured overnight at 37°C
in 384-well plates in LB/2x carb liquid media.
The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase (Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following parameters:
Step I: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, I min; Step 4: 72°C, 2 min; Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA
recoveries were reamplified using the same conditions as described above. Samples were diluted with 20%
dimethysulfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer sequencing primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE
Terminator cycle sequencing ready reaction kit (Perkin-Elmer).
In like manner, the nucleotide sequences of SEQ ID N0:13-24 are used to obtain 5' regulatory sequences using the procedure above, oligonucleotides designed for such extension, and an appropriate genomic library.
VII. Labeling and Use of Individual Hybridization Probes Hybridization probes derived from SEQ ID N0:13-24 are employed to screen cDNAs, genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base pairs, is specifically described, essentially the same procedure is used with larger nucleotide fragments.
Oligonucleotides are designed using state-of-the-art software such as OLIGO
4.06 software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 ,uCi of [y-3zP] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase (DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a SEPHADEX
G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). An aliquot containing 10' counts per minute of the labeled probe is used in a typical membrane-based hybridization analysis of human genomic DNA digested with one of the following endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xba I, or Pvu II (DuPont NEN).
The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate.
Hybridization patterns are visualized using autoradiography or an alternative imaging means and compared.
VIII. Microarrays A chemical coupling procedure and an ink jet device can be used to synthesize array elements on the surface of a substrate. (See, e.g., Baldeschweiler, supra.) An array analogous to a dot or slot blot may also be used to arrange and link elements to the surface of a substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be produced by hand or using available methods and machines and contain any appropriate number of elements.
After hybridization, nonhybridized probes are removed and a scanner used to determine the levels and patterns of fluorescence. The degree of complementarity and the relative abundance of each probe which hybridizes to an element on the microarray may be assessed through analysis of the scanned images.
Full-length cDNAs, Expressed Sequence Tags (ESTs), or fragments thereof may comprise the elements of the microarray. Fragments suitable for hybridization can be selected using software well known in the art such as LASERGENE software (DNASTAR). Full-length cDNAs, ESTs, or fragments thereof corresponding to one of the nucleotide sequences of the present invention, or selected at random from a cDNA library relevant to the present invention, are arranged on an appropriate substrate, e.g., a glass slide. The cDNA is fixed to the slide using, e.g., UV
cross-linking followed by thermal and chemical treatments and subsequent drying. (See, e.g., Schena, M.
et al. (1995) Science 270:467-470; Shalom D. et al. ( 1996) Genome Res. 6:639-645.) Fluorescent probes are prepared and used for hybridization to the elements on the substrate. The substrate is analyzed by procedures described above.
IX. Complementary Polynucleotides Sequences complementary to the LIPAP-encoding sequences, or any parts thereof, are used to detect, decrease, or inhibit expression of naturally occurring LIPAP. Although use of oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are designed using OLIGO
4.06 software (National Biosciences) and the coding sequence of LIPAP. To inhibit transcription, a complementary oligonucleotide is designed from the most unique 5' sequence and used to prevent promoter binding to the coding sequence. To inhibit translation, a complementary oligonucleotide is designed to prevent ribosomal binding to the LIPAP-encoding transcript.
X. Expression of LIPAP
Expression and purification of LIPAP is achieved using bacterial or virus-based expression systems. For expression of LIPAP in bacteria, cDNA is subcloned into an appropriate vector containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA
transcription. Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid promoter and the TS or T7 bacteriophage promoter in conjunction with the lac operator regulatory element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21 (DE3).
Antibiotic resistant bacteria express LIPAP upon induction with isopropyl beta-D-thiogalactopyranoside (IPTG). Expression of LIPAP in eukaryotic cells is achieved by infecting insect or mammalian cell lines with recombinant Auto rg-aphica californica nuclear polyhedrosis virus (AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is replaced with cDNA encoding LIPAP by either homologous recombination or bacterial-mediated transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to infect Spodoptera frugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases.
Infection of the latter requires additional genetic modifications to baculovirus. (See Engelhard, E.K. et WO 00/49043 CA 02362300 2001-07-31 pCT~JS00/04160 al. (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945.) In most expression systems, LIPAP is synthesized as a fusion protein with, e.g., glutathione S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton enzyme from Schistosoma japonicum, enables the purification of fusion proteins on immobilized glutathione under conditions that maintain protein activity and antigenicity (Amersham Pharmacia Biotech). Following purification, the GST moiety can be proteolytically cleaved from LIPAP at specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6-His, a stretch of six consecutive histidine residues, enables purification on metal-chelate resins (QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, supra, ch.
10 and 16). Purified LIPAP obtained by these methods can be used directly in the following activity assay.
IS XI. Demonstration of LIPAP Activity Selected candidate lipid molecules, such as C4 sterols, oxysterol, apolipoprotein E, and phospholipids, are arrayed in the wells of a multi-well plate. LIPAP, or biologically active fragments thereof, are labeled with'Z5I Bolton-Hunter reagent. (See, e.g., Bolton A.E.
and W.M. Hunter (1973) Biochem. J. 133:529-539.) The selected candidate lipid molecules are incubated with the labeled LIPAP and washed. Any wells with labeled LIPAP complex are assayed. Data obtained using different concentrations of LIPAP are used to calculate values for the number, affinity, and association of LIPAP with the candidate molecules. Significant binding of LIPAP to the candidate lipid molecules is indicative of LIPAP activity.
XII. Functional Assays LIPAP function is assessed by expressing the sequences encoding LIPAP at physiologically elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression vector containing a strong promoter that drives high levels of cDNA
expression. Vectors of choice include pCMV SPORT (Life Technologies) and pCR3.1 (Invitrogen, Carlsbad CA), both of which contain the cytomegalovirus promoter. 5-10 ~g of recombinant vector are transiently transfected into a human cell line, for example, an endothelial or hematopoietic cell line, using either liposome formulations or electroporation. I-2 ,ug of an additional plasmid containing sequences encoding a marker protein are co-transfected. Expression of a marker protein provides a means to distinguish transfected cells from nontransfected cells and is a reliable predictor of cDNA expression from the recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP;
Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), an automated, laser optics-based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to evaluate the apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake of fluorescent molecules that diagnose events preceding or coincident with cell death. These events include changes in nuclear DNA content as measured by staining of DNA with propidium iodide;
changes in cell size and granularity as measured by forward light scatter and 90 degree side light scatter; down-regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake;
alterations in expression of cell surface and intracellular proteins as measured by reactivity with specific antibodies; and alterations in plasma membrane composition as measured by the binding of fluorescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are discussed in Ormerod, M.G. ( 1994) Flow Cytometry, Oxford, New York NY.
The influence of LIPAP on gene expression can be assessed using highly purified populations of cells transfected with sequences encoding LIPAP and either CD64 or CD64-GFP. CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions of human immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake Success NY).
mRNA can be purified from the cells using methods well known by those of skill in the art. Expression of mRNA encoding LIPAP and other genes of interest can be analyzed by northern analysis or microarray techniques.
XIII. Production of LIPAP Specific Antibodies LIPAP substantially purified using polyacrylamide gel electrophoresis (PAGE;
see, e.g., Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to immunize rabbits and to produce antibodies using standard protocols.
Alternatively, the LIPAP amino acid sequence is analyzed using LASERGENE
software (DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is synthesized and used to raise antibodies by means known to those of skill in the art. Methods for selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well described in the art. (See, e.g., Ausubel, 1995, supra, ch. 11.) Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431A
peptide synthesizer (Perkin-Elmer) using fmoc-chemistry and coupled to KL,H
(Sigma-Aldrich, St.
Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase immunogenicity. (See, e.g., Ausubel, 1995, supra.) Rabbits are immunized with the oligopeptide-KL,H
complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti-LIPAP
activity by, for example, binding the peptide or LIPAP to a substrate, blocking with 1 % BSA, reacting with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG.
WO 00/49043 CA 02362300 2001-07-31 pCT~S00/04160 XIV. Purification of Naturally Occurring LIPAP Using Specific Antibodies Naturally occurring or recombinant LIPAP is substantially purified by immunoaffinity chromatography using antibodies specific for LIPAP. An immunoaffinity column is constructed by covalently coupling anti-LIPAP antibody to an activated chromatographic resin, such as CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is blocked and washed according to the manufacturer's instructions.
Media containing LIPAP are passed over the immunoaffinity column, and the column is washed under conditions that allow the preferential absorbance of LIl'AP
(e.g., high ionic strength buffers in the presence of detergent). The column is eluted under conditions that disrupt antibody/LIPAP binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such as urea or thiocyanate ion), and LIPAP is collected.
Various modifications and variations of the described methods and systems of the invention will be apparent to those skilled in the art without departing from the scope and spirit of the invention.
Although the invention has been described in connection with certain embodiments, it should be understood that the invention as claimed should not be unduly limited to such specific embodiments.
Indeed, various modifications of the described modes for carrying out the invention which are obvious to those skilled in molecular biology or related fields are intended to be within the scope of the following claims.
WO 00/49043 CA 02362300 2001-07-31 pCT~S00/04160 ~, .; - -- o ~r . --M En O ~. ri O
N O O E-~ H O E-~
o H z ~ off o r-~ o w r~ r~ o o ~ m ~ ~ ~ N
- - H q o U - o cx t7 a a7 rx ~n ~o ~ ~n ---E-I W H ~ H W
W O W O ~O Gc, H O ~O
O O rI ~-M N
--wr-~wHx .o~, -- xr-~x ~ ~ r.~
m o mn ~ E-r ,-i o C~,x o ,~ ~r o y ~ N x M M ~ ~ mn oo U o0 E-m, ~ m [~
~ n o o Cc, M N U1 N
x m cx o l0 ow 0~
H
tf1 L!1 l0 L~ N l0 01 - O
Op M H H ~O
W 01 ~
O Lfl N L~ I~ IIW l~ lf1.-. LC1 x w-7 al D t~
N Pa l0 OW -i rl O rl N lI1M N
N v tf1 Ul N "
'~' CO N M tI1 H O
~
v0 v-i - ,--~ . [-~ -- - v0 - ,--~
l0 -,.. ..- - - ., ~ .
H ,.. x - N - N ~ - --. L~ -~ ~ N
l0 N L~ e-i N ~ ~O
H
!-. ~ O w-i -~ rl O M O
O (~ O rl O
O rl rl O
,~ -, yo o ,~ o E-~ o cn E-~
E-~ E-~ (-a E-mo z ~
E-m o~~ Nz~~ off o oz z ' o H~ ~ E ~ z~ ,-oz a x~v~ ~z~ zN~~
H
z~ zo r~ aoa oHrx rx ~H ~ w H
v wo aU -m z~~ w oa xw o r~
-x oa -(> H ~ W ~ E-~N
w' '~ '-' Y~ " '-' '.
v N
~ ~, x ~, ,- x -- ~ -- ~
~, m (a v ri ~- H H v H l0 H M l0 O O r-I N O O O
l0 xr-~r.~r~rlx ~xo Mxr~ HN ~oxx Nr C~m o ~r ,-, x ,~ ,~ a~ C7.~.o u, o N ~ ~r o ~ o u, M co r-~
o pyo M x x d~ N d~ ~
fx E-I ~ ~0 z ~
M CO H E-~
~ W -i 01 rl rl Ov N 10 O
O L(1 d~ M 01 ~] rl W
cY A.i f.Y] Ca N
o~ M m M t~ o, o 00 ~r ~
o~ ~ ,~ u, ~o 0 ~ z u, v1 r.~
H H M h V~ Lll N O O V~
W H rl H CO ~ N O
M ~D
,~
H M ~O ~ 1f1 ~ 01 rl C1~O M
x ,~ ~ [~ z M W
M H - (Y]
l0 01 H ~ 00 ~O N C~ H N
N ~ M V~ N 'J '-' Lf1H
'-' H
~
e-I Ov N O N H
Ca -,--~ -10 -CO -ri -l0 -10 -r-I
- -- ~ _ ,.- - - . ~.- - ,.. ,..
x w ~ ~ ., ~., ~ x ~ ~o x -~ ~ N ~ ~ .-1 00 ~ M N
V~ N H 00 l0 c-1 N 1p L~ d' L!1 v-1 c-i r1 rl N O M O r-I
O O (~ O rl tf1 O O Lf1 M O M \O
M
o E-mno E-, o 0 o E-moo H H
E-~ ~o H N N E-~o ~r C~ o W O H O E-~ H E-~ ~ E-~O ~
M N L~ O O ~ ~ ~ rC
O r.~
Z H a ~I H E-~ En ~ z fl.~ ~ U E~ c o t~ W u N ~z U n C7 ~ '~ 7 m m o '''~ H N o z M
w~ zo wx ~ ~ ~ ~ ~
a x a Ll O U W C4 H W G: Gq W
P~l - E-~ E-i CL - 0.~- -- - -~7 ~C1 u7 cn _. ~q .r ..
v ~- -- v ~ ~.
,~ ~ , N c0 Ll N N CL Lfl ,~,~W O N .-I rl l0 r-1 l0 l0 c-I lD
C .-1 O t0 rl O O~ O
rl N GL rl -I c-1 CLn ri x Ci.i R; [~ x C~ (~ Ct.,CO
H W IW H
x x O E~ N Cc~ Cz~ N Cid~D l0 ~ ~ O N pG O O M
N lI1 01 O
O L(1 W -i Lf1 V~ N d' N ~ d~
C1 l0 LC1 l0 C~ O E-~
O [-a O
Ll1 Ov M ~ H ~ ~ (~ l0 rl O E-~ 01 ~ e-i ~ M O W
H O O M
~
u-1 ~ ~ rl N l0 rl H V~ rl N U7 N Lf1 ~l l0 ~ (Y., L~ l0 FC U1 ~
~-i N L~ CO I~ l~ CO CO O rl l0 0: ~ d~ O M Ca d~ d' H L~
l0 Op ~ ~ N M M M O CO
01 (Y~ [~ M W U N (Y~ FI,' (.Ta O (Y~
O
r-i O1 rl tf1 O N rl l0 .-i rl e-1 " N M .-I -~ N U~
-~ U7 v V~
.--~ M ri N N ~D M t~
o H H H H z H H
0 o o ~ o z z r.~ W PO U f~ CO U
tI1 M O ~D U'1l0 H
L~ 01 CO 111 d~ ~ ul H lfl M 01 M H l0 00 O N ~ l0 CO d' d~ N
r-1 H oW f1 l0 l~ N O O
H
U ~ N ~r c~ ~ M o o~
v-1 N N N tIlH H
'~
O
~~
z M ~ L!1 l0 l~ CO 01 O
H W -i ~-I u-I rl ~-ic-i N
r1 U
Ot ~
W
z ~n v..
~o ~.~
z p, Ca N H N M d~ Ill l0 l~ CO
H
ri I
w O
WO 00/49043 CA 02362300 2001-07-31 pCT~S00/04160 t~ ~ ,~
o N ~ o E, E-~ E-m n O
O - O O
H H U
N R i ~'r ~
U ~ ~ O frl U1 ~--- N z -- --N N
~ o a ~ o .oxN goo HxN
E-~ a~ C~ Cx, N 0.n O~ V~
O O
N 00 t~ M H O
~ Z, z O
tf1 N ~ ~ CO 00 00 r7 l0 c-1 Lh [~
Lf1 M O
U1 z O 00 _ ~ M 00 Lfl IYu H l0 O r-1 ~ O 01 M
/Tj ~ L~ [~
CO N U Lt1 I~
N N N V~
V v H O H M
O
. ~o E-, . ~o ~
F, -, W ~-, . ~ _ W E., M f-~ ~ l0 ~
H N e-I
G-~
l0 z N M O CO
O ~.,' [~ O
M
O E-1 a O O E-1 O
M ,'N L~ C/~
Nom o~ox Hmn Ho ~N
"
z~~ ~~ H~
z N G1 U
N oN
o xa a~
~
Pa U .-. H -- Cq ~ U ~ r ~
N . H ~.
. N
(a ~O l0 l0 10 H O O
lp ~ E-~ E E-~ ,-i ~
E-~ E-~ Ea H
Ei w r~~o ~o~ wMo xo a O H M L~
~7 I~ 00 O N M
M O z Lf1 U7 ~
r-I Lll O IIl N
N W H M
(Y., M
Lf1 ~ z ~ Lll O
H 01 U7 l~
~
Ltl H ~ ~ N 00 N N '-' M
"
r-I .~ N
_ H _ .~x~ ~x~ _~w~ _ ~ t~ N H -. ~
M N d~ H
H N
H
M O O N ~-~-I rl M O H O
O ~
O
o E-~ E-~ o H o M o t~ W
E-~ H E, H O W w G~7 E'~
d~ W O O
H Ln W
oz~o wow oz~w oz ~ ~ H
H
U
H V~ a N N H
N O U
x ~ ~ a ~
- - w _C c n -, m -- x -- oo ~ .r ~ .-~ ~ ..
v l0 t--i ~ M
~O H O ~O H
N lp H
H H
~owHx wHw HNHx Hx xHoo ~o~ xMo~ xo ~~z~ ~zM ~,~z ~~o o M o, M a ~ N
v o N m t~
o d~
d~ H O I~ H
M l0 M Lf1 l0 Lll H N e-i O 01 H
(Y., (3a a., M
H M M
l0 M z 01 Lf1 H ~ L(1 L
~ f~
l0 M
l0 H ~ O e-I LC1 N N '-~ M
'~ M
V~
N H H v-i H O O O
H H H
t~ O O W O
z z w z ,R E~ a H U
u7 O
a rx w z M r-I ~ O
dl M N O 00 M O do [~
O V~ 00 L!1 Lfl H
r-i 10 O1 M M
U I~ t~ M t N N M M
'~
O
-~
z O ~-I N M V~
Ca H
r1 U
O~
~
W
z~
~s o -~
z O H N
r-i r-I r-i .H
W
O
W
WO 00/49043 CA 02362300 2001-07-31 pCT~jS00/04160 u7 fx U
-~ pa W
U
f~
~ U7 U7 V7 C/1 rl U7 U7 O
(~
~ H w xH a,H w~ xH x H
1.W U) H U ~ U7 H w U U7 U U7 (S ~7 ~ r.>r H ~ H FC H o ~ O r.>r O r->r Ca O FC
~z a owaa as o s~waa a a oa ~a,cor~rim ~wa,mm m cn a~
bl M ~,' ?f LT
1.', a -~, r1 ~-1 .T", 1J
N
O -~ t11 S-I O t5~ v w ~ ''C N t~ ~ '~ U -rl ~-I
1.1 ri .j", Ul t~ -rl >_.,'U
L!1 N
f~ N -r-I r-i ri 'CS ri ,x O
U ~ ,~ M 7., ~ t"., ~ U U1 (O U
-rf O I t,'Jl'>'f Z71 rl I r-I
~ r-I Ul ~
w ~ r1 -- -ri ~ .R N -, r-I C1~
CO (IS
'L3 c0 w fJ, O a Ln ~ I co O I I
u1 O d~
1J O S-1 (a 1J ~"" w-i ~-I ~'., O ~'., O S=', ~ ~-1 .rr C~ N ri ~', r-1 r.,' ri M ~ G~ Ul ~ ,Q ri C7.~
e--I rl -rl 1J 4l ~ SV O 4l 4J c~ 17 (O (a N H N In 'L3 H U7 .U U7 ~I ~ 1.~ ~ N ~ N
~ rd aJ N N
ov H O O O U N ~ O N O N U7 N .-n f~
N
C~ ~-1 ~'.. 11 r-1 S-1 i31 ~-I -r-I
b1 ~ ri ~1 4) -- o ~, w ~ v~ a~ a, -- o z Ts -- t~ tn --N M
U W fsa S~ ~ I
I
~
.p.cW
o .,.
tT' .~ ~ N
M
N ~
4) ,~ ~ (a .. ~ ~
N
~.1 ~ N .~ ~ ~ I
I~
M
r-I -r-I .,..I ~ H
C~ c"I r-I
M
-4~ ~, oq o ~xz ~
_ c z S-I ~i H ~-1 N
o >~ w ~ a o ~, _ -N
[~ M
-~ N d' z ~, z z .~
v >~
~ ~ O~ N O
~
N Z
-r1 o Z z z ~n w ~' ,~ ~ u, , 0 H .-1 H N N L~ N t z z zzz z z 111 ~ c0 N
t11 c0 rl \O N O
~., l~ M 00 H rl d~ Ov M 00 e-1 CO M
r-i d~ M N
y -I cn ~ u1 cn amo ~r Lt1 co H t~
Ul r-i d~ ,-I u1 ~r ~ In O M M N N M Ll1 ~O C~ rl V~ M
CO N ~O
-~ ~ M M H U7 00 N U7 U7 U7 N N H
U7 H ~ U7 U7 00 ~
r-I I~ V~ U1 t11 M H V7 fl V7 1~ N ow a ~ '~ CO U7 rl lp I~ ~ ~ U1 M
H OO L~ 01 CIA CO
~
ri H U1 00 ~ N Ul lD M e-1 w1 [~ O ~
r-i O ~ O
v Ul C~ M M lIl N M Lll l0 O O V~
M I~ O~ N l0 l0 M
~ ~ O rl U7 N M Ul H Cll U7 N II1 ~ Ll1 U7 V7 CO U7 U7 II1 H N
N ~ t~ H N a, u~ H u1 cn O ~
ri ~ d~ U7 00 U7 U7 d~ l0 l ~ O t!1 M 00 O H 01 u~
O H 00 l0 ~ N l0 00 OWf1 h d~ 111 ~ O C~ N O N l~ lD
L(l W M 01 M N N CO 111 N M ~ L!1 ~ rl N
U7 r-I 10 v-i N ~O N M O
O ~ H cn ~o m u~ v1 cn ,-I H H
u~ ~n u1 v~ u~ u~ v1 ,~ ~
?~ ~
M u1 H cn u~ H
cn H v-W o0 O c0 O~ M ~O rl N N
t~ O 00 ~ t11 N ~O
c0 CO N O\ CO Lh CO M r1 CO L~ O ~
IIl M M l0 00 ~ CO N N l0 l0 1D
oW rl M r-i LIl ~' N M M tf1 I~ V~ N
-I M N l0 L~ rl tll ~ d~
N lO N
H H H ~n ~n H ~n H H v1 H ~n ~n H cn ~n u1 u1 cn H H ~n ~n H ~
N
o a, o, ~0 0 ri M CO O tI1 N O
U
~
~ M ~ V~ l~ N Ll1 ~
a q e-I N M ~ t17 0 Z
CA 02362300 2001-07-31 PCT~js00/04160 ~o ~ x~ ~ x '~ s~ cn N L1 to U O ~d U ~d O ~ O
W O m ~ O Ll W ~ O O Pa G1 W ~ f~
u1 ~ G: ~ FC a O >~ a ~ ~ O ~ r~ O
-''i m ~ C W ~ Cue, m CG N f~ Pa O N rx N f.~ rx C7 I U ~ ~ I f.~ C7 0.. I G~ C7 Pa C~ ~ Pa o (a V1 I U7 U7 I I (~, U1 I U1 I I U7 I U7 I I U7 I I I
~ f~HP.~ Cl.~H~ N0.~H f~H~~H C>~HH C~HoGH
1J (a H U7 ~'"r H U7 W Pr ~i U1 H U1 W fir' U1 H V7 V7 H U7 W U7 N L1 H r-C H H FC ~ H FC H rC H ~ H FC FC H r.C FC
oaa oa~~Haa oa as oaa oa~a ~mm ~cox~noar~ ~r~ cam ~r~r~ ~r~xa~
~"., r-I M W Gi M
'1~ ~ F.i 11 J-1 l0 ~r-I fri ~ '~ ~.i H N ~ri 00 N 'rl ~ri 1-1 U ~ rl i.i ,fZ ~ L G~ O
bt O N ~ I tJ~ O '~ wl Sa ~
w ~-i -- Sa V~ ~-i -- ~ N U1 I~ ~H f.~
-H O GL S-I M O f~ U1 -rl V~ H O tn 1J S-1 N O O LI1 ~.1 .(", U1 U7 ~ ~'., b1 N
..f"., ~ U1 ~, J-~ V~ ~ -H fx Ol O Lll N ø, ~-, r'-I
1J (a 'H ~ M 1.~ ~ rl ~-I 11 01 .f-', U7 -rl 01 U7 '~ .-i ~ CO U1 L I ~, (a 01 -rl O '~ I~
H -H o U~ ~,o a ~~N p,,~ ~~H
x ua~tT xs.~ a~~ts o~~.~~
U O r.>; ~.I --- O ~ U U1 U7 ~- U 'r .R '-' u1 ~ .. ~ ~ H ~
.H ~ ~ w t~ U1 d' ~ U
U ~ ~ 1.~ ~ ~ .. W .~ 4J ~
1.~ ~ crs ~ O U W ~ ~ ~° ~ rtJ
N v w Sa -~-~ ~ 'LJ ~ x ~ -N u7 'Lf E to N ~ ~ U -~ ~ , f~ 'x .~ w f(f -ri ~ 1~ U ~., Ra 't O ,Il Z37 ~ b1 N ~" M ~ ',~ N
n ~-I s..~ u~ U M ~ -H -H -~I ~ 'Li ' I -H d' ~ t~ .,~ °° tn ~
~°
i..~ ~ ~ ~ ~° N ~ v d' N °° .H '1 ,H cn .-r -~ M ~ '~ ~ o N
S~-. 'L~ N C~ O O 1.1 ~ M U7 ~1 ~ O ~ 2J x y~ Cx~ ~ ~ U~
~.r N ~ a! tv H M ~y O U N a ~ O H ~1 t"., I ~ I O " I O I
S-I I ~ ~ ~ ~-I I ~ I ~ ~ I N -~ ~' _~ .. ~o ~-I ~ ~° a ~
~°
~n ~ O °° ~S I c~ R~ -. d' .,~ ~ .~ ~ N l..i N ~"~ ,q w d' f~ w ~° R, m ~°
V ''~ ~ ~n 't~ d' ~ '-I ,~C o ~ d' U '~ I ~ ~ ~n ~..~ ~ ~ I -~ '~ ,.. ~ '-I
rti o ~~ >' Ls ~ o f.~ ~ ~ ~ '-7 3 U C~ ~ o W GL ~ q ~ ~ ~ G ~C
~, -H x -H s~ -~ -~I a~ o v -- ~ H o N o H .c~ o v~ w .H m a a ~I ~ o ~, ~ ~ U ~a ~n x N
a o a, a, c~ ,-, .~' m o ~~ z ~, M M z t/1 -r-I M r-i M N
~, o ~ ~ z z z a°, j, z ~ M ~r, N z N rl M M
r z zz z z zz tf1 rl V~ N l~ M
O N pp 01 Lfl N O\ Ov 00 H O
~~ N r-i Ov N M N N N N CO rl M V~
r1 JJ H N ~ CO U7 U7 V1 H U1 O U1 U7 U1 ~H
(~
-r-I r-I U1 ~ H l0 00 N Lll 01 U7 Ov CO l0 lyr N ~ '-~ O O~ O r-i O~ ~ V~ N tf1 ~ ~ N l0 O M N rl N N N N rl d' M
,~ N cn H H H ~n cn ~r H u~ cn 1J ~"., Cl~
r-I H r-I L!1 d~ 01 rl 00 01 OWE 00 ~ ,H p Q1 01 N 01 CO N M ~' N 01 N
N ~ Lf1 N N C~ rl N H M N vH N N
pn ~ o, u~ u1 u1 H cn u1 ~ H ~n ~n H
tIl M H H N L(1 M (~ (~ M M M rl O 01 c-I l0 O
l0 dW -I O~ 00 O 01 l0 M M O Lll CO rl Lll rl 01 r1 N CO Iw-I N rl M N M N H H N H N N ~
~n H cn H cn H ~n ~n ?~ H H u1 ~ H H cn ~n v~
N
,~ N N L~ L
pp M N
~ ~ N N
~ Gl z C~ c0 O~ O .-1 x a~
~
a ~
~ wH
xw W
q oa ~
~m M
O (a ~-I
O\
.ri Q) 01 1J -H C~
U Y.~ M
O ~ b1 w 1~ U v +~ ~ ~L ~
rtS H H
v .~ ,~ v .H ~ ~
H .F., 0..
O
. H Ca ~-I
U
O
(d U
U
H Ul w rtJ rtf .rt ~-I ~ a~ ~
'Ci N .,1 H
o ~
~
~
~
tn .H t .~ .H
O Q' ~ a -m +~
~
v ~ .H
.u o cn a ~
~
H N
z M
~i c-i ~-1 1!
~ O H
-H
~"., Uj O , d, H
O
CL
ri N
~ ~
.r-I 01 U .'1 N
N
.. N
H v-i z H En P~ p, U U U U U U
z U
W H W H H H
v a u. a ~ t~ a oa ca w w O O r-I M N M M
H
O M tI1 M ~ H
L(1 ~I1 Lfl l0 l0 U1 O
~ o 0 0 0 0 0 H
"
O .( ~"., ~"., ~'., T.,' ~"., , O H CO h Lfl M
O tm O O M O O M O V~
' M N
[~ .,.~ .,~ .,..I .,..I.,.~ .,..I
O . . . . .
O r v >~ v v ~ v v f~ N ~
S~ ~
w O w w O w w O w O
O O
ri -r-I -ri ri -r-1 r-I
-r-I r-I -r-I -r-1 -r-I -r-I
-~ r~ r~ r~ l~ r~ r~ l~ r~
GT-~l~ 1~ 1.1 l~
H r1 r-A r-I .--1 r-i r-i r-i r~ H ri r-I
~ w ~ ..~ .-~ .H w ~ w w w w C >~ N N J~ v v J~ N ~
~ ~
U H U U H U U H U H
H H
M
c~
o ~
N N
U1 ~ ' ~
O
H M rl M CO ~ O O 01 O CO ~ tI1 h r-1 v v h ~ ~ ~
~-Ici c-f rl Ov c1 N M c-i w cr OO
C~ ~ ' ~ ~ N .-. . QJ v .
O ~' ' . ~ ~ ~
N
O o O L(1 to O .('., ~., rl o o r-i o d~ rl O
dW
-~I
W Ll1 ~ ~,-~ p~ ~ ~ ,~ ~ ~
~.,~ ~ ~ . ~ L!1 .
O N O N N O N ~ N O ~ H
- ,--i. . ,--I . ~ O
~ ,~ .r ~ ~
v O f~ O O .-. , , , ~. I)f O (a H H (a (a fa O ~ O
~.
tn 'r v ~- v0 v ~ ~ '-' w ~
U ~ o~ G ~-I ~ ~..I -~-I ~ c0 M
U1 -'i r-i h -1 -.-i -.-i U -r-I
(a f~ -r-i (~ c-I U (0 (L3 ~
v ~ v v ,~ v ~ -~, -.~
,-~ N ~ r-I ~ v ~-, ~
a ,-, ,-, v .-~
H > m > > u~ > ~n w w ~n w ~ ~ m ~ ~ >
m . ~
-~ -~ -.-I -~ v v -~I N ~
v o o v v U o -~
U v U o U
JJ l~ .~-J 1J 1~ -rl -rl JJ -- ~- 1J 1J UI l~
U7 1J .u ~-- -r 1~
U .T".,U U ~ U ~ O U O ~
rti ~ fa ~ ~ U
(~
O ~
- O
'~ ' '~ ~ 'r~ O O 'Cf O O
O ~ O O O ~ 'L3 O ~ O
O ~-I O O O O W -~ 1~
rl O S-I ?-I O r-I S-a ~-I ~ O O O
.n S-1 ~.1 y.1 ~-1 1.1 !a (~
1J ,' ,' 1.1 ~.d 1J
''~ 1 1J ,7 '~ '~ ~1 T1 '(~
Gz OW f.~ G~ ~n ~ ~ ~
cn -I ~-I ~n f.~ f.~
~-I cn ~n ~ s-~ ~
s..l ~
N t~ v v N N N v N N t0 c(f N N to tt~ ~ v v rti v ~
(x (x (x z cx C7 x x C7 C7 z C7 U c~ U !x U C~ U z z U
h JJ h O N O~ Lf1 O
~, h h O r-W ~ N
O
~"I~ N ao av ~ ri v rl U I I I I ~ I
I I
L71M N ~-i Lf1 L!1 ~
H M O M h ~ h (~ ~
f-1~ N h 00 ~ O
Lf1 C W -I e-i Ca H
M V~ L!1 l0 h 00 w ~, O ~ U U U U U
o z z z z z U ~ H H H H H
'S~'' l0 ~ ~.. M ~. \D ~
O QO ~ N
H I~ ~' CO N O d~ L~ V~ LC1 M W t~
M N M M CO N N lfl N O~ N rl rl N O N N .-~I
O
O O O O O O
Ei v O O --- O v O O
v O ~-O
U ~ S~ >~ >~ S~ f~ G
O
O O O O O O
~ - - ~ - - -~ - ~ ~
0 f.a ~ S-I w S-I t~ ~r S-I
w S-~ w ~
-- E-a -- -- r0 -- -- -- r~
r~ ~ E-~ En N E-~
E-~ r~ H
M \ 00 [~ ~-1 O rl ~.I
S-I ~-I \ ~O ~-I \
\ ~I \ \
rl ~ O N ~ 00 0 M Q) N N S~ >~ 4l 7~ 4) .~'.
t0 M O Ln ~ w N tn N w w w O O w O
O w O
O 11 O O r-1 O O rl r-I rl J~ O 1.J r-1 L
11 rl 1J
~ ~ ~ O O
~
~i ~i ~-1 S-1 .S S
f~ f.~ ~ i1. ~1 f~
~ ~ N N N
U ~ U U ~ U U U .-a .H .H ~I ~ ~ ~
r-I ~ ~
~ w ~ ~ .-i ~ ~ .-a rl r-i w ~ w ~ w w ~ w r~ s~ ~ d N ~ rtS t~ N
v v !~ s~ N s~
W ~ !~
U H U U U U U U U
U U H H U H
H U H
U
M
U
a ~r N t11 -r-I'~ c-i JJ
O O Ll1 lO ~D O
u~
Ei ~ ~ a wr -, -.
M
N O ~-i V~ ~ N r-I
S-I rl w _ y -i ~ 00 M 4J
O O O rl l0 w-1 rl N >:., j~" O O r-1 O V~
W n '~ ~ eo ~p . . yn W ~ " '.' ~ ~.
O N ~ N N N O O O ~
-i -I ~ r-i i V O r1 r r r O (a O ~ O ~- ~. ~ H
H ~ O v f~ O
(a (a U1 v l~ "' N '~ O o0 ~-i U \ Lfl Lll '-' ~ ~.1 ~..I
)_', Cue. 7-', \ ..
rl U C~ o O f~ c-1 M (a N -rl ri r-1 (a U
ri N 1J N M Ql N M N ri rl r1 ~ r-I r-I rl rl JJ 1J (a 1.1 N
,' U1 'J ,7 r~ ~ ~ JJ
J-~ U7 ~ 1J U7 -~ v -~ o -~-I o o U
v o o -rl o U U v -~
~ N >~ ~
1J 1J 1.W---- 1~ -- ~ -- U1 -rl .U 1~ tJ~ U1 -r-I
~ J..~ ~ J.-~ 1J
U f~ U U U to ~ ~ O
O !~ f~ ~ ~ U
~ -r-I ~ UI ~ U7 U7 U7 ' ~2, U7 ~ ~ G~ ~ C2, U1 -r-I ri .~ ~
'~ O ' ~ '0 ~ ~ ~ O
O ~ ~ '~ O O O O ''d O O O
O S-I O O O O O O -~
t~ O O a ~ w 1~
O S-I ~ Y~ O
~-I ~-I ~ ~ ~ ~ ~ ''~
~ cd 5 S-I TS ''C rtS
' ~ ~ N 1-~ S-v N ~ v v N ~ N c0 N td v 4W 0 N td N N rd N N
t~
~xc~xz xzc~ z~C7 cxzULl zUC~ zUxo4 m 1.! l11 N l0 c0 M
.$-'.,N ~O N a~ t~
M N M O~ V~ N
I I I I I I
~J1 r-I CO 01 Ul d~ O~
H 00 e-1 [~ Ln M H
(Ij v N N N ~ N
S-I
~
C~
Ca H
01 O rl N M
Qt e-i N N N N N
z W
WO 00/49043 CA 02362300 2001-07-31 pCT/jjs00/04160 v >~ w b1 ~ N ~
w j ~ ,~ -I ''L~ ~ ~., o -rl r-I Lll o rl ?d .-I
M N ~Y o O 11 (~ 1~ - ~-I fa ~1 ,~ ~n w 0 0 w w ~ ~ v ~ 0.
r6 ~ O ~ x ~
N -~I v ,~ ~ o N '~
X ~o w v tff v 'L7 ~ w U U O O ~I v v ,~ O ~ v o~ to Q y.~ ~ v rl U1 U rl fa ~-1 w 3 ~.I ~ ~.I ,S'.., .i.~ ov 1J U7 v O -~ m ~ O ~n N ~ ~ w p, ,~
v -~I C2, w N ?r N v rl cn o ,~ -~ N w cn Wo c0 -3 ~
rl ~ >~ w ~ w w v U f~ 'L3 ''C3 ~ N ~ ~ m N ''C
u1 U7 JJ v f~ ~-1 r-I ?W v t~ f(i X S.~ ~-i 'C~
'~
u1 u1 ~I W ~ ~ N t.m-i 1.i ~ ~, N U
N
~, -~I ~ '~ O ~-i O .~ U v U7 w U .-I U7 (~
w w b~ ~ N v ~ ~ S-i 1~ ~ u1 1>
to N (6 .R >~ +~ O
U
~ -~I N ~ ?,'L3O w .~ ~I S~ r~ w U ~Q O v S~
'a~ ~ -~I U ~ +~ U ~ U w -.~ w 3 c~ -~I
U ~ w N
-r-IO ~-I U7 U7 v O ~ S3W U7 1~ O
O ~-i U7 ,.C, O ~ U7 r-i W 6 ~ )~ N ,~ >~ FC '~
+~ rti U O -rl 'LS
'~ y r-i .~". b1 ' ~
U7 ~"., ~ U U ~ O ~ N '~
N w ~ wi ,-~I -ri N rl U -~-I ~
N U U7 Si r~ O
'L3 N .t~ rl 1~ O '~ U1 ~ ~ '~ O >~
O U ,~
ca U '-I N t4 ?, .~'., U N U1 b1 U
U1 O ~-I O I
(fS
.-i ,~ rt3 N N ~ ~ U 0 ~ O P~l ra 1~ Si ''W c0 rd ~
~I
zs >~ w v U ~ b~ w 3 ~ ~ b~
~ ~ o -~I ~ ~
w o v o ~ ~w m C~>,~ ~ ~ 3 r~ ~
~ s~ 'd ~2 m ~ -~ .-~ ?W -rd r-I -.~ w ~
r-I ~ >~ -~ +~ v ~.I S-m~ - u1 r~ R~ ~ f~ ~ ra ~.I
~
v rl ~ ~I v ''~ ~3 ~ ~ ~ ~ w m ~
rtt cn o .~ v ~ o v f.~ v w ~ m ~ v ~I v ~ v cz, sa cn ~
m ~ v ~I
.(~',(~ U U1 1.1 U1 ~ ~ U7 . 1.1 !n (a (a ~., N rl J..1 N
(d N
~
w b~ -rl (0 U rtS N ~ 'a-W-~ N 'Lj U ~-I ~ N d N O
~
O
r~ ~ N ~ v ~ U7 '~ ~ ~., w ~ w ri rl -~ ~ a, rl O cn ~
f~
N ~ (a ~ tl7 U7 S~ O 1~ S-i r-I ''Cj ~ ri c~ O -r-I rt ~ O ~ N ra v O
U 1~ N 10 > >~ r0 .~ ~r rl 'L3 ''O
w >~ t~ 2S 7.1 'LS ~ 1~ ~ ~ r-I
b1-~ ~ f~ ~ v ~ ~.t o~
~
v ''~ ~d >r -~ ~ -~ ''O U1 ,~ cd ~ w w ~ ~--I~ ~ ~ ~ S-~ ?i U1 -~ ~
rd 4l !-I v .-I ,~., O FC
0 0 o o o rt o ~ rl ~ ~I ~ $ ~n bW ,~ v v u1 -~ t6 0 ~ w - rt w ~
~
U w w ~I w w N w w ~ c~ c~ U O ~I rl .
~ ~I ~
~
~ t6 ~ N v N ~ ~ N w cn ~ ~
>~ ~ ~ cn w zS . 23 ~ ~ ~ ~-I ~I cn rd o .~ z3 is cn Q, ~ 3 ~n ~ -~ -~
v ~0 p, N v ~, N N rl v (d v U7 ~d ~-i i-I
U ~ rl ~ O rl U
~, J-J 1.J .~! t~ -r-11.~ ~, T3 ~"~ ,~ 'a-~
N I ri ''~ 1~ U1 U 'L~ U7 ~'., ?d ~ b1 -ca W o rt1 r0 ,~ (~ v o ?~ ,~ o v -~I
1~ ~ ~-I (~ ~ ~ ~ o v .y -i rl ~-i rl U ~ 'a-~ ~ O ~-I 'L3 ~ V~ ~ t'r ~ O v ~ N m -r-i v O O >~ O O (a O O U ~ f~ ~-I ,~ N
O ~ '~ rl w ''C
O
m tn u7 u1 w tn ~ '~ 1md ,~ -~ ~
~-I ~ ~-I ?~ ~-I ra N ~d .u -.i -rl -~ w ~ rl ~ N U -~I w O
U ,Q N u7 'L3 ~ .-i ~ U
O
w ~ rl ~ ?~ -.-i +y v w ~ ~ FC
U R~ ~ 4l a o v ~ ~
o FC r.>rr.>r FC ~n ~ ~ t G) o ~ -~ 3 -~ ~ 3 rl ~ o ca ~ U ~a U o o ~
~ ~ ~I o x ca o N E
v U ~ ~
~ ~
' S-a Ci ~ ~ ~ U ri U ,~ E O ~
w U rl u7 f~ '~ ,~ ~ 'C3 U1 O v w ~
231 lT b~ b1 -rl tT r6 ~ v O ~-I i~
~ O o ~ v W 0 a a S-I
>~ f~ ~ >~ v ~ U rl ~r ~I w O Sa v O~ ~ w ~ ~ ~ O tT z p, 2f to -.-i-.-i -~-i-~-I ~ -~ v 'Cf 1~ S-I N
cn N O ~-I ~ v U in v cn ~ ?W ~ m u~ >J
v ~I u~ r~ z3 v N 3 U
m ~I v rl ~
~
U ~ 'CS ~ ~ ~ ~ ~ b1 !~ v N U ~C
U '~-I~ U -.n ~ i-t rl -i v -~--IO O v U rtf v ~ u~ O
~ U1 ~, N
~ .rs ~a ~d w ~ I ,-I v ~ o v v ~C it ~ ~ o >~ ~
,.~
, ~ a ~ ~' ~ v ro ~
v ~ ~ ~' ~ ~ v ~
u - ~ v ai ,~
. ~ ~ ,~ , I
?, .
~
U U ~ U U (~ U v t~ ~ u1 w ~ ~
b1 O v ,~ p, U7 f~ ~ U7 O
v ~ O ~ ~ -~ ~ ~, N v O w U1 ~I
O ~ w w O ~ N ~ t ~
!-i ?-I ~-I S-a ~.I I G.~ ~ ~ O ,R
,~ '~ v U1 CJ1.~ ~-I ov ..
~ -r-I v .Ll -~ (a ~ w ,il w w c~ w In >~ w fly v ~
H O ~ ~ w ~ ~I o, ~
U cn U7 U1 U1 U7 U U7 N W (a '~ C~, W-i (~ O ''~ 'C~
~'., ~-I
>~ ~ I >~ !~ ~ ~ ~ > ~ ~ ~ >~ v .~
w w ~I v 3 r~ >~
O'~ ONc~U O~ Oc(i(1,(aO(~N~~~v'd vvv--I~
U
U U 7-I U U U U 3 O .-i N C~ O ,~
~ W ~ ~ ~ 3 ~ ~-I
ri w ~ N O !~ ~ O ~ U7 --i v E-~
m O ~-I ~ ~ O
U7 U1 IJl U1 '~ U7 O .~ F~r r-I 'L~
,L',.~'., ~".,~ O U7 O ,.>~ U1 c~ N
r-I U rl ~ to t>~ r0 ~ t6 S-I -~ to >~ w U wl w O U tn >~ U1 w '., ~
3 3 ~ 3 3 0 3 w v ~ ~ E -~ ~n U z3 ~ u~ -~ ~ ~ v ~
N O v I v LT ~ C7 ~ ~ (~ O v ~,' ~ p:
?W ~r.i".y.,w?, ~.I ?~'~ ~ U7 O .!~, ~
.4.1 t~ N ?~ w w r- N
~ S-~ ~I S-W ~ N (a w ~ O ,~ ~
O .1J >~ If '~ ~ ~-I N v 'L3 O N .U
r6 (fi rtS ~ N cti ~ ~ w -~-I O rtf ~I tfS U ~ ~ N ~~I v ~ rl ~ O
~-I ~-I ~I ?-Wr ~ O wl ~ U7 ri S-I
~-I ~ r-a wl x U .aJ w ~-I O
(d v S-a N
,~ ,~ .L2 .~ ~ v ~-I -~ r-i ,CZ
O I 'L3 N ~ v U1 U7 ca >~
, ~ -I ,Il 7-I v ~ w -~I
~ O N O N
a CZw -~ -~ w -r-I ~ , N S-W a N ~ ~I E'i ~ w ~y f v ~l M r rJl U7 ~ w -r-I , C~
~ ~"., ~ S-1 1J
N U ri -~-, U1 ,L,' ri M rl N N
O O O rl O O
o s z ~ H
~ w ~ a ~n a r~
a w w w ~ w ~ w v ~o oz v r-1 M ~ LC1 l0 L~ 00 Ca U ~ ~ r-i c-W -i I--i >~
d ~w ~
tn O
I ,L.,' w ca v r, 1J U1 M ''Lj~ t~ -~ ~ O
~ ~ -O E ~ ~ !~ O O U7 h ~o ~ ,~ O
~ ~
~-1 ~1 O (~ ~-1 ~ (If v ~1 (l1 1~ ~-I
~ r-I ~ O .-.
w w w ~ ~
o ~ ~
w w -r . >~ ~ o~
6 ~ ~ ~ m -?~
>~ v ?~~ 'LS O c~ -H r~ ~ ~I 'C
o s~ rtf r~ v owo v ?, H ~ ~ N ~ S~ U cn 3 SC ~ v Z3 ~ 7 ~ ~n a v > ,-i o b~ ~ v o ~ o w N ~ r~ a v ~ ~ -H
.H -~ to -- m O O ~ 1~ ~ 1~ O ,C'.,~ f~ '~ N JJ ~-1 1J
O .j-'., U7 v O (~ 1J I
~ ~ rl O U ~ +~ ''~ is 'i3 ~ ~ U7 rd U ~ ~ N f0 w rl U ~
N O to ~ N v rtS N ~ v ~ N O rtS ~
r0 c~ ~ .~ cti o~
~
~-I ~ ~ N ''C ~ Pa ~ O WD N ~, ~-I N
>~ -H rl . U7 ~-I c0 l ~ U r~ f-I 2f O ~-I U ~, ,~ ~ w ,1Z
b~ S~ TS U1 H 1-~
O
v rtS ~ N N N v ~ v-I ~ ~ ,~ E-~ r~
~ -~ N X O ~ 1mo , ~ W 'LS N C2, ~ ~, v v ~I ~ ~ !~ N ~-t O ~ u~ TJ ~ r.~ ~ v (!7 .-I ~ Q~ cl~ ?-Wtf'd .L.l ~-I r-I
c0 c0 11 ~H v O ~-i f.2~ U1 ~ .~
~
m ?~ ?, ~n ~n ~n U v s0, U
~ it ~ ~ 3 ~n ,~7 3 ?~ ~I
v v H ~, i.~ U7 r-i N ~ !~ -~-i i.~ O
2f U7 (ff -~--I 3 !~ O i-I
U N
1..~ ~ H H 1Wi ~ O O ~-i U1 t(S U
(d ~ to ~ TS M O ~ ~I O ~0 (d O O +~ C2,-~ U~ ~ U Sa ~ U1 -rl ~,'C ~ N O O ~ v ~
1-1 1J .~., JJ O U7 N ~-'I ~ ,~ ~ rn ~ U ''~ rl .1.J ~-I S~ (if in U t~ ~ o ~n 7, H ~, ~n ,H o -.~ o m z3 ~ ~-I ~ w o 3 v (a N ~ df (~ S-a J.~ dI td N S".r ~ ~.1 ''d ca JJ .S".~ v N ''CS U7 PG
.
v 1~ ~ U U v O (~ v 3 U w U w-I ~., >~
~-1 Ul v v S-1 O ~ 00 r-I 'T~ ~1 M (~ (a ~i F. 11 v w N
~
.~ ca -~ U ,-, ,~ m ~ ~n ~. v ~ ~ ~ v -~
o ~ s~ ro o ~
ra ~
~ v >~ o ~ >~ -~I rl m ~I ~ v v ,~ ~
E o ~ ~
;~ o 'd '~ O 'Ci 'L3 O -rl (d U ~-I U 1-~
N U O ,~ ,~ N E~ ~t .R U7 Ul U v ,~
v .H ~ v v -~ C~ ~ ~-I u1 O df ~r rl ~ m ~ ~ rl v N
~
U1 f~S U1 U7 tJ (a ,~ ~ '1-W r-I
~ JJ -H r-I ?~ VJ ~'., r-I v H
v (~ U' f~ U b1 (d (a U r-I .$"'.,-rl ~ r-1 O N
U fir (d ~ v W 1~ ,S'., tJ1 ~ (a v -H N N s~ N ~ ~ -~ rl ~ U u1 rl v -H -H -~ w T5 .i-~ O --u1 U7 'L~ Ul U1 '~ U7 (~ ,~., O N -rl (a N w rl -r-I .~., v ,f' v .H l0 J-1 -rl ''~
-H ra ~ ri -ri ~ ~ r-I 1~ -.-i ~
~ S-i ~ v N ?~ r-I ~ E1 ~ ~ O ~
U 'L3 v '~ ~-I 'Ci 'L3 ~ .sr1r-I O 1.~ ~ r.>r -~ -H O ~ Cm ~ U O '~ ~
v td 'd u7 ''~ ~r O U N O ffS .~', ~ ~ ~ ~ O ~H
~
~ ~ r~ ~ ~2 ~ ~ ~ ~ ~ U7 la w ~ .r ~, v W I ~ w U ~ N ,~
~
O to ~ O Id O I~ O O U1 -~ ~ -H ~ ~
N ,~ ?a N v U 1.~
U
U ~-I ~ ~ ~ I ~I S-I ~.I ~ N 'L~ O b1 U1 U ~ v c!7 v U7 ~ U7 u~ v 1~ .~'., w v -~I w c~ w v w w ~ ~.,-~I O ~ N
. ~ ~ N ~I rtS O ~ H ~ ~-I
-H ~ N N ~..y ~--tU ~-I ~ ~ -~ U
cd ~ O f-i N
'~ (a ~ 'C3 TS ~ 'd 'C3 U7 v ~1 u1 U7 X O lyl7 fp rl CL ~ U N v ~r v r-I ~ v f~ v rH v v ~ ~ ?~ ~ ~ o U
~I -~ O O .~ ~
~., ~
1J -ri JJ .!-~ 1.) l~ .)r., ~-1 ,L,' ~ ''('~ .-t H T3 p., ~]y U N ?i 1-~
Ul 'CS ~.I O N
Id ,~ ''CjrtS rtS cd ~ (6 ~.I 'Lf ~ v v ~ v .~ a N
n ~ r-I -~ ~ ~ .-i ~ ~ .~ >~ N v '~ I-I
'd v 1-~ U >~ r-i ,~ ~ ~ ~ O
tl1 O (0 U7 O tff O N O O 1y, O t~ 1mH O
~ ~ r(S -- rti t~ U O ''d O H
a U7 v r-I U7 U7 1-~ tl7 U7 O S.I -ri ~ U
Sa 1-I Sa t~ cd O w rl U7 H
U ..r., -~ ~1 ~.I -H H b1 H -H O rtf 4-~ rtS
U v O a7 ra ~ I rl ~ 'C r~
.-a O
-!~. >~. J.-~ ~ ~r .~".,,~ ~ ~., U ~ S-i l ' 'i~ U 1.~ ~ ~ ~ N !~.
l i U1 O
~ U
a ~ -O
S ~
~
w r~ r.>r ~ r.C c r. ~ -H f~ -~ , -r ~ U f r ~ H S v 7 v ~ O U ct ~ r ~ ~ f~1 ~ ~
~
-I -.I rt ~ .
, i- I H ~ 3 -I
~ ~ H -I
r~
~ o a c~ ~ ?
''C t~ L31 - is ~
~-I >~ v 'C3 1~ ~ ~.I U ~ f~ O I
~ U O b1 C ~ 1.~ i b1 v rtS O 'f~
~ S -H O S N ~ O U M O v i ~ S~
f~ v -r-I .r T T > .4.1 Zf u~ ~ ~
~ -H T > ~ v c( ~ E E !H rt ~ ~ w Sa ~ ~I
O ~
W
~ ~-I ~ -.-i .-i .i -~ S-a N O u~ ~ ~
U1 5~ ~ ~ v S-I O O U7 N
~
v m r~ ~ cn ~n ~ m m ~ ~ w U ~ ~ ~I
-~I v ~ v ~ ~ ,~ 3 ~
o ,~
~ TS ~ ~ ~ ~ ra o 3 ~ is w .C > U '~ ~ ?~
~ r~
v ~ v -H -H ~ ?~ v rd o >~ ~
tn ' u7 a~
~ w v ~ ZS 23 w ~ ~ N ~ m v ~ S~ o ~ v ~ o N -~I ~ v >~ rl v ~ !~ rtS v it rl v ~ ~ '~ 1.~ o w N o i-~ v 1~ ,-I v 1.~ o ..
v .-I ~
w ~
1.~
~ ~ m v a ~ ~ ~ a a >~ t~ r~ -~I >
''t~ S~ r~ u1 ~-I H a, v .~
U r0 U7 U ~ U ~ U U O ~ ~ N 1-~ 't3 -H ~ ~ v ,~ ~ . wl N i.Wl C~
~ -~ -H ~ ~r ~ -~-i ~ ~ ~ O ~I U v ~ ~I
~ r-i w 1~ O U7 H
>~ v ?~~ 'LS O c~ -H r~ ~ ~I 'C
o s~ rtf r~ v owo v ?, H ~ ~ N ~ S~ U cn 3 SC ~ v Z3 ~ 7 ~ ~n a v > ,-i o b~ ~ v o ~ o w N ~ r~ a v ~ ~ -H
.H -~ to -- m O O ~ 1~ ~ 1~ O ,C'.,~ f~ '~ N JJ ~-1 1J
O .j-'., U7 v O (~ 1J I
~ ~ rl O U ~ +~ ''~ is 'i3 ~ ~ U7 rd U ~ ~ N f0 w rl U ~
N O to ~ N v rtS N ~ v ~ N O rtS ~
r0 c~ ~ .~ cti o~
~
~-I ~ ~ N ''C ~ Pa ~ O WD N ~, ~-I N
>~ -H rl . U7 ~-I c0 l ~ U r~ f-I 2f O ~-I U ~, ,~ ~ w ,1Z
b~ S~ TS U1 H 1-~
O
v rtS ~ N N N v ~ v-I ~ ~ ,~ E-~ r~
~ -~ N X O ~ 1mo , ~ W 'LS N C2, ~ ~, v v ~I ~ ~ !~ N ~-t O ~ u~ TJ ~ r.~ ~ v (!7 .-I ~ Q~ cl~ ?-Wtf'd .L.l ~-I r-I
c0 c0 11 ~H v O ~-i f.2~ U1 ~ .~
~
m ?~ ?, ~n ~n ~n U v s0, U
~ it ~ ~ 3 ~n ,~7 3 ?~ ~I
v v H ~, i.~ U7 r-i N ~ !~ -~-i i.~ O
2f U7 (ff -~--I 3 !~ O i-I
U N
1..~ ~ H H 1Wi ~ O O ~-i U1 t(S U
(d ~ to ~ TS M O ~ ~I O ~0 (d O O +~ C2,-~ U~ ~ U Sa ~ U1 -rl ~,'C ~ N O O ~ v ~
1-1 1J .~., JJ O U7 N ~-'I ~ ,~ ~ rn ~ U ''~ rl .1.J ~-I S~ (if in U t~ ~ o ~n 7, H ~, ~n ,H o -.~ o m z3 ~ ~-I ~ w o 3 v (a N ~ df (~ S-a J.~ dI td N S".r ~ ~.1 ''d ca JJ .S".~ v N ''CS U7 PG
.
v 1~ ~ U U v O (~ v 3 U w U w-I ~., >~
~-1 Ul v v S-1 O ~ 00 r-I 'T~ ~1 M (~ (a ~i F. 11 v w N
~
.~ ca -~ U ,-, ,~ m ~ ~n ~. v ~ ~ ~ v -~
o ~ s~ ro o ~
ra ~
~ v >~ o ~ >~ -~I rl m ~I ~ v v ,~ ~
E o ~ ~
;~ o 'd '~ O 'Ci 'L3 O -rl (d U ~-I U 1-~
N U O ,~ ,~ N E~ ~t .R U7 Ul U v ,~
v .H ~ v v -~ C~ ~ ~-I u1 O df ~r rl ~ m ~ ~ rl v N
~
U1 f~S U1 U7 tJ (a ,~ ~ '1-W r-I
~ JJ -H r-I ?~ VJ ~'., r-I v H
v (~ U' f~ U b1 (d (a U r-I .$"'.,-rl ~ r-1 O N
U fir (d ~ v W 1~ ,S'., tJ1 ~ (a v -H N N s~ N ~ ~ -~ rl ~ U u1 rl v -H -H -~ w T5 .i-~ O --u1 U7 'L~ Ul U1 '~ U7 (~ ,~., O N -rl (a N w rl -r-I .~., v ,f' v .H l0 J-1 -rl ''~
-H ra ~ ri -ri ~ ~ r-I 1~ -.-i ~
~ S-i ~ v N ?~ r-I ~ E1 ~ ~ O ~
U 'L3 v '~ ~-I 'Ci 'L3 ~ .sr1r-I O 1.~ ~ r.>r -~ -H O ~ Cm ~ U O '~ ~
v td 'd u7 ''~ ~r O U N O ffS .~', ~ ~ ~ ~ O ~H
~
~ ~ r~ ~ ~2 ~ ~ ~ ~ ~ U7 la w ~ .r ~, v W I ~ w U ~ N ,~
~
O to ~ O Id O I~ O O U1 -~ ~ -H ~ ~
N ,~ ?a N v U 1.~
U
U ~-I ~ ~ ~ I ~I S-I ~.I ~ N 'L~ O b1 U1 U ~ v c!7 v U7 ~ U7 u~ v 1~ .~'., w v -~I w c~ w v w w ~ ~.,-~I O ~ N
. ~ ~ N ~I rtS O ~ H ~ ~-I
-H ~ N N ~..y ~--tU ~-I ~ ~ -~ U
cd ~ O f-i N
'~ (a ~ 'C3 TS ~ 'd 'C3 U7 v ~1 u1 U7 X O lyl7 fp rl CL ~ U N v ~r v r-I ~ v f~ v rH v v ~ ~ ?~ ~ ~ o U
~I -~ O O .~ ~
~., ~
1J -ri JJ .!-~ 1.) l~ .)r., ~-1 ,L,' ~ ''('~ .-t H T3 p., ~]y U N ?i 1-~
Ul 'CS ~.I O N
Id ,~ ''CjrtS rtS cd ~ (6 ~.I 'Lf ~ v v ~ v .~ a N
n ~ r-I -~ ~ ~ .-i ~ ~ .~ >~ N v '~ I-I
'd v 1-~ U >~ r-i ,~ ~ ~ ~ O
tl1 O (0 U7 O tff O N O O 1y, O t~ 1mH O
~ ~ r(S -- rti t~ U O ''d O H
a U7 v r-I U7 U7 1-~ tl7 U7 O S.I -ri ~ U
Sa 1-I Sa t~ cd O w rl U7 H
U ..r., -~ ~1 ~.I -H H b1 H -H O rtf 4-~ rtS
U v O a7 ra ~ I rl ~ 'C r~
.-a O
-!~. >~. J.-~ ~ ~r .~".,,~ ~ ~., U ~ S-i l ' 'i~ U 1.~ ~ ~ ~ N !~.
l i U1 O
~ U
a ~ -O
S ~
~
w r~ r.>r ~ r.C c r. ~ -H f~ -~ , -r ~ U f r ~ H S v 7 v ~ O U ct ~ r ~ ~ f~1 ~ ~
~
-I -.I rt ~ .
, i- I H ~ 3 -I
~ ~ H -I
r~
~ o a c~ ~ ?
''C t~ L31 - is ~
~-I >~ v 'C3 1~ ~ ~.I U ~ f~ O I
~ U O b1 C ~ 1.~ i b1 v rtS O 'f~
~ S -H O S N ~ O U M O v i ~ S~
f~ v -r-I .r T T > .4.1 Zf u~ ~ ~
~ -H T > ~ v c( ~ E E !H rt ~ ~ w Sa ~ ~I
O ~
W
~ ~-I ~ -.-i .-i .i -~ S-a N O u~ ~ ~
U1 5~ ~ ~ v S-I O O U7 N
~
v m r~ ~ cn ~n ~ m m ~ ~ w U ~ ~ ~I
-~I v ~ v ~ ~ ,~ 3 ~
o ,~
~ TS ~ ~ ~ ~ ra o 3 ~ is w .C > U '~ ~ ?~
~ r~
v ~ v -H -H ~ ?~ v rd o >~ ~
tn ' u7 a~
~ w v ~ ZS 23 w ~ ~ N ~ m v ~ S~ o ~ v ~ o N -~I ~ v >~ rl v ~ !~ rtS v it rl v ~ ~ '~ 1.~ o w N o i-~ v 1~ ,-I v 1.~ o ..
v .-I ~
w ~
1.~
~ ~ m v a ~ ~ ~ a a >~ t~ r~ -~I >
''t~ S~ r~ u1 ~-I H a, v .~
U r0 U7 U ~ U ~ U U O ~ ~ N 1-~ 't3 -H ~ ~ v ,~ ~ . wl N i.Wl C~
~ -~ -H ~ ~r ~ -~-i ~ ~ ~ O ~I U v ~ ~I
~ r-i w 1~ O U7 H
7.IU7~r6U ~~~U ~U1.-i(a~-If~~1-INO~~~c~~uN~,'CJ
~ rU w ~ r~ ~ rd ~ +~ U o ~ ~ o v ~
~ o ~ rtf 3 ~ ~ m v 'H
(l7 U S-I U1 U1 U U1 U7 ~ O ~ 1J ~ jl -rl -rl O 'd -.-1 c~ O N
U
~ ~ o ~ ~n ~ ~ ~ ~ b w 3 ,~ v u~
u~ ~I m o >~ U rx ~ o o it ~a o ca o ~a o o v +~ >~ ~ ~ ~
~, o ~, ca ,~ ~ o ~ v ~a v U U U U U U U U ~ U1 rtS U7 I
~ ~ N !~ U 3 3 ~r ~ U7 .G
O O f~ ~ .H O ~ ~ U v O U7 a O O
~ b1 'LS
U1 '~ ~ tl1 U1 'LS U1 U1 (a >~ 1.J ~ S-a 1-~ O (0 !~ f~S . H H ~ U -r-I
ri O >~ rl t~ rl ' (IS ~ ~ rtf f~ S-I O O O ~H O
-H U1 U W U tl~ ~ U
r-i ~ H
3 0 ~ 3 3 0 3 3 y ~ ~ ~ w v '~ i v ~
v N ~
~I ~ U 'a~ ~ ,~ , ~ ~
~ -I
~t ~r S.W 'r ~y ~-I ?W ?i ~ ~ ~r ~ U ~ .-i ~ (d ~ J~ ~ N U
v N
w ~I rtf S~ ~ ~ i~ ~ u1 vo U u7 ~ v ~ U ?~ O rti O w c~ ~ ''C3 -~
N >~ ~ ~
t~ v U (~ f~ v N (d N v v ra ~I ~
O ~ ~ W U f-a I 7-I N -~ v d U~
~
~1 ~r O ~ ~ ~ ~r ~.I ~-1 U7 O W >~, S.-i ~', H ~ w-1 ~.1 v U ,~, 1J ~r ~-1 1.l O v ~-1 .C~ I t!7 ,Q ,~ I ,f~ ~ w ~ ,~ O ,Q U1 v (a '~ td ~ C1,.4 1~ ,~ U
'' 1-I 'C~ ~
~ ~ H ~ ~ 7~
~ W ~
H V H N -H N ~ w , ~ w N ~ v ~ ~ ~
U1 O a M !~ a ~
a 111 (a ~ a -ri a ~ H ~C U7 U rl yU
to w O ~ , .~., ?i U7 ~ . r ?, C2, (d U
M (~ N r-I rl w-i ~y O N r-i O O O
H H H H H
rd O O O O W O
E-f Ei a H U
w v~ a v~ o a rx o c~ a~
c~ U r~ z v ro -r., ..
~o oz v r-1 o\ O r-i N M
(~
U rl N N N N N
H
f~
OI
?~
W
H~
O
G~
00 ~ n :a o ~ ~ c W ~ J '~~ 'n =' r, v ro ~ .r a ~ ~ f = ~ a ~ r, a a -_p ~ ~ v~ _ i O y j i: rJ' ~ W per., O :~ J ~ II ~ ~ ~ a L ~ ~ J W. .' Z > ~ ;J ..,~.y. .D .~ i ~ t~D
O ._>'r ~ ~ ;b 'r.; ;'~ O 'J_. _~ op ~J' Ga. ~ 'J
~"O >~ yN II ~~ ~~,._ :~ '~.a V ~ ~ W W W c,UC ~ ~ ~ II O O > O p y ~ O ~ '~ ~ x ~ O w ~_ on y V G. ~ ~ y W ~ O ~ ~ O 'O y o ~ N y ~ ~ ~ y c ~ " o> o ~ ~ '1' ~ ~, ._~ y ~ v. J -o ~ p o t~
C. ~ ~~~L > Lz1<Ca~.:~s~,.~ ~j~ ~n v~~.
v~ E
a ~x ~ .
.. ~3~a :C ~ '0 U V
~ ~ ~ ~ x a a o N '-' . ,-CO ~ a ~ a ~ ' x . r .a cn vi ~ vi ' ~ N ~ y ~ p ~ p O a O W O
Cue. ~ ~ a~ can ~ ~ ~ ~ ~ Z ~ ~ ~ ~ ~ vp '~J~ '~rJ O~ Uj tn ,~ N . :~ C "'' ~n y ~ ~, 00 m ~ 07 a ~ M ~. ~ .f' = r~ ~ x ~ ~ C C:
yv ~ '~ ~ ~ N p N S vp a '~ E-.n. .c. G~ ~a. :j ~ y ~ ~ uj ~ ~ ~ N ~ cn ~ v~ a n. n. a.
aQ as aQ :~ax . ~b~ b~ o~~ '~.- U
ccd a m .
U Ct, o 'v r~ ~ ~ :c ~ a 3 - , y E ~ 8 ,r>~' ~ ,..~''" ~ ~ '-' 3 -o y !i: ~ vy ~ Q a a' r- v W U W U W U ~ ~~, a G 'U° ~ E- ~ o a ~ ~ ~ Q o z y .. ' ~ o . o a ~t ~ . o ~ . U
~.~~o y~ ~o "~~ ~b~~a. yyyp'~ cn,~
tn 2 0. r~., a, u, o. ti, Q N z n. z " v~ a s x z .- x~ N ..
U C
v~ ~ y ~ ~ U ~ ~ J
a' <n p, C J ~ 'O ~ '.~ . G. c~O c~ ~' ~ vj b ~ G .~ ~ ~ ~ O y ~ vy, G" ~~ ~ y <=C 'C U ~ U ~ J ~ ~i:, ~ ~ O .~ ~ J
V U ~ ~ ~ .U '~J~ ~ b ~ ~ V ~ 'Op '7 j y C 'C ~ ~ j ;C J r :C p ~ i.U., y ~ CL
a. ~ ~ E. '"" ~ C E- ~'.' E" 4 U ~ 3 ~ v '~-, ~ y u-, ,~ y ~ v oyn ~ ~a ~ V ~ ~ n, c J U .Wn ~ ~ G ~ ~ ~ ~ ~ ~ J ~ ~ ~ C ' y'U '~~ ~, y~ Jb ~~ ._=~ ~~"r0v .C y o a o .,~. ~ ~ '° .a '~ b -o O
r° '° ~ ~ n. c ~, ~ > ~ a ~
'~ i Li. '~ ~ ~ -a ~.~c~'~ ~ ~ N'~ ~ ~.x O p U ~ "a r y .~ 'J ~ J ~ j .SC rJ a OA ~..t"', n, v° rz. ' a, ~ '~ a. c a'» 'E ~. ~ m ' Wo ~
a ~ a ~ a aw ~.~ aw ~:~ a ~ ~ : a;
w U Q
U
caz. n. a a ~ g _.QQQoWC ua.
SEQUENCE LISTING
<110> INCYTE PHARMACEUTICALS, INC.
TANG, Y. Tom HILLMAN, Jennifer L.
YUE, Henry AZIMZAI, Yalda BAUGHN, Mariah R.
TRAM, Bao <120> HUMAN LIPID-ASSOCIATED PROTEINS
<130> PF-0676 PCT
<140> To Be Assigned <141> Herewith <150> 60/120,703; 60/142,762 <151> 1999-02-19; 1999-07-08 <160> 24 <170> PERL Program <210> 1 <211> 331 <212> PRT
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 161190CD1 <400> 1 Met Asp Ser Glu Lys Lys Arg Phe Thr Glu Glu Ala Thr Lys Tyr Phe Arg Glu Arg Val Ser Pro Val His Leu Gln Ile Leu Leu Thr Asn Asn Glu Ala Trp Lys Arg Phe Val Thr Ala Ala Glu Leu Pro Arg Asp Glu Ala Asp Ala Leu Tyr Glu Ala Leu Lys Lys Leu Arg Thr Tyr Ala Ala Ile Glu Asp Glu Tyr Val Gln Gln Lys Asp Glu Gln Phe Arg Glu Trp Phe Leu Lys Glu Phe Pro Gln Val Lys Arg Lys Ile Gln Glu Ser Ile Glu Lys Leu Arg Ala Leu Ala Asn Gly Ile Glu Glu Val His Arg Gly Cys Thr Ile Ser Asn Val Val Ser Ser Ser Thr Gly Ala Ala Ser Gly Ile Met Ser Leu Ala Gly Leu Val Leu Ala Pro Phe Thr Ala Gly Thr Ser Leu Ala Leu Thr Ala Ala Gly Val Gly Leu Gly Ala Ala Ser Ala Val Thr Gly Ile Thr Thr Ser Ile Val Glu His Ser Tyr Thr Ser Ser Ala Glu Ala Glu Ala Ser Arg Leu Thr Ala Thr Ser Ile Asp Arg Leu Lys Val Phe Lys Glu Val Met Arg Asp Ile Thr Pro Asn Leu Leu Ser Leu Leu Asn Asn Tyr Tyr Glu Ala Thr Gln Thr Ile Gly Ser Glu Ile Arg Ala Ile Arg Gln Ala Arg Ala Arg Ala Arg Leu Pro Val Thr Thr Trp Arg Ile Ser Ala Gly Ser Gly Gly Gln Ala Glu Arg Thr Ile Ala Gly Thr Thr Arg Ala Val Ser Arg Gly Ala Arg Ile Leu Ser Ala Thr Thr Ser Gly Ile Phe Leu Ala Leu Asp Val Val Asn Leu Val Tyr Glu Ser Lys His Leu His Glu Gly Ala Lys Ser Ala Ser Ala Glu Glu Leu Arg Arg Gln Ala Gln Glu Leu Glu Glu Asn Leu Met Glu Leu Thr Gln Ile Tyr Gln Arg Leu Asn Pro Cys His Thr His <210> 2 <211> 480 <212> PRT
<213> Homo sapiens <220>
<221> misc_feature <223> Incyte ID No: 1292575CD1 <400> 2 Met Asn Gly Glu Glu Glu Phe Phe Asp Ala Val Thr Gly Phe Asp Ser Asp Asn Ser Ser Gly Glu Phe Ser Glu Ala Asn Gln Lys Val Thr Gly Met Ile Asp Leu Asp Thr Ser Lys Asn Asn Arg Ile Gly Lys Thr Gly Glu Arg Pro Ser Gln Glu Asn Gly Ile Gln Lys His Arg Thr Ser Leu Pro Ala Pro Met Phe Ser Arg Ser Asp Phe Ser Val Trp Thr Ile Leu Lys Lys Cys Val Gly Leu Glu Leu Ser Lys Ile Thr Met Pro Ile Ala Phe Asn Glu Pro Leu Ser Phe Leu Gln Arg Ile Thr Glu Tyr Met Glu His Val Tyr Leu Ile His Arg Ala Ser Cys Gln Pro Gln Pro Leu Glu Arg Met Gln Ser Val Ala Ala Phe Ala Val Ser Ala Val Ala Ser Gln Trp Glu Arg Thr Gly Lys Pro Phe Asn Pro Leu Leu Gly Glu Thr Tyr Glu Leu Ile Arg Glu Asp Leu Gly Phe Arg Phe Ile Ser Glu Gln Val Ser His His Pro Pro Ile Ser Ala Phe His Ser Glu Gly Leu Asn His Asp Phe Leu Phe His Gly Ser Ile Tyr Pro Lys Leu Lys Phe Trp Gly Lys Ser Val Glu Ala Glu Pro Arg Gly Thr Ile Thr Leu Glu Leu Leu Lys His Asn Glu Ala Tyr Thr Trp Thr Asn Pro Thr Cys Cys Val His Asn Val Ile Ile Gly Lys Leu Trp Ile Glu Gln Tyr Gly Thr Val Glu Ile Leu Asn His Arg Thr Gly His Lys Cys Val Leu His Phe Lys Pro Cys Gly Leu Phe Gly Lys Glu Leu His Lys Val Glu Gly His Ile Gln Asp Lys Asn Lys Lys Lys Leu Phe Met Ile Tyr Gly Lys Trp Thr Glu Cys Leu Trp Gly Ile Asp Pro Val Ser Tyr Glu Ser Phe Lys Lys Gln Glu Arg Arg Gly Asp His Leu Arg Lys Ala WO 00/49043 CA 02362300 2001-07-31 pCT~S00/04160 Lys Leu Asp Glu Asp Ser Gly Lys Ala Asp Ser Asp Val Ala Asp Asp Val Pro Val Ala Gln Glu Thr Val Gln Val Ile Pro Gly Ser Lys Leu Leu Trp Arg Ile Asn Thr Arg Pro Pro Asn Ser Ala Gln Met Tyr Asn Phe Thr Ser Phe Thr Val Ser Leu Asn Glu Leu Glu Thr Gly Met Glu Lys Thr Leu Pro Pro Thr Asp Cys Arg Leu Arg Pro Asp Ile Arg Gly Met Glu Asn Gly Asn Met Asp Leu Ala Ser Gln Glu Lys Glu Arg Leu Glu Glu Lys Gln Arg Glu Ala Arg Arg Glu Arg Ala Lys Glu Glu Ala Glu Trp Gln Thr Arg Trp Phe Tyr Pro Gly Asn Asn Pro Tyr Thr Gly Thr Pro Asp Trp Leu Tyr Ala Gly Asp Tyr Phe Glu Arg Asn Phe Ser Asp Cys Pro Asp Ile Tyr <210> 3 <211> 409 <212> PRT
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 2454393CD1 <400> 3 Met Ala Thr Ser Val Gly His Arg Cys Leu Gly Leu Leu His Gly Val Ala Pro Trp Arg Ser Ser Leu His Pro Cys Glu Ile Thr Ala Leu Ser Gln Ser Leu Gln Pro Leu Arg Lys Leu Pro Phe Arg Ala Phe Arg Thr Asp Ala Arg Lys Ile His Thr Ala Pro Ala Arg Thr Met Phe Leu Leu Arg Pro Leu Pro Ile Leu Leu Val Thr Gly Gly Gly Tyr Ala Gly Tyr Arg Gln Tyr Glu Lys Tyr Arg Glu Arg Glu Leu Glu Lys Leu Gly Leu Glu Ile Pro Pro Lys Leu Ala Gly His Trp Glu Val Ala Leu Tyr Lys Ser Val Pro Thr Arg Leu Leu Ser Arg Ala Trp Gly Arg Leu Asn Gln Val Glu Leu Pro His Trp Leu Arg Arg Pro Val Tyr Ser Leu Tyr Ile Trp Thr Phe Gly Val Asn Met Lys Glu Ala Ala Val Glu Asp Leu His His Tyr Arg Asn Leu Ser Glu Phe Phe Arg Arg Lys Leu Lys Pro Gln Ala Arg Pro Val Cys Gly Leu His Ser Val Ile Ser Pro Ser Asp Gly Arg Ile Leu Asn Phe Gly Gln Val Lys Asn Cys Glu Val Glu Gln Val Lys Gly Val Thr Tyr Ser Leu Glu Ser Phe Leu Gly Pro Arg Met Cys Thr Glu Asp Leu Pro Phe Pro Pro Ala Ala Ser Cys Asp Ser Phe Lys Asn Gln Leu Val Thr Arg Glu Gly Asn Glu Leu Tyr His Cys Val Ile Tyr Leu Ala Pro Gly Asp Tyr His Cys Phe His Ser Pro Thr WO 00/49043 CA 02362300 2001-07-31 pCT~S00/04160 Asp Trp Thr Val Ser His Arg Arg His Phe Pro Gly Ser Leu Met Ser Val Asn Pro Gly Met Ala Arg Trp Ile Lys Glu Leu Phe Cys His Asn Glu Arg Val Val Leu Thr Gly Asp Trp Lys His Gly Phe Phe Ser Leu Thr Ala Val Gly Ala Thr Asn Val Gly Ser ile Arg Ile Tyr Phe Asp Arg Asp Leu His Thr Asn Ser Pro Arg His Ser Lys Gly Ser Tyr Asn Asp Phe Ser Phe Val Thr His Thr Asn Arg Glu Gly Val Pro Met Arg Lys Gly Glu His Leu Gly Glu Phe Asn Leu Gly Ser Thr Ile Val Leu Ile Phe Glu Ala Pro Lys Asp Phe Asn Phe Gln Leu Lys Thr Gly Gln Lys Ile Arg Phe Gly Glu Ala Leu Gly Ser Leu <210> 4 <211> 759 <212> PRT
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 2766980CD1 <400> 4 Met Glu Ser Ser Pro Phe Asn Arg Arg Gln Trp Thr Ser Leu Ser Leu Arg Val Thr Ala Lys Glu Leu Ser Leu Val Asn Lys Asn Lys Ser Ser Ala Ile Val Glu Ile Phe Ser Lys Tyr Gln Lys Ala Ala Glu Glu Thr Asn Met Glu Lys Lys Arg Ser Asn Thr Glu Asn Leu Ser Gln His Phe Arg Lys Gly Thr Leu Thr Val Leu Lys Lys Lys Trp Glu Asn Pro Gly Leu Gly Ala Glu Ser His Thr Asp Ser Leu Arg Asn Ser Ser Thr Glu Ile Arg His Arg Ala Asp His Pro Pro Ala Glu Val Thr Ser His Ala Ala Ser Gly Ala Lys Ala Asp Gln Glu Glu Gln Ile His Pro Arg Ser Arg Leu Arg Ser Pro Pro Glu Ala Leu Val Gln Gly Arg Tyr Pro His Ile Lys Asp Gly Glu Asp Leu Lys Asp His Ser Thr Glu Ser Lys Lys Met Glu Asn Cys Leu Gly Glu Ser Arg His Glu Val Glu Lys Ser Glu Ile Ser Glu Asn Thr Asp Ala Ser Gly Lys Ile G1u Lys Tyr Asn Val Pro Leu Asn Arg Leu Lys Met Met Phe Glu Lys Gly Glu Pro Thr Gln Thr Lys Ile Leu Arg Ala Gln Ser Arg Ser Ala Ser Gly Arg Lys Ile Ser Glu Asn Ser Tyr Ser Leu Asp Asp Leu Glu Ile Gly Pro Gly Gln Leu Ser Ser Ser Thr Phe Asp Ser Glu Lys Asn Glu Ser Arg Arg Asn Leu Glu Leu Pro Arg Leu Ser Glu Thr Ser Ile Lys Asp Arg Met Ala Lys Tyr Gln Ala Ala Val Ser Lys Gln Ser Ser Ser Thr Asn Tyr Thr Asn Glu Leu Lys Ala Ser Gly Gly Glu Ile Lys Ile His Lys Met Glu Gln Lys Glu Asn Val Pro Pro Gly Pro Glu Val Cys Ile Thr His Gln Glu Gly Glu Lys Ile Ser Ala Asn Glu Asn Ser Leu Ala Val Arg Ser Thr Pro Ala Glu Asp Asp Ser Arg Asp Ser Gln Val Lys Ser Glu Val Gln Gln Pro Val His Pro Lys Pro Leu Ser Pro Asp Ser Arg Ala Ser Ser Leu Ser Glu Ser Ser Pro Pro Lys Ala Met Lys Lys Phe Gln Ala Pro Ala Arg Glu Thr Cys Val Glu Cys Gln Lys Thr Val Tyr Pro Met Glu Arg Leu Leu Ala Asn Gln Gln Val Phe His Ile Ser Cys Phe Arg Cys Ser Tyr Cys Asn Asn Lys Leu Ser Leu Gly Thr Tyr Ala Ser Leu His Gly Arg Ile Tyr Cys Lys Pro His Phe Asn Gln Leu Phe Lys Ser Lys Gly Asn Tyr Asp Glu Gly Phe Gly His Arg Pro His Lys Asp Leu Trp Ala Ser Lys Asn Glu Asn Glu Glu Ile Leu Glu Arg Pro Ala Gln Leu Ala Asn Ala Arg Glu Thr Pro His Ser Pro Gly Val Glu Asp Ala Pro Ile Ala Lys Val Gly Val Leu Ala Ala Ser Met Glu Ala Lys Ala Ser Ser Gln Gln Glu Lys Glu Asp Lys Pro Ala Glu Thr Lys Lys Leu Arg Ile Ala Trp Pro Pro Pro Thr Glu Leu Gly Ser Ser Gly Ser Ala Leu Glu Glu Gly Ile Lys Met Ser Lys Pro Lys Trp Pro Pro Glu Asp Glu Ile Ser Lys Pro Glu Val Pro Glu Asp Val Asp Leu Asp Leu Lys Lys Leu Arg Arg Ser Ser Ser Leu Lys Glu Arg Ser Arg Pro Phe Thr Val Ala Ala Ser Phe Gln Ser Thr Ser Val Lys Ser Pro Lys Thr Val Ser Pro Pro Ile Arg Lys Gly Trp Ser Met Ser Glu Gln Ser Glu Glu Ser Val Gly Gly Arg Val Ala Glu Arg Lys Gln Val Glu Asn Ala Lys Ala Ser Lys Lys Asn Gly Asn Val Gly Lys Thr Thr Trp Gln Asn Lys Glu Ser Lys Gly Glu Thr Gly Lys Arg Ser Lys Glu Gly His Ser Leu Glu Met Glu Asn Glu Asn Leu Val Glu Asn Gly Ala Asp Ser Asp Glu Asp Asp Asn Ser Phe Leu Lys Gln Gln Ser Pro Gln Glu Pro Lys Ser Leu Asn Trp Ser Ser Phe Val Asp Asn Thr Phe Ala Glu Glu Phe Thr Thr Gln Asn Gln Lys Ser Gln Asp Val Glu Leu Trp Glu Gly Glu Val Val Lys Glu Leu Ser Val Glu Glu Gln Ile Lys Arg Asn Arg Tyr Tyr Asp Glu Asp Glu Asp Glu Glu <210> 5 CA 02362300 2001-07-31 PCZ'/US00/04160 <211> 226 <212> PRT
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 2768356CD1 <400> 5 Met Glu Ile Val Pro Ile Gly Thr Thr His Val Thr Leu Pro Val Phe Gly Asp His Phe Glu Trp Asn Lys Val Thr Ser Cys Ile His Asn Ile Leu Ser Gly Gln Arg Trp Ile Glu His Tyr Gly Glu Ile Val Ile Lys Asn Leu His Asp Asp Ser Cys Tyr Cys Lys Val Asn Phe Ile Lys Ala Lys Tyr Trp Ser Thr Asn Ala His Glu Ile Glu Gly Thr Val Phe Asp Arg Ser Gly Lys Ala Val His Arg Leu Phe Gly Lys Trp His Glu Ser Ile Tyr Cys Gly Gly Gly Ser Ser Ser Ala Cys Val Trp Arg Ala Asn Pro Met Pro Lys Gly Tyr Glu Gln Tyr Tyr Ser Phe Thr Gln Phe Ala Leu Glu Leu Asn Glu Met Asp Pro Ser Ser Lys Ser Leu Leu Pro Pro Thr Asp Thr Arg Phe Arg Pro Asp Gln Arg Phe Leu Glu Glu Gly Asn Leu Glu Glu Ala Glu Ile Gln Lys Gln Arg Ile Glu Gln Leu Gln Arg Glu Arg Arg Arg Val Leu Glu Glu Asn His Val Glu His Gln Pro Arg Phe Phe Arg Lys Ser Asp Asp Asp Ser Trp Val Ser Asn Gly Thr Tyr Leu Glu Leu Arg Lys Asp Leu Gly Phe Ser Lys Leu Asp His Pro Val Leu Trp <210> 6 <211> 500 <212> PRT
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 5324145CD1 <400> 6 Met Tyr Cys Pro Glu Ser Ala Val Ile Leu Leu Ser Thr Thr Val Leu Glu Asn Val Leu Gln Pro Phe His Phe Arg Ala Gly Thr Met Ser Lys Leu Pro Lys Phe Glu Ile Glu Leu Pro Ala Ala Pro Lys Ser Thr Lys Pro Ser Leu Ser Glu Arg Asp Ile Ala Met Ala Thr Ile Tyr Gly Gln Leu Tyr Val Leu Phe Leu Arg His His Ser Arg Thr Ser Asn Ser Thr Gly Ala Glu Val Val Leu Tyr His Leu Pro Arg Glu Gly Ala Cys Lys Lys Met His Ile Leu Lys Leu Asn Arg Thr Gly Lys Phe Ala Leu Asn Val Val Asp Asn Leu Val Val Val WO 00/49043 CA 02362300 2001-07-31 pCT~S00/04160 His His Gln Asp Thr Glu Thr Ser Val Ile Phe Asp Ile Lys Leu Arg Gly Glu Phe Asp Gly Ser Val Thr Phe His His Pro Val Leu Pro Ala Arg Ser Ile Gln Pro Tyr Gln Ile Pro Ile Thr Gly Pro Ala Ala Val Thr Ser Gln Ser Pro Val Pro Cys Lys Leu Tyr Ser Ser Ser Trp Ile Val Phe Gln Pro Asp Ile Ile Ile Ser Ala Ser Gln Gly Tyr Leu Trp Asn Leu Gln Val Lys Leu Glu Pro Ile Val Asn Leu Leu Pro Asp Lys Gly Arg Leu Met Asp Phe Leu Leu Gln Arg Lys Glu Cys Lys Met Val Ile Leu Ser Val Cys Ser Gln Met Leu Ser Glu Ser Asp Arg Ala Ser Leu Pro Val Ile Ala Thr Val Phe Asp Lys Leu Asn His Glu Tyr Lys Lys Tyr Leu Asp Ala Glu Gln Ser Tyr Ala Met Ala Val Glu Ala Gly Gln Ser Arg Ser Ser Pro Leu Leu Lys Arg Pro Val Arg Thr Gln Ala Val Leu Asp Gln Ser Asp Val Tyr Thr His Val Leu Ser Ala Phe Val Glu Lys Lys Glu Met Pro His Lys Phe Val Ile Ala Val Leu Met Glu Tyr Ile Arg Ser Leu Asn Gln Phe Gln Ile Ala Val Gln His Tyr Leu His Glu Leu Val Ile Lys Thr Leu Val Gln His Asn Leu Phe Tyr Met Leu His Gln Phe Leu Gln Tyr His Val Leu Ser Asp Ser Lys Pro Leu Ala Cys Leu Leu Leu Ser Leu Glu Ser Phe Tyr Pro Pro Ala His Gln Leu Ser Leu Asp Met Leu Lys Arg Leu Ser Thr Ala Asn Asp Glu Ile Val Glu Val Leu Leu Ser Lys His Gln Val Leu Ala Ala Leu Arg Phe Ile Arg Gly Ile Gly Gly His Asp Asn Ile Ser Ala Arg Lys Phe Leu Asp Ala Ala Lys Gln Thr Glu Asp Asn Met Leu Phe Tyr Thr Ile Phe Arg Phe Phe Glu Gln Arg Asn Gln Arg Leu Arg Gly Ser Pro Asn Phe Thr Pro Gly Glu His Cys Glu Glu His Val Ala Phe Phe Lys Gln Ile Phe Gly Asp Gln Ala Leu Met Arg Pro Thr Thr Phe <210> 7 <211> 272 <212> PRT
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 1004646CD1 <400> 7 Met Ser Cys His Asn Cys Ser Asp Pro Gln Val Leu Cys Ser Ser Gly Gln Leu Phe Leu Gln Pro Leu Trp Asp His Leu Arg Ser Trp Glu Ala Leu Leu Gln Ser Pro Phe Phe Pro Val Ile Phe Ser Ile Thr Thr Tyr Val Gly Phe Cys Leu Pro Phe Val Val Leu Asp Ile Leu Cys Ser Trp Val Pro Ala Leu Arg Arg Tyr Lys Ile His Pro Asp Phe Ser Pro Ser Ala Gln Gln Leu Leu Pro Cys Leu Gly Gln Thr Leu Tyr Gln His Val Met Phe Val Phe Pro Val Thr Leu Leu His Trp Ala Arg Ser Pro Ala Leu Leu Pro His Glu Ala Pro Glu Leu Leu Leu Leu Leu His His Ile Leu Phe Cys Leu Leu Leu Phe Asp Met Glu Phe Phe Val Trp His Leu Leu His His Lys Val Pro Trp Leu Tyr Arg Thr Phe His Lys Val His His Gln Asn Ser Ser Ser Phe Ala Leu Ala Thr Gln Tyr Met Ser Val Trp Glu Leu Phe Ser Leu Gly Phe Phe Asp Met Met Asn Val Thr Leu Leu Gly Cys His Pro Leu Thr Thr Leu Thr Phe His Val Val Asn Ile Trp Leu Ser Val Glu Asp His Ser Gly Tyr Asn Phe Pro Trp Ser Thr His Arg Leu Val Pro Phe Gly Trp Tyr Gly Gly Val Val His His Asp Leu His His Ser His Phe Asn Cys Asn Phe Ala Pro Tyr Phe Thr His Trp Asp Lys Ile Leu Gly Thr Leu Arg Thr Ala Ser Val Pro Ala Arg <210> 8 <211> 282 <212> PRT
<213> Homo sapiens <220>
<221> misc_feature <223> Incyte ID No: 1802851CD1 <400> 8 Met Ser Gly Gly Trp Met Ala Gln Val Gly Ala Trp Arg Thr Gly Ala Leu Gly Leu Ala Leu Leu Leu Leu Leu Gly Leu Gly Leu Gly Leu Glu Ala Ala Ala Ser Pro Leu Ser Thr Pro Thr Ser Ala Gln Ala Ala Gly Pro Ser Ser Gly Ser Cys Pro Pro Thr Lys Phe Gln Cys Arg Thr Ser Gly Leu Cys Val Pro Leu Thr Trp Arg Cys Asp Arg Asp Leu Asp Cys Ser Asp Gly Ser Asp Glu Glu Glu Cys Arg Ile Glu Pro Cys Thr Gln Lys Gly Gln Cys Pro Pro Pro Pro Gly Leu Pro Cys Pro Cys Thr Gly Val Ser Asp Cys Ser Gly Gly Thr Asp Lys Lys Leu Arg Asn Cys Ser Arg Leu Ala Cys Leu Ala Gly Glu Leu Arg Cys Thr Leu Ser Asp Asp Cys Ile Pro Leu Thr Trp Arg Cys Asp Gly His Pro Asp Cys Pro Asp Ser Ser Asp Glu Leu Gly Cys Gly Thr Asn Glu Ile Leu Pro Glu Gly Asp Ala Thr Thr Met Gly Pro Pro Val Thr Leu G1u Ser Val Thr Ser Leu Arg Asn Ala Thr Thr Met Gly Pro Pro Val Thr Leu Glu Ser Val Pro Ser Val Gly Asn Ala Thr Ser Ser Ser Ala Gly Asp Gln Ser Gly Ser Pro Thr Ala Tyr Gly Val Ile Ala Ala Ala Ala Val Leu Ser Ala Ser Leu Val Thr Ala Thr Leu Leu Leu Leu Ser Trp Leu Arg Ala Gln Glu Arg Leu Arg Pro Leu Gly Leu Leu Val Ala Met Lys Glu Ser Leu Leu Leu Ser Glu Gln Lys Thr Ser Leu Pro <210> 9 <211> 437 <212> PRT
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 2764333CD1 <400> 9 Met Ser Glu Glu Lys Asp Cys Gly Gly Gly Asp Ala Leu Ser Asn Gly Ile Lys Lys His Arg Thr Ser Leu Pro Ser Pro Met Phe Ser Arg Asn Asp Phe Ser Ile Trp Ser Ile Leu Arg Lys Cys Ile Gly Met Glu Leu Ser Lys Ile Thr Met Pro Val Ile Phe Asn Glu Pro Leu Ser Phe Leu Gln Arg Leu Thr Glu Tyr Met Glu His Thr Tyr Leu Ile His Lys Ala Ser Ser Leu Ser Asp Pro Val Glu Arg Met Gln Cys Val Ala Ala Phe Ala Val Ser Ala Val Ala Ser Gln Trp Glu Arg Thr Gly Lys Pro Phe Asn Pro Leu Leu Gly Glu Thr Tyr Glu Leu Val Arg Asp Asp Leu Gly Phe Arg Leu Ile Ser Glu Gln Val Ser His His Pro Pro Ile Ser Ala Phe His Ala Glu Gly Leu Asn Asn Asp Phe Ile Phe His Gly Ser Ile Tyr Pro Lys Leu Lys Phe Trp Gly Lys Ser Val Glu Ala Glu Pro Lys Gly Thr Ile Thr Leu Glu Leu Leu Glu His Asn Glu Ala Tyr Thr Trp Thr Asn Pro Thr Cys Cys Val His Asn Ile Ile Val Gly Lys Leu Trp Ile Glu Gln Tyr Gly Asn Val Glu Ile Ile Asn His Lys Thr Gly Asp Lys Cys Val Leu Asn Phe Lys Pro Cys Gly Leu Phe Gly Lys Glu Leu His Lys Val Glu Gly Tyr Ile Gln Asp Lys Ser Lys Lys Lys Leu Cys Ala Leu Tyr Gly Lys Trp Thr Glu Cys Leu Tyr Ser Val Asp Pro Ala Thr Phe Asp Ala Tyr Lys Lys Asn Asp Lys Lys Asn Thr Glu Glu Lys Lys Asn Ser Lys Gln Met Ser Thr Ser Glu Glu Leu Asp Glu Met Pro Val Pro Asp Ser Glu Ser Val Phe Ile Ile Pro Gly Ser Val Leu Leu Trp Arg Ile Ala Pro Arg Pro Pro Asn Ser Ala Gln Met Tyr Asn Phe Thr Ser Phe Ala Met Val Leu Asn Glu Val Asp Lys Asp Met Glu Ser Val Ile Pro Lys Thr Asp Cys Arg Leu Arg Pro Asp Ile Arg Ala Met Glu Asn Gly Glu Ile Asp Gln Ala Ser Glu Glu Lys Lys Arg Leu Glu Glu Lys G1n Arg Ala Ala Arg Lys Asn Arg Ser Lys Ser Glu Glu Asp Trp Lys Thr Arg Trp Phe His Gln Gly Pro Asn Pro Tyr Asn Gly Ala Gln Asp Trp Ile Tyr Ser Gly Ser Tyr Trp Asp Arg Asn Tyr Phe Asn Leu Pro Asp Ile Tyr <210> 10 <211> 427 <212> PRT
<213> Homo sapiens <220>
<221> misc_teature <223> Incyte ID No: 2798021CD1 <400> 10 Met Arg Gln Ala Ala Ala Asp Ala Lys Pro Glu Ser Leu Met Lys Arg Leu Glu Glu Glu Ile Lys Phe Asn Leu Tyr Met Val Thr Glu Lys Phe Pro Lys Glu Leu Glu Asn Lys Lys Lys Glu Leu His Phe Leu Gln Lys Val Val Ser Glu Pro Ala Met Gly His Ser Asp Leu Leu Glu Leu Glu Ser Lys Ile Asn Glu Ile Asn Thr Glu Ile Asn Gln Leu Ile Glu Lys Lys Met Met Arg Asn Glu Pro Ile Glu Gly Lys Leu Ser Leu Tyr Arg Gln Gln Ala Ser Ile Ile Ser Arg Lys Lys Glu Ala Lys Ala Glu Glu Leu Gln Glu Ala Lys Glu Lys Leu Ala Ser Leu Glu Arg Glu Ala Ser Val Lys Arg Asn Gln Thr Arg Glu Phe Asp Gly Thr Glu Val Leu Lys Gly Asp Glu Phe Lys Arg Tyr Val Asn Lys Leu Arg Ser Lys Ser Thr Val Phe Lys Lys Lys His Gln Ile Ile Ala Glu Leu Lys Ala Glu Phe Gly Leu Leu Gln Arg Thr Glu Glu Leu Leu Lys Gln Arg His Glu Asn Ile Gln Gln Gln Leu Gln Thr Met Glu Glu Lys Lys Gly Ile Ser Gly Tyr Ser Tyr Thr Gln Glu Glu Leu Glu Arg Val Ser Ala Leu Lys Ser Glu Val Asp Glu Met Lys Gly Arg Thr Leu Asp Asp Met Ser Glu Met Val Lys Lys Leu Tyr Ser Leu Val Ser Glu Lys Lys Ser Ala Leu Ala Ser Val Ile Lys Glu Leu Arg Gln Leu Arg Gln Lys Tyr Gln Glu Leu Thr Gln Glu Cys Asp Glu Lys Lys Ser Gln Tyr Asp Ser Cys Ala Ala Gly Leu Glu Ser Asn Arg Ser Lys Leu Glu Gln Glu Val Arg Arg Leu Arg Glu Glu Cys Leu Gln Glu Glu Ser Arg Tyr His Tyr Thr Asn Cys Met Ile Lys Asn Leu Glu Val Gln Leu Arg Arg Ala Thr Asp Glu Met Lys Ala Tyr Ile Ser Ser Asp Gln Gln Glu Lys Arg Lys Ala Ile Arg Glu Gln Tyr Thr Lys Asn Thr Ala Glu Gln Glu Asn Leu Gly Lys Lys Leu Arg Glu Lys Gln Lys Val Ile Arg Glu Ser His Gly Pro Asn Met Lys Gln Ala Lys Met Trp Arg Asp Leu Glu Gln Leu Met Glu Cys Lys Lys Gln Cys Phe Leu Lys Gln Gln Ser Gln Thr Ser Ile Gly Gln Val Ile Gln Glu Gly Gly Glu Asp Arg Leu Ile Leu <210> 11 <211> 564 <212> PRT
<213> Homo sapiens <220>
<221> misc_feature <223> Incyte ID No: 3335404CD1 <400> 11 Met Asp Ser Arg Tyr Asn Ser Thr Ala Gly Ile Gly Asp Leu Asn Gln Leu Ser Ala Ala Ile Pro Ala Thr Arg Val Glu Val Ser Val Ser Cys Arg Asn Leu Leu Asp Arg Asp Thr Phe Ser Lys Ser Asp Pro Ile Cys Val Leu Tyr Val Gln Gly Val Gly Asn Lys Glu Trp Arg Glu Phe Gly Arg Thr Glu Val Ile Asp Asn Thr Leu Asn Pro Asp Phe Val Arg Lys Phe Ile Leu Asp Tyr Phe Phe Glu Glu Arg Glu Asn Leu Arg Phe Asp Leu Tyr Asp Val Asp Ser Lys Ser Pro Asn Leu Ser Lys His Asp Phe Leu Gly Gln Val Phe Cys Thr Leu Gly Glu Ile Val Gly Ser Gln Gly Ser Arg Leu Glu Lys Pro Ile Val Gly Ile Pro Gly Lys Lys Cys Gly Thr Ile Ile Leu Thr Ala Glu Glu Leu Asn Cys Cys Arg Asp Ala Val Leu Met Gln Phe Cys Ala Asn Lys Leu Asp Lys Lys Asp Phe Phe Gly Lys Ser Asp Pro Phe Leu Val Phe Tyr Arg Ser Asn Glu Asp Gly Ser Phe Thr Ile Cys His Lys Thr Glu Val Val Lys Asn Thr Leu Asn Pro Val Trp Gln Ala Phe Lys Ile Ser Val Arg Ala Leu Cys Asn Gly Asp Tyr Asp Arg Thr Ile Lys Val Glu Val Tyr Asp Trp Asp Arg Asp Gly Ser His Asp Phe Ile Gly Glu Phe Thr Thr Ser Tyr Arg Glu Leu Ser Arg Gly Gln Ser Gln Phe Asn Val Tyr Glu Val Val Asn Pro CA 02362300 2001-07-31 pCT~S00/04160 Lys Lys Lys Gly Lys Lys Lys Lys Tyr Thr Asn Ser Gly Thr Val Thr Leu Leu Ser Phe Leu Val Glu Thr Glu Val Ser Phe Leu Asp Tyr Ile Lys Gly Gly Thr Gln Ile Asn Phe Thr Val Ala Ile Asp Phe Thr Ala Ser Asn Gly Asn Pro Ala Gln Pro Thr Ser Leu His Tyr Met Asn Pro Tyr Gln Leu Asn Ala Tyr Gly Met Ala Leu Lys Ala Val Gly Glu Ile Val Gln Asp Tyr Asp Ser Asp Lys Met Phe Pro Ala Leu Gly Phe Gly Ala Lys Leu Pro Pro Asp Gly Arg Ile Ser His Glu Phe Ala Leu Asn Gly Asn Pro Gln Asn Pro Tyr Cys Asp Gly Ile Glu Gly Val Met Glu Ala Tyr Tyr Arg Ser Leu Lys Ser Val Gln Leu Tyr Gly Pro Thr Asn Phe Ala Pro Val Ile Asn His Val Ala Arg Tyr Ala Ser Ser Val Lys Asp Gly Ser Gln Tyr Phe Val Leu Leu Ile Val Thr Asp Gly Val Ile Ser Asp Met Ala Gln Thr Lys Glu Ser Ile Val Asn Ala Ser Lys Leu Pro Met Ser Ile Ile Ile Val Gly Val Gly Pro Ala Glu Phe Asp Ala Met Val Glu Leu Asp Gly Asp Asp Val Arg Val Ser Ser Arg Gly Lys Tyr Ala Glu Arg Asp Ile Val Gln Phe Val Pro Phe Arg Asp Tyr Ile Asp Arg Ser Gly Asn His Ile Leu Ser Met Ala Arg Leu Ala Lys Asp Val Leu Ala Glu Ile Pro Glu Gln Phe Leu Ser Tyr Met Arg Ala Arg Gly Ile Lys Pro Ser Pro Ala Pro Pro Pro Tyr Thr Pro Pro Thr His Val Leu Gln Thr Gln Ile <210> 12 <211> 297 <212> PRT
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 3735780CD1 <400> 12 Met Met Asp Ser Glu Ala His Glu Lys Arg Pro Pro Ile Leu Thr Ser Ser Lys Gln Asp Ile Ser Pro His Ile Thr Asn Val Gly Glu Met Lys His Tyr Leu Cys Gly Cys Cys Ala Ala Phe Asn Asn Val Ala Ile Thr Phe Pro Ile Gln Lys Val Leu Phe Arg Gln Gln Leu Tyr Gly Ile Lys Thr Arg Asp Ala Ile Leu Gln Leu Arg Arg Asp Gly Phe Arg Asn Leu Tyr Arg Gly Ile Leu Pro Pro Leu Met Gln Lys Thr Thr Thr Leu Ala Leu Met Phe Gly Leu Tyr Glu Asp Leu Ser Cys Leu Leu His Lys His Val Ser Ala Pro Glu Phe Ala Thr WO 00/49043 CA 02362300 2001-07-31 pCT~JS00/04160 Ser Gly Val Ala Ala Val Leu Ala Gly Thr Thr Glu Ala Ile Phe Thr Pro Leu Glu Arg Val Gln Thr Leu Leu Gln Asp His Lys His His Asp Lys Phe Thr Asn Thr Tyr Gln Ala Phe Lys Ala Leu Lys Cys His Gly Ile Gly Glu Tyr Tyr Arg Gly Leu Val Pro Ile Leu Phe Arg Asn Gly Leu Ser Asn Val Leu Phe Phe Gly Leu Arg Gly Pro Ile Lys Glu His Leu Pro Thr Ala Thr Thr His Ser Ala His Leu Val Asn Asp Phe Ile Cys Gly Gly Leu Leu Gly Ala Met Leu Gly Phe Leu Phe Phe Pro Ile Asn Val Val Lys Thr Arg Ile Gln Ser Gln Ile Gly Gly Glu Phe Gln Ser Phe Pro Lys Val Phe Gln Lys Ile Trp Leu Glu Arg Asp Arg Lys Leu Ile Asn Leu Phe Arg Gly Ala His Leu Asn Tyr His Arg Ser Leu Ile Ser Trp Gly Ile Ile Asn Ala Thr Tyr Glu Phe Leu Leu Lys Val Ile <210> 13 <211> 2174 <212> DNA
<213> Homo sapiens <220>
<221> misc_feature <223> Incyte ID No: 161190CB1 <400> 13 ggatccacac agctcagaac agctggatct tgctcacact ctttcaagag aagcttcctt 60 ggacaaaagg accctgcctt ggtgtgagag tgagggcaga gggagctgga gcaagtagaa 120 tttctctaaa taccagctgg ctggggccca ggagattaaa aaacaccggg ctagggttaa 180 gaaaaaaaac gaacccttcc agtcaggtca gtgactggag agctccaagg aaagtctctc 240 agtgacctgg ctgctggcac catggactca gaaaagaaac gctttactga agaggccacc 300 aaatacttcc gggagagagt cagcccagtg catctgcaaa tcctgctgac taacaatgaa 360 gcctggaaga gattcgtgac tgcggctgaa ttgcccaggg atgaggcaga tgctctctac 420 gaagctctga agaagcttag aacatatgca gctattgagg acgaatatgt gcagcagaaa 480 gatgagcagt ttagggaatg gtttttgaaa gagtttcccc aagtcaagag gaagatccag 540 gagtccatag aaaagcttcg tgcccttgca aatggtattg aagaggtcca cagaggctgc 600 accatctcca atgtggtgtc cagctccact ggcgctgcct ctggcatcat gtcccttgct 660 ggtcttgttt tggcaccatt tacagcaggg acgagtctgg cccttactgc agctggggta 720 gggctgggag cagcgtctgc tgtgactggg atcaccacca gcatcgtgga gcactcatac 780 acatcatcag cagaagctga agccagcagg ctgactgcaa ccagcattga ccgattgaag 840 gtatttaagg aagttatgcg tgacatcaca cccaacttac tttcccttct taataattat 900 tacgaagcca cacaaaccat tgggagtgaa atccgtgcca tcaggcaagc cagagccagg 960 gcccgactcc ctgtgaccac ctggcgaatc tcagctggaa gtggaggtca agcagagaga 1020 acgattgcag gcaccacccg ggcagtgagc agaggagccc ggatcctgag tgcgaccact 1080 tcaggcatct tccttgcact ggatgtggtc aaccttgtat acgagtcaaa gcacttgcat 1140 gagggggcaa agtctgcatc tgctgaggag ctgaggcggc aggctcagga gctggaggag 1200 aatctaatgg agctcactca gatctatcag cgtctgaatc catgccatac ccactgaccc 1260 cagaccagtg cagccagcag gggaggtgag ccatacacag gccacgacaa aatgcaggca 1320 ttttattagg gggataaaga gggcaaggta aagtttatgg agctgagtgt tagtgacttt 1380 ggcatttctg tagctgagca cagcagggga ggggttaatg cagatggcaa gtgcaccaag 1440 gagaaggcag gaatgctgga gcctggaata agggaagaga ggggactgga gagtgtgggg 1500 aataggaaga agaaatttcc tttagactaa cgaatatatt ggggggagga atagagggga 1560 ggtgtgcagg aaccagcaat gagaaggcca ggaaaagaaa gagctgaaaa tgcagaaagc 1620 cgaagagtta gaacttttgg atacagcaga agaaacagcg gctccactac cgacctgccc 1680 ccggttcgat gtccttccaa gaatgaagtc tttccctggt gatggtcccc tgccctgtct 1740 ttccagcatc cactctgtct tgtcctcctg gaagtgtatc tcagtcagcc agtggcttct 1800 tgatgatggc ggtggaggtg gtggttgtag tgtgatggat cccctttagg ttatttaggg 1860 gtatatgtcc cctgcttgaa ccctgaaggc caggtaatga gccatggcca ttgtccccag 1920 WO 00/49043 CA 02362300 2001-07-31 pCT~S00/04160 ctgaggacca ggtgtctcta aaaacccaaa catcctggag agtatgcgag aacctaccaa 1980 gaaaaacagt ctcattactc atatacagca ggcaaagaga cagaaaatta actgaaaagc 2040 agtttagaga ctgggggagg ccggatctct agagccatcc tgctgagtgc cctgtgtgta 2100 agtcctaata aactcaccta ctcaccaaaa aaaaaaacga aaaactaaag aacaggagaa 2160 aaaaagggga gggc 2174 <210> 14 <211> 2620 <212> DNA
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 1292575CB1 <400> 14 agtgagctcg gccggcaacc gagggacccg cgtccagatc ttcagtgtct attggatttt 60 tccaagagaa agtttgtaaa attccttaca ctgtagatgt ggatcagata cgatgattca 120 gtagaagagc acatgtcagg ggcagtggag gctggctgct gaaggatgaa cggagaggaa 180 gaattctttg atgccgtcac aggctttgat tctgataact cttctgggga attttcagag 240 gcaaatcaga aagtcacggg aatgattgac ttagacacca gcaaaaataa taggattggg 300 aaaactgggg agaggccctc tcaagagaac ggaattcaga aacacaggac atcgctgccg 360 gctcccatgt tcagcagaag cgacttcagc gtgtggacca tcctgaagaa gtgtgttggc 420 ctggagctgt ccaagatcac gatgccaatc gccttcaacg agcctctgag cttcttgcag 480 cggatcacgg agtacatgga gcacgtgtac ctcatccaca gggcctcctg ccagccccag 540 cccctggaga ggatgcagtc tgtggctgct tttgctgttt cggctgtggc ttcccagtgg 600 gagaggaccg gcaaaccatt taatccactc ttgggagaaa cgtatgaatt aatcagggaa 660 gatttaggat tcagatttat atcggaacag gtcagtcacc acccccccat cagtgcgttc 720 cactcggaag gtctcaacca tgacttcctg ttccatggct ccatctaccc caagctcaag 780 ttctggggca aaagcgtgga ggcggagccc cgaggcacca tcaccctgga gctgctcaaa 840 cataatgaag cctacacctg gaccaacccc acctgctgcg tccacaacgt catcatcggg 900 aagctgtgga tagagcagta tgggacagtg gagattttaa accacagaac tggacataag 960 tgtgtgcttc actttaaacc gtgtggatta tttggaaaag aacttcacaa ggtggaagga 1020 cacattcaag acaaaaacaa aaagaagctc tttatgatct atggcaaatg gacggaatgt 1080 ttgtggggca tagatcctgt ttcgtatgaa tccttcaaga agcaggagag gagaggtgac 1140 cacctgagaa aggccaagct ggatgaagac tccgggaagg ctgacagcga cgtggctgac 1200 gacgtgcctg tggcccagga gaccgtgcag gtcattcctg gcagcaagct gctctggagg 1260 atcaacaccc ggccccccaa ctctgcccag atgtataatt tcaccagttt cactgtgagc 1320 ctcaacgagc tggagacagg catggagaag accctgccac ccacggactg ccgcctgcgc 1380 cctgacatcc gcggcatgga gaatggcaac atggatctgg ccagccagga gaaggagcgg 1440 ctggaggaga agcagagaga agcacggagg gagcgggcca aggaggaggc agagtggcag 1500 acgaggtggt tctacccagg caataacccc tacactggga cccccgactg gttgtatgca 1560 ggggattact ttgagcggaa tttctccgac tgcccagata tctactgagg gcctggaggg 1620 gcctggggcc cgggaccgga ggctgacgag gctggacttc ctcgagtggc cactgtgagc 1680 ctcgtcacag cagaaaccaa cttttctaac gactgagttc gcggagatag catcatccct 1740 gatcaaggat gtaattctaa ttaactgttg attgccaaac atttcactct gctgtgccgt 1800 ctcttcataa agcttcactt gggatcatcg tcttcattaa ggtttcaaca gggaaattct 1860 tcacggcgcc cttttatgtg gcagaaatca gctggggctt gtttagcttc cagcacactc 1920 tcagtcatag catgtgtagc taaaggaagt aatgggaagg ggttcatgtt ctctttataa 1980 tgcagtggca aaaggttctg aaagcctttt aaactcgaac cagtggggga aagatggatc 2040 ttgaagctaa tcctgcagag agttttatag aggccaggga ttgccttcta aattatgata 2100 aaacagaagt gaagagtttc agagcatcag attgagtgaa aagttgtcag attctgtatt 2160 ttttaacaat cttcaataat gtaaagatta cttttaaaat atttaagtta aaactacttg 2220 aatagtattt tgctgaagag caagatatgc attaatcacc ggttttatac tgtccaaaat 2280 gaagcatccc cgtgacaaac cagagtgggc agaagcatcg agagcgtgac aggaaatccc 2340 aagactgctt ccgcctcaga ggcgtcccgg ctgcgattcg ctgccctgtt gtcagtgagg 2400 cctggctgtc accgcacacc gcgtccgtgt ctccaggggg ttcctttctt ctcacacgtc 2460 gcgtgtaccc atagcactct tgtgtttctg tttttcccag tatgcatgtt taaaatagaa 2520 gtgacaagaa tcacatccgg ttgtgtcctg tgggagggtc agaggcagaa tctacttaca 2580 gtggtgtaat taaagttatt taaccaaaaa aaaaaaaaaa 2620 <210> 15 <211> 2426 <212> DNA
<213> Homo Sapiens WO 00/49043 cA 02362300 2001-07-31 pCT/US00/04160 <220>
<221> misc_feature <223> Incyte ID No: 2454393CB1 <400> 15 gaaacactag acatgatacg aattcgagct cgtacccttc gaggcacgct gagaaggagc 60 agacaagatg gcgacgtccg tggggcaccg atgtctggga ttactgcacg gggtcgcgcc 120 gtggcggagc agcctccatc cctgtgagat cactgccctg agccaatccc tacagccctt 180 acggaagctg ccttttagag cctttcgcac agatgccaga aaaatccaca ctgcccctgc 240 ccgaaccatg ttcctgctgc gtcccctgcc cattctgttg gtgacaggcg gcgggtatgc 300 agggtaccgg cagtatgaga agtacaggga gcgagagctg gagaagctgg gattggagat 360 tccacccaaa cttgctggtc actgggaggt ggctttgtac aagtcagtgc caacgcgctt 420 gctgtcacgg gcctggggtc gcctcaatca ggtggagctg ccacactggc tgcgcaggcc 480 cgtctacagc ctgtacatct ggacgtttgg ggtgaacatg aaagaggccg ctgtggagga 540 cctgcatcac taccgcaacc tcagcgagtt cttccggcgc aagctgaagc cgcaggcccg 600 gcctgtctgt ggcctgcaca gcgtgattag cccatcggat ggaaggatcc tcaactttgg 660 gcaggtgaag aactgtgagg tggagcaggt aaagggggtc acctactccc tggagtcgtt 720 cctgggcccg cgtatgtgca cagaggacct gcccttccca ccagccgcgt cgtgtgactc 780 cttcaagaac cagctggtca cccgggaagg gaatgagctc tatcactgtg tcatctacct 840 ggcccctggg gactaccact gcttccactc ccccaccgac tggactgtgt cccaccggcg 900 ccacttccca ggctccctga tgtcagtgaa ccctggcatg gctcgctgga tcaaagagct 960 cttctgccat aacgagcggg tggtcctgac gggggactgg aaacatggct tcttctcact 1020 gacagctgtg ggggccacca acgtgggctc cattcgcatc tactttgacc gggacctgca 1080 cacaaacagc ccaaggcaca gcaagggctc ctacaatgac ttcagcttcg tgacgcacac 1140 caatagagag ggcgtcccca tgcgtaaggg cgagcacctg ggcgagttca acctgggctc 1200 caccatcgtg ctcatcttcg aggcccccaa ggacttcaat ttccagctga aaacaggaca 1260 gaaaatccgc tttggggaag ccctgggctc gctctagagt ctctttcctg attatggctg 1320 ctaagggatc ttttccaaac agagtgaggg tcttttcaag agggaggccc atgaggccat 1380 ccaggtaagg gcctgcctca gcgtggttgg gagtctgacc aggtaggact tgaatgattc 1440 ggctaccacc tgttccagag gtgcagacaa gaggtggcga gagcccccat catgcccctc 1500 aacctatccc gttccttctg cctacaaata aaaagtgcag gctggaatga tctcagtcac 1560 atttggatct ttttaaacac tgtatagacg gaagagcctg cattcctgac cgaaccttca 1620 gttggtctcg gttgtcgttt tttcttgctg ctcctccccc catcacctga gctgttttct 1680 gttggcccct tttgtttttt ggccttaacg ctcctgctgc acagggtgag gtacctcctt 1740 ggcacagact gtggatgcct ctcccccagc agagccacac agccttcgtg acaactgctt 1800 tccgttccca cattcacctc atcctgctct ttagaaaaag cagtctttgt gcttgtggct 1860 gaacgcatca ccctggactc tgctagtgtc ttctgaggac actgatgaca ctgattaatg 1920 atacagacct ttgcaggacc tgatgagtga cccttctgga gctggccagg tcctctgcag 1980 caggcaagac caatcaatca ctgaacctgc ctcatggcac cagagtgaac agggcaggca 2040 ggtagtaggc ccagctgggg aaatgggaga gttcctgtcc ccctccacat atccctacat 2100 gaaatatggg aaagttgctg ctattgattc agggtctgtc ttggaggcag aggacccttg 2160 gtggatagtt ggtcagtgcc tggaaaacct gtcccagttt atcaggaacg caggcctggg 2220 gagcccccag tggcggggac agggccagat ttcatgttga ccctggggat gctgtgaatt 2280 tctcctgcag gagagacatc attgaatttt ttcaactgta tcagtagcac agtatttttg 2340 tatgaaaagt gggagacttc tgaacagtaa ttcatttaat tgcaaagcat tttgaaataa 2400 aaaaaatcaa acttaaaaaa aaaaaa 2426 <210> 16 <211> 3705 <212> DNA
<213> Homo sapiens <220>
<221> misc_feature <223> Incyte ID No: 2766980CB1 <400> 16 ggccgcagga gcagtaggtg ttagcagctt ggtcgcgaca ggtgcgctag gtagagcgcc 60 gggacctgtg acagggctgg tagcagcgca gaggaaaggc ggcttttagc caggtatttc 120 agtgtctgta gacaagatgg aatcatctcc atttaataga cggcaatgga cctcactatc 180 attgagggta acagccaaag aactttctct tgtcaacaag aacaagtcat cggctattgt 240 ggaaatattc tccaagtacc agaaagcagc tgaagaaaca aacatggaga agaagagaag 300 taacaccgaa aatctctccc agcactttag aaaggggacc ctgactgtgt taaagaagaa 360 gtgggagaac ccagggctgg gagcagagtc tcacacagac tctctacgga acagcagcac 420 tgagattagg cacagagcag accatcctcc tgctgaagtg acaagccacg ctgcttctgg 480 agccaaagct gaccaagaag aacaaatcca ccccagatct agactcaggt cacctcctga 540 agccctcgtt cagggtcgat atccccacat caaggacggt gaggatctta aagaccactc 600 aacagaaagt aaaaaaatgg aaaattgtct aggagaatcc aggcatgaag tagaaaaatc 660 agaaatcagt gaaaacacag atgcttcggg caaaatagag aaatataatg ttccgctgaa 720 caggcttaag atgatgtttg agaaaggtga accaactcaa actaagattc tccgggccca 780 aagccgaagt gcaagtggaa ggaagatctc tgaaaacagc tattctctag atgacctgga 840 aataggccca ggtcagttgt catcttctac atttgactcg gagaaaaatg agagtagacg 900 aaatctggaa cttccacgcc tctcagaaac ctctataaag gatcgaatgg ccaagtacca 960 ggcagctgtg tccaaacaaa gcagctcaac caactataca aatgagctga aagccagtgg 1020 tggcgaaatc aaaattcata aaatggagca aaaggagaat gtgcccccag gtcctgaggt 1080 ctgcatcacc catcaggaag gggaaaagat ttctgcaaat gagaatagcc tggcagtccg 1140 ttccacccct gccgaagatg actcccgtga ctcccaggtt aagagtgagg ttcaacagcc 1200 tgtccatccc aagccactaa gtccagattc cagagcctcc agtctttctg aaagttctcc 1260 tcccaaagca atgaagaagt ttcaggcacc tgcaagagag acctgcgtgg aatgtcagaa 1320 gacagtctat ccaatggagc gtctcttggc caaccagcag gtgtttcaca tcagctgctt 1380 ccgttgctcc tattgcaaca acaaactcag tctaggaaca tatgcatctt tacatggaag 1440 aatctattgt aagcctcact tcaatcaact ctttaaatct aagggcaact atgatgaagg 1500 ctttgggcac agaccacaca aggatctatg ggcaagcaaa aatgaaaacg aagagatttt 1560 ggagagacca gcccagcttg caaatgcaag ggagacccct cacagcccag gggtagaaga 1620 tgcccctatt gctaaggtgg gtgtcctggc tgcaagtatg gaagccaagg cctcctctca 1680 gcaggagaag gaagacaagc cagctgaaac caagaagctg aggatcgcct ggccaccccc 1740 cactgaactt ggaagttcag gaagtgcctt ggaggaaggg atcaaaatgt caaagcccaa 1800 atggcctcct gaagacgaaa tcagcaagcc cgaagttcct gaggatgtcg atctagatct 1860 gaagaagcta agacgatctt cttcactgaa ggaaagaagc cgcccattca ctgtagcagc 1920 ttcatttcaa agcacctctg tcaagagccc aaaaactgtg tccccaccta tcaggaaagg 1980 ctggagcatg tcagagcaga gtgaagagtc tgtgggtgga agagttgcag aaaggaaaca 2040 agtggaaaat gccaaggctt ctaagaagaa tgggaatgtg ggaaaaacaa cctggcaaaa 2100 caaagaatct aaaggagaga cagggaagag aagtaaggaa ggtcatagtt tggagatgga 2160 gaatgagaat cttgtagaaa atggtgcaga ctccgatgaa gatgataaca gcttcctcaa 2220 acaacaatct ccacaagaac ccaagtctct gaattggtcg agttttgtag acaacacctt 2280 tgctgaagaa ttcactactc agaatcagaa atcccaggat gtggaactct gggagggaga 2340 agtggtcaaa gagctctctg tggaagaaca gataaagaga aatcggtatt atgatgagga 2400 tgaggatgaa gagtgacaaa ttgcaatgat gctgggcctt aaattcatgt tagtgttagc 2460 gagccactgc cctttgtcaa aatgtgatgc acataagcag gtatcccagc atgaaatgta 2520 atttacttgg aagtaacttt ggaaaagaat tccttcttaa aatcaaaaac aaaacaaaaa 2580 aacacaaaaa acacattcta aatactagag ataactttac ttaaattctt cattttagca 2640 gtgatgatat gcataagtgc tgtaaggctt gtaactgggg aaatattcca cctgataata 2700 gcccagattc tactgtattc ccaaaaggca atattaaggt agatagatga ttagtagtat 2760 attgttacac actattttgg aattagagaa catacagaag gaatttaggg gcttaaacat 2820 tacgactgaa tgcactttag tataaagggc acagtttgta tatttttaaa tgaataccaa 2880 tttaattttt tagtatttac ctgttaagag attatttagt ctttaaattt tttaggttaa 2940 ttttcttgct gtgatatata tgaggaattt actactttat gtcctgctct ctaaactaca 3000 tcctgaactc gacgtcctga ggtataatac aacagagcac tttttgaggc aattgaaaaa 3060 ccaacctaca ctcttcggtg cttagagaga tctgctgtct cccaaataag cttttgtatc 3120 tgccagtgaa tttactgtac tccaaatgat tgctttcttt tctggtgata tctgtgcttc 3180 tcataattac tgaaagctgc aatattttag taataccttc gggatcactg tcccccatct 3240 tccgtgttag agcaaagtga agagtttaaa ggaggaagaa gaaagaactg tcttacacca 3300 cttgagctca gacctctaaa ccctgtattt cccttatgat gtcccctttt tgagacacta 3360 atttttaaat acttactagc tctgaaatat attgattttt atcacagtat tctcagggtg 3420 aaattaaacc aactataggc ctttttcttg ggatgatttt ctagtcttaa ggtttgggga 3480 cattataaac ttgagtacat ttgttgtaca cagttgatat tccaaattgt atggatggga 3540 gggagaggtg tcttaagctg taggcttttc tttgtactgc atttatagag atttagcttt 3600 aatatttttt agagatgtaa aacattctgc tttcttagtc ttacctagtc tgaaacattt 3660 ttattcaata aagattttaa ttaaaatttg aaaaaaaaaa aaaaa 3705 <210> 17 <211> 1273 <212> DNA
<213> Homo sapiens <n>
<220>
<221> unsure <222> 1237 <223> a or g or c or t, unknown, or other <220>
<221> misc_feature <223> Incyte ID No: 2768356CB1 <400> 17 gccaccatcc gcctatctct gcgtgtcatg ctgagtctag aaattttgtt ttctggcaag 60 atgtgagatg gaaaaacaaa ttctggggca aatccatgga aattgttcca attggcacaa 120 cccatgtgac tctgccagtt tttggggatc attttgagtg gaacaaagtg acctcttgca 180 tccataacat cttaagcggg cagaggtgga ttgagcacta tggagagatt gtcatcaaga 240 acctgcatga tgattcctgc tactgcaaag tgaattttat aaaggcaaaa tactggagca 300 ctaatgccca tgagattgaa ggcacagtgt ttgacaggag tggaaaagcg gttcatcggc 360 tgtttgggaa atggcatgaa agcatctact gtggcggcgg ctcctcttct gcctgtgtat 420 ggagagcaaa tcctatgccg aaaggctacg agcaatacta tagcttcaca cagtttgcgc 480 tggaattaaa tgaaatggat ccatcatcaa agtctttatt gccacctact gacactcgat 540 ttaggccaga ccagaggttt ctagaggaag ggaacttaga agaagctgaa atacaaaagc 600 agaggattga acaactgcag agagaaaggc ggcgggtctt agaagaaaat catgtggagc 660 accagcctcg gtttttcagg aaatccgacg atgactcttg ggtgagcaac ggcacctatt 720 tggaacttag aaaagatctt ggtttttcca aactggacca tcctgtctta tggtgaaaaa 780 gtaaagaaga aagataacat tagtgtattt ctcctgtgct tgccttctga agtggcacaa 840 acctgtgttt atatatttaa aagatactct aggatgatca cttgtgctta gcttagcatt 900 gtaactcttt aagtctatat tttcctcagt gcgtttcttt acaatttcaa atgttaccct 960 gattgtttat atgaatgtag aacaccttga catttctttt tatatataaa ctatttaata 1020 aaaatgaaag attgaatgtt catgtgtggg ttaaaaaaag aagctttaac actaattttc 1080 caaaggttag ggaagattcc aattaaattt atgccttata aaattatgtt gtagaaaaaa 1140 tatcaacctc tcccaggtgc attaagaaat aagaattccc agggttactc acccatgcgt 1200 aagctaccaa gtttaatttg gtagctgaaa tatcttnttg cctcagacag ctcttgaatt 1260 gctcatacag aca 1273 <210> 18 <211> 1733 <212> DNA
<213> Homo sapiens <220>
<221> misc_feature <223> Incyte ID No: 5324145CB1 <400> 18 gccaacattc taggattctg ctggactagt tcaactgaaa ttgtcttcat aacagatcaa 60 ggaatcgaat tttaccaggt attaccagag aaacggagtc tgaaactctt gaagagccac 120 aatctcaatg tgaattggta catgtactgc cccgagagcg ccgtgatctt gctgtctacc 180 acggtcctgg agaatgtcct gcagcctttt cactttaggg ctggcactat gtcgaagctg 240 cccaaatttg agattgaatt accagctgcg cctaagtcaa ctaaacccag cctttccgaa 300 agagacatcg caatggctac catatacggg cagctgtatg ttctcttctt gaggcatcat 360 tctcggacct ccaacagcac aggagcggag gtggtcctct atcatctacc acgagaaggt 420 gcctgtaaaa agatgcacat attgaagtta aataggacgg gaaagtttgc cctgaacgtg 480 gtggacaacc tggtagtcgt gcatcatcag gatacagaga catcggtaat attcgatatc 540 aagttacggg gagagtttga cggctccgtt accttccacc accccgtgct tcctgctcga 600 tcgatccagc cctatcagat ccccatcaca ggtcctgctg ccgtgaccag ccagtctcct 660 gttccatgta aactctattc ttcatcttgg attgtctttc aacctgacat cattatcagc 720 gcaagccaag gttacctctg gaacctccaa gtgaaacttg agcccatagt aaatctctta 780 ccagacaaag gaagactcat ggactttctc ctccagagaa aggaatgcaa gatggtcatc 840 ctgtctgtct gttcacagat gttaagtgag tcagacagag catcgctgcc cgtgatagcc 900 actgtttttg ataaactcaa ccatgagtat aaaaagtacc tggatgccga gcagagttat 960 gcgatggcgg tggaagcagg gcagagccga agcagcccgc tcctcaagag gccggtgcgg 1020 acccaggcgg tgctggacca gtcagatgtg tacacccatg tcctgtcagc ctttgtggaa 1080 aagaaggaga tgcctcataa atttgtgata gccgtgctga tggaatacat tcgttctctt 1140 aaccagtttc agattgcagt acagcattac ctacatgaac ttgttatcaa aacccttgtc 1200 cagcacaacc tcttttatat gctgcatcag ttcctgcagt accacgtcct cagcgactcc 1260 aaacctttgg cttgtctgct gttatcccta gagagtttct atcctcctgc tcatcagcta 1320 tctctggaca tgctgaagcg actttcaaca gcaaatgatg aaatagtaga agttctcctt 1380 tccaaacacc aagtgttagc tgccttaagg tttatccggg gcattggtgg ccatgacaac 1440 atttctgcac gaaaattttt agatgctgca aagcagactg aagacaacat gcttttctat 1500 acaatattcc gcttttttga acagcgaaac cagcgtttgc gagggagccc caatttcaca 1560 ccaggggaac actgtgaaga acatgttgct tttttcaaac agatttttgg agaccaagct 1620 ctaatgaggc ctacaacatt ctgaaatcac ttgctgtttt tttatataaa aatgtgtaca 1680 aagttaattt attgcattaa taaagctctt taaactataa aaaaaaaaaa aaa 1733 <210> 19 <211> 1370 <212> DNA
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 1004646CB1 <400> 19 gcacagcctc gcaatgagct gccacaactg ctccgacccc caggtccttt gcagctccgg 60 gcagctgttc ctgcagcccc tctgggacca cctgaggagc tgggaggccc tcctacagtc 120 gcccttcttc ccggtcatct tctccatcac cacatacgtg ggcttttgcc tgcccttcgt 180 ggtcctggat atcctgtgct cctgggtgcc cgccctgcgg cgctacaaga tccaccctga 240 cttctcgcca tccgcgcagc agctgctacc ttgcctgggg cagaccctct accagcatgt 300 gatgtttgtg ttccccgtga cgctgctgca ttgggcccgc agcccggccc tcctgcccca 360 cgaagctccc gagctgctcc tgctgctgca ccacatcctg ttctgcctgc tactcttcga 420 catggagttc ttcgtgtggc acctgctgca ccacaaggtg ccctggctgt accgcacctt 480 ccacaaggtg caccaccaga actcgtcctc gttcgcgctg gcaacgcagt atatgagcgt 540 ctgggaactg ttttctttgg gcttcttcga catgatgaac gtcacactgc tcgggtgcca 600 cccgctcacc accctgacct tccacgtggt caacatctgg ctttccgtgg aggaccactc 660 cggctacaac ttcccttggt ccactcacag actggtgccc ttcgggtggt acgggggtgt 720 ggtgcaccac gacctgcatc actctcactt taactgcaac ttcgctccgt actttacaca 780 ctgggacaaa atactgggaa cgctgcggac tgcatctgtc ccagcgcggt gatgtggctg 840 cggtgggtgc ccctaagact cgggactgct gtgcctttca cacttgaatg aagagaaaca 900 cctgagctat atattttttt aaagcaacta acttattgct ttatgtttat ctatgaaaac 960 catagataaa atctgatgca tttttgtaat ctgacaaagt aatttacata ctgtttgtgt 1020 atcaatacaa ttttgtgttc ttggtattct tagtctagct cacctcaata gccttgaatc 1080 ctgcatatga attagacatt catcactggc atatttagaa tatctctaaa aggacttgtt 1140 tgtagaataa ggaattttct atgtttcaaa gtgttctaaa acctggctaa aagaatgtat 1200 ttttgtggat ggtgttgact tctgactcta aaagcaatca aacatgtttc tgctggacag 1260 tgaccaagaa ttatagtacc ttcttatatt tttttataga actgtatatt tattttgaaa 1320 gaaatgttat tcgtgcttta aaaaggaaaa aaaaccatga atcaaataaa 1370 <210> 20 <211> 1264 <212> DNA
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 1802851CB1 <400> 20 ctcgcgtgcg gtgcgcaggg ataagagagc ggtctggaca gcgcgtggcc ggcgccgctg 60 tggggacagc atgagcggcg gttggatggc gcaggttgga gcgtggcgaa caggggctct 120 gggcctggcg ctgctgctgc tgctcggcct cggactaggc ctggaggccg ccgcgagccc 180 gctttccacc ccgacctctg cccaggccgc aggccccagc tcaggctcgt gcccacccac 240 caagttccag tgccgcacca gtggcttatg cgtgcccctc acctggcgct gcgacaggga 300 cttggactgc agcgatggca gcgatgagga ggagtgcagg attgagccat gtacccagaa 360 agggcaatgc ccaccgcccc ctggcctccc ctgcccctgc accggcgtca gtgactgctc 420 tgggggaact gacaagaaac tgcgcaactg cagccgcctg gcctgcctag caggcgagct 480 ccgttgcacg ctgagcgatg actgcattcc actcacgtgg cgctgcgacg gccacccaga 540 ctgtcccgac tccagcgacg agctcggctg tggaaccaat gagatcctcc cggaagggga 600 tgccacaacc atggggcccc ctgtgaccct ggagagtgtc acctctctca ggaatgccac 660 aaccatgggg ccccctgtga ccctggagag tgtcccctct gtcgggaatg ccacatcctc 720 ctctgccgga gaccagtctg gaagcccaac tgcctatggg gttattgcag ctgctgcggt 780 gctcagtgca agcctggtca ccgccaccct cctccttttg tcctggctcc gagcccagga 840 gcgcctccgc ccactggggt tactggtggc catgaaggag tccctgctgc tgtcagaaca 900 gaagacctcg ctgccctgag gacaagcact tgccaccacc gtcactcagc cctgggcgta 960 gccggacagg aggagagcag tgatgcggat gggtacccgg gcacaccagc cctcagagac 1020 ctgagctctt ctggccacgt ggaacctcga acccgagctc ctgcagaagt ggccctggag 1080 attgagggtc cctggacact ccctatggag atccggggag ctaggatggg gaacctgcca 1140 cagccagaac tgaggggctg gccccaggca gctcccaggg ggtagaacgg ccctgtgctt 1200 aagacactcc tgctgccccg tctgagggtg gcgattaaag ttgcttcaca tcctcaaaaa 1260 aaaa 1264 <210> 21 <211> 2566 <212> DNA
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 2764333CB1 <400> 21 cccacgcgtc cgacgagttc tatgatgcgc tgtcagattc cgagtccgaa aggtccctga 60 gtagattgga agcagtgaca gcacgctcct ttgaagagga aggagagcat ttgggcagta 120 gaaaacacag aatgtccgaa gaaaaagact gtggtggcgg agatgctctc tccaatggca 180 tcaagaaaca cagaacaagt ttgccttctc ctatgttttc cagaaatgac ttcagtatct 240 ggagcatcct cagaaaatgt attggaatgg aactatccaa gatcacgatg ccagttatat 300 ttaatgagcc tctgagcttc ctacagcgcc taactgaata catggagcat acttacctca 360 tccacaaggc cagttcactc tctgatcctg tggaaaggat gcagtgtgta gctgcgtttg 420 ctgtatctgc tgttgcttct cagtgggaac ggactggaaa acctttcaac ccactgctgg 480 gagagactta tgaattagtg cgagatgacc ttggatttag actcatctcc gaacaggtca 540 gccatcaccc accaatcagt gcatttcatg ctgaaggatt aaacaatgac ttcatctttc 600 atggctctat ctatcccaaa ctgaaattct gggggaagag tgtagaagca gaacccaaag 660 gaaccatcac cttggagctc cttgaacaca atgaggcata tacatggaca aatcccacct 720 gctgtgtgca taatatcatt gtgggtaaac tgtggatcga acagtatggc aatgtggaaa 780 ttataaacca caagactggg gacaaatgtg tgttgaattt taagccatgt ggcctttttg 840 gtaaggaatt acacaaagtt gaaggctaca ttcaagataa aagcaaaaag aagctctgtg 900 ccctctatgg gaagtggact gaatgtttat acagtgttga ccctgccacg tttgacgctt 960 acaaaaaaaa tgataagaaa aatacagaag agaagaagaa cagcaaacag atgagcacct 1020 ctgaggagtt ggatgaaatg ccagtgccgg attctgaaag tgtattcatt atccctggaa 1080 gcgttcttct atggcgaata gccccacggc ctccaaattc tgcccagatg tataatttta 1140 ctagttttgc aatggttttg aatgaagtag acaaagacat ggagagtgtg attcccaaga 1200 cagactgcag gttacggcct gacatcagag ccatggaaaa tggagagata gatcaagcta 1260 gtgaagaaaa aaaacgactt gaggaaaaac aaagagcagc ccgcaaaaac aggtccaagt 1320 cagaagagga ctggaagacg aggtggttcc atcaaggtcc taatccctac aatggagcac 1380 aggactggat ttactctggc agctactggg acagaaatta cttcaatttg cctgacattt 1440 attaaaatgc atacaagtca gggtgtttgg ctaatctaca aataagtctt aaacctatgt 1500 ttttaaattt ttttcccttg gtttctactt atcttttaaa aaaaaaaatg aaaaaacact 1560 catgagataa ctgcatttca cccaacaaaa gcagggtata aggcgatatt ggtgatgaaa 1620 gtcttaggaa aaatgcataa ttttgctata aaatgtactt atttggaata ctattttata 1680 tagaggtaag agaacactgc tggggaatat gctttttatg gttgctgttg ccatatttac 1740 tgaaggttta tacctaaatg taactttagc tttatggaac tatatagtaa tcccaaatca 1800 agttattttg aatattttta tgctgtcatg cttgaatgtt ttagatgtaa cctttgacat 1860 atttagaact ctcctcctat acaatgttta ttctcagata tagaggttat gtcattttat 1920 aaagacttca ttgataagat ggcttttatt catactaatc ctcccaatgt taccccttcc 1980 atcttccaag aagaaaaaaa atgcctgaat attcagaata gatatttctg atttgaaaat 2040 tctaaagaat taaactggaa aagtatttca tttacttagt gctctgaatt tacttttaca 2100 gttttctgca gtcagtatca ttaaaatggt taagtttaca tttgaactga aaatatgtat 2160 aaaatctagc aattcacaaa aatgccctag aaatatagat tttaatcacc attacataat 2220 gacaaacctt gttaaatgct tccacttcca gtggcaaatg ccactaggga aagtaagttg 2280 cactcatgta agtatcaaac tatataaaag gaggccttgt gcatttcaag tttgcaaagt 2340 acctgtgtac ttaaaatatg tgtggagacc tactgtacag tagttttgcc cctttaattg 2400 gggcacattc atcttaaatc ttatagtatt tatccaccca aaccccagac tgagatactg 2460 ctcccagggg cctaggtagc tgccagtccg tgattttaat tgctgtcttg aagttaacaa 2520 gtgttataat gaaataatct acctgatgct aaataaaggc tttaga 2566 <210> 22 <211> 1848 <212> DNA
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 2798021CB1 <400> 22 aaccagctga aaagcatgcg ccaagctgca gcagatgcaa agcctgaaag tttaatgaag 60 aggctagagg aggagataaa atttaattta tatatggtaa ctgaaaaatt tcctaaagaa 120 ttagaaaata agaaaaagga attacatttt ttacaaaaag tagtttcaga gccagctatg 180 ggccattctg atcttcttga acttgaatct aaaataaatg aaataaacac agaaattaac 240 cagttgattg aaaagaaaat gatgagaaat gagcccattg aaggcaaact ctcactgtat 300 aggcaacagg catctatcat ttcccgtaaa aaagaagcca aagctgagga acttcaggag 360 gccaaggaga agttagccag cctagagaga gaagcatcag taaagagaaa tcagacccgt 420 gaatttgatg gtactgaagt tttaaaggga gatgagttca aacgatatgt caataaactt 480 cgaagcaaga gtacagtttt caaaaagaag catcagataa tagctgaact taaagctgaa 540 ttcggtcttt tgcagaggac tgaagaactt cttaagcaac gtcatgaaaa tattcaacaa 600 caactgcaaa ctatggagga gaaaaagggt atatctggat atagttacac ccaagaagag 660 ctagaaagag tatctgcact gaagagtgaa gttgatgaaa tgaaaggacg aacattggat 720 gatatgtctg aaatggtgaa aaaactgtat tcattggtat ctgaaaagaa gtcagctctt 780 gcctcagtta taaaagagct acgacagttg cgtcaaaaat atcaagaact gacccaggag 840 tgtgatgaaa agaaatccca gtatgatagc tgtgcagcag gcctcgaaag caatcggtcc 900 aaattagaac aggaagttag aagactccgt gaagaatgtc ttcaagaaga aagtagatac 960 cattatacaa attgtatgat taagaaccta gaagttcaac ttcgtcgtgc tactgatgag 1020 atgaaggcat atatctcttc tgatcaacaa gaaaaaagaa aggcaattag ggaacagtat 1080 accaaaaata ctgctgaaca agaaaacctt ggaaagaaac ttcgggaaaa acaaaaagtt 1140 atacgagaaa gtcatggtcc aaatatgaaa caagcaaaaa tgtggcgtga tttggaacaa 1200 ttaatggaat gtaagaaaca gtgctttctg aaacaacaaa gccaaacttc cattggtcag 1260 gtaattcagg agggtgggga ggaccggcta atactgtgaa ttcttgtgtc atcgtttggg 1320 gttttacttg ataccactag ctataagcct aatctcataa tgtatttctt ttttgaaact 1380 gatttgtata gcattttgtt ttcagaagag ccattcttta ttaagttttc atagaaaata 1440 atgttaaggt agatttagtt tgaatgtttt ttcatatgaa aaagaggctt ttattctttt 1500 ccatagttta gacatcactg gcgtcttctg agttttatga gacaggaaac taagtttact 1560 atctgtaaat gtaaacatat gtccattaag aaacatgtag ttttttttta gaatgtaata 1620 acccagtggc ttactgtttt tcttaatctc ttttaaaaaa actttagaag aatcttttag 1680 gaactaatat ctcttgttct gaagaaacat ttatctgacg ttcagcagtt cctacagttt 1740 tacttcagtt tatttttctt ctgtaaaatg caagaaaatt taatattttg actaacatgt 1800 cttttctgtt tgtatcattt aaaggcatat aaacttgtcg agtattaa 1848 <210> 23 <211> 2518 <212> DNA
<213> Homo sapiens <220>
<221> misc_feature <223> Incyte ID No: 3335404CB1 <400> 23 cgcagagccg gcgagggctt gcggtcctgg acggtagggg tctgcgtctg gtgccacctt 60 ctccatcagg ctgcttgctg ggtccaccaa gcctgacccc tccagccagg aggggatcac 120 ggagtgtgtc ccccgcccgg gctgctgcct gccggaaggg ggctgggaaa ccgggatccc 180 cgcgcgccac ttgcccggca gcgtcagttt cacccaggac tggctagcgg ttccctcgtg 240 gctctgcgcg gaggtccgcc tcctcccttc cctcccccat cccgcgcgct gcctccgcct 300 ccttcttctg cctccaccct caacccccat ccccgcctga cgggagctag ccctcagtcc 360 gcccgagctg tggttgtggg cgccggacaa gtccaaggcg cctcctccca atatggacag 420 ccgctacaac agcactgcgg gcatcgggga cttgaaccag ctgagcgctg ccatcccggc 480 cacgcgggtg gaggtgtccg tgtcctgcag aaatcttctt gacagagaca cattttctaa 540 atctgatcca atttgtgtct tatatgtaca aggagttgga aataaagaat ggagagagtt 600 tggaagaact gaagtaattg ataatacttt aaatcctgat tttgtaagaa agtttattct 660 ggactacttt tttgaagaaa gagagaatct tcgttttgac ttgtatgatg ttgattcaaa 720 gagccccaac ttatccaaac atgactttct gggacaagtg ttttgtacat tgggagagat 780 cgttggttca cagggaagtc gcctggaaaa accaatagta ggaattccag ggaagaaatg 840 tggtacaatc atacttacag cagaggaatt aaactgttgc agggatgccg ttttgatgca 900 attttgtgcg aacaaattgg acaagaagga cttctttgga aaatcagatc ctttccttgt 960 attttatcga agtaatgaag atggcagttt tacaatttgt cacaagacag aagttgtcaa 1020 aaacactcta aatccagtat ggcaagcatt caagatctca gtcagagcat tatgtaatgg 1080 agactatgac agaacaatca aagtagaggt gtatgactgg gaccgagatg gaagtcatga 1140 tttcattgga gaatttacaa caagctatag ggaactttct agagggcagt cacaattcaa 1200 cgtatatgag gtggtgaatc ccaaaaagaa aggaaaaaag aaaaaatata ctaattctgg 1260 aacagtaact ttactctctt tcttggtaga aacagaagtt tcattccttg actacattaa 1320 gggagggacg caaatcaatt tcacagtggc tattgatttt acagcatcaa acgggaaccc 1380 tgctcagccc acttccctcc actacatgaa tccttaccaa ctgaatgcct atggtatggc 1440 actaaaagca gtgggagaaa ttgttcaaga ttatgacagt gataaaatgt ttccagctct 1500 aggatttggt gcaaaactgc ctccagatgg aaggatatct cacgaatttg ctttgaatgg 1560 gaatcctcaa aacccctact gtgatggcat tgagggggtc atggaggctt attacaggag 1620 tctgaaatct gtacaactat atgggcccac caactttgct cctgtaatta atcatgtagc 1680 aagatatgct tcttctgtaa aggatggctc ccagtatttt gtgcttctga ttgttacaga 1740 tggtgttatc tcagatatgg cccagactaa ggagtccata gttaatgcct caaaacttcc 1800 aatgtcaata attatagtag gtgttggacc agcagaattt gatgcaatgg tcgaattgga 1860 tggagatgat gtaagagtct cctctagagg aaaatatgct gaaagagaca ttgtgcagtt 1920 tgtgccattc agggattata ttgacagaag tggaaaccac atactgagca tggctagatt 1980 ggctaaagat gtcctagctg agatccctga gcagtttctc tcctatatga gagcccgagg 2040 aatcaagcca tcacctgcgc ctcccccata caccccacct acacatgtgt tacagactca 2100 aatatgactg tgctctgaaa tgctaatgtc aactacaaat caaaagtgct gagttaatgc 2160 tttgtgcctg gtgctctgta atgaaccagg caatgagata gttttctcag tttggtttca 2220 gcagttaatg tgctttcttg gatccaaatt taaatatctt cctaaaccaa aactgtaaat 2280 atggttgttg catgagcaac agaaaaaatt gtttaaatgc ttgaagcaaa gtatggatgt 2340 cttctctaaa tctttctttc tttttttttt tttttttttt ttaacagaaa cagctaattt 2400 ccaatgtatt gttgggaaaa agcacaaact gtgtttttaa ctcaaatatt gtcttcgact 2460 gctatgtgta taggaaagca gtctctgtat caatgtttac atgttactac tttttaaa 2518 <210> 24 <211> 1120 <212> DNA
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 3735780CB1 <400> 24 gaaatccagt tatcaaaatt gactcaagaa gagagaacct aacagaacaa taacaatgga 60 agaaattggg aacattatca caaagctatc atcctgccaa actccaggct cagatgtcac 120 aggttaaaaa aagtccttca tgaaaaagaa agatcttaag cagcatgatg gattcagaag 180 ctcatgaaaa gaggccacca atactaacat cttcaaaaca agatatatca cctcatatta 240 caaatgttgg tgagatgaag cattacttgt gtggctgctg tgcagccttc aacaatgtcg 300 caatcacatt tcccattcag aaggtcctct ttcgacaaca gctgtatggc atcaaaaccc 360 gggatgcaat acttcagttg agaagggatg gatttcgaaa tttgtatcgt ggaatccttc 420 ccccattgat gcagaagaca actacgcttg cacttatgtt tggtctgtat gaggatttat 480 cctgccttct ccacaagcat gtcagtgctc cagagtttgc aaccagtggc gtggcggcag 540 tgcttgcagg gacaacagaa gcaattttca ctccactgga aagagttcag acattgcttc 600 aagaccacaa acatcatgac aaatttacca acacttacca ggctttcaag gcactgaaat 660 gtcatggaat tggagagtat tatcgaggct tggtgcccat tcttttccgg aatggactca 720 gcaatgtctt gtttttcggc cttcgaggtc ccattaagga gcatctgcct accgcaacga 780 ctcacagtgc tcatctggtc aatgatttta tctgtggagg tctattgggt gccatgttgg 840 gattcttgtt ttttccaatt aatgttgtaa aaactcgcat acagtctcag attggtgggg 900 aatttcagtc tttccccaag gttttccaaa aaatctggct ggaacgggac agaaaactga 960 taaatctttt cagaggtgcc catctgaatt accatcggtc cctcatctct tggggcataa 1020 tcaatgcaac ttatgagttc ttgttaaagg ttatatgaaa aaaccatcag ttaagtgcca 1080 tttatcaact gaatagacct tctaagaaga aaaaaaaaaa 1120
~ rU w ~ r~ ~ rd ~ +~ U o ~ ~ o v ~
~ o ~ rtf 3 ~ ~ m v 'H
(l7 U S-I U1 U1 U U1 U7 ~ O ~ 1J ~ jl -rl -rl O 'd -.-1 c~ O N
U
~ ~ o ~ ~n ~ ~ ~ ~ b w 3 ,~ v u~
u~ ~I m o >~ U rx ~ o o it ~a o ca o ~a o o v +~ >~ ~ ~ ~
~, o ~, ca ,~ ~ o ~ v ~a v U U U U U U U U ~ U1 rtS U7 I
~ ~ N !~ U 3 3 ~r ~ U7 .G
O O f~ ~ .H O ~ ~ U v O U7 a O O
~ b1 'LS
U1 '~ ~ tl1 U1 'LS U1 U1 (a >~ 1.J ~ S-a 1-~ O (0 !~ f~S . H H ~ U -r-I
ri O >~ rl t~ rl ' (IS ~ ~ rtf f~ S-I O O O ~H O
-H U1 U W U tl~ ~ U
r-i ~ H
3 0 ~ 3 3 0 3 3 y ~ ~ ~ w v '~ i v ~
v N ~
~I ~ U 'a~ ~ ,~ , ~ ~
~ -I
~t ~r S.W 'r ~y ~-I ?W ?i ~ ~ ~r ~ U ~ .-i ~ (d ~ J~ ~ N U
v N
w ~I rtf S~ ~ ~ i~ ~ u1 vo U u7 ~ v ~ U ?~ O rti O w c~ ~ ''C3 -~
N >~ ~ ~
t~ v U (~ f~ v N (d N v v ra ~I ~
O ~ ~ W U f-a I 7-I N -~ v d U~
~
~1 ~r O ~ ~ ~ ~r ~.I ~-1 U7 O W >~, S.-i ~', H ~ w-1 ~.1 v U ,~, 1J ~r ~-1 1.l O v ~-1 .C~ I t!7 ,Q ,~ I ,f~ ~ w ~ ,~ O ,Q U1 v (a '~ td ~ C1,.4 1~ ,~ U
'' 1-I 'C~ ~
~ ~ H ~ ~ 7~
~ W ~
H V H N -H N ~ w , ~ w N ~ v ~ ~ ~
U1 O a M !~ a ~
a 111 (a ~ a -ri a ~ H ~C U7 U rl yU
to w O ~ , .~., ?i U7 ~ . r ?, C2, (d U
M (~ N r-I rl w-i ~y O N r-i O O O
H H H H H
rd O O O O W O
E-f Ei a H U
w v~ a v~ o a rx o c~ a~
c~ U r~ z v ro -r., ..
~o oz v r-1 o\ O r-i N M
(~
U rl N N N N N
H
f~
OI
?~
W
H~
O
G~
00 ~ n :a o ~ ~ c W ~ J '~~ 'n =' r, v ro ~ .r a ~ ~ f = ~ a ~ r, a a -_p ~ ~ v~ _ i O y j i: rJ' ~ W per., O :~ J ~ II ~ ~ ~ a L ~ ~ J W. .' Z > ~ ;J ..,~.y. .D .~ i ~ t~D
O ._>'r ~ ~ ;b 'r.; ;'~ O 'J_. _~ op ~J' Ga. ~ 'J
~"O >~ yN II ~~ ~~,._ :~ '~.a V ~ ~ W W W c,UC ~ ~ ~ II O O > O p y ~ O ~ '~ ~ x ~ O w ~_ on y V G. ~ ~ y W ~ O ~ ~ O 'O y o ~ N y ~ ~ ~ y c ~ " o> o ~ ~ '1' ~ ~, ._~ y ~ v. J -o ~ p o t~
C. ~ ~~~L > Lz1<Ca~.:~s~,.~ ~j~ ~n v~~.
v~ E
a ~x ~ .
.. ~3~a :C ~ '0 U V
~ ~ ~ ~ x a a o N '-' . ,-CO ~ a ~ a ~ ' x . r .a cn vi ~ vi ' ~ N ~ y ~ p ~ p O a O W O
Cue. ~ ~ a~ can ~ ~ ~ ~ ~ Z ~ ~ ~ ~ ~ vp '~J~ '~rJ O~ Uj tn ,~ N . :~ C "'' ~n y ~ ~, 00 m ~ 07 a ~ M ~. ~ .f' = r~ ~ x ~ ~ C C:
yv ~ '~ ~ ~ N p N S vp a '~ E-.n. .c. G~ ~a. :j ~ y ~ ~ uj ~ ~ ~ N ~ cn ~ v~ a n. n. a.
aQ as aQ :~ax . ~b~ b~ o~~ '~.- U
ccd a m .
U Ct, o 'v r~ ~ ~ :c ~ a 3 - , y E ~ 8 ,r>~' ~ ,..~''" ~ ~ '-' 3 -o y !i: ~ vy ~ Q a a' r- v W U W U W U ~ ~~, a G 'U° ~ E- ~ o a ~ ~ ~ Q o z y .. ' ~ o . o a ~t ~ . o ~ . U
~.~~o y~ ~o "~~ ~b~~a. yyyp'~ cn,~
tn 2 0. r~., a, u, o. ti, Q N z n. z " v~ a s x z .- x~ N ..
U C
v~ ~ y ~ ~ U ~ ~ J
a' <n p, C J ~ 'O ~ '.~ . G. c~O c~ ~' ~ vj b ~ G .~ ~ ~ ~ O y ~ vy, G" ~~ ~ y <=C 'C U ~ U ~ J ~ ~i:, ~ ~ O .~ ~ J
V U ~ ~ ~ .U '~J~ ~ b ~ ~ V ~ 'Op '7 j y C 'C ~ ~ j ;C J r :C p ~ i.U., y ~ CL
a. ~ ~ E. '"" ~ C E- ~'.' E" 4 U ~ 3 ~ v '~-, ~ y u-, ,~ y ~ v oyn ~ ~a ~ V ~ ~ n, c J U .Wn ~ ~ G ~ ~ ~ ~ ~ ~ J ~ ~ ~ C ' y'U '~~ ~, y~ Jb ~~ ._=~ ~~"r0v .C y o a o .,~. ~ ~ '° .a '~ b -o O
r° '° ~ ~ n. c ~, ~ > ~ a ~
'~ i Li. '~ ~ ~ -a ~.~c~'~ ~ ~ N'~ ~ ~.x O p U ~ "a r y .~ 'J ~ J ~ j .SC rJ a OA ~..t"', n, v° rz. ' a, ~ '~ a. c a'» 'E ~. ~ m ' Wo ~
a ~ a ~ a aw ~.~ aw ~:~ a ~ ~ : a;
w U Q
U
caz. n. a a ~ g _.QQQoWC ua.
SEQUENCE LISTING
<110> INCYTE PHARMACEUTICALS, INC.
TANG, Y. Tom HILLMAN, Jennifer L.
YUE, Henry AZIMZAI, Yalda BAUGHN, Mariah R.
TRAM, Bao <120> HUMAN LIPID-ASSOCIATED PROTEINS
<130> PF-0676 PCT
<140> To Be Assigned <141> Herewith <150> 60/120,703; 60/142,762 <151> 1999-02-19; 1999-07-08 <160> 24 <170> PERL Program <210> 1 <211> 331 <212> PRT
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 161190CD1 <400> 1 Met Asp Ser Glu Lys Lys Arg Phe Thr Glu Glu Ala Thr Lys Tyr Phe Arg Glu Arg Val Ser Pro Val His Leu Gln Ile Leu Leu Thr Asn Asn Glu Ala Trp Lys Arg Phe Val Thr Ala Ala Glu Leu Pro Arg Asp Glu Ala Asp Ala Leu Tyr Glu Ala Leu Lys Lys Leu Arg Thr Tyr Ala Ala Ile Glu Asp Glu Tyr Val Gln Gln Lys Asp Glu Gln Phe Arg Glu Trp Phe Leu Lys Glu Phe Pro Gln Val Lys Arg Lys Ile Gln Glu Ser Ile Glu Lys Leu Arg Ala Leu Ala Asn Gly Ile Glu Glu Val His Arg Gly Cys Thr Ile Ser Asn Val Val Ser Ser Ser Thr Gly Ala Ala Ser Gly Ile Met Ser Leu Ala Gly Leu Val Leu Ala Pro Phe Thr Ala Gly Thr Ser Leu Ala Leu Thr Ala Ala Gly Val Gly Leu Gly Ala Ala Ser Ala Val Thr Gly Ile Thr Thr Ser Ile Val Glu His Ser Tyr Thr Ser Ser Ala Glu Ala Glu Ala Ser Arg Leu Thr Ala Thr Ser Ile Asp Arg Leu Lys Val Phe Lys Glu Val Met Arg Asp Ile Thr Pro Asn Leu Leu Ser Leu Leu Asn Asn Tyr Tyr Glu Ala Thr Gln Thr Ile Gly Ser Glu Ile Arg Ala Ile Arg Gln Ala Arg Ala Arg Ala Arg Leu Pro Val Thr Thr Trp Arg Ile Ser Ala Gly Ser Gly Gly Gln Ala Glu Arg Thr Ile Ala Gly Thr Thr Arg Ala Val Ser Arg Gly Ala Arg Ile Leu Ser Ala Thr Thr Ser Gly Ile Phe Leu Ala Leu Asp Val Val Asn Leu Val Tyr Glu Ser Lys His Leu His Glu Gly Ala Lys Ser Ala Ser Ala Glu Glu Leu Arg Arg Gln Ala Gln Glu Leu Glu Glu Asn Leu Met Glu Leu Thr Gln Ile Tyr Gln Arg Leu Asn Pro Cys His Thr His <210> 2 <211> 480 <212> PRT
<213> Homo sapiens <220>
<221> misc_feature <223> Incyte ID No: 1292575CD1 <400> 2 Met Asn Gly Glu Glu Glu Phe Phe Asp Ala Val Thr Gly Phe Asp Ser Asp Asn Ser Ser Gly Glu Phe Ser Glu Ala Asn Gln Lys Val Thr Gly Met Ile Asp Leu Asp Thr Ser Lys Asn Asn Arg Ile Gly Lys Thr Gly Glu Arg Pro Ser Gln Glu Asn Gly Ile Gln Lys His Arg Thr Ser Leu Pro Ala Pro Met Phe Ser Arg Ser Asp Phe Ser Val Trp Thr Ile Leu Lys Lys Cys Val Gly Leu Glu Leu Ser Lys Ile Thr Met Pro Ile Ala Phe Asn Glu Pro Leu Ser Phe Leu Gln Arg Ile Thr Glu Tyr Met Glu His Val Tyr Leu Ile His Arg Ala Ser Cys Gln Pro Gln Pro Leu Glu Arg Met Gln Ser Val Ala Ala Phe Ala Val Ser Ala Val Ala Ser Gln Trp Glu Arg Thr Gly Lys Pro Phe Asn Pro Leu Leu Gly Glu Thr Tyr Glu Leu Ile Arg Glu Asp Leu Gly Phe Arg Phe Ile Ser Glu Gln Val Ser His His Pro Pro Ile Ser Ala Phe His Ser Glu Gly Leu Asn His Asp Phe Leu Phe His Gly Ser Ile Tyr Pro Lys Leu Lys Phe Trp Gly Lys Ser Val Glu Ala Glu Pro Arg Gly Thr Ile Thr Leu Glu Leu Leu Lys His Asn Glu Ala Tyr Thr Trp Thr Asn Pro Thr Cys Cys Val His Asn Val Ile Ile Gly Lys Leu Trp Ile Glu Gln Tyr Gly Thr Val Glu Ile Leu Asn His Arg Thr Gly His Lys Cys Val Leu His Phe Lys Pro Cys Gly Leu Phe Gly Lys Glu Leu His Lys Val Glu Gly His Ile Gln Asp Lys Asn Lys Lys Lys Leu Phe Met Ile Tyr Gly Lys Trp Thr Glu Cys Leu Trp Gly Ile Asp Pro Val Ser Tyr Glu Ser Phe Lys Lys Gln Glu Arg Arg Gly Asp His Leu Arg Lys Ala WO 00/49043 CA 02362300 2001-07-31 pCT~S00/04160 Lys Leu Asp Glu Asp Ser Gly Lys Ala Asp Ser Asp Val Ala Asp Asp Val Pro Val Ala Gln Glu Thr Val Gln Val Ile Pro Gly Ser Lys Leu Leu Trp Arg Ile Asn Thr Arg Pro Pro Asn Ser Ala Gln Met Tyr Asn Phe Thr Ser Phe Thr Val Ser Leu Asn Glu Leu Glu Thr Gly Met Glu Lys Thr Leu Pro Pro Thr Asp Cys Arg Leu Arg Pro Asp Ile Arg Gly Met Glu Asn Gly Asn Met Asp Leu Ala Ser Gln Glu Lys Glu Arg Leu Glu Glu Lys Gln Arg Glu Ala Arg Arg Glu Arg Ala Lys Glu Glu Ala Glu Trp Gln Thr Arg Trp Phe Tyr Pro Gly Asn Asn Pro Tyr Thr Gly Thr Pro Asp Trp Leu Tyr Ala Gly Asp Tyr Phe Glu Arg Asn Phe Ser Asp Cys Pro Asp Ile Tyr <210> 3 <211> 409 <212> PRT
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 2454393CD1 <400> 3 Met Ala Thr Ser Val Gly His Arg Cys Leu Gly Leu Leu His Gly Val Ala Pro Trp Arg Ser Ser Leu His Pro Cys Glu Ile Thr Ala Leu Ser Gln Ser Leu Gln Pro Leu Arg Lys Leu Pro Phe Arg Ala Phe Arg Thr Asp Ala Arg Lys Ile His Thr Ala Pro Ala Arg Thr Met Phe Leu Leu Arg Pro Leu Pro Ile Leu Leu Val Thr Gly Gly Gly Tyr Ala Gly Tyr Arg Gln Tyr Glu Lys Tyr Arg Glu Arg Glu Leu Glu Lys Leu Gly Leu Glu Ile Pro Pro Lys Leu Ala Gly His Trp Glu Val Ala Leu Tyr Lys Ser Val Pro Thr Arg Leu Leu Ser Arg Ala Trp Gly Arg Leu Asn Gln Val Glu Leu Pro His Trp Leu Arg Arg Pro Val Tyr Ser Leu Tyr Ile Trp Thr Phe Gly Val Asn Met Lys Glu Ala Ala Val Glu Asp Leu His His Tyr Arg Asn Leu Ser Glu Phe Phe Arg Arg Lys Leu Lys Pro Gln Ala Arg Pro Val Cys Gly Leu His Ser Val Ile Ser Pro Ser Asp Gly Arg Ile Leu Asn Phe Gly Gln Val Lys Asn Cys Glu Val Glu Gln Val Lys Gly Val Thr Tyr Ser Leu Glu Ser Phe Leu Gly Pro Arg Met Cys Thr Glu Asp Leu Pro Phe Pro Pro Ala Ala Ser Cys Asp Ser Phe Lys Asn Gln Leu Val Thr Arg Glu Gly Asn Glu Leu Tyr His Cys Val Ile Tyr Leu Ala Pro Gly Asp Tyr His Cys Phe His Ser Pro Thr WO 00/49043 CA 02362300 2001-07-31 pCT~S00/04160 Asp Trp Thr Val Ser His Arg Arg His Phe Pro Gly Ser Leu Met Ser Val Asn Pro Gly Met Ala Arg Trp Ile Lys Glu Leu Phe Cys His Asn Glu Arg Val Val Leu Thr Gly Asp Trp Lys His Gly Phe Phe Ser Leu Thr Ala Val Gly Ala Thr Asn Val Gly Ser ile Arg Ile Tyr Phe Asp Arg Asp Leu His Thr Asn Ser Pro Arg His Ser Lys Gly Ser Tyr Asn Asp Phe Ser Phe Val Thr His Thr Asn Arg Glu Gly Val Pro Met Arg Lys Gly Glu His Leu Gly Glu Phe Asn Leu Gly Ser Thr Ile Val Leu Ile Phe Glu Ala Pro Lys Asp Phe Asn Phe Gln Leu Lys Thr Gly Gln Lys Ile Arg Phe Gly Glu Ala Leu Gly Ser Leu <210> 4 <211> 759 <212> PRT
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 2766980CD1 <400> 4 Met Glu Ser Ser Pro Phe Asn Arg Arg Gln Trp Thr Ser Leu Ser Leu Arg Val Thr Ala Lys Glu Leu Ser Leu Val Asn Lys Asn Lys Ser Ser Ala Ile Val Glu Ile Phe Ser Lys Tyr Gln Lys Ala Ala Glu Glu Thr Asn Met Glu Lys Lys Arg Ser Asn Thr Glu Asn Leu Ser Gln His Phe Arg Lys Gly Thr Leu Thr Val Leu Lys Lys Lys Trp Glu Asn Pro Gly Leu Gly Ala Glu Ser His Thr Asp Ser Leu Arg Asn Ser Ser Thr Glu Ile Arg His Arg Ala Asp His Pro Pro Ala Glu Val Thr Ser His Ala Ala Ser Gly Ala Lys Ala Asp Gln Glu Glu Gln Ile His Pro Arg Ser Arg Leu Arg Ser Pro Pro Glu Ala Leu Val Gln Gly Arg Tyr Pro His Ile Lys Asp Gly Glu Asp Leu Lys Asp His Ser Thr Glu Ser Lys Lys Met Glu Asn Cys Leu Gly Glu Ser Arg His Glu Val Glu Lys Ser Glu Ile Ser Glu Asn Thr Asp Ala Ser Gly Lys Ile G1u Lys Tyr Asn Val Pro Leu Asn Arg Leu Lys Met Met Phe Glu Lys Gly Glu Pro Thr Gln Thr Lys Ile Leu Arg Ala Gln Ser Arg Ser Ala Ser Gly Arg Lys Ile Ser Glu Asn Ser Tyr Ser Leu Asp Asp Leu Glu Ile Gly Pro Gly Gln Leu Ser Ser Ser Thr Phe Asp Ser Glu Lys Asn Glu Ser Arg Arg Asn Leu Glu Leu Pro Arg Leu Ser Glu Thr Ser Ile Lys Asp Arg Met Ala Lys Tyr Gln Ala Ala Val Ser Lys Gln Ser Ser Ser Thr Asn Tyr Thr Asn Glu Leu Lys Ala Ser Gly Gly Glu Ile Lys Ile His Lys Met Glu Gln Lys Glu Asn Val Pro Pro Gly Pro Glu Val Cys Ile Thr His Gln Glu Gly Glu Lys Ile Ser Ala Asn Glu Asn Ser Leu Ala Val Arg Ser Thr Pro Ala Glu Asp Asp Ser Arg Asp Ser Gln Val Lys Ser Glu Val Gln Gln Pro Val His Pro Lys Pro Leu Ser Pro Asp Ser Arg Ala Ser Ser Leu Ser Glu Ser Ser Pro Pro Lys Ala Met Lys Lys Phe Gln Ala Pro Ala Arg Glu Thr Cys Val Glu Cys Gln Lys Thr Val Tyr Pro Met Glu Arg Leu Leu Ala Asn Gln Gln Val Phe His Ile Ser Cys Phe Arg Cys Ser Tyr Cys Asn Asn Lys Leu Ser Leu Gly Thr Tyr Ala Ser Leu His Gly Arg Ile Tyr Cys Lys Pro His Phe Asn Gln Leu Phe Lys Ser Lys Gly Asn Tyr Asp Glu Gly Phe Gly His Arg Pro His Lys Asp Leu Trp Ala Ser Lys Asn Glu Asn Glu Glu Ile Leu Glu Arg Pro Ala Gln Leu Ala Asn Ala Arg Glu Thr Pro His Ser Pro Gly Val Glu Asp Ala Pro Ile Ala Lys Val Gly Val Leu Ala Ala Ser Met Glu Ala Lys Ala Ser Ser Gln Gln Glu Lys Glu Asp Lys Pro Ala Glu Thr Lys Lys Leu Arg Ile Ala Trp Pro Pro Pro Thr Glu Leu Gly Ser Ser Gly Ser Ala Leu Glu Glu Gly Ile Lys Met Ser Lys Pro Lys Trp Pro Pro Glu Asp Glu Ile Ser Lys Pro Glu Val Pro Glu Asp Val Asp Leu Asp Leu Lys Lys Leu Arg Arg Ser Ser Ser Leu Lys Glu Arg Ser Arg Pro Phe Thr Val Ala Ala Ser Phe Gln Ser Thr Ser Val Lys Ser Pro Lys Thr Val Ser Pro Pro Ile Arg Lys Gly Trp Ser Met Ser Glu Gln Ser Glu Glu Ser Val Gly Gly Arg Val Ala Glu Arg Lys Gln Val Glu Asn Ala Lys Ala Ser Lys Lys Asn Gly Asn Val Gly Lys Thr Thr Trp Gln Asn Lys Glu Ser Lys Gly Glu Thr Gly Lys Arg Ser Lys Glu Gly His Ser Leu Glu Met Glu Asn Glu Asn Leu Val Glu Asn Gly Ala Asp Ser Asp Glu Asp Asp Asn Ser Phe Leu Lys Gln Gln Ser Pro Gln Glu Pro Lys Ser Leu Asn Trp Ser Ser Phe Val Asp Asn Thr Phe Ala Glu Glu Phe Thr Thr Gln Asn Gln Lys Ser Gln Asp Val Glu Leu Trp Glu Gly Glu Val Val Lys Glu Leu Ser Val Glu Glu Gln Ile Lys Arg Asn Arg Tyr Tyr Asp Glu Asp Glu Asp Glu Glu <210> 5 CA 02362300 2001-07-31 PCZ'/US00/04160 <211> 226 <212> PRT
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 2768356CD1 <400> 5 Met Glu Ile Val Pro Ile Gly Thr Thr His Val Thr Leu Pro Val Phe Gly Asp His Phe Glu Trp Asn Lys Val Thr Ser Cys Ile His Asn Ile Leu Ser Gly Gln Arg Trp Ile Glu His Tyr Gly Glu Ile Val Ile Lys Asn Leu His Asp Asp Ser Cys Tyr Cys Lys Val Asn Phe Ile Lys Ala Lys Tyr Trp Ser Thr Asn Ala His Glu Ile Glu Gly Thr Val Phe Asp Arg Ser Gly Lys Ala Val His Arg Leu Phe Gly Lys Trp His Glu Ser Ile Tyr Cys Gly Gly Gly Ser Ser Ser Ala Cys Val Trp Arg Ala Asn Pro Met Pro Lys Gly Tyr Glu Gln Tyr Tyr Ser Phe Thr Gln Phe Ala Leu Glu Leu Asn Glu Met Asp Pro Ser Ser Lys Ser Leu Leu Pro Pro Thr Asp Thr Arg Phe Arg Pro Asp Gln Arg Phe Leu Glu Glu Gly Asn Leu Glu Glu Ala Glu Ile Gln Lys Gln Arg Ile Glu Gln Leu Gln Arg Glu Arg Arg Arg Val Leu Glu Glu Asn His Val Glu His Gln Pro Arg Phe Phe Arg Lys Ser Asp Asp Asp Ser Trp Val Ser Asn Gly Thr Tyr Leu Glu Leu Arg Lys Asp Leu Gly Phe Ser Lys Leu Asp His Pro Val Leu Trp <210> 6 <211> 500 <212> PRT
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 5324145CD1 <400> 6 Met Tyr Cys Pro Glu Ser Ala Val Ile Leu Leu Ser Thr Thr Val Leu Glu Asn Val Leu Gln Pro Phe His Phe Arg Ala Gly Thr Met Ser Lys Leu Pro Lys Phe Glu Ile Glu Leu Pro Ala Ala Pro Lys Ser Thr Lys Pro Ser Leu Ser Glu Arg Asp Ile Ala Met Ala Thr Ile Tyr Gly Gln Leu Tyr Val Leu Phe Leu Arg His His Ser Arg Thr Ser Asn Ser Thr Gly Ala Glu Val Val Leu Tyr His Leu Pro Arg Glu Gly Ala Cys Lys Lys Met His Ile Leu Lys Leu Asn Arg Thr Gly Lys Phe Ala Leu Asn Val Val Asp Asn Leu Val Val Val WO 00/49043 CA 02362300 2001-07-31 pCT~S00/04160 His His Gln Asp Thr Glu Thr Ser Val Ile Phe Asp Ile Lys Leu Arg Gly Glu Phe Asp Gly Ser Val Thr Phe His His Pro Val Leu Pro Ala Arg Ser Ile Gln Pro Tyr Gln Ile Pro Ile Thr Gly Pro Ala Ala Val Thr Ser Gln Ser Pro Val Pro Cys Lys Leu Tyr Ser Ser Ser Trp Ile Val Phe Gln Pro Asp Ile Ile Ile Ser Ala Ser Gln Gly Tyr Leu Trp Asn Leu Gln Val Lys Leu Glu Pro Ile Val Asn Leu Leu Pro Asp Lys Gly Arg Leu Met Asp Phe Leu Leu Gln Arg Lys Glu Cys Lys Met Val Ile Leu Ser Val Cys Ser Gln Met Leu Ser Glu Ser Asp Arg Ala Ser Leu Pro Val Ile Ala Thr Val Phe Asp Lys Leu Asn His Glu Tyr Lys Lys Tyr Leu Asp Ala Glu Gln Ser Tyr Ala Met Ala Val Glu Ala Gly Gln Ser Arg Ser Ser Pro Leu Leu Lys Arg Pro Val Arg Thr Gln Ala Val Leu Asp Gln Ser Asp Val Tyr Thr His Val Leu Ser Ala Phe Val Glu Lys Lys Glu Met Pro His Lys Phe Val Ile Ala Val Leu Met Glu Tyr Ile Arg Ser Leu Asn Gln Phe Gln Ile Ala Val Gln His Tyr Leu His Glu Leu Val Ile Lys Thr Leu Val Gln His Asn Leu Phe Tyr Met Leu His Gln Phe Leu Gln Tyr His Val Leu Ser Asp Ser Lys Pro Leu Ala Cys Leu Leu Leu Ser Leu Glu Ser Phe Tyr Pro Pro Ala His Gln Leu Ser Leu Asp Met Leu Lys Arg Leu Ser Thr Ala Asn Asp Glu Ile Val Glu Val Leu Leu Ser Lys His Gln Val Leu Ala Ala Leu Arg Phe Ile Arg Gly Ile Gly Gly His Asp Asn Ile Ser Ala Arg Lys Phe Leu Asp Ala Ala Lys Gln Thr Glu Asp Asn Met Leu Phe Tyr Thr Ile Phe Arg Phe Phe Glu Gln Arg Asn Gln Arg Leu Arg Gly Ser Pro Asn Phe Thr Pro Gly Glu His Cys Glu Glu His Val Ala Phe Phe Lys Gln Ile Phe Gly Asp Gln Ala Leu Met Arg Pro Thr Thr Phe <210> 7 <211> 272 <212> PRT
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 1004646CD1 <400> 7 Met Ser Cys His Asn Cys Ser Asp Pro Gln Val Leu Cys Ser Ser Gly Gln Leu Phe Leu Gln Pro Leu Trp Asp His Leu Arg Ser Trp Glu Ala Leu Leu Gln Ser Pro Phe Phe Pro Val Ile Phe Ser Ile Thr Thr Tyr Val Gly Phe Cys Leu Pro Phe Val Val Leu Asp Ile Leu Cys Ser Trp Val Pro Ala Leu Arg Arg Tyr Lys Ile His Pro Asp Phe Ser Pro Ser Ala Gln Gln Leu Leu Pro Cys Leu Gly Gln Thr Leu Tyr Gln His Val Met Phe Val Phe Pro Val Thr Leu Leu His Trp Ala Arg Ser Pro Ala Leu Leu Pro His Glu Ala Pro Glu Leu Leu Leu Leu Leu His His Ile Leu Phe Cys Leu Leu Leu Phe Asp Met Glu Phe Phe Val Trp His Leu Leu His His Lys Val Pro Trp Leu Tyr Arg Thr Phe His Lys Val His His Gln Asn Ser Ser Ser Phe Ala Leu Ala Thr Gln Tyr Met Ser Val Trp Glu Leu Phe Ser Leu Gly Phe Phe Asp Met Met Asn Val Thr Leu Leu Gly Cys His Pro Leu Thr Thr Leu Thr Phe His Val Val Asn Ile Trp Leu Ser Val Glu Asp His Ser Gly Tyr Asn Phe Pro Trp Ser Thr His Arg Leu Val Pro Phe Gly Trp Tyr Gly Gly Val Val His His Asp Leu His His Ser His Phe Asn Cys Asn Phe Ala Pro Tyr Phe Thr His Trp Asp Lys Ile Leu Gly Thr Leu Arg Thr Ala Ser Val Pro Ala Arg <210> 8 <211> 282 <212> PRT
<213> Homo sapiens <220>
<221> misc_feature <223> Incyte ID No: 1802851CD1 <400> 8 Met Ser Gly Gly Trp Met Ala Gln Val Gly Ala Trp Arg Thr Gly Ala Leu Gly Leu Ala Leu Leu Leu Leu Leu Gly Leu Gly Leu Gly Leu Glu Ala Ala Ala Ser Pro Leu Ser Thr Pro Thr Ser Ala Gln Ala Ala Gly Pro Ser Ser Gly Ser Cys Pro Pro Thr Lys Phe Gln Cys Arg Thr Ser Gly Leu Cys Val Pro Leu Thr Trp Arg Cys Asp Arg Asp Leu Asp Cys Ser Asp Gly Ser Asp Glu Glu Glu Cys Arg Ile Glu Pro Cys Thr Gln Lys Gly Gln Cys Pro Pro Pro Pro Gly Leu Pro Cys Pro Cys Thr Gly Val Ser Asp Cys Ser Gly Gly Thr Asp Lys Lys Leu Arg Asn Cys Ser Arg Leu Ala Cys Leu Ala Gly Glu Leu Arg Cys Thr Leu Ser Asp Asp Cys Ile Pro Leu Thr Trp Arg Cys Asp Gly His Pro Asp Cys Pro Asp Ser Ser Asp Glu Leu Gly Cys Gly Thr Asn Glu Ile Leu Pro Glu Gly Asp Ala Thr Thr Met Gly Pro Pro Val Thr Leu G1u Ser Val Thr Ser Leu Arg Asn Ala Thr Thr Met Gly Pro Pro Val Thr Leu Glu Ser Val Pro Ser Val Gly Asn Ala Thr Ser Ser Ser Ala Gly Asp Gln Ser Gly Ser Pro Thr Ala Tyr Gly Val Ile Ala Ala Ala Ala Val Leu Ser Ala Ser Leu Val Thr Ala Thr Leu Leu Leu Leu Ser Trp Leu Arg Ala Gln Glu Arg Leu Arg Pro Leu Gly Leu Leu Val Ala Met Lys Glu Ser Leu Leu Leu Ser Glu Gln Lys Thr Ser Leu Pro <210> 9 <211> 437 <212> PRT
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 2764333CD1 <400> 9 Met Ser Glu Glu Lys Asp Cys Gly Gly Gly Asp Ala Leu Ser Asn Gly Ile Lys Lys His Arg Thr Ser Leu Pro Ser Pro Met Phe Ser Arg Asn Asp Phe Ser Ile Trp Ser Ile Leu Arg Lys Cys Ile Gly Met Glu Leu Ser Lys Ile Thr Met Pro Val Ile Phe Asn Glu Pro Leu Ser Phe Leu Gln Arg Leu Thr Glu Tyr Met Glu His Thr Tyr Leu Ile His Lys Ala Ser Ser Leu Ser Asp Pro Val Glu Arg Met Gln Cys Val Ala Ala Phe Ala Val Ser Ala Val Ala Ser Gln Trp Glu Arg Thr Gly Lys Pro Phe Asn Pro Leu Leu Gly Glu Thr Tyr Glu Leu Val Arg Asp Asp Leu Gly Phe Arg Leu Ile Ser Glu Gln Val Ser His His Pro Pro Ile Ser Ala Phe His Ala Glu Gly Leu Asn Asn Asp Phe Ile Phe His Gly Ser Ile Tyr Pro Lys Leu Lys Phe Trp Gly Lys Ser Val Glu Ala Glu Pro Lys Gly Thr Ile Thr Leu Glu Leu Leu Glu His Asn Glu Ala Tyr Thr Trp Thr Asn Pro Thr Cys Cys Val His Asn Ile Ile Val Gly Lys Leu Trp Ile Glu Gln Tyr Gly Asn Val Glu Ile Ile Asn His Lys Thr Gly Asp Lys Cys Val Leu Asn Phe Lys Pro Cys Gly Leu Phe Gly Lys Glu Leu His Lys Val Glu Gly Tyr Ile Gln Asp Lys Ser Lys Lys Lys Leu Cys Ala Leu Tyr Gly Lys Trp Thr Glu Cys Leu Tyr Ser Val Asp Pro Ala Thr Phe Asp Ala Tyr Lys Lys Asn Asp Lys Lys Asn Thr Glu Glu Lys Lys Asn Ser Lys Gln Met Ser Thr Ser Glu Glu Leu Asp Glu Met Pro Val Pro Asp Ser Glu Ser Val Phe Ile Ile Pro Gly Ser Val Leu Leu Trp Arg Ile Ala Pro Arg Pro Pro Asn Ser Ala Gln Met Tyr Asn Phe Thr Ser Phe Ala Met Val Leu Asn Glu Val Asp Lys Asp Met Glu Ser Val Ile Pro Lys Thr Asp Cys Arg Leu Arg Pro Asp Ile Arg Ala Met Glu Asn Gly Glu Ile Asp Gln Ala Ser Glu Glu Lys Lys Arg Leu Glu Glu Lys G1n Arg Ala Ala Arg Lys Asn Arg Ser Lys Ser Glu Glu Asp Trp Lys Thr Arg Trp Phe His Gln Gly Pro Asn Pro Tyr Asn Gly Ala Gln Asp Trp Ile Tyr Ser Gly Ser Tyr Trp Asp Arg Asn Tyr Phe Asn Leu Pro Asp Ile Tyr <210> 10 <211> 427 <212> PRT
<213> Homo sapiens <220>
<221> misc_teature <223> Incyte ID No: 2798021CD1 <400> 10 Met Arg Gln Ala Ala Ala Asp Ala Lys Pro Glu Ser Leu Met Lys Arg Leu Glu Glu Glu Ile Lys Phe Asn Leu Tyr Met Val Thr Glu Lys Phe Pro Lys Glu Leu Glu Asn Lys Lys Lys Glu Leu His Phe Leu Gln Lys Val Val Ser Glu Pro Ala Met Gly His Ser Asp Leu Leu Glu Leu Glu Ser Lys Ile Asn Glu Ile Asn Thr Glu Ile Asn Gln Leu Ile Glu Lys Lys Met Met Arg Asn Glu Pro Ile Glu Gly Lys Leu Ser Leu Tyr Arg Gln Gln Ala Ser Ile Ile Ser Arg Lys Lys Glu Ala Lys Ala Glu Glu Leu Gln Glu Ala Lys Glu Lys Leu Ala Ser Leu Glu Arg Glu Ala Ser Val Lys Arg Asn Gln Thr Arg Glu Phe Asp Gly Thr Glu Val Leu Lys Gly Asp Glu Phe Lys Arg Tyr Val Asn Lys Leu Arg Ser Lys Ser Thr Val Phe Lys Lys Lys His Gln Ile Ile Ala Glu Leu Lys Ala Glu Phe Gly Leu Leu Gln Arg Thr Glu Glu Leu Leu Lys Gln Arg His Glu Asn Ile Gln Gln Gln Leu Gln Thr Met Glu Glu Lys Lys Gly Ile Ser Gly Tyr Ser Tyr Thr Gln Glu Glu Leu Glu Arg Val Ser Ala Leu Lys Ser Glu Val Asp Glu Met Lys Gly Arg Thr Leu Asp Asp Met Ser Glu Met Val Lys Lys Leu Tyr Ser Leu Val Ser Glu Lys Lys Ser Ala Leu Ala Ser Val Ile Lys Glu Leu Arg Gln Leu Arg Gln Lys Tyr Gln Glu Leu Thr Gln Glu Cys Asp Glu Lys Lys Ser Gln Tyr Asp Ser Cys Ala Ala Gly Leu Glu Ser Asn Arg Ser Lys Leu Glu Gln Glu Val Arg Arg Leu Arg Glu Glu Cys Leu Gln Glu Glu Ser Arg Tyr His Tyr Thr Asn Cys Met Ile Lys Asn Leu Glu Val Gln Leu Arg Arg Ala Thr Asp Glu Met Lys Ala Tyr Ile Ser Ser Asp Gln Gln Glu Lys Arg Lys Ala Ile Arg Glu Gln Tyr Thr Lys Asn Thr Ala Glu Gln Glu Asn Leu Gly Lys Lys Leu Arg Glu Lys Gln Lys Val Ile Arg Glu Ser His Gly Pro Asn Met Lys Gln Ala Lys Met Trp Arg Asp Leu Glu Gln Leu Met Glu Cys Lys Lys Gln Cys Phe Leu Lys Gln Gln Ser Gln Thr Ser Ile Gly Gln Val Ile Gln Glu Gly Gly Glu Asp Arg Leu Ile Leu <210> 11 <211> 564 <212> PRT
<213> Homo sapiens <220>
<221> misc_feature <223> Incyte ID No: 3335404CD1 <400> 11 Met Asp Ser Arg Tyr Asn Ser Thr Ala Gly Ile Gly Asp Leu Asn Gln Leu Ser Ala Ala Ile Pro Ala Thr Arg Val Glu Val Ser Val Ser Cys Arg Asn Leu Leu Asp Arg Asp Thr Phe Ser Lys Ser Asp Pro Ile Cys Val Leu Tyr Val Gln Gly Val Gly Asn Lys Glu Trp Arg Glu Phe Gly Arg Thr Glu Val Ile Asp Asn Thr Leu Asn Pro Asp Phe Val Arg Lys Phe Ile Leu Asp Tyr Phe Phe Glu Glu Arg Glu Asn Leu Arg Phe Asp Leu Tyr Asp Val Asp Ser Lys Ser Pro Asn Leu Ser Lys His Asp Phe Leu Gly Gln Val Phe Cys Thr Leu Gly Glu Ile Val Gly Ser Gln Gly Ser Arg Leu Glu Lys Pro Ile Val Gly Ile Pro Gly Lys Lys Cys Gly Thr Ile Ile Leu Thr Ala Glu Glu Leu Asn Cys Cys Arg Asp Ala Val Leu Met Gln Phe Cys Ala Asn Lys Leu Asp Lys Lys Asp Phe Phe Gly Lys Ser Asp Pro Phe Leu Val Phe Tyr Arg Ser Asn Glu Asp Gly Ser Phe Thr Ile Cys His Lys Thr Glu Val Val Lys Asn Thr Leu Asn Pro Val Trp Gln Ala Phe Lys Ile Ser Val Arg Ala Leu Cys Asn Gly Asp Tyr Asp Arg Thr Ile Lys Val Glu Val Tyr Asp Trp Asp Arg Asp Gly Ser His Asp Phe Ile Gly Glu Phe Thr Thr Ser Tyr Arg Glu Leu Ser Arg Gly Gln Ser Gln Phe Asn Val Tyr Glu Val Val Asn Pro CA 02362300 2001-07-31 pCT~S00/04160 Lys Lys Lys Gly Lys Lys Lys Lys Tyr Thr Asn Ser Gly Thr Val Thr Leu Leu Ser Phe Leu Val Glu Thr Glu Val Ser Phe Leu Asp Tyr Ile Lys Gly Gly Thr Gln Ile Asn Phe Thr Val Ala Ile Asp Phe Thr Ala Ser Asn Gly Asn Pro Ala Gln Pro Thr Ser Leu His Tyr Met Asn Pro Tyr Gln Leu Asn Ala Tyr Gly Met Ala Leu Lys Ala Val Gly Glu Ile Val Gln Asp Tyr Asp Ser Asp Lys Met Phe Pro Ala Leu Gly Phe Gly Ala Lys Leu Pro Pro Asp Gly Arg Ile Ser His Glu Phe Ala Leu Asn Gly Asn Pro Gln Asn Pro Tyr Cys Asp Gly Ile Glu Gly Val Met Glu Ala Tyr Tyr Arg Ser Leu Lys Ser Val Gln Leu Tyr Gly Pro Thr Asn Phe Ala Pro Val Ile Asn His Val Ala Arg Tyr Ala Ser Ser Val Lys Asp Gly Ser Gln Tyr Phe Val Leu Leu Ile Val Thr Asp Gly Val Ile Ser Asp Met Ala Gln Thr Lys Glu Ser Ile Val Asn Ala Ser Lys Leu Pro Met Ser Ile Ile Ile Val Gly Val Gly Pro Ala Glu Phe Asp Ala Met Val Glu Leu Asp Gly Asp Asp Val Arg Val Ser Ser Arg Gly Lys Tyr Ala Glu Arg Asp Ile Val Gln Phe Val Pro Phe Arg Asp Tyr Ile Asp Arg Ser Gly Asn His Ile Leu Ser Met Ala Arg Leu Ala Lys Asp Val Leu Ala Glu Ile Pro Glu Gln Phe Leu Ser Tyr Met Arg Ala Arg Gly Ile Lys Pro Ser Pro Ala Pro Pro Pro Tyr Thr Pro Pro Thr His Val Leu Gln Thr Gln Ile <210> 12 <211> 297 <212> PRT
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 3735780CD1 <400> 12 Met Met Asp Ser Glu Ala His Glu Lys Arg Pro Pro Ile Leu Thr Ser Ser Lys Gln Asp Ile Ser Pro His Ile Thr Asn Val Gly Glu Met Lys His Tyr Leu Cys Gly Cys Cys Ala Ala Phe Asn Asn Val Ala Ile Thr Phe Pro Ile Gln Lys Val Leu Phe Arg Gln Gln Leu Tyr Gly Ile Lys Thr Arg Asp Ala Ile Leu Gln Leu Arg Arg Asp Gly Phe Arg Asn Leu Tyr Arg Gly Ile Leu Pro Pro Leu Met Gln Lys Thr Thr Thr Leu Ala Leu Met Phe Gly Leu Tyr Glu Asp Leu Ser Cys Leu Leu His Lys His Val Ser Ala Pro Glu Phe Ala Thr WO 00/49043 CA 02362300 2001-07-31 pCT~JS00/04160 Ser Gly Val Ala Ala Val Leu Ala Gly Thr Thr Glu Ala Ile Phe Thr Pro Leu Glu Arg Val Gln Thr Leu Leu Gln Asp His Lys His His Asp Lys Phe Thr Asn Thr Tyr Gln Ala Phe Lys Ala Leu Lys Cys His Gly Ile Gly Glu Tyr Tyr Arg Gly Leu Val Pro Ile Leu Phe Arg Asn Gly Leu Ser Asn Val Leu Phe Phe Gly Leu Arg Gly Pro Ile Lys Glu His Leu Pro Thr Ala Thr Thr His Ser Ala His Leu Val Asn Asp Phe Ile Cys Gly Gly Leu Leu Gly Ala Met Leu Gly Phe Leu Phe Phe Pro Ile Asn Val Val Lys Thr Arg Ile Gln Ser Gln Ile Gly Gly Glu Phe Gln Ser Phe Pro Lys Val Phe Gln Lys Ile Trp Leu Glu Arg Asp Arg Lys Leu Ile Asn Leu Phe Arg Gly Ala His Leu Asn Tyr His Arg Ser Leu Ile Ser Trp Gly Ile Ile Asn Ala Thr Tyr Glu Phe Leu Leu Lys Val Ile <210> 13 <211> 2174 <212> DNA
<213> Homo sapiens <220>
<221> misc_feature <223> Incyte ID No: 161190CB1 <400> 13 ggatccacac agctcagaac agctggatct tgctcacact ctttcaagag aagcttcctt 60 ggacaaaagg accctgcctt ggtgtgagag tgagggcaga gggagctgga gcaagtagaa 120 tttctctaaa taccagctgg ctggggccca ggagattaaa aaacaccggg ctagggttaa 180 gaaaaaaaac gaacccttcc agtcaggtca gtgactggag agctccaagg aaagtctctc 240 agtgacctgg ctgctggcac catggactca gaaaagaaac gctttactga agaggccacc 300 aaatacttcc gggagagagt cagcccagtg catctgcaaa tcctgctgac taacaatgaa 360 gcctggaaga gattcgtgac tgcggctgaa ttgcccaggg atgaggcaga tgctctctac 420 gaagctctga agaagcttag aacatatgca gctattgagg acgaatatgt gcagcagaaa 480 gatgagcagt ttagggaatg gtttttgaaa gagtttcccc aagtcaagag gaagatccag 540 gagtccatag aaaagcttcg tgcccttgca aatggtattg aagaggtcca cagaggctgc 600 accatctcca atgtggtgtc cagctccact ggcgctgcct ctggcatcat gtcccttgct 660 ggtcttgttt tggcaccatt tacagcaggg acgagtctgg cccttactgc agctggggta 720 gggctgggag cagcgtctgc tgtgactggg atcaccacca gcatcgtgga gcactcatac 780 acatcatcag cagaagctga agccagcagg ctgactgcaa ccagcattga ccgattgaag 840 gtatttaagg aagttatgcg tgacatcaca cccaacttac tttcccttct taataattat 900 tacgaagcca cacaaaccat tgggagtgaa atccgtgcca tcaggcaagc cagagccagg 960 gcccgactcc ctgtgaccac ctggcgaatc tcagctggaa gtggaggtca agcagagaga 1020 acgattgcag gcaccacccg ggcagtgagc agaggagccc ggatcctgag tgcgaccact 1080 tcaggcatct tccttgcact ggatgtggtc aaccttgtat acgagtcaaa gcacttgcat 1140 gagggggcaa agtctgcatc tgctgaggag ctgaggcggc aggctcagga gctggaggag 1200 aatctaatgg agctcactca gatctatcag cgtctgaatc catgccatac ccactgaccc 1260 cagaccagtg cagccagcag gggaggtgag ccatacacag gccacgacaa aatgcaggca 1320 ttttattagg gggataaaga gggcaaggta aagtttatgg agctgagtgt tagtgacttt 1380 ggcatttctg tagctgagca cagcagggga ggggttaatg cagatggcaa gtgcaccaag 1440 gagaaggcag gaatgctgga gcctggaata agggaagaga ggggactgga gagtgtgggg 1500 aataggaaga agaaatttcc tttagactaa cgaatatatt ggggggagga atagagggga 1560 ggtgtgcagg aaccagcaat gagaaggcca ggaaaagaaa gagctgaaaa tgcagaaagc 1620 cgaagagtta gaacttttgg atacagcaga agaaacagcg gctccactac cgacctgccc 1680 ccggttcgat gtccttccaa gaatgaagtc tttccctggt gatggtcccc tgccctgtct 1740 ttccagcatc cactctgtct tgtcctcctg gaagtgtatc tcagtcagcc agtggcttct 1800 tgatgatggc ggtggaggtg gtggttgtag tgtgatggat cccctttagg ttatttaggg 1860 gtatatgtcc cctgcttgaa ccctgaaggc caggtaatga gccatggcca ttgtccccag 1920 WO 00/49043 CA 02362300 2001-07-31 pCT~S00/04160 ctgaggacca ggtgtctcta aaaacccaaa catcctggag agtatgcgag aacctaccaa 1980 gaaaaacagt ctcattactc atatacagca ggcaaagaga cagaaaatta actgaaaagc 2040 agtttagaga ctgggggagg ccggatctct agagccatcc tgctgagtgc cctgtgtgta 2100 agtcctaata aactcaccta ctcaccaaaa aaaaaaacga aaaactaaag aacaggagaa 2160 aaaaagggga gggc 2174 <210> 14 <211> 2620 <212> DNA
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 1292575CB1 <400> 14 agtgagctcg gccggcaacc gagggacccg cgtccagatc ttcagtgtct attggatttt 60 tccaagagaa agtttgtaaa attccttaca ctgtagatgt ggatcagata cgatgattca 120 gtagaagagc acatgtcagg ggcagtggag gctggctgct gaaggatgaa cggagaggaa 180 gaattctttg atgccgtcac aggctttgat tctgataact cttctgggga attttcagag 240 gcaaatcaga aagtcacggg aatgattgac ttagacacca gcaaaaataa taggattggg 300 aaaactgggg agaggccctc tcaagagaac ggaattcaga aacacaggac atcgctgccg 360 gctcccatgt tcagcagaag cgacttcagc gtgtggacca tcctgaagaa gtgtgttggc 420 ctggagctgt ccaagatcac gatgccaatc gccttcaacg agcctctgag cttcttgcag 480 cggatcacgg agtacatgga gcacgtgtac ctcatccaca gggcctcctg ccagccccag 540 cccctggaga ggatgcagtc tgtggctgct tttgctgttt cggctgtggc ttcccagtgg 600 gagaggaccg gcaaaccatt taatccactc ttgggagaaa cgtatgaatt aatcagggaa 660 gatttaggat tcagatttat atcggaacag gtcagtcacc acccccccat cagtgcgttc 720 cactcggaag gtctcaacca tgacttcctg ttccatggct ccatctaccc caagctcaag 780 ttctggggca aaagcgtgga ggcggagccc cgaggcacca tcaccctgga gctgctcaaa 840 cataatgaag cctacacctg gaccaacccc acctgctgcg tccacaacgt catcatcggg 900 aagctgtgga tagagcagta tgggacagtg gagattttaa accacagaac tggacataag 960 tgtgtgcttc actttaaacc gtgtggatta tttggaaaag aacttcacaa ggtggaagga 1020 cacattcaag acaaaaacaa aaagaagctc tttatgatct atggcaaatg gacggaatgt 1080 ttgtggggca tagatcctgt ttcgtatgaa tccttcaaga agcaggagag gagaggtgac 1140 cacctgagaa aggccaagct ggatgaagac tccgggaagg ctgacagcga cgtggctgac 1200 gacgtgcctg tggcccagga gaccgtgcag gtcattcctg gcagcaagct gctctggagg 1260 atcaacaccc ggccccccaa ctctgcccag atgtataatt tcaccagttt cactgtgagc 1320 ctcaacgagc tggagacagg catggagaag accctgccac ccacggactg ccgcctgcgc 1380 cctgacatcc gcggcatgga gaatggcaac atggatctgg ccagccagga gaaggagcgg 1440 ctggaggaga agcagagaga agcacggagg gagcgggcca aggaggaggc agagtggcag 1500 acgaggtggt tctacccagg caataacccc tacactggga cccccgactg gttgtatgca 1560 ggggattact ttgagcggaa tttctccgac tgcccagata tctactgagg gcctggaggg 1620 gcctggggcc cgggaccgga ggctgacgag gctggacttc ctcgagtggc cactgtgagc 1680 ctcgtcacag cagaaaccaa cttttctaac gactgagttc gcggagatag catcatccct 1740 gatcaaggat gtaattctaa ttaactgttg attgccaaac atttcactct gctgtgccgt 1800 ctcttcataa agcttcactt gggatcatcg tcttcattaa ggtttcaaca gggaaattct 1860 tcacggcgcc cttttatgtg gcagaaatca gctggggctt gtttagcttc cagcacactc 1920 tcagtcatag catgtgtagc taaaggaagt aatgggaagg ggttcatgtt ctctttataa 1980 tgcagtggca aaaggttctg aaagcctttt aaactcgaac cagtggggga aagatggatc 2040 ttgaagctaa tcctgcagag agttttatag aggccaggga ttgccttcta aattatgata 2100 aaacagaagt gaagagtttc agagcatcag attgagtgaa aagttgtcag attctgtatt 2160 ttttaacaat cttcaataat gtaaagatta cttttaaaat atttaagtta aaactacttg 2220 aatagtattt tgctgaagag caagatatgc attaatcacc ggttttatac tgtccaaaat 2280 gaagcatccc cgtgacaaac cagagtgggc agaagcatcg agagcgtgac aggaaatccc 2340 aagactgctt ccgcctcaga ggcgtcccgg ctgcgattcg ctgccctgtt gtcagtgagg 2400 cctggctgtc accgcacacc gcgtccgtgt ctccaggggg ttcctttctt ctcacacgtc 2460 gcgtgtaccc atagcactct tgtgtttctg tttttcccag tatgcatgtt taaaatagaa 2520 gtgacaagaa tcacatccgg ttgtgtcctg tgggagggtc agaggcagaa tctacttaca 2580 gtggtgtaat taaagttatt taaccaaaaa aaaaaaaaaa 2620 <210> 15 <211> 2426 <212> DNA
<213> Homo Sapiens WO 00/49043 cA 02362300 2001-07-31 pCT/US00/04160 <220>
<221> misc_feature <223> Incyte ID No: 2454393CB1 <400> 15 gaaacactag acatgatacg aattcgagct cgtacccttc gaggcacgct gagaaggagc 60 agacaagatg gcgacgtccg tggggcaccg atgtctggga ttactgcacg gggtcgcgcc 120 gtggcggagc agcctccatc cctgtgagat cactgccctg agccaatccc tacagccctt 180 acggaagctg ccttttagag cctttcgcac agatgccaga aaaatccaca ctgcccctgc 240 ccgaaccatg ttcctgctgc gtcccctgcc cattctgttg gtgacaggcg gcgggtatgc 300 agggtaccgg cagtatgaga agtacaggga gcgagagctg gagaagctgg gattggagat 360 tccacccaaa cttgctggtc actgggaggt ggctttgtac aagtcagtgc caacgcgctt 420 gctgtcacgg gcctggggtc gcctcaatca ggtggagctg ccacactggc tgcgcaggcc 480 cgtctacagc ctgtacatct ggacgtttgg ggtgaacatg aaagaggccg ctgtggagga 540 cctgcatcac taccgcaacc tcagcgagtt cttccggcgc aagctgaagc cgcaggcccg 600 gcctgtctgt ggcctgcaca gcgtgattag cccatcggat ggaaggatcc tcaactttgg 660 gcaggtgaag aactgtgagg tggagcaggt aaagggggtc acctactccc tggagtcgtt 720 cctgggcccg cgtatgtgca cagaggacct gcccttccca ccagccgcgt cgtgtgactc 780 cttcaagaac cagctggtca cccgggaagg gaatgagctc tatcactgtg tcatctacct 840 ggcccctggg gactaccact gcttccactc ccccaccgac tggactgtgt cccaccggcg 900 ccacttccca ggctccctga tgtcagtgaa ccctggcatg gctcgctgga tcaaagagct 960 cttctgccat aacgagcggg tggtcctgac gggggactgg aaacatggct tcttctcact 1020 gacagctgtg ggggccacca acgtgggctc cattcgcatc tactttgacc gggacctgca 1080 cacaaacagc ccaaggcaca gcaagggctc ctacaatgac ttcagcttcg tgacgcacac 1140 caatagagag ggcgtcccca tgcgtaaggg cgagcacctg ggcgagttca acctgggctc 1200 caccatcgtg ctcatcttcg aggcccccaa ggacttcaat ttccagctga aaacaggaca 1260 gaaaatccgc tttggggaag ccctgggctc gctctagagt ctctttcctg attatggctg 1320 ctaagggatc ttttccaaac agagtgaggg tcttttcaag agggaggccc atgaggccat 1380 ccaggtaagg gcctgcctca gcgtggttgg gagtctgacc aggtaggact tgaatgattc 1440 ggctaccacc tgttccagag gtgcagacaa gaggtggcga gagcccccat catgcccctc 1500 aacctatccc gttccttctg cctacaaata aaaagtgcag gctggaatga tctcagtcac 1560 atttggatct ttttaaacac tgtatagacg gaagagcctg cattcctgac cgaaccttca 1620 gttggtctcg gttgtcgttt tttcttgctg ctcctccccc catcacctga gctgttttct 1680 gttggcccct tttgtttttt ggccttaacg ctcctgctgc acagggtgag gtacctcctt 1740 ggcacagact gtggatgcct ctcccccagc agagccacac agccttcgtg acaactgctt 1800 tccgttccca cattcacctc atcctgctct ttagaaaaag cagtctttgt gcttgtggct 1860 gaacgcatca ccctggactc tgctagtgtc ttctgaggac actgatgaca ctgattaatg 1920 atacagacct ttgcaggacc tgatgagtga cccttctgga gctggccagg tcctctgcag 1980 caggcaagac caatcaatca ctgaacctgc ctcatggcac cagagtgaac agggcaggca 2040 ggtagtaggc ccagctgggg aaatgggaga gttcctgtcc ccctccacat atccctacat 2100 gaaatatggg aaagttgctg ctattgattc agggtctgtc ttggaggcag aggacccttg 2160 gtggatagtt ggtcagtgcc tggaaaacct gtcccagttt atcaggaacg caggcctggg 2220 gagcccccag tggcggggac agggccagat ttcatgttga ccctggggat gctgtgaatt 2280 tctcctgcag gagagacatc attgaatttt ttcaactgta tcagtagcac agtatttttg 2340 tatgaaaagt gggagacttc tgaacagtaa ttcatttaat tgcaaagcat tttgaaataa 2400 aaaaaatcaa acttaaaaaa aaaaaa 2426 <210> 16 <211> 3705 <212> DNA
<213> Homo sapiens <220>
<221> misc_feature <223> Incyte ID No: 2766980CB1 <400> 16 ggccgcagga gcagtaggtg ttagcagctt ggtcgcgaca ggtgcgctag gtagagcgcc 60 gggacctgtg acagggctgg tagcagcgca gaggaaaggc ggcttttagc caggtatttc 120 agtgtctgta gacaagatgg aatcatctcc atttaataga cggcaatgga cctcactatc 180 attgagggta acagccaaag aactttctct tgtcaacaag aacaagtcat cggctattgt 240 ggaaatattc tccaagtacc agaaagcagc tgaagaaaca aacatggaga agaagagaag 300 taacaccgaa aatctctccc agcactttag aaaggggacc ctgactgtgt taaagaagaa 360 gtgggagaac ccagggctgg gagcagagtc tcacacagac tctctacgga acagcagcac 420 tgagattagg cacagagcag accatcctcc tgctgaagtg acaagccacg ctgcttctgg 480 agccaaagct gaccaagaag aacaaatcca ccccagatct agactcaggt cacctcctga 540 agccctcgtt cagggtcgat atccccacat caaggacggt gaggatctta aagaccactc 600 aacagaaagt aaaaaaatgg aaaattgtct aggagaatcc aggcatgaag tagaaaaatc 660 agaaatcagt gaaaacacag atgcttcggg caaaatagag aaatataatg ttccgctgaa 720 caggcttaag atgatgtttg agaaaggtga accaactcaa actaagattc tccgggccca 780 aagccgaagt gcaagtggaa ggaagatctc tgaaaacagc tattctctag atgacctgga 840 aataggccca ggtcagttgt catcttctac atttgactcg gagaaaaatg agagtagacg 900 aaatctggaa cttccacgcc tctcagaaac ctctataaag gatcgaatgg ccaagtacca 960 ggcagctgtg tccaaacaaa gcagctcaac caactataca aatgagctga aagccagtgg 1020 tggcgaaatc aaaattcata aaatggagca aaaggagaat gtgcccccag gtcctgaggt 1080 ctgcatcacc catcaggaag gggaaaagat ttctgcaaat gagaatagcc tggcagtccg 1140 ttccacccct gccgaagatg actcccgtga ctcccaggtt aagagtgagg ttcaacagcc 1200 tgtccatccc aagccactaa gtccagattc cagagcctcc agtctttctg aaagttctcc 1260 tcccaaagca atgaagaagt ttcaggcacc tgcaagagag acctgcgtgg aatgtcagaa 1320 gacagtctat ccaatggagc gtctcttggc caaccagcag gtgtttcaca tcagctgctt 1380 ccgttgctcc tattgcaaca acaaactcag tctaggaaca tatgcatctt tacatggaag 1440 aatctattgt aagcctcact tcaatcaact ctttaaatct aagggcaact atgatgaagg 1500 ctttgggcac agaccacaca aggatctatg ggcaagcaaa aatgaaaacg aagagatttt 1560 ggagagacca gcccagcttg caaatgcaag ggagacccct cacagcccag gggtagaaga 1620 tgcccctatt gctaaggtgg gtgtcctggc tgcaagtatg gaagccaagg cctcctctca 1680 gcaggagaag gaagacaagc cagctgaaac caagaagctg aggatcgcct ggccaccccc 1740 cactgaactt ggaagttcag gaagtgcctt ggaggaaggg atcaaaatgt caaagcccaa 1800 atggcctcct gaagacgaaa tcagcaagcc cgaagttcct gaggatgtcg atctagatct 1860 gaagaagcta agacgatctt cttcactgaa ggaaagaagc cgcccattca ctgtagcagc 1920 ttcatttcaa agcacctctg tcaagagccc aaaaactgtg tccccaccta tcaggaaagg 1980 ctggagcatg tcagagcaga gtgaagagtc tgtgggtgga agagttgcag aaaggaaaca 2040 agtggaaaat gccaaggctt ctaagaagaa tgggaatgtg ggaaaaacaa cctggcaaaa 2100 caaagaatct aaaggagaga cagggaagag aagtaaggaa ggtcatagtt tggagatgga 2160 gaatgagaat cttgtagaaa atggtgcaga ctccgatgaa gatgataaca gcttcctcaa 2220 acaacaatct ccacaagaac ccaagtctct gaattggtcg agttttgtag acaacacctt 2280 tgctgaagaa ttcactactc agaatcagaa atcccaggat gtggaactct gggagggaga 2340 agtggtcaaa gagctctctg tggaagaaca gataaagaga aatcggtatt atgatgagga 2400 tgaggatgaa gagtgacaaa ttgcaatgat gctgggcctt aaattcatgt tagtgttagc 2460 gagccactgc cctttgtcaa aatgtgatgc acataagcag gtatcccagc atgaaatgta 2520 atttacttgg aagtaacttt ggaaaagaat tccttcttaa aatcaaaaac aaaacaaaaa 2580 aacacaaaaa acacattcta aatactagag ataactttac ttaaattctt cattttagca 2640 gtgatgatat gcataagtgc tgtaaggctt gtaactgggg aaatattcca cctgataata 2700 gcccagattc tactgtattc ccaaaaggca atattaaggt agatagatga ttagtagtat 2760 attgttacac actattttgg aattagagaa catacagaag gaatttaggg gcttaaacat 2820 tacgactgaa tgcactttag tataaagggc acagtttgta tatttttaaa tgaataccaa 2880 tttaattttt tagtatttac ctgttaagag attatttagt ctttaaattt tttaggttaa 2940 ttttcttgct gtgatatata tgaggaattt actactttat gtcctgctct ctaaactaca 3000 tcctgaactc gacgtcctga ggtataatac aacagagcac tttttgaggc aattgaaaaa 3060 ccaacctaca ctcttcggtg cttagagaga tctgctgtct cccaaataag cttttgtatc 3120 tgccagtgaa tttactgtac tccaaatgat tgctttcttt tctggtgata tctgtgcttc 3180 tcataattac tgaaagctgc aatattttag taataccttc gggatcactg tcccccatct 3240 tccgtgttag agcaaagtga agagtttaaa ggaggaagaa gaaagaactg tcttacacca 3300 cttgagctca gacctctaaa ccctgtattt cccttatgat gtcccctttt tgagacacta 3360 atttttaaat acttactagc tctgaaatat attgattttt atcacagtat tctcagggtg 3420 aaattaaacc aactataggc ctttttcttg ggatgatttt ctagtcttaa ggtttgggga 3480 cattataaac ttgagtacat ttgttgtaca cagttgatat tccaaattgt atggatggga 3540 gggagaggtg tcttaagctg taggcttttc tttgtactgc atttatagag atttagcttt 3600 aatatttttt agagatgtaa aacattctgc tttcttagtc ttacctagtc tgaaacattt 3660 ttattcaata aagattttaa ttaaaatttg aaaaaaaaaa aaaaa 3705 <210> 17 <211> 1273 <212> DNA
<213> Homo sapiens <n>
<220>
<221> unsure <222> 1237 <223> a or g or c or t, unknown, or other <220>
<221> misc_feature <223> Incyte ID No: 2768356CB1 <400> 17 gccaccatcc gcctatctct gcgtgtcatg ctgagtctag aaattttgtt ttctggcaag 60 atgtgagatg gaaaaacaaa ttctggggca aatccatgga aattgttcca attggcacaa 120 cccatgtgac tctgccagtt tttggggatc attttgagtg gaacaaagtg acctcttgca 180 tccataacat cttaagcggg cagaggtgga ttgagcacta tggagagatt gtcatcaaga 240 acctgcatga tgattcctgc tactgcaaag tgaattttat aaaggcaaaa tactggagca 300 ctaatgccca tgagattgaa ggcacagtgt ttgacaggag tggaaaagcg gttcatcggc 360 tgtttgggaa atggcatgaa agcatctact gtggcggcgg ctcctcttct gcctgtgtat 420 ggagagcaaa tcctatgccg aaaggctacg agcaatacta tagcttcaca cagtttgcgc 480 tggaattaaa tgaaatggat ccatcatcaa agtctttatt gccacctact gacactcgat 540 ttaggccaga ccagaggttt ctagaggaag ggaacttaga agaagctgaa atacaaaagc 600 agaggattga acaactgcag agagaaaggc ggcgggtctt agaagaaaat catgtggagc 660 accagcctcg gtttttcagg aaatccgacg atgactcttg ggtgagcaac ggcacctatt 720 tggaacttag aaaagatctt ggtttttcca aactggacca tcctgtctta tggtgaaaaa 780 gtaaagaaga aagataacat tagtgtattt ctcctgtgct tgccttctga agtggcacaa 840 acctgtgttt atatatttaa aagatactct aggatgatca cttgtgctta gcttagcatt 900 gtaactcttt aagtctatat tttcctcagt gcgtttcttt acaatttcaa atgttaccct 960 gattgtttat atgaatgtag aacaccttga catttctttt tatatataaa ctatttaata 1020 aaaatgaaag attgaatgtt catgtgtggg ttaaaaaaag aagctttaac actaattttc 1080 caaaggttag ggaagattcc aattaaattt atgccttata aaattatgtt gtagaaaaaa 1140 tatcaacctc tcccaggtgc attaagaaat aagaattccc agggttactc acccatgcgt 1200 aagctaccaa gtttaatttg gtagctgaaa tatcttnttg cctcagacag ctcttgaatt 1260 gctcatacag aca 1273 <210> 18 <211> 1733 <212> DNA
<213> Homo sapiens <220>
<221> misc_feature <223> Incyte ID No: 5324145CB1 <400> 18 gccaacattc taggattctg ctggactagt tcaactgaaa ttgtcttcat aacagatcaa 60 ggaatcgaat tttaccaggt attaccagag aaacggagtc tgaaactctt gaagagccac 120 aatctcaatg tgaattggta catgtactgc cccgagagcg ccgtgatctt gctgtctacc 180 acggtcctgg agaatgtcct gcagcctttt cactttaggg ctggcactat gtcgaagctg 240 cccaaatttg agattgaatt accagctgcg cctaagtcaa ctaaacccag cctttccgaa 300 agagacatcg caatggctac catatacggg cagctgtatg ttctcttctt gaggcatcat 360 tctcggacct ccaacagcac aggagcggag gtggtcctct atcatctacc acgagaaggt 420 gcctgtaaaa agatgcacat attgaagtta aataggacgg gaaagtttgc cctgaacgtg 480 gtggacaacc tggtagtcgt gcatcatcag gatacagaga catcggtaat attcgatatc 540 aagttacggg gagagtttga cggctccgtt accttccacc accccgtgct tcctgctcga 600 tcgatccagc cctatcagat ccccatcaca ggtcctgctg ccgtgaccag ccagtctcct 660 gttccatgta aactctattc ttcatcttgg attgtctttc aacctgacat cattatcagc 720 gcaagccaag gttacctctg gaacctccaa gtgaaacttg agcccatagt aaatctctta 780 ccagacaaag gaagactcat ggactttctc ctccagagaa aggaatgcaa gatggtcatc 840 ctgtctgtct gttcacagat gttaagtgag tcagacagag catcgctgcc cgtgatagcc 900 actgtttttg ataaactcaa ccatgagtat aaaaagtacc tggatgccga gcagagttat 960 gcgatggcgg tggaagcagg gcagagccga agcagcccgc tcctcaagag gccggtgcgg 1020 acccaggcgg tgctggacca gtcagatgtg tacacccatg tcctgtcagc ctttgtggaa 1080 aagaaggaga tgcctcataa atttgtgata gccgtgctga tggaatacat tcgttctctt 1140 aaccagtttc agattgcagt acagcattac ctacatgaac ttgttatcaa aacccttgtc 1200 cagcacaacc tcttttatat gctgcatcag ttcctgcagt accacgtcct cagcgactcc 1260 aaacctttgg cttgtctgct gttatcccta gagagtttct atcctcctgc tcatcagcta 1320 tctctggaca tgctgaagcg actttcaaca gcaaatgatg aaatagtaga agttctcctt 1380 tccaaacacc aagtgttagc tgccttaagg tttatccggg gcattggtgg ccatgacaac 1440 atttctgcac gaaaattttt agatgctgca aagcagactg aagacaacat gcttttctat 1500 acaatattcc gcttttttga acagcgaaac cagcgtttgc gagggagccc caatttcaca 1560 ccaggggaac actgtgaaga acatgttgct tttttcaaac agatttttgg agaccaagct 1620 ctaatgaggc ctacaacatt ctgaaatcac ttgctgtttt tttatataaa aatgtgtaca 1680 aagttaattt attgcattaa taaagctctt taaactataa aaaaaaaaaa aaa 1733 <210> 19 <211> 1370 <212> DNA
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 1004646CB1 <400> 19 gcacagcctc gcaatgagct gccacaactg ctccgacccc caggtccttt gcagctccgg 60 gcagctgttc ctgcagcccc tctgggacca cctgaggagc tgggaggccc tcctacagtc 120 gcccttcttc ccggtcatct tctccatcac cacatacgtg ggcttttgcc tgcccttcgt 180 ggtcctggat atcctgtgct cctgggtgcc cgccctgcgg cgctacaaga tccaccctga 240 cttctcgcca tccgcgcagc agctgctacc ttgcctgggg cagaccctct accagcatgt 300 gatgtttgtg ttccccgtga cgctgctgca ttgggcccgc agcccggccc tcctgcccca 360 cgaagctccc gagctgctcc tgctgctgca ccacatcctg ttctgcctgc tactcttcga 420 catggagttc ttcgtgtggc acctgctgca ccacaaggtg ccctggctgt accgcacctt 480 ccacaaggtg caccaccaga actcgtcctc gttcgcgctg gcaacgcagt atatgagcgt 540 ctgggaactg ttttctttgg gcttcttcga catgatgaac gtcacactgc tcgggtgcca 600 cccgctcacc accctgacct tccacgtggt caacatctgg ctttccgtgg aggaccactc 660 cggctacaac ttcccttggt ccactcacag actggtgccc ttcgggtggt acgggggtgt 720 ggtgcaccac gacctgcatc actctcactt taactgcaac ttcgctccgt actttacaca 780 ctgggacaaa atactgggaa cgctgcggac tgcatctgtc ccagcgcggt gatgtggctg 840 cggtgggtgc ccctaagact cgggactgct gtgcctttca cacttgaatg aagagaaaca 900 cctgagctat atattttttt aaagcaacta acttattgct ttatgtttat ctatgaaaac 960 catagataaa atctgatgca tttttgtaat ctgacaaagt aatttacata ctgtttgtgt 1020 atcaatacaa ttttgtgttc ttggtattct tagtctagct cacctcaata gccttgaatc 1080 ctgcatatga attagacatt catcactggc atatttagaa tatctctaaa aggacttgtt 1140 tgtagaataa ggaattttct atgtttcaaa gtgttctaaa acctggctaa aagaatgtat 1200 ttttgtggat ggtgttgact tctgactcta aaagcaatca aacatgtttc tgctggacag 1260 tgaccaagaa ttatagtacc ttcttatatt tttttataga actgtatatt tattttgaaa 1320 gaaatgttat tcgtgcttta aaaaggaaaa aaaaccatga atcaaataaa 1370 <210> 20 <211> 1264 <212> DNA
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 1802851CB1 <400> 20 ctcgcgtgcg gtgcgcaggg ataagagagc ggtctggaca gcgcgtggcc ggcgccgctg 60 tggggacagc atgagcggcg gttggatggc gcaggttgga gcgtggcgaa caggggctct 120 gggcctggcg ctgctgctgc tgctcggcct cggactaggc ctggaggccg ccgcgagccc 180 gctttccacc ccgacctctg cccaggccgc aggccccagc tcaggctcgt gcccacccac 240 caagttccag tgccgcacca gtggcttatg cgtgcccctc acctggcgct gcgacaggga 300 cttggactgc agcgatggca gcgatgagga ggagtgcagg attgagccat gtacccagaa 360 agggcaatgc ccaccgcccc ctggcctccc ctgcccctgc accggcgtca gtgactgctc 420 tgggggaact gacaagaaac tgcgcaactg cagccgcctg gcctgcctag caggcgagct 480 ccgttgcacg ctgagcgatg actgcattcc actcacgtgg cgctgcgacg gccacccaga 540 ctgtcccgac tccagcgacg agctcggctg tggaaccaat gagatcctcc cggaagggga 600 tgccacaacc atggggcccc ctgtgaccct ggagagtgtc acctctctca ggaatgccac 660 aaccatgggg ccccctgtga ccctggagag tgtcccctct gtcgggaatg ccacatcctc 720 ctctgccgga gaccagtctg gaagcccaac tgcctatggg gttattgcag ctgctgcggt 780 gctcagtgca agcctggtca ccgccaccct cctccttttg tcctggctcc gagcccagga 840 gcgcctccgc ccactggggt tactggtggc catgaaggag tccctgctgc tgtcagaaca 900 gaagacctcg ctgccctgag gacaagcact tgccaccacc gtcactcagc cctgggcgta 960 gccggacagg aggagagcag tgatgcggat gggtacccgg gcacaccagc cctcagagac 1020 ctgagctctt ctggccacgt ggaacctcga acccgagctc ctgcagaagt ggccctggag 1080 attgagggtc cctggacact ccctatggag atccggggag ctaggatggg gaacctgcca 1140 cagccagaac tgaggggctg gccccaggca gctcccaggg ggtagaacgg ccctgtgctt 1200 aagacactcc tgctgccccg tctgagggtg gcgattaaag ttgcttcaca tcctcaaaaa 1260 aaaa 1264 <210> 21 <211> 2566 <212> DNA
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 2764333CB1 <400> 21 cccacgcgtc cgacgagttc tatgatgcgc tgtcagattc cgagtccgaa aggtccctga 60 gtagattgga agcagtgaca gcacgctcct ttgaagagga aggagagcat ttgggcagta 120 gaaaacacag aatgtccgaa gaaaaagact gtggtggcgg agatgctctc tccaatggca 180 tcaagaaaca cagaacaagt ttgccttctc ctatgttttc cagaaatgac ttcagtatct 240 ggagcatcct cagaaaatgt attggaatgg aactatccaa gatcacgatg ccagttatat 300 ttaatgagcc tctgagcttc ctacagcgcc taactgaata catggagcat acttacctca 360 tccacaaggc cagttcactc tctgatcctg tggaaaggat gcagtgtgta gctgcgtttg 420 ctgtatctgc tgttgcttct cagtgggaac ggactggaaa acctttcaac ccactgctgg 480 gagagactta tgaattagtg cgagatgacc ttggatttag actcatctcc gaacaggtca 540 gccatcaccc accaatcagt gcatttcatg ctgaaggatt aaacaatgac ttcatctttc 600 atggctctat ctatcccaaa ctgaaattct gggggaagag tgtagaagca gaacccaaag 660 gaaccatcac cttggagctc cttgaacaca atgaggcata tacatggaca aatcccacct 720 gctgtgtgca taatatcatt gtgggtaaac tgtggatcga acagtatggc aatgtggaaa 780 ttataaacca caagactggg gacaaatgtg tgttgaattt taagccatgt ggcctttttg 840 gtaaggaatt acacaaagtt gaaggctaca ttcaagataa aagcaaaaag aagctctgtg 900 ccctctatgg gaagtggact gaatgtttat acagtgttga ccctgccacg tttgacgctt 960 acaaaaaaaa tgataagaaa aatacagaag agaagaagaa cagcaaacag atgagcacct 1020 ctgaggagtt ggatgaaatg ccagtgccgg attctgaaag tgtattcatt atccctggaa 1080 gcgttcttct atggcgaata gccccacggc ctccaaattc tgcccagatg tataatttta 1140 ctagttttgc aatggttttg aatgaagtag acaaagacat ggagagtgtg attcccaaga 1200 cagactgcag gttacggcct gacatcagag ccatggaaaa tggagagata gatcaagcta 1260 gtgaagaaaa aaaacgactt gaggaaaaac aaagagcagc ccgcaaaaac aggtccaagt 1320 cagaagagga ctggaagacg aggtggttcc atcaaggtcc taatccctac aatggagcac 1380 aggactggat ttactctggc agctactggg acagaaatta cttcaatttg cctgacattt 1440 attaaaatgc atacaagtca gggtgtttgg ctaatctaca aataagtctt aaacctatgt 1500 ttttaaattt ttttcccttg gtttctactt atcttttaaa aaaaaaaatg aaaaaacact 1560 catgagataa ctgcatttca cccaacaaaa gcagggtata aggcgatatt ggtgatgaaa 1620 gtcttaggaa aaatgcataa ttttgctata aaatgtactt atttggaata ctattttata 1680 tagaggtaag agaacactgc tggggaatat gctttttatg gttgctgttg ccatatttac 1740 tgaaggttta tacctaaatg taactttagc tttatggaac tatatagtaa tcccaaatca 1800 agttattttg aatattttta tgctgtcatg cttgaatgtt ttagatgtaa cctttgacat 1860 atttagaact ctcctcctat acaatgttta ttctcagata tagaggttat gtcattttat 1920 aaagacttca ttgataagat ggcttttatt catactaatc ctcccaatgt taccccttcc 1980 atcttccaag aagaaaaaaa atgcctgaat attcagaata gatatttctg atttgaaaat 2040 tctaaagaat taaactggaa aagtatttca tttacttagt gctctgaatt tacttttaca 2100 gttttctgca gtcagtatca ttaaaatggt taagtttaca tttgaactga aaatatgtat 2160 aaaatctagc aattcacaaa aatgccctag aaatatagat tttaatcacc attacataat 2220 gacaaacctt gttaaatgct tccacttcca gtggcaaatg ccactaggga aagtaagttg 2280 cactcatgta agtatcaaac tatataaaag gaggccttgt gcatttcaag tttgcaaagt 2340 acctgtgtac ttaaaatatg tgtggagacc tactgtacag tagttttgcc cctttaattg 2400 gggcacattc atcttaaatc ttatagtatt tatccaccca aaccccagac tgagatactg 2460 ctcccagggg cctaggtagc tgccagtccg tgattttaat tgctgtcttg aagttaacaa 2520 gtgttataat gaaataatct acctgatgct aaataaaggc tttaga 2566 <210> 22 <211> 1848 <212> DNA
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 2798021CB1 <400> 22 aaccagctga aaagcatgcg ccaagctgca gcagatgcaa agcctgaaag tttaatgaag 60 aggctagagg aggagataaa atttaattta tatatggtaa ctgaaaaatt tcctaaagaa 120 ttagaaaata agaaaaagga attacatttt ttacaaaaag tagtttcaga gccagctatg 180 ggccattctg atcttcttga acttgaatct aaaataaatg aaataaacac agaaattaac 240 cagttgattg aaaagaaaat gatgagaaat gagcccattg aaggcaaact ctcactgtat 300 aggcaacagg catctatcat ttcccgtaaa aaagaagcca aagctgagga acttcaggag 360 gccaaggaga agttagccag cctagagaga gaagcatcag taaagagaaa tcagacccgt 420 gaatttgatg gtactgaagt tttaaaggga gatgagttca aacgatatgt caataaactt 480 cgaagcaaga gtacagtttt caaaaagaag catcagataa tagctgaact taaagctgaa 540 ttcggtcttt tgcagaggac tgaagaactt cttaagcaac gtcatgaaaa tattcaacaa 600 caactgcaaa ctatggagga gaaaaagggt atatctggat atagttacac ccaagaagag 660 ctagaaagag tatctgcact gaagagtgaa gttgatgaaa tgaaaggacg aacattggat 720 gatatgtctg aaatggtgaa aaaactgtat tcattggtat ctgaaaagaa gtcagctctt 780 gcctcagtta taaaagagct acgacagttg cgtcaaaaat atcaagaact gacccaggag 840 tgtgatgaaa agaaatccca gtatgatagc tgtgcagcag gcctcgaaag caatcggtcc 900 aaattagaac aggaagttag aagactccgt gaagaatgtc ttcaagaaga aagtagatac 960 cattatacaa attgtatgat taagaaccta gaagttcaac ttcgtcgtgc tactgatgag 1020 atgaaggcat atatctcttc tgatcaacaa gaaaaaagaa aggcaattag ggaacagtat 1080 accaaaaata ctgctgaaca agaaaacctt ggaaagaaac ttcgggaaaa acaaaaagtt 1140 atacgagaaa gtcatggtcc aaatatgaaa caagcaaaaa tgtggcgtga tttggaacaa 1200 ttaatggaat gtaagaaaca gtgctttctg aaacaacaaa gccaaacttc cattggtcag 1260 gtaattcagg agggtgggga ggaccggcta atactgtgaa ttcttgtgtc atcgtttggg 1320 gttttacttg ataccactag ctataagcct aatctcataa tgtatttctt ttttgaaact 1380 gatttgtata gcattttgtt ttcagaagag ccattcttta ttaagttttc atagaaaata 1440 atgttaaggt agatttagtt tgaatgtttt ttcatatgaa aaagaggctt ttattctttt 1500 ccatagttta gacatcactg gcgtcttctg agttttatga gacaggaaac taagtttact 1560 atctgtaaat gtaaacatat gtccattaag aaacatgtag ttttttttta gaatgtaata 1620 acccagtggc ttactgtttt tcttaatctc ttttaaaaaa actttagaag aatcttttag 1680 gaactaatat ctcttgttct gaagaaacat ttatctgacg ttcagcagtt cctacagttt 1740 tacttcagtt tatttttctt ctgtaaaatg caagaaaatt taatattttg actaacatgt 1800 cttttctgtt tgtatcattt aaaggcatat aaacttgtcg agtattaa 1848 <210> 23 <211> 2518 <212> DNA
<213> Homo sapiens <220>
<221> misc_feature <223> Incyte ID No: 3335404CB1 <400> 23 cgcagagccg gcgagggctt gcggtcctgg acggtagggg tctgcgtctg gtgccacctt 60 ctccatcagg ctgcttgctg ggtccaccaa gcctgacccc tccagccagg aggggatcac 120 ggagtgtgtc ccccgcccgg gctgctgcct gccggaaggg ggctgggaaa ccgggatccc 180 cgcgcgccac ttgcccggca gcgtcagttt cacccaggac tggctagcgg ttccctcgtg 240 gctctgcgcg gaggtccgcc tcctcccttc cctcccccat cccgcgcgct gcctccgcct 300 ccttcttctg cctccaccct caacccccat ccccgcctga cgggagctag ccctcagtcc 360 gcccgagctg tggttgtggg cgccggacaa gtccaaggcg cctcctccca atatggacag 420 ccgctacaac agcactgcgg gcatcgggga cttgaaccag ctgagcgctg ccatcccggc 480 cacgcgggtg gaggtgtccg tgtcctgcag aaatcttctt gacagagaca cattttctaa 540 atctgatcca atttgtgtct tatatgtaca aggagttgga aataaagaat ggagagagtt 600 tggaagaact gaagtaattg ataatacttt aaatcctgat tttgtaagaa agtttattct 660 ggactacttt tttgaagaaa gagagaatct tcgttttgac ttgtatgatg ttgattcaaa 720 gagccccaac ttatccaaac atgactttct gggacaagtg ttttgtacat tgggagagat 780 cgttggttca cagggaagtc gcctggaaaa accaatagta ggaattccag ggaagaaatg 840 tggtacaatc atacttacag cagaggaatt aaactgttgc agggatgccg ttttgatgca 900 attttgtgcg aacaaattgg acaagaagga cttctttgga aaatcagatc ctttccttgt 960 attttatcga agtaatgaag atggcagttt tacaatttgt cacaagacag aagttgtcaa 1020 aaacactcta aatccagtat ggcaagcatt caagatctca gtcagagcat tatgtaatgg 1080 agactatgac agaacaatca aagtagaggt gtatgactgg gaccgagatg gaagtcatga 1140 tttcattgga gaatttacaa caagctatag ggaactttct agagggcagt cacaattcaa 1200 cgtatatgag gtggtgaatc ccaaaaagaa aggaaaaaag aaaaaatata ctaattctgg 1260 aacagtaact ttactctctt tcttggtaga aacagaagtt tcattccttg actacattaa 1320 gggagggacg caaatcaatt tcacagtggc tattgatttt acagcatcaa acgggaaccc 1380 tgctcagccc acttccctcc actacatgaa tccttaccaa ctgaatgcct atggtatggc 1440 actaaaagca gtgggagaaa ttgttcaaga ttatgacagt gataaaatgt ttccagctct 1500 aggatttggt gcaaaactgc ctccagatgg aaggatatct cacgaatttg ctttgaatgg 1560 gaatcctcaa aacccctact gtgatggcat tgagggggtc atggaggctt attacaggag 1620 tctgaaatct gtacaactat atgggcccac caactttgct cctgtaatta atcatgtagc 1680 aagatatgct tcttctgtaa aggatggctc ccagtatttt gtgcttctga ttgttacaga 1740 tggtgttatc tcagatatgg cccagactaa ggagtccata gttaatgcct caaaacttcc 1800 aatgtcaata attatagtag gtgttggacc agcagaattt gatgcaatgg tcgaattgga 1860 tggagatgat gtaagagtct cctctagagg aaaatatgct gaaagagaca ttgtgcagtt 1920 tgtgccattc agggattata ttgacagaag tggaaaccac atactgagca tggctagatt 1980 ggctaaagat gtcctagctg agatccctga gcagtttctc tcctatatga gagcccgagg 2040 aatcaagcca tcacctgcgc ctcccccata caccccacct acacatgtgt tacagactca 2100 aatatgactg tgctctgaaa tgctaatgtc aactacaaat caaaagtgct gagttaatgc 2160 tttgtgcctg gtgctctgta atgaaccagg caatgagata gttttctcag tttggtttca 2220 gcagttaatg tgctttcttg gatccaaatt taaatatctt cctaaaccaa aactgtaaat 2280 atggttgttg catgagcaac agaaaaaatt gtttaaatgc ttgaagcaaa gtatggatgt 2340 cttctctaaa tctttctttc tttttttttt tttttttttt ttaacagaaa cagctaattt 2400 ccaatgtatt gttgggaaaa agcacaaact gtgtttttaa ctcaaatatt gtcttcgact 2460 gctatgtgta taggaaagca gtctctgtat caatgtttac atgttactac tttttaaa 2518 <210> 24 <211> 1120 <212> DNA
<213> Homo Sapiens <220>
<221> misc_feature <223> Incyte ID No: 3735780CB1 <400> 24 gaaatccagt tatcaaaatt gactcaagaa gagagaacct aacagaacaa taacaatgga 60 agaaattggg aacattatca caaagctatc atcctgccaa actccaggct cagatgtcac 120 aggttaaaaa aagtccttca tgaaaaagaa agatcttaag cagcatgatg gattcagaag 180 ctcatgaaaa gaggccacca atactaacat cttcaaaaca agatatatca cctcatatta 240 caaatgttgg tgagatgaag cattacttgt gtggctgctg tgcagccttc aacaatgtcg 300 caatcacatt tcccattcag aaggtcctct ttcgacaaca gctgtatggc atcaaaaccc 360 gggatgcaat acttcagttg agaagggatg gatttcgaaa tttgtatcgt ggaatccttc 420 ccccattgat gcagaagaca actacgcttg cacttatgtt tggtctgtat gaggatttat 480 cctgccttct ccacaagcat gtcagtgctc cagagtttgc aaccagtggc gtggcggcag 540 tgcttgcagg gacaacagaa gcaattttca ctccactgga aagagttcag acattgcttc 600 aagaccacaa acatcatgac aaatttacca acacttacca ggctttcaag gcactgaaat 660 gtcatggaat tggagagtat tatcgaggct tggtgcccat tcttttccgg aatggactca 720 gcaatgtctt gtttttcggc cttcgaggtc ccattaagga gcatctgcct accgcaacga 780 ctcacagtgc tcatctggtc aatgatttta tctgtggagg tctattgggt gccatgttgg 840 gattcttgtt ttttccaatt aatgttgtaa aaactcgcat acagtctcag attggtgggg 900 aatttcagtc tttccccaag gttttccaaa aaatctggct ggaacgggac agaaaactga 960 taaatctttt cagaggtgcc catctgaatt accatcggtc cctcatctct tggggcataa 1020 tcaatgcaac ttatgagttc ttgttaaagg ttatatgaaa aaaccatcag ttaagtgcca 1080 tttatcaact gaatagacct tctaagaaga aaaaaaaaaa 1120
Claims (23)
1. An isolated polypeptide comprising an amino acid sequence selected from the group consisting of:
a) an amino acid sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, and SEQ ID
NO:12, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:2, SEQ ID
NO:3, SEQ ID
NO:4, SEQ ID NO:6, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, and SEQ ID NO:12, c) a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:9, SEQ ID
NO:10, SEQ ID
NO:11, and SEQ ID NO:12, and d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:9, SEQ ID NO:10, SEQ ID
NO:11, and SEQ ID NO:12.
a) an amino acid sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, and SEQ ID
NO:12, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:2, SEQ ID
NO:3, SEQ ID
NO:4, SEQ ID NO:6, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, and SEQ ID NO:12, c) a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:9, SEQ ID
NO:10, SEQ ID
NO:11, and SEQ ID NO:12, and d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:9, SEQ ID NO:10, SEQ ID
NO:11, and SEQ ID NO:12.
2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:9, SEQ ID NO:10, SEQ ID
NO:11, and SEQ
ID NO:12.
NO:11, and SEQ
ID NO:12.
3. An isolated polynucleotide encoding a polypeptide of claim 1.
4. An isolated polynucleotide of claim 3 selected from the group consisting of SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:21, SEQ ID
NO:22, SEQ ID NO:23, and SEQ ID NO:24.
NO:22, SEQ ID NO:23, and SEQ ID NO:24.
5. A recombinant polynucleotide comprising a promoter sequence operably linked to a polynucleotide of claim 3.
6. A cell transformed with a recombinant polynucleotide of claim 5
7. A transgenic organism comprising a recombinant polynucleotide of claim 5.
8. A method for producing a polypeptide of claim 1, the method comprising:
a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of claim 1, and b) recovering the polypeptide so expressed.
a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of claim 1, and b) recovering the polypeptide so expressed.
9. An isolated antibody which specifically binds to a polypeptide of claim 1.
10. An isolated polynucleotide comprising a polynucleotide sequence selected from the group consisting of:
a) a polynucleotide sequence selected from the group consisting of SEQ ID
NO:13, SEQ ID
NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:21, SEQ ID NO:22, SEQ ID
NO:23, and SEQ ID NO:24, b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:13, SEQ ID NO:14, SEQ
ID NO:15, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:21, SEQ ID NO:22, SEQ ID
NO:23, and SEQ
ID NO:24, c) a polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) an RNA equivalent of a)-d).
a) a polynucleotide sequence selected from the group consisting of SEQ ID
NO:13, SEQ ID
NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:21, SEQ ID NO:22, SEQ ID
NO:23, and SEQ ID NO:24, b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:13, SEQ ID NO:14, SEQ
ID NO:15, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:21, SEQ ID NO:22, SEQ ID
NO:23, and SEQ
ID NO:24, c) a polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) an RNA equivalent of a)-d).
11. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a polynucleotide of claim 10.
12. A method for detecting a target polynucleotide in a sample, said target polynucleotide having a sequence of a polynucleotide of claim 10, the method comprising:
a) hybridizing the sample with a probe comprising at least 16 contiguous nucleotides comprising a sequence complementary to said target polynucleotide in the sample, and which probe specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization complex is formed between said probe and said target polynucleotide, and b) detecting the presence or absence of said hybridization complex, and, optionally, if present, the amount thereof.
a) hybridizing the sample with a probe comprising at least 16 contiguous nucleotides comprising a sequence complementary to said target polynucleotide in the sample, and which probe specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization complex is formed between said probe and said target polynucleotide, and b) detecting the presence or absence of said hybridization complex, and, optionally, if present, the amount thereof.
13. A method of claim 12, wherein the probe comprises at least 30 contiguous nucleotides.
14. A method of claim 12, wherein the probe comprises at least 60 contiguous nucleotides.
15. A pharmaceutical composition comprising an effective amount of a polypeptide of claim 1 and a pharmaceutically acceptable excipient.
16. A method for treating a disease or condition associated with decreased expression of functional LIPAP, comprising administering to a patient in need of such treatment the pharmaceutical composition of claim 15.
17. A method for screening a compound for effectiveness as an agonist of a polypeptide of claim 1, the method comprising:
a) exposing a sample comprising a polypeptide of claim 1 to a compound, and b) detecting agonist activity in the sample.
a) exposing a sample comprising a polypeptide of claim 1 to a compound, and b) detecting agonist activity in the sample.
18. A pharmaceutical composition comprising an agonist compound identified by a method of claim 17 and a pharmaceutically acceptable excipient.
19. A method for treating a disease or condition associated with decreased expression of functional LIPAP, comprising administering to a patient in need of such treatment a pharmaceutical composition of claim 18.
20. A method for screening a compound for effectiveness as an antagonist of a polypeptide of claim 1, the method comprising:
a) exposing a sample comprising a polypeptide of claim 1 to a compound, and b) detecting antagonist activity in the sample.
a) exposing a sample comprising a polypeptide of claim 1 to a compound, and b) detecting antagonist activity in the sample.
21. A pharmaceutical composition comprising an antagonist compound identified by a method of claim 20 and a pharmaceutically acceptable excipient.
22. A method for treating a disease or condition associated with overexpression of functional LIPAP, comprising administering to a patient in need of such treatment a pharmaceutical composition of claim 21.
23. A method for screening a compound for effectiveness in altering expression of a target polynucleotide, wherein said target polynucleotide comprises a sequence of claim 4, the method comprising:
a) exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered expression of the target polynucleotide.
a) exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered expression of the target polynucleotide.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12070399P | 1999-02-19 | 1999-02-19 | |
US60/120,703 | 1999-02-19 | ||
US14276299P | 1999-07-08 | 1999-07-08 | |
US60/142,762 | 1999-07-08 | ||
PCT/US2000/004160 WO2000049043A2 (en) | 1999-02-19 | 2000-02-18 | Human lipid-associated proteins |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2362300A1 true CA2362300A1 (en) | 2000-08-24 |
Family
ID=26818670
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002362300A Abandoned CA2362300A1 (en) | 1999-02-19 | 2000-02-18 | Human lipid-associated proteins |
Country Status (5)
Country | Link |
---|---|
EP (1) | EP1153035A2 (en) |
JP (1) | JP2002541772A (en) |
AU (1) | AU3001200A (en) |
CA (1) | CA2362300A1 (en) |
WO (1) | WO2000049043A2 (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7972802B2 (en) | 2005-10-31 | 2011-07-05 | University Of Washington | Lipoprotein-associated markers for cardiovascular disease |
US8241861B1 (en) | 2008-07-08 | 2012-08-14 | Insilicos, Llc | Methods and compositions for diagnosis or prognosis of cardiovascular disease |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5472858A (en) * | 1991-06-04 | 1995-12-05 | Wisconsin Alumni Research Foundation | Production of recombinant proteins in insect larvae |
CA2200794A1 (en) * | 1997-03-24 | 1998-09-24 | The Governing Council Of The University Of Toronto | Gene and protein related to alzheimer's disease |
JP2000050878A (en) * | 1998-08-12 | 2000-02-22 | Kazusa Dna Kenkyusho | New gene and protein encoded by the same |
-
2000
- 2000-02-18 CA CA002362300A patent/CA2362300A1/en not_active Abandoned
- 2000-02-18 WO PCT/US2000/004160 patent/WO2000049043A2/en not_active Application Discontinuation
- 2000-02-18 JP JP2000599780A patent/JP2002541772A/en active Pending
- 2000-02-18 EP EP00908718A patent/EP1153035A2/en not_active Withdrawn
- 2000-02-18 AU AU30012/00A patent/AU3001200A/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
JP2002541772A (en) | 2002-12-10 |
WO2000049043A3 (en) | 2001-02-01 |
WO2000049043A2 (en) | 2000-08-24 |
AU3001200A (en) | 2000-09-04 |
EP1153035A2 (en) | 2001-11-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2408315A1 (en) | Aminoacyl trna synthetases | |
EP1358329A2 (en) | Lipid-associated molecules | |
CA2374745A1 (en) | Proteases and protease inhibitors | |
CA2383907A1 (en) | Human hydrolytic enzymes | |
CA2364739A1 (en) | Regulators of intracellular phosphorylation | |
US20030143588A1 (en) | Phosphodiesterases | |
EP1131443A2 (en) | Human hydrolase proteins | |
EP1257635A2 (en) | Lipid metabolism enzymes and polynucleotides encoding them | |
CA2370195A1 (en) | Human membrane-associated proteins | |
CA2410953A1 (en) | Adenylyl and guanylyl cyclases | |
CA2362300A1 (en) | Human lipid-associated proteins | |
CA2398003A1 (en) | Phosphodiesterases | |
CA2414219A1 (en) | Lipid metabolism molecules | |
CA2423953A1 (en) | Hydrolases | |
CA2365421A1 (en) | Vesicle associated proteins | |
CA2401660A1 (en) | Lipid metabolism enzymes | |
US20040171009A1 (en) | Lipid-associated molecules | |
CA2369342A1 (en) | Carbohydrate-modifying enzymes | |
US7022507B1 (en) | Isolated polynucleotide encoding a human PSST subunit of the NADH:ubiquinone oxidoreductase complex | |
CA2417787A1 (en) | Lipid metabolism enzymes | |
WO2003083081A2 (en) | Lipid-associated molecules | |
CA2425767A1 (en) | Lipocalins | |
US20040054138A1 (en) | Phosphodiesterases | |
CA2349586A1 (en) | Coenzyme a-utilizing enzymes | |
WO2002094988A2 (en) | Lipid-associated molecules |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FZDE | Dead |