CN116782878A - Improved compositions for delivery of codon optimized mRNA - Google Patents
Improved compositions for delivery of codon optimized mRNA Download PDFInfo
- Publication number
- CN116782878A CN116782878A CN202180089274.0A CN202180089274A CN116782878A CN 116782878 A CN116782878 A CN 116782878A CN 202180089274 A CN202180089274 A CN 202180089274A CN 116782878 A CN116782878 A CN 116782878A
- Authority
- CN
- China
- Prior art keywords
- mrna
- lipid
- protein
- codon
- cationic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108020004999 messenger RNA Proteins 0.000 title claims abstract description 365
- 108020004705 Codon Proteins 0.000 title claims abstract description 161
- 238000012384 transportation and delivery Methods 0.000 title claims abstract description 77
- 239000000203 mixture Substances 0.000 title claims description 183
- 230000001976 improved effect Effects 0.000 title abstract description 7
- 150000002632 lipids Chemical class 0.000 claims abstract description 240
- -1 cationic lipid Chemical class 0.000 claims abstract description 188
- 239000002105 nanoparticle Substances 0.000 claims abstract description 108
- 239000008194 pharmaceutical composition Substances 0.000 claims abstract description 38
- 230000002685 pulmonary effect Effects 0.000 claims abstract description 34
- 238000000034 method Methods 0.000 claims description 181
- 108090000623 proteins and genes Proteins 0.000 claims description 162
- 102000004169 proteins and genes Human genes 0.000 claims description 157
- 125000002091 cationic group Chemical group 0.000 claims description 130
- 108010079245 Cystic Fibrosis Transmembrane Conductance Regulator Proteins 0.000 claims description 105
- 239000002502 liposome Substances 0.000 claims description 90
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 claims description 66
- 210000004072 lung Anatomy 0.000 claims description 34
- 235000012000 cholesterol Nutrition 0.000 claims description 33
- 239000004094 surface-active agent Substances 0.000 claims description 17
- 210000002919 epithelial cell Anatomy 0.000 claims description 11
- 238000002663 nebulization Methods 0.000 claims description 10
- 241000124008 Mammalia Species 0.000 claims description 8
- 108010063296 Kinesin Proteins 0.000 claims description 7
- 102000010638 Kinesin Human genes 0.000 claims description 7
- MWRBNPKJOOWZPW-GPADLTIESA-N 1,2-di-[(9E)-octadecenoyl]-sn-glycero-3-phosphoethanolamine Chemical compound CCCCCCCC\C=C\CCCCCCCC(=O)OC[C@H](COP(O)(=O)OCCN)OC(=O)CCCCCCC\C=C\CCCCCCCC MWRBNPKJOOWZPW-GPADLTIESA-N 0.000 claims description 5
- 101000801640 Homo sapiens Phospholipid-transporting ATPase ABCA3 Proteins 0.000 claims description 4
- 102100033623 Phospholipid-transporting ATPase ABCA3 Human genes 0.000 claims description 4
- 102000015395 alpha 1-Antitrypsin Human genes 0.000 claims description 4
- 108010050122 alpha 1-Antitrypsin Proteins 0.000 claims description 4
- 230000001939 inductive effect Effects 0.000 claims description 3
- 102100023419 Cystic fibrosis transmembrane conductance regulator Human genes 0.000 claims 1
- 108090000765 processed proteins & peptides Proteins 0.000 abstract description 24
- 229920001184 polypeptide Polymers 0.000 abstract description 15
- 102000004196 processed proteins & peptides Human genes 0.000 abstract description 15
- 102000012605 Cystic Fibrosis Transmembrane Conductance Regulator Human genes 0.000 description 104
- 125000003729 nucleotide group Chemical group 0.000 description 85
- 150000001875 compounds Chemical class 0.000 description 82
- 235000002639 sodium chloride Nutrition 0.000 description 82
- 239000002773 nucleotide Substances 0.000 description 81
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 80
- 150000003839 salts Chemical class 0.000 description 77
- 101000907783 Homo sapiens Cystic fibrosis transmembrane conductance regulator Proteins 0.000 description 61
- YMWUJEATGCHHMB-UHFFFAOYSA-N Dichloromethane Chemical compound ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 description 60
- 108700010070 Codon Usage Proteins 0.000 description 44
- 108020004414 DNA Proteins 0.000 description 43
- 210000004027 cell Anatomy 0.000 description 42
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 42
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 39
- 108091028043 Nucleic acid sequence Proteins 0.000 description 39
- 229960005305 adenosine Drugs 0.000 description 39
- 150000007523 nucleic acids Chemical class 0.000 description 36
- NOIRDLRUNWIUMX-UHFFFAOYSA-N 2-amino-3,7-dihydropurin-6-one;6-amino-1h-pyrimidin-2-one Chemical compound NC=1C=CNC(=O)N=1.O=C1NC(N)=NC2=C1NC=N2 NOIRDLRUNWIUMX-UHFFFAOYSA-N 0.000 description 33
- 150000001413 amino acids Chemical class 0.000 description 33
- 108091033319 polynucleotide Proteins 0.000 description 32
- 102000040430 polynucleotide Human genes 0.000 description 32
- 239000002157 polynucleotide Substances 0.000 description 32
- 239000003981 vehicle Substances 0.000 description 32
- 230000000694 effects Effects 0.000 description 31
- 102000039446 nucleic acids Human genes 0.000 description 31
- 108020004707 nucleic acids Proteins 0.000 description 31
- 241000282414 Homo sapiens Species 0.000 description 29
- 239000000243 solution Substances 0.000 description 28
- 108010065868 RNA polymerase SP6 Proteins 0.000 description 25
- 125000003275 alpha amino acid group Chemical group 0.000 description 25
- 230000015572 biosynthetic process Effects 0.000 description 25
- 239000011541 reaction mixture Substances 0.000 description 25
- 238000002560 therapeutic procedure Methods 0.000 description 24
- 210000001519 tissue Anatomy 0.000 description 24
- 238000001727 in vivo Methods 0.000 description 22
- 238000003786 synthesis reaction Methods 0.000 description 22
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 21
- 230000004048 modification Effects 0.000 description 20
- 238000012986 modification Methods 0.000 description 20
- 230000001965 increasing effect Effects 0.000 description 18
- 229910052757 nitrogen Inorganic materials 0.000 description 18
- 238000006243 chemical reaction Methods 0.000 description 17
- 229920001223 polyethylene glycol Polymers 0.000 description 17
- 238000011282 treatment Methods 0.000 description 17
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 16
- 238000000338 in vitro Methods 0.000 description 16
- 238000005457 optimization Methods 0.000 description 16
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 15
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 15
- ZMXDDKWLCZADIW-UHFFFAOYSA-N N,N-Dimethylformamide Chemical compound CN(C)C=O ZMXDDKWLCZADIW-UHFFFAOYSA-N 0.000 description 15
- 241000699670 Mus sp. Species 0.000 description 14
- 230000006978 adaptation Effects 0.000 description 14
- 125000000217 alkyl group Chemical group 0.000 description 14
- 239000000126 substance Substances 0.000 description 14
- 239000002202 Polyethylene glycol Substances 0.000 description 13
- MWRBNPKJOOWZPW-CLFAGFIQSA-N dioleoyl phosphatidylethanolamine Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OCC(COP(O)(=O)OCCN)OC(=O)CCCCCCC\C=C/CCCCCCCC MWRBNPKJOOWZPW-CLFAGFIQSA-N 0.000 description 13
- 201000010099 disease Diseases 0.000 description 13
- 238000001890 transfection Methods 0.000 description 13
- 230000014616 translation Effects 0.000 description 13
- 201000003883 Cystic fibrosis Diseases 0.000 description 12
- 101000991410 Homo sapiens Nucleolar and spindle-associated protein 1 Proteins 0.000 description 12
- 102100030991 Nucleolar and spindle-associated protein 1 Human genes 0.000 description 12
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 12
- 229930182558 Sterol Natural products 0.000 description 12
- 150000003432 sterols Chemical class 0.000 description 12
- 235000003702 sterols Nutrition 0.000 description 12
- 238000004458 analytical method Methods 0.000 description 11
- 238000003556 assay Methods 0.000 description 11
- 238000000746 purification Methods 0.000 description 11
- 108091026890 Coding region Proteins 0.000 description 10
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 10
- 238000009472 formulation Methods 0.000 description 10
- 239000012634 fragment Substances 0.000 description 10
- 102000056427 human CFTR Human genes 0.000 description 10
- CTSLXHKWHWQRSH-UHFFFAOYSA-N oxalyl chloride Chemical compound ClC(=O)C(Cl)=O CTSLXHKWHWQRSH-UHFFFAOYSA-N 0.000 description 10
- 229920000642 polymer Polymers 0.000 description 10
- 238000013519 translation Methods 0.000 description 10
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 9
- 241000699666 Mus <mouse, genus> Species 0.000 description 9
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 9
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 9
- 239000007853 buffer solution Substances 0.000 description 9
- 230000005714 functional activity Effects 0.000 description 9
- 229910052739 hydrogen Inorganic materials 0.000 description 9
- 210000005265 lung cell Anatomy 0.000 description 9
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 9
- 239000013612 plasmid Substances 0.000 description 9
- 239000002904 solvent Substances 0.000 description 9
- GHTGYZMBQPXTCQ-UHFFFAOYSA-N CC1(C)Cc2c(sc(NC(=O)c3ccn[nH]3)c2C(N)=O)C(C)(C)O1 Chemical compound CC1(C)Cc2c(sc(NC(=O)c3ccn[nH]3)c2C(N)=O)C(C)(C)O1 GHTGYZMBQPXTCQ-UHFFFAOYSA-N 0.000 description 8
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 8
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 8
- 108091028664 Ribonucleotide Proteins 0.000 description 8
- 108091036066 Three prime untranslated region Proteins 0.000 description 8
- HEDRZPFGACZZDS-MICDWDOJSA-N Trichloro(2H)methane Chemical compound [2H]C(Cl)(Cl)Cl HEDRZPFGACZZDS-MICDWDOJSA-N 0.000 description 8
- 238000007792 addition Methods 0.000 description 8
- 239000000872 buffer Substances 0.000 description 8
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 8
- 239000002777 nucleoside Substances 0.000 description 8
- 239000002336 ribonucleotide Substances 0.000 description 8
- 125000002652 ribonucleotide group Chemical group 0.000 description 8
- 239000011550 stock solution Substances 0.000 description 8
- ZAYHVCMSTBRABG-JXOAFFINSA-N 5-methylcytidine Chemical compound O=C1N=C(N)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 ZAYHVCMSTBRABG-JXOAFFINSA-N 0.000 description 7
- 102000004190 Enzymes Human genes 0.000 description 7
- 108090000790 Enzymes Proteins 0.000 description 7
- 229920002873 Polyethylenimine Polymers 0.000 description 7
- 125000003342 alkenyl group Chemical group 0.000 description 7
- 238000011156 evaluation Methods 0.000 description 7
- 238000001914 filtration Methods 0.000 description 7
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 7
- 230000035772 mutation Effects 0.000 description 7
- 230000005855 radiation Effects 0.000 description 7
- 230000002829 reductive effect Effects 0.000 description 7
- 238000012360 testing method Methods 0.000 description 7
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 7
- LMDZBCPBFSXMTL-UHFFFAOYSA-N 1-Ethyl-3-(3-dimethylaminopropyl)carbodiimide Substances CCN=C=NCCCN(C)C LMDZBCPBFSXMTL-UHFFFAOYSA-N 0.000 description 6
- ZDTFMPXQUSBYRL-UUOKFMHZSA-N 2-Aminoadenosine Chemical compound C12=NC(N)=NC(N)=C2N=CN1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O ZDTFMPXQUSBYRL-UUOKFMHZSA-N 0.000 description 6
- VHYFNPMBLIVWCW-UHFFFAOYSA-N 4-Dimethylaminopyridine Chemical compound CN(C)C1=CC=NC=C1 VHYFNPMBLIVWCW-UHFFFAOYSA-N 0.000 description 6
- 102000053602 DNA Human genes 0.000 description 6
- IGEOJNMYRZUKIK-IBGZPJMESA-N N-(benzenesulfonyl)-6-[3-[2-[1-(trifluoromethyl)cyclopropyl]ethoxy]pyrazol-1-yl]-2-[(4S)-2,2,4-trimethylpyrrolidin-1-yl]pyridine-3-carboxamide Chemical compound C[C@@H]1CN(C2=NC(=CC=C2C(=O)NS(=O)(=O)C2=CC=CC=C2)N2C=CC(OCCC3(CC3)C(F)(F)F)=N2)C(C)(C)C1 IGEOJNMYRZUKIK-IBGZPJMESA-N 0.000 description 6
- JUJWROOIHBZHMG-UHFFFAOYSA-N Pyridine Chemical compound C1=CC=NC=C1 JUJWROOIHBZHMG-UHFFFAOYSA-N 0.000 description 6
- 108091023045 Untranslated Region Proteins 0.000 description 6
- 239000012230 colorless oil Substances 0.000 description 6
- 238000005538 encapsulation Methods 0.000 description 6
- 238000002474 experimental method Methods 0.000 description 6
- LEQAOMBKQFMDFZ-UHFFFAOYSA-N glyoxal Chemical compound O=CC=O LEQAOMBKQFMDFZ-UHFFFAOYSA-N 0.000 description 6
- 239000001257 hydrogen Substances 0.000 description 6
- 150000002431 hydrogen Chemical class 0.000 description 6
- 150000003833 nucleoside derivatives Chemical class 0.000 description 6
- 239000003921 oil Substances 0.000 description 6
- 235000019198 oils Nutrition 0.000 description 6
- 238000012216 screening Methods 0.000 description 6
- 235000000346 sugar Nutrition 0.000 description 6
- 208000024891 symptom Diseases 0.000 description 6
- 238000013518 transcription Methods 0.000 description 6
- 230000035897 transcription Effects 0.000 description 6
- 238000012546 transfer Methods 0.000 description 6
- FBWMYSQUTZRHAT-HZJYTTRNSA-N (9z,12z)-octadeca-9,12-dienoyl chloride Chemical compound CCCCC\C=C/C\C=C/CCCCCCCC(Cl)=O FBWMYSQUTZRHAT-HZJYTTRNSA-N 0.000 description 5
- ZAYHVCMSTBRABG-UHFFFAOYSA-N 5-Methylcytidine Natural products O=C1N=C(N)C(C)=CN1C1C(O)C(O)C(CO)O1 ZAYHVCMSTBRABG-UHFFFAOYSA-N 0.000 description 5
- VEXZGXHMUGYJMC-UHFFFAOYSA-M Chloride anion Chemical compound [Cl-] VEXZGXHMUGYJMC-UHFFFAOYSA-M 0.000 description 5
- XKMLYUALXHKNFT-UUOKFMHZSA-N Guanosine-5'-triphosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O XKMLYUALXHKNFT-UUOKFMHZSA-N 0.000 description 5
- 241000282412 Homo Species 0.000 description 5
- 241001465754 Metazoa Species 0.000 description 5
- PMZURENOXWZQFD-UHFFFAOYSA-L Sodium Sulfate Chemical compound [Na+].[Na+].[O-]S([O-])(=O)=O PMZURENOXWZQFD-UHFFFAOYSA-L 0.000 description 5
- 239000000427 antigen Substances 0.000 description 5
- 108091007433 antigens Proteins 0.000 description 5
- 102000036639 antigens Human genes 0.000 description 5
- 210000003123 bronchiole Anatomy 0.000 description 5
- 229910052799 carbon Inorganic materials 0.000 description 5
- 239000003795 chemical substances by application Substances 0.000 description 5
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 5
- 238000004440 column chromatography Methods 0.000 description 5
- 238000009826 distribution Methods 0.000 description 5
- 210000000981 epithelium Anatomy 0.000 description 5
- 238000003384 imaging method Methods 0.000 description 5
- 239000012535 impurity Substances 0.000 description 5
- 239000007788 liquid Substances 0.000 description 5
- 108010017391 lysylvaline Proteins 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 5
- 210000003097 mucus Anatomy 0.000 description 5
- OYHQOLUKZRVURQ-UHFFFAOYSA-M octadeca-9,12-dienoate Chemical compound CCCCCC=CCC=CCCCCCCCC([O-])=O OYHQOLUKZRVURQ-UHFFFAOYSA-M 0.000 description 5
- 239000012044 organic layer Substances 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 239000011535 reaction buffer Substances 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- HPALAKNZSZLMCH-UHFFFAOYSA-M sodium;chloride;hydrate Chemical class O.[Na+].[Cl-] HPALAKNZSZLMCH-UHFFFAOYSA-M 0.000 description 5
- 238000006467 substitution reaction Methods 0.000 description 5
- 230000001225 therapeutic effect Effects 0.000 description 5
- 230000032258 transport Effects 0.000 description 5
- 238000005160 1H NMR spectroscopy Methods 0.000 description 4
- OMCCXGCDOABVEE-BBWANDEASA-N 2-[[1,3-bis[[(9Z,12Z)-octadeca-9,12-dienoyl]oxy]-2-[[(9Z,12Z)-octadeca-9,12-dienoyl]oxymethyl]propan-2-yl]amino]ethanesulfonic acid Chemical compound CCCCC/C=C\C/C=C\CCCCCCCC(OCC(COC(CCCCCCC/C=C\C/C=C\CCCCC)=O)(COC(CCCCCCC/C=C\C/C=C\CCCCC)=O)NCCS(O)(=O)=O)=O OMCCXGCDOABVEE-BBWANDEASA-N 0.000 description 4
- GJTBSTBJLVYKAU-XVFCMESISA-N 2-thiouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=S)NC(=O)C=C1 GJTBSTBJLVYKAU-XVFCMESISA-N 0.000 description 4
- HBZAZSCNDMDWEU-WREZULKGSA-N 3,5-diamino-6-chloro-n-[n'-[4-[4-[2-[hexyl-[(2s,3r,4r,5r)-2,3,4,5,6-pentahydroxyhexyl]amino]ethoxy]phenyl]butyl]carbamimidoyl]pyrazine-2-carboxamide Chemical compound C1=CC(OCCN(CCCCCC)C[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO)=CC=C1CCCCNC(=N)NC(=O)C1=NC(Cl)=C(N)N=C1N HBZAZSCNDMDWEU-WREZULKGSA-N 0.000 description 4
- MRPGNIOZXFPJGB-UHFFFAOYSA-N 3-(dimethylamino)propyl 4-hydroxy-3,5-dimethoxybenzoate Chemical compound COC1=CC(C(=O)OCCCN(C)C)=CC(OC)=C1O MRPGNIOZXFPJGB-UHFFFAOYSA-N 0.000 description 4
- UMOGNCVNHXWFIX-VIFPVBQESA-N 3-amino-N-[(2S)-2-hydroxypropyl]-5-[4-(trifluoromethoxy)phenyl]sulfonylpyridine-2-carboxamide Chemical compound NC=1C(=NC=C(C=1)S(=O)(=O)C1=CC=C(C=C1)OC(F)(F)F)C(=O)NC[C@H](C)O UMOGNCVNHXWFIX-VIFPVBQESA-N 0.000 description 4
- QUDOHCFOJCNKPK-QGZVFWFLSA-N 8-methyl-2-(3-methyl-1-benzofuran-2-yl)-5-[(1R)-1-(oxan-4-yl)ethoxy]quinoline-4-carboxylic acid Chemical compound O1CCC(CC1)[C@@H](C)OC1=C2C(=CC(=NC2=C(C=C1)C)C=1OC2=C(C=1C)C=CC=C2)C(=O)O QUDOHCFOJCNKPK-QGZVFWFLSA-N 0.000 description 4
- 108010051219 Cre recombinase Proteins 0.000 description 4
- 229940126130 GLPG2451 Drugs 0.000 description 4
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 4
- WZUVPPKBWHMQCE-UHFFFAOYSA-N Haematoxylin Chemical compound C12=CC(O)=C(O)C=C2CC2(O)C1C1=CC=C(O)C(O)=C1OC2 WZUVPPKBWHMQCE-UHFFFAOYSA-N 0.000 description 4
- 108060001084 Luciferase Proteins 0.000 description 4
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 4
- JGFZNNIVVJXRND-UHFFFAOYSA-N N,N-Diisopropylethylamine (DIPEA) Chemical compound CCN(C(C)C)C(C)C JGFZNNIVVJXRND-UHFFFAOYSA-N 0.000 description 4
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 4
- TYQIFWXBQYAKCR-UHFFFAOYSA-N N-[5-hydroxy-2,4-bis(trimethylsilyl)phenyl]-4-oxo-1H-quinoline-3-carboxamide Chemical compound C[Si](C)(C)C1=CC(=C(NC(=O)C2=CNC3=C(C=CC=C3)C2=O)C=C1O)[Si](C)(C)C TYQIFWXBQYAKCR-UHFFFAOYSA-N 0.000 description 4
- 101710163270 Nuclease Proteins 0.000 description 4
- XPEHHUISIBFLHX-RAIGVLPGSA-N O[C@H](C)C1=NN=C(O1)[C@@H]1C[C@H](C1)NC(=O)C1=CC(=NO1)C1=CC=CC=C1 Chemical compound O[C@H](C)C1=NN=C(O1)[C@@H]1C[C@H](C1)NC(=O)C1=CC(=NO1)C1=CC=CC=C1 XPEHHUISIBFLHX-RAIGVLPGSA-N 0.000 description 4
- WYURNTSHIVDZCO-UHFFFAOYSA-N Tetrahydrofuran Chemical compound C1CCOC1 WYURNTSHIVDZCO-UHFFFAOYSA-N 0.000 description 4
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 4
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 4
- 239000012190 activator Substances 0.000 description 4
- 210000001552 airway epithelial cell Anatomy 0.000 description 4
- 108010087924 alanylproline Proteins 0.000 description 4
- 125000000304 alkynyl group Chemical group 0.000 description 4
- 239000003012 bilayer membrane Substances 0.000 description 4
- 239000000969 carrier Substances 0.000 description 4
- 230000015556 catabolic process Effects 0.000 description 4
- 229940106189 ceramide Drugs 0.000 description 4
- 238000004587 chromatography analysis Methods 0.000 description 4
- 238000012937 correction Methods 0.000 description 4
- 238000006731 degradation reaction Methods 0.000 description 4
- 108020001507 fusion proteins Proteins 0.000 description 4
- 102000037865 fusion proteins Human genes 0.000 description 4
- QVDYQHXNAQHIKH-TZIWHRDSSA-N galicaftor Chemical compound FC1(OC2=C(O1)C=CC(=C2)C1(CC1)C(=O)N[C@@H]1C[C@@H](OC2=CC(=CC=C12)OC(F)F)C1=CC=C(C(=O)O)C=C1)F QVDYQHXNAQHIKH-TZIWHRDSSA-N 0.000 description 4
- 239000000499 gel Substances 0.000 description 4
- 238000001502 gel electrophoresis Methods 0.000 description 4
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 4
- 210000005260 human cell Anatomy 0.000 description 4
- 230000006872 improvement Effects 0.000 description 4
- 230000000670 limiting effect Effects 0.000 description 4
- 238000002156 mixing Methods 0.000 description 4
- PURKAOJPTOLRMP-ASMGOKTBSA-N n-[2-tert-butyl-4-[1,1,1,3,3,3-hexadeuterio-2-(trideuteriomethyl)propan-2-yl]-5-hydroxyphenyl]-4-oxo-1h-quinoline-3-carboxamide Chemical compound C1=C(O)C(C(C([2H])([2H])[2H])(C([2H])([2H])[2H])C([2H])([2H])[2H])=CC(C(C)(C)C)=C1NC(=O)C1=CNC2=CC=CC=C2C1=O PURKAOJPTOLRMP-ASMGOKTBSA-N 0.000 description 4
- 230000007935 neutral effect Effects 0.000 description 4
- NHOUNZMCSIHKHJ-FQEVSTJZSA-N olacaftor Chemical compound C1(=CC=CC=C1)S(=O)(=O)NC(=O)C=1C(=NC(=CC=1)C1=CC(=CC(=C1)OCC(C)C)F)N1C(C[C@@H](C1)C)(C)C NHOUNZMCSIHKHJ-FQEVSTJZSA-N 0.000 description 4
- 239000002245 particle Substances 0.000 description 4
- 239000012071 phase Substances 0.000 description 4
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 229920006395 saturated elastomer Polymers 0.000 description 4
- 239000000741 silica gel Substances 0.000 description 4
- 229910002027 silica gel Inorganic materials 0.000 description 4
- 239000000377 silicon dioxide Substances 0.000 description 4
- 239000007787 solid Substances 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 150000008163 sugars Chemical class 0.000 description 4
- 239000000725 suspension Substances 0.000 description 4
- LWIHDJKSTIGBAC-UHFFFAOYSA-K tripotassium phosphate Chemical compound [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 4
- 239000002691 unilamellar liposome Substances 0.000 description 4
- WUCYXAGSMRNPKY-UHFFFAOYSA-N 10-hydroxy-9-octylnonadecane-8,9,10-tricarboxylic acid Chemical compound CCCCCCCCC(C(O)=O)C(O)(C(O)=O)C(CCCCCCCC)(CCCCCCCC)C(O)=O WUCYXAGSMRNPKY-UHFFFAOYSA-N 0.000 description 3
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 3
- ZISVTYVLWSZJAL-UHFFFAOYSA-N 3,6-bis[4-[bis(2-hydroxydodecyl)amino]butyl]piperazine-2,5-dione Chemical compound CCCCCCCCCCC(O)CN(CC(O)CCCCCCCCCC)CCCCC1NC(=O)C(CCCCN(CC(O)CCCCCCCCCC)CC(O)CCCCCCCCCC)NC1=O ZISVTYVLWSZJAL-UHFFFAOYSA-N 0.000 description 3
- 229960000549 4-dimethylaminophenol Drugs 0.000 description 3
- 101710117545 C protein Proteins 0.000 description 3
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 3
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 3
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 3
- ZRALSGWEFCBTJO-UHFFFAOYSA-N Guanidine Chemical compound NC(N)=N ZRALSGWEFCBTJO-UHFFFAOYSA-N 0.000 description 3
- 239000007995 HEPES buffer Substances 0.000 description 3
- 229930010555 Inosine Natural products 0.000 description 3
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 3
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 3
- 241000880493 Leptailurus serval Species 0.000 description 3
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 3
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- SJRJJKPEHAURKC-UHFFFAOYSA-N N-Methylmorpholine Chemical compound CN1CCOCC1 SJRJJKPEHAURKC-UHFFFAOYSA-N 0.000 description 3
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 3
- 206010028980 Neoplasm Diseases 0.000 description 3
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 3
- 239000004698 Polyethylene Substances 0.000 description 3
- 229930185560 Pseudouridine Natural products 0.000 description 3
- PTJWIQPHWPFNBW-UHFFFAOYSA-N Pseudouridine C Natural products OC1C(O)C(CO)OC1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-UHFFFAOYSA-N 0.000 description 3
- 241000700159 Rattus Species 0.000 description 3
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 3
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 3
- 229910004298 SiO 2 Inorganic materials 0.000 description 3
- UIIMBOGNXHQVGW-UHFFFAOYSA-M Sodium bicarbonate Chemical class [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 description 3
- NYDLOCKCVISJKK-WRBBJXAJSA-N [3-(dimethylamino)-2-[(z)-octadec-9-enoyl]oxypropyl] (z)-octadec-9-enoate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OCC(CN(C)C)OC(=O)CCCCCCC\C=C/CCCCCCCC NYDLOCKCVISJKK-WRBBJXAJSA-N 0.000 description 3
- 230000004913 activation Effects 0.000 description 3
- 125000002252 acyl group Chemical group 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 125000001931 aliphatic group Chemical group 0.000 description 3
- 125000004450 alkenylene group Chemical group 0.000 description 3
- 125000002947 alkylene group Chemical group 0.000 description 3
- 108010062796 arginyllysine Proteins 0.000 description 3
- 238000000668 atmospheric pressure chemical ionisation mass spectrometry Methods 0.000 description 3
- WGDUUQDYDIIBKT-UHFFFAOYSA-N beta-Pseudouridine Natural products OC1OC(CN2C=CC(=O)NC2=O)C(O)C1O WGDUUQDYDIIBKT-UHFFFAOYSA-N 0.000 description 3
- 239000012267 brine Substances 0.000 description 3
- 201000011510 cancer Diseases 0.000 description 3
- 238000005251 capillar electrophoresis Methods 0.000 description 3
- 238000009295 crossflow filtration Methods 0.000 description 3
- 239000012043 crude product Substances 0.000 description 3
- 229940104302 cytosine Drugs 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 208000035475 disorder Diseases 0.000 description 3
- 238000001035 drying Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 239000007789 gas Substances 0.000 description 3
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 3
- 229940015043 glyoxal Drugs 0.000 description 3
- 125000000623 heterocyclic group Chemical group 0.000 description 3
- 238000000265 homogenisation Methods 0.000 description 3
- 238000003364 immunohistochemistry Methods 0.000 description 3
- 229960003786 inosine Drugs 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- 210000004185 liver Anatomy 0.000 description 3
- 230000033001 locomotion Effects 0.000 description 3
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 3
- 108010054155 lysyllysine Proteins 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- 125000003835 nucleoside group Chemical group 0.000 description 3
- 239000000546 pharmaceutical excipient Substances 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- WGYKZJWCGVVSQN-UHFFFAOYSA-N propylamine Chemical compound CCCN WGYKZJWCGVVSQN-UHFFFAOYSA-N 0.000 description 3
- 230000012846 protein folding Effects 0.000 description 3
- PTJWIQPHWPFNBW-GBNDHIKLSA-N pseudouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-GBNDHIKLSA-N 0.000 description 3
- UMJSCPRVCHMLSP-UHFFFAOYSA-N pyridine Natural products COC1=CC=CN=C1 UMJSCPRVCHMLSP-UHFFFAOYSA-N 0.000 description 3
- 239000000523 sample Substances 0.000 description 3
- 239000011734 sodium Substances 0.000 description 3
- 229910052938 sodium sulfate Inorganic materials 0.000 description 3
- 235000011152 sodium sulphate Nutrition 0.000 description 3
- 238000000527 sonication Methods 0.000 description 3
- 230000009885 systemic effect Effects 0.000 description 3
- 230000008685 targeting Effects 0.000 description 3
- 108010061238 threonyl-glycine Proteins 0.000 description 3
- 238000003325 tomography Methods 0.000 description 3
- 125000003258 trimethylene group Chemical group [H]C([H])([*:2])C([H])([H])C([H])([H])[*:1] 0.000 description 3
- 108010051110 tyrosyl-lysine Proteins 0.000 description 3
- 239000013598 vector Substances 0.000 description 3
- 238000001262 western blot Methods 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- RIFDKYBNWNPCQK-IOSLPCCCSA-N (2r,3s,4r,5r)-2-(hydroxymethyl)-5-(6-imino-3-methylpurin-9-yl)oxolane-3,4-diol Chemical compound C1=2N(C)C=NC(=N)C=2N=CN1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O RIFDKYBNWNPCQK-IOSLPCCCSA-N 0.000 description 2
- 125000006708 (C5-C14) heteroaryl group Chemical group 0.000 description 2
- SCYULBFZEHDVBN-UHFFFAOYSA-N 1,1-Dichloroethane Chemical compound CC(Cl)Cl SCYULBFZEHDVBN-UHFFFAOYSA-N 0.000 description 2
- KILNVBDSWZSGLL-KXQOOQHDSA-N 1,2-dihexadecanoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCCCCCCCCC KILNVBDSWZSGLL-KXQOOQHDSA-N 0.000 description 2
- SLKDGVPOSSLUAI-PGUFJCEWSA-N 1,2-dihexadecanoyl-sn-glycero-3-phosphoethanolamine zwitterion Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP(O)(=O)OCCN)OC(=O)CCCCCCCCCCCCCCC SLKDGVPOSSLUAI-PGUFJCEWSA-N 0.000 description 2
- SNKAWJBJQDLSFF-NVKMUCNASA-N 1,2-dioleoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCC\C=C/CCCCCCCC SNKAWJBJQDLSFF-NVKMUCNASA-N 0.000 description 2
- NRJAVPSFFCBXDT-HUESYALOSA-N 1,2-distearoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCCCCCCCCCCC NRJAVPSFFCBXDT-HUESYALOSA-N 0.000 description 2
- LVNGJLRDBYCPGB-UHFFFAOYSA-N 1,2-distearoylphosphatidylethanolamine Chemical compound CCCCCCCCCCCCCCCCCC(=O)OCC(COP([O-])(=O)OCC[NH3+])OC(=O)CCCCCCCCCCCCCCCCC LVNGJLRDBYCPGB-UHFFFAOYSA-N 0.000 description 2
- BIABMEZBCHDPBV-MPQUPPDSSA-N 1,2-palmitoyl-sn-glycero-3-phospho-(1'-sn-glycerol) Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP(O)(=O)OC[C@@H](O)CO)OC(=O)CCCCCCCCCCCCCCC BIABMEZBCHDPBV-MPQUPPDSSA-N 0.000 description 2
- KBPLFHHGFOOTCA-UHFFFAOYSA-N 1-Octanol Chemical compound CCCCCCCCO KBPLFHHGFOOTCA-UHFFFAOYSA-N 0.000 description 2
- RKSLVDIXBGWPIS-UAKXSSHOSA-N 1-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-5-iodopyrimidine-2,4-dione Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(I)=C1 RKSLVDIXBGWPIS-UAKXSSHOSA-N 0.000 description 2
- QLOCVMVCRJOTTM-TURQNECASA-N 1-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-5-prop-1-ynylpyrimidine-2,4-dione Chemical compound O=C1NC(=O)C(C#CC)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 QLOCVMVCRJOTTM-TURQNECASA-N 0.000 description 2
- PISWNSOQFZRVJK-XLPZGREQSA-N 1-[(2r,4s,5r)-4-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-5-methyl-2-sulfanylidenepyrimidin-4-one Chemical compound S=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 PISWNSOQFZRVJK-XLPZGREQSA-N 0.000 description 2
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 2
- JRYMOPZHXMVHTA-DAGMQNCNSA-N 2-amino-7-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-1h-pyrrolo[2,3-d]pyrimidin-4-one Chemical compound C1=CC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O JRYMOPZHXMVHTA-DAGMQNCNSA-N 0.000 description 2
- NEZDNQCXEZDCBI-UHFFFAOYSA-N 2-azaniumylethyl 2,3-di(tetradecanoyloxy)propyl phosphate Chemical compound CCCCCCCCCCCCCC(=O)OCC(COP(O)(=O)OCCN)OC(=O)CCCCCCCCCCCCC NEZDNQCXEZDCBI-UHFFFAOYSA-N 0.000 description 2
- FPQQSJJWHUJYPU-UHFFFAOYSA-N 3-(dimethylamino)propyliminomethylidene-ethylazanium;chloride Chemical compound Cl.CCN=C=NCCCN(C)C FPQQSJJWHUJYPU-UHFFFAOYSA-N 0.000 description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- LMMLLWZHCKCFQA-UGKPPGOTSA-N 4-amino-1-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)-2-prop-1-ynyloxolan-2-yl]pyrimidin-2-one Chemical compound C1=CC(N)=NC(=O)N1[C@]1(C#CC)O[C@H](CO)[C@@H](O)[C@H]1O LMMLLWZHCKCFQA-UGKPPGOTSA-N 0.000 description 2
- XXSIICQLPUAUDF-TURQNECASA-N 4-amino-1-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-5-prop-1-ynylpyrimidin-2-one Chemical compound O=C1N=C(N)C(C#CC)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 XXSIICQLPUAUDF-TURQNECASA-N 0.000 description 2
- AGFIRQJZCNVMCW-UAKXSSHOSA-N 5-bromouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(Br)=C1 AGFIRQJZCNVMCW-UAKXSSHOSA-N 0.000 description 2
- FHIDNBAQOFJWCA-UAKXSSHOSA-N 5-fluorouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(F)=C1 FHIDNBAQOFJWCA-UAKXSSHOSA-N 0.000 description 2
- KDOPAZIWBAHVJB-UHFFFAOYSA-N 5h-pyrrolo[3,2-d]pyrimidine Chemical compound C1=NC=C2NC=CC2=N1 KDOPAZIWBAHVJB-UHFFFAOYSA-N 0.000 description 2
- BXJHWYVXLGLDMZ-UHFFFAOYSA-N 6-O-methylguanine Chemical compound COC1=NC(N)=NC2=C1NC=N2 BXJHWYVXLGLDMZ-UHFFFAOYSA-N 0.000 description 2
- HCAJQHYUCKICQH-VPENINKCSA-N 8-Oxo-7,8-dihydro-2'-deoxyguanosine Chemical compound C1=2NC(N)=NC(=O)C=2NC(=O)N1[C@H]1C[C@H](O)[C@@H](CO)O1 HCAJQHYUCKICQH-VPENINKCSA-N 0.000 description 2
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 2
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 2
- YDNKGFDKKRUKPY-JHOUSYSJSA-N C16 ceramide Natural products CCCCCCCCCCCCCCCC(=O)N[C@@H](CO)[C@H](O)C=CCCCCCCCCCCCCC YDNKGFDKKRUKPY-JHOUSYSJSA-N 0.000 description 2
- 125000005915 C6-C14 aryl group Chemical group 0.000 description 2
- DRSHXJFUUPIBHX-UHFFFAOYSA-N COc1ccc(cc1)N1N=CC2C=NC(Nc3cc(OC)c(OC)c(OCCCN4CCN(C)CC4)c3)=NC12 Chemical compound COc1ccc(cc1)N1N=CC2C=NC(Nc3cc(OC)c(OC)c(OCCCN4CCN(C)CC4)c3)=NC12 DRSHXJFUUPIBHX-UHFFFAOYSA-N 0.000 description 2
- 241000282472 Canis lupus familiaris Species 0.000 description 2
- 108091006146 Channels Proteins 0.000 description 2
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 2
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 2
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 2
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical class OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 2
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 2
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 2
- OHCQJHSOBUTRHG-KGGHGJDLSA-N FORSKOLIN Chemical compound O=C([C@@]12O)C[C@](C)(C=C)O[C@]1(C)[C@@H](OC(=O)C)[C@@H](O)[C@@H]1[C@]2(C)[C@@H](O)CCC1(C)C OHCQJHSOBUTRHG-KGGHGJDLSA-N 0.000 description 2
- 108090000331 Firefly luciferases Proteins 0.000 description 2
- LVNILKSSFHCSJZ-IHRRRGAJSA-N Gln-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LVNILKSSFHCSJZ-IHRRRGAJSA-N 0.000 description 2
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 2
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 2
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 2
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 2
- FQFWFZWOHOEVMZ-IHRRRGAJSA-N Glu-Phe-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FQFWFZWOHOEVMZ-IHRRRGAJSA-N 0.000 description 2
- 102000005720 Glutathione transferase Human genes 0.000 description 2
- 108010070675 Glutathione transferase Proteins 0.000 description 2
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 2
- JZNWSCPGTDBMEW-UHFFFAOYSA-N Glycerophosphorylethanolamin Natural products NCCOP(O)(=O)OCC(O)CO JZNWSCPGTDBMEW-UHFFFAOYSA-N 0.000 description 2
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 2
- 102000002265 Human Growth Hormone Human genes 0.000 description 2
- 108010000521 Human Growth Hormone Proteins 0.000 description 2
- 239000000854 Human Growth Hormone Substances 0.000 description 2
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 2
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 2
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 2
- 208000026350 Inborn Genetic disease Diseases 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 2
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 2
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 2
- AOFYPTOHESIBFZ-KKUMJFAQSA-N Leu-His-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O AOFYPTOHESIBFZ-KKUMJFAQSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- 208000019693 Lung disease Diseases 0.000 description 2
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 2
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 2
- CRJGESKKUOMBCT-VQTJNVASSA-N N-acetylsphinganine Chemical compound CCCCCCCCCCCCCCC[C@@H](O)[C@H](CO)NC(C)=O CRJGESKKUOMBCT-VQTJNVASSA-N 0.000 description 2
- JOCBASBOOFNAJA-UHFFFAOYSA-N N-tris(hydroxymethyl)methyl-2-aminoethanesulfonic acid Chemical compound OCC(CO)(CO)NCCS(O)(=O)=O JOCBASBOOFNAJA-UHFFFAOYSA-N 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 description 2
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 2
- GLUUGHFHXGJENI-UHFFFAOYSA-N Piperazine Chemical compound C1CNCCN1 GLUUGHFHXGJENI-UHFFFAOYSA-N 0.000 description 2
- 239000004743 Polypropylene Substances 0.000 description 2
- WCUXLLCKKVVCTQ-UHFFFAOYSA-M Potassium chloride Chemical compound [Cl-].[K+] WCUXLLCKKVVCTQ-UHFFFAOYSA-M 0.000 description 2
- ATUOYWHBWRKTHZ-UHFFFAOYSA-N Propane Chemical compound CCC ATUOYWHBWRKTHZ-UHFFFAOYSA-N 0.000 description 2
- 108010007568 Protamines Proteins 0.000 description 2
- 102000007327 Protamines Human genes 0.000 description 2
- 102000009609 Pyrophosphatases Human genes 0.000 description 2
- 108010009413 Pyrophosphatases Proteins 0.000 description 2
- 108091034057 RNA (poly(A)) Proteins 0.000 description 2
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 2
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 2
- 108020004682 Single-Stranded DNA Proteins 0.000 description 2
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 2
- UIIMBOGNXHQVGW-DEQYMQKBSA-M Sodium bicarbonate-14C Chemical compound [Na+].O[14C]([O-])=O UIIMBOGNXHQVGW-DEQYMQKBSA-M 0.000 description 2
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 2
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 2
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 2
- 101150044878 US18 gene Proteins 0.000 description 2
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 2
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- DSNRWDQKZIEDDB-GCMPNPAFSA-N [(2r)-3-[2,3-dihydroxypropoxy(hydroxy)phosphoryl]oxy-2-[(z)-octadec-9-enoyl]oxypropyl] (z)-octadec-9-enoate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@H](COP(O)(=O)OCC(O)CO)OC(=O)CCCCCCC\C=C/CCCCCCCC DSNRWDQKZIEDDB-GCMPNPAFSA-N 0.000 description 2
- 239000000654 additive Substances 0.000 description 2
- 239000000443 aerosol Substances 0.000 description 2
- 210000005058 airway cell Anatomy 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- 125000003282 alkyl amino group Chemical group 0.000 description 2
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 2
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 2
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 2
- 235000011130 ammonium sulphate Nutrition 0.000 description 2
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical class OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 2
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 2
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 2
- 210000000424 bronchial epithelial cell Anatomy 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 125000004452 carbocyclyl group Chemical group 0.000 description 2
- 210000000170 cell membrane Anatomy 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- ZVEQCJWYRWKARO-UHFFFAOYSA-N ceramide Natural products CCCCCCCCCCCCCCC(O)C(=O)NC(CO)C(O)C=CCCC=C(C)CCCCCCCCC ZVEQCJWYRWKARO-UHFFFAOYSA-N 0.000 description 2
- 238000007385 chemical modification Methods 0.000 description 2
- 239000012320 chlorinating reagent Substances 0.000 description 2
- 210000000254 ciliated cell Anatomy 0.000 description 2
- 235000015165 citric acid Nutrition 0.000 description 2
- 238000013145 classification model Methods 0.000 description 2
- 229910052681 coesite Inorganic materials 0.000 description 2
- 229940125782 compound 2 Drugs 0.000 description 2
- 229940126214 compound 3 Drugs 0.000 description 2
- 239000000356 contaminant Substances 0.000 description 2
- 238000013270 controlled release Methods 0.000 description 2
- 229910052906 cristobalite Inorganic materials 0.000 description 2
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 2
- 231100000135 cytotoxicity Toxicity 0.000 description 2
- 230000003013 cytotoxicity Effects 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 230000009881 electrostatic interaction Effects 0.000 description 2
- 239000000839 emulsion Substances 0.000 description 2
- YQGOJNYOYNNSMM-UHFFFAOYSA-N eosin Chemical compound [Na+].OC(=O)C1=CC=CC=C1C1=C2C=C(Br)C(=O)C(Br)=C2OC2=C(Br)C(O)=C(Br)C=C21 YQGOJNYOYNNSMM-UHFFFAOYSA-N 0.000 description 2
- 150000002148 esters Chemical class 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 208000016361 genetic disease Diseases 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 229940029575 guanosine Drugs 0.000 description 2
- 229910052736 halogen Inorganic materials 0.000 description 2
- 150000002367 halogens Chemical class 0.000 description 2
- 210000003494 hepatocyte Anatomy 0.000 description 2
- 150000002402 hexoses Chemical class 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 238000010348 incorporation Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 238000009830 intercalation Methods 0.000 description 2
- 238000001990 intravenous administration Methods 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 239000003446 ligand Substances 0.000 description 2
- 238000004020 luminiscence type Methods 0.000 description 2
- 229910001629 magnesium chloride Inorganic materials 0.000 description 2
- 235000011147 magnesium chloride Nutrition 0.000 description 2
- HQKMJHAJHXVSDF-UHFFFAOYSA-L magnesium stearate Chemical compound [Mg+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O HQKMJHAJHXVSDF-UHFFFAOYSA-L 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- PSHKMPUSSFXUIA-UHFFFAOYSA-N n,n-dimethylpyridin-2-amine Chemical compound CN(C)C1=CC=CC=N1 PSHKMPUSSFXUIA-UHFFFAOYSA-N 0.000 description 2
- VVGIYYKRAMHVLU-UHFFFAOYSA-N newbouldiamide Natural products CCCCCCCCCCCCCCCCCCCC(O)C(O)C(O)C(CO)NC(=O)CCCCCCCCCCCCCCCCC VVGIYYKRAMHVLU-UHFFFAOYSA-N 0.000 description 2
- 125000002347 octyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 2
- 229910052760 oxygen Inorganic materials 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- GJVFBWCTGUSGDD-UHFFFAOYSA-L pentamethonium bromide Chemical compound [Br-].[Br-].C[N+](C)(C)CCCCC[N+](C)(C)C GJVFBWCTGUSGDD-UHFFFAOYSA-L 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 235000021317 phosphate Nutrition 0.000 description 2
- WTJKGGKOPKCXLL-RRHRGVEJSA-N phosphatidylcholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCC=CCCCCCCCC WTJKGGKOPKCXLL-RRHRGVEJSA-N 0.000 description 2
- 150000004713 phosphodiesters Chemical class 0.000 description 2
- 150000003904 phospholipids Chemical class 0.000 description 2
- 229920001155 polypropylene Polymers 0.000 description 2
- 239000004810 polytetrafluoroethylene Substances 0.000 description 2
- 229920001343 polytetrafluoroethylene Polymers 0.000 description 2
- 230000004481 post-translational protein modification Effects 0.000 description 2
- 229910000160 potassium phosphate Inorganic materials 0.000 description 2
- 235000011009 potassium phosphates Nutrition 0.000 description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 2
- 230000002035 prolonged effect Effects 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 239000001294 propane Substances 0.000 description 2
- 229940048914 protamine Drugs 0.000 description 2
- 238000003259 recombinant expression Methods 0.000 description 2
- 108010054624 red fluorescent protein Proteins 0.000 description 2
- 238000009256 replacement therapy Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 239000003161 ribonuclease inhibitor Substances 0.000 description 2
- 102220199585 rs1057521790 Human genes 0.000 description 2
- 230000003248 secreting effect Effects 0.000 description 2
- 210000002955 secretory cell Anatomy 0.000 description 2
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 2
- 235000012239 silicon dioxide Nutrition 0.000 description 2
- 239000001632 sodium acetate Substances 0.000 description 2
- 235000017281 sodium acetate Nutrition 0.000 description 2
- 239000001509 sodium citrate Substances 0.000 description 2
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 2
- 235000011083 sodium citrates Nutrition 0.000 description 2
- 238000004611 spectroscopical analysis Methods 0.000 description 2
- ATHGHQPFGPMSJY-UHFFFAOYSA-N spermidine Chemical compound NCCCCNCCCN ATHGHQPFGPMSJY-UHFFFAOYSA-N 0.000 description 2
- 238000003756 stirring Methods 0.000 description 2
- 229910052682 stishovite Inorganic materials 0.000 description 2
- 108010018381 streptavidin-binding peptide Proteins 0.000 description 2
- 229910052717 sulfur Inorganic materials 0.000 description 2
- JMSVCTWVEWCHDZ-UHFFFAOYSA-N syringic acid Chemical compound COC1=CC(C(O)=O)=CC(OC)=C1O JMSVCTWVEWCHDZ-UHFFFAOYSA-N 0.000 description 2
- 238000012385 systemic delivery Methods 0.000 description 2
- 238000010381 tandem affinity purification Methods 0.000 description 2
- YLQBMQCUIZJEEH-UHFFFAOYSA-N tetrahydrofuran Natural products C=1C=COC=1 YLQBMQCUIZJEEH-UHFFFAOYSA-N 0.000 description 2
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 2
- 230000001988 toxicity Effects 0.000 description 2
- 231100000419 toxicity Toxicity 0.000 description 2
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 2
- 229910052905 tridymite Inorganic materials 0.000 description 2
- 239000001226 triphosphate Substances 0.000 description 2
- 235000011178 triphosphate Nutrition 0.000 description 2
- 238000010798 ubiquitination Methods 0.000 description 2
- 230000034512 ubiquitination Effects 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 2
- 229940045145 uridine Drugs 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- VLAFRQCSFRYCLC-FXQIFTODSA-N (2s)-2-[[(2s)-2-[[2-[[(2s)-2-aminopropanoyl]amino]acetyl]amino]-3-hydroxypropanoyl]amino]pentanedioic acid Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VLAFRQCSFRYCLC-FXQIFTODSA-N 0.000 description 1
- LWPSUELGZBFZHK-BCTRXSSUSA-N (6z,9z)-octadeca-6,9-diene Chemical compound CCCCCCCC\C=C/C\C=C/CCCCC LWPSUELGZBFZHK-BCTRXSSUSA-N 0.000 description 1
- 125000006649 (C2-C20) alkynyl group Chemical group 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- TZCPCKNHXULUIY-RGULYWFUSA-N 1,2-distearoyl-sn-glycero-3-phosphoserine Chemical compound CCCCCCCCCCCCCCCCCC(=O)OC[C@H](COP(O)(=O)OC[C@H](N)C(O)=O)OC(=O)CCCCCCCCCCCCCCCCC TZCPCKNHXULUIY-RGULYWFUSA-N 0.000 description 1
- NGOTYFZFWPHNSU-UHFFFAOYSA-N 1,3-dioxol-4-amine Chemical compound NC1=COCO1 NGOTYFZFWPHNSU-UHFFFAOYSA-N 0.000 description 1
- UVBYMVOUBXYSFV-XUTVFYLZSA-N 1-methylpseudouridine Chemical compound O=C1NC(=O)N(C)C=C1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 UVBYMVOUBXYSFV-XUTVFYLZSA-N 0.000 description 1
- IIZPXYDJLKNOIY-JXPKJXOSSA-N 1-palmitoyl-2-arachidonoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCC\C=C/C\C=C/C\C=C/C\C=C/CCCCC IIZPXYDJLKNOIY-JXPKJXOSSA-N 0.000 description 1
- 238000001644 13C nuclear magnetic resonance spectroscopy Methods 0.000 description 1
- YKBGVTZYEHREMT-KVQBGUIXSA-N 2'-deoxyguanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](CO)O1 YKBGVTZYEHREMT-KVQBGUIXSA-N 0.000 description 1
- CKTSBUTUHBMZGZ-SHYZEUOFSA-N 2'‐deoxycytidine Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 CKTSBUTUHBMZGZ-SHYZEUOFSA-N 0.000 description 1
- LDGWQMRUWMSZIU-LQDDAWAPSA-M 2,3-bis[(z)-octadec-9-enoxy]propyl-trimethylazanium;chloride Chemical compound [Cl-].CCCCCCCC\C=C/CCCCCCCCOCC(C[N+](C)(C)C)OCCCCCCCC\C=C/CCCCCCCC LDGWQMRUWMSZIU-LQDDAWAPSA-M 0.000 description 1
- KSXTUUUQYQYKCR-LQDDAWAPSA-M 2,3-bis[[(z)-octadec-9-enoyl]oxy]propyl-trimethylazanium;chloride Chemical compound [Cl-].CCCCCCCC\C=C/CCCCCCCC(=O)OCC(C[N+](C)(C)C)OC(=O)CCCCCCC\C=C/CCCCCCCC KSXTUUUQYQYKCR-LQDDAWAPSA-M 0.000 description 1
- NGNBDVOYPDDBFK-UHFFFAOYSA-N 2-[2,4-di(pentan-2-yl)phenoxy]acetyl chloride Chemical compound CCCC(C)C1=CC=C(OCC(Cl)=O)C(C(C)CCC)=C1 NGNBDVOYPDDBFK-UHFFFAOYSA-N 0.000 description 1
- PGYFLJKHWJVRMC-ZXRZDOCRSA-N 2-[4-[[(3s,8s,9s,10r,13r,14s,17r)-10,13-dimethyl-17-[(2r)-6-methylheptan-2-yl]-2,3,4,7,8,9,11,12,14,15,16,17-dodecahydro-1h-cyclopenta[a]phenanthren-3-yl]oxy]butoxy]-n,n-dimethyl-3-[(9z,12z)-octadeca-9,12-dienoxy]propan-1-amine Chemical compound C([C@@H]12)C[C@]3(C)[C@@H]([C@H](C)CCCC(C)C)CC[C@H]3[C@@H]1CC=C1[C@]2(C)CC[C@H](OCCCCOC(CN(C)C)COCCCCCCCC\C=C/C\C=C/CCCCC)C1 PGYFLJKHWJVRMC-ZXRZDOCRSA-N 0.000 description 1
- DQVAZKGVGKHQDS-UHFFFAOYSA-N 2-[[1-[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]pyrrolidine-2-carbonyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(O)=O DQVAZKGVGKHQDS-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- XQCZBXHVTFVIFE-UHFFFAOYSA-N 2-amino-4-hydroxypyrimidine Chemical compound NC1=NC=CC(O)=N1 XQCZBXHVTFVIFE-UHFFFAOYSA-N 0.000 description 1
- NMAKWCABHXLADB-UHFFFAOYSA-N 2-amino-n,n-dioctadecylacetamide Chemical compound CCCCCCCCCCCCCCCCCCN(C(=O)CN)CCCCCCCCCCCCCCCCCC NMAKWCABHXLADB-UHFFFAOYSA-N 0.000 description 1
- MWBWWFOAEOYUST-UHFFFAOYSA-N 2-aminopurine Chemical compound NC1=NC=C2N=CNC2=N1 MWBWWFOAEOYUST-UHFFFAOYSA-N 0.000 description 1
- HBJGQJWNMZDFKL-UHFFFAOYSA-N 2-chloro-7h-purin-6-amine Chemical compound NC1=NC(Cl)=NC2=C1NC=N2 HBJGQJWNMZDFKL-UHFFFAOYSA-N 0.000 description 1
- RHFUOMFWUGWKKO-XVFCMESISA-N 2-thiocytidine Chemical compound S=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 RHFUOMFWUGWKKO-XVFCMESISA-N 0.000 description 1
- 108020005345 3' Untranslated Regions Proteins 0.000 description 1
- PYSGFFTXMUWEOT-UHFFFAOYSA-N 3-(dimethylamino)propan-1-ol Chemical compound CN(C)CCCO PYSGFFTXMUWEOT-UHFFFAOYSA-N 0.000 description 1
- IUNZBFGNHKODEU-UHFFFAOYSA-N 3-(dimethylamino)propanoyl chloride Chemical compound CN(C)CCC(Cl)=O IUNZBFGNHKODEU-UHFFFAOYSA-N 0.000 description 1
- QPIVLXFKBIEJTR-UHFFFAOYSA-N 3-[2-[2-[2-[bis[3-oxo-3-(undecylamino)propyl]amino]ethyl-[3-oxo-3-(undecylamino)propyl]amino]ethylamino]ethyl-[3-oxo-3-(undecylamino)propyl]amino]-n-undecylpropanamide Chemical compound CCCCCCCCCCCNC(=O)CCN(CCC(=O)NCCCCCCCCCCC)CCN(CCC(=O)NCCCCCCCCCCC)CCNCCN(CCC(=O)NCCCCCCCCCCC)CCC(=O)NCCCCCCCCCCC QPIVLXFKBIEJTR-UHFFFAOYSA-N 0.000 description 1
- 125000004080 3-carboxypropanoyl group Chemical group O=C([*])C([H])([H])C([H])([H])C(O[H])=O 0.000 description 1
- YGTHBVOCTPFJHQ-UHFFFAOYSA-N 4-(dimethylamino)butanamide Chemical compound CN(C)CCCC(N)=O YGTHBVOCTPFJHQ-UHFFFAOYSA-N 0.000 description 1
- OXOWTLDONRGYOT-UHFFFAOYSA-M 4-(dimethylamino)butanoate Chemical compound CN(C)CCCC([O-])=O OXOWTLDONRGYOT-UHFFFAOYSA-M 0.000 description 1
- JIMHYXZZCWVCMI-ZSOIEALJSA-N 4-[(z)-[4-oxo-2-sulfanylidene-3-[3-(trifluoromethyl)phenyl]-1,3-thiazolidin-5-ylidene]methyl]benzoic acid Chemical compound C1=CC(C(=O)O)=CC=C1\C=C/1C(=O)N(C=2C=C(C=CC=2)C(F)(F)F)C(=S)S\1 JIMHYXZZCWVCMI-ZSOIEALJSA-N 0.000 description 1
- LQLQRFGHAALLLE-UHFFFAOYSA-N 5-bromouracil Chemical compound BrC1=CNC(=O)NC1=O LQLQRFGHAALLLE-UHFFFAOYSA-N 0.000 description 1
- UJBCLAXPPIDQEE-UHFFFAOYSA-N 5-prop-1-ynyl-1h-pyrimidine-2,4-dione Chemical compound CC#CC1=CNC(=O)NC1=O UJBCLAXPPIDQEE-UHFFFAOYSA-N 0.000 description 1
- DCPSTSVLRXOYGS-UHFFFAOYSA-N 6-amino-1h-pyrimidine-2-thione Chemical compound NC1=CC=NC(S)=N1 DCPSTSVLRXOYGS-UHFFFAOYSA-N 0.000 description 1
- FHVDTGUDJYJELY-UHFFFAOYSA-N 6-{[2-carboxy-4,5-dihydroxy-6-(phosphanyloxy)oxan-3-yl]oxy}-4,5-dihydroxy-3-phosphanyloxane-2-carboxylic acid Chemical compound O1C(C(O)=O)C(P)C(O)C(O)C1OC1C(C(O)=O)OC(OP)C(O)C1O FHVDTGUDJYJELY-UHFFFAOYSA-N 0.000 description 1
- VKKXEIQIGGPMHT-UHFFFAOYSA-N 7h-purine-2,8-diamine Chemical compound NC1=NC=C2NC(N)=NC2=N1 VKKXEIQIGGPMHT-UHFFFAOYSA-N 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- MBWYUTNBYSSUIQ-HERUPUMHSA-N Ala-Asn-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N MBWYUTNBYSSUIQ-HERUPUMHSA-N 0.000 description 1
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 1
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 1
- RAAWHFXHAACDFT-FXQIFTODSA-N Ala-Met-Asn Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CC(N)=O)C(O)=O RAAWHFXHAACDFT-FXQIFTODSA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 1
- XMIAMUXIMWREBJ-HERUPUMHSA-N Ala-Trp-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XMIAMUXIMWREBJ-HERUPUMHSA-N 0.000 description 1
- RIPMDCIXRYWXSH-KNXALSJPSA-N Ala-Trp-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N RIPMDCIXRYWXSH-KNXALSJPSA-N 0.000 description 1
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 1
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- 235000019489 Almond oil Nutrition 0.000 description 1
- 108091029845 Aminoallyl nucleotide Proteins 0.000 description 1
- 108091023037 Aptamer Proteins 0.000 description 1
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 1
- KJGNDQCYBNBXDA-GUBZILKMSA-N Arg-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N KJGNDQCYBNBXDA-GUBZILKMSA-N 0.000 description 1
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 1
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 1
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 1
- JQFJNGVSGOUQDH-XIRDDKMYSA-N Arg-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JQFJNGVSGOUQDH-XIRDDKMYSA-N 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 1
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- XOZYYXMHMIEJET-XIRDDKMYSA-N Arg-Trp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XOZYYXMHMIEJET-XIRDDKMYSA-N 0.000 description 1
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- 208000002109 Argyria Diseases 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- HAJWYALLJIATCX-FXQIFTODSA-N Asn-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N HAJWYALLJIATCX-FXQIFTODSA-N 0.000 description 1
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 1
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 1
- VJTWLBMESLDOMK-WDSKDSINSA-N Asn-Gln-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VJTWLBMESLDOMK-WDSKDSINSA-N 0.000 description 1
- PPMTUXJSQDNUDE-CIUDSAMLSA-N Asn-Glu-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PPMTUXJSQDNUDE-CIUDSAMLSA-N 0.000 description 1
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- NUCUBYIUPVYGPP-XIRDDKMYSA-N Asn-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CC(N)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O NUCUBYIUPVYGPP-XIRDDKMYSA-N 0.000 description 1
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 1
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 1
- WXVGISRWSYGEDK-KKUMJFAQSA-N Asn-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N WXVGISRWSYGEDK-KKUMJFAQSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- UYCPJVYQYARFGB-YDHLFZDLSA-N Asn-Phe-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UYCPJVYQYARFGB-YDHLFZDLSA-N 0.000 description 1
- DOURAOODTFJRIC-CIUDSAMLSA-N Asn-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N DOURAOODTFJRIC-CIUDSAMLSA-N 0.000 description 1
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- IMGLJMRIAFKUPZ-FXQIFTODSA-N Asp-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N IMGLJMRIAFKUPZ-FXQIFTODSA-N 0.000 description 1
- JXGJJQJHXHXJQF-CIUDSAMLSA-N Asp-Met-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O JXGJJQJHXHXJQF-CIUDSAMLSA-N 0.000 description 1
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 1
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 1
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- OGKCAZDGZCTSHD-WRBBJXAJSA-N C(CCCCCCC\C=C/CCCCCCCC)(=O)C1(OCC(O1)CCN(C)C)C(CCCCCCC\C=C/CCCCCCCC)=O Chemical compound C(CCCCCCC\C=C/CCCCCCCC)(=O)C1(OCC(O1)CCN(C)C)C(CCCCCCC\C=C/CCCCCCCC)=O OGKCAZDGZCTSHD-WRBBJXAJSA-N 0.000 description 1
- 125000005865 C2-C10alkynyl group Chemical group 0.000 description 1
- 125000003358 C2-C20 alkenyl group Chemical group 0.000 description 1
- 125000004406 C3-C8 cycloalkylene group Chemical group 0.000 description 1
- 108091033409 CRISPR Proteins 0.000 description 1
- 102000000584 Calmodulin Human genes 0.000 description 1
- 108010041952 Calmodulin Proteins 0.000 description 1
- 241000282461 Canis lupus Species 0.000 description 1
- 108090000565 Capsid Proteins Proteins 0.000 description 1
- 102100023321 Ceruloplasmin Human genes 0.000 description 1
- 229920001661 Chitosan Polymers 0.000 description 1
- 108020004638 Circular DNA Proteins 0.000 description 1
- 102000008186 Collagen Human genes 0.000 description 1
- 108010035532 Collagen Proteins 0.000 description 1
- 206010010774 Constipation Diseases 0.000 description 1
- 229920002261 Corn starch Polymers 0.000 description 1
- 206010011224 Cough Diseases 0.000 description 1
- 108091029523 CpG island Proteins 0.000 description 1
- 108091029430 CpG site Proteins 0.000 description 1
- 229920000858 Cyclodextrin Polymers 0.000 description 1
- YZFCGHIBLBDZDA-ZLUOBGJFSA-N Cys-Asp-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YZFCGHIBLBDZDA-ZLUOBGJFSA-N 0.000 description 1
- KABHAOSDMIYXTR-GUBZILKMSA-N Cys-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N KABHAOSDMIYXTR-GUBZILKMSA-N 0.000 description 1
- PDRMRVHPAQKTLT-NAKRPEOUSA-N Cys-Ile-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O PDRMRVHPAQKTLT-NAKRPEOUSA-N 0.000 description 1
- OZSBRCONEMXYOJ-AVGNSLFASA-N Cys-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N OZSBRCONEMXYOJ-AVGNSLFASA-N 0.000 description 1
- WKKKNGNJDGATNS-QEJZJMRPSA-N Cys-Trp-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKKKNGNJDGATNS-QEJZJMRPSA-N 0.000 description 1
- NGOIQDYZMIKCOK-NAKRPEOUSA-N Cys-Val-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NGOIQDYZMIKCOK-NAKRPEOUSA-N 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- IGXWBGJHJZYPQS-SSDOTTSWSA-N D-Luciferin Chemical compound OC(=O)[C@H]1CSC(C=2SC3=CC=C(O)C=C3N=2)=N1 IGXWBGJHJZYPQS-SSDOTTSWSA-N 0.000 description 1
- XULFJDKZVHTRLG-JDVCJPALSA-N DOSPA trifluoroacetate Chemical compound [O-]C(=O)C(F)(F)F.CCCCCCCC\C=C/CCCCCCCCOCC(C[N+](C)(C)CCNC(=O)C(CCCNCCCN)NCCCN)OCCCCCCCC\C=C/CCCCCCCC XULFJDKZVHTRLG-JDVCJPALSA-N 0.000 description 1
- CYCGRDQQIOGCKX-UHFFFAOYSA-N Dehydro-luciferin Natural products OC(=O)C1=CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 CYCGRDQQIOGCKX-UHFFFAOYSA-N 0.000 description 1
- SUZLHDUTVMZSEV-UHFFFAOYSA-N Deoxycoleonol Natural products C12C(=O)CC(C)(C=C)OC2(C)C(OC(=O)C)C(O)C2C1(C)C(O)CCC2(C)C SUZLHDUTVMZSEV-UHFFFAOYSA-N 0.000 description 1
- CKTSBUTUHBMZGZ-UHFFFAOYSA-N Deoxycytidine Natural products O=C1N=C(N)C=CN1C1OC(CO)C(O)C1 CKTSBUTUHBMZGZ-UHFFFAOYSA-N 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- LTMHDMANZUZIPE-AMTYYWEZSA-N Digoxin Natural products O([C@H]1[C@H](C)O[C@H](O[C@@H]2C[C@@H]3[C@@](C)([C@@H]4[C@H]([C@]5(O)[C@](C)([C@H](O)C4)[C@H](C4=CC(=O)OC4)CC5)CC3)CC2)C[C@@H]1O)[C@H]1O[C@H](C)[C@@H](O[C@H]2O[C@@H](C)[C@H](O)[C@@H](O)C2)[C@@H](O)C1 LTMHDMANZUZIPE-AMTYYWEZSA-N 0.000 description 1
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 1
- 206010061818 Disease progression Diseases 0.000 description 1
- CYQFCXCEBYINGO-DLBZAZTESA-N Dronabinol Natural products C1=C(C)CC[C@H]2C(C)(C)OC3=CC(CCCCC)=CC(O)=C3[C@H]21 CYQFCXCEBYINGO-DLBZAZTESA-N 0.000 description 1
- 208000000059 Dyspnea Diseases 0.000 description 1
- 206010013975 Dyspnoeas Diseases 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 244000148064 Enicostema verticillatum Species 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- XZWYTXMRWQJBGX-VXBMVYAYSA-N FLAG peptide Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 XZWYTXMRWQJBGX-VXBMVYAYSA-N 0.000 description 1
- 241000242711 Fasciola hepatica Species 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- BJGNCJDXODQBOB-UHFFFAOYSA-N Fivefly Luciferin Natural products OC(=O)C1CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 BJGNCJDXODQBOB-UHFFFAOYSA-N 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- KZKBJEUWNMQTLV-XDTLVQLUSA-N Gln-Ala-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZKBJEUWNMQTLV-XDTLVQLUSA-N 0.000 description 1
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 1
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- WMOMPXKOKASNBK-PEFMBERDSA-N Gln-Asn-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WMOMPXKOKASNBK-PEFMBERDSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 1
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 1
- KZEUVLLVULIPNX-GUBZILKMSA-N Gln-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N KZEUVLLVULIPNX-GUBZILKMSA-N 0.000 description 1
- JKPGHIQCHIIRMS-AVGNSLFASA-N Gln-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N JKPGHIQCHIIRMS-AVGNSLFASA-N 0.000 description 1
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 1
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 1
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 1
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 1
- LUGUNEGJNDEBLU-DCAQKATOSA-N Gln-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LUGUNEGJNDEBLU-DCAQKATOSA-N 0.000 description 1
- LVRKAFPPFJRIOF-GARJFASQSA-N Gln-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N LVRKAFPPFJRIOF-GARJFASQSA-N 0.000 description 1
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 1
- DOQUICBEISTQHE-CIUDSAMLSA-N Gln-Pro-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O DOQUICBEISTQHE-CIUDSAMLSA-N 0.000 description 1
- OKARHJKJTKFQBM-ACZMJKKPSA-N Gln-Ser-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OKARHJKJTKFQBM-ACZMJKKPSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 1
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 1
- XMWNHGKDDIFXQJ-NWLDYVSISA-N Gln-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XMWNHGKDDIFXQJ-NWLDYVSISA-N 0.000 description 1
- RBSKVTZUFMIWFU-XEGUGMAKSA-N Gln-Trp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O RBSKVTZUFMIWFU-XEGUGMAKSA-N 0.000 description 1
- YMCPEHDGTRUOHO-SXNHZJKMSA-N Gln-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N YMCPEHDGTRUOHO-SXNHZJKMSA-N 0.000 description 1
- CVRUVYDNRPSKBM-QEJZJMRPSA-N Gln-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N CVRUVYDNRPSKBM-QEJZJMRPSA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 1
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 1
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 1
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 1
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 1
- MIQCYAJSDGNCNK-BPUTZDHNSA-N Glu-Gln-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MIQCYAJSDGNCNK-BPUTZDHNSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- JGHNIWVNCAOVRO-DCAQKATOSA-N Glu-His-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGHNIWVNCAOVRO-DCAQKATOSA-N 0.000 description 1
- ZPASCJBSSCRWMC-GVXVVHGQSA-N Glu-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N ZPASCJBSSCRWMC-GVXVVHGQSA-N 0.000 description 1
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 1
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 1
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 1
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 1
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- RQZGFWKQLPJOEQ-YUMQZZPRSA-N Gly-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN)CN=C(N)N RQZGFWKQLPJOEQ-YUMQZZPRSA-N 0.000 description 1
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 1
- MXXXVOYFNVJHMA-IUCAKERBSA-N Gly-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN MXXXVOYFNVJHMA-IUCAKERBSA-N 0.000 description 1
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 1
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- PEZZSFLFXXFUQD-XPUUQOCRSA-N Gly-Cys-Val Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O PEZZSFLFXXFUQD-XPUUQOCRSA-N 0.000 description 1
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 1
- MVORZMQFXBLMHM-QWRGUYRKSA-N Gly-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 MVORZMQFXBLMHM-QWRGUYRKSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 1
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 1
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- GULGDABMYTYMJZ-STQMWFEESA-N Gly-Trp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O GULGDABMYTYMJZ-STQMWFEESA-N 0.000 description 1
- GWNIGUKSRJBIHX-STQMWFEESA-N Gly-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN)O GWNIGUKSRJBIHX-STQMWFEESA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- 102100031181 Glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 1
- ZWZWYGMENQVNFU-UHFFFAOYSA-N Glycerophosphorylserin Natural products OC(=O)C(N)COP(O)(=O)OCC(O)CO ZWZWYGMENQVNFU-UHFFFAOYSA-N 0.000 description 1
- AEMRFAOFKBGASW-UHFFFAOYSA-N Glycolic acid Polymers OCC(O)=O AEMRFAOFKBGASW-UHFFFAOYSA-N 0.000 description 1
- JBJNKUOMNZGQIM-PYJNHQTQSA-N His-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JBJNKUOMNZGQIM-PYJNHQTQSA-N 0.000 description 1
- VTMLJMNQHKBPON-QWRGUYRKSA-N His-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 VTMLJMNQHKBPON-QWRGUYRKSA-N 0.000 description 1
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 1
- SKYULSWNBYAQMG-IHRRRGAJSA-N His-Leu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SKYULSWNBYAQMG-IHRRRGAJSA-N 0.000 description 1
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 1
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 1
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- CKRJBQJIGOEKMC-SRVKXCTJSA-N His-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CKRJBQJIGOEKMC-SRVKXCTJSA-N 0.000 description 1
- FHKZHRMERJUXRJ-DCAQKATOSA-N His-Ser-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 FHKZHRMERJUXRJ-DCAQKATOSA-N 0.000 description 1
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 1
- DQZCEKQPSOBNMJ-NKIYYHGXSA-N His-Thr-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DQZCEKQPSOBNMJ-NKIYYHGXSA-N 0.000 description 1
- 108010033040 Histones Proteins 0.000 description 1
- WUEIUSDAECDLQO-NAKRPEOUSA-N Ile-Ala-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)O)N WUEIUSDAECDLQO-NAKRPEOUSA-N 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- QYOGJYIRKACXEP-SLBDDTMCSA-N Ile-Asn-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N QYOGJYIRKACXEP-SLBDDTMCSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 description 1
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 1
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 1
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 1
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 1
- LWWILHPVAKKLQS-QXEWZRGKSA-N Ile-Gly-Met Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N LWWILHPVAKKLQS-QXEWZRGKSA-N 0.000 description 1
- UQXADIGYEYBJEI-DJFWLOJKSA-N Ile-His-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N UQXADIGYEYBJEI-DJFWLOJKSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- DBXXASNNDTXOLU-MXAVVETBSA-N Ile-Leu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DBXXASNNDTXOLU-MXAVVETBSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 1
- WVUDHMBJNBWZBU-XUXIUFHCSA-N Ile-Lys-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N WVUDHMBJNBWZBU-XUXIUFHCSA-N 0.000 description 1
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 1
- FJWALBCCVIHZBS-QXEWZRGKSA-N Ile-Met-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N FJWALBCCVIHZBS-QXEWZRGKSA-N 0.000 description 1
- FTUZWJVSNZMLPI-RVMXOQNASA-N Ile-Met-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N FTUZWJVSNZMLPI-RVMXOQNASA-N 0.000 description 1
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 1
- ZDNNDIJTUHQCAM-MXAVVETBSA-N Ile-Ser-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ZDNNDIJTUHQCAM-MXAVVETBSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- VBGCPJBKUXRYDA-DSYPUSFNSA-N Ile-Trp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N VBGCPJBKUXRYDA-DSYPUSFNSA-N 0.000 description 1
- DZMWFIRHFFVBHS-ZEWNOJEFSA-N Ile-Tyr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N DZMWFIRHFFVBHS-ZEWNOJEFSA-N 0.000 description 1
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- 108010065805 Interleukin-12 Proteins 0.000 description 1
- 102000013462 Interleukin-12 Human genes 0.000 description 1
- 108010065637 Interleukin-23 Proteins 0.000 description 1
- 102000013264 Interleukin-23 Human genes 0.000 description 1
- 102100033503 Interleukin-36 gamma Human genes 0.000 description 1
- 101710195086 Interleukin-36 gamma Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- GPXFZVUVPCFTMG-AVGNSLFASA-N Leu-Arg-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C GPXFZVUVPCFTMG-AVGNSLFASA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- XYUBOFCTGPZFSA-WDSOQIARSA-N Leu-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 XYUBOFCTGPZFSA-WDSOQIARSA-N 0.000 description 1
- CUXRXAIAVYLVFD-ULQDDVLXSA-N Leu-Arg-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUXRXAIAVYLVFD-ULQDDVLXSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- KWURTLAFFDOTEQ-GUBZILKMSA-N Leu-Cys-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KWURTLAFFDOTEQ-GUBZILKMSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- DXYBNWJZJVSZAE-GUBZILKMSA-N Leu-Gln-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N DXYBNWJZJVSZAE-GUBZILKMSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- WMTOVWLLDGQGCV-GUBZILKMSA-N Leu-Glu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WMTOVWLLDGQGCV-GUBZILKMSA-N 0.000 description 1
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 1
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- TVEOVCYCYGKVPP-HSCHXYMDSA-N Leu-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N TVEOVCYCYGKVPP-HSCHXYMDSA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- FIICHHJDINDXKG-IHPCNDPISA-N Leu-Lys-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O FIICHHJDINDXKG-IHPCNDPISA-N 0.000 description 1
- PKKMDPNFGULLNQ-AVGNSLFASA-N Leu-Met-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O PKKMDPNFGULLNQ-AVGNSLFASA-N 0.000 description 1
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 1
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 1
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- FPFOYSCDUWTZBF-IHPCNDPISA-N Leu-Trp-Leu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]([NH3+])CC(C)C)C(=O)N[C@@H](CC(C)C)C([O-])=O)=CNC2=C1 FPFOYSCDUWTZBF-IHPCNDPISA-N 0.000 description 1
- UFPLDOKWDNTTRP-ULQDDVLXSA-N Leu-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=C(O)C=C1 UFPLDOKWDNTTRP-ULQDDVLXSA-N 0.000 description 1
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- 235000010643 Leucaena leucocephala Nutrition 0.000 description 1
- 240000007472 Leucaena leucocephala Species 0.000 description 1
- OYHQOLUKZRVURQ-HZJYTTRNSA-N Linoleic acid Chemical compound CCCCC\C=C/C\C=C/CCCCCCCC(O)=O OYHQOLUKZRVURQ-HZJYTTRNSA-N 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- DDWFXDSYGUXRAY-UHFFFAOYSA-N Luciferin Natural products CCc1c(C)c(CC2NC(=O)C(=C2C=C)C)[nH]c1Cc3[nH]c4C(=C5/NC(CC(=O)O)C(C)C5CC(=O)O)CC(=O)c4c3C DDWFXDSYGUXRAY-UHFFFAOYSA-N 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 1
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 1
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 1
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- YFGWNAROEYWGNL-GUBZILKMSA-N Lys-Gln-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YFGWNAROEYWGNL-GUBZILKMSA-N 0.000 description 1
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- KZOHPCYVORJBLG-AVGNSLFASA-N Lys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N KZOHPCYVORJBLG-AVGNSLFASA-N 0.000 description 1
- VQXAVLQBQJMENB-SRVKXCTJSA-N Lys-Glu-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O VQXAVLQBQJMENB-SRVKXCTJSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 1
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- VWJFOUBDZIUXGA-AVGNSLFASA-N Lys-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N VWJFOUBDZIUXGA-AVGNSLFASA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- JLVVSXFLKOJNIY-UHFFFAOYSA-N Magnesium ion Chemical compound [Mg+2] JLVVSXFLKOJNIY-UHFFFAOYSA-N 0.000 description 1
- 208000003289 Meconium Ileus Diseases 0.000 description 1
- ACYHZNZHIZWLQF-BQBZGAKWSA-N Met-Asn-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ACYHZNZHIZWLQF-BQBZGAKWSA-N 0.000 description 1
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 1
- PTYVBBNIAQWUFV-DCAQKATOSA-N Met-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCSC)N PTYVBBNIAQWUFV-DCAQKATOSA-N 0.000 description 1
- OXHSZBRPUGNMKW-DCAQKATOSA-N Met-Gln-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OXHSZBRPUGNMKW-DCAQKATOSA-N 0.000 description 1
- HLYIDXAXQIJYIG-CIUDSAMLSA-N Met-Gln-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HLYIDXAXQIJYIG-CIUDSAMLSA-N 0.000 description 1
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 1
- DGNZGCQSVGGYJS-BQBZGAKWSA-N Met-Gly-Asp Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O DGNZGCQSVGGYJS-BQBZGAKWSA-N 0.000 description 1
- LQMHZERGCQJKAH-STQMWFEESA-N Met-Gly-Phe Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LQMHZERGCQJKAH-STQMWFEESA-N 0.000 description 1
- ORRNBLTZBBESPN-HJWJTTGWSA-N Met-Ile-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ORRNBLTZBBESPN-HJWJTTGWSA-N 0.000 description 1
- PZUUMQPMHBJJKE-AVGNSLFASA-N Met-Leu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N PZUUMQPMHBJJKE-AVGNSLFASA-N 0.000 description 1
- JOYFULUKJRJCSX-IUCAKERBSA-N Met-Met-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O JOYFULUKJRJCSX-IUCAKERBSA-N 0.000 description 1
- CRVSHEPROQHVQT-AVGNSLFASA-N Met-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N CRVSHEPROQHVQT-AVGNSLFASA-N 0.000 description 1
- LUYURUYVNYGKGM-RCWTZXSCSA-N Met-Pro-Thr Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUYURUYVNYGKGM-RCWTZXSCSA-N 0.000 description 1
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 1
- ZBLSZPYQQRIHQU-RCWTZXSCSA-N Met-Thr-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ZBLSZPYQQRIHQU-RCWTZXSCSA-N 0.000 description 1
- JHVNNUIQXOGAHI-KJEVXHAQSA-N Met-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N)O JHVNNUIQXOGAHI-KJEVXHAQSA-N 0.000 description 1
- 108060004795 Methyltransferase Proteins 0.000 description 1
- 102000016397 Methyltransferase Human genes 0.000 description 1
- 229920000168 Microcrystalline cellulose Polymers 0.000 description 1
- 206010027626 Milia Diseases 0.000 description 1
- 241000699660 Mus musculus Species 0.000 description 1
- FXHOOIRPVKKKFG-UHFFFAOYSA-N N,N-Dimethylacetamide Chemical compound CN(C)C(C)=O FXHOOIRPVKKKFG-UHFFFAOYSA-N 0.000 description 1
- JMOXSQYGVIXBBZ-UHFFFAOYSA-N N,N-dimethyl-beta-alanine Chemical compound CN(C)CCC(O)=O JMOXSQYGVIXBBZ-UHFFFAOYSA-N 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- CHJJGSNFBQVOTG-UHFFFAOYSA-N N-methyl-guanidine Natural products CNC(N)=N CHJJGSNFBQVOTG-UHFFFAOYSA-N 0.000 description 1
- 238000005481 NMR spectroscopy Methods 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 206010028748 Nasal obstruction Diseases 0.000 description 1
- 108091005461 Nucleic proteins Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 108091093037 Peptide nucleic acid Proteins 0.000 description 1
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 1
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- UEEVBGHEGJMDDV-AVGNSLFASA-N Phe-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEEVBGHEGJMDDV-AVGNSLFASA-N 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- CPTJPDZTFNKFOU-MXAVVETBSA-N Phe-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N CPTJPDZTFNKFOU-MXAVVETBSA-N 0.000 description 1
- WFDAEEUZPZSMOG-SRVKXCTJSA-N Phe-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O WFDAEEUZPZSMOG-SRVKXCTJSA-N 0.000 description 1
- YEEFZOKPYOUXMX-KKUMJFAQSA-N Phe-Gln-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O YEEFZOKPYOUXMX-KKUMJFAQSA-N 0.000 description 1
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 1
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 1
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 1
- YZJKNDCEPDDIDA-BZSNNMDCSA-N Phe-His-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 YZJKNDCEPDDIDA-BZSNNMDCSA-N 0.000 description 1
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 1
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 1
- BWTKUQPNOMMKMA-FIRPJDEBSA-N Phe-Ile-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BWTKUQPNOMMKMA-FIRPJDEBSA-N 0.000 description 1
- IEOHQGFKHXUALJ-JYJNAYRXSA-N Phe-Met-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IEOHQGFKHXUALJ-JYJNAYRXSA-N 0.000 description 1
- OHIYMVFLQXTZAW-UFYCRDLUSA-N Phe-Met-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OHIYMVFLQXTZAW-UFYCRDLUSA-N 0.000 description 1
- KAJLHCWRWDSROH-BZSNNMDCSA-N Phe-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 KAJLHCWRWDSROH-BZSNNMDCSA-N 0.000 description 1
- YMTMNYNEZDAGMW-RNXOBYDBSA-N Phe-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N YMTMNYNEZDAGMW-RNXOBYDBSA-N 0.000 description 1
- GZGPMBKUJDRICD-ULQDDVLXSA-N Phe-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O GZGPMBKUJDRICD-ULQDDVLXSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 1
- MRWOVVNKSXXLRP-IHPCNDPISA-N Phe-Ser-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MRWOVVNKSXXLRP-IHPCNDPISA-N 0.000 description 1
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 1
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 1
- 229920000954 Polyglycolide Polymers 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 1
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 1
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 1
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 1
- XNJVJEHDZPDPQL-BZSNNMDCSA-N Pro-Trp-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H]1CCCN1)C(O)=O XNJVJEHDZPDPQL-BZSNNMDCSA-N 0.000 description 1
- FZXSYIPVAFVYBH-KKUMJFAQSA-N Pro-Tyr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O FZXSYIPVAFVYBH-KKUMJFAQSA-N 0.000 description 1
- DYJTXTCEXMCPBF-UFYCRDLUSA-N Pro-Tyr-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O DYJTXTCEXMCPBF-UFYCRDLUSA-N 0.000 description 1
- DGDCSVGVWWAJRS-AVGNSLFASA-N Pro-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 DGDCSVGVWWAJRS-AVGNSLFASA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- XRGIDCGRSSWCKE-SRVKXCTJSA-N Pro-Val-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O XRGIDCGRSSWCKE-SRVKXCTJSA-N 0.000 description 1
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- 206010036790 Productive cough Diseases 0.000 description 1
- 108020005067 RNA Splice Sites Proteins 0.000 description 1
- 230000006819 RNA synthesis Effects 0.000 description 1
- 238000011529 RT qPCR Methods 0.000 description 1
- 208000037656 Respiratory Sounds Diseases 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 108091081021 Sense strand Proteins 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 1
- SWIQQMYVHIXPEK-FXQIFTODSA-N Ser-Cys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O SWIQQMYVHIXPEK-FXQIFTODSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- GRSLLFZTTLBOQX-CIUDSAMLSA-N Ser-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N GRSLLFZTTLBOQX-CIUDSAMLSA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- JLPMFVAIQHCBDC-CIUDSAMLSA-N Ser-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N JLPMFVAIQHCBDC-CIUDSAMLSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- OCWWJBZQXGYQCA-DCAQKATOSA-N Ser-Lys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O OCWWJBZQXGYQCA-DCAQKATOSA-N 0.000 description 1
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- UQGAAZXSCGWMFU-UBHSHLNASA-N Ser-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N UQGAAZXSCGWMFU-UBHSHLNASA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 1
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 1
- 229920002125 Sokalan® Polymers 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 241000282898 Sus scrofa Species 0.000 description 1
- 101710137500 T7 RNA polymerase Proteins 0.000 description 1
- 108700026226 TATA Box Proteins 0.000 description 1
- CYQFCXCEBYINGO-UHFFFAOYSA-N THC Natural products C1=C(C)CCC2C(C)(C)OC3=CC(CCCCC)=CC(O)=C3C21 CYQFCXCEBYINGO-UHFFFAOYSA-N 0.000 description 1
- 102000002933 Thioredoxin Human genes 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 1
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- NRUPKQSXTJNQGD-XGEHTFHBSA-N Thr-Cys-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NRUPKQSXTJNQGD-XGEHTFHBSA-N 0.000 description 1
- ASJDFGOPDCVXTG-KATARQTJSA-N Thr-Cys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ASJDFGOPDCVXTG-KATARQTJSA-N 0.000 description 1
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 1
- DXNUZQGVOMCGNS-SWRJLBSHSA-N Thr-Gln-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O DXNUZQGVOMCGNS-SWRJLBSHSA-N 0.000 description 1
- BIENEHRYNODTLP-HJGDQZAQSA-N Thr-Glu-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N)O BIENEHRYNODTLP-HJGDQZAQSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- SIMKLINEDYOTKL-MBLNEYKQSA-N Thr-His-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C)C(=O)O)N)O SIMKLINEDYOTKL-MBLNEYKQSA-N 0.000 description 1
- AYCQVUUPIJHJTA-IXOXFDKPSA-N Thr-His-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O AYCQVUUPIJHJTA-IXOXFDKPSA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- KDGBLMDAPJTQIW-RHYQMDGZSA-N Thr-Met-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N)O KDGBLMDAPJTQIW-RHYQMDGZSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- BJJRNAVDQGREGC-HOUAVDHOSA-N Thr-Trp-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O BJJRNAVDQGREGC-HOUAVDHOSA-N 0.000 description 1
- AUYYCJSJGJYCDS-LBPRGKRZSA-N Thyrolar Chemical class IC1=CC(C[C@H](N)C(O)=O)=CC(I)=C1OC1=CC=C(O)C(I)=C1 AUYYCJSJGJYCDS-LBPRGKRZSA-N 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- BIJDDZBDSJLWJY-PJODQICGSA-N Trp-Ala-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O BIJDDZBDSJLWJY-PJODQICGSA-N 0.000 description 1
- MVHHTXAUJCIOMZ-WDSOQIARSA-N Trp-Arg-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N MVHHTXAUJCIOMZ-WDSOQIARSA-N 0.000 description 1
- LTLBNCDNXQCOLB-UBHSHLNASA-N Trp-Asp-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 LTLBNCDNXQCOLB-UBHSHLNASA-N 0.000 description 1
- LGEPIBQBGZTBHL-SXNHZJKMSA-N Trp-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N LGEPIBQBGZTBHL-SXNHZJKMSA-N 0.000 description 1
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 1
- OJKVFAWXPGCJMF-BPUTZDHNSA-N Trp-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CO)C(=O)O OJKVFAWXPGCJMF-BPUTZDHNSA-N 0.000 description 1
- DTPWXZXGFAHEKL-NWLDYVSISA-N Trp-Thr-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DTPWXZXGFAHEKL-NWLDYVSISA-N 0.000 description 1
- HTGJDTPQYFMKNC-VFAJRCTISA-N Trp-Thr-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 HTGJDTPQYFMKNC-VFAJRCTISA-N 0.000 description 1
- 102000004243 Tubulin Human genes 0.000 description 1
- 108090000704 Tubulin Proteins 0.000 description 1
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 description 1
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 1
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- UXUFNBVCPAWACG-SIUGBPQLSA-N Tyr-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N UXUFNBVCPAWACG-SIUGBPQLSA-N 0.000 description 1
- HDSKHCBAVVWPCQ-FHWLQOOXSA-N Tyr-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HDSKHCBAVVWPCQ-FHWLQOOXSA-N 0.000 description 1
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 1
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 1
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 1
- AVFGBGGRZOKSFS-KJEVXHAQSA-N Tyr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O AVFGBGGRZOKSFS-KJEVXHAQSA-N 0.000 description 1
- PHKQVWWHRYUCJL-HJOGWXRNSA-N Tyr-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PHKQVWWHRYUCJL-HJOGWXRNSA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 1
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 1
- MJUTYRIMFIICKL-JYJNAYRXSA-N Tyr-Val-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MJUTYRIMFIICKL-JYJNAYRXSA-N 0.000 description 1
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 1
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 1
- 101150013568 US16 gene Proteins 0.000 description 1
- 101150071882 US17 gene Proteins 0.000 description 1
- 101150114976 US21 gene Proteins 0.000 description 1
- 101800001117 Ubiquitin-related Proteins 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- CWOSXNKDOACNJN-BZSNNMDCSA-N Val-Arg-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N CWOSXNKDOACNJN-BZSNNMDCSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 1
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 1
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- YTUABZMPYKCWCQ-XQQFMLRXSA-N Val-His-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N YTUABZMPYKCWCQ-XQQFMLRXSA-N 0.000 description 1
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- ODUHAIXFXFACDY-SRVKXCTJSA-N Val-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C ODUHAIXFXFACDY-SRVKXCTJSA-N 0.000 description 1
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 1
- 102000005789 Vascular Endothelial Growth Factors Human genes 0.000 description 1
- 108010019530 Vascular Endothelial Growth Factors Proteins 0.000 description 1
- 206010047897 Weight gain poor Diseases 0.000 description 1
- 206010047924 Wheezing Diseases 0.000 description 1
- 108010017070 Zinc Finger Nucleases Proteins 0.000 description 1
- ONVRFFJRASWLTG-UHFFFAOYSA-N [(1-amino-2-naphthalen-1-ylethylidene)amino] morpholine-4-carboxylate Chemical compound C=1C=CC2=CC=CC=C2C=1CC(N)=NOC(=O)N1CCOCC1 ONVRFFJRASWLTG-UHFFFAOYSA-N 0.000 description 1
- HCAJCMUKLZSPFT-KWXKLSQISA-N [3-(dimethylamino)-2-[(9z,12z)-octadeca-9,12-dienoyl]oxypropyl] (9z,12z)-octadeca-9,12-dienoate Chemical compound CCCCC\C=C/C\C=C/CCCCCCCC(=O)OCC(CN(C)C)OC(=O)CCCCCCC\C=C/C\C=C/CCCCC HCAJCMUKLZSPFT-KWXKLSQISA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- UOXMAJQKZWCZOR-UHFFFAOYSA-N [O-][Si]([O-])(O)O.P.[Ca+2] Chemical compound [O-][Si]([O-])(O)O.P.[Ca+2] UOXMAJQKZWCZOR-UHFFFAOYSA-N 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 230000009056 active transport Effects 0.000 description 1
- 150000003838 adenosines Chemical class 0.000 description 1
- 238000012387 aerosolization Methods 0.000 description 1
- 239000000556 agonist Substances 0.000 description 1
- 208000037883 airway inflammation Diseases 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010069490 alanyl-glycyl-seryl-glutamic acid Proteins 0.000 description 1
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 229940072056 alginate Drugs 0.000 description 1
- 235000010443 alginic acid Nutrition 0.000 description 1
- 229920000615 alginic acid Polymers 0.000 description 1
- 125000004419 alkynylene group Chemical group 0.000 description 1
- 208000026935 allergic disease Diseases 0.000 description 1
- 239000008168 almond oil Substances 0.000 description 1
- DTOSIQBPPRVQHS-PDBXOOCHSA-N alpha-linolenic acid Chemical compound CC\C=C/C\C=C/C\C=C/CCCCCCCC(O)=O DTOSIQBPPRVQHS-PDBXOOCHSA-N 0.000 description 1
- 235000020661 alpha-linolenic acid Nutrition 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 150000001414 amino alcohols Chemical class 0.000 description 1
- 150000008064 anhydrides Chemical group 0.000 description 1
- 239000005557 antagonist Substances 0.000 description 1
- 239000008346 aqueous phase Substances 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010068991 arginyl-threonyl-prolyl-prolyl-prolyl-seryl-glycine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 125000000732 arylene group Chemical group 0.000 description 1
- 235000010323 ascorbic acid Nutrition 0.000 description 1
- 239000011668 ascorbic acid Substances 0.000 description 1
- 229960005070 ascorbic acid Drugs 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 238000000429 assembly Methods 0.000 description 1
- 230000000712 assembly Effects 0.000 description 1
- 108010028263 bacteriophage T3 RNA polymerase Proteins 0.000 description 1
- 230000004888 barrier function Effects 0.000 description 1
- 238000010009 beating Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 238000004166 bioassay Methods 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 230000006287 biotinylation Effects 0.000 description 1
- 238000007413 biotinylation Methods 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 210000000621 bronchi Anatomy 0.000 description 1
- 239000000337 buffer salt Substances 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- FUFJGUQYACFECW-UHFFFAOYSA-L calcium hydrogenphosphate Chemical compound [Ca+2].OP([O-])([O-])=O FUFJGUQYACFECW-UHFFFAOYSA-L 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 229940022399 cancer vaccine Drugs 0.000 description 1
- 238000009566 cancer vaccine Methods 0.000 description 1
- 239000002775 capsule Substances 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000004700 cellular uptake Effects 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 235000010980 cellulose Nutrition 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 150000001783 ceramides Chemical class 0.000 description 1
- 229930183167 cerebroside Natural products 0.000 description 1
- 150000001784 cerebrosides Chemical class 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- ZYWFEOZQIUMEGL-UHFFFAOYSA-N chloroform;3-methylbutan-1-ol;phenol Chemical compound ClC(Cl)Cl.CC(C)CCO.OC1=CC=CC=C1 ZYWFEOZQIUMEGL-UHFFFAOYSA-N 0.000 description 1
- 150000001841 cholesterols Chemical class 0.000 description 1
- 210000004081 cilia Anatomy 0.000 description 1
- OHCQJHSOBUTRHG-UHFFFAOYSA-N colforsin Natural products OC12C(=O)CC(C)(C=C)OC1(C)C(OC(=O)C)C(O)C1C2(C)C(O)CCC1(C)C OHCQJHSOBUTRHG-UHFFFAOYSA-N 0.000 description 1
- 229920001436 collagen Polymers 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 229940125904 compound 1 Drugs 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 239000008120 corn starch Substances 0.000 description 1
- 229940099112 cornstarch Drugs 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 239000002537 cosmetic Substances 0.000 description 1
- 125000000392 cycloalkenyl group Chemical group 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 238000002784 cytotoxicity assay Methods 0.000 description 1
- 231100000263 cytotoxicity test Toxicity 0.000 description 1
- SPTYHKZRPFATHJ-HYZXJONISA-N dT6 Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)CO)[C@@H](O)C1 SPTYHKZRPFATHJ-HYZXJONISA-N 0.000 description 1
- CYQFCXCEBYINGO-IAGOWNOFSA-N delta1-THC Chemical compound C1=C(C)CC[C@H]2C(C)(C)OC3=CC(CCCCC)=CC(O)=C3[C@@H]21 CYQFCXCEBYINGO-IAGOWNOFSA-N 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000000151 deposition Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 125000004663 dialkyl amino group Chemical group 0.000 description 1
- 235000019700 dicalcium phosphate Nutrition 0.000 description 1
- UMGXUWVIJIQANV-UHFFFAOYSA-M didecyl(dimethyl)azanium;bromide Chemical compound [Br-].CCCCCCCCCC[N+](C)(C)CCCCCCCCCC UMGXUWVIJIQANV-UHFFFAOYSA-M 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 230000001079 digestive effect Effects 0.000 description 1
- 102000038379 digestive enzymes Human genes 0.000 description 1
- 108091007734 digestive enzymes Proteins 0.000 description 1
- 210000002249 digestive system Anatomy 0.000 description 1
- LTMHDMANZUZIPE-PUGKRICDSA-N digoxin Chemical compound C1[C@H](O)[C@H](O)[C@@H](C)O[C@H]1O[C@@H]1[C@@H](C)O[C@@H](O[C@@H]2[C@H](O[C@@H](O[C@@H]3C[C@@H]4[C@]([C@@H]5[C@H]([C@]6(CC[C@@H]([C@@]6(C)[C@H](O)C5)C=5COC(=O)C=5)O)CC4)(C)CC3)C[C@@H]2O)C)C[C@@H]1O LTMHDMANZUZIPE-PUGKRICDSA-N 0.000 description 1
- 229960005156 digoxin Drugs 0.000 description 1
- LTMHDMANZUZIPE-UHFFFAOYSA-N digoxine Natural products C1C(O)C(O)C(C)OC1OC1C(C)OC(OC2C(OC(OC3CC4C(C5C(C6(CCC(C6(C)C(O)C5)C=5COC(=O)C=5)O)CC4)(C)CC3)CC2O)C)CC1O LTMHDMANZUZIPE-UHFFFAOYSA-N 0.000 description 1
- 125000002147 dimethylamino group Chemical group [H]C([H])([H])N(*)C([H])([H])[H] 0.000 description 1
- SWSQBOPZIKWTGO-UHFFFAOYSA-N dimethylaminoamidine Natural products CN(C)C(N)=N SWSQBOPZIKWTGO-UHFFFAOYSA-N 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 230000005750 disease progression Effects 0.000 description 1
- 239000007884 disintegrant Substances 0.000 description 1
- 231100000673 dose–response relationship Toxicity 0.000 description 1
- 229960004242 dronabinol Drugs 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000013583 drug formulation Substances 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 229940121437 eluforsen Drugs 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 150000002081 enamines Chemical class 0.000 description 1
- 210000002889 endothelial cell Anatomy 0.000 description 1
- 230000003511 endothelial effect Effects 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- CCIVGXIOQKPBKL-UHFFFAOYSA-N ethanesulfonic acid Chemical compound CCS(O)(=O)=O CCIVGXIOQKPBKL-UHFFFAOYSA-N 0.000 description 1
- 235000019441 ethanol Nutrition 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- BEFDCLMNVWHSGT-UHFFFAOYSA-N ethenylcyclopentane Chemical compound C=CC1CCCC1 BEFDCLMNVWHSGT-UHFFFAOYSA-N 0.000 description 1
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000001704 evaporation Methods 0.000 description 1
- 230000029142 excretion Effects 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 210000001808 exosome Anatomy 0.000 description 1
- 238000013265 extended release Methods 0.000 description 1
- 238000001125 extrusion Methods 0.000 description 1
- 239000003925 fat Substances 0.000 description 1
- 235000019197 fats Nutrition 0.000 description 1
- 239000000945 filler Substances 0.000 description 1
- 239000000796 flavoring agent Substances 0.000 description 1
- 235000019634 flavors Nutrition 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 229960002143 fluorescein Drugs 0.000 description 1
- 235000003599 food sweetener Nutrition 0.000 description 1
- 150000002243 furanoses Chemical group 0.000 description 1
- 230000000799 fusogenic effect Effects 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 150000002270 gangliosides Chemical class 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108060003196 globin Proteins 0.000 description 1
- 102000018146 globin Human genes 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 1
- 125000003827 glycol group Chemical group 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010038983 glycyl-histidyl-lysine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010050848 glycylleucine Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 238000002657 hormone replacement therapy Methods 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 1
- 208000008384 ileus Diseases 0.000 description 1
- 150000002466 imines Chemical class 0.000 description 1
- 230000002519 immonomodulatory effect Effects 0.000 description 1
- 238000010166 immunofluorescence Methods 0.000 description 1
- 238000010569 immunofluorescence imaging Methods 0.000 description 1
- 230000002163 immunogen Effects 0.000 description 1
- 238000011532 immunohistochemical staining Methods 0.000 description 1
- 238000012750 in vivo screening Methods 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000002757 inflammatory effect Effects 0.000 description 1
- 231100000824 inhalation exposure Toxicity 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 229940117681 interleukin-12 Drugs 0.000 description 1
- 229940124829 interleukin-23 Drugs 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 239000007928 intraperitoneal injection Substances 0.000 description 1
- 238000007913 intrathecal administration Methods 0.000 description 1
- 238000007914 intraventricular administration Methods 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 229910052742 iron Inorganic materials 0.000 description 1
- 230000007794 irritation Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- PURKAOJPTOLRMP-UHFFFAOYSA-N ivacaftor Chemical compound C1=C(O)C(C(C)(C)C)=CC(C(C)(C)C)=C1NC(=O)C1=CNC2=CC=CC=C2C1=O PURKAOJPTOLRMP-UHFFFAOYSA-N 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 238000011031 large-scale manufacturing process Methods 0.000 description 1
- 239000010410 layer Substances 0.000 description 1
- 239000000787 lecithin Substances 0.000 description 1
- 235000010445 lecithin Nutrition 0.000 description 1
- 229940067606 lecithin Drugs 0.000 description 1
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 1
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 235000020778 linoleic acid Nutrition 0.000 description 1
- OYHQOLUKZRVURQ-IXWMQOLASA-N linoleic acid Natural products CCCCC\C=C/C\C=C\CCCCCCCC(O)=O OYHQOLUKZRVURQ-IXWMQOLASA-N 0.000 description 1
- 229960004488 linolenic acid Drugs 0.000 description 1
- KQQKGWQCNNTQJW-UHFFFAOYSA-N linolenic acid Natural products CC=CCCC=CCC=CCCCCCCCC(O)=O KQQKGWQCNNTQJW-UHFFFAOYSA-N 0.000 description 1
- 239000012669 liquid formulation Substances 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 239000000314 lubricant Substances 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 210000002540 macrophage Anatomy 0.000 description 1
- 229910001425 magnesium ion Inorganic materials 0.000 description 1
- 235000019359 magnesium stearate Nutrition 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 210000004379 membrane Anatomy 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 229940095102 methyl benzoate Drugs 0.000 description 1
- 229920000609 methyl cellulose Polymers 0.000 description 1
- 239000001923 methylcellulose Substances 0.000 description 1
- 108091070501 miRNA Proteins 0.000 description 1
- 239000000693 micelle Substances 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 235000019813 microcrystalline cellulose Nutrition 0.000 description 1
- 239000008108 microcrystalline cellulose Substances 0.000 description 1
- 229940016286 microcrystalline cellulose Drugs 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- DILRJUIACXKSQE-UHFFFAOYSA-N n',n'-dimethylethane-1,2-diamine Chemical compound CN(C)CCN DILRJUIACXKSQE-UHFFFAOYSA-N 0.000 description 1
- NFQBIAXADRDUGK-KWXKLSQISA-N n,n-dimethyl-2,3-bis[(9z,12z)-octadeca-9,12-dienoxy]propan-1-amine Chemical compound CCCCC\C=C/C\C=C/CCCCCCCCOCC(CN(C)C)OCCCCCCCC\C=C/C\C=C/CCCCC NFQBIAXADRDUGK-KWXKLSQISA-N 0.000 description 1
- GLGLUQVVDHRLQK-WRBBJXAJSA-N n,n-dimethyl-2,3-bis[(z)-octadec-9-enoxy]propan-1-amine Chemical compound CCCCCCCC\C=C/CCCCCCCCOCC(CN(C)C)OCCCCCCCC\C=C/CCCCCCCC GLGLUQVVDHRLQK-WRBBJXAJSA-N 0.000 description 1
- 239000002088 nanocapsule Substances 0.000 description 1
- 239000002159 nanocrystal Substances 0.000 description 1
- 210000002850 nasal mucosa Anatomy 0.000 description 1
- 230000017074 necrotic cell death Effects 0.000 description 1
- 239000002353 niosome Substances 0.000 description 1
- IJGRMHOSHXDMSA-UHFFFAOYSA-N nitrogen Substances N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 1
- 239000012038 nucleophile Substances 0.000 description 1
- 230000000269 nucleophilic effect Effects 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 230000002572 peristaltic effect Effects 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 150000008104 phosphatidylethanolamines Chemical class 0.000 description 1
- 150000003013 phosphoric acid derivatives Chemical group 0.000 description 1
- 230000010287 polarization Effects 0.000 description 1
- 229920000771 poly (alkylcyanoacrylate) Polymers 0.000 description 1
- 229920000723 poly(D-arginine) polymer Polymers 0.000 description 1
- 229920000747 poly(lactic acid) Polymers 0.000 description 1
- 229920000058 polyacrylate Polymers 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920001610 polycaprolactone Polymers 0.000 description 1
- 239000004632 polycaprolactone Substances 0.000 description 1
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 1
- 231100000683 possible toxicity Toxicity 0.000 description 1
- 239000001103 potassium chloride Substances 0.000 description 1
- 235000011164 potassium chloride Nutrition 0.000 description 1
- 229920001592 potato starch Polymers 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 230000002028 premature Effects 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 230000002335 preservative effect Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 230000000069 prophylactic effect Effects 0.000 description 1
- 125000001436 propyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- QELSKZZBTMNZEB-UHFFFAOYSA-N propylparaben Chemical compound CCCOC(=O)C1=CC=C(O)C=C1 QELSKZZBTMNZEB-UHFFFAOYSA-N 0.000 description 1
- 229960003415 propylparaben Drugs 0.000 description 1
- 230000004952 protein activity Effects 0.000 description 1
- 108010030416 proteoliposomes Proteins 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 125000004076 pyridyl group Chemical group 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000003762 quantitative reverse transcription PCR Methods 0.000 description 1
- 238000010791 quenching Methods 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 239000012925 reference material Substances 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- RWWYLEGWBNMMLJ-YSOARWBDSA-N remdesivir Chemical compound NC1=NC=NN2C1=CC=C2[C@]1([C@@H]([C@@H]([C@H](O1)CO[P@](=O)(OC1=CC=CC=C1)N[C@H](C(=O)OCC(CC)CC)C)O)O)C#N RWWYLEGWBNMMLJ-YSOARWBDSA-N 0.000 description 1
- 210000004994 reproductive system Anatomy 0.000 description 1
- 230000000241 respiratory effect Effects 0.000 description 1
- 210000002345 respiratory system Anatomy 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- RHFUOMFWUGWKKO-UHFFFAOYSA-N s2C Natural products S=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 RHFUOMFWUGWKKO-UHFFFAOYSA-N 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- HFHDHCJBZVLPGP-UHFFFAOYSA-N schardinger α-dextrin Chemical compound O1C(C(C2O)O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC(C(O)C2O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC2C(O)C(O)C1OC2CO HFHDHCJBZVLPGP-UHFFFAOYSA-N 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 208000013220 shortness of breath Diseases 0.000 description 1
- 239000002356 single layer Substances 0.000 description 1
- 238000004513 sizing Methods 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 229910001415 sodium ion Inorganic materials 0.000 description 1
- 235000019333 sodium laurylsulphate Nutrition 0.000 description 1
- 239000001488 sodium phosphate Substances 0.000 description 1
- 229910000162 sodium phosphate Inorganic materials 0.000 description 1
- 235000011008 sodium phosphates Nutrition 0.000 description 1
- SNOOUWRIMMFWNE-UHFFFAOYSA-M sodium;6-[(3,4,5-trimethoxybenzoyl)amino]hexanoate Chemical compound [Na+].COC1=CC(C(=O)NCCCCCC([O-])=O)=CC(OC)=C1OC SNOOUWRIMMFWNE-UHFFFAOYSA-M 0.000 description 1
- 239000002047 solid lipid nanoparticle Substances 0.000 description 1
- 235000010199 sorbic acid Nutrition 0.000 description 1
- 239000004334 sorbic acid Substances 0.000 description 1
- 229940075582 sorbic acid Drugs 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 229940063673 spermidine Drugs 0.000 description 1
- 150000003408 sphingolipids Chemical class 0.000 description 1
- 238000001694 spray drying Methods 0.000 description 1
- 210000003802 sputum Anatomy 0.000 description 1
- 208000024794 sputum Diseases 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 230000003637 steroidlike Effects 0.000 description 1
- 230000004936 stimulating effect Effects 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 239000000375 suspending agent Substances 0.000 description 1
- 210000004243 sweat Anatomy 0.000 description 1
- 239000003765 sweetening agent Substances 0.000 description 1
- 230000002195 synergetic effect Effects 0.000 description 1
- YIBXWXOYFGZLRU-UHFFFAOYSA-N syringic aldehyde Natural products CC12CCC(C3(CCC(=O)C(C)(C)C3CC=3)C)C=3C1(C)CCC2C1COC(C)(C)C(O)C(O)C1 YIBXWXOYFGZLRU-UHFFFAOYSA-N 0.000 description 1
- 239000006188 syrup Substances 0.000 description 1
- 235000020357 syrup Nutrition 0.000 description 1
- 239000000454 talc Substances 0.000 description 1
- 229910052623 talc Inorganic materials 0.000 description 1
- 239000013077 target material Substances 0.000 description 1
- 210000001138 tear Anatomy 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 239000010409 thin film Substances 0.000 description 1
- 150000007970 thio esters Chemical class 0.000 description 1
- 108060008226 thioredoxin Proteins 0.000 description 1
- 229940094937 thioredoxin Drugs 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 210000001685 thyroid gland Anatomy 0.000 description 1
- 239000005495 thyroid hormone Substances 0.000 description 1
- 229940036555 thyroid hormone Drugs 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- 210000003437 trachea Anatomy 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000011830 transgenic mouse model Methods 0.000 description 1
- 108010036387 trimethionine Proteins 0.000 description 1
- 238000009966 trimming Methods 0.000 description 1
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 238000004704 ultra performance liquid chromatography Methods 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 235000015112 vegetable and seed oil Nutrition 0.000 description 1
- 239000008158 vegetable oil Substances 0.000 description 1
- 229920002554 vinyl polymer Polymers 0.000 description 1
Landscapes
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
- Medicinal Preparation (AREA)
Abstract
The present application provides, inter alia, improved pharmaceutical compositions comprising a codon optimized mRNA encoding a peptide or polypeptide encapsulated in a lipid nanoparticle comprising one or more of a cationic lipid that is particularly effective for pulmonary delivery.
Description
Cross Reference to Related Applications
The present application claims priority from U.S. provisional application Ser. No. 63/111,321, filed 11/9/2020, and U.S. provisional application Ser. No. 63/195,581, filed 6/2021, the disclosures of each of which are hereby incorporated by reference.
Sequence listing
The description refers to the sequence listing (electronic submission of txt file under the name "MRT-2205WO_ST25" at month 9 of 2021). The txt file was generated at 2021, 11, 5, and was 92KB in size. The entire contents of the sequence listing are incorporated herein by reference.
Background
Messenger RNA Therapy (MRT) is becoming an increasingly important method for treating a variety of diseases. MRT involves administering messenger RNA (mRNA) to a patient in need of therapy for producing a protein encoded by the mRNA in the patient. Lipid nanoparticles are commonly used to encapsulate mRNA for efficient in vivo delivery of mRNA.
The success of an mRNA therapeutic will depend on deliberate and elaborate delivery systems that should direct mRNA into the desired compartments of the selected cells. However, humans and other organisms have developed natural barriers to protect their bodies against different kinds of pathogens or intruders. For example, the lungs contain mucus which traps microorganisms and particles, removing them from the lungs via synergistic beating of the motor cilia.
Disclosure of Invention
The invention provides, inter alia, pharmaceutical compositions comprising messenger RNA (mRNA) and methods of making and using the same. Notably, the cationic lipids described herein exhibit increased efficacy of pulmonary delivery and increased protein expression. In addition, the mRNAs described herein are codon optimized and exhibit increased levels of protein expression and activity. These pharmaceutical compositions may be used for improved treatment of pulmonary diseases.
The invention provides, inter alia, pharmaceutical compositions for delivery to a subject (e.g., a human subject or a cell of a human subject or a cell treated and delivered to a human subject) comprising a codon-optimized mRNA molecule encoding a peptide or polypeptide encapsulated in a lipid nanoparticle comprising one or more of the cationic lipids described herein.
In one aspect, the invention provides, inter alia, a composition for pulmonary delivery comprising mRNA encapsulated in a lipid nanoparticle, wherein the lipid nanoparticle comprises a cationic lipid as GL-TES-SA-DMP-E18-2.
In one aspect, the invention provides, inter alia, a composition for pulmonary delivery comprising mRNA encapsulated in a lipid nanoparticle, wherein the lipid nanoparticle comprises a cationic lipid that is GL-TES-SA-DME-E18-2.
In one aspect, the invention provides, inter alia, a composition for pulmonary delivery comprising mRNA encapsulated in a lipid nanoparticle, wherein the lipid nanoparticle comprises a cationic lipid that is TL 1-01D-DMA.
In one aspect, the invention provides, inter alia, a composition for pulmonary delivery comprising mRNA encapsulated in a lipid nanoparticle, wherein the lipid nanoparticle comprises a cationic lipid that is TL 1-04D-DMA.
In one aspect, the invention provides, inter alia, a composition for pulmonary delivery comprising mRNA encapsulated in a lipid nanoparticle, wherein the lipid nanoparticle comprises a cationic lipid that is SY-3-E14-DMAPr.
In one aspect, the invention provides, inter alia, a composition for pulmonary delivery comprising mRNA encapsulated in a lipid nanoparticle, wherein the lipid nanoparticle comprises a cationic lipid that is TL 1-10D-DMA.
In one aspect, the invention provides, inter alia, a composition for pulmonary delivery comprising mRNA encapsulated in a lipid nanoparticle, wherein the lipid nanoparticle comprises a cationic lipid as HEP-E3-E10.
In one aspect, the invention provides, inter alia, a composition for pulmonary delivery comprising mRNA encapsulated in a lipid nanoparticle, wherein the lipid nanoparticle comprises a cationic lipid as HEP-E4-E10.
In one aspect, the invention provides, inter alia, a composition for pulmonary delivery comprising mRNA encapsulated in a lipid nanoparticle, wherein the lipid nanoparticle comprises a cationic lipid as a Guan-SS-Chol.
In some embodiments, the mRNA is codon optimized.
In some embodiments, the lipid nanoparticle comprises one or more non-cationic lipids and one or more PEG-modified lipids. In some embodiments, the liposome comprises no more than three different lipid components. In some embodiments, the liposome comprises no more than four different lipid components.
In some embodiments, the liposome comprises four different lipid components, namely cationic lipids, non-cationic lipids, cholesterol, and PEG-modified lipids. In some embodiments, the molar ratio of cationic lipid to non-cationic lipid to cholesterol to PEG-modified lipid is between about 30-60:25-35:20-30:1-15, respectively.
In some embodiments, the liposome comprises three different lipid components, namely a cationic lipid (typically a sterol-based cationic lipid), a non-cationic lipid, and a PEG-modified lipid. In some embodiments, the molar ratio of cationic lipid to non-cationic lipid to PEG-modified lipid is about 60:35:5, respectively.
In some embodiments, the lipid nanoparticle comprises one or more cholesterol-based lipids.
In some embodiments, the lipid nanoparticle comprises a non-cationic lipid as DOPE.
In some embodiments, the lipid nanoparticle comprises a non-cationic lipid as DEPE.
In some embodiments, the lipid nanoparticle comprises a PEG-modified lipid as DMG-PEG 2K.
In some embodiments, the mRNA encodes a cystic fibrosis transmembrane conductance regulator (CFTR). In some embodiments, the mRNA encodes an ATP-binding cassette subfamily a member 3 protein. In some embodiments, the mRNA encodes a kinesin axin medium chain 1 (DNAI 1) protein. In some embodiments, the mRNA encodes a motor protein shaft heavy chain 5 (DNAH 5) protein. In some embodiments, the mRNA encodes an alpha-1-antitrypsin protein, a fork box P3 (FOXP 3) protein. In some embodiments, the mRNA encodes one or more surfactant proteins. In some embodiments, the mRNA encodes a surfactant a protein, a surfactant B protein, a surfactant C protein, or a surfactant D protein.
In certain embodiments, the invention provides a pharmaceutical composition for delivery to the lung or lung cells of a subject, the pharmaceutical composition comprising a codon-optimized mRNA molecule encoding a peptide or polypeptide encapsulated in a lipid nanoparticle comprising one or more of the cationic lipids described herein. In certain embodiments, the codon-optimized mRNA encapsulated in the lipid nanoparticle encodes a cystic fibrosis transmembrane conductance regulator (CFTR) protein. In certain embodiments, the codon-optimized mRNA encapsulated in the lipid nanoparticle encodes an ATP-binding cassette subfamily a member 3 protein. In certain embodiments, the codon-optimized mRNA encapsulated in the lipid nanoparticle encodes a kinesin axin medium-chain 1 (DNAI 1) protein. In certain embodiments, the codon-optimized mRNA encapsulated in the lipid nanoparticle encodes a kinesin shaft heavy chain 5 (DNAH 5) protein. In certain embodiments, the codon-optimized mRNA encapsulated in the lipid nanoparticle encodes an alpha-1-antitrypsin protein. In certain embodiments, the codon-optimized mRNA encapsulated in the lipid nanoparticle encodes a fork box P3 (FOXP 3) protein. In certain embodiments, the codon-optimized mRNA encapsulated in such lipid nanoparticles encodes one or more surfactant proteins, such as one or more of surfactant a protein, surfactant B protein, surfactant C protein, and surfactant D protein.
In certain embodiments, the codon-optimized mRNA encapsulated in such lipid nanoparticles encodes an antigen. In certain embodiments, the codon-optimized mRNA encapsulated in such lipid nanoparticles encodes an antigen associated with cancer in a subject or an antigen identified from cancer cells in a subject. In certain embodiments, the codon-optimized mRNA encapsulated in such lipid nanoparticles encodes an antigen determined from the subject's own cancer cells, i.e., to provide a personalized cancer vaccine.
In certain embodiments, the codon-optimized mRNA encapsulated in such lipid nanoparticles encodes an antibody. In certain embodiments, the antibody may be a bispecific antibody. In certain embodiments, the antibody may be part of a fusion protein. In certain embodiments, the codon-optimized mRNA encapsulated in such lipid nanoparticles encodes an antibody to OX 40. In certain embodiments, the codon-optimized mRNA encapsulated in such lipid nanoparticles encodes an antibody to VEGF. In certain embodiments, the codon-optimized mRNA encapsulated in such lipid nanoparticles encodes an antibody to tissue necrosis factor alpha. In certain embodiments, the codon-optimized mRNA encapsulated in such lipid nanoparticles encodes an antibody to CD 3. In certain embodiments, the codon-optimized mRNA encapsulated in such lipid nanoparticles encodes an antibody to CD 19.
In certain embodiments, the codon-optimized mRNA encapsulated in the lipid nanoparticle encodes an immunomodulatory agent. In certain embodiments, the codon-optimized mRNA encapsulated in the lipid nanoparticle encodes interleukin 12. In certain embodiments, the codon-optimized mRNA encapsulated in the lipid nanoparticle encodes interleukin 23. In certain embodiments, the codon-optimized mRNA encapsulated in the lipid nanoparticle encodes interleukin 36 γ. In certain embodiments, the codon-optimized mRNA encapsulated in such lipid nanoparticles encodes a constitutively active variant of one or more interferon gene stimulating factor (STING) proteins.
In certain embodiments, the codon optimized mRNA encapsulated in the lipid nanoparticle encodes an endonuclease. In certain embodiments, the codon optimized mRNA encapsulated in such lipid nanoparticles encodes an RNA-guided DNA endonuclease protein, such as a Cas 9 protein. In certain embodiments, the codon-optimized mRNA encapsulated in the lipid nanoparticle encodes a meganuclease protein. In certain embodiments, the codon-optimized mRNA encapsulated in the lipid nanoparticle encodes a transcriptional activator-like effector nuclease protein. In certain embodiments, the codon optimized mRNA encapsulated in the lipid nanoparticle encodes a zinc finger nuclease protein.
In some embodiments, the mRNA molecule comprises a 5' untranslated region (UTR). In some embodiments, the mRNA molecule comprises a 3' untranslated region (UTR). In some embodiments, the 5' untranslated region (UTR) comprises SEQ ID NO:12. In some embodiments, the 3' untranslated region (UTR) comprises SEQ ID NO:13. In some embodiments, the 3' untranslated region (UTR) comprises SEQ ID NO:14.
In some embodiments, the mRNA molecule further comprises a poly a tail. In some embodiments, the mRNA molecule further comprises a poly a tail that is at least 70 residues in length. In some embodiments, the mRNA molecule further comprises a poly a tail of at least 100 residues in length. In some embodiments, the mRNA molecule further comprises a poly a tail of at least 120 residues in length. In some embodiments, the mRNA molecule further comprises a poly a tail that is at least 150 residues in length. In some embodiments, the mRNA molecule further comprises a poly a tail that is at least 200 residues in length. In some embodiments, the mRNA molecule further comprises a poly a tail of at least 250 residues in length.
In some embodiments, the mRNA molecule comprises a 5' cap.
In some embodiments, the mRNA molecule comprises at least one non-standard nucleobase. In some embodiments, the nonstandard nucleobase is selected from one or more of 5-methyl-cytidine, pseudouridine, and 2-thio-uridine.
In some embodiments, the mRNA molecules are used to induce functional CFTR expression in a mammal or mammalian cell.
In some embodiments, the functional protein expression induced by the codon optimized mRNA molecule is at least 1.2-fold greater than the protein expression induced by the non-codon optimized mRNA. In some embodiments, the functional protein expression induced by the codon-optimized mRNA molecule is at least 1.5-fold greater than the protein expression induced by an un-codon-optimized mRNA molecule. In some embodiments, the functional protein expression induced by the codon-optimized mRNA molecule is at least 1.8-fold greater than the protein expression induced by an un-codon-optimized mRNA molecule. In some embodiments, the functional protein expression induced by the codon-optimized mRNA molecule is at least 2-fold greater than the protein expression induced by an un-codon-optimized mRNA molecule. In some embodiments, the functional protein expression induced by the codon-optimized mRNA molecule is at least 2.3-fold greater than the protein expression induced by an un-codon-optimized mRNA molecule. In some embodiments, the functional protein expression induced by the codon-optimized mRNA molecule is at least 2.5-fold greater than the protein expression induced by an un-codon-optimized mRNA molecule. In some embodiments, the functional protein expression induced by the codon-optimized mRNA molecule is at least 2.8-fold greater than the protein expression induced by an un-codon-optimized mRNA molecule. In some embodiments, the functional protein expression induced by the codon optimized mRNA molecule is at least 3.0 fold greater than the protein expression induced by the non-codon optimized mRNA. In some embodiments, the functional protein expression induced by the codon-optimized mRNA molecule is at least 3.2-fold greater than the protein expression induced by an un-codon-optimized mRNA molecule. In some embodiments, the functional protein expression induced by the codon-optimized mRNA molecule is at least 3.5-fold greater than the protein expression induced by an un-codon-optimized mRNA molecule. In some embodiments, the functional protein expression induced by the codon-optimized mRNA molecule is at least 3.7-fold greater than the protein expression induced by an un-codon-optimized mRNA molecule. In some embodiments, the functional protein expression induced by the codon-optimized mRNA molecule is at least 4.0-fold greater than the protein expression induced by an un-codon-optimized mRNA molecule. In some embodiments, the functional protein expression induced by the codon-optimized mRNA molecule is at least 4.5-fold greater than the protein expression induced by an un-codon-optimized mRNA molecule. In some embodiments, the functional protein expression induced by the codon-optimized mRNA molecule is at least 5.0-fold greater than the protein expression induced by an un-codon-optimized mRNA molecule.
In some embodiments, the polynucleotide is a linear polynucleotide comprising deoxynucleotide residues. In some embodiments, the polynucleotide is a circular polynucleotide comprising deoxynucleotide residues.
In some embodiments, the codon optimized mRNA is encapsulated within a nanoparticle. In some embodiments, the nanoparticle is a liposome.
In some embodiments, the liposome comprises one or more cationic lipids, one or more non-cationic lipids, and one or more PEG-modified lipids. In some embodiments, the liposome comprises one or more cholesterol-based lipids. In some embodiments, the liposome comprises one or more cationic lipids, one or more non-cationic lipids, one or more cholesterol-based lipids, and one or more PEG-modified lipids. In some embodiments, the liposome comprises no more than three different lipid components. In some embodiments, one of the different lipid components is a sterol-based cationic lipid. In some embodiments, the sterol-based cationic lipid is an Imidazole Cholesterol Ester (ICE).
In some embodiments, the one or more cationic lipids comprise GL-TES-SA-DME-E18-2. In some embodiments, the one or more cationic lipids comprise TL1-01D-DMA. In some embodiments, the one or more cationic lipids comprise SY-3-E14-DMAPR. In some embodiments, the one or more cationic lipids comprise TL1-10D-DMA. In some embodiments, the one or more cationic lipids comprise Guan-SS-Chol. In some embodiments, the one or more cationic lipids comprise GL-TES-SA-DMP-E18-2. In some embodiments, the one or more cationic lipids comprise HEP-E4-E10. In some embodiments, the one or more cationic lipids comprise HEP-E3-E10. In some embodiments, the one or more cationic lipids comprise TL1-04D-DMA.
In some embodiments, the lipid nanoparticle comprises a cationic lipid that is GL-TES-SA-DME-E18-2. In some embodiments, the lipid nanoparticle comprises a cationic lipid that is TL1-01D-DMA. In some embodiments, the lipid nanoparticle comprises a cationic lipid that is SY-3-E14-DMAPR. In some embodiments, the lipid nanoparticle comprises a cationic lipid that is TL1-10D-DMA. In some embodiments, the lipid nanoparticle comprises a cationic lipid as Guan-SS-Chol. In some embodiments, the lipid nanoparticle comprises a cationic lipid that is GL-TES-SA-DMP-E18-2. In some embodiments, the lipid nanoparticle comprises a cationic lipid that is HEP-E4-E10. In some embodiments, the lipid nanoparticle comprises a cationic lipid that is HEP-E3-E10. In some embodiments, the lipid nanoparticle comprises a cationic lipid that is TL1-04D-DMA.
In some embodiments, the liposome has a size of less than about 200 nm. In some embodiments, the liposomes have a size of less than about 150 nm. In some embodiments, the liposomes have a size of less than about 120 nm. In some embodiments, the liposomes have a size of less than about 110 nm. In some embodiments, the liposome has a size of less than about 100 nm. In some embodiments, the liposome has a size of less than about 80 nm. In some embodiments, the liposomes have a size of less than about 60 nm. In some embodiments, the liposomes have a size of less than about 50 nm. In some embodiments, the liposomes have a size of less than about 40 nm. In some embodiments, the liposome has a size of less than about 30 nm.
In some embodiments, the pharmaceutical composition further comprises a CFTR potentiator. In some embodiments, the pharmaceutical composition further comprises a CFTR corrector. In some embodiments, the pharmaceutical composition further comprises a CFTR activator. In some embodiments, the pharmaceutical composition further comprises a CFTR potentiator, a correction agent, and/or an activator. Suitable CFTR potentiators, correction and/or activators include ivacaine (trade name ) Lu Maka Torr (trade name->) Tizakator, vX-659, vX-445, vX-152, vX-440, vX-371, vX-561, GLPG1837, GLPG2222, GLPG2737, GLPG2451, GLPG1837, PTI-428, PTI-801, PTI-808, and Ai Lufu sen (eluforsen). In some embodiments, the pharmaceutical composition further comprises ivacaine. In some embodiments, the pharmaceutical composition further comprises Lu Maka torr. In some embodiments, the pharmaceutical composition further comprises tizakapton. In some embodiments, the pharmaceutical composition further comprises ivacaine, lu Maka torr, tizalcine, or a combination. In some embodiments, the pharmaceutical composition further comprises VX-659. In some embodiments, the pharmaceutical composition further comprises VX-445. In some embodiments, the pharmaceutical composition further comprises VX-152. In some embodiments, the pharmaceutical composition further comprises VX-440. In some embodiments, the pharmaceutical composition further comprises VX-371. In some embodiments, the pharmaceutical composition further comprises VX-561. In some embodiments, the pharmaceutical composition further comprises GLPG1837. In some embodiments, the pharmaceutical composition further comprises GLPG2222. In some embodiments, the pharmaceutical composition further comprises GLPG2737. In one place In some embodiments, the pharmaceutical composition further comprises GLPG2451. In some embodiments, the pharmaceutical composition further comprises GLPG1837. In some embodiments, the pharmaceutical composition further comprises PTI-428. In some embodiments, the pharmaceutical composition further comprises PTI-801. In some embodiments, the pharmaceutical composition further comprises PTI-808. In some embodiments, the pharmaceutical composition further comprises Ai Lufu sen. In some embodiments, the pharmaceutical composition further comprises any combination of CFTR potentiators, correction agents, and/or activators.
In one aspect, the invention provides a method of inducing protein expression in epithelial cells in the lung of a mammal, the method comprising the step of contacting the epithelial cells in the lung of the mammal with a pharmaceutical composition of the invention.
In some embodiments, the codon optimized mRNA is administered via pulmonary delivery. In some embodiments, the codon optimized mRNA is administered via intravenous delivery. In some embodiments, the codon optimized mRNA is administered via: oral, rectal, vaginal, transmucosal or intestinal administration; parenteral delivery, including intradermal, transdermal (topical), intramuscular, subcutaneous, intramedullary injections, as well as intrathecal, direct intraventricular, intravenous, intraperitoneal, and/or intranasal administration.
In some embodiments, the pulmonary delivery is nebulization. In some embodiments, the codon optimized mRNA is administered via aerosolization.
In some embodiments, treating the subject is achieved at a lower therapeutically effective dose as compared to treating the subject with a non-codon optimized mRNA encoding a wild-type protein.
In some embodiments, treating a subject in need thereof results in a shorter nebulization time to administer a therapeutically effective dose compared to treatment with a non-codon optimized mRNA encoding a wild-type protein.
Drawings
The drawings are for illustration purposes only and are not intended to be limiting.
FIGS. 1A and 1B illustrate a method of generating an optimized nucleotide sequence according to the present invention. As shown in fig. 1A, the method receives an amino acid sequence of interest and a first codon usage table reflecting the frequency of each codon in a given organism (e.g., mammal or human). If a codon is associated with a codon usage frequency below a threshold frequency (e.g., 10%), the method removes the codon from the first codon usage table. The codon usage frequency of the codons not removed in the first step is normalized to generate a normalized codon usage table. The method uses a standardized codon usage table to generate a list of optimized nucleotide sequences. Each optimized nucleotide sequence encodes an amino acid sequence of interest. As shown in fig. 1B, the list of optimized nucleotide sequences was further processed by applying a motif screening filter, a guanine-cytosine (GC) content analysis filter, and a Codon Adaptation Index (CAI) analysis filter in the following order to generate an updated list of optimized nucleotide sequences.
FIG. 2A shows an exemplary Western blot for determining the protein expression yield of CFTR protein encoded by an optimized nucleotide sequence generated according to the method of the invention in a time course experiment after transfection of the optimized nucleotide sequence into human cells. Fig. 2B shows an exemplary line graph depicting quantification of western blot data depicted in fig. 2A.
Fig. 3A shows an exemplary plot of data obtained from a bioassay for testing mRNA containing optimized nucleotide sequences encoding hCFTR. It depicts the short circuit current (I) in the Uwing (using) epithelial voltage clamp device for each mRNA tested SC ) And outputting. Fig. 3B shows an exemplary bar graph demonstrating hCFTR activity change as depicted in fig. 3A, expressed as a percentage of the activity of a reference mRNA encoding hCFTR.
Fig. 4A is an exemplary gel depicting the banding pattern of various CFTR sequences. Fig. 4B is a bar graph depicting the relative expression of various CFTR sequences in the C-band. Fig. 4C is a bar graph depicting CFTR mRNA efficacy of various CFTR sequences.
Fig. 5A and 5B are exemplary graphs showing short circuit conductivities of various codon optimized and non-codon optimized CFTR constructs measured by ewing cells. Fig. 5C is an exemplary bar graph of the maximum activation current of various CFTR constructs in the ews chamber assay at 22 and 44 hours.
Fig. 6 is an exemplary bar graph depicting the results of an assay to evaluate cytotoxicity of various CFTR mRNA sequences.
FIG. 7 is an exemplary bar graph depicting the amount of radiation produced by luciferase protein expressed in mice after administration of mRNA-LNP (each comprising a different cationic lipid component). In the figure at 10 4 p/s/cm 2 The horizontal line above and below sr represents the historical radiation/expression of pulmonary delivered (e.g., aerosolized) FFL mRNA encapsulated in LNP comprising ICE as cationic lipid. In the figure at 10 6 p/s/cm 2 The horizontal line above and below sr represents the historical radiation/expression of pulmonary delivered (e.g., aerosolized) FFL mRNA encapsulated in LNP comprising ML2 as a cationic lipid. These thresholds may be used to screen lipids for pulmonary delivery.
Fig. 8A is an exemplary bar graph depicting the amount of mRNA delivered to lung tissue as determined by RT-qPCR. Fig. 8B is an exemplary graph depicting the amount of mCherry protein produced by each amount of mCherry mRNA delivered to the lung (x-axis).
Fig. 9A is an exemplary image of a whole mouse showing the radiation of firefly luciferase expressed by delivered mRNA-LNP. Radiation showed that mRNA-LNP was efficiently delivered to the mouse lung for in vivo expression. Fig. 9B is an exemplary imaging of mice by cryofluorescence tomography, which shows Cre recombinase mRNA expression. Imaging showed that expression of the delivered protein was detected in the branches of the lung and airways, as indicated by the arrows. Fig. 9C is an exemplary immunohistochemistry of the mouse lung administered saline or Cre-mRNA LNP via nebulization. Positive bronchiole epithelial cells include secretory and/or ciliated cells (arrow with "1") and type I and type II lung cells are indicated by arrow with "2". Fig. 9D is an exemplary immunofluorescence imaging of lung sections at 40x and 100x magnification. The CFTR protein expressed by the delivered mRNA is evident at the apical surface of the airways, as indicated by the arrow.
Fig. 10A is an exemplary HBEC-ALI (human bronchial epithelial cell-gas liquid interface) model and time axis for ALI culture growth. Fig. 10B is an exemplary image of differentiated epithelium in HBEC-ALI model after staining with hematoxylin and eosin (H & E).
FIG. 11A is an exemplary bar graph depicting the amount of luminescence (on a logarithmic scale) produced by luciferase protein in cells in the HBEC-ALI model after transfection with mRNA-LNP. Fig. 11B is an exemplary bar graph depicting cell integrity as measured by transepithelial electrical resistance (TEER), which is a strong indicator of epithelial integrity.
Fig. 12 is an exemplary ROC curve (receiver operating profile) demonstrating that the HBEC-ALI model shows meaningful performance as a classification model for screening and filtering lipids prior to in vivo evaluation.
Fig. 13A is an exemplary graph showing the relative concentration of lipids in the HBEC-ALI model after transfection with mRNA-LNP. The half-life determined by the graph is about 2.9 hours. Fig. 13B is an exemplary table showing half-life values determined by mouse lung and human lung homogenates.
Definition of the definition
In order that the invention may be more readily understood, certain terms are first defined below. Additional definitions of the following terms and other terms are set forth throughout the specification. Publications and other reference materials mentioned herein to describe the background of the invention and to provide additional details concerning the practice of the invention are hereby incorporated by reference.
About or about: as used herein, the term "about" or "approximately" as applied to one or more destination values refers to values similar to the reference value. In certain embodiments, the term "about" or "approximately" refers to a range of values that fall within 25%, 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 11%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1% or less of any direction (greater than or less than) of the recited reference value, unless otherwise indicated or otherwise apparent from the context (unless such numbers exceed 100% of the possible values).
As used herein, the term "batch" refers to the amount or quantity of mRNA synthesized at one time (e.g., produced according to a single manufacturing order during the same manufacturing cycle). A batch may refer to the amount of mRNA synthesized in a single reaction that is performed under a set of conditions via a single aliquot of enzyme and/or a single aliquot of DNA template for continuous synthesis. In some embodiments, a batch will include mRNA produced by a reaction in which not all reagents and/or components are replenished and/or replenished as the reaction proceeds. The term "not in a single batch" does not mean that the mRNA synthesized at different times are combined to achieve the desired amount.
Delivery: as used herein, the term "delivery" encompasses both local and systemic delivery. For example, delivery of mRNA encompasses situations where mRNA is delivered to a target tissue and the encoded protein is expressed and retained within the target tissue (also referred to as "local distribution" or "local delivery"); and cases where mRNA is delivered to a target tissue and the encoded protein is expressed and secreted into the patient's circulatory system (e.g., serum), and is distributed systemically and taken up by other tissues (also referred to as "systemic distribution" or "systemic delivery"). In some embodiments, the delivery is pulmonary delivery, including, for example, nebulization.
Encapsulation: as used herein, the term "encapsulation" or grammatical equivalents refers to the process of confining mRNA molecules within a nanoparticle.
Engineered or mutant: as used herein, the term "engineered" or "mutant" or grammatical equivalents refers to a nucleotide or protein sequence that contains one or more modifications as compared to its naturally occurring sequence, including but not limited to deletions, insertions of heterologous nucleic acids or amino acids, inversions, substitutions, or combinations thereof.
Expression: as used herein, "expression" of a nucleic acid sequence refers to translation of mRNA into a polypeptide, assembly of multiple polypeptides (e.g., heavy or light chains of an antibody) into an intact protein (e.g., an antibody), and/or post-translational modification of the polypeptide or the fully assembled protein (e.g., an antibody). In the present application, the terms "express" and "produce" and their grammatical equivalents are used interchangeably.
Functionality: as used herein, a "functional" biomolecule is a biomolecule in a form in which it exhibits its characteristic properties and/or activity.
Half-life period: as used herein, the term "half-life" is the time required for an amount, such as nucleic acid or protein concentration or activity, to drop to half its value as measured at the beginning of a time period.
Improvement, increase or decrease: as used herein, the terms "improve," "increase," or "decrease," or grammatical equivalents, refer to a value relative to a baseline measurement, such as a measurement in the same individual prior to initiation of a treatment described herein, or a measurement in a control subject (or multiple control subjects) in the absence of a treatment described herein. A "control subject" is a subject afflicted with the same form of disease as the subject being treated, and is approximately the same age as the subject being treated.
Impurity: as used herein, the term "impurity" refers to a substance within a limited amount of liquid, gas, or solid that differs from the chemical composition of the target material or compound. Impurities are also known as contaminants.
In vitro: as used herein, the term "in vitro" refers to events that occur in an artificial environment (e.g., in a tube or reaction vessel, in cell culture, etc.), rather than within a multicellular organism.
In vivo: as used herein, the term "in vivo" refers to events that occur within multicellular organisms (e.g., humans and non-human animals). In the case of a cell-based system, the term may be used to refer to events that occur within living cells (as opposed to, for example, in vitro systems).
Separating: as used herein, the term "isolated" refers to the following substances and/or entities: has been (1) separated from at least some of the components to which it was originally associated (in nature and/or in the experimental environment), and/or (2) artificially created, prepared and/or manufactured. The isolated substance and/or entity may be separated from about 10%, about 20%, about 30%, about 40%, about 50%, about 60%, about 70%, about 80%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, about 99%, or more than about 99% of the other components to which it was originally attached. In some embodiments, the isolated agent is about 80%, about 85%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, about 99%, or more than about 99% pure. As used herein, a substance is "pure" if the substance is substantially free of other components. As used herein, calculation of the percent purity of an isolated substance and/or entity should not include excipients (e.g., buffers, solvents, water, etc.).
Messenger RNA (mRNA): as used herein, the term "messenger RNA (mRNA)" refers to a polynucleotide encoding at least one polypeptide. mRNA as used herein encompasses both modified and unmodified RNAs. An mRNA may contain one or more coding and non-coding regions. mRNA can be purified from natural sources, produced using recombinant expression systems and optionally purified, chemically synthesized, and the like. Where appropriate, for example, in the case of a chemically synthesized molecule, the mRNA may comprise nucleoside analogs (e.g., analogs of bases or sugars with chemical modifications), backbone modifications, and the like. Unless otherwise indicated, mRNA sequences are presented in the 5 'to 3' direction.
Nucleic acid: as used herein, the term "nucleic acid" is used in its broadest sense to refer to any compound and/or substance that is or can be incorporated into a polynucleotide strand. In some embodiments, the nucleic acid is a compound and/or substance that is incorporated or can be incorporated into the polynucleotide strand via a phosphodiester linkage. In some embodiments, "nucleic acid" refers to individual nucleic acid residues (e.g., nucleotides and/or nucleosides). In some embodiments, "nucleic acid" refers to a polynucleotide strand comprising individual nucleic acid residues. In some embodiments, "nucleic acid" encompasses RNA as well as single and/or double stranded DNA and/or cDNA. Furthermore, the terms "nucleic acid," "DNA," "RNA," and/or similar terms include nucleic acid analogs, i.e., analogs having a backbone other than a phosphodiester. For example, so-called "peptide nucleic acids" known in the art and having peptide bonds in the backbone in place of phosphodiester bonds are considered to be within the scope of the present invention. The term "nucleotide sequence encoding an amino acid sequence" includes all nucleotide sequences that are degenerate versions of each other and/or encode the same amino acid sequence. The nucleotide sequence encoding the protein and/or RNA may include introns. Nucleic acids can be purified from natural sources, produced using recombinant expression systems and optionally purified, chemically synthesized, and the like. Where appropriate, for example, in the case of a chemically synthesized molecule, the nucleic acid may comprise nucleoside analogs (e.g., analogs of bases or sugars with chemical modifications), backbone modifications, and the like. Unless otherwise indicated, the nucleic acid sequences are presented in the 5 'to 3' direction. In some embodiments, the nucleic acid is or comprises a natural nucleoside (e.g., adenosine, thymidine, guanosine, cytidine, uridine, deoxyadenosine, deoxythymidine, deoxyguanosine, and deoxycytidine); nucleoside analogs (e.g., 2-aminoadenosine, 2-thiothymidine, inosine, pyrrolo-pyrimidine, 3-methyladenosine, 5-methylcytidine, C-5 propynyl-cytidine, C-5 propynyl-uridine, 2-aminoadenosine, C5-bromouridine, C5-fluorouridine, C5-iodouridine, C5-propynyl-uridine, C5-propynyl-cytidine, C5-methylcytidine, 2-aminoadenosine, 7-deadenosine, 7-deazaguanosine, 8-oxo-guanosine, O (6) -methylguanine, and 2-thiocytidine); chemically modified bases; biologically modified bases (e.g., methylated bases); an intercalating base; modified sugars (e.g., 2 '-fluororibose, ribose, 2' -deoxyribose, arabinose, and hexose); and/or modified phosphate groups (e.g., phosphorothioate and 5' -N-phosphoramidite linkages). In some embodiments, the invention relates specifically to "unmodified nucleic acids," meaning nucleic acids (e.g., polynucleotides and residues, including nucleotides and/or nucleosides) that have not been chemically modified to facilitate or effect delivery. In some embodiments, the nucleotides T and U are used interchangeably in the sequence description.
Patient: as used herein, the term "patient" or "subject" refers to any organism to which the provided compositions can be administered, e.g., for experimental, diagnostic, prophylactic, cosmetic, and/or therapeutic purposes. Typical patients include animals (e.g., mammals such as mice, rats, rabbits, non-human primates, and/or humans). In some embodiments, the patient is a human. Humans include prenatal and postnatal forms.
Pharmaceutically acceptable: the term "pharmaceutically acceptable" as used herein refers to the following: suitable for use in contact with human and animal tissue without undue toxicity, irritation, allergic response, or other problem or complication, commensurate with a reasonable benefit/risk ratio, within the scope of sound medical judgment.
Stable: as used herein, the term "stable" protein or grammatical equivalents thereof refers to a protein that retains its physical stability and/or biological activity. In one embodiment, protein stability is determined at a low percentage of degraded (e.g., fragmented) and/or aggregated protein based on the percentage of monomeric protein in the solution. In one embodiment, the stabilized engineered protein retains or exhibits an increased half-life as compared to the wild-type protein. In one embodiment, the stabilized engineered protein is less prone to ubiquitination resulting in proteolysis than the wild-type protein.
The subject: as used herein, the term "subject" refers to a human or any non-human animal (e.g., mouse, rat, rabbit, dog, cat, cow, pig, sheep, horse, or primate). Humans include prenatal and postnatal forms. In many embodiments, the subject is a human. The subject may be a patient, which refers to a person presented to a medical provider for diagnosis or treatment of a disease. The term "subject" is used interchangeably herein with "individual" or "patient. The subject may be afflicted with or susceptible to a disease or disorder, but may or may not exhibit symptoms of the disease or disorder.
Basically: as used herein, the term "substantially" refers to a qualitative condition that exhibits an overall or near-overall range or degree of the characteristic or feature of interest. Those of ordinary skill in the biological arts will appreciate that biological and chemical phenomena are rarely, if ever, accomplished and/or proceed to completion or achieve or avoid absolute results. Thus, the term "substantially" is used herein to capture the potential lack of integrity inherent in many biological and chemical phenomena.
Treatment: as used herein, the term "treatment" or "treatment" refers to any method for partially or completely alleviating, ameliorating, alleviating, inhibiting, preventing, delaying the onset of, reducing the severity of, and/or reducing the incidence of one or more symptoms or features of a particular disease, disorder, and/or condition. The treatment may be administered to a subject that does not exhibit signs of disease and/or exhibits only early signs of disease for the purpose of reducing the risk of developing a condition associated with the disease.
Detailed Description
The present invention provides pharmaceutical compositions for delivery to a subject (e.g., a human subject or cells of a human subject or cells treated and delivered to a human subject) comprising a codon-optimized mRNA molecule encoding a peptide or polypeptide encapsulated in a lipid nanoparticle comprising one or more of the cationic lipids described herein. The pharmaceutical compositions described herein comprising codon optimized mRNA and/or cationic lipids are particularly effective in treating diseases by pulmonary administration.
Thus, in certain embodiments, the invention provides a pharmaceutical composition for delivery to the lung or lung cells of a subject, the pharmaceutical composition comprising a codon-optimized mRNA molecule encoding a peptide or polypeptide encapsulated in a lipid nanoparticle comprising one or more of the cationic lipids described herein. In certain embodiments, the codon-optimized mRNA encapsulated in the lipid nanoparticle encodes a cystic fibrosis transmembrane conductance regulator (CFTR) protein. In certain embodiments, the codon-optimized mRNA encapsulated in the lipid nanoparticle encodes an ATP-binding cassette subfamily a member 3 protein. In certain embodiments, the codon-optimized mRNA encapsulated in the lipid nanoparticle encodes a kinesin axin medium-chain 1 (DNAI 1) protein. In certain embodiments, the codon-optimized mRNA encapsulated in the lipid nanoparticle encodes a kinesin shaft heavy chain 5 (DNAH 5) protein. In certain embodiments, the codon-optimized mRNA encapsulated in the lipid nanoparticle encodes an alpha-1-antitrypsin protein. In certain embodiments, the codon-optimized mRNA encapsulated in the lipid nanoparticle encodes a fork box P3 (FOXP 3) protein. In certain embodiments, the codon-optimized mRNA encapsulated in such lipid nanoparticles encodes one or more surfactant proteins, such as one or more of surfactant a protein, surfactant B protein, surfactant C protein, and surfactant D protein.
Cystic fibrosis
The invention may be used to treat subjects suffering from or susceptible to cystic fibrosis. Cystic fibrosis is a genetic disorder characterized by mutations in the cystic fibrosis transmembrane conductance regulator (CFTR) gene. CFTR proteins function as channels across the cell membrane, which produce mucus, sweat, saliva, tears, and digestive enzymes. The channels transport negatively charged particles, called chloride ions, into and out of the cell. The transport of chloride ions helps control the movement of water in the tissue, which is necessary to produce free flowing dilute mucus. Mucus is a slippery substance that lubricates and protects the lining of the airways, digestive system, reproductive system, and other organs and tissues.
Respiratory symptoms of cystic fibrosis include: continuous cough, wheezing, shortness of breath, exercise intolerance, recurrent pulmonary infections and nasal airway inflammation or nasal obstruction that produce thick mucus (sputum). Digestive symptoms of cystic fibrosis include: malodor, oily stool, poor weight gain and growth, ileus (especially in newborns (meconium ileus)) and severe constipation.
Codon optimized mRNA
In some embodiments, the invention provides methods and compositions for delivering codon-optimized mRNA encoding a peptide or polypeptide to a subject for treating a disease. Suitable codon-optimized mRNAs encode any full length, fragment, or portion of a protein that can replace naturally occurring protein activity and/or reduce the intensity, severity, and/or frequency of one or more symptoms associated with a disease.
According to an increasing number of studies, mRNA contains multiple layers of information overlapping with the amino acid code. Traditionally, codon optimization has been used to remove rare codons that are believed to have rate limitations for protein expression. While both fast growing bacteria and yeast exhibit Jiang Mima seed bias in highly expressed genes, higher eukaryotes exhibit significantly less codon bias, which makes it more difficult to discern codons that may be rate limiting. Furthermore, it has been found that codon bias does not necessarily produce high expression per se, but other features are required.
For example, rare codons are associated with slowing translation and creating a pause site, which may be necessary for proper protein folding. Thus, changes in codon usage can provide a mechanism to fine tune the extended temporal pattern, thereby increasing the time available for the protein to make its correct confirmation. Codon optimisation may interfere with this trimming mechanism, resulting in reduced protein translation efficiency or increased numbers of misfolded proteins. Similarly, codon optimization can disrupt the normal pattern of homologous and oscillating tRNA usage, thereby affecting protein structure and function, as the mechanism of achieving proper protein folding can likewise be chosen to rely on the slowing of oscillating extension.
Various methods of performing codon optimization are known in the art, however, each has significant drawbacks and limitations from a computational and/or therapeutic standpoint. In particular, known codon optimization methods generally involve, for each amino acid, replacing each codon with the highest codon used for that amino acid, such that the "optimized" sequence contains only one codon encoding each amino acid (and thus may be referred to as a one-to-one sequence).
Despite these obstacles, the inventors have obtained improved codon optimized sequences that enhance protein expression at least twice as much as the coding sequence of the wild-type gene. The observed improvement in codon optimized protein coding sequence expression is expected to result in improved, more cost-effective mRNA replacement therapy for patients with disease, as it does not require the use of modified nucleotides to make mRNA, and allows treatment at reduced doses and/or at prolonged dosing intervals.
The genetic code has 64 possible codons. Each codon comprises a sequence of three nucleotides. The frequency of use of each codon in the protein coding region of the genome can be counted by: the number of instances of a particular codon present in the protein-encoding region of the genome is determined, and the obtained value is then divided by the total number of codons encoding the same amino acid within the protein-encoding region of the genome.
Regarding the frequency with which each codon is used to encode a certain amino acid for a particular biological source that generates a codon usage table, the codon usage table contains experimentally obtained data. For each codon, this information is expressed as a percentage (0 to 100%) or fraction (0 to 1) of the frequency with which the codon is used to encode a certain amino acid relative to the total number of times the codon encodes the amino acid.
The codon usage tables are stored in publicly available databases such as the codon usage database (Nakamura et al (2000) Nucleic Acids Research (1), 292; available online: https:// www.kazusa.or.jp/codon /) and the high performance integrated virtual environment-codon usage table (HIVE-CUT) database (Athey et al, (2017), BMC Bioinformatics (1), 391; available online: http:// HIVE. Biochem. Gwu. Edu/review/codon).
During the first step of codon optimization, if a codon is associated with a frequency of codon usage below a threshold frequency (e.g., 10%), the codon is removed from a first codon usage table reflecting the frequency of each codon in a given organism (e.g., mammal or human). The codon usage frequency of the codons not removed in the first step is normalized to generate a normalized codon usage table. An optimized nucleotide sequence encoding an amino acid sequence of interest is generated by selecting codons for each amino acid in the amino acid sequence based on the frequency of use of one or more codons associated with the given amino acid in the standardized codon usage table. The probability of selecting a certain codon for a given amino acid is equal to the frequency of codon usage in the standardized codon usage table that is related to that amino acid.
The codon optimized sequences of the invention are generated by computer-implemented methods for generating optimized nucleotide sequences. The method comprises the following steps: (i) Receiving an amino acid sequence, wherein the amino acid sequence encodes a peptide, polypeptide, or protein; (ii) Receiving a first codon usage table, wherein the first codon usage table comprises a list of amino acids, wherein each amino acid in the table is associated with at least one codon, and each codon is associated with a frequency of use; (iii) Removing any codons from the codon usage table that are associated with a frequency of use below a threshold frequency; (iv) Generating a normalized codon usage table by normalizing the frequency of use of codons not removed in step (iii); and (v) generating an optimized nucleotide sequence encoding the amino acid sequence by selecting codons for each amino acid in the amino acid sequence based on the frequency of use of the one or more codons in the standardized codon usage table associated with the amino acid sequence. The threshold frequency may be in the range of 5% -30%, in particular 5%, 10%, 15%, 20%, 25% or 30%. In the case of the present invention, the threshold frequency is typically 10%.
The step of generating a standardized codon usage table comprises: (a) Assigning the frequency of use of each codon associated with a first amino acid and removed in step (iii) to the remaining codons associated with the first amino acid; and (b) repeating step (a) for each amino acid to produce a standardized codon usage table. In some embodiments, the frequency of use of the removed codons is equally divided among the remaining codons. In some embodiments, the frequency of use of the removed codons is apportioned between the remaining codons based on the frequency of use of each remaining codon. "assignment" in this case may be defined as taking the combined magnitude of the frequency of use of the removed codons associated with a certain amino acid and apportioning a portion of that combined frequency to each of the remaining codons encoding the certain amino acid.
The step of selecting a codon for each amino acid comprises: (a) Identifying one or more codons associated with a first amino acid of the amino acid sequence in the standardized codon usage table; (b) Selecting codons associated with the first amino acid, wherein the probability of selecting a codon is equal to the frequency of use of codons in the standardized codon usage table that are related to the first amino acid; and (c) repeating steps (a) and (b) until a codon for each amino acid in the amino acid sequence has been selected.
The step of generating an optimized nucleotide sequence by selecting the codon of each amino acid in the amino acid sequence (step (v) in the above method) is performed n times to generate a list of optimized nucleotide sequences.
Motif screening
Motif screening filters were applied to the list of optimized nucleotide sequences. The optimized nucleotide sequences encoding any known negative cis-regulatory elements and weight bearing components are removed from the list to produce an updated list.
For each optimized nucleotide sequence in the list, it is also determined whether it contains a termination signal. Removing any nucleotide sequences containing one or more termination signals from the list, resulting in an updated list. In some embodiments, the termination signal has the following nucleotide sequence: 5' -X 1 ATCTX 2 TX 3 -3', wherein X 1 、X 2 And X 3 Independently selected from A, C, T or G. In some embodiments, the termination signal has one of the following nucleotide sequences: TATCTGTT; and/or TTTTTT; and/or AAGCTT; and/or gaagag; and/or tctag a. In some embodiments, the termination signal has the following nucleotide sequence: 5' -X 1 AUCUX 2 UX 3 -3', whereinX 1 、X 2 And X 3 Independently selected from A, C, U or G. In some embodiments, the termination signal has one of the following nucleotide sequences: UAUCUGUU; and/or uuuuuuuu; and/or AAGCUU; and/or gaagag; and/or UCUAGA.
Guanine-cytosine (GC) content
The method further includes determining a guanine-cytosine (GC) content of each optimized nucleotide sequence in the updated list of optimized nucleotide sequences. The GC content of a sequence is the percentage of bases in the nucleotide sequence that are guanine or cytosine. The list of optimized nucleotide sequences is further updated by: if the GC content of any nucleotide sequence falls outside a predetermined GC content range, the sequence is removed from the list.
Determining the GC content of each optimized nucleotide sequence includes, for each nucleotide sequence: determining the GC content of one or more additional portions of the nucleotide sequence, wherein the additional portions do not overlap each other and the first portion, and wherein updating the list of optimized sequences comprises: if the GC content of any portion falls outside a predetermined GC content range, the nucleotide sequence is removed, optionally wherein determining the GC content of the nucleotide sequence is stopped when the GC content of any portion is determined to be outside the predetermined GC content range. In some embodiments, the first portion and/or one or more additional portions of the nucleotide sequence comprises a predetermined number of nucleotides, optionally wherein the predetermined number of nucleotides is within the following range: 5 to 300 nucleotides, or 10 to 200 nucleotides, or 15 to 100 nucleotides, or 20 to 50 nucleotides. In the case of the present invention, the predetermined number of nucleotides is typically 30 nucleotides. The predetermined GC content range may be 15% -75%, or 40% -60%, or 30% -70%. In the case of the present invention, the predetermined GC content range is typically 30% -70%.
In the case of the present invention a suitable GC content filter may first analyze the first 30 nucleotides of the optimized nucleotide sequence, i.e. nucleotides 1 to 30 of the optimized nucleotide sequence. The analyzing may include determining the number of nucleotides in the portion as G or C, and the determining the GC content of the portion may include dividing the number of G or C nucleotides in the portion by the total number of nucleotides in the portion. The result of this analysis will provide a value describing the proportion of nucleotides in the portion as G or C and may be a percentage, for example 50%, or a fraction, for example 0.5. If the GC content of the first portion falls outside a predetermined GC content range, the optimized nucleotide sequence may be removed from the list of optimized nucleotide sequences.
The GC content filter may then analyze the second portion of the optimized nucleotide sequence if the GC content of the first portion falls within the predetermined GC content range. In this example, this may be the second 30 nucleotides of the optimized nucleotide sequence, i.e., nucleotides 31 to 60. The partial analysis may be repeated for each portion until either of the following occurs: a portion of the GC content was found to fall outside a predetermined GC content range, in which case the optimized nucleotide sequence can be removed from the list; or the entire optimized nucleotide sequence has been analyzed and no such part is found, in which case the GC content filter retains the optimized nucleotide sequence in the list and can move to the next optimized nucleotide sequence in the list.
Codon Adaptation Index (CAI)
The method further includes determining a codon adaptation index for each optimized nucleotide sequence in the most recently updated list of optimized nucleotide sequences. The codon adaptation index of a sequence is a measure of codon usage bias and may be a value between 0 and 1. The most recently updated list of optimized nucleotide sequences is further updated by: any nucleotide sequence is removed if its codon usage index is less than or equal to a predetermined codon usage index threshold. The codon adaptation index threshold may be 0.7, or 0.75, or 0.8, or 0.85, or 0.9. The inventors have found that optimized nucleotide sequences with codon adaptation indexes equal to or greater than 0.8 deliver very high protein yields. Thus, in the case of the present invention, the codon usage index threshold is typically 0.8.
For each optimized nucleotide sequence, the codon adaptation index may be calculated in any manner apparent to a person skilled in the art, for example as described in the following documents: "The codon adaptation index-a measure of directional synonymous codon usage bias, and its potential applications" (Sharp and Li,1987.Nucleic Acids Research 15 (3), pages 1281-1295).
Implementing the codon adaptation index calculation may include following or similar methods. For each amino acid in the sequence, the weight of each codon in the sequence can be determined by what is known as relative fitness (w i ) Is a parameter representation of (c). The relative fitness can be calculated from the set of reference sequences as the observed frequency f of the codon for that amino acid i Frequency f of the synonymous codon most frequently used j The ratio between. The codon adaptation index of the sequence can then be calculated as a geometric mean of the weights associated with each codon in the sequence length (measured in codons). The set of reference sequences used to calculate the codon adaptation index may be the same set of reference sequences from which the codon usage table used with the method of the invention was derived.
Codon optimized CFTR mRNA
In some embodiments, a suitable codon optimized mRNA sequence is an mRNA sequence encoding the human CFTR (hCFTR) protein of SEQ ID NO. 1.
TABLE 1 exemplary codon optimized human CFTR
In some embodiments, a suitable mRNA can be a codon optimized sequence as shown in SEQ ID NOS.2-11.
In some embodiments, a suitable mRNA sequence may be an mRNA sequence of a homolog or analog of a human CFTR protein. For example, a homolog or analog of a human CFTR protein may be a modified human CFTR protein that contains one or more amino acid substitutions, deletions, and/or insertions as compared to a wild-type or naturally occurring human CFTR protein, while substantially retaining the activity of the CFTR protein. In some embodiments, mRNAs suitable for the present invention encode an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% homologous or more homologous to SEQ ID NO. 1. In some embodiments, mRNA suitable for the present invention encodes a protein that is substantially identical to a human CFTR protein. In some embodiments, mRNAs suitable for the present invention encode an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% identical or more identical to SEQ ID NO. 1. In general, mRNA according to the invention encodes a CFTR protein having the same amino acid sequence as SEQ ID NO. 1.
In some embodiments, mRNA suitable for the present invention encodes a fragment or portion of a human CFTR protein. In some embodiments, mRNA suitable for the present invention encodes a fragment or portion of a human CFTR protein, wherein the fragment or portion of the protein still retains CFTR activity similar to that of a wild-type protein.
In some embodiments, a suitable mRNA encodes a fusion protein comprising the full length, fragment, or portion of a CFTR protein fused (e.g., N-or C-terminal fused) to another protein. In some embodiments, the protein encodes a signal or cell targeting sequence fused to mRNA encoding the full length, fragment or portion of the CFTR protein.
In some embodiments, mRNAs suitable for the present invention comprise nucleotide sequences that are at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% identical or more to SEQ ID NO. 2, SEQ ID NO. 3, SEQ ID NO. 4, SEQ ID NO. 5, SEQ ID NO. 6, SEQ ID NO. 7, SEQ ID NO. 8, SEQ ID NO. 9, SEQ ID NO. 10 or SEQ ID NO. 11.
mRNA synthesis
mRNA according to the present invention can be synthesized according to any of a variety of known methods. For example, mRNA according to the invention may be synthesized by In Vitro Transcription (IVT). Briefly, IVT is typically performed with: a linear or circular DNA template containing a promoter, a pool of ribonucleotides triphosphates, a buffer system that can include DTT and magnesium ions, and a suitable RNA polymerase (e.g., T3, T7, or SP6RNA polymerase), dnase I, pyrophosphatase, and/or an rnase inhibitor. The exact conditions will vary depending on the particular application.
Exemplary codon-optimized human cystic fibrosis transmembrane conductance regulator (CFTR) mRNA
Construct design:
X-SEQ ID NO:1-Y
5 'and 3' UTR sequences:
x (5' utr sequence) =
GGACAGAUCGCCUGGAGACGCCAUCCACGCUGUUUUGACCUCCAUAGAAGAC
ACCGGGACCGAUCCAGCCUCCGCGGCCGGGAACGGUGCAUUGGAACGCGGAUUCCCCGUGCCAAGAGUGACUCACCGUCCUUGACACG(SEQ ID NO:12)
Y (3' utr sequence) =
CGGGUGGCAUCCCUGUGACCCCUCCCCAGUGCCUCUCCUGGCCCUGGAAGUU
GCCACUCCAGUGCCCACCAGCCUUGUCCUAAUAAAAUUAAGUUGCAUCAAGCU(SEQ ID NO:13)
Or (b)
GGGUGGCAUCCCUGUGACCCCUCCCCAGUGCCUCUCCUGGCCCUGGAAGUUG
CCACUCCAGUGCCCACCAGCCUUGUCCUAAUAAAAUUAAGUUGCAUCAAAGCU(SEQ ID NO:14)
Exemplary codon optimized human CFTR mRNA sequences include any of SEQ ID NO. 2 through SEQ ID NO. 11, as described in the detailed description section.
In some embodiments, the activity of the CFTR protein is assessed by an ews chamber assay. In some embodiments, the duration of activity of the CFTR protein is assessed by a time course ews assay. In some embodiments, protein expression and stability are assessed by pulse-chase. In some embodiments, protein expression and stability are assessed by surface biotinylation.
In some embodiments, for the preparation of mRNA according to the invention, the DNA template is transcribed in vitro. Suitable DNA templates typically have a promoter for in vitro transcription (e.g., a T3, T7, or SP6 promoter) followed by the desired nucleotide sequence for the desired mRNA and a termination signal.
Synthesis of mRNA using SP6RNA polymerase
In some embodiments, the CFTR mRNA is produced using SP6RNA polymerase. SP6RNA polymerase is a DNA-dependent RNA polymerase that has high sequence specificity for the SP6 promoter sequence. SP6 polymerase catalyzes 5'→3' in vitro RNA synthesis on single-stranded DNA or double-stranded DNA downstream of its promoter; it incorporates natural ribonucleotides and/or modified ribonucleotides and/or labeled ribonucleotides into polymeric transcripts. Examples of such labeled ribonucleotides include biotin, fluorescein, digoxin, amino allyl, and isotopically labeled nucleotides.
The sequence of phage SP6RNA polymerase was originally described (GenBank: Y00105.1) as having the following amino acid sequence:
MQDLHAIQLQLEEEMFNGGIRRFEADQQRQIAAGSESDTAWNRRLLSELIAPMAEGIQ
AYKEEYEGKKGRAPRALAFLQCVENEVAAYITMKVVMDMLNTDATLQAIAMSVAER
IEDQVRFSKLEGHAAKYFEKVKKSLKASRTKSYRHAHNVAVVAEKSVAEKDADFDR
WEAWPKETQLQIGTTLLEILEGSVFYNGEPVFMRAMRTYGGKTIYYLQTSESVGQWIS
AFKEHVAQLSPAYAPCVIPPRPWRTPFNGGFHTEKVASRIRLVKGNREHVRKLTQKQ
MPKVYKAINALQNTQWQINKDVLAVIEEVIRLDLGYGVPSFKPLIDKENKPANPVPVE
FQHLRGRELKEMLSPEQWQQFINWKGECARLYTAETKRGSKSAAVVRMVGQARKYS
AFESIYFVYAMDSRSRVYVQSSTLSPQSNDLGKALLRFTEGRPVNGVEALKWFCINGA
NLWGWDKKTFDVRVSNVLDEEFQDMCRDIAADPLTFTQWAKADAPYEFLAWCFEY
AQYLDLVDEGRADEFRTHLPVHQDGSCSGIQHYSAMLRDEVGAKAVNLKPSDAPQDI
YGAVAQVVIKKNALYMDADDATTFTSGSVTLSGTELRAMASAWDSIGITRSLTKKPV
MTLPYGSTRLTCRESVIDYIVDLEEKEAQKAVAEGRTANKVHPFEDDRQDYLTPGAA
YNYMTALIWPSISEVVKAPIVAMKMIRQLARFAAKRNEGLMYTLPTGFILEQKIMATE
MLRVRTCLMGDIKMSLQVETDIVDEAAMMGAAAPNFVHGHDASHLILTVCELVDKG
VTSIAVIHDSFGTHADNTLTLRVALKGQMVAMYIDGNALQKLLEEHEVRWMVDTGIEVPEQGEFDLNEIMDSEYVFA(SEQ ID NO:16)。
the SP6RNA polymerase suitable for the present invention may be any enzyme having substantially the same polymerase activity as the bacteriophage SP6RNA polymerase. Thus, in some embodiments, SP6RNA polymerase suitable for the present invention may be modified from SEQ ID NO. 16. For example, a suitable SP6RNA polymerase may contain one or more amino acid substitutions, deletions or additions. In some embodiments, a suitable SP6RNA polymerase has an amino acid sequence that is about 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, 90%, 89%, 88%, 87%, 86%, 85%, 84%, 83%, 82%, 81%, 80%, 75%, 70%, 65%, or 60% identical or homologous to SEQ ID NO. 16. In some embodiments, a suitable SP6RNA polymerase may be a truncated protein (from the N-terminus, the C-terminus, or internally), but retains polymerase activity. In some embodiments, a suitable SP6RNA polymerase is a fusion protein.
SP6RNA polymerase suitable for the present invention may be commercially available products, such as those from Aldevron, ambion, new England Biolabs (NEB), promega and Roche. SP6 may be ordered and/or custom designed from commercial or non-commercial sources based on the amino acid sequence of SEQ ID NO. 16 or a variant of SEQ ID NO. 16 as described herein. SP6 may be a standard fidelity polymerase; or may be high fidelity/high efficiency/high capacity, which has been modified to promote RNA polymerase activity, such as mutations in the SP6RNA polymerase gene or post-translational modification of the SP6RNA polymerase itself. Examples of such modified SP6 include SP6RNA Polymerase-Plus from Ambion TM HiScribe SP6 from NEB and RiboMAX from Promega TM Andthe system.
In some embodiments, a suitable SP6RNA polymerase is a fusion protein. For example, the SP6RNA polymerase may comprise one or more tags to facilitate isolation, purification or solubility of the enzyme. Suitable tags may be located at the N-terminus, the C-terminus and/or internally. Non-limiting examples of suitable tags include Calmodulin Binding Protein (CBP); the fasciola hepatica 8kDa antigen (Fh 8); a FLAG tag peptide; glutathione-S-transferase (GST); a histidine tag (e.g., hexahistidine tag (His 6)); maltose Binding Protein (MBP); nitrogen-utilizing substance (NusA); a small molecule ubiquitin-related modifier (SUMO) fusion tag; streptavidin binding peptide (STREP); tandem Affinity Purification (TAP); and thioredoxin (TrxA). Other labels may be used in the present invention. These and other fusion tags have been described, for example, in the following documents: costa et al Frontiers in Microbiology (2014): 63 and PCT/US16/57044, the contents of which are incorporated herein by reference in their entirety. In certain embodiments, the His tag is located at the N-terminus of SP 6.
DNA template
Typically, CFTR DNA templates are fully double-stranded, or mostly single-stranded and have a double-stranded SP6 promoter sequence.
Linearized plasmid DNA (linearized via one or more restriction enzymes), linearized genomic DNA fragments (via restriction enzymes and/or physical means), PCR products and/or synthetic DNA oligonucleotides can be used as templates for SP6 in vitro transcription, provided that they contain a double stranded SP6 promoter upstream (and in the correct orientation) of the DNA sequence to be transcribed.
In some embodiments, the linearized DNA template has blunt ends.
In some embodiments, the DNA sequence to be transcribed may be optimized to promote more efficient transcription and/or translation. For example, the DNA sequence may be optimized with respect to: cis-regulatory elements (e.g., TATA box, termination signal, and protein binding sites), artificial recombination sites, χ sites, cpG dinucleotide content, negative CpG islands, GC content, polymerase slip sites, and/or other elements related to transcription; the DNA sequence may be optimized with respect to: cryptic splice sites, mRNA secondary structure, stable free energy of mRNA, repeat sequence, RNA instability motifsAnd/or other factors related to mRNA processing and stability; the DNA sequence may be optimized with respect to: codon usage bias, codon adaptation, internal χ sites, ribosome binding sites (e.g., IRES), premature poly-A sites, shine-Dalgarno (SD) sequences and/or other elements associated with translation; and/or the DNA sequence may be optimized with respect to: a sequence upstream and downstream of the codon, a codon-anticodon interaction, a translation pause site, and/or other elements related to protein folding. Optimization methods known in the art, such as thermo Fisher and optimumGene, may be used in the present invention TM Is described in US 20110081708, the contents of which are incorporated herein by reference in their entirety.
In some embodiments, the DNA template comprises 5 'and/or 3' untranslated regions. In some embodiments, the 5' untranslated region includes one or more elements that affect the stability or translation of the mRNA, such as an iron-responsive element. In some embodiments, the 5' untranslated region may be between about 50 and 500 nucleotides in length.
In some embodiments, the 3' untranslated region comprises one or more of the following: polyadenylation signals, binding sites for proteins affecting the stability of the position of mRNA in a cell, or one or more binding sites for mirnas. In some embodiments, the 3' untranslated region may be between 50 and 500 nucleotides in length or longer.
Exemplary 3 'and/or 5' utr sequences may be derived from a stable mRNA molecule (e.g., globin, actin, GAPDH, tubulin, histone, or citrate-circulating enzyme) to increase stability of the sense mRNA molecule. For example, the 5' utr sequence may include a partial sequence of the CMV immediate early 1 (IE 1) gene or fragment thereof to improve nuclease resistance and/or to improve the half-life of the polynucleotide. It is also contemplated that a sequence encoding human growth hormone (hGH) or fragment thereof is contained in the 3' or untranslated region of a polynucleotide (e.g., mRNA) to further stabilize the polynucleotide. In general, such modifications may improve the stability and/or pharmacokinetic properties (e.g., half-life) of the polynucleotide relative to the unmodified counterpart of the polynucleotide, and include, for example, modifications made to improve the resistance of such polynucleotides to nuclease digestion in vivo.
Large-scale mRNA synthesis
The present invention relates to the large-scale production of codon optimized CFTR mRNA. In some embodiments, the method according to the invention synthesizes at least 100mg, 150mg, 200mg, 300mg, 400mg, 500mg, 600mg, 700mg, 800mg, 900mg, 1g, 5g, 10g, 25g, 50g, 75g, 100g, 250g, 500g, 750g, 1kg, 5kg, 10kg, 50kg, 100kg, 1000kg or more mRNA in a single batch. As used herein, the term "batch" refers to the amount or quantity of mRNA synthesized at one time (e.g., produced according to a single manufacturing set-up). A batch may refer to the amount of mRNA synthesized in a single reaction that is performed under a set of conditions via a single aliquot of enzyme and/or a single aliquot of DNA template for continuous synthesis. mRNA synthesized in a single batch will not include mRNA synthesized at different times combined to achieve the desired amount. Typically, the reaction mixture includes SP6RNA polymerase, a linear DNA template, and an RNA polymerase reaction buffer (which may include ribonucleotides or may require the addition of ribonucleotides).
According to the invention, 1-100mg of SP6 polymerase per gram (g) of mRNA is generally produced. In some embodiments, about 1-90mg, 1-80mg, 1-60mg, 1-50mg, 1-40mg, 10-100mg, 10-80mg, 10-60mg, 10-50mg of SP6 polymerase is typically used per gram of mRNA produced. In some embodiments, about 5-20mg of the SP6 polymerase is used to produce about 1 gram of mRNA. In some embodiments, about 0.5 to 2 grams of SP6 polymerase is used to produce about 100 grams of mRNA. In some embodiments, about 5 to 20 grams of SP6 polymerase is used to produce about 1 kilogram of mRNA. In some embodiments, at least 5mg of the SP6 polymerase is used to produce at least 1 gram of mRNA. In some embodiments, at least 500mg of the SP6 polymerase is used to produce at least 100 grams of mRNA. In some embodiments, at least 5 grams of SP6 polymerase is used to produce at least 1 kilogram of mRNA. In some embodiments, about 10mg, 20mg, 30mg, 40mg, 50mg, 60mg, 70mg, 80mg, 90mg, or 100mg of plasmid DNA is used per gram of mRNA produced. In some embodiments, about 10-30mg plasmid DNA is used to produce about 1 gram of mRNA. In some embodiments, about 1 to 3 grams of plasmid DNA is used to produce about 100 grams of mRNA. In some embodiments, about 10 to 30 grams of plasmid DNA is used for about 1 kilogram of mRNA. In some embodiments, at least 10mg of plasmid DNA is used to produce at least 1 gram of mRNA. In some embodiments, at least 1 gram of plasmid DNA is used to produce at least 100 grams of mRNA. In some embodiments, at least 10 grams of plasmid DNA is used to produce at least 1 kilogram of mRNA.
In some embodiments, the concentration of SP6RNA polymerase in the reaction mixture may be about 1 to 100nM, 1 to 90nM, 1 to 80nM, 1 to 70nM, 1 to 60nM, 1 to 50nM, 1 to 40nM, 1 to 30nM, 1 to 20nM, or about 1 to 10nM. In certain embodiments, the concentration of the SP6RNA polymerase is about 10 to 50nM, 20 to 50nM, or 30 to 50nM. SP6RNA polymerase concentrations of 100 to 10000 units/ml may be used, for example the following concentrations may be used: 100 to 9000 units/ml, 100 to 8000 units/ml, 100 to 7000 units/ml, 100 to 6000 units/ml, 100 to 5000 units/ml, 100 to 1000 units/ml, 200 to 2000 units/ml, 500 to 1000 units/ml, 500 to 2000 units/ml, 500 to 3000 units/ml, 500 to 4000 units/ml, 500 to 5000 units/ml, 500 to 6000 units/ml, 1000 to 7500 units/ml, and 2500 to 5000 units/ml.
The concentration of each ribonucleotide (e.g., ATP, UTP, GTP and CTP) in the reaction mixture is between about 0.1mM and about 10mM, such as between about 1mM and about 10mM, between about 2mM and about 10mM, between about 3mM and about 10mM, between about 1mM and about 8mM, between about 1mM and about 6mM, between about 3mM and about 10mM, between about 3mM and about 8mM, between about 3mM and about 6mM, between about 4mM and about 5mM. In some embodiments, each ribonucleotide is about 5mM in the reaction mixture. In some embodiments, the total concentration of rtp (e.g., ATP, GTP, CTP and UTP combined) used in the reaction is in the range between 1mM and 40 mM. In some embodiments, the total concentration of rtp (e.g., ATP, GTP, CTP and UTP combined) used in the reaction is in the range between 1mM and 30mM or between 1mM and 28mM or between 1mM and 25mM or between 1mM and 20mM. In some embodiments, the total rtp concentration is less than 30mM. In some embodiments, the total rtp concentration is less than 25mM. In some embodiments, the total rtp concentration is less than 20mM. In some embodiments, the total rtp concentration is less than 15mM. In some embodiments, the total rtp concentration is less than 10mM.
RNA polymerase reaction buffers typically include salts/buffers such as Tris, HEPES, ammonium sulfate, sodium bicarbonate, sodium citrate, sodium acetate, potassium phosphate, sodium chloride, and magnesium chloride.
The pH of the reaction mixture may be between about 6 to 8.5, 6.5 to 8.0, 7.0 to 7.5, and in some embodiments, the pH is 7.5.
The linear or linearized DNA template (e.g., as described above and in an amount/concentration sufficient to provide the desired amount of RNA), the RNA polymerase reaction buffer, and the SP6RNA polymerase are combined to form a reaction mixture. The reaction mixture is incubated at between about 37 ℃ and about 42 ℃ for thirty minutes to six hours, for example about sixty minutes to about ninety minutes.
In some embodiments, about 5mM NTP, about 0.05mg/mL RNA polymerase, and about 0.1mg/mL DNA template are incubated in a suitable SP6 polymerase reaction buffer (final reaction mixture pH of about 7.5) at about 37℃to about 42℃for sixty to ninety minutes.
In some embodiments, the reaction mixture contains a linearized double stranded DNA template with an SP6 polymerase specific promoter, SP6RNA polymerase, rnase inhibitor, pyrophosphatase, 29mM NTP, 10mM DTT, and reaction buffer (800 mM HEPES, 20mM spermidine, 250mM MgCl when at 10x 2 (pH 7.7)) and with rnase free water sufficient (QS) to the desired reaction volume; the reaction mixture was then incubated at 37℃for 60 minutes. Then by adding DNase I and DNase I buffer (100 mM Tris-HCl, 5mM MgCl when at 10X) 2 And 25mM CaCl 2 (pH 7.6)) to promote digestion of double stranded DNA template to quench the polymerase reaction in preparation for purification. This embodiment has been shown to be sufficient to produce 100 grams of mRNA.
In some embodiments, the reaction mixture comprises NTP at a concentration ranging from 1 to 10mM, DNA template at a concentration ranging from 0.01 to 0.5mg/ml, and SP6RNA polymerase at a concentration ranging from 0.01 to 0.1mg/ml, e.g., the reaction mixture comprises NTP at a concentration of 5mM, DNA template at a concentration of 0.1mg/ml, and SP6RNA polymerase at a concentration of 0.05 mg/ml.
Nucleotide(s)
According to the present invention, various naturally occurring or modified nucleosides can be used to produce mRNA. In some embodiments, the mRNA is or comprises a natural nucleoside (e.g., adenosine, guanosine, cytidine, uridine); nucleoside analogs (e.g., 2-aminoadenosine, 2-thiothymidine, inosine, pyrrolo-pyrimidine, 3-methyladenosine, 5-methylcytidine, C-5 propynyl-cytidine, C-5 propynyl-uridine, 2-aminoadenosine, C5-bromouridine, C5-fluorouridine, C5-iodouridine, C5-propynyl-uridine, C5-propynyl-cytidine, C5-methylcytidine, 2-aminoadenosine, 7-deadenosine, 7-deazaguanosine, 8-oxo-guanosine, O (6) -methylguanine, pseudouridine (e.g., N-1-methyl-pseudouridine), 2-thiouridine, and 2-thiocytosine); chemically modified bases; biologically modified bases (e.g., methylated bases); an intercalating base; modified sugars (e.g., 2 '-fluororibose, ribose, 2' -deoxyribose, arabinose, and hexose); and/or modified phosphate groups (e.g., phosphorothioate and 5' -N-phosphoramidite linkages).
In some embodiments, the mRNA comprises one or more non-standard nucleotide residues. Non-standard nucleotide residues may include, for example, 5-methyl-cytidine ("5 mC"), pseudouridine ("ψu"), and/or 2-thiouridine ("2 sU"). For a discussion of such residues and their incorporation into mRNA see, e.g., U.S. patent No. 8,278,036 or WO 2011012316.mRNA can be RNA which is defined as RNA in which 25% of the U residues are 2-thiouridine and 25% of the C residues are 5-methylcytidine. Teachings regarding the use of RNA are disclosed in U.S. patent publication US 20120195936 and international publication WO 2011012316, both of which are incorporated herein by reference in their entirety. The presence of non-standard nucleotide residues may render the mRNA more stable and/or less immunogenic than a control mRNA having the same sequence but containing only standard residues. In other embodiments, the mRNA may comprise one or more non-standard nucleotide residues selected from the group consisting of: isocytosine, pseudoisocytosine, 5-bromouracil, 5-propynyluracil, 6-aminopurine, 2-aminopurine, inosine, diaminopurine, and 2-chloro-6-aminopurine cytosine, and combinations of these modifications and other nucleobase modifications. Some embodiments may further include additional modifications to the furanose ring or nucleobase. Additional modifications may include, for example, sugar modifications or substitutions (e.g., one or more of 2' -O-alkyl modifications, locked Nucleic Acids (LNAs)). In some embodiments, the RNA may be complexed or hybridized to additional polynucleotides and/or peptide Polynucleotides (PNAs). In some embodiments where the sugar modification is a 2 '-O-alkyl modification, such modifications may include, but are not limited to, 2' -deoxy-2 '-fluoro modifications, 2' -O-methyl modifications, 2 '-O-methoxyethyl modifications, and 2' -deoxy modifications. In some embodiments, any of these modifications may be present in 0-100% of the nucleotides, alone or in combination, e.g., more than 0%, 1%, 10%, 25%, 50%, 75%, 85%, 90%, 95% or 100% of the constituent nucleotides.
Post synthesis treatment
Typically, the 5 'cap and/or 3' tail may be added after synthesis. The presence of the cap is important to provide resistance to nucleases found in most eukaryotic cells. The presence of a "tail" serves to protect the mRNA from exonuclease degradation.
The 5' cap is typically added as follows: first, RNA terminal phosphatase removes one of the terminal phosphate groups from the 5' nucleotide, leaving two terminal phosphates; guanosine Triphosphate (GTP) is then added to the terminal phosphate via guanylate transferase, resulting in a 5'5 triphosphate linkage; the 7-nitrogen of guanine is then methylated by methyltransferase. Examples of cap structures include, but are not limited to, m7G (5 ') ppp (5' (a, G (5 ') ppp (5') a) and G (5 ') ppp (5') G. Additional cap structures are described in published U.S. application nos. US 2016/0032356 and U.S. provisional application 62/464,327 filed on 27 months 2 in 2017, which applications are incorporated herein by reference.
Typically, the tail structure comprises poly (a) and/or poly (C) tails. The poly a or poly C tail on the 3' end of an mRNA typically comprises at least 50 adenosine or cytosine nucleotides, at least 150 adenosine or cytosine nucleotides, at least 200 adenosine or cytosine nucleotides, at least 250 adenosine or cytosine nucleotides, at least 300 adenosine or cytosine nucleotides, at least 350 adenosine or cytosine nucleotides, at least 400 adenosine or cytosine nucleotides, at least 450 adenosine or cytosine nucleotides, at least 500 adenosine or cytosine nucleotides, at least 550 adenosine or cytosine nucleotides, at least 600 adenosine or cytosine nucleotides, at least 650 adenosine or cytosine nucleotides, at least 700 adenosine or cytosine nucleotides, at least 750 adenosine or cytosine nucleotides, at least 800 adenosine or cytosine nucleotides, at least 850 adenosine or cytosine nucleotides, at least 900 adenosine or cytosine nucleotides, at least 950 adenosine or cytosine nucleotides, or at least 1kb adenosine or cytosine nucleotides, respectively. In some embodiments, the poly a or poly C tail can be about 10 to 800 adenosine or cytosine nucleotides (e.g., about 10 to 200 adenosine or cytosine nucleotides, about 10 to 300 adenosine or cytosine nucleotides, about 10 to 400 adenosine or cytosine nucleotides, about 10 to 500 adenosine or cytosine nucleotides, about 10 to 550 adenosine or cytosine nucleotides, about 10 to 600 adenosine or cytosine nucleotides, about 50 to 600 adenosine or cytosine nucleotides, about 100 to 600 adenosine or cytosine nucleotides, about 150 to 600 adenosine or cytosine nucleotides, about 200 to 600 adenosine or cytosine nucleotides, about 250 to 600 adenosine or cytosine nucleotides, about 300 to 600 adenosine or cytosine nucleotides, about 350 to 600 adenosine or cytosine nucleotides, about 400 to 600 adenosine or cytosine nucleotides, about 450 to 600 adenosine or cytosine nucleotides, about 500 to 600 adenosine or cytosine nucleotides, about 10 to 20 to 60 adenosine or about 20 to cytosine nucleotides). In some embodiments, the tail structure comprises a combination of poly (a) and poly (C) tails of various lengths as described herein. In some embodiments, the tail structure comprises at least 50%, 55%, 65%, 70%, 75%, 80%, 85%, 90%, 92%, 94%, 95%, 96%, 97%, 98% or 99% adenosine nucleotides. In some embodiments, the tail structure comprises at least 50%, 55%, 65%, 70%, 75%, 80%, 85%, 90%, 92%, 94%, 95%, 96%, 97%, 98% or 99% cytosine nucleotides.
As described herein, the addition of a 5 'cap and/or 3' tail helps to detect abortive transcripts generated during in vitro synthesis, as the size of those prematurely aborted mRNA transcripts may be too small to be detected without capping and/or tailing. Thus, in some embodiments, a 5 'cap and/or 3' tail is added to the synthesized mRNA prior to testing the purity of the mRNA (e.g., the level of abortive transcripts present in the mRNA). In some embodiments, the 5 'cap and/or 3' tail is added to the synthesized mRNA prior to purifying the mRNA as described herein. In other embodiments, the 5 'cap and/or 3' tail is added to the synthesized mRNA after purification of the mRNA as described herein.
mRNA synthesized according to the present invention can be used without further purification. In particular, mRNA synthesized according to the present invention can be used without the step of removing short bodies (shortmers). In some embodiments, mRNA synthesized according to the invention may be further purified. Various methods can be used to purify the mRNA synthesized according to the present invention. For example, purification of mRNA can be performed using centrifugation, filtration, and/or chromatography. In some embodiments, the synthesized mRNA is purified by ethanol precipitation or filtration or chromatography or gel purification or any other suitable means. In some embodiments, the mRNA is purified by HPLC. In some embodiments, mRNA is extracted from a standard phenol-chloroform-isoamyl alcohol solution as is well known to those skilled in the art. In some embodiments, the mRNA is purified using tangential flow filtration. Suitable purification methods include those described in the following PCT application: PCT application PCT/US18/19954 entitled "METHODS FOR PURIFICATION OF MESSENGER RNA" filed on US 2016/0040154, US 2015/0376220, 27, 2, 2018, and PCT application PCT/US18/19978 entitled "METHODS FOR PURIFICATION OF MESSENGER RNA" filed on 27, 2, 2018, which are incorporated herein by reference in their entirety; and may be used to practice the invention.
In some embodiments, the mRNA is purified prior to capping and tailing. In some embodiments, the mRNA is purified after capping and tailing. In some embodiments, the mRNA is purified both before and after capping and tailing.
In some embodiments, the mRNA is purified by centrifugation before or after capping and tailing, or both.
In some embodiments, the mRNA is purified by filtration before or after capping and tailing, or both.
In some embodiments, the mRNA is purified by Tangential Flow Filtration (TFF) before or after capping and tailing, or both.
In some embodiments, the mRNA is purified by chromatography before or after capping and tailing or both.
Characterization of mRNA
Full-length or abortive transcripts of mRNA may be detected and quantified using any method available in the art. In some embodiments, the synthetic mRNA molecules are detected using the following method: blotting, capillary electrophoresis, chromatography, fluorescence, gel electrophoresis, HPLC, silver staining, spectroscopy, ultraviolet (UV) or UPLC or combinations thereof. Other detection methods known in the art are included in the present invention. In some embodiments, the synthesized mRNA molecules are detected using UV absorbance spectroscopy and separated by capillary electrophoresis. In some embodiments, mRNA is first denatured by glyoxal dye prior to gel electrophoresis ("glyoxal gel electrophoresis"). In some embodiments, the synthesized mRNA is characterized prior to capping or tailing. In some embodiments, the synthesized mRNA is characterized after capping and tailing.
In some embodiments, the mRNA produced by the methods disclosed herein comprises less than 10%, less than 9%, less than 8%, less than 7%, less than 6%, less than 5%, less than 4%, less than 3%, less than 2%, less than 1%, less than 0.5%, less than 0.1% of impurities other than full-length mRNA. The impurities include IVT contaminants such as proteins, enzymes, free nucleotides and/or short bodies.
In some embodiments, the mRNA produced according to the invention is substantially free of short bodies or abortive transcripts. In particular, the mRNA produced according to the invention contains short bodies or abortive transcripts at levels undetectable by capillary electrophoresis or glyoxal gel electrophoresis. As used herein, the term "short body" or "abortive transcript" refers to any transcript that is less than full length. In some embodiments, a "short body" or "abortive transcript" is less than 100 nucleotides in length, less than 90, less than 80, less than 70, less than 60, less than 50, less than 40, less than 30, less than 20, or less than 10 nucleotides in length. In some embodiments, short bodies are detected or quantified after addition of the 5 '-cap and/or 3' -poly a tail.
mRNA solution
In some embodiments, mRNA may be provided in a solution to be mixed with a lipid solution, such that the mRNA may be encapsulated in a lipid nanoparticle. Suitable mRNA solutions may be any aqueous solution containing the mRNA to be encapsulated at different concentrations. For example, a suitable mRNA solution may contain mRNA at the following concentrations: is at or above about 0.01mg/ml, 0.05mg/ml, 0.06mg/ml, 0.07mg/ml, 0.08mg/ml, 0.09mg/ml, 0.1mg/ml, 0.15mg/ml, 0.2mg/ml, 0.3mg/ml, 0.4mg/ml, 0.5mg/ml, 0.6mg/ml, 0.7mg/ml, 0.8mg/ml, 0.9mg/ml or 1.0mg/ml. In some embodiments, suitable mRNA solutions may contain mRNA in the following concentration ranges: about 0.01-1.0mg/ml, 0.01-0.9mg/ml, 0.01-0.8mg/ml, 0.01-0.7mg/ml, 0.01-0.6mg/ml, 0.01-0.5mg/ml, 0.01-0.4mg/ml, 0.01-0.3mg/ml, 0.01-0.2mg/ml, 0.01-0.1mg/ml, 0.05-1.0mg/ml, 0.05-0.9mg/ml, 0.05-0.8mg/ml, 0.05-0.7mg/ml, 0.05-0.6mg/ml, 0.05-0.5mg/ml, 0.05-0.4mg/ml, 0.05-0.3mg/ml, 0.05-0.2mg/ml, 0.05-0.1mg/ml, 0.1-0.2 mg/ml, 0.05-0.8mg/ml, 0.05-0.7mg/ml, 0.0.7 mg/ml, 0.0.3-0.5 mg/ml. In some embodiments, a suitable mRNA solution may contain mRNA at the following concentrations: up to about 5.0mg/ml, 4.0mg/ml, 3.0mg/ml, 2.0mg/ml, 1.0mg/ml, 0.09mg/ml, 0.08mg/ml, 0.07mg/ml, 0.06mg/ml or 0.05mg/ml.
In general, suitable mRNA solutions may also contain buffers and/or salts. Typically, buffers may include HEPES, ammonium sulfate, sodium bicarbonate, sodium citrate, sodium acetate, potassium phosphate, and sodium phosphate. In some embodiments, suitable concentrations of buffer may range from about 0.1mM to 100mM, 0.5mM to 90mM, 1.0mM to 80mM, 2mM to 70mM, 3mM to 60mM, 4mM to 50mM, 5mM to 40mM, 6mM to 30mM, 7mM to 20mM, 8mM to 15mM, or 9 to 12mM. In some embodiments, suitable concentrations of buffers are or greater than the following: about 0.1mM, 0.5mM, 1mM, 2mM, 4mM, 6mM, 8mM, 10mM, 15mM, 20mM, 25mM, 30mM, 35mM, 40mM, 45mM or 50mM.
Exemplary salts may include sodium chloride, magnesium chloride, and potassium chloride. In some embodiments, suitable salt concentrations in the mRNA solution may range from about 1mM to 500mM, 5mM to 400mM, 10mM to 350mM, 15mM to 300mM, 20mM to 250mM, 30mM to 200mM, 40mM to 190mM, 50mM to 180mM, 50mM to 170mM, 50mM to 160mM, 50mM to 150mM, or 50mM to 100mM. Suitable salt concentrations in the mRNA solutions are at or above about 1mM, 5mM, 10mM, 20mM, 30mM, 40mM, 50mM, 60mM, 70mM, 80mM, 90mM or 100mM.
In some embodiments, suitable mRNA solutions may have a pH in the range of about 3.5-6.5, 3.5-6.0, 3.5-5.5, 3.5-5.0, 3.5-4.5, 4.0-5.5, 4.0-5.0, 4.0-4.9, 4.0-4.8, 4.0-4.7, 4.0-4.6, or 4.0-4.5. In some embodiments, a suitable mRNA solution may have a pH of or not greater than about 3.5, 4.0, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.2, 5.4, 5.6, 5.8, 6.0, 6.1, 6.3, and 6.5.
Various methods can be used to prepare mRNA solutions suitable for the present invention. In some embodiments, mRNA can be directly dissolved in a buffer solution as described herein. In some embodiments, the mRNA solution may be generated by mixing the mRNA stock solution with a buffer solution prior to mixing with the lipid solution for encapsulation. In some embodiments, the mRNA solution may be generated by mixing the mRNA stock solution with the buffer solution immediately after mixing with the lipid solution for encapsulation. In some embodiments, a suitable stock solution of mRNA may contain mRNA in water at the following concentrations: is at or above about 0.2mg/ml, 0.4mg/ml, 0.5mg/ml, 0.6mg/ml, 0.8mg/ml, 1.0mg/ml, 1.2mg/ml, 1.4mg/ml, 1.5mg/ml, or 1.6mg/ml, 2.0mg/ml, 2.5mg/ml, 3.0mg/ml, 3.5mg/ml, 4.0mg/ml, 4.5mg/ml, or 5.0mg/ml.
In some embodiments, the mRNA stock solution is mixed with the buffer solution using a pump. Exemplary pumps include, but are not limited to, gear pumps, peristaltic pumps, and centrifugal pumps.
Typically, the buffer solution is mixed at a rate greater than the rate of the mRNA stock solution. For example, the buffer solution may be mixed at a rate at least 1x, 2x, 3x, 4x, 5x, 6x, 7x, 8x, 9x, 10x, 15x, or 20x greater than the rate of the mRNA stock solution. In some embodiments, the buffer solution is mixed at a flow rate ranging between about 100-6000 ml/min (e.g., about 100-300 ml/min, 300-600 ml/min, 600-1200 ml/min, 1200-2400 ml/min, 2400-3600 ml/min, 3600-4800 ml/min, 4800-6000 ml/min, or 60-420 ml/min). In some embodiments, the buffer solution is mixed at a flow rate of at or greater than about 60 ml/min, 100 ml/min, 140 ml/min, 180 ml/min, 220 ml/min, 260 ml/min, 300 ml/min, 340 ml/min, 380 ml/min, 420 ml/min, 480 ml/min, 540 ml/min, 600 ml/min, 1200 ml/min, 2400 ml/min, 3600 ml/min, 4800 ml/min, or 6000 ml/min.
In some embodiments, the mRNA stock solution is mixed at a flow rate ranging between about 10-600 ml/min (e.g., about 5-50 ml/min, about 10-30 ml/min, about 30-60 ml/min, about 60-120 ml/min, about 120-240 ml/min, about 240-360 ml/min, about 360-480 ml/min, or about 480-600 ml/min). In some embodiments, the mRNA stock solution is mixed at a flow rate of about or greater than 5 ml/min, 10 ml/min, 15 ml/min, 20 ml/min, 25 ml/min, 30 ml/min, 35 ml/min, 40 ml/min, 45 ml/min, 50 ml/min, 60 ml/min, 80 ml/min, 100 ml/min, 200 ml/min, 300 ml/min, 400 ml/min, 500 ml/min, or 600 ml/min.
Delivery vehicle
According to the invention, mRNA encoding a peptide, polypeptide, or protein (e.g., full length, fragment, or portion of a protein) as described herein can be delivered via a delivery vehicle. As used herein, the terms "delivery vehicle," "transfer vehicle," "nanoparticle," or grammatical equivalents are used interchangeably.
The delivery vehicle may be formulated in combination with one or more additional nucleic acids, carriers, targeting ligands or stabilizing agents, or in a pharmaceutical composition in which the vehicle is admixed with a suitable excipient. Techniques for drug formulation and administration can be found in the following documents: "Remington's Pharmaceutical Sciences," Mack Publishing Co., easton, pa., latest edition. The particular delivery vehicle is selected based on its ability to facilitate transfection of the nucleic acid into the target cell.
In some embodiments, the delivery vehicle comprising mRNA is administered by pulmonary delivery (e.g., including nebulization). In these embodiments, the delivery vehicle may be in the form of an aerosolized composition that can be inhaled. In some embodiments, mRNA is expressed in tissue to which the delivery vehicle is administered, e.g., nasal, tracheal, bronchial, bronchiole, and/or other pulmonary system-related cells or tissues. Other teachings of pulmonary delivery and nebulization are described in related international application PCT/US17/61100 filed on 10/11/2017, entitled "NOVEL ICE-BASED LIPID NANOPARTICLE FORMULATION FOR DELIVERY OF MRNA" by the applicant, and U.S. provisional application USSN 62/507,061, each of which is incorporated by reference in its entirety.
In some embodiments, mRNA encoding a protein may be delivered via a single delivery vehicle. In some embodiments, mRNA encoding a protein may be delivered via one or more delivery vehicles (each having a different composition). According to various embodiments, suitable delivery vehicles include, but are not limited to, polymer-based carriers (such as Polyethylenimine (PEI), lipid nanoparticles and liposomes, nanoliposomes, ceramide-containing nanoliposomes, proteoliposomes, exosomes of both natural and synthetic origin, natural, synthetic and semisynthetic lamellar bodies, nanoparticles, calcium phosphor-silicate nanoparticles, calcium phosphate nanoparticles, silica nanoparticles, nanocrystal particles, semiconductor nanoparticles, poly (D-arginine), sol-gel, nanodendrimers, starch-based delivery systems, micelles, emulsions, lipid vesicles (niosomes), multidomain-block polymers (vinyl polymers, polyacrylic acid polymers, dynamic poly-conjugates), dry powder formulations, plasmids, viruses, calcium phosphate nucleotides, aptamers, peptides and other vector tags, the use of biological nanocapsules and other viral capsid protein assemblies as suitable transfer vehicles is also contemplated (Hum. Gene. Th.2008; 19 (9): 887-95).
The delivery vehicle comprising mRNA can be administered and dosed according to current medical practice, taking into account the clinical condition of the subject, the site and method of administration (e.g., local and systemic, including oral, pulmonary, and via injection), the schedule of administration, the age, sex, weight of the subject, and other factors relevant to the clinician of ordinary skill in the art. An "effective amount" for purposes herein may be determined by such relevant considerations as are known to those of ordinary skill in the experimental clinical study, pharmacology, clinical and medical arts. In some embodiments, the amount administered is effective to achieve at least some stabilization, improvement or elimination of symptoms and other indicia, as selected by one of skill in the art as an appropriate measure of disease progression, regression or improvement. For example, suitable amounts and dosing regimens are those that result in at least temporary protein production.
In some embodiments, CFTR mRNA is administered in combination with one or more CFTR potentiators and/or correctors. Suitable CFTR potentiators and/or correction agents include ivacaine (trade name) Lu Maka Torr (trade name)) Or a combination of ivacaine and Lu Maka torr. In some embodiments, CFTR mRNA is combined with one or more other CF therapies (e.g., hormone replacement therapy, thyroid hormone during treatment Replacement therapy, non-steroidal inflammatory drugs and prescribed dronabinol ++>) And (3) combined application.
In some embodiments, the human subject receives concomitant CFTR modulator therapy. In some embodiments, the concomitant CFTR modulator therapy comprises ivacaine. In some embodiments, the concomitant CFTR modulator therapy comprises Lu Maka torr. In some embodiments, the concomitant CFTR modulator therapy comprises tizakatuo. In some embodiments, the concomitant CFTR modulator therapy is selected from ivacaine, lu Maka torr, tizalcine, or a combination. In some embodiments, the concomitant CFTR modulator therapy comprises VX-659. In some embodiments, the concomitant CFTR modulator therapy comprises VX-445. In some embodiments, the concomitant CFTR modulator therapy comprises VX-152. In some embodiments, the concomitant CFTR modulator therapy comprises VX-440. In some embodiments, the concomitant CFTR modulator therapy comprises VX-371. In some embodiments, the concomitant CFTR modulator therapy comprises VX-561. In some embodiments, the concomitant CFTR modulator therapy comprises GLPG1837. In some embodiments, the concomitant CFTR modulator therapy comprises GLPG2222. In some embodiments, the concomitant CFTR modulator therapy comprises GLPG2737. In some embodiments, the concomitant CFTR modulator therapy comprises GLPG2451. In some embodiments, the concomitant CFTR modulator therapy comprises GLPG1837. In some embodiments, the concomitant CFTR modulator therapy comprises PTI-428. In some embodiments, the concomitant CFTR modulator therapy comprises PTI-801. In some embodiments, the concomitant CFTR modulator therapy comprises PTI-808. In some embodiments, the concomitant CFTR modulator therapy comprises Ai Lufu sen.
In some embodiments, the human subject is not suitable for treatment with one or more of ivacaine, lu Maka torr, tizacal torr, VX-659, VX-445, VX-152, VX-440, VX-371, VX-561, VX-659, or a combination thereof. In some embodiments, the human subject is not suitable for treatment with one or more of ivacaide, lu Maka torr, tizacaide, VX-659, VX-445, VX-152, VX-440, VX-371, VX-561, VX-659, GLPG1837, GLPG2222, GLPG2737, GLPG2451, GLPG1837, PTI-428, PTI-801, PTI-808, ai Lufu sen, or a combination thereof.
In some embodiments, the delivery vehicles are formulated such that they are suitable for prolonged release of the mRNA contained therein. Such extended release compositions can be conveniently administered to a subject at extended dosing intervals.
Liposome delivery vehicles
In some embodiments, a suitable delivery vehicle is a liposomal delivery vehicle, such as a lipid nanoparticle. As used herein, a liposome delivery vehicle (e.g., a lipid nanoparticle) is generally characterized as a microvesicle having an internal aqueous space separated from an external medium by one or more bilayer membranes. Bilayer membranes of liposomes are typically formed from amphiphilic molecules, such as lipids of synthetic or natural origin, comprising spatially separated hydrophilic and hydrophobic domains (Lasic, trends biotechnology, 16:307-321,1998). The bilayer membrane of the liposome may also be formed from an amphiphilic polymer and a surfactant (e.g., a polymer body, a lipid vesicle, etc.). In the context of the present invention, a liposome delivery vehicle is typically used to transport the desired mRNA to a target cell or tissue. In some embodiments, the nanoparticle delivery vehicle is a liposome. In some embodiments, the liposome comprises one or more cationic lipids, one or more non-cationic lipids, one or more cholesterol-based lipids, and one or more PEG-modified lipids. In some embodiments, the liposome comprises no more than three different lipid components. In some embodiments, one of the different lipid components is a sterol-based cationic lipid.
Cationic lipids
As used herein, the phrase "cationic lipid" refers to any of a number of lipid species that have a net positive charge at a selected pH, such as physiological pH.
Suitable cationic lipids for use in the compositions and methods of the present invention include cationic lipids as described in International patent publication WO 2010/144740, which is incorporated herein by reference. In certain embodiments, the compositions and methods of the present invention comprise cationic lipids (6 z,9z,28z,31 z) -heptadecen-6,9,28,31-tetraen-19-yl 4- (dimethylamino) butyrate having the following compound structure:
and pharmaceutically acceptable salts thereof.
Other suitable cationic lipids for use in the compositions and methods of the present invention include ionizable cationic lipids as described in international patent publication WO 2013/149440, which is incorporated herein by reference. In some embodiments, the compositions and methods of the invention comprise a cationic lipid of one of the following formulas:
or a pharmaceutically acceptable salt thereof, wherein R 1 And R is 2 Each independently selected from hydrogen, optionally substituted, variably saturated or unsaturated C 1 -C 20 Alkyl and optionally substituted, variably saturated or unsaturated C 6 -C 20 An acyl group; wherein L is 1 And L 2 Each independently selected from hydrogen, optionally substituted C 1 -C 30 Alkyl, optionally substituted, variably unsaturated C 1 -C 30 Alkenyl and optionally substituted C 1 -C 30 Alkynyl; wherein m and o are each independently selected from zero and any positive integer (e.g., where m is three); and wherein n is zero or any positive integer (e.g., where n is one). In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid (15Z, 18Z) -N, N-dimethyl-6- (9Z, 12Z) -octadecane-9, 12-dien-l-yl) tetracosyl-15, 18-dien-1-amine ("HGT 5000") having the following compound structure:
and pharmaceutically acceptable salts thereof. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid (15 z,18 z) -N, N-dimethyl-6- ((9 z,12 z) -octadeca-9, 12-dien-1-yl) tetracosan-4,15,18-trien-l-amine ("HGT 5001") having the following compound structure:
and pharmaceutically acceptable salts thereof. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid and (15 z,18 z) -N, N-dimethyl-6- ((9 z,12 z) -octadeca-9, 12-dien-1-yl) tetracosan-5,15,18-trien-1-amine ("HGT 5002") having the following compound structure:
And pharmaceutically acceptable salts thereof.
Other suitable cationic lipids for use in the compositions and methods of the present invention include the cationic lipids described as aminoalcohol lipids (lipidoid) in international patent publication WO 2010/053572, which is incorporated herein by reference. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof.
Other suitable cationic lipids for use in the compositions and methods of the present invention include cationic lipids as described in International patent publication WO 2016/118725, which is incorporated herein by reference. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof.
Other suitable cationic lipids for use in the compositions and methods of the present invention include cationic lipids as described in International patent publication WO 2016/118724, which is incorporated herein by reference. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
And pharmaceutically acceptable salts thereof.
Other suitable cationic lipids for use in the compositions and methods of the present invention include cationic lipids having the formula 14, 25-ditridecyl 15,18,21,24-tetraaza-trioctadecyl, and pharmaceutically acceptable salts thereof.
Other suitable cationic lipids for use in the compositions and methods of the present invention include cationic lipids as described in International patent publications WO 2013/063284 and WO 2016/205691, each of which is incorporated herein by reference. In some embodiments, the compositions and methods of the invention comprise a cationic lipid of the formula:
or a pharmaceutically acceptable salt thereof, wherein R L Independently of each occurrence of (2) is optionally substituted C 6 -C 40 Alkenyl groups. In certain embodiments, the compositions and methods of the present invention compriseA cationic lipid having the structure of:
and pharmaceutically acceptable salts thereof. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
And pharmaceutically acceptable salts thereof. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof.
Other suitable cationic lipids for use in the compositions and methods of the present invention include cationic lipids as described in International patent publication WO 2015/184356, which is incorporated herein by reference. In some embodiments, the compositions and methods of the invention comprise a cationic lipid of the formula:
or a pharmaceutically acceptable salt thereof, wherein each X is independently O or S; each Y is independently O or S; each m is independently 0 to 20; each of which isn is independently 1 to 6; each R A Independently is hydrogen, optionally substituted C1-50 alkyl, optionally substituted C2-50 alkenyl, optionally substituted C2-50 alkynyl, optionally substituted C3-10 carbocyclyl, optionally substituted 3-14 membered heterocyclyl, optionally substituted C6-14 aryl, optionally substituted 5-14 membered heteroaryl, or halogen; and each R B Independently is hydrogen, optionally substituted C1-50 alkyl, optionally substituted C2-50 alkenyl, optionally substituted C2-50 alkynyl, optionally substituted C3-10 carbocyclyl, optionally substituted 3-14 membered heterocyclyl, optionally substituted C6-14 aryl, optionally substituted 5-14 membered heteroaryl, or halogen. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid "target 23" having the following compound structure:
And pharmaceutically acceptable salts thereof.
Other suitable cationic lipids for use in the compositions and methods of the present invention include cationic lipids as described in International patent publication WO 2016/004202, which is incorporated herein by reference. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
or a pharmaceutically acceptable salt thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
or a pharmaceutically acceptable salt thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
or a pharmaceutically acceptable salt thereof.
Other suitable cationic lipids for use in the compositions and methods of the present invention include cationic lipids as described in U.S. provisional patent application Ser. No. 62/758,179, which is incorporated herein by reference. In some embodiments, the compositions and methods of the invention comprise a cationic lipid of the formula:
or a pharmaceutically acceptable salt thereof, wherein each R 1 And R is 2 Independently H or C 1 -C 6 An aliphatic group; each m is independently an integer having a value of 1 to 4; each a is independently a covalent bond or arylene; each L 1 Independently an ester, thioester, disulfide, or anhydride group; each L 2 Independently C 2 -C 10 An aliphatic group; each X is 1 Independently H or OH; and each R 3 Independently C 6 -C 20 An aliphatic group. In some embodiments, the compositions and methods of the invention comprise a cationic lipid of the formula:
or a pharmaceutically acceptable salt thereof. In some embodiments, the compositions and methods of the invention comprise a cationic lipid of the formula:
or a pharmaceutically acceptable salt thereof. In some embodiments, the compositions and methods of the invention comprise a cationic lipid of the formula:
or a pharmaceutically acceptable salt thereof.
Other suitable cationic lipids for use in the compositions and methods of the present invention include cationic lipids as described in the following documents: J.McClellan, M.C.King, cell 2010,141,210-217 and Whitehead et al, nature Communications (2014) 5:4277, which are incorporated herein by reference. In certain embodiments, the cationic lipids of the compositions and methods of the present invention include cationic lipids having the following compound structure:
and pharmaceutically acceptable salts thereof.
Other suitable cationic lipids for use in the compositions and methods of the present invention include cationic lipids as described in International patent publication WO 2015/199952, which is incorporated herein by reference. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
And pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
And pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof.
Other suitable cationic lipids for use in the compositions and methods of the present invention include cationic lipids as described in International patent publication WO 2017/004143, which is incorporated herein by reference. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
And pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
/>
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
And pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof.
Other suitable cationic lipids for use in the compositions and methods of the present invention include cationic lipids as described in International patent publication WO 2017/075531, which is incorporated herein by reference. In some embodiments, the compositions and methods of the invention comprise a cationic lipid of the formula:
or a pharmaceutically acceptable salt thereof, wherein L 1 Or L 2 One of them is-O (c=o) -, - (c=o) O-, -C (=o) -, -O-, -S (O) x 、-S-S-、-C(=O)S-、-SC(=O)-、-NR a C(=O)-、-C(=O)NR a -、NR a C(=O)NR a -、-OC(=O)NR a -or-NR a C (=o) O-; and L is 1 Or L 2 The other of (C=O) -, - (C=O) O-, -C (=O) -, -O-, -S (O) x 、-S-S-、-C(=O)S-、SC(=O)-、-NR a C(=O)-、-C(=O)NR a -、NR a C(=O)NR a -、-OC(=O)NR a -or-NR a C (=o) O-or a direct bond; g 1 And G 2 Each independently is unsubstituted C 1 -C 12 Alkylene or C 1 -C 12 Alkenylene; g 3 Is C 1 -C 24 Alkylene, C 1 -C 24 Alkenylene, C 3 -C 8 Cycloalkylene, C 3 -C 8 A cycloalkenyl group; r is R a Is H or C 1 -C 12 An alkyl group; r is R 1 And R is 2 Each independently is C 6 -C 24 Alkyl or C 6 -C 24 Alkenyl groups; r is R 3 Is H, OR 5 、CN、-C(=O)OR 4 、-OC(=O)R 4 or-NR 5 C(=O)R 4 ;R 4 Is C 1 -C 12 An alkyl group; r is R 5 Is H or C 1 -C 6 An alkyl group; and x is 0, 1 or 2.
Other suitable cationic lipids for use in the compositions and methods of the present invention include cationic lipids as described in International patent publication WO 2017/117528, which is incorporated herein by reference. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
And pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In some embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof.
Other suitable cationic lipids for use in the compositions and methods of the present invention include cationic lipids as described in international patent publication WO 2017/049245, which is incorporated herein by reference. In some embodiments, the cationic lipids of the compositions and methods of the present invention include compounds of one of the following formulas:
and pharmaceutically acceptable salts thereof. For any of these four formulas, R 4 Independently selected from- (CH) 2 ) n Q and- (CH) 2 ) n CHQR; q is selected from the group consisting of-OR, -OH, -O (CH) 2 ) n N(R) 2 、-OC(O)R、-CX 3 、-CN、-N(R)C(O)R、-N(H)C(O)R、-N(R)S(O) 2 R、-N(H)S(O) 2 R、-N(R)C(O)N(R) 2 、-N(H)C(O)N(R) 2 、-N(H)C(O)N(H)(R)、-N(R)C(S)N(R) 2 、-N(H)C(S)N(R) 2 -N (H) C (S) N (H) (R) and heterocycle; and n is 1, 2 or 3. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
And pharmaceutically acceptable salts thereof. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof.
Other suitable cationic lipids for use in the compositions and methods of the present invention include cationic lipids as described in International patent publications WO 2017/173054 and WO 2015/095340, each of which is incorporated herein by reference. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
and pharmaceutically acceptable salts thereof. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid having the following compound structure:
And pharmaceutically acceptable salts thereof.
Other suitable cationic lipids for use in the compositions and methods of the present invention include cleavable cationic lipids as described in international patent publication WO 2012/170889, which is incorporated herein by reference. In some embodiments, the compositions and methods of the invention comprise a cationic lipid of the formula:
wherein R is 1 Selected from imidazole, guanidine, amino, imine, enamine, optionally substituted alkylamino (e.g., alkylamino, such as dimethylamino), and pyridinyl; wherein R is 2 Selected from one of the following two formulas:
and wherein R is 3 And R is 4 Each independently selected from optionally substituted, variably saturated or unsaturated C 6 -C 20 Alkyl and optionally substituted, variably saturated or unsaturated C 6 -C 20 An acyl group; and wherein n is zero or any positive integer (e.g., one, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen, sixteen, seventeen, eighteen, nineteen, twenty or more). In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid, "HGT4001", having the following compound structure:
and pharmaceutically acceptable salts thereof. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid ("HGT 4002" (also referred to herein as "Guan-SS-Chol")) a compound having the structure:
And pharmaceutically acceptable salts thereof. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid, "HGT4003", having the following compound structure:
and pharmaceutically acceptable salts thereof. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid, "HGT4004", having the following compound structure:
and pharmaceutically acceptable salts thereof. In certain embodiments, the compositions and methods of the present invention comprise a cationic lipid, "HGT4005", having the following compound structure:
and pharmaceutically acceptable salts thereof.
Other suitable cationic lipids for use in the compositions and methods of the present invention include cleavable cationic lipids as described in U.S. provisional application No. 62/672,194 filed 5/16 a 2018, and which application is incorporated herein by reference. In certain embodiments, the compositions and methods of the present invention include cationic lipids having any of the general formulas or any of structures (1 a) - (21 a) and (1 b) - (21 b) and (22) - (237) described in U.S. provisional application No. 62/672,194. In certain embodiments, the compositions and methods of the present invention comprise cationic lipids having a structure according to formula (I'),
Wherein:
R X independently is-H, -L 1 -R 1 or-L 5A -L 5B -B';
L 1 、L 2 And L 3 Each of which is independently a covalent bond, -C (O) -, -C (O) O-, -C (O) S-, or-C (O) NR L -;
Each L 4A And L 5A Is independently-C (O) -, -C (O) O-or-C (O) NR L -;
Each L 4B And L 5B Independently C 1 -C 20 An alkylene group; c (C) 2 -C 20 Alkenylene; or C 2 -C 20 Alkynylene;
each of B and B' is NR 4 R 5 Or a 5 to 10 membered nitrogen containing heteroaryl;
each R 1 、R 2 And R is 3 Independently C 6 -C 30 Alkyl, C 6 -C 30 Alkenyl or C 6 -C 30 Alkynyl;
each R 4 And R is 5 Independently hydrogen, C 1 -C 10 An alkyl group; c (C) 2 -C 10 Alkenyl groups; or C 2 -C 10 Alkynyl; and is also provided with
Each R L Independently hydrogen, C 1 -C 20 Alkyl, C 2 -C 20 Alkenyl or C 2 -C 20 Alkynyl groups.
In certain embodiments, the compositions and methods of the present invention include a cationic lipid of compound (139) as 62/672,194 having the following compound structure:
in some embodiments, the compositions and methods of the present invention include the cationic lipid N- [ l- (2, 3-dioleoyloxy) propyl ] -N, N, N-trimethylammonium chloride ("DOTMA"). (Feigner et al Proc. Nat' l Acad. Sci.84,7413 (1987); U.S. Pat. No. 4,897,355, incorporated herein by reference). Other cationic lipids suitable for the compositions and methods of the present invention include, for example, 5-carboxy sperminyl (spinyl) glycine dioctadecylamide ("DOGS"); 2, 3-dioleoyloxy-N- [2 (spermine-carboxamide) ethyl ] -N, N-dimethyl-l-propylamine ("DOSPA") (Behr et al proc. Nat.' l acad. Sci.86,6982 (1989), U.S. patent No. 5,171,678; U.S. patent No. 5,334,761); l, 2-dioleoyl-3-dimethylammonium-propane ("DODAP"); l, 2-dioleoyl-3-trimethylammonium-propane ("DOTAP").
Additional exemplary cationic lipids suitable for the compositions and methods of the present invention also include: l, 2-distearoyloxy-N, N-dimethyl-3-aminopropane ("DSDMA"); 1, 2-dioleoyloxy-N, N-dimethyl-3-aminopropane ("DODMA"); 1, 2-dioleoyloxy-N, N-dimethyl-3-aminopropane ("DLinDMA"); l, 2-di-linolenyloxy-N, N-dimethyl-3-aminopropane ("DLenDMA"); N-dioleoyl-N, N-dimethyl ammonium chloride ("DODAC"); n, N-distearoyl-N, N-dimethyl ammonium bromide ("DDAB"); n- (l, 2-dimyristoxyprop-3-yl) -N, N-dimethyl-N-hydroxyethylammonium bromide ("dmriie"); 3-dimethylamino-2- (cholest-5-en-3- β -oxybut-4-oxy) -l- (cis, cis-9, 12-octadecadienyloxy) propane ("CLinDMA"); 2- [5'- (cholest-5-en-3- β -oxy) -3' -oxapentoxy) -3-dimethyl-l- (cis, cis-9 ', l-2' -octadecadienoxy) propane ("CpLinDMA"); n, N-dimethyl-3, 4-dioleoyloxybenzylamine ("DMOBA"); 1,2-N, N' -dioleylcarbamoyl-3-dimethylaminopropane ("DOcarbDAP"); 2, 3-dioleoyloxy-n, n-dimethylpropylamine ("DLinDAP"); l,2-N, N' -dioleylcarbamoyl-3-dimethylaminopropane ("DLincarbDAP"); l, 2-dioleoyl carbamoyl-3-dimethylaminopropane ("dlindcap"); 2, 2-dioleoyl-4-dimethylaminomethyl- [ l,3] -dioxolane ("DLin-K-DMA"); 2- ((8- [ (3P) -cholest-5-en-3-yloxy ] octyl) oxy) -N, N-dimethyl-3- [ (9 z,12 z) -octadec-9, 12-dien-1-yloxy ] propan-1-amine ("octyl-CLinDMA"); (2R) -2- ((8- [ (3β) -cholest-5-en-3-yloxy ] octyl) oxy) -N, N-dimethyl-3- [ (9 z,12 z) -octadec-9, 12-dien-1-yloxy ] propan-1-amine ("octyl-CLinDMA (2R)"); (2S) -2- ((8- [ (3P) -cholest-5-en-3-yloxy ] octyl) oxy) -N, fsl-dimethyl 3- [ (9 z,12 z) -octadec-9, 12-dien-1-yloxy ] propan-1-amine ("octyl-CLinDMA (2S)"); 2, 2-dioleoyl-4-dimethylaminoethyl- [ l,3] -dioxolane ("DLin-K-XTC 2-DMA"); and 2- (2, 2-di ((9Z, 12Z) -octadecane-9, l 2-dien-1-yl) -l, 3-dioxolan-4-yl) -N, N-dimethylethylamine ("DLin-KC 2-DMA") (see, WO 2010/042877, which is incorporated herein by reference; semple et al, nature Biotech.28:172-176 (2010)). (Heyes, J. Et al J Controlled Release 107:276-287 (2005); morrissey, DV. et al Nat. Biotechnol.23 (8): 1003-1007 (2005); international patent publication WO 2005/121348). In some embodiments, the one or more cationic lipids comprise at least one of an imidazole, a dialkylamino, or a guanidine moiety. In some embodiments, one or more cationic lipids suitable for the compositions and methods of the present invention include 2, 2-dioleoyl-4-dimethylaminoethyl- [1,3] -dioxolane ("XTC"); (3 aR,5s,6 aS) -N, N-dimethyl-2, 2-bis ((9Z, 12Z) -octadecane-9, 12-dienyl) tetrahydro-3 aH-cyclopenta [ d ] [1,3] dioxol-5-amine ("ALNY-100") and/or 4,7, 13-tris (3-oxo-3- (undecylamino) propyl) -N1, N16-bis undecyl-4, 7,10, 13-tetraazahexadecane-1, 16-diamide ("NC 98-5").
In some embodiments, one or more cationic lipids suitable for the compositions and methods of the present invention include cationic lipids as TL1-04D-DMA having the following compound structure:
in some embodiments, one or more cationic lipids suitable for the compositions and methods of the present invention include a cationic lipid as GL-TES-SA-DME-E18-2 having the following compound structure:
in some embodiments, one or more cationic lipids suitable for the compositions and methods of the present invention include cationic lipids as SY-3-E14-DMAPR, which have the following compound structure:
in some embodiments, one or more cationic lipids suitable for the compositions and methods of the present invention include cationic lipids as TL1-01D-DMA having the following compound structure:
in some embodiments, one or more cationic lipids suitable for the compositions and methods of the present invention include cationic lipids as TL1-10D-DMA having the following compound structure:
in some embodiments, one or more cationic lipids suitable for the compositions and methods of the present invention include cationic lipids as GL-TES-SA-DMP-E18-2 having the following compound structure:
In some embodiments, one or more cationic lipids suitable for the compositions and methods of the present invention include cationic lipids as HEP-E4-E10 having the following compound structure:
in some embodiments, one or more cationic lipids suitable for the compositions and methods of the present invention include cationic lipids as HEP-E3-E10 having the following compound structure:
in some embodiments, the compositions of the present invention comprise one or more cationic lipids that constitute at least about 5%, 10%, 20%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, or 70% of the total lipid content in the composition (e.g., lipid nanoparticle), as measured by weight. In some embodiments, the compositions of the present invention comprise one or more cationic lipids, measured in mol%, that constitute at least about 5%, 10%, 20%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, or 70% of the total lipid content in the composition (e.g., lipid nanoparticle). In some embodiments, the compositions of the present invention comprise one or more cationic lipids, which constitute about 30-70% (e.g., about 30-65%, about 30-60%, about 30-55%, about 30-50%, about 30-45%, about 30-40%, about 35-50%, about 35-45%, or about 35-40%) of the total lipid content in the composition (e.g., lipid nanoparticle), by weight. In some embodiments, the compositions of the present invention include one or more cationic lipids, measured in mol%, that constitute about 30-70% (e.g., about 30-65%, about 30-60%, about 30-55%, about 30-50%, about 30-45%, about 30-40%, about 35-50%, about 35-45%, or about 35-40%) of the total lipid content in the composition (e.g., lipid nanoparticle).
Non-cationic/helper lipids
In some embodiments, provided liposomes contain one or more non-cationic ("helper") lipids. As used herein, the phrase "non-cationic lipid" refers to any neutral, zwitterionic, or anionic lipid. As used herein, the phrase "anionic lipid" refers to any of a number of lipid species that carry a net negative charge at a selected pH (e.g., physiological pH). Non-cationic lipids include, but are not limited to, distearoyl phosphatidylcholine (DSPC), dioleoyl phosphatidylcholine (DOPC), dipalmitoyl phosphatidylcholine (DPPC), dioleoyl phosphatidylglycerol (DOPG), dipalmitoyl phosphatidylglycerol (DPPG), dioleoyl phosphatidylethanolamine (DOPE), palmitoyl Oleoyl Phosphatidylcholine (POPC), palmitoyl Oleoyl Phosphatidylethanolamine (POPE), dioleoyl phosphatidylethanolamine 4- (N-maleimidomethyl) -cyclohexane-l-formate (DOPE-mal), dipalmitoyl phosphatidylethanolamine (DPPE), dimyristoyl phosphatidylethanolamine (DMPE), distearoyl phosphatidylethanolamine (DSPE), phosphatidylserine, sphingolipids, cerebrosides, gangliosides, 16-O-monomethyl PE, 16-O-dimethyl PE, 18-1-trans PE, l-stearoyl-2-oleoyl-phosphatidylethanolamine (SOPE), or mixtures thereof.
In some embodiments, such non-cationic lipids may be used alone, but preferably in combination with other lipids (e.g., cationic lipids). In some embodiments, the non-cationic lipid may comprise the following molar ratios of total lipids present in the liposome: about 5% to about 90%, or about 10% to about 70%. In some embodiments, the non-cationic lipid is a neutral lipid, i.e., a lipid that does not carry a net charge under the conditions of formulation and/or administration of the composition. In some embodiments, the percentage of non-cationic lipids in the liposomes can be greater than about 5%, greater than 10%, greater than 20%, greater than 30%, or greater than 40%.
Cholesterol-based lipids
In some embodiments, provided liposomes comprise one or more cholesterol-based lipids. Suitable cholesterol-based cationic lipids include, for example, DC-Choi (N, N-dimethyl-N-ethylcarboxamido cholesterol), l, 4-bis (3-N-oleylamino-propyl) piperazine (Gao et al biochem. Biophys. Res. Comm.179,280 (1991); wolf et al BioTechniques 23,139 (1997); U.S. Pat. No. 5,744,335), or ICE. In some embodiments, the cholesterol-based lipid may comprise the following molar ratios of total lipids present in the liposome: about 2% to about 30%, or about 5% to about 20%. In some embodiments, the percentage of cholesterol-based lipids in the lipid nanoparticle is greater than 5%, greater than 10%, greater than 20%, greater than 30%, or greater than 40%.
PEG modified lipids
The present invention also contemplates the use of polyethylene glycol (PEG) -modified phospholipids and derivatized lipids, such as derivatized ceramides (PEG-CER), including N-octanoyl-sphingosine-1- [ succinyl (methoxypolyethylene glycol) -2000](C8 PEG-2000 ceramide), alone or preferably in combination with other lipid formulations that constitute a transfer vehicle (e.g., lipid nanoparticles). Contemplated PEG modified lipids include, but are not limited to, covalent attachment to a polypeptide having one or more C' s 6 -C 20 A polyethylene glycol chain of length maximum S kDa of the lipid of the alkyl chain of length. The addition of such components can prevent duplicationThe complexes aggregate and may also provide a means for increasing the circulation life of the lipid-nucleic acid composition and increasing delivery of the lipid-nucleic acid composition to the target tissue (Klibanov et al (1990) FEBS Letters,268 (1): 235-237), or they may be selected for rapid exchange from the formulation in vivo (see us patent No. 5,885,613). Particularly useful exchangeable lipids are PEG-ceramides with a shorter acyl chain (e.g., C14 or C18). The PEG-modified phospholipids and derivatized lipids of the invention may comprise the following molar ratios of total lipids present in the liposomal transfer vehicle: about 0% to about 20%, about 0.5% to about 20%, about 1% to about 15%, about 4% to about 10% or about 2%.
According to various embodiments, the selection of cationic lipids, non-cationic lipids, and/or PEG-modified lipids comprising the lipid nanoparticle, as well as the relative molar ratio of such lipids with respect to each other, is based on the characteristics of one or more selected lipids, the nature of the intended target cell, the characteristics of the MCNA to be delivered. Additional considerations include, for example, saturation of alkyl chains, as well as the size, charge, pH, pKa, fusogenic (fusogenicity), and toxicity of one or more selected lipids. Whereby the molar ratio can be adjusted accordingly.
Polymer
In some embodiments, suitable delivery vehicles are formulated using polymers as carriers, either alone or in combination with other carriers, including the various lipids described herein. Thus, in some embodiments, liposome delivery vehicles as used herein also encompass nanoparticles comprising a polymer. Suitable polymers may include, for example, polyacrylates, polyalkylcyanoacrylates, polylactides, polylactide-polyglycolide copolymers, polycaprolactone, dextran, albumin, gelatin, alginate, collagen, chitosan, cyclodextrin, protamine, pegylated protamine, PLL, pegylated PLL, and Polyethylenimine (PEI). In the presence of PEI, it may be branched PEI with a molecular weight in the range of 10 to 40kDa, such as 25kDa branched PEI (Sigma # 408727).
Liposomes suitable for use in the present invention
Suitable liposomes for use in the present invention can include one or more of any of the cationic lipids, non-cationic lipids, cholesterol lipids, PEG-modified lipids and/or polymers described herein in varying ratios. As non-limiting examples, suitable liposome formulations can include a combination selected from the group consisting of: cKK-E12, DOPE, cholesterol and DMG-PEG2K; c12-200, DOPE, cholesterol and DMG-PEG2K; HGT4003, DOPE, cholesterol, and DMG-PEG2K; ICE, DOPE, cholesterol and DMG-PEG2K; or ICE, DOPE and DMG-PEG2K.
In various embodiments, the cationic lipid (e.g., cKK-E12, C12-200, ICE, and/or HGT 4003) comprises about 30-60% (e.g., about 30-55%, about 30-50%, about 30-45%, about 30-40%, about 35-50%, about 35-45%, or about 35-40%) of the liposome. In some embodiments, the percentage of cationic lipids (e.g., cKK-E12, C12-200, ICE, and/or HGT 4003) is at or above about 30%, about 35%, about 40%, about 45%, about 50%, about 55%, or about 60% of the liposome.
In some embodiments, the ratio of the one or more cationic lipids to the one or more non-cationic lipids to the one or more cholesterol-based lipids to the one or more PEG-modified lipids may be between about 30-60:25-35:20-30:1-15, respectively. In some embodiments, the ratio of the one or more cationic lipids to the one or more non-cationic lipids to the one or more cholesterol-based lipids to the one or more PEG-modified lipids is about 40:30:20:10, respectively. In some embodiments, the ratio of the one or more cationic lipids to the one or more non-cationic lipids to the one or more cholesterol-based lipids to the one or more PEG-modified lipids is about 40:30:25:5, respectively. In some embodiments, the ratio of the one or more cationic lipids to the one or more non-cationic lipids to the one or more cholesterol-based lipids to the one or more PEG-modified lipids is about 40:32:25:3, respectively. In some embodiments, the ratio of the one or more cationic lipids to the one or more non-cationic lipids to the one or more cholesterol-based lipids to the one or more PEG-modified lipids is about 50:25:20:5.
In particular embodiments, the liposomes for use in the present invention include a lipid component consisting of a cationic lipid, a non-cationic lipid (e.g., DOPE or DEPE), a PEG-modified lipid (e.g., DMG-PEG 2K), and optionally cholesterol. Cationic lipids particularly suitable for inclusion in such liposomes include GL-TES-SA-DME-E18-2, TL1-01D-DMA, SY-3-E14-DMAPR, TL1-10D-DMA, HGT4002 (also referred to herein as Guan-SS-Chol), GL-TES-SA-DMP-E18-2, HEP-E4-E10, HEP-E3-E10 and TL1-04D-DMA. These cationic lipids have been found to be particularly suitable for use in liposomes for administration by pulmonary delivery via nebulization. Among these cationic lipids, HEP-E4-E10, HEP-E3-E10, GL-TES-SA-DME-E18-2, GL-TES-SA-DMP-E18-2, TL1-01D-DMA and TL1-04D-DMA perform particularly well.
Exemplary liposomes include one of GL-TES-SA-DME-E18-2, TL1-01D-DMA, SY-3-E14-DMAPR, TL1-10D-DMA, GL-TES-SA-DMP-E18-2, HEP-E4-E10, HEP-E3-E10, and TL1-04D-DMA as a cationic lipid component, DOPE as a non-cationic lipid component, cholesterol as a helper lipid component, and DMG-PEG2K as a PEG modified lipid component. In some embodiments, the molar ratio of cationic lipid to non-cationic lipid to cholesterol to PEG-modified lipid may be between about 30-60:25-35:20-30:1-15, respectively. In some embodiments, the molar ratio of cationic lipid to non-cationic lipid to cholesterol to PEG-modified lipid is about 40:30:20:10, respectively. In some embodiments, the molar ratio of cationic lipid to non-cationic lipid to cholesterol to PEG-modified lipid is about 40:30:25:5, respectively. In some embodiments, the molar ratio of cationic lipid to non-cationic lipid to cholesterol to PEG-modified lipid is about 40:32:25:3, respectively. In some embodiments, the molar ratio of cationic lipid to non-cationic lipid to cholesterol to PEG-modified lipid is about 50:25:20:5, respectively.
In some embodiments, the lipid component of liposomes particularly suitable for pulmonary delivery consists of HGT4002 (also referred to herein as Guan-SS-Chol), DOPE, and DMG-PEG 2K. In some embodiments, the molar ratio of cationic lipid to non-cationic lipid to PEG-modified lipid is about 60:35:5.
Ratios of different lipid components
In embodiments where the lipid nanoparticle comprises three and no more than three different components of lipid, the ratio of total lipid content (i.e., the ratio of lipid component (1): lipid component (2): lipid component (3)) may be expressed as x: y: z, where
(y+z)=100-x。
In some embodiments, "x", "y" and "z" each represent mole percentages of three different components of the lipid, and the ratio is a molar ratio.
In some embodiments, "x", "y" and "z" each represent weight percentages of three different components of the lipid, and the ratio is a weight ratio.
In some embodiments, lipid component (1) represented by variable "x" is a sterol-based cationic lipid.
In some embodiments, lipid component (2) represented by variable "y" is a helper lipid.
In some embodiments, lipid component (3) represented by the variable "z" is a PEG lipid.
In some embodiments, the variable "x" representing the mole percent of lipid component (1) (e.g., a sterol-based cationic lipid) is at least about 10%, about 20%, about 30%, about 40%, about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, or about 95%.
In some embodiments, the variable "x" representing the mole percent of lipid component (1) (e.g., a sterol-based cationic lipid) is no more than about 95%, about 90%, about 85%, about 80%, about 75%, about 70%, about 65%, about 60%, about 55%, about 50%, about 40%, about 30%, about 20%, or about 10%. In embodiments, the variable "x" is no more than about 65%, about 60%, about 55%, about 50%, about 40%.
In some embodiments, the variable "x" representing the mole percent of lipid component (1) (e.g., a sterol-based cationic lipid) is: at least about 50% but less than about 95%; at least about 50% but less than about 90%; at least about 50% but less than about 85%; at least about 50% but less than about 80%; at least about 50% but less than about 75%; at least about 50% but less than about 70%; at least about 50% but less than about 65%; or at least about 50% but less than about 60%. In embodiments, the variable "x" is at least about 50% but less than about 70%; at least about 50% but less than about 65%; or at least about 50% but less than about 60%.
In some embodiments, the variable "x" representing the weight percent of lipid component (1) (e.g., a sterol-based cationic lipid) is at least about 10%, about 20%, about 30%, about 40%, about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, or about 95%.
In some embodiments, the variable "x" representing the weight percent of lipid component (1) (e.g., a sterol-based cationic lipid) is no more than about 95%, about 90%, about 85%, about 80%, about 75%, about 70%, about 65%, about 60%, about 55%, about 50%, about 40%, about 30%, about 20%, or about 10%. In embodiments, the variable "x" is no more than about 65%, about 60%, about 55%, about 50%, about 40%.
In some embodiments, the variable "x" representing the weight percent of lipid component (1) (e.g., a sterol-based cationic lipid) is: at least about 50% but less than about 95%; at least about 50% but less than about 90%; at least about 50% but less than about 85%; at least about 50% but less than about 80%; at least about 50% but less than about 75%; at least about 50% but less than about 70%; at least about 50% but less than about 65%; or at least about 50% but less than about 60%. In embodiments, the variable "x" is at least about 50% but less than about 70%; at least about 50% but less than about 65%; or at least about 50% but less than about 60%.
In some embodiments, the variable "z" representing the mole percent of lipid component (3) (e.g., a PEG lipid) is no more than about 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 15%, 20%, or 25%. In embodiments, the variable "z" representing the mole percentage of lipid component (3) (e.g., PEG lipid) is about 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%. In embodiments, the variable "z" representing the mole percent of lipid component (3) (e.g., PEG lipid) is from about 1% to about 10%, from about 2% to about 10%, from about 3% to about 10%, from about 4% to about 10%, from about 1% to about 7.5%, from about 2.5% to about 10%, from about 2.5% to about 7.5%, from about 2.5% to about 5%, from about 5% to about 7.5%, or from about 5% to about 10%.
In some embodiments, the variable "z" representing the weight percentage of lipid component (3) (e.g., PEG lipid) is no more than about 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 15%, 20%, or 25%. In embodiments, the variable "z" representing the weight percent of lipid component (3) (e.g., PEG lipid) is about 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%. In embodiments, the variable "z" representing the weight percent of lipid component (3) (e.g., PEG lipid) is from about 1% to about 10%, from about 2% to about 10%, from about 3% to about 10%, from about 4% to about 10%, from about 1% to about 7.5%, from about 2.5% to about 10%, from about 2.5% to about 7.5%, from about 2.5% to about 5%, from about 5% to about 7.5%, or from about 5% to about 10%.
For compositions having three and only three different lipid components, the variables "x", "y" and "z" may be in any combination, so long as the sum of the three variables adds up to 100% of the total lipid content.
Formation of liposomes encapsulating mRNA
Liposome transfer vehicles for use in the compositions of the present invention can be prepared by a variety of techniques presently known in the art. Liposomes for use in the provided compositions can be prepared by a variety of techniques presently known in the art. For example, multilamellar vesicles (MLVs) can be prepared according to conventional techniques, such as by depositing the selected lipid on the inner wall of a suitable container or vessel (by dissolving the lipid in a suitable solvent, then evaporating the solvent to leave a thin film inside the vessel) or by spray drying. An aqueous phase may then be added to the vessel with a swirling motion, which results in the formation of MLVs. Unilamellar vesicles (ULV) can then be formed by homogenization, sonication, or extrusion of the multilamellar vesicles. Alternatively, unilamellar vesicles may be formed by detergent removal techniques.
In certain embodiments, provided compositions comprise liposomes, wherein mRNA is both associated on the surface of the liposome and encapsulated within the same liposome. For example, during the preparation of the compositions of the invention, cationic liposomes can associate with mRNA via electrostatic interactions. For example, during the preparation of the compositions of the invention, cationic liposomes can associate with mRNA via electrostatic interactions.
In some embodiments, the compositions and methods of the invention comprise mRNA encapsulated in liposomes. In some embodiments, one or more mRNA species may be encapsulated in the same liposome. In some embodiments, one or more mRNA species may be encapsulated in different liposomes. In some embodiments, mRNA is encapsulated in one or more liposomes that differ in their lipid composition, molar ratio of lipid components, size, charge (zeta potential), targeting ligand, and/or combinations thereof. In some embodiments, one or more liposomes can have different compositions of sterol-based cationic lipids, neutral lipids, PEG-modified lipids, and/or combinations thereof. In some embodiments, one or more of the liposomes can have different molar ratios of cholesterol-based cationic lipids, neutral lipids, and PEG-modified lipids used to produce the liposomes.
The process of incorporating the desired mRNA into liposomes is often referred to as "loading". Exemplary methods are described in Lasic et al, FEBS lett, 312:255-258,1992, which is incorporated herein by reference. The nucleic acid incorporated into the liposome may be located wholly or partially within the interior space of the liposome, within the bilayer membrane of the liposome, or attached to the outer surface of the liposome membrane. Incorporation of a nucleic acid into a liposome is referred to herein as "encapsulation," wherein the nucleic acid is contained entirely within the interior space of the liposome. The purpose of incorporating mRNA into a transfer vehicle (e.g., a liposome) is often to protect the nucleic acid from the environment, which may contain enzymes or chemicals that degrade the nucleic acid and/or systems or receptors that cause rapid excretion of the nucleic acid. Thus, in some embodiments, a suitable delivery vehicle is capable of enhancing the stability of the mRNA contained therein and/or facilitating delivery of the mRNA to a target cell or tissue.
Suitable liposomes according to the invention can be made in a variety of sizes. In some embodiments, provided liposomes can be made smaller than previously known mRNA-encapsulating liposomes. In some embodiments, reduced liposome size is associated with more efficient delivery of mRNA. The selection of the appropriate liposome size may take into account the site of the target cell or tissue and, to some extent, the application for which the liposome is being prepared.
In some embodiments, liposomes of suitable size are selected to promote systemic distribution of antibodies encoded by the mRNA. In some embodiments, it may be desirable to limit transfection of mRNA to certain cells or tissues. For example, to target hepatocytes, liposomes can be sized such that they are smaller than the perforations of the endothelial intimal liver antrum in the liver; in this case, the liposomes can readily penetrate such endothelial cells to reach the target hepatocytes.
Alternatively or additionally, the liposomes may be sized such that the size of the liposomes has a sufficiently large diameter to limit or clearly avoid distribution into certain cells or tissues.
Various alternative methods known in the art may be used to determine the size of the liposome population. One such sizing method is described in U.S. patent No. 4,737,323 (incorporated herein by reference). The liposome suspension is sonicated by bath or probe sonication to gradually reduce the size to a small ULV of less than about 0.05 microns in diameter. Homogenization is another method that relies on shear energy to break up large liposomes into smaller ones. In a typical homogenization procedure, the MLV is recirculated through a standard emulsion homogenizer until a selected liposome size (typically between about 0.1 and 0.5 microns) is observed. The size of the liposomes can be determined by quasi-electro-optical scattering (QELS) as described in bloom field, ann.rev. Biophys. Bioeng.,10:421-150 (1981), incorporated herein by reference. The average liposome diameter can be reduced by sonicating the formed liposomes. Intermittent sonication cycles can be alternated with QELS assessment to direct efficient liposome synthesis.
Therapeutic use of compositions
In one aspect, the invention provides, inter alia, a method of inducing expression of a protein in vivo by administering a codon optimized nucleic acid encoding the protein, or by administering the protein. In some embodiments, the composition comprises a nucleic acid encapsulated with or complexed with a delivery vehicle. In some embodiments, the delivery vehicle is selected from the group consisting of liposomes, lipid nanoparticles, solid-lipid nanoparticles, polymers, viruses, sol-gels, and nanogels. In some embodiments, the codon-optimized nucleic acid encoding the protein is packaged in a viral particle.
For oral administration, the pharmaceutical formulation is in the form of, for example, a tablet or capsule prepared by known methods with pharmaceutically acceptable excipients such as binders (e.g., pregelatinized cornstarch, polyvinylpyrrolidone, or methylcellulose); fillers (e.g., lactose, microcrystalline cellulose, or calcium hydrogen phosphate); additives (e.g., magnesium stearate, talc, silica); disintegrants (e.g., potato starch); and/or a lubricant (e.g., sodium lauryl sulfate). The tablets may be coated using known methods. Liquid formulations for oral administration have the form of solutions, syrups or suspensions, for example, or may be in the form of a dry product which may be dissolved in water or another liquid prior to use. The formulations are prepared by known methods with pharmaceutically acceptable additives such as suspending agents (e.g., sorbitol, cellulose derivatives, edible hydrogenated fats); emulsifying agents (e.g., lecithin or acacia); a non-aqueous liquid (e.g., almond oil, oily esters, ethyl alcohol, or fractionated vegetable oil); and/or a preservative (e.g., methyl or propyl hydroxybenzoate, sorbic acid, or ascorbic acid). Where appropriate, the formulations may also contain buffer salts, colorants, flavors and/or sweeteners.
Formulations for oral administration are formulated in known manner to provide controlled release of the active compound.
Examples
Although certain compounds, compositions, and methods of the present invention have been specifically described according to certain embodiments, the following examples are illustrative of the compounds of the present invention and are not intended to be limiting.
Example 1 production of optimized nucleotide sequences
This example illustrates a method of producing an optimized nucleotide sequence according to the invention that is optimized to produce full length transcripts and result in high levels of expression of the encoded protein during in vitro synthesis.
The method combines the codon optimization method of fig. 1A with a series of filtering steps shown in fig. 1B to generate a list of optimized nucleotide sequences. Specifically, as shown in FIG. 1A, the method receives an amino acid sequence of interest and a first codon usage table reflecting the frequency of each codon in a given organism (i.e., human codon usage bias in the case of the present example). If a codon is associated with a codon usage frequency below a threshold frequency (10%), then the method removes the codon from the first codon usage table. The codon usage frequency of the codons not removed in the first step is normalized to generate a normalized codon usage table.
Normalizing the codon usage table involves reassigning the frequency of use value for each removed codon; the frequency of use of a certain removed codon is added to the frequency of use of other codons sharing amino acids with the removed codon. In this example, the reassignment is proportional to the magnitude of the frequency of use of codons not removed from the table. The method uses a standardized codon usage table to generate a list of optimized nucleotide sequences. Each optimized nucleotide sequence encodes an amino acid sequence of interest.
As shown in fig. 1B, the list of optimized nucleotide sequences was further processed by applying a motif screening filter, a guanine-cytosine (GC) content analysis filter, and a Codon Adaptation Index (CAI) analysis filter in the following order to generate an updated list of optimized nucleotide sequences.
As shown in the examples below, this method results in an optimized nucleotide sequence encoding the amino acid sequence of interest. The nucleotide sequences produce full length transcripts during in vitro synthesis and result in high levels of expression of the encoded proteins.
Example 2 codon optimization of CFTR mRNA sequence to increase CAI resulted in higher protein expression
This example demonstrates that a codon optimized protein coding sequence with a Codon Adaptation Index (CAI) of about 0.8 or higher is superior to a codon optimized protein coding sequence with a CAI of less than 0.8.
Human cystic fibrosis transmembrane conductance regulator (hCFTR) was codon optimized as described in example 1. hCFTR is encoded by a sequence of 4440 nucleotides.
Mutations in the gene encoding the hCFTR protein lead to Cystic Fibrosis (CF), which is the most common genetic disease in the caucasian population. It is characterized by abnormal transport of chloride and sodium ions in epithelial cells, resulting in thick, viscous secretions that most severely affect the lung, as well as the pancreas, liver and intestine. mRNA encoding the codon optimized hCFTR coding sequence is being developed as a novel therapeutic for the treatment of CF.
The natural hCFTR amino acid sequence was codon optimized according to the method of the invention as shown in example 1. Three sequences designated hCFTR#1 (SEQ ID NO: 16), hCFTR#2 (SEQ ID NO: 2) and hCFTR#3 (SEQ ID NO: 3) were selected for further analysis. For reference, nucleotide sequences having hCFTR coding sequences codon optimized with different algorithms are provided (SEQ ID NO: 15). The reference nucleotide sequence (SEQ ID NO: 15) has been previously experimentally verified both in vitro and in vivo. It has been found that the reference nucleotide sequence provides superior protein yields relative to other earlier tested codon-optimised nucleotide sequences encoding hCFTR proteins. The CAI and GC content% of the codon optimized hcftr#2 and hcftr#3 sequences are significantly increased when compared to the reference nucleotide sequence. Furthermore, their codon frequency assignment (CFD)% was 0% compared to 6% of the reference nucleotide sequence, indicating that rare codon clusters detrimental to translation efficiency were successfully removed. Additional filtration to remove the negative regulatory motif resulted in a significant reduction in the number of negative CIS-regulatory (CIS) elements in hcftr#2 and hcftr#3 (see table 2).
TABLE 2
To test the protein yield from each codon optimized sequence, 4 nucleic acid vectors were prepared, each comprising an expression cassette containing one of the 4 nucleotide sequences encoding hCFTR protein flanked by identical 3 'and 5' untranslated sequences (3 'and 5' utrs) and preceded by an RNA polymerase promoter. These nucleic acid vectors were used as templates for in vitro transcription reactions to provide 4 batches of mRNA containing 4 codon-optimized nucleotide sequences (reference and hCFTR #1 to # 3). Capping and tailing are performed separately.
Each capped and tailed mRNA was transfected individually into a cell line (HEK 293). Cell lysates were collected 24 and 48 hours after transfection. Protein samples were extracted and processed for SDS-PAGE. The expression level of the encoded hCFTR protein was assessed by western blotting. Proteins were visualized and quantified using the LI-COR system. Protein yield is expressed as Relative Fluorescence Units (RFU). The results of this experiment are summarized in fig. 2A. The codon optimized nucleotide sequences hcftr#2 and hcftr#3 (both having a CAI of 0.89) produced significantly higher yields of encoded hCFTR protein than the reference nucleotide sequence and hcftr#1 (both having a CAI of 0.7). This effect was more pronounced at the 24 hour time point (see fig. 2B), probably due to relatively rapid degradation of mRNA in HEK293 cells after transfection.
The data in this example demonstrate that codon optimization of the treatment-related nucleotide sequence (hCFTR) to achieve CAI of about 0.8 or higher results in higher protein yields, especially when combined with optimization of its CFD and GC content as well as with removal of any negative CIS elements from the nucleic acid sequence. The data in this example also demonstrate that codon optimization of mRNA according to the method of the invention results in very high protein yields in human cells compared to nucleotide sequences that are codon optimized with different algorithms.
Example 3 codon optimization of cftr nucleotide sequence leads to increased functional activity in cells
This example illustrates that codon optimization of the nucleotide sequence encoding a protein of interest according to the method of the invention does not affect the functional activity of the protein in human cells. To illustrate this, mRNA encoding CFTR was used in this study.
Administration of hCFTR mRNA is intended to result in uptake of hCFTR mRNA by airway epithelial cells of CF patients, followed by internalization into the cytoplasm of the target cells. After cellular uptake is achieved, hCFTR mRNA is translated into normal hCFTR protein, which is then processed by the cell's endogenous secretory pathway, resulting in localization of the hCFTR protein in the apical cell membrane. In this way, hCFTR mRNA administration produces functional hCFTR protein in airway epithelial cells, thereby correcting the lack of functional CFTR in the lungs of CF patients. Codon optimization of the hCFTR mRNA nucleotide sequence can increase expression of functional hCFTR protein, which is thought to result in higher amounts of functional hCFTR protein in target airway epithelial cells of CF patients.
Codon optimisation has been reported to be possible at the expense of reduced functional activity and associated loss of efficacy of the encoded protein, as the process may remove information encoded in the nucleotide sequence important to control translation of the protein and ensure correct folding of the nascent polypeptide chain (Mauro and Chappell, trends Mol Med.2014;20 (11): 604-13). To test the functional activity of hCFTR protein expressed from a codon optimized sequence generated using the codon optimization method as shown in example 1, hCFTR mRNA produced in example 2 was tested in an ewing cell assay. The assay uses an epithelial voltage clamp to monitor epithelium transfected with hCFTR mRNAThe chloride transport function of the cells to assess the functional activity of the protein expressed by the mRNA. Specifically, the functional activity of hCFTR protein expressed by mRNA having the control hCFTR coding sequence (SEQ ID NO: 15) or the coding sequence of hCFTR#1 (SEQ ID NO: 16), hCFTR#2 (SEQ ID NO: 2) or hCFTR#3 (SEQ ID NO: 3) was measured in Fischer Rat Thyroid (FRT) epithelial cells. FRT epithelial cells are commonly used as models to study human airway epithelial cell function. FRT epithelial cells in Snapwell TM Monolayers on filter inserts were grown and transfected with 4 hCFTR mRNA. 4 hCFTR mRNA were generated as described in example 2. Control mRNA has been previously validated in this assay and used as a reference standard.
When CFTR agonists (forskolin and VX-770) are administered) When correctly translated and located hCFTR protein produced by hCFTR mRNA increases short circuit current (I) in ews epithelial voltage clamp device SC ) And outputting. The administration of CFTR antagonist CFTRinh-172 brings hCFTR into a blocking state. I in the assay SC Current polarity switching records the apical to basolateral sodium current and basolateral to apical chloride current as negative values, and thus if transfection with test hCFTR mRNA generates high negative values, it can be concluded that: the encoded hCFTR protein is functional (fig. 3A). Furthermore, by transfecting an equal amount of mRNA, it can be assessed whether the mRNA produces a higher hCFTR protein yield, as protein yield and activity are related. Transfection of FRT epithelial cells with mRNA having hcftr#1 coding sequence resulted in activity comparable to that obtained by transfection with mRNA having control hCFTR coding sequence (fig. 3B). The nucleotide sequence encoding hCFTR produced by the method of the invention encodes mRNA resulting in significantly increased activity. Consistent with the higher protein yields observed in example 2, the activity resulting from hCFTR protein produced from mRNA encoding hcftr#2 was more than 2 times that of the control mRNA, and the activity resulting from hCFTR protein produced from mRNA encoding hcftr#3 was 3 times that of the control mRNA. This demonstrates the higher results observed in example 2 for hcftr#2 and hcftr#3 Protein yield is directly related to higher functional activity, indicating that codon optimization according to the methods of the invention does not negatively affect the functional activity of the encoded protein.
In summary, codon optimisation according to the method of the invention results in higher expression of the encoded protein in human cells and the expressed protein provides complete functional activity in a model system, which is a highly relevant model for human therapy.
Example 4 evaluation of codon optimized wild-type CFTR constructs against activated CFTR constructs
In this example, to examine the efficacy of the codon optimized mRNA of the present invention, the expression and activity of the codon optimized wild-type CFTR construct was compared to the non-codon optimized wild-type CFTR construct and the activated CFTR mutant construct.
The inventors developed mRNA encoding an engineered or mutant CFTR protein that showed increased activity and/or stability. In particular, the engineered CFTR protein may contain one or more modifications that mimic phosphorylated residues in the R domain (R domain phosphomimetic mutations). These mutations result in activation and opening of CFTR chloride channels. Another strategy for engineering activation mutants of CFTR is to mutate residues involved in ATP gating (e.g., E1371Q). CFTR proteins undergo ubiquitination at lysine residues. Amino acid mutations at lysine residues that result in substitution of lysine for another amino acid residue result in enhanced stability and protein expression of CFTR proteins (e.g., K14R). Example 3 shows that the codon optimized wild type CFTR construct of the invention has higher activity than the reference CFTR mRNA construct. To further evaluate the expression and activity of the codon optimized CFTR constructs of the invention, comparisons were made in this experiment using the activated CFTR constructs listed in table 3 and non-codon optimized CFTR constructs.
TABLE 3 wild-type and mutant constructs of various CFTR
First, studies were also performed to evaluate in vitro translation of codon optimized wild-type CFTR and CFTR mutants. The data from these studies indicate that codon optimized WT sequences, i.e., 13E and E1371Q K14R variants, showed increased expression in the C band of HEK293 lysates (fig. 4A and 4B). "C-band" refers to the mature complex glycosylated form of CFTR. In addition, codon-optimized WT CFTR showed higher efficacy than non-codon-optimized WT CFTR (fig. 4C).
Next, the codon optimized CFTR sequences (both WT and mutant) were evaluated for non-codon optimized WT CFTR in an ews chamber assay. The data from these assays indicate that codon optimized WT CFTR showed increased activity in the ews chamber assay compared to non-codon optimized CFTR (fig. 5A). Notably, CO WT showed activity comparable to the codon-optimized activated CFTR mutant construct (fig. 5B). Surprisingly, it was found that the activity of wild-type CFTR protein can be significantly enhanced without introducing amino acid mutations.
The duration of activity of the various CFTR constructs listed in test table 3 was measured in Cheng Yousi chambers when used. The activity of the CFTR protein was measured at 22 and 44 hours. Short-circuit current was plotted for each CFTR protein at 22 and 44 hours (I sc ) (ion movement measured from active transport in ews chamber). Figure 5C shows that codon optimized WT CFTR has high residual activity at 44 hours, significantly higher than the non-codon optimized counterpart.
In vitro tolerance of mutant CFTR mRNA was also assessed in HEK293 cells using a commercially available cytotoxicity assay. The data from these studies indicate that none of the CFTR variants, including codon optimized WT CFTR, showed increased cytotoxicity in HEK293 cells when compared to vehicle controls (fig. 6).
Taken together, these data demonstrate that mRNAs that are codon optimized according to the present invention exhibit significantly higher activity, which is particularly useful for treating pulmonary diseases by means of mRNA therapeutics.
EXAMPLE 5 Synthesis of lipids for pulmonary delivery
The synthesis of the cationic lipids of the present invention is described in this example.
1.GL-TES-SA-DME-E18-2
Synthetic scheme
Synthetic scheme
Synthesis of (9Z, 12Z) -octadeca-9, 12-dienoyl chloride (2)
To a solution of linolenic acid (1.0 g,3.6 mmol) in 10mL of dichloromethane was added N, N-dimethylformamide (0.1 mL) and oxalyl chloride (1.2 mL,14.3 mmol) at 0deg.C. The reaction mixture was warmed to room temperature and stirred for 3h. The solvent was removed under reduced pressure and the crude product was used in the next step without further purification.
Synthesis of 2- ((1, 3-bis (((9Z, 12Z) -octadeca-9, 12-dienoyl) oxy) -2- ((((9Z, 12Z) -octadeca-9, 12-dienoyl) oxy) methyl) propan-2-yl) amino) ethane-1-sulfonic acid (3)
To a solution of (9Z, 12Z) -octadeca-9, 12-dienoyl chloride 2 (1.1 g,3.6 mmol) in anhydrous N, N-dimethylacetamide (5.0 mL) and N-methylmorpholine (3.0 mL) was added 2- ((1, 3-dihydroxy-2- (hydroxymethyl) propan-2-yl) amino) ethane-1-sulfonic acid (1, TES) (200 mg,0.87 mmol). The reaction mixture was heated to 55 ℃ for 3h. MS analysis showed the formation of the desired product. Will be reversedThe mixture was cooled to room temperature, diluted with water (100 mL) and extracted with dichloromethane (2 x 100 mL). The combined organic layers were washed with saturated brine (100 mL) and dried over anhydrous sodium sulfate. The solvent was removed under vacuum and the residue was purified by column chromatography (40 g SiO 2 : a gradient of 0 to 10% methanol in dichloromethane) to obtain 2- ((1, 3-bis (((9 z,12 z) -octadeca-9, 12-dienoyl) oxy) -2- ((((9 z,12 z) -octadeca-9, 12-dienoyl) oxy) methyl) propan-2-yl) amino) ethane-1-sulfonic acid (292 mg,47% yield) as a colorless solid.
Synthesis of 2- ((2- (chlorosulfonyl) ethyl) amino) -2- ((((9Z, 12Z) -octadeca-9, 12-dienoyl) oxy) methyl) propane-1, 3-diyl (9Z, 9'Z, 12' Z) -bis (octadeca-9, 12-dienoate) (3-Cl)
To a solution of 2- ((1, 3-bis (((9Z, 12Z) -octadeca-9, 12-dienoyl) oxy) -2- ((((9Z, 12Z) -octadeca-9, 12-dienoyl) oxy) methyl) propan-2-yl) amino) ethane-1-sulfonic acid 3 (210 mg,0.82 mmol) in anhydrous dichloromethane (5.0 mL) was added N, N-dimethylformamide (0.05 mL) and oxalyl chloride (0.08 mL,2.1 mmol) at 0deg.C. The reaction mixture was warmed to room temperature and stirred for 3h. The solvent was removed under reduced pressure to dryness to give 2- ((2- (chlorosulfonyl) ethyl) amino) -2- ((((9 z,12 z) -octadeca-9, 12-dienyloxy) methyl) propane-1, 3-diyl (9 z,9'z, 12' z) -bis (octadeca-9, 12-dienoate) which was used in the next step without further purification.
Synthesis of 2- ((2- (N- (2- (dimethylamino) ethyl) sulfamoyl) ethyl) amino) -2- ((((9Z, 12Z) -octadeca-9, 12-dienoyl) oxy) methyl) propane-1, 3-diyl (9Z, 9'Z, 12' Z) -bis (octadeca-9, 12-dienoate) (Compound I)
To 2- ((2- (chlorosulfonyl) ethyl) amino) -2- ((((9Z, 12Z) -octadeca-9, 12-diene) at 0 DEG CAcyl) oxy) methyl) propane-1, 3-diyl (9Z, 9'Z, 12' Z) -bis (octadeca-9, 12-dienoate) 3-Cl (210 mg,0.82 mmol) in dry dichloromethane (5.0 mL) was added N 1 ,N 1 Dimethylethane-1, 2-diamine (182 mg,2.1 mmol). The reaction mixture was warmed to room temperature and stirred for 3h. The reaction was quenched by the addition of water and the mixture was extracted with dichloromethane (2 x 100 ml). The combined organic layers were washed with saturated brine (100 mL) and dried over anhydrous sodium sulfate. The solvent was removed and the crude was purified by column chromatography (40 g SiO2: 0 to 15% methanol gradient in dichloromethane) to give 2- ((2- (N- (2- (dimethylamino) ethyl) sulfamoyl) ethyl) amino) -2- ((((9 z,12 z) -octadeca-9, 12-dienyloxy) methyl) propane-1, 3-diyl (9 z,9'z, 12' z) -bis (octadeca-9, 12-dienoate) (139 mg,62% yield) as a yellow oil.
1H NMR (300 MHz, chloroform-d) delta 5.26-5.44 (m, 12H), 4.09 (s, 6H), 3.06-3.18 (m, 6H), 2.75 (t, 6H), 2.47 (t, 2H), 2.32 (t, 6H), 2.24 (s, 6H), 2.00-2.10 (m, 12H), 1.52-1.65 (m, 4H), 1.20-1.40 (m, 44H), 0.88 (t, 9H).
APCI-MS analysis: c64h115N3O8S, [ m+h ] calculated = 1186.7, observed = 1186.8.
2.GL-TES-SA-DMP-E18-2
Synthetic route
/>
Synthetic scheme
Compound II was prepared following the representative procedure described above in similar yields to those obtained for compound I.
Linoleic acid is treated with a chlorinating agent (e.g., oxalyl chloride) to provide the acid chloride compound 2. The reaction of compound 2 with a nucleophilic compound (e.g., buffer compound 1) provides compound 3. Compound 3 is treated with a chlorinating agent (e.g., oxalyl chloride) to provide the electrophilic compound 3-Cl. The reaction of 3-Cl with a nucleophile such as compound 4b then provides compound II.
The reaction conditions used were as follows:
1H NMR (300 MHz, chloroform-d) delta 5.24-5.42 (m, 12H), 4.08 (s, 6H), 3.17 (t, 2H), 3.06 (bs, 4H), 2.75 (t, 6H), 2.43 (t, 2H), 2.31 (t, 6H), 2.23 (s, 6H), 1.98-2.08 (m, 12H), 1.70 (quintuple peak, 2H), 1.52-1.63 (m, 4H), 1.17-1.45 (m, 44H), 0.87 (t, 9H).
APCI-MS analysis: c65h117N3O8S, [ m+h ] calculated = 1100.7, observed = 1100.8.
3.TL1-01D-DMA
Synthetic scheme
Synthetic scheme
Synthesis of trioctyl (2-hydroxypropane-1, 2, 3-tricarboxylic acid)
To a solution of citric acid A1 (2.1 g,11.0 mmol) and 1-octanol A2-1 (9.4 g,72.6 mmol) in dichloromethane (40 mL) were added DMAP (1.34 g,11.0 mmol) and EDCI (14.3 g,72.6 mmol), and the resulting mixture was stirred at room temperature for 24h. The reaction mixture was evaporated under vacuum. The residue was dissolved in dichloromethane (200 mL) and washed with brine (100 mL x 3). In the warp anhydrous Na 2 SO 4 After drying, the solvent was evaporated and the crude product was purified by column chromatography (220 g SiO 2 : gradient of 0 to 20% ethyl acetate in hexane) to obtain a solid phaseTrioctyl (2-hydroxypropane-1, 2, 3-tricarboxylic acid) was found to be a coloured oil (5.2 g, 90%).
Synthesis of trioctyl (2- ((3- (dimethylamino) propionyl) oxy) propane-1, 2, 3-tricarboxylic acid)
To a solution of trioctyl 2-hydroxypropane-1, 2, 3-tricarboxylic acid A3-1 (0.528 g,1.0 mmol), DMAP (122 mg,1.0 mmol) and pyridine (316 mg,4.0 mmol) in 10mL of dichloromethane was added 3- (dimethylamino) propionyl chloride A4-1 (271mg, 2.0 mmol) at 0deg.C, and the resulting mixture was stirred at room temperature for 24h. The reaction mixture was evaporated under vacuum. The residue was dissolved in dichloromethane (100 mL) and washed with brine (80 mL x 3). In the warp anhydrous Na 2 SO 4 After drying, the solvent was evaporated and the crude product was purified by column chromatography (80 g SiO 2 : a gradient of 0 to 10% methanol in dichloromethane) to give trioctyl 2- ((3- (dimethylamino) propionyl) oxy) propane-1, 2, 3-tricarboxylic acid (210 mg, 33%) as a colorless oil.
Alternatively, EDCI (13.1 g,68.5 mmol) and DMAP (2.09 g,17.1 mmol) were added to a suspension of 3- (dimethylamino) propionic acid (8.02 g,68.5 mmol) in 150mL of dichloromethane at 0deg.C, and the resulting mixture was stirred at that temperature for 5min. A solution of trioctyl 2-hydroxypropane-1, 2, 3-tricarboxylic acid A3-1 (9.05 g,17.1 mmol) in 10mL of dichloromethane was added, and the resulting mixture was stirred at room temperature for 48h. The reaction mixture was diluted with dichloromethane and washed with saturated sodium bicarbonate and brine. After drying over sodium sulfate, the organic layer was evaporated under vacuum. The residue was purified by column chromatography (220 g SiO2: 0 to 10% methanol in dichloromethane gradient) to give trioctyl 2- ((3- (dimethylamino) propionyl) oxy) propane-1, 2, 3-tricarboxylic acid (4.2 g, 38%) as a colorless oil.
1 H NMR(300MHz,CDCl 3 )δ4.56(s,br.,6H),4.24(t,2H),4.12(s,2H),2.55(t,2H),2.28-2.17(m,14H),1.63-1.48(m,8H),1.25(s,br.,32H),0.86(t,12H)。
APCI-MS analysis: c35h65NO8, [ m+h ] calculated = 627.9, observed = 628.5.
4.TL1-04D-DMA
TD1-04D-DMA may be prepared in a similar manner as TD-01D-DMA described above.
5.SY-3-E14-DMAPr
Synthetic scheme
R A =R B =C 12 H 25
Synthesis of 3- (dimethylamino) propyl 4-hydroxy-3, 5-dimethoxy benzoate (6)
To a suspension of syringic acid 5 (7.5 g,0.04 mol) in 100mL of dichloromethane was added oxalyl chloride (12.8 mL,0.15 mol) at 0 ℃, followed by dimethylformamide (5 drops), and the resulting mixture was stirred at that temperature for 2h. The reaction mixture was evaporated to dryness and the residue was dissolved in 100mL of dichloromethane. After cooling to 0 ℃, 3- (dimethylamino) propan-1-ol 2 (4.5 ml,40 mmol) was slowly added and the reaction mixture was stirred at room temperature overnight. The precipitate was filtered to give 3- (dimethylamino) propyl 4-hydroxy-3, 5-dimethoxy benzoate 6 (6.2 g, 58%) as a white solid.
6.TL1-10D-DMA
TD1-04D-DMA may be prepared in a similar manner as TD-01D-DMA described above.
7.HEP-E3-E10
Synthetic scheme
Scheme 1
Synthetic scheme
[3] Is synthesized by (a)
As described in scheme 1: to a solution containing HEP [1] (0.100 g, 0.284 mmol,1.0 eq), E3-E10[2] (0.668 g,1.038mmol,2.1 eq), 1ml dimethylformamide, 3ml dichloroethane, diisopropylethylamine (0.344. Mu.L, 1.98mmol,4.0 eq) and N, N-dimethylaminopyridine (0.024 g,0.198mmol,0.4 eq) was added 1-ethyl-3- (3-dimethylaminopropyl) carbodiimide (0.284 g,1.48mmol,3.0 eq) and allowed to react overnight (18 h) at room temperature. The reaction mixture was then concentrated using a rotary evaporator and purified using a Buchi Combi-flash system on a 12g,40 μm size silica gel column using hexane/ethyl acetate as the mobile phase to give a colorless oil (70% yield).
Synthesis of HEP-E3-E10[4]
As described in scheme 1: to a 20ml polypropylene scintillation vial equipped with a PTFE stirring rod was added [3] (0.500 g,0.344mmol,1.0 eq.) together with 4ml dry tetrahydrofuran. The vial was cooled to 0 ℃ to 5 ℃ on an ice bath and HF/pyridine (1.76 ml,67.86mmol,197.3 eq.) was added dropwise. After addition, the reaction vial was allowed to warm to room temperature and stirred overnight (18 h). The reaction mixture was then neutralized with saturated sodium bicarbonate at 0 ℃. Extraction was performed using ethyl acetate (3×). The organic layers were combined, washed with saturated sodium chloride (4×), dried over sodium sulfate, filtered and rotary evaporated to give a pale yellow oil. The oil was further purified using a Buchi Combi-flash system on a 12g,40 μm size silica gel column using dichloromethane/methanol (3% methanol) as the mobile phase to give a colorless oil (60% yield).
1H NMR(400MHz,CDCl3)4.16(m,4H),3.60(m,4H),2.97(m,3H),2.78(d,3H),2.58(m,9H),2.37(m,12H),2.15(m,2H),1.78(m,4H),1.44(m,7H),1.36(m,9H),1.26(br,45H),1.05(d,6H),0.87(t,12H)。
M/Z expected = 998.59, observed = 998.0.
8.HEP-E4-E10
Synthetic scheme
Scheme 2
Synthetic scheme
[12] Is synthesized by (a)
As described in scheme 2: to a solution of HEP [1] (0.100 g, 0.284 mmol,1.0 eq), E4-E10[11] (0.683 g,1.038mmol,2.1 eq), 1ml dimethylformamide, 3ml dichloroethane, diisopropylethylamine (0.344. Mu.L, 1.98mmol,4.0 eq) and N, N-dimethylaminopyridine (0.024 g,0.198mmol,0.4 eq) was added 1-ethyl-3- (3-dimethylaminopropyl) carbodiimide (0.284 g,1.48mmol,3.0 eq) and the reaction was allowed to proceed overnight (18 h) at room temperature. The reaction mixture was then concentrated using a rotary evaporator and purified using a Buchi Combi-flash system on a 12g,40 μm size silica gel column using hexane/ethyl acetate as the mobile phase to give a colorless oil (63.3% yield).
Synthesis of HEP-E4-E10[13]
As described in scheme 2: to a 20ml polypropylene scintillation vial equipped with a PTFE stirring rod was added [12] (0.450 g,0.303mmol,1.0 eq.) together with 4ml dry tetrahydrofuran. The vial was cooled to 0 ℃ to 5 ℃ on an ice bath and HF/pyridine (1.55 ml,59.920mmol,197.3 eq.) was added dropwise. After addition, the reaction vial was allowed to warm to room temperature and stirred overnight (18 h). The reaction mixture was then neutralized with saturated sodium bicarbonate at 0 ℃. Extraction was performed using ethyl acetate (3×). The organic layers were combined, washed with saturated sodium chloride (4×), dried over sodium sulfate, filtered and rotary evaporated to give a pale yellow oil. The oil was further purified using a Buchi Combi-flash system on a 12g,40 μm size silica gel column using dichloromethane/methanol (3%) as the mobile phase to give a colorless oil (48.4% yield).
1H NMR(400MHz,CDCl3)4.16(t,4H),3.62(br,4H),2.96(q,3H),2.76(d,4H),2.56(m,8H),2.40(m,4H),2.32(t,4H),2.13(t,2H),1.61(m,4H),1.46(m,8H),1.37(m,8H),1.28(br,44H),1.03(d,6H),0.87(t,12H),
13C NMR(400MHz,CDCl3)173.65(2C),69.65(2C),68.04(2C),62.84(2C),61.82(2C),61.44(2C),60.89(2C),55.57(4C),51.55(2C),35.35(4C),34.20(2C),32.09(7C),30.00(5C),29.77(6C),29.47(6C),26.93(2C),25.84(5C),22.84(9C),17.77(2C),14.30(7C)。
M/Z expected = 1025.64, observed = 1025.8.
9.Guan-SS-Chol
Guan-SS-Chol may be prepared according to the method described in International publication No. WO 2018/089801, which is hereby incorporated by reference in its entirety. Guan-SS-Chol and formula (V) (HGT 4002) are used interchangeably.
Example 6 evaluation of cationic lipids for pulmonary delivery
In this example, the in vivo efficacy of various cationic lipids was tested when mRNA encapsulated in lipid nanoparticles (mRNA-LNP) was administered to mice by pulmonary delivery. Both the potency (as determined by the level of protein production) and the tolerability (as determined by side effects associated with clearance and metabolism) of the cationic lipids were tested.
About 150 cationic lipids were tested. (FIG. 7). Each cationic lipid was used to prepare lipid nanoparticles encapsulating mRNA (FFL mRNA) encoding firefly luciferase protein according to methods known in the art. For example, suitable methods for mRNA encapsulation include those described in international publication nos. WO 2016/004318 and WO 2018/089801, which are hereby incorporated by reference in their entireties. The lipid nanoparticle tested comprised a lipid component consisting of cationic lipid, non-cationic lipid (DOPE), PEG-modified lipid (DMG-PEG 2K), and optionally cholesterol.
Through use ofNebulization lipid nanoparticle formulations comprising FFL mRNA were administered to male CD1 mice by intratracheal single administration. Animals were dosed with luciferin by intraperitoneal injection approximately 5 hours after dosing, and all animals were imaged using an IVIS imaging system to measure luciferase production in the lungs. Figure 7 shows the different efficacy of each cationic lipid in the lung with in vivo protein expression. Some cationic lipids have significantly greater than 50-fold increase in pulmonary protein expression compared to other cationic lipids.
Based on their performance in this in vivo screen, the following nine cationic lipids were selected for further study: GL-TES-SA-DME-E18-2, TL1-01D-DMA, SY-3-E14-DMAPR, TL1-10D-DMA, HGT4002 (also referred to herein as Guan-SS-Chol), GL-TES-SA-DMP-E18-2, HEP-E4-E10, HEP-E3-E10 and TL1-04D-DMA. Among these cationic lipids, HEP-E4-E10, HEP-E3-E10, GL-TES-SA-DME-E18-2, GL-TES-SA-DMP-E18-2, TL1-01D-DMA and TL1-04D-DMA exhibited particularly high potency as determined by the average radiation detected in the mouse lungs.
Example 7 evaluation of protein expression of lipid nanoparticles
In this example, both in vivo mRNA delivery and protein expression of cationic lipids were tested to evaluate potency and biodistribution. In this study, the cationic lipids TL-10D-DMA, SY-3-E14-DMAPR and TD1-04D-DMA were used to prepare lipid nanoparticles LNP-A, LNP-B and LNP-C, respectively, encapsulating mCherry mRNA.
mRNA-LNP administration by intratracheal administrationFor mice, and the amount of mRNA delivered to lung tissue was determined. As shown in fig. 8A, all mRNA-LNPs tested more efficiently delivered mRNA to lung cells. To check if the amount of mRNA delivered to the lung cells correlated with protein expression in the lung, the amount of mCherry protein was determined by ELISA. Fig. 8B shows the amount of mRNA in lung tissue in the x-axis and the amount of protein expressed in lung in the y-axis. The data indicate that certain LNPs have higher efficacy even when an equal amount of mRNA is delivered to the tissue, as shown by increased protein expression. For example, for an equal amount of 10 delivered 5 CN/mg tissue RNA, LNP-B produced about 10pg/mg protein, and LNP-A produced about 10 2 pg/mg protein.
Example 8 evaluation of the biodistribution of lipid nanoparticles by pulmonary delivery
In this example, mRNA-LNP biodistribution of LNP encapsulated mRNA was tested when the mRNA was administered to mice by pulmonary delivery.
First, a study was conducted to examine whether the mRNA-LNP of the present invention was effectively delivered to the lung in vivo. LNP encapsulating FFL mRNA was administered to CD-1 mice by intratracheal delivery, and radiation was detected 24 hours after administration. As shown in fig. 9A, the results demonstrate that mRNA-LNP was efficiently delivered to the lungs of mice.
To identify which types of cells were transfected with mRNA-LNP in vivo, a genetically modified mouse was used whose cells expressed fluorescent tdTomato protein after successful transfection with Cre recombinase. After in vivo administration of mRNA encoding Cre recombinase, successfully transfected cells in bulk tissue can be visualized at single cell resolution by detecting Cre-induced tdTomato expression. LNP encapsulating Cre recombinase mRNA was administered via aerosol inhalation to tdmamato transgenic mice. A single inhalation exposure occurs on a nasal-only atomizing tower with liquefied aerosol product. On day 3, approximately 48 hours after exposure, mice were euthanized and whole-body 3D imaging was performed using cryofluorescence tomography (CFT). This allows for high resolution imaging and spatial distribution of the tdtometer signal. In the event of efficient delivery, cre protein is expressed and results in tdtometer expression. After 48 hours, mice were imaged by cryofluorescence tomography. Fig. 9B shows that mRNA-LNP is efficiently delivered to the lung, and protein expression is observed even in the branches of the airways, as indicated by the arrows. Along the respiratory tract, mice treated with Cre mRNA-LNP displayed positive tdthato fluorescence in nasal epithelium, trachea and bronchi. Animals treated with saline did not show Td rimto signals in these tissues. Any signal observed in both saline and TdT treated mice was considered background signal and was not specific for the sample related delivery. The tdtometer signal was identified at the cellular level using pulmonary Immunohistochemistry (IHC) while maintaining tissue architecture. As shown in fig. 9C, positive IHC staining was observed in the whole lung, with bronchiole epithelium and alveolar lung cells being specific anatomical sites exhibiting tdthato positivity. Positive bronchiole epithelial cells include secretory cells and/or ciliated cells (arrows with "1"). Type I (arrow with "2") and type II lung cells are often positive, as are the bronchioles, alveolar ducts and dispersed macrophages in the alveoli.
Next, to examine the biodistribution and expression of mRNA-LNP at high resolution, LNP encapsulating CFTR mRNA was prepared. mRNA-LNP was administered by pulmonary delivery to CFTR Knockout (KO) mice. Protein expression was detected by immunofluorescence. Fig. 9D shows that CFTR protein expressed by the delivered mRNA-LNP is present on the top surface of the airway, as indicated by the arrow, demonstrating the effectiveness of the mRNA-LNP of the invention.
Example 9 evaluation of protein expression of lipid nanoparticles by HBEC-ALI
In this example, protein expression of mRNA-encapsulated LNP was tested using the HBEC-ALI (human bronchial epithelial cell-gas liquid interface) system. The HBEC-ALI technique is advantageous because it reproduces well-differentiated airway epithelium with different functional cells, allowing it to be used as a highly translatable airway cell model.
It is important to obtain successful HBEC-ALI culture that can be used for future experiments. Briefly, human bronchial epithelial cells were seeded onto wells and grown in culture. After confluence is reached, the apical medium is removed and replaced with growth medium. Prior to performing the experiments with mRNA-LNP, cells were grown to allow polarization and differentiation, as shown in fig. 10A. An exemplary HBEC-ALI system schematic is shown in fig. 10A. Differentiated epithelium was sectioned and stained with hematoxylin and eosin (H & E), as shown in fig. 10B, which indicates the presence of multi-ciliated cells that can be used as an airway cell model.
Cationic lipids ML2, GL-TES-SA-DMP-E18-2, GL-TES-SA-DME-E18-2, TL1-01D-DMA, TL1-04D-DMA, SY-3-E14-DMAPR, HEP-E3-E10 and HEP-E4-E10 were used to prepare LNPs encapsulating FFL mRNA. LNP encapsulating FFL mRNA was added to the top layer of HBEC-ALI. Luminescence is then measured to assess the amount of luciferase protein expressed in the cells. As shown in fig. 11A, all mRNA-LNPs tested showed dose-dependent protein expression. In addition, in the HEBC-ALI model, mRNA-LNP showed robust protein expression in lung cells. To examine whether human bronchial epithelial cells in the HBEC-ALI model maintained cell integrity during the experiment, transepithelial resistance (TEER), which is a strong indicator of epithelial integrity, was measured. As shown in fig. 11B, TEER was not significantly different, indicating that the monolayer remained intact for most treatments with mRNA-LNP. Thus, the HBEC-ALI model can be used as a robust in vitro system for evaluating protein expression of mRNA-LNP in lung cells.
To further examine whether data from the HBEC-ALI model is a good predictor of protein expression in vivo, ROC curves (receiver operating characteristics) were plotted. In general, the closer the ROC curve is to the upper left corner, the more effective the test. Statistical data from ROC curves showed high AUC (area under ROC curve) (0.827) and low p-value (< 0.013) (fig. 12), indicating that data from the HBEC-ALI model was interpretable for determining in vivo efficacy of mRNA-LNP, and that the HBEC-ALI model could be used to predict mRNA-LNP that is necessary for further study of in vivo applications.
Next, the lipid degradation rate after HBEC-ALI transfection was determined and compared with the results obtained with mouse lung and human lung homogenates. It is desirable that the lipid degrade rapidly to reduce the potential toxicity of the LNP component (including cationic lipids). The concentration of lipids in HBEC-ALI sample cultures was measured over time and plotted as shown in fig. 13A. The results show that after transfection with mRNA-LNP, the lipid rapidly degraded over time with a half-life of about 2.9 hours. The half-life values determined by the HBEC-ALI model were comparable to those determined by mouse lung and human lung homogenates (4.5 hours and 3.6 hours, respectively), as shown in fig. 13B. These results demonstrate that the HBEC-ALI model is a useful indicator of in vivo expression of mRNA-LNP.
Overall, the data in this example demonstrate that HBEC-ALI shows meaningful performance as a classification model for screening and filtering lipids prior to in vivo evaluation. In addition, the mRNA-LNP of the invention has robust protein expression and rapid degradation. In combination with the in vivo data provided herein, it is predicted that the mRNA-LNP of the present invention performs extremely well in both increasing potency and improving tolerability in vivo applications involving repeated delivery of mRNA to the lung via nebulization.
Equivalent content
Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the invention described herein. The scope of the invention is not intended to be limited by the foregoing description, but is instead as set forth in the following claims:
SEQUENCE LISTING
<110> translation Bio Inc
<120> improved compositions for delivery of codon optimized mRNA
<130> MRT-2205USP1
<140> PCT/US21/58623
<141> 2021-11-09
<150> 63/111,321
<151> 2020-11-09
<150> 63/195,581
<151> 2021-06-01
<160> 17
<170> PatentIn version 3.5
<210> 1
<211> 1480
<212> PRT
<213> Homo sapiens
<400> 1
Met Gln Arg Ser Pro Leu Glu Lys Ala Ser Val Val Ser Lys Leu Phe
1 5 10 15
Phe Ser Trp Thr Arg Pro Ile Leu Arg Lys Gly Tyr Arg Gln Arg Leu
20 25 30
Glu Leu Ser Asp Ile Tyr Gln Ile Pro Ser Val Asp Ser Ala Asp Asn
35 40 45
Leu Ser Glu Lys Leu Glu Arg Glu Trp Asp Arg Glu Leu Ala Ser Lys
50 55 60
Lys Asn Pro Lys Leu Ile Asn Ala Leu Arg Arg Cys Phe Phe Trp Arg
65 70 75 80
Phe Met Phe Tyr Gly Ile Phe Leu Tyr Leu Gly Glu Val Thr Lys Ala
85 90 95
Val Gln Pro Leu Leu Leu Gly Arg Ile Ile Ala Ser Tyr Asp Pro Asp
100 105 110
Asn Lys Glu Glu Arg Ser Ile Ala Ile Tyr Leu Gly Ile Gly Leu Cys
115 120 125
Leu Leu Phe Ile Val Arg Thr Leu Leu Leu His Pro Ala Ile Phe Gly
130 135 140
Leu His His Ile Gly Met Gln Met Arg Ile Ala Met Phe Ser Leu Ile
145 150 155 160
Tyr Lys Lys Thr Leu Lys Leu Ser Ser Arg Val Leu Asp Lys Ile Ser
165 170 175
Ile Gly Gln Leu Val Ser Leu Leu Ser Asn Asn Leu Asn Lys Phe Asp
180 185 190
Glu Gly Leu Ala Leu Ala His Phe Val Trp Ile Ala Pro Leu Gln Val
195 200 205
Ala Leu Leu Met Gly Leu Ile Trp Glu Leu Leu Gln Ala Ser Ala Phe
210 215 220
Cys Gly Leu Gly Phe Leu Ile Val Leu Ala Leu Phe Gln Ala Gly Leu
225 230 235 240
Gly Arg Met Met Met Lys Tyr Arg Asp Gln Arg Ala Gly Lys Ile Ser
245 250 255
Glu Arg Leu Val Ile Thr Ser Glu Met Ile Glu Asn Ile Gln Ser Val
260 265 270
Lys Ala Tyr Cys Trp Glu Glu Ala Met Glu Lys Met Ile Glu Asn Leu
275 280 285
Arg Gln Thr Glu Leu Lys Leu Thr Arg Lys Ala Ala Tyr Val Arg Tyr
290 295 300
Phe Asn Ser Ser Ala Phe Phe Phe Ser Gly Phe Phe Val Val Phe Leu
305 310 315 320
Ser Val Leu Pro Tyr Ala Leu Ile Lys Gly Ile Ile Leu Arg Lys Ile
325 330 335
Phe Thr Thr Ile Ser Phe Cys Ile Val Leu Arg Met Ala Val Thr Arg
340 345 350
Gln Phe Pro Trp Ala Val Gln Thr Trp Tyr Asp Ser Leu Gly Ala Ile
355 360 365
Asn Lys Ile Gln Asp Phe Leu Gln Lys Gln Glu Tyr Lys Thr Leu Glu
370 375 380
Tyr Asn Leu Thr Thr Thr Glu Val Val Met Glu Asn Val Thr Ala Phe
385 390 395 400
Trp Glu Glu Gly Phe Gly Glu Leu Phe Glu Lys Ala Lys Gln Asn Asn
405 410 415
Asn Asn Arg Lys Thr Ser Asn Gly Asp Asp Ser Leu Phe Phe Ser Asn
420 425 430
Phe Ser Leu Leu Gly Thr Pro Val Leu Lys Asp Ile Asn Phe Lys Ile
435 440 445
Glu Arg Gly Gln Leu Leu Ala Val Ala Gly Ser Thr Gly Ala Gly Lys
450 455 460
Thr Ser Leu Leu Met Val Ile Met Gly Glu Leu Glu Pro Ser Glu Gly
465 470 475 480
Lys Ile Lys His Ser Gly Arg Ile Ser Phe Cys Ser Gln Phe Ser Trp
485 490 495
Ile Met Pro Gly Thr Ile Lys Glu Asn Ile Ile Phe Gly Val Ser Tyr
500 505 510
Asp Glu Tyr Arg Tyr Arg Ser Val Ile Lys Ala Cys Gln Leu Glu Glu
515 520 525
Asp Ile Ser Lys Phe Ala Glu Lys Asp Asn Ile Val Leu Gly Glu Gly
530 535 540
Gly Ile Thr Leu Ser Gly Gly Gln Arg Ala Arg Ile Ser Leu Ala Arg
545 550 555 560
Ala Val Tyr Lys Asp Ala Asp Leu Tyr Leu Leu Asp Ser Pro Phe Gly
565 570 575
Tyr Leu Asp Val Leu Thr Glu Lys Glu Ile Phe Glu Ser Cys Val Cys
580 585 590
Lys Leu Met Ala Asn Lys Thr Arg Ile Leu Val Thr Ser Lys Met Glu
595 600 605
His Leu Lys Lys Ala Asp Lys Ile Leu Ile Leu His Glu Gly Ser Ser
610 615 620
Tyr Phe Tyr Gly Thr Phe Ser Glu Leu Gln Asn Leu Gln Pro Asp Phe
625 630 635 640
Ser Ser Lys Leu Met Gly Cys Asp Ser Phe Asp Gln Phe Ser Ala Glu
645 650 655
Arg Arg Asn Ser Ile Leu Thr Glu Thr Leu His Arg Phe Ser Leu Glu
660 665 670
Gly Asp Ala Pro Val Ser Trp Thr Glu Thr Lys Lys Gln Ser Phe Lys
675 680 685
Gln Thr Gly Glu Phe Gly Glu Lys Arg Lys Asn Ser Ile Leu Asn Pro
690 695 700
Ile Asn Ser Ile Arg Lys Phe Ser Ile Val Gln Lys Thr Pro Leu Gln
705 710 715 720
Met Asn Gly Ile Glu Glu Asp Ser Asp Glu Pro Leu Glu Arg Arg Leu
725 730 735
Ser Leu Val Pro Asp Ser Glu Gln Gly Glu Ala Ile Leu Pro Arg Ile
740 745 750
Ser Val Ile Ser Thr Gly Pro Thr Leu Gln Ala Arg Arg Arg Gln Ser
755 760 765
Val Leu Asn Leu Met Thr His Ser Val Asn Gln Gly Gln Asn Ile His
770 775 780
Arg Lys Thr Thr Ala Ser Thr Arg Lys Val Ser Leu Ala Pro Gln Ala
785 790 795 800
Asn Leu Thr Glu Leu Asp Ile Tyr Ser Arg Arg Leu Ser Gln Glu Thr
805 810 815
Gly Leu Glu Ile Ser Glu Glu Ile Asn Glu Glu Asp Leu Lys Glu Cys
820 825 830
Phe Phe Asp Asp Met Glu Ser Ile Pro Ala Val Thr Thr Trp Asn Thr
835 840 845
Tyr Leu Arg Tyr Ile Thr Val His Lys Ser Leu Ile Phe Val Leu Ile
850 855 860
Trp Cys Leu Val Ile Phe Leu Ala Glu Val Ala Ala Ser Leu Val Val
865 870 875 880
Leu Trp Leu Leu Gly Asn Thr Pro Leu Gln Asp Lys Gly Asn Ser Thr
885 890 895
His Ser Arg Asn Asn Ser Tyr Ala Val Ile Ile Thr Ser Thr Ser Ser
900 905 910
Tyr Tyr Val Phe Tyr Ile Tyr Val Gly Val Ala Asp Thr Leu Leu Ala
915 920 925
Met Gly Phe Phe Arg Gly Leu Pro Leu Val His Thr Leu Ile Thr Val
930 935 940
Ser Lys Ile Leu His His Lys Met Leu His Ser Val Leu Gln Ala Pro
945 950 955 960
Met Ser Thr Leu Asn Thr Leu Lys Ala Gly Gly Ile Leu Asn Arg Phe
965 970 975
Ser Lys Asp Ile Ala Ile Leu Asp Asp Leu Leu Pro Leu Thr Ile Phe
980 985 990
Asp Phe Ile Gln Leu Leu Leu Ile Val Ile Gly Ala Ile Ala Val Val
995 1000 1005
Ala Val Leu Gln Pro Tyr Ile Phe Val Ala Thr Val Pro Val Ile
1010 1015 1020
Val Ala Phe Ile Met Leu Arg Ala Tyr Phe Leu Gln Thr Ser Gln
1025 1030 1035
Gln Leu Lys Gln Leu Glu Ser Glu Gly Arg Ser Pro Ile Phe Thr
1040 1045 1050
His Leu Val Thr Ser Leu Lys Gly Leu Trp Thr Leu Arg Ala Phe
1055 1060 1065
Gly Arg Gln Pro Tyr Phe Glu Thr Leu Phe His Lys Ala Leu Asn
1070 1075 1080
Leu His Thr Ala Asn Trp Phe Leu Tyr Leu Ser Thr Leu Arg Trp
1085 1090 1095
Phe Gln Met Arg Ile Glu Met Ile Phe Val Ile Phe Phe Ile Ala
1100 1105 1110
Val Thr Phe Ile Ser Ile Leu Thr Thr Gly Glu Gly Glu Gly Arg
1115 1120 1125
Val Gly Ile Ile Leu Thr Leu Ala Met Asn Ile Met Ser Thr Leu
1130 1135 1140
Gln Trp Ala Val Asn Ser Ser Ile Asp Val Asp Ser Leu Met Arg
1145 1150 1155
Ser Val Ser Arg Val Phe Lys Phe Ile Asp Met Pro Thr Glu Gly
1160 1165 1170
Lys Pro Thr Lys Ser Thr Lys Pro Tyr Lys Asn Gly Gln Leu Ser
1175 1180 1185
Lys Val Met Ile Ile Glu Asn Ser His Val Lys Lys Asp Asp Ile
1190 1195 1200
Trp Pro Ser Gly Gly Gln Met Thr Val Lys Asp Leu Thr Ala Lys
1205 1210 1215
Tyr Thr Glu Gly Gly Asn Ala Ile Leu Glu Asn Ile Ser Phe Ser
1220 1225 1230
Ile Ser Pro Gly Gln Arg Val Gly Leu Leu Gly Arg Thr Gly Ser
1235 1240 1245
Gly Lys Ser Thr Leu Leu Ser Ala Phe Leu Arg Leu Leu Asn Thr
1250 1255 1260
Glu Gly Glu Ile Gln Ile Asp Gly Val Ser Trp Asp Ser Ile Thr
1265 1270 1275
Leu Gln Gln Trp Arg Lys Ala Phe Gly Val Ile Pro Gln Lys Val
1280 1285 1290
Phe Ile Phe Ser Gly Thr Phe Arg Lys Asn Leu Asp Pro Tyr Glu
1295 1300 1305
Gln Trp Ser Asp Gln Glu Ile Trp Lys Val Ala Asp Glu Val Gly
1310 1315 1320
Leu Arg Ser Val Ile Glu Gln Phe Pro Gly Lys Leu Asp Phe Val
1325 1330 1335
Leu Val Asp Gly Gly Cys Val Leu Ser His Gly His Lys Gln Leu
1340 1345 1350
Met Cys Leu Ala Arg Ser Val Leu Ser Lys Ala Lys Ile Leu Leu
1355 1360 1365
Leu Asp Glu Pro Ser Ala His Leu Asp Pro Val Thr Tyr Gln Ile
1370 1375 1380
Ile Arg Arg Thr Leu Lys Gln Ala Phe Ala Asp Cys Thr Val Ile
1385 1390 1395
Leu Cys Glu His Arg Ile Glu Ala Met Leu Glu Cys Gln Gln Phe
1400 1405 1410
Leu Val Ile Glu Glu Asn Lys Val Arg Gln Tyr Asp Ser Ile Gln
1415 1420 1425
Lys Leu Leu Asn Glu Arg Ser Leu Phe Arg Gln Ala Ile Ser Pro
1430 1435 1440
Ser Asp Arg Val Lys Leu Phe Pro His Arg Asn Ser Ser Lys Cys
1445 1450 1455
Lys Ser Lys Pro Gln Ile Ala Ala Leu Lys Glu Glu Thr Glu Glu
1460 1465 1470
Glu Val Gln Asp Thr Arg Leu
1475 1480
<210> 2
<211> 4443
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic polynucleotide
<400> 2
atgcagcgtt ctcccctgga gaaggcttct gtggtgagta aacttttttt ctcctggacc 60
agacctatcc tgaggaaagg ctacaggcag agactggagc tctctgacat ataccagata 120
ccttcagtcg atagcgccga caacctgagc gagaagctgg aacgcgagtg ggacagagag 180
ctggcaagca agaagaaccc aaagctgatt aatgccctga gaaggtgttt cttctggaga 240
ttcatgttct acggaatctt tctgtatctg ggggaggtta caaaggctgt gcaacccctg 300
ctgctcggca gaatcatcgc ctcatacgat ccagacaaca aggaagaaag aagcatcgcc 360
atctacctgg gcattggcct ctgcctcctg tttattgtgc ggactctgct gctgcaccca 420
gcaattttcg ggttgcatca tattggcatg cagatgcgca ttgctatgtt ttccctcatc 480
tacaaaaaga cactgaaact cagctcccgg gtgctggaca agatctccat cggccaactg 540
gtgtctctcc tgagcaataa cttgaataag ttcgacgaag ggctggccct ggcacacttc 600
gtgtggattg cccccctgca ggtggccctg ctgatgggac tgatttggga actgctgcag 660
gctagcgctt tctgcggcct ggggttcctg atcgtgctgg cactgtttca ggcaggcctg 720
ggccgtatga tgatgaagta cagagaccag agggccggga agatctccga acggctcgtt 780
attacctctg agatgatcga gaacattcag tctgtgaaag cctactgctg ggaggaggct 840
atggagaaga tgatcgagaa tctgagacag accgagctga agctgaccag aaaggccgcc 900
tacgtgaggt acttcaacag cagtgccttc ttcttctctg ggttcttcgt tgtgtttctg 960
agcgtgctgc catacgctct catcaaaggc atcatcctgc ggaagatctt caccaccatc 1020
agcttttgca tcgtgcttag aatggccgtg acacggcagt tcccatgggc cgttcaaact 1080
tggtatgatt ccctgggcgc catcaacaaa atccaggatt tcctgcagaa gcaggaatac 1140
aagacactcg aatataacct cacaactact gaggtggtta tggagaacgt gactgccttc 1200
tgggaggagg ggttcggaga gctttttgag aaggccaaac agaataataa taaccgcaaa 1260
accagcaacg gcgacgacag cctgttcttc tccaattttt ctctcctggg aacacccgtc 1320
ctcaaagaca tcaactttaa gatcgagagg ggccagctgc tcgccgtcgc cggatccaca 1380
ggcgccggca agacctctct gctgatggtt atcatgggcg aactggagcc ctccgagggc 1440
aagattaagc actcaggaag aatctccttt tgtagccagt tcagttggat tatgcccggc 1500
actattaagg agaatatcat ttttggggtg agctatgatg agtatcggta tcggagcgtt 1560
atcaaagcct gtcagctgga ggaggatatc agcaagttcg cagagaagga taatattgtg 1620
ctgggagagg gaggaatcac cctgagcgga ggccagagag ccagaatctc actggcccgg 1680
gccgtctaca aggacgccga cctttacctt ctggacagtc cctttggata tctggatgtg 1740
ctgactgaaa aggagatctt cgagtcttgt gtgtgcaagc tgatggctaa caagacccgg 1800
atcctagtga ctagtaagat ggagcacctg aagaaggcag acaagatctt gattctgcac 1860
gagggatcct cttactttta cggcaccttt agcgagctgc agaacctcca gcccgatttc 1920
tcatctaagc tgatgggctg tgatagcttc gaccagttct ctgccgagcg cagaaacagc 1980
atcctgacag agacactgca ccggttttca ctggagggcg acgcccctgt cagctggacc 2040
gagaccaaaa agcagtcttt caagcagaca ggcgagttcg gcgagaagcg caaaaacagc 2100
atcctgaatc caatcaactc tataaggaag tttagcatcg tgcagaagac acccctccag 2160
atgaacggca tcgaagagga cagtgacgag cccctggagc ggcgcctgag cctcgtgcct 2220
gacagcgaac agggcgaggc catcctgcct aggatcagcg tgatttcaac cgggccaaca 2280
ctgcaggcta ggagaagaca gtcagtgctt aacctgatga cacatagcgt gaatcaggga 2340
cagaacatcc atcgaaaaac cacagcctct actcgcaaag tgtcactggc tcctcaggct 2400
aatctgacag agctggacat ctatagcagg aggctgagcc aggagacagg cctggagatc 2460
agtgaggaga tcaacgaaga ggacctgaag gagtgctttt tcgatgacat ggagagtatc 2520
cccgccgtca ccacctggaa tacctacctc cggtacatca cagtgcacaa gtccctcatc 2580
tttgtgctga tttggtgcct cgtgatcttt ctcgcagaag tggccgcctc cctggtggtg 2640
ctgtggctgt tggggaatac tccactgcag gacaaaggca attctacaca cagcaggaat 2700
aattcctatg ccgtgattat caccagcaca tcctcttact acgtgttcta catctacgtg 2760
ggagtggcag atactctgct tgcaatgggc ttcttcaggg ggctgcccct ggtgcacaca 2820
ctgatcacag tgtccaagat cctccaccat aaaatgctcc acagcgtgct gcaggcaccc 2880
atgagcaccc tgaacacact gaaggccggc ggcatcctga atcgcttttc caaagacatc 2940
gccatcctcg acgatctcct gccactgacc atcttcgatt ttatccagct gctgctgatc 3000
gtgatcgggg ccatcgccgt ggtggccgtg ctgcagccat acattttcgt ggctacagtg 3060
cccgtgatcg ttgcctttat catgctgaga gcctacttcc tgcagacttc tcagcagctg 3120
aagcagctgg agagcgaagg gagaagcccc atcttcactc acctggtgac aagcctgaag 3180
ggactctgga ccctgagagc cttcggccgg cagccctatt tcgagaccct gtttcacaag 3240
gccctcaacc tgcacacagc caactggttc ctctacctgt ccaccctgag gtggttccag 3300
atgaggattg aaatgatctt cgtgattttt ttcatcgccg tgacattcat tagcattctg 3360
accaccggcg agggggaggg gagagtgggc atcatcctga cccttgccat gaacattatg 3420
agcacactgc agtgggccgt gaatagtagt atcgacgtgg acagtctgat gaggtccgtg 3480
agccgggtgt tcaagttcat tgacatgccc acagaaggga aacccaccaa aagcaccaag 3540
ccctacaaga acgggcagct gtccaaggtt atgatcatcg agaactctca cgtgaagaag 3600
gacgacattt ggcccagcgg cggccagatg acagtgaaag atctgaccgc caaatacacc 3660
gagggaggca acgccatcct cgaaaacatt agcttctcta tcagccctgg acagagggtg 3720
ggcctgctgg gccggacagg ctcagggaag agtactctgc tgtcagcatt cctgaggctc 3780
ctgaacacag agggcgagat ccagattgac ggcgtgtcct gggactccat caccctgcag 3840
cagtggcgga aggctttcgg ggtgatcccc cagaaggtgt tcatctttag cggcactttc 3900
agaaagaatc tggaccctta tgagcagtgg agtgaccagg agatctggaa agtggccgat 3960
gaggtcggac tgaggagcgt gatcgagcag tttccaggga agctggactt tgtgctggtg 4020
gatggcggat gcgtgctgtc tcacggccat aaacagctga tgtgtctggc ccggtccgtg 4080
ctgtctaagg ccaagatcct gctgctggac gaaccctccg cccacctgga ccccgtgaca 4140
taccagatca tcaggagaac tctcaagcag gccttcgccg actgtaccgt gattctgtgc 4200
gagcaccgca ttgaagctat gctggagtgt cagcagttcc tggtgatcga ggaaaataag 4260
gtgaggcagt acgacagcat ccagaagctg ctgaacgagc gctccctgtt ccgccaggct 4320
atctccccat cagaccgggt gaagctcttc ccccacagaa actcctcaaa gtgcaagtcc 4380
aagccccaga tcgccgccct gaaggaggag accgaggagg aggtgcagga caccaggctg 4440
tga 4443
<210> 3
<211> 4443
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic polynucleotide
<400> 3
atgcagcgct cgcctctgga aaaggcgagc gtcgtgtcaa agctattctt ttcttggacc 60
cggcccattc tcaggaaggg ctacaggcag aggctggagt tgagcgacat ctatcagatt 120
ccttccgtgg acagcgccga caacctgagc gagaagctgg aaagggagtg ggaccgcgaa 180
ctggcaagca aaaagaaccc caagctgatc aatgccctga gaaggtgttt cttttggaga 240
ttcatgttct acgggatctt tctgtatctg ggcgaggtta caaaggctgt gcagcccctg 300
ctgctcggca gaatcatcgc ctcatacgat ccagacaaca aggaagaaag aagcatcgcc 360
atctacctgg gcattggcct ctgcctcctg tttattgtgc ggactctgct gctgcaccca 420
gcaattttcg ggttgcatca tattggcatg cagatgcgca ttgctatgtt ttccctcatc 480
tacaaaaaga cactgaaact cagctcccgg gtgctggaca agatctccat cggccaactg 540
gtgtctctcc tgagcaataa cttgaataag ttcgacgaag ggctggccct ggcacacttc 600
gtgtggattg cccccctgca ggtggccctg ctgatgggac tgatttggga actgctgcag 660
gctagcgctt tctgcggcct ggggttcctg atcgtgctgg cactgtttca ggcaggcctg 720
ggccgtatga tgatgaagta cagagaccag agggccggga agatctccga acggctcgtt 780
attacctctg agatgatcga gaacattcag tctgtgaaag cctactgctg ggaggaggct 840
atggagaaga tgatcgagaa tctgagacag accgagctga agctgaccag aaaggccgcc 900
tacgtgaggt acttcaacag cagtgccttc ttcttctctg gcttcttcgt tgtgtttctg 960
agcgtgctgc catacgctct catcaaaggc atcatcctgc ggaagatctt caccaccatc 1020
agcttttgca tcgtgcttag aatggccgtg acccggcagt tcccatgggc cgtgcaaact 1080
tggtatgatt ccctgggcgc catcaacaaa atccaggatt tcctgcagaa gcaggaatac 1140
aagacactcg aatataatct cacaactact gaggtggtta tggagaacgt gactgccttc 1200
tgggaggagg ggttcggaga gctttttgag aaggcaaaac agaataacaa caaccgcaaa 1260
accagcaacg gcgacgacag cctgttcttc tccaattttt ctctcctggg aacacccgtc 1320
ctcaaagaca tcaactttaa gatcgagagg ggacagctgc tcgcagtcgc cggatccaca 1380
ggcgccggca agacctctct gctgatggtt atcatgggcg aactggagcc atccgagggc 1440
aagattaagc acagtggaag aatctccttt tgtagccagt tcagttggat tatgcccggc 1500
actattaagg agaatatcat ttttggggtg agctatgatg agtatcggta tcggagcgtt 1560
atcaaagcct gtcagctgga ggaggatatc agcaaattcg cagagaagga taatatcgtg 1620
ctgggggagg ggggaatcac cctgagcgga ggccagagag ccagaatctc actggcccgg 1680
gccgtctaca aggacgccga cctttacctt ctggacagtc cctttggata tctggatgtg 1740
ctgactgaaa aggagatctt cgagtcttgt gtgtgcaagc tgatggctaa taagacccgg 1800
atcctagtga ccagtaagat ggagcacctg aagaaggcag acaagatctt gattctgcac 1860
gagggatcct cttactttta cggcaccttt agcgagctgc agaatctcca gcccgatttc 1920
tcatctaagc tgatgggctg tgatagcttc gaccagttct ctgccgagcg cagaaacagc 1980
atcctgacag agacactgca ccggttttca ctggagggcg acgcccctgt cagctggacc 2040
gagaccaaaa agcagtcttt caagcagaca ggcgagttcg gcgagaagcg caaaaacagc 2100
atcctgaatc caatcaactc tataaggaag tttagcatcg tgcagaagac acccctccag 2160
atgaacggca tcgaagagga cagtgacgag cccctggagc ggcgcctgag cctcgtgcct 2220
gacagcgaac agggcgaggc catcctgcct aggatcagcg tgatttcaac cgggccaaca 2280
ctgcaggcta ggagaagaca gtcagtgctt aacctgatga cacatagcgt gaatcaggga 2340
cagaacatcc atcgaaaaac cacagcctct actcgcaaag tgtcactggc tcctcaggct 2400
aatctgacag agctggacat ctatagcagg aggctgagcc aggagacagg cctggagatc 2460
agtgaggaga tcaacgaaga ggacctgaag gagtgctttt tcgatgacat ggagagtatc 2520
cccgccgtca ccacctggaa tacctacctc cggtacatca cagtgcacaa gtccctcatc 2580
tttgtgctga tttggtgcct cgtgatcttt ctcgcagaag tggccgcctc cctggtggtg 2640
ctgtggctgt tggggaatac tccactgcag gacaaaggca attctacaca cagcaggaat 2700
aattcctatg ccgtgattat caccagcaca tcctcttact acgtgttcta catctacgtg 2760
ggagtggcag atactctgct tgcaatgggc ttcttcaggg ggctgcccct ggtgcacaca 2820
ctgatcacag tgtccaagat cctccaccat aaaatgctcc acagcgtgct gcaggcaccc 2880
atgagcaccc tgaacacact gaaggccggc ggcatcctga atcgcttttc caaagacatc 2940
gccatcctcg acgatctcct gccactgacc atcttcgatt ttatccagct gctgctgatc 3000
gtgatcgggg ccatcgccgt ggtggccgtg ctgcagccat acattttcgt ggctacagtg 3060
cccgtgatcg ttgcctttat catgctgaga gcctacttcc tgcagacttc tcagcagctg 3120
aagcagctgg agagcgaagg gagaagcccc atcttcactc acctggtgac aagcctgaag 3180
ggactctgga ccctgagagc cttcggccgg cagccctatt tcgagaccct gtttcacaag 3240
gccctcaacc tgcacacagc caactggttt ctctacctgt ccaccctgag gtggttccag 3300
atgaggattg aaatgatctt cgtgattttt ttcatcgccg tgacattcat tagcattctg 3360
accaccggcg agggggaggg gagagtgggc atcatcctga cccttgccat gaacattatg 3420
tccacactgc agtgggccgt gaatagttca atcgacgtgg acagtctgat gaggtccgtg 3480
agccgggtgt tcaagttcat tgacatgccc acagagggga aacccaccaa aagcaccaag 3540
ccctacaaga acgggcagct gtccaaggtt atgatcatcg agaactctca cgtgaagaag 3600
gacgacattt ggcccagcgg cggccagatg acagtgaaag atctgaccgc caaatacacc 3660
gagggaggca acgccatcct cgaaaacatt agcttctcta tcagccctgg acagagggtg 3720
ggcctgctgg gccggacagg ctcagggaag agtactctgc tgtcagcatt cctgaggctc 3780
ctgaacacag agggcgagat ccagattgac ggcgtgtcct gggactccat caccctgcag 3840
cagtggcgga aggctttcgg ggtgatcccc cagaaggtgt tcatctttag cggcactttc 3900
agaaagaatc tggaccctta tgagcagtgg agtgaccagg agatctggaa agtggccgat 3960
gaggtcggac tgaggagcgt gatcgagcag tttccaggga agctggactt tgtgctggtg 4020
gatggcggat gcgtgctgtc tcacggccat aaacagctga tgtgtctggc ccggtccgtg 4080
ctgtctaagg ccaagatcct gctgctggac gaaccctccg cccacctgga ccccgtgaca 4140
taccagatca tcaggagaac tctcaagcag gccttcgccg actgtaccgt gattctgtgc 4200
gagcaccgca ttgaagctat gctggagtgt cagcagttcc tggtgatcga ggaaaataag 4260
gtgaggcagt acgacagcat ccagaagctg ctgaacgagc gctccctgtt ccgccaggct 4320
atctccccat cagaccgggt gaagctcttc ccccacagaa actcctcaaa gtgcaagtcc 4380
aagccccaga tcgccgccct gaaggaggag accgaggagg aggtgcagga caccaggctg 4440
tga 4443
<210> 4
<211> 4443
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic polynucleotide
<400> 4
atgcagcgtt ctcccctgga gaaggcttct gtggtgagta aacttttttt ctcctggacc 60
agacctatcc tgaggaaagg ctacaggcag agactggagc tctctgacat ataccagata 120
ccttcagtcg atagcgccga caacctgagc gagaagctgg aacgcgagtg ggacagagag 180
ctggcaagca agaagaaccc aaagctgatt aatgccctga gaaggtgttt cttctggaga 240
ttcatgttct acggaatctt tctgtatctg ggggaggtta caaaggctgt gcaacccctg 300
ctgctcggca gaatcatcgc ctcatacgat ccagacaaca aggaagaaag aagcatcgcc 360
atctacctgg gcattggcct ctgcctcctg tttattgtgc ggactctgct gctgcaccca 420
gcaattttcg ggttgcatca tattggcatg cagatgcgca ttgctatgtt ttccctcatc 480
tacaaaaaga cactgaaact cagctcccgg gtgctggaca agatctccat cggccaactg 540
gtgtctctcc tgagcaataa cttgaataag ttcgacgaag ggctggccct ggcacacttc 600
gtgtggattg cccccctgca ggtggccctg ctgatgggac tgatttggga actgctgcag 660
gctagcgctt tctgcggcct ggggttcctg atcgtgctgg cactgtttca ggcaggcctg 720
ggccgtatga tgatgaagta cagagaccag agggccggga agatctccga acggctcgtt 780
attacctctg agatgatcga gaacattcag tctgtgaaag cctactgctg ggaggaggct 840
atggagaaga tgatcgagaa tctgagacag accgagctga agctgaccag aaaggccgcc 900
tacgtgaggt acttcaacag cagtgccttc ttcttctctg ggttcttcgt tgtgtttctg 960
agcgtgctgc catacgctct catcaaaggc atcatcctgc ggaagatctt caccaccatc 1020
agcttttgca tcgtgcttag aatggccgtg acccggcagt tcccatgggc cgtgcaaact 1080
tggtatgatt ccctgggcgc catcaacaaa atccaggatt tcctgcagaa gcaggaatac 1140
aagacactcg aatataatct cacaactact gaggtggtta tggagaacgt gactgccttc 1200
tgggaggagg ggttcggaga gctttttgag aaggcaaaac agaataacaa caaccgcaaa 1260
accagcaacg gcgacgacag cctgttcttc tccaattttt ctctcctggg aacacccgtc 1320
ctcaaagaca tcaactttaa gatcgagagg ggccagctgc tcgccgtcgc cggatccaca 1380
ggcgccggca agacctctct gctgatggtt atcatgggcg aactggagcc ctccgagggc 1440
aagattaagc actcaggaag aatctccttt tgtagccagt tcagttggat tatgcccggc 1500
actattaagg agaatatcat ttttggggtg agctatgatg agtatcggta tcggagcgtt 1560
atcaaagcct gtcagctgga ggaggatatc agcaagttcg cagagaagga taatattgtg 1620
ctgggagagg gaggaatcac cctgagcgga ggccagagag ccagaatctc actggcccgg 1680
gccgtctaca aggacgccga cctttacctt ctggacagtc cctttggata tctggatgtg 1740
ctgactgaaa aggagatctt cgagtcttgt gtgtgcaagc tgatggctaa taagacccgg 1800
atcctagtga ccagtaagat ggagcacctg aagaaggcag acaagatctt gattctgcac 1860
gagggatcct cttactttta cggcaccttt agcgagctgc agaatctcca gcccgatttc 1920
tcatctaagc tgatgggctg tgatagcttc gaccagttct ctgccgagcg cagaaacagc 1980
atcctgacag agacactgca ccggttttca ctggagggcg acgcccctgt cagctggacc 2040
gagaccaaaa agcagtcttt caagcagaca ggcgagttcg gcgagaagcg caaaaacagc 2100
atcctgaatc caatcaactc tataaggaag tttagcatcg tgcagaagac acccctccag 2160
atgaacggca tcgaagagga cagtgacgag cccctggagc ggcgcctgag cctcgtgcct 2220
gacagcgaac agggcgaggc catcctgcct aggatcagcg tgatttcaac cgggccaaca 2280
ctgcaggcta ggagaagaca gtcagtgctt aacctgatga cacatagcgt gaatcaggga 2340
cagaacatcc atcgaaaaac cacagcctct actcgcaaag tgtcactggc tcctcaggct 2400
aatctgacag agctggacat ctatagcagg aggctgagcc aggagacagg cctggagatc 2460
agtgaggaga tcaacgaaga ggacctgaag gagtgctttt tcgatgacat ggagagtatc 2520
cccgccgtca ccacctggaa tacctacctc cggtacatca cagtgcacaa gtccctcatc 2580
tttgtgctga tttggtgcct cgtgatcttt ctcgcagaag tggccgcctc cctggtggtg 2640
ctgtggctgt tggggaatac tccactgcag gacaaaggca attctacaca cagcaggaat 2700
aattcctatg ccgtgattat caccagcaca tcctcttact acgtgttcta catctacgtg 2760
ggagtggcag atactctgct tgcaatgggc ttcttcaggg ggctgcccct ggtgcacaca 2820
ctgatcacag tgtccaagat cctccaccat aaaatgctcc acagcgtgct gcaggcaccc 2880
atgagcaccc tgaacacact gaaggccggc ggcatcctga atcgcttttc caaagacatc 2940
gccatcctcg acgatctcct gccactgacc atcttcgatt ttatccagct gctgctgatc 3000
gtgatcgggg ccatcgccgt ggtggccgtg ctgcagccat acattttcgt ggctacagtg 3060
cccgtgatcg ttgcctttat catgctgaga gcctacttcc tgcagacttc tcagcagctg 3120
aagcagctgg agagcgaagg gagaagcccc atcttcactc acctggtgac aagcctgaag 3180
ggactctgga ccctgagagc cttcggccgg cagccctatt tcgagaccct gtttcacaag 3240
gccctcaacc tgcacacagc caactggttt ctctacctgt ccaccctgag gtggttccag 3300
atgaggattg aaatgatctt cgtgattttt ttcatcgccg tgacattcat tagcattctg 3360
accaccggcg agggggaggg gagagtgggc atcatcctga cccttgccat gaacattatg 3420
agcacactgc agtgggccgt gaatagtagt atcgacgtgg acagtctgat gaggtccgtg 3480
agccgggtgt tcaagttcat tgacatgccc acagaaggga aacccaccaa aagcaccaag 3540
ccctacaaga acgggcagct gtccaaggtt atgatcatcg agaactctca cgtgaagaag 3600
gacgacattt ggcccagcgg cggccagatg acagtgaaag atctgaccgc caaatacacc 3660
gagggaggca acgccatcct cgaaaacatt agcttctcta tcagccctgg acagagggtg 3720
ggcctgctgg gccggacagg ctcagggaag agtactctgc tgtcagcatt cctgaggctc 3780
ctgaacacag agggcgagat ccagattgac ggcgtgtcct gggactccat caccctgcag 3840
cagtggcgga aggctttcgg ggtgatcccc cagaaggtgt tcatctttag cggcactttc 3900
agaaagaatc tggaccctta tgagcagtgg agtgaccagg agatctggaa agtggccgat 3960
gaggtcggac tgaggagcgt gatcgagcag tttccaggga agctggactt tgtgctggtg 4020
gatggcggat gcgtgctgtc tcacggccat aaacagctga tgtgtctggc ccggtccgtg 4080
ctgtctaagg ccaagatcct gctgctggac gaaccctccg cccacctgga ccccgtgaca 4140
taccagatca tcaggagaac tctcaagcag gccttcgccg actgtaccgt gattctgtgc 4200
gagcaccgca ttgaagctat gctggagtgt cagcagttcc tggtgatcga ggaaaataag 4260
gtgaggcagt acgacagcat ccagaagctg ctgaacgagc gctccctgtt ccgccaggct 4320
atctccccat cagaccgggt gaaactcttc ccccacagaa actcctcaaa gtgcaagtcc 4380
aagccccaga tcgccgccct gaaggaggag accgaggagg aggtgcagga caccaggctg 4440
tga 4443
<210> 5
<211> 4443
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic polynucleotide
<400> 5
atgcagcgct cgcctctgga aaaggcgagc gtcgtgtcaa agctattctt ttcttggacc 60
cggcccattc tcaggaaggg ctacaggcag aggctggagt tgagcgacat ctatcagatt 120
ccttccgtgg acagcgccga caacctgagc gagaagctgg aaagggagtg ggaccgcgaa 180
ctggcaagca aaaagaaccc caagctgatc aatgccctga gaaggtgttt cttttggaga 240
ttcatgttct acgggatctt tctgtatctg ggcgaggtta caaaggctgt gcagcccctg 300
ctgctcggca gaatcatcgc ctcatacgat ccagacaaca aggaagaaag aagcatcgcc 360
atctacctgg gcattggcct ctgcctcctg tttattgtgc ggactctgct gctgcaccca 420
gcaattttcg ggttgcatca tattggcatg cagatgcgca ttgctatgtt ttccctcatc 480
tacaaaaaga cactgaaact cagctcccgg gtgctggaca agatctccat cggccaactg 540
gtgtctctcc tgagcaataa cttgaataag ttcgacgaag ggctggccct ggcacacttc 600
gtgtggattg cccccctgca ggtggccctg ctgatgggac tgatttggga actgctgcag 660
gctagcgctt tctgcggcct ggggttcctg atcgtgctgg cactgtttca ggcaggcctg 720
ggccgtatga tgatgaagta cagagaccag agggccggga agatctccga acggctcgtt 780
attacctctg agatgatcga gaacattcag tctgtgaaag cctactgctg ggaggaggct 840
atggagaaga tgatcgagaa tctgagacag accgagctga agctgaccag aaaggccgcc 900
tacgtgaggt acttcaacag cagtgccttc ttcttctctg gcttcttcgt tgtgtttctg 960
agcgtgctgc catacgctct catcaaaggc atcatcctgc ggaagatctt caccaccatc 1020
agcttttgca tcgtgcttag aatggccgtg acacggcagt tcccatgggc cgttcaaact 1080
tggtatgatt ccctgggcgc catcaacaaa atccaggatt tcctgcagaa gcaggaatac 1140
aagacactcg aatataacct cacaactact gaggtggtta tggagaacgt gactgccttc 1200
tgggaggagg ggttcggaga gctttttgag aaggccaaac agaataataa taaccgcaaa 1260
accagcaacg gcgacgacag cctgttcttc tccaattttt ctctcctggg aacacccgtc 1320
ctcaaagaca tcaactttaa gatcgagagg ggacagctgc tcgcagtcgc cggatccaca 1380
ggcgccggca agacctctct gctgatggtt atcatgggcg aactggagcc atccgagggc 1440
aagattaagc acagtggaag aatctccttt tgtagccagt tcagttggat tatgcccggc 1500
actattaagg agaatatcat ttttggggtg agctatgatg agtatcggta tcggagcgtt 1560
atcaaagcct gtcagctgga ggaggatatc agcaaattcg cagagaagga taatatcgtg 1620
ctgggggagg ggggaatcac cctgagcgga ggccagagag ccagaatctc actggcccgg 1680
gccgtctaca aggacgccga cctttacctt ctggacagtc cctttggata tctggatgtg 1740
ctgactgaaa aggagatctt cgagtcttgt gtgtgcaagc tgatggctaa caagacccgg 1800
atcctagtga ctagtaagat ggagcacctg aagaaggcag acaagatctt gattctgcac 1860
gagggatcct cttactttta cggcaccttt agcgagctgc agaacctcca gcccgatttc 1920
tcatctaagc tgatgggctg tgatagcttc gaccagttct ctgccgagcg cagaaacagc 1980
atcctgacag agacactgca ccggttttca ctggagggcg acgcccctgt cagctggacc 2040
gagaccaaaa agcagtcttt caagcagaca ggcgagttcg gcgagaagcg caaaaacagc 2100
atcctgaatc caatcaactc tataaggaag tttagcatcg tgcagaagac acccctccag 2160
atgaacggca tcgaagagga cagtgacgag cccctggagc ggcgcctgag cctcgtgcct 2220
gacagcgaac agggcgaggc catcctgcct aggatcagcg tgatttcaac cgggccaaca 2280
ctgcaggcta ggagaagaca gtcagtgctt aacctgatga cacatagcgt gaatcaggga 2340
cagaacatcc atcgaaaaac cacagcctct actcgcaaag tgtcactggc tcctcaggct 2400
aatctgacag agctggacat ctatagcagg aggctgagcc aggagacagg cctggagatc 2460
agtgaggaga tcaacgaaga ggacctgaag gagtgctttt tcgatgacat ggagagtatc 2520
cccgccgtca ccacctggaa tacctacctc cggtacatca cagtgcacaa gtccctcatc 2580
tttgtgctga tttggtgcct cgtgatcttt ctcgcagaag tggccgcctc cctggtggtg 2640
ctgtggctgt tggggaatac tccactgcag gacaaaggca attctacaca cagcaggaat 2700
aattcctatg ccgtgattat caccagcaca tcctcttact acgtgttcta catctacgtg 2760
ggagtggcag atactctgct tgcaatgggc ttcttcaggg ggctgcccct ggtgcacaca 2820
ctgatcacag tgtccaagat cctccaccat aaaatgctcc acagcgtgct gcaggcaccc 2880
atgagcaccc tgaacacact gaaggccggc ggcatcctga atcgcttttc caaagacatc 2940
gccatcctcg acgatctcct gccactgacc atcttcgatt ttatccagct gctgctgatc 3000
gtgatcgggg ccatcgccgt ggtggccgtg ctgcagccat acattttcgt ggctacagtg 3060
cccgtgatcg ttgcctttat catgctgaga gcctacttcc tgcagacttc tcagcagctg 3120
aagcagctgg agagcgaagg gagaagcccc atcttcactc acctggtgac aagcctgaag 3180
ggactctgga ccctgagagc cttcggccgg cagccctatt tcgagaccct gtttcacaag 3240
gccctcaacc tgcacacagc caactggttc ctctacctgt ccaccctgag gtggttccag 3300
atgaggattg aaatgatctt cgtgattttt ttcatcgccg tgacattcat tagcattctg 3360
accaccggcg agggggaggg gagagtgggc atcatcctga cccttgccat gaacattatg 3420
tccacactgc agtgggccgt gaatagttca atcgacgtgg acagtctgat gaggtccgtg 3480
agccgggtgt tcaagttcat tgacatgccc acagagggga aacccaccaa aagcaccaag 3540
ccctacaaga acgggcagct gtccaaggtt atgatcatcg agaactctca cgtgaagaag 3600
gacgacattt ggcccagcgg cggccagatg acagtgaaag atctgaccgc caaatacacc 3660
gagggaggca acgccatcct cgaaaacatt agcttctcta tcagccctgg acagagggtg 3720
ggcctgctgg gccggacagg ctcagggaag agtactctgc tgtcagcatt cctgaggctc 3780
ctgaacacag agggcgagat ccagattgac ggcgtgtcct gggactccat caccctgcag 3840
cagtggcgga aggctttcgg ggtgatcccc cagaaggtgt tcatctttag cggcactttc 3900
agaaagaatc tggaccctta tgagcagtgg agtgaccagg agatctggaa agtggccgat 3960
gaggtcggac tgaggagcgt gatcgagcag tttccaggga agctggactt tgtgctggtg 4020
gatggcggat gcgtgctgtc tcacggccat aaacagctga tgtgtctggc ccggtccgtg 4080
ctgtctaagg ccaagatcct gctgctggac gaaccctccg cccacctgga ccccgtgaca 4140
taccagatca tcaggagaac tctcaagcag gccttcgccg actgtaccgt gattctgtgc 4200
gagcaccgca ttgaagctat gctggagtgt cagcagttcc tggtgatcga ggaaaataag 4260
gtgaggcagt acgacagcat ccagaagctg ctgaacgagc gctccctgtt ccgccaggct 4320
atctccccat cagaccgggt gaaactcttc ccccacagaa actcctcaaa gtgcaagtcc 4380
aagccccaga tcgccgccct gaaggaggag accgaggagg aggtgcagga caccaggctg 4440
tga 4443
<210> 6
<211> 4443
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic polynucleotide
<400> 6
atgcagcgtt ctcccctgga gaaggcttct gtggtgagta aacttttttt ctcctggacc 60
agacctatcc tgaggaaagg ctacaggcag agactggagc tctctgacat ataccagata 120
ccttcagtcg atagcgccga caacctgagc gagaagctgg aacgcgagtg ggacagagag 180
ctggcaagca agaagaaccc aaagctgatt aatgccctga gaaggtgttt cttctggaga 240
ttcatgttct acggaatctt tctgtatctg ggggaggtta caaaggctgt gcaacccctg 300
ctgctcggca gaatcatcgc ctcatacgat ccagacaaca aggaagaaag aagcatcgcc 360
atctacctgg gcattggcct ctgcctcctg tttattgtgc ggactctgct gctgcaccca 420
gcaattttcg ggttgcatca tattggcatg cagatgcgca ttgctatgtt ttccctcatc 480
tacaaaaaga cactgaaact cagctcccgg gtgctggaca agatctccat cggccaactg 540
gtgtctctcc tgagcaataa cttgaataag ttcgacgaag ggctggccct ggcacacttc 600
gtgtggattg cccccctgca ggtggccctg ctgatgggac tgatttggga actgctgcag 660
gctagcgctt tctgcggcct ggggttcctg atcgtgctgg cactgtttca ggcaggcctg 720
ggccgtatga tgatgaagta cagagaccag agggccggga agatctccga acggctcgtt 780
attacctctg agatgatcga gaacattcag tctgtgaaag cctactgctg ggaggaggct 840
atggagaaga tgatcgagaa tctgagacag accgagctga agctgaccag aaaggccgcc 900
tacgtgaggt acttcaacag cagtgccttc ttcttctctg gcttcttcgt tgtgtttctg 960
agcgtgctgc catacgctct catcaaaggc atcatcctgc ggaagatctt caccaccatc 1020
agcttttgca tcgtgcttag aatggccgtg acccggcagt tcccatgggc cgtgcaaact 1080
tggtatgatt ccctgggcgc catcaacaaa atccaggatt tcctgcagaa gcaggaatac 1140
aagacactcg aatataatct cacaactact gaggtggtta tggagaacgt gactgccttc 1200
tgggaggagg ggttcggaga gctttttgag aaggcaaaac agaataacaa caaccgcaaa 1260
accagcaacg gcgacgacag cctgttcttc tccaattttt ctctcctggg aacacccgtc 1320
ctcaaagaca tcaactttaa gatcgagagg ggccagctgc tcgccgtcgc cggatccaca 1380
ggcgccggca agacctctct gctgatggtt atcatgggcg aactggagcc ctccgagggc 1440
aagattaagc actcaggaag aatctccttt tgtagccagt tcagttggat tatgcccggc 1500
actattaagg agaatatcat ttttggggtg agctatgatg agtatcggta tcggagcgtt 1560
atcaaagcct gtcagctgga ggaggatatc agcaagttcg cagagaagga taatattgtg 1620
ctgggagagg gaggaatcac cctgagcgga ggccagagag ccagaatctc actggcccgg 1680
gccgtctaca aggacgccga cctttacctt ctggacagtc cctttggata tctggatgtg 1740
ctgactgaaa aggagatctt cgagtcttgt gtgtgcaagc tgatggctaa caagacccgg 1800
atcctagtga ctagtaagat ggagcacctg aagaaggcag acaagatctt gattctgcac 1860
gagggatcct cttactttta cggcaccttt agcgagctgc agaacctcca gcccgatttc 1920
tcatctaagc tgatgggctg tgatagcttc gaccagttct ctgccgagcg cagaaacagc 1980
atcctgacag agacactgca ccggttttca ctggagggcg acgcccctgt cagctggacc 2040
gagaccaaaa agcagtcttt caagcagaca ggcgagttcg gcgagaagcg caaaaacagc 2100
atcctgaatc caatcaactc tataaggaag tttagcatcg tgcagaagac acccctccag 2160
atgaacggca tcgaagagga cagtgacgag cccctggagc ggcgcctgag cctcgtgcct 2220
gacagcgaac agggcgaggc catcctgcct aggatcagcg tgatttcaac cgggccaaca 2280
ctgcaggcta ggagaagaca gtcagtgctt aacctgatga cacatagcgt gaatcaggga 2340
cagaacatcc atcgaaaaac cacagcctct actcgcaaag tgtcactggc tcctcaggct 2400
aatctgacag agctggacat ctatagcagg aggctgagcc aggagacagg cctggagatc 2460
agtgaggaga tcaacgaaga ggacctgaag gagtgctttt tcgatgacat ggagagtatc 2520
cccgccgtca ccacctggaa tacctacctc cggtacatca cagtgcacaa gtccctcatc 2580
tttgtgctga tttggtgcct cgtgatcttt ctcgcagaag tggccgcctc cctggtggtg 2640
ctgtggctgt tggggaatac tccactgcag gacaaaggca attctacaca cagcaggaat 2700
aattcctatg ccgtgattat caccagcaca tcctcttact acgtgttcta catctacgtg 2760
ggagtggcag atactctgct tgcaatgggc ttcttcaggg ggctgcccct ggtgcacaca 2820
ctgatcacag tgtccaagat cctccaccat aaaatgctcc acagcgtgct gcaggcaccc 2880
atgagcaccc tgaacacact gaaggccggc ggcatcctga atcgcttttc caaagacatc 2940
gccatcctcg acgatctcct gccactgacc atcttcgatt ttatccagct gctgctgatc 3000
gtgatcgggg ccatcgccgt ggtggccgtg ctgcagccat acattttcgt ggctacagtg 3060
cccgtgatcg ttgcctttat catgctgaga gcctacttcc tgcagacttc tcagcagctg 3120
aagcagctgg agagcgaagg gagaagcccc atcttcactc acctggtgac aagcctgaag 3180
ggactctgga ccctgagagc cttcggccgg cagccctatt tcgagaccct gtttcacaag 3240
gccctcaacc tgcacacagc caactggttc ctctacctgt ccaccctgag gtggttccag 3300
atgaggattg aaatgatctt cgtgattttt ttcatcgccg tgacattcat tagcattctg 3360
accaccggcg agggggaggg gagagtgggc atcatcctga cccttgccat gaacattatg 3420
tccacactgc agtgggccgt gaatagttca atcgacgtgg acagtctgat gaggtccgtg 3480
agccgggtgt tcaagttcat tgacatgccc acagagggga aacccaccaa aagcaccaag 3540
ccctacaaga acgggcagct gtccaaggtt atgatcatcg agaactctca cgtgaagaag 3600
gacgacattt ggcccagcgg cggccagatg acagtgaaag atctgaccgc caaatacacc 3660
gagggaggca acgccatcct cgaaaacatt agcttctcta tcagccctgg acagagggtg 3720
ggcctgctgg gccggacagg ctcagggaag agtactctgc tgtcagcatt cctgaggctc 3780
ctgaacacag agggcgagat ccagattgac ggcgtgtcct gggactccat caccctgcag 3840
cagtggcgga aggctttcgg ggtgatcccc cagaaggtgt tcatctttag cggcactttc 3900
agaaagaatc tggaccctta tgagcagtgg agtgaccagg agatctggaa agtggccgat 3960
gaggtcggac tgaggagcgt gatcgagcag tttccaggga agctggactt tgtgctggtg 4020
gatggcggat gcgtgctgtc tcacggccat aaacagctga tgtgtctggc ccggtccgtg 4080
ctgtctaagg ccaagatcct gctgctggac gaaccctccg cccacctgga ccccgtgaca 4140
taccagatca tcaggagaac tctcaagcag gccttcgccg actgtaccgt gattctgtgc 4200
gagcaccgca ttgaagctat gctggagtgt cagcagttcc tggtgatcga ggaaaataag 4260
gtgaggcagt acgacagcat ccagaagctg ctgaacgagc gctccctgtt ccgccaggct 4320
atctccccat cagaccgggt gaaactcttc ccccacagaa actcctcaaa gtgcaagtcc 4380
aagccccaga tcgccgccct gaaggaggag accgaggagg aggtgcagga caccaggctg 4440
tga 4443
<210> 7
<211> 4443
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic polynucleotide
<400> 7
atgcagcgtt ctcccctgga gaaggcttct gtggtgagta aacttttttt ctcctggacc 60
agacctatcc tgaggaaagg ctacaggcag agactggagc tctctgacat ataccagata 120
ccttcagtcg atagcgccga caacctgagc gagaagctgg aacgcgagtg ggacagagag 180
ctggcaagca agaagaaccc aaagctgatt aatgccctga gaaggtgttt cttctggaga 240
ttcatgttct acggaatctt tctgtatctg ggggaggtta caaaggctgt gcaacccctg 300
ctgctcggca gaatcatcgc ctcatacgat ccagacaaca aggaagaaag aagcatcgcc 360
atctacctgg gcattggcct ctgcctcctg tttattgtgc ggactctgct gctgcaccca 420
gcaattttcg ggttgcatca tattggcatg cagatgcgca ttgctatgtt ttccctcatc 480
tacaaaaaga cactgaaact cagctcccgg gtgctggaca agatctccat cggccaactg 540
gtgtctctcc tgagcaataa cttgaataag ttcgacgaag ggctggccct ggcacacttc 600
gtgtggattg cccccctgca ggtggccctg ctgatgggac tgatttggga actgctgcag 660
gctagcgctt tctgcggcct ggggttcctg atcgtgctgg cactgtttca ggcaggcctg 720
ggccgtatga tgatgaagta cagagaccag agggccggga agatctccga acggctcgtt 780
attacctctg agatgatcga gaacattcag tctgtgaaag cctactgctg ggaggaggct 840
atggagaaga tgatcgagaa tctgagacag accgagctga agctgaccag aaaggccgcc 900
tacgtgaggt acttcaacag cagtgccttc ttcttctctg ggttcttcgt tgtgtttctg 960
agcgtgctgc catacgctct catcaaaggc atcatcctgc ggaagatctt caccaccatc 1020
agcttttgca tcgtgcttag aatggccgtg acacggcagt tcccatgggc cgttcaaact 1080
tggtatgatt ccctgggcgc catcaacaaa atccaggatt tcctgcagaa gcaggaatac 1140
aagacactcg aatataacct cacaactact gaggtggtta tggagaacgt gactgccttc 1200
tgggaggagg ggttcggaga gctttttgag aaggccaaac agaataataa taaccgcaaa 1260
accagcaacg gcgacgacag cctgttcttc tccaattttt ctctcctggg aacacccgtc 1320
ctcaaagaca tcaactttaa gatcgagagg ggccagctgc tcgccgtcgc cggatccaca 1380
ggcgccggca agacctctct gctgatggtt atcatgggcg aactggagcc ctccgagggc 1440
aagattaagc actcaggaag aatctccttt tgtagccagt tcagttggat tatgcccggc 1500
actattaagg agaatatcat ttttggggtg agctatgatg agtatcggta tcggagcgtt 1560
atcaaagcct gtcagctgga ggaggatatc agcaagttcg cagagaagga taatattgtg 1620
ctgggagagg gaggaatcac cctgagcgga ggccagagag ccagaatctc actggcccgg 1680
gccgtctaca aggacgccga cctttacctt ctggacagtc cctttggata tctggatgtg 1740
ctgactgaaa aggagatctt cgagtcttgt gtgtgcaagc tgatggctaa caagacccgg 1800
atcctagtga ctagtaagat ggagcacctg aagaaggcag acaagatctt gattctgcac 1860
gagggatcct cttactttta cggcaccttt agcgagctgc agaacctcca gcccgatttc 1920
tcatctaagc tgatgggctg tgatagcttc gaccagttct ctgccgagcg cagaaacagc 1980
atcctgacag agacactgca ccggttttca ctggagggcg acgcccctgt cagctggacc 2040
gagaccaaaa agcagtcttt caagcagaca ggcgagttcg gcgagaagcg caaaaacagc 2100
atcctgaatc caatcaactc tataaggaag tttagcatcg tgcagaagac acccctccag 2160
atgaacggca tcgaagagga cagtgacgag cccctggagc ggcgcctgag cctcgtgcct 2220
gacagcgaac agggcgaggc catcctgcct aggatcagcg tgatttcaac cgggccaaca 2280
ctgcaggcta ggagaagaca gtcagtgctt aacctgatga cacatagcgt gaatcaggga 2340
cagaacatcc atcgaaaaac cacagcctct actcgcaaag tgtcactggc tcctcaggct 2400
aatctgacag agctggacat ctatagcagg aggctgagcc aggagacagg cctggagatc 2460
agtgaggaga tcaacgaaga ggacctgaag gagtgctttt tcgatgacat ggagagtatc 2520
cccgccgtca ccacctggaa tacctacctc cggtacatca cagtgcacaa gtccctcatc 2580
tttgtgctga tttggtgcct cgtgatcttt ctcgcagaag tggccgcctc cctggtggtg 2640
ctgtggctgt tggggaatac tccactgcag gacaaaggca attctacaca cagcaggaat 2700
aattcctatg ccgtgattat caccagcaca tcctcttact acgtgttcta catctacgtg 2760
ggagtggcag atactctgct tgcaatgggc ttcttcaggg ggctgcccct ggtgcacaca 2820
ctgatcacag tgtccaagat cctccaccat aaaatgctcc acagcgtgct gcaggcaccc 2880
atgagcaccc tgaacacact gaaggccggc ggcatcctga atcgcttttc caaagacatc 2940
gccatcctcg acgatctcct gccactgacc atcttcgatt ttatccagct gctgctgatc 3000
gtgatcgggg ccatcgccgt ggtggccgtg ctgcagccat acattttcgt ggctacagtg 3060
cccgtgatcg ttgcctttat catgctgaga gcctacttcc tgcagacttc tcagcagctg 3120
aagcagctgg agagcgaagg gagaagcccc atcttcactc acctggtgac aagcctgaag 3180
ggactctgga ccctgagagc cttcggccgg cagccctatt tcgagaccct gtttcacaag 3240
gccctcaacc tgcacacagc caactggttc ctctacctgt ccaccctgag gtggttccag 3300
atgaggattg aaatgatctt cgtgattttt ttcatcgccg tgacattcat tagcattctg 3360
accaccggcg agggggaggg gagagtgggc atcatcctga cccttgccat gaacattatg 3420
agcacactgc agtgggccgt gaatagtagt atcgacgtgg acagtctgat gaggtccgtg 3480
agccgggtgt tcaagttcat tgacatgccc acagaaggga aacccaccaa aagcaccaag 3540
ccctacaaga acgggcagct gtccaaggtt atgatcatcg agaactctca cgtgaagaag 3600
gacgacattt ggcccagcgg cggccagatg acagtgaaag atctgaccgc caaatacacc 3660
gagggaggca acgccatcct cgaaaacatt agcttctcta tcagccctgg acagagggtg 3720
ggcctgctgg gccggacagg ctcagggaag agtactctgc tgtcagcatt cctgaggctc 3780
ctgaacacag agggcgagat ccagattgac ggcgtgtcct gggactccat caccctgcag 3840
cagtggcgga aggctttcgg ggtgatcccc cagaaggtgt tcatctttag cggcactttc 3900
agaaagaatc tggaccctta tgagcagtgg agtgaccagg agatctggaa agtggccgat 3960
gaggtcggac tgaggagcgt gatcgagcag tttccaggga agctggactt tgtgctggtg 4020
gatggcggat gcgtgctgtc tcacggccat aaacagctga tgtgtctggc ccggtccgtg 4080
ctgtctaagg ccaagatcct gctgctggac gaaccctccg cccacctgga ccccgtgaca 4140
taccagatca tcaggagaac tctcaagcag gccttcgccg actgtaccgt gattctgtgc 4200
gagcaccgca ttgaagctat gctggagtgt cagcagttcc tggtgatcga ggaaaataag 4260
gtgaggcagt acgacagcat ccagaagctg ctgaacgagc gctccctgtt ccgccaggct 4320
atctccccat cagaccgggt gaagctcttt ccccacagaa actcctcaaa gtgcaagtcc 4380
aagccccaga tcgccgccct gaaggaggag accgaggagg aggtgcagga caccaggctg 4440
tga 4443
<210> 8
<211> 4443
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic polynucleotide
<400> 8
atgcagcgtt ctcccctgga gaaggcttct gtggtgagta aacttttttt ctcctggacc 60
agacctatcc tgaggaaagg ctacaggcag agactggagc tctctgacat ataccagata 120
ccttcagtcg atagcgccga caacctgagc gagaagctgg aacgcgagtg ggacagagag 180
ctggcaagca agaagaaccc aaagctgatt aatgccctga gaaggtgttt cttctggaga 240
ttcatgttct acggaatctt tctgtatctg ggggaggtta caaaggctgt gcaacccctg 300
ctgctcggca gaatcatcgc ctcatacgat ccagacaata aggaagagag atctatcgcc 360
atctacctgg gaattggcct gtgtctgctg ttcatcgtgc gcaccctgct cctccaccca 420
gccatttttg ggctgcatca catcggaatg cagatgagga ttgctatgtt ttccctgatc 480
tataagaaga ccctgaaact ctcaagcaga gtgctggaca aaatttccat tggccagctg 540
gtgtctctgc tgtccaataa tctcaataag tttgacgagg gcctggccct ggcacacttc 600
gtctggattg cccctctcca ggtcgctctg ctgatgggcc tgatctggga gctgctgcag 660
gcatccgctt tctgcggcct ggggttcctg atcgtgctgg cactgtttca ggcaggcctg 720
ggccgtatga tgatgaagta cagagaccag agggccggga agatctccga acggctcgtt 780
attacctctg agatgatcga gaacattcag tctgtgaaag cctactgctg ggaggaggct 840
atggagaaga tgatcgagaa tctgagacag accgagctga agctgaccag aaaggccgcc 900
tacgtgaggt acttcaacag cagtgccttc ttcttctctg gcttcttcgt tgtgtttctg 960
agcgtgctgc catacgctct catcaaaggc atcatcctga ggaagatctt cactacaatc 1020
tccttctgca tcgtactcag aatggccgtg acccgccagt ttccctgggc cgtgcagaca 1080
tggtacgact ccctcggcgc cattaataag atccaggatt ttctgcagaa acaggaatac 1140
aagacactgg aatacaacct gacaacaaca gaggtggtca tggaaaacgt gaccgcattt 1200
tgggaggaag gcttcggaga gctctttgaa aaagctaagc agaacaacaa taacaggaaa 1260
acctctaatg gggacgacag cctgtttttc agcaattttt ctctgctggg gacacctgtg 1320
ctgaaggaca ttaactttaa gatcgagagg ggacagctgc tcgcagtcgc cggatccaca 1380
ggcgccggca agacctctct gctgatggtt atcatgggcg aactggagcc atccgagggc 1440
aagattaagc acagtggaag aatctccttt tgtagccagt tcagttggat tatgcccggc 1500
actattaagg agaatatcat ttttggggtg agctatgatg agtatcggta tcggagcgtt 1560
atcaaagcct gtcagctgga ggaggatatc agcaaattcg cagagaagga taatatcgtg 1620
ctgggggagg ggggaatcac cctgagcgga ggccagagag ccagaatcag cctggcaagg 1680
gcagtgtata aagacgctga cctgtacttg ctggactccc cttttggcta cctggacgtg 1740
ctgaccgaaa aggaaatctt tgagtcctgc gtctgcaagc tgatggcaaa caagaccaga 1800
atcctggtga cctccaagat ggaacatctg aagaaggcag ataaaatcct catcctgcat 1860
gagggatctt cttactttta tggaactttt agcgagctgc agaacctgca gccagacttc 1920
tccagcaagc tgatgggatg cgactccttt gaccagttct ccgccgaacg gcgcaattct 1980
atcctgaccg aaaccctgca ccggttttca ctggagggcg acgcccctgt cagctggacc 2040
gagaccaaaa agcagtcttt caagcagaca ggcgagttcg gcgagaagcg caaaaacagc 2100
atcctgaatc caatcaactc tataaggaag tttagcatcg tgcagaagac acccctccag 2160
atgaacggca tcgaagagga cagtgacgag cccctggagc ggcgcctgag cctcgtgcct 2220
gacagcgaac agggcgaggc catcctgcct aggatcagcg tgatttcaac cgggccaaca 2280
ctgcaggcta ggagaagaca gtcagtgctt aacctgatga cacatagcgt gaatcaggga 2340
cagaacatcc atcgaaaaac cacagcctct actcgcaaag tgtcactggc tcctcaggct 2400
aatctgacag agctggacat ctatagcagg aggctgagcc aggagacagg cctggagatc 2460
agtgaggaga tcaacgaaga ggacctgaag gagtgctttt tcgatgacat ggagagtatc 2520
cccgccgtca ccacctggaa tacctacctc cggtacatca cagtgcacaa gtccctcatc 2580
tttgtgctga tttggtgcct cgtgatcttt ctcgcagaag tggccgcctc cctggtggtg 2640
ctgtggctgt tggggaatac tccactgcag gacaaaggca attctacaca cagcaggaat 2700
aattcctatg ccgtgattat caccagcaca tcctcttact acgtgttcta catctacgtg 2760
ggagtggcag atactctgct tgcaatgggc ttcttcaggg ggctgcccct ggtgcacaca 2820
ctgatcacag tgtccaagat cctccaccat aaaatgctcc acagcgtgct gcaggcaccc 2880
atgagcaccc tgaacacact gaaggccggc ggcatcctga atcgcttttc caaagacatc 2940
gccatcctcg acgatctcct gccactgacc atcttcgatt ttatccagct gctgctgatc 3000
gtgatcgggg ccatcgccgt ggtggccgtg ctgcagccat acattttcgt ggctacagtg 3060
cccgtgatcg ttgcctttat catgctgaga gcctacttcc tgcagacttc tcagcagctg 3120
aagcagctgg agagcgaagg gagaagcccc atcttcactc acctggtgac aagcctgaag 3180
ggactctgga ccctgagagc cttcggccgg cagccctatt tcgagaccct gtttcacaag 3240
gccctcaacc tgcacacagc caactggttt ctctacctgt ccaccctgag gtggttccag 3300
atgaggattg aaatgatctt cgtgattttt ttcatcgccg tgacattcat tagcattctg 3360
accaccggcg agggggaggg gagagtgggc atcatcctga cccttgccat gaacattatg 3420
tccacactgc agtgggccgt gaatagttca atcgacgtgg acagtctgat gaggtccgtg 3480
agccgggtgt tcaagttcat tgacatgccc acagagggga aacccaccaa aagcaccaag 3540
ccctacaaga acgggcagct gtccaaggtt atgatcatcg agaactctca cgtgaagaag 3600
gacgacattt ggcccagcgg cggccagatg acagtgaaag atctgaccgc caaatacacc 3660
gagggaggca acgccatcct cgaaaacatt agcttctcta tcagccctgg acagagggtg 3720
ggcctgctgg gccggacagg ctcagggaag agtactctgc tgtcagcatt cctgaggctc 3780
ctgaacacag agggcgagat ccagattgac ggcgtgtcct gggactccat caccctgcag 3840
cagtggcgga aggctttcgg ggtgatcccc cagaaggtgt tcatctttag cggcactttc 3900
agaaagaatc tggaccctta tgagcagtgg agtgaccagg agatctggaa agtggccgat 3960
gaggtcggac tgaggagcgt gatcgagcag tttccaggga agctggactt tgtgctggtg 4020
gatggcggat gcgtgctgtc tcacggccat aaacagctga tgtgtctggc ccggtccgtg 4080
ctgtctaagg ccaagatcct gctgctggac gaaccctccg cccacctgga ccccgtgaca 4140
taccagatca tcaggagaac tctcaagcag gccttcgccg actgtaccgt gattctgtgc 4200
gagcaccgca ttgaagctat gctggagtgt cagcagttcc tggtgatcga ggaaaataag 4260
gtgaggcagt acgacagcat ccagaagctg ctgaacgagc gctccctgtt ccgccaggct 4320
atctccccat cagaccgggt gaaactcttc ccccacagaa actcctcaaa gtgcaagtcc 4380
aagccccaga tcgccgccct gaaggaggag accgaggagg aggtgcagga caccaggctg 4440
tga 4443
<210> 9
<211> 4443
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic polynucleotide
<400> 9
atgcagcgct cgcctctgga aaaggcgagc gtcgtgtcaa agctattctt ttcttggacc 60
cggcccattc tcaggaaggg ctacaggcag aggctggagt tgagcgacat ctatcagatt 120
ccttccgtgg acagcgccga caacctgagc gagaagctgg aaagggagtg ggaccgcgaa 180
ctggcaagca aaaagaaccc caagctgatc aatgccctga gaaggtgttt cttttggaga 240
ttcatgttct acgggatctt tctgtatctg ggcgaggtta caaaggctgt gcagcccctg 300
ctgctcggca gaatcatcgc ctcatacgat ccagacaaca aggaagaaag aagcatcgcc 360
atctacctgg gcattggcct ctgcctcctg tttattgtgc ggactctgct gctgcaccca 420
gcaattttcg ggttgcatca tattggcatg cagatgcgca ttgctatgtt ttccctcatc 480
tacaaaaaga cactgaaact cagctcccgg gtgctggaca agatctccat cggccaactg 540
gtgtctctcc tgagcaataa cttgaataag ttcgacgaag ggctggccct ggcacacttc 600
gtgtggattg cccccctgca ggtggccctg ctgatgggac tgatttggga actgctgcag 660
gctagcgctt tctgcggcct ggggttcctg atcgtgctgg cactgtttca ggcaggcctg 720
ggccgtatga tgatgaagta cagagaccag agggccggga agatctccga acggctcgtt 780
attacctctg agatgatcga gaacattcag tctgtgaaag cctactgctg ggaggaggct 840
atggagaaga tgatcgagaa tctgagacag accgagctga agctgaccag aaaggccgcc 900
tacgtgaggt acttcaacag cagtgccttc ttcttctctg ggttcttcgt tgtgtttctg 960
agcgtgctgc catacgctct catcaaaggc atcatcctgc ggaagatctt caccaccatc 1020
agcttttgca tcgtgcttag aatggccgtg acacggcagt tcccatgggc cgttcaaact 1080
tggtatgatt ccctgggcgc catcaacaaa atccaggatt tcctgcagaa gcaggaatac 1140
aagacactcg aatataacct cacaactact gaggtggtta tggagaacgt gactgccttc 1200
tgggaggagg ggttcggaga gctttttgag aaggccaaac agaataataa taaccgcaaa 1260
accagcaacg gcgacgacag cctgttcttc tccaattttt ctctcctggg aacacccgtc 1320
ctcaaagaca tcaactttaa gatcgagagg ggccagctgc tcgccgtcgc cggatccaca 1380
ggcgccggca agacctctct gctgatggtt atcatgggcg aactggagcc ctccgagggc 1440
aagattaagc actcaggaag aatctccttt tgtagccagt tcagttggat tatgcccggc 1500
actattaagg agaatatcat ttttggggtg agctatgatg agtatcggta tcggagcgtt 1560
atcaaagcct gtcagctgga ggaggatatc agcaagttcg cagagaagga taatattgtg 1620
ctgggagagg gaggaatcac cctgagcgga ggccagagag ccagaatctc actggcccgg 1680
gccgtctaca aggacgccga cctttacctt ctggacagtc cctttggata tctggatgtg 1740
ctgactgaaa aggagatctt cgagtcttgt gtgtgcaagc tgatggctaa caagacccgg 1800
atcctagtga ctagtaagat ggagcacctg aagaaggcag acaagatctt gattctgcac 1860
gagggatcct cttactttta cggcaccttt agcgagctgc agaacctcca gcccgatttc 1920
tcatctaagc tgatgggctg tgatagcttc gaccagttct ctgccgagcg cagaaacagc 1980
atcctgacag agacactgca ccggttttca ctggagggcg acgcccctgt cagctggacc 2040
gagaccaaaa agcagtcttt caagcagaca ggcgagttcg gcgagaagcg caaaaacagc 2100
atcctgaatc caatcaactc tataaggaag tttagcatcg tgcagaagac acccctccag 2160
atgaacggca tcgaagagga cagtgacgag cccctggagc ggcgcctgag cctcgtgcct 2220
gacagcgaac agggcgaggc catcctgcct aggatcagcg tgatttcaac cgggccaaca 2280
ctgcaggcta ggagaagaca gtcagtgctt aacctgatga cacatagcgt gaatcaggga 2340
cagaacatcc atcgaaaaac cacagcctct actcgcaaag tgtcactggc tcctcaggct 2400
aatctgacag agctggacat ctatagcagg aggctgagcc aggagacagg cctggagatc 2460
agtgaggaga tcaacgaaga ggacctgaag gagtgctttt tcgatgacat ggagagtatc 2520
cccgccgtca ccacctggaa tacctacctc cggtacatca cagtgcacaa gtccctcatc 2580
tttgtgctga tttggtgcct cgtgatcttt ctcgcagaag tggccgcctc cctggtggtg 2640
ctgtggctgt tggggaatac tccactgcag gacaaaggca attctacaca cagcaggaat 2700
aattcctatg ccgtgattat caccagcaca tcctcttact acgtgttcta catctacgtg 2760
ggagtggcag atactctgct tgcaatgggc ttcttcaggg ggctgcccct ggtgcacaca 2820
ctgatcacag tgtccaagat cctccaccat aaaatgctcc acagcgtgct gcaggcaccc 2880
atgagcaccc tgaacacact gaaggccggc ggcatcctga atcgcttttc caaagacatc 2940
gccatcctcg acgatctcct gccactgacc atcttcgatt ttatccagct gctgctgatc 3000
gtgatcgggg ccatcgccgt ggtggccgtg ctgcagccat acattttcgt ggctacagtg 3060
cccgtgatcg ttgcctttat catgctgaga gcctacttcc tgcagacttc tcagcagctg 3120
aagcagctgg agagcgaagg gagaagcccc atcttcactc acctggtgac aagcctgaag 3180
ggactctgga ccctgagagc cttcggccgg cagccctatt tcgagaccct gtttcacaag 3240
gccctcaacc tgcacacagc caactggttc ctctacctgt ccaccctgag gtggttccag 3300
atgaggattg aaatgatctt cgtgattttt ttcatcgccg tgacattcat tagcattctg 3360
accaccggcg agggggaggg gagagtgggc atcatcctga cccttgccat gaacattatg 3420
agcacactgc agtgggccgt gaatagtagt atcgacgtgg acagtctgat gaggtccgtg 3480
agccgggtgt tcaagttcat tgacatgccc acagaaggga aacccaccaa aagcaccaag 3540
ccctacaaga acgggcagct gtccaaggtt atgatcatcg agaactctca cgtgaagaag 3600
gacgacattt ggcccagcgg cggccagatg acagtgaaag atctgaccgc caaatacacc 3660
gagggaggca acgccatcct cgaaaacatt agcttctcta tcagccctgg acagagggtg 3720
ggcctgctgg gccggacagg ctcagggaag agtactctgc tgtcagcatt cctgaggctc 3780
ctgaacacag agggcgagat ccagattgac ggcgtgtcct gggactccat caccctgcag 3840
cagtggcgga aggctttcgg ggtgatcccc cagaaggtgt tcatctttag cggcactttc 3900
agaaagaatc tggaccctta tgagcagtgg agtgaccagg agatctggaa agtggccgat 3960
gaggtcggac tgaggagcgt gatcgagcag tttccaggga agctggactt tgtgctggtg 4020
gatggcggat gcgtgctgtc tcacggccat aaacagctga tgtgtctggc ccggtccgtg 4080
ctgtctaagg ccaagatcct gctgctggac gaaccctccg cccacctgga ccccgtgaca 4140
taccagatca tcaggagaac tctcaagcag gccttcgccg actgtaccgt gattctgtgc 4200
gagcaccgca ttgaagctat gctggagtgt cagcagttcc tggtgatcga ggaaaataag 4260
gtgaggcagt acgacagcat ccagaagctg ctgaacgagc gctccctgtt ccgccaggct 4320
atctccccat cagaccgggt gaaactcttc ccccacagaa actcctcaaa gtgcaagtcc 4380
aagccccaga tcgccgccct gaaggaggag accgaggagg aggtgcagga caccaggctg 4440
tga 4443
<210> 10
<211> 4443
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic polynucleotide
<400> 10
atgcagcgtt ctcccctgga gaaggcttct gtggtgagta aacttttttt ctcctggacc 60
agacctatcc tgaggaaagg ctacaggcag agactggagc tctctgacat ataccagata 120
ccttcagtcg atagcgccga caacctgagc gagaagctgg aacgcgagtg ggacagagag 180
ctggcaagca agaagaaccc aaagctgatt aatgccctga gaaggtgttt cttctggaga 240
ttcatgttct acggaatctt tctgtatctg ggggaggtta caaaggctgt gcaacccctg 300
ctgctcggca gaatcatcgc ctcatacgat ccagacaata aggaagagag atctatcgcc 360
atctacctgg gaattggcct gtgtctgctg ttcatcgtgc gcaccctgct cctccaccca 420
gccatttttg ggctgcatca catcggaatg cagatgagga ttgctatgtt ttccctgatc 480
tataagaaga ccctgaaact ctcaagcaga gtgctggaca aaatttccat tggccagctg 540
gtgtctctgc tgtccaataa tctcaataag tttgacgagg gcctggccct ggcacacttc 600
gtctggattg cccctctcca ggtcgctctg ctgatgggcc tgatctggga gctgctgcag 660
gcatccgctt tctgcggcct ggggttcctg atcgtgctgg cactgtttca ggcaggcctg 720
ggccgtatga tgatgaagta cagagaccag agggccggga agatctccga acggctcgtt 780
attacctctg agatgatcga gaacattcag tctgtgaaag cctactgctg ggaggaggct 840
atggagaaga tgatcgagaa tctgagacag accgagctga agctgaccag aaaggccgcc 900
tacgtgaggt acttcaacag cagtgccttc ttcttctctg ggttcttcgt tgtgtttctg 960
agcgtgctgc catacgctct catcaaaggc atcatcctga ggaagatctt cactacaatc 1020
tccttctgca tcgtactcag aatggccgtg acccgccagt ttccctgggc cgtgcagaca 1080
tggtacgact ccctcggcgc cattaataag atccaggatt ttctgcagaa acaggaatac 1140
aagacactgg aatacaacct gacaacaaca gaggtggtca tggaaaacgt gaccgcattt 1200
tgggaggaag gcttcggaga gctctttgaa aaagctaagc agaacaacaa taacaggaaa 1260
acctctaatg gggacgacag cctgtttttc agcaattttt ctctgctggg gacacctgtg 1320
ctgaaggaca ttaactttaa gatcgagagg ggacagctgc tcgcagtcgc cggatccaca 1380
ggcgccggca agacctctct gctgatggtt atcatgggcg aactggagcc atccgagggc 1440
aagattaagc acagtggaag aatctccttt tgtagccagt tcagttggat tatgcccggc 1500
actattaagg agaatatcat ttttggggtg agctatgatg agtatcggta tcggagcgtt 1560
atcaaagcct gtcagctgga ggaggatatc agcaaattcg cagagaagga taatatcgtg 1620
ctgggggagg ggggaatcac cctgagcgga ggccagagag ccagaatttc tctggccaga 1680
gccgtgtaca aagatgccga cctgtacctg ctggacagcc catttggcta tctggacgtg 1740
ctgaccgaaa aagagatttt cgagtcatgc gtttgtaagc tgatggccaa caagactcgc 1800
atcctggtga cttcgaagat ggaacatctg aagaaagctg ataagattct gatcctgcac 1860
gaaggcagct cctactttta cgggaccttc tccgagctcc agaacctgca gcctgatttc 1920
agctctaagc tgatgggctg cgatagcttt gaccagttta gcgcagaaag gcgcaactct 1980
attctgactg agacactgca ccggttttca ctggagggcg acgcccctgt cagctggacc 2040
gagaccaaaa agcagtcttt caagcagaca ggcgagttcg gcgagaagcg caaaaacagc 2100
atcctgaatc caatcaactc tataaggaag tttagcatcg tgcagaagac acccctccag 2160
atgaacggca tcgaagagga cagtgacgag cccctggagc ggcgcctgag cctcgtgcct 2220
gacagcgaac agggcgaggc catcctgcct aggatcagcg tgatttcaac cgggccaaca 2280
ctgcaggcta ggagaagaca gtcagtgctt aacctgatga cacatagcgt gaatcaggga 2340
cagaacatcc atcgaaaaac cacagcctct actcgcaaag tgtcactggc tcctcaggct 2400
aatctgacag agctggacat ctatagcagg aggctgagcc aggagacagg cctggagatc 2460
agtgaggaga tcaacgaaga ggacctgaag gagtgctttt tcgatgacat ggagagtatc 2520
cccgccgtca ccacctggaa tacctacctc cggtacatca cagtgcacaa gtccctcatc 2580
tttgtgctga tttggtgcct cgtgatcttt ctcgcagaag tggccgcctc cctggtggtg 2640
ctgtggctgt tggggaatac tccactgcag gacaaaggca attctacaca cagcaggaat 2700
aattcctatg ccgtgattat caccagcaca tcctcttact acgtgttcta catctacgtg 2760
ggagtggcag atactctgct tgcaatgggc ttcttcaggg ggctgcccct ggtgcacaca 2820
ctgatcacag tgtccaagat cctccaccat aaaatgctcc acagcgtgct gcaggcaccc 2880
atgagcaccc tgaacacact gaaggccggc ggcatcctga atcgcttttc caaagacatc 2940
gccatcctcg acgatctcct gccactgacc atcttcgatt ttatccagct gctgctgatc 3000
gtgatcgggg ccatcgccgt ggtggccgtg ctgcagccat acattttcgt ggctacagtg 3060
cccgtgatcg ttgcctttat catgctgaga gcctacttcc tgcagacttc tcagcagctg 3120
aagcagctgg agagcgaagg gagaagcccc atcttcactc acctggtgac aagcctgaag 3180
ggactctgga ccctgagagc cttcggccgg cagccctatt tcgagaccct gtttcacaag 3240
gccctcaacc tgcacacagc caactggttt ctctacctgt ccaccctgag gtggttccag 3300
atgaggattg aaatgatctt cgtgattttt ttcatcgccg tgacattcat tagcattctg 3360
accaccggcg agggggaggg gagagtgggc atcatcctga cccttgccat gaacattatg 3420
tccacactgc agtgggccgt gaatagttca atcgacgtgg acagtctgat gaggtccgtg 3480
agccgggtgt tcaagttcat tgacatgccc acagagggga aacccaccaa aagcaccaag 3540
ccctacaaga acgggcagct gtccaaggtt atgatcatcg agaactctca cgtgaagaag 3600
gacgacattt ggcccagcgg cggccagatg acagtgaaag atctgaccgc caaatacacc 3660
gagggaggca acgccatcct cgaaaacatt agcttctcta tcagccctgg acagagggtg 3720
ggcctgctgg gccggacagg ctcagggaag agtactctgc tgtcagcatt cctgaggctc 3780
ctgaacacag agggcgagat ccagattgac ggcgtgtcct gggactccat caccctgcag 3840
cagtggcgga aggctttcgg ggtgatcccc cagaaggtgt tcatctttag cggcactttc 3900
agaaagaatc tggaccctta tgagcagtgg agtgaccagg agatctggaa agtggccgat 3960
gaggtcggac tgaggagcgt gatcgagcag tttccaggga agctggactt tgtgctggtg 4020
gatggcggat gcgtgctgtc tcacggccat aaacagctga tgtgtctggc ccggtccgtg 4080
ctgtctaagg ccaagatcct gctgctggac gaaccctccg cccacctgga ccccgtgaca 4140
taccagatca tcaggagaac tctcaagcag gccttcgccg actgtaccgt gattctgtgc 4200
gagcaccgca ttgaagctat gctggagtgt cagcagttcc tggtgatcga ggaaaataag 4260
gtgaggcagt acgacagcat ccagaagctg ctgaacgagc gctccctgtt ccgccaggct 4320
atctccccat cagaccgggt gaaactcttc ccccacagaa actcctcaaa gtgcaagtcc 4380
aagccccaga tcgccgccct gaaggaggag accgaggagg aggtgcagga caccaggctg 4440
tga 4443
<210> 11
<211> 4443
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic polynucleotide
<400> 11
atgcagcgct cgcctctgga aaaggcgagc gtcgtgtcaa agctattctt ttcttggacc 60
cggcccattc tcaggaaggg ctacaggcag aggctggagt tgagcgacat ctatcagatt 120
ccttccgtgg acagcgccga caacctgagc gagaagctgg aaagggagtg ggaccgcgaa 180
ctggcaagca aaaagaaccc caagctgatc aatgccctga gaaggtgttt cttttggaga 240
ttcatgttct acgggatctt tctgtatctg ggcgaggtta caaaggctgt gcagcccctg 300
ctgctcggca gaatcatcgc ctcatacgat ccagacaata aggaagagag atctatcgcc 360
atctacctgg gaattggcct gtgtctgctg ttcatcgtgc gcaccctgct cctccaccca 420
gccatttttg ggctgcatca catcggaatg cagatgagga ttgctatgtt ttccctgatc 480
tataagaaga ccctgaaact ctcaagcaga gtgctggaca aaatttccat tggccagctg 540
gtgtctctgc tgtccaataa tctcaataag tttgacgagg gcctggccct ggcacacttc 600
gtctggattg cccctctcca ggtcgctctg ctgatgggcc tgatctggga gctgctgcag 660
gcatccgctt tctgcggcct ggggttcctg atcgtgctgg cactgtttca ggcaggcctg 720
ggccgtatga tgatgaagta cagagaccag agggccggga agatctccga acggctcgtt 780
attacctctg agatgatcga gaacattcag tctgtgaaag cctactgctg ggaggaggct 840
atggagaaga tgatcgagaa tctgagacag accgagctga agctgaccag aaaggccgcc 900
tacgtgaggt acttcaacag cagtgccttc ttcttctctg gcttcttcgt tgtgtttctg 960
agcgtgctgc catacgctct catcaaaggc atcatcctga gaaaaatttt cacaaccatc 1020
tccttttgca tcgtgctgag aatggccgtg acaaggcagt tcccttgggc tgtgcagacc 1080
tggtacgaca gcctgggagc tattaataag attcaagatt tcctgcagaa gcaggaatac 1140
aaaacactgg aatacaacct gacaactact gaggtcgtta tggagaacgt gacagcattt 1200
tgggaggagg ggttcgggga actcttcgag aaggcaaagc agaacaacaa caatcggaag 1260
acatccaacg gcgacgacag cctgttcttt tccaacttca gcctgctggg aactccagtg 1320
ctcaaagaca ttaactttaa gatcgagagg ggccagctgc tcgccgtcgc cggatccaca 1380
ggcgccggca agacctctct gctgatggtt atcatgggcg aactggagcc ctccgagggc 1440
aagattaagc actcaggaag aatctccttt tgtagccagt tcagttggat tatgcccggc 1500
actattaagg agaatatcat ttttggggtg agctatgatg agtatcggta tcggagcgtt 1560
atcaaagcct gtcagctgga ggaggatatc agcaagttcg cagagaagga taatattgtg 1620
ctgggagagg gaggaatcac cctgagcgga ggccagagag ccagaattag cctcgcccgg 1680
gcagtctaca aagatgccga cctgtacctg ctggacagcc cttttggcta tttggatgtg 1740
ctgactgaaa aggaaatctt cgagagctgc gtgtgcaagc tgatggccaa caagacccgc 1800
atcctcgtca ctagcaagat ggaacacctg aagaaggccg acaagatcct gattctgcac 1860
gaggggagca gctacttcta tggcactttt tccgagctgc aaaatctcca gcctgacttc 1920
tcttccaagc tgatgggatg tgacagcttt gaccagtttt ccgctgagcg gcgcaatagc 1980
atcctgaccg aaaccctgca ccggttttca ctggagggcg acgcccctgt cagctggacc 2040
gagaccaaaa agcagtcttt caagcagaca ggcgagttcg gcgagaagcg caaaaacagc 2100
atcctgaatc caatcaactc tataaggaag tttagcatcg tgcagaagac acccctccag 2160
atgaacggca tcgaagagga cagtgacgag cccctggagc ggcgcctgag cctcgtgcct 2220
gacagcgaac agggcgaggc catcctgcct aggatcagcg tgatttcaac cgggccaaca 2280
ctgcaggcta ggagaagaca gtcagtgctt aacctgatga cacatagcgt gaatcaggga 2340
cagaacatcc atcgaaaaac cacagcctct actcgcaaag tgtcactggc tcctcaggct 2400
aatctgacag agctggacat ctatagcagg aggctgagcc aggagacagg cctggagatc 2460
agtgaggaga tcaacgaaga ggacctgaag gagtgctttt tcgatgacat ggagagtatc 2520
cccgccgtca ccacctggaa tacctacctc cggtacatca cagtgcacaa gtccctcatc 2580
tttgtgctga tttggtgcct cgtgatcttt ctcgcagaag tggccgcctc cctggtggtg 2640
ctgtggctgt tggggaatac tccactgcag gacaaaggca attctacaca cagcaggaat 2700
aattcctatg ccgtgattat caccagcaca tcctcttact acgtgttcta catctacgtg 2760
ggagtggcag atactctgct tgcaatgggc ttcttcaggg ggctgcccct ggtgcacaca 2820
ctgatcacag tgtccaagat cctccaccat aaaatgctcc acagcgtgct gcaggcaccc 2880
atgagcaccc tgaacacact gaaggccggc ggcatcctga atcgcttttc caaagacatc 2940
gccatcctcg acgatctcct gccactgacc atcttcgatt ttatccagct gctgctgatc 3000
gtgatcgggg ccatcgccgt ggtggccgtg ctgcagccat acattttcgt ggctacagtg 3060
cccgtgatcg ttgcctttat catgctgaga gcctacttcc tgcagacttc tcagcagctg 3120
aagcagctgg agagcgaagg gagaagcccc atcttcactc acctggtgac aagcctgaag 3180
ggactctgga ccctgagagc cttcggccgg cagccctatt tcgagaccct gtttcacaag 3240
gccctcaacc tgcacacagc caactggttc ctctacctgt ccaccctgag gtggttccag 3300
atgaggattg aaatgatctt cgtgattttt ttcatcgccg tgacattcat tagcattctg 3360
accaccggcg agggggaggg gagagtgggc atcatcctga cccttgccat gaacattatg 3420
agcacactgc agtgggccgt gaatagtagt atcgacgtgg acagtctgat gaggtccgtg 3480
agccgggtgt tcaagttcat tgacatgccc acagaaggga aacccaccaa aagcaccaag 3540
ccctacaaga acgggcagct gtccaaggtt atgatcatcg agaactctca cgtgaagaag 3600
gacgacattt ggcccagcgg cggccagatg acagtgaaag atctgaccgc caaatacacc 3660
gagggaggca acgccatcct cgaaaacatt agcttctcta tcagccctgg acagagggtg 3720
ggcctgctgg gccggacagg ctcagggaag agtactctgc tgtcagcatt cctgaggctc 3780
ctgaacacag agggcgagat ccagattgac ggcgtgtcct gggactccat caccctgcag 3840
cagtggcgga aggctttcgg ggtgatcccc cagaaggtgt tcatctttag cggcactttc 3900
agaaagaatc tggaccctta tgagcagtgg agtgaccagg agatctggaa agtggccgat 3960
gaggtcggac tgaggagcgt gatcgagcag tttccaggga agctggactt tgtgctggtg 4020
gatggcggat gcgtgctgtc tcacggccat aaacagctga tgtgtctggc ccggtccgtg 4080
ctgtctaagg ccaagatcct gctgctggac gaaccctccg cccacctgga ccccgtgaca 4140
taccagatca tcaggagaac tctcaagcag gccttcgccg actgtaccgt gattctgtgc 4200
gagcaccgca ttgaagctat gctggagtgt cagcagttcc tggtgatcga ggaaaataag 4260
gtgaggcagt acgacagcat ccagaagctg ctgaacgagc gctccctgtt ccgccaggct 4320
atctccccat cagaccgggt gaaactcttc ccccacagaa actcctcaaa gtgcaagtcc 4380
aagccccaga tcgccgccct gaaggaggag accgaggagg aggtgcagga caccaggctg 4440
tga 4443
<210> 12
<211> 140
<212> RNA
<213> Artificial Sequence
<220>
<223> Synthetic polynucleotide
<400> 12
ggacagaucg ccuggagacg ccauccacgc uguuuugacc uccauagaag acaccgggac 60
cgauccagcc uccgcggccg ggaacggugc auuggaacgc ggauuccccg ugccaagagu 120
gacucaccgu ccuugacacg 140
<210> 13
<211> 105
<212> RNA
<213> Artificial Sequence
<220>
<223> Synthetic polynucleotide
<400> 13
cggguggcau cccugugacc ccuccccagu gccucuccug gcccuggaag uugccacucc 60
agugcccacc agccuugucc uaauaaaauu aaguugcauc aagcu 105
<210> 14
<211> 105
<212> RNA
<213> Artificial Sequence
<220>
<223> Synthetic polynucleotide
<400> 14
ggguggcauc ccugugaccc cuccccagug ccucuccugg cccuggaagu ugccacucca 60
gugcccacca gccuuguccu aauaaaauua aguugcauca aagcu 105
<210> 15
<211> 4443
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic polynucleotide
<400> 15
atgcaacgct ctcctcttga aaaggcctcg gtggtgtcca agctcttctt ctcgtggact 60
agacccatcc tgagaaaggg gtacagacag cgcttggagc tgtccgatat ctatcaaatc 120
ccttccgtgg actccgcgga caacctgtcc gagaagctcg agagagaatg ggacagagaa 180
ctcgcctcaa agaagaaccc gaagctgatt aatgcgctta ggcggtgctt tttctggcgg 240
ttcatgttct acggcatctt cctctacctg ggagaggtca ccaaggccgt gcagcccctg 300
ttgctgggac ggattattgc ctcctacgac cccgacaaca aggaagaaag aagcatcgct 360
atctacttgg gcatcggtct gtgcctgctt ttcatcgtcc ggaccctctt gttgcatcct 420
gctattttcg gcctgcatca cattggcatg cagatgagaa ttgccatgtt ttccctgatc 480
tacaagaaaa ctctgaagct ctcgagccgc gtgcttgaca agatttccat cggccagctc 540
gtgtccctgc tctccaacaa tctgaacaag ttcgacgagg gcctcgccct ggcccacttc 600
gtgtggatcg cccctctgca agtggcgctt ctgatgggcc tgatctggga gctgctgcaa 660
gcctcggcat tctgtgggct tggattcctg atcgtgctgg cactgttcca ggccggactg 720
gggcggatga tgatgaagta cagggaccag agagccggaa agatttccga acggctggtg 780
atcacttcgg aaatgatcga aaacatccag tcagtgaagg cctactgctg ggaagaggcc 840
atggaaaaga tgattgaaaa cctccggcaa accgagctga agctgacccg caaggccgct 900
tacgtgcgct atttcaactc gtccgctttc ttcttctccg ggttcttcgt ggtgtttctc 960
tccgtgctcc cctacgccct gattaaggga atcatcctca ggaagatctt caccaccatt 1020
tccttctgta tcgtgctccg catggccgtg acccggcagt tcccatgggc cgtgcagact 1080
tggtacgact ccctgggagc cattaacaag atccaggact tccttcaaaa gcaggagtac 1140
aagaccctcg agtacaacct gactactacc gaggtcgtga tggaaaacgt caccgccttt 1200
tgggaggagg gatttggcga actgttcgag aaggccaagc agaacaacaa caaccgcaag 1260
acctcgaacg gtgacgactc cctcttcttt tcaaacttca gcctgctcgg gacgcccgtg 1320
ctgaaggaca ttaacttcaa gatcgaaaga ggacagctcc tggcggtggc cggatcgacc 1380
ggagccggaa agacttccct gctgatggtg atcatgggag agcttgaacc tagcgaggga 1440
aagatcaagc actccggccg catcagcttc tgtagccagt tttcctggat catgcccgga 1500
accattaagg aaaacatcat cttcggcgtg tcctacgatg aataccgcta ccggtccgtg 1560
atcaaagcct gccagctgga agaggatatt tcaaagttcg cggagaaaga taacatcgtg 1620
ctgggcgaag ggggtattac cttgtcgggg ggccagcggg ctagaatctc gctggccaga 1680
gccgtgtata aggacgccga cctgtatctc ctggactccc ccttcggata cctggacgtc 1740
ctgaccgaaa aggagatctt cgaatcgtgc gtgtgcaagc tgatggctaa caagactcgc 1800
atcctcgtga cctccaaaat ggagcacctg aagaaggcag acaagattct gattctgcat 1860
gaggggtcct cctactttta cggcaccttc tcggagttgc agaacttgca gcccgacttc 1920
tcatcgaagc tgatgggttg cgacagcttc gaccagttct ccgccgaaag aaggaactcg 1980
atcctgacgg aaaccttgca ccgcttctct ttggaaggcg acgcccctgt gtcatggacc 2040
gagactaaga agcagagctt caagcagacc ggggaattcg gcgaaaagag gaagaacagc 2100
atcttgaacc ccattaactc catccgcaag ttctcaatcg tgcaaaagac gccactgcag 2160
atgaacggca ttgaggagga ctccgacgaa ccccttgaga ggcgcctgtc cctggtgccg 2220
gacagcgagc agggagaagc catcctgcct cggatttccg tgatctccac tggtccgacg 2280
ctccaagccc ggcggcggca gtccgtgctg aacctgatga cccacagcgt gaaccagggc 2340
caaaacattc accgcaagac taccgcatcc acccggaaag tgtccctggc acctcaagcg 2400
aatcttaccg agctcgacat ctactcccgg agactgtcgc aggaaaccgg gctcgaaatt 2460
tccgaagaaa tcaacgagga ggatctgaaa gagtgcttct tcgacgatat ggagtcgata 2520
cccgccgtga cgacttggaa cacttatctg cggtacatca ctgtgcacaa gtcattgatc 2580
ttcgtgctga tttggtgcct ggtgattttc ctggccgagg tcgcggcctc actggtggtg 2640
ctctggctgt tgggaaacac gcctctgcaa gacaagggaa actccacgca ctcgagaaac 2700
aacagctatg ccgtgattat cacttccacc tcctcttatt acgtgttcta catctacgtc 2760
ggagtggcgg ataccctgct cgcgatgggt ttcttcagag gactgccgct ggtccacacc 2820
ttgatcaccg tcagcaagat tcttcaccac aagatgttgc atagcgtgct gcaggccccc 2880
atgtccaccc tcaacactct gaaggccgga ggcattctga acagattctc caaggacatc 2940
gctatcctgg acgatctcct gccgcttacc atctttgact tcatccagct gctgctgatc 3000
gtgattggag caatcgcagt ggtggcggtg ctgcagcctt acattttcgt ggccactgtg 3060
ccggtcattg tggcgttcat catgctgcgg gcctacttcc tccaaaccag ccagcagctg 3120
aagcaactgg aatccgaggg acgatccccc atcttcactc accttgtgac gtcgttgaag 3180
ggactgtgga ccctccgggc tttcggacgg cagccctact tcgaaaccct cttccacaag 3240
gccctgaacc tccacaccgc caattggttc ctgtacctgt ccaccctgcg gtggttccag 3300
atgcgcatcg agatgatttt cgtcatcttc ttcatcgcgg tcacattcat cagcatcctg 3360
actaccggag agggagaggg acgggtcgga ataatcctga ccctcgccat gaacattatg 3420
agcaccctgc agtgggcagt gaacagctcg atcgacgtgg acagcctgat gcgaagcgtc 3480
agccgcgtgt tcaagttcat cgacatgcct actgagggaa aacccactaa gtccactaag 3540
ccctacaaaa atggccagct gagcaaggtc atgatcatcg aaaactccca cgtgaagaag 3600
gacgatattt ggccctccgg aggtcaaatg accgtgaagg acctgaccgc aaagtacacc 3660
gagggaggaa acgccattct cgaaaacatc agcttctcca tttcgccggg acagcgggtc 3720
ggccttctcg ggcggaccgg ttccgggaag tcaactctgc tgtcggcttt cctccggctg 3780
ctgaataccg agggggaaat ccaaattgac ggcgtgtctt gggattccat tactctgcag 3840
cagtggcgga aggccttcgg cgtgatcccc cagaaggtgt tcatcttctc gggtaccttc 3900
cggaagaacc tggatcctta cgagcagtgg agcgaccaag aaatctggaa ggtcgccgac 3960
gaggtcggcc tgcgctccgt gattgaacaa tttcctggaa agctggactt cgtgctcgtc 4020
gacgggggat gtgtcctgtc gcacggacat aagcagctca tgtgcctcgc acggtccgtg 4080
ctctccaagg ccaagattct gctgctggac gaaccttcgg cccacctgga tccggtcacc 4140
taccagatca tcaggaggac cctgaagcag gcctttgccg attgcaccgt gattctctgc 4200
gagcaccgca tcgaggccat gctggagtgc cagcagttcc tggtcatcga ggagaacaag 4260
gtccgccaat acgactccat tcaaaagctc ctcaacgagc ggtcgctgtt cagacaagct 4320
atttcaccgt ccgatagagt gaagctcttc ccgcatcgga acagctcaaa gtgcaaatcg 4380
aagccgcaga tcgcagcctt gaaggaagag actgaggaag aggtgcagga cacccggctt 4440
taa 4443
<210> 16
<211> 4443
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic polynucleotide
<400> 16
atgcagcggt ccccgctcga aaaggccagt gtcgtgtcca aactcttctt ctcatggact 60
cggcctatcc ttagaaaggg gtatcggcag aggcttgagt tgtctgacat ctaccagatc 120
ccctcggtag attcggcgga taacctctcg gagaagctcg aacgggaatg ggaccgcgaa 180
ctcgcgtcta agaaaaaccc gaagctcatc aacgcactga gaaggtgctt cttctggcgg 240
ttcatgttct acggtatctt cttgtatctc ggggaggtca caaaagcagt ccaacccctg 300
ttgttgggtc gcattatcgc ctcgtacgac cccgataaca aagaagaacg gagcatcgcg 360
atctacctcg ggatcggact gtgtttgctt ttcatcgtca gaacactttt gttgcatcca 420
gcaatcttcg gcctccatca catcggtatg cagatgcgaa tcgctatgtt tagcttgatc 480
tacaaaaaga cactgaaact ctcgtcgcgg gtgttggata agatttccat cggtcagttg 540
gtgtccctgc ttagtaataa cctcaacaaa ttcgatgagg gactggcgct ggcacatttc 600
gtgtggattg ccccgttgca agtcgccctt ttgatgggcc ttatttggga actcttgcag 660
gcatctgcct tttgtggcct gggatttctg attgtgttgg cattgtttca ggctgggctt 720
gggcggatga tgatgaagta tcgcgaccag agagcgggta aaatctcgga aagactcgtc 780
atcacttcgg aaatgatcga aaacatccag tcggtcaaag cctattgctg ggaagaagct 840
atggagaaga tgattgaaaa cctccgccaa actgagctga aactgacccg caaggcggcg 900
tatgtccggt atttcaattc gtcagcgttc ttcttttccg ggttcttcgt tgtctttctc 960
tcggttttgc cttatgcctt gattaagggg attatcctcc gcaagatttt caccacgatt 1020
tcgttctgca ttgtattgcg catggcagtg acacggcaat ttccgtgggc cgtgcagaca 1080
tggtatgact cgcttggagc gatcaacaaa atccaagact tcttgcaaaa gcaagagtac 1140
aagaccctgg agtacaatct tactactacg gaggtagtaa tggagaatgt gacggctttt 1200
tgggaagagg gttttggaga gctcttcgag aaagcaaagc agaataacaa caaccgcaag 1260
acctcaaatg gggacgattc cctgtttttc tcgaacttct ccctgctcgg aacacccgtg 1320
ttgaaggaca tcaatttcaa gattgagagg ggacagcttc tcgcggtagc gggaagcact 1380
ggtgcgggaa aaactagcct cttgatggtg attatggggg agcttgagcc cagcgagggg 1440
aagattaaac actccgggcg tatctcattc tgtagccagt tttcatggat catgcccgga 1500
accattaaag agaacatcat tttcggagta tcctatgatg agtaccgata cagatcggtc 1560
attaaggcgt gccagttgga agaggacatt tctaagttcg ccgagaagga taacatcgtc 1620
ttgggagaag ggggtattac attgtcggga gggcagcgag cgcggatcag cctcgcgaga 1680
gcggtataca aagatgcaga tttgtacctg ctcgattcac cgtttggata cctcgacgta 1740
ttgacagaaa aagaaatctt cgagtcgtgc gtgtgtaaac ttatggctaa taagacgaga 1800
atcctggtga catcaaaaat ggaacacctt aagaaggcgg acaagatcct gatcctccac 1860
gaaggatcgt cctactttta cggcactttc tcagagttgc aaaacttgca gccggacttc 1920
tcaagcaaac tcatggggtg tgactcattc gaccagttca gcgcggaacg gcggaactcg 1980
atcttgacgg aaacgctgca ccgattctcg cttgagggtg atgccccggt atcgtggacc 2040
gagacaaaga agcagtcgtt taagcagaca ggagaatttg gtgagaaaag aaagaacagt 2100
atcttgaatc ctattaactc aattcgcaag ttctcaatcg tccagaaaac tccactgcag 2160
atgaatggaa ttgaagagga ttcggacgaa cccctggagc gcaggcttag cctcgtgccg 2220
gattcagagc aaggggaggc cattcttccc cggatttcgg tgatttcaac cggacctaca 2280
cttcaggcga ggcgaaggca atccgtgctc aacctcatga cgcattcggt aaaccagggg 2340
caaaacattc accgcaaaac gacggcctca acgagaaaag tgtcacttgc accccaggcg 2400
aatttgactg aactcgacat ctacagccgt aggctttcgc aagaaaccgg acttgagatc 2460
agcgaagaaa tcaatgaaga agatttgaaa gagtgtttct ttgatgacat ggaatcaatc 2520
ccagcggtga caacgtggaa cacatacttg cgttacatca cggtgcacaa gtccttgatt 2580
ttcgtcctca tttggtgcct cgtgatcttt ctcgctgagg tcgcagcgtc acttgtggtc 2640
ctctggctgc ttggtaatac gcccttgcaa gacaaaggca attctacaca ctcaagaaac 2700
aattcctatg ccgtgattat cacttctaca agctcgtatt acgtgtttta catctacgta 2760
ggagtggccg acactctgct cgcgatgggt ttcttccgag gactcccact cgttcacacg 2820
cttatcactg tctccaagat tctccaccat aagatgcttc atagcgtact gcaggctccc 2880
atgtccacct tgaatacgct caaggcggga ggtattttga atcgcttctc aaaagatatt 2940
gcaattttgg atgaccttct gcccctgacg atcttcgact tcatccagtt gttgctgatc 3000
gtgattgggg ctattgcagt agtcgctgtc ctccagcctt acatttttgt cgcgaccgtt 3060
ccggtgatcg tggcgtttat catgctgcgg gcctatttct tgcagacgtc acagcagctt 3120
aagcaactgg agtctgaagg gaggtcgcct atctttacgc atcttgtgac cagtttgaag 3180
ggattgtgga cgttgcgcgc ctttggcagg cagccctact ttgaaacact gttccacaaa 3240
gcgctgaatc tccatacggc aaattggttt ttgtatttga gtaccctccg atggtttcag 3300
atgcgcattg agatgatttt tgtgatcttc tttatcgcgg tgacttttat ctccatcttg 3360
accacgggag agggcgaggg acgggtcggt attatcctga cactcgccat gaacattatg 3420
agcactttgc agtgggcagt gaacagctcg attgatgtgg atagcctgat gaggtccgtt 3480
tcgagggtct ttaagttcat cgacatgccg acggagggaa agcccacaaa aagtacgaaa 3540
ccctataaga atgggcaatt gagtaaggta atgatcatcg agaacagtca cgtgaagaag 3600
gatgacatct ggcctagcgg gggtcagatg accgtgaagg acctgacggc aaaatacacc 3660
gagggaggga acgcaatcct tgaaaacatc tcgttcagca ttagccccgg tcagcgtgtg 3720
gggttgctcg ggaggaccgg gtcaggaaaa tcgacgttgc tgtcggcctt cttgagactt 3780
ctgaatacag agggtgagat ccagatcgac ggcgtttcgt gggatagcat caccttgcag 3840
cagtggcgga aagcgtttgg agtaatcccc caaaaggtct ttatctttag cggaaccttc 3900
cgaaagaatc tcgatcctta tgaacagtgg tcagatcaag agatttggaa agtcgcggac 3960
gaggttggcc ttcggagtgt aatcgagcag tttccgggaa aactcgactt tgtccttgta 4020
gatgggggat gcgtcctgtc gcatgggcac aagcagctca tgtgcctggc gcgatccgtc 4080
ctctctaaag cgaaaattct tctcttggat gaaccttcgg cccatctgga cccggtaacg 4140
tatcagatca tcagaaggac acttaagcag gcgtttgccg actgcacggt gattctctgt 4200
gagcatcgta tcgaggccat gctcgaatgc cagcaatttc ttgtcatcga agagaataag 4260
gtccgccagt acgactccat ccagaagctg cttaatgaga gatcattgtt ccggcaggcg 4320
atttcaccat ccgatagggt gaaacttttt ccacacagaa attcgtcgaa gtgcaagtcc 4380
aaaccgcaga tcgcggcctt gaaagaagag actgaagaag aagttcaaga cacgcgtctt 4440
taa 4443
<210> 17
<211> 874
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic polypeptide
<400> 17
Met Gln Asp Leu His Ala Ile Gln Leu Gln Leu Glu Glu Glu Met Phe
1 5 10 15
Asn Gly Gly Ile Arg Arg Phe Glu Ala Asp Gln Gln Arg Gln Ile Ala
20 25 30
Ala Gly Ser Glu Ser Asp Thr Ala Trp Asn Arg Arg Leu Leu Ser Glu
35 40 45
Leu Ile Ala Pro Met Ala Glu Gly Ile Gln Ala Tyr Lys Glu Glu Tyr
50 55 60
Glu Gly Lys Lys Gly Arg Ala Pro Arg Ala Leu Ala Phe Leu Gln Cys
65 70 75 80
Val Glu Asn Glu Val Ala Ala Tyr Ile Thr Met Lys Val Val Met Asp
85 90 95
Met Leu Asn Thr Asp Ala Thr Leu Gln Ala Ile Ala Met Ser Val Ala
100 105 110
Glu Arg Ile Glu Asp Gln Val Arg Phe Ser Lys Leu Glu Gly His Ala
115 120 125
Ala Lys Tyr Phe Glu Lys Val Lys Lys Ser Leu Lys Ala Ser Arg Thr
130 135 140
Lys Ser Tyr Arg His Ala His Asn Val Ala Val Val Ala Glu Lys Ser
145 150 155 160
Val Ala Glu Lys Asp Ala Asp Phe Asp Arg Trp Glu Ala Trp Pro Lys
165 170 175
Glu Thr Gln Leu Gln Ile Gly Thr Thr Leu Leu Glu Ile Leu Glu Gly
180 185 190
Ser Val Phe Tyr Asn Gly Glu Pro Val Phe Met Arg Ala Met Arg Thr
195 200 205
Tyr Gly Gly Lys Thr Ile Tyr Tyr Leu Gln Thr Ser Glu Ser Val Gly
210 215 220
Gln Trp Ile Ser Ala Phe Lys Glu His Val Ala Gln Leu Ser Pro Ala
225 230 235 240
Tyr Ala Pro Cys Val Ile Pro Pro Arg Pro Trp Arg Thr Pro Phe Asn
245 250 255
Gly Gly Phe His Thr Glu Lys Val Ala Ser Arg Ile Arg Leu Val Lys
260 265 270
Gly Asn Arg Glu His Val Arg Lys Leu Thr Gln Lys Gln Met Pro Lys
275 280 285
Val Tyr Lys Ala Ile Asn Ala Leu Gln Asn Thr Gln Trp Gln Ile Asn
290 295 300
Lys Asp Val Leu Ala Val Ile Glu Glu Val Ile Arg Leu Asp Leu Gly
305 310 315 320
Tyr Gly Val Pro Ser Phe Lys Pro Leu Ile Asp Lys Glu Asn Lys Pro
325 330 335
Ala Asn Pro Val Pro Val Glu Phe Gln His Leu Arg Gly Arg Glu Leu
340 345 350
Lys Glu Met Leu Ser Pro Glu Gln Trp Gln Gln Phe Ile Asn Trp Lys
355 360 365
Gly Glu Cys Ala Arg Leu Tyr Thr Ala Glu Thr Lys Arg Gly Ser Lys
370 375 380
Ser Ala Ala Val Val Arg Met Val Gly Gln Ala Arg Lys Tyr Ser Ala
385 390 395 400
Phe Glu Ser Ile Tyr Phe Val Tyr Ala Met Asp Ser Arg Ser Arg Val
405 410 415
Tyr Val Gln Ser Ser Thr Leu Ser Pro Gln Ser Asn Asp Leu Gly Lys
420 425 430
Ala Leu Leu Arg Phe Thr Glu Gly Arg Pro Val Asn Gly Val Glu Ala
435 440 445
Leu Lys Trp Phe Cys Ile Asn Gly Ala Asn Leu Trp Gly Trp Asp Lys
450 455 460
Lys Thr Phe Asp Val Arg Val Ser Asn Val Leu Asp Glu Glu Phe Gln
465 470 475 480
Asp Met Cys Arg Asp Ile Ala Ala Asp Pro Leu Thr Phe Thr Gln Trp
485 490 495
Ala Lys Ala Asp Ala Pro Tyr Glu Phe Leu Ala Trp Cys Phe Glu Tyr
500 505 510
Ala Gln Tyr Leu Asp Leu Val Asp Glu Gly Arg Ala Asp Glu Phe Arg
515 520 525
Thr His Leu Pro Val His Gln Asp Gly Ser Cys Ser Gly Ile Gln His
530 535 540
Tyr Ser Ala Met Leu Arg Asp Glu Val Gly Ala Lys Ala Val Asn Leu
545 550 555 560
Lys Pro Ser Asp Ala Pro Gln Asp Ile Tyr Gly Ala Val Ala Gln Val
565 570 575
Val Ile Lys Lys Asn Ala Leu Tyr Met Asp Ala Asp Asp Ala Thr Thr
580 585 590
Phe Thr Ser Gly Ser Val Thr Leu Ser Gly Thr Glu Leu Arg Ala Met
595 600 605
Ala Ser Ala Trp Asp Ser Ile Gly Ile Thr Arg Ser Leu Thr Lys Lys
610 615 620
Pro Val Met Thr Leu Pro Tyr Gly Ser Thr Arg Leu Thr Cys Arg Glu
625 630 635 640
Ser Val Ile Asp Tyr Ile Val Asp Leu Glu Glu Lys Glu Ala Gln Lys
645 650 655
Ala Val Ala Glu Gly Arg Thr Ala Asn Lys Val His Pro Phe Glu Asp
660 665 670
Asp Arg Gln Asp Tyr Leu Thr Pro Gly Ala Ala Tyr Asn Tyr Met Thr
675 680 685
Ala Leu Ile Trp Pro Ser Ile Ser Glu Val Val Lys Ala Pro Ile Val
690 695 700
Ala Met Lys Met Ile Arg Gln Leu Ala Arg Phe Ala Ala Lys Arg Asn
705 710 715 720
Glu Gly Leu Met Tyr Thr Leu Pro Thr Gly Phe Ile Leu Glu Gln Lys
725 730 735
Ile Met Ala Thr Glu Met Leu Arg Val Arg Thr Cys Leu Met Gly Asp
740 745 750
Ile Lys Met Ser Leu Gln Val Glu Thr Asp Ile Val Asp Glu Ala Ala
755 760 765
Met Met Gly Ala Ala Ala Pro Asn Phe Val His Gly His Asp Ala Ser
770 775 780
His Leu Ile Leu Thr Val Cys Glu Leu Val Asp Lys Gly Val Thr Ser
785 790 795 800
Ile Ala Val Ile His Asp Ser Phe Gly Thr His Ala Asp Asn Thr Leu
805 810 815
Thr Leu Arg Val Ala Leu Lys Gly Gln Met Val Ala Met Tyr Ile Asp
820 825 830
Gly Asn Ala Leu Gln Lys Leu Leu Glu Glu His Glu Val Arg Trp Met
835 840 845
Val Asp Thr Gly Ile Glu Val Pro Glu Gln Gly Glu Phe Asp Leu Asn
850 855 860
Glu Ile Met Asp Ser Glu Tyr Val Phe Ala
865 870
Claims (21)
1. A composition for pulmonary delivery, the composition comprising mRNA encapsulated in a lipid nanoparticle, wherein the lipid nanoparticle comprises a cationic lipid selected from the group consisting of: GL-TES-SA-DMP-E18-2, GL-TES-SA-DME-E18-2, TL1-01D-DMA, TL1-04D-DMA, SY-3-E14-DMAPR, TL1-10D-DMA, HEP-E3-E10, HEP-E4-E10 and Guan-SS-Chol.
2. The composition of claim 1, wherein the mRNA is codon optimized.
3. The composition of claim 1 or 2, wherein the lipid nanoparticle further comprises one or more non-cationic lipids and one or more PEG-modified lipids.
4. The composition of any one of the preceding claims, wherein the lipid nanoparticle further comprises one or more cholesterol-based lipids.
5. The composition of claim 3 or 4, wherein the non-cationic lipid is DOPE or DEPE.
6. The composition of any one of the preceding claims, wherein the liposomes have a size of less than about 100 nm.
7. The composition of any one of the preceding claims, wherein the liposome comprises no more than three different lipid components.
8. The composition of claim 7, wherein the three different lipid components are the cationic lipid, a non-cationic lipid, and a PEG-modified lipid.
9. The composition of claim 8, wherein the non-cationic lipid is DOPE or DEPE.
10. The composition of claim 8 or 9, wherein the PEG-modified lipid is DMG-PEG2K.
11. The composition of any one of claims 8-10, wherein the three different lipid components are Guan-SS-Chol, DOPE, and DMG-PEG2K.
12. The composition of claim 11, wherein the Guan-SS-Chol, DOPE, and DMG-PEG2K are present in a molar ratio of about 60:35:5, respectively.
13. The composition of any one of claims 1-6, wherein the liposome comprises four different lipid components.
14. The composition of claim 13, wherein the four different lipid components are the cationic lipid, non-cationic lipid, cholesterol, and PEG-modified lipid.
15. The method of claim 11 or 12, wherein the non-cationic lipid is DOPE or DEPE.
16. The method of claim 15, wherein the non-cationic lipid is DOPE.
17. The method of any one of claims 11-16, wherein the PEG-modified lipid is DMG-PEG2K.
18. The pharmaceutical composition of any one of claims 13-15, wherein the molar ratio of cationic lipid to non-cationic lipid to cholesterol to PEG-modified lipid is between about 30-60:25-35:20-30:1-15, respectively.
19. The composition of any one of the preceding claims, wherein the mRNA encodes a cystic fibrosis transmembrane conductance regulator, an ATP-binding cassette subfamily a member 3 protein, a kinesin shaft chain 1 (DNAI 1) protein, a kinesin shaft chain heavy chain 5 (DNAH 5) protein, an alpha-1-antitrypsin protein, a fork box P3 (FOXP 3) protein, or one or more surfactant proteins.
20. A method of inducing protein expression in epithelial cells in the lung of a mammal, the method comprising contacting epithelial cells in the lung of the mammal with the composition of any one of claims 1-19.
21. The method of claim 20, wherein the composition is administered by nebulization.
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US63/111,321 | 2020-11-09 | ||
US202163195581P | 2021-06-01 | 2021-06-01 | |
US63/195,581 | 2021-06-01 | ||
PCT/US2021/058623 WO2022099194A1 (en) | 2020-11-09 | 2021-11-09 | Improved compositions for delivery of codon-optimized mrna |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116782878A true CN116782878A (en) | 2023-09-19 |
Family
ID=87989997
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202180089274.0A Pending CN116782878A (en) | 2020-11-09 | 2021-11-09 | Improved compositions for delivery of codon optimized mRNA |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116782878A (en) |
-
2021
- 2021-11-09 CN CN202180089274.0A patent/CN116782878A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11253605B2 (en) | Codon-optimized CFTR MRNA | |
US20230181619A1 (en) | Improved compositions for cftr mrna therapy | |
EP2506857A1 (en) | Delivery of mrna for the augmentation of proteins and enzymes in human genetic diseases | |
CN113272282A (en) | 2, 5-dioxopiperazine lipids with inserted ester, thioester, disulfide and anhydride moieties | |
WO2021222801A2 (en) | Nucleic acids and methods of treatment for cystic fibrosis | |
CA3165388A1 (en) | Improved process of preparing mrna-loaded lipid nanoparticles | |
EP3849617A1 (en) | Composition and methods for treatment of methylmalonic acidemia | |
US20240197825A1 (en) | mRNA Encoding Engineered CFTR | |
EP4146680A1 (en) | Composition and methods for treatment of primary ciliary dyskinesia | |
EP4114470A1 (en) | Compositions and methods for the treatment of ornithine transcarbamylase deficiency | |
US20220160633A1 (en) | Compositions for delivery of codon-optimized mrna | |
CA3198411A1 (en) | Improved process of preparing mrna-loaded lipid nanoparticles | |
CN116782878A (en) | Improved compositions for delivery of codon optimized mRNA | |
WO2022081548A9 (en) | Improved process of preparing ice-based lipid nanoparticles | |
EP4048316A1 (en) | Compositions, methods and uses of messenger rna | |
US20220287966A1 (en) | Stable Liquid Lipid Nanoparticle Formulations | |
AU2018203895B2 (en) | Delivery Of mRNA For The Augmentation Of Proteins And Enzymes In Human Genetic Diseases | |
CN118251497A (en) | Lipid compositions comprising peptide-lipid conjugates |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |