AU2021381384A1 - Compositions and methods for optimized peptide vaccines - Google Patents
Compositions and methods for optimized peptide vaccines Download PDFInfo
- Publication number
- AU2021381384A1 AU2021381384A1 AU2021381384A AU2021381384A AU2021381384A1 AU 2021381384 A1 AU2021381384 A1 AU 2021381384A1 AU 2021381384 A AU2021381384 A AU 2021381384A AU 2021381384 A AU2021381384 A AU 2021381384A AU 2021381384 A1 AU2021381384 A1 AU 2021381384A1
- Authority
- AU
- Australia
- Prior art keywords
- peptide
- seq
- hla
- sequence
- composition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 237
- 239000000203 mixture Substances 0.000 title claims abstract description 221
- 229940023041 peptide vaccine Drugs 0.000 title description 61
- 108090000765 processed proteins & peptides Chemical group 0.000 claims abstract description 1714
- 230000002163 immunogen Effects 0.000 claims abstract description 175
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 139
- 150000007523 nucleic acids Chemical group 0.000 claims abstract description 123
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 85
- 229960005486 vaccine Drugs 0.000 claims description 496
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 405
- 230000027455 binding Effects 0.000 claims description 279
- 238000009739 binding Methods 0.000 claims description 279
- 108700028369 Alleles Proteins 0.000 claims description 243
- 230000005847 immunogenicity Effects 0.000 claims description 164
- 108091005601 modified peptides Proteins 0.000 claims description 162
- 206010028980 Neoplasm Diseases 0.000 claims description 156
- 108090000623 proteins and genes Proteins 0.000 claims description 117
- 201000011510 cancer Diseases 0.000 claims description 82
- 238000003556 assay Methods 0.000 claims description 69
- WEVYNIUIFUYDGI-UHFFFAOYSA-N 3-[6-[4-(trifluoromethoxy)anilino]-4-pyrimidinyl]benzamide Chemical compound NC(=O)C1=CC=CC(C=2N=CN=C(NC=3C=CC(OC(F)(F)F)=CC=3)C=2)=C1 WEVYNIUIFUYDGI-UHFFFAOYSA-N 0.000 claims description 58
- 150000001413 amino acids Chemical class 0.000 claims description 58
- 125000000539 amino acid group Chemical group 0.000 claims description 52
- 230000001717 pathogenic effect Effects 0.000 claims description 52
- 244000052769 pathogen Species 0.000 claims description 51
- 230000004927 fusion Effects 0.000 claims description 49
- 108010026552 Proteome Proteins 0.000 claims description 41
- 102200006539 rs121913529 Human genes 0.000 claims description 39
- 238000006467 substitution reaction Methods 0.000 claims description 38
- 102100030708 GTPase KRas Human genes 0.000 claims description 35
- 101000584612 Homo sapiens GTPase KRas Proteins 0.000 claims description 35
- 102200006531 rs121913529 Human genes 0.000 claims description 32
- 238000001727 in vivo Methods 0.000 claims description 30
- 239000012634 fragment Substances 0.000 claims description 29
- 102200006540 rs121913530 Human genes 0.000 claims description 29
- 238000001914 filtration Methods 0.000 claims description 27
- 230000014509 gene expression Effects 0.000 claims description 24
- 102200006532 rs112445441 Human genes 0.000 claims description 20
- 102200006538 rs121913530 Human genes 0.000 claims description 20
- 102000054766 genetic haplotypes Human genes 0.000 claims description 19
- 229940126685 KRAS G12R Drugs 0.000 claims description 18
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 16
- 210000000621 bronchi Anatomy 0.000 claims description 7
- 210000001072 colon Anatomy 0.000 claims description 7
- 210000004072 lung Anatomy 0.000 claims description 7
- 210000000496 pancreas Anatomy 0.000 claims description 7
- 210000000664 rectum Anatomy 0.000 claims description 7
- 210000003679 cervix uteri Anatomy 0.000 claims description 6
- 208000015181 infectious disease Diseases 0.000 claims description 6
- 210000003734 kidney Anatomy 0.000 claims description 6
- 210000004185 liver Anatomy 0.000 claims description 6
- 210000002784 stomach Anatomy 0.000 claims description 6
- 210000003932 urinary bladder Anatomy 0.000 claims description 6
- 210000004291 uterus Anatomy 0.000 claims description 6
- 101150033421 ABL gene Proteins 0.000 claims description 3
- 108020004707 nucleic acids Proteins 0.000 abstract description 9
- 102000039446 nucleic acids Human genes 0.000 abstract description 9
- 238000013461 design Methods 0.000 description 115
- 102000043129 MHC class I family Human genes 0.000 description 93
- 108091054437 MHC class I family Proteins 0.000 description 93
- 102000004169 proteins and genes Human genes 0.000 description 72
- 235000018102 proteins Nutrition 0.000 description 68
- 235000001014 amino acid Nutrition 0.000 description 63
- 108091054438 MHC class II family Proteins 0.000 description 62
- 102000043131 MHC class II family Human genes 0.000 description 60
- 230000035772 mutation Effects 0.000 description 43
- 210000004027 cell Anatomy 0.000 description 31
- 230000006870 function Effects 0.000 description 31
- 210000001744 T-lymphocyte Anatomy 0.000 description 25
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 24
- 230000004048 modification Effects 0.000 description 24
- 238000012986 modification Methods 0.000 description 24
- 108700018351 Major Histocompatibility Complex Proteins 0.000 description 23
- 201000010099 disease Diseases 0.000 description 23
- 239000008194 pharmaceutical composition Substances 0.000 description 23
- 230000020382 suppression by virus of host antigen processing and presentation of peptide antigen via MHC class I Effects 0.000 description 23
- 239000002552 dosage form Substances 0.000 description 22
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 18
- 150000001875 compounds Chemical class 0.000 description 16
- 238000009472 formulation Methods 0.000 description 16
- 238000004949 mass spectrometry Methods 0.000 description 14
- 108700021021 mRNA Vaccine Proteins 0.000 description 13
- 108020004414 DNA Proteins 0.000 description 12
- 239000011159 matrix material Substances 0.000 description 12
- 210000004881 tumor cell Anatomy 0.000 description 12
- 229940126582 mRNA vaccine Drugs 0.000 description 11
- 210000002569 neuron Anatomy 0.000 description 11
- 230000008569 process Effects 0.000 description 11
- 239000000243 solution Substances 0.000 description 11
- 208000031261 Acute myeloid leukaemia Diseases 0.000 description 10
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 10
- 239000004471 Glycine Substances 0.000 description 10
- 208000033776 Myeloid Acute Leukemia Diseases 0.000 description 10
- 239000004480 active ingredient Substances 0.000 description 10
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 9
- 206010061902 Pancreatic neoplasm Diseases 0.000 description 9
- 101150040459 RAS gene Proteins 0.000 description 9
- 239000003814 drug Substances 0.000 description 9
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 description 9
- 108020004999 messenger RNA Proteins 0.000 description 9
- 201000002528 pancreatic cancer Diseases 0.000 description 9
- 208000008443 pancreatic carcinoma Diseases 0.000 description 9
- 238000011282 treatment Methods 0.000 description 9
- 108010041986 DNA Vaccines Proteins 0.000 description 8
- 229940021995 DNA vaccine Drugs 0.000 description 8
- 239000000427 antigen Substances 0.000 description 8
- 108091007433 antigens Proteins 0.000 description 8
- 102000036639 antigens Human genes 0.000 description 8
- 239000000969 carrier Substances 0.000 description 8
- 239000003937 drug carrier Substances 0.000 description 8
- 230000028993 immune response Effects 0.000 description 8
- 102000016914 ras Proteins Human genes 0.000 description 8
- 239000003981 vehicle Substances 0.000 description 8
- 108091008874 T cell receptors Proteins 0.000 description 7
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 description 7
- 239000007788 liquid Substances 0.000 description 7
- 238000005457 optimization Methods 0.000 description 7
- 239000004094 surface-active agent Substances 0.000 description 7
- 230000006044 T cell activation Effects 0.000 description 6
- 238000004422 calculation algorithm Methods 0.000 description 6
- 239000003795 chemical substances by application Substances 0.000 description 6
- 238000000205 computational method Methods 0.000 description 6
- 238000003114 enzyme-linked immunosorbent spot assay Methods 0.000 description 6
- -1 polyoxyethylene Polymers 0.000 description 6
- 241000894007 species Species 0.000 description 6
- 239000007921 spray Substances 0.000 description 6
- 230000001225 therapeutic effect Effects 0.000 description 6
- 208000024893 Acute lymphoblastic leukemia Diseases 0.000 description 5
- 208000032791 BCR-ABL1 positive chronic myelogenous leukemia Diseases 0.000 description 5
- 208000037162 Ductal Breast Carcinoma Diseases 0.000 description 5
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 5
- 108010076504 Protein Sorting Signals Proteins 0.000 description 5
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 5
- 230000002159 abnormal effect Effects 0.000 description 5
- 239000002671 adjuvant Substances 0.000 description 5
- 230000000890 antigenic effect Effects 0.000 description 5
- 230000003416 augmentation Effects 0.000 description 5
- 235000019441 ethanol Nutrition 0.000 description 5
- 230000036039 immunity Effects 0.000 description 5
- 239000004615 ingredient Substances 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- 239000000463 material Substances 0.000 description 5
- 238000004806 packaging method and process Methods 0.000 description 5
- 230000037361 pathway Effects 0.000 description 5
- 239000000546 pharmaceutical excipient Substances 0.000 description 5
- 239000003755 preservative agent Substances 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 238000012163 sequencing technique Methods 0.000 description 5
- 230000032258 transport Effects 0.000 description 5
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 5
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 4
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 4
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 4
- 229930006000 Sucrose Natural products 0.000 description 4
- 239000000443 aerosol Substances 0.000 description 4
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 210000000349 chromosome Anatomy 0.000 description 4
- 238000003776 cleavage reaction Methods 0.000 description 4
- 230000009260 cross reactivity Effects 0.000 description 4
- 238000002790 cross-validation Methods 0.000 description 4
- 235000011187 glycerol Nutrition 0.000 description 4
- 230000001965 increasing effect Effects 0.000 description 4
- 239000008101 lactose Substances 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 230000000069 prophylactic effect Effects 0.000 description 4
- 230000000717 retained effect Effects 0.000 description 4
- 230000007017 scission Effects 0.000 description 4
- 230000028327 secretion Effects 0.000 description 4
- 239000005720 sucrose Substances 0.000 description 4
- 208000023275 Autoimmune disease Diseases 0.000 description 3
- 241001137350 Fratercula Species 0.000 description 3
- 102100029974 GTPase HRas Human genes 0.000 description 3
- 108010010803 Gelatin Proteins 0.000 description 3
- 101000584633 Homo sapiens GTPase HRas Proteins 0.000 description 3
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- DNIAPMSPPWPWGF-UHFFFAOYSA-N Propylene glycol Chemical compound CC(O)CO DNIAPMSPPWPWGF-UHFFFAOYSA-N 0.000 description 3
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 3
- 102100022596 Tyrosine-protein kinase ABL1 Human genes 0.000 description 3
- 238000010171 animal model Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 150000001720 carbohydrates Chemical class 0.000 description 3
- 235000014633 carbohydrates Nutrition 0.000 description 3
- 230000010261 cell growth Effects 0.000 description 3
- 230000001143 conditioned effect Effects 0.000 description 3
- 238000012258 culturing Methods 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 239000013604 expression vector Substances 0.000 description 3
- 108020001507 fusion proteins Proteins 0.000 description 3
- 102000037865 fusion proteins Human genes 0.000 description 3
- 239000008273 gelatin Substances 0.000 description 3
- 229920000159 gelatin Polymers 0.000 description 3
- 235000019322 gelatine Nutrition 0.000 description 3
- 235000011852 gelatine desserts Nutrition 0.000 description 3
- 239000008103 glucose Substances 0.000 description 3
- 230000013595 glycosylation Effects 0.000 description 3
- 238000006206 glycosylation reaction Methods 0.000 description 3
- 108091008915 immune receptors Proteins 0.000 description 3
- 102000027596 immune receptors Human genes 0.000 description 3
- 210000000987 immune system Anatomy 0.000 description 3
- 230000006058 immune tolerance Effects 0.000 description 3
- 230000006872 improvement Effects 0.000 description 3
- 230000006698 induction Effects 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 239000002105 nanoparticle Substances 0.000 description 3
- 239000003921 oil Substances 0.000 description 3
- 229920001184 polypeptide Polymers 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- RYYKJJJTJZKILX-UHFFFAOYSA-M sodium octadecanoate Chemical compound [Na+].CCCCCCCCCCCCCCCCCC([O-])=O RYYKJJJTJZKILX-UHFFFAOYSA-M 0.000 description 3
- 239000007787 solid Substances 0.000 description 3
- 239000002904 solvent Substances 0.000 description 3
- 239000003381 stabilizer Substances 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 210000001519 tissue Anatomy 0.000 description 3
- 230000003612 virological effect Effects 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 2
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 2
- 102100039788 GTPase NRas Human genes 0.000 description 2
- 108010036972 HLA-A11 Antigen Proteins 0.000 description 2
- 108010056113 HLA-B55 antigen Proteins 0.000 description 2
- 108010043021 HLA-B58 Proteins 0.000 description 2
- 102000018713 Histocompatibility Antigens Class II Human genes 0.000 description 2
- 101000744505 Homo sapiens GTPase NRas Proteins 0.000 description 2
- 206010069755 K-ras gene mutation Diseases 0.000 description 2
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 2
- 229930195725 Mannitol Natural products 0.000 description 2
- 235000019483 Peanut oil Nutrition 0.000 description 2
- 101710177166 Phosphoprotein Proteins 0.000 description 2
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- 229940022005 RNA vaccine Drugs 0.000 description 2
- 229920002684 Sepharose Polymers 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- 229920002472 Starch Polymers 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- 230000002776 aggregation Effects 0.000 description 2
- 238000004220 aggregation Methods 0.000 description 2
- 239000003963 antioxidant agent Substances 0.000 description 2
- 235000006708 antioxidants Nutrition 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 235000019282 butylated hydroxyanisole Nutrition 0.000 description 2
- 238000009566 cancer vaccine Methods 0.000 description 2
- 229940022399 cancer vaccine Drugs 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 239000003729 cation exchange resin Substances 0.000 description 2
- 150000001768 cations Chemical class 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 238000002648 combination therapy Methods 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 235000014113 dietary fatty acids Nutrition 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 238000012377 drug delivery Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 239000000839 emulsion Substances 0.000 description 2
- MMXKVMNBHPAILY-UHFFFAOYSA-N ethyl laurate Chemical compound CCCCCCCCCCCC(=O)OCC MMXKVMNBHPAILY-UHFFFAOYSA-N 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 239000000194 fatty acid Substances 0.000 description 2
- 229930195729 fatty acid Natural products 0.000 description 2
- 239000011888 foil Substances 0.000 description 2
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 2
- 239000008187 granular material Substances 0.000 description 2
- 230000003053 immunization Effects 0.000 description 2
- 238000002649 immunization Methods 0.000 description 2
- 238000002513 implantation Methods 0.000 description 2
- 238000010348 incorporation Methods 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 150000002632 lipids Chemical class 0.000 description 2
- 239000002502 liposome Substances 0.000 description 2
- 239000006193 liquid solution Substances 0.000 description 2
- 239000008176 lyophilized powder Substances 0.000 description 2
- 238000010801 machine learning Methods 0.000 description 2
- 239000000594 mannitol Substances 0.000 description 2
- 235000010355 mannitol Nutrition 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 230000007935 neutral effect Effects 0.000 description 2
- 235000019198 oils Nutrition 0.000 description 2
- 239000002674 ointment Substances 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 239000000312 peanut oil Substances 0.000 description 2
- 239000008177 pharmaceutical agent Substances 0.000 description 2
- 230000000144 pharmacologic effect Effects 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- 239000004033 plastic Substances 0.000 description 2
- 229920003023 plastic Polymers 0.000 description 2
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 description 2
- 239000000244 polyoxyethylene sorbitan monooleate Substances 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 230000009145 protein modification Effects 0.000 description 2
- 230000012743 protein tagging Effects 0.000 description 2
- 230000005855 radiation Effects 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 102200124924 rs11554290 Human genes 0.000 description 2
- 102220050123 rs121913233 Human genes 0.000 description 2
- 102200006520 rs121913240 Human genes 0.000 description 2
- 239000008159 sesame oil Substances 0.000 description 2
- 235000011803 sesame oil Nutrition 0.000 description 2
- 235000010356 sorbitol Nutrition 0.000 description 2
- 239000000600 sorbitol Substances 0.000 description 2
- 239000003549 soybean oil Substances 0.000 description 2
- 235000012424 soybean oil Nutrition 0.000 description 2
- 230000006641 stabilisation Effects 0.000 description 2
- 238000011105 stabilization Methods 0.000 description 2
- 235000019698 starch Nutrition 0.000 description 2
- 239000000829 suppository Substances 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 239000006188 syrup Substances 0.000 description 2
- 235000020357 syrup Nutrition 0.000 description 2
- 239000000454 talc Substances 0.000 description 2
- 229910052623 talc Inorganic materials 0.000 description 2
- 238000002560 therapeutic procedure Methods 0.000 description 2
- 238000002255 vaccination Methods 0.000 description 2
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 description 1
- PUPZLCDOIYMWBV-UHFFFAOYSA-N (+/-)-1,3-Butanediol Chemical compound CC(O)CCO PUPZLCDOIYMWBV-UHFFFAOYSA-N 0.000 description 1
- YYGNTYWPHWGJRM-UHFFFAOYSA-N (6E,10E,14E,18E)-2,6,10,15,19,23-hexamethyltetracosa-2,6,10,14,18,22-hexaene Chemical compound CC(C)=CCCC(C)=CCCC(C)=CCCC=C(C)CCC=C(C)CCC=C(C)C YYGNTYWPHWGJRM-UHFFFAOYSA-N 0.000 description 1
- WRIDQFICGBMAFQ-UHFFFAOYSA-N (E)-8-Octadecenoic acid Natural products CCCCCCCCCC=CCCCCCCC(O)=O WRIDQFICGBMAFQ-UHFFFAOYSA-N 0.000 description 1
- NWUYHJFMYQTDRP-UHFFFAOYSA-N 1,2-bis(ethenyl)benzene;1-ethenyl-2-ethylbenzene;styrene Chemical compound C=CC1=CC=CC=C1.CCC1=CC=CC=C1C=C.C=CC1=CC=CC=C1C=C NWUYHJFMYQTDRP-UHFFFAOYSA-N 0.000 description 1
- BFSVOASYOCHEOV-UHFFFAOYSA-N 2-diethylaminoethanol Chemical compound CCN(CC)CCO BFSVOASYOCHEOV-UHFFFAOYSA-N 0.000 description 1
- LQJBNNIYVWPHFW-UHFFFAOYSA-N 20:1omega9c fatty acid Natural products CCCCCCCCCCC=CCCCCCCCC(O)=O LQJBNNIYVWPHFW-UHFFFAOYSA-N 0.000 description 1
- QSBYPNXLFMSGKH-UHFFFAOYSA-N 9-Heptadecensaeure Natural products CCCCCCCC=CCCCCCCCC(O)=O QSBYPNXLFMSGKH-UHFFFAOYSA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- 241000710929 Alphavirus Species 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 241000416162 Astragalus gummifer Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 101150049556 Bcr gene Proteins 0.000 description 1
- 239000004255 Butylated hydroxyanisole Substances 0.000 description 1
- 102100025570 Cancer/testis antigen 1 Human genes 0.000 description 1
- 206010009944 Colon cancer Diseases 0.000 description 1
- 208000001333 Colorectal Neoplasms Diseases 0.000 description 1
- 229920002261 Corn starch Polymers 0.000 description 1
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- LVGKNOAMLMIIKO-UHFFFAOYSA-N Elaidinsaeure-aethylester Natural products CCCCCCCCC=CCCCCCCCC(=O)OCC LVGKNOAMLMIIKO-UHFFFAOYSA-N 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 108010067770 Endopeptidase K Proteins 0.000 description 1
- 239000001856 Ethyl cellulose Substances 0.000 description 1
- ZZSNKZQZMQGXPY-UHFFFAOYSA-N Ethyl cellulose Chemical compound CCOCC1OC(OC)C(OCC)C(OCC)C1OC1C(O)C(O)C(OC)C(CO)O1 ZZSNKZQZMQGXPY-UHFFFAOYSA-N 0.000 description 1
- 206010064571 Gene mutation Diseases 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- 102100028972 HLA class I histocompatibility antigen, A alpha chain Human genes 0.000 description 1
- 102100028976 HLA class I histocompatibility antigen, B alpha chain Human genes 0.000 description 1
- 102100028971 HLA class I histocompatibility antigen, C alpha chain Human genes 0.000 description 1
- 108010075704 HLA-A Antigens Proteins 0.000 description 1
- 108010058607 HLA-B Antigens Proteins 0.000 description 1
- 108010052199 HLA-C Antigens Proteins 0.000 description 1
- 102000015789 HLA-DP Antigens Human genes 0.000 description 1
- 108010010378 HLA-DP Antigens Proteins 0.000 description 1
- 108010062347 HLA-DQ Antigens Proteins 0.000 description 1
- 102000006354 HLA-DR Antigens Human genes 0.000 description 1
- 108010058597 HLA-DR Antigens Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 101000856237 Homo sapiens Cancer/testis antigen 1 Proteins 0.000 description 1
- 108090000144 Human Proteins Proteins 0.000 description 1
- 102000003839 Human Proteins Human genes 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 206010062016 Immunosuppression Diseases 0.000 description 1
- 101150105104 Kras gene Proteins 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- LEVWYRKDKASIDU-IMJSIDKUSA-N L-cystine Chemical compound [O-]C(=O)[C@@H]([NH3+])CSSC[C@H]([NH3+])C([O-])=O LEVWYRKDKASIDU-IMJSIDKUSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 240000007472 Leucaena leucocephala Species 0.000 description 1
- 235000010643 Leucaena leucocephala Nutrition 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 102000018697 Membrane Proteins Human genes 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- GXCLVBGFBYZDAG-UHFFFAOYSA-N N-[2-(1H-indol-3-yl)ethyl]-N-methylprop-2-en-1-amine Chemical compound CN(CCC1=CNC2=C1C=CC=C2)CC=C GXCLVBGFBYZDAG-UHFFFAOYSA-N 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 108091092724 Noncoding DNA Proteins 0.000 description 1
- 208000008589 Obesity Diseases 0.000 description 1
- 239000005642 Oleic acid Substances 0.000 description 1
- ZQPPMHVWECSIRJ-UHFFFAOYSA-N Oleic acid Natural products CCCCCCCCC=CCCCCCCCC(O)=O ZQPPMHVWECSIRJ-UHFFFAOYSA-N 0.000 description 1
- 108700020796 Oncogene Proteins 0.000 description 1
- 102000043276 Oncogene Human genes 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- 108010029485 Protein Isoforms Proteins 0.000 description 1
- 102000001708 Protein Isoforms Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 235000019485 Safflower oil Nutrition 0.000 description 1
- 229920005654 Sephadex Polymers 0.000 description 1
- 239000012507 Sephadex™ Substances 0.000 description 1
- 208000000453 Skin Neoplasms Diseases 0.000 description 1
- 239000004147 Sorbitan trioleate Substances 0.000 description 1
- PRXRUNOAOLTIEF-ADSICKODSA-N Sorbitan trioleate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@@H](OC(=O)CCCCCCC\C=C/CCCCCCCC)[C@H]1OC[C@H](O)[C@H]1OC(=O)CCCCCCC\C=C/CCCCCCCC PRXRUNOAOLTIEF-ADSICKODSA-N 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- 241000282898 Sus scrofa Species 0.000 description 1
- 230000005867 T cell response Effects 0.000 description 1
- BHEOSNUKNHRBNM-UHFFFAOYSA-N Tetramethylsqualene Natural products CC(=C)C(C)CCC(=C)C(C)CCC(C)=CCCC=C(C)CCC(C)C(=C)CCC(C)C(C)=C BHEOSNUKNHRBNM-UHFFFAOYSA-N 0.000 description 1
- 208000024770 Thyroid neoplasm Diseases 0.000 description 1
- 229920001615 Tragacanth Polymers 0.000 description 1
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- 230000001594 aberrant effect Effects 0.000 description 1
- DPXJVFZANSGRMM-UHFFFAOYSA-N acetic acid;2,3,4,5,6-pentahydroxyhexanal;sodium Chemical compound [Na].CC(O)=O.OCC(O)C(O)C(O)C(O)C=O DPXJVFZANSGRMM-UHFFFAOYSA-N 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 150000001298 alcohols Chemical class 0.000 description 1
- 235000010443 alginic acid Nutrition 0.000 description 1
- 239000000783 alginic acid Substances 0.000 description 1
- 229920000615 alginic acid Polymers 0.000 description 1
- 229960001126 alginic acid Drugs 0.000 description 1
- 150000004781 alginic acids Chemical class 0.000 description 1
- HDTRYLNUVZCQOY-LIZSDCNHSA-N alpha,alpha-trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 description 1
- 229940037003 alum Drugs 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- 150000003862 amino acid derivatives Chemical class 0.000 description 1
- 238000012870 ammonium sulfate precipitation Methods 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 239000003708 ampul Substances 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 238000004135 animal tissue culture Methods 0.000 description 1
- 239000003957 anion exchange resin Substances 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 210000000612 antigen-presenting cell Anatomy 0.000 description 1
- 239000007900 aqueous suspension Substances 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 235000010323 ascorbic acid Nutrition 0.000 description 1
- 229960005070 ascorbic acid Drugs 0.000 description 1
- 239000011668 ascorbic acid Substances 0.000 description 1
- 238000000889 atomisation Methods 0.000 description 1
- 230000003190 augmentative effect Effects 0.000 description 1
- 229960000686 benzalkonium chloride Drugs 0.000 description 1
- CADWTSSKOVRVJC-UHFFFAOYSA-N benzyl(dimethyl)azanium;chloride Chemical compound [Cl-].C[NH+](C)CC1=CC=CC=C1 CADWTSSKOVRVJC-UHFFFAOYSA-N 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 239000006172 buffering agent Substances 0.000 description 1
- CZBZUDVBLSSABA-UHFFFAOYSA-N butylated hydroxyanisole Chemical compound COC1=CC=C(O)C(C(C)(C)C)=C1.COC1=CC=C(O)C=C1C(C)(C)C CZBZUDVBLSSABA-UHFFFAOYSA-N 0.000 description 1
- 229940043253 butylated hydroxyanisole Drugs 0.000 description 1
- 239000002775 capsule Substances 0.000 description 1
- 239000001768 carboxy methyl cellulose Substances 0.000 description 1
- 230000000747 cardiac effect Effects 0.000 description 1
- 125000002091 cationic group Chemical class 0.000 description 1
- 230000001364 causal effect Effects 0.000 description 1
- 230000020411 cell activation Effects 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 239000006143 cell culture medium Substances 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 229920002301 cellulose acetate Polymers 0.000 description 1
- 239000003638 chemical reducing agent Substances 0.000 description 1
- 238000002512 chemotherapy Methods 0.000 description 1
- 238000011098 chromatofocusing Methods 0.000 description 1
- 230000001684 chronic effect Effects 0.000 description 1
- 208000037976 chronic inflammation Diseases 0.000 description 1
- 230000006020 chronic inflammation Effects 0.000 description 1
- 229940110456 cocoa butter Drugs 0.000 description 1
- 235000019868 cocoa butter Nutrition 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000005094 computer simulation Methods 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 235000005687 corn oil Nutrition 0.000 description 1
- 239000002285 corn oil Substances 0.000 description 1
- 239000008120 corn starch Substances 0.000 description 1
- 235000012343 cottonseed oil Nutrition 0.000 description 1
- 239000002385 cottonseed oil Substances 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 229960003067 cystine Drugs 0.000 description 1
- 231100000135 cytotoxicity Toxicity 0.000 description 1
- 230000003013 cytotoxicity Effects 0.000 description 1
- GVJHHUAWPYXKBD-UHFFFAOYSA-N d-alpha-tocopherol Natural products OC1=C(C)C(C)=C2OC(CCCC(C)CCCC(C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-UHFFFAOYSA-N 0.000 description 1
- 239000003405 delayed action preparation Substances 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 230000003467 diminishing effect Effects 0.000 description 1
- 229940042399 direct acting antivirals protease inhibitors Drugs 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- PRAKJMSDJKAYCZ-UHFFFAOYSA-N dodecahydrosqualene Natural products CC(C)CCCC(C)CCCC(C)CCCCC(C)CCCC(C)CCCC(C)C PRAKJMSDJKAYCZ-UHFFFAOYSA-N 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 235000019325 ethyl cellulose Nutrition 0.000 description 1
- 229920001249 ethyl cellulose Polymers 0.000 description 1
- LVGKNOAMLMIIKO-QXMHVHEDSA-N ethyl oleate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OCC LVGKNOAMLMIIKO-QXMHVHEDSA-N 0.000 description 1
- 229940093471 ethyl oleate Drugs 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 239000000945 filler Substances 0.000 description 1
- 239000010408 film Substances 0.000 description 1
- 238000004108 freeze drying Methods 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000005227 gel permeation chromatography Methods 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 102000054767 gene variant Human genes 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 150000002334 glycols Chemical class 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 150000004677 hydrates Chemical class 0.000 description 1
- 229920001600 hydrophobic polymer Polymers 0.000 description 1
- 230000002519 immonomodulatory effect Effects 0.000 description 1
- 230000008102 immune modulation Effects 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 230000006054 immunological memory Effects 0.000 description 1
- 230000001506 immunosuppresive effect Effects 0.000 description 1
- 238000009169 immunotherapy Methods 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 239000012678 infectious agent Substances 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 108091006086 inhibitor proteins Proteins 0.000 description 1
- 230000000266 injurious effect Effects 0.000 description 1
- 238000007915 intraurethral administration Methods 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- QXJSBBXBKPUZAA-UHFFFAOYSA-N isooleic acid Natural products CCCCCCCC=CCCCCCCCCC(O)=O QXJSBBXBKPUZAA-UHFFFAOYSA-N 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 210000000265 leukocyte Anatomy 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- VTHJTEIRLNZDEV-UHFFFAOYSA-L magnesium dihydroxide Chemical compound [OH-].[OH-].[Mg+2] VTHJTEIRLNZDEV-UHFFFAOYSA-L 0.000 description 1
- 239000000347 magnesium hydroxide Substances 0.000 description 1
- 229910001862 magnesium hydroxide Inorganic materials 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 235000010270 methyl p-hydroxybenzoate Nutrition 0.000 description 1
- 239000004292 methyl p-hydroxybenzoate Substances 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- 239000002480 mineral oil Substances 0.000 description 1
- 235000010446 mineral oil Nutrition 0.000 description 1
- 210000005087 mononuclear cell Anatomy 0.000 description 1
- 238000010172 mouse model Methods 0.000 description 1
- 239000002324 mouth wash Substances 0.000 description 1
- 239000007908 nanoemulsion Substances 0.000 description 1
- 230000003472 neutralizing effect Effects 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 235000020824 obesity Nutrition 0.000 description 1
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N oleic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 1
- 239000004006 olive oil Substances 0.000 description 1
- 235000008390 olive oil Nutrition 0.000 description 1
- 231100000590 oncogenic Toxicity 0.000 description 1
- 230000002246 oncogenic effect Effects 0.000 description 1
- 239000006174 pH buffer Substances 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 235000010603 pastilles Nutrition 0.000 description 1
- 239000003961 penetration enhancing agent Substances 0.000 description 1
- 230000006919 peptide aggregation Effects 0.000 description 1
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 1
- 239000003208 petroleum Substances 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 229920005862 polyol Polymers 0.000 description 1
- 150000003077 polyols Chemical class 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 229940068977 polysorbate 20 Drugs 0.000 description 1
- 229920000053 polysorbate 80 Polymers 0.000 description 1
- 229940068968 polysorbate 80 Drugs 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 229920001592 potato starch Polymers 0.000 description 1
- 230000002335 preservative effect Effects 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 230000003449 preventive effect Effects 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 229940021993 prophylactic vaccine Drugs 0.000 description 1
- 238000011321 prophylaxis Methods 0.000 description 1
- 235000010232 propyl p-hydroxybenzoate Nutrition 0.000 description 1
- 239000004405 propyl p-hydroxybenzoate Substances 0.000 description 1
- QELSKZZBTMNZEB-UHFFFAOYSA-N propylparaben Chemical compound CCCOC(=O)C1=CC=C(O)C=C1 QELSKZZBTMNZEB-UHFFFAOYSA-N 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 238000012797 qualification Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000004007 reversed phase HPLC Methods 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 102200006616 rs104894230 Human genes 0.000 description 1
- 102220197775 rs1057519695 Human genes 0.000 description 1
- 102220053950 rs121913238 Human genes 0.000 description 1
- 102200006525 rs121913240 Human genes 0.000 description 1
- 102200124923 rs121913254 Human genes 0.000 description 1
- 102200006648 rs28933406 Human genes 0.000 description 1
- 235000005713 safflower oil Nutrition 0.000 description 1
- 239000003813 safflower oil Substances 0.000 description 1
- 150000003335 secondary amines Chemical class 0.000 description 1
- 238000007493 shaping process Methods 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 239000000741 silica gel Substances 0.000 description 1
- 229910002027 silica gel Inorganic materials 0.000 description 1
- 239000000377 silicon dioxide Substances 0.000 description 1
- 238000001542 size-exclusion chromatography Methods 0.000 description 1
- 235000020183 skimmed milk Nutrition 0.000 description 1
- 201000000849 skin cancer Diseases 0.000 description 1
- 229940126586 small molecule drug Drugs 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 235000019812 sodium carboxymethyl cellulose Nutrition 0.000 description 1
- 229920001027 sodium carboxymethylcellulose Polymers 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000012453 solvate Substances 0.000 description 1
- 235000019337 sorbitan trioleate Nutrition 0.000 description 1
- 229960000391 sorbitan trioleate Drugs 0.000 description 1
- 239000008347 soybean phospholipid Substances 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 238000001694 spray drying Methods 0.000 description 1
- 229940031439 squalene Drugs 0.000 description 1
- TUHBEKDERLKLEC-UHFFFAOYSA-N squalene Natural products CC(=CCCC(=CCCC(=CCCC=C(/C)CCC=C(/C)CC=C(C)C)C)C)C TUHBEKDERLKLEC-UHFFFAOYSA-N 0.000 description 1
- 230000010473 stable expression Effects 0.000 description 1
- 238000003153 stable transfection Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 239000003826 tablet Substances 0.000 description 1
- 210000001550 testis Anatomy 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 229940021747 therapeutic vaccine Drugs 0.000 description 1
- 201000002510 thyroid cancer Diseases 0.000 description 1
- 235000010384 tocopherol Nutrition 0.000 description 1
- 229960001295 tocopherol Drugs 0.000 description 1
- 229930003799 tocopherol Natural products 0.000 description 1
- 239000011732 tocopherol Substances 0.000 description 1
- 235000010487 tragacanth Nutrition 0.000 description 1
- 239000000196 tragacanth Substances 0.000 description 1
- 229940116362 tragacanth Drugs 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
- 235000013311 vegetables Nutrition 0.000 description 1
- 239000001993 wax Substances 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
- GVJHHUAWPYXKBD-IEOSBIPESA-N α-tocopherol Chemical compound OC1=C(C)C(C)=C2O[C@@](CCC[C@H](C)CCC[C@H](C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-IEOSBIPESA-N 0.000 description 1
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/0005—Vertebrate antigens
- A61K39/0011—Cancer antigens
- A61K39/001154—Enzymes
- A61K39/001164—GTPases, e.g. Ras or Rho
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/82—Translation products from oncogenes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y306/00—Hydrolases acting on acid anhydrides (3.6)
- C12Y306/05—Hydrolases acting on acid anhydrides (3.6) acting on GTP; involved in cellular and subcellular movement (3.6.5)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Animal Behavior & Ethology (AREA)
- Pharmacology & Pharmacy (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Oncology (AREA)
- Immunology (AREA)
- General Engineering & Computer Science (AREA)
- Mycology (AREA)
- Epidemiology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Biotechnology (AREA)
- Gastroenterology & Hepatology (AREA)
- Biomedical Technology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Peptides Or Proteins (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
The present disclosure provides for methods, systems, and compositions of nucleic acid and peptide sequences. The present disclosure provides for a nucleic acid sequence encoding at least two amino acid sequences selected from the group consisting of SEQ ID NO: 1 to 6, SEQ ID NO: 8 to 10, SEQ ID NO: 12 to 28, and SEQ ID NO: 30 to 41. The present disclosure also provides for an immunogenic peptide composition comprising at least one peptide selected from the group consisting of SEQ ID NO: 42 to 65. The present disclosure provides for a composition comprising nucleic acid sequences encoding at least two amino acid sequences selected from the group consisting of SEQ ID NOs: 1550 to 1593. The present disclosure further provides for a nucleic acid sequence encoding at least one amino acid sequence selected from the group consisting of SEQ ID NOs: 1595 to 1661.
Description
COMPOSITIONS AND METHODS FOR OPTIMIZED PEPTIDE VACCINES [0001] This application claims the benefit of and priority under 35 U.S.C. § 119(e) to U.S. Serial No.17/336,960 filed June 2, 2021, which is a continuation of U.S. Serial No.17/100,630 filed November 20, 2020, now U.S. Patent No.11,058,751, issued July 13, 2021, the contents of each of which are hereby incorporated by reference in their entireties. [0002] This application also claims the benefit of and priority under 35 U.S.C. § 119(e) to U.S. Serial No.17/389,875 filed July 30, 2021, which is a continuation of U.S. Serial No. 17/114,237 filed December 7, 2020, now U.S. Patent No.11,161,892, issued November 2, 2021, the contents of each of which are hereby incorporated by reference in their entireties. [0003] This application also claims the benefit of U.S. Serial No.63/249,235 filed September 28, 2021, the contents of which is hereby incorporated by reference in its entirety. [0004] All patents, patent applications and publications cited herein are hereby incorporated by reference in their entirety. The disclosures of these publications in their entireties are hereby incorporated by reference into this application. [0005] This patent disclosure contains material that is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction of the patent document or the patent disclosure as it appears in the U.S. Patent and Trademark Office patent file or records, but otherwise reserves any and all copyright rights. INCORPORATION BY REFERENCE [0006] All documents cited herein are incorporated herein by reference in their entireties. SEQUENCE LISTING [0007] The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. The ASCII copy, created on November 15, 2021, is named 2215269_00123WO1_SL.txt and is 23,070,263 bytes in size. FIELD OF THE INVENTION
[0008] The present invention relates generally to compositions, systems, and methods of peptide vaccines. More particularly, the present invention relates to compositions, systems, and methods of designing peptide vaccines to treat or prevent disease optimized based on predicted population immunogenicity. BACKGROUND [0009] The goal of a peptide vaccine is to train the immune system to recognize and expand its capacity to engage cells that display target peptides to improve the immune response to cancerous cells or pathogens. A peptide vaccine can also be administered to someone who is already diseased to increase their immune response to a causal cancer, other diseases, or pathogen. Alternatively, a peptide vaccine can be administered to induce the immune system to have therapeutic tolerance to one or more peptides. There exists a need for compositions, systems, and methods of peptide vaccines based on prediction of the target peptides that will be displayed to protect a host from cancer, other disease, or pathogen infection. We introduce novel prophylactic and therapeutic vaccines for cancer based upon neoantigens introduced by mutations in the RAS gene that occurs in many cancers, and the BCR-ABL gene fusion that occurs in cases of chronic myelogenous leukemia (CML), acute lymphoblastic leukemia (ALL), and acute myelogenous leukemia (AML), breast invasive ductal carcinoma, and other cancers. SUMMARY OF THE INVENTION [0010] In one aspect, the invention provides for a nucleic acid sequence encoding at least two amino acid sequences selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, and SEQ ID NO: 41. [0011] In some embodiments, the nucleic acid sequence is an immunogenic composition. In some embodiments, the nucleic acid sequence is administered in a construct for expression in vivo. In some embodiments, the in vivo administration of the nucleic acid sequence is configured
embodiments, the at least one peptide is a modified or an unmodified fragment of a mutated KRAS protein. In some embodiments, the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D. In some embodiments, the nucleic acid sequence is administered in an effective amount to a subject to prevent cancer. In some embodiments, the nucleic acid sequence is administered in an effective amount to a subject to treat cancer. [0012] In another aspect, the invention provides for an immunogenic peptide composition comprising at least two peptides selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, and SEQ ID NO: 41. [0013] In some embodiments at least one peptide of the at least two peptides is displayed by an HLA class I molecule in a subject. In some embodiments, at least one peptide of the at least two peptides is a modified or unmodified fragment of a mutated KRAS protein. In some embodiments, the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to prevent cancer. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to treat cancer. In some embodiments, the immunogenic peptide composition comprises at least three peptides selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, and SEQ ID NO: 41.
[0014] In another aspect, the invention provides for a nucleic acid sequence encoding at least one amino acid sequence selected from the group consisting of SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, and SEQ ID NO: 65. [0015] In some embodiments, the nucleic acid sequence is an immunogenic composition. In some embodiments, the nucleic acid sequence is administered in a construct for expression in vivo. In some embodiments, the in vivo administration of the nucleic acid sequence is configured to produce at least one peptide that is displayed by an HLA class II molecule. In some embodiments, the at least one peptide is a modified fragment of a mutated KRAS protein. In some embodiments, the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D. In some embodiments, the nucleic acid sequence is administered in an effective amount to a subject to prevent cancer. In some embodiments, the nucleic acid sequence is administered in an effective amount to a subject to treat cancer. [0016] In another aspect, the invention provides for an immunogenic peptide composition comprising at least one peptide selected from the group consisting of SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, and SEQ ID NO: 65. [0017] In some embodiments, at least one peptide in the immunogenic peptide composition is displayed by an HLA class II molecule. In some embodiments, at least one peptide in the immunogenic peptide composition is a modified or an unmodified fragment of a mutated KRAS protein. In some embodiments, the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to prevent cancer. In some embodiments, the immunogenic peptide composition is
immunogenic peptide composition comprises at least two peptides selected from the group consisting of SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, and SEQ ID NO: 65. [0018] In another aspect, the invention provides for method of forming an immunogenic peptide composition, the method comprising using a processor to perform the steps of creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein at least one peptide sequence of the plurality of unmodified peptide sequences is associated with a tumor neoantigen or a self-protein, determining a plurality of peptide-HLA binding scores for each peptide sequence in the first peptide set, determining whether each peptide sequence in the first peptide set has a peptide-HLA binding score that passes a first threshold with respect to at least three HLA alleles, creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set, determining whether each modified peptide sequence in the second peptide set has a peptide-HLA binding score that passes a second threshold with respect to the at least three HLA alleles, wherein the second threshold is more constrained than the first threshold; and creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises computing a population coverage, wherein the computing of the population coverage comprises excluding a peptide-HLA binding score with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA binding score for an unmodified peptide sequence associated with the modified peptide sequence does not pass the first threshold with respect to the first HLA allele, and wherein the selected subset has a population coverage above a third threshold, performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence in the third peptide set, and forming an immunogenic peptide composition comprising the at least one peptide sequence in the third peptide set for which the experimental assay was performed. [0019] In some embodiments, selecting the plurality of unmodified peptide sequences to create the first peptide set comprises sliding a window of size n across an amino acid sequence encoding the tumor neoantigen or the self-protein, wherein n is between about 8 amino acids and
about 25 amino acids in length, and wherein n is a length of each peptide sequence of the plurality of unmodified peptide sequences in the first peptide set. In some embodiments, each peptide sequence in the first peptide set binds to an HLA class I molecule or an HLA class II molecule. In some embodiments, the method further comprises filtering the first peptide set to exclude a peptide sequence with a predicted binding core that contains a target residue in an anchor position. In some embodiments, the method further comprises substituting at least one amino acid residue of each peptide sequence in the first peptide set, wherein for at least one peptide sequence in the first peptide set the at least one amino acid residue is in an anchor position. In some embodiments, the first threshold is a binding affinity of less than about 1000 nM. In some embodiments, the second threshold is a binding affinity of less than about 500 nM. In some embodiments, the population coverage is computed based on a frequency of an HLA haplotype in a human population. In some embodiments, the population coverage is computed based on a frequency of the at least three HLA alleles in a human population. In some embodiments, the plurality of unmodified peptide sequences is derived from the tumor neoantigen or the self-protein that is present in a subject. In some embodiments, the third threshold is a proportion of a human population of between about 0.7 and about 0.8. In some embodiments, the tumor neoantigen or the self-protein is associated with a cancer, and wherein the cancer is selected from the group consisting of pancreas, colon, rectum, kidney, bronchus, lung, uterus, cervix, bladder, liver, and stomach.
[0020] In another aspect, the invention provides for method of forming an immunogenic peptide composition, the method comprising using a processor to perform the steps of creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein the plurality of unmodified peptide sequences is associated with a tumor neoantigen or a self-protein, determining a plurality of peptide-HLA immunogenicity metrics for each peptide sequence in the first peptide set, determining whether each peptide sequence in the first peptide set has a peptide-HLA immunogenicity metric that passes a first threshold with respect to at least three HLA alleles, creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set, determining whether each modified peptide sequence in the second peptide set has a peptide-HLA immunogenicity metric that passes a second threshold with respect to the at least three HLA alleles, wherein the second threshold is more constrained than the first threshold, and creating a third peptide set by selecting a subset of the second
peptide set, wherein the selecting comprises computing a predicted vaccine performance, wherein the computing of the predicted vaccine performance comprises excluding a peptide- HLA immunogenicity metric with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA immunogenicity metric for an unmodified peptide sequence associated with the modified peptide sequence does not pass the first threshold with respect to the first HLA allele, and wherein the predicted vaccine performance is a function of a non-excluded peptide- HLA immunogenicity metric of each peptide sequence in the third peptide set, performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence in the third peptide set, and forming an immunogenic peptide composition comprising the at least one peptide sequence in the third peptide set for which the experimental assay was performed. [0021] In some embodiments, selecting the plurality of unmodified peptide sequences to create the first peptide set comprises sliding a window of size n across an amino acid sequence encoding the tumor neoantigen or the self-protein, wherein n is between about 8 amino acids and about 25 amino acids in length, and wherein n is a length of each peptide sequence of the plurality of unmodified peptide sequences in the first peptide set. In some embodiments, each peptide sequence in the first peptide set binds to an HLA class I molecule or an HLA class II molecule. In some embodiments, the method further comprises filtering the first peptide set to exclude a peptide sequence with a predicted binding core that contains a target residue in an anchor position. In some embodiments, the method further comprises substituting at least one amino acid residue of each peptide sequence in the first peptide set, wherein for at least one peptide sequence in the first peptide set the at least one amino acid residue is in an anchor position. In some embodiments, the first threshold is a binding affinity of less than about 1000 nM. [0022] In another aspect, the invention provides for method of forming an immunogenic peptide composition, the method comprising using a processor to perform the steps of creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein each peptide sequence of the plurality of unmodified peptide sequences is associated with a tumor neoantigen or a self-protein, determining a plurality of peptide-HLA binding scores for each peptide sequence in the first peptide set, determining whether each peptide sequence in the first peptide set has a peptide-HLA binding score that passes a first threshold with respect to at least three HLA alleles creating a second peptide set comprising the first peptide set and a plurality of
modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set, determining whether each modified peptide sequence in the second peptide set has a peptide-HLA binding score that passes a second threshold with respect to the at least three HLA alleles, wherein the second threshold is more constrained than the first threshold, and creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises computing a predicted vaccine performance, wherein the computing of the predicted vaccine performance comprises excluding a peptide-HLA binding score with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA binding score for an unmodified peptide sequence associated with the modified peptide sequence does not pass the first threshold with respect to the first HLA allele, and wherein the predicted vaccine performance is a function of a non-excluded peptide-HLA binding score of each peptide sequence in the third peptide set, performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence in the third peptide set, and forming an immunogenic peptide composition comprising the at least one peptide sequence in the third peptide set for which the experimental assay was performed. [0023] In some embodiments, the second threshold is based on data obtained from one or more experimental assays. In some embodiments, the predicted vaccine performance is further a function of a peptide-HLA immunogenicity metric of at least one modified peptide sequence of the plurality of modified peptide sequences bound to a second HLA allele of the at least three HLA alleles if a first peptide sequence in the first peptide set is predicted to be bound to the second HLA allele of the at least three HLA alleles with a first binding core, wherein the first binding core is a binding core of the first peptide sequence, wherein the first binding core is identical to a second binding core, wherein the first binding core and the second binding core comprise an amino acid position within a peptide sequence, and wherein the second binding core is a binding core of the at least one modified peptide sequence of the plurality of modified peptide sequences bound to the second HLA allele. [0024] In another aspect, the invention provides for method of forming an immunogenic peptide composition, the method comprising using a processor to perform the steps of creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein at least one peptide sequence of the plurality of unmodified peptide sequences is associated with a tumor neoantigen or a self-protein; determining a plurality of peptide-HLA binding scores for each
peptide sequence in the first peptide set, determining whether each peptide sequence in the first peptide set has a peptide-HLA binding score that passes a first threshold with respect to at least three HLA alleles, creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set, determining whether each modified peptide sequence in the second peptide set has a peptide-HLA binding score that passes a second threshold with respect to the at least three HLA alleles, wherein the second threshold is more constrained than the first threshold, creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises computing a predicted vaccine performance based on an HLA type of a subject, and wherein the computing of the predicted vaccine performance comprises excluding a peptide-HLA binding score with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA binding score for an unmodified peptide sequence associated with the modified peptide sequence does not pass the first threshold with respect to the first HLA allele, performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence in the third peptide set, and forming an immunogenic peptide composition comprising the at least one peptide sequence in the third peptide set for which the experimental assay was performed. [0025] In some embodiments, each peptide sequence in the first peptide set binds to an HLA class I molecule or an HLA class II molecule. In some embodiments, the plurality of unmodified peptide sequences is derived from the tumor neoantigen or the self-protein that is present in a subject. In some embodiments, the at least three HLA alleles are present in the HLA type of a subject. In some embodiments, the plurality of unmodified peptide sequences is derived from the tumor neoantigen or the self-protein that is present in a subject. In some embodiments, the plurality of unmodified peptide sequences is derived from the tumor neoantigen or the self-protein that is present in the subject. In some embodiments, the immunogenic peptide composition comprises nucleic acid sequences encoding an amino acid sequence of the at least one peptide sequence in the third peptide set. In some embodiments, the immunogenic peptide composition comprises nucleic acid sequences encoding an amino acid sequence of the at least one peptide sequence in the third peptide set.
[0026] In another aspect, the invention provides for a composition comprising nucleic acid sequences encoding at least two amino acid sequences selected from the group consisting of SEQ ID NOs: 1550 to 1593. [0027] In some embodiments, the composition is immunogenic. In some embodiments, the nucleic acid sequences are administered in a construct for expression in vivo. In some embodiments, the in vivo administration of the nucleic acid sequence is configured to produce at least one peptide that is displayed by an HLA class I molecule. In some embodiments, the at least one peptide is a modified or an unmodified fragment of a BCL-ABL gene fusion. In some embodiments, the BCR-ABL gene fusion is b3a2 or b2a2. In some embodiments, the nucleic acid sequences are administered in an effective amount to a subject to prevent cancer. In some embodiments, the nucleic acid sequences are administered in an effective amount to a subject to treat cancer. [0028] In another aspect, the invention provides for a composition comprising at least two peptides selected from the group consisting of SEQ ID NOs: 1550 to 1593. [0029] In some embodiments, at least one peptide of the at least two peptides is displayed by an HLA class I molecule in a subject. In some embodiments, at least one peptide in the immunogenic peptide composition is a modified or an unmodified fragment of a BCL-ABL gene fusion. In some embodiments, the BCR-ABL gene fusion is b3a2 or b2a2. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to prevent cancer. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to treat cancer. In some embodiments, the invention provides for a composition comprising at least two peptides selected from the group consisting of SEQ ID NOs: 1550 to 1593. [0030] In another aspect, the invention provides for a nucleic acid sequence encoding at least one amino acid sequence selected from the group consisting of SEQ ID NOs: 1595 to 1661. [0031] In some embodiments, the composition is immunogenic. In some embodiments, the nucleic acid sequence is administered in a construct for expression in vivo. In some embodiments, the in vivo administration of the nucleic acid sequence is configured to produce at least one peptide that is displayed by an HLA class II molecule. In some embodiments, the at least one amino acid sequence is derived from a modified fragment of a BCL-ABL gene fusion.
nucleic acid sequence is administered in an effective amount to a subject to prevent cancer. In some embodiments, the nucleic acid sequence is administered in an effective amount to a subject to treat cancer. [0032] In another aspect, the invention provides for a nucleic acid sequence encoding at least one amino acid sequence selected from the group consisting of SEQ ID NOs: 1595 to 1661. [0033] In some embodiments, the at least one peptide is displayed by an HLA class II molecule in a subject. In some embodiments, the at least one peptide in the immunogenic peptide composition is a modified or an unmodified fragment of a BCL-ABL gene fusion. In some embodiments, the BCR-ABL gene fusion is b3a2 or b2a2. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to prevent cancer. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to treat cancer. In some embodiments, the immunogenic peptide composition comprises at least two peptides selected from the group consisting of SEQ ID NOs: 1595 to 1661. [0034] In another aspect, the invention provides a method of forming an immunogenic peptide composition, the method comprising using a processor to perform the steps of creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein at least one peptide sequence of the plurality of unmodified peptide sequences is associated with a tumor neoantigen, a pathogen proteome, or a self-protein, determining a plurality of peptide-HLA binding scores for each peptide sequence in the first peptide set, determining whether each peptide sequence in the first peptide set has a peptide-HLA binding score that passes a first threshold with respect to at least three HLA alleles, creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set, determining a plurality of peptide-HLA binding scores for each peptide sequence in the second peptide set; and creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises computing a population coverage, wherein the computing of the population coverage comprises excluding a peptide-HLA binding score with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA binding score for an unmodified peptide sequence associated with the modified peptide sequence does not pass the first threshold with respect to the first HLA
performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence of the third peptide set; and forming an immunogenic peptide composition comprising the at least one peptide sequence of the third peptide set for which the experimental assay was performed. [0035] In some embodiments, selecting the plurality of unmodified peptide sequences to create the first peptide set comprises sliding a window of size n across at least a portion of an amino acid sequence encoding the tumor neoantigen, the pathogen proteome, or the self-protein, wherein n is between about 8 amino acids and about 25 amino acids in length, and wherein n is a length of each peptide sequence of the plurality of unmodified peptide sequences of the first peptide set. In some embodiments, each peptide sequence of the first peptide set binds to an HLA class I molecule or an HLA class II molecule. In some embodiments, further comprising filtering the first peptide set to exclude a peptide sequence with a predicted binding core that contains a target amino acid residue in an anchor position. In some embodiments, further comprising substituting at least one amino acid residue of each peptide sequence of the first peptide set, wherein for at least one peptide sequence of the first peptide set the at least one amino acid residue is in an anchor position. In some embodiments, the first threshold is a binding affinity of less than about 1000 nM. In some embodiments, the population coverage is computed with respect to the at least three HLA alleles. In some embodiments, the population coverage is computed based on a frequency of an HLA haplotype in a human population. In some embodiments, the population coverage is computed based on a frequency of the at least three HLA alleles in a human population. In some embodiments, the plurality of unmodified peptide sequences is derived from the tumor neoantigen, the pathogen proteome, or the self- protein that is present in a subject. In some embodiments, the second threshold is a proportion of a human population of between about 0.7 and about 0.8. In some embodiments, the tumor neoantigen or the self-protein is associated with a cancer, and wherein the cancer is selected from the group consisting of pancreas, colon, rectum, kidney, bronchus, lung, uterus, cervix, bladder, liver, and stomach. In some embodiments, the pathogen proteome is associated with a pathogen infection in a human subject. In some embodiments, the immunogenic peptide composition comprises nucleic acid sequences encoding an amino acid sequence of the at least one peptide sequence of the third peptide set. [0036] In another aspect, the invention provides a method of forming an immunogenic peptide composition, the method comprising using a processor to perform the steps of creating a
first peptide set by selecting a plurality of unmodified peptide sequences, wherein the plurality of unmodified peptide sequences is associated with a tumor neoantigen, a pathogen proteome, or a self-protein, determining a plurality of peptide-HLA immunogenicity metrics for each peptide sequence in the first peptide set, determining whether each peptide sequence in the first peptide set has a peptide-HLA immunogenicity metric that passes a threshold with respect to at least three HLA alleles, creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set, determining a plurality of peptide-HLA immunogenicity metrics for each peptide sequence in the second peptide set; and creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises a predicted vaccine performance, wherein the computing of the predicted vaccine performance comprises excluding a peptide-HLA immunogenicity metric with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA immunogenicity metric for an unmodified peptide sequence associated with the modified peptide sequence does not pass the threshold with respect to the first HLA allele, and wherein the predicted vaccine performance is a function of a peptide-HLA immunogenicity metric of each peptide sequence in the third peptide set, performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence of the third peptide set; and forming an immunogenic peptide composition comprising the at least one peptide sequence of the third peptide set for which the experimental assay was performed.
[0037] In some embodiments, selecting the plurality of unmodified peptide sequences to create the first peptide set comprises sliding a window of size n across at least a portion of an amino acid sequence encoding the tumor neoantigen, the pathogen proteome, or the self-protein, wherein n is between about 8 amino acids and about 25 amino acids in length, and wherein n is a length of each peptide sequence of the plurality of unmodified peptide sequences of the first peptide set. In some embodiments, each peptide sequence of the first peptide set binds to an HLA class I molecule or an HLA class II molecule. In some embodiments, further comprising filtering the first peptide set to exclude a peptide sequence with a predicted binding core that contains a target amino acid residue in an anchor position. In some embodiments, further comprising substituting at least one amino acid residue of each peptide sequence of the first peptide set. In some embodiments, the threshold is a binding affinity of less than about 1000 nM. In some embodiments, the at least three HLA alleles are present in an HLA type of a
subject. In some embodiments, the plurality of peptide sequences is derived from the tumor neoantigen, the pathogen proteome, or the self-protein that is present in the subject. In some embodiments, the immunogenic peptide composition comprises nucleic acid sequences encoding an amino acid sequence of the at least one peptide sequence of the third peptide set. [0038] In another aspect, the invention provides a method of forming an immunogenic peptide composition, the method comprising using a processor to perform the steps of creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein each peptide sequence of the plurality of unmodified peptide sequences is associated with a tumor neoantigen, a pathogen proteome, or a self-protein, determining a plurality of peptide-HLA binding scores for each peptide sequence in the first peptide set, determining whether each peptide sequence in the first peptide set has a peptide-HLA binding score that passes a threshold with respect to at least three HLA alleles, creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set, determining a plurality of peptide-HLA binding scores for each peptide sequence in the second peptide set, and creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises computing a predicted vaccine performance, wherein the computing of the predicted vaccine performance comprises excluding a peptide-HLA binding score with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA binding score for an unmodified peptide sequence associated with the modified peptide sequence does not pass the threshold with respect to the first HLA allele, and wherein the predicted vaccine performance is a function of a peptide- HLA binding score of each peptide sequence in the third peptide set, performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence of the third peptide set, and forming an immunogenic peptide composition comprising the at least one peptide sequence of the third peptide set for which the experimental assay was performed. [0039] In some embodiments, the second threshold is determined from data obtained from one or more experimental assays. In some embodiments, the predicted vaccine performance is further a function of a peptide-HLA immunogenicity metric of at least one modified peptide sequence of the second peptide set with respect to a second HLA allele if a first peptide sequence of the first peptide set is predicted to be bound to the second HLA allele with a first
binding core, wherein the first binding core is a binding core of the first peptide sequence, wherein the first binding core is identical to a second binding core, wherein the first binding core and the second binding core each comprise an amino acid position within a peptide sequence, and wherein the second binding core is a binding core of the at least one modified peptide sequence. In some embodiments, the plurality of peptide sequences is derived from the tumor neoantigen, the pathogen proteome, or the self-protein that is present in a subject. [0040] In another aspect, the invention provides a method of forming an immunogenic peptide composition, the method comprising using a processor to perform the steps of creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein at least one peptide sequence of the plurality of unmodified peptide sequences is associated with a tumor neoantigen, a pathogen proteome, or a self-protein, determining a plurality of peptide-HLA binding scores for each peptide sequence in the first peptide set, determining whether each peptide sequence in the first peptide set has a peptide-HLA binding score that passes a threshold with respect to at least three HLA alleles, creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set, determining a plurality of peptide-HLA binding scores for each peptide sequence in the second peptide set, and creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises computing a predicted vaccine performance based on an HLA type of a subject, and wherein the computing of the predicted vaccine performance comprises excluding a peptide-HLA binding score with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA binding score for an unmodified peptide sequence associated with the modified peptide sequence does not pass the threshold with respect to the first HLA allele, performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence of the third peptide set, and forming an immunogenic peptide composition comprising the at least one peptide sequence of the third peptide set for which the experimental assay was performed. [0041] In some embodiments, each peptide sequence in the first peptide set binds to an HLA class I molecule or an HLA class II molecule. In some embodiments, the plurality of peptide sequences is derived from the tumor neoantigen, the pathogen proteome, or the self-protein that is present in the subject.
BRIEF DESCRIPTION OF THE DRAWINGS [0042] The following FIG.s depict illustrative embodiments of the invention. [0043] FIG.1 is a flow chart of a vaccine optimization method. [0044] FIG.2 is a flow chart of vaccine optimization method with seed set compression. [0045] FIG.3 is a graph showing predicted population coverage for MHC class I vaccines by vaccine size for KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D targets. [0046] FIG.4 is a graph showing predicted population coverage for MHC class II vaccines by vaccine size for KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D targets. [0047] FIG.5 is a graph showing predicted population coverage for MHC class I vaccines that include heteroclitic peptides by vaccine size for the BCR-ABL b3a2 fusion, for at least one peptide-HLA hit (circles), at least three peptide-HLA hits (triangles), and at least five peptide- HLA hits (squares). The dashed lines show the predicted population coverage of BCR-ABL b3a2 fusion vaccines without heteroclitic peptides for at least 1 (top dashed line) and 5 (bottom dashed line) peptide-HLA hits per-individual. [0048] FIG.6 is a graph showing predicted population coverage for MHC class II vaccines that include heteroclitic peptides by vaccine size for the BCR-ABL b3a2 fusion, for at least one peptide-HLA hit (circles), at least three peptide-HLA hits (triangles), and at least five peptide- HLA hits (squares). The dashed line shows the predicted population coverage of BCR-ABL b3a2 fusion vaccines without heteroclitic peptides for at least one peptide-HLA hit per- individual. [0049] FIG.7 is a graph showing predicted population coverage for MHC class I vaccines that include heteroclitic peptides by vaccine size for the BCR-ABL b2a2 fusion, for at least one peptide-HLA hit (circles), at least three peptide-HLA hits (triangles), and at least five peptide- HLA hits (squares). The dashed line shows the predicted population coverage of BCR-ABL b3a2 fusion vaccines without heteroclitic peptides for at least one peptide-HLA hit per- individual. [0050] FIG.8 is a graph showing predicted population coverage for MHC class II vaccines that include heteroclitic peptides by vaccine size for the BCR ABL b2a2 fusion for at least one
peptide-HLA hit (circles), at least three peptide-HLA hits (triangles), and at least five peptide- HLA hits (squares). The dashed lines show the predicted population coverage of BCR-ABL b3a2 fusion vaccines without heteroclitic peptides for at least 1 (top dashed line) and 5 (bottom dashed line) peptide-HLA hits per-individual. [0051] FIG.9 shows predicated peptide-HLA hits by vaccine size for a KRAS G12V vaccine for the HLA diplotype HLA-A02:03, HLA-A11:01, HLA-B55:02, HLA-B58:01, HLA- C03:02, HLA-C03:03. [0052] FIG.10 shows probabilities of disease presentations for pancreas, colon/rectum, and bronchus/lung and respective probabilities of target presentations for various mutated protein targets. [0053] FIG.11 is a flow chart for multiple target (combined) vaccine optimization methods. [0054] FIG.12 shows predicted population coverage for pancreatic cancer multiple target (combined) MHC class I vaccines by vaccine size for KRAS G12D, KRAS G12V, and KRAS G12R targets. The dashed line shows the predicted population coverage of a pancreatic cancer combined vaccine without heteroclitic peptides for MHC class I. [0055] FIG.13 shows predicted population coverage for pancreatic cancer multiple target (combined) MHC class II vaccines by vaccine size for KRAS G12D, KRAS G12V, and KRAS G12R targets. The dashed line shows the predicted population coverage of a pancreatic cancer combined vaccine without heteroclitic peptides for MHC class II. [0056] FIG.14 is a script showing an example Python implementation of the MERGEMULTI function for combined vaccine design procedures. DETAILED DESCRIPTION OF THE INVENTION [0057] In some embodiments, the disclosure provides for peptide vaccines that incorporate peptide sequences that will be displayed by Major Histocompatibility Complex (MHC) molecules on cells and train the immune system to recognize cancer or pathogen diseased cells. In some embodiments, the disclosure provides for peptide vaccines that that incorporate peptide sequences that will be displayed by Major Histocompatibility Complex (MHC) molecules on cells to induce therapeutic tolerance in antigen-specific immunotherapy for autoimmune diseases (Alhadj Ali et al., 2017, Gibson, et al.2015). In some embodiments, a peptide vaccine is a composition that consists of one or more peptides. In some embodiments, a peptide vaccine is an mRNA or DNA construct administered for expression in vivo that encodes for one or more
[0058] Peptide display by an MHC molecule is necessary, but not sufficient, for a peptide to be immunogenic and cause the recognition of the resulting peptide-MHC complex by an individual's T cells to trigger T cell activation, expansion, and immune memory. In some embodiments, ELISPOT (Slota et al., 2011) or the Multiplex Identification of Antigen-Specific T Cell Receptors Using a Combination of Immune Assays and Immune Receptor Sequencing (MIRA) assay (Klinger et al., 2015) are used to scoring peptide display (e.g., a peptide immunogenicity that requires peptide binding) by an MHC molecule (e.g , HLA allele) (e.g., measured as a peptide-HLA binding score). In some embodiments, experimental data from assays such as the ELISPOT (Slota et al, 2011) or the Multiplex Identification of Antigen- Specific T Cell Receptors Using a Combination of Immune Assays and Immune Receptor Sequencing (MIRA) assay (Klinger et al, 2015) can be used to produce a peptide-HLA immunogenicity metric with respect to a peptide and an HLA allele in a given experimental context or individual. In some embodiments, experimental data from assays such as the ELISPOT (Slota et al., 2011) or the Multiplex Identification of Antigen-Specific T Cell Receptors Using a Combination of Immune Assays and Immune Receptor Sequencing (MIRA) assay (Klinger et al., 2015) can be combined with machine learning based predictions for scoring peptide display (e.g., binding affinity) by an MHC molecule (e.g., HLA allele) (e.g., measured as a peptide-HLA binding score) or for determining a peptide-HLA immunogenicity metric. In some embodiments, the MHCflurry or NetMHCpan (Reynisson et al., 2020) computational methods (as known in the art) are used to predict MHC class I display of a peptide by an HLA allele (see Table 1). In some embodiments, the NetMHCIIpan computational method (Reynisson et al., 2020) is used to predict MHC class II display of a peptide by an HLA allele (see Table 2).
[0059] In some embodiments, computational methods such as MHCflurry (Odonnell et al., 2018, Odonnell et al., 2020, incorporated by reference in their entireties herein), NetMHCpan (Reynisson et al., 2020, incorporated by reference in its entirety herein), and NetMHCIIpan (Reynisson et al., 2020) are used to predict either MHC class I (MHCflurry, NetMHCpan) or class II (NetMHCIIpan) display of peptides by an HLA allele. In other embodiments, other methods of determining peptide-HLA binding are used as disclosed in International Publication No. WO 2005/042698, incorporated by reference in its entirety herein. NetMHCpan-4.1 and NetMHCIIpan-4.0 utilize the NNAlign_MA algorithm (Alvarez et al., 2019, incorporated by reference in its entirety herein) for predicting peptide-HLA binding. NNAlign_MA is in turn based upon the NNAlign (Nielsen et al., 2009, Nielsen et al., 2017, incorporated by reference in
their entireties herein) neural network. NetMHCpan-4.1 (Reynisson et al., 2020) uses NNAlign_MA networks with at least 180 inputs that describe the peptide sequence (9 x 20 = 180 inputs). Networks with both 56 and 66 hidden neurons and two outputs are utilized (Alvarez et al., 2019). Each network architecture (56 or 66 hidden neurons) is trained with 5 different random parameter initializations and 5-fold cross-validation resulting in a total of 50 individual trained networks (2 architectures x 5 initializations x 5 cross-validation). These 50 trained networks are used as an ensemble with 25 networks having at least 10,800 parameters (180 inputs x 56 neurons) and 25 networks consist of at least 11,880 parameters (180 inputs x 66 neurons). Thus, the ensemble of 50 networks in NetMHCpan-4.1 consists of at least 567,000 parameters that must be evaluated with at least 567,000 arithmetic operations for computing peptide-MHC binding. NetMHCIIpan-4.1 (Reynisson et al., 2020) uses NNAlign_MA networks with at least 180 inputs that describe the peptide sequence (9 x 20 = 180 inputs). Networks with 2, 10, 20, 40, and 60 hidden neurons and two outputs are utilized (Alvarez et al., 2019). Each network architecture (2, 10, 20, 40, or 60 hidden neurons) is trained with 10 different random parameter initializations and 5-fold cross-validation resulting in a total of 250 individual trained networks (5 architectures x 10 initializations x 5 cross-validation). These 250 trained networks are used as an ensemble with 50 networks having at least 360 parameters (180 inputs x 2 neurons), 50 networks having at least 1800 parameters (180 inputs x 10 neurons), 50 networks having at least 3600 parameters (180 inputs x 20 neurons), 50 networks having at least 7200 parameters (180 inputs x 40 neurons), and 50 networks having at least 10,800 parameters (180 inputs x 60 neurons). Thus, the ensemble of 250 networks in NetMHCIIpan-4.0 consists of at least 1,188,000 parameters that must be evaluated with at least 1,188,000 arithmetic operations for computing peptide-MHC binding.
[0060] A peptide is displayed by an MHC molecule when it binds within the groove of the MHC molecule and is transported to the cell surface where it can be recognized by a T cell receptor. A target peptide refers to a foreign peptide or a self-peptide. In some embodiments, a peptide that is part of the normal proteome in a healthy individual is a self-peptide, and a peptide that is not part of the normal proteome is a foreign peptide. In some embodiments, target peptides can be part of the normal proteome that exhibit aberrant expression (e.g., cancer-testis antigens such as NY-ESO-1). Foreign peptides can be generated by mutations in normal self- proteins in tumor cells that create epitopes called neoantigens, or by pathogenic infections. In some embodiments, a neoantigen is any subsequence of a human protein, where the subsequence contains one or more altered amino acids or protein modifications that do not appear in a healthy
individual. Therefore, in this disclosure, foreign peptide refers to an amino acid sequence encoding a fragment of a target protein/peptide (or a full-length protein/peptide), the target protein/peptide consisting of: a neoantigen protein, a pathogen proteome, or any other undesired protein that is non-self and is expected to be bound and displayed by an HLA allele. [0061] For example, KRAS gene mutations are the most frequently mutated oncogenes in cancer, but they have been very difficult to treat with small molecule therapeutics. The KRAS protein is part of a signaling pathway that controls cellular growth, and point mutations in the protein can cause constitutive pathway activation and uncontrolled cell growth. Single amino acid KRAS mutations result in minor changes in protein structure, making it difficult to engineer small molecule drugs that recognize a mutant specific binding pocket and inactivate KRAS signaling. KRAS oncogenic mutations include the mutation of position 12 from glycine to aspartic acid (G12D), glycine to valine (G12V), glycine to arginine (G12R), or glycine to cystine (G12C); or the mutation of position 13 from glycine to aspartic acid (G13D). The corresponding foreign peptides contain these mutations. KRAS is a member of the RAS family of genes that also includes HRAS and NRAS. KRAS, HRAS, and NRAS have identical sequences from residue 1 to residue 86. Thus, all of the vaccines and peptide sequences described herein for a mutation in one RAS family member can be used for the identical mutation in any other RAS family member (e.g., a KRAS G12D vaccine is also a vaccine for HRAS G12D). [0062] The BCR-ABL mutation is the result of the abnormal joining of the BCR gene from chromosome 22 with the ABL gene from chromosome 9 that results in a fusion of the two genes on chromosome 22. Differences in the fusion product formed result in different BCR-ABL transcripts, with b3a2 (also known as e14a2) and b2a2 (also known as e13a2) being the most prevalent. In a study of two hundred BCR-ABL affected patients 42% expressed b2a2, 41% expressed b3a2, and 18% expressed both transcripts (Jain et al., 2016). The abnormal b2a2 and b3a2 BCR-ABL fusions create novel protein sequences that contain foreign peptides at the junction of BCL and ABL. Disclosed herein is how these foreign peptides and their derivatives are used as neoantigen epitopes for vaccine design. [0063] A challenge for the design of peptide vaccines is the diversity of human MHC alleles (HLA alleles) that each have specific preferences for the peptide sequences they will display. The Human Leukocyte Antigen (HLA) loci, located within the MHC, encode the HLA class I
three loci that encode class II molecules (HLA-DR, HLA-DQ, and HLA-DP). An individual’s HLA type describes the alleles they carry at each of these loci. Peptides of length of between about 8 and about 11 residues can bind to HLA class I (or MHC class I) molecules whereas those peptides of length of between about 13 and about 25 residues bind to HLA class II (or MHC class II) molecules (Rist et al., 2013; Chicz et al., 1992). Human populations that originate from different geographies have differing frequencies of HLA alleles, and these populations exhibit linkage disequilibrium between HLA loci that result in population specific haplotype frequencies. In some embodiments, methods are disclosed for creating effective vaccines that include consideration of the HLA allelic frequency in the target population, as well as linkage disequilibrium between HLA genes to achieve a set of peptides that is likely to be robustly displayed. [0064] The present disclosure provides for compositions, systems, and methods of vaccine designs that produce immunity to single or multiple targets. In some embodiments, a target is a neoantigen protein sequence, a pathogen proteome, or any other undesired protein sequence that is non-self and is expected to be bound and displayed by an HLA molecule (also referred to herein as an HLA allele). When a target is present in an individual, it may result in multiple peptide sequences that are displayed by a variety of HLA alleles. In some embodiments, it may be desirable to create a vaccine that includes selected self-peptides, and thus these selected self- peptides are considered to be the target peptides for this purpose. [0065] The term peptide-HLA binding is defined to be the binding of a peptide to an HLA allele, and can either be computationally predicted, experimentally observed, or computationally predicted using experimental observations. The metric of peptide-HLA binding can be expressed as affinity, percentile rank, binary at a predetermined threshold, probability, or other metrics as are known in the art. The term peptide-HLA immunogenicity metric is defined as the activation of T cells based upon their recognition of a peptide when bound by an HLA allele. The term peptide-HLA immunogenicity score is another term for a peptide-HLA immunogenicity metric, and the terms are interchangeable. A peptide-HLA immunogenicity metric can vary from individual to individual, and the metric for peptide-HLA immunogenicity can be expressed as a probability, a binary indicator, or other metric that relates to the likelihood that a peptide-HLA combination will be immunogenic. In some embodiments, peptide-HLA immunogenicity is defined as the induction of immune tolerance based upon the recognition of a peptide when bound by an HLA allele. A peptide-HLA immunogenicity metric can be
computationally predicted, experimentally observed, or computationally predicted using experimental observations. In some embodiments, a peptide-HLA immunogenicity metric is based only upon peptide-HLA binding, since peptide-HLA binding is necessary for peptide- HLA immunogenicity. In some embodiments, peptide-HLA immunogenicity data or computational predictions of peptide-HLA immunogenicity can be included and combined with scores for peptide display in the methods disclosed herein. One way of combining the scores is using immunogenicity data for peptides assayed for immunogenicity in diseased or vaccinated individuals and assigning peptides to the HLA allele that displayed them in the individual by choosing the HLA allele that computational methods predict has the highest likelihood of display. For peptides that are not experimentally assayed, computational predictions of display can be used. In some embodiments, different computational methods of predicting peptide-HLA immunogenicity or peptide-HLA binding can be combined (Liu et al., 2020b). For a given set of peptides and a set of HLA alleles, the term peptide-HLA hits is the number of unique combinations of peptides and HLA alleles that exhibit peptide-HLA immunogenicity or binding at a predetermined threshold. For example, a peptide-HLA hit of 2 can mean that one peptide is predicted to be bound (or trigger T cell activation) by two different HLA alleles, two peptides are predicted to be bound (or trigger T cell activation) by two different HLA alleles, or two peptides are predicted to be bound (or trigger T cell activation) by the same HLA allele. For a given set of peptides and HLA frequencies, HLA haplotype frequencies, or HLA diplotype frequencies, the expected number of peptide-HLA hits is the average number of peptide-HLA hits in each set of HLAs that represent an individual, weighted by their frequency of occurrence. [0066] Because immunogenicity may vary from individual to individual, one method to increase the probability of vaccine efficacy is to use a diverse set of target peptides (e.g., at least two peptides) to increase the chances that some subset of them will be immunogenic in a given individual. Prior research using mouse models has shown that most MHC displayed peptides are immunogenic, but immunogenicity varies from individual to individual as described in Croft et al. (2019). In some embodiments, experimental peptide-HLA immunogenicity data are used to determine which target peptides and their modifications will be effective immunogens in a vaccine. [0067] Considerations for the design of peptide vaccines, are outlined in Liu et al., Cell Systems 11, Issue 2, p.131-146 (Liu et al., 2020) and (Liu et al., 2020b) and U.S. Patent No.
11,058,751 and U.S. Patent No.11,161,892 which are incorporated by reference in their entireties herein. [0068] Certain target peptides may not bind with high affinity to a wide range of HLA molecules. To increase the binding of target peptides to HLA molecules, their amino acid composition can be altered to change one or more anchor residues or other residues. In some embodiments, to increase the immunogenicity of a target peptide when displayed by HLA molecules, a target peptide’s amino acid composition can be altered to change one or more residues. Anchor residues are amino acids that interact with an HLA molecule and have the largest influence on the affinity of a peptide for an HLA molecule. Peptides with one or more altered amino acid residues are called heteroclitic peptides. In some embodiments, heteroclitic peptides include target peptides with residue modifications at anchor positions. In some embodiments, heteroclitic peptides include target peptides with residue modifications at non- anchor positions. In some embodiments, heteroclitic peptides include target peptides with residue modifications that include unnatural amino acids and/or amino acid derivatives. Modifications to create heteroclitic peptides can improve the binding of peptides to both MHC class I and MHC class II molecules, and the modifications required can be both peptide and MHC class specific. Since peptide anchor residues face the MHC molecule groove, they are less visible than other peptide residues to T cell receptors. Thus, heteroclitic peptides with anchor residue modifications have been observed to induce a T cell response where the stimulated T cells also respond to unmodified peptides. It has been observed that the use of heteroclitic peptides in a vaccine can improve a vaccine's effectiveness (Zirlik et al., 2006). In some embodiments, the immunogenicity of heteroclitic peptides are experimentally determined and their ability to activate T cells that also recognize the corresponding base (also called seed) peptide of the heteroclitic peptide is determined, as is known in the art (Houghton et al., 2007). In some embodiments, these assays of the immunogenicity and cross-reactivity of heteroclitic peptides are performed when the heteroclitic peptides are displayed by specific HLA alleles. Peptide vaccines to induce immunity to one or more targets [0069] In some embodiments, a method is provided for formulating peptide vaccines using a single vaccine design for one or more targets. In some embodiments, a single target is a foreign protein with a specific mutation (e.g., KRAS G12D). In some embodiments, a single target is a self-protein (e.g., a protein that is overexpressed in tumor cells such as cancer/testis antigens). In some embodiments a single target is a pathogen protein (eg a protein contained in a viral
proteome). In some embodiments, multiple targets can be used (e.g., both KRAS G12D and KRAS, or foreign peptides derived from BCL-ABL transcripts b2a2 and b3a2). [0070] In some embodiments, the method includes extracting peptides to construct a candidate set from all target proteome sequences (e.g., entire KRAS G12D protein) as described in Liu et al. (2020). [0071] FIG.s 1 and 2 depict flow charts for example vaccine design methods that can be used for MHC class I or MHC class II vaccine design. A Candidate Peptide Set (see FIG.1 and 2) is comprised of target peptides extracted by windowing an input protein sequence. In some embodiments, extracted target peptides are of amino acid length of between about 8 and about 10 (e.g., for MHC class I binding (Rist et al., 2013)). In some embodiments, the extracted target peptides presented by MHC class I molecules are longer than 10 amino acid residues, such as 11 residues (Trolle et al., 2016). In some embodiments, extracted target peptides are of length between about 13 and about 25 (e.g., for class II binding (Chicz et al., 1992)). In some embodiments, sliding windows of various size ranges described herein are used over the entire proteome. In some embodiments, other target peptide lengths for MHC class I and class II sliding windows can be utilized. In some embodiments, computational predictions of proteasomal cleavage are used to filter or select peptides in the candidate set. One computational method for predicting proteasomal cleavage is described by Nielsen et al. (2005). In some embodiments, peptide mutation rates, glycosylation, cleavage sites, or other criteria can be used to filter peptides as described in Liu et al. (2020). In some embodiments, peptides can be filtered based upon evolutionary sequence variation above a predetermined threshold. Evolutionary sequence variation can be computed with respect to other species, other pathogens, other pathogen strains, or other related organisms. In some embodiments, a first peptide set is the candidate set. [0072] In some embodiments, for the design of vaccines for foreign peptides that are generated by abnormal gene fusions, target peptides are extracted for inclusion in the Candidate Peptide Set from the gene fusion product where each extracted target peptide includes the breakpoint between the two genes. For example, in some embodiments for the design of the BCR-ABL vaccines, the BCR-ABL b3a2 (e14a2) and b2a2 (e13a2) chimeric protein sequences were obtained from NCBI (GenBank ID CAA10376.1 and CAA10377.1, respectively). For each isoform, sliding windows of length 8-11 (MHC class I) and 13-25 (MHC class II) were
procedures described herein in “MHC Class I Vaccine Design Procedure” and “MHC Class II Vaccine Design Procedure” where contacns the chimeric protein sequence, t specifies the
position of the breakpoint in the chimeric protein sequence, and s = true. For b3a2, the BCR- ABL junction disrupts a triplet codon, yielding a novel lysine (“K”) at the breakpoint (Clark et al., 2001). For b2a2, a codon disruption at the junction causes Asp to be altered to Glu, but this novel amino acid is also present at the normal a1a2 junction (Clark et al., 2001). Thus, for b3a2, all resulting windows spanning the “K” breakpoint were retained. For b2a2, only windows containing the sequence “KEE” were retained, eliminating windows that are found solely in BCR or ABL protein sequences. This procedure can be applied to generating vaccines for other abnormal gene fusions by identifying the breakpoint between the fused genes and utilizing the described windowing strategy. [0073] As shown in FIG.s 1-2, in some embodiments, the next step of the method includes scoring the target peptides in the candidate set for peptide-HLA binding to all considered HLA alleles as described in Liu et al. (2020) and Liu et al. (2020b). In some embodiments, a first peptide set is the candidate set after scoring the target peptides. Scoring can be accomplished for human HLA molecules, mouse H-2 molecules, swine SLA molecules, or MHC molecules of any species for which prediction algorithms are available or can be developed. Thus, vaccines targeted at non-human species can be designed with the method. Scoring metrics can include the affinity for a target peptide to an HLA allele in nanomolar, eluted ligand, presentation, and other scores that can be expressed as percentile rank or any other metric. The candidate set may be further filtered to exclude peptides whose predicted binding cores do not contain a particular pathogenic or neoantigen target residue of interest or whose predicted binding cores contain the target residue in an anchor position. The candidate set may also be filtered for target peptides of specific lengths, such as length 9 for MHC class I, for example. In some embodiments, scoring of target peptides is accomplished with experimental data or a combination of experimental data and computational prediction methods. When computational models are unavailable to make peptide-HLA binding predictions for particular (peptide, HLA) pairs, the binding value for such pairs can be defined by the mean, median, minimum, or maximum immunogenicity value taken over supported pairs, a fixed value (such as an indication of no binding), or inferred using other techniques, including a function of the prediction of the most similar (peptide, HLA) pair available in the scoring model.
[0074] In some embodiments, foreign peptides created by abnormal gene fusions are not eliminated when they contain a fusion breakpoint that falls on an MHC Class I or Class II anchor position for an HLA allele. For example, for the design of the BCL-ABL vaccines for MHC class I, no windows are eliminated when the BCR-ABL breakpoint falls within a peptide anchor position. For MHC class II, the scoring model requires the breakpoint to lie within the predicted 9-mer binding core for a given HLA (in any position), and scores for peptide-HLA pairs not meeting this criterion are eliminated. In some embodiments, for the design of BCR- ABL vaccines for MHC class I, windows are eliminated if the BCR-ABL breakpoint falls within a peptide anchor position. In some embodiments, for the design of BCR-ABL vaccines for MHC class II, peptide-HLA scores are eliminated if the BCR-ABL breakpoint lies within an anchor position of the predicted 9-mer binding core for a given peptide-HLA pair. In some embodiments, for the design of MHC class II vaccines, the gene fusion breakpoint can lie in any position either inside or outside of the predicted 9-mer binding core for a given peptide-HLA pair.
[0075] In some embodiments, a base set (also referred to as seed set herein) is constructed by selecting peptides from the scored candidate set using individual peptide-HLA binding or immunogenicity criteria (e.g. , first peptide set) (FIG. 1). In some embodiments, since a given peptide has multiple peptide-HLA scores, the selection can be based on the peptide-HLA binding score or peptide-HLA immunogenicity metric with the best affinity or highest immunogenicity (e.g., predicted to bind the strongest or activate T cells the most for a given HLA allele). The criteria used for scoring peptide-HLA binding during the scoring procedure can accommodate different goals during the base set selection and vaccine design phases. For example, a target peptide with peptide-HLA binding affinities of 500 nM may be displayed by an individual that is diseased, but at a lower frequency than a target peptide with a 50 nM peptide-HLA binding affinity. During the combinatorial design phase of a vaccine, a more constrained affinity criteria may be used (e.g., when selecting a third peptide set, the Vaccine for Target(s) in FIG.s 1 and 2), such a 50 nM, to increase the probability that a vaccine peptide will be found and displayed by HLA molecules. In some embodiments, a relatively less constrained threshold (e.g., less than about 1000 nM or less than about 500 nM) of peptide-HLA immunogenicity or peptide-HLA binding is used as a first threshold for filtering candidate peptide-HLA scores (the first Peptide Scoring and Score Filtering step in FIG.s 1 and 2) and a relatively more constrained second threshold (e.g., less than about 50 nM) is used for filtering expanded set peptide-HLA scores (the second Peptide Filtering and Scoring step in FIG.s 1 and
2) for their scores for specific HLA alleles. In some embodiments, specific peptide-HLA scores are not used for modified peptides for a given HLA for vaccine design when their unmodified counterpart peptide does not pass the first less constrained threshold. This filtering of peptide- HLA scores is based on the observation that peptides that are not immunogenic enough for vaccine inclusion may be antigenic (meet the first threshold) and thus recognized by T cell clonotypes expanded by a vaccine. A peptide is antigenic when it is recognized by a T cell receptor and results in a response such as CD8+ T cell cytotoxicity or CD4+ cell activation. Derivatives of an antigenic peptide may be strongly immunogenic, included in a vaccine, and thus activate and expand T cells that recognize the antigenic peptide. The expansion of T cells that recognize an unmodified antigenic peptide can provide an immune response that contributes to disease control. In some embodiments, peptides are scored for third peptide set (Vaccine for Target(s) in FIG.s 1 and 2) potential inclusion that have peptide-HLA binding affinities less than about 500 nM. In some embodiments, peptides are selected for the base set that have peptide- HLA binding affinities less than about 1000 nM for at least one HLA allele. Alternatively, predictions of peptide-HLA immunogenicity can be used to qualify target peptides for base set inclusion. In some embodiments, experimental observations of the immunogenicity of peptides in the context of their display by HLA alleles or experimental observation of the binding of peptides to HLA alleles can be used to score peptides for binding to HLA alleles or peptide- HLA immunogenicity.
[0076] In some embodiments, experimental observations of the display of peptides by specific HLA alleles in tumor cells can be used to score peptides for peptide-HLA binding or peptide-HLA immunogenicity. In some embodiments, experimental observations of the display of peptides tumor cells by a specific HLA allele can be used to score peptides for peptide-HLA binding or peptide-HLA immunogenicity for that HLA allele. In some embodiments, experimental observations of the display of peptides tumor cells can be used to score peptides for peptide-HLA binding or peptide-HLA immunogenicity, with the HLA allele(s) for a specific observed peptide selected from the HLA alleles present in the tumor that meet a predicted peptide-HLA binding or immunogenicity threshold. In some embodiments, mass spectrometry is used to experimentally determine the display of peptides by tumor cells as described by Bear et al. (2021) or Wang et al. (2019) and these data are used to score for peptide-HLA binding or peptide-HLA immunogenicity. In some embodiments, mass spectrometry is used to experimentally determine the display of peptides by tumor cells, and these experimental data are used to qualify the inclusion of base set (seed set) peptides for one or more HLA alleles for a
vaccine. In some embodiments, mass spectrometry is used to experimentally determine the display of a peptide by tumor cells, and these experimental data are used to exclude peptide- HLA binding scores or peptide-HLA immunogenicity scores for the peptide when the peptide is not observed to be displayed by an HLA allele by mass spectrometry. In some embodiments, mass spectrometry is used to experimentally determine the display of peptides by tumor cells in an individual, and these experimental data are used to qualify the inclusion of base set (seed set) peptides for that individual for one or more HLA alleles. In some embodiments, mass spectrometry is used to experimentally determine the display of a peptide by tumor cells in an individual, and these experimental data are used to exclude peptide-HLA binding scores or peptide-HLA immunogenicity scores for the peptide when the peptide is not observed to be displayed by an HLA allele by mass spectrometry. In some embodiments, computational predictions of the immunogenicity of a peptide in the context of display by HLA alleles can used for scoring such as the methods of Ogishi et al. (2019) or Bulik-Sullivan et al. (2019). [0077] In some embodiments, a peptide-HLA score or a peptide-HLA immunogenicity score for a first peptide in the base set (seed set) for a given HLA allele is eliminated and not considered during vaccine design if the wild-type peptide corresponding to the first peptide (e.g. the unmutated naturally occurring form for the peptide or a peptide in the respective species within a defined sequence edit distance) has a peptide-HLA score or a peptide-HLA immunogenicity score for the same HLA allele within a defined threshold. The threshold can be based upon the difference of the scores of the first peptide and the wild-type peptide, the ratio of the scores of the first peptide and the wild-type peptide, the score of the wild-type peptide, or other metrics. The defined threshold can be either greater than or less than a specified value. In some embodiments, the threshold is defined so that the wild-type peptide is not predicted to be presented. In some embodiments, when a peptide-HLA score or peptide-HLA immunogenicity score is eliminated for a first peptide during vaccine design, then peptide-HLA scores or peptide-HLA immunogenicity scores for all of its derivatives (e.g., heteroclitic peptide derivatives) for the same HLA allele are also eliminated and not considered during vaccine design. [0078] In some embodiments, the method further includes running the OptiVax-Robust algorithm as described in Liu et al. (2020) using the HLA haplotype frequencies of a population on the scored candidate set to construct a base set (also referred to as seed set herein) of target peptides (FIG.2). In some embodiments, HLA diplotype frequencies can be provided to
OptiVax. OptiVax-Robust includes algorithms to eliminate peptide redundancy that arises from the sliding window approach with varying window sizes, but other redundancy elimination measures can be used to enforce minimum edit distance constraints between target peptides in the candidate set. The size of the seed set is determined by a point of diminishing returns of population coverage as a function of the number of target peptides in the seed set. Other criteria can also be used, including a minimum number of vaccine target peptides, maximum number of vaccine target peptides, and desired predicted population coverage. In some embodiments, a predetermined population coverage is less than about 0.4, between about 0.4 and 0.5, between about 0.5 and 0.6, between about 0.6 and 0.7, between about 0.7 and 0.8, between about 0.8 and 0.9, or greater than about 0.9. Another possible criterion is a minimum number of expected peptide-HLA binding hits in each individual. In alternate embodiments, the method further includes running the OptiVax-Unlinked algorithm as described in Liu et al. (2020) instead of OptiVax-Robust.
[0079] The OptiVax-Robust method uses binary predictions of peptide-HLA immunogenicity, and these binary predictions can be generated as described in Liu et al. (2020b). The OptiVax-Unlinked method uses the probability of target peptide binding to HLA alleles and can be generated as described in Liu et al. (2020). In some embodiments, OptiVax- Unlinked and EvalVax-Unlinked are used with the probabilities of peptide-HLA immunogenicity. Either method can be used for the purposes described herein, and thus the term “OptiVax” refers to either the Robust or Unlinked method. In some embodiments, the observed probability of peptide-HLA immunogenicity in experimental assays can be used as the probability of peptide-HLA binding in EvalVax-Unlinked and OptiVax-Unlinked. In some embodiments, the HLA haplotype or HLA allele frequencies of a population provided to OptiVax for vaccine design describe the world’s population. In alternative embodiments, the HLA haplotype or HLA allele frequencies of a population provided to OptiVax for vaccine design are specific to a geographic region. In alternative embodiments, the HLA haplotype or HLA allele frequencies of a population provided to OptiVax for vaccine design are specific to an ancestry. In alternative embodiments, the HLA haplotype or HLA allele frequencies of a population provided to OptiVax for vaccine design are specific to a race. In alternative embodiments, the HLA haplotype or HLA allele frequencies of a population provided to OptiVax for vaccine design are specific to individuals with risk factors such as genetic indicators of risk, age, exposure to chemicals, alcohol use, chronic inflammation, diet, hormones, immunosuppression, infectious agents, obesity, radiation, sunlight, or tobacco use. In
alternative embodiments, the HLA haplotype or HLA allele frequencies of a population provided to OptiVax for vaccine design are specific to individuals that carry certain HLA alleles. In alternative embodiments, the HLA diplotypes provided to OptiVax for vaccine design describe a single individual and are used to design an individualized vaccine. [0080] In some embodiments, the base (or seed) set of target peptides (e.g., first peptide set) that results from OptiVax application to the candidate set of target peptides describes a set of unmodified target peptides that represent a possible compact vaccine design (Seed Set in FIG. 2). A base peptide is a target peptide that is included in the base or seed peptide set (e.g., first peptide set). In some embodiments, the seed set (e.g., first peptide set) is based upon filtering candidate peptide scores by predicted or observed affinity or immunogenicity with respect to HLA molecules (Seed Set in FIG.1). However, to improve the display of the target peptides in a wide range of HLA haplotypes as possible, some embodiments include modifications of the seed (or base) set. In some embodiments, experimental assays can be used to ensure that a modified seed (or base) peptide activates T cells that also recognize the base/seed peptide. [0081] For a given target peptide, the optimal anchor residue selection may depend upon the HLA allele that is binding to and displaying the target peptide and the class of the HLA allele (MHC class I or class II). A seed peptide set (e.g., first peptide set) can become an expanded set by including anchor residue modified peptides of either MHC class I or II peptides (FIG.s 1-2). Thus, one aspect of vaccine design is considering how to select a limited set of heteroclitic peptides that derive from the same target peptide for vaccine inclusion given that different heteroclitic peptides will have different and potentially overlapping population coverages. [0082] In some embodiments, all possible anchor modifications for each base set of target peptide are considered. There are typically two anchor residues in peptides bound by MHC class I molecules, typically at positions 2 and 9 for 9-mer peptides. In some embodiments, anchors for 8-mers, 10-mers, and 11-mers are found at positions 2 and n, where n is the last position (8, 10, and 11, respectively). For MHC class I molecules, the last position n is called the “C” position herein for carboxyl terminus. In some embodiments, at each anchor position, 20 possible amino acids are attempted in order to select the best heteroclitic peptides. Thus, for MHC class I binding, 400 (i.e., 20 amino acids by 2 positions = 202) minus 1 heteroclitic peptides are generated for each base target peptide. There are typically four anchor residues in peptides bound by MHC class II molecules, typically at positions 1, 4, 6, and 9 of the 9-mer
positions = 204) minus 1 heteroclitic peptides generated for each base target peptide. In some embodiments, more than two (MHC class I) or four (MHC class II) positions are considered as anchors. Other methods, including Bayesian optimization, can be used to select optimal anchor residues to create heteroclitic peptides from each seed (or base) set peptide. Other methods of selecting optimal anchor residues are presented in “Machine learning optimization of peptides for presentation by class II MHCs” by Dai et al. (2020), incorporated in its entirety herein. In some embodiments, the anchor positions are determined by the HLA allele that presents a peptide, and thus the set of heteroclitic peptides includes for each set of HLA specific anchor positions, all possible anchor modifications.
[0083] In some embodiments, for all of the target peptides in the base/seed set, new peptide sequences with all possible anchor residue modifications (e.g., MHC class I or class II) are created resulting in a new heteroclitic base set (Expanded set in FIG.s 1-2) that includes all of the modifications. In some embodiments, anchor residue modifications of a peptide are not included in the heteroclitic base set if one or more of the peptide’s anchor residue positions contains a substitution mutation that distinguishes the peptide from a self-peptide. In some embodiments, anchor residue modifications of a base/seed peptide are only included in the heteroclitic base set for peptide positions that do not contain a substitution mutation that distinguishes the base/seed peptide from a self-peptide. In some embodiments, anchor residue modifications of a peptide are not included in the heteroclitic base set when one or more of the peptide’s mutations does not occur between a pair of its adjacent anchor residues. In some embodiments, for all of the target peptides in the base/seed set, new peptide sequences with anchor residue modifications (e.g., MHC class I or class II) at selected anchor locations are created resulting in a new heteroclitic base set (Expanded set in FIG.s 1-2) that includes the selected modifications. In some embodiments, the anchor residue positions used for modifying peptides are selected from anchor residue positions determined by the HLA alleles considered during vaccine evaluation. In some embodiments, the heteroclitic base set (Expanded set in FIG.s 1-2) also includes the original seed (or base) set (Seed Peptide Set in FIG.s 1-2). In some embodiments, the heteroclitic base set includes amino acid substitutions at non-anchor residues. In some embodiments, modifications of base peptide residues is accomplished to alter binding to T cell receptors to improve therapeutic efficacy (Candia, et al. 2016). In some embodiments, the heteroclitic base set includes amino acid substitutions of non-natural amino acid analogs. The heteroclitic base set is scored for HLA affinity, peptide-HLA immunogenicity, or other metrics as described herein (another round of Peptide Scoring and Score Filtering as shown in FIG.s 1-
2). In some embodiments, the scoring predictions may be further updated for pairs of heteroclitic peptide and HLA allele, eliminating pairs where a heteroclitic peptide has a seed (or base) peptide from which it was derived that is not predicted to be displayed by the HLA allele at a specified threshold of peptide-HLA binding score or a specified peptide-HLA immunogenicity metric. In some embodiments, the peptide-HLA scores may also be filtered to ensure that predicted binding cores of the heteroclitic peptide displayed by a particular HLA allele align exactly in position with the binding cores of the respective seed (or base) set target peptide for that HLA allele. In some embodiments, the scoring predictions are filtered for an HLA allele to ensure that the heteroclitic peptides considered for that HLA allele are only modified at anchor positions determined by that HLA allele. Scoring produces a metric of peptide-HLA immunogenicity for peptides and HLA alleles that can be either binary, a probability of immunogenicity, or other metric of immunogenicity such as peptide-HLA affinity or percent rank, and can be based on computational predictions, experimental observations, or a combination of both computational predictions and experimental observations. In some embodiments, probabilities of peptide-HLA immunogenicity are utilized by OptiVax-Unlinked. In some embodiments, heteroclitic peptides are included in experimental assays such as MIRA (Klinger et al., 2015) or ELISPOT to determine their peptide-HLA immunogenicity metric with respect to specific HLA alleles. In some embodiments, the methods of Liu et al. (2020b), can be used to incorporate MIRA data for heteroclitic peptides into a model of peptide-HLA immunogenicity. In some embodiments, peptide-HLA immunogenicity metrics of heteroclitic peptides are experimentally determined and their ability to activate T cells that also recognize the corresponding seed (or base) peptide of the heteroclitic peptide is performed as is known in the art to qualify the heteroclitic peptide for vaccine inclusion (e.g., Houghton et al., 2007). In some embodiments, these assays of the immunogenicity and cross-reactivity of heteroclitic peptides are performed when the heteroclitic peptides are displayed by specific HLA alleles.
[0084] In some embodiments, experimental observations of the display of heteroclitic peptides by specific HLA alleles in cells can be used to score peptides for peptide-HLA binding or peptide-HLA immunogenicity. In some embodiments, mass spectrometry is used to experimentally determine the display of heteroclitic peptides by cells as described by Bear et al. (2021) or Wang et al. (2019) and these data are used to score for peptide-HLA binding or peptide-HLA immunogenicity. In some embodiments, mass spectrometry is used to experimentally determine the display of heteroclitic peptides by cells, and these experimental data are used to qualify the inclusion of heteroclitic peptides for inclusion in a vaccine. In some
embodiments, mass spectrometry is used to experimentally determine the display of a peptide by tumor cells, and these experimental data are used to exclude peptide-HLA binding scores or peptide-HLA immunogenicity scores for the peptide when the peptide is not observed to be displayed by an HLA allele by mass spectrometry. In some embodiments, mass spectrometry is used to experimentally determine the display of a heteroclitic peptide by cells with an HLA allele found in an individual, and these experimental data are used to qualify the inclusion of the heteroclitic peptide for inclusion in a vaccine for the individual. In some embodiments, mass spectrometry is used to experimentally determine the display of a peptide by tumor cells in an individual, and these experimental data are used to exclude peptide-HLA binding scores or peptide-HLA immunogenicity scores for the peptide when the peptide is not observed to be displayed by an HLA allele by mass spectrometry. In some embodiments, computational predictions of the immunogenicity of a heteroclitic peptide in the context of display by HLA alleles can used for scoring such as the methods of Ogishi et al. (2019) or Bulik-Sullivan et al. (2019). [0085] In some embodiments, a peptide in the heteroclitic base set is removed if (1) one of its anchor positions for an HLA allele corresponds to the location of a mutation in the base/seed peptide from which it was derived that distinguishes the base/seed peptide from a self-peptide, and (2) if the peptide-HLA binding or peptide-HLA immunogenicity of the self-peptide is stronger than a specified threshold for self-peptide binding or immunogenicity. This eliminates peptides in the heteroclitic base set that may cross-react with self-peptides as a result of sharing TCR facing residues with self-peptides. In some embodiments, the threshold for self-peptide binding is between approximately 500 nM to 1000 nM. [0086] In some embodiments, redundant peptides in the heteroclitic base set are removed. In some embodiments, a redundant peptide is a first heteroclitic peptide that has peptide-HLA immunogenicity scores or peptide-HLA binding scores that are less immunogenic for all scored HLAs than a second heteroclitic peptide in the heteroclitic base set, where both the first and second heteroclitic peptides are derived from the same base (or seed) peptide. In some embodiments, peptide redundancy is determined by only comparing peptide-HLA immunogenicity scores or peptide-HLA binding scores for HLA alleles where the peptide-HLA immunogenicity scores or peptide-HLA binding scores for both peptides for an HLA allele are more immunogenic than a given threshold (e.g., 50 nM for binding). In some embodiments, a redundant peptide is a first heteroclitic peptide that has an average peptide-HLA
immunogenicity score or peptide-HLA binding score that is less immunogenic than the average peptide-HLA immunogenicity score or peptide-HLA binding score of a second heteroclitic peptide in the heteroclitic base set, where both the first and second heteroclitic peptides are derived from the same base (or seed) peptide, and the average scores are computed for HLA alleles where the peptide-HLA immunogenicity scores or peptide-HLA binding scores for both peptides for an HLA allele are more immunogenic than a given threshold (e.g., 50 nM for binding). In some embodiments, a redundant peptide is a first heteroclitic peptide that has a weighted peptide-HLA immunogenicity score or peptide-HLA binding score that is less immunogenic than the weighted peptide-HLA immunogenicity score or peptide-HLA binding score of a second heteroclitic peptide in the heteroclitic base set, where both the first and second heteroclitic peptides are derived from the same base (or seed) peptide, and where the weighting is determined by the frequency of the HLA allele in a human population, and the weighted scores are computed for HLA alleles where the peptide-HLA immunogenicity scores or peptide- HLA binding scores for both peptides for an HLA allele are more immunogenic that a given threshold (e.g., 50 nM for binding). [0087] In some embodiments, the next step involves scoring the heteroclitic base set (the second peptide set) and filtering the resulting scores to create a second peptide set by comparing the peptide-HLA immunogenicity scores or peptide-HLA binding scores of the peptides for one or more HLA alleles to a threshold. In some embodiments, an affinity criterion of about 50 nM is used to increase the probability that a vaccine peptide will be found and displayed by HLA molecules. In some embodiments, the affinity criteria is more constrained than 50 nM (i.e., < 50 nM). In some embodiments, the affinity criteria is more constrained than about 500 nM (i.e., < 500 nM). In some embodiments, individual peptide-HLA binding scores or immunogenicity metrics are determined and thus a peptide may be retained as long as it meets the criteria for at least one HLA allele, and only peptide-HLA scores that meet the criteria are considered for vaccine design. [0088] In some embodiments, the next step involves inputting the second peptide set to OptiVax to select a compact set of vaccine peptides that maximizes predicted vaccine performance (Vaccine Performance Optimization; FIG.s 1-2). In some embodiments, predicted vaccine performance is a function of expected peptide-HLA binding affinity (e.g., a function of the distribution of peptide-HLA binding affinities across all peptide-HLA combinations for a given peptide set, or weighted by the occurrence of the HLA alleles in a population or
individual). In some embodiments, predicted vaccine performance is the expected population coverage of a vaccine. In some embodiments, predicted vaccine performance is the expected number peptide-HLA hits produced by a vaccine in a population or individual. In some embodiments, predicted vaccine performance requires a minimum expected number of peptide- HLA hits (e.g., 1, 2, 3, 4, 5, 6, 7, 8, or more) produced by a vaccine. In some embodiments, predicted vaccine performance is a function of population coverage and expected number of peptide-HLA hits desired produced by a vaccine. In some embodiments, predicted vaccine performance is a metric that describes the overall immunogenic properties of a vaccine where all of the peptides in the vaccine are scored for peptide-HLA immunogenicity for two or more HLA alleles (e.g., three or more HLA alleles). In some embodiments, predicted vaccine performance excludes immunogenicity contributions by selected HLA alleles above a maximum number of peptide-HLA hits (e.g., 1, 2, 3, 4, 5, 6, 7, 8, or more). In some embodiments, predicted vaccine performance excludes immunogenicity contributions of individual HLA diplotypes above a maximum number of peptide-HLA hits (e.g., 1, 2, 3, 4, 5, 6, 7, 8, or more). In some embodiments, predicted vaccine performance is the fraction of covered HLA alleles, which is the expected fraction of HLA alleles in each individual that have a minimum number of peptides (e.g , 1, 2, 3, 4, 5, 6, 7, 8, or more) with predicted peptide-HLA immunogenicity produced by a vaccine. In some embodiments, predicted vaccine performance is the expected fraction of HLA alleles in a single individual that have a minimum number of peptides (e.g , 1, 2, 3, 4, 5, 6, 7, 8, or more) with predicted peptide-HLA immunogenicity produced by a vaccine.
[0089] In some embodiments, a vaccine is designed by the iterative selection of peptides from the heteroclitic base set (also referred to as Expanded set as shown in FIG.s 1-2) at progressively less stringent criteria for predicted peptide immunogenicity or display. In some embodiments, a peptide is retained if at least one of its peptide-HLA scores is not eliminated by the thresholds employed. In some embodiments, OptiVax is first used to design a vaccine with a desired vaccine performance with specific peptide qualification criteria (e.g., seed HLA-peptide scores from the candidate set must bind to at least one MHC molecule at 500 nM or stronger, and peptide-HLA scores from the expanded set must bind to at least one MHC molecule at 50 nM or stronger). The vaccine that results from this application of OptiVax is then used as the foundation for vaccine augmentation with less stringent criteria (e.g., seed peptide-HLA scores from the candidate set must bind to at least one MHC molecule at 1000 nM or stronger, and peptide-HLA scores from the expanded set must bind to at least one MHC molecule at 100 nM or stronger) to further improve the desired vaccine performance. Methods for vaccine
augmentation are described in Liu et al. (2020b), incorporated by reference in its entirety herein. In some embodiments, multiple rounds of vaccine augmentation may be utilized. In some embodiments, the final augmented vaccine is the one selected. [0090] In some embodiments, selection of peptide sets to meet a desired predicted vaccine performance can be accomplished by computational algorithms other than OptiVax. In some embodiments, integer linear programming or mixed-integer linear programming is employed for selecting peptide sets instead of OptiVax. One example of an integer programming method for peptide set selection is described by Toussaint et al. (2008), incorporated by reference in its entirety herein. An example solver for mixed-integer linear programming is Python-MIP than can be used in conjunction with Toussaint et al. (2008). A second example of methods for vaccine peptide selection is described in “Maximum n-times Coverage for Vaccine Design” by Liu et al. (2021), incorporated by reference in its entirety herein. [0091] Predicted vaccine performance refers to a metric. Predicted vaccine performance can be expressed as a single numerical value, a plurality of numerical values, any number of non- numerical values, and a combination thereof. The value or values can be expressed in any mathematical or symbolic term and on any scale (e.g., nominal scale, ordinal scale, interval scale, or ratio scale). [0092] A seed (or base) peptide and all of the modified peptides that are derived from that seed (or base) peptide comprise a single peptide family. In some embodiments, in the component of vaccine performance that is based on peptide-HLA immunogenicity for a given HLA allele, a maximum number of peptides (e.g., 1, 2, 3, 4, 5, 6, 7, 8, or more) that are in the same peptide family are given computational immunogenicity credit for that HLA allele. This limit on peptide family immunogenicity limits the credit caused by many modified versions of the same base peptide. In some embodiments, the methods described herein are included for running OptiVax with an EvalVax objective function that corresponds to a desired metric of predicted vaccine performance. In some embodiments, population coverage means the proportion of a subject population that presents one or more immunogenic peptides that activate T cells responsive to a seed (or base) target peptide. The metric of population coverage is computed using the HLA haplotype frequency in a given population such as a representative human population. In some embodiments, the metric of population coverage is computed using marginal HLA frequencies in a population. Maximizing population coverage means selecting a
modified peptides; e.g., a first peptide set, second peptide set, or third peptide set) that collectively results in the greatest fraction of the population that has at least a minimum number (e.g., 1, 2, 3, 4, 5, 6, 7, 8, or more) of immunogenic peptide-HLA bindings based on proportions of HLA haplotypes in a given population (e.g., representative human population). In some embodiments, this process includes the OptiVax selection of heteroclitic peptides (as described in this disclosure) that activate T cells that respond to their corresponding seed (or base) peptide and the heteroclitic base peptides to improve population coverage. In some embodiments, the seed (or base) target peptides are always included in the final vaccine design. In some embodiments, peptides are only considered as candidates for a vaccine design (e.g., included in a first, second, and/or third peptide set) if they have been observed to be immunogenic in clinical data, animal models, or tissue culture models. [0093] Although heteroclitic peptides are used as exemplary embodiments in this disclosure, any modified peptide could be used in place of a heteroclitic peptide. A modified peptide is a peptide that has one or more amino acid substitutions of a target base/seed peptide. The amino acid substitution could be located at an anchor position or any other non-anchor position. [0094] In some embodiments, a candidate vaccine peptide (e.g., a base peptide or a modified peptide) is eliminated from vaccine inclusion if it activates T cells that recognize self-peptides (e.g., this can be achieved at the first and/or second round of Peptide Filtering and Sorting as shown in FIG.s 1-2). In some embodiments, a candidate vaccine peptide (e.g., a base peptide or a modified peptide) is computationally eliminated from vaccine inclusion if its outward facing amino acids when bound by an HLA allele are similar to outward facing self-peptide residues that are presented by the same HLA allele, where similarity can be defined by identity or defined similarity metrics such as BLOSUM matrices (BLOSUM matrices are known in the art). Testing a vaccine peptide for its ability to activate T cells that recognize self-peptides can be experimentally accomplished by the vaccination of animal models followed by ELISPOT or other immunogenicity assay or with human tissue protocols. In both cases, models with HLA alleles that present the vaccine peptide are used. In some embodiments, human primary blood mononuclear cells (PBMCs) are stimulated with a vaccine peptide, the T cells are allowed to grow, and then T cell activation with a self-peptide is assayed as described in Tapia-Calle et al. (2019) or other methods as known in the art. In some embodiments, the vaccine peptide is excluded from vaccine inclusion if the T cells are activated by the self-peptide. In some embodiments, computational predictions of the ability of a peptide to activate T cells that also
recognize self-peptides can be utilized. These predictions can be based upon the modeling of the outward facing residues from the peptide-HLA complex and their interactions with other peptide residues. In some embodiments, a candidate vaccine peptide (e.g., a base peptide or a modified peptide) is eliminated from vaccine inclusion or experimentally tested for cross-reactivity if it is predicted to activate T cells that also recognize self-peptides based upon the structural similarity of the peptide-MHC complex of the candidate peptide (e.g., a base peptide or a modified peptide) and the peptide-MHC complex of a self-peptide. One method for the prediction of peptide-MHC structure is described by Park et al. (2013). [0095] In some embodiments, the peptide-HLA binding score or peptide-HLA immunogenicity metric for a candidate heteroclitic vaccine peptide (e.g., a modified peptide) and HLA allele is eliminated from consideration during vaccine design if the candidate heteroclitic vaccine peptide does not activate T cells that recognize its corresponding base/seed target peptide (second round of Peptide Scoring and Score Filtering, FIG.s 1-2) for the given HLA allele. In some embodiments, a heteroclitic vaccine peptide (e.g., a modified peptide) is eliminated from a vaccine design if the candidate heteroclitic vaccine peptide does not activate T cells that recognize its corresponding base/seed target peptide (second round of Peptide Scoring and Score Filtering, FIG.s 1-2) for a given HLA allele. Testing a candidate heteroclitic peptide (e.g., a modified peptide) for its ability to activate T cells that recognize its corresponding seed (or base) target peptide with respect to the same HLA allele can be experimentally accomplished by the vaccination of animal models followed by ELISPOT or other immunogenicity assay or with human tissue protocols. In both cases, models with HLA alleles that present the heteroclitic peptide are used. In some embodiments, human PBMCs are stimulated with the heteroclitic peptide, the T cells are allowed to grow, and then T cell activation with the seed (or base) target peptide is assayed as described in Tapia-Calle et al. (2019) or using other methods known in the art. In some embodiments, computational predictions of the ability of a heteroclitic peptide to activate T cells that also recognize the corresponding seed (or base) target peptide can be utilized. These predictions can be based upon the modeling of the outward facing residues from the peptide-HLA complex and their interactions with other peptide residues. In some embodiments, the structural similarity of the peptide-HLA complex of a heteroclitic peptide and the peptide-HLA complex of the corresponding seed (or base) target is used to qualify heteroclitic peptides for vaccine inclusion or to require experimental immunogenicity testing before vaccine inclusion.
[0096] TCR Interface Divergence (TCRID) is the Least Root Mean Square Deviation of the difference between a first peptide’s TCR facing residues’ 3D positions and the corresponding residue positions of a second peptide with respect to a specific HLA allele. In some embodiments, other metrics are used for the TCRID instead of Least Root Mean Square Deviation. In some embodiments, other metrics are used for the TCRID that include position deviations in non-TCR facing residues and MHC residues from the specific HLA allele. In some embodiments, TCRID is used to predict if two peptides when displayed by a given HLA allele will activate the same T cell clonotypes. In some embodiments, FlexPepDock (London et al., 2011, incorporated by reference in its entirety herein) or DINC (Antunes et al., 2018, incorporated by reference in its entirety herein) in conjunction with the crystal structures of HLA molecules can be used to compute TCRID metrics for pairs of peptides given an HLA molecule. In some embodiments, TCRID is computed by (1) determining the 3D peptide-HLA structures for two different peptides bound by a specific HLA allele, (2) aligning the HLA alpha helices of the peptide-HLA structures, and (3) computing the Least Root Mean Square Deviation of the difference between the TCR facing residues of the two peptides with respect to the aligned alpha helix reference frame. [0097] In some embodiments, the second Peptide Scoring and Score Filtering step in FIG.s 1 and 2 will eliminate the peptide-HLA binding or immunogenicity score for a heteroclitic peptide for a specific HLA allele when the HLA specific TCRID between the heteroclitic peptide and its corresponding base (or seed) peptide from which it was derived is over a first TCRID threshold. In some embodiments, the second Peptide Scoring and Score Filtering step in FIG.s 1 and 2 will eliminate all peptide-HLA binding or immunogenicity scores for a heteroclitic peptide when a HLA specific TCRID between the heteroclitic peptide and its corresponding unmutated self- peptide from which it was derived is under a second TCRID threshold. In some embodiments, the first Peptide Scoring and Score Filtering step in FIG.s 1 and 2 will eliminate all peptide-HLA binding or immunogenicity scores for a candidate peptide when the HLA specific TCRID between the peptide and its corresponding unmutated self-peptide is under a third TCRID threshold. In some embodiments, any of the TCRID thresholds are determined by experimentally observing or computationally predicting the cross-reactivity of TCR molecules to peptide-HLA complexes. [0098] FIG.3 (MHC class I) and FIG.4 (MHC class II) show the predicted population coverage of OptiVax-Robust selected single target-specific vaccines with differing number of
peptides designed for the KRAS mutations G12D, G12V, G12R, G12C, and G13D. FIG.s 3-4 show that as the number of peptides increases for a vaccine, its predicted population coverage increases. The population coverage shown in FIG.s 3-4 are of those individuals that have the specific mutation that the vaccine is designed to cover. An increase in peptide count will also typically cause the average number of peptide-HLA hits in each individual to increase in the population. [0099] FIG.5 shows predicted population coverage for MHC class I vaccines by vaccine size for the BCL-ABL fusion that produces b3a2. FIG.6 shows predicted population coverage for MHC class II vaccines by vaccine size for the BCL-ABL fusion that produces b3a2. FIG.7 shows predicted population coverage for MHC class I vaccines by vaccine size for the BCL- ABL fusion that produces b2a2. FIG.8 shows predicted population coverage for MHC class II vaccines by vaccine size for the BCL-ABL fusion that produces b2a2. [0100] OptiVax can be used to design a vaccine to maximize the fraction/proportion of the population whose HLA molecules are predicted to bind to and display at least p peptides from the vaccine. In some embodiments, this prediction (e.g., scoring) includes experimental immunogenicity data to directly predict at least p peptides will be immunogenic. The number p is input to OptiVax, and OptiVax can be run multiple times with varying values for p to obtain a predicted optimal target peptide set for different peptide counts p. Larger values of p will increase the redundancy of a vaccine at the cost of more peptides to achieve a desired population coverage. In some embodiments, it may not be possible to achieve a given population coverage given a specific heteroclitic base set. In some embodiments, the number p is a function of the desired size of a vaccine. [0101] The methods described herein can be used to design separate vaccine formulations for MHC class I and class II-based immunity. [0102] In some embodiments, this procedure is used to create a vaccine for an individual. In some embodiments, the target peptides present in the individual are determined by sequencing the individual’s tumor RNA or DNA and identifying mutations that produce foreign peptides. One embodiment of this method is described in U.S. Patent No.10,738,355, incorporated in its entirety herein. In some embodiments, peptide sequencing methods are used to identify target peptides in the individual. One embodiment of this is described in U.S. Publication No. 2011/0257890 In some embodiments the target peptides used for the individual’s vaccine are
selected when a self-peptide, foreign peptide, pathogen peptide or RNA encoding a self-peptide, foreign peptide or pathogen peptide is observed in a specimen from the individual is present at a predetermined level. The target peptides in the individual are used to construct a vaccine as disclosed herein. For vaccine design, OptiVax is provided a diplotype comprising the HLA type of the individual. In an alternative embodiment, the HLA type of an individual is separated into multiple diplotypes with frequencies that sum to one, where each diplotype comprises one or more HLA alleles from the individual and a notation that the other allele positions should not be evaluated. The use of multiple diplotypes will cause OptiVax’s objective function to increase the chance that immunogenic peptides will be displayed by all of the constructed diplotypes. This achieves the objective of maximizing the number of distinct HLA alleles in the individual that exhibit peptide-HLA immunogenicity and thus improves the allelic coverage of the vaccine in the individual. [0103] FIG.9 shows the predicted vaccine performance (predicted number of peptide-HLA hits) of ten example G12V MHC class I vaccines for a single individual with the MHC class I HLA diplotype HLA-A02:03, HLA-A11:01, HLA-B55:02, HLA-B58:01, HLA-C03:02, and HLA-C03:03. OptiVax was used to design ten G12V MHC class I vaccines for this HLA diplotype with peptide counts ranging from 1 to 10. For the results in FIG.9, OptiVax was run with six synthetic diplotypes, each equally weighted, each with one HLA allele from the individual’s HLA diplotype, and the other allele positions marked to not be evaluated. The 10 peptide vaccine in FIG.9 comprises SEQ ID NO: 3 (GAVGVGKSL), SEQ ID NO: 4 (LMVVGAVGV), SEQ ID NO: 7 (VVGAVGVGK), SEQ ID NO: 14 (GPVGVGKSV), SEQ ID NO: 69 (LMVVGAVGI), SEQ ID NO: 72 (LMVVGAVGL), SEQ ID NO: 131 (GAVGVGKSM), SEQ ID NO: 138 (GPVGVGKSA), SEQ ID NO: 142 (VTGAVGVGK), and SEQ ID NO: 198 (VAGAVGVGM). Two peptides, SEQ ID NO: 3 (GAVGVGKSL) and SEQ ID NO: 131 (GAVGVGKSM), are predicted to bind two of the HLA alleles with an affinity of 50 nM or less. MHC Class I Vaccine Design Procedure [0104] In some embodiments, MHC class I vaccine design procedures consist of the following computational steps. [0105] In some embodiments, the inputs for the computation are:
P1...n- Peptide sequence (length n) containing the neoantigen or pathogenic target(s) of interest (e.g., KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, KRAS G13D, BCR-ABL b3a2, BCR-ABL b2a2). Pt denotes the amino acid at position i. t: Position of target mutation in P, t ∈ [1, ... n] (e.g, t = 12 for KRAS G12D). s: Substitution mutation s G [true, false] is true if the mutation is a substitution, and false if the mutation is a deletion or insertion or the peptide does not contain a mutation (such as in pathogen targets). When the mutation is a deletion or insertion then tindicates the position immediately before the deletion or insertion. Threshold for potential presentation of peptides byMHC for peptide-MHC scoring
(e.g., 500 nM binding affinity) Threshold for predicted display of peptides by MHC for peptide-MHC scoring ( .g.,
50 nM binding affinity) Set of HLA alleles (for HLA-A, HLA-B, HLA-C loci)
Population haplotype frequencies (for OptiVax optimization and coverage
evaluation ).
N: Parameter for EvalVax and OptiVax objective function. Specifies minimum number of predicted per-individual hits for population coverage objective to consider the individual covered. Default = 1 (computes P(n ≥ 1) population coverage).
[0106] In some embodiments, Peptide-HLA Scoring Functions used are:
ScorePotential : Scoring function mapping a (peptide, HLA allele) pair to a
prediction of peptide-HLA display. If predicted affinity
then returns 1, else returns 0. Options include MHCfluny, NetMHCpan, PUFFIN, ensembles, or alternative metrics or software maybe used, including models calibrated against immunogenicity data.
SCOREDISPLAY : Scoring function mapping a (peptide, HLA allele) pair to a
prediction of peptide-HLA display. If predicted affinity then returns 1, else
returns 0. Options include MHCfluny, NetMHCpan, PUFFIN, ensembles, or alternative
metrics or software maybe used, including models calibrated against immunogenicity data.
[0107] Next, from the seed protein sequence ( ), a set
of windowed native peptides spanning the protein sequence(s) is constructed. only produces set members when
the subscripts are within the range of the defined seed protein P. In some embodiments, 8-mers, 9-mers, 10-mers, and 11-mers are produced, but this process can be performed with any desired window lengths and the resulting peptide sets combined. In some embodiments, only 9-mers are produced.
[0108] The second condition excludes peptides where the mutation at t
is in positions P2 or Pk of the windowed /Lmer peptide (i.e., the anchor positions) and the mutation is a substitution.
MHC Class I Vaccine Design Procedure with Defined Peptide Set P
[0109] Next, each peptide sequence in
is scored against all HLA alleles in
for potential presentation using SCOREPOTENTIAL (with threshold = 500 nM) and store results in a
Note that S is a binary matrix where 1 indicates the HLA is predicted to potentially present the peptide, and 0 indicates no potential presentation.
Thus, B contains the native peptides that are predicted to be potentially presented by at least 1 HLA.
Create a set of all heteroclitic peptides B' stemming from peptides in :
where ANCHOR-MODIFIED(b ) returns a set of all 399 anchor-modified peptides stemming from b (with all possible modifications to the amino acids at P2 and P9).
[0110] Next, all heteroclitic candidate peptides e.g., modified peptides) in B' are scored against all HLA alleles in
for predicted display using SCOREDISPLAY (with threshold = 50
nM), and store results in binary
[0111] Next, an updated scoring matrix
is computed for heteroclitic peptides conditioned on the potential presentation of the corresponding base peptides by each HLA:
where each heteroclitic peptide b' ∈ B' is a mutation of base peptide b EB. This condition enforces that if h was not predicted to potentially present b, then all heteroclitic peptides b' derived from b will not be displayed by h (even if h would otherwise be predicted to display b').
[0112] In some embodiments, OptiVax-Robust is used to design a final peptide set (e.g., third peptide set) from the union of base peptides and heteroclitic peptides B U B' (with corresponding scoring matrices S and for B and B', respectively). OptiVax will output m sets
where m is the largest vaccine size requested from OptiVax. Let Vk denote
the compact set of vaccine peptides output by OptiVax containing k peptides. Note that
is not necessarily a superset of
In alternate embodiments, OptiVax can be used to augment the base set B with peptides from B' using scoring matrix to have OptiVax return set , and the
final vaccine set consists of peptides
[0113] In some embodiments, this procedure is repeated independently for each target of interest, and the resulting independent vaccine sets can be merged into a combined vaccine as described below.
MHC Class II Vaccine Design Procedure [0114] In some embodiments, MHC class II vaccine design procedures consist of the following computational steps. [0115] In some embodiments, the inputs for the computation are:
N: Parameter for EvalVax and OptiVax objective function. Specifies minimum number of predicted per-individual hits for population coverage objective to consider the individual covered. Default = 1 (computes P(n ≥ 1) population coverage). [0116] In some embodiments, Peptide-HLA Scoring Functions used are:
Options include NetMHCIIpan, PUFFIN, ensembles, or alternative metrics or software may be used, including models calibrated against immunogenicity data.
[0117] Next, from the seed protein sequence (P), a set
of peptides spanning the protein sequence are constructed. only produces set members when the subscripts are within
the range of the defined seed protein P. Here, we extract all windowed peptides of length 13–25 spanning the target mutation, but this process can be performed using any desired window lengths (e.g., only 15-mers).
where contains all sliding windows of length k, which are combined to form ^^. Note that here (unlike MHC class I), no peptides are excluded based on binding core or anchor residue positions (for MHC class II, filtering is performed as described in this disclosure). MHC Class II Vaccine Design Procedure with Defined Peptide Set
[0118] Next, each peptide sequence in
is scored against all HLA alleles in
for potential presentation using SCOREPOTENTIAL (with threshold τ1 = 500 nM) and store results in a
Note that S1 is a binary matrix where 1 indicates the HLA is predicted to potentially present the peptide, and 0 indicates no potential presentation. [0119] For each (peptide, HLA allele) pair (p, h), identify/predict the 9-mer binding core using FINDCORE. The predicted binding core is recorded in a matrix C:
[0120] Next, if not(s) then otherwise an updated scoring matrix S2 is
computed for native peptides in
where Pt is the target residue of interest (e.g., the mutation site of KRAS G12D). This condition enforces the target residue to fall within the binding core at a non-anchor position for all (peptide, HLA allele) pairs with non-zero scores in S2 , and allows the binding core to vary by allele per peptide (as the binding cores of a particular peptide may differ based on the HLA allele presenting the peptide). Thus, for each pair (p, h), if the predicted binding core specifies the target residue Pt
at an anchor position (P1, P4, P6, or P9 of the 9-mer core), or if Pt is not contained within the binding core, then S2 [p, h] = 0. In an alternate embodiment, Pt can be located outside of the core or inside the core in a non-anchor position. In some embodiments, Pt can only be located at specific positions inside and/or outside of the core. In some embodiments, the binding core predictions in C are accompanied by prediction confidences. In some embodiments, if the confidence for predicted core is below a desired threshold (e.g., 0.5, 0.6,
0.7, 0.8, or 0.9), then S2 [p, h] = 0. [0121] Next, OptiVax-Robust is run with peptides
and scoring matrix S2 to identify a non- redundant base set of peptides . (In alternate embodiments, B can be chosen as the entire
set ^ rather than identifying a non-redundant base set.) [0122] Next, a set of all heteroclitic peptides B' is created stemming from peptides in B:
where ANCHOR-MODIFIED(b,c) returns a set of all 204 – 1 anchor-modified peptides stemming from b with all possible modifications to the amino acids at P1, P4, P6, and P9 of the 9-mer binding core c. Thus, for each base peptide b, the heteroclitic set B' contains all anchor-modified peptides b' with modifications to all unique cores of b identified for any HLA alleles that potentially present b with a valid core position as indicated by scoring matrix S2. [0123] Next, all heteroclitic candidate peptides (e.g., modified peptides) in B' are scored against all HLA alleles in for predicted display using SCOREDISPLAY
[0124] For each (heteroclitic peptide, HLA allele) pair (b',h), identify/predict the 9-mer binding core using FINDCORE. The predicted binding core is recorded in a matrix C'::
[0125] An updated scoring matrix is computed for heteroclitic peptides conditioned on
the identified binding cores of a heteroclitic and base peptides occurring at the same offset by a particular HLA:
where each heteroclitic peptide b' ∈ B' is a mutation of base peptide b ∈ B. This condition enforces the binding core of the heteroclitic peptide b' to be at the same relative position as the base peptide b, and, implicitly, enforces that the target residue Pt still falls in a non-anchor position within the 9-mer binding core (Step 3). [0126] An updated scoring matrix
is computed for heteroclitic peptides conditioned on the potential presentation of the corresponding base peptides by each HLA:
where each heteroclitic peptide b' ∈ B' is a mutation of base peptide b ∈ B. This condition enforces that if h was not predicted to display b, then all heteroclitic peptides b'
derived from b will not be displayed by h (even if h would otherwise be predicted to display b'). [0127] OptiVax-Robust is used to design a final peptide set (e.g., third peptide set) from the union of base peptides and heteroclitic peptides B ∪ B' (with corresponding scoring matrices S2 and for B and B', respectively). OptiVax will output m sets where m is
the largest vaccine size requested from OptiVax. Let denote the compact set of vaccine
peptides output by OptiVax containing k peptides. Note that
is not necessarily a superset of In alternate embodiments, OptiVax can be used to augment the base set B with peptides from B' using scoring matrix
to have OptiVax return set and the final vaccine set
consists of peptides
[0128] In some embodiments, this procedure is repeated independently for each single target of interest, and the resulting independent vaccine sets can be merged into a combined vaccine as described below. MHC Class I or Class II Vaccine Design Method Prioritizing Peptide Conservation [0129] In some embodiments, peptide sequences that are more conserved across strains, species, or other protein sources of interest are prioritized for vaccine inclusion. In some embodiments, a set of related protein sequences called protein variants are considered for vaccine design. A protein variant is one instance of a family of protein sequences, and protein variants can be sequences from various species, pathogen strains, viral strains, or other variations considered for vaccine design. In some embodiments, each protein variant has an associated probability called a protein variant probability, where the sum of all protein variant probabilities for the supplied set of protein variants is one. In some embodiments, multiple proteins of interest can be considered for the design of a single vaccine using an MHC Class I or Class II vaccine design method prioritizing peptide conservation. In these embodiments, protein variants for all proteins of interest are collectively considered for generating candidate peptides. In some embodiments, the protein variant probabilities across all of the considered multiple proteins sum to one. [0130] A set of candidate peptides are created from each protein variant using a sliding window method that parses the protein variant into peptide sequences. In some embodiments, for MHC Class I 8-mers, 9-mers, 10-mers, and/or 11-mers are produced, but this process can be
performed with any desired window lengths and the resulting peptide sets combined. In some embodiments, for MHC Class I, only 9-mers are produced. In some embodiments, for MHC Class II, all windowed peptides of length 13–25 are produced, but this process can be performed using any desired window lengths (e.g., only 15-mers). In some embodiments, peptides that are predicted to be glycosylated in a given protein variant are removed and not considered for that variant as described in Liu et al.2020 which is incorporated by reference herein in its entirety. [0131] In some embodiments, for each generated peptide sequence (MHC Class I or Class II) conservation is defined as the fraction of input protein variants where the peptide sequence occurs. For example, if a given 9-mer peptide sequence occurs in the peptides generated from 90% of the protein variants provided as input, its conservation is .90. In some embodiments, conservation is defined for each generated peptide sequence (MHC Class I or Class II) as the sum of the protein variant frequencies where the peptide sequence occurs. For example, if a given 9-mer peptide sequence occurs in the peptides generated from protein variants with protein variant probabilities of 0.10 and 0.20, its conservation is 0.30. In some embodiments, this functionality is implemented by a ComputeConservation function that computes the sum of the frequencies of the protein variants that contain a peptide sequence. In some embodiments, when sufficient protein variants are not sufficient for computing expected future conservation a method of predicting conservation can be used to implement ComputeConservation, such as the one found in Hie et al., 2021 which is incorporated by reference herein in its entirety. [0132] In some embodiments, vaccine design considers conservation by prioritizing peptides for vaccine inclusion that are more conserved than others to meet a desired vaccine performance metric. In some embodiments, the vaccine design method attempts to first design a vaccine with candidate peptides that all meet a first conservation threshold, and if the desired vaccine performance is not met, it iteratively adds additional peptides with less stringent conservation to attempt to meet the desired vaccine performance metric. In some embodiments, vaccine design prioritizing conservation proceeds by setting a vaccine design D to be an empty set, and then performing the steps of: (1) selecting candidate peptides in which each peptide passes a conservation threshold to create a candidate peptide set and is not in D, (2) selecting vaccine designs having varying peptide numbers/combinations from this candidate set to optimize a vaccine performance metric using methods disclosed herein for MHC Class I or Class II vaccine design to augment the vaccine design contained in D (one implementation of vaccine augmentation is described in (Liu et al., 2021), incorporated by reference in its entirety herein),
(3) selecting the smallest vaccine peptide set design from Step 2 that either meets the desired vaccine performance metric or where adding one more peptide to the selected set does not provide a desired minimum improvement in the vaccine performance metric, (4) if a vaccine peptide set was found in Step 3, adding the vaccine peptide set design from Step 3 to the vaccine design D, and (5) determining whether the vaccine design D meets a desired vaccine performance metric objective, and if so, return vaccine design D as the final vaccine design. If at Step 6, the vaccine design D fails to meet the desired vaccine performance metric objective, the computation continues with the following steps: (6) setting an updated conservation threshold to be lower than the current conservation threshold (less constrained) and (7) repeating the process starting at Step 1 retaining the current vaccine design D and current candidate set until either a desired vaccine performance metric objective is reached at Step 6, or the updated conservation threshold is lower than a minimum desired conservation threshold. If on any iteration, the updated conservation threshold is lower than a minimum desired conservation threshold, the latest version of vaccine design D will be used as the final vaccine design. When the process completes, the final vaccine design D includes all of the peptides that can be used in a vaccine. [0133] In some embodiments, MHC class I or class II vaccine design procedures consist of the following computational steps. [0134] In some embodiments, the inputs for the computation are:
[0135] The protein variant sequences Pj are used to produce windowed peptides that span the protein sequence(s) starting at each location m with a peptide length of k residues. The result is the set Xj that contains all of the peptide sequences in protein variant only
produces a sequence when the subscripts are within the range of the defined protein Pj. In some embodiments for MHC Class I, k is chosen to produce 8-mers, 9-mers, 10-mers, and 11-mers, but this process can be performed with any desired window lengths and the resulting peptide sets combined. In some embodiments for MHC Class I, only 9-mers are produced. In some embodiments for MHC Class II, we extract all windowed peptides of length 13–25, but this process can be performed using any desired window lengths (e.g., only 15-mers).
[0136] In some embodiments for MHC Class I, the second condition m ≠ {t – (k-1), t – 1} excludes peptides where the mutation at t is in positions P2 or Pk of the windowed k-mer peptide
(i.e., the anchor positions) and the mutation is a substitution and if for MHC Class I design. MHC Class II anchor positions are filtered in the MHC Class II design method. [0137] Create the set of all peptides B that occur in any input protein variant.
[0138] For each peptide ^^௪ in ^^ its conservation metric Cw is computed using COMPUTECONSERVATION Cw = COMPUTECONSERVATION[ Bw, X, O] [0139] The current conservation threshold is then set to the initial conservation threshold
[0140] At Step 1, candidate peptides are selected where each peptide passes a conservation threshold to create a candidate peptide set and is not in D. A set of peptide candidates
is defined such that each candidate peptide meets the current conservation threshold c^ and the peptide candidate is not already in D. D is set to empty (0 peptides) on the first iteration of the computational steps.
[0141] At Step 2, vaccine designs are selected having varying peptide numbers/combinations from the candidate set to optimize a vaccine performance metric using methods disclosed herein for MHC Class I or Class II vaccine design to augment the vaccine design contained in D. The peptide set is provided to “MHC Class I Vaccine Design
Procedure with Defined Peptide Set
for MHC Class I and “MHC Class II Vaccine Design Procedure with Defined Peptide Set
for MHC Class II. The peptide set
is provided as the set of candidates to augment the set D. Both the set
and D are provided to OptiVax which uses D as the fixed starting set and augments D with peptides from the set
using vaccine augmentation as described in (Liu et al., 2021), incorporated by reference in its entirety herein. OptiVax-Robust is used to augment the set D with peptides from
using the scoring matrices as
defined in “MHC Class I Vaccine Design Procedure with Defined Peptide Set
for MHC Class I and “MHC Class II Vaccine Design Procedure with Defined Peptide Set
for MHC Class II, and returns sets where each set is a compact set of vaccine peptides output by
OptiVax containing s peptides. In some embodiments, the steps to modify anchor positions are not utilized in the MHC Class I or MHC Class II vaccine design methods and only the base peptides B are utilized for vaccine design. In some embodiments, positions in addition to anchor positions are modified in the MHC Class I or MHC Class II vaccine design methods utilized to create B’. [0142] At Step 3, the smallest vaccine peptide set design is selected from Step 2 that either meets the desired vaccine performance metric or where adding one more peptide to the selected set does not provide a desired minimum improvement in the vaccine performance metric. A vaccine design
is chosen that meets minimum requirements. In some embodiments, the vaccine design
is chosen with the value s chosen to be the minimum value of s such that the difference in vaccine performance between and is less than
v
d. In some
embodiments, the value s is chosen to be the minimum value such that the vaccine performance metric of meets the final vaccine performance metric v. In some embodiments, is
not necessarily a superset of
[0143] At Step 4, if a vaccine peptide set was found in Step 3, it is added to the vaccine peptide set design D. If an acceptable vaccine design was found in Step 4, the vaccine
design set D is updated to consist of
[0144] At Step 5, it is determined whether the vaccine design D meets a desired vaccine performance metric objective. If the vaccine design set D meets the final vaccine performance design metric v, return D as the final design. [0145] At Step 6, the conservation threshold is updated to be lower than the current conservation threshold (less constrained). If the vaccine design set D does not meet the final vaccine performance design metric v, reduce cc
[0146] At Step 7, repeat the process starting at Step 1 retaining the current vaccine design D and current candidate set until either a desired vaccine performance metric objective is reached
threshold. If cc <cm then return design set D as the final vaccine. If not, return to Step 1 and repeat all subsequent steps. [0147] In some embodiments, this procedure is repeated independently for each pathogen gene variant or target variant of interest, and the resulting independent vaccine sets can be merged into a combined vaccine. Methods for combining multiple vaccines [0148] The above-described methods will produce an optimized target peptide set (e.g., third peptide set) for one or more individual targets. In some embodiments, a method is provided for designing separate vaccines for MHC class I and class II-based immunity for multiple targets (e.g., two or more targets such as KRAS G12D and KRAS G12V). [0149] In some embodiments, a method is disclosed for producing a combined peptide vaccine for multiple targets by using a table of presentations for a disease that is based upon empirical data from sources such as the Cancer Genome Atlas (TCGA). FIG.10 shows one embodiment for factoring disease presentation type probabilities (e.g., pancreatic cancer, colorectal cancer, and skin cancer) by probability, for each disease presentation, of target presented for various mutation targets (e.g., KRAS G12D, KRAS G12V, and KRAS G12R). A presentation is a unique set of targets that are presented by one form of a disease (e.g., distinct type of cancer or cancer indication as shown in FIG.10). For each presentation, FIG.10 shows an example of the probability of that presentation, and the probability that a given target is observed. For a given presentation, there can be one or more targets, each having a probability. In some embodiments, the method for multi-target vaccine design will allocate peptide resources for inducing disease immunity based on the presentation and respective target probabilities as shown in FIG.10, for example. In some embodiments, presentations correspond to the prevalence of targets in different human populations or different risk groups. The probability of a target in a population is computed by summing for each possible presentation the probability of that presentation times the probability of the target in that presentation. FIG.10 shows weights used for merging individual vaccines for each target (row) into combined vaccines for each disease indication (column). Values indicate the observed fraction of cases containing each target mutation. Data are from The Cancer Genome Atlas (TCGA). For each disease indication, TCGA data are filtered to cases where the Primary Site is the indication.
[0150] In some embodiments, the same vaccine design will be generated for mutations to different proteins when the base peptides generated by the mutations to the different proteins are identical. For example, in some embodiments of base peptide selection the following mutations have identical vaccine designs because they share the same set of base peptides: HRAS Q61K, NRAS Q61K, and KRAS Q61K; HRAS Q61L, NRAS Q61L, and KRAS Q61L; HRAS Q61R, NRAS Q61R, and KRAS Q61R. Referring to FIG.10, in some embodiments, when two mutations have identical individual vaccine designs, their presentation specific probabilities are added when weighting the individual vaccine design for inclusion in a combined vaccine as described below (e.g., for Thyroid Cancer NRAS Q61R and HRAS Q61R). [0151] Referring to FIG.11, in some embodiments, the method first includes designing an individual peptide vaccine for each target to create a combined vaccine design for multiple targets. This initially results in sets of target-specific vaccine designs. In some embodiments, the marginal predicted vaccine performance of each target-specific vaccine at size k is defined by predicted vaccine performance at size k minus the predicted vaccine performance of the vaccine at size k minus one. The composition of a vaccine may change as the number of peptides used in the vaccine increases, and thus for computing contributions to a combined vaccine, the marginal predicted vaccine performance of each target-specific vaccine is used instead of a specific set of peptides. [0152] In some embodiments, the weighted marginal predicted vaccine performance of a target-specific vaccine design for each target specific vaccine size is computed as shown in FIG. 11. For a given target specific vaccine size, its weighted predicted vaccine performance is computed by multiplying its predicted vaccine performance times the probability of the target in the population (e.g., by using values as shown in FIG.10). The marginal weighted predicted vaccine performance for a target specific vaccine is its weighted coverage at size k minus its coverage a size k minus one. The marginal weighted predicted vaccine performance of a target specific vaccine of size one is its weighted predicted vaccine performance. The marginal weighted predicted vaccine performances for all vaccines are combined into a single list, and the combined list is sorted from largest to least by the weighted marginal predicted vaccine performances of the target specific vaccines as shown in FIG.11. The combined vaccine of size n is then determined by the first n elements of this list. The peptides for the combined vaccine are determined by the individual peptide target vaccines whose sizes add to n and whose weighted predicted vaccine performances sums to the same sum as the first n elements of the
sorted list. This maximizes the predicted vaccine performance of the combined vaccine of size n. [0153] In some embodiments, the combined multiple target vaccine can be designed on its overall predicted coverage for the disease described depending on the presentation table used (e.g., see FIG.10), by its predicted coverage for a specific indication, and/or by its predicted coverage for a specific target by adjusting the weighting used for predicted vaccine performance accordingly. Once a desired level of coverage is selected, the peptides of the combined vaccine are determined by the contributions of target-specific designs. For example, if the combined vaccine includes a target-specific vaccine of size k, then the vaccine peptides for this target at size k are used in the combined vaccine. [0154] As an example of one embodiment, FIG.10 shows mutations (e.g., KRAS G12D, G12V, and G12R) and their respective probabilities of occurring in an individual with different cancer indications (e.g., pancreatic cancer). FIG.3 (MHC class I) and FIG.4 (MHC class II) show the population coverage of target-specific vaccines for the KRAS G12D, G12V, G12R, G12C, and G13D targets using the methods for vaccines described herein. The marginal population coverage of each target-specific vaccine at a given vaccine size is the improvement in coverage at that size and the size minus one. The coverage with no peptides is zero. The marginal coverage of each target-specific vaccine is multiplied by the probability of the target in the population as determined by the proportions as shown in FIG.10 for a selected indication (e.g., pancreatic cancer). These weighted marginal coverages of all target-specific vaccines are sorted to determine the best target-specific compositions, and the resulting list describes the composition of a combined vaccine for the selected indication at each size k by taking the first k elements of the list. As an example of one embodiment, FIG.s 12 (MHC Class I) and FIG.13 (MHC Class II) show the target specific contributions at each vaccine size for a combined KRAS vaccine for the three mutations KRAS G12D, G12V, and G12R. The methods for combined vaccine protocol described herein was used to compute the examples in FIG.12 and 13. At each combined vaccine size, different components of the target-specific vaccines are utilized for the indication illustrated. Table 1 (below) contains the peptides present in independent (single target) and combined (multiple target) MHC class I vaccine designs for the KRAS G12D, G12V, G12R, G12C, and G13D targets. Table 2 (below) contains the contains the peptides present in independent (single target) MHC class II vaccine designs for the KRAS G12D, G12V, G12R, G12C, and G13D targets, and any subset of the individual/single target
vaccines can be combined to create an MHC class II vaccine for two or more multiple targets. For alternate embodiments, Sequence Listing provides heteroclitic peptides useful in MHC class I vaccines for the KRAS G12D, G12V, G12R, G12C, and G13D targets. Combined Vaccine Design Procedure [0155] In some embodiments, the procedure described herein is used to combine individual compact vaccines optimized for different targets into a single optimized combined vaccine. [0156] In some embodiments, the computational inputs for the procedure are:
[0157] At Step 1, for each target t (individually) compute optimized vaccines of sizes 1 to m as the sets where k denotes the size of the vaccine and then compute their vaccine performance at each vaccine size. For each target t (individually) and vaccine size (peptide count) k, the unweighted population coverage is computed:
In some embodiments, for each target, t, is generally monotonically increasing and concave down for increasing values of k (each additional peptide increases coverage but with decreasing returns).
[0158] At Step 2, vaccine marginal performance is computed and weighted by each target’s prevalence weight. For each target t (individually), the marginal coverage
is computed of the k-th peptide added to the vaccine set:
In some embodiments, for each target t, mt,k should be a monotonically decreasing function in k (by Step 1 above). [0159] The weighted marginal population coverage ^^^௧,^ is computed using weights of each target in W:
The weighted marginal population coverage gives the effective marginal coverage of the k-th peptide in the vaccine weighted by the prevalence of the target in the presentation (by multiplication with the probability/weight of the target in the presentation). [0160] At Step 3, the weighted vaccine performances are merged for all targets to produce combined vaccine designs at each peptide count. The individual vaccines are combined into a combined vaccine via the MERGEMULTI procedure called on the weighted marginal population coverage lists
FIG.14 shows an example Python implementation of the MERGEMULTI function. This procedure takes as input multiple sorted (descending) lists and merges them into a single sorted (descending) list. Let M indicate the output of MERGEMULTI where each element Mk contains both the marginal weighted coverage and source (target) of the k-th peptide in the combined vaccine. The combined vaccine contains peptides from different targets. In particular, the combined vaccine with k peptides contains
peptides from target t. gives the distribution of the k
peptides in the combined vaccine across the targets). [0161] At Step 4, a vaccine with a desired performance is selected. The final vaccine size k can vary based upon the specific population coverage goals of the vaccine. The marginal weighted coverage values of the combined vaccine Mk can be cumulatively summed over k to give the overall effective (target-weighted) population coverage of the combined vaccine containing k peptides as (taking into account both the probabilities/weights of the
targets in the presentation and the expected population coverage of peptides based on HLA display). [0162] At Step 5, the vaccine peptides corresponding to the target coverage is retrieved for the final vaccine size k. The optimal combined vaccine set for the final vaccine size k is defined as:
[0163] Thus, the combined vaccine with k peptides is the combination of the optimal individual MHC class I peptide sequences for RAS vaccines [0164] In some embodiments, a peptide vaccine (single target or combined multiple target vaccine) comprises about five, ten, or twenty MHC class I peptides with each peptide consisting of 8 or more amino acids. In some embodiments, an MHC class I peptide vaccine is intended for one or more of the KRAS G12D, G12V, and G12R targets. In some embodiments, the amino acid sequence of a first peptide in a five-peptide combined vaccine comprises SEQ ID NO: 1. GADGVGKSM (SEQ ID NO: 1). In some embodiments, the amino acid sequence of a second peptide in a five-peptide combined vaccine comprises SEQ ID NO: 2. LMVVGADGV (SEQ ID NO: 2). In some embodiments, the amino acid sequence of a third peptide in a five-peptide combined vaccine comprises SEQ ID NO: 3. GAVGVGKSL (SEQ ID NO: 3). In some embodiments, the amino acid sequence of a fourth peptide in a five-peptide combined vaccine comprises SEQ ID NO: 4. LMVVGAVGV (SEQ ID NO: 4). In some embodiments, the amino acid sequence of a fifth peptide in a five-peptide combined vaccine comprises SEQ ID NO: 5. VTGARGVGK (SEQ ID NO: 5). An example combined vaccine for the KRAS G12D, G12V, and G12R targets with five peptides (SEQ ID NO: 1 to SEQ ID NO: 5) is predicted to have a weighted population coverage of 0.3620. [0165] In some embodiments, any one of the peptides (peptides 1-5) in the five-peptide vaccine comprise an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, or SEQ ID NO: 5.
[0166] In some embodiments, the amino acid sequence of peptides 1 to 5 in a ten-peptide combined vaccine comprise SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, and SEQ ID NO: 5. In some embodiments, the amino acid sequence of a sixth peptide in a ten- peptide combined vaccine comprises SEQ ID NO: 6. VMGAVGVGK (SEQ ID NO: 6). In some embodiments, the amino acid sequence of a seventh peptide in a ten-peptide combined vaccine comprises SEQ ID NO: 7. VVGAVGVGK (SEQ ID NO: 7). In some embodiments, the amino acid sequence of an eight peptide in a ten-peptide combined vaccine comprises SEQ ID NO: 8. GARGVGKSY (SEQ ID NO: 8). In some embodiments, the amino acid sequence of a ninth peptide in a ten-peptide combined vaccine comprises SEQ ID NO: 9. GPRGVGKSA (SEQ ID NO: 9). In some embodiments, the amino acid sequence of a tenth peptide in a ten- peptide combined vaccine comprises SEQ ID NO: 10. LMVVGARGV (SEQ ID NO: 10). An example combined vaccine for the KRAS G12D, G12V, and G12R targets with ten peptides (SEQ ID NO: 1 to SEQ ID NO: 10) is predicted to have a weighted population coverage of 0.4374. [0167] In some embodiments, any one of the peptides (peptides 1-10) in the ten-peptide vaccine comprise an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, or SEQ ID NO: 10. [0168] In some embodiments, the amino acid sequence of peptides 1 to 10 in a twenty- peptide combined vaccine comprise SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, and SEQ ID NO: 10. In some embodiments, the amino acid sequence of an 11th peptide in a twenty- peptide combined vaccine comprises SEQ ID NO: 11. GADGVGKSL (SEQ ID NO: 11). In some embodiments, the amino acid sequence of a 12th peptide in a twenty-peptide combined vaccine comprises SEQ ID NO: 12. GADGVGKSY (SEQ ID NO: 12). In some embodiments, the amino acid sequence of a 13th peptide in a twenty-peptide combined vaccine comprises SEQ ID NO: 13. GYDGVGKSM (SEQ ID NO: 13). In some embodiments, the amino acid sequence of a 14th peptide in a twenty-peptide combined vaccine comprises SEQ ID NO: 14. GPVGVGKSV (SEQ ID NO: 14). In some embodiments, the amino acid sequence of a 15th peptide in a twenty-peptide combined vaccine comprises SEQ ID NO: 15. LTVVGAVGV (SEQ ID NO: 15). In some embodiments, the amino acid sequence of a 16th peptide in a twenty-
peptide combined vaccine comprises SEQ ID NO: 16. VVGAVGVGR (SEQ ID NO: 16). In some embodiments, the amino acid sequence of a 17th peptide in a twenty-peptide combined vaccine comprises SEQ ID NO: 17. GARGVGKSM (SEQ ID NO: 17). In some embodiments, the amino acid sequence of an 18th peptide in a twenty-peptide combined vaccine comprises SEQ ID NO: 18. GPRGVGKSV (SEQ ID NO: 18). In some embodiments, the amino acid sequence of a 19th peptide in a twenty-peptide combined vaccine comprises SEQ ID NO: 19. LLVVGARGV (SEQ ID NO: 19). In some embodiments, the amino acid sequence of a 20th peptide in a twenty-peptide combined vaccine comprises SEQ ID NO: 20. VAGARGVGM (SEQ ID NO: 20). An example combined vaccine for the KRAS G12D, G12V, and G12R targets with twenty peptides (SEQ ID NO: 1 to SEQ ID NO: 20) is predicted to have a weighted population coverage of 0.4604. [0169] In some embodiments, any one of the peptides (peptides 1-20) in the twenty-peptide vaccine comprise an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, and SEQ ID NO: 10, , SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, or SEQ ID NO: 20 [0170] Table 1 shows MHC class I peptide sequences described herein including the respective SEQ ID NO, amino acid sequence corresponding to the SEQ ID NO, KRAS protein target (with specific mutation), the seed amino acid sequence (i.e., the amino acid sequence of the wild type KRAS fragment), the amino acid substitution (if any) for heteroclitic peptides at positions 2 and 9, and notes detailing embodiments in which the peptide may be included in a 5, 10, or 20 combined peptide vaccine as described herein. Table 1 also includes additional peptide sequences comprising SEQ ID NOs: 21-41. In some embodiments, any combination of peptides listed in Table 1 (SEQ ID NOs: 1-41) may be used to create a combined peptide vaccine having between about 2 and about 40 peptides. In some embodiments, any one of the peptides (peptides 1-41; SEQ ID NOs: 1-41) in the combined vaccine comprises an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to any of SEQ ID NOs: 1-41.
[0171] Additional amino acid sequences of MHC class I heteroclitic peptides are provided in Sequence Listings (SEQ ID NOs: 67-1522). In some embodiments, any combination of MHC class I peptides disclosed herein (SEQ ID NOs: 1-41, 67-1522, 1524-1536, and 1547-1549) may be used to create a combined peptide vaccine having between about 2 and about 40 peptides. In some embodiments, any one of the peptides (SEQ ID NOs: 1-41, 67-1522, 1524-1536, and 1547-1549) in the combined vaccine comprises or contains an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to any of SEQ ID NOs: 1-41, 67-1522, 1524-1536, or 1547-1549. MHC class II peptide sequences for RAS vaccines [0172] In some embodiments, a peptide vaccine (single target or combined multiple target vaccine) comprises about 2 to 40 MHC class II peptides with each peptide consisting of about 20 amino acids. In some embodiments, an MHC class II peptide vaccine is intended for one or more of the KRAS G12D, G12V, G12R, G12C, and G13D targets. [0173] Table 2 summarizes MHC class II peptide sequences described herein including the respective SEQ ID NO, amino acid sequence corresponding to the SEQ ID NO, the amino acid sequence corresponding to the peptide’s binding core, the KRAS protein target (with specific mutation), the seed amino acid sequence (i.e., the amino acid sequence of the wild type KRAS fragment), the seed amino acid sequence of the binding core, and the amino acid substitution (if any) for heteroclitic peptides at positions 1, 4, 6, and 9. Table 2 includes peptide sequences comprising SEQ ID NOs: 42-66. SEQ ID NOs: 42-65 (Table 2) encode for recombinant peptides. In some embodiments, any combination of peptides listed in Table 2 (SEQ ID NOs: 42-66) may be used to create a single target (individual) or combined peptide vaccine having
42-66; SEQ ID NOs: 42-66) in the combined vaccine comprises an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to any of SEQ ID NOs: 42-66. Table 2. Example RAS Vaccine Peptides (MHC class II)
[0174] In some embodiments, any combination of MHC class I and/or MHC class II peptides disclosed herein (SEQ ID NOs: 1-1522 and 1524-1549) may be used to create a single target (individual) or combined peptide vaccine having between about 2 and about 40 peptides. In some embodiments, any one of the peptides (peptides 1-1522 and 1524-1549; SEQ ID NOs: 1-1522 and 1524-1549) in the combined vaccine comprises an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to any of SEQ ID NOs: 1-1522 or 1524-1549. RAS mRNA and DNA vaccines [0175] In some embodiments, vaccine peptides are encoded as mRNA or DNA molecules and are administered for expression in vivo as is known in the art. One example of the delivery of vaccines by mRNA is found in Kranz et al. (2016) and U.S. Patent No.8,637,006, incorporated herein by reference. In one embodiment, a construct comprises 10 peptides, including a five-peptide MHC class I combined pancreatic cancer vaccine (targets: KRAS G12D, G12V, G12R) and a five-peptide MHC class II combined pancreatic cancer vaccine (targets: KRAS G12D, G12V, G12R), as optimized by the procedure described herein. Peptides are prepended with a secretion signal sequence at the N-terminus and followed by an MHC class I trafficking signal (MITD) (Kreiter et al., 2008; Sahin et al., 2017; U.S. Patent No.8,637,006). The MITD has been shown to route antigens to pathways for HLA class I and class II presentation (Kreiter et al., 2008). Here we combine all peptides of each MHC class into a single construct using non-immunogenic glycine/serine linkers from Sahin et al. (2017), though it is also plausible to construct individual constructs containing single peptides with the same secretion and MITD signals as demonstrated by Kreiter et al. (2008).
[0176] In some embodiments, the amino acid sequence encoded by the mRNA vaccine comprises SEQ ID NO: 1523. Underlined amino acids correspond to the signal peptide (or leader) sequence. Bolded amino acids correspond to MHC class I (9 amino acids in length; 5 peptides) and MHC class II (13–25 amino acids in length; 5 peptides) peptide sequences. Italicized amino acids correspond to the trafficking signal. MRVTAPRTLILLLSGALALTETWAGSGGSGGGGSGGGADGVGKSMGGSGGGGSGGL MVVGADGVGGSGGGGSGGGAVGVGKSLGGSGGGGSGGLMVVGAVGVGGSGGGGS GGVTGARGVGKGGSGGGGSGGEYKFVVLGTVGHGKSGGSGGGGSGGEYKIVVAG NVGIGKSGGSGGGGSGGEYKFVVFGSDGAGKSGGSGGGGSGGMTEYKFVVSGADGI GKSALTGGSGGGGSGGMTEYKFVVIGNRGVGKSALTGGSLGGGGSGIVGIVAGLAVL AVVVIGAVVATVMCRRKSSGGKGGSYSQAASSDSAQGSDVSLTA (SEQ ID NO: 1523). [0177] In some embodiments, the vaccine is an mRNA vaccine comprising a nucleic acids sequence encoding the amino acid sequence consisting of SEQ ID NO: 1523. In some embodiments, the nucleic acid sequence of the mRNA vaccine encodes for an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to SEQ ID NO: 1523. [0178] In some embodiments, the vaccine is a DNA vaccine comprising a nucleic acids sequence encoding the amino acid sequence consisting of SEQ ID NO: 1523. In some embodiments, the nucleic acid sequence of the DNA vaccine encodes for an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to SEQ ID NO: 1523. [0179] In some embodiments, mRNA encoded vaccine peptides are used as the payload of a self-amplifying RNA vaccine. In one embodiment, the mRNA sequence encoding the vaccine peptides replaces one or more structural proteins of an infectious alphavirus particle as described in Geall et al. (2012) that is incorporated herein by reference. As is described by Geall et al. (2012), self-amplifying RNA vaccines can increase the efficiency of antigen production in vivo. [0180] In some embodiments, one or more MHC class I and/or MHC class II peptides disclosed herein (SEQ ID NO: 1-1522 and 1524-1549) can be encoded in one or more mRNA or DNA molecules and administered for expression in vivo. In some embodiments between about 2 and about 40 peptide sequences are encoded in one or more mRNA constructs. In some embodiments between about 2 and about 40 peptide sequences are encoded in one or more
DNA constructs (i.e., nucleic acids encoding the amino acids sequences comprising on or more of SEQ ID NOs: 1-1522 and 1524-1549). In some embodiments, the amino acid sequence of the mRNA vaccine or the nucleic acid sequence of the DNA vaccine encodes for an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to any of SEQ ID NOs: 1-1522 or 1524-1549.
MHC class I peptide sequences for BCR-ABL vaccines
[0181] In some embodiments, a peptide vaccine (single target or combined multiple target vaccine) comprises about 1 to 40 MHC class I peptides with each peptide consisting of 8 or more amino acids. In some embodiments, an MHC class I peptide vaccine is intended for a BCR-ABL gene fusion. In some embodiments, the BCR-ABL gene fusion is selected from the group consisting of b3a2 and b2a2. In some embodiments, an MHC class I peptide vaccine is intended to prevent cancer. In some embodiments, an MHC class I peptide vaccine is intended to treat cancer. In some embodiments, an MHC class I peptide vaccine is intended to prevent chronic myelogenous leukemia (CML), acute lymphoblastic leukemia (ALL), acute myelogenous leukemia (AML), or breast invasive ductal carcinoma. In some embodiments, an MHC class I peptide vaccine is intended to treat chronic myelogenous leukemia (CML), acute lymphoblastic leukemia (ALL), acute myelogenous leukemia (AML), or breast invasive ductal carcinoma.
[0182] In some embodiments, the amino acid sequence vaccine for a MHC class I peptide vaccine for BCR-ABL comprises one or more of the SEQ ID NOs: 1550 to 1594. In some embodiments, any one of the peptides in the BCR-ABL vaccine comprise an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to SEQ ID NOs: 1550 to 1594.
[0183] In some embodiments, the amino acid sequence vaccine for a MHC class I peptide vaccine for BCR-ABL comprises two or more of the SEQ ID NOs: 1550 to 1594. In some embodiments, any one of the peptides in the BCR-ABL vaccine comprise an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to SEQ ID NOs: 1550 to 1594.
[0184] Table 3 shows MHC class I peptide sequences described herein including the respective SEQ ID NO, amino acid sequence corresponding to the SEQ ID NO, the seed amino acid sequence (i.e., the amino acid sequence of the wild type BCR-ABL protein fusion
fragment), the amino acid substitution (if any) for heteroclitic peptides at positions 2 and C (carboxyl terminus), and notes detailing embodiments in which the peptide may be included in a combined peptide vaccine as described herein. SEQ ID NOs: 1550-1582, and 1594 are derived from BCR-ABL b3a2, while SEQ ID NOs: 1583-1593 are derived from BCR-ABL b2a2. In some embodiments, any combination of peptides listed in Table 3 (SEQ ID NOs: 1550 to 1594) may be used to create a combined peptide vaccine having between about 1 and about 40 peptides. In some embodiments, any one of the peptides (peptides 1550 to 1594; SEQ ID NOs: 1550 to 1594) in the combined vaccine comprises an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to any of SEQ ID NOs: 1550 to 1594. [0185] In some embodiments, any combination of the peptides listed in Table 3 in the “b3a2 Vaccine” column (SEQ ID NOs: 1550 to 1582 and SEQ ID NO: 1594) may be used to create a combined peptide vaccine having between about 1 and about 40 peptides. In some embodiments, any one of these peptides in the combined vaccine comprises an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to the peptides listed in Table 3 in the “b3a2 Vaccine” column (SEQ ID NOs: 1550 to 1582 and SEQ ID NO: 1594). [0186] In some embodiments, any combination of the peptides listed in Table 3 in the "b2a2 Vaccine" column (SEQ ID NOs: 1583 to 1593) may be used to create a combined peptide vaccine having between about 1 and about 40 peptides. In some embodiments, any one of these peptides in the combined vaccine comprises an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to the peptides listed in Table 3 in the "b2a2 Vaccine" column (SEQ ID NOs: 1583 to 1593). [0187] Additional amino acid sequences of MHC class I vaccine peptides are provided in Sequence Listings (SEQ ID NOs: 1662 to 2249). In some embodiments, any combination of MHC class I peptides disclosed herein (SEQ ID NOs: 1550 to 1594 and SEQ ID NOs: 1662 to 2249) may be used to create a combined peptide vaccine having between about 1 and about 40 peptides. In some embodiments, any one of the peptides (SEQ ID NOs: 1550 to 1594 and SEQ ID NOs: 1662 to 2249) in the combined vaccine comprises or contains an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to any of SEQ ID NOs: 1550 to 1594 or SEQ ID NOs: 1662 to 2249.
Table 3. Example BCR-ABL Vaccine Peptides (MHC class I)
MHC class II peptide sequences for BCR-ABL vaccines [0188] In some embodiments, a peptide vaccine (single target or combined multiple target vaccine) comprises about 1 to 40 MHC class II peptides with each peptide consisting of about 20 amino acids. In some embodiments, an MHC class II peptide vaccine is intended for a BCR- ABL gene fusion. In some embodiments, the BCR-ABL gene fusion is selected from the group consisting of b3a2 and b2a2. In some embodiments, an MHC class II peptide vaccine is intended to prevent cancer. In some embodiments, an MHC class II peptide vaccine is intended to treat cancer. In some embodiments, an MHC class I peptide vaccine is intended to prevent chronic myelogenous leukemia (CML), acute lymphoblastic leukemia (ALL), acute myelogenous leukemia (AML), or breast invasive ductal carcinoma. In some embodiments, an MHC class I peptide vaccine is intended to treat chronic myelogenous leukemia (CML), acute lymphoblastic leukemia (ALL), acute myelogenous leukemia (AML), or breast invasive ductal carcinoma. [0189] In some embodiments, the amino acid sequence vaccine for a MHC class II peptide vaccine for BCR-ABL comprises one or more of the SEQ ID NOs: 1595 to 1661. In some embodiments, any one of the peptides in the BCR-ABL vaccine comprise an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to SEQ ID NOs: 1595 to 1661. [0190] In some embodiments, the amino acid sequence vaccine for a MHC class II peptide
embodiments, any one of the peptides in the BCR-ABL vaccine comprise an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to SEQ ID NOs: 1595 to 1661. [0191] Table 4 summarizes MHC class II peptide sequences described herein including the respective SEQ ID NO, amino acid sequence corresponding to the SEQ ID NO, the amino acid sequence corresponding to the peptide's binding core, the seed amino acid sequence (i.e., the amino acid sequence of the wild type BCR-ABL protein fusion fragment), the seed amino acid sequence of the binding core, and the amino acid substitution (if any) for heteroclitic peptides at positions 1, 4, 6, and 9. Table 4 includes peptide sequences comprising SEQ ID NOs: 1595 to 1661. SEQ ID NOs: 1595 to 1661 (Table 4) encode for recombinant peptides. SEQ ID NOs: 1595-1627 are derived from BCR-ABL b3a2, while SEQ IS NOs: 1628-1661 are derived from BCR-ABL b2a2. In some embodiments, any combination of peptides listed in Table 4 (SEQ ID NOs: 1595 to 1661) may be used to create a single target (individual) or combined peptide vaccine having between about 1 and about 40 peptides. In some embodiments, any one of the peptides (peptides 1595 to 1661; SEQ ID NOs: 1595 to 1661) in the combined vaccine comprises an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to any of SEQ ID NOs: 1595 to 1661. [0192] In some embodiments, any combination of the peptides listed in Table 4 in the “b3a2 Vaccine” column (SEQ ID NOs: 1595 to 1627) may be used to create a combined peptide vaccine having between about 1 and about 40 peptides. In some embodiments, any one of these peptides in the combined vaccine comprises an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to the peptides listed in Table 4 in the "b3a2 Vaccine" column (SEQ ID NOs: 1595 to 1627). [0193] In some embodiments, any combination of the peptides listed in Table 4 in the “b2a2 Vaccine” column (SEQ ID NOs: 1628 to 1661) may be used to create a combined peptide vaccine having between about 1 and about 40 peptides. In some embodiments, any one of these peptides in the combined vaccine comprises an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to the peptides listed in Table 4 in the "b2a2 Vaccine" column (SEQ ID NOs: 1628 to 1661). [0194] Additional amino acid sequences of MHC class II vaccine peptides are provided in Sequence Listings (SEQ ID NOs: 2250 to 63661) In some embodiments any combination of
MHC class II peptides disclosed herein (SEQ ID NOs: 1595 to 1661 and SEQ ID NOs: 2250 to 63661) may be used to create a combined peptide vaccine having between about 1 and about 40 peptides. In some embodiments, any one of the peptides (SEQ ID NOs: 1595 to 1661 and SEQ ID NOs: 2250 to 63661) in the combined vaccine comprises or contains an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to any of SEQ ID NOs: 1595 to 1661 or SEQ ID NOs: 2250 to 63661. [0195] In some embodiments, any combination of MHC class I and/or MHC class II peptides disclosed herein (SEQ ID NOs: 1550 to 63662) may be used to create a single target (individual) or combined peptide vaccine having between about 2 and about 40 peptides. In some embodiments, any one of the peptides (peptides 1550 to 63662; SEQ ID NOs: 1550 to 63662) in the combined vaccine comprises an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to any of SEQ ID NOs: 1550 to 63662. Table 4. Example BCR-ABL Vaccine Peptides (MHC class II)
BCR-ABL mRNA and DNA vaccines [0196] In one embodiment, a construct comprises 20 peptides, including a ten-peptide MHC class I BCR-ABL b3a2 vaccine and a ten-peptide MHC class II BCR-ABL b3a2 vaccine, as optimized by the procedure described herein. Peptides are prepended with a secretion signal sequence at the N-terminus and followed by an MHC class I trafficking signal (MITD) (See Kreiter et al., 2008; Sahin et al., 2017; U.S. Patent No.8,637,006, incorporated by reference in their entireties herein). The MITD has been shown to route antigens to pathways for HLA class I and class II presentation (Kreiter et al., 2008). Here we combine all peptides of each MHC class into a single construct using non-immunogenic glycine/serine linkers from Sahin et al. (2017), though it is also plausible to construct individual constructs containing single peptides with the same secretion and MITD signals as demonstrated by Kreiter et al. (2008). [0197] In some embodiments, the amino acid sequence encoded by the mRNA vaccine comprises SEQ ID NO: 63662. Underlined amino acids correspond to the signal peptide (or leader) sequence. Bolded amino acids correspond to MHC class I (8–11 amino acids in length; 10 peptides) and MHC class II (13–25 amino acids in length; 10 peptides) peptide sequences. Italicized amino acids correspond to the trafficking signal. In alternate embodiments, any number and variation of peptide sequences disclosed herein can be included in an mRNA vaccine comprising the signal peptide sequence and the trafficking signal as shown in SEQ ID NO: 63662 below. MRVTAPRTLILLLSGALALTETWAGSGGSGGGGSGGAMGFKQSSKGGSGGGGSGGGY KQSSKAMGGSGGGGSGGKQLQRPVASDYGGSGGGGSGGKTLQRPVASDWGGSGGG GSGGKYSSKALQRGGSGGGGSGGSAKALQRPMGGSGGGGSGGSAKALQRPYGGSG GGGSGGSTKALQRPLGGSGGGGSGGSTTGFKQSSKGGSGGGGSGGSTTGFKQSSRG GSGGGGSGGLNVIVHSATGIKQISAALIRPVASDGGSGGGGSGGLNVIVHSATGIKQI SSALIRPVASDGGSGGGGSGGSATGFFQSKKFLQVPVASDFGGSGGGGSGGSATGFK QFSIALRRPVASDFGGSGGGGSGGSATGFKQISRALSRPVASDFGGSGGGGSGGSATG FKQSSFALIRPVASDFGGSGGGGSGGSATGFKQSSRALSRAVANDFGGSGGGGSGGS ATGFNQSAKVLQAPVASDFGGSGGGGSGGYGFLNVIVHSATGFKQTSFALNRPVGG
SGGGGSGGYGFLNVIVHSATGIKQASNALARPVGGSLGGGGSGIVGIVAGLAVLAVVVI GAVVATVMCRRKSSGGKGGSYSQAASSDSAQGSDVSLTA (SEQ ID NO: 63662). [0198] In some embodiments, the vaccine is an mRNA vaccine comprising a nucleic acids sequence encoding the amino acid sequence consisting of SEQ ID NO: 63662. In some embodiments, the nucleic acid sequence of the mRNA vaccine encodes for an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to SEQ ID NO: 63662. [0199] In some embodiments, the vaccine is a DNA vaccine comprising a nucleic acids sequence encoding the amino acid sequence consisting of SEQ ID NO: 63662. In some embodiments, the nucleic acid sequence of the DNA vaccine encodes for an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to SEQ ID NO: 63662. [0200] In some embodiments, one or more MHC class I and/or MHC class II peptides disclosed herein (SEQ ID NO: 1550 to 63662) can be encoded in one or more mRNA or DNA molecules and administered for expression in vivo. In some embodiments, between about 2 and about 40 peptide sequences are encoded in one or more mRNA constructs. In some embodiments, between about 2 and about 40 peptide sequences are encoded in one or more DNA constructs (i.e., nucleic acids encoding the amino acids sequences comprising on or more of SEQ ID NOs: 1550 to 63662). In some embodiments, the amino acid sequence of the mRNA vaccine or the nucleic acid sequence of the DNA vaccine encodes for an amino acid sequence 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 % identical to any of SEQ ID NOs: 1550 to 63662. Non-limiting embodiments of the subject matter [0201] In one aspect, the invention provides for a nucleic acid sequence encoding at least two amino acid sequences selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33,
SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, and SEQ ID NO: 41. [0202] In some embodiments, the nucleic acid sequence is an immunogenic composition. In some embodiments, the nucleic acid sequence is administered in a construct for expression in vivo. In some embodiments, the in vivo administration of the nucleic acid sequence is configured to produce at least one peptide that is displayed by an HLA class I molecule. In some embodiments, the at least one peptide is a modified or an unmodified fragment of a mutated KRAS protein. In some embodiments, the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D. In some embodiments, the nucleic acid sequence is administered in an effective amount to a subject to prevent cancer. In some embodiments, the nucleic acid sequence is administered in an effective amount to a subject to treat cancer. [0203] In another aspect, the invention provides for an immunogenic peptide composition comprising at least two peptides selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, and SEQ ID NO: 41. [0204] In some embodiments at least one peptide of the at least two peptides is displayed by an HLA class I molecule in a subject. In some embodiments, at least one peptide of the at least two peptides is a modified or unmodified fragment of a mutated KRAS protein. In some embodiments, the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to prevent cancer. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to treat cancer. In some embodiments, the immunogenic peptide composition comprises at least three peptides selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO:
NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, and SEQ ID NO: 41. [0205] In another aspect, the invention provides for a nucleic acid sequence encoding at least one amino acid sequence selected from the group consisting of SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, and SEQ ID NO: 65. [0206] In some embodiments, the nucleic acid sequence is an immunogenic composition. In some embodiments, the nucleic acid sequence is administered in a construct for expression in vivo. In some embodiments, the in vivo administration of the nucleic acid sequence is configured to produce at least one peptide that is displayed by an HLA class II molecule. In some embodiments, the at least one peptide is a modified fragment of a mutated KRAS protein. In some embodiments, the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D. In some embodiments, the nucleic acid sequence is administered in an effective amount to a subject to prevent cancer. In some embodiments, the nucleic acid sequence is administered in an effective amount to a subject to treat cancer. [0207] In another aspect, the invention provides for an immunogenic peptide composition comprising at least one peptide selected from the group consisting of SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, and SEQ ID NO: 65. [0208] In some embodiments, at least one peptide in the immunogenic peptide composition is displayed by an HLA class II molecule In some embodiments at least one peptide in the
immunogenic peptide composition is a modified or an unmodified fragment of a mutated KRAS protein. In some embodiments, the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to prevent cancer. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to treat cancer. In some embodiments, the immunogenic peptide composition comprises at least two peptides selected from the group consisting of SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ
ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO:
57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ
ID NO: 63, SEQ ID NO: 64, and SEQ ID NO: 65.
[0209] In another aspect, the invention provides for method of forming an immunogenic peptide composition, the method comprising using a processor to perform the steps of creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein at least one peptide sequence of the plurality of unmodified peptide sequences is associated with a tumor neoantigen or a self-protein, determining a plurality of peptide-HLA binding scores for each peptide sequence in the first peptide set, determining whether each peptide sequence in the first peptide set has a peptide-HLA binding score that passes a first threshold with respect to at least three HLA alleles, creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set, determining whether each modified peptide sequence in the second peptide set has a peptide-HLA binding score that passes a second threshold with respect to the at least three HLA alleles, wherein the second threshold is more constrained than the first threshold; and creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises computing a population coverage, wherein the computing of the population coverage comprises excluding a peptide-HLA binding score with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA binding score for an unmodified peptide sequence associated with the modified peptide sequence does not pass the first threshold with respect to the first HLA allele, and wherein the selected subset has a population coverage above a third threshold, performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence in the third peptide set, and forming an
immunogenic peptide composition comprising the at least one peptide sequence in the third peptide set for which the experimental assay was performed. [0210] In some embodiments, selecting the plurality of unmodified peptide sequences to create the first peptide set comprises sliding a window of size n across an amino acid sequence encoding the tumor neoantigen or the self-protein, wherein n is between about 8 amino acids and about 25 amino acids in length, and wherein n is a length of each peptide sequence of the plurality of unmodified peptide sequences in the first peptide set. In some embodiments, each peptide sequence in the first peptide set binds to an HLA class I molecule or an HLA class II molecule. In some embodiments, the method further comprises filtering the first peptide set to exclude a peptide sequence with a predicted binding core that contains a target residue in an anchor position. In some embodiments, the method further comprises substituting at least one amino acid residue of each peptide sequence in the first peptide set, wherein for at least one peptide sequence in the first peptide set the at least one amino acid residue is in an anchor position. In some embodiments, the first threshold is a binding affinity of less than about 1000 nM. In some embodiments, the second threshold is a binding affinity of less than about 500 nM. In some embodiments, the population coverage is computed based on a frequency of an HLA haplotype in a human population. In some embodiments, the population coverage is computed based on a frequency of the at least three HLA alleles in a human population. In some embodiments, the plurality of unmodified peptide sequences is derived from the tumor neoantigen or the self-protein that is present in a subject. In some embodiments, the third threshold is a proportion of a human population of between about 0.7 and about 0.8. In some embodiments, the tumor neoantigen or the self-protein is associated with a cancer, and wherein the cancer is selected from the group consisting of pancreas, colon, rectum, kidney, bronchus, lung, uterus, cervix, bladder, liver, and stomach. [0211] In another aspect, the invention provides for method of forming an immunogenic peptide composition, the method comprising using a processor to perform the steps of creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein the plurality of unmodified peptide sequences is associated with a tumor neoantigen or a self-protein, determining a plurality of peptide-HLA immunogenicity metrics for each peptide sequence in the first peptide set, determining whether each peptide sequence in the first peptide set has a peptide-HLA immunogenicity metric that passes a first threshold with respect to at least three HLA alleles, creating a second peptide set comprising the first peptide set and a plurality of
modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set, determining whether each modified peptide sequence in the second peptide set has a peptide-HLA immunogenicity metric that passes a second threshold with respect to the at least three HLA alleles, wherein the second threshold is more constrained than the first threshold, and creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises computing a predicted vaccine performance, wherein the computing of the predicted vaccine performance comprises excluding a peptide- HLA immunogenicity metric with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA immunogenicity metric for an unmodified peptide sequence associated with the modified peptide sequence does not pass the first threshold with respect to the first HLA allele, and wherein the predicted vaccine performance is a function of a non-excluded peptide- HLA immunogenicity metric of each peptide sequence in the third peptide set, performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence in the third peptide set, and forming an immunogenic peptide composition comprising the at least one peptide sequence in the third peptide set for which the experimental assay was performed. [0212] In some embodiments, selecting the plurality of unmodified peptide sequences to create the first peptide set comprises sliding a window of size n across an amino acid sequence encoding the tumor neoantigen or the self-protein, wherein n is between about 8 amino acids and about 25 amino acids in length, and wherein n is a length of each peptide sequence of the plurality of unmodified peptide sequences in the first peptide set. In some embodiments, each peptide sequence in the first peptide set binds to an HLA class I molecule or an HLA class II molecule. In some embodiments, the method further comprises filtering the first peptide set to exclude a peptide sequence with a predicted binding core that contains a target residue in an anchor position. In some embodiments, the method further comprises substituting at least one amino acid residue of each peptide sequence in the first peptide set, wherein for at least one peptide sequence in the first peptide set the at least one amino acid residue is in an anchor position. In some embodiments, the first threshold is a binding affinity of less than about 1000 nM. [0213] In another aspect, the invention provides for method of forming an immunogenic peptide composition, the method comprising using a processor to perform the steps of creating a
first peptide set by selecting a plurality of unmodified peptide sequences, wherein each peptide sequence of the plurality of unmodified peptide sequences is associated with a tumor neoantigen or a self-protein, determining a plurality of peptide-HLA binding scores for each peptide sequence in the first peptide set, determining whether each peptide sequence in the first peptide set has a peptide-HLA binding score that passes a first threshold with respect to at least three HLA alleles creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set, determining whether each modified peptide sequence in the second peptide set has a peptide-HLA binding score that passes a second threshold with respect to the at least three HLA alleles, wherein the second threshold is more constrained than the first threshold, and creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises computing a predicted vaccine performance, wherein the computing of the predicted vaccine performance comprises excluding a peptide-HLA binding score with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA binding score for an unmodified peptide sequence associated with the modified peptide sequence does not pass the first threshold with respect to the first HLA allele, and wherein the predicted vaccine performance is a function of a non-excluded peptide-HLA binding score of each peptide sequence in the third peptide set, performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence in the third peptide set, and forming an immunogenic peptide composition comprising the at least one peptide sequence in the third peptide set for which the experimental assay was performed.
[0214] In some embodiments, the second threshold is based on data obtained from one or more experimental assays. In some embodiments, the predicted vaccine performance is further a function of a peptide-HLA immunogenicity metric of at least one modified peptide sequence of the plurality of modified peptide sequences bound to a second HLA allele of the at least three HLA alleles if a first peptide sequence in the first peptide set is predicted to be bound to the second HLA allele of the at least three HLA alleles with a first binding core, wherein the first binding core is a binding core of the first peptide sequence, wherein the first binding core is identical to a second binding core, wherein the first binding core and the second binding core comprise an amino acid position within a peptide sequence, and wherein the second binding core is a binding core of the at least one modified peptide sequence of the plurality of modified peptide sequences bound to the second HLA allele.
[0215] In another aspect, the invention provides for method of forming an immunogenic peptide composition, the method comprising using a processor to perform the steps of creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein at least one peptide sequence of the plurality of unmodified peptide sequences is associated with a tumor neoantigen or a self-protein; determining a plurality of peptide-HLA binding scores for each peptide sequence in the first peptide set, determining whether each peptide sequence in the first peptide set has a peptide-HLA binding score that passes a first threshold with respect to at least three HLA alleles, creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set, determining whether each modified peptide sequence in the second peptide set has a peptide-HLA binding score that passes a second threshold with respect to the at least three HLA alleles, wherein the second threshold is more constrained than the first threshold, creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises computing a predicted vaccine performance based on an HLA type of a subject, and wherein the computing of the predicted vaccine performance comprises excluding a peptide-HLA binding score with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA binding score for an unmodified peptide sequence associated with the modified peptide sequence does not pass the first threshold with respect to the first HLA allele, performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence in the third peptide set, and forming an immunogenic peptide composition comprising the at least one peptide sequence in the third peptide set for which the experimental assay was performed.
[0216] In some embodiments, each peptide sequence in the first peptide set binds to an HLA class I molecule or an HLA class II molecule. In some embodiments, the plurality of unmodified peptide sequences is derived from the tumor neoantigen or the self-protein that is present in a subject. In some embodiments, the at least three HLA alleles are present in the HLA type of a subject. In some embodiments, the plurality of unmodified peptide sequences is derived from the tumor neoantigen or the self-protein that is present in a subject. In some embodiments, the plurality of unmodified peptide sequences is derived from the tumor neoantigen or the self-protein that is present in the subject. In some embodiments, the immunogenic peptide composition comprises nucleic acid sequences encoding an amino acid sequence of the at least one peptide sequence in the third peptide set. In some embodiments, the
immunogenic peptide composition comprises nucleic acid sequences encoding an amino acid sequence of the at least one peptide sequence in the third peptide set. [0217] In another aspect, the invention provides for a composition comprising nucleic acid sequences encoding at least two amino acid sequences selected from the group consisting of SEQ ID NOs: 1550 to 1593. [0218] In some embodiments, the composition is immunogenic. In some embodiments, the nucleic acid sequences are administered in a construct for expression in vivo. In some embodiments, the in vivo administration of the nucleic acid sequence is configured to produce at least one peptide that is displayed by an HLA class I molecule. In some embodiments, the at least one peptide is a modified or an unmodified fragment of a BCL-ABL gene fusion. In some embodiments, the BCR-ABL gene fusion is b3a2 or b2a2. In some embodiments, the nucleic acid sequences are administered in an effective amount to a subject to prevent cancer. In some embodiments, the nucleic acid sequences are administered in an effective amount to a subject to treat cancer. [0219] In another aspect, the invention provides for a composition comprising at least two peptides selected from the group consisting of SEQ ID NOs: 1550 to 1593. [0220] In some embodiments, at least one peptide of the at least two peptides is displayed by an HLA class I molecule in a subject. In some embodiments, at least one peptide in the immunogenic peptide composition is a modified or an unmodified fragment of a BCL-ABL gene fusion. In some embodiments, the BCR-ABL gene fusion is b3a2 or b2a2. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to prevent cancer. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to treat cancer. In some embodiments, the invention provides for a composition comprising at least two peptides selected from the group consisting of SEQ ID NOs: 1550 to 1593. [0221] In another aspect, the invention provides for a nucleic acid sequence encoding at least one amino acid sequence selected from the group consisting of SEQ ID NOs: 1595 to 1661. [0222] In some embodiments, the composition is immunogenic. In some embodiments, the nucleic acid sequence is administered in a construct for expression in vivo. In some embodiments the in vivo administration of the nucleic acid sequence is configured to produce at
least one peptide that is displayed by an HLA class II molecule. In some embodiments, the at least one amino acid sequence is derived from a modified fragment of a BCL-ABL gene fusion. In some embodiments, the BCR-ABL gene fusion is b3a2 or b2a2. In some embodiments, the nucleic acid sequence is administered in an effective amount to a subject to prevent cancer. In some embodiments, the nucleic acid sequence is administered in an effective amount to a subject to treat cancer. [0223] In another aspect, the invention provides for a nucleic acid sequence encoding at least one amino acid sequence selected from the group consisting of SEQ ID NOs: 1595 to 1661. [0224] In some embodiments, the at least one peptide is displayed by an HLA class II molecule in a subject. In some embodiments, the at least one peptide in the immunogenic peptide composition is a modified or an unmodified fragment of a BCL-ABL gene fusion. In some embodiments, the BCR-ABL gene fusion is b3a2 or b2a2. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to prevent cancer. In some embodiments, the immunogenic peptide composition is administered in an effective amount to a subject to treat cancer. In some embodiments, the immunogenic peptide composition comprises at least two peptides selected from the group consisting of SEQ ID NOs: 1595 to 1661. [0225] In another aspect, the invention provides a method of forming an immunogenic peptide composition, the method comprising using a processor to perform the steps of creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein at least one peptide sequence of the plurality of unmodified peptide sequences is associated with a tumor neoantigen, a pathogen proteome, or a self-protein, determining a plurality of peptide-HLA binding scores for each peptide sequence in the first peptide set, determining whether each peptide sequence in the first peptide set has a peptide-HLA binding score that passes a first threshold with respect to at least three HLA alleles, creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set, determining a plurality of peptide-HLA binding scores for each peptide sequence in the second peptide set; and creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises computing a population coverage, wherein the computing of the population coverage comprises
sequence if a peptide-HLA binding score for an unmodified peptide sequence associated with the modified peptide sequence does not pass the first threshold with respect to the first HLA allele, and wherein the selected subset has a population coverage above a second threshold, performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence of the third peptide set; and forming an immunogenic peptide composition comprising the at least one peptide sequence of the third peptide set for which the experimental assay was performed.
[0226] In some embodiments, selecting the plurality of unmodified peptide sequences to create the first peptide set comprises sliding a window of size n across at least a portion of an amino acid sequence encoding the tumor neoantigen, the pathogen proteome, or the self-protein, wherein n is between about 8 amino acids and about 25 amino acids in length, and wherein n is a length of each peptide sequence of the plurality of unmodified peptide sequences of the first peptide set. In some embodiments, each peptide sequence of the first peptide set binds to an HLA class I molecule or an HLA class II molecule. In some embodiments, further comprising filtering the first peptide set to exclude a peptide sequence with a predicted binding core that contains a target amino acid residue in an anchor position. In some embodiments, further comprising substituting at least one amino acid residue of each peptide sequence of the first peptide set, wherein for at least one peptide sequence of the first peptide set the at least one amino acid residue is in an anchor position. In some embodiments, the first threshold is a binding affinity of less than about 1000 nM. In some embodiments, the population coverage is computed with respect to the at least three HLA alleles. In some embodiments, the population coverage is computed based on a frequency of an HLA haplotype in a human population. In some embodiments, the population coverage is computed based on a frequency of the at least three HLA alleles in a human population. In some embodiments, the plurality of unmodified peptide sequences is derived from the tumor neoantigen, the pathogen proteome, or the selfprotein that is present in a subject. In some embodiments, the second threshold is a proportion of a human population of between about 0.7 and about 0.8. In some embodiments, the tumor neoantigen or the self-protein is associated with a cancer, and wherein the cancer is selected from the group consisting of pancreas, colon, rectum, kidney, bronchus, lung, uterus, cervix, bladder, liver, and stomach. In some embodiments, the pathogen proteome is associated with a pathogen infection in a human subject. In some embodiments, the immunogenic peptide composition comprises nucleic acid sequences encoding an amino acid sequence of the at least one peptide sequence of the third peptide set.
[0227] In another aspect, the invention provides a method of forming an immunogenic peptide composition, the method comprising using a processor to perform the steps of creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein the plurality of unmodified peptide sequences is associated with a tumor neoantigen, a pathogen proteome, or a self-protein, determining a plurality of peptide-HLA immunogenicity metrics for each peptide sequence in the first peptide set, determining whether each peptide sequence in the first peptide set has a peptide-HLA immunogenicity metric that passes a threshold with respect to at least three HLA alleles, creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set, determining a plurality of peptide-HLA immunogenicity metrics for each peptide sequence in the second peptide set; and creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises a predicted vaccine performance, wherein the computing of the predicted vaccine performance comprises excluding a peptide-HLA immunogenicity metric with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA immunogenicity metric for an unmodified peptide sequence associated with the modified peptide sequence does not pass the threshold with respect to the first HLA allele, and wherein the predicted vaccine performance is a function of a peptide-HLA immunogenicity metric of each peptide sequence in the third peptide set, performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence of the third peptide set; and forming an immunogenic peptide composition comprising the at least one peptide sequence of the third peptide set for which the experimental assay was performed.
[0228] In some embodiments, selecting the plurality of unmodified peptide sequences to create the first peptide set comprises sliding a window of size n across at least a portion of an amino acid sequence encoding the tumor neoantigen, the pathogen proteome, or the self-protein, wherein n is between about 8 amino acids and about 25 amino acids in length, and wherein n is a length of each peptide sequence of the plurality of unmodified peptide sequences of the first peptide set. In some embodiments, each peptide sequence of the first peptide set binds to an HLA class I molecule or an HLA class II molecule. In some embodiments, further comprising filtering the first peptide set to exclude a peptide sequence with a predicted binding core that contains a target amino acid residue in an anchor position. In some embodiments, further comprising substituting at least one amino acid residue of each peptide sequence of the first
peptide set. In some embodiments, the threshold is a binding affinity of less than about 1000 nM. In some embodiments, the at least three HLA alleles are present in an HLA type of a subject. In some embodiments, the plurality of peptide sequences is derived from the tumor neoantigen, the pathogen proteome, or the self-protein that is present in the subject. In some embodiments, the immunogenic peptide composition comprises nucleic acid sequences encoding an amino acid sequence of the at least one peptide sequence of the third peptide set. [0229] In another aspect, the invention provides a method of forming an immunogenic peptide composition, the method comprising using a processor to perform the steps of creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein each peptide sequence of the plurality of unmodified peptide sequences is associated with a tumor neoantigen, a pathogen proteome, or a self-protein, determining a plurality of peptide-HLA binding scores for each peptide sequence in the first peptide set, determining whether each peptide sequence in the first peptide set has a peptide-HLA binding score that passes a threshold with respect to at least three HLA alleles, creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set, determining a plurality of peptide-HLA binding scores for each peptide sequence in the second peptide set, and creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises computing a predicted vaccine performance, wherein the computing of the predicted vaccine performance comprises excluding a peptide-HLA binding score with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA binding score for an unmodified peptide sequence associated with the modified peptide sequence does not pass the threshold with respect to the first HLA allele, and wherein the predicted vaccine performance is a function of a peptide- HLA binding score of each peptide sequence in the third peptide set, performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence of the third peptide set, and forming an immunogenic peptide composition comprising the at least one peptide sequence of the third peptide set for which the experimental assay was performed. [0230] In some embodiments, the second threshold is determined from data obtained from one or more experimental assays. In some embodiments, the predicted vaccine performance is further a function of a peptide-HLA immunogenicity metric of at least one modified peptide
sequence of the second peptide set with respect to a second HLA allele if a first peptide sequence of the first peptide set is predicted to be bound to the second HLA allele with a first binding core, wherein the first binding core is a binding core of the first peptide sequence, wherein the first binding core is identical to a second binding core, wherein the first binding core and the second binding core each comprise an amino acid position within a peptide sequence, and wherein the second binding core is a binding core of the at least one modified peptide sequence. In some embodiments, the plurality of peptide sequences is derived from the tumor neoantigen, the pathogen proteome, or the self-protein that is present in a subject. [0231] In another aspect, the invention provides a method of forming an immunogenic peptide composition, the method comprising using a processor to perform the steps of creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein at least one peptide sequence of the plurality of unmodified peptide sequences is associated with a tumor neoantigen, a pathogen proteome, or a self-protein, determining a plurality of peptide-HLA binding scores for each peptide sequence in the first peptide set, determining whether each peptide sequence in the first peptide set has a peptide-HLA binding score that passes a threshold with respect to at least three HLA alleles, creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set, determining a plurality of peptide-HLA binding scores for each peptide sequence in the second peptide set, and creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises computing a predicted vaccine performance based on an HLA type of a subject, and wherein the computing of the predicted vaccine performance comprises excluding a peptide-HLA binding score with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA binding score for an unmodified peptide sequence associated with the modified peptide sequence does not pass the threshold with respect to the first HLA allele, performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence of the third peptide set, and forming an immunogenic peptide composition comprising the at least one peptide sequence of the third peptide set for which the experimental assay was performed. [0232] In some embodiments, each peptide sequence in the first peptide set binds to an HLA class I molecule or an HLA class II molecule. In some embodiments, the plurality of peptide
sequences is derived from the tumor neoantigen, the pathogen proteome, or the self-protein that is present in the subject.
Compositions
[0233] In some embodiments, a peptide vaccine comprises one or more peptides of this disclosure and is administered in a pharmaceutical composition that includes a pharmaceutically acceptable carrier. In some embodiments, the peptide vaccine is comprised of the third peptide set, as described in this disclosure. In some embodiments, the pharmaceutical composition is in the form of a spray, aerosol, gel, solution, emulsion, lipid nanoparticle, nanoparticle, or suspension. In some embodiments, the pharmaceutical composition is in the form of a cationic nanoemulsion, one example of which is described by Brito et al. (2014) that is incorporated herein by reference.
[0234] The composition is preferably administered to a subject with a pharmaceutically acceptable carrier. Typically, in some embodiments, an appropriate amount of a pharmaceutically acceptable salt is used in the formulation, which in some embodiments can render the formulation isotonic.
[0235] In certain embodiments, the peptides are provided as an immunogenic composition comprising any one of the peptides described herein and a pharmaceutically acceptable carrier. In certain embodiments, the immunogenic composition further comprises an adjuvant. In certain embodiments, the peptides are conjugated with other molecules to increase their effectiveness as is known by those practiced in the art. For example, peptides can be coupled to antibodies that recognize cell surface proteins on antigen presenting cells to enhance vaccine effectiveness. One such method for increasing the effectiveness of peptide delivery is described in Woodham, et al. (2018). In certain embodiments for the treatment of autoimmune disorders, the peptides are delivered with a composition and protocol designed to induce tolerance as is known in the art. Example methods for using peptides for immune tolerization are described in Alhadj Ali, et al. (2017) and Gibson, et al. (2015).
[0236] In some embodiments, the pharmaceutically acceptable carrier is selected from the group consisting of saline, Ringer's solution, dextrose solution, and a combination thereof. Other suitable pharmaceutically acceptable carriers known in the art are contemplated. Suitable carriers and their formulations are described in Remington's Pharmaceutical Sciences, 2005, Mack Publishing Co. The pH of the solution is preferably from about 5 to about 8, and more
preferably from about 7 to about 7.5. The formulation may also comprise a lyophilized powder. Further carriers include sustained release preparations such as semipermeable matrices of solid hydrophobic polymers, which matrices are in the form of shaped articles, e.g., films, liposomes or microparticles. It will be apparent to those persons skilled in the art that certain carriers may be more preferable depending upon, for instance, the route of administration and concentration of peptides being administered. [0237] The phrase pharmaceutically acceptable carrier as used herein means a pharmaceutically acceptable material, composition or vehicle, such as a liquid or solid filler, diluent, excipient, solvent or encapsulating material, involved in carrying or transporting the subject pharmaceutical agent from one organ, or portion of the body, to another organ, or portion of the body. Each carrier is acceptable in the sense of being compatible with the other ingredients of the formulation and not injurious to the patient. Some examples of materials which can serve as pharmaceutically acceptable carriers include: sugars, such as lactose, glucose and sucrose; starches, such as corn starch and potato starch; cellulose, and its derivatives, such as sodium carboxymethyl cellulose, ethyl cellulose and cellulose acetate; powdered tragacanth; malt; gelatin; talc; excipients, such as cocoa butter and suppository waxes; oils, such as peanut oil, cottonseed oil, safflower oil, sesame oil, olive oil, corn oil and soybean oil; glycols, such as butylene glycol; polyols, such as glycerin, sorbitol, mannitol and polyethylene glycol; esters, such as ethyl oleate and ethyl laurate; agar; buffering agents, such as magnesium hydroxide and aluminum hydroxide; alginic acid; pyrogen-free water; isotonic saline; Ringer’s solution; ethyl alcohol; phosphate buffer solutions; and other non-toxic compatible substances employed in pharmaceutical formulations. The term carrier denotes an organic or inorganic ingredient, natural or synthetic, with which the active ingredient is combined to facilitate the application. The components of the pharmaceutical compositions also are capable of being comingled with the compounds of the present invention, and with each other, in a manner such that there is no interaction which would substantially impair the desired pharmaceutical efficiency. The composition may also include additional agents such as an isotonicity agent, a preservative, a surfactant, and, a divalent cation, preferably, zinc. [0238] The composition can also include an excipient, or an agent for stabilization of a peptide composition, such as a buffer, a reducing agent, a bulk protein, amino acids (such as e.g., glycine or praline) or a carbohydrate. Bulk proteins useful in formulating peptide
compositions include albumin. Typical carbohydrates useful in formulating peptides include but are not limited to sucrose, mannitol, lactose, trehalose, or glucose. [0239] Surfactants may also be used to prevent soluble and insoluble aggregation and/or precipitation of peptides or proteins included in the composition. Suitable surfactants include but are not limited to sorbitan trioleate, soya lecithin, and oleic acid. In certain cases, solution aerosols are preferred using solvents such as ethanol. Thus, formulations including peptides can also include a surfactant that can reduce or prevent surface-induced aggregation of peptides by atomization of the solution in forming an aerosol. Various conventional surfactants can be employed, such as polyoxyethylene fatty acid esters and alcohols, and polyoxyethylene sorbitol fatty acid esters. Amounts will generally range between 0.001% and 4% by weight of the formulation. In some embodiments, surfactants used with the present disclosure are polyoxyethylene sorbitan mono-oleate, polysorbate 80, polysorbate 20. Additional agents known in the art can also be included in the composition. [0240] In some embodiments, the pharmaceutical compositions and dosage forms further comprise one or more compounds that reduce the rate by which an active ingredient will decay, or the composition will change in character. So called stabilizers or preservatives may include, but are not limited to, amino acids, antioxidants, pH buffers, or salt buffers. Nonlimiting examples of antioxidants include butylated hydroxy anisole (BHA), ascorbic acid and derivatives thereof, tocopherol and derivatives thereof, butylated hydroxy anisole and cysteine. Nonlimiting examples of preservatives include parabens, such as methyl or propyl p- hydroxybenzoate and benzalkonium chloride. Additional nonlimiting examples of amino acids include glycine or proline. [0241] The present invention also teaches the stabilization (preventing or minimizing thermally or mechanically induced soluble or insoluble aggregation and/or precipitation of an inhibitor protein) of liquid solutions containing peptides at neutral pH or less than neutral pH by the use of amino acids including proline or glycine, with or without divalent cations resulting in clear or nearly clear solutions that are stable at room temperature or preferred for pharmaceutical administration. [0242] In one embodiment, the composition is a pharmaceutical composition of single unit or multiple unit dosage forms. Pharmaceutical compositions of single unit or multiple unit dosage forms of the invention comprise a prophylactically or therapeutically effective amount of
one or more compositions (e.g., a compound of the invention, or other prophylactic or therapeutic agent), typically, one or more vehicles, carriers, or excipients, stabilizing agents, and/or preservatives. Preferably, the vehicles, carriers, excipients, stabilizing agents and preservatives are pharmaceutically acceptable. [0243] In some embodiments, the pharmaceutical compositions and dosage forms comprise anhydrous pharmaceutical compositions and dosage forms. Anhydrous pharmaceutical compositions and dosage forms of the invention can be prepared using anhydrous or low moisture containing ingredients and low moisture or low humidity conditions. Pharmaceutical compositions and dosage forms that comprise lactose and at least one active ingredient that comprise a primary or secondary amine are preferably anhydrous if substantial contact with moisture and/or humidity during manufacturing, packaging, and/or storage is expected. An anhydrous pharmaceutical composition should be prepared and stored such that its anhydrous nature is maintained. Accordingly, anhydrous compositions are preferably packaged using materials known to prevent exposure to water such that they can be included in suitable formulary kits. Examples of suitable packaging include, but are not limited to, hermetically sealed foils, plastics, unit dose containers (e.g., vials), blister packs, and strip packs. [0244] Suitable vehicles are well known to those skilled in the art of pharmacy, and non- limiting examples of suitable vehicles include glucose, sucrose, starch, lactose, gelatin, rice, silica gel, glycerol, talc, sodium chloride, dried skim milk, propylene glycol, water, sodium stearate, ethanol, and similar substances well known in the art. Saline solutions and aqueous dextrose and glycerol solutions can also be employed as liquid vehicles. Whether a particular vehicle is suitable for incorporation into a pharmaceutical composition or dosage form depends on a variety of factors well known in the art including, but not limited to, the way in which the dosage form will be administered to a patient and the specific active ingredients in the dosage form. Pharmaceutical vehicles can be sterile liquids, such as water and oils, including those of petroleum, animal, vegetable or synthetic origin, such as peanut oil, soybean oil, mineral oil, sesame oil and the like. [0245] The invention also provides that a pharmaceutical composition can be packaged in a hermetically sealed container such as an ampoule or sachette indicating the quantity. In one embodiment, the pharmaceutical composition can be supplied as a dry sterilized lyophilized powder in a delivery device suitable for administration to the lower airways of a patient. The
contain one or more unit dosage forms containing the active ingredient. The pack can for example comprise metal or plastic foil, such as a blister pack. The pack or dispenser device can be accompanied by instructions for administration. [0246] Methods of preparing these formulations or compositions include the step of bringing into association a compound of the present invention with the carrier and, optionally, one or more accessory ingredients. In general, the formulations are prepared by uniformly and intimately bringing into association a compound of the present invention with liquid carriers, or finely divided solid carriers, or both, and then, if necessary, shaping the product. [0247] Formulations of the invention suitable for administration may be in the form of powders, granules, or as a solution or a suspension in an aqueous or non-aqueous liquid, or as an oil-in-water or water-in-oil liquid emulsion, or as an elixir or syrup, or as pastilles (using an inert base, such as gelatin and glycerin, or sucrose and acacia) and/or as mouthwashes and the like, each containing a predetermined amount of a compound of the present invention (e.g., peptides) as an active ingredient. [0248] A liquid composition herein can be used as such with a delivery device, or they can be used for the preparation of pharmaceutically acceptable formulations comprising peptides that are prepared for example by the method of spray drying. The methods of spray freeze- drying peptides/proteins for pharmaceutical administration disclosed in Maa et al., Curr. Pharm. Biotechnol., 2001, 1, 283-302, are incorporated herein. In another embodiment, the liquid solutions herein are freeze spray dried and the spray-dried product is collected as a dispersible peptide-containing powder that is therapeutically effective when administered to an individual. [0249] The compounds and pharmaceutical compositions of the present invention can be employed in combination therapies, that is, the compounds and pharmaceutical compositions can be administered concurrently with, prior to, or subsequent to, one or more other desired therapeutics or medical procedures (e.g., peptide vaccine can be used in combination therapy with another treatment such as chemotherapy, radiation, pharmaceutical agents, and/or another treatment). The particular combination of therapies (therapeutics or procedures) to employ in a combination regimen will take into account compatibility of the desired therapeutics and/or procedures and the desired therapeutic effect to be achieved. It will also be appreciated that the therapies employed may achieve a desired effect for the same disorder (for example, the
compound of the present invention may be administered concurrently with another therapeutic or prophylactic). [0250] The invention also provides a pharmaceutical pack or kit comprising one or more containers filled with one or more of the ingredients of the pharmaceutical compositions of the invention. Optionally associated with such container(s) can be a notice in the form prescribed by a governmental agency regulating the manufacture, use or sale of pharmaceuticals or biological products, which notice reflects approval by the agency of manufacture, use or sale for human administration. [0251] The current invention provides for dosage forms comprising peptides suitable for treating cancer or other diseases. The dosage forms can be formulated, e.g., as sprays, aerosols, nanoparticles, liposomes, or other forms known to one of skill in the art. See, e.g., Remington's Pharmaceutical Sciences; Remington: The Science and Practice of Pharmacy supra; Pharmaceutical Dosage Forms and Drug Delivery Systems by Howard C., Ansel et al., Lippincott Williams & Wilkins; 7th edition (Oct.1, 1999). [0252] Generally, a dosage form used in the acute treatment of a disease may contain larger amounts of one or more of the active ingredients it comprises than a dosage form used in the chronic treatment of the same disease. In addition, the prophylactically and therapeutically effective dosage form may vary among different conditions. For example, a therapeutically effective dosage form may contain peptides that has an appropriate immunogenic action when intending to treat cancer or other disease. On the other hand, a different effective dosage may contain peptides that has an appropriate immunogenic action when intending to use the peptides of the invention as a prophylactic (e.g., vaccine) against cancer or another disease/condition. These and other ways in which specific dosage forms encompassed by this invention will vary from one another and will be readily apparent to those skilled in the art. See, e.g., Remington's Pharmaceutical Sciences, 2005, Mack Publishing Co.; Remington: The Science and Practice of Pharmacy by Gennaro, Lippincott Williams & Wilkins; 20th edition (2003); Pharmaceutical Dosage Forms and Drug Delivery Systems by Howard C. Ansel et al., Lippincott Williams & Wilkins; 7th edition (Oct.1, 1999); and Encyclopedia of Pharmaceutical Technology, edited by Swarbrick, J. & J.C. Boylan, Marcel Dekker, Inc., New York, 1988, which are incorporated herein by reference in their entirety.
[0253] The pH of a pharmaceutical composition or dosage form may also be adjusted to improve delivery and/or stability of one or more active ingredients. Similarly, the polarity of a solvent carrier, its ionic strength, or tonicity can be adjusted to improve delivery. Compounds such as stearates can also be added to pharmaceutical compositions or dosage forms to alter advantageously the hydrophilicity or lipophilicity of one or more active ingredients to improve delivery. In this regard, stearates can also serve as a lipid vehicle for the formulation, as an emulsifying agent or surfactant, and as a delivery enhancing or penetration-enhancing agent. Different salts, hydrates, or solvates of the active ingredients can be used to adjust further the properties of the resulting composition.
[0254] Compositions can be formulated with appropriate carriers and adjuvants using techniques to yield compositions suitable for immunization. The compositions can include an adjuvant, such as, for example but not limited to, alum, poly IC, MF-59, squalene-based adjuvants, or liposomal based adjuvants suitable for immunization.
[0255] In some embodiments, the compositions and methods comprise any suitable agent or immune modulation which could modulate mechanisms of host immune tolerance and release of the induced antibodies. In certain embodiments, an immunomodulatory agent is administered in at time and in an amount sufficient for transient modulation of the subject's immune response so as to induce an immune response which comprises antibodies against for example tumor neoantigens (i.e., tumor-specific antigens (TSA)).
Expression systems
[0256] In certain aspects, the invention provides culturing a cell line that expresses any one of the peptides of the invention in a culture medium comprising any of the peptides described herein.
[0257] Various expression systems for producing recombinant proteins/peptides are known in the art, and include, prokaryotic (e.g., bacteria), plant, insect, yeast, and mammalian expression systems. Suitable cell lines, can be transformed, transduced, or transfected with nucleic acids containing coding sequences for the peptides of the invention in order to produce the molecule of interest. Expression vectors containing such a nucleic acid sequence, which can be linked to at least one regulatory sequence in a manner that allows expression of the nucleotide sequence in a host cell, can be introduced via methods known in the art. Practitioners in the art understand that designing an expression vector can depend on factors, such as the choice of host
cell to be transfected and/or the type and/or amount of desired protein to be expressed. Enhancer regions, which are those sequences found upstream or downstream of the promoter region in non-coding DNA regions, are also known in the art to be important in optimizing expression. If needed, origins of replication from viral sources can be employed, such as if a prokaryotic host is utilized for introduction of plasmid DNA. However, in eukaryotic organisms, chromosome integration is a common mechanism for DNA replication. For stable transfection of mammalian cells, a small fraction of cells can integrate introduced DNA into their genomes. The expression vector and transfection method utilized can be factors that contribute to a successful integration event. For stable amplification and expression of a desired protein, a vector containing DNA encoding a protein of interest is stably integrated into the genome of eukaryotic cells (for example mammalian cells), resulting in the stable expression of transfected genes. A gene that encodes a selectable marker (for example, resistance to antibiotics or drugs) can be introduced into host cells along with the gene of interest in order to identify and select clones that stably express a gene encoding a protein of interest. Cells containing the gene of interest can be identified by drug selection wherein cells that have incorporated the selectable marker gene will survive in the presence of the drug. Cells that have not incorporated the gene for the selectable marker die. Surviving cells can then be screened for the production of the desired protein molecule. [0258] A host cell strain, which modulates the expression of the inserted sequences, or modifies and processes the nucleic acid in a specific fashion desired also may be chosen. Such modifications (for example, glycosylation and other post-translational modifications) and processing (for example, cleavage) of peptide/protein products may be important for the function of the peptide/protein. Different host cell strains have characteristic and specific mechanisms for the post-translational processing and modification of proteins and gene products. As such, appropriate host systems or cell lines can be chosen to ensure the correct modification and processing of the target protein expressed. Thus, eukaryotic host cells possessing the cellular machinery for proper processing of the primary transcript, glycosylation, and phosphorylation of the gene product may be used. [0259] Various culturing parameters can be used with respect to the host cell being cultured. Appropriate culture conditions for mammalian cells are well known in the art (Cleveland WL, et al., J lmmunol Methods, 1983, 56(2): 221-234) or can be determined by the skilled artisan (see, for example, Animal Cell Culture: A Practical Approach 2nd Ed., Rickwood, D. and Hames, B.
D., eds. (Oxford University Press: New York, 1992)). Cell culturing conditions can vary according to the type of host cell selected. Commercially available medium can be utilized. [0260] Peptides of the invention can be purified from any human or non-human cell which expresses the polypeptide, including those which have been transfected with expression constructs that express peptides of the invention. For protein recovery, isolation and/or purification, the cell culture medium or cell lysate is centrifuged to remove particulate cells and cell debris. The desired polypeptide molecule is isolated or purified away from contaminating soluble proteins and polypeptides by suitable purification techniques. Non-limiting purification methods for proteins include: size exclusion chromatography; affinity chromatography; ion exchange chromatography; ethanol precipitation; reverse phase HPLC; chromatography on a resin, such as silica, or cation exchange resin, e.g., DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration using, e.g., Sephadex G-75, Sepharose; protein A sepharose chromatography for removal of immunoglobulin contaminants; and the like. Other additives, such as protease inhibitors (e.g., PMSF or proteinase K) can be used to inhibit proteolytic degradation during purification. Purification procedures that can select for carbohydrates can also be used, e.g., ion-exchange soft gel chromatography, or HPLC using cation- or anionexchange resins, in which the more acidic fraction(s) is/are collected. Methods of treatment [0261] In some embodiments, the subject matter disclosed herein relates to a preventive medical treatment started after following diagnosis of cancer in order to prevent the disease from worsening or curing the disease. In some embodiments, the term “prevent” refers to providing a medical treatment to a subject (e.g., by administering nucleic acids or peptides disclosed herein) to stop a potential disease onset (e.g., one or more types of cancer), but “prevent” does not necessarily mean that the disease will not occur in 100% of subjects that received the medical treatment. In one embodiment, the subject matter disclosed herein relates to prophylaxis of subjects who are believed to be at risk for cancer or have previously been diagnosed with cancer (or another disease). In one embodiment, said subjects can be administered the peptide vaccine described herein or pharmaceutical compositions thereof. The invention contemplates using any of the peptides produced by the systems and methods described herein. In one embodiment, the peptide vaccines described herein can be administered subcutaneously via syringe or any other suitable method know in the art.
[0262] The compound(s) or combination of compounds disclosed herein, or pharmaceutical compositions may be administered to a cell, mammal, or human by any suitable means. Non- limiting examples of methods of administration include, among others, (a) administration though oral pathways, which includes administration in capsule, tablet, granule, spray, syrup, or other such forms; (b) administration through non-oral pathways such as intraocular, intranasal, intraauricular, rectal, vaginal, intraurethral, transmucosal, buccal, or transdermal, which includes administration as an aqueous suspension, an oily preparation or the like or as a drip, spray, suppository, salve, ointment or the like; (c) administration via injection, including subcutaneously, intraperitoneally, intravenously, intramuscularly, intradermally, intraorbitally, intracapsularly, intraspinally, intrasternally, or the like, including infusion pump delivery; (d) administration locally such as by injection directly in the renal or cardiac area, e.g., by depot implantation; (e) administration topically; as deemed appropriate by those of skill in the art for bringing the compound or combination of compounds disclosed herein into contact with living tissue; (f) administration via inhalation, including through aerosolized, nebulized, and powdered formulations; and (g) administration through implantation. [0263] As will be readily apparent to one skilled in the art, the effective in vivo dose to be administered and the particular mode of administration will vary depending upon the age, weight and species treated, and the specific use for which the compound or combination of compounds disclosed herein are employed. The determination of effective dose levels, that is the dose levels necessary to achieve the desired result, can be accomplished by one skilled in the art using routine pharmacological methods. Typically, human clinical applications of products are commenced at lower dose levels, with dose level being increased until the desired effect is achieved. Alternatively, acceptable in vitro studies can be used to establish useful doses and routes of administration of the compositions identified by the present methods using established pharmacological methods. Effective animal doses from in vivo studies can be converted to appropriate human doses using conversion methods known in the art (e.g., see Nair AB, Jacob S. A simple practice guide for dose conversion between animals and human. Journal of basic and clinical pharmacy.2016 Mar;7(2):27.) Methods of prevention [0264] In some embodiments, the peptides prepared using methods of the invention can be used as a vaccine to promote an immune response against cancer (e.g., against tumor neoantigens) In some embodiments the invention provides compositions and methods for
induction of immune response, for example induction of antibodies to tumor neoantigens. In some embodiments, the antibodies are broadly neutralizing antibodies. In some embodiments, the peptides prepared using methods of the invention can be used as a vaccine to promote an immune response against a pathogen. In some embodiments, the peptides prepared using methods of the invention can be used to promote immune tolerance as an autoimmune disease therapeutic. [0265] In some embodiments, the peptides prepared using methods of the invention can be combined with additional vaccine components. In some embodiments, these combined vaccines can be encoded in one or more nucleic acids that encode the peptides produced with the methods described herein and additional vaccine components (e.g. peptides or proteins) that are known in the art. In some embodiments, these combined vaccines are created by adding the peptides or proteins that encode the additional vaccine components of the peptides that result from the methods described here for combined formulation and packaging. An example of the combination of vaccine components is the creation of RAS vaccines that use one or more nucleic acids to encode the components of vaccines for KRAS G12D and KRAS G12V and packaging the nucleic acids in a mRNA-LNP or DNA formulation, or separately formulating different components as mRNA-LNP or DNA and then combining them for packaging or immediately before administration to a person. [0266] The compositions, systems, and methods disclosed herein are not to be limited in scope to the specific embodiments described herein. Indeed, various modifications of the compositions, systems, and methods in addition to those described will become apparent to those of skill in the art from the foregoing description.
Claims (120)
- What is claimed is: 1. A nucleic acid sequence encoding at least two amino acid sequences selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, and SEQ ID NO: 41.
- 2. The nucleic acid sequence of claim 1, wherein the nucleic acid sequence is an immunogenic composition.
- 3. The nucleic acid sequence of claim 1, wherein the nucleic acid sequence is administered in a construct for expression in vivo.
- 4. The nucleic acid sequence of claim 3, wherein the in vivo administration of the nucleic acid sequence is configured to produce at least one peptide that is displayed by an HLA class I molecule.
- 5. The nucleic acid sequence of claim 4, wherein the at least one peptide is a modified or an unmodified fragment of a mutated KRAS protein.
- 6. The nucleic acid sequence of claim 5, wherein the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D.
- 7. The nucleic acid sequence of claim 3, wherein the nucleic acid sequence is administered in an effective amount to a subject to prevent cancer.
- 8. The nucleic acid sequence of claim 3, wherein the nucleic acid sequence is administered in an effective amount to a subject to treat cancer.
- 9. An immunogenic peptide composition comprising at least two peptides selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, and SEQ ID NO: 41.
- 10. The immunogenic peptide composition of claim 9, wherein at least one peptide of the at least two peptides is displayed by an HLA class I molecule in a subject.
- 11. The immunogenic peptide composition of claim 9, wherein at least one peptide of the at least two peptides is a modified or an unmodified fragment of a mutated KRAS protein.
- 12. The immunogenic peptide composition of claim 11, wherein the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D.
- 13. The immunogenic peptide composition of claim 9, wherein the immunogenic peptide composition is administered in an effective amount to a subject to prevent cancer.
- 14. The immunogenic peptide composition of claim 9, wherein the immunogenic peptide composition is administered in an effective amount to a subject to treat cancer.
- 15. The immunogenic peptide composition of claim 9, wherein the immunogenic peptide composition comprises at least three peptides selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, and SEQ ID NO: 41.
- 16. A nucleic acid sequence encoding at least one amino acid sequence selected from the group consisting of SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, and SEQ ID NO: 65.
- 17. An immunogenic composition comprising the nucleic acid sequence of claim 16.
- 18. The nucleic acid sequence of claim 16, wherein the nucleic acid sequence is administered in a construct for expression in vivo.
- 19. The nucleic acid sequence of claim 18, wherein the in vivo administration of the nucleic acid sequence is configured to produce at least one peptide that is displayed by an HLA class II molecule.
- 20. The nucleic acid sequence of claim 16, wherein the at least one peptide is a modified fragment of a mutated KRAS protein.
- 21. The nucleic acid sequence of claim 20, wherein the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D.
- 22. The nucleic acid sequence of claim 16, wherein the nucleic acid sequence is administered in an effective amount to a subject to prevent cancer.
- 23. The nucleic acid sequence of claim 16, wherein the nucleic acid sequence is administered in an effective amount to a subject to treat cancer.
- 24. An immunogenic peptide composition comprising at least one peptide selected from the group consisting of SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, and SEQ ID NO: 65.
- 25. The immunogenic peptide composition of claim 24, wherein at least one peptide in the immunogenic peptide composition is displayed by an HLA class II molecule.
- 26. The immunogenic peptide composition of claim 24, wherein at least one peptide in the immunogenic peptide composition is a modified or an unmodified fragment of a mutated KRAS protein.
- 27. The immunogenic peptide composition of claim 26, wherein the mutated KRAS protein is selected from the group consisting of KRAS G12D, KRAS G12V, KRAS G12R, KRAS G12C, and KRAS G13D.
- 28. The immunogenic peptide composition of claim 24, wherein the immunogenic peptide composition is administered in an effective amount to a subject to prevent cancer.
- 29. The immunogenic peptide composition of claim 24, wherein the immunogenic peptide composition is administered in an effective amount to a subject to treat cancer.
- 30. The immunogenic peptide composition of claim 24, wherein the immunogenic peptide composition comprises at least two peptides selected from the group consisting of SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, and SEQ ID NO: 65.
- 31. A method of forming an immunogenic peptide composition, the method comprising: using a processor to perform the steps of: creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein at least one peptide sequence of the plurality of unmodified peptide sequences is associated with a tumor neoantigen or a self-protein; determining a plurality of peptide-HLA binding scores for each peptide sequence in the first peptide set; determining whether each peptide sequence in the first peptide set has a peptide- HLA binding score that passes a first threshold with respect to at least three HLA alleles; creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set; determining whether each modified peptide sequence in the second peptide set has a peptide-HLA binding score that passes a second threshold with respect to the at least three HLA alleles, wherein the second threshold is more constrained than the first threshold; and creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises computing a population coverage, wherein the computing of the population coverage comprises excluding a peptide-HLA binding score with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA binding score for an unmodified peptide sequence associated with the modified peptide sequence does not pass the first threshold with respect to the first HLA allele, and wherein the selected subset has a population coverage above a third threshold; performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence in the third peptide set; and forming an immunogenic peptide composition comprising the at least one peptide sequence in the third peptide set for which the experimental assay was performed.
- 32. The method of claim 31, wherein selecting the plurality of unmodified peptide sequences to create the first peptide set comprises sliding a window of size n across an amino acid sequence encoding the tumor neoantigen or the self-protein, wherein n is between about 8 amino acids and about 25 amino acids in length, and wherein n is a length of each peptide sequence of the plurality of unmodified peptide sequences in the first peptide set.
- 33. The method of claim 31, wherein each peptide sequence in the first peptide set binds to an HLA class I molecule or an HLA class II molecule.
- 34. The method of claim 31, further comprising filtering the first peptide set to exclude a peptide sequence with a predicted binding core that contains a target residue in an anchor position.
- 35. The method of claim 31, further comprising substituting at least one amino acid residue of each peptide sequence in the first peptide set, wherein for at least one peptide sequence in the first peptide set the at least one amino acid residue is in an anchor position.
- 36. The method of claim 31, wherein the first threshold is a binding affinity of less than about 1000 nM.
- 37. The method of claim 31, when the population coverage is computed with respect to the at least three HLA alleles.
- 38. The method of claim 31, wherein the second threshold is a binding affinity of less than about 500 nM.
- 39. The method of claim 31, wherein the population coverage is computed based on a frequency of an HLA haplotype in a human population.
- 40. The method of claim 31, wherein the population coverage is computed based on a frequency of the at least three HLA alleles in a human population.
- 41. The method of claim 31, wherein the plurality of unmodified peptide sequences is derived from the tumor neoantigen or the self-protein that is present in a subject.
- 42. The method of claim 31, wherein the third threshold is a proportion of a human population of between about 0.7 and about 0.8.
- 43. The method of claim 31, wherein the tumor neoantigen or the self-protein is associated with a cancer, and wherein the cancer is selected from the group consisting of pancreas, colon, rectum, kidney, bronchus, lung, uterus, cervix, bladder, liver, and stomach.
- 44. A method of forming an immunogenic peptide composition, the method comprising: using a processor to perform the steps of: creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein the plurality of unmodified peptide sequences is associated with a tumor neoantigen or a self-protein; determining a plurality of peptide-HLA immunogenicity metrics for each peptide sequence in the first peptide set; determining whether each peptide sequence in the first peptide set has a peptide-HLA immunogenicity metric that passes a first threshold with respect to at least three HLA alleles; creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set; determining whether each modified peptide sequence in the second peptide set has a peptide-HLA immunogenicity metric that passes a second threshold with respect to the at least three HLA alleles, wherein the second threshold is more constrained than the first threshold; and creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises computing a predicted vaccine performance, wherein the computing of the predicted vaccine performance comprises excluding a peptide-HLA immunogenicity metric with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA immunogenicity metric for an unmodified peptide sequence associated with the modified peptide sequence does not pass the first threshold with respect to the first HLA allele, and wherein the predicted vaccine performance is a function of a non-excluded peptide-HLA immunogenicity metric of each peptide sequence in the third peptide set; performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence in the third peptide set; and forming an immunogenic peptide composition comprising the at least one peptide sequence in the third peptide set for which the experimental assay was performed.
- 45. The method of claim 44, wherein selecting the plurality of unmodified peptide sequences to create the first peptide set comprises sliding a window of size n across an amino acid sequence encoding the tumor neoantigen or the self-protein, wherein n is between about 8 amino acids and about 25 amino acids in length, and wherein n is a length of each peptide sequence of the plurality of unmodified peptide sequences in the first peptide set.
- 46. The method of claim 44, wherein each peptide sequence in the first peptide set binds to an HLA class I molecule or an HLA class II molecule.
- 47. The method of claim 44, further comprising filtering the first peptide set to exclude a peptide sequence with a predicted binding core that contains a target residue in an anchor position.
- 48. The method of claim 44, further comprising substituting at least one amino acid residue of each peptide sequence in the first peptide set.
- 49. The method of claim 44, wherein the first threshold is a binding affinity of less than about 1000 nM.
- 50. A method of forming an immunogenic peptide composition, the method comprising: using a processor to perform the steps of: creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein each peptide sequence of the plurality of unmodified peptide sequences is associated with a tumor neoantigen or a self-protein; determining a plurality of peptide-HLA binding scores for each peptide sequence in the first peptide set; determining whether each peptide sequence in the first peptide set has a peptide-HLA binding score that passes a first threshold with respect to at least three HLA alleles; creating a second peptide set comprising the first peptide set and a plurality of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set; determining whether each modified peptide sequence in the second peptide set has a peptide-HLA binding score that passes a second threshold with respect to the at least three HLA alleles, wherein the second threshold is more constrained than the first threshold; and creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises computing a predicted vaccine performance, wherein the computing of the predicted vaccine performance comprises excluding a peptide-HLA binding score with respect to a first HLA allele for a modified peptide sequence if a peptide- HLA binding score for an unmodified peptide sequence associated with the modified peptide sequence does not pass the first threshold with respect to the first HLA allele, and wherein the predicted vaccine performance is a function of a non-excluded peptide-HLA binding score of each peptide sequence in the third peptide set; performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence in the third peptide set; and forming an immunogenic peptide composition comprising the at least one peptide sequence in the third peptide set for which the experimental assay was performed.
- 51. The method of claim 50, wherein the second threshold is based on data obtained from one or more experimental assays.
- 52. The method of claim 50, wherein the predicted vaccine performance is further a function of a peptide-HLA immunogenicity metric of at least one modified peptide sequence of the plurality of modified peptide sequences bound to a second HLA allele of the at least three HLA alleles if a first peptide sequence in the first peptide set is predicted to be bound to the second HLA allele of the at least three HLA alleles with a first binding core, wherein the first binding core is a binding core of the first peptide sequence, wherein the first binding core is identical to a second binding core, wherein the first binding core and the second binding core comprise an amino acid position within a peptide sequence and wherein the second binding core is a binding core of the at least one modified peptide sequence of the plurality of modified peptide sequences bound to the second HLA allele.
- 53. A method of forming an immunogenic peptide composition, the method comprising: using a processor to perform the steps of: creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein at least one peptide sequence of the plurality of unmodified peptide sequences is associated with a tumor neoantigen or a self-protein; determining a plurality of peptide-HLA binding scores for each peptide sequence in the first peptide set; determining whether each peptide sequence in the first peptide set has a peptide- HLA binding score that passes a first threshold with respect to at least three HLA alleles; creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set; determining whether each modified peptide sequence in the second peptide set has a peptide-HLA binding score that passes a second threshold with respect to the at least three HLA alleles, wherein the second threshold is more constrained than the first threshold; creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises computing a predicted vaccine performance based on an HLA type of a subject, and wherein the computing of the predicted vaccine performance comprises excluding a peptide-HLA binding score with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA binding score for an unmodified peptide sequence associated with the modified peptide sequence does not pass the first threshold with respect to the first HLA allele; performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence in the third peptide set; and forming an immunogenic peptide composition comprising the at least one peptide sequence in the third peptide set for which the experimental assay was performed.
- 54. The method of claim 53, wherein each peptide sequence in the first peptide set binds to an HLA class I molecule or an HLA class II molecule.
- 55. The method of claim 54, wherein the plurality of unmodified peptide sequences is derived from the tumor neoantigen or the self-protein that is present in a subject.
- 56. The method of claim 54, wherein the at least three HLA alleles are present in the HLA type of a subject.
- 57. The method of claim 53, wherein the plurality of unmodified peptide sequences is derived from the tumor neoantigen or the self-protein that is present in a subject.
- 58. The method of claim 53, wherein the plurality of unmodified peptide sequences is derived from the tumor neoantigen or the self-protein that is present in the subject.
- 59. The method of claim 53, wherein the immunogenic peptide composition comprises nucleic acid sequences encoding an amino acid sequence of the at least one peptide sequence in the third peptide set.
- 60. The method of claim 53, wherein the immunogenic peptide composition comprises nucleic acid sequences encoding an amino acid sequence of the at least one peptide sequence in the third peptide set.
- 61. A composition comprising nucleic acid sequences encoding at least two amino acid sequences selected from the group consisting of SEQ ID NOs: 1550 to 1593.
- 62. The composition of claim 61, wherein the composition is immunogenic.
- 63. The composition of claim 61, wherein the nucleic acid sequences are
- 64. The composition of claim 63, wherein the in vivo administration of the nucleic acid sequence is configured to produce at least one peptide that is displayed by an HLA class I molecule.
- 65. The composition of claim 64, wherein the at least one peptide is a modified or an unmodified fragment of a BCL-ABL gene fusion.
- 66. The composition of claim 65, wherein the BCR-ABL gene fusion is b3a2 or b2a2.
- 67. The composition of claim 63, wherein the nucleic acid sequences are administered in an effective amount to a subject to prevent cancer.
- 68. The composition of claim 63, wherein the nucleic acid sequences are administered in an effective amount to a subject to treat cancer.
- 69. An immunogenic peptide composition comprising at least two peptides selected from the group consisting of SEQ ID NOs: 1550 to 1593.
- 70. The immunogenic peptide composition of claim 69, wherein at least one peptide of the at least two peptides is displayed by an HLA class I molecule in a subject.
- 71. The immunogenic peptide composition of claim 69, wherein at least one peptide in the immunogenic peptide composition is a modified or an unmodified fragment of a BCL- ABL gene fusion.
- 72. The immunogenic peptide composition of claim 71, wherein the BCR-ABL gene fusion is b3a2 or b2a2.
- 73. The immunogenic peptide composition of claim 69, wherein the immunogenic peptide composition is administered in an effective amount to a subject to prevent cancer.
- 74. The immunogenic peptide composition of claim 69, wherein the immunogenic peptide composition is administered in an effective amount to a subject to treat cancer.
- 75. The immunogenic peptide composition of claim 69, wherein the immunogenic peptide composition comprises at least three peptides selected from the group consisting of SEQ ID NOs: 1550 to 1593
- 76. A nucleic acid sequence encoding at least one amino acid sequence selected from the group consisting of SEQ ID NOs: 1595 to 1661.
- 77. An immunogenic composition comprising the nucleic acid sequence of claim 76.
- 78. The nucleic acid sequence of claim 76, wherein the nucleic acid sequence is administered in a construct for expression in vivo.
- 79. The nucleic acid sequence of claim 78, wherein the in vivo administration of the nucleic acid sequence is configured to produce at least one peptide that is displayed by an HLA class II molecule.
- 80. The nucleic acid sequence of claim 76, wherein the at least one amino acid sequence is derived from a modified fragment of a BCL-ABL gene fusion.
- 81. The nucleic acid sequence of claim 80, wherein the BCR-ABL gene fusion is b3a2 or b2a2.
- 82. The nucleic acid sequence of claim 76, wherein the nucleic acid sequence is administered in an effective amount to a subject to prevent cancer.
- 83. The nucleic acid sequence of claim 76, wherein the nucleic acid sequence is administered in an effective amount to a subject to treat cancer.
- 84. An immunogenic peptide composition comprising at least one peptide selected from the group consisting of SEQ ID NOs: 1595 to 1661.
- 85. The immunogenic peptide composition of claim 84, wherein the at least one peptide is displayed by an HLA class II molecule in a subject.
- 86. The immunogenic peptide composition of claim 84, wherein the at least one peptide in the immunogenic peptide composition is a modified or an unmodified fragment of a BCL-ABL gene fusion.
- 87. The immunogenic peptide composition of claim 86, wherein the BCR-ABL gene fusion is b3a2 or b2a2.
- 88. The immunogenic peptide composition of claim 84, wherein the immunogenic
- 89. The immunogenic peptide composition of claim 84, wherein the immunogenic peptide composition is administered in an effective amount to a subject to treat cancer.
- 90. The immunogenic peptide composition of claim 84, wherein the immunogenic peptide composition comprises at least two peptides selected from the group consisting of SEQ ID NOs: 1595 to 1661.
- 91. A method of forming an immunogenic peptide composition, the method comprising: using a processor to perform the steps of: creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein at least one peptide sequence of the plurality of unmodified peptide sequences is associated with a tumor neoantigen, a pathogen proteome, or a self-protein; determining a plurality of peptide-HLA binding scores for each peptide sequence in the first peptide set; determining whether each peptide sequence in the first peptide set has a peptide-HLA binding score that passes a first threshold with respect to at least three HLA alleles; creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set; determining a plurality of peptide-HLA binding scores for each peptide sequence in the second peptide set; and creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises computing a population coverage, wherein the computing of the population coverage comprises excluding a peptide-HLA binding score with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA binding score for an unmodified peptide sequence associated with the modified peptide sequence does not pass the first threshold with respect to the first HLA allele, and wherein the selected subset has a population coverage above a second threshold; performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence of the third peptide set; and forming an immunogenic peptide composition comprising the at least one peptide sequence of the third peptide set for which the experimental assay was performed.
- 92. The method of claim 91, wherein selecting the plurality of unmodified peptide sequences to create the first peptide set comprises sliding a window of size n across at least a portion of an amino acid sequence encoding the tumor neoantigen, the pathogen proteome, or the self-protein, wherein n is between about 8 amino acids and about 25 amino acids in length, and wherein n is a length of each peptide sequence of the plurality of unmodified peptide sequences of the first peptide set.
- 93. The method of claim 91, wherein each peptide sequence of the first peptide set binds to an HLA class I molecule or an HLA class II molecule.
- 94. The method of claim 91, further comprising filtering the first peptide set to exclude a peptide sequence with a predicted binding core that contains a target amino acid residue in an anchor position.
- 95. The method of claim 91, further comprising substituting at least one amino acid residue of each peptide sequence of the first peptide set, wherein for at least one peptide sequence of the first peptide set the at least one amino acid residue is in an anchor position.
- 96. The method of claim 91, wherein the first threshold is a binding affinity of less than about 1000 nM.
- 97. The method of claim 91, wherein the population coverage is computed with respect to the at least three HLA alleles.
- 98. The method of claim 91, wherein the population coverage is computed based on a frequency of an HLA haplotype in a human population.
- 99. The method of claim 91, wherein the population coverage is computed based on a frequency of the at least three HLA alleles in a human population.
- 100. The method of claim 91, wherein the plurality of unmodified peptide sequences is derived from the tumor neoantigen, the pathogen proteome, or the self-protein that is present in a subject.
- 101. The method of claim 91, wherein the second threshold is a proportion of a human population of between about 0.7 and about 0.8.
- 102. The method of claim 91, wherein the tumor neoantigen or the self-protein is associated with a cancer, and wherein the cancer is selected from the group consisting of pancreas, colon, rectum, kidney, bronchus, lung, uterus, cervix, bladder, liver, and stomach.
- 103. The method of claim 91, wherein the pathogen proteome is associated with a pathogen infection in a human subject.
- 104. The method of claim 91, wherein the immunogenic peptide composition comprises nucleic acid sequences encoding an amino acid sequence of the at least one peptide sequence of the third peptide set.
- 105. A method of forming an immunogenic peptide composition, the method comprising: using a processor to perform the steps of: creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein the plurality of unmodified peptide sequences is associated with a tumor neoantigen, a pathogen proteome, or a self-protein; determining a plurality of peptide-HLA immunogenicity metrics for each peptide sequence in the first peptide set; determining whether each peptide sequence in the first peptide set has a peptide-HLA immunogenicity metric that passes a threshold with respect to at least three HLA alleles; creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set; determining a plurality of peptide-HLA immunogenicity metrics for each peptide sequence in the second peptide set; and creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises a predicted vaccine performance, wherein the computing of the predicted vaccine performance comprises excluding a peptide-HLA immunogenicity metric with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA immunogenicity metric for an unmodified peptide sequence associated with the modified peptide sequence does not pass the threshold with respect to the first HLA allele, and wherein the predicted vaccine performance is a function of a peptide-HLA immunogenicity metric of each peptide sequence in the third peptide set; performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence of the third peptide set; and forming an immunogenic peptide composition comprising the at least one peptide sequence of the third peptide set for which the experimental assay was performed.
- 106. The method of claim 105, wherein selecting the plurality of unmodified peptide sequences to create the first peptide set comprises sliding a window of size n across at least a portion of an amino acid sequence encoding the tumor neoantigen, the pathogen proteome, or the self-protein, wherein n is between about 8 amino acids and about 25 amino acids in length, and wherein n is a length of each peptide sequence of the plurality of unmodified peptide sequences of the first peptide set.
- 107. The method of claim 105, wherein each peptide sequence of the first peptide set binds to an HLA class I molecule or an HLA class II molecule.
- 108. The method of claim 105, further comprising filtering the first peptide set to exclude a peptide sequence with a predicted binding core that contains a target amino acid residue in an anchor position.
- 109. The method of claim 105, further comprising substituting at least one amino acid residue of each peptide sequence of the first peptide set.
- 110. The method of claim 105, wherein the threshold is a binding affinity of less than about 1000 nM.
- 111. The method of claim 105, wherein the at least three HLA alleles are present in an HL A type of a subject.
- 112. The method of claim 111, wherein the plurality of peptide sequences is derived from the tumor neoantigen, the pathogen proteome, or the self-protein that is present in the subject.
- 113. The method of claim 105, wherein the immunogenic peptide composition comprises nucleic acid sequences encoding an amino acid sequence of the at least one peptide sequence of the third peptide set.
- 114. A method of forming an immunogenic peptide composition, the method comprising: using a processor to perform the steps of: creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein each peptide sequence of the plurality of unmodified peptide sequences is associated with a tumor neoantigen, a pathogen proteome, or a self-protein; determining a plurality of peptide-HLA binding scores for each peptide sequence in the first peptide set; determining whether each peptide sequence in the first peptide set has a peptide-HLA binding score that passes a threshold with respect to at least three HLA alleles; creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set; determining a plurality of peptide-HLA binding scores for each peptide sequence in the second peptide set; and creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises computing a predicted vaccine performance, wherein the computing of the predicted vaccine performance comprises excluding a peptide-HLA binding score with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA binding score for an unmodified peptide sequence associated with the modified peptide sequence does not pass the threshold with respect to the first HLA allele, and wherein the predicted vaccine performance is a function of a peptide-HLA binding score of each peptide sequence in the third peptide set; performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence of the third peptide set; and forming an immunogenic peptide composition comprising the at least one peptide sequence of the third peptide set for which the experimental assay was performed.
- 115. The method of claim 114, wherein the second threshold is determined from data obtained from one or more experimental assays.
- 116. The method of claim 114, wherein the predicted vaccine performance is further a function of a peptide-HLA immunogenicity metric of at least one modified peptide sequence of the second peptide set with respect to a second HLA allele if a first peptide sequence of the first peptide set is predicted to be bound to the second HLA allele with a first binding core, wherein the first binding core is a binding core of the first peptide sequence, wherein the first binding core is identical to a second binding core, wherein the first binding core and the second binding core each comprise an amino acid position within a peptide sequence, and wherein the second binding core is a binding core of the at least one modified peptide sequence.
- 117. The method of claim 114, wherein the plurality of peptide sequences is derived from the tumor neoantigen, the pathogen proteome, or the self-protein that is present in a subject.
- 118. A method of forming an immunogenic peptide composition, the method comprising: using a processor to perform the steps of: creating a first peptide set by selecting a plurality of unmodified peptide sequences, wherein at least one peptide sequence of the plurality of unmodified peptide sequences is associated with a tumor neoantigen, a pathogen proteome, or a self-protein; determining a plurality of peptide-HLA binding scores for each peptide sequence in the first peptide set; determining whether each peptide sequence in the first peptide set has a peptide-HLA binding score that passes a threshold with respect to at least three HLA alleles; creating a second peptide set comprising the first peptide set and a plurality of modified peptide sequences, wherein each modified peptide sequence of the plurality of modified peptide sequences comprises a substitution of at least one amino acid residue of a peptide sequence in the first peptide set; determining a plurality of peptide-HLA binding scores for each peptide sequence in the second peptide set; and creating a third peptide set by selecting a subset of the second peptide set, wherein the selecting comprises computing a predicted vaccine performance based on an HLA type of a subject, and wherein the computing of the predicted vaccine performance comprises excluding a peptide-HLA binding score with respect to a first HLA allele for a modified peptide sequence if a peptide-HLA binding score for an unmodified peptide sequence associated with the modified peptide sequence does not pass the threshold with respect to the first HLA allele; performing an experimental assay to obtain a peptide-HLA immunogenicity metric for at least one peptide sequence of the third peptide set; and forming an immunogenic peptide composition comprising the at least one peptide sequence of the third peptide set for which the experimental assay was performed.
- 119. The method of claim 118, wherein each peptide sequence in the first peptide set binds to an HLA class I molecule or an HLA class II molecule.
- 120. The method of claim 118, wherein the plurality of peptide sequences is derived from the tumor neoantigen, the pathogen proteome, or the self-protein that is present in the subject.
Applications Claiming Priority (11)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US17/100,630 US11058751B1 (en) | 2020-11-20 | 2020-11-20 | Compositions for optimized RAS peptide vaccines |
US17/100,630 | 2020-11-20 | ||
US17/114,237 | 2020-12-07 | ||
US17/114,237 US11161892B1 (en) | 2020-12-07 | 2020-12-07 | Method of compact peptide vaccines using residue optimization |
US17/336,960 | 2021-06-02 | ||
US17/336,960 US11235039B1 (en) | 2020-11-20 | 2021-06-02 | Immunogenic compositions comprising nucleic acids for RAS peptides |
US17/389,875 | 2021-07-30 | ||
US17/389,875 US11421015B2 (en) | 2020-12-07 | 2021-07-30 | Method of compact peptide vaccines using residue optimization |
US202163249235P | 2021-09-28 | 2021-09-28 | |
US63/249,235 | 2021-09-28 | ||
PCT/US2021/060013 WO2022109220A1 (en) | 2020-11-20 | 2021-11-19 | Compositions and methods for optimized peptide vaccines |
Publications (1)
Publication Number | Publication Date |
---|---|
AU2021381384A1 true AU2021381384A1 (en) | 2023-06-22 |
Family
ID=81708071
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU2021381384A Pending AU2021381384A1 (en) | 2020-11-20 | 2021-11-19 | Compositions and methods for optimized peptide vaccines |
Country Status (5)
Country | Link |
---|---|
EP (1) | EP4247401A1 (en) |
JP (1) | JP2023551204A (en) |
AU (1) | AU2021381384A1 (en) |
CA (1) | CA3202227A1 (en) |
WO (1) | WO2022109220A1 (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20240131133A1 (en) * | 2022-10-10 | 2024-04-25 | Think Therapeutics, Inc. | Compositions and methods for optimized kras peptide vaccines |
CN116130005B (en) * | 2023-01-30 | 2023-06-16 | 深圳新合睿恩生物医疗科技有限公司 | Tandem design method and device for multi-epitope vaccine, equipment and storage medium |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2016154047A2 (en) * | 2015-03-20 | 2016-09-29 | Memorial Sloan-Kettering Cancer Center | Monoclonal antigen-binding proteins to intracellular oncogene products |
CN108513593A (en) * | 2015-04-23 | 2018-09-07 | 南托米克斯有限责任公司 | The new epitope of cancer |
MA42420A (en) * | 2015-05-13 | 2018-05-23 | Agenus Inc | VACCINES FOR THE TREATMENT AND PREVENTION OF CANCER |
EP3297660A2 (en) * | 2015-05-20 | 2018-03-28 | The Broad Institute Inc. | Shared neoantigens |
AU2019290127A1 (en) * | 2018-06-19 | 2021-01-28 | Biontech Us Inc. | Neoantigens and uses thereof |
US11058751B1 (en) * | 2020-11-20 | 2021-07-13 | Think Therapeutics, Inc. | Compositions for optimized RAS peptide vaccines |
-
2021
- 2021-11-19 EP EP21895640.7A patent/EP4247401A1/en active Pending
- 2021-11-19 WO PCT/US2021/060013 patent/WO2022109220A1/en active Application Filing
- 2021-11-19 JP JP2023530750A patent/JP2023551204A/en active Pending
- 2021-11-19 CA CA3202227A patent/CA3202227A1/en active Pending
- 2021-11-19 AU AU2021381384A patent/AU2021381384A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
EP4247401A1 (en) | 2023-09-27 |
WO2022109220A1 (en) | 2022-05-27 |
CA3202227A1 (en) | 2022-05-27 |
JP2023551204A (en) | 2023-12-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11464837B2 (en) | Immunogenic compositions comprising nucleic acids for RAS peptides | |
AU2022264493A1 (en) | Compositions and method for optimized peptide vaccines using residue optimization | |
US11161892B1 (en) | Method of compact peptide vaccines using residue optimization | |
AU2021381384A1 (en) | Compositions and methods for optimized peptide vaccines | |
US20210388438A1 (en) | Method for selecting neoepitopes | |
US12091443B2 (en) | Method of compact peptide vaccines using residue optimization | |
US20050277161A1 (en) | Phosphopeptide antigens associated with MHC molecules | |
US11819542B2 (en) | Immunotherapeutic method for treating lung cancer by administering a polypeptide comprising an epitope of hZP3 | |
Odales et al. | Antitumor efficacy of MUC1-derived variable epitope library treatments in a mouse model of breast cancer | |
US20230099121A1 (en) | Compositions for optimized bcr-abl peptide vaccines | |
US20240131133A1 (en) | Compositions and methods for optimized kras peptide vaccines | |
KR20210064229A (en) | Epitope targeting method and system for neoantigen-based immunotherapy | |
WO2024050451A2 (en) | Compositions and methods for optimized covid peptide vaccines | |
WO2023009264A1 (en) | Compositions for optimized influenza peptide vaccines | |
US20240269250A1 (en) | Compositions and method for optimized peptide vaccines using residue optimization | |
Plunkett et al. | Protection against MUC1 expressing mouse tumours by intra‐muscular injection of MUC1 cDNA requires functional CD8+ and CD4+ T cells but does not require the MUC1 tandem repeat domain | |
EP4311834A1 (en) | Peptides and combinations of peptides for use in immunotherapy against ropharyngeal squamous cell carcinoma (opscc) and other cancers | |
CN115397519A (en) | Antigenic polypeptides and methods of use thereof | |
EA046410B1 (en) | IMMUNOGENETIC SCREENING TEST FOR CANCER |