US20230416747A1 - Safe harbor loci - Google Patents
Safe harbor loci Download PDFInfo
- Publication number
- US20230416747A1 US20230416747A1 US18/453,801 US202318453801A US2023416747A1 US 20230416747 A1 US20230416747 A1 US 20230416747A1 US 202318453801 A US202318453801 A US 202318453801A US 2023416747 A1 US2023416747 A1 US 2023416747A1
- Authority
- US
- United States
- Prior art keywords
- cell
- seq
- cells
- sgrna
- safe harbor
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108700019146 Transgenes Proteins 0.000 claims abstract description 85
- 238000000034 method Methods 0.000 claims abstract description 80
- 230000037431 insertion Effects 0.000 claims abstract description 41
- 238000003780 insertion Methods 0.000 claims abstract description 41
- 229920002477 rna polymer Polymers 0.000 claims abstract description 9
- 210000004027 cell Anatomy 0.000 claims description 218
- 230000010354 integration Effects 0.000 claims description 131
- 210000001744 T-lymphocyte Anatomy 0.000 claims description 126
- 108090000623 proteins and genes Proteins 0.000 claims description 112
- 206010028980 Neoplasm Diseases 0.000 claims description 61
- 108010019670 Chimeric Antigen Receptors Proteins 0.000 claims description 52
- 108091033409 CRISPR Proteins 0.000 claims description 49
- 102100029452 T cell receptor alpha chain constant Human genes 0.000 claims description 41
- 201000011510 cancer Diseases 0.000 claims description 37
- 101100495925 Schizosaccharomyces pombe (strain 972 / ATCC 24843) chr3 gene Proteins 0.000 claims description 33
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 claims description 25
- 108091008874 T cell receptors Proteins 0.000 claims description 24
- 102000004169 proteins and genes Human genes 0.000 claims description 22
- 239000003814 drug Substances 0.000 claims description 17
- 230000034431 double-strand break repair via homologous recombination Effects 0.000 claims description 16
- 229940124597 therapeutic agent Drugs 0.000 claims description 16
- 102000005962 receptors Human genes 0.000 claims description 14
- 108020003175 receptors Proteins 0.000 claims description 14
- 210000000130 stem cell Anatomy 0.000 claims description 14
- 101150045565 Socs1 gene Proteins 0.000 claims description 12
- 108700027336 Suppressor of Cytokine Signaling 1 Proteins 0.000 claims description 12
- 102100024779 Suppressor of cytokine signaling 1 Human genes 0.000 claims description 12
- 101000738771 Homo sapiens Receptor-type tyrosine-protein phosphatase C Proteins 0.000 claims description 10
- 101000946860 Homo sapiens T-cell surface glycoprotein CD3 epsilon chain Proteins 0.000 claims description 10
- 101710143123 Mothers against decapentaplegic homolog 2 Proteins 0.000 claims description 10
- 102100037422 Receptor-type tyrosine-protein phosphatase C Human genes 0.000 claims description 10
- 102100035794 T-cell surface glycoprotein CD3 epsilon chain Human genes 0.000 claims description 10
- 102100037513 40S ribosomal protein S23 Human genes 0.000 claims description 9
- 102100036464 Activated RNA polymerase II transcriptional coactivator p15 Human genes 0.000 claims description 9
- 102100029457 Adenine phosphoribosyltransferase Human genes 0.000 claims description 9
- 108010024223 Adenine phosphoribosyltransferase Proteins 0.000 claims description 9
- 102100029398 Calpain small subunit 1 Human genes 0.000 claims description 9
- 102100035273 E3 ubiquitin-protein ligase CBL-B Human genes 0.000 claims description 9
- 102100032450 Endothelial differentiation-related factor 1 Human genes 0.000 claims description 9
- 101001097953 Homo sapiens 40S ribosomal protein S23 Proteins 0.000 claims description 9
- 101000713904 Homo sapiens Activated RNA polymerase II transcriptional coactivator p15 Proteins 0.000 claims description 9
- 101000919194 Homo sapiens Calpain small subunit 1 Proteins 0.000 claims description 9
- 101000737265 Homo sapiens E3 ubiquitin-protein ligase CBL-B Proteins 0.000 claims description 9
- 101001016384 Homo sapiens Endothelial differentiation-related factor 1 Proteins 0.000 claims description 9
- 101000653374 Homo sapiens Methylcytosine dioxygenase TET2 Proteins 0.000 claims description 9
- 101001135804 Homo sapiens Protein tyrosine phosphatase receptor type C-associated protein Proteins 0.000 claims description 9
- 101000694402 Homo sapiens RNA transcription, translation and transport factor protein Proteins 0.000 claims description 9
- 101000700734 Homo sapiens Serine/arginine-rich splicing factor 9 Proteins 0.000 claims description 9
- 101000650652 Homo sapiens Small EDRK-rich factor 2 Proteins 0.000 claims description 9
- 101000713305 Homo sapiens Sodium-coupled neutral amino acid transporter 1 Proteins 0.000 claims description 9
- 101000831007 Homo sapiens T-cell immunoreceptor with Ig and ITIM domains Proteins 0.000 claims description 9
- 101000738413 Homo sapiens T-cell surface glycoprotein CD3 gamma chain Proteins 0.000 claims description 9
- 101001135572 Homo sapiens Tyrosine-protein phosphatase non-receptor type 2 Proteins 0.000 claims description 9
- 101000617285 Homo sapiens Tyrosine-protein phosphatase non-receptor type 6 Proteins 0.000 claims description 9
- 102100030803 Methylcytosine dioxygenase TET2 Human genes 0.000 claims description 9
- 108010011536 PTEN Phosphohydrolase Proteins 0.000 claims description 9
- 102100032543 Phosphatidylinositol 3,4,5-trisphosphate 3-phosphatase and dual-specificity protein phosphatase PTEN Human genes 0.000 claims description 9
- 102100036937 Protein tyrosine phosphatase receptor type C-associated protein Human genes 0.000 claims description 9
- 102100027122 RNA transcription, translation and transport factor protein Human genes 0.000 claims description 9
- 102100029288 Serine/arginine-rich splicing factor 9 Human genes 0.000 claims description 9
- 102100037082 Signal recognition particle 14 kDa protein Human genes 0.000 claims description 9
- 101710089523 Signal recognition particle 14 kDa protein Proteins 0.000 claims description 9
- 102100027692 Small EDRK-rich factor 2 Human genes 0.000 claims description 9
- 102100036916 Sodium-coupled neutral amino acid transporter 1 Human genes 0.000 claims description 9
- 102100024834 T-cell immunoreceptor with Ig and ITIM domains Human genes 0.000 claims description 9
- 102100025237 T-cell surface antigen CD2 Human genes 0.000 claims description 9
- 102100037911 T-cell surface glycoprotein CD3 gamma chain Human genes 0.000 claims description 9
- 102100022012 Transcription intermediary factor 1-beta Human genes 0.000 claims description 9
- 101710177718 Transcription intermediary factor 1-beta Proteins 0.000 claims description 9
- 102100033141 Tyrosine-protein phosphatase non-receptor type 2 Human genes 0.000 claims description 9
- 102100021657 Tyrosine-protein phosphatase non-receptor type 6 Human genes 0.000 claims description 9
- 108091027881 NEAT1 Proteins 0.000 claims description 8
- 108091027544 Subgenomic mRNA Proteins 0.000 claims description 8
- 239000013611 chromosomal DNA Substances 0.000 claims description 7
- 102100027314 Beta-2-microglobulin Human genes 0.000 claims description 6
- 101000937544 Homo sapiens Beta-2-microglobulin Proteins 0.000 claims description 6
- 101000934341 Homo sapiens T-cell surface glycoprotein CD5 Proteins 0.000 claims description 6
- 229910004444 SUB1 Inorganic materials 0.000 claims description 6
- 102100025244 T-cell surface glycoprotein CD5 Human genes 0.000 claims description 6
- 108091070501 miRNA Proteins 0.000 claims description 6
- 102100021062 Ferritin light chain Human genes 0.000 claims description 5
- 101000818390 Homo sapiens Ferritin light chain Proteins 0.000 claims description 5
- 101000934346 Homo sapiens T-cell surface antigen CD2 Proteins 0.000 claims description 5
- 239000002679 microRNA Substances 0.000 claims description 5
- 108091027967 Small hairpin RNA Proteins 0.000 claims description 4
- 210000004263 induced pluripotent stem cell Anatomy 0.000 claims description 4
- 210000001778 pluripotent stem cell Anatomy 0.000 claims description 3
- PZNPLUBHRSSFHT-RRHRGVEJSA-N 1-hexadecanoyl-2-octadecanoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCCCC(=O)O[C@@H](COP([O-])(=O)OCC[N+](C)(C)C)COC(=O)CCCCCCCCCCCCCCC PZNPLUBHRSSFHT-RRHRGVEJSA-N 0.000 claims description 2
- 101001105486 Homo sapiens Proteasome subunit alpha type-7 Proteins 0.000 claims description 2
- 102100021201 Proteasome subunit alpha type-7 Human genes 0.000 claims description 2
- 108020004459 Small interfering RNA Proteins 0.000 claims description 2
- 239000004055 small Interfering RNA Substances 0.000 claims description 2
- 101710153660 Nuclear receptor corepressor 2 Proteins 0.000 claims 2
- 102000057208 Smad2 Human genes 0.000 claims 2
- 230000010473 stable expression Effects 0.000 abstract description 7
- 230000001965 increasing effect Effects 0.000 abstract description 6
- 230000014509 gene expression Effects 0.000 description 108
- 101000634853 Homo sapiens T cell receptor alpha chain constant Proteins 0.000 description 39
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 38
- 108020005004 Guide RNA Proteins 0.000 description 34
- 102000040430 polynucleotide Human genes 0.000 description 33
- 108091033319 polynucleotide Proteins 0.000 description 33
- 239000002157 polynucleotide Substances 0.000 description 33
- 108020004414 DNA Proteins 0.000 description 32
- 201000010099 disease Diseases 0.000 description 29
- 230000000694 effects Effects 0.000 description 29
- 239000000203 mixture Substances 0.000 description 29
- 102000004127 Cytokines Human genes 0.000 description 27
- 108090000695 Cytokines Proteins 0.000 description 27
- 108091028043 Nucleic acid sequence Proteins 0.000 description 26
- 150000007523 nucleic acids Chemical class 0.000 description 26
- 125000003729 nucleotide group Chemical group 0.000 description 23
- 239000002773 nucleotide Substances 0.000 description 22
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 21
- 210000002865 immune cell Anatomy 0.000 description 21
- 238000003559 RNA-seq method Methods 0.000 description 20
- 108091026890 Coding region Proteins 0.000 description 19
- 102000003735 Mesothelin Human genes 0.000 description 19
- 108090000015 Mesothelin Proteins 0.000 description 19
- 230000006870 function Effects 0.000 description 19
- 102000039446 nucleic acids Human genes 0.000 description 19
- 108020004707 nucleic acids Proteins 0.000 description 19
- 210000000349 chromosome Anatomy 0.000 description 18
- 108090000765 processed proteins & peptides Proteins 0.000 description 18
- 231100000135 cytotoxicity Toxicity 0.000 description 17
- 238000011282 treatment Methods 0.000 description 17
- 230000003013 cytotoxicity Effects 0.000 description 16
- 238000010362 genome editing Methods 0.000 description 16
- 229920001184 polypeptide Polymers 0.000 description 16
- 102000004196 processed proteins & peptides Human genes 0.000 description 16
- 230000001225 therapeutic effect Effects 0.000 description 16
- 239000013598 vector Substances 0.000 description 16
- 238000010354 CRISPR gene editing Methods 0.000 description 15
- 108010042407 Endonucleases Proteins 0.000 description 15
- 102100031780 Endonuclease Human genes 0.000 description 14
- 239000000546 pharmaceutical excipient Substances 0.000 description 14
- 239000000427 antigen Substances 0.000 description 13
- 108091007433 antigens Proteins 0.000 description 13
- 102000036639 antigens Human genes 0.000 description 13
- 229940124531 pharmaceutical excipient Drugs 0.000 description 13
- 101100118093 Drosophila melanogaster eEF1alpha2 gene Proteins 0.000 description 12
- 102100024222 B-lymphocyte antigen CD19 Human genes 0.000 description 11
- 101000980825 Homo sapiens B-lymphocyte antigen CD19 Proteins 0.000 description 11
- 102000004389 Ribonucleoproteins Human genes 0.000 description 11
- 108010081734 Ribonucleoproteins Proteins 0.000 description 11
- 230000001105 regulatory effect Effects 0.000 description 11
- 238000004458 analytical method Methods 0.000 description 10
- 238000003556 assay Methods 0.000 description 10
- 238000004520 electroporation Methods 0.000 description 10
- 239000013612 plasmid Substances 0.000 description 10
- 230000028327 secretion Effects 0.000 description 10
- 230000008685 targeting Effects 0.000 description 10
- 238000002560 therapeutic procedure Methods 0.000 description 10
- 210000001519 tissue Anatomy 0.000 description 10
- 108010077544 Chromatin Proteins 0.000 description 9
- 210000003483 chromatin Anatomy 0.000 description 9
- 208000035475 disorder Diseases 0.000 description 9
- 238000011156 evaluation Methods 0.000 description 9
- 238000002474 experimental method Methods 0.000 description 9
- 210000005260 human cell Anatomy 0.000 description 9
- 230000003612 virological effect Effects 0.000 description 9
- 102000017420 CD3 protein, epsilon/gamma/delta subunit Human genes 0.000 description 8
- 108050005493 CD3 protein, epsilon/gamma/delta subunit Proteins 0.000 description 8
- 102100025751 Mothers against decapentaplegic homolog 2 Human genes 0.000 description 8
- 238000013459 approach Methods 0.000 description 8
- 238000001415 gene therapy Methods 0.000 description 8
- 239000008194 pharmaceutical composition Substances 0.000 description 8
- 101000914514 Homo sapiens T-cell-specific surface glycoprotein CD28 Proteins 0.000 description 7
- 241000699670 Mus sp. Species 0.000 description 7
- 101710163270 Nuclease Proteins 0.000 description 7
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 7
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 7
- 102100036011 T-cell surface glycoprotein CD4 Human genes 0.000 description 7
- 102100027213 T-cell-specific surface glycoprotein CD28 Human genes 0.000 description 7
- 230000008901 benefit Effects 0.000 description 7
- 230000002759 chromosomal effect Effects 0.000 description 7
- 238000012217 deletion Methods 0.000 description 7
- 230000037430 deletion Effects 0.000 description 7
- 238000000338 in vitro Methods 0.000 description 7
- 101710149870 C-C chemokine receptor type 5 Proteins 0.000 description 6
- 230000003044 adaptive effect Effects 0.000 description 6
- 238000002659 cell therapy Methods 0.000 description 6
- 230000001419 dependent effect Effects 0.000 description 6
- 238000000684 flow cytometry Methods 0.000 description 6
- 201000005787 hematologic cancer Diseases 0.000 description 6
- 208000024200 hematopoietic and lymphoid system neoplasm Diseases 0.000 description 6
- 210000003958 hematopoietic stem cell Anatomy 0.000 description 6
- 238000001802 infusion Methods 0.000 description 6
- 230000001404 mediated effect Effects 0.000 description 6
- 230000006780 non-homologous end joining Effects 0.000 description 6
- 230000000638 stimulation Effects 0.000 description 6
- 230000035899 viability Effects 0.000 description 6
- 102100035875 C-C chemokine receptor type 5 Human genes 0.000 description 5
- 102100025278 Coxsackievirus and adenovirus receptor Human genes 0.000 description 5
- 241000725303 Human immunodeficiency virus Species 0.000 description 5
- 102100034922 T-cell surface glycoprotein CD8 alpha chain Human genes 0.000 description 5
- 108700009124 Transcription Initiation Site Proteins 0.000 description 5
- 210000004369 blood Anatomy 0.000 description 5
- 239000008280 blood Substances 0.000 description 5
- 239000003623 enhancer Substances 0.000 description 5
- 239000012634 fragment Substances 0.000 description 5
- 230000012010 growth Effects 0.000 description 5
- 238000007481 next generation sequencing Methods 0.000 description 5
- 238000006467 substitution reaction Methods 0.000 description 5
- 239000013603 viral vector Substances 0.000 description 5
- 238000010453 CRISPR/Cas method Methods 0.000 description 4
- 102000053602 DNA Human genes 0.000 description 4
- 102100034343 Integrase Human genes 0.000 description 4
- 108010061833 Integrases Proteins 0.000 description 4
- 102000000588 Interleukin-2 Human genes 0.000 description 4
- 108010002350 Interleukin-2 Proteins 0.000 description 4
- 210000000173 T-lymphoid precursor cell Anatomy 0.000 description 4
- 102100040247 Tumor necrosis factor Human genes 0.000 description 4
- 208000027418 Wounds and injury Diseases 0.000 description 4
- 230000004913 activation Effects 0.000 description 4
- 230000033228 biological regulation Effects 0.000 description 4
- 230000003915 cell function Effects 0.000 description 4
- 230000010261 cell growth Effects 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 229940127089 cytotoxic agent Drugs 0.000 description 4
- 230000006378 damage Effects 0.000 description 4
- 230000005782 double-strand break Effects 0.000 description 4
- 210000001671 embryonic stem cell Anatomy 0.000 description 4
- 230000002068 genetic effect Effects 0.000 description 4
- 230000001976 improved effect Effects 0.000 description 4
- 208000014674 injury Diseases 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- 210000003071 memory t lymphocyte Anatomy 0.000 description 4
- 238000010369 molecular cloning Methods 0.000 description 4
- 230000009438 off-target cleavage Effects 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 230000003252 repetitive effect Effects 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- 208000024891 symptom Diseases 0.000 description 4
- 238000013519 translation Methods 0.000 description 4
- 208000024893 Acute lymphoblastic leukemia Diseases 0.000 description 3
- 238000011357 CAR T-cell therapy Methods 0.000 description 3
- 208000005623 Carcinogenesis Diseases 0.000 description 3
- 201000009030 Carcinoma Diseases 0.000 description 3
- 108091007741 Chimeric antigen receptor T cells Proteins 0.000 description 3
- 241000701022 Cytomegalovirus Species 0.000 description 3
- 241000702421 Dependoparvovirus Species 0.000 description 3
- 102100030334 Friend leukemia integration 1 transcription factor Human genes 0.000 description 3
- 101001062996 Homo sapiens Friend leukemia integration 1 transcription factor Proteins 0.000 description 3
- 101000876829 Homo sapiens Protein C-ets-1 Proteins 0.000 description 3
- 101000893493 Homo sapiens Protein flightless-1 homolog Proteins 0.000 description 3
- 206010025323 Lymphomas Diseases 0.000 description 3
- 241000124008 Mammalia Species 0.000 description 3
- 208000034578 Multiple myelomas Diseases 0.000 description 3
- 241001529936 Murinae Species 0.000 description 3
- 201000003793 Myelodysplastic syndrome Diseases 0.000 description 3
- 108091092724 Noncoding DNA Proteins 0.000 description 3
- 108700020796 Oncogene Proteins 0.000 description 3
- 102000043276 Oncogene Human genes 0.000 description 3
- 108700026244 Open Reading Frames Proteins 0.000 description 3
- 206010061902 Pancreatic neoplasm Diseases 0.000 description 3
- 102100035251 Protein C-ets-1 Human genes 0.000 description 3
- 230000006044 T cell activation Effects 0.000 description 3
- 108091028113 Trans-activating crRNA Proteins 0.000 description 3
- 108010017070 Zinc Finger Nucleases Proteins 0.000 description 3
- 230000003213 activating effect Effects 0.000 description 3
- 230000002776 aggregation Effects 0.000 description 3
- 238000004220 aggregation Methods 0.000 description 3
- 239000002246 antineoplastic agent Substances 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 3
- 230000036952 cancer formation Effects 0.000 description 3
- 231100000504 carcinogenesis Toxicity 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 230000003247 decreasing effect Effects 0.000 description 3
- 238000009169 immunotherapy Methods 0.000 description 3
- 238000001727 in vivo Methods 0.000 description 3
- 230000006698 induction Effects 0.000 description 3
- 230000000977 initiatory effect Effects 0.000 description 3
- 208000032839 leukemia Diseases 0.000 description 3
- 210000000265 leukocyte Anatomy 0.000 description 3
- 210000004698 lymphocyte Anatomy 0.000 description 3
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 201000001441 melanoma Diseases 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- 210000000822 natural killer cell Anatomy 0.000 description 3
- 201000002528 pancreatic cancer Diseases 0.000 description 3
- 208000008443 pancreatic carcinoma Diseases 0.000 description 3
- 210000005259 peripheral blood Anatomy 0.000 description 3
- 239000011886 peripheral blood Substances 0.000 description 3
- 239000013641 positive control Substances 0.000 description 3
- 230000008439 repair process Effects 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 208000000649 small cell carcinoma Diseases 0.000 description 3
- 206010041823 squamous cell carcinoma Diseases 0.000 description 3
- 238000010186 staining Methods 0.000 description 3
- 150000003431 steroids Chemical class 0.000 description 3
- 230000002459 sustained effect Effects 0.000 description 3
- 208000031261 Acute myeloid leukaemia Diseases 0.000 description 2
- 206010061424 Anal cancer Diseases 0.000 description 2
- 208000032791 BCR-ABL1 positive chronic myelogenous leukemia Diseases 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 206010006187 Breast cancer Diseases 0.000 description 2
- 208000026310 Breast neoplasm Diseases 0.000 description 2
- 108010041397 CD4 Antigens Proteins 0.000 description 2
- 210000001266 CD8-positive T-lymphocyte Anatomy 0.000 description 2
- 208000005243 Chondrosarcoma Diseases 0.000 description 2
- 208000010833 Chronic myeloid leukaemia Diseases 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 206010009944 Colon cancer Diseases 0.000 description 2
- 206010014733 Endometrial cancer Diseases 0.000 description 2
- 206010014759 Endometrial neoplasm Diseases 0.000 description 2
- 208000000461 Esophageal Neoplasms Diseases 0.000 description 2
- 229940123611 Genome editing Drugs 0.000 description 2
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 2
- 108060003951 Immunoglobulin Proteins 0.000 description 2
- 208000031671 Large B-Cell Diffuse Lymphoma Diseases 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- 108091007460 Long intergenic noncoding RNA Proteins 0.000 description 2
- 108060001084 Luciferase Proteins 0.000 description 2
- 239000005089 Luciferase Substances 0.000 description 2
- 102000043129 MHC class I family Human genes 0.000 description 2
- 108091054437 MHC class I family Proteins 0.000 description 2
- 108700018351 Major Histocompatibility Complex Proteins 0.000 description 2
- 208000002231 Muscle Neoplasms Diseases 0.000 description 2
- 208000033761 Myelogenous Chronic BCR-ABL Positive Leukemia Diseases 0.000 description 2
- 206010030155 Oesophageal carcinoma Diseases 0.000 description 2
- 206010033128 Ovarian cancer Diseases 0.000 description 2
- 206010061535 Ovarian neoplasm Diseases 0.000 description 2
- 206010035226 Plasma cell myeloma Diseases 0.000 description 2
- 208000006664 Precursor Cell Lymphoblastic Leukemia-Lymphoma Diseases 0.000 description 2
- 208000015634 Rectal Neoplasms Diseases 0.000 description 2
- 108700008625 Reporter Genes Proteins 0.000 description 2
- 241000725643 Respiratory syncytial virus Species 0.000 description 2
- 206010039491 Sarcoma Diseases 0.000 description 2
- 208000000453 Skin Neoplasms Diseases 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- 208000005718 Stomach Neoplasms Diseases 0.000 description 2
- 238000010459 TALEN Methods 0.000 description 2
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 2
- 208000007097 Urinary Bladder Neoplasms Diseases 0.000 description 2
- 208000002495 Uterine Neoplasms Diseases 0.000 description 2
- 230000001154 acute effect Effects 0.000 description 2
- 208000009956 adenocarcinoma Diseases 0.000 description 2
- 238000011467 adoptive cell therapy Methods 0.000 description 2
- 230000002411 adverse Effects 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- -1 and a crRNA (e.g. Proteins 0.000 description 2
- 230000002547 anomalous effect Effects 0.000 description 2
- 239000013059 antihormonal agent Substances 0.000 description 2
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 2
- 210000001106 artificial yeast chromosome Anatomy 0.000 description 2
- 210000003719 b-lymphocyte Anatomy 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 201000006491 bone marrow cancer Diseases 0.000 description 2
- 230000033077 cellular process Effects 0.000 description 2
- 201000010881 cervical cancer Diseases 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 238000002512 chemotherapy Methods 0.000 description 2
- 208000006990 cholangiocarcinoma Diseases 0.000 description 2
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical group C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 210000001151 cytotoxic T lymphocyte Anatomy 0.000 description 2
- 230000003831 deregulation Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- 206010012818 diffuse large B-cell lymphoma Diseases 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 230000009977 dual effect Effects 0.000 description 2
- 239000012636 effector Substances 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 201000004101 esophageal cancer Diseases 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 2
- 206010017758 gastric cancer Diseases 0.000 description 2
- 238000012239 gene modification Methods 0.000 description 2
- 230000030279 gene silencing Effects 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- 230000005017 genetic modification Effects 0.000 description 2
- 235000013617 genetically modified food Nutrition 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 210000002443 helper t lymphocyte Anatomy 0.000 description 2
- 230000003463 hyperproliferative effect Effects 0.000 description 2
- 230000028993 immune response Effects 0.000 description 2
- 102000018358 immunoglobulin Human genes 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 210000002540 macrophage Anatomy 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 201000002077 muscle cancer Diseases 0.000 description 2
- 210000000581 natural killer T-cell Anatomy 0.000 description 2
- 108091027963 non-coding RNA Proteins 0.000 description 2
- 102000042567 non-coding RNA Human genes 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 230000009437 off-target effect Effects 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- 210000004986 primary T-cell Anatomy 0.000 description 2
- 230000037452 priming Effects 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 206010038038 rectal cancer Diseases 0.000 description 2
- 201000001275 rectum cancer Diseases 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 230000001177 retroviral effect Effects 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 210000003705 ribosome Anatomy 0.000 description 2
- 208000020989 salivary duct carcinoma Diseases 0.000 description 2
- 201000000849 skin cancer Diseases 0.000 description 2
- 201000011549 stomach cancer Diseases 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 230000020382 suppression by virus of host antigen processing and presentation of peptide antigen via MHC class I Effects 0.000 description 2
- 230000004083 survival effect Effects 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 210000004881 tumor cell Anatomy 0.000 description 2
- 230000004614 tumor growth Effects 0.000 description 2
- 210000003171 tumor-infiltrating lymphocyte Anatomy 0.000 description 2
- 210000003932 urinary bladder Anatomy 0.000 description 2
- 201000005112 urinary bladder cancer Diseases 0.000 description 2
- 206010046766 uterine cancer Diseases 0.000 description 2
- 230000009385 viral infection Effects 0.000 description 2
- BHQCQFFYRZLCQQ-UHFFFAOYSA-N (3alpha,5alpha,7alpha,12alpha)-3,7,12-trihydroxy-cholan-24-oic acid Natural products OC1CC2CC(O)CCC2(C)C2C1C1CCC(C(CCC(O)=O)C)C1(C)C(O)C2 BHQCQFFYRZLCQQ-UHFFFAOYSA-N 0.000 description 1
- MZOFCQQQCNRIBI-VMXHOPILSA-N (3s)-4-[[(2s)-1-[[(2s)-1-[[(1s)-1-carboxy-2-hydroxyethyl]amino]-4-methyl-1-oxopentan-2-yl]amino]-5-(diaminomethylideneamino)-1-oxopentan-2-yl]amino]-3-[[2-[[(2s)-2,6-diaminohexanoyl]amino]acetyl]amino]-4-oxobutanoic acid Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN MZOFCQQQCNRIBI-VMXHOPILSA-N 0.000 description 1
- QGVQZRDQPDLHHV-DPAQBDIFSA-N (3s,8s,9s,10r,13r,14s,17r)-10,13-dimethyl-17-[(2r)-6-methylheptan-2-yl]-2,3,4,7,8,9,11,12,14,15,16,17-dodecahydro-1h-cyclopenta[a]phenanthrene-3-thiol Chemical compound C1C=C2C[C@@H](S)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 QGVQZRDQPDLHHV-DPAQBDIFSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- 208000030507 AIDS Diseases 0.000 description 1
- 241001156739 Actinobacteria <phylum> Species 0.000 description 1
- 206010000830 Acute leukaemia Diseases 0.000 description 1
- 208000014697 Acute lymphocytic leukaemia Diseases 0.000 description 1
- 208000036762 Acute promyelocytic leukaemia Diseases 0.000 description 1
- 208000003200 Adenoma Diseases 0.000 description 1
- 108700028369 Alleles Proteins 0.000 description 1
- 201000003076 Angiosarcoma Diseases 0.000 description 1
- 208000007860 Anus Neoplasms Diseases 0.000 description 1
- 241001142141 Aquificae <phylum> Species 0.000 description 1
- 101100407151 Arabidopsis thaliana PBL6 gene Proteins 0.000 description 1
- 108010031480 Artificial Receptors Proteins 0.000 description 1
- 206010003571 Astrocytoma Diseases 0.000 description 1
- 208000023275 Autoimmune disease Diseases 0.000 description 1
- 206010003827 Autoimmune hepatitis Diseases 0.000 description 1
- 208000010839 B-cell chronic lymphocytic leukemia Diseases 0.000 description 1
- 208000032800 BCR-ABL1 positive blast phase chronic myelogenous leukemia Diseases 0.000 description 1
- 206010004146 Basal cell carcinoma Diseases 0.000 description 1
- 208000023328 Basedow disease Diseases 0.000 description 1
- 206010004593 Bile duct cancer Diseases 0.000 description 1
- 206010005003 Bladder cancer Diseases 0.000 description 1
- 208000004860 Blast Crisis Diseases 0.000 description 1
- 206010005949 Bone cancer Diseases 0.000 description 1
- 208000018084 Bone neoplasm Diseases 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 208000003174 Brain Neoplasms Diseases 0.000 description 1
- 101150005393 CBF1 gene Proteins 0.000 description 1
- 101150017501 CCR5 gene Proteins 0.000 description 1
- 241000282832 Camelidae Species 0.000 description 1
- 241000282472 Canis lupus familiaris Species 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 206010008342 Cervix carcinoma Diseases 0.000 description 1
- 239000004380 Cholic acid Substances 0.000 description 1
- 208000004378 Choroid plexus papilloma Diseases 0.000 description 1
- 108091060290 Chromatid Proteins 0.000 description 1
- 241001112695 Clostridiales Species 0.000 description 1
- 208000001333 Colorectal Neoplasms Diseases 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- 101100329224 Coprinopsis cinerea (strain Okayama-7 / 130 / ATCC MYA-4618 / FGSC 9003) cpf1 gene Proteins 0.000 description 1
- 241000192700 Cyanobacteria Species 0.000 description 1
- 201000005171 Cystadenoma Diseases 0.000 description 1
- 208000002699 Digestive System Neoplasms Diseases 0.000 description 1
- 206010061818 Disease progression Diseases 0.000 description 1
- 206010061825 Duodenal neoplasm Diseases 0.000 description 1
- 208000005431 Endometrioid Carcinoma Diseases 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 102100038595 Estrogen receptor Human genes 0.000 description 1
- 208000006168 Ewing Sarcoma Diseases 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 206010016654 Fibrosis Diseases 0.000 description 1
- 241000192125 Firmicutes Species 0.000 description 1
- 208000004057 Focal Nodular Hyperplasia Diseases 0.000 description 1
- 208000022072 Gallbladder Neoplasms Diseases 0.000 description 1
- 206010018364 Glomerulonephritis Diseases 0.000 description 1
- 206010018404 Glucagonoma Diseases 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 208000015023 Graves' disease Diseases 0.000 description 1
- 208000002125 Hemangioendothelioma Diseases 0.000 description 1
- 208000001258 Hemangiosarcoma Diseases 0.000 description 1
- 208000002250 Hematologic Neoplasms Diseases 0.000 description 1
- 208000035186 Hemolytic Autoimmune Anemia Diseases 0.000 description 1
- 206010019629 Hepatic adenoma Diseases 0.000 description 1
- 206010073073 Hepatobiliary cancer Diseases 0.000 description 1
- 208000017604 Hodgkin disease Diseases 0.000 description 1
- 208000010747 Hodgkins lymphoma Diseases 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101000804764 Homo sapiens Lymphotactin Proteins 0.000 description 1
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 1
- 241000829111 Human polyomavirus 1 Species 0.000 description 1
- 206010021245 Idiopathic thrombocytopenic purpura Diseases 0.000 description 1
- 238000012404 In vitro experiment Methods 0.000 description 1
- 102100037850 Interferon gamma Human genes 0.000 description 1
- 108010074328 Interferon-gamma Proteins 0.000 description 1
- 102000003814 Interleukin-10 Human genes 0.000 description 1
- 108090000174 Interleukin-10 Proteins 0.000 description 1
- 102000004388 Interleukin-4 Human genes 0.000 description 1
- 108090000978 Interleukin-4 Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 208000003456 Juvenile Arthritis Diseases 0.000 description 1
- 206010059176 Juvenile idiopathic arthritis Diseases 0.000 description 1
- 208000007766 Kaposi sarcoma Diseases 0.000 description 1
- 208000008839 Kidney Neoplasms Diseases 0.000 description 1
- 206010023825 Laryngeal cancer Diseases 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 208000018142 Leiomyosarcoma Diseases 0.000 description 1
- 206010024218 Lentigo maligna Diseases 0.000 description 1
- 206010061523 Lip and/or oral cavity cancer Diseases 0.000 description 1
- 208000036241 Liver adenomatosis Diseases 0.000 description 1
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 1
- 208000031422 Lymphocytic Chronic B-Cell Leukemia Diseases 0.000 description 1
- 102100035304 Lymphotactin Human genes 0.000 description 1
- 208000032271 Malignant tumor of penis Diseases 0.000 description 1
- 208000000172 Medulloblastoma Diseases 0.000 description 1
- 208000015021 Meningeal Neoplasms Diseases 0.000 description 1
- 108020005196 Mitochondrial DNA Proteins 0.000 description 1
- 208000003445 Mouth Neoplasms Diseases 0.000 description 1
- 206010057269 Mucoepidermoid carcinoma Diseases 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 208000033776 Myeloid Acute Leukemia Diseases 0.000 description 1
- 208000033833 Myelomonocytic Chronic Leukemia Diseases 0.000 description 1
- 208000009525 Myocarditis Diseases 0.000 description 1
- 108091061960 Naked DNA Proteins 0.000 description 1
- 206010029260 Neuroblastoma Diseases 0.000 description 1
- 206010029488 Nodular melanoma Diseases 0.000 description 1
- 208000015914 Non-Hodgkin lymphomas Diseases 0.000 description 1
- REYJJPSVUYRZGE-UHFFFAOYSA-N Octadecylamine Chemical compound CCCCCCCCCCCCCCCCCCN REYJJPSVUYRZGE-UHFFFAOYSA-N 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 208000037064 Papilloma of choroid plexus Diseases 0.000 description 1
- 208000004091 Parotid Neoplasms Diseases 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 206010061336 Pelvic neoplasm Diseases 0.000 description 1
- 206010034277 Pemphigoid Diseases 0.000 description 1
- 241000721454 Pemphigus Species 0.000 description 1
- 208000002471 Penile Neoplasms Diseases 0.000 description 1
- 206010034299 Penile cancer Diseases 0.000 description 1
- 208000031845 Pernicious anaemia Diseases 0.000 description 1
- 208000009565 Pharyngeal Neoplasms Diseases 0.000 description 1
- 208000007913 Pituitary Neoplasms Diseases 0.000 description 1
- 208000007452 Plasmacytoma Diseases 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 208000033826 Promyelocytic Acute Leukemia Diseases 0.000 description 1
- 241000192142 Proteobacteria Species 0.000 description 1
- 108091008109 Pseudogenes Proteins 0.000 description 1
- 102000057361 Pseudogenes Human genes 0.000 description 1
- 206010051807 Pseudosarcoma Diseases 0.000 description 1
- 201000004681 Psoriasis Diseases 0.000 description 1
- 201000008183 Pulmonary blastoma Diseases 0.000 description 1
- 108091030071 RNAI Proteins 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 206010038389 Renal cancer Diseases 0.000 description 1
- 208000006265 Renal cell carcinoma Diseases 0.000 description 1
- 201000000582 Retinoblastoma Diseases 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 206010039710 Scleroderma Diseases 0.000 description 1
- 208000021386 Sjogren Syndrome Diseases 0.000 description 1
- 206010041067 Small cell lung cancer Diseases 0.000 description 1
- 206010054184 Small intestine carcinoma Diseases 0.000 description 1
- 208000032383 Soft tissue cancer Diseases 0.000 description 1
- 102000005157 Somatostatin Human genes 0.000 description 1
- 108010056088 Somatostatin Proteins 0.000 description 1
- 241001180364 Spirochaetes Species 0.000 description 1
- 241000193996 Streptococcus pyogenes Species 0.000 description 1
- 101000910035 Streptococcus pyogenes serotype M1 CRISPR-associated endonuclease Cas9/Csn1 Proteins 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- 206010042553 Superficial spreading melanoma stage unspecified Diseases 0.000 description 1
- 201000009594 Systemic Scleroderma Diseases 0.000 description 1
- 206010042953 Systemic sclerosis Diseases 0.000 description 1
- 230000024932 T cell mediated immunity Effects 0.000 description 1
- 108010092262 T-Cell Antigen Receptors Proteins 0.000 description 1
- 208000000389 T-cell leukemia Diseases 0.000 description 1
- 208000028530 T-cell lymphoblastic leukemia/lymphoma Diseases 0.000 description 1
- 208000024313 Testicular Neoplasms Diseases 0.000 description 1
- 206010057644 Testis cancer Diseases 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 210000000447 Th1 cell Anatomy 0.000 description 1
- 210000000068 Th17 cell Anatomy 0.000 description 1
- 210000004241 Th2 cell Anatomy 0.000 description 1
- 241001143310 Thermotogae <phylum> Species 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 208000031981 Thrombocytopenic Idiopathic Purpura Diseases 0.000 description 1
- 208000024770 Thyroid neoplasm Diseases 0.000 description 1
- 206010062129 Tongue neoplasm Diseases 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 108700029229 Transcriptional Regulatory Elements Proteins 0.000 description 1
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 1
- 208000023915 Ureteral Neoplasms Diseases 0.000 description 1
- 206010046392 Ureteric cancer Diseases 0.000 description 1
- 206010046431 Urethral cancer Diseases 0.000 description 1
- 206010046458 Urethral neoplasms Diseases 0.000 description 1
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 description 1
- 201000005969 Uveal melanoma Diseases 0.000 description 1
- 206010046851 Uveitis Diseases 0.000 description 1
- 208000009311 VIPoma Diseases 0.000 description 1
- 206010047115 Vasculitis Diseases 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 206010047642 Vitiligo Diseases 0.000 description 1
- 208000004354 Vulvar Neoplasms Diseases 0.000 description 1
- 208000008383 Wilms tumor Diseases 0.000 description 1
- 208000012018 Yolk sac tumor Diseases 0.000 description 1
- XVIYCJDWYLJQBG-UHFFFAOYSA-N acetic acid;adamantane Chemical compound CC(O)=O.C1C(C2)CC3CC1CC2C3 XVIYCJDWYLJQBG-UHFFFAOYSA-N 0.000 description 1
- 206010000583 acral lentiginous melanoma Diseases 0.000 description 1
- 208000009621 actinic keratosis Diseases 0.000 description 1
- 239000004480 active ingredient Substances 0.000 description 1
- 208000002517 adenoid cystic carcinoma Diseases 0.000 description 1
- 201000001256 adenosarcoma Diseases 0.000 description 1
- 201000008395 adenosquamous carcinoma Diseases 0.000 description 1
- 210000001789 adipocyte Anatomy 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 210000004504 adult stem cell Anatomy 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- 230000000735 allogeneic effect Effects 0.000 description 1
- 208000004631 alopecia areata Diseases 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 230000001668 ameliorated effect Effects 0.000 description 1
- 201000007696 anal canal cancer Diseases 0.000 description 1
- 239000004037 angiogenesis inhibitor Substances 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 201000011165 anus cancer Diseases 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 230000001363 autoimmune Effects 0.000 description 1
- 201000000448 autoimmune hemolytic anemia Diseases 0.000 description 1
- 201000003710 autoimmune thrombocytopenic purpura Diseases 0.000 description 1
- 208000029336 bartholin gland carcinoma Diseases 0.000 description 1
- 230000037429 base substitution Effects 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 210000000941 bile Anatomy 0.000 description 1
- 201000009036 biliary tract cancer Diseases 0.000 description 1
- 208000020790 biliary tract neoplasm Diseases 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- 210000000481 breast Anatomy 0.000 description 1
- 230000000981 bystander Effects 0.000 description 1
- 230000005907 cancer growth Effects 0.000 description 1
- 238000002619 cancer immunotherapy Methods 0.000 description 1
- 208000002458 carcinoid tumor Diseases 0.000 description 1
- 210000004413 cardiac myocyte Anatomy 0.000 description 1
- 101150059443 cas12a gene Proteins 0.000 description 1
- 230000022131 cell cycle Effects 0.000 description 1
- 230000030833 cell death Effects 0.000 description 1
- 230000024245 cell differentiation Effects 0.000 description 1
- 230000010307 cell transformation Effects 0.000 description 1
- 230000004700 cellular uptake Effects 0.000 description 1
- 210000003679 cervix uteri Anatomy 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000000546 chi-square test Methods 0.000 description 1
- 235000012000 cholesterol Nutrition 0.000 description 1
- BHQCQFFYRZLCQQ-OELDTZBJSA-N cholic acid Chemical compound C([C@H]1C[C@H]2O)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(O)=O)C)[C@@]2(C)[C@@H](O)C1 BHQCQFFYRZLCQQ-OELDTZBJSA-N 0.000 description 1
- 229960002471 cholic acid Drugs 0.000 description 1
- 235000019416 cholic acid Nutrition 0.000 description 1
- 210000001612 chondrocyte Anatomy 0.000 description 1
- 208000006571 choroid plexus carcinoma Diseases 0.000 description 1
- 210000004756 chromatid Anatomy 0.000 description 1
- 208000024207 chronic leukemia Diseases 0.000 description 1
- 208000032852 chronic lymphocytic leukemia Diseases 0.000 description 1
- 201000010902 chronic myelomonocytic leukemia Diseases 0.000 description 1
- 230000007882 cirrhosis Effects 0.000 description 1
- 208000019425 cirrhosis of liver Diseases 0.000 description 1
- 208000009060 clear cell adenocarcinoma Diseases 0.000 description 1
- 210000001072 colon Anatomy 0.000 description 1
- 208000029742 colonic neoplasm Diseases 0.000 description 1
- 238000002648 combination therapy Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 230000000536 complexating effect Effects 0.000 description 1
- 238000010205 computational analysis Methods 0.000 description 1
- 201000010918 connective tissue cancer Diseases 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 208000035250 cutaneous malignant susceptibility to 1 melanoma Diseases 0.000 description 1
- 230000001461 cytolytic effect Effects 0.000 description 1
- 239000000824 cytostatic agent Substances 0.000 description 1
- 231100000433 cytotoxic Toxicity 0.000 description 1
- 239000002254 cytotoxic agent Substances 0.000 description 1
- 231100000599 cytotoxic agent Toxicity 0.000 description 1
- 230000001472 cytotoxic effect Effects 0.000 description 1
- 231100000050 cytotoxic potential Toxicity 0.000 description 1
- SPTYHKZRPFATHJ-HYZXJONISA-N dT6 Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)CO)[C@@H](O)C1 SPTYHKZRPFATHJ-HYZXJONISA-N 0.000 description 1
- 230000009849 deactivation Effects 0.000 description 1
- 230000007123 defense Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000002716 delivery method Methods 0.000 description 1
- 210000004443 dendritic cell Anatomy 0.000 description 1
- KXGVEGMKQFWNSR-UHFFFAOYSA-N deoxycholic acid Natural products C1CC2CC(O)CCC2(C)C2C1C1CCC(C(CCC(O)=O)C)C1(C)C(O)C2 KXGVEGMKQFWNSR-UHFFFAOYSA-N 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 201000001981 dermatomyositis Diseases 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- 208000024558 digestive system cancer Diseases 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 238000011038 discontinuous diafiltration by volume reduction Methods 0.000 description 1
- 230000005750 disease progression Effects 0.000 description 1
- 201000000312 duodenum cancer Diseases 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 230000002124 endocrine Effects 0.000 description 1
- 210000000750 endocrine system Anatomy 0.000 description 1
- 208000001991 endodermal sinus tumor Diseases 0.000 description 1
- 201000003908 endometrial adenocarcinoma Diseases 0.000 description 1
- 201000003914 endometrial carcinoma Diseases 0.000 description 1
- 201000006828 endometrial hyperplasia Diseases 0.000 description 1
- 201000000330 endometrial stromal sarcoma Diseases 0.000 description 1
- 208000028730 endometrioid adenocarcinoma Diseases 0.000 description 1
- 208000029179 endometrioid stromal sarcoma Diseases 0.000 description 1
- 210000002889 endothelial cell Anatomy 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 210000002919 epithelial cell Anatomy 0.000 description 1
- 210000003238 esophagus Anatomy 0.000 description 1
- 108010038795 estrogen receptors Proteins 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 208000024519 eye neoplasm Diseases 0.000 description 1
- 238000005206 flow analysis Methods 0.000 description 1
- 210000004186 follicle cell Anatomy 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 201000010175 gallbladder cancer Diseases 0.000 description 1
- 208000010749 gastric carcinoma Diseases 0.000 description 1
- 201000011555 gastric fundus cancer Diseases 0.000 description 1
- 208000015419 gastrin-producing neuroendocrine tumor Diseases 0.000 description 1
- 201000000052 gastrinoma Diseases 0.000 description 1
- 201000010231 gastrointestinal system cancer Diseases 0.000 description 1
- 230000009368 gene silencing by RNA Effects 0.000 description 1
- 210000004392 genitalia Anatomy 0.000 description 1
- 210000004602 germ cell Anatomy 0.000 description 1
- 210000004907 gland Anatomy 0.000 description 1
- 208000005017 glioblastoma Diseases 0.000 description 1
- 125000003827 glycol group Chemical group 0.000 description 1
- 230000003394 haemopoietic effect Effects 0.000 description 1
- 210000003128 head Anatomy 0.000 description 1
- 201000010235 heart cancer Diseases 0.000 description 1
- 208000024348 heart neoplasm Diseases 0.000 description 1
- 210000003566 hemangioblast Anatomy 0.000 description 1
- 201000002222 hemangioblastoma Diseases 0.000 description 1
- 201000011066 hemangioma Diseases 0.000 description 1
- 210000000777 hematopoietic system Anatomy 0.000 description 1
- 206010073071 hepatocellular carcinoma Diseases 0.000 description 1
- 231100000844 hepatocellular carcinoma Toxicity 0.000 description 1
- 210000003494 hepatocyte Anatomy 0.000 description 1
- 238000012165 high-throughput sequencing Methods 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 210000003917 human chromosome Anatomy 0.000 description 1
- 201000002316 ileum cancer Diseases 0.000 description 1
- 230000003832 immune regulation Effects 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 238000003364 immunohistochemistry Methods 0.000 description 1
- 230000003308 immunostimulating effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000012966 insertion method Methods 0.000 description 1
- 238000002743 insertional mutagenesis Methods 0.000 description 1
- 239000012212 insulator Substances 0.000 description 1
- 206010022498 insulinoma Diseases 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000001361 intraarterial administration Methods 0.000 description 1
- 208000020082 intraepithelial neoplasia Diseases 0.000 description 1
- 208000014899 intrahepatic bile duct cancer Diseases 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 230000002601 intratumoral effect Effects 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 201000003856 jejunal cancer Diseases 0.000 description 1
- 201000009592 jejunal neoplasm Diseases 0.000 description 1
- 201000002215 juvenile rheumatoid arthritis Diseases 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 201000010982 kidney cancer Diseases 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 208000003849 large cell carcinoma Diseases 0.000 description 1
- 201000011061 large intestine cancer Diseases 0.000 description 1
- 206010023841 laryngeal neoplasm Diseases 0.000 description 1
- 210000000867 larynx Anatomy 0.000 description 1
- 201000004962 larynx cancer Diseases 0.000 description 1
- 208000011080 lentigo maligna melanoma Diseases 0.000 description 1
- 150000002632 lipids Chemical group 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 201000007270 liver cancer Diseases 0.000 description 1
- 208000014018 liver neoplasm Diseases 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 201000005249 lung adenocarcinoma Diseases 0.000 description 1
- 201000005202 lung cancer Diseases 0.000 description 1
- 208000020816 lung neoplasm Diseases 0.000 description 1
- 201000000966 lung oat cell carcinoma Diseases 0.000 description 1
- 208000003747 lymphoid leukemia Diseases 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 210000003809 male germ line stem cell Anatomy 0.000 description 1
- 201000005831 male reproductive organ cancer Diseases 0.000 description 1
- 208000030883 malignant astrocytoma Diseases 0.000 description 1
- 208000006178 malignant mesothelioma Diseases 0.000 description 1
- 208000022006 malignant tumor of meninges Diseases 0.000 description 1
- 208000016847 malignant urinary system neoplasm Diseases 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 201000008203 medulloepithelioma Diseases 0.000 description 1
- 210000002901 mesenchymal stem cell Anatomy 0.000 description 1
- 238000010197 meta-analysis Methods 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 208000011645 metastatic carcinoma Diseases 0.000 description 1
- 206010061289 metastatic neoplasm Diseases 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 230000000116 mitigating effect Effects 0.000 description 1
- 210000001616 monocyte Anatomy 0.000 description 1
- 210000005087 mononuclear cell Anatomy 0.000 description 1
- 201000006417 multiple sclerosis Diseases 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 206010028417 myasthenia gravis Diseases 0.000 description 1
- 210000000066 myeloid cell Anatomy 0.000 description 1
- 210000003739 neck Anatomy 0.000 description 1
- 230000009826 neoplastic cell growth Effects 0.000 description 1
- 201000008026 nephroblastoma Diseases 0.000 description 1
- 210000003360 nephrocyte Anatomy 0.000 description 1
- 201000011682 nervous system cancer Diseases 0.000 description 1
- 210000001178 neural stem cell Anatomy 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 201000000032 nodular malignant melanoma Diseases 0.000 description 1
- 201000008106 ocular cancer Diseases 0.000 description 1
- 230000006548 oncogenic transformation Effects 0.000 description 1
- 238000011275 oncology therapy Methods 0.000 description 1
- 201000005443 oral cavity cancer Diseases 0.000 description 1
- 201000000890 orbital cancer Diseases 0.000 description 1
- 210000004409 osteocyte Anatomy 0.000 description 1
- 201000008968 osteosarcoma Diseases 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 125000000913 palmityl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 208000021255 pancreatic insulinoma Diseases 0.000 description 1
- 201000005163 papillary serous adenocarcinoma Diseases 0.000 description 1
- 208000024641 papillary serous cystadenocarcinoma Diseases 0.000 description 1
- 201000001219 parotid gland cancer Diseases 0.000 description 1
- ONTNXMBMXUNDBF-UHFFFAOYSA-N pentatriacontane-17,18,19-triol Chemical compound CCCCCCCCCCCCCCCCC(O)C(O)C(O)CCCCCCCCCCCCCCCC ONTNXMBMXUNDBF-UHFFFAOYSA-N 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 230000002688 persistence Effects 0.000 description 1
- 230000000144 pharmacologic effect Effects 0.000 description 1
- 201000008006 pharynx cancer Diseases 0.000 description 1
- 150000003904 phospholipids Chemical class 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 210000002706 plastid Anatomy 0.000 description 1
- 229920000768 polyamine Polymers 0.000 description 1
- 201000006292 polyarteritis nodosa Diseases 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 208000005987 polymyositis Diseases 0.000 description 1
- 230000003334 potential effect Effects 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 230000003449 preventive effect Effects 0.000 description 1
- 208000029340 primitive neuroectodermal tumor Diseases 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 210000002307 prostate Anatomy 0.000 description 1
- 230000002685 pulmonary effect Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 210000004203 pyloric antrum Anatomy 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 238000001959 radiotherapy Methods 0.000 description 1
- 210000000664 rectum Anatomy 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 210000003289 regulatory T cell Anatomy 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 201000007048 respiratory system cancer Diseases 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 201000009410 rhabdomyosarcoma Diseases 0.000 description 1
- 206010039073 rheumatoid arthritis Diseases 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 201000007416 salivary gland adenoid cystic carcinoma Diseases 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 208000004548 serous cystadenocarcinoma Diseases 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 208000037968 sinus cancer Diseases 0.000 description 1
- 238000005549 size reduction Methods 0.000 description 1
- 210000003491 skin Anatomy 0.000 description 1
- 201000002314 small intestine cancer Diseases 0.000 description 1
- 201000000267 smooth muscle cancer Diseases 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 210000001988 somatic stem cell Anatomy 0.000 description 1
- NHXLMOGPVYXJNR-ATOGVRKGSA-N somatostatin Chemical compound C([C@H]1C(=O)N[C@H](C(N[C@@H](CO)C(=O)N[C@@H](CSSC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CC=2C3=CC=CC=C3NC=2)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N1)[C@@H](C)O)NC(=O)CNC(=O)[C@H](C)N)C(O)=O)=O)[C@H](O)C)C1=CC=CC=C1 NHXLMOGPVYXJNR-ATOGVRKGSA-N 0.000 description 1
- 229960000553 somatostatin Drugs 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 208000014618 spinal cord cancer Diseases 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 210000002784 stomach Anatomy 0.000 description 1
- 210000003699 striated muscle Anatomy 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 208000030457 superficial spreading melanoma Diseases 0.000 description 1
- 230000009469 supplementation Effects 0.000 description 1
- 208000011580 syndromic disease Diseases 0.000 description 1
- 201000000596 systemic lupus erythematosus Diseases 0.000 description 1
- 210000002435 tendon Anatomy 0.000 description 1
- 201000003120 testicular cancer Diseases 0.000 description 1
- 210000001550 testis Anatomy 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 150000003568 thioethers Chemical class 0.000 description 1
- 210000001541 thymus gland Anatomy 0.000 description 1
- 201000002510 thyroid cancer Diseases 0.000 description 1
- 206010043778 thyroiditis Diseases 0.000 description 1
- 201000006134 tongue cancer Diseases 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000002054 transplantation Methods 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- ZMANZCXQSJIPKH-UHFFFAOYSA-O triethylammonium ion Chemical compound CC[NH+](CC)CC ZMANZCXQSJIPKH-UHFFFAOYSA-O 0.000 description 1
- 125000002948 undecyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 208000010576 undifferentiated carcinoma Diseases 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 201000011294 ureter cancer Diseases 0.000 description 1
- 201000000360 urethra cancer Diseases 0.000 description 1
- 201000004435 urinary system cancer Diseases 0.000 description 1
- 201000009825 uterine corpus cancer Diseases 0.000 description 1
- 210000004291 uterus Anatomy 0.000 description 1
- 206010046885 vaginal cancer Diseases 0.000 description 1
- 208000013139 vaginal neoplasm Diseases 0.000 description 1
- 208000008662 verrucous carcinoma Diseases 0.000 description 1
- 201000005102 vulva cancer Diseases 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
- C12N15/1136—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing against growth factors, growth regulators, cytokines, lymphokines or hormones
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K2239/00—Indexing codes associated with cellular immunotherapy of group A61K39/46
- A61K2239/31—Indexing codes associated with cellular immunotherapy of group A61K39/46 characterized by the route of administration
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K2239/00—Indexing codes associated with cellular immunotherapy of group A61K39/46
- A61K2239/38—Indexing codes associated with cellular immunotherapy of group A61K39/46 characterised by the dose, timing or administration schedule
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K2239/00—Indexing codes associated with cellular immunotherapy of group A61K39/46
- A61K2239/46—Indexing codes associated with cellular immunotherapy of group A61K39/46 characterised by the cancer treated
- A61K2239/54—Pancreas
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K35/00—Medicinal preparations containing materials or reaction products thereof with undetermined constitution
- A61K35/12—Materials from mammals; Compositions comprising non-specified tissues or cells; Compositions comprising non-embryonic stem cells; Genetically modified cells
- A61K35/14—Blood; Artificial blood
- A61K35/17—Lymphocytes; B-cells; T-cells; Natural killer cells; Interferon-activated or cytokine-activated lymphocytes
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/46—Cellular immunotherapy
- A61K39/461—Cellular immunotherapy characterised by the cell type used
- A61K39/4611—T-cells, e.g. tumor infiltrating lymphocytes [TIL], lymphokine-activated killer cells [LAK] or regulatory T cells [Treg]
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/46—Cellular immunotherapy
- A61K39/463—Cellular immunotherapy characterised by recombinant expression
- A61K39/4631—Chimeric Antigen Receptors [CAR]
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/46—Cellular immunotherapy
- A61K39/464—Cellular immunotherapy characterised by the antigen targeted or presented
- A61K39/4643—Vertebrate antigens
- A61K39/4644—Cancer antigens
- A61K39/464402—Receptors, cell surface antigens or cell surface determinants
- A61K39/464411—Immunoglobulin superfamily
- A61K39/464412—CD19 or B4
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/46—Cellular immunotherapy
- A61K39/464—Cellular immunotherapy characterised by the antigen targeted or presented
- A61K39/4643—Vertebrate antigens
- A61K39/4644—Cancer antigens
- A61K39/464466—Adhesion molecules, e.g. NRCAM, EpCAM or cadherins
- A61K39/464468—Mesothelin [MSLN]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/111—General methods applicable to biologically active non-coding nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
- C12N15/902—Stable introduction of foreign DNA into chromosome using homologous recombination
- C12N15/907—Stable introduction of foreign DNA into chromosome using homologous recombination in mammalian cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N5/00—Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
- C12N5/06—Animal cells or tissues; Human cells or tissues
- C12N5/0602—Vertebrate cells
- C12N5/0634—Cells from the blood or the immune system
- C12N5/0636—T lymphocytes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
Definitions
- Cancer continues to present a significant clinical burdSen despite the substantial research efforts and scientific advances in cancer therapies.
- Blood and bone marrow cancers are frequently diagnosed cancer types, including multiple myelomas, leukemia, and lymphomas.
- Current treatment options for these cancers are not effective for all patients and/or can have substantial adverse side effects.
- Other types of cancer also remain challenging to treat using existing therapeutic options.
- Cancer immunotherapies are a promising solution because they can be highly specific, allowing for increased therapeutic effectiveness and the mitigation of side effects.
- SHS safe harbor sites
- the present disclosure is directed, inter alia, to safe harbor loci that exhibit high knock-in efficiency and stable expression of their transgenes. These safe harbor loci can be used to alter T cells for immunotherapy. These safe harbor loci are useful for the treatment of various diseases, including cancer.
- the present disclosure provides an engineered cell, comprising at least one sequence encoding a transgene, wherein the at least one sequence is inserted within a safe harbor locus, the safe harbor locus is at any one or more of the sgRNA target loci provided in Table 4; and wherein expression of the at least one sequence encoding the transgene is operatively linked to an endogenous promoter.
- the present disclosure provides an engineered cell, comprising at least one sequence encoding a transgene, wherein the at least one sequence is inserted within a safe harbor locus, the safe harbor locus is at any one or more of the sgRNA target loci provided in Table 4; and wherein expression of the at least one sequence encoding the transgene is operatively linked to an exogenous promoter.
- the target locus is selected from: chr10:33130000-33140000, chr10:72290000-72300000, chr11:128340000-128350000, chr11:65425000-65427000 (NEAT1), chr15:92830000-92840000, chr16:11220000-11230000, chr2:87460000-87470000, chr3:186510000-186520000, chr3:59450000-59460000, chr8:127980000-128000000, and chr9:7970000-7980000.
- the target locus is selected from: chr10:72290000-72300000, chr11:128340000-128350000, chr15:92830000-92840000, and chr16:11220000-11230000. In some embodiments, the target locus is chr11:128340000-128350000. In some embodiments, the target locus is chr15:92830000-92840000.
- the target locus is a gene selected from: APRT, B2M, CAPNS1, CBLB, CD2, CD3E, CD3G, CD5, EDF1, FTL, PTEN, PTPN2, PTPN6, PTPRC, PTPRCAP, RPS23, RTRAF, SERF2, SLC38A1, SMAD2, SOCS1, SRP14, SRSF9, SUB1, TET2, TIGIT, TRAC, and TRIM28.
- the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS88, GS89, GS90, GS91, GS92, GS93, GS94, GS95, GS96, GS97, GS98, GS99, GS100, GS101, GS102, GS103, GS104, GS105, GS106, GS107, GS108, GS109, GS110, GS111, GS112, GS113, GS114, GS115, GS116, GS117, GS118, GS119, GS120.
- the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS91, GS92, GS93, GS94, GS95, GS96, GS100, GS101, GS102, GS103, GS104, and GS105. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS103, GS104, and GS105. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS94, GS95, and GS96. In some embodiments, the safe harbor locus is the GS94 integration site in Table 4.
- the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS100, GS101, and GS102. In some embodiments, the safe harbor locus is the GS102 integration site in Table 4. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS91, GS92, and GS93.
- the exogenous promoter is an EF1a promoter.
- the engineered cell is a stem cell, a human cell, a primary cell, an hematopoietic cell, an adaptive immune cell, an innate immune cell, a T cell or a T cell progenitor.
- the transgene encodes a recombinant protein, optionally a therapeutic agent.
- the transgene encodes a chimeric antigen receptor (CAR).
- the present disclosure provides a composition comprising the engineered cell as described herein and a pharmaceutical excipient.
- the present disclosure provides a guide ribonucleic acids (gRNA) for editing a cell at a safe harbor locus, wherein gRNA comprises any one of the sgRNA sequences in Table 4.
- gRNA guide ribonucleic acids
- the gRNA comprises any one of SEQ ID NOS:1-120. In some embodiments, the gRNA comprises any one of SEQ ID NOS: 91-96 and 100-105. In some embodiments, the gRNA comprises SEQ ID NO:94 or SEQ ID NO:102. In some embodiments, the gRNA comprises SEQ ID NO:94. In some embodiments, the gRNA comprises SEQ ID NO:102. In some embodiments, the cell is a stem cell, a human cell, a primary cell, an hematopoietic cell, an adaptive immune cell, an innate immune cell, a T cell or a T cell progenitor.
- the present disclosure provides a method of editing a cell having chromosomal DNA, comprising inserting at least one sequence encoding a transgene within a safe harbor locus in the chromosomal DNA of the cell, wherein the safe harbor locus is any one or more of the sgRNA target loci provided in Table 4.
- the target locus is selected from: chr10:33130000-33140000, chr10:72290000-72300000, chr11:128340000-128350000, chr11:65425000-65427000 (NEAT1), chr15:92830000-92840000, chr16:11220000-11230000, chr2:87460000-87470000, chr3:186510000-186520000, chr3:59450000-59460000, chr8:127980000-128000000, and chr9:7970000-7980000.
- the target locus is selected from: chr10:72290000-72300000, chr11:128340000-128350000, chr15:92830000-92840000, and chr16:11220000-11230000. In some embodiments, the target locus is chr11:128340000-128350000. In some embodiments, the target locus is chr15:92830000-92840000.
- the target locus is a gene selected from: APRT, B2M, CAPNS1, CBLB, CD2, CD3E, CD3G, CD5, EDF1, FTL, PTEN, PTPN2, PTPN6, PTPRC, PTPRCAP, RPS23, RTRAF, SERF2, SLC38A1, SMAD2, SOCS1, SRP14, SRSF9, SUB1, TET2, TIGIT, TRAC, and TRIM28.
- the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS88, GS89, GS90, GS91, GS92, GS93, GS94, GS95, GS96, GS97, GS98, GS99, GS100, GS101, GS102, GS103, GS104, GS105, GS106, GS107, GS108, GS109, GS110, GS111, GS112, GS113, GS114, GS115, GS116, GS117, GS118, GS119, GS120.
- the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS91, GS92, GS93, GS94, GS95, GS96, GS100, GS101, GS102, GS103, GS104, and GS105. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS103, GS104, and GS105. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS94, GS95, and GS96. In some embodiments, the safe harbor locus is the GS94 integration site in Table 4.
- the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS100, GS101, and GS102. In some embodiments, the safe harbor locus is the GS102 integration site in Table 4. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS91, GS92, and GS93.
- the transgene encodes a recombinant protein, optionally a therapeutic agent. In some embodiments, the transgene encodes a chimeric antigen receptor (CAR).
- the at least one sequence comprises an exogenous promoter and the exogenous promoter is operably linked to the transgene. In some embodiments, the exogenous promoter is an EF1a promoter.
- the cell is a stem cell, a human cell, a primary cell, an hematopoietic cell, an adaptive immune cell, an innate immune cell, a T cell or T cell progenitor. In some embodiments, the at least one sequence is inserted using a homology-directed repair.
- the at least one sequence is inserted using a homology independent targeted insertion. In some embodiments, the at least one sequence is inserted using one or more guide ribonucleic acids (gRNAs) and one or more Cas9 endonucleases.
- gRNAs guide ribonucleic acids
- the one or more gRNAs comprises any one of SEQ ID NOS: 1-120. In some embodiments, the one or more gRNAs comprises any one of SEQ ID NOS: 91-96 and 100-105. In some embodiments, the gRNA comprises SEQ ID NO:94 or SEQ ID NO:102. In some embodiments, the gRNA comprises SEQ ID NO:94. In some embodiments, the gRNA comprises SEQ ID NO:102.
- the present disclosure provides a method of editing a T cell, comprising contacting a T cell with one or more guide ribonucleic acids (gRNAs), at least one sequence encoding a transgene, and one or more Cas9 endonucleases, wherein the one or more gRNAs and Cas9 endonucleases facilitate the insertion of the at least one sequence into chromosomal DNA within a safe harbor locus, wherein the safe harbor locus is selected from any one or more of the sgRNA target loci in Table 4.
- gRNAs guide ribonucleic acids
- Cas9 endonucleases facilitate the insertion of the at least one sequence into chromosomal DNA within a safe harbor locus, wherein the safe harbor locus is selected from any one or more of the sgRNA target loci in Table 4.
- the one or more gRNAs comprises a sequence selected from any one of the sgRNA sequences in Table 4. In some embodiments, the one or more gRNAs comprises any one of SEQ ID NOS: 1-120. In some embodiments, the one or more gRNAs comprises any one of SEQ ID NOS: 91-96 and 100-105. In some embodiments, the gRNA comprises SEQ ID NO:94 or SEQ ID NO:102. In some embodiments, the gRNA comprises SEQ ID NO:94. In some embodiments, the gRNA comprises SEQ ID NO:102.
- the target locus is selected from: chr10:33130000-33140000, chr10:72290000-72300000, chr11:128340000-128350000, chr11:65425000-65427000 (NEAT1), chr15:92830000-92840000, chr16:11220000-11230000, chr2:87460000-87470000, chr3:186510000-186520000, chr3:59450000-59460000, chr8:127980000-128000000, and chr9:7970000-7980000.
- the target locus is selected from: chr10:72290000-72300000, chr11:128340000-128350000, chr15:92830000-92840000, and chr16:11220000-11230000. In some embodiments, the target locus is chr11:128340000-128350000. In some embodiments, the target locus is chr15:92830000-92840000.
- the target locus is a gene selected from: APRT, B2M, CAPNS1, CBLB, CD2, CD3E, CD3G, CD5, EDF1, FTL, PTEN, PTPN2, PTPN6, PTPRC, PTPRCAP, RPS23, RTRAF, SERF2, SLC38A1, SMAD2, SOCS1, SRP14, SRSF9, SUB1, TET2, TIGIT, TRAC, and TRIM28.
- the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS88, GS89, GS90, GS91, GS92, GS93, GS94, GS95, GS96, GS97, GS98, GS99, GS100, GS101, GS102, GS103, GS104, GS105, GS106, GS107, GS108, GS109, GS110, GS111, GS112, GS113, GS114, GS115, GS116, GS117, GS118, GS119, GS120.
- the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS91, GS92, GS93, GS94, GS95, GS96, GS100, GS101, GS102, GS103, GS104, and GS105. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS103, GS104, and GS105. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS94, GS95, and GS96. In some embodiments, the safe harbor locus is the GS94 integration site in Table 4.
- the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS100, GS101, and GS102. In some embodiments, the safe harbor locus is the GS102 integration site in Table 4. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS91, GS92, and GS93
- the present disclosure provides an ex vivo method of obtaining an engineered cell or population thereof, comprising (a) obtaining a cell; and (b) genetically modifying the cell by inserting at least one sequence encoding a transgene within a safe harbor locus, wherein the safe harbor locus is selected from any one of the sgRNA target loci in Table 4.
- obtaining the cell comprises: (i) collecting a tissue sample from a subject, (ii) isolating the cells from the tissue samples, and (iii) culturing the cells in vitro.
- the tissue sample is a blood sample.
- the cell is a stem cell, a human cell, a primary cell, an hematopoietic cell, an adaptive immune cell, an innate immune cell, a T cell or T cell precursor.
- the at least one sequence is inserted using a homology-directed repair. In some embodiments, the at least one sequence is inserted using a homology independent targeted insertion.
- the genetically modifying in step (b) comprises contacting the cell with one or more guide ribonucleic acids (gRNAs), the at least one sequence, and one or more Cas9 endonucleases, wherein the one or more gRNAs and Cas9 endonucleases facilitate the insertion of the at least one sequence into chromosomal DNA within the safe harbor locus.
- the one or more gRNAs comprises a sequence selected from any one of the sgRNA sequences in Table 4.
- the transgene encodes a recombinant protein, optionally a therapeutic agent. In some embodiments, the transgene encodes a chimeric antigen receptor (CAR). In some embodiments, the at least one sequence comprises an exogenous promoter and the exogenous promoter is operably linked to the transgene. In some embodiments, the exogenous promoter is an EF1a promoter.
- the present disclosure provides a method of treating a subject having or at risk of having a disease, comprising administering to the subject an effective amount of an engineered cell as described herein, a population thereof, or a composition as described herein.
- the cell, the population thereof, or the composition is administered to the subject by infusion.
- the present disclosure provides a method of treating a subject having or at risk of having a disease, comprising (a) conducting any one of the methods described supra; and (b) administering to the subject an effective amount of a composition comprising the cell or a population thereof.
- the composition is administered to the subject by infusion.
- the disease is cancer.
- the disease is blood cancer.
- the present disclosure provides a method of identifying a safe harbor locus, comprising: (a) identifying genes or non-coding regions in a chromosome that are above a threshold level for expression across developmental cell states and/or a threshold level for accessibility of chromatin; (b) generating a linear model that correlates the gene or non-coding region from step (a) with knock-in (KI) efficiency and estimates the KI efficiency of any gene or coding region on the chromosome; and (c) selecting the safe harbor locus based on threshold parameters; wherein the safe harbor locus is selected for insertion of at least one sequence encoding a transgene within a cell.
- KI knock-in
- the threshold parameters include one or more of: stable expression of a transgene, knockout of the gene confers benefit to the function of the cell, no known function within the cell, stable transgene expression in vitro with or without CD3/CD28 stimulation, negligible off-target cleavage as detected by iGuide-Seq or CRISPR-Seq, less off-target cleavage relative to other loci as detected by iGuide-Seq or CRISPR-Seq, negligible transgene-independent cytotoxicity, negligible transgene-independent cytokine expression, negligible transgene-independent chimeric antigen receptor expression, negligible deregulation or silencing of nearby genes, and positioned outside of a cancer-related gene.
- the stable expression of a transgene at the safe harbor locus is less than or equal to 2-fold expression change over the course of at least 1, 2, 3, 4, 5, 6, or 7 days, and wherein expression change is measured by mean fluorescence intensity of a reporter gene encoded by the at least one sequence.
- the accessibility of chromatin is measured using an assay for transposase-accessible chromatin using sequencing (ATAC-seq).
- the level of expression across developmental cell states is measured using RNA sequencing (RNA-seq).
- the cell is a stem cell, a human cell, a primary cell, an hematopoietic cell, an adaptive immune cell, an innate immune cell, a T cell or T cell precursor.
- the linear model has a coefficient of determination (R 2 value) of at least 30%.
- the present disclosure provides an engineered cell, composition, gRNA or method as described herein, wherein insertion within the safe harbor locus increase cell cytotoxicity of diseased cells.
- the present disclosure provides an engineered cell, composition, gRNA or method as described herein, wherein knock-in efficiency at the safe harbor locus is increased relative to other locations along the chromosome.
- FIG. 1 includes a schematic depicting the methodology used to identify safe harbor loci in the present disclosure, in some embodiments.
- FIGS. 2 A- 2 C include schematic depicting 3 top safe harbor loci known in the art: AAVS1 ( FIG. 2 A ), CCR5 ( FIG. 2 B ), and Rosa26 ( FIG. 2 C ).
- the figures were retrieved from Sadelain, M., et al. (2012). Safe harbours for the integration of new DNA in the human genome. Nature reviews Cancer, 12(1), 51-58, the relevant disclosures of which are herein incorporated by reference in their entirety.
- FIG. 3 includes a schematic outlining the data provided in Roth, T. L., et al. 2019. Rapid discovery of synthetic DNA sequences to rewrite endogenous T cell circuits. bioRxiv, 604561, the relevant disclosures of which are herein incorporated by reference in their entirety.
- FIGS. 4 A- 4 B include heatmaps with samples clustered based on activation status and cell type, generated with processed RNA-seq data.
- FIG. 4 A includes a heatmap of samples generated with processed RNA-seq data.
- FIG. 4 B includes a heatmap sample generated with processed RNA-seq data, showing the clustering of ⁇ 20 k genes.
- FIGS. 5 A- 5 B include plots generated using processed RNA-seq data.
- FIG. 5 A includes a plot showing direct correlation between transcript expression data from Roth, T. L., et al. 2019 and transcript expression data generated by the inventors of the present disclosure.
- FIG. 5 B includes a heatmap showing clusters of the top 10% expressed genes.
- FIG. 6 includes a heatmap of samples generated with processed ATAC-seq data.
- FIGS. 7 A- 7 B includes plots depicting the signal enrichment at transcription start sites (TSS) ( FIG. 7 A ) and peak size distribution ( FIG. 7 B ), generated with processed ATAC-seq data for coding regions.
- TSS transcription start sites
- FIG. 7 B peak size distribution
- FIG. 8 includes a plot depicting the open chromatin regions around the TRAC locus.
- FIG. 9 includes a plot depicting KI (knock-in) efficiency for coding regions/genes with GFP as a reporter.
- FIG. 10 includes a plot depicting KI efficiency for coding regions/genes with tNGFR as a reporter.
- FIG. 11 includes plots showing scaled KI efficiency for 90 genes vs d2 (day-2) RNA-seq data, d4 (day-4) RNA-seq data and ATAC-seq data, utilized for the predictive linear model.
- FIG. 12 includes a plot showing chromatin accessibility for a top candidate non-coding region, measured using ATAQ sequencing.
- FIG. 13 A includes plots showing the cell counts for GFP controls used for the evaluation of candidate KI loci.
- FIG. 13 B includes pmax (GFP_high) readings for GFP controls used for the evaluation of candidate KI loci.
- FIG. 14 A includes plots showing the maximum episomal GFP expression (GFP_high) readings for non-targeting controls.
- FIG. 14 B includes the cell count for non-targeting controls from donors 1, 2, and 3.
- FIG. 15 includes the cell count and maximum GFP expression (GFP_high) readings for WT controls.
- FIG. 16 includes plots showing the cell count and maximum GFP expression (GFP_high) readings when using sgRNA5, which targets a B2M safe harbor locus, and the construct expression was driven by an endogenous promoter.
- FIG. 17 includes plots showing the cell count and maximum GFP expression (GFP_high) readings when using sgRNA5, which targets a B2M safe harbor locus, and the construct expression was driven by an EF1a promoter.
- FIG. 18 includes plots showing the cell count and maximum GFP expression (GFP_high) readings when using sgRNA79 to target a TRAC safe harbor locus, and the construct expression was driven by an endogenous promoter.
- FIG. 19 includes plots showing the cell count and maximum GFP expression (GFP_high) readings when using sgRNA79, which targets a TRAC safe harbor locus, and the construct expression was driven by an exogenous promoter.
- FIG. 20 includes plots showing TCR (T cell receptor) vs. GFP among all donors and time points when using sgRNA79, and the construct is driven by an endogenous promoter.
- FIG. 21 includes plots showing TCR vs. GFP among all donors and time points when using sgRNA79 and the construct is driven by an exogenous promoter.
- FIG. 22 includes plots showing the cell count and maximum GFP expression (GFP_high) readings when using sgRNA83, which targets a TRAC safe harbor locus, and the construct expression was driven by an endogenous promoter.
- FIG. 23 includes plots showing the cell count and maximum GFP expression (GFP_high) readings when using sgRNA83, which targets a TRAC safe harbor locus, and the construct expression was driven by an exogenous promoter.
- FIG. 24 includes plots showing TCR vs. GFP among all donors and time points when using sgRNA83 and the construct is driven by an endogenous promoter.
- FIG. 25 includes plots showing TCR vs. GFP among all donors and time points when using sgRNA83 and the construct is driven by an exogenous promoter.
- FIG. 26 includes plots illustrating potential sources of variation (e.g., edge effects and electroporation errors) observed between replicates and donors.
- potential sources of variation e.g., edge effects and electroporation errors
- FIG. 27 includes plots illustrating potential sources of variation observed between replicates and donors, e.g., relating to inherent differences between donors.
- FIG. 28 includes plots illustrating potential sources of variation observed between replicates and donors, e.g., relating to gating errors.
- FIG. 29 includes plots showing the GFP mean fluorescence intensity (GFP MFI) and KI efficiency for top KI loci evaluated with endogenous promoters.
- FIG. 30 includes plots showing the GFP mean fluorescence intensity (GFP MFI) and KI efficiency for top KI loci evaluated with the EF1a promoter.
- FIGS. 31 A- 31 C include plots showing all the significant KI loci (top KI loci) as evaluated with endogenous and EF1a promoters.
- FIG. 31 A includes a plot showing that the expression from the EF1a promoter was approximately 10 times higher than expression from an endogenous promoter.
- FIG. 31 B shows the top KI loci ranked by GFP MFI at week 3.
- FIG. 31 C shows the top integration loci ranked by GFP MFI (at weeks 3 and 4; donors 1-3).
- FIG. 32 A includes a plot showing some target loci and their measured transgene expression levels.
- FIG. 32 B includes plots showing the transgene (Prime Receptor (PrimeR)) and TCR expression for the control and insertion at GS94, GS102 and TRAC loci.
- Primary Receptor Primary Receptor
- FIG. 33 A includes a plot showing the PrimeR levels measured for the indicated integration sites.
- FIG. 33 B includes a schematic showing the GS94 integration site on Chromosome 11.
- FIG. 34 A includes plots showing CAR induction and primeR expression of engineered T cells after 48 hours of coculturing with K562-CD19 cells.
- FIG. 34 B includes plots showing the cytotoxity and cytokine secretion levels for engineered T cells 48 hours of coculturing with K562-CD19/MSLN cells.
- FIG. 35 A includes a schematic showing the experimental overview for evaluating the effect of integration site on cytotoxicity.
- FIG. 35 B includes plots showing the measured cytotoxicity for engineered T cells cocultured for 48 hours with the K562 CD19+/MSLN+ or K562 CD19 ⁇ /MSLN+ cells.
- FIG. 36 A includes a schematic showing the experimental overview for evaluating the effect of integration site on cytokine secretion.
- FIG. 36 B includes plots showing the measured cytokine levels for engineered T cells cocultured for 48 hours with K562 CD19+/MSLN+ cells.
- FIG. 37 includes a schematic showing the in vitro experiment conducted to determin the effect of integration site on primeR-independent CAR expression.
- Flow refers to flow cytometry and “restim” refers to repetitive CD3/CD28 stimulation of the engineered T cells.
- EP refers to electroporation.
- FIGS. 38 A and 38 B include plots showing the stability of PrimeR expression over time when using the indicated integration sites.
- Flow refers to flow cytometry and “restim” refers to repetitive CD3/CD28 stimualation of the engineered T cells.
- EP refers to electroporation.
- the PrimeR expression is normalized to the expression from using the TRAC integration site.
- FIG. 39 A includes a schematic showing the iGuide-Seq assay technique.
- FIG. 39 B includes a plot showing the on-target efficiency, using iGuide Seq assay, for the indicated integration sites.
- FIG. 39 C includes schematics from the iGuide-Seq analysis showing that GS94 had no reproducible putatitve off-targets across two donors
- FIG. 39 C discloses SEQ ID NOS 312-322, 312-313, and 323-331, respectively, in order of appearance.
- FIG. 40 includes a schematic showing the iGuide-Seq workflow and data.
- FIG. 41 includes a plot showing rhAmp-segTM analysis of putative off-target sites identified by iGUIDE-seq and Elevation prediction.
- FIG. 42 includes plots showing RNA-seq analysis of cells with GS94, GS102 and TRAC knock-in of CD19/MSLN circuits. Scatterplot of gene expression in cells with integration at the GS94 locus (y-axis) vs cells with integration at either the TRAC or the GS102 locus (x-axis) in two donors. The yellow dots correspond to ETS1 and FLI1. In blue are the genes that were found to be differentially expressed using edgeR (fold-change >0, FDR-corrected p-value ⁇ 0.01, average counts-per-million across compared conditions at least 2).
- FIG. 43 includes plots showing the absence of cytokine-independent growth in cells with CD19/MSLN circuit KI at GS94.
- the present disclosure provides safe harbor loci and methods for identifying safe harbor loci that exhibit high integration efficiency (e.g., high knock-in (KI) efficiency), high and constant levels of transgene expression, and such benefits independent of T cell activation/differentiation state.
- the safe harbor loci also exhibit minimal to no disruption to T cell function and/or capacity for product manufacturing.
- these loci are useful for effective and safe integration and expression of transgenes in T cells (e.g. in CAR T therapies).
- the methods described herein can be used for the identification of safe harbor loci for insertion of transgenes in other types of cells.
- FIG. 1 illustrates the overall approach that the inventors of the present disclosure used to identify safe harbor loci.
- the present disclosure provides a method comprising the identification of genes within a genome and non-coding regions with sustained expression in a treatment cell (e.g. T cell) and using a predictive model of KI efficiency as a function of T cell chromatic state and computational analysis to predict candidate integration sites. The method further comprises evaluating the candidate integration sites for actual KI efficiency, sustained levels of transgene expression of a transgene, and minimal disruption to the treatment cell phenotype (e.g., T cell function and/or capacity for treatment product expansion and manufacturing).
- the safe harbor loci allow for integration of a transgene driven by an endogenous promoter.
- the safe harbor loci allow for integration of a transgene driven by an exogenous promoter (e.g. EF1a promoter).
- the term “about” indicates and encompasses an indicated value and a range above and below that value. In certain embodiments, the term “about” indicates the designated value ⁇ 10%, ⁇ 5%, or ⁇ 1%. In certain embodiments, where applicable, the term “about” indicates the designated value(s) ⁇ one standard deviation of that value(s).
- the term “gene” refers to the basic unit of heredity, consisting of a segment of DNA arranged along a chromosome, which codes for a specific protein or segment of protein.
- a gene typically includes a promoter, a 5′ untranslated region, one or more coding sequences (exons), optionally introns, a 3′ untranslated region.
- the gene may further comprise a terminator, enhancers and/or silencers.
- locus refers to a specific, fixed physical location on a chromosome where a gene or genetic marker is located.
- target locus refers to a locus on a chromosome within which a safe harbor locus can be used for the insertion of a sequence.
- a target locus can consist of multiple potential safe harbor loci (integration sites). Examples of target loci are provided in Table 4, as sgRNA target loci.
- the notation used for the sgRNA target loci in Table 4 refers to the genomic region of the target locus, defined by the chromosome of the target locus and the coordinate range for that target locus.
- chr10:33130000-33140000 refers to a target locus on Chr10 (chromosome 10) starting from coordinate 33130000 and ending with coordinate 33140000.
- safe harbor locus refers to a locus at which genes or genetic elements can be incorporated without disruption to expression or regulation of adjacent genes. These safe harbor loci are also referred to as safe harbor sites (SHS).
- SHS safe harbor sites
- a safe harbor locus refers to an “integration site” or “knock-in site” at which a sequence encoding a transgene, as defined herein, can be inserted. In some embodiments the insertion occurs with replacement of a sequence that is located at the integration site. In some embodiments, the insertion occurs without replacement of a sequence at the integration site. Examples of integration sites contemplated are provided in Table 4.
- the term “insert” refers to a nucleotide sequence that is integrated (inserted) at a safe harbor site.
- the insert can be used to refer to the genes or genetic elements that are incorporated at the safe harbor site using, for example, homology-directed repair (HDR) CRISPR/Cas9 genome-editing or other methods for inserting nucleotide sequences into a genomic region known to those of ordinary skill in the art.
- HDR homology-directed repair
- CRISPR/Cas refers to a widespread class of bacterial systems for defense against foreign nucleic acid.
- CRISPR/Cas systems are found in a wide range of eubacterial and archaeal organisms.
- CRISPR/Cas systems include type I, II, and III sub-types. Wild-type type II CRISPR/Cas systems utilize an RNA-mediated nuclease, Cas9 in complex with guide and activating RNA to recognize and cleave foreign nucleic acid.
- Guide RNAs having the activity of both a guide RNA and an activating RNA are also known in the art. In some cases, such dual activity guide RNAs are referred to as a small guide RNA (sgRNA).
- sgRNA small guide RNA
- Cas9 homologs are found in a wide variety of eubacteria, including, but not limited to bacteria of the following taxonomic groups: Actinobacteria, Aquificae, Bacteroidetes-Chlorobi, Chlamydiae-Verrucomicrobia, Chlroflexi, Cyanobacteria, Firmicutes, Proteobacteria, Spirochaetes, and Thermotogae.
- An exemplary Cas9 protein is the Streptococcus pyogenes Cas9 protein. Additional Cas9 proteins and homologs thereof are described in, e.g., Chylinksi, et al., RNA Biol. 2013 May 1; 10(5): 726-737; Nat.
- the Cas9 nuclease domain can be optimized for efficient activity or enhanced stability in the host cell.
- RNA-mediated nuclease e.g., of bacterial or archeal orgin, or derived therefrom.
- RNA-mediated nuclases include the foregoing Cas9 proteins and homologs thereof, and include but are not limited to, CPF1 (See, e.g., Zetsche et al., Cell, Volume 163, Issue 3, p 759-771, 22 Oct. 2015).
- Cas9 ribonucleoprotein complex and the like refers to a complex between the Cas9 protein, and a crRNA (e.g., guide RNA or small guide RNA), the Cas9 protein and a trans-activating crRNA (tracrRNA), the Cas9 protein and a small guide RNA, or a combination thereof (e.g., a complex containing the Cas9 protein, a tracrRNA, and a crRNA guide RNA).
- a crRNA e.g., guide RNA or small guide RNA
- tracrRNA trans-activating crRNA
- Cas9 protein and a small guide RNA e.g., a complex containing the Cas9 protein, a tracrRNA, and a crRNA guide RNA
- T lymphocyte and “T cell” are used interchangeably and refer to cells that have completed maturation in the thymus, and identify certain foreign antigens in the body. The terms also refer to the major leukocyte types that have various roles in the immune system, including activation and deactivation of other immune cells.
- the T cell can be any T cell such as a cultured T cell, e.g., a primary T cell, or a T cell derived from a cultured T cell line, e.g., a Jurkat, SupT1, etc., or a T cell obtained from a mammal.
- the T cell can be a CD3+ cell.
- the T cell can be any type of T cell, CD4+/CD8+ double positive T cells, CD4+ helper T cells (e.g. Th1 and Th2 cells), CD8+ T cells (e.g. cytotoxic T cells), peripheral Including but not limited to blood mononuclear cells (PBMC), peripheral blood leukocytes (PBL), tumor infiltrating lymphocytes (TIL), memory T cells, naive T cells, regulatory T cells, ⁇ T cells, etc. It can be any T cell at any stage of development. Additional types of helper T cells include Th3 (Treg) cells, Th17 cells, Th9 cells, or Tfh cells.
- helper T cells include Th3 (Treg) cells, Th17 cells, Th9 cells, or Tfh cells.
- T cells such as central memory T cells (Tcm cells), effector memory T cells (Tem cells and TEMRA cells).
- a T cell can also refer to a genetically modified T cell, such as a T cell that has been modified to express a T cell receptor (TCR) or a chimeric antigen receptor (CAR). T cells can also be differentiated from stem cells or progenitor cells.
- TCR T cell receptor
- CAR chimeric antigen receptor
- CD4+ T cells refers to a subset of T cells that express CD4 on their surface and are associated with a cellular immune response.
- CD4+ T cells are characterized by a post-stimulation secretion profile that can include secretion of cytokines such as IFN- ⁇ , TNF- ⁇ , IL-2, IL-4 and IL-10.
- CD4 is a 55 kD glycoprotein originally defined as a differentiation antigen on T lymphocytes, but was also found on other cells including monocytes/macrophages.
- the CD4 antigen is a member of the immunoglobulin superfamily and has been implicated as an associative recognition element in MHC (major histocompatibility complex) class II restricted immune responses.
- MHC major histocompatibility complex
- CD8+ T cells refers to a subset of T cells that express CD8 on their surface, are MHC class I restricted, and function as cytotoxic T cells.
- the “CD8” molecule is a differentiation antigen present on thymocytes, as well as on cytotoxic and suppressor T lymphocytes.
- the CD8 antigen is a member of the immunoglobulin superfamily and is an associative recognition element in major histocompatibility complex class I restriction interactions.
- ex vivo generally includes experiments or measurements made in or on living tissue, preferably in an artificial environment outside the organism, preferably with minimal differences from natural conditions.
- construct refers to a complex of molecules, including macromolecules or polynucleotides.
- the term “integration” refers to the process of stably inserting one or more nucleotides of a construct into the cell genome, i.e., covalently linking to a nucleic acid sequence in the chromosomal DNA of the cell. It may also refer to nucleotide deletions at a site of integration. Where there is a deletion at the insertion site, “integration” may further include substitution of the endogenous sequence or nucleotide deleted with one or more inserted nucleotides.
- the term “exogenous” refers to a molecule or activity that has been introduced into a host cell and is not native to that cell.
- the molecule can be introduced, for example, by introduction of the encoding nucleic acid into host genetic material, such as by integration into a host chromosome, or as non-chromosomal genetic material, such as a plasmid.
- the term, when used in connection with expression of an encoding nucleic acid refers to the introduction of the encoding nucleic acid into a cell in an expressible form.
- the term “endogenous” refers to a molecule or activity that is present in a host cell under natural, unedited conditions.
- the term, when used in connection with expression of the encoding nucleic acid refers to expression of the encoding nucleic acid that is contained within the cell and not introduced exogenously.
- a “polynucleotide donor construct” refers to a nucleotide sequence (e.g. DNA sequence) that is genetically inserted into a polynucleotide and is exogenous to that polynucleotide.
- the polynucleotide donor construct is transcribed into RNA and optionally translated into a polypeptide.
- the polynucleotide donor construct can include prokaryotic sequences, cDNA from eukaryotic mRNA, genomic DNA sequences from eukaryotic (e.g., mammalian) DNA, and synthetic DNA sequences.
- the polynucleotide donor construct can be a miRNA, shRNA, natural polypeptide (i.e., a naturally occurring polypeptide) or fragment thereof or a variant polypeptide (e.g. a natural polypeptide having less than 100% sequence identity with the natural polypeptide) or fragments thereof.
- natural polypeptide i.e., a naturally occurring polypeptide
- variant polypeptide e.g. a natural polypeptide having less than 100% sequence identity with the natural polypeptide
- transgene refers to a polynucleotide that has been transferred naturally, or by any of a number of genetic engineering techniques from one organism to another. It is optionally translated into a polypeptide. It is optionally translated into a recombinant protein.
- a “recombinant protein” is a protein encoded by a gene—recombinant DNA—that has been cloned in a system that supports expression of the gene and translation of messenger RNA (see expression system).
- the recombinant protein can be a therapeutic agent, e.g. a protein that treats a disease or disorder disclosed herein.
- transgene can refer to a polynucleotide that encodes a polypeptide.
- a transgene can also refer to a non-encoding sequence, such as, but not limited to shRNAs, miRNAs, and miRs.
- protein protein
- polypeptide peptide
- operably linked refers to the binding of a nucleic acid sequence to a single nucleic acid fragment such that one function is affected by the other.
- a promoter is capable of affecting the expression of a coding sequence or functional RNA (i.e., the coding sequence or functional RNA is under transcriptional control by the promoter)
- the promoter is operably linked thereto.
- Coding sequences can be operably linked to control sequences in both sense and antisense orientation.
- developmental cell states refers to, for example, states when the cell is inactive, actively expressing, differentiating, senescent, etc.
- developmental cell state may also refer to a cell in a precursor state (e.g., a T cell precursor).
- the term “encoding” refers to a sequence of nucleic acids which codes for a protein or polypeptide of interest.
- the nucleic acid sequence may be either a molecule of DNA or RNA.
- the molecule is a DNA molecule.
- the molecule is a RNA molecule.
- When present as a RNA molecule it will comprise sequences which direct the ribosomes of the host cell to start translation (e.g., a start codon, ATG) and direct the ribosomes to end translation (e.g., a stop codon). Between the start codon and stop codon is an open reading frame (ORF).
- ORF open reading frame
- inserting refers to a manipulation of a nucleotide sequence to introduce a non-native sequence. This is done, for example, via the use of restriction enzymes and ligases whereby the DNA sequence of interest, usually encoding the gene of interest, can be incorporated into another nucleic acid molecule by digesting both molecules with appropriate restriction enzymes in order to create compatible overlaps and then using a ligase to join the molecules together.
- restriction enzymes and ligases whereby the DNA sequence of interest, usually encoding the gene of interest, can be incorporated into another nucleic acid molecule by digesting both molecules with appropriate restriction enzymes in order to create compatible overlaps and then using a ligase to join the molecules together.
- ligase to join the molecules together.
- the term “subject” refers to a mammalian subject. Exemplary subjects include humans, monkeys, dogs, cats, mice, rats, cows, horses, camels, goats, rabbits, pigs and sheep. In certain embodiments, the subject is a human. In some embodiments the subject has a disease or condition that can be treated with an engineered cell provided herein or population thereof. In some aspects, the disease or condition is a cancer.
- promoter refers to a nucleotide sequence (e.g. DNA sequence) capable of controlling the expression of a coding sequence or functional RNA.
- the promoter sequence consists of proximal and more distal upstream elements, the latter elements often referred to as enhancers.
- a promoter can be derived from natural genes in its entirety, can be composed of different elements from different promoters found in nature, and/or may comprise synthetic DNA segments.
- a promoter, as contemplated herein, can be endogenous to the cell of interest or exogenous to the cell of interest. It is appreciated by those skilled in the art that different promoters can induce gene expression in different tissue or cell types, or at different developmental stages, or in response to different environmental conditions.
- a promoter can be selected according to the strength of the promoter and/or the conditions under which the promoter is active, e.g., constitutive promoter, strong promoter, weak promoter, inducible/repressible promoter, tissue specific Or developmentally regulated promoters, cell cycle-dependent promoters, and the like.
- a promoter can be an inducible promoter (e.g., a heat shock promoter, tetracycline-regulated promoter, steroid-regulated promoter, metal-regulated promoter, estrogen receptor-regulated promoter, etc.).
- the promoter can be a constitutive promoter (e.g., CMV promoter, UBC promoter).
- the promoter can be a spatially restricted and/or temporally restricted promoter (e.g., a tissue specific promoter, a cell type specific promoter, etc.). See for example U.S. application Ser. No. 15/715,068, the disclosures of which are herein incorporated by reference in their entirety.
- Gene editing may involve a gene (or nucleotide sequence) knock-in or knock-out.
- knock-in refers to an addition of a DNA sequence, or fragment thereof into a genome.
- DNA sequences to be knocked-in may include an entire gene or genes, may include regulatory sequences associated with a gene or any portion or fragment of the foregoing.
- a polynucleotide donor construct encoding a recombinant protein may be inserted into the genome of a cell carrying a mutant gene.
- a knock-in strategy involves substitution of an existing sequence with the provided sequence, e.g., substitution of a mutant allele with a wild-type copy.
- knock-out refers to the elimination of a gene or the expression of a gene.
- a gene can be knocked out by either a deletion or an addition of a nucleotide sequence that leads to a disruption of the reading frame.
- a gene may be knocked out by replacing a part of the gene with an irrelevant (e.g., non-coding) sequence.
- non-homologous end joining refers to a cellular process in which cut or nicked ends of a DNA strand are directly ligated without the need for a homologous template nucleic acid. NHEJ can lead to the addition, the deletion, substitution, or a combination thereof, of one or more nucleotides at the repair site.
- the term “homology directed repair” or HDR refers to a cellular process in which cut or nicked ends of a DNA strand are repaired by polymerization from a homologous template nucleic acid.
- the homologous template nucleic acid can be provided by homologous sequences elsewhere in the genome (sister chromatids, homologous chromosomes, or repeated regions on the same or different chromosomes).
- an exogenous template nucleic acid can be introduced to obtain a specific HDR-induced change of the sequence at the target site. In this way, specific mutations can be introduced at the cut site.
- Vectors can be linear or circular. Vectors can integrate into a target genome of a host cell or replicate independently in a host cell. Vectors can comprise, for example, an origin of replication, a multicloning site, and/or a selectable marker.
- An expression vector typically comprises an expression cassette.
- Vectors and plasmids include, but are not limited to, integrating vectors, prokaryotic plasmids, eukaryotic plasmids, plant synthetic chromosomes, episomes, viral vectors, cosmids, and artificial chromosomes.
- expression cassette is a polynucleotide construct, generated recombinantly or synthetically, comprising regulatory sequences operably linked to a selected polynucleotide to facilitate expression of the selected polynucleotide in a host cell.
- the regulatory sequences can facilitate transcription of the selected polynucleotide in a host cell, or transcription and translation of the selected polynucleotide in a host cell.
- An expression cassette can, for example, be integrated in the genome of a host cell or be present in an expression vector.
- the phrase “subject in need thereof” refers to a subject that exhibits and/or is diagnosed with one or more symptoms or signs of a disease or disorder as described herein.
- chemotherapeutic agent refers to a chemical compound useful in the treatment of cancer.
- Chemotherapeutic agents include “anti-hormonal agents” or “endocrine therapeutics” which act to regulate, reduce, block, or inhibit the effects of hormones that can promote the growth of cancer.
- composition refers to a mixture that contains, e.g., an engineered cell or protein contemplated herein.
- the composition may contain additional components, such as adjuvants, stabilizers, excipients, and the like.
- composition or “pharmaceutical composition” refers to a preparation which is in such form as to permit the biological activity of an active ingredient contained therein to be effective in treating a subject, and which contains no additional components which are unacceptably toxic to the subject in the amounts provided in the pharmaceutical composition.
- the term “effective amount” refers to the amount of a compound (e.g., a compositions described herein, cells described herein) sufficient to effect beneficial or desired results.
- An effective amount can be administered in one or more administrations, applications or dosages and is not intended to be limited to a particular formulation or administration route.
- the term “treating” includes any effect, e.g., lessening, reducing, modulating, ameliorating or eliminating, that results in the improvement of the condition, disease, disorder, and the like, or ameliorating a symptom thereof.
- modulate and “modulation” refer to reducing or inhibiting or, alternatively, activating or increasing, a recited variable.
- increase and activate refer to an increase of 10%, 20%, 30%, 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 100%, 2-fold, 3-fold, 4-fold, 5-fold, 10-fold, 50-fold, 100-fold, or greater in a recited variable.
- reduce and “inhibit” refer to a decrease of 10%, 20%, 30%, 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 2-fold, 3-fold, 4-fold, 5-fold, 10-fold, 20-fold, 100-fold, or greater in a recited variable.
- Gene editing therapies include, for example, viral vector integration and site specific integration.
- Site-specific integration is a promising alternative to random integration of viral vectors, as it mitigates the risks of insertional mutagenesis or insertional oncogenesis (Kolb et al. Trends Biotechnol. 2005 23:399-406; Porteus et al. Nat Biotechnol. 2005 23:967-973; Paques et al. Curr Gen Ther. 2007 7:49-66).
- site specific integration continues to face challenges such as poor knock-in efficiency, risk of insertional oncogenesis, unstable and/or anomalous expression of adjacent genes or the transgene, low accessibility (e.g. within kB of adjacent genes), etc.
- SHS safe harbor loci or safe harbor sites
- the most widely used of the putative human safe harbor sites is the AAVS1 site on chromosome 19q, which was initially identified as a site for recurrent adenoassociated virus insertion.
- Other potential SHS have been identified on the basis of homology, with sites first identified in other species (e.g., the human homolog of the permissive murine Rosa26 locus) or among the growing number of human genes that appear non-essential under some circumstances.
- One putative SHS of this type is the CCR5 chemokine receptor gene, which, when disrupted, confers resistance to human immunodeficiency virus infection.
- Additional potential genomic SHS have been identified in human and other cell types on the basis of viral integration site mapping or gene-trap analyses, as was the original murine Rosa26 locus.
- the AAVS1 (also known as the PPP 1R12C locus) on human chromosome 19 is a known SHS for hosting transgenes (e.g. DNA transgenes) with expected function. It is at position 19q13.42. It has an open chromatin structure and is transcription-competent.
- the canonical SHS locus for AAVS1 is chr19: 55,625,241-55,629,351. See Pellenz et al. “New Human Chromosomal Sites with “Safe Harbor” Potential for Targeted Transgene Insertion.” Human gene therapy vol. 30,7 (2019): 814-828, the relevant disclosures of which are herein incorporated by reference.
- An exemplary AAVS1 target gRNA and target sequence are provided below:
- AAVS1-gRNA sequence (SEQ ID NO: 121) ggggccactagggacaggatGTTTTAGAGCTAGAA ATAGCAAGTTAAAATAAGGCTAGTCCGTTATCAAC TTGAAAAAGTGGCACCGAGTCGGTGCTTTTTTTTT AAVS1 target sequence: (SEQ ID NO: 122) ggggccactagggacaggat
- CCR5 which is located on chromosome 3 at position 3p21.31, encodes the major co-receptor for HIV-1. Disruption at this site in the CCR5 gene has been beneficial in HIV/AIDS therapy and prompted the development of zinc-finger nucleases that target its third exon.
- the canonical SHS locus for CCR5 is chr3: 46,414,443-46,414,942. See Pellenz et al. “New Human Chromosomal Sites with “Safe Harbor” Potential for Targeted Transgene Insertion.” Human gene therapy vol. 30,7 (2019): 814-828, the relevant disclosures of which are herein incorporated by reference.
- the mouse Rosa26 locus is particularly useful for genetic modification as it can be targeted with high efficiency and is expressed in most cell types tested.
- Irion et al. 2007 (“Identification and targeting of the ROSA26 locus in human embryonic stem cells.” Nature biotechnology 25.12 (2007): 1477-1482, the relevant disclosure of which are herein incorporated by reference) identified the human homolog, human ROSA26, in chromosome 3 (position 3p25.3).
- the canonical SHS locus for human Rosa26 (hRosa26) is chr3: 9,415,082-9,414,043. See Pellenz et al. “New Human Chromosomal Sites with “Safe Harbor” Potential for Targeted Transgene Insertion.” Human gene therapy vol. 30,7 (2019): 814-828, the relevant disclosures of which are herein incorporated by reference.
- the present disclosure is directed to methods for identifying safe harbor loci with benefits including, but not limited to, high knock-in efficiency and high expression of transgene.
- An example of applications of the presently disclosed methods is the identification of safe harbor loci for insertion of transgenes (e.g., chimeric antigen receptors (CAR)) into T-cells.
- the safe harbor loci of the present disclosure are useful for the insertion of a sequence encoding a transgene.
- the safe harbor sites allow for high transgene expression (sufficient to allow for transgene functionality or treatment of a disease of interest) and stable expression of the transgene over several days, weeks or months.
- knockout of the gene at the safe harbor locus confers benefit to the function of the cell, or the gene at the safe harbor locus has no known function within the cell.
- the safe harbor locus results in stable transgene expression in vitro with or without CD3/CD28 stimulation, negligible off-target cleavage as detected by iGuide-Seq or CRISPR-Seq, less off-target cleavage relative to other loci as detected by iGuide-Seq or CRISPR-Seq, negligible transgene-independent cytotoxicity, negligible transgene-independent cytokine expression, negligible transgene-independent chimeric antigen receptor expression, negligible deregulation or silencing of nearby genes, and positioned outside of a cancer-related gene.
- a “nearby gene” can refer to a gene that is within about 100 kB, about 125 kB, about 150 kB, about 175 kB, about 200 kB, about 225 kB, about 250 kB, about 275 kB, about 300 kB, about 325 kB, about 350 kB, about 375 kB, about 400 kB, about 425 kB, about 450 kB, about 475 kB, about 500 kB, about 525 kB, about 550 kB away from the safe harbor locus (integration site).
- the present disclosure contemplates inserts that comprise one or more transgenes.
- the transgene can encode a therapeutic protein, an antibody, a peptide, a suicide gene, an apoptosis gene or any other gene of interest.
- the safe harbor loci identified using the method described herein allow for transgene integration that results in, for example, enhanced therapeutic properties.
- These enhanced therapeutic properties refer to an enhanced therapeutic property of a cell when compared to a typical immune cell of the same normal cell type.
- an NK cell having “enhanced therapeutic properties” has an enhanced, improved, and/or increased treatment outcome when compared to a typical, unmodified and/or naturally occurring NK cell.
- the therapeutic properties of immune cells can include, but are not limited to, cell transplantation, transport, homing, viability, self-renewal, persistence, immune response control and regulation, survival, and cytotoxicity.
- the therapeutic properties of immune cells are also manifested by: antigen-targeted receptor expression; HLA presentation or lack thereof; tolerance to the intratumoral microenvironment; induction of bystander immune cells and immune regulation; improved target specificity with reduction; resistance to treatments such as chemotherapy.
- the term “insert size” refers to the length of the nucleotide sequence being integrated (inserted) at the safe harbor site.
- the insert size comprises at least about 100, 200, 300, 400 or 500 basepairs.
- the insert size comprises about 500 nucleotides or basepairs.
- the insert size comprises up to 0.5, 0.6, 0.7, 0.8, 0.9, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 kbp (kilo basepairs) or the sizes in between.
- the insert size is greater than 0.5, 0.6, 0.7, 0.8, 0.9, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 kbp or the sizes in between. In some embodiments, the insert size is within the range of 3-15 kbp or is any number in that range. In some embodiments, the insert size is within the range of 1.5-8.3 kbp or is any number in that range. In some embodiments, the insert size is within the range of 1.5-15 kbp or is any number in that range. In some embodiments, the insert size is within the range of 0.5-20 kbp or is any number in that range.
- the insert size is 0.5-10, 0.6-10, 0.7-10, 0.8-10, 0.9-10, 1-10, 2-10, 3-10, 4-10, 5-10, 6-10, 7-10, 8-10, 9-10 kbp. In some embodiments, the insert size is 0.5-11, 0.6-11, 0.7-11, 0.8-11, 0.9-11, 1-11, 2-11, 3-11, 4-11, 5-11, 6-11, 7-11, 8-11, 9-11, or 10-11 kbp. In some embodiments, the insert size is 0.5-12, 0.6-12, 0.7-12, 0.8-12, 0.9-12, 1-12, 2-12, 3-12, 4-12, 5-12, 6-12, 7-12, 8-12, 9-12, 10-12, or 11-12 kbp.
- the insert size is 0.5-13, 0.6-13, 0.7-13, 0.8-13, 0.9-13, 1-13, 2-13, 3-13, 4-13, 5-13, 6-13, 7-13, 8-13, 9-13, 10-13, 11-13, or 12-13 kbp. In some embodiments, the insert size is 0.5-14, 0.6-14, 0.7-14, 0.8-14, 0.9-14, 1-14, 2-14, 3-14, 4-14, 5-14, 6-14, 7-14, 8-14, 9-14, 10-14, 11-14, 12-14 or 13-14 kbp.
- the insert size is 0.5-15, 0.6-15, 0.7-15, 0.8-15, 0.9-15, 2-15, 3-15, 4-15, 5-15, 6-15, 7-15, 8-15, 9-15, 10-15, 11-15, 12-15, 13-15, or 14-15 kbp. In some embodiments, the insert size is 0.5-16, 0.6-16, 0.7-16, 0.8-16, 0.9-16, 1-16, 2-16, 3-16, 4-16, 5-16, 6-16, 7-16, 8-16, 9-16, 10-16, 11-16, 12-16, 13-16, 14-16 or 15-16 kbp.
- the insert size is 0.5-17, 0.6-17, 0.7-17, 0.8-17, 0.9-17, 1-17, 2-17, 3-17, 4-17, 5-17, 6-17, 7-17, 8-17, 9-17, 10-17, 11-17, 12-17, 13-17, or 14-17, 15-17 or 16-17 kbp. In some embodiments, the insert size is 0.5-18, 0.6-18, 0.7-18, 0.8-18, 0.9-18, 1-18, 2-18, 3-18, 4-18, 5-18, 6-18, 7-18, 8-18, 9-18, 10-18, 11-18, 12-18, 13-18, 14-18, 15-18, 16-18 or 17-18 kbp.
- the insert size is 0.5-19, 0.6-19, 0.7-19, 0.8-19, 0.9-19, 1-19, 2-19, 3-19, 4-19, 5-19, 6-19, 7-19, 8-19, 9-19, 10-19, 11-19, 12-19, 13-19, 14-0.7-20, 19, 15-19, 16-19, 17-19, or 18-19 kbp.
- the insert size is 0.5-20, 0.6-20, 0.7-20, 0.8-20, 0.9-20, 1-20, 2-20, 3-20, 4-20, 5-20, 6-20, 7-20, 8-20, 9-20, 10-20, 11-20, 12-20, 13-20, 14-20, 15-20, 16-20, 17-20, 18-20, or 19-20 kbp.
- the inserts of the present disclosure refer to nucleic acid molecules or polynucleotide inserted at a safe harbor site.
- the nucleotide sequence is a DNA molecule, e.g., genomic DNA, or comprises deoxy-ribonucleotides.
- the insert comprises a smaller fragment of DNA, such as a plastid DNA, mitochondrial DNA, or DNA isolated in the form of a plasmid, a fosmid, a cosmid, a bacterial artificial chromosome (BAC), a yeast artificial chromosome (YAC), and/or any other sub-genome segment of DNA.
- the insert is an RNA molecule or comprises ribonucleotides.
- nucleotides in the insert are contemplated as naturally occurring nucleotides, non-naturally occurring, and modified nucleotides.
- Nucleotides may be modified chemically or biochemically, or may contain non-natural or derivatized nucleotide bases, as will be readily appreciated by those of skill in the art. Such modifications include, for example, labels, methylation, substitution of one or more of the naturally occurring nucleotides with an analog, internucleotide modifications.
- the polynucleotides can be in any topological conformation, including single-stranded, double-stranded, partially duplexed, triplexed, hairpinned, circular conformations, and other three-dimension conformations contemplated in the art.
- the inserts can have coding and/or non-coding regions.
- the insert can comprises a non-coding sequence (e.g., control elements, e.g., a promoter sequence).
- the insert encodes transcription factors.
- the insert encodes an antigen binding receptors such as single receptors, T-cell receptors (TCRs), syn-notch, CARs, mAbs, etc.
- the inserts are RNAi molecules, including, but not limited to, miRNAs, siRNA, shRNAs, etc.
- the insert is a human sequence.
- the insert is chimeric.
- the insert is a multi-gene/multi-module therapeutic cassette.
- a multi-gene/multi-module therapeutic cassette referst to an insert or cassette having one or more than one receptor (e.g., synthetic receptors), other exogenous protein coding sequences, non-coding RNAs, transcriptional regulatory elements, and/or insulator sequences, etc.
- receptor e.g., synthetic receptors
- other exogenous protein coding sequences e.g., non-coding RNAs, transcriptional regulatory elements, and/or insulator sequences, etc.
- a cell comprising a safe harbor site and/or a cell comprising an insert at a safe harbor site as described in the present disclosure can be referred to as an engineered cell.
- the cells can include, but are not limited to, eukaryotic cells, prokaryotic cells, animal cells, plant cells, fungal cells and the like.
- the cell is a mammalian cell, for example, a human cell.
- that engineered cell is a stem cell, a human cell, a primary cell, an hematopoietic cell, an adaptive immune cell, an innate immune cell, a T cell or a T cell progenitor.
- Non-limiting examples of immune cells that are contemplated in the present disclosure include T cell, B cell, natural killer (NK) cell, NKT/iNKT cell, macrophage, myeloid cell, and dendritic cells.
- Non-limiting examples of stem cells that are contemplated in the present disclosure include pluripotent stem cells (PSCs), embryonic stem cells (ESCs), induced pluripotent stem cells (iPSCs), embryo-derived embryonic stem cells obtained by nuclear transfer (ntES; nuclear transfer ES), male germline stem cells (GS cells), embryonic germ cells (EG cells), hematopoietic stem/progenitor stem cells (HSPCs), somatic stem cells (adult stem cells), hemangioblasts, neural stem cells, mesenchymal stem cells and stem cells of other cells (including osteocyte, chondrocyte, myocyte, cardiac myocyte, neuron, tendon cell, adipocyte, pancreocyte, hepatocyte, nephrocyte and follicle cells and so on
- the engineered cells is a T cell, NK cells, iPSC, and HSPC.
- the engineered cells used in the present disclosure are human cell lines grown in vitro (e.g. deliberately immortalized cell lines, cancer cell lines, etc.).
- the methods for integrating the inserts at the safe harbor sites can be viral or non-viral delivery techniques.
- the nucleic acid sequence is inserted into the genome of the engineered cell by introducing a vector, for example, a viral vector, comprising the nucleic acid.
- a viral vector for example, a viral vector
- examples of viral vectors include, but are not limited to, adeno-associated viral (AAV) vectors, retroviral vectors or lentiviral vectors.
- the lentiviral vector is an integrase-deficient lentiviral vector.
- the nucleic acid sequence is inserted into the genome of the T cell via non-viral delivery.
- the nucleic acid can be naked DNA, or in a non-viral plasmid or vector.
- Non-viral delivery techniques can be site-specific integration techniques, as described herein or known to those of ordinary skill in the art. Examples of site-specific techniques for integration into the safe harbor loci include, without limitation, homology-dependent engineering using nucleases and homology independent targeted insertion using Cas9.
- the insert is integrated at a safe harbor site by introducing into the engineered cell, (a) a targeted nuclease that cleaves a target region in the safe harbor site to create the insertion site; and (b) the nucleic acid sequence (insert), wherein the insert is incorporated at the insertion site by, e.g., HDR.
- a targeted nuclease that cleaves a target region in the safe harbor site to create the insertion site
- the nucleic acid sequence (insert) wherein the insert is incorporated at the insertion site by, e.g., HDR.
- Chimeric antigen receptor (CAR) T cells are T cells that have been genetically engineered to produce an artificial T-cell receptor for use in immunotherapy. Chimeric antigen receptors are receptor proteins that have been engineered to confer T cells with the ability to target a specific protein.
- the genetic modification of lymphocytes (e.g. T cells) by incorporation of, for example, CARs, and administration of the engineered cells to a subject is an example of “adoptive cell therapy”.
- the term “adoptive cell therapy” refers to cell-based immunotherapy for transfusion of autologous or allogeneic lymphocytes, referred to as T cells or B cells. In this CAR therapy approach, cells are expanded and cultured ex vivo and genetically modified, prior to transfusion.
- T-cells are harvested from a subject—they can be autologous T-cells from the subject own blood or from a donor that will not be receiving the CAR therapy. Once isolated, the T-cells are genetically modified with a CAR, expanded ex vivo, and administered to the subject (i.e. patient) by, e.g. infusion.
- the CARs may be introduced into the T-cells using, for example, a viral technique (e.g., retroviral integration) or site-specific technique.
- a viral technique e.g., retroviral integration
- site-specific technique e.g. CARs
- the transgenes may be targeted to a safe harbor locus.
- site-specific techniques for integration into the safe harbor loci include, without limitation, homology-dependent engineering using nucleases and homology independent targeted insertion using Cas9.
- the engineered CAR T cells have applications to immune-oncology.
- the CAR for example, can be selected to target a specific tumor antigen.
- Examples of cancers that can be effectively targeted using CAR T cells are blood cancers.
- CAR T cell therapy can be used to treat solid tumors.
- gene editing or “genome editing”, as used herein, refer to a type of genetic manipulation in which DNA is inserted, replaced, or removed from the genome using artificially manipulated nucleases or “molecular scissors”. It is a useful tool for elucidating the function and effect of sequence-specific genes or proteins or altering cell behavior (e.g. for therapeutic purposes).
- TALENs transcription activator-like effector nucleases
- safe harbor loci e.g. the adeno-associated virus integration site 1 (AAVS1) safe harbor locus.
- AAVS1 adeno-associated virus integration site 1
- CRISPR/Cas9 clustered regularly interspaced short palindromic repeat/Cas9
- Site specific gene editing approaches can include homology dependent mechanisms or homology independent mechanisms.
- Crisp-Cas approach e.g. Crispr-Cas9
- This approach incorporates the use of a guide polynucleotide (e.g. guide ribonucleic acid or gRNA) and a cas endonuclease (e.g. Cas9 endonuclease).
- a guide polynucleotide e.g. guide ribonucleic acid or gRNA
- a cas endonuclease e.g. Cas9 endonuclease
- a polypeptide referred to as a “Cas endonuclease” or having “Cas endonuclease activity” refers to a CRISPR-related (Cas) polypeptide encoded by a Cas gene, wherein a Cas polypeptide is a target DNA sequence that can be cleaved when operably linked to one or more guide polynucleotides (see, e.g., U.S. Pat. No. 8,697,359). Also included in this definition are variants of Cas endonuclease that retain guide polynucleotide-dependent endonuclease activity.
- the Cas endonuclease used in the donor DNA insertion method detailed herein is an endonuclease that introduces double-strand breaks into DNA at the target site (e.g., within the target locus or at the safe harbor site).
- guide polynucleotide relates to a polynucleotide sequence capable of complexing with a Cas endonuclease and allowing the Cas endonuclease to recognize and cleave a DNA target site.
- the guide polynucleotide can be a single molecule or a double molecule.
- the guide polynucleotide sequence can be an RNA sequence, a DNA sequence, or a combination thereof (RNA-DNA combination sequence).
- a guide polynucleotide comprising only ribonucleic acid is also referred to as “guide RNA”.
- a polynucleotide donor construct is inserted at a safe harbor locus using a guide RNA (gRNA) in combination with a cas endonuclease (e.g. Cas9 endonuclease).
- gRNA guide RNA
- cas endonuclease e.g. Cas9 endonuclease
- the guide polynucleotide includes a first nucleotide sequence domain (also referred to as a variable targeting domain or VT domain) that is complementary to a nucleotide sequence in the target DNA, and a second nucleotide that interacts with a Cas endonuclease polypeptide.
- It can be a double molecule (also referred to as a double-stranded guide polynucleotide) comprising a sequence domain (referred to as a Cas endonuclease recognition domain or CER domain).
- the CER domain of this double molecule guide polynucleotide comprises two separate molecules that hybridize along the complementary region.
- the two separate molecules can be RNA sequences, DNA sequences and/or RNA-DNA combination sequences.
- Genome editing using CRISPR-Cas approaches relies on the repair of site-specific DNA double-strand breaks (DSBs) induced by the RNA-guided Cas endonuclease (e.g. Cas 9 endonuclease). Homology-directed repair (HDR) of these DSBs enables precise editing of the genome by introducing defined genomic changes, including base substitutions, sequence insertions, and deletions.
- HDR-based CRISPR/Cas9 genome-editing involves transfecting cells with Cas9, gRNA and donor DNA containing homologous arms matching the genomic locus of interest.
- HITI non-homologous end joining
- NHEJ non-homologous end joining
- gRNAs Guide RNAs
- donor plasmids lack homology arms and DSB repair does not occur through the HDR pathway.
- the donor polynucleotide construct can be engineered to include Cas9 cleavage site(s) flanking the gene or sequence to be inserted. This results in Cas9 cleavage at both the donor plasmid and the genomic target sequence. Both target and donor have blunt ends and the linearized donor DNA plasmid is used by the NHEJ pathway resulting integration into the genomic DSB site.
- the guide RNAs and/or mRNA (or DNA) encoding an endonuclease can be chemically linked to one or more moieties or conjugates that enhance the activity, cellular distribution, or cellular uptake of the oligonucleotide.
- Non-limiting examples of such moieties include lipid moieties such as a cholesterol moiety, cholic acid, a thioether, a thiocholesterol, an aliphatic chain (e.g., dodecandiol or undecyl residues), a phospholipid, e.g., di-hexadecyl-rac-glycerol or triethylammonium 1,2-di-O-hexadecyl-rac-glycero-3-H-phosphonate, a polyamine or a polyethylene glycol chain, adamantane acetic acid, a palmityl moiety and an octadecylamine or hexylamino-carbonyl-t oxycholesterol moiety. See for example U.S. application Ser. No. 15/715,068, the disclosures of which are herein incorporated by reference in their entirety.
- the engineered cells, populations thereof, or compositions thereof are administered to a subject, generally a mammal, generally a human, in an effective amount.
- the engineered cells may be administered to a subject by infusion (e.g., continuous infusion over a period of time) or other modes of administration known to those of ordinary skill in the art.
- infusion e.g., continuous infusion over a period of time
- other modes of administration known to those of ordinary skill in the art.
- the engineered cells provided herein can be administered as part of a pharmaceutical compositions.
- the present disclosure provides compositions comprising a guide RNA of the present disclosure.
- the pharmaceutical composition may comprise one or more pharmaceutical excipients. Any suitable pharmaceutical excipient may be used, and one of ordinary skill in the art is capable of selecting suitable pharmaceutical excipients. Accordingly, the pharmaceutical excipients provided below are intended to be illustrative, and not limiting. Additional pharmaceutical excipients include, for example, those described in the Handbook of Pharmaceutical Excipients , Rowe et al. (Eds.) 6th Ed. (2009), incorporated by reference in its entirety.
- the engineered cells provided herein not only find use in gene therapy but also in non-pharmaceutical uses such as, e.g., production of animal models and production of recombinant cell lines expressing a protein of interest.
- the engineered cells of the present disclosure can be any cell, generally a mammalian cell, generally a human cell that has been modified by integrating a transgene at a safe harbor locus described herein.
- the engineered cells are immune cells.
- the engineered cells are lymphocytes.
- the engineered cells are T cells or T cell precursors.
- the engineered cells, compositions and methods of the present disclosure are useful for therapeutic applications such as CAR T cell therapy and TCR T cell therapy.
- the insertion of a sequence encoding a transgene within a safe harbor locus maintains the TCR expression relative to instances when there is no insertion and enables transgene expression while maintaining TCR function.
- Non-limiting examples of such diseases include alopecia areata, autoimmune hemolytic anemia, autoimmune hepatitis, cancer, dermatomyositis, diabetes (type 1), certain juvenile idiopathic arthritis, glomerulonephritis, Graves' disease, Guillain Valley Syndrome, idiopathic thrombocytopenic purpura, myasthenia gravis, certain myocarditis, multiple sclerosis, pemphigus/pemphigoid, pernicious anemia, polyarteritis nodosa, polymyositis, primary bile With cirrhosis, psoriasis, rheumatoid arthritis, scleroderma/systemic sclerosis, Sjogren's syndrome, systemic lupus erythematosus, certain thyroiditis, certain uveitis, vitiligo, multiple va
- Cancers that can be treated with the engineered cells (e.g., CAR T-cells) of the present disclosure, populations thereof, or compositions thereof include blood cancers.
- the cancer treated using the engineered cells (e.g., CAR T-cells) described herein, populations thereof, or compositions thereof is a hematologic malignancy or leukemia.
- the engineered cells (e.g., CAR T-cells) described herein, populations thereof, or compositions thereof are used for the treatment of acute lymphoblastic leukemia (ALL) or diffuse large B-cell lymphoma (DLBCL).
- ALL acute lymphoblastic leukemia
- DLBCL diffuse large B-cell lymphoma
- the cancer is acute myeloid leukemia (AML), acute lymphoblastic leukemia (ALL), myelodysplasia, myelodysplastic syndromes, acute T-lymphoblastic leukemia, or acute promyelocytic leukemia, chronic myelomonocytic leukemia, or myeloid blast crisis of chronic myeloid leukemia.
- AML acute myeloid leukemia
- ALL acute lymphoblastic leukemia
- myelodysplasia myelodysplasia
- myelodysplastic syndromes myelodysplastic syndromes
- acute T-lymphoblastic leukemia or acute promyelocytic leukemia
- chronic myelomonocytic leukemia chronic myelomonocytic leukemia
- myeloid blast crisis of chronic myeloid leukemia.
- cancers treatable using the engineered cells (e.g., CAR T-cells) described herein include, without limitation, breast cancer, ovarian cancer, esophageal cancer, bladder or gastric cancer, salivary duct carcinoma, salivary duct carcinomas, adenocarcinoma of the lung or aggressive forms of uterine cancer, such as uterine serous endometrial carcinoma.
- the cancer is brain cancer, breast cancer, cervical cancer, colon cancer, colorectal cancer, endometrial cancer, esophageal cancer, leukemia, lung cancer, liver cancer, melanoma, ovarian cancer, pancreatic cancer, rectal cancer, renal cancer, stomach cancer, testicular cancer, or uterine cancer.
- the cancer is a squamous cell carcinoma, adenocarcinoma, small cell carcinoma, melanoma, neuroblastoma, sarcoma (e.g., an angiosarcoma or chondrosarcoma), larynx cancer, parotid cancer, biliary tract cancer, thyroid cancer, acral lentiginous melanoma, actinic keratoses, acute lymphocytic leukemia, acute myeloid leukemia, adenoid cystic carcinoma, adenomas, adenosarcoma, adenosquamous carcinoma, anal canal cancer, anal cancer, anorectum cancer, astrocytic tumor, bartholin gland carcinoma, basal cell carcinoma, biliary cancer, bone cancer, bone marrow cancer, bronchial cancer, bronchial gland carcinoma, carcinoid, cholangiocarcinoma, chondrosarcoma, choroid
- the present disclosure provides methods of treating a subject in need of treatment by administering to the subject a composition comprising any of the engineered cells described herein.
- the terms “treat,” “treatment,” and the like refer generally to obtaining a desired pharmacological and/or physiological effect. That effect is preventive in terms of complete or partial prevention of the disease and/or therapeutic in terms of partial or complete cure of the disease and/or adverse effects resulting from the disease.
- treatment encompasses any treatment of a disease in a subject (e.g., mammal, e.g., human).
- Treatment may also refer to the administration of the engineered cells provided herein to a subject that is susceptible to the disease but has not yet been diagnosed as suffering from it, including preventing the disease from occurring; inhibiting disease progression; or reducing the disease (i.e., causing a regression of the disease). Further, treatment may stabilize or reduce undesirable clinical symptoms in subjects (e.g., patients).
- the cells provided herein populations thereof, or compositions thereof may be administered before, during or after the occurrence of the disease or injury.
- the subject has a disease, condition, and/or injury that can be treated and/or ameliorated by cell therapy.
- the subject in need of cell therapy is a subject having an injury, disease, or condition, thereby causing cell therapy (e.g., therapy in which cellular material is administered to the subject).
- cell therapy e.g., therapy in which cellular material is administered to the subject.
- a subject in need of cell therapy includes, but is not limited to, a bone marrow transplant or stem cell transplant candidate, a subject who has received chemotherapy or radiation therapy, a hyperproliferative disease or cancer (e.g., a hematopoietic system), a subject having or at risk of developing a hyperproliferative disease or cancer), a subject having or at risk of developing a tumor (e.g., solid tumor), viral infection or virus. It is also intended to encompass subjects suffering from or at risk of suffering from a disease associated with an infection.
- a hyperproliferative disease or cancer e.g., a hematopoietic system
- a subject having or at risk of developing a hyperproliferative disease or cancer e.g., solid tumor
- a tumor e.g., solid tumor
- the present disclosure provides a composition of the present disclosure along with instructions for use.
- the instructions for use can be present in the kits as a package insert, in the labeling of the container of the kit or components thereof, or can be in digital form (e.g. on a CD-ROM, via a link on the internet).
- a kit can include one or more of a genome-targeting nucleic acid, a polynucleotide encoding a genome-targeting nucleic acid, a site-directed polypeptide, and/or a polynucleotide encoding a site-directed polypeptide. Additional components within the kits are also contemplated, for example, buffer (such as reconstituting buffer, stabilizing buffer, diluting buffer), and/or one or more control vectors.
- an engineered cells of the present disclosure or composition thereof is administered with at least one additional therapeutic agent.
- Any suitable additional therapeutic agent may be administered with an engineered cell provided herein, populations thereof, or compositions thereof.
- the additional therapeutic agent is selected from radiation, an ophthalmologic agent, a cytotoxic agent, a chemotherapeutic agent, a cytostatic agent, an anti-hormonal agent, an immunostimulatory agent, an anti-angiogenic agent, and combinations thereof.
- an engineered cell of the present disclosure or composition thereof is administered with a steroid.
- the administration of a steroid can prevent or mitigate the risk of a subject receiving the engineered cell(s) or composition thereof having an autoimmune reaction.
- the additional therapeutic agent may be administered by any suitable means.
- the engineered cells described herein, populations thereof, or compositions thereof and the additional therapeutic agent is administered in the same pharmaceutical composition, e.g. by infusion.
- the engineered cells described herein and additional therapeutic agent are included in different pharmaceutical compositions.
- the pharmaceutical composition may comprise one or more pharmaceutical excipients. Any suitable pharmaceutical excipient may be used, and one of ordinary skill in the art is capable of selecting suitable pharmaceutical excipients. Accordingly, the pharmaceutical excipients provided below are intended to be illustrative, and not limiting. Additional pharmaceutical excipients include, for example, those described in the Handbook of Pharmaceutical Excipients , Rowe et al. (Eds.) 6th Ed. (2009), incorporated by reference in its entirety.
- the additional therapeutic agent is administered by any suitable mode of administration.
- modes of administration include, without limitation, intravitreal, subretinal, suprachoroidal, intraarterial, intradermal, intramuscular, intraperitoneal, intravenous, nasal, parenteral, topical, pulmonary, and subcutaneous routes.
- administering can occur prior to, simultaneously, and/or following, administration of the additional therapeutic agent.
- the objective of the experiments in this set of examples was to identify T-cell knock-in (KI) loci in coding regions outside of the T Cell Receptor Alpha Constant (TRAC) locus.
- KI T-cell knock-in loci
- T Cell Receptor Alpha Constant T Cell Receptor Alpha Constant
- CAR T chimeric antigen receptor therapy
- FIG. 4 A and FIG. 4 B show the processed RNA-seq data, showing that samples cluster based on activation status and by cell type. Donor 4 (AKI4) was identified as an outlier, as it clustered differently than the others, and was removed from remaining analysis.
- the transcript expression data from RNA-seq experiments on the 90 genes cited in the Roth, T. L. et al. paper was correlated to transcript expression data in Roth, T. L. et al. paper. (See FIG. 5 A and FIG. 5 B ).
- FIG. 6 shows the process ATAC-seq data, which focused on the 10 kb area around the transcription start site (TSS). Again, donor 4 (AK14) was identified as an outlier, based on expression profile in both CD4 and CD8 cells, and was removed from remaining analysis. The results of the remaining ATAC-seq analysis are shown in FIGS. 7 A- 8 . The data in FIG. 7 A for TSS enrichment scores also indicated that all libraries were high quality. The data on open chromatin regions around the TRAC locus revealed the highest signal near exon 3 rather than exon 1 ( FIG. 8 ).
- the K1 efficiency results (obtained using the model described infra) based on the 5 donors, 2 gRNA, and 2 replicates/gRNAs from the Roth, T. L. et al. paper are shown in FIG. 9 and FIG. 10 .
- FIG. 11 shows KI efficiency versus day 2 (d2) RNA-seq data, day 4 (d4) RNA seq data, and d2 ATAC data.
- the linear model was applied to remaining candidate genes to estimate their KI efficiency.
- the candidate coding loci were selected by first ranking all genes in the pooled data sets by predicted KI efficiency, using RNA-seq expression data from d2 (day 2).
- the candidate loci also had to be accessible based on ATAC-seq data.
- 16 well-characterized coding genes were selected as candidate genes. The knockout of these 16 would confer a benefit to the function of CART cells (e.g. B2M, CD5, SMAD2, PTPRC, CD3E).
- An additional 12 coding genes with high predicted KI efficiency and no apparent essential function were selected as candidate genes. (See Table 2).
- the inventors here started by looking in highly accessible regions of the genome (10 kb windows). The most accessible region overlapped with (i) annotated protein coding genes (>50% accessible regions), (ii) pseudogenes and noncoding RNAs ( ⁇ 20% accessible regions), and (iii) enhancer/regulatory regions ( ⁇ 20% accessible regions).
- the candidate genes were required to be ⁇ 10 kb from the coding regions.
- a few regions that overlapped with long intergenic noncoding RNAs (lincRNAs) but did not have apparent function in T cells were also considered.
- FIG. 12 is a plot showing the normalized ATAQ SEQ data for a top candidate non-coding region.
- FACS panel GFP, TCR, Zombie
- RNA-seq data at d0, d2, and d4 (to determine stable expression)
- ATAC-seq data at d2 (to identify activated cells).
- the methods used for conducting the RNA-seq and ATAC-seq experiments are in Roth, T. L., et al. 2019. (Rapid discovery of synthetic DNA sequences to rewrite endogenous T cell circuits. bioRxiv, 604561), which is herein incorporated by reference in its entirety.
- Loci significance was calculated using robust rank aggregation, which was the methodology used for consistent ranking of loci among donors and time points.
- Methods for conducting robust rank aggregation are known in the art. (See, for example, Kolde, R., et al. (2012). Robust rank aggregation for gene list integration and meta-analysis. Bioinformatics (Oxford, England), 28(4), 573-580, the disclosure of which is herein incorporated by reference in its entirety).
- FIGS. 13 A- 25 The results from the analysis of controls and experimental groups is shown in FIGS. 13 A- 25 .
- the loci were ranked based on mean florescence intensity (MFI) of reporter gene, GFP, and KI efficiency.
- FIG. 29 shows GFP MFI and KI efficiency for all significant loci having endogenous promoters.
- the B2M locus reported the highest GFP MFI.
- the TRAC locus was among the top 10 MFI.
- the highest GFP MFI readings were reported in week 1 and reduced slightly in weeks 3 and 4.
- the results also showed most top loci have more than 1 sgRNA.
- the SOCS1 locus reported the highest KI efficiency. Overall, KI efficiency showed greater variation among donors than GFP MFI.
- FIG. 30 shows GFP MFI and KI efficiency for all significant loci having exogenous promoters (e.g. EF1a promoters).
- the TRAC locus reported the highest GFP MFI. There was comparable expression between weeks 3 and 4. The results also showed most top loci have more than 1 sgRNA.
- the SOCS1 locus reported the highest KI efficiency. Overall, KI efficiency showed less variation when driven by the exogenous promoters than when driven by endogenous promoters.
- FIGS. 31 A, 31 B, and 31 C show GFP MFI and KI efficiency for all significant loci having endogenous and exogenous (e.g., EF1a) promoters.
- endogenous and exogenous (e.g., EF1a) promoters The results showed that expression driven by EF1a promoters is about 10 ⁇ higher than endogenous promoters ( FIG. 31 A ).
- KI efficiency the SOCS1 locus reported the highest KI efficiency. At weeks 3 and 4, KI efficiency was higher when driven by the exogenous promoters than when driven by endogenous promoters.
- TCR expression was measured following circuit cassette knock in of PrimeR (Prime Receptor (Myc)).
- Myc denotes an N-terminal Myc epitope tag to facilate detection of surface expressed primer receptor.
- CD3-CD28 Dynabead-activated T cells were electroporated with sgRNA/Cas9 RNPs targeting the indicated sites (see FIGS. 32 B and 33 A ), as well as HDRTs with homology arms directing HDR-mediated integration into the indicated sites.
- sgRNA/Cas9 RNPs targeting the indicated sites (see FIGS. 32 B and 33 A ), as well as HDRTs with homology arms directing HDR-mediated integration into the indicated sites.
- At day 6 post-electroporation cells were stained with anti-TCRalpha/beta and anti-myc antibodies and analyzed on an Attune NxT flow cytometer. As shown in FIG.
- the TCR expression was maintained at GS94 and GS102 candidate integration (knock-in) sites. Only 2 ⁇ 3 of the cells that had circuit cassette knock in of PrimeR at GS79 (TRAC) locus maintained TCR expression. These results indicated that the GS94 and GS102 sites showed better potential for TCR stimulation.
- the percentage of cells showing effective knock-in was 36% ⁇ 4% when using the GS94 integration site, as compared to 32% ⁇ 5% when using the TRAC integration site.
- GS94 is a candidate integration site located on chromosome 11's distal q arm. It is within 180-350 kb of the promoters for ETS1 and FLI1 ( FIG. 33 B ), however that is considered low-risk for integration vector gene therapy. The circuit expression and function potential of the GS94 gene was evaluated.
- T cells underwent circuit cassette integration with PrimeR at GS79 (TRAC) integration site and GS94 integration site.
- the cells were cocultured with K562 C19 cells for 48 hours and then the PrimeR induced CAR MFI was compared to the PrimeR MFI.
- T cells generated as described in the Example above were cocultured with K562 CD19+/MSLN ⁇ cells at day 7 post-electroporation.
- MSLN (mesothelin) is a gene that is overexpressed in human pancreatic cancer.
- Cells were then stained with anti-FLAG antibody 48 h post initiation of coculture and analyzed on an Attune NxT flow cytometer.
- GS94 resulted in prime antigen-dependent CAR expression that was approximately two-fold higher than the expression in several other candidate integration sites as well as the TRAC integration site. Additionally, on average, the prime receptor surface expression level was no less than 50% of expression level when using the TRAC integration site.
- T cells that had undergone circuit cassette integration with PrimeR at GS79 (TRAC) integration site and GS94 integration site were cocultured with K562 C19/MSLN cells for 48 hours.
- MSLN is a gene that is overexpressed in human pancreatic cancer.
- the cells were treated at a 1:1 effector:target cell ratio (1:1 E:T)).
- T cells generated as described above were cocultured with K562 CD19+/MSLN+ cells at day 7 post-electroporation. 48 h post initiation of coculture, supernatants were collected and analyzed via Luminex® for cytokine levels.
- the cytokines measure were IL-2, INFg, and TNF. Cytotoxicity was analyzed by measuring luciferase activity of remaining target cells after 48 h. Each of the data points in FIG. 34 B represent two replicates and the lines represent the range of cytotoxicity for the replicates. As shown in FIG. 34 B , the GS94 integration sites resulted in superior cycotoxic ability and cytokine secretion following the 48-hour coculture with K562 C19/MSLN cells.
- T cells that had undergone circuit cassette integration with PrimeR at GS79 (TRAC) integration site and GS94 integration site were cocultured with K562 CD19+/MSLN+ cells or K562 CD19 ⁇ /MSLN+ cells (“K562 MSLN”) at day 7 post-electroporation for 48 hours. 0.3, 1.0, and 3.0 E:T cell ratios were tested. See FIG. 35 A and FIG. 35 B .
- cytotoxicity was analyzed by measuring luciferase activity of remaining target cells
- the GS94 integration site resulted in equivalent cytotoxic potential to the TRAC integration site and there was no prime-independent cytotoxicity.
- a group that had target cells only (E:T 0; Targets only) was compared to a group with an E:T cell ratio of 1.
- supernatant was collected and analyzed via Luminex® for cytokine levels to measure secretion of IL-2, INFg and TNF cytokines. See FIG. 36 A and FIG. 36 B .
- the GS94 integration site resulted in equivalent cytokine secretion to the TRAC integration site and there was no prime-independent secretion of IL-2, INFg or TNF.
- T cells that had undergone circuit cassette integration with PrimeR at GS79 (TRAC) integration site, GS94 integration site, and GS102 integration site were cultured in vitro for 32 days.
- the cells were treated with repetitive CD3/CD28 stimulation at days 5, 12, 19 and 28 of the experiment.
- the cells were evaluated for CAR expression using a flow cytometry assay.
- T cell activation through TCR did not result in prime R-independent CAR expression from circuit cassette integration at the candidate integration sites.
- T cells generated as described above were cultured in 96-well plates, with T cell growth medium being exchanged every 2 days. At days 5, 12, 19 and 28, T cells were stimulated with 1:1 CD3/CD28 Dynabeads. Cells were analyzed for PrimeR expression by myc epitope tag staining, and for CAR expression by FLAG epitope tag staining at the indicated time points. Flow analysis was performed on an Attune NxT flow cytometer.
- T cells that had undergone circuit cassette integration with PrimeR at the integration sites indicated in FIG. 38 were cultured in vitro for 32 days. Briefly, T cells generated as described above were cultured in 96-well plates, with T cell growth medium being exchanged every 2 days. At days 5, 12, 19 and 28, T cells were stimulated with 1:1 CD3/CD28 Dynabeads repetitive stimulation. Flow cytometry assays were run on days 16 and 32 using an Attune NxT flow cytometer. The cells were analyzed for PrimeR expression by myc epitope tag staining. As shown in FIGS. 38 A and 38 B , the GS94 integration site resulted in stable PrimeR expression over at least a 4-week period.
- iGUIDE-Seq assay was used.
- the methods used for conducting the iGUIDE-Seq assay are illustrated in FIG. 39 A and provided in Nobles et al., Genome Biology (2019), which is hereby incorporated by reference in its entirety.
- FIG. 39 B the GS94 integration site had the highest on-target editing efficiency of the evaluated candidate integration sites.
- FIG. 39 C GS94 resulted in no putative off-target editing as observed with two donors.
- Elevation prediction Computational predictions of potential off-target sites from (gs94) were performed using Elevation-search (algorithm described in Listberg et al. 2018. Prediction of off-target activities for the end-to-end design of CRISPR guide RNAs. Nat Biomed Engr 2, 37-48; software obtained from github.com/Microsoft/Elevation). All sites identified by Elevation-search were subjected to analysis using rhAmp-segTM.
- rhAmpSegTM 49 candidate off-target sites for GS94 identified by iGUIDE or the Elevation prediction algorithm and the GS94 target site were characterized by rhAmpSegTM (Integrated DNA Technologies, Inc.). This targeted amplification enables NGS-based quantification of the editing occurring at numerous sites simultaneously. Genomic DNA from T cells from at least 2 donors that had been treated singly with each of the following 7 guides: GS84, GS94, GS95, GS96, GS102, GS108, and GS138 was isolated with the GenFind® V3 DNA purification system (Beckman Coulter).
- rhAmpSegTM amplification pools Two separate rhAmpSegTM amplification pools were used to cover the 50 loci, the procedure was performed as recommended by Integrated DNA Technologies for each of the samples.
- the rhAmpSegTM libraries were sequenced on a MiniSegTM with a Mid Output Kit (300-cycles) (Illumina).
- the CRISPResso2 algorithm (github.com/pinellolab/CRISPResso2) was used to determine the percentage of insertions and deletions at each of the amplified loci.
- Statistical significance FDR-adjusted p-value ⁇ 0.001 using a chi-squared test was only observed at the GS94 site.
- RNA-seq To evaluate changes induced by GS94 integration at the transcriptional level, a CD19/MSLN circuit was integrated at the GS79 (TRAC), GS94, and GS102 integration sites. On day 6 post-integration, 1e6 edited cells were sorted using a BD FACSAria based on transgene expression. RNA was isolated from sorted T cells with the RNeasy kit (Qiagen). Purified RNA was converted into an NGS library using the TruSeqTM RNA Library Prep Kit v2 (Illumina). Libraries were sequenced on either the NovaSeqTM 6000 or NextSeqTM 550 instruments (Illumina). The STAR 2.7.3a aligner (Dobin A. et al.
- Cytokine-independent growth assay To evaluate the safety of the primary T cells with GS94 locus KI, cytokine-independent growth assay was performed to evaluate the potential for oncogenic transformation. Briefly, primary human T cells that had undergone CD19/MSLN circuit cassette integration at GS94 locus were thawed and recovered overnight. 1 ⁇ 10 6 cells were then seeded in one well of a 24 well-GRex plate, culturing for 5 days in the medium with or without cytokines. Cell number and viability were recorded at days 0, 3 and 5. As a positive control, 1 ⁇ 10 6 Jurkat cells were cultured in the medium without cytokines in parallel. As shown in FIG.
- CRISPR reagents e.g. SpCas9 complexed with sgRNA
- GS94-targeting CRISPR RNP showed the highest percentage of iGUIDE-seq oligo cassette trapping events of all candidates evaluated, and the control sgRNA sequences from the iGUIDE-seq paper showed similar specificities to what was reported in the original publication, suggesting that the assay performed as expected.
- Putative off-target sites were taken from the iGUIDE-seq output, which already suggested that the putative sites were spurious. Additional target sites were predicted by a computational approach (Elevation software package). rhAmp-segTM was used to prepare high-throughput sequencing libraries for each of the putative off-target sites, and the method was applied to DNA samples from T cells electroporated with CRISPR RNPs targeting the candidate target sites. The resulting NGS data were processed with CRISPResso2 software, and the frequency of insertions and deletions (indels) was taken as indication of CRISPR cleavage activity, as is common in the field.
- indels frequency of insertions and deletions
- T cells electroporated with GS94-targeting CRISPR RNP showed no greater frequency of indels at the set of putative off-target sites than T cells treated with CRISPR RNP targeting other sites, consistent with the GS94-targeting CRISPR RNP having no consequential or detectable off-target activity, and therefore being the most specific out of the set evaluated ( FIG. 41 ).
- RNA-seq libraries were prepared and sequenced, and analysis of the resulting Illumina sequencing data revealed no biologically or statistically significant differences in expression of any genes within 300 kb of the GS94 site in cells with integrations at GS94 compared to cells with integration at TRAC or the GS102 sites ( FIG. 42 ). Furthermore, other gene expression differences that reached statistical significance were minimal in number and in effect size, consistent with them being noise in the comparison.
- transgene integration at GS94 could confer a transformed phenotype
- cells with integrations at the GS94 site were cultured with and without cytokines in vitro. Cells remained alive and viable with cytokine addition, but died without cytokine supplementation and lost their viability ( FIG. 43 ). The positive control Jurkat cells remained viable and proliferated. Overall, this indicates that integration of a transgene at GS94 does not confer capacity for cytokine-independent growth, which is a hallmark of T cell transformation.
- T cells In vivo efficacy of T cells with a transgene cassette expressing a CAR recognizing a tumor antigen, or a CAR recognizing a tumor antigen under control of a priming receptor recognizing an antigen in the anatomical vicinity of the tumor, is assessed against human tumor cells such as K562 engineered to express the CAR antigen or to express antigens recognized by both the priming receptor and the CAR.
- Tumor cells e.g. 1e6
- Tumor growth is assessed by dimensional measurement by calipers every 2-4 days.
- mice are intravenously injected with 5e6 T cells with a CAR or prime-CAR circuit cassette integrated at a specific site by CRISPR-mediated insertion, or with T cells engineered with CRISPR RNP alone, or with PBS alone as a sham injection. Tumor growth is monitored and mice are euthanized when tumor volume reaches 2000 cubic mm. Peripheral blood is bled from mice through a retro-orbital procedure, and flow cytometry and/or ddPCR is used to observe engineered T cell expansion over time.
- spleen, blood, tumor and/or other tissue is analyzed via flow cytometry, ddPCR, and/or immunohistochemistry for the presence of engineered T cells.
- the results demonstrate that T cells engineered with cassette integration at one of the defined genomic loci lead to tumor regression and clearance in injected mice as compared to T cells without cassette integration, and that engineered T cells are detectable in the peripheral blood and tissues of injected mice.
- sgRNA_ summarized start_ sgRNA Integra- from 2 sgRNA coor Target tion donors, 2 ID sgRNA Sequence GRCH38 Loci Site primersets sgRNA_ GCACCTGAATACCACGCCTG chr16: APRT APRT 79.28 1 (SEQ ID NO: 1) 88811818 sgRNA_ CGCCTGCGATGTAGTCGATG chr16: APRT APRT 78.60 2 (SEQ ID NO: 2) 88811551 sgRNA_ CAGGACGGGCGAGATGTCCC chr16: APRT APRT 85.25 3 (SEQ ID NO: 3) 88811640 sgRNA_ CTGAATCTTTGGAGTACCTG chr15: B2M B2M 78.51 4 (SEQ ID NO: 4) 44715425 sgRNA_
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Biomedical Technology (AREA)
- Zoology (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- General Engineering & Computer Science (AREA)
- Immunology (AREA)
- Molecular Biology (AREA)
- Cell Biology (AREA)
- Biochemistry (AREA)
- Epidemiology (AREA)
- Medicinal Chemistry (AREA)
- Veterinary Medicine (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Pharmacology & Pharmacy (AREA)
- Mycology (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Oncology (AREA)
- Hematology (AREA)
- Endocrinology (AREA)
- Gastroenterology & Hepatology (AREA)
- Hospice & Palliative Care (AREA)
- Developmental Biology & Embryology (AREA)
- Virology (AREA)
- Medicines Containing Material From Animals Or Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Provided herein are safe harbor loci and methods for identifying and using safe harbor loci to engineer cells to express transgenes. The safe harbor loci exhibit increased transgene knock-in efficiency and allow for increased, stable expression of transgenes in engineered cells. Guide ribonucleic acids (gRNAs) may be used for insertion of transgenes in the safe harbor loci.
Description
- This application is a divisional of U.S. application Ser. No. 17/716,816 filed Apr. 8, 2022, allowed, which is a divisional of U.S. application Ser. No. 17/239,484 filed Apr. 23, 2021, allowed, which claims priority to and the benefit of U.S. Provisional Patent Application No. 63/141,926, filed on Jan. 26, 2021, and U.S. Provisional Patent Application No. 63/105,834, filed on Oct. 26, 2020, the entire contents of which are incorporated by reference herein for all purposes.
- The instant application contains a Sequence Listing which will be submitted via Patent Center and is hereby incorporated by reference in its entirety. Said XML copy, created on Aug. 29, 2023, is named ANB-203D2_SL.xml and is 845,305 bytes in size.
- Cancer continues to present a significant clinical burdSen despite the substantial research efforts and scientific advances in cancer therapies. Blood and bone marrow cancers are frequently diagnosed cancer types, including multiple myelomas, leukemia, and lymphomas. Current treatment options for these cancers are not effective for all patients and/or can have substantial adverse side effects. Other types of cancer also remain challenging to treat using existing therapeutic options. Cancer immunotherapies are a promising solution because they can be highly specific, allowing for increased therapeutic effectiveness and the mitigation of side effects.
- Genetically engineered immune cell therapy is a growing field with promising applications for the treatment of diseases including, but not limited to, cancer. Through the alteration of coding and/or non-coding genomic regions, researchers are identifying transgenes and insertion sites within cells that facilitate, for example, enhanced cell function, arrest cell growth, induced cell death, and tumor size/volume reduction. The identification of safe harbor sites (SHS) has improved outcomes of genome-engineering therapies. Well known SHS include the AAVS1 adeno-associated virus insertion site on
chromosome 19, the human homolog of the murine Rosa26 locus, and the CCR5 chemokine receptor gene—the absence of which confers HIV resistance. (See, for example, Pellenz et al., 2018, the relevant disclosures of which are herein incorporated by reference). However, there is still a need for improved guidelines for gene editing therapies and additional SHS to address challenges such as poor knock-in (KI) efficiency, insertional oncogenesis, unstable and/or anomalous expression of transgenes and/or adjacent genes, etc. - The present disclosure is directed, inter alia, to safe harbor loci that exhibit high knock-in efficiency and stable expression of their transgenes. These safe harbor loci can be used to alter T cells for immunotherapy. These safe harbor loci are useful for the treatment of various diseases, including cancer.
- In one aspect, the present disclosure provides an engineered cell, comprising at least one sequence encoding a transgene, wherein the at least one sequence is inserted within a safe harbor locus, the safe harbor locus is at any one or more of the sgRNA target loci provided in Table 4; and wherein expression of the at least one sequence encoding the transgene is operatively linked to an endogenous promoter. In another aspect, the present disclosure provides an engineered cell, comprising at least one sequence encoding a transgene, wherein the at least one sequence is inserted within a safe harbor locus, the safe harbor locus is at any one or more of the sgRNA target loci provided in Table 4; and wherein expression of the at least one sequence encoding the transgene is operatively linked to an exogenous promoter.
- In some embodiments, the target locus is selected from: chr10:33130000-33140000, chr10:72290000-72300000, chr11:128340000-128350000, chr11:65425000-65427000 (NEAT1), chr15:92830000-92840000, chr16:11220000-11230000, chr2:87460000-87470000, chr3:186510000-186520000, chr3:59450000-59460000, chr8:127980000-128000000, and chr9:7970000-7980000. In some embodiments, the target locus is selected from: chr10:72290000-72300000, chr11:128340000-128350000, chr15:92830000-92840000, and chr16:11220000-11230000. In some embodiments, the target locus is chr11:128340000-128350000. In some embodiments, the target locus is chr15:92830000-92840000. In some embodiments, the target locus is a gene selected from: APRT, B2M, CAPNS1, CBLB, CD2, CD3E, CD3G, CD5, EDF1, FTL, PTEN, PTPN2, PTPN6, PTPRC, PTPRCAP, RPS23, RTRAF, SERF2, SLC38A1, SMAD2, SOCS1, SRP14, SRSF9, SUB1, TET2, TIGIT, TRAC, and TRIM28.
- In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS88, GS89, GS90, GS91, GS92, GS93, GS94, GS95, GS96, GS97, GS98, GS99, GS100, GS101, GS102, GS103, GS104, GS105, GS106, GS107, GS108, GS109, GS110, GS111, GS112, GS113, GS114, GS115, GS116, GS117, GS118, GS119, GS120. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS91, GS92, GS93, GS94, GS95, GS96, GS100, GS101, GS102, GS103, GS104, and GS105. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS103, GS104, and GS105. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS94, GS95, and GS96. In some embodiments, the safe harbor locus is the GS94 integration site in Table 4. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS100, GS101, and GS102. In some embodiments, the safe harbor locus is the GS102 integration site in Table 4. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS91, GS92, and GS93.
- In some embodiments, the exogenous promoter is an EF1a promoter. In some embodiments, the engineered cell is a stem cell, a human cell, a primary cell, an hematopoietic cell, an adaptive immune cell, an innate immune cell, a T cell or a T cell progenitor. In some embodiments, the transgene encodes a recombinant protein, optionally a therapeutic agent. In some embodiments, the transgene encodes a chimeric antigen receptor (CAR).
- In another aspect, the present disclosure provides a composition comprising the engineered cell as described herein and a pharmaceutical excipient.
- In another aspect, the present disclosure provides a guide ribonucleic acids (gRNA) for editing a cell at a safe harbor locus, wherein gRNA comprises any one of the sgRNA sequences in Table 4.
- In some embodiments, the gRNA comprises any one of SEQ ID NOS:1-120. In some embodiments, the gRNA comprises any one of SEQ ID NOS: 91-96 and 100-105. In some embodiments, the gRNA comprises SEQ ID NO:94 or SEQ ID NO:102. In some embodiments, the gRNA comprises SEQ ID NO:94. In some embodiments, the gRNA comprises SEQ ID NO:102. In some embodiments, the cell is a stem cell, a human cell, a primary cell, an hematopoietic cell, an adaptive immune cell, an innate immune cell, a T cell or a T cell progenitor.
- In another aspect, the present disclosure provides a method of editing a cell having chromosomal DNA, comprising inserting at least one sequence encoding a transgene within a safe harbor locus in the chromosomal DNA of the cell, wherein the safe harbor locus is any one or more of the sgRNA target loci provided in Table 4.
- In some embodiments, the target locus is selected from: chr10:33130000-33140000, chr10:72290000-72300000, chr11:128340000-128350000, chr11:65425000-65427000 (NEAT1), chr15:92830000-92840000, chr16:11220000-11230000, chr2:87460000-87470000, chr3:186510000-186520000, chr3:59450000-59460000, chr8:127980000-128000000, and chr9:7970000-7980000. In some embodiments, the target locus is selected from: chr10:72290000-72300000, chr11:128340000-128350000, chr15:92830000-92840000, and chr16:11220000-11230000. In some embodiments, the target locus is chr11:128340000-128350000. In some embodiments, the target locus is chr15:92830000-92840000. In some embodiments, the target locus is a gene selected from: APRT, B2M, CAPNS1, CBLB, CD2, CD3E, CD3G, CD5, EDF1, FTL, PTEN, PTPN2, PTPN6, PTPRC, PTPRCAP, RPS23, RTRAF, SERF2, SLC38A1, SMAD2, SOCS1, SRP14, SRSF9, SUB1, TET2, TIGIT, TRAC, and TRIM28.
- In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS88, GS89, GS90, GS91, GS92, GS93, GS94, GS95, GS96, GS97, GS98, GS99, GS100, GS101, GS102, GS103, GS104, GS105, GS106, GS107, GS108, GS109, GS110, GS111, GS112, GS113, GS114, GS115, GS116, GS117, GS118, GS119, GS120. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS91, GS92, GS93, GS94, GS95, GS96, GS100, GS101, GS102, GS103, GS104, and GS105. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS103, GS104, and GS105. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS94, GS95, and GS96. In some embodiments, the safe harbor locus is the GS94 integration site in Table 4. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS100, GS101, and GS102. In some embodiments, the safe harbor locus is the GS102 integration site in Table 4. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS91, GS92, and GS93.
- In some embodiments, the transgene encodes a recombinant protein, optionally a therapeutic agent. In some embodiments, the transgene encodes a chimeric antigen receptor (CAR). In some embodiments, the at least one sequence comprises an exogenous promoter and the exogenous promoter is operably linked to the transgene. In some embodiments, the exogenous promoter is an EF1a promoter. In some embodiments, the cell is a stem cell, a human cell, a primary cell, an hematopoietic cell, an adaptive immune cell, an innate immune cell, a T cell or T cell progenitor. In some embodiments, the at least one sequence is inserted using a homology-directed repair. In some embodiments, the at least one sequence is inserted using a homology independent targeted insertion. In some embodiments, the at least one sequence is inserted using one or more guide ribonucleic acids (gRNAs) and one or more Cas9 endonucleases.
- In some embodiments, the one or more gRNAs comprises any one of SEQ ID NOS: 1-120. In some embodiments, the one or more gRNAs comprises any one of SEQ ID NOS: 91-96 and 100-105. In some embodiments, the gRNA comprises SEQ ID NO:94 or SEQ ID NO:102. In some embodiments, the gRNA comprises SEQ ID NO:94. In some embodiments, the gRNA comprises SEQ ID NO:102.
- In another aspect, the present disclosure provides a method of editing a T cell, comprising contacting a T cell with one or more guide ribonucleic acids (gRNAs), at least one sequence encoding a transgene, and one or more Cas9 endonucleases, wherein the one or more gRNAs and Cas9 endonucleases facilitate the insertion of the at least one sequence into chromosomal DNA within a safe harbor locus, wherein the safe harbor locus is selected from any one or more of the sgRNA target loci in Table 4.
- In some embodiments, the one or more gRNAs comprises a sequence selected from any one of the sgRNA sequences in Table 4. In some embodiments, the one or more gRNAs comprises any one of SEQ ID NOS: 1-120. In some embodiments, the one or more gRNAs comprises any one of SEQ ID NOS: 91-96 and 100-105. In some embodiments, the gRNA comprises SEQ ID NO:94 or SEQ ID NO:102. In some embodiments, the gRNA comprises SEQ ID NO:94. In some embodiments, the gRNA comprises SEQ ID NO:102.
- In some embodiments, the target locus is selected from: chr10:33130000-33140000, chr10:72290000-72300000, chr11:128340000-128350000, chr11:65425000-65427000 (NEAT1), chr15:92830000-92840000, chr16:11220000-11230000, chr2:87460000-87470000, chr3:186510000-186520000, chr3:59450000-59460000, chr8:127980000-128000000, and chr9:7970000-7980000. In some embodiments, the target locus is selected from: chr10:72290000-72300000, chr11:128340000-128350000, chr15:92830000-92840000, and chr16:11220000-11230000. In some embodiments, the target locus is chr11:128340000-128350000. In some embodiments, the target locus is chr15:92830000-92840000. In some embodiments, the target locus is a gene selected from: APRT, B2M, CAPNS1, CBLB, CD2, CD3E, CD3G, CD5, EDF1, FTL, PTEN, PTPN2, PTPN6, PTPRC, PTPRCAP, RPS23, RTRAF, SERF2, SLC38A1, SMAD2, SOCS1, SRP14, SRSF9, SUB1, TET2, TIGIT, TRAC, and TRIM28.
- In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS88, GS89, GS90, GS91, GS92, GS93, GS94, GS95, GS96, GS97, GS98, GS99, GS100, GS101, GS102, GS103, GS104, GS105, GS106, GS107, GS108, GS109, GS110, GS111, GS112, GS113, GS114, GS115, GS116, GS117, GS118, GS119, GS120. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS91, GS92, GS93, GS94, GS95, GS96, GS100, GS101, GS102, GS103, GS104, and GS105. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS103, GS104, and GS105. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS94, GS95, and GS96. In some embodiments, the safe harbor locus is the GS94 integration site in Table 4. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS100, GS101, and GS102. In some embodiments, the safe harbor locus is the GS102 integration site in Table 4. In some embodiments, the safe harbor locus is selected from any one of the integration sites in Table 4 designated: GS91, GS92, and GS93
- In another aspect, the present disclosure provides an ex vivo method of obtaining an engineered cell or population thereof, comprising (a) obtaining a cell; and (b) genetically modifying the cell by inserting at least one sequence encoding a transgene within a safe harbor locus, wherein the safe harbor locus is selected from any one of the sgRNA target loci in Table 4.
- In some embodiments, obtaining the cell comprises: (i) collecting a tissue sample from a subject, (ii) isolating the cells from the tissue samples, and (iii) culturing the cells in vitro. In some embodiments, the tissue sample is a blood sample. In some embodiments, the cell is a stem cell, a human cell, a primary cell, an hematopoietic cell, an adaptive immune cell, an innate immune cell, a T cell or T cell precursor. In some embodiments, the at least one sequence is inserted using a homology-directed repair. In some embodiments, the at least one sequence is inserted using a homology independent targeted insertion.
- In some embodiments, the genetically modifying in step (b) comprises contacting the cell with one or more guide ribonucleic acids (gRNAs), the at least one sequence, and one or more Cas9 endonucleases, wherein the one or more gRNAs and Cas9 endonucleases facilitate the insertion of the at least one sequence into chromosomal DNA within the safe harbor locus. In some embodiments, the one or more gRNAs comprises a sequence selected from any one of the sgRNA sequences in Table 4.
- In some embodiments, the transgene encodes a recombinant protein, optionally a therapeutic agent. In some embodiments, the transgene encodes a chimeric antigen receptor (CAR). In some embodiments, the at least one sequence comprises an exogenous promoter and the exogenous promoter is operably linked to the transgene. In some embodiments, the exogenous promoter is an EF1a promoter.
- In yet another aspect, the present disclosure provides a method of treating a subject having or at risk of having a disease, comprising administering to the subject an effective amount of an engineered cell as described herein, a population thereof, or a composition as described herein. In some embodiments, the cell, the population thereof, or the composition is administered to the subject by infusion.
- In yet another aspect, the present disclosure provides a method of treating a subject having or at risk of having a disease, comprising (a) conducting any one of the methods described supra; and (b) administering to the subject an effective amount of a composition comprising the cell or a population thereof. In some embodiments, the composition is administered to the subject by infusion. In some embodiments, the disease is cancer. In some embodiments, the disease is blood cancer.
- In another aspect, the present disclosure provides a method of identifying a safe harbor locus, comprising: (a) identifying genes or non-coding regions in a chromosome that are above a threshold level for expression across developmental cell states and/or a threshold level for accessibility of chromatin; (b) generating a linear model that correlates the gene or non-coding region from step (a) with knock-in (KI) efficiency and estimates the KI efficiency of any gene or coding region on the chromosome; and (c) selecting the safe harbor locus based on threshold parameters; wherein the safe harbor locus is selected for insertion of at least one sequence encoding a transgene within a cell.
- In some embodiments, the threshold parameters include one or more of: stable expression of a transgene, knockout of the gene confers benefit to the function of the cell, no known function within the cell, stable transgene expression in vitro with or without CD3/CD28 stimulation, negligible off-target cleavage as detected by iGuide-Seq or CRISPR-Seq, less off-target cleavage relative to other loci as detected by iGuide-Seq or CRISPR-Seq, negligible transgene-independent cytotoxicity, negligible transgene-independent cytokine expression, negligible transgene-independent chimeric antigen receptor expression, negligible deregulation or silencing of nearby genes, and positioned outside of a cancer-related gene. In some embodiments, the stable expression of a transgene at the safe harbor locus is less than or equal to 2-fold expression change over the course of at least 1, 2, 3, 4, 5, 6, or 7 days, and wherein expression change is measured by mean fluorescence intensity of a reporter gene encoded by the at least one sequence. In some embodiments, the accessibility of chromatin is measured using an assay for transposase-accessible chromatin using sequencing (ATAC-seq). In some embodiments, the level of expression across developmental cell states is measured using RNA sequencing (RNA-seq).
- In some embodiments, the cell is a stem cell, a human cell, a primary cell, an hematopoietic cell, an adaptive immune cell, an innate immune cell, a T cell or T cell precursor. In some embodiments, the linear model has a coefficient of determination (R2 value) of at least 30%.
- In yet another aspect, the present disclosure provides an engineered cell, composition, gRNA or method as described herein, wherein insertion within the safe harbor locus increase cell cytotoxicity of diseased cells.
- In yet another aspect, the present disclosure provides an engineered cell, composition, gRNA or method as described herein, wherein knock-in efficiency at the safe harbor locus is increased relative to other locations along the chromosome.
- These and other features, aspects, and advantages of the present invention will become better understood with regard to the following description, and accompanying drawings, where:
-
FIG. 1 includes a schematic depicting the methodology used to identify safe harbor loci in the present disclosure, in some embodiments. -
FIGS. 2A-2C include schematic depicting 3 top safe harbor loci known in the art: AAVS1 (FIG. 2A ), CCR5 (FIG. 2B ), and Rosa26 (FIG. 2C ). The figures were retrieved from Sadelain, M., et al. (2012). Safe harbours for the integration of new DNA in the human genome. Nature reviews Cancer, 12(1), 51-58, the relevant disclosures of which are herein incorporated by reference in their entirety. -
FIG. 3 includes a schematic outlining the data provided in Roth, T. L., et al. 2019. Rapid discovery of synthetic DNA sequences to rewrite endogenous T cell circuits. bioRxiv, 604561, the relevant disclosures of which are herein incorporated by reference in their entirety. -
FIGS. 4A-4B include heatmaps with samples clustered based on activation status and cell type, generated with processed RNA-seq data.FIG. 4A includes a heatmap of samples generated with processed RNA-seq data.FIG. 4B includes a heatmap sample generated with processed RNA-seq data, showing the clustering of ˜20 k genes. -
FIGS. 5A-5B include plots generated using processed RNA-seq data.FIG. 5A includes a plot showing direct correlation between transcript expression data from Roth, T. L., et al. 2019 and transcript expression data generated by the inventors of the present disclosure.FIG. 5B includes a heatmap showing clusters of the top 10% expressed genes. -
FIG. 6 includes a heatmap of samples generated with processed ATAC-seq data. -
FIGS. 7A-7B includes plots depicting the signal enrichment at transcription start sites (TSS) (FIG. 7A ) and peak size distribution (FIG. 7B ), generated with processed ATAC-seq data for coding regions. -
FIG. 8 includes a plot depicting the open chromatin regions around the TRAC locus. -
FIG. 9 includes a plot depicting KI (knock-in) efficiency for coding regions/genes with GFP as a reporter. -
FIG. 10 includes a plot depicting KI efficiency for coding regions/genes with tNGFR as a reporter. -
FIG. 11 includes plots showing scaled KI efficiency for 90 genes vs d2 (day-2) RNA-seq data, d4 (day-4) RNA-seq data and ATAC-seq data, utilized for the predictive linear model. -
FIG. 12 includes a plot showing chromatin accessibility for a top candidate non-coding region, measured using ATAQ sequencing. -
FIG. 13A includes plots showing the cell counts for GFP controls used for the evaluation of candidate KI loci.FIG. 13B includes pmax (GFP_high) readings for GFP controls used for the evaluation of candidate KI loci. -
FIG. 14A includes plots showing the maximum episomal GFP expression (GFP_high) readings for non-targeting controls.FIG. 14B includes the cell count for non-targeting controls fromdonors -
FIG. 15 includes the cell count and maximum GFP expression (GFP_high) readings for WT controls. -
FIG. 16 includes plots showing the cell count and maximum GFP expression (GFP_high) readings when using sgRNA5, which targets a B2M safe harbor locus, and the construct expression was driven by an endogenous promoter. -
FIG. 17 includes plots showing the cell count and maximum GFP expression (GFP_high) readings when using sgRNA5, which targets a B2M safe harbor locus, and the construct expression was driven by an EF1a promoter. -
FIG. 18 includes plots showing the cell count and maximum GFP expression (GFP_high) readings when using sgRNA79 to target a TRAC safe harbor locus, and the construct expression was driven by an endogenous promoter. -
FIG. 19 includes plots showing the cell count and maximum GFP expression (GFP_high) readings when using sgRNA79, which targets a TRAC safe harbor locus, and the construct expression was driven by an exogenous promoter. -
FIG. 20 includes plots showing TCR (T cell receptor) vs. GFP among all donors and time points when using sgRNA79, and the construct is driven by an endogenous promoter. -
FIG. 21 includes plots showing TCR vs. GFP among all donors and time points when using sgRNA79 and the construct is driven by an exogenous promoter. -
FIG. 22 includes plots showing the cell count and maximum GFP expression (GFP_high) readings when using sgRNA83, which targets a TRAC safe harbor locus, and the construct expression was driven by an endogenous promoter. -
FIG. 23 includes plots showing the cell count and maximum GFP expression (GFP_high) readings when using sgRNA83, which targets a TRAC safe harbor locus, and the construct expression was driven by an exogenous promoter. -
FIG. 24 includes plots showing TCR vs. GFP among all donors and time points when using sgRNA83 and the construct is driven by an endogenous promoter. -
FIG. 25 includes plots showing TCR vs. GFP among all donors and time points when using sgRNA83 and the construct is driven by an exogenous promoter. -
FIG. 26 includes plots illustrating potential sources of variation (e.g., edge effects and electroporation errors) observed between replicates and donors. -
FIG. 27 includes plots illustrating potential sources of variation observed between replicates and donors, e.g., relating to inherent differences between donors. -
FIG. 28 includes plots illustrating potential sources of variation observed between replicates and donors, e.g., relating to gating errors. -
FIG. 29 includes plots showing the GFP mean fluorescence intensity (GFP MFI) and KI efficiency for top KI loci evaluated with endogenous promoters. -
FIG. 30 includes plots showing the GFP mean fluorescence intensity (GFP MFI) and KI efficiency for top KI loci evaluated with the EF1a promoter. -
FIGS. 31A-31C include plots showing all the significant KI loci (top KI loci) as evaluated with endogenous and EF1a promoters.FIG. 31A includes a plot showing that the expression from the EF1a promoter was approximately 10 times higher than expression from an endogenous promoter.FIG. 31B shows the top KI loci ranked by GFP MFI atweek 3.FIG. 31C shows the top integration loci ranked by GFP MFI (atweeks -
FIG. 32A includes a plot showing some target loci and their measured transgene expression levels.FIG. 32B includes plots showing the transgene (Prime Receptor (PrimeR)) and TCR expression for the control and insertion at GS94, GS102 and TRAC loci. -
FIG. 33A includes a plot showing the PrimeR levels measured for the indicated integration sites.FIG. 33B includes a schematic showing the GS94 integration site onChromosome 11. -
FIG. 34A includes plots showing CAR induction and primeR expression of engineered T cells after 48 hours of coculturing with K562-CD19 cells.FIG. 34B includes plots showing the cytotoxity and cytokine secretion levels for engineered T cells 48 hours of coculturing with K562-CD19/MSLN cells. -
FIG. 35A includes a schematic showing the experimental overview for evaluating the effect of integration site on cytotoxicity.FIG. 35B includes plots showing the measured cytotoxicity for engineered T cells cocultured for 48 hours with the K562 CD19+/MSLN+ or K562 CD19−/MSLN+ cells. -
FIG. 36A includes a schematic showing the experimental overview for evaluating the effect of integration site on cytokine secretion.FIG. 36B includes plots showing the measured cytokine levels for engineered T cells cocultured for 48 hours with K562 CD19+/MSLN+ cells. -
FIG. 37 includes a schematic showing the in vitro experiment conducted to determin the effect of integration site on primeR-independent CAR expression. “Flow” refers to flow cytometry and “restim” refers to repetitive CD3/CD28 stimulation of the engineered T cells. “EP” refers to electroporation. -
FIGS. 38A and 38B include plots showing the stability of PrimeR expression over time when using the indicated integration sites. “Flow” refers to flow cytometry and “restim” refers to repetitive CD3/CD28 stimualation of the engineered T cells. “EP” refers to electroporation. InFIG. 38B , the PrimeR expression is normalized to the expression from using the TRAC integration site. -
FIG. 39A includes a schematic showing the iGuide-Seq assay technique.FIG. 39B includes a plot showing the on-target efficiency, using iGuide Seq assay, for the indicated integration sites.FIG. 39C includes schematics from the iGuide-Seq analysis showing that GS94 had no reproducible putatitve off-targets across two donorsFIG. 39C discloses SEQ ID NOS 312-322, 312-313, and 323-331, respectively, in order of appearance. -
FIG. 40 includes a schematic showing the iGuide-Seq workflow and data. -
FIG. 41 includes a plot showing rhAmp-seg™ analysis of putative off-target sites identified by iGUIDE-seq and Elevation prediction. -
FIG. 42 includes plots showing RNA-seq analysis of cells with GS94, GS102 and TRAC knock-in of CD19/MSLN circuits. Scatterplot of gene expression in cells with integration at the GS94 locus (y-axis) vs cells with integration at either the TRAC or the GS102 locus (x-axis) in two donors. The yellow dots correspond to ETS1 and FLI1. In blue are the genes that were found to be differentially expressed using edgeR (fold-change >0, FDR-corrected p-value <0.01, average counts-per-million across compared conditions at least 2). -
FIG. 43 includes plots showing the absence of cytokine-independent growth in cells with CD19/MSLN circuit KI at GS94. - The present disclosure provides safe harbor loci and methods for identifying safe harbor loci that exhibit high integration efficiency (e.g., high knock-in (KI) efficiency), high and constant levels of transgene expression, and such benefits independent of T cell activation/differentiation state. In some embodiments, the safe harbor loci also exhibit minimal to no disruption to T cell function and/or capacity for product manufacturing. In some embodiments, these loci are useful for effective and safe integration and expression of transgenes in T cells (e.g. in CAR T therapies). In some embodiments, the methods described herein can be used for the identification of safe harbor loci for insertion of transgenes in other types of cells.
-
FIG. 1 illustrates the overall approach that the inventors of the present disclosure used to identify safe harbor loci. In some embodiments, the present disclosure provides a method comprising the identification of genes within a genome and non-coding regions with sustained expression in a treatment cell (e.g. T cell) and using a predictive model of KI efficiency as a function of T cell chromatic state and computational analysis to predict candidate integration sites. The method further comprises evaluating the candidate integration sites for actual KI efficiency, sustained levels of transgene expression of a transgene, and minimal disruption to the treatment cell phenotype (e.g., T cell function and/or capacity for treatment product expansion and manufacturing). In some embodiments, the safe harbor loci allow for integration of a transgene driven by an endogenous promoter. In some embodiments, the safe harbor loci allow for integration of a transgene driven by an exogenous promoter (e.g. EF1a promoter). - To facilitate an understanding of the present disclosure, a number of terms and phrases are defined below. Unless otherwise defined herein, scientific and technical terms used in this application shall have the meanings that are commonly understood by those of ordinary skill in the art. Generally, nomenclature used in connection with, and techniques of pharmacology, cell and tissue culture, molecular biology, cell and cancer biology, neurobiology, neurochemistry, virology, immunology, microbiology, genetics and protein and nucleic acid chemistry, described herein, are those well-known and commonly used in the art. In case of conflict, the present specification, including definitions, will control.
- Unless otherwise defined, all terms of art, notations and other scientific terminology used herein are intended to have the meanings commonly understood by those of skill in the art. In some cases, terms with commonly understood meanings are defined herein for clarity and/or for ready reference, and the inclusion of such definitions herein should not necessarily be construed to represent a difference over what is generally understood in the art. The techniques and procedures described or referenced herein are generally well understood and commonly employed using conventional methodologies by those skilled in the art, such as, for example, the widely utilized molecular cloning methodologies described in Sambrook et al., Molecular Cloning: A Laboratory Manual 4th ed. (2012) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. As appropriate, procedures involving the use of commercially available kits and reagents are generally carried out in accordance with manufacturer-defined protocols and conditions unless otherwise noted.
- As used herein, the singular forms “a,” “an,” and “the” include the plural referents unless the context clearly indicates otherwise. The terms “include,” “such as,” and the like are intended to convey inclusion without limitation, unless otherwise specifically indicated.
- As used herein, the term “comprising” also specifically includes embodiments “consisting of” and “consisting essentially of” the recited elements, unless specifically indicated otherwise.
- The term “about” indicates and encompasses an indicated value and a range above and below that value. In certain embodiments, the term “about” indicates the designated value ±10%, ±5%, or ±1%. In certain embodiments, where applicable, the term “about” indicates the designated value(s) ±one standard deviation of that value(s).
- As used herein, the term “gene” refers to the basic unit of heredity, consisting of a segment of DNA arranged along a chromosome, which codes for a specific protein or segment of protein. A gene typically includes a promoter, a 5′ untranslated region, one or more coding sequences (exons), optionally introns, a 3′ untranslated region. The gene may further comprise a terminator, enhancers and/or silencers.
- As used herein, the term “locus” refers to a specific, fixed physical location on a chromosome where a gene or genetic marker is located.
- As used herein, the term “target locus” refers to a locus on a chromosome within which a safe harbor locus can be used for the insertion of a sequence. A target locus can consist of multiple potential safe harbor loci (integration sites). Examples of target loci are provided in Table 4, as sgRNA target loci. The notation used for the sgRNA target loci in Table 4 refers to the genomic region of the target locus, defined by the chromosome of the target locus and the coordinate range for that target locus. For example, chr10:33130000-33140000 refers to a target locus on Chr10 (chromosome 10) starting from coordinate 33130000 and ending with coordinate 33140000.
- The term “safe harbor locus” refers to a locus at which genes or genetic elements can be incorporated without disruption to expression or regulation of adjacent genes. These safe harbor loci are also referred to as safe harbor sites (SHS). As used herein, a safe harbor locus refers to an “integration site” or “knock-in site” at which a sequence encoding a transgene, as defined herein, can be inserted. In some embodiments the insertion occurs with replacement of a sequence that is located at the integration site. In some embodiments, the insertion occurs without replacement of a sequence at the integration site. Examples of integration sites contemplated are provided in Table 4.
- As used herein, the term “insert” refers to a nucleotide sequence that is integrated (inserted) at a safe harbor site. The insert can be used to refer to the genes or genetic elements that are incorporated at the safe harbor site using, for example, homology-directed repair (HDR) CRISPR/Cas9 genome-editing or other methods for inserting nucleotide sequences into a genomic region known to those of ordinary skill in the art.
- The “CRISPR/Cas” system refers to a widespread class of bacterial systems for defense against foreign nucleic acid. CRISPR/Cas systems are found in a wide range of eubacterial and archaeal organisms. CRISPR/Cas systems include type I, II, and III sub-types. Wild-type type II CRISPR/Cas systems utilize an RNA-mediated nuclease, Cas9 in complex with guide and activating RNA to recognize and cleave foreign nucleic acid. Guide RNAs having the activity of both a guide RNA and an activating RNA are also known in the art. In some cases, such dual activity guide RNAs are referred to as a small guide RNA (sgRNA).
- Cas9 homologs are found in a wide variety of eubacteria, including, but not limited to bacteria of the following taxonomic groups: Actinobacteria, Aquificae, Bacteroidetes-Chlorobi, Chlamydiae-Verrucomicrobia, Chlroflexi, Cyanobacteria, Firmicutes, Proteobacteria, Spirochaetes, and Thermotogae. An exemplary Cas9 protein is the Streptococcus pyogenes Cas9 protein. Additional Cas9 proteins and homologs thereof are described in, e.g., Chylinksi, et al., RNA Biol. 2013 May 1; 10(5): 726-737; Nat. Rev. Microbiol. 2011 June; 9(6): 467-477; Hou, et al., Proc Natl Acad Sci USA. 2013 Sep. 24; 110(39):15644-9; Sampson et al., Nature. 2013 May 9; 497(7448):254-7; and Jinek, et al., Science. 2012 Aug. 17; 337(6096):816-21. The Cas9 nuclease domain can be optimized for efficient activity or enhanced stability in the host cell.
- As used herein, the term “Cas9” refers to an RNA-mediated nuclease (e.g., of bacterial or archeal orgin, or derived therefrom). Exemplary RNA-mediated nuclases include the foregoing Cas9 proteins and homologs thereof, and include but are not limited to, CPF1 (See, e.g., Zetsche et al., Cell, Volume 163,
Issue 3, p 759-771, 22 Oct. 2015). Similarly, as used herein, the term “Cas9 ribonucleoprotein” complex and the like refers to a complex between the Cas9 protein, and a crRNA (e.g., guide RNA or small guide RNA), the Cas9 protein and a trans-activating crRNA (tracrRNA), the Cas9 protein and a small guide RNA, or a combination thereof (e.g., a complex containing the Cas9 protein, a tracrRNA, and a crRNA guide RNA). - As used herein, the terms “T lymphocyte” and “T cell” are used interchangeably and refer to cells that have completed maturation in the thymus, and identify certain foreign antigens in the body. The terms also refer to the major leukocyte types that have various roles in the immune system, including activation and deactivation of other immune cells. The T cell can be any T cell such as a cultured T cell, e.g., a primary T cell, or a T cell derived from a cultured T cell line, e.g., a Jurkat, SupT1, etc., or a T cell obtained from a mammal. The T cell can be a CD3+ cell. The T cell can be any type of T cell, CD4+/CD8+ double positive T cells, CD4+ helper T cells (e.g. Th1 and Th2 cells), CD8+ T cells (e.g. cytotoxic T cells), peripheral Including but not limited to blood mononuclear cells (PBMC), peripheral blood leukocytes (PBL), tumor infiltrating lymphocytes (TIL), memory T cells, naive T cells, regulatory T cells, γδ T cells, etc. It can be any T cell at any stage of development. Additional types of helper T cells include Th3 (Treg) cells, Th17 cells, Th9 cells, or Tfh cells. Additional types of memory T cells include cells such as central memory T cells (Tcm cells), effector memory T cells (Tem cells and TEMRA cells). A T cell can also refer to a genetically modified T cell, such as a T cell that has been modified to express a T cell receptor (TCR) or a chimeric antigen receptor (CAR). T cells can also be differentiated from stem cells or progenitor cells.
- “CD4+ T cells” refers to a subset of T cells that express CD4 on their surface and are associated with a cellular immune response. CD4+ T cells are characterized by a post-stimulation secretion profile that can include secretion of cytokines such as IFN-γ, TNF-α, IL-2, IL-4 and IL-10. “CD4” is a 55 kD glycoprotein originally defined as a differentiation antigen on T lymphocytes, but was also found on other cells including monocytes/macrophages. The CD4 antigen is a member of the immunoglobulin superfamily and has been implicated as an associative recognition element in MHC (major histocompatibility complex) class II restricted immune responses. On T lymphocytes, the CD4 antigen defines a helper/inducer subset.
- “CD8+ T cells” refers to a subset of T cells that express CD8 on their surface, are MHC class I restricted, and function as cytotoxic T cells. The “CD8” molecule is a differentiation antigen present on thymocytes, as well as on cytotoxic and suppressor T lymphocytes. The CD8 antigen is a member of the immunoglobulin superfamily and is an associative recognition element in major histocompatibility complex class I restriction interactions.
- As used herein, the term “ex vivo” generally includes experiments or measurements made in or on living tissue, preferably in an artificial environment outside the organism, preferably with minimal differences from natural conditions.
- As used herein, the term “construct” refers to a complex of molecules, including macromolecules or polynucleotides.
- As used herein, the term “integration” refers to the process of stably inserting one or more nucleotides of a construct into the cell genome, i.e., covalently linking to a nucleic acid sequence in the chromosomal DNA of the cell. It may also refer to nucleotide deletions at a site of integration. Where there is a deletion at the insertion site, “integration” may further include substitution of the endogenous sequence or nucleotide deleted with one or more inserted nucleotides.
- As used herein, the term “exogenous” refers to a molecule or activity that has been introduced into a host cell and is not native to that cell. The molecule can be introduced, for example, by introduction of the encoding nucleic acid into host genetic material, such as by integration into a host chromosome, or as non-chromosomal genetic material, such as a plasmid. Thus, the term, when used in connection with expression of an encoding nucleic acid, refers to the introduction of the encoding nucleic acid into a cell in an expressible form. The term “endogenous” refers to a molecule or activity that is present in a host cell under natural, unedited conditions. Similarly, the term, when used in connection with expression of the encoding nucleic acid, refers to expression of the encoding nucleic acid that is contained within the cell and not introduced exogenously.
- As used herein, a “polynucleotide donor construct” refers to a nucleotide sequence (e.g. DNA sequence) that is genetically inserted into a polynucleotide and is exogenous to that polynucleotide. The polynucleotide donor construct is transcribed into RNA and optionally translated into a polypeptide. The polynucleotide donor construct can include prokaryotic sequences, cDNA from eukaryotic mRNA, genomic DNA sequences from eukaryotic (e.g., mammalian) DNA, and synthetic DNA sequences. For example, the polynucleotide donor construct can be a miRNA, shRNA, natural polypeptide (i.e., a naturally occurring polypeptide) or fragment thereof or a variant polypeptide (e.g. a natural polypeptide having less than 100% sequence identity with the natural polypeptide) or fragments thereof.
- As used herein, the term “transgene” refers to a polynucleotide that has been transferred naturally, or by any of a number of genetic engineering techniques from one organism to another. It is optionally translated into a polypeptide. It is optionally translated into a recombinant protein. A “recombinant protein” is a protein encoded by a gene—recombinant DNA—that has been cloned in a system that supports expression of the gene and translation of messenger RNA (see expression system). The recombinant protein can be a therapeutic agent, e.g. a protein that treats a disease or disorder disclosed herein. As used, transgene can refer to a polynucleotide that encodes a polypeptide. A transgene can also refer to a non-encoding sequence, such as, but not limited to shRNAs, miRNAs, and miRs.
- The terms “protein,” “polypeptide,” and “peptide” are used herein interchangeably.
- As used herein, the term “operably linked” refers to the binding of a nucleic acid sequence to a single nucleic acid fragment such that one function is affected by the other. For example, if a promoter is capable of affecting the expression of a coding sequence or functional RNA (i.e., the coding sequence or functional RNA is under transcriptional control by the promoter), the promoter is operably linked thereto. Coding sequences can be operably linked to control sequences in both sense and antisense orientation.
- As used herein, the term “developmental cell states” refers to, for example, states when the cell is inactive, actively expressing, differentiating, senescent, etc. developmental cell state may also refer to a cell in a precursor state (e.g., a T cell precursor).
- As used, the term “encoding” refers to a sequence of nucleic acids which codes for a protein or polypeptide of interest. The nucleic acid sequence may be either a molecule of DNA or RNA. In preferred embodiments, the molecule is a DNA molecule. In other preferred embodiments, the molecule is a RNA molecule. When present as a RNA molecule, it will comprise sequences which direct the ribosomes of the host cell to start translation (e.g., a start codon, ATG) and direct the ribosomes to end translation (e.g., a stop codon). Between the start codon and stop codon is an open reading frame (ORF). Such terms are known to one of ordinary skill in the art.
- The term “inserting” refers to a manipulation of a nucleotide sequence to introduce a non-native sequence. This is done, for example, via the use of restriction enzymes and ligases whereby the DNA sequence of interest, usually encoding the gene of interest, can be incorporated into another nucleic acid molecule by digesting both molecules with appropriate restriction enzymes in order to create compatible overlaps and then using a ligase to join the molecules together. One skilled in the art is very familiar with such manipulations and examples may be found in Sambrook et al. (Sambrook, Fritsch, & Maniatis, “Molecular Cloning: A Laboratory Manual”, 2nd ed., Cold Spring Harbor Laboratory, 1989), which is hereby incorporated by reference in its entirety including any drawings, figures and tables.
- As used herein, the term “subject” refers to a mammalian subject. Exemplary subjects include humans, monkeys, dogs, cats, mice, rats, cows, horses, camels, goats, rabbits, pigs and sheep. In certain embodiments, the subject is a human. In some embodiments the subject has a disease or condition that can be treated with an engineered cell provided herein or population thereof. In some aspects, the disease or condition is a cancer.
- As used herein, the term “promoter” refers to a nucleotide sequence (e.g. DNA sequence) capable of controlling the expression of a coding sequence or functional RNA. The promoter sequence consists of proximal and more distal upstream elements, the latter elements often referred to as enhancers. A promoter can be derived from natural genes in its entirety, can be composed of different elements from different promoters found in nature, and/or may comprise synthetic DNA segments. A promoter, as contemplated herein, can be endogenous to the cell of interest or exogenous to the cell of interest. It is appreciated by those skilled in the art that different promoters can induce gene expression in different tissue or cell types, or at different developmental stages, or in response to different environmental conditions. As is known in the art, a promoter can be selected according to the strength of the promoter and/or the conditions under which the promoter is active, e.g., constitutive promoter, strong promoter, weak promoter, inducible/repressible promoter, tissue specific Or developmentally regulated promoters, cell cycle-dependent promoters, and the like.
- A promoter can be an inducible promoter (e.g., a heat shock promoter, tetracycline-regulated promoter, steroid-regulated promoter, metal-regulated promoter, estrogen receptor-regulated promoter, etc.). The promoter can be a constitutive promoter (e.g., CMV promoter, UBC promoter). In some embodiments, the promoter can be a spatially restricted and/or temporally restricted promoter (e.g., a tissue specific promoter, a cell type specific promoter, etc.). See for example U.S. application Ser. No. 15/715,068, the disclosures of which are herein incorporated by reference in their entirety.
- Gene editing, as contemplated herein, may involve a gene (or nucleotide sequence) knock-in or knock-out. As used herein, the term “knock-in” refers to an addition of a DNA sequence, or fragment thereof into a genome. Such DNA sequences to be knocked-in may include an entire gene or genes, may include regulatory sequences associated with a gene or any portion or fragment of the foregoing. For example, a polynucleotide donor construct encoding a recombinant protein may be inserted into the genome of a cell carrying a mutant gene. In some embodiments, a knock-in strategy involves substitution of an existing sequence with the provided sequence, e.g., substitution of a mutant allele with a wild-type copy. On the other hand, the term “knock-out” refers to the elimination of a gene or the expression of a gene. For example, a gene can be knocked out by either a deletion or an addition of a nucleotide sequence that leads to a disruption of the reading frame. As another example, a gene may be knocked out by replacing a part of the gene with an irrelevant (e.g., non-coding) sequence.
- As used herein, the term “non-homologous end joining” or NHEJ refers to a cellular process in which cut or nicked ends of a DNA strand are directly ligated without the need for a homologous template nucleic acid. NHEJ can lead to the addition, the deletion, substitution, or a combination thereof, of one or more nucleotides at the repair site.
- As used herein, the term “homology directed repair” or HDR refers to a cellular process in which cut or nicked ends of a DNA strand are repaired by polymerization from a homologous template nucleic acid. Thus, the original sequence is replaced with the sequence of the template. The homologous template nucleic acid can be provided by homologous sequences elsewhere in the genome (sister chromatids, homologous chromosomes, or repeated regions on the same or different chromosomes). Alternatively, an exogenous template nucleic acid can be introduced to obtain a specific HDR-induced change of the sequence at the target site. In this way, specific mutations can be introduced at the cut site.
- The terms “vector” and “plasmid” are used interchangeably and as used herein refer to polynucleotide vehicles useful to introduce genetic material into a cell. Vectors can be linear or circular. Vectors can integrate into a target genome of a host cell or replicate independently in a host cell. Vectors can comprise, for example, an origin of replication, a multicloning site, and/or a selectable marker. An expression vector typically comprises an expression cassette. Vectors and plasmids include, but are not limited to, integrating vectors, prokaryotic plasmids, eukaryotic plasmids, plant synthetic chromosomes, episomes, viral vectors, cosmids, and artificial chromosomes.
- As used herein the term “expression cassette” is a polynucleotide construct, generated recombinantly or synthetically, comprising regulatory sequences operably linked to a selected polynucleotide to facilitate expression of the selected polynucleotide in a host cell. For example, the regulatory sequences can facilitate transcription of the selected polynucleotide in a host cell, or transcription and translation of the selected polynucleotide in a host cell. An expression cassette can, for example, be integrated in the genome of a host cell or be present in an expression vector.
- As used herein, the phrase “subject in need thereof” refers to a subject that exhibits and/or is diagnosed with one or more symptoms or signs of a disease or disorder as described herein.
- A “chemotherapeutic agent” refers to a chemical compound useful in the treatment of cancer. Chemotherapeutic agents include “anti-hormonal agents” or “endocrine therapeutics” which act to regulate, reduce, block, or inhibit the effects of hormones that can promote the growth of cancer.
- The term “composition” refers to a mixture that contains, e.g., an engineered cell or protein contemplated herein. In some embodiments, the composition may contain additional components, such as adjuvants, stabilizers, excipients, and the like. The term “composition” or “pharmaceutical composition” refers to a preparation which is in such form as to permit the biological activity of an active ingredient contained therein to be effective in treating a subject, and which contains no additional components which are unacceptably toxic to the subject in the amounts provided in the pharmaceutical composition.
- As used herein, the term “effective amount” refers to the amount of a compound (e.g., a compositions described herein, cells described herein) sufficient to effect beneficial or desired results. An effective amount can be administered in one or more administrations, applications or dosages and is not intended to be limited to a particular formulation or administration route. As used herein, the term “treating” includes any effect, e.g., lessening, reducing, modulating, ameliorating or eliminating, that results in the improvement of the condition, disease, disorder, and the like, or ameliorating a symptom thereof.
- The terms “modulate” and “modulation” refer to reducing or inhibiting or, alternatively, activating or increasing, a recited variable.
- The terms “increase” and “activate” refer to an increase of 10%, 20%, 30%, 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 100%, 2-fold, 3-fold, 4-fold, 5-fold, 10-fold, 50-fold, 100-fold, or greater in a recited variable.
- The terms “reduce” and “inhibit” refer to a decrease of 10%, 20%, 30%, 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 2-fold, 3-fold, 4-fold, 5-fold, 10-fold, 20-fold, 100-fold, or greater in a recited variable.
- 1. Safe Harbor Loci
- Gene editing therapies include, for example, viral vector integration and site specific integration. Site-specific integration is a promising alternative to random integration of viral vectors, as it mitigates the risks of insertional mutagenesis or insertional oncogenesis (Kolb et al. Trends Biotechnol. 2005 23:399-406; Porteus et al. Nat Biotechnol. 2005 23:967-973; Paques et al. Curr Gen Ther. 2007 7:49-66). However, site specific integration continues to face challenges such as poor knock-in efficiency, risk of insertional oncogenesis, unstable and/or anomalous expression of adjacent genes or the transgene, low accessibility (e.g. within kB of adjacent genes), etc. These challenges can be addressed, in part, through the identification and use of safe harbor loci or safe harbor sites (SHS), which are sites in which genes or genetic elements can be incorporated without disruption to expression or regulation of adjacent genes.
- The most widely used of the putative human safe harbor sites is the AAVS1 site on chromosome 19q, which was initially identified as a site for recurrent adenoassociated virus insertion. Other potential SHS have been identified on the basis of homology, with sites first identified in other species (e.g., the human homolog of the permissive murine Rosa26 locus) or among the growing number of human genes that appear non-essential under some circumstances. One putative SHS of this type is the CCR5 chemokine receptor gene, which, when disrupted, confers resistance to human immunodeficiency virus infection. Additional potential genomic SHS have been identified in human and other cell types on the basis of viral integration site mapping or gene-trap analyses, as was the original murine Rosa26 locus. The three top SHS, AAVS1, CCR5, and Rosa26, are in close proximity to many protein coding genes and regulatory elements. (See
FIG. 2 from Sadelain, M., et al. (2012). Safe harbours for the integration of new DNA in the human genome. Nature reviews Cancer, 12(1), 51-58, the relevant disclosures of which are herein incorporated by reference in their entirety). - The AAVS1 (also known as the PPP 1R12C locus) on
human chromosome 19 is a known SHS for hosting transgenes (e.g. DNA transgenes) with expected function. It is at position 19q13.42. It has an open chromatin structure and is transcription-competent. The canonical SHS locus for AAVS1 is chr19: 55,625,241-55,629,351. See Pellenz et al. “New Human Chromosomal Sites with “Safe Harbor” Potential for Targeted Transgene Insertion.” Human gene therapy vol. 30,7 (2019): 814-828, the relevant disclosures of which are herein incorporated by reference. An exemplary AAVS1 target gRNA and target sequence are provided below: -
AAVS1-gRNA sequence: (SEQ ID NO: 121) ggggccactagggacaggatGTTTTAGAGCTAGAA ATAGCAAGTTAAAATAAGGCTAGTCCGTTATCAAC TTGAAAAAGTGGCACCGAGTCGGTGCTTTTTTT AAVS1 target sequence: (SEQ ID NO: 122) ggggccactagggacaggat - CCR5, which is located on
chromosome 3 at position 3p21.31, encodes the major co-receptor for HIV-1. Disruption at this site in the CCR5 gene has been beneficial in HIV/AIDS therapy and prompted the development of zinc-finger nucleases that target its third exon. The canonical SHS locus for CCR5 is chr3: 46,414,443-46,414,942. See Pellenz et al. “New Human Chromosomal Sites with “Safe Harbor” Potential for Targeted Transgene Insertion.” Human gene therapy vol. 30,7 (2019): 814-828, the relevant disclosures of which are herein incorporated by reference. - The mouse Rosa26 locus is particularly useful for genetic modification as it can be targeted with high efficiency and is expressed in most cell types tested. Irion et al. 2007 (“Identification and targeting of the ROSA26 locus in human embryonic stem cells.” Nature biotechnology 25.12 (2007): 1477-1482, the relevant disclosure of which are herein incorporated by reference) identified the human homolog, human ROSA26, in chromosome 3 (position 3p25.3). The canonical SHS locus for human Rosa26 (hRosa26) is chr3: 9,415,082-9,414,043. See Pellenz et al. “New Human Chromosomal Sites with “Safe Harbor” Potential for Targeted Transgene Insertion.” Human gene therapy vol. 30,7 (2019): 814-828, the relevant disclosures of which are herein incorporated by reference.
- Additional examples of safe harbor sites are provided in Pellenz et al. “New Human Chromosomal Sites with “Safe Harbor” Potential for Targeted Transgene Insertion.” Human gene therapy vol. 30,7 (2019): 814-828, the relevant disclosures of which are herein incorporated by reference.
- The present disclosure is directed to methods for identifying safe harbor loci with benefits including, but not limited to, high knock-in efficiency and high expression of transgene. An example of applications of the presently disclosed methods is the identification of safe harbor loci for insertion of transgenes (e.g., chimeric antigen receptors (CAR)) into T-cells. In some embodiments, the safe harbor loci of the present disclosure are useful for the insertion of a sequence encoding a transgene. In some embodiments, the safe harbor sites allow for high transgene expression (sufficient to allow for transgene functionality or treatment of a disease of interest) and stable expression of the transgene over several days, weeks or months. In some embodiments, knockout of the gene at the safe harbor locus confers benefit to the function of the cell, or the gene at the safe harbor locus has no known function within the cell. In some embodiments the safe harbor locus results in stable transgene expression in vitro with or without CD3/CD28 stimulation, negligible off-target cleavage as detected by iGuide-Seq or CRISPR-Seq, less off-target cleavage relative to other loci as detected by iGuide-Seq or CRISPR-Seq, negligible transgene-independent cytotoxicity, negligible transgene-independent cytokine expression, negligible transgene-independent chimeric antigen receptor expression, negligible deregulation or silencing of nearby genes, and positioned outside of a cancer-related gene.
- As used, a “nearby gene” can refer to a gene that is within about 100 kB, about 125 kB, about 150 kB, about 175 kB, about 200 kB, about 225 kB, about 250 kB, about 275 kB, about 300 kB, about 325 kB, about 350 kB, about 375 kB, about 400 kB, about 425 kB, about 450 kB, about 475 kB, about 500 kB, about 525 kB, about 550 kB away from the safe harbor locus (integration site).
- In some embodiments, the present disclosure contemplates inserts that comprise one or more transgenes. The transgene can encode a therapeutic protein, an antibody, a peptide, a suicide gene, an apoptosis gene or any other gene of interest. The safe harbor loci identified using the method described herein allow for transgene integration that results in, for example, enhanced therapeutic properties. These enhanced therapeutic properties, as used herein, refer to an enhanced therapeutic property of a cell when compared to a typical immune cell of the same normal cell type. For example, an NK cell having “enhanced therapeutic properties” has an enhanced, improved, and/or increased treatment outcome when compared to a typical, unmodified and/or naturally occurring NK cell. The therapeutic properties of immune cells can include, but are not limited to, cell transplantation, transport, homing, viability, self-renewal, persistence, immune response control and regulation, survival, and cytotoxicity. The therapeutic properties of immune cells are also manifested by: antigen-targeted receptor expression; HLA presentation or lack thereof; tolerance to the intratumoral microenvironment; induction of bystander immune cells and immune regulation; improved target specificity with reduction; resistance to treatments such as chemotherapy.
- As used herein, the term “insert size” refers to the length of the nucleotide sequence being integrated (inserted) at the safe harbor site. In some embodiments, the insert size comprises at least about 100, 200, 300, 400 or 500 basepairs. In some embodiments, the insert size comprises about 500 nucleotides or basepairs. In some embodiments, the insert size comprises up to 0.5, 0.6, 0.7, 0.8, 0.9, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 kbp (kilo basepairs) or the sizes in between. In some embodiments, the insert size is greater than 0.5, 0.6, 0.7, 0.8, 0.9, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 kbp or the sizes in between. In some embodiments, the insert size is within the range of 3-15 kbp or is any number in that range. In some embodiments, the insert size is within the range of 1.5-8.3 kbp or is any number in that range. In some embodiments, the insert size is within the range of 1.5-15 kbp or is any number in that range. In some embodiments, the insert size is within the range of 0.5-20 kbp or is any number in that range. In some embodiments, the insert size is 0.5-10, 0.6-10, 0.7-10, 0.8-10, 0.9-10, 1-10, 2-10, 3-10, 4-10, 5-10, 6-10, 7-10, 8-10, 9-10 kbp. In some embodiments, the insert size is 0.5-11, 0.6-11, 0.7-11, 0.8-11, 0.9-11, 1-11, 2-11, 3-11, 4-11, 5-11, 6-11, 7-11, 8-11, 9-11, or 10-11 kbp. In some embodiments, the insert size is 0.5-12, 0.6-12, 0.7-12, 0.8-12, 0.9-12, 1-12, 2-12, 3-12, 4-12, 5-12, 6-12, 7-12, 8-12, 9-12, 10-12, or 11-12 kbp. In some embodiments, the insert size is 0.5-13, 0.6-13, 0.7-13, 0.8-13, 0.9-13, 1-13, 2-13, 3-13, 4-13, 5-13, 6-13, 7-13, 8-13, 9-13, 10-13, 11-13, or 12-13 kbp. In some embodiments, the insert size is 0.5-14, 0.6-14, 0.7-14, 0.8-14, 0.9-14, 1-14, 2-14, 3-14, 4-14, 5-14, 6-14, 7-14, 8-14, 9-14, 10-14, 11-14, 12-14 or 13-14 kbp. In some embodiments, the insert size is 0.5-15, 0.6-15, 0.7-15, 0.8-15, 0.9-15, 2-15, 3-15, 4-15, 5-15, 6-15, 7-15, 8-15, 9-15, 10-15, 11-15, 12-15, 13-15, or 14-15 kbp. In some embodiments, the insert size is 0.5-16, 0.6-16, 0.7-16, 0.8-16, 0.9-16, 1-16, 2-16, 3-16, 4-16, 5-16, 6-16, 7-16, 8-16, 9-16, 10-16, 11-16, 12-16, 13-16, 14-16 or 15-16 kbp. In some embodiments, the insert size is 0.5-17, 0.6-17, 0.7-17, 0.8-17, 0.9-17, 1-17, 2-17, 3-17, 4-17, 5-17, 6-17, 7-17, 8-17, 9-17, 10-17, 11-17, 12-17, 13-17, or 14-17, 15-17 or 16-17 kbp. In some embodiments, the insert size is 0.5-18, 0.6-18, 0.7-18, 0.8-18, 0.9-18, 1-18, 2-18, 3-18, 4-18, 5-18, 6-18, 7-18, 8-18, 9-18, 10-18, 11-18, 12-18, 13-18, 14-18, 15-18, 16-18 or 17-18 kbp. In some embodiments, the insert size is 0.5-19, 0.6-19, 0.7-19, 0.8-19, 0.9-19, 1-19, 2-19, 3-19, 4-19, 5-19, 6-19, 7-19, 8-19, 9-19, 10-19, 11-19, 12-19, 13-19, 14-0.7-20, 19, 15-19, 16-19, 17-19, or 18-19 kbp. In some embodiments, the insert size is 0.5-20, 0.6-20, 0.7-20, 0.8-20, 0.9-20, 1-20, 2-20, 3-20, 4-20, 5-20, 6-20, 7-20, 8-20, 9-20, 10-20, 11-20, 12-20, 13-20, 14-20, 15-20, 16-20, 17-20, 18-20, or 19-20 kbp.
- The inserts of the present disclosure refer to nucleic acid molecules or polynucleotide inserted at a safe harbor site. In some embodiments, the nucleotide sequence is a DNA molecule, e.g., genomic DNA, or comprises deoxy-ribonucleotides. In some embodiments, the insert comprises a smaller fragment of DNA, such as a plastid DNA, mitochondrial DNA, or DNA isolated in the form of a plasmid, a fosmid, a cosmid, a bacterial artificial chromosome (BAC), a yeast artificial chromosome (YAC), and/or any other sub-genome segment of DNA. In some embodiments, the insert is an RNA molecule or comprises ribonucleotides. The nucleotides in the insert are contemplated as naturally occurring nucleotides, non-naturally occurring, and modified nucleotides. Nucleotides may be modified chemically or biochemically, or may contain non-natural or derivatized nucleotide bases, as will be readily appreciated by those of skill in the art. Such modifications include, for example, labels, methylation, substitution of one or more of the naturally occurring nucleotides with an analog, internucleotide modifications. The polynucleotides can be in any topological conformation, including single-stranded, double-stranded, partially duplexed, triplexed, hairpinned, circular conformations, and other three-dimension conformations contemplated in the art.
- The inserts can have coding and/or non-coding regions. The insert can comprises a non-coding sequence (e.g., control elements, e.g., a promoter sequence). In some embodiments, the insert encodes transcription factors. In some embodiments, the insert encodes an antigen binding receptors such as single receptors, T-cell receptors (TCRs), syn-notch, CARs, mAbs, etc. In some embodiments, the inserts are RNAi molecules, including, but not limited to, miRNAs, siRNA, shRNAs, etc. In some embodiments, the insert is a human sequence. In some embodiments, the insert is chimeric. In some embodiments, the insert is a multi-gene/multi-module therapeutic cassette. A multi-gene/multi-module therapeutic cassette referst to an insert or cassette having one or more than one receptor (e.g., synthetic receptors), other exogenous protein coding sequences, non-coding RNAs, transcriptional regulatory elements, and/or insulator sequences, etc.
- Various cell types are contemplated as having the safe harbor sites in the present disclosure. A cell comprising a safe harbor site and/or a cell comprising an insert at a safe harbor site as described in the present disclosure can be referred to as an engineered cell. The cells can include, but are not limited to, eukaryotic cells, prokaryotic cells, animal cells, plant cells, fungal cells and the like. Optionally, the cell is a mammalian cell, for example, a human cell. In some embodiments, that engineered cell is a stem cell, a human cell, a primary cell, an hematopoietic cell, an adaptive immune cell, an innate immune cell, a T cell or a T cell progenitor. Non-limiting examples of immune cells that are contemplated in the present disclosure include T cell, B cell, natural killer (NK) cell, NKT/iNKT cell, macrophage, myeloid cell, and dendritic cells. Non-limiting examples of stem cells that are contemplated in the present disclosure include pluripotent stem cells (PSCs), embryonic stem cells (ESCs), induced pluripotent stem cells (iPSCs), embryo-derived embryonic stem cells obtained by nuclear transfer (ntES; nuclear transfer ES), male germline stem cells (GS cells), embryonic germ cells (EG cells), hematopoietic stem/progenitor stem cells (HSPCs), somatic stem cells (adult stem cells), hemangioblasts, neural stem cells, mesenchymal stem cells and stem cells of other cells (including osteocyte, chondrocyte, myocyte, cardiac myocyte, neuron, tendon cell, adipocyte, pancreocyte, hepatocyte, nephrocyte and follicle cells and so on). In some embodiments, the engineered cells is a T cell, NK cells, iPSC, and HSPC. In some embodiments, the engineered cells used in the present disclosure are human cell lines grown in vitro (e.g. deliberately immortalized cell lines, cancer cell lines, etc.).
- The methods for integrating the inserts at the safe harbor sites can be viral or non-viral delivery techniques.
- In some embodiments, the nucleic acid sequence is inserted into the genome of the engineered cell by introducing a vector, for example, a viral vector, comprising the nucleic acid. Examples of viral vectors include, but are not limited to, adeno-associated viral (AAV) vectors, retroviral vectors or lentiviral vectors. In some embodiments, the lentiviral vector is an integrase-deficient lentiviral vector.
- In some embodiments, the nucleic acid sequence is inserted into the genome of the T cell via non-viral delivery. In non-viral delivery methods, the nucleic acid can be naked DNA, or in a non-viral plasmid or vector. Non-viral delivery techniques can be site-specific integration techniques, as described herein or known to those of ordinary skill in the art. Examples of site-specific techniques for integration into the safe harbor loci include, without limitation, homology-dependent engineering using nucleases and homology independent targeted insertion using Cas9.
- In some embodiments, the insert is integrated at a safe harbor site by introducing into the engineered cell, (a) a targeted nuclease that cleaves a target region in the safe harbor site to create the insertion site; and (b) the nucleic acid sequence (insert), wherein the insert is incorporated at the insertion site by, e.g., HDR. Examples of non-viral delivery techniques that can be used in the methods of the present disclosure are provided in U.S. application Ser. Nos. 16/568,116 and 16/622,843, the relevant disclosures of which are herein incorporated by reference in their entirety.
- 2. CAR T Cell Therapy
- Chimeric antigen receptor (CAR) T cells are T cells that have been genetically engineered to produce an artificial T-cell receptor for use in immunotherapy. Chimeric antigen receptors are receptor proteins that have been engineered to confer T cells with the ability to target a specific protein. The genetic modification of lymphocytes (e.g. T cells) by incorporation of, for example, CARs, and administration of the engineered cells to a subject is an example of “adoptive cell therapy”. As used herein, the term “adoptive cell therapy” refers to cell-based immunotherapy for transfusion of autologous or allogeneic lymphocytes, referred to as T cells or B cells. In this CAR therapy approach, cells are expanded and cultured ex vivo and genetically modified, prior to transfusion.
- The expression of CARs allows the engineered T-cells to target and bind specific proteins, for example, tumor antigens. In CAR therapy, T-cells are harvested from a subject—they can be autologous T-cells from the subject own blood or from a donor that will not be receiving the CAR therapy. Once isolated, the T-cells are genetically modified with a CAR, expanded ex vivo, and administered to the subject (i.e. patient) by, e.g. infusion.
- The CARs may be introduced into the T-cells using, for example, a viral technique (e.g., retroviral integration) or site-specific technique. With site specific integration of the transgenes (e.g. CARs), the transgenes may be targeted to a safe harbor locus. Examples of site-specific techniques for integration into the safe harbor loci include, without limitation, homology-dependent engineering using nucleases and homology independent targeted insertion using Cas9.
- The engineered CAR T cells have applications to immune-oncology. The CAR, for example, can be selected to target a specific tumor antigen. Examples of cancers that can be effectively targeted using CAR T cells are blood cancers. In some embodiments, CAR T cell therapy can be used to treat solid tumors.
- 3. Gene Editing
- The terms “gene editing” or “genome editing”, as used herein, refer to a type of genetic manipulation in which DNA is inserted, replaced, or removed from the genome using artificially manipulated nucleases or “molecular scissors”. It is a useful tool for elucidating the function and effect of sequence-specific genes or proteins or altering cell behavior (e.g. for therapeutic purposes).
- Currently available genome editing tools include zinc finger nucleases (ZFN) and transcription activator-like effector nucleases (TALENs) to incorporate genes at safe harbor loci (e.g. the adeno-associated virus integration site 1 (AAVS1) safe harbor locus). The DICE (dual integrase cassette exchange) system utilizing phiC31 integrase and Bxb1 integrase is a tool for target integration. Additionally, clustered regularly interspaced short palindromic repeat/Cas9 (CRISPR/Cas9) techniques can be used for targeted gene insertion.
- Site specific gene editing approaches can include homology dependent mechanisms or homology independent mechanisms.
- All methods known in the art for targeted insertion of gene sequences are contemplated in the methods described herein to insert constructs at safe harbor loci.
- 4. Crispr-Cas Gene Editing
- One effective example of gene editing is the Crisp-Cas approach (e.g. Crispr-Cas9). This approach incorporates the use of a guide polynucleotide (e.g. guide ribonucleic acid or gRNA) and a cas endonuclease (e.g. Cas9 endonuclease).
- As used herein, a polypeptide referred to as a “Cas endonuclease” or having “Cas endonuclease activity” refers to a CRISPR-related (Cas) polypeptide encoded by a Cas gene, wherein a Cas polypeptide is a target DNA sequence that can be cleaved when operably linked to one or more guide polynucleotides (see, e.g., U.S. Pat. No. 8,697,359). Also included in this definition are variants of Cas endonuclease that retain guide polynucleotide-dependent endonuclease activity. The Cas endonuclease used in the donor DNA insertion method detailed herein is an endonuclease that introduces double-strand breaks into DNA at the target site (e.g., within the target locus or at the safe harbor site).
- As used herein, the term “guide polynucleotide” relates to a polynucleotide sequence capable of complexing with a Cas endonuclease and allowing the Cas endonuclease to recognize and cleave a DNA target site. The guide polynucleotide can be a single molecule or a double molecule. The guide polynucleotide sequence can be an RNA sequence, a DNA sequence, or a combination thereof (RNA-DNA combination sequence). A guide polynucleotide comprising only ribonucleic acid is also referred to as “guide RNA”. In some embodiments, a polynucleotide donor construct is inserted at a safe harbor locus using a guide RNA (gRNA) in combination with a cas endonuclease (e.g. Cas9 endonuclease).
- The guide polynucleotide includes a first nucleotide sequence domain (also referred to as a variable targeting domain or VT domain) that is complementary to a nucleotide sequence in the target DNA, and a second nucleotide that interacts with a Cas endonuclease polypeptide. It can be a double molecule (also referred to as a double-stranded guide polynucleotide) comprising a sequence domain (referred to as a Cas endonuclease recognition domain or CER domain). The CER domain of this double molecule guide polynucleotide comprises two separate molecules that hybridize along the complementary region. The two separate molecules can be RNA sequences, DNA sequences and/or RNA-DNA combination sequences.
- Genome editing using CRISPR-Cas approaches relies on the repair of site-specific DNA double-strand breaks (DSBs) induced by the RNA-guided Cas endonuclease (e.g.
Cas 9 endonuclease). Homology-directed repair (HDR) of these DSBs enables precise editing of the genome by introducing defined genomic changes, including base substitutions, sequence insertions, and deletions. Conventional HDR-based CRISPR/Cas9 genome-editing involves transfecting cells with Cas9, gRNA and donor DNA containing homologous arms matching the genomic locus of interest. - HITI (homology independent targeted insertion) uses a non-homologous end joining (NHEJ)-based homology-independent strategy and the method can be more efficient than HDR. Guide RNAs (gRNAs) target the insertion site. For HITI, donor plasmids lack homology arms and DSB repair does not occur through the HDR pathway. The donor polynucleotide construct can be engineered to include Cas9 cleavage site(s) flanking the gene or sequence to be inserted. This results in Cas9 cleavage at both the donor plasmid and the genomic target sequence. Both target and donor have blunt ends and the linearized donor DNA plasmid is used by the NHEJ pathway resulting integration into the genomic DSB site. (See, for example, Suzuki, K., et al. (2016). In vivo genome editing via CRISPR/Cas9 mediated homology-independent targeted integration. Nature, 540(7631), 144-149, the relevant disclosures of which are herein incorporated in their entirety).
- Methods for conducing gene editing using CRISPR-Cas approaches are known to those of ordinary skill in the art. (See, for example, U.S. application Ser. No. 16/312,676, U.S. Ser. No. 15/303,722, and U.S. Ser. No. 15/628,533, the disclosures of which are herein incorporated by reference in their entirety). Additionally, uses of endonucleases for inserting transgenes into safe harbor loci are described, for example, in U.S. application Ser. No. 13/036,343, the disclosures of which are herein incorporated by reference in their entirety.
- The guide RNAs and/or mRNA (or DNA) encoding an endonuclease can be chemically linked to one or more moieties or conjugates that enhance the activity, cellular distribution, or cellular uptake of the oligonucleotide. Non-limiting examples of such moieties include lipid moieties such as a cholesterol moiety, cholic acid, a thioether, a thiocholesterol, an aliphatic chain (e.g., dodecandiol or undecyl residues), a phospholipid, e.g., di-hexadecyl-rac-glycerol or
triethylammonium 1,2-di-O-hexadecyl-rac-glycero-3-H-phosphonate, a polyamine or a polyethylene glycol chain, adamantane acetic acid, a palmityl moiety and an octadecylamine or hexylamino-carbonyl-t oxycholesterol moiety. See for example U.S. application Ser. No. 15/715,068, the disclosures of which are herein incorporated by reference in their entirety. - 5. Therapeutic Applications
- For therapeutic applications, the engineered cells, populations thereof, or compositions thereof are administered to a subject, generally a mammal, generally a human, in an effective amount.
- The engineered cells may be administered to a subject by infusion (e.g., continuous infusion over a period of time) or other modes of administration known to those of ordinary skill in the art.
- The engineered cells provided herein can be administered as part of a pharmaceutical compositions. In some embodiments, the present disclosure provides compositions comprising a guide RNA of the present disclosure. The pharmaceutical composition may comprise one or more pharmaceutical excipients. Any suitable pharmaceutical excipient may be used, and one of ordinary skill in the art is capable of selecting suitable pharmaceutical excipients. Accordingly, the pharmaceutical excipients provided below are intended to be illustrative, and not limiting. Additional pharmaceutical excipients include, for example, those described in the Handbook of Pharmaceutical Excipients, Rowe et al. (Eds.) 6th Ed. (2009), incorporated by reference in its entirety.
- The engineered cells provided herein not only find use in gene therapy but also in non-pharmaceutical uses such as, e.g., production of animal models and production of recombinant cell lines expressing a protein of interest.
- The engineered cells of the present disclosure can be any cell, generally a mammalian cell, generally a human cell that has been modified by integrating a transgene at a safe harbor locus described herein. In some embodiments, the engineered cells are immune cells. In some embodiments, the engineered cells are lymphocytes. In some embodiments, the engineered cells are T cells or T cell precursors.
- The engineered cells, compositions and methods of the present disclosure are useful for therapeutic applications such as CAR T cell therapy and TCR T cell therapy. In some embodiments, the insertion of a sequence encoding a transgene within a safe harbor locus maintains the TCR expression relative to instances when there is no insertion and enables transgene expression while maintaining TCR function.
- Various diseases treated using the engineered cells, populations thereof, or compositions thereof are provided herein. Non-limiting examples of such diseases include alopecia areata, autoimmune hemolytic anemia, autoimmune hepatitis, cancer, dermatomyositis, diabetes (type 1), certain juvenile idiopathic arthritis, glomerulonephritis, Graves' disease, Guillain Valley Syndrome, idiopathic thrombocytopenic purpura, myasthenia gravis, certain myocarditis, multiple sclerosis, pemphigus/pemphigoid, pernicious anemia, polyarteritis nodosa, polymyositis, primary bile With cirrhosis, psoriasis, rheumatoid arthritis, scleroderma/systemic sclerosis, Sjogren's syndrome, systemic lupus erythematosus, certain thyroiditis, certain uveitis, vitiligo, multiple vasculitis (Wegener)); autoimmune disorders including, but not limited to, granulomatosis; hematopoietic tumors including but not limited to acute and chronic leukemia, lymphoma, multiple myeloma and myelodysplastic syndrome; tumors of the prostate, breast, lung, colon, uterus, skin, liver, bone, pancreas, ovary, testis, bladder, kidney, head, neck, stomach, cervix, rectum, larynx, or esophagus solid tumors; HIV (human immunodeficiency virus) related disorders, RSV (respiratory syncytial virus) related disorders; EBV (Epstein-Barr virus) related disorders; CMV (cytomegalovirus) related disorders; and infectious diseases including, but not limited to, adenovirus-related disorders and BK polyomavirus-related disorders.
- Cancers that can be treated with the engineered cells (e.g., CAR T-cells) of the present disclosure, populations thereof, or compositions thereof include blood cancers. In some embodiments, the cancer treated using the engineered cells (e.g., CAR T-cells) described herein, populations thereof, or compositions thereof is a hematologic malignancy or leukemia. In some embodiments, the engineered cells (e.g., CAR T-cells) described herein, populations thereof, or compositions thereof are used for the treatment of acute lymphoblastic leukemia (ALL) or diffuse large B-cell lymphoma (DLBCL). In some embodiments, the cancer is acute myeloid leukemia (AML), acute lymphoblastic leukemia (ALL), myelodysplasia, myelodysplastic syndromes, acute T-lymphoblastic leukemia, or acute promyelocytic leukemia, chronic myelomonocytic leukemia, or myeloid blast crisis of chronic myeloid leukemia. Examples of cancers treatable using the engineered cells (e.g., CAR T-cells) described herein include, without limitation, breast cancer, ovarian cancer, esophageal cancer, bladder or gastric cancer, salivary duct carcinoma, salivary duct carcinomas, adenocarcinoma of the lung or aggressive forms of uterine cancer, such as uterine serous endometrial carcinoma. In some other embodiments, the cancer is brain cancer, breast cancer, cervical cancer, colon cancer, colorectal cancer, endometrial cancer, esophageal cancer, leukemia, lung cancer, liver cancer, melanoma, ovarian cancer, pancreatic cancer, rectal cancer, renal cancer, stomach cancer, testicular cancer, or uterine cancer. In yet other embodiments, the cancer is a squamous cell carcinoma, adenocarcinoma, small cell carcinoma, melanoma, neuroblastoma, sarcoma (e.g., an angiosarcoma or chondrosarcoma), larynx cancer, parotid cancer, biliary tract cancer, thyroid cancer, acral lentiginous melanoma, actinic keratoses, acute lymphocytic leukemia, acute myeloid leukemia, adenoid cystic carcinoma, adenomas, adenosarcoma, adenosquamous carcinoma, anal canal cancer, anal cancer, anorectum cancer, astrocytic tumor, bartholin gland carcinoma, basal cell carcinoma, biliary cancer, bone cancer, bone marrow cancer, bronchial cancer, bronchial gland carcinoma, carcinoid, cholangiocarcinoma, chondrosarcoma, choroid plexus papilloma/carcinoma, chronic lymphocytic leukemia, chronic myeloid leukemia, clear cell carcinoma, connective tissue cancer, cystadenoma, digestive system cancer, duodenum cancer, endocrine system cancer, endodermal sinus tumor, endometrial hyperplasia, endometrial stromal sarcoma, endometrioid adenocarcinoma, endothelial cell cancer, ependymal cancer, epithelial cell cancer, Ewing's sarcoma, eye and orbit cancer, female genital cancer, focal nodular hyperplasia, gallbladder cancer, gastric antrum cancer, gastric fundus cancer, gastrinoma, glioblastoma, glucagonoma, heart cancer, hemangioblastomas, hemangioendothelioma, hemangiomas, hepatic adenoma, hepatic adenomatosis, hepatobiliary cancer, hepatocellular carcinoma, Hodgkin's disease, ileum cancer, insulinoma, intraepithelial neoplasia, interepithelial squamous cell neoplasia, intrahepatic bile duct cancer, invasive squamous cell carcinoma, jejunum cancer, joint cancer, Kaposi's sarcoma, pelvic cancer, large cell carcinoma, large intestine cancer, leiomyosarcoma, lentigo maligna melanomas, lymphoma, male genital cancer, malignant melanoma, malignant mesothelial tumors, medulloblastoma, medulloepithelioma, meningeal cancer, mesothelial cancer, metastatic carcinoma, mouth cancer, mucoepidermoid carcinoma, multiple myeloma, muscle cancer, nasal tract cancer, nervous system cancer, neuroepithelial adenocarcinoma nodular melanoma, non-epithelial skin cancer, non-Hodgkin's lymphoma, oat cell carcinoma, oligodendroglial cancer, oral cavity cancer, osteosarcoma, papillary serous adenocarcinoma, penile cancer, pharynx cancer, pituitary tumors, plasmacytoma, pseudosarcoma, pulmonary blastoma, rectal cancer, renal cell carcinoma, respiratory system cancer, retinoblastoma, rhabdomyosarcoma, sarcoma, serous carcinoma, sinus cancer, skin cancer, small cell carcinoma, small intestine cancer, smooth muscle cancer, soft tissue cancer, somatostatin-secreting tumor, spine cancer, squamous cell carcinoma, striated muscle cancer, submesothelial cancer, superficial spreading melanoma, T cell leukemia, tongue cancer, undifferentiated carcinoma, ureter cancer, urethra cancer, urinary bladder cancer, urinary system cancer, uterine cervix cancer, uterine corpus cancer, uveal melanoma, vaginal cancer, verrucous carcinoma, VIPoma, vulva cancer, well-differentiated carcinoma, or Wilms tumor.
- In some embodiments, the present disclosure provides methods of treating a subject in need of treatment by administering to the subject a composition comprising any of the engineered cells described herein. As used, the terms “treat,” “treatment,” and the like refer generally to obtaining a desired pharmacological and/or physiological effect. That effect is preventive in terms of complete or partial prevention of the disease and/or therapeutic in terms of partial or complete cure of the disease and/or adverse effects resulting from the disease. The term “treatment”, as used herein, encompasses any treatment of a disease in a subject (e.g., mammal, e.g., human). Treatment may also refer to the administration of the engineered cells provided herein to a subject that is susceptible to the disease but has not yet been diagnosed as suffering from it, including preventing the disease from occurring; inhibiting disease progression; or reducing the disease (i.e., causing a regression of the disease). Further, treatment may stabilize or reduce undesirable clinical symptoms in subjects (e.g., patients). The cells provided herein populations thereof, or compositions thereof may be administered before, during or after the occurrence of the disease or injury.
- In certain embodiments, the subject has a disease, condition, and/or injury that can be treated and/or ameliorated by cell therapy. In some embodiments, the subject in need of cell therapy is a subject having an injury, disease, or condition, thereby causing cell therapy (e.g., therapy in which cellular material is administered to the subject). However, it is contemplated that it is possible to treat, ameliorate and/or reduce the severity of at least one symptom associated with the injury, disease or condition. In certain embodiments, a subject in need of cell therapy includes, but is not limited to, a bone marrow transplant or stem cell transplant candidate, a subject who has received chemotherapy or radiation therapy, a hyperproliferative disease or cancer (e.g., a hematopoietic system), a subject having or at risk of developing a hyperproliferative disease or cancer), a subject having or at risk of developing a tumor (e.g., solid tumor), viral infection or virus. It is also intended to encompass subjects suffering from or at risk of suffering from a disease associated with an infection.
- In some embodiments, the present disclosure provides a composition of the present disclosure along with instructions for use. The instructions for use can be present in the kits as a package insert, in the labeling of the container of the kit or components thereof, or can be in digital form (e.g. on a CD-ROM, via a link on the internet). A kit can include one or more of a genome-targeting nucleic acid, a polynucleotide encoding a genome-targeting nucleic acid, a site-directed polypeptide, and/or a polynucleotide encoding a site-directed polypeptide. Additional components within the kits are also contemplated, for example, buffer (such as reconstituting buffer, stabilizing buffer, diluting buffer), and/or one or more control vectors.
- 6. Combination Therapies
- In some embodiments, an engineered cells of the present disclosure or composition thereof is administered with at least one additional therapeutic agent. Any suitable additional therapeutic agent may be administered with an engineered cell provided herein, populations thereof, or compositions thereof. In some aspects, the additional therapeutic agent is selected from radiation, an ophthalmologic agent, a cytotoxic agent, a chemotherapeutic agent, a cytostatic agent, an anti-hormonal agent, an immunostimulatory agent, an anti-angiogenic agent, and combinations thereof.
- In some embodiments, an engineered cell of the present disclosure or composition thereof is administered with a steroid. The administration of a steroid can prevent or mitigate the risk of a subject receiving the engineered cell(s) or composition thereof having an autoimmune reaction.
- The additional therapeutic agent may be administered by any suitable means. In some embodiments, the engineered cells described herein, populations thereof, or compositions thereof and the additional therapeutic agent is administered in the same pharmaceutical composition, e.g. by infusion. In some embodiments, the engineered cells described herein and additional therapeutic agent are included in different pharmaceutical compositions.
- The pharmaceutical composition may comprise one or more pharmaceutical excipients. Any suitable pharmaceutical excipient may be used, and one of ordinary skill in the art is capable of selecting suitable pharmaceutical excipients. Accordingly, the pharmaceutical excipients provided below are intended to be illustrative, and not limiting. Additional pharmaceutical excipients include, for example, those described in the Handbook of Pharmaceutical Excipients, Rowe et al. (Eds.) 6th Ed. (2009), incorporated by reference in its entirety.
- Various modes of administering the additional therapeutic agents are contemplated herein. In some embodiments, the additional therapeutic agent is administered by any suitable mode of administration. Generally, modes of administration include, without limitation, intravitreal, subretinal, suprachoroidal, intraarterial, intradermal, intramuscular, intraperitoneal, intravenous, nasal, parenteral, topical, pulmonary, and subcutaneous routes.
- In embodiments where the engineered cells provided herein and the additional therapeutic agent are included in different pharmaceutical compositions, administration of the engineered cells provided herein can occur prior to, simultaneously, and/or following, administration of the additional therapeutic agent.
- The following are examples of methods and compositions of the invention. It is understood that various other embodiments may be practiced, given the general description provided herein.
- Below are examples of specific embodiments for carrying out the present invention. The examples are offered for illustrative purposes only, and are not intended to limit the scope of the present invention in any way. Efforts have been made to ensure accuracy with respect to numbers used (e.g., amounts, temperatures, etc.), but some experimental error and deviation should, of course, be allowed for.
- The practice of the present invention will employ, unless otherwise indicated, conventional methods of protein chemistry, biochemistry, recombinant DNA techniques and pharmacology, within the skill of the art. Such techniques are explained fully in the literature. See, e.g., T. E. Creighton, Proteins: Structures and Molecular Properties (W.H. Freeman and Company, 1993); A. L. Lehninger, Biochemistry (Worth Publishers, Inc., current addition); Sambrook, et al., Molecular Cloning: A Laboratory Manual (2nd Edition, 1989); Methods In Enzymology (S. Colowick and N. Kaplan eds., Academic Press, Inc.); Remington's Pharmaceutical Sciences, 18th Edition (Easton, Pennsylvania: Mack Publishing Company, 1990).
- The objective of the experiments in this set of examples was to identify T-cell knock-in (KI) loci in coding regions outside of the T Cell Receptor Alpha Constant (TRAC) locus. To select candidate loci, the criteria and requirements shown in Table 1 were used.
-
TABLE 1 Criteria used for identification of candidate knock-in loci within coding regions Detailed Datasets Considered for Criteria Requirements identification of loci KI High expression at Knock in efficiency efficiency d2 (day 2) data on ~90 genes Similar expression derived from dynamics as TRAC Roth, T. L., et al. 2019. (stable expression (Rapid discovery of synthetic upon activation) DNA sequences to rewrite endogenous Accessible T cell circuits. bioRxiv, 604561.) chromatin Bulk RNA-seq data at do, d2, d3 and d4: gRNA editing (i) publicly available data; efficiency (ii) data derived from Roth, T. L., et al. 2019. (Rapid discovery of synthetic DNA sequences to rewrite endogenous T cell circuits. bioRxiv, 604561.); and (iii) internally generated data Bulk ATAC-seq data at do, d2, d3 and d4: (i) data derived from Roth, T. L., et al. 2019. (Rapid discovery of synthetic DNA sequences to rewrite endogenous T cell circuits. bioRxiv, 604561.); and (ii) internally generated data Safety Does not affect T cell screening data proliferation, Achilles' project on cancer cell lines data cytolytic activity or Annotated oncogenes and tumor signaling of T cells suppressors from TCGA Annotated functions in immune cells Be careful with tumor suppressors and oncogenes - Generally, the requirements for candidate KI loci within the coding regions was that the coding gene is not essential for T cell function or the knock-out of that gene would be beneficial to chimeric antigen receptor therapy (CAR T) functions.
- As shown in Table 1, some of the data utilized, for comparison purposes, to identify candidate KI loci was sourced, in part, from Roth, T. L., et al. 2019 (Rapid discovery of synthetic DNA sequences to rewrite endogenous T cell circuits. bioRxiv, 604561), the relevant disclosures of which are herein incorporated by reference in their entirety. A summary of data in the Roth, T. L. et al. paper is shown in
FIG. 3 . -
FIG. 4A andFIG. 4B show the processed RNA-seq data, showing that samples cluster based on activation status and by cell type. Donor 4 (AKI4) was identified as an outlier, as it clustered differently than the others, and was removed from remaining analysis. The transcript expression data from RNA-seq experiments on the 90 genes cited in the Roth, T. L. et al. paper was correlated to transcript expression data in Roth, T. L. et al. paper. (SeeFIG. 5A andFIG. 5B ). -
FIG. 6 shows the process ATAC-seq data, which focused on the 10 kb area around the transcription start site (TSS). Again, donor 4 (AK14) was identified as an outlier, based on expression profile in both CD4 and CD8 cells, and was removed from remaining analysis. The results of the remaining ATAC-seq analysis are shown inFIGS. 7A-8 . The data inFIG. 7A for TSS enrichment scores also indicated that all libraries were high quality. The data on open chromatin regions around the TRAC locus revealed the highest signal nearexon 3 rather than exon 1 (FIG. 8 ). - The K1 efficiency results (obtained using the model described infra) based on the 5 donors, 2 gRNA, and 2 replicates/gRNAs from the Roth, T. L. et al. paper are shown in
FIG. 9 andFIG. 10 . - A linear model was built to estimate KI efficiency of approximately 90 genes using RNA-seq and ATAC-seq data. The linear model captured 33% of the variation in the data (i.e. R2=0.33).
FIG. 11 shows KI efficiency versus day 2 (d2) RNA-seq data, day 4 (d4) RNA seq data, and d2 ATAC data. The linear model was applied to remaining candidate genes to estimate their KI efficiency. - The candidate coding loci were selected by first ranking all genes in the pooled data sets by predicted KI efficiency, using RNA-seq expression data from d2 (day 2). Candidate coding loci were required to be stably expressed during T cell activation (e.g., <=2-fold expression change relative to day 0 (d0). The candidate loci also had to be accessible based on ATAC-seq data. Using that selection process/requirements, 16 well-characterized coding genes (with known functions) were selected as candidate genes. The knockout of these 16 would confer a benefit to the function of CART cells (e.g. B2M, CD5, SMAD2, PTPRC, CD3E). An additional 12 coding genes with high predicted KI efficiency and no apparent essential function (inert coding genes) were selected as candidate genes. (See Table 2).
-
TABLE 2 KI Candidate Loci selected Genes with Known Functions Inert Coding Genes B2M (~70%) CD2 SUB1 CD3E (~50%) SMAD2 EDF1 PTPRC (~50%) SLC38A1 CAPNS1 CD3G (~30%) TIGIT SRP14 SOCS1 PTPN2 RTRAF APRT Tested in Roth, T.L. et al CBLB PTPRCAP paper (reported editing PTEN SRSF9 efficiency from Roth, PTPN6 FTL T.L. et al paper) CD5 TRIM28 TET2 SERF2 TRAC RPS23 - The criteria for selection of candidate loci in non-coding regions or gene deserts is summarized in Table 3.
-
TABLE 3 Criteria used for identification of candidate knock-in loci within non-coding regions Datasets Considered for Criteria Detailed Requirements identification of loci KI Accessible chromatin Bulk ATAC-seq data efficiency at d0, d2, d3 and d4: (i) data derived from Roth, T. L., et al. 2019. (Rapid discovery of synthetic DNA sequences to rewrite endogenous T cell circuits. bioRxiv, 604561.); and (ii) internally generated data Safety >10 kb from cancer-related Ensembl genes genes or TADS Annotated oncogenes and tumor >10 kb from any suppressors from TCGA miRNA/functional small Annotated enhancers RNAs from Encode and papers >10 kb from any 5' gene end >10 kb from any regulatory regions (ultra-conserved elements, enhancers) - To select candidate regions within the non-coding regions, the inventors here started by looking in highly accessible regions of the genome (10 kb windows). The most accessible region overlapped with (i) annotated protein coding genes (>50% accessible regions), (ii) pseudogenes and noncoding RNAs (˜20% accessible regions), and (iii) enhancer/regulatory regions (˜20% accessible regions). The candidate genes were required to be <10 kb from the coding regions. A few regions that overlapped with long intergenic noncoding RNAs (lincRNAs) but did not have apparent function in T cells were also considered.
- Examples of other criteria that have been used in the art for selection of SHS are described, for example in Pellenz, S., et al. (2019). New human chromosomal sites with “safe harbor” potential for targeted transgene insertion. Human gene therapy, 30(7), 814-828, the relevant disclosures of which are herein incorporated by reference in their entirety.
-
FIG. 12 is a plot showing the normalized ATAQ SEQ data for a top candidate non-coding region. - Using the above-described selection process/requirement, 11 candidate regions in gene deserts were selected. In total, there were 39 KI candidate loci, including those in coding and non-coding regions that would be evaluated as safe harbor loci (predicted loci).
- Materials
-
- 120 sgRNAs were ordered from Synthego, 3 sgRNAs per region, and editing efficiency was assessed by next generation sequencings (NGS).
- 87 targeted coding genes
- 33 targeted gene deserts
- 189 constructs were synthesized by Genscript
- Homology arm: 450 bp each
- eGFP was used as the reporter
- Vector backbone: pUC57-Kan backbone
- 78 with endogenous promoters
- 111 with EF1a-HTLV promoter
- Constructs were sequence verified by Genscript and majority were internally verified by sequencing. The sgRNA sequences are provided in Table 4 and the construct sequences are provided in Table 5.
- 120 sgRNAs were ordered from Synthego, 3 sgRNAs per region, and editing efficiency was assessed by next generation sequencings (NGS).
- Methods
- To evaluate the effectiveness of the predicted loci, the following were used:
-
- (87 endogenous constructs+120 exogenous construct)*2=414 wells+Controls=460
- 3 donors, 2 replicates at 3 time points: wk 1 (d6=day 6), wk 3 (d21=day 21) and wk 4 (d28=day 28)
- Controls
- DNA-only (no ribonucleoprotein (RNP)): episomal expression (n=8)
- DNA+non-targeting guides+RNP: to check if DNA delivery is more efficient with RNP (n=8)
- pMax GFP was measured to check if transfection is ok (n=3).
- WT cells with no electroporation
- WT cells with electroporation
- Controls were spread out across plates.
- Cells were electroporated without RNP/HDR to check if cell counting makes sense.
- 2 more donors were added and changed to 3 replicates at 1 time point: wk 1 (d6)
- FACS panel: GFP, TCR, Zombie
- Washed every 3 wells.
- The secondary data collected included: RNA-seq data: at d0, d2, and d4 (to determine stable expression) and ATAC-seq data: at d2 (to identify activated cells). The methods used for conducting the RNA-seq and ATAC-seq experiments are in Roth, T. L., et al. 2019. (Rapid discovery of synthetic DNA sequences to rewrite endogenous T cell circuits. bioRxiv, 604561), which is herein incorporated by reference in its entirety.
- Automated gating was performed on FACS data, using .fcs files. For constructs with endogenous promoters, all 5 donors and all 3 time points were used. For constructs with EF1a promoter, used 3 donors (donors 1-3) and 2 time points (
wk 3 and wk 4). - To summarize data, wells with a % K1 efficiency (% GFP_high cells)<=10% (likely a gating error, experimental error or minimal integration) were removed. Loci were then ranked by median expression among donors and median % K1 efficiency among donors.
- Loci significance was calculated using robust rank aggregation, which was the methodology used for consistent ranking of loci among donors and time points. Methods for conducting robust rank aggregation are known in the art. (See, for example, Kolde, R., et al. (2012). Robust rank aggregation for gene list integration and meta-analysis. Bioinformatics (Oxford, England), 28(4), 573-580, the disclosure of which is herein incorporated by reference in its entirety).
- Results
- The results from the analysis of controls and experimental groups is shown in
FIGS. 13A-25 . - The results of the pmax_GFP control experiments revealed that
donor 5 had significantly higher cell count than other donors. Generally, the GFP_high readings decreased over time, as expected. (FIGS. 13A and 13B ). - Experiments to analyze non-targeting controls for changes in episomal GFP expression, revealed that episomal GFP expression decreased to less than 0.05 over time. (
FIGS. 14A and 14B ) Some of the outliers were attributed to gating errors, which were most severe indonor 3. WT control experiments also exhibited <0.05 GFP_high readings, as expected (FIG. 15 ). - As shown in
FIG. 16 andFIG. 17 , there was a more consistent trend among donors when using sgRNAS to target the B2M safe harbor locus. - A comparison of GFP expression trends between sgRNA79 and sgRNA83 for TRAC insertion and expression with an endogenous reporter revealed that
donor 2 trends were different between sgRNA79 and sgRNA83, whiledonor 4 trends were the same (FIG. 18 andFIG. 22 ). Some potential reasons for the observed variation between replicates and donors include: -
- (1) manual handling of a large number of plates (e.g.,
FIG. 26 ):- Edge effects
- Electroporation errors
- (2) Inherent differences among donors (e.g.,
FIG. 27 ):- There were consistent differences with certain wells throughput the assay (spanning multiple weeks)
- There were variable cell numbers between donors
- (3) Gating errors (e.g.
FIG. 28 ):- 1 peak vs >2 peaks in GFP signals
- (1) manual handling of a large number of plates (e.g.,
- The loci were ranked based on mean florescence intensity (MFI) of reporter gene, GFP, and KI efficiency.
-
FIG. 29 shows GFP MFI and KI efficiency for all significant loci having endogenous promoters. The B2M locus reported the highest GFP MFI. The TRAC locus was among the top 10 MFI. The highest GFP MFI readings were reported inweek 1 and reduced slightly inweeks -
FIG. 30 shows GFP MFI and KI efficiency for all significant loci having exogenous promoters (e.g. EF1a promoters). The TRAC locus reported the highest GFP MFI. There was comparable expression betweenweeks -
FIGS. 31A, 31B, and 31C show GFP MFI and KI efficiency for all significant loci having endogenous and exogenous (e.g., EF1a) promoters. The results showed that expression driven by EF1a promoters is about 10× higher than endogenous promoters (FIG. 31A ). With regard to KI efficiency, the SOCS1 locus reported the highest KI efficiency. Atweeks - This experiment was conducted to identify target loci and integration sites that enable high transgene expression without disrupting the TCR. Some gene deserts (
e.g. gene deserts FIG. 32A ). - To evaluate and identify preferred knock-in sites, TCR expression was measured following circuit cassette knock in of PrimeR (Prime Receptor (Myc)). Myc denotes an N-terminal Myc epitope tag to facilate detection of surface expressed primer receptor. Briefly, CD3-CD28 Dynabead-activated T cells were electroporated with sgRNA/Cas9 RNPs targeting the indicated sites (see
FIGS. 32B and 33A ), as well as HDRTs with homology arms directing HDR-mediated integration into the indicated sites. Atday 6 post-electroporation, cells were stained with anti-TCRalpha/beta and anti-myc antibodies and analyzed on an Attune NxT flow cytometer. As shown inFIG. 32B , the TCR expression was maintained at GS94 and GS102 candidate integration (knock-in) sites. Only ⅔ of the cells that had circuit cassette knock in of PrimeR at GS79 (TRAC) locus maintained TCR expression. These results indicated that the GS94 and GS102 sites showed better potential for TCR stimulation. - The percentage of cells showing effective knock-in (based on measurements of PrimeR) was 36%±4% when using the GS94 integration site, as compared to 32%±5% when using the TRAC integration site. These results revealed integration sites, including GS94, that supported reproducibly high circuit cassette knockin rates. See
FIG. 33A . - GS94 is a candidate integration site located on
chromosome 11's distal q arm. It is within 180-350 kb of the promoters for ETS1 and FLI1 (FIG. 33B ), however that is considered low-risk for integration vector gene therapy. The circuit expression and function potential of the GS94 gene was evaluated. - T cells underwent circuit cassette integration with PrimeR at GS79 (TRAC) integration site and GS94 integration site. The cells were cocultured with K562 C19 cells for 48 hours and then the PrimeR induced CAR MFI was compared to the PrimeR MFI.
- Briefly, T cells generated as described in the Example above were cocultured with K562 CD19+/MSLN− cells at
day 7 post-electroporation. MSLN (mesothelin) is a gene that is overexpressed in human pancreatic cancer. Cells were then stained with anti-FLAG antibody 48 h post initiation of coculture and analyzed on an Attune NxT flow cytometer. The results revealed that GS94 yields superior CAR induction with high prime R expression following the 48-hour coculture with the K562 C19 cells. SeeFIG. 34A . GS94 resulted in prime antigen-dependent CAR expression that was approximately two-fold higher than the expression in several other candidate integration sites as well as the TRAC integration site. Additionally, on average, the prime receptor surface expression level was no less than 50% of expression level when using the TRAC integration site. - To evaluate the effect of the candidate integration site on cytotoxicity and cytokine secretion, T cells that had undergone circuit cassette integration with PrimeR at GS79 (TRAC) integration site and GS94 integration site were cocultured with K562 C19/MSLN cells for 48 hours. MSLN is a gene that is overexpressed in human pancreatic cancer. The cells were treated at a 1:1 effector:target cell ratio (1:1 E:T)). Briefly, T cells generated as described above were cocultured with K562 CD19+/MSLN+ cells at
day 7 post-electroporation. 48 h post initiation of coculture, supernatants were collected and analyzed via Luminex® for cytokine levels. The cytokines measure were IL-2, INFg, and TNF. Cytotoxicity was analyzed by measuring luciferase activity of remaining target cells after 48 h. Each of the data points inFIG. 34B represent two replicates and the lines represent the range of cytotoxicity for the replicates. As shown inFIG. 34B , the GS94 integration sites resulted in superior cycotoxic ability and cytokine secretion following the 48-hour coculture with K562 C19/MSLN cells. - To compare the effect of the candidate integration site on cytotoxicity versus prime-independent cytotoxicity, T cells that had undergone circuit cassette integration with PrimeR at GS79 (TRAC) integration site and GS94 integration site were cocultured with K562 CD19+/MSLN+ cells or K562 CD19−/MSLN+ cells (“K562 MSLN”) at
day 7 post-electroporation for 48 hours. 0.3, 1.0, and 3.0 E:T cell ratios were tested. SeeFIG. 35A andFIG. 35B . At 48 h post initiation of coculture, cytotoxicity was analyzed by measuring luciferase activity of remaining target cells As shown inFIG. 35B , the GS94 integration site resulted in equivalent cytotoxic potential to the TRAC integration site and there was no prime-independent cytotoxicity. - To compare the effect of the candidate integration site on cytoxicity versus prime-independent cytotoxicity, T cells that had undergone circuit cassette integration with PrimeR at GS79 (TRAC) integration site and GS94 integration site, generated as described above, were cocultured with K562 CD19+/MSLN+ cells. A group that had target cells only (E:T=0; Targets only) was compared to a group with an E:T cell ratio of 1. Following the 48 h coculture with the K562 CD19+/MSLN+ cells, supernatant was collected and analyzed via Luminex® for cytokine levels to measure secretion of IL-2, INFg and TNF cytokines. See
FIG. 36A andFIG. 36B . As shown inFIG. 36B , the GS94 integration site resulted in equivalent cytokine secretion to the TRAC integration site and there was no prime-independent secretion of IL-2, INFg or TNF. - To evaluate the effect of the candidate integration site on prime-independent CAR expression, T cells that had undergone circuit cassette integration with PrimeR at GS79 (TRAC) integration site, GS94 integration site, and GS102 integration site were cultured in vitro for 32 days. The cells were treated with repetitive CD3/CD28 stimulation at
days FIG. 37 , T cell activation through TCR did not result in prime R-independent CAR expression from circuit cassette integration at the candidate integration sites. - T cells generated as described above were cultured in 96-well plates, with T cell growth medium being exchanged every 2 days. At
days - To evaluate the effect of the candidate integration sites on stable (sustained) expression of PrimeR, T cells that had undergone circuit cassette integration with PrimeR at the integration sites indicated in
FIG. 38 were cultured in vitro for 32 days. Briefly, T cells generated as described above were cultured in 96-well plates, with T cell growth medium being exchanged every 2 days. Atdays FIGS. 38A and 38B , the GS94 integration site resulted in stable PrimeR expression over at least a 4-week period. - To evaluate the on-target editing efficiency of candidate knock-in sites, iGUIDE-Seq assay was used. The methods used for conducting the iGUIDE-Seq assay are illustrated in
FIG. 39A and provided in Nobles et al., Genome Biology (2019), which is hereby incorporated by reference in its entirety. As shown inFIG. 39B , the GS94 integration site had the highest on-target editing efficiency of the evaluated candidate integration sites. As shown inFIG. 39C , GS94 resulted in no putative off-target editing as observed with two donors. - Elevation prediction: Computational predictions of potential off-target sites from (gs94) were performed using Elevation-search (algorithm described in Listgarten et al. 2018. Prediction of off-target activities for the end-to-end design of CRISPR guide RNAs.
Nat Biomed Engr 2, 37-48; software obtained from github.com/Microsoft/Elevation). All sites identified by Elevation-search were subjected to analysis using rhAmp-seg™. - rhAmpSeg™: 49 candidate off-target sites for GS94 identified by iGUIDE or the Elevation prediction algorithm and the GS94 target site were characterized by rhAmpSeg™ (Integrated DNA Technologies, Inc.). This targeted amplification enables NGS-based quantification of the editing occurring at numerous sites simultaneously. Genomic DNA from T cells from at least 2 donors that had been treated singly with each of the following 7 guides: GS84, GS94, GS95, GS96, GS102, GS108, and GS138 was isolated with the GenFind® V3 DNA purification system (Beckman Coulter). Two separate rhAmpSeg™ amplification pools were used to cover the 50 loci, the procedure was performed as recommended by Integrated DNA Technologies for each of the samples. The rhAmpSeg™ libraries were sequenced on a MiniSeg™ with a Mid Output Kit (300-cycles) (Illumina). The CRISPResso2 algorithm (github.com/pinellolab/CRISPResso2) was used to determine the percentage of insertions and deletions at each of the amplified loci. Statistical significance (FDR-adjusted p-value≤0.001) using a chi-squared test was only observed at the GS94 site.
- RNA-seq: To evaluate changes induced by GS94 integration at the transcriptional level, a CD19/MSLN circuit was integrated at the GS79 (TRAC), GS94, and GS102 integration sites. On
day 6 post-integration, 1e6 edited cells were sorted using a BD FACSAria based on transgene expression. RNA was isolated from sorted T cells with the RNeasy kit (Qiagen). Purified RNA was converted into an NGS library using the TruSeq™ RNA Library Prep Kit v2 (Illumina). Libraries were sequenced on either theNovaSeq™ 6000 or NextSeq™ 550 instruments (Illumina). The STAR 2.7.3a aligner (Dobin A. et al. Bioinformatics. 2013. 29:15-21) was used to align the RNA-seq data against the reference human GRCh38 transcriptome and to obtain gene-level read counts. edgeR (Robinson M D et al. Bioinformatics. 2010. 26: 139-140) was used to compute differential expression, combining data across both donors. The only genes within 300 Kb of the GS94 site, ETS1 and FLI1, were not differentially expressed in cells with integration at the GS94 integration site compared to cells with integration at any of the other two loci. At an FDR-adjusted p-value cutoff of 0.01, the number of differentially expressed genes was minimal (<100 genes genome-wide). - Cytokine-independent growth assay: To evaluate the safety of the primary T cells with GS94 locus KI, cytokine-independent growth assay was performed to evaluate the potential for oncogenic transformation. Briefly, primary human T cells that had undergone CD19/MSLN circuit cassette integration at GS94 locus were thawed and recovered overnight. 1×106 cells were then seeded in one well of a 24 well-GRex plate, culturing for 5 days in the medium with or without cytokines. Cell number and viability were recorded at
days FIG. 43 , while GS94 KI T cells maintained good viability and total cell count when cultured with cytokine, the viability of GS94 KI T cells drastically decreased over the course of 5 days when cultured without cytokine and there was no viable cell left onday 5. The positive control Jurkat cells maintained good viability and expansion without cytokine throughout the assay. Taken together, this data shows that GS94 edited primary human T cells still depend on exogenous cytokine for growth, survival and expansion, therefore, there is no concern for cellular transformation. - The specificity of CRISPR reagents (e.g. SpCas9 complexed with sgRNA) targeting candidate loci including GS94 was evaluated by iGUIDE-seq (
FIG. 40 ). GS94-targeting CRISPR RNP showed the highest percentage of iGUIDE-seq oligo cassette trapping events of all candidates evaluated, and the control sgRNA sequences from the iGUIDE-seq paper showed similar specificities to what was reported in the original publication, suggesting that the assay performed as expected. - Putative off-target sites were taken from the iGUIDE-seq output, which already suggested that the putative sites were spurious. Additional target sites were predicted by a computational approach (Elevation software package). rhAmp-seg™ was used to prepare high-throughput sequencing libraries for each of the putative off-target sites, and the method was applied to DNA samples from T cells electroporated with CRISPR RNPs targeting the candidate target sites. The resulting NGS data were processed with CRISPResso2 software, and the frequency of insertions and deletions (indels) was taken as indication of CRISPR cleavage activity, as is common in the field. T cells electroporated with GS94-targeting CRISPR RNP showed no greater frequency of indels at the set of putative off-target sites than T cells treated with CRISPR RNP targeting other sites, consistent with the GS94-targeting CRISPR RNP having no consequential or detectable off-target activity, and therefore being the most specific out of the set evaluated (
FIG. 41 ). - Potential effects of transgene integration at the GS94 site on the regulation of the T cell transcriptome were evaluated by knocking in a large cassette to the site, growing T cells for several days, sorting cells expressing the transgene within the cassette, and then collecting RNA from the cells. RNA-seq libraries were prepared and sequenced, and analysis of the resulting Illumina sequencing data revealed no biologically or statistically significant differences in expression of any genes within 300 kb of the GS94 site in cells with integrations at GS94 compared to cells with integration at TRAC or the GS102 sites (
FIG. 42 ). Furthermore, other gene expression differences that reached statistical significance were minimal in number and in effect size, consistent with them being noise in the comparison. - To assess whether transgene integration at GS94 could confer a transformed phenotype, cells with integrations at the GS94 site were cultured with and without cytokines in vitro. Cells remained alive and viable with cytokine addition, but died without cytokine supplementation and lost their viability (
FIG. 43 ). The positive control Jurkat cells remained viable and proliferated. Overall, this indicates that integration of a transgene at GS94 does not confer capacity for cytokine-independent growth, which is a hallmark of T cell transformation. - In vivo efficacy of T cells with a transgene cassette expressing a CAR recognizing a tumor antigen, or a CAR recognizing a tumor antigen under control of a priming receptor recognizing an antigen in the anatomical vicinity of the tumor, is assessed against human tumor cells such as K562 engineered to express the CAR antigen or to express antigens recognized by both the priming receptor and the CAR. Tumor cells (e.g. 1e6) are subcutaneously injected into the flank of NSG mice (Jackson Laboratories). Tumor growth is assessed by dimensional measurement by calipers every 2-4 days. When the tumor volume reaches ˜100 cubic mm, mice are intravenously injected with 5e6 T cells with a CAR or prime-CAR circuit cassette integrated at a specific site by CRISPR-mediated insertion, or with T cells engineered with CRISPR RNP alone, or with PBS alone as a sham injection. Tumor growth is monitored and mice are euthanized when tumor volume reaches 2000 cubic mm. Peripheral blood is bled from mice through a retro-orbital procedure, and flow cytometry and/or ddPCR is used to observe engineered T cell expansion over time. At time of sacrifice, spleen, blood, tumor and/or other tissue is analyzed via flow cytometry, ddPCR, and/or immunohistochemistry for the presence of engineered T cells. The results demonstrate that T cells engineered with cassette integration at one of the defined genomic loci lead to tumor regression and clearance in injected mice as compared to T cells without cassette integration, and that engineered T cells are detectable in the peripheral blood and tissues of injected mice.
-
- Eyquem, J., Mansilla-Soto, J., Giavridis, T., van der Stegen, S. J Hamieh, M., Cunanan, K. M., . . . & Sadelain, M. (2017). Targeting a CAR to the TRAC locus with CRISPR/Cas9 enhances tumour rejection. Nature, 543(7643), 113-117.
- Sadelain, M., Papapetrou, E. P., & Bushman, F. D. (2012). Safe harbours for the integration of new DNA in the human genome. Nature reviews Cancer, 12(1), 51-58.
- Irion, S., Luche, H., Gadue, P., Fehling, H. J., Kennedy, M., & Keller, G. (2007). Identification and targeting of the ROSA26 locus in human embryonic stem cells. Nature biotechnology, 25(12), 1477-1482.
- Pellenz, S., Phelps, M., Tang, W., Hovde, B. T., Sinit, R. B., Fu, W., . . . & Monnat Jr, R. J. (2019). New human chromosomal sites with “safe harbor” potential for targeted transgene insertion. Human gene therapy, 30(7), 814-828.
- Roth, T. L., Li, P. J., Nies, J. F., Yu, R., Nguyen, M. L., Lee, Y., . . . & Nguyen, D. N. (2019). Rapid discovery of synthetic DNA sequences to rewrite endogenous T cell circuits. bioRxiv, 604561.
-
TABLE 4 SGRNA SEQUENCES USED FOR EVALUATION OF PREDICTED LOCI Median (% Modified), sgRNA_ summarized start_ sgRNA Integra- from 2 sgRNA coor Target tion donors, 2 ID sgRNA Sequence GRCH38 Loci Site primersets sgRNA_ GCACCTGAATACCACGCCTG chr16: APRT APRT 79.28 1 (SEQ ID NO: 1) 88811818 sgRNA_ CGCCTGCGATGTAGTCGATG chr16: APRT APRT 78.60 2 (SEQ ID NO: 2) 88811551 sgRNA_ CAGGACGGGCGAGATGTCCC chr16: APRT APRT 85.25 3 (SEQ ID NO: 3) 88811640 sgRNA_ CTGAATCTTTGGAGTACCTG chr15: B2M B2M 78.51 4 (SEQ ID NO: 4) 44715425 sgRNA_ GGCCACGGAGCGAGACATCT chr15: B2M B2M 94.75 5 (SEQ ID NO: 5) 44711550 sgRNA_ AAGTCAACTTCAATGTCGGA chr15: B2M B2M 70.97 6 (SEQ ID NO: 6) 44715515 sgRNA_ GCTTGGAGGCCTGATCAGCG chr19: CAPNS1 CAPNS1 89.34 7 (SEQ ID NO: 7) 36141111 sgRNA_ CTTATCTCTTCGCAGCGAGG chr19: CAPNS1 CAPNS1 91.09 8 (SEQ ID NO: 8) 36142301 sgRNA_ CACACATTACTCCAACATTG chr19: CAPNS1 CAPNS1 71.98 9 (SEQ ID NO: 9) 36142676 sgRNA_ TTCCGCAAAATAGAGCCCCA chr3: CBLB CBLB 91.55 10 (SEQ ID NO: 10) 105746019 sgRNA_ TGCACAGAACTATCGTACCA chr3: CBLB CBLB 91.43 11 (SEQ ID NO: 11) 105751622 sgRNA_ GCAATAAGACTCTTTAAAGA chr3: CBLB CBLB 76.18 12 (SEQ ID NO: 12) 105853470 sgRNA_ CAAAGAGATTACGAATGCCT chr1: CD2 CD2 89.80 13 (SEQ ID NO: 13) 116754658 sgRNA_ CAAGGCACCCCAGGTTTCCA chr1: CD2 CD2 92.70 14 (SEQ ID NO: 14) 116754663 sgRNA_ TTACGAATGCCTTGGAAACC chr1: CD2 CD2 92.82 15 (SEQ ID NO: 15) 116754666 sgRNA_ CAGAGACGCATCTGACCCTC chr11: CD3E CD3E 90.96 16 (SEQ ID NO: 16) 118315540 sgRNA_ CATGCAGTTCTCACACACTG chr11: CD3E CD3E 87.47 17 (SEQ ID NO: 17) 118313715 sgRNA_ GTGTGAGAACTGCATGGAGA chr11: CD3E CD3E 86.65 18 (SEQ ID NO: 18) 118313715 sgRNA_ TCTCATTTCAGGAAACCACT chr11: CD3G CD3G 87.24 19 (SEQ ID NO: 19) 118349748 sgRNA_ AGTCATACACCTTAACCAAG chr11: CD3G CD3G 87.99 20 (SEQ ID NO: 20) 118349754 sgRNA_ TTCAAGGAAACCAGTTGAGG chr11: CD3G CD3G 86.55 21 (SEQ ID NO: 21) 118352458 sgRNA_ GAGCCTTGCCTGGAAATCTG chr11: CD5 CD5 84.03 22 (SEQ ID NO: 22) 61118177 sgRNA_ AAGCGTCAAAAGTCTGCCAG chr11: CD5 CD5 89.19 23 (SEQ ID NO: 23) 61118324 sgRNA_ CGTTCCAACTCGAAGTGCCA chr11: CD5 CD5 83.11 24 (SEQ ID NO: 24) 61118121 sgRNA_ GAGCGACTGGGACACGGTGA chr9: EDF1 EDF1 88.84 25 (SEQ ID NO: 25) 136866246 sgRNA_ GCTGCGCAAGAAGGGCCCTA chr9: EDF1 EDF1 91.04 26 (SEQ ID NO: 26) 136866211 sgRNA_ TTGTTCTGGCCAGCAGCCCC chr9: EDF1 EDF1 85.98 27 (SEQ ID NO: 27) 136863433 sgRNA_ CTTCCAGAGCCACATCATCG chr19: FTL FTL 93.10 28 (SEQ ID NO: 28) 48965791 sgRNA_ GGGACTCACCAGAGAGAGGT chr19: FTL FTL 88.86 29 (SEQ ID NO: 29) 48965601 sgRNA_ CGGTCGAAATAGAAGCCCTA chr19: FTL FTL 93.14 30 (SEQ ID NO: 30) 48965770 sgRNA_ AAAAGGATATTGTGCAACTG chr10: PTEN PTEN 92.37 31 (SEQ ID NO: 31) 87933015 sgRNA_ TGTGCATATTTATTACATCG chr10: PTEN PTEN 90.64 32 (SEQ ID NO: 32) 87933183 sgRNA_ TTTGTGAAGATCTTGACCAA chr10: PTEN PTEN 85.36 33 (SEQ ID NO: 33) 87933087 sgRNA_ TGTCATGCTGAACCGCATTG chr18: PTPN2 PTPN2 87.94 34 (SEQ ID NO: 34) 12830972 sgRNA_ CCACTCTATGAGGATAGTCA chr18: PTPN2 PTPN2 92.45 35 (SEQ ID NO: 35) 12859219 sgRNA_ TTGACATAGAAGAGGCACAA chr18: PTPN2 PTPN2 93.96 36 (SEQ ID NO: 36) 12836828 sgRNA_ GAGTACTACACTCAGCAGCA chr12: PTPN6 PTPN6 89.61 37 (SEQ ID NO: 37) 6952098 sgRNA_ TCACGCACAAGAAACGTCCA chr12: PTPN6 PTPN6 82.74 38 (SEQ ID NO: 38) 6954872 sgRNA_ AGGTCTCGGTGAAACCACCT chr12: PTPN6 PTPN6 91.27 39 (SEQ ID NO: 39) 6951610 sgRNA_ AGCATTATCCAAAGAGTCCG chr1: PTPRC PTPRC 88.88 40 (SEQ ID NO: 40) 198696873 sgRNA_ ATATTAATTCTTACCAGTGG chr1: PTPRC PTPRC 88.95 41 (SEQ ID NO: 41) 198692370 sgRNA_ AGCTTTAAATCAAGGTTCAT chr1: PTPRC PTPRC 96.89 42 (SEQ ID NO: 42) 198756176 sgRNA_ ATCCCGAGCCCTAAGGTGCA chr11: PTPRCAP PTPRCA 84.08 43 (SEQ ID NO: 43) 67436325 P sgRNA_ GGCAGCGCGGAGGACAGCGT chr11: PTPRCAP PTPRCA 97.74 44 (SEQ ID NO: 44) 67436285 P sgRNA_ CTCAGGGGGCTACTACCACC chr11: PTPRCAP PTPRCA 91.50 45 (SEQ ID NO: 45) 67436170 P sgRNA_ GTCACCGACGAGACCAGAAG chr5: RPS23 RPS23 79.40 46 (SEQ ID NO: 46) 82277810 sgRNA_ GTCGTGGACTTCGTACTGCT chr5: RPS23 RPS23 83.07 47 (SEQ ID NO: 47) 82277843 sgRNA_ TAATTTTTAGGCAAGTGTCG chr5: RPS23 RPS23 61.94 48 (SEQ ID NO: 48) 82277860 sgRNA_ TTAGCTGTTAGACTTGAATA chr14: RTRAF RTRAF 85.50 49 (SEQ ID NO: 49) 51993810 sgRNA_ CGAGAGCCGTCAACTTGCGT chr14: RTRAF RTRAF 85.64 50 (SEQ ID NO: 50) 51989652 sgRNA_ CGGCTTCAACTGCAAAGGTG chr14: RTRAF RTRAF 88.77 51 (SEQ ID NO: 51) 51989700 sgRNA_ TATGAAAAAGCAGAGCGACT chr15: SERF2 SERF2 89.61 52 (SEQ ID NO: 52) 43793025 sgRNA_ TCTGGCGGGCGAGCTCACGC chr15: SERF2 SERF2 86.73 53 (SEQ ID NO: 53) 43792989 sgRNA_ CTCACGCTGGTTACCGCCTA chr15: SERF2 SERF2 80.57 54 (SEQ ID NO: 54) 43792977 sgRNA_ AAAGATTACGAACTTCCCTG chr12: SLC38A1 SLC38A1 92.24 55 (SEQ ID NO: 55) 46207559 sgRNA_ GTTAAAAACAGACATGCCTA chr12: SLC38A1 SLC38A1 91.51 56 (SEQ ID NO: 56) 46229232 sgRNA_ ATGCCTAAGGAGGTTGTACC chr12: SLC38A1 SLC38A1 79.48 57 (SEQ ID NO: 57) 46229246 sgRNA_ CTCCAGGTATCCCATCGAAA chr18: SMAD2 SMAD2 79.53 58 (SEQ ID NO: 58) 47869418 sgRNA_ CACCAAATACGATAGATCAG chr18: SMAD2 SMAD2 86.61 59 (SEQ ID NO: 59) 47870532 sgRNA_ TGGCGGCGTGAATGGCAAGA chr18: SMAD2 SMAD2 82.91 60 (SEQ ID NO: 60) 47896729 sgRNA_ TAGGATGGTAGCACACAACC chr16: SOCS1 SOCS1 92.25 61 (SEQ ID NO: 61) 11255478 sgRNA_ CAGCAGCAGAGCCCCGACGG chr16: SOCS1 SOCS1 83.79 62 (SEQ ID NO: 62) 11255432 sgRNA_ CGGCGTGCGAACGGAATGTG chr16: SOCS1 SOCS1 84.24 63 (SEQ ID NO: 63) 11255296 sgRNA_ TATAGACGCTGCCCGACGTC chr15: SRP14 SRP14 95.12 64 (SEQ ID NO: 64) 40038895 sgRNA_ TCCAAAGAAGGGTACTGTGG chr15: SRP14 SRP14 92.14 65 (SEQ ID NO: 65) 40038368 sgRNA_ ACAGTACCCTTCTTTGGAAT chr15: SRP14 SRP14 65.82 66 (SEQ ID NO: 66) 40038358 sgRNA_ GCGACGGGCGCATCTACGTG chr12: SRSF9 SRSF9 83.68 67 (SEQ ID NO: 67) 120469572 sgRNA_ CCCGACCTCCATAAGTCCTG chr12: SRSF9 SRSF9 92.56 68 (SEQ ID NO: 68) 120465700 sgRNA_ GGGGTCCTCGAAGCGCACGA chr12: SRSF9 SRSF9 89.94 69 (SEQ ID NO: 69) 120469426 sgRNA_ TGCTCTGTTTAGAAGATGAC chr5: SUB1 SUB1 79.36 70 (SEQ ID NO: 70) 32591641 sgRNA_ ATATTCTTTTCTAGTTAAAG chr5: SUB1 SUB1 70.93 71 (SEQ ID NO: 71) 32591566 sgRNA_ CCTGTAAAGAAACAAAAGAC chr5: SUB1 SUB1 93.66 72 (SEQ ID NO: 72) 32591614 sgRNA_ TGGAGAAAGACGTAACTTCG chr4: TET2 TET2 83.53 73 (SEQ ID NO: 73) 105234315 sgRNA_ TCTGCCCTGAGGTATGCGAT chr4: TET2 TET2 90.97 74 (SEQ ID NO: 74) 105234747 sgRNA_ ATTCCGCTTGGTGAAAACGA chr4: TET2 TET2 89.62 75 (SEQ ID NO: 75) 105235656 sgRNA_ CAGGCACAATAGAAACAACG chr3: TIGIT TIGIT 92.65 76 (SEQ ID NO: 76) 114295571 sgRNA_ CCATTTGTAATGCTGACTTG chr3: TIGIT TIGIT 60.75 77 (SEQ ID NO: 77) 114295700 sgRNA_ CTGGGTCACTTGTGCCGTGG chr3: TIGIT TIGIT 87.99 78 (SEQ ID NO: 78) 114295634 sgRNA_ GTCAGGGTTCTGGATATCTG chr14: TRAC TRAC 98.20 79 (SEQ ID NO: 79) 22547508 sgRNA_ TGGATTTAGAGTCTCTCAGC chr14: TRAC TRAC 88.15 80 (SEQ ID NO: 80) 22547541 sgRNA_ CTGCGGCTGTGGTCCAGCTG chr14: TRAC TRAC 94.77 81 (SEQ ID NO: 81) 22550661 sgRNA_ ACAAAACTGTGCTAGACATG chr14: TRAC TRAC 87.86 82 (SEQ ID NO: 82) 22547658 sgRNA_ TTCTTCCCCAGCCCAGGTAA chr14: TRAC TRAC 89.85 83 (SEQ ID NO: 83) 22547778 sgRNA_ CGTCATGAGCAGATTAAACC chr14: TRAC TRAC 95.81 84 (SEQ ID NO: 84) 22550625 sgRNA_ GAGAGCGCCTGCGACCCGAG chr19: TRIM28 TRIM28 89.44 85 (SEQ ID NO: 85) 58544980 sgRNA_ CCAGCGGGTGAAGTACACCA chr19: TRIM28 TRIM28 94.79 86 (SEQ ID NO: 86) 58544869 sgRNA_ GGAGCGCTTTTCGCCGCCAG chr19: TRIM28 TRIM28 91.81 87 (SEQ ID NO: 87) 58544839 sgRNA_ TGAGGCCTGGACCTTATGCA chr10: chr10: desert_ 69.44 88 (SEQ ID NO: 88) 33134193 33130000- 1 33140000 (GS88) sgRNA_ CCTGGTGGAGTGAACCATGA chr10: chr10: desert_ 95.25 89 (SEQ ID NO: 89) 33132917 33130000- 1 33140000 (GS89) sgRNA_ CAAGCACTTAGGTTCCCCTG chr10: chr10: desert_ 91.13 90 (SEQ ID NO: 90) 33134633 33130000- 1 33140000 (GS90) sgRNA_ GGTCTCCCTACAATTCAGCG chr10: chr10: desert_ 92.02 91 (SEQ ID NO: 91) 72294568 72290000- 2 72300000 (GS91) sgRNA_ CACAGCGCGTGACTGCAATG chr10: chr10: desert_ 90.22 92 (SEQ ID NO: 92) 72298268 72290000- 2 72300000 (GS92) sgRNA_ TCTGGGGCACCAATTCTAGG chr10: chr10: desert_ 86.35 93 (SEQ ID NO: 93) 72292786 72290000- 2 72300000 (GS93) sgRNA_ GAGCCATGCTTGGCTTACGA chr11: chr11: desert_ 91.24 94 (SEQ ID NO: 94) 128342576 128340000- 3 128350000 (GS94) sgRNA_ GTACAAGTACTTATCTCATG chr11: chr11: desert_ 89.02 95 (SEQ ID NO: 95) 128343592 128340000- 3 128350000 (GS95) sgRNA_ GAGATAACAACATAACAACA chr11: chr11: desert_ 96.47 96 (SEQ ID NO: 96) 128347170 128340000- 3 128350000 (GS96) sgRNA_ CATATTCCATAGTCTTTGGG chr11: chr11: desert_ 88.54 97 (SEQ ID NO: 97) 65425000 65425000- 4 65427000 (GS97) (NEAT1) sgRNA_ CTGCCCCTTAGCAACTTAGG chr11: chr11: desert_ 92.76 98 (SEQ ID NO: 98) 65425507 65425000- 4 65427000 (GS98) (NEAT1) sgRNA_ TGTTTAAAAATATGTTGACA chr11: chr11: desert_ 90.76 99 (SEQ ID NO: 99) 65426264 65425000- 4 65427000 (GS99) (NEAT1) sgRNA_ CCAGGAATGGAAACTCACGC chr15: chr15: desert_ 87.84 100 (SEQ ID NO: 100) 92830315 92830000- 5 92840000 (GS100) sgRNA_ GAGGCCGCTGAATTAACCCG chr15: chr15: desert_ 85.32 101 (SEQ ID NO: 101) 92831850 92830000- 5 92840000 (GS101) sgRNA_ ATACACGCACACTTGCAGAA chr15: chr15: desert_ 99.92 102 (SEQ ID NO: 102) 92831131 92830000- 5 92840000 (GS102) sgRNA_ GAGCAGACAGAAACCCAGGG chr16: chr16: desert_ 87.92 103 (SEQ ID NO: 103) 11225670 11220000- 6 11230000 (GS103) sgRNA_ TGAGTCTCCAAACAGAACAG chr16: chr16: desert_ 88.53 104 (SEQ ID NO: 104) 11226284 11220000- 6 11230000 (GS104) sgRNA_ TAATATCACTGACTTCACGG chr16: chr16: desert_ 87.65 105 (SEQ ID NO: 105) 11225029 11220000- 6 11230000 (GS105) sgRNA_ TACACACAATGTAAGCAGCA chr2: chr2: desert_ 71.79 106 (SEQ ID NO: 106) 87467461 87460000- 7 87470000 (GS106) sgRNA_ GGGAGCTCAATTCGAAACCA chr2: chr2: desert_ 65.89 107 (SEQ ID NO: 107) 87468809 87460000- 7 87470000 (GS107) sgRNA_ TTGGACAGGTGAGACAGTCG chr2: chr2: desert_ 72.64 108 (SEQ ID NO: 108) 87467001 87460000- 7 87470000 (GS108) sgRNA_ AAGCTCACTCAGATAGTGTG chr3: chr3: desert_ 76.89 109 (SEQ ID NO: 109) 186511316 186510000- 8 186520000 (GS109) sgRNA_ CAGGAGAACCACCTTACACG chr3: chr3: desert_ 86.31 110 (SEQ ID NO: 110) 186515260 186510000- 8 186520000 (GS110) sgRNA_ GGACAGACCCTGATTCACAA chr3: chr3: desert_ 85.47 111 (SEQ ID NO: 111) 186519655 186510000- 8 186520000 (GS111) sgRNA_ ACATGGCAGTCTATGAACAG chr3: chr3: desert_ 87.77 112 (SEQ ID NO: 112) 59451154 59450000- 9 59460000 (GS112) sgRNA_ CCTATAGAGAGTACTACTTG chr3: chr3: desert_ 79.33 113 (SEQ ID NO: 113) 59456416 59450000- 9 59460000 (GS113) sgRNA_ CCAACCGGGTCTTCATTACG chr3: chr3: desert_ 92.21 114 (SEQ ID NO: 114) 59457029 59450000- 9 59460000 (GS114) sgRNA_ TCAAGCGTAGAGTTCCGAGT chr8: chr8: desert_ 93.07 115 (SEQ ID NO: 115) 127993006 127980000- 10 128000000 (GS115) sgRNA_ TCATGCAATTATGGACCCAG chr8: chr8: desert_ 89.40 116 (SEQ ID NO: 116) 127994663 127980000- 10 128000000 (GS116) sgRNA_ CGGGAAAGTGACTGGCCATG chr8: chr8: desert_ 87.45 117 (SEQ ID NO: 117) 127996766 127980000- 10 128000000 (GS117) sgRNA_ TGAGATTGAAATCAAATCGG chr9: chr9: desert_ 84.84 118 (SEQ ID NO: 118) 7974159 7970000- 11 7980000 (GS118) sgRNA_ TATGCAATATTCATCACGCG chr9: chr9: desert_ 85.44 119 (SEQ ID NO: 119) 7977914 7970000- 11 7980000 (GS119) sgRNA_ AATGTGTTAAATCAAATGCA chr9: chr9: desert_ 83.48 120 (SEQ ID NO: 120) 7976895 7970000- 11 7980000 (GS120) -
TABLE 5 CONSTRUCTS USED FOR EVALUATION OF PREDICTED LOCI Construct name Sequence Length pARBI-903: ATCTCGGCCAATAAAGGAGAAAGGGCGCGGCCCGTACGCGCGCCAGGTGC 1806 APRT_1_ GTGGGCGAGACCAGCTCACGCCCCTCCTCCAGCCGCCAAGGCCCCGGCCC Endogenous_1 ACAGCTGCCTGGCTGCAGTCAGAAGCGTAGCCCGAGACAAGGAAGGGCGC CTTGACTCGCACTTTTGTCCGGTTCGAACGTTCTGctcagtggtgcgtgg aatgcgagcgcgtcttaaaatcgatggcgcctaggagtccatgaaatacg gTACAGGCTTCCGGCGACGGATGCCCCGCCCCTCACCCACGCTCCGCCCT CCGGGGATGCCCCACCCCTCGTGGCGGTCCCGCCCGTCCCCGCGCAGGCG CGCTCGGGCTGCCGCTGGCTCTTCGCACGCGGCCATGGCCGACTCCGAGC TGCAGCTGGTTGAGCAGCGGATCCGCAGCTTCCCCGACTTCCCCACCCCA tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGGCGTGGTATTCAGGTGCACGCACAGGCCGCCCTCGTGGCGCCC CGACCTGCGGGCCTACGGATGGGAGCGCGTGGCCCGCGACCTCCGGGCGG GCGGGGGGGGAACCCTCGTCTTTCGCCCCCGGGGCCCTGCCCTCCTTCGG CCCCGGCGTCACCAGGCCTGTCCTTGGGTCCAGGGACATCTCGCCCGTCC TGAAGGACCCCGCCTCCTTCCGCGCCGCCATCGGCCTCCTGGCGCGACAC CTGAAGGCGACCCACGGGGGCCGCATCGACTACATCGCAGGCGAGTGCCC AGTGGCCGCATCTAGGGCGCTTCCGCCTCTGCGCGCGCCGAGGGCAGCAC GTGGGCTCTGCGCGTCTGCTTGGGGGAGGGCCTTTGGGGTGCTTCAGGGG GCGCCGGGACGGGCGCCGTGCTTGGGTCGCCCGGGAAGGGTTGTGAGATT GAGCCC (SEQ ID NO: 123) pARBI-904: TGCCCCGCCCCTCACCCACGCTCCGCCCTCCGGGGATGCCCCACCCCTCG 1806 APRT_2_ TGGCGGTCCCGCCCGTCCCCGCGCAGGCGCGCTCGGGCTGCCGCTGGCTC Endogenous_2 TTCGCACGCGGCCATGGCCGACTCCGAGCTGCAGCTGGTTGAGCAGCGGA TCCGCAGCTTCCCCGACTTCCCCACCCCAGGCGTGGTATTCAGGTGCACG CACAGGCCGCCCTCGTGGCGCCCCGACCTGCGGGCCTACGGATGGGAGCG CGTGGCCCGCGACCTCCGGGGGGGGGGGGCGGGAACCCTCGTCTTTCGCC CCCGGGGCCCTGCCCTCCTTCGGCCCCGGCGTCACCAGGCCTGTCCTTGG GTCCAGGGACATCTCGCCCGTCCTGAAGGACCCCGCCTCCTTCCGCGCCG CCATCGGCCTCCTGGCGCGACACCTGAAGGCGACCCACGGGGGCCGCATC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGACTACATCGCAGGCGAGTGCCCAGTGGCCGCATCTAGGGCGCT TCCGCCTCTGCGCGCGCCGAGGGCAGCACGTGGGCTCTGCGCGTCTGCTT GGGGGAGGGCCTTTGGGGTGCTTCAGGGGGCGCCGGGACGGGCGCCGTGC TTGGGTCGCCCGGGAAGGGTTGTGAGATTGAGCCCCCGAGGCCGCCGCGC TGTGCAGGCGTCCTTCCCGCAGGTTCCGGGTCCCCAGCCCAGGACAGGCG TGACCGAGTTGCCGGGTCAGTTGGTCTCCCTGGAGTGCCCAAGCTGAATC CACAGGGCCCAGCTGCCTTGCTTCTTGTTCCTTCTGCGAGCTGGTATTGA GCGCCTGCCACGAGCCAGGCCTTCCCTGGTGAAGATCACGGAATGCCCAC CCAGGGAAGGGAGGCCTGGAGGCCTCCGGGAGAGCCCAAGAGGTGGCCCA GGGAGA (SEQ ID NO: 124) pARBI-905: CGTTCTGctcagtggtgcgtggaatgcgagcgcgtcttaaaatcgatggc 1806 APRT_3_ gcctaggagtccatgaaatacggTACAGGCTTCCGGCGACGGATGCCCCG Endogenous_3 CCCCTCACCCACGCTCCGCCCTCCGGGGATGCCCCACCCCTCGTGGCGGT CCCGCCCGTCCCCGCGCAGGCGCGCTCGGGCTGCCGCTGGCTCTTCGCAC GCGGCCATGGCCGACTCCGAGCTGCAGCTGGTTGAGCAGCGGATCCGCAG CTTCCCCGACTTCCCCACCCCAGGCGTGGTATTCAGGTGCACGCACAGGC CGCCCTCGTGGCGCCCCGACCTGCGGGCCTACGGATGGGAGCGCGTGGCC CGCGACCTCCGGGCGGGCGGGGGGGGAACCCTCGTCTTTCGCCCCCGGGG CCCTGCCCTCCTTCGGCCCCGGCGTCACCAGGCCTGTCCTTGGGTCCAGG tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGACATCTCGCCCGTCCTGAAGGACCCCGCCTCCTTCCGCGCCGC CATCGGCCTCCTGGCGCGACACCTGAAGGCGACCCACGGGGGCCGCATCG ACTACATCGCAGGCGAGTGCCCAGTGGCCGCATCTAGGGCGCTTCCGCCT CTGCGCGCGCCGAGGGCAGCACGTGGGCTCTGCGCGTCTGCTTGGGGGAG GGCCTTTGGGGTGCTTCAGGGGGCGCCGGGACGGGCGCCGTGCTTGGGTC GCCCGGGAAGGGTTGTGAGATTGAGCCCCCGAGGCCGCCGCGCTGTGCAG GCGTCCTTCCCGCAGGTTCCGGGTCCCCAGCCCAGGACAGGCGTGACCGA GTTGCCGGGTCAGTTGGTCTCCCTGGAGTGCCCAAGCTGAATCCACAGGG CCCAGCTGCCTTGCTTCTTGTTCCTTCTGCGAGCTGGTATTGAGCGCCTG CCACGA (SEQ ID NO: 125) pARBI-906: AGTAATTTGATGGGGGCTATTATGAACTGAGAAATGAACTTTGAAAAGTA 1806 B2M_1_ TCTTGGGGCCAAATCATGTAGACTCTTGAGTGATGTGTTAAGGAATGCTA Endogenous_4 TGAGTGCTGAGAGGGCATCAGAAGTCCTTGAGAGCCTCCAGAGAAAGGCT CTTAAAAATGCAGCGCAATCTCCAGTGACAGAAGATACTGCTAGAAATCT GCTAGAAAAAAAACAAAAAAGGCATGTATAGAGGAATTATGAGGGAAAGA TACCAAGTCACGGTTTATTCTTCAAAATGGAGGTGGCTTGTTGGGAAGGT GGAAGCTCATTTGGCCAGAGTGGAAATGGAATTGGGAGAAATCGATGACC AAATGTAAACACTTGGTGCCTGATATAGCTTGACACCAAGTTAGCCCCAA GTGAAATACCCTGGCAATATTAATGTGTCTTTTCCCGATATTCCTCAGGT tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatACTCCAAAGATTCAGGTTTACTCACGTCATCCAGCAGAGAATGG AAAGTCAAATTTCCTGAATTGCTATGTGTCTGGGTTTCATCCATCCGACA TTGAAGTTGACTTACTGAAGAATGGAGAGAGAATTGAAAAAGTGGAGCAT TCAGACTTGTCTTTCAGCAAGGACTGGTCTTTCTATCTCTTGTACTACAC TGAATTCACCCCCACTGAAAAAGATGAGTATGCCTGCCGTGTGAACCATG TGACTTTGTCACAGCCCAAGATAGTTAAGTGGGGTAAGTCTTACATTCTT TTGTAAGCTGCTGAAAGTTGTGTATGAGTAGTCATATCATAAAGCTGCTT TGATATAAAAAAGGTCTATGGCCATACTACCCTGAATGAGTCCCATCCCA TCTGATATAAACAATCTGCATATTGGGATTGTCAGGGAATGTTCTTAAAG ATCAGA (SEQ ID NO: 126) pARBI-907: AAGATCTTAATCTTCTGGGTTTCCGTTTTCTCGAATGAAAAATGCAGGTC 1806 B2M_2_ CGAGCAGTTAACTGGCTGGGGCACCATTAGCAAGTCACTTAGCATCTCTG Endogenous_5 GGGCCAGTCTGCAAAGCGAGGGGGCAGCCTTAATGTGCCTCCAGCCTGAA GTCCTAGAATGAGCGCCCGGTGTCCCAAGCTGGGGCGCGCACCCCAGATC GGAGGGCGCCGATGTACAGACAGCAAACTCACCCAGTCTAGTGCATGCCT TCTTAAACATCACGAGACTCTAAGAAAAGGAAACTGAAAACGGGAAAGTC CCTCTCTCTAACCTGGCACTGCGTCGCTGGCTTGGAGACAGGTGACGGTC CCTGCGGGCCTTGTCCTGATTGGCTGGGCACGCGTTTAATATAAGTGGAG GCGTCGCGCTGGCGGGCATTCCTGAAGCTGACAGCATTCGGGCCGAGATG tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatTCTCGCTCCGTGGCCTTAGCTGTGCTCGCGCTACTCTCTCTTTC TGGCCTGGAGGCTATCCAGCGTGAGTCTCTCCTACCCTCCCGCTCTGGTC CTTCCTCTCCCGCTCTGCACCCTCTGTGGCCCTCGCTGTGCTCTCTCGCT CCGTGACTTCCCTTCTCCAAGTTCTCCTTGGTGGCCCGCCGTGGGGCTAG TCCAGGGCTGGATCTCGGGGAAGCGGGGGGTGGCCTGGGAGTGGGGAAGG GGGTGCGCACCCGGGACGCGCGCTACTTGCCCCTTTCGGCGGGGAGCAGG GGAGACCTTTGGCCTACGGCGACGGGAGGGTCGGGACAAAGTTTAGGGCG TCGATAAGCGTCAGAGCGCCGAGGTTGGGGGAGGGTTTCTCTTCCGCTCT TTCGCGGGGCCTCTGGCTCCCCCAGCGCAGCTGGAGTGGGGGACGGGTAG GCTCG (SEQ ID NO: 127) pARBI-908: AGGAATGCTATGAGTGCTGAGAGGGCATCAGAAGTCCTTGAGAGCCTCCA 1806 B2M_3_ GAGAAAGGCTCTTAAAAATGCAGCGCAATCTCCAGTGACAGAAGATACTG Endogenous_6 CTAGAAATCTGCTAGAAAAAAAACAAAAAAGGCATGTATAGAGGAATTAT GAGGGAAAGATACCAAGTCACGGTTTATTCTTCAAAATGGAGGTGGCTTG TTGGGAAGGTGGAAGCTCATTTGGCCAGAGTGGAAATGGAATTGGGAGAA ATCGATGACCAAATGTAAACACTTGGTGCCTGATATAGCTTGACACCAAG TTAGCCCCAAGTGAAATACCCTGGCAATATTAATGTGTCTTTTCCCGATA TTCCTCAGGTACTCCAAAGATTCAGGTTTACTCACGTCATCCAGCAGAGA ATGGAAAGTCAAATTTCCTGAATTGCTATGTGTCTGGGTTTCATCCATCC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGACATTGAAGTTGACTTACTGAAGAATGGAGAGAGAATTGAAAA AGTGGAGCATTCAGACTTGTCTTTCAGCAAGGACTGGTCTTTCTATCTCT TGTACTACACTGAATTCACCCCCACTGAAAAAGATGAGTATGCCTGCCGT GTGAACCATGTGACTTTGTCACAGCCCAAGATAGTTAAGTGGGGTAAGTC TTACATTCTTTTGTAAGCTGCTGAAAGTTGTGTATGAGTAGTCATATCAT AAAGCTGCTTTGATATAAAAAAGGTCTATGGCCATACTACCCTGAATGAG TCCCATCCCATCTGATATAAACAATCTGCATATTGGGATTGTCAGGGAAT GTTCTTAAAGATCAGATTAGTGGCACCTGCTGAGATACTGATGCACAGCA TGGTTTCTGAACCAGTAGTTTCCCTGCAGTTGAGCAGGGAGCAGCAGCAG CACTTG (SEQ ID NO: 128) pARBI-909: GATCTCACATTCTAGCGCCCAGACATGTCGGGAACGCCTTACGCCAACTT 1806 CAPNS1_1_ GGGGTCCCCACCAAGAGAACCCCCACCAGATCTGCACCCTCCCCTTCACG Endogenous_7 CGTGCACCCAGTCCAGGCTCCCTCAAGCCCCACGGGTGCCTTTTAGACCT GAGGAGGTTGCAAACCTGATCCCCCATACCTGCCCCACCCATCCGCGGAC AACCCGCCCTCGCAAACTCAGACCCCCACCCGGAGGCTTCAGATTCCTCC CAGGTCCAGCTGCCGGAAATGCGTGTTTGAAGGGAGGGTGTGGGCTCAGG GGCGAAGCACCCACTGGTCCCCTTTTTTCCCCCCAGCAGTGAGTCGCAGC CATGTTCCTGGTTAACTCGTTCTTGAAGGGCGGCGGCGGCGGCGGCGGGG GAGGCGGGGGCCTGGGTGGGGGCCTGGGAAATGTGCTTGGAGGCCTGATC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatAGCGGGGCCGGGGGCGGCGGCGGCGGCGGCGGCGGCGGCGGCGG TGGTGGAGGCGGCGGTGGCGGTGGAACGGCCATGCGCATCCTAGGCGGAG TCATCAGCGCCATCAGGTAAGGCGGAGACTATCAGAGGGGGGGGCCTGGG AATGGGAGGAGCCTCAGTGAGGCGTGGTCTGGGAGGGGCGTGGTCTAAAA ATAGAATAGGATTAACCTGGAGGCTAACCTGGGTACATGAATTAGGCCGG GGAGGCCTGGTTTGAGAGTTCTGCTGTAAGGGGGTGGACCCCAGTGAGGC GGATATCAGTCATTGGGGGCGGTGCTTGATATGGGAGTAGTCTGATTGTG TGGGACCAGGACAATGTGGTCCTGAGGGGACTGATGTGGAGTTTTGGCGG GTGGGGCTTATGGGTCTGGGCTCAGCCTGCATTGGCTAACCTGGAGATGA ACGTT (SEQ ID NO: 129) pARBI-910: CGGGCATTTAGGAAGCGGGCCATGGAGTAGGGTCTTGGGCAGATGGCACC 1806 CAPNS1_2_ TGAGGAGGATAAGGCCTTGGGAAGCTGGGGCAGAGCCTCTATGAAGTGAG Endogenous_8 CCCTCCAAGGGGCGGGGTCTTTTGATTCTACGTGGGATTTTTTAGGTTTA GGGTGGCCAAGATGACTGAAATCTGCCACTGGGTAGGTGTGCCTGGCAGG AGGGGAGCCTCCCAGGGGACCGGTCTCTGGGTTTCCTCGAGGGTGGGGTT GGCCTGAGGAAGGGAGAAGAGGGGCACGACCAGGGCAGTGTGGATTGGGA CAGATGAGGACAAGAACAAATGAAAGGCACAGCAACCAAGTAAGGAAGAT AACGGCTGGGGTCTGGAGCGTTGGGGCTGATGGTTCTGTAGTGCTGCCCG TTGGAGGCCCCGCCCCTGGCACTAACCCCTCCCCCTTATCTCTTCGCAGC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGAGGCGGCTGCGCAGTACAACCCGGAGCCCCCGGTAAGCCCCCT CTGCAACCAGACCCCCTTCTCCTGCCAAGGCCTCTTCGAGGTCCCATCCC TGTTCCTGTAGAGAAGCCCCACCTTCCTCCCCTTCTTGTGAAATTCCTCT GCCAGTTCCTCCCATGCCGTGTCTGCAGCTTCGCCATGGGTCTTAGCCAT GCCCCACATACGTGCACCCCATTCACTACCACCCCTCATTCTTTTCCTCA TCAAGCTGCCCAGCCCACTCTGACTTCCCCACCCAGGGTACCTGGGTTTG GGGAGCCGTCCTGGCCGGGTTCCCCTCCCCCTGCTCTGAGCTCTCCTCCC TTTGCAGCCCCCACGCACACATTACTCCAACATTGAGGCCAACGAGAGTG AGGAGGTCCGGCAGTTCCGGAGACTCTTTGCCCAGCTGGCTGGAGATGTA AGTAAC (SEQ ID NO: 130) pARBI-911: GCTGATGGTTCTGTAGTGCTGCCCGTTGGAGGCCCCGCCCCTGGCACTAA 1806 CAPNS1_3_ CCCCTCCCCCTTATCTCTTCGCAGCGAGGCGGCTGCGCAGTACAACCCGG Endogenous_9 AGCCCCCGGTAAGCCCCCTCTGCAACCAGACCCCCTTCTCCTGCCAAGGC CTCTTCGAGGTCCCATCCCTGTTCCTGTAGAGAAGCCCCACCTTCCTCCC CTTCTTGTGAAATTCCTCTGCCAGTTCCTCCCATGCCGTGTCTGCAGCTT CGCCATGGGTCTTAGCCATGCCCCACATACGTGCACCCCATTCACTACCA CCCCTCATTCTTTTCCTCATCAAGCTGCCCAGCCCACTCTGACTTCCCCA CCCAGGGTACCTGGGTTTGGGGAGCCGTCCTGGCCGGGTTCCCCTCCCCC TGCTCTGAGCTCTCCTCCCTTTGCAGCCCCCACGCACACATTACTCCAAC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatATTGAGGCCAACGAGAGTGAGGAGGTCCGGCAGTTCCGGAGACT CTTTGCCCAGCTGGCTGGAGATGTAAGTAACCTGGGGTCCCTGGCCCCGT CCTAACCGTTCCATCCCTTCCCTTGTGGCTGCCCTTGCACACACACCCTT GACCATGACAATCCCAGTGTTCCCATTCTCCATGACATTCTCAGACCCCT TTCAGTCACCCCTGACCTGCCCCTAACTTCCGCCCGCAGGACATGGAGGT CAGCGCCACAGAACTCATGAACATTCTCAATAAGGTTGTGACACGACGTA AGTGACCGGGGTTAAGGAATAGGGTAGATTCAGAGGCAGAGGGGTCAGAG AGGATTTGACCTCTGGCCTCTGACTTTCAACCTGTTACCCACAGACCCTG ATCTGAAGACTGATGGTTTTGGCATTGACACATGTCGCAGCATGGTGGCC GTGATG (SEQ ID NO: 131) pARBI-912: ggagagttatagcagatggagacaaaaaagggaactgctggggatcaaac 1806 CBLB_1_ cttgtaaagtccttctaagtgatatatgaggactttgacttttattcaaa Endogenous_10 gttagaaggctttgaacaaaagaattatttcatttgtgttttacaatggt cacaagtgccgtagcgagaataggctgTGTAGAGAAGAGATTTTGTGGTT TTCTTTTTTGTTCTCTGTATTCTCTTCCTGAAAAATCTGTTTTCTGCCTA ATCATATTTCTGTAAAGAACAATAGACTTGAACGTCTGCTGTTAAGTAAC AAAGGTTGACCTAAACCACATAAGGTCAGATTAACTTTTAAAATATCCAA GATAATGTTAAGTGTATTAAATATGGTTCAGTATGTAATCAAAATATTTG ATATTCTCTAGATACTAATCTCttttaatttttttattttttAGCCTTGG tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGGCTCTATTTTGCGGAATTGGAATTTCTTAGCTGTGACACATCC AGGTTACATGGCATTTCTCACATATGATGAAGTTAAAGCACGACTACAGA AATATAGCACCAAACCCGGAAGGTAAGACTTTTTAGCAGTAATATTATGT GATATATCCAAAGATGAAAAATTCTCCATGGTTTGTTCCTAAGTAAGTCA GCAATACCCGCTGATGGCATGCTTGGGAGGGGGCTAGAAAAGGTAAAAGA CCAAAATGGTGCTTCTTAACATAATGTTTTAAATTGATACATTGTGCTTA CTCTGAAACTTGGTATTCATAAAGTGAAAGGGGATATGGTCTCCTTTCTG TTTTCATGATATCCGTGTTTATGTGCAAAATTGAATGTATATATTTGAAT ACTTGAATACTGTGCTAAAGAACAAATAAATAGAACAGTCTTATGTTCAT TACTTT (SEQ ID NO: 132) pARBI-913: CATATTACTCATTTGTGCAATGAATAAAGGATTTCTATTATTATATGGAC 1806 CBLB_2_ CCATCATCTTATGACTGTTTTTAGAAAATGAGAATTAAATCTTGATTCAT Endogenous_11 TATTAAATAACCTTGTATACACACATGAAGAGGTCATGACACATAAGACT TTGACCTAATACCTTTTTTCAGATATGTTTCTATAGCTATCTTGATAGCC TAGGACTGTTTGAGAGAAAATTAATTTAAATTTTTGTCATATTCTTAGCA AATGCAAAATAATTGAGTTAAACAGAACCCACTACATTTGTTCTGAAATA TGAACAATATTCTCAAATATTTGTCTGGTAGTACAAATACAATTATATTA ATTTTATTATTGAATTTTGCTGCTTCAAAGGGAGGTATTTCTTAAATGAT ACTTGATTCAATTATTTCCCTTTTTTTCCACCTTGCACAGAACTATCGTA tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatCCATGGAAAGTATTCAGACAGTGCCTTCATGAGGTCCACCAGAT TAGCTCTGGCCTGGAAGCAATGGCTCTAAAATCAACAATTGATTTAACTT GCAATGATTACATTTCAGTTTTTGAATTTGATATTTTTACCAGGCTGTTT CAGGTAAGTTTATACTTGCTTAGTCTATCCATTCCCTtcttctctctccc tgtcttttctctctgtttctctctctctctgtctctctctcacacacaca cacaTAAACAAACATATGGAAAATAGTGTCATTCCTATTAGTCTGTAATT TTATTGGTATGTTTGGTCATTGTTAAAAGTTTAGAGTGAGCCATTTGGCT TAAGTTTGGAGTTATCAGATGCTGTGAGCCTGGTGGCTGCTTGTTAGCTT TGAGAGGATCCATCCAAAAGAGCTTTGTTTTAAACTCTCTACTTAGTTTC TTACTT (SEQ ID NO: 133) pARBI-914: TTAAACTTTAGAGGTTCAAAATAAGGATATGATATAGAAAAGATTGAATG 1806 CBLB_3_ GTAGGAGAACAGAATAGGATTTATTTATATTATGACAAACGTGACTATCA Endogenous_12 GTTAATAAATTGGAGATAGTAAGTTAAGATCATTATTTATATTATGGAAA AATGAAGAAATAGGACATGGTTCTCTAACTTTTTAATTTTTCTGTGAAAT AAAATATTATGACTTATTTTATTTTTATCTTTGTTTTTAGGTAAGACTGT GCCAAAATCCCAAACTTCAGTTGAAAAATAGCCCACCATATATACTTGAT ATTTTGCCTGATACATATCAGCATTTACGACTTATATTGAGTAAATATGA TGACAACCAGAAACTTGCCCAACTCAGTGAGAATGAGTACTTTAAAATCT ACATTGATAGCCTTATGAAAAAGTCAAAACGGGCAATAAGACTCTTTAAA tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGAAGGCAAGGAGAGAATGTATGAAGAACAGTCACAGGACAGGTA AGAAGAATATTTCAGATGTTTTGGTGTAAAGGTCATTTATGTTGCTTTTT ACTTAATAGTTAACCTAAACGTCACCATGTAATTGTTTTGGGGTAAATGT GAGTCGCTTTGTATATagtcatgtggtacttaacgatggggatactttac aagaaatgcatttttagacaatttcattgttgtgtggacatgatagagtg tacttacacagagctagatggtattacctaccgcacacctaggctgtatg atgtagcctattgctcctaggctgcaaatctatacagtatgttactgtac tgaatactgtgggcagttgtaacacaatggtaagtatgtgtgtatctaaa tatacctaaacatagaaaaggtatggtaaaaatactgtataaaaggccgg gcgtgg (SEQ ID NO: 134) pARBI-915: AGATGAGAAAACCTATCCTTCCCAATTTTTTTGTGTGAGAATTAAAATGC 1806 CD2_1_ AGCAAGAAAACACACACTCATAAACACATCTGCTTTGGCAAAGGAGCACA Endogenous_13_ TCAGAAGGGCTGGCTTGTGCGCGCTCTTGCTCTCTGTGTATGTGTATTAT 14_15 GTTTTATGTTACTGTAAAAGATGTAAAGAGAGGCACGTGGTTAAGCTCTC GGGGTGTGGACTCCACCAGTCTCACTTCAGTTCCTTTTGCATGAAGAGCT CAGAATCAAAAGAGGAAACCAACCCCTAAGATGAGCTTTCCATGTAAATT TGTAGCCAGCTTCCTTCTGATTTTCAATGTTTCTTCCAAAGGTAAGCATA AGAGTCAAAGAAGTCCCAACCCAGCTTTCCCTGAAAGTGACTCTCAGTAA CTCTTTTGCTTTTTATAGGTGCAGTCTCCAAAGAGATTACGAATGCCTTG tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGAAACCTGGGGTGCCTTGGGTCAGGACATCAACTTGGACATTCC TAGTTTTCAAATGAGTGATGATATTGACGATATAAAATGGGAAAAAACTT CAGACAAGAAAAAGATTGCACAATTCAGAAAAGAGAAAGAGACTTTCAAG GAAAAAGATACATATAAGCTATTTAAAAATGGAACTCTGAAAATTAAGCA TCTGAAGACCGATGATCAGGATATCTACAAGGTATCAATATATGATACAA AAGGAAAAAATGTGTTGGAAAAAATATTTGATTTGAAGATTCAAGGTAAG TGTTCATTCCCTTAATTGCTTTATTTCAGTGTGGGTGCTATTTGGCAAGT TGGAAAATAGCATTTCTAATATTCCCCAGCGCTGACCTCTGCCTCCAGGG GGGCTATACAGGAAACCAGATGCATGTCTCCTGCCCCAGTGGAGACTGTG GTCCAA (SEQ ID NO: 135) pARBI-916: AGGATTGTTCTAGTTGATTGGTATGTGTGCACTCCTAGTTGTTAAATATT 1806 CD3E_1_ TTCACTATCACACCTGGATATACTCAACAAATATTTGTTGAGCCAAATAC Endogenous_16 TCAACACCAGCCAAACACGTAGTATTTACTTTAGCTTAAGCGAATTATTT AGCCCTGACAGAAGCCCTGGAATGTGGGTCTTTAAGTTCCTATTTTTGAG ATGGGAAAGCTGAGGCTCACGGAAGGAGGTGACCAGCTCAAGTCTCCTAC CGTCCATGCCAAATTAGAATTCCAGCCTGCCTCCTGACTTCAAGTCCAAA GTTCTTCCCACGCACTAAAGCTAGCTCTTCAGTGTCCTTTCTTAGGAGGT ACTTCCTCCCGCACCACTGACCGCCCCCTCTCTATTTCACCCCCAGCCCA TCCGGAAAGGCCAGCGGGACCTGTATTCTGGCCTGAATCAGAGACGCATC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatTGACCCTCTGGAGAACACTGCCTCCCGCTGGCCCAGGTCTCCTC TCCAGTCCCCCTGCGACTCCCTGTTTCCTGGGCTAGTCTTGGACCCCACG AGAGAGAATCGTTCCTCAGCCTCATGGTGAACTCGCGCCCTCCAGCCTGA TCCCCCGCTCCCTCCTCCCTGCCTTCTCTGCTGGTACCCAGTCCTAAAAT ATTGCTGCTTCCTCTTCCTTTGAAGCATCATCAGTAGTCACACCCTCACA GCTGGCCTGCCCTCTTGCCAGGATATTTATTTGTGCTATTCACTCCCTTC CCTTTGGATGTAACTTCTCCGTTCAGTTCCCTCCTTTTCTTGCATGTAAG TTGTCCCCCATCCCAAAGTATTCCATCTACTTTTCTATCGCCGTCCCCTT TTGCAGCCCTCTCTGGGGATGGACTGGGTAAATGTTGACAGAGGCCCTGC CCCGTT (SEQ ID NO: 136) pARBI-917: AATAATAAAATAATAACAATACTTAACATTTATTGAGTGCTTATTAAGTC 1806 CD3E_2_ TCAAGCACTGTCTGTACCCAACACTTATCAAGGATTCTTTTTCATGTAAT Endogenous_17_18 CCTCTCAACAACTATATGGGTTAAGTATCATTTTATTCCCATGAGTAAAG GGATGAGGAAACAGAGGGTTTGTGAGTTGAAAACACATTTCACGCTTCTC ACAGCTAGTGAGTAATAAAGCTGGGACTCAAACCCAGGGCTGTTTGACTC CAGTGCCTCTACCCACGGCCACCACTCTTTGCTTGTCAATGTTGTTCTAA ACATATTGAAGGGGGGGCTCTGACCGTGGCAAGCGTGTGAGTAGTAAGGG GAGAATGGCCTTCATGCACTCCCTCCTCACCTCCAGCGCCTTGTGTTTTC CTTGCTTAGTGATTTCCCCTCTCCCCACCCCACCCCCCACAGTGTGTGAG tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatAACTGCATGGAGATGGATGTGATGTCGGTGGCCACAATTGTCAT AGTGGACATCTGCATCACTGGGGGCTTGCTGCTGCTGGTTTACTACTGGA GCAAGAATAGAAAGGCCAAGGCCAAGCCTGTGACACGAGGAGCGGGTGCT GGCGGCAGGCAAAGGGGTAAGGCTGTGGAGTCCAGTCAGAGGAGATTCCT GCCAAGGGGGACGACCAGCCTGGGCCAGGGTGGGTGGCAAGTCCACAGCT AGGTCAGAACAGCTTCTCTAGAGCTTCTATGCACAGCTTCTATTACTGTG ATGACAAGATCTCAACAGACGGTTTCAAATCTCACATCACTCCCCTCCTT CCCATCCTAGAAAAGTGCAAAAAAGTTTATGAAAGTGATGGGCTTCCTCA CATACCTGTCAATGCCTGCAGTCATCCGATTCCGCCCCTAAGCTGTGGGA AGAGAG (SEQ ID NO: 137) pARBI-918: ACTGTGACCCATGAACCAGTAGGTATTGGCGTCACCTGGGATCCTGACGT 1806 CD3G_1_ TAGTCTCTGTCAATCCTTCTCTTTAGTTCATCTATTCTACCCAAAGTGAT Endogenous_19_20 CTCATCATCTGGTATGCTGTTAGCAGTTTCTTACCTGTATAGTATCTTCC AAATAACATGCCCCAAAATCCCAAAGTTTTACCCCTACTAATTACAGCAA TGTCTCTTTTATTCTTCACCCCCTGACGCAGATATTGGCGTCACCCGAGA GCATGTTAGTAATGCAGAATCTCCCCTCCCCAGAACTACTAAATAGCACC TGAAATTTTAACAAGATCCCCATGTGATTCATGTGCACATCAAAGTTTGA GAAACACTACTCTAATGATCTCCTGGTATGCAGAAGCAGGGAGAATTTCA GAGGCAAGATCCTTAATAGAACCACGGCTTTTCTCATTTCAGGAAACCAC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatTTGGTTAAGGTGTATGACTATCAAGAAGATGGTTCGGTACTTCT GACTTGTGATGCAGAAGCCAAAAATATCACATGGTTTAAAGATGGGAAGA TGATCGGCTTCCTAACTGAAGATaaaaaaaaaTGGAATCTGGGAAGTAAT GCCAAGGACCCTCGAGGGATGTATCAGTGTAAAGGATCACAGAACAAGTC AAAACCACTCCAAGTGTATTACAGAAGTATGTAATCCCCTTTGGTCTGTT TGTTGTGAAATTAATCAGTATTTGCTGTTCTGGTGAGCTTTTTATCTGGG GTGAAAGTGGAAATAGATCCTCAACAGTAATATTATCGCCTGTTCTCTTA ATTTCAGCTTGCCTCTTTTAAAATACTGTAAGATACTTCCCTCACCCTAT TGAAAAACTACAGCCAGTCCTGTAAAATTTTGTTTACCTTTGGGTGGGCT CCATGG (SEQ ID NO: 138) pARBI-919: TTCGAGACAAGCCTGGCCAACATGATGAAACTCTGTCTCTACTAAAAATA 1806 CD3G_3_ CAAAAAAAATTAACCAGGCGTGGAGGCGCGCGCCTGTAGTCCCAGCTACT Endogenous_21 CAGGAGGCTGAGGCAGGAGGACCACTTGAACCCAGGAGGTCGAGGTTGCA GTGAGCTGCGATTGTGCCACTGCACTCCAGCCTGGGCAACAGAGAAAGAC TCCGTCTCAAAAAAAAAAAAGAGAGAGAGAGAAAAAGAAAAAAGACAGAG CCTCCATCTCCTTGTCCTCTTTCCATCCTCAGGACCATGAAGTACCCACT CCAAATTCTCACATATAAAAAACATTCAATAAACATGCATCAAATTAATT AATAGAGGATGGAAAAAATGACTTATGACTGTGCTGTCCTTTCCAGCCCC TCAAGGATCGAGAAGATGACCAGTACAGCCACCTTCAAGGAAACCAGTTG tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatAGGAGGAATTGAACTCAGGACTCAGAGTAGGTGGGTTCTTCAAT GCCAATTCTAATAAAGGACCCTTGCATCAACTGCCCTCGCAATTGCTTCT AAGTCTAGCTCCCTTCCCTAAGCGGCTATAAGCATCAGACTCTGGGGATC AGGGATTGGGACGTGGTTTGGGGTACTCTTTTCTAAAAATTCTGGGGCCA TACTGATTGTCTTGGCCTAGGTAAATATGAATTTTATGTATCTGTAAATC CTGTCAGAGCAGGGCCTCAAGCCATAGAGATGCTGAATATTAATCTTAAC CTACATTTGAATTTCTCATTATCTACACTATTAACATTTTGGGCTAATTA ATTATTTGTGATGAGGGGCTAGCCTGTGCATTGTAGGAGTTATGGAAGCA TCCCTGGCCTCTCTCCACCAGATGCTGGTAGATTGTCCAGTGTGACAATC AAAAAT (SEQ ID NO: 139) pARBI-920: GACCTCAGGTGATCTGCCCGCCTCAGCTTCCCAAAGTGCTGGGATTACAG 1806 CD5_1_ GCATGAGCCACCACACCTGGCCTAAAATTAATTTTTAAAGATCTCTTAAA Endogenous_22 AAGCAGACACCAGCCCCAATCTCAGACCCCTTGAGACAGAATTTCCAGGA CAGGGGCCATCCTGCTGGACAGTGGGTGCCGAGAACACCTTGCCCATTTA TCTGAGCTCCCTTCTGACTCTGAAATCTGGAGCCCCACCCTCCTGGGTCT AGCTTCGGGGCTGCCTGGGTCAGGGTCCTCTGGGAAGCCCCTGCAGTGCC CCAGAAGGGACGAAGCTCACAAGGGGCAAGGCAGGCAGCCCACGGGGCAG GAGGGAGCTCAACTGGGCGTCCTAGGGAGAGGGCAGTGAGGGGTGCCAGT GGGGAACCCCTCCCAGCCTGACCCCCACCACACCTTTCTGACCCCCAGAT tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatTTCCAGGCAAGGCTCACCCGTTCCAACTCGAAGTGCCAGGGCCA GCTGGAGGTCTACCTCAAGGACGGATGGCACATGGTTTGCAGCCAGAGCT GGGGCCGGAGCTCCAAGCAGTGGGAGGACCCCAGTCAAGCGTCAAAAGTC TGCCAGCGGCTGAACTGTGGGGTGCCCTTAAGCCTTGGCCCCTTCCTTGT CACCTACACACCTCAGAGCTCAATCATCTGCTACGGACAACTGGGCTCCT TCTCCAACTGCAGCCACAGCAGAAATGACATGTGTCACTCTCTGGGCCTG ACCTGCTTAGGTGGGTAACTAGCCAGCCACACGGGCACCCTGGGCCTGGG CGCCAGCCCCGAGGAGACTGCCCGAGGCCTGTGATCTAGGGTCTGAGCAG GCTGGTGGAAGGGGTGGGGGGACCCCAGTTTATAACCACTCCCCAAGACA CATACC (SEQ ID NO: 140) pARBI-921: GGACAGGGGCCATCCTGCTGGACAGTGGGTGCCGAGAACACCTTGCCCAT 1806 CD5_2_ TTATCTGAGCTCCCTTCTGACTCTGAAATCTGGAGCCCCACCCTCCTGGG Endogenous_23 TCTAGCTTCGGGGCTGCCTGGGTCAGGGTCCTCTGGGAAGCCCCTGCAGT GCCCCAGAAGGGACGAAGCTCACAAGGGGCAAGGCAGGCAGCCCACGGGG CAGGAGGGAGCTCAACTGGGCGTCCTAGGGAGAGGGCAGTGAGGGGTGCC AGTGGGGAACCCCTCCCAGCCTGACCCCCACCACACCTTTCTGACCCCCA GATTTCCAGGCAAGGCTCACCCGTTCCAACTCGAAGTGCCAGGGCCAGCT GGAGGTCTACCTCAAGGACGGATGGCACATGGTTTGCAGCCAGAGCTGGG GCCGGAGCTCCAAGCAGTGGGAGGACCCCAGTCAAGCGTCAAAAGTCTGC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatCAGCGGCTGAACTGTGGGGTGCCCTTAAGCCTTGGCCCCTTCCT TGTCACCTACACACCTCAGAGCTCAATCATCTGCTACGGACAACTGGGCT CCTTCTCCAACTGCAGCCACAGCAGAAATGACATGTGTCACTCTCTGGGC CTGACCTGCTTAGGTGGGTAACTAGCCAGCCACACGGGCACCCTGGGCCT GGGCGCCAGCCCCGAGGAGACTGCCCGAGGCCTGTGATCTAGGGTCTGAG CAGGCTGGTGGAAGGGGTGGGGGGACCCCAGTTTATAACCACTCCCCAAG ACACATACCCAGGAGGGGGACTGGAAGGGGCCAGCACCCATCTGTAGGAT GGCAATGGAGGACCTAGTTCTGCCAATCACTGACTTCATCGTCGCCTCTG AACCTCCATTCTCCCATCTGTGAAGTGGGGTGGTACTTCCCGCCTCGCAG GAGGCT (SEQ ID NO: 141) pARBI-922: tgctgggattacaggcatgagccaccacacctggcctaaaattaattttt 1806 CD5_3_ aaagaTCTCTTAAAAAGCAGACACCAGCCCCAATCTCAGACCCCTTGAGA Endogenous_24 CAGAATTTCCAGGACAGGGGCCATCCTGCTGGACAGTGGGTGCCGAGAAC ACCTTGCCCATTTATCTGAGCTCCCTTCTGACTCTGAAATCTGGAGCCCC ACCCTCCTGGGTCTAGCTTCGGGGCTGCCTGGGTCAGGGTCCTCTGGGAA GCCCCTGCAGTGCCCCAGAAGGGACGAAGCTCACAAGGGGCAAGGCAGGC AGCCCACGGGGCAGGAGGGAGCTCAACTGGGCGTCCTAGGGAGAGGGCAG TGAGGGGTGCCAGTGGGGAACCCCTCCCAGCCTGACCCCCACCACACCTT TCTGACCCCCAGATTTCCAGGCAAGGCTCACCCGTTCCAACTCGAAGTGC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatCAGGGCCAGCTGGAGGTCTACCTCAAGGACGGATGGCACATGGT TTGCAGCCAGAGCTGGGGCCGGAGCTCCAAGCAGTGGGAGGACCCCAGTC AAGCGTCAAAAGTCTGCCAGCGGCTGAACTGTGGGGTGCCCTTAAGCCTT GGCCCCTTCCTTGTCACCTACACACCTCAGAGCTCAATCATCTGCTACGG ACAACTGGGCTCCTTCTCCAACTGCAGCCACAGCAGAAATGACATGTGTC ACTCTCTGGGCCTGACCTGCTTAGGTGGGTAACTAGCCAGCCACACGGGC ACCCTGGGCCTGGGCGCCAGCCCCGAGGAGACTGCCCGAGGCCTGTGATC TAGGGTCTGAGCAGGCTGGTGGAAGGGGTGGGGGGACCCCAGTTTATAAC CACTCCCCAAGACACATACCCAGGAGGGGGACTGGAAGGGGCCAGCACCC ATCTGT (SEQ ID NO: 142) pARBI-923: GGGGGCGGCCGCACGGCTAGAGCGGAGACCCCGCGCCCCCTCCGCCCGCG 1806 EDF1_1_ TGGCCCGGCCGGGGTGCGGGGCCCGGGGAGGCACGGGGGCTGCGCGTCGG Endogenous_25 GGCGCAGCCGCCGCCCGCGTGTGCTCGGAGGCCGCGGGGCCCGGGCTCCG GGGTCCTCCCGACCTGCAGCCCCAGCGGCTACCGCGCCTCGCCAGGCCAG GCCAGGCCCCGACGTCGCCTTCCCTACGTCGCCGGCGCCCGGCCACGACG TCCCTCAGACGAGCCGAACGCCGAATGGCCCCGAGCACGGGAAGTGCCCG CCCCCCGCGTGCAGCCAGCCAATGGGACGCCGAAAGCGGGGAGGTGCCGA GGGGACGTAGCGTCGCCGCGCCAGGTCTCTAGCAGCTGCCGCTGAGCCGC CGGACGGACGCTCGTCTTCGCCCGCCATGGCCGAGAGCGACTGGGACACG tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGTGACGGTGCTGCGCAAGAAGGGCCCTACGGCCGCCCAGGCCAA ATCCAAGCAGGTGCTTTGCTGACTGAGGAGCGGCCGGGCCGGGCGGGGCG GACGCTgggccggggccacggaccgtggggcagggggctcggggaagggc aggcggggcctgggacagggcgcaggggacggggacaaggctggGGTGCC GGGGGCAGGACGGGGTTCGAAGGGCGGGGCGAATCGCAGGGGTAGGAGGA CCGGGCCAGGACaggggtgggaaggtggggggcggacccaggaggggatg gacggacccaggaggcagggggCGGGCGACTGGAGGAGGAGAGGAGCAGG GAACCGGACGGGGCAAGGGGCAGGCGATAGGGCCGGGACCAGTGCCAGGG TGGGGCGGGGAGGGGATAGGGGCAGGGACTGTTGGTGGGGACAGGTGTGA GGACAG (SEQ ID NO: 143) pARBI-924: ACGGCTAGAGCGGAGACCCCGCGCCCCCTCCGCCCGCGTGGCCCGGCCGG 1806 EDF1_2_ GGTGCGGGGCCCGGGGAGGCACGGGGGCTGCGCGTCGGGGCGCAGCCGCC Endogenous_26 GCCCGCGTGTGCTCGGAGGCCGCGGGGCCCGGGCTCCGGGGTCCTCCCGA CCTGCAGCCCCAGCGGCTACCGCGCCTCGCCAGGCCAGGCCAGGCCCCGA CGTCGCCTTCCCTACGTCGCCGGCGCCCGGCCACGACGTCCCTCAGACGA GCCGAACGCCGAATGGCCCCGAGCACGGGAAGTGCCCGCCCCCCGCGTGC AGCCAGCCAATGGGACGCCGAAAGCGGGGAGGTGCCGAGGGGACGTAGCG TCGCCGCGCCAGGTCTCTAGCAGCTGCCGCTGAGCCGCCGGACGGACGCT CGTCTTCGCCCGCCATGGCCGAGAGCGACTGGGACACGGTGACGGTGCTG tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatCGCAAGAAGGGCCCTACGGCCGCCCAGGCCAAATCCAAGCAGGT GCTTTGCTGACTGAGGAGCGGCCGGGCCGGGGGGGGCGGACGCTgggccg gggccacggaccgtggggcagggggctcggggaagggcaggcggggcctg ggacagggcgcaggggacggggacaaggctggGGTGCCGGGGGCAGGACG GGGTTCGAAGGGCGGGGCGAATCGCAGGGGTAGGAGGACCGGGCCAGGAC aggggtgggaaggtggggggcggacccaggaggggatggacggacccagg aggcagggggCGGGCGACTGGAGGAGGAGAGGAGCAGGGAACCGGACGGG GCAAGGGGCAGGCGATAGGGCCGGGACCAGTGCCAGGGTGGGGCGGGGAG GGGATAGGGGCAGGGACTGTTGGTGGGGACAGGTGTGAGGACAGAGGCAG GAGGTG (SEQ ID NO: 144) pARBI-925: CCCTGGAGTCGGTGTGAGGCAAAAGCAGAGGAGAGGATTTGGAATTAACC 1806 EDF1_3_ TGAAGAAAAACCATTTCAAAGCGGTAACCAGACCCCAGAGGCTGCCTGAA Endogenous_27 CTCAAGGGGACATGGGAACCCAGGCTGTCCCAAGTTCACACACCTCAGGT CGTGGACTTCCAGAGTTTTCTCTCAGTTATGAGGACAGGCAGCTGTACTC ATGCCAACCAGAGCTGCTCTGGGAAGATGGCTGCCTCCCAGGGCTCCGGC CAGCACCGGTGCAGGCAGGCACTCAGCTGACAACGTCCCCGGGGGCGCCG CACACACCACACCCACCAGCACATGGACCCCACAGCACAGCCTCATGTTG CAAGCGGAAACACAAGTACCTACATTTCTTGGAAGTCTCCACATCTTCTC CTCGTCTCTGTGCCGCTAAGATAGCCTAGAAAATTAGAAAACATCAGTGG tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatTTTCTAAAGAGAAGATGTGCTTTATACACATCAGCTCCACTAGG ACTGCTGCAAAACCAGGCGCGAAGCGTCGCCTGAGAACAGCAGCTTCTAG GGCCCCTGGGGTACGCACCCACACGCAGCTGGGTTTTGTGCGAGAGGCAG TGAGAGCCGGGCTGACCTGGCTTCCCGAGGCATTCCCTGCAGGGGAACCA GGGGGGTGGAGCCCACCCTCCCCGTGCTATCTGAACACCGGCCATCCCCC TCCCCAAACCCACAACCCCAGGAGTGAGAGCCCCGGCGCAGCCTCACCGT GGCCAGGTCCTTCTGCGTAAGCCCCTTGCTCTGCCGACCTTGCTGGATCA CCTTGCCCACCTCCAGGGTCACCCTGTCATGGTGCAGCTCCTCTGTCTCC CGGTCCAGCTTGGCCGTGTTCTTGGTAATAGAATGTTGTTTGTTCTGGCC AGCAGC (SEQ ID NO: 145) pARBI-926: CTCTTGCTTCAACAGTGTTTGGACGGAACAGATCCGGGGACTCTCTTCCA 1806 FTL_1_ GCCTCCGACCGCCCTCCGATTTCCTCTCCGCTTGCAACCTCCGGGACCAT Endogenous_2830 CTTCTCGGCCATCTCCTGCTTCTGGGACCTGCCAGCACCGTTTTTGTGGT TAGCTCCTTCTTGCCAACCAACCATGAGCTCCCAGATTCGTCAGAATTAT TCCACCGACGTGGAGGCAGCCGTCAACAGCCTGGTCAATTTGTACCTGCA GGCCTCCTACACCTACCTCTCTCTGGTGAGTCCCCAGGACGCCCCTGGCC CTAATTTCCTCCAGCTGCGCACCTCCGGCCCTCACTGCACGCGCCAGCCT TCTTTGTGCGGTCGGGTAAACAGAGGGCGGAGTCCCCTTGGCCTCGCCTC CCGCTAACCATTGTTGCCTCCATCTCTTCCCGTAGGGCTTCTATTTCGAC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatCGCGATGATGTGGCTCTGGAAGGCGTGAGCCACTTCTTCCGCGA ATTGGCCGAGGAGAAGCGCGAGGGCTACGAGCGTCTCCTGAAGATGCAAA ACCAGCGTGGCGGCCGCGCTCTCTTCCAGGACATCAAGGTAACTAGTGTG TGGGTAATGGACTACATCTCCCAGCAGGCCGTGCGCGCGAGGAGCCTTGA TTTGAGGGCGTAGGTGTCGCGTGGGCTTCTGGGAGATTGAGTTCGGTCTT GTGAGCCCTCTTAACCGCTGGAAATAGAGGCGCACCTCGTGCAGTGCCCA CAACACGCGGCAGTCCACACCGCTGCGTGGTCTTAGGGACGTATAGCTGT AAGAGCTAGGACAGGGTGCGGAGAGTGATAAATACAAGCTGTCACATGTC TTTGTGGCCTGGGCCTCTGACCCCCAACGACTCTTGGGAAATGTAGGTTT AGTTCT (SEQ ID NO: 146) pARBI-927: TGGGGCGGGGGGCTGAGACTCCTATGTGCTCCGGATTGGTCAGGCACGGC 1806 FTL_2_ CTTCGGCCCCGCCTCCTGCCACCGCAGATTGGCCGCTAGCCCTCCCCGAG Endogenous_29 CGCCCTGCCTCCGAGGGCCGGCGCACCATAAAAGAAGCCGCCCTAGCCAC GTCCCCTCGCAGTTCGGCGGTCCCGCGGGTCTGTCTCTTGCTTCAACAGT GTTTGGACGGAACAGATCCGGGGACTCTCTTCCAGCCTCCGACCGCCCTC CGATTTCCTCTCCGCTTGCAACCTCCGGGACCATCTTCTCGGCCATCTCC TGCTTCTGGGACCTGCCAGCACCGTTTTTGTGGTTAGCTCCTTCTTGCCA ACCAACCATGAGCTCCCAGATTCGTCAGAATTATTCCACCGACGTGGAGG CAGCCGTCAACAGCCTGGTCAATTTGTACCTGCAGGCCTCCTACACCTAC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatCTCTCTCTGGTGAGTCCCCAGGACGCCCCTGGCCCTAATTTCCT CCAGCTGCGCACCTCCGGCCCTCACTGCACGCGCCAGCCTTCTTTGTGCG GTCGGGTAAACAGAGGGCGGAGTCCCCTTGGCCTCGCCTCCCGCTAACCA TTGTTGCCTCCATCTCTTCCCGTAGGGCTTCTATTTCGACCGCGATGATG TGGCTCTGGAAGGCGTGAGCCACTTCTTCCGCGAATTGGCCGAGGAGAAG CGCGAGGGCTACGAGCGTCTCCTGAAGATGCAAAACCAGCGTGGCGGCCG CGCTCTCTTCCAGGACATCAAGGTAACTAGTGTGTGGGTAATGGACTACA TCTCCCAGCAGGCCGTGCGCGCGAGGAGCCTTGATTTGAGGGCGTAGGTG TCGCGTGGGCTTCTGGGAGATTGAGTTCGGTCTTGTGAGCCCTCTTAACC GCTGGA (SEQ ID NO: 147) pARBI-928: CTTTGCATAGTTTATCCTATTAGTAATCTATTCTGTCTTTGGAATATGTT 1806 PTEN_1_ TTGTGATGATGAAATAAATACTATAAATAGTATTATTCCTTTTGCATTGA Endogenous_31 GAGTCCTGACGAAATGTCCATGTGACAGTTCATTTTGGGTTTAGCTCTAC CTCTAATATGTGACCTATGCTACCAGTCCGTATAGCGTAAATTCCCAGAA TATATCCTCCTGAATAAAATGGGGGAAAATAATACCTGGCTTCCTTAATG ATTATATTTAAGACTTATCAAGAGACTATTTTCTATTTAACAATTAGAAA GTTAAGCAATACATTATTTTTCTCTGGAATCCAGTGTTTCTTTTAAATAC CTGTTAAGTTTGTATGCAACATTTCTAAAGTTACCTACTTGTTAATTAAA AATTCAAGAGTTTTTTTTTCTTATTCTGAGGTTATCTTTTTACCACAGTT tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGCACAATATCCTTTTGAAGACCATAACCCACCACAGCTAGAACT TATCAAACCCTTTTGTGAAGATCTTGACCAATGGCTAAGTGAAGATGACA ATCATGTTGCAGCAATTCACTGTAAAGCTGGAAAGGGACGAACTGGTGTA ATGATATGTGCATATTTATTACATCGGGGCAAATTTTTAAAGGCACAAGA GGCCCTAGATTTCTATGGGGAAGTAAGGACCAGAGACAAAAAGGTAAGTT ATTTTTTGATGTTTTTCCTTTCCTCTTCCTGGATCTGAGAATTTATTGGA AAACAGATTTTGGGTTTCTTTTTTTCCTTCAGTTTTATTGAGGTGTAATT GACAAGTAAAAATTATATATAAATACAATGTATAATATGATGTTTTGATG TATGTGTATATACATTGTGAAATGATTACTACAGTCAAACTACTTAACAT ATTCAT (SEQ ID NO: 148) pARBI-929: GCTACCAGTCCGTATAGCGTAAATTCCCAGAATATATCCTCCTGAATAAA 1806 PTEN_2_ ATGGGGGAAAATAATACCTGGCTTCCTTAATGATTATATTTAAGACTTAT Endogenous_32 CAAGAGACTATTTTCTATTTAACAATTAGAAAGTTAAGCAATACATTATT TTTCTCTGGAATCCAGTGTTTCTTTTAAATACCTGTTAAGTTTGTATGCA ACATTTCTAAAGTTACCTACTTGTTAATTAAAAATTCAAGAGTTTTTTTT TCTTATTCTGAGGTTATCTTTTTACCACAGTTGCACAATATCCTTTTGAA GACCATAACCCACCACAGCTAGAACTTATCAAACCCTTTTGTGAAGATCT TGACCAATGGCTAAGTGAAGATGACAATCATGTTGCAGCAATTCACTGTA AAGCTGGAAAGGGACGAACTGGTGTAATGATATGTGCATATTTATTACAT tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatCGGGGCAAATTTTTAAAGGCACAAGAGGCCCTAGATTTCTATGG GGAAGTAAGGACCAGAGACAAAAAGGTAAGTTATTTTTTGATGTTTTTCC TTTCCTCTTCCTGGATCTGAGAATTTATTGGAAAACAGATTTTGGGTTTC TTTTTTTCCTTCAGTTTTATTGAGGTGTAATTGACAAGTAAAAATTATAT ATAAATACAATGTATAATATGATGTTTTGATGTATGTGTATATACATTGT GAAATGATTACTACAGTCAAACTACTTAACATATTCATCACCTCACATAA TTATTATTCTCCCCCCAGGGTGAAAGCATTTAAGATCTACAAGCTACAAT TTTCAATTATACAATGTTATTATTAACTATAGTCACTATGCTGTCCAGTA GAGCTTCAGATCTTGTTCATCTTGTGTTCCTCCCTCCCCACCCTCAGTCC CTGGAA (SEQ ID NO: 149) pARBI-930: ATAAATAGTATTATTCCTTTTGCATTGAGAGTCCTGACGAAATGTCCATG 1806 PTEN_3_ TGACAGTTCATTTTGGGTTTAGCTCTACCTCTAATATGTGACCTATGCTA Endogenous_33 CCAGTCCGTATAGCGTAAATTCCCAGAATATATCCTCCTGAATAAAATGG GGGAAAATAATACCTGGCTTCCTTAATGATTATATTTAAGACTTATCAAG AGACTATTTTCTATTTAACAATTAGAAAGTTAAGCAATACATTATTTTTC TCTGGAATCCAGTGTTTCTTTTAAATACCTGTTAAGTTTGTATGCAACAT TTCTAAAGTTACCTACTTGTTAATTAAAAATTCAAGAGTTTTTTTTTCTT ATTCTGAGGTTATCTTTTTACCACAGTTGCACAATATCCTTTTGAAGACC ATAACCCACCACAGCTAGAACTTATCAAACCCTTTTGTGAAGATCTTGAC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatCAATGGCTAAGTGAAGATGACAATCATGTTGCAGCAATTCACTG TAAAGCTGGAAAGGGACGAACTGGTGTAATGATATGTGCATATTTATTAC ATCGGGGCAAATTTTTAAAGGCACAAGAGGCCCTAGATTTCTATGGGGAA GTAAGGACCAGAGACAAAAAGGTAAGTTATTTTTTGATGTTTTTCCTTTC CTCTTCCTGGATCTGAGAATTTATTGGAAAACAGATTTTGGGTTTCTTTT TTTCCTTCAGTTTTATTGAGGTGTAATTGACAAGTAAAAATTATATATAA ATACAATGTATAATATGATGTTTTGATGTATGTGTATATACATTGTGAAA TGATTACTACAGTCAAACTACTTAACATATTCATCACCTCACATAATTAT TATTCTCCCCCCAGGGTGAAAGCATTTAAGATCTACAAGCTACAATTTTC AATTAT (SEQ ID NO: 150) pARBI-931: TAGTGACGGTATCAAGAGCAGCATCCCAGGTTGACTTTCTTTGGGGCAGG 1806 PTPN2_1_ ACCTTGCTTGTGTCTTAACCCTTGGTTCCTACccaagtttgtctctctag Endogenous_34 tcctcaactctgaaagccacttgatatcttacacaattccttttttgcct gagaaaataaaagtcagttttgatcatttacaaccaaaaatcctaactaa cacaGACCTGTTTTTGAAATCAGGTAGATTGGAAGTCTTGGTTTACTTCT TATAAGCCCGTCCGCTGTCTGTCTTGTAACATCCCAGGAGCAGACTTAAC AAGGCCTTCCTGGAGCCTTGCTCTTTCTGTCTGCTTCCCTCATCTGctct ctctcctctctctTCACAGGGTCCACTTCCTAACACATGCTGCCATTTCT GGCTTATGGTTTGGCAGCAGAAGACCAAAGCAGTTGTCATGCTGAACCGC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatATTGTGGAGAAAGAATCGGTGAGTAATATTTAATATTTACAACT TAGTATACTGTATGTGATCATTAGCATATAAAGATTTTCATTTTGGGGCC AATATTCATTCCTTAGTTTTGGCCTTTAATTGCTAAAGGCTAGCGGTCAA AGTGTTTCTGTCCGGAAAAACTCATGTATCCTTTTCCTTTCTTAAGTATT TTTATAACAATTGCAGACCTTTCATCTCCAAGATAGAAATACAATGGAAT AAATATGGACTAGCAGTGACATATAGATCTTTGGAATCCCTAGGAATCCT TATTGACCTATTGTCAATAAGTCCTCTTCCAGCTTTAAGATCTTGTACTG TTCTACTTCAGATTTCTTACTTTATTCTACATTTCATAATTGCTGGTTTT TAATGATGATGAAAATTTCCTAATCCACTTCTAATTAGATGTGGCAATGA CATGCC (SEQ ID NO: 151) pARBI-932: CAGAATTTGAATATGATGTGGCTGACCATAGATACCTCCATTGCATTAAT 1806 PTPN2_2_ TCTGCTTCATAAAGTGTTTGATGCTTGCTTGTAGGTGTTGAATAGAGCCT Endogenous_35 AATTAGCAGTAAGTAAACAGATCTagagcacagactttggcatctgaagg accttggttcaatgctgggactgccacttactagctatgggatctttggt ggatttttaacctttgaaagcctttcatccctcacagatataatggggaa gaaaatcgtttccattagtattgtgaggattaaatgattaattgatgtaa acaatgtgctttccatagtacttggcatatcataagtgcttcatGTTATT TTACGCTGGCTGGGAAGATAAGTTTTGCTGTGGAGAATTTAAGAGGGATA TTAAAATATATTTTTGTTATTTTAAGGAAATTCGAAATGAGTCCCATGAC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatTATCCTCATAGAGTGGCCAAGTTTCCAGAAAACAGAAATCGAAA CAGATACAGAGATGTAAGCCCATGTAAGTACTTGTGGGTTTGTGTGCATG TGTATTTTTTGGTTTGTTTTAGGAGGCAGGATGTAGTGAAAAGGAGGGCT TGGGGTCAGTGTTGATAAATACAGTAATTTTTTTCCTATTTACGTAAGAC ACGTTCTTTAAGAATTTAAAGGTGTAGAATAGTGGAAGGAAAAATAATTA CCCATAATTATTTCATCCTTCAAACATAGGCACCATTATTTTGGTGTATT TCCCACAACCTGTTTTTCTTATGCACAGTTTTCCTTTCTTTAAAATAATT GTAATTATAGATGTCTATATAATATCCATATGTATTTCCTTTTCATATCA TTCCATCCCCTTTGTAAGTTTTGCTTAAAATAGTTATTGGTGTAGTTTGG TTTTTT (SEQ ID NO: 152) pARBI-933: ACCAGTTATCGTTTTGTAGGTAGACCAATCAATTCTTAGGTACCAAGAAT 1806 PTPN_3_ CTGAATGATCCTTTTACTTTTAGGAAGGTAAGTACTTTATTCATTTAACT Endogenous_36 ATGTTTACTCCTGGTCGATTTTTCAAGTCACAATGGCTAATGTGCTACAA ACAGAAGTGCTTCCTTTCCAGCACTATAGTAATAAACAAGATTGCATTTT ATACTCCTTACAAAAAAAGTAGAGAATAGTTTAGGATTTGTCTCAACTCT ATTATGATGCTAATTCATTTTTCTTATTTCTTCTGTCTTTTTATAAATAC CTAGAATATTAATATGAAGATAAAAACAGTAATTTTGAAATGACAGTTGT GGTTTATCATTCTCTATTTTCAGATGATCACAGTCGTGTTAAACTGCAAA ATGCTGAGAATGATTATATTAATGCCAGTTTAGTTGACATAGAAGAGGCA tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatCAAAGGAGTTACATCTTAACACAGGCAAGTAATACGATACCACG CAAATGTCTGAAAATGATGTGTTTGTGCTTGTTCTGCTTTAAATCATAGG TAAAAGTATATCTTGACTTCTTTTGGAAATGAAATGGTATTTCAGTTTTT TTCTGACGATGTAATGAATTATGGACATTATAAGGTTTTGAAGCTTTGAG TATTTAAGATAAAAGGCAAGTTATTTTTGATATTACACGTTCTGTGAAGG AAAATTCTTAGGAAATGGCTTAGGCCAGTTCTTTGGCAGATTGTGTTCCT TACATTATATCTGACACAGAGTGCTGCTTATGCTTCTTAGTGTCTTCTTT TCTCTTCCCATACCCTGTGGCAAAACCAGAGGCCCTGGACAGCTCTTCTG TAGCTCCCCCTGCCTCGCATATATCTTCAGAGAGTACACCAAGCCCTGGA TGGTGT (SEQ ID NO: 153) pARBI-934: CCTGCTCAAGGGCCGAGGTGTCCACGGTAGCTTCCTGGCTCGGCCCAGTC 1806 PTPN6_1_ GCAAGAACCAGGGTGACTTCTCGCTCTCCGTCAGGTAGGTGGGCCCCCCG Endogenous_37 CAACCCCGGGCATTTTGGCCACTCTCTTGTGCCATCCAGGCCCTGAACCA CTCATTCCTGGTTCCCCGTGGCAGTGCTGACTCCCCGTCTGTTCCCTTGC CCCCAACCCCCACACTCCCCATCCCTGTCTGTGCCCACCCATGCCCATGT GTGCCCCCACCCAGGACCTCAGCCGATCCCTGCCCTCCTGCCTCTACTCC TGCACCGACTGGCCTCACCGCCTGGTGCCCTGCAGGGTGGGGGATCAGGT GACCCATATTCGGATCCAGAACTCAGGGGATTTCTATGACCTGTATGGAG GGGAGAAGTTTGCGACTCTGACAGAGCTGGTGGAGTACTACACTCAGCAG tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatCAGGGTGTCCTGCAGGACCGCGACGGCACCATCATCCACCTCAA GTACCCGCTGAACTGCTCCGATCCCACTAGTGAGAGGTGAGGGCTCCGCA CCCCCGCCATTCCCAAGCAGGGATGAGCCGGCTCCCACCCTGAACAGCCA GGGAGGCAGGGAGACTGGCAGCCGGCGCTGCCTACCCTCCATCCCCTCCC CTCCCTGCACCAGCTGGGGCTCTCAATGTCCCTCCTCCCTGCTGTCCTGG GACCTGGTGTCTCAGAGCCTAACCTACCACCCTTTCCACCTAACCCCGAG GAAGCCACAGAAAGCTGCCTCGCCCTACTCCGGGAGCCCTGGCCGCTGCA ACCCAGGTCCCACTGGAGACAGGGAGGCCACTGCTGGTGGCCAGCATGTC GTGCAGGCCAGCTCTGTTGTTAGAAAGCTCTTCTTCCTCTGGAATCGAGC CTGCCT (SEQ ID NO: 154) pARBI-935: CTGGGTTCGAAGCCCGGTTAGAACTCTGGAGGCTAGGATGGCTTGAACCT 1806 PTPN6_2_ GGGAGGTCGAGGCTGCAGAGAGCTGTAACCGCGCCACTGCACTCCAGCCT Endogenous_38 GGGCAACAGAGCTCTGGAAGCTTGCCCTAGAGTCAGTCAAGGGCCCTAGG CCAGTGAGTAACAGCTCAGCGTCAGTTTCCTCATCTATAAAATGGGGGTA ATATCATACCTAGCTCTCAGCATGTTTGTGAGAGACCTAAATGAGGTGGT GGATTTGGAAGCATGTAGCGCAGTGCCTGGCACACAGTAGGTGCTTGATT TCCGGCCCCTCTCTGTGAATGTCTCTGCTCAGCGCCTTCCCCTGTGGCCT GGGTCTTACCTTCCCTGACGCTGCCTTCTCTAGGTGGTACCATGGCCACA TGTCTGGCGGGCAGGCAGAGACGCTGCTGCAGGCCAAGGGCGAGCCCTGG tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatACGTTTCTTGTGCGTGAGAGCCTCAGCCAGCCTGGAGACTTCGT GCTTTCTGTGCTCAGTGACCAGCCCAAGGCTGGCCCAGGCTCCCCGCTCA GGGTCACCCACATCAAGGTCATGTGCGAGGTAAGGCAGCCAGGCGGCGGG GGAGCCTCTGCTGAGGCTCCTGTCTGTGACCACAGTGTGGGTGGCAGGGA GGGTCTGCCTGGGCTTGAATTCAAGGCTGGGGACCCAGGGAGGGAGACTC AAGTCCTGTGAATGGCCTAATTTGGCTCCCCCCAGGGTGGACGCTACACA GTGGGTGGTTTGGAGACCTTCGACAGCCTCACGGACCTGGTGGAGCATTT CAAGAAGACGGGGATTGAGGAGGCCTCAGGCGCCTTTGTCTACCTGCGGC AGGTCAGGGGTGGGCCCAGCTGCCTCCCCACTTCCCCTGAGCTGTCCCCC AGATGT (SEQ ID NO: 155) pARBI-936: TGTCACATATGTGCAATGCCATGCTCCTGAGCCTTTGATTGCAGACGTGT 1806 PTPN6_3_ GGGAAGTGGGCCCCGTCCCCACCCCCAGTGCCACCCTGCTCTGCTTCTCT Endogenous_39 TCCCTTGCTGTGCTCTAAAACGAGAAGTACAAGTGAGTTCCCCCAAGGGG TCGGCCGCGCCTCTTCCTGTCCCCGCCCTGCCGGCTGCCCCAGGCCAGTG GAGTGGCAGCCCCAGAACTGGGACCACCGGGGGTGGTGAGGCGGCCCGGC ACTGGGAGCTGCATCTGAGGCTTAGTCCCTGAGCTCTCTGCCTGCCCAGA CTAGCTGCACCTCCTCATTCCCTGCGCCCCCTTCCTCTCCGGAAGCCCCC AGGATGGTGAGGTAAGGGCCTGCCACCCACGGTAGACAGGAGGCAAGGGT GCCTGGTGCCCACGGGACCCCTCCTCACTGCCCTGCCTGGGCCGCCCAGG tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatTGGTTTCACCGAGACCTCAGTGGGCTGGATGCAGAGACCCTGCT CAAGGGCCGAGGTGTCCACGGTAGCTTCCTGGCTCGGCCCAGTCGCAAGA ACCAGGGTGACTTCTCGCTCTCCGTCAGGTAGGTGGGCCCCCCGCAACCC CGGGCATTTTGGCCACTCTCTTGTGCCATCCAGGCCCTGAACCACTCATT CCTGGTTCCCCGTGGCAGTGCTGACTCCCCGTCTGTTCCCTTGCCCCCAA CCCCCACACTCCCCATCCCTGTCTGTGCCCACCCATGCCCATGTGTGCCC CCACCCAGGACCTCAGCCGATCCCTGCCCTCCTGCCTCTACTCCTGCACC GACTGGCCTCACCGCCTGGTGCCCTGCAGGGTGGGGGATCAGGTGACCCA TATTCGGATCCAGAACTCAGGGGATTTCTATGACCTGTATGGAGGGGAGA AGTTTG (SEQ ID NO: 156) pARBI-937: TCTTTAGCTAATGTTCTGTAGCTTGTTTTAGATCTGTGTCTACATATTAT 1806 PTPRC_1_ ATGTATCTCTGTATGCATCAATATTTGTATATATGCATGCATATATGTGT Endogenous_40 ATGTATATTTATGAATACATATACACACCATATACATACACTTATGTATG GTGTGTGCATATGTATGTGTAGATATATGTACATACACACTATAGTGGAC TGGGGAGTTAGTATACTGGGAGGAGCATACATTTAGGGTATGATTCACAT ATTTATTTTGTCCTTCTCCCATTTTCCATTAATTAACAGGATTGACTACA GCAAAGATGCCCAGTGTTCCACTTTCAAGTGACCCCTTACCTACTCACAC CACTGCATTCTCACCCGCAAGCACCTTTGAAAGAGAAAATGACTTCTCAG AGACCACAACTTCTCTTAGTCCAGACAATACTTCCACCCAAGTATCCCCG tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGACTCTTTGGATAATGCTAGTGCTTTTAATACCACAGGTTGGCA CACAAAAGTTGTTAACTTAAATATCAGGGAATGTCATTTAGAAAATTCTA CAGTTATCAGTACAACTTGTCTTTAAATTATTTGCACAGTTTCTAAGTAT GTGATTTTATTCAAGTGCAGAAATTGCAGGAAATTAGTATCTGTGAAATA TAGATCGACTGAAGTAATTAATGCATGTTGTTAGGGAGTGGAGAGAGAAA AGAAGGGAAGCAAGATCTCCAAGGACAATCAGGAGGGGAAATTTGTTCTA GTATCCTCTGATCTATACACACTCGCCATGATTCTCCCGCTTGGCTTCCC GCCACCTGGACAGATGAGAATTTCCCTAGTTCAGAGATTATCAGTTCTAC TCTCCTTGGAAGGTGTCTTAAATGGGAGTCTTCCCATTTCTTTGTTTCAC TCTAGA (SEQ ID NO: 157) pARBI-938: AGGGAATTTTTATATGTTGGTCATATTATATCCCTAGCATGTGGCATAAT 1806 PTPRC_2_ GTCTAGCACAAAATAGGTGCTCAATTAATCATCATTATCTAAATAAATAA Endogenous_41 TGCATTTGGGAAAAAAAAGTTTCAAAAGTTTTTCAAAAGTCTTTTGCAGG CTTGAAATTAATCCCAATAGTGATCCTTTAGTCTGTTATGTTTCTGATTT AGCCTGGGGATTCAAAAAATAAATAACATAATTTTGATATATTTGGGCTT TGTAAACATGGTACTAAGAGAGGAAATATAGTTTCATTAGGGTAAAAGCT ACTGAAAATTGCCACTTGGTGAATGTTCTATCATAGACTTGAGGTACATA TAAAAATCTAATATATGTTTACATTAATATGAATGAAATTTGAAATTTTC TAAGAGATTTTTGTTTCTTCTTTGCAGGGCAAAGCCCAACACCTTCCCCC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatACTGGTAAGAATTAATATTTATATTTTTACTAATTTTATTTTCT TGTTGCAAAGTTTATATATTTAACTACAATTTTCTATTATTAACACTGAA ATTATTTTTAAGGATAAATTTTATAATCATGAGTGATTCTTGACATTCAC TTGTTCTTAAACTTTCTGCTTATACGTTATAGAGTTTAATAACTACCTAA ACATGTTATTAAATTTGTATATATATTTTGTGTATAAATAGTAACTTTTC CCAAACTTGACAGTAAATCACACAACAGGTTTCTACTCTCTTTTAATATT TTAAGACTATAAAAAAATGCATTTAAATTAGATAACAAAATTTTATAGTC TGAAAGCAGGTTAACAGCTGTCTATGTATGTTATAGATATGTAGATAACA GATTTGCATATGTCTATATTTCTTTAAGAGTATGTTGCTTTTTTCAATGG TATGCA (SEQ ID NO: 158) pARBI-939: CCTTTAGCACCATAAAGAAACTAAATTATTTAGATGTTTTTATGAGAACA 1806 PTPRC_3_ TATCAAAAAGTACTTTTCTGTCATCCAATACTTCCACAAATAAATCATTA Endogenous_42 GTTCTTGCTAATCTTCATCTGGCATAAAAATAATGACATCAACTTTCTTC ATGTAATTTCCCACTTAATTCCTTTACTAGGAGCAATATCAATTCCTATA TGACGTCATTGCCAGCACCTACCCTGCTCAGAATGGACAAGTAAAGAAAA ACAACCATCAAGAAGATAAAATTGAATTTGATAATGAAGTGGACAAAGTA AAGCAGGATGCTAATTGTGTTAATCCACTTGGTGCCCCAGAAAAGCTCCC TGAAGCAAAGGAACAGGCTGAAGGTTCTGAACCCACGAGTGGCACTGAGG GGCCAGAACATTCTGTCAATGGTCCTGCAAGTCCAGCTTTAAATCAAGGT tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatTCATAGGAAAAGACATAAATGAGGAAACTCCAAACCTCCTGTTA GCTGTTATTTCTATTTTTGTAGAAGTAGGAAGTGAAAATAGGTATACAGT GGATTAATTAAATGCAGCGAACCAATATTTGTAGAAGGGTTATATTTTAC TACTGTGGAAAAATATTTAAGATAGTTTTGCCAGAACAGTTTGTACAGAC GTATGCTTATTTTAAAATTTTATCTCTTATTCAGTAAAAAACAACTTCTT TGTAATCGTTATGTGTGTATATGTATGTGTGTATGGGTGTGTGTTTGTGT GAGAGACAGAGAAAGAGAGAGAATTCTTTCAAGTGAATCTAAAAGCTTTT GCTTTTCCTTTGTTTTTATGAAGAAAAAATACATTTTATATTAGAAGTGT TAACTTAGCTTGAAGGATCTGTTTTTAAAAATCATAAACTGTGTGCAGAC TCAATA (SEQ ID NO: 159) pARBI-940: aaggaaaaaggcactgagtgctggggggtgctggggtgggctgcagtgat 1806 PTPRCAP_1_ agacatcagggtagaggttaaggtcaggttcagcctcactggggtgaagt Endogenous_43 ttgagcacggtgagcaggccatgcagcccgggggaggggaggatgggagg aggtggagctttccgggcagagggaacagccagtgcgaaggccccaggca ggtggcttaatgcagctgttgggggaggtgagtggtagggaggaggctgg agggatgggggctgatctcacagggccagagcctggttgaccaaataagg ccttggccttttctGCTTGGCTGTCCCAAGAGGATCCCAAAGAGAAAAAA ACGAAAGTGGTCTTGGTCACCCAGCCTGCCCCACACCAGGCCCCACCCCA GGTGCTGAGCCCTCTGAGCCCCTGCCTGTCTCCCACAGGCTCTGCCCTGC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatACCTTAGGGCTCGGGATGCTGCTGGCCCTGCCAGGGGCCTTGGG CTCGGGTGGCAGCGCGGAGGACAGCGTGGGCTCCAGCTCTGTCACCGTTG TCctgctgctgctgctgctcctactgctgGCCACTGGCCTAGCACTGGCC TGGCGCCGCCTCAGCCGTGACTCAGGGGGCTACTACCACCCGGCCCGCCT AGGTGCCGCGCTGTGGGGCCGCACGCGGCGCCTGCTCTGGGCCAGCCCCC CAGGTCGCTGGCTGCAGGCCCGAGCTGAGCTGGGGTCCACAGACAATGAC CTTGAGCGACAGGAGGATGAGCAGGACACAGACTATGACCACGTCGCGGA TGGTGGCCTGCAGGCTGACCCTGGGGAAGGCGAGCAGCAATGTGGAGAGG CGTCCAGCCCAGAGCAGGTCCCCGTGCGGGCTGAGGAAGCCAGAGACAGT GACACG (SEQ ID NO: 160) pARBI-941: aaggtcaggttcagcctcactggggtgaagtttgagcacggtgagcaggc 1806 PTPRCAP_2_ catgcagcccgggggaggggaggatgggaggaggtggagctttccgggca Endogenous_44 gagggaacagccagtgcgaaggccccaggcaggtggcttaatgcagctgt tgggggaggtgagtggtagggaggaggctggagggatgggggctgatctc acagggccagagcctggttgaccaaataaggccttggccttttctGCTTG GCTGTCCCAAGAGGATCCCAAAGAGAAAAAAACGAAAGTGGTCTTGGTCA CCCAGCCTGCCCCACACCAGGCCCCACCCCAGGTGCTGAGCCCTCTGAGC CCCTGCCTGTCTCCCACAGGCTCTGCCCTGCACCTTAGGGCTCGGGATGC TGCTGGCCCTGCCAGGGGCCTTGGGCTCGGGTGGCAGCGCGGAGGACAGC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGTGGGCTCCAGCTCTGTCACCGTTGTCctgctgctgctgctgct cctactgctgGCCACTGGCCTAGCACTGGCCTGGCGCCGCCTCAGCCGTG ACTCAGGGGGCTACTACCACCCGGCCCGCCTAGGTGCCGCGCTGTGGGGC CGCACGCGGCGCCTGCTCTGGGCCAGCCCCCCAGGTCGCTGGCTGCAGGC CCGAGCTGAGCTGGGGTCCACAGACAATGACCTTGAGCGACAGGAGGATG AGCAGGACACAGACTATGACCACGTCGCGGATGGTGGCCTGCAGGCTGAC CCTGGGGAAGGCGAGCAGCAATGTGGAGAGGCGTCCAGCCCAGAGCAGGT CCCCGTGCGGGCTGAGGAAGCCAGAGACAGTGACACGGAGGGCGACCTGG TCCTCGGCTCCCCAGGACCAGCGAGCGCAGGGGGCAGTGCTGAGGCCCTG CTGAGT (SEQ ID NO: 161) pARBI-942: cagtgcgaaggccccaggcaggtggcttaatgcagctgttgggggaggtg 1806 PTPRCAP_3_ agtggtagggaggaggctggagggatgggggctgatctcacagggccaga Endogenous 45 gcctggttgaccaaataaggccttggccttttctGCTTGGCTGTCCCAAG AGGATCCCAAAGAGAAAAAAACGAAAGTGGTCTTGGTCACCCAGCCTGCC CCACACCAGGCCCCACCCCAGGTGCTGAGCCCTCTGAGCCCCTGCCTGTC TCCCACAGGCTCTGCCCTGCACCTTAGGGCTCGGGATGCTGCTGGCCCTG CCAGGGGCCTTGGGCTCGGGTGGCAGCGCGGAGGACAGCGTGGGCTCCAG CTCTGTCACCGTTGTCctgctgctgctgctgctcctactgctgGCCACTG GCCTAGCACTGGCCTGGCGCCGCCTCAGCCGTGACTCAGGGGGCTACTAC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatCACCCGGCCCGCCTAGGTGCCGCGCTGTGGGGCCGCACGCGGCG CCTGCTCTGGGCCAGCCCCCCAGGTCGCTGGCTGCAGGCCCGAGCTGAGC TGGGGTCCACAGACAATGACCTTGAGCGACAGGAGGATGAGCAGGACACA GACTATGACCACGTCGCGGATGGTGGCCTGCAGGCTGACCCTGGGGAAGG CGAGCAGCAATGTGGAGAGGCGTCCAGCCCAGAGCAGGTCCCCGTGCGGG CTGAGGAAGCCAGAGACAGTGACACGGAGGGCGACCTGGTCCTCGGCTCC CCAGGACCAGCGAGCGCAGGGGGCAGTGCTGAGGCCCTGCTGAGTGACCT GCACGCCTTTGCTGGCAGCGCAGCCTGGGATGACAGCGCCAGGGCAGCTG GGGGCCAGGGCCTCCATGTCACCGCACTGTAGAGGCCGGTCTTGGTGTCC CATCCC (SEQ ID NO: 162) pARBI-943: ATAGAGTAGGGCGGGGGATGCCATGGAGAGGCTCCATGGGGGAGGGCCGG 1806 RPS23_1_ GGAAGCGCCGCTCCAGGAGGCACGTGGTCCGGCGCGGAAGGGGCCCATGA Endogenous_46 GGCGTGGAGGCCGCCGAGGTCGGGGTACCGAGGGACGCAGGGAGGCCAGC GCTTCCTCCCGGGCATTCGAGCGGGGCCTCGTCCTTCGGGAGAACACATT CTCCGGAGCCCTCTTCGAACGTTTATTAGTCGGTTCAGGGCAACTTGAAG GCCAAATGTTTGGCCCACAGGCCAATAAATAGTACGAGAGCCAATCGGCT TAAGGGTTTATTCCAGGTGAGGCGAGTGTCTTAGAAGATGGGAAACACGT AGATGGCGTGTTTTTACGGAAGAACTAAAATATTTAATTTTTAGGCAAGT GTCGTGGACTTCGTACTGCTAGGAAGCTCCGTAGTCACCGACGAGACCAG tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatAAGTGGCATGATAAACAGTATAAGAAAGCTCATTTGGGCACAGC CCTAAAGGCCAACCCTTTTGGAGGTGCTTCTCATGCAAAAGGAATCGTGC TGGAAAAAGTGTAAGTCCATTGCTCCCGTCAAGTTTTAGTTTATTATAGG AATTCGAGACATGAACTTACGAATTCTTGTTTTGAAAGTAATTGCAGGTT TTTGTGTAGTAGTATTCATTTGGGCATTGTGGGGTAAAATTGCAAAGCGT TTGTTCTATTTAAAAGTTGGTAAAATTAGTTTTTGGGAATTAGGTAGTTA AGGTTTTAATTTAACGTTGGCCTGGAAGGAATTGGAGAAGATACTAGCAA TGATGAAGTAAAGGACACAAACACCTTTACTGTGGGAGTTGTTATAAGTA AATGGCACGTGTCAGCTATTGAACTTTATCGACTTGATAAAACTAAGGTG AAGAGA (SEQ ID NO: 163) pARBI-944: CAAGCGGGACTTGGGGTCTTGGGGACGGGCGGGCGGATGCGAATAGAGTA 1806 RPS23_2_ GGGCGGGGGATGCCATGGAGAGGCTCCATGGGGGAGGGCCGGGGAAGCGC Endogenous_47_48 CGCTCCAGGAGGCACGTGGTCCGGCGCGGAAGGGGCCCATGAGGCGTGGA GGCCGCCGAGGTCGGGGTACCGAGGGACGCAGGGAGGCCAGCGCTTCCTC CCGGGCATTCGAGCGGGGCCTCGTCCTTCGGGAGAACACATTCTCCGGAG CCCTCTTCGAACGTTTATTAGTCGGTTCAGGGCAACTTGAAGGCCAAATG TTTGGCCCACAGGCCAATAAATAGTACGAGAGCCAATCGGCTTAAGGGTT TATTCCAGGTGAGGCGAGTGTCTTAGAAGATGGGAAACACGTAGATGGCG TGTTTTTACGGAAGAACTAAAATATTTAATTTTTAGGCAAGTGCCGCGGA tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatCTTCGTACTGCTAGGAAGCTCCGTAGTCACCGACGAGACCAGAA GTGGCATGATAAACAGTATAAGAAAGCTCATTTGGGCACAGCCCTAAAGG CCAACCCTTTTGGAGGTGCTTCTCATGCAAAAGGAATCGTGCTGGAAAAA GTGTAAGTCCATTGCTCCCGTCAAGTTTTAGTTTATTATAGGAATTCGAG ACATGAACTTACGAATTCTTGTTTTGAAAGTAATTGCAGGTTTTTGTGTA GTAGTATTCATTTGGGCATTGTGGGGTAAAATTGCAAAGCGTTTGTTCTA TTTAAAAGTTGGTAAAATTAGTTTTTGGGAATTAGGTAGTTAAGGTTTTA ATTTAACGTTGGCCTGGAAGGAATTGGAGAAGATACTAGCAATGATGAAG TAAAGGACACAAACACCTTTACTGTGGGAGTTGTTATAAGTAAATGGCAC GTGTCA (SEQ ID NO: 164) pARBI-945: TTGTATATCCTTTTTAAAGTTATTCTTTTTAGTTAATTGCTCTATGTGTA 1806RTRAF_1_ TTGAGACTAGGAATCAGAAAGCTTAGATTCTAGTCCCAGGTGTAAGTTGT Endogenous_49 GTAACCCTTGGCAAGTGTCAATCTCTAGGCCTCAGCTTTCTCATCTATAA AATGAGGAAGTTGTCGTATTCTATTTTTTTTCTTAAGATGATACACTTAA ATGTTCCCTTCTGTTGGGTTATATAATTGCATCAAAAGTGTAGTAATGTT ATTAAAAAATTGTTAGAGATCCAAACTAAGGTCTCTTTCAACTCTCCCAT TCTTTTTTCTGTGACTTTATGGTAATAATGAAACTGGTGGTTTTCTTTTC TTCCCCCTCACAGTATCTCAGAGATGTTAACTGTCCTTTCAAGATTCAAG ATCGACAAGAAGCTATTGACTGGCTTCTTGGTTTAGCTGTTAGACTTGAA tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatTATGGAGATAATGGTACGTTTTGTGGGGAATGTGTATTTTAAAG AGAGAGGAAAGATGGGAAAGGGAGTGTGAAAATGTAGGGAACTTTGCAGT TTGTTTTGTCTAGTACTATTTTACCTTTGGTTTATTCTTATCACAAGTTA AAAGCACTTTTATTGTCTTTCATTGGTGTTTATATATTTCTGTTAGAATT TGGAAATGGTGCCCTCTGGAGAAGGCTAATTGACTGTCTTCTCACAGAGT AACACTACTTTGATAATATGGTCTGCACCTTAGCCTTTCAAATTAAATTG TTTTTAGTGTCCCAGAATTGATGGGACTTTGAAGTGTTGTTGCAGTAGGT AATTTCTCAAAAGACTGAAACATGTCTAATGCCAATATACATTAATACTC TATAGGCCAGAATATATTACTTATGTTTACTGTCTTAGCACAGATACTTC TATGGT (SEQ ID NO: 165) pARBI-946: GCACCTGTTATGTAGGAGGAGTAATAAAATGAATGAATGCACTCAAAACA 1806 RTRAF_2_ CTAAACAGTAATTCTGTAATCCAACGGGAGGTACAGCGAATACCAAAAGC Endogenous_50 CTACATATACTATTCTCTGCATGCTATAGCAAGAAAGAAAGGAAAGTGGC TTCCCGGTGGTTTTCTGCCTATTGTACAACCAGGAAGCTGACAATAAAGT TTATTTGAGCGTCGACGTGCGCCGACGTGGCCCCGCCTCCCCAGCCGGAG CCGCGATTGGTGGGCATTTGCCGGCGGCCACCGCTTTTAAGCCACGATTG GCGAAGGCCGCCGTCATTTCGGAGCGACTCAGCGCCTGCCCGCCCTCTCG CCGCGTCGCCGGTGCCTGCGCCTCCCGCTCCACCTCGCTTCTTCTCTCCC GGCCGAGGCCCGGGGGACCAGAGCGAGAAGCGGGGACCATGTTCCGACGC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatAAGTTGACGGCTCTCGACTACCACAACCCCGCCGGCTTCAACTG CAAAGGTGAGGCGGCGGCCTCAGCCCGGCCGCGTGTCCCTGACCTGGGCG GAGGTCCCAGCCTCAGTGCCCGCACCCCACCTCCCCGTCGGGACCCTCGG CGGCCTGGTTTCCGCCGGCAGCCTCCGGGCCCCTCTCCTCTGGGTCGCCA CGTACCTCGGCTCTTCGCCGCCCCTTCCCGCCTTTAAAGCCCTCTCACCT ACTCCTGTCTCGGCATGTTACTTTCTGCACTTGCTTAACTCCAAGCATCA CGTAACTACCTTCTCTGTACATAAAAGGGAGAGCATTCGTCTTTCTCACT CACTATTCAACTCCATGGTTCCCTGGGTAATTAGGCGATACCTTGAGCAC CTGCTAATTATGGGCCAGCGCGGTGCTGGATTCTGAGGAAGGTGCTGAGT AACTTG (SEQ ID NO: 166) pARBI-947: CACTAAACAGTAATTCTGTAATCCAACGGGAGGTACAGCGAATACCAAAA 1806 RTRAF_3_ GCCTACATATACTATTCTCTGCATGCTATAGCAAGAAAGAAAGGAAAGTG Endogenous_51 GCTTCCCGGTGGTTTTCTGCCTATTGTACAACCAGGAAGCTGACAATAAA GTTTATTTGAGCGTCGACGTGCGCCGACGTGGCCCCGCCTCCCCAGCCGG AGCCGCGATTGGTGGGCATTTGCCGGCGGCCACCGCTTTTAAGCCACGAT TGGCGAAGGCCGCCGTCATTTCGGAGCGACTCAGCGCCTGCCCGCCCTCT CGCCGCGTCGCCGGTGCCTGCGCCTCCCGCTCCACCTCGCTTCTTCTCTC CCGGCCGAGGCCCGGGGGACCAGAGCGAGAAGCGGGGACCATGTTCCGAC GCAAGTTGACGGCTCTCGACTACCACAACCCCGCCGGCTTCAACTGCAAA tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGGTGAGGCGGCGGCCTCAGCCCGGCCGCGTGTCCCTGACCTGGG CGGAGGTCCCAGCCTCAGTGCCCGCACCCCACCTCCCCGTCGGGACCCTC GGCGGCCTGGTTTCCGCCGGCAGCCTCCGGGCCCCTCTCCTCTGGGTCGC CACGTACCTCGGCTCTTCGCCGCCCCTTCCCGCCTTTAAAGCCCTCTCAC CTACTCCTGTCTCGGCATGTTACTTTCTGCACTTGCTTAACTCCAAGCAT CACGTAACTACCTTCTCTGTACATAAAAGGGAGAGCATTCGTCTTTCTCA CTCACTATTCAACTCCATGGTTCCCTGGGTAATTAGGCGATACCTTGAGC ACCTGCTAATTATGGGCCAGCGCGGTGCTGGATTCTGAGGAAGGTGCTGA GTAACTTGAAGACTAGTTCACTGCCTGCCAGGAGCTAAAGGGACGAGGGG TGGAAG (SEQ ID NO: 167) pARBI-948: CCGGGTTAGGCATAGGCCCTCCCGGATCTTCCGCGGTGTAAGGAGAAGGC 1806 SERF2_1_ CAGCGCCCCTGTGATAGGCCCAGAGCCCCCACTCCACAAGCCAGCCCATC Endogenous_52 CCCCACGGAGACCCAAACCGTCCACACACACCTTGCCAGCTGTTTGGGCC CCACGCCGCCCCAAAACACGCCTCCAGCTGGCCCCTTGGGACCTCCCTTC TCTAGTCCGTATTTTGACCTGGCCCGTGGCAGATTCGCCACTCCCCCCTA CCCCAAGCAGCCTGGGCCTCGATGGGCCGTTGTCGGGGCCCGGAGATTGA AGTGGTGTTGGATCCTGCTGCTGGCCGCGCTGGGGTAGAAGGGTCGCCGG TGTGTGGGCAGAGCGGCCCCCGCGTCTCACCTTTAATTTTCTTTCCTTAG GCGGTAACCAGCGTGAGCTCGCCCGCCAGAAGAATATGAAAAAGCAGAGC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGACTCGGTTAAGGGAAAGCGCCGAGATGACGGGCTTTCTGCTGC CGCCCGCAAGCAGAGGTAGCCCCAGGGAGGGGAGGGAAAGGGACGGTGGA GACCTGGGTTAGACCAAGGGTTATAGAAGGAAAGAGAGCTACCTCAGGGC TTGAATGTGGACTAGTCGTGAGGAGCAGAGTGCATTGCTTCCTCTAGGGT TTTATTTCCTCCCCACCCTCCAAATTGTTAGCTCACAGCCTTACAGGAAA GGACGGGGGCGGGCGCCTGCCCTCAGTCTGATTTCTGAGCGTCCCTGGGT CTGACCTTAAGGGCAAGGGCAGGGAGCTTCACATTTCAAATACAGTTGTG GTTACGGCAGCCCAGTACTTTTGGCCCTCCTTGCTGTTCGGTTCTCCTCC CTTCTCCCAACCTCCTCACTGGTGTTGCTGGGTGTGGTCCTCAATACAGA ATAGAG (SEQ ID NO: 168) pARBI-949: AATCGAGCCCTTTGCCCACGGCTACTTCACGGGACCACCCTCCCGGGTTA 1806 SERF2_2_ GGCATAGGCCCTCCCGGATCTTCCGCGGTGTAAGGAGAAGGCCAGCGCCC Endogenous_ CTGTGATAGGCCCAGAGCCCCCACTCCACAAGCCAGCCCATCCCCCACGG 53_54 AGACCCAAACCGTCCACACACACCTTGCCAGCTGTTTGGGCCCCACGCCG CCCCAAAACACGCCTCCAGCTGGCCCCTTGGGACCTCCCTTCTCTAGTCC GTATTTTGACCTGGCCCGTGGCAGATTCGCCACTCCCCCCTACCCCAAGC AGCCTGGGCCTCGATGGGCCGTTGTCGGGGCCCGGAGATTGAAGTGGTGT TGGATCCTGCTGCTGGCCGCGCTGGGGTAGAAGGGTCGCCGGTGTGTGGG CAGAGCGGCCCCCGCGTCTCACCTTTAATTTTCTTTCCTTAGGCGGTAAC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatCAGCGCGAGCTTGCCCGCCAGAAGAATATGAAAAAGCAGAGCGA CTCGGTTAAGGGAAAGCGCCGAGATGACGGGCTTTCTGCTGCCGCCCGCA AGCAGAGGTAGCCCCAGGGAGGGGAGGGAAAGGGACGGTGGAGACCTGGG TTAGACCAAGGGTTATAGAAGGAAAGAGAGCTACCTCAGGGCTTGAATGT GGACTAGTCGTGAGGAGCAGAGTGCATTGCTTCCTCTAGGGTTTTATTTC CTCCCCACCCTCCAAATTGTTAGCTCACAGCCTTACAGGAAAGGACGGGG GCGGGCGCCTGCCCTCAGTCTGATTTCTGAGCGTCCCTGGGTCTGACCTT AAGGGCAAGGGCAGGGAGCTTCACATTTCAAATACAGTTGTGGTTACGGC AGCCCAGTACTTTTGGCCCTCCTTGCTGTTCGGTTCTCCTCCCTTCTCCC AACCTC (SEQ ID NO: 169) pARBI-950: TTCAAAAGGAATCTTTTTGTTTCAACTATTAGGATCTTTTTAAATCAAAT 1806 SLC38A1_1_ ATGTATTTTAATGGTGTACACCAGACCTGAAGAAAAAGATCACAGAAGGA Endogenous_55 ATTTCCCCTTTGTAAGATAGAGGTAGTTAAAAATAAGCCTTATGACCTAA TAGGTTAATTGTAATGCTCTGGTAGCCAACTTGAAAAAGGAGAAATGATG GTTGCATCTCACATTTTAAAATGTTAAGAATTTGTTTTGTAATGAGGATA CCTTAACCCCTGAAGGCCAAAATGACTATTTGTGTGAGTTTGAGAAAGGC ACATAGTAACTTGGGGAACAGTCAATAGAATGACAAAATCTTGACTATTT TAACTTTCTGAACCCTGTCATTTCTTGTGTTCTCAAATTTGATTTTTAAA TAGGCTGCATGGTGTATGAAAAGCTGGGGGAACAAGTCTTTGGCACCACA tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGGGAAGTTCGTAATCTTTGGAGCCACCTCTCTACAGAACACTGG AGGTAAAAAGAACATGCTTTTCTTTACATAACTTGAAACTAATTCTGGTG GATGAGCGGCCACATGAATTTTAACATAATTCAAGCAGTTATCATCCTCA TTGCTAAAATGGCACAGGGAAAGTAAAGCAGAGACAGCAGTCACTTATTT AAAGCCACAAATCCTGCTAGAGTAGCTGAAGTTGCCTTTGTGTCTTACTG ACAGTTGGTCTAAGAACGGGCTGATAACTTTTTATGTAGCTTGCAACATA AGCCTTCGTATCTTCTTTCTAAAAATGTTCTCATTTTCCTTCAGCAATGC TGAGCTACCTCTTCATCGTAAAAAATGAACTACCCTCTGCCATAAAGTTT CTAATGGGAAAGGAAGAGACATTTTCGTAAGTTATAGACCATTTTTTTAT GATATA (SEQ ID NO: 170) pARBI-951: TAAAGATGTCAAGCTTCAAATCATTATCATAAGGTTAAATATATTACGCA 1806 SLC38A1_2_ TGTCTTTACAGCAAGTTTATTTCTGATCGTGAAAGTAGAAGAAGTCTCAC Endogenous_56_57 AAACAGCCATTTGGAAAAAAAGAAGTGTGATGAGTATGTAAGTATCATTA TAATGAAGTATGCTTATTTTTTTAATCATATAAATTGGGACAAATCTTTT TAATTCAAGTAGCATAGTACCAAAAGCATAACTACCTGTATTCACCAGGA TATTCCTCATACCTTATGTATTTTCTCTTGAAATCCATTCAAGGAAATAA CTAGATAACTAAGAATGGATTTTCTTGAATTTTACTTCCATATTGGTTCC TGAGAGTGCATTAAAGGCTTTGGCTTGATGAACTTTCTTGGGGATTATTT TGCTTAGTGTCAATGTTTGTTTCTTTGTTCAGATTCCAGGTACAACCTCC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatTTAGGCATGTCAGTTTTTAACCTAAGCAACGCCATTATGGGCAG TGGGATTTTGGGACTCGCCTTTGCCCTGGCAAACACTGGAATCCTACTTT TTCTGTGAGTATTGACGTGCCGGTACTTTCCATTTTAAAACTGAACTTTT TGTATGTTTCTGTTATTACATAGAAGAAATGTGAAATATTTATAAATAGT TTTTTTATTAACTTGGCTAAGTTACAAGTACATACTTTGATATTTATTGC TTGAGTGATTTTCCAAACTGTACCTAATGCTTACAACAAAGAGGAAGGAG AAAATCTTTTTATTAATCAAAAATACTGAAATAAGCATTTGTGAAAAGGA TTAAAACATTTGAAAATAACTTTTTTTGGTATTTTTGGAGTCTACCTGTG ACTTTAAATCTCAGTTAAAATATTAAGAGGTGTTTAACCCCAGCCAGTCA CCACTT (SEQ ID NO: 171) pARBI-952: ATCATTCTTCACACTAGTTCCTTTTCTGACTCTTCAGTCACCTCCAGAAT 1806 SMAD2_1_ TAATATCCTTAGAAGCATCTCTAAAATGAGTTTGATTTACTAAACATAGA Endogenous_58 AGTTGACAAAGTTTTTCAAGTATAACTGCATAAACTCTCTTTTTTAAAAA ATCAGTTTTTCCCAAGTTGTTAGCCTTTCATTGGCATTTGAGTCATTTAT GTGACATATTTATAAGAAACATCTGCTAGTGCTGCTGCATACTTTTATCA GATCCATTAAATAGTACATTTTTGTTCTTGATTCTCACTAAAACTAACTA AATGGCTGTCATTCTTTTCTTTATGCTTTTATTGTACTAATTTAGCCCAT TTGACTGCACttttttttttttttttttAAAGTTCCCTCCTTTCTTTTCC CCTTGCTTCCCAACAGGTCTCTTGATGGTCGTCTCCAGGTATCCCATCGA tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatAAAGGATTGCCACATGTTATATATTGCCGATTATGGCGCTGGCC TGATCTTCACAGTCATCATGAACTCAAGGCAATTGAAAACTGCGAATATG CTTTTAATCTTAAAAAGGATGAAGTATGTGTAAACCCTTACCACTATCAG AGAGTTGAGACACCAGGTAGGAATATTGCAAGTTTTTTTCTTGGTTTTGA ATTAAATGCCTGAACTTCAGTATATTTAAGTACTCTTGTGACCCAGGAAA ATTTTAGTGTAAATTCTAATAAATTACCTTAAATTGTGCATATTATTTGG TGGTAATTAAAAtttttttatttttaaaaatttttAATGAGAAAACATAT CTGTTTCTTGGAATAGCATATGATTTGGCATGAAAGTGCATATCCAGGAA ATCCTGTAGATTGGCAGTCTGCTCAGCGCTATGCTGATGTGCTTCTACAT GTCCCT (SEQ ID NO: 172) pARBI-953: TTCTATATTAAAAATCAGTTGACATCTGTTTTTAGATAATTTTTTAAAAA 1806 SMAD2_2_ ACTGGATTATAAAATGAAGCACTTAGTAAAGCTGATGGCAGAGGTTTTAG Endogenous_59 ATCTTTTGAATACAGGAAAGTGGTAAGGGCATTATAGGTATGATTCATTG AGCTAGAGGGCGTTGGTGTTCACATTTTAAAAACATAATCATGACTTATC TGGAGTCACATGTTCCATTTTATTGGCTTCCTTCATATAGAAAATATAGT AACCAGCACTACATGCCTGTGAAAAAGTGAAGCAATTGTATATTTTCTGG GTGAAGGAAGTATTCTGTACATTCTCCATGTTTTACATCATGGTATTTTG AATAACCATGCTTCCATGTTCACATCAATTTTTTGTTTTTCAATTTATGC ACAGCACTTGCTCTGAAATTTGGGGACTGAGTACACCAAATACGATAGAT tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatCAGTGGGATACAACAGGCCTTTACAGCTTCTCTGAACAAACCAG GTGAATATTCTGCCCTCTGTCGAATCTTAGAGATCTTGTGGGAGGGGGGT ATATTTTGAAAGACCTATATGGGGTTGCTTAGTATAATTTTGCCTAGGAT GTTTCCTTAATGTAAAATAAGGCATAGCATTTAAAATATCTGCTTGCCAG TATAAAAAATATATTAAATGTTTCAGCTGATGTTTTAATCAATGCAATTC TAGTTTGAATCTTCTTTAAATTACTGTATCCCCTAAAGAATAATAATTTT GATAACTATATATTTATTTTAGCTTGAGATCAGATTATAATCTGTTGTTG GCCTAATTTTTAAGTAGATACATGATGAGTTTTGCTAAATTTCTTGTTAT CAGAATCTCATCTTTATACTAATAAATACATACTTAATGAACCCCTTATA TCACAA (SEQ ID NO: 173) pARBI-954: TCAATGTATTTTTGTATAGTCTTTTGAGATTAGAGTGAAGTTCTAAAAAG 1806 SMAD2_3_ TAAGAGAACCTTTTGTGAAAATATTTTAATTGACACTACTAAGAGTTTAA Endogenous_60 AATAGTATTGGTGGTGAAAAATCGTTAAAAGTATCTGATTTTACTTGCAA AATTACTGTATTTTCCCACAAGAGGAGTCCTTACAACATTCTTGTTTTTA GAAGGGTTTGTTTCCATCATTCATTTAAATTCATAAAGATAACGTTTTCA TGGGTGGAGAAGTCTATTGGGAAAGTCATAATTCGAATTTCTATCTTGCT TTGCAGTTTGCTTTCTATGATAAGTTGAAATTATTACTTGATGTTCAAGG TAGTCTCTACATCATCCTTTCAATATTTCTGCTAGGTTCGATACAAGAGG CTGTTTTCCTAGCGTGGCTTGCTGCCTTTGGTAAGAACATGTCGTCCATC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatTTGCCATTCACGCCGCCAGTIGTGAAGAGACTGCTGGGATGGAA GAAGTCAGCTGGTGGGTCTGGAGGAGCAGGCGGAGGAGAGCAGAATGGGC AGGAAGAAAAGTGGTGTGAGAAAGCAGTGAAAAGTCTGGTGAAGAAGCTA AAGAAAACAGGACGATTAGATGAGCTTGAGAAAGCCATCACCACTCAAAA CTGTAATACTAAATGTGTTACCATACCAAGGTAAGTTTTGTTAGATCCCA GGTTTGATCAAATTATGTCAAGGAATCTGAAGGAAAGTTACTGAATTTGT GTTCCTTTCAAGTTGCCTGTAAAAAGTGATGATTGAAATATGACTGTTTT TAACCTTGTATAAATTGTTTTTGCTAGCTGACTTGTTTTATAAATTATTT TCTTGAATAGTGAGGTTTAATCAAGCTAAATAAGATACTTAGATTTATTA TCTTCA (SEQ ID NO: 174) pARBI-955: GGGCTCCCGCAGCACAGCCTTTCTCCGGCCCTAGCCCAAATCGCCCAGAC 1806 SOCS1_1_ CAGGCGCGGATCCCAGCCTGGCCAGCAGGCGGCGGGCGCGGGGGGCGAGC Endogenous_61 CGGGGCCGGACGGCTGGAGCCAGAACCGGCTGCTCTCCACGCCCCCCTCT CGGTGCTGCCCGGAGGCCGGACTCCGCCTCCACCGAGCCCCCACCCGCCG GGAAGAGCTCCGCGGAGTACAGAGCCCATTTTCTAGCTGTGTCCACTGAG GCTGAACGGATCCGCGCGGACTTGGTGCTCCGTGCTCGCCCCCTAGGGCC GGGTCCGCCGGGAGCGCCGCCCTCCGGAGTTGTCCGGCCGGCGCACACCT GCCCGGCCCCGCAGCGCCCCAGCTCACCTCTTTGTCTCTCCCGCAGCGCA CCCCCGGACGCTATGGCCCACCCCTCCGGCTGGCCCCTTCTGTAGGATGt ccggatccggagagggcaggggatctctccttacttgtggcgacgtggag gagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGT GGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCA GCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTG AAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGT GACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACA TGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAG GAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGA GGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCA TCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAAC TACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCAT CAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGC TCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTG CTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCC CAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCG GGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagct tataatggttacaaataaagcaatagcatcacaaatttcacaaataaagc atttttttcactgcattctagttgtggtttgtccaaactcatcaatgtat cttatGTAGCACACAACCAGGTGGCAGCCGACAATGCAGTCTCCACAGCA GCAGAGCCCCGACGGCGGCCAGAACCTTCCTCCTCTTCCTCCTCCTcgcc cgcggcccccgcgcgcccgcggccgtgccccgcggtcccggccccggCCC CCGGCGACACGCACTTCCGCACATTCCGTTCGCACGCCGATTACCGGCGC ATCACGCGCGCCAGCGCGCTCCTGGACGCCTGCGGATTCTACTGGGGGCC CCTGAGCGTGCACGGGGCGCACGAGCGGCTGCGCGCCGAGCCCGTGGGCA CCTTCCTGGTGCGCGACAGCCGCCAGCGGAACTGCTTTTTCGCCCTTAGC GTGAAGATGGCCTCGGGACCCACGAGCATCCGCGTGCACTTTCAGGCCGG CCGCTTTCACCTGGATGGCAGCCGCGAGAGCTTCGACTGCCTCTTCGAGC TGCTG (SEQ ID NO: 175) pARBI-956: GGATCCCAGCCTGGCCAGCAGGCGGCGGGCGCGGGGCGGCGAGCCGGGGC 1806 SOCS1_2_ CGGACGGCTGGAGCCAGAACCGGCTGCTCTCCACGCCCCCCTCTCGGTGC Endogenous_62 TGCCCGGAGGCCGGACTCCGCCTCCACCGAGCCCCCACCCGCCGGGAAGA GCTCCGCGGAGTACAGAGCCCATTTTCTAGCTGTGTCCACTGAGGCTGAA CGGATCCGCGCGGACTTGGTGCTCCGTGCTCGCCCCCTAGGGCCGGGTCC GCCGGGAGCGCCGCCCTCCGGAGTTGTCCGGCCGGCGCACACCTGCCCGG CCCCGCAGCGCCCCAGCTCACCTCTTTGTCTCTCCCGCAGCGCACCCCCG GACGCTATGGCCCACCCCTCCGGCTGGCCCCTTCTGTAGGATGGTAGCAC ACAACCAGGTGGCAGCCGACAATGCAGTCTCCACAGCAGCAGAGCCCCGA tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatCGGCGGCCAGAACCTTCCTCCTCTTCCTCCTCCTcgcccgcggc ccccgcgcgcccgcggccgtgccccgcggtcccggccccggCCCCCGGCG ACACGCACTTCCGCACATTCCGTTCGCACGCCGATTACCGGCGCATCACG CGCGCCAGCGCGCTCCTGGACGCCTGCGGATTCTACTGGGGGCCCCTGAG CGTGCACGGGGCGCACGAGCGGCTGCGCGCCGAGCCCGTGGGCACCTTCC TGGTGCGCGACAGCCGCCAGCGGAACTGCTTTTTCGCCCTTAGCGTGAAG ATGGCCTCGGGACCCACGAGCATCCGCGTGCACTTTCAGGCCGGCCGCTT TCACCTGGATGGCAGCCGCGAGAGCTTCGACTGCCTCTTCGAGCTGCTGG AGCACTACGTGGCGGCGCCGCGCCGCATGCTGGGGGCCCCGCTGCGCCAG CGCCGC (SEQ ID NO: 176) pARBI-957: CGGACTCCGCCTCCACCGAGCCCCCACCCGCCGGGAAGAGCTCCGCGGAG 1806 SOCS1_3_ TACAGAGCCCATTTTCTAGCTGTGTCCACTGAGGCTGAACGGATCCGCGC Endogenous_63 GGACTTGGTGCTCCGTGCTCGCCCCCTAGGGCCGGGTCCGCCGGGAGCGC CGCCCTCCGGAGTTGTCCGGCCGGCGCACACCTGCCCGGCCCCGCAGCGC CCCAGCTCACCTCTTTGTCTCTCCCGCAGCGCACCCCCGGACGCTATGGC CCACCCCTCCGGCTGGCCCCTTCTGTAGGATGGTAGCACACAACCAGGTG GCAGCCGACAATGCAGTCTCCACAGCAGCAGAGCCCCGACGGCGGCCAGA ACCTTCCTCCTCTTCCTCCTCCTcgcccgcggcccccgcgcgcccgcggc cgtgccccgcggtcccggccccggCCCCCGGCGACACGCACTTCCGCACA tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatTTCCGTTCGCACGCCGATTACCGGCGCATCACGCGCGCCAGCGC GCTCCTGGACGCCTGCGGATTCTACTGGGGGCCCCTGAGCGTGCACGGGG CGCACGAGCGGCTGCGCGCCGAGCCCGTGGGCACCTTCCTGGTGCGCGAC AGCCGCCAGCGGAACTGCTTTTTCGCCCTTAGCGTGAAGATGGCCTCGGG ACCCACGAGCATCCGCGTGCACTTTCAGGCCGGCCGCTTTCACCTGGATG GCAGCCGCGAGAGCTTCGACTGCCTCTTCGAGCTGCTGGAGCACTACGTG GCGGCGCCGCGCCGCATGCTGGGGGCCCCGCTGCGCCAGCGCCGCGTGCG GCCGCTGCAGGAGCTGTGCCGCCAGCGCATCGTGGCCACCGTGGGCCGCG AGAACCTGGCTCGCATCCCCCTCAACCCCGTCCTCCGCGACTACCTGAGC TCCTTC (SEQ ID NO: 177) pARBI-958: GGCAGACAAACAGTTAAAACAGAGGAGTGGGGTAAGTGATGACCTAACCC 1806 SRP14_1_ AGCCCCGCTGCTCTAGTTTTGTTAAACACCTCATCTGCTATCACGGTCTG Endogenous_64 GCGAAACCCTGGAGAAACTATTATTTCCACGACGGAAACATGTAATATCG AAGCACGTTAAATTCCACTCAAAGTGGCGGCGTCTGGGATCCCTGACAAA GAGCCCTGGGCTGCTTGGGGCCCATTGTTACCAGAAGCAACCTAGGCGGC TCAGTGCTGGGGACTGAGCCAGCTACAATCCCTACTATTTTCCGGGCCCG AAGCCCCGAATGTGCTCAGTACAGGGTGGGGAAAGGTAGAGGAAGGCGGC GGTCCGCGGCAGACAGACTCCGGTGGCTCCCAGACACCGGGAACCCAGGG AGCATCGCCCGGCTCCCCTCCCGCTGCTTACACTTCTTCAAGGTGATATA tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGACGCTGCCCGACGTCCGGCACTTCTGGAAAAGTCTGGTCAGCT CCGTCAGGAACTGGAAAACAACAGGCCCAGCCCCGTTAGCCAGCCCCAAA TCCTCGTCCTGCCGCGTCAAGGCCCTGGTCCTCCTGCAGGAGGCACAGGT CTCGAGTAACGCCTGAGCCGCCCCCTTCCCTCGGCCGGCCAGGCCTAGCC ATACCTGCTCGCTCTCCAACAACACCATCGCGGCGACGCTGGCTCGACTC CCTCCGCTTAAGCCCCTAGCAGTGAGAGCCGGAAGTTCGGCCTAGGCTGG GGGGACTTCCGCTACTAGACTTCCATTGTCTTCCACCAATCTCTTCTCTT CCACCAATCCCGGCCTGCGCCCTCCCCCCTCCCCGCCCGCCTAGCGCCCG CGCCCTGGGACGTCCGGGGGCCTGTGACCCGGAGGCGCTGGGGCTGTCCT GGGTC (SEQ ID NO: 178) pARBI-959: GTCTGTCTGCCGCGGACCGCCGCCTTCCTCTACCTTTCCCCACCCTGTAC 1806 SRP14_2_ TGAGCACATTCGGGGCTTCGGGCCCGGAAAATAGTAGGGATTGTAGCTGG Endogenous_ CTCAGTCCCCAGCACTGAGCCGCCTAGGTTGCTTCTGGTAACAATGGGCC 65_66 CCAAGCAGCCCAGGGCTCTTTGTCAGGGATCCCAGACGCCGCCACTTTGA GTGGAATTTAACGTGCTTCGATATTACATGTTTCCGTCGTGGAAATAATA GTTTCTCCAGGGTTTCGCCAGACCGTGATAGCAGATGAGGTGTTTAACAA AACTAGAGCAGCGGGGCTGGGTTAGGTCATCACTTACCCCACTCCTCTGT TTTAACTGTTTGTCTGCCGCTCAGCCACGTACGCGGCCGTGTTCACCAGG ATGCATTTTTTCCTTCAGATGACGGTCGAACCAAACCCATTCCAAAGAAG tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGGTACTGTGGAGGGCTTTGAGCCCGCAGACAACAAGTGTCTGTT AAGAGCTACCGATGGGAAGAAGAAGATCAGCACTGTGGTGAGCTTAATTT CTAAGGCTGTCTTTTGAAATGTAAAGACTTGAACTTAACAGAGGATGGGG CGTTTCTGAACCAGGCTTTTATTTGTTTTTCCCTTTTGCCCTGTGTGGCT ATTTTTGAGACCAGGACCTTCCTATACTTTAGTAGTGGAAACCTCAAGAA TAAAATAAGAAGGTAGAGGTCAGACAGTCGTTAGTTCTGCTAAAGCTCTT GTGGAAATGAAAGTAGCATATTGGTCCTTATTCGTAGTACTGTAAGGAGC AGCTGGCATAAATATTTGATTCCTTAGCCCCTTACTGTGCTGGGCCTTCA ATAAGCAGTTTCTTGGTACCAGGTGATGCTATTATTTCCTTATCTTTGTG GTTCAC (SEQ ID NO: 179) pARBI-960: CCCACGTCAGTGAGTGCTCACAAGTCTGTGAGGTAGGCGGCGCCACAGAG 1806 SRSF9_1_ AGAAACTGAGGCGCGGGAGCCCAAGCCACTTGTCCAGCGCCTCAGCCGAG Endogenous_67 CGCGAACCGCGCTCGGGGATGGCATCCACACGGCCCGGCCCCAGGCTCTC CCTGTCAGCGCCCGAAGGCCCTTCGCACCTCCAGGGGGCGCCGGCCTGCG CGCACGCGCAATGGTCGCAGCCGCGTTCTCTTTAAGAGGACTCCTTTTGC CTCCGCCGACCCCTTCGCTTCCGCTCCGCGTTCCCACAATGCAGTGCGGC TGAGCGCCTCGGAGCCCGCGGGGACGCTGCGGGGGGACCCGTGCTGAggc ggcggcggcgacgtgggctgcggcgggcccgcggcgtcgggcggtgcgga tgtcgggctgggcggacgagcgcggcggcgAGGGCGACGGGCGCATCTAC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGTGGGGAACCTTCCGACCGACGTGCGCGAGAAGGACTTGGAGGA CCTGTTCTACAAGTACGGCCGCATCCGCGAGATCGAGCTCAAGAACCGGC ACGGCCTCGTGCCCTTCGCCTTCGTGCGCTTCGAGGACCCCCGGTGAGGc ccccgcgcccctgccctctcctcctcggtgcctgaggcccccgccttccc tgctgtcccctcccccagggctcccctccccccggcctcccctccccccg cctccgcgcagacccctcaCGGCGCCCCCTCACGGGTGGAGGATGAGGCA GCCTCTCCTCGCAGGCCCGGGCCGTCCTTCGCGCCGTCGTCACTTCCTTT ATTTTTATTATTCCAATATTTTACTTAGAAACCCAAAAGCTGAGCCTTTG GAGGGCCCAAGCCCGCCTGCACCGGCCTCGGGGGGTCCAAATGAGCCTTG TCCGCC (SEQ ID NO: 180) pARBI-961: TGTGGTGTGCTTTTCGAGAAAATGCTCACGAAGATTAGAAAAACACTGAT 1806 SRSF9_2_ GGTTTATTGGCAAAATCAATAATCTGTCAGCAGTTGATTCttttttttCT Endogenous_68 CTGAATTAGCCCTCTTATGAGTAATCTGTTTGCTCATTCATTATAATTTC ATACCTCTAAAACTGCGTGTGACAGCTGTAAAGGTTAATTCCAGTATCAT GAAACGTCTCCAAACCAAAGCAGAAGTGCTTCAGGATCCTGATTTGTGTG TTTTTTTCTTCACTCTAGGTTTCCCTTTTATGCTTACTACTCATGCCCCT CACTTGGAAAGTCACTTGGCCTCCTGAACAGCACTAACTCCAAACGtttt ttttgttgttgttgttttttttAGAGATGCAGAGGATGCTATTTATGGAA GAAATGGTTATGATTATGGCCAGTGTCGGCTTCGTGTGGAGTTCCCCAGG tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatACTTATGGAGGTCGGGGTGGGTGGCCCCGTGGTGGGAGGAATGG GCCTCCTACAAGAAGATCTGATTTCCGAGTTCTTGTTTCAGGTATGTTCC TTTCAAACAGaatgagatgatacatgtaaaatacttaacacagagtctgt cttccaagaaatgatagctgttattcttCAGTGCATGGGACACGGGGGCT TTCTTTTCAATAGCCTGTGTGAAGCCTTGCCCTGGATTGCCAATGAGGAA AGTATCCTGCAAATGAAATTGCGCTGGGAGTGCAGCCTTGGAAGAACATA ACCATATTTCTTGTAAAGGAGTTTTCTAGTGGTGAGAAGGAAAGATGATG GGAAAACTTGAGCTACAATTCTAAAGATGCTTCTTTTGGAATATACTTGG CATCAGACATGGTAGAAAGGCATTCAAGGAGCCAGATTTGAACAACTTAC CCAGCC (SEQ ID NO: 181) pARBI-962: GATGGCATCCACACGGCCCGGCCCCAGGCTCTCCCTGTCAGCGCCCGAAG 1806 SRSF9_3_ GCCCTTCGCACCTCCAGGGGGCGCCGGCCTGCGCGCACGCGCAATGGTCG Endogenous_69 CAGCCGCGTTCTCTTTAAGAGGACTCCTTTTGCCTCCGCCGACCCCTTCG CTTCCGCTCCGCGTTCCCACAATGCAGTGCGGCTGAGCGCCTCGGAGCCC GCGGGGACGCTGCGGGGGGACCCGTGCTGAggcggcggcggcgacgtggg ctgcggcgggcccgcggcgtcgggcggtgcggatgtcgggctgggcggac gagcgcggcggcgAGGGCGACGGGCGCATCTACGTGGGGAACCTTCCGAC CGACGTGCGCGAGAAGGACTTGGAGGACCTGTTCTACAAGTACGGCCGCA TCCGCGAGATCGAGCTCAAGAACCGGCACGGCCTCGTGCCCTTCGCCTTC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGTGCGCTTCGAGGACCCCCGGTGAGGcccccgcgcccctgccct ctcctcctcggtgcctgaggcccccgccttccctgctgtcccctccccca gggctcccctccccccggcctcccctccccccgcctcegcgcagacccct caCGGCGCCCCCTCACGGGTGGAGGATGAGGCAGCCTCTCCTCGCAGGCC CGGGCCGTCCTTCGCGCCGTCGTCACTTCCTTTATTTTTATTATTCCAAT ATTTTACTTAGAAACCCAAAAGCTGAGCCTTTGGAGGGCCCAAGCCCGCC TGCACCGGCCTCGGGGGGTCCAAATGAGCCTTGTCCGCCTCCTGCCTGGG GCAGCACCGTAGGGGGAAGCGGCCGCGGGGCAGCGCGGGGGTCGCCGTTC GCCCTTCCCGCTCGCCTCTCCCCCGGCCCGTGCTCGCCGTGGCTGGAGAG CAAGCT (SEQ ID NO: 182) pARBI-963: GTAACTGACTAGCCTTAGTAGACTGTGGTTTCCAGGTTTATTCAGAAGTA 1806 SUB1_1_ GCAAGATCCCTCCAtttttttttCTACCAAAGAAATCGTATGTGGGATCC Endogenous_70 CAAACCACAAAATAACCGTTCCTGTGGTTAATACTACTATAATGCCTGAA GTGTCTTTTGGGATCCTGAGAACAGAGTTTGAAAACATTACTAGACAGAA GGATTGGTTAGATTCATAGTTTTGTTGTTGAGTGAAACTTGCTTATGTAT ATATTTATGATATTTTGGATGTAGTCTTTTGATTGTTTAAATCTTAAAAA GTAATGGGATCTTTTGACACTGGGGTATGTTTTATTTTTATGTGTGCAAA TTTTAACCATATTCTTTTCTAGTTAAAGAGGAAAAAGCAAGTTGCTCCAG AAAAACCTGTAAAGAAACAAAAGACAGGTGAGACTTCGAGAGCCCTGTCA tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatTCTTCTAAACAGAGCAGCAGCAGCAGAGATGATAACATGTTTCA GGTAAAGTTGGCTAttttttttttttttttttttgacatggagtcatgct ctgtcacccaggctggagtgcagtggcgccatctcggctcactgcaacct cagcctcctgagttcaagcagttctctgcctcagcctcccgagtagctag gattacaggcatccgccaccagacctggctaatttttgtatttttagtag agatggggtttcaccatcttggccaggctggtcttgaactcctgaccttg tgatccaactgcctcagcctccaaaagtgctgggtttacaggtgtgagcc accatgccttgccAAAGTTGGCTGTTTCTTTAGATTCAGAGGAATTATTA TCTGGCTTGATCTGAAGAATGTTAAAAGTACTATGATCTGATAATTGCCT AATATG (SEQ ID NO: 183) pARBI-964: ATAAATAAACATATCCTGGAAATGGAATAAGTTGGTTATATTCTTTTTAA 1806 SUB1_2_ TTAGTATATCTGCTTCGTAAAATAAGTAACTGACTAGCCTTAGTAGACTG Endogenous_71 TGGTTTCCAGGTTTATTCAGAAGTAGCAAGATCCCTCCATTTTTTTTTCT ACCAAAGAAATCGTATGTGGGATCCCAAACCACAAAATAACCGTTCCTGT GGTTAATACTACTATAATGCCTGAAGTGTCTTTTGGGATCCTGAGAACAG AGTTTGAAAACATTACTAGACAGAAGGATTGGTTAGATTCATAGTTTTGT TGTTGAGTGAAACTTGCTTATGTATATATTTATGATATTTTGGATGTAGT CTTTTGATTGTTTAAATCTTAAAAAGTAATGGGATCTTTTGACACTGGGG TATGTTTTATTTTTATGTGTGCAAATTTTAACCATATTCTTTTCTAGTTA tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatAAGAGGAAAAAGCAAGTTGCTCCAGAAAAACCTGTAAAGAAACA AAAGACAGGTGAGACTTCGAGAGCCCTGTCATCTTCTAAACAGAGCAGCA GCAGCAGAGATGATAACATGTTTCAGGTAAAGTTGGCTATTTTTTTTTTT TTTTTTTTTGACATGGAGTCATGCTCTGTCACCCAGGCTGGAGTGCAGTG GCGCCATCTCGGCTCACTGCAACCTCAGCCTCCTGAGTTCAAGCAGTTCT CTGCCTCAGCCTCCCGAGTAGCTAGGATTACAGGCATCCGCCACCAGACC TGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATCTTGGCCA GGCTGGTCTTGAACTCCTGACCTTGTGATCCAACTGCCTCAGCCTCCAAA AGTGCTGGGTTTACAGGTGTGAGCCACCATGCCTTGCCAAAGTTGGCTGT TTCTTT (SEQ ID NO: 184) pARBI-965: AATTAGTATATCTGCTTCGTAAAATAAGTAACTGACTAGCCTTAGTAGAC 1806 SUB1_3_ TGTGGTTTCCAGGTTTATTCAGAAGTAGCAAGATCCCTCCATTTTTTTTT Endogenous_72 CTACCAAAGAAATCGTATGTGGGATCCCAAACCACAAAATAACCGTTCCT GTGGTTAATACTACTATAATGCCTGAAGTGTCTTTTGGGATCCTGAGAAC AGAGTTTGAAAACATTACTAGACAGAAGGATTGGTTAGATTCATAGTTTT GTTGTTGAGTGAAACTTGCTTATGTATATATTTATGATATTTTGGATGTA GTCTTTTGATTGTTTAAATCTTAAAAAGTAATGGGATCTTTTGACACTGG GGTATGTTTTATTTTTATGTGTGCAAATTTTAACCATATTCTTTTCTAGT TAAAGAGGAAAAAGCAAGTTGCTCCAGAAAAACCTGTAAAGAAACAAAAG tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatACAGGTGAGACTTCGAGAGCCCTGTCATCTTCTAAACAGAGCAG CAGCAGCAGAGATGATAACATGTTTCAGGTAAAGTTGGCTATTTTTTTTT TTTTTTTTTTTGACATGGAGTCATGCTCTGTCACCCAGGCTGGAGTGCAG TGGCGCCATCTCGGCTCACTGCAACCTCAGCCTCCTGAGTTCAAGCAGTT CTCTGCCTCAGCCTCCCGAGTAGCTAGGATTACAGGCATCCGCCACCAGA CCTGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATCTTGGC CAGGCTGGTCTTGAACTCCTGACCTTGTGATCCAACTGCCTCAGCCTCCA AAAGTGCTGGGTTTACAGGTGTGAGCCACCATGCCTTGCCAAAGTTGGCT GTTTCTTTAGATTCAGAGGAATTATTATCTGGCTTGATCTGAAGAATGTT AAAAGT (SEQ ID NO: 185) pARBI-966: CACATTTTAATTTTTGTTTCCATGCTCTTTAGAATTCAACTAGAGGGCAG 1806 TET2_1_ CCTTGTGGATGGCCCCGAAGCAAGCCTGATGGAACAGGATAGAACCAACC Endogenous_73 ATGTTGAGGGCAACAGACTAAGTCCATTCCTGATACCATCACCTCCCATT TGCCAGACAGAACCTCTGGCTACAAAGCTCCAGAATGGAAGCCCACTGCC TGAGAGAGCTCATCCAGAAGTAAATGGAGACACCAAGTGGCACTCTTTCA AAAGTTATTATGGAATACCCTGTATGAAGGGAAGCCAGAATAGTCGTGTG AGTCCTGACTTTACACAAGAAAGTAGAGGGTATTCCAAGTGTTTGCAAAA TGGAGGAATAAAACGCACAGTTAGTGAACCTTCTCTCTCTGGGCTCCTTC AGATCAAGAAATTGAAACAAGACCAAAAGGCTAATGGAGAAAGACGTAAC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatTTCGGGGTAAGCCAAGAAAGAAATCCAGGTGAAAGCAGTCAACC AAATGTCTCCGATTTGAGTGATAAGAAAGAATCTGTGAGTTCTGTAGCCC AAGAAAATGCAGTTAAAGATTTCACCAGTTTTTCAACACATAACTGCAGT GGGCCTGAAAATCCAGAGCTTCAGATTCTGAATGAGCAGGAGGGGAAAAG TGCTAATTACCATGACAAGAACATTGTATTACTTAAAAACAAGGCAGTGC TAATGCCTAATGGTGCTACAGTTTCTGCCTCTTCCGTGGAACACACACAT GGTGAACTCCTGGAAAAAACACTGTCTCAATATTATCCAGATTGTGTTTC CATTGCGGTGCAGAAAACCACATCTCACATAAATGCCATTAACAGTCAGG CTACTAATGAGTTGTCCTGTGAGATCACTCACCCATCGCATACCTCAGGG CAGATC (SEQ ID NO: 186) pARBI-967: AATGGAGAAAGACGTAACTTCGGGGTAAGCCAAGAAAGAAATCCAGGTGA 1806 TET2_2_ AAGCAGTCAACCAAATGTCTCCGATTTGAGTGATAAGAAAGAATCTGTGA Endogenous_74 GTTCTGTAGCCCAAGAAAATGCAGTTAAAGATTTCACCAGTTTTTCAACA CATAACTGCAGTGGGCCTGAAAATCCAGAGCTTCAGATTCTGAATGAGCA GGAGGGGAAAAGTGCTAATTACCATGACAAGAACATTGTATTACTTAAAA ACAAGGCAGTGCTAATGCCTAATGGTGCTACAGTTTCTGCCTCTTCCGTG GAACACACACATGGTGAACTCCTGGAAAAAACACTGTCTCAATATTATCC AGATTGTGTTTCCATTGCGGTGCAGAAAACCACATCTCACATAAATGCCA TTAACAGTCAGGCTACTAATGAGTTGTCCTGTGAGATCACTCACCCATCG tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatCATACCTCAGGGCAGATCAATTCCGCACAGACCTCTAACTCTGA GCTGCCTCCAAAGCCAGCTGCAGTGGTGAGTGAGGCCTGTGATGCTGATG ATGCTGATAATGCCAGTAAACTAGCTGCAATGCTAAATACCTGTTCCTTT CAGAAACCAGAACAACTACAACAACAAAAATCAGTTTTTGAGATATGCCC ATCTCCTGCAGAAAATAACATCCAGGGAACCACAAAGCTAGCGTCTGGTG AAGAATTCTGTTCAGGTTCCAGCAGCAATTTGCAAGCTCCTGGTGGCAGC TCTGAACGGTATTTAAAACAAAATGAAATGAATGGTGCTTACTTCAAGCA AAGCTCAGTGTTCACTAAGGATTCCTTTTCTGCCACTACCACACCACCAC CACCATCACAATTGCTTCTTTCTCCCCCTCCTCCTCTTCCACAGGTTCCT CAGCTT (SEQ ID NO: 187) pARBI-968: GGAAAAAGCACTCTGAATGGTGGAGTTTTAGAAGAACACCACCACTACCC 1806 TET2_3_ CAACCAAAGTAACACAACACTTTTAAGGGAAGTGAAAATAGAGGGTAAAC Endogenous_75 CTGAGGCACCACCTTCCCAGAGTCCTAATCCATCTACACATGTATGCAGC CCTTCTCCGATGCTTTCTGAAAGGCCTCAGAATAATTGTGTGAACAGGAA TGACATACAGACTGCAGGGACAATGACTGTTCCATTGTGTTCTGAGAAAA CAAGACCAATGTCAGAACACCTCAAGCATAACCCACCAATTTTTGGTAGC AGTGGAGAGCTACAGGACAACTGCCAGCAGTTGATGAGAAACAAAGAGCA AGAGATTCTGAAGGGTCGAGACAAGGAGCAAACACGAGATCTTGTGCCCC CAACACAGCACTATCTGAAACCAGGATGGATTGAATTGAAGGCCCCTCGT tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatTTTCACCAAGCGGAATCCCATCTAAAACGTAATGAGGCATCACT GCCATCAATTCTTCAGTATCAACCCAATCTCTCCAATCAAATGACCTCCA AACAATACACTGGAAATTCCAACATGCCTGGGGGGCTCCCAAGGCAAGCT TACACCCAGAAAACAACACAGCTGGAGCACAAGTCACAAATGTACCAAGT TGAAATGAATCAAGGGCAGTCCCAAGGTACAGTGGACCAACATCTCCAGT TCCAAAAACCCTCACACCAGGTGCACTTCTCCAAAACAGACCATTTACCA AAAGCTCATGTGCAGTCACTGTGTGGCACTAGATTTCATTTTCAACAAAG AGCAGATTCCCAAACTGAAAAACTTATGTCCCCAGTGTTGAAACAGCACT TGAATCAACAGGCTTCAGAGACTGAGCCATTTTCAAACTCACACCTTTTG CAACAT (SEQ ID NO: 188) pARBI-969: AAAGGGAGGTTGAGATGGGCTGAGGTCTTCTAGGAGGCGGGGAGGGAGTG 1806 TIGIT_1_ CAGCCTTGACAAGCCTCCCTGTGGGCGAGGTGTAAAGAGGGGCAGAAGTC Endogenous_76 AGCTCTGGAAGCATAGGGCAGTGGGTGGGGAGGAGATGGGCTGGGCTGGG CTGGAGTAGAGATGTGTGGAGAAGGGTGAGAAGACTGGAAAGACAACCTG AATGGGGGACTGGGAGCCTTGAATAACAGGCATGGAGAGGAGCGTCTCTT GAAATGGAAGAAACAGGAAATAATTACAGCCTCTATGGAGGAGCAACAGG ATGGACTGGAGAAACTATCATTCCAAAATCCAGTTGGGGCCTCAAAGGCC CTTAGAATTTTTCTAGGAAGGTTGAAGGCCAGCTGCTGACCCAGGACTCA CATGTGCTTCGTCCTCTTCCCTAGGAATGATGACAGGCACAATAGAAACA tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatACGGGGAACATTTCTGCAGAGAAAGGTGGCTCTATCATCTTACA ATGTCACCTCTCCTCCACCACGGCACAAGTGACCCAGGTCAACTGGGAGC AGCAGGACCAGCTTCTGGCCATTTGTAATGCTGACTTGGGGTGGCACATC TCCCCATCCTTCAAGGATCGAGTGGCCCCAGGTCCCGGCCTGGGCCTCAC CCTCCAGTCGCTGACCGTGAACGATACAGGGGAGTACTTCTGCATCTATC ACACCTACCCTGATGGGACGTACACTGGGAGAATCTTCCTGGAGGTCCTA GAAAGCTCAGGTATTCCTGCTGGAGCAAGTTGGTGGATAAACCTCTCCCT CTAGCATAGAAAATGCAATCCTGAAACACTGCACAGCAGGGCTTCTCAAT TCGGGATCACATTTGAATCACCTGAGGAGATTTTAAATCATACTGATGCC GAGGCC (SEQ ID NO: 189) pARBI-970: GAGGAGATGGGCTGGGCTGGGCTGGAGTAGAGATGTGTGGAGAAGGGTGA 1806 TIGIT_2_ GAAGACTGGAAAGACAACCTGAATGGGGGACTGGGAGCCTTGAATAACAG Endogenous_77 GCATGGAGAGGAGCGTCTCTTGAAATGGAAGAAACAGGAAATAATTACAG CCTCTATGGAGGAGCAACAGGATGGACTGGAGAAACTATCATTCCAAAAT CCAGTTGGGGCCTCAAAGGCCCTTAGAATTTTTCTAGGAAGGTTGAAGGC CAGCTGCTGACCCAGGACTCACATGTGCTTCGTCCTCTTCCCTAGGAATG ATGACAGGCACAATAGAAACAACGGGGAACATTTCTGCAGAGAAAGGTGG CTCTATCATCTTACAATGTCACCTCTCCTCCACCACGGCACAAGTGACCC AGGTCAACTGGGAGCAGCAGGACCAGCTTCTGGCCATTTGTAATGCTGAC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatTTGGGGTGGCACATCTCCCCATCCTTCAAGGATCGAGTGGCCCC AGGTCCCGGCCTGGGCCTCACCCTCCAGTCGCTGACCGTGAACGATACAG GGGAGTACTTCTGCATCTATCACACCTACCCTGATGGGACGTACACTGGG AGAATCTTCCTGGAGGTCCTAGAAAGCTCAGGTATTCCTGCTGGAGCAAG TTGGTGGATAAACCTCTCCCTCTAGCATAGAAAATGCAATCCTGAAACAC TGCACAGCAGGGCTTCTCAATTCGGGATCACATTTGAATCACCTGAGGAG ATTTTAAATCATACTGATGCCGAGGCCTCACCCAGACCAATTCAATCAGA ATCCCTAATAGCAGAGCTAAACAAGGGTAAGGTCTAAAAGCATTTCCAGG TGATTCTAATGGGCAGCCAATACTGAGAACCACTGTTCTTATGTAAGAAG CACATC (SEQ ID NO: 190) pARBI-971: CCTCCCTGTGGGCGAGGTGTAAAGAGGGGCAGAAGTCAGCTCTGGAAGCA 1806 TIGIT_3_ TAGGGCAGTGGGTGGGGAGGAGATGGGCTGGGCTGGGCTGGAGTAGAGAT Endogenous_78 GTGTGGAGAAGGGTGAGAAGACTGGAAAGACAACCTGAATGGGGGACTGG GAGCCTTGAATAACAGGCATGGAGAGGAGCGTCTCTTGAAATGGAAGAAA CAGGAAATAATTACAGCCTCTATGGAGGAGCAACAGGATGGACTGGAGAA ACTATCATTCCAAAATCCAGTTGGGGCCTCAAAGGCCCTTAGAATTTTTC TAGGAAGGTTGAAGGCCAGCTGCTGACCCAGGACTCACATGTGCTTCGTC CTCTTCCCTAGGAATGATGACAGGCACAATAGAAACAACGGGGAACATTT CTGCAGAGAAAGGTGGCTCTATCATCTTACAATGTCACCTCTCCTCCACC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatACGGCACAAGTGACCCAGGTCAACTGGGAGCAGCAGGACCAGCT TCTGGCCATTTGTAATGCTGACTTGGGGTGGCACATCTCCCCATCCTTCA AGGATCGAGTGGCCCCAGGTCCCGGCCTGGGCCTCACCCTCCAGTCGCTG ACCGTGAACGATACAGGGGAGTACTTCTGCATCTATCACACCTACCCTGA TGGGACGTACACTGGGAGAATCTTCCTGGAGGTCCTAGAAAGCTCAGGTA TTCCTGCTGGAGCAAGTTGGTGGATAAACCTCTCCCTCTAGCATAGAAAA TGCAATCCTGAAACACTGCACAGCAGGGCTTCTCAATTCGGGATCACATT TGAATCACCTGAGGAGATTTTAAATCATACTGATGCCGAGGCCTCACCCA GACCAATTCAATCAGAATCCCTAATAGCAGAGCTAAACAAGGGTAAGGTC TAAAAG (SEQ ID NO: 191) pARBI-972: TAATGCCCAGCCTAAGTTGGGGAGACCACTCCAGATTCCAAGATGTACAG 1806 TRAC_1_ TTTGCTTTGCTGGGCCTTTTTCCCATGCCTGCCTTTACTCTGCCAGAGTT Endogenous_79 ATATTGCTGGGGTTTTGAAGAAGATCCTATTAAATAAAAGAATAAGCAGT ATTATTAAGTAGCCCTGCATTTCAGGTTTCCTTGAGTGGCAGGCCAGGCC TGGCCGTGAACGTTCACTGAAATCATGGCCTCTTGGCCAAGATTGATAGC TTGTGCCTGTCCCTGAGTCCCAGTCCATCACGAGCAGCTGGTTTCTAAGA TGCTATTTCCCGTATAAAGCATGAGACCGTGACTTGCCAGCCCCACAGAG CCCCGCCCTTGTCCATCACTGGCATCTGGACTCCAGCCTGGGTTGGGGCA AAGAGGGAAATGAGATCATGTCCTAACCCTGATCCTCTTGTCCCACAGAT tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatATCCAGAACCCTGACCCTGCCGTGTACCAGCTGAGAGACTCTAA ATCCAGTGACAAGTCTGTCTGCCTATTCACCGATTTTGATTCTCAAACAA ATGTGTCACAAAGTAAGGATTCTGATGTGTATATCACAGACAAAACTGTG CTAGACATGAGGTCTATGGACTTCAAGAGCAACAGTGCTGTGGCCTGGAG CAACAAATCTGACTTTGCATGTGCAAACGCCTTCAACAACAGCATTATTC CAGAAGACACCTTCTTCCCCAGCCCAGGTAAGGGCAGCTTTGGTGCCTTC GCAGGCTGTTTCCTTGCTTCAGGAATGGCCAGGTTCTGCCCAGAGCTCTG GTCAATGATGTCTAAAACTCCTCTGATTGGTGGTCTCGGCCTTATCCATT GCCACCAAAACCCTCTTTTTACTAAGAAACAGTGAGCCTTGTTCTGGCAG TCCAGA (SEQ ID NO: 192) pARBI-973: GATTCCAAGATGTACAGTTTGCTTTGCTGGGCCTTTTTCCCATGCCTGCC 1806 TRAC_2_ TTTACTCTGCCAGAGTTATATTGCTGGGGTTTTGAAGAAGATCCTATTAA Endogenous_80 ATAAAAGAATAAGCAGTATTATTAAGTAGCCCTGCATTTCAGGTTTCCTT GAGTGGCAGGCCAGGCCTGGCCGTGAACGTTCACTGAAATCATGGCCTCT TGGCCAAGATTGATAGCTTGTGCCTGTCCCTGAGTCCCAGTCCATCACGA GCAGCTGGTTTCTAAGATGCTATTTCCCGTATAAAGCATGAGACCGTGAC TTGCCAGCCCCACAGAGCCCCGCCCTTGTCCATCACTGGCATCTGGACTC CAGCCTGGGTTGGGGCAAAGAGGGAAATGAGATCATGTCCTAACCCTGAT CCTCTTGTCCCACAGATATCCAGAACCCTGACCCTGCCGTGTACCAGCTG tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatAGAGACTCTAAATCCAGTGACAAGTCTGTCTGCCTATTCACCGA TTTTGATTCTCAAACAAATGTGTCACAAAGTAAGGATTCTGATGTGTATA TCACAGACAAAACTGTGCTAGACATGAGGTCTATGGACTTCAAGAGCAAC AGTGCTGTGGCCTGGAGCAACAAATCTGACTTTGCATGTGCAAACGCCTT CAACAACAGCATTATTCCAGAAGACACCTTCTTCCCCAGCCCAGGTAAGG GCAGCTTTGGTGCCTTCGCAGGCTGTTTCCTTGCTTCAGGAATGGCCAGG TTCTGCCCAGAGCTCTGGTCAATGATGTCTAAAACTCCTCTGATTGGTGG TCTCGGCCTTATCCATTGCCACCAAAACCCTCTTTTTACTAAGAAACAGT GAGCCTTGTTCTGGCAGTCCAGAGAATGACACGGGAAAAAAGCAGATGAA GAGAAG (SEQ ID NO: 193) pARBI-974: TTCCTGTGGTAGCGGAACTCACTAAGGGGCCCATCTGGACCCGAGGTATT 1806 TRAC_3_ GTGATGATAAATTCTGAGCACCTACCCCATCCCCAGAAGGGCTCAGAAAT Endogenous_81 AAAATAAGAGCCAAGTCTAGTCGGTGTTTCCTGTCTTGAAACACAATACT GTTGGCCCTGGAAGAATGCACAGAATCTGTTTGTAAGGGGATATGCACAG AAGCTGCAAGGGACAGGAGGTGCAGGAGCTGCAGGCCTCCCCCACCCAGC CTGCTCTGCCTTGGGGAAAACCGTGGGTGTGTCCTGCAGGCCATGCAGGC CTGGGACATGCAAGCCCATAACCGCTGTGGCCTCTTGGTTTTACAGATAC GAACCTAAACTTTCAAAACCTGTCAGTGATTGGGTTCCGAATCCTCCTCC TGAAAGTGGCCGGGTTTAATCTGCTCATGACGCTGCGGCTGTGGTCCAGC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatTGAGGTGAGGGGCCTTGAAGCTGGGAGTGGGGTTTAGGGACGCG GGTCTCTGGGTGCATCCTAAGCTCTGAGAGCAAACCTCCCTGCAGGGTCT TGCTTTTAAGTCCAAAGCCTGAGCCCACCAAACTCTCCTACTTCTTCCTG TTACAAATTCCTCTTGTGCAATAATAATGGCCTGAAACGCTGTAAAATAT CCTCATTTCAGCCGCCTCAGTTGCACTTCTCCCCTATGAGGTAGGAAGAA CAGTTGTTTAGAAACGAAGAAACTGAGGCCCCACAGCTAATGAGTGGAGG AAGAGAGACACTTGTGTACACCACATGCCTTGTGTTGTACTTCTCTCACC GTGTAACCTCCTCATGTCCTCTCTCCCCAGTACGGCTCTCTTAGCTCAGT AGAAAGAAGACATTACACTCATATTACACCCCAATCCTGGCTAGAGTCTC CGCACC (SEQ ID NO: 194) pARBI-975: ATTATTAAGTAGCCCTGCATTTCAGGTTTCCTTGAGTGGCAGGCCAGGCC 1806 TRAC_4_ TGGCCGTGAACGTTCACTGAAATCATGGCCTCTTGGCCAAGATTGATAGC Endogenous_82 TTGTGCCTGTCCCTGAGTCCCAGTCCATCACGAGCAGCTGGTTTCTAAGA TGCTATTTCCCGTATAAAGCATGAGACCGTGACTTGCCAGCCCCACAGAG CCCCGCCCTTGTCCATCACTGGCATCTGGACTCCAGCCTGGGTTGGGGCA AAGAGGGAAATGAGATCATGTCCTAACCCTGATCCTCTTGTCCCACAGAT ATCCAGAACCCTGACCCTGCCGTGTACCAGCTGAGAGACTCTAAATCCAG TGACAAGTCTGTCTGCCTATTCACCGATTTTGATTCTCAAACAAATGTGT CACAAAGTAAGGATTCTGATGTGTATATCACAGACAAAACTGTGCTAGAC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatATGAGGTCTATGGACTTCAAGAGCAACAGTGCTGTGGCCTGGAG CAACAAATCTGACTTTGCATGTGCAAACGCCTTCAACAACAGCATTATTC CAGAAGACACCTTCTTCCCCAGCCCAGGTAAGGGCAGCTTTGGTGCCTTC GCAGGCTGTTTCCTTGCTTCAGGAATGGCCAGGTTCTGCCCAGAGCTCTG GTCAATGATGTCTAAAACTCCTCTGATTGGTGGTCTCGGCCTTATCCATT GCCACCAAAACCCTCTTTTTACTAAGAAACAGTGAGCCTTGTTCTGGCAG TCCAGAGAATGACACGGGAAAAAAGCAGATGAAGAGAAGGTGGCAGGAGA GGGCACGTGGCCCAGCCTCAGTCTCTCCAACTGAGTTCCTGCCTGCCTGC CTTTGCTCAGACTGTTTGCCCCTTACTGCTCTTCTAGGCCTCATTCTAAG CCCCTT (SEQ ID NO: 195) pARBI-976: TTCATTTCCATTTGAGTTGTTCTTATTGAGTCATCCTTCCTGTGGTAGCG 1806 TRAC_6_ GAACTCACTAAGGGGCCCATCTGGACCCGAGGTATTGTGATGATAAATTC Endogenous_84 TGAGCACCTACCCCATCCCCAGAAGGGCTCAGAAATAAAATAAGAGCCAA GTCTAGTCGGTGTTTCCTGTCTTGAAACACAATACTGTTGGCCCTGGAAG AATGCACAGAATCTGTTTGTAAGGGGATATGCACAGAAGCTGCAAGGGAC AGGAGGTGCAGGAGCTGCAGGCCTCCCCCACCCAGCCTGCTCTGCCTTGG GGAAAACCGTGGGTGTGTCCTGCAGGCCATGCAGGCCTGGGACATGCAAG CCCATAACCGCTGTGGCCTCTTGGTTTTACAGATACGAACCTAAACTTTC AAAACCTGTCAGTGATTGGGTTCCGAATCCTCCTCCTGAAAGTGGCCGGG tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatTTTAATCTGCTCATGACGCTGCGGCTGTGGTCCAGCTGAGGTGA GGGGCCTTGAAGCTGGGAGTGGGGTTTAGGGACGCGGGTCTCTGGGTGCA TCCTAAGCTCTGAGAGCAAACCTCCCTGCAGGGTCTTGCTTTTAAGTCCA AAGCCTGAGCCCACCAAACTCTCCTACTTCTTCCTGTTACAAATTCCTCT TGTGCAATAATAATGGCCTGAAACGCTGTAAAATATCCTCATTTCAGCCG CCTCAGTTGCACTTCTCCCCTATGAGGTAGGAAGAACAGTTGTTTAGAAA CGAAGAAACTGAGGCCCCACAGCTAATGAGTGGAGGAAGAGAGACACTTG TGTACACCACATGCCTTGTGTTGTACTTCTCTCACCGTGTAACCTCCTCA TGTCCTCTCTCCCCAGTACGGCTCTCTTAGCTCAGTAGAAAGAAGACATT ACACTC (SEQ ID NO: 196) pARBI-977: CAGTCCATCACGAGCAGCTGGTTTCTAAGATGCTATTTCCCGTATAAAGC 1806 TRAC_ ATGAGACCGTGACTTGCCAGCCCCACAGAGCCCCGCCCTTGTCCATCACT Endogenous_83 GGCATCTGGACTCCAGCCTGGGTTGGGGCAAAGAGGGAAATGAGATCATG TCCTAACCCTGATCCTCTTGTCCCACAGATATCCAGAACCCTGACCCTGC CGTGTACCAGCTGAGAGACTCTAAATCCAGTGACAAGTCTGTCTGCCTAT TCACCGATTTTGATTCTCAAACAAATGTGTCACAAAGTAAGGATTCTGAT GTGTATATCACAGACAAAACTGTGCTAGACATGAGGTCTATGGACTTCAA GAGCAACAGTGCTGTGGCCTGGAGCAACAAATCTGACTTTGCATGTGCAA ACGCCTTCAACAACAGCATTATTCCAGAAGACACCTTCTTCCCCAGCCCA tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGGTAAGGGCAGCTTTGGTGCCTTCGCAGGCTGTTTCCTTGCTTC AGGAATGGCCAGGTTCTGCCCAGAGCTCTGGTCAATGATGTCTAAAACTC CTCTGATTGGTGGTCTCGGCCTTATCCATTGCCACCAAAACCCTCTTTTT ACTAAGAAACAGTGAGCCTTGTTCTGGCAGTCCAGAGAATGACACGGGAA AAAAGCAGATGAAGAGAAGGTGGCAGGAGAGGGCACGTGGCCCAGCCTCA GTCTCTCCAACTGAGTTCCTGCCTGCCTGCCTTTGCTCAGACTGTTTGCC CCTTACTGCTCTTCTAGGCCTCATTCTAAGCCCCTTCTCCAAGTTGCCTC TCCTTATTTCTCCCTGTCTGCCAAAAAATCTTTCCCAGCTCACTAAGTCA GTCTCACGCAGTCACTCATTAACCCACCAATCACTGATTGTGCCGGCACA TGAATG (SEQ ID NO: 197) pARBI-978: TGAGGAAGAAGCGCGGGCGGCGCCTTCGGGAGGCGAGCAGGCAGCAGTTG 1806 TRIM28_1_ GCCGTGCCGTAGCAGCGTCCCGCGCGCGGCGGGCAGCGGCCCAGGAGGCG Endogenous_85 CGTGGCGGCGCTCGGCCTCGCGGCGGCGGCGGCGGCAGCGGCCCAGCAGT TGGCGGCGAGCGCGTCTGCGCCTGCGCGGCGGGCCCCGCGCCCCTCCTCC CCCCCTGGGCGCCCCCGGCGGCGTGTGAATGGCGGCCTCCGCGGCGGCAG CCTCGGCAGCAGCGGCCTCGGCCGCCTCTGGCAGCCCGGGCCCGGGCGAG GGCTCCGCTGGCGGCGAAAAGCGCTCCACCGCCCCTTCGGCCGCAGCCTC GGCCTCTGCCTCAGCCGCGGCGTCGTCGCCCGCGGGGGGCGGCGCCGAGG CGCTGGAGCTGCTGGAGCACTGCGGCGTGTGCAGAGAGCGCCTGCGACCC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGAGAGGGAGCCCCGCCTGCTGCCCTGTTTGCACTCGGCCTGTAG TGCCTGCTTAGGGCCCGCGGCCCCCGCCGCCGCCAACAGCTCGGGGGACG GCGGGGCGGCGGGCGACGGCACCGGTAAGTACGAAGTGATCGGTGCCACC CCTCCCCCTACTCTCTGCCTTTGATTCCGACTGGGTGCAGAGATGAGGAT GCCACCTGGGCGAGAGGATGGGGGCCCGGACAGGGCACGGGAAATACTTT CTGGGTCCTGCATACGAACGTGGGTTTGTGCTGGCCGCTGAGATGGGACA TCTGACTAAAGTTGGAGAAAAGAAGGCTCGGGGAGGGGAGGGGCTGGTTC GCTGCGGGATAATGGTCGGGGGCCCACCCAGCAGGGGAATGGTGGGGGCC ATAACCTGGGTGGGAACTTGTAACAGTCTCCCACATCCCTGCTTCTCGAA GTGGTG (SEQ ID NO: 198) pARBI-979: TCGAAGTGGTGGACTGTCCCGTGTGCAAGCAACAGTGCTTCTCCAAAGAC 1806 TRIM28_2_ ATCGTGGAGAATTATTTCATGCGTGATAGTGGCAGCAAGGCTGCCACCGA Endogenous_86 CGCCCAGGATGCGAACCAGGTGCGTCCTATCTCAGCAACCACAAGGAGGT TTCTGGGGAGGGGGCATCTGCGCAGGAGGAGCTTGGCACCAGCTCCAGGC TGTTACTCCACTTTCCCAAGGCTCTGGGTGGGCTGCCTAGGTTGGGTCAA GGGACCAATCTTAAATCTCCGGTTGTATTTTCTGGGATGTAAACGTGGAT CTATCAAGTIGTCTTGCCTTCTCTGACCCTGCCTTTGTCTGGCAGTGCTG CACTAGCTGTGAGGATAATGCCCCAGCCACCAGCTACTGTGTGGAGTGCT CGGAGCCTCTGTGTGAGACCTGTGTAGAGGCGCACCAGCGGGTGAAGTAC tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatACCAAGGACCATACTGTGCGCTCTACTGGTACATGAGGCTGAGG GGGGCTGTTGGAGTTGTTCTCCCATGTGTGCCCTCAGTTGCTTTTATGAT GTTGGTTGCATCTGGTGGATGGGTCCTAGAGTTCTCTAGGGGGTGCCGCC CGAAGGGCCCGAGGGCAGAACTCCAGAAGCAGAAAAACTGGGGTTGTGGT GTGTAGTCTCTAGGGCCTAGGTGGGAGAGGGTGGGAAGGGGAAATAAGGG AGACCTTAATGTGCTGCAGGGAGGTACAAAGGGTTTGAGAAGGCTTATCA GGGAGTTGTCAGACCTGGTGGTGAAGGGCTCAGCATATGCAAACAGGGAA AGGCATGGTGTGAAGGGCTTTCTGGGTTTGTGGTTCCCTGGCACACATCT GGTATAAGATGCTGCTAGGGAAAGTAGACTGTGGGTCCATGGATTATAAG TGATGA (SEQ ID NO: 199) pARBI-980: GGCGCCCAATGCGCGTGCGCGGCGGCGTCGGCGCCAGTTATTTCTGTCCC 1806 TRIM28_3_ GCCCCCCGGCCTCGGCTCTTTCTGCGAGCGGGCGCGCGGGCGAGCGGTTG EndogenouS_87 TGCTTGTGCTTGTGGCGCGTGGTGCGGGTTTCGGCGGCGGCTGAGGAAGA AGCGCGGGCGGCGCCTTCGGGAGGCGAGCAGGCAGCAGTTGGCCGTGCCG TAGCAGCGTCCCGCGCGCGGCGGGCAGCGGCCCAGGAGGCGCGTGGCGGC GCTCGGCCTCGCGGCGGCGGCGGCGGCAGCGGCCCAGCAGTTGGCGGCGA GCGCGTCTGCGCCTGCGCGGCGGGCCCCGCGCCCCTCCTCCCCCCCTGGG CGCCCCCGGCGGCGTGTGAATGGCGGCCTCCGCGGCGGCAGCCTCGGCAG CAGCGGCCTCGGCCGCCTCTGGCAGCCCGGGCCCGGGCGAGGGCTCCGCT tccggatccggagagggcaggggatctctccttacttgtggcgacgtgga ggagaaccccggccccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGG TGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTC AGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCT GAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCG TGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatGGCGGCGAAAAGCGCTCCACCGCCCCTTCGGCCGCAGCCTCGGC CTCTGCCTCAGCCGCGGCGTCGTCGCCCGCGGGGGGCGGCGCCGAGGCGC TGGAGCTGCTGGAGCACTGCGGCGTGTGCAGAGAGCGCCTGCGACCCGAG AGGGAGCCCCGCCTGCTGCCCTGTTTGCACTCGGCCTGTAGTGCCTGCTT AGGGCCCGCGGCCCCCGCCGCCGCCAACAGCTCGGGGGACGGCGGGGCGG CGGGCGACGGCACCGGTAAGTACGAAGTGATCGGTGCCACCCCTCCCCCT ACTCTCTGCCTTTGATTCCGACTGGGTGCAGAGATGAGGATGCCACCTGG GCGAGAGGATGGGGGCCCGGACAGGGCACGGGAAATACTTTCTGGGTCCT GCATACGAACGTGGGTTTGTGCTGGCCGCTGAGATGGGACATCTGACTAA AGTTGG (SEQ ID NO: 200) pARBI-596: ATCTCGGCCAATAAAGGAGAAAGGGCGCGGCCCGTACGCGCGCCAGGTGC 2541 APRT_1_ GTGGGCGAGACCAGCTCACGCCCCTCCTCCAGCCGCCAAGGCCCCGGCCC Exogenous_1 ACAGCTGCCTGGCTGCAGTCAGAAGCGTAGCCCGAGACAAGGAAGGGCGC CTTGACTCGCACTTTTGTCCGGTTCGAACGTTCTGctcagtggtgcgtgg aatgcgagcgcgtcttaaaatcgatggcgcctaggagtccatgaaatacg gTACAGGCTTCCGGCGACGGATGCCCCGCCCCTCACCCACGCTCCGCCCT CCGGGGATGCCCCACCCCTCGTGGCGGTCCCGCCCGTCCCCGCGCAGGCG CGCTCGGGCTGCCGCTGGCTCTTCGCACGCGGCCATGGCCGACTCCGAGC TGCAGCTGGTTGAGCAGCGGATCCGCAGCTTCCCCGACTTCCCCACCCCA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatGGCGTGGTAT TCAGGTGCACGCACAGGCCGCCCTCGTGGCGCCCCGACCTGCGGGCCTAC GGATGGGAGCGCGTGGCCCGCGACCTCCGGGGGGCGGGGCGGGAACCCTC GTCTTTCGCCCCCGGGGCCCTGCCCTCCTTCGGCCCCGGCGTCACCAGGC CTGTCCTTGGGTCCAGGGACATCTCGCCCGTCCTGAAGGACCCCGCCTCC TTCCGCGCCGCCATCGGCCTCCTGGCGCGACACCTGAAGGCGACCCACGG GGGCCGCATCGACTACATCGCAGGCGAGTGCCCAGTGGCCGCATCTAGGG CGCTTCCGCCTCTGCGCGCGCCGAGGGCAGCACGTGGGCTCTGCGCGTCT GCTTGGGGGAGGGCCTTTGGGGTGCTTCAGGGGGCGCCGGGACGGGCGCC GTGCTTGGGTCGCCCGGGAAGGGTTGTGAGATTGAGCCC (SEQ ID NO: 201) pARBI-597: TGCCCCGCCCCTCACCCACGCTCCGCCCTCCGGGGATGCCCCACCCCTCG 2541 APRT_2_ TGGCGGTCCCGCCCGTCCCCGCGCAGGCGCGCTCGGGCTGCCGCTGGCTC Exogenous_2 TTCGCACGCGGCCATGGCCGACTCCGAGCTGCAGCTGGTTGAGCAGCGGA TCCGCAGCTTCCCCGACTTCCCCACCCCAGGCGTGGTATTCAGGTGCACG CACAGGCCGCCCTCGTGGCGCCCCGACCTGCGGGCCTACGGATGGGAGCG CGTGGCCCGCGACCTCCGGGGGGGGGGGGCGGGAACCCTCGTCTTTCGCC CCCGGGGCCCTGCCCTCCTTCGGCCCCGGCGTCACCAGGCCTGTCCTTGG GTCCAGGGACATCTCGCCCGTCCTGAAGGACCCCGCCTCCTTCCGCGCCG CCATCGGCCTCCTGGCGCGACACCTGAAGGCGACCCACGGGGGCCGCATC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGACTACATC GCAGGCGAGTGCCCAGTGGCCGCATCTAGGGCGCTTCCGCCTCTGCGCGC GCCGAGGGCAGCACGTGGGCTCTGCGCGTCTGCTTGGGGGAGGGCCTTTG GGGTGCTTCAGGGGGCGCCGGGACGGGCGCCGTGCTTGGGTCGCCCGGGA AGGGTTGTGAGATTGAGCCCCCGAGGCCGCCGCGCTGTGCAGGCGTCCTT CCCGCAGGTTCCGGGTCCCCAGCCCAGGACAGGCGTGACCGAGTTGCCGG GTCAGTTGGTCTCCCTGGAGTGCCCAAGCTGAATCCACAGGGCCCAGCTG CCTTGCTTCTTGTTCCTTCTGCGAGCTGGTATTGAGCGCCTGCCACGAGC CAGGCCTTCCCTGGTGAAGATCACGGAATGCCCACCCAGGGAAGGGAGGC CTGGAGGCCTCCGGGAGAGCCCAAGAGGTGGCCCAGGGAGA (SEQ ID NO: 202) pARBI-598: CGTTCTGctcagtggtgcgtggaatgcgagcgcgtcttaaaatcgatggc 2541 APRT_3_ gcctaggagtccatgaaatacggTACAGGCTTCCGGCGACGGATGCCCCG Exogenous_3 CCCCTCACCCACGCTCCGCCCTCCGGGGATGCCCCACCCCTCGTGGCGGT CCCGCCCGTCCCCGCGCAGGCGCGCTCGGGCTGCCGCTGGCTCTTCGCAC GCGGCCATGGCCGACTCCGAGCTGCAGCTGGTTGAGCAGCGGATCCGCAG CTTCCCCGACTTCCCCACCCCAGGCGTGGTATTCAGGTGCACGCACAGGC CGCCCTCGTGGCGCCCCGACCTGCGGGCCTACGGATGGGAGCGCGTGGCC CGCGACCTCCGGGCGGGCGGGGGGGGAACCCTCGTCTTTCGCCCCCGGGG CCCTGCCCTCCTTCGGCCCCGGCGTCACCAGGCCTGTCCTTGGGTCCAGG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatGACATCTCGC CCGTCCTGAAGGACCCCGCCTCCTTCCGCGCCGCCATCGGCCTCCTGGCG CGACACCTGAAGGCGACCCACGGGGGCCGCATCGACTACATCGCAGGCGA GTGCCCAGTGGCCGCATCTAGGGCGCTTCCGCCTCTGCGCGCGCCGAGGG CAGCACGTGGGCTCTGCGCGTCTGCTTGGGGGAGGGCCTTTGGGGTGCTT CAGGGGGCGCCGGGACGGGCGCCGTGCTTGGGTCGCCCGGGAAGGGTTGT GAGATTGAGCCCCCGAGGCCGCCGCGCTGTGCAGGCGTCCTTCCCGCAGG TTCCGGGTCCCCAGCCCAGGACAGGCGTGACCGAGTTGCCGGGTCAGTTG GTCTCCCTGGAGTGCCCAAGCTGAATCCACAGGGCCCAGCTGCCTTGCTT CTTGTTCCTTCTGCGAGCTGGTATTGAGCGCCTGCCACGA (SEQ ID NO: 203) pARBI-599: AGTAATTTGATGGGGGCTATTATGAACTGAGAAATGAACTTTGAAAAGTA 2541 B2M_1_ TCTTGGGGCCAAATCATGTAGACTCTTGAGTGATGTGTTAAGGAATGCTA Exogenous_4 TGAGTGCTGAGAGGGCATCAGAAGTCCTTGAGAGCCTCCAGAGAAAGGCT CTTAAAAATGCAGCGCAATCTCCAGTGACAGAAGATACTGCTAGAAATCT GCTAGAAAAAAAACAAAAAAGGCATGTATAGAGGAATTATGAGGGAAAGA TACCAAGTCACGGTTTATTCTTCAAAATGGAGGTGGCTTGTTGGGAAGGT GGAAGCTCATTTGGCCAGAGTGGAAATGGAATTGGGAGAAATCGATGACC AAATGTAAACACTTGGTGCCTGATATAGCTTGACACCAAGTTAGCCCCAA GTGAAATACCCTGGCAATATTAATGTGTCTTTTCCCGATATTCCTCAGGT tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatACTCCAAAG ATTCAGGTTTACTCACGTCATCCAGCAGAGAATGGAAAGTCAAATTTCCT GAATTGCTATGTGTCTGGGTTTCATCCATCCGACATTGAAGTTGACTTAC TGAAGAATGGAGAGAGAATTGAAAAAGTGGAGCATTCAGACTTGTCTTTC AGCAAGGACTGGTCTTTCTATCTCTTGTACTACACTGAATTCACCCCCAC TGAAAAAGATGAGTATGCCTGCCGTGTGAACCATGTGACTTTGTCACAGC CCAAGATAGTTAAGTGGGGTAAGTCTTACATTCTTTTGTAAGCTGCTGAA AGTTGTGTATGAGTAGTCATATCATAAAGCTGCTTTGATATAAAAAAGGT CTATGGCCATACTACCCTGAATGAGTCCCATCCCATCTGATATAAACAAT CTGCATATTGGGATTGTCAGGGAATGTTCTTAAAGATCAGA (SEQ ID NO: 204) pARBI-600: AAGATCTTAATCTTCTGGGTTTCCGTTTTCTCGAATGAAAAATGCAGGTC 2541 B2M_2_ CGAGCAGTTAACTGGCTGGGGCACCATTAGCAAGTCACTTAGCATCTCTG Exogenous_5 GGGCCAGTCTGCAAAGCGAGGGGGCAGCCTTAATGTGCCTCCAGCCTGAA GTCCTAGAATGAGCGCCCGGTGTCCCAAGCTGGGGCGCGCACCCCAGATC GGAGGGCGCCGATGTACAGACAGCAAACTCACCCAGTCTAGTGCATGCCT TCTTAAACATCACGAGACTCTAAGAAAAGGAAACTGAAAACGGGAAAGTC CCTCTCTCTAACCTGGCACTGCGTCGCTGGCTTGGAGACAGGTGACGGTC CCTGCGGGCCTTGTCCTGATTGGCTGGGCACGCGTTTAATATAAGTGGAG GCGTCGCGCTGGCGGGCATTCCTGAAGCTGACAGCATTCGGGCCGAGATG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatTCTCGCTCCG TGGCCTTAGCTGTGCTCGCGCTACTCTCTCTTTCTGGCCTGGAGGCTATC CAGCGTGAGTCTCTCCTACCCTCCCGCTCTGGTCCTTCCTCTCCCGCTCT GCACCCTCTGTGGCCCTCGCTGTGCTCTCTCGCTCCGTGACTTCCCTTCT CCAAGTTCTCCTTGGTGGCCCGCCGTGGGGCTAGTCCAGGGCTGGATCTC GGGGAAGCGGCGGGGTGGCCTGGGAGTGGGGAAGGGGGTGCGCACCCGGG ACGCGCGCTACTTGCCCCTTTCGGCGGGGAGCAGGGGAGACCTTTGGCCT ACGGCGACGGGAGGGTCGGGACAAAGTTTAGGGCGTCGATAAGCGTCAGA GCGCCGAGGTTGGGGGAGGGTTTCTCTTCCGCTCTTTCGCGGGGCCTCTG GCTCCCCCAGCGCAGCTGGAGTGGGGGACGGGTAGGCTCG (SEQ ID NO: 205) pARBI-601: AGGAATGCTATGAGTGCTGAGAGGGCATCAGAAGTCCTTGAGAGCCTCCA 2541 B2M_3_ GAGAAAGGCTCTTAAAAATGCAGCGCAATCTCCAGTGACAGAAGATACTG Exogenous_6 CTAGAAATCTGCTAGAAAAAAAACAAAAAAGGCATGTATAGAGGAATTAT GAGGGAAAGATACCAAGTCACGGTTTATTCTTCAAAATGGAGGTGGCTTG TTGGGAAGGTGGAAGCTCATTTGGCCAGAGTGGAAATGGAATTGGGAGAA ATCGATGACCAAATGTAAACACTTGGTGCCTGATATAGCTTGACACCAAG TTAGCCCCAAGTGAAATACCCTGGCAATATTAATGTGTCTTTTCCCGATA TTCCTCAGGTACTCCAAAGATTCAGGTTTACTCACGTCATCCAGCAGAGA ATGGAAAGTCAAATTTCCTGAATTGCTATGTGTCTGGGTTTCATCCATCC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGACATTGAA GTTGACTTACTGAAGAATGGAGAGAGAATTGAAAAAGTGGAGCATTCAGA CTTGTCTTTCAGCAAGGACTGGTCTTTCTATCTCTTGTACTACACTGAAT TCACCCCCACTGAAAAAGATGAGTATGCCTGCCGTGTGAACCATGTGACT TTGTCACAGCCCAAGATAGTTAAGTGGGGTAAGTCTTACATTCTTTTGTA AGCTGCTGAAAGTTGTGTATGAGTAGTCATATCATAAAGCTGCTTTGATA TAAAAAAGGTCTATGGCCATACTACCCTGAATGAGTCCCATCCCATCTGA TATAAACAATCTGCATATTGGGATTGTCAGGGAATGTTCTTAAAGATCAG ATTAGTGGCACCTGCTGAGATACTGATGCACAGCATGGTTTCTGAACCAG TAGTTTCCCTGCAGTTGAGCAGGGAGCAGCAGCAGCACTTG (SEQ ID NO: 206) pARBI-602: GATCTCACATTCTAGCGCCCAGACATGTCGGGAACGCCTTACGCCAACTT 2541 CAPNS1_1_ GGGGTCCCCACCAAGAGAACCCCCACCAGATCTGCACCCTCCCCTTCACG Exogenous_7 CGTGCACCCAGTCCAGGCTCCCTCAAGCCCCACGGGTGCCTTTTAGACCT GAGGAGGTTGCAAACCTGATCCCCCATACCTGCCCCACCCATCCGCGGAC AACCCGCCCTCGCAAACTCAGACCCCCACCCGGAGGCTTCAGATTCCTCC CAGGTCCAGCTGCCGGAAATGCGTGTTTGAAGGGAGGGTGTGGGCTCAGG GGCGAAGCACCCACTGGTCCCCTTTTTTCCCCCCAGCAGTGAGTCGCAGC CATGTTCCTGGTTAACTCGTTCTTGAAGGGCGGCGGCGGCGGCGGCGGGG GAGGCGGGGGCCTGGGTGGGGGCCTGGGAAATGTGCTTGGAGGCCTGATC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatAGCGGGGCC GGGGGCGGCGGCGGCGGCGGCGGCGGCGGCGGCGGTGGTGGAGGCGGCGG TGGCGGTGGAACGGCCATGCGCATCCTAGGCGGAGTCATCAGCGCCATCA GGTAAGGCGGAGACTATCAGAGGGGCGGGGCCTGGGAATGGGAGGAGCCT CAGTGAGGCGTGGTCTGGGAGGGGCGTGGTCTAAAAATAGAATAGGATTA ACCTGGAGGCTAACCTGGGTACATGAATTAGGCCGGGGAGGCCTGGTTTG AGAGTTCTGCTGTAAGGGGGTGGACCCCAGTGAGGCGGATATCAGTCATT GGGGGCGGTGCTTGATATGGGAGTAGTCTGATTGTGTGGGACCAGGACAA TGTGGTCCTGAGGGGACTGATGTGGAGTTTTGGCGGGTGGGGCTTATGGG TCTGGGCTCAGCCTGCATTGGCTAACCTGGAGATGAACGTT (SEQ ID NO: 207) pARBI-603: CGGGCATTTAGGAAGCGGGCCATGGAGTAGGGTCTTGGGCAGATGGCACC 2541 CAPNS1_2_ TGAGGAGGATAAGGCCTTGGGAAGCTGGGGCAGAGCCTCTATGAAGTGAG Exogenous_8 CCCTCCAAGGGGCGGGGTCTTTTGATTCTACGTGGGATTTTTTAGGTTTA GGGTGGCCAAGATGACTGAAATCTGCCACTGGGTAGGTGTGCCTGGCAGG AGGGGAGCCTCCCAGGGGACCGGTCTCTGGGTTTCCTCGAGGGTGGGGTT GGCCTGAGGAAGGGAGAAGAGGGGCACGACCAGGGCAGTGTGGATTGGGA CAGATGAGGACAAGAACAAATGAAAGGCACAGCAACCAAGTAAGGAAGAT AACGGCTGGGGTCTGGAGCGTTGGGGCTGATGGTTCTGTAGTGCTGCCCG TTGGAGGCCCCGCCCCTGGCACTAACCCCTCCCCCTTATCTCTTCGCAGC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACAT CGCCCACAGTCCCCGAGAAGTTGGGGGGAGGGGTCGGCAATTGAACGGGT GCCTAGAGAAGGTGGCGCGGGGTAAACTGGGAAAGTGATGTCGTGTACTG GCTCCGCCTTTTTCCCGAGGGTGGGGGAGAACCGTATATAAGTGCAGTAG TCGCCGTGAACGTTCTTTTTCGCAACGGGTTTGCCGCCAGAACACAGCTG AAGCTTCGAGGGGCTCGCATCTCTCCTTCACGCGCCCGCCGCCCTACCTG AGGCCGCCATCCACGCCGGTTGAGTCGCGTTCTGCCGCCTCCCGCCTGTG GTGCCTCCTGAACTGCGTCCGCCGTCTAGGTAAGTTTAAAGCTCAGGTCG AGACCGGGCCTTTGTCCGGCGCTCCCTTGGAGCCTACCTAGACTCAGCCG GCTCTCCACGCTTTGCCTGACCCTGCTTGCTCAACTCTACGTCTTTGTTT CGTTTTCTGTTCTGCGCCGTTACAGATCCAAGCTGTGACCGGCGCCTACT CTAGAGCTAGCGAATTgccgccaccATGGTGAGCAAGGGCGAGGAGCTGT TCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGC CACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAA GCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGC CCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTAC CCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGG CTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGA CCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAG CTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCT GGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGA AGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGC AGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGG CCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGA GCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTG ACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtt tattgcagcttataatggttacaaataaagcaatagcatcacaaatttca caaataaagcatttttttcactgcattctagttgtggtttgtccaaactc atcaatgtatcttatGAGGCGGCTGCGCAGTACAACCCGGAGCCCCCGGT AAGCCCCCTCTGCAACCAGACCCCCTTCTCCTGCCAAGGCCTCTTCGAGG TCCCATCCCTGTTCCTGTAGAGAAGCCCCACCTTCCTCCCCTTCTTGTGA AATTCCTCTGCCAGTTCCTCCCATGCCGTGTCTGCAGCTTCGCCATGGGT CTTAGCCATGCCCCACATACGTGCACCCCATTCACTACCACCCCTCATTC TTTTCCTCATCAAGCTGCCCAGCCCACTCTGACTTCCCCACCCAGGGTAC CTGGGTTTGGGGAGCCGTCCTGGCCGGGTTCCCCTCCCCCTGCTCTGAGC TCTCCTCCCTTTGCAGCCCCCACGCACACATTACTCCAACATTGAGGCCA ACGAGAGTGAGGAGGTCCGGCAGTTCCGGAGACTCTTTGCCCAGCTGGCT GGAGATGTAAGTAAC (SEQ ID NO: 208) pARBI-604: GCTGATGGTTCTGTAGTGCTGCCCGTTGGAGGCCCCGCCCCTGGCACTAA 2541 CAPNS1_3_ CCCCTCCCCCTTATCTCTTCGCAGCGAGGCGGCTGCGCAGTACAACCCGG Exogenous_9 AGCCCCCGGTAAGCCCCCTCTGCAACCAGACCCCCTTCTCCTGCCAAGGC CTCTTCGAGGTCCCATCCCTGTTCCTGTAGAGAAGCCCCACCTTCCTCCC CTTCTTGTGAAATTCCTCTGCCAGTTCCTCCCATGCCGTGTCTGCAGCTT CGCCATGGGTCTTAGCCATGCCCCACATACGTGCACCCCATTCACTACCA CCCCTCATTCTTTTCCTCATCAAGCTGCCCAGCCCACTCTGACTTCCCCA CCCAGGGTACCTGGGTTTGGGGAGCCGTCCTGGCCGGGTTCCCCTCCCCC TGCTCTGAGCTCTCCTCCCTTTGCAGCCCCCACGCACACATTACTCCAAC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCC GCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGTCGCGTTCTGCCGCCT CCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGTCTAGGTAAGTTTAAA GCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCCCTTGGAGCCTACCTA GACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTGCTTGCTCAACTCTAC GTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAGATCCAAGCTGTGACC GGCGCCTACTCTAGAGCTAGCGAATTgccgccaccATGGTGAGCAAGGGC GAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGA CGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCA CCTACGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCC GTGCCCTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTT CAGCCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCA TGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGC AACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAA CCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGG GGCACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCC GACAAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACAT CGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCA TCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAG TCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCT GGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACA Aatgattgtttattgcagcttataatggttacaaataaagcaatagcatc acaaatttcacaaataaagcatttttttcactgcattctagttgtggttt gtccaaactcatcaatgtatcttatATTGAGGCCAACGAGAGTGAGGAGG TCCGGCAGTTCCGGAGACTCTTTGCCCAGCTGGCTGGAGATGTAAGTAAC CTGGGGTCCCTGGCCCCGTCCTAACCGTTCCATCCCTTCCCTTGTGGCTG CCCTTGCACACACACCCTTGACCATGACAATCCCAGTGTTCCCATTCTCC ATGACATTCTCAGACCCCTTTCAGTCACCCCTGACCTGCCCCTAACTTCC GCCCGCAGGACATGGAGGTCAGCGCCACAGAACTCATGAACATTCTCAAT AAGGTTGTGACACGACGTAAGTGACCGGGGTTAAGGAATAGGGTAGATTC AGAGGCAGAGGGGTCAGAGAGGATTTGACCTCTGGCCTCTGACTTTCAAC CTGTTACCCACAGACCCTGATCTGAAGACTGATGGTTTTGGCATTGACAC ATGTCGCAGCATGGTGGCCGTGATG (SEQ ID NO: 209) pARBI-605: ggagagttatagcagatggagacaaaaaagggaactgctggggatcaaac 2541 CBLB_1_ cttgtaaagtccttctaagtgatatatgaggactttgacttttattcaaa Exogenous_10 gttagaaggctttgaacaaaagaattatttcatttgtgttttacaatggt cacaagtgccgtagcgagaataggctgTGTAGAGAAGAGATTTTGTGGTT TTCTTTTTTGTTCTCTGTATTCTCTTCCTGAAAAATCTGTTTTCTGCCTA ATCATATTTCTGTAAAGAACAATAGACTTGAACGTCTGCTGTTAAGTAAC AAAGGTTGACCTAAACCACATAAGGTCAGATTAACTTTTAAAATATCCAA GATAATGTTAAGTGTATTAAATATGGTTCAGTATGTAATCAAAATATTTG ATATTCTCTAGATACTAATCTCttttaatttttttattttttAGCCTTGG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGGCTCTATT TTGCGGAATTGGAATTTCTTAGCTGTGACACATCCAGGTTACATGGCATT TCTCACATATGATGAAGTTAAAGCACGACTACAGAAATATAGCACCAAAC CCGGAAGGTAAGACTTTTTAGCAGTAATATTATGTGATATATCCAAAGAT GAAAAATTCTCCATGGTTTGTTCCTAAGTAAGTCAGCAATACCCGCTGAT GGCATGCTTGGGAGGGGGCTAGAAAAGGTAAAAGACCAAAATGGTGCTTC TTAACATAATGTTTTAAATTGATACATTGTGCTTACTCTGAAACTTGGTA TTCATAAAGTGAAAGGGGATATGGTCTCCTTTCTGTTTTCATGATATCCG TGTTTATGTGCAAAATTGAATGTATATATTTGAATACTTGAATACTGTGC TAAAGAACAAATAAATAGAACAGTCTTATGTTCATTACTTT (SEQ ID NO: 210) pARBI-606: CATATTACTCATTTGTGCAATGAATAAAGGATTTCTATTATTATATGGAC 2541 CBLB_2_ CCATCATCTTATGACTGTTTTTAGAAAATGAGAATTAAATCTTGATTCAT Exogenous_11 TATTAAATAACCTTGTATACACACATGAAGAGGTCATGACACATAAGACT TTGACCTAATACCTTTTTTCAGATATGTTTCTATAGCTATCTTGATAGCC TAGGACTGTTTGAGAGAAAATTAATTTAAATTTTTGTCATATTCTTAGCA AATGCAAAATAATTGAGTTAAACAGAACCCACTACATTTGTTCTGAAATA TGAACAATATTCTCAAATATTTGTCTGGTAGTACAAATACAATTATATTA ATTTTATTATTGAATTTTGCTGCTTCAAAGGGAGGTATTTCTTAAATGAT ACTTGATTCAATTATTTCCCTTTTTTTCCACCTTGCACAGAACTATCGTA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggggggtggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCAC ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCG AGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAA CTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCA TCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAG CTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCT GCTGCCCGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACC CCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagc ttataatggttacaaataaagcaatagcatcacaaatttcacaaataaag catttttttcactgcattctagttgtggtttgtccaaactcatcaatgta tcttatCCATGGAAAGTATTCAGACAGTGCCTTCATGAGGTCCACCAGAT TAGCTCTGGCCTGGAAGCAATGGCTCTAAAATCAACAATTGATTTAACTT GCAATGATTACATTTCAGTTTTTGAATTTGATATTTTTACCAGGCTGTTT CAGGTAAGTTTATACTTGCTTAGTCTATCCATTCCCTtcttctctctccc tgtcttttctctctgtttctctctctctctgtctctctctcacacacaca cacaTAAACAAACATATGGAAAATAGTGTCATTCCTATTAGTCTGTAATT TTATTGGTATGTTTGGTCATTGTTAAAAGTTTAGAGTGAGCCATTTGGCT TAAGTTTGGAGTTATCAGATGCTGTGAGCCTGGTGGCTGCTTGTTAGCTT TGAGAGGATCCATCCAAAAGAGCTTTGTTTTAAACTCTCTACTTAGTTTC TTACTT (SEQ ID NO: 211) pARBI-607: TTAAACTTTAGAGGTTCAAAATAAGGATATGATATAGAAAAGATTGAATG 2541 CBLB_3_ GTAGGAGAACAGAATAGGATTTATTTATATTATGACAAACGTGACTATCA Exogenous_12 GTTAATAAATTGGAGATAGTAAGTTAAGATCATTATTTATATTATGGAAA AATGAAGAAATAGGACATGGTTCTCTAACTTTTTAATTTTTCTGTGAAAT AAAATATTATGACTTATTTTATTTTTATCTTTGTTTTTAGGTAAGACTGT GCCAAAATCCCAAACTTCAGTTGAAAAATAGCCCACCATATATACTTGAT ATTTTGCCTGATACATATCAGCATTTACGACTTATATTGAGTAAATATGA TGACAACCAGAAACTTGCCCAACTCAGTGAGAATGAGTACTTTAAAATCT ACATTGATAGCCTTATGAAAAAGTCAAAACGGGCAATAAGACTCTTTAAA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGAAGGCAAG GAGAGAATGTATGAAGAACAGTCACAGGACAGGTAAGAAGAATATTTCAG ATGTTTTGGTGTAAAGGTCATTTATGTTGCTTTTTACTTAATAGTTAACC TAAACGTCACCATGTAATTGTTTTGGGGTAAATGTGAGTCGCTTTGTATA Tagtcatgtggtacttaacgatggggatactttacaagaaatgcattttt agacaatttcattgttgtgtggacatgatagagtgtacttacacagagct agatggtattacctaccgcacacctaggctgtatgatgtagcctattgct cctaggctgcaaatctatacagtatgttactgtactgaatactgtgggca gttgtaacacaatggtaagtatgtgtgtatctaaatatacctaaacatag aaaaggtatggtaaaaatactgtataaaaggccgggcgtgg (SEQ ID NO: 212) pARBI-608: AGATGAGAAAACCTATCCTTCCCAATTTTTTTGTGTGAGAATTAAAATGC 2541 CD2_1_ AGCAAGAAAACACACACTCATAAACACATCTGCTTTGGCAAAGGAGCACA Exogenous_13_ TCAGAAGGGCTGGCTTGTGCGCGCTCTTGCTCTCTGTGTATGTGTATTAT 14_15 GTTTTATGTTACTGTAAAAGATGTAAAGAGAGGCACGTGGTTAAGCTCTC GGGGTGTGGACTCCACCAGTCTCACTTCAGTTCCTTTTGCATGAAGAGCT CAGAATCAAAAGAGGAAACCAACCCCTAAGATGAGCTTTCCATGTAAATT TGTAGCCAGCTTCCTTCTGATTTTCAATGTTTCTTCCAAAGGTAAGCATA AGAGTCAAAGAAGTCCCAACCCAGCTTTCCCTGAAAGTGACTCTCAGTAA CTCTTTTGCTTTTTATAGGTGCAGTCTCCAAAGAGATTACGAATGCCTTG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGAAACCTGG GGTGCCTTGGGTCAGGACATCAACTTGGACATTCCTAGTTTTCAAATGAG TGATGATATTGACGATATAAAATGGGAAAAAACTTCAGACAAGAAAAAGA TTGCACAATTCAGAAAAGAGAAAGAGACTTTCAAGGAAAAAGATACATAT AAGCTATTTAAAAATGGAACTCTGAAAATTAAGCATCTGAAGACCGATGA TCAGGATATCTACAAGGTATCAATATATGATACAAAAGGAAAAAATGTGT TGGAAAAAATATTTGATTTGAAGATTCAAGGTAAGTGTTCATTCCCTTAA TTGCTTTATTTCAGTGTGGGTGCTATTTGGCAAGTTGGAAAATAGCATTT CTAATATTCCCCAGCGCTGACCTCTGCCTCCAGGGGGGCTATACAGGAAA CCAGATGCATGTCTCCTGCCCCAGTGGAGACTGTGGTCCAA (SEQ ID NO: 213) pARBI-609: AGGATTGTTCTAGTTGATTGGTATGTGTGCACTCCTAGTTGTTAAATATT 2541 CD3_ TTCACTATCACACCTGGATATACTCAACAAATATTTGTTGAGCCAAATAC E_1_ TCAACACCAGCCAAACACGTAGTATTTACTTTAGCTTAAGCGAATTATTT Exogenous_16 AGCCCTGACAGAAGCCCTGGAATGTGGGTCTTTAAGTTCCTATTTTTGAG ATGGGAAAGCTGAGGCTCACGGAAGGAGGTGACCAGCTCAAGTCTCCTAC CGTCCATGCCAAATTAGAATTCCAGCCTGCCTCCTGACTTCAAGTCCAAA GTTCTTCCCACGCACTAAAGCTAGCTCTTCAGTGTCCTTTCTTAGGAGGT ACTTCCTCCCGCACCACTGACCGCCCCCTCTCTATTTCACCCCCAGCCCA TCCGGAAAGGCCAGCGGGACCTGTATTCTGGCCTGAATCAGAGACGCATC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggggggtggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatTGACCCTCTG GAGAACACTGCCTCCCGCTGGCCCAGGTCTCCTCTCCAGTCCCCCTGCGA CTCCCTGTTTCCTGGGCTAGTCTTGGACCCCACGAGAGAGAATCGTTCCT CAGCCTCATGGTGAACTCGCGCCCTCCAGCCTGATCCCCCGCTCCCTCCT CCCTGCCTTCTCTGCTGGTACCCAGTCCTAAAATATTGCTGCTTCCTCTT CCTTTGAAGCATCATCAGTAGTCACACCCTCACAGCTGGCCTGCCCTCTT GCCAGGATATTTATTTGTGCTATTCACTCCCTTCCCTTTGGATGTAACTT CTCCGTTCAGTTCCCTCCTTTTCTTGCATGTAAGTIGTCCCCCATCCCAA AGTATTCCATCTACTTTTCTATCGCCGTCCCCTTTTGCAGCCCTCTCTGG GGATGGACTGGGTAAATGTTGACAGAGGCCCTGCCCCGTT (SEQ ID NO: 214) pARBI-610: AATAATAAAATAATAACAATACTTAACATTTATTGAGTGCTTATTAAGTC 2541 CD3_E_2_ TCAAGCACTGTCTGTACCCAACACTTATCAAGGATTCTTTTTCATGTAAT Exogenous_17_18 CCTCTCAACAACTATATGGGTTAAGTATCATTTTATTCCCATGAGTAAAG GGATGAGGAAACAGAGGGTTTGTGAGTTGAAAACACATTTCACGCTTCTC ACAGCTAGTGAGTAATAAAGCTGGGACTCAAACCCAGGGCTGTTTGACTC CAGTGCCTCTACCCACGGCCACCACTCTTTGCTTGTCAATGTTGTTCTAA ACATATTGAAGGGGGGGCTCTGACCGTGGCAAGCGTGTGAGTAGTAAGGG GAGAATGGCCTTCATGCACTCCCTCCTCACCTCCAGCGCCTTGTGTTTTC CTTGCTTAGTGATTTCCCCTCTCCCCACCCCACCCCCCACAGTGTGTGAG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatAACTGCATGG AGATGGATGTGATGTCGGTGGCCACAATTGTCATAGTGGACATCTGCATC ACTGGGGGCTTGCTGCTGCTGGTTTACTACTGGAGCAAGAATAGAAAGGC CAAGGCCAAGCCTGTGACACGAGGAGCGGGTGCTGGCGGCAGGCAAAGGG GTAAGGCTGTGGAGTCCAGTCAGAGGAGATTCCTGCCAAGGGGGACGACC AGCCTGGGCCAGGGTGGGTGGCAAGTCCACAGCTAGGTCAGAACAGCTTC TCTAGAGCTTCTATGCACAGCTTCTATTACTGTGATGACAAGATCTCAAC AGACGGTTTCAAATCTCACATCACTCCCCTCCTTCCCATCCTAGAAAAGT GCAAAAAAGTTTATGAAAGTGATGGGCTTCCTCACATACCTGTCAATGCC TGCAGTCATCCGATTCCGCCCCTAAGCTGTGGGAAGAGAG (SEQ ID NO: 215) pARBI-611: ACTGTGACCCATGAACCAGTAGGTATTGGCGTCACCTGGGATCCTGACGT 2541 CD3G_1_ TAGTCTCTGTCAATCCTTCTCTTTAGTTCATCTATTCTACCCAAAGTGAT Exogenous_19_20 CTCATCATCTGGTATGCTGTTAGCAGTTTCTTACCTGTATAGTATCTTCC AAATAACATGCCCCAAAATCCCAAAGTTTTACCCCTACTAATTACAGCAA TGTCTCTTTTATTCTTCACCCCCTGACGCAGATATTGGCGTCACCCGAGA GCATGTTAGTAATGCAGAATCTCCCCTCCCCAGAACTACTAAATAGCACC TGAAATTTTAACAAGATCCCCATGTGATTCATGTGCACATCAAAGTTTGA GAAACACTACTCTAATGATCTCCTGGTATGCAGAAGCAGGGAGAATTTCA GAGGCAAGATCCTTAATAGAACCACGGCTTTTCTCATTTCAGGAAACCAC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggggggtggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatTTGGTTAAGG TGTATGACTATCAAGAAGATGGTTCGGTACTTCTGACTTGTGATGCAGAA GCCAAAAATATCACATGGTTTAAAGATGGGAAGATGATCGGCTTCCTAAC TGAAGATaaaaaaaaaTGGAATCTGGGAAGTAATGCCAAGGACCCTCGAG GGATGTATCAGTGTAAAGGATCACAGAACAAGTCAAAACCACTCCAAGTG TATTACAGAAGTATGTAATCCCCTTTGGTCTGTTTGTTGTGAAATTAATC AGTATTTGCTGTTCTGGTGAGCTTTTTATCTGGGGTGAAAGTGGAAATAG ATCCTCAACAGTAATATTATCGCCTGTTCTCTTAATTTCAGCTTGCCTCT TTTAAAATACTGTAAGATACTTCCCTCACCCTATTGAAAAACTACAGCCA GTCCTGTAAAATTTTGTTTACCTTTGGGTGGGCTCCATGG (SEQ ID NO: 216) pARBI-612: TTCGAGACAAGCCTGGCCAACATGATGAAACTCTGTCTCTACTAAAAATA 2541 CD3G_3_ CAAAAAAAATTAACCAGGCGTGGAGGCGCGCGCCTGTAGTCCCAGCTACT Exogenous_21 CAGGAGGCTGAGGCAGGAGGACCACTTGAACCCAGGAGGTCGAGGTTGCA GTGAGCTGCGATTGTGCCACTGCACTCCAGCCTGGGCAACAGAGAAAGAC TCCGTCTCAAAAAAAAAAAAGAGAGAGAGAGAAAAAGAAAAAAGACAGAG CCTCCATCTCCTTGTCCTCTTTCCATCCTCAGGACCATGAAGTACCCACT CCAAATTCTCACATATAAAAAACATTCAATAAACATGCATCAAATTAATT AATAGAGGATGGAAAAAATGACTTATGACTGTGCTGTCCTTTCCAGCCCC TCAAGGATCGAGAAGATGACCAGTACAGCCACCTTCAAGGAAACCAGTTG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatAGGAGGAAT TGAACTCAGGACTCAGAGTAGGTGGGTTCTTCAATGCCAATTCTAATAAA GGACCCTTGCATCAACTGCCCTCGCAATTGCTTCTAAGTCTAGCTCCCTT CCCTAAGCGGCTATAAGCATCAGACTCTGGGGATCAGGGATTGGGACGTG GTTTGGGGTACTCTTTTCTAAAAATTCTGGGGCCATACTGATTGTCTTGG CCTAGGTAAATATGAATTTTATGTATCTGTAAATCCTGTCAGAGCAGGGC CTCAAGCCATAGAGATGCTGAATATTAATCTTAACCTACATTTGAATTTC TCATTATCTACACTATTAACATTTTGGGCTAATTAATTATTTGTGATGAG GGGCTAGCCTGTGCATTGTAGGAGTTATGGAAGCATCCCTGGCCTCTCTC CACCAGATGCTGGTAGATTGTCCAGTGTGACAATCAAAAAT (SEQ ID NO: 217) pARBI-613: GACCTCAGGTGATCTGCCCGCCTCAGCTTCCCAAAGTGCTGGGATTACAG 2541 CD5_1_ GCATGAGCCACCACACCTGGCCTAAAATTAATTTTTAAAGATCTCTTAAA Exogenous_22 AAGCAGACACCAGCCCCAATCTCAGACCCCTTGAGACAGAATTTCCAGGA CAGGGGCCATCCTGCTGGACAGTGGGTGCCGAGAACACCTTGCCCATTTA TCTGAGCTCCCTTCTGACTCTGAAATCTGGAGCCCCACCCTCCTGGGTCT AGCTTCGGGGCTGCCTGGGTCAGGGTCCTCTGGGAAGCCCCTGCAGTGCC CCAGAAGGGACGAAGCTCACAAGGGGCAAGGCAGGCAGCCCACGGGGCAG GAGGGAGCTCAACTGGGCGTCCTAGGGAGAGGGCAGTGAGGGGTGCCAGT GGGGAACCCCTCCCAGCCTGACCCCCACCACACCTTTCTGACCCCCAGAT tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatTTCCAGGCA AGGCTCACCCGTTCCAACTCGAAGTGCCAGGGCCAGCTGGAGGTCTACCT CAAGGACGGATGGCACATGGTTTGCAGCCAGAGCTGGGGCCGGAGCTCCA AGCAGTGGGAGGACCCCAGTCAAGCGTCAAAAGTCTGCCAGCGGCTGAAC TGTGGGGTGCCCTTAAGCCTTGGCCCCTTCCTTGTCACCTACACACCTCA GAGCTCAATCATCTGCTACGGACAACTGGGCTCCTTCTCCAACTGCAGCC ACAGCAGAAATGACATGTGTCACTCTCTGGGCCTGACCTGCTTAGGTGGG TAACTAGCCAGCCACACGGGCACCCTGGGCCTGGGCGCCAGCCCCGAGGA GACTGCCCGAGGCCTGTGATCTAGGGTCTGAGCAGGCTGGTGGAAGGGGT GGGGGGACCCCAGTTTATAACCACTCCCCAAGACACATACC (SEQ ID NO: 218) pARBI-614: GGACAGGGGCCATCCTGCTGGACAGTGGGTGCCGAGAACACCTTGCCCAT 2541 CD5_2_ TTATCTGAGCTCCCTTCTGACTCTGAAATCTGGAGCCCCACCCTCCTGGG Exogenous_23 TCTAGCTTCGGGGCTGCCTGGGTCAGGGTCCTCTGGGAAGCCCCTGCAGT GCCCCAGAAGGGACGAAGCTCACAAGGGGCAAGGCAGGCAGCCCACGGGG CAGGAGGGAGCTCAACTGGGCGTCCTAGGGAGAGGGCAGTGAGGGGTGCC AGTGGGGAACCCCTCCCAGCCTGACCCCCACCACACCTTTCTGACCCCCA GATTTCCAGGCAAGGCTCACCCGTTCCAACTCGAAGTGCCAGGGCCAGCT GGAGGTCTACCTCAAGGACGGATGGCACATGGTTTGCAGCCAGAGCTGGG GCCGGAGCTCCAAGCAGTGGGAGGACCCCAGTCAAGCGTCAAAAGTCTGC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatCAGCGGCTG AACTGTGGGGTGCCCTTAAGCCTTGGCCCCTTCCTTGTCACCTACACACC TCAGAGCTCAATCATCTGCTACGGACAACTGGGCTCCTTCTCCAACTGCA GCCACAGCAGAAATGACATGTGTCACTCTCTGGGCCTGACCTGCTTAGGT GGGTAACTAGCCAGCCACACGGGCACCCTGGGCCTGGGCGCCAGCCCCGA GGAGACTGCCCGAGGCCTGTGATCTAGGGTCTGAGCAGGCTGGTGGAAGG GGTGGGGGGACCCCAGTTTATAACCACTCCCCAAGACACATACCCAGGAG GGGGACTGGAAGGGGCCAGCACCCATCTGTAGGATGGCAATGGAGGACCT AGTTCTGCCAATCACTGACTTCATCGTCGCCTCTGAACCTCCATTCTCCC ATCTGTGAAGTGGGGTGGTACTTCCCGCCTCGCAGGAGGCT (SEQ ID NO: 219) pARBI-615: tgctgggattacaggcatgagccaccacacctggcctaaaattaattttt 2541 CD5_3_ aaagaTCTCTTAAAAAGCAGACACCAGCCCCAATCTCAGACCCCTTGAGA Exogenous_24 CAGAATTTCCAGGACAGGGGCCATCCTGCTGGACAGTGGGTGCCGAGAAC ACCTTGCCCATTTATCTGAGCTCCCTTCTGACTCTGAAATCTGGAGCCCC ACCCTCCTGGGTCTAGCTTCGGGGCTGCCTGGGTCAGGGTCCTCTGGGAA GCCCCTGCAGTGCCCCAGAAGGGACGAAGCTCACAAGGGGCAAGGCAGGC AGCCCACGGGGCAGGAGGGAGCTCAACTGGGCGTCCTAGGGAGAGGGCAG TGAGGGGTGCCAGTGGGGAACCCCTCCCAGCCTGACCCCCACCACACCTT TCTGACCCCCAGATTTCCAGGCAAGGCTCACCCGTTCCAACTCGAAGTGC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatCAGGGCCAGC TGGAGGTCTACCTCAAGGACGGATGGCACATGGTTTGCAGCCAGAGCTGG GGCCGGAGCTCCAAGCAGTGGGAGGACCCCAGTCAAGCGTCAAAAGTCTG CCAGCGGCTGAACTGTGGGGTGCCCTTAAGCCTTGGCCCCTTCCTTGTCA CCTACACACCTCAGAGCTCAATCATCTGCTACGGACAACTGGGCTCCTTC TCCAACTGCAGCCACAGCAGAAATGACATGTGTCACTCTCTGGGCCTGAC CTGCTTAGGTGGGTAACTAGCCAGCCACACGGGCACCCTGGGCCTGGGCG CCAGCCCCGAGGAGACTGCCCGAGGCCTGTGATCTAGGGTCTGAGCAGGC TGGTGGAAGGGGTGGGGGGACCCCAGTTTATAACCACTCCCCAAGACACA TACCCAGGAGGGGGACTGGAAGGGGCCAGCACCCATCTGT (SEQ ID NO: 220) pARBI-616: TGTCCAAAGTGTGTGTCACCTGCTCTGTGTTCACCCACTGGGTTCATCAG 2541 chr2_1_ AGGGGCACTTGAACAAAACAGGAATGTCACTGGAGAGGGGAACAAATGCT Exogenous_106 GTCCCCAAGTCCTTCCCTGCCGTTTAGAGTAACGCAGCCCCTGTCCGCAC ATCCTCACCCCCATCTGCCCTCTGCTTGCCAGAATTATCAGAATCTCTCA ATTCTAATTTCAGCCATTCTCTGTGAAAATGAAAAGTTAGTGGCAGGCAA TGGACACCCTATTGAAATAGAAATTGTTCTTCATATGTCAGAAAAGTCCC AGAGTAATCTGGGAGTAATCTGAGCAAACTGATTCAAAGGATTATTACAA TTTCTTAGTCTTCTACTGCTGCAAAAGTGGCACTCTTTGATTCTCATGTT TGAGGTTCTAATTACTACCTCATGCAGGACTTACAAGAGATCAGCCATGC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatTGCTTACAT TGTGTGTAGTGAATCAATGTTATTCTGAAGGGCCTCCTGGAAGCAGTAGG GATTGGAGAATGTATGTCTTTTAACATGATCATTCCCATCCTATGGGGGT TGGGCACTCTTAACTCATGTTGCAGATGAGTAAACTGAGGCTTTTGAGAG TTTGAGGGTGCACAGCTTGAGGCAGAGGTGGAATTCCCAGCTTCATCCAA GAAATCACGAATCTCCAATGGCTTGCACACATCCAGGTTTGCTGGATCCT ATTTGTCAAGACCACATGCAGATTAGGAGCCTTCAAACAAATTATTTCCA AGGGAGGGCATCTTTCTCTGTAACCCCTACCCTGGCAATTCTCCATGGCC TCAGGGAGTCCAGCCGGAAGCTCTGCAATGAGTCCTGGGGAACTCTCATC TGGAAAAAGGCGCTCATGACAGGTGGTGCCAACCTTCCACG (SEQ ID NO: 221) pARBI-617: CTACAAGCATGCATCACCAGGCCTGGCTAATTTTTCTATTTTTTGTAGGG 2541 chr2_2_ ACAGGGTTTTGCTGTGTTGCCCAGGCTGGTCTCAATTTCCTGGACACAAG Exogenous_107 CAATCTGCCTGCCTCAGCCTCATAATGTTTTTTTTTAATGTTAAACCCCT TCTTGTGTTGAGAGGCCTGGCTTGTGTGTTTGAGCGCAGCTGCCTACCTT TGCAGCTGTAAATGTCAGGGCTCCAGGAGGGCTGGCTGGGTCTCCCCTCC TCCCCTGTTGGGTGGGTGGGGCCCTGAGCTTATCCAGGCATCCCTAGGTG GATGGATGGGTGGATGGGTAGGCCCTGGCCTTCAGGGAATGCACTGTGTG TGGTCTTCCAGCCCCAGCAAAGTCTCAGGGGGCCCGAGCGGGAAGTCGCC ATCCTCAGGGGGTCCGAGTGGGGAGTCACCATCCTCACGGTCAGCCGTGG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatTTTCGAATT GAGCTCCCAGGGGCGGAGACTCCATTCCCCCGGGGAACAATCTGATAGCC AGAAATGAGAGCAGTGAGTGACTTGGACGCTGTGTGATTCAGGAGGGTCC CAGTCCCTCAGAGTCGCCTGGACACCAGATTGTATTTACAAAACAGATGG ATGAGGAAATGGAGATTGGGTCTTCAAAAGAATTAACCTCAGATTTTATT TATTTATTTTAAAAATTTTAATTTTTCCCAAGGACCCTTTCCAAGTCAAG AATTGTTTTAAAAAGAAGCCTGGTGAAGGCAAATGCTGTCTGTGGCCGAA GGCACCGTGAGGAGCTGAGGCTGACTTATGTTTCTCCTCTGGGCTATGTG CGCCTCTAAGGAGTTCACACACTTTAACCCCCTTAGGAACCCTACATGAT CATCCTCGTTTCTTAAAGAGGAAGCAGAGCCAGCAAACAAG (SEQ ID NO: 222) pARBI-618: TCTTGATGGATACCCATCCTTCTCTGCTGTTCTCTCTTTCCTCCATCCTC 2541 chr2_3_ AAGCACAGGGAGCTTTTGGCGAGGTTAGCTAACTTCGGGGCAGCTGTGAA Exogenous_108 TTAGATACTGGCTGTAAGTTACCTCCAACATCCAGAGCATTTAAACGCTC GGCTGTCAATGATTGTCATTTTCATTGTTTACATATAGTCACATCCTCAG TTTGTGGGGTGGATGCATTTGATTCAATTCAGATATGCCTTTCAAGCTGG CCCTGCTGAGCTGGGCCTGAAAGGCACATGGCTCATCTCATCTGCATGTG ACAGTCTCTTTGAAGTCCCCAGAAGTGCCTGCTCAGAGTCATGCTCGGGA GGGAAGAGGTAAGGCACAGAATGAGTACAGACTGCCACCCAGCCGCAGGA AAGGTGACTGCACACCAGAGGTTCCTGGTAAGGAGGTGACTCACCCGCGA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatCTGTCTCAC CTGTCCAAAGTGTGTGTCACCTGCTCTGTGTTCACCCACTGGGTTCATCA GAGGGGCACTTGAACAAAACAGGAATGTCACTGGAGAGGGGAACAAATGC TGTCCCCAAGTCCTTCCCTGCCGTTTAGAGTAACGCAGCCCCTGTCCGCA CATCCTCACCCCCATCTGCCCTCTGCTTGCCAGAATTATCAGAATCTCTC AATTCTAATTTCAGCCATTCTCTGTGAAAATGAAAAGTTAGTGGCAGGCA ATGGACACCCTATTGAAATAGAAATTGTTCTTCATATGTCAGAAAAGTCC CAGAGTAATCTGGGAGTAATCTGAGCAAACTGATTCAAAGGATTATTACA ATTTCTTAGTCTTCTACTGCTGCAAAAGTGGCACTCTTTGATTCTCATGT TTGAGGTTCTAATTACTACCTCATGCAGGACTTACAAGAGA (SEQ ID NO: 223) pARBI-619: TGAGCAAATAGTTCTATCCTTTTTGACACATTTTCCACAGTTTGTATAAT 2541 chr3_1_ TTATTCATAGAATCCTTCTTAGACTACTAATAAATTCCTATGGGACATAG Exogenous_112 TCCTTTTATATGCGGAGGAATCCACCACCATGAAATGTTCACAAGAAATG ACTTTTAAAATTTTATTTGTGGCCATGCTGATTATGAATTTCTGAATTAT GTCTGGGCTGCTCTTTTTCTCGGAATATGCAAAGTGGGATACTGGCATCT CTGTATCTGGAGAGTTACAGCTTCTGAGTTAAAAGATGAATGGGAAGTAG CAGGAGAGGGAGAGATCAGAAATCTCCACTCCCTGAAACACAGACACTTC ACATACCCAGGAGGTAAATTCCTGGCATGAATTGTAACAAGTATACACAG GCATTTGTACATTGTTTGCTTATGTGCTAGATGACATGGCAGTCTATGAA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatCAGGGGCCC CACATCTGCAGGCAGGACATTCAGCCCTGTATCTTGCCACTACCATGAAA TCTGAGGTTCTTTCATTTTTGTTTTTCTATTCTGATGAGATTGAGGTACA CTGACTCACTGGTAAAGAAGACAATTTATTTGTGAACTTGTAGTCACTCC ATTTGACGGCCTCCCTCTACTCACTGAGGCACAGCACGTGTGGAACAGGA AAAAGGCCTCTGGAGAGCTTGTGTTCCACTGGTTTCTTTGACTTTTGCCA ACCAACTTTGCAAATGTTCTCAAAAGGCATTCATTACCTCTGCAGGTGTG AAATAACTCTGCAGTTCAAGGGTTTGTGAAATCACGTGAAACAGCTGAAA GCTGTTCTGGCAAAGCGACCAGAGTCAGAGTTAGCTGTCACCAAGCCACA TCCTGGCTTAAACGATGTTATTTATATCTAACTGCACTCGA (SEQ ID NO: 224) pARBI-620: AGATGCCCATTGTACCACCTACCATTGCTTGACCCCCGCCCATGAGCTGT 2541 chr3_2_ TCCTTCATACTCATAACTAAAATGCCAAATGTGTGACTTGAGTAATTAAC Exogenous_113 TAGAGACCTCCAGAAGGCTTTCCTTCTTATTCTGGGTACATTGATATTTA CTTTAGTATTTGTTTTGCCTTTAAAGTCATGTTCCCTTTTGATTCTTTAA TATCATATCCAAAACGAAGTCCTTTTAACCTATAACAATGCAAATCAAAA GTGCTGTCTTTCAGCTACCTCAAGGATTTAGCAAGCGTTCCTGGAAATCT AACCAACCTTTTAAAAAGTATTTCTATGGGAATGTATTTCTGAGCCCTAA ATTATGGATTTTGAAACTAAAATAGAAGAATACTGTAAGTGCAAAACTTC CAGTAGATAACAGAATTTGCAAAATCTCTAATCTCTTGTATGTGCCCCAA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatGTAGTACTCT CTATAGGCCATTGGAACAGAACATCTGGCTTAAAATAATCTCAGCACCAA AAACGGTCTAACTATGGTCTCCACAACATAACCCGATGGTTCTCATATTT TAGATGCTTGATGCACAAGTTGAACTGGCTAACAATTCAGACTTCAGGAC TTGACTCCCCTGGGAATCTGCAGTTGTTTCCCTCCCCTTGAGTGAGACTG ACTGTTGGGAGTAGAATAACCGCATGCTCTGGGTCAGAATGCCTGGCCCT TCAGAGTAACGGCAGCCATGATCGCTGAGCTCCTACTGTGAGGCACATAC TTCGCCACACTAACACATGGAATCCTTGCAAGCTCCTTACATAATCAGTA TTTTATGCACTTTGCAAATGAGAATACTAAGGCTCAGAAGAGGCAAGCTC CTCTCCCACTGTTAAACAGCTAGTGATGAACAGAGCTGGG (SEQ ID NO: 225) pARBI-621: ACTCCCCTGGGAATCTGCAGTTGTTTCCCTCCCCTTGAGTGAGACTGACT 2541 chr3_3_ GTTGGGAGTAGAATAACCGCATGCTCTGGGTCAGAATGCCTGGCCCTTCA Exogenous_114 GAGTAACGGCAGCCATGATCGCTGAGCTCCTACTGTGAGGCACATACTTC GCCACACTAACACATGGAATCCTTGCAAGCTCCTTACATAATCAGTATTT TATGCACTTTGCAAATGAGAATACTAAGGCTCAGAAGAGGCAAGCTCCTC TCCCACTGTTAAACAGCTAGTGATGAACAGAGCTGGGACCTAAACTTGTC TGATTTCCCACACTGCTGCTCCATCAGATTGTTTGCCAGGTTAAGGATTC ATGCCAGGACCTTAAATCCCTCTGAATCGACAGAGCTAACCTAAAGGGCA CCTTGATTTCAGGTTTTGTTCCCAAACCTATGACTGTGGCCTGGCCACGT tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatAATGAAGAC CCGGTTGGTTCTAGCTTCACAATGAGCTCATTGGCATTGTCCCCACAACC ACCACCCCCTCTCCACCTCTGATCATCCACTATCATTTCTATATGGCCTT TGCCAGGCTTGTCCTCATATCAAAGCTTCTGCCCTCTATGTTTAAATGCA TAAGCTTGACAAATGTTTTCATTTGAGAAGGCAAGCTGCCTCCCCGTGCT TGGTCTAATGTGAGCTTGGTAGAGTGACACTGCTGGATTCAGTTTTAGTC ATCAACACTTCCAGCTGATGCTGATCTGAGTAATTGTCCTATTTAGACTC TGACAGGTGAAAGTCATAGATTCAGGAAACGGAGACTCCCCTCACCCTGC TATGAAGGGGAGTTTGAGAGGGTTGGGCAGCTCTTCCCGGCAGTGGCAGG CAGTGTGGGGACACACGTATGGATAAATTGTTAATGCTCTT (SEQ ID NO: 226) pARBI-622: CTGCAGGTGGCAGACTTACTCATTCTCATGCTTCAGCTCCAGGGGTTCCC 2541 chr3: CTGTATGAGTCTTCTCTGCCCGCTCCCACACCCCAGGTAGCATTGCCCCT 1865_ GCTTGCTTTGTGCCCTCACTGTCTCCACATATATACACATCTCTTACAAT Exogenous_109 AATATTAAATACCAGCCAACATTTATATAGTACTTCTTATGTACCAAGTA CTGTTGTCTATCTATCTATCATCTATCTATCTATCTATGTACACACACAC TCATATTATCTTCACAATGCTATGAAATAGGTATTATTGCTGTCCATTTT ACAGATGGGGAAACCAATCTATAATTGGTTTAGTGAGATTTACTGAGCAT ATCTACAGGTATTTGTCCTTGGTTATAGGAAACCAATCTGTCACTGGTTT CCCCTTCTGTAAAATAAGTTAAATAATTTGCTTAAGCTCACTCAGATAGT tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatGTGTGGTTGA GTCATAATCTGAGCTACTCTCTGGCCCAAACTGCTTATTGTAACCACCAC ATCATATGGCCACTTCAGCACCAGTCACACCTCATTTTACTTGTTGATGT CTCAGTTTTCCCCTCTATTCTGCGGGCTACTTGAGAAGACTTCTGTCTTC TTCCTCTAGGTATCTCTGTGCTTGACTCAGTATCGAGACCATAGGTGTGT GGGTGGGGTTAATCCACACTTGCTGAACACATGTGTCCGTGCCCAATGTC CCTGTGCATAGTGGCTACTTCCCATGGTTCACACGCATCTTCATGGCTCT TGTTTCTTCCATTCACCCATCCCATGTCCCTGGCCCTGTTCTCAGTGCTG AGGATGGAACCTCATTGCTGCTGCTTCCAGCCCTGGCCCTGCCCCTGTTT GGGGCTCAGTGCTTCATCCCGTGGAGCTCAAGGGGTGCTG (SEQ ID NO: 227) pARBI-623: CTTCTCATCCTCACACCCATTCATTTCTTAGTTTCCCATACTCTGACTTC 2541 chr3: TACTACCCCAAGTTCATAAAGACCTCATGAACACAACCTGGAGAACGCCC 1865_ TCAGCCTCATCTCACCTCCCTGCAACACATGGCTCACTTGTCTGCTTCTT Exogenous_110 CCTTCTTAGAGTCCTCTCTCTGGTTGACCTGTCAAAGATGGGATTCTCCT GTTAACGACCCTGACTCCTTTGTCTCCTTCTACAAAGATAATGCACTGAC TTTCCAAATGTACTAACTAGTATGATCATCACTGTCTACTGTTGCATCGA ACCACTACTAACCTCAATAGGGAAGGCACCAGGTTCAAGAGGCCAAAGAA GAGACCCAGAGCCAGCAAATGAGACATGGGGTTTTATGAGGGGCTCACAT AGAGGGGAGAGAGTCCAGTGGCAGTGGGCTGGGCAGGAGAACCACCTTAC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatACGTGGTCC AGTGGTGGGAGGCTGGAAAAGAGATCTGTCTTGCCTACAGTTCAGTGGTG GCGGGTTGGGCAGGAAAACCACAACTGCCTGCAAACATCATGCAGTTTAC ATAACATTGTCACTTAGCAGCCTCCCCTAACGACCTCCACCTGGCAATCT TCATTTAACCCAAAACTCAAGGCCTCAATCCCCTGTATGGACCATGTTTC ATGGGACAGGCCAGGGGCTCAGAAGTTTATCATAGATAAGGAATGAATCT CCGGGTTGGCCACTCCTGGATTCCCTAGCTTGGAACACACATTCAAGTGT ATCTGCCATTCTGAGTGTATTCTTCAATTATTGCTGTCAGGTGTGTTGAC CCTCTATCTACATAAGTATTTCTCAAACCAGGGCCCTACGATCAGCATCA GCATCACCTGGGGTATATGTGAAAAATGCAGCTCCTGGACC (SEQ ID NO: 228) pARBI-624: GGGCCAAGCAAGCTCTGAGACAAGCTGTCTCACCTCCTCATCTCCTGAGC 2541 chr3: ACAACTCACCCACAGCTCTGCCCTCGCCTCCACCTACATCATGCCTTCCT 1865_ GCATGCCAACTTCCCTCTGAGGTTCCCTTCACCCACTTACAACCACAGCC Exogenous_111 TCCTTCCGGACCTCCCCTACACGATGAACAGTCTGCCAGTTAATATACAT CAAGCTGCCACCATGAAACAGCTCGGAAAGCCCCATTTGTTCTCAATAGC ATTCCAGCTAGAAAGACCATCCACCACCCATGGCCACTGAAGAAGGTCCT CCCCTGAGTGCCAGGCAGTGGGGCTCCTAACTCTGGGCCCCAATTAGCTT AGCTTGGGGGAGCAAAGGACAGGAGCACTGTTCACCCAGTTGGACTAACA GCTCCACACTGTGCCTAATAGGAACCTCCTTCTCACAGGTGGCACCTTTG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatTGAATCAGG GTCTGTCCTGAGCCACCCTCCCTTGGAGAGATCTGAGCACTGAGAACCAC AGAGCCTGGGACACCAGCACAGGGTTTGAAAAACTCCACTCCCAGTCAGC TGCGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGT GGATCACAAGGTCAGGAGTTCGAGACAGCTTGACCAGCATGGTAAAACCC CATCTCTACTAAAAGTACAAAAATTAGCCAGGCATGGTGTTGGGCACCTA TAATCCCAGCTACTCGGGAGGCTGAGGCAGGGGAATTGTTTGAACCTGGG AGCTGGAGGTTGCAGTGAGCCGAGATCGCACCACTGCACTCCAGCCTGGG GGCTGAGCGAGATTCTGTCTCAAAAAAATGGGAAAAACATCATTCCCATG TGTGTCCTGGTTGCTTGTCTCTTTCAAAGTACTGCATACT (SEQ ID NO: 229) pARBI-625: CCACATCATGGACCAGAAGCCACTTGGTCTGGTTGTGAGGATGACGGAAC 2541 chr8_1_ CTTGCCCGAGGAAATGGGCAATGAACAGGAGCCTGTTTGTTTCTCCTTCC Exogenous_115 AGCTCCTGACCTGTCTAGCTCCTGACCCTCTCAGTGTCATTCTTGCTCAC CCCTGGCCCTGCTCCCTGGATATGCAGACAGGATGGGCTTCTTCCCTTTA GACCTTCACTTTGCTCTTTTGGACTCCTCTTCTGGCCTTGCCTTCCCCTG GTGTCATCTGCTGGCTCCCTGCCTGATACTGCCACACTGGTCATGCTGGC CTCCTGGCCTTTGGTGACACCGCATCAGCAGGCTCCTGCTTCCCTGTCAA GGGATACAAGCCAGCACCGGTCTTTCGCCACAAGGTATGTGTGTAAAGAT TGCTTCCTGTACTTGGTGTCCACAGGATTTTAAAGAACTCCTGGCCCACT tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatCGGAACTCT ACGCTTGAAACAGAAACATCTTCTCAAAACACCTCTGGTATTGGCCCATT TCTCTATTACTGTTTCTTGTGGTTTCATACTGAGGTTTTTGGTGCATTGC AATTGCCCTGGAGTTTATTTTTAGTAATAAAGACAGAGTTCAATTAATGT CAAAACATGAGGGTAATTGCAGAGGAAGGATTAGTAAGTCTAGGGGAAAA TGTGCCCAACTTTTTTTCTTCTGATAACATCTTTCACTGATCATACACCG TCACACCCACAGACACGGAGCTTCCTCCGTGATGTCTGCAATGCACTAGG CCCAGTCGGGGAGCAGTGGCTGCGCCACCACCTGAAAACGAAAGCATTTC TGAGTCTCTTTCAGTCCAGCTCATCTAGCAGCCAGCCACACAGGTGACCA CCCACCGTAGCTGTCACTATGGGTGATGATTATCATTATGG (SEQ ID NO: 230) pARBI-626: AGGATCTTTCTCTCTACAGTGCCTCTTGTGGGACGGGTTCTGCTGATGGG 2541 chr8_2_ GGCTGTGTTTGGGATCTCGCCTGGAAACACGATCTCTCCAACTTTCCCTG Exogenous_116 GGAGCAGAGCCTTGGGTCTCACTGAACCCACCTCCCCAGCACCTTGTGTG GTGACTGGCAGGAGGTAGCCATTCCCTCCTGTCCTCTCTGCCCTTGTCCA CTTTCCTGTTATGAGTCAGTTGCTGAGATAGGCTATGAGGAGTTAACAGA GTGGATAAAAAAGAGCTTTGTCCTTCTTTAAAGCCTTCAAAATATACAGA TGTATTAGTCAGGGCTTTCCAGAGAAACAGGACCAATAGAACATGAAGAT AGATGCGTATCCATGTCTATATCAATATCTACCTTGATATTTATATCTAT ATCTAGGAGATTTATTTATTATAAGGTATTGGCTCATGCAATTATGGACC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatCAGAGGGGT CCTGTGATTTGCCACCTGCAGGCTGGAGACCTAGGAAACAGCTGTCGTGT CACAAAGGGCAGAGGGCTGGAGAACCTGTAGTGTAGATTCCAGTCCAGAT TCGAAGGCCTGAGGACCAGGAATGCCAAGGGCAGGAGAAGATCGATGTCC CAGAACAAGCAATCAGGCAGAGATGAAATTCAACCTTCCTGTGCCTTTCT GTTCTATTCAGGCCTTCAGTGGGTTGAATGAGGCCCAACCCACATTGAAG AGGACCATCTGCTTTACTCAGTCTCCTGGTTGAAGTGATAATCTCATCTG GACATACTCAGAAATGCTGTTTAACCAGCTCTCTGGAAATCCTGTGGCCC AGTCAAGTTGACACACAAAATTAACCATCACACTGCAGAGACACAGTCTA TCCCAATACCTGTATTTACTGGGTAACGTGCATCTCAGCTC (SEQ ID NO: 231) pARBI-627: TAATAATGACTCTGCTGGTAATTTGTGTCCTTCTGCTTGGAACTGTTTCC 2541 chr8_3_ TTTTTAGTTTGGTCACCCTCCCAGAGCTGGTTTCAATGGGGGCATACCCA Exogenous_117 TTATGGGATGCAGGGCATCCTGCATCCTGAGGAATTTTTTTTCCTCCAAA AATGAAACCTTGAAATGAGGACATTGTCCTGTCCACGGACTGCACAACAA CACTGAGCCTCAAGGACTCATACTGGCATTTTTCTTCTTTTGCAGAGTGT GGGCACCCTGGCTTCAAGCTCACGAGAAACCAGGTCGGGATTTAAACAAT GTTGGGTTAAAGCAAAGTTTCATAAAGACAGAATCAAGAAAAAAAGAAAG AGAAACCAATCTAAGTGCCATCCTCCCTGAGTTGCATCTTACCTGAGTCT TCAGCCGCCGCCCCCTGCTGCTGTGGGAGGAAACGGGAAAGTGACTGGCC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatATGGGGACA GGGGTCGGGGGTGTGAGGGTTAGTGGTAGCAGGGAGGTAGCACTTCTGTG AAGCCGGAAAGGAGACCTGAACCTGGCTCTCTGCTTTGCTGCTTGCCAGC CCTCTGGCTGGGAAGAAGGCTCTTGGCTCTTGTGAGCCTCGGTTTCCTCT GCAGAGTGGCCTGTTCTGAGGACAGGCTGACATTGCTGTCCATCAGTCTT CCAGGACCCTGAGGATTTCAGCTCTACCCATGGGCACTGAACTGGAACAT TCACTGGTCATTTGCTTTCGTTTTTTTTTTTTGTTTGTTTGTTTTTTGAT ACAGAGTTTCACTCTTGTTGCCTAGGGTGGAGTACAATGGCGTGATCTCA GCTCACCACAAGCTCTGCCTCCCGGGGTCAAGCGATTCTCCCCAGTAGCT GAGATTACAGGCATGCGCCACCACGTCCGGCTAATTTTGTA (SEQ ID NO: 232) pARBI-628: AAGTCAACAACAGGCTCCAGAAAAAAATGGAAATATAAATGGTCCCCAAT 2541 chr9_1_ TTGGCATTTTGGAATTTGAGGCTTATATAGTATGAGCAAAATAAACTCCT Exogenous_118 CCTGCTTCTATGAATTCTTTGCACTTTTTCTCTATAATTCCTTATCAAGC AATGGGTTTGTCTGTTAACCAAGACTATACATAAAATGTATTCCCGTAGT TCCTTTTATTACATTTTTTCTTTGTTGTTACTAGTGGAGAAGCTAACAAG AGCAGAATTTCCTGCCCAACCGTAGCAAATACAGAAAATCAAAAAGAGAA GCTTTGATACAGCCAGTGTAGATACAGCTAGGACGTCATGAATCAGAGCT CCCACACACGATGATAATGGAAACATCTTATCCATATCTGATTTGCTCAC ATATAGTTATTTAGACAACAAATTTTACATTCCTGGATAGGGAGCCTCCG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggggggtggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatATTTGATTTC AATCTCATAGTCCAGGTAGGTGTAAAATGACTGCCAAGCCAGCAGACATA AATCTGTATTATCTCATTTTCATCACAGTCTCTGCATTGGGAAAAAAATG TGAGTGAAAAGACATTTTCTCCCACCTTGTTAAGTACACAATTACATAGT TTAAAGAAGATAAGAATTTCAAATTGTTAGAATGATATCATTATTCAAAT GATTCCCTGAGAACTTCAAGTATTCCCCTTCCCCCAAATGTAGGCAGGGC AAAATAGAACCAGTAGGAATGAAAAATGTTAATTGTTTCAGGTTCTGTAA TTTTGGGTTTAATGACTCAGGCTATGTGCTACTGAGTAAAATACTCATCT GGAGAAAAAGACTGTTGCATGACATATTTCTGATTAAGGACAATTTAAAC TAATTCTAAGAAAACTCTTACATACTGCACATGCCATAAA (SEQ ID NO: 233) pARBI-629: GATTAGGTCTGGAATAAGATTGCACCATGACTAAAATGACTTCTTTAGCC 2541 chr9_2_ AATGACATTCCTGAAGATTGGATCAGAGTAAAGTTAAACTAACCTACTCA Exogenous_119 AGGAAGACTTTCCTAAATCAGCCCCTTTGCTGTATAATTTAGTATGGAGA GCTGATTGTGGGTTTTTCTTGTTTCATTGTTTAATACATAGAGGTTTCAC ATTTCATTTTGAAACAGAAAGAGGGACTGGAAGAGATAAGTTAGTATGAG TGAGGAAAAGATTAGAAGGTAGAGATAGGATCGCAATGCTGTTATCACCT TTGCTTTATTGATAATCTCAGGAAGGACTAGACTGCTAAGAAAGAGGAGC CCAGATTGGAATAAAAGAGACGATGCCAGGATCAGCCTGTGAGAGGATGT CTCACTTAGTACACCTCAAGGATCATGTCAGCTGAGCATCTATACCCCGC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCG AGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGGGC GAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACCAC CGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTACG GCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACTTC TTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTT CAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCG ACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGAC GGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAACGT CTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAAGA TCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAG CAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTA CCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGATC ACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATG GACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaata aagcaatagcatcacaaatttcacaaataaagcatttttttcactgcatt ctagttgtggtttgtccaaactcatcaatgtatcttatGTGATGAATATT GCATACAGACTCTGCCTTCAAGTCATCCCCCATTCGACAAGCATTTATTT AAGGTTCTTACATTCCAGGACTTGTACTAGATGCTAGAAACAAACAGATA AGGAAAACGTAGGAAAATGCTTCTCTCAGGAAGCAATATTGTATATGAGT GGTGAAGAGCTTCAAATCTGGGGCCAAACTGCTGGGTTTGAAATTTGGCT TGGCCACCTACTGGCTGTGAAATCTTGGGCAACTTTCTTAATCACTCTGT GCCTCAGTTTCTTCATCTGTAAAATGAAATAGCCATAGTACTTATTACAA ATAGGTAATAATTAGTGAGAATTAATTTGAGTCAATACATAAAAATGACT CAGAATAAAAGACCAGCTAAGAATATATACACACATACTTAGCTACTTAT AATCTAGTAGGAAAGATGGGTATGTTATCAGAGGACTT (SEQ ID NO: 234) pARBI-630: CAGATAAAAAAAGCTGAAGCTCAGGGATATTGTCATTTACCCATGAGCTC 2541 chr9_3_ ACAGGCAGGAAGTGGTGAAACTATCATTAAACTACGGTTGGCCTTGTTCT Exogenous_120 TTTGTGGTTTCATTTCACTAAGCCTCTGACACCCTGACCTCAACTTCCTT CTGTAGATATGTCACCTGGCCTCCTTCAGTCTAAGACACTACACACTGGC CACACACTTTGTCATCCTCATAGACTGACAACCTGCTCTCACATACCCTG CTTTCTTACTGTGTAGGCTCTCTTTTCTTGCTCAGAATGTCCTTCCCCAC TTCTTCATGTGCCAACTCCTAGTCATCCTACAAGAACCACATTCACTTTA AGTTTCTTTTGAAGCCTTTTCAGTGTCCCATACCCTGATCCCATTACCTC TTCTATGTATGCAGGGCTTGCTTCATGCTTGTGTAACCCAAGGGCCCTGC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggggggtggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatATTTGATTTA ACACATTTTAAAATTCTTAACAATTTATGAAAAAGGAAACCCATATGTTC ATTCTGCCCTGGACCCTGCAAATTATGCAGCCAATCCTGTGTGTGTTTTT TCATGTTGCGTAGACTCTATTACAGTCAGTAACATATTGAGAAAGAGTGT GTGTCTCCTCTGATAGTCTATGAGCTCCTTGAAGCAAGAGGCTTGTCTTA GTCATTGCTGTATCCTCACACCAGACTCTACGGGATGCTCAGGATTCACC TGTTGATCTGTGAGGAAGCCCAAGGCTCAGTTAATGCCCCCCACCCTCCT TCCATAACAGTCTCCTGATTAGGAAACAGTGCTTGACCTCTAGACAATCT GAGTATCCACTCCAGGAAGTGGAACATTGCCTTCCCAATGTTGAAACTGT TAGCACTCAATCATGTTTCAGGAACATATAGAAGGTATGT (SEQ ID NO: 235) pARBI-631: TGGACCTTATGCAGGGTAATAAAAACCAACACGTGGGGCCTGTGCTGTAA 2541 chr10_1_ GGTGGCAGAGGGCGATGACTCACCCCACTCATGCAGGTTGAGTCATTTCA Exogenous_90 AGGCAGAGTTGTGCCAGTTCAGTACCCAGTAATATTTTCCAGTCGACGAG TATCAGTGAACAGGAGATAACCAGTCATTTCTAGATTCTGCTCAGAGTCC CAGCTTAGAGGCTCCACCAGCTCAAAGAGACGGGATGGCAAAACAGCCAC CTTAATTTCCAGATCATCTGCCAGCTCATCTGCCAGCTGACACACAGCCC ATGGCTCCACTATTGCCAGTGCTGTAGCTGCACCAGACCAATCTGGTTCA ACTTTTATGTAACAAAGTTGTGATTTGTTTTTCAGTTGCCCGGGACTCCC AGGTTGAAGATCATGAGCATGCCCTGATGAGCCAAGGCATGCAACCACAG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGGGAACCTA AGTGCTTGGACTGAGGAGTGGGGACTGAAATAAGAAGCGGACACTGCAGG GCAGGATCAGGATCCAATCAGATCGAGCCCTGGCATCACCTCATGGCAGG ATCTAGTCAGATCGTGGCTTTTGACATCACTTCATTGTGAAATCCAATCA GATCACACCTTATTACCCTATCCTTATAAAACCGGACCTAGCCCCCAGCT CGAGGAGGCACTGCTTTGGGAGTTATCCCAGGTGTTCTTGTTACTAGTTG CAAGGAATAAAATCCTCTTGCTAAATCCTCCTTGGTTGTGCAGATGTTGG CATGGATGTGGAGAGAAGGGAACACTTACGCTTTGTTGGTGAGGATGTAA ATTAGCTCAACTCTGTGATAAACAGTATGGAGAGTTTTCAAAGAACTAAA AATAGAACTTCTACTCAACCCAGCAATCCTATGAGTGCATG (SEQ ID NO: 236) pARBI-632: ACATCTAATCCCACACCATGACTAATAAACATGGCATGCTGCAATGGGCA 2541 chr10_2_ TTAATTTATGGGTAACAGAAGCAGATCCAAGCCTTTTAGCTAATTAAATA Exogenous_88 AAAATGCACCATACAGCAACCATCCTGCCTGCTTGGGTGTAAAATAGAGA GGAGAAAGATGGCATGCTGGTCATTGAGAGATAATTACCTGCAATTTCAG AGCTGGAATCTAGTGACAACATAAGAAAAATAATAACCTCTCTGGCTGTT AGGATTCTCAGTTCTAGCTGCGAGCAAGGAGCAAAGGCACCCACAGACTT GCCAAACCTATGTCAGGAATGACGAATAGAAGGCAGAAATCCACATCCCC AGGTGGTAAAATTGTTATCCACCTTCGTCACCTCTTTCCTAATGGAGGAA GTGAGGACAGGCAGCCTTGGAGTCCTACTTGAATGAGGCCTGGACCTTAT tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGCAGGGTAA TAAAAACCAACACGTGGGGCCTGTGCTGTAAGGTGGCAGAGGGCGATGAC TCACCCCACTCATGCAGGTTGAGTCATTTCAAGGCAGAGTTGTGCCAGTT CAGTACCCAGTAATATTTTCCAGTCGACGAGTATCAGTGAACAGGAGATA ACCAGTCATTTCTAGATTCTGCTCAGAGTCCCAGCTTAGAGGCTCCACCA GCTCAAAGAGACGGGATGGCAAAACAGCCACCTTAATTTCCAGATCATCT GCCAGCTCATCTGCCAGCTGACACACAGCCCATGGCTCCACTATTGCCAG TGCTGTAGCTGCACCAGACCAATCTGGTTCAACTTTTATGTAACAAAGTT GTGATTTGTTTTTCAGTTGCCCGGGACTCCCAGGTTGAAGATCATGAGCA TGCCCTGATGAGCCAAGGCATGCAACCACAGGGGAACCTAA (SEQ ID NO: 237) pARBI-633: TGAGGTCTTGGTATGTTGCCCAGGATGGTCTCAAACTCCTGGGCTCAAGC 2541 chr10_3_ AATCCTCCTGCCTCCACCTCCCAAAGCGCTGAGATTATAGGCATGAGACA Exogenous_89 CAGAGCCCAGCCACTGTCCGCATTTCTTTCCTGGCTTCCTTACTGGGTTC TCTACTTTCACTCTTGCCTTCTTCCTCTCTTCCACCAGCATAAGTTTTAG AAACTGCACTTCTGATCATGCTGCTCTGCTGCTTAAAATCCTTCATGCAG ATCATGTAAGACCTTCCTGATCCAGTGCTGCCTTGCCCTCTAGAACCCAG GGTGCCTTTCTCCCCTCTGCAGTTCTACTGCGCCACCTGCAGTTTCTCTC TCCTGTGTTCATGTGTCTACTCCCTCTGAATGGTAGGCTCACACCCTCCA CCTGGCCAACTCTTACTAAATCTTGGGCTTCAACTCAAGGATCACCTTCA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatTGGTTCACTC CACCAGGCTGGGCTGTGTGTCCTGCCCAGTTCCTATAGTGCCCTAACTCA CTTTTTTAAAAAAATACTTTAAGTTCTGGGGTACATGTGCAGAACATGCA GGTTTGTTACATAGGTATACACTTGCCATGGTGGTTTGCTGAAGCCATCA ATGTCATCTACATTAGGCATTTCTCCTAATGCTGTCCCTCCCCTAGCCTC CCACCCCCCAACAGGCCCCAGTGTGTGATGTTCCCCTTCCTGTGTCCATG TGTTCTCGTTGTTCAGCTTCCACTTATGAGTGAGAACATGCGGTGTTTGG TTTTCTGTTCCTGTGTTAGTTTGCTGAGAATGATGCTTTCCAGCTTCATC CATGTCCCTGCAAAGGACATGAACTCATTCTTTTTTATGGCTGTCCATTA TTCCATGGTGTATGTGTGCCACATTTTATTTATCCAGTCT (SEQ ID NO: 238) pARBI-634: CCTTCCACCCCCTTCCTTAATAGTTTTCCTCCTCCGCCTCCTGCACGGCC 2541 chr10_1_ TCCGCCTCCGCCTGACCGTCCTCGGAAACCTCCCAGCCGCAGCCCGGCCC Exogenous_92 TCCCCAGCGGCCTGAGTGGCCTCCTCCCGCGGCGCCCCCCAGCTCCGCAC CTCCCCAGCCCCGCCCGAGGCGCGCCCAGCGAGCGTGACCCCGACCCAGG GCCACGGCTGCGCCCGTAGGATCGCGGGCGCGCGGGTCTCCATGAGGGAC TGGGAGAGATGCGTCACCCGCTAATCCCTCAACTGGGTCGCACCCGGTCT GCCTTGGTTGGTCCCTTCTGATGGCGGAGGCGGGACCTCGACTTGGTGGC CAATGAGGAGCCTCGTGGGGGTGCTCTCGGCTGCCATGGCAACGGAGGGG CCGCCCGCTTCCCCGCGGGGCGAGGCCGGCTCTCCCAGGACTGGCCACAT tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatTGCAGTCAC GCGCTGTGGCCCCAGTCCTGTTAGCATGAGGTGGGGGTCAGAAGTTGAGG AACAACTCGGACCTAAAGCCTTGACAATTGCCAACAGTGTAGACAAGTCC CAGCATGCCCAAAATTTATAAGAGAAGGCTGCCTGGAAGAGGGACACGAT GAGGAGGGTCTGCTGAAACGGACAAAGAAACCCAGTTCAGTGCAGTGGAG TGGACATGGCACGGGCAGGGCACGGAGTGGGGTCGGTGCTTAGTAAGCGG ATTAGGGTTTCCAAGCCAAGACCCACCCAGGGCAGGATGGGGAACCAAGG TGAGCAGGTGCGCTGAATTGTGGGTCTCACACTGTCCTTCAGTGTCCCGC AGGAAGAGAGAGCTAAGGATCACCAGGGTCATTCCCCTGCCTCAGGCAGC CTGCACTCCACCTTAGGTACAGTCATGTCCGGTCTTCAGCA (SEQ ID NO: 239) pARBI-635: TGATCCTCCCACCTCAGTCTCCCAAAGAGCTGGGATTACAGGCATAAGCC 2541 chr10_2_ ACCTTGCCCCGCCAGTTTTTAGCCTGTCAGCTCTTTGAGGGACACTTGTT Exogenous_91 ATGGTCATACCCACAGCTTTGACCCTTACCAGGTGGTCAGGTGGATGAAT GAATGAGTAGAAAGAAGGAAGAGAAGGCAGGAGGGAGAAGAGGAAGAATG AAGGAAGGGTTGAGTTTGTTCTGGAAGCTCCCAGCTTCCATCCGCACCAC ACTCTCCCTCTGTCCCCAGGACACCAGGGAACTGACTCAAGGCTCTCCTC TTACCAGGTGCTGTGTGGGGTCCAGGGAGAAGAGCCCATTTGAAATGGGG CACTGGTCTAAACTCTGGACCTGCTGCTCACTATTGACTGAACTTAGCCA CTTAAATATTCTCCTCTATAAAATGGGAGTGACCATAGCAACTACCTCGC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatTGAATTGTA GGGAGACCCAAAAAATGTATATGAATACATTTGTCAACTAAAGAGTGTTA GGGATTATCTCAGTGGAGGGATAGGCAGTGCAGGCTCTGGAACCAGACTT CTTGTGTTCACATCCCTGCTCCACTGCTTGGAGACCTTTGGCAAGGTCCT TAACTCCTCTGGACCTCAGTTTTCTCATCTGTAAAACAGGGATTATACTA GTTCCTACCTCACAGAGTTGTGAAGATTAAATGAGATAATGCATGAAAGG CTTTTCAATAAGAGCAGGGCACACATTAGGCATGATGTCCATGATCAAGT GCCTAATATGTGCCTGCTCTCAGGAAGGATGCTGTGAACTACTTCACACT AAAGTGTAAATGAATGATAAGATCTACTATGCTCCACTCATGCCTGTAAT CCTAGCACTTTGGGAGGCCGAAGCAAGTGGATCACCTGAGG (SEQ ID NO: 240) pARBI-636: GGTGAGGTCACCCTGGGTACCTGGAGTTACTCAGAGTGACACTAGTACAC 2541 chr10_3_ GCCTGGCAGGAGTGATGGATGAGCTAGGCTCCAGGAAAGCAGGTGGGGCT Exogenous_93 GGAAACCACCCAAGACAAGGAAATTGTTGGGTTTGAAATTCCAGGTAAGC AGAGTATGGCCTGAGAGCAGCGGCAGAGGCTGAAAGTGGGCTTGGTCTCG GTATTTTTGAGGCTGAGGTAGCTCCAACTCTCTCCCTCTGAGTCAGCACC CCACTTAAGACAGAAATACTGAGATCTGCGACCCCAGCCTCCCCCACAGC AACTCCACAGTCCCCAAACCTTTGCTCCGCTGGGTCCTCCCAAGTATGGT AGCTGACACAGCCCTGAGCAGCCCACTGTCCTCTGCCCTGCCAAGTCTTC TCACTCTTGTCACTTTGTCCTACATCATTTCTTTGTCTGGCACCCCTCCT tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatAGAATTGGT GCCCCAGAGGTGGGAAAAACCTACCCTGCTCACTGCTCTATCTCCTGGGC TTAGCTCAGACTCTGGCATACGTGAGTGCTTAGTAGAAACTTGATGGGGT ATTAAAATTGAGACCTGGGTGGGTTCTCACATTAAATTTCTAACACTCCT TAGGTGGCTTGGGCCCACCCCTCTGAGCCCTGTTTCCACATCTGTAAAGT AGCAAACCATCGCCATGAAGAACAGGGAAGAACCAGGTCAGCAGGCAGGG CGGATGAAGCATCCAGCCAGTTCTAGGAAGTTTCCGATGGCTAATCCTTC AGATGCAGCTTGTAAGCAGGTCAAATACAGAATAGGAAACTGAGGCGCAA CGAAGTCATGCTTAGAAGTCTGTCCAGGGAGACCAGTCAAGGCCTCCCCC ACACCCTGCAAGCCCTTCTAGGAGCTCTGGTTAAGTCACTC (SEQ ID NO: 241) pARBI-637: ATCTTCAAATTGTAGTCTTTGTAACAACCAAATAACCTTTTGTGGTCACT 2541 chr11_1_ GTAAAATTAATATTTGGTAGACAGAATCCATGTACCTTTGCTAAGGTTAG Exogenous_99 AATGAATAATTTATTGTATTTTTAATTTGAATGTTTGTGCTTTTTAAATG AGCCAAGACTAGAGGGGAAACTATCACCTAAAATCAGTTTGGAAAACAAG ACCTAAAAAGGGAAGGGGATGGGGATTGTGGGGAGAGAGTGGGCGAGGTG CCTTTACTACATGTGTGATCTGAAAACCCTGCTTGGTTCTGAGCTGCGTC TATTGAATTGGTAAAGTAATACCAATGGCTTTTTATCATTTCCTTCTTCC CTTTAAGTTTCACTTGAAATTTTAAAAATCATGGTTATTTTTATCGTTGG GATCTTTCTGTCTTCTGGGTTCCATTTTTTAAATGTTTAAAAATATGTTG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatACATGGTAG TTCAGTTCTTAACCAATGACTTGGGGATGATGCAAACAATTACTGTCGTT GGGATTTAGAGTGTATTAGTCACGCATGTATGGGGAAGTAGTCTCGGGTA TGCTGTTGTGAAATTGAAACTGTAAAAGTAGATGGTTGAAAGTACTGGTA TGTTGCTCTGTATGGTAAGAACTAATTCTGTTACGTCATGTACATAATTA CTAATCACTTTTCTTCCCCTTTACAGCACAAATAAAGTTTGAGTTCTAAA CTCATTAGAATTGTTGTATTGCTATGTTACATTTCTCGACCCCTATCACA TTGCCTTCATAACGACTTTGGATGTATCTTCATATTGTAGATTTAGGTCT AGATTTGCTAGCTCCAAGTAATTAAGGCCATGTAGGAGAGCATGGTAACC ACAGATAGAACTGGTATTATCCCAAGTGGTCTGCAGACTGC (SEQ ID NO: 242) pARBI-638: TTACAGCACAAATAAAGTTTGAGTTCTAAACTCATTAGAATTGTTGTATT 2541 chr11_2_ GCTATGTTACATTTCTCGACCCCTATCACATTGCCTTCATAACGACTTTG Exogenous_97 GATGTATCTTCATATTGTAGATTTAGGTCTAGATTTGCTAGCTCCAAGTA ATTAAGGCCATGTAGGAGAGCATGGTAACCACAGATAGAACTGGTATTAT CCCAAGTGGTCTGCAGACTGCTGAGTGGGGATGGGATCTGCTCTCTGTTG AGAGTTGGTAATCATTGGTTTGAAATGTGATGAAACCACTCAAGCCAATG AAGGTGGGTGTGTAGGTGGGGAGTACTTTGCCATAATATTTTAAAACATT ACCTGGTTAGAGTTCTAAGTGGTACTTATTTTTGTTTGGTTAGGGGAAAG CCTGAATAAAAACAGAAATGGACACATAATATGCATATTCCATAGTCTTT tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGGGAGGCTG GAATGTGCCTGGGATTTGGGTCTAAGTGTATGCGTAATTCTTACCTCACT AAAGAATTTGCCTTGTTTTTTTCCTTTTGGTGAGTGACTAAAACGTCTGG GCTTCCCTGTGTGCGTGCTACAGTAAGCAAGCAGAGGCTGTGCAAAGGTG TGAGCAGGATCACGTGGAATCTGGAGGATACATCTTGGCTTGCAAACTGC CTCTGTCTCCTGGGTGGGACTGTTCTGTCCTTGCACTGCTGTTCTGTGTT ACCTCTTGGGGTGTAAGGTTTTGCTTACAGGAGACAAACTTTGGGCGTAG AATGGAAGCCACTGCCAGCCTCTGTGCTGAGAAGGAAGGTGCTTGTTTCA AAGGGAGCAGCAAGGGAGGCTTGTTCTACTCACCTGGGCCTGTTTGCCTG AGAAGGGGAGATAAGGGCTGAACTGGGACTAGCCAGGGGGA (SEQ ID NO: 243) pARBI-639: TGGGGCTTTGGGGAATTTAGTGCGTGGGTGAGCCAAGAAAATACTAATTA 2541 chr11_3_ ATAATAGTAAGTTGTTAGTGTTGGTTAAGTTGTTGCTTGGAAGTGAGAAG Exogenous_98 TTGCTTAGAAACTTTCCAAAGTGCTTAGAACTTTAAGTGCAAACAGACAA ACTAACAAACAAAAATTGTTTTGCTTTGCTACAAGGTGGGGAAGACTGAA GAAGTGTTAACTGAAAACAGGTGACACAGAGTCACCAGTTTTCCGAGAAC CAAAGGGAGGGGTGTGTGATGCCATCTCACAGGCAGGGGAAATGTCTTTA CCAGCTTCCTCCTGGTGGCCAAGACAGCCTGTTTCAGAGGGTTGTTTTGT TTGGGGTGTGGGTGTTATCAAGTGAATTAGTCACTTGAAAGATGGGCGTC AGACTTGCATACGCAGCAGATCAGCATCCTTCGCTGCCCCTTAGCAACTT tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatAGGTGGTTG ATTTGAAACTGTGAAGGTGTGATTTTTTCAGGAGCTGGAAGTCTTAGAAA AGCCTTGTAAATGCCTATATTGTGGGCTTTTAACGTATTTAAGGGACCAC TTAAGACGAGATTAGATGGGCTCTTCTGGATTTGTTCCTCATTTGTCACA GGTGTCTTGTGATTGAAAATCATGAGCGAAGTGAAATTGCATTGAATTTC AAGGGAATTTAGTATGTAAATCGTGCCTTAGAAACACATCTGTTGTCTTT TCTGTGTTTGGTCGATATTAATAATGGCAAAATTTTTGCCTATCTAGTAT CTTCAAATTGTAGTCTTTGTAACAACCAAATAACCTTTTGTGGTCACTGT AAAATTAATATTTGGTAGACAGAATCCATGTACCTTTGCTAAGGTTAGAA TGAATAATTTATTGTATTTTTAATTTGAATGTTTGTGCTTT (SEQ ID NO: 244) pARBI-640: CGACCAACCCATCAAACTCCCCGCCCCCAGCACTTTTATTTCTCCTCTTT 2541 chr11_1_ AGGAAGTACACTTCAGTATCTTTGGCACAGTGCATGAGCACGACTAAAGT Exogenous_94 AAAACATCGCAGAAAACATAGCTTTAGTCTACCCTTCGTGTCCTAAAAGG AAAACCAGTAGCTTCCCAGGCCACCGGAAGGGCAACACATGTCCTCTGCA GTTTCTGCACACGGGAAGGTAAAGACAGAGAGAGGACCTACTCCTCAACA CAGAAACATTTCAAAATCTTTCCTCGCCTGCAACCCAAGCTGAAGTCATT CTCCCCAGAAATAACAAAAGTTGGAAGAGAAGCCGGAGACAGGATAGGTG CAGGAAGCCCACACTTTGAGGGCAGCACTCAGACACCCTCTCCTGTGTGC AGGACGTGCCGAATGTTCAGGTGCAATGAGAATGAGCCATGCTTGGCTTA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatCGAGGGCAA TCTGGCCCATCAAGTGGCCTTCGCCTCTGGGAGTAACAAAAATGCACTTC AAAATAGCTTCTGTAATCAAGCTGCATGGGTGGAGTACTCCCCAGCTGAC TCCAGGAAGTTCTCTATCCAAAGCTATTCATTAGGCCAGAGCTGTGCAAA TAATTAGTCACCCACTTGCTCCATAACCCTCCATGACAGCCCAGGCATTG AGTCCAGGTGGGACCATCAAGCCATGCTCTGGTGGCTCATGCATTATCAT AGAAATGGGAGGCTTTATTTATTTTACTAAAAAGAACAAAAACAACAGAC TGCTGTCCTTTAGACAATAGGATCACGTCATCTGAGCCCTCTGTGCCCCA GGTGACAAGCCCAGCCCCAAGTTCTCTTTCCTCAGCCTCCCCACACATGT TCTGGAGGAGATGGGCCCAGCAGGCTGCTCTGAGGCCTGGC (SEQ ID NO: 245) pARBI-641: TCACAGAATCTTGTTTGAGCTAACAACCAGTTATTCTACCCAAAGTAAAG 2541 chr11_2_ AGTAACTTAAATTTTCCCATTGTTCTTCCAGGCTGGAAGATGTAAGAAAC Exogenous_96 ACACACAGTATAGTGTAGGTTTCCTCCATAGTTAAGTACCCCAGAGCTAG GCTGAAATCCAGTGTTGGATTGTTTCCAACTTATAGTAGGGAACCGCCAA TGACATGAAAGAGCATTACCTTACCGGACTGATTCATATTCTACTTCCAG TCATAGTACAAATGATCACATGTCCACACCCACATGTGCTCTGATAAGTA GTCAATTGAGAGGAGTGAGTCAGGGAGCCGTAACTTTGCAGCCAACCTGC CAAGAAGAACTGATGCTTTTAGAAATTCAGCAGCTTTCATTGGCATAGAA GTTTCAGCCTTATAGGCCCAGGGTTTCTCTGAGGCAGAGGCAAACCCTGT tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatTGTTATGTT GTTATCTCAGGATTCTTTTTTTTTTTTTTGAGACAGAGTCTCACTCTGTT GCCCAGGCTGGAGTGCAGTGGCACAATCTCGGCTCACTGCAAGCTCCGCC TCCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACT ACAGGCGCCCGCCACCACGCCTTTCTAAATTTTGTATTTTTAGTAGAGAC GGGATTTCACCATGTTAGCCAGGATGGTCTCGATCTCCTGACCTCGTGAT CCGCCGCCCTGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGC GCCCAGCCTATCTCAGGATTCTTAATGTAAAAACAGTTCAGTATGGAAAG CGTACTTATGACAGAGCTGTTGCAGAATGTGAAATCACCCATGTGCCTAA GTGTCTCCCTACTACCTGGAACACTAGACCACATGCAGCAG (SEQ ID NO: 246) pARBI-642: CAGCCCTGAACTCAGCCTTCACGGCCCTCCCTAAACACATGGGCCAGCCA 2541 chr11_3_ CAGGGAAGAAAAAAATGCCCTGTAGCACAGGAATCTAAAACATTGCTGTT Exogenous_95 GTTCTAGTTAATAAAAGGACTTAGAGATTGAAAAGAAGTACCATCTGTAT GGCTGGCAACAGACAGCATGATTTTGATGACTGACTCTGGGGAGATCTGA GAAAACAGTACATAATATGTCCTTACCTCTGCTCCATCATTCATGACCCA AGCTCCTGCCCTGTCTTCTCAGGAAGATTCTTACAGTAACTCCAGTTTGG AATTCCCCTTTCATATTTGAAATAACAACAGTAATAGTGACCACTAATGC TTACCGAGTGTCCACTCTGTGCCAAGCATCGTGTTTTTATCATCTTTTTT GTTAGGCACTGTACATGGATTAATACAATTACAACCCACCACAACCCCAT tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGAGATAAGT ACTTGTACTATCCTCATTTTATGGGTGATGATACAAGAAACAGAAAGGTT CAGCAACATGCCTAAAGCCACACAGCTACTATGAGGCAAAGTCTAGGTTT GAACTCAGAAATTCTGACTTCAGACTCACTTATCTTATTATACTTCACTT AGTATCTCCCATTAATTAATTCATTCATTCTTCCTTTTATTTGTTAATTC ACCAAATATTCTGGGAAGCTACTTGTCTAGACACTGTATGAAAAATGAGA ATGGATACAAAGCAACCTCAGACATGAACTCAAGGTCCTTACAAGCTTAA ACGTATATGATACATTTCCAAATAGCAATACTATGAGGACAGCTGATACT CACAACATAAGTCTTGGTGGTCACAGGAGATCTATCATAGGATTTCACGA AGAACCTGACACTGAAAACCAGAACTTGAGGGAAATAAAAC (SEQ ID NO: 247) pARBI-643: GTGTCCATCCTGCATGTCTCAAGTCACAATCCTCCAGATATTCAACCAAA 2541 chr15_1_ GTGCTTGATTCTGTTGGCACAAGGACCATGAAATCTTCAGTAGCTAAGGA Exogenous_100 GACTCTTTGACAAATAGATTTATTGCATATGATGTAAAAGGGTCTCATTC AGAGAACTACACATTTACTACACATTTACTAGCTGTTAGATGTGAGAAAT TGAAGCTTTGAGAATTGTCATTTTGTAAGCACGCTGTAATAATTGTGTTC TCTACAAAAATGGAGAAATACTTAATCTTTGCAGGTTTTTATTAGCTTCG AAAAAGGGAATACAAATATTTTGATACCTGACTTTATTTTAAAGATACTG GCAAGCAAGGAAAGATACTCAGTTTGCTAAATTATAGCATAGATTTGAAA GACGGAAATTTTTTTTTGTTATTTTGATTCTTTCTGATTCAGAACCTGCG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatTGAGTTTCC ATTCCTGGGATAGAGAGCCCTGCCAAAGCCAACTCAGTCTCCTGTTATTG CAGAAATTTAAAGGTTACTTGAGCTGATTGATACCAATTAAAATAAAAGC ATCTACTGCCATTTTATTAAACGCCCTATTTAATCTCCACATCCCAAGTT CATTCTCTCTCTCTTTTAAGGCTATACATGGGCCTCAACATTCTCTCTTC TTCAGTATCAGCTGTCCTCATCCCTATTCAAAGTCAGAATGACTTGGAAT CTATTCAGGGCTGCTACTTAATTGGATGAAGAAAGTACTCTGTTGCATGC TAACCTCCCAGATTTGAATTAGAACCAAGGTCCTGAATCTACACAAGACC TTTTTCCCTTTCCTTTCTGCATTTATTTCTTTTTTTTCTACATCTTTCTT TTTGAATTTTTTTTTTTTTTTTTTTCTGAGATAGAGTCTTG (SEQ ID NO: 248) pARBI-644: ATCATCAAAATCTTCATTTATACAAATCACGGGTATTAGAAATAAAATCA 2541 chr15_2_ GGTATTGTGTACAGTCAGATCCCAGGTTTACACAAACTAGAGGTTTCAAT Exogenous_101 TAAGTTTCCTTCATTTTATAAAACCATGACACAAGCTGTAAATAAAGGAG CTCTGGCTTTAGGCCTTCTTCTGTTTATAGACAGTAAATTTTAATCCTCT GTCCCTGCAGACCAACTCAAAGCAGTAACCTTGGCTACCGTTGCCGCAGA AGAAACACACCCTGCAGATTTCCAGTTCTATATTCAGTCTTAGAAAATTG GTCATGTTGAAAGGGAAAAAACAATTTTACTTTGCCTCAGTTATTGACAC CAAGAAGCACTCCTCAGCCTGTTCAGAGGGGAAACAATGAGTCATTCATG CTGTCAGCCCAGGTGGTGCATGAATTCCAGACAGAGGCCGCTGAATTAAC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatCCGTGGAGG CGTCTCTCTGAGCAGAGCCCGCAATGCGCCTGCTTGGGGCTCCCTGCAGC CTCTGGGGGAGGCAGGGCGGCCCAGAGCAGGCCTGTGCTGGAAAGGAACG CGAAGCCCTGTAACCAAGCCTGTACCTCTGCAGTGCTAGTCCCAAGGGGC CTCCGAGCTGTTTGTCACCATGTGATTGGCTCAGGAGAGGGGTGGAGAAA TGAAAACACTCTGCCCAGGATATATTTAGTTGAAGTGCAGCTGGGGAAGT GCTTAAACAAGGGAGCTTTTGTCCTTATGTTGAAGTGTTTTTCTTAACTC CTCAAGGGTGAAAAACTTGAGCCACGTACTGATCCCATTCCCCCCCACCA CCCCCAATATATTTTCTCTTCTTTAGGAAATGCTCTTATTCTGAAACTTT AGAATTTTCTAGGGTTTGTTGCATAAGAGGAAACTGAATAA (SEQ ID NO: 249) pARBI-645: CTTTCCTTTCTGCATTTATTTCTTTTTTTTCTACATCTTTCTTTTTGAAT 2541 chr15_3_ TTTTTTTTTTTTTTTTTTCTGAGATAGAGTCTTGCTCTGTCGCCAGGCTG Exogenous_102 GAGTGCAGTGGTGTGATCTTGGCTCACTGCAGTCTCCAACTCTGGTTCAA GCAATTCTCCTGCCTCAGCCTCCGGGTAGCTGGGACTACAGGCATACACC ACCATGCCCAGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCAT AGTGGCCAGGATGGTCTCCATCTCTTGACCTCGTGACCTGACCCCCTCGG CCTCCCAAAGTGCTGAGATTACAGGCATGAGCCACCGCGCCCGGCCTCTT TTTGAATTTTTTTAAAAAAACACCTAAAGTTTAGGAAAGTATAAGAGGCC AAAGAAACAAGAGTTCAAAGAAACAAGAGTGTTATACACGCACACTTGCA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggggggtggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatGAAAGGTAGT AAAATATCTGTATAGTTCTGGCTGTCAAGCTTTTGGACTGAGATATGCGT AAGCAAGGCAAAAAGATCCCATAGTCCAGGAATATCAACCAGTCCAGTTT CCCAAGGAAACTAAAAGTAACTGAAAATGAGAAGGGTGCCATATAGGAAA ATTGAAGGTGGGGCATAATTATTAATACACTGCTTTGGAAATGCTGGTAT GGAAGATTCATATATGGACTTCAAAGCATACATGTCTTATACTTAGCTAT GGGAAAATTATCATCAAAATCTTCATTTATACAAATCACGGGTATTAGAA ATAAAATCAGGTATTGTGTACAGTCAGATCCCAGGTTTACACAAACTAGA GGTTTCAATTAAGTTTCCTTCATTTTATAAAACCATGACACAAGCTGTAA ATAAAGGAGCTCTGGCTTTAGGCCTTCTTCTGTTTATAGA (SEQ ID NO: 250) pARBI-646: TGTCTTCACGGCACGGCCCCCGCTTCACCAGCTGCACTGTTTGATGAGCT 2541 chr16_1_ TTGCAGGTCCCAGATCTTATAAGCTCATGGTGATTGATCCAAATGATGCA Exogenous_103 GAGGTCGGCCTAAAGTTAGAAGTGGGCCCCTCTCTGCCCCAAGACAGCCC TTCACCCCAATTCCATTCCCACAGTTTGGGCATCCACCCAGGCTGCCAAG CCAAGCGGGGGCTGCCCGGGTTAGCAGGGACCTGGCCATGGGCCTCCTCA GCTAGGGGCCGCCTCTCTGAGGAGTGGGTGCCCCGCCCCTCCGTGGGCCG CCTGGTTTCTCTTATTGGCAGCGTCTGTAGTCCCCTGGCTCTGTCACCCG CAGCTATTCTAGGCCTCTGGTTCCTTTCACTTTCTGCTCTGCGTCTTCCT GCCTCGGTGGTTCCCCACCATCCAGCCTGAGACCTCCCCGTTGCCCGCCC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggggggggggcaggacagcaagggggaggattgggaagacaatagcag gcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGTG CCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGGG GGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAAA CTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGGG GAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAAC GGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTCC TTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGTC GCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGTC TAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCCC TTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTGC TTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAGA TCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccaccA TGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTC GAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGGG CGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACCA CCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTAC GGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACTT CTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCT TCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGC GACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGA CGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAACG TCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAAG ATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCA GCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACT ACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGAT CACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCAT GGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaat aaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcat tctagttgtggtttgtccaaactcatcaatgtatcttatTGGGTTTCTGT CTGCTCCTCTCCGTGGTGGCCGGCCCTGGCTCTGCCCCAAAGCCATGGAG AAGGCCAAATGCAGGCTGTAGAAAGGTCTGGGATCAGATCCCCCCATCCT TCCATGGCCACGTGACCGGGCCCGCCTCTTCCCTCTGAGCCTCACATCCT CATCTGAAGAGGGAATGCTGCTCATGCCCACCTTGCGGGGATGTGGGGAG GGGGTGCTGCCAGTGGAGACTCCCAGGGGACCCTCCAGGAGTCTCATCCT GACTCGAGACTGGGCACCTGGTGGGGGCACAGGCCCCCTCTCCCGCCATG GGACCGCCACATCCAGGCTCTCGTGGGTTGAGATGCTGAGGAAAGGAAAG AACAAAACCTGCTGGAGGAGCAGCGAACACGGCCCTCAGCTGGGTGACCT TGGGCAAGTCACTTCCCCTCTCTGGGCCTCAGTTTCCAT (SEQ ID NO: 251) pARBI-647: CTGAAGAGGGAATGCTGCTCATGCCCACCTTGCGGGGATGTGGGGAGGGG 2541 chr16_2_ GTGCTGCCAGTGGAGACTCCCAGGGGACCCTCCAGGAGTCTCATCCTGAC Exogenous_104 TCGAGACTGGGCACCTGGTGGGGGCACAGGCCCCCTCTCCCGCCATGGGA CCGCCACATCCAGGCTCTCGTGGGTTGAGATGCTGAGGAAAGGAAAGAAC AAAACCTGCTGGAGGAGCAGCGAACACGGCCCTCAGCTGGGTGACCTTGG GCAAGTCACTTCCCCTCTCTGGGCCTCAGTTTCCATATCTGTGAATGAGG CAGCTGGACTGAAGAATAGAGAGTCACAAAGCAGAGTGACACAAACTGGG CTACAGGAAACCCTGGAGGCCCTCGACCCAGGCTGGGGCCGGGGATGGTT CTGCAAAGCCTCTTGGGAGAGGGTTGCCTGAGCTGAGTCTCCAAACAGAA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatCAGTGGCCT TAAGTCACGGGAGATGAGCCAGAGCAGTCCAGAGGGCACGGTCAGCACAT GCTAAGGTCCTGGGGCACAGAGTACAGGGCTTGAGAAGGGTTCCTCTTGC TCCATGTGACTTTCATCATCCAACAGCCTGTGGTGGCACTTGCCTGTATC CCAGCTATCTAGGAGGCTGAGGTGGGAGGATTGCTTGAGTCTGAGAGGTC GGGCTTGTTCTCCCGGTGTTGGCAGAGTCCAGAGGGAGCAAGTGGAAGCC ACAGGTGTCTTGAGGCCTAGGCTGAGATCTAGCACATCACCGCGTCACTT CATCCCTTCATTTCATCCTATTGGCCAAAGCAGCCCAATTCTAGGGATGG GGATCTAGACTCAACTTCTTGATGGGAGAAGTAGCAAAGTCACACTGCCA AGGGCAGGGACACAGGGAGGGATGAAGAGTTAGGAATGGTT (SEQ ID NO: 252) pARBI-648: AAGCCACCTCCCTGCTCTGTTGCTGGCTCTCCTTGAAGACAAGTGTTTTT 2541 chr16_3_ CAATTAACCAAAGCGGTGCATGGCAATGTGATAGGGGAATGAAACACAGC Exogenous_105 AACAGAATCAATGCCCCACGCTGGGCAATAGCCGACTTTCTGTTCGCTCC CATCCCTTCCTTCCCTCCCCGCCTCCTTCCCCTGCTCCTTCCATTCCACA AACATTTATTGAGCACCCGCTGTGTGCCAGCCACTGTTCTAGGCCCTGAA GACACAGAAGTGAACAAAAAAAAAGAGTCCCTGTGCACATCCTGGAGGGA CTTTCCCCGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTCTATACAGTGTA TGGAGACAGTGGATAATAAAAGCTGTTTATGAGGCACTTTCTCTGAGCTG TTCCATGTGCTTCACTTATACTCATTCAGCTAATCCTCAGGACACCCCCG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatTGAAGTCAG TGATATTAGTCACAGAGGCCCAGAGAAGTGAAGTGACTTGCCCAAGGTCA CACAGCCAGCAAGAGGCCAAGCCCGGATTGAACCCCAGCCGCCTGGCTCT GGAGCCTGCAGCATAACCACAGCAGGGAACTGCCACCGGAGACAGACATC GACAGCCACTAGGAGATGTTAACCAACAGGCTTGTCTTCACGGCACGGCC CCCGCTTCACCAGCTGCACTGTTTGATGAGCTTTGCAGGTCCCAGATCTT ATAAGCTCATGGTGATTGATCCAAATGATGCAGAGGTCGGCCTAAAGTTA GAAGTGGGCCCCTCTCTGCCCCAAGACAGCCCTTCACCCCAATTCCATTC CCACAGTTTGGGCATCCACCCAGGCTGCCAAGCCAAGCGGGGGCTGCCCG GGTTAGCAGGGACCTGGCCATGGGCCTCCTCAGCTAGGGGC (SEQ ID NO: 253) pARBI-649: GGGGGCGGCCGCACGGCTAGAGCGGAGACCCCGCGCCCCCTCCGCCCGCG 2541 EDF1_1_ TGGCCCGGCCGGGGTGCGGGGCCCGGGGAGGCACGGGGGCTGCGCGTCGG Exogenous_25 GGCGCAGCCGCCGCCCGCGTGTGCTCGGAGGCCGCGGGGCCCGGGCTCCG GGGTCCTCCCGACCTGCAGCCCCAGCGGCTACCGCGCCTCGCCAGGCCAG GCCAGGCCCCGACGTCGCCTTCCCTACGTCGCCGGCGCCCGGCCACGACG TCCCTCAGACGAGCCGAACGCCGAATGGCCCCGAGCACGGGAAGTGCCCG CCCCCCGCGTGCAGCCAGCCAATGGGACGCCGAAAGCGGGGAGGTGCCGA GGGGACGTAGCGTCGCCGCGCCAGGTCTCTAGCAGCTGCCGCTGAGCCGC CGGACGGACGCTCGTCTTCGCCCGCCATGGCCGAGAGCGACTGGGACACG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGTGACGGTG CTGCGCAAGAAGGGCCCTACGGCCGCCCAGGCCAAATCCAAGCAGGTGCT TTGCTGACTGAGGAGCGGCCGGGCCGGGCGGGGCGGACGCTgggccgggg ccacggaccgtggggcagggggctcggggaagggcaggcggggcctggga cagggcgcaggggacggggacaaggctggGGTGCCGGGGGCAGGACGGGG TTCGAAGGGCGGGGCGAATCGCAGGGGTAGGAGGACCGGGCCAGGACagg ggtgggaaggtggggggcggacccaggaggggatggacggacccaggagg cagggggCGGGCGACTGGAGGAGGAGAGGAGCAGGGAACCGGACGGGGCA AGGGGCAGGCGATAGGGCCGGGACCAGTGCCAGGGTGGGGGGGGGAGGGG ATAGGGGCAGGGACTGTTGGTGGGGACAGGTGTGAGGACAG (SEQ ID NO: 254) pARBI-650: ACGGCTAGAGCGGAGACCCCGCGCCCCCTCCGCCCGCGTGGCCCGGCCGG 2541 EDF1_2_ GGTGCGGGGCCCGGGGAGGCACGGGGGCTGCGCGTCGGGGCGCAGCCGCC Exogenous_26 GCCCGCGTGTGCTCGGAGGCCGCGGGGCCCGGGCTCCGGGGTCCTCCCGA CCTGCAGCCCCAGCGGCTACCGCGCCTCGCCAGGCCAGGCCAGGCCCCGA CGTCGCCTTCCCTACGTCGCCGGCGCCCGGCCACGACGTCCCTCAGACGA GCCGAACGCCGAATGGCCCCGAGCACGGGAAGTGCCCGCCCCCCGCGTGC AGCCAGCCAATGGGACGCCGAAAGCGGGGAGGTGCCGAGGGGACGTAGCG TCGCCGCGCCAGGTCTCTAGCAGCTGCCGCTGAGCCGCCGGACGGACGCT CGTCTTCGCCCGCCATGGCCGAGAGCGACTGGGACACGGTGACGGTGCTG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatCGCAAGAAGG GCCCTACGGCCGCCCAGGCCAAATCCAAGCAGGTGCTTTGCTGACTGAGG AGCGGCCGGGCCGGGCGGGGCGGACGCTgggccggggccacggaccgtgg ggcagggggctcggggaagggcaggcggggcctgggacagggcgcagggg acggggacaaggctggGGTGCCGGGGGCAGGACGGGGTTCGAAGGGCGGG GCGAATCGCAGGGGTAGGAGGACCGGGCCAGGACaggggtgggaaggtgg ggggcggacccaggaggggatggacggacccaggaggcagggggCGGGCG ACTGGAGGAGGAGAGGAGCAGGGAACCGGACGGGGCAAGGGGCAGGCGAT AGGGCCGGGACCAGTGCCAGGGTGGGGCGGGGAGGGGATAGGGGCAGGGA CTGTTGGTGGGGACAGGTGTGAGGACAGAGGCAGGAGGTG (SEQ ID NO: 255) pARBI-651: CCCTGGAGTCGGTGTGAGGCAAAAGCAGAGGAGAGGATTTGGAATTAACC 2541 EDF1_3_ TGAAGAAAAACCATTTCAAAGCGGTAACCAGACCCCAGAGGCTGCCTGAA Exogenous_27 CTCAAGGGGACATGGGAACCCAGGCTGTCCCAAGTTCACACACCTCAGGT CGTGGACTTCCAGAGTTTTCTCTCAGTTATGAGGACAGGCAGCTGTACTC ATGCCAACCAGAGCTGCTCTGGGAAGATGGCTGCCTCCCAGGGCTCCGGC CAGCACCGGTGCAGGCAGGCACTCAGCTGACAACGTCCCCGGGGGCGCCG CACACACCACACCCACCAGCACATGGACCCCACAGCACAGCCTCATGTTG CAAGCGGAAACACAAGTACCTACATTTCTTGGAAGTCTCCACATCTTCTC CTCGTCTCTGTGCCGCTAAGATAGCCTAGAAAATTAGAAAACATCAGTGG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatTTTCTAAAG AGAAGATGTGCTTTATACACATCAGCTCCACTAGGACTGCTGCAAAACCA GGCGCGAAGCGTCGCCTGAGAACAGCAGCTTCTAGGGCCCCTGGGGTACG CACCCACACGCAGCTGGGTTTTGTGCGAGAGGCAGTGAGAGCCGGGCTGA CCTGGCTTCCCGAGGCATTCCCTGCAGGGGAACCAGGGGGGTGGAGCCCA CCCTCCCCGTGCTATCTGAACACCGGCCATCCCCCTCCCCAAACCCACAA CCCCAGGAGTGAGAGCCCCGGCGCAGCCTCACCGTGGCCAGGTCCTTCTG CGTAAGCCCCTTGCTCTGCCGACCTTGCTGGATCACCTTGCCCACCTCCA GGGTCACCCTGTCATGGTGCAGCTCCTCTGTCTCCCGGTCCAGCTTGGCC GTGTTCTTGGTAATAGAATGTTGTTTGTTCTGGCCAGCAGC (SEQ ID NO: 256) pARBI-652: CTCTTGCTTCAACAGTGTTTGGACGGAACAGATCCGGGGACTCTCTTCCA 2541 FTL_1_ GCCTCCGACCGCCCTCCGATTTCCTCTCCGCTTGCAACCTCCGGGACCAT Exogenous_2830 CTTCTCGGCCATCTCCTGCTTCTGGGACCTGCCAGCACCGTTTTTGTGGT TAGCTCCTTCTTGCCAACCAACCATGAGCTCCCAGATTCGTCAGAATTAT TCCACCGACGTGGAGGCAGCCGTCAACAGCCTGGTCAATTTGTACCTGCA GGCCTCCTACACCTACCTCTCTCTGGTGAGTCCCCAGGACGCCCCTGGCC CTAATTTCCTCCAGCTGCGCACCTCCGGCCCTCACTGCACGCGCCAGCCT TCTTTGTGCGGTCGGGTAAACAGAGGGCGGAGTCCCCTTGGCCTCGCCTC CCGCTAACCATTGTTGCCTCCATCTCTTCCCGTAGGGCTTCTATTTCGAC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatCGCGATGATG TGGCTCTGGAAGGCGTGAGCCACTTCTTCCGCGAATTGGCCGAGGAGAAG CGCGAGGGCTACGAGCGTCTCCTGAAGATGCAAAACCAGCGTGGCGGCCG CGCTCTCTTCCAGGACATCAAGGTAACTAGTGTGTGGGTAATGGACTACA TCTCCCAGCAGGCCGTGCGCGCGAGGAGCCTTGATTTGAGGGCGTAGGTG TCGCGTGGGCTTCTGGGAGATTGAGTTCGGTCTTGTGAGCCCTCTTAACC GCTGGAAATAGAGGCGCACCTCGTGCAGTGCCCACAACACGCGGCAGTCC ACACCGCTGCGTGGTCTTAGGGACGTATAGCTGTAAGAGCTAGGACAGGG TGCGGAGAGTGATAAATACAAGCTGTCACATGTCTTTGTGGCCTGGGCCT CTGACCCCCAACGACTCTTGGGAAATGTAGGTTTAGTTCT (SEQ ID NO: 257) pARBI-653: TGGGGCGGGGGGCTGAGACTCCTATGTGCTCCGGATTGGTCAGGCACGGC 2541 FTL_2_ CTTCGGCCCCGCCTCCTGCCACCGCAGATTGGCCGCTAGCCCTCCCCGAG Exogenous_29 CGCCCTGCCTCCGAGGGCCGGCGCACCATAAAAGAAGCCGCCCTAGCCAC GTCCCCTCGCAGTTCGGCGGTCCCGCGGGTCTGTCTCTTGCTTCAACAGT GTTTGGACGGAACAGATCCGGGGACTCTCTTCCAGCCTCCGACCGCCCTC CGATTTCCTCTCCGCTTGCAACCTCCGGGACCATCTTCTCGGCCATCTCC TGCTTCTGGGACCTGCCAGCACCGTTTTTGTGGTTAGCTCCTTCTTGCCA ACCAACCATGAGCTCCCAGATTCGTCAGAATTATTCCACCGACGTGGAGG CAGCCGTCAACAGCCTGGTCAATTTGTACCTGCAGGCCTCCTACACCTAC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatCTCTCTCTG GTGAGTCCCCAGGACGCCCCTGGCCCTAATTTCCTCCAGCTGCGCACCTC CGGCCCTCACTGCACGCGCCAGCCTTCTTTGTGCGGTCGGGTAAACAGAG GGCGGAGTCCCCTTGGCCTCGCCTCCCGCTAACCATTGTTGCCTCCATCT CTTCCCGTAGGGCTTCTATTTCGACCGCGATGATGTGGCTCTGGAAGGCG TGAGCCACTTCTTCCGCGAATTGGCCGAGGAGAAGCGCGAGGGCTACGAG CGTCTCCTGAAGATGCAAAACCAGCGTGGCGGCCGCGCTCTCTTCCAGGA CATCAAGGTAACTAGTGTGTGGGTAATGGACTACATCTCCCAGCAGGCCG TGCGCGCGAGGAGCCTTGATTTGAGGGCGTAGGTGTCGCGTGGGCTTCTG GGAGATTGAGTTCGGTCTTGTGAGCCCTCTTAACCGCTGGA (SEQ ID NO: 258) pARBI-654: CTTTGCATAGTTTATCCTATTAGTAATCTATTCTGTCTTTGGAATATGTT 2541 PTEN_1_ TTGTGATGATGAAATAAATACTATAAATAGTATTATTCCTTTTGCATTGA Exogenous_31 GAGTCCTGACGAAATGTCCATGTGACAGTTCATTTTGGGTTTAGCTCTAC CTCTAATATGTGACCTATGCTACCAGTCCGTATAGCGTAAATTCCCAGAA TATATCCTCCTGAATAAAATGGGGGAAAATAATACCTGGCTTCCTTAATG ATTATATTTAAGACTTATCAAGAGACTATTTTCTATTTAACAATTAGAAA GTTAAGCAATACATTATTTTTCTCTGGAATCCAGTGTTTCTTTTAAATAC CTGTTAAGTTTGTATGCAACATTTCTAAAGTTACCTACTTGTTAATTAAA AATTCAAGAGTTTTTTTTTCTTATTCTGAGGTTATCTTTTTACCACAGTT tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCG CACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGA AGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGAC TTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAA CAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGG TGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCC GACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCC CGACAACCACTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACG AGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATC ACTCTCGGCATGGACGAGCTGTACAAatgattgtttattgcagcttataa tggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttt tttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttat GCACAATATCCTTTTGAAGACCATAACCCACCACAGCTAGAACTTATCAA ACCCTTTTGTGAAGATCTTGACCAATGGCTAAGTGAAGATGACAATCATG TTGCAGCAATTCACTGTAAAGCTGGAAAGGGACGAACTGGTGTAATGATA TGTGCATATTTATTACATCGGGGCAAATTTTTAAAGGCACAAGAGGCCCT AGATTTCTATGGGGAAGTAAGGACCAGAGACAAAAAGGTAAGTTATTTTT TGATGTTTTTCCTTTCCTCTTCCTGGATCTGAGAATTTATTGGAAAACAG ATTTTGGGTTTCTTTTTTTCCTTCAGTTTTATTGAGGTGTAATTGACAAG TAAAAATTATATATAAATACAATGTATAATATGATGTTTTGATGTATGTG TATATACATTGTGAAATGATTACTACAGTCAAACTACTTAACATATTCAT (SEQ ID NO: 259) pARBI-655: GCTACCAGTCCGTATAGCGTAAATTCCCAGAATATATCCTCCTGAATAAA 2541 PTEN_2_ ATGGGGGAAAATAATACCTGGCTTCCTTAATGATTATATTTAAGACTTAT Exogenous_32 CAAGAGACTATTTTCTATTTAACAATTAGAAAGTTAAGCAATACATTATT TTTCTCTGGAATCCAGTGTTTCTTTTAAATACCTGTTAAGTTTGTATGCA ACATTTCTAAAGTTACCTACTTGTTAATTAAAAATTCAAGAGTTTTTTTT TCTTATTCTGAGGTTATCTTTTTACCACAGTTGCACAATATCCTTTTGAA GACCATAACCCACCACAGCTAGAACTTATCAAACCCTTTTGTGAAGATCT TGACCAATGGCTAAGTGAAGATGACAATCATGTTGCAGCAATTCACTGTA AAGCTGGAAAGGGACGAACTGGTGTAATGATATGTGCATATTTATTACAT tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatCGGGGCAAAT TTTTAAAGGCACAAGAGGCCCTAGATTTCTATGGGGAAGTAAGGACCAGA GACAAAAAGGTAAGTTATTTTTTGATGTTTTTCCTTTCCTCTTCCTGGAT CTGAGAATTTATTGGAAAACAGATTTTGGGTTTCTTTTTTTCCTTCAGTT TTATTGAGGTGTAATTGACAAGTAAAAATTATATATAAATACAATGTATA ATATGATGTTTTGATGTATGTGTATATACATTGTGAAATGATTACTACAG TCAAACTACTTAACATATTCATCACCTCACATAATTATTATTCTCCCCCC AGGGTGAAAGCATTTAAGATCTACAAGCTACAATTTTCAATTATACAATG TTATTATTAACTATAGTCACTATGCTGTCCAGTAGAGCTTCAGATCTTGT TCATCTTGTGTTCCTCCCTCCCCACCCTCAGTCCCTGGAA (SEQ ID NO: 260) pARBI-656: ATAAATAGTATTATTCCTTTTGCATTGAGAGTCCTGACGAAATGTCCATG 2541 PTEN_3_ TGACAGTTCATTTTGGGTTTAGCTCTACCTCTAATATGTGACCTATGCTA Exogenous_33 CCAGTCCGTATAGCGTAAATTCCCAGAATATATCCTCCTGAATAAAATGG GGGAAAATAATACCTGGCTTCCTTAATGATTATATTTAAGACTTATCAAG AGACTATTTTCTATTTAACAATTAGAAAGTTAAGCAATACATTATTTTTC TCTGGAATCCAGTGTTTCTTTTAAATACCTGTTAAGTTTGTATGCAACAT TTCTAAAGTTACCTACTTGTTAATTAAAAATTCAAGAGTTTTTTTTTCTT ATTCTGAGGTTATCTTTTTACCACAGTTGCACAATATCCTTTTGAAGACC ATAACCCACCACAGCTAGAACTTATCAAACCCTTTTGTGAAGATCTTGAC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatCAATGGCTA AGTGAAGATGACAATCATGTTGCAGCAATTCACTGTAAAGCTGGAAAGGG ACGAACTGGTGTAATGATATGTGCATATTTATTACATCGGGGCAAATTTT TAAAGGCACAAGAGGCCCTAGATTTCTATGGGGAAGTAAGGACCAGAGAC AAAAAGGTAAGTTATTTTTTGATGTTTTTCCTTTCCTCTTCCTGGATCTG AGAATTTATTGGAAAACAGATTTTGGGTTTCTTTTTTTCCTTCAGTTTTA TTGAGGTGTAATTGACAAGTAAAAATTATATATAAATACAATGTATAATA TGATGTTTTGATGTATGTGTATATACATTGTGAAATGATTACTACAGTCA AACTACTTAACATATTCATCACCTCACATAATTATTATTCTCCCCCCAGG GTGAAAGCATTTAAGATCTACAAGCTACAATTTTCAATTAT (SEQ ID NO: 261) pARBI-657: TAGTGACGGTATCAAGAGCAGCATCCCAGGTTGACTTTCTTTGGGGCAGG 2541 PTPN2_1_ ACCTTGCTTGTGTCTTAACCCTTGGTTCCTACccaagtttgtctctctag Exogenous_34 tcctcaactctgaaagccacttgatatcttacacaattccttttttgcct gagaaaaaaaagtcagttttgatcatttacaaccaaaaatcctaactaac acaGACCTGTTTTTGAAATCAGGTAGATTGGAAGTCTTGGTTTACTTCTT ATAAGCCCGTCCGCTGTCTGTCTTGTAACATCCCAGGAGCAGACTTAACA AGGCCTTCCTGGAGCCTTGCTCTTTCTGTCTGCTTCCCTCATCTGctctc tctcctctctctTCACAGGGTCCACTTCCTAACACATGCTGCCATTTCTG GCTTATGGTTTGGCAGCAGAAGACCAAAGCAGTTGTCATGCTGAACCGCt gacgactgtgccttctagttgccagccatctgttgtttgcccctcccccg tgccttccttgaccctggaaggtgccactcccactgtcctttcctaataa aatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctggg gggtggggtggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatATTGTGGAGA AAGAATCGGTGAGTAATATTTAATATTTACAACTTAGTATACTGTATGTG ATCATTAGCATATAAAGATTTTCATTTTGGGGCCAATATTCATTCCTTAG TTTTGGCCTTTAATTGCTAAAGGCTAGCGGTCAAAGTGTTTCTGTCCGGA AAAACTCATGTATCCTTTTCCTTTCTTAAGTATTTTTATAACAATTGCAG ACCTTTCATCTCCAAGATAGAAATACAATGGAATAAATATGGACTAGCAG TGACATATAGATCTTTGGAATCCCTAGGAATCCTTATTGACCTATTGTCA ATAAGTCCTCTTCCAGCTTTAAGATCTTGTACTGTTCTACTTCAGATTTC TTACTTTATTCTACATTTCATAATTGCTGGTTTTTAATGATGATGAAAAT TTCCTAATCCACTTCTAATTAGATGTGGCAATGACATGCC (SEQ ID NO: 262) pARBI-658: CAGAATTTGAATATGATGTGGCTGACCATAGATACCTCCATTGCATTAAT 2541 PTPN2_2_ TCTGCTTCATAAAGTGTTTGATGCTTGCTTGTAGGTGTTGAATAGAGCCT Exogenous_35 AATTAGCAGTAAGTAAACAGATCTagagcacagactttggcatctgaagg accttggttcaatgctgggactgccacttactagctatgggatctttggt ggatttttaacctttgaaagcctttcatccctcacagatataatggggaa gaaaatcgtttccattagtattgtgaggattaaatgattaattgatgtaa acaatgtgctttccatagtacttggcatatcataagtgcttcatGTTATT TTACGCTGGCTGGGAAGATAAGTTTTGCTGTGGAGAATTTAAGAGGGATA TTAAAATATATTTTTGTTATTTTAAGGAAATTCGAAATGAGTCCCATGAC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggggggtggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatTATCCTCATA GAGTGGCCAAGTTTCCAGAAAACAGAAATCGAAACAGATACAGAGATGTA AGCCCATGTAAGTACTTGTGGGTTTGTGTGCATGTGTATTTTTTGGTTTG TTTTAGGAGGCAGGATGTAGTGAAAAGGAGGGCTTGGGGTCAGTGTTGAT AAATACAGTAATTTTTTTCCTATTTACGTAAGACACGTTCTTTAAGAATT TAAAGGTGTAGAATAGTGGAAGGAAAAATAATTACCCATAATTATTTCAT CCTTCAAACATAGGCACCATTATTTTGGTGTATTTCCCACAACCTGTTTT TCTTATGCACAGTTTTCCTTTCTTTAAAATAATTGTAATTATAGATGTCT ATATAATATCCATATGTATTTCCTTTTCATATCATTCCATCCCCTTTGTA AGTTTTGCTTAAAATAGTTATTGGTGTAGTTTGGTTTTTT (SEQ ID NO: 263) pARBI-659: ACCAGTTATCGTTTTGTAGGTAGACCAATCAATTCTTAGGTACCAAGAAT 2541 PTPN2_3_ CTGAATGATCCTTTTACTTTTAGGAAGGTAAGTACTTTATTCATTTAACT Exogenous_36 ATGTTTACTCCTGGTCGATTTTTCAAGTCACAATGGCTAATGTGCTACAA ACAGAAGTGCTTCCTTTCCAGCACTATAGTAATAAACAAGATTGCATTTT ATACTCCTTACAAAAAAAGTAGAGAATAGTTTAGGATTTGTCTCAACTCT ATTATGATGCTAATTCATTTTTCTTATTTCTTCTGTCTTTTTATAAATAC CTAGAATATTAATATGAAGATAAAAACAGTAATTTTGAAATGACAGTTGT GGTTTATCATTCTCTATTTTCAGATGATCACAGTCGTGTTAAACTGCAAA ATGCTGAGAATGATTATATTAATGCCAGTTTAGTTGACATAGAAGAGGCA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatCAAAGGAGTT ACATCTTAACACAGGCAAGTAATACGATACCACGCAAATGTCTGAAAATG ATGTGTTTGTGCTTGTTCTGCTTTAAATCATAGGTAAAAGTATATCTTGA CTTCTTTTGGAAATGAAATGGTATTTCAGTTTTTTTCTGACGATGTAATG AATTATGGACATTATAAGGTTTTGAAGCTTTGAGTATTTAAGATAAAAGG CAAGTTATTTTTGATATTACACGTTCTGTGAAGGAAAATTCTTAGGAAAT GGCTTAGGCCAGTTCTTTGGCAGATTGTGTTCCTTACATTATATCTGACA CAGAGTGCTGCTTATGCTTCTTAGTGTCTTCTTTTCTCTTCCCATACCCT GTGGCAAAACCAGAGGCCCTGGACAGCTCTTCTGTAGCTCCCCCTGCCTC GCATATATCTTCAGAGAGTACACCAAGCCCTGGATGGTGT (SEQ ID NO: 264) pARBI-660: CCTGCTCAAGGGCCGAGGTGTCCACGGTAGCTTCCTGGCTCGGCCCAGTC 2541 PTPN6_1_ GCAAGAACCAGGGTGACTTCTCGCTCTCCGTCAGGTAGGTGGGCCCCCCG Exogenous_37 CAACCCCGGGCATTTTGGCCACTCTCTTGTGCCATCCAGGCCCTGAACCA CTCATTCCTGGTTCCCCGTGGCAGTGCTGACTCCCCGTCTGTTCCCTTGC CCCCAACCCCCACACTCCCCATCCCTGTCTGTGCCCACCCATGCCCATGT GTGCCCCCACCCAGGACCTCAGCCGATCCCTGCCCTCCTGCCTCTACTCC TGCACCGACTGGCCTCACCGCCTGGTGCCCTGCAGGGTGGGGGATCAGGT GACCCATATTCGGATCCAGAACTCAGGGGATTTCTATGACCTGTATGGAG GGGAGAAGTTTGCGACTCTGACAGAGCTGGTGGAGTACTACACTCAGCAG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatCAGGGTGTC CTGCAGGACCGCGACGGCACCATCATCCACCTCAAGTACCCGCTGAACTG CTCCGATCCCACTAGTGAGAGGTGAGGGCTCCGCACCCCCGCCATTCCCA AGCAGGGATGAGCCGGCTCCCACCCTGAACAGCCAGGGAGGCAGGGAGAC TGGCAGCCGGCGCTGCCTACCCTCCATCCCCTCCCCTCCCTGCACCAGCT GGGGCTCTCAATGTCCCTCCTCCCTGCTGTCCTGGGACCTGGTGTCTCAG AGCCTAACCTACCACCCTTTCCACCTAACCCCGAGGAAGCCACAGAAAGC TGCCTCGCCCTACTCCGGGAGCCCTGGCCGCTGCAACCCAGGTCCCACTG GAGACAGGGAGGCCACTGCTGGTGGCCAGCATGTCGTGCAGGCCAGCTCT GTTGTTAGAAAGCTCTTCTTCCTCTGGAATCGAGCCTGCCT (SEQ ID NO: 265) pARBI-661: CTGGGTTCGAAGCCCGGTTAGAACTCTGGAGGCTAGGATGGCTTGAACCT 2541 PTPN6_2_ GGGAGGTCGAGGCTGCAGAGAGCTGTAACCGCGCCACTGCACTCCAGCCT Exogenous_38 GGGCAACAGAGCTCTGGAAGCTTGCCCTAGAGTCAGTCAAGGGCCCTAGG CCAGTGAGTAACAGCTCAGCGTCAGTTTCCTCATCTATAAAATGGGGGTA ATATCATACCTAGCTCTCAGCATGTTTGTGAGAGACCTAAATGAGGTGGT GGATTTGGAAGCATGTAGCGCAGTGCCTGGCACACAGTAGGTGCTTGATT TCCGGCCCCTCTCTGTGAATGTCTCTGCTCAGCGCCTTCCCCTGTGGCCT GGGTCTTACCTTCCCTGACGCTGCCTTCTCTAGGTGGTACCATGGCCACA TGTCTGGCGGGCAGGCAGAGACGCTGCTGCAGGCCAAGGGCGAGCCCTGG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatACGTTTCTT GTGCGTGAGAGCCTCAGCCAGCCTGGAGACTTCGTGCTTTCTGTGCTCAG TGACCAGCCCAAGGCTGGCCCAGGCTCCCCGCTCAGGGTCACCCACATCA AGGTCATGTGCGAGGTAAGGCAGCCAGGCGGCGGGGGAGCCTCTGCTGAG GCTCCTGTCTGTGACCACAGTGTGGGTGGCAGGGAGGGTCTGCCTGGGCT TGAATTCAAGGCTGGGGACCCAGGGAGGGAGACTCAAGTCCTGTGAATGG CCTAATTTGGCTCCCCCCAGGGTGGACGCTACACAGTGGGTGGTTTGGAG ACCTTCGACAGCCTCACGGACCTGGTGGAGCATTTCAAGAAGACGGGGAT TGAGGAGGCCTCAGGCGCCTTTGTCTACCTGCGGCAGGTCAGGGGTGGGC CCAGCTGCCTCCCCACTTCCCCTGAGCTGTCCCCCAGATGT (SEQ ID NO: 266) pARBI-662: TGTCACATATGTGCAATGCCATGCTCCTGAGCCTTTGATTGCAGACGTGT 2541 PTPN6_3_ GGGAAGTGGGCCCCGTCCCCACCCCCAGTGCCACCCTGCTCTGCTTCTCT Exogenous_39 TCCCTTGCTGTGCTCTAAAACGAGAAGTACAAGTGAGTTCCCCCAAGGGG TCGGCCGCGCCTCTTCCTGTCCCCGCCCTGCCGGCTGCCCCAGGCCAGTG GAGTGGCAGCCCCAGAACTGGGACCACCGGGGGTGGTGAGGCGGCCCGGC ACTGGGAGCTGCATCTGAGGCTTAGTCCCTGAGCTCTCTGCCTGCCCAGA CTAGCTGCACCTCCTCATTCCCTGCGCCCCCTTCCTCTCCGGAAGCCCCC AGGATGGTGAGGTAAGGGCCTGCCACCCACGGTAGACAGGAGGCAAGGGT GCCTGGTGCCCACGGGACCCCTCCTCACTGCCCTGCCTGGGCCGCCCAGG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatTGGTTTCAC CGAGACCTCAGTGGGCTGGATGCAGAGACCCTGCTCAAGGGCCGAGGTGT CCACGGTAGCTTCCTGGCTCGGCCCAGTCGCAAGAACCAGGGTGACTTCT CGCTCTCCGTCAGGTAGGTGGGCCCCCCGCAACCCCGGGCATTTTGGCCA CTCTCTTGTGCCATCCAGGCCCTGAACCACTCATTCCTGGTTCCCCGTGG CAGTGCTGACTCCCCGTCTGTTCCCTTGCCCCCAACCCCCACACTCCCCA TCCCTGTCTGTGCCCACCCATGCCCATGTGTGCCCCCACCCAGGACCTCA GCCGATCCCTGCCCTCCTGCCTCTACTCCTGCACCGACTGGCCTCACCGC CTGGTGCCCTGCAGGGTGGGGGATCAGGTGACCCATATTCGGATCCAGAA CTCAGGGGATTTCTATGACCTGTATGGAGGGGAGAAGTTTG (SEQ ID NO: 267) pARBI-663: TCTTTAGCTAATGTTCTGTAGCTTGTTTTAGATCTGTGTCTACATATTAT 2541 PTPRC_1_ ATGTATCTCTGTATGCATCAATATTTGTATATATGCATGCATATATGTGT Exogenous_40 ATGTATATTTATGAATACATATACACACCATATACATACACTTATGTATG GTGTGTGCATATGTATGTGTAGATATATGTACATACACACTATAGTGGAC TGGGGAGTTAGTATACTGGGAGGAGCATACATTTAGGGTATGATTCACAT ATTTATTTTGTCCTTCTCCCATTTTCCATTAATTAACAGGATTGACTACA GCAAAGATGCCCAGTGTTCCACTTTCAAGTGACCCCTTACCTACTCACAC CACTGCATTCTCACCCGCAAGCACCTTTGAAAGAGAAAATGACTTCTCAG AGACCACAACTTCTCTTAGTCCAGACAATACTTCCACCCAAGTATCCCCG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGACTCTTTG GATAATGCTAGTGCTTTTAATACCACAGGTTGGCACACAAAAGTTGTTAA CTTAAATATCAGGGAATGTCATTTAGAAAATTCTACAGTTATCAGTACAA CTTGTCTTTAAATTATTTGCACAGTTTCTAAGTATGTGATTTTATTCAAG TGCAGAAATTGCAGGAAATTAGTATCTGTGAAATATAGATCGACTGAAGT AATTAATGCATGTTGTTAGGGAGTGGAGAGAGAAAAGAAGGGAAGCAAGA TCTCCAAGGACAATCAGGAGGGGAAATTTGTTCTAGTATCCTCTGATCTA TACACACTCGCCATGATTCTCCCGCTTGGCTTCCCGCCACCTGGACAGAT GAGAATTTCCCTAGTTCAGAGATTATCAGTTCTACTCTCCTTGGAAGGTG TCTTAAATGGGAGTCTTCCCATTTCTTTGTTTCACTCTAGA (SEQ ID NO: 268) pARBI-664: AGGGAATTTTTATATGTTGGTCATATTATATCCCTAGCATGTGGCATAAT 2541 PTPRC_2_ GTCTAGCACAAAATAGGTGCTCAATTAATCATCATTATCTAAATAAATAA Exogenous_41 TGCATTTGGGAAAAAAAAGTTTCAAAAGTTTTTCAAAAGTCTTTTGCAGG CTTGAAATTAATCCCAATAGTGATCCTTTAGTCTGTTATGTTTCTGATTT AGCCTGGGGATTCAAAAAATAAATAACATAATTTTGATATATTTGGGCTT TGTAAACATGGTACTAAGAGAGGAAATATAGTTTCATTAGGGTAAAAGCT ACTGAAAATTGCCACTTGGTGAATGTTCTATCATAGACTTGAGGTACATA TAAAAATCTAATATATGTTTACATTAATATGAATGAAATTTGAAATTTTC TAAGAGATTTTTGTTTCTTCTTTGCAGGGCAAAGCCCAACACCTTCCCCC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatACTGGTAAG AATTAATATTTATATTTTTACTAATTTTATTTTCTTGTTGCAAAGTTTAT ATATTTAACTACAATTTTCTATTATTAACACTGAAATTATTTTTAAGGAT AAATTTTATAATCATGAGTGATTCTTGACATTCACTTGTTCTTAAACTTT CTGCTTATACGTTATAGAGTTTAATAACTACCTAAACATGTTATTAAATT TGTATATATATTTTGTGTATAAATAGTAACTTTTCCCAAACTTGACAGTA AATCACACAACAGGTTTCTACTCTCTTTTAATATTTTAAGACTATAAAAA AATGCATTTAAATTAGATAACAAAATTTTATAGTCTGAAAGCAGGTTAAC AGCTGTCTATGTATGTTATAGATATGTAGATAACAGATTTGCATATGTCT ATATTTCTTTAAGAGTATGTTGCTTTTTTCAATGGTATGCA (SEQ ID NO: 269) pARBI-665: CCTTTAGCACCATAAAGAAACTAAATTATTTAGATGTTTTTATGAGAACA 2541 PTPRC_3_ TATCAAAAAGTACTTTTCTGTCATCCAATACTTCCACAAATAAATCATTA Exogenous_42 GTTCTTGCTAATCTTCATCTGGCATAAAAATAATGACATCAACTTTCTTC ATGTAATTTCCCACTTAATTCCTTTACTAGGAGCAATATCAATTCCTATA TGACGTCATTGCCAGCACCTACCCTGCTCAGAATGGACAAGTAAAGAAAA ACAACCATCAAGAAGATAAAATTGAATTTGATAATGAAGTGGACAAAGTA AAGCAGGATGCTAATTGTGTTAATCCACTTGGTGCCCCAGAAAAGCTCCC TGAAGCAAAGGAACAGGCTGAAGGTTCTGAACCCACGAGTGGCACTGAGG GGCCAGAACATTCTGTCAATGGTCCTGCAAGTCCAGCTTTAAATCAAGGT tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatTCATAGGAA AAGACATAAATGAGGAAACTCCAAACCTCCTGTTAGCTGTTATTTCTATT TTTGTAGAAGTAGGAAGTGAAAATAGGTATACAGTGGATTAATTAAATGC AGCGAACCAATATTTGTAGAAGGGTTATATTTTACTACTGTGGAAAAATA TTTAAGATAGTTTTGCCAGAACAGTTTGTACAGACGTATGCTTATTTTAA AATTTTATCTCTTATTCAGTAAAAAACAACTTCTTTGTAATCGTTATGTG TGTATATGTATGTGTGTATGGGTGTGTGTTTGTGTGAGAGACAGAGAAAG AGAGAGAATTCTTTCAAGTGAATCTAAAAGCTTTTGCTTTTCCTTTGTTT TTATGAAGAAAAAATACATTTTATATTAGAAGTGTTAACTTAGCTTGAAG GATCTGTTTTTAAAAATCATAAACTGTGTGCAGACTCAATA (SEQ ID NO: 270) pARBI-666: aaggaaaaaggcactgagtgctggggggtgctggggtgggctgcagtgat PTPRCAP_1_ agacatcagggtagaggttaaggtcaggttcagcctcactggggtgaagt Exogenous_43 ttgagcacggtgagcaggccatgcagcccgggggaggggaggatgggagg aggtggagctttccgggcagagggaacagccagtgcgaaggccccaggca ggtggcttaatgcagctgttgggggaggtgagtggtagggaggaggctgg agggatgggggctgatctcacagggccagagcctggttgaccaaataagg ccttggccttttctGCTTGGCTGTCCCAAGAGGATCCCAAAGAGAAAAAA ACGAAAGTGGTCTTGGTCACCCAGCCTGCCCCACACCAGGCCCCACCCCA GGTGCTGAGCCCTCTGAGCCCCTGCCTGTCTCCCACAGGCTCTGCCCTGC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatACCTTAGGGC TCGGGATGCTGCTGGCCCTGCCAGGGGCCTTGGGCTCGGGTGGCAGCGCG GAGGACAGCGTGGGCTCCAGCTCTGTCACCGTTGTCctgctgctgctgct gctcctactgctgGCCACTGGCCTAGCACTGGCCTGGCGCCGCCTCAGCC GTGACTCAGGGGGCTACTACCACCCGGCCCGCCTAGGTGCCGCGCTGTGG GGCCGCACGCGGCGCCTGCTCTGGGCCAGCCCCCCAGGTCGCTGGCTGCA GGCCCGAGCTGAGCTGGGGTCCACAGACAATGACCTTGAGCGACAGGAGG ATGAGCAGGACACAGACTATGACCACGTCGCGGATGGTGGCCTGCAGGCT GACCCTGGGGAAGGCGAGCAGCAATGTGGAGAGGCGTCCAGCCCAGAGCA GGTCCCCGTGCGGGCTGAGGAAGCCAGAGACAGTGACACG 2541 (SEQ ID NO: 271) pARBI-667: aaggtcaggttcagcctcactggggtgaagtttgagcacggtgagcaggc 2541 PTPRCAP_2_ catgcagcccgggggaggggaggatgggaggaggtggagctttccgggca Exogenous_44 gagggaacagccagtgcgaaggccccaggcaggtggcttaatgcagctgt tgggggaggtgagtggtagggaggaggctggagggatgggggctgatctc acagggccagagcctggttgaccaaataaggccttggccttttctGCTTG GCTGTCCCAAGAGGATCCCAAAGAGAAAAAAACGAAAGTGGTCTTGGTCA CCCAGCCTGCCCCACACCAGGCCCCACCCCAGGTGCTGAGCCCTCTGAGC CCCTGCCTGTCTCCCACAGGCTCTGCCCTGCACCTTAGGGCTCGGGATGC TGCTGGCCCTGCCAGGGGCCTTGGGCTCGGGTGGCAGCGCGGAGGACAGC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGTGGGCTCC AGCTCTGTCACCGTTGTCctgctgctgctgctgctcctactgctgGCCAC TGGCCTAGCACTGGCCTGGCGCCGCCTCAGCCGTGACTCAGGGGGCTACT ACCACCCGGCCCGCCTAGGTGCCGCGCTGTGGGGCCGCACGCGGCGCCTG CTCTGGGCCAGCCCCCCAGGTCGCTGGCTGCAGGCCCGAGCTGAGCTGGG GTCCACAGACAATGACCTTGAGCGACAGGAGGATGAGCAGGACACAGACT ATGACCACGTCGCGGATGGTGGCCTGCAGGCTGACCCTGGGGAAGGCGAG CAGCAATGTGGAGAGGCGTCCAGCCCAGAGCAGGTCCCCGTGCGGGCTGA GGAAGCCAGAGACAGTGACACGGAGGGCGACCTGGTCCTCGGCTCCCCAG GACCAGCGAGCGCAGGGGGCAGTGCTGAGGCCCTGCTGAGT (SEQ ID NO: 272) pARBI-668: cagtgcgaaggccccaggcaggtggcttaatgcagctgttgggggaggtg 2541 PTPRCAP_3_ agtggtagggaggaggctggagggatgggggctgatctcacagggccaga Exogenous_45 gcctggttgaccaaataaggccttggccttttctGCTTGGCTGTCCCAAG AGGATCCCAAAGAGAAAAAAACGAAAGTGGTCTTGGTCACCCAGCCTGCC CCACACCAGGCCCCACCCCAGGTGCTGAGCCCTCTGAGCCCCTGCCTGTC TCCCACAGGCTCTGCCCTGCACCTTAGGGCTCGGGATGCTGCTGGCCCTG CCAGGGGCCTTGGGCTCGGGTGGCAGCGCGGAGGACAGCGTGGGCTCCAG CTCTGTCACCGTTGTCctgctgctgctgctgctcctactgctgGCCACTG GCCTAGCACTGGCCTGGCGCCGCCTCAGCCGTGACTCAGGGGGCTACTAC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatCACCCGGCC CGCCTAGGTGCCGCGCTGTGGGGCCGCACGCGGCGCCTGCTCTGGGCCAG CCCCCCAGGTCGCTGGCTGCAGGCCCGAGCTGAGCTGGGGTCCACAGACA ATGACCTTGAGCGACAGGAGGATGAGCAGGACACAGACTATGACCACGTC GCGGATGGTGGCCTGCAGGCTGACCCTGGGGAAGGCGAGCAGCAATGTGG AGAGGCGTCCAGCCCAGAGCAGGTCCCCGTGCGGGCTGAGGAAGCCAGAG ACAGTGACACGGAGGGCGACCTGGTCCTCGGCTCCCCAGGACCAGCGAGC GCAGGGGGCAGTGCTGAGGCCCTGCTGAGTGACCTGCACGCCTTTGCTGG CAGCGCAGCCTGGGATGACAGCGCCAGGGCAGCTGGGGGCCAGGGCCTCC ATGTCACCGCACTGTAGAGGCCGGTCTTGGTGTCCCATCCC (SEQ ID NO: 273) pARBI-669: ATAGAGTAGGGCGGGGGATGCCATGGAGAGGCTCCATGGGGGAGGGCCGG 2541 RPS23_1_ GGAAGCGCCGCTCCAGGAGGCACGTGGTCCGGCGCGGAAGGGGCCCATGA Exogenous_46 GGCGTGGAGGCCGCCGAGGTCGGGGTACCGAGGGACGCAGGGAGGCCAGC GCTTCCTCCCGGGCATTCGAGCGGGGCCTCGTCCTTCGGGAGAACACATT CTCCGGAGCCCTCTTCGAACGTTTATTAGTCGGTTCAGGGCAACTTGAAG GCCAAATGTTTGGCCCACAGGCCAATAAATAGTACGAGAGCCAATCGGCT TAAGGGTTTATTCCAGGTGAGGCGAGTGTCTTAGAAGATGGGAAACACGT AGATGGCGTGTTTTTACGGAAGAACTAAAATATTTAATTTTTAGGCAAGT GTCGTGGACTTCGTACTGCTAGGAAGCTCCGTAGTCACCGACGAGACCAG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatAAGTGGCAT GATAAACAGTATAAGAAAGCTCATTTGGGCACAGCCCTAAAGGCCAACCC TTTTGGAGGTGCTTCTCATGCAAAAGGAATCGTGCTGGAAAAAGTGTAAG TCCATTGCTCCCGTCAAGTTTTAGTTTATTATAGGAATTCGAGACATGAA CTTACGAATTCTTGTTTTGAAAGTAATTGCAGGTTTTTGTGTAGTAGTAT TCATTTGGGCATTGTGGGGTAAAATTGCAAAGCGTTTGTTCTATTTAAAA GTTGGTAAAATTAGTTTTTGGGAATTAGGTAGTTAAGGTTTTAATTTAAC GTTGGCCTGGAAGGAATTGGAGAAGATACTAGCAATGATGAAGTAAAGGA CACAAACACCTTTACTGTGGGAGTTGTTATAAGTAAATGGCACGTGTCAG CTATTGAACTTTATCGACTTGATAAAACTAAGGTGAAGAGA (SEQ ID NO: 274) pARBI-670: CAAGCGGGACTTGGGGTCTTGGGGACGGGCGGGCGGATGCGAATAGAGTA 2541 RPS23_2_ GGGCGGGGGATGCCATGGAGAGGCTCCATGGGGGAGGGCCGGGGAAGCGC Exogenous_ CGCTCCAGGAGGCACGTGGTCCGGCGCGGAAGGGGCCCATGAGGCGTGGA 47_48 GGCCGCCGAGGTCGGGGTACCGAGGGACGCAGGGAGGCCAGCGCTTCCTC CCGGGCATTCGAGCGGGGCCTCGTCCTTCGGGAGAACACATTCTCCGGAG CCCTCTTCGAACGTTTATTAGTCGGTTCAGGGCAACTTGAAGGCCAAATG TTTGGCCCACAGGCCAATAAATAGTACGAGAGCCAATCGGCTTAAGGGTT TATTCCAGGTGAGGCGAGTGTCTTAGAAGATGGGAAACACGTAGATGGCG TGTTTTTACGGAAGAACTAAAATATTTAATTTTTAGGCAAGTGCCGCGGA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatCTTCGTACT GCTAGGAAGCTCCGTAGTCACCGACGAGACCAGAAGTGGCATGATAAACA GTATAAGAAAGCTCATTTGGGCACAGCCCTAAAGGCCAACCCTTTTGGAG GTGCTTCTCATGCAAAAGGAATCGTGCTGGAAAAAGTGTAAGTCCATTGC TCCCGTCAAGTTTTAGTTTATTATAGGAATTCGAGACATGAACTTACGAA TTCTTGTTTTGAAAGTAATTGCAGGTTTTTGTGTAGTAGTATTCATTTGG GCATTGTGGGGTAAAATTGCAAAGCGTTTGTTCTATTTAAAAGTTGGTAA AATTAGTTTTTGGGAATTAGGTAGTTAAGGTTTTAATTTAACGTTGGCCT GGAAGGAATTGGAGAAGATACTAGCAATGATGAAGTAAAGGACACAAACA CCTTTACTGTGGGAGTTGTTATAAGTAAATGGCACGTGTCA (SEQ ID NO: 275) pARBI-671: TTGTATATCCTTTTTAAAGTTATTCTTTTTAGTTAATTGCTCTATGTGTA 2541 RTRAF_1_ TTGAGACTAGGAATCAGAAAGCTTAGATTCTAGTCCCAGGTGTAAGTTGT Exogenous_49 GTAACCCTTGGCAAGTGTCAATCTCTAGGCCTCAGCTTTCTCATCTATAA AATGAGGAAGTTGTCGTATTCTATTTTTTTTCTTAAGATGATACACTTAA ATGTTCCCTTCTGTTGGGTTATATAATTGCATCAAAAGTGTAGTAATGTT ATTAAAAAATTGTTAGAGATCCAAACTAAGGTCTCTTTCAACTCTCCCAT TCTTTTTTCTGTGACTTTATGGTAATAATGAAACTGGTGGTTTTCTTTTC TTCCCCCTCACAGTATCTCAGAGATGTTAACTGTCCTTTCAAGATTCAAG ATCGACAAGAAGCTATTGACTGGCTTCTTGGTTTAGCTGTTAGACTTGAA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggggggtggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatTATGGAGATA ATGGTACGTTTTGTGGGGAATGTGTATTTTAAAGAGAGAGGAAAGATGGG AAAGGGAGTGTGAAAATGTAGGGAACTTTGCAGTTTGTTTTGTCTAGTAC TATTTTACCTTTGGTTTATTCTTATCACAAGTTAAAAGCACTTTTATTGT CTTTCATTGGTGTTTATATATTTCTGTTAGAATTTGGAAATGGTGCCCTC TGGAGAAGGCTAATTGACTGTCTTCTCACAGAGTAACACTACTTTGATAA TATGGTCTGCACCTTAGCCTTTCAAATTAAATTGTTTTTAGTGTCCCAGA ATTGATGGGACTTTGAAGTGTTGTTGCAGTAGGTAATTTCTCAAAAGACT GAAACATGTCTAATGCCAATATACATTAATACTCTATAGGCCAGAATATA TTACTTATGTTTACTGTCTTAGCACAGATACTTCTATGGT (SEQ ID NO: 276) pARBI-672: GCACCTGTTATGTAGGAGGAGTAATAAAATGAATGAATGCACTCAAAACA 2541 RTRAF_2_ CTAAACAGTAATTCTGTAATCCAACGGGAGGTACAGCGAATACCAAAAGC Exogenous_50 CTACATATACTATTCTCTGCATGCTATAGCAAGAAAGAAAGGAAAGTGGC TTCCCGGTGGTTTTCTGCCTATTGTACAACCAGGAAGCTGACAATAAAGT TTATTTGAGCGTCGACGTGCGCCGACGTGGCCCCGCCTCCCCAGCCGGAG CCGCGATTGGTGGGCATTTGCCGGCGGCCACCGCTTTTAAGCCACGATTG GCGAAGGCCGCCGTCATTTCGGAGCGACTCAGCGCCTGCCCGCCCTCTCG CCGCGTCGCCGGTGCCTGCGCCTCCCGCTCCACCTCGCTTCTTCTCTCCC GGCCGAGGCCCGGGGGACCAGAGCGAGAAGCGGGGACCATGTTCCGACGC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatAAGTTGACG GCTCTCGACTACCACAACCCCGCCGGCTTCAACTGCAAAGGTGAGGCGGC GGCCTCAGCCCGGCCGCGTGTCCCTGACCTGGGCGGAGGTCCCAGCCTCA GTGCCCGCACCCCACCTCCCCGTCGGGACCCTCGGCGGCCTGGTTTCCGC CGGCAGCCTCCGGGCCCCTCTCCTCTGGGTCGCCACGTACCTCGGCTCTT CGCCGCCCCTTCCCGCCTTTAAAGCCCTCTCACCTACTCCTGTCTCGGCA TGTTACTTTCTGCACTTGCTTAACTCCAAGCATCACGTAACTACCTTCTC TGTACATAAAAGGGAGAGCATTCGTCTTTCTCACTCACTATTCAACTCCA TGGTTCCCTGGGTAATTAGGCGATACCTTGAGCACCTGCTAATTATGGGC CAGCGCGGTGCTGGATTCTGAGGAAGGTGCTGAGTAACTTG (SEQ ID NO: 277) pARBI-673: CACTAAACAGTAATTCTGTAATCCAACGGGAGGTACAGCGAATACCAAAA 2541 RTRAF_3_ GCCTACATATACTATTCTCTGCATGCTATAGCAAGAAAGAAAGGAAAGTG Exogenous_51 GCTTCCCGGTGGTTTTCTGCCTATTGTACAACCAGGAAGCTGACAATAAA GTTTATTTGAGCGTCGACGTGCGCCGACGTGGCCCCGCCTCCCCAGCCGG AGCCGCGATTGGTGGGCATTTGCCGGCGGCCACCGCTTTTAAGCCACGAT TGGCGAAGGCCGCCGTCATTTCGGAGCGACTCAGCGCCTGCCCGCCCTCT CGCCGCGTCGCCGGTGCCTGCGCCTCCCGCTCCACCTCGCTTCTTCTCTC CCGGCCGAGGCCCGGGGGACCAGAGCGAGAAGCGGGGACCATGTTCCGAC GCAAGTTGACGGCTCTCGACTACCACAACCCCGCCGGCTTCAACTGCAAA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGGTGAGGCG GCGGCCTCAGCCCGGCCGCGTGTCCCTGACCTGGGCGGAGGTCCCAGCCT CAGTGCCCGCACCCCACCTCCCCGTCGGGACCCTCGGCGGCCTGGTTTCC GCCGGCAGCCTCCGGGCCCCTCTCCTCTGGGTCGCCACGTACCTCGGCTC TTCGCCGCCCCTTCCCGCCTTTAAAGCCCTCTCACCTACTCCTGTCTCGG CATGTTACTTTCTGCACTTGCTTAACTCCAAGCATCACGTAACTACCTTC TCTGTACATAAAAGGGAGAGCATTCGTCTTTCTCACTCACTATTCAACTC CATGGTTCCCTGGGTAATTAGGCGATACCTTGAGCACCTGCTAATTATGG GCCAGCGCGGTGCTGGATTCTGAGGAAGGTGCTGAGTAACTTGAAGACTA GTTCACTGCCTGCCAGGAGCTAAAGGGACGAGGGGTGGAAG (SEQ ID NO: 278) pARBI-674: CCGGGTTAGGCATAGGCCCTCCCGGATCTTCCGCGGTGTAAGGAGAAGGC 2541 SERF2_1_ CAGCGCCCCTGTGATAGGCCCAGAGCCCCCACTCCACAAGCCAGCCCATC Exogenous_52 CCCCACGGAGACCCAAACCGTCCACACACACCTTGCCAGCTGTTTGGGCC CCACGCCGCCCCAAAACACGCCTCCAGCTGGCCCCTTGGGACCTCCCTTC TCTAGTCCGTATTTTGACCTGGCCCGTGGCAGATTCGCCACTCCCCCCTA CCCCAAGCAGCCTGGGCCTCGATGGGCCGTTGTCGGGGCCCGGAGATTGA AGTGGTGTTGGATCCTGCTGCTGGCCGCGCTGGGGTAGAAGGGTCGCCGG TGTGTGGGCAGAGCGGCCCCCGCGTCTCACCTTTAATTTTCTTTCCTTAG GCGGTAACCAGCGTGAGCTCGCCCGCCAGAAGAATATGAAAAAGCAGAGC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGACTCGGTT AAGGGAAAGCGCCGAGATGACGGGCTTTCTGCTGCCGCCCGCAAGCAGAG GTAGCCCCAGGGAGGGGAGGGAAAGGGACGGTGGAGACCTGGGTTAGACC AAGGGTTATAGAAGGAAAGAGAGCTACCTCAGGGCTTGAATGTGGACTAG TCGTGAGGAGCAGAGTGCATTGCTTCCTCTAGGGTTTTATTTCCTCCCCA CCCTCCAAATTGTTAGCTCACAGCCTTACAGGAAAGGACGGGGGCGGGCG CCTGCCCTCAGTCTGATTTCTGAGCGTCCCTGGGTCTGACCTTAAGGGCA AGGGCAGGGAGCTTCACATTTCAAATACAGTTGTGGTTACGGCAGCCCAG TACTTTTGGCCCTCCTTGCTGTTCGGTTCTCCTCCCTTCTCCCAACCTCC TCACTGGTGTTGCTGGGTGTGGTCCTCAATACAGAATAGAG (SEQ ID NO: 279) DARBI-675: AATCGAGCCCTTTGCCCACGGCTACTTCACGGGACCACCCTCCCGGGTTA 2541 SERF2_2_ GGCATAGGCCCTCCCGGATCTTCCGCGGTGTAAGGAGAAGGCCAGCGCCC Exogenous_ CTGTGATAGGCCCAGAGCCCCCACTCCACAAGCCAGCCCATCCCCCACGG 53_54 AGACCCAAACCGTCCACACACACCTTGCCAGCTGTTTGGGCCCCACGCCG CCCCAAAACACGCCTCCAGCTGGCCCCTTGGGACCTCCCTTCTCTAGTCC GTATTTTGACCTGGCCCGTGGCAGATTCGCCACTCCCCCCTACCCCAAGC AGCCTGGGCCTCGATGGGCCGTTGTCGGGGCCCGGAGATTGAAGTGGTGT TGGATCCTGCTGCTGGCCGCGCTGGGGTAGAAGGGTCGCCGGTGTGTGGG CAGAGCGGCCCCCGCGTCTCACCTTTAATTTTCTTTCCTTAGGCGGTAAC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatCAGCGCGAG CTTGCCCGCCAGAAGAATATGAAAAAGCAGAGCGACTCGGTTAAGGGAAA GCGCCGAGATGACGGGCTTTCTGCTGCCGCCCGCAAGCAGAGGTAGCCCC AGGGAGGGGAGGGAAAGGGACGGTGGAGACCTGGGTTAGACCAAGGGTTA TAGAAGGAAAGAGAGCTACCTCAGGGCTTGAATGTGGACTAGTCGTGAGG AGCAGAGTGCATTGCTTCCTCTAGGGTTTTATTTCCTCCCCACCCTCCAA ATTGTTAGCTCACAGCCTTACAGGAAAGGACGGGGGCGGGCGCCTGCCCT CAGTCTGATTTCTGAGCGTCCCTGGGTCTGACCTTAAGGGCAAGGGCAGG GAGCTTCACATTTCAAATACAGTTGTGGTTACGGCAGCCCAGTACTTTTG GCCCTCCTTGCTGTTCGGTTCTCCTCCCTTCTCCCAACCTC (SEQ ID NO: 280) pARBI-676: TTCAAAAGGAATCTTTTTGTTTCAACTATTAGGATCTTTTTAAATCAAAT 2541 SLC38A1_1_ ATGTATTTTAATGGTGTACACCAGACCTGAAGAAAAAGATCACAGAAGGA Exogenous_55 ATTTCCCCTTTGTAAGATAGAGGTAGTTAAAAATAAGCCTTATGACCTAA TAGGTTAATTGTAATGCTCTGGTAGCCAACTTGAAAAAGGAGAAATGATG GTTGCATCTCACATTTTAAAATGTTAAGAATTTGTTTTGTAATGAGGATA CCTTAACCCCTGAAGGCCAAAATGACTATTTGTGTGAGTTTGAGAAAGGC ACATAGTAACTTGGGGAACAGTCAATAGAATGACAAAATCTTGACTATTT TAACTTTCTGAACCCTGTCATTTCTTGTGTTCTCAAATTTGATTTTTAAA TAGGCTGCATGGTGTATGAAAAGCTGGGGGAACAAGTCTTTGGCACCACA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggggggtggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatGGGAAGTTCG TAATCTTTGGAGCCACCTCTCTACAGAACACTGGAGGTAAAAAGAACATG CTTTTCTTTACATAACTTGAAACTAATTCTGGTGGATGAGCGGCCACATG AATTTTAACATAATTCAAGCAGTTATCATCCTCATTGCTAAAATGGCACA GGGAAAGTAAAGCAGAGACAGCAGTCACTTATTTAAAGCCACAAATCCTG CTAGAGTAGCTGAAGTTGCCTTTGTGTCTTACTGACAGTTGGTCTAAGAA CGGGCTGATAACTTTTTATGTAGCTTGCAACATAAGCCTTCGTATCTTCT TTCTAAAAATGTTCTCATTTTCCTTCAGCAATGCTGAGCTACCTCTTCAT CGTAAAAAATGAACTACCCTCTGCCATAAAGTTTCTAATGGGAAAGGAAG AGACATTTTCGTAAGTTATAGACCATTITTTTATGATATA (SEQ ID NO: 281) pARBI-677: TAAAGATGTCAAGCTTCAAATCATTATCATAAGGTTAAATATATTACGCA 2541 SLC38A1_2_ TGTCTTTACAGCAAGTTTATTTCTGATCGTGAAAGTAGAAGAAGTCTCAC Exogenous_ AAACAGCCATTTGGAAAAAAAGAAGTGTGATGAGTATGTAAGTATCATTA 56_57 TAATGAAGTATGCTTATTTTTTTAATCATATAAATTGGGACAAATCTTTT TAATTCAAGTAGCATAGTACCAAAAGCATAACTACCTGTATTCACCAGGA TATTCCTCATACCTTATGTATTTTCTCTTGAAATCCATTCAAGGAAATAA CTAGATAACTAAGAATGGATTTTCTTGAATTTTACTTCCATATTGGTTCC TGAGAGTGCATTAAAGGCTTTGGCTTGATGAACTTTCTTGGGGATTATTT TGCTTAGTGTCAATGTTTGTTTCTTTGTTCAGATTCCAGGTACAACCTCC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatTTAGGCATG TCAGTTTTTAACCTAAGCAACGCCATTATGGGCAGTGGGATTTTGGGACT CGCCTTTGCCCTGGCAAACACTGGAATCCTACTTTTTCTGTGAGTATTGA CGTGCCGGTACTTTCCATTTTAAAACTGAACTTTTTGTATGTTTCTGTTA TTACATAGAAGAAATGTGAAATATTTATAAATAGTTTTTTTATTAACTTG GCTAAGTTACAAGTACATACTTTGATATTTATTGCTTGAGTGATTTTCCA AACTGTACCTAATGCTTACAACAAAGAGGAAGGAGAAAATCTTTTTATTA ATCAAAAATACTGAAATAAGCATTTGTGAAAAGGATTAAAACATTTGAAA ATAACTTTTTTTGGTATTTTTGGAGTCTACCTGTGACTTTAAATCTCAGT TAAAATATTAAGAGGTGTTTAACCCCAGCCAGTCACCACTT (SEQ ID NO: 282) pARBI-678: ATCATTCTTCACACTAGTTCCTTTTCTGACTCTTCAGTCACCTCCAGAAT 2541 SMAD2_1_ TAATATCCTTAGAAGCATCTCTAAAATGAGTTTGATTTACTAAACATAGA Exogenous_58 AGTTGACAAAGTTTTTCAAGTATAACTGCATAAACTCTCTTTTTTAAAAA ATCAGTTTTTCCCAAGTTGTTAGCCTTTCATTGGCATTTGAGTCATTTAT GTGACATATTTATAAGAAACATCTGCTAGTGCTGCTGCATACTTTTATCA GATCCATTAAATAGTACATTTTTGTTCTTGATTCTCACTAAAACTAACTA AATGGCTGTCATTCTTTTCTTTATGCTTTTATTGTACTAATTTAGCCCAT TTGACTGCACttttttttttttttttttAAAGTTCCCTCCTTTCTTTTCC CCTTGCTTCCCAACAGGTCTCTTGATGGTCGTCTCCAGGTATCCCATCGA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatAAAGGATTGC CACATGTTATATATTGCCGATTATGGCGCTGGCCTGATCTTCACAGTCAT CATGAACTCAAGGCAATTGAAAACTGCGAATATGCTTTTAATCTTAAAAA GGATGAAGTATGTGTAAACCCTTACCACTATCAGAGAGTTGAGACACCAG GTAGGAATATTGCAAGTTTTTTTCTTGGTTTTGAATTAAATGCCTGAACT TCAGTATATTTAAGTACTCTTGTGACCCAGGAAAATTTTAGTGTAAATTC TAATAAATTACCTTAAATTGTGCATATTATTTGGTGGTAATTAAAAtttt tttatttttaaaaatttttAATGAGAAAACATATCTGTTTCTTGGAATAG CATATGATTTGGCATGAAAGTGCATATCCAGGAAATCCTGTAGATTGGCA GTCTGCTCAGCGCTATGCTGATGTGCTTCTACATGTCCCT (SEQ ID NO: 283) pARBI-679: TTCTATATTAAAAATCAGTTGACATCTGTTTTTAGATAATTTTTTAAAAA 2541 SMAD2_2_ ACTGGATTATAAAATGAAGCACTTAGTAAAGCTGATGGCAGAGGTTTTAG Exogenous_59 ATCTTTTGAATACAGGAAAGTGGTAAGGGCATTATAGGTATGATTCATTG AGCTAGAGGGCGTTGGTGTTCACATTTTAAAAACATAATCATGACTTATC TGGAGTCACATGTTCCATTTTATTGGCTTCCTTCATATAGAAAATATAGT AACCAGCACTACATGCCTGTGAAAAAGTGAAGCAATTGTATATTTTCTGG GTGAAGGAAGTATTCTGTACATTCTCCATGTTTTACATCATGGTATTTTG AATAACCATGCTTCCATGTTCACATCAATTTTTTGTTTTTCAATTTATGC ACAGCACTTGCTCTGAAATTTGGGGACTGAGTACACCAAATACGATAGAT tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatCAGTGGGAT ACAACAGGCCTTTACAGCTTCTCTGAACAAACCAGGTGAATATTCTGCCC TCTGTCGAATCTTAGAGATCTTGTGGGAGGGGGGTATATTTTGAAAGACC TATATGGGGTTGCTTAGTATAATTTTGCCTAGGATGTTTCCTTAATGTAA AATAAGGCATAGCATTTAAAATATCTGCTTGCCAGTATAAAAAATATATT AAATGTTTCAGCTGATGTTTTAATCAATGCAATTCTAGTTTGAATCTTCT TTAAATTACTGTATCCCCTAAAGAATAATAATTTTGATAACTATATATTT ATTTTAGCTTGAGATCAGATTATAATCTGTTGTTGGCCTAATTTTTAAGT AGATACATGATGAGTTTTGCTAAATTTCTTGTTATCAGAATCTCATCTTT ATACTAATAAATACATACTTAATGAACCCCTTATATCACAA (SEQ ID NO: 284) pARBI-680: TCAATGTATTTTTGTATAGTCTTTTGAGATTAGAGTGAAGTTCTAAAAAG 2541 SMAD2_3_ TAAGAGAACCTTTTGTGAAAATATTTTAATTGACACTACTAAGAGTTTAA Exogenous_60 AATAGTATTGGTGGTGAAAAATCGTTAAAAGTATCTGATTTTACTTGCAA AATTACTGTATTTTCCCACAAGAGGAGTCCTTACAACATTCTTGTTTTTA GAAGGGTTTGTTTCCATCATTCATTTAAATTCATAAAGATAACGTTTTCA TGGGTGGAGAAGTCTATTGGGAAAGTCATAATTCGAATTTCTATCTTGCT TTGCAGTTTGCTTTCTATGATAAGTTGAAATTATTACTTGATGTTCAAGG TAGTCTCTACATCATCCTTTCAATATTTCTGCTAGGTTCGATACAAGAGG CTGTTTTCCTAGCGTGGCTTGCTGCCTTTGGTAAGAACATGTCGTCCATC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatTTGCCATTCA CGCCGCCAGTIGTGAAGAGACTGCTGGGATGGAAGAAGTCAGCTGGTGGG TCTGGAGGAGCAGGCGGAGGAGAGCAGAATGGGCAGGAAGAAAAGTGGTG TGAGAAAGCAGTGAAAAGTCTGGTGAAGAAGCTAAAGAAAACAGGACGAT TAGATGAGCTTGAGAAAGCCATCACCACTCAAAACTGTAATACTAAATGT GTTACCATACCAAGGTAAGTTTTGTTAGATCCCAGGTTTGATCAAATTAT GTCAAGGAATCTGAAGGAAAGTTACTGAATTTGTGTTCCTTTCAAGTTGC CTGTAAAAAGTGATGATTGAAATATGACTGTTTTTAACCTTGTATAAATT GTTTTTGCTAGCTGACTTGTTTTATAAATTATTTTCTTGAATAGTGAGGT TTAATCAAGCTAAATAAGATACTTAGATTTATTATCTTCA (SEQ ID NO: 285) pARBI-681: GGGCTCCCGCAGCACAGCCTTTCTCCGGCCCTAGCCCAAATCGCCCAGAC 2541 SOCS1_1_ CAGGCGCGGATCCCAGCCTGGCCAGCAGGCGGCGGGCGCGGGGCGGCGAG Exogenous_61 CCGGGGCCGGACGGCTGGAGCCAGAACCGGCTGCTCTCCACGCCCCCCTC TCGGTGCTGCCCGGAGGCCGGACTCCGCCTCCACCGAGCCCCCACCCGCC GGGAAGAGCTCCGCGGAGTACAGAGCCCATTTTCTAGCTGTGTCCACTGA GGCTGAACGGATCCGCGCGGACTTGGTGCTCCGTGCTCGCCCCCTAGGGC CGGGTCCGCCGGGAGCGCCGCCCTCCGGAGTTGTCCGGCCGGCGCACACC TGCCCGGCCCCGCAGCGCCCCAGCTCACCTCTTTGTCTCTCCCGCAGCGC ACCCCCGGACGCTATGGCCCACCCCTCCGGCTGGCCCCTTCTGTAGGATG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGTAGCACAC AACCAGGTGGCAGCCGACAATGCAGTCTCCACAGCAGCAGAGCCCCGACG GCGGCCAGAACCTTCCTCCTCTTCCTCCTCCTcgcccgcggcccccgcgc gcccgcggccgtgccccgcggtcccggccccggCCCCCGGCGACACGCAC TTCCGCACATTCCGTTCGCACGCCGATTACCGGCGCATCACGCGCGCCAG CGCGCTCCTGGACGCCTGCGGATTCTACTGGGGGCCCCTGAGCGTGCACG GGGCGCACGAGCGGCTGCGCGCCGAGCCCGTGGGCACCTTCCTGGTGCGC GACAGCCGCCAGCGGAACTGCTTTTTCGCCCTTAGCGTGAAGATGGCCTC GGGACCCACGAGCATCCGCGTGCACTTTCAGGCCGGCCGCTTTCACCTGG ATGGCAGCCGCGAGAGCTTCGACTGCCTCTTCGAGCTGCTG (SEQ ID NO: 286) pARBI-682: GGATCCCAGCCTGGCCAGCAGGCGGCGGGCGCGGGGCGGCGAGCCGGGGC 2541 SOCS1_2_ CGGACGGCTGGAGCCAGAACCGGCTGCTCTCCACGCCCCCCTCTCGGTGC Exogenous_62 TGCCCGGAGGCCGGACTCCGCCTCCACCGAGCCCCCACCCGCCGGGAAGA GCTCCGCGGAGTACAGAGCCCATTTTCTAGCTGTGTCCACTGAGGCTGAA CGGATCCGCGCGGACTTGGTGCTCCGTGCTCGCCCCCTAGGGCCGGGTCC GCCGGGAGCGCCGCCCTCCGGAGTTGTCCGGCCGGCGCACACCTGCCCGG CCCCGCAGCGCCCCAGCTCACCTCTTTGTCTCTCCCGCAGCGCACCCCCG GACGCTATGGCCCACCCCTCCGGCTGGCCCCTTCTGTAGGATGGTAGCAC ACAACCAGGTGGCAGCCGACAATGCAGTCTCCACAGCAGCAGAGCCCCGA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatCGGCGGCCA GAACCTTCCTCCTCTTCCTCCTCCTcgcccgcggcccccgcgcgcccgcg gccgtgccccgcggtcccggccccggCCCCCGGCGACACGCACTTCCGCA CATTCCGTTCGCACGCCGATTACCGGCGCATCACGCGCGCCAGCGCGCTC CTGGACGCCTGCGGATTCTACTGGGGGCCCCTGAGCGTGCACGGGGCGCA CGAGCGGCTGCGCGCCGAGCCCGTGGGCACCTTCCTGGTGCGCGACAGCC GCCAGCGGAACTGCTTTTTCGCCCTTAGCGTGAAGATGGCCTCGGGACCC ACGAGCATCCGCGTGCACTTTCAGGCCGGCCGCTTTCACCTGGATGGCAG CCGCGAGAGCTTCGACTGCCTCTTCGAGCTGCTGGAGCACTACGTGGCGG CGCCGCGCCGCATGCTGGGGGCCCCGCTGCGCCAGCGCCGC (SEQ ID NO: 287) pARBI-683: CGGACTCCGCCTCCACCGAGCCCCCACCCGCCGGGAAGAGCTCCGCGGAG 2541 SOCS1_3_ TACAGAGCCCATTTTCTAGCTGTGTCCACTGAGGCTGAACGGATCCGCGC Exogenous_63 GGACTTGGTGCTCCGTGCTCGCCCCCTAGGGCCGGGTCCGCCGGGAGCGC CGCCCTCCGGAGTTGTCCGGCCGGCGCACACCTGCCCGGCCCCGCAGCGC CCCAGCTCACCTCTTTGTCTCTCCCGCAGCGCACCCCCGGACGCTATGGC CCACCCCTCCGGCTGGCCCCTTCTGTAGGATGGTAGCACACAACCAGGTG GCAGCCGACAATGCAGTCTCCACAGCAGCAGAGCCCCGACGGCGGCCAGA ACCTTCCTCCTCTTCCTCCTCCTcgcccgcggcccccgcgcgcccgcggc cgtgccccgcggtcccggccccggCCCCCGGCGACACGCACTTCCGCACA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatTTCCGTTCG CACGCCGATTACCGGCGCATCACGCGCGCCAGCGCGCTCCTGGACGCCTG CGGATTCTACTGGGGGCCCCTGAGCGTGCACGGGGCGCACGAGCGGCTGC GCGCCGAGCCCGTGGGCACCTTCCTGGTGCGCGACAGCCGCCAGCGGAAC TGCTTTTTCGCCCTTAGCGTGAAGATGGCCTCGGGACCCACGAGCATCCG CGTGCACTTTCAGGCCGGCCGCTTTCACCTGGATGGCAGCCGCGAGAGCT TCGACTGCCTCTTCGAGCTGCTGGAGCACTACGTGGCGGCGCCGCGCCGC ATGCTGGGGGCCCCGCTGCGCCAGCGCCGCGTGCGGCCGCTGCAGGAGCT GTGCCGCCAGCGCATCGTGGCCACCGTGGGCCGCGAGAACCTGGCTCGCA TCCCCCTCAACCCCGTCCTCCGCGACTACCTGAGCTCCTTC (SEQ ID NO: 288) pARBI-684: GGCAGACAAACAGTTAAAACAGAGGAGTGGGGTAAGTGATGACCTAACCC 2541 SRP14_1_ AGCCCCGCTGCTCTAGTTTTGTTAAACACCTCATCTGCTATCACGGTCTG Exogenous_64 GCGAAACCCTGGAGAAACTATTATTTCCACGACGGAAACATGTAATATCG AAGCACGTTAAATTCCACTCAAAGTGGCGGCGTCTGGGATCCCTGACAAA GAGCCCTGGGCTGCTTGGGGCCCATTGTTACCAGAAGCAACCTAGGCGGC TCAGTGCTGGGGACTGAGCCAGCTACAATCCCTACTATTTTCCGGGCCCG AAGCCCCGAATGTGCTCAGTACAGGGTGGGGAAAGGTAGAGGAAGGCGGC GGTCCGCGGCAGACAGACTCCGGTGGCTCCCAGACACCGGGAACCCAGGG AGCATCGCCCGGCTCCCCTCCCGCTGCTTACACTTCTTCAAGGTGATATA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGACGCTGCC CGACGTCCGGCACTTCTGGAAAAGTCTGGTCAGCTCCGTCAGGAACTGGA AAACAACAGGCCCAGCCCCGTTAGCCAGCCCCAAATCCTCGTCCTGCCGC GTCAAGGCCCTGGTCCTCCTGCAGGAGGCACAGGTCTCGAGTAACGCCTG AGCCGCCCCCTTCCCTCGGCCGGCCAGGCCTAGCCATACCTGCTCGCTCT CCAACAACACCATCGCGGCGACGCTGGCTCGACTCCCTCCGCTTAAGCCC CTAGCAGTGAGAGCCGGAAGTTCGGCCTAGGCTGGGCGGGACTTCCGCTA CTAGACTTCCATTGTCTTCCACCAATCTCTTCTCTTCCACCAATCCCGGC CTGCGCCCTCCCCCCTCCCCGCCCGCCTAGCGCCCGCGCCCTGGGACGTC CGGGGGCCTGTGACCCGGAGGCGCTGGGGCTGTCCTGGGTC (SEQ ID NO: 289) pARBI-685: GTCTGTCTGCCGCGGACCGCCGCCTTCCTCTACCTTTCCCCACCCTGTAC 2541 SRP14_2_ TGAGCACATTCGGGGCTTCGGGCCCGGAAAATAGTAGGGATTGTAGCTGG Exogenous_ CTCAGTCCCCAGCACTGAGCCGCCTAGGTTGCTTCTGGTAACAATGGGCC 65_66 CCAAGCAGCCCAGGGCTCTTTGTCAGGGATCCCAGACGCCGCCACTTTGA GTGGAATTTAACGTGCTTCGATATTACATGTTTCCGTCGTGGAAATAATA GTTTCTCCAGGGTTTCGCCAGACCGTGATAGCAGATGAGGTGTTTAACAA AACTAGAGCAGCGGGGCTGGGTTAGGTCATCACTTACCCCACTCCTCTGT TTTAACTGTTTGTCTGCCGCTCAGCCACGTACGCGGCCGTGTTCACCAGG ATGCATTTTTTCCTTCAGATGACGGTCGAACCAAACCCATTCCAAAGAAG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGGTACTGTG GAGGGCTTTGAGCCCGCAGACAACAAGTGTCTGTTAAGAGCTACCGATGG GAAGAAGAAGATCAGCACTGTGGTGAGCTTAATTTCTAAGGCTGTCTTTT GAAATGTAAAGACTTGAACTTAACAGAGGATGGGGCGTTTCTGAACCAGG CTTTTATTTGTTTTTCCCTTTTGCCCTGTGTGGCTATTTTTGAGACCAGG ACCTTCCTATACTTTAGTAGTGGAAACCTCAAGAATAAAATAAGAAGGTA GAGGTCAGACAGTCGTTAGTTCTGCTAAAGCTCTTGTGGAAATGAAAGTA GCATATTGGTCCTTATTCGTAGTACTGTAAGGAGCAGCTGGCATAAATAT TTGATTCCTTAGCCCCTTACTGTGCTGGGCCTTCAATAAGCAGTTTCTTG GTACCAGGTGATGCTATTATTTCCTTATCTTTGTGGTTCAC (SEQ ID NO: 290) pARBI-686: CCCACGTCAGTGAGTGCTCACAAGTCTGTGAGGTAGGCGGCGCCACAGAG 2541 SRSF9_1_ AGAAACTGAGGCGCGGGAGCCCAAGCCACTTGTCCAGCGCCTCAGCCGAG Exogenous_67 CGCGAACCGCGCTCGGGGATGGCATCCACACGGCCCGGCCCCAGGCTCTC CCTGTCAGCGCCCGAAGGCCCTTCGCACCTCCAGGGGGCGCCGGCCTGCG CGCACGCGCAATGGTCGCAGCCGCGTTCTCTTTAAGAGGACTCCTTTTGC CTCCGCCGACCCCTTCGCTTCCGCTCCGCGTTCCCACAATGCAGTGCGGC TGAGCGCCTCGGAGCCCGCGGGGACGCTGCGGGGGGACCCGTGCTGAggc ggcggcggcgacgtgggctgcggcgggcccgcggcgtcgggcggtgcgga tgtcgggctgggcggacgagcgcggcggcgAGGGCGACGGGCGCATCTAC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatGTGGGGAACC TTCCGACCGACGTGCGCGAGAAGGACTTGGAGGACCTGTTCTACAAGTAC GGCCGCATCCGCGAGATCGAGCTCAAGAACCGGCACGGCCTCGTGCCCTT CGCCTTCGTGCGCTTCGAGGACCCCCGGTGAGGcccccgcgcccctgccc tctcctcctcggtgcctgaggcccccgccttccctgctgtcccctccccc agggctcccctccccccggcctcccctccccccgcctccgcgcagacccc tcaCGGCGCCCCCTCACGGGTGGAGGATGAGGCAGCCTCTCCTCGCAGGC CCGGGCCGTCCTTCGCGCCGTCGTCACTTCCTTTATTTTTATTATTCCAA TATTTTACTTAGAAACCCAAAAGCTGAGCCTTTGGAGGGCCCAAGCCCGC CTGCACCGGCCTCGGGGGGTCCAAATGAGCCTTGTCCGCC (SEQ ID NO: 291) pARBI-687: TGTGGTGTGCTTTTCGAGAAAATGCTCACGAAGATTAGAAAAACACTGAT 2541 SRSF9_2_ GGTTTATTGGCAAAATCAATAATCTGTCAGCAGTTGATTCttttttttCT Exogenous_68 CTGAATTAGCCCTCTTATGAGTAATCTGTTTGCTCATTCATTATAATTTC ATACCTCTAAAACTGCGTGTGACAGCTGTAAAGGTTAATTCCAGTATCAT GAAACGTCTCCAAACCAAAGCAGAAGTGCTTCAGGATCCTGATTTGTGTG TTTTTTTCTTCACTCTAGGTTTCCCTTTTATGCTTACTACTCATGCCCCT CACTTGGAAAGTCACTTGGCCTCCTGAACAGCACTAACTCCAAACGtttt ttttgttgttgttgttttttttAGAGATGCAGAGGATGCTATTTATGGAA GAAATGGTTATGATTATGGCCAGTGTCGGCTTCGTGTGGAGTTCCCCAGG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggggggtggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatACTTATGGAG GTCGGGGTGGGTGGCCCCGTGGTGGGAGGAATGGGCCTCCTACAAGAAGA TCTGATTTCCGAGTTCTTGTTTCAGGTATGTTCCTTTCAAACAGaatgag atgatacatgtaaaatacttaacacagagtctgtcttccaagaaatgata gctgttattcttCAGTGCATGGGACACGGGGGCTTTCTTTTCAATAGCCT GTGTGAAGCCTTGCCCTGGATTGCCAATGAGGAAAGTATCCTGCAAATGA AATTGCGCTGGGAGTGCAGCCTTGGAAGAACATAACCATATTTCTTGTAA AGGAGTTTTCTAGTGGTGAGAAGGAAAGATGATGGGAAAACTTGAGCTAC AATTCTAAAGATGCTTCTTTTGGAATATACTTGGCATCAGACATGGTAGA AAGGCATTCAAGGAGCCAGATTTGAACAACTTACCCAGCC (SEQ ID NO: 292) pARBI-688: GATGGCATCCACACGGCCCGGCCCCAGGCTCTCCCTGTCAGCGCCCGAAG 2541 SRSF9_3_ GCCCTTCGCACCTCCAGGGGGCGCCGGCCTGCGCGCACGCGCAATGGTCG Exogenous_69 CAGCCGCGTTCTCTTTAAGAGGACTCCTTTTGCCTCCGCCGACCCCTTCG CTTCCGCTCCGCGTTCCCACAATGCAGTGCGGCTGAGCGCCTCGGAGCCC GCGGGGACGCTGCGGGGGGACCCGTGCTGAggcggcggcggcgacgtggg ctgcggcgggcccgcggcgtcgggcggtgcggatgtcgggctgggcggac gagcgcggcggcgAGGGCGACGGGCGCATCTACGTGGGGAACCTTCCGAC CGACGTGCGCGAGAAGGACTTGGAGGACCTGTTCTACAAGTACGGCCGCA TCCGCGAGATCGAGCTCAAGAACCGGCACGGCCTCGTGCCCTTCGCCTTC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGTGCGCTTC GAGGACCCCCGGTGAGGcccccgcgcccctgccctctcctcctcggtgcc tgaggcccccgccttccctgctgtcccctcccccagggctcccctccccc cggcctcccctccccccgcctccgcgcagacccctcaCGGCGCCCCCTCA CGGGTGGAGGATGAGGCAGCCTCTCCTCGCAGGCCCGGGCCGTCCTTCGC GCCGTCGTCACTTCCTTTATTTTTATTATTCCAATATTTTACTTAGAAAC CCAAAAGCTGAGCCTTTGGAGGGCCCAAGCCCGCCTGCACCGGCCTCGGG GGGTCCAAATGAGCCTTGTCCGCCTCCTGCCTGGGGCAGCACCGTAGGGG GAAGCGGCCGCGGGGCAGCGCGGGGGTCGCCGTTCGCCCTTCCCGCTCGC CTCTCCCCCGGCCCGTGCTCGCCGTGGCTGGAGAGCAAGCT (SEQ ID NO: 293) pARBI-689: GTAACTGACTAGCCTTAGTAGACTGTGGTTTCCAGGTTTATTCAGAAGTA 2541 SUB1_1_ GCAAGATCCCTCCAtttttttttCTACCAAAGAAATCGTATGTGGGATCC Exogenous_70 CAAACCACAAAATAACCGTTCCTGTGGTTAATACTACTATAATGCCTGAA GTGTCTTTTGGGATCCTGAGAACAGAGTTTGAAAACATTACTAGACAGAA GGATTGGTTAGATTCATAGTTTTGTTGTTGAGTGAAACTTGCTTATGTAT ATATTTATGATATTTTGGATGTAGTCTTTTGATTGTTTAAATCTTAAAAA GTAATGGGATCTTTTGACACTGGGGTATGTTTTATTTTTATGTGTGCAAA TTTTAACCATATTCTTTTCTAGTTAAAGAGGAAAAAGCAAGTTGCTCCAG AAAAACCTGTAAAGAAACAAAAGACAGGTGAGACTTCGAGAGCCCTGTCA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatTCTTCTAAAC AGAGCAGCAGCAGCAGAGATGATAACATGTTTCAGGTAAAGTTGGCTAtt ttttttttttttttttttgacatggagtcatgctctgtcacccaggctgg agtgcagtggcgccatctcggctcactgcaacctcagcctcctgagttca agcagttctctgcctcagcctcccgagtagctaggattacaggcatccgc caccagacctggctaatttttgtatttttagtagagatggggtttcacca tcttggccaggctggtcttgaactcctgaccttgtgatccaactgcctca gcctccaaaagtgctgggtttacaggtgtgagccaccatgccttgccAAA GTTGGCTGTTTCTTTAGATTCAGAGGAATTATTATCTGGCTTGATCTGAA GAATGTTAAAAGTACTATGATCTGATAATTGCCTAATATG (SEQ ID NO: 294) pARBI-690: ATAAATAAACATATCCTGGAAATGGAATAAGTTGGTTATATTCTTTTTAA 2541 SUB1_2_ TTAGTATATCTGCTTCGTAAAATAAGTAACTGACTAGCCTTAGTAGACTG Exogenous_71 TGGTTTCCAGGTTTATTCAGAAGTAGCAAGATCCCTCCATTTTTTTTTCT ACCAAAGAAATCGTATGTGGGATCCCAAACCACAAAATAACCGTTCCTGT GGTTAATACTACTATAATGCCTGAAGTGTCTTTTGGGATCCTGAGAACAG AGTTTGAAAACATTACTAGACAGAAGGATTGGTTAGATTCATAGTTTTGT TGTTGAGTGAAACTTGCTTATGTATATATTTATGATATTTTGGATGTAGT CTTTTGATTGTTTAAATCTTAAAAAGTAATGGGATCTTTTGACACTGGGG TATGTTTTATTTTTATGTGTGCAAATTTTAACCATATTCTTTTCTAGTTA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatAAGAGGAAA AAGCAAGTTGCTCCAGAAAAACCTGTAAAGAAACAAAAGACAGGTGAGAC TTCGAGAGCCCTGTCATCTTCTAAACAGAGCAGCAGCAGCAGAGATGATA ACATGTTTCAGGTAAAGTTGGCTATTTTTTTTTTTTTTTTTTTTGACATG GAGTCATGCTCTGTCACCCAGGCTGGAGTGCAGTGGCGCCATCTCGGCTC ACTGCAACCTCAGCCTCCTGAGTTCAAGCAGTTCTCTGCCTCAGCCTCCC GAGTAGCTAGGATTACAGGCATCCGCCACCAGACCTGGCTAATTTTTGTA TTTTTAGTAGAGATGGGGTTTCACCATCTTGGCCAGGCTGGTCTTGAACT CCTGACCTTGTGATCCAACTGCCTCAGCCTCCAAAAGTGCTGGGTTTACA GGTGTGAGCCACCATGCCTTGCCAAAGTTGGCTGTTTCTTT (SEQ ID NO: 295) pARBI-691: AATTAGTATATCTGCTTCGTAAAATAAGTAACTGACTAGCCTTAGTAGAC 2541 SUB1_3_ TGTGGTTTCCAGGTTTATTCAGAAGTAGCAAGATCCCTCCATTTTTTTTT Exogenous_72 CTACCAAAGAAATCGTATGTGGGATCCCAAACCACAAAATAACCGTTCCT GTGGTTAATACTACTATAATGCCTGAAGTGTCTTTTGGGATCCTGAGAAC AGAGTTTGAAAACATTACTAGACAGAAGGATTGGTTAGATTCATAGTTTT GTTGTTGAGTGAAACTTGCTTATGTATATATTTATGATATTTTGGATGTA GTCTTTTGATTGTTTAAATCTTAAAAAGTAATGGGATCTTTTGACACTGG GGTATGTTTTATTTTTATGTGTGCAAATTTTAACCATATTCTTTTCTAGT TAAAGAGGAAAAAGCAAGTTGCTCCAGAAAAACCTGTAAAGAAACAAAAG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatACAGGTGAG ACTTCGAGAGCCCTGTCATCTTCTAAACAGAGCAGCAGCAGCAGAGATGA TAACATGTTTCAGGTAAAGTTGGCTATTTTTTTTTTTTTTTTTTTTGACA TGGAGTCATGCTCTGTCACCCAGGCTGGAGTGCAGTGGCGCCATCTCGGC TCACTGCAACCTCAGCCTCCTGAGTTCAAGCAGTTCTCTGCCTCAGCCTC CCGAGTAGCTAGGATTACAGGCATCCGCCACCAGACCTGGCTAATTTTTG TATTTTTAGTAGAGATGGGGTTTCACCATCTTGGCCAGGCTGGTCTTGAA CTCCTGACCTTGTGATCCAACTGCCTCAGCCTCCAAAAGTGCTGGGTTTA CAGGTGTGAGCCACCATGCCTTGCCAAAGTTGGCTGTTTCTTTAGATTCA GAGGAATTATTATCTGGCTTGATCTGAAGAATGTTAAAAGT (SEQ ID NO: 296) pARBI-692: CACATTTTAATTTTTGTTTCCATGCTCTTTAGAATTCAACTAGAGGGCAG 2541 TET2_1_ CCTTGTGGATGGCCCCGAAGCAAGCCTGATGGAACAGGATAGAACCAACC Exogenous_73 ATGTTGAGGGCAACAGACTAAGTCCATTCCTGATACCATCACCTCCCATT TGCCAGACAGAACCTCTGGCTACAAAGCTCCAGAATGGAAGCCCACTGCC TGAGAGAGCTCATCCAGAAGTAAATGGAGACACCAAGTGGCACTCTTTCA AAAGTTATTATGGAATACCCTGTATGAAGGGAAGCCAGAATAGTCGTGTG AGTCCTGACTTTACACAAGAAAGTAGAGGGTATTCCAAGTGTTTGCAAAA TGGAGGAATAAAACGCACAGTTAGTGAACCTTCTCTCTCTGGGCTCCTTC AGATCAAGAAATTGAAACAAGACCAAAAGGCTAATGGAGAAAGACGTAAC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatTTCGGGGTA AGCCAAGAAAGAAATCCAGGTGAAAGCAGTCAACCAAATGTCTCCGATTT GAGTGATAAGAAAGAATCTGTGAGTTCTGTAGCCCAAGAAAATGCAGTTA AAGATTTCACCAGTTTTTCAACACATAACTGCAGTGGGCCTGAAAATCCA GAGCTTCAGATTCTGAATGAGCAGGAGGGGAAAAGTGCTAATTACCATGA CAAGAACATTGTATTACTTAAAAACAAGGCAGTGCTAATGCCTAATGGTG CTACAGTTTCTGCCTCTTCCGTGGAACACACACATGGTGAACTCCTGGAA AAAACACTGTCTCAATATTATCCAGATTGTGTTTCCATTGCGGTGCAGAA AACCACATCTCACATAAATGCCATTAACAGTCAGGCTACTAATGAGTTGT CCTGTGAGATCACTCACCCATCGCATACCTCAGGGCAGATC (SEQ ID NO: 297) pARBI-693: AATGGAGAAAGACGTAACTTCGGGGTAAGCCAAGAAAGAAATCCAGGTGA 2541 TET2_2_ AAGCAGTCAACCAAATGTCTCCGATTTGAGTGATAAGAAAGAATCTGTGA Exogenous_74 GTTCTGTAGCCCAAGAAAATGCAGTTAAAGATTTCACCAGTTTTTCAACA CATAACTGCAGTGGGCCTGAAAATCCAGAGCTTCAGATTCTGAATGAGCA GGAGGGGAAAAGTGCTAATTACCATGACAAGAACATTGTATTACTTAAAA ACAAGGCAGTGCTAATGCCTAATGGTGCTACAGTTTCTGCCTCTTCCGTG GAACACACACATGGTGAACTCCTGGAAAAAACACTGTCTCAATATTATCC AGATTGTGTTTCCATTGCGGTGCAGAAAACCACATCTCACATAAATGCCA TTAACAGTCAGGCTACTAATGAGTTGTCCTGTGAGATCACTCACCCATCG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatCATACCTCA GGGCAGATCAATTCCGCACAGACCTCTAACTCTGAGCTGCCTCCAAAGCC AGCTGCAGTGGTGAGTGAGGCCTGTGATGCTGATGATGCTGATAATGCCA GTAAACTAGCTGCAATGCTAAATACCTGTTCCTTTCAGAAACCAGAACAA CTACAACAACAAAAATCAGTTTTTGAGATATGCCCATCTCCTGCAGAAAA TAACATCCAGGGAACCACAAAGCTAGCGTCTGGTGAAGAATTCTGTTCAG GTTCCAGCAGCAATTTGCAAGCTCCTGGTGGCAGCTCTGAACGGTATTTA AAACAAAATGAAATGAATGGTGCTTACTTCAAGCAAAGCTCAGTGTTCAC TAAGGATTCCTTTTCTGCCACTACCACACCACCACCACCATCACAATTGC TTCTTTCTCCCCCTCCTCCTCTTCCACAGGTTCCTCAGCTT (SEQ ID NO: 298) pARBI-694: GGAAAAAGCACTCTGAATGGTGGAGTTTTAGAAGAACACCACCACTACCC 2541 TET2_3_ CAACCAAAGTAACACAACACTTTTAAGGGAAGTGAAAATAGAGGGTAAAC Exogenous_75 CTGAGGCACCACCTTCCCAGAGTCCTAATCCATCTACACATGTATGCAGC CCTTCTCCGATGCTTTCTGAAAGGCCTCAGAATAATTGTGTGAACAGGAA TGACATACAGACTGCAGGGACAATGACTGTTCCATTGTGTTCTGAGAAAA CAAGACCAATGTCAGAACACCTCAAGCATAACCCACCAATTTTTGGTAGC AGTGGAGAGCTACAGGACAACTGCCAGCAGTTGATGAGAAACAAAGAGCA AGAGATTCTGAAGGGTCGAGACAAGGAGCAAACACGAGATCTTGTGCCCC CAACACAGCACTATCTGAAACCAGGATGGATTGAATTGAAGGCCCCTCGT tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatTTTCACCAAG CGGAATCCCATCTAAAACGTAATGAGGCATCACTGCCATCAATTCTTCAG TATCAACCCAATCTCTCCAATCAAATGACCTCCAAACAATACACTGGAAA TTCCAACATGCCTGGGGGGCTCCCAAGGCAAGCTTACACCCAGAAAACAA CACAGCTGGAGCACAAGTCACAAATGTACCAAGTTGAAATGAATCAAGGG CAGTCCCAAGGTACAGTGGACCAACATCTCCAGTTCCAAAAACCCTCACA CCAGGTGCACTTCTCCAAAACAGACCATTTACCAAAAGCTCATGTGCAGT CACTGTGTGGCACTAGATTTCATTTTCAACAAAGAGCAGATTCCCAAACT GAAAAACTTATGTCCCCAGTGTTGAAACAGCACTTGAATCAACAGGCTTC AGAGACTGAGCCATTTTCAAACTCACACCTTTTGCAACAT (SEQ ID NO: 299) pARBI-695: AAAGGGAGGTTGAGATGGGCTGAGGTCTTCTAGGAGGCGGGGAGGGAGTG 2541 TIGIT_1_ CAGCCTTGACAAGCCTCCCTGTGGGCGAGGTGTAAAGAGGGGCAGAAGTC Exogenous_76 AGCTCTGGAAGCATAGGGCAGTGGGTGGGGAGGAGATGGGCTGGGCTGGG CTGGAGTAGAGATGTGTGGAGAAGGGTGAGAAGACTGGAAAGACAACCTG AATGGGGGACTGGGAGCCTTGAATAACAGGCATGGAGAGGAGCGTCTCTT GAAATGGAAGAAACAGGAAATAATTACAGCCTCTATGGAGGAGCAACAGG ATGGACTGGAGAAACTATCATTCCAAAATCCAGTTGGGGCCTCAAAGGCC CTTAGAATTTTTCTAGGAAGGTTGAAGGCCAGCTGCTGACCCAGGACTCA CATGTGCTTCGTCCTCTTCCCTAGGAATGATGACAGGCACAATAGAAACA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatACGGGGAAC ATTTCTGCAGAGAAAGGTGGCTCTATCATCTTACAATGTCACCTCTCCTC CACCACGGCACAAGTGACCCAGGTCAACTGGGAGCAGCAGGACCAGCTTC TGGCCATTTGTAATGCTGACTTGGGGTGGCACATCTCCCCATCCTTCAAG GATCGAGTGGCCCCAGGTCCCGGCCTGGGCCTCACCCTCCAGTCGCTGAC CGTGAACGATACAGGGGAGTACTTCTGCATCTATCACACCTACCCTGATG GGACGTACACTGGGAGAATCTTCCTGGAGGTCCTAGAAAGCTCAGGTATT CCTGCTGGAGCAAGTTGGTGGATAAACCTCTCCCTCTAGCATAGAAAATG CAATCCTGAAACACTGCACAGCAGGGCTTCTCAATTCGGGATCACATTTG AATCACCTGAGGAGATTTTAAATCATACTGATGCCGAGGCC (SEQ ID NO: 300) pARBI-696: GAGGAGATGGGCTGGGCTGGGCTGGAGTAGAGATGTGTGGAGAAGGGTGA 2541 TIGIT_2_ GAAGACTGGAAAGACAACCTGAATGGGGGACTGGGAGCCTTGAATAACAG Exogenous_77 GCATGGAGAGGAGCGTCTCTTGAAATGGAAGAAACAGGAAATAATTACAG CCTCTATGGAGGAGCAACAGGATGGACTGGAGAAACTATCATTCCAAAAT CCAGTTGGGGCCTCAAAGGCCCTTAGAATTTTTCTAGGAAGGTTGAAGGC CAGCTGCTGACCCAGGACTCACATGTGCTTCGTCCTCTTCCCTAGGAATG ATGACAGGCACAATAGAAACAACGGGGAACATTTCTGCAGAGAAAGGTGG CTCTATCATCTTACAATGTCACCTCTCCTCCACCACGGCACAAGTGACCC AGGTCAACTGGGAGCAGCAGGACCAGCTTCTGGCCATTTGTAATGCTGAC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatTTGGGGTGG CACATCTCCCCATCCTTCAAGGATCGAGTGGCCCCAGGTCCCGGCCTGGG CCTCACCCTCCAGTCGCTGACCGTGAACGATACAGGGGAGTACTTCTGCA TCTATCACACCTACCCTGATGGGACGTACACTGGGAGAATCTTCCTGGAG GTCCTAGAAAGCTCAGGTATTCCTGCTGGAGCAAGTTGGTGGATAAACCT CTCCCTCTAGCATAGAAAATGCAATCCTGAAACACTGCACAGCAGGGCTT CTCAATTCGGGATCACATTTGAATCACCTGAGGAGATTTTAAATCATACT GATGCCGAGGCCTCACCCAGACCAATTCAATCAGAATCCCTAATAGCAGA GCTAAACAAGGGTAAGGTCTAAAAGCATTTCCAGGTGATTCTAATGGGCA GCCAATACTGAGAACCACTGTTCTTATGTAAGAAGCACATC (SEQ ID NO: 301) pARBI-697: CCTCCCTGTGGGCGAGGTGTAAAGAGGGGCAGAAGTCAGCTCTGGAAGCA 2541 TIGIT_3_ TAGGGCAGTGGGTGGGGAGGAGATGGGCTGGGCTGGGCTGGAGTAGAGAT Exogenous_78 GTGTGGAGAAGGGTGAGAAGACTGGAAAGACAACCTGAATGGGGGACTGG GAGCCTTGAATAACAGGCATGGAGAGGAGCGTCTCTTGAAATGGAAGAAA CAGGAAATAATTACAGCCTCTATGGAGGAGCAACAGGATGGACTGGAGAA ACTATCATTCCAAAATCCAGTTGGGGCCTCAAAGGCCCTTAGAATTTTTC TAGGAAGGTTGAAGGCCAGCTGCTGACCCAGGACTCACATGTGCTTCGTC CTCTTCCCTAGGAATGATGACAGGCACAATAGAAACAACGGGGAACATTT CTGCAGAGAAAGGTGGCTCTATCATCTTACAATGTCACCTCTCCTCCACC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggggggcaggacagcaagggggaggattgggaagacaatagca ggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGGT GCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGGG GGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTAA ACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGGG GGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCAA CGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCTC CTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAGT CGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCGT CTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTCC CTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCTG CTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACAG ATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccacc ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGG GCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACC ACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTA CGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACT TCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAAC GTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACC AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCAC TACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCGA TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCA TGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaaa taaagcaatagcatcacaaatttcacaaataaagcatttttttcactgca ttctagttgtggtttgtccaaactcatcaatgtatcttatACGGCACAAG TGACCCAGGTCAACTGGGAGCAGCAGGACCAGCTTCTGGCCATTTGTAAT GCTGACTTGGGGTGGCACATCTCCCCATCCTTCAAGGATCGAGTGGCCCC AGGTCCCGGCCTGGGCCTCACCCTCCAGTCGCTGACCGTGAACGATACAG GGGAGTACTTCTGCATCTATCACACCTACCCTGATGGGACGTACACTGGG AGAATCTTCCTGGAGGTCCTAGAAAGCTCAGGTATTCCTGCTGGAGCAAG TTGGTGGATAAACCTCTCCCTCTAGCATAGAAAATGCAATCCTGAAACAC TGCACAGCAGGGCTTCTCAATTCGGGATCACATTTGAATCACCTGAGGAG ATTTTAAATCATACTGATGCCGAGGCCTCACCCAGACCAATTCAATCAGA ATCCCTAATAGCAGAGCTAAACAAGGGTAAGGTCTAAAAG (SEQ ID NO: 302) pARBI-698: TAATGCCCAGCCTAAGTTGGGGAGACCACTCCAGATTCCAAGATGTACAG 2541 TRAC_1_ TTTGCTTTGCTGGGCCTTTTTCCCATGCCTGCCTTTACTCTGCCAGAGTT Exogenous_79 ATATTGCTGGGGTTTTGAAGAAGATCCTATTAAATAAAAGAATAAGCAGT ATTATTAAGTAGCCCTGCATTTCAGGTTTCCTTGAGTGGCAGGCCAGGCC TGGCCGTGAACGTTCACTGAAATCATGGCCTCTTGGCCAAGATTGATAGC TTGTGCCTGTCCCTGAGTCCCAGTCCATCACGAGCAGCTGGTTTCTAAGA TGCTATTTCCCGTATAAAGCATGAGACCGTGACTTGCCAGCCCCACAGAG CCCCGCCCTTGTCCATCACTGGCATCTGGACTCCAGCCTGGGTTGGGGCA AAGAGGGAAATGAGATCATGTCCTAACCCTGATCCTCTTGTCCCACAGAT tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatATCCAGAAC CCTGACCCTGCCGTGTACCAGCTGAGAGACTCTAAATCCAGTGACAAGTC TGTCTGCCTATTCACCGATTTTGATTCTCAAACAAATGTGTCACAAAGTA AGGATTCTGATGTGTATATCACAGACAAAACTGTGCTAGACATGAGGTCT ATGGACTTCAAGAGCAACAGTGCTGTGGCCTGGAGCAACAAATCTGACTT TGCATGTGCAAACGCCTTCAACAACAGCATTATTCCAGAAGACACCTTCT TCCCCAGCCCAGGTAAGGGCAGCTTTGGTGCCTTCGCAGGCTGTTTCCTT GCTTCAGGAATGGCCAGGTTCTGCCCAGAGCTCTGGTCAATGATGTCTAA AACTCCTCTGATTGGTGGTCTCGGCCTTATCCATTGCCACCAAAACCCTC TTTTTACTAAGAAACAGTGAGCCTTGTTCTGGCAGTCCAGA (SEQ ID NO: 303) pARBI-699: GATTCCAAGATGTACAGTTTGCTTTGCTGGGCCTTTTTCCCATGCCTGCC 2541 TRAC_2_ TTTACTCTGCCAGAGTTATATTGCTGGGGTTTTGAAGAAGATCCTATTAA Exogenous_80 ATAAAAGAATAAGCAGTATTATTAAGTAGCCCTGCATTTCAGGTTTCCTT GAGTGGCAGGCCAGGCCTGGCCGTGAACGTTCACTGAAATCATGGCCTCT TGGCCAAGATTGATAGCTTGTGCCTGTCCCTGAGTCCCAGTCCATCACGA GCAGCTGGTTTCTAAGATGCTATTTCCCGTATAAAGCATGAGACCGTGAC TTGCCAGCCCCACAGAGCCCCGCCCTTGTCCATCACTGGCATCTGGACTC CAGCCTGGGTTGGGGCAAAGAGGGAAATGAGATCATGTCCTAACCCTGAT CCTCTTGTCCCACAGATATCCAGAACCCTGACCCTGCCGTGTACCAGCTG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatAGAGACTCT AAATCCAGTGACAAGTCTGTCTGCCTATTCACCGATTTTGATTCTCAAAC AAATGTGTCACAAAGTAAGGATTCTGATGTGTATATCACAGACAAAACTG TGCTAGACATGAGGTCTATGGACTTCAAGAGCAACAGTGCTGTGGCCTGG AGCAACAAATCTGACTTTGCATGTGCAAACGCCTTCAACAACAGCATTAT TCCAGAAGACACCTTCTTCCCCAGCCCAGGTAAGGGCAGCTTTGGTGCCT TCGCAGGCTGTTTCCTTGCTTCAGGAATGGCCAGGTTCTGCCCAGAGCTC TGGTCAATGATGTCTAAAACTCCTCTGATTGGTGGTCTCGGCCTTATCCA TTGCCACCAAAACCCTCTTTTTACTAAGAAACAGTGAGCCTTGTTCTGGC AGTCCAGAGAATGACACGGGAAAAAAGCAGATGAAGAGAAG (SEQ ID NO: 304) pARBI-700: TTCCTGTGGTAGCGGAACTCACTAAGGGGCCCATCTGGACCCGAGGTATT 2541 TRAC_3_ GTGATGATAAATTCTGAGCACCTACCCCATCCCCAGAAGGGCTCAGAAAT Exogenous_81 AAAATAAGAGCCAAGTCTAGTCGGTGTTTCCTGTCTTGAAACACAATACT GTTGGCCCTGGAAGAATGCACAGAATCTGTTTGTAAGGGGATATGCACAG AAGCTGCAAGGGACAGGAGGTGCAGGAGCTGCAGGCCTCCCCCACCCAGC CTGCTCTGCCTTGGGGAAAACCGTGGGTGTGTCCTGCAGGCCATGCAGGC CTGGGACATGCAAGCCCATAACCGCTGTGGCCTCTTGGTTTTACAGATAC GAACCTAAACTTTCAAAACCTGTCAGTGATTGGGTTCCGAATCCTCCTCC TGAAAGTGGCCGGGTTTAATCTGCTCATGACGCTGCGGCTGTGGTCCAGC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatTGAGGTGAG GGGCCTTGAAGCTGGGAGTGGGGTTTAGGGACGCGGGTCTCTGGGTGCAT CCTAAGCTCTGAGAGCAAACCTCCCTGCAGGGTCTTGCTTTTAAGTCCAA AGCCTGAGCCCACCAAACTCTCCTACTTCTTCCTGTTACAAATTCCTCTT GTGCAATAATAATGGCCTGAAACGCTGTAAAATATCCTCATTTCAGCCGC CTCAGTTGCACTTCTCCCCTATGAGGTAGGAAGAACAGTTGTTTAGAAAC GAAGAAACTGAGGCCCCACAGCTAATGAGTGGAGGAAGAGAGACACTTGT GTACACCACATGCCTTGTGTTGTACTTCTCTCACCGTGTAACCTCCTCAT GTCCTCTCTCCCCAGTACGGCTCTCTTAGCTCAGTAGAAAGAAGACATTA CACTCATATTACACCCCAATCCTGGCTAGAGTCTCCGCACC (SEQ ID NO: 305) pARBI-701: ATTATTAAGTAGCCCTGCATTTCAGGTTTCCTTGAGTGGCAGGCCAGGCC 2541 TRAC_4_ TGGCCGTGAACGTTCACTGAAATCATGGCCTCTTGGCCAAGATTGATAGC Exogenous_82 TTGTGCCTGTCCCTGAGTCCCAGTCCATCACGAGCAGCTGGTTTCTAAGA TGCTATTTCCCGTATAAAGCATGAGACCGTGACTTGCCAGCCCCACAGAG CCCCGCCCTTGTCCATCACTGGCATCTGGACTCCAGCCTGGGTTGGGGCA AAGAGGGAAATGAGATCATGTCCTAACCCTGATCCTCTTGTCCCACAGAT ATCCAGAACCCTGACCCTGCCGTGTACCAGCTGAGAGACTCTAAATCCAG TGACAAGTCTGTCTGCCTATTCACCGATTTTGATTCTCAAACAAATGTGT CACAAAGTAAGGATTCTGATGTGTATATCACAGACAAAACTGTGCTAGAC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatATGAGGTCT ATGGACTTCAAGAGCAACAGTGCTGTGGCCTGGAGCAACAAATCTGACTT TGCATGTGCAAACGCCTTCAACAACAGCATTATTCCAGAAGACACCTTCT TCCCCAGCCCAGGTAAGGGCAGCTTTGGTGCCTTCGCAGGCTGTTTCCTT GCTTCAGGAATGGCCAGGTTCTGCCCAGAGCTCTGGTCAATGATGTCTAA AACTCCTCTGATTGGTGGTCTCGGCCTTATCCATTGCCACCAAAACCCTC TTTTTACTAAGAAACAGTGAGCCTTGTTCTGGCAGTCCAGAGAATGACAC GGGAAAAAAGCAGATGAAGAGAAGGTGGCAGGAGAGGGCACGTGGCCCAG CCTCAGTCTCTCCAACTGAGTTCCTGCCTGCCTGCCTTTGCTCAGACTGT TTGCCCCTTACTGCTCTTCTAGGCCTCATTCTAAGCCCCTT (SEQ ID NO: 306) pARBI-702: TTCATTTCCATTTGAGTTGTTCTTATTGAGTCATCCTTCCTGTGGTAGCG 2541 TRAC_6_ GAACTCACTAAGGGGCCCATCTGGACCCGAGGTATTGTGATGATAAATTC Exogenous_84 TGAGCACCTACCCCATCCCCAGAAGGGCTCAGAAATAAAATAAGAGCCAA GTCTAGTCGGTGTTTCCTGTCTTGAAACACAATACTGTTGGCCCTGGAAG AATGCACAGAATCTGTTTGTAAGGGGATATGCACAGAAGCTGCAAGGGAC AGGAGGTGCAGGAGCTGCAGGCCTCCCCCACCCAGCCTGCTCTGCCTTGG GGAAAACCGTGGGTGTGTCCTGCAGGCCATGCAGGCCTGGGACATGCAAG CCCATAACCGCTGTGGCCTCTTGGTTTTACAGATACGAACCTAAACTTTC AAAACCTGTCAGTGATTGGGTTCCGAATCCTCCTCCTGAAAGTGGCCGGG tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatTTTAATCTG CTCATGACGCTGCGGCTGTGGTCCAGCTGAGGTGAGGGGCCTTGAAGCTG GGAGTGGGGTTTAGGGACGCGGGTCTCTGGGTGCATCCTAAGCTCTGAGA GCAAACCTCCCTGCAGGGTCTTGCTTTTAAGTCCAAAGCCTGAGCCCACC AAACTCTCCTACTTCTTCCTGTTACAAATTCCTCTTGTGCAATAATAATG GCCTGAAACGCTGTAAAATATCCTCATTTCAGCCGCCTCAGTTGCACTTC TCCCCTATGAGGTAGGAAGAACAGTTGTTTAGAAACGAAGAAACTGAGGC CCCACAGCTAATGAGTGGAGGAAGAGAGACACTTGTGTACACCACATGCC TTGTGTTGTACTTCTCTCACCGTGTAACCTCCTCATGTCCTCTCTCCCCA GTACGGCTCTCTTAGCTCAGTAGAAAGAAGACATTACACTC (SEQ ID NO: 307) pARBI-703: CAGTCCATCACGAGCAGCTGGTTTCTAAGATGCTATTTCCCGTATAAAGC 2541 TRAC_ ATGAGACCGTGACTTGCCAGCCCCACAGAGCCCCGCCCTTGTCCATCACT Exogenous_83 GGCATCTGGACTCCAGCCTGGGTTGGGGCAAAGAGGGAAATGAGATCATG TCCTAACCCTGATCCTCTTGTCCCACAGATATCCAGAACCCTGACCCTGC CGTGTACCAGCTGAGAGACTCTAAATCCAGTGACAAGTCTGTCTGCCTAT TCACCGATTTTGATTCTCAAACAAATGTGTCACAAAGTAAGGATTCTGAT GTGTATATCACAGACAAAACTGTGCTAGACATGAGGTCTATGGACTTCAA GAGCAACAGTGCTGTGGCCTGGAGCAACAAATCTGACTTTGCATGTGCAA ACGCCTTCAACAACAGCATTATTCCAGAAGACACCTTCTTCCCCAGCCCA tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGGTAAGGGC AGCTTTGGTGCCTTCGCAGGCTGTTTCCTTGCTTCAGGAATGGCCAGGTT CTGCCCAGAGCTCTGGTCAATGATGTCTAAAACTCCTCTGATTGGTGGTC TCGGCCTTATCCATTGCCACCAAAACCCTCTTTTTACTAAGAAACAGTGA GCCTTGTTCTGGCAGTCCAGAGAATGACACGGGAAAAAAGCAGATGAAGA GAAGGTGGCAGGAGAGGGCACGTGGCCCAGCCTCAGTCTCTCCAACTGAG TTCCTGCCTGCCTGCCTTTGCTCAGACTGTTTGCCCCTTACTGCTCTTCT AGGCCTCATTCTAAGCCCCTTCTCCAAGTTGCCTCTCCTTATTTCTCCCT GTCTGCCAAAAAATCTTTCCCAGCTCACTAAGTCAGTCTCACGCAGTCAC TCATTAACCCACCAATCACTGATTGTGCCGGCACATGAATG (SEQ ID NO: 308) pARBI-704: TGAGGAAGAAGCGCGGGCGGCGCCTTCGGGAGGCGAGCAGGCAGCAGTTG 2541 TRIM28_1_ GCCGTGCCGTAGCAGCGTCCCGCGCGCGGCGGGCAGCGGCCCAGGAGGCG Exogenous_85 CGTGGCGGCGCTCGGCCTCGCGGCGGCGGCGGCGGCAGCGGCCCAGCAGT TGGCGGCGAGCGCGTCTGCGCCTGCGCGGCGGGCCCCGCGCCCCTCCTCC CCCCCTGGGCGCCCCCGGCGGCGTGTGAATGGCGGCCTCCGCGGCGGCAG CCTCGGCAGCAGCGGCCTCGGCCGCCTCTGGCAGCCCGGGCCCGGGCGAG GGCTCCGCTGGCGGCGAAAAGCGCTCCACCGCCCCTTCGGCCGCAGCCTC GGCCTCTGCCTCAGCCGCGGCGTCGTCGCCCGCGGGGGGCGGCGCCGAGG CGCTGGAGCTGCTGGAGCACTGCGGCGTGTGCAGAGAGCGCCTGCGACCC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGAGAGGGAG CCCCGCCTGCTGCCCTGTTTGCACTCGGCCTGTAGTGCCTGCTTAGGGCC CGCGGCCCCCGCCGCCGCCAACAGCTCGGGGGACGGCGGGGCGGCGGGCG ACGGCACCGGTAAGTACGAAGTGATCGGTGCCACCCCTCCCCCTACTCTC TGCCTTTGATTCCGACTGGGTGCAGAGATGAGGATGCCACCTGGGCGAGA GGATGGGGGCCCGGACAGGGCACGGGAAATACTTTCTGGGTCCTGCATAC GAACGTGGGTTTGTGCTGGCCGCTGAGATGGGACATCTGACTAAAGTTGG AGAAAAGAAGGCTCGGGGAGGGGAGGGGCTGGTTCGCTGCGGGATAATGG TCGGGGGCCCACCCAGCAGGGGAATGGTGGGGGCCATAACCTGGGTGGGA ACTTGTAACAGTCTCCCACATCCCTGCTTCTCGAAGTGGTG (SEQ ID NO: 309) pARBI-705: TCGAAGTGGTGGACTGTCCCGTGTGCAAGCAACAGTGCTTCTCCAAAGAC 2541 TRIM28_2_ ATCGTGGAGAATTATTTCATGCGTGATAGTGGCAGCAAGGCTGCCACCGA Exogenous_86 CGCCCAGGATGCGAACCAGGTGCGTCCTATCTCAGCAACCACAAGGAGGT TTCTGGGGAGGGGGCATCTGCGCAGGAGGAGCTTGGCACCAGCTCCAGGC TGTTACTCCACTTTCCCAAGGCTCTGGGTGGGCTGCCTAGGTTGGGTCAA GGGACCAATCTTAAATCTCCGGTTGTATTTTCTGGGATGTAAACGTGGAT CTATCAAGTIGTCTTGCCTTCTCTGACCCTGCCTTTGTCTGGCAGTGCTG CACTAGCTGTGAGGATAATGCCCCAGCCACCAGCTACTGTGTGGAGTGCT CGGAGCCTCTGTGTGAGACCTGTGTAGAGGCGCACCAGCGGGTGAAGTAC tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatACCAAGGAC CATACTGTGCGCTCTACTGGTACATGAGGCTGAGGGGGGCTGTTGGAGTT GTTCTCCCATGTGTGCCCTCAGTTGCTTTTATGATGTTGGTTGCATCTGG TGGATGGGTCCTAGAGTTCTCTAGGGGGTGCCGCCCGAAGGGCCCGAGGG CAGAACTCCAGAAGCAGAAAAACTGGGGTTGTGGTGTGTAGTCTCTAGGG CCTAGGTGGGAGAGGGTGGGAAGGGGAAATAAGGGAGACCTTAATGTGCT GCAGGGAGGTACAAAGGGTTTGAGAAGGCTTATCAGGGAGTTGTCAGACC TGGTGGTGAAGGGCTCAGCATATGCAAACAGGGAAAGGCATGGTGTGAAG GGCTTTCTGGGTTTGTGGTTCCCTGGCACACATCTGGTATAAGATGCTGC TAGGGAAAGTAGACTGTGGGTCCATGGATTATAAGTGATGA (SEQ ID NO: 310) pARBI-706: GGCGCCCAATGCGCGTGCGCGGCGGCGTCGGCGCCAGTTATTTCTGTCCC 2541 TRIM28_3_ GCCCCCCGGCCTCGGCTCTTTCTGCGAGCGGGCGCGCGGGCGAGCGGTTG Exogenous_87 TGCTTGTGCTTGTGGCGCGTGGTGCGGGTTTCGGCGGCGGCTGAGGAAGA AGCGCGGGCGGCGCCTTCGGGAGGCGAGCAGGCAGCAGTTGGCCGTGCCG TAGCAGCGTCCCGCGCGCGGCGGGCAGCGGCCCAGGAGGCGCGTGGCGGC GCTCGGCCTCGCGGCGGCGGCGGCGGCAGCGGCCCAGCAGTTGGCGGCGA GCGCGTCTGCGCCTGCGCGGCGGGCCCCGCGCCCCTCCTCCCCCCCTGGG CGCCCCCGGCGGCGTGTGAATGGCGGCCTCCGCGGCGGCAGCCTCGGCAG CAGCGGCCTCGGCCGCCTCTGGCAGCCCGGGCCCGGGCGAGGGCTCCGCT tgacgactgtgccttctagttgccagccatctgttgtttgcccctccccc gtgccttccttgaccctggaaggtgccactcccactgtcctttcctaata aaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagc aggcatgctggggatgcggtgggctctatggGGATCTGCGATCGCTCCGG TGCCCGTCAGTGGGCAGAGCGCACATCGCCCACAGTCCCCGAGAAGTTGG GGGGAGGGGTCGGCAATTGAACGGGTGCCTAGAGAAGGTGGCGCGGGGTA AACTGGGAAAGTGATGTCGTGTACTGGCTCCGCCTTTTTCCCGAGGGTGG GGGAGAACCGTATATAAGTGCAGTAGTCGCCGTGAACGTTCTTTTTCGCA ACGGGTTTGCCGCCAGAACACAGCTGAAGCTTCGAGGGGCTCGCATCTCT CCTTCACGCGCCCGCCGCCCTACCTGAGGCCGCCATCCACGCCGGTTGAG TCGCGTTCTGCCGCCTCCCGCCTGTGGTGCCTCCTGAACTGCGTCCGCCG TCTAGGTAAGTTTAAAGCTCAGGTCGAGACCGGGCCTTTGTCCGGCGCTC CCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCACGCTTTGCCTGACCCT GCTTGCTCAACTCTACGTCTTTGTTTCGTTTTCTGTTCTGCGCCGTTACA GATCCAAGCTGTGACCGGCGCCTACTCTAGAGCTAGCGAATTgccgccac cATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGG TCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT ACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTT CTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAA CGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCA AGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCA CTACCTGAGCACCCAGTCCAAGCTGAGCAAAGACCCCAACGAGAAGCGCG ATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGC ATGGACGAGCTGTACAAatgattgtttattgcagcttataatggttacaa ataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgc attctagttgtggtttgtccaaactcatcaatgtatcttatGGCGGCGAA AAGCGCTCCACCGCCCCTTCGGCCGCAGCCTCGGCCTCTGCCTCAGCCGC GGCGTCGTCGCCCGCGGGGGGCGGCGCCGAGGCGCTGGAGCTGCTGGAGC ACTGCGGCGTGTGCAGAGAGCGCCTGCGACCCGAGAGGGAGCCCCGCCTG CTGCCCTGTTTGCACTCGGCCTGTAGTGCCTGCTTAGGGCCCGCGGCCCC CGCCGCCGCCAACAGCTCGGGGGACGGCGGGGCGGCGGGCGACGGCACCG GTAAGTACGAAGTGATCGGTGCCACCCCTCCCCCTACTCTCTGCCTTTGA TTCCGACTGGGTGCAGAGATGAGGATGCCACCTGGGCGAGAGGATGGGGG CCCGGACAGGGCACGGGAAATACTTTCTGGGTCCTGCATACGAACGTGGG TTTGTGCTGGCCGCTGAGATGGGACATCTGACTAAAGTTGG (SEQ ID NO: 311)
Claims (21)
1. A method of editing a cell having chromosomal DNA, comprising inserting at least one sequence comprising a transgene within a safe harbor locus in the chromosomal DNA of the cell, wherein the safe harbor locus is at any one or more of an sgRNA target loci selected from: chr11:128340000-128350000, chr15:92830000-92840000, chr10:33130000-33140000, chr10:72290000-72300000, chr11:65425000-65427000 (NEAT1), chr16: 11220000-11230000, chr2:87460000-87470000, chr3:186510000-186520000, chr3:59450000-59460000, chr8:127980000-128000000, chr9:7970000-7980000, APRT, B2M, CAPNS1, CBLB, CD2, CD3E, CD3G, CD5, EDF1, FTL, PTEN, PTPN2, PTPN6, PTPRC, PTPRCAP, RPS23, RTRAF, SERF2, SLC38A1, SMAD2, SOCS1, SRP14, SRSF9, SUB1, TET2, TIGIT, TRAC, or TRIM28.
2. The method of claim 1 , wherein the sgRNA target locus is chr11:128340000-128350000 or chr15:92830000-92840000.
3. The method of claim 1 , wherein the sgRNA target locus comprises an integration site comprising a start coordinate of chr11:128342576.
4. The method of claim 1 , wherein the sgRNA target locus comprises an integration site comprising a start coordinate of chr15:92831131.
5. The method of claim 1 , wherein the at least one sequence is operatively linked to an endogenous promoter.
6. The method of claim 1 , wherein the at least one sequence is operatively linked to an exogenous promoter.
7. The method of claim 6 , wherein the exogenous promoter is an EF1α promoter.
8. The method of claim 1 , wherein the cell is a natural killer (NK) cell, an induced pluripotent stem cell (iPSC), a human pluripotent stem cell (HSPC), a T cell, a CD8+ T cell, or a T cell progenitor cell.
9. The method of claim 8 , wherein the cell is a T cell, a CD8+ T cell, or a T cell progenitor cell.
10. The method of claim 8 , wherein the cell is a CD8+ T cell.
11. The method of claim 13 , wherein the transgene encodes a protein, a therapeutic agent, a cancer-specific therapeutic agent, a T-cell receptor (TCR), a Prime Receptor (primeR), or a chimeric antigen receptor (CAR).
12. The method of claim 1 , wherein the transgene encodes a chimeric antigen receptor (CAR).
13. The method of claim 1 , wherein the transgene encodes a T-cell receptor (TCR).
14. The method of claim 1 , wherein the transgene encodes a Prime Receptor (primeR).
15. The method of claim 1 , wherein the transgene encodes an shRNA, an miRNA, or an siRNA.
16. The method of claim 1 , wherein the at least one sequence is inserted using a homology-directed repair or a homology independent targeted insertion.
17. The method of claim 1 , wherein the at least one sequence is inserted using one or more guide ribonucleic acids (gRNAs) and one or more Cas9 endonucleases, optionally wherein the one or more gRNAs comprises any one of SEQ ID NOS: 1-120.
18. The method of claim 17 , wherein the one or more gRNAs comprises SEQ ID NO: 94.
19. The method of claim 17 , wherein the one or more gRNAs comprises SEQ ID NO: 102.
20. A guide ribonucleic acid (gRNA) for editing a cell at a safe harbor locus, wherein the gRNA comprises any one of SEQ ID NOS:1-120.
21. An ex vivo method of obtaining an engineered cell or population thereof, comprising:
a. obtaining a cell; and
b. genetically modifying the cell by inserting at least one sequence encoding a transgene within a safe harbor locus, wherein the safe harbor locus is at any one or more of an sgRNA target loci selected from: chr10:33130000-33140000, chr10:72290000-72300000, chr11:128340000-128350000, chr11:65425000-65427000 (NEAT1), chr15:92830000-92840000, chr16:11220000-11230000, chr2:87460000-87470000, chr3:186510000-186520000, chr3:59450000-59460000, chr8:127980000-128000000, chr9:7970000-7980000, APRT, B2M, CAPNS1, CBLB, CD2, CD3E, CD3G, CD5, EDF1, FTL, PTEN, PTPN2, PTPN6, PTPRC, PTPRCAP, RPS23, RTRAF, SERF2, SLC38A1, SMAD2, SOCS1, SRP14, SRSF9, SUB1, TET2, TIGIT, TRAC, or TRIM28.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US18/453,801 US20230416747A1 (en) | 2020-10-26 | 2023-08-22 | Safe harbor loci |
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202063105834P | 2020-10-26 | 2020-10-26 | |
US202163141926P | 2021-01-26 | 2021-01-26 | |
US17/239,484 US11332744B1 (en) | 2020-10-26 | 2021-04-23 | Safe harbor loci |
US17/716,816 US11761004B2 (en) | 2020-10-26 | 2022-04-08 | Safe harbor loci |
US18/453,801 US20230416747A1 (en) | 2020-10-26 | 2023-08-22 | Safe harbor loci |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/716,816 Division US11761004B2 (en) | 2020-10-26 | 2022-04-08 | Safe harbor loci |
Publications (1)
Publication Number | Publication Date |
---|---|
US20230416747A1 true US20230416747A1 (en) | 2023-12-28 |
Family
ID=81258012
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/239,484 Active US11332744B1 (en) | 2020-10-26 | 2021-04-23 | Safe harbor loci |
US17/716,816 Active US11761004B2 (en) | 2020-10-26 | 2022-04-08 | Safe harbor loci |
US18/453,801 Pending US20230416747A1 (en) | 2020-10-26 | 2023-08-22 | Safe harbor loci |
Family Applications Before (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/239,484 Active US11332744B1 (en) | 2020-10-26 | 2021-04-23 | Safe harbor loci |
US17/716,816 Active US11761004B2 (en) | 2020-10-26 | 2022-04-08 | Safe harbor loci |
Country Status (1)
Country | Link |
---|---|
US (3) | US11332744B1 (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11332744B1 (en) * | 2020-10-26 | 2022-05-17 | Arsenal Biosciences, Inc. | Safe harbor loci |
WO2023064928A2 (en) | 2021-10-14 | 2023-04-20 | Arsenal Biosciences, Inc. | Immune cells having co-expressed shrnas and logic gate systems |
WO2024100604A1 (en) | 2022-11-09 | 2024-05-16 | Juno Therapeutics Gmbh | Methods for manufacturing engineered immune cells |
WO2024161021A1 (en) | 2023-02-03 | 2024-08-08 | Juno Therapeutics Gmbh | Methods for non-viral manufacturing of engineered immune cells |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040142325A1 (en) | 2001-09-14 | 2004-07-22 | Liat Mintz | Methods and systems for annotating biomolecular sequences |
US20070161031A1 (en) | 2005-12-16 | 2007-07-12 | The Board Of Trustees Of The Leland Stanford Junior University | Functional arrays for high throughput characterization of gene expression regulatory elements |
US20130227715A1 (en) | 2010-02-26 | 2013-08-29 | Cellectis | Use of endonucleases for inserting transgenes into safe harbor loci |
AU2013355327A1 (en) | 2012-12-05 | 2015-06-11 | Sangamo Therapeutics, Inc. | Methods and compositions for regulation of metabolic disorders |
US8697359B1 (en) | 2012-12-12 | 2014-04-15 | The Broad Institute, Inc. | CRISPR-Cas systems and methods for altering expression of gene products |
SG10201809157VA (en) | 2014-04-18 | 2018-11-29 | Editas Medicine Inc | Crispr-cas-related methods, compositions and components for cancer immunotherapy |
US20170362635A1 (en) | 2016-06-20 | 2017-12-21 | University Of Washington | Muscle-specific crispr/cas9 editing of genes |
US20190185849A1 (en) | 2016-06-29 | 2019-06-20 | Crispr Therapeutics Ag | Compositions and methods for gene editing |
WO2018058064A1 (en) | 2016-09-23 | 2018-03-29 | Casebia Therapeutics Limited Liability Partnership | Compositions and methods for gene editing |
EP3759217A4 (en) | 2018-03-02 | 2022-05-11 | Generation Bio Co. | Closed-ended dna (cedna) vectors for insertion of transgenes at genomic safe harbors (gsh) in humans and murine genomes |
US11332744B1 (en) * | 2020-10-26 | 2022-05-17 | Arsenal Biosciences, Inc. | Safe harbor loci |
-
2021
- 2021-04-23 US US17/239,484 patent/US11332744B1/en active Active
-
2022
- 2022-04-08 US US17/716,816 patent/US11761004B2/en active Active
-
2023
- 2023-08-22 US US18/453,801 patent/US20230416747A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
US20220127616A1 (en) | 2022-04-28 |
US20220315928A1 (en) | 2022-10-06 |
US11332744B1 (en) | 2022-05-17 |
US11761004B2 (en) | 2023-09-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP7101419B2 (en) | Targeted substitution of endogenous T cell receptors | |
US20230416747A1 (en) | Safe harbor loci | |
JP6929791B2 (en) | Compositions and methods for epigenome editing | |
JP7385281B2 (en) | Method for producing low antigenicity cells | |
US20210317406A1 (en) | Gene targets for t-cell-based immunotherapy | |
IL297881A (en) | Selection by essential-gene knock-in | |
WO2022093846A1 (en) | Safe harbor loci | |
TW202235617A (en) | Compositions and methods for reducing mhc class ii in a cell | |
WO2021197342A1 (en) | Active dna transposon systems and methods for use thereof | |
JP2020528046A (en) | Compositions and Methods for Enhancing the Efficacy of T Cell-Based Immunotherapy | |
US20240307444A1 (en) | Methods and compositions for improved immunotherapies | |
CN116546993A (en) | Safe harbor gene locus | |
TW202417626A (en) | Immune cells having co-expressed tgfbr shrnas | |
CN118369110A (en) | CD38 compositions and methods for immunotherapy | |
WO2024123235A1 (en) | Safe harbour loci for cell engineering |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |