WO2017009126A1 - Générateur de codes à barres d'adn aléatoire génétique pour le traçage in vivo de cellules - Google Patents

Générateur de codes à barres d'adn aléatoire génétique pour le traçage in vivo de cellules Download PDF

Info

Publication number
WO2017009126A1
WO2017009126A1 PCT/EP2016/065932 EP2016065932W WO2017009126A1 WO 2017009126 A1 WO2017009126 A1 WO 2017009126A1 EP 2016065932 W EP2016065932 W EP 2016065932W WO 2017009126 A1 WO2017009126 A1 WO 2017009126A1
Authority
WO
WIPO (PCT)
Prior art keywords
polynucleotide
barcoding
recombinase
host cell
sequence
Prior art date
Application number
PCT/EP2016/065932
Other languages
English (en)
Inventor
Hans-Reimer Rodewald
Thorsten B. FEYERABEND
Weike PEI
Original Assignee
Deutsches Krebsforschungszentrum
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Deutsches Krebsforschungszentrum filed Critical Deutsches Krebsforschungszentrum
Publication of WO2017009126A1 publication Critical patent/WO2017009126A1/fr

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6809Methods for determination or identification of nucleic acids involving differential detection

Definitions

  • the present invention relates to a barcoding polynucleotide comprising at least five, preferably at least ten core structures, wherein each of said core structures comprises two uniquely identifiable sequences separated by a recombinase recognition sequence, wherein said recombinase is a recombinase capable of (i) mediating excision or (ii) mediating excision or inversion of a polynucleotide flanked by two recognition sequences of said recombinase; and to a vector, a host cell, a host organism and an experimental animal comprising said barcoding polynucleotide.
  • the present invention relates to kits, methods, and uses related to said barcoding polynucleotide.
  • the means and methods of the invention are useful in barcoding of cell lines or of cells within an organism.
  • Recombinases have been used extensively to manipulate and modify DNA.
  • Cre a site-specific recombinase, was found to be particularly useful in genome editing, e.g. inducible and/or tissue-specific excision of DNA fragments from chromosomal DNA, by inverting and/or depleting DNA fragments (Branda et al. (2004), Dev Cell 6(1): 7; PMID: 14723844).
  • the present invention relates to a barcoding polynucleotide comprising at least five, preferably at least ten core structures, wherein each of said core structures comprises two uniquely identifiable sequences separated by a recombinase recognition sequence, wherein said recombinase is a recombinase capable of (i) mediating excision or (ii) mediating excision or inversion of a polynucleotide flanked by two recognition sequences of said recombinase.
  • the terms “have”, “comprise” or “include” or any arbitrary grammatical variations thereof are used in a non-exclusive way. Thus, these terms may both refer to a situation in which, besides the feature introduced by these terms, no further features are present in the entity described in this context and to a situation in which one or more further features are present.
  • the expressions “A has B”, “A comprises B” and “A includes B” may both refer to a situation in which, besides B, no other element is present in A (i.e. a situation in which A solely and exclusively consists of B) and to a situation in which, besides B, one or more further elements are present in entity A, such as element C, elements C and D or even further elements.
  • the terms “preferably”, “more preferably”, “most preferably”, “particularly”, “more particularly”, “specifically”, “more specifically” or similar terms are used in conjunction with optional features, without restricting alternative possibilities.
  • features introduced by these terms are optional features and are not intended to restrict the scope of the claims in any way.
  • the invention may, as the skilled person will recognize, be performed by using alternative features.
  • features introduced by “in an embodiment of the invention” or similar expressions are intended to be optional features, without any restriction regarding alternative embodiments of the invention, without any restrictions regarding the scope of the invention and without any restriction regarding the possibility of combining the features introduced in such way with other optional or non-optional features of the invention.
  • the term “about”, if not noted otherwise, relates to the indicated value ⁇ 20 %.
  • barcoding polynucleotide as used in accordance with the present invention relates to a DNA polynucleotide comprising the sequence features as described herein below.
  • the barcoding polynucleotide has the activity that an appropriate recombinase excises or inverts a sequence flanked by two recombinase recognition sequences. Suitable assays for measuring said activity are described in the accompanying examples or in Abremski & Hoess (1984), JBC 259(3): 1509, PMID: 6319400).
  • the barcoding polynucleotide comprises at least one, preferably at least two, more preferably at least three, still more preferably at least four, most preferably at least five of the nucleic acid sequences shown in SEQ ID NO: 1 to 30. More preferably, the barcoding polynucleotide comprises at least one, preferably at least two, more preferably at least three, still more preferably at least four, most preferably at least five of the nucleic acid sequences shown in SEQ ID NO: 1 to 10.
  • the barcoding polynucleotide comprises or consists of the nucleotide sequence of SEQ ID NO: 31 (PolyloxP2.0 w/o restriction sites) or 32 (PolyloxP2.0 with restriction sites). Most preferably, the barcoding polynucleotide comprises or consists of the nucleotide sequence of SEQ ID NO: 33.
  • the term "barcoding polynucleotide” as used in accordance with the present invention further encompasses variants of the aforementioned specific polynucleotides, provided that said variants still are polynucleotides having the activity as specified above.
  • the barcoding polynucleotide variants preferably, comprise a nucleic acid sequence characterized in that the sequence can be derived from the aforementioned specific nucleic acid sequences by at least one nucleotide substitution, addition and/or deletion.
  • Variants also encompass polynucleotides comprising a nucleic acid sequence which is capable of hybridizing to the aforementioned specific nucleic acid sequences, preferably, under stringent hybridization conditions. These stringent conditions are known to the skilled worker and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N. Y. (1989), 6.3.1-6.3.6.
  • SSC 6x sodium chloride/sodium citrate
  • 0.2x SSC 0.1% SDS at 50 to 65°C.
  • the skilled worker knows that these hybridization conditions differ depending on the type of nucleic acid and, for example when organic solvents are present, with regard to the temperature and concentration of the buffer.
  • the temperature differs depending on the type of nucleic acid between 42°C and 58°C in aqueous buffer with a concentration of 0.1 to 5x SSC (pH 7.2). If organic solvent is present in the abovementioned buffer, for example 50% formamide, the temperature under standard conditions is approximately 42°C.
  • the hybridization conditions for DNA:DNA hybrids are preferably for example O. lx SSC and 20°C to 45°C, preferably between 30°C and 45°C.
  • the hybridization conditions for DNA:R A hybrids are preferably, for example, O. lx SSC and 30°C to 55°C, preferably between 45°C and 55°C.
  • DNA or cDNA from bacteria, fungi, plants or animals may be used.
  • variants include polynucleotides comprising nucleic acid sequences which are at least 70%>, preferably at least 80%, more preferably at least 90%, still more preferably at least 95%, most preferably at least 98% to at least one, preferably at least two, more preferably at least three, still more preferably at least four, most preferably at least five of the nucleic acid sequences shown in SEQ ID NO: 1 to 10.
  • the percent identity values are, preferably, calculated over the entire nucleic acid sequence region. More preferably, the percent identity values are calculated over the recombinase recognition sequences, i.e.
  • the sequence identity values recited above in percent (%) are to be determined, preferably, using the program GAP over the entire sequence region with the following settings: Gap Weight: 50, Length Weight: 3, Average Match: 10.000 and Average Mismatch: 0.000, which, unless otherwise specified, shall always be used as standard settings for sequence alignments.
  • the barcoding polynucleotides of the present invention either essentially consist of the aforementioned nucleic acid sequences or comprise the aforementioned nucleic acid sequences. Thus, they may contain further nucleic acid sequences as well.
  • the barcoding polynucleotide of the present invention shall be provided, preferably, either as an isolated polynucleotide (i.e. isolated from its natural context) or in genetically modified form.
  • the term encompasses single as well as double stranded polynucleotides.
  • comprised are also chemically modified polynucleotides including naturally occurring modified polynucleotides such as glycosylated or methylated polynucleotides or artificially modified ones such as biotinylated polynucleotides.
  • the uniquely identifiable sequence is a sequence unique, i.e., preferably, occurring only once, within the barcoding polynucleotide of the present invention. More preferably, a uniquely identifiable sequence is a sequence not occurring within a host cell; i.e., preferably, occurring only once within a host cell after introducing the barcoding polynucleotide of the invention into the host cell. Most preferably, a uniquely identifiable sequence is a sequence not occurring within a host organism; i.e., preferably, occurring only once within a host organism after introducing the barcoding polynucleotide of the invention into the host organism.
  • the uniquely identifiable sequences separated by a recombinase recognition sequence in a core structure of the present invention are uniquely identifiable sequences differing in sequence.
  • the uniquely identifiable sequence has a length of at least 5 nucleotides, preferably at least 10 nucleotides, more preferably at least 25 nucleotides, even more preferably at least 50 nucleotides, most preferably at least 75 nucleotides.
  • the barcoding polynucleotide of the present invention comprises at least five, preferably at least seven, more preferably at least nine, most preferably at least 10 core structures; accordingly, the barcoding polynucleotide of the present invention comprises at least ten, preferably at least 14, more preferably at least 18, most preferably at least 20 uniquely identifiable sequences.
  • each uniquely identifiable sequence comprised in a barcoding polynucleotide of the present invention differs from all other uniquely identifiable sequences present in said barcoding polynucleotide by at least one deletion, insertion, or, preferably, substitution.
  • recombinase relates to a DNA recombinase capable of (i) mediating excision or (ii) mediating excision or inversion of a polynucleotide flanked by two recognition sequences of said recombinase.
  • excision is known to the skilled person and relates to the removal of a polynucleotide fragment from a polynucleotide and reestablishing covalent bonding for the remaining polynucleotide, i.e., preferably, re-sealing of the double-strand break formally generated by removing the polynucleotide fragment.
  • the term "inversion" of a polynucleotide fragment is known to the skilled person and relates to inverting the polynucleotide fragment relative to the surrounding nucleic acid sequence. Preferably, said inversion is mediated by excision of said polynucleotide fragment and its reinsertion in the inverse orientation.
  • Suitable recombinases known in the art are listed in Table 1.
  • the term “recombinase” includes variants of the aforesaid recombinases, wherein said variants are active in excising polynucleotide fragments flanked by recombinase recognition sequences of said recombinase. Preferably, activity is established in the excision assay as specified elsewhere herein.
  • Tn5053 res transponson Tn5053 Thomson and Ow, Genesis, 2006
  • the recombinase is Cre, Dre, or Flp. More preferably, the recombinase is Cre.
  • recombinase recognition sequence is, in principle, known to the skilled person. Specific recognition sequences for particular recombinases can, e.g., preferably be obtained from the references cited in Table 1. Moreover, the term, preferably, also includes variants of the naturally occurring recombinase recognition sequence, provided that said variants still are functional in mediating excision as specified herein above. Preferably, said variant comprises at least one point mutation relative to the naturally occurring recombinase recognition sequence, more preferably at least one substitution mutation. As will be understood by the skilled person, e.g.
  • a loxP site which is a recombinase recognition sequence for Cre recombinase, comprises a 13 base pair 5' recognition sequence, an 8 base pair spacer region, and a 13 base pair 3' recognition sequence, wherein the 3' recognition sequence is the inverse complement of the 5' recognition sequence. Since the 8 base pair spacer region is less conserved in loxP sequences, mutations are preferably located in said spacer region. However, functional mutations in a 5' or 3' recognition sequence are also known, e.g. from Missirlis et al. (BMC Genomic(2006), 7:73).
  • activity of a variant recombinase recognition sequence in mediating excision is established by incubating 1 ⁇ g of a linearized polynucleotide comprising two of said variant recombinase recognition sequences at a distance of 500 base pairs for 12 hours at 37°C with 1 unit of recombinase, wherein 1 unit of recombinase is the amount of recombinase polypeptide mediating excision of at least 20% of a corresponding wild type loxP-flanked polynucleotide within 1 hour under the same conditions.
  • a variant recombinase recognition sequence is considered active if at least 10 % of fragments flanked by said variant recombinase recognition sequence were excised in said assay ("excision assay").
  • activity of a variant recombinase recognition sequence in mediating inversion is established by incubating 1 ⁇ g of a linearized polynucleotide comprising two of said variant recombinase recognition sequences at a distance of 500 base pairs for 12 hours at 37°C with 1 unit of recombinase, wherein 1 unit of recombinase is the amount of recombinase polypeptide mediating excision of at least 20% of a corresponding wild type loxP-flanked polynucleotide within 1 hour under the same conditions.
  • a variant recombinase recognition sequence is considered active if in at least 10 % of plasmids thus treated, the fragment flanked by said variant recombinase recognition sequences is inverted in said assay ("inversion assay"), wherein inversion is preferably tested for by digesting said plasmid with one or more appropriate restriction enzyme(s).
  • catalysis of both inversion and excision is, in general, a property of the recombinase, not of the recombinase recognition sequence; accordingly, preferably, either the excision assay or the inversion assay is performed with respect to a given recombinase recognition sequence, and catalysis of both reaction types, or not, on said recombinase recognition sequence is deduced from the result of said assay.
  • the expression "recognition sequence of a recombinase” relates to a recombinase recognition sequence corresponding to said recombinase, i.e., to a recombinase recognition sequence recognized, preferably specifically recognized, by said recombinase.
  • each recombinase recognition sequence is separated from its neighboring recombinase recognition sequence or recombinase recognition sequences by at least 10, 25, or 50, more preferably at least 82, even more preferably at least 94, most preferably at least 178 nucleotides.
  • the recombinase recognition sequences comprised in the barcoding polynucleotide of the present invention are recombinase recognition sequences of the same recombinase.
  • a recombinase may recognize recognition sequences differing in sequence by one or more nucleotide exchanges; e.g. Cre recombinase recognizes sequences corresponding to the generic consensus of SEQ ID NO:34.
  • a specific recombinase may be compatible, i.e.
  • the barcoding polynucleotide may comprise various recombinase recognition sequences of a recombinase, varying in nucleic acid sequence, wherein, preferably, said recombinase recognition sequences with different sequences are incompatible with each other.
  • the recombinase recognition sequences comprised in the barcoding polynucleotide of the present invention are mutually compatible recombinase recognition sequences, wherein, preferably, a first and a second recombinase recognition sequence are mutually compatible if they are active in the aforesaid excision assay, in which the fragment to be excised is flanked by the first and second recombinase recognition sequence.
  • all recombinase recognition sequences comprised in the barcoding polynucleotide of the present invention are identical.
  • the recombinase recognition sequence comprises the sequence of SEQ ID NO: 34; more preferably, the recombinase recognition sequence comprises the sequence of SEQ ID NO: 35.
  • core structure relates to a polynucleotide comprising two uniquely identifiable sequences as specified herein above, separated by a recombinase recognition sequence as specified herein above.
  • the core structure comprises exactly one recombinase recognition sequence per recombinase, i.e. preferably, per recombinase, one recombinase recognition sequence intervenes between the two uniquely identifiable sequences.
  • the core structure comprises exactly one recombinase recognition sequence.
  • the one recombinase recognition sequence comprised in said core structure is a Cre recombinase recognition sequence, more preferably comprising or having the sequence of SEQ ID NO: 34, more preferably of SEQ ID NO: 35.
  • the barcoding polynucleotide of the present invention comprises at least five of the aforesaid core structures.
  • the barcoding polynucleotide of the present invention comprises at least seven, more preferably at least nine, most preferably at least 10 of the aforesaid core structures.
  • the upper limit of core structures is essentially only determined by practical aspects, e.g. size of the barcoding polynucleotide, and/or stability of the barcoding polynucleotide in cloning procedures and during amplification in, e.g. a plasmid.
  • the diversity detectable from the products of the barcoding polynucleotide of the present invention depends on the method of detection. Accordingly, the barcoding polynucleotide of the present invention, preferably, comprises at most 250 core structures, more preferably at most 50 core structures, even more preferably at most 20, most preferably at most 15 core structures, preferably in case the barcoding polynucleotide shall be used in a method comprising cleaving of the product(s) of recombination with a restriction enzyme.
  • the barcoding polynucleotide of the present invention comprises at most 100 core structures, more preferably at most 25 core structures, even more preferably at most 17, most preferably at most 15 core structures, preferably in case the barcoding polynucleotide shall be used in a method comprising sequencing the product(s) of recombination on the whole (in toto).
  • the orientation of the recombinase recognition sequence in at least one core structure is inverted relative to the orientation of the neighboring recombinase recognition sequences or the neighboring recombinase recognition sequence.
  • the orientation of the recombinase recognition sequence in at least two core structures is inverted relative to the orientation of the neighboring recombinase recognition sequences or the neighboring recombinase recognition sequence.
  • the orientation of each recombinase recognition sequence is inverted relative to the orientation of the neighboring recombinase recognition sequences or the neighboring recombinase recognition sequence; i.e., preferably, the orientations of the recombinase recognition sequences are alternating.
  • the recombinase recognition sequence comprised in at least one core structure as determined starting from the starting nucleotide of the 5' uniquely identifiable sequence is shifted by at least one nucleotide as compared to at least one further core structure comprised the barcoding polynucleotide of the invention. More preferably, the recombinase recognition sequence of each core structure of the barcoding polynucleotide of the invention as determined starting from the starting nucleotide of the 5' uniquely identifiable sequence is shifted by at least one nucleotide as compared to any further core structure comprised in said barcoding polynucleotide.
  • the barcoding polynucleotide of the present invention may comprise additional sequence elements, e.g. preferably, one or more restriction enzyme recognition sequences and/or sequencing primer annealing sites as specified elsewhere herein.
  • the restriction enzyme recognition sequences are spaced such that sequencing of the complete fragments arising from cleaving the barcoding polynucleotide with an appropriate restriction enzyme is feasible with the selected sequencing method; more preferably, the restriction enzyme recognition sequences are arranged between the core structures of the barcoding polynucleotide; most preferably, all core structures of the barcoding polynucleotide are flanked by at least one restriction enzyme recognition site on both sides.
  • the sequencing primer annealing sites are spaced such that sequencing of the complete barcoding polynucleotide is feasible with the selected sequencing method; more preferably, the sequencing primer annealing sites are arranged between the core structures of the barcoding polynucleotide; most preferably, all core structures of the barcoding polynucleotide are flanked by a sequencing primer annealing site.
  • the core structures are separated by at most 1000 base pairs, more preferably at most 100 base pairs in the barcoding polynucleotide of the invention. Most preferably, the core structures are directly linked in the barcoding polynucleotide of the invention.
  • restriction enzyme and "restriction enzyme recognition sequence” are known to the skilled person.
  • the restriction enzyme is a Type II or Type III restriction enzyme. More preferably, the restriction enzyme is a Type IIS restriction enzyme, i.e. a restriction enzyme cleaving DNA at a defined distance outside of its own recognition sequence, but, preferably, not more than 50 base pairs away from said recognition sequence.
  • Type IIS restriction enzymes are also known to the skilled person as "outside cutters”. More preferably, the restriction enzyme is Bsgl and/or BciVI.
  • a restriction enzyme recognition sequence may be present at least once in a barcoding polynucleotide of the present invention.
  • the restriction enzyme recognition sequence is present at least once in each core structure of the barcoding polynucleotide of the present invention. More preferably, the restriction enzyme recognition sequence is present at essentially the same location relative to the other elements in each core structure of the barcoding polynucleotide. Still more preferably, each core structure is flanked at its 5' side and at its 3' side by at least one restriction enzyme recognition sequence. Most preferably, each core structure is flanked at its 5' side and at its 3' side by two restriction enzyme recognition sequences.
  • the restriction enzyme recognition sequence is arranged such that the restriction enzyme cleavage site lies between the restriction enzyme recognition sequence and the most proximal uniquely identifiable sequence.
  • two restriction enzyme recognition sequences are positioned next to each other and oriented such that the restriction enzyme recognition sequences are located in between the restriction enzyme cleavage sites; i.e., preferably, the restriction enzyme recognition sequence of the second restriction enzyme lies between the restriction enzyme cleavage site and the restriction enzyme recognition sequence of the first enzyme, and the restriction enzyme recognition sequence of the first restriction enzyme lies between the restriction enzyme cleavage site and the restriction enzyme recognition sequence of the second enzyme.
  • Said latter arrangement is particularly advantageous in certain applications, since by cleaving with both restriction enzymes, either simultaneously or consecutively, the restriction enzyme recognition sequences can be removed from the core structures.
  • a restriction enzyme recognition sequence or restriction enzyme recognition sequences are, preferably, present only once per core structure border.
  • the restriction enzyme recognition site is a restriction enzyme recognition site not present in any core element of the barcoding polynucleotide of present invention; more preferably, the restriction enzyme recognition site is a restriction enzyme recognition site not naturally present in the barcoding polynucleotide of the present invention.
  • each core element comprises a sequencing primer annealing site. More preferably, only the first and the last core element of a consecutive series of core elements comprise a sequencing primer annealing site.
  • the sequencing primer annealing sites are oriented such that amplification and/or sequencing of the core elements comprised between said sequencing primer annealing sites is possible.
  • the sequencing primer annealing site comprised in the first core structure and the sequencing primer annealing site comprised in the last core structure are, preferably, non-identical.
  • recombinases like Cre do not act processively, but randomly excise and/or invert DNA fragments flanked by appropriate recognition sites from polynucleotides comprising a multitude of recognition sites. Accordingly, appropriate constructs can be used for generating random combinations of barcodes, which can be used to uniquely label cells. Moreover, it was found that, after removal of the recombinase, the random barcode combinations are stable and are suitable to track the progeny of labeled cells.
  • the present invention further relates to a vector comprising a barcoding polynucleotide according to the present invention.
  • vector encompasses phage, plasmid, and viral (including, e.g. retroviral) vectors, as well as artificial chromosomes, such as bacterial, yeast, and mammalian artificial chromosomes. Moreover, the term also relates to targeting constructs which allow for random or site-directed integration of the targeting construct into genomic DNA. Such targeting constructs, preferably, comprise DNA of sufficient length for either homologous or heterologous recombination.
  • the vector encompassing the barcoding polynucleotide of the present invention preferably, further comprises a selectable marker for propagation and/or selection in a host cell. The vector may be incorporated into a host cell by various techniques well known in the art.
  • a plasmid vector can be introduced in a precipitate such as a calcium phosphate precipitate or rubidium chloride precipitate, or in a complex with a charged lipid or in carbon-based clusters, such as fullerenes.
  • a plasmid vector may be introduced by heat shock or electroporation techniques.
  • the vector may be packaged in vitro using an appropriate packaging cell line prior to administration to host cells. Suitable vectors are known in the art.
  • the vector is a plasmid vector
  • said plasmid has an origin of replication providing for medium or, preferably, medium copy number in a bacterial host cell.
  • the vector is a gene transfer or targeting vector.
  • the vector is a vector comprising or consisting of the nucleotide sequence of SEQ ID NO: 47 (PolyloxP2.0_Rosa26 targeting vector, pWP-AG).
  • the present invention further relates to a host cell comprising a barcoding polynucleotide according to the present invention and/or a vector according to the present invention.
  • the term "host cell”, as used herein, relates to any bacterial, archeal, or eukaryotic cell.
  • the cell is bacterial cell, more preferably an Escherichia cell (e.g. E. coli) or a Bacillus cell (e.g. B. subtilis).
  • the host cell is a eukaryotic cell; more preferably, the eukaryotic cell is a fungal cell, most preferably a yeast cell, e.g. a cell of Saccharomyces cerevisiae.
  • the eukaryotic cell is a mammalian cell, more preferably a cell of an experimental animal, e.g.
  • the host cell is a human cell.
  • the cell is a proliferation competent cell; more preferably, the cell is a cultured cell, preferably a cell line, more preferably a tumor cell line. It is, however, also envisaged that the host cell is a primary cell, preferably a primary tumor cell. Even more preferably, the host cell is a stem cell, most preferably an embryonic or somatic stem cell.
  • the host cell is a human embryonic stem cell, said human embryonic stem cell was obtained by blastocyst biopsy.
  • the host cell is not a human embryonic stem cell.
  • the present invention relates to a host organism comprising a barcoding polynucleotide according to the present invention, a vector according to the present invention, and/or a host cell according to the present invention.
  • the term "host organism” relates to a multicellular organism, preferably an experimental animal.
  • Preferred experimental animals are: insects, preferably of the genus Drosophila, more preferably D. melanogaster; nematodes, preferably of the genus Caenorhabditis, more preferably C. elegans; Amphibia, preferably of the genus Xenopus, more preferably X. laevis; fishes, preferably of the genus Danio, more preferably D. rerio; mammals, of which dogs, cats, horses, sheep, goats, and cattle are preferred. More preferably, the experimental animal is a rat, mouse, guinea pig, pig, or hamster. Preferably, the host organism is non-human.
  • the present invention also relates to a method of labeling a host cell, comprising introducing a barcoding polynucleotide according to the present invention into said host cell, contacting said barcoding polynucleotide with a corresponding recombinase in said host cell, and thereby labeling a host cell.
  • the method of labeling a host cell of the present invention preferably, is an in vitro method. It is, however, understood by the skilled person, that the method may also be performed on a cell comprised in an organism; i.e. the method of labeling a host cell may also be an in vivo method. Moreover, the method may comprise steps in addition to those explicitly mentioned above. For example, further steps may relate, e.g., to detecting the combination of uniquely identifiable sequences after propagation of the cell. Moreover, one or more of said steps may be performed by automated equipment.
  • introducing a barcoding polynucleotide is stably introducing said barcoding polynucleotide, i.e. preferably, eliciting integration of said barcoding polynucleotide into the genome of the host cell, preferably by homologous recombination or other integration mechanisms known to the skilled person.
  • the barcoding polynucleotide of the present invention is preferably flanked by sequences of sufficient length homologous to the intended insertion site in case insertion by homologous recombination is to be obtained.
  • the recombinase recognition sequence comprised in said barcoding polynucleotide is a loxP sequence as specified elsewhere herein and wherein said corresponding recombinase is Cre as specified elsewhere herein.
  • Methods for contacting a barcoding polynucleotide with a recombinase in a host cell are known to the skilled person.
  • the recombinase protein is introduced into the host cell.
  • a polynucleotide comprising an expression construct for said recombinase is introduced into the host cell.
  • said contacting a barcoding polynucleotide with a recombinase in a host cell comprises transient transfection of an expression construct for said recombinase into the host cell.
  • said contacting a barcoding polynucleotide with a recombinase in a host cell comprises stably introducing a regulable expression construct for said recombinase, or comprises stably introducing an expression construct for a regulable variant of said recombinase.
  • stable introduction is, preferably, achieved by transfecting said expression construct into a host cell under conditions which allow for integration of the expression construct into the genome of the host cell. It is, however, also envisaged by the present invention, that the expression construct is present in the genome of an experimental animal and the expression construct is introduced into said host cell by cross-breeding.
  • a first experimental animal stably carrying the barcoding polynucleotide of the present invention and a second experimental animal stably carrying the, preferably regulable, expression construct for the recombinase, or the expression construct for the regulable variant of the recombinase are crossed.
  • the barcoding polynucleotide is contacted with the recombinase at least once each on at least two or, more preferably, at least three, preferably consecutive, days.
  • contacting a barcoding polynucleotide with a recombinase in a host cell comprises providing recombinase activity within the host cell for at least three, more preferably at least 6, most preferably, at least 12 hours.
  • contacting a barcoding polynucleotide with a recombinase in a host cell comprises providing recombinase activity within the host cell for a period of from at most seven days, preferably at most five days, still more preferably at most four days, most preferably, at most three days.
  • contacting a barcoding polynucleotide with a recombinase in a host cell comprises providing recombinase activity within the host cell for of from three hours to seven days, more preferably of from six hours to five days, still more preferably of from twelve hours to four days, most preferably of from one day to three days.
  • the aforesaid values may require adjustment depending on the activity of the recombinase within the specific host cell; and adjustment can be accomplished according to the methods as provided in the examples elsewhere herein.
  • Methods of providing a regulable expression construct are known to the skilled person and, preferably, comprise cloning of a gene encoding a polypeptide comprising the recombinase in appropriate orientation downstream of a regulable promotor.
  • Regulable promotors are well known in the art and include, e.g. tetracyclin-inducible or tetracyclin-repressible promoters.
  • Methods for obtaining regulable variants of the recombinase are also known in the art.
  • the regulable variant is fusion polypeptide comprising the recombinase and a polypeptide inhibiting recombinase activity and/or preventing the recombinase from contacting its substrate DNA.
  • the regulable variant of the recombinase comprises a polypeptide preventing the recombinase from accessing the nucleus in a eukaryotic, preferably mammalian, host cell. Even more preferably, the regulable variant of the recombinase comprises an estrogen receptor polypeptide, most preferably an estrogen receptor polypeptide as specified herein in the Examples.
  • the barcoding polynucleotide introduced in the method further comprises restriction enzyme recognition sequences as specified herein above, and the method further comprises cleaving said barcoding polynucleotide at said restriction enzyme recognition sequences prior to sequencing the uniquely identifiable sequences.
  • the present invention relates to a method of identifying a host cell, comprising labeling said cell according to the method of labeling a host cell of the present invention and further comprising sequencing uniquely identifiable sequences comprised in said barcoding polynucleotide after contacting said barcoding polynucleotide with said recombinase.
  • a barcoding polynucleotide of the present invention by incubating a barcoding polynucleotide of the present invention with a corresponding recombinase, a vast diversity of sequences is generated, comprising random deletions and inversions of sequences intervening two recombinase recognition sequences, of which each, if the number of core elements is selected appropriately, is unique for a specific cell, even if all cells of an organism are barcoded by the method of the present invention. Accordingly, the polynucleotide remaining from the barcoding polynucleotide of the present invention after recombination is also referred to as "barcoded polynucleotide".
  • the diversity created lies with the specific barcodes remaining or newly created, as well as the relative arrangement of the barcodes to each other. Accordingly, a method comprising sequencing the barcoded polynucleotide as such (on the whole) will preserve the complete complexity, whereas a method comprising cleaving the barcoded polynucleotide with a restriction enzyme may only reveal complexity as far as the barcodes remaining or newly created are concerned, but not their relative arrangement. Accordingly, if high complexity is required, a method comprising sequencing the barcoded polynucleotide on the whole (in toto) is preferred.
  • a method comprising cleaving the barcoded polynucleotide with a restriction enzyme is technically less demanding and amenable to current high throughput sequencing equipment.
  • a barcoded polynucleotide comprising restriction enzyme recognition sequences may also be used for single molecule sequencing.
  • the barcoded polynucleotide remaining after recombinase action is sequenced in its entirety, preferably by long-range sequencing over the whole polynucleotide, or, also preferably, by sequencing overlapping fragments preferably using, e.g. uniquely identifiable sequences as sequencing primer binding sites, or by providing dedicated sequencing primer binding sites within the barcoding polynucleotide of the invention.
  • restriction enzyme recognition sites intervening the core sequences are included in the barcoding polynucleotide of the invention as specified herein above and, after isolating DNA comprising the barcode, said DNA is cleaved by a restriction enzyme active on said restriction enzyme recognition sequences.
  • sequencing adapters comprising primer binding sites are ligated to the fragments generated by restriction enzyme digestion and fragments are sequenced, more preferably by a high-throughput method.
  • the method of labeling a host cell of the present invention enables several methods requiring barcoding of cells:
  • the present invention relates to a method for providing an experimental animal comprising a labeled population of cells, comprising labeling at least one host cell in said experimental animal, preferably a stem cell, more preferably a non-embryonic stem cell, according to the method of labeling a host cell of the present invention and keeping said experimental animal under conditions allowing proliferation of said host cell, thereby providing an experimental animal comprising a labeled population of cells.
  • labeling at least one host cell in an experimental animal comprises introducing a barcoding polynucleotide of the present invention into a cell of said experimental animal, e.g., preferably, by transfection or infection with a viral vector.
  • labeling at least one host cell in an experimental animal comprises introducing a barcoding polynucleotide of the present invention into a cultured cell, preferably a cultured cell capable of proliferating in an experimental animal, and introducing said host cell into said experimental animal.
  • a barcoding polynucleotide of the present invention into a cultured cell, preferably a cultured cell capable of proliferating in an experimental animal, and introducing said host cell into said experimental animal.
  • at least ten, more preferably at least 1000, most preferably at least a million, host cells are labeled. It is, however, also envisaged that all cells of a specific tissue or organ or that all cells of a host organism are labeled.
  • the present invention also relates to a method for barcoding host cells to study population dynamics in cell populations or in bacteria in vivo or, preferably, in vitro, comprising the method of labeling a host cell of the present invention.
  • host cells are labeled and population growth dynamics is analyzed, preferably under various conditions like, preferably antibiotic treatment, niche competition, or limited nutrients.
  • the present invention also relates to a method for barcoding of cells and tissues in mice.
  • hematopoietic stem cells HSCs
  • breeding knockin mice comprising the barcoding polynucleotide of the present invention, preferably stably integrated into the genome of at least one cell, with mice comprising a, preferably inducible, recombinase expression sequence.
  • said inducible recombinase expression sequence comprises an HSC specific promoter (e.g. Tie2).
  • said method is used to study clonal dynamics of native hematopoiesis and lineage relationships in diverse cell compartments (i.e., preferably, T- and B-cells, macrophages, granulocytes, dendritic cells etc.). Furthermore, using said method, development and maintenance of essentially any organ for which recombinase, preferably, Cre-specific inducible drivers exist can be examined. In addition to these studies aiming at developing tissue development maps under physiological conditions, the method can also be applied to address population dynamics of body cells during pathological conditions, e.g. inflammation, degeneration, regeneration, aging or functional adaptation.
  • pathological conditions e.g. inflammation, degeneration, regeneration, aging or functional adaptation.
  • the present invention relates to a method for barcoding for studies of cancer development, progression and metastasis.
  • Models of cancer development are known to the skilled person, e.g. using switching on oncogenes in a tissue- and time-defined manner.
  • mice bearing the barcoding polynucleotide of the present invention together with an inducible recombinase preferably under the control of tumor specific markers, tumor cells are tagged initially by the barcodes.
  • the population dynamics of tumor development, progression and/or metastasis is studied.
  • e.g. relationships of metastasis- forming cells are established.
  • the present invention also relates to a kit comprising the barcoding polynucleotide according to the present invention, the vector according to the present invention, the host cell according to the present invention, or the experimental animal according to the present invention; and (a) a corresponding recombinase (i) mediating excision or (ii) mediating excision and inversion of a polynucleotide flanked by recognition sequences comprised in said barcoding polynucleotide; and/or (b) a polynucleotide encoding the corresponding recombinase of (a).
  • the present invention relates to a kit comprising (a) the barcoding polynucleotide according to the present invention or the vector according to the present invention; and (b) means for introducing said barcoding polynucleotide or vector into a host cell.
  • kit refers to a collection of the aforementioned components.
  • said components are combined with additional components, preferably within an outer container.
  • the outer container also preferably, comprises instructions for carrying out a method of the present invention. Examples for such components of the kit as well as methods for their use have been given in this specification.
  • the kit preferably, contains the aforementioned components in a ready-to-use formulation.
  • the kit may additionally comprise instructions, e.g., a user's manual for applying the components with respect to the applications provided by the methods of the present invention. Details are to be found elsewhere in this specification. Additionally, such user's manual may provide instructions about correctly using the components of the kit.
  • a user's manual may be provided in paper or electronic form, e.g., stored on CD or CD ROM.
  • the present invention also relates to the use of said kit in any of the methods according to the present invention.
  • the present invention further relates to a host cell, obtained or obtainable by the method of labeling a host cell of the present invention.
  • the present invention also relates to an experimental animal, obtained or obtainable by a method comprising the method of labeling a host cell of the present invention or by the method for providing an experimental animal comprising a labeled population of cells of the present invention.
  • the present invention relates to the use of a barcoding polynucleotide according to the present invention and/or the vector according to the present invention for labeling a host cell.
  • a barcoding polynucleotide comprising at least five, preferably at least ten core structures, wherein each of said core structures comprises two uniquely identifiable sequences separated by a recombinase recognition sequence, wherein said recombinase is a recombinase capable of (i) mediating excision or (ii) mediating excision or inversion of a polynucleotide flanked by two recognition sequences of said recombinase.
  • each of said uniquely identifiable sequences differs from all other uniquely identifiable sequences present in said barcoding polynucleotide by at least one deletion, insertion, or, preferably, substitution.
  • each of said core structures is flanked at its 5' side and at its 3' side by at least one, preferably two, restriction enzyme recognition sequence(s).
  • restriction enzyme recognition sequence is or wherein said restriction enzyme recognition sequences are identical for all core structures comprised in said barcoding polynucleotide.
  • said restriction enzyme recognition sequences are recognition sequences of type IIS restriction enzymes, preferably Bsgl and/or BciVI restriction enzyme recognition sequences.
  • each of said recombinase recognition sequences is separated from any neighboring recombinase recognition site or recombinase recognition sites by at least 10, 25, or 50, preferably at least 82, more preferably at least 94 nucleotides.
  • a vector comprising a barcoding polynucleotide according to any one of embodiments 1 to 15.
  • a host cell comprising a barcoding polynucleotide according to any one of
  • a host organism comprising a barcoding polynucleotide according to any one of embodiments 1 to 15, a vector according to embodiment 16, or a host cell according to embodiment 17.
  • a method of labeling a host cell comprising introducing a barcoding polynucleotide according to any one of embodiments 1 to 15 into said host cell, contacting said barcoding polynucleotide with a corresponding recombinase in said cell, and thereby labeling a host cell.
  • polynucleotide is contacted with said recombinase at least once each on at least two or, preferably, at least three, preferably consecutive, days. 25.
  • a method of identifying a host cell comprising labeling said cell according to the method of any one of embodiments 19 to 23 and further comprising sequencing uniquely identifiable sequences comprised in said barcoding polynucleotide after said barcoding polynucleotide was contacted with said recombinase.
  • a method for providing an experimental animal comprising a labeled population of cells comprising labeling at least one host cell in said experimental animal, preferably a stem cell, more preferably a non-embryonic stem cell, according to the method of any one of embodiments 19 to 24 and keeping said experimental animal under conditions allowing proliferation of said host cell, thereby providing an experimental animal comprising a labeled population of cells.
  • a kit comprising the barcoding polynucleotide according to any one of embodiments 1 to 15, the vector of embodiment 16, the host cell of embodiment 17, or the experimental animal of embodiment 18; and
  • kits comprising (a) the barcoding polynucleotide according to any one of embodiments 1 to 15 or the vector according to embodiment 16; and (b) means for introducing said barcoding polynucleotide or vector into a host cell.
  • a labeled host cell obtained or obtainable by the method of any one of embodiments 19 to 24.
  • An experimental animal obtained or obtainable by a method comprising the method of any one or embodiments 19 to 24 or by the method of any one of embodiments 27 to 29.
  • a method for obtaining a barcoded polynucleotide comprising contacting a barcoding polynucleotide according to any one of embodiments 1 to 15 or a vector according to embodiment 16 with a corresponding recombinase.
  • a barcoded polynucleotide comprising at least two, preferably at least four, more preferably at least six, even more preferably at least eight, most preferably at least ten barcode sequences, obtained or obtainable by the method according to any one of embodiments 35 to 37.
  • a cell comprising a barcoded polynucleotide at least two, preferably at least four, more preferably at least six, even more preferably at least eight, most preferably at least ten barcode sequences, obtained or obtainable by the method according to any one of
  • Triangles represent loxP recombination recognition sequences. Uniquely identifiable sequences are numbered from 1 to 20.
  • Two examples for Cre-mediated recombination are depicted. Cre recombinase randomly selects two loxP recognition sites and the DNA sequence in between is either deleted (e.g. in recombination between the first and the third loxP site) or inverted (e.g. in recombination between the first and the second loxP site), depending on whether the selected loxP sites were in parallel or opposite orientation, respectively. As a result, new arrangements of the numbered uniquely identifiable sequences are formed.
  • the dotted lines indicate specific restriction sites (Bsgl/BciVI) for the fragmentation of the barcoding polynucleotide into barcodes for "next generation sequencing”.
  • Reporter mice are obtained by combining an allele of the PolyloxP2.0 barcoding cassette and a tissue specific, inducible Cre recombinase gene.
  • the promoter controlling Cre expression determines which tissue or type of cells will initially be barcoded. Treatment with tamoxifen enables the translocation of Cre into the nucleus, leading to the random recombination of the barcoding cassette in the cell population of interest.
  • the specific barcodes of these cells will be inherited to their daughter cells and descending lineages. Different cell populations are isolated e.g. by FACS sorting and the genomic DNA is extracted.
  • the barcoded cassette is amplified by PCR. The occurrence of Cre-mediated deletion events can be visualized by gel electrophoresis.
  • PCR products from individual cells are analyzed by traditional Sanger sequencing or the pool of sequences is subjected to third generation sequencing e.g. on the PacBio platform.
  • the barcoded polynucleotides are digested into barcode sequences and then analyzed for recombination pairs of unique DNA elements via Illumina sequencing (e.g. 250 bp paired-end MiSeq analysis).
  • FIG. 3 Gene targeting in ES cells and generation of ROSA26(PolyloxP2.0) knockin mice.
  • SA short arm of targeting vector (1.08kb).
  • LA long arm of targeting vector (4.32 kb).
  • Neo neomycin resistance gene for positive selection.
  • PGK-DTA diphtheria toxin subunit A with PGK promoter for negative selection.
  • Genotying of the offspring from chimeric mice for detecting germline transmission of the PolyloxP2.0 cassette Genomic DNA from PolyloxP2.0-targeted ES cells and Rosa26RFP knock-in mice were used as PCR controls. The arrow indicates the lane with the PCR positive sample from the founder animal of the B6.ROSA26(PolyloxP2.0) mouse line.
  • Figure 4 In vitro recombination of PolyloxP2.0 on plasmid DNA.
  • B) Cre-mediated PolyloxP2.0 recombination products were separated by gel electrophoresis. The five different size fragments that are obtained by Cre-mediated excision are boxed.
  • C) Recombination products were amplified by PCR, digested into barcodes and sequenced. The obtained frequencies of individual barcodes relative to the total number of barcodes sequenced are depicted in a heatmap. Each row is representing an individual barcode and different frequencies are coded by different gray scales.
  • 'WT' lists the ten different unrecombined barcodes and 'Recombination' the 90 recombined barcodes. The overall frequencies of recombined or unrecombined barcodes are depicted in the table below.
  • Figure 5 In vitro recombination of PolyloxP2.0 on genomic DNA.
  • Genomic DNA was extracted from PolyloxP2.0-targeted ES cells and incubated with Cre recombinase, in vitro. Recombination products were amplified by PCR, separated by gel electrophoresis, and digested barcodes were sequenced.
  • C) PCR-amplified recombination products were digested into barcodes and sequenced.
  • the obtained frequencies of individual barcodes relative to the total number of barcodes sequenced are depicted in a heatmap. Each row is representing an individual barcode and different frequencies are coded by different gray scales.
  • 'WT' lists the ten different unrecombined barcodes and 'Recombination' the 90 recombined barcodes. The overall frequencies of recombined or unrecombined barcodes are depicted in the table below.
  • Figure 6 Tamoxifen-induced recombination of the PolyloxP2.0 cassette in ES cells.
  • the barcoded PolyloxP2.0 cassette was amplified by PCR and products were separated by gel electrophoresis. Water (H 2 0) and wild-type ES cell DNA (El 4) were used as PCR negative control. Genomic DNA from PolyloxP2.0 ES cells treated with Cre in vitro served as PCR positive control. Negative control for the 4-OHT treatment were vehicle (EtOH) treated MerCreMer-transfected PolyloxP2.0 ES cells. C) Heat map of the barcode distribution in 4-OHT treated ES cells.
  • Figure 7 Pulse-chase recombination in MerCreMer-transfected PolyloxP2.0 ES cells.
  • ES cells were induced for 3 hours with a single pulse of 100 nM 4-OHT, then washed and genomic DNA was prepared from aliquots of the cells at the indicated time points for PCR amplification of the PolyloxP2.0 cassette. Recombination could already be detected by PCR after 3 hours of incubation (0 days) and was even more pronounced after 3 days of chase.
  • Figure 8 Full-length sequencing of the recombined PolyloxP2.0 cassette (barcoded polynucleotide) from single cells.
  • Figure 10 Barcode probability in PCR sampling controls
  • each data point represents one barcode and its abundance in either of two PCR samples and is plotted on its respective axis.
  • Unrecombined "WT" barcodes are depicted as open squares, recombined barcodes are shown as closed black circles.
  • Figure 11 Barcode probability plots from different samples.
  • each data point represents one barcode and its abundance in either of two PCR samples and is plotted on its respective axis.
  • Unrecombined "WT" barcodes are depicted as open squares, recombined barcodes are shown as closed black circles.
  • Figure 12 Plasmid map of the PolyloxP2.0_Rosa26 targeting vector.
  • the plasmid has the sequence of SEQ ID NO: 47.
  • Figure 13 PolyloxP2.0 full-length read coding.
  • terminology of the unique DNA barcode elements used in restriction digest-based Illumina sequencing experiments (upper row, numbers 1-20 in boxes) used in Figures 1-11 is compared to the simplified terminology (lower row, letters A-I in boxes) used for the DNA segments obtained by full-length read coding (FRC) based PacBio sequencing experiments (Example 8, Table 3).
  • Example 1 Design of a PolyloxP2.0 cassette and cell- free recombination assay on plasmid DNA
  • the PolyloxP2.0 recombination cassette consists of 10 loxP sites with alternating orientation and a fixed distance of 178 bp between each two neighboring loxP sites. This distance was calculated according to the optimal closest distance of 94bp reported by Hoess et al. (1985), Gene 40(2-3):325 plus an extension of eight DNA windings of 10.5bp.
  • the alternating orientation of the loxP sites allows an increased recombination product diversity upon Cre activity compared to an "all-same-direction" construct, because it enables both: excisions and inversions of DNA segments between loxP sites with parallel or opposite directions, respectively.
  • Illumina MiSeq provides a highly efficient sequencing technology, since, with an average of 25 mio. reads per flow cell, it even allows for the simultaneous analysis of several multiplexed sample libraries in a single run. However, with a current maximal read length of 500-600 bp (300 bp paired-end) it cannot provide the analysis of the entire PolyloxP2.0 cassette in one piece. Instead, it requires the analysis of specific fragments of it.
  • Each core fragment has a size of 192 bp and consists of one loxP site as well as one upstream and one downstream uniquely identifiable sequence. As shown in Fig. 1, the combinations of these uniquely identifiable sequences are indicative for specific recombination events.
  • a second feature that was considered for the same technical reason was a positional +1 bp shift of the loxP sites in comparison to the position of the loxP site in the previous core fragment.
  • PolyloxP2.0 targeting vector ( ⁇ g) carrying the PolyloxP2.0 cassette was linearized by SacII and Ascl (NEB), and then incubated with purified Cre recombinase (NEB, M0298L) for overnight at 37°C. The reaction was terminated by heating at 70°C for 10 min. Afterwards the mixture was sequentially digested by Bsgl and BciVI, and the digestion products were purified by gel extraction (QIAquick Gel Extraction Kit, QIAGEN, 28706). Finally, extracted DNA was quantified (Qubit 2.0) and analyzed for its purity (Agilent 2100 Bioanalyzer) before being used for subsequent library preparation and DNA sequencing.
  • genomic DNA was first purified by phenol-chloroform extraction and isopropanol precipitation. Purified genomic DNA was then incubated with Cre recombinase (NEB, M0298L) for overnight at 37°C and the reaction was terminated by heating at 70°C for 10 min. Afterwards, the PolyloxP2.0 cassette (together with some flanking Rosa26 sequence) was amplified by PCR with the Expand Long Template PCR System (Roche, 11759060001).
  • Cre recombinase NEB, M0298L
  • Stepl 5 min 95°C
  • Step2 30s 95°C
  • Step3 30s 62°C
  • Step4 5 min 72°C
  • Forward primer #493 5 '-GC AAGC ACGTTTCCGACTTGAG-3 ' (SEQ ID NO: 36)
  • Reverse primer #2427 5 * -CATACCTTAGAGAAAGCCTGTCGAG-3 * (SEQ ID NO: 37).
  • the PCR products were first purified by PCR purification kit (QIAGEN, 28106), digested by Bsgl and BciVI, and finally the 200 bp "core-fragments" were purified from the digestion mixture by gel extraction (QIAquick Gel Extraction Kit, QIAGEN, 28706).
  • Example 3 Induction and analysis of intracellular recombination
  • the pANMerCreMerpuro expression plasmid carrying the tamoxifen-inducible MerCreMer under the control of human beta-actin promoter (ACTB) was transfected into PolyloxP2.0 targeted ES cells to enable the induction of intracellular recombination.
  • Transfected ES cells were selected with puromycin and screened for stable integration by PCR using the following conditions:
  • Stepl 5 min 95°C; Step2: 30s 95°C, Step3: 30s 54°C, Step4: 4 min 72°C, repeat Steps 2-4 for 35 times, continue to final Step5: 10 min 72°C.
  • PolyloxP2.0 targeted ES cells were used to generate B6-Gt(ROSA)26Sor tmI TM ylox ⁇ Hrr mice (short name Rosa26 PolyloxP/+ ). Those mice were subsequently crossed to Rosa26 Cre_ERT2 mice (B6.129-Gt(ROSA)26Sortml(cre/ESRl)Tyj/J). Mice were injected intraperitoneally with 1 mg tamoxifen once or daily on 4 consecutive days. Stock solution was prepared by dissolving 1 g tamoxifen (Sigma T5648) in 36 mL peanut oil (Sigma P2144) and 4 mL absolute ethanol at 55°C. Afterwards, genomic DNA was prepared from thymus, spleen and lung and amplified for subsequent sequencing as described under "Cell-free recombination assay on genomic DNA”.
  • Example 5 DNA library preparation and multiplexing for Illumina sequencing
  • Sequencing libraries from individual DNA samples (10ng) were prepared using NEBNext High-Fidelity 2X PCR Master Mix (NEB, E6240). End-repair, dA-tailing and adaptor ligation were done according to the manufacturer's standard protocol. Afterwards, adaptor- ligated DNA was directly used for PCR enrichment (10 PCR cycles) and indexing without size selection. Index primers were provided in NEBNext Multiplex for Illumina (NEB E7335). Distinct DNA libraries were normalized to prepare an equimolar multiplex (10 nM) and mixed together with 5% PhiX carrier DNA. Sequencing was performed using Illumina Miseq V2: 250bp paired-end sequencing platform.
  • Example 6 Single Cell sequencing PolyloxP2.0 targeted ES cells carrying inducible Cre (pANMerCreMerpuro transfected clone) were treated with lOOnM 4-hydroxtamoxifen for one day, then washed and cultured for 6 days. Next, single cells were sorted into individual PCR tubes by FACS sorting (FACSAria III, BD Biosciences) and digested by Proteinase K treatment at 55°C for 1 h. The reaction was terminated by heating at 95°C for 10 min and the mixture was directly used for the amplification of PolyloxP2.0 recombination products by nested PCR.
  • FACS sorting FACS sorting
  • Stepl 5 min 95°C; Step2: 30s 95°C, Step3: 30s 56°C, Step4: 5 min 72°C, repeat Steps 2-4 for 35 times, continue to final Step5: 10 min 72°C.
  • Forward primer #2426 5 * -CGACGACACTGCCAAAGATTTC-3 * (SEQ ID NO: 42) and Reverse primer #2427 (SEQ ID NO: 37).
  • PCR products were purified by PCR purification kit (QIAGEN, 28106) and sent to Sanger sequencing using oligos #2426 and #2427 as sequencing primers.
  • Murine embryonic stem (ES) cells clone JM8A3 derived from C57BL/6N (Pettitt et al, Nat Meth. 2009, PMID 19525957), were cultured on neomycin-resistant embryonic fibroblasts in Knockout DMEM (GIBCO) supplemented with 10 % FCS (Hyclone), 2mM GlutaMAX (GIBCO), 1 mM sodium pyruvate (GIBCO), 0.1 mM nonessential amino acids (GIBCO), lOO U/ml penicillin, 100 ⁇ streptomycin (GIBCO), 25 ⁇ 2-mercaptoethanol (GIBCO) and 1000 U/ml LIF (Chemicon).
  • GlutaMAX g., 2mM GlutaMAX
  • GBCO 1 mM sodium pyruvate
  • GIBCO 0.1 mM nonessential amino acids
  • lOO U/ml penicillin 100 ⁇ streptomycin (GIBCO
  • ES cells were electroporated with 30 ⁇ g of the linearized targeting vector (pWP-AG, PolyloxP2.0_Rosa26 targeting vector) at 500 ⁇ , 0.24 kV (Gene Pulser Xcell, BioRad). Starting one day after electroporation, cells were selected by adding 150 ⁇ g/mL Geneticin (G418, GIBCO) to the culture medium.
  • pWP-AG PolyloxP2.0_Rosa26 targeting vector
  • clones were randomly picked, expanded, and screened for correct homologous recombination by PCR using one external forward primer HL16: 5 * -CCTAAAGAAGAGGCTGTGCTTTGG-3 * (SEQ ID NO:43) and a vector specific reverse primer HL15: 5'- AAGACCGCGAAGAGTTTGTCC-3 * (SEQ ID NO:44) (Luche et al. (2007), EJI 37(1):43, PMID: 17171761). Correct homologous recombination was confirmed by Southern blot using a biotinylated 822 bp probe located upstream of the first exon and the short arm of homology.
  • the probe was obtained by PCR amplification (01igo2424: 5'- GCAAAGGCGCCCGATAGAATAA-3 * (SEQ ID NO: 45) and 01igo2425: 5'- CCGGGGGAAAGAAGGGTCAC-3 * (SEQ ID NO: 46) of genomic C57BL/6 DNA and was labeled by random prime labeling (North2South Biotin Random Prime Labeling Kit, Thermo Scientific).
  • Example 8 Exploiting the full combinatorial potential of the PolyloxP2.0 cassette by third generation sequencing.
  • the PolyloxP2.0 cassette can undergo recombination events at multiple sites within the same molecule.
  • the recombined cassette preferably is sequenced on the whole as a single molecule.
  • SMRT® Single Molecule Real Time
  • PacBio RS platform Pacific Biosciences
  • FRCs full-length read codes
  • the nine DNA segments spacing the ten loxP sites are denominated with the capital letters A-I.
  • the DNA sequence before the first loxP site (barcode 1) defines the 5' end of the code
  • the DNA sequence after the last loxP site (barcode 20) defines its 3' end.
  • the affected DNA segments are labeled with their respective small letters a-i.
  • Fig. 13 there are four exemplary FRCs depicted: the unrecombined full-length code 5'-ABCDEFGHI-3' (Fig. l3B), excision of the first two segments 5'-CDEFGHI-3' (Fig.
  • mice were treated with tamoxifen (1 mg i.p.) and bone marrow cells were isolated several weeks after induction. Cell suspensions were stained with antibodies against CD4 (Invitrogen, RM4.5), CD8 (Pharmingen, 53-6.7), CDl lb (eBioscience, Ml/70), CD19 (Pharmingen, 1D3) and Gr-1 (Pharmingen, RB6-8C5).
  • Granulocytes were isolated as Gr-1 + CD1 lb + CD4 " CD8 " CD 19 " cells on a FACSArialll (BD Biosciences) cell sorter. Genomic DNA extraction and PCR amplification from sorted granulocytes were in principle done as described under Example 2 "Cell- free recombination assay on genomic DNA”.
  • Stepl 5 min 95°C
  • Step2 30s 95°C
  • Step3 30s 56°C
  • Step4 5 min 72°C
  • Step4 5 min 72°C
  • Step5 10 min 72°C.
  • Forward primer #2450 5 * -TGTGGTATGGCTGATTATGATCAG-3 * (SEQ ID NO: 40)
  • Reverse primer #2427 5 * -CATACCTTAGAGAAAGCCTGTCGAG-3 * (SEQ ID NO: 37).
  • PCR products were purified and size selected using the Agencourt AMPure PB beads system according to manufacturer's protocol. During this procedure, PCR products were split into a "small fragments” and a "large fragments” fraction by extraction with 0.9x and 0.4x AMPure beads, respectively. Both fractions underwent library preparation procedure according to the manufacturer's protocol (SMRTbell Template Prep Kit, Pacific Biosciences 100-259-100) and were sequenced on the PacBio RS platform ( Pacific Biosciences) using standard protocols. The "small fragments” library was loaded by diffusion mode, the "large fragments” library by MagBead mode. The obtained circular consensus sequence (CCS) reads of both fragment libraries were merged and the PolyloxP2.0 FRC barcode for each of the reads was determined. A summary of the codes detected in the experiment is listed in Table 3.
  • barcode 11 agcttcaccaggctctgaggttcctctttgacgtattgtgatgcagacttcatgatcgattttctgtgtacattaa barcode 12 tgtcgaaattcctacggtgtgatgattcttttgatatacatgtttctgatatgtattcggatagacatgctcttatcg tgccccatctac
  • barcode 15 atatggaagtgttgcatggaaagaccgtatggaagtatggaagaaacaacaaatagaaaagctacaagtcg ttaa
  • PolyloxP2 caacaggaatgaattcgttctcattaacgccgacgacactgccaaagatttctcaattttgttcttttgagaaac .0 w/o aagaatgataacttcgtatagcatacattatacgaagttattatcttctgtttggatcgatctgggtttggagattg restriction aatctgcgttttctaaaacagaagtcttttatggaaatttcacctgctgtcatattgcatgatttatacgtgtcggat sites atgatcaatgagacaaaagttagtgtcagtttcagactcatctccatgcggtttataacttcgtataatgtatgct atacgaagttatgtcagcttgtgtgagtatttaa

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Analytical Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Immunology (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

La présente invention concerne un polynucléotide de codage à barres comprenant au moins cinq, de préférence au moins dix, structures centrales, chacune desdites structures centrales comprenant deux séquences identifiables de manière unique, séparées par une séquence de reconnaissance de recombinase, ladite recombinase étant une recombinase capable (i) de conditionner une excision ou (ii) de conditionner l'excision ou l'inversion d'un polynucléotide flanqué de deux séquences de reconnaissance de ladite recombinase; et un vecteur, une cellule hôte, un organisme hôte et un animal expérimental comprenant ledit polynucléotide de codage à barres. La présente invention concerne en outre des kits, des procédés et des utilisations relatives audit polynucléotide de codage à barres. Les moyens et les procédés selon l'invention sont utilisés, en particulier, pour l'attribution de codes à barres de lignées cellulaires ou de cellules au sein d'un organisme.
PCT/EP2016/065932 2015-07-10 2016-07-06 Générateur de codes à barres d'adn aléatoire génétique pour le traçage in vivo de cellules WO2017009126A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP15176251.5 2015-07-10
EP15176251 2015-07-10

Publications (1)

Publication Number Publication Date
WO2017009126A1 true WO2017009126A1 (fr) 2017-01-19

Family

ID=53776307

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2016/065932 WO2017009126A1 (fr) 2015-07-10 2016-07-06 Générateur de codes à barres d'adn aléatoire génétique pour le traçage in vivo de cellules

Country Status (1)

Country Link
WO (1) WO2017009126A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113528506A (zh) * 2021-07-09 2021-10-22 天津大学 一种dna反转系统及其应用以及一种目标dna反转方法

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140272988A1 (en) * 2013-03-14 2014-09-18 Cold Spring Harbor Laboratory Trans-splicing transcriptome profiling

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140272988A1 (en) * 2013-03-14 2014-09-18 Cold Spring Harbor Laboratory Trans-splicing transcriptome profiling

Non-Patent Citations (7)

* Cited by examiner, † Cited by third party
Title
ALICE GERRITS ET AL: "Cellular barcoding tool for clonal analysis in the hematopoietic system", BLOOD, vol. 115, 1 January 2010 (2010-01-01), pages 2610 - 2618, XP055136898, DOI: 10.1182/blood-2009-06- *
ANTHONY M. ZADOR ET AL: "Sequencing the Connectome", PLOS BIOLOGY, vol. 10, no. 10, 23 October 2012 (2012-10-23), pages e1001411, XP055280511, DOI: 10.1371/journal.pbio.1001411 *
BRANDA C S ET AL: "Talking about a revolution: The impact of site-specific recombinase on genetic analyses in mice", DEVELOPMENTAL CELL, CELL PRESS, CAMBRIDGE, MA, US, vol. 6, no. 1, 1 January 2004 (2004-01-01), pages 7 - 28, XP002994211, ISSN: 1097-4172, DOI: 10.1016/S1534-5807(03)00399-X *
I. D. PEIKON ET AL: "In vivo generation of DNA sequence diversity for cellular barcoding", NUCLEIC ACIDS RESEARCH, vol. 42, no. 16, 15 September 2014 (2014-09-15), GB, pages e127 - e127, XP055304273, ISSN: 0305-1048, DOI: 10.1093/nar/gku604 *
JONATHAN D. POLLOCK ET AL: "Molecular neuroanatomy: a generation of progress", TRENDS IN NEUROSCIENCE., vol. 37, no. 2, 1 February 2014 (2014-02-01), NL, pages 106 - 123, XP055304020, ISSN: 0166-2236, DOI: 10.1016/j.tins.2013.11.001 *
S. D. COLLOMS ET AL: "Rapid metabolic pathway assembly and modification using serine integrase site-specific recombination", NUCLEIC ACIDS RESEARCH, vol. 42, no. 4, 12 November 2013 (2013-11-12), GB, pages e23 - e23, XP055304294, ISSN: 0305-1048, DOI: 10.1093/nar/gkt1101 *
TOM S. WEBER ET AL: "Site-specific recombinatorics: in situ cellular barcoding with the Cre Lox system", BMC SYSTEMS BIOLOGY, vol. 7, no. 3, 30 June 2016 (2016-06-30), pages 33529, XP055304267, DOI: 10.1186/s12918-016-0290-3 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113528506A (zh) * 2021-07-09 2021-10-22 天津大学 一种dna反转系统及其应用以及一种目标dna反转方法
CN113528506B (zh) * 2021-07-09 2022-12-20 天津大学 一种dna反转系统及其应用以及一种目标dna反转方法

Similar Documents

Publication Publication Date Title
JP7101211B2 (ja) ガイドrnaのペアを使用したターゲティングによる遺伝子改変の方法及び組成物
US10959414B2 (en) Efficient non-meiotic allele introgression
AU2017268458B2 (en) Methods for breaking immunological tolerance using multiple guide RNAS
ES2980050T3 (es) Método para fabricar células eucariotas editadas con ADN
ES2901074T3 (es) Métodos y composiciones para modificaciones genéticas objetivo y métodos de uso
ES3019688T3 (en) Methods and compositions for modifying a targeted locus
Fei et al. Application and optimization of CRISPR–Cas9-mediated genome engineering in axolotl (Ambystoma mexicanum)
ES2699848T3 (es) Acido nucleico CRISPR clase 2 de tipo cruzado modificado que se dirige a ácidos nucleicos
US20150128300A1 (en) Methods and compositions for generating conditional knock-out alleles
CA3001683A1 (fr) Procedes et compositions pour generer des arn guide crispr/cas
US9125385B2 (en) Site-directed integration of transgenes in mammals
JP2019507610A (ja) CRISPR−Casゲノム編集に基づく、Fel d1ノックアウト並びに関連組成物及び方法
US20150064149A1 (en) Materials and methods for correcting recessive mutations in animals
JP2017184639A (ja) Cas9タンパク質を哺乳動物の受精卵に導入する方法
WO2019173248A1 (fr) Acides nucléiques ciblant un acide nucléique modifié
CN106754949A (zh) 猪肌抑素基因编辑位点864‑883及其应用
CA2379055A1 (fr) Vecteur piege et procede de piegeage de genes utilisant ledit vecteur
WO2017009126A1 (fr) Générateur de codes à barres d'adn aléatoire génétique pour le traçage in vivo de cellules
WO2013139994A1 (fr) Nouvelle méthode de production d'un ovocyte portant une séquence cible modifiée au sein de son génome
US20220112509A1 (en) Gene knock-in method, gene knock-in cell fabrication method, gene knock-in cell, malignant transformation risk evaluation method, cancer cell production method, and kit for use in these
US20240100184A1 (en) Methods of precise genome editing by in situ cut and paste (icap)
Tasan New tools for live cell imaging of endogenous loci in mammalian cells
JP2007124915A (ja) リコンビニアリングコンストラクト、及びジーンターゲティングコンストラクト作製用ベクター
JP2016198044A (ja) 細胞の作製方法および該作製方法で作製された細胞
NZ718194B2 (en) Efficient non-meiotic allele introgression

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16736431

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16736431

Country of ref document: EP

Kind code of ref document: A1