WO2017150677A1 - 標的遺伝子の塩基配列を決定する方法 - Google Patents
標的遺伝子の塩基配列を決定する方法 Download PDFInfo
- Publication number
- WO2017150677A1 WO2017150677A1 PCT/JP2017/008320 JP2017008320W WO2017150677A1 WO 2017150677 A1 WO2017150677 A1 WO 2017150677A1 JP 2017008320 W JP2017008320 W JP 2017008320W WO 2017150677 A1 WO2017150677 A1 WO 2017150677A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- base sequence
- target gene
- gene
- mrna
- target
- Prior art date
Links
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/158—Expression markers
Definitions
- the present invention relates to a method for determining the base sequence of a target gene.
- hybridoma production technology is widely accepted as a means of producing large amounts of monoclonal antibodies and applying them to research and medicine.
- the culture of the hybridoma is continued, the reactivity of the antibody produced by somatic mutation may change.
- 5 ′ Rapid Amplification of cDNA Ends 5 ′ RACE method, degenerate PCR method, and the like have been used to determine the base sequence of an antibody gene (see, for example, Non-Patent Document 1).
- the 5'RACE method requires a large amount of total RNA and may be difficult to implement.
- the original base sequence may be lost due to mishybridization of degenerate primers.
- An object of the present invention is to provide a new technique that can easily and accurately determine the base sequence of a target gene expressed in a target cell.
- the present invention includes the following aspects.
- a method for determining the base sequence of a target gene expressed in a target cell comprising comprehensively determining the base sequence of mRNA in the target cell, and among the determined base sequence of the mRNA, Identifying a base sequence having a partial base sequence of the target gene, wherein the identified base sequence is the base sequence of the target gene.
- the target gene has a rank of 1 to 10 when all genes expressed in the target cell are ranked in descending order of the number of mRNA molecules.
- the target cell is an antibody-producing cell
- the target gene is an antibody heavy chain gene
- a partial base sequence of the target gene is a partial base sequence of a constant region of the antibody heavy chain gene
- the target gene is an antibody light chain gene
- the partial base sequence of the target gene is a partial base sequence of a constant region of the antibody light chain gene
- Experimental Example 1 it is a graph in which transcriptomes are arranged in descending order of expression level.
- A is a figure which shows the base sequence (SEQ ID NO: 28) and the deduced amino acid sequence (SEQ ID NO: 29) of the hybridoma clone HD1 in Experimental Example 2.
- B shows the amino acid sequence of the IgH protein of clone HD1 whose nucleotide sequence was determined in Experimental Example 2, and the amino acid sequence of the constant region of known rat IgH (IgG2b) (accession number: AAA6078, SEQ ID NO: 30).
- FIG. (A) is a figure which shows the base sequence (sequence number 31) and the deduced amino acid sequence (sequence number 32) of Igk of hybridoma clone HD1 in Experimental example 2.
- FIG. (B) is a diagram in which the amino acid sequence of the IgK protein of clone HD1 whose base sequence was determined in Experimental Example 2 and the amino acid sequence of a known constant region of rat IgK (accession number: CAA24558, SEQ ID NO: 33) are aligned. It is.
- (A) is a graph which shows the result of having calculated the reconstruction rate of Igh at the time of carrying out de novo assembly from the lead
- FIG. (B) is a graph showing the result of calculating the reconstruction rate of Igk when de novo assembly is performed from leads of each number of leads in Experimental Example 4.
- (A)-(d) is the figure which put together the base sequence determination method which
- the present invention is a method for determining a base sequence of a target gene expressed in a target cell, comprising comprehensively determining a base sequence of mRNA in the target cell, and the determined mRNA A base sequence having a partial base sequence of the target gene is specified, and the specified base sequence is the base sequence of the target gene.
- the base sequence of the target gene can be determined easily and accurately.
- the method of this embodiment can also be performed using only about 0.1 ⁇ g of total RNA. Therefore, for example, the base sequence of the target gene can be determined at the single cell level using one target cell as a sample.
- the step of comprehensively determining the base sequence of mRNA is preferably performed by next-generation sequencing. More specifically, the method of this embodiment can be effectively carried out by mRNA-seq that comprehensively determines the base sequence of mRNA by next-generation sequencing.
- Next generation sequencing is a term used in contrast to a first generation sequencer represented by a fluorescent capillary sequencer using a sequencing method based on the Sanger method.
- Next-generation sequencing actually includes a variety of devices and technologies, and various forms will be devised in the future.
- the maximum number of samples that can be processed at one time was limited to about 96.
- DNA molecules to be samples for sequencing must be prepared individually by cloning or amplified by the PCR method in advance, and a great deal of labor was required at that stage.
- DNA fragments containing various sequences are applied in parallel by applying amplification techniques such as emulsion PCR and bridge PCR and high-sensitivity detection techniques such as single-molecule observation. Analyze the sequence. For this reason, it is possible to easily determine a large-scale base sequence.
- next-generation sequencers include, for example, MiSeq, HiSeq, NovaSeq (Illumina); Genetic Analyzer V2.0, Ion Proton (Thermo Fisher Scientific); MinION, Promethion (Nanopore).
- a library is prepared according to the method of the next-generation sequencer used.
- the average read length is 50 to 100 bp
- the number of reads is 30,000 to 50
- Sequencing at 000 is sufficient. That is, the number of reads of the base sequence by next-generation sequencing may be 50,000 or less.
- the base sequence of the target gene can be determined even when the number of reads is as small as this.
- the base sequence data obtained by sequencing is assembled (bonded) by any method to obtain a contig.
- contig means a longer base sequence obtained by pasting short leads, for example, a base sequence of assembled full-length mRNA.
- the base sequence of mRNA in the target cell is determined. For example, it can be assembled by a technique such as de novo assembly that does not require a reference sequence.
- a base sequence having a partial base sequence of the target gene is specified.
- the base sequence specified in this way is the base sequence of the target gene.
- the base sequence of the constant region of the antibody heavy chain (Igh), the base sequence of the constant region of the antibody ⁇ light chain (Igl) or antibody ⁇ light chain (Igk) can be used as a partial base sequence. More specifically, for example, the base sequences described in SEQ ID NOs: 11 to 14 can be used as a partial base sequence of the constant region of rat Igh. Further, for example, the base sequences described in SEQ ID NOs: 15 to 16 can be used as a partial base sequence of the constant region of rat Igl. For example, the base sequence set forth in SEQ ID NO: 17 can be used as a partial base sequence of the constant region of rat Igk.
- the base sequences described in SEQ ID NOs: 18 to 22 can be used as a partial base sequence of the constant region of mouse Igh.
- the base sequences described in SEQ ID NOs: 23 to 26 can be used as a partial base sequence of the constant region of mouse Igl.
- the base sequence set forth in SEQ ID NO: 27 can be used as a partial base sequence of the constant region of mouse Igk.
- the base sequence of the target gene can be extracted more efficiently.
- the amino acid sequence of Igh includes 400 or more amino acid residues. Therefore, a contig having a base sequence having a length of 1200 bp or more necessary for encoding this amino acid sequence may be specified. Thereby, contigs of the full-length target gene can be efficiently extracted.
- the target gene for determining the base sequence is a gene having a rank of 1 to 10 when all the genes expressed in the target cell are ranked in descending order of the number of mRNA molecules. Is preferred. A target gene having a large number of mRNA molecules in the above range can easily determine the base sequence.
- the expression level of the target gene is preferably 5,000 FPKM (fragments per kilobase of exon per million mapped fragments) or more.
- the target gene having such an expression level can easily determine the base sequence.
- the upper limit of the expression level of the target gene is not particularly limited, but generally the upper limit is often about 30,000 FPKM.
- target genes include, but are not limited to, antibody genes, T cell receptor genes, B cell receptor genes, and the like.
- the target cell may be an antibody-producing cell
- the target gene may be an antibody heavy chain gene
- the partial base sequence of the target gene may be a partial base sequence of the constant region of the antibody heavy chain gene.
- the target cell may be an antibody-producing cell
- the target gene may be an antibody light chain gene
- the partial base sequence of the target gene may be a partial base sequence of the constant region of the antibody light chain gene.
- antibodies may be produced using animals such as skunks, for example, whose genome sequence has not been clarified.
- the base sequence of the genome may not be used as a reference sequence for determining the base sequence.
- the base sequence of the antibody gene can be determined even in such a case.
- the base sequence of an antibody gene only the base sequence of the variable region could be specified.
- the full length of the base sequence of the target gene including the constant region can be determined. Therefore, as will be described later in the Examples, when the target gene is an antibody gene, it is possible to specify the antibody isotype and subclass. Further, for example, it is possible to detect a small number of mutants due to somatic mutation of the antibody gene.
- T cell receptor gene For example, in adoptive cancer immunotherapy and the like, there is a demand for determining the base sequence of a T cell receptor. Therefore, for example, even if the target cell is a T cell, the target gene is a T cell receptor gene, and the partial base sequence of the target gene is a partial base sequence of the constant region of the T cell receptor gene. Good.
- the target cell is an immature B cell
- the target gene is a B cell receptor weight chain gene
- a part of the base sequence of the target gene is a part of the constant region of the B cell receptor weight chain gene.
- the target cell is an immature B cell
- the target gene is a B cell receptor light chain gene
- a partial base sequence of the target gene is a partial base of the constant region of the B cell receptor light chain gene It may be an array.
- the target gene is not limited to the above gene, and may be any gene.
- SNVs single nucleotide variants
- SNPs single nucleotide polymorphisms
- Indel insertion / deletion
- splicing variants etc.
- Hybridoma cell lines (clones HD1, HD2, HD3 and HD4) established by the inventors were used in the experiments. Each hybridoma is 10% fetal bovine serum (FBS), 1.2% penicillin-streptomycin-glutamine (Gibco), hybridoma serum-free medium (Gibco) supplemented with 1 ng / mL interleukin (IL) -6, or Culturing was performed using GIT medium (Wako Pure Chemical Industries) supplemented with 1 ng / mL IL-6.
- FBS fetal bovine serum
- Gibco penicillin-streptomycin-glutamine
- Gibco hybridoma serum-free medium
- IL interleukin
- Culturing was performed using GIT medium (Wako Pure Chemical Industries) supplemented with 1 ng / mL IL-6.
- RNA-seq Total RNA was prepared from each hybridoma cell line using a commercially available kit (model “AllPrep DNA / RNA Mini Kit”, Qiagen). Using 1 ⁇ g of total RNA, a library was prepared using a commercially available kit (model “NEBNext Ultra Directional RNA Library Prep Kit”, New England Biolabs). In this kit, the total RNA used can be reduced to about 0.1 ⁇ g to prepare a library.
- mRNA-seq was performed by paired end sequencing with an average read length of 50 bp using a next-generation sequencer (model “HiSeq 1500”, Illumina). For each hybridoma cell line, base sequence data with the number of reads of 40 ⁇ 10 6 or more was obtained.
- Custom transcriptome reference sequences include the bases of the mouse transcript, rat transcript, rat immunoglobulin heavy chain (Igh) constant region, immunoglobulin lambda light chain (Igl) constant region and immunoglobulin ⁇ light chain (Igk) constant region. Contained sequences.
- BWA-MEM which is a mapping program
- leads were mapped with parameters of -t8-P-L10000.
- the TIGAR2 program was used with default settings.
- the expression level of each gene was quantified by FPKM (fragments per kilobase of exon per million mapped fragments).
- the base sequence (CDS) coding for Igh and Igl / Igk contains a characteristic base sequence in the constant region of 20-30 bp of Igh or Igl / Igk, as a result of filtering (assembled base sequence). And an appropriate length (over 1200 bp for Igh and over 600 bp for Igl / Igk).
- RNA of each hybridoma was purified by phenol / chloroform extraction.
- a reverse transcription reaction was performed using a commercially available kit (model “PrimeScript TM II 1st strand cDNA Synthesis Kit”, Takara Bio Inc.).
- PCR was performed using an enzyme (model “KOD Plus”, Toyobo Co., Ltd.) and a thermal cycler.
- the PCR product was purified by gel extraction to remove nonspecific amplification products. Thereafter, the base sequence was determined by the Sanger method. Table 1 below shows the nucleotide sequence numbers of the primers used for PCR.
- FIG. 1 is a graph in which transcriptomes are arranged in descending order of expression level.
- the base sequence encoding Igh was extracted by filtering.
- the filtering contigs having a base sequence of 20-30 bp characteristic of the constant region of Igh were extracted.
- the sequence numbers of the base sequences used for filtering are shown in Table 2 below.
- Full length IgH has more than 400 amino acids. Therefore, a base sequence of 1,395 bp was identified as a base sequence of Igh having a length of 1,200 bp or more and including a base sequence of 24 bp characteristic of Ighg2b. The identified Igh base sequence was identical to the Igh base sequence of clone HD1 whose base sequence was determined by the Sanger method.
- FIG. 2 (a) is a diagram showing the Igh base sequence (SEQ ID NO: 28) and the deduced amino acid sequence (SEQ ID NO: 29) of clone HD1 whose base sequence was determined by de novo transcriptome assembly.
- FIG. 2 (b) shows the amino acid sequence of the IgH protein of clone HD1 whose base sequence was determined by de novo transcriptome assembly and the amino acid sequence of the constant region of known rat IgH (IgG2b) (accession number: AAA6078, It is the figure which aligned with sequence number 30).
- IgG2b accession number: AAA6078, It is the figure which aligned with sequence number 30.
- the base sequence encoding Igl / Igk was extracted by filtering.
- the filtering contigs having a base sequence of 20 to 30 bp characteristic of the constant region of Igl / Igk were extracted.
- the sequence numbers of the base sequences used for filtering are shown in Table 3 below.
- Full length IgK has 200 or more amino acids. Therefore, a base sequence of 705 bp was identified as a base sequence of Igk having a length of 600 bp or more and including a base sequence characteristic of Igk. The identified Igk nucleotide sequence was identical to the Igk nucleotide sequence of clone HD1 whose nucleotide sequence was determined by the Sanger method.
- FIG. 3 (a) is a diagram showing the Igk base sequence (SEQ ID NO: 31) and the deduced amino acid sequence (SEQ ID NO: 32) of clone HD1 whose base sequence was determined by de novo transcriptome assembly.
- FIG. 3 (b) shows the amino acid sequence of the IgK protein of clone HD1 whose nucleotide sequence was determined by de novo transcriptome assembly and the amino acid sequence of the known rat IgK constant region (accession number: CAA24558, SEQ ID NO: 33). ).
- the amino acid sequence of 129 to 234 out of the amino acid sequence of IgK produced by clone HD1 matches the amino acid sequence of a known rat IgK constant region.
- the inventors similarly identified the nucleotide sequence of the antibody gene produced by clone HD2 (Ighg2a / Igk), clone HD3 (Ighg2a / Igk), and clone HD4 (Ighg2a / Igk).
- FIG. 4A is a graph showing the result of calculating the reconstruction rate of Igh when de novo assembly is performed from the leads of each number of leads. As a result, it was revealed that in all 4 clones, the Igh base sequence can be completely identified with the number of reads exceeding 30 ⁇ 10 3 reads.
- FIG. 4B is a graph showing the result of calculating the reconstruction rate of Igk when de novo assembly is performed from leads of each number of leads. As a result, it was revealed that the Igk base sequence could be completely identified with the number of reads exceeding 10 ⁇ 10 3 reads in all 4 clones.
- FIG. 5 summarizes the current technique.
- mRNA is extracted from the cells, and the base sequence is determined by next-generation sequencing (NGS).
- NGS next-generation sequencing
- FIG. 5 (c) a contig is created by de novo assembly of data obtained by mRNA-seq.
- FIG. 5D a contig having a specific base sequence is specified by filtering to obtain a base sequence of the target gene.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Analytical Chemistry (AREA)
- Physics & Mathematics (AREA)
- Immunology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
Abstract
Description
[1]対象細胞が発現する標的遺伝子の塩基配列を決定する方法であって、前記対象細胞中のmRNAの塩基配列を網羅的に決定する工程と、決定された前記mRNAの塩基配列のうち、前記標的遺伝子の一部の塩基配列を有する塩基配列を特定する工程と、を含み、特定された前記塩基配列が標的遺伝子の塩基配列である、方法。
[2]前記標的遺伝子は、前記対象細胞が発現する全遺伝子をmRNAの分子数が多いものから順に順位付けした場合の順位が1~10位である、[1]に記載の方法。
[3]前記対象細胞が抗体産生細胞であり、前記標的遺伝子が抗体重鎖遺伝子であり、前記標的遺伝子の一部の塩基配列が抗体重鎖遺伝子の定常領域の一部の塩基配列であるか、又は、前記標的遺伝子が抗体軽鎖遺伝子であり、前記標的遺伝子の一部の塩基配列が抗体軽鎖遺伝子の定常領域の一部の塩基配列である、[1]又は[2]に記載の方法。
[4]mRNAの塩基配列を網羅的に決定する前記工程が、次世代シーケンシングにより行われる、[1]~[3]のいずれかに記載の方法。
[5]次世代シーケンシングにおける塩基配列のリード数が50,000リード以下である、[4]に記載の方法。
例えば、対象細胞が抗体産生細胞であり、標的遺伝子が抗体重鎖遺伝子であり、標的遺伝子の一部の塩基配列が抗体重鎖遺伝子の定常領域の一部の塩基配列であってもよい。あるいは、対象細胞が抗体産生細胞であり、標的遺伝子が抗体軽鎖遺伝子であり、標的遺伝子の一部の塩基配列が抗体軽鎖遺伝子の定常領域の一部の塩基配列であってもよい。
例えば、癌の養子免疫療法等において、T細胞受容体の塩基配列を決定する需要がある。そこで、例えば、対象細胞がT細胞であり、標的遺伝子がT細胞受容体遺伝子であり、標的遺伝子の一部の塩基配列がT細胞受容体遺伝子の定常領域の一部の塩基配列であってもよい。
例えば、対象細胞が未成熟なB細胞であり、標的遺伝子がB細胞受容体重鎖遺伝子であり、標的遺伝子の一部の塩基配列がB細胞受容体重鎖遺伝子の定常領域の一部の塩基配列であってもよい。あるいは、対象細胞が未成熟なB細胞であり、標的遺伝子がB細胞受容体軽鎖遺伝子であり、標的遺伝子の一部の塩基配列がB細胞受容体軽鎖遺伝子の定常領域の一部の塩基配列であってもよい。
標的遺伝子は、上記の遺伝子に限られず、任意の遺伝子であってもよい。本実施形態の方法により、例えば、任意の標的遺伝子について、single nucleotide variants(SNVs)、single nucleotide polymorphysms(SNPs)、insertion/deletion(Indel)、スプライシングバリアント等を容易に解析することができる。
(細胞株)
発明者らが樹立したハイブリドーマ細胞株(クローンHD1、HD2、HD3及びHD4)を実験に用いた。各ハイブリドーマは、10%ウシ胎児血清(FBS)、1.2%ペニシリン-ストレプトマイシン-グルタミン(ギブコ社)、1ng/mL インターロイキン(IL)-6を添加したハイブリドーマ無血清培地(ギブコ社)、又は、1ng/mL IL-6を添加したGIT培地(和光純薬)を使用して培養した。
各ハイブリドーマ細胞株から、市販のキット(型式「AllPrep DNA/RNA Mini Kit」、キアゲン社)を使用して全RNAを調製した。1μgの全RNAを使用し、市販のキット(型式「NEBNext Ultra Directional RNA Library Prep Kit」、ニュー・イングランド・バイオラボ社)を用いてライブラリーを作製した。なお、このキットは、使用する全RNAを約0.1μgにまで減らしてライブラリーを作製することができる。
取得されたリードは、発明者らのカスタムトランスクリプトーム参照配列に対してマッピングした。カスタムトランスクリプトーム参照配列は、マウストランスクリプト、ラットトランスクリプト、ラットイムノグロブリン重鎖(Igh)定常領域、イムノグロブリンλ軽鎖(Igl)定常領域及びイムノグロブリンκ軽鎖(Igk)定常領域の塩基配列を含んでいた。
全リード又は「fastq-sample」プログラム(http://homes.cs.washington.edu/~dcjones/fastq-tools)によりサブサンプリングしたリードを、「Trinity」プログラムを用いてデノボ・アセンブリした。CPU及びmax-memoryパラメータはリード数に応じて変更した。例えば、40×106リードの場合、CPUパラメータは8、max-memoryパラメータは52Gに設定した。また、例えば、1×106リードの場合、CPUパラメータは2、max-memoryパラメータは12Gに設定した。
各ハイブリドーマのRNAをフェノール/クロロホルム抽出により精製した。市販のキット(型式「PrimeScript(商標)II 1st strand cDNA Synthesis Kit」、タカラバイオ社)を用いて逆転写反応を行った。酵素(型式「KOD Plus」、東洋紡社)及びサーマルサイクラーを用いてPCRを行った。PCR産物はゲル抽出により精製し、非特異的な増幅産物を除去した。その後サンガー法により塩基配列を決定した。PCR使用したプライマーの塩基配列の配列番号を下記表1に示す。
(ハイブリドーマのmRNA-seq解析)
ラットBリンパ球とマウスミエローマ細胞株SP2との細胞融合により樹立したハイブリドーマ細胞株である、クローンHD1、HD2、HD3及びHD4のmRNA-seqをそれぞれ行った。平均リード長50bpでペアードエンドシーケンシングを行った。
(ラットハイブリドーマのIgh及びIgl/Igkの塩基配列のアセンブリ)
実験例1で得られたmRNA-seqデータのデノボ・トランスクリプトーム・アセンブリにより、Igh及びIgl/Igkの塩基配列の再構成を試みた。
(マウスハイブリドーマのIgh及びIgl/Igkの塩基配列のアセンブリ)
実験例2と同様にして、マウスハイブリドーマであるクローン8A2及び13C7のIgh及びIgkの塩基配列を決定した。マウスIgh及びIgl/Igkのフィルタリングに用いた塩基配列の配列番号を下記表4に示す。
(抗体遺伝子の塩基配列決定のためのデノボ・アセンブリ条件の最適化)
ハイブリドーマのmRNA-seqデータを用いてIgh及びIgl/Igk遺伝子の塩基配列を決定する条件の最適化を試みた。まず、抗体遺伝子の塩基配列の決定に必要なリード数を検討した。
(他のハイブリドーマが産生する抗体の遺伝子の塩基配列の決定)
今回の塩基配列決定方法を、更に多数のハイブリドーマに適用し、抗体遺伝子の塩基配列を決定した。その結果、96種類以上のハイブリドーマについて、1サンプルあたり200×103リードでmRNA-seqを行い、抗体遺伝子の塩基配列を正確に決定することができた。
Claims (5)
- 対象細胞が発現する標的遺伝子の塩基配列を決定する方法であって、
前記対象細胞中のmRNAの塩基配列を網羅的に決定する工程と、
決定された前記mRNAの塩基配列のうち、前記標的遺伝子の一部の塩基配列を有する塩基配列を特定する工程と、
を含み、特定された前記塩基配列が標的遺伝子の塩基配列である、方法。 - 前記標的遺伝子は、前記対象細胞が発現する全遺伝子をmRNAの分子数が多いものから順に順位付けした場合の順位が1~10位である、請求項1に記載の方法。
- 前記対象細胞が抗体産生細胞であり、
前記標的遺伝子が抗体重鎖遺伝子であり、前記標的遺伝子の一部の塩基配列が抗体重鎖遺伝子の定常領域の一部の塩基配列であるか、又は
前記標的遺伝子が抗体軽鎖遺伝子であり、前記標的遺伝子の一部の塩基配列が抗体軽鎖遺伝子の定常領域の一部の塩基配列である、
請求項1又は2に記載の方法。 - mRNAの塩基配列を網羅的に決定する前記工程が、次世代シーケンシングにより行われる、請求項1~3のいずれか一項に記載の方法。
- 次世代シーケンシングにおける塩基配列のリード数が50,000リード以下である、請求項4に記載の方法。
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2018503404A JP6869550B2 (ja) | 2016-03-02 | 2017-03-02 | 標的遺伝子の塩基配列を決定する方法 |
US16/079,745 US20190048414A1 (en) | 2016-03-02 | 2017-03-02 | Method for determining nucleic acid sequence of target gene |
EP17760140.8A EP3425057A4 (en) | 2016-03-02 | 2017-03-02 | METHOD FOR DETERMINING A NUCLEIC ACID SEQUENCE OF A TARGET GENE |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201662302196P | 2016-03-02 | 2016-03-02 | |
US62/302,196 | 2016-03-02 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2017150677A1 true WO2017150677A1 (ja) | 2017-09-08 |
Family
ID=59743069
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2017/008320 WO2017150677A1 (ja) | 2016-03-02 | 2017-03-02 | 標的遺伝子の塩基配列を決定する方法 |
Country Status (4)
Country | Link |
---|---|
US (1) | US20190048414A1 (ja) |
EP (1) | EP3425057A4 (ja) |
JP (1) | JP6869550B2 (ja) |
WO (1) | WO2017150677A1 (ja) |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014210585A2 (en) * | 2013-06-29 | 2014-12-31 | Firmenich Sa | Methods of identifying, isolating and using odorant and aroma receptors |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20130195843A1 (en) * | 2010-06-23 | 2013-08-01 | British Columbia Cancer Agency Branch | Biomarkers for Non-Hodgkin Lymphomas and Uses Thereof |
AU2013331212A1 (en) * | 2012-10-19 | 2015-05-07 | Adaptive Biotechnologies Corp. | Monitoring clonotypes of plasma cell proliferative disorders in peripheral blood |
US10175242B2 (en) * | 2014-02-28 | 2019-01-08 | H. Lee Moffitt Cancer Center And Research Institute, Inc. | Personalized myeloma detection |
-
2017
- 2017-03-02 WO PCT/JP2017/008320 patent/WO2017150677A1/ja active Application Filing
- 2017-03-02 US US16/079,745 patent/US20190048414A1/en not_active Abandoned
- 2017-03-02 EP EP17760140.8A patent/EP3425057A4/en not_active Withdrawn
- 2017-03-02 JP JP2018503404A patent/JP6869550B2/ja active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014210585A2 (en) * | 2013-06-29 | 2014-12-31 | Firmenich Sa | Methods of identifying, isolating and using odorant and aroma receptors |
Non-Patent Citations (2)
Title |
---|
See also references of EP3425057A4 |
ZHOU, H. ET AL.: "Optimization of primer sequences for mouse scFv repertoire display library construction", NUCLEIC ACIDS RESEARCH, vol. 22, no. 5, 1994, pages 888 - 889 |
Also Published As
Publication number | Publication date |
---|---|
JP6869550B2 (ja) | 2021-05-12 |
EP3425057A4 (en) | 2019-07-24 |
US20190048414A1 (en) | 2019-02-14 |
EP3425057A1 (en) | 2019-01-09 |
JPWO2017150677A1 (ja) | 2018-12-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Oikonomopoulos et al. | Methodologies for transcript profiling using long-read technologies | |
Da Costa et al. | Molecular approaches to diagnose Diamond-Blackfan anemia: The EuroDBA experience | |
Di Genova et al. | Whole genome comparison between table and wine grapes reveals a comprehensive catalog of structural variants | |
Ji et al. | Isolation and analyses of genes preferentially expressed during early cotton fiber development by subtractive PCR and cDNA array | |
Beckmann et al. | Effects of high passage cultivation on CHO cells: a global analysis | |
Wuest et al. | Genomics in mammalian cell culture bioprocessing | |
Ohshima et al. | Inference for the initial stage of domain shuffling: tracing the evolutionary fate of the PIPSL retrogene in hominoids | |
TW201321518A (zh) | 微量核酸樣本的庫製備方法及其應用 | |
Kubo et al. | Single-cell transcriptome analysis of Physcomitrella leaf cells during reprogramming using microcapillary manipulation | |
EP3414255B1 (en) | Preparative electrophoretic method for targeted purification of genomic dna fragments | |
US20140302559A1 (en) | Rapid method for cloning and expression of cognate antibody variable region gene segments | |
Cole et al. | Highly accurate sequencing of Full-Length immune repertoire amplicons using Tn5-Enabled and molecular Identifier–Guided amplicon assembly | |
Chen et al. | Barcoded sequencing workflow for high throughput digitization of hybridoma antibody variable domain sequences | |
JP6904953B2 (ja) | 細胞クローン性を決定する方法 | |
CN109112217B (zh) | 一种与猪体长和乳头数显著关联的遗传标记及应用 | |
US20200355698A1 (en) | Use of nanoexpression to interrogate antibody repertoires | |
Marx et al. | The octatricopeptide repeat protein Raa8 is required for chloroplast trans splicing | |
WO2022018055A1 (en) | Circulation method to sequence immune repertoires of individual cells | |
Rubio et al. | Small RNA populations reflect the complex dialogue established between heterograft partners in grapevine | |
Srila et al. | Glutamine synthetase (GS) knockout (KO) using CRISPR/Cpf1 diversely enhances selection efficiency of CHO cells expressing therapeutic antibodies | |
Subas Satish et al. | NAb-seq: an accurate, rapid, and cost-effective method for antibody long-read sequencing in hybridoma cell lines and single B cells | |
Vollmers et al. | Novel exons and splice variants in the human antibody heavy chain identified by single cell and single molecule sequencing | |
Hamashima et al. | Nematode-specific tRNAs that decode an alternative genetic code for leucine | |
WO2017150677A1 (ja) | 標的遺伝子の塩基配列を決定する方法 | |
Bauer et al. | An arrayed CRISPR screen reveals Myc depletion to increase productivity of difficult-to-express complex antibodies in CHO cells |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 2018503404 Country of ref document: JP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2017760140 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 2017760140 Country of ref document: EP Effective date: 20181002 |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 17760140 Country of ref document: EP Kind code of ref document: A1 |