WO2014153753A1 - Gene mutant and use thereof - Google Patents

Gene mutant and use thereof Download PDF

Info

Publication number
WO2014153753A1
WO2014153753A1 PCT/CN2013/073365 CN2013073365W WO2014153753A1 WO 2014153753 A1 WO2014153753 A1 WO 2014153753A1 CN 2013073365 W CN2013073365 W CN 2013073365W WO 2014153753 A1 WO2014153753 A1 WO 2014153753A1
Authority
WO
WIPO (PCT)
Prior art keywords
nucleic acid
sample
mutation
seq
sequencing
Prior art date
Application number
PCT/CN2013/073365
Other languages
French (fr)
Chinese (zh)
Inventor
张建国
方明艳
王海荣
杜萨特科瓦•佩特拉
王威
王俊
Original Assignee
深圳华大基因科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳华大基因科技有限公司 filed Critical 深圳华大基因科技有限公司
Priority to CN201380074760.0A priority Critical patent/CN105263944B/en
Priority to PCT/CN2013/073365 priority patent/WO2014153753A1/en
Publication of WO2014153753A1 publication Critical patent/WO2014153753A1/en
Priority to HK16101540.5A priority patent/HK1213573A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07HSUGARS; DERIVATIVES THEREOF; NUCLEOSIDES; NUCLEOTIDES; NUCLEIC ACIDS
    • C07H21/00Compounds containing two or more mononucleotide units having separate phosphate or polyphosphate groups linked by saccharide radicals of nucleoside groups, e.g. nucleic acids
    • C07H21/04Compounds containing two or more mononucleotide units having separate phosphate or polyphosphate groups linked by saccharide radicals of nucleoside groups, e.g. nucleic acids with deoxyribosyl as saccharide radical
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/156Polymorphic or mutational markers

Definitions

  • the present invention relates to gene mutants and uses thereof.
  • the present invention relates to an isolated nucleic acid, an isolated polypeptide, a method of screening a biological sample of adult-onset diabetes in a susceptible young person, a system for screening a biological sample of adult-onset diabetes in a susceptible young person, and A kit for screening biological samples of adult onset diabetes in susceptible young people. Background technique
  • Maturity onset diabetes of the young is a group of monogenic diseases with high genetic and clinical phenotype heterogeneity.
  • Fajans and Tattersall analyzed and named the series according to the series since 1950, and summarized the following clinical features: (1) involving 3 or more generations of family members, autosomal dominant, not related to human leukocyte antigen (HLA) (2) There are usually more than 2 patients in the family who are ill before the age of 25; (3) The penetrance rate is high, which can exceed 90%; (4) The disease progresses slowly, and can be asymptomatic or only exhibits glucose tolerance during adolescence. Reduced; (5) General ketoacidosis, at least for 2 years after the onset of insulin treatment.
  • the present invention aims to solve at least one of the technical problems existing in the prior art. To this end, it is an object of the present invention to provide a method for efficiently screening biological samples of adult onset diabetes in susceptible young people.
  • the present invention was accomplished based on the inventors' work in that the inventors identified new mutations in the causative genes of adult onset diabetes in young adults by high-throughput exome sequencing combined with candidate gene mutation verification.
  • the invention proposes an isolated nucleic acid. According to an embodiment of the invention,
  • the nucleic acid has a C.703 OT mutation compared to SEQ ID NO: 1, or the nucleic acid has at least one mutation selected from the group consisting of c.338G>T and c.358A>G compared to SEQ ID NO:3.
  • the inventors identified HNF1B and HNF1A gene mutants, which are closely related to the onset of adult-onset diabetes in young people, thereby detecting the presence or absence of these new mutants in biological samples. It can effectively detect whether biological samples are susceptible to adult-onset diabetes in young people.
  • the invention provides an isolated polypeptide.
  • SEQ ID NO: 2 the isolated polypeptide has a p. Arg235Trp mutation; or compared to SEQ ID NO: 4, the isolated polypeptide has at least one mutation selected from the group consisting of: p.Trpl l3Leu and p.Lysl20Glu .
  • the invention provides a method of screening a biological sample of adult onset diabetes in a susceptible young person.
  • the method comprises the steps of: extracting a nucleic acid sample from a biological sample; determining a nucleic acid sequence of the nucleic acid sample; having a nucleic acid sequence of the nucleic acid sample or a complementary sequence thereof, having SEQ ID NO: 1
  • An at least one mutation selected from the group consisting of C. 703 OT mutations or having SEQ ID NO: 3 and having at least one mutation selected from the group consisting of c.338G>T and c.358A>G is an adult-onset type in a susceptible young person of the biological sample.
  • Instructions for diabetes By screening a biological sample of adult-onset diabetes in a susceptible young person according to an embodiment of the present invention, a biological sample of adult-onset diabetes in a susceptible young person can be effectively screened.
  • the invention provides a system for screening biological samples of adult onset diabetes in susceptible young people.
  • the system includes: a nucleic acid extraction device for extracting a nucleic acid sample from the biological sample; a nucleic acid sequence determining device, the nucleic acid sequence determining device being coupled to the nucleic acid extraction device, For analyzing the nucleic acid sample to determine a nucleic acid sequence of the nucleic acid sample; a judging device, the judging device being coupled to the nucleic acid sequence determining device, based on the nucleic acid sequence of the nucleic acid sample or a complementary sequence thereof, Having a C.703OT mutation compared to SEQ ID NO: 1, or having at least one mutation selected from c.338G>T and c.358A>G compared to SEQ ID NO: 3, determining whether the biological sample is easy Adults with age-onset diabetes.
  • the present invention provides a kit for screening a biological sample of adult onset diabetes in a susceptible young person.
  • the kit comprises: an agent suitable for detecting at least one of the HNF1B and HNFL4 gene mutants, wherein the HN ⁇ B gene mutant has C.703OT compared to SEQ ID NO: 1. Mutation; compared to SEQ ID NO: 3, the HNFL4 gene mutant has at least one mutation selected from the group consisting of: c.338G>T and c.358A>G.
  • Figure 1 is a schematic view showing a system for screening a biological sample of adult-onset diabetes mellitus in a susceptible young person, and a component thereof, according to an embodiment of the present invention, wherein
  • A is a line for screening biological samples of adult-onset diabetes in susceptible young people according to an embodiment of the present invention.
  • B is a schematic diagram of a nucleic acid extraction device according to an embodiment of the present invention.
  • C is a schematic diagram of a nucleic acid sequence determining device according to an embodiment of the present invention.
  • Figure 2 shows the family diagram of the MODY patient family CZ01 and CZ02 according to one embodiment of the present invention, wherein
  • A is a family diagram of the CZ01 family according to an embodiment of the present invention.
  • B is a family diagram of a CZ02 family according to an embodiment of the present invention.
  • Fig. 3 is a view showing a Sanger sequencing verification peak of a HNF1B gene mutation site in a patient and a normal human in a family of CZ01 in a family of MODY patients according to an embodiment of the present invention
  • Figure 4 shows a Sanger sequencing verification peak of the HNF1A gene mutation site in patients and their normal humans in the family of CZ02 in the MODY patient family according to one embodiment of the present invention
  • Figure 5 shows a Sanger sequencing validation peak map of the HNFL4 gene mutation site in MODY sporadic patients and off-normal subjects in accordance with one embodiment of the present invention. Detailed description of the invention
  • the invention proposes an isolated nucleic acid.
  • the nucleic acid has a C.703OT mutation compared to SEQ ID NO: 1 or, in contrast to SEQ ID NO: 3, the nucleic acid has at least one mutation selected from the group consisting of: c.338G> T and c.358A>G.
  • the isolated nucleic acid of the present invention encodes a HNF1B or HNF1A mutant, which may also be referred to as a "nucleic acid encoding a HNF1B or HNF1A mutant", that is, the nucleic acid may be understood to correspond to a gene encoding a HNF1B or HNF1A mutant.
  • the nucleic acid substance, that is, the type of the nucleic acid is not particularly limited, and may be any polymer containing deoxyribonucleotides and/or ribonucleotides corresponding to the coding genes of HNF1B and HNF1A, including but not limited to DNA, RA. Or cDNA.
  • the nucleic acid encoding the HNF1B and HNF1A mutants described above is DNA.
  • the inventors identified a novel mutant of the HA ⁇ JB p HNFM gene, which is closely related to the onset of adult-onset diabetes in young people, thereby detecting the mutant in a biological sample. Whether it exists, can effectively detect whether the biological sample is susceptible to adult-onset diabetes in young people, or can detect whether the organism is prevalent in the organism, and can effectively predict whether the organism is susceptible to adulthood in young people. Onset diabetes.
  • HNF1B and HNF1A mutants are new in the inventors of the present application by high-throughput exome sequencing combined with candidate gene mutation verification, and identified new genes in the pathogenic genes of adult onset diabetes in young people. Mutation, and the C.703OT mutation on the HN B gene is not seen in the prior art, and the c.338G>T, c.358A>G mutation on the HNFL4 gene is associated with adult onset diabetes in young people. Report.
  • nucleotide sequence of the cDNA of the wild type HNF1B gene is shown below (1674 nt):
  • TDTSSISTLTNMSSSKQCPLQAW (SEQ ID NO: 2).
  • nucleotide sequence of the cDNA of the wild type HNFIA gene is shown below (1896 nt):
  • the HNF1B gene mutant discovered by the inventors has a C.703OT (exon 3 ) mutation compared to SEQ ID NO: 1, that is, the 703th C in the cDNA of the HNF1B gene mutant of the present invention relative to the wild type HNF1B gene.
  • Mutation to T in exon 3
  • the encoded product has a p.Arg235Trp (R235W) mutation compared to hepatocyte nuclear factor 1 ⁇ (HNF IB) (SEQ ID NO: 2) That is, its 235 amino acid is mutated from Arg to Trp.
  • the HNFJ4 gene mutant discovered by the inventors has at least one mutation of c. 338G>T (exon 2 ) and c.358A>G (exon 2 ) compared to SEQ ID NO: 3, ie, relative to wild type
  • the HNF1A gene, in the cDNA of the HNF1A gene mutant of the present invention has a G mutation at position 338 of T (in exon 2), and an A mutation at position 358 is G (in exon 2), whereby the encoded product thereof Hepatocyte nuclear factor 1 ⁇ (HNF1A) ( SEQ ID NO: 4 ) has p.Trpl l3Leu ( W113L ) or p ⁇ ysl20Glu (K120E) mutation, ie its 113 amino acid is mutated from Trp to Leu Or its 120 amino acid is mutated from Lys to Glu.
  • the inventors of the present invention first proposed that the HNF1B C.703OT mutation may cause clinical manifestations of hyperglycemia in individuals without renal function defects, and the c.338G>T and c.358A>G mutations of the H HNF1A gene cause the patient to appear in young people.
  • Adult onset diabetes symptoms The inventors of the present invention first proposed that the HNF1B C.703OT mutation may cause clinical manifestations of hyperglycemia in individuals without renal function defects, and the c.338G>T and c.358A>G mutations of the H HNF1A gene cause the patient to appear in young people.
  • Adult onset diabetes symptoms may cause clinical manifestations of hyperglycemia in individuals without renal function defects.
  • the invention proposes an isolated polypeptide. According to an embodiment of the invention, with SEQ
  • the isolated polypeptide has a p.Arg235Trp mutation; or compared to SEQ ID NO: 4, the isolated polypeptide has at least one mutation selected from the group consisting of p.Trpl l3Leu and p ⁇ ysl20Glu.
  • the polypeptide having the p.Arg235Trp mutation is encoded by the aforementioned isolated nucleic acid encoding the HNF1B mutant, and the polypeptide having the p.Trpl l3Leu and p ⁇ ysl20Glu mutations is the HNF1A mutant encoded by the foregoing. Nucleic acid encoded.
  • the invention provides a method of screening a biological sample of adult onset diabetes in a susceptible young person.
  • the method of screening a biological sample of adult onset diabetes in a susceptible young person may comprise the following steps:
  • a nucleic acid sample is extracted from a biological sample.
  • the type of the biological sample is not particularly limited as long as a nucleic acid sample reflecting the presence or absence of a mutation in the biological samples HNF1B and HNF1A can be extracted from the biological sample.
  • the biological sample may be at least one selected from the group consisting of human blood, skin, and subcutaneous tissue. Thereby, sampling and detection can be conveniently performed, thereby further improving the efficiency of screening biological samples of adult-onset diabetes in susceptible young people.
  • nucleic acid sample as used herein shall be understood broadly, and may be any sample capable of reflecting the presence or absence of a mutation in HA ⁇ JB pHNFM in a biological sample, for example, may be directly extracted from a biological sample.
  • Whole genome DNA which may also be part of the whole genome comprising the HNF1B and HNF1A coding sequences, may be total RA extracted from a biological sample, or may be a NA NA extracted from a biological sample.
  • the nucleic acid sample is whole genome DNA.
  • extracting a nucleic acid sample from a biological sample for using RA as a nucleic acid sample may further include: extracting an RNA sample from the biological sample, preferably the RA sample is mRNA; and based on the obtained RNA sample, The transcription reaction, obtaining a cDNA sample, and the obtained cDNA sample constitutes a nucleic acid sample.
  • the nucleic acid sample can be analyzed to enable determination of the nucleic acid sequence of the resulting nucleic acid sample.
  • the method and apparatus for determining the nucleic acid sequence of the obtained nucleic acid sample are not particularly limited.
  • the nucleic acid sequence of the nucleic acid sample can be determined by sequencing methods.
  • the method and apparatus that can be used for sequencing according to embodiments of the present invention are not particularly limited.
  • second generation sequencing techniques can be employed, and third generation and fourth generation or more advanced sequencing techniques can also be employed.
  • the nucleic acid sequence can be sequenced using at least one selected from the group consisting of Hiseq2000, SOLiD, 454, and a single molecule sequencing device. Therefore, combined with the latest sequencing technology, high sequencing depth can be achieved for a single site, detection sensitivity and accuracy are greatly improved, and thus the high-throughput and deep sequencing characteristics of these sequencing devices can be utilized to further improve nucleic acid samples. The efficiency of the test analysis. Thereby, the accuracy and accuracy of subsequent analysis of the sequenced data can be improved.
  • determining the nucleic acid sequence of the nucleic acid sample may further comprise: first, constructing a nucleic acid sequencing library for the obtained nucleic acid sample; and sequencing the obtained nucleic acid sequencing library to obtain a plurality of The sequencing results of the sequencing data.
  • the resulting nucleic acid sequencing library can be sequenced using at least one selected from the group consisting of Hiseq2000, SOLiD, 454, and a single molecule sequencing device.
  • nucleic acid samples can be screened and enriched for HNF1B and HNFJ4 exons, which can be used in the process of constructing a sequencing library, or constructing a test before constructing a sequencing library.
  • the sequence library is followed.
  • constructing the nucleic acid sequencing library for the nucleic acid sample further comprises: performing PCR amplification on the nucleic acid sample using at least one selected from the group consisting of HN B and HNFM gene exon-specific primers; Amplification product, construct a nucleic acid sequencing library.
  • the HNF1B and HNF1A exons can be enriched by PCR amplification, thereby further improving the efficiency of screening biological samples of adult-onset diabetes in susceptible young people.
  • the sequence of the HNF1B HNFM gene exon-specific primer is not particularly limited, and according to a preferred embodiment of the present invention, the HN B gene exon-specific primer may have SEQ ID NOs: 5-6 The nucleotide sequence shown, the HNF1A gene exon-specific primer may have a nucleotide sequence as shown in SEQ ID NOS: 7-8.
  • the inventors have surprisingly found that by using the primers shown in SEQ ID NOS: 5-6, amplification of the exon 3 sequence of HA ⁇ JB, particularly C.703OT, can be performed in the PCR reaction system significantly and efficiently.
  • the primers shown in SEQ ID NOS: 7-8 can be used to significantly and efficiently amplify the exon 2 sequence of HNF1A, particularly c.338G>T and c.358A>G. It is to be noted that these nucleotide sequences shown by SEQ ID NOS: 5-8 were unexpectedly obtained by the inventors of the present invention after exerting laborious labor.
  • the method and apparatus for extracting a nucleic acid sample from a biological sample are also not particularly limited, and can be carried out using a commercially available nucleic acid extraction kit.
  • nucleic acid sequence as used herein should be understood in a broad sense, which may be the complete nucleic acid sequence information obtained after assembling the sequencing data obtained by sequencing the nucleic acid sample, or may be direct ⁇ The sequencing data obtained by sequencing the nucleic acid sample is used as the nucleic acid sequence as long as the nucleic acid sequence contains the coding sequences corresponding to HNF1B and HNF1A.
  • the corresponding reference sequence of the nucleic acid sequence of the obtained nucleic acid sample is aligned, and when the obtained nucleic acid sequence has at least one of the aforementioned three mutations, the biological sample is indicated Adults with age-onset diabetes.
  • the nucleic acid sequence of the nucleic acid sample is compared with the sequence of SEQ ID NO: 1, if the obtained nucleic acid is A C.703OT mutation in the sequence indicates that the biological sample is susceptible to adult onset diabetes in young people.
  • the nucleic acid sequence of the nucleic acid sample is aligned with the sequence of SEQ ID NO: 3, if it has c in the obtained nucleic acid sequence At least one mutation of .338G>T and c.358A>G indicates that the biological sample is susceptible to adult-onset diabetes in young people.
  • obtained nuclear The acid sample is obtained by amplification of the exon-specific primers of the HNFL4 and HN B genes, and the nucleic acid sequence of the nucleic acid sample is compared with the sequences of SEQ ID NO: 1 and SEQ ID NO: 3, respectively.
  • the resulting nucleic acid sequence has at least one mutation of c.703C>T, c.338G>T and c.358A>G, indicating that the biological sample is susceptible to adult onset diabetes in young people.
  • a biological sample of adult-onset diabetes in a susceptible young person can be effectively screened.
  • the method and apparatus for aligning a nucleic acid sequence with SEQ ID NO: 1 and SEQ ID NO: 3 are not particularly limited, and may be operated by any conventional software, according to the specific For example, SOAPALIGNER/SOAP2 can be used for comparison.
  • the use of the "method of screening biological samples of adult-onset diabetes among susceptible young people" according to an embodiment of the present invention is not particularly limited, and for example, it can be used as a screening method for non-diagnostic purposes.
  • System and kit for screening biological samples of adult-onset diabetes in susceptible young people is not particularly limited, and for example, it can be used as a screening method for non-diagnostic purposes.
  • the present invention provides a system capable of effectively performing the above-described method of screening a biological sample of adult-onset diabetes in a susceptible young person.
  • a system 1000 for screening a biological sample of adult-onset diabetes in a susceptible young person comprises: a nucleic acid extraction device 100, a nucleic acid sequence determining device 200, and a judging device 300.
  • the nucleic acid extraction device 100 is for extracting a nucleic acid sample from a biological sample.
  • the type of the nucleic acid sample is not particularly limited, and for RA as a nucleic acid sample, the nucleic acid extraction device further includes an RA extraction unit 101 and a reverse transcription unit 102, wherein The unit 101 is for extracting an RA sample from a biological sample, and the reverse transcription unit 102 is connected to the RA extraction unit 101 for performing a reverse transcription reaction on the RA sample to obtain a cDNA sample, and the obtained cDNA sample constitutes a nucleic acid sample.
  • the nucleic acid sequence determining device 200 is coupled to the nucleic acid extraction device 100 for analyzing a nucleic acid sample to determine a nucleic acid sequence of the nucleic acid sample.
  • the nucleic acid sequence of the nucleic acid sample can be determined using sequencing methods.
  • the nucleic acid sequence determining apparatus 200 may further include: a library construction unit 201 and a sequencing unit 202.
  • the library construction unit 201 is configured to construct a nucleic acid sequencing library for the nucleic acid sample; the sequencing unit 202 is connected to the library construction unit 201 for sequencing the nucleic acid sequencing library to obtain a sequencing result composed of a plurality of sequencing data.
  • the HNF1B and HNF1A exons can be amplified by PCR to further improve the efficiency of screening biological samples of adult-onset diabetes in susceptible young people.
  • the library construction unit 201 may further include a PCR amplification module (not shown) in which at least one selected from the group consisting of HNF1B and HNF1A gene exon-specific primers is provided in order to utilize At least one of the HNF1B and HNF1A gene exon-specific primers, the nucleic acid sample is subjected to PCR amplification, and according to a specific embodiment of the present invention, the HNF1B gene exon-specific primer has SEQ ID NOs: 5-6 The nucleotide sequence shown, the HNF1A gene exon-specific primer has the nucleotide sequence shown in SEQ ID NOS: 7-8.
  • the sequencing unit 202 may include at least one selected from the group consisting of HISEQ2000, SOLiD, 454, and a single molecule sequencing device. Therefore, combined with the latest sequencing technology, a high sequencing depth can be achieved for a single site, and the detection sensitivity and accuracy are greatly improved, so that the high-throughput and deep sequencing characteristics of these sequencing devices can be utilized to further improve the The efficiency of detection and analysis of nucleic acid samples. Thereby, the accuracy and accuracy of subsequent analysis of the sequenced data are improved.
  • the determining device 300 is coupled to the nucleic acid sequence determining device 200, and is adapted to align the nucleic acid sequences of the nucleic acid samples so as to be based on the nucleic acid sequences of the nucleic acid samples and the SEQ ID NO: 1 and SEQ ID NO: Distinguish whether the biological sample is susceptible to adult-onset diabetes in young people. Specifically, based on the nucleic acid sequence of the nucleic acid sample or its complementary sequence, it has a C.703OT mutation compared to SEQ ID NO: 1, or has a selection from c.338G>T and c compared to SEQ ID NO:3.
  • At least one mutation of 358A>G determines whether the biological sample is susceptible to adult-onset diabetes in young people.
  • the nucleic acid sequence of the nucleic acid sample or the complement thereof has a C.703 OT mutation selected from SEQ ID NO: 1, or has a selection compared to SEQ ID NO: 3.
  • At least one mutation from c.338G>T and c.358A>G is indicative of adult-onset diabetes in a biologically susceptible young person.
  • the apparatus for aligning the nucleic acid sequence with SEQ ID NO: 1 and SEQ ID NO: 3 is not particularly limited and may be operated by any conventional software, for example, according to Specific examples of the present invention can be aligned using SOAPALIGNER/SOAP2.
  • the above-described method for screening a biological sample of adult-onset diabetes in a susceptible young person can be effectively carried out, thereby effectively screening a biological sample of adult-onset diabetes in a susceptible young person.
  • the present invention provides a kit for screening a biological sample of adult onset diabetes in a susceptible young person.
  • the kit for screening a biological sample of adult onset diabetes in a susceptible young person comprises: an agent suitable for detecting at least one of the HNF1B and HNF1A gene mutants, wherein SEQ ID NO The HA ⁇ JB gene mutant has a C.703OT mutation compared to SEQ ID NO: 3, and the HNFL4 gene mutant has at least one mutation selected from the group consisting of c.338G>T and c. 358A>G.
  • the kit according to the embodiment of the present invention it is possible to effectively screen biological samples of adult-onset diabetes in a susceptible young person.
  • the term "agent suitable for detecting at least one of the HNF1B and HNF1A gene mutants" should be understood broadly, that is, an agent that detects at least one of the HA ⁇ JB and HNFM-encoding genes, or It is an agent that detects at least one of the HNF1B and HNF1A mutants, and for example, an antibody that recognizes a specific site can be used.
  • the reagent is a nucleic acid probe.
  • FIG. 2 shows the family diagrams of the CZ01 and CZ02 families.
  • 2A is a family diagram of the CZ01 family
  • FIG. 2B is a family diagram of the CZ02 family.
  • O indicates normal women
  • Qin indicates that it is impossible to determine whether a woman is ill
  • LJ indicates a normal male
  • the figure indicates a male who is unable to determine whether the disease is ill
  • the country indicates a male patient; indicates a female patient; indicates a deceased male Patient
  • indicates a deceased female patient;
  • indicates a deceased normal male;
  • indicates a deceased normal female;
  • the arrow indicates a proband.
  • CZ01 family has 3 generations of survival, a total of 8 members, including 5 patients
  • CZ02 family has 3 generations of survival, a total of 6 members, including 2 patients.
  • the inventors conducted a thorough and detailed physical examination of all patients in CZ01. The results showed that all patients had typical clinical features of diabetes. Among them, no renal dysfunction was detected in the ultrasound examination of the proband, and the mother and grandfather passed. Repeated nephropathy was found to have a horseshoe kidney. Since the renal dysfunction reported in the clinical phenotype is a dominant feature, the HNF1B gene was not detected before the exon was sequenced.
  • the patient first detected diabetes symptoms at 17 years of age, late insulin therapy and oral hypoglycemic agents. More than three patients in the family have developed before the age of 25, involving 3 or more generations of family members, which are autosomal dominant; the penetrance rate is high, which can exceed 90%.
  • CZ02 family proband, female, 21 years old, body mass index 28.6.
  • the patient first detected the symptoms of diabetes at the age of 17 and was treated with oral hypoglycemic agents.
  • Family members of the family with 3 or more generations are autosomal dominant; the penetrance rate is high, which can exceed 90%.
  • the pathological diagnosis of the two probands is: adult onset diabetes in young people.
  • the inventors collected DNA samples from three patients and three normal persons surviving the above CZ01 family, two patients surviving in the CZ02 family, and four normal human DNA samples.
  • the whole exome group is sequenced to determine the pathogenic genes and mutation sites.
  • the inventors used the Agilent Sureselect 44M Kit in combination with Solexa high-throughput sequencing technology to perform the above-mentioned probands in the family of CZ01 patients and their diseased mothers and probands in the families of normal fathers and CZ02 patients, and their affected mothers and normal fathers.
  • Exon sequencing the specific steps are as follows:
  • genomic DNA was extracted by conventional salting out method, and the concentration and purity of DNA were measured by spectrophotometer.
  • the OD260/OD280 of the genomic DNA of the specimen is located between 1.7 and 2.0, the concentration is not less than 200 ng ⁇ l, and the total amount is not less than 30 ⁇ .
  • the raw sequencing data obtained above was processed using Illumina basecalling Software 1.7, and after filtration and decontamination, SOAPaligner/SOAP2 was used (see: Li R, Li Y, Kristiansen K, et al, SOAP: short oligonucleotide alignment program. Bioinformatics 2008 , 24(5): 713-714; Li R, Yu C, Li Y, ea al, SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics 2009, 25(15): 1966-1967, by reference Incorporating into the reference genome UCSC NCBI37/hgl9, in order to obtain a unique aligned sequence aligned to the genome.
  • the remaining non-synonymous/splicing site mutations and microinteger deletions are preferentially selected according to the following characteristics: (a) Selection of isogenic homologous and mutant mutations shared between the two patients, filtering out mutations present in normal humans There are 343 SNP mutation sites and 39 Indel mutations in the CZ01 family; there are 336 SNP mutation sites and 45 Indel mutations in the CZ02 family; (b) Among these mutation sites, the MODY known genes are preferentially searched. Mutation, the C.703OT mutation of the HNF1B gene was found in the CZ01 family; the c.338G>T mutation of the HNF1A gene was found in the CZ02 family.
  • the peripheral venous blood of 3 patients and 3 normal family members, 2 patients in CZ02, 4 normal family members, 1 sporadic patient, and 95 normal off-campus normal persons in the above-mentioned family CZ01 were collected.
  • the genomic DNA in the peripheral blood leukocytes was extracted by conventional salting out method, and the concentration and purity of the extracted DNA were measured by a spectrophotometer.
  • the OD260/OD280 of each specimen genomic DNA was between 1.7 and 2.0, and the concentration was not Less than 200 ng/ ⁇ l, the total amount is not less than 6 g, whereby 108 DNA samples are obtained and used.
  • the exon 3 of the HNF1B gene having the nucleotide sequence shown by SEQ ID NOS: 5-6 is designed, and has the SEQ ID NOs: 7-8
  • the HNF1A gene exon 2 specific primer of the nucleotide sequence is shown in the following table.
  • each PCR reaction system of each genomic DNA sample was prepared and subjected to a PCR reaction according to the following ratios.
  • each PCR reaction system of each genomic DNA sample was prepared according to the following ratios (different mutation sites use the same reaction system):
  • each PCR reaction system of the exon 2 of the HNF1B gene exon 2 HNFL4 gene obtained is subjected to PCR reaction according to the following reaction conditions (different mutation sites are used for the same reaction conditions): Reaction conditions:
  • PCR amplification products were obtained from 3 patients in the above-mentioned family CZ01 and 3 normal persons in the family, 2 patients in CZ02, 4 normal persons in the family, 1 sporadic patient, and 95 normal off-campus normal persons. .
  • each PCR reaction system of each genomic DNA sample was prepared for DNA sequencing according to the following ratios (different mutation sites, the same reaction system, BigDye® Terminator v3.1 Cycle Sequncing kit, Applied Biosystems):
  • each PCR reaction system for preparing the exon 2 of the HNF1B exon HNFJ4 gene obtained was subjected to a PCR reaction under the following reaction conditions and then subjected to sequencing using DNA sequencer ABI Prism 3130x1 (Applied Biosystems, Warrington, UK) (different The mutation site uses the same reaction conditions): Reaction conditions: 96 °C 1 min
  • HN B HNFM gene coding sequence alignment was performed on each of the above samples.
  • the inventors found that patients in their families CZ01 and CZ02 and their normal families had co-segregation in c.703C>T (HNF1B) and c.338G>T (HNF1A), respectively, and patients in families CZ01 and CZ02 carried C. 703OT and c.338G>T mutations, and c.358A>G mutation (HNF1A) was found in a sporadic patient.
  • HNF1B gene was found in 95 out-of-home normal humans without C.703OT mutation; 95 out-of-home normals None of the HNF1A genes were found to have c.338G>T and c.358A>G mutations. Thus, the C.703OT mutation of the HNF1B gene and the c.338G>T and c.358A>G mutations of the HNF1A gene were found to be young. A pathogenic mutation in adult onset diabetes.
  • Figure 3 shows the Sanger sequencing verification peak of the HA ⁇ JB gene mutation site in the CZ01 family of MODY patients
  • Figure 4 shows the HNF1A gene mutation site in the CZ02 family and normal subjects.
  • the Sanger sequencing verification peak map shows the Sanger sequencing verification peak of the HNF1A gene mutation site in MODY spora patients and off-normal subjects.
  • the HNF1B gene of the CZ01 family has a C.703OT mutation, and the normal person in the family does not have the mutation;
  • the HNFL4 gene of the CZ02 family has a c.338G>T mutation, and the family The normal person does not have this mutation;
  • the HNFJ4 gene of the sporadic patient has a c.358A>G mutation, and the normal human outside the family does not have the mutation at the corresponding site.
  • the inventors performed mutations on the sequence of exon 3 of HNF1B gene and exon 2 of HNF1A gene in patient family members, and found that three patients of family CZ01 had C.703OT (HNF1B) mutation, and two of family CZ02.
  • the patient had a c.338G>T (HNF1A) mutation; a c.358A>G (HNF1A) mutation was found in the sporadic patients; the mutation was not found in 95 out of normal families from the same region as the patient.
  • the C.703OT mutation of the HNF1B group and the c.338G>T and c.358A>G mutation of the HNF1A gene are pathogenic mutations in adult onset diabetes in young people.
  • HNF1A hepatocyte nuclear factor l ⁇
  • the region and the DNA binding region are constructed.
  • the gene is located on chromosome 12q24.2, with a total length of 3241 bp, wherein the protein coding region is 1 896 bp long.
  • the HNF1A DNA binding region contains a POU-homology domain sequence, and the dimeric region forms a homodimer and HNF1B forms a heterodimer to collectively regulate target gene expression.
  • Hepatocyte nuclear factor 1 ⁇ is a transcription factor that plays an important role in the formation of pancreatic cells and the maintenance of blood glucose homeostasis.
  • HNF1B is an HNF1A analog expressed in vertebrate, highly homologous to HNF1A, and its biological activity is significantly lower than that of HNF1A. When HNF1A expression is down-regulated, HNF1B can be compensated for.
  • This study may be applied to, but is not limited to, HNF1B gene mutation screening and clinical phenotypic analysis in special patients with no renal structural/dysfunction in diabetic patients; on the other hand, HN B gene mutations will be evaluated in Czech young people. The relationship between susceptibility to adult-onset diabetes.
  • the inventors further demonstrated that the HA ⁇ JB and HNFL4 genes are pathogenicity-related genes for adult-onset diabetes in young adults, the C.703OT mutation of the HNF1B gene, and the c.338G>T and c.358A of the HNF1A gene.
  • the G mutation is a causative mutation in adult-onset diabetes in young people.
  • HNF1B C.703OT
  • HNF1A c.338G>T
  • HNF1A c.358A>G
  • HNF1A the exon 3 of HNF1B gene was utilized.
  • the exon 2 primer sequence of the HNFL4 gene was subjected to the corresponding method of Example 2, and the genomic DNA samples of 95 out-of-home normal humans obtained in Example 2 were subjected to HNF1B m exon 2 of the exon 2 HNFL4 gene. Proton mutation site detection. As a result, the inventors found that the above mutations were not found in 95 normal off-campus normal persons.
  • the inventors further demonstrated that the C.703OT mutation of the HA ⁇ £ gene, the c.338G>T and the c.358A>G mutation of the HNFL4 gene are pathogenic mutations in adult onset diabetes in young people.
  • the isolated nucleic acid, the isolated polypeptide of the present invention, the method, system and kit for screening biological samples of adult onset diabetes in susceptible young people can be effectively applied to detect whether a biological sample is susceptible to adult onset in young people Type 2 diabetes, and the results obtained are accurate and reliable.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Biotechnology (AREA)
  • General Health & Medical Sciences (AREA)
  • Wood Science & Technology (AREA)
  • Analytical Chemistry (AREA)
  • Zoology (AREA)
  • Pathology (AREA)
  • Immunology (AREA)
  • Microbiology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

Provided are an isolated nucleic acid, an isolated polypeptide, a method for screening a biological sample of maturity onset diabetes of the susceptible young, a system for screening a biological sample of maturity onset diabetes of the susceptible young, and a kit for screening a biological sample of maturity onset diabetes of the susceptible young, wherein the isolated nucleic acid has c.703C>T mutation in comparison with SEQ ID NO: 1, or has at least one mutation selected from the group consisting of c.338G>T and c.358A>G compared with SEQ ID NO: 3.

Description

基因突变体及其应用 优先权信息  Gene Mutants and Their Applications Priority Information
无 技术领域  No technical field
本发明涉及基因突变体及其应用。 具体地, 本发明涉及分离的核酸、 分离的多肽、 筛选易感青年人中的成年发病型糖尿病的生物样品的方法、筛选易感青年人中的成年发 病型糖尿病的生物样品的系统以及用于筛选易感青年人中的成年发病型糖尿病的生物 样品的试剂盒。 背景技术  The present invention relates to gene mutants and uses thereof. In particular, the present invention relates to an isolated nucleic acid, an isolated polypeptide, a method of screening a biological sample of adult-onset diabetes in a susceptible young person, a system for screening a biological sample of adult-onset diabetes in a susceptible young person, and A kit for screening biological samples of adult onset diabetes in susceptible young people. Background technique
青年人中的成年发病型糖尿病 (maturity onset diabetes of the young, MODY) , 是一 组具有高度遗传及临床表型异质性的单基因疾病。 1975 年由 Fajans 和 Tattersall 依据 1950 年以来系列报道分析并命名, 并总结其以下临床特征: ( 1 ) 累及 3代或以上家族 成员, 呈常染色体显性遗传, 而与人类白细胞抗原(HLA)无关; (2 ) 家族中一般有 2 个以上患者在 25岁以前发病; (3 ) 外显率高, 可超过 90%; ( 4 ) 病程进展緩慢, 在 青年期可以无症状或仅表现为糖耐量减低; (5 ) —般无酮症酸中毒, 至少在发病 2年 内不依赖胰岛素治疗。  Maturity onset diabetes of the young (MODY) is a group of monogenic diseases with high genetic and clinical phenotype heterogeneity. In 1975, Fajans and Tattersall analyzed and named the series according to the series since 1950, and summarized the following clinical features: (1) involving 3 or more generations of family members, autosomal dominant, not related to human leukocyte antigen (HLA) (2) There are usually more than 2 patients in the family who are ill before the age of 25; (3) The penetrance rate is high, which can exceed 90%; (4) The disease progresses slowly, and can be asymptomatic or only exhibits glucose tolerance during adolescence. Reduced; (5) General ketoacidosis, at least for 2 years after the onset of insulin treatment.
现阶段, 寻找致病基因对疾病诊断或治疗意义重大。 虽然目前已发现一些青年人中 的成年发病型糖尿病的致病基因, 例如 GCK、 HNF1A、 HNF4A、 HNF1B 基因, 但仍 存在着相当一部分未知致病基因位点。  At this stage, finding a disease-causing gene is of great significance for the diagnosis or treatment of the disease. Although some of the pathogenic genes of adult-onset diabetes mellitus have been found in young people, such as the GCK, HNF1A, HNF4A, and HNF1B genes, there are still a number of unknown pathogenic gene loci.
因而, 目前对青年人中的成年发病型糖尿病的研究仍有待深入。 发明内容  Therefore, the current research on adult-onset diabetes in young people remains to be deepened. Summary of the invention
本发明旨在至少解决现有技术中存在的技术问题之一。 为此, 本发明的一个目的在 于提出一种能够有效筛选易感青年人中的成年发病型糖尿病的生物样品的方法。  The present invention aims to solve at least one of the technical problems existing in the prior art. To this end, it is an object of the present invention to provide a method for efficiently screening biological samples of adult onset diabetes in susceptible young people.
本发明是基于发明人的下列工作而完成的:发明人通过高通量外显子组测序联合候 选基因突变验证的方法确定了青年人中的成年发病型糖尿病的致病基因上的新突变。  The present invention was accomplished based on the inventors' work in that the inventors identified new mutations in the causative genes of adult onset diabetes in young adults by high-throughput exome sequencing combined with candidate gene mutation verification.
根据本发明的第一方面, 本发明提出了一种分离的核酸。 根据本发明的实施例, 与 According to a first aspect of the invention, the invention proposes an isolated nucleic acid. According to an embodiment of the invention,
SEQ ID NO: 1相比, 该核酸具有 C.703OT突变; 或者与 SEQ ID NO: 3相比, 该核 酸具有选自下列的至少一种突变: c.338G>T和 c.358A>G。 #居本发明的实施例, 发明 人确定了 HNF1B和 HNF1A基因突变体, 这些新突变体与青年人中的成年发病型糖尿 病的发病密切相关,从而通过检测这些新突变体在生物样品中是否存在, 可以有效地检 测生物样品是否易感青年人中的成年发病型糖尿病。 The nucleic acid has a C.703 OT mutation compared to SEQ ID NO: 1, or the nucleic acid has at least one mutation selected from the group consisting of c.338G>T and c.358A>G compared to SEQ ID NO:3. In the examples of the present invention, the inventors identified HNF1B and HNF1A gene mutants, which are closely related to the onset of adult-onset diabetes in young people, thereby detecting the presence or absence of these new mutants in biological samples. It can effectively detect whether biological samples are susceptible to adult-onset diabetes in young people.
根据本发明的第二方面, 本发明提出了一种分离的多肽。 根据本发明的实施例, 与 SEQ ID NO: 2相比, 该分离的多肽具有 p.Arg235Trp突变; 或者与 SEQ ID NO: 4相比, 该分离的多肽具有选自下列的至少一种突变: p.Trpl l3Leu和 p.Lysl20Glu。 通过检测生物 样品中是否表达该多肽,可以有效地检测生物样品是否易感青年人中的成年发病型糖尿 病。 According to a second aspect of the invention, the invention provides an isolated polypeptide. According to an embodiment of the invention, SEQ ID NO: 2, the isolated polypeptide has a p. Arg235Trp mutation; or compared to SEQ ID NO: 4, the isolated polypeptide has at least one mutation selected from the group consisting of: p.Trpl l3Leu and p.Lysl20Glu . By detecting whether or not the polypeptide is expressed in the biological sample, it is possible to effectively detect whether the biological sample is susceptible to adult-onset diabetes in young people.
根据本发明的第三方面,本发明提出了一种筛选易感青年人中的成年发病型糖尿病 的生物样品的方法。 根据本发明的实施例, 该方法包括以下步骤: 从生物样品提取核酸 样本; 确定所述核酸样本的核酸序列; 所述核酸样本的核酸序列或其互补序列, 与 SEQ ID NO: 1相比具有选自 C.703OT突变, 或者与 SEQ ID NO: 3相比具有选自 c.338G>T 和 c.358A>G的至少一种突变, 是所述生物样品易感青年人中的成年发病型糖尿病的指 示。 通过根据本发明实施例的筛选易感青年人中的成年发病型糖尿病的生物样品的方 法, 可以有效地筛选易感青年人中的成年发病型糖尿病的生物样品。  According to a third aspect of the invention, the invention provides a method of screening a biological sample of adult onset diabetes in a susceptible young person. According to an embodiment of the invention, the method comprises the steps of: extracting a nucleic acid sample from a biological sample; determining a nucleic acid sequence of the nucleic acid sample; having a nucleic acid sequence of the nucleic acid sample or a complementary sequence thereof, having SEQ ID NO: 1 An at least one mutation selected from the group consisting of C. 703 OT mutations or having SEQ ID NO: 3 and having at least one mutation selected from the group consisting of c.338G>T and c.358A>G is an adult-onset type in a susceptible young person of the biological sample. Instructions for diabetes. By screening a biological sample of adult-onset diabetes in a susceptible young person according to an embodiment of the present invention, a biological sample of adult-onset diabetes in a susceptible young person can be effectively screened.
根据本发明的第四方面,本发明提出了一种筛选易感青年人中的成年发病型糖尿病 的生物样品的系统。 根据本发明的实施例, 该系统包括: 核酸提取装置, 所述核酸提取 装置用于从所述生物样品提取核酸样本; 核酸序列确定装置, 所述核酸序列确定装置与 所述核酸提取装置相连, 用于对所述核酸样本进行分析, 以便确定所述核酸样本的核酸 序列; 判断装置, 所述判断装置与所述核酸序列确定装置相连, 以便基于所述核酸样本的 核酸序列或其互补序列, 与 SEQ ID NO: 1相比具有 C.703OT突变, 或者与 SEQ ID NO: 3相比具有选自 c.338G>T和 c.358A>G的至少一种突变, 判断所述生物样品是否易感青年 人中的成年发病型糖尿病。 利用该系统, 能够有效地实施前述筛选易感青年人中的成年 发病型糖尿病的生物样品的方法,从而可以有效地筛选易感青年人中的成年发病型糖尿 病的生物样品。  According to a fourth aspect of the invention, the invention provides a system for screening biological samples of adult onset diabetes in susceptible young people. According to an embodiment of the present invention, the system includes: a nucleic acid extraction device for extracting a nucleic acid sample from the biological sample; a nucleic acid sequence determining device, the nucleic acid sequence determining device being coupled to the nucleic acid extraction device, For analyzing the nucleic acid sample to determine a nucleic acid sequence of the nucleic acid sample; a judging device, the judging device being coupled to the nucleic acid sequence determining device, based on the nucleic acid sequence of the nucleic acid sample or a complementary sequence thereof, Having a C.703OT mutation compared to SEQ ID NO: 1, or having at least one mutation selected from c.338G>T and c.358A>G compared to SEQ ID NO: 3, determining whether the biological sample is easy Adults with age-onset diabetes. With this system, the above-described method of screening biological samples of adult-onset diabetes among susceptible young people can be effectively carried out, thereby effectively screening biological samples of adult-onset diabetes in susceptible young people.
根据本发明的第五方面,本发明提出了一种用于筛选易感青年人中的成年发病型糖 尿病的生物样品的试剂盒。 根据本发明的实施例, 该试剂盒含有: 适于检测 HNF1B和 HNFL4基因突变体的至少一种的试剂, 其中与 SEQ ID NO: 1相比, 所述 HN ^B基因 突变体具有 C.703OT突变; 与 SEQ ID NO: 3相比, 所述 HNFL4基因突变体具有选自 下列的至少一种突变: c.338G>T和 c.358A>G。 利用 #居本发明的实施例的试剂盒, 能 够有效地筛选易感青年人中的成年发病型糖尿病的生物样品。  According to a fifth aspect of the present invention, the present invention provides a kit for screening a biological sample of adult onset diabetes in a susceptible young person. According to an embodiment of the present invention, the kit comprises: an agent suitable for detecting at least one of the HNF1B and HNFL4 gene mutants, wherein the HN ^B gene mutant has C.703OT compared to SEQ ID NO: 1. Mutation; compared to SEQ ID NO: 3, the HNFL4 gene mutant has at least one mutation selected from the group consisting of: c.338G>T and c.358A>G. Using the kit of the embodiment of the present invention, it is possible to efficiently screen biological samples of adult-onset diabetes in susceptible young people.
本发明的附加方面和优点将在下面的描述中部分给出,部分将从下面的描述中变得 明显, 或通过本发明的实践了解到。 附图说明  The additional aspects and advantages of the invention will be set forth in part in the description which follows. DRAWINGS
本发明的上述和 /或附加的方面和优点从结合下面附图对实施例的描述中将变得明 显和容易理解, 其中:  The above and/or additional aspects and advantages of the present invention will become apparent and readily understood from
图 1 : 显示了根据本发明一个实施例的筛选易感青年人中的成年发病型糖尿病的生 物样品的系统及其组成部分的示意图, 其中,  Figure 1 is a schematic view showing a system for screening a biological sample of adult-onset diabetes mellitus in a susceptible young person, and a component thereof, according to an embodiment of the present invention, wherein
A 为根据本发明实施例的筛选易感青年人中的成年发病型糖尿病的生物样品的系 统的示意图, A is a line for screening biological samples of adult-onset diabetes in susceptible young people according to an embodiment of the present invention. Schematic diagram,
B为根据本发明实施例的核酸提取装置的示意图,  B is a schematic diagram of a nucleic acid extraction device according to an embodiment of the present invention,
C为根据本发明实施例的核酸序列确定装置的示意图;  C is a schematic diagram of a nucleic acid sequence determining device according to an embodiment of the present invention;
图 2:显示了根据本发明一个实施例的 MODY患者家系 CZ01和 CZ02的家系图语, 其中,  Figure 2: shows the family diagram of the MODY patient family CZ01 and CZ02 according to one embodiment of the present invention, wherein
A为根据本发明实施例的 CZ01家系的家系图谱,  A is a family diagram of the CZ01 family according to an embodiment of the present invention,
B为根据本发明实施例的 CZ02家系的家系图谱;  B is a family diagram of a CZ02 family according to an embodiment of the present invention;
图 3 : 显示了根据本发明一个实施例, MODY患者家系 CZ01家系内的患者及正常 人的 HNF1B基因突变位点的 Sanger测序验证峰图;  Fig. 3 is a view showing a Sanger sequencing verification peak of a HNF1B gene mutation site in a patient and a normal human in a family of CZ01 in a family of MODY patients according to an embodiment of the present invention;
图 4: 显示了根据本发明一个实施例, MODY患者家系 CZ02家系内的患者及正常 人的 HNF1A基因突变位点的 Sanger测序验证峰图;  Figure 4: shows a Sanger sequencing verification peak of the HNF1A gene mutation site in patients and their normal humans in the family of CZ02 in the MODY patient family according to one embodiment of the present invention;
图 5 : 显示了根据本发明一个实施例, MODY散发患者及家系外正常人的 HNFL4基 因突变位点的 Sanger测序验证峰图。 发明详细描述  Figure 5: shows a Sanger sequencing validation peak map of the HNFL4 gene mutation site in MODY sporadic patients and off-normal subjects in accordance with one embodiment of the present invention. Detailed description of the invention
下面详细描述本发明的实施例, 所述实施例的示例在附图中示出。 下面通过参考附 图描述的实施例是示例性的, 仅用于解释本发明, 而不能理解为对本发明的限制。  Embodiments of the invention are described in detail below, examples of which are illustrated in the accompanying drawings. The embodiments described below with reference to the drawings are intended to be illustrative only and not to limit the invention.
HNF1B及 HNF1A基因突变体  HNF1B and HNF1A gene mutants
根据本发明的第一方面,本发明提出了一种分离的核酸。根据本发明的实施例,与 SEQ ID NO: 1相比, 该核酸具有 C.703OT突变; 或者与 SEQ ID NO: 3相比, 该核酸具有选 自下列的至少一种突变: c.338G>T和 c.358A>G。 需要说明的是, 本发明的分离的核酸编码 HNF1B或者 HNF1A突变体, 也可以称为 "编码 HNF1B或者 HNF1A突变体的核酸", 即该 核酸可以理解为与编码 HNF1B或者 HNF1A突变体的基因相对应的核酸物质, 即核酸的类 型不受特别限制, 可以是任何包含与 HNF1B和 HNF1A的编码基因相对应的脱氧核糖核苷 酸和 /或核糖核苷酸的聚合物, 包括但不限于 DNA、 R A或 cDNA。根据本发明的一个具体 示例, 前面所述的编码 HNF1B和 HNF1A突变体的核酸为 DNA。 才艮据本发明的实施例, 发 明人确定了 HA^JB p HNFM基因的新突变体, 该突变体与青年人中的成年发病型糖尿病 的发病密切相关, 从而通过检测该突变体在生物样品中是否存在, 可以有效地检测生物样 品是否易感青年人中的成年发病型糖尿病, 也可以通过检测该突变体在生物体中是否存在, 可以有效地预测生物体是否易感青年人中的成年发病型糖尿病。  According to a first aspect of the invention, the invention proposes an isolated nucleic acid. According to an embodiment of the invention, the nucleic acid has a C.703OT mutation compared to SEQ ID NO: 1 or, in contrast to SEQ ID NO: 3, the nucleic acid has at least one mutation selected from the group consisting of: c.338G> T and c.358A>G. It should be noted that the isolated nucleic acid of the present invention encodes a HNF1B or HNF1A mutant, which may also be referred to as a "nucleic acid encoding a HNF1B or HNF1A mutant", that is, the nucleic acid may be understood to correspond to a gene encoding a HNF1B or HNF1A mutant. The nucleic acid substance, that is, the type of the nucleic acid is not particularly limited, and may be any polymer containing deoxyribonucleotides and/or ribonucleotides corresponding to the coding genes of HNF1B and HNF1A, including but not limited to DNA, RA. Or cDNA. According to a specific example of the present invention, the nucleic acid encoding the HNF1B and HNF1A mutants described above is DNA. According to an embodiment of the present invention, the inventors identified a novel mutant of the HA^JB p HNFM gene, which is closely related to the onset of adult-onset diabetes in young people, thereby detecting the mutant in a biological sample. Whether it exists, can effectively detect whether the biological sample is susceptible to adult-onset diabetes in young people, or can detect whether the organism is prevalent in the organism, and can effectively predict whether the organism is susceptible to adulthood in young people. Onset diabetes.
这些编码 HNF1B和 HNF1A突变体的核酸, 是本申请的发明人通过高通量外显子组测 序联合候选基因突变验证的方法, 确定的青年人中的成年发病型糖尿病的致病基因上的新 突变, 并且在现有技术中并未见到 HN B基因上的 C.703OT突变, 以及 HNFL4基因上的 c.338G>T、 c.358A>G突变与青年人中的成年发病型糖尿病相关的报道。  These nucleic acids encoding HNF1B and HNF1A mutants are new in the inventors of the present application by high-throughput exome sequencing combined with candidate gene mutation verification, and identified new genes in the pathogenic genes of adult onset diabetes in young people. Mutation, and the C.703OT mutation on the HN B gene is not seen in the prior art, and the c.338G>T, c.358A>G mutation on the HNFL4 gene is associated with adult onset diabetes in young people. Report.
野生型 HNF1B基因的 cDNA的核苷酸序列如下所示 (1674 nt):
Figure imgf000005_0001
Figure imgf000006_0001
The nucleotide sequence of the cDNA of the wild type HNF1B gene is shown below (1674 nt):
Figure imgf000005_0001
Figure imgf000006_0001
TDTSSISTLTNMSSSKQCPLQAW ( SEQ ID NO: 2 )。  TDTSSISTLTNMSSSKQCPLQAW (SEQ ID NO: 2).
野生型 HNFIA基因的 cDNA的核苷酸序列如下所示 (1896 nt): The nucleotide sequence of the cDNA of the wild type HNFIA gene is shown below (1896 nt):
NGQSHLLPSNHSVIETFISTQMASSSQ ( SEQ ID NO: 4 ), NGQSHLLPSNHSVIETFISTQMASSSQ ( SEQ ID NO: 4 ),
发明人发现的 HNF1B基因突变体与 SEQ ID NO: 1相比, 具有 C.703OT ( exon 3 ) 突 变, 即相对于野生型 HNF1B基因, 本发明的 HNF1B基因突变体的 cDNA中第 703位的 C 突变为 T(位于 exon 3 ),由此,其所编码的产物与肝细胞核因子 1 β (hepatocyte nuclear factorl β , HNF IB) ( SEQ ID NO: 2 )相比, 具有 p.Arg235Trp ( R235W ) 突变, 即其 235位氨基 酸从 Arg突变为 Trp。  The HNF1B gene mutant discovered by the inventors has a C.703OT (exon 3 ) mutation compared to SEQ ID NO: 1, that is, the 703th C in the cDNA of the HNF1B gene mutant of the present invention relative to the wild type HNF1B gene. Mutation to T (in exon 3), whereby the encoded product has a p.Arg235Trp (R235W) mutation compared to hepatocyte nuclear factor 1 β (HNF IB) (SEQ ID NO: 2) That is, its 235 amino acid is mutated from Arg to Trp.
此外, 发明人发现的 HNFJ4基因突变体与 SEQ ID NO: 3相比, 具有 c. 338G>T ( exon 2 )和 c.358A>G ( exon 2 )的至少一种突变, 即相对于野生型 HNF1A基因,本发明的 HNF1A 基因突变体的 cDNA中第 338位的 G突变为 T (位于 exon 2 )、 第 358位的 A突变为 G (位 于 exon 2 ),由此,其所编码的产物与肝细胞核因子 1 a (hepatocyte nuclear factorl α , HNF1A) ( SEQ ID NO: 4 )相比, 具有 p.Trpl l3Leu ( W113L )或者 p丄 ysl20Glu (K120E)突变, 即 其 113位氨基酸从 Trp突变为 Leu或者其 120位氨基酸从 Lys突变为 Glu。  Furthermore, the HNFJ4 gene mutant discovered by the inventors has at least one mutation of c. 338G>T (exon 2 ) and c.358A>G (exon 2 ) compared to SEQ ID NO: 3, ie, relative to wild type The HNF1A gene, in the cDNA of the HNF1A gene mutant of the present invention, has a G mutation at position 338 of T (in exon 2), and an A mutation at position 358 is G (in exon 2), whereby the encoded product thereof Hepatocyte nuclear factor 1 α (HNF1A) ( SEQ ID NO: 4 ) has p.Trpl l3Leu ( W113L ) or p丄ysl20Glu (K120E) mutation, ie its 113 amino acid is mutated from Trp to Leu Or its 120 amino acid is mutated from Lys to Glu.
本发明的发明人首次提出 HNF1B C.703OT突变可能导致无肾脏功能缺陷的个体 出现临床表现高血糖症状, H HNFlA基因的 c.338G>T及 c.358A>G突变导致患者出现青 年人中的成年发病型糖尿病症状。  The inventors of the present invention first proposed that the HNF1B C.703OT mutation may cause clinical manifestations of hyperglycemia in individuals without renal function defects, and the c.338G>T and c.358A>G mutations of the H HNF1A gene cause the patient to appear in young people. Adult onset diabetes symptoms.
根据本发明的第二方面,本发明提出了一种分离的多肽。根据本发明的实施例,与 SEQ According to a second aspect of the invention, the invention proposes an isolated polypeptide. According to an embodiment of the invention, with SEQ
ID NO: 2相比, 该分离的多肽具有 p.Arg235Trp突变; 或者与 SEQ ID NO: 4相比, 该分 离的多肽具有选自下列的至少一种突变: p.Trpl l3Leu和 p丄 ysl20Glu。根据本发明的一些具 体示例, 具有 p.Arg235Trp突变的多肽是由前述分离的编码 HNF1B突变体的核酸编码的, 具有 p.Trpl l3Leu和 p丄 ysl20Glu突变的多肽是由前述分离的编码 HNF1A突变体的核酸编 码的。 通过检测生物样品中是否表达该多肽, 可以有效地检测生物样品是否易感青年人中 的成年发病型糖尿病, 也可以通过检测这些多肽在生物体中是否存在, 可以有效地预测生 物体是否易感青年人中的成年发病型糖尿病。 筛选易感青年人中的成年发病型糖尿病的生物样品的方法 In contrast to ID NO: 2, the isolated polypeptide has a p.Arg235Trp mutation; or compared to SEQ ID NO: 4, the isolated polypeptide has at least one mutation selected from the group consisting of p.Trpl l3Leu and p丄ysl20Glu. According to some specific examples of the present invention, the polypeptide having the p.Arg235Trp mutation is encoded by the aforementioned isolated nucleic acid encoding the HNF1B mutant, and the polypeptide having the p.Trpl l3Leu and p丄ysl20Glu mutations is the HNF1A mutant encoded by the foregoing. Nucleic acid encoded. By detecting whether the polypeptide is expressed in the biological sample, it is possible to effectively detect whether the biological sample is susceptible to adult onset diabetes in young people, or to detect the presence of these polypeptides in the organism, and can effectively predict the growth. Whether the object is susceptible to adult-onset diabetes in young people. Method for screening biological samples of adult onset diabetes in susceptible young people
根据本发明的第三方面, 本发明提出了一种筛选易感青年人中的成年发病型糖尿病的 生物样品的方法。 根据本发明的实施例, 该筛选易感青年人中的成年发病型糖尿病的生物 样品的方法可以包括以下步骤:  According to a third aspect of the invention, the invention provides a method of screening a biological sample of adult onset diabetes in a susceptible young person. According to an embodiment of the present invention, the method of screening a biological sample of adult onset diabetes in a susceptible young person may comprise the following steps:
首先, 从生物样品提取核酸样本。 根据本发明的实施例, 生物样品的类型并不受特别 限制, 只要从该生物样品中能够提取到反映生物样品 HNF1B和 HNF1A是否存在突变的核 酸样本即可。 根据本发明的实施例, 生物样品可以为选自人体血液、 皮肤、 皮下组织的至 少一种。 由此, 可以方便地进行取样和检测, 从而能够进一步提高筛选易感青年人中的成 年发病型糖尿病的生物样品的效率。 根据本发明的实施例, 这里所使用的术语 "核酸样本" 应做广义理解,其可以是任何能够反映生物样品中 HA^JB pHNFM是否存在突变的样本, 例如可以是从生物样品中直接提取的全基因组 DNA, 也可以是该全基因组中包含 HNF1B 和 HNF1A编码序列的一部分, 可以是从生物样品中提取的总 R A, 也可以是从生物样品 中提取的 m NA。 根据本发明的一个实施例, 所述核酸样本为全基因组 DNA。 由此, 可以 扩大生物样品的来源范围, 并且可以同时对生物样品的多种信息进行确定, 从而能够提高 筛选易感青年人中的成年发病型糖尿病的生物样品的效率。 另外, 根据本发明的实施例, 针对釆用 R A作为核酸样本, 从生物样品提取核酸样本可以进一步包括: 从生物样品提取 RNA样本, 优选 R A样本为 mRNA; 以及基于所得到的 RNA样本, 通过反转录反应, 获 得 cDNA样本, 所得到的 cDNA样本构成核酸样本。 由此, 可以进一步提高利用 R A作为 核酸样本筛选易感青年人中的成年发病型糖尿病的生物样品的效率。  First, a nucleic acid sample is extracted from a biological sample. According to an embodiment of the present invention, the type of the biological sample is not particularly limited as long as a nucleic acid sample reflecting the presence or absence of a mutation in the biological samples HNF1B and HNF1A can be extracted from the biological sample. According to an embodiment of the present invention, the biological sample may be at least one selected from the group consisting of human blood, skin, and subcutaneous tissue. Thereby, sampling and detection can be conveniently performed, thereby further improving the efficiency of screening biological samples of adult-onset diabetes in susceptible young people. According to an embodiment of the present invention, the term "nucleic acid sample" as used herein shall be understood broadly, and may be any sample capable of reflecting the presence or absence of a mutation in HA^JB pHNFM in a biological sample, for example, may be directly extracted from a biological sample. Whole genome DNA, which may also be part of the whole genome comprising the HNF1B and HNF1A coding sequences, may be total RA extracted from a biological sample, or may be a NA NA extracted from a biological sample. According to an embodiment of the invention, the nucleic acid sample is whole genome DNA. Thereby, the source range of the biological sample can be expanded, and various information of the biological sample can be simultaneously determined, thereby improving the efficiency of screening biological samples of adult-onset diabetes in susceptible young people. In addition, according to an embodiment of the present invention, extracting a nucleic acid sample from a biological sample for using RA as a nucleic acid sample may further include: extracting an RNA sample from the biological sample, preferably the RA sample is mRNA; and based on the obtained RNA sample, The transcription reaction, obtaining a cDNA sample, and the obtained cDNA sample constitutes a nucleic acid sample. Thereby, the efficiency of screening a biological sample of adult-onset diabetes in a susceptible young person using R A as a nucleic acid sample can be further improved.
接下来, 在得到核酸样本之后, 可以对核酸样本进行分析, 从而能够确定所得到核酸 样本的核酸序列。 才艮据本发明的实施例, 确定所得到核酸样本的核酸序列的方法和设备并 不受特别限制。 根据本发明的具体实施例, 可以通过测序方法, 确定核酸样本的核酸序列。 根据本发明的实施例, 可以用于进行测序的方法和设备并不受特别限制。 根据本发明的实 施例, 可以釆用第二代测序技术, 也可以釆用第三代以及第四代或者更先进的测序技术。 根据本发明的具体示例, 可以利用选自 Hiseq2000、 SOLiD、 454和单分子测序装置的至少 一种对核酸序列进行测序。 由此, 结合最新的测序技术, 针对单个位点可以达到较高的测 序深度, 检测灵敏度和准确性大大提高, 因而能够利用这些测序装置的高通量、 深度测序 的特点, 进一步提高对核酸样本进行检测分析的效率。 从而, 能够提高后续对测序数据进 行分析时的精确性和准确度。 由此, 根据本发明的实施例, 确定核酸样本的核酸序列可以 进一步包括: 首先, 针对所得到的核酸样本, 构建核酸测序文库; 以及对所得到的核酸测 序文库进行测序, 以便获得由多个测序数据构成的测序结果。 根据本发明的一些实施例, 可以釆用选自 Hiseq2000、 SOLiD、 454和单分子测序装置的至少一种对所得到的核酸测序 文库进行测序。 另外, 根据本发明的实施例, 可以对核酸样本进行筛选, 富集 HNF1B 和 HNFJ4外显子, 该筛选富集可以在构建测序文库之前, 构建测序文库过程中, 或者构建测 序文库之后进行。 根据本发明的一个实施例, 针对核酸样本, 构建核酸测序文库进一步包 括: 利用选自 HN B和 HNFM基因外显子特异性引物的至少一种, 对核酸样本进行 PCR 扩增; 以及针对所得到的扩增产物, 构建核酸测序文库。 由此, 可以通过 PCR扩增, 富集 HNF1B和 HNF1A外显子, 从而能够进一步提高筛选易感青年人中的成年发病型糖尿病的 生物样品的效率。 根据本发明的实施例, HNF1B HNFM基因外显子特异性引物的序列 不受特别限制, 根据本发明的优选实施例, HN B基因外显子特异性引物可以具有如 SEQ ID NO: 5-6所示的核苷酸序列, HNF1A基因外显子特异性引物可以具有如 SEQ ID NO: 7-8 所示的核苷酸序列。 发明人惊奇地发现, 通过釆用 SEQ ID NO: 5-6所示的引物, 可以在 PCR反应体系中显著有效地完成对 HA^JB尤其是 C.703OT 所在的 3号外显子序列的扩增; 釆用 SEQ ID NO: 7-8所示的引物, 可以显著有效地完成对 HNF1A 尤其是 c.338G>T及 c.358A>G所在的 2号外显子序列的扩增。 需要说明的是, 这些 SEQ ID NO: 5-8所示的核 苷酸序列是本发明的发明人在付出了艰苦的劳动后, 意外获得的。 Next, after the nucleic acid sample is obtained, the nucleic acid sample can be analyzed to enable determination of the nucleic acid sequence of the resulting nucleic acid sample. According to an embodiment of the present invention, the method and apparatus for determining the nucleic acid sequence of the obtained nucleic acid sample are not particularly limited. According to a particular embodiment of the invention, the nucleic acid sequence of the nucleic acid sample can be determined by sequencing methods. The method and apparatus that can be used for sequencing according to embodiments of the present invention are not particularly limited. According to embodiments of the present invention, second generation sequencing techniques can be employed, and third generation and fourth generation or more advanced sequencing techniques can also be employed. According to a specific example of the present invention, the nucleic acid sequence can be sequenced using at least one selected from the group consisting of Hiseq2000, SOLiD, 454, and a single molecule sequencing device. Therefore, combined with the latest sequencing technology, high sequencing depth can be achieved for a single site, detection sensitivity and accuracy are greatly improved, and thus the high-throughput and deep sequencing characteristics of these sequencing devices can be utilized to further improve nucleic acid samples. The efficiency of the test analysis. Thereby, the accuracy and accuracy of subsequent analysis of the sequenced data can be improved. Thus, according to an embodiment of the present invention, determining the nucleic acid sequence of the nucleic acid sample may further comprise: first, constructing a nucleic acid sequencing library for the obtained nucleic acid sample; and sequencing the obtained nucleic acid sequencing library to obtain a plurality of The sequencing results of the sequencing data. According to some embodiments of the invention, the resulting nucleic acid sequencing library can be sequenced using at least one selected from the group consisting of Hiseq2000, SOLiD, 454, and a single molecule sequencing device. In addition, according to an embodiment of the present invention, nucleic acid samples can be screened and enriched for HNF1B and HNFJ4 exons, which can be used in the process of constructing a sequencing library, or constructing a test before constructing a sequencing library. The sequence library is followed. According to an embodiment of the present invention, constructing the nucleic acid sequencing library for the nucleic acid sample further comprises: performing PCR amplification on the nucleic acid sample using at least one selected from the group consisting of HN B and HNFM gene exon-specific primers; Amplification product, construct a nucleic acid sequencing library. Thus, the HNF1B and HNF1A exons can be enriched by PCR amplification, thereby further improving the efficiency of screening biological samples of adult-onset diabetes in susceptible young people. According to an embodiment of the present invention, the sequence of the HNF1B HNFM gene exon-specific primer is not particularly limited, and according to a preferred embodiment of the present invention, the HN B gene exon-specific primer may have SEQ ID NOs: 5-6 The nucleotide sequence shown, the HNF1A gene exon-specific primer may have a nucleotide sequence as shown in SEQ ID NOS: 7-8. The inventors have surprisingly found that by using the primers shown in SEQ ID NOS: 5-6, amplification of the exon 3 sequence of HA^JB, particularly C.703OT, can be performed in the PCR reaction system significantly and efficiently. The primers shown in SEQ ID NOS: 7-8 can be used to significantly and efficiently amplify the exon 2 sequence of HNF1A, particularly c.338G>T and c.358A>G. It is to be noted that these nucleotide sequences shown by SEQ ID NOS: 5-8 were unexpectedly obtained by the inventors of the present invention after exerting laborious labor.
Figure imgf000009_0001
Figure imgf000009_0001
关于针对核酸样本, 构建测序文库的方法和流程, 本领域技术人员可以根据不同的测 序技术进行适当选择, 关于流程的细节,可以参见测序仪器的厂商例如 Illumina公司所提供 的规程, 例 ¾口参见 Illumina公司 Multiplexing Sample Preparation Guide ( Part#1005361; Feb 2010 )或 Paired-End SamplePrep Guide ( Part# 1005063; Feb 2010 ), 通过参照将其并入本文。 根据本发明的实施例, 从生物样品提取核酸样本的方法和设备, 也不受特别限制, 可以釆 用商品化的核酸提取试剂盒进行。  For methods and procedures for constructing sequencing libraries for nucleic acid samples, those skilled in the art can appropriately select according to different sequencing technologies. For details of the processes, refer to the procedures provided by manufacturers of sequencing instruments such as Illumina, for example. Illumina Corporation Multiplexing Sample Preparation Guide (Part #1005361; Feb 2010) or Paired-End SamplePrep Guide (Part # 1005063; Feb 2010), which is incorporated herein by reference. According to an embodiment of the present invention, the method and apparatus for extracting a nucleic acid sample from a biological sample are also not particularly limited, and can be carried out using a commercially available nucleic acid extraction kit.
需要说明的是, 在这里所使用的术语 "核酸序列" 应作广义理解, 其可以是在对核酸 样本进行测序得到的测序数据进行组装后, 得到的完整的核酸序列信息, 也可以是直接釆 用通过对核酸样本进行测序所得到的测序数据 (reads )作为核酸序列, 只要这些核酸序列 中含有对应 HNF1B和 HNF1A的编码序列即可。  It should be noted that the term "nucleic acid sequence" as used herein should be understood in a broad sense, which may be the complete nucleic acid sequence information obtained after assembling the sequencing data obtained by sequencing the nucleic acid sample, or may be direct 釆The sequencing data obtained by sequencing the nucleic acid sample is used as the nucleic acid sequence as long as the nucleic acid sequence contains the coding sequences corresponding to HNF1B and HNF1A.
最后, 在确定核酸样本的核酸序列之后, 将所得到的核酸样本的核酸序列相应的参考 序列进行比对, 当所得到的核酸序列中具有前述的三种突变的至少之一时, 即指示生物样 品易感青年人中的成年发病型糖尿病。 具体地, 当获得的核酸样本是釆用 HN B基因外显 子特异性引物扩增获得时, 将该核酸样本的核酸序列与 SEQ ID NO: 1的序列相比对, 如果 在所得到的核酸序列中具有 C.703OT突变, 则指示生物样品易感青年人中的成年发病型糖 尿病。 当获得的核酸样本是釆用 HNFL4基因外显子特异性引物扩增获得时, 将该核酸样本 的核酸序列与 SEQ ID NO: 3的序列相比对, 如果在所得到的核酸序列中具有 c.338G>T和 c.358A>G至少一种突变, 则指示生物样品易感青年人中的成年发病型糖尿病。 当获得的核 酸样本是釆用 HNFL4及 HN B基因外显子特异性引物扩增获得时, 将该核酸样本的核酸 序列分别与 SEQ ID NO: 1和 SEQ ID NO: 3的序列相比对, 如果在所得到的核酸序列中具 有 c.703C>T、 c.338G>T和 c.358A>G至少一种突变, 则指示生物样品易感青年人中的成年 发病型糖尿病。 由此, 通过根据本发明实施例的筛选易感青年人中的成年发病型糖尿病的 生物样品的方法, 可以有效地筛选易感青年人中的成年发病型糖尿病的生物样品。 根据本 发明的实施例, 对核酸序列与 SEQ ID NO: 1 及 SEQ ID NO: 3进行比对的方法和设备并 不受特别限制, 可以釆用任意常规的软件进行操作, 根据本发明的具体实例, 可以釆用 SOAPALIGNER/SOAP2进行比对。 Finally, after determining the nucleic acid sequence of the nucleic acid sample, the corresponding reference sequence of the nucleic acid sequence of the obtained nucleic acid sample is aligned, and when the obtained nucleic acid sequence has at least one of the aforementioned three mutations, the biological sample is indicated Adults with age-onset diabetes. Specifically, when the obtained nucleic acid sample is obtained by amplification using an HN B gene exon-specific primer, the nucleic acid sequence of the nucleic acid sample is compared with the sequence of SEQ ID NO: 1, if the obtained nucleic acid is A C.703OT mutation in the sequence indicates that the biological sample is susceptible to adult onset diabetes in young people. When the obtained nucleic acid sample is obtained by amplification with an exon-specific primer of the HNFL4 gene, the nucleic acid sequence of the nucleic acid sample is aligned with the sequence of SEQ ID NO: 3, if it has c in the obtained nucleic acid sequence At least one mutation of .338G>T and c.358A>G indicates that the biological sample is susceptible to adult-onset diabetes in young people. When obtained nuclear The acid sample is obtained by amplification of the exon-specific primers of the HNFL4 and HN B genes, and the nucleic acid sequence of the nucleic acid sample is compared with the sequences of SEQ ID NO: 1 and SEQ ID NO: 3, respectively. The resulting nucleic acid sequence has at least one mutation of c.703C>T, c.338G>T and c.358A>G, indicating that the biological sample is susceptible to adult onset diabetes in young people. Thus, by screening a biological sample of adult-onset diabetes in a susceptible young person according to an embodiment of the present invention, a biological sample of adult-onset diabetes in a susceptible young person can be effectively screened. According to an embodiment of the present invention, the method and apparatus for aligning a nucleic acid sequence with SEQ ID NO: 1 and SEQ ID NO: 3 are not particularly limited, and may be operated by any conventional software, according to the specific For example, SOAPALIGNER/SOAP2 can be used for comparison.
需要说明的是, 根据本发明实施例的 "筛选易感青年人中的成年发病型糖尿病的生物 样品的方法" 的用途不受特别限制, 例如可以用作非诊断目的的筛选方法。 筛选易感青年人中的成年发病型糖尿病的生物样品的系统和试剂盒  It is to be noted that the use of the "method of screening biological samples of adult-onset diabetes among susceptible young people" according to an embodiment of the present invention is not particularly limited, and for example, it can be used as a screening method for non-diagnostic purposes. System and kit for screening biological samples of adult-onset diabetes in susceptible young people
根据本发明的第四方面, 本发明提出了一种能够有效实施上述筛选易感青年人中的成 年发病型糖尿病的生物样品的方法的系统。  According to a fourth aspect of the present invention, the present invention provides a system capable of effectively performing the above-described method of screening a biological sample of adult-onset diabetes in a susceptible young person.
参考图 1 , 根据本发明的实施例, 该筛选易感青年人中的成年发病型糖尿病的生物样品 的系统 1000包括: 核酸提取装置 100、 核酸序列确定装置 200以及判断装置 300。  Referring to Fig. 1, a system 1000 for screening a biological sample of adult-onset diabetes in a susceptible young person according to an embodiment of the present invention comprises: a nucleic acid extraction device 100, a nucleic acid sequence determining device 200, and a judging device 300.
根据本发明的实施例, 核酸提取装置 100用于从生物样品提取核酸样本。 如前所述, 根据本发明的实施例, 核酸样本的类型并不受特别限制, 对于釆用 R A作为核酸样本, 则 核酸提取装置进一步包括 R A提取单元 101和反转录单元 102, 其中, 提取单元 101用于 从生物样品提取 R A样本,反转录单元 102与 R A提取单元 101相连, 用于对 R A样本 进行反转录反应, 以便获得 cDNA样本, 所得到的 cDNA样本构成核酸样本。  According to an embodiment of the invention, the nucleic acid extraction device 100 is for extracting a nucleic acid sample from a biological sample. As described above, according to an embodiment of the present invention, the type of the nucleic acid sample is not particularly limited, and for RA as a nucleic acid sample, the nucleic acid extraction device further includes an RA extraction unit 101 and a reverse transcription unit 102, wherein The unit 101 is for extracting an RA sample from a biological sample, and the reverse transcription unit 102 is connected to the RA extraction unit 101 for performing a reverse transcription reaction on the RA sample to obtain a cDNA sample, and the obtained cDNA sample constitutes a nucleic acid sample.
才艮据本发明的实施例, 核酸序列确定装置 200与核酸提取装置 100相连, 用于对核酸 样本进行分析, 以便确定核酸样本的核酸序列。 如前所示, 可以釆用测序的方法确定核酸 样本的核酸序列。 由此, 根据本发明的一个实施例, 所述核酸序列确定装置 200可以进一 步包括: 文库构建单元 201以及测序单元 202。 文库构建单元 201用于针对核酸样本, 构建 核酸测序文库; 测序单元 202与文库构建单元 201相连, 用于对核酸测序文库进行测序, 以便获得由多个测序数据构成的测序结果。 如前所述, 可以通过 PCR扩增, 富臬 HNF1B 和 HNF1A外显子, 进一步提高筛选易感青年人中的成年发病型糖尿病的生物样品的效率。 由此, 文库构建单元 201可以进一步包括 PCR扩增模块(图中未示出), 在该 PCR扩增模 块中设置有选自 HNF1B和 HNF1A基因外显子特异性引物的至少一种,以便利用 HNF1B和 HNF1A基因外显子特异性引物的至少一种, 对所述核酸样本进行 PCR扩增,根据本发明的 具体实施例, HNF1B基因外显子特异性引物具有如 SEQ ID NO: 5-6所示的核苷酸序列, HNF1A基因外显子特异性引物具有如 SEQ ID NO: 7-8所示的核苷酸序列。 根据本发明的 实施例, 测序单元 202可以包括选自 HISEQ2000、 SOLiD、 454和单分子测序装置的至少一 种。 由此, 结合最新的测序技术, 针对单个位点可以达到较高的测序深度, 检测灵敏度和 准确性大大提高, 因而能够利用这些测序装置的高通量、 深度测序的特点, 进一步提高对 核酸样本进行检测分析的效率。 从而, 提高后续对测序数据进行分析时的精确性和准确度。 根据本发明的实施例, 判断装置 300与核酸序列确定装置 200相连, 适于将核酸样本 的核酸序列进行比对, 以便基于核酸样本的核酸序列与 SEQ ID NO: 1及 SEQ ID NO: 3的 区别判断生物样品是否易感青年人中的成年发病型糖尿病。 具体地, 基于所述核酸样本的 核酸序列或其互补序列, 与 SEQ ID NO: 1相比具有 C.703OT突变, 或者与 SEQ ID NO: 3相比具有选自 c.338G>T和 c.358A>G的至少一种突变, 判断生物样品是否易感青年人中 的成年发病型糖尿病。 如前所述, 根据本发明的一个实施例, 核酸样本的核酸序列或其互 补序列, 与 SEQ ID NO: 1相比具有选自 C.703OT突变, 或者与 SEQ ID NO: 3相比具有 选自 c.338G>T和 c.358A>G的至少一种突变, 是生物样品易感青年人中的成年发病型糖尿 病的指示。如前所述,根据本发明的实施例,对核酸序列与 SEQ ID NO: 1以及 SEQ ID NO: 3进行比对的设备并不受特别限制, 可以釆用任意常规的软件进行操作, 例如根据本发明的 具体实例, 可以釆用 SOAPALIGNER/SOAP2进行比对。 According to an embodiment of the present invention, the nucleic acid sequence determining device 200 is coupled to the nucleic acid extraction device 100 for analyzing a nucleic acid sample to determine a nucleic acid sequence of the nucleic acid sample. As indicated previously, the nucleic acid sequence of the nucleic acid sample can be determined using sequencing methods. Thus, according to an embodiment of the present invention, the nucleic acid sequence determining apparatus 200 may further include: a library construction unit 201 and a sequencing unit 202. The library construction unit 201 is configured to construct a nucleic acid sequencing library for the nucleic acid sample; the sequencing unit 202 is connected to the library construction unit 201 for sequencing the nucleic acid sequencing library to obtain a sequencing result composed of a plurality of sequencing data. As described above, the HNF1B and HNF1A exons can be amplified by PCR to further improve the efficiency of screening biological samples of adult-onset diabetes in susceptible young people. Thus, the library construction unit 201 may further include a PCR amplification module (not shown) in which at least one selected from the group consisting of HNF1B and HNF1A gene exon-specific primers is provided in order to utilize At least one of the HNF1B and HNF1A gene exon-specific primers, the nucleic acid sample is subjected to PCR amplification, and according to a specific embodiment of the present invention, the HNF1B gene exon-specific primer has SEQ ID NOs: 5-6 The nucleotide sequence shown, the HNF1A gene exon-specific primer has the nucleotide sequence shown in SEQ ID NOS: 7-8. According to an embodiment of the present invention, the sequencing unit 202 may include at least one selected from the group consisting of HISEQ2000, SOLiD, 454, and a single molecule sequencing device. Therefore, combined with the latest sequencing technology, a high sequencing depth can be achieved for a single site, and the detection sensitivity and accuracy are greatly improved, so that the high-throughput and deep sequencing characteristics of these sequencing devices can be utilized to further improve the The efficiency of detection and analysis of nucleic acid samples. Thereby, the accuracy and accuracy of subsequent analysis of the sequenced data are improved. According to an embodiment of the present invention, the determining device 300 is coupled to the nucleic acid sequence determining device 200, and is adapted to align the nucleic acid sequences of the nucleic acid samples so as to be based on the nucleic acid sequences of the nucleic acid samples and the SEQ ID NO: 1 and SEQ ID NO: Distinguish whether the biological sample is susceptible to adult-onset diabetes in young people. Specifically, based on the nucleic acid sequence of the nucleic acid sample or its complementary sequence, it has a C.703OT mutation compared to SEQ ID NO: 1, or has a selection from c.338G>T and c compared to SEQ ID NO:3. At least one mutation of 358A>G determines whether the biological sample is susceptible to adult-onset diabetes in young people. As described above, according to one embodiment of the present invention, the nucleic acid sequence of the nucleic acid sample or the complement thereof has a C.703 OT mutation selected from SEQ ID NO: 1, or has a selection compared to SEQ ID NO: 3. At least one mutation from c.338G>T and c.358A>G is indicative of adult-onset diabetes in a biologically susceptible young person. As described above, according to an embodiment of the present invention, the apparatus for aligning the nucleic acid sequence with SEQ ID NO: 1 and SEQ ID NO: 3 is not particularly limited and may be operated by any conventional software, for example, according to Specific examples of the present invention can be aligned using SOAPALIGNER/SOAP2.
由此, 利用该系统, 能够有效地实施前述筛选易感青年人中的成年发病型糖尿病的生 物样品的方法, 从而可以有效地筛选易感青年人中的成年发病型糖尿病的生物样品。  Thus, with the system, the above-described method for screening a biological sample of adult-onset diabetes in a susceptible young person can be effectively carried out, thereby effectively screening a biological sample of adult-onset diabetes in a susceptible young person.
根据本发明的第五方面, 本发明提出了一种用于筛选易感青年人中的成年发病型糖尿 病的生物样品的试剂盒。 根据本发明的实施例, 该用于筛选易感青年人中的成年发病型糖 尿病的生物样品的试剂盒包括:适于检测 HNF1B和 HNF1A基因突变体的至少一种的试剂, 其中与 SEQ ID NO: 1相比, 该 HA^JB基因突变体具有 C.703OT突变; 与 SEQ ID NO: 3 相比, 该 HNFL4基因突变体具有选自下列的至少一种突变: c.338G>T和 c.358A>G。 利用 根据本发明的实施例的试剂盒, 能够有效地筛选易感青年人中的成年发病型糖尿病的生物 样品。 在本文中, 所使用的术语 "适于检测 HNF1B和 HNF1A基因突变体的至少一种的试 剂" 应做广义理解, 即可以是检测 HA^JB和 HNFM编码基因的至少一种的试剂, 也可以 是检测 HNF1B和 HNF1A突变体的至少一种的试剂,例如可以釆用识别特异性位点的抗体。 根据本发明的一个实施例, 该试剂为核酸探针。 由此, 可以高效地筛选易感青年人中的成 年发病型糖尿病的生物样品。  According to a fifth aspect of the present invention, the present invention provides a kit for screening a biological sample of adult onset diabetes in a susceptible young person. According to an embodiment of the present invention, the kit for screening a biological sample of adult onset diabetes in a susceptible young person comprises: an agent suitable for detecting at least one of the HNF1B and HNF1A gene mutants, wherein SEQ ID NO The HA^JB gene mutant has a C.703OT mutation compared to SEQ ID NO: 3, and the HNFL4 gene mutant has at least one mutation selected from the group consisting of c.338G>T and c. 358A>G. With the kit according to the embodiment of the present invention, it is possible to effectively screen biological samples of adult-onset diabetes in a susceptible young person. As used herein, the term "agent suitable for detecting at least one of the HNF1B and HNF1A gene mutants" should be understood broadly, that is, an agent that detects at least one of the HA^JB and HNFM-encoding genes, or It is an agent that detects at least one of the HNF1B and HNF1A mutants, and for example, an antibody that recognizes a specific site can be used. According to an embodiment of the invention, the reagent is a nucleic acid probe. Thereby, biological samples of adult-onset diabetes in susceptible young people can be efficiently screened.
需要说明的是, 在本文前面筛选易感青年人中的成年发病型糖尿病的生物样品的方法 部分中所描述的特征和优点, 同样适用于筛选易感青年人中的成年发病型糖尿病的生物样 品的系统或者试剂盒, 在此不再赘述。  It should be noted that the features and advantages described in the method section for screening biological samples of adult-onset diabetes in susceptible young people are also applicable to screening biological samples of adult-onset diabetes in susceptible young people. The system or kit will not be described here.
此外,还需要说明的是,根据本发明实施例的筛选易感青年人中的成年发病型糖尿病 的生物样品的方法、 系统以及试剂盒, 是本申请的发明人经过艰苦的创造性劳动和优化工 作才完成的。 下面将结合实施例对本发明的方案进行解释。 本领域技术人员将会理解, 下面的实施 例仅用于说明本发明, 而不应视为限定本发明的范围。 实施例中未注明具体技术或条件的, 按照本领域内的文献所描述的技术或条件 (例如参考 J.萨姆布鲁克等著, 黄培堂等译的《分 子克隆实验指南》, 第三版, 科学出版社)或者按照产品说明书进行。 所用试剂或仪器未注 明生产厂商者, 均为可以通过市购获得的常规产品, 例如可以釆购自 Illumina公司。 In addition, it should be noted that the method, system and kit for screening biological samples of adult-onset diabetes in susceptible young people according to embodiments of the present invention are the inventors of the present application undergoing arduous creative labor and optimization work. Only completed. The solution of the present invention will be explained below in conjunction with the embodiments. Those skilled in the art will understand that the following examples are merely illustrative of the invention and should not be construed as limiting the scope of the invention. In the examples, the specific techniques or conditions are not indicated, according to the techniques or conditions described in the literature in the field (for example, refer to J. Sambrook et al., Huang Peitang et al., Molecular Cloning Experimental Guide, Third Edition, Science Press) or in accordance with the product manual. The reagent or instrument used is not marked Ming manufacturers are all conventional products that can be obtained through the market, for example, they can be purchased from Illumina.
实施例 1 全外显子组测序确定致病基因及突变位点  Example 1 Whole exome sequencing to determine pathogenic genes and mutation sites
1、 样本收集:  1. Sample collection:
发明人收集到 2个捷克 MODY患者家系 CZ01及 CZ02。 其中, 图 2分别显示了 CZ01 及 CZ02家系的家系图语。 其中, 图 2A为 CZ01家系的家系图谱, 图 2B为 CZ02家系的 家系图语。 如图 2所示, O表示正常女性; 秦表示无法确定是否患病的女性; LJ表示正 常男性; 圖表示无法确定是否患病的男性; 國表示男性患者; 着表示女性患者; 表示 已故男性患者; ^表示已故女性患者; ^表示已故正常男性; ^表示已故正常女性; 箭头所指为先证者。 其中 CZ01家系现有 3代存活, 共 8个成员, 其中患者 5人; CZ02家 系现有 3代存活, 共 6个成员, 其中患者 2人。 The inventor collected two families of Czech MODY patients, CZ01 and CZ02. Among them, Figure 2 shows the family diagrams of the CZ01 and CZ02 families. 2A is a family diagram of the CZ01 family, and FIG. 2B is a family diagram of the CZ02 family. As shown in Figure 2, O indicates normal women; Qin indicates that it is impossible to determine whether a woman is ill; LJ indicates a normal male; the figure indicates a male who is unable to determine whether the disease is ill; the country indicates a male patient; indicates a female patient; indicates a deceased male Patient; ^ indicates a deceased female patient; ^ indicates a deceased normal male; ^ indicates a deceased normal female; the arrow indicates a proband. Among them, CZ01 family has 3 generations of survival, a total of 8 members, including 5 patients; CZ02 family has 3 generations of survival, a total of 6 members, including 2 patients.
此外, 发明人对 CZ01现有所有患者进行了全面仔细的体检, 结果显示所有患者均具有 典型的糖尿病临床特征, 其中先证者肾脏超声波检测中未检测到任何肾脏功能异常, 其母 亲及外祖父经过反复肾病检测发现患有马蹄肾。 由于在临床表型中报道的肾脏功能异常是 显性特征, 故对先证者进行外显子测序前未做 HNF1B基因的检测。  In addition, the inventors conducted a thorough and detailed physical examination of all patients in CZ01. The results showed that all patients had typical clinical features of diabetes. Among them, no renal dysfunction was detected in the ultrasound examination of the proband, and the mother and grandfather passed. Repeated nephropathy was found to have a horseshoe kidney. Since the renal dysfunction reported in the clinical phenotype is a dominant feature, the HNF1B gene was not detected before the exon was sequenced.
如图 2所示, CZ01家系, 先证者, 女, 26岁, 体质量指数 21.2。 患者首次检测到糖尿 症状在 17岁, 后期胰岛素治疗及口服降糖药治疗。 家系内出现有 3个以上患者在 25岁以 前发病, 累及 3 代或以上家族成员, 呈常染色体显性遗传; 外显率高, 可超过 90%。  As shown in Figure 2, CZ01 family, proband, female, 26 years old, body mass index 21.2. The patient first detected diabetes symptoms at 17 years of age, late insulin therapy and oral hypoglycemic agents. More than three patients in the family have developed before the age of 25, involving 3 or more generations of family members, which are autosomal dominant; the penetrance rate is high, which can exceed 90%.
CZ02家系, 先证者, 女, 21岁, 体质量指数 28.6。 患者首次检测到糖尿症状在 17岁, 后期口服降糖药治疗。 家系内累及 3 代或以上家族成员, 呈常染色体显性遗传; 外显率高, 可超过 90%。  CZ02 family, proband, female, 21 years old, body mass index 28.6. The patient first detected the symptoms of diabetes at the age of 17 and was treated with oral hypoglycemic agents. Family members of the family with 3 or more generations are autosomal dominant; the penetrance rate is high, which can exceed 90%.
两个先证者的病理诊断均为: 青年人中的成年发病型糖尿病。  The pathological diagnosis of the two probands is: adult onset diabetes in young people.
发明人收集获得上述 CZ01家系中存活的 3个患者以及 3个正常人的 DNA样本, CZ02 家系中存活的 2个患者以及 4个正常人的 DNA样本。  The inventors collected DNA samples from three patients and three normal persons surviving the above CZ01 family, two patients surviving in the CZ02 family, and four normal human DNA samples.
2、 全外显子组测序确定致病基因及突变位点  2. The whole exome group is sequenced to determine the pathogenic genes and mutation sites.
发明人利用 Agilent Sureselect 44M Kit结合 Solexa高通量测序技术对上述 CZ01患者家 系中的先证者及其患病母亲和正常父亲和 CZ02 患者家系中的先证者及其患病母亲和正常 父亲进行了外显子测序, 具体步骤如下:  The inventors used the Agilent Sureselect 44M Kit in combination with Solexa high-throughput sequencing technology to perform the above-mentioned probands in the family of CZ01 patients and their diseased mothers and probands in the families of normal fathers and CZ02 patients, and their affected mothers and normal fathers. Exon sequencing, the specific steps are as follows:
2.1 样品制备  2.1 Sample preparation
分别取 CZ01和 CZ02患者家系中的先证者及其患病母亲和正常父亲的外周血, 利用常 规盐析法抽提基因组 DNA, 并利用分光光度计测量 DNA的浓度及纯度, 所得的每个标本 基因组 DNA的 OD260/OD280均位于 1.7-2.0之间,浓度不少于 200ng^l,总量不少于 30μ§, 备用。  Peripheral blood of the probands of CZ01 and CZ02 patients and their diseased mothers and normal fathers were taken separately, genomic DNA was extracted by conventional salting out method, and the concentration and purity of DNA were measured by spectrophotometer. The OD260/OD280 of the genomic DNA of the specimen is located between 1.7 and 2.0, the concentration is not less than 200 ng^l, and the total amount is not less than 30 μ§.
2.2 文库构建及测序  2.2 Library construction and sequencing
利用超声波仪 ( CovarisS2, Massachusetts, USA ) 将各基因组 DNA样本随机打断成 200-300bp左右的片段, 随后按照制造商提供的操作说明书, 在片段两端分别连接上接头制 备文库(可参见: http://www.illumina.com/提供的 Illumina/Solexa标准建库说明书, 通过参 照将其全文并入本文)。 文库经纯化后经过 Ligation-mediated PCR (LM-PCR)的线性扩增与 SureSelect Biotiny lated R A Library (BAITS)进行杂交富集, 再经过 LM-PCR的线性扩增, 文库检测合格后即可上机测序, 以便获得原始测序数据。 其中, 参照 Illumina标准的成簇和 测序的 protocol进行测序, 测序平台为 Illumina Hiseq 2000 , 读取长度为 90bp , 样本的平均 测序深度为 170x。 Randomly interrupted each genomic DNA sample using a sonicator (CovarisS2, Massachusetts, USA) Fragments of about 200-300 bp were subsequently ligated to the ends of the fragments to prepare libraries (see the Illumina/Solexa standard library instructions provided at http://www.illumina.com/, according to the manufacturer's instructions. This is incorporated herein by reference in its entirety. The library was purified and linearized by Ligation-mediated PCR (LM-PCR) and purified by SureSelect Biotiny lated RA Library (BAITS). After linear amplification by LM-PCR, the library was tested and qualified. Sequencing to obtain raw sequencing data. Among them, the Illumina Hiseq 2000 was sequenced according to the Illumina standard clustering and sequencing protocol. The read length was 90 bp, and the average sequencing depth of the sample was 170x.
2.3 变异检测、 注释及数据库比较  2.3 Variation detection, annotation and database comparison
利用 Illumina basecalling Software 1.7对上述获得的原始测序数据进行处理, 经过过滤 去污染后, 使用 SOAPaligner/SOAP2 (可参见: Li R, Li Y, Kristiansen K, et al, SOAP: short oligonucleotide alignment program. Bioinformatics 2008, 24(5):713-714; Li R, Yu C, Li Y, ea al, SOAP2:an improved ultrafast tool for short read alignment.Bioinformatics 2009, 25(15): 1966-1967, 通过参照将其全文并入本文) 比对到参考基因组 UCSC NCBI37/hgl9, 以便获得比对到基因组上的唯一比对序列。 然后利用 SOAPsnp (可参见: Li R, Li Y, Fang X, Yang H, et al, SNP detection for massively parallel whole-genome resequencing . Genome Res 2009, 19(6): 1124-1132, 通过参照将其全文并入本文)确定靶区域的基因型。  The raw sequencing data obtained above was processed using Illumina basecalling Software 1.7, and after filtration and decontamination, SOAPaligner/SOAP2 was used (see: Li R, Li Y, Kristiansen K, et al, SOAP: short oligonucleotide alignment program. Bioinformatics 2008 , 24(5): 713-714; Li R, Yu C, Li Y, ea al, SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics 2009, 25(15): 1966-1967, by reference Incorporating into the reference genome UCSC NCBI37/hgl9, in order to obtain a unique aligned sequence aligned to the genome. Then use SOAPsnp (see: Li R, Li Y, Fang X, Yang H, et al, SNP detection for massively parallel whole-genome resequencing. Genome Res 2009, 19(6): 1124-1132, by reference Incorporate herein to determine the genotype of the target region.
在得到测序结果之后, 对非同义突变、 剪接受体 /供体位点突变、 编码区插入和缺失突 变这三类最有可能与病理相关的突变进行研究。 结果, 发明人通过测序分析发现, 在家系 CZ01中的 3个测序样本中平均发现 67745个单核苷酸多态性( SNPs ) 和 4784处的插入 / 缺失, 在家系 CZ02中的的 3个测序样本中平均发现 69246个单核苷酸多态性( SNPs ) 和 4840 处 的 插 入 / 缺 失 。 随 后 通 过 dbSNP132 数 据 库 ( http://www.ncbi.nlm.nih.gov/projects/SNP/snp_summary.cgi )、 Exome Variant Server数据库 ( http://evs.gs.washington.edu/EVS/ )<千人基因组数据库 ( www.1000genomes.org )、 HapMa 数 据库 ( http:〃hapmap .ncbi.nlm.nih.gov ) 等公共数据库以及糖尿病基因变异 LuCamp数据库 (http://www.lucamp.org)的过滤, 去掉所有已知的且在数据库中等位基因频率大于 0.005 的变 异。 剩下的非同义 /剪接位点突变和微小插入缺失根据以下特征进行优先选择: ( a )选择两 个患者中共有的同基因同位点同突变型的突变, 过滤掉正常人中存在的突变, CZ01家系还 剩下 343个 SNP突变位点, 39个 Indel突变; CZ02家系还剩下 336个 SNP突变位点, 45 个 Indel突变; (b )这些突变位点中, 优先查找 MODY已知基因的突变, 在 CZ01家系中发 现 HNF1B基因的 C.703OT突变; 在 CZ02家系中发现 HNF1A基因的 c.338G>T突变。  After sequencing results, the most likely pathologically relevant mutations were investigated for non-synonymous mutations, splice acceptor/donor site mutations, coding region insertions, and deletion mutations. As a result, the inventors found through sequencing analysis that an average of 67,745 single nucleotide polymorphisms (SNPs) and 4,784 insertions/deletions were found in three sequencing samples in the CZ01 family, and three sequencing in the home line CZ02. On average, 69,624 single nucleotide polymorphisms (SNPs) and 4840 insertions/deletions were found in the sample. Then through the dbSNP132 database (http://www.ncbi.nlm.nih.gov/projects/SNP/snp_summary.cgi), Exome Variant Server database (http://evs.gs.washington.edu/EVS/)<thousands Filtration of public databases such as the Human Genome Database (www.1000genomes.org), the HapMa database (http:〃hapmap.ncbi.nlm.nih.gov), and the LuCamp database of diabetes genetic variants (http://www.lucamp.org), Remove all known variants with a gene allele frequency greater than 0.005 in the database. The remaining non-synonymous/splicing site mutations and microinteger deletions are preferentially selected according to the following characteristics: (a) Selection of isogenic homologous and mutant mutations shared between the two patients, filtering out mutations present in normal humans There are 343 SNP mutation sites and 39 Indel mutations in the CZ01 family; there are 336 SNP mutation sites and 45 Indel mutations in the CZ02 family; (b) Among these mutation sites, the MODY known genes are preferentially searched. Mutation, the C.703OT mutation of the HNF1B gene was found in the CZ01 family; the c.338G>T mutation of the HNF1A gene was found in the CZ02 family.
由于外显子组测序存在一定程度的假阳性, 接下来, 发明人又利用 Sanger测序方法, 对上述确定的 2个 SNP突变在家系 CZ01和 CZ021 的先证者分别进行了验证, 结果显示 fflV ^B基因的 C.703OT突变及 HNFL4基因的 c.338G>T突变均为真实的突变。 由此, 表 明 HNF1B 和 HNF1A 基因为青年人中的成年发病型糖尿病的致病相关基因, C.703OT ( HNF1B ), c.338G>T ( HNFJ4 ) 突变是青年人中的成年发病型糖尿病的致病突变。 实施例 2 Sanger法测序验证青年人中的成年发病型糖尿病的致病基因 Since the exome sequencing has a certain degree of false positives, the inventors then used the Sanger sequencing method to verify the two SNP mutations identified above in the family CZ01 and CZ021. The results show that fflV ^ The C.703OT mutation of the B gene and the c.338G>T mutation of the HNFL4 gene are all true mutations. Thus, HNF1B and HNF1A genes are involved in the pathogenesis-related genes of adult-onset diabetes mellitus in young adults. C.703OT (HNF1B), c.338G>T (HNFJ4) mutation is a cause of adult-onset diabetes in young adults. Disease mutation. Example 2 Sanger method sequencing to verify the pathogenic genes of adult onset diabetes in young people
分别对 MODY患者家系 CZ01中的 6人( 3个女性患者及 3个家系内正常人) CZ02中 的 6人(2个女性患者及 4个家系内正常人)(见图 2)、 1个散发患者及 95个家系外正常人 的 HNF1B和 HNF1A基因进行检测, 针对 HNF1B基因的 3号外显子 HNFL4基因 2号外 显子序列设计引物, 然后通过 PCR扩增、 产物纯化和测序的方法获得 HNF1B基因的 3号 外显子 HNFL4基因 2号外显子有关序列,根据确定序列测定结果属于突变型还是野生型, 证 HNF1B和 HNF1A与青年人中的成年发病型糖尿病之间的相关性。具体方法步骤如下: 6 of the CZ01 family of the MODY patients (3 female patients and 3 normal family members) 6 of the CZ02 (2 female patients and 4 normal family members) (see Figure 2), 1 scatter The HNF1B and HNF1A genes were detected in the patients and 95 normal off-campus normal subjects. Primers were designed for the exon 2 sequence of exon 3 HNFL4 gene of HNF1B gene, and then HNF1B gene was obtained by PCR amplification, product purification and sequencing. Sequence No. 2 of the exon 2 of the exon HNFL4 gene, whether it is a mutant or a wild type according to the determined sequence, confirms the correlation between HNF1B and HNF1A and adult onset diabetes in young people. The specific method steps are as follows:
1、 DNA提取 1, DNA extraction
分别釆集上述家系 CZ01中的 3名患者和 3名家系内正常人、 CZ02中的 2名患者和 4 名家系内正常人、 1个散发患者以及 95个家系外正常人的外周静脉血, 然后利用常规盐析 法抽提外周血白细胞中的基因组 DNA, 并利用分光光度计测量所提取 DNA的浓度及纯度, 所得的每个标本基因组 DNA的 OD260/OD280均位于 1.7-2.0之间,浓度不少于 200ng/微升, 总量不少于 6 g, 由此, 获得 108份 DNA样品, 备用。  The peripheral venous blood of 3 patients and 3 normal family members, 2 patients in CZ02, 4 normal family members, 1 sporadic patient, and 95 normal off-campus normal persons in the above-mentioned family CZ01 were collected. The genomic DNA in the peripheral blood leukocytes was extracted by conventional salting out method, and the concentration and purity of the extracted DNA were measured by a spectrophotometer. The OD260/OD280 of each specimen genomic DNA was between 1.7 and 2.0, and the concentration was not Less than 200 ng/μl, the total amount is not less than 6 g, whereby 108 DNA samples are obtained and used.
2、 引物设计及 PCR反应  2. Primer design and PCR reaction
首先, 参考人类基因组序列数据库 GRCh37.1/hgl9, 设计得到具有 SEQ ID NO: 5-6所 示的核苷酸序列的 HNF1B基因 3号外显子, 以及具有 SEQ ID NO: 7-8所示的核苷酸序列 的 HNF1A基因 2号外显子序列特异性引物, 具体见下表。  First, with reference to the human genome sequence database GRCh37.1/hgl9, the exon 3 of the HNF1B gene having the nucleotide sequence shown by SEQ ID NOS: 5-6 is designed, and has the SEQ ID NOs: 7-8 The HNF1A gene exon 2 specific primer of the nucleotide sequence is shown in the following table.
HNF1B和 HNF1A基因外显子特异性 |物  HNF1B and HNF1A gene exon specificity
Figure imgf000014_0001
Figure imgf000014_0001
接着,分别按照以下配比配制各基因组 DNA样本的 PCR反应体系以及进行 PCR反应。 针对 HNF1B基因 3号外显子 HNFL4基因 2号外显子序列, 按以下配比分别配制各 基因组 DNA样本的各 PCR反应体系 (不同的突变位点釆用相同的反应体系):  Next, the PCR reaction system of each genomic DNA sample was prepared and subjected to a PCR reaction according to the following ratios. For the exon 2 sequence of HNF1B gene exon 3 HNFL4 gene, each PCR reaction system of each genomic DNA sample was prepared according to the following ratios (different mutation sites use the same reaction system):
反应体系: 10 μΐ  Reaction system: 10 μΐ
Figure imgf000014_0002
DNA 1
Figure imgf000014_0002
DNA 1
总体积 10  Total volume 10
然后, 将配制获得的 HNF1B基因 3号外显子 HNFL4基因 2号外显子序列的各 PCR 反应体系按照以下反应条件分别进行 PCR反应 (不同的突变位点釆用相同的反应条件 ): 反应条件:  Then, each PCR reaction system of the exon 2 of the HNF1B gene exon 2 HNFL4 gene obtained is subjected to PCR reaction according to the following reaction conditions (different mutation sites are used for the same reaction conditions): Reaction conditions:
10个循环 10 cycles
Figure imgf000015_0001
Figure imgf000015_0001
由此, 获得上述家系 CZ01中的 3名患者和 3名家系内正常人, CZ02中的 2名患者和 4名家系内正常人, 1个散发患者以及 95个家系外正常人的 PCR扩增产物。  Thus, PCR amplification products were obtained from 3 patients in the above-mentioned family CZ01 and 3 normal persons in the family, 2 patients in CZ02, 4 normal persons in the family, 1 sporadic patient, and 95 normal off-campus normal persons. .
3、 测序 3. Sequencing
将步骤 2中获得的获自家系 CZ01中的 3名患者和 3名家系内正常人, CZ02中的 2名 患者和 4名家系内正常人,散发患者以及 95个家系外正常人的 PCR扩增产物,按以下配比 分别配制各基因组 DNA样本的各 PCR反应体系进行 DNA测序(不同的突变位点釆用相同 的反应体系, BigDye® Terminator v3.1 Cycle Sequncing kit, Applied Biosystems ):  PCR amplification of 3 patients and 3 family members in the CZ01 obtained in step 2, 2 patients in CZ02, 4 normal subjects in the family, sporadic patients and 95 normal off-campus normal subjects For the products, each PCR reaction system of each genomic DNA sample was prepared for DNA sequencing according to the following ratios (different mutation sites, the same reaction system, BigDye® Terminator v3.1 Cycle Sequncing kit, Applied Biosystems):
Figure imgf000015_0002
然后, 将配制获得的 HNF1B 3号外显子 HNFJ4基因 2号外显子的各 PCR反应 体系按照以下反应条件分别进行 PCR反应然后利用 DNA sequencer ABI Prism 3130x1 (Applied Biosystems, Warrington, UK)进行测序 (不同的突变位点釆用相同的反应条件 ): 反应条件: 96 °C 1 min
Figure imgf000015_0002
Then, each PCR reaction system for preparing the exon 2 of the HNF1B exon HNFJ4 gene obtained was subjected to a PCR reaction under the following reaction conditions and then subjected to sequencing using DNA sequencer ABI Prism 3130x1 (Applied Biosystems, Warrington, UK) (different The mutation site uses the same reaction conditions): Reaction conditions: 96 °C 1 min
96 °c 10 s  96 °c 10 s
55 °c 5 s J" 25  55 °c 5 s J" 25
63 °c 4 min  63 °c 4 min
72 °c 7 min  72 °c 7 min
15 。c oo  15 . c oo
结果显示, HNF1B基因的 c.703C>T突变在 CZ01家系内出现共分离, HNF1A基因的 c.338G>T突变在 CZ02家系中出现共分离。  The results showed that the c.703C>T mutation of HNF1B gene was co-segregated in the CZ01 family, and the c.338G>T mutation of HNF1A gene was co-segregated in the CZ02 family.
接着, 基于测序结果, 对上述各样本进行 HN B HNFM基因编码序列比对。 发明 人发现,家系 CZ01和 CZ02内患者及其家系内正常人分别在 c.703C>T( HNF1B )和 c.338G>T ( HNF1A )出现共分离现象, 家系 CZ01和 CZ02内患者分别携带 C.703OT和 c.338G>T突 变, 并在一个散发患者中发现了 c.358A>G突变 ( HNF1A , 95个家系外正常人的 HNF1B 基因均未发现具有 C.703OT 突变; 95 个家系外正常人的 HNF1A 基因均未发现具有 c.338G>T和 c.358A>G突变。 由此, 证明 HNF1B基因的 C.703OT突变、 HNF1A基因的 c.338G>T和 c.358A>G突变是青年人中的成年发病型糖尿病的致病突变。  Next, based on the sequencing results, the HN B HNFM gene coding sequence alignment was performed on each of the above samples. The inventors found that patients in their families CZ01 and CZ02 and their normal families had co-segregation in c.703C>T (HNF1B) and c.338G>T (HNF1A), respectively, and patients in families CZ01 and CZ02 carried C. 703OT and c.338G>T mutations, and c.358A>G mutation (HNF1A) was found in a sporadic patient. HNF1B gene was found in 95 out-of-home normal humans without C.703OT mutation; 95 out-of-home normals None of the HNF1A genes were found to have c.338G>T and c.358A>G mutations. Thus, the C.703OT mutation of the HNF1B gene and the c.338G>T and c.358A>G mutations of the HNF1A gene were found to be young. A pathogenic mutation in adult onset diabetes.
其中, 图 3显示了 MODY患者家系 CZ01家系内的患者及正常人的 HA^JB基因突变 位点的 Sanger测序验证峰图, 图 4显示了 CZ02家系内的患者及正常人的 HNF1A基因突 变位点的 Sanger测序验证峰图,图 5显示了 MODY散发患者及家系外正常人的 HNF1A基 因突变位点的 Sanger测序验证峰图。 由图 3-5可知, CZ01家系中患者的 HNF1B基因具有 C.703OT突变, 而该家系中的正常人不具有该突变; CZ02家系中患者的 HNFL4基因具有 c.338G>T突变,而该家系中的正常人不具有该突变;散发患者的 HNFJ4基因具有 c.358A>G 突变, 而家系外正常人在相应位点上均不存在该突变。  Among them, Figure 3 shows the Sanger sequencing verification peak of the HA^JB gene mutation site in the CZ01 family of MODY patients, and Figure 4 shows the HNF1A gene mutation site in the CZ02 family and normal subjects. The Sanger sequencing verification peak map, Figure 5 shows the Sanger sequencing verification peak of the HNF1A gene mutation site in MODY spora patients and off-normal subjects. As can be seen from Figures 3-5, the HNF1B gene of the CZ01 family has a C.703OT mutation, and the normal person in the family does not have the mutation; the HNFL4 gene of the CZ02 family has a c.338G>T mutation, and the family The normal person does not have this mutation; the HNFJ4 gene of the sporadic patient has a c.358A>G mutation, and the normal human outside the family does not have the mutation at the corresponding site.
由此, 发明人在患者家族成员中对 HNF1B基因 3号外显子和 HNF1A基因 2号外显子 的序列进行突变排查发现家系 CZ01的 3个患者具有 C.703OT ( HNF1B )突变, 家系 CZ02 的 2个患者具有 c.338G>T ( HNF1A ) 突变; 散发患者中发现了一例 c.358A>G ( HNF1A ) 突变; 而在与患者来自同一区域的 95个家系外正常人中均未发现上述突变。 由此, 进一步 证明 HNF1B基 的 C.703OT突变、 HNF1A基因的 c.338G>T和 c.358A>G突变是青年人 中的成年发病型糖尿病的致病突变。  Thus, the inventors performed mutations on the sequence of exon 3 of HNF1B gene and exon 2 of HNF1A gene in patient family members, and found that three patients of family CZ01 had C.703OT (HNF1B) mutation, and two of family CZ02. The patient had a c.338G>T (HNF1A) mutation; a c.358A>G (HNF1A) mutation was found in the sporadic patients; the mutation was not found in 95 out of normal families from the same region as the patient. Thus, it was further demonstrated that the C.703OT mutation of the HNF1B group and the c.338G>T and c.358A>G mutation of the HNF1A gene are pathogenic mutations in adult onset diabetes in young people.
需要说明的是, 已知, 肝细 ϋ包核因子 l a (hepatocyte nuclear factorl α , HNF1A)属于 POU -同源结构域家族, 由 N末端的类肌球蛋白二聚体区、 C末端的转录激活区和 DNA结 合区构成。其基因定位于染色体 12q24. 2,全长 3241bp,其中蛋白编码区长 1 896 bp。HNFlA DNA结合区含 POU -同源结构域序列, 二聚体区可形成同源二聚体和 HNF1B 形成异源 二聚体共同调节靶基因表达。 肝细胞核因子 1 β (hepatocyte nuclear factorl β , HNF1B)是对 胰腺细胞形成和维持血糖稳态具有重要作用的转录因子。 HNF1B是 HNF1A类似物,在脊推 动物中表达, 与 HNF1A 高度同源, 其生物学活性明显低于 HNF1A。 HNF1A表达下调时, HNF1B 可代偿性增加。本研究可适用却不局限于在糖尿病患者无肾脏结构 /功能异常症状的 特殊患者中进行 HNF1B基因突变筛查和临床表型分析; 另一方面, 将评价 HN B基因变 异与捷克人青年人中的成年发病型糖尿病易感性的关系。 It should be noted that hepatocyte nuclear factor l α (HNF1A) belongs to the POU-homology domain family, which is activated by the N-terminal myosin dimer region and the C-terminus. The region and the DNA binding region are constructed. The gene is located on chromosome 12q24.2, with a total length of 3241 bp, wherein the protein coding region is 1 896 bp long. The HNF1A DNA binding region contains a POU-homology domain sequence, and the dimeric region forms a homodimer and HNF1B forms a heterodimer to collectively regulate target gene expression. Hepatocyte nuclear factor 1 β (HNF1B) is a transcription factor that plays an important role in the formation of pancreatic cells and the maintenance of blood glucose homeostasis. HNF1B is an HNF1A analog expressed in vertebrate, highly homologous to HNF1A, and its biological activity is significantly lower than that of HNF1A. When HNF1A expression is down-regulated, HNF1B can be compensated for. This study may be applied to, but is not limited to, HNF1B gene mutation screening and clinical phenotypic analysis in special patients with no renal structural/dysfunction in diabetic patients; on the other hand, HN B gene mutations will be evaluated in Czech young people. The relationship between susceptibility to adult-onset diabetes.
综上, 发明人进一步证明了 HA^JB和 HNFL4基因为青年人中的成年发病型糖尿病的 致病相关基因, HNF1B基因的 C.703OT突变、 HNF1A基因的 c.338G>T和 c.358A>G突变 是青年人中的成年发病型糖尿病的致病突变。 实施例 3 Sanger法测序验证青年人中的成年发病型糖尿病的致病突变  In conclusion, the inventors further demonstrated that the HA^JB and HNFL4 genes are pathogenicity-related genes for adult-onset diabetes in young adults, the C.703OT mutation of the HNF1B gene, and the c.338G>T and c.358A of the HNF1A gene. The G mutation is a causative mutation in adult-onset diabetes in young people. Example 3 Sanger Method Sequencing Verification of Pathogenic Mutations in Adult-onset Diabetes in Young People
分别对上述得到的 HNF1B和 HNF1A基因上 C.703OT ( HNF1B ). c.338G>T ( HNF1A ) 和 c.358A>G ( HNF1A )的致病突变进行验证, 即利用 HNF1B基因的 3号外显子 HNFL4 基因的 2号外显子序列引物, 分别按照实施例 2中的相应方法, 对实施例 2中获得的 95个 家系外正常人的基因组 DNA样本进行 HNFlB m的 3号外显子 HNFL4基因的 2号外显 子突变位点检测。 结果, 发明人发现, 95个家系外正常人中均未发现上述突变。  The pathogenic mutations of C.703OT (HNF1B). c.338G>T (HNF1A) and c.358A>G (HNF1A) on the HNF1B and HNF1A genes obtained above were verified, that is, the exon 3 of HNF1B gene was utilized. The exon 2 primer sequence of the HNFL4 gene was subjected to the corresponding method of Example 2, and the genomic DNA samples of 95 out-of-home normal humans obtained in Example 2 were subjected to HNF1B m exon 2 of the exon 2 HNFL4 gene. Proton mutation site detection. As a result, the inventors found that the above mutations were not found in 95 normal off-campus normal persons.
由此, 发明人进一步证明了 HA^ £基因的 C.703OT突变、 HNFL4基因的 c.338G>T 和 c.358A>G突变是青年人中的成年发病型糖尿病的致病突变。 工业实用性  Thus, the inventors further demonstrated that the C.703OT mutation of the HA^ £ gene, the c.338G>T and the c.358A>G mutation of the HNFL4 gene are pathogenic mutations in adult onset diabetes in young people. Industrial applicability
本发明的分离的核酸、 分离的多肽、 筛选易感青年人中的成年发病型糖尿病的生物 样品的方法、 系统以及试剂盒, 能够有效地应用于检测生物样品是否易感青年人中的成 年发病型糖尿病, 并且获得的结果准确可靠。 尽管本发明的具体实施方式已经得到详细的描述, 本领域技术人员将会理解。 根据已 经公开的所有教导, 可以对那些细节进行各种修改和替换, 这些改变均在本发明的保护范 围之内。 本发明的全部范围由所附权利要求及其任何等同物给出。  The isolated nucleic acid, the isolated polypeptide of the present invention, the method, system and kit for screening biological samples of adult onset diabetes in susceptible young people can be effectively applied to detect whether a biological sample is susceptible to adult onset in young people Type 2 diabetes, and the results obtained are accurate and reliable. Although specific embodiments of the invention have been described in detail, those skilled in the art will understand. Various modifications and alterations of those details are possible in light of the teachings of the invention. The full scope of the invention is given by the appended claims and any equivalents thereof.
在本说明书的描述中, 参考术语 "一个实施例"、 "一些实施例"、 "示意性实施例"、 "示 例"、 "具体示例"、 或 "一些示例" 等的描述意指结合该实施例或示例描述的具体特征、 结 构、 材料或者特点包含于本发明的至少一个实施例或示例中。 在本说明书中, 对上述术语 的示意性表述不一定指的是相同的实施例或示例。 而且, 描述的具体特征、 结构、 材料或 者特点可以在任何的一个或多个实施例或示例中以合适的方式结合。  In the description of the present specification, the description of the terms "one embodiment", "some embodiments", "illustrative embodiment", "example", "specific example", or "some examples", etc. Particular features, structures, materials or features described in the examples or examples are included in at least one embodiment or example of the invention. In the present specification, the schematic representation of the above terms does not necessarily mean the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in a suitable manner in any one or more embodiments or examples.

Claims

权利要求书 claims
1、 一种分离的核酸, 其特征在于, 1. An isolated nucleic acid, characterized by:
与 SEQ ID NO: 1相比, 所述核酸具有 C.703OT突变; 或者 Compared with SEQ ID NO: 1, the nucleic acid has a C.703OT mutation; or
与 SEQ ID NO: 3相比,所述核酸具有选自下列的至少一种突变: c.338G>T和 c.358A>G, 任选地, 所述核酸为 DNA。 Compared with SEQ ID NO: 3, the nucleic acid has at least one mutation selected from the following: c.338G>T and c.358A>G, optionally, the nucleic acid is DNA.
2、 一种分离的多肽, 其特征在于, 2. An isolated polypeptide, characterized by:
与 SEQ ID NO: 2相比, 所述分离的多肽具有 p.Arg235Trp突变; 或者 Compared with SEQ ID NO: 2, the isolated polypeptide has a p.Arg235Trp mutation; or
与 SEQ ID NO: 4相比, 所述分离的多肽具有选自下列的至少一种突变: p.Trpl l3Leu 和 p.Lysl20Glu, Compared to SEQ ID NO: 4, the isolated polypeptide has at least one mutation selected from the group consisting of: p.Trpl 13Leu and p.Lysl20Glu,
任选地, 所述多肽是由权利要求 1所述的核酸编码的。 Optionally, the polypeptide is encoded by the nucleic acid of claim 1.
3、 一种筛选易感青年人中的成年发病型糖尿病的生物样品的方法, 其特征在于, 包括 以下步骤: 3. A method for screening biological samples for adult-onset diabetes in susceptible young people, characterized by including the following steps:
从所述生物样品提取核酸样本; extract a nucleic acid sample from the biological sample;
确定所述核酸样本的核酸序列; Determine the nucleic acid sequence of the nucleic acid sample;
所述核酸样本的核酸序列或其互补序列, 与 SEQ ID NO: 1相比具有选自 C.703OT突 变, 或者与 SEQ ID NO: 3相比具有选自 c.338G>T和 c.358A>G的至少一种突变, 是所述 生物样品易感青年人中的成年发病型糖尿病的指示, The nucleic acid sequence of the nucleic acid sample or its complementary sequence, compared with SEQ ID NO: 1, has a mutation selected from the group consisting of C.703OT, or compared with SEQ ID NO: 3, has a mutation selected from the group consisting of c.338G>T and c.358A> at least one mutation of G that is indicative of adult-onset diabetes in a susceptible young adult in said biological sample,
任选地, 所述生物样品为选自人体血液、 皮肤、 毛发和肌肉的至少一种, Optionally, the biological sample is at least one selected from human blood, skin, hair and muscle,
任选地, 所述核酸样本为全基因组 DNA。 Optionally, the nucleic acid sample is whole genome DNA.
4、 根据权利要求 3所述的方法, 其特征在于, 从所述生物样品提取核酸样本进一步包 括: 4. The method of claim 3, wherein extracting nucleic acid samples from the biological samples further includes:
从所述生物样品提取 R A样本, 优选所述 R A样本为 mR A; 以及 Extracting an RA sample from the biological sample, preferably the RA sample is mRA; and
基于所述 R A样本, 通过反转录反应, 获得 cDNA样本, 所述 cDNA样本构成所述核 酸样本。 Based on the RA sample, a cDNA sample is obtained through a reverse transcription reaction, and the cDNA sample constitutes the nucleic acid sample.
5、 根据权利要求 3所述的方法, 其特征在于, 确定所述核酸样本的核酸序列进一步包 括: 5. The method of claim 3, wherein determining the nucleic acid sequence of the nucleic acid sample further includes:
针对所述核酸样本, 构建核酸测序文库; 以及 Construct a nucleic acid sequencing library for the nucleic acid sample; and
对所述核酸测序文库进行测序, 以便获得由多个测序数据构成的测序结果, Sequencing the nucleic acid sequencing library to obtain a sequencing result composed of multiple sequencing data,
任选地, 釆用选自 Hiseq2000、 SOLiD、 454和单分子测序装置的至少一种对所述核酸 测序文库进行测序, Optionally, the nucleic acid sequencing library is sequenced using at least one selected from the group consisting of Hiseq2000, SOLiD, 454 and a single molecule sequencing device,
任选地, 针对所述核酸样本, 构建核酸测序文库进一步包括: Optionally, for the nucleic acid sample, constructing a nucleic acid sequencing library further includes:
利用选自 HNF1B和 HNF1A基因外显子特异性引物的至少一种, 对所述核酸样本进行 PCR扩增; 以及 Using at least one selected from the group consisting of HNF1B and HNF1A gene exon-specific primers, the nucleic acid sample is subjected to PCR amplification; and
针对所得到的扩增产物, 构建所述核酸测序文库, 任选地, 所述 HNF1B基因外显子特异性引物具有如 SEQ ID NO: 5-6所示的核苷酸序 列, 所述 HNF1A基因外显子特异性引物具有如 SEQ ID NO: 7-8所示的核苷酸序列。 Construct the nucleic acid sequencing library based on the obtained amplification product, Optionally, the HNF1B gene exon-specific primer has a nucleotide sequence as shown in SEQ ID NO: 5-6, and the HNF1A gene exon-specific primer has a nucleotide sequence as SEQ ID NO: 7-8 The nucleotide sequence shown.
6、一种筛选易感青年人中的成年发病型糖尿病的生物样品的系统, 其特征在于, 包括: 核酸提取装置, 所述核酸提取装置用于从所述生物样品提取核酸样本; 6. A system for screening biological samples for adult-onset diabetes in susceptible young people, characterized by comprising: a nucleic acid extraction device, the nucleic acid extraction device being used to extract nucleic acid samples from the biological samples;
核酸序列确定装置, 所述核酸序列确定装置与所述核酸提取装置相连, 用于对所述核 酸样本进行分析, 以便确定所述核酸样本的核酸序列; Nucleic acid sequence determination device, the nucleic acid sequence determination device is connected to the nucleic acid extraction device, and is used to analyze the nucleic acid sample to determine the nucleic acid sequence of the nucleic acid sample;
判断装置, 所述判断装置与所述核酸序列确定装置相连, 以便基于所述核酸样本的核 酸序列或其互补序列, 与 SEQ ID NO: 1相比具有 C.703OT突变, 或者与 SEQ ID NO: 3 相比具有选自 c.338G>T和 c.358A>G的至少一种突变, 判断所述生物样品是否易感青年人 中的成年发病型糖尿病。 A judgment device, the judgment device is connected to the nucleic acid sequence determination device, so that based on the nucleic acid sequence of the nucleic acid sample or its complementary sequence, it has a C.703OT mutation compared with SEQ ID NO: 1, or has a C.703OT mutation compared with SEQ ID NO: 1. 3 Determine whether the biological sample is susceptible to adult-onset diabetes in young adults compared to having at least one mutation selected from c.338G>T and c.358A>G.
7、 根据权利要求 6所述的系统, 其特征在于, 所述核酸提取装置进一步包括: R A提取单元, 所述 R A提取单元用于从所述生物样品提取 R A样本; 以及 反转录单元, 所述反转录单元与所述 R A提取单元相连, 用于对所述 RNA样本进行 反转录反应, 以便获得 cDNA样本, 所述 cDNA样本构成所述核酸样本。 7. The system according to claim 6, wherein the nucleic acid extraction device further includes: an RA extraction unit, the RA extraction unit is used to extract an RA sample from the biological sample; and a reverse transcription unit, The reverse transcription unit is connected to the RA extraction unit and is used to perform a reverse transcription reaction on the RNA sample to obtain a cDNA sample, and the cDNA sample constitutes the nucleic acid sample.
8、 根据权利要求 6所述的系统, 其特征在于, 所述核酸序列确定装置进一步包括: 文库构建单元, 所述文库构建单元用于针对所述核酸样本, 构建核酸测序文库; 以及 测序单元, 所述测序单元与所述文库构建单元相连, 用于对所述核酸测序文库进行测 序, 以便获得由多个测序数据构成的测序结果, 8. The system according to claim 6, wherein the nucleic acid sequence determination device further includes: a library construction unit, the library construction unit is used to construct a nucleic acid sequencing library for the nucleic acid sample; and a sequencing unit, The sequencing unit is connected to the library construction unit and is used to sequence the nucleic acid sequencing library to obtain a sequencing result composed of multiple sequencing data,
任选地, 所述文库构建单元进一步包括: Optionally, the library building blocks further include:
PCR扩增模块,所述 PCR扩增模块中设置有选自 HNF1B和 HNF1A基因外显子特异性 引物的至少一种, 以便利用 HA^JB和 HNFL4基因外显子特异性引物的至少一种, 对所述 核酸样本进行 PCR扩增, A PCR amplification module, the PCR amplification module is provided with at least one selected from the group consisting of HNF1B and HNF1A gene exon-specific primers, so as to utilize at least one type of HNF1B and HNFL4 gene exon-specific primers, Perform PCR amplification of the nucleic acid sample,
任选地, 所述 HNF1B基因外显子特异性引物具有如 SEQ ID NO: 5-6所示的核苷酸序 列, 所述 HNF1A基因外显子特异性引物具有如 SEQ ID NO: 7-8所示的核苷酸序列, Optionally, the HNF1B gene exon-specific primer has a nucleotide sequence as shown in SEQ ID NO: 5-6, and the HNF1A gene exon-specific primer has a nucleotide sequence as SEQ ID NO: 7-8 The nucleotide sequence shown is,
任选地, 所述测序单元包括选自 HISEQ2000、 SOLiD、 454和单分子测序装置的至少一 种。 Optionally, the sequencing unit includes at least one selected from the group consisting of HISEQ2000, SOLiD, 454 and single molecule sequencing devices.
9、一种用于筛选易感青年人中的成年发病型糖尿病的生物样品的试剂盒,其特征在于, 含有: 9. A kit for screening biological samples for adult-onset diabetes in susceptible young people, characterized by containing:
适于检测 HNF1B和 HNF1A基因突变体的至少一种的试剂, 其中 A reagent suitable for detecting at least one of HNF1B and HNF1A gene mutants, wherein
与 SEQ ID NO: 1相比, 所述 fflV^JB基因突变体具有 C.703OT突变; Compared with SEQ ID NO: 1, the fflV^JB gene mutant has a C.703OT mutation;
与 SEQ ID NO: 3 相比, 所述 HNF1A 基因突变体具有选自下列的至少一种突变: c.338G>T和 c.358A>G, Compared with SEQ ID NO: 3, the HNF1A gene mutant has at least one mutation selected from the following: c.338G>T and c.358A>G,
任选地, 所述试剂为核酸探针。 Optionally, the reagent is a nucleic acid probe.
PCT/CN2013/073365 2013-03-28 2013-03-28 Gene mutant and use thereof WO2014153753A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
CN201380074760.0A CN105263944B (en) 2013-03-28 Gene mutation body and its application
PCT/CN2013/073365 WO2014153753A1 (en) 2013-03-28 2013-03-28 Gene mutant and use thereof
HK16101540.5A HK1213573A1 (en) 2013-03-28 2016-02-12 Gene mutant and use thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2013/073365 WO2014153753A1 (en) 2013-03-28 2013-03-28 Gene mutant and use thereof

Publications (1)

Publication Number Publication Date
WO2014153753A1 true WO2014153753A1 (en) 2014-10-02

Family

ID=51622389

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2013/073365 WO2014153753A1 (en) 2013-03-28 2013-03-28 Gene mutant and use thereof

Country Status (2)

Country Link
HK (1) HK1213573A1 (en)
WO (1) WO2014153753A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998011254A1 (en) * 1996-09-10 1998-03-19 Arch Development Corporation MUTATIONS IN THE DIABETES SUSCEPTIBILITY GENES HEPATOCYTE NUCLEAR FACTOR (HNF) 1 ALPHA (α), HNF-1β AND HNF-4$g(a)
CN1496412A (en) * 2001-03-14 2004-05-12 ���Ĵ���ѧ Method for estimating danger of diabetes typ B developed in the human species of Chinese bloodline and composition
CN1495191A (en) * 2002-09-16 2004-05-12 三星电子株式会社 HNF-1 alpha gene variant with new mononucleotide polymorphism and its coded protein variant
US20050181406A1 (en) * 2004-02-13 2005-08-18 Jeong Sung-Young HNF-1alpha gene including novel single-nucleotide polymorphism, protein encoded by the HNF-1alpha gene, and polynucleotide, microarray, kit, and method for diagnosis of MODY3
CN1254549C (en) * 2001-12-18 2006-05-03 三星电子株式会社 Multiple PCR primer group for human HNF-1 alpha gene amplification

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998011254A1 (en) * 1996-09-10 1998-03-19 Arch Development Corporation MUTATIONS IN THE DIABETES SUSCEPTIBILITY GENES HEPATOCYTE NUCLEAR FACTOR (HNF) 1 ALPHA (α), HNF-1β AND HNF-4$g(a)
CN1496412A (en) * 2001-03-14 2004-05-12 ���Ĵ���ѧ Method for estimating danger of diabetes typ B developed in the human species of Chinese bloodline and composition
CN1254549C (en) * 2001-12-18 2006-05-03 三星电子株式会社 Multiple PCR primer group for human HNF-1 alpha gene amplification
CN1495191A (en) * 2002-09-16 2004-05-12 三星电子株式会社 HNF-1 alpha gene variant with new mononucleotide polymorphism and its coded protein variant
US20050181406A1 (en) * 2004-02-13 2005-08-18 Jeong Sung-Young HNF-1alpha gene including novel single-nucleotide polymorphism, protein encoded by the HNF-1alpha gene, and polynucleotide, microarray, kit, and method for diagnosis of MODY3

Also Published As

Publication number Publication date
HK1213573A1 (en) 2016-07-08
CN105263944A (en) 2016-01-20

Similar Documents

Publication Publication Date Title
KR102622309B1 (en) Detection of chromosomal interactions
CN104120132A (en) FBN1 genetic mutant and application thereof
CN111423503B (en) Novel mutant protein related to long QT syndrome, novel mutant gene and application thereof
AU2016351311B9 (en) SCAP gene mutant and the application thereof
EP3927844B1 (en) Piezo type mechanosensitive ion channel component 1 (piezo1) variants and uses thereof
CN106282195A (en) Gene mutation body and application thereof
CN113355332B (en) HEG1 gene mutant and application thereof
CN104745592B (en) CYP4V2 gene mutant and application thereof
CN104073499B (en) TMC1 gene mutation body and its application
CN115786356B (en) Arrhythmia right ventricular dysplasia cardiomyopathy variant gene CDH2 and application thereof
CN116004642A (en) Long QT syndrome variant gene KCNH2 and application thereof
CN106701709A (en) ATM (ataxia telangiectasia mutated) gene mutant and application thereof
CN104099338B (en) MYO15A gene mutation body and its application
WO2013078690A1 (en) Ankylosing spondylitis susceptibility and mononucleotide polymorphism detection method, kit and use thereof
JP4119490B2 (en) Genetic diagnosis of diseases involving TNF-α promoter
CN104164424A (en) CC2D2A gene mutant and application thereof
CN104745591B (en) CYP4V2 gene mutant and application thereof
WO2014153753A1 (en) Gene mutant and use thereof
CN110511992B (en) TARDBP mutant gene, detection primer and kit
CN105802974B (en) BCS1L gene mutant and application thereof
CN105263944B (en) Gene mutation body and its application
CN115948533B (en) Reagent for detecting MYBPC3 mutant gene and application thereof
CN114381513B (en) Biomarker related to X-linked lymphoproliferative disorder and application thereof
CN105255902B (en) Early-onset diabetes gene mutant and application thereof
CN115992145A (en) Long QT syndrome variant gene KCNQ1 and application thereof

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 201380074760.0

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13880491

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205N DATED 11/02/2016)

122 Ep: pct application non-entry in european phase

Ref document number: 13880491

Country of ref document: EP

Kind code of ref document: A1