CN113308527A - Gene composition, chip and kit for screening refractory hereditary bone diseases - Google Patents
Gene composition, chip and kit for screening refractory hereditary bone diseases Download PDFInfo
- Publication number
- CN113308527A CN113308527A CN202110255299.5A CN202110255299A CN113308527A CN 113308527 A CN113308527 A CN 113308527A CN 202110255299 A CN202110255299 A CN 202110255299A CN 113308527 A CN113308527 A CN 113308527A
- Authority
- CN
- China
- Prior art keywords
- kit
- gene
- hereditary
- library
- sequencing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 53
- 208000020084 Bone disease Diseases 0.000 title claims abstract description 33
- 239000000203 mixture Substances 0.000 title claims abstract description 23
- 238000012216 screening Methods 0.000 title claims abstract description 23
- 239000000523 sample Substances 0.000 claims abstract description 30
- 238000000034 method Methods 0.000 claims abstract description 21
- 230000001717 pathogenic effect Effects 0.000 claims abstract description 16
- 238000001514 detection method Methods 0.000 claims abstract description 10
- 238000012163 sequencing technique Methods 0.000 claims description 38
- 238000012408 PCR amplification Methods 0.000 claims description 15
- 239000012634 fragment Substances 0.000 claims description 15
- -1 CHST 10 Proteins 0.000 claims description 12
- 230000002068 genetic effect Effects 0.000 claims description 10
- 101001120236 Crotalus durissus cumanensis Basic phospholipase A2 10 Proteins 0.000 claims description 9
- 230000007918 pathogenicity Effects 0.000 claims description 9
- 238000004458 analytical method Methods 0.000 claims description 8
- 238000009396 hybridization Methods 0.000 claims description 8
- 230000008569 process Effects 0.000 claims description 8
- 238000006243 chemical reaction Methods 0.000 claims description 7
- 101001003584 Homo sapiens Prelamin-A/C Proteins 0.000 claims description 6
- 102100026531 Prelamin-A/C Human genes 0.000 claims description 6
- 238000002474 experimental method Methods 0.000 claims description 5
- 238000012165 high-throughput sequencing Methods 0.000 claims description 5
- 238000002360 preparation method Methods 0.000 claims description 5
- 101001034831 Homo sapiens Interferon-induced transmembrane protein 5 Proteins 0.000 claims description 4
- 102100039731 Interferon-induced transmembrane protein 5 Human genes 0.000 claims description 4
- 108091081062 Repeated sequence (DNA) Proteins 0.000 claims description 4
- 238000001914 filtration Methods 0.000 claims description 4
- 102100030489 15-hydroxyprostaglandin dehydrogenase [NAD(+)] Human genes 0.000 claims description 3
- IGRCWJPBLWGNPX-UHFFFAOYSA-N 3-(2-chlorophenyl)-n-(4-chlorophenyl)-n,5-dimethyl-1,2-oxazole-4-carboxamide Chemical compound C=1C=C(Cl)C=CC=1N(C)C(=O)C1=C(C)ON=C1C1=CC=CC=C1Cl IGRCWJPBLWGNPX-UHFFFAOYSA-N 0.000 claims description 3
- SXXLKZCNJHJYFL-UHFFFAOYSA-N 4,5,6,7-tetrahydro-[1,2]oxazolo[4,5-c]pyridin-5-ium-3-olate Chemical compound C1CNCC2=C1ONC2=O SXXLKZCNJHJYFL-UHFFFAOYSA-N 0.000 claims description 3
- 102100032295 A disintegrin and metalloproteinase with thrombospondin motifs 10 Human genes 0.000 claims description 3
- 108091022879 ADAMTS Proteins 0.000 claims description 3
- 102000029750 ADAMTS Human genes 0.000 claims description 3
- 108091005668 ADAMTS10 Proteins 0.000 claims description 3
- 102100034111 Activin receptor type-1 Human genes 0.000 claims description 3
- 102100036601 Aggrecan core protein Human genes 0.000 claims description 3
- 102100025683 Alkaline phosphatase, tissue-nonspecific isozyme Human genes 0.000 claims description 3
- 102100034112 Alkyldihydroxyacetonephosphate synthase, peroxisomal Human genes 0.000 claims description 3
- 102100035028 Alpha-L-iduronidase Human genes 0.000 claims description 3
- 102100034273 Annexin A7 Human genes 0.000 claims description 3
- 102100031491 Arylsulfatase B Human genes 0.000 claims description 3
- 102100025440 BMP-binding endothelial regulator protein Human genes 0.000 claims description 3
- 102100036597 Basement membrane-specific heparan sulfate proteoglycan core protein Human genes 0.000 claims description 3
- 101001042041 Bos taurus Isocitrate dehydrogenase [NAD] subunit beta, mitochondrial Proteins 0.000 claims description 3
- 108010050543 Calcium-Sensing Receptors Proteins 0.000 claims description 3
- 102100035673 Centrosomal protein of 290 kDa Human genes 0.000 claims description 3
- 101710198317 Centrosomal protein of 290 kDa Proteins 0.000 claims description 3
- 102100029318 Chondroitin sulfate synthase 1 Human genes 0.000 claims description 3
- UDMBCSSLTHHNCD-UHFFFAOYSA-N Coenzym Q(11) Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(O)=O)C(O)C1O UDMBCSSLTHHNCD-UHFFFAOYSA-N 0.000 claims description 3
- 108010016788 Cyclin-Dependent Kinase Inhibitor p21 Proteins 0.000 claims description 3
- 102100033270 Cyclin-dependent kinase inhibitor 1 Human genes 0.000 claims description 3
- 101100480904 Drosophila melanogaster tctn gene Proteins 0.000 claims description 3
- 102100021242 Dymeclin Human genes 0.000 claims description 3
- 102100035650 Extracellular calcium-sensing receptor Human genes 0.000 claims description 3
- 102100039991 Heparan-alpha-glucosaminide N-acetyltransferase Human genes 0.000 claims description 3
- 102100030308 Homeobox protein Hox-A11 Human genes 0.000 claims description 3
- 102100030307 Homeobox protein Hox-A13 Human genes 0.000 claims description 3
- 102100040227 Homeobox protein Hox-D13 Human genes 0.000 claims description 3
- 101001126430 Homo sapiens 15-hydroxyprostaglandin dehydrogenase [NAD(+)] Proteins 0.000 claims description 3
- 101100323309 Homo sapiens ANXA7 gene Proteins 0.000 claims description 3
- 101000799140 Homo sapiens Activin receptor type-1 Proteins 0.000 claims description 3
- 101000999998 Homo sapiens Aggrecan core protein Proteins 0.000 claims description 3
- 101000574445 Homo sapiens Alkaline phosphatase, tissue-nonspecific isozyme Proteins 0.000 claims description 3
- 101000799143 Homo sapiens Alkyldihydroxyacetonephosphate synthase, peroxisomal Proteins 0.000 claims description 3
- 101001019502 Homo sapiens Alpha-L-iduronidase Proteins 0.000 claims description 3
- 101000923070 Homo sapiens Arylsulfatase B Proteins 0.000 claims description 3
- 101000934632 Homo sapiens BMP-binding endothelial regulator protein Proteins 0.000 claims description 3
- 101001000001 Homo sapiens Basement membrane-specific heparan sulfate proteoglycan core protein Proteins 0.000 claims description 3
- 101100383804 Homo sapiens CHSY1 gene Proteins 0.000 claims description 3
- 101000817629 Homo sapiens Dymeclin Proteins 0.000 claims description 3
- 101001035092 Homo sapiens Heparan-alpha-glucosaminide N-acetyltransferase Proteins 0.000 claims description 3
- 101001083158 Homo sapiens Homeobox protein Hox-A11 Proteins 0.000 claims description 3
- 101001037168 Homo sapiens Homeobox protein Hox-D13 Proteins 0.000 claims description 3
- 101000840540 Homo sapiens Iduronate 2-sulfatase Proteins 0.000 claims description 3
- 101001002634 Homo sapiens Interleukin-1 alpha Proteins 0.000 claims description 3
- 101001044336 Homo sapiens Intraflagellar transport protein 122 homolog Proteins 0.000 claims description 3
- 101000960200 Homo sapiens Intraflagellar transport protein 140 homolog Proteins 0.000 claims description 3
- 101000960114 Homo sapiens Intraflagellar transport protein 172 homolog Proteins 0.000 claims description 3
- 101000961164 Homo sapiens Intraflagellar transport protein 43 homolog Proteins 0.000 claims description 3
- 101001010727 Homo sapiens Intraflagellar transport protein 80 homolog Proteins 0.000 claims description 3
- 101000960234 Homo sapiens Isocitrate dehydrogenase [NADP] cytoplasmic Proteins 0.000 claims description 3
- 101000599886 Homo sapiens Isocitrate dehydrogenase [NADP], mitochondrial Proteins 0.000 claims description 3
- 101001042362 Homo sapiens Leukemia inhibitory factor receptor Proteins 0.000 claims description 3
- 101000973618 Homo sapiens NF-kappa-B essential modulator Proteins 0.000 claims description 3
- 101000913761 Homo sapiens Serine/threonine-protein kinase ICK Proteins 0.000 claims description 3
- 101000657845 Homo sapiens Small nuclear ribonucleoprotein-associated proteins B and B' Proteins 0.000 claims description 3
- 101000809875 Homo sapiens TYRO protein tyrosine kinase-binding protein Proteins 0.000 claims description 3
- 101000620880 Homo sapiens Tartrate-resistant acid phosphatase type 5 Proteins 0.000 claims description 3
- 101000799461 Homo sapiens Thrombopoietin Proteins 0.000 claims description 3
- 101001067244 Homo sapiens Transcription factor HES-7 Proteins 0.000 claims description 3
- 101000979190 Homo sapiens Transcription factor MafB Proteins 0.000 claims description 3
- 101000635938 Homo sapiens Transforming growth factor beta-1 proprotein Proteins 0.000 claims description 3
- 101000625842 Homo sapiens Tubulin-specific chaperone E Proteins 0.000 claims description 3
- 101000655533 Homo sapiens dTDP-D-glucose 4,6-dehydratase Proteins 0.000 claims description 3
- 102100029199 Iduronate 2-sulfatase Human genes 0.000 claims description 3
- 102100026214 Indian hedgehog protein Human genes 0.000 claims description 3
- 101710139099 Indian hedgehog protein Proteins 0.000 claims description 3
- 102100020881 Interleukin-1 alpha Human genes 0.000 claims description 3
- 102100021502 Intraflagellar transport protein 122 homolog Human genes 0.000 claims description 3
- 102100039927 Intraflagellar transport protein 140 homolog Human genes 0.000 claims description 3
- 102100039929 Intraflagellar transport protein 172 homolog Human genes 0.000 claims description 3
- 102100039342 Intraflagellar transport protein 43 homolog Human genes 0.000 claims description 3
- 102100030002 Intraflagellar transport protein 80 homolog Human genes 0.000 claims description 3
- 102100039905 Isocitrate dehydrogenase [NADP] cytoplasmic Human genes 0.000 claims description 3
- 102100037845 Isocitrate dehydrogenase [NADP], mitochondrial Human genes 0.000 claims description 3
- 102100021747 Leukemia inhibitory factor receptor Human genes 0.000 claims description 3
- 102100025889 Lon protease homolog 2, peroxisomal Human genes 0.000 claims description 3
- 101710167388 Lon protease homolog, mitochondrial Proteins 0.000 claims description 3
- 101100129193 Mus musculus Man2b1 gene Proteins 0.000 claims description 3
- 101100485095 Mus musculus Wnt10b gene Proteins 0.000 claims description 3
- 102100022219 NF-kappa-B essential modulator Human genes 0.000 claims description 3
- 102000003480 SLC35D Human genes 0.000 claims description 3
- 108060008450 SLC35D Proteins 0.000 claims description 3
- 102100026621 Serine/threonine-protein kinase ICK Human genes 0.000 claims description 3
- 102100034683 Small nuclear ribonucleoprotein-associated proteins B and B' Human genes 0.000 claims description 3
- 102100029538 Structural maintenance of chromosomes protein 1A Human genes 0.000 claims description 3
- 102000003563 TRPV Human genes 0.000 claims description 3
- 108060008564 TRPV Proteins 0.000 claims description 3
- 102100038717 TYRO protein tyrosine kinase-binding protein Human genes 0.000 claims description 3
- 102100022919 Tartrate-resistant acid phosphatase type 5 Human genes 0.000 claims description 3
- 102100034195 Thrombopoietin Human genes 0.000 claims description 3
- 102100034423 Transcription factor HES-7 Human genes 0.000 claims description 3
- 102100023234 Transcription factor MafB Human genes 0.000 claims description 3
- 102100030742 Transforming growth factor beta-1 proprotein Human genes 0.000 claims description 3
- 206010051956 Trichorhinophalangeal syndrome Diseases 0.000 claims description 3
- 102100024769 Tubulin-specific chaperone E Human genes 0.000 claims description 3
- 108060008683 Tumor Necrosis Factor Receptor Proteins 0.000 claims description 3
- 101100156738 Xenopus laevis wnt10 gene Proteins 0.000 claims description 3
- UDMBCSSLTHHNCD-KQYNXXCUSA-N adenosine 5'-monophosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O UDMBCSSLTHHNCD-KQYNXXCUSA-N 0.000 claims description 3
- 229950006790 adenosine phosphate Drugs 0.000 claims description 3
- 102100032354 dTDP-D-glucose 4,6-dehydratase Human genes 0.000 claims description 3
- 108010021685 homeobox protein HOXA13 Proteins 0.000 claims description 3
- ONCZDRURRATYFI-QTCHDTBASA-N methyl (2z)-2-methoxyimino-2-[2-[[(e)-1-[3-(trifluoromethyl)phenyl]ethylideneamino]oxymethyl]phenyl]acetate Chemical compound CO\N=C(/C(=O)OC)C1=CC=CC=C1CO\N=C(/C)C1=CC=CC(C(F)(F)F)=C1 ONCZDRURRATYFI-QTCHDTBASA-N 0.000 claims description 3
- APVPOHHVBBYQAV-UHFFFAOYSA-N n-(4-aminophenyl)sulfonyloctadecanamide Chemical compound CCCCCCCCCCCCCCCCCC(=O)NS(=O)(=O)C1=CC=C(N)C=C1 APVPOHHVBBYQAV-UHFFFAOYSA-N 0.000 claims description 3
- 108010004731 structural maintenance of chromosome protein 1 Proteins 0.000 claims description 3
- 208000024891 symptom Diseases 0.000 claims description 3
- 102000003298 tumor necrosis factor receptor Human genes 0.000 claims description 3
- 238000004140 cleaning Methods 0.000 claims description 2
- 108020004414 DNA Proteins 0.000 claims 4
- 101000711796 Homo sapiens Sclerostin Proteins 0.000 claims 1
- 102100034201 Sclerostin Human genes 0.000 claims 1
- 101710106660 Shutoff alkaline exonuclease Proteins 0.000 claims 1
- 239000000047 product Substances 0.000 abstract description 7
- 238000003745 diagnosis Methods 0.000 abstract description 6
- 238000007405 data analysis Methods 0.000 abstract description 5
- 230000000694 effects Effects 0.000 abstract description 5
- 238000012268 genome sequencing Methods 0.000 abstract description 3
- 239000013589 supplement Substances 0.000 abstract description 2
- 201000010099 disease Diseases 0.000 description 10
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 10
- 230000035772 mutation Effects 0.000 description 8
- 239000011324 bead Substances 0.000 description 5
- 206010064571 Gene mutation Diseases 0.000 description 4
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 4
- 230000003321 amplification Effects 0.000 description 4
- 238000003199 nucleic acid amplification method Methods 0.000 description 4
- 239000002096 quantum dot Substances 0.000 description 4
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 3
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 3
- 108060002716 Exonuclease Proteins 0.000 description 3
- 208000020221 Short stature Diseases 0.000 description 3
- 238000000246 agarose gel electrophoresis Methods 0.000 description 3
- 238000009826 distribution Methods 0.000 description 3
- 102000013165 exonuclease Human genes 0.000 description 3
- 238000013467 fragmentation Methods 0.000 description 3
- 238000006062 fragmentation reaction Methods 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 230000008439 repair process Effects 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical group N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 2
- 102000012410 DNA Ligases Human genes 0.000 description 2
- 108010061982 DNA Ligases Proteins 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- 208000004286 Osteochondrodysplasias Diseases 0.000 description 2
- 229960005305 adenosine Drugs 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 239000013610 patient sample Substances 0.000 description 2
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- 108700028369 Alleles Proteins 0.000 description 1
- 206010070918 Bone deformity Diseases 0.000 description 1
- 101150082216 COL2A1 gene Proteins 0.000 description 1
- 102100033601 Collagen alpha-1(I) chain Human genes 0.000 description 1
- 102100036213 Collagen alpha-2(I) chain Human genes 0.000 description 1
- 208000013558 Developmental Bone disease Diseases 0.000 description 1
- 206010058314 Dysplasia Diseases 0.000 description 1
- 206010053759 Growth retardation Diseases 0.000 description 1
- 208000016621 Hearing disease Diseases 0.000 description 1
- 208000028782 Hereditary disease Diseases 0.000 description 1
- 101000875067 Homo sapiens Collagen alpha-2(I) chain Proteins 0.000 description 1
- 101000891031 Homo sapiens Peptidyl-prolyl cis-trans isomerase FKBP10 Proteins 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 208000024556 Mendelian disease Diseases 0.000 description 1
- 206010031243 Osteogenesis imperfecta Diseases 0.000 description 1
- 102100040349 Peptidyl-prolyl cis-trans isomerase FKBP10 Human genes 0.000 description 1
- 206010072610 Skeletal dysplasia Diseases 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 230000005856 abnormality Effects 0.000 description 1
- 108010029483 alpha 1 Chain Collagen Type I Proteins 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 230000022159 cartilage development Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000003759 clinical diagnosis Methods 0.000 description 1
- 230000019771 cognition Effects 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 210000001808 exosome Anatomy 0.000 description 1
- 231100000221 frame shift mutation induction Toxicity 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 231100000001 growth retardation Toxicity 0.000 description 1
- 208000006750 hematuria Diseases 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 230000009916 joint effect Effects 0.000 description 1
- 230000003340 mental effect Effects 0.000 description 1
- 206010028197 multiple epiphyseal dysplasia Diseases 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 238000001303 quality assessment method Methods 0.000 description 1
- 238000012372 quality testing Methods 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 238000007480 sanger sequencing Methods 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 230000012488 skeletal system development Effects 0.000 description 1
- 206010062920 spondyloepiphyseal dysplasia Diseases 0.000 description 1
- 201000002962 spondyloepiphyseal dysplasia with congenital joint dislocations Diseases 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 231100000378 teratogenic Toxicity 0.000 description 1
- 230000003390 teratogenic effect Effects 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 208000029257 vision disease Diseases 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
- C12Q1/6874—Methods for sequencing involving nucleic acid arrays, e.g. sequencing by hybridisation
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/156—Polymorphic or mutational markers
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Zoology (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Analytical Chemistry (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Immunology (AREA)
- Biotechnology (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Pathology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The invention provides a gene composition, a chip and a kit for screening refractory hereditary bone diseases, wherein the gene composition comprises 322 hereditary bone disease related pathogenic genes. Aiming at the requirement of difficult diagnosis and treatment of difficult osteopathia in China, the group of genome compositions related to hereditary osteopathia and related products (gene chips and kits) provided by the invention are developed by combining genome sequencing and big data analysis on the basis of the existing knowledge of hereditary osteopathia and can be used for diagnosing various hereditary osteopathia, and the kit has the advantages that: the probe can be flexibly customized, the target gene and region can be captured specifically, and especially the pathogenic region not contained in WES can also be captured and sequenced; the method can supplement and optimize areas which cannot be captured by WES, achieves better capturing effect, and is high in coverage rate and high in variation detection precision; the cost performance is higher, the clinical detection is quicker, and the cost is lower.
Description
Technical Field
The invention relates to the technical field of biology, in particular to a gene composition, a chip and a kit for screening difficult hereditary bone diseases.
Background
Hereditary osteopathy refers to a hereditary disease with abnormal skeletal development caused by genetic factor change, and the clinical phenotype includes growth retardation, short and small stature, long bone deformity, limited joint activity and the like, and the clinical and genetic heterogeneity is large. Although the incidence of diseases is low, the diseases are teratogenic and disabling, most of the diseases are born in young children and are inherited generation by generation or generation after generation, and huge mental and economic burdens are brought to patients and family members of the patients. Therefore, the definite diagnosis has great significance for the subsequent treatment and the prepotency of the patients.
At present, the diagnosis of hereditary bone diseases by clinicians is generally based on clinical history data, laboratory examination indexes, imaging examination and genetic detection of patients, but most of patients with hereditary bone diseases are not diagnosed and treated completely, and the following reasons specifically exist: 1. 461 hereditary bone diseases are discovered in total in 2019, the bone diseases are various and rare, most of the bone diseases are often only shown as skeletal deformity without hematuria biochemical abnormality, doctors have no systematic training, and the diseases cannot be diagnosed clearly due to insufficient cognition; 2. the phenotype of partial osteopathia is overlapped, most of the hereditary osteopathia can show short stature, but causes of short stature are different, and the hereditary chondrogenesis is imperfecta, spondyloepiphyseal dysplasia and pseudochondrogenesis imperfecta can show short stature; 3. the genetic heterogeneity of hereditary osteopathy is high, the same disease can have a plurality of pathogenic genes, for example, osteogenesis imperfecta can be caused by a plurality of gene mutations such as COL1A1, COL1A2, IFITM5, FKBP10, and the like, and the clinical diagnosis can only be carried out on the whole according to the medical history data of patients, and the individual treatment cannot be carried out by specific typing; different types of diseases caused by the same gene mutation, for example, COL2A1 gene mutation can cause spinal epiphyseal dysplasia, multiple epiphyseal dysplasia with vision and hearing disorders, Kniest bone dysplasia and the like, and great difficulty is added to diagnosis.
The pathogenic genes of hereditary bone diseases are more, doctors often cannot accurately locate the pathogenic genes, and the pathogenic genes or the pathogenic genes are determined by sequencing one by one candidate genes, so that the sequencing cost and time are increased invisibly. In the sequencing process, the sequencing data has the defects of overlarge N content, low sequencing quality, high base number proportion, low reads quality caused by sequence pollution and the like, influence on subsequent analysis, and poor coverage rate and low capture rate caused by high GC content.
Therefore, there is a need for an optimized product and method for detecting a gene associated with hereditary bone disease.
Disclosure of Invention
Aiming at the defects of the prior art, the invention provides a gene composition, a chip and a kit for screening difficult hereditary osteopathia, and the kit is suitable for screening hereditary osteopathia of Chinese.
In order to achieve the purpose, the invention adopts the following technical scheme:
in a first aspect, the present invention provides a genomic composition for screening refractory hereditary bone diseases, which comprises the following 322 genes:
ACAN, ACP5, ACVR1, ADAMTS10, ADAMTSL 10, AGA, AGPS, AKT 10, ALPL, ALX 10, AMER 10, ANKH, ANO 10, ANTXR 10, ARHGAP 10, ARSB, ATP6V0A 10, B3GALT 10, B4GALT 10, BHA LH72, BMP 10, BMPER, BMPR1 10, CA 10, CANT 10, CASR, CC2D 10, CCDC 10, CDC 10, CDH 10, CDKN1 10, CDT 10, CEP290, CHST 10, CHSY 72, CK3672, CACKAP 210, GRECD 10, CAFFC 10, CDC 10, CDCOGN 10, CDC 10, CAFFC 10, HES7, HGSNAT, HOXA11, HOXA13, HOXD13, HPGD, HSPG2, ICK, IDH1, IDH2, IDS, IDUA, IFITM5, IFT122, IFT140, IFT172, IFT43, IFT80, IHH, IKBKG, IL1 80, IMPAD 80, INPPL 80, KAT6 80, KIF 80, LBR, LEMD 80, LEPRE 80, LIFR, LMNA 80, LMNA 1 LMX 80, LONP 80, LPIN 80, LRP 80, LTBP 80, MAFB 80, MAN2B 80, MAN2C 80, MANBA, MATN 80, MENFN 80, MENFR 6863672, HO 80, HONP 72, PHNFP 80, NFP 80, NFR 80, NFP 80, NFR 80, NFP 36NRNPNFP 80, 36NRNPNFP 80, 36NRPSN 80, 36NRNPNFP 80, 363672, 80, 36363672, 80, 36NRNPNFP 36P 80, 36P 80, 363636P 36P 80, 36P 80, 36P 80, 36P 80, 363672, 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P, SKI, SLC17A, SLC26A, SLC29A, SLC34A, SLC35D, SLC39A, SLCO2A, SLCO5A, SMAD, SMARCAL, SMC1, SMC, SNRPB, SNX, SOST, SOX, SP, SULF, SUMF, TBCE, TBX, TBXAS, TCF, TCIRG, TCOF, TCTN, TGDS, TGFB, TGFBR, THPO, TMZEM 216, TMEM38, TMEM, TNFRSF11, TNFR, TP, TRAPPC, TREM, TRIP, TRPS, TRPV, TTC21, TWIST, TYROBP, WDR, WISP, WNT10, WNT, WNSF 5, WNT, YZT, XZT, XLT and XLT.
In a second aspect, the invention provides an application of the above genome composition in preparing a kit for screening refractory hereditary bone diseases.
In a third aspect, the present invention also provides a gene chip for screening refractory hereditary bone diseases, which comprises a probe for the above gene composition.
In a fourth aspect, the invention also provides a targeted capture sequencing kit comprising probes against the above genomic composition.
In a fifth aspect, the present invention also provides a kit for screening problematic hereditary bone diseases, which comprises the above gene chip or a probe for the above gene composition.
Further, the kit further comprises the following instructions:
a. establishing a DNA library comprising the gene composition;
quality detection and high-throughput sequencing of the DNA library to obtain FastQ data; and
c. the FastQ data described above was analyzed.
Further, the instruction a specifically includes the following steps:
(1) detecting the quality of the sample DNA;
(2) sorting, purifying and fragmenting DNA;
(3) carrying out end repairing reaction;
(4) adding single adenylic acid at the 3' end;
(5) connecting a sequencing joint;
(6) screening gene library fragments;
(7) PCR amplifying the gene library;
(8) hybridizing the target area chips;
(9) cleaning and purifying the hybridization library; and
(10) the hybridization library was amplified by PCR.
Furthermore, the method adopted by the sample DNA quality detection comprises agarose gel electrophoresis and Nanodrop 2000/Qubit.
Further, the instruction c specifically includes the following steps:
(1) obtaining a preliminary comparison result by using BWA software; and
(2) the Picard software is used for counting the primary alignment results, and the statistics include the number and the proportion of the sequences in the alignment, the proportion of redundant sequences (Duplicate Reads) caused by PCR amplification in the sample preparation and capture experiment processes, the proportion of sequences Q20 and Q30, the average coverage depth and the coverage proportion of the target area from 1X to 100X.
Further, the kit further comprises the following instructions:
d. and (3) correcting the primary comparison result: the method adopts a GATK standard flow to correct the repeated sequence caused by PCR amplification, error comparison generated by InDel and base quality, accurately identifies SNV and InDel, and greatly reduces false positive and false negative generated in the sequencing and comparison processes;
e. SNV filtering is carried out on the high-throughput sequencing data, and then pathogenicity analysis is carried out to evaluate by combining clinical symptoms, so that a pathogenic gene is determined.
Further, the above pathogenicity analysis uses software or databases including SIFT, POLYPHen V2, MutationTaster, and Cadd Raw.
By adopting the technical scheme, compared with the prior art, the invention has the following technical effects:
aiming at the requirement of difficult diagnosis and treatment of difficult osteopathia in China, the group of genome compositions related to hereditary osteopathia and related products (gene chips and kits) provided by the invention are developed by combining genome sequencing and big data analysis on the basis of the existing knowledge of hereditary osteopathia and can be used for diagnosing various hereditary osteopathia, and the kit has the advantages that: the probe can be flexibly customized, the target gene and region can be captured specifically, and especially the pathogenic region not contained in WES can also be captured and sequenced; the method can supplement and optimize areas which cannot be captured by WES, achieves better capturing effect, and is high in coverage rate and high in variation detection precision; the cost performance is higher, the clinical detection is quicker, and the cost is lower.
Drawings
FIG. 1 is a graph of base quality distribution of raw data for 15 patients according to an embodiment of the present invention.
Detailed Description
The invention provides a gene composition, a chip and a kit for screening refractory hereditary bone diseases according to the characteristics of hereditary bone diseases of Chinese people, wherein the gene composition comprises the following 322 genes:
ACAN, ACP5, ACVR1, ADAMTS10, ADAMTSL 10, AGA, AGPS, AKT 10, ALPL, ALX 10, AMER 10, ANKH, ANO 10, ANTXR 10, ARHGAP 10, ARSB, ATP6V0A 10, B3GALT 10, B4GALT 10, BHA LH72, BMP 10, BMPER, BMPR1 10, CA 10, CANT 10, CASR, CC2D 10, CCDC 10, CDC 10, CDH 10, CDKN1 10, CDT 10, CEP290, CHST 10, CHSY 72, CK3672, CACKAP 210, GRECD 10, CAFFC 10, CDC 10, CDCOGN 10, CDC 10, CAFFC 10, HES7, HGSNAT, HOXA11, HOXA13, HOXD13, HPGD, HSPG2, ICK, IDH1, IDH2, IDS, IDUA, IFITM5, IFT122, IFT140, IFT172, IFT43, IFT80, IHH, IKBKG, IL1 80, IMPAD 80, INPPL 80, KAT6 80, KIF 80, LBR, LEMD 80, LEPRE 80, LIFR, LMNA 80, LMNA 1 LMX 80, LONP 80, LPIN 80, LRP 80, LTBP 80, MAFB 80, MAN2B 80, MAN2C 80, MANBA, MATN 80, MENFN 80, MENFR 6863672, HO 80, HONP 72, PHNFP 80, NFP 80, NFR 80, NFP 80, NFR 80, NFP 36NRNPNFP 80, 36NRNPNFP 80, 36NRPSN 80, 36NRNPNFP 80, 363672, 80, 36363672, 80, 36NRNPNFP 36P 80, 36P 80, 363636P 36P 80, 36P 80, 36P 80, 36P 80, 363672, 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P, SKI, SLC17A, SLC26A, SLC29A, SLC34A, SLC35D, SLC39A, SLCO2A, SLCO5A, SMAD, SMARCAL, SMC1, SMC, SNRPB, SNX, SOST, SOX, SP, SULF, SUMF, TBCE, TBX, TBXAS, TCF, TCIRG, TCOF, TCTN, TGDS, TGFB, TGFBR, THPO, TMZEM 216, TMEM38, TMEM, TNFRSF11, TNFR, TP, TRAPPC, TREM, TRIP, TRPS, TRPV, TTC21, TWIST, TYROBP, WDR, WISP, WNT10, WNT, WNSF 5, WNT, YZT, XZT, XLT and XLT.
The specific position and coverage information of the 322 genetic bone disease related pathogenic genes are shown in the following table 1:
TABLE 1322 detailed location and overlay information of hereditary osteopathy-related pathogenic genes
In addition, the gene chip and the kit comprise probes for the 322 genes, and more than 400 known hereditary bone diseases can be detected.
The present invention will be described in detail and specifically with reference to the following examples to facilitate better understanding of the present invention, but the following examples do not limit the scope of the present invention.
In the examples, the conventional methods were used unless otherwise specified, and reagents used were those conventionally commercially available or formulated according to the conventional methods without specifically specified.
Example 1
The embodiment provides a method for screening refractory hereditary bone diseases, which is based on the principle that a target capture sequencing kit is customized according to the 322 genes, and comprises the following steps:
DNA library establishment:
1) sample DNA Quality detection (Evaluation of DNA Quality): high quality genomic DNA is a prerequisite for conducting genome sequencing experiments. The quality detection of the genome DNA is carried out by the following two methods: agarose gel electrophoresis for genomic DNA integrity: the electrophoretic bands are required to be clearly visible without obvious tailing; the method comprises the following steps of (1) detecting the concentration and the quality of genome DNA by using a Nanodrop 2000/Qubit: the required concentration is more than or equal to 50 ng/mu L, the total amount is more than or equal to 2 mu g, and OD260/280 is 1.8-2.0.
2) Sorting, purifying and fragmenting DNA (DNA Purification and cleavage): taking genome DNA for ultrasonic fragmentation. After fragmentation, the DNA length is mostly between 100-500 bp.
3) End Repair reaction (End Repair): adding a terminal completion system into the purified DNA fragment, wherein the single-stranded protrusion at the 3 'end is digested by the Exonuclease (Exonuclease) activity of T4 DNA Polymerase (T4 DNA Polymerase), and the 5' end protrusion is completed by the Polymerase (Polymerase) activity; meanwhile, phosphate groups necessary for subsequent ligation reaction are added to the 5 'end of Phosphokinase (PNK), and the mixture is purified by Agencourt AMpure XP magnetic beads to finally obtain a blunt-end DNA short fragment library containing the phosphate groups at the 5' end.
4) Reaction with "A" at the 3 'end (Adenlylate 3' Ends): to the above system was added an "A" buffer reaction system at the 3' end. The single adenosine A is added at the end of the double-stranded DNA3 'with the end modified, so that the blunt end self-connection between DNA fragments is prevented, and the single adenosine A can also be matched with the single T protrusion at the 5' end of the sequencing adaptor in the next step in a complementary manner, so that the accurate connection is realized, and the self-series connection between the library fragments is effectively reduced.
5) Ligation sequencing linker (Adapter Ligation): adding a connecting buffer solution and a double-stranded sequencing adaptor into the reaction system, and connecting the illumina sequencing adaptor to both ends of the library DNA by using T4 DNA ligase.
6) Library fragment screening (Size Selection): for the adaptor-added library, fragments of different sizes were separated by agarose gel electrophoresis, and fragments of appropriate length were recovered by gel cutting and used for the next PCR amplification. The purified library eliminates excessive sequencing joint and joint self-connection products in the system, avoids ineffective amplification in the PCR process and eliminates the influence on-machine sequencing.
7) PCR Amplification of DNA library (PCR Amplification): the original library was amplified using high fidelity polymerase to ensure sufficient library throughput. In addition, since only the DNA fragments with adapters attached to both ends can be amplified, this step can effectively enrich this DNA. The number of PCR amplification cycles is controlled to be between 10 and 12, and the deviation caused by overlarge number of amplification cycles is reduced on the premise of ensuring enough products; finally, the concentration of each library was accurately determined using Qubit.
8) Target area chip Hybridization (Library Hybridization): the library was hybridized to probes using the Agilent surelectxt custom Kit for region of interest capture.
9) Hybrid Library washing and Purification (Library Wash and Purification): by means of coupled biotin groups on the probe, usingMyOneTMStreptavidin T1 magnetic beads enriched probes. After the hybridization step in step 8), the library DNA to which the probe specifically binds is enriched. After further washing to remove DNA non-specifically bound to the magnetic beads, the DNA belonging to the region of interest in the library is enriched.
10) PCR Amplification of DNA library (PCR Amplification): the original library was amplified using high fidelity polymerase to ensure sufficient library throughput. The number of PCR amplification cycles is controlled between 10 and 12. And on the premise of ensuring that the product is enough, the bias introduced due to the overlarge number of amplification cycles is reduced. And purifying the amplified library by magnetic beads to obtain the sequencing library which can be operated on a computer.
11) Quality testing of the Library (Library Quality Assessment): the library concentration and the library fragment length distribution were measured using the Qubit and Agilent 2100Bioanalyzer, respectively. The required concentration was >5 ng/. mu.l and the fragment lengths were centered between 300 and 400 bp.
12) Sequencing on a machine (Sequencing): the library was finally subjected to high throughput sequencing on the Illumina Hiseq platform in a2 × 150bp paired-end sequencing mode to obtain FastQ data.
2. And (3) data analysis: BWA software (http:// bio-bw. source form. net /) can accurately and rapidly align short sequences to a reference genome, and the software is continuously updated to indicate that a document is complete, so that the method is currently the alignment method of sequencing data recommended by the Broad center. And (3) comparing the raw data of each sample with a reference genome by using a mem algorithm of BWA to obtain a primary comparison result in a BAM format. The alignment results of each sample are counted by Picard software (https:// branched. githu. io/Picard /), including the number and ratio of sequences aligned, the ratio of redundant sequences (Duplicate Reads) caused by PCR amplification during sample preparation and capture experiments, the ratio of Q20 and Q30 sequences, average coverage depth, the coverage ratio of the target region from 1X to 100X, and the like. Generally, when the base coverage depth of a site is 10X or more, the SNV (single nucleotide variation) detected at the site is considered to be relatively reliable.
3. And (3) correcting the data comparison result: the primary alignment result obtained by correcting BWA software by adopting a GATK standard process is mainly corrected aiming at a repetitive sequence caused by PCR amplification, error alignment generated by InDel and base quality. Thus, the processed comparison result can more accurately identify SNV and InDel, and the false positive a and the false negative b generated in the sequencing and comparison processes are greatly reduced. a: the sources of false positives include (1) high sequencing error rate, which is likely to cause erroneous judgment when the sequencing depth is low; (2) the randomness of sequencing causes the detected times of two alleles to deviate from 1:1, and causes errors in genotype interpretation; (3) false positive sites are easily caused when repeated sequences, STR and the like exist; (4) the presence of homologous sequences results in partial reads being mis-spliced. b: sources of false negatives include (1) imbalances in sequencing, especially at lower sequencing depths, where partial regions fail to be covered; (2) influence of GC content.
4. Screening the sequencing data: SNV were filtered according to the following criteria: 1) population database frequency filtering. 1000genome < ═ 0.01, ExAC03_ EAS < ═ 0.01, gnomAD _ exosome _ EAS < ═ 0.01, geneskyexodb _ Freq < > 0.05 (sky hao 300 exon control database). 2) And (4) filtering functional mutation. The mutations in the exon or splicing region are retained, and nonsynonymous SNV type mutations such as missense mutation and frameshift mutation are retained. 3) The reported pathogenic mutations remain, if the mutation is DM (Disease-using mutations) at HGMD _ site _ class (pathogenicity classification of HGMD database), the mutation remains, when 1), 2 is not considered). And (3) carrying out pathogenicity analysis on the filtered data, carrying out pathogenicity prediction on the data by using SIFT, POLYPHen V2, MutationTaster, Cadd Raw and the like, determining the pathogenicity of the data, carrying out Sanger sequencing verification on the proband and family members of the proband and evaluating by combining clinical symptoms to determine a pathogenic gene and determine the carrying condition of a mutant gene of the family members.
Example 2
In this embodiment, for 15 suspected patients with hereditary bone diseases, the screening of candidate gene mutations related to problematic hereditary bone diseases is implemented by the following steps:
step one, customizing a target capture sequencing kit aiming at the 322 genetic bone disease related pathogenic genes, wherein the kit comprises a probe of the 322 genes, and the gene sequencing kit can detect more than 400 known genetic bone diseases;
step two, establishing a high-quality library for sequencing analysis, and specifically comprising the following steps: (1) taking genome DNA for ultrasonic fragmentation, adding exonuclease of T4 DNA polymerase and phosphokinase to repair the tail end, adding single adenylic acid at the tail end of DNA3' to prevent the tail end of the DNA fragments from self-connecting, and connecting an illumina sequencing joint to the two ends of the library DNA by using T4 DNA ligase. Fragments of different sizes were separated by electrophoresis, and excess sequencing adapters and adapter self-ligated products were removed. (2) The original library was amplified with high fidelity enzyme, hybridized with probes using the Agilent sureselectXT custom Kit, and the region of interest captured. By usingMyOneTMStreptavidin T1 magnetic bead enrichment probe, after hybridization, non-specifically bound DNA fragments are discarded, and the original library is amplified again by high fidelity enzyme, so that the total amount of the library is ensured.
Step three, data analysis: BWA software (http:// bio-bw. source form. net /) can accurately and rapidly align short sequences to a reference genome, and the software is continuously updated to indicate that a document is complete, so that the method is currently the alignment method of sequencing data recommended by the Broad center. And (3) comparing the original data of each sample with the reference genome by using a mem algorithm of BWA to obtain a primary comparison result of the BAM format file. The alignment results for each sample, including the number and ratio of sequences aligned, the ratio of redundant sequences (Duplicate Reads) resulting from PCR amplification during sample preparation and capture experiments, the ratio of Q20 and Q30 sequences, the average coverage depth, the ratio of coverage of the target region 1X-100X, etc., were counted by Picard software (https:// branched amplification. Generally, when the base coverage depth of a site is 10X or more, the SNV detected at the site is considered to be relatively reliable.
TABLE 215 statistical table of sequencing data quality for patients suspected of having hereditary bone diseases
Whether the sequencing results are balanced or not is evaluated according to the base mass distribution (shown in figure 1) of the raw data of each sample, and samples with larger differences are rejected.
Step four, a mature data analysis system: the preliminary alignment results obtained using GATK standard protocol to calibrate BWA software were corrected primarily for repeat sequences resulting from PCR amplification, incorrect alignments by InDel, and base quality, with SNV/InDel numbers for 15 patient samples as shown in table 3 below.
TABLE 315 SNV/InDel number of patient samples
And step five, comparing and analyzing all SNV/InDel sites with known information such as a newly released group database, a function database, a disease database and the like, and evaluating the variation frequency, the function characteristics, the conservation, the pathogenicity and the like of the SNV/InDel sites. Therefore, the SNV/InDel locus with the most biological significance can be quickly found from a large amount of variation information, the analysis range is quickly reduced to hundreds or even tens, and the research efficiency is greatly improved on the premise of ensuring the research accuracy. 8-20 candidate disease-causing genes were initially selected for each sample (as shown in Table 4 below).
TABLE 415 candidate disease genes after preliminary screening of patient chip capture data
And step six, screening the candidate genes from the aspects of genetic mode, family cosegregation, clinical phenotype and the like to finally obtain more accurate diagnosis results (as shown in the following table 5).
Table 515 patients screened specific information of pathogenic gene
The embodiments of the present invention have been described in detail, but the embodiments are merely examples, and the present invention is not limited to the embodiments described above. Any equivalent modifications and substitutions to those skilled in the art are also within the scope of the present invention. Accordingly, equivalent changes and modifications made without departing from the spirit and scope of the present invention should be covered by the present invention.
Claims (10)
1. A gene composition for screening refractory hereditary bone diseases, which is characterized by comprising the following 322 genes:
ACAN, ACP5, ACVR1, ADAMTS10, ADAMTSL 10, AGA, AGPS, AKT 10, ALPL, ALX 10, AMER 10, ANKH, ANO 10, ANTXR 10, ARHGAP 10, ARSB, ATP6V0A 10, B3GALT 10, B4GALT 10, BHA LH72, BMP 10, BMPER, BMPR1 10, CA 10, CANT 10, CASR, CC2D 10, CCDC 10, CDC 10, CDH 10, CDKN1 10, CDT 10, CEP290, CHST 10, CHSY 72, CK3672, CACKAP 210, GRECD 10, CAFFC 10, CDC 10, CDCOGN 10, CDC 10, CAFFC 10, HES7, HGSNAT, HOXA11, HOXA13, HOXD13, HPGD, HSPG2, ICK, IDH1, IDH2, IDS, IDUA, IFITM5, IFT122, IFT140, IFT172, IFT43, IFT80, IHH, IKBKG, IL1 80, IMPAD 80, INPPL 80, KAT6 80, KIF 80, LBR, LEMD 80, LEPRE 80, LIFR, LMNA 80, LMNA 1 LMX 80, LONP 80, LPIN 80, LRP 80, LTBP 80, MAFB 80, MAN2B 80, MAN2C 80, MANBA, MATN 80, MENFN 80, MENFR 6863672, HO 80, HONP 72, PHNFP 80, NFP 80, NFR 80, NFP 80, NFR 80, NFP 36NRNPNFP 80, 36NRNPNFP 80, 36NRPSN 80, 36NRNPNFP 80, 363672, 80, 36363672, 80, 36NRNPNFP 36P 80, 36P 80, 363636P 36P 80, 36P 80, 36P 80, 36P 80, 363672, 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P 80, 36P, SKI, SLC17A, SLC26A, SLC29A, SLC34A, SLC35D, SLC39A, SLCO2A, SLCO5A, SMAD, SMARCAL, SMC1, SMC, SNRPB, SNX, SOST, SOX, SP, SULF, SUMF, TBCE, TBX, TBXAS, TCF, TCIRG, TCOF, TCTN, TGDS, TGFB, TGFBR, THPO, TMZEM 216, TMEM38, TMEM, TNFRSF11, TNFR, TP, TRAPPC, TREM, TRIP, TRPS, TRPV, TTC21, TWIST, TYROBP, WDR, WISP, WNT10, WNT, WNSF 5, WNT, YZT, XZT, XLT and XLT.
2. Use of the genomic composition according to claim 1 for the preparation of a kit for screening for conditions associated with refractory hereditary bone diseases.
3. A gene chip for screening refractory hereditary bone diseases, which comprises a probe against the gene composition of claim 1.
4. A targeted capture sequencing kit comprising probes directed against the genetic composition of claim 1.
5. A kit for screening for problematic hereditary bone disease, comprising a probe for the gene composition of claim 1 or the gene chip of claim 3.
6. The kit of claim 5, further comprising instructions to:
a. creating a DNA library comprising said genetic composition;
quality detection and high-throughput sequencing of the DNA library to obtain FastQ data; and
c. the FastQ data was analyzed.
7. The kit according to claim 5, wherein the instructions a comprise in particular the steps of:
(1) detecting the quality of the sample DNA;
(2) sorting, purifying and fragmenting DNA;
(3) carrying out end repairing reaction;
(4) adding single adenylic acid at the 3' end;
(5) connecting a sequencing joint;
(6) screening gene library fragments;
(7) PCR amplifying the gene library;
(8) hybridizing the target area chips;
(9) cleaning and purifying the hybridization library; and
(10) the hybridization library was amplified by PCR.
8. The kit according to claim 5, characterized in that said instructions c comprise in particular the steps of:
(1) obtaining a preliminary comparison result by using BWA software; and
(2) and counting the primary alignment result by using Picard software, wherein the counted values comprise the number and the proportion of the sequences in alignment, the proportion of redundant sequences caused by PCR amplification in the sample preparation and capture experiment processes, the proportion of sequences Q20 and Q30, the average coverage depth and the coverage proportion of a target area from 1X to 100X.
9. The kit of claim 8, further comprising instructions to:
d. correcting the preliminary comparison result: the method adopts a GATK standard flow to correct the repeated sequence caused by PCR amplification, error comparison generated by InDel and base quality, accurately identifies SNV and InDel, and greatly reduces false positive and false negative generated in the sequencing and comparison processes;
e. SNV filtering is carried out on the high-throughput sequencing data, and then pathogenicity analysis is carried out to evaluate by combining clinical symptoms, so that a pathogenic gene is determined.
10. The kit of claim 9, wherein the pathogenicity analysis employs software or databases comprising SIFT, polyphren V2, mutationmaster, and Cadd Raw.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110255299.5A CN113308527A (en) | 2021-03-09 | 2021-03-09 | Gene composition, chip and kit for screening refractory hereditary bone diseases |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110255299.5A CN113308527A (en) | 2021-03-09 | 2021-03-09 | Gene composition, chip and kit for screening refractory hereditary bone diseases |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113308527A true CN113308527A (en) | 2021-08-27 |
Family
ID=77371877
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110255299.5A Pending CN113308527A (en) | 2021-03-09 | 2021-03-09 | Gene composition, chip and kit for screening refractory hereditary bone diseases |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113308527A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117025747A (en) * | 2023-05-16 | 2023-11-10 | 山东大学齐鲁医院 | Application of reagent for detecting SLC34A3 gene variation in sample in preparation of product for screening low-blood-phosphorus rickets/osteomalacia patients |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2004018642A2 (en) * | 2002-08-24 | 2004-03-04 | Creighton University | Methods for detecting genes associated with bone properties and/or height |
US20060030541A1 (en) * | 2002-04-05 | 2006-02-09 | Proskelia | Genes involved in osteogenesis, and methods of use |
WO2011149352A1 (en) * | 2010-05-28 | 2011-12-01 | Universiteit Twente | Osteoarthrosis markers |
CN103993096A (en) * | 2014-06-09 | 2014-08-20 | 中国医学科学院血液病医院(血液学研究所) | Kit for diagnosing congenital bone marrow failure diseases |
CN105586389A (en) * | 2014-10-21 | 2016-05-18 | 天津华大基因科技有限公司 | Kit and application thereof in detection on hereditary bone disease genes |
CN108570496A (en) * | 2017-03-07 | 2018-09-25 | 上海交通大学医学院附属上海儿童医学中心 | A kind of molecular diagnosis method and kit of constitutional bone disease |
CN109897894A (en) * | 2018-12-27 | 2019-06-18 | 黄欢 | A kind of pathogenic mutation and its detection reagent of the infull disease of Osteogenic developmental |
CN110739073A (en) * | 2019-10-18 | 2020-01-31 | 中国医学科学院北京协和医院 | Computer intelligent diagnosis system for osteogenesis imperfecta |
-
2021
- 2021-03-09 CN CN202110255299.5A patent/CN113308527A/en active Pending
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060030541A1 (en) * | 2002-04-05 | 2006-02-09 | Proskelia | Genes involved in osteogenesis, and methods of use |
WO2004018642A2 (en) * | 2002-08-24 | 2004-03-04 | Creighton University | Methods for detecting genes associated with bone properties and/or height |
WO2011149352A1 (en) * | 2010-05-28 | 2011-12-01 | Universiteit Twente | Osteoarthrosis markers |
CN103993096A (en) * | 2014-06-09 | 2014-08-20 | 中国医学科学院血液病医院(血液学研究所) | Kit for diagnosing congenital bone marrow failure diseases |
CN105586389A (en) * | 2014-10-21 | 2016-05-18 | 天津华大基因科技有限公司 | Kit and application thereof in detection on hereditary bone disease genes |
CN108570496A (en) * | 2017-03-07 | 2018-09-25 | 上海交通大学医学院附属上海儿童医学中心 | A kind of molecular diagnosis method and kit of constitutional bone disease |
CN109897894A (en) * | 2018-12-27 | 2019-06-18 | 黄欢 | A kind of pathogenic mutation and its detection reagent of the infull disease of Osteogenic developmental |
CN110739073A (en) * | 2019-10-18 | 2020-01-31 | 中国医学科学院北京协和医院 | Computer intelligent diagnosis system for osteogenesis imperfecta |
Non-Patent Citations (4)
Title |
---|
GEERT R. MORTIER等: "Nosology and classification of genetic skeletal disorders: 2019 revision", AM J MED GENET, vol. 179, no. 12, pages 2394 * |
JUN-SEOK BAE等: "Comprehensive genetic exploration of skeletal dysplasia using targeted exome sequencing", GENETICS IN MEDICINE, vol. 18, no. 6, pages 2 - 3 * |
夏欣一等: "遗传性骨病的分子遗传学研究进展", 中国优生与遗传杂志, vol. 17, no. 12, pages 122 * |
张治国等: "骨质疏松症遗传相关基因的生物信息学研究", 中国骨质疏松杂志, vol. 17, no. 04, pages 278 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117025747A (en) * | 2023-05-16 | 2023-11-10 | 山东大学齐鲁医院 | Application of reagent for detecting SLC34A3 gene variation in sample in preparation of product for screening low-blood-phosphorus rickets/osteomalacia patients |
CN117025747B (en) * | 2023-05-16 | 2024-04-09 | 山东大学齐鲁医院 | Application of reagent for detecting SLC34A3 gene variation in sample in preparation of product for screening low-blood-phosphorus rickets/osteomalacia patients |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103874767B (en) | Presumptive area in sample of nucleic acid is carried out the method and system of gene type | |
KR102339760B1 (en) | Diagnosing fetal chromosomal aneuploidy using massively parallel genomic sequencing | |
Macken et al. | Applying genomic and transcriptomic advances to mitochondrial medicine | |
CN105648045B (en) | The method and apparatus for determining fetus target area haplotype | |
US20230002821A1 (en) | High-throughput detection method for rare mutation of gene | |
CN110029158B (en) | Marfan syndrome detection panel and application thereof | |
US20150344977A1 (en) | Method And System For Detection Of An Organism | |
CN105555970B (en) | Method and system for simultaneous haplotyping and chromosomal aneuploidy detection | |
CN105331606A (en) | Nucleic acid molecule quantification method applied to high-throughput sequencing | |
EP3564391A1 (en) | Method, device and kit for detecting fetal genetic mutation | |
US20210024999A1 (en) | Method of identifying risk for autism | |
CN110229897A (en) | MED12 gene mutation detection kit and its application | |
EP3797418B1 (en) | Method for determining the probability of the risk of chromosomal and genetic disorders from free dna of fetal origin | |
CN114317728B (en) | Primer group, kit, method and system for detecting multiple mutations in SMA | |
CN108060227A (en) | A kind of amplimer, kit and its detection method for detecting PAH gene mutations | |
CN113564266B (en) | SNP typing genetic marker combination, detection kit and application | |
CN113308527A (en) | Gene composition, chip and kit for screening refractory hereditary bone diseases | |
CN108728515A (en) | A kind of analysis method of library construction and sequencing data using the detection ctDNA low frequencies mutation of duplex methods | |
CN116246704B (en) | System for noninvasive prenatal detection of fetuses | |
CN114717303B (en) | Primer group, kit and application for detecting osteogenesis imperfecta related genes based on multiplex PCR and high throughput sequencing technology | |
US20190309345A1 (en) | Clinical application of cell free dna technologies to non-invasive prenatal diagnosis and other liquid biopsies | |
CN112662754B (en) | Methods of using compositions for predicting the probability of occurrence of small ear deformities | |
CN110993024B (en) | Method and device for establishing fetal concentration correction model and method and device for quantifying fetal concentration | |
CN111218514A (en) | ABO gene full-length sequence determination method based on NGS technology and reagent pack | |
Qian et al. | Noninvasive prenatal screening for common fetal aneuploidies using single-molecule sequencing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |