WO2019206176A1 - 结核分枝杆菌H37Rv编码基因及其应用 - Google Patents

结核分枝杆菌H37Rv编码基因及其应用 Download PDF

Info

Publication number
WO2019206176A1
WO2019206176A1 PCT/CN2019/084047 CN2019084047W WO2019206176A1 WO 2019206176 A1 WO2019206176 A1 WO 2019206176A1 CN 2019084047 W CN2019084047 W CN 2019084047W WO 2019206176 A1 WO2019206176 A1 WO 2019206176A1
Authority
WO
WIPO (PCT)
Prior art keywords
mycobacterium tuberculosis
gene
rv1796c
sequence
encoding gene
Prior art date
Application number
PCT/CN2019/084047
Other languages
English (en)
French (fr)
Inventor
张瑶
徐平
武舒佳
孙金帅
王富强
何崔同
常蕾
Original Assignee
北京蛋白质组研究中心
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京蛋白质组研究中心 filed Critical 北京蛋白质组研究中心
Publication of WO2019206176A1 publication Critical patent/WO2019206176A1/zh

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/195Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
    • C07K14/35Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Mycobacteriaceae (F)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N1/00Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
    • C12N1/20Bacteria; Culture media therefor
    • C12N1/205Bacterial isolates
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/02Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving viable microorganisms
    • C12Q1/04Determining presence or kind of microorganism; Use of selective media for testing antibiotics or bacteriocides; Compositions containing a chemical indicator therefor
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • C12Q1/686Polymerase chain reaction [PCR]
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6888Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
    • C12Q1/689Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for bacteria
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/166Oligonucleotides used as internal standards, controls or normalisation probes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12RINDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
    • C12R2001/00Microorganisms ; Processes using microorganisms
    • C12R2001/01Bacteria or Actinomycetales ; using bacteria or Actinomycetales
    • C12R2001/32Mycobacterium

Definitions

  • the invention relates to the field of gene detection, in particular to the identification of pathogenic species.
  • Mycobacterium tuberculosis is a pathogen that causes tuberculosis in humans. It can invade all organs of the body, but tuberculosis is the most common. Tuberculosis is an infectious disease that is extremely important to date and poses a serious threat to human health. According to the WHO, about 8 million new cases occur each year, and at least 3 million people die from the disease. The clinical strain of MTB is difficult to culture, grows slowly, can be cross-infected with other mycobacteria, and the symptoms of tuberculosis and other respiratory infections are difficult to distinguish, which brings great difficulties to clinical rapid diagnosis and treatment. Therefore, rapid, accurate, specific, sensitive and inexpensive methods for detecting tuberculosis are established. Many of them are necessary prerequisites for effective treatment and control of the spread of tuberculosis, and are also new challenges and new tasks for the detection of mycobacteria in clinical laboratories.
  • Mycobacterium tuberculosis complex including M. tuberculosis, M. africanum, M. orgis, M. bovis, M. microti, M. canettii, M. caprae, M. pinnipedii, M. suricattae Mycobacteria, such as M.mungi, which cause tuberculosis in humans and other living organisms.
  • identification methods of MTBC at home and abroad are mainly divided into the following three categories: traditional isolation and culture methods; molecular level detection (IS6110, restriction fragment length polymorphism analysis, multi-locus variable number repeat fragment polymorphism analysis, etc.); Chromatographic analysis of bacterial components (fatty acids, mycolic acids).
  • MTB H37Rv completed genome-wide sequencing in 1998 and was the first MTB strain to complete full-sequence sequencing. Since then, researchers in various countries have been perfecting and supplementing the H37Rv gene annotation database based on algorithms such as algorithm optimization, annotation software update, transcriptomics and proteomics. However, since MTB belongs to prokaryotes, due to the deficiency of prokaryotic genome annotation technology, there may be annotation errors in genome annotation (over-annotation, gene boundary error and ORF initiation, termination site error, alternative splicing, ribosome). Displacement and leakage of annotations have brought problems to the in-depth and accurate analysis of biological mechanisms.
  • Protein Genomics Proteogenomics
  • H37Rv annotated genes
  • the traditional M. tuberculosis complex (MTBC) identification strategy has the defects of long cycle, cumbersome steps, low specificity and sensitivity.
  • MTBC M. tuberculosis complex
  • An object of the present invention is to provide a novel coding gene for Mycobacterium tuberculosis H37Rv, which is a H37Rv leak-annotation coding gene Rv1796c (-
  • Other objects of the present invention include providing a specific PCR primer that can be used to amplify the above-described coding gene and providing a detection method for detecting or identifying the presence or absence of a mycobacterial complex in a sample; the present invention also provides a gene associated with the above-described coding gene. Detection kit and application of the above genes.
  • a protein coding sequence in H37Rv that is difficult to be discovered by gene prediction software is found, which is effective in distinguishing MTBC from other species belonging to the same genus.
  • the gene is a missing annotation gene of Mycobacterium tuberculosis H37Rv, which is Rv1796c (-
  • Rv1796c -
  • NCBI-BLASTP no sequence is aligned in the database, which is a functionally unknown protein.
  • the comparative genomics study found that the gene sequence can distinguish the Mycobacterium tuberculosis complex (MTBC) strain from other species of Mycobacterium.
  • MTBC Mycobacterium tuberculosis complex
  • ) gene of MTBC is designed, which is a primer proposed in the present invention, and the primer sequence is:
  • MTBC can be quickly and accurately identified.
  • the present invention specifically establishes a method for detecting or identifying a Mycobacterium tuberculosis complex based on the above novel standard coding gene of M. tuberculosis H37Rv, the steps of which are as follows:
  • R 5'-GCAAGATAGCCCCGATCGACAACCC-3' (SEQ ID NO. 5).
  • step (2) (3) performing gel electrophoresis analysis or sequencing on the DNA product amplified in step (2);
  • step (3) The result of the step (3) is compared with the barcode gene Rv1796c (-
  • the above detection method according to the principle of DNA barcode, preliminary electrophoresis analysis of the PCR product, if the strain to be tested does not have a target band, indicating that the strain is not MTBC; if there is a band, further sequencing verification, sequencing will be sequenced Homologous comparison and alignment with the standard sequence of R79796 (HZR) of Rv1796c (-
  • HZR standard sequence of R79796
  • Rv1796c -
  • the detection method can be used for the identification of strains of Mycobacterium tuberculosis complex group, and can also be used for clinical rapid test.
  • the sample to be tested may be H37Rv strain, other MTBC, non-tuberculous mycobacteria, common respiratory pathogens, common viral strains of the respiratory tract; or direct use of sputum, saliva or blood of tuberculosis and other respiratory patients.
  • the present invention also provides a detection kit, which is provided with a reagent for detecting a new standard coding gene of Mycobacterium tuberculosis H37Rv, and at the same time, may be provided by a government drug regulatory agency, Information on the manufacture, use and sale of pharmaceuticals or biological products.
  • ) gene in the sample may contain an amplification primer, dNTP, a DNA polymerase for PCR reaction, a buffer thereof, and a digestion reaction. And/or one or more of the reagents required for sequencing the reaction, and the like.
  • the primers may employ the above specific PCR primers
  • the DNA polymerase used for the PCR reaction is an enzyme that can be used for PCR amplification.
  • Detection of the coding genes of the invention can also be provided in an integrated manner, such as a gene chip.
  • the present invention provides a standard gene and molecular identification method for molecular identification of Mycobacterium tuberculosis complex (MTBC), which can effectively distinguish MTBC from other species belonging to the same genus.
  • the identification method of the gene overcomes the shortcomings of the primer design and the poor repeatability of the existing Mycobacterium tuberculosis complex group, and has the characteristics of universal, easy amplification and easy comparison, and can accurately classify the group. It is identified from other mycobacteria or other respiratory infections with close kinship, providing powerful technical means and research tools for tuberculosis epidemiology investigation and rapid diagnosis and identification of clinical tuberculosis patients.
  • Figure 1 Peptide mapping matching evidence supporting the discovery of new coding genes
  • Figure 2 Comparison of the mass spectrum of the synthetic peptide with the original identified peptide mass spectrum
  • Figure 3 Corresponding diagram of the protein sequence encoded by the ORF in the region of the peptide; the underlined portion is a peptide that is proteomically identified and verified by the synthetic peptide;
  • Figure 5 BLASTP results of the protein sequence corresponding to the H37Rv strain Rv1796c (-
  • FIG. 6 Agarose gel electrophoresis results of PCR amplification products of Rv1796c (-
  • Figure 7 Comparison of PCR amplification sequencing results and standard sequences of Rv1796c (-
  • Example 1 Search for the leak-annotated coding gene of the H37Rv strain genome
  • the deep coverage of the proteome was performed on the H37Rv strain using the high-coverage proteome technique.
  • the genome was annotated with the pFind 3 engine for gene validation.
  • the pAnno software In order to discover the new protein coding region, we used the pAnno software to perform a six-frame database translation of H37Rv's genome-wide (NC_000962.3) file published by NCBI using pAnno software, and used this database to perform new peptides on mass spectrometry data. Identification of segments and new proteins.
  • we used three filtering methods for estimating the class FDR separately for the annotated peptide and the new peptide in the data filtering process namely S-FDR, T-FDR I and T-FDR II.
  • the gene theory encodes the amino acid sequence of the product as shown in SEQ ID NO. 3:
  • NCBI-BLASTP analysis was performed on the amino acid sequence of the product encoded by the theoretical gene shown in SEQ ID NO. 3, and only one M. xenopi 4042 serine protease domain protein (EUA42684.1) with a low homology was aligned in the database. The sequence coverage is 36% and the similarity is 70%. (See Figure 5). This indicated that the Rv1796c (-
  • Example 2 Establishing a method for identifying MTBC complexes
  • PCR primers were designed using Oligo7.0.
  • the primer sequences are as follows:
  • the DNA fragment was amplified, subjected to a polymerase chain reaction (PCR) reaction, and amplified using the above F/R primer.
  • PCR polymerase chain reaction
  • the PCR system (25 ⁇ L) was dd H 2 O (9.5 ⁇ L), 2XTaq PCR MasterMix (TIANGEN, 12.5 ⁇ L), primer F (10 ⁇ M, 1 ⁇ L), primer R (10 ⁇ M, 1 ⁇ L), DNA template (1 ⁇ L);
  • the amplification procedure pre-denaturation at 94 ° C for 3 min, denaturation at 94 ° C for 30 s, annealing at 58 ° C for 30 s, extension at 72 ° C for 1 min, 35 cycles, and extension at 72 ° C for 5 min.
  • the amplified product was electrophoresed and detected by electrophoresis in agarose gel and 1 ⁇ TBE electrophoresis solution.
  • the MTBC and the positive control group showed an amplification band at 118 bp, and the amplification result was in agreement with the expectation, and the specificity was 100%.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Analytical Chemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Biophysics (AREA)
  • General Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Physics & Mathematics (AREA)
  • Immunology (AREA)
  • Biomedical Technology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Medicinal Chemistry (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Virology (AREA)
  • Toxicology (AREA)
  • Tropical Medicine & Parasitology (AREA)
  • Plant Pathology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

提供了一种结核分枝杆菌H37Rv编码基因及其应用,具体为新的漏注释编码基因Rv1796c(-|2034439-2034570|),其可用作结核分枝杆菌复合群分子鉴定的标准基因,用于结核分枝杆菌复合群的分子鉴定及临床检测

Description

结核分枝杆菌H37Rv编码基因及其应用 技术领域
本发明涉及基因检测领域,具体涉及对病原菌物种进行鉴定。
背景技术
结核分枝杆菌(Mycobacterium tuberculosis,MTB)是引起人类结核病的病原菌。可入额侵全身各器官,但以肺结核为最多见。结核病是至今极为重要的传染病,严重威胁人类生命健康。据WHO报道,每年约有800万新病例发生,至少有300万人死于该病。MTB的临床菌株难培养、生长缓慢、与其它分枝杆菌能交叉感染、结核病与其它呼吸道感染症状难区分等特征,给临床快速诊断和治疗带来了极大的困难。故建立快速、准确、特异、敏感、廉价的结核病检测方法,许多是有效治疗、控制结核病蔓延的必要前提,也是临床实验室分枝杆菌检测面临的新挑战和新任务。
结核分枝杆菌复合群(Mycobacterium tuberculosis complex,MTBC),包括M.tuberculosis、M.africanum、M.orygis、M.bovis、M.microti、M.canettii、M.caprae、M.pinnipedii、M.suricattae、M.mungi等分枝杆菌类群,这些物种均会引起人和其它生命体结核病。目前国内外对MTBC的鉴定方法主要分为以下三类:传统分离培养法;分子水平检测(IS6110、限制性片段长度多态性分析、多位点可变数目重复片段多态性分析等);菌体成分(脂肪酸、分枝菌酸)色谱分析方法。三类方法虽都有各自的优点,但也有不足之处,如传统分离培养周期长和菌体可培养率低;目前分子水平检测在特异性、灵敏性和简便性方面尚差些;菌体成分特性分析成本较高、操作复杂。
MTB H37Rv于1998年完成全基因组测序,是最早完成全图测序的MTB菌株。自此,各国研究者们基于算法优化、注释软件更新、转录组学和蛋白质组学等策略,一直在完善、补充H37Rv基因注释数据库。然而,由于MTB属于原核生物,由于原核生物基因组注释技术本身的不足,在基因组注释中尚可能存在注释错误(过度注释、基因边界错误和ORF起始、终止位点错误、可变剪接、核糖体移位、漏注释),给深入、准确解析生物学机制带来了困扰。为解决此难题,蛋白质基因组学(Proteogenomics)虽已被用于H37Rv已注释基因的校 正,然而,高比例假阳性、常规技术难以进行注释基因预测、新基因验证、新基因功能分析及其应用等是该领域所面临的难题。
总的来说,传统结核分枝杆菌复合群(MTBC)鉴定策略具有周期长、步骤繁琐、特异性和灵敏度不高等缺陷。为进一步完善对H37Rv全基因组重新注释,发现H37Rv中遗漏注释基因,确保H37Rv全基因组遗漏注释基因及其在MTBC分子鉴定中的应用技术得到有效保护,开发利用H37Rv新基因在MTBC类群中快速精准鉴别的方法势在必行。
发明内容
本发明的一个目的是提供一种结核分枝杆菌H37Rv新的编码基因,该基因为H37Rv漏注释编码基因Rv1796c(-|2034439-2034570|),其可用作结核分枝杆菌复合群条形码分子标记,用于检测结核分枝杆菌复合群,其序列如SEQ ID NO.1所示。
本发明的其他目的包括提供可用于扩增上述编码基因的特异性PCR引物以及提供一种检测或鉴定样品中是否存在结合分枝杆菌复合群的检测方法;本发明还提供与上述编码基因相关的检测试剂盒和上述基因的应用。
根据本发明的一个方面,通过比较蛋白质基因组学研究技术,发现了H37Rv中一个难以被基因预测软件发现的蛋白编码序列,该基因能有效地将MTBC与同属的其它物种区分开来。该基因是一个结核分枝杆菌(Mycobacterium tuberculosis H37Rv)的遗漏注释基因,即Rv1796c(-|2034439-2034570|),经NCBI-BLASTP后,数据库中没有比对到任何序列,属于功能未知蛋白。经比较基因组学研究发现该基因序列能将结核分枝杆菌复合群(MTBC)菌株与分枝杆菌属的其它种鉴别开来。
具体地,设计能对MTBC的Rv1796c(-|2034439-2034570|)基因实现特异性扩增引物,即为本发明所提出的引物,引物序列为:
F:5’-ATGGAGATTATCTCGACACCGGGGG-3’;
R:5’-GCAAGATAGCCCCGATCGACAACCC-3’。
根据待测样品中的该基因DNA序列PCR产物的有无或DNA序列的差异,可以快速准确鉴定MTBC。
根据本发明的另一个方面,基于上述结核分枝杆菌H37Rv新的标准编码基 因,本发明具体地建立了检测或鉴定结核分枝杆菌复合群的方法,步骤如下:
(1)从待测样品中分离提取基因组DNA;
(2)以步骤(1)获得的DNA为模板,采用下述引物进行PCR扩增:
F:5’-ATGGAGATTATCTCGACACCGGGGG-3’(SEQ ID NO.4);
R:5’-GCAAGATAGCCCCGATCGACAACCC-3’(SEQ ID NO.5)。
(3)对步骤(2)扩增得到的DNA产物进行凝胶电泳分析或进行测序;
(4)将步骤(3)的结果与条形码基因Rv1796c(-|2034439-2034570|)进行比对,如果同源性大于99%,判定待测样品含有结核分枝杆菌复合群。
进一步地,上述检测方法,根据DNA条形码原理,初步对PCR产物进行电泳分析,如果待测菌株没有目标条带,说明该菌株不是MTBC;如果有条带,则可进一步测序验证,将测序得到序列与H37Rv的Rv1796c(-|2034439-2034570|)的标准序列进行同源比较和比对,获得序列间的相似性,若序列同源性大于99%,即可判定菌株可能为MTBC;根据待鉴定鉴定菌株的DNA条形码序列与标准序列聚类情况来区分MTBC家族与非结核分枝杆菌、呼吸道常见病原菌及呼吸道常见病毒。
该检测方法既可用于对结核分枝杆菌复合群的菌种鉴定研究,也可用于临床快速检验。待测样品可以是H37Rv菌株、其它MTBC、非结核分枝杆菌、呼吸道常见病原菌、呼吸道常见病毒菌株;或者直接使用结核病和其它呼吸道患者痰液、唾液或者血液。
在上述方法的基础上,本发明也提供检测试剂盒,试剂盒容器内装有用以检测结核分枝杆菌H37Rv新的标准编码基因的试剂,与之同时提供的可以是经政府药物管理机构审核的、有关药品或生物制品的制造、使用及销售信息。例如,采用PCR扩增后,直接检测样品中Rv1796c(-|2034439-2034570|)基因的试剂,例如可含有扩增引物、dNTP、用于PCR反应的DNA聚合酶及其缓冲液、酶切反应和/或测序反应所需试剂等的一种或多种。本领域技术人员已知,以上组分仅是示意性的,例如,所述引物可以采用上述特异性PCR引物,所述的用于PCR反应的DNA聚合酶是能够用于PCR扩增的酶。本发明的编码基因的检测也可以以集成的例如基因芯片的方式提供。
有益效果:本发明提供了一种用作结核分枝杆菌复合群(Mycobacterium tuberculosis complex,MTBC)分子鉴定的标准基因及分子鉴定方法,该基因能有 效地将MTBC与同属的其它物种区分开来,应用该基因的鉴定方法克服了现有结核分枝杆菌复合群鉴定过程中的引物设计多重性、结果重复性差等缺点,具有通用、易扩增、易比对的特点,可以准确地将该类群从亲缘关系很近的其它分枝杆菌或其它呼吸道感染病菌中鉴定出来,为结核流行病学调查及临床结核病患者快速诊断、鉴别提供有力的技术手段和研究工具。
附图说明
图1:支持发现的新编码基因的肽谱匹配证据;
图2:合成肽段质谱图与原鉴定肽段质谱图对比;
图3:肽段坐落区域ORF编码的蛋白质序列对应图;下划线部分为蛋白质组学鉴定并被合成肽段验证的肽段;
图4:Rv1796c(-|2034439-2034570|)标准基因序列同源性比较;
图5:H37Rv菌株Rv1796c(-|2034439-2034570|)基因所对应的蛋白序列BLASTP结果;
图6:Rv1796c(-|2034439-2034570|)特异引物PCR扩增产物琼脂糖凝胶电泳结果;
其中,各泳道样品具体信息见表1。
图7:Rv1796c(-|2034439-2034570|)基因PCR扩增测序结果和标准序列比较。
具体实施方式
下面结合具体实施方式对本发明做进一步说明,但不限制本发明权利要求范围。本发明所用试剂均为市售。
实施例1:寻找H37Rv菌株基因组的漏注释编码基因
1.1对H37Rv菌株基因组的高覆盖蛋白质组验证
利用高覆盖蛋白质组技术对H37Rv菌株进行了蛋白质组的深度覆盖研究。基于Tuberculosis(20160307)数据库,使用pFind 3引擎对其基因组进行了注释编码基因验证。为了发现新的蛋白编码区,我们基于蛋白质基因组学技术,用pAnno软件对H37Rv在NCBI发表的全基因组(NC_000962.3)文件进行六阅读框数据库翻译,并利用这个数据库对质谱数据进行了新肽段和新蛋白质的 鉴定。为了降低假阳性率,我们在数据过滤的过程中使用了3种对已注释肽段和新肽段分开估计类别FDR的过滤方法,分别是S-FDR,T-FDR I和T-FDR II。
经数据分析,我们共鉴定到3238个H37Rv已注释基因,覆盖度高达该菌株的80%以上,这是至今报道最大的H37Rv蛋白质谱数据。此外,经3种FDR≤1过滤后,我们获得新肽段。为了进一步确保新肽段质量,我们对上述过滤剩余的新肽段所对应的谱图进行了谱图质量筛选,最终保留了一些谱图质量好的肽段。为进一步排查这些谱图质量较高的肽段并非由于已注释肽段发生单个氨基酸突变所致,我们进行了氨基酸突变核查,确保这些新肽段为H37Rv新鉴定肽段。
1.2对Rv1796c(-|2034439-2034570|)基因的编码蛋白和数据库验证
经过高覆盖蛋白质组验证后,我们发现一些疑似的新的漏注释肽段,对上述高可信得疑似新肽段进行肽段合成验证,据新肽段原始谱和肽段合成谱相似度打分≥0.9作为相似度阈值,经打分筛选后,有1条肽段通过验证,对应于新开放阅读框(Open Reading Frame,ORF),即目前的H37Rv菌株的潜在漏注释基因。
其中,我们发现新的漏注释基因Rv1796c(-|2034439-2034570|),经BLASTP比较,仅比对到M.xenopi 4042同源性很低的一个丝氨酸蛋白酶结构域蛋白(EUA42684.1),序列覆盖度36%,相似性70%,Rv1796c可能与其功能相近。我们检测到肽段GLSIGAILR(SEQ ID NO.6),且对应于新基因Rv1796c(-|2034439-2034570|),如图1所示,谱图质量很好,b/y离子连续匹配,杂峰信号较低,结果很可信。
为进一步确证这个鉴定结果,我们按照我们新鉴定肽段的氨基酸序列化学合成了该肽段,并利用上述的质谱分析条件产生了该合成肽段的二级谱图。
我们对合成肽段产生的高能量碰撞MS 2进行了核实,一级母离子和二级子离子均符合理论值,表明我们合成的肽段序列正确;在此基础上,我们手工检查了根据大规模蛋白质组数据鉴定到的新肽段序列的合成肽段的MS 2和大规模鉴定新肽段谱图,两者几乎完全一致以子离子相似性获得的cosin值为0.99,证明我们从H37Rv中鉴定到的新肽段正确无误。(图2)。
在确认上述漏注释肽段的序列后,根据上述肽段所在的基因位置,以前一个终止密码子和后一个终止密码子包括的区域为界,得到包含上述新漏注释肽 段的开放阅读框(ORF)DNA序列,如SEQ ID NO.2所示。
Figure PCTCN2019084047-appb-000001
该开放阅读框编码与氨基酸序列的对应关系如图3所示。
进一步翻译验证,发现真实的基因序列(SEQ ID NO.1)从上述开放阅读框DNA(SEQ ID NO.2)中的 ATG开始,共132bp,编码43个氨基酸,其理论分子量4.04kDa,即为Rv1796c(-|2034439-2034570|)基因。
Figure PCTCN2019084047-appb-000002
该基因理论编码产物氨基酸序列如SEQ ID NO.3所示:
Figure PCTCN2019084047-appb-000003
对该SEQ ID NO.3所示理论基因编码产物的氨基酸顺序进行NCBI-BLASTP分析,数据库中仅比对到一个同源性很低的M.xenopi 4042丝氨酸蛋白酶结构域蛋白(EUA42684.1),序列覆盖度36%,相似性70%。(见图5)。表明我们检测到的Rv1796c(-|2034439-2034570|)基因产物在H37Rv菌株数据库中被遗漏注释。
我们将该Rv1796c(-|2034439-2034570|)基因的DNA序列进行比较基因组本地BLAST分析,如图5所示,结果表明Rv1796c(-|2034439-2034570|)基因序列属于MTBC家族特异性基因,在其它物种中没有同源性较高的序列,意味着我们在H37Rv菌株中发现的Rv1796c(-|2034439-2034570|)基因序列具有较好的序列特异性,可将MTBC与同属内其它分枝杆菌及其它呼吸道感染细菌区分开。
实施例2:建立鉴定MTBC复合群的方法
(1)设计引物:
基于如SEQ ID NO.1所示的Rv1796c(-|2034439-2034570|)基因的CDS序列,采用Oligo7.0设计了PCR引物,引物序列如下:
F:5’-ATGGAGATTATCTCGACACCGGGGG-3’(SEQ ID NO.4);
R:5’-GCAAGATAGCCCCGATCGACAACCC-3’(SEQ ID NO.5)
上述引物在与Rv1796c(-|2034439-2034570|)基因的位置关系如下所示,其中引物对应位置下标单划线。
Figure PCTCN2019084047-appb-000004
(2)提取包括M.tuberculosis H37Rv在内的待测菌株的总DNA,40株分枝杆菌属标准菌株由中国医学细菌菌种保藏管理中心(CMCC)保藏,其余16株非结核分枝杆菌是中国人民解放军309医院临床分离株,已经完成菌种16S RNA基因测序、比对及NCBI序列提交工作,待测菌株如表1所示:
表1.选用的相关菌株
Figure PCTCN2019084047-appb-000005
Figure PCTCN2019084047-appb-000006
(3)扩增DNA片段,进行聚合酶链式(PCR)反应,采用上述F/R引物进行扩增。
PCR体系(25μL)为dd H 2O(9.5μL)、2XTaq PCR MasterMix(TIANGEN,12.5μL)、引物F(10μM,1μL)、引物R(10μM,1μL)、DNA模板(1μL);
扩增程序:94℃预变性3min、94℃变性30s、58℃退火30s、72℃延伸1min、35个循环,72℃延伸5min。
(4)扩增产物电泳检测,在琼脂糖凝胶、1×TBE电泳液中电泳检测。结果如图6所示,MTBC和阳性对照组在118bp处出现了扩增条带,且扩增结果和 预期相符,特异性为100%。
(5)为了进一步验证扩增的DNA的序列,我们对扩增序列进行了测序并和漏注释序列比较,如图7所示,结果与预期完全相符,序列正确无误,这进一步验证了新漏注释基因的存在。这表明基于Rv1796c(-|2034439-2034570|)基因进行MTBC复合群鉴定的方法真实可靠。

Claims (10)

  1. 一种结核分枝杆菌H37Rv编码基因Rv1796c(-|2034439-2034570|),所述编码基因的核苷酸序列如SEQ ID NO.1所示。
  2. 权利要求1所述的结核分枝杆菌H37Rv编码基因Rv1796c(-|2034439-2034570|),其特征在于所述基因编码如SEQ ID NO.3序列所示的氨基酸。
  3. 一种条形码分子标记,用作检测和/或鉴定结核分枝杆菌复合群,其包含作为标准检测基因的权利要求1所述的结核分枝杆菌H37Rv编码基因Rv1796c(-|2034439-2034570|)。
  4. 一种特异性PCR引物,用于扩增权利要求1所述的结核分枝杆菌H37Rv编码基因Rv1796c(-|2034439-2034570|)。
  5. 如权利要求4所述的PCR引物,其特征在于,所述引物的序列为:
    F:5’-ATGGAGATTATCTCGACACCGGGGG-3’;
    R:5’-GCAAGATAGCCCCGATCGACAACCC-3’。
  6. 一种结核分枝杆菌复合群鉴定的检测方法,包括如下步骤:
    (1)从待测样品中分离提取基因组DNA;
    (2)以步骤(1)获得的DNA为模板,加入扩增引物,进行聚合酶链式反应;
    (3)对步骤(2)扩增得到的DNA产物进行凝胶电泳分析或进行测序;
    (4)将步骤(3)的结果与权利要求1所述的作为标准检测基因的Rv1796c(-|2034439-2034570|)进行比对,根据其同源性判定待测样品中是否存在结核分枝杆菌复合群。
  7. 如权利要求6所述的检测方法,其中步骤(2)中所述的扩增引物序列为:
    F:5’-ATGGAGATTATCTCGACACCGGGGG-3’;
    R:5’-GCAAGATAGCCCCGATCGACAACCC-3’。
  8. 如权利要求6所述的检测方法,其中在步骤(4)中,如果同源性大于99%,则判定待测样品中含有结核分枝杆菌复合群。
  9. 一种检测试剂盒,包含权利要求1所述的结核分枝杆菌H37Rv编码基因Rv1796c(-|2034439-2034570|)和/或权利要求4所述的特异性PCR引物。
  10. 权利要求1所述的结核分枝杆菌H37Rv编码基因 Rv1796c(-|2034439-2034570|)在结核流行病学调查和/或临床结核病患者快速诊断和鉴别中的应用。
PCT/CN2019/084047 2018-04-28 2019-04-24 结核分枝杆菌H37Rv编码基因及其应用 WO2019206176A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810403798.2 2018-04-28
CN201810403798.2A CN110408632B (zh) 2018-04-28 2018-04-28 结核分枝杆菌H37Rv编码基因及其应用

Publications (1)

Publication Number Publication Date
WO2019206176A1 true WO2019206176A1 (zh) 2019-10-31

Family

ID=68293763

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/084047 WO2019206176A1 (zh) 2018-04-28 2019-04-24 结核分枝杆菌H37Rv编码基因及其应用

Country Status (2)

Country Link
CN (1) CN110408632B (zh)
WO (1) WO2019206176A1 (zh)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113403327A (zh) * 2020-03-17 2021-09-17 北京蛋白质组研究中心 结核分枝杆菌H37Rv新基因Rv2706及其编码蛋白和应用

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006035317A2 (en) * 2004-09-30 2006-04-06 Institut Pasteur Immunogenic glycopeptides for diagnosing pathogenic microorganisms infections
CN107304231A (zh) * 2016-04-18 2017-10-31 华中农业大学 一种结核分枝杆菌融合蛋白及应用

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006008575A2 (en) * 2004-07-12 2006-01-26 Avestha Gengraine Technologies Pvt. Ltd. Construction of a comparative database and identificaiton of virulence factors through comparison of polymorphic regions in clinical isolates of infectious organisms

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006035317A2 (en) * 2004-09-30 2006-04-06 Institut Pasteur Immunogenic glycopeptides for diagnosing pathogenic microorganisms infections
CN107304231A (zh) * 2016-04-18 2017-10-31 华中农业大学 一种结核分枝杆菌融合蛋白及应用

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
AXEL-CLAUDE GARAPIN ET AL.: "Mixed immune response induced in rodents by two naked DNA genes coding for mycobacterial glycosylated proteins", VACCINE, 6 April 2001 (2001-04-06), XP027322000, ISSN: 0264-410X *

Also Published As

Publication number Publication date
CN110408632B (zh) 2021-01-19
CN110408632A (zh) 2019-11-05

Similar Documents

Publication Publication Date Title
Ha et al. Application of the whole genome-based bacterial identification system, TrueBac ID, using clinical isolates that were not identified with three matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) systems
Carbonnelle et al. Rapid identification of Staphylococci isolated in clinical microbiology laboratories by matrix-assisted laser desorption ionization-time of flight mass spectrometry
Cherkaoui et al. Comparison of two matrix-assisted laser desorption ionization-time of flight mass spectrometry methods with conventional phenotypic identification for routine identification of bacteria to the species level
WO2019206172A1 (zh) 结核分枝杆菌H37Rv编码基因及其应用
US20200294628A1 (en) Creation or use of anchor-based data structures for sample-derived characteristic determination
CN110408630B (zh) 结核分枝杆菌H37Rv编码基因及其应用
CN116171198A (zh) 用于快速早期检测感染的宿主rna生物标志物和早期鉴定人的covid-19冠状病毒感染的系统、方法和组合物
Baoshan et al. Combined nucleic acid assays for diagnosis of A19 vaccine‐caused human brucellosis
WO2019208691A1 (ja) カンジダ・アウリス検出用プライマーセット、カンジダ・アウリス検出キットおよびカンジダ・アウリスの検出方法
CN108004253B (zh) 结核分枝杆菌H37Rv编码基因及其应用
CN108165562B (zh) 结核分枝杆菌H37Rv编码基因及其应用
CN108165561B (zh) 结核分枝杆菌H37Rv编码基因及其应用
WO2019206176A1 (zh) 结核分枝杆菌H37Rv编码基因及其应用
CN108165560B (zh) 结核分枝杆菌H37Rv编码基因及其应用
CN108165564B (zh) 结核分枝杆菌H37Rv编码基因及其应用
CN108165565B (zh) 结核分枝杆菌H37Rv编码基因及其应用
CN111286551A (zh) 结核分枝杆菌的快速检测引物、试剂盒及其使用方法
RAO Infection of Mycobacterium mageritense at surgical site: a first case report of India
CN108165563B (zh) 结核分枝杆菌H37Rv编码基因及其应用
CN110408631B (zh) 结核分枝杆菌H37Rv编码基因及其应用
CN114196779A (zh) 基于靶向测序的病原微生物检测方法及试剂盒
Chen et al. Improved targeting of the 16S rDNA nanopore sequencing method enables rapid pathogen identification in bacterial pneumonia in children
JP7023465B2 (ja) 検体中の細菌数の定量方法
CN114107454A (zh) 基于宏基因/宏转录组测序的呼吸道感染病原检测方法
CN113174443B (zh) 一种分枝杆菌鉴定方法及其生物材料

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19792050

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 11/03/2021)

122 Ep: pct application non-entry in european phase

Ref document number: 19792050

Country of ref document: EP

Kind code of ref document: A1