TWI640636B - A method for simultaneous performing gene locus, chromosome and linkage analysis - Google Patents

A method for simultaneous performing gene locus, chromosome and linkage analysis Download PDF

Info

Publication number
TWI640636B
TWI640636B TW105137959A TW105137959A TWI640636B TW I640636 B TWI640636 B TW I640636B TW 105137959 A TW105137959 A TW 105137959A TW 105137959 A TW105137959 A TW 105137959A TW I640636 B TWI640636 B TW I640636B
Authority
TW
Taiwan
Prior art keywords
snp
reaction
amplification
analysis
sample
Prior art date
Application number
TW105137959A
Other languages
Chinese (zh)
Other versions
TW201728758A (en
Inventor
曉亮 謝
喬傑
陸思嘉
閆麗盈
湯富酬
黃蕾
Original Assignee
上海序康醫療科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 上海序康醫療科技有限公司 filed Critical 上海序康醫療科技有限公司
Publication of TW201728758A publication Critical patent/TW201728758A/en
Application granted granted Critical
Publication of TWI640636B publication Critical patent/TWI640636B/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Immunology (AREA)
  • Biotechnology (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Analytical Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

本發明涉及一種同時完成基因位點、染色體及連鎖分析的方法,具體包括胚胎細胞樣本的獲取、全基因組擴增、目的基因突變位點擴增、全基因組擴增產物及目的基因突變位點的資料庫建立、高通量定序和數據分析各主要步驟,藉由利用全基因組擴增技術結合高通量定序,一步完成多項綜合性檢測,避免了使用多方法、多步驟分別進行單基因遺傳病突變位點、染色體疾病和連鎖分析的檢測。本發明所提供的方法為微量樣本提供了有利條件,不僅可用於PGD檢測,確定胚胎是否攜帶致病基因和染色體複製數異常情況;也適用於反覆流產、高齡婦女胚胎的遺傳性篩選,實現一個步驟完成單個樣本多項檢測。因其操作簡單、週期短、可行性強,有利於推廣和應用。The invention relates to a method for simultaneously performing gene locus, chromosome and linkage analysis, which particularly comprises obtaining embryonic cell sample, whole genome amplification, amplification of target gene mutation site, whole genome amplification product and target gene mutation site. The main steps of database establishment, high-throughput sequencing and data analysis, by using whole-genome amplification technology combined with high-throughput sequencing, complete multiple comprehensive detections in one step, avoiding the use of multiple methods, multiple steps to separate single genes Detection of genetic mutation sites, chromosomal disorders, and linkage analysis. The method provided by the invention provides favorable conditions for a small amount of samples, and can be used not only for PGD detection, but also for determining whether an embryo carries a disease-causing gene and an abnormality of chromosome replication; and is also suitable for genetic screening of abortion, older women's embryos, and realizing a The step completes a single sample multiple test. Because of its simple operation, short cycle and strong feasibility, it is conducive to promotion and application.

Description

一種同時完成基因位點、染色體及連鎖分析的方法A method for simultaneously performing gene locus, chromosome and linkage analysis

本發明涉及基因組序列分析領域和生物資訊學領域,具體涉及一種利用MALBAC擴增技術結合高通量定序,一步完成單細胞單基因疾病、染色體疾病及連鎖分析的方法。The invention relates to the field of genomic sequence analysis and the field of bioinformatics, and particularly relates to a method for realizing single-cell monogenic diseases, chromosomal diseases and linkage analysis by using MALBAC amplification technology combined with high-throughput sequencing.

遺傳疾病是指人體中的遺傳物質(染色體或者粒線體DNA上的基因序列)發生改變而引起的疾病,近年來遺傳病的發生率逐年增高。在我國,每年有多達數萬例染色體異常的患兒出生,染色體異常患兒至今仍無法治癒。而目前已經發現的單基因遺傳疾病有7000多種,其中已經明確致病基因的有4000多種,雖然單基因遺傳病的單個病種發生率較低,但由於其種類繁多,所以在出生活嬰中的整體發生率和人群中的整體盛行率並不低,並且大部分單基因遺傳病具有致死性、致殘性或致畸性,除少部分可以藉由某些治療手段進行校正外,大部分至今尚無有效的治療手段。解決上述問題的根本途徑就是進行出生前或胚胎移植前診斷,避免此類患兒的出生,因此,胚胎著床前遺傳診斷對預防單基因遺傳病及染色體遺傳病的發生和傳遞具有重要的科學及社會意義。Genetic diseases refer to diseases caused by changes in genetic material (gene sequences on chromosomes or mitochondrial DNA) in humans. In recent years, the incidence of genetic diseases has increased year by year. In China, as many as tens of thousands of children with chromosomal abnormalities are born each year, and children with chromosomal abnormalities are still incurable. At present, there are more than 7,000 single-gene genetic diseases, of which more than 4,000 have been identified. Although the incidence of single disease in single-gene genetic diseases is low, due to its wide variety, it is in the life of infants. The overall incidence and overall prevalence in the population are not low, and most monogenic genetic diseases are lethal, disabling or teratogenic, with the exception of a small number of corrections that can be corrected by some treatments, most There is no effective treatment to date. The fundamental way to solve the above problems is to pre-natal or pre-embryo diagnosis to avoid the birth of such children. Therefore, pre-implantation genetic diagnosis has an important science to prevent the occurrence and transmission of monogenic genetic diseases and chromosomal genetic diseases. And social significance.

遺傳病的胚胎植入前遺傳學診斷(PGD)是生殖醫學重要內容,可以阻斷在胚胎著床前,避免患兒出生,從而避免和減少遺傳病的發生,減輕家庭和社會的經濟負擔。目前,胚胎著床前遺傳學診斷主要採用FISH和單細胞PCR技術,以及近幾年應用到臨床PGD的CGH array和SNP array技術。其中,應用FISH進行胚胎篩選受到探針的限制,不能同時檢測所有染色體狀態;單細胞固定易導致訊息丟失,單細胞PCR技術易造成等位基因丟失(ADO),從而引起誤診;CGH array只能診斷染色體數目改變及大片段染色體結構(非平衡易位,重複、缺失),而SNP array雖能進行所有CGH能診斷的染色體異常,分辨率較CGH提高,但不能同時完成單基因遺傳病和染色體疾病的診斷。Preimplantation genetic diagnosis (PGD) of genetic diseases is an important part of reproductive medicine. It can block the birth of embryos before implantation, avoiding and reducing the occurrence of genetic diseases, and reducing the economic burden of family and society. At present, preimplantation genetic diagnosis mainly uses FISH and single cell PCR technology, as well as CGH array and SNP array technology applied to clinical PGD in recent years. Among them, the application of FISH for embryo screening is limited by probes, and it is impossible to detect all chromosome states at the same time; single cell fixation is easy to cause loss of information, single-cell PCR technology is easy to cause allele loss (ADO), which causes misdiagnosis; CGH array can only Diagnosing changes in chromosome number and large fragment chromosome structure (non-equilibrium translocation, duplication, deletion), while SNP array can perform all CGH-diagnosed chromosomal abnormalities with higher resolution than CGH, but cannot simultaneously complete single-gene genetic diseases and chromosomes. Diagnosis of the disease.

可見,現有的胚胎植入前遺傳學診斷技術仍存在各種不足,無法滿足實際需求,在使用和推廣上均受到一定的限制。It can be seen that the existing preimplantation genetic diagnosis technology still has various deficiencies, which cannot meet the actual needs, and is limited in use and promotion.

此外,連鎖分析方法現有的主要是採用STR和SNP兩種遺傳標記。多重螢光PCR技術將多重PCR結合螢光探針檢測STR分型,由於STR連鎖標記較少,應用到具體案例中可能會沒有可用STR位點,所以進行臨床檢測前需要做大量預實驗工作,STR遺傳標記大多距離致病位點距離較遠,易發生染色體重組導致誤診,且檢測成本較高,故此技術無法進行批量檢測。而晶片和探針捕獲SNP的方法,與普通PCR捕獲SNP方法相比較,成本偏高,週期偏長。In addition, the existing linkage analysis method mainly uses two genetic markers, STR and SNP. Multiplex PCR technology combines multiplex PCR with fluorescent probes to detect STR typing. Since there are few STR linkage markers, there may be no STR sites available in specific cases, so a lot of pre-experimental work is needed before clinical testing. Most of the STR genetic markers are far away from the pathogenic site, which is prone to misdiagnosis due to chromosome recombination, and the detection cost is high. Therefore, the technique cannot perform batch detection. The method of capturing SNP by the wafer and the probe is higher in cost and longer in cycle than the conventional PCR capture SNP method.

綜上所述,即使對於大量細胞的樣本,現有技術無法藉由一步檢測同時完成單基因遺傳病、染色體異常和連鎖分析。另外,對於少量細胞的樣本(尤其是用於胚胎植入前遺傳學診斷的微量細胞),更無法有效地進行同時完成單基因遺傳病、染色體異常和連鎖分析。In summary, even for samples of a large number of cells, the prior art cannot simultaneously perform single-gene genetic diseases, chromosomal abnormalities, and linkage analysis by one-step detection. In addition, for samples of a small number of cells (especially for micro-cells used for preimplantation genetic diagnosis), it is even more difficult to perform simultaneous single-gene genetic diseases, chromosomal abnormalities, and linkage analysis.

因此,本領域急需一種開發在僅採用微量細胞就能同時進行單基因遺傳病、染色體異常和連鎖分析的PGD檢測方法。Therefore, there is an urgent need in the art to develop a PGD detection method that can simultaneously perform single gene genetic diseases, chromosomal abnormalities, and linkage analysis using only a small number of cells.

本發明的目的就是提供一種開發僅基於微量細胞(如3-10個細胞或更少細胞)就能同時進行單基因遺傳病、染色體異常和連鎖分析的PGD檢測方法。It is an object of the present invention to provide a PGD assay for the development of single gene genetic diseases, chromosomal abnormalities and linkage analysis based on only a small number of cells (e.g., 3-10 cells or less).

在本發明的第一方面,提供了一種同時完成基因位點、染色體及連鎖分析的方法,包括以下各步驟: (1) 胚胎細胞樣本的獲得:藉由單精子注射獲得受精卵,培養至囊胚期在外滋養層細胞分離獲得包含3-10個細胞的樣本; (2) 全基因組擴增:在樣本中加入裂解液放入PCR儀中進行裂解、失活蛋白酶,對獲得的細胞裂解樣本進行全基因組擴增; (3) 目的基因突變位點擴增:在完成引子設計、引子評估之後,將目的基因突變位點進行普通PCR擴增獲得PCR產物,將同一樣本的PCR產物與步驟(2)中獲得的片段化的基因組DNA按照一定比例混合; (4) 全基因組擴增產物與目的基因突變位點進行資料庫建立; (5) 採用高通量定序平臺,對樣本進行定序; (6) 數據分析 (6.1) 將步驟(5)獲得的定序結果去掉轉接子以及低品質數據,採用比對軟體比對到參考基因組; (6.2) 下機的數據經品質控制後比對到參考序列上,統計位點的等位基因的分佈及其頻率,檢測目的基因位點突變資訊; (6.3) 下機的數據經品質控制後比對到參考序列上,序列按照一定視窗進行GC校正,以幾千例全基因組擴增的單細胞數據為數據庫進行修正,最終檢測出染色體複製數變異(CNV); (6.4) 採用SNP-單套型進行連鎖分析。In a first aspect of the invention, there is provided a method for simultaneously performing gene locus, chromosome and linkage analysis, comprising the following steps: (1) obtaining an embryonic cell sample: obtaining a fertilized egg by single sperm injection, and culturing to a capsule The embryonic stage is isolated from the outer trophoblast cells to obtain a sample containing 3-10 cells; (2) Whole genome amplification: the lysate is added to the sample and placed in a PCR machine for lysis, inactivating protease, and the obtained cell lysis sample is subjected to Whole genome amplification; (3) Amplification of target gene mutation site: After completion of primer design and primer evaluation, the target gene mutation site is subjected to ordinary PCR amplification to obtain PCR product, and the PCR product and step of the same sample (2) The fragmented genomic DNA obtained in the mixture is mixed according to a certain ratio; (4) the whole genome amplification product and the target gene mutation site are used for database establishment; (5) the sample is sequenced by using a high-throughput sequencing platform; (6) Data analysis (6.1) The sequence results obtained in step (5) are removed from the adapter and low quality data, and the comparison software is used to compare the reference genome; (6.2) After the quality control, the data of the lower machine is compared to the reference sequence, the distribution of the alleles of the statistical locus and its frequency, and the mutation information of the target gene locus is detected; (6.3) The data of the lower machine is compared by the quality control. On the reference sequence, the sequence was GC-corrected according to a certain window, and the single-cell data of thousands of whole-genome amplifications were corrected for the database, and the chromosome replication number variation (CNV) was finally detected; (6.4) SNP-single-type type was used. Linkage analysis.

在另一優選例中,所述方法是非診斷性、非治療性的。In another preferred embodiment, the method is non-diagnostic, non-therapeutic.

在另一優選例中,所述步驟(3)中,普通PCR擴增的模版選自下組:(i)細胞裂解樣本的核酸、(ii)步驟(2)的全基因組擴增反應的產物、或(iii)由(i)和(ii)混合的形成的混合模版。In another preferred embodiment, in the step (3), the template for ordinary PCR amplification is selected from the group consisting of: (i) nucleic acid of the cell lysis sample, and (ii) the product of the whole genome amplification reaction of the step (2). Or (iii) a mixed stencil formed by mixing (i) and (ii).

在另一優選例中,所述的普通PCR擴增的模版為步驟(2)的全基因組擴增反應的產物、或細胞裂解樣本的核酸與步驟(2)的全基因組擴增反應的產物的混合物(其中,(ii)與(i)的混合比為10-10000:1,較佳地100-1000:1)。In another preferred embodiment, the common PCR amplified template is the product of the whole genome amplification reaction of step (2), or the nucleic acid of the cell lysis sample and the product of the whole genome amplification reaction of step (2). The mixture (wherein the mixing ratio of (ii) to (i) is from 10 to 10000:1, preferably from 100 to 1,000:1).

在另一優選例中,所述步驟(2)的全基因組擴增由多重黏合與成環擴增循環(MALBAC)實現,包括第一輪線性擴增和第二輪指數擴增。In another preferred embodiment, the whole genome amplification of step (2) is achieved by a multiple binding and loop-forming amplification cycle (MALBAC), including a first round of linear amplification and a second round of exponential amplification.

在另一優選例中,所述步驟(2)中第一輪線性擴增的條件為: (1) 90-98℃反應90s-5min; (2) 15-25℃反應30s-1min; (3) 25-35℃反應30s-1min; (4) 35-45℃反應20s-1min; (5) 45-55℃反應20s-1min; (6) 55-65℃反應20s-1min; (7) 65-85℃反應2min-6min; (8) 90-98℃反應10-40s; (9) 45-65℃反應5-20s; (10) 重複步驟(2)到步驟(9) 5至20個循環。In another preferred embodiment, the conditions of the first round of linear amplification in the step (2) are: (1) 90-98 ° C reaction for 90 s - 5 min; (2) 15 - 25 ° C reaction for 30 s - 1 min; 25-35 ° C reaction 30s-1min; (4) 35-45 ° C reaction 20s-1min; (5) 45-55 ° C reaction 20s-1min; (6) 55-65 ° C reaction 20s-1min; (7) 65 -85 ° C reaction 2min-6min; (8) 90-98 ° C reaction 10-40s; (9) 45-65 ° C reaction 5-20s; (10) repeat steps (2) to step (9) 5 to 20 cycles .

在另一優選例中,所述步驟(2)中第一輪線性擴增的條件優選為: (1) 94℃反應3min; (2) 20℃反應40s; (3) 30℃反應40s; (4) 40℃反應30s; (5) 50℃反應30s; (6) 60℃反應30s; (7) 70℃反應4min; (8) 95℃反應20s; (9) 58℃反應10s; (10) 重複步驟(2)到步驟(9) 8個循環。In another preferred embodiment, the conditions of the first round of linear amplification in the step (2) are preferably: (1) reaction at 94 ° C for 3 min; (2) reaction at 20 ° C for 40 s; (3) reaction at 30 ° C for 40 s; 4) 40 ° C reaction for 30 s; (5) 50 ° C reaction for 30 s; (6) 60 ° C reaction for 30 s; (7) 70 ° C reaction for 4 min; (8) 95 ° C reaction for 20 s; (9) 58 ° C reaction for 10 s; (10) Repeat steps (2) through (9) for 8 cycles.

在另一優選例中,所述步驟(2)中第二輪指數擴增的條件為: (1) 90-98℃反應10s-50s; (2) 90-98℃反應10s-40s; (3) 45-65℃反應20s-45s; (4) 65-80℃反應75s-5min; (5) 重複步驟(2)到步驟(4) 10至30個循環; (6) 將擴增後的產物在0-5℃保存。In another preferred embodiment, the conditions of the second round of exponential amplification in the step (2) are: (1) 90-98 ° C reaction for 10 s - 50 s; (2) 90-98 ° C reaction for 10 s - 40 s; 45-65 ° C reaction 20s-45s; (4) 65-80 ° C reaction 75s-5min; (5) repeat steps (2) to step (4) 10 to 30 cycles; (6) the product after amplification Store at 0-5 °C.

在另一優選例中,所述步驟(2)中第二輪指數擴增的條件優選為: (1) 94℃反應30s; (2) 94℃反應20s; (3) 58℃反應30s; (4) 72℃反應3min; (5) 重複步驟(2)到步驟(4) 17個循環; (6) 將擴增後的產物在4℃保存。In another preferred embodiment, the condition of the second round of exponential amplification in the step (2) is preferably: (1) reaction at 94 ° C for 30 s; (2) reaction at 94 ° C for 20 s; (3) reaction at 58 ° C for 30 s; 4) React at 72 ° C for 3 min; (5) Repeat steps (2) to (4) for 17 cycles; (6) Store the amplified product at 4 ° C.

在另一優選例中,MALBAC擴增後的產物在300-2000bp之間或250-2000bp之間。In another preferred embodiment, the product after MALBAC amplification is between 300-2000 bp or between 250-2000 bp.

在另一優選例中,所述步驟(3)中的引子設計在設計時,引子長度為20-25bp,擴增片段大小根據定序用的reads來進行調整設計,單端的不能超過reads長度,雙端的不能超過2倍長。In another preferred embodiment, the primer design in the step (3) is designed to have a length of 20-25 bp, and the size of the amplified fragment is adjusted according to the order of the read, and the single-ended cannot exceed the read length. The double end can't be more than 2 times longer.

在另一優選例中,所述步驟(3)中PCR產物與片段化的基因組DNA的混合比例為1:(1-100);較佳地為1:(10-100);更佳地1:(20-50)。In another preferred embodiment, the mixing ratio of the PCR product to the fragmented genomic DNA in the step (3) is 1: (1-100); preferably 1: (10-100); more preferably 1 :(20-50).

在另一優選例中,所述步驟(5)中的高通量定序如果只進行單基因致病位點和染色體複製數變異(CNV)分析,每個樣本定序平均深度最低為基因組的0.1倍;如果要同時獲得SNP連鎖資訊,每個樣本定序平均深度最低為基因組的2倍。In another preferred embodiment, the high-throughput sequencing in the step (5) is performed only on a single gene pathogenic site and a chromosome replication number variation (CNV) analysis, and each sample has a minimum average order depth of the genome. 0.1 times; if SNP linkage information is to be obtained at the same time, the average depth of sequencing of each sample is at least 2 times that of the genome.

在另一優選例中,所述步驟(6.4)採用SNP-單套型進行連鎖分析具體操作如下: (1) 選擇SNP區域,界定在目的基因上下游1M範圍內; (2) 用軟體獲得目標區域的SNP,所述軟體選自:samtools mpileup,GATK,FreeBayes,VarScan; (3) SNP過濾:等位基因雜合度差異最低為10%,除去潛在錯誤的SNP; (4) 篩選區分型SNP,並構建父本和母本SNP-單套型:同一SNP位點,在父本和母本的4個等位基因鹼基中,有1個鹼基不同於其他3個即可區分; (5) 分析SNP-單套型:胚胎SNP-單套型有2個,分別遺傳自父本和母本各1條,根據區分型SNP和孟德爾遺傳原理,判斷其SNP-單套型具體哪個是父源遺傳,哪個是母源遺傳; (6) 結果分析:根據步驟(4)確定父本和母本SNP-單套型,區分出與致病位點連鎖的單套型,將胚胎所在SNP的具體等位基因對比,根據孟德爾遺傳原理,判斷胚胎是否攜帶致病基因位點。In another preferred embodiment, the step (6.4) uses a SNP-single set to perform linkage analysis as follows: (1) selecting a SNP region, defined within a range of 1 M upstream and downstream of the target gene; (2) obtaining a target with software The SNP of the region, the soft body is selected from the group consisting of: samtools mpileup, GATK, FreeBayes, VarScan; (3) SNP filtration: the lowest allergy heterozygosity difference is 10%, removing potentially erroneous SNPs; (4) screening differentiated SNPs, And construct the male and female SNP-single type: the same SNP locus, in the four allele bases of the male and female parent, one base is different from the other three to distinguish; Analysis of SNP-single-type: There are 2 embryonic SNP-single-type types, which are inherited from the father and the female, respectively. According to the distinguishing SNP and Mendelian genetic principle, it is judged which SNP-single type is specific. Parental inheritance, which is maternal inheritance; (6) Analysis of results: According to step (4) to determine the male and female SNP-single type, distinguish the single set of linkages with the pathogenic site, the SNP where the embryo is located Specific allelic comparison, according to Mendelian genetic principle, to determine whether the embryo carries a causative basis Site.

在另一優選例中,步驟(5)的胚胎SNP-單套型分析中,區分型SNP最低為10個,若有3個以上SNP錯誤,此胚胎SNP-單套型數據視為數據量不足,難以進行分析。In another preferred embodiment, in the embryonic SNP-single set analysis of step (5), the minimum SNP is 10, and if there are more than 3 SNP errors, the embryo SNP-single set data is regarded as insufficient data. It is difficult to analyze.

應理解,在本發明範圍內中,本發明的上述各技術特徵和在下文(如實施例)中具體描述的各技術特徵之間都可以互相組合,從而構成新的或優選的技術方案。限於篇幅,在此不再一一累述。It is to be understood that within the scope of the present invention, the various technical features of the present invention and the various technical features specifically described hereinafter (as in the embodiments) may be combined with each other to constitute a new or preferred technical solution. Due to space limitations, we will not repeat them here.

較佳實施例之詳細說明Detailed description of the preferred embodiment

本發明人經過長期廣泛而深入的研究,藉由大量的試驗,首次開發了一種針對微量細胞樣本(如僅3-10個細胞)、基於MALBAC擴增技術結合高通量定序、簡便、快速、準確地對同一樣本一步完成胚胎單基因遺傳病、染色體疾病及連鎖分析等綜合性檢測的方法。具體地,本發明方法藉由對微量樣本(如胚胎樣本例如3-10個滋養層細胞)進行全基因組擴增(MALBAC擴增),然後以樣本核酸(優選全基因組擴增產物)作為模版進行普通PCR擴增(即突變位點PCR擴增),將來自同一樣本的突變位點PCR擴增產物與全基因組擴增產物以一定比例混合[例如1:(10-100)]後一起建立資料庫,然後進行高通量定序,並對定序數據進行比對分析,實現對同一微量樣本一步完成胚胎單基因遺傳病、染色體疾病及連鎖分析等多項檢測。在此基礎上,本發明人完成了本發明。檢測方法 The inventors have long-term extensive and in-depth research, through a large number of experiments, for the first time developed a micro-cellular sample (such as only 3-10 cells), based on MALBAC amplification technology combined with high-throughput sequencing, simple, fast A comprehensive method for comprehensive detection of embryonic monogenic genetic diseases, chromosomal diseases and linkage analysis in the same sample in one step. Specifically, the method of the present invention performs whole genome amplification (MALBAC amplification) on a trace sample (such as an embryo sample such as 3-10 trophoblast cells), and then uses a sample nucleic acid (preferably a whole genome amplification product) as a template. Ordinary PCR amplification (ie, PCR amplification of the mutation site), and the PCR amplification product of the mutation site from the same sample is mixed with the whole genome amplification product in a certain ratio [eg 1: (10-100)] to establish data together. The library then performs high-throughput sequencing and performs comparative analysis on the sequenced data to achieve multiple detections of embryonic monogenic genetic diseases, chromosomal diseases, and linkage analysis in one step. On the basis of this, the inventors completed the present invention. Detection method

本發明提供了一種如第一方面中所述的同時完成基因位點、染色體及連鎖分析的方法。The present invention provides a method of simultaneously performing gene locus, chromosome and linkage analysis as described in the first aspect.

典型地,一種同時完成基因位點、染色體及連鎖分析的方法包括以下各步驟: (1) 胚胎細胞樣本的獲得:藉由單精子注射獲得受精卵,培養至囊胚期在外滋養層細胞分離獲得包含3-10個細胞的樣本,其中胚胎細胞可以選自人或者其他哺乳動物。 (2) 全基因組擴增:在樣本中加入裂解液放入PCR儀中進行裂解、失活蛋白酶,對獲得的細胞裂解樣本進行全基因組擴增,優選採用多重黏合與成環擴增循環(MALBAC)實現;具體而言,在細胞裂解樣本中加入預擴增混合液進行第一輪線性擴增,之後再加入擴增混合液進行第二輪指數擴增,將擴增產物進行純化;MALBAC擴增後的產物在300-2000bp (或在250-2000bp)之間。 典型地,第一輪線性擴增的條件為: (1) 90-98℃反應90s-5min; (2) 15-25℃反應30s-1min; (3) 25-35℃反應30s-1min; (4) 35-45℃反應20s-1min; (5) 45-55℃反應20s-1min; (6) 55-65℃反應20s-1min; (7) 65-85℃反應2min-6min; (8) 90-98℃反應10-40s; (9) 45-65℃反應5-20s; (10) 重複步驟(2)到步驟(9) 5至20個循環。 第二輪指數擴增的條件為: (1) 90-98℃反應10s-50s; (2) 90-98℃反應10s-40s; (3) 45-65℃反應20s-45s; (4) 65-80℃反應75s-5min; (5) 重複步驟(2)到步驟(4) 10至30個循環; (6) 將擴增後的產物在0-5℃保存。 (3) 目的基因突變位點擴增:在完成引子設計、引子評估之後,將目的基因突變位點進行普通PCR擴增獲得PCR產物,將同一樣本的PCR產物與步驟(2)中獲得的片段化的基因組DNA按照1:(1-100)[較佳地1:(10-100)]的質量比混合。其中在進行引子設計時,引子長度為20-25bp,擴增片段大小根據定序用的reads來進行調整設計,單端的不能超過reads長度,雙端的不能超過2倍長。 (4) MALBAC擴增產物與目的基因突變位點進行基因庫的建構。 (5) 採用高通量定序平臺,對樣本進行定序;如果只進行單基因致病位點和染色體複製數變異(CNV)分析,每個樣本定序平均深度最低為基因組的0.1倍;如果要同時獲得SNP連鎖資訊,每個樣本需定序平均深度最低為基因組的2倍。 (6) 數據分析 (6.1) 將步驟(5)獲得的定序結果去掉轉接子以及低品質數據,採用比對軟體比對到參考基因組; (6.2) 下機的數據經品質控制後比對到參考序列上,統計位點的等位基因的分佈及其頻率,檢測目的基因位點突變資訊; (6.3) 下機的數據經品質控制後比對到參考序列上,序列按照一定視窗進行GC校正,以幾千例MALBAC擴增的單細胞數據為數據庫進行修正,最終檢測出染色體複製數變異(CNV); (6.4) 採用SNP-單套型進行連鎖分析: (6.4.1) 選擇SNP區域,界定在目的基因上下游1M範圍內; (6.4.2) 用軟體獲得目標區域的SNP,所述軟體包括但不限於samtools mpileup、GATK、FreeBayes、VarScan中的至少一種; (6.4.3) SNP過濾:等位基因雜合度差異最低為10%,除去潛在錯誤的SNP; (6.4.4) 篩選區分型SNP,並構建父本和母本SNP-單套型:同一SNP位點,在父本和母本的4個等位基因鹼基中,有1個鹼基不同於其他3個即可區分; (6.4.5) 分析SNP-單套型:胚胎SNP-單套型有2個,分別遺傳自父本和母本各1條,根據區分型SNP和孟德爾遺傳原理,判斷其SNP-單套型具體哪個是父源遺傳,哪個是母源遺傳;其中區分型SNP最低為10個,若有3個以上SNP錯誤,此胚胎SNP-單套型數據視為數據量不足,難以進行分析; (6.4.6) 結果分析:根據步驟(6.4.4)確定父本和母本SNP-單套型,區分出與致病位點連鎖的單套型,將胚胎所在SNP的具體等位基因對比,根據孟德爾遺傳原理,判斷胚胎是否攜帶致病基因位點。Typically, a method for simultaneously performing gene locus, chromosome, and linkage analysis includes the following steps: (1) Obtaining an embryonic cell sample: obtaining a fertilized egg by single sperm injection, and culturing until blastocyst stage is obtained in an outer trophoblast cell. A sample comprising 3-10 cells, wherein the embryonic cells can be selected from a human or other mammal. (2) Whole genome amplification: lysate is added to the sample to be lysed, inactivated, and the whole cell is amplified. The obtained cell lysate sample is subjected to whole genome amplification, preferably using multiple binding and loop-forming amplification cycles (MALBAC). Implementation; specifically, a pre-amplification mixture is added to the cell lysis sample for the first round of linear amplification, and then the amplification mixture is added for the second round of exponential amplification to purify the amplified product; MALBAC expansion The increased product is between 300-2000 bp (or between 250-2000 bp). Typically, the first round of linear amplification conditions are: (1) 90-98 ° C reaction 90s-5min; (2) 15-25 ° C reaction 30s-1min; (3) 25-35 ° C reaction 30s-1min; 4) 35-45 ° C reaction 20s-1min; (5) 45-55 ° C reaction 20s-1min; (6) 55-65 ° C reaction 20s-1min; (7) 65-85 ° C reaction 2min-6min; (8) Reaction at 90-98 ° C for 10-40 s; (9) Reaction at 45-65 ° C for 5-20 s; (10) Repeat steps (2) to (9) for 5 to 20 cycles. The second round of exponential amplification conditions are: (1) 90-98 ° C reaction 10s-50s; (2) 90-98 ° C reaction 10s-40s; (3) 45-65 ° C reaction 20s-45s; (4) 65 -80 ° C reaction for 75 s - 5 min; (5) Repeat steps (2) to (4) 10 to 30 cycles; (6) Store the amplified product at 0-5 ° C. (3) Amplification of the target gene mutation site: After completing the primer design and the primer evaluation, the target gene mutation site is subjected to ordinary PCR amplification to obtain a PCR product, and the PCR product of the same sample and the fragment obtained in the step (2) are obtained. The genomic DNA is mixed in a mass ratio of 1: (1-100) [preferably 1: (10-100)]. In the design of the primer, the length of the primer is 20-25 bp, and the size of the amplified fragment is adjusted according to the order of the read. The single-ended can not exceed the length of the reads, and the double-ended can not exceed 2 times. (4) Construction of the gene pool by MALBAC amplification products and target gene mutation sites. (5) Using a high-throughput sequencing platform to sequence the samples; if only single-gene causative sites and chromosome replication number variation (CNV) analysis are performed, the average depth of sequencing of each sample is at least 0.1 times that of the genome; If you want to obtain SNP linkage information at the same time, each sample needs to be ordered to have a minimum depth of 2 times the genome. (6) Data analysis (6.1) The sequence results obtained in step (5) are removed from the adapter and low-quality data, and the comparison software is used to compare the reference genome; (6.2) The data of the lower machine is quality-controlled and compared. To the reference sequence, statistically distribute the alleles of the locus and their frequencies, and detect the mutation information of the target locus; (6.3) The data of the lower machine is compared with the reference sequence by quality control, and the sequence is performed according to a certain window. Correction, using thousands of MALBAC-expanded single-cell data as a database for correction, and finally detecting chromosome replication number variation (CNV); (6.4) SNP-single-type linkage analysis: (6.4.1) Selection of SNP regions , defined in the range of 1M upstream and downstream of the target gene; (6.4.2) obtaining the SNP of the target region by using software, including but not limited to at least one of samtools mpileup, GATK, FreeBayes, VarScan; (6.4.3) SNP Filtration: the lowest allele heterozygosity difference is 10%, remove the potential wrong SNP; (6.4.4) Screen the distinguishing SNP, and construct the male and female SNP-single type: the same SNP locus, in the male parent And 4 of the female parent Among the bases of the gene, one base is different from the other three; (6.4.5) Analysis of SNP-single type: There are 2 embryonic SNP-single type, which are inherited from the father and the female parent respectively. According to the distinguishing SNP and Mendelian genetic principle, it is judged which SNP-single type is the parental inheritance and which is the maternal inheritance; among them, the minimum SNP is 10, if there are more than 3 SNP errors The embryonic SNP-single-type data is regarded as insufficient data, which is difficult to analyze; (6.4.6) Analysis of results: According to step (6.4.4), the male and female SNP-single sets are determined, and the results are distinguished. A single set of disease-linked loci, comparing the specific alleles of the SNPs of the embryos, and judging whether the embryos carry the disease-causing gene loci according to the Mendelian genetic principle.

優選地,所述步驟(2)中第一輪線性擴增的條件為: (1) 94℃反應3min; (2) 20℃反應40s; (3) 30℃反應40s; (4) 40℃反應30s; (5) 50℃反應30s; (6) 60℃反應30s; (7) 70℃反應4min; (8) 95℃反應20s; (9) 58℃反應10s; (10) 重複步驟(2)到步驟(9) 8個循環。Preferably, the conditions of the first round of linear amplification in the step (2) are: (1) reaction at 94 ° C for 3 min; (2) reaction at 20 ° C for 40 s; (3) reaction at 30 ° C for 40 s; (4) reaction at 40 ° C 30s; (5) 50 ° C reaction 30s; (6) 60 ° C reaction 30s; (7) 70 ° C reaction 4min; (8) 95 ° C reaction 20s; (9) 58 ° C reaction 10s; (10) Repeat step (2) Go to step (9) for 8 cycles.

優選地,所述步驟(2)中第二輪指數擴增的條件為: (1) 94℃反應30s; (2) 94℃反應20s; (3) 58℃反應30s; (4) 72℃反應3min; (5) 重複步驟(2)到步驟(4) 17個循環; (6) 將擴增後的產物在4℃保存。Preferably, the conditions of the second round of exponential amplification in the step (2) are: (1) reaction at 94 ° C for 30 s; (2) reaction at 94 ° C for 20 s; (3) reaction at 58 ° C for 30 s; (4) reaction at 72 ° C 3 min; (5) Repeat steps (2) through (4) for 17 cycles; (6) Store the amplified product at 4 °C.

本發明的一個突出特點是僅利用微量細胞(3-10個細胞),就一次性獲得全面的檢測或診斷結果,這樣有助於提高PGD檢測的準確度並減少成本。為此,本發明方法中藉由對於參數和步驟的組合進行了優化,從而對於來自微量細胞的核酸樣本,既保證了常規染色體(如CNV)檢測的有效性,又充分擴展了對於發生頻率較低或極低的單基因遺傳病和/或連鎖進行有效而準確分析的檢測能力。A salient feature of the present invention is that a single measurement or diagnosis is obtained in a single use using only a small number of cells (3-10 cells), which helps to improve the accuracy and cost of PGD detection. To this end, the method of the present invention is optimized by combining parameters and steps, thereby ensuring the validity of conventional chromosome (such as CNV) detection for nucleic acid samples from a small number of cells, and fully expanding the frequency of occurrence. Low or very low monogenic genetic diseases and/or linkages for detection of effective and accurate analysis.

在本發明中,對於來自微量細胞的核酸樣本,先進行MALBAC擴增。其中,所採用的MALBAC擴增引子可以是本領域已有的擴增引子,例如WO2012/166425所述的引子。然而,為了實現對微量細胞同時進行多項檢測的目的,最適化的MALBAC擴增參數如上所述。In the present invention, for nucleic acid samples derived from a small amount of cells, MALBAC amplification is first performed. Among them, the MALBAC amplification primer used may be an amplification primer existing in the art, such as the primer described in WO2012/166425. However, in order to achieve simultaneous detection of a plurality of cells for a plurality of times, the optimized MALBAC amplification parameters are as described above.

此外,為了有效地進行單基因遺傳病和/或連鎖分析,必須獲得高品質(即高保真且擴增覆蓋區域全)擴增產物。為此,在進行針對突變位點進行常規PCR擴增時,雖然可以採用高品質的萃取自微量細胞的核酸作為模版,但是優選採用部分MALBAC擴增產物作為模版。Furthermore, in order to efficiently perform a single gene genetic disease and/or linkage analysis, it is necessary to obtain a high quality (i.e., high fidelity and amplified coverage area) amplification product. For this reason, when performing conventional PCR amplification for a mutation site, although a high-quality nucleic acid extracted from a micro cell can be used as a template, it is preferred to use a partial MALBAC amplification product as a template.

典型地,在突變位點的PCR擴增反應中,模版選自下組:(i)細胞裂解樣本的核酸、(ii)步驟(2)的全基因組擴增反應的產物(即片段化的基因組DNA)、或(iii)由(i)和(ii)混合的形成的混合模版。較佳地,所述模版為步驟(2)的全基因組擴增反應的產物、或細胞裂解樣本的核酸與步驟(2)的全基因組擴增反應的產物的混合物[其中,(ii)與(i)的混合比為10-10000:1,較佳地100-1000:1]。Typically, in a PCR amplification reaction at a mutation site, the template is selected from the group consisting of: (i) a nucleic acid of a cell lysing sample, and (ii) a product of a whole genome amplification reaction of step (2) (ie, a fragmented genome) DNA), or (iii) a mixed stencil formed by mixing (i) and (ii). Preferably, the template is a product of the whole genome amplification reaction of step (2), or a mixture of the nucleic acid of the cell lysis sample and the product of the whole genome amplification reaction of step (2) [wherein (ii) and ( The mixing ratio of i) is 10-10000:1, preferably 100-1000:1].

此外,對於步驟(2)獲得的全基因組擴增產物和步驟(3)獲得的目的基因突變位點擴增產物,在本發明中藉由將其按一定比例混合,從而可以一次性地藉由後續的資料庫建立、定序和分析,獲得全面的檢測結果。這樣,就可以避免分別建立資料庫和/或分別檢測的麻煩。Further, the whole genome amplification product obtained in the step (2) and the amplification product of the target gene mutation site obtained in the step (3) can be used in the present invention by mixing them in a certain ratio. Subsequent database establishment, sequencing and analysis will obtain comprehensive test results. In this way, it is possible to avoid the trouble of separately establishing a database and/or separately detecting.

在本發明中,在所述混合步驟中,突變位點的PCR擴增產物與片段化的基因組DNA的混合比例為1:(1-100);較佳地為1:(10-100);更佳地1:(20-50)。In the present invention, in the mixing step, the mixing ratio of the PCR amplification product of the mutation site to the fragmented genomic DNA is 1: (1-100); preferably 1: (10-100); More preferably 1: (20-50).

本發明的主要優點包括: (1) 本發明方法僅需微量樣本(來自胚胎樣本的3-10個滋養層細胞),就可完成本領域原本需要大量樣本或需要分步進行的檢測。本發明利用MALBAC擴增技術全基因組擴增並結合目的基因突變位點擴增,然後對擴增產物的混合基因庫藉由一次高通量定序,對同一樣品,一步完成胚胎單基因遺傳病、染色體疾病及連鎖分析等綜合性檢測。 (2) 本發明方法可操作性強,降低成本,操作簡單快速,適用範圍廣,避免了使用多方法、多步驟分別進行單基因遺傳病突變位點、染色體疾病和連鎖分析的檢測。 (3) 本發明方法應用廣泛,不僅可用於胚胎植入前遺傳學診斷(PGD)檢測,確定胚胎是否攜帶致病基因位點和染色體複製數異常情況;同樣適用於反覆流產、高齡婦女胚胎的遺傳性篩選,簡便、快速、準確地完成針對微量細胞樣本的多項而全面的檢測。 (4) 本發明的檢測結果準確,可有效減少等位基因脫扣(ADO)等問題。The main advantages of the present invention include: (1) The method of the present invention requires only a small amount of samples (from 3 to 10 trophoblast cells from an embryo sample) to perform a test that would otherwise require a large number of samples in the art or that requires stepwise. The invention utilizes the MALBAC amplification technology to whole-genome amplification and combine with the mutation of the target gene to amplify, and then, through a high-throughput sequencing of the mixed gene library of the amplified product, completes the embryo single genetic disease in one step for the same sample. Comprehensive tests such as chromosomal diseases and linkage analysis. (2) The method of the invention has strong operability, low cost, simple and rapid operation, wide application range, and avoids detection of single gene genetic disease mutation sites, chromosomal diseases and linkage analysis by using multiple methods and multiple steps. (3) The method of the invention is widely used, and can be used not only for preimplantation genetic diagnosis (PGD) detection, but also for determining whether an embryo carries a disease-causing gene locus and an abnormality of chromosome replication; the same is applicable to abortion, adolescent women's embryos. Hereditary screening, simple, rapid and accurate completion of multiple and comprehensive detection of micro-cell samples. (4) The detection result of the invention is accurate, and the problem of allele tripping (ADO) can be effectively reduced.

下面結合具體實施例,進一步闡述本發明。應理解,這些實施例僅用於說明本發明而不用於限制本發明的範圍。下列實施例中未註明具體條件的實驗方法,通常按照常規條件,例如Sambrook等人,分子選殖:實驗室手冊(New York: Cold Spring Harbor Laboratory Press, 1989)中所述的條件,或按照製造廠商所建議的條件。除非另外說明,否則百分比和份數是重量百分比和重量份數。試劑和設備 The invention is further illustrated below in conjunction with specific embodiments. It is to be understood that the examples are not intended to limit the scope of the invention. The experimental methods in the following examples which do not specify the specific conditions are usually manufactured according to the conditions described in the conventional conditions, for example, Sambrook et al., Molecular Colonization: Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989), or according to the manufacturing conditions. The conditions recommended by the manufacturer. Unless otherwise stated, percentages and parts are by weight and parts by weight. Reagents and equipment

除非另外說明,否則所有的試劑和儀器均為市售產品,或按常規方法製備。Unless otherwise stated, all reagents and instruments are commercially available or prepared in a conventional manner.

MALBAC引子參見WO2012/166425的實施例1。實施例 實施例 1 The MALBAC primer is described in Example 1 of WO2012/166425. EXAMPLES Example 1

一體染色體顯性遺傳病模型,致病基因為A。本家系中,父本及父本的父親均為攜帶者,分別攜帶Mutation突變位點(c.233delC)。經分析,胚胎9、10、11和12為父源突變攜帶者的致病性胚胎,胚胎7和8不攜帶突變位點。 1 取胚胎單細胞: 1.1 精卵受精An integrated chromosomal genetic disease model with a disease-causing gene A. In this family, both the father and the father's father are carriers, carrying the Mutation mutation site (c.233delC). After analysis, embryos 9, 10, 11 and 12 are pathogenic embryos of the parent-derived mutation carrier, and embryos 7 and 8 do not carry the mutation site. 1 Take embryonic single cells: 1.1 sperm fertilization

對MII期的卵子顯微注射單精子(ICSI),將卵子放到(G-MOPS)操作液中,轉移到顯微操作儀平臺上,進行顯微操作。 1.2 胚胎體外培養For the MII phase of the egg microinjection of single sperm (ICSI), the egg was placed in the (G-MOPS) operating solution, transferred to the micromanipulator platform for micromanipulation. 1.2 Embryo culture in vitro

將受精胚在G1培養液(Vitrolife)或Gm培養液中(Global)中培養72小時左右至5-8細胞期,在透明帶上進行雷射打孔,轉移至已經平衡過的G2培養液(Vitrolife)或Gm培養液中(Global)繼續培養至囊胚,借助雷射將孵化的囊胚外滋養層細胞分離一小團,包含3-10個細胞。The fertilized embryos are cultured in G1 broth (Vitrolife) or Gm broth (Global) for about 72 hours to 5-8 cell phase, laser perforated on a zona pellucida, and transferred to the equilibrated G2 broth ( Vitrolife or Gm medium (Global) is continued to culture to the blastocyst, and the hatched blastocyst trophoblast cells are separated by a laser, containing 3-10 cells.

其中,胚胎細胞可以選自人類或者其他哺乳動物。 2. MALBAC擴增 2.1 MALBAC擴增 2.1.1 裂解細胞Among them, the embryonic cells may be selected from humans or other mammals. 2. MALBAC amplification 2.1 MALBAC amplification 2.1.1 Lysis cells

將胚胎細胞轉移至含有5 μl裂解液的PCR管中,放入PCR儀中進行裂解,50℃ 50分鐘。 2.1.2 失活蛋白酶Embryonic cells were transferred to a PCR tube containing 5 μl of lysate and placed in a PCR machine for lysis at 50 ° C for 50 minutes. 2.1.2 Inactivating protease

將PCR管放入PCR儀中失活,80℃ 10分鐘。 2.1.3 第一輪線性擴增The PCR tube was placed in a PCR machine and inactivated at 80 ° C for 10 minutes. 2.1.3 First round of linear amplification

在5 μl的細胞裂解樣品中加入30 μL預擴增混合液。Add 30 μL of pre-amplification mixture to 5 μl of cell lysate sample.

溫度流程:94℃ 3分鐘,8×(20℃---40秒,30℃---40秒,40℃---30秒,50℃---30秒,60℃---30秒,70℃---4分鐘,95℃---20秒,58℃---10秒)。 2.1.4 第二輪指數擴增Temperature flow: 94 ° C for 3 minutes, 8 × (20 ° C - -40 seconds, 30 ° C - -40 seconds, 40 ° C - 30 seconds, 50 ° C - 30 seconds, 60 ° C - 30 seconds , 70 ° C - 4 minutes, 95 ° C - -20 seconds, 58 ° C - 10 seconds). 2.1.4 Second round of exponential amplification

在35 μl的預擴增混合液中加入30 μl擴增混合液。Add 30 μl of the amplification mixture to the 35 μl preamplification mixture.

溫度流程:94℃---30秒,17×(94℃---20秒,58℃---30秒,72℃---3分鐘),4℃保存。 2.2 MALBAC擴增結果Temperature flow: 94 ° C - -30 sec, 17 × (94 ° C - -20 sec, 58 ° C - -30 sec, 72 ° C - - 3 min), stored at 4 ° C. 2.2 MALBAC amplification results

單細胞或等量DNA可藉由單細胞擴增反應,從每65 μL的反應體系中,獲得範圍在約300-2000 bp之間的擴增產物2-4 μg(即片段化的基因組DNA)。Single-cell or equivalent DNA can obtain 2-4 μg of amplified product ranging from about 300-2000 bp (ie, fragmented genomic DNA) from each 65 μL reaction system by single cell amplification. .

產物電泳如圖1。圖中M表示2K DNA ladder,E1、E2、E3、E4表示四個胚胎擴增產物,NC表示負對照組,由圖1可知,單細胞擴增終產物大小範圍在約250-2000bp。 2.3 MALBAC產物純化The product electrophoresis is shown in Figure 1. In the figure, M represents a 2K DNA ladder, E1, E2, E3, and E4 represent four embryo amplification products, and NC represents a negative control group. As can be seen from Fig. 1, the single cell amplification end product size ranges from about 250 to 2000 bp. 2.3 MALBAC product purification

利用商業純化試劑盒,進行MALBAC產物的純化。 3. 目的基因突變位點擴增 3.1 引子設計Purification of the MALBAC product was carried out using a commercial purification kit. 3. Target gene mutation site amplification 3.1 Primer design

利用Oligo、Primer等引子設計軟體或引子專業設計網站進行引子設計,設計時,引子長度20-25bp左右,擴增片段大小根據定序用的reads來進行調整設計,單端的不能超過reads長度,雙端的不能超過2倍長。 3.2 引子評估Use Oligo, Primer and other primer design software or primer professional design website for primer design. When designing, the length of the primer is about 20-25bp. The size of the amplified fragment is adjusted according to the order of the read. The single-ended can't exceed the length of the read. The end cannot be more than 2 times longer. 3.2 Introduction evaluation

引子設計完成後採用NCBI或UCSC等專業網站進行引子特異性評估(如錯配率、髮夾結構等等),評估合格後進行引子合成。 3.3 目的基因突變位點擴增After the primer design is completed, the NCBI or UCSC and other professional websites are used for the specific evaluation of the primers (such as mismatch rate, hairpin structure, etc.), and the primers are synthesized after the evaluation is passed. 3.3 Target gene mutation site amplification

將目的基因突變位點進行普通PCR擴增。The target gene mutation site was subjected to ordinary PCR amplification.

來自同一樣本的PCR產物和片段化的基因組DNA按照一定質量比進行混合,目的基因突變位點PCR產物:基因組DNA=1:(10-100)。 4. MALBAC產物及目的基因突變位點的資料庫建立The PCR product from the same sample and the fragmented genomic DNA were mixed at a certain mass ratio, and the PCR product of the target gene mutation site: genomic DNA = 1: (10-100). 4. Establishment of a database of MALBAC products and target gene mutation sites

利用商業資料庫建立試劑盒,進行MALBAC產物及目的基因突變位點的資料庫建立。 5. 高通量定序A database of MALBAC products and target gene mutation sites was established using a commercial database to establish a kit. 5. High-throughput sequencing

採用高通量定序平臺,對樣本進行定序。定序平臺不受特別限制,第二代定序平臺包括但不限於Illumina公司的GA、GAII、GAIIx、HiSeq1000/2000/2500/3000/4000、X Ten、X Five、NextSeq500/550、MiSeq,Applied Biosystems的SOLiD,Roche的454 FLX,Thermo Fisher Scientific(Life Technologies)的Ion Torrent、Ion PGM、Ion Proton I/II;第三代單分子定序平臺:包括但不限於Helicos BioSciences公司的HeliScope系統,Pacific Bioscience的SMRT系統,Oxford Nanopore Technologies的GridION、MinION。定序類型可為單端(Single End)定序或雙端(Paired End)定序。The sample is sequenced using a high-throughput sequencing platform. The sequencing platform is not particularly limited, and the second generation sequencing platform includes, but is not limited to, Illumina's GA, GAII, GAIIx, HiSeq1000/2000/2500/3000/4000, X Ten, X Five, NextSeq500/550, MiSeq, Applied Biosystems' SOLiD, Roche's 454 FLX, Thermo Fisher Scientific (Life Technologies)'s Ion Torrent, Ion PGM, Ion Proton I/II; third generation single molecule sequencing platform: including but not limited to Helicos BioSciences' HeliScope system, Pacific Bioscience's SMRT system, GridION, MinION of Oxford Nanopore Technologies. The sequencing type can be single-ended (Single End) or double-ended (Paired End) sequencing.

如果只進行單基因致病位點和染色體複製數變異(CNV)分析,每個樣本定序平均深度為只需0.1X;如果要同時獲得SNP連鎖資訊,每個樣本需定序平均深度為2X。 6. 數據分析 6.1 參考序列比對If only single-gene causative sites and chromosome replication number variation (CNV) analysis are performed, the average depth of each sample is only 0.1X; if SNP linkage information is to be obtained at the same time, each sample needs to be ordered to have an average depth of 2X. . 6. Data Analysis 6.1 Reference Sequence Alignment

將定序結果去掉轉接子及低品質數據,比對到參考基因組。參考基因組可為全基因組、任意染色體、染色體的一部分、基因。參考基因組通常選擇已被公認確定的序列可為NCBI或UCSC的基因序列為參考序列(如人類Hg19,小鼠mm10),或任意一條染色體。比對軟體可用任何一種免費或商業軟體,如BWA (Burrows-Wheeler Alignment tool)、SOAPaligner/soap2 (Short Oligonucleotide Analysis Package)、Bowtie/Bowtie2。 6.2 目的基因突變位點分析The sequencing result is removed and the low quality data is removed and compared to the reference genome. The reference genome can be a whole genome, an arbitrary chromosome, a part of a chromosome, or a gene. The reference genome usually selects a sequence whose expression can be determined to be NCBI or UCSC as a reference sequence (eg, human Hg19, mouse mm10), or any chromosome. The comparison software can be used with any kind of free or commercial software, such as BWA (Burrows-Wheeler Alignment tool), SOAPaligner/soap2 (Short Oligonucleotide Analysis Package), Bowtie/Bowtie2. 6.2 Analysis of target gene mutation sites

下機的數據,經品質控制後比對到參考序列(如人類Hg19,小鼠mm10)上,統計位點的等位基因(Allele)分佈及頻率,檢測目的基因位點突變資訊。 6.3 CNV分析The data of the lower machine was compared with the reference sequence (such as human Hg19, mouse mm10) after quality control, and the allele distribution and frequency of the statistical locus were detected to detect the mutation information of the target gene locus. 6.3 CNV analysis

下機的數據,經品質控制後比對到參考序列(如人類Hg19,小鼠mm10)上,序列按照一定視窗(bin)進行GC校正,以幾千例MALBAC擴增的單細胞數據為數據庫進行修正,最終檢測出染色體複製數,見CNV分析示例:表1和圖2。圖2中X座標表示染色體編號,Y座標表示染色體複製數目,箭頭所示染色體複製數異常染色體。 表1. CNV分析結果 6.4 連鎖分析The data of the lower machine was compared with the reference sequence (such as human Hg19, mouse mm10) after quality control, and the sequence was GC-corrected according to a certain bin (bin), and the database of thousands of MALBAC-expanded single cells was used as a database. Corrected, the number of chromosome copies was finally detected, see CNV analysis examples: Table 1 and Figure 2. In Fig. 2, the X coordinate indicates the chromosome number, the Y coordinate indicates the number of chromosome copies, and the arrow indicates the chromosome copy number abnormal chromosome. Table 1. CNV analysis results 6.4 Linkage analysis

本方法中以SNP-單套型為示例。 6.4.1 SNP calling和SNP過濾In this method, a SNP-single-sleeve type is taken as an example. 6.4.1 SNP calling and SNP filtering

選擇SNP區域,界定在目的基因上下游1M範圍內,雜合度較高(千份個體數據庫中頻率大於0.3);SNP間距越小,重組率越小。The SNP region was selected and defined in the range of 1M upstream and downstream of the target gene, and the heterozygosity was higher (the frequency in the individual database was more than 0.3); the smaller the SNP spacing, the smaller the recombination rate.

用軟體獲得目標區域的SNP,軟體包括但不限於samtools mpileup,GATK,FreeBayes,VarScan中的至少之一。The SNP of the target area is obtained by software, including but not limited to at least one of samtools mpileup, GATK, FreeBayes, and VarScan.

SNP過濾,等位基因雜合度差異最低10%,除去潛在錯誤的SNP。 6.4.2 篩選區分型SNP,並構建父本和母本SNP-單套型。SNP filtration, the lowest allergy heterozygosity difference is 10%, removing potentially erroneous SNPs. 6.4.2 Screen the distinguishing SNPs and construct the male and female SNP-single sets.

區分型SNP,即同一SNP位點,父本和母本的4個等位基因鹼基中,有1個鹼基不同於其他3個即可區分。如表2,A-SNP1,父本等位基因型為A/G,母本等位基因型為A/A,父本的父親個體等位基因型為A/A。說明,父本此位點鹼基A與致病位點連鎖在同一單套型中,並遺傳給後代;父本此位點鹼基G與正常等位基因連鎖在同一單套型中。這個SNP位點即為區分型SNP,其他位點以此類推,詳見表2示例。 表2. A基因區域部分SNP基因型 注釋: 1. 表2中“_”表示無法得到相應SNP資料(無數據覆蓋或深度較低); 2. 粗斜體表示攜帶的c.233delC致病突變; 3. 表2中父本單套型1表示致病突變所在單套型; 4. 表2中Mutation為致病突變。 6.4.3 分析胚胎SNP-單套型A distinguishing SNP, that is, the same SNP site, one of the four allele bases of the male parent and the female parent, one base is different from the other three. As shown in Table 2, A-SNP1, the paternal allele genotype is A/G, the maternal allele genotype is A/A, and the paternal father's individual allele genotype is A/A. It is indicated that the base A of the paternal locus and the causative locus are linked in the same single set and passed to the offspring; the base G of the paternal locus is linked to the normal allele in the same single set. This SNP site is a differentiated SNP, and the other sites are deduced by analogy. See Table 2 for an example. Table 2. Partial SNP genotypes in the A gene region Notes: 1. “_” in Table 2 indicates that the corresponding SNP data cannot be obtained (no data coverage or low depth); 2. The bold italic indicates the c.233delC pathogenic mutation carried; 3. The parental single set in Table 2 Type 1 indicates a single set of disease-causing mutations; 4. Mutation in Table 2 is a disease-causing mutation. 6.4.3 Analysis of embryonic SNPs - single set

胚胎SNP-單套型有2個,分別遺傳自父本和母本各1條,根據區分型SNP和孟德爾遺傳原理,判斷其SNP-單套型具體哪個是父源遺傳,哪個是母源遺傳。胚胎SNP-單套型分析中,區分型SNP最低為10個,若有3個以上SNP錯誤,此胚胎SNP-單套型分析錯誤。 6.4.4 結果分析There are 2 embryonic SNP-single-type types, which are inherited from the father and the female, respectively. According to the distinguishing SNP and Mendelian genetic principle, it is judged which SNP-single type is the parent source and which is the mother source. Genetic. In the SNP-single-type analysis of embryos, the minimum SNP of the differentiated SNP is 10. If there are more than 3 SNP errors, the SNP-single type analysis of this embryo is wrong. 6.4.4 Analysis of results

根據6.4.2確定父本和母本SNP-單套型,區分出與致病位點連鎖的單套型,將胚胎所在SNP的具體等位基因對比。例如,分析表2 A-SNP1,胚胎7、8的等位基因中均有鹼基G,說明這2個胚胎均不攜帶父源的致病基因位點;胚胎9、10、11、12的等位基因中均有鹼基A,說明這4個胚胎均攜帶父源的致病基因位點。根據孟德爾遺傳原理,判斷胚胎是否攜帶致病基因位點。 7. 方法可行性驗證 7.1 Sanger定序The male and female SNP-single sets were determined according to 6.4.2, and a single set of linkages to the pathogenic sites was distinguished, and the specific alleles of the SNPs of the embryos were compared. For example, in Table 2 A-SNP1, the alleles of embryos 7 and 8 all have a base G, indicating that the two embryos do not carry the pathogenic gene locus of the parent; embryos 9, 10, 11, and 12 The alleles have base A, indicating that these four embryos carry the pathogenic gene locus of the father. According to Mendelian genetic principle, it is judged whether the embryo carries a disease-causing gene locus. 7. Method feasibility verification 7.1 Sanger sequencing

對目的基因突變位點設計特異性引子,進行普通PCR擴增和Sanger定序,與高通量定序結果進行比較,用於驗證本專利中目的基因突變位點檢測方法的可行性,具體結果見圖3。圖3中黑色標註位置為目的基因突變位點;Sanger定序圖中除“A/T/C/G”四種鹼基外,另有“R/S/M/W/V”等簡單鹼基,分別表示由突變引起的A/G、G/C、A/C、G/A/C雜合峰。從圖3中可以得知,Sanger定序結果與表2中高通量定序結果一致。 7.2 STR位點連鎖分析Design specific primers for the target gene mutation site, perform common PCR amplification and Sanger sequencing, and compare with high-throughput sequencing results to verify the feasibility of the target gene mutation site detection method in this patent. See Figure 3. In Figure 3, the position marked in black is the target gene mutation site; in the Sanger sequence diagram, in addition to the four bases "A/T/C/G", there are other simple bases such as "R/S/M/W/V". The bases represent the A/G, G/C, A/C, and G/A/C heterozygous peaks caused by the mutation, respectively. As can be seen from Figure 3, the Sanger sequencing results are consistent with the high-throughput sequencing results in Table 2. 7.2 STR locus linkage analysis

對目的基因連鎖兩個STR位點(STR1和STR2)設計特異性引子,進行STR位點連鎖分析,用於驗證本發明中高通量定序SNP-單套型分析方法的可行性,具體見圖4。其中“■”表示患病男性,“●”表示患病女性,“□”表示正常男性,“○”表示正常女性。圖中數字“229”、“235”和“238”表示STR1位點等位基因型,“193”、“195”和“197”表示STR2位點等位基因型。從圖4中可以得知,STR位點連鎖分析結果與表2中SNP-單套型分析結果一致。The specific primers were designed for the two STR loci (STR1 and STR2) linked to the target gene, and the linkage analysis of STR loci was carried out to verify the feasibility of the high-throughput sequencing SNP-single-sleeve analysis method in the present invention. 4. Among them, “■” means a sick male, “●” means a sick female, “□” means a normal male, and “○” means a normal female. In the figure, the numbers "229", "235" and "238" indicate the STR1 locus allele, and "193", "195" and "197" indicate the STR2 locus allele. It can be seen from Fig. 4 that the results of the linkage analysis of the STR locus are consistent with the results of the SNP-single set analysis in Table 2.

綜上,本發明中所述一次定序完成單基因、染色體和連鎖分析的方法,可行性和準確性強。討論 In summary, the method for performing single gene, chromosome and linkage analysis in a single sequence in the present invention is feasible and accurate. discuss

在本發明中,針對微量細胞(如3-10個細胞或更少細胞),利用MALBAC擴增技術結合高通量定序,一步完成胚胎單基因遺傳病、染色體疾病及連鎖分析等綜合性檢測,可操作性強,降低成本,操作簡單快速,適用範圍廣,避免了使用多方法、多步驟分別進行單基因遺傳病突變位點、染色體疾病和連鎖分析的檢測。本發明所提供的方法為微量樣本(如胚胎樣本)的多種檢測提供有利條件,應用廣泛,不僅可用於PGD檢測,確定胚胎是否攜帶致病基因位點和染色體複製數異常情況;同樣適用於反覆流產、高齡婦女胚胎的遺傳性篩選,實現一個步驟完成單個樣本多項檢測。綜上,本發明操作簡單、工作週期短,實踐可行性強,尤其利用高通量定序方法,檢測和數據分析快速,有利於推廣和應用。In the present invention, for micro-cells (such as 3-10 cells or less), using MALBAC amplification technology combined with high-throughput sequencing, one-step comprehensive detection of embryonic monogenic genetic diseases, chromosomal diseases and linkage analysis The utility model has the advantages of strong operability, low cost, simple and rapid operation, wide application range, and avoids the detection of single gene genetic disease mutation sites, chromosomal diseases and linkage analysis by using multiple methods and multiple steps. The method provided by the invention provides favorable conditions for various detections of a small sample (such as an embryo sample), and is widely used, and can be used not only for PGD detection, but also whether the embryo carries a disease-causing gene locus and an abnormality of chromosome copy number; the same applies to repeated The genetic screening of abortion, older women's embryos, to achieve a single step to complete a single sample multiple tests. In summary, the present invention is simple in operation, short in work cycle, and has strong practical feasibility, especially using a high-throughput sequencing method, which is fast in detection and data analysis, and is advantageous for promotion and application.

在本發明提及的所有文獻都在本申請中引用作為參考,就如同每一篇文獻被單獨引用作為參考那樣。此外應理解,在閱讀了本發明的上述講授內容之後,本領域技術人員可以對本發明作各種改動或修改,這些等價形式同樣落於本申請所附申請專利範圍所限定的範圍。All documents mentioned in the present application are hereby incorporated by reference in their entirety in their entireties in the the the the the the the the In addition, it should be understood that various modifications and changes may be made by those skilled in the art without departing from the scope of the invention.

圖1為本發明MALBAC擴增結果的產物電泳圖。Figure 1 is a product electrophoresis pattern of the results of MALBAC amplification of the present invention.

圖2為本發明實施例1中胚胎全基因組定序的CNV分析圖。2 is a CNV analysis diagram of whole genome sequencing of embryos in Example 1 of the present invention.

圖3為本發明實施例1中目的基因突變位點的Sanger定序圖。Figure 3 is a Sanger sequence diagram of the mutation site of the target gene in Example 1 of the present invention.

圖4為本發明實施例1中STR位點連鎖分析圖。4 is a diagram showing linkage analysis of STR sites in Example 1 of the present invention.

Claims (11)

一種同時完成基因位點、染色體及連鎖分析的方法,其特徵在於,包括以下各步驟:(1)胚胎細胞樣本的獲得:藉由單精子注射獲得受精卵,培養至囊胚期在外滋養層細胞分離獲得包含3-10個細胞的樣本;(2)全基因組擴增:在樣本中加入裂解液放入PCR儀中進行裂解、失活蛋白酶,對獲得的細胞裂解樣本進行全基因組擴增,獲得片段化的基因組DNA;(3)目的基因突變位點擴增:在完成引子設計、引子評估之後,將目的基因突變位點進行普通PCR擴增獲得PCR產物,將同一樣本的PCR產物與步驟(2)中獲得的片段化的基因組DNA按照1:(1-100)的質量比混合;(4)全基因組擴增產物與目的基因突變位點進行資料庫建立;(5)採用高通量定序平臺,對樣本進行定序;(6)數據分析(6.1)將步驟(5)獲得的定序結果去掉轉接子以及低品質數據,採用比對軟體比對到參考基因組;(6.2)下機的數據經品質控制後比對到參考序列上,統計位點的等位基因的分佈及其頻率,檢測目的基因位點突變資訊;(6.3)下機的數據經品質控制後比對到參考序列上,序列按照一定視窗進行GC校正,以幾千例全基因組擴增的單細胞數據為數據庫進行修正,最終檢測出染色體複製數變異(CNV);(6.4)採用SNP-單套型進行連鎖分析。A method for simultaneously performing gene locus, chromosome and linkage analysis, comprising the following steps: (1) obtaining embryonic cell samples: obtaining fertilized eggs by single sperm injection, and culturing to blastocyst stage in outer trophoblast cells Separate and obtain samples containing 3-10 cells; (2) Whole genome amplification: Add lysate into the sample to be lysed in the PCR instrument, inactivate the protease, and perform whole genome amplification on the obtained cell lysate sample. Fragmented genomic DNA; (3) Amplification of target gene mutation site: After completion of primer design and primer evaluation, the target gene mutation site is subjected to ordinary PCR amplification to obtain a PCR product, and the PCR product and step of the same sample are 2) The fragmented genomic DNA obtained in the mixture is mixed according to the mass ratio of 1:(1-100); (4) the whole genome amplification product and the target gene mutation site are used for database establishment; (5) high-throughput determination is adopted. Sequence platform, sequence the sample; (6) data analysis (6.1) remove the transfer result obtained from step (5) and the low quality data, compare the reference software to the reference genome; (6.2) Machine After the quality control, the data is compared to the reference sequence, the distribution of the alleles of the statistical locus and its frequency, and the mutation information of the target gene locus is detected; (6.3) the data of the lower machine is compared with the reference sequence by quality control. The sequence was GC-corrected according to a certain window, and the single-cell data of thousands of whole-genome amplifications were corrected for the database, and the chromosome replication number variation (CNV) was finally detected; (6.4) The SNP-single-type type was used for linkage analysis. 如請求項1的方法,其特徵在於,所述步驟(2)的全基因組擴增由多重黏合與成環擴增循環(MALBAC)實現,包括第一輪線性擴增和第二輪指數擴增。The method of claim 1, wherein the whole genome amplification of the step (2) is achieved by a multiple binding and loop-forming amplification cycle (MALBAC), including a first round of linear amplification and a second round of exponential amplification. . 如請求項2的方法,其特徵在於,所述步驟(2)中第一輪線性擴增的條件為:(1)90-98℃反應90s-5min;(2)15-25℃反應30s-1min;(3)25-35℃反應30s-1min;(4)35-45℃反應20s-1min;(5)45-55℃反應20s-1min;(6)55-65℃反應20s-1min;(7)65-85℃反應2min-6min;(8)90-98℃反應10-40s;(9)45-65℃反應5-20s;(10)重複步驟(2)到步驟(9)5至20個循環。The method of claim 2, wherein the first round of linear amplification in the step (2) is: (1) 90-98 ° C reaction for 90 s - 5 min; (2) 15 - 25 ° C reaction for 30 s - 1min; (3) 25-35 ° C reaction 30s-1min; (4) 35-45 ° C reaction 20s-1min; (5) 45-55 ° C reaction 20s-1min; (6) 55-65 ° C reaction 20s-1min; (7) 65-85 ° C reaction 2min-6min; (8) 90-98 ° C reaction 10-40s; (9) 45-65 ° C reaction 5-20s; (10) repeat steps (2) to step (9) 5 Up to 20 cycles. 如請求項2的方法,其特徵在於,所述步驟(2)中第二輪指數擴增的條件為:(1)90-98℃反應10s-50s;(2)90-98℃反應10s-40s;(3)45-65℃反應20s-45s;(4)65-80℃反應75s-5min;(5)重複步驟(2)到步驟(4)10至30個循環;(6)將擴增後的產物在0-5℃保存。The method of claim 2, wherein the second round of exponential amplification in the step (2) is: (1) 90-98 ° C reaction for 10 s - 50 s; (2) 90-98 ° C reaction for 10 s - 40s; (3) 45-65 ° C reaction 20s-45s; (4) 65-80 ° C reaction 75s-5min; (5) repeat steps (2) to step (4) 10 to 30 cycles; (6) will expand The added product was stored at 0-5 °C. 如請求項1的方法,其特徵在於,所述步驟(3)中,普通PCR擴增的模版選自下組:(i)細胞裂解樣本的核酸、(ii)步驟(2)的全基因組擴增反應的產物、或(iii)由(i)和(ii)混合的形成的混合模版。The method of claim 1, wherein in the step (3), the template for ordinary PCR amplification is selected from the group consisting of: (i) nucleic acid of the cell lysing sample, and (ii) whole genome expansion of the step (2) The product of the increased reaction, or (iii) a mixed stencil formed by mixing (i) and (ii). 如請求項2至5任一項所述的方法,其特徵在於,MALBAC擴增後的產物在300-2000bp之間或250-2000bp之間。The method of any one of claims 2 to 5, wherein the product after MALBAC amplification is between 300-2000 bp or between 250-2000 bp. 如請求項1的方法,其特徵在於,所述步驟(3)中的引子設計在設計時,引子長度為20-25bp,擴增片段大小根據定序用的reads來進行調整設計,單端的不能超過reads長度,雙端的不能超過2倍長。The method of claim 1, wherein the primer design in the step (3) is designed to have a length of 20-25 bp, and the size of the amplified segment is adjusted according to the order of the read, and the single-ended cannot More than the length of the read, the double end can not exceed 2 times longer. 如請求項1的方法,其特徵在於,所述步驟(3)中PCR產物與片段化的基因組DNA的混合比例為1:(10-100)。The method of claim 1, wherein the mixing ratio of the PCR product to the fragmented genomic DNA in the step (3) is 1: (10-100). 如請求項1的方法,其特徵在於,所述步驟(5)中的高通量定序如果只進行單基因致病位點和染色體複製數變異(CNV)分析,每個樣本定序平均深度最低為基因組的0.1倍;如果要同時獲得SNP連鎖資訊,每個樣本定序平均深度最低為基因組的2倍。The method of claim 1, characterized in that the high-throughput sequencing in the step (5) is performed only on a single gene pathogenic site and a chromosome copy number variation (CNV) analysis, and the average depth of each sample is determined. The lowest is 0.1 times of the genome; if you want to obtain SNP linkage information at the same time, the average depth of each sample is at least 2 times that of the genome. 如請求項1的方法,其特徵在於,所述步驟(6.4)採用SNP-單套型進行連鎖分析具體操作如下:(1)選擇SNP區域,界定在目的基因上下游1M範圍內;(2)用軟體獲得目標區域的SNP,所述軟體選自:samtools mpileup,GATK,FreeBayes,VarScan;(3)SNP過濾:等位基因雜合度差異最低為10%,除去潛在錯誤的SNP;(4)篩選區分型SNP,並構建父本和母本SNP-單套型:同一SNP位點,在父本和母本的4個等位基因鹼基中,有1個鹼基不同於其他3個即可區分;(5)分析SNP-單套型:胚胎SNP-單套型有2個,分別遺傳自父本和母本各1條,根據區分型SNP和孟德爾遺傳原理,判斷其SNP-單套型具體哪個是父源遺傳,哪個是母源遺傳;(6)結果分析:根據步驟(4)確定父本和母本SNP-單套型,區分出與致病位點連鎖的單套型,將胚胎所在SNP的具體等位基因對比,根據孟德爾遺傳原理,判斷胚胎是否攜帶致病基因位點。The method of claim 1, wherein the step (6.4) performs a linkage analysis using a SNP-single set of specific operations as follows: (1) selecting a SNP region, defined within a range of 1 M upstream and downstream of the target gene; (2) The SNP of the target region is obtained by software, which is selected from the group consisting of: samtools mpileup, GATK, FreeBayes, VarScan; (3) SNP filtration: the lowest allergy heterozygosity difference is 10%, the potential error SNP is removed; (4) screening Differentiate SNPs and construct a male and female SNP-single type: the same SNP locus. In the four allele bases of the male and female parent, one base is different from the other three. (5) Analysis of SNP-single-set type: There are 2 embryonic SNP-single-type types, which are inherited from the father and the female, respectively. According to the distinguishing SNP and Mendelian genetic principle, the SNP-single set is judged. Which type is parental inheritance and which is maternal inheritance; (6) Analysis of results: According to step (4), the male and female SNP-single sets are determined, and a single set that is linked to the pathogenic site is distinguished. Compare the specific alleles of the SNPs of the embryos, and judge whether the embryos carry the disease-causing gene according to the Mendelian genetic principle. point. 如請求項10的方法,其特徵在於,步驟(5)的胚胎SNP-單套型分析中,區分型SNP最低為10個,若有3個以上SNP錯誤,此胚胎SNP-單套型數據視為數據量不足,難以進行分析。The method of claim 10, characterized in that in the embryonic SNP-single-type analysis of step (5), the minimum SNP is 10, and if there are more than 3 SNP errors, the embryo SNP-single data is regarded as For the lack of data, it is difficult to analyze.
TW105137959A 2015-11-18 2016-11-18 A method for simultaneous performing gene locus, chromosome and linkage analysis TWI640636B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510794535.5A CN105543339B (en) 2015-11-18 2015-11-18 Method for simultaneously completing gene locus, chromosome and linkage analysis
??201510794535.5 2015-11-18

Publications (2)

Publication Number Publication Date
TW201728758A TW201728758A (en) 2017-08-16
TWI640636B true TWI640636B (en) 2018-11-11

Family

ID=55822896

Family Applications (1)

Application Number Title Priority Date Filing Date
TW105137959A TWI640636B (en) 2015-11-18 2016-11-18 A method for simultaneous performing gene locus, chromosome and linkage analysis

Country Status (3)

Country Link
CN (1) CN105543339B (en)
TW (1) TWI640636B (en)
WO (1) WO2017084624A1 (en)

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105543339B (en) * 2015-11-18 2021-07-16 上海序康医疗科技有限公司 Method for simultaneously completing gene locus, chromosome and linkage analysis
WO2018148903A1 (en) * 2017-02-16 2018-08-23 上海亿康医学检验所有限公司 Auxiliary diagnosis method for urinary system tumours
CN106834490B (en) * 2017-03-02 2021-01-22 上海亿康医学检验所有限公司 Method for identifying balanced translocation breakpoint and balanced translocation carrying state of embryo
CN107247863A (en) * 2017-04-18 2017-10-13 北京水母科技有限公司 Integrate high flux Genotyping and the biomedical Ontology integration method of clinical medicine information
CN106929595B (en) * 2017-04-28 2021-03-30 上海亿康医学检验所有限公司 System and method for identifying balanced translocation carrying state of embryo
CN108165610A (en) * 2017-12-22 2018-06-15 上海美吉生物医药科技有限公司 A kind of unicellular whole genome amplification kit
CN108410979A (en) * 2018-05-25 2018-08-17 安徽华大医学检验所有限公司 Chip that is a kind of while detecting single-gene disorder and chromosomal disorders and its application
CN110628891B (en) * 2018-06-25 2024-01-09 深圳华大智造科技股份有限公司 Method for screening embryo genetic abnormality
WO2020047694A1 (en) * 2018-09-03 2020-03-12 深圳华大智造科技有限公司 Method and device for determining genetic status of new mutation in embryo
CN110872618B (en) * 2018-09-04 2022-04-19 北京果壳生物科技有限公司 Method for judging sex of detected sample based on Illumina human whole genome SNP chip data and application
CN110305943A (en) * 2019-05-23 2019-10-08 北京新抗元生物技术有限公司 It is a kind of for assessing the internal reference system and its method for building up of gene sequencing data noise
CN110093406A (en) * 2019-05-27 2019-08-06 新疆农业大学 A kind of argali and its filial generation gene research method
CN110241262A (en) * 2019-06-26 2019-09-17 云南省第一人民医院 A kind of cleavage stage embryo's Single cell analysis HBV viral genome vertical transmission method based on PGD
CN110305968A (en) * 2019-07-14 2019-10-08 西安交通大学口腔医院 A kind of composite amplification system in the micro- haplotype domain SNP-DIP based on NGS parting for medical jurisprudence individual identification
CN111276189B (en) * 2020-02-26 2020-12-29 广州市金域转化医学研究院有限公司 Chromosome balance translocation detection and analysis system based on NGS and application thereof
CN111440857B (en) * 2020-03-11 2023-03-21 阿吉安(福州)基因医学检验实验室有限公司 Method for non-invasive genetic detection of embryo before implantation
CN113436680B (en) * 2020-05-22 2022-03-25 复旦大学附属妇产科医院 Method for simultaneously identifying chromosome structural abnormality and carrier state of pathogenic gene of embryo
CN111816248B (en) * 2020-05-22 2023-12-01 武汉菲沙基因信息有限公司 Pacbio surassemblies and Hi-C reads-based whole genome typing method
CN113035271B (en) * 2020-12-22 2024-01-23 苏州奥根诊断科技有限公司 Multiplex PCR primer design, primer set and detection method for detecting heterogenous cfDNA and application of multiplex PCR primer design
CN114220482A (en) * 2021-12-02 2022-03-22 苏州贝康医疗器械有限公司 Method, device, equipment and storage medium for carrying out chromosome microdeletion analysis across breakpoint
CN114507728B (en) * 2022-03-03 2024-03-22 苏州贝康医疗器械有限公司 Capturing primer and application thereof
CN114717337B (en) * 2022-05-18 2023-10-13 中国计量大学 Cell cross-contamination detection method and application thereof
CN114898803B (en) * 2022-05-27 2023-03-24 圣湘生物科技股份有限公司 Mutation detection analysis method, device, readable medium and apparatus
CN114836536B (en) * 2022-07-04 2022-09-30 北京大学第三医院(北京大学第三临床医学院) Method and system for screening single-cell high-amplification region based on MALBAC
CN115064210B (en) * 2022-07-27 2022-11-18 北京大学第三医院(北京大学第三临床医学院) Method for identifying chromosome cross-exchange positions in diploid embryonic cells and application
CN116153395B (en) * 2023-04-17 2023-06-30 北京大学第三医院(北京大学第三临床医学院) Method and system for detecting single-cell small fragment chromosome copy number variation
CN117230175B (en) * 2023-06-21 2024-05-28 广州序源医学科技有限公司 Embryo preimplantation genetics detection method based on third generation sequencing
CN117116344B (en) * 2023-10-25 2024-07-19 北京大学第三医院(北京大学第三临床医学院) Detection system and method for single-cell level PMP22 repeated variation
CN117721222B (en) * 2024-02-07 2024-05-10 北京大学第三医院(北京大学第三临床医学院) Method for predicting embryo implantation by single cell transcriptome and application

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104711362A (en) * 2015-03-27 2015-06-17 苏州贝康医疗器械有限公司 Method for detecting embryo chromosome abnormalities by using blastula-stage embryo cells

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2714938B1 (en) * 2011-05-27 2017-11-15 President and Fellows of Harvard College Methods of amplifying whole genome of a single cell
CN104004817A (en) * 2013-02-22 2014-08-27 哈佛大学 Selection of embryo of test tube baby through sequencing by single cell genome of polar body or embryo
CN104745718B (en) * 2015-04-23 2018-02-16 北京中仪康卫医疗器械有限公司 A kind of method for detecting human embryos microdeletion and micro- repetition
CN105543339B (en) * 2015-11-18 2021-07-16 上海序康医疗科技有限公司 Method for simultaneously completing gene locus, chromosome and linkage analysis

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104711362A (en) * 2015-03-27 2015-06-17 苏州贝康医疗器械有限公司 Method for detecting embryo chromosome abnormalities by using blastula-stage embryo cells

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
Huang L et al,"Single-Cell Whole-Genome Amplification and Sequencing: Methodology and Applications.", Annu Rev Genomics Hum Genet. 2015;16:79-102, Epub 2015 Jun 11. *
Konstantinidis M et al,"Live births following Karyomapping of human blastocysts: experience from clinical application of the method.",Reprod Biomed Online. 2015 Sep;31(3):394-403. *
Rechitsky S et al,"First systematic experience of preimplantation genetic diagnosis for single-gene disorders, and/or preimplantation human leukocyte antigen typing, combined with 24-chromosome aneuploidy testing.", Fertil Steril. 2015 Feb;103(2):503-12. *
Yan L et al,"Live births after simultaneous avoidance of monogenic diseases and chromosome abnormality by next-generation sequencing with linkage analyses.", Proc Natl Acad Sci U S A. 2015 Dec 29;112(52):15964-9. *

Also Published As

Publication number Publication date
WO2017084624A1 (en) 2017-05-26
TW201728758A (en) 2017-08-16
CN105543339B (en) 2021-07-16
CN105543339A (en) 2016-05-04

Similar Documents

Publication Publication Date Title
TWI640636B (en) A method for simultaneous performing gene locus, chromosome and linkage analysis
CN105441432B (en) Composition and its purposes in sequencing and variation detection
US20140206552A1 (en) Methods for preimplantation genetic diagnosis by sequencing
WO2013052557A2 (en) Methods for preimplantation genetic diagnosis by sequencing
WO2014127749A1 (en) Application of single cell genome sequencing in preimplantation genetic diagnosis
CN104136628A (en) Method for detecting micro-deletion and micro-repetition of chromosome
AU2018327337A1 (en) Systems and methods for non-invasive preimplantation genetic diagnosis
WO2015042980A1 (en) Method, system, and computer-readable medium for determining snp information in a predetermined chromosomal region
Liu et al. Emerging technologies for prenatal diagnosis: The application of whole genome and RNA sequencing
Deleye et al. Massively parallel sequencing of micro-manipulated cells targeting a comprehensive panel of disease-causing genes: A comparative evaluation of upstream whole-genome amplification methods
JP7333838B2 (en) Systems, computer programs and methods for determining genetic patterns in embryos
Xing et al. Long‐read Oxford nanopore sequencing reveals a de novo case of complex chromosomal rearrangement involving chromosomes 2, 7, and 13
CN106929595B (en) System and method for identifying balanced translocation carrying state of embryo
JP7446343B2 (en) Systems, computer programs and methods for determining genome ploidy
JP2014530629A (en) Method for detecting chromosomal microdeletions and microduplications
Bakhtiar et al. Omics technologies for clinical diagnosis and gene therapy: medical applications in human genetics
Vendrell et al. New protocol based on massive parallel sequencing for aneuploidy screening of preimplantation human embryos
CN117925820B (en) Method for detecting variation before embryo implantation
Tobler et al. The use of next-generation sequencing (NGS) for preimplantation genetic screening (PGS) and diagnosis (PGD)
WO2017124214A1 (en) Method for detecting chromosome robertsonian translocation
Yilmaz Structural Variants in Health and Disease
CN116814772A (en) Primer combination, kit and method for detecting FANCA gene mutation
Farslow Gene duplications during experimental evolution of Caenorhabditis elegans: duplication rates and evolutionary responses

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees