KR101632881B1

KR101632881B1 - Sequencing method of genomic DNA end sequence using NGS

Info

Publication number: KR101632881B1
Application number: KR1020150166022A
Authority: KR
Inventors: 박홍석; 김지선; 김민영; 나윤정
Original assignee: 주식회사 지앤시바이오
Priority date: 2015-11-26
Filing date: 2015-11-26
Publication date: 2016-06-23
Also published as: WO2017090904A1

Abstract

The present invention relates to a method for analyzing a base sequence of dielectric DNA by using NGS after amplifying only a large quantity of terminal base sequences of dielectric DNA by using a primer which is specific to a dielectric DNA terminal base sequence. Terminal base sequence information of dielectric DNA inserted in a vector may be rapidly, conveniently, and cheaply mass-produced. A sequence of contigs or scaffolds may be easily determined by using a large quantity of analyzed terminal base sequence information. An ultra-large scaffold having tens of mega size may be constructed, and a precise dielectric physical map may be written. A position of a clone corresponding to analyzed terminal base sequence information may be clarified, so the method according to the present invention may be used in enhancing the quality of dielectric analysis such as sequence analysis gap connection and sequence accuracy verification.

Description

[0001] The present invention relates to a method for mass analysis of genomic DNA end sequence using a next-generation sequencing method (Sequencing method of genomic DNA end sequence using NGS)

본 발명은 NGS를 이용한 유전체 DNA 서열분석방법에 관한 것으로, 보다 구체적으로는 유전체 DNA 말단 염기서열에 특이적인 프라이머를 사용하여 유전체 DNA의 말단 염기서열만을 대량 증폭시킨 후 NGS를 이용하여 유전체 DNA의 말단 염기서열을 분석하는 방법에 관한 것이다.The present invention relates to a method for analyzing a genomic DNA sequence using NGS. More specifically, a primer specific to a DNA base sequence is amplified to amplify a large amount of a base sequence of a genomic DNA, To a method for analyzing a nucleotide sequence.

현재 주류를 이루고 있는 유전체 해독 방식인 차세대 염기서열분석(Next Generation Sequencing, NGS) 기술은 세포로부터 추출한 유전체 DNA(genomic DNA)를 단편화하여 단편화된 DNA 조각의 염기서열을 단시간에 대량으로 해독하고, 컴퓨터의 알고리즘을 이용하여 해독된 수십억 개의 DNA 단편 염기서열을 재조합하여 원래의 유전체 구조를 완성하는 방법이다. 그러나 이 방법은 NGS 기기 자체의 DNA 해독 오류와 컴퓨터 어셈블링(assembling) 프로그램 오류로 인하여 생산된 재조합 유전체 정보(scaffold, contig)의 염기배열 및 유전체의 구조가 전통적인 방법에 비하여 부정확하다는 단점을 가지고 있다. The Next Generation Sequencing (NGS) technology, which is currently the mainstream genome sequencing method, is a technique that genomic DNA extracted from a cell is fragmented to rapidly decode a fragment of the fragmented DNA fragment in a short time, The algorithm is used to reconstruct the decoded DNA fragments of several billion DNA fragments to complete the original genome structure. However, this method is disadvantageous in that the nucleotide sequence of the recombinant genome information (scaffold, contig) and the structure of the genome produced by the NGS device itself due to the DNA decoding error and the computer assembling program error are inferior to the conventional method .

이러한 NGS 기술의 단점을 보완하는 기술은 NGS 기기의 종류에 따라서 다양한데, 대부분의 NGS 기종(Illumina사, Roche사, LifeTechnology사)은 유전체 DNA를 3kb, 5kb, 10kb 등의 크기로 크게 조각내어 양쪽 말단을 해독(mate-pair sequencing)한 후, 이 염기배열 정보와 단편의 크기를 표지자(bridge sequence) 정보로 이용하여 DNA를 재조합하는 원리를 사용하고 있다. 한편, 퍼시픽 바이오(Pacific Bio)사는 10kb~20kb의 비교적 긴 DNA 단편을 한 번에 해독하여 이를 데이터 어셈블링(data assembling)에 이용하고 있다. Most of the NGS models (Illumina, Roche, LifeTechnology) sculpt the genomic DNA to a size of 3kb, 5kb, 10kb and so on, (Mate-pair sequencing), and then uses the nucleotide sequence information and the size of the fragment as a bridge sequence information to recombine the DNA. Meanwhile, Pacific Bio decodes relatively long DNA fragments of 10 kb to 20 kb at a time and uses them for data assembling.

브릿지 클론 서열(Bridge clone sequence)을 이용하여 콘틱(contig)과 스캐폴드(scaffold)를 만든다는 점은 전통적으로 인간게놈프로젝트에서 사용한 '유전체 물리지도(genome physical map)' 작성기술과 유사하다. 유전체 물리지도는 서열분석(sequencing)된 콘틱과 스캐폴드의 배열 순서를 맞추는 방법으로서, '서열 갭(sequence gap) 연결', '서열정확도' 검증, 초대형 스캐폴드 구축 등 유전체 구조의 정확도를 향상시키는데 매우 중요한 기술로 활용되고 있다. The use of bridge clone sequences to create contigs and scaffolds is similar to the 'genome physical map' technique traditionally used in the human genome project. Dielectric physical maps are a method of aligning sequences of sequenced conics and scaffolds. They improve the accuracy of dielectric structures, such as 'sequence gap connection', 'sequence accuracy' verification, and large scale scaffold construction It is used as a very important technology.

유전체 물리지도 작성에 사용되는 핵심기술은 대장균 속에서 자가 증식 가능한 플라스미드, 즉 벡터(vector)를 이용하여 연구대상의 유전체 DNA를 증식시켜 삽입된 유전체 DNA 말단의 정보를 이용하는 것이다. 일반적으로 벡터의 종류는 벡터 내로 삽입 가능한 DNA의 크기에 따라서 BAC(Bacteria Artificial Chromosome), Fosme, Cosmid 등으로 나누어진다.The core technology used in genome mapping is to utilize the information of inserted genomic DNA ends by propagating the genomic DNA of the study using a self-propagating plasmid, a vector, in E. coli. In general, the types of vectors are divided into BAC (Bacterial Artificial Chromosome), Fosme, Cosmid and the like according to the size of insertable DNA in the vector.

예를 들어 BAC 벡터의 경우, 보통 100,000~200,000 염기 크기의 유전체 DNA 삽입이 가능하다. 도 1에 나타낸 바와 같이, 유전체 DNA가 삽입된 BAC 클론은 BAC 벡터와 이에 삽입되는 유전체 DNA의 두 개의 부위로 구성되는데, BAC 벡터에 내재된 벡터 특이의 염기서열(T7 프로모터와 SP6 프로모터)은 삽입된 유전체 DNA의 양쪽 말단부위에 근접하여 위치하게 된다. 따라서 삽입된 유전체 DNA의 양쪽 말단부위의 염기서열은 이들 T7 프로모터와 SP6 프로모터를 서열분석 프라이머(sequencing primer)로 이용하여 해독이 가능하다. 삽입된 유전체 DNA의 양쪽 말단 염기서열의 방향은 서열분석에 사용한 프로모터 종류에 따라서 구별되는데, T7 프로모터로 해독된 유전체 DNA 말단 염기서열은 'T7-BAC 말단서열(T7-BAC end sequence, T7-BES)'이라고 부르고, SP6 프로모터로 해독된 유전체 DNA 말단 염기서열은 'SP6-BAC 말단서열(SP6-BES)'이라고 부른다. 이들 'T7-BES', 'SP6-BES'는 '유전체 물리지도' 작성에 필수적인 정보로서 전통적으로 인간게놈프로젝트를 비롯하여 많은 유전체 해독 연구에서 유전체 구조 및 서열분석 오류 검증을 위해 사용되었다. For example, in the case of BAC vectors, genomic DNA insertion of 100,000 to 200,000 bases is usually possible. As shown in FIG. 1, the BAC clone into which the genomic DNA is inserted is composed of two regions, a BAC vector and a genomic DNA inserted therein. The vector-specific base sequence (T7 promoter and SP6 promoter) Lt; RTI ID = 0.0 > DNA < / RTI > Therefore, it is possible to decode the base sequence of both ends of the inserted genomic DNA by using these T7 promoter and SP6 promoter as a sequencing primer. The direction of both terminal sequences of the inserted genomic DNA is distinguished according to the type of the promoter used for the sequence analysis. The terminus of the genomic DNA decoded by the T7 promoter is referred to as' T7-BAC end sequence (T7-BAC end sequence, T7-BES ), And the end of the genomic DNA terminal sequence decoded by the SP6 promoter is called 'SP6-BAC end sequence (SP6-BES)'. These 'T7-BES' and 'SP6-BES' are indispensable information for 'Genome Physics Map' and have traditionally been used to verify genome structure and sequence analysis errors in many genome detoxification studies including the Human Genome Project.

BAC 클론에 삽입된 유전체 DNA의 BES를 해독하는 방법은 전통적인 1세대 유전자분석(capillary sequencing) 방식과 NGS방식이 있다. 1세대 유전자분석은 BAC 클론을 한 개씩 나누어서 서열분석하는 방식으로서, 서열분석 후 해독한 BES 정보와 일치하는 BAC 클론의 위치 파악이 가능하기 때문에 후속 연구에서 재료로 사용할 수 있다는 장점이 있지만, 시간이 오래 걸리고 데이터 생산 비용이 매우 비싸다는 단점이 있다. 최근에는 NGS 기기를 활용하여 BES 염기해독이 가능한데, 이는 수 만개의 BAC 클론을 하나로 모아서 NGS 서열분석 라이브러리(NGS sequencing library, NXSeq library)를 만들어서 대량으로 해독하는 것이다. 그러나 이 방식은 대량 해독은 가능하지만, 최종적으로 얻어지는 'T7-BES', 'SP6-BES'량이 적기 때문에 효율이 낮을 뿐만 아니라, BES 서열분석의 최대 장점인 해독 후 BAC 클론의 염기서열 정보에 해당하는 BAC 클론의 위치 파악이 불가능하기 때문에 서열분석한 BAC 클론을 후속연구에 활용할 수가 없다는 단점이 있다.Methods for decoding BES of genomic DNA inserted in BAC clones include the traditional first-generation capillary sequencing method and the NGS method. First-generation gene analysis is a method of sequencing BAC clones one by one. Since BAC clones can be located in accordance with the decoded BES information, it is possible to use them as a material in subsequent studies. It takes a long time and the data production cost is very expensive. In recent years, it is possible to decode BES base using NGS device, which collects tens of thousands of BAC clones into NGS sequencing library (NXSeq library) to decode in large quantities. However, this method is not only low in efficiency because it is possible to mass-decipher, but the amount of 'T7-BES' and 'SP6-BES' finally obtained is small and corresponds to the nucleotide sequence information of the BAC clone after detoxification which is the greatest advantage of BES sequence analysis The BAC clones can not be used for subsequent studies because it is impossible to identify the location of the BAC clones.

1. 대한민국 특허등록 제10-1406720호1. Korean Patent Registration No. 10-1406720 2. 대한민국 특허등록 제10-1447593호2. Korean Patent Registration No. 10-1447593 3. 대한민국 특허등록 제10-1533792호3. Korean Patent Registration No. 10-1533792 4. 대한민국 특허공개 제10-2015-0017525호4. Korean Patent Publication No. 10-2015-0017525

상기 문제점을 해결하기 위하여 본 발명은 유전체 DNA의 말단 염기서열(BES)만을 특이적으로 증폭할 수 있는 프라이머를 사용하여 NGS 방식으로 벡터에 삽입된 유전체 DNA의 말단염기서열을 신속하고 저렴한 비용으로 대량 생산하고, 생산된 BES 정보에 해당하는 클론의 위치를 규명할 수 있는 방법을 제공하는 것을 목적으로 한다.In order to solve the above problems, the present invention provides a method for rapidly and cheaply disassembling the terminal nucleotide sequence of a genomic DNA inserted into a vector by the NGS method using a primer capable of specifically amplifying only the terminal base sequence (BES) of the genomic DNA And a method for identifying the position of a clone corresponding to the produced BES information.

상기 목적을 달성하기 위하여, 본 발명은 분석 대상의 유전체 DNA를 추출하여 BAC 라이브러리를 제작하고 각 BAC 클론을 384-웰 플레이트에 모아 세포 스톡을 제작하는 단계; 상기 세포 스톡용 384-웰 플레이트 20장의 BAC 클론들을 3차원적으로 조합하여 60개 저장소로 구성된 3차원 BAC 라이브러리를 제작하는 단계; 대장균 배양액을 이용하여 상기 3차원 BAC 라이브러리의 BAC 클론들을 증식시킨 후 BAC 클론 DNA를 추출하는 단계; 상기 추출한 BAC 클론 DNA를 분석에 필요한 적절한 크기로 절단하는 단계; 상기 절단한 DNA 단편의 양쪽 말단을 평활 말단화시킨 후 단편의 양쪽 말단에 Y-형 어댑터 프라이머를 부착하는 단계; BAC 벡터의 SP6 프로모터 서열을 포함하는 프라이머 또는 T7 프로모터 서열을 포함하는 프라이머를 이용하여 상기 유전체 DNA의 양쪽 말단 서열을 증폭하는 단계; 상기 증폭된 DNA를 정제하여 NGS 서열분석에 적합한 크기로 분획하는 단계; 상기 분획된 DNA를 emPCR에 의하여 증폭하는 단계; 상기 증폭된 DNA의 서열을 NGS로 분석하여 서열데이터를 얻는 단계; 및 상기 서열데이터를 이용하여 BAC 클론의 위치를 규명하는 단계를 포함하는, 차세대 염기서열분석법을 이용한 유전체 DNA 말단 서열의 대량 분석방법을 제공한다.In order to achieve the above object, the present invention provides a method for preparing a cell stock comprising the steps of: preparing a BAC library by extracting genomic DNA to be analyzed; collecting each BAC clone in a 384-well plate to prepare a cell stock; Three-dimensional BAC clones of the 384-well plate for cell stock are three-dimensionally combined to prepare a three-dimensional BAC library consisting of 60 stores; Amplifying the BAC clones of the 3-dimensional BAC library using an Escherichia coli culture and extracting BAC clone DNA; Cutting the extracted BAC clone DNA to an appropriate size necessary for analysis; Attaching a Y-type adapter primer to both ends of the fragment after smoothing both ends of the DNA fragment; Amplifying both terminal sequences of the genomic DNA using primers comprising the SP6 promoter sequence of the BAC vector or a primer comprising the T7 promoter sequence; Purifying the amplified DNA and fractionating it into a size suitable for NGS sequence analysis; Amplifying the fractionated DNA by emPCR; Analyzing the sequence of the amplified DNA with NGS to obtain sequence data; And identifying the position of the BAC clone using the sequence data. The present invention also provides a method for mass analysis of a genomic DNA end sequence using a next-generation sequencing method.

상기 방법에서, 상기 세포 스톡용 384-웰 플레이트에 X축(수평), Y축(수직) 및 Z축(플레이트)으로 수집한 유전체 DNA의 유래를 구별할 수 있도록 횡렬, 종렬 및 플레이트 단위로 표식자 번호를 붙이는 것이 바람직하다.In the above method, the 384-well plate for cell stock is coated with a marker (in a row, column and plate) so as to distinguish the origins of the genomic DNA collected on the X axis (horizontal), Y axis It is desirable to number them.

상기 방법에서, 상기 3차원 BAC 라이브러리는, In the method, the three-dimensional BAC library comprises:

세포 스톡용 384-웰 플레이트에서 동일한 번호에 해당하는 수평축(X축)의 24개의 BAC 클론들을 하나의 저장소에 모으는 단계; 세포 스톡용 384-웰 플레이트에서 동일한 번호에 해당하는 수직축(Y축)의 16개의 BAC 클론들을 하나의 저장소에 모으는 단계; 및 세포 스톡용 384-웰 플레이트 한 장에 담겨있는 384개 BAC 클론들을 하나의 저장소에 모아 플레이트 풀(Z축)을 얻는 단계를 포함하는 방법에 의하여 제작되는 것이 바람직하다.Collecting 24 BAC clones on the horizontal axis (X axis) corresponding to the same number in 384-well plates for cell stock in one reservoir; Collecting 16 BAC clones in the vertical axis (Y axis) corresponding to the same number in a 384-well plate for cell stock into one reservoir; And collecting 384 BAC clones contained in one 384-well plate for cell stock into one reservoir to obtain a plate pool (Z axis).

상기 방법에서, 상기 Y-형 어댑터 프라이머는 서열번호 1과 서열번호 2의 염기서열을 가지며 말단의 5개의 염기서열이 비상보적인 프라이머인 것이 바람직하다.In this method, it is preferable that the Y-type adapter primer has the nucleotide sequence of SEQ ID NO: 1 and SEQ ID NO: 2, and the terminal nucleotide sequence of the five nucleotide sequences is the non-complementary primer.

상기 방법에서, 상기 추출한 BAC 클론 DNA는 초음파를 사용하여 절단하는 것이 바람직하다.In the above method, the extracted BAC clone DNA is preferably cut using ultrasonic waves.

상기 방법에서, 상기 SP6 프로모터 서열을 포함하는 프라이머는 서열번호 3의 포워드 어댑터, 3차원 BAC 라이브러리에서의 위치를 표시하는 바코드 서열 및 서열번호 4의 SP6 프로모터로 이루어진 GSB-SP6 프라이머인 것이 바람직하다.In this method, the primer comprising the SP6 promoter sequence is preferably a GSB-SP6 primer consisting of the forward adapter of SEQ ID NO: 3, the bar code sequence representing the position in the 3D BAC library, and the SP6 promoter of SEQ ID NO: 4.

상기 방법에서, 상기 T7 프로모터 서열을 포함하는 프라이머는 서열번호 5의 포워드 어댑터, 3차원 BAC 라이브러리에서의 위치를 표시하는 바코드 서열 및 서열번호 6의 T7 프로모터로 이루어진 GSB-T7 프라이머인 것이 바람직하다.In this method, the primer comprising the T7 promoter sequence comprises a forward adapter of SEQ ID NO: 5, a barcode sequence representing the position in the 3D BAC library, and GSB-T7 comprising the T7 promoter of SEQ ID NO: 6 Primer is preferred.

상기 방법에서, 상기 유전체 DNA 양쪽 말단 서열의 증폭은 제6항의 GSB-SP6 프라이머와 서열번호 7의 GSB-RP 프라이머, 또는 제7항의 GSB-T7 프라이머와 서열번호 7의 GSB-RP 프라이머를 사용하여 행하는 것이 바람직하다.In this method, the amplification of both end sequences of the genomic DNA is carried out using the GSB-SP6 primer of claim 6 and the GSB-RP primer of SEQ ID NO: 7, or the GSB-T7 primer of claim 7 and the GSB-RP primer of SEQ ID NO: .

본 발명의 방법은 분석하고자 하는 유전체 DNA의 양쪽 말단 염기서열을 대량 증폭하여 NGS 기기를 이용하여 대량 해독하고 각 말단 염기서열에 해당하는 BAC 클론의 위치를 신속하게 파악할 수 있어, 벡터에 삽입된 유전체 DNA의 말단 염기서열 정보를 신속, 간편, 저렴하게 대량 생산할 수 있다.The method of the present invention can mass-amplify both terminal base sequences of a genomic DNA to be analyzed and mass-decode it using an NGS instrument and quickly locate the BAC clone corresponding to each terminal base sequence, It is possible to mass-produce the DNA base sequence information quickly, simply and inexpensively.

본 발명의 실시예에서 실시한 7,680 클론을 기준으로 비교할 때, 전통적인 1세대 유전자분석 기술(ABI3730XL 1대 기준)은 약 1개월 이상이 소요되는 반면, 본 발명의 방법을 사용하면 1일이면 분석이 가능하며, 비용 측면에서도 1세대 유전자분석 기술의 1/10 이하의 비용이 소요된다. 또한 1세대 유전자 분석기술의 경우 서열분석의 전처리 단계가 매우 복잡하기 때문에 수 명의 인력이 필요하지만, 본 발명의 방법은 1인이 전 과정을 수행할 수 있을 정도로 일이 간단하다. In comparison with the 7,680 clones performed in the examples of the present invention, the conventional first-generation gene analysis technology (based on one ABI3730XL) takes about one month or more, whereas the method of the present invention allows analysis in one day Cost of less than 1/10 of first-generation gene analysis technology. In addition, in the first generation gene analysis technique, since the preprocessing step of the sequence analysis is very complicated, several manpower is required, but the method of the present invention is simple enough to carry out the entire process.

또한 해독한 대량의 말단염기서열 정보를 이용하여 콘틱이나 스캐폴드의 순서를 손쉽게 결정할 수 있으며 수십 메가 크기의 초대형 스캐폴드 구축 등 정밀한 유전체 물리지도 작성이 가능하게 된다.In addition, it is possible to easily determine the sequence of conic or scaffold using the decoded large amount of terminal sequence information, and it is possible to make precise dielectric physical mapping such as the construction of a very large scaffold of several tens of mega size.

또한 본 발명의 방법은 해독한 말단염기서열 정보에 해당하는 클론의 위치 규명이 가능하기 때문에 서열분석 갭 연결(sequencing gap closing), 서열 정확도 검증 등 유전체 해독의 질을 고도화시키는데 이용될 수 있다.In addition, the method of the present invention can be used to improve the quality of genome detoxification such as sequencing gap closing and sequence accuracy verification because the location of the clone corresponding to the decoded terminal sequence information can be identified.

도 1은 BAC 클론의 구조이다.
도 2는 본 발명의 전체 구성도이다.
도 3은 3차원 BAC 라이브러리 제작 모식도이다.
도 4는 BAC 클론 DNA의 추출 결과를 나타낸 것이다.
도 5는 BAC 클론 DNA의 단편화 결과를 나타낸 것이다.
도 6은 GSB-YAP 프라이머 염기서열정보이며, 밑줄부위는 비상보성 염기서열을 나타낸 것이다.
도 7은 BAC 벡터에 삽입된 유전체 DNA 말단염기서열의 증폭에 사용한 어댑터 프라이머 서열이다.
도 8은 GSB-SP6/GSB-RP 및 GSB-T7/GSB-RP 프라이머를 사용한 PCR 결과를 나타낸 것이다.
도 9는 PCR DNA의 크기 분획을 나타낸 것이다.
도 10은 GS-FLX로 피로시퀀싱한 서열 판독들의 염기배열구조이다.
도 11은 서열데이터 분석 구성도이다.
도 12는 BAC 말단서열을 이용한 유전체 물리지도 작성의 예이다.Figure 1 shows the structure of a BAC clone.
2 is an overall configuration diagram of the present invention.
3 is a schematic diagram of a three-dimensional BAC library production.
Figure 4 shows the extraction results of BAC clone DNA.
Figure 5 shows fragmentation results of BAC clone DNA.
6 shows the sequence information of GSB-YAP primer and the underlined region shows the non-complementary base sequence.
7 is an adapter primer sequence used for amplification of a DNA base sequence inserted into a BAC vector.
Figure 8 shows the results of PCR using GSB-SP6 / GSB-RP and GSB-T7 / GSB-RP primers.
Figure 9 shows the size fraction of PCR DNA.
10 is a nucleotide sequence structure of sequence reads fatigued with GS-FLX.
11 is a diagram showing the structure of sequence data analysis.
Figure 12 is an example of genetic physical mapping using the BAC end sequence.

본 발명에서 벡터로는 BAC를 사용하였으나, 통상적으로 사용되는 모든 벡터를 사용할 수 있다.
In the present invention, BAC is used as a vector, but all commonly used vectors can be used.

본 발명의 방법의 전체 구성도는 도 2에 나타낸 바와 같으며, 각 단계에 대한 구체적인 내용은 다음과 같다.
The overall configuration of the method of the present invention is as shown in FIG. 2, and details of each step are as follows.

1. 세포 1. Cells 스톡(cell stock)의Of stock 제작 making

분석하고자 하는 대상의 유전체 DNA를 추출하여 BAC 라이브러리를 제작하고 각 BAC 클론을 384-웰 플레이트(384-well plate)에 모아 세포 스톡을 제작한다.
The BAC library is prepared by extracting the genomic DNA of the subject to be analyzed, and each BAC clone is collected in a 384-well plate to prepare a cell stock.

2. 3차원 2. 3D BACBAC 라이브러리의 제작 Building the Library

상기 세포 스톡용 384-웰 플레이트 20장의 BAC 클론들을 3차원적으로 조합하여(3-dimentional pooling; 3D-pool) 60개 저장소로 구성된 3차원 BAC 라이브러리를 제작한다. 3차원 BAC 라이브러리란 384-웰 플레이트에 보관된 각각의 BAC 클론을 수평축(X축), 수직축(Y축), 플레이트 풀(Plate pool, Z축) 별로 일정량씩 적출한 후, X축, Y축, Z축 별로 모아진 각각의 BAC 클론을 하나의 저장소(reservoir)에 혼합해 놓은 라이브러리를 말한다. A three-dimensional BAC library consisting of 60 repositories of 3-dimentional pooling (3D-pool) is prepared by combining 20 BAC clones of the 384-well plate for cell stocks three-dimensionally. The three-dimensional BAC library was obtained by extracting each BAC clone stored in a 384-well plate by a predetermined amount on each of a horizontal axis (X axis), a vertical axis (Y axis), and a plate pool (Z axis) , And a library in which each BAC clone collected in the Z axis is mixed into one reservoir.

3차원 BAC 라이브러리를 제작하는 구체적인 과정은 다음과 같다.The concrete procedure for producing the 3D BAC library is as follows.

(1) 세포 스톡용 384-웰 플레이트에서 동일한 번호에 해당하는 수평축(horizontal; X축)의 24개의 BAC 클론들을 하나의 저장소에 모은다.(1) Collect the 24 BAC clones on the horizontal axis (X-axis) corresponding to the same number in one store in 384-well plate for cell stock.

(2) 세포 스톡용 384-웰 플레이트에서 동일한 번호에 해당하는 수직축(vertical; Y축)의 16개의 BAC 클론들을 하나의 저장소에 모은다.(2) Collect 16 BAC clones in vertical (Y axis) corresponding to the same number in 384-well plates for cell stock into one reservoir.

(3) 세포 스톡용 384-웰 플레이트 한 장에 담겨있는 384개 BAC 클론들을 하나의 저장소에 모아 플레이트 풀(plate pool, Z축)을 얻는다.(3) Collect 384 BAC clones contained in a 384-well plate for cell stock in one reservoir to obtain a plate pool (Z axis).

상기 세포 스톡용 384-웰 플레이트에는 X축, Y축 및 Z축으로 수집한 유전체 DNA의 유래를 구별할 수 있도록 횡렬, 종렬 및 플레이트 단위로 표식자 번호를 붙이는 것이 바람직하다. 상세하게는, 384-웰 플레이트의 상단에는 수평축(X축)으로 24개의 아라비아 숫자 일련번호(1~24)를, 좌측에는 수직축(Y축)으로 16개의 알파벳(A~P)을 새기고, 혼합하고자 하는 각각의 플레이트에서 동일한 X축 열 혹은 Y축 열에 해당하는 BAC 클론들을 일정량씩 적출하여 각 열에 따라서 각각의 용기에 담고, Z축은 384-웰 플레이트 1장에 들어 있는 384개 BAC 클론에서 동량을 적출하여 하나의 용기에 담아 3차원 BAC 라이브러리를 제작한다.
Preferably, the 384-well plate for cell stock is provided with marker numbers in a row, column, and plate unit so as to distinguish the origins of the genomic DNA collected on the X axis, the Y axis, and the Z axis. In detail, 16 alphabets (A to P) are placed on the upper side of the 384-well plate with 24 Arabic numerals (1 to 24) on the horizontal axis (X axis) and a vertical axis BAC clones corresponding to the same X-axis or Y-axis sequence in each of the plates to be desired were picked out in a predetermined amount and placed in each container according to each row. The Z axis was measured in 384 BAC clones contained in one 384- And extracted into a single container to prepare a 3D BAC library.

3. 3. BACBAC 클론의 증식 및 The proliferation of clones and BACBAC 클론 Clone DNADNA 추출 extraction

대장균을 배양하는 배양액을 이용하여 상기 3차원 BAC 라이브러리의 BAC 클론들을 증식시킨 후 BAC 클론 DNA를 추출한다.
BAC clones of the 3-dimensional BAC library are amplified using a culture medium for culturing E. coli, and then BAC clone DNA is extracted.

4. 추출한 4. Extracted DNADNA 의 절단Cutting of

상기 추출한 BAC 클론 DNA를 분석에 필요한 적절한 크기로 절단한다. 이때 DNA를 절단하는 방법은 통상적으로 사용되는 다양한 방법을 사용할 수 있지만, 초음파를 사용하여 절단하는 것이 보다 바람직하다.
The extracted BAC clone DNA is cut to an appropriate size for analysis. At this time, various methods commonly used can be used for cutting the DNA, but it is more preferable to cut the DNA using ultrasonic waves.

5. Y-형 어댑터 5. Y-type adapter 프라이머(Y-type adapter primer)의Primer (Y-type adapter primer) 부착 Attach

상기 절단한 DNA 단편의 양쪽 말단을 평활 말단화(blunt end)시킨 후, DNA 단편끼리 결합하지 못하도록 단편의 양쪽 말단에 Y-형 어댑터 프라이머를 부착한다. Y-형 어댑터 프라이머는 말단에 비상보적 서열을 가지고 있어서 유전체 DNA 말단서열을 증폭하는 과정에서 유전체 DNA의 자가결합을 방지한다. Both ends of the cleaved DNA fragment are blunt-ended, and a Y-type adapter primer is attached to both ends of the fragment so that DNA fragments can not bind to each other. The Y-type adapter primer has a non-complementary sequence at its terminus to prevent self-assembly of the genomic DNA during amplification of the DNA end sequence.

Y-형 어댑터 프라이머의 한 예를 서열목록의 서열번호 1과 서열번호 2, 및 도 6에 나타내었다. 도 6에 나타낸 바와 같이, 말단 5개의 서열은 서로 비상보적이다.
An example of a Y-type adapter primer is shown in SEQ ID NO: 1, SEQ ID NO: 2, and FIG. 6 of the Sequence Listing. As shown in Fig. 6, the five terminal sequences are non-complementary to each other.

6. 유전체 6. Dielectric DNADNA 양쪽 말단 서열의 증폭 Amplification of both end sequences

BAC 벡터에 내재된 SP6 프로모터와 T7 프로모터 서열을 이용하여 유전체 DNA의 양쪽 말단 서열을 PCR을 실시하여 증폭한다. Both ends of the genomic DNA are amplified by PCR using the SP6 promoter and the T7 promoter sequence inherent in the BAC vector.

SP6 프로모터 서열을 포함하는 프라이머 또는 T7 프로모터 서열을 포함하는 프라이머를 사용하여 유전체 DNA의 양쪽 말단 서열을 증폭하는 것이 바람직하다. It is preferable to amplify both terminal sequences of the genomic DNA using a primer comprising the SP6 promoter sequence or a primer comprising the T7 promoter sequence.

SP6 프로모터 서열을 포함하는 프라이머의 한 예인 GSB-SP6 프라이머는 서열번호 3의 포워드 어댑터(30bp), 바코드 서열(10bp) 및 서열번호 4의 SP6 프로모터(18bp)로 이루어진다. T7 프로모터 서열을 포함하는 프라이머의 한 예인 GSB-T7 프라이머는 서열번호 5의 포워드 어댑터(30bp), 바코드 서열(10bp) 및 서열번호 6의 T7 프로모터(20bp)로 이루어진다. 상기 프라이머와 함께 사용되는 역방향 프라이머의 한 예인 GSB-RP 프라이머는 서열번호 7의 서열을 가진다. 이때, 바코드 서열은 3차원 BAC 라이브러리에서의 위치를 표시하는 표식자 서열이다. 상기 프라이머들의 구성을 도 7에 나타내었다.The GSB-SP6 primer, which is an example of a primer containing the SP6 promoter sequence, consists of the forward adapter (30 bp) of SEQ ID NO: 3, the bar code sequence (10 bp) and the SP6 promoter of SEQ ID NO: 4 (18 bp). The GSB-T7 primer, which is an example of a primer containing the T7 promoter sequence, consists of the forward adapter (30 bp) of SEQ ID NO: 5, the bar code sequence (10 bp) and the T7 promoter of SEQ ID NO: 6 (20 bp). The GSB-RP primer, which is an example of a reverse primer used together with the primer, has the sequence of SEQ ID NO: 7. At this time, the barcode sequence is a marker sequence indicating the position in the 3-dimensional BAC library. The construction of the primers is shown in FIG.

하나의 저장소의 DNA를 2개로 나누어 하나는 GSB-SP6 프라이머와 GSB-RP 프라이머를 넣어 PCR을 실시하고, 다른 하나는 GSB-T7 프라이머와 GSB-RP 프라이머를 넣어 PCR을 실시하는 것이 바람직하다.
PCR is performed by dividing the DNA of one reservoir into two and adding GSB-SP6 primer and GSB-RP primer, and the other one is preferably performing PCR by inserting GSB-T7 primer and GSB-RP primer.

7. 증폭된 7. Amplified DNADNA 의 정제 및 크기분획Purification and size fractionation

상기 증폭된 DNA를 정제하여 NGS 서열분석에 적합한 크기로 분획한다. 예를 들어 NGS 분석에 GS-FLX 서열분석기를 사용하는 경우에는 평균사이즈가 700bp인 것이 바람직하다.
The amplified DNA is purified and fractionated to a size suitable for NGS sequencing. For example, when using a GS-FLX sequencer for NGS analysis, the average size is preferably 700 bp.

8. 8. emPCRemPCR 에 의한 On by DNADNA 의 증폭Amplification of

상기 적합한 크기로 분획된 DNA를 emPCR에 의하여 증폭한다. DNA 증폭에는 dNTP, PCR 버퍼, 프라이머, Taq 중합효소, PPiase(Peptidyl-Prolyl Cis-Trans Isomerase)를 혼합한 프리믹스(premix)를 사용하여 PCR 오일분자를 만든 후 PCR을 실시하여 오일분자에 혼합된 DNA를 증폭한다.
The DNA fragments of the appropriate size are amplified by emPCR. PCR amplification was carried out using dNTP, PCR buffer, primer, Taq polymerase, and PPiase (Peptidyl-Prolyl Cis-Trans Isomerase) / RTI >

9. 9. NGSNGS 에 의한 서열분석&Lt; / RTI >

상기 증폭된 DNA의 서열을 NGS로 분석하여 서열데이터를 얻는다.
Sequence of the amplified DNA is analyzed by NGS to obtain sequence data.

10. 10. BACBAC 클론의 위치 규명 Location of the clone

상기 NGS 서열분석을 통해 얻은 서열데이터를 이용하여 BAC 클론의 위치를 규명한다. Using the sequence data obtained from the NGS sequence analysis, the location of the BAC clone is identified.

얻어진 서열데이터에서 3' 말단 부위의 어댑터 프라이머를 제거하고 5' 말단 부위의 어댑터 프라이머를 가진 서열데이터를 확보한다. 확보한 서열데이터를 3차원 서열 라이브러리 제작에 사용한 바코드 서열을 이용하여 60종류로 다시 분리한 후 바코드별로 모아진 서열데이터를 콘틱으로 만든다. X축, Y축 및 Z축 각각에 해당하는 콘틱들간의 상동성 검색을 실시하여 동일한 서열별로 분류한 후 SP6 프로모터 또는 T7 프로모터의 위치가 동일한 클론별로 재분류하여 BAC 클론의 위치를 규명한다.
In the obtained sequence data, the adapter primer at the 3'-terminal region is removed and the sequence data with the adapter primer at the 5'-terminal region are obtained. The obtained sequence data is divided into 60 types using the barcode sequence used for the production of the three-dimensional sequence library, and the sequence data collected for each barcode is conic. Homologies between the cones corresponding to the X axis, the Y axis and the Z axis are searched and classified by the same sequence, and the position of the SP6 promoter or T7 promoter is reclassified by the same clone to identify the position of the BAC clone.

이하, 실시예를 통하여 본 발명을 더욱 상세히 설명한다. 하기 실시예는 본 발명을 예시하기 위한 것으로 본 발명의 범위가 이들 실시예에 의해 한정되는 것은 아니다.
Hereinafter, the present invention will be described in more detail by way of examples. The following examples are intended to illustrate the present invention and the scope of the present invention is not limited by these examples.

<< 실시예Example 1> 1>

세포 cell 스톡의Stock 제작 making

거저리(Tenebrio) 유전체 DNA를 추출한 후, CopyRight v2.0 BAC 클로닝 키트(Lucigen)를 이용하여 BAC 라이브러리를 제작하고 각각의 BAC 클론을 384-웰 플레이트에 모아(picking) 세포 스톡을 만들었다. 얻어진 세포 스톡은 -80℃ 냉동고에 보관하였다.
After extracting the Tenebrio genomic DNA, a BAC library was prepared using the CopyRight v2.0 BAC Cloning Kit (Lucigen) and each BAC clone was picked into a 384-well plate to make a cell stock. The resulting cell stock was stored in a -80 ° C freezer.

<< 실시예Example 2> 2>

3차원 3D BACBAC 라이브러리(3D- The library (3D- dimentionaldimentional BACBAC librarylibrary ) 제작Production

상기 방법으로 제작한 세포 스톡인 20장의 384-웰 플레이트를 혼합하여 60개의 저장소로 구성된 3차원 BAC 라이브러리를 제작하였다. 3차원 BAC 라이브러리의 제작과정을 도 3에 나타내었으며, 구체적인 제작방법은 다음과 같다.A three-dimensional BAC library consisting of 60 reservoirs was prepared by mixing 20 384-well plates, cell stock prepared by the above method. The production process of the 3-dimensional BAC library is shown in FIG. 3, and a concrete production method is as follows.

(1) -80℃ 냉동고에 세포 스톡으로 보관된 384-웰 플레이트 20장을 4℃ 냉장고로 옮겨 해동시켰다. (1) 20 sheets of 384-wells stored as cell stocks in a -80 DEG C freezer were transferred to a 4 DEG C refrigerator and thawed.

(2) 384-웰 플레이트는 수평축(X축)으로 24개('×1'~'×24'), 수직축(Y축)으로 16개(×A~×P)의 웰이 배열된 구조로써, 본 발명에서는 20개의 384 투명웰 플레이트에 보관된 7,680개의 BAC 클론(384클론/플레이트×20플레이트)을 이용하였다.(2) The 384-well plate is a structure in which 16 (× A to × P) wells are arranged in 24 ('× 1' to '× 24') and vertical (Y-axis) , 7,680 BAC clones (384 clones / plate x 20 plates) stored in 20 384 transparent well plates were used in the present invention.

(3) 먼저, 각 플레이트의 동일한 위치에 있는 웰(예, #1 플레이트의 h1 레인(16개 웰), #2 플레이트의 h1 레인(16개 웰)...#20 플레이트의 h1 레인(16개 웰)로부터 각 2㎕씩 스톡 세포를 취하여 50㎖의 1×LB배지가 들어있는 1개의 저장소(Rx)에 혼합하여, X축으로부터 24개의 혼합물('Rx1'~'Rx24')을 만들었다. 동일한 방법으로 Y축으로부터는 16개의 혼합물('RyA'~'RyP')을 만들었다. 또한 각 플레이트의 384개 세포를 1개의 저장소(Rp)에 혼합한 플레이트 풀(plate pool)(Z축)로 20개의 혼합물('Rz1'~'Rz20')을 만들었다. 최종적으로 7,680개의 BAC 클론으로부터 60개의 혼합물 저장소(X축:24개, Y축:16개, Z축:20개)를 만들었다.(16 wells) of the # 1 plate, the h1 lane (16 wells) of the # 2 plate, the h1 lane (16 wells) of the # 20 plate, (Rx1 'to Rx24') were prepared from the X-axis by mixing stocks (Rx) containing 50 ml of 1 × LB medium. In the same manner, 16 mixtures ('RyA' to 'RyP') were prepared from the Y axis, and a plate pool (Z axis) in which 384 cells of each plate were mixed in one reservoir (X-axis: 24, Y-axis: 16, and Z-axis: 20) from 7,680 BAC clones.

(4) 각 저장소에 모아진 BAC 클론 혼합물과 동량의 DMSO 동결보존액을 섞어 준 후, -80℃ 냉동고에 보관하였다.(4) The BAC clone mixture collected in each reservoir was mixed with an equal amount of DMSO frozen stock solution and stored in a -80 ° C freezer.

(5) -80℃ 냉동고의 저장소 스톡으로부터 각각 312㎕(X축), 208㎕(Y축), 500㎕(Z축)를 취하여 50㎖의 1×LB 배지와 혼합한 후, 37℃ 배양기에서 200rpm으로 진탕(shaking)하면서 6시간 동안 배양하였다(OD=약1.0).
(5) Each of 312 占 퐇 (X axis), 208 占 퐇 (Y axis) and 500 占 퐇 (Z axis) were taken from a storage stock of a -80 占 폚 freezer and mixed with 50 ml of 1 占 LB medium. And incubated for 6 hours with shaking at 200 rpm (OD = about 1.0).

<< 실시예Example 3> 3>

BACBAC 클론 Clone DNADNA 의 추출Extraction of

BAC 클론 DNA의 추출은 HiPure 플라스미드 키트(Invitrogen)를 사용하였다. 키트에 설명된 사용방법에 따라서 배양한 용액을 원심분리방법으로 모은 균체에 현탁버퍼(suspension buffer) 4㎖, 라이시스버퍼(lysis buffer) 4㎖, 중화버퍼(neutralize buffer) 4㎖를 순차적으로 넣고 일정시간씩 반응시켜 최종적으로 균체의 세포막을 용해시켰다. 용해된 액체를 원심분리하여 취한 상층액을 칼럼에 넣고 일정시간 실온에 둔 후 칼럼을 통과시켜 BAC 클론 DNA를 추출하였다. BAC clone DNA was extracted using HiPure plasmid kit (Invitrogen). 4 ml of a suspension buffer, 4 ml of a lysis buffer and 4 ml of a neutralizing buffer were sequentially added to the cells collected by the centrifugation method in accordance with the use method described in the kit, The cells were reacted for a certain period of time to finally dissolve cell membranes. The supernatant obtained by centrifugation of the dissolved liquid was placed in a column, allowed to stand at room temperature for a certain time, and then passed through a column to extract BAC clone DNA.

BAC 클론 DNA를 추출한 결과를 도 4에 나타내었다. 도 4에서 1~6레인은 BAC 클론 DNA를 각 1㎕ 로딩(=100~120ng)한 것이며, M은 1kb 람다 래더(lambda ladder)이다. The results of extraction of BAC clone DNA are shown in Fig. In FIG. 4, 1 to 6 lanes are obtained by loading 1 각 each of BAC clone DNA (= 100 to 120 ng), and M is 1 kb lambda ladder.

상기 방법으로 총 60개의 저장소로부터 추출한 BAC 클론 DNA의 양은 각각 1~4㎍이었다.
The amount of BAC clone DNA extracted from a total of 60 reservoirs was 1 to 4 μg each.

<< 실시예Example 4> 4>

DNADNA 의 단편화Fragmentation

1.5㎖ 마이크로튜브에 추출한 BAC 클론 DNA 1㎍을 넣고 1×TE로 전체 부피를 50㎕로 조정한 후, 초음파 기기(US portable cleaners, 모델명:US-05, JEIOTECH, Korea)의 '하이 모드(high mode)'에서 30~120초간 반응시켜, DNA를 100bp~10kb 크기로 단편화시켰다. 그 결과를 도 5에 나타내었다. 각각 3㎕ 로딩(=100~120ng)한 것이고, 사이즈 마커는 100bp 래더이다.1 μg of BAC clone DNA extracted into a 1.5 ml microtube was added and the volume was adjusted to 1 μl with a total volume of 50 μl. The volume of the high (high) mode of US portable cleaners (US-05, JEIOTECH, Korea) mode) for 30 to 120 seconds to fragment the DNA to a size of 100 bp to 10 kb. The results are shown in Fig. (= 100 to 120 ng), respectively, and the size marker is a 100 bp ladder.

<< 실시예Example 5> 5>

단편화된 Fragmented DNADNA 말단부위 수복 Restoration of distal site

단편화된 DNA의 말단 부위를 평활말단(blunt form)으로 만들기 위해 말단 수복 효소 믹스 키트(end repairing enzyme mix kit, Fermentas, K0771)를 사용하였다. The end repairing enzyme mix kit (Fermentas, K0771) was used to blunt the end of the fragmented DNA.

제조회사의 키트 사용방법에 따라서 각각의 DNA가 담겨있는 1.5㎖ 마이크로튜브에 버퍼와 효소를 넣고 혼합하여 20℃ 수조블록(water bath block)에서 5분간 반응시킨 후, 페놀클로로포름(phenolchloroform)과 에탄올 침전법으로 DNA를 정제하고 멸균수 6㎕에 DNA를 녹여서 그 농도를 측정하였다.
The buffer and enzyme were mixed in a 1.5 ml microtube containing each DNA according to the manufacturer's kit method, and the mixture was reacted in a water bath block at 20 ° C for 5 minutes. Then, phenol chloroform and ethanol precipitation The DNA was purified by the method and the DNA was dissolved in 6 sterilized water and its concentration was measured.

<< 실시예Example 6> 6>

Y-형 어댑터 Y-type adapter 프라이머의Primer 결합 Combination

중합효소반응시 시발체가 되고, DNA 단편들이 자가결합(self ligation)되는 것을 막기 위해 Y-형 어댑터 프라이머(GSB-YAP)을 양쪽 말단에 결합시켜 주었다. Y-type adapter primer (GSB-YAP) was attached to both ends to prevent self-ligation of DNA fragments as a primer during the polymerase reaction.

사용한 GSB-YAP은 18bp의 이중가닥 DNA로서 말단부위 5개 염기서열이 서로 상보적이지 않는 구조를 가지고 있다. GSB-YAP 프라이머 염기서열 정보를 도 6에 나타내었으며, 도 6에서 밑줄친 부위는 비상보성 염기서열을 나타낸다.The GSB-YAP used was 18 bp double-stranded DNA. It has a structure in which the 5 nucleotide sequences at the terminal region are not complementary to each other. The GSB-YAP primer sequence information is shown in FIG. 6, and the underlined region in FIG. 6 represents the non-complementary base sequence.

평활말단화시킨 DNA 200ng과 GSB-YAP 200ng을 넣은 튜브에 리가아제(ligase) 2.5㎕를 혼합하고 16℃ 수조에서 3시간 동안 반응시켰다. 반응이 끝나면 DNA 정제키트를 사용하여 정제한 후 DNA를 멸균수 30㎕에 녹였다.
To the tube containing 200 ng of smooth-polished DNA and 200 ng of GSB-YAP, 2.5 리 of ligase was mixed and reacted in a water bath at 16 캜 for 3 hours. After the reaction was completed, the DNA was purified using a DNA purification kit, and the DNA was dissolved in 30 μl of sterilized water.

<< 실시예Example 7> 7>

BACBAC 벡터에 삽입된 유전체 Dielectric inserted into vector DNADNA 말단 염기서열의 증폭 Amplification of the terminal base sequence

삽입된 유전체 DNA의 양쪽말단 염기서열을 특이적으로 증폭하기 위하여 BAC 벡터의 SP6 프로모터 또는 T7 프로모터 서열, 바코드(Barcode) 서열, emPCR용 포워드 어댑터 프라이머(forward adaptor primer)를 결합한 프라이머(GSB-SP6과 GSB-T7)를 각각 디자인하여 제작하였다. BAC 벡터에 삽입된 유전체 DNA 말단염기서열의 증폭에 사용한 프라이머인 GSB-SP6 프라이머(서열번호 2), GSB-T7 프라이머(서열번호 3) 및 GSB_RP 프라이머(서열번호 4)의 서열을 도 7에 나타내었다.SPB promoter or T7 promoter sequence, a barcode sequence, a forward adapter primer for emPCR (forward primer) (GSB-SP6 and Bacillus spp.) In order to specifically amplify both terminal base sequences of the inserted genomic DNA GSB-T7) were designed and manufactured. The sequences of GSB-SP6 primer (SEQ ID NO: 2), GSB-T7 primer (SEQ ID NO: 3) and GSB_RP primer (SEQ ID NO: 4) which are the primers used for amplification of the DNA base sequence inserted into the BAC vector are shown in FIG. .

하나의 저장소 DNA를 두 개의 튜브로 나눈 후, 한 튜브에는 GSB-SP6 프라이머와 GSB-RP 프라이머를 넣고, 다른 튜브에는 GSB-T7 프라이머와 GSB-RP 어댑터 프라이머를 넣어 PCR을 실시하였다. GSB-SP6 primer and GSB-RP primer were inserted into one tube, and GSB-T7 primer and GSB-RP adapter primer were added to the other tube to perform PCR.

PCR 증폭반응은 ProDNi 유전자증폭기(ProDNi thermocycler, GnC Bio.)를 사용하여 실시하였으며, 각 반응은 Taq DNA 중합효소(Taq DNA polymerase) 0.25㎕, 10×버퍼 2.5㎕, 50×dNTP 0.5㎕, GSB-SP6 프라이머 10pmol, GSB-RP 프라이머 1㎕/10pmol, 주형 DNA(Template DNA) 3㎕를 함유하는 PCR 혼합액으로 총 25㎕의 양으로 수행하였다. 반응은 96℃에서 3분간 변성하고, 96℃ 20초, 50℃ 20초, 72℃ 20초로 40회 반복시킨 후 72℃에서 10분간 반응시켰다. 증폭반응 후 1% 농도의 아가로스 젤에서 전기영동한 후 에티디움 브로마이드(ethidium bromide)로 착색시켜 UV 트랜스일루미네이커(transilluminator)에서 확인하였다. 그 결과를 도 8에 나타내었다. 도 8에는 M은 100bp 사이즈 마커이다.PCR amplification was carried out using a ProDNi thermocycler (GnC Bio). Each reaction was performed by adding 0.25 μl of Taq DNA polymerase, 2.5 μl of 10 × buffer, 0.5 μl of 50 × dNTP, 10 pmol of SP6 primer, 1 mu l / 10 pmol of GSB-RP primer and 3 mu l of template DNA (Template DNA). The reaction was denatured at 96 캜 for 3 minutes, repeated 40 times at 96 캜 for 20 seconds, 50 캜 for 20 seconds, and 72 캜 for 20 seconds, followed by reaction at 72 캜 for 10 minutes. After the amplification reaction, electrophoresis was performed on agarose gel at a concentration of 1%, followed by coloring with ethidium bromide and confirmed in a UV transilluminator. The results are shown in Fig. 8, M is a 100 bp size marker.

그 결과, 도 8의 a에서 2.0kb 이하의 PCR 단편이 증폭되는 것을 확인하였으며, X축, Y축, Z축으로 수집한 60개의 저장소의 BAC 클론에 대하여 동일한 방법으로 PCR을 실시하여 도 8의 b와 같은 결과를 얻었다.
As a result, it was confirmed that the PCR fragment of 2.0 kb or less was amplified in FIG. 8 (a), and PCR was performed on BAC clones of 60 reservoirs collected on the X axis, Y axis and Z axis, b.

<< 실시예Example 8> 8>

크기 분획(Size fraction ( SizeSize fractionfraction ))

GSB-T7/GSB-RP 60종류, GSB-SP6/GSB-RP 60종류로 증폭된 120종류의 DNA를 하나의 튜브로 다시 합친 후, GS-FLX 서열분석기(sequencer)에 사용가능한 크기인 평균크기 700bp의 DNA를 추출하기 위해 크로마 스핀 TE1000 칼럼 키트(chroma spin TE1000 column kit, Clontech)와 앰퓨어 비드(Ampure bead, Beckmancoulter)로 크기 분획을 실시하였다. 키트는 제품설명서에 따라서 사용하였다. 120 DNAs amplified with 60 types of GSB-T7 / GSB-RP and 60 types of GSB-SP6 / GSB-RP were reassembled into a single tube, and then the average sizes of GS-FLX sequencers To extract 700 bp of DNA, size fractions were performed with a chroma spin TE1000 column kit (Clontech TE1000 column kit, Clontech) and Ampure bead (Beckmancoulter). The kit was used according to the product manual.

DNA 약 100㎕를 크기 분획 비드가 들어있는 TE1000 칼럼에 통과시키고 자유낙하시켜 총 10개 방울을 받아낸 뒤 전기영동으로 크기를 확인하고(도 9의 a), 큰 사이즈의 DNA가 상대적으로 적은 10번째 방울의 DNA로 다음 과정을 진행하였다. Approximately 100 DNA of DNA was passed through a TE1000 column containing size fraction beads and freely dropped. Ten total droplets were collected, and the size was confirmed by electrophoresis (Fig. 9 (a)). The second step was performed with the DNA of the second drop.

작은 크기의 DNA를 제거하기 위하여 큰 사이즈의 DNA를 제거한 10번째 방울의 DNA가 들어있는 튜브에 앰퓨어 XP 비드(AMPure XP Bead) 250㎕를 넣어 MPC 장치에 장착한 후 버퍼를 제거하고, 사이징 용액(sizing solution) 500㎕를 첨가하여 앰퓨어 XP 비드와 혼합하였다. 사이징 혼합액(sizing mix) 125㎕를 회수한 10번째 방울의 PCR 산물 50㎕와 재혼합한 후, 25℃ 수조에서 5분간 인큐베이션하고, MPC에 장착하여 상층액을 회수하였다. 회수한 상층액 125㎕를 사용하고 남은 사이징 혼합액 375㎕에 넣고 혼합하여 25℃ 수조에서 인큐베이션한 후, MPC에 다시 장착하여 상층액을 제거하였다. 상층액을 제거한 튜브에 TE 버퍼 100㎕를 첨가하여 흔들어준(vortex) 후 사이징 용액 500㎕를 첨가하여 흔들어주고 25℃ 수조에서 5분간 인큐베이션하였다. 이 용액을 MPC에 장착하여 상층액을 제거하고, 70% 에탄올로 2회 세척한 후 앰퓨어 XP 비드를 대기건조시켰다. 건조된 비드에 TE 버퍼 23㎕를 넣어 흔들어준 후, 다시 MPC에 장착하여 상층액 21㎕를 회수하였다. 회수한 DNA의 크기와 농도는 피코RNA 칩(PicoRNA chip)을 사용하여 바이오애널라이저 2100(Bioanalyzer 2100, BECKMAN Co.)으로 측정하였다. 그 결과 DNA 농도는 2.24ng/㎕, 평균 크기는 704bp임을 확인하였다(도 9의 b)
To remove small size DNA, 250 μl of Ampou XP Bead was added to a tube containing the DNA of the 10th drop from which large size DNA had been removed, and the DNA was mounted on the MPC apparatus. Then, the buffer was removed, (sizing solution) was added and mixed with amphire XP beads. 125 [mu] l of the sizing mix was collected and re-blended with 50 [mu] l of the 10th PCR product, and then incubated for 5 minutes in a water bath at 25 [deg.] C and mounted on MPC to recover the supernatant. 125 쨉 l of the recovered supernatant was added to 375 쨉 l of the remaining sizing mixture, and the mixture was incubated in a 25 째 C water bath, and then mounted in MPC to remove supernatant. 100 쨉 l of TE buffer was added to the tube from which the supernatant was removed, and vortexed. Then, 500 쨉 l of a sizing solution was added, followed by shaking, followed by incubation in a water bath at 25 캜 for 5 minutes. This solution was mounted on MPC, the supernatant was removed, washed twice with 70% ethanol, and then the amphora XP beads were air-dried. 23 쨉 l of TE buffer was added to the dried beads, followed by shaking, and then mounted on MPC to recover 21 쨉 l of the supernatant. The size and concentration of the recovered DNA were measured with a BioAnalyzer 2100 (Bioanalyzer 2100, BECKMAN Co.) using a pico RNA chip. As a result, it was confirmed that the DNA concentration was 2.24 ng / μl and the average size was 704 bp (FIG. 9 b)

<< 실시예Example 9> 9>

emPCRemPCR 을 통한 through DNADNA 의 증폭Amplification of

전과정에 필요한 시약 및 방법은 Roche사의 사용방법에 따라서 실시하였다. dNTP, PCR 버퍼, 프라이머, Taq 중합효소, PPiase를 혼합한 프리믹스(premix)를 32개 에멀젼 튜브(emulsion tube)에 분주하고, 실시예 8에서 회수하여 바이오애널라이저 PicoRNA GSB-RPnning 칩(Bioanalyzer PicoRNA GSB-RPnning chip)으로 계산한 DNA의 카피 개수와 비드의 개수를 적정량 혼합 후 비드와 단일가닥 DNA가 결합할 수 있도록 PCR 기기로 80℃에서 20℃까지 순차적으로 온도를 내리는 반응을 실시하였다. 반응이 끝난 DNA 포획 비드(DNA captured bead)에 멸균수를 넣은 후, PCR 프리믹스가 혼합된 오일(oil)과 혼합시켜 티슈라이저(Tissue lyser)를 사용하여 12Hz에서 5분 동안 셰이킹(shaking)하여 PCR 오일 분자(microreactors)를 만들었다. PCR 오일분자가 들어있는 튜브를 PCR기기에 장착하여 PCR을 실시하여 오일분자에 혼합된 DNA를 증폭하였다.
The reagents and methods required for the entire process were performed according to the Roche method. A premix prepared by mixing dNTP, PCR buffer, primer, Taq polymerase and PPiase was dispensed into 32 emulsion tubes and recovered in Example 8 to obtain a bioanalyzer PicoRNA GSB- RPnning chip) and the number of beads were mixed in an appropriate amount. Then, PCR was performed to lower the temperature sequentially from 80 ° C to 20 ° C so that the beads and single-stranded DNA could bind. Sterile water was added to the DNA-captured beads after the reaction, and the PCR premix was mixed with the mixed oil and shaken at 12 Hz for 5 minutes using a Tissue lyser PCR oil molecules (microreactors) were made. A tube containing the PCR oil molecule was attached to the PCR instrument and subjected to PCR to amplify the mixed DNA in the oil molecule.

<< 실시예Example 10> 10>

GSGS -- FLXFLX 서열분석 Sequencing

전과정에 필요한 시약 및 방법은 Roche사의 사용방법에 따라서 실시하였다. emPCR이 끝난 시료에, 이소프로판올(iso-propanol), 에탄올, 증진 유체 버퍼(enhancing fluid buffer) 등을 사용하여 스트렙토아비딘(streptoavidin)이 코팅된 비드를 회수하였다. 용융용액(Melting solution)과 어닐링버퍼(annealing buffer)로 비드를 중화시킨 후 강화 프라이머(enrichment primer)를 넣고 65℃ 수조에서 5분 동안 반응시켜 서열분석 시료를 준비하였다. 증진버퍼(enhancing buffer)로 수차례 세척한 강화 비드(enrichment bead)를 준비된 DNA 시료와 혼합하고 용융용액으로 세척하여 서열분석 프라이머가 표적 DNA가 결합된 비드만을 회수하였다. 회수한 포획비드를 PicoTiter^TM 플레이트에 넣고 12시간 동안 피로시퀀싱(pyrosequencing)을 진행하였다. 최종적으로 얻어지는 시퀀스의 구조는 도 10에 나타낸 바와 같다. 각 서열 판독들은 SP6 프로모터, T7 프로모터 및 바코드 서열로 분류할 수 있다.
The reagents and methods required for the entire process were performed according to the Roche method. Streptavidin-coated beads were recovered from the emPCR-treated samples using isopropanol, ethanol, enhancing fluid buffer and the like. After the beads were neutralized with a melting solution and an annealing buffer, enrichment primers were added and reacted for 5 minutes in a water bath at 65 ° C. to prepare a sequence analysis sample. The enrichment beads, which had been washed several times with an enhancing buffer, were mixed with the prepared DNA samples and washed with a molten solution. Only the beads bound to the target DNA were recovered by the sequencing primer. The recovered capture beads were placed in a PicoTiter ^TM plate and subjected to pyrosequencing for 12 hours. The structure of the finally obtained sequence is shown in Fig. Each sequence reading can be classified as an SP6 promoter, a T7 promoter, and a barcode sequence.

<< 실시예Example 11> 11>

서열데이터(Sequence data ( SequenceSequence datadata ) 분석) analysis

XL70 서열분석 라이브러리 키트(XL70 sequencing library kit)로 피로시퀀싱을 실시하여 647,759개의 서열분석 판독(sequencing read)을 확보한 후, BlastN 방법으로 서열 라이브러리 제작과정에서 3' 말단에 부착시킨 어댑터 서열(GSB-RP)을 모두 제거하고, 5' 말단 부위에 어댑터 프라이머로 부착시킨 GSB-SP6와 GSB-T7를 가지고 있는 서열분석 판독 171,104개와 234,984개를 각각 확보하였다. 확보한 각각의 서열분석 판독을 3D-풀 라이브러리(3D-pool library) 제작에 사용한 바코드 서열을 이용하여 60종류(X축: 24, Y축: 16, Z축: 20)로 다시 분리한 후, 바코드별로 모아진 서열분석 판독은 CAP3 어셈블러(assembler)를 사용하여 콘틱으로 만들었다. X축, Y축, Z축 각각에 해당하는 콘틱들간에 상동성 검색을 실시하여 동일한 서열별로 분류한 후, SP6 및 T7에서 위치가 동일한 클론별로 재분류하여 최종적으로 BAC 클론의 위치를 규명하였다. 서열데이터 분석구성도를 도 11에 나타내었다.The adapter sequence (GSB-1) attached to the 3'-end of the sequence library construct by the BlastN method after obtaining 647,759 sequencing readings by performing the fat-sequencing with the XL70 sequencing library kit, RP) were removed, and 171,104 and 234,984 readings of sequence analysis with GSB-SP6 and GSB-T7 attached to the 5 'end region with adapter primer, respectively, were obtained. Each of the obtained sequencing analyzes was separated again into 60 types (X-axis: 24, Y-axis: 16, Z-axis: 20) using the bar code sequence used for the 3D-pool library, Sequence analysis read by bar code was made conic using CAP3 assembler. The homology search was performed among the cones corresponding to each of the X axis, Y axis and Z axis, and classified according to the same sequence. Then, SP6 and T7 were reclassified by identical clones to finally locate the BAC clone. Fig. 11 shows the sequence data analysis structure.

BlastN을 사용하여 서열 상동성을 X축, Y축, Z축으로 상호교차검색 한 결과, 본 실험에 사용한 7,680종류(384-웰 플레이트 20장)에서 클론의 위치가 판명된 BAC 클론의 수는 총 7,108개(실험군의 93%)였으며, 각 판독들의 서열 길이는 17bp~793bp 분포를 보였으며, 평균길이는 268bp였다. 위치가 판명된 7,108개 중에서 BAC 클론의 양쪽 말단서열 SP6와 T7이 모두 확인된 클론은 5,062개(실험군의 66%), SP6만 확인된 클론은 1,153개(실험군의 15%), T7만 확인된 클론은 893개(12%)였다.
As a result of cross-searching of sequence homology using X-axis, Y-axis and Z-axis using BlastN, the number of BAC clones in which clones were found in 7,680 kinds (384-well plates 20) 7,108 (93% of the experimental group). The sequence length of each reading was 17bp ~ 793bp, and the mean length was 268bp. Of the 7,108 sites identified, 5,062 clones (66% of the experimental group), 1,153 clones (15% of the experimental group), and only T7 were identified with both end sequences SP6 and T7 of the BAC clones The clones were 893 (12%).

<< 실시예Example 12> 12>

BACBAC 말단 서열을 이용한 유전체 물리지도( Dielectric Physical Map Using Terminal Sequence genomegenome physicalphysical mapmap ) 작성) write

HiSeq 2500, GS-FLX 등 NGS 기기를 사용하여 샷건(shot gun) 서열분석 및 어셈블리를 통해 얻은 콘틱과 스캐폴드 서열에 대해 본 발명에서 얻은 BAC 말단서열을 BLASTN 방법으로 상동성 검색을 실시하여 유전체 물리지도를 작성하였으며, 그 결과를 도 12에 나타내었다. 위치가 규명된 BAC의 양쪽말단 염기서열(SP6는 검정색, T7은 빨간색)을 이용하여 유전체 구조에 맞게 스캐폴드의 순서를 규명할 수가 있었다. The BAC end sequence obtained in the present invention was subjected to homology search using the BLASTN method for the conic and scaffold sequences obtained by shot gun sequence analysis and assembly using NGS instruments such as HiSeq 2500 and GS-FLX, A map was created and the results are shown in Fig. We could identify the sequence of scaffolds according to the genome structure by using both terminal sequences (SP6 for black and T7 for red) of the identified BACs.

<110> GnC Bio Co.,LTD. <120> Sequencing method of genomic DNA end sequence using NGS <130> 000 <160> 7 <170> KopatentIn 2.0 <210> 1 <211> 18 <212> DNA <213> Artificial Sequence <220> <223> GSB-YAP primer <400> 1 ccgatctaga tcgtccga 18 <210> 2 <211> 18 <212> DNA <213> Artificial Sequence <220> <223> GSB-YAP primer <400> 2 ggctagatct agcgtcct 18 <210> 3 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> forward adaptor in GSB-SP6 primer <400> 3 ccatctcatc cctgcgtgtc tccgactcag 30 <210> 4 <211> 18 <212> DNA <213> Artificial Sequence <220> <223> SP6 promoter in GSB-SP6 primer <400> 4 atttaggtga cactatag 18 <210> 5 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> forward adaptor in GSB-T7 primer <400> 5 ccatctcatc cctgcgtgtc tccgactcag 30 <210> 6 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> T7 promoter in GSB-T7 primer <400> 6 taatacgact cactataggg 20 <210> 7 <211> 48 <212> DNA <213> Artificial Sequence <220> <223> GSB-RP primer <400> 7 cctatcccct gtgtgccttg gcagtctcag tcctgcgatc tagatcgg 48 <110> GnC Bio Co., LTD. <120> Sequencing method of genomic DNA end sequence using NGS <130> 000 <160> 7 <170> Kopatentin 2.0 <210> 1 <211> 18 <212> DNA <213> Artificial Sequence <220> <223> GSB-YAP primer <400> 1 ccgatctaga tcgtccga 18 <210> 2 <211> 18 <212> DNA <213> Artificial Sequence <220> <223> GSB-YAP primer <400> 2 ggctagatct agcgtcct 18 <210> 3 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> forward adapter in GSB-SP6 primer <400> 3 ccatctcatc cctgcgtgtc tccgactcag 30 <210> 4 <211> 18 <212> DNA <213> Artificial Sequence <220> <223> SP6 promoter in GSB-SP6 primer <400> 4 atttaggtga cactatag 18 <210> 5 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> forward adapter in GSB-T7 primer <400> 5 ccatctcatc cctgcgtgtc tccgactcag 30 <210> 6 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> T7 promoter in GSB-T7 primer <400> 6 taatacgact cactataggg 20 <210> 7 <211> 48 <212> DNA <213> Artificial Sequence <220> <223> GSB-RP primer <400> 7 cctatcccct gtgtgccttg gcagtctcag tcctgcgatc tagatcgg 48

Claims

Preparing a BAC library by extracting genomic DNA to be analyzed and collecting each BAC clone in a 384-well plate to prepare a cell stock;
Three-dimensional BAC clones of the 384-well plate for cell stock are three-dimensionally combined to prepare a three-dimensional BAC library consisting of 60 stores;
Amplifying the BAC clones of the 3-dimensional BAC library using an Escherichia coli culture and extracting BAC clone DNA;
Cutting the extracted BAC clone DNA to a size suitable for analysis;
Attaching a Y-type adapter primer to both ends of the fragment after smoothing both ends of the DNA fragment;
Amplifying both terminal sequences of the genomic DNA using primers comprising the SP6 promoter sequence of the BAC vector or a primer comprising the T7 promoter sequence;
Purifying the amplified DNA and fractionating it into a size suitable for NGS sequence analysis;
Amplifying the fractionated DNA by emPCR;
Analyzing the sequence of the amplified DNA with NGS to obtain sequence data; And
And identifying the location of the BAC clone using the sequence data. &Lt; Desc / Clms Page number 19 >

The method according to claim 1,
To label the 384-well plate for cell stocks with marker numbers in rows, columns and plates so as to distinguish the origins of the collected genomic DNA on the X axis (horizontal), Y axis (vertical) and Z axis Lt; / RTI >

3. The method of claim 2,
The three-dimensional BAC library comprises:
Collecting 24 BAC clones on the horizontal axis (X axis) corresponding to the same number in 384-well plates for cell stock in one reservoir;
Collecting 16 BAC clones in the vertical axis (Y axis) corresponding to the same number in a 384-well plate for cell stock into one reservoir; And
(Z axis) by collecting 384 BAC clones contained in one 384-well plate for cell stock into one reservoir.

The method according to claim 1,
Wherein the Y-type adapter primer has the nucleotide sequence of SEQ ID NO: 1 and SEQ ID NO: 2 and the terminal nucleotide sequence of the five nucleotide sequences is an unconformable primer.

The method according to claim 1,
Wherein the extracted BAC clone DNA is cut using ultrasonic waves.

The method according to claim 1,
Wherein the primer comprising the SP6 promoter sequence is a GSB-SP6 primer consisting of the forward adapter of SEQ ID NO: 3, the barcode sequence representing the position in the 3D BAC library, and the SP6 promoter of SEQ ID NO: 4.

The method according to claim 1,
The primer comprising the T7 promoter sequence comprises a forward adapter of SEQ ID NO: 5, a barcode sequence representing the position in the 3D BAC library, and GSB-T7 comprising the T7 promoter of SEQ ID NO: 6 Primer. &Lt; / RTI >

The method according to claim 1,
The amplification of both end sequences of the genomic DNA is performed using the GSB-SP6 primer of SEQ ID NO: 6, the GSB-RP primer of SEQ ID NO: 7, or the GSB-T7 primer of SEQ ID NO: 7 and the GSB-RP primer of SEQ ID NO: How to.