KR101139360B1

KR101139360B1 - Polynucleotides derived from PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, and NSMCE2 genes comprising single nucleotide polymorphisms, microarrays and diagnostic kits comprising the same, and analytic methods using the same

Info

Publication number: KR101139360B1
Application number: KR1020090100058A
Authority: KR
Inventors: 곽규범; 편정아; 김지혜; 이광재; 조성원
Original assignee: 차의과학대학교 산학협력단
Priority date: 2009-10-21
Filing date: 2009-10-21
Publication date: 2012-04-26
Also published as: KR20110043098A

Abstract

본 발명은 PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, 및 NSMCE2 유전자로부터 유래된 단일염기다형(Single-Nucleotide Polymorphism; SNP)을 포함하는 폴리뉴클레오티드 또는 그의 상보적 폴리뉴클레오티드를 제공하며, 상기 폴리뉴클레오티드 또는 그의 상보적 폴리뉴클레오티드는 장형 위암(Intestinal type of gastric cancer) 의 임상적 진단의 표지 인자로서, 사용될 수 있다.The present invention relates to single base polymorphisms (Single-Nucleotide) derived from the genes PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, and NSMCE2. Polymorphisms (SNPs), or complementary polynucleotides thereof, which may be used as markers for clinical diagnosis of Intestinal type of gastric cancer. .

PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, NSMCE2, 장형 위암 PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, NSMCE2, Enteric Gastric Cancer

Description

GF, SB, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC2, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC2, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC2, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, HC5, Microarrays and diagnostic kits included, and analysis methods using the same {Polynucleotides derived from PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196 , and NSMCE2 genes comprising single nucleotide polymorphisms, microarrays and diagnostic kits comprising the same, and analytic methods using the same}

본 발명은 PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, 및 NSMCE2 유전자로부터 유래된 단일염기다형을 포함하는 폴리뉴클레오티드 또는 그의 상보적 뉴클레오티드; 상기 폴리뉴클레오티드 또는 그의 상보적 뉴클레오티드로 이루어진 장형 위암(intestinal type of gastric cancer) 진단을 위한 증폭용 프라이머 또는 장형 위암 진단용 프로브; 상기 프로브를 포함하는 마이크로어레이; 및 상기 폴리뉴클레오티드 또는 그의 상보적 뉴클레오티드를 이용한 장형 위암 진단에 필요한 정보를 제공하기 위한 분석 방법에 관한 것이다.The present invention provides polybasic polynucleotides derived from the PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, and NSMCE2 genes. Nucleotides or their complementary nucleotides; Amplification primers or enteric gastric cancer diagnostic probes for diagnosing intestinal type of gastric cancer comprising the polynucleotide or its complementary nucleotides; A microarray comprising the probe; And an analysis method for providing information necessary for diagnosing enteric gastric cancer using the polynucleotide or its complementary nucleotides.

인간의 게놈은 모두 22쌍의 상동 염색체와 두 개의 성염색체로 이루어져 있다. 그러나 모든 인간이 각기 다른 외모와 성격을 갖는다. 이는 개개인이 같은 수의 염색체를 가지고 있기는 하지만, 그 염색체로부터 발현되는 유전자들에 의한 표현형이 모두 다르기 때문이다. 개개인의 이러한 유전자 발현의 다양성은 곧 유전자를 가지고 있는 게놈의 유전적 다양성 때문이다. 이러한 유전적 다양성은 많은 연구가 수행되어 왔으며, 최근 인간게놈프로젝트의 완성으로 그 구조가 모두 밝혀졌다. The human genome consists of 22 pairs of homologous chromosomes and two sex chromosomes. But every human being has a different appearance and character. This is because each person has the same number of chromosomes, but all of the phenotypes of the genes expressed from that chromosome are different. The diversity of these gene expressions in individuals is due to the genetic diversity of the genome carrying the gene. Much research has been done on this genetic diversity, and the structure of the genetic genome has recently been revealed.

게놈에서의 유전적 다양성을 대표하는 것으로는 Transposable element, STR (short tandem repeats), Microsatellites, STS (sequence tagged site), VNTR (variable number tandem repeat), SNP (single nucleotide polymorphism : 단일염기다형) 등이 알려져 있다. 이중 SNP는 단일염기다형으로서, 인간 게놈 상에서 약 1000개의 뉴클레오티드 중 1개의 빈도로 나타나며 동일한 종의 개체 사이의 단일뉴클레오티드 변이의 형태를 취한다. 이러한 단일염기다형이 나타내는 유전학에서의 의미는 그 위치에 따라 각 개체에 큰 차이를 가져온다. 예를 들면, 단일염기다형이 단백질을 암호화하고 있는 위치에 존재할 경우, 단백질의 구조에 영향을 미칠 수 있게 되어 단백질 기능이 달라질 수 있게 되며, 질병과도 연관될 수 있다. 다른 예로 단일염기다형이 단백질을 암호화하지 않는 다른 유전자상에서 존재할 경우, 즉 프로모터(promoter) 또는 인트론(intron)에 존재할 경우, 각각에 대하여 단백질의 발현 수준에 차이를 가져와 그 단백질의 전체적인 활성이 줄어들 수 있고, 변성적인 인트론 제거 과정(alternative splicing)을 통하여 단백질이 비정상적으로 발현될 수도 있다. Genetic diversity in the genome is represented by transposable elements, short tandem repeats (STRs), microsatellites, sequence tagged sites (STS), variable number tandem repeats (VNTRs), and single nucleotide polymorphisms (SNPs). Known. Dual SNPs are monobasic, appearing at a frequency of about one thousand nucleotides on the human genome and taking the form of single nucleotide variations between individuals of the same species. The meaning in the genetics represented by these monobasic polymorphisms makes a big difference in each individual depending on its location. For example, when a monobasic polymorph is present at a location that encodes a protein, it may affect the structure of the protein, altering protein function and associated with disease. In another example, if a monobasic polymorph is present on another gene that does not encode a protein, ie, on a promoter or intron, the expression level of the protein may differ for each, resulting in a decrease in the overall activity of the protein. In addition, a protein may be abnormally expressed through alterative splicing.

단일염기다형들은 질병과 같은 특정 표현형(phenotype)의 지표로 사용될 수 있고, 이를 위해서는 가장 우선적으로 인간 유전체에 존재하는 대량의 단일염기다형을 발굴하고 발굴된 단일염기다형은 다수의 인간 유전체 시료에서 전체 인간집단의 1% 이상 존재하는 유전변이형 마커인지를 조사하는 검정작업을 거치게 된다. 검정된 SNP은 특정 질병 또는 약물반응과 같은 유전적 차이에 기인된 원인을 찾기 위한 연관성 연구에 사용되게 된다. 예를 들면, 특정 단일염기다형이 환자군과 대조군 간의 비교에서 유의적인 빈도의 차이가 관측된다면 아주 유용한 단일염기다형으로 선발되어 임상실험을 거치게 될 것이며 임상실험의 결과가 계속적으로 진행되어 그 결과가 좋을 때는 경제적으로 아주 유용한 가치를 지니는 유전지표로 실제 임상에 사용할 수 있게 될 것이다.Single base polymorphisms can be used as an indicator of a specific phenotype such as disease. To do this, first of all, a large number of single base polymorphisms present in the human genome are identified and the single base polymorphism is found in a number of human genome samples. An assay will be conducted to determine whether the genetic marker is present in more than 1% of the human population. Assayed SNPs will be used in association studies to find the cause due to genetic differences such as specific diseases or drug responses. For example, if a single base polymorphism is found to be significantly different in the comparison between the patient and control group, it will be selected as a very useful single base polymorphism and subjected to clinical trials. It is a genetic indicator of economically useful value and will be available for actual clinical use.

이러한 단일염기다형의 발견을 위한 종래 기술로는 제한효소를 이용한 제한 단편화 길이 다형성(Restriction Fragment Length Polymorphism; RFLP), 대립형질 특이적 혼성화(Allele-specific hybridization)를 이용한 TaqMan^TM 프로브 방법, 용융온도(melting temperature; Tm)을 이용한 역동적 대립형질 특이적 혼성화(Dynamic Allele-Specific Hybridization; DASH)방법, 중합반응을 이용한 pyrosequencing등이 있다. 최근, 마이크로어레이 기술은 단일 염기 신장(Single Base Extension), 또는 대립인자 특이적 프라이머 신장(Allele Specific Primer Extension)이라는 원리를 이용한 프로브들을 조그만 기판위에 집적화시켜 심은 후, 목적 DNA들과의 혼성화와 신장(Extension)을 시킴으로써 단일염기다형을 발견해 내고 있다. Conventional techniques for the discovery of such monobasic polymorphisms include Restriction Fragment Length Polymorphism (RFLP) using restriction enzymes, TaqMan ^™ probe method using Allele-specific hybridization, and melting temperature ( dynamic allele-specific hybridization (DASH) using melting temperature (Tm) and pyrosequencing using polymerization. Recently, microarray technology has integrated and planted probes on a small substrate using the principle of Single Base Extension or Allele Specific Primer Extension, and then hybridized and stretched with target DNAs. Single base polymorphisms are found by extension.

한편, 위암의 발생률과 사망률은 감소하고 있으나, 아직도 전 세계에서 두 번째로 많이 발생하는 악성 종양이다. 우리나라의 경우, 남성과 여성에서의 암 발생률은 위암이 1위를 차지하고 사망률의 경우 2위를 차지한다. 위암과 깊게 관련되는 Helicobacter pylori (H. pylori) 감염은 사람에서 가장 흔한 만성 감염으로서 전 세계 인구의 50% 이상이 감염되어 있다. 우리나라의 H. pylori 감염률은 전 국민에서 46.6%, 16세 이상의 성인에서는 69.4%, 40대 에서는 78.5% 라는 높은 감염률을 보인다. 위암은 로렌 분류법(Lauren classification) 에 의해 장형 위암(Intestinal type of gastric cancer) 과 미만형 위암(Diffuse type of gastric cancer)로 분류 된다. 장형 위암은 H. pylori 감염과 관련되어 만성 위염(chronic gastritis), 위축성 위염(atrophic gastritis), 장상피화생(intestinal metaplasia), 이형성증(displasia), 장형 위암(Intestinal type of gastric cancer)의 순서로 진행된다. 반면 미만형 위암은 정상적인 점막으로 부터 H. pylori 감염과 관계없이 유발 될 수 있다. 우리나라에서 H. pylori 감염률이 높다는 것은 장형 위암에 대한 위험이 높다는 것을 의미 한다. The incidence and mortality rate of gastric cancer is decreasing, but it is still the second most common malignancy in the world. In Korea, the incidence of cancer in men and women is ranked first in gastric cancer and second in mortality. Helicobacter pylori (H. pylori) infection, which is closely related to gastric cancer, is the most common chronic infection in humans, affecting more than 50% of the world's population. The prevalence of H. pylori infection in Korea is high at 46.6%, 69.4% in adults over 16 years old, and 78.5% in 40s. Gastric cancer is classified into Intestinal type of gastric cancer and Diffuse type of gastric cancer by Lauren classification. Enteric gastric cancer is associated with H. pylori infection in the order of chronic gastritis, atrophic gastritis, intestinal metaplasia, displasia, and intestinal type of gastric cancer. do. Diffuse gastric cancer, on the other hand, can be induced from normal mucous membranes regardless of H. pylori infection. A high H. pylori infection rate in Korea means a high risk for intestinal gastric cancer.

조기 위암은 5년 생존률이 95～100%로 높지만 증상을 나타내지 않는 경우가 80% 를 차지한다. 이는 조기 진단의 중요성과 함께 적절한 조기 진단법의 개발이 필요함을 뜻한다. H. pylori 감염된 100명 중 1～2명에서만 위암이 발생하기 때문에 H. pylori 감염만으로 위암 발생 위험군으로 판단 내리는 것은 적합하지 않다. 위암의 전 단계에 속하는 만성 위염 환자 사이에서 장형 위암이 발생할 위험성이 있는 사람을 미리 예측하는 것과 위암 발생에 관여하는 인자를 규명하는 것은 국민 보건에 중요하며 위암을 예방하기 위한 검사의 필수 요건이다. 장형 위암에 특이적으로 나타날 수 있는 단일염기다형을 찾아내는 것은 장형 위암을 조기에 진단할 수 있으며, 더 나아가 병태생리학적 경로를 밝히고 치료 방법을 개발하는데까지 이용될 수 있을 것이다.Early gastric cancer has a 5-year survival rate of 95 to 100%, but 80% of the cases show no symptoms. This implies the need for early diagnosis, along with the importance of early diagnosis. Since only 1 or 2 of 100 H. pylori infections develop gastric cancer, it is not appropriate to determine H. pylori infection as a risk group for gastric cancer. Predicting the risk of developing gastrointestinal cancer among patients with chronic gastritis at all stages of gastric cancer and identifying the factors involved in the development of gastric cancer is important for public health and is an essential requirement for tests to prevent stomach cancer. Identifying monobasic polymorphisms that may be specific to enteric gastric cancer can be used to diagnose early gastric cancer and further develop pathophysiological pathways and develop therapeutic methods.

본 발명자들은 한국인의 장형 위암 환자에서 특이적으로 나타나는 단일염기다형을 찾아내고자 연구를 거듭한 결과, PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, 및 NSMCE2 유전자로부터 장형 위암환자에 특이적으로 나타나는 단일염기다형 부위를 발견하여 본 발명을 완성하게 되었다. The present inventors conducted a study to find a single nucleotide polymorphism that appears specifically in Korean patients with gastric cancer, PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1 The present invention was completed by finding a single nucleotide polymorphism site that is specific to enteric gastric cancer patients from the TNKS, CDH17, UBR5, KIAA0196, and NSMCE2 genes.

따라서, 본 발명은 장형 위암과 관련된 단일염기다형으로서 PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, 및 NSMCE2 유전자로부터 유래된 단일염기다형을 포함하는 폴리뉴클레오티드 또는 그의 상보적 폴리뉴클레오티드를 제공하는 것을 목적으로 한다.Accordingly, the present invention is directed to the PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, and NSMCE2 genes. It is an object to provide a polynucleotide comprising a monobasic polymorph derived from or a complementary polynucleotide thereof.

또한, 본 발명은 상기 폴리뉴클레오티드 또는 그의 상보적 뉴클레오티드로 이루어진 장형 위암 진단을 위한 증폭용 프라이머 또는 장형 위암 진단용 프로브를 제공하는 것을 목적으로 한다.In addition, an object of the present invention is to provide an amplification primer for diagnosing enteric gastric cancer or a probe for diagnosing enteric gastric cancer comprising the polynucleotide or its complementary nucleotides.

또한, 본 발명은 상기 프로브를 포함하는 마이크로어레이를 제공하는 것을 목적으로 한다.Another object of the present invention is to provide a microarray including the probe.

또한, 본 발명은 상기 폴리뉴클레오티드 또는 그의 상보적 뉴클레오티드를 이용한 장형 위암 진단 방법을 제공하는 것을 목적으로 한다.In addition, an object of the present invention is to provide a method for diagnosing enteric gastric cancer using the polynucleotide or its complementary nucleotide.

본 발명의 일 태양에 따라, PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, 및 NSMCE2 유전자로부터 유래된 특정 단일염기다형을 포함하는 폴리뉴클레오티드 또는 그의 상보적 폴리뉴클레오티드가 제공된다.According to one aspect of the invention, certain single genes derived from the PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, and NSMCE2 genes Polynucleotides comprising the base polymorphism or complementary polynucleotides thereof are provided.

또한, 본 발명의 다른 태양에 따라, 상기 폴리뉴클레오티드 또는 그의 상보적 폴리뉴클레오티드로 이루어진, 장형 위암 진단을 위한 증폭용 프라이머가 제공된다.In addition, according to another aspect of the present invention, an amplification primer for diagnosing enteric gastric cancer, comprising the polynucleotide or a complementary polynucleotide thereof, is provided.

또한, 본 발명의 다른 태양에 따라, 상기 폴리뉴클레오티드 또는 그의 상보적 폴리뉴클레오티드로 이루어진, 장형 위암 진단용 프로브 및 이를 포함한 마이크로어레이가 제공된다.In addition, according to another aspect of the present invention, there is provided a enteric gastric cancer diagnostic probe and a microarray comprising the polynucleotide or a complementary polynucleotide thereof.

또한, 본 발명의 다른 태양에 따라, 상기 폴리뉴클레오티드 또는 그의 상보적 폴리뉴클레오티드를 포함하는 장형 위암 진단용 키트가 제공된다.In addition, according to another aspect of the present invention, there is provided a kit for diagnosing enteric gastric cancer comprising the polynucleotide or a complementary polynucleotide thereof.

또한, 본 발명의 다른 태양에 따라, 장형 위암 진단에 필요한 정보를 제공하기 위하여, (i) 검체로부터 핵산 시료를 얻는 단계; 및 (ii) 상기 핵산 시료로부터, 서열번호 1 내지 6, 10 내지 38, 41, 및 44 내지 45의 폴리뉴클레오티드로부터 하나 이상 선택된 폴리뉴클레오티드 또는 그의 상보적 폴리뉴클레오티드의 다형성 부위의 염기 서열을 분석하는 단계를 포함하는, 검체 중의 단일염기다형을 검출하는 방법이 제공된다.In addition, according to another aspect of the invention, to provide information necessary for the diagnosis of enteric gastric cancer, (i) obtaining a nucleic acid sample from a sample; And (ii) analyzing the nucleotide sequence of the polymorphic site of at least one polynucleotide selected from polynucleotides of SEQ ID NOs: 1 to 6, 10 to 38, 41, and 44 to 45 or complementary polynucleotides thereof from the nucleic acid sample. Provided is a method comprising detecting a monobasic polymorph in a sample.

또한, 본 발명의 다른 태양에 따라, 장형 위암 진단에 필요한 정보를 제공하기 위하여, (i) 검체로부터 핵산 시료를 얻는 단계 및 (ii) 상기 핵산 시료로부터, 반수체형의 분석을 포함하는 것을 특징으로 하는, 검체 중의 단일염기다형을 검출하는 방법이 제공된다.Further, according to another aspect of the present invention, in order to provide information necessary for diagnosing enteric gastric cancer, (i) obtaining a nucleic acid sample from a sample, and (ii) analyzing the haplotype from the nucleic acid sample A method for detecting a monobasic polymorph in a sample is provided.

본 발명에 따른 폴리뉴클레오티드 (a) 내지 (al)로 이루어진 군으로부터 하나 이상 선택된 폴리뉴클레오티드 또는 그의 상보적 뉴클레오티드는 장형 위암의 진단에 유용하게 사용될 수 있다.One or more polynucleotides selected from the group consisting of polynucleotides (a) to (al) according to the present invention or complementary nucleotides thereof may be usefully used for the diagnosis of enteric gastric cancer.

또한, 상기 폴리뉴클레오티드 또는 그의 상보적 뉴클레오티드로 이루어진 프라이머 또는 프로브, 및 상기 폴리뉴클레오티드 또는 그의 상보적 뉴클레오티드를 포함하는 키트는 장형 위암의 진단에 유용하게 사용될 수 있다.In addition, a primer or probe consisting of the polynucleotide or its complementary nucleotides, and a kit comprising the polynucleotide or the complementary nucleotides thereof, may be usefully used for the diagnosis of enteric gastric cancer.

또한 본 발명의 분석방법은 장형 위암 진단에 필요한 정보를 제공하는데 유용하게 사용될 수 있다.In addition, the analysis method of the present invention can be usefully used to provide information necessary for diagnosing gastric cancer.

본 발명은 장형 위암과 관련된 것으로 새롭게 밝혀진 PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, NSMCE2 유전자로부터 유래하는 하기 폴리뉴클레오티드 (a) 내지 (al)로 이루어진 군으로부터 하나 이상 선택된 폴리뉴클레오티드 또는 그의 상보적 폴리뉴클레오티드를 제공한다. 즉, 본 발명은 서열번호 1의 DNA 서열로 구성된 폴리뉴클레오티드에서, 501번째 염기(다형성 부위)가 C이고, 상기 501번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (a); 서열번호 2의 DNA 서열로 구성된 폴리뉴클레오티드에서, 501번째 염기(다형성 부위)가 G이고, 상기 501번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (b); 서열번호 3의 DNA 서열로 구성된 폴리뉴클레오티드에서, 501번째 염기(다형성 부위)가 A이고, 상기 501번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (c); 서열번호 4의 DNA 서열로 구성된 폴리뉴클레오티드에서, 251번째 염기(다형성 부위)가 A이고, 상기 251번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (d); 서열번호 5의 DNA 서열로 구성된 폴리뉴클레오티드에서, 251번째 염기(다형성 부위)가 C고, 상기 251번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (e); 서열번호 6의 DNA 서열로 구성된 폴리뉴클레오티드에서, 301번째 염기(다형성 부위)가 C이고, 상기 301번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (f); 서열번호 10의 DNA 서열로 구성된 폴리뉴클레오티드에서, 1093번째 염기(다형성 부위)가 T이고, 상기 1093번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (g); 서열번호 11의 DNA 서열로 구성된 폴리뉴클레오티드에서, 1018번째 염기(다형성 부위)가 A이고, 상기 1018번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (h); 서열번호 12의 DNA 서열로 구성된 폴리뉴클레오티드에서, 201번째 염기(다형성 부위)가 A이고, 상기 201번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (i); 서열번호 13의 DNA 서열로 구성된 폴리뉴클레오티드에서, 1001번째 염기(다형성 부위)가 G이고, 상기 1001번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (j); 서열번호 14의 DNA 서열로 구성된 폴리뉴클레오티드에서, 345번째 염기(다형성 부위)가 G이고, 상기 345번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (k); 서열번호 15의 DNA 서열로 구성된 폴리뉴클레오티드에서, 1001번째 염기(다형성 부위)가 A이고, 상기 1001번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (l); 서열번호 16의 DNA 서열로 구성된 폴리뉴클레오티드에서, 251번째 염기(다형성 부위)가 G이고, 상기 251번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (m); 서열번호 17의 DNA 서열로 구성된 폴리뉴클레오티드에서, 401번째 염기(다형성 부위)가 G이고, 상기 401번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (n); 서열번호 18의 DNA 서열로 구성된 폴리뉴클레오티드에서, 301번째 염기(다형성 부위)가 G이고, 상기 301번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (o); 서열번호 19의 DNA 서열로 구성된 폴리뉴클레오티드에서, 301번째 염기(다형성 부위)가 T이고, 상기 301번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (p); 서열번호 20의 DNA 서열로 구성된 폴리뉴클레오티드에서, 251번째 염기(다형성 부위)가 C이고, 상기 251번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (q); 서열번호 21의 DNA 서열로 구성된 폴리뉴클레오티드에서, 301번째 염기(다형성 부위)가 A이고, 상기 301번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (r); 서열번호 22의 DNA 서열로 구성된 폴리뉴클레오티드에서, 301번째 염기(다형성 부위)가 A이고, 상기 301번째 염기를 포함하는 10개 이상의 연속 DNA 서 열로 구성되는 폴리뉴클레오티드 (s); 서열번호 23의 DNA 서열로 구성된 폴리뉴클레오티드에서, 166번째 염기(다형성 부위)가 G이고, 상기 166번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (t); 서열번호 24의 DNA 서열로 구성된 폴리뉴클레오티드에서, 101번째 염기(다형성 부위)가 C이고, 상기 101번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (u); 서열번호 25의 DNA 서열로 구성된 폴리뉴클레오티드에서, 301번째 염기(다형성 부위)가 A이고, 상기 301번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (v); 서열번호 26의 DNA 서열로 구성된 폴리뉴클레오티드에서, 360번째 염기(다형성 부위)가 T이고, 상기 360번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (w); 서열번호 27의 DNA 서열로 구성된 폴리뉴클레오티드에서, 301번째 염기(다형성 부위)가 G이고, 상기 301번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (x); 서열번호 28의 DNA 서열로 구성된 폴리뉴클레오티드에서, 251번째 염기(다형성 부위)가 C이고, 상기 251번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (y); 서열번호 29의 DNA 서열로 구성된 폴리뉴클레오티드에서, 251번째 염기(다형성 부위)가 C이고, 상기 251번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (z); 서열번호 30의 DNA 서열로 구성된 폴리뉴클레오티드에서, 251번째 염기(다형성 부위)가 A이고, 상기 251번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (aa); 서열번호 31의 DNA 서열로 구성된 폴리뉴클레오티드에서, 301번째 염기(다형성 부위)가 C이고, 상기 301번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (ab); 서열번호 32의 DNA 서열로 구성된 폴리뉴클레오티드에서, 201번째 염기(다형성 부위)가 A이고, 상기 201번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (ac); 서열번호 33의 DNA 서열로 구성된 폴리뉴클레오티드에서, 301번째 염기(다형성 부위)가 T이고, 상기 301번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (ad); 서열번호 34의 DNA 서열로 구성된 폴리뉴클레오티드에서, 201번째 염기(다형성 부위)가 A이고, 상기 201번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (ae); 서열번호 35의 DNA 서열로 구성된 폴리뉴클레오티드에서, 501번째 염기(다형성 부위)가 C이고, 상기 501번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (af); 서열번호 36의 DNA 서열로 구성된 폴리뉴클레오티드에서, 301번째 염기(다형성 부위)가 A이고, 상기 301번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (ag); 서열번호 37의 DNA 서열로 구성된 폴리뉴클레오티드에서, 301번째 염기(다형성 부위)가 C이고, 상기 301번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (ah); 서열번호 38의 DNA 서열로 구성된 폴리뉴클레오티드에서, 301번째 염기(다형성 부위)가 T이고, 상기 301번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (ai); 서열번호 41의 DNA 서열로 구성된 폴리뉴클레오티드에서, 251번째 염기(다형성 부위)가 C이고, 상기 251번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (aj); 서열번호 44의 DNA 서열로 구성된 폴리뉴클레오티드에서, 301번째 염기(다형성 부위)가 G이고, 상기 301번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (ak); 및 서열번호 45의 DNA 서열로 구성된 폴리뉴클레오티드에서, 490번째 염기(다형성 부위)가 T이고, 상기 490번째 염기를 포함하는 10개 이상의 연속 DNA 서열로 구성되는 폴리뉴클레오티드 (al)로 이루어진 군으로부터 하나 이상 선택된 폴리뉴클레오티드 또는 그의 상보적 폴리뉴클레오티드를 제공한다.The present invention is derived from the PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, NSMCE2 genes newly found to be related to enteric gastric cancer Provided are one or more polynucleotides selected from the group consisting of polynucleotides (a) to (al) or complementary polynucleotides thereof. That is, the present invention provides a polynucleotide consisting of the DNA sequence of SEQ ID NO: 1, wherein the 501th base (polymorphic site) is C, the polynucleotide consisting of 10 or more consecutive DNA sequences comprising the 501th base; A polynucleotide consisting of the DNA sequence of SEQ ID NO: 2, wherein the 501th base (polymorphic site) is G and the polynucleotide consists of 10 or more consecutive DNA sequences comprising the 501th base; A polynucleotide consisting of the DNA sequence of SEQ ID NO: 3, wherein the 501th base (polymorphic site) is A and is composed of 10 or more contiguous DNA sequences comprising the 501th base (c); A polynucleotide consisting of the DNA sequence of SEQ ID NO: 4, wherein the 251 st base (polymorphic site) is A and the polynucleotide consists of 10 or more consecutive DNA sequences comprising the 251 st base; In a polynucleotide consisting of a DNA sequence of SEQ ID NO: 5, a polynucleotide (e) consisting of ten or more consecutive DNA sequences comprising the 251 st base (polymorphic site) C and comprising the 251 st base; In a polynucleotide consisting of the DNA sequence of SEQ ID NO: 6, a polynucleotide (f) consisting of ten or more consecutive DNA sequences comprising the 301 th base (polymorphic site) is C and comprising the 301 th base; In a polynucleotide consisting of the DNA sequence of SEQ ID NO: 10, the polynucleotide (g) consisting of 10 or more consecutive DNA sequences comprising the 1093th base (T polymorphic site) is T and comprising the 1093th base; In a polynucleotide consisting of the DNA sequence of SEQ ID NO: 11, a polynucleotide consisting of 10 or more contiguous DNA sequences including the 1018th base (polymorphic site) is A and the 1018th base (h); A polynucleotide consisting of the DNA sequence of SEQ ID NO: 12, wherein the 201 base (polymorphic site) is A and the polynucleotide (i) consists of 10 or more consecutive DNA sequences comprising said 201 base; In a polynucleotide consisting of the DNA sequence of SEQ ID NO: 13, a polynucleotide (j) consisting of ten or more contiguous DNA sequences comprising a 1001 base (polymorphic site) G and comprising the 1001 base; A polynucleotide consisting of the DNA sequence of SEQ ID NO: 14, wherein the 345th base (polymorphic site) is G and the polynucleotide (k) consists of 10 or more consecutive DNA sequences comprising said 345th base; In a polynucleotide consisting of the DNA sequence of SEQ ID NO: 15, a polynucleotide (l) consisting of ten or more consecutive DNA sequences comprising the 1001th base (polymorphic site) A and comprising the 1001th base; In a polynucleotide consisting of the DNA sequence of SEQ ID NO: 16, a nucleotide of 251 (polymorphic site) is G and a polynucleotide (m) consisting of 10 or more consecutive DNA sequences comprising the 251 th base; In a polynucleotide consisting of the DNA sequence of SEQ ID NO: 17, a polynucleotide (n) consisting of ten or more contiguous DNA sequences comprising a 401 th base (polymorphic site) G and comprising the 401 th base; In a polynucleotide consisting of the DNA sequence of SEQ ID NO: 18, the polynucleotide (o) consisting of 10 or more consecutive DNA sequences comprising the 301 th base (polymorphic site) G and comprising the 301 th base; In a polynucleotide consisting of the DNA sequence of SEQ ID NO: 19, a polynucleotide (p) consisting of 10 or more contiguous DNA sequences comprising the 301 th base (polymorphic site) is T and the 301 th base; In a polynucleotide consisting of the DNA sequence of SEQ ID NO: 20, the polynucleotide (q) consisting of 10 or more consecutive DNA sequences comprising the 251 st base (polymorphic site) is C and comprising the 251 st base; In a polynucleotide consisting of the DNA sequence of SEQ ID NO: 21, a polynucleotide (r) consisting of 10 or more consecutive DNA sequences comprising the 301th base (polymorphic site) A and comprising the 301th base; In the polynucleotide consisting of the DNA sequence of SEQ ID NO: 22, the polynucleotide (s) consisting of 10 or more consecutive DNA sequences comprising the 301th base (polymorphic site) is A, the 301th base; A polynucleotide consisting of the DNA sequence of SEQ ID NO: 23, wherein the 166th base (polymorphic site) is G and the polynucleotide (t) is composed of 10 or more consecutive DNA sequences comprising the 166th base; In a polynucleotide consisting of the DNA sequence of SEQ ID NO: 24, a polynucleotide (u) consisting of ten or more consecutive DNA sequences comprising the 101 st base (polymorphic site) is C and comprising the 101 st base; In a polynucleotide consisting of the DNA sequence of SEQ ID NO: 25, the polynucleotide (v) consisting of 10 or more consecutive DNA sequences comprising the 301th base (polymorphic site) is A and the 301th base; A polynucleotide consisting of the DNA sequence of SEQ ID NO: 26, wherein the 360th base (polymorphic site) is T and the polynucleotide (w) consists of 10 or more consecutive DNA sequences comprising the 360th base; In a polynucleotide consisting of the DNA sequence of SEQ ID NO: 27, a polynucleotide (x) consisting of ten or more consecutive DNA sequences comprising the 301 th base (polymorphic site) G and comprising the 301 th base; A polynucleotide consisting of the DNA sequence of SEQ ID NO: 28, wherein the 251 st base (polymorphic site) is C and the polynucleotide (y) consists of 10 or more contiguous DNA sequences comprising the 251 st base; A polynucleotide consisting of the DNA sequence of SEQ ID NO: 29, wherein the 251 st base (polymorphic site) is C and the polynucleotide (z) consists of 10 or more contiguous DNA sequences comprising the 251 st base; In a polynucleotide consisting of the DNA sequence of SEQ ID NO: 30, a nucleotide 251 (polymorphic site) is A and a polynucleotide (aa) consisting of 10 or more consecutive DNA sequences comprising the 251 th base; In a polynucleotide consisting of the DNA sequence of SEQ ID NO: 31, the polynucleotide (ab) consisting of 10 or more consecutive DNA sequences comprising the 301 th base (polymorphic site) is C and comprising the 301 th base; A polynucleotide consisting of the DNA sequence of SEQ ID NO: 32, wherein the 201 base (polymorphic site) is A and the polynucleotide (ac) consists of 10 or more consecutive DNA sequences comprising said 201 base; A polynucleotide consisting of the DNA sequence of SEQ ID NO: 33, wherein the 301th base (polymorphic site) is T and is composed of 10 or more consecutive DNA sequences comprising the 301th base (ad); A polynucleotide consisting of the DNA sequence of SEQ ID NO: 34, wherein the 201 base (polymorphic site) is A, and the polynucleotide (ae) is composed of 10 or more consecutive DNA sequences comprising the 201 base; A polynucleotide consisting of the DNA sequence of SEQ ID NO: 35, wherein the 501th base (polymorphic site) is C and a polynucleotide (af) consisting of 10 or more consecutive DNA sequences comprising said 501th base; A polynucleotide consisting of a DNA sequence of SEQ ID NO: 36, wherein the 301th base (polymorphic site) is A and is composed of 10 or more consecutive DNA sequences comprising the 301th base (ag); A polynucleotide consisting of the DNA sequence of SEQ ID NO: 37, wherein the 301th base (polymorphic site) is C and a polynucleotide (ah) consisting of 10 or more consecutive DNA sequences comprising the 301th base; In a polynucleotide consisting of the DNA sequence of SEQ ID NO: 38, the polynucleotide (ai) consisting of 10 or more consecutive DNA sequences comprising the 301th base (T polymorphic site) is T and the 301th base; A polynucleotide consisting of the DNA sequence of SEQ ID NO: 41, wherein the 251 th base (polymorphic site) is C and a polynucleotide (aj) consisting of 10 or more consecutive DNA sequences comprising said 251 th base; A polynucleotide consisting of the DNA sequence of SEQ ID NO: 44, wherein the 301th base (polymorphic site) is G and the polynucleotide (ak) is composed of 10 or more consecutive DNA sequences comprising the 301th base; And a polynucleotide consisting of the DNA sequence of SEQ ID NO: 45, wherein the 490th base (polymorphic site) is T and one from the group consisting of polynucleotides (al) consisting of 10 or more consecutive DNA sequences comprising the 490th base Provided above is a polynucleotide selected or complementary polynucleotides thereof.

상기 서열번호 1 내지 6, 10 내지 38, 41, 및 44 내지 45의 DNA 서열은 만성 위염 환자군과 장형 위암 환자군에서 통계학적으로 유의성 있는 차이(유의수준 p value < 0.05)를 보이는 단일염기다형으로서, 이들은 PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, 또는 NSMCE2 유전자에 존재한다.The DNA sequences of SEQ ID NOs: 1 to 6, 10 to 38, 41, and 44 to 45 are single nucleotide polymorphisms having a statistically significant difference (significance level p value <0.05) in patients with chronic gastritis and in patients with gastrointestinal gastric cancer. They are present in the PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, or NSMCE2 genes.

본 발명자들은 장형 위암과 관련된 후보 유전자군으로서 PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, 및 NSMCE2 유전자를 선정하여, 장형 위암 진단에 있어서 분자 수준에서의 진단 지표를 개발하고자 다양한 연구를 수행하였다. 본 발명자들은 한국인 만성 위염 환자 326명과 장형 위암 환자 153명을 대상으로 유전자형 분석 및 통계학적 분석을 수행하였으며, 놀랍게도 수많은 단일염기다형 중 극히 일부의 단일염기다형만이 만성 위염 환자군과 장형 위암 환자군에서 통계적으로 유의성 있는 차이를 나타낸다는 것을 발견하였다. 특히, 서열번호 1 내지 6, 10 내지 38, 41, 및 44 내지 45의 단일염기다형은 하디-와인버그 평형 (Hardy-Weinberg Equilibrium) 및 로지스틱 회기분석 결과, 각각의 다형성 부위에 위험대립인자가 존재함을 확인하였다.We selected PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, and NSMCE2 genes as candidate gene groups associated with enteric gastric cancer Therefore, various studies have been conducted to develop diagnostic indicators at the molecular level in diagnosing gastric cancer. We performed genotyping and statistical analysis on 326 Korean patients with chronic gastritis and 153 patients with gastrointestinal gastric cancer. Surprisingly, only a single polymorphic polymorphic polymorphism was found in patients with chronic gastritis and intestinal gastric cancer. It was found that this represents a significant difference. In particular, the monobasic polymorphs of SEQ ID NOs: 1 to 6, 10 to 38, 41, and 44 to 45 show risk alleles at each polymorphic site as a result of Hardy-Weinberg Equilibrium and Logistic Regression It was confirmed.

본 발명은 상기 폴리뉴클레오티드 (a) 내지 (al)로 이루어진 군으로부터 하나 이상 선택된 폴리뉴클레오티드 또는 그의 상보적 뉴클레오티드와 혼성화하는 폴리뉴클레오티드를 포함하며, 상기 폴리뉴클레오티드 또는 그의 상보적 뉴클레오티드의 길이는 10 내지 100 뉴클레오티드, 바람직하게는 20 내지 60 뉴클레오티드, 더욱 바람직하게는 40 내지 60 뉴클레오티드이다. The present invention includes a polynucleotide hybridizing with at least one polynucleotide selected from the group consisting of the polynucleotides (a) to (al) or a complementary nucleotide thereof, and the length of the polynucleotide or the complementary nucleotide thereof is 10 to 100. Nucleotides, preferably 20 to 60 nucleotides, more preferably 40 to 60 nucleotides.

본 발명에 따른 상기 폴리뉴클레오티드 (a) 내지 (al)로 이루어진 군으로부터 하나 이상 선택된 폴리뉴클레오티드는 다형성 서열이다. 다형성 서열 (polymorphic sequence)이란 뉴클레오티드 서열 중에 단일염기다형을 나타내는 다형성 부위 (polymorphic site)를 포함하는 서열을 말한다. 다형성 부위 (polymorphic site)란 다형성 서열 중 단일염기다형이 일어나는 부위를 말한다. 상기 폴리뉴클레오티드는 DNA 또는 RNA가 될 수 있다.At least one polynucleotide selected from the group consisting of the polynucleotides (a) to (al) according to the present invention is a polymorphic sequence. A polymorphic sequence refers to a sequence comprising a polymorphic site representing a monobasic polymorphism in a nucleotide sequence. A polymorphic site refers to a site where a monobasic polymorphism occurs in the polymorphic sequence. The polynucleotide can be DNA or RNA.

본 발명은 또한, 상기 폴리뉴클레오티드 (a) 내지 (al)로 이루어진 군으로부터 하나 이상 선택된 폴리뉴클레오티드 또는 그의 상보적 폴리뉴클레오티드로 이루어진, 장형 위암 진단을 위한 증폭용 대립형질 특이적 프라이머를 포함한다. 여기서 "프라이머(primer)"란 적절한 버퍼 중의 적절한 조건 (예를 들면, 4개의 다른 뉴클레오시드 트리포스페이트(ATP,GTP,CTP,TTP) 및 DNA 폴리머라제) 및 적당한 온도 하에서 주형-지시 DNA 합성의 시작점으로서 작용할 수 있는 단일가닥 올리고뉴클레오티드를 말한다. 상기 프라이머의 적절한 길이는 사용 목적에 따라 달라질 수 있으나, 통상 10 내지 100, 바람직하게는 10 내지 80 뉴클레오티드이다. 짧은 프라이머 분자는 일반적으로 주형과 안정한 혼성체를 형성하기 위해서는 더 낮은 온도를 필요로 한다. 프라이머 서열은 주형과 완전하게 상보적일 필요는 없으나, 주형과 혼성화 할 정도로 충분히 상보적이어야 한다. 상기 프라이머는 3'-말단이 서열번호 1 내지 6, 10 내지 38, 41, 및 44 내지 45의 단일염기다형과 정렬하는 것이 바람직하다. 상기 프라이머는 다형성 부위를 포함하는 표적 DNA에 혼성화하고, 상기 프라이머가 완전한 상동성을 보이는 대립형질 형태의 DNA에서 증폭을 개시한다. 이 프라이머는 반대편에 혼성화하는 제2 프라이머와 쌍을 이루어 사용된다. 증폭에 의하여 두 개의 프라이머로부터 산물이 증폭되고, 증폭된 산물만을 특이적으로 염색하여 시각화한다. 이는 특정 대립형질 형태가 존재한다는 것을 의미한다. 여기서 사용되는 대립형질 특이적 폴리뉴클레오티드들이 다른 변성화 과정을 거치게 되면, GoldenGate Assay 뿐만 아니라 다른 방법들에 의해서도 사용될 수 있다. The present invention also includes an allele specific primer for amplification for enteric gastric cancer, comprising at least one polynucleotide selected from the group consisting of the polynucleotides (a) to (al) or complementary polynucleotides thereof. “Primer” herein refers to the synthesis of template-directed DNA synthesis under appropriate conditions (eg, four different nucleoside triphosphates (ATP, GTP, CTP, TTP) and DNA polymerase) in a suitable buffer and at a suitable temperature. It refers to a single stranded oligonucleotide that can act as a starting point. The appropriate length of the primer may vary depending on the purpose of use, but is usually 10 to 100, preferably 10 to 80 nucleotides. Short primer molecules generally require lower temperatures to form stable hybrids with the template. The primer sequence need not be completely complementary to the template, but should be sufficiently complementary to hybridize with the template. It is preferred that the primer 3'-end is aligned with the monobasic polymorphism of SEQ ID NO: 1 to 6, 10 to 38, 41, and 44 to 45. The primer hybridizes to the target DNA comprising the polymorphic site and initiates amplification in allelic form of DNA where the primer exhibits complete homology. This primer is used in pairs with a second primer that hybridizes to the other side. The product is amplified from two primers by amplification, and only the amplified product is specifically stained and visualized. This means that certain allelic forms exist. If the allele specific polynucleotides used herein undergo different denaturation processes, they can be used by other methods as well as GoldenGate Assay.

본 발명은 폴리뉴클레오티드 (a) 내지 (al)로 이루어진 군으로부터 하나 이상 선택된 폴리뉴클레오티드 또는 그의 상보적 폴리뉴클레오티드로 이루어진, 장형 위암 진단용 프로브 및 이를 포함하는 장형 위암 진단용 마이크로어레이를 포함한다. 또한, 본 발명은 폴리뉴클레오티드 (a) 내지 (al)로 이루어진 군으로부터 하나 이상 선택된 폴리뉴클레오티드 또는 그의 상보적 폴리뉴클레오티드를 포함하는, 장형 위암 진단용 키트를 포함한다.The present invention includes a probe for diagnosing enteric gastric cancer and a microarray for diagnosing enteric gastric cancer comprising the polynucleotide (a) to at least one selected from the group consisting of (al) or a complementary polynucleotide thereof. The present invention also includes a kit for diagnosing enteric gastric cancer, comprising a polynucleotide selected from the group consisting of polynucleotides (a) to (al) or complementary polynucleotides thereof.

본 발명은 또한 서열번호 1 내지 6, 10 내지 38, 41, 및 44 내지 45로 이루어진 군으로부터 선택된 폴리뉴클레오티드 또는 그의 상보적 폴리뉴클레오티드를 주형으로 하는 프라이머 또는 프로브를 이용한 서열번호 1 내지 6, 10 내지 38, 41, 및 44 내지 45로 이루어진 군으로부터 선택된 폴리뉴클레오티드 또는 그의 상보적 폴리뉴클레오티드의 단일염기다형의 분석방법으로서, 상기 단일염기다형은 폴리뉴클레오티드 (a) 내지 (al)로 이루어진 군으로부터 선택된 DNA 서열(단, 상기 폴리뉴클레오티드 (a) 내지 (al)은 상기에서 정의한 바와 같다)로 구성되는 것을 특징으로 하는 분석방법을 포함한다. 상기 분석 방법에서 상기 DNA 서열의 길이는 10 내지 100 뉴클레오티드, 바람직하게는 20 내지 60 뉴클레오티드, 더욱 바람직하게는 40 내지 60 뉴클레오티드일 수 있다.The invention also relates to SEQ ID NOs: 1 to 6, 10 to 10, using primers or probes based on polynucleotides selected from the group consisting of SEQ ID NOs: 1 to 6, 10 to 38, 41, and 44 to 45 or complementary polynucleotides thereof. 38, 41, and 44-45 polynucleotides selected from the group consisting of or a complementary polynucleotide thereof, wherein the monobasic polymorphism is a polynucleotide (a) to a DNA selected from the group consisting of (al) And, wherein the polynucleotides (a) to (al) are as defined above). In the analysis method, the length of the DNA sequence may be 10 to 100 nucleotides, preferably 20 to 60 nucleotides, more preferably 40 to 60 nucleotides.

본 발명은 또한 장형 위암 진단에 필요한 정보를 제공하기 위하여, (i) 검체로부터 핵산 시료를 얻는 단계; 및 (ii) 상기 핵산 시료로부터, 서열번호 1 내지 6, 10 내지 38, 41, 및 44 내지 45의 폴리뉴클레오티드로부터 하나 이상 선택된 폴리뉴클레오티드 또는 그의 상보적 폴리뉴클레오티드의 다형성 부위의 염기 서열을 분석하는 단계를 포함하는, 검체 중의 단일염기다형을 검출하는 방법을 제공한다. 상기 검출 방법에 있어서, 상기 DNA 서열의 다형성 부위는 다음 표 1과 같다.The present invention also provides a method for diagnosing gastric cancer, comprising: (i) obtaining a nucleic acid sample from a sample; And (ii) analyzing the nucleotide sequence of the polymorphic site of at least one polynucleotide selected from polynucleotides of SEQ ID NOs: 1 to 6, 10 to 38, 41, and 44 to 45 or complementary polynucleotides thereof from the nucleic acid sample. It provides a method for detecting a monobasic polymorphism in the sample. In the detection method, the polymorphic sites of the DNA sequence are shown in Table 1 below.

서열번호SEQ ID NO: 다형성 부위Polymorphic site 1One 501501 22 501501 33 501501 44 251251 55 251251 66 301301 1010 10931093 1111 10181018 1212 201201 1313 10011001 1414 345345 1515 10011001 1616 251251 1717 401401 1818 301301 1919 301301 2020 251251 2121 301301 2222 301301 2323 166166 2424 101101 2525 301301 2626 360360 2727 301301 2828 251251 2929 251251 3030 251251 3131 301301 3232 201201 3333 301301 3434 201201 3535 501501 3636 301301 3737 301301 3838 301301 4141 251251 4444 301301 4545 490490

상기 검체는 혈액, 조직, 세포 등을 포함하며, 핵산 시료는 통상의 방법에 따라 얻을 수 있다. 예를 들어 혈액에 경우 다음과 같이 핵산 시료를 얻을 수 있다. 혈액을 원심분리 실험관으로 옮기고 낮은 농도의 염 성분 완충용액(low-salt buffer solution(10mM Tris-HCl [pH 7.6], 10mM KCl, 10mM MgCl₂, and 2mM EDTA))을 5ml 첨가한다. 그리고 Nonidet P-40을 첨가한 다음, 얻어진 용액을 잘 섞어준 후, 상온에서 10분간 2,200 rpm으로 원심분리한다. 이때 얻어진 덩어리는 고농도의 염 성분 완충용액(high-salt buffer solution(10mM Tris-HCl [pH 7.6], 10mM KCl, 10mM MgCl₂, 0.4M NaCl, and 2mM EDTA))으로 재현탁한다. 게놈 DNA는 50ul의 10% SDS와 섞은 후, 55℃에서 밤새 반응시킨다. 반응 후에, 각 게놈 DNA들은 1.5ml 용량의 원심분리 실험관으로 옮겨, 6M NaCl을 0.3ml 첨가한다. 게놈 DNA들을 잘 섞어준 후, 12,000rpm에서 10분간 원심분리한다. 원심분리 후, 게놈 DNA가 들어있는 상층을 모아서 새로운 시험관으로 옮기고, 상온에서 동량의 100% 에탄올을 첨가한다. 게놈 DNA들이 침전될 때까지 잘 섞어 주고, 침전이 되면 70% 에탄올로 게놈 DNA들을 세척한다. 이 과정이 끝나면 실험관 뚜껑을 열어, 용액성분이 마르도록 놓아둔다. 그리고 TE 완충용액(pH 8.0)을 넣어 게놈 DNA를 재현탁함으로써, 핵산 시료를 얻을 수 있다.The sample includes blood, tissue, cells, and the like, and a nucleic acid sample can be obtained according to a conventional method. For example, in the case of blood, a nucleic acid sample can be obtained as follows. Transfer the blood to a centrifuge tube and add 5 ml of low concentration salt solution (10 mM Tris-HCl [pH 7.6], 10 mM KCl, 10 mM MgCl ₂ , and 2 mM EDTA). Then, after adding Nonidet P-40, the resulting solution was mixed well and centrifuged at 2,200 rpm for 10 minutes at room temperature. The obtained mass is resuspended in a high concentration of salt component buffer (10 mM Tris-HCl [pH 7.6], 10 mM KCl, 10 mM MgCl ₂ , 0.4M NaCl, and 2 mM EDTA). Genomic DNA is mixed with 50ul of 10% SDS and allowed to react overnight at 55 ° C. After the reaction, each genomic DNA is transferred to a 1.5 ml centrifuge tube and 0.3 ml of 6 M NaCl is added. After mixing the genomic DNA well, centrifuge for 10 minutes at 12,000rpm. After centrifugation, the upper layer containing genomic DNA is collected, transferred to a new test tube, and the same amount of 100% ethanol is added at room temperature. Mix the genomic DNA well until it is settled, and wash the genomic DNA with 70% ethanol when settling. At the end of this process, open the test tube lid and let the solution dry. The nucleic acid sample can be obtained by resuspending genomic DNA by adding TE buffer solution (pH 8.0).

여기에서, 상기 다형성 부위의 염기 서열을 분석하는 단계는 서열번호 1 내지 6, 10 내지 38, 41, 및 44 내지 45의 폴리뉴클레오티드로 이루어진 군으로부터 선택된 폴리뉴클레오티드 또는 그의 상보적 폴리뉴클레오티드 서열로서 다형성 부위(단, 상기 다형성 부위는 표 1에서 정의한 바와 같다)를 포함하는 서열을 주형으로 하는 프라이머 또는 프로브를 이용하여 수행될 수 있다. 상기 프라이머 또는 프로브는 상기에서 설명한 바와 같다. Here, the step of analyzing the nucleotide sequence of the polymorphic site is a polynucleotide selected from the group consisting of polynucleotides of SEQ ID NOs: 1 to 6, 10 to 38, 41, and 44 to 45 or a polymorphic site thereof as a complementary polynucleotide sequence thereof. (However, the polymorphic site is as defined in Table 1) can be carried out using a primer or a probe as a template. The primer or probe is as described above.

또한, 상기 다형성 부위의 염기 서열을 분석하는 단계는 폴리뉴클레오티드 (a) 내지 (al)로 이루어진 군으로부터 선택된 폴리뉴클레오티드 또는 그의 상보적 뉴클레오티드가 고정된 마이크로어레이에 상기 핵산 시료를 혼성화 시키는 단계; 및 얻어진 혼성화 결과를 분석하는 단계를 포함하여 수행될 수 있으며, 상기 마이크로어레이에 고정되는 폴리뉴클레오티드 또는 그의 상보적 뉴클레오티드의 DNA 서열의 길이는 각각 10 내지 100 뉴클레오티드일 수 있다.In addition, analyzing the nucleotide sequence of the polymorphic site may include hybridizing the nucleic acid sample to a microarray to which a polynucleotide selected from the group consisting of polynucleotides (a) to (al) or a complementary nucleotide thereof is immobilized; And analyzing the obtained hybridization results, wherein the lengths of the DNA sequences of the polynucleotides or complementary nucleotides immobilized on the microarrays may be 10 to 100 nucleotides, respectively.

본 발명의 검체 중의 단일염기다형을 검출하는 방법에 따라 얻어진 결과는 장형 위암 진단에 필요한 정보를 제공할 수 있으며 예를 들어, 서열번호 1 내지 6, 10 내지 38, 41, 및 44 내지 45의 다형성 부위의 염기가 각각 C, G, A, A, C, C, T, A, A, G, G, A, G, G, G, T, C, A, A, G, C, A, T, G, C, C, A, C, A, T, A, C, A, C, T, C, G, T (위험 대립인자)인 것이 하나 이상 존재하는 경우, 장형 위암에 걸릴 확률이 높은 위험군에 속하는 것으로 판정할 수 있다. 상기 위험 대립 인자를 갖는 핵산 서열이 하나의 검체에서 많이 검출되면 될수록 위험군에서 속하는 확률은 더 높은 것으로 판단할 수 있다. The results obtained according to the method for detecting monobasic polymorphisms in the sample of the present invention may provide information necessary for diagnosing gastric cancer, for example, polymorphisms of SEQ ID NOs: 1 to 6, 10 to 38, 41, and 44 to 45 Bases of the sites are C, G, A, A, C, C, T, A, A, G, G, A, G, G, G, T, C, A, A, G, C, A, T If you have more than one G, C, C, A, C, A, T, A, C, A, C, T, C, G, T (risk allele), you are more likely to develop gastric cancer It can be determined that they belong to the risk group. The more the nucleic acid sequence having the risk allele is detected in one sample, the higher the probability of belonging to the risk group can be determined.

또한, 상기 다형성 부위의 염기 서열을 분석하는 단계는 서열번호 1 내지 6, 10 내지 38, 41, 및 44 내지 45의 다형성 부위의 대립인자형(allele) 또는 유전자형(genotype) 분석을 포함할 수 있다. 바람직하게는, 상기 다형성 부위의 염기 서열을 분석하는 단계는 서열번호 1의 DNA 서열의 다형성부위의 열성 유전자형(recessive genotype) 분석을 포함하거나, 서열번호 2의 DNA 서열의 다형성 부위의 공우성 유전자형(co-dominant genotype), 열성 유전자형(recessive genotype) 분석을 포함하거나, 서열번호 3의 DNA 서열의 다형성 부위의 공우성 유전자형(co-dominant genotype), 열성 유전자형(recessive genotype) 분석을 포함하거나, 서열번호 4의 DNA 서열의 다형성 부위의 대립인자형(allele), 공우성 유전자형(co-dominant genotype), 열성 유전자형(recessive genotype) 분석을 포함하거나, 서열번호 5의 DNA 서열의 다형성 부위의 대립인자형(allele), 공우성 유전자형(co-dominant genotype), 열성 유전자형(recessive genotype) 분석을 포함하거나, 서열번호 6의 DNA 서열의 다형성 부위의 공우성 유전자형(co-dominant genotype), 우성 유전자형(dominant genotype) 분석을 포함하거나, 서열번호 10의 DNA 서열의 다형성 부위의 우성 유전자형(dominant genotype) 분석을 포함하거나, 서열번호 11의 DNA 서열의 다형성 부위의 공우성 유전자형(co-dominant genotype), 열성 유전자형(recessive genotype) 분석을 포함하거나, 서열번호 12의 DNA 서열의 다형성 부위의 우성 유전자형(dominant genotype) 분석을 포함하거나, 서열번호 13의 DNA 서열의 다형성 부위의 우성 유전자형(dominant genotype) 분석을 포함하거나, 서열번호 14의 DNA 서열의 다형성 부위의 대립인자형(allele) 분석을 포함하거나, 서열번호 15의 DNA 서열의 다형성 부위의 열성 유전자형(recessive genotype) 분석을 포함하거나, 서열번호 16의 DNA 서열의 다형성 부위의 대립인자형(allele) 분석을 포함하거나, 서열번호 17의 DNA 서열의 다형성 부위의 대립인자형(allele), 우성 유전자형(dominant genotype) 분석을 포함하거나, 서열번호 18의 DNA 서열의 다형성 부위의 대립인자형(allele) 분석을 포함하거나, 서열번호 19의 DNA 서열의 다형성 부위의 대립인자형(allele), 공우성 유전자형(co-dominant genotype), 우성 유전자형(dominant genotype) 분석을 포함하거나, 서열번호 20의 DNA 서열의 다형성 부위의 대립인자형(allele), 공우성 유전자형(co-dominant genotype), 열성 유전자형(recessive genotype) 분석을 포함하거나, 서열번호 21의 DNA 서열의 다형성 부위의 대립인자형(allele), 공우성 유전자형(co-dominant genotype), 우성 유전자형(dominant genotype) 분석을 포함하거나, 서열번호 22의 DNA 서열의 다형성 부위의 대립인자형(allele), 공우성 유전자형(co-dominant genotype), 열성 유전자형(recessive genotype) 분석을 포함하거나, 서열번호 239의 DNA 서열의 다형성 부위의 우성 유전자형(dominant genotype) 분석을 포함하거나, 서열번호 24의 DNA 서열의 다형성 부위의 우성 유전자형(dominant genotype) 분석을 포함하거나, 서열번호 25의 DNA 서열의 다형성 부위의 대립인자형(allele), 우성 유전자형(dominant genotype) 분석을 포함하거나, 서열번호 26의 DNA 서열의 다형성 부위의 대립인자형(allele), 공우성 유전자형(co-dominant genotype), 우성 유전자형(dominant genotype) 분석을 포함하거나, 서열번호 27의 DNA 서열의 다형성 부위의 대립인자형(allele), 공우성 유전자형(co-dominant genotype), 우성 유전자형(dominant genotype) 분석을 포함하거나, 서열번호 28의 DNA 서열의 다형성 부위의 열성 유전자형(recessive genotype) 분석을 포함하거나, 서열번호 29의 DNA 서열의 다형성 부위의 열성 유전자형(recessive genotype) 분석을 포함하거나, 서열번호 30의 DNA 서열의 다형성 부위의 열성 유전자형(recessive genotype) 분석을 포함하거나, 서열번호 31의 DNA 서열의 다형성 부위의 대립인자형(allele), 공우성 유전자형(co-dominant genotype) 분석을 포함하거나, 서열번호 32의 DNA 서열의 다형성 부위의 열성 유전자형(recessive genotype) 분석을 포함하거나, 서열번호 33의 DNA 서열의 다형성 부위의 열성 유전자형(recessive genotype) 분석을 포함하거나, 서열번호 34의 DNA 서열의 다형성 부위의 열성 유전자형(recessive genotype) 분석을 포함하거나, 서열번호 35의 DNA 서열의 다형성 부위의 열성 유전자형(recessive genotype) 분석을 포함하거나, 서열번호 36의 DNA 서열의 다형성 부위의 열성 유전자형(recessive genotype) 분석을 포함하거나, 서열번호 37의 DNA 서열의 다형성 부위의 우성 유전자형(dominant genotype) 분석을 포함하거나, 서열번호 38의 DNA 서열의 다형성 부위의 우성 유전자형(dominant genotype) 분석을 포함하거나, 서열번호 41의 DNA 서열의 다형성 부위의 우성 유전자형(dominant genotype) 분석을 포함하거나, 서열번호 44의 DNA 서열의 다형성 부위의 대립인자형(allele), 공우성 유전자형(co-dominant genotype), 우성 유전자형(dominant genotype) 분석을 포함하거나, 서열번호 45의 DNA 서열의 다형성 부위의 대립인자형(allele), 공우성 유전자형(co-dominant genotype), 우성 유전자형(dominant genotype) 분석을 포함할 수 있다.In addition, analyzing the base sequence of the polymorphic site may include an allele or genotype analysis of the polymorphic sites of SEQ ID NOs: 1 to 6, 10 to 38, 41, and 44 to 45. Preferably, analyzing the nucleotide sequence of the polymorphic site comprises recessive genotype analysis of the polymorphic site of the DNA sequence of SEQ ID NO: 1, or the concomitant genotype of the polymorphic site of the DNA sequence of SEQ ID NO: co-dominant genotype, recessive genotype analysis, co-dominant genotype, recessive genotype analysis of the polymorphic region of the DNA sequence of SEQ ID NO: 3, or SEQ ID NO: Allele, co-dominant genotype, recessive genotype analysis of the polymorphic site of the DNA sequence of 4, or allele of the polymorphic site of the DNA sequence of SEQ ID NO: 5 , Co-dominant genotype, recessive genotype analysis, or co-dominan of a polymorphic region of the DNA sequence of SEQ ID NO: 6 t genotype), including dominant genotype analysis, or dominant genotype analysis of the polymorphic site of the DNA sequence of SEQ ID NO: 10, or the dominant genotype of the polymorphic site of the DNA sequence of SEQ ID NO: co-dominant genotype, including recessive genotype analysis, or dominant genotype analysis of the polymorphic site of the DNA sequence of SEQ ID NO: 12, or dominant genotype of the polymorphic site of the DNA sequence of SEQ ID NO: 13 (dominant genotype) analysis, or allele analysis of the polymorphic site of the DNA sequence of SEQ ID NO: 14, or recessive genotype analysis of the polymorphic site of the DNA sequence of SEQ ID NO: 15, or Allele analysis of the polymorphic site of the DNA sequence of SEQ ID NO: 16, or allele of the polymorphic site of the DNA sequence of SEQ ID NO: 17 (a llele), including dominant genotype analysis, or allele analysis of the polymorphic site of the DNA sequence of SEQ ID NO: 18, or allele of the polymorphic site of the DNA sequence of SEQ ID NO: 19, Co-dominant genotype, dominant genotype analysis, or alleles, co-dominant genotypes, recessive genotypes of polymorphic regions of the DNA sequence include recessive genotype analysis, allele, co-dominant genotype, dominant genotype analysis of the polymorphic site of the DNA sequence of SEQ ID NO: 21, or DNA of SEQ ID NO: 22 Dominant of allelic, co-dominant genotype, recessive genotype analysis of the polymorphic site of the sequence, or dominant of the polymorphic site of the DNA sequence of SEQ ID NO: 239 Include genotype analysis, dominant genotype analysis of the polymorphic site of the DNA sequence of SEQ ID NO: 24, or allele, dominant genotype of the polymorphic site of the DNA sequence of SEQ ID NO: 25 ( DNA analysis of SEQ ID NO: 27, including dominant genotype analysis, allele, co-dominant genotype, dominant genotype analysis of the polymorphic site of the DNA sequence of SEQ ID NO: 26 Analysis of alleles, co-dominant genotypes, dominant genotypes of polymorphic regions of the sequence, or recessive genotype analysis of the polymorphic regions of the DNA sequence of SEQ ID NO: 28. A recessive genotype analysis of the polymorphic site of the DNA sequence of SEQ ID NO: 29, or a sequence of polymorphic site of the DNA sequence of SEQ ID NO: 30 Genotype analysis, including allele, co-dominant genotype analysis of the DNA sequence of SEQ ID NO: 31, or polymorphic site of the DNA sequence of SEQ ID NO: 32 Includes a recessive genotype analysis, or includes a recessive genotype analysis of a polymorphic site of a DNA sequence of SEQ ID NO: 33, or a recessive genotype analysis of a polymorphic site of a DNA sequence of SEQ ID NO: 34 Or include a recessive genotype analysis of the polymorphic site of the DNA sequence of SEQ ID NO: 35, or include a recessive genotype analysis of the polymorphic site of the DNA sequence of SEQ ID NO: 36, or the DNA of SEQ ID NO: 37 Dominant genotype analysis of the polymorphic site of the sequence, or dominant inheritance of the polymorphic site of the DNA sequence of SEQ ID NO: 38 Include dominant genotype analysis, dominant genotype analysis of the polymorphic site of the DNA sequence of SEQ ID NO: 41, or allele, co-dominant genotype of the polymorphic site of the DNA sequence of SEQ ID NO: 44 (co-dominant genotype), dominant genotype analysis, or allele, co-dominant genotype, dominant genotype of the polymorphic site of the DNA sequence of SEQ ID NO: 45 Analysis may be included.

예를 들어, 서열번호 1 내지 6, 서열번호 10 내지 13, 서열번호 15, 서열번호 17, 서열번호 19 내지 38, 서열번호 41, 서열번호 44 내지 45의 다형성 부위가 각각 유전자형이 C/C(서열번호 1), G/G(서열번호 2), A/A(서열번호 3), A/A(서열번호 4), C/C(서열번호 5), T/C와 C/C(서열번호 6), T/T(서열번호 10), A/A와 A/G(서열번호 11), G/A와 A/A(서열번호 12), G/G(서열번호 13), A/A(서열번호 15), A/G와 G/G(서열번호 17), C/T와 T/T(서열번호 19), C/C(서열번호 20), A/A(서열번호 21), A/A(서열번호 22), G/G(서열번호 23), C/C(서열번호 24), A/A(서열번호 25), C/T와 T/T(서열번호 26), T/G와 G/G(서열번호 27), C/C와 C/T(서열번호 28), C/C와 C/T(서열번호 29), A/A와 A/G(서열번호 30), T/C와 C/C(서열번호 31), A/A와 A/G(서열번호 32), T/T와 T/C(서열번호 33), A/A와 A/G(서열번호 34), C/C와 C/A(서열번호 35), A/A와 A/G(서열번호 36), T/C와 C/C(서열번호 37), C/T와 T/T(서열번호 38), T/C와 C/C(서열번호 41), T/G와 G/G(서열번호 44), C/T와 T/T(서열번호 45)가 하나 이상 존재하는 경우 장형 위암에 걸릴 위험도가 높은 것으로 판단할 수 있다. 상기 위험 유전자형을 갖는 핵산 서열이 하나의 검체에서 많이 검출되면 될수록 위험군에서 속하는 확률은 더 높은 것으로 판단할 수 있다. For example, the polymorphic sites of SEQ ID NOs: 1 to 6, SEQ ID NOs: 10 to 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NOs: 19 to 38, SEQ ID NO: 41, and SEQ ID NOs: 44 to 45 are each genotype C / C ( SEQ ID NO: 1), G / G (SEQ ID NO: 2), A / A (SEQ ID NO: 3), A / A (SEQ ID NO: 4), C / C (SEQ ID NO: 5), T / C, and C / C (SEQ ID NO: Number 6), T / T (SEQ ID NO: 10), A / A and A / G (SEQ ID NO: 11), G / A and A / A (SEQ ID NO: 12), G / G (SEQ ID NO: 13), A / A (SEQ ID NO: 15), A / G and G / G (SEQ ID NO: 17), C / T and T / T (SEQ ID NO: 19), C / C (SEQ ID NO: 20), A / A (SEQ ID NO: 21) , A / A (SEQ ID NO: 22), G / G (SEQ ID NO: 23), C / C (SEQ ID NO: 24), A / A (SEQ ID NO: 25), C / T and T / T (SEQ ID NO: 26), T / G and G / G (SEQ ID NO: 27), C / C and C / T (SEQ ID NO: 28), C / C and C / T (SEQ ID NO: 29), A / A and A / G (SEQ ID NO: 30) ), T / C and C / C (SEQ ID NO: 31), A / A and A / G (SEQ ID NO: 32), T / T and T / C (SEQ ID NO: 33), A / A and A / G (SEQ ID NO: Number 34), C / C and C / A (SEQ ID NO: 35), A / A and A / G (SEQ ID NO: 36), T / C and C / C (SEQ ID NO: 34) Number 37), C / T and T / T (SEQ ID NO: 38), T / C and C / C (SEQ ID NO: 41), T / G and G / G (SEQ ID NO: 44), C / T and T / T If one or more of (SEQ ID NO: 45) is present, it may be determined that the risk of developing intestinal gastric cancer is high. The more the nucleic acid sequence having the risk genotype is detected in one sample, the higher the probability of belonging to the risk group can be determined.

본 발명은 또한, 장형위암 진단에 필요한 정보를 제공하기 위하여, 상기 핵산 시료로부터 반수체형(haplotype)을 분석하는 단계를 포함하는, 검체 중의 단일염기다형을 검출하는 방법을 포함한다. 구체적으로, 상기 반수체형 분석은 상기 핵산 시료로부터, 서열번호 1 및 2의 DNA 서열의 다형성 부위로부터 결정되는 반수체형; 서열번호 4 및 5의 DNA 서열의 다형성 부위로부터 결정되는 반수체형; 서열번호 6 내지 8의 DNA 서열의 다형성 부위로부터 결정되는 반수체형; 서열번호 9 및 10의 DNA 서열의 다형성 부위로부터 결정되는 반수체형; 서열번호 18 및 19의 DNA 서열의 다형성 부위로부터 결정되는 반수체형; 서열번호 21 및 22의 DNA 서열의 다형성 부위로부터 결정되는 반수체형; 서열번호 27 및 28의 DNA 서열의 다형성 부위로부터 결정되는 반수체형; 서열번호 30 내지 36의 DNA 서열의 다형성 부위로부터 결정되는 반수체형; 서열번호 39 및 40의 DNA 서열의 다형성 부위로부터 결정되는 반수체형; 및 서열번호 42 내지 45의 DNA 서열의 다형성 부위로부터 결정되는 반수체형으로 이루어진 군으로부터 선택된 반수체형을 분석함으로써 수행될 수 있다. 단, 반수체형 분석에 있어서, 상기 DNA 서열의 다형성 부위는 다음 표 2와 같다.The present invention also includes a method for detecting a monobasic polymorph in a sample, comprising analyzing a haplotype from the nucleic acid sample to provide information necessary for diagnosing gastric cancer. Specifically, the haplotype analysis comprises a haplotype determined from the polymorphic site of the DNA sequence of SEQ ID NOs: 1 and 2 from the nucleic acid sample; Haplotypes determined from the polymorphic sites of the DNA sequences of SEQ ID NOs: 4 and 5; Haplotype determined from the polymorphic site of the DNA sequence of SEQ ID NOs: 6-8; Haplotypes determined from the polymorphic sites of the DNA sequences of SEQ ID NOs: 9 and 10; Haplotypes determined from the polymorphic sites of the DNA sequences of SEQ ID NOs: 18 and 19; Haplotypes determined from the polymorphic sites of the DNA sequences of SEQ ID NOs: 21 and 22; Haplotypes determined from the polymorphic sites of the DNA sequences of SEQ ID NOs: 27 and 28; Haplotype determined from the polymorphic site of the DNA sequence of SEQ ID NOs: 30-36; Haplotypes determined from the polymorphic sites of the DNA sequences of SEQ ID NOs: 39 and 40; And haplotypes selected from the group consisting of haplotypes determined from the polymorphic sites of the DNA sequences SEQ ID NOs: 42-45. However, in haplotype analysis, the polymorphic sites of the DNA sequence are shown in Table 2 below.

서열번호SEQ ID NO: 다형성 부위Polymorphic site 1One 501501 22 501501 44 251251 55 251251 66 301301 77 301301 88 301301 99 401401 1010 10931093 1818 301301 1919 301301 2121 301301 2222 301301 2727 301301 2828 251251 3030 251251 3131 301301 3232 201201 3333 301301 3434 201201 3535 501501 3636 301301 3939 301301 4040 481481 4242 885885 4343 452452 4444 301301 4545 490490

즉, 서열번호 1 및 2, 서열번호 4 및 5, 서열번호 6 내지 8, 서열번호 9 및 10, 서열번호 18 및 19, 서열번호 21 및 22, 서열번호 27 및 28, 서열번호 30 내지 36, 서열번호 39 및 40, 서열번호 42 내지 45의 DNA 서열의 다형성 부위로부터 결정된 반수체형 분석결과, 서열번호 1 및 2의 DNA 서열이 C-G 반수체형을 둘 다 가지는 경우(즉, 이배체형(diplotype)을 가지는 경우), 그렇지 않은 사람보다 장형위암에 걸릴 위험이 높은 것으로 판정할 수 있으며, A-A 반수체형을 하나라도 나타내는 경우 장형 위암에 저항성을 갖는 것으로 판정할 수 있고(표 9, 도 1a 및 1b 참조), 서열번호 4 및 5의 DNA 서열이 A-C 반수체형을 갖는 경우, 그렇지 않은 사람보다 장형위암에 걸릴 위험이 높은 것으로 판정할 수 있으며, G-G 반수체형을 나타내는 경우 장형 위암에 저항성을 갖는 것으로 판정할 수 있고(표 9, 도 2a 및 2b 참조), 서열번호 6 내지 8 의 DNA 서열이 C-A-C 반수체형을 하나라도 갖는 경우, 그렇지 않은 사람보다 장형위암에 걸릴 위험이 높은 것으로 판정할 수 있고(표 9, 도 3a 및 3b 참조), 서열번호 9 및 10의 DNA 서열이 C-C 반수체형을 하나라도 갖는 경우, 그렇지 않은 사람보다 장형위암에 저항성을 갖는 것으로 판정할 수 있고(표 9, 도 4a 및 4b 참조), 서열번호 18 및 19의 DNA 서열이 G-T 반수체형을 하나라도 갖는 경우, 그렇지 않은 사람보다 장형위암에 걸릴 위험이 높은 것으로 판정할 수 있으며, A-C 반수체형을 나타내는 경우, 장형 위암에 저항성을 갖는 것으로 판정할 수 있고(표 9, 도 5a 및 5b 참조), 서열번호 21 및 22 의 DNA 서열이 A-A를 둘다 갖는 경우, 장형위암에 걸릴 위험이 높은 것으로 판정할 수 있으며, G-G 반수체형을 하나라도 갖는 경우, 장형위암에 저항성을 갖는 것으로 판정할 수 있고(표 9, 도 6a 및 6b 참조), 서열번호 27 및 28 의 DNA 서열이 G-C 반수체형을 하나라도 갖는 경우, 그렇지 않은 사람보다 장형위암에 걸릴 위험이 높은 것으로 판정할 수 있으며, T-T 반수체형을 둘다 갖는 경우, 장형위암에 저항성을 갖는 것으로 판정할 수 있고(표 9, 도 7a 및 7b 참조), 서열번호 30 내지 36의 DNA 서열이 G-T-G-G-A-G 반수체형을 둘다 갖는 경우, 그렇지 않은 사람보다 장형위암에 저항성을 갖는 것으로 판정할 수 있고(표 9, 도 8a 및 8b 참조), 서열번호 39 내지 40의 DNA 서열이 T-T 반수체형을 하나라도 갖는 경우, 그렇지 않은 사람보다 장형위암에 걸릴 위험이 높은 것으로 판정할 수 있고(표 9, 도 9a 및 9b 참조), 서열번호 42 내지 45의 DNA 서열이 A-A-T-C 반수체형을 둘다 갖는 경우, 그렇지 않은 사람보다 장형위암에 걸릴 위험이 높은 것으로 판정할 수 있다(표 9, 도 10a 및 10b 참조).That is, SEQ ID NO: 1 and 2, SEQ ID NO: 4 and 5, SEQ ID NO: 6 to 8, SEQ ID NO: 9 and 10, SEQ ID NO: 18 and 19, SEQ ID NO: 21 and 22, SEQ ID NO: 27 and 28, SEQ ID NO: 30 to 36, As a result of haplotype analysis determined from the polymorphic sites of the DNA sequences SEQ ID NOs: 39 and 40 and SEQ ID NOs: 42-45, the DNA sequences of SEQ ID NOs: 1 and 2 had both CG haplotypes (ie, diplotypes). ), It may be determined that the risk of developing gastrointestinal cancer is higher than that of others, and if any AA haplotype is indicated, it may be determined to be resistant to gastric cancer (see Table 9, FIGS. 1A and 1B). , If the DNA sequence of SEQ ID NO: 4 and 5 has an AC haplotype, it may be determined that the risk of developing gastrointestinal cancer is higher than that of those who do not, and if the GG haplotype is indicated, it may be determined that it is resistant to enteric gastric cancer. There (See Table 9, Figures 2A and 2B), if the DNA sequence of SEQ ID NOs 6 to 8 has any CAC haplotype, it can be determined that the risk of developing gastrointestinal cancer is higher than that of the other (Table 9, Figure 3a and 3b), if the DNA sequences of SEQ ID NOs: 9 and 10 have any CC haplotype, it may be determined that they are more resistant to gastric cancer than those who do not (see Table 9, Figures 4A and 4B), If the DNA sequences of SEQ ID NOs: 18 and 19 have any GT haplotype, it may be determined that the risk of developing gastrointestinal cancer is higher than that of other humans, and if it is AC haploid, it is determined to be resistant to enteric gastric cancer. (See Table 9, FIGS. 5A and 5B), if the DNA sequences of SEQ ID NOs: 21 and 22 have both AA, it can be determined that the risk of developing gastrointestinal gastric cancer is high, and any of the GG haplotypes , Long If it can be determined that it is resistant to gastric cancer (see Tables 9, 6A and 6B), and the DNA sequences of SEQ ID NOs: 27 and 28 have any GC haplotype, they are at higher risk for gastric cancer than those who do not. And having both TT haplotypes, can be determined to be resistant to gastric gastric cancer (see Tables 9, 7A and 7B), and the DNA sequences of SEQ ID NOs: 30-36 are both GTGGAG haplotypes Have a TT haplotype if the DNA sequence of SEQ ID NOs: 39 to 40 can be determined to be more resistant to gastrointestinal cancer than those who do not (see Tables 9, 8A and 8B). If the DNA sequence of SEQ ID NOS: 42-45 has both AATC haplotypes, it can be determined that the risk of developing gastric cancer is higher (see Tables 9, 9A and 9B). It can be determined that the risk of catching is high (see Table 9, FIGS. 10A and 10B).

이하, 본 발명을 실시예를 통하여 더욱 상세히 설명한다. 그러나 이들 실시예는 본 발명을 예시하기 위한 것으로, 본 발명의 범위가 이들 실시예에 한정되는 것은 아니다.Hereinafter, the present invention will be described in more detail with reference to examples. However, these examples are for illustrating the present invention, and the scope of the present invention is not limited to these examples.

실시예Example

1. 피검자 선정1. Selection of subjects

본 실험은 한국 아주대 병원으로부터 임상적으로 장형 위암으로 진단받은 153명의 환자들과 만성 위염으로 진단받은 326명의 환자들로부터 동의를 받아서 실험과 연관성 분석이 수행되었다. 479명의 환자 모두가 Helicobacter pylori에 감염된 환자이다. 사용된 군집의 크기는 다음 표 3과 같다.This study was performed with the consent of 153 patients who were clinically diagnosed as enteric gastric cancer and 326 patients who were diagnosed with chronic gastritis. All 479 patients were infected with Helicobacter pylori. The size of the clusters used is shown in Table 3 below.

그룹group 만성 위염 환자군Chronic gastritis 장형 위암 환자군Enteric Gastric Cancer Group 총합계total 조사인원Survey 326326 153153 479479

2. 게놈 DNA의 준비2. Preparation of Genomic DNA

게놈 DNA(genomic DNA; gDNA)를 준비하기 위하여 장형 위암 환자와 만성 위염 환자들로부터 혈액을 채취하였다. 전체 정맥혈의 5ml은 각 지원자들의 전주정맥으로부터 얻었고 15% EDTA가 들어있는 Vacutainer tube로 옮겨졌다. 게놈 DNA를 추출하기 위하여, 혈액을 원심분리 실험관으로 옮기고 낮은 농도의 염 성분 완충용액(low-salt buffer solution(10mM Tris-HCl [pH 7.6], 10mM KCl, 10mM MgCl₂, and 2mM EDTA))을 5ml 첨가 하였다. 그리고 Nonidet P-40을 첨가하였다. 얻어진 용액을 잘 섞어준 후, 상온에서 10분간 2,200 rpm으로 원심분리 하였다. 이 때 얻어진 덩어리는 고농도의 염 성분 완충용액(high-salt buffer solution(10mM Tris-HCl [pH 7.6], 10mM KCl, 10mM MgCl₂, 0.4M NaCl, and 2mM EDTA))으로 재현탁 하였다. 이 게놈 DNA는 50ul의 10% SDS와 섞은 후, 55℃에서 밤새 반응시켰다.Blood was collected from enteric gastric cancer patients and chronic gastritis patients to prepare genomic DNA (gDNA). 5 ml of total venous blood was obtained from each volunteer's anterior venous vein and transferred to a Vacutainer tube containing 15% EDTA. To extract genomic DNA, the blood is transferred to a centrifuge tube and a low concentration of salt buffer solution (10 mM Tris-HCl [pH 7.6], 10 mM KCl, 10 mM MgCl ₂ , and 2 mM EDTA) is added. 5 ml was added. And Nonidet P-40 was added. The resulting solution was mixed well and centrifuged at 2,200 rpm for 10 minutes at room temperature. The obtained mass was resuspended in a high concentration salt buffer solution (10 mM Tris-HCl [pH 7.6], 10 mM KCl, 10 mM MgCl ₂ , 0.4M NaCl, and 2 mM EDTA). The genomic DNA was mixed with 50ul of 10% SDS and reacted overnight at 55 ° C.

반응 후에, 각 게놈 DNA들은 1.5ml 용량의 원심분리 실험관으로 옮겨, 6M NaCl을 0.3ml 첨가하였다. 게놈 DNA들을 잘 섞어준 후, 12,000rpm에서 10분간 원심분리하였다. 원심분리 후, 게놈 DNA가 들어있는 상층을 모아서 새로운 시험관으로 옮기고, 상온에서 동량의 100% 에탄올을 첨가하였다. 게놈 DNA들이 침전될 때까지 잘 섞어 주고, 침전이 되면 70% 에탄올로 게놈 DNA들을 세척하였다. 이 과정이 끝나면 실험관 뚜껑을 열어, 용액성분이 마르도록 놓아둔다. 그리고 TE 완충용액(pH 8.0)을 넣어 게놈 DNA를 재현탁하였다.After the reaction, each genomic DNA was transferred to a 1.5 ml centrifuge tube and 0.3 ml of 6 M NaCl was added. After mixing the genomic DNA well, it was centrifuged for 10 minutes at 12,000rpm. After centrifugation, the upper layer containing genomic DNA was collected and transferred to a new test tube, and the same amount of 100% ethanol was added at room temperature. Mix well until genomic DNA precipitates and wash genomic DNA with 70% ethanol when precipitated. At the end of this process, open the test tube lid and let the solution dry. Then, TE buffer solution (pH 8.0) was added to resuspend the genomic DNA.

준비된 게놈 DNA는 GoldenGate Assay를 위해 Quant-iT^TM PicoGreen dsDNA reagent (Molecular Probes, Invitrogen)을 사용하여 250ng/5uL의 농도로 조정하였다. Prepared genomic DNA using the ^{Quant-iT TM PicoGreen dsDNA reagent (} Molecular Probes, Invitrogen) for GoldenGate Assay was adjusted to a concentration of 250ng / 5uL.

3. 게놈 DNA의 증폭 및 유전자형 분석 - Goldengate assay 3. Amplification and Genotyping of Genomic DNA-Goldengate assay

Goldengate assay 방법에 따라, Sentrix array matrix chip을 사용하여 유전자형을 분석하였다. 일루미나사(Illumina Inc.)로부터 GoldenGate assay를 위한 모든 시약과 Sentrix array matrix chip을 구입하여 사용하였다. 모든 과정은 일루미나사의 GoldenGate assay 방법에 따라 시행하였다. According to the Goldengate assay method, genotyping was performed using a Sentrix array matrix chip. All reagents and Sentrix array matrix chips for GoldenGate assay were purchased from Illumina Inc. All procedures were performed according to Illumina's GoldenGate assay method.

게놈 DNA 시료(250 ng/5 ㎕)는 96웰 플레이트에서 5 ul의 GS#-MS1 시약과 혼합하였다. 플레이트를 밀봉하고 250×g에서 1 분 동안 원심분리하고, 95 ℃ heat block에서 30 분간 반응시켰다. 여기에 5 ul GS#-SUD를 넣고 혼합한 후 5 ul 2-프로판올을 넣고, 다시 혼합한 후에 3,000×g에서 20분 동안 원심분리한 후 상등액을 제거하고 펠렛을 건조시켰다. 건조된 DNA는 10 ul의 GS#-RS1 시약을 넣고 잘 풀어주었다. 여기에 10 ul의 GS#-OPA와 30 ul의 GS#-OB1를 넣고, 잘 혼합한 후 70℃ heat block에 넣고 30℃가 될 때까지 천천히 식혔다. GS#-ASE 플레이트를 마그네틱 플레이트 위에 올리고 2분간 방치하였다. 상등액을 제거하고, 50 ul GS#-AM1을 넣어주고 잘 혼합하였다. 마그네틱 플레이트 위에 올리고 2분간 방치한 후 상등액을 제거하고, 50 ul GS#-AM1을 넣어주고 잘 혼합하였다. 위와 동일한 방법으로 50 ul GS#-UB1로 세척 후, 상등액을 제거하고, 37 ul GS#-MEL을 넣어주고 잘 혼합한 후에 45℃에서 15분간 배양하였다. 위 플레이트를 다시 마그네틱 플레이트 위에 올리고 2분간 방치한 후 상등액을 제거하고, 50 ul GS#-UB1을 넣어주었다. 다시 위 과정을 반복하고, 35 ul GS#-IP1을 넣어주고 잘 혼합한 후 95℃ heat block에서 1분간 배양(incubation)하였다. 플레이트를 마그네틱 플레이트 위에 올리고 2분간 방치하였다. 상등액 30 ul를 GS#-PCR 플레이트에 옮긴 후 64 ul의 illumina-recommended DNA polymerase와 100 ul UDG를 GS#-MMP tube에 넣어준후 잘 섞어주고, 표 4에 나타낸 조건으로 증폭시켰다. Genomic DNA samples (250 ng / 5 μl) were mixed with 5 ul of GS # -MS1 reagent in 96 well plates. The plate was sealed and centrifuged for 1 minute at 250 × g, and reacted for 30 minutes in a 95 ° C. heat block. 5 ul GS # -SUD was added thereto and mixed, 5 ul 2-propanol was added thereto, and mixed again, followed by centrifugation at 3,000 × g for 20 minutes, and then the supernatant was removed and the pellet was dried. The dried DNA was added well with 10 ul of GS # -RS1 reagent. 10 ul of GS # -OPA and 30 ul of GS # -OB1 were added thereto, mixed well, and placed in a 70 ° C. heat block and slowly cooled down to 30 ° C. The GS # -ASE plate was placed on the magnetic plate and left for 2 minutes. The supernatant was removed and 50 ul GS # -AM1 was added and mixed well. After placing on a magnetic plate and left for 2 minutes, the supernatant was removed, 50 ul GS # -AM1 was added and mixed well. After washing with 50 ul GS # -UB1 in the same manner as above, the supernatant was removed, 37 ul GS # -MEL was added and well mixed and incubated for 15 minutes at 45 ℃. The upper plate was placed again on the magnetic plate, left for 2 minutes, the supernatant was removed, and 50 ul GS # -UB1 was added thereto. Repeat the above process again, put 35 ul GS # -IP1 and mix well and incubated for 1 minute in a 95 ℃ heat block. The plate was placed on a magnetic plate and left for 2 minutes. 30 ul of the supernatant was transferred to the GS # -PCR plate, 64 ul of illumina-recommended DNA polymerase and 100 ul UDG were added to the GS # -MMP tube, mixed well, and amplified under the conditions shown in Table 4.

온도Temperature 각 온도에서의 시간Time at each temperature 37℃37 ℃ 10 min10 min 95℃95 ℃ 3 min3 min 34 사이클34 cycles 95℃95 ℃ 35 sec 35 sec 56℃56 ℃ 35 sec 35 sec 72℃72 ℃ 2 min2 min 72℃72 ℃ 10 min10 min 4℃4 ℃ 5 min5 min

반응 후에 20 ul의 GS#MPB를 넣고 잘 혼합한 후, 96웰 필터 플레이트에 옮기고, 암실에서 60분 동안 방치하였다. 필터 플레이트 어댑터를 96웰 V-bottom에 놓고, 원심 분리하여 상등액을 제거하였다. 다시 필터 플레이트에 50 ul GS#-UB2를 넣고 1000 x g에서 5 분간 원심 분리하였다. 30 ul의 GS#_MH1가 들어있는 GS#-INT 플레이트를 필터 플레이트위에 놓고, 30 ul의 0.1N NaOH를 필터 플레이트에 넣었다. 1000 xg에서 5 분간 원심 분리하여 PCR 생산물을 수거하였다.After the reaction, 20 ul of GS # MPB was added and mixed well, and then transferred to a 96 well filter plate, and left in the dark for 60 minutes. The filter plate adapter was placed in a 96 well V-bottom and centrifuged to remove the supernatant. 50 ul GS # -UB2 was put back into the filter plate and centrifuged at 1000 x g for 5 minutes. A GS # -INT plate containing 30 ul of GS # _MH1 was placed on the filter plate and 30 ul of 0.1N NaOH was placed in the filter plate. PCR products were harvested by centrifugation at 1000 xg for 5 minutes.

칩을 GS#-UB2와 NaOH에서 전 처리하고, 수거한 PCR 산물을 384웰에 옮기고 여기에 전처리한 칩을 올렸다. 60 ℃에서 30 분 동안 혼성화시키고 다시 온도를 45℃로 바꾸고 14시간 이상 혼성화시켰다. GS#UB2, GS#IS1에서 세척한 후에 상온에서 건조시킨 후, 분석을 위해서 이미지 스캐닝(image scanning)을 실시하였다.The chips were pretreated in GS # -UB2 and NaOH, and the collected PCR products were transferred to 384 wells and the pretreated chips were loaded thereon. Hybridization was performed at 60 ° C. for 30 minutes and again the temperature was changed to 45 ° C. and hybridized for at least 14 hours. After washing in GS # UB2, GS # IS1, and dried at room temperature, image scanning was performed for analysis.

유전자형 분석 이미지 파일로부터의 유전자형의 분석은 Illumina사의 Beadstudio를 사용하여 각 다형성에 대한 유전자형를 확인하였다. Beadstudio는 두 개의 대립인자를 인식하는 프로브들 사이의 발색의 세기에 대한 비율을 가지고 전형적인 유전자형의 패턴을 나타낸다.Genotyping Analysis of genotypes from image files confirmed the genotype for each polymorphism using Illumina's Beadstudio. Beadstudio exhibits a typical genotype pattern with a ratio of the intensity of color development between probes recognizing two alleles.

4. 통계학적 분석4. Statistical Analysis

단일염기다형들을 구성하는 대립인자들에 대한 집단내의 유전학적 분포여부에 대하여 하아디-와인버그 평형 검정을 하였다. 여기에서 단일염기다형의 기준이 되는 HWE p value > 0.05와 작은 대립인자 빈도(Minor Allele Frequency; MAF) > 0.1의 기준을 적용하였다. 장형 위암 환자군과 만성 위염 환자군간의 출현빈도를 이용하여 질병에 대한 각 단일염기다형들의 연관성분석은 유의수준을 < 0.05로 정하였으며, SAS software version 9.1.3를 사용하여 분석하였다. 연관성분석은 대립인자들 간의 장형 위암 환자군과 만성 위염 환자군에서의 출현빈도로 나누어 유의성 있는 차이를 비교하였으며, 유의수준을 < 0.05로 하였다. 오즈 비(Odds Ratio)는 로지스틱 회귀모형을 통하여 산출하였으며 대립인자(Allele), 우성 (Dominant), 열성 (Recessive) 및 공우성(Co-Dominant) 모형을 적용하였다. 각 모형에 따른 질병과의 유의성을 분석함에 있어 다수 대립인자를 A1이라 하고 소수 대립인자를 A2라 하면 대립인자(Allele) 모형은 A1의 빈도와 A2의 빈도를 비교하는 것이고, 우성 (Dominant) 모형은 A1A1의 빈도와 A1A2와 A2A2의 빈도에 대한 합을 비교하는 것이고, 열성 (Recessive) 형질 모형은 A1A1과 A1A2의 합과 A2A2의 빈도를 비교하는 것이고, 공우성 (Co-Dominant) 형질 모형은 A1A1, A1A2 및 A2A2에 대한 각각의 빈도를 비교하는 것이다. The Hardy-Weinberg equilibrium test was performed to determine the genetic distribution of the alleles that make up the monobasic polymorphisms. In this case, the criteria of HWE p value> 0.05 and Minor Allele Frequency (MAF)> 0.1, which are the criteria for polybasic polymorphism, were applied. Using the frequency of incidence between intestinal gastric cancer patients and chronic gastritis patients, the significance level of each polybasic polymorphism was set to <0.05 and analyzed using SAS software version 9.1.3. Correlation analysis was performed to compare significant differences between alleles in the incidence of intestinal gastric cancer and in patients with chronic gastritis. The significance level was <0.05. Odds Ratio was calculated by logistic regression model and allele, dominant, recessive and co-dominant models were applied. In analyzing the significance of each model, the majority allele is A1 and the minor allele is A2. The allele model compares the frequency of A1 with the frequency of A2, and the dominant model. Is comparing the sum of the frequency of A1A1 with the sum of the frequencies of A1A2 and A2A2, the recessive trait is comparing the sum of A1A1 and A1A2 with the frequency of A2A2, and the co-dominant trait is A1A1. , To compare the respective frequencies for A1A2 and A2A2.

단일염기다형들 간의 연관비평형 검정을 하였다. 이는 Haploview (ver. 4.1)을 사용하였다. 장형 위암 환자군과 만성 위염 환자군 모두에서 형성되는 각 단일염기다형들간의 상호연관성에 대한 분석에서 형성된 연관비평형블록으로부터 반수체형이 형성되었으며, 이들을 장형 위암 환자군과 만성 위염 환자군으로 나누어 장형 위암 환자군 특이적인 반수체형을 분석하였다. Associative non-equilibrium tests between monobasic polymorphs were performed. It used Haploview (ver. 4.1). The haplotype was formed from the association non-equilibrium block formed in the analysis of the correlation between the monobasic polymorphisms in both the gastric cancer patients and the chronic gastritis patients, and divided into the gastric cancer patients and the chronic gastritis patients. Haploid type was analyzed.

5. 결과5. Results

(1) PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, 및 NSMCE2 유전자에 존재하는 유의성을 나타내는 단일염기다형(1) Monobasic polymorphisms showing significance present in the PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, and NSMCE2 genes

하기 표 5a 및 5b는 상기 만성 위염 환자군과 장형 위암 환자군이 참여하여 장형 위암과의 연관성을 분석한 결과 유의성 있는 차이를 나타낸 서열의 목록으로서, 이 중 서열번호 1 내지 6, 10 내지 38, 41, 및 44 내지 45의 다형성 서열은 통계학적으로 유의성 있는 차이를 나타내는 것을 분석되었다.Table 5a and 5b is a list of sequences showing significant differences as a result of analyzing the association between the gastric gastritis patient group and the gastrointestinal gastric cancer patient group to analyze the association between gastric gastric cancer, SEQ ID NOs: 1 to 6, 10 to 38, 41, And polymorphic sequences of 44 to 45 were analyzed to show statistically significant differences.

ㆍ서열번호는 각 단일염기다형을 나타내는 서열번호이다.Sequence number is a sequence number which represents each monobasic polymorphism.

ㆍReference SNP ID는 표준단일염기다형의 명칭 (Reference SNP ID; rs)을 의미하며, 이는 인간게놈프로젝트 후, 미국 국립보건원 산하 생물공학정보연구소 (NCBI)의 데이터베이스에서 각 단일염기다형을 구분하기 위하여 붙인 이름이다. Reference SNP ID refers to the name of the standard single nucleotide polymorphism (Reference SNP ID; rs), which is used to distinguish each single nucleotide polymorphism from the database of the National Institute of Biotechnology Information (NCBI) under the National Institute of Health. Name given.

ㆍ대립인자는 각 단일염기다형에서의 Major allele과 Minor allele을 나타낸다.Alleles represent major and minor alleles in each monobasic polymorph.

ㆍ염색체는 각 단일염기다형들이 몇 번 염색체에 위치하고 있는지를 나타내주고 있다.The chromosome shows how many times each single base polymorph is located on the chromosome.

ㆍ위치는 염색체상에서 표준염기서열의 위치를 의미한다(NCBI Genome build 36.3).Position refers to the position of the standard base sequence on the chromosome (NCBI Genome build 36.3).

ㆍSNP 역할은 상기 단일염기다형들이 유전자상에서 어디에 위치하고 있는지를 나타내주고 있다. 따라서 각각의 단일염기다형들이 유전자의 발현과 기능에 있어서 어떻게 작용하는지를 알 수 있다. The SNP role indicates where the single base polymorphisms are located on the gene. Thus, it can be seen how each monobasic polymorphism works in gene expression and function.

ㆍ아미노산의 변화는 상기 단일염기다형들이 유전자가 발현되어 단백질을 생성할 때, 그 단백질을 이루는 아미노산에 변화를 주고 있는지에 대한 설명이다.A change in amino acid is a description of whether the monobasic polymorphs are changing the amino acids constituting the protein when the gene is expressed to produce a protein.

(2) 유전자형(genotype) 및 위험대립인자 분석(2) Genotype and Risk Allele Analysis

하기 표 6a 내지 표 6d는 서열번호 1 내지 6, 10 내지 38, 41, 및 44 내지 45의 DNA 서열의 대립인자 빈도와 각 유전자형에 대한 만성 위염 환자군과 장형 위암 환자군 각각의 빈도를 나타내며, 표 7a 내지 표 7e는 표 6a 내지 표 6d의 유전자형 빈도를 토대로 진행한 하아디-와인버그 평형 검정(Hardy-Weinberg Equilibrium Test) 결과와 장형 위암 연관성 분석[즉, 케이스-컨트롤 연관 분석(Case-control association study)] 결과로서, 하기 연관성 분석에 의해 장형 위암에 영향을 미칠 수 있는 단일염기다형들과 이들의 유전자형을 확인할 수 있다. Tables 6a to 6d below show allele frequencies of the DNA sequences of SEQ ID NOs: 1 to 6, 10 to 38, 41, and 44 to 45, and the frequencies of the chronic gastritis and enteric gastric cancer patient groups for each genotype, respectively. Table 7e shows the results of the Hardy-Weinberg Equilibrium Test based on the genotype frequencies of Tables 6a to 6d and the analysis of bowel gastric cancer (ie, Case-control association study). As a result, the following association analysis can identify monobasic polymorphisms and their genotypes that may affect intestinal gastric cancer.

ㆍ 대립인자형 빈도(allele frequency) 란에서는 집단 내에서 관찰되는 각각의 대립인자들의 빈도를 나타내고 있다. 여기서 대립인자란 단일염기다형 부위에 실제로 존재하고 있는 뉴클레오티드의 염기서열을 뜻하며, 하나의 단일염기다형은 그 특성상 두 개의 대립인자를 가질 수 있다. 이들 대립인자 빈도의 고저에 따라 다수 대립인자(Major allele)와 소수 대립인자(Minor allele)로 나뉜다. 특히 소수 대립인자의 빈도(Minor allele frequency; MAF)는 0.1을 기준으로 하여 단일염기다형과 단일염기변이(point mutation)을 나누는 유전학적 기준으로 사용된다. 이러한 기준은 다형성 자체의 안정성을 뜻하기 때문에(즉, 단일염기다형이 일어나는 부위는 일정하다라는 의미), 유전적 표지인자로써의 사용 가능성을 대변해 준다. The allele frequency column shows the frequency of each allele observed in the population. Here, an allele means a nucleotide sequence of nucleotides actually present at a single nucleotide polymorphism site, and a single nucleotide polymorphism may have two alleles due to its characteristics. According to the high and low frequency of these alleles, they are divided into major alleles and minor alleles. In particular, the minor allele frequency (MAF) is used as a genetic reference for dividing a single base polymorphism and a point mutation based on 0.1. This criterion represents the stability of the polymorphism itself (that is, the site where a monobasic polymorphism occurs is constant), thus representing its potential use as a genetic marker.

ㆍ 유전자형은 각 단일염기다형을 형성하는 각각의 대립인자들로 이루어지며, 단일염기다형 부위가 존재하는 유전자좌(locus)에 어떠한 대립인자들이 있는지를 나타내 주고 있다. Genotype consists of alleles that form each monobasic polymorphism, and shows what alleles are in the locus where the monobasic polymorphic site exists.

ㆍ 유전자형 빈도수(genotype frequency)와 유전자형 빈도 % 란은 각각의 유전자형에 해당하는 사람들의 수를 만성 위염 환자군과 장형 위암 환자군으로 나누어 나타낸다. 이를 관측치라고 할 수 있다. The genotype frequency and% genotype frequency column shows the number of people corresponding to each genotype divided into the group of patients with chronic gastritis and the patients with type gastric cancer. This can be called an observation.

표 7a 내지 표 7e는 하나의 단일염기다형을 이루는 두 개의 대립인자와 이로 인하여 형성되는 3가지 형태의 유전자형에 대한 만성 위염 환자군-장형 위암 환자군 간의 빈도차이를 비교하여 어떠한 대립인자가 또는 어떠한 유전자형이 질병과 연관되어 있는지를 분석한 결과이다. 이러한 연관성 분석은 대립인자형(allele model), 공동-우성 유전자형(Co-dominant model), 우성 유전자형(Dominant model), 그리고 열성 유전자형(Recessive model)이라는 네 가지 항목에서 실시하였다. 표 7a 내지 표 7e에 있어서 각 칼럼이 의미하는 바는 다음과 같다. Table 7a to table 7e compare the frequency differences between the two alleles that form a single base polymorphism and the three types of genotypes resulting from chronic gastritis patients and gastrointestinal cancer patients, and which alleles or genotypes It is the result of analyzing whether it is related to the disease. This association analysis was carried out in four categories: the allele model, the co-dominant genotype, the dominant genotype, and the recessive model. In Tables 7a to 7e, the meaning of each column is as follows.

ㆍ서열번호는 각각의 단일염기다형을 나타내는 번호이다. The sequence number is a number representing each monobasic polymorphism.

ㆍHWE는 하아디-와인버그 평형 (Hardy-Weinberg Equilibrium)의 상태를 나타내는 것이다. 카이제곱 (df=1) 검정에서 chi-value = 3.84(p-value=0.05, df=1)을 기준으로, 3.84보다 큰 경우에는 하아디-와인버크 평형 HWE (Hardy-Weinberg Equilibrium)으로 판단하고, 3.84보다 작은 경우에는 하아디-와인버그 비평형 (Hardy-Weinberg Disequilibrium)으로 판단하였다. 이는 만성 위염 환자군과 장형 위암 환자군 모두에서 관찰된 각 단일염기다형들의 유전적 표지인자로서의 기능과 이들로 이루어지는 유전자형의 분포가 일반적인 것들인지를 나타낸다.HWE represents the state of the Hardy-Weinberg Equilibrium. Based on chi-value = 3.84 (p-value = 0.05, df = 1) in the chi-square (df = 1) test, it is judged by Hardy-Weinberg equilibrium HWE (Hardy-Weinberg Equilibrium) if it is greater than 3.84, If less than 3.84, it was judged as Hardy-Weinberg Disequilibrium. This indicates that the genotype and distribution of genotypes of the individual monobasic polymorphisms observed in both patients with chronic gastritis and intestinal gastric cancer are common.

ㆍ오즈 비율 (odds ratio, OR)은 장형 위암 환자 중에서 위험 대립인자를 가질 오즈에 대한 만성 위염 환자군 중에서 위험 대립인자를 가질 오즈의 비율을 나타낸다. 95% 신뢰하한과 95% 신뢰상한은 오즈 비율의 95% 신뢰구간을 나타낸다. 신뢰구간은 1을 포함하는 경우에는 질병과의 연관성을 유의하다고 판단할 수 없다. 오즈비율이 1보다 크고 95%신뢰구간에 1이 포함되어 있지 않은 경우 소수 대립인자가 질환군에서 더 높은 빈도를 가지며, 오즈비율이 1보다 작은 경우에는 다수 대립인자가 질환군에서 더 높은 빈도를 가진다. The odds ratio (OR) represents the ratio of odds of risk allele among the group of chronic gastritis patients to odds of risk allele among patients with gastric cancer. The 95% confidence bounds and 95% confidence bounds represent the 95% confidence intervals for the odds ratio. Confidence intervals with a value of 1 cannot be considered significant for association with disease. If the odds ratio is greater than 1 and does not include 1 in the 95% confidence interval, minor alleles have a higher frequency in the disease group. If the odds ratio is less than 1, the majority allele has a higher frequency in the disease group. Have

ㆍ연관성 분석에서 Logit p value는 오즈비에 대한 통계적 유의성을 나타낸다. 이는 오즈비 만큼 만성 위염 환자군과 장형 위암 환자군간에 차이에 대한 유의성을 뜻한다. 여기서 유의성있는 차이가 있다고 판단하는 기준은 Logit_p value가 0.05 이하의 값을 나타낼 경우이다. 이러한 값은 로지스틱 회귀(logistic regression) 분석을 통하여 얻었다. In the correlation analysis, the logit p value represents the statistical significance of the odds ratio. This means that the difference between the patients with chronic gastritis and the patients with gastrointestinal cancer is as significant as the odds ratio. The criterion for determining that there is a significant difference is when the Logit_p value indicates a value of 0.05 or less. These values were obtained through logistic regression analysis.

ㆍ위험 대립 인자 (risk allele)는 p-value에 유의성이 있는 경우 오즈비가 1 보가 크면 소수 대립인자, 오즈비가 1 보다 작으면 다수 대립인자로 하였다.The risk allele was a small allele if the odds ratio was greater than 1 when the p-value was significant and a majority allele if the odds ratio was less than 1.

ㆍ각 네가지 분석모델에 따른 질병과의 유의성을 분석함에 있어 다수 대립인자를 A1이라 하고 소수 대립인자를 A2라 하면 우성 (Dominant) 모형은 A1A1의 빈도와 A1A2와 A2A2의 빈도에 대한 합을 비교하는 것이고, 열성 (Recessive) 형질 모형은 A1A1과 A1A2의 합과 A2A2의 빈도를 비교하는 것이고, 공우성 (Co-Dominant) 형질 모형은 A1A1, A1A2 및 A2A2에 대한 각각의 빈도를 비교하는 것이다. In analyzing the significance of the disease according to the four analysis models, if the majority allele is A1 and the minor allele is A2, the dominant model compares the sum of the frequency of A1A1 with the frequency of A1A2 and A2A2. The recessive trait model compares the sum of A1A1 and A1A2 with the frequency of A2A2, and the co-dominant trait model compares the respective frequencies for A1A1, A1A2 and A2A2.

각 네가지 분석모델에 대한 연관성 분석은 대립형질들이 만성 위염 환자군과 장형 위암 환자군 사이에서 출현되는 빈도수의 비율을 통하여 위험 대립인자가 장형 위암에 어느 정도의 연관성 (위험도)를 가지고 있는지 확인할 수 있다. 95% 신뢰구간에 1을 포함하고 있을 경우 유의성이 없기 때문에, 상기 표 7a 내지 표 7e로부터 확인할 수 있는 바와 같이, 서열번호 1 내지 6, 10 내지 38, 41, 및 44 내지 45의 다형성 부위의 염기가 각각 C, G, A, A, C, C, T, A, A, G, G, A, G, G, G, T, C, A, A, G, C, A, T, G, C, C, A, C, A, T, A, C, A, C, T, C, G, T (위험 대립인자)인 것이 하나 이상 존재하는 경우, 장형 위암에 걸릴 가능성이 높은 것으로 판단할 수 있다. Correlation analysis for each of the four analytical models shows how risk alleles have an association (risk) with intestinal gastric cancer, based on the ratio of the frequencies of alleles between the patients with chronic gastritis and those with gastric cancer. Since there is no significance when 1 is included in the 95% confidence interval, the bases of the polymorphic sites of SEQ ID NOs: 1 to 6, 10 to 38, 41, and 44 to 45, as can be seen from Tables 7a to 7e above. Are C, G, A, A, C, C, T, A, A, G, G, A, G, G, G, T, C, A, A, G, C, A, T, G, If one or more of C, C, A, C, A, T, A, C, A, C, T, C, G, T (risk alleles) is present, you may be more likely to develop gastric cancer. Can be.

서열번호 19는 공우성 유전자형(co-dominant genotype), 우성 유전자형(dominant genotype), 열성 유전자형(recessive genotype)에서 유의성을 나타내며, 서열번호 6, 서열번호 21, 서열번호 26 내지 27, 서열번호 44 내지 45는 공우성 유전자형(co-dominant genotype), 우성 유전자형(dominant genotype)에서 유의성을 보이며, 서열번호 2 내지 5, 서열번호 11, 서열번호 20, 서열번호 22는 공우성 유전자형(co-dominant genotype), 열성 유전자형(recessive genotype)에서 유의성을 나타내며, 서열번호 10, 서열번호 12 내지 13, 서열번호 17, 서열번호 23 내지 25, 서열번호 37 내지 38, 서열번호 41은 우성 유전자형(dominant genotype)에서 유의성을 보이며, 서열번호 1, 서열번호 15, 서열번호 28 내지 30, 서열번호 32 내지 36은 열성 유전자형(recessive genotype)에서 유의성을 나타내고, 서열번호 31은 공우성 유전자형(co-dominant genotype)에서 유의성을 나타내었다. 상기 결과로 부터 서열번호 1 내지 6, 서열번호 10 내지 13, 서열번호 15, 서열번호 17, 서열번호 19 내지 38, 서열번호 41, 서열번호 44 내지 45의 다형성 부위가 각각 유전자형이 C/C(서열번호 1), G/G(서열번호 2), A/A(서열번호 3), A/A(서열번호 4), C/C(서열번호 5), T/C와 C/C(서열번호 6), T/T(서열번호 10), A/A와 A/G(서열번호 11), G/A와 A/A(서열번호 12), G/G(서열번호 13), A/A(서열번호 15), A/G와 G/G(서열번호 17), C/T와 T/T(서열번호 19), C/C(서열번호 20), A/A(서열번호 21), A/A(서열번호 22), G/G(서열번호 23), C/C(서열번호 24), A/A(서열번호 25), C/T와 T/T(서열번호 26), T/G와 G/G(서열번호 27), C/C와 C/T(서열번호 28), C/C와 C/T(서열번호 29), A/A와 A/G(서열번호 30), T/C와 C/C(서열번호 31), A/A와 A/G(서열번호 32), T/T와 T/C(서열번호 33), A/A와 A/G(서열번호 34), C/C와 C/A(서열번호 35), A/A와 A/G(서열번호 36), T/C와 C/C(서열번호 37), C/T와 T/T(서열번호 38), T/C와 C/C(서열번호 41), T/G와 G/G(서열번호 44), C/T와 T/T(서열번호 45)가 하나 이상 존재하는 경우 장형 위암에 걸릴 위험도가 높은 것으로 판단할 수 있다. 상기 위험 유전자형을 갖는 핵산 서열이 하나의 검체에서 많이 검출되면 될수록 위험군에서 속하는 확률은 더 높은 것으로 판단할 수 있다. SEQ ID NO: 19 shows significance in co-dominant genotype, dominant genotype, recessive genotype, and SEQ ID NO: 6, SEQ ID NO: 21, SEQ ID NO: 26 to 27, SEQ ID NO: 44 to 45 is significant in the co-dominant genotype, dominant genotype, SEQ ID NO: 2 to 5, SEQ ID NO: 11, SEQ ID NO: 20, SEQ ID NO: 22 is the co-dominant genotype , Significance in recessive genotype, SEQ ID NO: 10, SEQ ID NO: 12 to 13, SEQ ID NO: 17, SEQ ID NO: 23 to 25, SEQ ID NO: 37 to 38, SEQ ID NO: 41 are significant in the dominant genotype SEQ ID NO: 1, SEQ ID NO: 15, SEQ ID NO: 28 to 30, SEQ ID NO: 32 to 36 indicate significance in the recessive genotype, and SEQ ID NO: 31 to the co-dominant genotype notype). From the above results, the polymorphic sites of SEQ ID NOs: 1 to 6, SEQ ID NOs: 10 to 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19 to 38, SEQ ID NO: 41, SEQ ID NO: 44 to 45, respectively, were genotypes C / C ( SEQ ID NO: 1), G / G (SEQ ID NO: 2), A / A (SEQ ID NO: 3), A / A (SEQ ID NO: 4), C / C (SEQ ID NO: 5), T / C, and C / C (SEQ ID NO: Number 6), T / T (SEQ ID NO: 10), A / A and A / G (SEQ ID NO: 11), G / A and A / A (SEQ ID NO: 12), G / G (SEQ ID NO: 13), A / A (SEQ ID NO: 15), A / G and G / G (SEQ ID NO: 17), C / T and T / T (SEQ ID NO: 19), C / C (SEQ ID NO: 20), A / A (SEQ ID NO: 21) , A / A (SEQ ID NO: 22), G / G (SEQ ID NO: 23), C / C (SEQ ID NO: 24), A / A (SEQ ID NO: 25), C / T and T / T (SEQ ID NO: 26), T / G and G / G (SEQ ID NO: 27), C / C and C / T (SEQ ID NO: 28), C / C and C / T (SEQ ID NO: 29), A / A and A / G (SEQ ID NO: 30) ), T / C and C / C (SEQ ID NO: 31), A / A and A / G (SEQ ID NO: 32), T / T and T / C (SEQ ID NO: 33), A / A and A / G (SEQ ID NO: Number 34), C / C and C / A (SEQ ID NO: 35), A / A and A / G (SEQ ID NO: 36), T / C and C / C (SEQ ID NO: 37), C / T and T / T (SEQ ID NO: 38), T / C and C / C (SEQ ID NO: 41), T / G and G / G (SEQ ID NO: 44), C / T If more than one T / T (SEQ ID NO: 45) is present, it may be determined that there is a high risk of developing gastric cancer. The more the nucleic acid sequence having the risk genotype is detected in one sample, the higher the probability of belonging to the risk group can be determined.

(3) 반수체형 분석(3) haploid analysis

표 8a, 표 8b 및 도 1a 내지 도 10a는 서열번호 1 및 2, 서열번호 4 및 5, 서열번호 6 내지 8, 서열번호 9 및 10, 서열번호 18 및 19, 서열번호 21 및 22, 서열번호 27 및 28, 서열번호 30 내지 36, 서열번호 39 및 40, 서열번호 42 내지 45에 대한 다형성 부위간의 연관성을 나타내며, 도 1b 내지 도 10b는 이를 토대로, 도 1a 내지 도 10a로부터 형성된 연관 비평형 블록(Linkage Disequilibrium Block)으로부터 유전자형의 분석결과와 대비하여 형성된 반수체형(Haplotype)을 나타낸다. 표 9a 내지 표 9d는 각 블록의 반수체형에 대한 장형 위암과의 연관성 검정 결과이다.Tables 8a, 8b and FIGS. 1a to 10a show SEQ ID NOs: 1 and 2, SEQ ID NOs 4 and 5, SEQ ID NOs 6 to 8, SEQ ID NOs 9 and 10, SEQ ID NOs 18 and 19, SEQ ID NOs 21 and 22, SEQ ID NOs: 27 and 28, SEQ ID NO: 30 to 36, SEQ ID NO: 39 and 40, SEQ ID NO: 42 to 45 shows the association between the polymorphic sites, Figures 1B to 10B based on this, the association non-equilibrium block formed from Figures 1A to 10A The haplotype is formed in comparison with the result of genotyping from the Linkage Disequilibrium Block. Tables 9a to 9d show the results of the association test with enteric gastric cancer for the haplotype of each block.

표 8a 및 표 8b는 도 1a 내지 도 10a에 나타난 각 단일염기다형들 사이의 연관비평형 블록의 조밀함을 수치화하여 나타낸 것으로 D' 값과 r²값을 나타내고 있다. D' 값이 0.8 이상이 될 때, 각 단일염기다형들은 연관비평형 관계에 있다고 판단하며, 도 1a 내지 도 10a에서 보듯이 붉은 색으로 표시된다. 이러한 D' 값의 범위는 0 ～ 1까지 이며, 각 단일염기다형들 사이의 게놈 DNA 상에서의 떨어져 있는 거리에 대한 계산이 포함되어 있다. 또한 r² 값은 각 단일염기다형들 간의 상관관계를 나타내는 것으로 r² 값이 0.3 이상이면 그 값에 해당하는 단일염기다형들이 매우 조밀하게 묶여있다는 것을 뜻한다. 이는 앞서 언급한 도 1a 내지 도 10a와 표 8a 및 8b에서 뜻하는 바는 1 및 2, 서열번호 4 및 5, 서열번호 6 내지 8, 서열번호 9 및 10, 서열번호 18 및 19, 서열번호 21 및 22, 서열번호 27 및 28, 서열번호 30 내지 36, 서열번호 39 및 40, 서열번호 42 내지 45의 단일염기다형들이 서로 밀접하게 연관되어 있어 유전적 단위체를 형성한다는 것이다. 또한 이렇게 형성된 단위체들은 다음 세대로 유전될 때, 같이 묶여서 전달 되게 된다. 따라서 연관비평형 블록들이 환자군 특이적으로 나타나 질병을 일으키는데, 많은 영향을 미칠 경우, 그 블록 자체가 다음 세대로 전달되기 때문에, 다음 세대의 자손에 대한 질병이 일어날 수 있는 빈도를 예측할 수 있다. Tables 8a and 8b quantify the compactness of the associative non-equilibrium blocks between the respective polybasic polymorphs shown in Figs. 1a to 10a and show the values of D 'and r ² . When the D 'value is more than 0.8, each of the monobasic polymorphisms is determined to be in an associative non-equilibrium relationship, and is shown in red as shown in FIGS. 1A to 10A. These D 'values range from 0 to 1, and include calculations of the distance on genomic DNA between each monobasic polymorph. In addition, the r ² value represents the correlation between the single polymorphic polymorphisms. If the r ² value is 0.3 or more, it means that the single polymorphisms corresponding to the value are closely packed. 1 and 2, SEQ ID NOs: 4 and 5, SEQ ID NOs: 6 to 8, SEQ ID NOs: 9 and 10, SEQ ID NOs: 18 and 19, SEQ ID NO: 21 And single base polymorphisms of 22, SEQ ID NOs: 27 and 28, SEQ ID NOs: 30 to 36, SEQ ID NOs: 39 and 40, and SEQ ID NOs: 42 to 45 are closely related to each other to form a genetic unit. In addition, these formed units are bundled together and transferred when passed to the next generation. Thus, non-equilibrium blocks appear to be patient-specific and cause disease, and if so affected, the block itself is passed on to the next generation, thus predicting the frequency of disease in the offspring of the next generation.

연관비평형 블록들로부터 형성되는 유전적 단위체는 각 단일염기다형의 대립인자형 분석으로부터 그 유형이 결정되는데, 이를 반수체형(haplotype)이라고 한다 (도 1b 내지 도 10b). 도 1b 내지 도 10b는 각각 도 1a 내지 도 10a로부터 형성된 연관비평형 블록으로부터 유전자형의 분석결과와 대비하여 형성된 반수체형이다. 도 1a 내지 도 10a에서 나타난 한 개의 연관비평형 블록으로부터 각각 도 1b 내지 도 10b에 상응하는 반수체형이 나타나며, 각각의 블록에서 환자군에서 특이적으로 많이 나타나는 반수체형과 정상인군에서 특이적으로 많이 나타나는 반수체형이 관찰 되었다(표 9a 내지 표 9d). 반수체형은 다음 세대로 유전되기 때문에, 이러한 반수체형이 질병과 연관되는지를 분석할 필요가 있다. 총체적으로 연관비평형블록과 반수체형에 대한 질병관련 분석은 지금 세대에서의 질병의 예측 뿐 만아니라, 그 질병의 다음 세대로의 전달까지 예측할 수 있게 한다. 표 9a 내지 표 9d는 도 1b 내지 도 10b의 반수체형들의 빈도수가 환자군과 정상인군 사이에서 나타나는 빈도에 유의성 있는 차이를 조사한 표이다. Genetic units formed from associative non-equilibrium blocks are determined from an allele analysis of each monobasic polymorph, which is called a haplotype (FIGS. 1B-10B). 1b to 10b are haplotypes formed in comparison with genotyping results from associative non-equilibrium blocks formed from FIGS. 1a to 10a, respectively. The haplotypes corresponding to FIGS. 1B to 10B are shown from one of the associative non-equilibrium blocks shown in FIGS. 1A to 10A, respectively. Haploid forms were observed (Tables 9A-9D). Since haplotypes are passed down to the next generation, it is necessary to analyze whether these haplotypes are associated with disease. Overall, disease-related analyzes of associative non-equilibrium blocks and haplotypes allow us to predict not only the disease in the current generation, but also the transmission of the disease to the next generation. Tables 9a to 9d are tables that examine the significant difference in the frequency of the haplotypes of FIGS. 1b to 10b between the patient group and the normal group.

서열번호 1 및 2, 서열번호 4 및 5, 서열번호 6 내지 8, 서열번호 9 및 10, 서열번호 18 및 19, 서열번호 21 및 22, 서열번호 27 및 28, 서열번호 30 내지 36, 서열번호 39 및 40, 서열번호 42 내지 45의 DNA 서열의 다형성 부위로부터 결정된 반수체형 분석결과, 서열번호 1 및 2의 DNA 서열이 C-G 반수체형을 둘 다 가지는 경우(즉, 이배체형을 가지는 경우), 그렇지 않은 사람보다 장형위암에 걸릴 위험이 높은 것으로 판정할 수 있으며, A-A 반수체형을 하나라도 나타내는 경우 장형 위암에 저항성을 갖는 것으로 판정할 수 있고, 서열번호 4 및 5의 DNA 서열이 A-C 반수체형을 갖는 경우, 그렇지 않은 사람보다 장형위암에 걸릴 위험이 높은 것으로 판정할 수 있으며, G-G 반수체형을 나타내는 경우 장형 위암에 저항성을 갖는 것으로 판정할 수 있고, 서열번호 6 내지 8 의 DNA 서열이 C-A-C 반수체형을 하나라도 갖는 경우, 그렇지 않은 사람보다 장형위암에 걸릴 위험이 높은 것으로 판정할 수 있고, 서열번호 9 및 10의 DNA 서열이 C-C 반수체형을 하나라도 갖는 경우, 그렇지 않은 사람보다 장형위암에 저항성을 갖는 것으로 판정할 수 있고, 서열번호 18 및 19의 DNA 서열이 G-T 반수체형을 하나라도 갖는 경우, 그렇지 않은 사람보다 장형위암에 걸릴 위험이 높은 것으로 판정할 수 있으며, A-C 반수체형을 나타내는 경우, 장형 위암에 저항성을 갖는 것으로 판정할 수 있고, 서열번호 21 및 22 의 DNA 서열이 A-A를 둘다 갖는 경우, 장형위암에 걸릴 위험이 높은 것으로 판정할 수 있으며, G-G 반수체형을 하나라도 갖는 경우, 장형위암에 저항성을 갖는 것으로 판정할 수 있고, 서열번호 27 및 28 의 DNA 서열이 G-C 반수체형을 하나라도 갖는 경우, 그렇지 않은 사람보다 장형위암에 걸릴 위험이 높은 것으로 판정할 수 있으며, T-T 반수체형을 둘다 갖는 경우, 장형위암에 저항성을 갖는 것으로 판정할 수 있고, 서열번호 30 내지 36의 DNA 서열이 G-T-G-G-A-G 반수체형을 둘다 갖는 경우, 그렇지 않은 사람보다 장형위암에 저항성을 갖는 것으로 판정할 수 있고, 서열번호 39 내지 40의 DNA 서열이 T-T 반수체형을 하나라도 갖는 경우, 그렇지 않은 사람보다 장형위암에 걸릴 위험이 높은 것으로 판정할 수 있고, 서열번호 42 내지 45의 DNA 서열이 A-A-T-C 반수체형을 둘다 갖는 경우, 그렇지 않은 사람보다 장형위암에 걸릴 위험이 높은 것으로 판정할 수 있다.SEQ ID NO: 1 and 2, SEQ ID NO: 4 and 5, SEQ ID NO: 6 to 8, SEQ ID NO: 9 and 10, SEQ ID NO: 18 and 19, SEQ ID NO: 21 and 22, SEQ ID NO: 27 and 28, SEQ ID NO: 30 to 36, SEQ ID NO: Haplotype analysis determined from the polymorphic sites of the DNA sequences 39 and 40, SEQ ID NOs: 42-45, indicates that the DNA sequences of SEQ ID NOs: 1 and 2 have both CG haplotypes (ie, have a diploid); It may be determined that the risk of developing gastrointestinal cancer is higher than that of the non-human. If any AA haplotype is expressed, it may be determined to be resistant to gastrointestinal gastric cancer, and the DNA sequences of SEQ ID NOs: 4 and 5 may have an AC haplotype. In this case, it may be determined that the risk of developing gastrointestinal cancer is higher than that of the non-human. If the GG haplotype is shown, it may be determined to be resistant to gastrointestinal gastric cancer, and the DNA sequence of SEQ ID NOs. If you have at least one CAC haploid type, you can determine that you are at higher risk for gastric cancer than those who do not. If the DNA sequences of SEQ ID NOs. If the DNA sequence of SEQ ID NOs: 18 and 19 has any GT haplotype, it can be determined that the risk of developing gastrointestinal cancer is higher than that of the other person, and the AC haplotype is shown. In this case, it may be determined that resistance to intestinal gastric cancer, and if the DNA sequence of SEQ ID NO: 21 and 22 have both AA, it may be determined that the risk of developing gastrointestinal cancer is high, and if any one of the GG haplotype When the DNA sequence of SEQ ID NOs: 27 and 28 has any GC haplotype, It may be determined that the risk of developing gastrointestinal cancer is higher than that of the non-human, and if both TT haplotypes are present, it may be determined that it is resistant to gastrointestinal gastric cancer, and the DNA sequence of SEQ ID NOs: 30 to 36 is both If you have, you can be determined to be more resistant to gastric cancer than those who do not, and if the DNA sequence of SEQ ID NO: 39 to 40 has any TT haploid type, it is determined that the risk of developing gastrointestinal cancer is higher than those who do not If the DNA sequences of SEQ ID NOs: 42 to 45 have both AATC haplotypes, it may be determined that the risk of developing gastrointestinal cancer is higher than that of those who do not.

도 1a 및 1b는 각각 서열번호 1 및 2의 단일염기다형들 간의 연관비평형블록 및 상기 연관비평형블록으로부터 형성된 반수체형을 나타낸다.1A and 1B show associative non-equilibrium blocks between the monobasic polymorphs of SEQ ID NOs: 1 and 2 and haplotypes formed from the associative non-equilibrium blocks, respectively.

도 2a 및 2b는 각각 서열번호 4 및 5의 단일염기다형들 간의 연관비평형블록 및 상기 연관비평형블록으로부터 형성된 반수체형을 나타낸다.2A and 2B show associative non-equilibrium blocks between the monobasic polymorphs of SEQ ID NOs: 4 and 5 and haplotypes formed from the associative non-equilibrium blocks, respectively.

도 3a 및 3b는 각각 서열번호 6 내지 8의 단일염기다형들 간의 연관비평형블록 및 상기 연관비평형블록으로부터 형성된 반수체형을 나타낸다.3A and 3B show associative non-equilibrium blocks between the monobasic polymorphs of SEQ ID NOs: 6-8 and haplotypes formed from the associative non-equilibrium blocks, respectively.

도 4a 및 5b는 각각 서열번호 9 및 10의 단일염기다형들 간의 연관비평형블록 및 상기 연관비평형블록으로부터 형성된 반수체형을 나타낸다.4A and 5B show associative non-equilibrium blocks between the monobasic polymorphs of SEQ ID NOs: 9 and 10 and haplotypes formed from the associative non-equilibrium blocks, respectively.

도 5a 및 5b는 각각 서열번호 18 및 19의 단일염기다형들 간의 연관비평형블록 및 상기 연관비평형블록으로부터 형성된 반수체형을 나타낸다.5A and 5B show associative non-equilibrium blocks between the monobasic polymorphs of SEQ ID NOs: 18 and 19 and haplotypes formed from the associative non-equilibrium blocks, respectively.

도 6a 및 6b는 각각 서열번호 21 및 22의 단일염기다형들 간의 연관비평형블록 및 상기 연관비평형블록으로부터 형성된 반수체형을 나타낸다.6A and 6B show associative non-equilibrium blocks between the monobasic polymorphs of SEQ ID NOs: 21 and 22 and haplotypes formed from the associative non-equilibrium blocks, respectively.

도 7a 및 7b는 각각 서열번호 27 및 28의 단일염기다형들 간의 연관비평형블록 및 상기 연관비평형블록으로부터 형성된 반수체형을 나타낸다.7A and 7B show associative non-equilibrium blocks between the monobasic polymorphs of SEQ ID NOs: 27 and 28 and haplotypes formed from the associative non-equilibrium blocks, respectively.

도 8a 및 8b는 각각 서열번호 30 내지 36의 단일염기다형들 간의 연관비평형블록 및 상기 연관비평형블록으로부터 형성된 반수체형을 나타낸다.8A and 8B show associative non-equilibrium blocks between the monobasic polymorphs of SEQ ID NOs: 30 to 36 and haplotypes formed from the associative non-equilibrium blocks, respectively.

도 9a 및 9b는 각각 서열번호 39 및 40의 단일염기다형들 간의 연관비평형블록 및 상기 연관비평형블록으로부터 형성된 반수체형을 나타낸다.9A and 9B show associative non-equilibrium blocks between the monobasic polymorphs of SEQ ID NOs: 39 and 40 and haplotypes formed from the associative non-equilibrium blocks, respectively.

도 10a 및 10b는 각각 서열번호 42 내지 45의 단일염기다형들 간의 연관비평 형블록 및 상기 연관비평형블록으로부터 형성된 반수체형을 나타낸다.10A and 10B show associative equilibrium blocks between the monobasic polymorphs of SEQ ID NOs: 42 to 45 and haplotypes formed from the associative non equilibrium blocks, respectively.

<110> College of Medicine Pochon CHA University Industry-Academic Cooperation Foundation <120> Polynucleotides derived from PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, and NSMCE2 genes comprising single nucleotide polymorphisms, microarrays and diagnostic kits comprising the same, and analytic methods using the same <130> PN0316 <160> 45 <170> KopatentIn 1.71 <210> 1 <211> 1001 <212> DNA <213> Homo sapiens <400> 1 tccaaattga tttctttgca tgtggatatt cagttatccc accatcattt gttgaaaaga 60 ctatttttct cccatttaat tgtttgcaca attgtcaaaa atcaaatgac caaaatttaa 120 gggtttattc ctggatttta aattctattc tattgattga tttgtctatc ctttgacaat 180 aatcacagtg tcttcattac tgtagttttt tagtaacttt taaaacaagg aagtgtgagt 240 cttacaactt tttttttttt caagagtgtc ttggctattc tggatccctt acattttcat 300 gtgaatttta ggatcaactt gtcaatttct ttaagaaagc cagctgagat tttgataacg 360 attgccttga atctgtgaat cagtttgtgt ggtttttgtc atgttaacag tattaaacct 420 tccaatccat gaacatggga taacttttca tttatttagg cagcttcaat tgctttctca 480 ttttcagagt atacattttg macttctttt actcttaaat attttattct ttataatgct 540 attgtaaaca aattgttttc ttagtttcct ttttgaatta ttcacttcta ctatagagaa 600 atgcagttga tttttgcata ttgttgatat tgtattgcag aacttgttta ttaggtgaat 660 tccttaggat ttcctgcata taagactgtc attttttttt tttttttttt tgagatggag 720 tctcgctctg ttgcccaggc tggagtgcag tggcacgatc tgggctgact gcaagctccg 780 cctcctgggc tcatgccagt ctcctgcctc agcctcctga gtagctggga ctacaggcac 840 tcgccaccat gcccagctaa ttttttgtat ttttagtaga gacagggttt cactgtgtta 900 gccaggatgg tctcaatctc ctgacctcgt gatccacccg cctcggcctc ccaaagtgct 960 gggattacag gcgtgagcca ctgtgaccag ccaagactgt c 1001 <210> 2 <211> 1001 <212> DNA <213> Homo sapiens <400> 2 tctcactgtg cattgtgttg cccagggacg gagtctcgct ggagtgcagt ggtgcagtct 60 cagaggctca cagcaacctc tgcctcctgg gtttaagcga ttctcgtgcc tcagcctccg 120 gagtagctgg gattacaggt gcccgtatat tttataatat attccataat taatcgctcc 180 gtattaaact tgctgatgca ggtctccaat tactgaaaaa cctgatacca agtttttcaa 240 tgggattttt ttgtttgttt aagagatggg gtctcaccat gttgctcagg ctggccttga 300 actcctgggc tcaagtgatc ctccagcctc agcctctcaa aggctgggac tacaggcatg 360 tagcactttg cctgactcaa ggggaatttt tagtaagagt atataatcca agtctgtagc 420 agttggagga tgtcgtacca cgctgaagtg tcacacataa ctgatgactc agtcccccaa 480 gcctatttta tcatcgtaaa raaagctatt tctttgatta ttccaactct ttcagcacca 540 ggactgctgc aggggctgga ttcatgctac ctgtagttgc agtgtaaatt atgtgtcaga 600 gccaagatcc taggaaagtc aaagactagg ttacaagagc gattcttttt tttttttttt 660 tttttttttt ttgagacaga gtcttgctct gtcgcccagg ctggagtgca gtggcacagt 720 cttggctcac tgcaacctcc gcctccctgg ttcacgccat tctcctgcct tagcctcccg 780 agtagctggg actacaggcg cccgccacca cgcccggcta attttttgtg tttttagtag 840 agacggggtt tcaccgtgtt agccaggatg gtctcgatct cctgaccttg tgatccgccc 900 gcctcggcct cccaaagtgc tgggattaca ggcttgagcc accgcgcccg gccaaggacg 960 attctttcta atctttggta agccagtttt tacagaatga a 1001 <210> 3 <211> 1001 <212> DNA <213> Homo sapiens <400> 3 agctagcctg aacaaaaaga acaaaactgg agggatcaca ttctatgcct ttaaattata 60 ctacaaactg tagtaaccaa agcaggatgg tgctggcata aaaacagaca cataaaccga 120 tggaacagaa gagagaaccc agaaataaat ccatacatct agagtgaact cattttttac 180 aaaggtgcca agaacataac actggggaaa agacagcctt caataaataa gactgtcaat 240 aaatagtgct gggaaaacaa gatatccaca tgcagaagaa tgaatgaaac tagaccccta 300 tttctcgcca tatacaaaaa tcaaatcaaa atgaattaaa gacttaaatc caaaacctca 360 aactatgcaa ctactaaaag aaaaaactgt ggaaactctc catgacactg gtccaggcaa 420 gtaattcttg agtagcaccc cgcaatcaca ggtaatcaaa gcaaaaatag gtaagtggga 480 tcacatcaac ttaggtttct rcaaaacaaa gcaatcaaaa aaatgaagag acaacccaca 540 gaatgggaga aaatatttgc acactatgca tctgacaaag gattaataac cagaatagac 600 aaggagttca aacaactcaa taggaaaaaa aaaacctaat aatctgatta aataatgggc 660 aaaagatatg aatagacatt tttccttttt tttttttgga aacggatctt gctctgtcac 720 ccaggctgga gtgcagtggc gtaatcttgg ctcactgcaa cctctgcctc ctgggttcaa 780 gtagttctac tgcctcggcc tcccaagtag ctgggattgc aggcacgtgc caccatgccc 840 agctaatttt tgtagtttta gtagagacag ggtttcacca tgttggcctg gctggtctgg 900 aactcctgac ctcaagtgat ccgcccgcct cggcctccca aagtggtggg attacaagcg 960 tgagacacca tccctagccc tgaatagaca ttttttttta t 1001 <210> 4 <211> 501 <212> DNA <213> Homo sapiens <400> 4 ggcattttta acaatcaggt tatctgcaat attaaccaga aaaaaaatgg tagcaaaata 60 gagaaaatgg gccattttta ttccagggga caagctgcac aaaggaatgt tcttctattt 120 attttaaaca aatgactgcg tgtactgaat ctgactgtgt gaaataatct cagaatggca 180 gcaccactgg catggcgatg gtgcaggtgg gtgcagttcc ctgtggtctc tattgcttga 240 agagagaaag raagttccct attattatat ttaaggcagt tttcagagca ctggcattct 300 tgtttgctct gttttgggga tattctattt gccaatcagt tcacactcat caaccatcca 360 cttcctgtct ttcacttctt caggtgaaat atcaccactc tagtgagcac caacatggtc 420 agagtgcaaa taactgtgat ggatacttcg aagtgaataa gaagggaagc ctgattattg 480 ctgagatcac cttggtattg c 501 <210> 5 <211> 501 <212> DNA <213> Homo sapiens <400> 5 tggtccattt catgtgatct attactctga cataaaccca tctgtaatat attgccagta 60 tataagctgt ttagtttgtt aattgattaa actgtatgtc ttataagaaa acatgtaaag 120 ggggaatata tggggggagt gagctctctc agacccttga agatgtagct tccaaatttg 180 aatggattaa atggcacctg tataccaatt tgtagaaaga acatatgtga tacttatgta 240 aagtgtgggg sgagtgggta acagttttca ggcacaaaat ggtttggcct tctgaaggca 300 ggtgtgaata aaagctgaga gtcttttctg tgacagtaaa gagtaggaga gactggaggg 360 tagtgtgtgg tgtggactaa gactgaaagg aggttattac ccagatacta ggagacactg 420 gtgtccctgt tttagagatt tggctctata cccatttgtc atttctcaat tactggtcac 480 aacaagaggc cactggagaa a 501 <210> 6 <211> 601 <212> DNA <213> Homo sapiens <400> 6 gatgacctaa aagctagaga gtggaaaagg attaccatat tcccatccct agccgttcat 60 ataattattc ttcatttgtg ccgtgattca gtaccctgat gctacagacg aggacatcac 120 ctcacacatg gaaagcgagg agttgaatgg tgcatacaag gccatccccg ttgcccagga 180 cctgaacgcg ccttctgatt gggacagccg tgggaaggac agttatgaaa cgagtcagct 240 ggatgaccag agtgctgaaa cccacagcca caagcagtcc agattatata agcggaaagc 300 yaatgatgag agcaatgagc attccgatgt gattgatagt caggaacttt ccaaagtcag 360 ccgtgaattc cacagccatg aatttcacag ccatgaagat atgctggttg tagaccccaa 420 aagtaaggaa gaagataaac acctgaaatt tcgtatttct catgaattag atagtgcatc 480 ttctgaggtc aattaaaagg agaaaaaata caatttctca ctttgcattt agtcaaaaga 540 aaaaatgctt tatagcaaaa tgaaagagaa catgaaatgc ttctttctca gtttattggt 600 t 601 <210> 7 <211> 601 <212> DNA <213> Homo sapiens <400> 7 tgatagtcag gaactttcca aagtcagccg tgaattccac agccatgaat ttcacagcca 60 tgaagatatg ctggttgtag accccaaaag taaggaagaa gataaacacc tgaaatttcn 120 tatttctcat gaattagata gtgcatcttc tgaggtcaat taaaaggaga aaaaatacaa 180 tttctcactt tgcatttagt caaaagaaaa aatgctttat agcaaaatga aagagaacat 240 gaaatgcttc tttctcagtt tattggttga atgtgtatct atttgagtct ggaaataact 300 ratgtgtttg ataattagtt tagtttgtgg cttcatggaa actccctgta aactaaaagc 360 ttcagggtta tgtctatgtt cattctatag aagaaatgca aactatcact gtattttaat 420 atttgttatt ctctcatgaa tagaaattta tgtagangca aacaaaatac ttttacccac 480 ttaaaaagag aatataacat tttatgtcac tataatcttt tgttttttaa gttagtgtat 540 attttgttgt gattatcttt ttgtggtgtg aataaatctt ttatcttgaa tgtaataaga 600 a 601 <210> 8 <211> 601 <212> DNA <213> Homo sapiens <400> 8 caggccaatc caaaccaaat ctatcatgaa gaaggccaca aatcctagct ttatgttatt 60 aaagcagcaa aatgacaaac aaagtaaagt caaagtctta ctgtggttag cagttcttct 120 cataaagccc aaaacaccag attttcttct aatgatttca gatgtcaagt ggtaaaaatt 180 gattttacaa tgttcttgag taaataagta ctggggaaaa aaatgaggag gaaacatgtt 240 tgtttgggta aattgctaag caacaccaca catgccagaa actccttttg gatttcctct 300 mgaggtggag accgcgaaac caacgggtac ttcactttgt gtctgtttca tcagttcaca 360 ctggaaaata aaagccaaac aggcaattct aataatacaa gagtcagagg tgtggaggcc 420 actaatatta aattagactg aactgtttaa ggttgccttt taaaatggct ccttccagcg 480 tccacggtgg cactaagact gaggtgagat ttgtgcagcg tgtccaggaa gatctagcac 540 actgatgaaa aaaaaaacaa acagcagcca ggccatgtta tctatcccac tcggtatttc 600 c 601 <210> 9 <211> 801 <212> DNA <213> Homo sapiens <400> 9 aaatcttaca aacaacagac taatggctgg gaagtcacag agagcagtgg tgtcaatagt 60 accttttaca attccaatag aacatcatcg cataatcatg ggcaagcaac tacaatctgt 120 aaattataat cttttactaa atgaccttca gtattagccc caaaagtgac agaattaaaa 180 tgtggaatgt aatgaaacag gaaggtagaa tcaaactagc ttcaggatac tttgtgtgga 240 ctgagatcac caaaaaaagc ccagtgggaa actgccaaat gctgcaccat aattaagagc 300 aacactgaca ctgagtagat ttttcattgt ataaggcagt tttccaagta tacaaattga 360 gagttgtgtt gggattctgg caaggaaagt ctgtatctta yggaaatatc aatttccttc 420 atacgatcca ttacttactg ttgactttat taatattgcc ttggatttct gcttgtgtca 480 gcagttcttc ataagcatct ctttcttctt ctgaaataca gctattctgc aaaacaaagt 540 cttcactgat ttctgaggtt aaataaaata tttccaaaac tcgaaattca aatggaaaaa 600 tgtgaagagt gacaaaaacc attccctaat tatttaaatt cattgcaagc atttgtggag 660 tgcctgctct gtgcagggga ctaggggaga atgctggcaa agactacaaa gatgaatgac 720 agacccggtg actctttgaa aagggctaat tttcaaatgt ggaaaacatt agtttggacc 780 acttggtatt gccctttttt t 801 <210> 10 <211> 1293 <212> DNA <213> Homo sapiens <400> 10 ggtattgaga agtaacctcc agatagatcc gtttcattag aattgtaaaa gcaggtttca 60 tagtttgact tctgtgcttt tcatttgtga accctatagt cagtcttgtt ctctctttct 120 ctctctctct ctctctctct ctctatatat atatatatat atatatatat atatatatat 180 attttttttt tttttttttt tttttttttt ggtgggggga acagagtctt actctgtcac 240 ccaggctaga gtgcagtggt gcaatcttgg ctcactgcaa cctctgcctc ctggattcaa 300 gtgattctcc tacctcagcc tcccaagtag ctgggacttg aatagacact tcatttcttt 360 aaagatgatg tacaaaggga aaataaacac atgaaaagat gctcaaaatc attagtcact 420 agaaaaatac aaatccaaac cacagtatga taccacttca cacccactag gatggctgta 480 atgaaaaaga caataacaag tgttggtcag aatgtagaga gctactcagg aggctgaggc 540 acaagaatca cttgaacaca ggaggcggag gtggcagtga gccaagatca tgccacagca 600 ctccagcctg ggggacagag caagaatctc tcaaaaaaaa aaaataaaaa taaaaaccag 660 cagctgggac cttgatagaa gttgcattga atctgtaggt caatttaaga agtatttcca 720 tcttaacaat actaaatctt ccaattcatg aacattggct cttttcattt atttgtcttt 780 aagttttttc ccatggtgtt gtatagtttt cagtgtacat gtcttatgtt tctttcatta 840 aattatgtct aaatatttat ttataattat aatgtatttg tagttgcagt cttatgaatt 900 gaatgtgttc ttaattttat tttcagattg cttattatga gcgtatagac acataattgg 960 ttttgtgctt gcatcttgca tcctatagcc ttgctgtact agttaattag ccttttgcaa 1020 tcggtgatac taaagaagtt ataatgtact gtgacttaaa aggaaaacta cttcttcatt 1080 ttctaggagt tgyagctgta gggtacatca atgaagctat tgatgaaggg aatcctttga 1140 ggactttaga aactttgctc ctacctactg cgaatattag tgatgtggac ccagcccatg 1200 cccagcacta ccaggatgtt ttataccatg ctaaatcaca gaaactcgga gtaagtttta 1260 gtatatctgt ttcttttaaa ttttacaggc tgg 1293 <210> 11 <211> 1060 <212> DNA <213> Homo sapiens <400> 11 gttctgagtg ttctattttg gggaacagag gcgtccttgg tagcatttgg aagaggatag 60 ccagctgggg tgtgtgtaca tcacagcctg acagtaacag catccgaacc agaggtgact 120 ggctaagggc agacccaggg caacaggtta accgttctag ggccgggcac agggaggaga 180 acattccaac actctgtgtg cccagtgccg acgcacgttc tctcttttat cctcaaaaca 240 gtcctatgag gatataagcc agagagagac agagacaagg aattacaagt tggtgagagt 300 caggatttga acttggctct ggcagatgga aaattagggt ctgtattctt tacaaaaccg 360 tgtgtgcctc agatggagtt ggtgcataac aagcagaggt atccagggtc gcggtcctgc 420 ttgccacgga aggggccgcc ttgtcagttg tgaccaccca gccctggaaa tgtcagtaat 480 gctgtaagga gtggggatcg gatcagatgc catccagatg ctgaagtttg accttgtgtc 540 atttttcact ttcttttttg gctcttctgc aatcaattca tttatttagc aaaaaagaaa 600 ttatgtgtgc cgagagcatg cagaagatat gtctccgttc tctgcttccc tccaaaaaag 660 aatcccaaaa ctgctttctg tgaacgtgtg ccagggtccc agcaggactc agggagagca 720 ggaagcccag cccagacccc ttgcacaacc taccgtgggg aggccttagg ctctggctac 780 tacagagctg gttccagtct gcactgccac agcctggcca gggacttgga cacatctgct 840 ggccacttcc tgtctcagtt tccttatctg caaaataagg gaaaagcccc cacaaaggtg 900 cacgtgtagc aggagctctt ttccctccct attttaggaa ggcagttggt gggaagtcca 960 gcttgggtcc ctgagagctg tgagaaggag atgcggctgc tgctggccct gttggggrtc 1020 ctgctgagtg tgcctgggcc tccagtcttg tccctggagg 1060 <210> 12 <211> 401 <212> DNA <213> Homo sapiens <400> 12 agtaatgggg ggcggaggaa gagtggagga acactagggg ggatgaagga ggggccttct 60 gtccctgaca acccctgggg aagtaggggg aagtaggacg gtgtgcctgg agggcaggtg 120 ataggagggg agaaggaatc tcggaacccc ctgggcagtc ccagccctgc tgtttgttga 180 tctggtctct cctttctagg ratgagaatt gcaaggtggc tgccgtgtgc cccaggaggg 240 gcaggacctg gaaacaggta ttgggtggtt acagagttct gtattcctcc tcccaggaga 300 ggatgcttaa tttgccaggt tattacagat gcttctcaga gaacctgcaa cttgtcataa 360 tttgaaacca ctcaccttgg ctaaaggaac ccaggggctt c 401 <210> 13 <211> 2001 <212> DNA <213> Homo sapiens <400> 13 tcttgttgtg ctcaagtctt gtctctgtca tccatggtct cctacgaagt cattgcccta 60 agttcatgct gggggagcca gaagggaagt ccttggatat cttatacctc aatattggct 120 caatttcttg gggagggggt gctgtcagag attgttatct gaggatgtga catagatttc 180 tcagggcaca atttcaacta ctttttcagc tttagggttt ttagatacgt ttgtaccaca 240 attgagcatg ggagggagag gggtgagcct aagcagtgat ggctgatttc tgtcatgtct 300 gtcatgtgtc ccccagtacc tccagaggta actgtgctca caaacagccc tgtggaactg 360 agagagccca acgtcctcat ctgtttcata gacaagttca ccccaccagt ggtcaatgtc 420 acgtggcttc gaaatggaaa acctgtcacc acaggagtgt cagagacagt cttcctgccc 480 agggaagacc accttttccg caagttccac tatctcccct tcctgccctc aactgaggac 540 gtttacgact gcagggtgga gcactggggc ttggatgagc ctcttctcaa gcactggggt 600 atggaccaac actcaatctc ctttatttca aggtttcctc ctatgatgct tgtgtgaaac 660 tcggtgttct aactgtttca taatatctgc tacaattaat ataactgtct tctcctacta 720 tccagcttcc tccttttttt aatctgtaat tctctcaata catcattctg tcttcctctt 780 ctttaatcta tgaataactt ttctctttat taagaaccct acatttgatt ctgagtgtta 840 cttcttccca cactcattac catgtactct gccttatttc cccccagagt ttgatgctcc 900 aagccctctc ccagagacta cagagaacgt ggtgtgtgcc ctgggcctga ctgtgggtct 960 ggtgggcatc attattggga ccatcttcat catcaaggga ktgcgcaaaa gcaatgcagc 1020 agaacgcagg gggcctctgt aaggcacatg gaggtgagtt aggtgtggtc agaggaagac 1080 gtatatggag atatctgagg gaggaaaaca gggtggggaa aggaaatgta atgcatttaa 1140 gagacaaggt aggaacagat gtggctcttg atttctcttt gctagaacga atcagacatt 1200 cgtatcatct ggtatcccaa agcttcaggg tctgtcatcc ctttctatag acgggcacct 1260 tgatcacggc tccagtctta gaaatcatct ccagtaccta aaaccattgt ttcacattag 1320 aatactgagt ctagggatct agaaaataca ttagaatatg gagtctaggg atctagaaaa 1380 tactgagtct agggatctag aaaaataagc ctcaagattt gggcacatcc tagcttgtat 1440 ttcctggggc aggtcatcag ttcagaagca tttccagatc ctggctcctt tcaggttagg 1500 gtcaattcat tgcatgaaat gggaatctct tagaggccaa tgcctgcttt tgcttcttta 1560 gtctcaaatg tagtatgaga aactctaaaa aaaggtaaag catggttgct tattatgttc 1620 agttggagag tagggtatac agttagttca tgttggaaag gttagatgaa cattgaaaga 1680 attttgcaaa gtcaaaggat taagagagaa gaggaaggaa tctgaagcaa ggagctcaaa 1740 actgatcttc aactccttgg taactatgtg tgtcttgcta taggtgatgg tgtttcttag 1800 agagaagatc actgaagaaa cttctgcttt aatggcttta caaagctggc aatattacaa 1860 tccttgacct cagtgaaagc agtcatcttc agcattttcc agccctatag ccaccccaag 1920 tgtggatatg cctcttcgat tgctccgtac tctaacatct agctggcttc cctgtctatt 1980 gccttttcct gtatctattt t 2001 <210> 14 <211> 747 <212> DNA <213> Homo sapiens <400> 14 atgaagcttc tctgaattgg tgccacacga gttgtgaatg ggtgatgaat tggaggcatt 60 aatgttaagg agaattctgg ggaagatagt gtggctattg cttaatgaaa gaaggaagag 120 tagagggaag taaagatgaa gacacatgca gcgccagatc atatagagtc ttgttgacca 180 ggagggaatt tagactaagc aagaaagaaa gtctgaactg ggggagtgct tcgactgaat 240 ccaggaggga gaagaaaatc tcttcagtaa ttccatcccc agtcattctt caagctcacc 300 tcattccctg cagcctgttg ctgctgaagc ctcagaagaa gtttrttccc aaagttggga 360 atacctcctc ctacacattt cctagtaatg ctgatgtctg accctgttca tcctgcctca 420 tgtctcacaa gctggtctcc attcgctgct gagtaggaca gtgcagtgga aagaacatgg 480 gctctagcat cacacagacc ttccaaccac ctcctagctg tttgaccttg ggcaagttag 540 gtaatttgaa tcctagcttc cttctttata aaatgaaatc attatactta acttgcagat 600 tggtcataag gaagaaatga aataattcat gtatggcaga atgatgcttt tctttttcct 660 ttgtttgtat ttaggtttgt cacacaagac aatagcacat gtcaccgcca aggcctttcc 720 tcttctggct ttggatttcc caaaatc 747 <210> 15 <211> 2001 <212> DNA <213> Homo sapiens <400> 15 gccaccttat ccaagccttg ggcctctctt tatcatgcat gggactctag ggaaaagtag 60 aaaaaggaga taacatcatg gcaggatacc agtggcctct tcatacagag taaacccagg 120 caggagtgga gtccccatga gcctgtcctg tcccagcacc cttcagaact gttgttagct 180 gagggggtcg tgaggaacag gataatctgc atccctttca gggtaaacct gaactcagga 240 ggcaaatttc atgaagtcca tgtgaaatgg ctcattcaca aagtaacaca caatgggcca 300 aatggagcaa gacacacctg tgtgggcccc aggatggctg ccaatcccca gcaccacatg 360 cctcacccct ggaagaactg gccaaggcct ggaaaggaca cagtgcaaac accaccaaag 420 catttagtgc tgccagccag agctttgggt agagcaagaa tgtgtgtgtg tgttgaatgg 480 gaagggaagc tagtaggtgt ccaacaaacc ctgccatgaa tactggggcc aaaaaagagg 540 ggacccgtag gacagatgtt gatcccactc aaagtcagca cagcgggatg cacttaaagg 600 gcactgagca cgcaggggct gtcacaaacc catgaggatc tgcagggtgt ctcccacaag 660 tcatttctct cagaaggatc attacctaaa atagcagaaa acatacgatc gaggttgctc 720 aatttcaata tgctgggatc ctatctctga gtgcccacct cccccaaaac ctcactctct 780 caccccacct ctgcttcttt tctccctgcc catttctttt ctgacttctt tccccacaac 840 agaatctctg attctccacc cacgtcctgt tcagagtcat ccactttcct cccccacccc 900 ccagactccc ggggcctctg cacctgggga cactggacac atatgtgccc atgatgatga 960 ggacggtgcc cacgaggaag cccaccaggc cgatggccag rcccagggca cagaccaggg 1020 tctccatggc atctggtggt ggaataggca cctggagctc taggagagaa aggaaggagt 1080 tggtggtata tgaaaggatt ctagagtaaa ggaaacctgg ggccaggagg gtgcatgggg 1140 agggggctcc gtaccccaat gcctgaggag tggcgcatcc aggccccagt gctccacctg 1200 gcagtcatag acgtcctcgg ctgagggcac gaagggcagg tagtggaact tgcggaacaa 1260 atggtcaggc tgggaataga agctggtctg ggccactccc tcagtgacag tttggccgtt 1320 gcgcagccag gtgatattga tcacaggggg gaagatgttg tccacgatgc agatgaggat 1380 gttgggctgg cccagctcca cccgagactt ggggagcacg gtcacccgtg gaggcactag 1440 gaggaacagg ccctgagtcc acaggctcat ccctcacccc agggccttac taggactggg 1500 attaagggac gttccccctt tgtagccatc tgtgggcagg ggatgctctg gggtatccac 1560 tggggcagga gaggaggaaa acagagggag aggagactgg ggagggagtg gggacgccag 1620 gagctcctat atttgactgg tccctgggcg ggagtccggg tgagaggtgt cattcctcaa 1680 ggagaggggt gccaaagggg tctgggaaga cctggagcct cctgggaaag aaaggaacag 1740 ggcatgacag gcgcgggcgc tgagagcgcg ccccagagtg atgggagcct aggaactggg 1800 aggaagtttc tctggacctt cccgcctgac tgggtgggca gagggagggc cggtaccgtt 1860 gatggctctg ctgcggttgg agcgctccac caggatgtcc agatgggctt tgattgcggc 1920 gatgccggcc agcccgccct gcgggtcaaa gcgggcaaag tcaccaaact caggcagacg 1980 ccacacggcc tcgcttttct t 2001 <210> 16 <211> 501 <212> DNA <213> Homo sapiens <400> 16 gctcatctcc acccttagaa tcaccctggt gcaggctgca ccagacctgc ctcccagtgc 60 tggagagggg acaccagggg tagaagtccc catagacagc ccggtcagtg cagccatctc 120 ttcctcctgc gccatcacat cttttgaggg aagcatcttc acctgctcat tacctgcagg 180 ctgagatccc aggctctata cacagttgat ggactgggtt aggggcaagc gtgagtaggg 240 cccaagctcc rggcacatgt gactctgtgt cagttcccat tcctctggga agcatttttc 300 caactcccca tagctggacc ctgccttgcg tcttcaatgc aactctccct gggtgggcgc 360 tttcctgccc ccactctcca tcgccctctc caaggggggt tctatgcaca gctccctccc 420 ccaaacccct ctgcccaccc tctagctccc gctagccatt ttattattca tctaatagct 480 actccaaaac acctcaaatt a 501 <210> 17 <211> 801 <212> DNA <213> Homo sapiens <400> 17 tgtaacagaa tacatagaga ataatgagga gtttatgatg gaaccttaaa tatataatgt 60 tgccagcgat tttagttcaa tatttgttac tgttatctat ctgctgtata tggaattctt 120 ttaattcaaa cgctgaaaag aatcagcatt tagtcttgcc aggcacaccc aataatcagt 180 catgtgtaat atgcacaagt ttgtttttgt ttttgttttt tttgttggtt ggtttgtttt 240 tttgctttaa gttgcatgat ctttctgcag gaaatagtca ctcatcccac tccacataag 300 gggtttagta agagaagtct gtctgtctga tgatggatag ggggcaaatc tttttcccct 360 ttctgttaat agtcatcaca tttctatgcc aaacaggaac ratccataac tttagtctta 420 atgtacacat tgcattttga taaaattaat tttgttgttt cctttgaggt tgatcgttgt 480 gttgttgttt tgctgcactt tttacttttt tgcgtgtgga gctgtattcc cgagaccaac 540 gaagcgttgg gatacttcat taaatgtagc gactgtcaac agcgtgcagg ttttctgttt 600 ctgtgttgtg gggtcaaccg tacaatggtg tgggagtgac gatgatgtga gtatttagaa 660 tgtaccatat tttttgtaaa ttatttatgt ttttctaaac aaatttatcg tataggttga 720 tgaaacgtca tgtgttttgc caaagactgt aaatatttat ttatgtgttc acatggtcaa 780 aatttcacca ctgaaaccct g 801 <210> 18 <211> 601 <212> DNA <213> Homo sapiens <400> 18 cctgggcgga gtatccacag cttgtgtccg gagccaacct tccatcactg gtgaatgctt 60 cacagagttc atgttgtctt tctttcctct tcttcttttt aaacaccatt tttagagtct 120 tagtgcatga aggaaaacag gtcatggcag actcaggacc tatctatgac caaacctacg 180 ctggcgggcg gctgggtcta tttgtcttct ctcaagaaat ggtctatttc tcagacctca 240 agtacgaatg cagaggtgag tgtgagcact tcaggttagt ggggaaggca tgtggcaccc 300 rgaaagggaa aggggtacac agtgcgctct tccacttaca aaagctgtac atcagggcaa 360 taaacctaat gcacaatacc acacatgagc cccctgcaca ttccaagcag ccatttcatt 420 tggtcagctt ggattgtttt gtttttaatt gcatttggat gcctctggta gaacactttc 480 caattattta caggtcccac ccatcctcat tgtgccatag cctagagttt cctagatttc 540 ctccacttcc aggctcctga cgccatttta gattggtacc cttgcattag caatggcagg 600 c 601 <210> 19 <211> 601 <212> DNA <213> Homo sapiens <400> 19 agcctcctga gtagctggga ttacaggtac ccaccaccat gcccggctaa tttttgcatt 60 tttagtagag acaaggtttc accatattgg tcagaccagt ctcgaactct tgacctcagg 120 tgatctgccc acctcggcct ctcaaagtgc tgggattaca ggcatgagtc actactcccg 180 gccacctgaa aggtgtttct aacgaaacct tatggcgttg catccaggtt ttgacgagtt 240 tgggtctgtg gacttcagtg gcacattcta cgtaaacact gaccgggacg acgactatgc 300 yggcttcgtc tttggttacc agtcaagcag ccgcttctat gtggtgatgt ggaagcaggt 360 gacgcagacc tactgggagg accagcccac gcgggcctat ggctactccg gcgtgtccct 420 caaggtggtg aactccacca cggggacggg cgagcacctg aggaacgcgc tgtggcacac 480 ggggaacacg ccggggcagg tgaggttggc tggccaggcc cacacatctc ctctgcacca 540 ctgtggctgc agagtgtttc agcaggacta gcccagtcca gagaccccac ttttgtcttt 600 a 601 <210> 20 <211> 501 <212> DNA <213> Homo sapiens <400> 20 gataatatat cctatgagct gcatcaagat aactgtcctc ccttgtctct ttcaaacgca 60 aacctggaaa tgaagccaat tcctatgttg gcatcaggaa aatggggaga gaattctgtg 120 aagacacaac ctcgctccag ctgatgaggt gaaggcgtct attcatttgg cctgtggcca 180 ctgagataca agaatagtgc tgagatttga gttgaataaa tgttttcaat tgaaatcatt 240 gaaaactcca ygttatcctg tctttgtgga gctcatggaa aactcccaag gccccatgga 300 ggggactgat gaggacctga gacccgcctg cccatggctg ccctcatctg gtcagggttg 360 tgcaccaggg gcagttgtca cagtgatccc ccacaccgcc accgtccccg cgtccctctg 420 atgaggacct ccaaccccgc ctgtctgcgg ctgccctcac ctggtcaggg ttgtgcacca 480 gggggcagtt gtcacagtga t 501 <210> 21 <211> 601 <212> DNA <213> Homo sapiens <400> 21 actgaggtaa aatccgcttt tgcaaaggac tgtctttata gtgacctttg ggaactgcct 60 atgcacaaac gcttgcagga caatggcagt tacgttgaca agggctgttt ggtataaagt 120 ggcctcctgt ggacctcgct gctcctggaa cccagggcag gtgcgcggca caagctgctc 180 ctaccttctc atcggtcaca ccgtcattgt catcgtcatc atcacaggca tcgccaatcc 240 cgtccttgtc aaagtcttcc tgcccagaat ttggcagatg ggggcagtta tcctgcaatt 300 rgagaaggaa gaatacttgg aaaaacattt aggaaagcgc ctctttcagg aatgcaggtg 360 aaatgaaacc agcatggcgc cgaggaagga agcgtgcccc atcagtcctc gatctcaagc 420 gggagctcag agaggagcca ggaggccaga agcgcatgaa tttcttctca actcctgccc 480 tggaaattat gtcaccacta ctcacgaggg agacaaaaga aacttgtgtt cggagcccag 540 aatcccagtg aggttgccaa acaactgcag aaacaagaag agaatgacaa ggtattgctc 600 t 601 <210> 22 <211> 601 <212> DNA <213> Homo sapiens <400> 22 aaacaactgc agaaacaaga agagaatgac aaggtattgc tctgagagaa catgggagag 60 aaaagaggct ttagtggtgc tttccttccc acagacttca gctcataaat caaaatgcca 120 aagcttataa acgtttaacc ctcaccagtc aaaaaatctc aaagaaaaga gtctttcaaa 180 tatgtatgtg atatgttcag aaaccagggc tgagaacttg aaattatctg ctaactcagc 240 ttttcacggt gagtgggtaa tgcatgtaat tgggcaaaaa actcctttca tttcagtcca 300 rttataccat aatttaaagc tgagctgtgc agacactaac aacttgaggg aattggaaat 360 ccaggaaaag gtagagagta gcgtgtccgt ttcagtttca gtacaagagc cccaagggtg 420 tgtcatcccc gtggctcaaa gcacatcact cgttgaatca gaagacagaa cctggggtct 480 tgtgctggag cccagagttt cagcacttac agattaagaa aggagagcat gttgcttgtc 540 atggaggaca cggatcccag acgggggccc ctggcaatgg ggaggatcca gcacctgtgt 600 g 601 <210> 23 <211> 833 <212> DNA <213> Homo sapiens <400> 23 cctgctgtgc aagaaggaaa ttcacactac accaatgtcc cctgaggcac taagccactc 60 acatccgttt ttctttttca gaacaataaa tctgagaaac cctgtgctcc agcaggtgct 120 ggagggaagg aatgaggtcc tgtgcgtttt gacacagaag atcacracga tgcagaagtg 180 tgtgatctct gagcacatgc aggtcgagga gaagtgtggt ggcatcgtgg gcatccagac 240 caagacggtg caggtgcgtg cagtcgggct gccatgcgtt catggctgag cagtcctgga 300 gaccttggtt gggggcctct ttgggtggtg tcttcattag tcaggactct ttctgtagca 360 agttaacaga aacccaactc acgttgtctt aaatgagcaa gagtgtattg gctcatgtag 420 ctgaaacact tggggaaact ggctccagtc actgctggat ccaggccttg aggatatagg 480 gagctgcttt ccctgggttg atgccctctc agatgggctc tccctatggt gctggcagct 540 ccaggcttgc tcctgatttg ctgcaactcc aggggtgagg ggggcctagg actgggtctc 600 accagaccgt cccgggatat gttcattcct gaagcagggc agggttgcag gcactcgaat 660 gggccaggcg ggctggcgtc aggtgctctt tctgtgggcg cagatgtttc cccaagaaaa 720 tgaacacaca gcactgatca ggccaaatgc agaaaatccc tagagccttt caagctgggc 780 tttacaatga ttacttggaa tagctgctgc aaatcctttg tgggttgagt tag 833 <210> 24 <211> 833 <212> DNA <213> Homo sapiens <400> 24 cctgctgtgc aagaaggaaa ttcacactac accaatgtcc cctgaggcac taagccactc 60 acatccgttt ttctttttca gaacaataaa tctgagaaac mctgtgctcc agcaggtgct 120 ggagggaagg aatgaggtcc tgtgcgtttt gacacagaag atcacgacga tgcagaagtg 180 tgtgatctct gagcacatgc aggtcgagga gaagtgtggt ggcatcgtgg gcatccagac 240 caagacggtg caggtgcgtg cagtcgggct gccatgcgtt catggctgag cagtcctgga 300 gaccttggtt gggggcctct ttgggtggtg tcttcattag tcaggactct ttctgtagca 360 agttaacaga aacccaactc acgttgtctt aaatgagcaa gagtgtattg gctcatgtag 420 ctgaaacact tggggaaact ggctccagtc actgctggat ccaggccttg aggatatagg 480 gagctgcttt ccctgggttg atgccctctc agatgggctc tccctatggt gctggcagct 540 ccaggcttgc tcctgatttg ctgcaactcc aggggtgagg ggggcctagg actgggtctc 600 accagaccgt cccgggatat gttcattcct gaagcagggc agggttgcag gcactcgaat 660 gggccaggcg ggctggcgtc aggtgctctt tctgtgggcg cagatgtttc cccaagaaaa 720 tgaacacaca gcactgatca ggccaaatgc agaaaatccc tagagccttt caagctgggc 780 tttacaatga ttacttggaa tagctgctgc aaatcctttg tgggttgagt tag 833 <210> 25 <211> 601 <212> DNA <213> Homo sapiens <400> 25 aaccctcccg ccaagactac tcatcagaga ggaagctaat tcgcagggat atgaagagct 60 ttctaaaatt agttccagac gttttttgtc ttaaatctgt attatggctg ccctattttt 120 aatttacttt ttgaaaaacc acattgggac actaagtaat ctgggggccc aaaagacttt 180 gggggccacg ggaacctccc aggccctagg gggcagtagc accaccatgc agatcaccag 240 ttccacaaaa gaacggaact ccaccagtac ctagtgttat tagtataagt tttgggaaag 300 rtccttcttt tcttagctca ttagaaaaac aatcacagca tcctatggac tgggccaaag 360 tgggcagaca ttgcagtcca gcccaaagag gcctgtccca gcaggacgcc aggggacagg 420 gtcagaaaga ggcagcccac aagctgagca cagctgcaga ggttttagct tgcttactac 480 atttgaatta attgccaaac ttttaaactc aggggaattc cgcaaaaatc taggtaccag 540 cttttcttga agacaacgga agctcaggcc accctgaggc catcacctct ggcagtcacc 600 t 601 <210> 26 <211> 560 <212> DNA <213> Homo sapiens <400> 26 caaagtcccc agtccaccac tggctctgcc ttttaagctg gacataggaa agctttctgg 60 atgtcagtga ggccagacca cagcacacat gagcaggaac agcatgaagc tgctttcccc 120 agagcccctg ggccagaggg cattgctcag ttactggcca gaactggaga gtgagatggg 180 gaggatcagc agggacggct ggagtcactg aaagataggt ttggggcttt gctctctgct 240 ttcttcttgt gtgaccttgg acaacggact tggccttctt gaacctccaa gtcttcattt 300 gtacagtgag atgctactat ctctttgaca aatattttaa ttttctctat tttaggtacy 360 gctgcagcta gaatccaaat ctgccctagg tccaaaaaat ggtgtctata tcaagatcgt 420 atcccgctga cacagaaggc tgccgggtgg ggggagggca cccccaaatt caaagaaaac 480 cctaagtgtg gatgttcaga attttggaaa aatgtcactg aagtgattga aagagtgcct 540 ggcatgcaag gataagaggt 560 <210> 27 <211> 601 <212> DNA <213> Homo sapiens <400> 27 tcagccggta tttctgatgg tctctgctgg cagatgtctt tgccccaatc tttgccattg 60 gttggagtca cctcacggtc agtatgtggt agtcacgacc cctcaagcag tggcgggaat 120 ccacattcat catggcttgg tgggttgtcc ccatgttctg attggggagg gtgagtcctc 180 tgggtgcctt gcagtctctt ggtttgcagt tgcatcagct ctattctggt tagctatatc 240 catctgcata agccattaca tcattatata tgcgttgttg gctagttata cccttccata 300 kccatgcatt tcaggacttt ctgagccttt tctaaacatg gttatgactt tgggggaggg 360 caggtgttta tctcttttga gtgatatagt catatgcaaa tccgtaatgg ggtatacatg 420 tatgatttta attacttttg gaaaaaaagc ttacaaggaa aggaacattt ttggtgggaa 480 aacttgttta tcttcttttt ttaaagcaca gcaaatagat ttttatttat tcctgttaaa 540 atgtatgatg taatatcctt taatcatatg aagctagtta gtgacaaagc taggattcat 600 a 601 <210> 28 <211> 501 <212> DNA <213> Homo sapiens <400> 28 tgagctactg ggcctggccg gcaatttagt ttttaaaaat ctctgtccag cagagacttc 60 cttattttat tgttattatt attttttgta ctgatagaca caacagatgc attttgaata 120 agtctgaaca tcactacttt ctatgagtaa ctgaccactg ggagaccaga tgagatattt 180 cttatgcctc ccattccagg aggccattat ttaggggaca catggtgtgt tatggatatc 240 taacatatca ygttggatca ctcccacttc tcaatggttc tctgctgtac atcaagaagt 300 taagattaaa aaacagaagc atgaagaaga atgttgtgga ctgattttag ttgcactgat 360 gtgagttttc ctatccagct acactttttg tccggtgtct actgcgaaca atggcacatt 420 tcatcacaca gctggataat gttaagaatt ataatctgta accccagatt agatgaaaat 480 aaatgccact cgtaatacta a 501 <210> 29 <211> 501 <212> DNA <213> Homo sapiens <400> 29 gaccactggg agaccagatg agatatttct tatgcctccc attccaggag gccattattt 60 aggggacaca tggtgtgtta tggatatcta acatatcatg ttggatcact cccacttctc 120 aatggttctc tgctgtacat caagaagtta agattaaaaa acagaagcat gaagaagaat 180 gttgtggact gattttagtt gcactgatgt gagttttcct atccagctac actttttgtc 240 cggtgtctac ygcgaacaat ggcacatttc atcacacagc tggataatgt taagaattat 300 aatctgtaac cccagattag atgaaaataa atgccactcg taatactaat tatgttcctg 360 accttattgt atttacttac taaccccaaa taaactcatc tatttactta aatgggaata 420 ttaacatcac ataaatcata tttaacaatg aacacatgaa acaatttatc acctagttta 480 atattctgct tgactcatct a 501 <210> 30 <211> 501 <212> DNA <213> Homo sapiens <400> 30 aacatatcat gttggatcac tcccacttct caatggttct ctgctgtaca tcaagaagtt 60 aagattaaaa aacagaagca tgaagaagaa tgttgtggac tgattttagt tgcactgatg 120 tgagttttcc tatccagcta cactttttgt ccggtgtcta ctgcgaacaa tggcacattt 180 catcacacag ctggataatg ttaagaatta taatctgtaa ccccagatta gatgaaaata 240 aatgccactc rtaatactaa ttatgttcct gaccttattg tatttactta ctaaccccaa 300 ataaactcat ctatttactt aaatgggaat attaacatca cataaatcat atttaacaat 360 gaacacatga aacaatttat cacctagttt aatattctgc ttgactcatc taaatcttgt 420 ttctcttaca gaatttccat ctgtgtccac tctattggaa taatgctgtt tctgtatact 480 tctacaatgc ctgctttact a 501 <210> 31 <211> 601 <212> DNA <213> Homo sapiens <400> 31 cctatccagc tacacttttt gtccggtgtc tactgcgaac aatggcacat ttcatcacac 60 agctggataa tgttaagaat tataatctgt aaccccagat tagatgaaaa taaatgccac 120 tcgtaatact aattatgttc ctgaccttat tgtatttact tactaacccc aaataaactc 180 atctatttac ttaaatggga atattaacat cacataaatc atatttaaca atgaacacat 240 gaaacaattt atcacctagt ttaatattct gcttgactca tctaaatctt gtttctctta 300 yagaatttcc atctatgtcc actctattgg aataatgctg tttctgtata cttctacaat 360 gcctgcttta ctattgaaca gtaggtatga aatggagtcc tcaatgggat tatctcattc 420 atggatacat cttttgatag tagaaaaaca aaagtataca taggttgagg gaaagacaga 480 aaaagcagtt gctgaaaaat agcacctgat ggtgagtaat ggacagcaat caggtttttg 540 ttatgttttc ttgttgtttt tgtgacaggg tctccgccct ctgtcaccca ggctggagtg 600 c 601 <210> 32 <211> 701 <212> DNA <213> Homo sapiens <400> 32 tctgcttgac tcatctaaat cttgtttctc ttacagaatt tccatctgtg tccactctat 60 tggaataatg ctgtttctgt atacttctac aatgcctgct ttactattga acagtaggta 120 tgaaatggag tcctcaatgg gattatctca ttcatggata catcttttga tagtagaaaa 180 acaaaagtat acataggttg rgggaaagac agaaaaagca gttgctgaaa aatagcacct 240 gatggtgagt aatggacagc aatcaggttt ttgttatgtt ttcttgttgt ttttgtgaca 300 gggtctccgc cctctgtcac ccaggctgga gtgcagtggt gtgacctcgg ctcacagcaa 360 cctctgcctc ccgggctcaa gtgatcctcc caacagactc ctgagtagct gggactacac 420 gcatgtgcca ctttgcccag ctaatctgtg tattttttgt agagatggag cttcgccgtg 480 atggccaggc tgatcttgaa ctcccgggct caagtggccc gccggcctcg gcctcccaaa 540 gtgctaggat tacaggcatg agccaccgtg accagcctag caatcaggtt tttatctgtt 600 tcgggtgtgg gagaggctaa gctaagccag aattcaaaag aaaggaatat attcataatt 660 agttacactt agggtgaaaa ataaaatttc agagagagca g 701 <210> 33 <211> 601 <212> DNA <213> Homo sapiens <400> 33 ggtgtgggag aggctaagct aagccagaat tcaaaagaaa ggaatatatt cataattagt 60 tacacttagg gtgaaaaata aaatttcaga gagagcagaa taggaaaaaa aatcataatt 120 ttaataattc acacttattg cactattgtt atttcaattt tacagagaaa aacaggcata 180 gagtgatgaa atagcaagta catatataaa caacaaatgg cagagctgga acgccaatcc 240 tagcaatctg aaccagcatt taaaaacgag gaattaaagc acatagggtg agaaaaatta 300 yagggaaggg acaagagctc aaggggacac ctggaaaaga aatactataa tttaaaaaat 360 taccaacggg ctaattatac ttgtgtttta ctggcagagc gctttgtctg ttagctcttt 420 taatcccgct aataattctg tgaggttggt actataagtt ccatttttca gacaaggacg 480 cttaatagat gataacggac acattccctg cttgctctga tctgcatctc aacagctggc 540 tctggagcaa gcaaagggag actaagggaa acagagcgga gcagggcaaa gaaagccgac 600 t 601 <210> 34 <211> 705 <212> DNA <213> Homo sapiens <400> 34 gccatatcgt ttcctcaaac taagtgaatc caagcttgaa ttgttccagt tcaccaccca 60 tttcatatag tttggccact agccccaaga atagatattt tggcatcatc tttagttctt 120 ccttttcaaa attccctttg accaagcgtt caccatgtgt ttctcatacc catacattca 180 tccccaattt ccctaataca rtgctgtgtg ttgtgatttg tcttagattc ggttatcagc 240 agtatttatt tatttatttt tattttattt tattttattg aggagtctcg cactattgcc 300 caggctggaa tgcagtggcg tggtctcggc tcactgccat ctctgcctcc caggttcaag 360 cgattctcgt gcctcagcct cccaagtagg tgggattaca ggtgcctgcc atcacgcctg 420 gctaattttt gtggttttat tagagacggg gtatcaccat gttggccagg ctggtcttga 480 actcctgacc tcaggtgacc cgcccacctc ggcctcacac agtgctgtga gggtgtgagc 540 cactgcgccc ggcttatcag cagtatttaa ataacccttg tttctatgaa tgtagtgatg 600 agtttctata tacagttttg tgttgtgaag cctgagggga aagtgagttc ctggataagt 660 cttgtcatat ttattttgtg tttctagcac tgttgcacag cacat 705 <210> 35 <211> 806 <212> DNA <213> Homo sapiens <400> 35 ttccctttga ccaagcgttc accatgtgtt tctcataccc atacattcat ccccaatttc 60 cctaatacaa tgctgtgtgt tgtgatttgt cttagattcg gttatcagca gtatttattt 120 atttattttt attttatttt attttattga ggagtctcgc actattgccc aggctggaat 180 gcagtggcgt ggtctcggct cactgccatc tctgcctccc aggttcaagc gattctcgtg 240 cctcagcctc ccaagtaggt gggattacag gtgcctgcca tcacgcctgg ctaatttttg 300 tggttttatt agagacgggg tatcaccatg ttggccaggc tggtcttgaa ctcctgacct 360 caggtgaccc gcccacctcg gcctcacaca gtgctgtgag ggtgtgagcc actgcgcccg 420 gcttatcagc agtatttaaa taacccttgt ttctatgaat gtagtgatga gtttctatat 480 acagttttgt gttgtgaagc mtgaggggaa agtgagttcc tggataagtc ttgtcatatt 540 tattttgtgt ttctagcact gttgcacagc acatattaag tgtttagaaa atgttttttg 600 aatgaacagc ccaaagacta aaggaagagc ttccttagtt ctaaattctg tgaaatgagt 660 cagtttgaat gaagctaatg tttcttttgt tttacaccat ttactataga gtgcagtact 720 tcagttgttc atacttgttg atattgttac agatattcta atggttcttt ttacaaaatc 780 aaatatatat acaagacatt ttttcg 806 <210> 36 <211> 601 <212> DNA <213> Homo sapiens <400> 36 ttgggactca tcaacggtag tgcctcataa cttacaattg aattttactt actggaacaa 60 ctttttaata cattcattga tttaatcatt gtttgggtag tattacagca acagtctcct 120 cctctcctgc cctctccacc ctccctacgt atacacacca cataccagat gtggctgata 180 aatgctttgg attaatatat ctggatttta cttaaacagt gattttttct tttgtaggaa 240 cctccatatt taagaattcc cagtagctac tttttaccca atatggcagg cattacattc 300 rtgccttagg atatattata gtgattgagt ttctgtttct aaacattttt gatggaaaga 360 atgagtataa gaggcagttc ctttttgcag ctccacgcca ggggaccagg gttacacaca 420 ttactatgtc tgttaaactt cttggaagcc atccagtatg ttatggtgag tttcatgtgc 480 aagtggtgag ccactaaaag gcttgcacag tattataact attctgagga gaggaggcat 540 ttggaaattc agcagtaaag cagcgacagc aacaagcagc cacagtggta gtaaagagag 600 g 601 <210> 37 <211> 601 <212> DNA <213> Homo sapiens <400> 37 ctctcagtgt tacttgggct tgggtactac ctgagtgatt ttgatggggt gaggatcagt 60 tgagttttcc accatctcca caggttttgg tgctttccaa atattctctg tcactatgat 120 atccacagat gtggtatcac tgaaggaatt ctcactctgg cctcccatgt ccttcactga 180 gatcaccaga ttataggaag gattcttagc aggattcaat tcctgagatc ctgtgagaag 240 taggggagaa gggaatagga agtgtctctg gaagagacag accaaactcc ctttttctga 300 ycgagacagt cctaccaggg cacccctgaa cttaccctct cgggtaagag agatggctcc 360 cgttttgttg ttgatctgaa agtacatgac attgttgatc atgggaagct ggatgacaat 420 ctggtaataa agctggccat tgggagtggc cggatcatcc aggtctgtgg cattgacata 480 caagaagggc tttcctgttt aacaagacgt acccagaaaa aaaaaaaaaa agatattaat 540 tgaaacccaa accaaccata gctacttttt ccatctaaaa agtggagata taatagggaa 600 a 601 <210> 38 <211> 601 <212> DNA <213> Homo sapiens <400> 38 cttggaccag accttgagaa accttatttt agggaagggg ggagtgggat aaatatttat 60 ttccccactg gtgtgaattg tcataccttt ccctattata tctccacttt ttagatggaa 120 aaagtagcta tggttggttt gggtttcaat taatatcttt tttttttttt ttctgggtac 180 gtcttgttaa acaggaaagc ccttcttgta tgtcaatgcc acagacctgg atgatccggc 240 cactcccaat ggccagcttt attaccagat tgtcatccag cttcccatga tcaacaatgt 300 yatgtacttt cagatcaaca acaaaacggg agccatctct cttacccgag agggtaagtt 360 caggggtgcc ctggtaggac tgtctcggtc agaaaaaggg agtttggtct gtctcttcca 420 gagacacttc ctattccctt ctcccctact tctcacagga tctcaggaat tgaatcctgc 480 taagaatcct tcctataatc tggtgatctc agtgaaggac atgggaggcc agagtgagaa 540 ttccttcagt gataccacat ctgtggatat catagtgaca gagaatattt ggaaagcacc 600 a 601 <210> 39 <211> 601 <212> DNA <213> Homo sapiens <400> 39 attgtgccac tgcactccag cctgagcgac agagtgagac tccgtctcaa aaaaaaaaaa 60 aacctggcag aaattaaaac taggaaatag attaaaatag atgacttaga acttatacca 120 gatagccaat cctttctgtt tttggtggtt cttacagact tgagatgggg cttgtcttaa 180 ggaaacaagg tgccctgaat ctaaagactg actaaaggtg gtagaactta agctatttag 240 gtgatcctga aaccaacagg gacagttatg gtattgacta gtgggaactt ttaagagcca 300 yaagactgga gaagtcaaat gctgaccaat tatatcaaac ccacctgggg aacaaactaa 360 gtcaaaatta ctagggtggc taaaagctga gaaatgaact aggtggcagt taggactgct 420 gaacaaaccc atctcaccat agcagagaaa tattagaata cacctagagc catgtaaaga 480 ttcaggagtg ctaaaagcat gaattcccct ccttccctcc accaccccca aaataagctt 540 ctgcctgaat gaaggtaact gggaatctgc aatttttttc tttcctgcac aaataaccac 600 t 601 <210> 40 <211> 681 <212> DNA <213> Homo sapiens <400> 40 gtcaaagcag taaaacatta ctgtttataa aactagttcc taaaattttt aaatataaac 60 acaattattt atttgaccag aatctaagag atggaaacat tttaggccaa aaaaaggaca 120 agccgccagg cgcagtggct cacgcctgta atcccaacac tttgggaggc caaggtgggt 180 ggatcatgag gtcaggagat cgagaccatc ctggccaaca tggtgaaacc ccatctctac 240 taaaaataca aaaattaggt gggcgtggtg gcgcgtacct gtaatcccag ctactgggga 300 ggctgaggca ggagaatcgc ttgaaccagg gagtcaagag gttgcagtga gctgagattg 360 tgccactaca ctccagcctg gcgacagagc aagactccat ctcacacaca cacacacaca 420 aaaagtacaa gcccttctac ctcagtctgt ctcactaact cttgatccat ctttaacacc 480 kacagtatcc atctctccta tgtgtacaca cacgttcatc tctttcattc cctaactaaa 540 ctgtctatac ataagacaag cacagcagtt taaccaaatt ctcttgccag aatcagaaaa 600 gtgtatattt gctttacctg tcctccacct tcttctttac acaggtcaat tgcaaatgcc 660 aaatccattg ctgagaaaac a 681 <210> 41 <211> 501 <212> DNA <213> Homo sapiens <400> 41 cactgtatag caacatttat aaggcagccc aggatgacat caagaaacag tgagtgaaca 60 agtaagcact tgtaataccc ggcataatac cttaaaatcc gttaggattt actgagttta 120 tctatagtaa atttgaagag actatggatg ttcaaagggt atcatcataa aagttcagta 180 aatctccaaa taccatgaaa gtaaattgtt acttctgaaa gagcagtaaa atataacctg 240 gcaaatgaag ytactacttg agattctaac agggctaaaa tataacttac aaagctttta 300 caataaaaat ttcaattttt aagttttttc agttgaaaaa aattcttcac tggccaaatc 360 actagcaaaa tcagtttcaa acatcactgc cttgttttag tagagcatat ggtggaccca 420 tttgcttggt ttcgtaagat tcccatctaa ctgaccctaa acttctattt tctggttaaa 480 aactcacttt cccagatctg g 501 <210> 42 <211> 1289 <212> DNA <213> Homo sapiens <400> 42 aatttagact tacactagaa tcaagactga gtagaaaaca gaatacttct gctgatcatg 60 gtgtctcatg cctataatcc cagcacttga gaggccgagt tgggaggatc acttgagctc 120 aggagtttga gaccagcttg ggcaacacag ggagatggtc gtctctacta aaaattaaaa 180 aagaaaaatt ggccaagtgg tggcgcgcgc ctgtggtccc agctacttga gaggctaatg 240 tgggagaatc acatgagcct gggaggttga ggctgtagcg agcatgatca tgtcactgca 300 ctccagcctg ggtgacagaa caagaccctg tctcaaaaac aaaaagaaaa ccgaggccag 360 gggcagtggc tcacgcctgt aatcctagca ctttgggagg ccgaggccgg tggatcactt 420 gaggtcagga gtttgagacc agcctggcaa acatagtgaa accccgtctc tactaaaaat 480 acaaaaatta gcctggcatg tagctcccag ctactgggga ggctgaggca agagaattgc 540 atgaacctgg gaggcgtagg ttgcagtgag ccaaggtagt gccactgcac tccagcctga 600 gggacaagag caaaactcca tctcaaaaaa aaaaaaaaaa aaagaaaaga aaattgaata 660 ctggaattat atcatttggt ccttgcaggt ttgtgttttt tgattaatac ttagtactgt 720 aacaggtaaa ccccacaagt ctaaatggct ttgcccagtt gaagtttctt tcatgcatgg 780 agcctgatag gtgagtagtt ttcctccaag cagtgatgta gggccttgag ctccttccat 840 gcagtaattc tctatcatca gcatatagct ttactaaggt ccctrtgctt gcctgcatca 900 agctatcaaa tgggaaaatg tggctttttg cagattaggc ctggaaatgg ctcacatttc 960 attgccttgg gtacagtcat aggccacagc taacttcaag gaaggctgga aaatgtggag 1020 catatatagg catgtaaccg ggttgaagag gaaatggttt tggtgggcac ctgtccagta 1080 tctgtcacaa gtgttttgtc cttacctgcc ctgagaaata gtctcaagaa acagtcctag 1140 aagtttgcta ttgcatttac tataaggata tttttgagga ttattttttt cagtgtaacc 1200 cattatctca tatatgatga tccatacttg ccttgattgc catggaaagc tttatttttt 1260 tcttgagttt ctagtgttat ttgtttttt 1289 <210> 43 <211> 1155 <212> DNA <213> Homo sapiens <400> 43 tgtgatagaa acctaattgc aggaaacagt gaaggcacca tggagttagt ggtcaattct 60 agcagaggac ctttagtaga acttctgtac tggaaatgag gacattaatg atgagtaggc 120 attccctaga gagtgggaga gggtgtttct gagggtagca aagatgagaa gagctgtgag 180 cagcttggtg tcaagatctg tgaaagctaa caggttagtc tgccacagtg tggcaggatg 240 ttggcagaag gtgagagctt atgagtcaga gacaaaggtc tttatcactt gttgcacatt 300 gcacatcaaa cagtgtgagt atcatagttc accttgttgt cccctctgtc ccaagtccta 360 cattggtgag gaagggccca gggtgatact gtggagggtt tgtgtcacag ctgaggaact 420 tgggggctaa gggacccaga tttttgtaac grgcagtaaa cttccctgac ctttgccatg 480 gaggaagaca tcatcttatt atactgccct ctgatgtaga aggaggcaca acctcaatct 540 tccagggttt tgttcattat acaaacatcc ttgaaaacta taaaagggca gttagtcatc 600 caggcggggt gactcacacc tgtaattcca gcactttgag aggccaaggc gggcagatca 660 cttgaggtca ggagttcgag accagcttgg ccaacttggt gaagactcca tctcttctaa 720 aaatacagaa attaagctgg catggtggtg ggtgcctact aaggaggccg aggcaggaga 780 atcacttcaa cccagaggca gaggttgcag tgagccgaga ttgcgccata gcactccagc 840 ctgggtgaca gtgtgagact ctgtttcaaa aaaaaaaaaa cgagggggac agttattgtc 900 tttgctcaca agatgtgagg agatgtgtga gaccatgagg aatagtgtcc cagcacctgg 960 caggttcagg gaactgcatg ttaacttcag gttgggcctt tccaagaatt ttaccaccac 1020 gtattatagc ttcagcactt actacctcaa gaagatgtag taccttactt gatgcacgaa 1080 agtcctattt gctgctggat tccttctttc cctttttggc aaagagggca tcatattctg 1140 ttggctaggc agaat 1155 <210> 44 <211> 601 <212> DNA <213> Homo sapiens <400> 44 attcacttct ttatcatgaa acaatatgat ttttaacaaa tggttgctgt atatatagca 60 acatatatat gttggtacca agccatatgt ccattttgtt gactttcaga aattttcaca 120 acaacaaact gtgactctaa tagtctaaca attttacctg gagtatgacc tgagcctttt 180 ctcacgcacc attttgttaa acatatattt aatttgggga atgtaaatac aaatttttct 240 tatgtaaaca ttacaccctc ccctgacccc ctaacttaca ttgagatatt tgtgtattct 300 ktgctgcctg gatagtaaaa gctgaagggt cttctaccag aagcatgtta acccaggaga 360 tagtctatga aatatactta taagtaacta ttataaaatt aaagcatatg ttttaaaagc 420 caaatgtgaa tggactttac attatttcat tactttggaa tcatctgtgg tattattctt 480 cagagtagaa aagaaaaact gacatttgtc tgcactaaga tatatgaatg gagagtcttt 540 gagtaggttc attcccaaat gtgatgaaag aaagctttat tttcttttca tttagttatc 600 t 601 <210> 45 <211> 990 <212> DNA <213> Homo sapiens <400> 45 tccagtattc agtgagtgct ctctgtgtgc taagcactgt agctaaaatg agagtagacc 60 agaagagtgg atcaggagca gacggtcaac agctctggag gccaggttat agaactgaga 120 cctgacccaa caggcagcga gaaagacact acaagatttc aaggggtaaa gtgataatga 180 gagtttaaaa aaaatgatta ctattgttaa tgcctggagg acacacaact gagagggagg 240 gagagggcag ttgctgccat atgttaacta atggcagagg ggctggcaac tgggatgctg 300 ttttagaagc agaatcacca actcttgttg tgagaggaag acaagagagt aatgatgcct 360 ccaagatatt taatgtggac agctagtcaa atgctattgt cttgaggaaa gcagcagaaa 420 tcactgaaac tgtaagactt tgaagtgaag ttacctagta cgaagctaga aacatttaaa 480 atgagggcay tggaagaaaa agaagagaga tgcaaaagga actctgaaaa actaaagaag 540 aaaaccagca gtgaggaatg ctacagaaga aaggaaagag aaaatttcca ttagagaagt 600 gcttgtcctt tggggaaaat gatgcaagtg agcttaagta agaaaagggc taatccttaa 660 atccactgga tttaattata aggcattctg cacacagcgc tggaattatg gctccatgaa 720 gcttctctca gacaaggatt atgtccctag tgtctggccc gctgcctgtt gcacagtggg 780 tgctcagatg atgtgggtga acgagtaaat tatcagaagg ctaaattaca ctgtgttgag 840 atgagaaaag gggttaacag agttctgtgt gaaccaatga atatagtttg atggcaaagg 900 aggaacaaga tgagaagcag atcatgggaa tgagagtgct aaagatactt gtttatttgt 960 atagaggaac aggagctaga agttgttatg 990 <110> College of Medicine Pochon CHA University Industry-Academic Cooperation Foundation <120> Polynucleotides derived from PRKCI, MAPK10, SPP1, IQGAP2, FGFR4, NOTCH4, HLA-DRA, HLA-DOA, THBS2, DFNA5, TBXAS1, TNKS, CDH17, UBR5, KIAA0196, and NSMCE2 genes comprising single nucleotide polymorphisms, microarrays and diagnostic kits comprising the same, and analytic methods using the same <130> PN0316 <160> 45 <170> KopatentIn 1.71 <210> 1 <211> 1001 <212> DNA <213> Homo sapiens <400> 1 tccaaattga tttctttgca tgtggatatt cagttatccc accatcattt gttgaaaaga 60 ctatttttct cccatttaat tgtttgcaca attgtcaaaa atcaaatgac caaaatttaa 120 gggtttattc ctggatttta aattctattc tattgattga tttgtctatc ctttgacaat 180 aatcacagtg tcttcattac tgtagttttt tagtaacttt taaaacaagg aagtgtgagt 240 cttacaactt tttttttttt caagagtgtc ttggctattc tggatccctt acattttcat 300 gtgaatttta ggatcaactt gtcaatttct ttaagaaagc cagctgagat tttgataacg 360 attgccttga atctgtgaat cagtttgtgt ggtttttgtc atgttaacag tattaaacct 420 tccaatccat gaacatggga taacttttca tttatttagg cagcttcaat tgctttctca 480 ttttcagagt atacattttg macttctttt actcttaaat attttattct ttataatgct 540 attgtaaaca aattgttttc ttagtttcct ttttgaatta ttcacttcta ctatagagaa 600 atgcagttga tttttgcata ttgttgatat tgtattgcag aacttgttta ttaggtgaat 660 tccttaggat ttcctgcata taagactgtc attttttttt tttttttttt tgagatggag 720 tctcgctctg ttgcccaggc tggagtgcag tggcacgatc tgggctgact gcaagctccg 780 cctcctgggc tcatgccagt ctcctgcctc agcctcctga gtagctggga ctacaggcac 840 tcgccaccat gcccagctaa ttttttgtat ttttagtaga gacagggttt cactgtgtta 900 gccaggatgg tctcaatctc ctgacctcgt gatccacccg cctcggcctc ccaaagtgct 960 gggattacag gcgtgagcca ctgtgaccag ccaagactgt c 1001 <210> 2 <211> 1001 <212> DNA <213> Homo sapiens <400> 2 tctcactgtg cattgtgttg cccagggacg gagtctcgct ggagtgcagt ggtgcagtct 60 cagaggctca cagcaacctc tgcctcctgg gtttaagcga ttctcgtgcc tcagcctccg 120 gagtagctgg gattacaggt gcccgtatat tttataatat attccataat taatcgctcc 180 gtattaaact tgctgatgca ggtctccaat tactgaaaaa cctgatacca agtttttcaa 240 tgggattttt ttgtttgttt aagagatggg gtctcaccat gttgctcagg ctggccttga 300 actcctgggc tcaagtgatc ctccagcctc agcctctcaa aggctgggac tacaggcatg 360 tagcactttg cctgactcaa ggggaatttt tagtaagagt atataatcca agtctgtagc 420 agttggagga tgtcgtacca cgctgaagtg tcacacataa ctgatgactc agtcccccaa 480 gcctatttta tcatcgtaaa raaagctatt tctttgatta ttccaactct ttcagcacca 540 ggactgctgc aggggctgga ttcatgctac ctgtagttgc agtgtaaatt atgtgtcaga 600 gccaagatcc taggaaagtc aaagactagg ttacaagagc gattcttttt tttttttttt 660 tttttttttt ttgagacaga gtcttgctct gtcgcccagg ctggagtgca gtggcacagt 720 cttggctcac tgcaacctcc gcctccctgg ttcacgccat tctcctgcct tagcctcccg 780 agtagctggg actacaggcg cccgccacca cgcccggcta attttttgtg tttttagtag 840 agacggggtt tcaccgtgtt agccaggatg gtctcgatct cctgaccttg tgatccgccc 900 gcctcggcct cccaaagtgc tgggattaca ggcttgagcc accgcgcccg gccaaggacg 960 attctttcta atctttggta agccagtttt tacagaatga a 1001 <210> 3 <211> 1001 <212> DNA <213> Homo sapiens <400> 3 agctagcctg aacaaaaaga acaaaactgg agggatcaca ttctatgcct ttaaattata 60 ctacaaactg tagtaaccaa agcaggatgg tgctggcata aaaacagaca cataaaccga 120 tggaacagaa gagagaaccc agaaataaat ccatacatct agagtgaact cattttttac 180 aaaggtgcca agaacataac actggggaaa agacagcctt caataaataa gactgtcaat 240 aaatagtgct gggaaaacaa gatatccaca tgcagaagaa tgaatgaaac tagaccccta 300 tttctcgcca tatacaaaaa tcaaatcaaa atgaattaaa gacttaaatc caaaacctca 360 aactatgcaa ctactaaaag aaaaaactgt ggaaactctc catgacactg gtccaggcaa 420 gtaattcttg agtagcaccc cgcaatcaca ggtaatcaaa gcaaaaatag gtaagtggga 480 tcacatcaac ttaggtttct rcaaaacaaa gcaatcaaaa aaatgaagag acaacccaca 540 gaatgggaga aaatatttgc acactatgca tctgacaaag gattaataac cagaatagac 600 aaggagttca aacaactcaa taggaaaaaa aaaacctaat aatctgatta aataatgggc 660 aaaagatatg aatagacatt tttccttttt tttttttgga aacggatctt gctctgtcac 720 ccaggctgga gtgcagtggc gtaatcttgg ctcactgcaa cctctgcctc ctgggttcaa 780 gtagttctac tgcctcggcc tcccaagtag ctgggattgc aggcacgtgc caccatgccc 840 agctaatttt tgtagtttta gtagagacag ggtttcacca tgttggcctg gctggtctgg 900 aactcctgac ctcaagtgat ccgcccgcct cggcctccca aagtggtggg attacaagcg 960 tgagacacca tccctagccc tgaatagaca ttttttttta t 1001 <210> 4 <211> 501 <212> DNA <213> Homo sapiens <400> 4 ggcattttta acaatcaggt tatctgcaat attaaccaga aaaaaaatgg tagcaaaata 60 gagaaaatgg gccattttta ttccagggga caagctgcac aaaggaatgt tcttctattt 120 attttaaaca aatgactgcg tgtactgaat ctgactgtgt gaaataatct cagaatggca 180 gcaccactgg catggcgatg gtgcaggtgg gtgcagttcc ctgtggtctc tattgcttga 240 agagagaaag raagttccct attattatat ttaaggcagt tttcagagca ctggcattct 300 tgtttgctct gttttgggga tattctattt gccaatcagt tcacactcat caaccatcca 360 cttcctgtct ttcacttctt caggtgaaat atcaccactc tagtgagcac caacatggtc 420 agagtgcaaa taactgtgat ggatacttcg aagtgaataa gaagggaagc ctgattattg 480 ctgagatcac cttggtattg c 501 <210> 5 <211> 501 <212> DNA <213> Homo sapiens <400> 5 tggtccattt catgtgatct attactctga cataaaccca tctgtaatat attgccagta 60 tataagctgt ttagtttgtt aattgattaa actgtatgtc ttataagaaa acatgtaaag 120 ggggaatata tggggggagt gagctctctc agacccttga agatgtagct tccaaatttg 180 aatggattaa atggcacctg tataccaatt tgtagaaaga acatatgtga tacttatgta 240 aagtgtgggg sgagtgggta acagttttca ggcacaaaat ggtttggcct tctgaaggca 300 ggtgtgaata aaagctgaga gtcttttctg tgacagtaaa gagtaggaga gactggaggg 360 tagtgtgtgg tgtggactaa gactgaaagg aggttattac ccagatacta ggagacactg 420 gtgtccctgt tttagagatt tggctctata cccatttgtc atttctcaat tactggtcac 480 aacaagaggc cactggagaa a 501 <210> 6 <211> 601 <212> DNA <213> Homo sapiens <400> 6 gatgacctaa aagctagaga gtggaaaagg attaccatat tcccatccct agccgttcat 60 ataattattc ttcatttgtg ccgtgattca gtaccctgat gctacagacg aggacatcac 120 ctcacacatg gaaagcgagg agttgaatgg tgcatacaag gccatccccg ttgcccagga 180 cctgaacgcg ccttctgatt gggacagccg tgggaaggac agttatgaaa cgagtcagct 240 ggatgaccag agtgctgaaa cccacagcca caagcagtcc agattatata agcggaaagc 300 yaatgatgag agcaatgagc attccgatgt gattgatagt caggaacttt ccaaagtcag 360 ccgtgaattc cacagccatg aatttcacag ccatgaagat atgctggttg tagaccccaa 420 aagtaaggaa gaagataaac acctgaaatt tcgtatttct catgaattag atagtgcatc 480 ttctgaggtc aattaaaagg agaaaaaata caatttctca ctttgcattt agtcaaaaga 540 aaaaatgctt tatagcaaaa tgaaagagaa catgaaatgc ttctttctca gtttattggt 600 t 601 <210> 7 <211> 601 <212> DNA <213> Homo sapiens <400> 7 tgatagtcag gaactttcca aagtcagccg tgaattccac agccatgaat ttcacagcca 60 tgaagatatg ctggttgtag accccaaaag taaggaagaa gataaacacc tgaaatttcn 120 tatttctcat gaattagata gtgcatcttc tgaggtcaat taaaaggaga aaaaatacaa 180 tttctcactt tgcatttagt caaaagaaaa aatgctttat agcaaaatga aagagaacat 240 gaaatgcttc tttctcagtt tattggttga atgtgtatct atttgagtct ggaaataact 300 ratgtgtttg ataattagtt tagtttgtgg cttcatggaa actccctgta aactaaaagc 360 ttcagggtta tgtctatgtt cattctatag aagaaatgca aactatcact gtattttaat 420 atttgttatt ctctcatgaa tagaaattta tgtagangca aacaaaatac ttttacccac 480 ttaaaaagag aatataacat tttatgtcac tataatcttt tgttttttaa gttagtgtat 540 attttgttgt gattatcttt ttgtggtgtg aataaatctt ttatcttgaa tgtaataaga 600 a 601 <210> 8 <211> 601 <212> DNA <213> Homo sapiens <400> 8 caggccaatc caaaccaaat ctatcatgaa gaaggccaca aatcctagct ttatgttatt 60 aaagcagcaa aatgacaaac aaagtaaagt caaagtctta ctgtggttag cagttcttct 120 cataaagccc aaaacaccag attttcttct aatgatttca gatgtcaagt ggtaaaaatt 180 gattttacaa tgttcttgag taaataagta ctggggaaaa aaatgaggag gaaacatgtt 240 tgtttgggta aattgctaag caacaccaca catgccagaa actccttttg gatttcctct 300 mgaggtggag accgcgaaac caacgggtac ttcactttgt gtctgtttca tcagttcaca 360 ctggaaaata aaagccaaac aggcaattct aataatacaa gagtcagagg tgtggaggcc 420 actaatatta aattagactg aactgtttaa ggttgccttt taaaatggct ccttccagcg 480 tccacggtgg cactaagact gaggtgagat ttgtgcagcg tgtccaggaa gatctagcac 540 actgatgaaa aaaaaaacaa acagcagcca ggccatgtta tctatcccac tcggtatttc 600 c 601 <210> 9 <211> 801 <212> DNA <213> Homo sapiens <400> 9 aaatcttaca aacaacagac taatggctgg gaagtcacag agagcagtgg tgtcaatagt 60 accttttaca attccaatag aacatcatcg cataatcatg ggcaagcaac tacaatctgt 120 aaattataat cttttactaa atgaccttca gtattagccc caaaagtgac agaattaaaa 180 tgtggaatgt aatgaaacag gaaggtagaa tcaaactagc ttcaggatac tttgtgtgga 240 ctgagatcac caaaaaaagc ccagtgggaa actgccaaat gctgcaccat aattaagagc 300 aacactgaca ctgagtagat ttttcattgt ataaggcagt tttccaagta tacaaattga 360 gagttgtgtt gggattctgg caaggaaagt ctgtatctta yggaaatatc aatttccttc 420 atacgatcca ttacttactg ttgactttat taatattgcc ttggatttct gcttgtgtca 480 gcagttcttc ataagcatct ctttcttctt ctgaaataca gctattctgc aaaacaaagt 540 cttcactgat ttctgaggtt aaataaaata tttccaaaac tcgaaattca aatggaaaaa 600 tgtgaagagt gacaaaaacc attccctaat tatttaaatt cattgcaagc atttgtggag 660 tgcctgctct gtgcagggga ctaggggaga atgctggcaa agactacaaa gatgaatgac 720 agacccggtg actctttgaa aagggctaat tttcaaatgt ggaaaacatt agtttggacc 780 acttggtatt gccctttttt t 801 <210> 10 <211> 1293 <212> DNA <213> Homo sapiens <400> 10 ggtattgaga agtaacctcc agatagatcc gtttcattag aattgtaaaa gcaggtttca 60 tagtttgact tctgtgcttt tcatttgtga accctatagt cagtcttgtt ctctctttct 120 ctctctctct ctctctctct ctctatatat atatatatat atatatatat atatatatat 180 attttttttt tttttttttt tttttttttt ggtgggggga acagagtctt actctgtcac 240 ccaggctaga gtgcagtggt gcaatcttgg ctcactgcaa cctctgcctc ctggattcaa 300 gtgattctcc tacctcagcc tcccaagtag ctgggacttg aatagacact tcatttcttt 360 aaagatgatg tacaaaggga aaataaacac atgaaaagat gctcaaaatc attagtcact 420 agaaaaatac aaatccaaac cacagtatga taccacttca cacccactag gatggctgta 480 atgaaaaaga caataacaag tgttggtcag aatgtagaga gctactcagg aggctgaggc 540 acaagaatca cttgaacaca ggaggcggag gtggcagtga gccaagatca tgccacagca 600 ctccagcctg ggggacagag caagaatctc tcaaaaaaaa aaaataaaaa taaaaaccag 660 cagctgggac cttgatagaa gttgcattga atctgtaggt caatttaaga agtatttcca 720 tcttaacaat actaaatctt ccaattcatg aacattggct cttttcattt atttgtcttt 780 aagttttttc ccatggtgtt gtatagtttt cagtgtacat gtcttatgtt tctttcatta 840 aattatgtct aaatatttat ttataattat aatgtatttg tagttgcagt cttatgaatt 900 gaatgtgttc ttaattttat tttcagattg cttattatga gcgtatagac acataattgg 960 ttttgtgctt gcatcttgca tcctatagcc ttgctgtact agttaattag ccttttgcaa 1020 tcggtgatac taaagaagtt ataatgtact gtgacttaaa aggaaaacta cttcttcatt 1080 ttctaggagt tgyagctgta gggtacatca atgaagctat tgatgaaggg aatcctttga 1140 ggactttaga aactttgctc ctacctactg cgaatattag tgatgtggac ccagcccatg 1200 cccagcacta ccaggatgtt ttataccatg ctaaatcaca gaaactcgga gtaagtttta 1260 gtatatctgt ttcttttaaa ttttacaggc tgg 1293 <210> 11 <211> 1060 <212> DNA <213> Homo sapiens <400> 11 gttctgagtg ttctattttg gggaacagag gcgtccttgg tagcatttgg aagaggatag 60 ccagctgggg tgtgtgtaca tcacagcctg acagtaacag catccgaacc agaggtgact 120 ggctaagggc agacccaggg caacaggtta accgttctag ggccgggcac agggaggaga 180 acattccaac actctgtgtg cccagtgccg acgcacgttc tctcttttat cctcaaaaca 240 gtcctatgag gatataagcc agagagagac agagacaagg aattacaagt tggtgagagt 300 caggatttga acttggctct ggcagatgga aaattagggt ctgtattctt tacaaaaccg 360 tgtgtgcctc agatggagtt ggtgcataac aagcagaggt atccagggtc gcggtcctgc 420 ttgccacgga aggggccgcc ttgtcagttg tgaccaccca gccctggaaa tgtcagtaat 480 gctgtaagga gtggggatcg gatcagatgc catccagatg ctgaagtttg accttgtgtc 540 atttttcact ttcttttttg gctcttctgc aatcaattca tttatttagc aaaaaagaaa 600 ttatgtgtgc cgagagcatg cagaagatat gtctccgttc tctgcttccc tccaaaaaag 660 aatcccaaaa ctgctttctg tgaacgtgtg ccagggtccc agcaggactc agggagagca 720 ggaagcccag cccagacccc ttgcacaacc taccgtgggg aggccttagg ctctggctac 780 tacagagctg gttccagtct gcactgccac agcctggcca gggacttgga cacatctgct 840 ggccacttcc tgtctcagtt tccttatctg caaaataagg gaaaagcccc cacaaaggtg 900 cacgtgtagc aggagctctt ttccctccct attttaggaa ggcagttggt gggaagtcca 960 gcttgggtcc ctgagagctg tgagaaggag atgcggctgc tgctggccct gttggggrtc 1020 ctgctgagtg tgcctgggcc tccagtcttg tccctggagg 1060 <210> 12 <211> 401 <212> DNA <213> Homo sapiens <400> 12 agtaatgggg ggcggaggaa gagtggagga acactagggg ggatgaagga ggggccttct 60 gtccctgaca acccctgggg aagtaggggg aagtaggacg gtgtgcctgg agggcaggtg 120 ataggagggg agaaggaatc tcggaacccc ctgggcagtc ccagccctgc tgtttgttga 180 tctggtctct cctttctagg ratgagaatt gcaaggtggc tgccgtgtgc cccaggaggg 240 gcaggacctg gaaacaggta ttgggtggtt acagagttct gtattcctcc tcccaggaga 300 ggatgcttaa tttgccaggt tattacagat gcttctcaga gaacctgcaa cttgtcataa 360 tttgaaacca ctcaccttgg ctaaaggaac ccaggggctt c 401 <210> 13 <211> 2001 <212> DNA <213> Homo sapiens <400> 13 tcttgttgtg ctcaagtctt gtctctgtca tccatggtct cctacgaagt cattgcccta 60 agttcatgct gggggagcca gaagggaagt ccttggatat cttatacctc aatattggct 120 caatttcttg gggagggggt gctgtcagag attgttatct gaggatgtga catagatttc 180 tcagggcaca atttcaacta ctttttcagc tttagggttt ttagatacgt ttgtaccaca 240 attgagcatg ggagggagag gggtgagcct aagcagtgat ggctgatttc tgtcatgtct 300 gtcatgtgtc ccccagtacc tccagaggta actgtgctca caaacagccc tgtggaactg 360 agagagccca acgtcctcat ctgtttcata gacaagttca ccccaccagt ggtcaatgtc 420 acgtggcttc gaaatggaaa acctgtcacc acaggagtgt cagagacagt cttcctgccc 480 agggaagacc accttttccg caagttccac tatctcccct tcctgccctc aactgaggac 540 gtttacgact gcagggtgga gcactggggc ttggatgagc ctcttctcaa gcactggggt 600 atggaccaac actcaatctc ctttatttca aggtttcctc ctatgatgct tgtgtgaaac 660 tcggtgttct aactgtttca taatatctgc tacaattaat ataactgtct tctcctacta 720 tccagcttcc tccttttttt aatctgtaat tctctcaata catcattctg tcttcctctt 780 ctttaatcta tgaataactt ttctctttat taagaaccct acatttgatt ctgagtgtta 840 cttcttccca cactcattac catgtactct gccttatttc cccccagagt ttgatgctcc 900 aagccctctc ccagagacta cagagaacgt ggtgtgtgcc ctgggcctga ctgtgggtct 960 ggtgggcatc attattggga ccatcttcat catcaaggga ktgcgcaaaa gcaatgcagc 1020 agaacgcagg gggcctctgt aaggcacatg gaggtgagtt aggtgtggtc agaggaagac 1080 gtatatggag atatctgagg gaggaaaaca gggtggggaa aggaaatgta atgcatttaa 1140 gagacaaggt aggaacagat gtggctcttg atttctcttt gctagaacga atcagacatt 1200 cgtatcatct ggtatcccaa agcttcaggg tctgtcatcc ctttctatag acgggcacct 1260 tgatcacggc tccagtctta gaaatcatct ccagtaccta aaaccattgt ttcacattag 1320 aatactgagt ctagggatct agaaaataca ttagaatatg gagtctaggg atctagaaaa 1380 tactgagtct agggatctag aaaaataagc ctcaagattt gggcacatcc tagcttgtat 1440 ttcctggggc aggtcatcag ttcagaagca tttccagatc ctggctcctt tcaggttagg 1500 gtcaattcat tgcatgaaat gggaatctct tagaggccaa tgcctgcttt tgcttcttta 1560 gtctcaaatg tagtatgaga aactctaaaa aaaggtaaag catggttgct tattatgttc 1620 agttggagag tagggtatac agttagttca tgttggaaag gttagatgaa cattgaaaga 1680 attttgcaaa gtcaaaggat taagagagaa gaggaaggaa tctgaagcaa ggagctcaaa 1740 actgatcttc aactccttgg taactatgtg tgtcttgcta taggtgatgg tgtttcttag 1800 agagaagatc actgaagaaa cttctgcttt aatggcttta caaagctggc aatattacaa 1860 tccttgacct cagtgaaagc agtcatcttc agcattttcc agccctatag ccaccccaag 1920 tgtggatatg cctcttcgat tgctccgtac tctaacatct agctggcttc cctgtctatt 1980 gccttttcct gtatctattt t 2001 <210> 14 <211> 747 <212> DNA <213> Homo sapiens <400> 14 atgaagcttc tctgaattgg tgccacacga gttgtgaatg ggtgatgaat tggaggcatt 60 aatgttaagg agaattctgg ggaagatagt gtggctattg cttaatgaaa gaaggaagag 120 tagagggaag taaagatgaa gacacatgca gcgccagatc atatagagtc ttgttgacca 180 ggagggaatt tagactaagc aagaaagaaa gtctgaactg ggggagtgct tcgactgaat 240 ccaggaggga gaagaaaatc tcttcagtaa ttccatcccc agtcattctt caagctcacc 300 tcattccctg cagcctgttg ctgctgaagc ctcagaagaa gtttrttccc aaagttggga 360 atacctcctc ctacacattt cctagtaatg ctgatgtctg accctgttca tcctgcctca 420 tgtctcacaa gctggtctcc attcgctgct gagtaggaca gtgcagtgga aagaacatgg 480 gctctagcat cacacagacc ttccaaccac ctcctagctg tttgaccttg ggcaagttag 540 gtaatttgaa tcctagcttc cttctttata aaatgaaatc attatactta acttgcagat 600 tggtcataag gaagaaatga aataattcat gtatggcaga atgatgcttt tctttttcct 660 ttgtttgtat ttaggtttgt cacacaagac aatagcacat gtcaccgcca aggcctttcc 720 tcttctggct ttggatttcc caaaatc 747 <210> 15 <211> 2001 <212> DNA <213> Homo sapiens <400> 15 gccaccttat ccaagccttg ggcctctctt tatcatgcat gggactctag ggaaaagtag 60 aaaaaggaga taacatcatg gcaggatacc agtggcctct tcatacagag taaacccagg 120 caggagtgga gtccccatga gcctgtcctg tcccagcacc cttcagaact gttgttagct 180 gagggggtcg tgaggaacag gataatctgc atccctttca gggtaaacct gaactcagga 240 ggcaaatttc atgaagtcca tgtgaaatgg ctcattcaca aagtaacaca caatgggcca 300 aatggagcaa gacacacctg tgtgggcccc aggatggctg ccaatcccca gcaccacatg 360 cctcacccct ggaagaactg gccaaggcct ggaaaggaca cagtgcaaac accaccaaag 420 catttagtgc tgccagccag agctttgggt agagcaagaa tgtgtgtgtg tgttgaatgg 480 gaagggaagc tagtaggtgt ccaacaaacc ctgccatgaa tactggggcc aaaaaagagg 540 ggacccgtag gacagatgtt gatcccactc aaagtcagca cagcgggatg cacttaaagg 600 gcactgagca cgcaggggct gtcacaaacc catgaggatc tgcagggtgt ctcccacaag 660 tcatttctct cagaaggatc attacctaaa atagcagaaa acatacgatc gaggttgctc 720 aatttcaata tgctgggatc ctatctctga gtgcccacct cccccaaaac ctcactctct 780 caccccacct ctgcttcttt tctccctgcc catttctttt ctgacttctt tccccacaac 840 agaatctctg attctccacc cacgtcctgt tcagagtcat ccactttcct cccccacccc 900 ccagactccc ggggcctctg cacctgggga cactggacac atatgtgccc atgatgatga 960 ggacggtgcc cacgaggaag cccaccaggc cgatggccag rcccagggca cagaccaggg 1020 tctccatggc atctggtggt ggaataggca cctggagctc taggagagaa aggaaggagt 1080 tggtggtata tgaaaggatt ctagagtaaa ggaaacctgg ggccaggagg gtgcatgggg 1140 agggggctcc gtaccccaat gcctgaggag tggcgcatcc aggccccagt gctccacctg 1200 gcagtcatag acgtcctcgg ctgagggcac gaagggcagg tagtggaact tgcggaacaa 1260 atggtcaggc tgggaataga agctggtctg ggccactccc tcagtgacag tttggccgtt 1320 gcgcagccag gtgatattga tcacaggggg gaagatgttg tccacgatgc agatgaggat 1380 gttgggctgg cccagctcca cccgagactt ggggagcacg gtcacccgtg gaggcactag 1440 gaggaacagg ccctgagtcc acaggctcat ccctcacccc agggccttac taggactggg 1500 attaagggac gttccccctt tgtagccatc tgtgggcagg ggatgctctg gggtatccac 1560 tggggcagga gaggaggaaa acagagggag aggagactgg ggagggagtg gggacgccag 1620 gagctcctat atttgactgg tccctgggcg ggagtccggg tgagaggtgt cattcctcaa 1680 ggagaggggt gccaaagggg tctgggaaga cctggagcct cctgggaaag aaaggaacag 1740 ggcatgacag gcgcgggcgc tgagagcgcg ccccagagtg atgggagcct aggaactggg 1800 aggaagtttc tctggacctt cccgcctgac tgggtgggca gagggagggc cggtaccgtt 1860 gatggctctg ctgcggttgg agcgctccac caggatgtcc agatgggctt tgattgcggc 1920 gatgccggcc agcccgccct gcgggtcaaa gcgggcaaag tcaccaaact caggcagacg 1980 ccacacggcc tcgcttttct t 2001 <210> 16 <211> 501 <212> DNA <213> Homo sapiens <400> 16 gctcatctcc acccttagaa tcaccctggt gcaggctgca ccagacctgc ctcccagtgc 60 tggagagggg acaccagggg tagaagtccc catagacagc ccggtcagtg cagccatctc 120 ttcctcctgc gccatcacat cttttgaggg aagcatcttc acctgctcat tacctgcagg 180 ctgagatccc aggctctata cacagttgat ggactgggtt aggggcaagc gtgagtaggg 240 cccaagctcc rggcacatgt gactctgtgt cagttcccat tcctctggga agcatttttc 300 caactcccca tagctggacc ctgccttgcg tcttcaatgc aactctccct gggtgggcgc 360 tttcctgccc ccactctcca tcgccctctc caaggggggt tctatgcaca gctccctccc 420 ccaaacccct ctgcccaccc tctagctccc gctagccatt ttattattca tctaatagct 480 actccaaaac acctcaaatt a 501 <210> 17 <211> 801 <212> DNA <213> Homo sapiens <400> 17 tgtaacagaa tacatagaga ataatgagga gtttatgatg gaaccttaaa tatataatgt 60 tgccagcgat tttagttcaa tatttgttac tgttatctat ctgctgtata tggaattctt 120 ttaattcaaa cgctgaaaag aatcagcatt tagtcttgcc aggcacaccc aataatcagt 180 catgtgtaat atgcacaagt ttgtttttgt ttttgttttt tttgttggtt ggtttgtttt 240 tttgctttaa gttgcatgat ctttctgcag gaaatagtca ctcatcccac tccacataag 300 gggtttagta agagaagtct gtctgtctga tgatggatag ggggcaaatc tttttcccct 360 ttctgttaat agtcatcaca tttctatgcc aaacaggaac ratccataac tttagtctta 420 atgtacacat tgcattttga taaaattaat tttgttgttt cctttgaggt tgatcgttgt 480 gttgttgttt tgctgcactt tttacttttt tgcgtgtgga gctgtattcc cgagaccaac 540 gaagcgttgg gatacttcat taaatgtagc gactgtcaac agcgtgcagg ttttctgttt 600 ctgtgttgtg gggtcaaccg tacaatggtg tgggagtgac gatgatgtga gtatttagaa 660 tgtaccatat tttttgtaaa ttatttatgt ttttctaaac aaatttatcg tataggttga 720 tgaaacgtca tgtgttttgc caaagactgt aaatatttat ttatgtgttc acatggtcaa 780 aatttcacca ctgaaaccct g 801 <210> 18 <211> 601 <212> DNA <213> Homo sapiens <400> 18 cctgggcgga gtatccacag cttgtgtccg gagccaacct tccatcactg gtgaatgctt 60 cacagagttc atgttgtctt tctttcctct tcttcttttt aaacaccatt tttagagtct 120 tagtgcatga aggaaaacag gtcatggcag actcaggacc tatctatgac caaacctacg 180 ctggcgggcg gctgggtcta tttgtcttct ctcaagaaat ggtctatttc tcagacctca 240 agtacgaatg cagaggtgag tgtgagcact tcaggttagt ggggaaggca tgtggcaccc 300 rgaaagggaa aggggtacac agtgcgctct tccacttaca aaagctgtac atcagggcaa 360 taaacctaat gcacaatacc acacatgagc cccctgcaca ttccaagcag ccatttcatt 420 tggtcagctt ggattgtttt gtttttaatt gcatttggat gcctctggta gaacactttc 480 caattattta caggtcccac ccatcctcat tgtgccatag cctagagttt cctagatttc 540 ctccacttcc aggctcctga cgccatttta gattggtacc cttgcattag caatggcagg 600 c 601 <210> 19 <211> 601 <212> DNA <213> Homo sapiens <400> 19 agcctcctga gtagctggga ttacaggtac ccaccaccat gcccggctaa tttttgcatt 60 tttagtagag acaaggtttc accatattgg tcagaccagt ctcgaactct tgacctcagg 120 tgatctgccc acctcggcct ctcaaagtgc tgggattaca ggcatgagtc actactcccg 180 gccacctgaa aggtgtttct aacgaaacct tatggcgttg catccaggtt ttgacgagtt 240 tgggtctgtg gacttcagtg gcacattcta cgtaaacact gaccgggacg acgactatgc 300 yggcttcgtc tttggttacc agtcaagcag ccgcttctat gtggtgatgt ggaagcaggt 360 gacgcagacc tactgggagg accagcccac gcgggcctat ggctactccg gcgtgtccct 420 caaggtggtg aactccacca cggggacggg cgagcacctg aggaacgcgc tgtggcacac 480 ggggaacacg ccggggcagg tgaggttggc tggccaggcc cacacatctc ctctgcacca 540 ctgtggctgc agagtgtttc agcaggacta gcccagtcca gagaccccac ttttgtcttt 600 a 601 <210> 20 <211> 501 <212> DNA <213> Homo sapiens <400> 20 gataatatat cctatgagct gcatcaagat aactgtcctc ccttgtctct ttcaaacgca 60 aacctggaaa tgaagccaat tcctatgttg gcatcaggaa aatggggaga gaattctgtg 120 aagacacaac ctcgctccag ctgatgaggt gaaggcgtct attcatttgg cctgtggcca 180 ctgagataca agaatagtgc tgagatttga gttgaataaa tgttttcaat tgaaatcatt 240 gaaaactcca ygttatcctg tctttgtgga gctcatggaa aactcccaag gccccatgga 300 ggggactgat gaggacctga gacccgcctg cccatggctg ccctcatctg gtcagggttg 360 tgcaccaggg gcagttgtca cagtgatccc ccacaccgcc accgtccccg cgtccctctg 420 atgaggacct ccaaccccgc ctgtctgcgg ctgccctcac ctggtcaggg ttgtgcacca 480 gggggcagtt gtcacagtga t 501 <210> 21 <211> 601 <212> DNA <213> Homo sapiens <400> 21 actgaggtaa aatccgcttt tgcaaaggac tgtctttata gtgacctttg ggaactgcct 60 atgcacaaac gcttgcagga caatggcagt tacgttgaca agggctgttt ggtataaagt 120 ggcctcctgt ggacctcgct gctcctggaa cccagggcag gtgcgcggca caagctgctc 180 ctaccttctc atcggtcaca ccgtcattgt catcgtcatc atcacaggca tcgccaatcc 240 cgtccttgtc aaagtcttcc tgcccagaat ttggcagatg ggggcagtta tcctgcaatt 300 rgagaaggaa gaatacttgg aaaaacattt aggaaagcgc ctctttcagg aatgcaggtg 360 aaatgaaacc agcatggcgc cgaggaagga agcgtgcccc atcagtcctc gatctcaagc 420 gggagctcag agaggagcca ggaggccaga agcgcatgaa tttcttctca actcctgccc 480 tggaaattat gtcaccacta ctcacgaggg agacaaaaga aacttgtgtt cggagcccag 540 aatcccagtg aggttgccaa acaactgcag aaacaagaag agaatgacaa ggtattgctc 600 t 601 <210> 22 <211> 601 <212> DNA <213> Homo sapiens <400> 22 aaacaactgc agaaacaaga agagaatgac aaggtattgc tctgagagaa catgggagag 60 aaaagaggct ttagtggtgc tttccttccc acagacttca gctcataaat caaaatgcca 120 aagcttataa acgtttaacc ctcaccagtc aaaaaatctc aaagaaaaga gtctttcaaa 180 tatgtatgtg atatgttcag aaaccagggc tgagaacttg aaattatctg ctaactcagc 240 ttttcacggt gagtgggtaa tgcatgtaat tgggcaaaaa actcctttca tttcagtcca 300 rttataccat aatttaaagc tgagctgtgc agacactaac aacttgaggg aattggaaat 360 ccaggaaaag gtagagagta gcgtgtccgt ttcagtttca gtacaagagc cccaagggtg 420 tgtcatcccc gtggctcaaa gcacatcact cgttgaatca gaagacagaa cctggggtct 480 tgtgctggag cccagagttt cagcacttac agattaagaa aggagagcat gttgcttgtc 540 atggaggaca cggatcccag acgggggccc ctggcaatgg ggaggatcca gcacctgtgt 600 g 601 <210> 23 <211> 833 <212> DNA <213> Homo sapiens <400> 23 cctgctgtgc aagaaggaaa ttcacactac accaatgtcc cctgaggcac taagccactc 60 acatccgttt ttctttttca gaacaataaa tctgagaaac cctgtgctcc agcaggtgct 120 ggagggaagg aatgaggtcc tgtgcgtttt gacacagaag atcacracga tgcagaagtg 180 tgtgatctct gagcacatgc aggtcgagga gaagtgtggt ggcatcgtgg gcatccagac 240 caagacggtg caggtgcgtg cagtcgggct gccatgcgtt catggctgag cagtcctgga 300 gaccttggtt gggggcctct ttgggtggtg tcttcattag tcaggactct ttctgtagca 360 agttaacaga aacccaactc acgttgtctt aaatgagcaa gagtgtattg gctcatgtag 420 ctgaaacact tggggaaact ggctccagtc actgctggat ccaggccttg aggatatagg 480 gagctgcttt ccctgggttg atgccctctc agatgggctc tccctatggt gctggcagct 540 ccaggcttgc tcctgatttg ctgcaactcc aggggtgagg ggggcctagg actgggtctc 600 accagaccgt cccgggatat gttcattcct gaagcagggc agggttgcag gcactcgaat 660 gggccaggcg ggctggcgtc aggtgctctt tctgtgggcg cagatgtttc cccaagaaaa 720 tgaacacaca gcactgatca ggccaaatgc agaaaatccc tagagccttt caagctgggc 780 tttacaatga ttacttggaa tagctgctgc aaatcctttg tgggttgagt tag 833 <210> 24 <211> 833 <212> DNA <213> Homo sapiens <400> 24 cctgctgtgc aagaaggaaa ttcacactac accaatgtcc cctgaggcac taagccactc 60 acatccgttt ttctttttca gaacaataaa tctgagaaac mctgtgctcc agcaggtgct 120 ggagggaagg aatgaggtcc tgtgcgtttt gacacagaag atcacgacga tgcagaagtg 180 tgtgatctct gagcacatgc aggtcgagga gaagtgtggt ggcatcgtgg gcatccagac 240 caagacggtg caggtgcgtg cagtcgggct gccatgcgtt catggctgag cagtcctgga 300 gaccttggtt gggggcctct ttgggtggtg tcttcattag tcaggactct ttctgtagca 360 agttaacaga aacccaactc acgttgtctt aaatgagcaa gagtgtattg gctcatgtag 420 ctgaaacact tggggaaact ggctccagtc actgctggat ccaggccttg aggatatagg 480 gagctgcttt ccctgggttg atgccctctc agatgggctc tccctatggt gctggcagct 540 ccaggcttgc tcctgatttg ctgcaactcc aggggtgagg ggggcctagg actgggtctc 600 accagaccgt cccgggatat gttcattcct gaagcagggc agggttgcag gcactcgaat 660 gggccaggcg ggctggcgtc aggtgctctt tctgtgggcg cagatgtttc cccaagaaaa 720 tgaacacaca gcactgatca ggccaaatgc agaaaatccc tagagccttt caagctgggc 780 tttacaatga ttacttggaa tagctgctgc aaatcctttg tgggttgagt tag 833 <210> 25 <211> 601 <212> DNA <213> Homo sapiens <400> 25 aaccctcccg ccaagactac tcatcagaga ggaagctaat tcgcagggat atgaagagct 60 ttctaaaatt agttccagac gttttttgtc ttaaatctgt attatggctg ccctattttt 120 aatttacttt ttgaaaaacc acattgggac actaagtaat ctgggggccc aaaagacttt 180 gggggccacg ggaacctccc aggccctagg gggcagtagc accaccatgc agatcaccag 240 ttccacaaaa gaacggaact ccaccagtac ctagtgttat tagtataagt tttgggaaag 300 rtccttcttt tcttagctca ttagaaaaac aatcacagca tcctatggac tgggccaaag 360 tgggcagaca ttgcagtcca gcccaaagag gcctgtccca gcaggacgcc aggggacagg 420 gtcagaaaga ggcagcccac aagctgagca cagctgcaga ggttttagct tgcttactac 480 atttgaatta attgccaaac ttttaaactc aggggaattc cgcaaaaatc taggtaccag 540 cttttcttga agacaacgga agctcaggcc accctgaggc catcacctct ggcagtcacc 600 t 601 <210> 26 <211> 560 <212> DNA <213> Homo sapiens <400> 26 caaagtcccc agtccaccac tggctctgcc ttttaagctg gacataggaa agctttctgg 60 atgtcagtga ggccagacca cagcacacat gagcaggaac agcatgaagc tgctttcccc 120 agagcccctg ggccagaggg cattgctcag ttactggcca gaactggaga gtgagatggg 180 gaggatcagc agggacggct ggagtcactg aaagataggt ttggggcttt gctctctgct 240 ttcttcttgt gtgaccttgg acaacggact tggccttctt gaacctccaa gtcttcattt 300 gtacagtgag atgctactat ctctttgaca aatattttaa ttttctctat tttaggtacy 360 gctgcagcta gaatccaaat ctgccctagg tccaaaaaat ggtgtctata tcaagatcgt 420 atcccgctga cacagaaggc tgccgggtgg ggggagggca cccccaaatt caaagaaaac 480 cctaagtgtg gatgttcaga attttggaaa aatgtcactg aagtgattga aagagtgcct 540 ggcatgcaag gataagaggt 560 <210> 27 <211> 601 <212> DNA <213> Homo sapiens <400> 27 tcagccggta tttctgatgg tctctgctgg cagatgtctt tgccccaatc tttgccattg 60 gttggagtca cctcacggtc agtatgtggt agtcacgacc cctcaagcag tggcgggaat 120 ccacattcat catggcttgg tgggttgtcc ccatgttctg attggggagg gtgagtcctc 180 tgggtgcctt gcagtctctt ggtttgcagt tgcatcagct ctattctggt tagctatatc 240 catctgcata agccattaca tcattatata tgcgttgttg gctagttata cccttccata 300 kccatgcatt tcaggacttt ctgagccttt tctaaacatg gttatgactt tgggggaggg 360 caggtgttta tctcttttga gtgatatagt catatgcaaa tccgtaatgg ggtatacatg 420 tatgatttta attacttttg gaaaaaaagc ttacaaggaa aggaacattt ttggtgggaa 480 aacttgttta tcttcttttt ttaaagcaca gcaaatagat ttttatttat tcctgttaaa 540 atgtatgatg taatatcctt taatcatatg aagctagtta gtgacaaagc taggattcat 600 a 601 <210> 28 <211> 501 <212> DNA <213> Homo sapiens <400> 28 tgagctactg ggcctggccg gcaatttagt ttttaaaaat ctctgtccag cagagacttc 60 cttattttat tgttattatt attttttgta ctgatagaca caacagatgc attttgaata 120 agtctgaaca tcactacttt ctatgagtaa ctgaccactg ggagaccaga tgagatattt 180 cttatgcctc ccattccagg aggccattat ttaggggaca catggtgtgt tatggatatc 240 taacatatca ygttggatca ctcccacttc tcaatggttc tctgctgtac atcaagaagt 300 taagattaaa aaacagaagc atgaagaaga atgttgtgga ctgattttag ttgcactgat 360 gtgagttttc ctatccagct acactttttg tccggtgtct actgcgaaca atggcacatt 420 tcatcacaca gctggataat gttaagaatt ataatctgta accccagatt agatgaaaat 480 aaatgccact cgtaatacta a 501 <210> 29 <211> 501 <212> DNA <213> Homo sapiens <400> 29 gaccactggg agaccagatg agatatttct tatgcctccc attccaggag gccattattt 60 aggggacaca tggtgtgtta tggatatcta acatatcatg ttggatcact cccacttctc 120 aatggttctc tgctgtacat caagaagtta agattaaaaa acagaagcat gaagaagaat 180 gttgtggact gattttagtt gcactgatgt gagttttcct atccagctac actttttgtc 240 cggtgtctac ygcgaacaat ggcacatttc atcacacagc tggataatgt taagaattat 300 aatctgtaac cccagattag atgaaaataa atgccactcg taatactaat tatgttcctg 360 accttattgt atttacttac taaccccaaa taaactcatc tatttactta aatgggaata 420 ttaacatcac ataaatcata tttaacaatg aacacatgaa acaatttatc acctagttta 480 atattctgct tgactcatct a 501 <210> 30 <211> 501 <212> DNA <213> Homo sapiens <400> 30 aacatatcat gttggatcac tcccacttct caatggttct ctgctgtaca tcaagaagtt 60 aagattaaaa aacagaagca tgaagaagaa tgttgtggac tgattttagt tgcactgatg 120 tgagttttcc tatccagcta cactttttgt ccggtgtcta ctgcgaacaa tggcacattt 180 catcacacag ctggataatg ttaagaatta taatctgtaa ccccagatta gatgaaaata 240 aatgccactc rtaatactaa ttatgttcct gaccttattg tatttactta ctaaccccaa 300 ataaactcat ctatttactt aaatgggaat attaacatca cataaatcat atttaacaat 360 gaacacatga aacaatttat cacctagttt aatattctgc ttgactcatc taaatcttgt 420 ttctcttaca gaatttccat ctgtgtccac tctattggaa taatgctgtt tctgtatact 480 tctacaatgc ctgctttact a 501 <210> 31 <211> 601 <212> DNA <213> Homo sapiens <400> 31 cctatccagc tacacttttt gtccggtgtc tactgcgaac aatggcacat ttcatcacac 60 agctggataa tgttaagaat tataatctgt aaccccagat tagatgaaaa taaatgccac 120 tcgtaatact aattatgttc ctgaccttat tgtatttact tactaacccc aaataaactc 180 atctatttac ttaaatggga atattaacat cacataaatc atatttaaca atgaacacat 240 gaaacaattt atcacctagt ttaatattct gcttgactca tctaaatctt gtttctctta 300 yagaatttcc atctatgtcc actctattgg aataatgctg tttctgtata cttctacaat 360 gcctgcttta ctattgaaca gtaggtatga aatggagtcc tcaatgggat tatctcattc 420 atggatacat cttttgatag tagaaaaaca aaagtataca taggttgagg gaaagacaga 480 aaaagcagtt gctgaaaaat agcacctgat ggtgagtaat ggacagcaat caggtttttg 540 ttatgttttc ttgttgtttt tgtgacaggg tctccgccct ctgtcaccca ggctggagtg 600 c 601 <210> 32 <211> 701 <212> DNA <213> Homo sapiens <400> 32 tctgcttgac tcatctaaat cttgtttctc ttacagaatt tccatctgtg tccactctat 60 tggaataatg ctgtttctgt atacttctac aatgcctgct ttactattga acagtaggta 120 tgaaatggag tcctcaatgg gattatctca ttcatggata catcttttga tagtagaaaa 180 acaaaagtat acataggttg rgggaaagac agaaaaagca gttgctgaaa aatagcacct 240 gatggtgagt aatggacagc aatcaggttt ttgttatgtt ttcttgttgt ttttgtgaca 300 gggtctccgc cctctgtcac ccaggctgga gtgcagtggt gtgacctcgg ctcacagcaa 360 cctctgcctc ccgggctcaa gtgatcctcc caacagactc ctgagtagct gggactacac 420 gcatgtgcca ctttgcccag ctaatctgtg tattttttgt agagatggag cttcgccgtg 480 atggccaggc tgatcttgaa ctcccgggct caagtggccc gccggcctcg gcctcccaaa 540 gtgctaggat tacaggcatg agccaccgtg accagcctag caatcaggtt tttatctgtt 600 tcgggtgtgg gagaggctaa gctaagccag aattcaaaag aaaggaatat attcataatt 660 agttacactt agggtgaaaa ataaaatttc agagagagca g 701 <210> 33 <211> 601 <212> DNA <213> Homo sapiens <400> 33 ggtgtgggag aggctaagct aagccagaat tcaaaagaaa ggaatatatt cataattagt 60 tacacttagg gtgaaaaata aaatttcaga gagagcagaa taggaaaaaa aatcataatt 120 ttaataattc acacttattg cactattgtt atttcaattt tacagagaaa aacaggcata 180 gagtgatgaa atagcaagta catatataaa caacaaatgg cagagctgga acgccaatcc 240 tagcaatctg aaccagcatt taaaaacgag gaattaaagc acatagggtg agaaaaatta 300 yagggaaggg acaagagctc aaggggacac ctggaaaaga aatactataa tttaaaaaat 360 taccaacggg ctaattatac ttgtgtttta ctggcagagc gctttgtctg ttagctcttt 420 taatcccgct aataattctg tgaggttggt actataagtt ccatttttca gacaaggacg 480 cttaatagat gataacggac acattccctg cttgctctga tctgcatctc aacagctggc 540 tctggagcaa gcaaagggag actaagggaa acagagcgga gcagggcaaa gaaagccgac 600 t 601 <210> 34 <211> 705 <212> DNA <213> Homo sapiens <400> 34 gccatatcgt ttcctcaaac taagtgaatc caagcttgaa ttgttccagt tcaccaccca 60 tttcatatag tttggccact agccccaaga atagatattt tggcatcatc tttagttctt 120 ccttttcaaa attccctttg accaagcgtt caccatgtgt ttctcatacc catacattca 180 tccccaattt ccctaataca rtgctgtgtg ttgtgatttg tcttagattc ggttatcagc 240 agtatttatt tatttatttt tattttattt tattttattg aggagtctcg cactattgcc 300 caggctggaa tgcagtggcg tggtctcggc tcactgccat ctctgcctcc caggttcaag 360 cgattctcgt gcctcagcct cccaagtagg tgggattaca ggtgcctgcc atcacgcctg 420 gctaattttt gtggttttat tagagacggg gtatcaccat gttggccagg ctggtcttga 480 actcctgacc tcaggtgacc cgcccacctc ggcctcacac agtgctgtga gggtgtgagc 540 cactgcgccc ggcttatcag cagtatttaa ataacccttg tttctatgaa tgtagtgatg 600 agtttctata tacagttttg tgttgtgaag cctgagggga aagtgagttc ctggataagt 660 cttgtcatat ttattttgtg tttctagcac tgttgcacag cacat 705 <210> 35 <211> 806 <212> DNA <213> Homo sapiens <400> 35 ttccctttga ccaagcgttc accatgtgtt tctcataccc atacattcat ccccaatttc 60 cctaatacaa tgctgtgtgt tgtgatttgt cttagattcg gttatcagca gtatttattt 120 atttattttt attttatttt attttattga ggagtctcgc actattgccc aggctggaat 180 gcagtggcgt ggtctcggct cactgccatc tctgcctccc aggttcaagc gattctcgtg 240 cctcagcctc ccaagtaggt gggattacag gtgcctgcca tcacgcctgg ctaatttttg 300 tggttttatt agagacgggg tatcaccatg ttggccaggc tggtcttgaa ctcctgacct 360 caggtgaccc gcccacctcg gcctcacaca gtgctgtgag ggtgtgagcc actgcgcccg 420 gcttatcagc agtatttaaa taacccttgt ttctatgaat gtagtgatga gtttctatat 480 acagttttgt gttgtgaagc mtgaggggaa agtgagttcc tggataagtc ttgtcatatt 540 tattttgtgt ttctagcact gttgcacagc acatattaag tgtttagaaa atgttttttg 600 aatgaacagc ccaaagacta aaggaagagc ttccttagtt ctaaattctg tgaaatgagt 660 cagtttgaat gaagctaatg tttcttttgt tttacaccat ttactataga gtgcagtact 720 tcagttgttc atacttgttg atattgttac agatattcta atggttcttt ttacaaaatc 780 aaatatatat acaagacatt ttttcg 806 <210> 36 <211> 601 <212> DNA <213> Homo sapiens <400> 36 ttgggactca tcaacggtag tgcctcataa cttacaattg aattttactt actggaacaa 60 ctttttaata cattcattga tttaatcatt gtttgggtag tattacagca acagtctcct 120 cctctcctgc cctctccacc ctccctacgt atacacacca cataccagat gtggctgata 180 aatgctttgg attaatatat ctggatttta cttaaacagt gattttttct tttgtaggaa 240 cctccatatt taagaattcc cagtagctac tttttaccca atatggcagg cattacattc 300 rtgccttagg atatattata gtgattgagt ttctgtttct aaacattttt gatggaaaga 360 atgagtataa gaggcagttc ctttttgcag ctccacgcca ggggaccagg gttacacaca 420 ttactatgtc tgttaaactt cttggaagcc atccagtatg ttatggtgag tttcatgtgc 480 aagtggtgag ccactaaaag gcttgcacag tattataact attctgagga gaggaggcat 540 ttggaaattc agcagtaaag cagcgacagc aacaagcagc cacagtggta gtaaagagag 600 g 601 <210> 37 <211> 601 <212> DNA <213> Homo sapiens <400> 37 ctctcagtgt tacttgggct tgggtactac ctgagtgatt ttgatggggt gaggatcagt 60 tgagttttcc accatctcca caggttttgg tgctttccaa atattctctg tcactatgat 120 atccacagat gtggtatcac tgaaggaatt ctcactctgg cctcccatgt ccttcactga 180 gatcaccaga ttataggaag gattcttagc aggattcaat tcctgagatc ctgtgagaag 240 taggggagaa gggaatagga agtgtctctg gaagagacag accaaactcc ctttttctga 300 ycgagacagt cctaccaggg cacccctgaa cttaccctct cgggtaagag agatggctcc 360 cgttttgttg ttgatctgaa agtacatgac attgttgatc atgggaagct ggatgacaat 420 ctggtaataa agctggccat tgggagtggc cggatcatcc aggtctgtgg cattgacata 480 caagaagggc tttcctgttt aacaagacgt acccagaaaa aaaaaaaaaa agatattaat 540 tgaaacccaa accaaccata gctacttttt ccatctaaaa agtggagata taatagggaa 600 a 601 <210> 38 <211> 601 <212> DNA <213> Homo sapiens <400> 38 cttggaccag accttgagaa accttatttt agggaagggg ggagtgggat aaatatttat 60 ttccccactg gtgtgaattg tcataccttt ccctattata tctccacttt ttagatggaa 120 aaagtagcta tggttggttt gggtttcaat taatatcttt tttttttttt ttctgggtac 180 gtcttgttaa acaggaaagc ccttcttgta tgtcaatgcc acagacctgg atgatccggc 240 cactcccaat ggccagcttt attaccagat tgtcatccag cttcccatga tcaacaatgt 300 yatgtacttt cagatcaaca acaaaacggg agccatctct cttacccgag agggtaagtt 360 caggggtgcc ctggtaggac tgtctcggtc agaaaaaggg agtttggtct gtctcttcca 420 gagacacttc ctattccctt ctcccctact tctcacagga tctcaggaat tgaatcctgc 480 taagaatcct tcctataatc tggtgatctc agtgaaggac atgggaggcc agagtgagaa 540 ttccttcagt gataccacat ctgtggatat catagtgaca gagaatattt ggaaagcacc 600 a 601 <210> 39 <211> 601 <212> DNA <213> Homo sapiens <400> 39 attgtgccac tgcactccag cctgagcgac agagtgagac tccgtctcaa aaaaaaaaaa 60 aacctggcag aaattaaaac taggaaatag attaaaatag atgacttaga acttatacca 120 gatagccaat cctttctgtt tttggtggtt cttacagact tgagatgggg cttgtcttaa 180 ggaaacaagg tgccctgaat ctaaagactg actaaaggtg gtagaactta agctatttag 240 gtgatcctga aaccaacagg gacagttatg gtattgacta gtgggaactt ttaagagcca 300 yaagactgga gaagtcaaat gctgaccaat tatatcaaac ccacctgggg aacaaactaa 360 gtcaaaatta ctagggtggc taaaagctga gaaatgaact aggtggcagt taggactgct 420 gaacaaaccc atctcaccat agcagagaaa tattagaata cacctagagc catgtaaaga 480 ttcaggagtg ctaaaagcat gaattcccct ccttccctcc accaccccca aaataagctt 540 ctgcctgaat gaaggtaact gggaatctgc aatttttttc tttcctgcac aaataaccac 600 t 601 <210> 40 <211> 681 <212> DNA <213> Homo sapiens <400> 40 gtcaaagcag taaaacatta ctgtttataa aactagttcc taaaattttt aaatataaac 60 acaattattt atttgaccag aatctaagag atggaaacat tttaggccaa aaaaaggaca 120 agccgccagg cgcagtggct cacgcctgta atcccaacac tttgggaggc caaggtgggt 180 ggatcatgag gtcaggagat cgagaccatc ctggccaaca tggtgaaacc ccatctctac 240 taaaaataca aaaattaggt gggcgtggtg gcgcgtacct gtaatcccag ctactgggga 300 ggctgaggca ggagaatcgc ttgaaccagg gagtcaagag gttgcagtga gctgagattg 360 tgccactaca ctccagcctg gcgacagagc aagactccat ctcacacaca cacacacaca 420 aaaagtacaa gcccttctac ctcagtctgt ctcactaact cttgatccat ctttaacacc 480 kacagtatcc atctctccta tgtgtacaca cacgttcatc tctttcattc cctaactaaa 540 ctgtctatac ataagacaag cacagcagtt taaccaaatt ctcttgccag aatcagaaaa 600 gtgtatattt gctttacctg tcctccacct tcttctttac acaggtcaat tgcaaatgcc 660 aaatccattg ctgagaaaac a 681 <210> 41 <211> 501 <212> DNA <213> Homo sapiens <400> 41 cactgtatag caacatttat aaggcagccc aggatgacat caagaaacag tgagtgaaca 60 agtaagcact tgtaataccc ggcataatac cttaaaatcc gttaggattt actgagttta 120 tctatagtaa atttgaagag actatggatg ttcaaagggt atcatcataa aagttcagta 180 aatctccaaa taccatgaaa gtaaattgtt acttctgaaa gagcagtaaa atataacctg 240 gcaaatgaag ytactacttg agattctaac agggctaaaa tataacttac aaagctttta 300 caataaaaat ttcaattttt aagttttttc agttgaaaaa aattcttcac tggccaaatc 360 actagcaaaa tcagtttcaa acatcactgc cttgttttag tagagcatat ggtggaccca 420 tttgcttggt ttcgtaagat tcccatctaa ctgaccctaa acttctattt tctggttaaa 480 aactcacttt cccagatctg g 501 <210> 42 <211> 1289 <212> DNA <213> Homo sapiens <400> 42 aatttagact tacactagaa tcaagactga gtagaaaaca gaatacttct gctgatcatg 60 gtgtctcatg cctataatcc cagcacttga gaggccgagt tgggaggatc acttgagctc 120 aggagtttga gaccagcttg ggcaacacag ggagatggtc gtctctacta aaaattaaaa 180 aagaaaaatt ggccaagtgg tggcgcgcgc ctgtggtccc agctacttga gaggctaatg 240 tgggagaatc acatgagcct gggaggttga ggctgtagcg agcatgatca tgtcactgca 300 ctccagcctg ggtgacagaa caagaccctg tctcaaaaac aaaaagaaaa ccgaggccag 360 gggcagtggc tcacgcctgt aatcctagca ctttgggagg ccgaggccgg tggatcactt 420 gaggtcagga gtttgagacc agcctggcaa acatagtgaa accccgtctc tactaaaaat 480 acaaaaatta gcctggcatg tagctcccag ctactgggga ggctgaggca agagaattgc 540 atgaacctgg gaggcgtagg ttgcagtgag ccaaggtagt gccactgcac tccagcctga 600 gggacaagag caaaactcca tctcaaaaaa aaaaaaaaaa aaagaaaaga aaattgaata 660 ctggaattat atcatttggt ccttgcaggt ttgtgttttt tgattaatac ttagtactgt 720 aacaggtaaa ccccacaagt ctaaatggct ttgcccagtt gaagtttctt tcatgcatgg 780 agcctgatag gtgagtagtt ttcctccaag cagtgatgta gggccttgag ctccttccat 840 gcagtaattc tctatcatca gcatatagct ttactaaggt ccctrtgctt gcctgcatca 900 agctatcaaa tgggaaaatg tggctttttg cagattaggc ctggaaatgg ctcacatttc 960 attgccttgg gtacagtcat aggccacagc taacttcaag gaaggctgga aaatgtggag 1020 catatatagg catgtaaccg ggttgaagag gaaatggttt tggtgggcac ctgtccagta 1080 tctgtcacaa gtgttttgtc cttacctgcc ctgagaaata gtctcaagaa acagtcctag 1140 aagtttgcta ttgcatttac tataaggata tttttgagga ttattttttt cagtgtaacc 1200 cattatctca tatatgatga tccatacttg ccttgattgc catggaaagc tttatttttt 1260 tcttgagttt ctagtgttat ttgtttttt 1289 <210> 43 <211> 1155 <212> DNA <213> Homo sapiens <400> 43 tgtgatagaa acctaattgc aggaaacagt gaaggcacca tggagttagt ggtcaattct 60 agcagaggac ctttagtaga acttctgtac tggaaatgag gacattaatg atgagtaggc 120 attccctaga gagtgggaga gggtgtttct gagggtagca aagatgagaa gagctgtgag 180 cagcttggtg tcaagatctg tgaaagctaa caggttagtc tgccacagtg tggcaggatg 240 ttggcagaag gtgagagctt atgagtcaga gacaaaggtc tttatcactt gttgcacatt 300 gcacatcaaa cagtgtgagt atcatagttc accttgttgt cccctctgtc ccaagtccta 360 cattggtgag gaagggccca gggtgatact gtggagggtt tgtgtcacag ctgaggaact 420 tgggggctaa gggacccaga tttttgtaac grgcagtaaa cttccctgac ctttgccatg 480 gaggaagaca tcatcttatt atactgccct ctgatgtaga aggaggcaca acctcaatct 540 tccagggttt tgttcattat acaaacatcc ttgaaaacta taaaagggca gttagtcatc 600 caggcggggt gactcacacc tgtaattcca gcactttgag aggccaaggc gggcagatca 660 cttgaggtca ggagttcgag accagcttgg ccaacttggt gaagactcca tctcttctaa 720 aaatacagaa attaagctgg catggtggtg ggtgcctact aaggaggccg aggcaggaga 780 atcacttcaa cccagaggca gaggttgcag tgagccgaga ttgcgccata gcactccagc 840 ctgggtgaca gtgtgagact ctgtttcaaa aaaaaaaaaa cgagggggac agttattgtc 900 tttgctcaca agatgtgagg agatgtgtga gaccatgagg aatagtgtcc cagcacctgg 960 caggttcagg gaactgcatg ttaacttcag gttgggcctt tccaagaatt ttaccaccac 1020 gtattatagc ttcagcactt actacctcaa gaagatgtag taccttactt gatgcacgaa 1080 agtcctattt gctgctggat tccttctttc cctttttggc aaagagggca tcatattctg 1140 ttggctaggc agaat 1155 <210> 44 <211> 601 <212> DNA <213> Homo sapiens <400> 44 attcacttct ttatcatgaa acaatatgat ttttaacaaa tggttgctgt atatatagca 60 acatatatat gttggtacca agccatatgt ccattttgtt gactttcaga aattttcaca 120 acaacaaact gtgactctaa tagtctaaca attttacctg gagtatgacc tgagcctttt 180 ctcacgcacc attttgttaa acatatattt aatttgggga atgtaaatac aaatttttct 240 tatgtaaaca ttacaccctc ccctgacccc ctaacttaca ttgagatatt tgtgtattct 300 ktgctgcctg gatagtaaaa gctgaagggt cttctaccag aagcatgtta acccaggaga 360 tagtctatga aatatactta taagtaacta ttataaaatt aaagcatatg ttttaaaagc 420 caaatgtgaa tggactttac attatttcat tactttggaa tcatctgtgg tattattctt 480 cagagtagaa aagaaaaact gacatttgtc tgcactaaga tatatgaatg gagagtcttt 540 gagtaggttc attcccaaat gtgatgaaag aaagctttat tttcttttca tttagttatc 600 t 601 <210> 45 <211> 990 <212> DNA <213> Homo sapiens <400> 45 tccagtattc agtgagtgct ctctgtgtgc taagcactgt agctaaaatg agagtagacc 60 agaagagtgg atcaggagca gacggtcaac agctctggag gccaggttat agaactgaga 120 cctgacccaa caggcagcga gaaagacact acaagatttc aaggggtaaa gtgataatga 180 gagtttaaaa aaaatgatta ctattgttaa tgcctggagg acacacaact gagagggagg 240 gagagggcag ttgctgccat atgttaacta atggcagagg ggctggcaac tgggatgctg 300 ttttagaagc agaatcacca actcttgttg tgagaggaag acaagagagt aatgatgcct 360 ccaagatatt taatgtggac agctagtcaa atgctattgt cttgaggaaa gcagcagaaa 420 tcactgaaac tgtaagactt tgaagtgaag ttacctagta cgaagctaga aacatttaaa 480 atgagggcay tggaagaaaa agaagagaga tgcaaaagga actctgaaaa actaaagaag 540 aaaaccagca gtgaggaatg ctacagaaga aaggaaagag aaaatttcca ttagagaagt 600 gcttgtcctt tggggaaaat gatgcaagtg agcttaagta agaaaagggc taatccttaa 660 atccactgga tttaattata aggcattctg cacacagcgc tggaattatg gctccatgaa 720 gcttctctca gacaaggatt atgtccctag tgtctggccc gctgcctgtt gcacagtggg 780 tgctcagatg atgtgggtga acgagtaaat tatcagaagg ctaaattaca ctgtgttgag 840 atgagaaaag gggttaacag agttctgtgt gaaccaatga atatagtttg atggcaaagg 900 aggaacaaga tgagaagcag atcatgggaa tgagagtgct aaagatactt gtttatttgt 960 atagaggaac aggagctaga agttgttatg 990

Claims

delete

In a polynucleotide consisting of the DNA sequence of SEQ ID NO: 2, a polynucleotide consisting of 10 or more consecutive DNA sequences comprising the 501th base (polymorphic site) G and comprising the 501th base or a complementary polynucleotide thereof; Or a polynucleotide consisting of the DNA sequence of SEQ ID NO: 3, wherein the 501th base (polymorphic site) is A and consists of a polynucleotide consisting of 10 or more consecutive DNA sequences comprising the 501th base or a complementary polynucleotide thereof Enteric gastric cancer diagnostic microarray comprising a probe.

In a polynucleotide consisting of the DNA sequence of SEQ ID NO: 2, a polynucleotide consisting of 10 or more consecutive DNA sequences comprising the 501th base (polymorphic site) G and comprising the 501th base or a complementary polynucleotide thereof; Or a polynucleotide consisting of the DNA sequence of SEQ ID NO: 3, wherein the 501th base (polymorphic site) is A and comprises a polynucleotide consisting of 10 or more consecutive DNA sequences comprising the 501th base or a complementary polynucleotide thereof Kit for diagnosis of enteric stomach cancer.

In order to provide information necessary for diagnosing enteric gastric cancer, (i) obtaining a nucleic acid sample from a sample; And (ii) analyzing the nucleotide sequence of the polymorphic site of the polynucleotide of SEQ ID NO: 2 or 3 or its complementary polynucleotide from the nucleic acid sample, wherein the single nucleotide polymorphism in the sample is provided. The polymorphic site means the 501th base for the sequence of SEQ ID NO: 2 and the 501th base for the sequence of SEQ ID NO: 3).

The method of claim 7, wherein the analyzing of the nucleotide sequence of the polymorphic site is performed using a primer or a probe whose template is a polynucleotide of SEQ ID NO: 2 or 3 or a complementary polynucleotide sequence thereof. A method for detecting a monobasic polymorph in a sample, characterized in that the.

The method of claim 7, wherein the analyzing the base sequence of the polymorphic site comprises at least 10 tens of the polynucleotide consisting of the DNA sequence of SEQ ID NO: 2, wherein the 501th base (polymorphic site) is G and comprises the 501th base. Polynucleotides or complementary polynucleotides consisting of contiguous DNA sequences; Or a polynucleotide consisting of the DNA sequence of SEQ ID NO: 3, wherein the 501th base (polymorphic site) is A, and the polynucleotide consisting of 10 or more consecutive DNA sequences comprising the 501th base, or a complementary polynucleotide thereof is fixed Hybridizing the nucleic acid sample to the prepared microarray; And analyzing the obtained hybridization result.

10. The method of claim 9, wherein the length of the DNA sequence of the polynucleotide immobilized on the microarray or its complementary polynucleotide is 10 to 100 nucleotides each.

The method of claim 7, wherein the analyzing of the nucleotide sequence of the polymorphic site comprises genotype or allele analysis of the polymorphic site of the DNA sequence of SEQ ID NO: 2 or 3. 9. Method for detecting monobasic polymorphs.

delete