KR102193658B1

KR102193658B1 - SNP markers for diagnosing Soeumin of sasang constitution and use thereof

Info

Publication number: KR102193658B1
Application number: KR1020190074103A
Authority: KR
Inventors: 반효정; 진희정; 이시우; 백영화; 차성원; 류다은
Original assignee: 한국한의학연구원
Priority date: 2019-06-21
Filing date: 2019-06-21
Publication date: 2020-12-22

Abstract

The present invention relates to a single nucleotide polymorphism (SNP) marker for diagnosing soeumin (a type of sasang constitutional medicine) among sasang constitution. More specifically, the present invention relates to a composition for diagnosis of soeumin comprising a SNP marker specific to soeumin, a kit or a microarray containing the composition, a method for diagnosing soeumin among sasang constitution using the marker, and a method for selecting the SNP for diagnosis of soeumin.

Description

SNP markers for diagnosing Soeumin of sasang constitution and use thereof

본 발명은 사상체질 중 소음인 진단용 SNP(single nucleotide polymorphism) 마커에 관한 것으로, 보다 구체적으로 소음인 특이적 SNP 마커를 포함하는 소음인 진단용 조성물, 상기 조성물을 포함하는 키트 또는 마이크로어레이, 상기 마커를 이용한 사상체질 중 소음인 예측 또는 진단을 위한 정보의 제공방법 및 소음인 진단용 SNP를 선별하는 방법에 관한 것이다.The present invention relates to a SNP (single nucleotide polymorphism) marker for diagnosing noise in sasang constitution, and more specifically, a composition for diagnosis in noise including a specific SNP marker for noise, a kit or microarray containing the composition, and sasang constitution using the marker It relates to a method of providing information for predicting or diagnosing a noise person, and a method of selecting a SNP for diagnosis of a noise person.

사상의학은 100여년 전 이제마가 독자적으로 창설한 우리나라 고유의 체질의학으로 알려져 있다. 사상의학은 사람의 체질을 태양, 소양, 태음, 및 소음의 네 가지로 분류하는데, 이는 체질의 특성을 다소강약(多少强弱) 이라는 비례 기능적 생리현상과 성정, 행동, 태도, 용모 등을 종합하여 분류한 것이다. 이때, 다소(多少)의 의미는 기능상의 생리적 허실(虛實), 즉 강약(强弱)을 의미한다.Sasang Medicine is known as Korea's unique constitutional medicine, founded by Ijema about 100 years ago. Sasang Medicine classifies human constitution into four categories: sun, soyang, taeum, and noise, which synthesizes the proportional functional physiological phenomenon of a person's constitution, which is somewhat strong and weak, and sexuality, behavior, attitude, and appearance. And classified. At this time, the meaning of some (多少) means a functional physiological heosil (虛實), that is, strength and weakness (强弱).

상기 네 가지 체질은 약에 대한 반응성이 다를 뿐만 아니라 외모, 체형, 성격, 장부기능, 식생활 등에서도 차이를 보인다고 알려져 있다. 이제마는 1894년에 저술한 동의수세보원을 통해 인간은 천부적으로 장부허실이 있고, 이에 따른 희로애락의 성정이 작용하여 생리현상을 빚는다고 주장하면서, 각 체질에 대한 생리, 병리, 진단 감별법, 치료와 약물, 식품 섭취에 이르기까지 체질과 연관지어 이를 임상에 응용할 수 있는 새로운 방법을 제시하였다.The above four constitutions are known to show differences not only in reactivity to drugs, but also in appearance, body type, personality, book function, and diet. Ijema argued that through Donguisusebowon, which he wrote in 1894, that humans naturally have books and demeanor, and that the resulting emotions of happiness and sorrows act to create physiological phenomena, the physiology, pathology, diagnosis, discrimination, treatment and From drug to food intake, a new method was suggested that could be applied to clinical settings in connection with the constitution.

동의수세보원에 의하면, 체질은 선천적으로 정해지면 일생 동안 변하지 않고 유지되는 것으로 받아들여지고 있어, 생물학적으로는 유전적인 특성이라고 할 수 있다. 관련하여, 인구 내 체질 비율이 태음인:소양인:소음인 = 5:3:2로서 사상체질 이론이 주장된 후 100년이 넘는 기간 동안 거의 변하지 않고 있고, 쌍둥이 및 가계를 대상으로 이루어진 체질의 유전율 연구에서도 체질별로 유전적 소양이 50% 안팎에 이르는 것으로 밝혀졌다. 따라서, 모태에서 결정된 체질은 일생 동안 변하지 않는다고 보기 때문에 체질 형성과 유전적인 차이는 서로 연관성이 클 것으로 여겨진다.According to Donguisusebowon, the constitution is considered to remain unchanged for a lifetime if it is determined by nature, and it can be said that it is a biologically genetic trait. In relation to this, the constitution ratio in the population is Taeeumin: Soyangin: Soeumin = 5:3:2, which has remained almost unchanged for more than 100 years after the theory of Sasang constitution was asserted, and even in a study on the inheritance of constitution in twins and families It has been found that the genetic literacy by constitution reaches around 50%. Therefore, it is believed that the constitution determined in the mother's womb does not change throughout life, so constitution formation and genetic differences are considered to be highly related to each other.

한편, 전통적으로 사상체질의 판별은 개인의 음성, 외모, 체형, 성격 및 다양한 외형적인 생물학적 지표를 종합적으로 이용하여 수행해왔으며, 숙련된 전문가에 의하여 주관적 판단을 바탕으로 진단이 행해지고 있다. 현대에 들어서는 맥진법, 신체형태지수, 설문검사법, 안면부 형상분류법, 약물복용법, 적외선 촬영법 및 O-ring test법 등 다양한 방법이 개발되었다.On the other hand, the discrimination of the Sasang constitution has traditionally been performed by comprehensively using an individual's voice, appearance, body shape, personality, and various external biological indicators, and diagnosis is made based on subjective judgment by skilled experts. In modern times, various methods such as pulsing method, body shape index, questionnaire test method, facial shape classification method, drug use method, infrared imaging method and O-ring test method have been developed.

뿐만 아니라, 체질분류의 객관화를 위해 체질문진표를 고안하여 다양한 체질의 판정지표를 체계적으로 종합하여 과학적인 체질 진단을 가능케 하고, 체질을 유전학적인 면에서 검토하여 체질별 상이성 및 연관성을 찾아보려는 노력이 수행되어 왔다.In addition, a constitutional questionnaire was devised for objectification of constitution classification, systematically synthesizing various constitution determination indicators to enable scientific constitution diagnosis, and an effort to find differences and associations by constitution by examining constitution in terms of genetics. This has been done.

그러나, 기존의 연구들로 체질에 대한 유전적 배경은 확립되었으나, 유전적/환경적 소인이 복합된 형질의 특성 상 기존의 유전체 분석방법에 의한 분류가 어려운 실정이다. 또한, 한방병원 및 한의원에서 수집한 통계자료나 질병관리본부 및 건강보험공단에서 제공하는 환자의 정보를 이용한 체질 분류는 규모가 매우 작고 성능평가가 어렵다는 단점이 있다.However, although the genetic background of constitution has been established by the existing studies, it is difficult to classify it by the conventional genome analysis method due to the characteristics of the trait in which the genetic/environmental predisposition is complex. In addition, constitution classification using statistical data collected by oriental medicine hospitals and oriental clinics or patient information provided by the Korea Centers for Disease Control and Prevention and Health Insurance Corporation has a disadvantage in that the scale is very small and performance evaluation is difficult.

또한, 사상체질 진단과 관련된 공개, 등록 특허가 존재하며, 예시적인 특허는 아래와 같다. 등록번호 10-0455996호는 손가락을 금고리 및 은고리에 접촉시킨 후 신체 반응을 측정해 사상체질을 판별하는 장치에 관한 것이며; 등록번호 제10-0427243호는 음성의 피치를 분석하여 사상체질을 감별하는 방법에 관한 것이며; 등록번호 제10-0563127호는 체질에 맞는 식품 또는 약품 접촉 시 손목의 압력을 통해 사상체질을 판별하는 장치에 관한 것이다. 하지만, 이러한 연구에도 불구하고, 아직은 과학적이고 정밀도가 높은 사상체질 진단방법이 개발되지 못하고 있는 실정이다. 따라서, 높은 정확도와 재현성 및 신뢰도를 갖는 사상체질 진단방법의 개발이 절실하다.In addition, there are disclosed and registered patents related to the diagnosis of Sasang constitution, and exemplary patents are as follows. Registration No. 10-0455996 relates to a device for determining sasang constitution by measuring a body reaction after a finger is brought into contact with a golden ring and a silver ring; Registration No. 10-0427243 relates to a method for discriminating sasang constitution by analyzing the pitch of the voice; Registration No. 10-0563127 relates to a device that determines the sasang constitution through the pressure of the wrist when in contact with food or medicine suitable for the constitution. However, despite these studies, a scientific and highly accurate Sasang constitution diagnosis method has not yet been developed. Therefore, it is urgent to develop a method for diagnosing Sasang constitution with high accuracy, reproducibility and reliability.

이러한 배경하에 본 발명자들은 사상체질의 유전성에 근거하여 사상체질과 연관된 SNP를 대상으로 연구를 수행하였고, 과학적이고 객관적으로 사상체질을 진단할 수 있는 표지자를 찾기 위해 노력하였다. 그 결과, SNP 중 소음 집단은 특정 SNP들에서 특정 유전자형을 가지고 있음을 확인하고, 본 발명을 완성하였다.Under this background, the present inventors conducted a study on SNPs related to the Sasang constitution based on the heritability of the Sasang constitution, and tried to find a marker capable of scientifically and objectively diagnosing the Sasang constitution. As a result, it was confirmed that the noise group among SNPs has a specific genotype in specific SNPs, and the present invention was completed.

본 발명의 하나의 목적은 사상체질 중 소음인 진단용 SNP(single nucleotide polymorphism) 마커를 제공하는 것이다.One object of the present invention is to provide a diagnostic SNP (single nucleotide polymorphism) marker, which is noise in sasang constitution.

본 발명의 다른 하나의 목적은 상기 소음인 진단용 SNP 마커를 검출할 수 있는 프로브 또는 증폭할 수 있는 프라이머를 포함하는 소음인 진단용 조성물을 제공하는 것이다.Another object of the present invention is to provide a diagnostic composition for noise-in comprising a probe capable of detecting the SNP marker for diagnostic use in noise-in or a primer capable of amplifying it.

본 발명의 또 다른 하나의 목적은 피험자로부터 채취한 시료로부터 획득한 염기서열을 이용하여 소음인 특이적으로 발현하는 SNP의 검출 여부를 확인하는 단계를 포함하는 소음인 예측 또는 진단을 위한 정보의 제공방법을 제공하는 것이다.Another object of the present invention is to provide a method of providing information for predicting or diagnosing a noise person comprising the step of confirming whether or not a SNP specifically expressed as a noise person is detected using a nucleotide sequence obtained from a sample collected from a subject. To provide.

본 발명의 또 다른 하나의 목적은 피험자의 상기 소음인 진단용 조성물을 포함하는 소음인 진단용 키트를 제공하는 것이다.Another object of the present invention is to provide a kit for diagnosis of noise, comprising a composition for diagnosis of the subject's noise.

본 발명의 또 다른 하나의 목적은 피험자의 상기 소음인 진단용 조성물을 포함하는 소음인 진단용 마이크로 어레이를 제공하는 것이다.Another object of the present invention is to provide a microarray for diagnosis of noise comprising a composition for diagnosis of the noise of a subject.

본 발명의 또 다른 하나의 목적은 상기 소음인 진단용 SNP 마커를 선별하는 방법을 제공하는 것이다.Another object of the present invention is to provide a method for selecting the SNP marker for diagnosis of the noise.

이를 구체적으로 설명하면 다음과 같다. 한편, 본 발명에서 개시된 각각의 설명 및 실시형태는 각각의 다른 설명 및 실시형태에도 적용될 수 있다. 즉, 본 발명에서 개시된 다양한 요소들의 모든 조합이 본 발명의 범주에 속한다. 또한, 하기 기술된 구체적인 서술에 의하여 본 발명의 범주가 제한된다고 볼 수 없다.This will be described in detail as follows. On the other hand, each description and embodiment disclosed in the present invention can be applied to each other description and embodiment. That is, all combinations of various elements disclosed in the present invention belong to the scope of the present invention. In addition, it cannot be seen that the scope of the present invention is limited by the specific description described below.

또한, 당해 기술분야의 통상의 지식을 가진 자는 통상의 실험만을 사용하여 본 발명에 기재된 본 발명의 특정 양태에 대한 다수의 등가물을 인지하거나 확인할 수 있다. 또한, 이러한 등가물은 본 발명에 포함되는 것으로 의도된다.In addition, those of ordinary skill in the art can recognize or ascertain using only routine experimentation a number of equivalents to the specific aspects of the invention described herein. Also, such equivalents are intended to be included in the present invention.

상기 목적을 달성하기 위하여, 본 발명의 하나의 양태는 사상체질 중 소음인 진단용 SNP (single nucleotide polymorphism) 마커를 제공한다.In order to achieve the above object, one aspect of the present invention provides a diagnostic SNP (single nucleotide polymorphism) marker for noise in sasang constitution.

상기 마커는 구체적으로 서열번호 1 내지 15로 구성된 군에서 선택된 하나 이상의 폴리뉴클레오티드들에 있어서, SNP 위치인 각 서열의 16번째 염기를 포함하고, 5 내지 100개의 연속적인 염기로 구성되는 폴리뉴클레오티드로부터 선택된 하나 이상의 폴리뉴클레오티드, 또는 이의 상보적인 폴리뉴클레오티드를 포함하는, 소음인 진단용 마커일 수 있다.The marker is specifically selected from polynucleotides consisting of 5 to 100 consecutive bases, including the 16th base of each sequence, which is the SNP position, in one or more polynucleotides selected from the group consisting of SEQ ID NOs: 1 to 15 It may be a diagnostic marker for noise, including one or more polynucleotides, or complementary polynucleotides thereof.

본 발명에서 사용되는 용어, "다형성(polymorphism)"이란 하나의 유전자 좌위(locus)에 두 가지 이상의 대립 유전자(allele)가 존재하는 경우를 말하며 다형성 부위 중에서, 사람에 따라 단일 염기만이 다른 것을 단일 염기 다형성(single nucleotide polymorphism, SNP)이라 한다. 구체적인 다형성 마커는 선택된 집단에서 1% 이상, 더욱 구체적으로 5% 또는 10% 이상의 발생 빈도를 나타내는 두 가지 이상의 대립 유전자를 가진다.The term "polymorphism" used in the present invention refers to a case in which two or more alleles exist in one locus. Among polymorphic sites, only a single base differs from one person to another. It is called single nucleotide polymorphism (SNP). Specific polymorphic markers have two or more alleles that exhibit a frequency of occurrence of at least 1%, more specifically at least 5% or 10% in the selected population.

본 발명에서 사용되는 용어, "대립 유전자"는 상동 염색체의 동일한 유전자 좌위에 존재하는 한 유전자의 여러 타입을 말한다. 대립 유전자는 다형성을 나타내는데 사용되기도 하며, 예컨대, SNP는 두 종류의 대립 인자(biallele)를 갖는다.As used herein, the term "allele" refers to several types of one gene present at the same locus of a homologous chromosome. Alleles are also used to indicate polymorphism, for example, SNPs have two types of alleles.

본 발명에서 사용되는 용어, "rs_id"란 1998년부터 SNP 정보를 축적하기 시작한 NCBI가 초기에 등록되는 모든 SNP에 대하여 부여한 독립된 표지자인 rs-ID를 의미한다. 본 발명에서는 rs12667492와 같은 형태로 기재하였다. 이와 같은 표에 기재된 rs_id는 본 발명의 다형성 마커인 SNP 마커를 의미한다. 당업자라면 상기 rs_id를 이용하여 SNP의 위치 및 서열을 용이하게 확인할 수 있을 것이다. NCBI의 dbSNP (The Single Nucleotide Polymorphism Database) 번호인 rs_id에 해당하는 구체적인 서열은 시간이 지남에 따라 약간 변경될 수 있다. 본 발명의 범위가 상기 변경된 서열에도 미치는 것은 당업자에게 자명할 것이다.The term "rs_id" used in the present invention refers to rs-ID, which is an independent marker assigned to all SNPs initially registered by NCBI, which started accumulating SNP information from 1998. In the present invention, it is described in the same form as rs12667492. The rs_id described in this table means the SNP marker, which is a polymorphic marker of the present invention. Those skilled in the art will be able to easily identify the location and sequence of the SNP using the rs_id. The specific sequence corresponding to the NCBI's dbSNP (The Single Nucleotide Polymorphism Database) number, rs_id, may change slightly over time. It will be apparent to those skilled in the art that the scope of the present invention also extends to the altered sequence.

상기 "서열번호 1 내지 15"는 다형성 부위를 포함하는 다형성 서열(polymorphic sequence)이다. 다형성 서열이란 폴리뉴클레오티드 서열 중에 SNP를 포함하는 다형성 부위(polymorphic site)를 포함하는 서열을 의미한다. 상기 폴리뉴클레오티드 서열은 DNA 또는 RNA가 될 수 있다.The “SEQ ID NOs: 1 to 15” are polymorphic sequences including polymorphic sites. The polymorphic sequence refers to a sequence including a polymorphic site including SNP in a polynucleotide sequence. The polynucleotide sequence may be DNA or RNA.

본 발명의 SNP 마커는 표 3에 표시된 마커로서, 개체의 SNP 마커의 다형성 부위가 효과 대립 유전자(effect allele)로 표시된 대립 유전자일 경우, 개체의 사상체질을 소음으로 판단할 수 있는 것이다.The SNP marker of the present invention is a marker shown in Table 3, and when the polymorphic site of the SNP marker of the individual is an allele indicated as an effect allele, the sasang constitution of the individual can be determined as noise.

본 발명에서 사용되는 용어, "사상체질"이란 사상의학에 의해 사람의 체질을 태양인, 소양인, 태음인 및 소음인의 네 가지로 분류한 것을 의미하는 것으로 체질은 선천적이며 일생에 걸쳐 변하지 않아 생물학적으로 유전적인 특성을 가진 것으로 볼 수 있다. 체질별로 유전적 차이가 있어서, 사상체질 특이적 마커를 제공할 수 있다면, 보다 정확한 사상체질의 진단이 이루어 질 수 있을 것으로 기대되고 있어 본 발명자들에 의해 최초로 사상체질별로 특이적인 사상체질 진단용 SNP 마커를 제공한 것이다. 본 발명의 목적상, 한국인에 있어서 그 빈도가 매우 낮은 태양인을 제외하고 태음인, 소양인 및 소음인, 구체적으로 소음인에 특이적인 SNP 마커를 제공한다.The term "sasang constitution" as used in the present invention means that a person's constitution is classified into four categories: Taeyangin, Soyangin, Taeeumin, and Soeuminin by Sasang Medicine. The constitution is congenital and does not change over a lifetime, so it is biologically genetic. It can be seen as having characteristics. Since there is a genetic difference for each constitution, if it is possible to provide a specific marker for the sasang constitution, it is expected that a more accurate diagnosis of the sasang constitution can be made. Was provided. For the purposes of the present invention, it provides a SNP marker specific to Taeeumin, Soyangin and Soeumin, specifically Soeumin, except for Taeyangin, whose frequency is very low in Koreans.

본 발명에서 사용되는 용어, "진단"은 사상체질을 결정하거나 판별하는데 사용되는 모든 유형의 분석을 포함한다.The term "diagnosis" as used in the present invention includes all types of analysis used to determine or discriminate sasang constitution.

구체적으로 상기 SNP 마커는 소음인 진단용 SNP 마커로서, 상기 서열번호 1 내지 15로 구성된 폴리뉴클레오티드들 중, 서열번호 1로 기재되는 rs12667492에 있어서, 16번째 염기가 A 또는 G인 16번째 염기를 포함하는 5 내지 100개의 연속적인 염기로 구성되는 폴리뉴클레오티드, 또는 이의 상보적인 폴리뉴클레오티드; Specifically, the SNP marker is a diagnostic SNP marker for noise in, among the polynucleotides consisting of SEQ ID NOs: 1 to 15, in rs12667492 described in SEQ ID NO: 1, the 16th base is A or G, including the 16th base 5 A polynucleotide consisting of from to 100 consecutive bases, or a complementary polynucleotide thereof;

서열번호 2로 기재되는 rs74028907에 있어서, 16번째 염기가 T 또는 G인 16번째 염기를 포함하는 5 내지 100개의 연속적인 염기로 구성되는 폴리뉴클레오티드, 또는 이의 상보적인 폴리뉴클레오티드; In rs74028907 shown in SEQ ID NO: 2, a polynucleotide consisting of 5 to 100 consecutive bases including a 16th base in which the 16th base is T or G, or a complementary polynucleotide thereof;

서열번호 3으로 기재되는 rs4909803에 있어서, 16번째 염기가 G 또는 T인 16번째 염기를 포함하는 5 내지 100개의 연속적인 염기로 구성되는 폴리뉴클레오티드, 또는 이의 상보적인 폴리뉴클레오티드; In rs4909803 shown in SEQ ID NO: 3, a polynucleotide consisting of 5 to 100 consecutive bases including a 16th base in which the 16th base is G or T, or a complementary polynucleotide thereof;

서열번호 4로 기재되는 rs11646443에 있어서, 16번째 염기가 A 또는 G인 16번째 염기를 포함하는 5 내지 100개의 연속적인 염기로 구성되는 폴리뉴클레오티드, 또는 이의 상보적인 폴리뉴클레오티드; A polynucleotide consisting of 5 to 100 consecutive bases including a 16th base in which the 16th base is A or G, or a complementary polynucleotide thereof according to rs11646443 shown in SEQ ID NO: 4;

서열번호 5로 기재되는 rs75575584에 있어서, 16번째 염기가 A 또는 G인 16번째 염기를 포함하는 5 내지 100개의 연속적인 염기로 구성되는 폴리뉴클레오티드, 또는 이의 상보적인 폴리뉴클레오티드; In rs75575584 shown in SEQ ID NO: 5, a polynucleotide consisting of 5 to 100 consecutive bases including a 16th base in which the 16th base is A or G, or a complementary polynucleotide thereof;

서열번호 6으로 기재되는 rs17669146에 있어서, 16번째 염기가 C 또는 T인 16번째 염기를 포함하는 5 내지 100개의 연속적인 염기로 구성되는 폴리뉴클레오티드, 또는 이의 상보적인 폴리뉴클레오티드; In rs17669146 shown in SEQ ID NO: 6, a polynucleotide consisting of 5 to 100 consecutive bases including a 16th base in which the 16th base is C or T, or a complementary polynucleotide thereof;

서열번호 7로 기재되는 rs4688636에 있어서, 16번째 염기가 G 또는 A인 16번째 염기를 포함하는 5 내지 100개의 연속적인 염기로 구성되는 폴리뉴클레오티드, 또는 이의 상보적인 폴리뉴클레오티드; A polynucleotide consisting of 5 to 100 consecutive bases including a 16th base in which the 16th base is G or A, or a complementary polynucleotide thereof according to rs4688636 shown in SEQ ID NO: 7;

서열번호 8로 기재되는 rs1017409에 있어서, 16번째 염기가 C 또는 T인 16번째 염기를 포함하는 5 내지 100개의 연속적인 염기로 구성되는 폴리뉴클레오티드, 또는 이의 상보적인 폴리뉴클레오티드;In rs1017409 shown in SEQ ID NO: 8, a polynucleotide consisting of 5 to 100 consecutive bases including a 16th base in which the 16th base is C or T, or a complementary polynucleotide thereof;

서열번호 9로 기재되는 rs981760에 있어서, 16번째 염기가 T 또는 A인 16번째 염기를 포함하는 5 내지 100개의 연속적인 염기로 구성되는 폴리뉴클레오티드, 또는 이의 상보적인 폴리뉴클레오티드;In rs981760 shown in SEQ ID NO: 9, a polynucleotide consisting of 5 to 100 consecutive bases including a 16th base in which the 16th base is T or A, or a complementary polynucleotide thereof;

서열번호 10으로 기재되는 rs1671413에 있어서, 16번째 염기가 T 또는 C인 16번째 염기를 포함하는 5 내지 100개의 연속적인 염기로 구성되는 폴리뉴클레오티드, 또는 이의 상보적인 폴리뉴클레오티드;In rs1671413 shown in SEQ ID NO: 10, a polynucleotide consisting of 5 to 100 consecutive bases including a 16th base in which the 16th base is T or C, or a complementary polynucleotide thereof;

서열번호 11로 기재되는 rs7013390에 있어서, 16번째 염기가 A 또는 G인 16번째 염기를 포함하는 5 내지 100개의 연속적인 염기로 구성되는 폴리뉴클레오티드, 또는 이의 상보적인 폴리뉴클레오티드;In rs7013390 shown in SEQ ID NO: 11, a polynucleotide consisting of 5 to 100 consecutive bases including a 16th base in which the 16th base is A or G, or a complementary polynucleotide thereof;

서열번호 12로 기재되는 rs10955446에 있어서, 16번째 염기가 C 또는 T인 16번째 염기를 포함하는 5 내지 100개의 연속적인 염기로 구성되는 폴리뉴클레오티드, 또는 이의 상보적인 폴리뉴클레오티드;In rs10955446 shown in SEQ ID NO: 12, a polynucleotide consisting of 5 to 100 consecutive bases including a 16th base in which the 16th base is C or T, or a complementary polynucleotide thereof;

서열번호 13으로 기재되는 rs7015202에 있어서, 16번째 염기가 A 또는 C인 16번째 염기를 포함하는 5 내지 100개의 연속적인 염기로 구성되는 폴리뉴클레오티드, 또는 이의 상보적인 폴리뉴클레오티드;In rs7015202 shown in SEQ ID NO: 13, a polynucleotide consisting of 5 to 100 consecutive bases including a 16th base in which the 16th base is A or C, or a complementary polynucleotide thereof;

서열번호 14로 기재되는 rs1115971에 있어서, 16번째 염기가 C 또는 T인 16번째 염기를 포함하는 5 내지 100개의 연속적인 염기로 구성되는 폴리뉴클레오티드, 또는 이의 상보적인 폴리뉴클레오티드;In rs1115971 shown in SEQ ID NO: 14, a polynucleotide consisting of 5 to 100 consecutive bases including a 16th base in which the 16th base is C or T, or a complementary polynucleotide thereof;

서열번호 15로 기재되는 rs7138393에 있어서, 16번째 염기가 G 또는 C인 16번째 염기를 포함하는 5 내지 100개의 연속적인 염기로 구성되는 폴리뉴클레오티드, 또는 이의 상보적인 폴리뉴클레오티드로 이루어진 군에서 선택된 하나 이상의 폴리뉴클레오티드일 수 있다.In rs7138393 described in SEQ ID NO: 15, at least one selected from the group consisting of 5 to 100 consecutive bases including the 16th base in which the 16th base is G or C, or complementary polynucleotides thereof It can be a polynucleotide.

보다 구체적으로는 2개 이상, 보다 더 구체적으로는 3개 내지 14개 이상, 가장 구체적으로는 15개 모두의 SNP 마커 세트일 수 있다. 상기 마커는 각 SNP 마커의 16번째 염기가 표 3의 효과 대립 유전자 염기인 경우에는 소음인으로 진단할 수 있고, 개체의 SNP 마커의 수가 많을수록 정확도가 높아질 수 있다.More specifically, it may be a set of 2 or more, more specifically 3 to 14 or more, and most specifically all 15 SNP markers. When the 16th base of each SNP marker is an effect allele base of Table 3, the marker can be diagnosed as Soeumin, and the accuracy may increase as the number of SNP markers of an individual increases.

본 발명자들은 상기 SNP가 소음인 진단용 마커임을 다음과 같은 입증을 통하여 확인하였다. 구체적으로 단축형 사상체질 진단도구(KS-15)를 통해 설문(Sasang Constitut Med, 2015, 27(2): 211-221)을 기반으로 체질을 확정한 사람을 대상으로 체질과 연관성 있는 SNP들을 확보하였다.The present inventors confirmed through the following verification that the SNP is a diagnostic marker for noise. Specifically, SNPs related to constitution were obtained for those who confirmed constitution based on a questionnaire (Sasang Constitut Med, 2015, 27(2): 211-221) through the shortened Sasang Constitution Diagnosis Tool (KS-15). .

체질정보은행의 자료에서 체형과 성격, 병증의 중요항목을 선별하여 체질을 분류하는 단축형 사상체질 진단설문지(KS-15)를 개발하였고, KS-15 설문지를 통해 얻어진 체질 특성의 가중치를 부여한 점수를 기반으로 유전체 연관 분석을 실시하였다. 상기 KS-15 설문은 체형 요소, 성격, 및 증상요소를 포함한다. 즉 키와 몸무게를 통해 생성되는 체질량 지수(Body Mass Index; BMI)를 상기 체형요소 문항으로, 체질정보은행에서 수집한 성격문항과 소증 문항으로부터 상기 성격 및 증상요소 문항으로 선정하였다. 이후, 최종적으로 사상체질전문가 논의를 통해 14문항을 선정하였다. The shortened Sasang Constitutional Diagnosis Questionnaire (KS-15) was developed to classify the constitution by selecting important items of body type, personality, and condition from the data of the Constitutional Information Bank, and the score given the weight of constitution characteristics obtained through the KS-15 questionnaire Based on this, genome-related analysis was conducted. The KS-15 questionnaire includes body shape factor, personality, and symptom factor. That is, the Body Mass Index (BMI), which is generated through height and weight, was selected as the body shape factor question, and the personality and symptom factor questions from the personality questions and subtest questions collected by the Constitutional Information Bank. Afterwards, 14 questions were finally selected through a discussion of Sasang constitutional experts.

각 문항별로 체질 가중치가 부여되고 가중치의 합이 각 체질별 점수 3개로 도출되며 BMI 1개를 독립변수로 하고 체질을 종속변수로 하는 다항 로지스틱 회귀(Multinomial Logistic Regression)을 실시하였다. 분석을 통해 얻어진 회귀계수를 이용하여 체질별 예측확률을 구하였다. 상기 체질별 예측확률은 체질별 설문항의 점수에 해당하는 가중치의 합을 말한다(Sasang Constitut Med, 2015, 27(2): 211-221).Constitution weights are given for each question, and the sum of the weights is derived as three points for each constitution, and Multinomial Logistic Regression was conducted with one BMI as an independent variable and constitution as a dependent variable. Using the regression coefficient obtained through the analysis, the prediction probability for each constitution was calculated. The predicted probability for each constitution refers to the sum of weights corresponding to the score of the questionnaire for each constitution (Sasang Constitut Med, 2015, 27(2): 211-221).

상기 체질별 예측 확률을 유전형과의 연관 분석으로 유전 후보 물질을 선정하는데 기초자료로 사용하였다.The predicted probability for each constitution was used as basic data to select a genetic candidate by analyzing the association with the genotype.

즉, KS-15의 점수의 수치값과 유전형의 선형관계성을 도출하여 연관 유전 후보 물질을 선정하였고, 후보물질들의 위치 정보를 이용해 SNP의 생물학적 기능 점수를 부여하여 최종 선정에 사용하였다.In other words, the genetic candidate material was selected by deriving the linear relationship between the numerical value of the score of KS-15 and the genotype, and the biological function score of SNP was assigned using the location information of the candidate materials and used for final selection.

유전 후보 마커 SNP들은 각각 유전체 특정 부위에 위치하게 되는데, 이 경우 유전자 이외에도 유전자를 조절하는 다양한 단백질이 붙거나 후성 유전 물질이 붙게 된다. 이에, 유전자를 조절하는 기능 정보들을 모두 수집하여 기능 점수를 부여하는 절차를 추가하였다. 상기 기능 점수의 부여는 선정된 구간(block) 내에서 SNP 기능 정보 패턴을 추출하는 것으로, 학습 사례/컨트롤(case/control) 데이터 셋들을 이용하여 학습하면서 정답셋과 유사할 수록 높은 점수를 부여하여 합산하였다. 전체 점수의 합산 결과들을 분포 변환을 통해 0 내지 1의 점수로 환산하였다. 1에 가까워질수록 패턴이 정답셋들과 유사하다고 판단하였으며, 본 발명에서 선정한 마커들은 점수가 1인 것들부터 순위를 매겨 선정한 것이다.Each of the genetic candidate marker SNPs is located in a specific region of the genome. In this case, in addition to the gene, various proteins that control the gene or epigenetic material are attached. Accordingly, a procedure for assigning a function score by collecting all of the function information that controls genes was added. The assignment of the function score is to extract the SNP function information pattern within a selected block, while learning using the learning case/control data sets, the higher the score is given as the correct answer set is similar. Summed up. The summation results of all scores were converted into scores of 0 to 1 through distribution transformation. It was determined that the pattern was similar to those of the correct answer sets as it approached 1, and the markers selected in the present invention were selected by ranking from those with a score of 1.

각 SNP는 하나의 위치 정보를 가지고 있기 때문에 기능 점수를 부여하기 위해 주변 1M 영역으로 확장하여 함께 관여할 수 있는 SNP들을 모아 패턴을 분석하였다. 학습시키는 SNP의 구간(block)이 이미 체질 연관 후보 SNP들이기 때문에 이들의 패턴은 체질에 관여되는 기능 정보들임을 알 수 있다. 상기 패턴을 도출하는 과정은 하기와 같다.Since each SNP has one location information, the pattern was analyzed by collecting SNPs that can be involved together by expanding to the surrounding 1M area to give a function score. Since the block of the SNP to be trained is already constitution-related candidate SNPs, it can be seen that their patterns are functional information related to constitution. The process of deriving the pattern is as follows.

p-value를 기반으로 연관성이 가장 강한 SNP를 lead(단서) SNP로 선출하고, 하나의 lead SNP와 그 주변 1Mb 이내의 후보 SNP들 중에 p-value가 5 x 10^-4 이하인 SNP들 중에서 연관성이 높은 순으로 최대 30개의 SNP를 선정하여 한 구간(block)으로 설정하였다. 훈련, 검증, 테스팅 세트로 나누고 SNP의 기능 영역을 맵핑하였다. 딥러닝 알고리즘을 적용하여 SNP의 기능을 점수화하여 체질별 후보 SNP를 선정하였다. 체질확진자를 통한 검증절차를 거쳐 최종적으로 소음인 특이적 사상체질 진단용 SNP를 선정하였다. Based on the p-value, the SNP with the strongest correlation is selected as the lead (clue) SNP, and among the SNPs with a p-value of 5 x 10 ^-4 or less among candidate SNPs within 1 Mb of one lead SNP and its surroundings Up to 30 SNPs were selected in the highest order and set as one block. It was divided into training, validation, and testing sets, and the functional domains of SNPs were mapped. By applying a deep learning algorithm, the function of SNP was scored to select candidate SNPs for each constitution. After a verification procedure through a person diagnosed with constitution, a SNP for diagnosis of specific Sasang constitution was finally selected.

본 발명의 다른 하나의 양태는, 소음인 진단용 SNP 마커를 검출할 수 있는 프로브 또는 증폭할 수 있는 제제를 포함하는, 소음인 진단용 조성물을 제공한다.Another aspect of the present invention provides a diagnostic composition for noise-in, comprising a probe capable of detecting a SNP marker for diagnosis of noise-in or an agent capable of amplifying it.

본 발명에서 사용되는 용어, "소음인 진단용 SNP 마커를 검출할 수 있는 프로브"는 상기와 같은 유전자의 다형성 부위와 특이적으로 혼성화 반응을 통해 확인하여 사상체질 중 소음인을 진단할 수 있는 조성물을 의미하며, 이와 같은 유전자 분석의 구체적 방법은 특별한 제한이 없으며, 이 발명이 속하는 기술분야에 알려진 모든 유전자 검출 방법에 의하는 것일 수 있다.The term used in the present invention, "a probe capable of detecting a noise-in diagnostic SNP marker" refers to a composition capable of diagnosing so-eum-in among sasang constitutions by specifically identifying the polymorphic site of the above-described gene through a hybridization reaction, and , There is no particular limitation on a specific method of such gene analysis, and may be performed by any method for detecting genes known in the art to which this invention belongs.

본 발명에서 사용되는 용어, "소음인 진단용 SNP 마커를 증폭할 수 있는 프라이머"란 상기와 같은 유전자의 다형성 부위를 증폭을 통해 확인하여 소음인을 진단할 수 있는 조성물을 의미하며, 구체적으로 상기 소음인 진단용 마커의 폴리뉴클레오티드를 특이적으로 증폭할 수 있는 프라이머를 의미한다.As used herein, the term "primer capable of amplifying the SNP marker for noise-in diagnosis" refers to a composition capable of diagnosing so-eum-in by confirming the polymorphic site of the above gene through amplification, and specifically, the diagnostic marker for noise-in It refers to a primer that can specifically amplify the polynucleotide of.

상기 다형성 마커 증폭에 사용되는 프라이머는, 적절한 버퍼 중의 적절한 조건(예를 들면, 4개의 다른 뉴클레오시드 트리포스페이트 및 DNA, RNA 폴리머라제 또는 역전사 효소와 같은 중합제) 및 적당한 온도 하에서 주형-지시 DNA 합성의 시작점으로서 작용할 수 있는 단일가닥 올리고뉴클레오티드를 말한다. 상기 프라이머의 적절한 길이는 사용 목적에 따라 달라질 수 있으나, 통상 15 내지 30 뉴클레오티드이다. 짧은 프라이머 분자는 일반적으로 주형과 안정한 혼성체를 형성하기 위해서는 더 낮은 온도를 필요로 한다. 프라이머 서열은 주형과 완전하게 상보적일 필요는 없으나, 주형과 혼성화 할 정도로 충분히 상보적이어야 한다.The primers used for the amplification of the polymorphic markers are template-directed DNA under appropriate conditions (e.g., 4 different nucleoside triphosphates and a polymerizing agent such as DNA, RNA polymerase or reverse transcriptase) in an appropriate buffer. It refers to a single-stranded oligonucleotide that can serve as a starting point for synthesis. The appropriate length of the primer may vary depending on the intended use, but is usually 15 to 30 nucleotides. Short primer molecules generally require a lower temperature to form a stable hybrid with the template. The primer sequence need not be completely complementary to the template, but must be sufficiently complementary to hybridize to the template.

본 발명에서 사용되는 용어, "프라이머"는 짧은 자유 3' 말단 수산화기(free 3' hydroxyl group)를 가지는 염기 서열로 상보적인 템플레이트(template)와 염기쌍(base pair)을 형성할 수 있고 주형 가닥 복사를 위한 시작 지점으로 기능을 하는 짧은 서열을 의미한다. 프라이머는 적절한 완충용액 및 온도에서 중합반응(즉, DNA 폴리머라아제 또는 역전사 효소)을 위한 시약 및 상이한 4가지 뉴클레오사이드 트리포스페이트의 존재하에서 DNA 합성을 개시할 수 있다. PCR 증폭을 실시하여 원하는 생성물의 생성 여부를 통해 사상체질 특이적 대사증후군을 예측할 수 있다. PCR 조건, 센스 및 안티센스 프라이머의 길이는 당업계에 공지된 것을 기초로 변형할 수 있다. The term "primer" used in the present invention is a base sequence having a short free 3'hydroxyl group, and can form a base pair with a complementary template, and template strand copying is performed. It refers to a short sequence that functions as a starting point for. Primers can initiate DNA synthesis in the presence of reagents for polymerization (ie, DNA polymerase or reverse transcriptase) and four different nucleoside triphosphates at appropriate buffers and temperatures. By performing PCR amplification, it is possible to predict the specific metabolic syndrome of sasang constitution through whether or not the desired product is produced. The PCR conditions, the length of the sense and antisense primers can be modified based on those known in the art.

본 발명의 프로브 또는 프라이머는 포스포르아미다이트 고체 지지체 방법, 또는 기타 널리 공지된 방법을 사용하여 화학적으로 합성할 수 있다. 이러한 핵산 서열은 또한 당해 분야에 공지된 많은 수단을 이용하여 변형시킬 수 있다. 이러한 변형의 비-제한적인 예로는 메틸화, 캡화, 천연 뉴클레오타이드 하나 이상의 동족체로의 치환, 및 뉴클레오타이드 간의 변형, 예를 들면, 하전되지 않은 연결체(예: 메틸 포스포네이트, 포스포트리에스테르, 포스포로아미데이트, 카바메이트 등) 또는 하전된 연결체(예: 포스포로티오에이트, 포스포로디티오에이트 등)로의 변형이 있다.The probe or primer of the present invention can be chemically synthesized using a phosphoramidite solid support method, or other well known methods. Such nucleic acid sequences can also be modified using a number of means known in the art. Non-limiting examples of such modifications include methylation, encapsulation, substitution of one or more homologs of natural nucleotides, and modifications between nucleotides, e.g., uncharged linkers (e.g., methyl phosphonate, phosphorester, phosphoro Amidates, carbamates, etc.) or to charged linkers (eg phosphorothioate, phosphorodithioate, etc.).

본 발명의 또 다른 하나의 양태는, (a) 개체로부터 분리된 시료의 DNA로부터 서열번호 1 내지 12로 구성된 군에서 선택된 하나 이상의 폴리뉴클레오티드, 또는 이의 상보적인 폴리뉴클레오티드의 SNP를 포함하는 다형성 부위를 증폭하거나 프로브와 혼성화하는 단계; (b) 상기 (a) 단계의 증폭된 또는 혼성화된 다형성 부위의 염기를 결정하는 단계를 포함하는 소음인 예측 또는 진단을 위한 정보의 제공방법을 제공한다.In another aspect of the present invention, (a) a polymorphic site comprising a SNP of one or more polynucleotides selected from the group consisting of SEQ ID NOs: 1 to 12, or a complementary polynucleotide thereof from DNA of a sample isolated from an individual Amplifying or hybridizing with the probe; (b) It provides a method of providing information for prediction or diagnosis of noise comprising the step of determining the base of the amplified or hybridized polymorphic site in step (a).

본 발명의 용어 "개체"란 사상체질을 예측 또는 진단하기 위한 피험자일 수 있다. 상기 개체에서 머리카락, 뇨, 혈액, 각종 체액, 분리된 조직, 분리된 세포 또는 타액과 같은 시료 등으로부터 DNA를 수득할 수 있으나, 이에 제한되는 것은 아니다.The term "individual" in the present invention may be a subject for predicting or diagnosing sasang constitution. DNA may be obtained from samples such as hair, urine, blood, various body fluids, separated tissues, separated cells, or saliva from the individual, but is not limited thereto.

상기 (a) 단계의 DNA로부터 상기 SNP 마커의 다형성 부위를 증폭하거나 프로브와 혼성화하는 단계는 당업자에게 알려진 어떠한 방법이든 사용 가능하다. 예를 들면, 표적 핵산을 PCR을 통하여 증폭하고 이를 정제하여 얻을 수 있다. 그 외 리가제 연쇄 반응(LCR)(Wu 및 Wallace, Genomics 4, 560(1989), Landegren 등, Science 241, 1077(1988)), 전사증폭(transcription amplification)(Kwoh 등, Proc. Natl.Acad. Sci. USA 86, 1173(1989)) 및 자가유지 서열 복제(Guatelli 등, Proc. Natl. Acad. Sci. USA 87, 1874(1990)) 및 핵산에 근거한 서열 증폭(NASBA)이 사용될 수 있다.The step of amplifying the polymorphic site of the SNP marker from the DNA of step (a) or hybridizing with a probe may be performed by any method known to those skilled in the art. For example, it can be obtained by amplifying a target nucleic acid through PCR and purifying it. Other ligase chain reaction (LCR) (Wu and Wallace, Genomics 4, 560 (1989), Landegren et al., Science 241, 1077 (1988)), transcription amplification (Kwoh et al., Proc. Natl. Acad. Sci. USA 86, 1173 (1989)) and self-maintaining sequence replication (Guatelli et al., Proc. Natl. Acad. Sci. USA 87, 1874 (1990)) and nucleic acid-based sequence amplification (NASBA) can be used.

상기 방법 중 (b) 단계의 다형성 부위의 염기를 결정하는 것은 서열 분석, 마이크로어레이(microarray)에 의한 혼성화, 대립유전자 특이적인 PCR(allele specific PCR), 다이나믹 대립유전자 혼성화 기법(dynamic allele-specific hybridization, DASH), PCR 연장 분석, PCR-SSCP, PCR-RFLP 분석 또는 TaqMan 기법, SNPlex 플랫폼(Applied Biosystems), 질량 분석법(예를 들면, Sequenom의 MassARRAY 시스템), 미니-시퀀싱(mini-sequencing)방법, Bio-Plex 시스템(BioRad), CEQ and SNPstream 시스템(Beckman), Molecular Inversion Probe 어레이 기술(예를 들면, Affymetrix GeneChip), 및 BeadArray Technologies(예를 들면, Illumina GoldenGate 및 Infinium 분석법)를 포함하나, 이에 한정되지 않는다. 상기 방법들 또는 본 발명이 속하는 기술분야의 당업자에게 이용가능한 다른 방법에 의해, 마이크로새틀라이트, SNP 또는 다른 종류의 다형성 마커를 포함한, 다형성 마커에서의 하나 이상의 대립유전자가 확인될 수 있다. 이와 같은 다형성 부위의 염기를 결정하는 것은 구체적으로는 SNP 칩을 통해 수행할 수 있다.Of the above methods, determining the base of the polymorphic site in step (b) is sequence analysis, hybridization by microarray, allele specific PCR, and dynamic allele-specific hybridization. , DASH), PCR extension analysis, PCR-SSCP, PCR-RFLP analysis or TaqMan technique, SNPlex platform (Applied Biosystems), mass spectrometry (e.g., Sequenom's MassARRAY system), mini-sequencing method, Bio-Plex system (BioRad), CEQ and SNPstream system (Beckman), Molecular Inversion Probe array technology (e.g., Affymetrix GeneChip), and BeadArray Technologies (e.g., Illumina GoldenGate and Infinium assays). It doesn't work. One or more alleles in polymorphic markers, including microsatellites, SNPs or other types of polymorphic markers, can be identified by the above methods or other methods available to those skilled in the art to which the present invention pertains. Determining the base of such a polymorphic site can be specifically performed through an SNP chip.

본 발명에서 용어, "SNP 칩"은 수십만 개의 SNP의 각 염기를 한번에 확인할 수 있는 DNA 마이크로어레이의 하나를 의미한다.In the present invention, the term "SNP chip" means one of the DNA microarrays capable of identifying each base of hundreds of thousands of SNPs at once.

TaqMan 방법은 (1) 원하는 DNA 단편을 증폭할 수 있도록 프라이머 및 TaqMan 탐침을 설계 및 제작하는 단계; (2) 서로 다른 대립유전자의 탐침을 FAM 염료 및 VIC 염료로 표지(Applied Biosystems)하는 단계; (3) 상기 DNA를 주형으로 하고, 상기의 프라이머 및 탐침을 이용하여 PCR을 수행하는 단계; (4) 상기의 PCR 반응이 완성된 후, TaqMan 분석 플레이트를 핵산 분석기로 분석 및 확인하는 단계; 및 (5) 상기 분석결과로부터 단계 (1)의 폴리뉴클레오티들의 유전자형을 결정하는 단계를 포함한다.The TaqMan method includes (1) designing and producing primers and TaqMan probes to amplify a desired DNA fragment; (2) labeling probes of different alleles with FAM dye and VIC dye (Applied Biosystems); (3) using the DNA as a template and performing PCR using the primers and probes; (4) after the PCR reaction is completed, analyzing and confirming the TaqMan assay plate with a nucleic acid analyzer; And (5) determining the genotype of the polynucleotides of step (1) from the analysis result.

상기에서, 시퀀싱 분석은 염기서열 결정을 위한 통상적인 방법을 사용할 수 있으며, 자동화된 유전자분석기를 이용하여 수행될 수 있다. 또한, 대립유전자 특이적 PCR은 SNP가 위치하는 염기를 3' 말단으로 하여 고안한 프라이머를 포함한 프라이머 세트로 상기 SNP가 위치하는 DNA 단편을 증폭하는 PCR 방법을 의미한다. 상기 방법의 원리는, 예를 들어, 특정 염기가 A에서 G로 치환된 경우, 상기 A를 3' 말단염기로 포함하는 프라이머 및 적당한 크기의 DNA 단편을 증폭할 수 있는 반대 방향 프라이머를 고안하여 PCR 반응을 수행할 경우, 상기 SNP 위치의 염기가 A인 경우에는 증폭반응이 정상적으로 수행되어 원하는 위치의 밴드가 관찰되고, 상기 염기가 G로 치환된 경우에는 프라이머는 주형 DNA에 상보결합 할 수 있으나, 3' 말단 쪽이 상보결합을 하지 못함으로써 증폭 반응이 제대로 수행되지 않는 점을 이용한 것이다. DASH는 통상적인 방법으로 수행될 수 있고, 구체적으로는 프린스 등에 의한 방법에 의하여 수행될 수 있다.In the above, the sequencing analysis may be performed using a conventional method for determining the base sequence, and may be performed using an automated genetic analyzer. In addition, allele-specific PCR refers to a PCR method for amplifying a DNA fragment in which the SNP is located with a primer set including a primer designed with the base at the 3'end of the SNP. The principle of the method is, for example, when a specific base is substituted from A to G, a primer containing the A as a 3'end base and a reverse primer capable of amplifying a DNA fragment of an appropriate size are designed and PCR In the case of performing the reaction, when the base at the SNP position is A, the amplification reaction is normally performed and the band at the desired position is observed, and when the base is substituted with G, the primer can complementarily bind to the template DNA. This is due to the fact that the amplification reaction is not performed properly because the 3'end cannot perform complementary bonding. DASH may be performed by a conventional method, and specifically, may be performed by a method by a prince or the like.

한편, PCR 연장 분석은 먼저 단일염기 다형성이 위치하는 염기를 포함하는 DNA 단편을 프라이머 쌍으로 증폭을 한 다음, 반응에 첨가된 모든 뉴클레오티드를 탈인산화시킴으로써 불활성화시키고, 여기에 SNP 특이적 연장 프라이머, dNTP 혼합물, 디디옥시뉴클레오티드, 반응 완충액 및 DNA 중합효소를 첨가하여 프라이머 연장반응을 수행함으로써 이루어진다. 이때, 연장 프라이머는 SNP가 위치하는 염기의 5' 방향의 바로 인접한 염기를 3' 말단으로 삼으며, dNTP 혼합물에는 디디옥시뉴클레오티드와 동일한 염기를 갖는 핵산이 제외되고, 상기 디디옥시뉴클레오티드는 SNP를 나타내는 염기 종류 중 하나에서 선택된다. 예를 들어, A에서 G로의 치환이 있는 경우, dGTP, dCTP 및 TTP 혼합물과 ddATP를 반응에 첨가할 경우, 상기 치환이 일어난 염기에서 프라이머는 DNA 중합효소에 의하여 연장되고, 몇 염기가 지난 후 A 염기가 최초로 나타나는 위치에서 ddATP에 의하여 프라이머 연장반응이 종결된다. 만일 상기 치환이 일어나지 않았다면, 그 위치에서 연장반응이 종결되므로, 상기 연장된 프라이머의 길이를 비교함으로써 SNP를 나타내는 염기 종류를 판별할 수 있게 된다.On the other hand, in the PCR extension analysis, a DNA fragment containing a base on which a single base polymorphism is located is first amplified with a primer pair, and then all nucleotides added to the reaction are dephosphorylated to inactivate, and the SNP-specific extension primer, A dNTP mixture, didioxynucleotide, reaction buffer, and DNA polymerase are added to perform a primer extension reaction. At this time, the extension primer uses the base immediately adjacent to the 5'direction of the base where the SNP is located as the 3'end, and the nucleic acid having the same base as the didioxynucleotide is excluded from the dNTP mixture, and the didioxynucleotide represents the SNP. It is selected from one of the base types. For example, when there is a substitution from A to G, a mixture of dGTP, dCTP, and TTP and ddATP are added to the reaction, the primer at the base where the substitution occurs is extended by DNA polymerase, and after several bases, A Primer extension reaction is terminated by ddATP at the position where the base first appears. If the substitution does not occur, since the extension reaction is terminated at that position, it is possible to determine the type of base representing the SNP by comparing the length of the extended primers.

이때, 검출방법으로는 연장 프라이머 또는 디디옥시뉴클레오티드를 형광 표지한 경우에는 일반적인 염기서열 결정에 사용되는 유전자 분석기(예를 들어, ABI사의 Model 3700 등)를 사용하여 형광을 검출함으로써 상기 SNP를 검출할 수 있으며, 무-표지된 연장 프라이머 및 디디옥시뉴클레오티드를 사용할 경우에는 MALDI-TOF(matrix assisted laser desorption ionization-time of flight) 기법을 이용하여 분자량을 측정함으로써 상기 SNP를 검출할 수 있다.At this time, as a detection method, in the case of fluorescently labeled extension primers or dodioxynucleotides, the SNP can be detected by detecting fluorescence using a genetic analyzer (eg, ABI's Model 3700, etc.) used for general nucleotide sequence determination. The SNP can be detected by measuring the molecular weight using a matrix assisted laser desorption ionization-time of flight (MALDI-TOF) technique in the case of using a non-labeled extension primer and a didioxynucleotide.

구체적으로, 상기 (b) 단계에서 결정된 염기서열 중, 서열번호 1로 기재되는 rs12667492에 있어서, 16번째 염기가 A인 경우; 서열번호 2로 기재되는 rs74028907에 있어서, 16번째 염기가 T인 경우; 서열번호 3으로 기재되는 rs4909803에 있어서, 16번째 염기가 G인 경우; 서열번호 4로 기재되는 rs11646443에 있어서, 16번째 염기가 A인 경우; 서열번호 5로 기재되는 rs75575584에 있어서, 16번째 염기가 A인 경우; 서열번호 6으로 기재되는 rs17669146에 있어서, 16번째 염기가 C인 경우; 서열번호 7로 기재되는 rs4688636에 있어서, 16번째 염기가 G인 경우; 서열번호 8로 기재되는 rs1017409에 있어서, 16번째 염기가 C인 경우; 서열번호 9로 기재되는 rs981760에 있어서, 16번째 염기가 T인 경우; 서열번호 10으로 기재되는 rs1671413에 있어서, 16 번째 염기가 T인 경우; 서열번호 11로 기재되는 rs7013390에 있어서, 16번째 염기가 A인 경우; 서열번호 12로 기재되는 rs10955446에 있어서, 16 번째 염기가 C인 경우; 서열번호 13으로 기재되는 rs7015202에 있어서, 16 번째 염기가 A인 경우; 서열번호 14로 기재되는 rs1115971에 있어서, 16번째 염기가 C인 경우; 서열번호 15로 기재되는 rs7138393에 있어서, 16 번째 염기가 G인 경우, 사상체질 중 소음인으로 판단할 수 있다.Specifically, among the nucleotide sequences determined in step (b), in rs12667492 represented by SEQ ID NO: 1, when the 16th base is A; In rs74028907 shown in SEQ ID NO: 2, when the 16th base is T; In rs4909803 shown in SEQ ID NO: 3, when the 16th base is G; In rs11646443 shown in SEQ ID NO: 4, when the 16th base is A; In rs75575584 shown in SEQ ID NO: 5, when the 16th base is A; In rs17669146 shown in SEQ ID NO: 6, when the 16th base is C; In rs4688636 shown in SEQ ID NO: 7, when the 16th base is G; In rs1017409 shown in SEQ ID NO: 8, when the 16th base is C; In rs981760 shown in SEQ ID NO: 9, when the 16th base is T; In rs1671413 shown in SEQ ID NO: 10, when the 16th base is T; In rs7013390 shown in SEQ ID NO: 11, when the 16th base is A; In rs10955446 shown in SEQ ID NO: 12, when the 16th base is C; In rs7015202 shown in SEQ ID NO: 13, when the 16th base is A; In rs1115971 shown in SEQ ID NO: 14, when the 16th base is C; In rs7138393 described in SEQ ID NO: 15, when the 16th base is G, it can be determined as Soeumin among sasang constitutions.

본 발명의 또 다른 하나의 양태는, 소음인 진단용 조성물을 포함하는 소음인 진단용 키트를 제공한다.Another aspect of the present invention provides a kit for diagnosis of noise, comprising a composition for diagnosis of noise.

상기 키트는 RT-PCR 키트 또는 DNA 분석용(예, DNA 칩) 키트일 수 있다.The kit may be an RT-PCR kit or a kit for DNA analysis (eg, a DNA chip).

본 발명의 키트는 소음인 진단용 마커인 SNP 마커를 증폭을 통해 확인하거나, SNP 마커의 발현 수준을 mRNA의 발현 수준을 확인함으로써 사상체질을 진단할 수 있다. The kit of the present invention can diagnose sasang constitution by confirming the SNP marker, which is a diagnostic marker for soumin, through amplification, or by checking the expression level of the SNP marker and the mRNA expression level.

구체적인 일 구현예에서, 본 발명의 키트는 DNA 칩을 수행하기 위해 필요한 필수 요소를 포함하는 소음인 진단용 키트일 수 있다. DNA 칩 키트는, 일반적으로 편평한 고체 지지판, 전형적으로는 현미경용 슬라이드보다 크지 않은 유리 표면에 핵산 종을 격자형 배열(gridded array)로 부착한 것으로, 칩 표면에 핵산이 일정하게 배열되어, DNA 칩 상의 핵산과 칩 표면에 처리된 용액 내에 포함된 상보적인 핵산 간에 다중 혼성화(hybridization) 반응이 일어나 대량 병렬 분석이 가능하도록 하는 도구이다.In a specific embodiment, the kit of the present invention may be a diagnostic kit, which is a noise including essential elements necessary to perform a DNA chip. A DNA chip kit is a gridded array of nucleic acid species attached to a generally flat solid support plate, typically a glass surface that is not larger than a microscope slide, in which nucleic acids are uniformly arranged on the chip surface. It is a tool that enables mass-parallel analysis by causing multiple hybridization reactions between the nucleic acid on the image and the complementary nucleic acid contained in the solution treated on the chip surface.

본 발명의 또 다른 하나의 양태는, 소음인 진단용 조성물을 포함하는 소음인 진단용 마이크로어레이를 제공한다.Another aspect of the present invention provides a microarray for diagnosis of noise comprising a composition for diagnosis of noise.

상기 마이크로어레이는 DNA 또는 RNA 폴리뉴클레오티드를 포함하는 것일 수 있다. 상기 마이크로어레이는 프로브 폴리뉴클레오티드에 본 발명의 폴리뉴클레오티드를 포함하는 것을 제외하고는 통상적인 마이크로어레이로 이루어진다.The microarray may include DNA or RNA polynucleotides. The microarray is composed of a conventional microarray, except that the polynucleotide of the present invention is included in the probe polynucleotide.

프로브 폴리뉴클레오티드를 기판상에 고정화하여 마이크로어레이를 제조하는 방법은 당업계에 잘 알려져 있다. 상기 프로브 폴리뉴클레오티드는 혼성화할 수 있는 폴리뉴클레오티드를 의미하는 것으로, 핵산의 상보성 가닥에 서열 특이적으로 결합할 수 있는 올리고뉴클레오티드를 의미한다. A method of preparing a microarray by immobilizing a probe polynucleotide on a substrate is well known in the art. The probe polynucleotide refers to a polynucleotide capable of hybridizing, and refers to an oligonucleotide capable of sequence-specific binding to a complementary strand of a nucleic acid.

본 발명의 프로브는 대립 유전자 특이적 프로브로서, 같은 종의 두 구성원으로부터 유래한 핵산 단편 중에 다형성 부위가 존재하여, 한 구성원으로부터 유래한 DNA 단편에는 혼성화하나, 다른 구성원으로부터 유래한 단편에는 혼성화하지 않는다. 이 경우 혼성화 조건은 대립 유전자간의 혼성화 강도에 있어서 유의한 차이를 보여, 대립 유전자 중 하나에만 혼성화 하도록 충분히 엄격해야 한다. 이렇게 함으로써 다른 대립 유전자 형태 간에 좋은 혼성화 차이를 유발할 수 있다. The probe of the present invention is an allele-specific probe, and a polymorphic site exists in a nucleic acid fragment derived from two members of the same species, and hybridizes to a DNA fragment derived from one member, but does not hybridize to a fragment derived from another member. . In this case, the hybridization conditions show a significant difference in hybridization strength between alleles, and must be sufficiently stringent to hybridize to only one of the alleles. This can lead to good hybridization differences between different allele types.

본 발명의 상기 프로브는 대립 유전자를 검출하여 사상체질 중 소음인 진단 방법 등에 사용될 수 있다. 상기 예측 또는 진단 방법에는 서던 블롯트 등과 같은 핵산의 혼성화에 근거한 검출방법들이 포함되며, DNA 칩을 이용한 방법에서 DNA 칩의 기판에 미리 결합된 형태로 제공될 수도 있다. 상기 혼성화란 엄격한 조건, 예를 들면 1M 이하의 염 농도 및 25 ℃ 이상의 온도 하에서 보통 수행될 수 있다. 예를 들면, 5x SSPE (750 mM NaCl, 50 mM Na Phosphate, 5 mM EDTA, pH 7.4) 및 2530 ℃의 조건이 대립 유전자 특이적 프로브 혼성화에 적합할 수 있다.The probe of the present invention can be used for diagnosing noise in sasang constitution by detecting alleles. The prediction or diagnosis method includes detection methods based on hybridization of nucleic acids such as Southern blot, and may be provided in the form of being previously bound to a substrate of a DNA chip in a method using a DNA chip. The hybridization may be usually carried out under stringent conditions, for example, a salt concentration of 1M or less and a temperature of 25°C or more. For example, 5x SSPE (750 mM NaCl, 50 mM Na Phosphate, 5 mM EDTA, pH 7.4) and conditions of 2530° C. may be suitable for allele-specific probe hybridization.

본 발명의 사상체질 중 소음인 진단과 연관된 프로브 폴리뉴클레오티드의 기판상에 고정화하는 과정도 또한 이러한 종래 기술을 사용하여 용이하게 제조할 수 있다. 또한, 마이크로어레이 상에서의 핵산의 혼성화 및 혼성화 결과의 검출은 당업계에 잘 알려져 있다. 상기 검출은 예를 들면, 핵산 시료를 형광 물질 예를 들면 Cy3 및 Cy5와 같은 물질을 포함하는 검출가능한 신호를 발생시킬 수 있는 표지 물질로 표지한 다음, 마이크로어레이 상에 혼성화하고 상기 표지 물질로부터 발생하는 신호를 검출함으로써 혼성화 결과를 검출할 수 있다.The process of immobilizing the probe polynucleotide associated with the diagnosis of noise in the sasang constitution of the present invention on a substrate can also be easily prepared using this conventional technique. In addition, hybridization of nucleic acids on microarrays and detection of hybridization results are well known in the art. For the detection, for example, a nucleic acid sample is labeled with a labeling substance capable of generating a detectable signal including a fluorescent substance such as Cy3 and Cy5, and then hybridized on a microarray and generated from the labeling substance. The hybridization result can be detected by detecting the signal.

본 발명의 또 다른 하나의 양태는, 진단용 SNP 마커를 선별하는 방법을 제공한다.Another aspect of the present invention provides a method of selecting a diagnostic SNP marker.

상기 방법은 구체적으로, (a) 전장유전체 분석(Genome-Wide Association Study, GWAS)을 통해 체질과 연관성이 있는 SNP를 선별하는 단계; (b) 상기 (a) 단계의 선별된 SNP를 바탕으로 SNP 세트 선정 및 기능 영역을 맵핑하는 단계; (c) 딥러닝 알고리즘을 적용하여 SNP 기능 점수를 스코어링하는 단계; 및 (d) 상기 (c) 단계에서 기능점수가 높은 SNP를 선별하는 단계를 포함하는, 소음인 진단용 SNP 선별 방법일 수 있다.Specifically, the method comprises the steps of: (a) selecting SNPs associated with constitution through a full-length genome analysis (Genome-Wide Association Study, GWAS); (b) selecting an SNP set and mapping a functional region based on the selected SNP in step (a); (c) scoring a SNP function score by applying a deep learning algorithm; And (d) selecting SNPs having a high functional score in step (c).

상기 (a) 단계는 복잡한 데이터들로부터 체질과 연관성이 있는 SNP를 효과적으로 추출하기 위해 컨볼루션 레이어(Convolution layer)의 도입, 제한 볼츠만 머신(Restricted Boltzmann machine; RBM), 또는 오토 인코더(Auto-encoder)를 통해SNP를 선별할 수 있으나, 이에 제한되지는 않는다. 구체적으로는 각 체질별로 연관성이 P < 0.0001인 SNP들을 선별할 수 있다. Step (a) is the introduction of a convolution layer, a Restricted Boltzmann machine (RBM), or an auto-encoder in order to effectively extract the SNPs that are related to constitution from complex data. SNP can be selected through, but is not limited thereto. Specifically, SNPs having a correlation of P <0.0001 for each constitution can be selected.

상기 (b) 단계는 상기 (a) 단계에서 선별된 체질 연관 SNP를 바탕으로 SNP 세트를 선정하고 기능 영역을 맵핑할 수 있다.In the step (b), a SNP set may be selected based on the constitution-related SNP selected in step (a) and a functional region may be mapped.

통계적 유의성이 높은 SNP 주변으로 연관 불균형이 존재하는 LD 영역 내 실제 형질에 주요하게 작용하는 SNP가 존재할 수 있으므로, 하나의 SNP가 아닌 SNP 주변영역으로 확장하여 기능 패턴을 조사하였다.Since there may be SNPs that play a major role in the actual trait in the LD region in which linkage disequilibrium exists around SNPs with high statistical significance, the functional pattern was investigated by expanding to the region surrounding the SNP instead of one SNP.

상기 (c) 단계는 딥러닝 알고리즘을 적용하여 SNP 기능 점수를 스코어링할 수 있다.In the step (c), the SNP function score may be scored by applying a deep learning algorithm.

상기 (d) 단계는 상기 (c) 단계에서 기능점수가 높은 SNP를 선별할 수 있다.In the step (d), the SNP having a high functional score in step (c) may be selected.

본 발명의 SNP 마커는 사상체질 중 소음인을 진단하는 소음인 특이적 SNP 마커로서, 정확하고 객관적인 사상체질의 진단을 할 수 있을 것이다.The SNP marker of the present invention is a specific SNP marker for diagnosing Soeumin among Sasang constitutions, and it will be possible to accurately and objectively diagnose Sasang constitution.

도 1은 본 발명의 SNP를 선별하기 위한 방법의 순서도를 간략하게 나타낸 것이다.1 is a simplified flowchart of a method for selecting SNPs of the present invention.

이하, 실시예를 통하여 본 발명을 보다 상세히 설명하고자 한다. 이들 실시예는 본 발명을 보다 구체적으로 설명하기 위한 것으로, 본 발명의 범위가 이들 실시예에 의해 한정되는 것은 아니다.Hereinafter, the present invention will be described in more detail through examples. These examples are for explaining the present invention more specifically, and the scope of the present invention is not limited by these examples.

실시예 1. 대상자 선별Example 1. Subject selection

단축형 사상체질 진단도구(KS-15)를 통해 설문을 기반으로 체질이 분류된 태음인(TE), 소음인(SE), 소양인(SY) 집단을 대상으로 하였다. 각각의 체질별 집단은 5797명의 사람들로 이루어졌다.The subjects were Taeumin (TE), Soeuminin (SE), and Soyangin (SY) whose constitutions were classified based on a questionnaire through the shortened Sasang Constitutional Diagnosis Tool (KS-15). Each constitution group consisted of 5797 people.

실시예 2. 체질 연관 유전인자 선정Example 2. Selection of genetic factors related to constitution

상기 소음인 집단의 5797명에 대해 전장 유전체 분석(Genome-Wide Association Study, GWAS)을 진행하였다. 실험에서는 1000 genome project에서 제공되고 있는 ASN 패널을 사용하여 결측값을 추정(imputation)하였다. A full-length genome analysis (Genome-Wide Association Study, GWAS) was performed on 5797 people of the Soeumin group. In the experiment, the missing value was estimated using the ASN panel provided by the 1000 genome project.

전장 유전체 분석을 통해 찾은 유전형과 체질간 연관 분석(linear regression)을 통해 체질 연관 유전인자를 선정하였다. 이 때, 통계 파워를 계산하여 P-value < 0.0001을 기준으로 통계적으로 연관이 많은 유의한 SNP를 선정하였다. Genetic factors related to constitution were selected through linear regression between genotype and constitution found through full-length genome analysis. At this time, statistical power was calculated and statistically significant SNPs were selected based on P-value <0.0001.

실시예 3. SNP block 생성 및 SNP 세트 선정Example 3. SNP block generation and SNP set selection

전장 유전체에 대해서 p-value를 기반으로 연관성이 강한 순서대로 정렬한 후, 연관성이 가장 강한 SNP를 lead(단서) SNP로 선출하였고, lead SNP는 서로 1Mb 이내에 겹치지 않도록 선출하였다.After arranging the full-length genome in the order of strong correlation based on the p-value, the SNP with the strongest correlation was selected as the lead (clue) SNP, and the lead SNPs were selected so that they do not overlap within 1Mb of each other.

상기 유전체의 연관성은 유전자의 대립형(genotype allele type)에 따른 표현형(phenotype) 즉, 체질값으로 계산하였다. 상기 연관성의 정도는 오즈비(Odds Ratio)나 베타(BETA) 값의 통계량값으로 해석하였고, 유의성은 p-value 값으로 정하였다. 상기 p-value를 cut off로 설정하여 후보 가능한 SNP들을 정하였으며, 이를 정렬(sorting)하여 순차적으로 leadSNP를 선출하였다.The association of the genome was calculated as a phenotype, that is, a constitution value according to the genotype allele type. The degree of association was interpreted as a statistical value of odds ratio or beta, and significance was determined as a p-value. The p-value was set to cut off to determine possible candidate SNPs, and leadSNPs were sequentially selected by sorting them.

하나의 lead SNP와 그 주변 1Mb 이내의 후보 SNP들 중에 p-value가 5 x 10^-4 이하인 SNP들 중에서 연관성이 높은 순으로 최대 30개의 SNP를 선정하여 한 구간(block)으로 정의하였다. Among the SNPs with a p-value of 5 x 10 ^-4 or less among candidate SNPs within one lead SNP and its surrounding 1Mb, up to 30 SNPs were selected in the order of high correlation and defined as a block.

통계적 연관 변이 주변에는 통계적으로는 유의하지 않지만 유전적 연관 불균형 구간에 포함되어 있어 형질에 영향을 미치는 경우가 있으므로, 주변의 연관 SNP도 포함하기 위해 SNP block 구간 내의 SNP 목록을 하기 표 1과 같이 선정하였다.Statistical linkage variation is not statistically significant, but it is included in the genetic linkage disequilibrium section and may affect the trait, so the list of SNPs within the SNP block section is selected as shown in Table 1 below to include the surrounding linkage SNPs. I did.

또한, 훈련, 검증, 테스팅 세트 사이에 중복이 없도록 하기 위해 염색체 정보에 기반하여 데이터 세트를 구분하였다. 염색체 1번부터 10번까지는 훈련 세트로 사용하고, 모델의 최종 성능(performance)을 평가하는데 사용되는 테스팅 세트는 11번부터 14번까지를 사용하였다. 나머지 15번에서 22번까지의 염색체는 검증세트로 사용하였다.In addition, data sets were classified based on chromosome information in order not to overlap between training, verification, and testing sets. Chromosomes 1 to 10 were used as training sets, and testing sets 11 to 14 were used to evaluate the final performance of the model. The remaining chromosomes from 15 to 22 were used as validation sets.

하기 표 1의 수치는 학습에 사용된 SNP 세트의 기본 정보수로서 SNP block(구간)의 수를 의미한다.The numbers in Table 1 below represent the number of SNP blocks (sections) as the number of basic information of the SNP set used for learning.

훈련, 검증, 테스트 세트 수Number of training, validation, and test sets GWAS SNP set (out off P value < 5e^-04)GWAS SNP set (out off P value <5e ^-04 ) 소음 (SE)Noise (SE) 소양 (SY)Soyang (SY) 태음 (TE)Taeum (TE) Training Set (Chr. 1-10)Training Set (Chr. 1-10) 9292 8989 8686 Validation Set (Chr. 15-22)Validation Set (Chr. 15-22) 3333 2222 2626 Test Set (Chr.11-14)Test Set (Chr.11-14) 3030 2121 3131

실시예 4: SNP의 기능 영역 맵핑Example 4: Functional region mapping of SNP

각각의 SNP들의 위치에 존재하는 후성유전학적인 정보의 특이적 패턴들을 학습하여 SNP에 기능 점수를 부여하였다. 상기 후성유전학적인 정보는 ENCODE 프로젝트(The ENCODE Project Consortium, 2012) 및 Roadmap 에피지놈 프로젝트(Roadmap Epigenomics Consortium, 2015)에서 제공하는 DNase Hypersensitive site(DHS), 히스톤 변형, 타겟 유전자 기능, 전사조절인자들이 결합하는 부위 내에 포함되어 있는 유전자 영역이다(표 2). 상기 후성유전체 정보는 근처 유전자들의 발현 조절을 담당하는 단백질이 붙는 부위나 그 유전자가 작용하는 대사경로에 관한 것이다. 이를 기반으로 SNP에 기능점수가 부여된다. 즉, SNP가 상기 기능 부위에 해당하는 경우 1, 해당하지 않은 경우를 0으로 하는 바이너리 데이터를 구성하였다.Function points were assigned to SNPs by learning specific patterns of epigenetic information present at the positions of each SNP. The epigenetic information is the DNase Hypersensitive site (DHS) provided by the ENCODE Project (The ENCODE Project Consortium, 2012) and the Roadmap Epigenomics Consortium (2015), histone modification, target gene function, and transcriptional regulatory factors. It is a gene region contained within the region (Table 2). The epigenetic information relates to a site to which a protein responsible for regulating the expression of nearby genes is attached or a metabolic pathway through which the gene acts. Based on this, SNP is given a function score. That is, when the SNP corresponds to the functional site 1, the binary data was configured as 0 when it does not.

학습에 사용된 기능 맵핑 정보 수Number of feature mapping information used for training # feature# feature 소음 (SE)Noise (SE) 소양 (SY)Soyang (SY) 태음 (TE)Taeum (TE) DHSDHS 124124 124124 122122 119119 Histone modificationHistone modification 606606 570570 511511 453453 Target gene pathwayTarget gene pathway 301301 99 1414 1111 FMOFMO 996996 1515 77 66

실시예 5: 딥러닝 알고리즘을 적용한 SNP의 기능 점수화Example 5: Functional scoring of SNP applying deep learning algorithm

체질값과 통계적으로 연관된 유전부위에 존재하는 생물학적 기능 패턴을 추출하기 위해 컨볼루션 신경망 기반의 딥러닝 프레임워크를 활용하였다. A deep learning framework based on a convolutional neural network was used to extract biological function patterns that exist in genetic regions that are statistically related to constitutional values.

실시예 2에서 생성된 하나의 SNP block이 하나의 훈련 이미지 데이터가 되며, 이 block 내에 적어도 하나 이상의 원인 변이가 있는 것을 바탕으로 컨볼루션 신경망 프레임워크를 활용하여 원인 변이의 특징을 학습하였다.One SNP block generated in Example 2 becomes one training image data, and based on the fact that at least one cause mutation exists in this block, the features of the cause mutation were learned using the convolutional neural network framework.

학습에 사용한 모델은 패턴 탐색기 부분과 각 SNP 마다 기능 점수를 스코어링하는 부분으로 나눌 수 있으며, 하기의 과정으로 수행되었다.The model used for learning can be divided into a pattern explorer part and a part for scoring a function score for each SNP, and was performed in the following process.

5-1. 전장 유전체 연관성 분석 결과로부터 입력 이미지의 구성5-1. Composition of input image from full-length genome correlation analysis results

전장 유전체에 대해 p-value를 기반으로 연관성이 강한 순서대로 정렬한 후 가장 강한 SNP부터 시작하여 1Mb 이내에 겹치지 않도록 단서(lead) SNP를 선출하였다. 하나의 lead SNP와 그 주변 1Mb 이내에서 선택된 SNP를 포함하는 후보 SNP 중에 연관성이 높은 순으로 최대 30개 SNP까지 한 구간(block)으로 정의하였다. 이 때, 한 구간으로부터 하나의 이미지로 규정되는 훈련 이미지 데이터가 생성되며, 이 구간 내 적어도 하나 이상의 원인 변이가 있다는 것을 이용하여 이미지의 부분적인 특징을 추출할 수 있는 CNN 프레임워크를 활용하여 원인변이의 특징을 학습하였다.After sorting in the order of strong correlation based on the p-value for the full-length genome, lead SNPs were selected so that they do not overlap within 1Mb starting with the strongest SNP. Among candidate SNPs including one lead SNP and a SNP selected within 1Mb of its surroundings, up to 30 SNPs in the order of high correlation were defined as one block. At this time, the training image data defined as one image is generated from one section, and the cause variation using a CNN framework that can extract partial features of the image by using at least one causal variation in this section. I learned the characteristics of

5-2. 모델 디자인5-2. Model design

CNN 프레임워크는 조정(tuning)이 가능한 패턴 탐색기(pattern detector)들로 구성되어 있다. 상기 패턴 탐색기들은 커널(kernel), 또는 필터(filter)로 불리며, 이는 국소 연결성(local connectivity)이 있는 부분적인 패턴을 감지하고, 이를 전체 입력 필드(field)에 대해 반복적으로 연산을 수행하였다.The CNN framework consists of pattern detectors that can be tuned. The pattern searchers are called kernels or filters, which detect partial patterns with local connectivity and perform operations repeatedly on the entire input field.

5-3. Denoising-오토 인코더(DA)를 활용한 사전학습(pre-training)5-3. Denoising-pre-training using auto encoder (DA)

오토 인코더는 딥러닝에서 커널들을 미세조정(fine-tuning)하기에 앞서 훈련(training) 데이터들의 좀 더 일반적인 특징을 학습할 수 있도록, 국소 최소값에 빠지지 않도록 좀 더 최적화된 포인트에서부터 학습을 시작할 수 있도록 하는 비지도(unsupervised) 학습 기법이다. 모델이 더 강건한(robust) 특징을 학습할 수 있도록 입력값에 무작위적인 노이즈를 섞어서 학습하는 denoising-오토인코더(DA)를 활용하여 사전학습을 수행하였다.The auto-encoder enables learning more general features of training data before fine-tuning the kernels in deep learning, and starts learning from a more optimized point so as not to fall into a local minimum. It is an unsupervised learning technique. Pre-learning was performed using a denoising-auto-encoder (DA) that mixes random noise into the input value so that the model can learn more robust features.

5-4. 하이퍼 파라미터(hyper parameter)5-4. Hyper parameter

딥러닝 모델은 다수의 하이퍼 파라미터들에 의존하며, 각각의 하이퍼 파라미터들은 모델의 성능에 상당한 영향을 미친다. 하이퍼 파라미터는 DA 사전학습 과정에서의 입력값에 적용되는 노이즈 레벨(corruption level), 미세조정 과정에서의 학습률(learning rate), 모멘텀(momemtum), 엘라스틱넷(elastic net) 규제화(regularization) 과정에 사용되는 파라미터가 포함된다.Deep learning models rely on a number of hyperparameters, and each hyperparameter has a significant impact on the model's performance. Hyper-parameters are used for the noise level (corruption level) applied to the input values in the DA pre-learning process, the learning rate in the fine tuning process, the momentum, and the elastic net regularization process. Parameters are included.

무작위적으로 선출된 하이퍼 파라미터 셋에 기반하여 최적화된 조건을 찾았다.Optimized conditions were found based on a set of randomly selected hyper parameters.

5-5. SNP의 기능 점수화5-5. SNP function scoring

첫 번째 레이어(Convolution layer)(C1)에서는 한 개의 제1층 입력이미지를 규정하는 이미지 매트릭스를 입력한다. 제1층 입력이미지와 컨볼루션 커널인 제1층 커널을 사용하여 각 후보 SNP 들이 각 종류의 패턴들과 부합하는지에 대한 매트릭스를 생성하는 컨볼루션 과정을 수행하였다. 제1층 커널을 나타내는 어레이의 각 요소는 0 또는 1의 값을 갖는다.In the first layer (Convolution layer) C1, an image matrix defining one input image of the first layer is input. Using the first layer input image and the first layer kernel, which is a convolution kernel, a convolution process was performed to generate a matrix of whether each candidate SNP matches each type of pattern. Each element of the array representing the first layer kernel has a value of 0 or 1.

두 번째 레이어(Convolution layer)(C2)에서는 추출된 패턴 결과를 바탕으로 비선형적인 복잡한 패턴을 분석할 수 있었다. 제2층 커널의 어레이 각 요소는 0 또는 1의 값을 갖는다.In the second layer (Convolution layer) (C2), a nonlinear complex pattern could be analyzed based on the extracted pattern result. Each element of the array of the second layer kernel has a value of 0 or 1.

예측 점수가 1에 가까울수록 리스크 유전 구간들에서 공통적으로 나타나는 조절 패턴들이 감지되었음을 의미한다.The closer the predicted score is to 1, the more common control patterns in the risk genetic sections were detected.

마지막으로, max-pooling을 수행하여 최종 도출된 점수는 하나의 연관성 구간 내에서 리스크 패턴과 가장 잘 매칭되는 SNP의 점수로 도출하였다.Finally, the score finally derived by performing max-pooling was derived as the score of the SNP that best matches the risk pattern within one correlation section.

실시예 6: 검증절차를 통한 사상체질 진단용 SNP 선정Example 6: Selection of SNP for diagnosing Sasang constitution through verification procedure

체질 확진자를 통한 검증절차를 거쳐 최종적으로 소음인 특이적 사상체질 진단용 SNP를 선정하였다. After the verification procedure through the constitution confirmed person, SNP for the diagnosis of specific Sasang constitution was finally selected.

체질 확진자의 선정 기준은 다음과 같다.The criteria for selecting a person with constitution are as follows.

기준 1: 각 체질별 KS-15의 확률 값이 높은 대상자 우선Criterion 1: Subjects with a high probability of KS-15 for each constitution are given priority

기준 2: 체질 전문가의 진단과 KS-15 의 확률 값 진단 체질이 동일Criterion 2: Constitutional expert's diagnosis and KS-15 probability value diagnosis constitution is the same

기준 3: BMI 지수 18.5 이상Criterion 3: BMI index 18.5 or higher

기준 4: 소음인의 경우, BMI 지수 18.5 내지 25의 정상체중 집단 30명 및 BMI 지수 25 초과의 과체중 집단 30명으로 나눔Criterion 4: In the case of Soeumin, divided into 30 normal weight groups with a BMI index of 18.5 to 25 and 30 overweight groups with a BMI index of 25 or more.

기준 5: 대사증후군이 없고, 질병 과거력 중 암 및 갑상선 이상이 없을 것Criterion 5: No metabolic syndrome and no cancer or thyroid abnormalities in the history of disease

체질 확진자 선정 기준에 따라 소음인 30명, 소양인 30명, 태음인 살찐 그룹 30명, 태음인 정상체중 그룹 30명을 대상으로 WGS(Whole Genome Sequencing)를 수행하였다. 각각의 체질 확진자 그룹에서 체질별로 선정된 후보 SNP들이 존재하는지 재현성 확인을 하고, 다른 체질 값과 연관된 SNP를 제외하였다.WGS (Whole Genome Sequencing) was performed on 30 Soumin, 30 Soyangin, 30 Fatty Groups, and 30 Normal Weight Groups for Taeumin, according to the selection criteria for constitutional diagnosis. Reproducibility was checked for the presence of candidate SNPs selected for each constitution in each constitutional confirmed group, and SNPs associated with other constitutional values were excluded.

필터링 된 SNP 중, SNP의 빈도와 기능 점수를 고려하여 Minor Allel Frequency가 높고 기능 점수 값이 높은 SNP를 선정하였다.Among the filtered SNPs, a SNP with a high Minor Allel Frequency and a high function score was selected in consideration of the frequency and function score of the SNP.

후보 SNP 선정 절차 및 검증 절차를 거쳐 최종적으로 선정된 체질 특이적 SNP를 하기 표 3에 나타내었다.The constitution-specific SNPs finally selected through the candidate SNP selection procedure and verification procedure are shown in Table 3 below.

소음인 특이적 SNPSoeumin specific SNP SNP IDSNP ID Allel Type　 Allel Type Minor Allel FrequencyMinor Allel Frequency Chr.Chr. Position　 Position GWAS P-valueGWAS P-value Functional ScoreFunctional Score SE Allel FrequencySE Allel Frequency rs12667492rs12667492 A>GA>G 0.18340.1834 77 155965552155965552 0.000024370.00002437 0.990.99 0.560.56 rs74028907rs74028907 T>GT>G 0.17910.1791 1515 8766190687661906 0.000041460.00004146 0.990.99 0.60.6 rs4909803rs4909803 G>TG>T 0.15880.1588 88 139509485139509485 0.000020540.00002054 1One 0.320.32 rs11646443rs11646443 A>GA>G 0.081590.08159 1616 8427047684270476 0.000030380.00003038 0.990.99 0.50.5 rs75575584rs75575584 A>GA>G 0.054770.05477 22 4739338347393383 0.000025690.00002569 1One 0.420.42 rs17669146rs17669146 C>TC>T 0.094530.09453 33 6155209261552092 0.00047710.0004771 1One 1One rs4688636rs4688636 G>AG>A 0.085280.08528 33 6155563761555637 0.00012410.0001241 1One 1One rs1017409rs1017409 C>TC>T 0.23490.2349 55 5922562159225621 0.00015960.0001596 0.8330.833 1One rs981760rs981760 T>AT>A 0.24380.2438 55 5923431259234312 0.00011060.0001106 0.8140.814 1One rs1671413rs1671413 T>CT>C 0.0010380.001038 88 1322658813226588 0.0003070.000307 1One 1One rs7013390rs7013390 A>GA>G 0.15220.1522 88 108288033108288033 0.0002340.000234 0.950.95 1One rs10955446rs10955446 C>TC>T 0.1520.152 88 108288324108288324 0.00015930.0001593 0.860.86 1One rs7015202rs7015202 A>CA>C 0.028110.02811 88 139501740139501740 0.00007850.0000785 0.9130.913 1One rs1115971rs1115971 C>TC>T 0.13870.1387 1212 3183695031836950 0.00036180.0003618 1One 1One rs7138393rs7138393 G>CG>C 0.081390.08139 1212 3359078733590787 0.00033890.0003389 1One 1One

이상의 설명으로부터, 본 발명이 속하는 기술분야의 당업자는 본 발명이 그 기술적 사상이나 필수적 특징을 변경하지 않고서 다른 구체적인 형태로 실시될 수 있다는 것을 이해할 수 있을 것이다. 이와 관련하여, 이상에서 기술한 실시예들은 모든 면에서 예시적인 것이며 한정적인 것이 아닌 것으로 이해해야만 한다. 본 발명의 범위는 상기 상세한 설명보다는 후술하는 청구범위의 의미 및 범위 그리고 그 등가 개념으로부터 도출되는 모든 변경 또는 변형된 형태가 본 발명의 범위에 포함되는 것으로 해석되어야 한다.From the above description, those skilled in the art to which the present invention pertains will be able to understand that the present invention can be implemented in other specific forms without changing the technical spirit or essential features thereof. In this regard, it should be understood that the embodiments described above are illustrative in all respects and not limiting. The scope of the present invention should be construed that all changes or modifications derived from the meaning and scope of the claims to be described later, and equivalent concepts, rather than the detailed description above, are included in the scope of the present invention.

<110> Korea Institute of Oriental Medicine <120> SNP markers for diagnosing Soeumin of sasang constitution and use thereof <130> KPA181218-KR <160> 15 <170> KoPatentIn 3.0 <210> 1 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> Y= A or G <400> 1 taggatgaga gttacygaat aaggtggtcc g 31 <210> 2 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> R= T or G <400> 2 ggtgaccttt ctctcrcctg ctctccagac t 31 <210> 3 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> Y= G or T <400> 3 ccgctgcgca ccgcgygggt agacgctgcg c 31 <210> 4 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> S= A or G <400> 4 ccatctcgcg cagccsgttc atgcacaggc c 31 <210> 5 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> K= A or G <400> 5 gagttcagac acattkcacc tttcacctaa t 31 <210> 6 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> R= C or T <400> 6 gattgaaagt aacttrcttg gcattcacgt g 31 <210> 7 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> R= G or A <400> 7 aaatgtgtct cataartctt ccaaatccct g 31 <210> 8 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> R= C or T <400> 8 cctaagacaa gcccarctta caaaagaggc t 31 <210> 9 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> R= T or A <400> 9 ccagccaaga tgaccratat gtctaaaacc a 31 <210> 10 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> K= T or C <400> 10 tgattgcctc agcctktgat gatgtttaga g 31 <210> 11 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> K= A or G <400> 11 caccttagat aatttkttgg aggcagtgct t 31 <210> 12 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> Y= C or T <400> 12 gtgcctctga agaccyagga agcctttttt g 31 <210> 13 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> Y= A or C <400> 13 ctccatgcct cctgtytaat ttacttttct c 31 <210> 14 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> Y= C or T <400> 14 aaaatcagtt gttaayataa taactgatgt c 31 <210> 15 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> Y= G or C <400> 15 caacatgctc acacaytgcc ttccactgag t 31 <110> Korea Institute of Oriental Medicine <120> SNP markers for diagnosing Soeumin of sasang constitution and use thereof <130> KPA181218-KR <160> 15 <170> KoPatentIn 3.0 <210> 1 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> Y= A or G <400> 1 taggatgaga gttacygaat aaggtggtcc g 31 <210> 2 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> R= T or G <400> 2 ggtgaccttt ctctcrcctg ctctccagac t 31 <210> 3 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> Y= G or T <400> 3 ccgctgcgca ccgcgygggt agacgctgcg c 31 <210> 4 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> S= A or G <400> 4 ccatctcgcg cagccsgttc atgcacaggc c 31 <210> 5 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> K= A or G <400> 5 gagttcagac acattkcacc tttcacctaa t 31 <210> 6 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> R= C or T <400> 6 gattgaaagt aacttrcttg gcattcacgt g 31 <210> 7 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> R= G or A <400> 7 aaatgtgtct cataartctt ccaaatccct g 31 <210> 8 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> R= C or T <400> 8 cctaagacaa gcccarctta caaaagaggc t 31 <210> 9 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> R= T or A <400> 9 ccagccaaga tgaccratat gtctaaaacc a 31 <210> 10 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> K= T or C <400> 10 tgattgcctc agcctktgat gatgtttaga g 31 <210> 11 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> K= A or G <400> 11 caccttagat aatttkttgg aggcagtgct t 31 <210> 12 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> Y= C or T <400> 12 gtgcctctga agaccyagga agcctttttt g 31 <210> 13 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> Y= A or C <400> 13 ctccatgcct cctgtytaat ttacttttct c 31 <210> 14 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> Y= C or T <400> 14 aaaatcagtt gttaayataa taactgatgt c 31 <210> 15 <211> 31 <212> DNA <213> Homo sapiens <220> <221> variation <222> (16) <223> Y= G or C <400> 15 caacatgctc acacaytgcc ttccactgag t 31

Claims

Includes a polynucleotide consisting of the nucleotide sequence of SEQ ID NO: 6 or a complementary polynucleotide thereof, and the nucleotide sequence includes the 16th nucleotide as SNP (single nucleotide polymorphism),
The 16th base of SEQ ID NO: 6 is C, a diagnostic microarray of noise in sasang constitution.

The method of claim 1, wherein the polynucleotide further comprises one or more polynucleotides selected from the group consisting of nucleotide sequences of SEQ ID NOs: 1 to 5 and 7 to 15, wherein the nucleotide sequence is a 16th base as a single nucleotide polymorphism (SNP). ), and
The 16th base of SEQ ID NO: 1 is A, the 16th base of SEQ ID NO: 2 is T, the 16th base of SEQ ID NO: 3 is G, the 16th base of SEQ ID NO: 4 is A, and the 16th base of SEQ ID NO: 5 is A , The 16th base of SEQ ID NO: 7 is G, the 16th base of SEQ ID NO: 8 is C, the 16th base of SEQ ID NO: 9 is T, the 16th base of SEQ ID NO: 10 is T, and the 16th base of SEQ ID NO: 11 is A , The 16th base of SEQ ID NO: 12 is C, the 16th base of SEQ ID NO: 13 is A, the 16th base of SEQ ID NO: 14 is C, the 16th base of SEQ ID NO: 15 is G.

A probe capable of binding to a polynucleotide consisting of the nucleotide sequence of SEQ ID NO: 6 or a complementary polynucleotide thereof, or a primer capable of specifically amplifying,
The 16th base of SEQ ID NO: 6 is C, a diagnostic composition for noise in sasang constitution.

The method of claim 3, wherein the polynucleotide further comprises one or more polynucleotides selected from the group consisting of nucleotide sequences of SEQ ID NOs: 1 to 5 and 7 to 15,
The 16th base of SEQ ID NO: 1 is A, the 16th base of SEQ ID NO: 2 is T, the 16th base of SEQ ID NO: 3 is G, the 16th base of SEQ ID NO: 4 is A, and the 16th base of SEQ ID NO: 5 is A , The 16th base of SEQ ID NO: 7 is G, the 16th base of SEQ ID NO: 8 is C, the 16th base of SEQ ID NO: 9 is T, the 16th base of SEQ ID NO: 10 is T, and the 16th base of SEQ ID NO: 11 is A , The 16th base of SEQ ID NO: 12 is C, the 16th base of SEQ ID NO: 13 is A, the 16th base of SEQ ID NO: 14 is C, the 16th base of SEQ ID NO: 15 is G.

A kit for diagnosis of noise in Sasang constitution comprising the composition of claim 3 or 4.

The kit of claim 5, wherein the kit is an RT-PCR kit or a DNA chip kit.

(a) amplifying a polymorphic site comprising a SNP of a nucleotide sequence of SEQ ID NO: 6 or a complementary polynucleotide thereof from DNA of a sample isolated from the individual, or hybridizing with a probe;
(b) determining the base of the amplified or hybridized polymorphic site of step (a); And
(c) Among the nucleotide sequences determined in step (b),
In rs17669146 of SEQ ID NO: 6, when the 16th base is C, the method of providing information for predicting or diagnosing noise in sasang constitution, comprising determining the individual as a soeumin.

The method of claim 7, wherein the polynucleotide further comprises one or more polynucleotides selected from the group consisting of SEQ ID NOs: 1 to 5 and 7 to 15,
Among the nucleotide sequences determined in step (b),
In rs17669146 described in SEQ ID NO: 6, the 16th base is C,
In rs12667492 shown in SEQ ID NO: 1, when the 16th base is A;
In rs74028907 shown in SEQ ID NO: 2, when the 16th base is T;
In rs4909803 shown in SEQ ID NO: 3, when the 16th base is G;
In rs11646443 shown in SEQ ID NO: 4, when the 16th base is A;
In rs75575584 shown in SEQ ID NO: 5, when the 16th base is A;
In rs4688636 shown in SEQ ID NO: 7, when the 16th base is G;
In rs1017409 shown in SEQ ID NO: 8, when the 16th base is C;
In rs981760 shown in SEQ ID NO: 9, when the 16th base is T;
In rs1671413 shown in SEQ ID NO: 10, when the 16th base is T;
In rs7013390 shown in SEQ ID NO: 11, when the 16th base is A;
In rs10955446 shown in SEQ ID NO: 12, when the 16th base is C;
In rs7015202 shown in SEQ ID NO: 13, when the 16th base is A;
In rs1115971 shown in SEQ ID NO: 14, when the 16th base is C; or
In rs7138393 described in SEQ ID NO: 15, when the 16th base is G,
Determining the subject to be a somber.

The method of claim 7 or 8, wherein the base determination in step (b) is sequence analysis, hybridization by microarray, allele specific PCR, and dynamic allele-specific hybridization. , DASH), PCR extension analysis, PCR-SSCP (PCR-single strand conformation polymorphism), PCR-RFLP (PCR-restriction fragment length polymorphism) and TaqMan method that is performed by one or more methods selected from the group consisting of .

delete