JP2021175381A

JP2021175381A - Method for detecting infant atopic dermatitis

Info

Publication number: JP2021175381A
Application number: JP2020081473A
Authority: JP
Inventors: 直人高田; Naoto Takada; 高良井上; Takayoshi Inoue; 哲矢桑野; Tetsuya Kuwano
Original assignee: Kao Corp
Current assignee: Kao Corp
Priority date: 2020-05-01
Filing date: 2020-05-01
Publication date: 2021-11-04
Also published as: JP2023157965A

Abstract

To provide a marker for detecting atopic dermatitis, and a method for detecting atopic dermatitis using the marker.SOLUTION: A method for detecting atopic dermatitis in a subject includes the step of measuring the expression level of at least one gene selected from seventeen gene groups consisting of MECR, RASA4CP, ARRDC4, EIF1AD, FDFT1, ZNF706, TEX2, TMPRSS11E, RPS6KB2, CTBP1, ZNF335, DGKA, PPP1R9B, SPDYE7P, DNASE1L1, GNB2 and CSNK1G2 or an expression product thereof in a biological sample taken from the subject.SELECTED DRAWING: None

Description

本発明は、アトピー性皮膚炎マーカーを用いたアトピー性皮膚炎の検出方法に関する。 The present invention relates to a method for detecting atopic dermatitis using an atopic dermatitis marker.

アトピー性皮膚炎（以下、「ＡＤ」とも称する）は、アトピー素因を有する者に主に発症する湿疹性皮膚疾患である。アトピー性皮膚炎の典型的な症状は、左右対側性に発生する、慢性及び反復性の痒み、皮疹、紅斑等、ならびに角化不全、バリア能低下、乾燥肌などである。アトピー性皮膚炎の多くは乳幼児に発症し、成長と共に軽快傾向を示すが、近年では成人型や難治性のアトピー性皮膚炎も増加している。 Atopic dermatitis (hereinafter, also referred to as "AD") is an eczema skin disease that mainly develops in persons having an atopic predisposition. Typical symptoms of atopic dermatitis include bilateral, chronic and recurrent itching, rashes, erythema, etc., as well as keratinization deficiency, reduced barrier capacity, and dry skin. Most atopic dermatitis develops in infants and tends to improve with growth, but in recent years, adult-type and refractory atopic dermatitis have also increased.

アトピー性皮膚炎の症度の判断は、肉眼による所見を頼りに行われているのが現状である。所見項目としては、乾燥症状、紅斑、鱗屑、丘疹、掻破痕、浮腫、痂疲の付着、小水疱、びらん、痒診結節など様々が存在するが、いずれも主観的要素が入り込むため、医師の技量により判断に差が生じる可能性を含んでいる。斯様に、アトピー性皮膚炎の症度の判断は客観性に乏しいという問題がある。 At present, the degree of atopic dermatitis is determined by relying on the findings with the naked eye. There are various findings such as dryness, erythema, scales, papules, scratches, edema, adhesion of scabs, vesicles, erosions, and pruritus nodules. There is a possibility that the judgment may differ depending on the skill. Thus, there is a problem that the judgment of the degree of atopic dermatitis is not objective.

近年、バイオマーカーを用いてアトピー性皮膚炎を検出する方法として、血液の末梢血好酸球数、血清総ＩｇＥ値、ＬＤＨ（乳酸デヒドロゲナーゼ）値、血清中のＴｈｙｍｕｓａｎｄＡｃｔｉｖａｔｉｏｎ−ＲｅｇｕｌａｔｅｄＣｈｅｍｏｋｉｎｅ（ＴＡＲＣ）やＳｑｕａｍｏｕｓｃｅｌｌｃａｒｃｉｎｏｍａａｎｔｉｇｅｎ２（ＳＣＣＡ２）を検出すること（非特許文献１、２、３）や、皮膚細菌叢における黄色ブドウ球菌のａｇｒＣ変異依存性のＲＮＡＩＩＩ遺伝子発現量を検出すること（特許文献１）等が提案されているが、必ずしも十分な精度で診断が可能であるとは云えない。 In recent years, as a method for detecting atopic dermatitis using a biomarker, peripheral blood eosinophil count in blood, total serum IgE value, LDH (lactate dehydrogenase) level, and Symus and Activation-Regulated Chemokine (TARC) in serum have been used. And Squamous cell carcinoma antigen 2 (SCCA2) (Non-Patent Documents 1, 2 and 3), and the agrC mutation-dependent RNAIII gene expression level of Staphylococcus aureus in the skin flora (Patent Document 1). ) Etc. have been proposed, but it cannot always be said that diagnosis is possible with sufficient accuracy.

一方、生体試料中のＤＮＡやＲＮＡ等の核酸の解析によりヒトの生体内の現在さらには将来の生理状態を調べる技術が開発されている。核酸を用いた解析は、網羅的な解析方法が確立されており一度の解析で豊富な情報を得られる、及び一塩基多形やＲＮＡ機能等に関する多くの研究報告に基づいて解析結果の機能的な紐付けが容易であるといった利点を有する。生体由来の核酸は、血液等の体液、分泌物、組織等から抽出することができるが、最近、皮膚表上脂質（ｓｋｉｎｓｕｒｆａｃｅｌｉｐｉｄｓ；ＳＳＬ）に含まれるＲＮＡを生体の解析用の試料として用いること、ＳＳＬから表皮、汗腺、毛包及び皮脂腺のマーカー遺伝子が検出できることが報告されている（特許文献２）。 On the other hand, a technique for investigating the current and future physiological states in a human body by analyzing nucleic acids such as DNA and RNA in a biological sample has been developed. For analysis using nucleic acids, a comprehensive analysis method has been established and abundant information can be obtained by one analysis, and the functional analysis results are based on many research reports on monobasic polymorphism and RNA function. It has the advantage of being easy to link. Nucleic acids derived from living organisms can be extracted from body fluids such as blood, secretions, tissues, etc. Recently, RNA contained in skin surface lipids (SSL) is used as a sample for analysis of living organisms. It has been reported that marker genes for epidermis, sweat glands, hair follicles and sebaceous glands can be detected from SSL (Patent Document 2).

特開２０１９−３０２７２号公報Japanese Unexamined Patent Publication No. 2019-30272 国際公開公報第２０１８／００８３１９号International Publication No. 2018/0083319

Sugawara et al., Allergy (2002) 57:180-181Sugawara et al., Allergy (2002) 57: 180-181 Ohta et al., Ann Clin Biochem.(2012) 49:277-84Ohta et al., Ann Clin Biochem. (2012) 49: 277-84 加藤ら、日皮会誌（2018）128: 2431-2502Kato et al., Nikkikai Magazine (2018) 128: 2431-2502

本発明は、アトピー性皮膚炎を検出するためのマーカー、及び該マーカーを用いたアトピー性皮膚炎の検出方法を提供することに関する。 The present invention relates to providing a marker for detecting atopic dermatitis and a method for detecting atopic dermatitis using the marker.

本発明者らは、成人のアトピー性皮膚炎患者と健常人からＳＳＬを採取し、ＳＳＬ中に含まれるＲＮＡの発現状態をシーケンス情報として網羅的に解析した結果、特定の遺伝子の発現レベルが両者の間で有意に異なり、これを指標としてアトピー性皮膚炎を検出できることを見出した。 The present inventors collected SSL from adult patients with atopic dermatitis and healthy subjects, and comprehensively analyzed the expression state of RNA contained in the SSL as sequence information. As a result, the expression levels of specific genes were both. It was found that atopic dermatitis can be detected using this as an index.

すなわち、本発明は、以下の１）〜３）に係るものである。
１）被験者から採取された生体試料について、ＭＥＣＲ、ＲＡＳＡ４ＣＰ、ＡＲＲＤＣ４、ＥＩＦ１ＡＤ、ＦＤＦＴ１、ＺＮＦ７０６、ＴＥＸ２、ＴＭＰＲＳＳ１１Ｅ、ＲＰＳ６ＫＢ２、ＣＴＢＰ１、ＺＮＦ３３５、ＤＧＫＡ、ＰＰＰ１Ｒ９Ｂ、ＳＰＤＹＥ７Ｐ、ＤＮＡＳＥ１Ｌ１、ＧＮＢ２及びＣＳＮＫ１Ｇ２からなる１７種の遺伝子群より選択される少なくとも１つの遺伝子又はその発現産物の発現レベルを測定する工程を含む、当該被験者におけるアトピー性皮膚炎の検出方法。
２）前記遺伝子と特異的にハイブリダイズするオリゴヌクレオチド、又は前記遺伝子の発現産物を認識する抗体を含有する、１）の方法に用いられるアトピー性皮膚炎を検出するための検査用キット。
３）後記表Ｂに示される２１０種の遺伝子群より選択される少なくとも１つの遺伝子又はその発現産物からなる、アトピー性皮膚炎の検出マーカー。 That is, the present invention relates to the following 1) to 3).
1) Regarding biological samples collected from subjects, MECR, RASA4CP, ARRDC4, EIF1AD, FDFT1, ZNF706, TEX2, TMPRSS11E, RPS6KB2, CTBP1, ZNF335, DGKA, PPP1R9B, SPDYE7P, DNASE1L1 A method for detecting atopic dermatitis in a subject, comprising the step of measuring the expression level of at least one gene selected from the group or an expression product thereof.
2) A test kit for detecting atopic dermatitis used in the method 1), which contains an oligonucleotide that specifically hybridizes with the gene or an antibody that recognizes an expression product of the gene.
3) A marker for detecting atopic dermatitis, which comprises at least one gene selected from the 210 gene groups shown in Table B below or an expression product thereof.

本発明によれば、簡便且つ非侵襲的に、アトピー性皮膚炎を早期に、高い精度、感度及び特異度で検出することが可能となる。 According to the present invention, it is possible to detect atopic dermatitis at an early stage with high accuracy, sensitivity and specificity in a simple and non-invasive manner.

本明細書中で引用された全ての特許文献、非特許文献、及びその他の刊行物は、その全体が本明細書中において参考として援用される。 All patented, non-patented, and other publications cited herein are hereby incorporated by reference in their entirety.

本発明において、「核酸」又は「ポリヌクレオチド」と云う用語は、ＤＮＡ又はＲＮＡを意味する。ＤＮＡには、ｃＤＮＡ、ゲノムＤＮＡ、及び合成ＤＮＡのいずれもが含まれ、「ＲＮＡ」には、ｔｏｔａｌＲＮＡ、ｍＲＮＡ、ｒＲＮＡ、ｔＲＮＡ、ｎｏｎ-ｃｏｄｉｎｇＲＮＡ及び合成のＲＮＡのいずれもが含まれる。 In the present invention, the term "nucleic acid" or "polynucleotide" means DNA or RNA. DNA includes any of cDNA, genomic DNA, and synthetic DNA, and "RNA" includes any of total RNA, mRNA, rRNA, tRNA, non-coding RNA, and synthetic RNA.

本発明において「遺伝子」とは、ヒトゲノムＤＮＡを含む２本鎖ＤＮＡの他、ｃＤＮＡを含む１本鎖ＤＮＡ（正鎖）、当該正鎖と相補的な配列を有する１本鎖ＤＮＡ（相補鎖）、及びこれらの断片を包含するものであって、ＤＮＡを構成する塩基の配列情報の中に、何らかの生物学的情報が含まれているものを意味する。
また、当該「遺伝子」は特定の塩基配列で表される「遺伝子」だけではなく、これらの同族体（すなわち、ホモログもしくはオーソログ）、遺伝子多型等の変異体、及び誘導体をコードする核酸が包含される。
本明細書中に開示される遺伝子の名称は、ＮＣＢＩ（[www.ncbi.nlm.nih.gov/]）に記載のあるＯｆｆｉｃｉａｌＳｙｍｂｏｌに従う。一方で遺伝子オントロジー（ＧＯ）に関しては、Ｓｔｒｉｎｇ（[string-db.org/]）に記載のあるＰａｔｈｗａｙＩＤ．に従う。 In the present invention, the "gene" refers to double-stranded DNA containing human genomic DNA, single-stranded DNA containing cDNA (positive strand), and single-stranded DNA having a sequence complementary to the positive strand (complementary strand). , And those fragments, which include some biological information in the sequence information of the bases constituting the DNA.
In addition, the "gene" includes not only "genes" represented by specific base sequences, but also nucleic acids encoding these homologues (that is, homologs or orthologs), mutants such as gene polymorphisms, and derivatives. Will be done.
The names of the genes disclosed herein follow the Official Symbol described in NCBI ([www.ncbi.nlm.nih.gov/]). On the other hand, regarding Gene Ontology (GO), Pathway ID. Follow.

本発明において、遺伝子の「発現産物」とは、遺伝子の転写産物及び翻訳産物を包含する概念である。「転写産物」とは、遺伝子（ＤＮＡ）から転写されて生じるＲＮＡであり、「翻訳産物」とは、ＲＮＡに基づき翻訳合成される、遺伝子にコードされたタンパク質を意味する。 In the present invention, the "expression product" of a gene is a concept including a transcript and a translation product of the gene. The "transcription product" is RNA produced by being transcribed from a gene (DNA), and the "translation product" means a protein encoded by a gene that is translated and synthesized based on RNA.

本発明において、「アトピー性皮膚炎」とは、増悪・寛解を繰り返す、そう痒のある湿疹を主病原とする疾患を指し、その患者の多くは、アトピー素因を持つとされている。アトピー素因としては、ｉ）家族歴・既往歴（気管支喘息，アレルギー性鼻炎・結膜炎，アトピー性皮膚炎のうちいずれか，あるいは複数の疾患）、又はｉｉ）ＩｇＥ抗体を産生し易い素因が挙げられる。 In the present invention, "atopic dermatitis" refers to a disease caused by pruritic eczema, which repeats exacerbations and remissions, and many of the patients are said to have an atopic predisposition. Examples of atopic predisposition include i) family history / medical history (one or more of bronchial asthma, allergic rhinitis / conjunctivitis, and atopic dermatitis), or ii) predisposition to easily produce IgE antibody. ..

本発明において、アトピー性皮膚炎の「検出」とは、アトピー性皮膚炎の存在又は不存在を明らかにする意味であり、検査、測定、判定、評価又は評価支援という用語で言い換えることもできる。なお、本明細書において「判定」又は「評価」という用語は、医師による判定や評価を含むものではない。 In the present invention, "detection" of atopic dermatitis means to clarify the presence or absence of atopic dermatitis, and can be paraphrased by the terms inspection, measurement, judgment, evaluation or evaluation support. In addition, the term "judgment" or "evaluation" in this specification does not include judgment or evaluation by a doctor.

本明細書において、「特徴量」とは機械学習における「説明変数」と同義であり、アトピー性皮膚炎検出マーカーから選択される機械学習に使用する遺伝子又はその発現産物を合わせて「特徴量遺伝子」と称する。 In the present specification, "feature amount" is synonymous with "explanatory variable" in machine learning, and a gene used for machine learning selected from atopic dermatitis detection markers or an expression product thereof is combined to form a "feature amount gene". ".

後述する実施例に示すように、成人の健常者１４名及びアトピー性皮膚炎患者２９名のＳＳＬから抽出されたＲＮＡの発現量のデータ（リードカウント値）について、ＤＥＳｅｑ２（Love MI et al. Genome Biol. 2014）を用いて補正されたカウント値（Ｎｏｒｍａｌｉｚｅｄｃｏｕｎｔ値）を用いて、健常者と比較してアトピー性皮膚炎患者における尤度比検定によるｐ値の補正値（ＦＤＲ）が０．０５未満となるＲＮＡを抽出することにより、発現上昇遺伝子４８種、発現低下遺伝子７５種（計１２３遺伝子（表１−１〜１−３）が同定された。表中、「ＵＰ」で示される遺伝子は、アトピー性皮膚炎患者で発現レベルが上がる遺伝子であり、「ＤＯＷＮ」で示される遺伝子は、アトピー性皮膚炎患者で発現レベルが下がる遺伝子である。
したがって、斯かる１２３種の遺伝子群より選択される遺伝子又はその発現産物は、アトピー性皮膚炎を検出するためのアトピー性皮膚炎マーカーとなり得る。当該遺伝子群のうち１０７遺伝子（表１−１〜１−３において＊を付し太字で表示）は、これまでにアトピー性皮膚炎との関係が報告されていない遺伝子である。 As shown in Examples described later, DESeq2 (Love MI et al. Genome) was used for RNA expression level data (read count value) extracted from the SSL of 14 healthy adult subjects and 29 atopic dermatitis patients. Using the count value (Normalized count value) corrected using Biol. 2014), the correction value (FDR) of the p value by the likelihood ratio test in patients with atopic dermatitis was 0.05 compared to healthy subjects. By extracting RNA that is less than, 48 types of up-expression genes and 75 types of down-expression genes (total of 123 genes (Tables 1-1 to 1-3) were identified. Genes indicated by "UP" in the table. Is a gene whose expression level is increased in patients with atopic dermatitis, and a gene represented by "DOWN" is a gene whose expression level is decreased in patients with atopic dermatitis.
Therefore, a gene selected from the 123 gene groups or an expression product thereof can be a marker for atopic dermatitis for detecting atopic dermatitis. Of the gene group, 107 genes (marked with * in Tables 1-1 to 1-3 and indicated in bold) are genes for which no relationship with atopic dermatitis has been reported so far.

また、被験者からの全てのＳＳＬ由来ＲＮＡの発現量のデータ（７４２９遺伝子のＬｏｇ_２（ＲＰＭ＋１）値）を説明変数とし、健常者とアトピー性皮膚炎患者を目的変数とし、機械学習アルゴリズムとしてランダムフォレスト（Breiman L. Machine Learning (2001) 45;5-32）を用いて特徴量遺伝子の抽出及び予測モデルの構築を試みた。後述する実施例に示すように、ジニ係数に基づく変数重要度の上位１５０遺伝子（表３−１〜３−４）を特徴量遺伝子として選択し、これを用いて予測モデルを構築したところ、アトピー性皮膚炎の予測が可能であることが示された。
したがって、斯かる１５０種の遺伝子群より選択される遺伝子又はその発現産物は、アトピー性皮膚炎を検出するための好適なアトピー性皮膚炎マーカーとなり得る。そして、このうち１２７遺伝子（表３−１〜３−４において＊を付し太字で表示）は、これまでにアトピー性皮膚炎との関係が報告されていない新規なアトピー性皮膚炎マーカーである。後述する実施例に示すように、これら新規のアトピー性皮膚炎マーカーを用いた予測モデルでもアトピー性皮膚炎の予測が可能である。 In addition, data on the expression levels of all SSL-derived RNAs from the subjects _{(Log 2} (RPM + 1) value of the 7429 gene) were used as explanatory variables, and healthy subjects and patients with atopic dermatitis were used as objective variables, and a random forest was used as a machine learning algorithm. Using (Breiman L. Machine Learning (2001) 45; 5-32), we attempted to extract feature-quantity genes and construct a prediction model. As shown in Examples described later, the top 150 genes with variable importance based on the Gini coefficient (Tables 3-1 to 3-4) were selected as feature genes, and a prediction model was constructed using them. As a result, atopy was constructed. It has been shown that sexual dermatitis can be predicted.
Therefore, a gene selected from such a group of 150 genes or an expression product thereof can be a suitable marker for atopic dermatitis for detecting atopic dermatitis. Of these, 127 genes (marked with * in Tables 3-1 to 3-4 and indicated in bold) are novel atopic dermatitis markers for which no relationship with atopic dermatitis has been reported so far. .. As shown in Examples described later, atopic dermatitis can be predicted even with a prediction model using these novel atopic dermatitis markers.

また、健常者とアトピー性皮膚炎患者で発現変動が見られた上記の１２３遺伝子又はそのうちの１０７遺伝子の発現量のデータ（Ｌｏｇ_２（ＲＰＭ＋１）値）を用いて、同様にランダムフォレストを用いて予測モデルの構築を試みたところ、いずれにおいてもアトピー性皮膚炎の予測が可能であることが示された。 _{In addition, using the data (Log 2} (RPM + 1) value) of the expression levels of the above 123 genes or 107 genes of which expression fluctuations were observed in healthy subjects and patients with atopic dermatitis, similarly using a random forest. When we tried to build a predictive model, it was shown that atopic dermatitis can be predicted in any case.

また、機械学習アルゴリズムとしてＢｏｒｕｔａ法（Kursa et al. Fundamental Informaticae (2010) 101;271-286）を用いて特徴量遺伝子の抽出（最大試行回数１０００回、ｐ値０．０１未満）を行ったところ、４５遺伝子（表４）が特徴量遺伝子として抽出され、後述の実施例で示すように、これを用いたランダムフォレストによる予測モデルでアトピー性皮膚炎の予測が可能であることが示された。
したがって、斯かる４５種の遺伝子群より選択される遺伝子又はその発現産物は、アトピー性皮膚炎を検出するための好適なアトピー性皮膚炎マーカーとなり得る。そして、このうち３９遺伝子（表４において＊を付し太字で表示）は、これまでにアトピー性皮膚炎との関係が報告されていない新規なアトピー性皮膚炎マーカーである。後述する実施例に示すように、これら新規のアトピー性皮膚炎マーカーを用いた予測モデルでもアトピー性皮膚炎の予測が可能である。 In addition, the feature gene was extracted using the Boruta method (Kursa et al. Fundamental Informaticae (2010) 101; 271-286) as a machine learning algorithm (maximum number of trials 1000 times, p value less than 0.01). , 45 genes (Table 4) were extracted as feature genes, and as shown in Examples described later, it was shown that atopic dermatitis can be predicted by a prediction model using a random forest using these genes.
Therefore, a gene selected from such a group of 45 genes or an expression product thereof can be a suitable marker for atopic dermatitis for detecting atopic dermatitis. Of these, 39 genes (marked with * in Table 4 and indicated in bold) are novel atopic dermatitis markers for which no relationship with atopic dermatitis has been reported so far. As shown in Examples described later, atopic dermatitis can be predicted even with a prediction model using these novel atopic dermatitis markers.

上述した発現変動解析で抽出された表１−１〜１−３で示される１２３種の遺伝子群（Ａ）と、ランダムフォレストにより特徴量遺伝子として選択された表３−１〜３−４で示される１５０種の遺伝子群（Ｂ）と、Ｂｏｒｕｔａ法により特徴量遺伝子として選択された表４で示される４５種の遺伝子群（Ｃ）の和（Ａ∪Ｂ∪Ｃ）である２４５遺伝子（表Ａ）はアトピー性皮膚炎マーカーとなり、そのうち２１０遺伝子（表Ｂ）は新規なアトピー性皮膚炎マーカーである。
The 123 gene groups (A) shown in Tables 1-1 to 1-3 extracted by the above-mentioned expression fluctuation analysis and Tables 3-1 to 3-4 selected as characteristic amount genes by a random forest are shown. 245 genes (Table A), which is the sum (A∪B∪C) of the 150 gene groups (B) and the 45 gene groups (C) shown in Table 4 selected as the feature amount genes by the Boruta method. ) Is a marker for atopic dermatitis, of which 210 genes (Table B) are novel markers for atopic dermatitis.

本発明のＭＥＣＲ、ＲＡＳＡ４ＣＰ、ＡＲＲＤＣ４、ＥＩＦ１ＡＤ、ＦＤＦＴ１、ＺＮＦ７０６、ＴＥＸ２、ＴＭＰＲＳＳ１１Ｅ、ＲＰＳ６ＫＢ２、ＣＴＢＰ１、ＺＮＦ３３５、ＤＧＫＡ、ＰＰＰ１Ｒ９Ｂ、ＳＰＤＹＥ７Ｐ、ＤＮＡＳＥ１Ｌ１、ＧＮＢ２及びＣＳＮＫ１Ｇ２からなる１７種の遺伝子は、上述した発現変動解析で抽出された表１−１〜１−３で示される１２３種の遺伝子群（Ａ）と、ランダムフォレストにより特徴量遺伝子として選択された表３−１〜３−４で示される１５０種の遺伝子群（Ｂ）と、Ｂｏｒｕｔａ法により特徴量遺伝子として選択された表４で示される４５種の遺伝子群（Ｃ）に共通する遺伝子（Ａ∩Ｂ∩Ｃ）であって、従来アトピー性皮膚炎と関連付けられていない遺伝子である（各表において＊を付し太字で表示）。したがって、これらの遺伝子群より選択される少なくとも１つの遺伝子又はその発現産物は、アトピー性皮膚炎を検出するための、新規なアトピー性皮膚炎マーカーとして特に有用である。
斯かる１７種の遺伝子はそれぞれ単独でアトピー性皮膚炎マーカーとなり得るが、２種以上、好ましくは５種以上、より好ましくは１０種以上を組み合わせて用いることが好ましく、１７種全てを用いるのがよりさらに好ましい。 MECR, RASA4CP, ARRDC4, EIF1AD, FDFT1, ZNF706, TEX2, TMPRSS11E, RPS6KB2, CTBP1, ZNF335, DGKA, PPP1R9B, SPDYE7P, DNASE1L1, GNB2 and CSNK1 123 gene groups (A) shown in Tables 1-1 to 1-3 extracted and 150 gene groups shown in Tables 3-1 to 3-4 selected as feature genes by random forest. (B) and the gene (A∩B∩C) common to the 45 gene groups (C) shown in Table 4 selected as the feature gene by the Boruta method, and are conventionally associated with atopic dermatitis. Genes that have not been identified (marked with * in each table and shown in bold). Therefore, at least one gene selected from these gene groups or an expression product thereof is particularly useful as a novel atopic dermatitis marker for detecting atopic dermatitis.
Each of these 17 genes can be a marker for atopic dermatitis alone, but it is preferable to use 2 or more, preferably 5 or more, more preferably 10 or more in combination, and it is preferable to use all 17 genes. Even more preferable.

なお、上記のアトピー性皮膚炎マーカーとなり得る遺伝子（以下、「標的遺伝子」とも称す）には、アトピー性皮膚炎を検出するためのバイオマーカーとなり得る限り、当該遺伝子を構成するＤＮＡの塩基配列と実質的に同一の塩基配列を有する遺伝子も包含される。ここで、実質的に同一の塩基配列とは、例えば、相同性計算アルゴリズムNCBI BLASTを用い、期待値＝１０；ギャップを許す；フィルタリング＝ＯＮ；マッチスコア＝１；ミスマッチスコア＝−３の条件にて検索をした場合、当該遺伝子を構成するＤＮＡの塩基配列と９０％以上、好ましくは９５％以上、さらにより好ましく９８％以上の同一性があることを意味する。 The gene that can be a marker for atopic dermatitis (hereinafter, also referred to as "target gene") includes the base sequence of the DNA that constitutes the gene as long as it can be a biomarker for detecting atopic dermatitis. Genes having substantially the same base sequence are also included. Here, with substantially the same base sequence, for example, the homology calculation algorithm NCBI BLAST is used, and the condition of expected value = 10; gap is allowed; filtering = ON; match score = 1; mismatch score = -3. It means that there is 90% or more, preferably 95% or more, and even more preferably 98% or more identity with the base sequence of the DNA constituting the gene.

本発明のアトピー性皮膚炎の検出方法は、被験者から採取された生体試料について、標的遺伝子、一態様として、ＭＥＣＲ、ＲＡＳＡ４ＣＰ、ＡＲＲＤＣ４、ＥＩＦ１ＡＤ、ＦＤＦＴ１、ＺＮＦ７０６、ＴＥＸ２、ＴＭＰＲＳＳ１１Ｅ、ＲＰＳ６ＫＢ２、ＣＴＢＰ１、ＺＮＦ３３５、ＤＧＫＡ、ＰＰＰ１Ｒ９Ｂ、ＳＰＤＹＥ７Ｐ、ＤＮＡＳＥ１Ｌ１、ＧＮＢ２及びＣＳＮＫ１Ｇ２からなる１７種の遺伝子群より選択される少なくとも１つの遺伝子又はその発現産物の発現レベルを測定する工程を含む。 The method for detecting atopic dermatitis of the present invention is a target gene, as one embodiment, MECR, RASA4CP, ARRDC4, EIF1AD, FDFT1, ZNF706, TEX2, TMPRSS11E, RPS6KB2, CTBP1, ZNF335, for a biological sample collected from a subject. It comprises a step of measuring the expression level of at least one gene selected from a group of 17 genes consisting of DGKA, PPP1R9B, SPDYE7P, DNASE1L1, GNB2 and CSNK1G2 or an expression product thereof.

本発明において用いられる生体試料としては、アトピー性皮膚炎の発症、進行に伴い本発明の遺伝子が発現変化する組織及び生体材料であればよい。具体的には臓器、皮膚、血液、尿、唾液、汗、皮膚表上脂質（ＳＳＬ）、組織浸出液等の体液、血液から調製された血清、血漿、その他、便、毛髪等が挙げられ、好ましくは皮膚または皮膚表上脂質（ＳＳＬ）、より好ましくは皮膚表上脂質（ＳＳＬ）が挙げられる。
また、生体試料を採取する被験体は、アトピー性皮膚炎の検出を必要とする人又はアトピー性皮膚炎の発症が疑われる人が好ましく、性別及び年齢は限定されないが、好ましくは成人であり、具体的には１６歳以上の人であり、好ましくは２０歳以上の人である。 The biological sample used in the present invention may be a tissue or a biomaterial whose expression changes with the onset and progression of atopic dermatitis. Specific examples thereof include body fluids such as organs, skin, blood, urine, saliva, sweat, skin surface lipids (SSL) and tissue exudates, serum prepared from blood, plasma, others, stool, hair and the like, which are preferable. Is skin or skin surface lipid (SSL), more preferably skin surface lipid (SSL).
In addition, the subject for which the biological sample is collected is preferably a person who needs to detect atopic dermatitis or a person who is suspected of developing atopic dermatitis, and is not limited in gender and age, but is preferably an adult. Specifically, it is a person 16 years or older, preferably a person 20 years or older.

ここで、「皮膚表上脂質（ＳＳＬ）」とは、皮膚の表上に存在する脂溶性画分をいい、皮脂と呼ばれることもある。一般に、ＳＳＬは、皮膚にある皮脂腺等の外分泌腺から分泌された分泌物を主に含み、皮膚表面を覆う薄い層の形で皮膚表上に存在している。ＳＳＬは、皮膚細胞で発現したＲＮＡを含む。（前記特許文献３参照）。また本明細書において、「皮膚」とは、特に限定しない限り、体表の表皮、真皮、毛包、ならびに汗腺、皮脂腺及びその他の腺等の組織を含む領域の総称である。 Here, "skin surface lipid (SSL)" refers to a fat-soluble fraction existing on the surface of the skin, and is sometimes called sebum. In general, SSL mainly contains secretions secreted from exocrine glands such as sebaceous glands in the skin, and is present on the surface of the skin in the form of a thin layer covering the skin surface. SSL contains RNA expressed in skin cells. (See Patent Document 3 above). Further, in the present specification, "skin" is a general term for regions including the epidermis, dermis, hair follicles, and tissues such as sweat glands, sebaceous glands, and other glands on the body surface, unless otherwise specified.

被験者の皮膚からのＳＳＬの採取には、皮膚からのＳＳＬの回収又は除去に用いられているあらゆる手段を採用することができる。好ましくは、後述するＳＳＬ吸収性素材、ＳＳＬ接着性素材、又は皮膚からＳＳＬをこすり落とす器具を使用することができる。ＳＳＬ吸収性素材又はＳＳＬ接着性素材としては、ＳＳＬに親和性を有する素材であれば特に限定されず、例えばポリプロピレン、パルプ等が挙げられる。皮膚からのＳＳＬの採取手順のより詳細な例としては、あぶら取り紙、あぶら取りフィルム等のシート状素材へＳＳＬを吸収させる方法、ガラス板、テープ等へＳＳＬを接着させる方法、スパーテル、スクレイパー等によりＳＳＬをこすり落として回収する方法、等が挙げられる。ＳＳＬの吸着性を向上させるため、脂溶性の高い溶媒を予め含ませたＳＳＬ吸収性素材を用いてもよい。一方、ＳＳＬ吸収性素材は、水溶性の高い溶媒や水分を含んでいるとＳＳＬの吸着が阻害されるため、水溶性の高い溶媒や水分の含有量が少ないことが好ましい。ＳＳＬ吸収性素材は、乾燥した状態で用いることが好ましい。ＳＳＬが採取される皮膚の部位としては、特に限定されず、頭、顔、首、体幹、手足等の身体の任意の部位の皮膚が挙げられ、皮脂の分泌が多い部位、例えば顔の皮膚が好ましい。 For the collection of SSL from the subject's skin, any means used to recover or remove SSL from the skin can be employed. Preferably, an SSL-absorbing material, an SSL-adhesive material, or an instrument that scrapes the SSL from the skin, which will be described later, can be used. The SSL-absorbing material or the SSL-adhesive material is not particularly limited as long as it is a material having an affinity for SSL, and examples thereof include polypropylene and pulp. More detailed examples of the procedure for collecting SSL from the skin include a method of absorbing SSL into a sheet-like material such as oil removal paper and oil removal film, a method of adhering SSL to a glass plate, tape, etc., a spatula, a scraper, etc. There is a method of scraping off the SSL and collecting the SSL. In order to improve the adsorptivity of SSL, an SSL-absorbing material containing a highly lipophilic solvent in advance may be used. On the other hand, if the SSL-absorbent material contains a highly water-soluble solvent or water, the adsorption of SSL is inhibited, so that the content of the highly water-soluble solvent or water is preferably small. The SSL-absorbent material is preferably used in a dry state. The part of the skin from which the SSL is collected is not particularly limited, and examples thereof include the skin of any part of the body such as the head, face, neck, trunk, and limbs, and the part where sebum is secreted, for example, the skin of the face. Is preferable.

被験者から採取されたＲＮＡ含有ＳＳＬは一定期間保存されてもよい。採取されたＳＳＬは、含有するＲＮＡの分解を極力抑えるために、採取後できるだけ速やかに低温条件で保存することが好ましい。本発明における該ＲＮＡ含有ＳＳＬの保存の温度条件は、０℃以下であればよく、好ましくは−２０±２０℃〜−８０±２０℃、より好ましくは−２０±１０℃〜−８０±１０℃、さらに好ましくは−２０±２０℃〜−４０±２０℃、さらに好ましくは−２０±１０℃〜−４０±１０℃、さらに好ましくは−２０±１０℃、さらに好ましくは−２０±５℃である。該ＲＮＡ含有ＳＳＬの該低温条件での保存の期間は、特に限定されないが、好ましくは１２か月以下、例えば６時間以上１２ヶ月以下、より好ましくは６ヶ月以下、例えば１日間以上６ヶ月以下、さらに好ましくは３ヶ月以下、例えば３日間以上３ヶ月以下である。 RNA-containing SSL collected from the subject may be stored for a period of time. The collected SSL is preferably stored under low temperature conditions as soon as possible after collection in order to suppress the decomposition of the contained RNA as much as possible. The storage temperature condition of the RNA-containing SSL in the present invention may be 0 ° C. or lower, preferably −20 ± 20 ° C. to −80 ± 20 ° C., more preferably −20 ± 10 ° C. to −80 ± 10 ° C. , More preferably −20 ± 20 ° C. to −40 ± 20 ° C., further preferably −20 ± 10 ° C. to −40 ± 10 ° C., further preferably −20 ± 10 ° C., still more preferably −20 ± 5 ° C. .. The storage period of the RNA-containing SSL under the low temperature condition is not particularly limited, but is preferably 12 months or less, for example, 6 hours or more and 12 months or less, more preferably 6 months or less, for example, 1 day or more and 6 months or less. More preferably, it is 3 months or less, for example, 3 days or more and 3 months or less.

本発明において、標的遺伝子又はその発現産物の発現レベルの測定対象としては、ＲＮＡから人工的に合成されたｃＤＮＡ、そのＲＮＡをエンコードするＤＮＡ、そのＲＮＡにコードされるタンパク質、該タンパク質と相互作用をする分子、そのＲＮＡと相互作用する分子、又はそのＤＮＡと相互作用する分子等が挙げられる。ここで、ＲＮＡ、ＤＮＡ又はタンパク質と相互作用する分子としては、ＤＮＡ、ＲＮＡ、タンパク質、多糖、オリゴ糖、単糖、脂質、脂肪酸、及びこれらのリン酸化物、アルキル化物、糖付加物等、及び上記いずれかの複合体が挙げられる。また、発現レベルとは、当該遺伝子又は発現産物の発現量や活性を包括的に意味する。 In the present invention, as a target for measuring the expression level of a target gene or an expression product thereof, a cDNA artificially synthesized from RNA, a DNA encoding the RNA, a protein encoded by the RNA, and an interaction with the protein are used. Examples include molecules that interact with RNA, molecules that interact with DNA, and the like. Here, examples of molecules that interact with RNA, DNA or protein include DNA, RNA, proteins, polysaccharides, oligosaccharides, monosaccharides, lipids, fatty acids, their phosphorylates, alkylated products, sugar adducts and the like, and the like. Any of the above complexes can be mentioned. The expression level comprehensively means the expression level and activity of the gene or expression product.

本発明の方法においては、好ましい態様として、生体試料としてＳＳＬが用いられるが、この場合にはＳＳＬに含まれるＲＮＡの発現レベルが解析され、具体的にはＲＮＡを逆転写によりｃＤＮＡに変換した後、該ｃＤＮＡ又はその増幅産物が測定される。
ＳＳＬからのＲＮＡの抽出には、生体試料からのＲＮＡの抽出又は精製に通常使用される方法、例えば、フェノール/クロロホルム法、ＡＧＰＣ（ａｃｉｄｇｕａｎｉｄｉｎｉｕｍｔｈｉｏｃｙａｎａｔｅ−ｐｈｅｎｏｌ−ｃｈｌｏｒｏｆｏｒｍｅｘｔｒａｃｔｉｏｎ）法、又はＴＲＩｚｏｌ（登録商標）、ＲＮｅａｓｙ（登録商標）、ＱＩＡｚｏｌ（登録商標）等のカラムを用いた方法、シリカをコーティングした特殊な磁性体粒子を用いる方法、ＳｏｌｉｄＰｈａｓｅＲｅｖｅｒｓｉｂｌｅＩｍｍｏｂｉｌｉｚａｔｉｏｎ磁性体粒子を用いる方法、ＩＳＯＧＥＮ等の市販のＲＮＡ抽出試薬による抽出等を用いることができる。 In the method of the present invention, SL is used as a biological sample as a preferred embodiment. In this case, the expression level of RNA contained in SSL is analyzed, and specifically, after RNA is converted to cDNA by reverse transcription. , The cDNA or its amplification product is measured.
For the extraction of RNA from SSL, methods commonly used for extracting or purifying RNA from biological samples, such as the phenol / chloroform method, the AGPC (acid guanidinium thiocyanate-phenyl-chromoform extraction) method, or TRIzol®. ), RNAy®, QIAzol® and other columns, silica-coated special magnetic particles, Solid Phase Reversible Imaging magnetic particles, ISOGEN and other commercially available products. Extraction with an RNA extraction reagent or the like can be used.

該逆転写には、解析したい特定のＲＮＡを標的としたプライマーを用いてもよいが、より包括的な核酸の保存及び解析のためにはランダムプライマーを用いることが好ましい。該逆転写には、一般的な逆転写酵素又は逆転写試薬キットを使用することができる。好適には、正確性及び効率性の高い逆転写酵素又は逆転写試薬キットが用いられ、その例としては、Ｍ−ＭＬＶＲｅｖｅｒｓｅＴｒａｎｓｃｒｉｐｔａｓｅ及びその改変体、あるいは市販の逆転写酵素又は逆転写試薬キット、例えばＰｒｉｍｅＳｃｒｉｐｔ（登録商標）ＲｅｖｅｒｓｅＴｒａｎｓｃｒｉｐｔａｓｅシリーズ（タカラバイオ社）、ＳｕｐｅｒＳｃｒｉｐｔ（登録商標）ＲｅｖｅｒｓｅＴｒａｎｓｃｒｉｐｔａｓｅシリーズ（ＴｈｅｒｍｏＳｃｉｅｎｔｉｆｉｃ社）等が挙げられる。ＳｕｐｅｒＳｃｒｉｐｔ（登録商標）III ＲｅｖｅｒｓｅＴｒａｎｓｃｒｉｐｔａｓｅ、ＳｕｐｅｒＳｃｒｉｐｔ（登録商標）ＶＩＬＯｃＤＮＡＳｙｎｔｈｅｓｉｓｋｉｔ（いずれもＴｈｅｒｍｏＳｃｉｅｎｔｉｆｉｃ社）等が好ましく用いられる。
該逆転写における伸長反応は、温度を好ましくは４２℃±１℃、より好ましくは４２℃±０．５℃、さらに好ましくは４２℃±０．２５℃に調整し、一方、反応時間を好ましくは６０分間以上、より好ましくは８０〜１２０分間に調整するのが好ましい。 For the reverse transcription, primers targeting the specific RNA to be analyzed may be used, but it is preferable to use random primers for more comprehensive nucleic acid storage and analysis. A general reverse transcriptase or reverse transcriptase kit can be used for the reverse transcription. Preferably, a highly accurate and efficient reverse transcriptase or reverse transcriptase kit is used, and examples thereof include M-MLV Reverse Transcriptase and its variants, or commercially available reverse transcriptase or reverse transcriptase kits. For example, PrimeScript (registered trademark) Reverse Transcriptase series (Takara Bio Co., Ltd.), SuperScript (registered trademark) Reverse Transcriptase series (Thermo Scientific) and the like can be mentioned. SuperScript (registered trademark) III Reverse Transcriptase, SuperScript (registered trademark) VILO cDNA Synthesis kit (both from Thermo Scientific) and the like are preferably used.
For the extension reaction in the reverse transcription, the temperature is preferably adjusted to 42 ° C. ± 1 ° C., more preferably 42 ° C. ± 0.5 ° C., still more preferably 42 ° C. ± 0.25 ° C., while the reaction time is preferably adjusted to 42 ° C. ± 0.25 ° C. It is preferably adjusted for 60 minutes or more, more preferably 80 to 120 minutes.

発現レベルを測定する方法は、ＲＮＡ、ｃＤＮＡ又はＤＮＡを対象とする場合、これらにハイブリダイズするＤＮＡをプライマーとしたＰＣＲ法、リアルタイムＲＴ−ＰＣＲ法、マルチプレックスＰＣＲ、ＳｍａｒｔＡｍｐ法、ＬＡＭＰ法等に代表される核酸増幅法、これらにハイブリダイズする核酸をプローブとして用いるハイブリダイゼーション法（ＤＮＡチップ、ＤＮＡマイクロアレイ、ドットブロットハイブリダイゼーション、スロットブロットハイブリダイゼーション、ノーザンブロットハイブリダイゼーション等）、塩基配列を決定する方法（シーケンシング）、又はこれらを組み合わせた方法から選ぶことができる。 When RNA, cDNA or DNA is targeted, the method for measuring the expression level is represented by the PCR method using the DNA hybridizing to these as a primer, the real-time RT-PCR method, the multiplex PCR, the SmartAmp method, the LAMP method, and the like. Nucleic acid amplification method, hybridization method using nucleic acid hybridizing to these as a probe (DNA chip, DNA microarray, dot blot hybridization, slot blot hybridization, Northern blot hybridization, etc.), method for determining base sequence ( Sequencing) or a combination of these methods can be selected.

ＰＣＲでは、解析したい特定のＤＮＡを標的としたプライマーペアを用いて該特定の1種のＤＮＡのみを増幅してもよいが、複数のプライマーペアを用いて同時に複数の特定のＤＮＡを増幅してもよい。好ましくは、該ＰＣＲはマルチプレックスＰＣＲである。マルチプレックスＰＣＲは、ＰＣＲ反応系に複数のプライマー対を同時に使用することで、複数の遺伝子領域を同時に増幅する方法である。マルチプレックスＰＣＲは、市販のキット（例えば、ＩｏｎＡｍｐｌｉＳｅｑＴｒａｎｓｃｒｉｐｔｏｍｅＨｕｍａｎＧｅｎｅＥｘｐｒｅｓｓｉｏｎＫｉｔ；ライフテクノロジーズジャパン株式会社等）を用いて実施することができる。
該ＰＣＲにおけるアニーリング及び伸長反応の温度は、使用するプライマーに依存するため一概には言えないが、上記のマルチプレックスＰＣＲキットは用いる場合、好ましくは６２℃±１℃、より好ましくは６２℃±０．５℃、さらに好ましくは６２℃±０．２５℃である。したがって、該ＰＣＲでは、好ましくはアニーリング及び伸長反応が１ステップで行われる。該アニーリング及び伸長反応のステップの時間は、増幅すべきＤＮＡのサイズ等に依存して調整され得るが、好ましくは１４〜１８分間である。該ＰＣＲにおける変性反応の条件は、増幅すべきＤＮＡに依存して調整され得るが、好ましくは９５〜９９℃で１０〜６０秒間である。上記のような温度及び時間での逆転写及びＰＣＲは、一般的にＰＣＲに使用されるサーマルサイクラーを用いて実行することができる。 In PCR, only one specific DNA may be amplified using a primer pair targeting a specific DNA to be analyzed, but a plurality of specific DNAs may be amplified at the same time using a plurality of primer pairs. May be good. Preferably, the PCR is multiplex PCR. Multiplex PCR is a method of simultaneously amplifying a plurality of gene regions by simultaneously using a plurality of primer pairs in a PCR reaction system. Multiplex PCR can be carried out using a commercially available kit (for example, Ion AmpliSeqTranscriptome Human Gene Expression Kit; Life Technologies Japan Co., Ltd., etc.).
The temperature of the annealing and extension reactions in the PCR cannot be unequivocally determined because it depends on the primers used, but when the above multiplex PCR kit is used, it is preferably 62 ° C ± 1 ° C, more preferably 62 ° C ± 0. It is 5.5 ° C., more preferably 62 ° C. ± 0.25 ° C. Therefore, in the PCR, the annealing and extension reactions are preferably carried out in one step. The time of the annealing and extension reaction steps can be adjusted depending on the size of the DNA to be amplified and the like, but is preferably 14 to 18 minutes. The conditions of the denaturation reaction in the PCR can be adjusted depending on the DNA to be amplified, but are preferably 95 to 99 ° C. for 10 to 60 seconds. Reverse transcription and PCR at temperature and time as described above can be performed using a thermal cycler commonly used for PCR.

当該ＰＣＲで得られた反応産物の精製は、反応産物のサイズ分離によって行われることが好ましい。サイズ分離により、目的のＰＣＲ反応産物を、ＰＣＲ反応液中に含まれるプライマーやその他の不純物から分離することができる。ＤＮＡのサイズ分離は、例えば、サイズ分離カラムや、サイズ分離チップ、サイズ分離に利用可能な磁気ビーズ等によって行うことができる。サイズ分離に利用可能な磁気ビーズの好ましい例としては、ＡｍｐｕｒｅＸＰ等のＳｏｌｉｄＰｈａｓｅＲｅｖｅｒｓｉｂｌｅＩｍｍｏｂｉｌｉｚａｔｉｏｎ（ＳＰＲＩ）磁性ビーズが挙げられる。 Purification of the reaction product obtained by the PCR is preferably performed by size separation of the reaction product. By size separation, the PCR reaction product of interest can be separated from the primers and other impurities contained in the PCR reaction solution. DNA size separation can be performed by, for example, a size separation column, a size separation chip, magnetic beads that can be used for size separation, or the like. Preferred examples of magnetic beads that can be used for size separation include Solid Phase Reversible Imaging (SPRI) magnetic beads such as Aple XP.

精製したＰＣＲ反応産物に対して、その後の定量解析を行うために必要なさらなる処理を施してもよい。例えば、ＤＮＡのシーケンシングのために、精製したＰＣＲ反応産物を、適切なバッファー溶液へと調製したり、ＰＣＲ増幅されたＤＮＡに含まれるＰＣＲプライマー領域を切断したり、増幅されたＤＮＡにアダプター配列をさらに付加したりしてもよい。例えば、精製したＰＣＲ反応産物をバッファー溶液へと調製し、増幅ＤＮＡに対してＰＣＲプライマー配列の除去及びアダプターライゲーションを行い、得られた反応産物を、必要に応じて増幅して、定量解析のためのライブラリーを調製することができる。これらの操作は、例えば、ＳｕｐｅｒＳｃｒｉｐｔ（登録商標）ＶＩＬＯｃＤＮＡＳｙｎｔｈｅｓｉｓｋｉｔ（ライフテクノロジーズジャパン株式会社）に付属している５×ＶＩＬＯＲＴＲｅａｃｔｉｏｎＭｉｘ、及びＩｏｎＡｍｐｌｉＳｅｑＴｒａｎｓｃｒｉｐｔｏｍｅＨｕｍａｎＧｅｎｅＥｘｐｒｅｓｓｉｏｎＫｉｔ（ライフテクノロジーズジャパン株式会社）に付属している５×ＩｏｎＡｍｐｌｉＳｅｑＨｉＦｉＭｉｘ、及びＩｏｎＡｍｐｌｉＳｅｑＴｒａｎｓｃｒｉｐｔｏｍｅＨｕｍａｎＧｅｎｅＥｘｐｒｅｓｓｉｏｎＣｏｒｅＰａｎｅｌを用いて、各キット付属のプロトコルに従って行うことができる。 The purified PCR reaction product may be subjected to further treatment necessary for subsequent quantitative analysis. For example, for DNA sequencing, the purified PCR reaction product can be prepared into a suitable buffer solution, the PCR primer region contained in the PCR-amplified DNA can be cleaved, or the adapter sequence can be added to the amplified DNA. May be further added. For example, the purified PCR reaction product is prepared into a buffer solution, the PCR primer sequence is removed and adapter ligation is performed on the amplified DNA, and the obtained reaction product is amplified as necessary for quantitative analysis. Library can be prepared. These operations are performed, for example, by 5 × VILO RT Reaction Mix attached to SuperScript (registered trademark) VILO cDNA Synthesis kit (Life Technologies Japan Co., Ltd.) and Ion AmpliSeq Transcriptome Human Gene Expression Japan Co., Ltd. It can be performed according to the protocol included in each kit using the 5 × Ion Ampliseq HiFi Mix and the Ion Ampliseq Transcriptome Human Gene Expression Core Panel included in the kit.

ノーザンブロットハイブリダイゼーション法を利用して標的遺伝子又はそれに由来する核酸の発現量を測定する場合は、例えば、まずプローブＤＮＡを放射性同位元素、蛍光物質等で標識し、次いで、得られた標識ＤＮＡを、常法に従ってナイロンメンブレン等にトランスファーした生体試料由来のＲＮＡとハイブリダイズさせる。その後、形成された標識ＤＮＡとＲＮＡとの二重鎖を、標識物に由来するシグナルを検出することにより測定する方法が挙げられる。 When measuring the expression level of a target gene or a nucleic acid derived from it using the Northern blot hybridization method, for example, the probe DNA is first labeled with a radioactive isotope, a fluorescent substance, or the like, and then the obtained labeled DNA is used. , Hybridize with RNA derived from a biological sample transferred to a nylon membrane or the like according to a conventional method. Then, a method of measuring the double chain of the formed labeled DNA and RNA by detecting a signal derived from the labeled substance can be mentioned.

ＲＴ−ＰＣＲ法を用いて標的遺伝子又はそれに由来する核酸の発現量を測定する場合は、例えば、まず生体試料由来のＲＮＡから常法に従ってｃＤＮＡを調製し、これを鋳型として本発明の標的遺伝子が増幅できるように調製した一対のプライマー（上記ｃＤＮＡ（−鎖）に結合する正鎖、＋鎖に結合する逆鎖）をこれとハイブリダイズさせる。その後、常法に従ってＰＣＲ法を行い、得られた増幅二本鎖ＤＮＡを検出する。増幅された二本鎖ＤＮＡの検出には、予めＲＩ、蛍光物質等で標識しておいたプライマーを用いて上記ＰＣＲを行うことによって産生される標識二本鎖ＤＮＡを検出する方法等を用いることができる。 When measuring the expression level of a target gene or a nucleic acid derived from the target gene using the RT-PCR method, for example, cDNA is first prepared from RNA derived from a biological sample according to a conventional method, and the target gene of the present invention is used as a template. A pair of primers prepared for amplification (a normal chain that binds to the above-mentioned cDNA (-chain) and a reverse chain that binds to the + chain) are hybridized with this. Then, the PCR method is performed according to a conventional method, and the obtained amplified double-stranded DNA is detected. To detect the amplified double-stranded DNA, a method for detecting the labeled double-stranded DNA produced by performing the above PCR using a primer labeled with RI, a fluorescent substance, or the like in advance is used. Can be done.

ＤＮＡマイクロアレイを用いて標的遺伝子又はそれに由来する核酸の発現量を測定する場合は、例えば、支持体に本発明の標的遺伝子由来の核酸（ｃＤＮＡ又はＤＮＡ）の少なくとも１種を固定化したアレイを用い、ｍＲＮＡから調製した標識化ｃＤＮＡ又はｃＲＮＡをマイクロアレイ上に結合させ、マイクロアレイ上の標識を検出することによって、ｍＲＮＡの発現量を測定することができる。
前記アレイに固定化される核酸としては、ストリンジェントな条件下に特異的（すなわち、実質的に目的の核酸のみに）にハイブリダイズする核酸であればよく、例えば、本発明の標的遺伝子の全配列を有する核酸であってもよく、部分配列からなる核酸であってもよい。ここで、「部分配列」とは、少なくとも１５〜２５塩基からなる核酸が挙げられる。ここでストリンジェントな条件は、通常「１×ＳＳＣ、０．１％ＳＤＳ、３７℃」程度の洗浄条件を挙げることができ、より厳しいハイブリダイズ条件としては「０．５×ＳＳＣ、０．１％ＳＤＳ、４２℃」程度、さらに厳しいハイブリダイズ条件としては「０．１×ＳＳＣ、０．１％ＳＤＳ、６５℃」程度の条件を挙げることができる。ハイブリダイズ条件は、J. Sambrook et al., Molecular Cloning: A Laboratory Manual, Thrd Edition, Cold Spring Harbor Laboratory Press (2001)等に記載されている。 When measuring the expression level of a target gene or a nucleic acid derived from the target gene using a DNA microarray, for example, an array in which at least one of the nucleic acids (cDNA or DNA) derived from the target gene of the present invention is immobilized on a support is used. , Labeled cDNA or cRNA prepared from mRNA can be bound on a microarray, and the expression level of mRNA can be measured by detecting the label on the microarray.
The nucleic acid immobilized on the array may be a nucleic acid that specifically hybridizes under stringent conditions (that is, substantially only to the nucleic acid of interest), and for example, all of the target genes of the present invention. It may be a nucleic acid having a sequence or a nucleic acid consisting of a partial sequence. Here, the "partial sequence" includes nucleic acids consisting of at least 15 to 25 bases. Here, stringent conditions can usually include cleaning conditions of about "1 x SSC, 0.1% SDS, 37 ° C.", and stricter hybridization conditions include "0.5 x SSC, 0.1". % SDS, 42 ° C. ”, and more severe hybridization conditions include“ 0.1 × SSC, 0.1% SDS, 65 ° C. ”. Hybridization conditions are described in J. Sambrook et al., Molecular Cloning: A Laboratory Manual, Thrd Edition, Cold Spring Harbor Laboratory Press (2001), etc.

シーケンシングによって標的遺伝子又はそれに由来する核酸の発現量を測定する場合は、例えば、次世代シーケンサー（例えばＩｏｎＳ５／ＸＬシステム、ライフテクノロジーズジャパン株式会社）が用いて解析することが挙げられる。シーケンシングで作成されたリードの数（リードカウント）に基づいて、ＲＮＡ発現を定量することができる。 When measuring the expression level of a target gene or a nucleic acid derived from the target gene by sequencing, for example, analysis using a next-generation sequencer (for example, Ion S5 / XL system, Life Technologies Japan Co., Ltd.) can be mentioned. RNA expression can be quantified based on the number of reads created by sequencing (read count).

上記の測定に用いられるプローブ又はプライマー、すなわち、本発明の標的遺伝子又はそれに由来する核酸を特異的に認識し増幅するためのプライマー、又は該ＲＮＡ又はそれに由来する核酸を特異的に検出するためのプローブがこれに該当するが、これらは、当該標的遺伝子を構成する塩基配列に基づいて設計することができる。ここで「特異的に認識する」とは、例えばノーザンブロット法において、実質的に本発明の標的遺伝子又はそれに由来する核酸のみを検出できること、また例えばＲＴ−ＰＣＲ法において、実質的に当該核酸のみが増幅される如く、当該検出物又は生成物が当該遺伝子又はそれに由来する核酸であると判断できることを意味する。
具体的には、本発明の標的遺伝子を構成する塩基配列からなるＤＮＡ又はその相補鎖に相補的な一定数のヌクレオチドを含むオリゴヌクレオチドを利用することができる。ここで「相補鎖」とは、Ａ：Ｔ（ＲＮＡの場合はＵ）、Ｇ：Ｃの塩基対からなる２本鎖ＤＮＡの一方の鎖に対する他方の鎖を指す。また、「相補的」とは、当該一定数の連続したヌクレオチド領域で完全に相補配列である場合に限られず、好ましくは８０％以上、より好ましくは９０％以上、さらに好ましくは９５％以上の塩基配列上の同一性を有すればよい。塩基配列の同一性は、前記ＢＬＡＳＴ等のアルゴリズムにより決定することができる。
斯かるオリゴヌクレオチドは、プライマーとして用いる場合には、特異的なアニーリング及び鎖伸長ができればよく、通常、例えば１０塩基以上、好ましくは１５塩基以上、より好ましくは２０塩基以上、かつ例えば１００塩基以下、好ましくは５０塩基以下、より好ましくは３５塩基以下の鎖長を有するものが挙げられる。また、プローブとして用いる場合には、特異的なハイブリダイゼーションができればよく、本発明の標的遺伝子を構成する塩基配列からなるＤＮＡ（又はその相補鎖）の少なくとも一部若しくは全部の配列を有し、例えば１０塩基以上、好ましくは１５塩基以上、かつ例えば１００塩基以下、好ましくは５０塩基以下、より好ましくは２５塩基以下の鎖長のものが用いられる。
なお、ここで、「オリゴヌクレオチド」は、ＤＮＡあるいはＲＮＡであることができ、合成されたものでも天然のものでもよい。又はイブリダイゼーションに用いるプローブは、通常標識したものが用いられる。 A probe or primer used for the above measurement, that is, a primer for specifically recognizing and amplifying the target gene of the present invention or a nucleic acid derived from it, or a primer for specifically detecting the RNA or a nucleic acid derived from it. Probes fall into this category, but they can be designed based on the nucleotide sequences that make up the target gene. Here, "specifically recognizing" means that, for example, in Northern blotting, substantially only the target gene of the present invention or a nucleic acid derived from the target gene can be detected, and in, for example, in the RT-PCR method, substantially only the nucleic acid. Means that the detection or product can be determined to be the gene or nucleic acid derived from it so that is amplified.
Specifically, an oligonucleotide containing a DNA consisting of a base sequence constituting the target gene of the present invention or a certain number of nucleotides complementary to the complementary strand thereof can be used. Here, the "complementary strand" refers to the other strand of a double-stranded DNA consisting of A: T (U in the case of RNA) and G: C base pairs with respect to one strand. Further, "complementary" is not limited to the case where the sequence is completely complementary in the fixed number of continuous nucleotide regions, and is preferably 80% or more, more preferably 90% or more, still more preferably 95% or more bases. It suffices to have the sameness on the sequence. The identity of the base sequence can be determined by an algorithm such as BLAST.
When such an oligonucleotide is used as a primer, it suffices if it can perform specific annealing and chain extension, and is usually, for example, 10 bases or more, preferably 15 bases or more, more preferably 20 bases or more, and for example 100 bases or less. Those having a chain length of 50 bases or less, more preferably 35 bases or less can be mentioned. Further, when it is used as a probe, it suffices if specific hybridization can be performed, and it has at least a part or all of a sequence of DNA (or its complementary strand) consisting of a base sequence constituting the target gene of the present invention, for example. A chain length of 10 bases or more, preferably 15 bases or more, and for example 100 bases or less, preferably 50 bases or less, more preferably 25 bases or less is used.
Here, the "oligonucleotide" can be DNA or RNA, and may be synthesized or natural. Alternatively, as the probe used for hybridization, a labeled probe is usually used.

また、本発明の標的遺伝子の翻訳産物（タンパク質）、当該タンパク質と相互作用する分子、ＲＮＡと相互作用する分子、又はＤＮＡと相互作用する分子を測定する場合は、プロテインチップ解析、免疫測定法（例えば、ＥＬＩＳＡ等）、質量分析（例えば、ＬＣ−ＭＳ／ＭＳ、ＭＡＬＤＩ−ＴＯＦ／ＭＳ）、１−ハイブリッド法（PNAS 100, 12271-12276(2003)）や２−ハイブリッド法（Biol. Reprod. 58, 302-311 (1998)）のような方法を用いることができ、対象に応じて適宜選択できる。
例えば、測定対象としてタンパク質が用いられる場合は、本発明の発現産物に対する抗体を生体試料と接触させ、当該抗体に結合した試料中のタンパク質を検出し、そのレベルを測定することによって実施される。例えば、ウェスタンブロット法によれば、一次抗体として上記の抗体を用いた後、二次抗体として放射性同位元素、蛍光物質又は酵素等で標識した一次抗体に結合する抗体を用いて、その一次抗体を標識し、これら標識物質由来のシグナルを放射線測定器、蛍光検出器等で測定することが行われる。
尚、上記翻訳産物に対する抗体は、ポリクローナル抗体であっても、モノクローナル抗体であってもよい。これらの抗体は、公知の方法に従って製造することができる。具体的には、ポリクローナル抗体は、常法に従って大腸菌等で発現し精製したタンパク質を用いて、あるいは常法に従って当該タンパク質の部分ポリペプチドを合成して、家兎等の非ヒト動物に免疫し、該免疫動物の血清から常法に従って得ることが可能である。
一方、モノクローナル抗体は、常法に従って大腸菌等で発現し精製したタンパク質又は該タンパク質の部分ポリペプチドをマウス等の非ヒト動物に免疫し、得られた脾臓細胞と骨髄腫細胞とを細胞融合させて調製したハイブリドーマ細胞から得ることができる。また、モノクローナル抗体は、ファージディスプレイを用いて作製してもよい（Griffiths, A.D.; Duncan, A.R., Current Opinion in Biotechnology, Volume 9, Number 1, February 1998 , pp. 102-108(7)）。 In addition, when measuring a translation product (protein) of the target gene of the present invention, a molecule that interacts with the protein, a molecule that interacts with RNA, or a molecule that interacts with DNA, protein chip analysis and immunoassay (immunassay) For example, ELISA, etc.), mass spectrometry (eg, LC-MS / MS, MALDI-TOF / MS), 1-hybrid method (PNAS 100, 12271-12276 (2003)) and 2-hybrid method (Biol. Reprod. 58). , 302-311 (1998)) can be used and can be appropriately selected according to the target.
For example, when a protein is used as a measurement target, it is carried out by contacting an antibody against the expression product of the present invention with a biological sample, detecting the protein in the sample bound to the antibody, and measuring the level thereof. For example, according to Western blotting, the above antibody is used as the primary antibody, and then the primary antibody is used as the secondary antibody by using an antibody that binds to the primary antibody labeled with a radioactive isotope, a fluorescent substance, an enzyme or the like. Labeling is performed, and signals derived from these labeling substances are measured with a radiation measuring device, a fluorescence detector, or the like.
The antibody against the translation product may be a polyclonal antibody or a monoclonal antibody. These antibodies can be produced according to known methods. Specifically, the polyclonal antibody immunizes non-human animals such as rabbits by using a protein expressed and purified in Escherichia coli or the like according to a conventional method or by synthesizing a partial polypeptide of the protein according to a conventional method. It can be obtained from the serum of the immunized animal according to a conventional method.
On the other hand, a monoclonal antibody is obtained by immunizing a non-human animal such as a mouse with a protein expressed and purified in Escherichia coli or the like according to a conventional method or a partial polypeptide of the protein, and fusing the obtained spleen cells and myeloma cells. It can be obtained from the prepared hybridoma cells. Monoclonal antibodies may also be prepared using a phage display (Griffiths, AD; Duncan, AR, Current Opinion in Biotechnology, Volume 9, Number 1, February 1998, pp. 102-108 (7)).

斯くして、被験者から採取された生体試料中の本発明の標的遺伝子又はその発現産物の発現レベルが測定され、当該発現レベルに基づいてアトピー性皮膚炎が検出される。検出は、具体的には、測定された本発明の標的遺伝子又はその発現産物の発現レベルを対照レベルと比較することによって行われる。
シーケンシングにより複数の標的遺伝子の発現レベルの解析を行う場合は、上記したように、発現量のデータであるリードカウント値、該リードカウント値をサンプル間の総リード数の違いを補正したＲＰＭ値、当該ＲＰＭ値を底２の対数値に変換した値（Ｌｏｇ_２ＲＰＭ値）又は整数１を加算した底２の対数値（Ｌｏｇ_２（ＲＰＭ＋１）値）、あるいはＤＥＳｅｑ２を用いて補正されたカウント値（Ｎｏｒｍａｌｉｚｅｄｃｏｕｎｔ値）又は整数１を加算した底２の対数値（Ｌｏｇ_２（ｃｏｕｎｔ＋１）値）を指標として用いるのが好ましい。また、ＲＮＡ−ｓｅｑの定量値として一般的な、ｆｒａｇｍｅｎｔｓｐｅｒｋｉｌｏｂａｓｅｏｆｅｘｏｎｐｅｒｍｉｌｌｉｏｎｒｅａｄｓｍａｐｐｅｄ（ＦＰＫＭ）、ｒｅａｄｓｐｅｒｋｉｌｏｂａｓｅｏｆｅｘｏｎｐｅｒｍｉｌｌｉｏｎｒｅａｄｓｍａｐｐｅｄ（ＲＰＫＭ）、ｔｒａｎｓｃｒｉｐｔｓｐｅｒｍｉｌｌｉｏｎ（ＴＰＭ）などによって算出される値であってもよい。また、マイクロアレイ法によって得られるシグナル値、及びその補正値であってもよい。また、ＲＴ−ＰＣＲなどにより特定の標的遺伝子のみ発現レベルの解析を行う場合には、対象遺伝子の発現量をハウスキーピング遺伝子の発現量を基準とする相対的な発現量に変換（相対定量）して解析する方法、又は標的遺伝子の領域を含むプラスミドを用いて絶対的なコピー数を定量（絶対定量）して解析する方法が好ましい。デジタルＰＣＲ法によって得られるコピー数であってもよい。
ここで、「対照レベル」とは、例えば、健常人における当該標的遺伝子又はその発現産物の発現レベルが挙げられる。健常人の発現レベルは、健常人集団から測定した当該遺伝子又はその発現産物の発現レベルの統計値（例えば平均値等）であってもよい。標的遺伝子が複数の場合は、各々の遺伝子又はその発現産物について基準発現レベルを求めることが好ましい。 Thus, the expression level of the target gene of the present invention or the expression product thereof in the biological sample collected from the subject is measured, and atopic dermatitis is detected based on the expression level. Detection is specifically performed by comparing the measured expression level of the target gene of the invention or its expression product with the control level.
When analyzing the expression levels of multiple target genes by sequencing, as described above, the read count value, which is the expression level data, and the RPM value obtained by correcting the difference in the total number of reads between the samples. , The value obtained by converting the RPM value into the logarithmic value of the base 2 (Log ₂ RPM value), the logarithmic value of the base 2 obtained by adding the integer 1 (Log ₂ (RPM + 1) value), or the count value corrected using DESeq2. It is preferable to use (Normalized count value) or the logarithmic value of the base 2 (Log ₂ (count + 1) value) obtained by adding an integer 1 as an index. Also, as a quantitative value of RNA-seq, it is commonly used as a quantitative value of RNA-seq. It may be a value. Further, it may be a signal value obtained by the microarray method and a correction value thereof. When analyzing the expression level of only a specific target gene by RT-PCR or the like, the expression level of the target gene is converted into a relative expression level based on the expression level of the housekeeping gene (relative quantification). The method of analysis by quantifying (absolute quantification) the absolute number of copies using a plasmid containing the region of the target gene is preferable. It may be the number of copies obtained by the digital PCR method.
Here, the “control level” includes, for example, the expression level of the target gene or its expression product in a healthy person. The expression level of a healthy person may be a statistical value (for example, an average value) of the expression level of the gene or its expression product measured from a healthy person population. When there are a plurality of target genes, it is preferable to determine the reference expression level for each gene or its expression product.

また、本発明におけるアトピー性皮膚炎の検出は、本発明の標的遺伝子又はその発現産物の発現レベルの上昇／減少により行うこともできる。この場合は、被験者由来の生体試料における標的遺伝子又はその発現産物の発現レベルが、各遺伝子又はその発現産物のカットオフ値（参照値）と比較される。カットオフ値は、予め健常者における当該標的遺伝子又はその発現産物の発現レベルを基準データとして取得しておき、それに基づく発現レベルの平均値や標準偏差等の統計的数値に基づき、適宜決定すれば良い。 Further, the detection of atopic dermatitis in the present invention can also be carried out by increasing / decreasing the expression level of the target gene of the present invention or its expression product. In this case, the expression level of the target gene or its expression product in the biological sample derived from the subject is compared with the cutoff value (reference value) of each gene or its expression product. The cutoff value can be determined as appropriate based on statistical values such as the average value and standard deviation of the expression level based on the expression level of the target gene or its expression product in a healthy subject obtained as reference data in advance. good.

さらに、アトピー性皮膚炎患者由来の標的遺伝子又はその発現産物の発現レベルと、健常人由来の標的遺伝子又はその発現産物の発現レベルの測定値を利用して、アトピー性皮膚炎患者と健常人とを分ける判別式（予測モデル）を構築し、当該判別式を利用して、アトピー性皮膚炎を検出することができる。すなわち、アトピー性皮膚炎患者由来の標的遺伝子又はその発現産物の発現レベルと、健常者由来の標的遺伝子又はその発現産物の発現レベルの測定値を教師サンプルとして、アトピー性皮膚炎患者と健常人を分ける判別式（予測モデル）を構築し、当該判別式に基づいてアトピー性皮膚炎患者と健常人を判別するカットオフ値（参照値）を求める。なお、判別式の作成においては、主成分分析（ＰＣＡ）により次元圧縮を行ない、主要成分を説明変数とすることができる。
そして、被験者から採取された生体試料から標的遺伝子又はその発現産物のレベルを同様に測定し、得られた測定値を当該判別式に代入し、当該判別式から得られた結果を参照値と比較することによって、被検者におけるアトピー性皮膚炎の存在又は不存在を評価できる。 Furthermore, using the measured values of the expression level of the target gene derived from the atopic dermatitis patient or its expression product and the expression level of the target gene derived from the healthy person or its expression product, the atopic dermatitis patient and the healthy person Atopic dermatitis can be detected by constructing a discriminant formula (prediction model) for dividing the disease and using the discriminant formula. That is, using the measured values of the expression level of the target gene derived from the atopic dermatitis patient or its expression product and the expression level of the target gene derived from the healthy person or its expression product as a teacher sample, the atopic dermatitis patient and the healthy person are selected. A discriminant formula (prediction model) for dividing is constructed, and a cutoff value (reference value) for discriminating between atopic dermatitis patients and healthy subjects is obtained based on the discriminant formula. In creating the discriminant, dimensional compression can be performed by principal component analysis (PCA), and the main component can be used as an explanatory variable.
Then, the level of the target gene or its expression product is measured in the same manner from the biological sample collected from the subject, the obtained measured value is substituted into the discriminant, and the result obtained from the discriminant is compared with the reference value. By doing so, the presence or absence of atopic dermatitis in the subject can be evaluated.

判別式の構築に用いる変数には説明変数と目的変数がある。説明変数としては、例えば、下記の方法で選択した標的遺伝子又はその発現産物の発現レベルを用いることができる。目的変数としては、例えば、そのサンプルが健常人由来かアトピー性皮膚炎患者由来か、を用いることができる。 The variables used to construct the discriminant include explanatory variables and objective variables. As the explanatory variable, for example, the expression level of the target gene or its expression product selected by the following method can be used. As the objective variable, for example, whether the sample is derived from a healthy person or a patient with atopic dermatitis can be used.

特徴量の選択には、判別する２群間の統計学的に有意な差異、例えば、発現レベルが２群間で有意に変動する遺伝子（発現変動遺伝子）またはその発現産物の発現レベルを用いることができる。また、機械学習に用いるアルゴリズムなどの公知のものを利用して特徴量遺伝子を抽出し、その発現レベルを用いたりすることができる。例えば、下記に示すランダムフォレストにおける変数重要度の高い遺伝子またはその発現産物の発現レベルを用いたり、Ｒ言語の“Ｂｏｒｕｔａ”パッケージなどを用いて特徴量遺伝子を抽出し、その発現レベルを用いたりすることができる。 For the selection of the feature amount, a statistically significant difference between the two groups to be discriminated, for example, a gene whose expression level significantly fluctuates between the two groups (expression fluctuation gene) or the expression level of the expression product thereof is used. Can be done. In addition, a well-known gene such as an algorithm used for machine learning can be used to extract a feature gene and its expression level can be used. For example, the expression level of a gene with high variable importance or its expression product in the random forest shown below is used, or the feature amount gene is extracted using the "Boruta" package of R language and the expression level is used. be able to.

判別式の構築におけるアルゴリズムとしては、機械学習に用いるアルゴリズムなどの公知のものを利用することができる。機械学習アルゴリズムの例としては、ランダムフォレスト（Ｒａｎｄｏｍｆｏｒｅｓｔ）、線形カーネルのサポートベクターマシン（ＳＶＭｌｉｎｅａｒ）、ｒｂｆカーネルのサポートベクターマシン（ＳＶＭｒｂｆ）ニューラルネットワーク（Ｎｅｒｕｒａｌｎｅｔ）、一般線形モデル（Ｇｅｎｅｒａｌｉｚｅｄｌｉｎｅａｒｍｏｄｅｌ）、正則化線形判別分析（Ｒｅｇｕｌａｒｉｚｅｄｌｉｎｅａｒｄｉｓｃｒｉｍｉｎａｎｔａｎａｌｙｓｉｓ）、正則化ロジスティック回帰（Ｒｅｇｕｌａｒｉｚｅｄｌｏｇｉｓｔｉｃｒｅｇｒｅｓｓｉｏｎ）などが挙げられる。構築した予測モデルに検証用のデータを入力して予測値を算出し、該予測値が実測値と最も適合するモデル、例えば正解率（Ａｃｃｕｒａｃｙ）が最も大きいモデルを最適な予測モデルとして選抜することができる。また、予測値と実測値から検出率（Ｒｅｃａｌｌ）、精度（Ｐｒｅｃｉｓｉｏｎ）、及びそれらの調和平均であるＦ値を計算し、そのＦ値が最も大きいモデルを最適な予測モデルとして選抜することができる。 As an algorithm for constructing the discriminant, a known algorithm such as an algorithm used for machine learning can be used. Examples of machine learning algorithms include random forest (Random forest), linear kernel support vector machine (SVM liner), rbf kernel support vector machine (SVM rbf) neural network (Regular net), and generalized linear model. ), Regularized linear discriminant analysis, Regularized logistic regression, and the like. Input verification data into the constructed prediction model to calculate the prediction value, and select the model whose prediction value best matches the measured value, for example, the model with the largest accuracy rate, as the optimum prediction model. Can be done. In addition, the detection rate (Recall), accuracy (Precision), and F value, which is the harmonic mean thereof, can be calculated from the predicted value and the measured value, and the model with the largest F value can be selected as the optimum prediction model. ..

判別式の構築においてランダムフォレストのアルゴリズムを使用する場合、予測モデルの精度の指標として、未知データに対する推定の誤答率（ＯＯＢｅｒｒｏｒｒａｔｅ）を算出することができる（Breiman L. Machine Learning (2001) 45;5-32）。 When the random forest algorithm is used in the construction of the discriminant, the estimated error rate (OOB rate) for unknown data can be calculated as an index of the accuracy of the prediction model (Breiman L. Machine Learning (2001)). 45; 5-32).

ランダムフォレストにおいては、ブートストラップ法という手法に従い、全サンプル中から重複を許して、サンプル数の約３分の２のサンプルをランダムに抽出し、決定木と呼ばれる分類器を作成する。この時抽出されなかったサンプルはＯｕｔｏｆｂｕｇ（ＯＯＢ）と呼ばれ、1本の決定木を用いて、ＯＯＢの目的変数の予測を行い、正解ラベルと比較することでその誤答率を算出することができる（決定木におけるＯＯＢｅｒｒｏｒｒａｔｅ）。同様の作業を５００回繰り返し行い、５００本の決定木におけるＯＯＢｅｒｒｏｒｒａｔｅの平均値をとった値を、該ランダムフォレストのモデルのＯＯＢｅｒｒｏｒｒａｔｅとすることができる。 In a random forest, a classifier called a decision tree is created by randomly extracting about two-thirds of the number of samples by allowing duplication from all the samples according to a technique called the bootstrap method. The sample not extracted at this time is called Out of bug (OOB), and the objective variable of OOB is predicted using one decision tree, and the wrong answer rate is calculated by comparing it with the correct answer label. Can be (OOB variable rate in decision tree). The same operation is repeated 500 times, and the value obtained by taking the average value of the OOB erase in 500 decision trees can be used as the OOB erase of the model of the random forest.

なお、ランダムフォレストのモデルを構築する決定木の数（ｎｔｒｅｅ値）は、デフォルトでは５００本であるが、必要に応じて任意の本数に変更することができる。さらに、1つの決定木においてサンプルの判別式の作成に用いる変数の数（ｍｔｒｙ値）は、デフォルトでは説明変数の数の平方根をとった値であるが、必要に応じて１つから全説明変数の数までの値のいずれかに変更することができる。 The number of decision trees (stream values) for constructing a random forest model is 500 by default, but it can be changed to any number as needed. Furthermore, the number of variables (mtry value) used to create the sample discriminant formula in one decision tree is the value obtained by taking the square root of the number of explanatory variables by default, but if necessary, one to all explanatory variables Can be changed to any of the values up to.

ｍｔｒｙ値の決定にはＲ言語の“ｃａｒｅｔ”パッケージを用いることができる。“ｃａｒｅｔ”パッケージのメソッドにランダムフォレストを指定し、８通りのｍｔｒｙ値を試行し、例えばＡｃｃｕｒａｃｙが最大となるｍｔｒｙ値を最適なｍｔｒｙ値として選択することができる。なお、ｍｔｒｙ値の試行回数は、必要に応じて任意の試行回数に変更することができる。 The R language "caret" package can be used to determine the mtry value. A random forest can be specified in the method of the "caret" package, eight mtry values can be tried, and for example, the mtry value that maximizes the accuracy can be selected as the optimum mtry value. The number of trials of the mtry value can be changed to any number of trials as needed.

判別式の構築においてランダムフォレストのアルゴリズムを使用する場合、モデルの構築に用いた説明変数の重要度を数値（変数重要度）化することができる。変数重要度の値には、例えば、ジニ係数の減少量（ＭｅａｎＤｅｃｒｅａｓｅＧｉｎｉ）を用いることができる。 When the random forest algorithm is used in the construction of the discriminant, the importance of the explanatory variables used in the construction of the model can be quantified (variable importance). For the value of the variable importance, for example, the amount of decrease in the Gini coefficient (Mean Decrease Gini) can be used.

カットオフ値（参照値）の決定方法は特に制限されず、公知の手法に従って決定することができる。例えば、判別式を使用して作成されたＲＯＣ（ＲｅｃｅｉｖｅｒＯｐｅｒａｔｉｎｇＣｈａｒａｃｔｅｒｉｓｔｉｃＣｕｒｖｅ）曲線より求めることができる。ＲＯＣ曲線では、縦軸に陽性患者において陽性の結果がでる確率（感度）と、横軸に陰性患者において陰性の結果がでる確率（特異度）を１から減算した値（偽陽性率）がプロットされる。ＲＯＣ曲線に示される「真陽性（感度）」及び「偽陽性（１−特異度）」に関し、「真陽性（感度）」−「偽陽性（１−特異度）」が最大となる値（Ｙｏｕｄｅｎｉｎｄｅｘ）をカットオフ値（参照値）とすることができる。 The method for determining the cutoff value (reference value) is not particularly limited, and the cutoff value (reference value) can be determined according to a known method. For example, it can be obtained from a ROC (Receiver Operating Characteristic Curve) curve created by using a discriminant. In the ROC curve, the vertical axis plots the probability of a positive result in a positive patient (sensitivity) and the horizontal axis plots the probability of a negative result in a negative patient (specificity) subtracted from 1 (false positive rate). Will be done. Regarding "true positive (sensitivity)" and "false positive (1-specificity)" shown in the ROC curve, "true positive (sensitivity)"-"false positive (1-specificity)" is the maximum value (Youden). index) can be a cutoff value (reference value).

既に述べたとおり、ＭＥＣＲ、ＲＡＳＡ４ＣＰ、ＡＲＲＤＣ４、ＥＩＦ１ＡＤ、ＦＤＦＴ１、ＺＮＦ７０６、ＴＥＸ２、ＴＭＰＲＳＳ１１Ｅ、ＲＰＳ６ＫＢ２、ＣＴＢＰ１、ＺＮＦ３３５、ＤＧＫＡ、ＰＰＰ１Ｒ９Ｂ、ＳＰＤＹＥ７Ｐ、ＤＮＡＳＥ１Ｌ１、ＧＮＢ２及びＣＳＮＫ１Ｇ２からなる１７種の遺伝子、及び当該１７種の遺伝子を含む表１−１〜１−３で示される１２３遺伝子、表３−１〜３−４で示される１５０遺伝子、又は表４で示される４５遺伝子を特徴量遺伝子として、アトピー性皮膚炎の予測が可能な予測モデルを構築することができる。 As already mentioned, MECR, RASA4CP, ARRDC4, EIF1AD, FDFT1, ZNF706, TEX2, TMPRSS11E, RPS6KB2, CTBP1, ZNF335, DGKA, PPP1R9B, SPDYE7P, DNASE1L1, SPDYE7P, DNASE1L1, GNB2 and CSN1 123 genes shown in Tables 1-1 to 1-3, 150 genes shown in Tables 3-1 to 3-4, or 45 genes shown in Table 4 containing genes are used as characteristic amount genes for atopic dermatitis. It is possible to build a predictive model that can be predicted.

したがって、上記アトピー性皮膚炎患者群と健常人群とを分ける判別式を作成する場合に、特徴量遺伝子としてＭＥＣＲ、ＲＡＳＡ４ＣＰ、ＡＲＲＤＣ４、ＥＩＦ１ＡＤ、ＦＤＦＴ１、ＺＮＦ７０６、ＴＥＸ２、ＴＭＰＲＳＳ１１Ｅ、ＲＰＳ６ＫＢ２、ＣＴＢＰ１、ＺＮＦ３３５、ＤＧＫＡ、ＰＰＰ１Ｒ９Ｂ、ＳＰＤＹＥ７Ｐ、ＤＮＡＳＥ１Ｌ１、ＧＮＢ２及びＣＳＮＫ１Ｇ２からなる１７種の遺伝子から選択される１つの遺伝子又は２種以上、好ましくは５種以上、より好ましくは１０種以上、よりさらに好ましくは１７種全て選択し、その遺伝子又はその発現産物の発現データを用いる。さらに、複数の遺伝子を選択する場合、これら遺伝子の表３−１〜３−４における変数重要度のより上位の遺伝子から順に特徴量遺伝子として選択し、判別式を作成するのが好ましい。また、当該１７種の遺伝子に、前記表Ａで示される２４５遺伝子、表１−１〜１−３で示される１２３遺伝子、表３−１〜３−４で示される１５０遺伝子又は表４で示される４５遺伝子において、当該１７種の遺伝子以外の遺伝子から選ばれる少なくとも１つ、５以上、１０以上、２０以上又は５０以上の遺伝子又はその発現産物の発現データを適宜加えることにより、判別式を作成し、アトピー性皮膚炎の検出をすることも可能である。当該１７種の遺伝子以外の遺伝子を表３−１〜３−４に示される１５０遺伝子から選択する場合は、変数重要度のより上位の遺伝子から順に、あるいは変数重要度が上位から５０位以内、好ましくは３０位以内の遺伝子から特徴量遺伝子を選択しても良い。さらに、当該１７種の遺伝子以外の遺伝子を特徴量遺伝子として選択する場合には、表１−１〜１−３、表３−１〜３−４及び表４において＊を付し太字で表示された新規のアトピー性皮膚炎マーカーから特徴量遺伝子を選択するのが好ましい。
好ましくは、上記１７遺伝子、表１−１〜１−３で示される１２３遺伝子又は１０７遺伝子（表１−１〜１−３において＊を付し太字で表示）、表３−１〜３−４で示される１５０遺伝子又は１２７遺伝子（表３−１〜３−４において＊を付し太字で表示）、又は表４で示される４５遺伝子又は３９遺伝子（表４において＊を付し太字で表示）を特徴量遺伝子とする判別式が挙げられる。 Therefore, when creating a discriminant that separates the atopic dermatitis patient group from the healthy subject group, MECR, RASA4CP, ARRDC4, EIF1AD, FDFT1, ZNF706, TEX2, TMPRSS11E, RPS6KB2, CTBP1, ZNF335, DGKA are used as characteristic gene. , PPP1R9B, SPDYE7P, DNASE1L1, GNUB2 and CSNK1G2, one gene or two or more, preferably 5 or more, more preferably 10 or more, still more preferably all 17 , The expression data of the gene or its expression product is used. Further, when a plurality of genes are selected, it is preferable to select the genes having the higher variable importance in Tables 3-1 to 3-4 of these genes as the feature amount genes in order to prepare a discriminant. In addition, the 17 kinds of genes include 245 genes shown in Table A, 123 genes shown in Tables 1-1 to 1-3, 150 genes shown in Tables 3-1 to 3-4, or Table 4. A discriminant formula is created by appropriately adding expression data of at least one, 5 or more, 10 or more, 20 or more or 50 or more genes or their expression products selected from genes other than the 17 kinds of genes among the 45 genes. However, it is also possible to detect atopic dermatitis. When selecting genes other than the 17 genes from the 150 genes shown in Tables 3-1 to 3-4, the genes having the highest variable importance are selected in order, or the variable importance is within the 50th position from the highest. Preferably, the feature amount gene may be selected from the genes within the 30th position. Further, when a gene other than the 17 kinds of genes is selected as a feature gene, * is added in Tables 1-1 to 1-3, Tables 3-1 to 3-4 and Table 4 and displayed in bold. It is preferable to select the feature gene from the novel atopic dermatitis markers.
Preferably, the above 17 genes, 123 genes or 107 genes shown in Tables 1-1 to 1-3 (indicated in bold with * in Tables 1-1 to 1-3), Tables 3-1 to 3-4. 150 genes or 127 genes shown in Table 3 to 127 genes (marked with * in Tables 3-1 to 3-4 and shown in bold), or 45 genes or 39 genes shown in Table 4 (marked with * in Table 4 and shown in bold). There is a discriminant formula in which is a feature amount gene.

本発明のアトピー性皮膚炎を検出するための検査用キットは、患者から分離した生体試料における本発明の標的遺伝子又はその発現産物の発現レベルを測定するための検査試薬を含有するものである。具体的には、本発明の標的遺伝子又はそれに由来する核酸と特異的に結合（ハイブリダイズ）するオリゴヌクレオチド（例えば、ＰＣＲ用のプライマー）を含む、核酸増幅、ハイブリダイゼーションのための試薬、或いは、本発明の標的遺伝子の発現産物（タンパク質）を認識する抗体を含む免疫学的測定のための試薬等が挙げられる。当該キットに包含されるオリゴヌクレオチド、抗体等は、上述したとおり公知の方法により得ることができる。
また、当該検査用キットには、上記抗体や核酸の他、標識試薬、緩衝液、発色基質、二次抗体、ブロッキング剤や、試験に必要な器具やポジティブコントロールやネガティブコントロールとして使用するコントロール試薬、生体試料を採取するための用具（例えば、ＳＳＬを採取するための脂取りフィルムなど）等を含むことができる。 The test kit for detecting atopic dermatitis of the present invention contains a test reagent for measuring the expression level of the target gene of the present invention or an expression product thereof in a biological sample isolated from a patient. Specifically, a reagent for nucleic acid amplification and hybridization, or a reagent containing an oligonucleotide (for example, a primer for PCR) that specifically binds (hybridizes) to the target gene of the present invention or a nucleic acid derived from the target gene. Examples thereof include reagents for immunological measurement containing an antibody that recognizes an expression product (protein) of the target gene of the present invention. The oligonucleotides, antibodies and the like included in the kit can be obtained by a known method as described above.
In addition to the above antibodies and nucleic acids, the test kit includes labeling reagents, buffer solutions, color-developing substrates, secondary antibodies, blocking agents, instruments necessary for testing, and control reagents used as positive and negative controls. A tool for collecting a biological sample (for example, a grease removing film for collecting an SSL) or the like can be included.

本発明の態様及び好ましい実施態様を以下に示す。
＜１＞被験者から採取された生体試料について、ＭＥＣＲ、ＲＡＳＡ４ＣＰ、ＡＲＲＤＣ４、ＥＩＦ１ＡＤ、ＦＤＦＴ１、ＺＮＦ７０６、ＴＥＸ２、ＴＭＰＲＳＳ１１Ｅ、ＲＰＳ６ＫＢ２、ＣＴＢＰ１、ＺＮＦ３３５、ＤＧＫＡ、ＰＰＰ１Ｒ９Ｂ、ＳＰＤＹＥ７Ｐ、ＤＮＡＳＥ１Ｌ１、ＧＮＢ２及びＣＳＮＫ１Ｇ２からなる１７種の遺伝子群より選択される少なくとも１つの遺伝子又はその発現産物の発現レベルを測定する工程を含む、当該被験者におけるアトピー性皮膚炎の検出方法。
＜２＞遺伝子又はその発現産物の発現レベルがｍＲＮＡの発現量の測定である、＜１＞の方法。
＜３＞遺伝子又はその発現産物が前記被験者の皮膚表上脂質に含まれるＲＮＡである、＜１＞又は＜２＞の方法。
＜４＞発現レベルの測定値を前記各遺伝子又はその発現産物の参照値と比較し、アトピー性皮膚炎の存在又は不存在を評価する、＜１＞〜＜３＞のいずれかの方法。
＜５＞アトピー性皮膚患者由来の前記遺伝子又はその発現産物の発現レベルと、健常者由来の同遺伝子又はその発現産物の発現レベルの測定値を教師サンプルとして、アトピー性皮膚炎患者と健常者を分ける判別式を作成し、被験者から採取された生体試料から得られた同遺伝子又はその発現産物の発現レベルの測定値を当該判別式に代入し、得られた結果を参照値と比較することによって、被検者におけるアトピー性皮膚炎の存在又は不存在を評価する、＜１＞〜＜３＞のいずれかの方法。
＜６＞判別式の構築におけるアルゴリズムが、ランダムフォレスト、線形カーネルのサポートベクターマシン、ｒｂｆカーネルのサポートベクターマシン、ニューラルネットワーク、一般線形モデル、正則化線形判別分析、又は正則化ロジスティック回帰である＜５＞の方法。
＜７＞前記１７種の遺伝子群の全ての遺伝子又はその発現産物の発現レベルが測定される、＜５＞又は＜６＞の方法。
＜８＞前記７種の遺伝子群より選択される少なくとも１つの遺伝子の他に、後記表１−１〜１−３で示される１１２３種、後記表３−１〜３−４で示される１５０種、表４で示される４５種のうち前記１７種の遺伝子を除いた遺伝子群より選択される少なくとも１つの遺伝子又はその発現産物の発現レベルが測定される、＜５＞〜＜７＞のいずれかの方法。
＜９＞後記表３−１〜３−４で示される１５０種がランダムフォレストを用いて抽出された特徴量遺伝子である、＜８＞の方法。
＜１０＞後記表４で示される４５種がＢｏｒｕｔａ法を用いて抽出された特徴量遺伝子である、＜８＞の方法。
＜１１＞前記遺伝子又はそれに由来する核酸と特異的にハイブリダイズするオリゴヌクレオチド、又は前記遺伝子の発現産物を認識する抗体を含有する、＜１＞〜＜１０＞のいずれかの方法に用いられるアトピー性皮膚炎を検出するための検査用キット。
＜１２＞前記Ｂに示される２１０種の遺伝子群より選択される少なくとも１つの遺伝子又はその発現産物からなる、アトピー性皮膚炎の検出マーカー。
＜１３＞ＭＥＣＲ、ＲＡＳＡ４ＣＰ、ＡＲＲＤＣ４、ＥＩＦ１ＡＤ、ＦＤＦＴ１、ＺＮＦ７０６、ＴＥＸ２、ＴＭＰＲＳＳ１１Ｅ、ＲＰＳ６ＫＢ２、ＣＴＢＰ１、ＺＮＦ３３５、ＤＧＫＡ、ＰＰＰ１Ｒ９Ｂ、ＳＰＤＹＥ７Ｐ、ＤＮＡＳＥ１Ｌ１、ＧＮＢ２及びＣＳＮＫ１Ｇ２からなる１７種の遺伝子群より選択される少なくとも１つの遺伝子又はその発現産物である、＜１２＞のアトピー性皮膚炎の検出マーカー。 Aspects and preferred embodiments of the present invention are shown below.
<1> Regarding biological samples collected from subjects, MECR, RASA4CP, ARRDC4, EIF1AD, FDFT1, ZNF706, TEX2, TMPRSS11E, RPS6KB2, CTBP1, ZNF335, DGKA, PPP1R9B, SPDYE7P, DNASE1L1 and GN A method for detecting atopic dermatitis in a subject, comprising the step of measuring the expression level of at least one gene selected from the gene group or an expression product thereof.
<2> The method of <1>, wherein the expression level of the gene or its expression product is a measurement of the expression level of mRNA.
<3> The method of <1> or <2>, wherein the gene or its expression product is RNA contained in the lipid on the skin surface of the subject.
<4> The method according to any one of <1> to <3>, wherein the measured value of the expression level is compared with the reference value of each gene or its expression product to evaluate the presence or absence of atopic dermatitis.
<5> Using the measured values of the expression level of the gene or its expression product derived from atopic dermatitis patients and the expression level of the gene or its expression product derived from healthy subjects as a teacher sample, atopic dermatitis patients and healthy subjects By creating a discriminant formula for division, substituting the measured value of the expression level of the gene or its expression product obtained from the biological sample collected from the subject into the discriminant formula, and comparing the obtained result with the reference value. , Any of <1> to <3> for assessing the presence or absence of atopic dermatitis in a subject.
<6> The algorithm for constructing the discriminant formula is a random forest, a support vector machine for a linear kernel, a support vector machine for an rbf kernel, a neural network, a general linear model, a regularized linear discriminant analysis, or a regularized logistic regression. > Method.
<7> The method of <5> or <6>, wherein the expression levels of all genes of the 17 gene groups or their expression products are measured.
<8> In addition to at least one gene selected from the above 7 gene groups, 1123 genes shown in Tables 1-1 to 1-3 below and 150 genes shown in Tables 3-1 to 3-4 below. , Any one of <5> to <7>, wherein the expression level of at least one gene selected from the gene group excluding the 17 genes among the 45 genes shown in Table 4 or an expression product thereof is measured. the method of.
<9> The method of <8>, wherein the 150 species shown in Tables 3-1 to 3-4 below are feature genes extracted using a random forest.
<10> The method of <8>, wherein the 45 species shown in Table 4 below are feature genes extracted by the Boruta method.
<11> Atopy used in any of the methods <1> to <10>, which contains an oligonucleotide that specifically hybridizes with the gene or a nucleic acid derived from the gene, or an antibody that recognizes an expression product of the gene. A test kit for detecting sexual dermatitis.
<12> A marker for detecting atopic dermatitis, which comprises at least one gene selected from the 210 gene groups shown in B or an expression product thereof.
<13> MECR, RASA4CP, ARRDC4, EIF1AD, FDFT1, ZNF706, TEX2, TMPRSS11E, RPS6KB2, CTBP1, ZNF335, DGKA, PPP1R9B, SPDYE7P, DNASE1L1, GNB2 and CSNK1 A marker for detecting atopic dermatitis of <12>, which is a gene or an expression product thereof.

以下、実施例に基づき本発明をさらに詳細に説明するが、本発明はこれに限定されるものではない。 Hereinafter, the present invention will be described in more detail based on Examples, but the present invention is not limited thereto.

実施例１ＳＳＬから抽出されたＲＮＡにおけるアトピー性皮膚炎関連の発現変動遺伝子の検出
１）ＳＳＬ採取
成人の健常者（ＨＬ）（２５〜５７歳、男性）１４名、及びアトピー性皮膚を有する成人（ＡＤ）（２３〜５６歳、男性）２９名を被験者とした。アトピー性皮膚炎の被検者は、皮膚科専門医により、少なくとも顔面部に皮疹を有し、かつ重症度が軽症又は中等症のアトピー性皮膚炎であるとの診断を受けている。各被験者の全顔（ＡＤ患者は皮疹部を含む）からあぶら取りフィルム（５×８ｃｍ、ポリプロピレン製、３Ｍ社）を用いて皮脂を回収後、該あぶら取りフィルムをバイアルに移し、ＲＮＡ抽出に使用するまで−８０℃で、約１ヶ月間保存した。 Example 1 Detection of expression-variable genes related to atopic dermatitis in RNA extracted from SSL 1) SSL collection 14 healthy adults (HL) (25-57 years old, male) and adults with atopic skin 29 subjects (AD) (23-56 years old, male) were used as subjects. Subjects with atopic dermatitis have been diagnosed by a dermatologist as having a rash at least on the face and with mild or moderate severity of atopic dermatitis. After collecting sebum from the entire face of each subject (including the eruption part in AD patients) using an oil removal film (5 x 8 cm, made of polypropylene, 3M company), the oil removal film is transferred to a vial and used for RNA extraction. The mixture was stored at −80 ° C. for about 1 month.

２）ＲＮＡ調製及びシーケンシング
上記１）のあぶら取りフィルムを適当な大きさに切断し、ＱＩＡｚｏｌＬｙｓｉｓＲｅａｇｅｎｔ（Ｑｉａｇｅｎ）を用いて、付属のプロトコルに準じてＲＮＡを抽出した。抽出されたＲＮＡを元に、ＳｕｐｅｒＳｃｒｉｐｔＶＩＬＯｃＤＮＡＳｙｎｔｈｅｓｉｓｋｉｔ（ライフテクノロジーズジャパン株式会社）を用いて４２℃、９０分間逆転写を行いｃＤＮＡの合成を行った。逆転写反応のプライマーには、キットに付属しているランダムプライマーを使用した。得られたｃＤＮＡから、マルチプレックスＰＣＲにより２０８０２遺伝子に由来するＤＮＡを含むライブラリーを調製した。マルチプレックスＰＣＲは、ＩｏｎＡｍｐｌｉＳｅｑＴｒａｎｓｃｒｉｐｔｏｍｅＨｕｍａｎＧｅｎｅＥｘｐｒｅｓｓｉｏｎＫｉｔ（ライフテクノロジーズジャパン株式会社）を用いて、［９９℃、２分→（９９℃、１５秒→６２℃、１６分）×２０サイクル→４℃、Ｈｏｌｄ］の条件で行った。得られたＰＣＲ産物は、ＡｍｐｕｒｅＸＰ（ベックマン・コールター株式会社）で精製した後に、バッファーの再構成、プライマー配列の消化、アダプターライゲーションと精製、増幅を行い、ライブラリーを調製した。調製したライブラリーをＩｏｎ５４０Ｃｈｉｐにローディングし、ＩｏｎＳ５／ＸＬシステム（ライフテクノロジーズジャパン株式会社）を用いてシーケンシングした。 2) RNA preparation and sequencing The oil-removing film from 1) above was cut to an appropriate size, and RNA was extracted using QIAzol Lysis Reagent (Qiagen) according to the attached protocol. Based on the extracted RNA, cDNA was synthesized by reverse transcription at 42 ° C. for 90 minutes using a SuperScript VILO cDNA Synthesis kit (Life Technologies Japan Co., Ltd.). The random primer included in the kit was used as the primer for the reverse transcription reaction. From the obtained cDNA, a library containing DNA derived from the 20802 gene was prepared by multiplex PCR. Multiplex PCR was performed using Ion AmpliSeqTranscriptome Human Gene Expression Kit (Life Technologies Japan Co., Ltd.) [99 ° C, 2 minutes → (99 ° C, 15 seconds → 62 ° C, 16 minutes) × 20 cycles → 4 ° C, Hold. ] Condition. The obtained PCR product was purified by Aple XP (Beckman Coulter, Inc.), followed by buffer reconstitution, primer sequence digestion, adapter ligation and purification, and amplification to prepare a library. The prepared library was loaded into an Ion 540 Chip and sequenced using an Ion S5 / XL system (Life Technologies Japan KK).

３）データ解析
ｉ）使用データ
上記２）で測定した被験者由来のＲＮＡの発現量のデータ（リードカウント値）において、ＤＥＳｅｑ２という手法を用いて補正した。但し、全サンプル被験者の発現量データのうち９０％以上のサンプル被験者で欠損値ではない発現量データが得られている７４２９遺伝子のみ以下の解析に使用した。解析には、ＤＥＳｅｑ２という手法を用いて補正されたカウント値（Ｎｏｒｍａｌｉｚｅｄｃｏｕｎｔ値）を用いた。 3) Data analysis i) Data used The data (read count value) of the expression level of RNA derived from the subject measured in 2) above was corrected using a method called DESeq2. However, only the 7429 gene for which 90% or more of the expression level data of all sample subjects had expression level data that was not a missing value was used in the following analysis. For the analysis, a count value (normalized count value) corrected by using a method called DESeq2 was used.

ｉｉ）ＲＮＡ発現解析
上記ｉ）で測定した健常者、及びＡＤのＳＳＬ由来ＲＮＡ発現量（Ｎｏｒｍａｌｉｚｅｄｃｏｕｎｔ値）を基に、健常者と比較してＡＤにおいて尤度比検定によるｐ値の補正値（ＦＤＲ）が０．０５未満となるＲＮＡ（発現変動遺伝子）を同定した。その結果、７５種のＲＮＡがＡＤにおいて低下（ＤＯＷＮ）、４８種のＲＮＡがＡＤにおいて上昇（ＵＰ）した（表１−１〜１−３）。 ii) RNA expression analysis Based on the healthy subjects measured in i) above and the SSL-derived RNA expression level (Normalized count value) of AD, the p-value corrected by the likelihood ratio test in AD compared with healthy subjects (corrected value of p value by likelihood ratio test). RNA (expression variation gene) having FDR) of less than 0.05 was identified. As a result, 75 kinds of RNA decreased (DOWN) in AD, and 48 kinds of RNA increased (UP) in AD (Tables 1-1 to 1-3).

表１−１〜１−３に示された１２３遺伝子について、公共データベースであるＳＴＲＩＮＧを用いて遺伝子オントロジー（ＧＯ）エンリッチメント解析によるｂｉｏｌｏｇｉｃａｌｐｒｏｃｅｓｓ（ＢＰ）の探索を行った。その結果、ＡＤ患者において発現低下した遺伝子群と関連したＢＰが２７個得られ、脂質代謝やアミノ酸代謝に関するタームが含まれることが示され（表２）。また、発現上昇した遺伝子群と関連したＢＰが４個得られ、白血球の活性化に関するタームなどが含まれていることが示された（表２）。一方で、前記表１−１〜１−３に示された１２３遺伝子のうちの１０７遺伝子（各表において＊を付し太字で表示）については、これまでにアトピー性皮膚炎との関連性を示唆する報告がないことから、新規なアトピー性皮膚炎マーカーとなり得ると判断された。 The 123 genes shown in Tables 1-1 to 1-3 were searched for biological processes (BP) by gene ontology (GO) enrichment analysis using STRING, which is a public database. As a result, 27 BPs associated with the underexpressed gene group were obtained in AD patients, and it was shown that they contained terms related to lipid metabolism and amino acid metabolism (Table 2). In addition, four BPs associated with the up-expressed gene group were obtained, and it was shown that they contained terms related to leukocyte activation (Table 2). On the other hand, 107 of the 123 genes shown in Tables 1-1 to 1-3 (indicated in bold with * in each table) have been associated with atopic dermatitis so far. Since there was no suggestive report, it was judged that it could be a novel marker for atopic dermatitis.

実施例２ランダムフォレストの変数重要度の高い遺伝子を用いた判別モデルの構築
１）使用データ
実施例１と同様の方法で、被験者からのＳＳＬ由来ＲＮＡの発現量のデータ（リードカウント値）を取得し、サンプル間の総リード数の違いを補正したＲＰＭ値に変換した。ただし、全サンプル中９０％以上のサンプルで欠損値ではない発現量データが得られている７４２９遺伝子のみ以下の解析に使用した。機械学習モデルの構築には、負の二項分布に従うＲＰＭ値を正規分布に近似するため、整数１を加算した底２の対数値（Ｌｏｇ_２（ＲＰＭ＋１）値）を用いた。 Example 2 Construction of a discrimination model using genes with high variable importance in random forest 1) Usage data Obtain data (read count value) of the expression level of SSL-derived RNA from the subject by the same method as in Example 1. Then, it was converted into an RNAM value in which the difference in the total number of reads between the samples was corrected. However, only the 7429 gene for which expression level data that was not a missing value was obtained in 90% or more of the samples was used for the following analysis. _{In constructing the machine learning model, the logarithm of the base 2 (Log 2} (RPM + 1) value) obtained by adding the integer 1 was used in order to approximate the RPM value following the negative binomial distribution to the normal distribution.

２）特徴量遺伝子の選択
ランダムフォレストのアルゴリズムを用いて特徴量遺伝子の選択を行うために、全サンプル中９０％以上のサンプルで欠損値ではない発現量データが得られている７４２９遺伝子のＬｏｇ_２（ＲＰＭ＋１）値を説明変数とし、健常者（ＨＬ）とＡＤを目的変数として用いた。Ｒ言語の“ｃａｒｅｔ”パッケージにおいてランダムフォレストのアルゴリズムをメソッドとして指定し、１回の決定木の構築に用いる変数の数（ｍｔｒｙ値）の最適値をチューニングした。チューニングによって決定したｍｔｒｙ値を用いて、ランダムフォレストのアルゴリズムを実行し、ジニ係数に基づく変数重要度の上位１５０遺伝子を算出した（表３−１〜３−４）。そして、これら１５０遺伝子もしくは、そのうちこれまでにアトピー性皮膚炎との関係が報告されていない１２７遺伝子（各表において＊を付し太字で表示）を特徴量遺伝子として選択した。 2) Selection of feature gene In order to select the feature gene using the random forest algorithm, the expression level data that is not a missing value is obtained in 90% or more of the samples. Log _{2 of the 7429 gene.} The (RPM + 1) value was used as an explanatory variable, and healthy subjects (HL) and AD were used as objective variables. The random forest algorithm was specified as a method in the "caret" package of R language, and the optimum value of the number of variables (mtry value) used for constructing one decision tree was tuned. Using the mtry value determined by tuning, a random forest algorithm was executed to calculate the top 150 genes of variable importance based on the Gini coefficient (Tables 3-1 to 3-4). Then, these 150 genes or 127 genes whose relationship with atopic dermatitis has not been reported so far (marked with * in each table and indicated in bold) were selected as feature genes.

３）モデル構築
上記１５０遺伝子または１２７遺伝子のＬｏｇ_２（ＲＰＭ＋１）値を説明変数とし、ＨＬとＡＤを目的変数として用いた。Ｒ言語の“ｃａｒｅｔ”パッケージにおいてランダムフォレストのアルゴリズムをメソッドとして指定し、１回の決定木の構築に用いる変数の数（ｍｔｒｙ値）の最適値をチューニングした。チューニングによって決定したｍｔｒｙ値を用いて、ランダムフォレストのアルゴリズムを実行し、推定誤答率（ＯＯＢｅｒｒｏｒｒａｔｅ）を算出した。その結果、１５０遺伝子を用いた場合のモデルではＯＯＢｅｒｒｏｒｒａｔｅが６．９８％、１２７遺伝子を用いた場合のモデルでは６．９８％であった。 3) Model construction The Log ₂ (RPM + 1) value of the above 150 genes or 127 genes was used as an explanatory variable, and HL and AD were used as objective variables. The random forest algorithm was specified as a method in the "caret" package of R language, and the optimum value of the number of variables (mtry value) used for constructing one decision tree was tuned. The random forest algorithm was executed using the mtry value determined by tuning, and the estimated error rate (OOB rate) was calculated. As a result, the OOB erase rate was 6.98% in the model using 150 genes and 6.98% in the model using 127 genes.

実施例３発現変動遺伝子を用いた判別モデルの構築
１）使用データ
実施例１と同様の方法で、被験者からのＳＳＬ由来ＲＮＡの発現量のデータ（リードカウント値）を取得し、サンプル間の総リード数の違いを補正したＲＰＭ値に変換した。機械学習モデルの構築には、負の二項分布に従うＲＰＭ値を正規分布に近似するため、整数１を加算した底２の対数値（ｌｏｇ_２（ＲＰＭ＋１）値）を用いた。 Example 3 Construction of a discrimination model using expression-variable genes 1) Usage data Data on the expression level of SSL-derived RNA from subjects (read count value) was obtained from the subjects in the same manner as in Example 1, and the total between the samples was obtained. It was converted into an RPM value corrected for the difference in the number of reads. _{In constructing the machine learning model, the logarithm of the base 2 (log 2} (RPM + 1) value) obtained by adding the integer 1 was used in order to approximate the RPM value following the negative binomial distribution to the normal distribution.

２）特徴量遺伝子の選択
実施例１において健常者（ＨＬ）と比較してＡＤで有意に発現が変動していた１２３遺伝子（表１−１〜１−３）もしくは、そのうちこれまでにアトピー性皮膚炎との関係が報告されていない１０７遺伝子（各表において＊を付し太字で表示）を特徴量遺伝子として選択した。 2) Selection of feature amount genes 123 genes (Tables 1-1 to 1-3) whose expression was significantly changed in AD as compared with healthy subjects (HL) in Example 1, or of which atopic properties have been obtained so far. 107 genes for which no relationship with dermatitis was reported (marked with * in each table and indicated in bold) were selected as feature genes.

３）モデル構築
上記１２３遺伝子または１０７遺伝子のＬｏｇ_２（ＲＰＭ＋１）値を説明変数とし、ＨＬとＡＤを目的変数として用いた。Ｒ言語の“ｃａｒｅｔ”パッケージにおいてランダムフォレストのアルゴリズムをメソッドとして指定し、１回の決定木の構築に用いる変数の数（ｍｔｒｙ値）の最適値をチューニングした。チューニングによって決定したｍｔｒｙ値を用いて、ランダムフォレストのアルゴリズムを実行し、ＯＯＢｅｒｒｏｒｒａｔｅを算出した。その結果、１２３遺伝子を用いた場合のモデルではＯＯＢｅｒｒｏｒｒａｔｅが１３．９５％、１０７遺伝子を用いた場合のモデルでは１３．９５％であった。 3) Model construction The Log ₂ (RPM + 1) value of the 123 gene or 107 gene was used as an explanatory variable, and HL and AD were used as objective variables. The random forest algorithm was specified as a method in the "caret" package of R language, and the optimum value of the number of variables (mtry value) used for constructing one decision tree was tuned. Using the mtry value determined by tuning, the random forest algorithm was executed and the OOB eraser rate was calculated. As a result, the OOB erase rate was 13.95% in the model using 123 genes and 13.95% in the model using 107 genes.

実施例４Ｂｏｒｕｔａ法により抽出した特徴量遺伝子を用いた判別モデルの構築
１）使用データ
実施例１と同様の方法で、被験者からのＳＳＬ由来ＲＮＡの発現量のデータ（リードカウント値）を取得し、サンプル間の総リード数の違いを補正したＲＰＭ値に変換した。ただし、全サンプル中９０％以上のサンプルで欠損値ではない発現量データが得られている７４２９遺伝子のみ以下の解析に使用した。機械学習モデルの構築には、負の二項分布に従うＲＰＭ値を正規分布に近似するため、整数１を加算した底２の対数値（ｌｏｇ_２（ＲＰＭ＋１）値）を用いた。 Example 4 Construction of a discrimination model using the feature amount gene extracted by the Boruta method 1) Usage data Data on the expression level of SSL-derived RNA (read count value) from the subject was acquired by the same method as in Example 1. , Converted to an RPM value corrected for the difference in the total number of reads between samples. However, only the 7429 gene for which expression level data that was not a missing value was obtained in 90% or more of the samples was used for the following analysis. _{In constructing the machine learning model, the logarithm of the base 2 (log 2} (RPM + 1) value) obtained by adding the integer 1 was used in order to approximate the RPM value following the negative binomial distribution to the normal distribution.

２）特徴量遺伝子の選択
全サンプル中９０％以上のサンプルで欠損値ではない発現量データが得られている７４２９遺伝子のＬｏｇ_２（ＲＰＭ+１）値を説明変数とし、健常者（ＨＬ）とＡＤを目的変数として用い、Ｒ言語の“Ｂｏｒｕｔａ”パッケージのアルゴリズムを実行した。最大試行回数を１０００回とし、ｐ値が０．０１未満である４５遺伝子を算出した（表４）。これら４５遺伝子もしくは、そのうちこれまでにアトピー性皮膚炎との関係が報告されていない３９遺伝子（表４において＊を付し太字で表示）を特徴量遺伝子として選択した。 _{2) Selection of feature amount gene The Log 2} (RPM + 1) value of the 7429 gene, for which expression level data that is not a missing value is obtained in 90% or more of all samples, is used as an explanatory variable, and a healthy subject (HL) is used. Using AD as the objective variable, the algorithm of the R language "Boruta" package was executed. The maximum number of trials was 1000, and 45 genes having a p-value of less than 0.01 were calculated (Table 4). Of these 45 genes, or 39 genes for which no relationship with atopic dermatitis has been reported so far (marked with * in Table 4 and indicated in bold) were selected as feature genes.

３）モデル構築
上記４５遺伝子または３９遺伝子のＬｏｇ_２（ＲＰＭ＋１）値を説明変数とし、ＨＬとＡＤを目的変数として用いた。Ｒ言語の“ｃａｒｅｔ”パッケージにおいてランダムフォレストのアルゴリズムをメソッドとして指定し、１回の決定木の構築に用いる変数の数（ｍｔｒｙ値）の最適値をチューニングした。チューニングによって決定したｍｔｒｙ値を用いて、ランダムフォレストのアルゴリズムを実行し、ＯＯＢｅｒｒｏｒｒａｔｅを算出した。その結果、４５遺伝子を用いた場合のモデルではＯＯＢｅｒｒｏｒｒａｔｅが６．９８％、３９遺伝子を用いた場合のモデルでは９．３％であった。 3) Model construction The Log ₂ (RPM + 1) value of the above 45 genes or 39 genes was used as an explanatory variable, and HL and AD were used as objective variables. The random forest algorithm was specified as a method in the "caret" package of R language, and the optimum value of the number of variables (mtry value) used for constructing one decision tree was tuned. Using the mtry value determined by tuning, the random forest algorithm was executed and the OOB eraser rate was calculated. As a result, the OOB erase rate was 6.98% in the model using 45 genes and 9.3% in the model using 39 genes.

実施例５複数の実施例で重複して用いられた特徴量遺伝子に基づく判別モデルの構築
１）使用データ
実施例１と同様の方法で、被験者からのＳＳＬ由来ＲＮＡの発現量のデータ（リードカウント値）を取得し、サンプル間の総リード数の違いを補正したＲＭＰ値に変換した。機械学習モデルの構築には、負の二項分布に従うＲＰＭ値を正規分布に近似するため、整数１を加算した底２の対数値（ｌｏｇ_２（ＲＰＭ＋１）値）を用いた。 Example 5 Construction of a discrimination model based on the feature amount genes duplicated in a plurality of Examples 1) Usage data Data on the expression level of SSL-derived RNA from the subject (read count) in the same manner as in Example 1. Value) was obtained and converted to an RMP value in which the difference in the total number of reads between samples was corrected. _{In constructing the machine learning model, the logarithm of the base 2 (log 2} (RPM + 1) value) obtained by adding the integer 1 was used in order to approximate the RPM value following the negative binomial distribution to the normal distribution.

２）特徴量遺伝子の選択
実施例２〜４において用いた特徴量遺伝子のうち、実施例２〜４全ての実施例において用いた遺伝子はＭＥＣＲ、ＲＡＳＡ４ＣＰ、ＨＭＧＣＳ１、ＡＲＲＤＣ４、ＥＩＦ１ＡＤ、ＦＤＦＴ１、ＺＮＦ７０６、ＴＥＸ２、ＴＭＰＲＳＳ１１Ｅ、ＲＰＳ６ＫＢ２、ＣＴＢＰ１、ＺＮＦ３３５、ＣＡＰＮ１、ＤＧＫＡ、ＰＰＰ１Ｒ９Ｂ、ＳＰＤＹＥ７Ｐ、ＤＮＡＳＥ１Ｌ１、ＧＮＢ２、ＣＳＮＫ１Ｇ２の１９遺伝子であった（表５）。これら１９遺伝子のうちこれまでにアトピー性皮膚炎との関係が報告されていない１７遺伝子（表５において＊を付し太字で表示）を特徴量遺伝子として選択した。 2) Selection of feature amount genes Among the feature amount genes used in Examples 2 to 4, the genes used in all Examples 2 to 4 are MECR, RASA4CP, HMGCS1, ARRDC4, EIF1AD, FDFT1, ZNF706, TEX2. , TMPRSS11E, RPS6KB2, CTBP1, ZNF335, CAPN1, DGKA, PPP1R9B, SPDYE7P, DNASE1L1, GNB2, CSNK1G2 (Table 5). Of these 19 genes, 17 genes whose relationship with atopic dermatitis has not been reported so far (marked with * in Table 5 and indicated in bold) were selected as feature genes.

３）モデル構築
上記１７遺伝子のＬｏｇ_２（ＲＰＭ＋１）値を説明変数とし、ＨＬとＡＤを目的変数として用いた。Ｒ言語の“ｃａｒｅｔ”パッケージにおいてランダムフォレストのアルゴリズムをメソッドとして指定し、１回の決定木の構築に用いる変数の数（ｍｔｒｙ値）の最適値をチューニングした。チューニングによって決定したｍｔｒｙ値を用いて、ランダムフォレストのアルゴリズムを実行し、ＯＯＢｅｒｒｏｒｒａｔｅを算出した。その結果、ＯＯＢｅｒｒｏｒｒａｔｅは６．９８％であった。 3) Model construction The Log ₂ (RPM + 1) value of the above 17 genes was used as an explanatory variable, and HL and AD were used as objective variables. The random forest algorithm was specified as a method in the "caret" package of R language, and the optimum value of the number of variables (mtry value) used for constructing one decision tree was tuned. Using the mtry value determined by tuning, the random forest algorithm was executed and the OOB eraser rate was calculated. As a result, the OOB error rate was 6.98%.

Claims

Regarding the biological samples collected from the subjects, MECR, RASA4CP, ARRDC4, EIF1AD, FDFT1, ZNF706, TEX2, TMPRSS11E, RPS6KB2, CTBP1, ZNF335, DGKA, PPP1R9B, SPDYE7P, DNASE1L1 A method for detecting atopic dermatitis in a subject, comprising the step of measuring the expression level of at least one gene selected or an expression product thereof.

The method according to claim 1, wherein the expression level of the gene or an expression product thereof is a measurement of the expression level of mRNA.

The method according to claim 1 or 2, wherein the gene or an expression product thereof is RNA contained in the lipid on the skin surface of the subject.

The method according to any one of claims 1 to 3, wherein the measured value of the expression level is compared with the reference value of each gene or its expression product to evaluate the presence or absence of atopic dermatitis.

Discrimination between atopic dermatitis patients and healthy subjects using the measured values of the expression level of the gene or its expression product derived from atopic dermatitis patients and the expression level of the gene or its expression products derived from healthy subjects as a teacher sample. By creating a formula, substituting the measured value of the expression level of the gene or its expression product obtained from the biological sample collected from the subject into the discrimination formula, and comparing the obtained result with the reference value, the subject is subjected to. The method according to any one of claims 1 to 4, which evaluates the presence or absence of atopic dermatitis in a examiner.

The method according to claim 5, wherein the expression level of all genes of the 17 gene groups or expression products thereof is measured.

In addition to at least one gene selected from the 17 gene groups, at least one gene selected from the gene group excluding the 17 genes from the 245 genes shown in Table A below or an expression product thereof. The method according to claim 5 or 6, wherein the expression level of is measured.

In addition to at least one gene selected from the 17 gene groups, 123 species in Tables 1-1 to 1-3 below, 150 species shown in Tables 3-1 to 3-4, or shown in Table 4 The method according to claim 5 or 6, wherein the expression level of at least one gene selected from the gene group excluding the 17 kinds of genes out of 45 kinds of genes or an expression product thereof is measured.

In addition to at least one gene selected from the 17 gene groups, at least 1 selected from the 107 gene group, 127 gene group, and 39 gene group excluding the 17 gene group in the table below. The method according to claim 7 or 8, wherein the expression level of one gene or an expression product thereof is measured.

Atopic skin used in the method according to any one of claims 1 to 9, which contains an oligonucleotide that specifically hybridizes with the gene or a nucleic acid derived from the gene, or an antibody that recognizes an expression product of the gene. Inspection kit for detecting flames.

A marker for detecting atopic dermatitis, which comprises at least one gene selected from the 210 gene groups shown in Table B below or an expression product thereof.

MECR, RASA4CP, ARRDC4, EIF1AD, FDFT1, ZNF706, TEX2, TMPRSS11E, RPS6KB2, CTBP1, ZNF335, DGKA, PPP1R9B, SPDYE7P, DNASE1L1, GNB2 and CSNK1G2. The marker for detecting atopic dermatitis according to claim 11, which is an expression product.