WO2009157251A1 - Method of diagnosing integration dysfunction syndrome - Google Patents

Method of diagnosing integration dysfunction syndrome Download PDF

Info

Publication number
WO2009157251A1
WO2009157251A1 PCT/JP2009/057861 JP2009057861W WO2009157251A1 WO 2009157251 A1 WO2009157251 A1 WO 2009157251A1 JP 2009057861 W JP2009057861 W JP 2009057861W WO 2009157251 A1 WO2009157251 A1 WO 2009157251A1
Authority
WO
WIPO (PCT)
Prior art keywords
seq
expression level
gene
gene group
schizophrenia
Prior art date
Application number
PCT/JP2009/057861
Other languages
French (fr)
Japanese (ja)
Inventor
秀幸 青島
一男 竹村
健太朗 飯嶋
浩志 林
Original Assignee
株式会社エスアールエル
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 株式会社エスアールエル filed Critical 株式会社エスアールエル
Publication of WO2009157251A1 publication Critical patent/WO2009157251A1/en

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/68Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
    • G01N33/6893Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids related to diseases not provided for elsewhere
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/158Expression markers
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N2800/00Detection or diagnosis of diseases
    • G01N2800/30Psychoses; Psychiatry

Definitions

  • the present invention relates to a method for diagnosing schizophrenia using blood as a sample.
  • an object of the present invention is to provide means capable of objectively diagnosing schizophrenia with high accuracy using patient blood as a sample.
  • the inventors of the present invention use blood as a sample, compare the expression levels of about 55,000 genes between healthy individuals and schizophrenic patients, select genes whose expression levels vary significantly, and further describe the invention described later Narrow down to the criteria that we have devised independently, and then use the neural network to increase the variable and cross-validation method to perform the primary selection of the gene group, and the inventors of the present application further select the primary selection of the gene group
  • a low-cost and highly versatile microarray equipped with a gene group to which a large number of candidate genes for classification prediction were added, processed with a neural network in the same manner as described above, and the sensitivity of detection by the constructed classification prediction algorithm The fact that (true positive rate) and specificity (true negative rate) were 80% or more was confirmed using a large number of actual samples, and the present invention was completed.
  • the present invention provides a method for detecting schizophrenia using as an index the expression level of the following gene groups (1) to (10) in a sample isolated from a living body.
  • DLGAP3 SEQ ID NO: 1
  • KCNJ15 SEQ ID NO: 2
  • GPR30 SEQ ID NO: 3
  • NPCR SEQ ID NO: 4
  • TMED1 SEQ ID NO: 5
  • PAFAH2 SEQ ID NO: 6
  • TMEM23 SEQ ID NO: 7
  • PGRMC1 SEQ ID NO: 9)
  • INSL3 SEQ ID NO: 10
  • the present invention provides for the first time a means capable of objectively diagnosing schizophrenia with high accuracy.
  • the present invention uses the expression level of the gene groups (1) to (10) as an index.
  • the sample for measuring the expression level of each gene is not particularly limited as long as it is a sample isolated from a living body, but as described in detail in the following examples, the gene group is selected using blood as a sample. Therefore, it is preferable to use blood as a sample.
  • the gene group includes those in which the expression level is increased or decreased in schizophrenic patients. Further, in the following examples, determination can be made based on the expression levels of only the above 10 genes that have been confirmed to have a detection sensitivity (true positive rate) and specificity (true negative rate) of 80% or more. preferable.
  • the expression levels of other genes such as various genes for normalization in order to ensure measurement accuracy.
  • “based on the expression levels of only the 10 genes” means that the expression levels of only the 10 genes are used as direct variables for classification prediction.
  • the correct answer rate is (a + d) / (a + b + c + d).
  • Measurement of the expression level of each gene in the sample itself can be performed by a known method.
  • the measurement method is not particularly limited, but a method using a single-stranded oligonucleotide probe that hybridizes to the sense strand or antisense strand of each gene, preferably a DNA array on which a DNA probe is immobilized, is simple and preferred.
  • oligonucleotide probes that extract total mRNA from blood, prepare cRNA labeled with biotin from the extracted mRNA, and hybridize with cRNA derived from each gene.
  • CRNA is applied to the immobilized array, the cRNA and probe are hybridized, the array is washed, and the amount of label remaining on the substrate is measured to determine the amount of cRNA, and hence the amount of mRNA, that is, the gene expression level. Can be measured.
  • the probe to be immobilized has a size that specifically hybridizes with cRNA, and usually has a size of about 18 to 50 bases, preferably about 20 to 40 bases.
  • the probe to be immobilized is preferably completely complementary to the region of RNA to which it is hybridized, but the normal hybridization when using a DNA array as specifically described in the following examples. A small number (usually 1 or 2) of mismatches is acceptable as long as it hybridizes under the conditions. Therefore, even when a natural SNP occurs in a gene, it can be measured using the same DNA array.
  • the expression level of the gene group is SEQ ID NO: 34, SEQ ID NO: 42, SEQ ID NO: 77, SEQ ID NO: 81, SEQ ID NO: 98, SEQ ID NO: 109, SEQ ID NO: 122, SEQ ID NO: 165, SEQ ID NO: 200 and SEQ ID NO: It is measured using an oligonucleotide probe having the base sequence indicated by No. 218, and a DNA array on which these probes are immobilized can be preferably used.
  • the determination based on the expression level of the gene group is basically performed by comparing the expression level of the gene group with the expression level of the gene group in known schizophrenia patients and healthy subjects, which are measured in advance. .
  • This comparison is preferably performed by a neural network trained by the variable increment method using the expression level of the gene group in known schizophrenia patients and healthy individuals. Input the measured expression levels of the above 10 types of genes into the constructed learned neural network (construction method will be described later), output the prediction probability of the group classified into the neural network, and use this prediction probability as a criterion Schizophrenia can be detected.
  • the above comparison is preferably performed by multiple regression analysis.
  • a prediction formula multiple regression formula
  • Whether or not the subject has schizophrenia can be determined. This comparison is made, for example, based on the dependent variable calculated for each sample of the known schizophrenia patient group and the healthy subject group, by defining the value of the dependent variable that can preferably classify both groups as a cut-off value.
  • the cut-off value can be appropriately determined by routine statistical processing based on the dependent variables calculated for known schizophrenia patients and healthy individuals.
  • the technique of multiple regression analysis itself is well known, and various software and the like for performing multiple regression analysis are known, and there are many commercially available products. Any software may be used in the present invention.
  • the prediction formula can be determined once the analysis for the known patient and the healthy person is performed once, it is not necessary to perform the analysis for the known patient / healthy person group every time it is performed, and the prediction formula once obtained. Can also be used in subsequent implementations.
  • an analysis method including a step of obtaining a dependent variable of a sample using the obtained multiple regression equation is widely included, and an analysis step for obtaining a multiple regression equation includes Not necessarily included. Therefore, as described above, any method for detecting schizophrenia using the already obtained multiple regression equation is included in the “detection method for performing comparison by multiple regression analysis” in the present invention.
  • the measurement value of the expression level used in the present invention is preferably a value obtained by normalizing the measured signal intensity by a global normalization method as described in the following examples.
  • the global normalization method is a method of calculating the relative expression level by calculating the median value of the expression levels of all genes mounted on the DNA microarray and dividing the expression level of each gene by this median value. .
  • the neural network When performing the method of the present invention using a neural network, the neural network itself is well known and a commercially available neural network can be used. However, although the neural network itself can use a commercial product, in the present invention, there is a feature in the data to be learned by the neural network, and sensitivity (true positive rate) and specificity (true negative rate) can be obtained by learning any data. It is necessary to devise whether both can be increased to 80% or more (described later).
  • An optimal model of a classification prediction model using a neural network can be constructed by a method detailed in the following example, for example. Briefly, for example, the optimum model can be determined as follows. First, the expression level of various genes is measured using samples collected from many schizophrenic patients and healthy individuals. The expression level of the gene can be performed using a DNA microarray as described above. In the following examples, a commercially available DNA microarray equipped with DNA probes of about 55,000 kinds of human genes was used.
  • data cleansing is performed on the expression level measured using a DNA microarray.
  • the data cleansing can be performed, for example, by excluding probes of genes less than 30% tile or 98% tile or more of the entire expression level.
  • probes other than Quality Flag “Good”, probes of genes located on the Y chromosome, probes set distal from the mRNA 3 ′ end, etc. are excluded, and 10,498 from about 55,000 probes. Narrow down to the probe.
  • the quality “Flag” being “Good” means that the measured expression level is larger than 1.5SD of the background around the spot and can be trusted as the measurement value.
  • the gene located on the Y chromosome is present only in males, it was excluded because the sensitivity and / or specificity of detection might be lowered when females were examined.
  • Probes set distal from the mRNA 3 'end are excluded because they are subject to bias in the preparation of cRNA and are a significant variation in the measured values. Furthermore, preliminary analysis excluded those with a missing value of 25% or more, those with a large difference in expression between men and women, and those with a large difference between batches during array production.
  • the expression level of the gene derived from the RNA hybridized with each probe, measured for each probe selected in this way, is input to the neural network, and the two-group test (t test), that is, the learning example A significant difference test (t-test) is performed between schizophrenia (non-medicine) and healthy subjects.
  • t test two-group test
  • samples were also narrowed down. That is, the median value of 56 healthy subjects was calculated for each probe, the correlation of each sample was examined with the data set as an object, and the parameters of the approximate curve and the signal intensity ratio greatly separated were excluded from the analysis target.
  • a DNA microarray equipped with about 55,000 types of probes is expensive, and only one specimen can be processed with one microarray. Therefore, it is desirable to use a lower cost microarray for practical use. Therefore, in the following examples, 216 types of probes were selected from the probes including the 14 types of probes selected above and having a significant difference between schizophrenia and healthy subjects, and mounted on the substrate. Of these 216 types of probes, 202 types of probes other than the 14 types of probes described above are probes that have a significant difference between schizophrenia and healthy subjects, and patients with bipolar disorder who are similar mental disorders. Genes with statistically significant differences among schizophrenic patients were selected.
  • a probe used for global normalization and a management probe (for alignment) were also mounted (details in the following examples).
  • the global normalization was selected with small variation between arrays.
  • a plurality of chambers (16 in the following embodiment) can be formed on a single substrate, that is, 16 specimens can be simultaneously tested with a single array. Cost, and inspection cost and labor can be greatly reduced.
  • the measurement results of the 14 types of previously specified probes, measured using this practical array, are input to the neural network classification prediction model constructed previously, and the sensitivity and specificity are determined using the above test example.
  • the sensitivity and specificity of the healthy person was less than 80%, and when the sensitivity and specificity were calculated using another test example, both the sensitivity and specificity of schizophrenia (untreated) and healthy persons were calculated. It became less than 80%.
  • the probe with the highest correct answer rate is used.
  • a combination was sought.
  • the above 10 genes were identified. Note that the above 10 types of genes are different from the 14 types of genes specified earlier, and only 2 types of genes overlapped with both.
  • an oligonucleotide probe having the base sequence represented by SEQ ID NO: 42 means an oligonucleotide probe having a base sequence of tcccacatcc ccttgaatat cccaggaaa represented by SEQ ID NO: 42 and having a size of 30 bases.
  • cRNA cRNA was prepared using 0.5 ⁇ g of extracted total RNA.
  • Biotin-labeled cRNA was prepared using an iExpress kit (GE Healthcare Bioscience, Chandler, CA, USA) according to the manufacturer's instructions.
  • the quantification and quality confirmation of the prepared cRNA were performed in the same manner as the quantification and quality confirmation of the extracted total RNA. That is, the absorbance of 230, 260, and 280 nm of cRNA solution diluted 50 times was measured, and the concentration of total RNA was measured. The quality of the cRNA was confirmed using an Agilent 2100 bioanalyzer.
  • Codelink TM 55K Bioarray (GE Healthcare Bioscience) was used. Codelink (trademark) 55K Bioarray is coated with acrylamide with special chemical modification on the surface of the slide glass, and the 30mer probe is three-dimensionally fixed. It is an excellent microarray, and probes corresponding to about 55,000 human genes are immobilized.
  • cRNA 10 ⁇ g was prepared with RNase-Free H 2 O to a final volume of 20 ⁇ l, 5 ⁇ l of 5 ⁇ Fragmentation Buffer of iExpress kit was added, and then incubated at 94 ° C. for 20 minutes to fragment the cRNA.
  • the array was fixed using Hybridization® Removal Tool, the hybridization chamber was peeled off, and the array was set on Bioarray® Rack.
  • the Bioarray® Rack with the array set was transferred to a Large® Reagent reservoir containing 0.75 ⁇ TNT® Buffer at 46 ° C. and incubated at 46 ° C. for 1 hour.
  • the Bioarray® Rack was transferred to a Small® Reagent reservoir filled with 3.4 ⁇ ml of Streptavidin-Cy5 diluted solution and incubated at room temperature for 30 minutes. After staining, the Bioarray Rack was transferred to a Large Reagent reservoir filled with 240 ml of 1 ⁇ TNT Buffer, and washed by repeating the operation of incubating at room temperature for 5 minutes 4 times. Next, the Bioarray® Rack was transferred to a Large® Reagent reservoir filled with 0.1 ⁇ SSC / 0.05% Tween-20, washed for 30 seconds, the array was centrifuged and dried, and then stored in the dark until scanning.
  • Array Scanning The washed and dried arrays were scanned with an Agilent Scanner (Agilent Technologies, Santa Clara, CA, USA). The scanner settings were Red PMT [%] 70%, Dye Channel Red (Red is Cy5). The other settings are the default. The scanned array data was saved as a TIF file and digitized.
  • probes other than Quality Flag “Good”, probes located on the Y chromosome, probes set distal from the mRNA 3 ′ end, and the like were excluded. Furthermore, preliminary analysis also excluded those with a missing value of 25% or more, those with a large difference in the expression level between men and women, and those with a large difference between batches during array production. As a result, the probe was narrowed down from about 55,000 probes to 10,498 probes.
  • CodeLink (trade name) 55K Bioarray is very expensive, and only one sample can be processed and analyzed. In order to put it to practical use, a microarray that can be analyzed at a lower cost is required. Therefore, the same surface treatment as CodeLink (trade name) 55K Bioarray is applied, and this is divided into 16 chambers, so that CodeLink (trade name) 16-Assay can process and analyze up to 16 samples at a time.
  • a practical array based on Bioarray (Applied Microarrays) was designed. In addition to the probe of 216 genes described above, a probe used for global normalization (SEQ ID NO: 226 to 525) and a management probe provided by the manufacturer were added to design the following array.
  • CodeLink (trade name) 16-Assay Bioarray classification prediction based on measurement results (neural network) Based on the gene expression information of 60 untreated schizophrenia patients and 56 healthy subjects, we attempted to construct a classification prediction model using a neural network with excellent classification prediction.
  • a classification prediction algorithm is a series of algorithms that can output an optimal solution by inputting a data set whose attributes have been clarified in advance and performing “learning and training”. It is said that an algorithm with high classification accuracy can be constructed from the high learning effect.
  • Numerous algorithms are constructed by variously changing various parameters of the neural network (learning efficiency, momentum, number of repetitions, number of layers, number of neurons), and learning accuracy by using cross-validation and test examples described above for each. Verification was performed.
  • Table 5 and Table 6 show the prediction results for learning examples and test examples based on this algorithm. Further, FIG. 1 shows the result of Forward Selection for this algorithm.
  • CodeLink (trade name) 16-Assay Bioarray classification prediction based on measurement results (multiple regression analysis) Similar to the above, classification prediction by multiple regression analysis was attempted using gene expression information of 60 untreated schizophrenia patients and 56 healthy subjects. Using the expression data of the 10 probes as explanatory variables, a multiple regression analysis was performed on the learning example using commercially available software (SPSS), and a prediction formula was constructed. Multiple regression analysis was performed so that the dependent variable was increased in patients with schizophrenia. Subsequently, the dependent variable was calculated about the said test example using the constructed prediction formula. The obtained prediction formula is as follows.
  • X 1 is the gene expression level of DLGAP3 (GE54859 SEQ ID NO: 42)
  • X 2 is the gene expression level of KCN15J (GE58277 SEQ ID NO: 77)
  • X 3 is the gene expression level of GPR30 (GE80129 SEQ ID NO: 165)
  • X 4 is the gene expression level of NPCR (GE540583 SEQ ID NO: 34)
  • X 5 is gene expression level TMED1 (GE85017 SEQ ID NO: 200)
  • X 6 the gene expression level of PAFAH2 (GE62881 SEQ ID NO: 122)
  • X 7 is a gene expression level of TMEM23 (GE60313 SEQ ID NO: 98)
  • X 8 is the gene expression level of ABCG1 (GE58)
  • a 1 is 1.00019621196698 A 2 is 0.273175387505458 A 3 is 0.606651443546423 A 4 is -0.659859665599205 A 5 is, -0.287215519193429 A 6 is, -0.271285204843002 A 7 is -0.220049126802913 A 8 is -0.00285057386315785 A 9 is 0.478133475554455 A 10 is -0.169744977943406 Constant C is 0.0429404615746508 It is.
  • Tables 7 and 8 show the results of tabulating the dependent variable 50 as a cutoff value. Moreover, the dependent variable calculated about each sample of a learning example and a test example is shown in FIG.
  • Sensitivity 92.5% (37/40), Specificity: 92.1% (35/38), Correct answer rate: 92.3% (72/78)

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Organic Chemistry (AREA)
  • Molecular Biology (AREA)
  • Analytical Chemistry (AREA)
  • Immunology (AREA)
  • Physics & Mathematics (AREA)
  • Biomedical Technology (AREA)
  • Urology & Nephrology (AREA)
  • Microbiology (AREA)
  • Pathology (AREA)
  • Genetics & Genomics (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Biochemistry (AREA)
  • Biotechnology (AREA)
  • Hematology (AREA)
  • General Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Cell Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Food Science & Technology (AREA)
  • Medicinal Chemistry (AREA)
  • General Physics & Mathematics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

Disclosed is a novel means whereby integration dysfunction syndrome can be highly accurately and objectively diagnosed with the use of the blood of a patient as a sample. A method of detecting integration dysfunction syndrome with the use, as indications, of the expression amounts of ten specific genes in a sample separated from living body. According to this method, integration dysfunction syndrome can be objectively diagnosed at a high accuracy. By using a number of specimens, it is confirmed that both of the detection sensitivity (i.e., true positive ratio) and the specificity (i.e., true negative ratio) exceed 80%. Since blood is usable as the specimen, this method can be conveniently carried out.

Description

統合失調症の診断方法Diagnosis method of schizophrenia
 本発明は、血液を試料とした統合失調症の診断方法に関する。 The present invention relates to a method for diagnosing schizophrenia using blood as a sample.
 日本国内における統合失調症の発症率は人口の約0.8%であり、主に青年期に発症する。該疾患の予後は多様である。概ね全体の1/3の患者については、顕著で継続的な改善をみる。1/3はいくらか改善するが,間欠性再発と残遺性障害を残す。残りの1/3は重篤で,永久的に能力が失われ、社会的機能遂行に支障をきたす重大な精神疾患である。 The incidence of schizophrenia in Japan is about 0.8% of the population, and it occurs mainly in adolescence. The prognosis for the disease varies. About 1/3 of all patients see significant and continuous improvement. 1/3 improves somewhat, but leaves intermittent recurrence and residual disability. The remaining 1/3 is a serious mental illness that is severely impaired and permanently impairs social functioning.
 統合失調症の治療では早期の治療が重要である。従来、統合失調症の診断としては、米国精神医学会(APA)が制定した精神疾患の診断・統計マニュアルであるDSM-IV (Diagnostic and Statistical Manual of Mental Disorders-IV) による包括的評価に基いて下される。しかし、このような方法では、診断医の主観や技量に大きく依存するため、該疾患を客観的かつ早期に診断することは困難である。 】 Early treatment is important in the treatment of schizophrenia. Traditionally, the diagnosis of schizophrenia is based on a comprehensive evaluation by DSM-IV (Diagnostic and Manual-of Mental-Disorders-IV), a diagnostic and statistical manual for mental disorders established by the American Psychiatric Association (APA). Be defeated. However, in such a method, since it greatly depends on the subjectivity and skill of the diagnostician, it is difficult to make an objective and early diagnosis of the disease.
 統合失調症の生物学的マーカー等を用いた客観的な診断方法が確立すれば、早期診断、早期治療が可能になり、重症化の回避や治癒率の向上が可能になる。現在までに報告されている、生物学的マーカーを用いた診断方法としては、例えば、上皮細胞成長因子の血清中濃度を指標として精神分裂病(統合失調症)を診断する方法(特許文献1)や、血液を試料として用い、特定の遺伝子の発現量を指標とする方法がある(特許文献2)。しかしながら、特許文献1及び2記載の方法は、診断の精度が未だ満足することができない。 確立 If an objective diagnostic method using biological markers for schizophrenia is established, early diagnosis and early treatment will be possible, and it will be possible to avoid the severity and improve the cure rate. Examples of diagnostic methods using biological markers that have been reported so far include a method of diagnosing schizophrenia (schizophrenia) using serum concentration of epidermal growth factor as an index (Patent Document 1). Alternatively, there is a method using blood as a sample and using the expression level of a specific gene as an index (Patent Document 2). However, the methods described in Patent Documents 1 and 2 still cannot satisfy the accuracy of diagnosis.
特許第3706913号公報Japanese Patent No. 3706913 特開2004-135667号公報JP 2004-135667 A
 従って、本発明の目的は、患者血液を試料として高い精度で客観的に統合失調症を診断することができる手段を提供することにある。 Therefore, an object of the present invention is to provide means capable of objectively diagnosing schizophrenia with high accuracy using patient blood as a sample.
 本願発明者らは、血液を試料として用い、約55000種類の遺伝子の発現量を健常人と統合失調症患者の間で比較し、発現量が有意に変動する遺伝子を選び、さらに後述する本願発明者らが独自に考え出した基準で絞込み、これをニューラルネットワークを駆使した変数増加法とcross validation法にかけて遺伝子群の一次選択を行い、一次選択された遺伝子群にさらに本願発明者らが独自に選定したかなりの数の分類予測候補遺伝子を付加した遺伝子群を搭載した低コストで汎用性の高いマイクロアレイを作成し、上記と同様にニューラルネットワークで処理し、構築された分類予測アルゴリズムにより、検出の感度(真陽性率)及び特異度(真陰性率)が80%以上となることを実際の多数の検体を用いて確認し、本発明を完成した。 The inventors of the present invention use blood as a sample, compare the expression levels of about 55,000 genes between healthy individuals and schizophrenic patients, select genes whose expression levels vary significantly, and further describe the invention described later Narrow down to the criteria that we have devised independently, and then use the neural network to increase the variable and cross-validation method to perform the primary selection of the gene group, and the inventors of the present application further select the primary selection of the gene group A low-cost and highly versatile microarray equipped with a gene group to which a large number of candidate genes for classification prediction were added, processed with a neural network in the same manner as described above, and the sensitivity of detection by the constructed classification prediction algorithm The fact that (true positive rate) and specificity (true negative rate) were 80% or more was confirmed using a large number of actual samples, and the present invention was completed.
 すなわち、本発明は、生体から分離された試料における下記(1)~(10)の遺伝子群の発現量を指標とする、統合失調症の検出方法を提供する。
(1) DLGAP3(配列番号1)
(2) KCNJ15(配列番号2)
(3) GPR30(配列番号3)
(4) NPCR(配列番号4)
(5) TMED1(配列番号5)
(6) PAFAH2(配列番号6)
(7) TMEM23(配列番号7)
(8) ABCG1(配列番号8)
(9) PGRMC1(配列番号9)
(10) INSL3(配列番号10)
That is, the present invention provides a method for detecting schizophrenia using as an index the expression level of the following gene groups (1) to (10) in a sample isolated from a living body.
(1) DLGAP3 (SEQ ID NO: 1)
(2) KCNJ15 (SEQ ID NO: 2)
(3) GPR30 (SEQ ID NO: 3)
(4) NPCR (SEQ ID NO: 4)
(5) TMED1 (SEQ ID NO: 5)
(6) PAFAH2 (SEQ ID NO: 6)
(7) TMEM23 (SEQ ID NO: 7)
(8) ABCG1 (SEQ ID NO: 8)
(9) PGRMC1 (SEQ ID NO: 9)
(10) INSL3 (SEQ ID NO: 10)
 本発明により、高い精度で客観的に統合失調症を診断することができる手段が初めて提供された。 The present invention provides for the first time a means capable of objectively diagnosing schizophrenia with high accuracy.
本発明の実施例で行った、ニューラルネットワークによる分類予測モデルにおける、プローブ数と正答率の関係を示す図である。It is a figure which shows the relationship between the number of probes and the correct answer rate in the classification | category prediction model by a neural network performed in the Example of this invention. 重回帰分析により求めた学習例及び試験例の各検体の従属変数を示す図である。It is a figure which shows the dependent variable of each sample of the learning example calculated | required by multiple regression analysis, and a test example.
 上記の通り、本発明は、(1)~(10)の遺伝子群の発現量を指標とする。各遺伝子の発現量を測定する試料は、生体から分離された試料であれば特に限定されるわけではないが、下記実施例で詳述するとおり、上記遺伝子群は、血液を試料として用いて選定されたものであるので、血液を試料とすることが好ましい。なお、上記遺伝子群には、統合失調症患者において発現量が増大しているものも減少しているものも含まれる。また、下記実施例において、検出の感度(真陽性率)及び特異度(真陰性率)が80%以上となることが確認された上記10種類の遺伝子のみの発現量に基づいて判定することが好ましい。なお、下記実施例に具体的に記載するように、測定の精度を確保する等のために、正規化のための種々の遺伝子等の他の遺伝子の発現量を同時に測定することは好ましいことであり、「上記10種類の遺伝子のみの発現量に基づく」とは、上記10種類の遺伝子のみの発現量を分類予測の直接的な変数として用いるという意味である。また、感度を表わす真陽性率は、下記表1におけるa/(a+b)、特異度を表わす真陰性率は、1-偽陽性=d/(c+d)である。また、正答率は、(a+d)/(a+b+c+d)である。 As described above, the present invention uses the expression level of the gene groups (1) to (10) as an index. The sample for measuring the expression level of each gene is not particularly limited as long as it is a sample isolated from a living body, but as described in detail in the following examples, the gene group is selected using blood as a sample. Therefore, it is preferable to use blood as a sample. In addition, the gene group includes those in which the expression level is increased or decreased in schizophrenic patients. Further, in the following examples, determination can be made based on the expression levels of only the above 10 genes that have been confirmed to have a detection sensitivity (true positive rate) and specificity (true negative rate) of 80% or more. preferable. As specifically described in the examples below, it is preferable to simultaneously measure the expression levels of other genes such as various genes for normalization in order to ensure measurement accuracy. Yes, “based on the expression levels of only the 10 genes” means that the expression levels of only the 10 genes are used as direct variables for classification prediction. The true positive rate representing sensitivity is a / (a + b) in Table 1 below, and the true negative rate representing specificity is 1-false positive = d / (c + d). The correct answer rate is (a + d) / (a + b + c + d).
Figure JPOXMLDOC01-appb-T000001
Figure JPOXMLDOC01-appb-T000001
 上記10種類の遺伝子の配列は、上記した各配列番号に記載されているが、各遺伝子のGenBankアクセション番号、遺伝子産物、下記実施例で用いた、各遺伝子の発現量の測定に用いたプローブ番号及びその配列番号を下記表2に示す。 The sequences of the above 10 types of genes are described in each of the above SEQ ID Nos. The GenBank accession number of each gene, the gene product, and the probe used for measuring the expression level of each gene used in the following Examples The numbers and their sequence numbers are shown in Table 2 below.
Figure JPOXMLDOC01-appb-T000002
Figure JPOXMLDOC01-appb-T000002
 また、これらのうち、機能がよく知られている遺伝子の機能の説明と、統合失調症患者において健常者と比較して発現量が増える(上向き矢印)のか減る(下向き矢印)のかを下記表3に示す。 In addition, among these, explanation of the function of genes whose functions are well known and whether the expression level is increased (upward arrow) or decreased (downward arrow) in schizophrenic patients compared to healthy individuals are shown in Table 3 below. Shown in
Figure JPOXMLDOC01-appb-T000003
Figure JPOXMLDOC01-appb-T000003
 試料中の各遺伝子の発現量の測定自体は公知の方法により行なうことができる。測定方法は、特に限定されないが、各遺伝子のセンス鎖又はアンチセンス鎖とハイブリダイズする一本鎖オリゴヌクレオチドプローブ、好ましくはDNAプローブが固定化されたDNAアレイを用いる方法が簡便で好ましい。例えば、下記実施例に具体的に記載するように、血液から全mRNAを抽出し、抽出したmRNAから、ビオチン等で標識されたcRNAを調製し、各遺伝子由来のcRNAとハイブリダイズするオリゴヌクレオチドプローブが固定化されたアレイにcRNAを施してcRNAとプローブとをハイブリダイズさせ、アレイを洗浄後、基板上に残留する標識量を測定することによりcRNA量、ひいてはmRNA量、すなわち遺伝子の発現量を測定することができる。 Measurement of the expression level of each gene in the sample itself can be performed by a known method. The measurement method is not particularly limited, but a method using a single-stranded oligonucleotide probe that hybridizes to the sense strand or antisense strand of each gene, preferably a DNA array on which a DNA probe is immobilized, is simple and preferred. For example, as specifically described in the Examples below, oligonucleotide probes that extract total mRNA from blood, prepare cRNA labeled with biotin from the extracted mRNA, and hybridize with cRNA derived from each gene. CRNA is applied to the immobilized array, the cRNA and probe are hybridized, the array is washed, and the amount of label remaining on the substrate is measured to determine the amount of cRNA, and hence the amount of mRNA, that is, the gene expression level. Can be measured.
 なお、固定化されるプローブは、cRNAと特異的にハイブリダイズするサイズを有するものであり、通常、18塩基~50塩基、好ましくは20塩基~40塩基程度のサイズを有する。また、固定化するプローブは、それがハイブリダイズするRNAの領域と完全に相補的であることが好ましいが、下記実施例に具体的に記載するような、DNAアレイを用いる際の通常のハイブリダイズ条件下でハイブリダイズするものであれば少数(通常、1個か2個)のミスマッチがあっても許容できる。従って、遺伝子に天然のSNPが生じている場合でも、同じDNAアレイを用いて測定可能である。 The probe to be immobilized has a size that specifically hybridizes with cRNA, and usually has a size of about 18 to 50 bases, preferably about 20 to 40 bases. In addition, the probe to be immobilized is preferably completely complementary to the region of RNA to which it is hybridized, but the normal hybridization when using a DNA array as specifically described in the following examples. A small number (usually 1 or 2) of mismatches is acceptable as long as it hybridizes under the conditions. Therefore, even when a natural SNP occurs in a gene, it can be measured using the same DNA array.
 下記実施例では、前記遺伝子群の発現量は、配列番号34、配列番号42、配列番号77、配列番号81、配列番号98、配列番号109、配列番号122、配列番号165、配列番号200及び配列番号218に示される塩基配列を有するオリゴヌクレオチドプローブを用いて測定されており、これらのプローブを固定化したDNAアレイを好ましく利用することができる。 In the following examples, the expression level of the gene group is SEQ ID NO: 34, SEQ ID NO: 42, SEQ ID NO: 77, SEQ ID NO: 81, SEQ ID NO: 98, SEQ ID NO: 109, SEQ ID NO: 122, SEQ ID NO: 165, SEQ ID NO: 200 and SEQ ID NO: It is measured using an oligonucleotide probe having the base sequence indicated by No. 218, and a DNA array on which these probes are immobilized can be preferably used.
 上記遺伝子群の発現量に基づく判定は、基本的には前記遺伝子群の発現量を、あらかじめ測定した、既知の統合失調症患者及び健常者における前記遺伝子群の発現量と対比することにより行われる。この対比は、既知の統合失調症患者及び健常者における前記遺伝子群の発現量を用いて変数増加法により学習させたニューラルネットワークにより行なうことが好ましい。構築した学習済みのニューラルネットワーク(構築方法は後述)に測定した上記10種類の遺伝子の発現量を入力し、該ニューラルネットワークに分類される群の予測確率を出力させ、この予測確率を判定基準として統合失調症を検出することができる。 The determination based on the expression level of the gene group is basically performed by comparing the expression level of the gene group with the expression level of the gene group in known schizophrenia patients and healthy subjects, which are measured in advance. . This comparison is preferably performed by a neural network trained by the variable increment method using the expression level of the gene group in known schizophrenia patients and healthy individuals. Input the measured expression levels of the above 10 types of genes into the constructed learned neural network (construction method will be described later), output the prediction probability of the group classified into the neural network, and use this prediction probability as a criterion Schizophrenia can be detected.
 あるいはまた、上記対比は、重回帰分析により行なうことも好ましい。測定した上記12種類の遺伝子の発現量を説明変数とし、既知の統合失調症患者及び健常者における前記遺伝子群の発現量を重回帰分析すると、予測式(重回帰式)を得ることができる。得られた予測式に、被検者における上記遺伝子群の発現量を入力して従属変数を求め、この従属変数の数値を既知の統合失調症患者及び健常者の従属変数と対比することで、該被検者が統合失調症か否かを判別することができる。この対比は、例えば、既知の統合失調症患者群と健常者群の各検体について計算された従属変数をもとに、両群を好ましく分類できる従属変数の値をカットオフ値として定め、被検者の従属変数をこのカットオフ値と対比することにより行なうことができる。例えば、統合失調症患者で従属変数が大きくなるように設定して発現量を分析した場合、被検者について計算された従属変数の数値がカットオフ値よりも大きければ、該被検者は統合失調症であると予測することができる。カットオフ値は、既知の統合失調症患者及び健常者について計算された従属変数をもとに、常法の統計処理により適宜定めることができる。重回帰分析の手法自体は周知であり、重回帰分析を行なうソフトウェア等も種々のものが公知で、市販品も多く存在する。本発明ではいずれのソフトウェアを用いてもよい。なお、予測式は、既知の患者及び健常者についての分析を一度行なえば定めることができるので、実施の都度既知の患者/健常者群についての分析を行なう必要はなく、一旦得られた予測式をその後に実施する際にも用いることができる。 Alternatively, the above comparison is preferably performed by multiple regression analysis. By using the measured expression levels of the 12 genes as explanatory variables and performing multiple regression analysis of the expression levels of the genes in known schizophrenic patients and healthy individuals, a prediction formula (multiple regression formula) can be obtained. By calculating the dependent variable by inputting the expression level of the gene group in the subject to the obtained prediction formula, and comparing the dependent variable with the dependent variable of a known schizophrenic patient and a healthy person, Whether or not the subject has schizophrenia can be determined. This comparison is made, for example, based on the dependent variable calculated for each sample of the known schizophrenia patient group and the healthy subject group, by defining the value of the dependent variable that can preferably classify both groups as a cut-off value. This can be done by comparing the dependent variable of this person with this cutoff value. For example, if the expression level is analyzed in a patient with schizophrenia and the dependent variable is set to be large, if the numerical value of the dependent variable calculated for the subject is greater than the cutoff value, the subject is integrated. Can be predicted to be ataxia. The cut-off value can be appropriately determined by routine statistical processing based on the dependent variables calculated for known schizophrenia patients and healthy individuals. The technique of multiple regression analysis itself is well known, and various software and the like for performing multiple regression analysis are known, and there are many commercially available products. Any software may be used in the present invention. In addition, since the prediction formula can be determined once the analysis for the known patient and the healthy person is performed once, it is not necessary to perform the analysis for the known patient / healthy person group every time it is performed, and the prediction formula once obtained. Can also be used in subsequent implementations.
 なお、本発明において「重回帰分析」といった場合には、得られた重回帰式を用いて試料の従属変数を求める工程を含む分析方法を広く包含し、重回帰式を得るための分析工程は必ずしも含まれない。従って、上記したように、既に求められた重回帰式を用いて統合失調症の検出を行う方法であれば、本発明にいう「重回帰分析により対比を行なう検出方法」に包含される。 In the present invention, in the case of “multiple regression analysis”, an analysis method including a step of obtaining a dependent variable of a sample using the obtained multiple regression equation is widely included, and an analysis step for obtaining a multiple regression equation includes Not necessarily included. Therefore, as described above, any method for detecting schizophrenia using the already obtained multiple regression equation is included in the “detection method for performing comparison by multiple regression analysis” in the present invention.
 本発明で用いる発現量の測定値は、下記実施例に記載される通り、測定されるシグナル強度をグローバルノーマライゼーション(global normalization)法により正規化したものであることが好ましい。ここで、グローバルノーマライゼーション法とは、DNAマイクロアレイ上に搭載した全遺伝子の発現量の中央値を求め、この中央値で各遺伝子の発現量を除することで相対的発現量を算出する方法である。 The measurement value of the expression level used in the present invention is preferably a value obtained by normalizing the measured signal intensity by a global normalization method as described in the following examples. Here, the global normalization method is a method of calculating the relative expression level by calculating the median value of the expression levels of all genes mounted on the DNA microarray and dividing the expression level of each gene by this median value. .
 ニューラルネットワークを用いて本発明の方法を実施する場合、ニューラルネットワーク自体は周知であり、市販のニューラルネットワークを用いることができる。もっとも、ニューラルネットワーク自体は市販品を利用できるが、本発明では、ニューラルネットワークに学習させるデータに特徴があり、いかなるデータを学習させることにより感度(真陽性率)及び特異度(真陰性率)の両者を80%以上にできるかは工夫が必要である(後述)。 When performing the method of the present invention using a neural network, the neural network itself is well known and a commercially available neural network can be used. However, although the neural network itself can use a commercial product, in the present invention, there is a feature in the data to be learned by the neural network, and sensitivity (true positive rate) and specificity (true negative rate) can be obtained by learning any data. It is necessary to devise whether both can be increased to 80% or more (described later).
 ニューラルネットワークを用いた分類予測モデルの最適モデルは、例えば下記実施例に詳述する方法により構築することができる。簡単に説明すると、例えば、次のようにして最適モデルを決定することができる。まず、多数の統合失調症患者及び健常者から採取した試料を用い、種々の遺伝子の発現量を測定する。遺伝子の発現量は、上記の通り、DNAマイクロアレイを用いて行なうことができる。下記実施例では、約55000種類のヒト遺伝子のDNAプローブが搭載された市販のDNAマイクロアレイを用いた。 An optimal model of a classification prediction model using a neural network can be constructed by a method detailed in the following example, for example. Briefly, for example, the optimum model can be determined as follows. First, the expression level of various genes is measured using samples collected from many schizophrenic patients and healthy individuals. The expression level of the gene can be performed using a DNA microarray as described above. In the following examples, a commercially available DNA microarray equipped with DNA probes of about 55,000 kinds of human genes was used.
 次に、DNAマイクロアレイを用いて測定した発現量をデータクレンジングする。ここで、データクレンジングは、例えば、全体の発現量の30%tile未満の遺伝子のプローブや98%tile以上の遺伝子のプローブを除外することにより行なうことができる。 Next, data cleansing is performed on the expression level measured using a DNA microarray. Here, the data cleansing can be performed, for example, by excluding probes of genes less than 30% tile or 98% tile or more of the entire expression level.
 多数の統合失調症患者及び健常者のDNAマイクロアレイのデータを、学習例と学習例と独立した試験例に分け、学習例をニューラルネットワークに学習させ、構築された分類予測モデルによりどの程度の感度及び特異度が達成されるかを試験例を用いて算出、評価を行うHold out cross validation法により構築した。分類予測モデル構築には、ニューラルネットワークのパラメーターを変更して行い、学習例と試験例に振り分けた検体の独立性は担保して検証を続け、最も良い成績のモデルを採用した。下記実施例では、統合失調症患者及び健常者の2/3を学習例、1/3を試験例として用いた。 Divide the DNA microarray data of a large number of schizophrenia patients and healthy individuals into study examples and test examples independent of the study examples, let the study examples be learned by the neural network, and how much sensitivity and It was constructed by the Hold out cross validation method, which calculates and evaluates whether specificity is achieved using test examples. The classification prediction model was constructed by changing the parameters of the neural network, ensuring the independence of the samples assigned to the learning examples and test examples, and continuing the verification, and adopting the model with the best results. In the following examples, 2/3 of schizophrenia patients and healthy subjects were used as learning examples, and 1/3 were used as test examples.
 まず、ニューラルネットワークに学習させる学習用データを準備する。下記実施例では、Quality Flag "Good"以外のプローブ、Y染色体上に座位する遺伝子のプローブ、mRNA 3'末端から遠位に設定されているプローブなどを除外し、約55,000個のプローブから10,498個のプローブに絞り込んだ。ここで、Quality Flagが "Good"とは、測定された発現量がスポット周囲のバックグラウンドの1.5SDより大きく、測定値として信頼できるということを意味する。またY染色体上に座位する遺伝子は男性にしか存在しないため、女性の検査を行った際に検出の感度及び/又は特異度が下がる恐れがあるので除外した。また、mRNA 3'末端から遠位に設定されているプローブは、cRNAの調製におけるバイアスを受けやすいため、測定値の大きな変動要因であるため除外した。さらに、予備分析により、欠損値が25%以上あるもの、男女間の発現量の差が大きいもの、アレイ製造時のバッチ間差が大きいものも除外した。 First, prepare the learning data to be learned by the neural network. In the following examples, probes other than Quality Flag “Good”, probes of genes located on the Y chromosome, probes set distal from the mRNA 3 ′ end, etc. are excluded, and 10,498 from about 55,000 probes. Narrow down to the probe. Here, the quality “Flag” being “Good” means that the measured expression level is larger than 1.5SD of the background around the spot and can be trusted as the measurement value. Further, since the gene located on the Y chromosome is present only in males, it was excluded because the sensitivity and / or specificity of detection might be lowered when females were examined. Probes set distal from the mRNA 3 'end are excluded because they are subject to bias in the preparation of cRNA and are a significant variation in the measured values. Furthermore, preliminary analysis excluded those with a missing value of 25% or more, those with a large difference in expression between men and women, and those with a large difference between batches during array production.
 次にこのようにして選択した各プローブについて測定された、各プローブがハイブリダイズするRNAが由来する遺伝子の発現量をニューラルネットワークに入力し、2群間検定(t検定)、すなわち、学習例の統合失調症(未投薬)と健常者群の間で有意差検定(t検定)を行なう。なお、下記実施例では、試料の絞込みも行なった。すなわち、健常者56例の中央値をプローブごとに計算し、そのデータセットを対象として各試料の相関を調べ、近似曲線のパラメータやシグナル強度比が大きく隔たったものは分析対象から除外した。 Next, the expression level of the gene derived from the RNA hybridized with each probe, measured for each probe selected in this way, is input to the neural network, and the two-group test (t test), that is, the learning example A significant difference test (t-test) is performed between schizophrenia (non-medicine) and healthy subjects. In the following examples, samples were also narrowed down. That is, the median value of 56 healthy subjects was calculated for each probe, the correlation of each sample was examined with the data set as an object, and the parameters of the approximate curve and the signal intensity ratio greatly separated were excluded from the analysis target.
 有意差検定により有意差が認められたプローブについて変数増加法(forward selection)による選択を行なう。変数増加法自体は周知であり、説明変数(各プローブの測定結果)を1つずつ足していき、目的変数(正答率)との相関が高い組合せを得ることにより行なう。コンピューターにインストールされたニューラルネットワークを用いて変数増加法を行ない、最も正答率が高いプローブ数を選択する。下記実施例では、上記した10,498個のプローブの測定値の中から、14種類のプローブの測定値を説明変数として用いた場合に正答率が最高になった。このようにして構築されたニューラルネットワークの最適モデルによれば、感度及び特異度とも80%を超えたので、この最適モデル及び上記DNAマイクロアレイを用いて統合失調症の検出を行なうことが可能であった。 プ ロ ー ブ Select the probe for which significant difference is recognized by the significant difference test by the forward selection method. The variable increasing method itself is well known, and is performed by adding explanatory variables (measurement results of each probe) one by one and obtaining a combination having a high correlation with the objective variable (correct answer rate). Using a neural network installed on the computer, the variable increase method is performed, and the number of probes with the highest correct answer rate is selected. In the following examples, the correct answer rate was the highest when the measured values of 14 types of probes were used as explanatory variables among the measured values of 10,498 probes described above. According to the optimal model of the neural network constructed in this way, both sensitivity and specificity exceeded 80%, and it was possible to detect schizophrenia using this optimal model and the DNA microarray. It was.
 しかしながら、約55000種類のプローブを搭載したDNAマイクロアレイは高価であり、1枚のマイクロアレイで1検体しか処理できないので、実用化のためにはより低コストのマイクロアレイを用いることが望まれる。そこで、下記実施例では、上記で選択された14種類のプローブを含み、統合失調症と健常者の間で有意差があるプローブから216種類のプローブを選択し、基板に搭載した。この216種類のプローブのうち、上記した14種類のプローブ以外の202種類のプローブは、統合失調症と健常者の間で有意差のあるプローブのほか、類似した精神疾患である双極性障害患者と統合失調症患者間で統計学的有意差の認められる遺伝子を選択した。さらにグローバルノーマライゼーションに用いるプローブ及び管理用プローブ(位置合わせ用)も搭載した(詳細は下記実施例)。グローバルノーマライゼーションには、アレイ間での変動が小さなものを選択した。この実用化アレイは、1枚の基板上に複数(下記実施例では16個)のチャンバーを形成することができ、すなわち、1枚のアレイで16検体の検査を同時に行なうことができ、アレイ作製のコスト並びに検査のコスト及び手間を大幅に下げることができる。 However, a DNA microarray equipped with about 55,000 types of probes is expensive, and only one specimen can be processed with one microarray. Therefore, it is desirable to use a lower cost microarray for practical use. Therefore, in the following examples, 216 types of probes were selected from the probes including the 14 types of probes selected above and having a significant difference between schizophrenia and healthy subjects, and mounted on the substrate. Of these 216 types of probes, 202 types of probes other than the 14 types of probes described above are probes that have a significant difference between schizophrenia and healthy subjects, and patients with bipolar disorder who are similar mental disorders. Genes with statistically significant differences among schizophrenic patients were selected. Furthermore, a probe used for global normalization and a management probe (for alignment) were also mounted (details in the following examples). The global normalization was selected with small variation between arrays. In this practical array, a plurality of chambers (16 in the following embodiment) can be formed on a single substrate, that is, 16 specimens can be simultaneously tested with a single array. Cost, and inspection cost and labor can be greatly reduced.
 この実用化アレイを用いて測定された、先に特定された14種類のプローブの測定結果を、先に構築したニューラルネットワークの分類予測モデルに入力し、上記試験例を用いて感度及び特異度を算出したところ、健常者の感度及び特異度が80%未満となり、さらに別の試験例を用いて感度及び特異度を算出したところ、統合失調症(未治療)及び健常者の感度及び特異度とも80%未満となってしまった。 The measurement results of the 14 types of previously specified probes, measured using this practical array, are input to the neural network classification prediction model constructed previously, and the sensitivity and specificity are determined using the above test example. As a result of calculation, the sensitivity and specificity of the healthy person was less than 80%, and when the sensitivity and specificity were calculated using another test example, both the sensitivity and specificity of schizophrenia (untreated) and healthy persons were calculated. It became less than 80%.
 そこで、実用化アレイを用いた測定値を利用して、上記と同様、コンピューターにインストールされたニューラルネットワークを用いて、cross validation法及び変数増加法を駆使して、正答率が最高となるプローブの組合せを求めた。その結果、上記した10種類の遺伝子が特定された。なお、上記10種類の遺伝子は、先に特定された14種類の遺伝子とは異なるものであり、両方に重複する遺伝子は2種類のみであった。 Therefore, using the measured values using the practical array, using the neural network installed in the computer as described above, using the cross-validation method and the variable increasing method, the probe with the highest correct answer rate is used. A combination was sought. As a result, the above 10 genes were identified. Note that the above 10 types of genes are different from the 14 types of genes specified earlier, and only 2 types of genes overlapped with both.
 上記した10種類の遺伝子のプローブの測定値を説明変数として入力し、試験例について感度及び特異度を算出すると、統合失調症及び健常者のいずれも感度及び特異度が80%を超え、高感度及び高特異度で統合失調症の検出が可能であることが確認された。 When the measured values of the above 10 kinds of gene probes are input as explanatory variables and the sensitivity and specificity are calculated for the test examples, the sensitivity and specificity of both schizophrenia and healthy subjects exceed 80%, and the sensitivity is high. It was also confirmed that schizophrenia can be detected with high specificity.
 また、上記した10種類の遺伝子のプローブの測定値を説明変数として用いて重回帰分析を行ない、試験例について感度及び特異度を算出した場合にも、統合失調症及び健常者のいずれも感度及び特異度が80%を超えた。重回帰分析によっても、上記10種類の遺伝子発現量を用いて、高感度及び高特異度で統合失調症の検出が可能であることが確認された。 In addition, when multiple regression analysis was performed using the measurement values of the above-described 10 types of gene probes as explanatory variables, and sensitivity and specificity were calculated for the test examples, both schizophrenia and healthy subjects were sensitive and Specificity exceeded 80%. Multiple regression analysis also confirmed that schizophrenia could be detected with high sensitivity and high specificity using the above 10 gene expression levels.
 なお、本発明において、「塩基配列を有する」とは、塩基がそのような順序で配列しているという意味である。従って、例えば、「配列番号42で示される塩基配列を有するオリゴヌクレオチドプローブ」とは、配列番号42に示されるtcccacatcc ccttgaatat cccaggaaaaの塩基配列を持つ、30塩基のサイズのオリゴヌクレオチドプローブを意味する。 In the present invention, “having a base sequence” means that the bases are arranged in such an order. Therefore, for example, “an oligonucleotide probe having the base sequence represented by SEQ ID NO: 42” means an oligonucleotide probe having a base sequence of tcccacatcc ccttgaatat cccaggaaaa represented by SEQ ID NO: 42 and having a size of 30 bases.
 以下、実施例に基づき本発明をより具体的に説明する。 Hereinafter, the present invention will be described more specifically based on examples.
1. プローブの絞込み
採血および試料の保管
 統合失調症患者 抗精神病薬 未投薬群58例、健常者56例、対照精神疾患である双極性障害患者41例より、PAXgene Blood RNA Kit (Qiagen, Valencia, CA, USA) を用いて採血及びRNA抽出を行なった。PAXgene Blood RNA Tubes 2本に2.5mlづつ採血し、転倒混和したあと凍結し、実験室への搬送を行った。保管は-80℃とした。
1. Probe blood collection and sample storage Schizophrenia patients Antipsychotics 58 untreated groups, 56 healthy subjects, and 41 patients with bipolar disorder, a psychotic disorder, PAXgene Blood RNA Kit (Qiagen, Valencia, Blood collection and RNA extraction were performed using CA, USA). Two PAXgene Blood RNA Tubes were collected 2.5 ml each, mixed by inversion, frozen, and transported to the laboratory. Storage was -80 ° C.
RNA抽出
 -80℃に保管しておいたPAXgene Blood RNA Tubesを室温で融解し、製造者の指示書に従ってtotal RNAを抽出した。抽出したtotal RNAは、-80℃で保管した。
RNA extraction PAXgene Blood RNA Tubes stored at -80 ° C were thawed at room temperature and total RNA was extracted according to the manufacturer's instructions. The extracted total RNA was stored at -80 ° C.
抽出したRNAの濃度、クォリティーの確認
 抽出したtotal RNAを10mM Tris-HCl(pH7.5)で50倍に希釈し、230, 260, 280nmの吸光度を測定し、total RNAの濃度を測定した。抽出したRNAのクォリティーは、Agilent 2100バイオアナライザー(Agilent Technologies, Inc. Santa Clara, CA, USA)で確認を行った。
Confirmation of concentration and quality of extracted RNA Extracted total RNA was diluted 50-fold with 10 mM Tris-HCl (pH 7.5), absorbance at 230, 260, and 280 nm was measured, and the concentration of total RNA was measured. The quality of the extracted RNA was confirmed with an Agilent 2100 bioanalyzer (Agilent Technologies, Inc. Santa Clara, CA, USA).
cRNAの調製
 抽出したtotal RNA 0.5μgを用いてcRNAを調製した。iExpress kit(GE Healthcare Bioscience, Chandler, CA, USA)を用い、製造者の指示書に従ってBiotin標識したcRNAを調製した。
Preparation of cRNA cRNA was prepared using 0.5 μg of extracted total RNA. Biotin-labeled cRNA was prepared using an iExpress kit (GE Healthcare Bioscience, Chandler, CA, USA) according to the manufacturer's instructions.
 調製したcRNAの定量およびクォリティーの確認は、抽出したtotal RNAの定量及びクォリティーの確認と同様に行った。すなわち、50倍に希釈したcRNA溶液の230, 260, 280 nmの吸光度を測定し、total RNAの濃度を測定した。cRNAのクォリティーの確認は、Agilent 2100バイオアナライザーで行った。 The quantification and quality confirmation of the prepared cRNA were performed in the same manner as the quantification and quality confirmation of the extracted total RNA. That is, the absorbance of 230, 260, and 280 nm of cRNA solution diluted 50 times was measured, and the concentration of total RNA was measured. The quality of the cRNA was confirmed using an Agilent 2100 bioanalyzer.
アレイへのハイブリダイゼーションと洗浄
 マイクロアレイとして、Codelink(商標) 55K Bioarray(GEヘルスケア バイオサイエンス)を用いた。Codelink(商標) 55K Bioarrayは、スライドガラス表面を特殊な化学修飾を施したアクリルアミドでコーティングし、30merのプローブが3次元的に固定されているため、ハイブリダイズの効率が良く、再現性や感度に優れたマイクロアレイであり、ヒトの約55,000遺伝子に対応するプローブが固定されている。
Hybridization and Washing to Array As a microarray, Codelink ™ 55K Bioarray (GE Healthcare Bioscience) was used. Codelink (trademark) 55K Bioarray is coated with acrylamide with special chemical modification on the surface of the slide glass, and the 30mer probe is three-dimensionally fixed. It is an excellent microarray, and probes corresponding to about 55,000 human genes are immobilized.
 10μgのcRNAを最終容量が20μlになるようRNase-Free H2Oで調製し、iExpress kit の5×Fragmentation Bufferを5μl添加した後、94℃で20分間インキュベートしてcRNAを断片化した。 10 μg of cRNA was prepared with RNase-Free H 2 O to a final volume of 20 μl, 5 μl of 5 × Fragmentation Buffer of iExpress kit was added, and then incubated at 94 ° C. for 20 minutes to fragment the cRNA.
 10μgの断片化したcRNA(25μl)、78μlのiExpress kitのHybridization Buffer A、130μlのiExpress kit のHybridization Buffer Bを混合し、計260μlになるように調製した。90℃で5分間インキュベートした後、氷上で5~30分間インキュベートした。 10 μg of fragmented cRNA (25 μl), 78 μl of iExpress® kit Hybridization Buffer A, and 130 μl of iExpress kit Hybridization Buffer B were mixed to prepare a total of 260 μl. After incubating at 90 ° C. for 5 minutes, it was incubated on ice for 5-30 minutes.
 250μlのハイブリダイゼーション溶液をCodeLink(商標) 55K Bioarray(GE Healthcare Bioscience, Chandler, CA, USA)のチャンバーへ注入し、CodeLink(商標) INNOVAシェイカー(GE Healthcare Bioscience, Chandler, CA, USA)を用いて、アレイを300rpmで旋回させながら、37℃で18~24時間インキュベートした。 250 μl of hybridization solution was injected into the chamber of CodeLink ™ 55K Bioarray (GE Healthcare Bioscience, Chandler, CA, USA) and using a CodeLink ™ INNOVA shaker (GE Healthcare Bioscience, Chandler, CA, USA) The array was incubated for 18-24 hours at 37 ° C. with swirling at 300 rpm.
 Hybridization Removal Toolを使用してアレイを固定し、ハイブリダイゼーションチャンバーを引き剥がし、Bioarray Rackにアレイをセットした。アレイをセットしたBioarray Rackを46℃の0.75×TNT Bufferの入ったLarge Reagentリザーバーに移し、46℃で1時間インキュベーションした。 The array was fixed using Hybridization® Removal Tool, the hybridization chamber was peeled off, and the array was set on Bioarray® Rack. The Bioarray® Rack with the array set was transferred to a Large® Reagent reservoir containing 0.75 × TNT® Buffer at 46 ° C. and incubated at 46 ° C. for 1 hour.
 Bioarray Rackを3.4 mlのStreptavidin-Cy5希釈溶液で満たしたSmall Reagentリザーバーに移し、室温で30分間インキュベートした。染色後、Bioarray Rack を240mlの1×TNT Bufferで満たしたLarge Reagentリザーバーに移し、室温で5分間インキュベートする操作を4回繰り返して洗浄した。次にBioarray Rackを0.1×SSC/0.05% Tween 20で満たしたLarge Reagentリザーバーに移し、30秒間洗浄し、アレイを遠心して乾燥した後、スキャニングまで遮光して保存した。 The Bioarray® Rack was transferred to a Small® Reagent reservoir filled with 3.4 μml of Streptavidin-Cy5 diluted solution and incubated at room temperature for 30 minutes. After staining, the Bioarray Rack was transferred to a Large Reagent reservoir filled with 240 ml of 1 × TNT Buffer, and washed by repeating the operation of incubating at room temperature for 5 minutes 4 times. Next, the Bioarray® Rack was transferred to a Large® Reagent reservoir filled with 0.1 × SSC / 0.05% Tween-20, washed for 30 seconds, the array was centrifuged and dried, and then stored in the dark until scanning.
アレイのスキャニング
 洗浄後乾燥させたアレイをAgilent Scanner (Agilent Technologies, Santa Clara, CA, USA)にてスキャニングした。スキャナーの設定は、Red PMT [%] を70%、 Dye ChannelをRed (RedはCy5)とした。これ以外の設定はデフォルトとした。スキャンしたアレイデータはTIFファイルで保存し、数値化を行った。
Array Scanning The washed and dried arrays were scanned with an Agilent Scanner (Agilent Technologies, Santa Clara, CA, USA). The scanner settings were Red PMT [%] 70%, Dye Channel Red (Red is Cy5). The other settings are the default. The scanned array data was saved as a TIF file and digitized.
アレイデータの数値化
 製造者の指示書に従い、CodeLink(商標) Expression AnalysisによりTIFファイルで保存したアレイデータの数値化とグローバルノーマライゼーションによる正規化を行った。
Digitization of array data According to the manufacturer's instructions, CodeLink (trademark) Expression Analysis was used to digitize array data saved in TIF files and normalize by global normalization.
プローブの絞込み
 上記で得られた実験結果から、Quality Flag "Good"以外のプローブ、Y染色体上に座位するプローブ、mRNA 3’末端から遠位に設定されているプローブなどを除外した。さらに、さらに、予備分析により、欠損値が25%以上あるもの、男女間の発現量の差が大きいもの、アレイ製造時のバッチ間差が大きいものも除外した。これらにより、約55,000個のプローブから10,498個のプローブに絞り込んだ。
Narrowing down probes From the experimental results obtained above, probes other than Quality Flag “Good”, probes located on the Y chromosome, probes set distal from the mRNA 3 ′ end, and the like were excluded. Furthermore, preliminary analysis also excluded those with a missing value of 25% or more, those with a large difference in the expression level between men and women, and those with a large difference between batches during array production. As a result, the probe was narrowed down from about 55,000 probes to 10,498 probes.
2. 統計処理
 以上のような条件で絞り込んだプローブから、統合失調症患者 抗精神病薬 未投薬群43検体、健常者38検体の2群間での有意差および双極性障害患者32検体と統合失調症患者 抗精神病薬 未投薬群40検体の間で統計学的有意差の認められたプローブ、さらに双極性障害患者32検体と健常者38検体の間で統計学的有意差の認められたプローブを216個抽出した。これらのプローブの配列を配列表の配列番号11~226に示す。また、各プローブが由来する遺伝子名及びGenBank Accession No.を下記表4に示す。
2. Statistical processing Based on the above-mentioned conditions, schizophrenic patients, antipsychotics, 43 untreated groups, and 38 healthy subjects, significant differences between 2 groups, and 32 patients with bipolar disorder and schizophrenia Patients who have a statistically significant difference between 40 specimens of anti-psychotic drug untreated groups and a probe that has a statistically significant difference between 32 specimens of bipolar disorder and 38 healthy subjects 216 were extracted. The sequences of these probes are shown in SEQ ID NOs: 11 to 226 in the sequence listing. In addition, Table 4 below shows the gene name and GenBank Accession No. from which each probe is derived.
Figure JPOXMLDOC01-appb-T000004
Figure JPOXMLDOC01-appb-T000004
Figure JPOXMLDOC01-appb-T000005
Figure JPOXMLDOC01-appb-T000005
Figure JPOXMLDOC01-appb-T000006
Figure JPOXMLDOC01-appb-T000006
Figure JPOXMLDOC01-appb-T000007
Figure JPOXMLDOC01-appb-T000007
Figure JPOXMLDOC01-appb-T000008
Figure JPOXMLDOC01-appb-T000008
Figure JPOXMLDOC01-appb-T000009
Figure JPOXMLDOC01-appb-T000009
Figure JPOXMLDOC01-appb-T000010
Figure JPOXMLDOC01-appb-T000010
Figure JPOXMLDOC01-appb-T000011
Figure JPOXMLDOC01-appb-T000011
Figure JPOXMLDOC01-appb-T000012
Figure JPOXMLDOC01-appb-T000012
Figure JPOXMLDOC01-appb-T000013
Figure JPOXMLDOC01-appb-T000013
3. 実用化アレイの設計
 CodeLink(商品名) 55K Bioarrayは、非常に高価であり、1枚で1検体しか処理、解析ができない。実用化をはかるためには、より低コストで解析が可能なマイクロアレイが必要となる。そこで、CodeLink(商品名) 55K Bioarrayと全く同一の表面処理を施し、これを16個のチャンバーで区切ることで、一度に最大で16検体の処理、解析が可能なCodeLink(商品名) 16-Assay Bioarray(Applied Microarrays社)をベースにした実用化アレイを設計した。上記216遺伝子のプローブの他にグローバルノーマライゼーション用に使用するプローブ(配列番号226~525)やメーカーによる管理用プローブなど追加して、以下のようなアレイを設計した。
3. Designing a practical array CodeLink (trade name) 55K Bioarray is very expensive, and only one sample can be processed and analyzed. In order to put it to practical use, a microarray that can be analyzed at a lower cost is required. Therefore, the same surface treatment as CodeLink (trade name) 55K Bioarray is applied, and this is divided into 16 chambers, so that CodeLink (trade name) 16-Assay can process and analyze up to 16 samples at a time. A practical array based on Bioarray (Applied Microarrays) was designed. In addition to the probe of 216 genes described above, a probe used for global normalization (SEQ ID NO: 226 to 525) and a management probe provided by the manufacturer were added to design the following array.
CodeLink(商品名) 16-Assay Bioarray プローブの内訳 
(合計 1714スポット/チャンバー)
・分類予測候補プローブ :  216プローブ × 4重スポット
・ノーマライゼーション用追加プローブ :  299プローブ × 2重スポット
・メーカーによる管理用プローブ :  96プローブ × 各1スポット
・ 規格上の予約プローブ :  グリッド (32) , Positive Control (60) , Negative Control (64)
CodeLink (Product Name) 16-Assay Bioarray Probe Breakdown
(Total 1714 spots / chamber)
-Classification prediction candidate probe: 216 probes x 4 spots-Additional probe for normalization: 299 probes x 2 spots-Management probe by manufacturer: 96 probes x 1 spot each- Standard reserved probe: Grid (32), Positive Control (60), Negative Control (64)
4. CodeLink(商品名) 16-Assay Bioarrayによる測定結果を基にした分類予測(ニューラルネットワーク)
 未治療統合失調症患者 60例、健常者 56例の遺伝子発現情報を基に、分類予測に優れたニューラルネットワークによる分類予測モデルの構築を試みた。
4. CodeLink (trade name) 16-Assay Bioarray classification prediction based on measurement results (neural network)
Based on the gene expression information of 60 untreated schizophrenia patients and 56 healthy subjects, we attempted to construct a classification prediction model using a neural network with excellent classification prediction.
 分類予測モデルを構築するにあたって、全体の約2/3を学習例(統合失調症患者 抗精神病薬 未投薬群40検体、健常者38検体)として分類予測モデルの構築に用い、モデルの構築に関与していない、残りの1/3を試験例(統合失調症患者 抗精神病薬 未投薬群20検体、健常者18検体)としてモデルの評価を行うHold out cross validation法により検証を行った。 In building a classification prediction model, about 2/3 of the whole is used as a learning example (40 patients with schizophrenia, antipsychotic drug untreated group, 38 healthy subjects) to build a classification prediction model, and is involved in building the model The remaining 1/3 was tested by the Hold 例 out cross validation method in which the model was evaluated as a test example (20 patients with schizophrenia, antipsychotic drug untreated group, 18 healthy subjects).
(1) 遺伝子発現情報の取得
 216プローブを含む上記実用化アレイを用いて、未治療統合失調症患者 60例、健常者 56例の遺伝子発現情報を取得した。採血、RNAの抽出、cRNAの調製、アレイへのハイブリダイゼーション、蛍光シグナル(蛍光色素Cy5)のスキャニングは、上記と同様に行なった。スキャナーで読み取った1714スポットの画像データは、CodeLink(商標) Expression Analysisソフトウェアを用いて数値化及びグローバルノーマライゼーションによる正規化を行なった。
(1) Acquisition of gene expression information Using the above practical array containing 216 probes, gene expression information of 60 untreated schizophrenia patients and 56 healthy persons was acquired. Blood collection, RNA extraction, cRNA preparation, array hybridization, and fluorescent signal (fluorescent dye Cy5) scanning were performed as described above. Image data of 1714 spots read by the scanner was digitized and normalized by global normalization using CodeLink ™ Expression Analysis software.
(2) ニューラルネットワークによる分類予測モデルの構築
 市販の解析ソフトArrayAssist(登録商標)(STRATAGENE社)に搭載されているニューラルネットワークを用いて、正規化したデータの解析を行ない、分類予測アルゴリズムの構築を試みた。分類予測アルゴリズムとは、あらかじめ属性が明らかになっているデータセットを入力し「学習、訓練」を行なうことで、最適な解を出力することができるようになる一連のアルゴリズムであり、ニューラルネットワークは、学習効果の高さから高い分類精度のアルゴリズムが構築できるとされている。
(2) Construction of classification prediction model by neural network Analyze normalized data using neural network installed in commercially available analysis software ArrayAssist (registered trademark) (STRATAGENE) to construct classification prediction algorithm Tried. A classification prediction algorithm is a series of algorithms that can output an optimal solution by inputting a data set whose attributes have been clarified in advance and performing “learning and training”. It is said that an algorithm with high classification accuracy can be constructed from the high learning effect.
 正規化されたデータの2/3(統合失調症患者 抗精神病薬 未投薬群40例、健常者38例)を学習例として、ArrayAssistのニューラルネットワークに入力し、アルゴリズムの構築を行なった。Feature Selectionは変数増加法(Forward Selection)により行ない、学習データを3組に分けたcross validation(N-fold cross validation (N=3))により分類予測アルゴリズムを構築した。具体的には、学習例のデータセットを3分割し、データセットを入れ替えながら、216個のプローブのうちで有意差のあったプローブを一つずつ使って分類予測を行ない、最も正しく分類できたプローブを分類予測に用いるプローブとして採用することとし、これを順次繰り返して採用するプローブを追加していった。このようにして、徐々に用いるプローブを増やしていき、正しく分類できた割合(Number of Class Accuracy(%))がプラトーに達したところで学習を終了させた。 2/3 of the normalized data (schizophrenic patients antipsychotic drug untreated group 40 cases, 38 healthy subjects) was input to the ArrayAssist neural network as a learning example, and an algorithm was constructed. Feature Selection was performed by the variable increment method (Forward Selection), and the classification prediction algorithm was constructed by cross validation (N-fold cross validation (N = 3)) that divided the learning data into three sets. Specifically, the data set of the learning example was divided into three parts, and the prediction was made using one of the 216 probes with significant difference one by one while changing the data set. We decided to adopt the probe as a probe to be used for classification prediction, and added probes that were repeatedly used in this order. In this way, the number of probes used was gradually increased, and when the ratio (Number 分類 of Class Accuracy (%)) that was correctly classified reached a plateau, the learning was terminated.
 次いで、このようにして学習させたアルゴリズムを用いて、試験例のデータセットを分析した。Number of Class Accuracyがプラトーに達した点で使用されているプローブセットについての正規化データを上記学習済みアルゴリズムに入力し、試験例についての該アルゴリズムによる分類が臨床診断とどの程度一致するかを検証した。 Next, the data set of the test example was analyzed using the algorithm learned in this way. Enter the normalized data for the probe set used at the point where Number of Class Accuracy reaches the plateau into the learned algorithm above to verify how well the classification of the test example matches the clinical diagnosis did.
 ニューラルネットワークの各種パラメーター(学習効率、Momentum、繰り返し数、レイヤー数、ニューロン数)を種々に変化させて多数のアルゴリズムを構築し、それぞれについて上記したcross validationによる学習及び試験例を用いた診断精度の検証を行なった。 Numerous algorithms are constructed by variously changing various parameters of the neural network (learning efficiency, momentum, number of repetitions, number of layers, number of neurons), and learning accuracy by using cross-validation and test examples described above for each. Verification was performed.
 その結果、パラメーターを下記の通りに設定したアルゴリズムにおいて、試験例を最も正しく分類することができた。
  学習効率:0.5
  Momentum:0.3
  繰り返し数:115
  レイヤー数:1
  ニューロン数:3
As a result, it was possible to classify the test examples most correctly in the algorithm in which the parameters were set as follows.
Learning efficiency: 0.5
Momentum: 0.3
Number of repetitions: 115
Number of layers: 1
Number of neurons: 3
 このアルゴリズムによる学習例及び試験例についての予測結果を表5及び表6に示す。また、このアルゴリズムについてのForward Selectionの結果を図1に示す。 Table 5 and Table 6 show the prediction results for learning examples and test examples based on this algorithm. Further, FIG. 1 shows the result of Forward Selection for this algorithm.
Figure JPOXMLDOC01-appb-T000014
Figure JPOXMLDOC01-appb-T000014
Figure JPOXMLDOC01-appb-T000015
Figure JPOXMLDOC01-appb-T000015
 図1に示される通り、構築されたアルゴリズムによれば、10個のプローブ(表2、前掲)による発現データを用いて統合失調症と健常者とを精度良く分類することができる。これらのプローブによって検出される遺伝子の生物学的機能について調査したところ、上記表3に示すように、脳神経系での作用が報告されている遺伝子があった。 As shown in FIG. 1, according to the constructed algorithm, it is possible to classify schizophrenia and a healthy person with high accuracy using expression data obtained by 10 probes (Table 2, supra). When the biological functions of genes detected by these probes were investigated, as shown in Table 3 above, there was a gene that was reported to have an action in the cranial nervous system.
5. CodeLink(商品名) 16-Assay Bioarrayによる測定結果を基にした分類予測(重回帰分析)
 上記と同様に、未治療統合失調症患者60例、健常者56例の遺伝子発現情報を用いて、重回帰分析による分類予測を試みた。上記10個のプローブによる発現データを説明変数として用いて、上記学習例について市販のソフトウェア(SPSS)を用いて重回帰分析を行ない、予測式を構築した。重回帰分析は、統合失調症患者で従属変数が大きくなるように設定して行った。次いで、構築された予測式を用いて、上記試験例について従属変数を計算した。なお、得られた予測式は以下の通りである。
Y=(A1X1+A2X2+A3X3+A4X4+A5X5+A6X6+A7X7+A8X8+A9X9+A10X10+C)×100
ここで、
X1は、DLGAP3(GE54859 配列番号42)の遺伝子発現量
X2は、KCN15J(GE58277 配列番号77)の遺伝子発現量
X3は、GPR30 (GE80129 配列番号165)の遺伝子発現量
X4は、NPCR (GE540583 配列番号34)の遺伝子発現量
X5は、TMED1(GE85017 配列番号200)の遺伝子発現量
X6は、PAFAH2 (GE62881 配列番号122)の遺伝子発現量
X7は、TMEM23 (GE60313 配列番号98)の遺伝子発現量
X8は、ABCG1 (GE586854 配列番号81)の遺伝子発現量
X9は、PGRMC1 (GE62032 配列番号109)の遺伝子発現量
X10は、INSL3 (GE88024 配列番号218)の遺伝子発現量
をそれぞれ指す。
各遺伝子の発現量に乗ずる係数は、それぞれ、
A1は、1.00019621196698
A2は、0.273175387505458
A3は、0.606651443546423
A4は、-0.659859665599205
A5は、-0.287215519193429
A6は、-0.271285204843002
A7は、-0.220049126802913
A8は、-0.00285057386315785
A9は、0.478133475554455
A10は、-0.169744977943406
定数Cは、0.0429404615746508
である。
5. CodeLink (trade name) 16-Assay Bioarray classification prediction based on measurement results (multiple regression analysis)
Similar to the above, classification prediction by multiple regression analysis was attempted using gene expression information of 60 untreated schizophrenia patients and 56 healthy subjects. Using the expression data of the 10 probes as explanatory variables, a multiple regression analysis was performed on the learning example using commercially available software (SPSS), and a prediction formula was constructed. Multiple regression analysis was performed so that the dependent variable was increased in patients with schizophrenia. Subsequently, the dependent variable was calculated about the said test example using the constructed prediction formula. The obtained prediction formula is as follows.
Y = (A 1 X 1 + A 2 X 2 + A 3 X 3 + A 4 X 4 + A 5 X 5 + A 6 X 6 + A 7 X 7 + A 8 X 8 + A 9 X 9 + A 10 X 10 + C) x 100
here,
X 1 is the gene expression level of DLGAP3 (GE54859 SEQ ID NO: 42)
X 2 is the gene expression level of KCN15J (GE58277 SEQ ID NO: 77)
X 3 is the gene expression level of GPR30 (GE80129 SEQ ID NO: 165)
X 4 is the gene expression level of NPCR (GE540583 SEQ ID NO: 34)
X 5 is gene expression level TMED1 (GE85017 SEQ ID NO: 200)
X 6, the gene expression level of PAFAH2 (GE62881 SEQ ID NO: 122)
X 7 is a gene expression level of TMEM23 (GE60313 SEQ ID NO: 98)
X 8 is the gene expression level of ABCG1 (GE586854 SEQ ID NO: 81)
X 9 is the gene expression level of PGRMC1 (GE62032 SEQ ID NO: 109)
X 10 indicates the gene expression level of INSL3 (GE88024 SEQ ID NO: 218), respectively.
The coefficient multiplied by the expression level of each gene is
A 1 is 1.00019621196698
A 2 is 0.273175387505458
A 3 is 0.606651443546423
A 4 is -0.659859665599205
A 5 is, -0.287215519193429
A 6 is, -0.271285204843002
A 7 is -0.220049126802913
A 8 is -0.00285057386315785
A 9 is 0.478133475554455
A 10 is -0.169744977943406
Constant C is 0.0429404615746508
It is.
 従属変数50をカットオフ値として集計した結果を表7及び表8に示す。また、学習例及び試験例の各検体について計算された従属変数を図2に示す。 Tables 7 and 8 show the results of tabulating the dependent variable 50 as a cutoff value. Moreover, the dependent variable calculated about each sample of a learning example and a test example is shown in FIG.
Figure JPOXMLDOC01-appb-T000016
感度:92.5%(37/40)、特異度:92.1%(35/38)、正答率:92.3%(72/78)
Figure JPOXMLDOC01-appb-T000016
Sensitivity: 92.5% (37/40), Specificity: 92.1% (35/38), Correct answer rate: 92.3% (72/78)
Figure JPOXMLDOC01-appb-T000017
感度:95.0%(19/20)、特異度:94.4%(17/18)、正答率:94.7%(36/38)
Figure JPOXMLDOC01-appb-T000017
Sensitivity: 95.0% (19/20), Specificity: 94.4% (17/18), Correct answer rate: 94.7% (36/38)
 以上の通り、カットオフ値を50とすると、試験例において感度・特異度ともに80%を超える結果となった。重回帰分析によっても、上記10種類の遺伝子発現量を用いて統合失調症と健常者とを精度良く分類することができた。 As described above, when the cutoff value is 50, both the sensitivity and specificity in the test example exceeded 80%. Multiple regression analysis was also able to classify schizophrenia and healthy subjects with high accuracy using the above 10 gene expression levels.

Claims (10)

  1.  生体から分離された試料における下記(1)~(10)の遺伝子群の発現量を指標とする、統合失調症の検出方法。
    (1) DLGAP3(配列番号1)
    (2) KCNJ15(配列番号2)
    (3) GPR30(配列番号3)
    (4) NPCR(配列番号4)
    (5) TMED1(配列番号5)
    (6) PAFAH2(配列番号6)
    (7) TMEM23(配列番号7)
    (8) ABCG1(配列番号8)
    (9) PGRMC1(配列番号9)
    (10) INSL3(配列番号10)
    A method for detecting schizophrenia, wherein the expression level of the following gene groups (1) to (10) in a sample isolated from a living body is used as an index.
    (1) DLGAP3 (SEQ ID NO: 1)
    (2) KCNJ15 (SEQ ID NO: 2)
    (3) GPR30 (SEQ ID NO: 3)
    (4) NPCR (SEQ ID NO: 4)
    (5) TMED1 (SEQ ID NO: 5)
    (6) PAFAH2 (SEQ ID NO: 6)
    (7) TMEM23 (SEQ ID NO: 7)
    (8) ABCG1 (SEQ ID NO: 8)
    (9) PGRMC1 (SEQ ID NO: 9)
    (10) INSL3 (SEQ ID NO: 10)
  2.  上記(1)~(10)の遺伝子群の発現量のみを指標とする請求項1記載の方法。 The method according to claim 1, wherein only the expression level of the gene group (1) to (10) is used as an index.
  3.  前記遺伝子群の発現量は、配列番号34、配列番号42、配列番号77、配列番号81、配列番号98、配列番号109、配列番号122、配列番号165、配列番号200及び配列番号218に示される塩基配列を有するオリゴヌクレオチドプローブにより測定される、請求項1又は2記載の方法。 The expression level of the gene group is shown in SEQ ID NO: 34, SEQ ID NO: 42, SEQ ID NO: 77, SEQ ID NO: 81, SEQ ID NO: 98, SEQ ID NO: 109, SEQ ID NO: 122, SEQ ID NO: 165, SEQ ID NO: 200 and SEQ ID NO: 218. The method according to claim 1 or 2, which is measured by an oligonucleotide probe having a base sequence.
  4.  前記試料における前記遺伝子群の発現量を、あらかじめ測定した、既知の統合失調症患者及び健常者における前記遺伝子群の発現量と対比する工程を含む、請求項1ないし3ののいずれか1項に記載の方法。 4. The method according to claim 1, comprising a step of comparing the expression level of the gene group in the sample with the expression level of the gene group in a known schizophrenia patient and a healthy person, which is measured in advance. The method described.
  5.  前記対比は、既知の統合失調症患者及び健常者における前記遺伝子群の発現量を用いて変数増加法により学習させたニューラルネットワークにより行なわれる請求項4記載の方法。 The method according to claim 4, wherein the comparison is performed by a neural network that is trained by a variable increment method using expression levels of the gene group in known schizophrenic patients and healthy individuals.
  6.  前記対比は、前記遺伝子群の発現量を説明変数として用いた重回帰分析により行なわれる請求項4記載の方法。 The method according to claim 4, wherein the comparison is performed by multiple regression analysis using the expression level of the gene group as an explanatory variable.
  7.  前記試料について計算された従属変数を、既知の統合失調症患者及び健常者について計算された従属変数に基づいて定められたカットオフ値と対比することを含む、請求項6記載の方法。 7. The method of claim 6, comprising comparing the dependent variable calculated for the sample with a cutoff value determined based on the dependent variable calculated for known schizophrenic patients and healthy individuals.
  8.  前記遺伝子群の発現量は、配列番号11~226に示される塩基配列を有するオリゴヌクレオチドプローブがスポットされたアレイにより測定される遺伝子発現量のシグナル強度をグローバルノーマライゼーション法により正規化したものである、請求項3ないし7のいずれか1項に記載の方法。 The expression level of the gene group is obtained by normalizing the signal intensity of the gene expression level measured by an array spotted with oligonucleotide probes having the nucleotide sequences shown in SEQ ID NOs: 11 to 226 by the global normalization method. 8. A method according to any one of claims 3-7.
  9.  前記アレイは配列番号227~525に示される塩基配列を有するオリゴヌクレオチドプローブをさらに含む、請求項8記載の方法。 The method according to claim 8, wherein the array further comprises oligonucleotide probes having the base sequences shown in SEQ ID NOs: 227 to 525.
  10.  前記試料は血液である請求項1ないし9のいずれか1項に記載の方法。 10. The method according to any one of claims 1 to 9, wherein the sample is blood.
PCT/JP2009/057861 2008-06-25 2009-04-20 Method of diagnosing integration dysfunction syndrome WO2009157251A1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
JP2008-166582 2008-06-25
JP2008166582 2008-06-25
JP2008-226185 2008-09-03
JP2008226185A JP2010029171A (en) 2008-06-25 2008-09-03 Diagnostic method for integration dysfunction syndrome

Publications (1)

Publication Number Publication Date
WO2009157251A1 true WO2009157251A1 (en) 2009-12-30

Family

ID=41444321

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2009/057861 WO2009157251A1 (en) 2008-06-25 2009-04-20 Method of diagnosing integration dysfunction syndrome

Country Status (2)

Country Link
JP (1) JP2010029171A (en)
WO (1) WO2009157251A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5695328B2 (en) * 2010-02-24 2015-04-01 国立大学法人大阪大学 How to detect the likelihood of schizophrenia

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040024727A1 (en) * 2002-07-30 2004-02-05 Sandvine Incorporated Method and system of re-sharing files with modifications
US20050209181A1 (en) * 2003-11-05 2005-09-22 Huda Akil Compositions and methods for diagnosing and treating mental disorders
WO2008024114A1 (en) * 2006-08-24 2008-02-28 Genizon Biosciences Inc. Genemap of the human genes associated with schizophrenia
WO2009001095A2 (en) * 2007-06-28 2008-12-31 Mitsubishi Tanable Pharma Corporation Novel schizophrenia associated genes

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040024727A1 (en) * 2002-07-30 2004-02-05 Sandvine Incorporated Method and system of re-sharing files with modifications
US20050209181A1 (en) * 2003-11-05 2005-09-22 Huda Akil Compositions and methods for diagnosing and treating mental disorders
WO2008024114A1 (en) * 2006-08-24 2008-02-28 Genizon Biosciences Inc. Genemap of the human genes associated with schizophrenia
WO2009001095A2 (en) * 2007-06-28 2008-12-31 Mitsubishi Tanable Pharma Corporation Novel schizophrenia associated genes

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
HAKAK Y. ET AL.: "Genome-wide expression analysis reveals dysregulation of myelination- related genes in chronic schizophrenia.", PROC. NATL. ACAD. SCI. USA, vol. 98, 2001, pages 4746 - 4751 *
LEWIS D.A ET AL.: "Transcriptome alterations in schizophrenia: disturbing the functional architecture of the dorsolateral prefrontal cortex.", PROG. BRAIN RES., vol. 158, 2006, pages 141 - 152 *
YASUSHI KAJII: "Bunretsubyo Kanren Idenshi", SAISHIN SEISHIN IGAKU, vol. 6, 2001, pages 529 - 536 *

Also Published As

Publication number Publication date
JP2010029171A (en) 2010-02-12

Similar Documents

Publication Publication Date Title
KR20140044375A (en) Microrna biomarkers indicative of alzheimer's disease
US20030099979A1 (en) Method of testing for bronchial asthma
CN111662982B (en) Biomarker for early diagnosis and/or recurrence monitoring of brain glioma and application thereof
US20090136945A1 (en) Compositions and methods for assessing disorders
CN108676872A (en) A kind of and the relevant biomarker of asthma and its application
TWI682034B (en) Methods for assessing the risk of developing or diagnosing endometrial cancer
CN110229899B (en) Plasma marker combinations for early diagnosis or prognosis prediction of colorectal cancer
CN108728525B (en) Circulating microRNA biomarker for subarachnoid hemorrhage and application thereof
TW201625797A (en) Method and biomarkers for accessing the risk of having colorectal cancer
CN104087672A (en) Kit for quickly detecting number of human chromosomes 21 by multiplex real-time fluorescence quantitative PCR (polymerase chain reaction) technique
CN108300788A (en) A kind of micro RNA combination and its application for detecting light-duty brain trauma
WO2009157251A1 (en) Method of diagnosing integration dysfunction syndrome
CN104087671A (en) Kit used for detecting number of human chromosomes 21
WO2021039777A1 (en) Method for examining rheumatoid arthritis
TWI704927B (en) Diagnosis and treatment of psoriatic arthritis and kit thereof
JP7336517B2 (en) Biomarkers for diagnosis and/or prediction of frailty
WO2021060311A1 (en) Method for detecting brain tumor
WO2010026821A1 (en) Method for discrimination between bipolar disorder and schizophrenia
JP2011092100A (en) Gene marker estimating physiological state change and effect of factor giving change in physiological state, estimation method, estimation system and computer program
KR102346185B1 (en) Biomarker for predicting skin pigmentation risk and use thereof
JP2022025456A (en) Method for inspecting multiple sclerosis
KR102548873B1 (en) A method for sorting colorectal cancer and advanced neoplasia and use of the same
WO2020054474A1 (en) Method for examining rheumatoid arthritis
KR102325356B1 (en) Composition for diagnosis of malignancy of glioma using pseudogenes and use thereof
EP3940085A1 (en) Method for diagnosing celiac disease based on the level of expression of the ube2l3 gene

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09769961

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 09769961

Country of ref document: EP

Kind code of ref document: A1