JP2009066088A

JP2009066088A - Multivariable analyzer and computer program

Info

Publication number: JP2009066088A
Application number: JP2007235895A
Authority: JP
Inventors: Nao Nemoto; 直根本; Ichiro Ando; 一郎安藤; Masako Fujiwara; 正子藤原; Kazunori Arifuku; 和紀有福
Original assignee: Jeol Ltd; National Institute of Advanced Industrial Science and Technology AIST
Current assignee: Jeol Ltd; National Institute of Advanced Industrial Science and Technology AIST
Priority date: 2007-09-11
Filing date: 2007-09-11
Publication date: 2009-04-02
Anticipated expiration: 2027-09-11
Also published as: JP5088731B2

Abstract

<P>PROBLEM TO BE SOLVED: To support a researcher to comprehensively or complementarily analyze different types of data. <P>SOLUTION: A multivariable analyzer displays multivariable analysis results of one or more types of samples (data) on the same GUI. When a user specifies a specific sample 250 of a specific type on the GUI, this multivariable analyzer extracts and highlightedly displays other types of samples 252 and 254 having commonality with the specified sample 250, and detects and highlightedly displays marker variables 256, 258 and 259 unique to the samples 250, 252 and 254. When the user specifies the specific variable 256 of the specific type on the GUI, the multivariable analyzer extracts and highlightedly displays the sample 250 unique to the specific variable 256, extracts and highlightedly displays other types of samples 252 and 254 having the commonality with the sample 250, and detects and highlightedly displays the marker variables 258 and 259 unique to the samples 252 and 254. <P>COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は、データの多変量解析を主とした各種分析を行うための装置及びコンピュータプログラムに関する。本発明は、広く様々な解析用途に適用可能であるが、その一例として、医学、創薬、食品又は生命科学などのための生命データの解析がある。 The present invention relates to an apparatus and a computer program for performing various analyzes mainly including multivariate analysis of data. The present invention can be applied to a wide variety of analysis applications, and examples thereof include analysis of life data for medicine, drug discovery, food or life science.

医学、創薬、食品又は生命科学などの研究を例にとると、例えば、フェノーム、メタボノーム、プロテオーム、トランスクリプトーム、ゲノムなどの種々の研究分野が存在する。研究分野により研究対象となるサンプルの種類及びサンプルから取得される生命データの種類（例えば、データの表す事象又はデータの構造など）は当然に異なる。サンプルから目的の生命データを最初に取得するための一次分析装置として、NMR（核磁気共鳴分析）装置、MS（質量分析）装置、ＤＮＡアレイ、MRI（磁気共鳴画像）装置などの種々のものがある。 Taking research such as medicine, drug discovery, food or life science as an example, there are various research fields such as phenome, metabonome, proteome, transcriptome, and genome. Naturally, the type of sample to be studied and the type of life data acquired from the sample (for example, the event represented by the data or the structure of the data) differ depending on the research field. Various primary analyzers for acquiring the desired life data from the sample first, such as NMR (nuclear magnetic resonance analyzer), MS (mass spectrometer), DNA array, MRI (magnetic resonance imaging) is there.

NMR装置、MS装置及びＤＮＡアレイでそれぞれ分析することによりNMRスペクトル、MSスペクトル、DNAスペクトルなど、サンプルに含まれる成分物質の異なる事象（核磁気共鳴、質量電荷比、DNAなど）に関するスペクトルデータが得られる。また、MRI装置を用いると、MRI画像と、CSI（ケミカル・シフト・イメージング）により得られるそのMR画像中の多数の部位（ボクセル）にそれぞれ対応する多数のCSIスペクトルデータ（NMRスペクトルデータの一種である）のセットが得られるが、MR画像とCSIスペクトルデータとではデータ構造が異なる（後者は多変量解析の対象となり得るが、後者は通常なり得ない）。 By analyzing with NMR equipment, MS equipment and DNA array respectively, spectral data on different events (nuclear magnetic resonance, mass-to-charge ratio, DNA, etc.) of the constituent substances contained in the sample such as NMR spectrum, MS spectrum, DNA spectrum etc. are obtained. It is done. In addition, when using an MRI system, MRI images and a number of CSI spectrum data (a kind of NMR spectrum data) corresponding to a number of sites (voxels) in the MR image obtained by CSI (Chemical Shift Imaging) respectively. The MR image and CSI spectrum data have different data structures (the latter can be the target of multivariate analysis, but the latter cannot be normal).

生命データを例えば統計数学の手法を用いて２次的に解析するための装置として、従来、特定の研究分野のための特定種類の生命データの解析に適するように構成されたものが知られている。例えば、特許文献１には、メタボノミクスのためのNMRスペクトルの多変量解析に適したデータ処理装置が開示されている。 As a device for secondary analysis of life data using, for example, a statistical mathematics method, a device that has been configured to be suitable for analyzing a specific type of life data for a specific research field has been known. Yes. For example, Patent Document 1 discloses a data processing apparatus suitable for multivariate analysis of NMR spectra for metabonomics.

従来、上述した各分野での研究の多くは、他分野での研究からは独立して行われ、そこでは、その分野独自の視点から特定種類の生命データが選択され、その特定種類の生命データついて、その分野独自の視点に基づく方法による解析と判断が行われる。 Conventionally, much of the research in each field described above has been performed independently of research in other fields, where a specific type of life data is selected from the perspective unique to that field, and that specific type of life data is selected. Then, analysis and judgment are performed by a method based on the unique viewpoint of the field.

特開２００６−３３７３５４号公報JP 2006-337354 A

研究分野、研究目的又は研究者が異なると、取り扱われる生命データの種類（事象又はデータ構造）が異なったり、解析方法や判断方法が異なったりする。また、研究者の違いによる判断のばらつきもある。各分野の研究では、他の研究分野での研究成果、他種類の生命データの解析結果、又は、他の解析方法による解析結果を、積極的に取り入れて活用するということはあまり行われていない。 If the research field, research purpose, or researcher is different, the type (event or data structure) of life data to be handled will be different, or the analysis method or judgment method will be different. There are also variations in judgment due to differences in researchers. In research in each field, it is not often done to actively incorporate and utilize research results in other research fields, analysis results of other types of life data, or analysis results by other analysis methods. .

しかし、フェノーム、メタボノーム、プロテオーム、トランスクリプトーム、ゲノムなどの異なる分野で研究対象とされる異なるレベルの生命現象は、生体内で独立して存在しているわけではなく、複雑な（そして、多くは未知の）因果関係で互いに結ばれている筈である。また、同一、類似又は関連するサンプルに関して、NMR装置、MS装置、ＤＮＡアレイ及びMRI装置などの異なる一次分析装置から得られた生命データの間に、何らかの（そして、多くは未知の）関連性が存在するであろう。 However, the different levels of life phenomena studied in different fields, such as phenome, metabonome, proteome, transcriptome, and genome, do not exist independently in vivo, but are complex (and many Are connected to each other by (unknown) causality. Also, for the same, similar or related samples, there is some (and many unknown) association between life data obtained from different primary analyzers such as NMR instruments, MS instruments, DNA arrays and MRI instruments. Will exist.

従って、もし、一人の研究者（又は一つの研究者グループ）が、複数の異なる分野の生命データ、又は複数の異なる種類の生命データを、総合的に又は相補的に解析することができたならば、より確度の高い新規なマーカが発見できたり、異分野の生命現象間又は異種類の生命データ間の未知の相関関係が解明されるなど、より有益で新規な研究成果が得られる可能性が高い。これを行うことは、取り扱うべきデータの種類が多く且つデータ数が膨大であるために、従来の解析装置では不可能である。 Therefore, if a single researcher (or a group of researchers) was able to analyze life data from different fields, or different types of life data, comprehensively or complementarily. For example, new markers with higher accuracy can be discovered, and unknown correlations between life phenomena in different fields or between different types of life data can be elucidated. Is expensive. This is not possible with a conventional analysis apparatus because the number of types of data to be handled is large and the number of data is enormous.

同様の問題は、生命データ以外の他の用途分野のデータの解析においても、存在するであろう。 Similar problems may exist in the analysis of data for other application fields other than life data.

従って、本発明の目的は、研究者が、異種類のデータを総合的に又は相補的に解析することを、支援することにある。 Accordingly, an object of the present invention is to assist researchers in analyzing different types of data comprehensively or in a complementary manner.

本発明の別の目的は、研究者が、得られた全ての情報（例えば、生命データの解析結果、解析対象の生命データとは別途に入手された付加的なデータ、その他関連する各種の知見など）から、着目の挙動の特徴や、異分野又は異種類の情報間の相関関係を視覚的に把握して、総合的な判断を下すことを、支援することにある。 Another object of the present invention is that researchers have obtained all the information obtained (for example, analysis results of life data, additional data obtained separately from the life data to be analyzed, and various other related knowledge). Etc.) to visually grasp the characteristics of the behavior of interest and the correlation between different fields or different types of information, and to make a comprehensive judgment.

本発明の一つの側面に従う多変量解析装置は、
サンプル群の対象データ群であって、個々の対象データが複数の変数の値をもつ、前記対象データ群を記憶する対象データ記憶手段と、
前記対象データ群を用いて多変量解析を行う多変量解析手段と、前記対象データ群を用いて特徴抽出処理を行う特徴抽出手段とを有する計算・制御手段と、
前記多変量解析の結果と前記特徴抽出処理の結果をグラフィカル・ユーザ・インタフェース（以下、GUIと略称する）に可視的に表示する表示手段と
を備え、
前記計算・制御手段の前記多変量解析手段は、前記多変量解析によって判明した前記サンプル群の特性と前記複数の変数の特性とを表す複数の多変量解析チャートを生成し、
前記表示手段は、前記複数のチャートを前記GUI上に表示し、
前記計算・制御手段の前記特徴抽出手段は、前記サンプル群中の任意のサンプルサブ群又は前記複数の変数中の任意の変数を指定し、前記対象データ群に基づいて、前記指定されたサンプルサブ群に特徴的な変数、若しくは前記指定された変数に特徴的なサンプルサブ群、又は、前記対象データ記憶手段に他のサンプル群の対象データ群も記憶されている場合には前記他のサンプル群中で前記指定されたサンプルサブ群に関連するサンプルサブ群、を検出し、
前記表示手段は、前記指定されたサンプルサブ群又は前記指定された変数と、前記検出された特徴的な変数、若しくは前記検出された特徴的なサンプルサブ群、又は前記検出された関連するサンプルサブ群を、前記GUI上でハイライトして表示する。 A multivariate analyzer according to one aspect of the present invention is:
Target data storage means for storing the target data group, the target data group of the sample group, each target data having a plurality of variable values;
A calculation / control unit having a multivariate analysis unit that performs multivariate analysis using the target data group; and a feature extraction unit that performs a feature extraction process using the target data group;
Display means for visually displaying the result of the multivariate analysis and the result of the feature extraction processing on a graphical user interface (hereinafter abbreviated as GUI);
The multivariate analysis means of the calculation / control means generates a plurality of multivariate analysis charts representing characteristics of the sample group and characteristics of the plurality of variables found by the multivariate analysis,
The display means displays the plurality of charts on the GUI,
The feature extraction unit of the calculation / control unit specifies an arbitrary sample sub group in the sample group or an arbitrary variable in the plurality of variables, and the specified sample sub group is based on the target data group. Variables characteristic of the group, sample subgroups characteristic of the designated variable, or other sample groups if the target data group of the other sample groups is also stored in the target data storage means Detecting a sample subgroup associated with said designated sample subgroup,
The display means includes the designated sample subgroup or the designated variable and the detected characteristic variable, or the detected characteristic sample subgroup, or the detected related sample sub. Groups are highlighted on the GUI.

本発明の多変量解析装置の一実施形態においては、
前記多変量解析手段が行う前記多変量解析には、主成分解析が含まれ、
前記GUI上に表示される前記複数の多変量解析チャートには、
A. 所定の主成分軸からなる座標空間にて、前記サンプル群のそれぞれのサンプルが、所定の主成分についてのスコアに応じた座標にプロットされたサンプル・プロット・チャート、
B. 所定の主成分軸からなる座標空間にて前記複数の変数が、所定の主成分についてのローディングに応じた座標にプロットされた変数プロット・チャート、
C. 前記複数の変数の所定の主成分ついての寄与率が図示された寄与率チャート、及び
D. 前記複数の変数の所定の主成分についてのローディングの和が図示されたローディング・チャート、
のうちの少なくとも二つが含まれ、
前記表示手段は、前記指定されたサンプルサブ群又は前記検出された関連するサンプルサブ群を、前記サンプル・プロット・チャート上でハイライトして表示し、前記検出されたマーカ変数を、前記変数プロット・チャート、前記寄与率チャート、及び前記ローディング・チャートの内の少なくとも一つのチャート上でハイライトして表示する。 In one embodiment of the multivariate analyzer of the present invention,
The multivariate analysis performed by the multivariate analysis means includes principal component analysis,
In the plurality of multivariate analysis charts displayed on the GUI,
A. A sample plot chart in which each sample of the sample group is plotted at coordinates according to a score for a predetermined principal component in a coordinate space consisting of a predetermined principal component axis,
B. Variable plot chart in which the plurality of variables are plotted in coordinates according to loading for a predetermined principal component in a coordinate space consisting of a predetermined principal component axis;
C. A contribution chart illustrating contribution ratios of the plurality of variables for a predetermined principal component, and
D. a loading chart illustrating the sum of loadings for a given principal component of the plurality of variables;
At least two of the
The display means highlights and displays the designated sample subgroup or the detected related sample subgroup on the sample plot chart, and displays the detected marker variable in the variable plot. • Highlight and display on at least one of the chart, the contribution ratio chart, and the loading chart.

本発明の多変量解析装置の一実施形態においては、前記計算・制御手段の前記特徴抽出手段は、
前記サンプル群中から任意のサンプルサブ群を指定するサンプル指定手段と、
前記対象データ群がもつ前記変数の値に基づいて、前記指定されたサンプルサブ群に共通する１以上の特徴的な変数を、前記複数の変数中からマーカ変数として検出するマーカ変数検出手段と、
を有し、
前記表示手段は、前記指定されたサンプルサブ群と前記検出されたマーカ変数とを前記GUI上でハイライトして表示する。 In one embodiment of the multivariate analysis apparatus of the present invention, the feature extraction unit of the calculation / control unit includes:
Sample designating means for designating an arbitrary sample subgroup from the sample group;
Marker variable detection means for detecting one or more characteristic variables common to the specified sample subgroup as marker variables from the plurality of variables based on the values of the variables of the target data group;
Have
The display means highlights and displays the designated sample subgroup and the detected marker variable on the GUI.

上記実施形態においては、さらに、前記マーカ変数検出手段は、前記サンプル群の対象データ群全体での各変数の値の分布を求め、求められた前記対象データ群全体での各変数の値の分布と、前記指定されたサンプルサブ群の対象データがもつ各変数の値とに基づいて、前記指定されたサンプルサブ群に共通する１以上の特徴的な変数を検出する。 In the above-described embodiment, the marker variable detecting unit further obtains a distribution of values of each variable in the entire target data group of the sample group, and distributes the values of each variable in the obtained entire target data group. And one or more characteristic variables common to the designated sample subgroup are detected based on the values of the variables of the target data of the designated sample subgroup.

上記実施形態においては、またさらに、前記多変量解析手段が行う前記多変量解析には、主成分解析が含まれ、前記マーカ変数検出手段は、前記検出されたマーカ変数に対して、前記主成分解析で得られた寄与率に基づいたフィルタリングを行なって、より確度の高いマーカ変数に絞り込む。 In the above embodiment, the multivariate analysis performed by the multivariate analysis unit includes principal component analysis, and the marker variable detection unit performs the principal component analysis on the detected marker variable. Filtering based on the contribution ratio obtained by the analysis is performed to narrow down to marker variables with higher accuracy.

本発明の多変量解析装置の一実施形態においては、
前記計算・制御手段の前記特徴抽出手段は、
前記複数の変数中から任意の変数をマーカ変数として指定するマーカ変数指定手段と、
前記サンプル群の対象データ群がもつ各変数の値に基づいて、前記指定されたマーカ変数に共通する１以上の特徴的なサンプルサブ群を検出する特徴サンプル検出手段と
を有し、
前記表示手段は、前記指定されたマーカ変数と前記検出された特徴的なサンプルサブ群とを前記GUI上でハイライトして表示する。 In one embodiment of the multivariate analyzer of the present invention,
The feature extraction means of the calculation / control means is:
Marker variable designating means for designating an arbitrary variable among the plurality of variables as a marker variable;
Characteristic sample detection means for detecting one or more characteristic sample subgroups common to the designated marker variable based on the value of each variable of the target data group of the sample group;
The display means highlights and displays the designated marker variable and the detected characteristic sample subgroup on the GUI.

上記実施形態においては、さらに、前記特徴サンプル検出手段は、前記対象データ群全体での前記指定されたマーカ変数の各々の値の分布を求め、求まった前記対象データ群全体での前記指定されたマーカ変数の各々の値の分布と、各サンプルの対象データがもつ前記指定されたマーカ変数の各々の値とに基づいて、前記特徴的なサンプルサブ群を検出する。 In the above-described embodiment, the feature sample detection means further obtains a distribution of values of each of the specified marker variables in the entire target data group, and determines the specified in the entire target data group thus obtained. The characteristic sample sub-group is detected based on the distribution of the values of the marker variables and the values of the designated marker variables included in the target data of the samples.

上記実施形態においては、またさらに、前記多変量解析手段が行う前記多変量解析には、主成分解析が含まれ、前記特徴サンプル検出手段は、前記指定されたマーカ変数に対して、前記主成分解析で得られた寄与率に基づいたフィルタリングを行なって、より確度の高いマーカ変数に絞り込む。 In the above embodiment, the multivariate analysis performed by the multivariate analysis means includes principal component analysis, and the feature sample detection means performs the principal component analysis on the designated marker variable. Filtering based on the contribution ratio obtained by the analysis is performed to narrow down to marker variables with higher accuracy.

本発明の多変量解析装置の一実施形態においては、
前記サンプル群の１種類以上の付加データ群を記憶する付加データ記憶手段をさらに備え、
前記計算・制御手段の前記特徴抽出手段は、
前記１種類以上の付加データ群中から任意の種類の付加データ群を指定する付加データ指定手段と、
前記指定された種類の付加データ群の値に基づいて、前記指定された種類の付加データ群に共通する１以上の特徴的なサンプルサブ群を、前記サンプル群中から検出する特徴サンプル検出手段と、
前記対象データ群がもつ各変数の値に基づいて、前記検出された特徴的なサンプルサブ群に共通する１以上の特徴的な変数を、前記複数の変数中からマーカ変数として検出するマーカ変数検出手段と、
を有し、
前記表示手段は、前記検出された特徴的なサンプルサブ群と前記検出されたマーカ変数とを前記GUI上でハイライトして表示する。 In one embodiment of the multivariate analyzer of the present invention,
Further comprising additional data storage means for storing one or more additional data groups of the sample group;
The feature extraction means of the calculation / control means is:
Additional data designating means for designating an arbitrary additional data group from the one or more additional data groups;
Characteristic sample detection means for detecting one or more characteristic sample subgroups common to the designated type of additional data group from the sample group based on the value of the designated type of additional data group; ,
Marker variable detection that detects one or more characteristic variables common to the detected characteristic sample subgroup as marker variables from the plurality of variables based on the values of the variables of the target data group Means,
Have
The display means highlights and displays the detected characteristic sample sub-group and the detected marker variable on the GUI.

上記実施形態においては、さらに、
前記特徴サンプル検出手段は、前記指定された種類の付加データ群全体での前記付加データの値の分布を求め、求まった前記付加データ群全体での前記付加データの値の分布と、各サンプルの付加データがもつ値とに基づいて、前記特徴的なサンプルサブ群を検出し、
前記マーカ変数検出手段は、前記対象データ群全体での各変数の値の分布を求め、求められた前記対象データ群全体での各変数の値の分布と、前記検出された特徴的なサンプルサブ群の対象データがもつ各変数の値とに基づいて、前記指定されたサンプルサブ群に共通する１以上の特徴的な変数を検出する。 In the above embodiment, further,
The feature sample detection means obtains a distribution of the value of the additional data in the entire additional data group of the specified type, obtains a distribution of the value of the additional data in the entire additional data group thus obtained, Detecting the characteristic sample subgroup based on the value of the additional data;
The marker variable detection means obtains a distribution of values of each variable in the entire target data group, and obtains a distribution of values of each variable in the obtained entire target data group and the detected characteristic sample sub One or more characteristic variables common to the designated sample subgroup are detected based on the value of each variable included in the target data of the group.

本発明の多変量解析装置の一実施形態においては、
前記サンプル群の１種類以上の付加データ群を記憶する付加データ記憶手段をさらに備え、
前記計算・制御手段の前記多変量解析手段は、前記多変量解析として少なくとも主成分解析を行い、前記主成分解析の結果に基づいて、所定の１以上の主成分軸からなる座標空間にて、前記サンプル群のそれぞれのサンプルが、所定の主成分についてのスコアに応じた座標にプロットされたサンプル・プロット・チャートを、前記多変量解析チャートの一つとして作成し、
前記表示手段は、前記サンプル・プロット・チャートを前記GUI上に表示し、
前記計算・制御手段の前記特徴抽出手段は、
前記所定の１以上の主成分軸中から任意の主成分軸を指定し、且つ、前記１種類以上の付加データ群中から任意の種類の付加データ群を指定する付加データ指定手段と、
前記主成分解析の結果と前記指定された種類の付加データ群の値とに基づいて、前記サンプル・プロット・チャート中の前記指定された主成分軸を前記指定された種類の付加データ群の値に対応する座標軸に置換してなる置換サンプル・プロット・チャートを作成する主成分軸置換手段と
を有し、
前記表示手段は、前記置換サンプル・プロット・チャートを前記GUI上に表示する。 In one embodiment of the multivariate analyzer of the present invention,
Further comprising additional data storage means for storing one or more additional data groups of the sample group;
The multivariate analysis means of the calculation / control means performs at least a principal component analysis as the multivariate analysis, and based on a result of the principal component analysis, in a coordinate space composed of one or more principal component axes, A sample plot chart in which each sample of the sample group is plotted at coordinates according to a score for a predetermined main component is created as one of the multivariate analysis charts,
The display means displays the sample plot chart on the GUI;
The feature extraction means of the calculation / control means is:
Additional data designating means for designating an arbitrary principal component axis from the predetermined one or more principal component axes and designating an optional additional data group from the one or more additional data groups;
Based on the result of the principal component analysis and the value of the specified type of additional data group, the specified principal component axis in the sample plot chart is the value of the specified type of additional data group. And a principal component axis replacement means for creating a replacement sample / plot / chart that is replaced by a coordinate axis corresponding to
The display means displays the replacement sample / plot chart on the GUI.

本発明の多変量解析装置の一実施形態においては、
前記対象データ記憶手段は、
第１のサンプル群の第１の対象データ群であって、個々の対象データが複数の変数の値をもつ、前記第１の対象データ群と、
第２のサンプル群の第２の対象データ群であって、個々の対象データが複数の変数の値をもつ、前記第２の対象データ群と
を記憶し、前記第１と第２のサンプル群間で互いに共通性のあるサンプル同士を関連づけており、
前記計算・制御手段の前記多変量解析手段は、前記第１の対象データ群を用いた第１の多変量解析と前記第２の対象データ群を用いた第２の多変量解析とを行ない、前記第１の多変量解析の結果を示す１以上の第１のチャートと、前記第２の多変量解析の結果を示す１以上の第２のチャートとを作成し、
前記表示手段は、前記第１のチャートと前記第２のチャートとを前記GUI上に表示し、
前記計算・制御手段の前記特徴抽出手段は、
前記第１のサンプル群中から任意の第１のサンプルサブ群を指定するサンプル指定手段と、
前記第１と第２のサンプル群間の関連付けに基づいて、前記指定された第１のサンプルサブ群に共通性のある第２のサンプルサブ群を、前記第２のサンプル群中から検索する関連サンプル検索手段と、
を有し、
前記表示手段は、前記指定された第１のサンプルサブ群と前記検出された第２のサンプルサブ群とを前記GUI上でハイライトして表示する。 In one embodiment of the multivariate analyzer of the present invention,
The target data storage means is
A first target data group of a first sample group, wherein each target data has a plurality of variable values;
A second target data group of the second sample group, each of the target data having a plurality of variable values, and storing the second target data group; and the first and second sample groups Between the samples that are common to each other,
The multivariate analysis means of the calculation / control means performs a first multivariate analysis using the first target data group and a second multivariate analysis using the second target data group, Creating one or more first charts showing the results of the first multivariate analysis and one or more second charts showing the results of the second multivariate analysis;
The display means displays the first chart and the second chart on the GUI,
The feature extraction means of the calculation / control means is:
Sample designating means for designating an arbitrary first sample sub-group from the first sample group;
An association for searching the second sample group for a second sample subgroup that is common to the designated first sample subgroup based on the association between the first and second sample groups Sample search means;
Have
The display means highlights and displays the designated first sample subgroup and the detected second sample subgroup on the GUI.

上記実施形態においては、さらに、
前記計算・制御手段の前記特徴抽出手段は、
前記第１の対象データ群がもつ各変数の値に基づいて、前記指定された第１のサンプルサブ群に共通する１以上の特徴的な変数を、前記第１の対象データ群がもつ前記複数の変数中から、第１のマーカ変数として検出する第１のマーカ変数検出手段と、
前記第２の対象データ群がもつ各変数の値に基づいて、前記検索された第２のサンプルサブ群に共通する１以上の特徴的な変数を、前記第２の対象データ群がもつ前記複数の変数中から、第２のマーカ変数として検出する第２のマーカ変数検出手段と
をさらに備え、
前記表示手段は、前記検出された第１と第２のマーカ変数を前記GUI上でハイライトして表示する。 In the above embodiment, further,
The feature extraction means of the calculation / control means is:
Based on the value of each variable of the first target data group, the plurality of the first target data group have one or more characteristic variables common to the designated first sample subgroup. First marker variable detecting means for detecting as a first marker variable from among the variables of
Based on the value of each variable of the second target data group, the plurality of the second target data group has one or more characteristic variables common to the searched second sample subgroup. A second marker variable detecting means for detecting as a second marker variable from the variables of
The display means highlights and displays the detected first and second marker variables on the GUI.

本発明の多変量解析装置の一実施形態においては、
前記対象データ記憶手段は、
第１のサンプル群の第１の対象データ群であって、個々の対象データが複数の変数の値をもつ、前記第１の対象データ群と、
第２のサンプル群の第２の対象データ群であって、個々の対象データが複数の変数の値をもつ、前記第２の対象データ群と
を記憶し、前記第１と第２のサンプル群間で互いに共通性のあるサンプル同士を関連づけており、
前記計算・制御手段の前記多変量解析手段は、前記第１の対象データ群を用いた第１の多変量解析と前記第２の対象データ群を用いた第２の多変量解析とを行ない、前記第１の多変量解析の結果を示す１以上の第１のチャートと、前記第２の多変量解析の結果を示す１以上の第２のチャートとを作成し、
前記表示手段は、前記第１のチャートと前記第２のチャートとを前記GUI上に表示し、
前記計算・制御手段の前記特徴抽出手段は、
前記第１の対象データ群がもつ前記複数の変数中から任意の変数をマーカ変数として指定するマーカ変数指定手段と、
前記第１の対象データ群がもつ各変数の値に基づいて、前記指定されたマーカ変数に共通する１以上の特徴的な第１のサンプルサブ群を、前記第１のサンプル群中から検出する特徴サンプル検出手段と、
前記第１と第２のサンプル群間の関連付けに基づいて、前記検出された第１のサンプルサブ群に共通性のある第２のサンプルサブ群を、前記第２のサンプル群中から検索する関連サンプル検索手段と、
を有し、
前記表示手段は、前記指定されたマーカ変数と前記検出された第１の特徴的なサンプルサブ群と前記検索された第２の特徴的なサンプルサブ群とを前記GUI上でハイライトして表示する。 In one embodiment of the multivariate analyzer of the present invention,
The target data storage means is
A first target data group of a first sample group, wherein each target data has a plurality of variable values;
A second target data group of the second sample group, each of the target data having a plurality of variable values, and storing the second target data group; and the first and second sample groups Between the samples that are common to each other,
The multivariate analysis means of the calculation / control means performs a first multivariate analysis using the first target data group and a second multivariate analysis using the second target data group, Creating one or more first charts showing the results of the first multivariate analysis and one or more second charts showing the results of the second multivariate analysis;
The display means displays the first chart and the second chart on the GUI,
The feature extraction means of the calculation / control means is:
Marker variable designating means for designating an arbitrary variable among the plurality of variables of the first target data group as a marker variable;
Based on the value of each variable of the first target data group, one or more characteristic first sample subgroups common to the designated marker variable are detected from the first sample group. A feature sample detection means;
An association for searching the second sample group for a second sample subgroup that is common to the detected first sample subgroup based on the association between the first and second sample groups Sample search means;
Have
The display means highlights and displays the designated marker variable, the detected first characteristic sample sub-group, and the searched second characteristic sample sub-group on the GUI. To do.

上記実施形態においては、さらに、
前記計算・制御手段の前記特徴抽出手段は、
前記第２の対象データ群がもつ各変数の値に基づいて、前記検索された第２のサンプルサブ群に共通する１以上の特徴的な変数を、前記第２の対象データ群がもつ前記複数の変数中から、関連マーカ変数として検出する第２のマーカ変数検出手段と
をさらに備え、
前記表示手段は、前記指定されたマーカ変数と前記検出された第１の特徴的なサンプルサブ群と前記検索された第２の特徴的なサンプルサブ群だけでなく、前記検出された関連マーカ変数も、前記GUI上でハイライトして表示する。 In the above embodiment, further,
The feature extraction means of the calculation / control means is:
Based on the value of each variable of the second target data group, the plurality of the second target data group has one or more characteristic variables common to the searched second sample subgroup. Second marker variable detecting means for detecting as a related marker variable from among the variables of
The display means includes not only the designated marker variable, the detected first characteristic sample subgroup and the searched second characteristic sample subgroup, but also the detected related marker variable. Is also highlighted on the GUI.

本発明の多変量解析装置の一実施形態においては、前記サンプル群のそれぞれのサンプルに関連付けられた複数のデータ部分を有する別構造データを記憶した別構造データ記憶手段を更に備え、
前記計算・制御手段は、前記別構造データを図示した別構造データ・チャートを作成する手段をさらに有し、
前記表示手段は、前記複数の多変量解析チャートだけでなく前記別構造データ・チャートも前記GUI上に表示し、
前記計算・制御手段の前記特徴抽出手段は、
前記別構造データの前記複数のデータ部分中から任意のデータ部分を指定するデータ部分指定手段と、
前記サンプル群中から、前記指定されたデータ部分に関連付けられたサンプルサブ群を検出する関連サンプル検出手段と、
前記対象データ群がもつ各変数の値に基づいて、前記検出されたサンプルサブ群に共通する１以上の特徴的な変数を、前記複数の変数中からマーカ変数として検出するマーカ変数検出手段と、
を有し、
前記表示手段は、前記指定されたデータ部分と前記検出されたサンプルサブ群と前記検索されたマーカ変数とを前記GUI上でハイライトして表示する。 In one embodiment of the multivariate analysis apparatus of the present invention, the multivariate analysis apparatus further comprises another structure data storage means for storing another structure data having a plurality of data portions associated with each sample of the sample group,
The calculation / control means further includes means for creating a separate structure data chart illustrating the separate structure data,
The display means displays not only the plurality of multivariate analysis charts but also the separate structure data chart on the GUI,
The feature extraction means of the calculation / control means is:
Data portion specifying means for specifying an arbitrary data portion from among the plurality of data portions of the separate structure data;
An associated sample detection means for detecting a sample subgroup associated with the designated data portion from the sample group;
Marker variable detection means for detecting one or more characteristic variables common to the detected sample subgroup as marker variables from the plurality of variables based on the values of the variables of the target data group;
Have
The display means highlights and displays the designated data portion, the detected sample subgroup, and the searched marker variable on the GUI.

本発明の多変量解析装置の一実施形態においては、
前記サンプル群のそれぞれのサンプルに関連付けられた複数のデータ部分を有する別構造データを記憶した別構造データ記憶手段を更に備え、
前記計算・制御手段は、前記別構造データを図示した別構造データ・チャートを作成する手段をさらに有し、
前記表示手段は、前記複数の多変量解析チャートだけでなく前記別構造データ・チャートも前記GUI上に表示し、
前記計算・制御手段の前記特徴抽出手段は、
前記サンプル群中から任意のサンプルサブ群を指定するサンプル指定手段と、
前記別構造データの前記複数のデータ部分中から、前記指定されたサンプルサブ群に関連付けられたデータ部分を検出する関連データ部分検出手段と、
前記対象データ群がもつ各変数の値に基づいて、前記指定されたサンプルサブ群に共通する１以上の特徴的な変数を、前記複数の変数中からマーカ変数として検出するマーカ変数検出手段と、
を有し、
前記表示手段は、前記指定されたサンプルサブ群と前記検出されたデータ部分と前記検索されたマーカ変数とを前記GUI上でハイライトして表示する。 In one embodiment of the multivariate analyzer of the present invention,
Further comprising another structure data storage means for storing another structure data having a plurality of data portions associated with each sample of the sample group,
The calculation / control means further includes means for creating a separate structure data chart illustrating the separate structure data,
The display means displays not only the plurality of multivariate analysis charts but also the separate structure data chart on the GUI,
The feature extraction means of the calculation / control means is:
Sample designating means for designating an arbitrary sample subgroup from the sample group;
Related data portion detection means for detecting a data portion associated with the designated sample subgroup from the plurality of data portions of the separate structure data;
Marker variable detection means for detecting one or more characteristic variables common to the designated sample subgroup as marker variables from the plurality of variables based on the values of the variables of the target data group;
Have
The display means highlights and displays the designated sample subgroup, the detected data portion, and the searched marker variable on the GUI.

本発明の多変量解析装置の一実施形態においては、
前記サンプル群のそれぞれのサンプルに関連付けられた複数のデータ部分を有する別構造データを記憶した別構造データ記憶手段を更に備え、
前記計算・制御手段は、前記別構造データを図示した別構造データ・チャートを作成する手段をさらに有し、
前記表示手段は、前記複数の多変量解析チャートだけでなく前記別構造データ・チャートも前記GUI上に表示し、
前記計算・制御手段の前記特徴抽出手段は、
前記複数の変数中から、任意の変数をマーカ変数として指定するマーカ変数指定手段と、
前記対象データ群がもつ各変数の値に基づいて、前記指定されたマーカ変数に共通する１以上の特徴的なサンプルサブ群を、前記サンプル群中から検出する特徴サンプル検出手段と、
前記別構造データの前記複数のデータ部分中から、前記検出された特徴的なサンプルサブ群に関連付けられたデータ部分を検出する関連データ部分検出手段と、
を有し、
前記表示手段は、前記指定されたマーカ変数と前記検出された特徴的なサンプルサブ群と前記検出されたデータ部分とを前記GUI上でハイライトして表示する。 In one embodiment of the multivariate analyzer of the present invention,
Further comprising another structure data storage means for storing another structure data having a plurality of data portions associated with each sample of the sample group,
The calculation / control means further includes means for creating a separate structure data chart illustrating the separate structure data,
The display means displays not only the plurality of multivariate analysis charts but also the separate structure data chart on the GUI,
The feature extraction means of the calculation / control means is:
Marker variable designation means for designating an arbitrary variable as a marker variable from among the plurality of variables;
Feature sample detection means for detecting one or more characteristic sample subgroups common to the designated marker variable from the sample group based on the values of the variables of the target data group;
Related data portion detection means for detecting a data portion associated with the detected characteristic sample subgroup from the plurality of data portions of the different structure data;
Have
The display means highlights and displays the designated marker variable, the detected characteristic sample subgroup, and the detected data portion on the GUI.

本発明はまた、コンピュータを上述の多変量解析装置として機能させるためのコンピュータプログラムも提供する。 The present invention also provides a computer program for causing a computer to function as the above-described multivariate analyzer.

本発明によれば、研究者にとり、異種類の事象又は次元のデータを総合的に又は相補的に解析することが容易になる。 According to the present invention, it becomes easy for researchers to analyze data of different kinds of events or dimensions comprehensively or complementarily.

以下、図面を参照して、本発明の一実施形態にかかる多変量解析装置について説明する。 Hereinafter, a multivariate analyzer according to an embodiment of the present invention will be described with reference to the drawings.

図１は、本実施形態に係る多変量解析装置の全体的な処理の流れを示す。 FIG. 1 shows the overall processing flow of the multivariate analyzer according to this embodiment.

本実施形態に係る多変量解析装置は、その機能の少なくとも一部は典型的には図１に示す処理をコンピュータに行わせるためのコンピュータプログラムがインストールされたコンピュータによって実現される。しかし、変形例として、本発明の多変量解析装置の少なくとも一部の機能が、専用のワーヤードロジック回路によって実現されてもよい、
図１に示すように、本実施形態に係る多変量解析装置は、元データ入力処理１００、元データ処理１０２、多変量解析処理１０４及び特徴抽出処理１０６を行うことができる。 In the multivariate analysis apparatus according to the present embodiment, at least a part of its functions is typically realized by a computer in which a computer program for causing the computer to perform the processing shown in FIG. 1 is installed. However, as a modification, at least a part of the functions of the multivariate analysis apparatus of the present invention may be realized by a dedicated word logic circuit.
As shown in FIG. 1, the multivariate analysis apparatus according to this embodiment can perform an original data input process 100, an original data process 102, a multivariate analysis process 104, and a feature extraction process 106.

元データ入力処理１００では、外部の任意の１以上の種類のセンシング装置（例えば、生命データの場合であれば、NMR分析装置、MS装置、ＤＮＡアレイ、MRI装置、X線CT撮影装置、又は各種の生化学検査装置など）を用いて取得された、多数のサンプルについての元データが、この多変量解析装置に入力される。 In the original data input processing 100, one or more types of external sensing devices (for example, in the case of life data, an NMR analyzer, an MS device, a DNA array, an MRI device, an X-ray CT imaging device, or various types The original data for a large number of samples obtained by using a biochemical examination apparatus or the like is input to the multivariate analysis apparatus.

元データ処理１０２では、入力された各サンプルの元データに基づいて、その元データが表す実態をユーザに視覚的に確認させるためのデータ（以下、「実態データ」という）と、多変量解析の直接の対象となるデータ（以下、「対象データ」という）とが生成される。実態データは、例えば、NMRスペクトルデータ、MSスペクトルデータ、DNAスペクトルデータ、MRI画像データ、X線CT画像データなどであり、他方、対象データは、例えば、上記各種のスペクトルデータを縮約することで得られる所定個数の変数値からなるヒストグラムデータである。各サンプルの実態データと対象データは、この多変量解析装置に記憶される。 In the original data processing 102, based on the input original data of each sample, data for visually confirming the actual state represented by the original data (hereinafter referred to as “actual data”), multivariate analysis Direct target data (hereinafter referred to as “target data”) is generated. The actual data is, for example, NMR spectrum data, MS spectrum data, DNA spectrum data, MRI image data, X-ray CT image data, etc. On the other hand, the target data is, for example, by reducing the above-mentioned various spectrum data. This is histogram data composed of a predetermined number of variable values obtained. The actual data and target data of each sample are stored in this multivariate analyzer.

元データ処理１０２の具体的な処理としては、公知のものが使用できる。例えば、元データが、NMR装置から得られたFID（自由誘導減衰）データである場合、例えば、フーリエ変換によりFIDデータを所定形式の実態データとしてのNMRスペクトルデータに変換する処理、及び、バケット積分によりNMRスペクトルデータを所定形式の対象データとしてのNMRヒストグラムデータに縮約する処理などがある。また、例えば、元データがMS装置又はＤＮＡアレイ装置から得られたスペクトルデータである場合、そのスペクトルデータを所定形式の実態データとしてのスペクトルデータに変換する処理、そのスペクトルデータを所定形式の対象データとしてのヒストグラムデータに縮約する処理などがある。また、例えば、元データがMRI装置から得られたMRI画像及びそのMRI画像中の多数のボクセルにそれぞれ対応する多数のCSIスペクトルデータ（NMRスペクトルデータ）のセットである場合、そのMRI画像を所定形式の実態データとしてのMRI画像に変換する処理、及びボクセル毎のCSIスペクトルデータを、バケット積分により所定形式の対象データとしてのCSIヒストグラムデータに変換する処理などがある。 As a specific process of the original data process 102, a known process can be used. For example, when the original data is FID (free induction decay) data obtained from an NMR apparatus, for example, processing for converting FID data into NMR spectrum data as actual data in a predetermined format by Fourier transform, and bucket integration To reduce the NMR spectrum data to NMR histogram data as target data in a predetermined format. In addition, for example, when the original data is spectrum data obtained from an MS device or a DNA array device, a process of converting the spectrum data into spectrum data as actual data in a predetermined format, and the spectrum data as target data in a predetermined format There is a process of reducing the data to histogram data. For example, when the original data is a set of an MRI image obtained from an MRI apparatus and a large number of CSI spectrum data (NMR spectrum data) corresponding to a large number of voxels in the MRI image, the MRI image is converted into a predetermined format. There is a process of converting into an MRI image as actual data and a process of converting CSI spectrum data for each voxel into CSI histogram data as target data in a predetermined format by bucket integration.

多変量解析処理１０４では、データ処理１０２で得られた多数のサンプル（サンプル群）の対象データ群（ヒストグラムデータ群）を用いて多変量解析が行われ、その解析結果がこのディスプレイスクリーンに表示される。多変量解析の具体的な手法としては、PCA（主成分）解析やSIMCA解析など公知のものが使用できる。例えば、後に具体的に説明するように、主成分解析を行ない、その解析結果に基づいて、サンプル・プロット・チャート、変数プロット・チャート、寄与率チャート、ローディング・チャート、及びスペクトル・チャートなどを作成し、それらをこの多変量解析装置のディスプレイスクリーンに表示するという作業を行うことができる。 In the multivariate analysis processing 104, multivariate analysis is performed using the target data group (histogram data group) of a large number of samples (sample groups) obtained in the data processing 102, and the analysis result is displayed on this display screen. The As a specific method of multivariate analysis, known methods such as PCA (principal component) analysis and SIMCA analysis can be used. For example, as will be described in detail later, a principal component analysis is performed, and based on the analysis results, a sample plot chart, a variable plot chart, a contribution ratio chart, a loading chart, a spectrum chart, etc. are created. Then, an operation of displaying them on the display screen of the multivariate analyzer can be performed.

特徴抽出処理１０６では、多変量解析処理１０４で得られた解析結果に基づいて、特徴抽出分析が行われる。この特徴抽出処理１０６は、後に詳細に説明する。 In the feature extraction processing 106, feature extraction analysis is performed based on the analysis result obtained in the multivariate analysis processing 104. This feature extraction processing 106 will be described in detail later.

本実施形態では、上述した処理１００〜１０６の全てを、一つのコンピュータプログラムが行う。とりわけ、元データ処理１０２と多変量解析処理１０４とを一つのコンピュータプログラムが行うことにより、多変量解析処理１０４による多変量解析結果と、元データ処理１０２により記憶された実態データ（例えば、上述した各種のスペクトルデータやMRI画像データなど）との間にリンクを設定することができ、それにより、例えば、PCA解析を行った場合、その結果として表示されたサンプル・プロット・チャート上の或るサンプルをユーザが指定することで、そのサンプルの実態データ（例えば、NMRスペクトルデータ）を読み出して、その実態データの画像（例えば、スペクトル・チャート）をサンプル・プロット・チャートと一緒に表示するというように、ユーザは、同じディスプレイスクリーン上で、多変量解析結果と同時にその解析結果に関連する所望のサンプルの実態データ（つまり、そのサンプルの元データの実質内容）を視覚で確認することができる。 In the present embodiment, a single computer program performs all of the processes 100 to 106 described above. In particular, the original data processing 102 and the multivariate analysis processing 104 are performed by a single computer program, so that the multivariate analysis result by the multivariate analysis processing 104 and the actual data stored by the original data processing 102 (for example, as described above) Link to various spectral data, MRI image data, etc.), for example, when a PCA analysis is performed, a sample on the displayed sample plot chart When the user designates, the actual data of the sample (for example, NMR spectrum data) is read, and the image of the actual data (for example, the spectrum chart) is displayed together with the sample / plot chart, etc. , The user can simultaneously display the multivariate analysis results on the same display screen. Actual data of the desired samples associated with analysis results (i.e., real contents of the original data of the sample) can be confirmed by visually.

従って、本実施形態に従う多変量解析装置は、「混合物系」サンプルの解析の全般に適しており、例えば、代謝物診断、毒性予測、マーカー検索創薬、コンビケム、食品、ポリマーなどの種々の分野・市場で利用可能である。さらに、それ以外の解析目的にも、本実施形態に従う多変量解析装置は、適用可能である。 Therefore, the multivariate analysis apparatus according to the present embodiment is suitable for the overall analysis of “mixture-based” samples. For example, various fields such as metabolite diagnosis, toxicity prediction, marker search drug discovery, combichem, food, polymer, etc. -Available in the market. Furthermore, the multivariate analyzer according to the present embodiment can be applied to other analysis purposes.

以下では、医学や生命科学等の研究分野における生命データの解析に使用される場合を例に挙げて、本実施形態に従う多変量解析装置の構成と機能について、より詳細に説明する。 In the following, the configuration and function of the multivariate analyzer according to the present embodiment will be described in more detail by taking as an example a case where it is used for analysis of life data in research fields such as medicine and life science.

この多変量解析装置がもつ基本機能は、次のとおりである。すなわち、この多変量解析装置は、任意の種類の生命データを入力することができ、入力された生命データに関して、多変量解析を主たるツールとして用いて複数種類の特徴抽出分析（図１の特徴抽出処理１０６）を行うことができる。その複数種類の特徴分析抽出には、
(１) １事象のデータの解析結果に基づく特徴変数及び相関の検出、
(２) 複数事象のデータの解析結果に基づく特徴変数及び相関の検出、及び
(３) 異構造のデータの解析結果に基づく特徴変数及び相関の検出
がある。 The basic functions of this multivariate analyzer are as follows. In other words, this multivariate analysis apparatus can input any kind of life data, and with respect to the input life data, a multivariate analysis is used as a main tool, and a plurality of types of feature extraction analysis (feature extraction of FIG. 1) is performed. Processing 106) can be performed. For the multiple types of feature analysis extraction,
(1) Detection of feature variables and correlations based on analysis results of one event data,
(2) Detection of feature variables and correlations based on analysis results of multiple event data, and
(3) There is detection of feature variables and correlations based on the analysis results of data with different structures.

図１に示した参照番号１００〜１０６の全ての処理に対するユーザによる操作及びユーザへの表示は、１つのディスプレイスクリーン上に表示された１つのユーザーインターフェイスを用いて実現される。 The user's operation and display to the user for all the processes of reference numbers 100 to 106 shown in FIG. 1 are realized by using one user interface displayed on one display screen.

図２は、この多変量解析装置の基本的な構成を示す。 FIG. 2 shows the basic configuration of this multivariate analyzer.

図２に示すように、この多変量解析装置１１０は、計算・制御モジュール１１２と表示モジュール１１４を有する。計算・制御モジュール１１２は、図１に示したような処理を行う。表示モジュール１１４は、グラフィカルなユーザインタフェースを作成してそれをディスプレイスクリーンに表示し、計算・制御モジュール１１２の処理結果を、そのユーザインタフェース上に表示する。計算・制御モジュール１１２は、表示されたグラフィカルなユーザインタフェースに対するマウスやキーボードによるユーザの操作によって入力される要求１１５に応答して、要求され処理を行う。 As shown in FIG. 2, the multivariate analyzer 110 includes a calculation / control module 112 and a display module 114. The calculation / control module 112 performs processing as shown in FIG. The display module 114 creates a graphical user interface and displays it on the display screen, and displays the processing result of the calculation / control module 112 on the user interface. The calculation / control module 112 performs the requested processing in response to a request 115 input by a user operation with a mouse or a keyboard on the displayed graphical user interface.

計算・制御モジュール１１２は、図１に示された元データ入力処理１００及び元データ処理１０２を行ったとき、それらの処理で得られた各種の実態データ（例えば、NMRスペクトルデータ、MSスペクトルデータ、DNAスペクトルデータ、CSIスペクトルデータ、及びMRI画像データ等）１１６、１１８を、多変量で解析装置１１０内の記憶装置に保存する。図２には、２種類の実態データ１１６、１１８が示されているが、その一方の種類の実態データ１１６は、所定の標準的なデータ構造（例えば、スペクトルデータという構造）をもつもの（例えば、NMRスペクトルデータ、MSスペクトルデータ、DNAスペクトルデータ、CSIスペクトルデータなど）を指し、他方の種類の実態データ１１８は、それ以外のデータ構造をもつもの（例えば、MRI画像データ、X線CT画像データなど）を指している。この実施形態では、一つの例示として、１サンプルの１事象の実態データが、１ファイル（図２において、参照番号１１６又は１１８が付された１つのブロック）として管理される。例えば、NMR装置で１サンプルを検査して得られたMNRスペクトルデータが、１ファイルとして管理される。 When the original data input process 100 and the original data process 102 shown in FIG. 1 are performed, the calculation / control module 112 performs various actual data (for example, NMR spectrum data, MS spectrum data, (DNA spectrum data, CSI spectrum data, MRI image data, etc.) 116 and 118 are stored in a storage device in the analysis device 110 in a multivariate manner. FIG. 2 shows two types of actual data 116 and 118. One type of actual data 116 has a predetermined standard data structure (for example, a structure called spectral data) (for example, a structure of spectral data). , NMR spectral data, MS spectral data, DNA spectral data, CSI spectral data, etc., and the other type of actual data 118 has other data structures (for example, MRI image data, X-ray CT image data) Etc.). In this embodiment, as one example, actual data of one event of one sample is managed as one file (in FIG. 2, one block given the reference number 116 or 118). For example, MNR spectrum data obtained by examining one sample with an NMR apparatus is managed as one file.

さらに、計算・制御モジュール１１２は、図１に示された元データ処理１０２を行ったとき、１以上の解析テーブル１２０を作成し、多変量解析装置１１０内の記憶装置に保存する。この実施形態では、一つの例示として、１事象について１つの解析テーブル１２０が作成される。例えば、NMRについて１つの解析テーブル１２０が作成され、MSについて別の一つの解析テーブル１２０が作成される。各解析テーブル１２０には、対応する１つの事象に関する対象データ群が登録されている。計算・制御モジュール１１２は、解析テーブル１２０を用いて、その基本的な解析動作を行なう。 Further, the calculation / control module 112 creates one or more analysis tables 120 when the original data processing 102 shown in FIG. 1 is performed, and stores them in the storage device in the multivariate analysis apparatus 110. In this embodiment, as an example, one analysis table 120 is created for one event. For example, one analysis table 120 is created for NMR and another analysis table 120 is created for MS. In each analysis table 120, a target data group related to one corresponding event is registered. The calculation / control module 112 performs a basic analysis operation using the analysis table 120.

図２中で点線矢印で示すように、各解析テーブル１２０に登録された個々の対象データと、それに対応する個々の実態データ１１６又は１１８とは、各解析テーブル１２０内に記述されるリンク情報（例えば、フォルダ名やファイル名などのディレクトリ名、或いは、識別符号のようなキーワードなど）により関連づけられている。計算・制御モジュール１１２は、或る解析テーブル１２０にアクセスしているとき、その解析テーブル１２０に登録された個々の対象データのリンク情報を用いて、個々の対象データに対応する個々の実態データ１１６又は１１８にアクセスすることができる。 As indicated by dotted arrows in FIG. 2, each target data registered in each analysis table 120 and each corresponding actual data 116 or 118 are linked information ( (For example, directory names such as folder names and file names, or keywords such as identification codes). When accessing the certain analysis table 120, the calculation / control module 112 uses the link information of the individual target data registered in the analysis table 120 to use the individual actual data 116 corresponding to the individual target data. Or 118 can be accessed.

また、複数の解析テーブル１２０間において、共通性のあるサンプル(例えば、同じサンプル、又は同じサンプルソースから得られた異なるサンプル（例えば同じ人間から得られた血液サンプルと尿サンプル)）の対象データ同士も、それぞれに同じキーワードを割り当てることによって、相互に関連づけられている。計算・制御モジュール１１２は、複数の解析テーブル１２０にアクセスしているとき、同じキーワードが付された対象データをそれぞれの解析テーブル１２０から検索することで、互いに共通性のあるサンプルの対象データを見つけ出すことができる。 In addition, between the plurality of analysis tables 120, the target data of the common sample (for example, the same sample or different samples obtained from the same sample source (for example, blood sample and urine sample obtained from the same person)) Are also associated with each other by assigning the same keyword to each. When accessing the plurality of analysis tables 120, the calculation / control module 112 searches the target data with the same keyword from the respective analysis tables 120 to find sample data that are common to each other. be able to.

計算・制御モジュール１１２は、ユーザインタフェースからの要求１１５に応答して、要求された事象の解析テーブル１２０に記述された対象データ群を用いて、図１に示された多変量解析処理１０４と特徴抽出処理１０６を行ことができる。表示モジュール１１４は、計算・制御モジュール１１２による多変量解析処理１０４の結果（例えば、サンプル・プロット・チャート、変数プロット・チャート、スペクトル・チャート、寄与率チャート及びローディング・チャートなど）及び特徴抽出処理１０６の結果（例えば、特徴変数や相関）を、ディスプレイスクリーン上に視覚的に表示する。 In response to the request 115 from the user interface, the calculation / control module 112 uses the target data group described in the analysis table 120 of the requested event, and features the multivariate analysis processing 104 shown in FIG. An extraction process 106 can be performed. The display module 114 is a result of the multivariate analysis process 104 by the calculation / control module 112 (for example, a sample plot chart, a variable plot chart, a spectrum chart, a contribution chart, a loading chart, etc.) and a feature extraction process 106. (Eg, feature variables and correlations) are visually displayed on the display screen.

図３は、解析テーブル１２０のフォーマットを示す。 FIG. 3 shows the format of the analysis table 120.

図３に示された解析テーブル１２０において、インデックス・コメント欄１２２には、この解析テーブル１２０に登録されたデータ群についてのモデル名称や任意コメントが記載される。 In the analysis table 120 shown in FIG. 3, the index / comment column 122 describes model names and arbitrary comments about the data group registered in the analysis table 120.

データ処理条件欄１２４には、多変量解析の対象となる及び解析の対象外となる変数の範囲（例えば、NMRヒストグラムの場合のケミカルシフトの範囲）などの解析条件が記述される。 In the data processing condition column 124, analysis conditions such as a range of variables to be subjected to multivariate analysis and a range not to be analyzed (for example, a chemical shift range in the case of an NMR histogram) are described.

データ解析条件欄１２６では、「PCA」フィールドにPCA解析におけるモデル次数、「SIMCA」フィールドにSIMCA解析におけるモデル次数、「X」フィールドにX軸に表示するPC（主成分）軸の次数、「Y」フィールドにY軸に表示するPC軸の次数、「Mean_centering」フィールドにミーン・センタリング処理の有無、及び「Auto_scaling」フィールドにオート・スケーリング処理の有無が記述される。 In the data analysis condition column 126, the “PCA” field has the model order in the PCA analysis, the “SIMCA” field has the model order in the SIMCA analysis, the “X” field has the order of the PC (principal component) axis displayed on the X axis, “Y "Field describes the degree of the PC axis displayed on the Y axis," Mean_centering "field indicates whether or not mean centering processing is performed, and" Auto_scaling "field indicates whether or not auto scaling processing is performed.

データ・サイズ欄１２８では、「Sample」フィールドに、この解析テーブル１２０に登録されたサンプル（対象データ）の総数、「Variable」欄に、対象データ群がもつ変数の総数が記述される。 In the data size column 128, the total number of samples (target data) registered in the analysis table 120 is described in the “Sample” field, and the total number of variables of the target data group is described in the “Variable” column.

データ・スケール欄１３０には、この解析テーブル１２０に登録された対象データ群がもつ各変数のスケール（バケット積分のバケットの範囲）が記述される。 In the data scale column 130, the scale of each variable (the bucket range of bucket integration) included in the target data group registered in the analysis table 120 is described.

実態データ・ファイル欄１３２には、この解析テーブル１２０に登録された各対象データに対応する実態データのディレクトリ（保存先のフォルダー名とファイル名）が記述される。 In the actual data / file column 132, the actual data directory (save folder name and file name) corresponding to each target data registered in the analysis table 120 is described.

プロット・タイプ欄１３４には、ディスプレイスクリーン（例えば、PCA解析結果のサンプル・プロット・チャート）上に各サンプル（対象データ）のマーク（ドット）を表示するときのそのマークの形状が記述され、プロット・カラー欄１３６には、そのマークの表示カラーが記述される。 The plot type field 134 describes the shape of the mark (dot) when displaying the mark (dot) of each sample (target data) on the display screen (for example, the sample plot chart of the PCA analysis result). In the color column 136, the display color of the mark is described.

キーワード欄１３８には、この解析テーブル１２０に登録された各サンプル（対象データ）を識別するためのキーワードが記述される。 The keyword column 138 describes a keyword for identifying each sample (target data) registered in the analysis table 120.

付加データ欄１４０には、この解析テーブル１２０に登録された各サンプル（対象データ）に関する付加データ（例えば、別途に入手されたそのサンプルについての各種の生化学データ（体重、血圧など））が記述される。 In the additional data column 140, additional data relating to each sample (target data) registered in the analysis table 120 (for example, various biochemical data (weight, blood pressure, etc.) about the sample obtained separately) is described. Is done.

プロット・ステータス欄１４２には、この解析テーブル１２０に登録された各サンプル（対象データ）の状態（例えば、「0」は通常、「1」は削除を意味する）が記述される。 The plot status column 142 describes the state of each sample (target data) registered in the analysis table 120 (for example, “0” normally means “1” means deletion).

対象データ欄１４４には、この解析テーブル１２０に登録された対象データ（変数値の列であるヒストグラムデータ）が記述される。図３では、１行が１サンプルの対象データに対応する。 In the target data column 144, target data (histogram data which is a column of variable values) registered in the analysis table 120 is described. In FIG. 3, one row corresponds to one sample of target data.

次に、この多変量解析装置１１０がディスプレイスクリーンに表示するグラフィカルなユーザインタフェース（GUI）の構成について説明する。 Next, the configuration of a graphical user interface (GUI) displayed on the display screen by the multivariate analyzer 110 will be described.

前述したように、この多変量解析装置１１０が行う複数種類の特徴分析抽出には、
(１) １事象のデータの解析結果に基づく特徴変数及び相関の検出、
(２) 複数事象のデータの解析結果に基づく特徴変数及び相関の検出、及び
(３) 異構造のデータの解析結果に基づく特徴変数及び相関の検出
がある。 As described above, the multiple types of feature analysis extraction performed by the multivariate analysis apparatus 110 include:
(1) Detection of feature variables and correlations based on analysis results of one event data,
(2) Detection of feature variables and correlations based on analysis results of multiple event data, and
(3) There is detection of feature variables and correlations based on the analysis results of data with different structures.

図４は、「１事象のデータの解析結果に基づく特徴変数及び相関の検出」を行う場合のGUIの例を示し、図５は、「複数事象のデータの解析結果に基づく特徴変数及び相関の検出」を行う場合のGUIの例を示し、図６は、「異構造のデータの解析結果に基づく特徴変数及び相関の検出」を行う場合のGUIの例を示す。いずれの例も、変量解析としてPCA解析を行った場合である。 FIG. 4 shows an example of a GUI in the case of performing “detection of feature variable and correlation based on analysis result of one event data”, and FIG. FIG. 6 shows an example of a GUI in the case of performing “detection of feature variables and correlations based on the analysis result of data having different structures”. In either case, PCA analysis is performed as a variable analysis.

図４〜図６に示すように、どの場合にも、GUIは複数のウィンドウを有している。どの場合にも表示される共通のウィンドウは、ユーザがこの多変量解析装置１１０に対して要求を入力するためのコントロールパネル１５０である。その他のウィンドウには、以下に具体的に説明するように、多変量解析によって判明したサンプル群の特性（例えば、PCA解析を行った場合、それぞれのサンプルのそれぞれのPC（主成分）に対するスコア）及び変数の特性（例えば、PCA解析を行った場合、それぞれ変数のそれぞれのPC（主成分）に対するろーーディング及び寄与率）をそれぞれ図示したチャートや、特定のサンプルのスペクトルを図示したチャートなどが表示される。これらのチャートはいずれも、図２に示された計算・制御モジュール１１２により作成され、表示モジュール１１４によりGUI上の複数のウィンドウに表示される。これらのチャートのどれを表示し、どれを表示しないかを、ユーザが選択することができる。 As shown in FIGS. 4 to 6, in any case, the GUI has a plurality of windows. A common window displayed in any case is a control panel 150 for a user to input a request to the multivariate analyzer 110. In the other windows, as will be described in detail below, the characteristics of the sample group determined by multivariate analysis (for example, if PCA analysis is performed, the score for each PC (principal component) of each sample) And charts showing the characteristics of the variables (for example, when PCA analysis is performed, the charts illustrating the loading and contribution ratio of each variable to each PC (principal component)) and the charts illustrating the spectra of specific samples Is done. All of these charts are created by the calculation / control module 112 shown in FIG. 2 and displayed in a plurality of windows on the GUI by the display module 114. The user can select which of these charts are displayed and which are not displayed.

図４に示すように、PCA解析を用いて「１事象のデータの解析結果に基づく特徴変数及び相関の検出」を行った場合、GUIに含まれるウィンドウには、例えば、サンプル・プロット・ウィンドウ１５２、変数プロット・ウィンドウ１５４、寄与率チャート・ウィンドウ１５６、ローディング・チャート・ウィンドウ１５８、スペクトル・チャート・ウィンドウ１６０及びビュー・ナビゲータ・ウィンドウ１６２などがある。 As shown in FIG. 4, when “detection of feature variables and correlations based on the analysis result of one event data” is performed using PCA analysis, a window included in the GUI includes, for example, a sample plot window 152 Variable plot window 154, contribution chart window 156, loading chart window 158, spectrum chart window 160, and view navigator window 162.

サンプル・プロット・ウィンドウ１５２には、PCA解析の結果として作成されるサンプル・プロット・チャートが表示される。サンプル・プロット・チャートとは、ユーザにより指定された１個以上３個以下のPC(主成分)にそれぞれ対応する座標軸（以下、PC軸という）から構成される座標空間（図示の例は、３本のPC軸による３次元空間）において、それぞれのサンプルが、上記１個以上３個以下のPCについてのスコアに応じた座標にプロットされているチャートである。図４に例示したような３次元のサンプル・プロット・チャートが表示される場合には、この多変量解析装置１１０は、その３次元のサンプル・プロット・チャートを複数のPC軸回りに回転させて、ユーザがその3次元空間サンプル・プロット・チャートをあらゆる視線方向から観察することを可能にする。それにより、ユーザは、例えば、サンプル群全体がいくつかのサブ群に分かれているというような、サンプルの分布の様子を的確に把握することができる。 The sample plot window 152 displays a sample plot chart created as a result of the PCA analysis. A sample plot chart is a coordinate space composed of coordinate axes (hereinafter referred to as PC axes) respectively corresponding to one or more and three or less PCs (principal components) specified by the user (in the example shown, 3 In the three-dimensional space of a PC axis), each sample is a chart in which coordinates are plotted at coordinates according to the scores for the above-described one or more and three or less PCs. When a three-dimensional sample / plot chart as illustrated in FIG. 4 is displayed, the multivariate analyzer 110 rotates the three-dimensional sample / plot chart around a plurality of PC axes. , Allowing the user to view the 3D spatial sample plot chart from any line of sight. Thereby, the user can grasp | ascertain exactly the mode of distribution of a sample that the whole sample group is divided into some subgroups, for example.

変数プロット・ウィンドウ１５４には、PCA解析の結果として作成される変数プロット・チャートが表示される。変数プロット・チャートとは、ユーザにより指定された１個以上３個以下のPCにそれぞれ対応する座標軸（以下、PC軸という）から構成される座標空間（図示の例は、２本のPC軸による２次元空間）において、それぞれの変数が、上記１個以上３個以下のPCについてのローディングに応じた座標にプロットされているチャートである。 The variable plot window 154 displays a variable plot chart created as a result of the PCA analysis. A variable plot chart is a coordinate space composed of coordinate axes (hereinafter referred to as PC axes) corresponding to one or more and three or less PCs designated by the user (the example shown is composed of two PC axes). In the (two-dimensional space), each variable is a chart in which coordinates are plotted at coordinates corresponding to the loading of the above one or more and three or less PCs.

寄与率チャート・ウィンドウ１５６には、PCA解析の結果として作成される寄与率チャートが表示される。寄与率チャートとは、それぞれの変数の、ユーザ指定された１個以上３個以下のPCについての寄与率を、横軸が変数に対応し縦軸が寄与率に対応するグラフ（例えば棒グラフ）の形で表したチャートである。 The contribution rate chart window 156 displays a contribution rate chart created as a result of the PCA analysis. The contribution rate chart is a graph (for example, a bar graph) in which the horizontal axis corresponds to the variable and the vertical axis corresponds to the contribution rate for each of the variables of 1 to 3 PCs specified by the user. It is a chart expressed in shape.

ローディング・チャート・ウィンドウ１５８には、PCA解析の結果として作成されるローディング・チャートが表示される。ローディング・チャートとは、それぞれの変数の、ユーザ指定された１個以上３個以下のPCについてのローディングの絶対値と残差成分との和を、横軸が変数に対応し縦軸がローディングに対応したグラフ（例えば棒グラフ）の形で表したチャートである。 The loading chart window 158 displays a loading chart created as a result of PCA analysis. The loading chart is the sum of the absolute value and the residual component of the loading for each variable of 1 to 3 PCs specified by the user, with the horizontal axis corresponding to the variable and the vertical axis representing loading. It is a chart represented in the form of a corresponding graph (for example, a bar graph).

スペクトル・チャート・ウィンドウ１６０には、スペクトル・チャートが表示される。スペクトル・チャートとは、１以上のサンプルの実態データ（スペクトル）を、横軸が変数に対応し縦軸が変数値に対応したグラフ（例えば折れ線グラフ）の形で表したチャートである。 In the spectrum chart window 160, a spectrum chart is displayed. A spectrum chart is a chart representing actual data (spectrum) of one or more samples in the form of a graph (for example, a line graph) in which the horizontal axis corresponds to a variable and the vertical axis corresponds to a variable value.

寄与率チャート・ウィンドウ１５６とローディング・チャート・ウィンドウ１５８とスペクトル・チャート・ウィンドウ１６０は、互いに横軸のスケールが一致するようにして配置される。 The contribution rate chart window 156, the loading chart window 158, and the spectrum chart window 160 are arranged so that the scales on the horizontal axis coincide with each other.

ビュー・ナビゲータ・ウィンドウ１６２には、寄与率チャート１５６とローディング・チャート１５８とスペクトル・チャート１６０に現在表示されている横軸の範囲（変数の範囲）が、全体範囲内のどの部分に該当するかを、スペクトル・チャート１６０に表示されたスペクトルの一つの全体像の中で特定して示したビュー・ナビゲータが、表示される。 In the view navigator window 162, to which part of the entire range the horizontal axis range (variable range) currently displayed in the contribution rate chart 156, loading chart 158, and spectrum chart 160 corresponds. Is displayed in the entire view of the spectrum displayed on the spectrum chart 160.

図５に示すように、PCA解析を用いて「複数事象のデータの解析結果に基づく特徴変数及び相関の検出」を行った場合には、各事象について、図４に示されたウィンドウと同種のウィンドウの全部または一部が表示される。 As shown in FIG. 5, when “detection of feature variables and correlations based on the analysis result of data of multiple events” is performed using PCA analysis, each event has the same type as the window shown in FIG. 4. All or part of the window is displayed.

例えば、図５に示されたGUI例は、同じサンプル群又は異なるサンプル群の３種の事象（A、B、C）のデータを解析した場合に対応し、そこでは、各事象について、一例として、サンプル・プロット・ウィンドウ１５２（１５２A、B、C）と寄与率チャート・ウィンドウ１５６（１５６A、B、C）とスペクトル・チャート・ウィンドウ１６０（１６０A、B、C）とビュー・ナビゲータ・ウィンドウ１６２（１６２A、B、C）が表示されている。どのような種類のウィンドウのセットを表示するかは、ユーザが選択できる。 For example, the GUI example shown in FIG. 5 corresponds to the case where the data of three types of events (A, B, C) of the same sample group or different sample groups is analyzed. Sample plot window 152 (152A, B, C), contribution chart window 156 (156A, B, C), spectrum chart window 160 (160A, B, C), and view navigator window 162 ( 162A, B, C) are displayed. The user can select what type of window set to display.

図５の例から分かるように、全ての事象について、同じ種類のウィンドウが表示され、それらが同じ位置関係で配置される。同じサンプル群について異なる事象のデータが解析された場合、スペクトル・チャート・ウィンドウ１６０（１６０A、B、C）には、ユーザ選択された１以上の同じサンプルについての異なる事象の実態データを表示することができる。ユーザにとり、複数の事象の解析結果や実態データを対比することが可能であり、それにより、複数の事象間の相関関係を見つけたり、複数の事象の視点から総合的又は相補的に判断して確度の高いマーカを見つけ出したりすることが容易になる。 As can be seen from the example of FIG. 5, the same type of windows are displayed for all the events, and they are arranged in the same positional relationship. When different event data is analyzed for the same sample group, the spectrum chart window 160 (160A, B, C) displays actual data of different events for one or more of the same samples selected by the user. Can do. For users, it is possible to compare the analysis results and actual data of multiple events, thereby finding the correlation between multiple events, or judging comprehensively or complementarily from the viewpoint of multiple events It becomes easy to find a marker with high accuracy.

図６に示すように、「異構造のデータの解析結果に基づく特徴変数及び相関の検出」を行った場合には、多変量解析の対象となった標準構造のデータ（例えば、スペクトルデータ）についての多変量解析結果（図４に示されたウィンドウと同種のウィンドウの全部又は一部）と、多変量解析の対象とならなかった別構造の実態データ（例えば画像データ）の内容を表示したウィンドウとが表示される。 As shown in FIG. 6, when “detection of feature variables and correlations based on the analysis result of data of different structure” is performed, data of a standard structure (for example, spectral data) subjected to multivariate analysis is obtained. A window displaying the results of multivariate analysis (all or part of the same type of window as shown in FIG. 4) and the contents of actual data (for example, image data) of another structure that was not subject to multivariate analysis Is displayed.

例えば、図６に示されたGUI例は、MRI画像データと、そのMRI画像内の多数のボクセル（個々のボクセルがサンプルとして取り扱われる）のCSIスペクトルデータとを解析した場合に対応し、そこには、MRI画像データを表示した別構造データ・ウィンドウ１６４と、多数のボクセルのCSIスペクトルデータを多変量解析（例えばPCA解析）した結果としてのサンプル・プロット・ウィンドウ１５２、変数プロット・ウィンドウ１５４、寄与率チャート・ウィンドウ１５６、ローディング・チャート・ウィンドウ１５８、スペクトル・チャート・ウィンドウ１６０及びビュー・ナビゲータ・ウィンドウ１６２が表示される。 For example, the GUI example shown in FIG. 6 corresponds to the case where MRI image data and CSI spectrum data of a large number of voxels (each voxel is handled as a sample) in the MRI image are analyzed. Shows a separate structure data window 164 displaying MRI image data, a sample plot window 152 as a result of multivariate analysis (for example, PCA analysis) of CSI spectrum data of many voxels, a variable plot window 154, contribution A rate chart window 156, a loading chart window 158, a spectrum chart window 160, and a view navigator window 162 are displayed.

別構造データ・ウィンドウ１６４に表示されたMRI画像データ中の任意の場所をユーザがポインティングデバイスで指定すると、その指定された場所に対応するボクセル（サンプル）のCSIスペクトルデータが、スペクトル・チャート・ウィンドウ１６０に表示される。 When the user designates an arbitrary location in the MRI image data displayed in the separate structure data window 164 with the pointing device, the CSI spectrum data of the voxel (sample) corresponding to the designated location is displayed in the spectrum chart window. 160.

ユーザにとり、異なる構造のデータ間の相関関係を見つけたり、異なる構造のデータの視点から総合的又は相補的に判断して確度の高いマーカを見つけ出したりすることが容易になる。 It becomes easy for the user to find a correlation between data having different structures, or to find a marker with high accuracy by comprehensively or complementarily judging from the viewpoint of data having different structures.

次に、上述した３種類の特徴抽出機能、すなわち
(１) 「１事象のデータの解析結果に基づく特徴変数及び相関の検出」機能、
(２) 「複数事象のデータの解析結果に基づく特徴変数及び相関の検出」機能、及び
(３) 「異構造のデータの解析結果に基づく特徴変数及び相関の検出」機能
各々について、より具体的に説明する。 Next, the three types of feature extraction functions described above, that is,
(1) “Detection of feature variables and correlations based on the analysis result of one event data” function,
(2) “Detection of feature variables and correlations based on analysis results of multiple event data” function, and
(3) Each of the “detection of feature variables and correlations based on the analysis result of data of different structure” functions will be described more specifically.

１「１事象のデータの解析結果に基づく特徴変数及び相関の検出」機能
この機能には、複数のサブ機能が含まれ、各サブ機能は、コントロール・パネル１５０へのユーザの操作の仕方に応じて選択される。それらサブ機能には、「特定サンプルの特徴変数の抽出によるマーカ変数の検出」、「指定マーカ変数による特徴サンプルの検出」、「分析データと付加データとの相補的な解析＃１」、「分析データと付加データとの相補的な解析＃２」などがある。以下、各サブ機能について説明する。 1 “Detection of Feature Variables and Correlation Based on Analysis Result of 1 Event Data” Function This function includes a plurality of sub-functions, and each sub-function depends on how the user operates the control panel 150. Selected. These sub-functions include “detection of marker variable by extracting feature variable of specific sample”, “detection of feature sample by specified marker variable”, “complementary analysis # 1 of analysis data and additional data”, “analysis Complementary analysis # 2 of data and additional data ”. Hereinafter, each sub-function will be described.

１．１「特定サンプルの特徴変数の抽出によるマーカ変数の検出」サブ機能
多変量解析などの統計的手法により、解析されたサンプル群全体の傾向を見ることが可能である。もし、このサンプル群全体が複数のサブ群に分れているようであるならば、ユーザが考える解析プロセスの次の課題の一つは、「分割されたサンプルサブ群毎に、どのような特徴があるのか、つまり、特にどの変数がそのサブ群を特徴づけるのに寄与しているのか？」であろう。この課題を解決するために、このサブ機能は、ユーザが任意に指定したサンプルのサブ群のデータと、その他のサンプルのデータとを比較して、指定されたサンプルサブ群を他のサンプルから分離するのに寄与している可能性のある幾つかの特徴変数を抽出し、更に寄与率を用いてノイズ情報を除去して、より確度の高い特徴変数を抽出する。このサブ機能により、指定されたサンプルサブ群の特異的な特徴変数が導き出される。 1.1 “Detection of Marker Variables by Extracting Feature Variables of Specific Samples” Sub-Function It is possible to see the trend of the entire analyzed sample group by statistical methods such as multivariate analysis. If this sample group seems to be divided into multiple sub-groups, one of the next challenges of the analysis process considered by the user is “what characteristics for each divided sample sub-group? Is there, in particular, which variables contribute to characterize the subgroups? ” To solve this problem, this sub-function separates the specified sample sub-group from other samples by comparing the data of the sample sub-group specified by the user with the data of other samples. Some feature variables that may contribute to the extraction are extracted, noise information is removed using the contribution rate, and feature variables with higher accuracy are extracted. This subfunction derives specific feature variables for the specified sample subgroup.

図７は、このサブ機能の処理の流れを示し、図８は、このサブ機能の処理をGUIを用いて説明している。以下、図７に示す各ステップを、図８を一緒に参照しつつ説明する。 FIG. 7 shows the flow of processing of this sub-function, and FIG. 8 explains the processing of this sub-function using the GUI. Hereinafter, each step shown in FIG. 7 will be described with reference to FIG. 8 together.

１．１．１ステップ１７０：サンプルのサブ群の指定
多変量解析装置１１０は、或る事象の或るサンプル群の対象データ群を、対応する解析テーブル１２０から読み込み、多変量解析、例えばPCA解析、を行ない、そして、その解析結果を一つのGUI上に表示する。そのGUIにおけるそのサンプル群が表示されたウィンドウ、例えば図８に示すようなPCA解析で得られたサンプル・プロット・チャートを表したサンプル・プロット・ウィンドウ１５２中から、ユーザが、マウスなどのポインティング・デバイスを用いて、興味のある任意のサンプル（例えば、他のサンプルから分離されているサンプルのサブ群）１７８を指定する。すると、多変量解析装置１１０は、指定されたサンプルサブ群１７８を、サンプル・プロット・ウィンドウ１５２上でハイライト表示する。さらに、多変量解析装置１１０は、そのサンプル群の解析テーブル１２０を参照し、その解析テーブル１２０に記述されたリンクを用いて、指定されたサンプルの実態データ（スペクトルデータ）を読み出し、そして、図８に示すように、それらの実態データに基づいて指定されたサンプルのスペクトルを、スペクトル・ウィンドウ１６０に表示する。 1.1.1 Step 170: Designation of Sample Subgroup The multivariate analysis apparatus 110 reads a target data group of a sample group of a certain event from the corresponding analysis table 120, and performs multivariate analysis such as PCA analysis. , And display the analysis result on one GUI. From the window displaying the sample group in the GUI, for example, the sample plot window 152 representing the sample plot chart obtained by the PCA analysis as shown in FIG. The device is used to specify any sample of interest (eg, a sub-group of samples that are separated from other samples) 178. Then, the multivariate analyzer 110 highlights the designated sample subgroup 178 on the sample plot window 152. Further, the multivariate analysis apparatus 110 refers to the analysis table 120 of the sample group, reads the actual data (spectrum data) of the specified sample using the link described in the analysis table 120, and As shown in FIG. 8, the spectrum of the sample designated based on the actual data is displayed in the spectrum window 160.

１．１．２ステップ１７２：指定サンプルサブ群に共通する特徴変数の抽出
ステップ１７０で指定されたサンプルに共通する特徴的ないくつかの変数が、多変量解析装置１１０により計算される。ここで、或るサンプルに「特徴的な変数（特徴変数）」とは、そのサンプルがもつその変数の値が、他のサンプルがもつその変数の値と比べて、明らかに大きいもしくは小さいと判断される変数である。そのような変数の判別方法として、統計的ないろいろな手法が採用できるが、本実施形態では、以下に説明するような平均値を利用した方法を採用する。 1.1.2 Step 172: Extracting Characteristic Variables Common to the Specified Sample Subgroup Several characteristic variables common to the samples specified in Step 170 are calculated by the multivariate analyzer 110. Here, a “characteristic variable (characteristic variable)” of a certain sample is determined that the value of the variable that the sample has is clearly larger or smaller than the value of the variable that the other sample has. Variable. As a method for discriminating such variables, various statistical methods can be employed. In this embodiment, a method using an average value as described below is employed.

d次元の（つまりd個の変数をもつ）対象データをそれぞれもつｎ個のサンプル群は次のようにｎ×ｄの多変量データ行列Xとして表わすことができる。行列X内の各行が各サンプルの対象データに対応する。 A group of n samples each having target data of d dimensions (that is, having d variables) can be represented as an n × d multivariate data matrix X as follows. Each row in the matrix X corresponds to the target data of each sample.

この行列Xにおけるｋ列目（k番目の変数）のサンプル群全体での平均値x_{k_avg}は次式で示される。

An average value x _{k_avg} in the entire sample group of the k-th column (k-th variable) in the matrix X is _expressed by the following equation.

ここで、ｋ列目（k番目の変数）のサンプル群全体内での最小値をx_{k_min}、最大値をx_{k_max}とする。また、特徴的か否かを判別するために設定される閾値レベルを、それぞれx_{k_min_th}、x_{k_max_th}とする。このサンプル群におけるｋ列目（k番目の変数）の値の分布は、例えば図９Aまたは図９Bに示されるようになる。

Here, the minimum value in the entire sample group of the k-th column (k-th variable) is x _{k_min} , and the maximum value is x _{k_max} . In addition, threshold levels set to determine whether or not they are characteristic are x _{k_min_th} and x _{k_max_th} , respectively. The distribution of the values in the k-th column (k-th variable) in this sample group is as shown in FIG. 9A or FIG. 9B, for example.

特徴変数として抽出したい変数は、x_{k_min}とx_{k_min_th}の間の領域またはx_{k_max_th}とx_{k_max}との間の領域（これらの領域を以下「特徴領域」という）に入る値をもつものである。変数によって、その値の分布は、図９Aのように正規分布に近い形を示す場合もあるし、図９Bに例示するようにかなり偏った形を示す場合もある。特徴変数を抽出するということは、図９Bのように特徴領域に集中するグループに明確に分けられた分布を持つ変数を導き出すという意味がある。 Variable to be extracted as the feature variable are those having an area value falling (called "feature space" following these regions) between the region or x _{K_max_th} and x _{k_max} between x _{K_min} and _x k_min_th. Depending on the variable, the distribution of the values may show a shape close to a normal distribution as shown in FIG. 9A, or may show a considerably biased shape as shown in FIG. 9B. Extracting feature variables means that a variable having a distribution clearly divided into groups concentrated in the feature region as shown in FIG. 9B is derived.

図９Bに示すように、そのサンプル群中から指定されたサンプル（指定サンプルサブ群）１７８の全てに共通して、特定の変数が何れかの特徴領域に所属した場合、その変数は、その指定サンプルサブ群に共通する特徴変数といえるから、そのような特徴変数が抽出される。 As shown in FIG. 9B, when a specific variable belongs to any feature region in common to all of the samples (designated sample subgroups) 178 designated from the sample group, the variable is designated. Since it can be said that the feature variable is common to the sample subgroups, such a feature variable is extracted.

従って、指定サブ群の全てに共通してその変数がいずれかの特徴領域に入るか否かをチェックする処理を、行列Xの全行（全変数）について実行し、そのチェック結果がイエスであった変数の実を抽出することにより、その指定サンプルサブ群に共通する特徴変数が抽出される。 Therefore, the process of checking whether the variable falls into any feature region in common with all the specified subgroups is executed for all rows (all variables) of the matrix X, and the check result is yes. The characteristic variables common to the designated sample subgroup are extracted by extracting the actual variables.

１．１．３ステップ１７４：寄与率によるフィルタリング
このステップ１７４では、上記ステップ１７２で得られた特徴変数の各々について、その変数が、指定サンプルサブ群をそれ以外のサンプルから差別化するという点で本当に意味のある変数であるかどうかを、寄与率のような別のパラメータを用いて評価をする。特にノイズのようにランダムで偶発的に選択された変数を、事前に求められた寄与率などの事前パラメータを用いて検出し、それを除去する。具体的には、或る変数の寄与率があるレベル以下であれば、その変数を除去する。結果として、上記観点で本当に意味のある（より確度の高い）特徴変数のみが絞り込まれる。このような特徴変数をそれぞれ以下「マーカ変数」という。 1.1.3 Step 174: Filtering by Contribution In this step 174, for each of the feature variables obtained in step 172, the variable differentiates the specified sample subgroup from the other samples. Evaluate whether it is a really meaningful variable using another parameter such as contribution rate. In particular, a random and accidentally selected variable such as noise is detected using a prior parameter such as a contribution rate obtained in advance, and is removed. Specifically, if the contribution rate of a certain variable is below a certain level, that variable is removed. As a result, only feature variables that are really meaningful (more accurate) in terms of the above are narrowed down. Such characteristic variables are hereinafter referred to as “marker variables”.

１．１．４ステップ１７６：特徴変数の表示
このステップ１７６では、上記ステップ１７４で抽出されたマーカ変数を、GUI上で視覚的に識別できるようにハイライトして表示する。例えば、図８に例示するように、スペクトル・チャート・ウィンドウ１６０、寄与率チャート・ウィンドウ１５６、ローディング・チャート・ウィンドウ１５８を通じて、抽出されたマーカ変数に対してのみ所定カラーの帯１８０が付され、また、変数プロット・ウィンドウ１５４で、それらのマーカ変数が他の変数とは異なるカラーのドット１８２で表示される。それにより、ユーザはGUI上で、それらのマーカ変数を視覚的に容易に識別可能である。好ましくは、マーカ変数が、図９Aに示された最大値に近い（つまり、指定サンプルサブ群がもつその変数の値が、他のサンプルのそれより明らかに大きい）のか、もしくは最小値に近い（つまり、指定サンプルサブ群がもつその変数の値が、他のサンプルのそれより明らかに小さい）のかを、異なるカラーを使って区別して表示する（例えば、前者に付された帯１８０のカラーはオレンジ、後者に付された帯１８０のカラーはブルー）。これにより、ユーザは、指定サンプルサブ群の量的な特徴を視覚的に容易に把握できる。 1.1.4 Step 176: Display of Feature Variable In this step 176, the marker variable extracted in step 174 is highlighted and displayed so as to be visually identifiable on the GUI. For example, as illustrated in FIG. 8, a band 180 of a predetermined color is attached only to the extracted marker variable through the spectrum chart window 160, the contribution chart window 156, and the loading chart window 158. Also, in the variable plot window 154, the marker variables are displayed as dots 182 of a different color from other variables. Thereby, the user can easily identify those marker variables visually on the GUI. Preferably, the marker variable is close to the maximum value shown in FIG. 9A (that is, the value of that variable that the specified sample subgroup has is clearly greater than that of the other samples) or close to the minimum value ( In other words, whether the value of the variable of the specified sample subgroup is clearly smaller than that of the other samples is displayed using different colors (for example, the color of the band 180 attached to the former is orange) The color of the band 180 attached to the latter is blue). As a result, the user can easily visually grasp the quantitative characteristics of the designated sample subgroup.

１．２「指定マーカ変数による特徴サンプルの検出」サブ機能
前述の「特定サンプルの特徴変数の抽出によるマーカ変数の検出」サブ機能は、指定サンプルサブ群に共通するマーカ変数を探索する手法である。他方、「指定マーカの有意差検定によるサンプル特徴分布検出」サブ機能は、前述のサブ機能と逆のアプローチにより、指定されたマーカ変数がどのサンプルサブ群に特異的に影響を与えているかを探索する手法である。このサブ機能では、指定されたマーカ変数を寄与率によりフィルタリングし、その後、指定されたマーカ変数が特徴的に表れているサンプルサブ群を抽出する。 1.2 “Detection of Feature Samples Using Specified Marker Variables” Sub-Function The above-mentioned “Detection of Marker Variables by Extracting Feature Variables of Specific Samples” sub-function is a method for searching for marker variables common to specified sample sub-groups. . On the other hand, the “Sample feature distribution detection by significant difference test of specified marker” sub-function searches for which sample sub-group the specified marker variable has a specific influence by the opposite approach to the previous sub-function. It is a technique to do. In this sub-function, the designated marker variable is filtered by the contribution rate, and then the sample subgroup in which the designated marker variable is characteristically extracted is extracted.

図１０は、このサブ機能の処理の流れを示し、図１１は、このサブ機能の処理をGUIを用いて説明している。以下、図１０に示す各ステップを、図１１を一緒に参照しつつ説明する。 FIG. 10 shows the flow of processing of this sub-function, and FIG. 11 explains the processing of this sub-function using the GUI. Hereinafter, each step shown in FIG. 10 will be described with reference to FIG. 11 together.

１．２．１ステップ１９０：マーカ変数の指定
多変量解析装置１１０は、或る事象の或るサンプル群の対象データ群を、対応する解析テーブル１２０から読み込み、多変量解析、例えばPCA解析、を行ない、そして、その解析結果を一つのGUI上に表示する。そのGUI上で変数が表示されているウィンドウ、例えば変数プロット・ウィンドウ１５４上で、ユーザが、興味ある任意の任意の１以上の変数１９８を、マーカ変数として指定する（寄与率プロット・ウィンドウ１５６またはローディング・チャート・ウィンドウ１５８でマーカ変数を指定してもよい。）。図１１に示された例では、ユーザは、変数プロット・ウィンドウ１５４上で他の変数から分離されて孤立しているような１又は複数の変数１９８に興味を持ち、その変数１９８をマーカ変数として指定している。 1.2.1 Step 190: Designation of Marker Variable The multivariate analysis apparatus 110 reads a target data group of a certain sample group of a certain event from the corresponding analysis table 120, and performs multivariate analysis such as PCA analysis. And display the analysis result on one GUI. In a window in which variables are displayed on the GUI, such as the variable plot window 154, the user designates any one or more variables 198 of interest as marker variables (contribution plot window 156 or Marker variables may be specified in the loading chart window 158.) In the example shown in FIG. 11, the user is interested in one or more variables 198 that are isolated and isolated from other variables on the variable plot window 154, and the variable 198 is used as a marker variable. It is specified.

１．２．２ステップ１９２：寄与率によるフィルタリング
上記ステップ１９０で指定された変数について、その変数に意味があるかどうかが、寄与率等の別のパラメータを用いて評価される。特にノイズのようにランダムで偶発的に指定された変数が、事前に求められた寄与率等による評価結果に基づいて検出され、マーカ変数から除外される。具体的には、寄与率などのパラメータの値が或るレベル以下である変数を排除することで、本当に意味のある変数のみがマーカ変数として絞り込まれる。 1.2.2 Step 192: Filtering by Contribution Rate For the variable specified in Step 190 above, whether or not the variable is meaningful is evaluated using another parameter such as the contribution rate. In particular, a variable that is randomly and accidentally specified, such as noise, is detected based on an evaluation result based on a contribution rate obtained in advance, and is excluded from the marker variable. Specifically, by excluding variables whose parameter values such as contribution rate are below a certain level, only variables that are truly meaningful are narrowed down as marker variables.

絞り込まれたマーカ変数が、GUI上で視覚的に識別できるように、ハイライトされて表示され。例えば、図１１に例示するように、スペクトル・チャート・ウィンドウ１６０、寄与率チャート・ウィンドウ１５６、ローディング・チャート・ウィンドウ１５８を通じて、それらのマーカ変数に対してのみ所定カラーの帯２００が付され、また、変数プロット・ウィンドウ１５４で、それらのマーカ変数が他の変数とは異なるカラーのドットで表示される。 Filtered marker variables are highlighted and displayed for visual identification on the GUI. For example, as illustrated in FIG. 11, a predetermined color band 200 is attached only to the marker variables through the spectrum chart window 160, the contribution chart window 156, and the loading chart window 158, and In the variable plot window 154, the marker variables are displayed with different colored dots from the other variables.

１．２．３ステップ１９４：指定マーカ変数に共通する特徴サンプルサブ群の抽出
上記ステップ１９２で決定されたマーカ変数に共通する特徴的な１以上のサンプル（サンプルサブ群）が検出される。ここで、或るマーカ変数に「特徴的なサンプル」とは、そのサンプルがもつそのマーカ変数の値が、他のサンプルがもつそのマーカ変数の値と比べて、明らかに大きいもしくは小さいと判断されるサンプルである。判別方法として種々の統計的な手法が採用できるが、本実施形態は、次に説明するような平均値を利用した方法を採用する。 1.2.3 Step 194: Extraction of Feature Sample Subgroup Common to Designated Marker Variable One or more characteristic samples (sample subgroup) common to the marker variable determined in step 192 are detected. Here, a “characteristic sample” for a marker variable is determined to be that the value of the marker variable of that sample is clearly larger or smaller than the value of the marker variable of another sample. This is a sample. Although various statistical methods can be adopted as the determination method, this embodiment employs a method using an average value as described below.

すなわち、サンプル群の対象データ群全部に基づいて、サンプル群全体における、各マーカ変数の値の分布が計算される。或るマーカ変数について、例えば図１２に示すような値の分布が得られる。各マーカ変数の値の分布に基づいて、図９Aと図９Bを参照して説明したのと同様な方法で、そのマーカ変数について最小値x_{k_min}側の特徴領域（x_{k_min}とx_{k_min_th}の間の領域）と最大値x_{k_max}側の特徴領域（x_{k_max_th}とx_{k_max}との間の領域）が決定される。そして、それぞれのサンプルの対象データがもつそのマーカ変数の値に基づいて、そのマーカ変数の値が何れかの特徴領域に入る（すなわち、そのサンプルがもつそのマーカ変数の値が、他のサンプルより明らかに大きいまたは小さい）サンプル２０２、２０４を抽出する。この処理を全てのマーカ変数について行い、全てのマーカ変数の値が何れかの特徴領域に入るサンプルだけを絞り込むことで、それらのマーカ変数に共通する特徴サンプルサブ群が抽出される。 That is, the distribution of the value of each marker variable in the entire sample group is calculated based on the entire target data group of the sample group. For a certain marker variable, for example, a value distribution as shown in FIG. 12 is obtained. Based on the distribution of the value of each marker variable, a feature region (between x _{k_min} and x _{k_min_th} ₎ on the side of the minimum value x _{k_min} for that marker variable in the same manner as described with reference to FIGS. 9A and 9B. Region) and a feature region on the side of the maximum value x _{k_max} (a region between x _{k_max_th} and x _{k_max} ) are determined. Then, based on the value of the marker variable of the target data of each sample, the value of the marker variable enters one of the feature areas (that is, the value of the marker variable of the sample Samples 202, 204 are extracted (obviously large or small). This process is performed for all the marker variables, and by narrowing down only the samples in which all the marker variable values fall in any feature region, a feature sample subgroup common to those marker variables is extracted.

１．２．４ステップ１９６：特徴サンプルサブ群の表示
上記ステップ１９４で最終的に抽出された特徴サンプル２０２、２０４が、GUIのサンプル・プロット・ウィンドウ１５２上に、視覚的に識別できるようにハイライトされて表示される。また、それら特徴サンプルの実態データが読み出され、それらのスペクトルがスペクトル・チャート・ウィンドウ１６０上に表示される。好ましくは、それぞれの特徴サンプル２０２、２０４のもつ指定マーカ変数の値が最大値に近いのか、もしくは最小値に近いのかが、視覚的に区別できるように、前者の特徴サンプル２０２と後者の特徴サンプル２０４とが異なる色で表示される。例えば、サンプル・プロット・ウィンドウ１５２上で、前者の特徴サンプル２０２の表にカラーがオレンジであり、後者の特徴サンプル２０４の表示カラーがブルーである。さらに、例えば、サンプル・プロット・ウィンドウ１５２上で、特徴サンプル２０２、２０４の各々がもつマーカ変数の値も表示される。これにより、指定したマーカ変数の量的な特徴が視覚的に容易に把握できる。 1.2.4 Step 196: Display of Feature Sample Subgroup Feature samples 202, 204 finally extracted in step 194 above are high on the GUI sample plot window 152 so that they can be visually identified. Written and displayed. Further, the actual data of the feature samples are read out, and their spectra are displayed on the spectrum chart window 160. Preferably, the former feature sample 202 and the latter feature sample can be visually discriminated whether the value of the designated marker variable of each feature sample 202, 204 is close to the maximum value or the minimum value. 204 is displayed in a different color. For example, on the sample plot window 152, the color of the table of the former feature sample 202 is orange, and the display color of the latter feature sample 204 is blue. Further, for example, on the sample plot window 152, the value of the marker variable of each of the feature samples 202 and 204 is also displayed. Thereby, the quantitative feature of the designated marker variable can be easily grasped visually.

１．３「分析データと付加データとの相補的な解析＃１」サブ機能
従来行われているほとんどの解析手法は、一つの観点から（例えば、一つの事象に関して、一つの一次分析装置を用いて）分析して得られたデータを基に、一意的な結論を求めようとしている。しかし、多くの異なる観点から分析を行って様々な関連する情報、データを収集し、それらの情報（知見）を総合的または相補的に解釈してこそ、より正しい結論が導き出される筈である。また、それぞれの観点から得られた情報（知見）同士の相関を見ることにより、新たな知見を得ることができるであろう。このサブ機能は、ある観点での分析データから得られた結果と、それに関連する別の１以上の観点での情報（付加データ）とを有機的に結合し、それらの相関が一つのGUIにて視覚的に理解できるように提示する機能である。このサブ機能は、指定された付加データに特徴的なサンプルを自動的に抽出し、そして、それらのサンプルに特異なマーカ変数を自動的に見つけ出す機能である。これにより、ユーザは、付加データと変数との相関関係を視覚的に容易に把握することができる。なお、付加データは、図３に示された解析テーブル１２０内の付加データ欄１４０に予め登録されている。 1.3 “Complementary analysis # 1 of analysis data and additional data” sub-function Most conventional analysis methods are based on a single point of view (eg, using one primary analyzer for one event). Based on the data obtained from the analysis, a unique conclusion is being sought. However, more accurate conclusions can be drawn only by analyzing from many different viewpoints, collecting various related information and data, and interpreting the information (knowledge) comprehensively or complementarily. Moreover, new knowledge will be acquired by seeing the correlation between the information (knowledge) obtained from each viewpoint. This sub-function organically combines the results obtained from analysis data from a certain point of view with information (additional data) from one or more other points related to it, and their correlation is combined into a single GUI. It is a function that is presented so that it can be visually understood. This sub-function is a function that automatically extracts samples characteristic of specified additional data and automatically finds marker variables specific to those samples. Thereby, the user can easily grasp the correlation between the additional data and the variable visually. The additional data is registered in advance in the additional data column 140 in the analysis table 120 shown in FIG.

図１３は、このサブ機能の処理の流れを示し、図１４は、このサブ機能の処理をGUIを用いて説明している。以下、図１３に示す各ステップを、図１４を一緒に参照しつつ説明する。この説明では、付加データとして、或る種類の生化学データを用いた場合を例にとる。 FIG. 13 shows the flow of processing of this sub-function, and FIG. 14 explains the processing of this sub-function using the GUI. Hereinafter, each step shown in FIG. 13 will be described with reference to FIG. 14 together. In this description, a case where a certain kind of biochemical data is used as additional data is taken as an example.

１．３．１ステップ２１０：付加データ（生化学データ）の指定
多変量解析装置１１０は、或る事象の或るサンプル群の対象データ群を、対応する解析テーブル１２０から読み込み、多変量解析、例えばPCA解析、を行ない、そして、その解析結果を一つのGUI上に表示する。ユーザは、そのGUIのコントロール・パネル１５０上の付加データ選択メニュー２２０を用いて、興味ある任意の種類の付加データ（例えば生化学データの一種である体重差データ）を選択する。 1.3.1 Step 210: Specifying Additional Data (Biochemical Data) The multivariate analysis apparatus 110 reads a target data group of a certain sample group of a certain event from the corresponding analysis table 120, and performs multivariate analysis, For example, PCA analysis is performed, and the analysis result is displayed on one GUI. Using the additional data selection menu 220 on the control panel 150 of the GUI, the user selects any kind of additional data of interest (for example, weight difference data which is a kind of biochemical data).

１．３．２ステップ２１２：特定サンプルサブ群の抽出
解析テーブル１２０から、サンプル群全体の指定生化学データの値が読み出される。上述の１．２．３項で説明したステップ１９４で指定マーカ変数について行われた方法（図１２を参照して説明した方法）と同様の方法で、指定された生化学データの各々についてサンプル群全体での値の分布が計算され、それらの分布に基づいて、指定された生化学データに共通する特徴的なサンプル（サンプルサブ群）２２０、２２２が抽出される。指定生化学データの値が最大値に近い特徴サンプル２２０と、最小値に近い特徴サンプル２２２とが、区別できるように別のカラーで、サンプル・プロット・ウィンドウ１５２上で表示される。特徴サンプル２２０、２２２の実態データが読み出され、それらのスペクトルが、スペクトル・チャート・ウィンドウ１６０上に表示される。 1.3.2 Step 212: Extraction of Specific Sample Subgroup From the analysis table 120, the value of designated biochemical data of the entire sample group is read. A sample group for each of the designated biochemical data in the same manner as that performed for the designated marker variable in step 194 described in section 1.2.3 above (the method described with reference to FIG. 12). The overall distribution of values is calculated, and characteristic samples (sample subgroups) 220, 222 common to the specified biochemical data are extracted based on the distribution. The feature sample 220 whose designated biochemical data value is close to the maximum value and the feature sample 222 close to the minimum value are displayed on the sample plot window 152 in different colors so that they can be distinguished. Actual data of the feature samples 220 and 222 are read out, and their spectra are displayed on the spectrum chart window 160.

１．３．３ステップ２１４：特徴サンプルサブ群における特徴変数の抽出
１．１．２項で説明したステップ１７２と同様な方法で、上記ステップ２１２で抽出された特徴サンプルサブ群に共通する特徴的な変数が抽出される。 1.3.3 Step 214: Extraction of Feature Variables in Feature Sample Subgroup Characteristic common to the feature sample subgroup extracted in step 212 in the same manner as step 172 described in section 1.1.2. Variable is extracted.

１．３．４ステップ２１６：寄与率によるフィルタリング
１．１．３項で説明したステップ１７４と同様な方法によるフィルタリングが、上記ステップ２１４で抽出された特徴変数に対して適用されて、マーカ変数が決定される。マーカ変数は、スペクトル・チャート・ウィンドウ１６０、寄与率チャート・ウィンドウ１５６及びローディング・チャート・ウィンドウ１５８を通じて、視覚的に区別できるように、例えば所定カラーの帯２２４によりハイライトされて表示される。 1.3.4 Step 216: Filtering by Contribution Ratio Filtering by the same method as step 174 described in section 1.1.3 is applied to the feature variable extracted in step 214, and the marker variable is It is determined. The marker variable is highlighted and displayed by a band 224 of a predetermined color, for example, through the spectrum chart window 160, the contribution chart window 156, and the loading chart window 158 so as to be visually distinguishable.

１．３．５マーカ変数の表示
１．１．４項で説明したステップ１７６と同様な方法で、マーカ変数が、視覚的に識別できように他の変数とは別のカラーで、変数プロット・ウィンドウ１５４上に表示される。これらのマーカ変数は、指定された付加データ（生化学データ）との相関が、他の変数より高いと推定することができる。 1.3.5 Displaying marker variables In the same manner as in step 176 described in section 1.1.4, the marker variable is displayed in a different color from the other variables so that it can be visually identified. It is displayed on the window 154. These marker variables can be estimated to have a higher correlation with designated additional data (biochemical data) than other variables.

１．４「分析データと付加データとの相補的な解析＃２」サブ機能
このサブ機能は、１．３項で説明したサブ機能と同様に、ある観点での分析データから得られた結果と、それに関連する別の１以上の観点での情報（付加データ）とを有機的に結合し、それらの相関が一つのGUIにて視覚的に理解できるように提示する機能である。このサブ機能は、PC(主成分)解析で得られたPC（主成分）と、指定された付加データとを座標軸とする座標空間に、サンプル群をプロットして、GUI上に表示する機能である。 1.4 “Complementary analysis # 2 of analysis data and additional data” sub-function This sub-function is similar to the sub-function described in section 1.3, and results obtained from analysis data from a certain point of view. This is a function that organically combines information (additional data) from one or more other viewpoints related thereto and presents such correlation so that it can be visually understood by one GUI. This sub-function is a function that plots sample groups in the coordinate space with the PC (principal component) obtained by PC (principal component) analysis and the specified additional data as coordinate axes and displays them on the GUI. is there.

図１５は、このサブ機能の処理の流れを示し、図１６は、このサブ機能の処理をGUIを用いて説明している。以下、図１５に示す各ステップを、図１６を一緒に参照しつつ説明する。この説明では、付加データとして、或る種類の生化学データを用いた場合を例にとる。 FIG. 15 shows the flow of processing of this sub-function, and FIG. 16 explains the processing of this sub-function using the GUI. Hereinafter, each step shown in FIG. 15 will be described with reference to FIG. 16 together. In this description, a case where a certain kind of biochemical data is used as additional data is taken as an example.

１．４．１ステップ２３０：付加データ（生化学データ）の指定
多変量解析装置１１０は、或る事象の或るサンプル群の対象データ群を、対応する解析テーブル１２０から読み込み、多変量解析、例えばPCA解析、を行ない、そして、その解析結果を一つのGUI上に表示する。ユーザは、そのGUIのコントロール・パネル１５０上の３つの座標軸（x、y、z軸）選択メニュー２３４を用いて、少なくとも１つの任意の座標軸に、興味ある任意の種類の付加データ（例えば生化学データの一種である体重差データ）を指定し、残りの座標軸に、PC（主成分）解析で得られた任意のPCを指定する。図１５に示された例では、ｘ軸に生化学データの一つである体重差が指定され、ｙ軸とｚ軸には或る同じPCが指定されている（ｙ軸とｚ軸に異なるPCを指定しても、勿論よい。）。 1.4.1 Step 230: Designation of Additional Data (Biochemical Data) The multivariate analysis apparatus 110 reads a target data group of a certain sample group of a certain event from the corresponding analysis table 120, multivariate analysis, For example, PCA analysis is performed, and the analysis result is displayed on one GUI. The user uses the three coordinate (x, y, z) selection menu 234 on the GUI control panel 150 to select any type of additional data of interest (eg, biochemistry) on at least one arbitrary coordinate axis. Specify weight difference data, which is a kind of data), and specify any PC obtained by PC (principal component) analysis for the remaining coordinate axes. In the example shown in FIG. 15, a weight difference, which is one of biochemical data, is designated on the x axis, and the same PC is designated on the y axis and the z axis (different between the y axis and the z axis). Of course, you can specify a PC.)

１．４．２ステップ２３２：PC軸の置換
多変量解析装置１１０は、解析対象のサンプル群の解析テーブル１２０から、サンプル群の指定付加データのデータを読み込み、そして、サンプル・プロット・ウィンドウ１５２に表示されている座標空間を構成する複数のPC軸の内、ユーザに指定されたPC軸を、ユーザに指定された付加データの値に対応する座標軸（以下、付加データ軸という）に置き換える。そして、その付加データ軸と残りのPC軸とから構成される座標空間に、そのサンプル群をプロットし直したサンプル・プロット・チャートを作成し、そして、そのサンプル・プロット・チャートを、サンプル・プロット・ウィンドウ１５２に表示する。図１５に例示された例では、ｘ軸に体重差が指定され、ｙ軸とｚ軸には同一のPCが指定されている（つまり、座標軸が実質的に2本である）ため、サンプル・プロット・ウィンドウ１５２には２次元（縦軸が体重差、横軸が指定PC）のサンプル・プロット・チャートが表示されているが、x、y、zの３軸にそれぞれ異なる成分が指定されていれば、３次元のサンプル・プロット・チャートが表示されることになる。 1.4.2 Step 232: Replacement of PC Axis The multivariate analyzer 110 reads the data of the designated additional data of the sample group from the analysis table 120 of the sample group to be analyzed, and the sample plot window 152 Of the plurality of PC axes constituting the displayed coordinate space, the PC axis specified by the user is replaced with a coordinate axis (hereinafter referred to as an additional data axis) corresponding to the value of the additional data specified by the user. Then, a sample plot chart is created by re-plotting the sample group in the coordinate space composed of the additional data axis and the remaining PC axis, and the sample plot chart is converted into the sample plot. -Display in window 152. In the example illustrated in FIG. 15, the weight difference is designated on the x-axis, and the same PC is designated on the y-axis and the z-axis (that is, the coordinate axes are substantially two). The plot window 152 displays a two-dimensional sample plot chart (vertical axis is weight difference, horizontal axis is specified PC), but different components are specified for each of the three axes x, y, and z. Then, a three-dimensional sample / plot chart is displayed.

このチャートを見ることで、ユーザは、任意の付加データと任意のPC（又は変数）との間の相関関係が視覚的に把握することが容易になる。例えば、図１５に示されたサンプル・プロット・ウィンドウ１５２では、他のサンプル群から分離して孤立したサンプルのサブ群３６が存在するが、このサンプルサブ群２３６は、体重差（縦軸）の特定値と指定PC（横軸）のスコアの特定値の近傍領域に集まっている。このことから、体重差と、このサンプルサブ群２３６に特徴的な何らかの変数との間に、意味のある関連性が存在するかもしれないという推測が得られる。そこで、ユーザは、前記１．１のサブ機能を使って、このサンプルサブ群２３６に共通するマーカ変数を検出してもよい。 By viewing this chart, the user can easily grasp the correlation between any additional data and any PC (or variable) visually. For example, in the sample plot window 152 shown in FIG. 15, there is an isolated sample subgroup 36 separated from other sample groups, and this sample subgroup 236 has a weight difference (vertical axis). It is gathered in the vicinity of the specific value and the specific value of the score of the specified PC (horizontal axis). This leads to the speculation that there may be a meaningful association between the weight difference and some variable characteristic of this sample subgroup 236. Therefore, the user may detect a marker variable common to the sample sub-group 236 using the sub-function of 1.1.

２「複数事象のデータの解析結果に基づく特徴変数及び相関の検出」機能
この機能には、「異事象の分析データ間の相関およびマーカ変数の検出」サブ機能及び「指定マーカ変数における異事象での挙動の抽出」サブ機能が含まれる。以下、これらのサブ機能について説明する。 2 “Detection of feature variables and correlations based on analysis results of multiple event data” function This function includes “Detection of correlations between analysis data of different events and marker variables” sub-function and “Detection of different events in specified marker variables” "Extract Behavior" sub-function is included. Hereinafter, these sub-functions will be described.

２．１「異事象の分析データ間の相関およびマーカ変数の検出」サブ機能
このサブ機能は、ある現象を理解するために、利用可能な複数の分析・評価法を活用して異なる事象の観点からデータを収集し分析して、その現象の全体像を探索することを容易にする。一般的に、それ一つで万能であるという分析法・評価法は存在しないので、研究者は、各手法の得手・不得手を理解した上で、各手法で得られたデータ（情報）を処理し解析し、そして、最終的に研究者の頭の中で、異なる手法で得られた情報を総合的・相補的に組み合わせながら一つの結論（推論）を導き出している。このサブ機能は、各種の手法から得られた異なる事象のデータ（情報）を、一つのGUI上で、相互の相関が把握しやすいように表示する機能である。このサブ機能は、特に、異事象間で共通性のあるサンプル（サンプルのデータ）をピックアップし、それら異事象間で共通性のあるサンプルに特異的なマーカ変数を検出し表示する。それにより、ユーザは、異なる手法で得られた情報を総合的・相補的に組み合わせることが容易になる。 2.1 “Correlation between analysis data of different events and marker variable detection” sub-function This sub-function uses multiple available analysis / evaluation methods to understand a phenomenon and Data is collected and analyzed, making it easy to explore the whole picture of the phenomenon. In general, there is no analysis method / evaluation method that is all-purpose, so researchers understand the pros and cons of each method and use the data (information) obtained by each method. It is processed and analyzed, and finally one conclusion (inference) is drawn in the researcher's head by combining information obtained by different methods comprehensively and complementarily. This sub-function is a function for displaying different event data (information) obtained from various methods so that the mutual correlation can be easily grasped on one GUI. In particular, this sub-function picks up samples (sample data) that are common between different events and detects and displays marker variables specific to samples that are common between these different events. As a result, the user can easily and comprehensively combine information obtained by different methods.

図１７は、このサブ機能の処理の流れを示し、図１８は、このサブ機能の処理をGUIを用いて説明している。以下、図１７に示す各ステップを、図１８を一緒に参照しつつ説明する。 FIG. 17 shows the flow of processing of this sub-function, and FIG. 18 explains the processing of this sub-function using the GUI. Hereinafter, each step shown in FIG. 17 will be described with reference to FIG. 18 together.

２．１．１ステップ２４０：サンプルサブ群の指定
多変量解析装置１１０は、複数の異なる事象の対象データ群を、それぞれの事象の解析テーブル１２０から読み込み、事象毎に多変量解析、例えばPCA解析、を行ない、そして、それら複数の事象の解析結果を一つのGUI上に表示する。例えば、図１７に示したGUI上には、メタボローム、プロテオーム、ゲノムの観点からの３種類の事象A〜Cについて、PCA解析結果として、サンプル・プロット・ウィンドウ１５２A〜１５２C、スペクトル・チャート・ウィンドウ１６０A〜１６０C及び寄与率チャート・ウィンドウ１５６A〜１５６Cなどが、３事象を通じて同じ配置で、表示されている。 2.1.1 Step 240: Designation of Sample Subgroup The multivariate analysis apparatus 110 reads target data groups of a plurality of different events from the analysis table 120 of each event, and performs multivariate analysis, for example, PCA analysis for each event. , And the analysis results of these multiple events are displayed on one GUI. For example, on the GUI shown in FIG. 17, sample plot windows 152A to 152C and spectrum chart window 160A are obtained as PCA analysis results for three types of events AC from the viewpoint of metabolome, proteome, and genome. ~ 160C and contribution chart windows 156A to 156C etc. are displayed in the same arrangement throughout the three events.

ユーザは、任意の１事象、例えば事象A、についてのサンプル群が表示されているウィンドウ、例えばサンプル・プロット・ウィンドウ１５２A上で、興味ある任意のサンプル（サンプルサブ群）２５０を指定する。すると、上記１．１．１で説明したステップ１７０と同様の方法で、指定されたサンプルサブ群２５０は、その事象Aのサンプル・プロット・ウィンドウ１５２A上でハイライト表示され、また、指定されたサンプルサブ群２５０の実態データのスペクトルが、その事象Aのスペクトル・チャート・ウィンドウ１６２A上に表示される。 The user designates any sample (sample subgroup) 250 of interest on a window in which the sample group for any one event, for example, event A, is displayed, for example, the sample plot window 152A. Then, the designated sample subgroup 250 is highlighted on the sample plot window 152A of the event A in the same manner as in the step 170 described in 1.1.1 above, and the designated sample subgroup 250 is designated. The spectrum of the actual data of the sample subgroup 250 is displayed on the spectrum chart window 162A of the event A.

２．１．２ステップ２４２：異事象データの連結
多変量解析装置１１０は、事象A、B、Cの解析テーブル１２０に登録されている各サンプルに割り当てられたキーワード（要するに、異事象A、B、Cのサンプル群間で、共通性のあるサンプル同士を関連づけるリンク情報）を頼りに、上記ステップ２４０で事象Aに関し指定されたサンプルサブ群２５０のデータと共通性のある他の事象B、Cのサンプルサブ群２５２、２５４のデータを検索する。ここで、異事象間で「共通性のある」サンプルのデータとは、例えば、同じサンプルソース（例えば、同じ人）から得られた異種類のサンプル（例えば血液、細胞、染色体）を分析したデータ、あるいは、同じサンプル（例えば、或る混合物）を異なる一次分析装置（例えば、NMR装置、MS装置、DNAアレイなど）で分析して得たデータなどを意味する。 2.1.2 Step 242: Linking Different Event Data The multivariate analysis apparatus 110 uses the keywords assigned to each sample registered in the analysis table 120 of events A, B, and C (in short, different events A and B). , C, and other events B, C that are common to the data of the sample subgroup 250 specified for the event A in step 240 above. The data of the sample subgroups 252 and 254 are searched. Here, sample data “common” between different events is, for example, data obtained by analyzing different types of samples (eg, blood, cells, chromosomes) obtained from the same sample source (eg, the same person). Alternatively, it means data obtained by analyzing the same sample (for example, a mixture) with different primary analyzers (for example, NMR apparatus, MS apparatus, DNA array, etc.).

そして、多変量解析装置１１０は、検索された共通性のあるサンプルサブ群２５２、２５４のデータを、それぞれの事象B、Cのサンプル・プロット・ウィンドウ１５２B、１５２C上にハイライト表示し、さらに、それら共通性のあるサンプルサブ群２５２、２５４の実態データから得られたスペクトルを、それぞれの事象B、Cのスペクトル・チャート・ウィンドウ１６０B、１６０C上に表示する。これにより、異なる事象A、B、Cに関する共通性のあるサンプルサブ群のデータが相互に関連づけられる。 Then, the multivariate analysis apparatus 110 highlights the data of the searched common sample subgroups 252 and 254 on the sample plot windows 152B and 152C of the respective events B and C, and The spectrum obtained from the actual data of the common sample subgroups 252 and 254 is displayed on the spectrum chart windows 160B and 160C of the respective events B and C. As a result, common sample sub-group data regarding different events A, B, and C are correlated with each other.

このように異なる事象A、B、C間で共通性のあるサンプルのデータを関連付ける処理を行うために、図１９に示すように、それらの事象A、B、Cの解析テーブル１２０A、１２０B、１２０Cを通じて、共通性のあるサンプルデータには、キーワードとして、同じ文字列もしくは番号が割り当てられている。例えば、図１９の例では、事象AのサンプルAと、事象BのサンプルFと、事象CのサンプルJとは、互いに共通性のあるサンプルである。 In order to perform processing for associating sample data common to different events A, B, and C in this way, as shown in FIG. 19, analysis tables 120A, 120B, 120C for those events A, B, and C are used. The same character string or number is assigned as a keyword to common sample data. For example, in the example of FIG. 19, the sample A of the event A, the sample F of the event B, and the sample J of the event C are common samples.

図１９に矢印で例示するように、一つの事象Aの解析テーブル１２０Aにて、ユーザにより指定されたサンプル名を検索し、検索されたサンプル名に割り当てられたキーワードを読み、そして、別の事象B、Cの解析テーブル１２０B、１２０Cから、そのキーワードが割り当てられたサンプル名を調べることで、異なる事象A、B、C間で共通性のあるサンプルが識別できる。 As illustrated by an arrow in FIG. 19, the sample name specified by the user is searched in the analysis table 120A of one event A, the keyword assigned to the searched sample name is read, and another event is detected. By examining the sample names to which the keywords are assigned from the analysis tables 120B and 120C for B and C, it is possible to identify samples that are common between different events A, B, and C.

２．１．３ステップ２４４A、B、C〜２４８A、B、C：事象毎のマーカ変数の検出と表示
上述１．１．２項から１．１．４項で説明したと同様の方法で、事象A、B、Cのそれぞれについて、指定サンプルサブ群（事象A）及びそれに関連付けられたサンプルサブ群（事象B、C）に共通する特徴変数の抽出、寄与率によるフィルタリング、及びマーカ変数の表示の処理が行われる。その結果、図１８に例示するように、それらの事象A、B、Cにおいて検出されたマーカ変数２５６、２５８、２５９がハイライト表示される。 2.1.3 Steps 244A, B, C to 248A, B, C: Detection and display of marker variables for each event In the same manner as described in the above section 1.1.2 to 1.1.4, For each of events A, B, and C, extraction of feature variables common to the specified sample subgroup (event A) and its associated sample subgroup (events B and C), filtering by contribution rate, and display of marker variables Is performed. As a result, as illustrated in FIG. 18, marker variables 256, 258, and 259 detected in those events A, B, and C are highlighted.

ユーザは、異なる事象A、B、C間で共通性あるサンプルに特異的なマーカ変数を視覚的に容易に確認でき、事象A、B、Cのマーカ変数間に何らかの意味ある関連性（例えば因果関係）が存在する可能性を容易に知ることができる。これにより、例えばメタボローム、プロテオーム及びゲノムなどの異レベルの研究分野間、あるいは、例えばNMR装置、MS装置、ＤＮＡアレイなどの異種類の一次分析装置で得られたデータ間の相関関係を解き明かして、新たな知見を得ようとする研究が、より容易に行える。 Users can easily visually identify marker variables specific to samples that are common between different events A, B, and C, and have some meaningful association between the marker variables of events A, B, and C (eg causal It is easy to know the possibility that a relationship) exists. In this way, for example, the correlation between different levels of research fields such as metabolome, proteome and genome, or between data obtained by different types of primary analyzers such as NMR apparatus, MS apparatus, DNA array, etc. Research to obtain new knowledge can be performed more easily.

２．２「指定マーカ変数における異事象での挙動の抽出」サブ機能
このサブ機能も、上記２．１のサブ機能と同様、ある現象を理解するために、利用可能な複数の分析・評価法を活用して異なる事象の観点からデータを収集し分析して、その現象の全体像を探索することを容易にする。このサブ機能は、特に、特定の事象における特定のマーカ変数が指定されると、そのマーカ変数により特徴付けられるサンプルのサブ群をその特定事象と他の事象のサンプル群からピックアップし、そして、それら異事象のサンプルに特異的なマーカ変数を検出し表示する。それにより、ユーザは、異なる手法で得られた情報を総合的・相補的に組み合わせることが容易になる。 2.2 “Extracting behavior of different events in specified marker variables” sub-function This sub-function, like the sub-function in 2.1 above, is also available with multiple analysis / evaluation methods that can be used to understand a phenomenon. Makes it easy to collect and analyze data from the perspective of different events and explore the whole picture of the phenomenon. This sub-function, especially when a particular marker variable in a particular event is specified, picks up a sub-group of samples characterized by that marker variable from that particular event and other event sample groups, and Detect and display marker variables specific to foreign event samples. As a result, the user can easily and comprehensively combine information obtained by different methods.

図２０は、このサブ機能の処理の流れを示し、図２１は、このサブ機能の処理をGUIを用いて説明している。以下、図２０に示す各ステップを、図２１を一緒に参照しつつ説明する。 FIG. 20 shows the flow of processing of this sub-function, and FIG. 21 explains the processing of this sub-function using the GUI. Hereinafter, each step shown in FIG. 20 will be described with reference to FIG. 21 together.

２．２．１ステップ２６０：マーカ変数の指定
多変量解析装置１１０は、上記２．１のサブ機能の場合と同様、複数の異なる事象の対象データ群を、それぞれの事象の解析テーブル１２０から読み込み、事象毎に多変量解析、例えばPCA解析、を行ない、そして、それら複数の事象の解析結果を一つのGUI上に表示する。例えば、図２１に示したGUI上には、３種類の事象A〜Cについて、PCA解析結果として、サンプル・プロット・ウィンドウ１５２A〜１５２C、スペクトル・チャート・ウィンドウ１６０A〜１６０C及び寄与率チャート・ウィンドウ１５６A〜１５６Cなどが、３事象を通じて同じ配置で、表示されている。 2.2.1 Step 260: Designation of Marker Variable The multivariate analysis apparatus 110 reads the target data groups of a plurality of different events from the analysis table 120 of each event, as in the case of the subfunction of 2.1 above. Multivariate analysis is performed for each event, for example, PCA analysis, and the analysis results of these multiple events are displayed on one GUI. For example, on the GUI shown in FIG. 21, sample plot windows 152A to 152C, spectrum chart windows 160A to 160C, and contribution rate chart window 156A are obtained as PCA analysis results for three types of events A to C. ˜156C etc. are displayed in the same arrangement throughout the three events.

ユーザは、任意の１事象、例えば事象A、について変数が表示されているウィンドウ、例えば寄与率チャート・ウィンドウ１５６A上で、興味ある任意の変数２８０をマーカ変数として指定する（変数プロット・ウィンドウ１５４またはローディング・チャート・ウィンドウ１５８を表示して、いずれかのウィンドウでマーカ変数を指定してもよい。）。 The user designates any variable 280 of interest as a marker variable on a window in which variables are displayed for any one event, such as event A, for example, the contribution chart window 156A (variable plot window 154 or (The loading chart window 158 may be displayed, and the marker variable may be specified in any window.)

２．２．２ステップ２６２〜２６６：特徴サンプルサブ群の抽出
上述１．２．２項〜１．２．４項で説明した方法と同様な方法で、上記１つの事象Aに関して、指定マーカ変数の寄与率によるフィルタリング、フィルタリングで絞り込まれたマーカ変数に共通する特徴サンプルサブ群の抽出、及び抽出された特徴サンプルサブ群の表示が行われる。図２１に例示されるように、寄与率によるフィルタリングで絞り込まれたマーカ変数２８０は、視覚的に容易に識別できるようにハイライト表示される。また、図２１に例示されるように、マーカ変数２８０の分布の最大値に近い特徴領域にそのマーカ変数２８０を有する特徴サンプルサブ群２８２と、同分布の最小値に近い特徴領域にそのマーカ変数２８０を有する特徴サンプルサブ群２８４とが抽出され、そして、その２種類の特徴サンプルサブ群２８２、２８４は視覚的に区別できるように、サンプル・プロット・ウィンドウ１５２A上で異なるカラーでハイライト表示される。さらに、その特徴サンプルサブ群２８２、２８４の実態データから得られたスペクトルが、スペクトル・チャート・ウィンドウ１６０Aに表示される。 2.2.2 Steps 262 to 266: Extraction of Feature Sample Subgroups For the one event A, the designated marker variable is extracted in the same manner as described in the above sections 1.2.2 to 1.2.4. Filtering based on the contribution ratio of, extraction of feature sample subgroups common to marker variables narrowed down by filtering, and display of the extracted feature sample subgroups are performed. As illustrated in FIG. 21, the marker variable 280 narrowed down by the filtering by the contribution rate is highlighted so that it can be easily identified visually. Further, as illustrated in FIG. 21, a feature sample sub-group 282 having the marker variable 280 in the feature region close to the maximum value of the distribution of the marker variable 280, and the marker variable in the feature region close to the minimum value of the distribution. Feature sample sub-group 284 having 280 is extracted, and the two feature sample sub-groups 282, 284 are highlighted in different colors on the sample plot window 152A so that they can be visually distinguished. The Further, the spectrum obtained from the actual data of the feature sample subgroups 282 and 284 is displayed in the spectrum chart window 160A.

２．２．３ステップ２６８：異事象データの連結
上述２．１．２項で説明した方法と同様の方法で、上記ステップ２６２〜２６４で事象Aに関して抽出された特徴サンプルサブ群２８２、２８４のデータと共通性のある他の事象B、Cのサンプルサブ群２８６、２８２；２９０、２９２のデータが検索される。そして、検索された共通性のあるサンプルサブ群２８６、２８２；２９０、２９２のデータは、それぞれの事象B、Cのサンプル・プロット・ウィンドウ１５２B、１５２C上で、同様にハイライト表示される。さらに、それら共通性のあるサンプルサブ群２８６、２８２；２９０、２９２の実態データから得られたスペクトルが、それぞれの事象B、Cのスペクトル・チャート・ウィンドウ１６０B、１６０C上に表示される。これにより、異なる事象A、B、Cに関する同じサンプルサブ群のデータが相互に関連づけられる。 2.2.3 Step 268: Concatenation of Different Event Data The feature sample subgroups 282 and 284 extracted for event A in steps 262 to 264 above in the same manner as described in section 2.1.2 above. Data for the sample subgroups 286, 282; 290, 292 of other events B, C that are in common with the data are retrieved. The data of the retrieved common sample subgroups 286, 282; 290, 292 are similarly highlighted on the respective event B, C sample plot windows 152B, 152C. Further, spectra obtained from actual data of the sample subgroups 286, 282; 290, 292 having the commonality are displayed on the spectrum chart windows 160B, 160C of the respective events B, C. This correlates data for the same sample subgroup for different events A, B, C.

２．３．４ステップ２７０B、C〜２７４B、C：他事象のマーカ変数の検出と表示
上述１．１．２項から１．１．４項で説明したと同様の方法で、上記ステップ２６８で検索された他事象B、Cのサンプルサブ群２８６、２８２；２９０、２９２のそれぞれについて、そのサンプルサブ群に共通する特徴変数の抽出、寄与率によるフィルタリング、及びマーカ変数の表示の処理が行われる。その結果、図２１に例示するように、他事象B、Cにおいて検出されたマーカ変数２９４、２９６がハイライト表示される。 2.3.4 Steps 270B, C to 274B, C: Detection and Display of Marker Variables of Other Events In the same manner as described in Sections 1.1.2 to 1.1.4 above, in Step 268 above For each of the retrieved sub-groups 286 and 282 of other events B and C; 290 and 292, extraction of feature variables common to the sample sub-groups, filtering by contribution ratio, and display of marker variables are performed. . As a result, as illustrated in FIG. 21, marker variables 294 and 296 detected in other events B and C are highlighted.

ユーザは、ハイライト表示された事象A、B、Cのマーカ変数間に何らかの意味ある関連性（例えば因果関係）が存在する可能性を容易に知ることができる。 The user can easily know the possibility that some meaningful relationship (for example, a causal relationship) exists between the marker variables of the highlighted events A, B, and C.

３「異構造のデータの解析結果に基づく特徴変数及び相関の検出」機能
この機能は、上述した異事象のデータの解析の概念をさらに発展させたもので、前述の機能で取り扱われたような多変量解析に適した構造をもつデータ（又は、そのような構造のデータから把握される現象）と、それとは別の構造をもつデータ（例えば、画像データ、又は、そのような別構造のデータから把握される現象）との間の相関を調べ、特徴抽出することにより、今までに得ることのできなかった全く新たな情報を得ることを容易にする。以下では、具体例として、MRI装置のCSI（Chemical Shift Imaging）機能により得られるCSIスペクトルデータ（多変量解析に適した構造のデータ）と、そのMRI装置から得られるMRI画像データ（別構造のデータ）とを用いた解析法を説明する。ここで、CSIスペクトルデータは、大まかにいえば、MRI画像中の多数の部位（ボクセル）の各々のケミカルなスペクトルデータ（NMRスペクトルデータの一種）である。 3 “Detection of feature variables and correlations based on analysis results of data with different structure” This function is a further development of the above-mentioned concept of analysis of data on different events, and is handled as described above. Data having a structure suitable for multivariate analysis (or a phenomenon grasped from data having such a structure) and data having a different structure (for example, image data or data having such a different structure) By examining the correlation with the phenomenon) and extracting features, it becomes easy to obtain completely new information that could not be obtained so far. Below, as a specific example, CSI spectrum data (structure data suitable for multivariate analysis) obtained by the CSI (Chemical Shift Imaging) function of the MRI apparatus and MRI image data (data of another structure) obtained from the MRI apparatus ) Will be described. Here, the CSI spectrum data is roughly chemical spectrum data (a kind of NMR spectrum data) of each of a large number of sites (voxels) in the MRI image.

この例では、多変量解析装置１１０は、MRI画像データ（図２に示された別構造の実態データ１１８に相当する）とCSIで得られたそのMRI画像中の多数部位のスペクトルデータ（図２に示された解析テーブル１２０中の対象データ群に相当する）とを入力し、入力された多数部位のスペクトルデータに多変量解析、例えばPCA解析、を適用し、そして、その多変量解析結果とMRI画像中の位置情報との相関を導き出す。 In this example, the multivariate analysis apparatus 110 includes MRI image data (corresponding to the actual data 118 of another structure shown in FIG. 2) and spectrum data of multiple parts in the MRI image obtained by CSI (FIG. 2). Corresponding to the target data group in the analysis table 120 shown in FIG. 5), multivariate analysis, for example, PCA analysis, is applied to the input spectrum data of a large number of parts, and the multivariate analysis result and The correlation with the position information in the MRI image is derived.

従来、特定のスペクトルの特徴として抽出される特定の物質・成分に専ら着目して、その多寡または増減に基づいて評価を行っている。これに対し、この機能によれば、個々のスペクトルの物質・成分だけでなく、別構造データが表すスペクトル全体のパターン（例えば、MRI画像）を認識して、その全体パターン中から特徴抽出を行うことにより、今までとは全く違う解釈を得ることが可能になる。特に、今まで困難であった異常部位の特定や異常原因の究明・診断の可能性が出てくる。 Conventionally, focusing on a specific substance / component extracted as a characteristic of a specific spectrum, evaluation is performed based on the number or increase / decrease. On the other hand, according to this function, not only the substance / component of each spectrum but also the pattern of the whole spectrum (for example, MRI image) represented by the separate structure data is recognized, and the feature is extracted from the whole pattern. This makes it possible to obtain a completely different interpretation. In particular, there is a possibility of identifying an abnormal site and investigating and diagnosing the cause of the abnormality, which have been difficult until now.

この機能には、「異構造データ間の相関の検出＃１」サブ機能、「異構造データ間の相関の検出＃２」サブ機能、及び「異構造データ間の相関の検出＃３」サブ機能がある。以下、これらのサブ機能について説明する。 This function includes a “correlation detection between different structure data # 1” sub-function, a “correlation detection between different structure data # 2” sub-function, and a “correlation detection between different structure data # 3” sub-function. There is. Hereinafter, these sub-functions will be described.

３．１「異構造データ間の相関の検出＃１」サブ機能
図２２は、このサブ機能の処理の流れを示し、図２３は、このサブ機能の処理をGUIを用いて説明している。以下、図２２に示す各ステップを、図２３を一緒に参照しつつ説明する。 3.1 “Detection of Correlation between Different Structure Data # 1” Subfunction FIG. 22 shows the flow of processing of this subfunction, and FIG. 23 explains the processing of this subfunction using the GUI. Hereinafter, each step shown in FIG. 22 will be described with reference to FIG. 23 together.

３．１．１ステップ３００：特定サンプルの指定
多変量解析装置１１０は、MRI画像データ（図２に示された別構造の実態データ１１８に相当）を読み込み、MRI画像データに基づきMRI画像を作成してこれをGUI上に表示する。また、多変量解析装置１１０は、GUI上に表示されたMRI画像中の多数の部位つまりボクセル（以下、サンプルという）のCSIヒストグラム・データ群（図２に示される対象データ群）を読み込み、CSIヒストグラム・データ群の多変量解析、例えばPCA解析、を行ない、そして、その解析結果を同じGUI上に表示する。例えば、図２３に例示したGUI上には、別構造データ・ウィンドウ１６４にMRI画像が表示され、また、サンプル・プロット・ウィンドウ１５２、変数プロット・ウィンドウ１５４、寄与率チャート・ウィンドウ１５６及びローディング・チャート・ウィンドウ１５８に、そのMRI画像内の多数のサンプル（ボクセル）のスペクトルデータのPCA解析結果を表示されている。 3.1.1 Step 300: Designation of Specific Sample The multivariate analyzer 110 reads MRI image data (corresponding to the actual data 118 of another structure shown in FIG. 2) and creates an MRI image based on the MRI image data. And display this on the GUI. Further, the multivariate analysis apparatus 110 reads CSI histogram data groups (target data groups shown in FIG. 2) of a large number of regions, that is, voxels (hereinafter referred to as samples) in the MRI image displayed on the GUI, and performs CSI. Multivariate analysis of histogram data group, for example, PCA analysis is performed, and the analysis result is displayed on the same GUI. For example, on the GUI illustrated in FIG. 23, an MRI image is displayed in a separate structure data window 164, and a sample plot window 152, a variable plot window 154, a contribution chart window 156, and a loading chart are displayed. In the window 158, PCA analysis results of spectrum data of a large number of samples (voxels) in the MRI image are displayed.

ユーザは、このGUIの別構造データ・ウィンドウ１６４に表示されたMRI画像上で、興味ある任意のサンプル（ボクセル）３１２を指定する。 The user designates an arbitrary sample (voxel) 312 of interest on the MRI image displayed in the separate structure data window 164 of the GUI.

３．１．２ステップ３０２：異構造データの連結
多変量解析装置１１０は、上記ステップ３００で指定されたサンプル（ボクセル）のMRI画像上での位置情報（識別番号）と、解析テーブル１２０のキーワードとを用いて、指定されたサンプル（ボクセル）のMRI画像上の位置と、そのサンプル（ボクセル）の対象データの多変量解析結果とを相互に関連づけて連結する。 3.1.2 Step 302: Linking Different Structure Data The multivariate analysis apparatus 110 uses the position information (identification number) on the MRI image of the sample (voxel) specified in step 300 and the keyword of the analysis table 120. And the position of the designated sample (voxel) on the MRI image and the multivariate analysis result of the target data of the sample (voxel) are associated with each other and connected.

すなわち、図２４に例示するように、CSIスペクトルの解析テーブル１２０には、予め、各サンプル（ボクセル）のキーワードとして、各サンプルのMRI画像上での位置情報（識別番号）が登録されている。多変量解析装置１１０は、ユーザが指定したMRI画像上の位置から、指定されたサンプル（ボクセル）のMRI画像上での位置情報（識別番号）を割り出し、図２４で矢印で示すように、割り出された位置情報（識別番号）に相当するキーワードをもつサンプル（ボクセル）の対象データを解析テーブル１２０から検索し、そして、指定されたサンプル（ボクセル）のMRI画像上での位置と、検索されたサンプル（ボクセル）の対象データとをGUI上で可視的に関連付ける。 That is, as illustrated in FIG. 24, in the CSI spectrum analysis table 120, position information (identification number) of each sample on the MRI image is registered in advance as a keyword of each sample (voxel). The multivariate analysis apparatus 110 calculates the position information (identification number) on the MRI image of the specified sample (voxel) from the position on the MRI image specified by the user, and assigns the position information as indicated by the arrows in FIG. The target data of the sample (voxel) having the keyword corresponding to the issued position information (identification number) is searched from the analysis table 120, and the position of the specified sample (voxel) on the MRI image is searched. The target data of the sample (voxel) is visually associated on the GUI.

上記の関連付けは、例えば、次のようになされる。すなわち、図２３に例示するように、別構造データ・ウィンドウ１６４上のMRI画像上で、指定されたサンプル（ボクセル）３１２がハイライトされて表示され、かつ、サンプル・プロット・ウィンドウ１５２上で、上記のようにして検索されたサンプル（対象データ）３１４がハイライトされて表示される。また、検索されたサンプル（ボクセル）３１４のCSIスペクトル・データ（図２に示された標準構造の実態データ１１６に相当）が読み出され、それに基づいて、検索されたサンプル（ボクセル）３１４のスペクトルの画像が生成されて、それらのスペクトル画像がスペクトル・チャート・ウィンドウ１６０に表示される。 The above association is performed as follows, for example. That is, as illustrated in FIG. 23, the designated sample (voxel) 312 is highlighted and displayed on the MRI image on the separate structure data window 164, and on the sample plot window 152, The sample (target data) 314 searched as described above is highlighted and displayed. Further, CSI spectrum data (corresponding to the actual data 116 of the standard structure shown in FIG. 2) of the retrieved sample (voxel) 314 is read out, and based on this, the spectrum of the retrieved sample (voxel) 314 is obtained. Are generated and their spectral images are displayed in the spectrum chart window 160.

３．１．３ステップ３０４〜３０８：マーカ変数の検出と表示
前述した１．１．２項から１．１．４項で説明したと同様の方法で、検索されたサンプル（ボクセル）（つまり、指定されたサンプルサブ群）３１４に共通する特徴変数が抽出され、それらについて寄与率等によるフィルタリングが行われ、それにより、最終的に指定されたサンプル（ボクセル）に特異なマーカ変数が検出され、検出されたマーカ変数が図２３に例示するようにハイライトされて表示される。 3.1.3 Steps 304-308: Detection and Display of Marker Variables Samples (voxels) retrieved in the same manner as described in the above-mentioned sections 1.1.2 to 1.1.4 (ie, Feature variables common to the designated sample sub-group) 314 are extracted, and filtering is performed on the contribution variables and the like, so that marker variables specific to the finally designated sample (voxel) are detected, The detected marker variable is highlighted and displayed as illustrated in FIG.

３．２「異構造データ間の相関の検出＃２」サブ機能
図２５は、このサブ機能の処理の流れを示し、図２６は、このサブ機能の処理をGUIを用いて説明している。以下、図２５に示す各ステップを、図２６を一緒に参照しつつ説明する。 3.2 “Detection of Correlation Between Different Structure Data # 2” Sub-Function FIG. 25 shows the flow of processing of this sub-function, and FIG. 26 explains the processing of this sub-function using the GUI. The steps shown in FIG. 25 will be described below with reference to FIG.

３．２．１ステップ３２０：特定サンプルの指定
上述した３．１項の「異構造データ間の相関の検出＃１」サブ機能の場合と同様に、MRI画像とMRI画像中の多数のサンプル（ボクセル）の対象データ群（CSIヒストグラム・データ群）の多変量解析結果がGUI上に表示されている状態において、ユーザは、このGUIのサンプル・プロット・ウィンドウ１５２上で、興味ある任意のサンプル（ボクセル）３３０を指定する。 3.2.1 Step 320: Designation of Specific Samples As in the case of “Detect Correlation between Different Structure Data # 1” sub-function in section 3.1 described above, MRI images and a number of samples in MRI images ( In the state where the multivariate analysis result of the target data group (CSI histogram data group) of the voxel is displayed on the GUI, the user can select any sample ( Voxel) 330 is designated.

３．２．２ステップ３２２：異構造データの連結
多変量解析装置１１０は、上記ステップ３２０で指定されたサンプル（ボクセル）３３０の解析テーブル１２０上のキーワードと、MRI画像上での位置情報（識別番号）とを用いて、指定されたサンプル（ボクセル）の対象データの多変量解析結果と、それらのサンプル（ボクセル）のMRI画像上の位置とを相互に関連づけて連結する。 3.2.2 Step 322: Linking Different Structure Data The multivariate analysis apparatus 110 uses the keyword on the analysis table 120 of the sample (voxel) 330 specified in step 320 and the position information (identification) on the MRI image. And the position of the sample (voxel) on the MRI image are linked to each other and linked to each other.

すなわち、図２７に例示するように、指定されたサンプル（ボクセル）のキーワードを解析テーブル１２０から読み、そのキーワードに相当する位置情報（識別番号）をもつMRI画像上のサンプル（ボクセル）を割り出し、そして、指定されたサンプル（ボクセル）の対象データと、割り出されたMRI画像上のサンプル（ボクセル）とをGUI上で可視的に関連付ける。 That is, as illustrated in FIG. 27, a keyword of a designated sample (voxel) is read from the analysis table 120, and a sample (voxel) on the MRI image having position information (identification number) corresponding to the keyword is determined. Then, the target data of the designated sample (voxel) and the sample (voxel) on the determined MRI image are visually associated on the GUI.

上記の関連付けは、例えば、次のようになされる。すなわち、図２６に例示するように、サンプル・プロット・ウィンドウ１５２上で、指定されたサンプル（対象データ）３３０がハイライトされて表示され、かつ、別構造データ・ウィンドウ１６４上で、上記のようにして割り出されたMRI画像上のサンプル（ボクセル）３３２がハイライトされて表示される。また、指定されたサンプル（ボクセル）３３０に対応するCSIスペクトル・データ（図２に示された標準構造の実態データ１１６に相当）が読み出され、それに基づいて、それらのサンプル（ボクセル）のスペクトルの画像が生成されて、それらのスペクトル画像がスペクトル・チャート・ウィンドウ１６０に表示される。 The above association is performed as follows, for example. That is, as illustrated in FIG. 26, the specified sample (target data) 330 is highlighted on the sample plot window 152 and displayed as described above on the separate structure data window 164. The sample (voxel) 332 on the determined MRI image is highlighted and displayed. Further, CSI spectrum data (corresponding to the actual data 116 of the standard structure shown in FIG. 2) corresponding to the designated sample (voxel) 330 is read out, and based on this, the spectrum of those samples (voxel) is read. Images are generated and their spectral images are displayed in the spectral chart window 160.

３．２．３ステップ３２４〜３２８：マーカ変数の検出と表示
前述した１．１．２項から１．１．４項で説明したと同様の方法で、指定されたサンプル（対象データ）（指定されたサンプルサブ群）３３０に共通する特徴変数が抽出され、それらについて寄与率等によるフィルタリングが行われ、それにより、最終的に指定されたサンプル（対象データ）に特異なマーカ変数が検出され、検出されたマーカ変数が図２６に例示するようにハイライトされて表示される。 3.2.3 Steps 324 to 328: Detection and Display of Marker Variable Specified sample (target data) (specified) in the same manner as described in the above section 1.1.2 to 1.1.4 Feature variables common to the sample subgroups 330) are extracted, and filtering is performed on the contribution variables and the like, so that marker variables specific to the finally designated sample (target data) are detected, The detected marker variable is highlighted and displayed as illustrated in FIG.

３．３「異構造データ間の相関の検出＃３」サブ機能
図２８は、このサブ機能の処理の流れを示し、図２９は、このサブ機能の処理をGUIを用いて説明している。以下、図２８に示す各ステップを、図２９を一緒に参照しつつ説明する。 3.3 “Detection of Correlation between Different Structure Data # 3” Sub-Function FIG. 28 shows the flow of processing of this sub-function, and FIG. 29 explains the processing of this sub-function using the GUI. Hereinafter, each step shown in FIG. 28 will be described with reference to FIG. 29 together.

３．３．１ステップ３４０：マーカ変数の指定
上述した３．１項の「異構造データ間の相関の検出＃１」サブ機能の場合と同様に、MRI画像とMRI画像中の多数のサンプル（ボクセル）の対象データ群（CSIヒストグラム・データ群）の多変量解析結果がGUI上に表示されている状態において、ユーザは、このGUIの変数プロット・ウィンドウ１５４上で、興味ある任意の任意の１以上の変数３６０を、マーカ変数として指定する（寄与率プロット・ウィンドウ１５６またはローディング・チャート・ウィンドウ１５８でマーカ変数を指定してもよい。）。 3.3.1 Step 340: Specify Marker Variable Similar to the above-mentioned “Detect Correlation between Different Structure Data # 1” sub-function in section 3.1, MRI image and a number of samples in MRI image ( In the state where the multivariate analysis result of the target data group (CSI histogram data group) of the voxel is displayed on the GUI, the user can select any arbitrary one of interest on the variable plot window 154 of the GUI. The above variable 360 is designated as a marker variable (the marker variable may be designated in the contribution rate plot window 156 or the loading chart window 158).

３．３．２ステップ３４２〜３４６：特徴サンプルサブ群の抽出と表示
前述の１．２．２項から１．２．３項で説明されたのと同様の方法で、指定されたマーカ変数３６０に対する寄与率等によるフィルタリングが行われ、フィルタリングで絞り込まれたマーカ変数に共通する特徴サンプルサブ群３６４、３６６（マーカ変数の値が最小値に近い特徴サンプルサブ群３６４と、最大値に近い特徴サンプルサブ群３６６）が抽出され、そして、抽出された特徴サンプルサブ群３６４、３６６がサンプル・プロット・ウィンドウ１５２上でハイライトされて表示される。このとき、マーカ変数の値が最小値に近い特徴サンプルサブ群３６４と、最大値に近い特徴サンプルサブ群３６６とは、一目で区別できるよう、例えば異なるカラーで表示される。また、抽出された特徴サンプルサブ群３６４、３６６のスペクトルの画像が、スペクトル・チャート・ウィンドウ１６０上に表示される。 3.3.2 Steps 342-346: Extraction and Display of Feature Sample Subgroups The specified marker variable 360 is processed in the same manner as described in Sections 1.2.2 to 1.2.3 above. Are filtered by the contribution rate to the feature sample sub-groups 364 and 366 (feature sample sub-groups 364 whose marker variable values are close to the minimum value and feature samples close to the maximum value). Subgroup 366) is extracted, and the extracted feature sample subgroups 364, 366 are highlighted and displayed on the sample plot window 152. At this time, the feature sample subgroup 364 whose marker variable value is close to the minimum value and the feature sample subgroup 366 close to the maximum value are displayed in different colors, for example, so that they can be distinguished at a glance. Further, the spectrum images of the extracted feature sample subgroups 364 and 366 are displayed on the spectrum chart window 160.

２．３．３ステップ３４８：異構造データの連結
前述の３．２．２項で説明されたのと同様の方法で、上記ステップ３４６で抽出された特徴サンプルサブ群３６４、３６６に対応するMRI画像上の対応する特徴サンプルサブ群（ボクセルのサブ群）３６８、３７０が割り出され、そして、抽出された特徴サンプルサブ群３６４、３６６の対象データと、割り出されたMRI画像上の特徴サンプルサブ群（ボクセルのサブ群）３６８、３７０とがGUI上で可視的に関連付けられ連結される。 2.3.3 Step 348: Linking Different Structure Data MRI corresponding to the feature sample subgroups 364 and 366 extracted in step 346 in the same manner as described in the above section 3.2.2. Corresponding feature sample subgroups (voxel subgroups) 368, 370 on the image are determined, and the target data of the extracted feature sample subgroups 364, 366 and the feature samples on the determined MRI image Subgroups (subgroups of voxels) 368 and 370 are visually associated and linked on the GUI.

２．３．４ステップ３５０：別構造データ上の関連サンプルサブ群の表示
上記ステップ３４８での関連付けの結果として、MRI画像上の特徴サンプル（ボクセル）サブ群（ボクセルのサブ群）３６８、３７０が、別構造データ・ウィンドウ１６４上でハイライトされて表示される。このとき、マーカ変数の値が最小値に近い特徴サンプルサブ群３６８と、最大値に近い特徴サンプルサブ群３７０とは、一目で区別できるように例えば異なるカラーで表示される。 2.3.4 Step 350: Display of Related Sample Subgroups on Separate Structure Data As a result of the association in step 348 above, feature samples (voxel) subgroups (voxel subgroups) 368, 370 on the MRI image are , Highlighted on the separate structure data window 164 and displayed. At this time, the feature sample subgroup 368 whose marker variable value is close to the minimum value and the feature sample subgroup 370 close to the maximum value are displayed in different colors, for example, so that they can be distinguished at a glance.

以上のような「異構造データ間の相関の検出＃１」〜「異構造データ間の相関の検出＃３」のサブ機能を活用することで、ユーザは、各種スペクトル・データのように多変量解析の対象になり得る構造のデータと、画像データのように多変量解析の対象にならない構造のデータとの間の相関を可視的に認識することができ、その結果、これらの異構造のデータを補完的又は統合的に観察し分析して新しい知見を見つけ出すことが、より容易になる。 By utilizing the sub-functions of “detection of correlation between different structure data # 1” to “detection of correlation between different structure data # 3” as described above, the user can change multivariate data like various spectrum data. It is possible to visually recognize the correlation between structural data that can be analyzed and structural data that is not subject to multivariate analysis, such as image data. It is easier to discover and discover new knowledge through complementary and integrated observation and analysis.

以上、本発明の好適な一実施形態を説明したが、これは本発明の説明のための例示であり、本発明の範囲をその実施形態にのみ限定する趣旨ではない。本発明は、その要旨を逸脱することなしに、上述の実施形態とは異なる種々の態様で実施することができる。 The preferred embodiment of the present invention has been described above, but this is an example for explaining the present invention, and the scope of the present invention is not limited to the embodiment. The present invention can be implemented in various modes different from the above-described embodiments without departing from the gist thereof.

例えば、本発明は、実施形態で例示したような生命データだけでなく、それ以外の種々のデータの解析に適用できる。 For example, the present invention can be applied to analysis of various data other than life data as exemplified in the embodiment.

本発明の一実施形態に係る多変量解析装置の全体的な処理の流れを示すフローチャート。The flowchart which shows the flow of the whole process of the multivariate analyzer which concerns on one Embodiment of this invention. この多変量解析装置の基本的な構成を示すブロック図。The block diagram which shows the basic composition of this multivariate analyzer. 解析テーブル１２０のフォーマットを示す図。The figure which shows the format of the analysis table 120. FIG. 「１事象のデータの解析結果に基づく特徴変数及び相関の検出」を行う場合のGUIの例を示す図。The figure which shows the example of GUI in the case of performing "detection of the characteristic variable and correlation based on the analysis result of 1 event data." 「複数事象のデータの解析結果に基づく特徴変数及び相関の検出」を行う場合のGUIの例を示す図。The figure which shows the example of GUI in the case of performing "detection of the characteristic variable and correlation based on the analysis result of the data of multiple events". 「異構造のデータの解析結果に基づく特徴変数及び相関の検出」を行う場合のGUIの例を示す。An example of a GUI in the case of performing “detection of feature variables and correlation based on analysis result of data of different structure” is shown. 「特定サンプルの特徴変数の抽出によるマーカ変数の検出」サブ機能の処理の流れを示ずフローチャート。The flowchart which does not show the flow of a process of the "detection of a marker variable by extraction of the characteristic variable of a specific sample" subfunction. 同サブ機能の処理をGUIを用いて説明する図。The figure explaining the process of the same subfunction using GUI. 特徴変数を抽出する処理を説明するための変数値の分布を示す図。The figure which shows distribution of the variable value for demonstrating the process which extracts a characteristic variable. 「指定マーカ変数による特徴サンプルの検出」サブ機能処理の流れを示ずフローチャート。The flowchart which does not show the flow of the "detection of the feature sample by the designated marker variable" sub function processing. 同サブ機能の処理をGUIを用いて説明する図。The figure explaining the process of the same subfunction using GUI. 特徴サンプルを抽出する処理を説明するための変数値の分布を示す図。The figure which shows distribution of the variable value for demonstrating the process which extracts a feature sample. 「分析データと付加データとの相補的な解析＃１」サブ機能の処理の流れを示すフローチャート。The flowchart which shows the flow of a process of the "complementary analysis # 1 of analysis data and additional data" subfunction. 同サブ機能の処理をGUIを用いて説明する図。The figure explaining the process of the same subfunction using GUI. 「分析データと付加データとの相補的な解析＃２」サブ機能の処理の流れを示すフローチャート。The flowchart which shows the flow of a process of a "complementary analysis # 2 of analysis data and additional data" subfunction. 同サブ機能の処理をGUIを用いて説明する図。The figure explaining the process of the same subfunction using GUI. 「異事象の分析データ間の相関およびマーカ変数の検出」サブ機能の処理の流れを示すフローチャート。The flowchart which shows the flow of a process of the "detection of the correlation between the analysis data of different events, and a marker variable" subfunction. 同サブ機能の処理をGUIを用いて説明する図。The figure explaining the process of the same subfunction using GUI. 異事象間で共通性あるサンプルのデータを検索する処理を説明する図。The figure explaining the process which searches the data of the sample which is common between different events. 「指定マーカ変数における異事象での挙動の抽出」サブ機能の処理の流れをすフローチャート。The flowchart which shows the flow of a process of the "extraction of the behavior by the different event in a designated marker variable" subfunction. 同サブ機能の処理をGUIを用いて説明する図。The figure explaining the process of the same subfunction using GUI. 「異構造データ間の相関の検出＃１」サブ機能の処理の流れを示すフローチャート。The flowchart which shows the flow of a process of the "detection of correlation between different structure data # 1" subfunction. 同サブ機能の処理をGUIを用いて説明する図。The figure explaining the process of the same subfunction using GUI. 異構造データを関連付ける処理を説明する図。The figure explaining the process which associates different structure data. 「異構造データ間の相関の検出＃２」サブ機能の処理の流れを示すフローチャート。The flowchart which shows the flow of a process of the "detection of correlation between different structure data # 2" subfunction. 同サブ機能の処理をGUIを用いて説明する図。The figure explaining the process of the same subfunction using GUI. 異構造データを関連付ける処理を説明する図。The figure explaining the process which associates different structure data. 「異構造データ間の相関の検出＃３」サブ機能の処理の流れを示すフローチャート。The flowchart which shows the flow of a process of the "detection of correlation between different structure data # 3" subfunction. 同サブ機能の処理をGUIを用いて説明する図。The figure explaining the process of the same subfunction using GUI.

Explanation of symbols

１１０多変量解析装置
１１２計算・制御モジュール
１１４表示モジュール
１１５ UIからの要求
１１６標準構造の実態データ
１１８別構造の実態データ
１２０解析テーブル
１５０コントロール・パネル
１５２サンプル・プロット・ウィンドウ
１５４変数プロット・ウィンドウ
１５６寄与率チャート・ウィンドウ
１５８ローディング・チャート・ウィンドウ
１６０スペクトル・チャート・ウィンドウ
１６２ビュー・ナビゲータ・ウィンドウ
１６４別構造データ・ウィンドウ 110 Multivariate Analyzer 112 Calculation / Control Module 114 Display Module 115 UI Request 116 Actual Data of Standard Structure 118 Actual Data of Different Structure 120 Analysis Table 150 Control Panel 152 Sample Plot Window 154 Variable Plot Window 156 Contribution Rate chart window 158 Loading chart window 160 Spectrum chart window 162 View navigator window 164 Separately structured data window

Claims

Target data storage means for storing the target data group, the target data group of the sample group, each target data having a plurality of variable values;
A calculation / control unit having a multivariate analysis unit that performs multivariate analysis using the target data group; and a feature extraction unit that performs a feature extraction process using the target data group;
Display means for visually displaying the result of the multivariate analysis and the result of the feature extraction processing on a graphical user interface (hereinafter abbreviated as GUI);
The multivariate analysis means of the calculation / control means generates a plurality of multivariate analysis charts representing characteristics of the sample group and characteristics of the plurality of variables found by the multivariate analysis,
The display means displays the plurality of charts on the GUI,
The feature extraction unit of the calculation / control unit specifies an arbitrary sample sub group in the sample group or an arbitrary variable in the plurality of variables, and the specified sample sub group is based on the target data group. Variables characteristic of the group, sample subgroups characteristic of the designated variable, or other sample groups if the target data group of the other sample groups is also stored in the target data storage means Detecting a sample subgroup associated with said designated sample subgroup,
The display means includes the designated sample subgroup or the designated variable and the detected characteristic variable, or the detected characteristic sample subgroup, or the detected related sample sub. Groups are highlighted and displayed on the GUI,
Multivariate analyzer.

The multivariate analyzer according to claim 1,
The multivariate analysis performed by the multivariate analysis means includes principal component analysis,
In the plurality of multivariate analysis charts displayed on the GUI,
A. A sample plot chart in which each sample of the sample group is plotted at coordinates according to a score for a predetermined principal component in a coordinate space consisting of a predetermined principal component axis,
B. Variable plot chart in which the plurality of variables are plotted in coordinates according to loading for a predetermined principal component in a coordinate space consisting of a predetermined principal component axis;
C. A contribution chart illustrating contribution ratios of the plurality of variables for a predetermined principal component, and
D. a loading chart illustrating the sum of loadings for a given principal component of the plurality of variables;
At least two of the
The display means highlights and displays the designated sample subgroup or the detected related sample subgroup on the sample plot chart, and displays the detected marker variable in the variable plot. Highlighting and displaying on at least one of the chart, the contribution chart, and the loading chart;
Multivariate analyzer.

In the multivariate analyzer according to claim 1 or 2,
The feature extraction means of the calculation / control means is:
Sample designating means for designating an arbitrary sample subgroup from the sample group;
Marker variable detection means for detecting one or more characteristic variables common to the specified sample subgroup as marker variables from the plurality of variables based on the values of the variables of the target data group;
Have
The display means highlights and displays the designated sample subgroup and the detected marker variable on the GUI.
Multivariate analyzer.

In the multivariate analyzer according to claim 3,
The marker variable detection means obtains the distribution of the values of each variable in the entire target data group of the sample group, the distribution of the values of each variable in the entire target data group thus obtained, and the designated sample sub Detecting one or more characteristic variables common to the designated sample subgroup based on the value of each variable of the target data of the group;
Multivariate analyzer.

In the multivariate analyzer according to claim 3 or 4,
The multivariate analysis performed by the multivariate analysis means includes principal component analysis,
The marker variable detection means performs filtering based on the contribution ratio obtained by the principal component analysis on the detected marker variable, and narrows down to a marker variable with higher accuracy.
Multivariate analyzer.

In the multivariate analyzer according to any one of claims 1 to 5,
The feature extraction means of the calculation / control means is:
Marker variable designating means for designating an arbitrary variable among the plurality of variables as a marker variable;
Characteristic sample detection means for detecting one or more characteristic sample subgroups common to the designated marker variable based on the value of each variable of the target data group of the sample group;
The display means highlights and displays the designated marker variable and the detected characteristic sample subgroup on the GUI.
Multivariate analyzer.

The multivariate analyzer according to claim 6,
The feature sample detection means obtains a distribution of each value of the designated marker variable in the entire target data group, and obtains a distribution of each value of the designated marker variable in the obtained whole target data group. And detecting the characteristic sample sub-group based on each of the designated marker variables of the target data of each sample.
Multivariate analyzer.

In the multivariate analyzer according to claim 6 or 7,
The multivariate analysis performed by the multivariate analysis means includes principal component analysis,
The feature sample detection means performs filtering based on the contribution ratio obtained by the principal component analysis for the designated marker variable, and narrows down to a marker variable with higher accuracy.
Multivariate analyzer.

In the multivariate analyzer according to any one of claims 1 to 8,
Further comprising additional data storage means for storing one or more additional data groups of the sample group;
The feature extraction means of the calculation / control means is:
Additional data designating means for designating an arbitrary additional data group from the one or more additional data groups;
Characteristic sample detection means for detecting one or more characteristic sample subgroups common to the designated type of additional data group from the sample group based on the value of the designated type of additional data group; ,
Marker variable detection that detects one or more characteristic variables common to the detected characteristic sample subgroup as marker variables from the plurality of variables based on the values of the variables of the target data group Means,
Have
The display means highlights and displays the detected characteristic sample sub-group and the detected marker variable on the GUI.
Multivariate analyzer.

The multivariate analyzer according to claim 9, wherein
The feature sample detection means obtains a distribution of the value of the additional data in the entire additional data group of the specified type, obtains a distribution of the value of the additional data in the entire additional data group thus obtained, Detecting the characteristic sample subgroup based on the value of the additional data;
The marker variable detection means obtains a distribution of values of each variable in the entire target data group, and obtains a distribution of values of each variable in the obtained entire target data group and the detected characteristic sample sub Detecting one or more characteristic variables common to the designated sample subgroup based on the value of each variable of the target data of the group;
Multivariate analyzer.

In the multivariate analyzer according to any one of claims 1 to 10,
Further comprising additional data storage means for storing one or more additional data groups of the sample group;
The multivariate analysis means of the calculation / control means performs at least a principal component analysis as the multivariate analysis, and based on a result of the principal component analysis, in a coordinate space composed of one or more principal component axes, A sample plot chart in which each sample of the sample group is plotted at coordinates according to a score for a predetermined main component is created as one of the multivariate analysis charts,
The display means displays the sample plot chart on the GUI;
The feature extraction means of the calculation / control means is:
Additional data designating means for designating an arbitrary principal component axis from the predetermined one or more principal component axes and designating an optional additional data group from the one or more additional data groups;
Based on the result of the principal component analysis and the value of the specified type of additional data group, the specified principal component axis in the sample plot chart is the value of the specified type of additional data group. And a principal component axis replacement means for creating a replacement sample / plot / chart that is replaced by a coordinate axis corresponding to
The display means displays the replacement sample / plot chart on the GUI.
Multivariate analyzer.

In the multivariate analyzer according to any one of claims 1 to 11,
The target data storage means is
A first target data group of a first sample group, wherein each target data has a plurality of variable values;
A second target data group of the second sample group, each of the target data having a plurality of variable values, and storing the second target data group; and the first and second sample groups Between the samples that are common to each other,
The multivariate analysis means of the calculation / control means performs a first multivariate analysis using the first target data group and a second multivariate analysis using the second target data group, Creating one or more first charts showing the results of the first multivariate analysis and one or more second charts showing the results of the second multivariate analysis;
The display means displays the first chart and the second chart on the GUI,
The feature extraction means of the calculation / control means is:
Sample designating means for designating an arbitrary first sample sub-group from the first sample group;
An association for searching the second sample group for a second sample subgroup that is common to the designated first sample subgroup based on the association between the first and second sample groups Sample search means;
Have
The display means highlights and displays the designated first sample subgroup and the detected second sample subgroup on the GUI.
Multivariate analyzer.

The multivariate analyzer according to claim 12,
The feature extraction means of the calculation / control means is:
Based on the value of each variable of the first target data group, the plurality of the first target data group have one or more characteristic variables common to the designated first sample subgroup. First marker variable detecting means for detecting as a first marker variable from among the variables of
Based on the value of each variable of the second target data group, the plurality of the second target data group has one or more characteristic variables common to the searched second sample subgroup. A second marker variable detecting means for detecting as a second marker variable from the variables of
The display means highlights and displays the detected first and second marker variables on the GUI.
Multivariate analyzer.

In the multivariate analyzer according to any one of claims 1 to 13,
The target data storage means is
A first target data group of a first sample group, wherein each target data has a plurality of variable values;
A second target data group of the second sample group, each of the target data having a plurality of variable values, and storing the second target data group; and the first and second sample groups Between the samples that are common to each other,
The multivariate analysis means of the calculation / control means performs a first multivariate analysis using the first target data group and a second multivariate analysis using the second target data group, Creating one or more first charts showing the results of the first multivariate analysis and one or more second charts showing the results of the second multivariate analysis;
The display means displays the first chart and the second chart on the GUI,
The feature extraction means of the calculation / control means is:
Marker variable designating means for designating an arbitrary variable among the plurality of variables of the first target data group as a marker variable;
Based on the value of each variable of the first target data group, one or more characteristic first sample subgroups common to the designated marker variable are detected from the first sample group. A feature sample detection means;
An association for searching the second sample group for a second sample subgroup that is common to the detected first sample subgroup based on the association between the first and second sample groups Sample search means;
Have
The display means highlights and displays the designated marker variable, the detected first characteristic sample sub-group, and the searched second characteristic sample sub-group on the GUI. To
Multivariate analyzer.

The multivariate analyzer according to claim 14,
The feature extraction means of the calculation / control means is:
Based on the value of each variable of the second target data group, the plurality of the second target data group has one or more characteristic variables common to the searched second sample subgroup. Second marker variable detecting means for detecting as a related marker variable from among the variables of
The display means includes not only the designated marker variable, the detected first characteristic sample subgroup and the searched second characteristic sample subgroup, but also the detected related marker variable. Is also highlighted and displayed on the GUI,
Multivariate analyzer.

The multivariate analyzer according to claim 1,
Further comprising another structure data storage means for storing another structure data having a plurality of data portions associated with each sample of the sample group,
The calculation / control means further includes means for creating a separate structure data chart illustrating the separate structure data,
The display means displays not only the plurality of multivariate analysis charts but also the separate structure data chart on the GUI,
The feature extraction means of the calculation / control means is:
Data portion specifying means for specifying an arbitrary data portion from among the plurality of data portions of the separate structure data;
An associated sample detection means for detecting a sample subgroup associated with the designated data portion from the sample group;
Marker variable detection means for detecting one or more characteristic variables common to the detected sample subgroup as marker variables from the plurality of variables based on the values of the variables of the target data group;
Have
The display means highlights and displays the designated data portion, the detected sample subgroup, and the searched marker variable on the GUI.
Multivariate analyzer.

In the multivariate analyzer according to any one of claims 1 to 16,
Further comprising another structure data storage means for storing another structure data having a plurality of data portions associated with each sample of the sample group,
The calculation / control means further includes means for creating a separate structure data chart illustrating the separate structure data,
The display means displays not only the plurality of multivariate analysis charts but also the separate structure data chart on the GUI,
The feature extraction means of the calculation / control means is:
Sample designating means for designating an arbitrary sample subgroup from the sample group;
Related data portion detection means for detecting a data portion associated with the designated sample subgroup from the plurality of data portions of the separate structure data;
Marker variable detection means for detecting one or more characteristic variables common to the designated sample subgroup as marker variables from the plurality of variables based on the values of the variables of the target data group;
Have
The display means highlights and displays the designated sample subgroup, the detected data portion, and the searched marker variable on the GUI.
Multivariate analyzer.

In the multivariate analyzer according to any one of claims 1 to 17,
Further comprising another structure data storage means for storing another structure data having a plurality of data portions associated with each sample of the sample group,
The calculation / control means further includes means for creating a separate structure data chart illustrating the separate structure data,
The display means displays not only the plurality of multivariate analysis charts but also the separate structure data chart on the GUI,
The feature extraction means of the calculation / control means is:
Marker variable designation means for designating an arbitrary variable as a marker variable from among the plurality of variables;
Feature sample detection means for detecting one or more characteristic sample subgroups common to the designated marker variable from the sample group based on the values of the variables of the target data group;
Related data portion detection means for detecting a data portion associated with the detected characteristic sample subgroup from the plurality of data portions of the different structure data;
Have
The display means highlights and displays the designated marker variable, the detected characteristic sample subgroup, and the detected data portion on the GUI.
Multivariate analyzer.

Target data storage means for storing the target data group, the target data group of the sample group, each target data having a plurality of variable values;
A calculation / control unit having a multivariate analysis unit that performs multivariate analysis using the target data group; and a feature extraction unit that performs a feature extraction process using the target data group;
Display means for visually displaying the result of the multivariate analysis and the result of the feature extraction processing on a graphical user interface (hereinafter abbreviated as GUI);
The multivariate analysis means of the calculation / control means generates a plurality of multivariate analysis charts representing characteristics of the sample group and characteristics of the plurality of variables found by the multivariate analysis,
The display means displays the plurality of charts on the GUI,
The feature extraction unit of the calculation / control unit specifies an arbitrary sample sub group in the sample group or an arbitrary variable in the plurality of variables, and the specified sample sub group is based on the target data group. Variables characteristic of the group, sample subgroups characteristic of the designated variable, or other sample groups if the target data group of the other sample groups is also stored in the target data storage means Detecting a sample subgroup associated with said designated sample subgroup,
The display means includes the designated sample subgroup or the designated variable and the detected characteristic variable, or the detected characteristic sample subgroup, or the detected related sample sub. Groups are highlighted and displayed on the GUI,
A computer program that causes a computer to function as a multivariate analyzer.