JP5415476B2

JP5415476B2 - NMR data processing apparatus and method

Info

Publication number: JP5415476B2
Application number: JP2011095717A
Authority: JP
Inventors: 慶子平川; 和紀有福; 正子藤原
Original assignee: Jeol Resonance Inc
Current assignee: Jeol Resonance Inc
Priority date: 2005-05-02
Filing date: 2011-04-22
Publication date: 2014-02-12
Anticipated expiration: 2025-09-21
Also published as: JP2011141298A

Description

本発明は、NMR(核磁気共鳴）データを処理するための装置及び方法に関わり、特に、例えばメタボローム解析（メタボノミクスまたはメタボロミクス）のように多数のNMRスペクトルの多変量解析を行なう用途に好適なものである。 The present invention relates to an apparatus and method for processing NMR (nuclear magnetic resonance) data, and is particularly suitable for use in multivariate analysis of a large number of NMR spectra, such as metabolomic analysis (metabonomics or metabolomics). Is.

近年、ゲノムやプロテオーム科学の進展に伴って、代謝物全体（メタボローム又はメタボノーム）を網羅的に解析するメタボノミクス（メタボロミクスともいう）が注目を集めている。特に、NMR装置により測定される多数の生体サンプルのプロトンスペクトル（NMRスペクトル）を用い、多変量解析によってサンプル間のパターン分類を行うことで、例えば病態や未病に対する知見を得ようとする解析方法が、メタボノミクスにおいて採用される。 In recent years, with the progress of genome and proteome science, metabonomics (also referred to as metabolomics) that comprehensively analyzes the entire metabolite (metabolome or metabonome) has attracted attention. In particular, analysis methods for obtaining knowledge about disease states and non-diseases, for example, by performing pattern classification between samples by multivariate analysis using proton spectra (NMR spectra) of a large number of biological samples measured by an NMR apparatus Are employed in metabonomics.

すなわち、生体サンプルのプロトンスペクトルは、多種の低分子代謝物や生体高分子化合物を含むので多くのピークが重なり複雑なパターンを示す。また、微弱なピークも多くて、それをノイズと区別することが難しい。そのため、プロトンスペクトルに含まれる個々のピークを同定し構造解析することが困難である。そこで、メタボノミクスにおいては、多数の生体サンプルのプロトンスペクトルを測定し、測定された多数のプロトンスペクトルのデータを用いて多変量解析のような統計的解析を行ない、生体サンプル間の構成成分種や比率の相違をパターン（特徴量）として抽出するという解析方法が採用される。この方法により、代謝物の質的相違を捉えたり、経時変化を検出したりすることが可能になる。 That is, the proton spectrum of a biological sample contains a variety of low-molecular metabolites and biopolymer compounds, so that many peaks overlap and show a complicated pattern. Moreover, there are many weak peaks, and it is difficult to distinguish them from noise. Therefore, it is difficult to identify and analyze the individual peaks included in the proton spectrum. Therefore, in metabonomics, the proton spectra of a large number of biological samples are measured, and statistical analysis such as multivariate analysis is performed using a large number of measured proton spectrum data. An analysis method of extracting a difference in ratio as a pattern (feature amount) is adopted. This method makes it possible to detect qualitative differences in metabolites and to detect changes over time.

ところで、メタボノミクスを使用した薬物発見、疾患の処置及び診断のための方法が特許文献１に開示されている。ただし、特許文献１には、NMRスペクトルに関する開示はない。また、特許文献２及び３には、メタボノミクスのためのNMRスペクトルの処理に関する技術が開示されている。また、化学データの多変量解析の種々の手法が、非特許文献１に開示されている。 Incidentally, Patent Document 1 discloses a method for drug discovery, treatment of disease and diagnosis using metabonomics. However, Patent Document 1 does not disclose an NMR spectrum. Patent Documents 2 and 3 disclose techniques relating to processing of NMR spectra for metabonomics. Non-patent literature 1 discloses various methods for multivariate analysis of chemical data.

特表２００３−５３０１３０号公報Special table 2003-530130 gazette 特表２００４−５２６１３０号公報Special table 2004-526130 gazette 特表２００４−５３８５５９号公報JP-T-2004-538559 「コンピュータ・ケミストリーシリーズ３ケモメトリックス化学パターン認識と多変量解析」宮下芳・佐々木慎一著、共立出版“Computer Chemistry Series 3 Chemometrics Chemical Pattern Recognition and Multivariate Analysis” by Miyashita Yoshi and Shinichi Sasaki, Kyoritsu Shuppan

一般的なメタボノミクスのプロセスは、NMR装置から出力される多数のサンプルのFID(Free Induction Decay：自由誘導減衰）データを得ること、多数のサンプルのFID信号をそれぞれNMRスペクトルに変換すること、及び、多数のサンプルのNMRスペクトルに関して多変量解析に代表される統計処理を行なうことが含まれる。メタボノミクスのプロセスに含まれる複雑なNMRスペクトルデータの処理は、通常、コンピュータにより実行されるが、信頼できる処理結果を得るためには、FID信号から得られるNMRスペクトルの精度を向上し、かつ、多数のNMRスペクトルを統計処理プログラムで効率よく分析できるようにするためのデータ加工技術が必要である。 A typical metabonomic process is to obtain FID (Free Induction Decay) data for a large number of samples output from the NMR instrument, to convert the FID signals of a large number of samples to NMR spectra, respectively, and , Including performing statistical processing represented by multivariate analysis on the NMR spectra of a large number of samples. Processing of complex NMR spectral data included in the metabonomics process is usually performed by a computer, but in order to obtain a reliable processing result, the accuracy of the NMR spectrum obtained from the FID signal is improved, and There is a need for data processing techniques to enable a large number of NMR spectra to be analyzed efficiently with a statistical processing program.

NMRスペクトルの精度を向上させるための従来の処理技術として、位相補正とベースライン補正がある。位相補正とベースライン補正は、いずれも、コンピュータにより自動的に行い得るが、しかし最終的には、人が手動で、コンピュータの表示スクリーンに表示されたNMRスペクトルのグラフを見ながら、補正パラメータを最も適当と思われる値に調整するための操作をコンピュータに加えるという方法で行われる。しかし、この従来の補正方法は、長い作業時間を要して非効率的であるとともに、補正結果の適正レベルが人に依存するので常に最適なNMRスペクトルを得ることが難しい。 Conventional processing techniques for improving the accuracy of NMR spectra include phase correction and baseline correction. Both phase correction and baseline correction can be performed automatically by the computer, but eventually the correction parameters can be adjusted manually by a person looking at the graph of the NMR spectrum displayed on the computer display screen. This is done by adding an operation to the computer to adjust the value to the most suitable value. However, this conventional correction method requires a long working time and is inefficient, and the appropriate level of the correction result depends on the person, so that it is difficult to always obtain an optimum NMR spectrum.

また、NMRスペクトルを統計処理プログラムで効率よく分析できるようにするために、従来、NMRスペクトルを統計処理プログラムにインプットする前に、NMRスペクトルにバケット積分を適用して、これをデータ量のより小さいヒストグラムに縮約するという処理が行われている。すなわち、NMRスペクトルのケミカルシフト軸（周波数軸）に沿った観測範囲が、一定幅(典型的には０．０４ppm）の刻みで多数のバケット（ケミカルシフトの小区間）に分割され、バケット毎にNMRスペクトルの強度の積分（バケット積分）が行われ、その結果として、多数のバケットの積分値のセットからなるヒストグラムが、NMRスペクトルの縮約として得られる。各サンプルのNMRスペクトルがヒストグラムに変換されるので、全体でサンプル数×バケット数に相当する数の積分値が得られ、それらの積分値が統計処理プログラムにインプットされる。 In addition, in order to make it possible to analyze NMR spectra efficiently with a statistical processing program, conventionally, before inputting an NMR spectrum into a statistical processing program, bucket integration is applied to the NMR spectrum, which is smaller in data volume. A process of reducing to a histogram is performed. In other words, the observation range along the chemical shift axis (frequency axis) of the NMR spectrum is divided into a large number of buckets (chemical shift subsections) in steps of a constant width (typically 0.04 ppm). Integration of the intensity of the NMR spectrum (bucket integration) is performed, and as a result, a histogram consisting of a set of integral values of multiple buckets is obtained as a reduction of the NMR spectrum. Since the NMR spectrum of each sample is converted into a histogram, a total number of integral values corresponding to the number of samples × the number of buckets is obtained, and these integral values are input to the statistical processing program.

しかしながら、従来のバケット積分の処理方法で得られるヒストグラムは、必ずしも、NMRスペクトルの特徴を十分に良く反映したものではない。すなわち、NMRスペクトル上の重要な幾つかの強度ピークが、ヒストグラム上では矮小化又は稀釈化されてしまう場合がある。この現象は、バケット積分のプロセスにおいて、一つの強度ピークが分布するケミカルシフト領域が複数のバケットに分割され、その結果、その強度ピークが複数の小さい積分値に分散されるときに生じる。 However, the histogram obtained by the conventional bucket integration processing method does not necessarily reflect the characteristics of the NMR spectrum sufficiently well. That is, some important intensity peaks on the NMR spectrum may be dwarfed or diluted on the histogram. This phenomenon occurs when a chemical shift region in which one intensity peak is distributed is divided into a plurality of buckets in the bucket integration process, and as a result, the intensity peak is dispersed into a plurality of small integrated values.

上述した位相補正とベースライン補正に関わる問題、及びバケット積分に関わる問題に対しては、特許文献１−３はいずれも、格別有用な技術を提供していない。これらの問題は、メタボノミクスのためのNMRスペクトル処理だけに限らず、他の用途のためのNMRスペクトル処理においても同様に存在する。 None of Patent Documents 1-3 provide a particularly useful technique for the above-described problems related to phase correction and baseline correction, and problems related to bucket integration. These problems exist not only in NMR spectral processing for metabonomics, but also in NMR spectral processing for other applications.

また、統計処理プログラムにより行なわれる多変量解析の処理については、上述した特許文献には格別の提案はない。 In addition, regarding the multivariate analysis processing performed by the statistical processing program, there is no particular proposal in the above-described patent document.

ここで、非特許文献１に開示されているように、多変量解析と呼ばれる分析手法には多くの種類がある。代表的な分析手法の一つはPCA（Principal component Analysis：主成分分析）である。PCAでは、サンプルの成分である多数の変数の座標軸からなる多次元空間中で、サンプル間の相違（分散）が最も顕著に現れる少数本（例えば３本程度）の新しい座標軸が定義され、そして、新しい座標軸上での各サンプルの座標値が計算される。ここで、その新しい座標軸は「主成分軸」と呼ばれ、各主成分軸に沿った変数は「主成分」と呼ばれ、そして、各サンプルの各主成分軸上の座標値は「スコア」と呼ばれる。各主成分は、多数の変数の線形一次式で定義され、その線形一次式内の各変数項の係数は「ローディング」と呼ばれる。ある主成分についての各変数のローディングは、その主成分に対する各変量の寄与度合いつまり重みを表す。 Here, as disclosed in Non-Patent Document 1, there are many types of analysis methods called multivariate analysis. One typical analysis method is PCA (Principal component Analysis). In PCA, in a multidimensional space consisting of coordinate axes of a large number of variables that are components of a sample, a few (for example, about 3) new coordinate axes in which the difference (dispersion) between samples is most prominent are defined, and The coordinate value of each sample on the new coordinate axis is calculated. Here, the new coordinate axis is called “principal component axis”, the variable along each principal component axis is called “principal component”, and the coordinate value on each principal component axis of each sample is “score” Called. Each principal component is defined by a linear linear expression of a large number of variables, and the coefficient of each variable term in the linear linear expression is called “loading”. The loading of each variable for a certain principal component represents the degree of contribution, that is, the weight of each variable to that principal component.

このようなPCAの結果に基づいて、主成分空間上に多数のサンプルをプロットすることにより、それらサンプルの性格を視覚的に把握することが容易になる。また、主成分空間上でのサンプルの配置や相互間の距離などを利用して、多数のサンプルのクラス分けを行なうことができる。 By plotting a large number of samples on the principal component space based on such PCA results, it becomes easy to visually grasp the characteristics of these samples. In addition, it is possible to classify a large number of samples by using the arrangement of samples in the principal component space and the distance between them.

また、PCAを応用したSIMCA（Soft Independent Modeling of Class Analogy：部分空間法）も、メタボノミクスにおいて有用であると考えられる。SIMCAでは、異なるクラスのサンプルに対してクラス毎にPCAを行ない、クラス毎に主成分を決定し、そして、或る主成分がそれらのクラスを分類するためにどの程度有効であるかを評価することができる。 Also, SIMCA (Soft Independent Modeling of Class Analogy) using PCA is considered useful in metabonomics. SIMCA performs PCA on different classes of samples for each class, determines principal components for each class, and evaluates how effective a principal component is to classify those classes be able to.

このようなPCA及びSIMCAの解析結果を巧く利用することにより、サンプルから得られる知見を有効に生かして、例えばマーカ変数の検出やその他の目的をより容易且つ精度良く達成できるようになることが期待される。 By skillfully using such PCA and SIMCA analysis results, it will be possible to more easily and accurately achieve, for example, marker variable detection and other purposes by effectively utilizing the knowledge obtained from the sample. Be expected.

しかしながら、上記の期待を満足させるために、特に、メタボノミクスのような応用分野で高い有用性を発揮するために、PCA及びSIMCAを具体的にどのように実行し、どのように関連させ、そして、その解析結果をどのようなにユーザに対して表示するかというような活用技術については、まだ満足できるものが従来提供されていない。そのため、多変量解析は、統計数学の専門家以外の人々にとってはまだ敷居が高く、メタボノミクスなどの応用分野で、期待されながらも、その真価を発揮するほどには十分に活用されていないというのが現状である。 However, in order to meet the above expectations, in particular how PCA and SIMCA are specifically implemented, how they relate, and in order to be highly useful in applications such as metabonomics, and As for the utilization technique such as how to display the analysis result to the user, there has not been provided a satisfactory technique. Therefore, multivariate analysis is still a threshold for non-statistical mathematicians, and is expected to be used in application fields such as metabonomics, but is not fully utilized to demonstrate its true value. is the current situation.

従って、本発明の目的は、NMRスペクトルの解析処理の精度を高めることにある。 Therefore, an object of the present invention is to increase the accuracy of NMR spectrum analysis processing.

別の目的は、NMRスペクトルの解析処理の効率を高めることにある。 Another object is to increase the efficiency of the NMR spectrum analysis process.

別の目的は、NMRスペクトルをバケット積分によりヒストグラムに縮約するとき、NMRスペクトルのピークの特徴がヒストグラムに良く反映されるようにすることにある。 Another object is to make the characteristics of NMR spectrum peaks well reflected in the histogram when the NMR spectrum is reduced to a histogram by bucket integration.

さらに別の目的は、多数サンプルのNMRスペクトルの解析において、多変量解析を有効に活かすための処理技術を提供することにある。 Yet another object is to provide a processing technique for effectively utilizing multivariate analysis in the analysis of NMR spectra of a large number of samples.

さらにまた別の目的は、メタボノミクスなどの応用分野において、多変量解析処理を活用して、より精度の高い結果又は結論を導くことを容易にすることにある。 Yet another object is to facilitate the use of multivariate analysis processes to derive more accurate results or conclusions in application fields such as metabonomics.

本発明の一つの側面に従えば、NMRデータの処理装置は、サンプルのNMR特性を示す対象スペクトルデータを生成するスペクトル生成手段と、前記対象スペクトルデータに対し、不均等な幅をもつ多数のバケットからなるバケットセットを用いたバケット積分を実行することにより、前記対象スペクトルデータをヒストグラムデータに縮約するデータ縮約手段と、前記ヒストグラムデータを記憶又は出力する手段とを備える。 According to one aspect of the present invention, an NMR data processing apparatus includes a spectrum generating unit that generates target spectrum data indicating NMR characteristics of a sample, and a plurality of buckets having an unequal width with respect to the target spectrum data. And a means for storing or outputting the histogram data. The data reduction means reduces the target spectrum data to histogram data by executing bucket integration using a bucketset consisting of:

対象スペクトルデータのバケット積分を実行する際、適切に設定された不均等な幅をもつバケットセットを用いることにより、対象スペクトルデータがもつ重要なピークの情報を良好に維持したまま、対象スペクトルデータをバケット積分データのセットであるヒストグラムに縮約できる。そのため、そのヒストグラムを用いて行われるスペクトル解析の精度が向上する。 When performing bucket integration of the target spectrum data, by using a bucket set with an appropriately set unequal width, the target spectrum data can be stored while maintaining the important peak information of the target spectrum data well. It can be reduced to a histogram that is a set of bucket integral data. Therefore, the accuracy of spectrum analysis performed using the histogram is improved.

好適な実施形態では、対象スペクトルデータがもつ重要なピークの情報を良好に維持するために、指定された１以上のピーク領域のいずれもが複数バケットに分割されないように、不均等幅のバケットセットが設定されている。さらに、対象スペクトルデータから検出された１以上のピーク領域のいずれもが複数バケットに分割されないように、不均等幅のバケットセットが設定されている。 In a preferred embodiment, a non-uniform width bucket set is used so that none of the specified one or more peak areas is divided into multiple buckets in order to maintain the important peak information of the target spectrum data well. Is set. Furthermore, a non-uniform width bucket set is set so that none of the one or more peak areas detected from the target spectrum data is divided into a plurality of buckets.

好適な実施形態では、上記のような不均等幅のバケットセットを自ら設定するバケット設定手段が、NMRデータの処理装置に備えられる。このバケット設定手段は、複数の対象スペクトルデータを入力し、それらの複数の対象スペクトルデータが投影された投影スペクトルデータを生成し、投影スペクトルデータからピーク領域を検出し、そして、検出された１以上のピーク領域のいずれもが複数バケットに分割されないようにして各バケットを設定する。さらに、このバケット設定手段は、１以上のピーク領域を指定し、指定された１以上のピーク領域のいずれもが複数バケットに分割されないようにして各バケットを設定する。さらに、このバケット設定手段は、オペレータから入力された要求に応答して、各バケットを修正することもできる。 In a preferred embodiment, the NMR data processing apparatus is provided with bucket setting means for setting a bucket set having the non-uniform width as described above. The bucket setting means inputs a plurality of target spectrum data, generates projection spectrum data obtained by projecting the plurality of target spectrum data, detects a peak region from the projection spectrum data, and detects one or more detected ones or more Each bucket is set so that none of the peak areas are divided into a plurality of buckets. Further, the bucket setting means specifies one or more peak areas, and sets each bucket so that none of the specified one or more peak areas is divided into a plurality of buckets. Further, the bucket setting means can modify each bucket in response to a request input from the operator.

好適な実施形態では、上記対象スペクトルとして、サンプルのFIDデータから得られたNMRスペクトルデータに絶対値微分を施した結果である絶対値微分スペクトルデータが用いられる。ここで、NMRスペクトルデータの絶対値微分とは、NMRスペクトルの実部の周波数（ケミカルシフト）微分の二乗と、虚部の周波数（ケミカルシフト）微分の二乗との和の平方根である。このような絶対値微分スペクトルデータを用いてスペクトル解析を行うことで、面倒な位相補正やベースライン補正を行うことなしに、効率的に精度の良い解析結果を得ることが可能になる。 In a preferred embodiment, absolute value differential spectrum data, which is a result of performing absolute value differentiation on NMR spectrum data obtained from FID data of a sample, is used as the target spectrum. Here, the absolute value differentiation of NMR spectrum data is the square root of the sum of the square of the frequency (chemical shift) derivative of the real part of the NMR spectrum and the square of the frequency (chemical shift) derivative of the imaginary part. By performing spectrum analysis using such absolute value differential spectrum data, it is possible to efficiently obtain an accurate analysis result without performing troublesome phase correction and baseline correction.

本発明の別の側面に従うNMRデータの処理装置は、サンプルのFIDデータを取得する手段と、前記FIDデータからNMRスペクトルデータを生成する手段と、前記NMRスペクトルデータに絶対値微分を施して絶対値微分スペクトルデータを生成する手段と、前記絶対値微分スペクトルデータを記憶又は出力する手段とを備える。 An apparatus for processing NMR data according to another aspect of the present invention comprises means for acquiring FID data of a sample, means for generating NMR spectrum data from the FID data, and performing absolute value differentiation on the NMR spectrum data to obtain an absolute value. Means for generating differential spectrum data, and means for storing or outputting the absolute value differential spectrum data.

本発明のまた別の側面に従うNMRスペクトルの統計処理装置は、複数のサンプルのNMRヒストグラムデータからなる多変量データ行列を入力する多変量データ行列入力手段と、入力された前記多変量データ行列についてPCA解析計算を行なうPCA解析手段と、前記PCA解析計算の結果を受けて、複数種のPCA解析結果チャートをユーザインタフェース画面に表示するPCA解析結果表示手段と、前記ユーザインタフェース画面を用いてユーザ要求を入力するユーザ要求入力手段と、入力された前記ユーザ要求が1以上のサンプルを選択するものである場合、前記選択されたサンプルのNMRスペクトルのデータを入力して、入力された前記NMRスペクトルをケミカルシフト軸上に表したNMRスペクトルチャートを前記ユーザインタフェース画面に表示するNMRスペクトル連携手段とを備える。 According to still another aspect of the present invention, there is provided a statistical processing apparatus for NMR spectra, a multivariate data matrix input means for inputting a multivariate data matrix composed of NMR histogram data of a plurality of samples, and a PCA for the input multivariate data matrix. PCA analysis means for performing analysis calculation, PCA analysis result display means for receiving a plurality of types of PCA analysis result charts on a user interface screen in response to the result of the PCA analysis calculation, and a user request using the user interface screen When the user request input means for inputting and the input user request is to select one or more samples, the NMR spectrum data of the selected sample is input, and the input NMR spectrum is chemically converted. An NMR spectrum sequence for displaying an NMR spectrum chart on the shift axis on the user interface screen. And carrying means.

前記PCA解析結果チャートには少なくとも、複数の主成分に対する前記サンプルのスコアを表したスコアプロットと、変数たる所定の複数のケミカルシフト値の前記複数の主成分に対するローディングを表したローディングプロットと、前記所定の複数のケミカルシフト値の寄与率又はローディングをケミカルシフト軸上に表した寄与率／ローディングチャートとが含まれる。前記寄与率／ローディングチャートと前記NMRスペクトルチャートとは、それぞれの前記ケミカルシフト軸のスケールが一致するようにして、前記ケミカルシフト軸に直交する方向に並べられて表示されるようになっている。 The PCA analysis result chart includes at least a score plot that represents the score of the sample with respect to a plurality of principal components, a loading plot that represents loading with respect to the plurality of principal components of a predetermined plurality of chemical shift values as variables, and A contribution ratio / loading chart representing the contribution ratio or loading of a plurality of predetermined chemical shift values on the chemical shift axis is included. The contribution ratio / loading chart and the NMR spectrum chart are arranged and displayed in a direction perpendicular to the chemical shift axis so that the scales of the chemical shift axes coincide with each other.

好適な実施形態では、入力された前記ユーザ要求が前記寄与率／ローディングチャートを拡大するものである場合、前記PCA解析結果表示手段と前記NMRスペクトル連携手段が、前記寄与率／ローディングチャートと前記NMRスペクトルチャートとの間の前記ケミカルシフト軸のスケールの一致が保持されるようにして、前記寄与率／ローディングチャートと前記NMRスペクトルチャートとを拡大して表示する。 In a preferred embodiment, when the input user request is to enlarge the contribution ratio / loading chart, the PCA analysis result display means and the NMR spectrum cooperation means include the contribution ratio / loading chart and the NMR spectrum. The contribution ratio / loading chart and the NMR spectrum chart are enlarged and displayed so that the coincidence of the scale of the chemical shift axis with the spectrum chart is maintained.

また、好適な実施形態では、入力された前記ユーザ要求が選択された1以上のサンプルを削除するものである場合、前記PCA解析手段が、前記削除されたサンプル以外のサンプルのNMRヒストグラムデータからなる多変量データ行列についてPCA解析計算を再度実行し、前記PCA解析結果表示手段が、再度実行された前記PCA解析計算の結果を受けて、前記PCA解析結果チャートを改めて表示する。 Further, in a preferred embodiment, when the input user request is to delete one or more selected samples, the PCA analysis means comprises NMR histogram data of samples other than the deleted samples. The PCA analysis calculation is executed again for the multivariate data matrix, and the PCA analysis result display means receives the result of the PCA analysis calculation executed again and displays the PCA analysis result chart again.

また、好適な実施形態では、入力された前記ユーザ要求が選択された1以上の変数たるケミカルシフト値を削除するものである場合、前記PCA解析手段が、前記削除されたケミカルシフト値以外のケミカルシフト値を用いてPCA解析計算を再度実行し、前記PCA解析結果表示手段が、再度実行された前記PCA解析計算の結果を受けて、前記PCA解析結果チャートを改めて表示する。 Further, in a preferred embodiment, when the input user request is to delete the selected chemical shift value as one or more variables, the PCA analysis means includes a chemical other than the deleted chemical shift value. The PCA analysis calculation is executed again using the shift value, and the PCA analysis result display means receives the result of the PCA analysis calculation executed again and displays the PCA analysis result chart again.

本発明のさらにまた別の側面に従うNMRスペクトルの統計処理装置は、数のクラスに割り当てられた複数のサンプルのNMRヒストグラムデータからなる複数クラスの多変量データ行列を入力する多変量データ行列入力手段と、入力された前記複数クラスの多変量データ行列についてSIMCA解析計算を行なうSIMCA解析手段と、前記SIMCA解析計算の結果を受けて、複数種のSIMCA解析結果チャートをユーザインタフェース画面に表示するSIMCA解析結果表示手段と、前記ユーザインタフェース画面を用いてユーザ要求を入力するユーザ要求入力手段と、入力された前記ユーザ要求が1以上のサンプルを選択するものである場合、前記選択されたサンプルのNMRスペクトルのデータを入力して、入力された前記NMRスペクトルをケミカルシフト軸上に表したNMRスペクトルチャートを前記ユーザインタフェース画面に表示するNMRスペクトル連携手段とを備える。 An NMR spectrum statistical processing apparatus according to still another aspect of the present invention comprises a multivariate data matrix input means for inputting a multivariate data matrix of a plurality of classes composed of NMR histogram data of a plurality of samples assigned to a number of classes. SIMCA analysis means for performing SIMCA analysis calculation for the input multivariate data matrix of the plurality of classes, and SIMCA analysis results for receiving a result of the SIMCA analysis calculation and displaying a plurality of types of SIMCA analysis result charts on a user interface screen Display means, user request input means for inputting a user request using the user interface screen, and when the input user request is to select one or more samples, the NMR spectrum of the selected sample Enter the data, and the NMR spectrum showing the entered NMR spectrum on the chemical shift axis. And a NMR spectrum linkage means for displaying the chart on the user interface screen.

前記SIMCA解析結果チャートには少なくとも、前記複数のクラスに対する前記サンプルの距離を表したクーマンプロットと、変数たる所定の複数のケミカルシフト値のモデリング力又は識別力をケミカルシフト軸上に表したモデリング力／識別力チャートとが含まれる。前記モデリング力／識別力チャートと前記NMRスペクトルチャートとは、それぞれの前記ケミカルシフト軸のスケールが一致するようにして、前記ケミカルシフト軸に直交する方向に並べられて表示されるようになっている。 The SIMCA analysis result chart includes at least a Cooman plot representing the distance of the sample with respect to the plurality of classes, and a modeling ability representing the modeling power or discriminating power of a predetermined plurality of chemical shift values as variables on the chemical shift axis. / Discrimination power chart is included. The modeling force / discriminating power chart and the NMR spectrum chart are arranged and displayed in a direction orthogonal to the chemical shift axis so that the scales of the chemical shift axes coincide with each other. .

好適な実施形態では、入力された前記ユーザ要求が前記モデリング力／識別力チャートを拡大するものである場合、前記SIMCA解析結果表示手段と前記NMRスペクトル連携手段が、前記モデリング力／識別力チャートと前記NMRスペクトルチャートとの間の前記ケミカルシフト軸のスケールの一致が保持されるようにして、前記モデリング力／識別力チャートと前記NMRスペクトルチャートとを拡大して表示する。 In a preferred embodiment, when the input user request is for enlarging the modeling power / discrimination power chart, the SIMCA analysis result display means and the NMR spectrum cooperation means include the modeling power / discrimination power chart and The modeling power / discriminating power chart and the NMR spectrum chart are enlarged and displayed so that the coincidence of the scale of the chemical shift axis with the NMR spectrum chart is maintained.

また、好適な実施形態では、入力された前記ユーザ要求が選択された1以上のサンプルを削除するものである場合、前記SIMCA解析手段が、前記削除されたサンプル以外のサンプルのNMRヒストグラムデータからなる複数クラスの多変量データ行列についてSIMCA解析計算を再度実行し、前記SIMCA解析結果表示手段が、再度実行された前記SIMCA解析計算の結果を受けて、前記SIMCA解析結果チャートを改めて表示する。 Also, in a preferred embodiment, when the input user request is to delete one or more selected samples, the SIMCA analysis means comprises NMR histogram data of samples other than the deleted samples. The SIMCA analysis calculation is executed again for a plurality of classes of multivariate data matrices, and the SIMCA analysis result display means receives the result of the SIMCA analysis calculation executed again and displays the SIMCA analysis result chart again.

また、好適な実施形態では、入力された前記ユーザ要求が、所定の閾値より前記識別力又はモデリング力が高い変数たるケミカルシフト値を選択するとともに再計算を要求するものである場合、前記SIMCA解析手段が、前記選択されたケミカルシフト値だけを用いてSIMCA解析計算を再度実行し、前記SIMCA解析結果表示手段が、再度実行された前記SIMCA解析計算の結果を受けて、前記SIMCA解析結果チャートを改めて表示する。 In a preferred embodiment, the SIMCA analysis is performed when the input user request is to select a chemical shift value that is a variable having the discriminating power or modeling power higher than a predetermined threshold and to request recalculation. The means re-executes the SIMCA analysis calculation using only the selected chemical shift value, and the SIMCA analysis result display means receives the result of the re-executed SIMCA analysis calculation, and displays the SIMCA analysis result chart. Display again.

本発明のまた更に別の側面に従うNMRスペクトルの統計処理装置は、複数のサンプルのNMRヒストグラムデータからなる多変量データ行列についてPCA解析計算を行なうPCA解析手段と、前記PCA解析計算の結果を受けて、複数種のPCA解析結果チャートをユーザインタフェース画面に表示するPCA解析結果表示手段と、複数のクラスに割り当てられた複数のサンプルのNMRヒストグラムデータからなる複数クラスの多変量データ行列を入力する多変量データ行列入力手段と、入力された前記複数クラスの多変量データ行列についてSIMCA解析計算を行なうSIMCA解析手段と、前記SIMCA解析計算の結果を受けて、複数種のSIMCA解析結果チャートをユーザインタフェース画面に表示するSIMCA解析結果表示手段と、前記ユーザインタフェース画面を用いてユーザ要求を入力するユーザ要求入力手段とを備える。 According to still another aspect of the present invention, there is provided a statistical processing apparatus for NMR spectra, PCA analysis means for performing PCA analysis calculation on a multivariate data matrix composed of NMR histogram data of a plurality of samples, and receiving the result of the PCA analysis calculation. , PCA analysis result display means to display multiple types of PCA analysis result charts on the user interface screen, and multivariate to input multiclass multivariate data matrix consisting of NMR histogram data of multiple samples assigned to multiple classes Data matrix input means, SIMCA analysis means for performing a SIMCA analysis calculation on the input multivariate data matrix of the plurality of classes, and receiving a result of the SIMCA analysis calculation, a plurality of types of SIMCA analysis result charts on a user interface screen The user request is input using the SIMCA analysis result display means to display and the user interface screen. User request input means.

前記PCA解析結果チャートには少なくとも、複数の主成分に対する前記サンプルのスコアを表したスコアプロットと、変数たる所定の複数のケミカルシフト値の前記複数の主成分に対するローディングを表したローディングプロットと、前記所定の複数のケミカルシフト値の寄与率又はローディングをケミカルシフト軸上に表した寄与率／ローディングチャートとが含まれる。前記SIMCA解析結果チャートには少なくとも、前記複数のクラスに対する前記サンプルの距離を表したクーマンプロットと、変数たる所定の複数のケミカルシフト値のモデリング力又は識別力をケミカルシフト軸上に表したモデリング力／識別力チャートとが含まれる。 The PCA analysis result chart includes at least a score plot that represents the score of the sample with respect to a plurality of principal components, a loading plot that represents loading with respect to the plurality of principal components of a predetermined plurality of chemical shift values as variables, and A contribution ratio / loading chart representing the contribution ratio or loading of a plurality of predetermined chemical shift values on the chemical shift axis is included. The SIMCA analysis result chart includes at least a Cooman plot representing the distance of the sample with respect to the plurality of classes, and a modeling ability representing the modeling power or discriminating power of a predetermined plurality of chemical shift values as variables on the chemical shift axis. / Discrimination power chart is included.

前記ユーザ要求入力手段により入力された前記ユーザ要求が、前記モデリング力／識別力チャートに表示された前記識別力又はモデリング力が所定閾値より高い変数たるケミカルシフト値を選択するとともにPCA解析計算の再実行を要求するものである場合、前記PCA解析手段が、前記選択されたケミカルシフト値だけを用いてPCA解析計算を再度実行し、前記PCA解析結果表示手段が、再度実行された前記PCA解析計算の結果を受けて、前記PCA解析結果チャートを改めて表示する。 The user request input by the user request input means selects a chemical shift value that is a variable in which the discriminating power or modeling power displayed in the modeling power / discriminating power chart is higher than a predetermined threshold and re-executes the PCA analysis calculation. In the case of requesting execution, the PCA analysis means executes the PCA analysis calculation again using only the selected chemical shift value, and the PCA analysis result display means executes the PCA analysis calculation executed again. Then, the PCA analysis result chart is displayed again.

好適な実施形態では、前記ユーザ要求入力手段により入力された前記ユーザ要求が、前記モデリング力／識別力チャートに表示された前記識別力又はモデリング力が所定閾値より高い変数たるケミカルシフト値を選択するとともに変数を追加したPCA解析計算の再実行を要求するものである場合、前記PCA解析手段が、前記選択されたケミカルシフト値を現在の変数たるケミカルシフト値に追加してなる変数を用いてPCA解析計算を再度実行し、前記PCA解析結果表示手段が、再度実行された前記PCA解析計算の結果を受けて、前記PCA解析結果チャートを改めて表示する。 In a preferred embodiment, the user request input by the user request input means selects a chemical shift value as a variable in which the discriminating power or modeling power displayed in the modeling power / discriminating power chart is higher than a predetermined threshold. When the PCA analysis calculation requesting the re-execution of the PCA analysis calculation with the variable added thereto is used, the PCA analysis means uses the variable formed by adding the selected chemical shift value to the current chemical shift value. The analysis calculation is executed again, and the PCA analysis result display means receives the result of the PCA analysis calculation executed again and displays the PCA analysis result chart again.

また、好適な実施形態では、前記ユーザ要求入力手段により入力された前記ユーザ要求が、前記モデリング力／識別力チャートに表示された前記識別力又はモデリング力が所定閾値より高い変数たるケミカルシフト値を選択するとともにSIMCA解析計算の再実行を要求するものである場合、前記SIMCA解析手段が、前記選択されたケミカルシフト値だけを用いてPCA解析計算を再度実行し、前記SIMCA解析結果表示手段が、再度実行された前記SIMCA解析計算の結果を受けて、前記SIMCA解析結果チャートを改めて表示する。 In a preferred embodiment, the user request input by the user request input means is a chemical shift value that is a variable whose discrimination power or modeling power displayed in the modeling power / discrimination power chart is higher than a predetermined threshold. When selecting and requesting re-execution of the SIMCA analysis calculation, the SIMCA analysis means executes the PCA analysis calculation again using only the selected chemical shift value, and the SIMCA analysis result display means, In response to the result of the SIMCA analysis calculation executed again, the SIMCA analysis result chart is displayed again.

本発明は更に、上述した原理に基づくNMRデータの処理方法、NMRデータ処理のためのコンピュータプログラム、及びNMRスペクトルの統計処理のためのコンピュータプログラムも提供する。 The present invention further provides a method for processing NMR data based on the above-described principle, a computer program for processing NMR data, and a computer program for statistical processing of NMR spectra.

本発明のNMRデータの処理装置及び方法によれば、多数サンプルのNMRスペクトルの解析において、多変量解析を有効に活かすことができ、そのため、メタボノミクスなどの応用分野において、より精度の高い結果又は結論を導くことが容易になる。 According to the NMR data processing apparatus and method of the present invention, multivariate analysis can be effectively utilized in the analysis of NMR spectra of a large number of samples. Therefore, in an application field such as metabonomics, more accurate results or It is easier to draw conclusions.

本発明のNMRデータの処理装置及び方法によれば、NMRスペクトルに絶対値微分を適用することで、NMRスペクトルの解析処理の効率が向上する。 According to the NMR data processing apparatus and method of the present invention, the efficiency of NMR spectrum analysis processing is improved by applying absolute value differentiation to an NMR spectrum.

本発明のNMRスペクトルの統計処理装置によれば、多数サンプルのNMRスペクトルの解析において、多変量解析を有効に活かすことができる。 According to the statistical processing apparatus for NMR spectra of the present invention, multivariate analysis can be effectively utilized in the analysis of NMR spectra of a large number of samples.

本発明のNMRスペクトルの統計処理装置によれば、これをメタボノミクスなどの応用分野で利用することにより、多変量解析処理を活用して、より精度の高い結果又は結論を導くことが容易になる。 According to the statistical processing apparatus for NMR spectrum of the present invention, by using this in an application field such as metabonomics, it becomes easy to draw a more accurate result or conclusion by utilizing multivariate analysis processing. .

以下、図面を参照して本発明の実施形態を説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

図１は、本発明に従うNMRデータ処理装置の一実施形態の全体的な構成と機能を示す。 FIG. 1 shows the overall configuration and functions of an embodiment of an NMR data processing apparatus according to the present invention.

図１に示すように、本発明に従うNMRデータ処理装置としてのコンピュータシステム１００が、NMR装置１０２と関連して用いられる。コンピュータシステム１００は、プログラムされた１台コンピュータマシン(例えば、汎用のパーソナルコンピュータ、ワークステーション或いはメインフレームコンピュータマシンなど）であってもよいし、或は複数台のコンピュータマシンから構成されてもよい。なお、コンピュータシステムに代えて、１以上の専用ハードウェア回路、又は専用ハードウェア回路とコンピュータシステムとの組み合わせなどが、NMRデータ処理装置として用いられてもよい。 As shown in FIG. 1, a computer system 100 as an NMR data processing apparatus according to the present invention is used in association with an NMR apparatus 102. The computer system 100 may be a programmed single computer machine (for example, a general-purpose personal computer, a workstation, or a mainframe computer machine), or may be composed of a plurality of computer machines. Instead of a computer system, one or more dedicated hardware circuits, or a combination of a dedicated hardware circuit and a computer system may be used as the NMR data processing apparatus.

コンピュータシステム１００は、プロセッサ１０４と、プロセッサ１０４のためのコンピュータプログラム及びプロセッサ１０４により処理されるデータを格納するための記憶装置１０６とを有する。プロセッサ１０４は、記憶装置３０６に記憶されている所定のコンピュータプログラムを実行することによりNMRデータの処理を行う。NMRデータを処理する目的が異なれば、その処理の方法も当然に異なるが、ここでは、NMRデータの処理の一例として、メタボノミクスによりマーカ成分を確定することを目的として、多数のサンプルのNMRスペクトルを収集し、それらのスペクトルを統計的に解析する場合を取り上げて説明する。 The computer system 100 includes a processor 104 and a storage device 106 for storing a computer program for the processor 104 and data processed by the processor 104. The processor 104 processes NMR data by executing a predetermined computer program stored in the storage device 306. If the purpose of processing the NMR data is different, the processing method is naturally different, but here, as an example of the processing of the NMR data, the NMR spectra of a large number of samples are used for the purpose of determining the marker component by metabonomics. The case of collecting and statistically analyzing those spectra will be described.

プロセッサ１０４が行うNMRデータの処理は、大きくFIDデータ入力処理１０８、NMRスペクトル処理１１０、データ縮約処理１１２、積分結果データ保存／出力処理１１４、及び統計処理１１６に分けられる。FIDデータ入力処理１０８では、NMR装置１０２から出力される各サンプルのFID(Free Induction Decay：自由誘導減衰）データが入力され、それにより、多数のサンプルのFIDデータが収集されて記憶装置１０６に記憶される。NMRスペクトル処理１１０では、各サンプルのFIDデータがフーリエ変換によりNMRスペクトルデータに変換され、更にそのNMRスペクトルデータが本発明の原理に従った「絶対値微分法（Absolute
Differential Calculation Method)」により「絶対値微分NMRスペクトル（Absolute Differential NMR Spectrum」データに変換され、それにより、多数のサンプルの「絶対値微分NMRスペクトル」データが記憶装置１０６に記憶される。ここで、「絶対値微分法」及び「絶対値微分NMRスペクトル」という用語は、この明細書で説明の便宜上用いられる特殊な用語であり、その詳細は後に説明する。「絶対値微分NMRスペクトル」は、以下、「ADスペクトル」と略称する。 The processing of the NMR data performed by the processor 104 is roughly divided into an FID data input process 108, an NMR spectrum process 110, a data contraction process 112, an integration result data storage / output process 114, and a statistical process 116. In the FID data input process 108, FID (Free Induction Decay) data of each sample output from the NMR apparatus 102 is input, whereby FID data of a large number of samples is collected and stored in the storage device 106. Is done. In the NMR spectrum processing 110, the FID data of each sample is converted into NMR spectrum data by Fourier transform, and the NMR spectrum data is further converted into an "Absolute differential method (Absolute differential method (Absolute
“Absolute Differential NMR Spectrum” data is converted by the “Differential Calculation Method”, whereby “absolute differential NMR spectrum” data of a large number of samples is stored in the storage device 106. The terms “absolute derivative method” and “absolute derivative NMR spectrum” are special terms used for convenience in this specification, and will be described in detail later. This is abbreviated as “AD spectrum”.

次のデータ縮約処理１１２では、本発明の原理に従った方法で「最適化バケットセット」が算出され、そして、その「最適化バケットセット」を用いて多数のサンプルのADスペクトルに「最適化バケット積分」が実行され、それにより、多数のサンプルのADスペクトルの縮約データであるヒストグラム（バケット積分データのセット）が生成される。ここで、「最適化バケットセット」及び「最適化バケット積分」という用語も、この明細書で説明の便宜上用いられる特殊な用語であり、その詳細は後に説明する。積分結果データ保存／出力処理１１４では、上記のデータ縮約処理１１２により生成された多数のサンプルのヒストグラムデータ（バケット積分データのセット）が、記憶装置１０６に記憶され、また必要に応じて外部へ出力される。統計処理１１６では、多数のサンプルのヒストグラムデータを用いて、処理目的に応じた統計処理、例えばマーカ成分を確定するための検出多変量解析、が行われる。そして、その統計処理の結果を表すデータが記憶装置１０６に記憶され、また必要に応じて外部へ出力される。 In the next data reduction process 112, an “optimized bucket set” is calculated in a manner in accordance with the principles of the present invention, and the “optimized bucket set” is used to “optimize” the AD spectrum of a large number of samples. "Bucket integration" is performed, thereby generating a histogram (a set of bucket integration data) that is the reduced data of the AD spectrum of a large number of samples. Here, the terms “optimized bucketset” and “optimized bucket integral” are also special terms used for convenience of explanation in this specification, and details thereof will be described later. In the integration result data storage / output process 114, histogram data (a set of bucket integration data) of a large number of samples generated by the data reduction process 112 is stored in the storage device 106, and externally as necessary. Is output. In the statistical processing 116, statistical processing according to the processing purpose, for example, detection multivariate analysis for determining a marker component, is performed using histogram data of a large number of samples. Data representing the result of the statistical processing is stored in the storage device 106 and is output to the outside as necessary.

ここで、FIDデータ入力処理１０８、積分結果データ保存／出力処理１１４及び統計処理１１６には、それぞれ、公知の各種の処理方法が採用可能であるため、これらについての詳細な説明は省略する。以下では、絶対値微分法を用いたNMRスペクトル処理１１０と、最適化バケット積分によるデータ縮約１１４について、詳細且つ具体的に説明する。 Here, since various known processing methods can be employed for the FID data input processing 108, the integration result data storage / output processing 114, and the statistical processing 116, detailed description thereof will be omitted. Hereinafter, the NMR spectrum processing 110 using the absolute value differentiation method and the data contraction 114 using the optimized bucket integration will be described in detail and specifically.

図２は、NMRスペクトル処理１１０の流れを示す。 FIG. 2 shows the flow of the NMR spectrum processing 110.

図２に示すように、ステップ１２０で、NMR装置１０２から取得されて記憶装置１０６に保存されている各サンプルのFIDデータが、記憶装置１０６から読み込まれる。ステップ１２２で、そのFIDデータに対してウィンドウ処理が行われ、そのFIDデータのSN比が改善され、またリップルが除去される。ステップ１２４で、ウィンドウ処理１２２を経たFIDデータに対してフーリエ変換処理が行われて、そのFIDデータがNMRスペクトルデータに変換される。ステップ１２６で、そのNMRスペクトルデータに対して絶対値微分演算が行われて、そのNMRスペクトルデータがADスペクトルデータ（絶対値微分NMRスペクトルデータ）に変換される。ステップ１２８で、そのNMRスペクトルデータとADスペクトルデータとが記憶装置１０６に保存され、また、必要に応じて外部へ出力される。多数のサンプルの各々について、上記のステップ１２０〜１２８の処理が行われ、それにより、多数のサンプルのNMRスペクトルデータとADスペクトルデータが記憶装置１０６に蓄積される。 As shown in FIG. 2, in step 120, the FID data of each sample acquired from the NMR device 102 and stored in the storage device 106 is read from the storage device 106. At step 122, window processing is performed on the FID data to improve the signal-to-noise ratio of the FID data and to remove ripples. In step 124, Fourier transform processing is performed on the FID data that has passed through the window processing 122, and the FID data is converted into NMR spectrum data. In step 126, an absolute value differential operation is performed on the NMR spectrum data, and the NMR spectrum data is converted into AD spectrum data (absolute value differential NMR spectrum data). In step 128, the NMR spectrum data and AD spectrum data are stored in the storage device 106, and output to the outside as necessary. For each of a large number of samples, the processing of steps 120 to 128 described above is performed, whereby the NMR spectrum data and AD spectrum data of the large number of samples are accumulated in the storage device 106.

ステップ１２６の絶対値微分演算について、以下、具体的に説明する。 The absolute value differentiation operation in step 126 will be specifically described below.

絶対値微分の対象であるNMRスペクトルは、δをケミカルシフト、ｊを虚数記号としたとき、次式
NMRスペクトル＝R(δ)＋ｊ・I(δ) …(1)
で表現することができる。ここで、R(δ)はNMRスペクトルの実部であり、I(δ)は虚部である。そして、NMRスペクトルを絶対値微分した結果であるADスペクトルは、次式 The NMR spectrum, which is the target of absolute value differentiation, has the following formula when δ is a chemical shift and j is an imaginary symbol:
NMR spectrum = R (δ) + j · I (δ) (1)
Can be expressed as Here, R (δ) is the real part of the NMR spectrum and I (δ) is the imaginary part. And the AD spectrum which is the result of absolute value differentiation of the NMR spectrum is

で表現することができ（又は、ケミカルシフトδに代えて周波数ωで微分してもよい）。すなわち、これは、NMRスペクトルの実部R(δ)と虚部I(δ)それぞれの微分値の２乗和の平方根である。

(Or may be differentiated by frequency ω instead of chemical shift δ). That is, this is the square root of the square sum of the differential values of the real part R (δ) and the imaginary part I (δ) of the NMR spectrum.

図３Aは、NMRスペクトルの一例の実部の波形を示し、図３BはこのNMRスペクトルの絶対値微分であるADスペクトルの波形を示す。 FIG. 3A shows the waveform of the real part of an example of an NMR spectrum, and FIG. 3B shows the waveform of an AD spectrum that is an absolute value derivative of the NMR spectrum.

図３Aに示されたNMRスペクトルの実部１４０から分かるように、このNMRスペクトルには位相ずれ（例えば、溶媒信号の部分１４２に位相ずれが顕著に表れている）が含まれ、かつ、そのベースライン成分は平坦ではなく歪んでいる。FIDデータから得られたNMRスペクトルは、殆ど例外なく、位相ずれと歪んだベースライン成分を含んでいる。従来技術によれば、NMRスペクトルに位相補正とベースライン補正が施される必要がある。 As can be seen from the real part 140 of the NMR spectrum shown in FIG. 3A, this NMR spectrum contains a phase shift (for example, the phase shift is markedly shown in the portion 142 of the solvent signal), and its base. The line component is not flat but distorted. The NMR spectra obtained from FID data almost without exception contain phase shifts and distorted baseline components. According to the prior art, the NMR spectrum needs to be phase corrected and baseline corrected.

位相補正は位相ずれを除去するための操作である。位相ずれが含まれたNMRスペクトルでは、実部R(δ)と虚部I(δ)におけるピーク波形はそれぞれ図４Aに例示するような波形になる。位相補正では、位相ずれを低減して、図４Bに例示するような波形に整形する。また、ベースライン補正は、NMRスペクトルからベースライン成分を差し引くための補正である。図５Bに例示するように歪んだベースラインBLを、図５Bに例示するようなゼロで一定のベースラインBLに修正するのである。しかし、位相補正とベースライン補正は、オペレータにより手動で調整される必要があり、時間がかかり、かつ、オペレータの主観や技量のバイアスが入ってしまう。さらに、苦労して調整しても、十分満足できる結果に到達することが難しい。 Phase correction is an operation for removing a phase shift. In the NMR spectrum including the phase shift, the peak waveforms in the real part R (δ) and the imaginary part I (δ) are waveforms as illustrated in FIG. 4A. In the phase correction, the phase shift is reduced and shaped into a waveform as illustrated in FIG. 4B. Baseline correction is correction for subtracting the baseline component from the NMR spectrum. The distorted baseline BL as illustrated in FIG. 5B is corrected to a zero and constant baseline BL as illustrated in FIG. 5B. However, the phase correction and the baseline correction need to be manually adjusted by the operator, which is time consuming and introduces a bias in the subjectivity and skill of the operator. Furthermore, even if it is difficult to adjust, it is difficult to achieve a sufficiently satisfactory result.

これに対し、図３Bに例示されるように、NMRスペクトルの絶対値微分結果であるADスペクトル１４４においては、位相ずれが解消され、さらに、ベースラインもゼロ近傍でほぼ一定である。このことは、ADスペクトル１４４を用いれば、位相補正とベースライン補正が実質的に不要になることを意味する。その理由について具体的に説明する。 On the other hand, as illustrated in FIG. 3B, in the AD spectrum 144, which is the absolute value differential result of the NMR spectrum, the phase shift is eliminated, and the baseline is substantially constant in the vicinity of zero. This means that phase correction and baseline correction are substantially unnecessary if the AD spectrum 144 is used. The reason will be specifically described.

NMRスペクトルの実部R(δ)と虚部I(δ)は、次式
R(δ)＝A(δ)・cos(θ(δ)) …(3)
I(δ)＝A(δ)・sin(θ(δ)) …(4)
で表現することができる。ここで、A(δ)は振幅であり、θ(δ)は位相である。位相θ(δ)は、大雑把にはケミカルシフトδの一次関数、
θ(δ)＝K・δ＋L …(5)
で表すことができ、ここで、KとLは一定の係数である。これらの式を用いて絶対値微分を計算すると、ADスペクトルは次式、 The real part R (δ) and imaginary part I (δ) of the NMR spectrum are
R (δ) = A (δ) · cos (θ (δ)) (3)
I (δ) ＝ A (δ) ・ sin (θ (δ))… (4)
Can be expressed as Here, A (δ) is the amplitude, and θ (δ) is the phase. The phase θ (δ) is roughly a linear function of chemical shift δ,
θ (δ) = K · δ + L (5)
Where K and L are constant coefficients. Using these equations to calculate the absolute derivative, the AD spectrum is

に変換される。

Is converted to

上記(6)式から分かるように、ADスペクトルには位相θ(δ)の成分が含まれていない。また、(6)式において支配的な部分は第1項であるが、そこでは振幅A(δ)がケミカルシフトδで微分されており、この微分によりベースライン成分（これは、図５Aの例から分かるようにケミカルシフトδに対して緩やかな傾きで歪んでいる）は実質的にゼロになる。そのため、ADスペクトルでは、ベースライン成分はかなり小さく低減される。このように、絶対値微分という簡易な演算により、位相補正およびベースライン補正をすることなく高品位なスペクトルを得ることができる。 As can be seen from the above equation (6), the AD spectrum does not contain a component of phase θ (δ). In addition, the dominant part in the equation (6) is the first term, where the amplitude A (δ) is differentiated by the chemical shift δ, and the baseline component (this is the example of FIG. 5A). As can be seen from the above, the chemical shift δ is distorted with a gentle inclination), which is substantially zero. Therefore, the baseline component is reduced considerably in the AD spectrum. As described above, a high-quality spectrum can be obtained by a simple calculation called absolute value differentiation without performing phase correction and baseline correction.

また、NMRスペクトルには、図３Aに参照番号１４２で示すような溶媒（例えば軽水）信号の巨大なピークが存在する。その巨大な溶媒ピークの裾野部分は幅広く広がっており、これも、スペクトルの精度低下の主原因の一つである。しかし、ADスペクトルにおいては、ベースラインと同様に上記理由によって溶媒ピークの裾野部分も良好に低減されるので、溶媒ピークの影響を極めて小さくすることができる。 In addition, the NMR spectrum has a huge peak of a solvent (for example, light water) signal as indicated by reference numeral 142 in FIG. 3A. The base of the huge solvent peak spreads widely, which is also one of the main causes of spectrum accuracy degradation. However, in the AD spectrum, the base part of the solvent peak is also reduced well for the above reason as in the case of the base line, so that the influence of the solvent peak can be made extremely small.

なお、ADスペクトルの一つの欠点は、NMRスペクトルの絶対的な定量性が失われること、特に、ブロードなピークの信号強度が微分により減衰してしまうことである。しかしながら、複数のNMRスペクトル相互間の相対的な関係（例えば、どのケミカルシフトδの位置にピークが現れるかという点での相対的関係）は、ADスペクトルにおいても保たれるため、メタボノミクスのように多数のNMRスペクトルを解析する目的においては、ADスペクトルは非常に有用な情報である。 One drawback of the AD spectrum is that the absolute quantification of the NMR spectrum is lost, and in particular, the signal intensity of a broad peak is attenuated by differentiation. However, the relative relationship between multiple NMR spectra (for example, the relative relationship in terms of which chemical shift δ appears at the peak) is maintained in the AD spectrum. For the purpose of analyzing a large number of NMR spectra, the AD spectrum is very useful information.

再び図２を参照して、ステップ１２６で上述したADスペクトルが生成された後、ステップ１２８でそのADスペクトルが記憶装置１０６に保存される。統計処理に必要な多数のサンプルのNMRスペクトルのそれぞれについて、図２に示す処理が繰り返され、その結果、多数のサンプルのADスペクトルが記憶装置１０６に蓄積される。その後、図１に示すように、最適化バケット積分を用いたデータ縮約処理１１２が実行される。 Referring again to FIG. 2, after the above-described AD spectrum is generated in step 126, the AD spectrum is stored in the storage device 106 in step 128. The process shown in FIG. 2 is repeated for each of the NMR spectra of a large number of samples necessary for statistical processing, and as a result, the AD spectra of the large number of samples are accumulated in the storage device 106. Thereafter, as shown in FIG. 1, a data reduction process 112 using optimized bucket integration is executed.

図６は、最適化バケット積分を用いたデータ縮約処理１１２の流れを示す。 FIG. 6 shows the flow of the data reduction process 112 using the optimized bucket integration.

図６に示すように、ステップ１７０で、記憶装置１０６に蓄積された多数のADスペクトルのそれぞれについて、そのケミカルシフト（δ）軸スケールの基準点（δ基準点）（原点）が自動的に設定される。δ基準点の設定は公知の方法で行うことができ、それにより、いずれのADスペクトルについても、同じ特定物質のピークが現れるケミカルシフト軸上のポイントが基準点として設定される。 As shown in FIG. 6, in step 170, the reference point (δ reference point) (origin) of the chemical shift (δ) axis scale is automatically set for each of a large number of AD spectra accumulated in the storage device 106. Is done. The δ reference point can be set by a known method, whereby the point on the chemical shift axis where the peak of the same specific substance appears for any AD spectrum is set as the reference point.

その後、ステップ１７１で、多数のADスペクトルを共通のスペクトル上に投影する処理が行われる。このスペクトル投影処理１７１は、それらのADスペクトルのケミカルシフト軸のスケールを合わせた上で、ケミカルシフトδのポイント毎に全てのADスペクトルの最大値又は総和を計算するという処理である。その結果、多数のADスペクトルの最大値又は総和として１つの投影ADスペクトルが生成され、その投影ADスペクトルには、全てのADスペクトルに含まれる全てのピークが現れることになる。 Thereafter, in step 171, a process of projecting a large number of AD spectra onto a common spectrum is performed. The spectrum projection process 171 is a process of calculating the maximum value or sum of all AD spectra for each point of the chemical shift δ after matching the scales of the chemical shift axes of those AD spectra. As a result, one projection AD spectrum is generated as the maximum value or the sum of many AD spectra, and all peaks included in all AD spectra appear in the projection AD spectrum.

図７A,Bは、このスペクトル投影処理１７１の原理を説明する図であり、図８は、このスペクトル投影処理１７１の流れを示す。 7A and 7B are diagrams for explaining the principle of the spectrum projection processing 171, and FIG. 8 shows the flow of the spectrum projection processing 171.

スペクトル投影処理１７１では、図８に示すように、ステップ１９０で、多数のADスペクトルの中から一つのADスペクトルSaが選択される。例えば図７Aに示すようなスペクトルSaが選択される。このADスペクトルSaが、投影ADスペクトルSpに初期設定される。ステップ１９２で、別の一つのADスペクトルSbが選択される。例えば、図７Bに示すようなADスペクトルSbが選択される。 In the spectrum projection process 171, as shown in FIG. 8, in step 190, one AD spectrum Sa is selected from a large number of AD spectra. For example, the spectrum Sa as shown in FIG. 7A is selected. This AD spectrum Sa is initially set to the projected AD spectrum Sp. In step 192, another AD spectrum Sb is selected. For example, an AD spectrum Sb as shown in FIG. 7B is selected.

ステップ１９４で、両スペクトルSbとSpのケミカルシフト軸スケールを一致させた上で（例えば、両スペクトルの基準点rbとraの差分(rb-ra)だけ、スペクトルSbのδスケール（ケミカルシフト軸スケール）をシフトさせて、基準点rbとraを一致させた上で）、両スペクトルSbとSpの強度の最大値又は和がδポイント毎に計算される。具体的には、
最大値： if(Sp > Sb(rb-ra)) Sp=Sa else Sp =Sb(rb-ra)
和： Sp = Sp+Sb (rb-ra)
という計算がδポイント毎に行われる。それにより、図７Cに示すように、投影ADスペクトルSpには、処理された複数のスペクトルSaとSbに含まれている全てのピークが投影されることになる。なお、最大値と和のうち、いずれか一方のみが計算されてもよいし、或いは、最大値と和の両方が別個の投影ADスペクトルとして計算されてもよい。 In step 194, the chemical shift axis scales of both spectra Sb and Sp are matched (for example, the δ scale (chemical shift axis scale) of the spectrum Sb is set by the difference (rb-ra) between the reference points rb and ra of both spectra. ) And shifting the reference points rb and ra), the maximum value or sum of the intensities of both spectra Sb and Sp is calculated for each δ point. In particular,
Maximum value: if (Sp> Sb (rb-ra)) Sp = Sa else Sp = Sb (rb-ra)
Sum: Sp = Sp + Sb (rb-ra)
Is calculated every δ points. Thereby, as shown in FIG. 7C, all peaks included in the processed spectra Sa and Sb are projected onto the projected AD spectrum Sp. Note that only one of the maximum value and the sum may be calculated, or both the maximum value and the sum may be calculated as separate projected AD spectra.

ステップ１９６により、処理対象である全てのADスペクトルについて、上述したステップ１９２と１９４が繰り返される。全てのADスペクトルについての繰り返し処理が終わると、その全てのADスペクトルに含まれる全てのピークが投影された投影ADスペクトルSｐが完成する。ステップ１９８で、その投影ADスペクトルSｐが記憶装置１０６に保存され、また、必要に応じて外部へ出力される。 Step 196 repeats steps 192 and 194 described above for all AD spectra to be processed. When the iterative process is completed for all AD spectra, a projected AD spectrum Sp in which all the peaks included in all the AD spectra are projected is completed. In step 198, the projection AD spectrum Sp is stored in the storage device 106, and is output to the outside as necessary.

再び図６を参照する。上述したスペクトル投影処理１７１が終わると、ステップ１７２で、投影ADスペクトルのバケット積分が計算される。 Refer to FIG. 6 again. When the above-described spectral projection processing 171 ends, in step 172, the bucket integral of the projected AD spectrum is calculated.

図９は、この投影ADスペクトルのバケット積分処理１７２の原理を説明する図であり、図１０は、このバケット積分処理１７２の流れを示す。 FIG. 9 is a diagram for explaining the principle of the bucket integration processing 172 of the projected AD spectrum, and FIG. 10 shows the flow of the bucket integration processing 172.

図９において、参照番号２００は、バケット積分が実行される１単位の積分範囲を示し、「バケット」と呼ばれる。バケット２００は、このバケット積分処理１７２においては、規定された一定の幅W（例えば０．０４ppm）をもつが、後の処理において、この幅Wが修正され最適化されることになる。参照番号２０２は、バケット積分を実行すべきケミカルシフトδの全範囲を示し、例えば０ppm〜約１６ppmの範囲である。全範囲２０２は、例えば約２００個のバケット２００に分割される。このように全範囲２０２内に設定された多数のバケット２００の集合を、以下、「バケットセット」という。また、参照番号２０４は、バケット積分を実行しない範囲を示し、「ダークリジョン」と呼ばれる。 In FIG. 9, reference numeral 200 indicates an integration range of one unit in which bucket integration is performed, and is referred to as “bucket”. The bucket 200 has a specified constant width W (for example, 0.04 ppm) in the bucket integration processing 172, but the width W is corrected and optimized in the subsequent processing. Reference number 202 indicates the full range of chemical shift δ at which bucket integration is to be performed, for example in the range of 0 ppm to about 16 ppm. The entire range 202 is divided into about 200 buckets 200, for example. A set of a large number of buckets 200 set in the entire range 202 in this way is hereinafter referred to as “bucket set”. Reference numeral 204 indicates a range in which bucket integration is not performed, and is referred to as “dark region”.

図１０に示すように、バケット積分処理１７２では、ステップ２１０で、図９に示された全範囲２０２が多数の一定幅Wのバケット２００に均等に分割され、そして、バケット２００毎に投影ADスペクトルSpの積分値（すなわち、各バケットにおける投影ADスペクトルSpとケミカルシフト軸とにより挟まれた領域の面積）が計算される。それにより、バケット２００の個数に相当する個数のバケット積分データの集合が得られる。この多数のバケット積分データの集合を、以下「積分データセット」という。この積分データセットは、投影ADスペクトルSpのバケット毎のピーク面積を示す一種のヒストグラムである。 As shown in FIG. 10, in the bucket integration process 172, in step 210, the entire range 202 shown in FIG. 9 is equally divided into a large number of constant width W buckets 200, and the projected AD spectrum for each bucket 200. The integral value of Sp (that is, the area of the region sandwiched between the projected AD spectrum Sp and the chemical shift axis in each bucket) is calculated. Thereby, a set of bucket integral data corresponding to the number of buckets 200 is obtained. A set of such a large number of bucket integration data is hereinafter referred to as an “integration data set”. This integral data set is a kind of histogram indicating the peak area for each bucket of the projected AD spectrum Sp.

その後、ステップ２１２で、ダークリジョン２０４に該当するバケットと、そのバケットセットのバケット積分データが、ステップ２１０で得られたバケットセットと積分データセットの中から削除される。ステップ２１４では、その積分データセットの総和が所定値になるように、その積分データセットがノーマライズされる。そして、ノーマライズされた積分データセット（ヒストグラムデータ）と、バケットセットに含まれる全てのバケットの位置（始点と終点）を表したバケットセット情報とが、記憶装置１０６に保存され、また、必要に応じて外部出力される。 Thereafter, in step 212, the bucket corresponding to the dark region 204 and the bucket integration data of the bucket set are deleted from the bucket set and integration data set obtained in step 210. In step 214, the integral data set is normalized so that the total sum of the integral data sets becomes a predetermined value. Then, the normalized integral data set (histogram data) and bucket set information representing the positions (start point and end point) of all buckets included in the bucket set are stored in the storage device 106, and if necessary, Is output externally.

再び図６を参照する。上述したバケット積分処理１７２が終わると、ステップ１７４で、自動積分ブロック設定に基づくバケットの自動修正処理が行われる。ここでは、公知技術である自動積分ブロック設定処理により、投影ADスペクトルSp上の種々のピークが自動的に検出され、検出された各ピークが複数のバケットにより分割されないようにバケットの位置と幅Wが修正される。以下では、「ピーク」という用語は、単一の孤立したピークだけでなく、一つのピークと看做せるような一塊の複数のピークを指す意味でも用いる。 Refer to FIG. 6 again. When the above-described bucket integration process 172 is completed, in step 174, the bucket automatic correction process based on the automatic integration block setting is performed. Here, various peaks on the projected AD spectrum Sp are automatically detected by a known automatic integration block setting process, and the position and width W of the bucket are set so that each detected peak is not divided by a plurality of buckets. Is fixed. In the following, the term “peak” is used not only to indicate a single isolated peak but also to indicate a plurality of peaks in a lump that can be regarded as a single peak.

図１１は、この自動積分ブロック設定に基づくバケットの自動修正処理１７４の原理を説明する図であり、図１２は、このバケット自動修正処理１７４の流れを示す。 FIG. 11 is a diagram for explaining the principle of the bucket automatic correction processing 174 based on the automatic integration block setting, and FIG. 12 shows the flow of the bucket automatic correction processing 174.

このバケット自動修正処理１７４では、図１２に示すように、ステップ２３０で、公知技術である自動積分ブロック設定処理が投影ADスペクトルSpに対して施される。それにより、投影ADスペクトルSp上の種々のピークが自動的に検出され、そして、検出されたピークが存在するケミカルシフトδ領域（以下、「ピーク領域」という）にそれぞれ１対1の関係で積分ブロック（積分区間）が自動的に設定される。例えば、図１１Aに示されるように、ADスペクトルSp上に２つのピーク（例えば、左側のものは３つのピークの塊であり、右側は単一のピークである）が存在する場合、その２つのピークが自動的に検出され、そして、図１１Bに例示するように、左側のピーク領域には一つの積分ブロック２２０xが設定され、また、右側のピーク領域には別の一つの積分ブロック２２０yが設定される。いずれの積分ブロック２２０x、２２０yも、対応するピーク領域をカバーしており、一つのピーク領域が複数の積分ブロックにより分割されることはない。 In this bucket automatic correction process 174, as shown in FIG. 12, in step 230, an automatic integration block setting process, which is a known technique, is performed on the projection AD spectrum Sp. As a result, various peaks on the projected AD spectrum Sp are automatically detected and integrated in a one-to-one relationship with the chemical shift δ region (hereinafter referred to as “peak region”) where the detected peak exists. A block (integration interval) is automatically set. For example, as shown in FIG. 11A, if there are two peaks on the AD spectrum Sp (eg, the left one is a cluster of three peaks and the right one is a single peak), the two A peak is automatically detected, and as shown in FIG. 11B, one integration block 220x is set in the left peak region, and another integration block 220y is set in the right peak region. Is done. Each of the integration blocks 220x and 220y covers the corresponding peak region, and one peak region is not divided by a plurality of integration blocks.

これに対し、図１１Cに例示するように、現在のバケット２００a、２００b、２００ｃ、…は、図９を参照して既に説明したように、全範囲２０２を単純に一定幅W（例えばｍ０．０４ppm）で均等分割したものであるから、一つのピーク領域を複数のバケットが分割しているおそれがある。図１１A,Cに示す例では、左側のピーク領域が２つのバケット２００bと２００cにより分割されている。記憶装置１０６に現在保存されている積分データセット（ヒストグラム）は、このようなバケットセットを用いて計算されたものであるため、分割されたピークの情報が異なる積分データに分散され稀釈化されてしまっていおり、精度が良くない。そこで、この問題を解消するために、以下に述べる後続のステップが行われて、ピークを分割しないようにバケットセットが修正される。 On the other hand, as illustrated in FIG. 11C, the current buckets 200 a, 200 b, 200 c,..., As described above with reference to FIG. ), The plurality of buckets may divide one peak area. In the example shown in FIGS. 11A and 11C, the left peak region is divided by two buckets 200b and 200c. Since the integral data set (histogram) currently stored in the storage device 106 is calculated using such a bucket set, the divided peak information is distributed and diluted to different integral data. It is trapped and the accuracy is not good. Therefore, in order to solve this problem, the following steps described below are performed to modify the bucket set so as not to divide the peak.

すなわち、図１２に示すように、ステップ２３２で、それぞれの積分ブロックの始点に最も近いバケットの終点が記憶装置１０６内のバケットセット情報からサーチされ、見つかったバケットの終点が、それぞれ対応する積分ブロックの始点と同一値に修正される。さらに、ステップ２３４で、それぞれの積分ブロックの終点に最も近いバケットの始点が記憶装置１０６内のバケットセット情報からサーチされ、見つかったバケットの始点が、それぞれ対応する積分ブロックの終点と同一値に修正される。例えば、図１１B,C,Dに示された例では、左側の積分ブロック２００xの始点s(x)に最も近いバケット２００ａの終点e(a)がその始点s(x)と同一値に修正され、また、右側の積分ブロック２００yの始点s(y)に最も近いバケット２００dの終点e(d)がその始点s(y)と同一値に修正される。また、左側の積分ブロック２００xの終点e(x)に最も近いバケット２００dの始点s(d)がその終点e(x)と同一値に修正され、右側の積分ブロック２００yの終点e(y)に最も近いバケット２００fの始点s(f)がその終点e(y)と同一値に修正される。 That is, as shown in FIG. 12, at step 232, the end point of the bucket closest to the start point of each integration block is searched from the bucket set information in the storage device 106, and the end point of the found bucket is the corresponding integration block. It is corrected to the same value as the starting point. Further, in step 234, the starting point of the bucket closest to the end point of each integration block is searched from the bucket set information in the storage device 106, and the starting point of the found bucket is corrected to the same value as the end point of the corresponding integration block. Is done. For example, in the example shown in FIGS. 11B, 11C, and 11D, the end point e (a) of the bucket 200a closest to the start point s (x) of the left integration block 200x is corrected to the same value as the start point s (x). In addition, the end point e (d) of the bucket 200d closest to the start point s (y) of the right integration block 200y is corrected to the same value as the start point s (y). Also, the starting point s (d) of the bucket 200d closest to the end point e (x) of the left integration block 200x is corrected to the same value as the end point e (x), and the end point e (y) of the right integration block 200y is changed to the end point e (y). The start point s (f) of the nearest bucket 200f is corrected to the same value as the end point e (y).

上記始点／終点の修正の後、ステップ２３６で、一つの積分ブロックを分割する複数のバケットがサーチされ、その複数のバケットが一つのバケットに統合される。例えば、図１１B,C,Dに示された例では、上記始点／終点の修正の後は、左側の積分ブロック２００xだけが２つのバケット２００bと２００ｃにより分割されるので、その２つのバケット２００bと２００ｃが一つのバケット２００bcに統合される。 After the correction of the start point / end point, in step 236, a plurality of buckets dividing one integration block are searched, and the plurality of buckets are integrated into one bucket. For example, in the example shown in FIGS. 11B, 11C, and 11D, after the start point / end point correction, only the left integration block 200x is divided by the two buckets 200b and 200c. 200c is integrated into one bucket 200bc.

そして、ステップ２３８で、上述したステップ２３２〜２３６で修正された全てのバケットについてバケット積分が再計算される。そして、記憶装置１０６に現在記憶されている積分データセットのうち、該当するバケットの積分データが、再計算された積分データに置き換えられる。例えば図１１Dに示された例の場合、バケット２００a、２００bc、２００d、２００e及び２００fの全てに修正がなされたので、それらバケット２００a、２００bc、２００d、２００e及び２００fのそれぞれについてバケット積分が再計算されて、再計算された積分データが、前の積分データに置き換えられる。 In step 238, the bucket integral is recalculated for all buckets modified in steps 232 to 236 described above. Then, the integration data of the corresponding bucket in the integration data set currently stored in the storage device 106 is replaced with the recalculated integration data. For example, in the example shown in FIG. 11D, since all of the buckets 200a, 200bc, 200d, 200e, and 200f have been modified, the bucket integral is recalculated for each of the buckets 200a, 200bc, 200d, 200e, and 200f. Thus, the recalculated integration data is replaced with the previous integration data.

ステップ２３９では、以上のようにして修正された積分データセットと修正されたバケットセット情報とが記憶装置１０６に保存され、また、必要に応じて外部へ出力される。上記修正されたバケットセット情報によれば、自動積分ブロック設定処理によって投影ADスペクトルSpから自動的に検出された全てのピーク領域のいずれもが複数バケットにより分割されないように（つまり、一つのピーク領域は必ず一つのバケットによりカバーされる）ように、バケットセットが定義される。上記修正された積分データセットは、上記修正されたバケットセットに基づいているので、検出された全てのピークの情報は稀釈されておらず、よって、以前の積分データセットより精度が良い。 In step 239, the integration data set modified as described above and the modified bucket set information are stored in the storage device 106, and output to the outside as necessary. According to the modified bucket set information, none of the peak areas automatically detected from the projected AD spectrum Sp by the automatic integration block setting process is divided into multiple buckets (that is, one peak area). A bucketset is defined so that it is always covered by one bucket. Since the modified integration data set is based on the modified bucket set, the information of all detected peaks is not diluted and is therefore more accurate than the previous integration data set.

しかしながら、公知の自動積分ブロック設定処理により自動検出されたピークに基づいて設定されたケミカルシフトδ区間（積分区間）が必ずしも適切であるとはいえない。そこで、更にバケットデータセットと成分データセットを最適化するための処理が後続して行われる。すなわち、図６に示すように、上述した自動積分ブロック設定に基づくバケット自動修正処理１７４の後、更にバケットデータセットと成分データセットを最適化するために、指定ピーク情報に基づくバケット自動修正処理１７６と、バケットの手動修正処理１７８とが実行される。 However, the chemical shift δ interval (integration interval) set based on the peak automatically detected by the known automatic integration block setting process is not necessarily appropriate. Therefore, processing for further optimizing the bucket data set and the component data set is subsequently performed. That is, as shown in FIG. 6, after the bucket automatic correction processing 174 based on the above-described automatic integration block setting, in order to further optimize the bucket data set and the component data set, the bucket automatic correction processing 176 based on the designated peak information. Then, the bucket manual correction processing 178 is executed.

まず、指定ピーク情報に基づくバケット自動修正処理１７６について説明する。 First, the bucket automatic correction process 176 based on the designated peak information will be described.

この処理１７６では、既に同定されているNMRスペクトル（ADスペクトルではない）上の特定のピークが指定され、その指定されたピークのピーク領域をカバーするように積分ブロックが決定され、その積分ブロックに基づいて、指定ピークのピーク領域が複数バケットにより分割されないように、現在記憶されているバケットセットが修正される。指定されるピークは、典型的には、このスペクトル解析の用途においてマーカになり得るようなピークである。スペクトル解析の用途が異なればマーカも異なることになるから、用途に応じて、指定されるピークが異なってくる。 In this process 176, a specific peak on an already identified NMR spectrum (not an AD spectrum) is specified, and an integration block is determined so as to cover the peak area of the specified peak. Based on this, the currently stored bucketset is modified so that the peak area of the designated peak is not divided by multiple buckets. The specified peak is typically a peak that can be a marker in this spectral analysis application. Since the markers are different for different uses of spectrum analysis, the designated peak differs depending on the use.

ピークを指定するためのソースデータとしては、学会誌などで規定するNMRデータの一般的な発表フォーマット（Bull. Chem. Soc. Japan等）に従ったピーク定義データ用いることができる。そのようなフォーマットによるピーク定義データの一例を挙げると、
1H-NMR (CDCl3) δ: 8.06 (2H, t, J = 8.1 Hz, CH2), 7.24 (1H, s, CH3),…
というようなものである。このデータ例において、最初の「1H-NMR」は測定核名を示し、次の「(CDCl3)」は溶媒名を示し、次の「δ: 8.06」は１つ目のピークの中心のケミカルシフトを示し、次の「2H」は１つ目のピークのプロトン数を示し、次の「t」は１つ目のピークの分裂パターンを示し、次の「J = 8.1 Hz」は１つ目のピークのスピン結合定数を示し、次の「CH2」は１つ目のピークに関するコメントである。１つ目のピークの定義の後に、２つ目のピークの定義、３つ目のピークの定義、…が順次続く。ここで、ピークの分裂パターンには種々のものがあるが、代表的な分裂パターンは例えば以下の５種のパターン、
「s」：単一（singlet）
「d」：２分裂（doublet）
「t」：３分裂（triplet）
「q」：４分裂（qualtet）
「m」：多分裂（multiplet）
の組合せとなる。 As source data for designating a peak, peak definition data in accordance with a general presentation format (Bull. Chem. Soc. Japan, etc.) of NMR data defined in academic journals can be used. An example of peak definition data in such a format is
1H-NMR (CDCl3) δ: 8.06 (2H, t, J = 8.1 Hz, CH2), 7.24 (1H, s, CH3),…
It is something like that. In this data example, the first “1H-NMR” indicates the name of the measurement nucleus, the next “(CDCl 3)” indicates the name of the solvent, and the next “δ: 8.06” indicates the chemical shift at the center of the first peak. The next “2H” indicates the proton number of the first peak, the next “t” indicates the splitting pattern of the first peak, and the next “J = 8.1 Hz” indicates the first peak. The spin coupling constant of the peak is shown, and the next “CH2” is a comment regarding the first peak. The definition of the first peak is followed by the definition of the second peak, the definition of the third peak,. Here, there are various peak splitting patterns, but typical splitting patterns include, for example, the following five types of patterns:
“S”: single
"D": doublet
“T”: triplet
“Q”: qualtet
“M”: multiplet
It becomes a combination.

図１３Aは３分裂ピークの例を示しており、図１３Bは４分裂ピークの例を示している。 FIG. 13A shows an example of a three splitting peak, and FIG. 13B shows an example of a four splitting peak.

図１３Aに示すように、３分裂ピークの場合、上述したようなフォーマットのピーク定義データにより、その中心のケミカルシフトδ（単位：ppm）と１つのスピン結合定数J（単位：Hz）が定義される。図示は省略するが、２分裂ピークの場合も、同様に、中心のケミカルシフトδと１つのスピン結合定数Jが定義される。単一ピークの場合は、中心のケミカルシフトδが定義されるが、スピン結合定数Jは当然に定義されない。また、図１３Bに例示するように、４分裂ピークの場合、中心のケミカルシフトδと、広狭２つのスピン結合定数Ja、Jb（単位：Hz）が定義される。図示は省略するが、多分裂（５以上の分裂）ピークの場合、中心のケミカルシフトδと、分列数に応じた数の異なるスピン結合定数が定義される。なお、中心のケミカルシフトδが決め難い複数分裂ピークの場合、「δa−δb」というようにケミカルシフトδの範囲でピークが定義される場合もあり得る。 As shown in FIG. 13A, in the case of a three split peak, the center chemical shift δ (unit: ppm) and one spin coupling constant J (unit: Hz) are defined by the peak definition data in the format as described above. The Although illustration is omitted, in the case of the bisecting peak, similarly, the central chemical shift δ and one spin coupling constant J are defined. In the case of a single peak, the central chemical shift δ is defined, but the spin coupling constant J is not naturally defined. Further, as illustrated in FIG. 13B, in the case of a quadrilateral peak, a central chemical shift δ and two broad and narrow spin coupling constants Ja and Jb (unit: Hz) are defined. Although illustration is omitted, in the case of a multi-split (split of 5 or more) peak, a central chemical shift δ and a number of different spin coupling constants depending on the number of columns are defined. Note that in the case of a multiple split peak where the central chemical shift δ is difficult to determine, the peak may be defined in the range of the chemical shift δ, such as “δa−δb”.

図１３A、Bに例示するように、各ピークのピーク領域（そのピークが位置するケミカルシフトδ区間）は、上述したピーク定義データに含まれる数値、特に中心のケミカルシフトδとスピン結合定数Jとに基づいて決定することができる。例えば図１３Aに示す３分裂ピークの場合には、そのピーク領域２４０xの始点s(x)と終点e(x)は、
s(x)＝δ−（J＋PW/2)
e(x)＝δ＋（J＋PW/2)
により、また、図１３Bに示す４分裂ピークの場合には、そのピーク領域２４０yの始点s(y)と終点e(y)は、
s(y)＝δ−（Jb＋PW/2)
e(y)＝δ＋（Jb＋PW/2)
により計算することができる。ここで、「PW」はピーク幅（単位：Hz)を指し、これはNMR装置の分解能やサンプルの状態などの幾つかの条件によって異なってくるので、その条件セットに応じた適当な値を予め設定しておけばよい。図示以外の分裂パターンの場合についても、中心ケミカルシフトδとスピン結合定数J（ここで、単一ピークの場合はJ=0とすればよい）を用いてピークが定義されていれば、上記と同様のやり方でピーク領域が決定できる。また、「δa−δb」というようにケミカルシフトδの範囲でピークが定義されている場合には、そのピーク領域の始点sと終点eは、
s＝δa−PW/2
e＝δb＋PW/2
により計算することができる。 As illustrated in FIGS. 13A and 13B, the peak region of each peak (the chemical shift δ section where the peak is located) is a numerical value included in the above-described peak definition data, in particular, the chemical shift δ at the center, the spin coupling constant J, Can be determined based on For example, in the case of the triple split peak shown in FIG. 13A, the start point s (x) and end point e (x) of the peak region 240x are:
s (x) = δ− (J + PW / 2)
e (x) = δ + (J + PW / 2)
Further, in the case of the quadrilateral peak shown in FIG. 13B, the start point s (y) and the end point e (y) of the peak region 240y are
s (y) = δ− (Jb + PW / 2)
e (y) = δ + (Jb + PW / 2)
Can be calculated. Here, “PW” refers to the peak width (unit: Hz), which differs depending on several conditions such as the resolution of the NMR apparatus and the state of the sample. Just set it up. For splitting patterns other than those shown, if the peak is defined using the central chemical shift δ and the spin coupling constant J (where J = 0 for a single peak), then A peak area can be determined in a similar manner. Further, when a peak is defined in the range of chemical shift δ such as “δa−δb”, the start point s and end point e of the peak region are
s = δa−PW / 2
e = δb + PW / 2
Can be calculated.

さて、本実施形態にかかるNMRデータ処理装置１００は、指定ピークについて上述したようなフォーマットのピーク定義データを入力し、そのピーク定義データに基づいて、指定ピークのピーク領域を決定するために必要な情報（以下、「指定ピーク情報」という）を自動的に生成する。生成された指定ピーク情報は、後に何時でも利用できるように、記憶装置１０６に登録されることができる。 Now, the NMR data processing apparatus 100 according to the present embodiment inputs peak definition data in the format as described above for the designated peak, and is necessary for determining the peak area of the designated peak based on the peak definition data. Information (hereinafter referred to as “designated peak information”) is automatically generated. The generated designated peak information can be registered in the storage device 106 so that it can be used at any time later.

図１４A,Bは、記憶装置１０６に登録される指定ピーク情報のデータ構造例を示す。 14A and 14B show examples of the data structure of designated peak information registered in the storage device 106. FIG.

図１４Aに示すように、NMRスペクトル解析の異なる用途にそれぞれ対応した異なる指定ピーク情報セット２５０A、２５０B、２５０Cが、記憶装置１０６に登録されている。例えば、特定の疾病に罹患した者の血液に共通するマーカを確定しようとする用途と、特定種類の食品に共通するマーカを確定しようとする用途とでは、マーカとなり得るピークは異なるから、用途毎に異なるピークが指定されることになる。そのため、用途毎に別の指定ピーク情報セット２５０A、２５０B、２５０Cが登録される。指定ピーク情報セット２５０A、２５０B、２５０Cの各々には、指定された１以上のピークの各々のピーク指定情報が記録されている。各ピークのピーク指定情報には、そのピークを識別するためのピークIDと、そのピークのピーク領域を計算するためのケミカルシフト、分裂パターン及びスピン結合定数などの数値データが含まれている。これらの数値データは、NMRデータ処理装置１００に入力された上述したようなフォーマットのピーク定義データの中から、NMRデータ処理装置１００により自動抽出されたものである。 As shown in FIG. 14A, different designated peak information sets 250A, 250B, and 250C respectively corresponding to different uses of NMR spectrum analysis are registered in the storage device 106. For example, the peak that can be used as a marker differs between an application that attempts to determine a marker common to the blood of a person suffering from a specific disease and an application that attempts to determine a marker that is common to a specific type of food. Different peaks will be assigned to. Therefore, different designated peak information sets 250A, 250B, 250C are registered for each application. In each of the designated peak information sets 250A, 250B, 250C, peak designation information of each of one or more designated peaks is recorded. The peak designation information for each peak includes a peak ID for identifying the peak, and numerical data such as a chemical shift, a splitting pattern, and a spin coupling constant for calculating the peak area of the peak. These numerical data are automatically extracted by the NMR data processing apparatus 100 from the peak definition data in the format as described above input to the NMR data processing apparatus 100.

また、図１４Bに例示するように、予め設定されたピーク幅PWを示すピーク幅セット２５２も、記憶装置１０６に登録されている。前述したように分解能やその他の条件が異なるとピーク幅PWが異なってくるため、ピーク幅セット２５２には、異なる条件セットに対応して異なるピーク幅PWが設定されている。 Further, as illustrated in FIG. 14B, a peak width set 252 indicating a preset peak width PW is also registered in the storage device 106. As described above, when the resolution and other conditions are different, the peak width PW is different. Therefore, different peak widths PW are set in the peak width set 252 corresponding to different condition sets.

ところで、変形例として、図１４Aに例示された指定ピーク情報セット２５０A、２５０B、２５０Cには、ケミカルシフト、分裂パターン及びスピン結合定数などの数値データと共に、又はそれに代えて、それらの数値データに基づき上記計算方法により計算されたピーク領域を示すデータ（例えば、始点と終点の数値データ）が登録されてもよい。 As a modification, the designated peak information sets 250A, 250B, 250C illustrated in FIG. 14A are based on numerical data such as chemical shift, splitting pattern, and spin coupling constant, or instead of the numerical data. Data indicating the peak area calculated by the above calculation method (for example, numerical data of the start point and the end point) may be registered.

さて、以上のような指定ピーク情報を用いて、図６に示されたステップ１７６の、指定ピーク情報に基づくバケットの自動修正処理が、次の手順で行われる。 Now, using the designated peak information as described above, the bucket automatic correction processing based on the designated peak information in step 176 shown in FIG. 6 is performed in the following procedure.

図１５は、このバケット自動修正処理１７６の流れを示す。 FIG. 15 shows the flow of this bucket automatic correction processing 176.

図１５に示すように、ステップ２６０で、任意に指定された１以上のピークのそれぞれについてピーク定義情報が入力され、そのピーク定義情報に基づいて各指定ピークのピーク定義情報が生成される。或いは、予め記憶装置１０６に登録されている図１４Aに例示されたような指定ピーク情報セット２５０A、２５０B、２５０Cの中から、今回の用途に対応した指定ピーク情報セットが選択され、その選択された指定ピーク情報セットからピーク毎の指定ピーク情報が読み込まれる。さらに、予め設定されている図１４Bに例示されたような条件毎のピーク幅データの中から、今回の解析に最適なピーク幅データが読込まれる。 As shown in FIG. 15, in step 260, peak definition information is input for each of one or more arbitrarily designated peaks, and peak definition information for each designated peak is generated based on the peak definition information. Alternatively, the designated peak information set corresponding to the current use is selected from the designated peak information sets 250A, 250B, and 250C illustrated in FIG. 14A registered in the storage device 106 in advance, and the selected peak information set is selected. The specified peak information for each peak is read from the specified peak information set. Further, the optimum peak width data for the current analysis is read from the preset peak width data for each condition as illustrated in FIG. 14B.

そして、ステップ２６２で、各指定ピークの指定ピーク情報とピーク幅データに基づいて、上述したような計算方法により、各指定ピークのピーク領域が決定され、そのピーク領域がそのまま積分ブロックとして設定される。例えば、図１３A,Bに例示された２種類のピークの場合、それらのピークのピーク領域２４０x及び２４０yがそれぞれ積分ブロックとして設定される。 In step 262, the peak area of each designated peak is determined by the calculation method as described above based on the designated peak information and peak width data of each designated peak, and the peak area is set as an integration block as it is. . For example, in the case of the two types of peaks illustrated in FIGS. 13A and 13B, the peak areas 240x and 240y of these peaks are set as integration blocks, respectively.

その後、ステップ２６４で、図１２に示した自動積分ブロック設定に基づくバケット修正処理１７４のステップ２３２〜２３８と同様の手順で、記憶装置１０６に記憶されている1度修正されたバケットセットが、指定ピークに対応する積分ブロックに応じて更に修正される。この２回目のバケット修正により、いずれの指定ピークのピーク領域も複数のバケットにより分割されないようになる（つまり、いずれの指定ピークのピーク領域の必ずいずれか一つのバケット内に入るようになる）。また、ここで修正がなされたバケットについては、投影ADスペクトルSpのバケット積分が再計算され、再計算されたバケットの積分データにより、記憶装置１０６に記憶されている対応するバケットの積分データが置き換えられる。 Thereafter, in step 264, the bucket set that has been corrected once is stored in the storage device 106 in the same procedure as steps 232 to 238 of the bucket correction processing 174 based on the automatic integration block setting shown in FIG. Further correction is made according to the integration block corresponding to the peak. By the second bucket correction, the peak area of any designated peak is not divided by a plurality of buckets (that is, the peak area of any designated peak always falls within any one bucket). In addition, for the bucket modified here, the bucket integral of the projection AD spectrum Sp is recalculated, and the integral data of the corresponding bucket stored in the storage device 106 is replaced by the recalculated bucket integral data. It is done.

そして、ステップ２３９で、このように２度修正されたバケットセットと積分データセットが、記憶装置１０６に保存され、また、必要に応じて外部に出力される。 In step 239, the bucket set and the integral data set corrected twice in this way are stored in the storage device 106, and are output to the outside as necessary.

次に、図６に示されたステップ１７８の、バケットの手動修正処理について説明する。この処理１７８は、上述した２回の自動的なバケット修正だけでは満足できない場合、オペレータが手動でバケットを更に修正できるようにするためのものである。 Next, the bucket manual correction process in step 178 shown in FIG. 6 will be described. This process 178 is for allowing the operator to further modify the bucket manually if the above-described two automatic bucket modifications are not sufficient.

図１６A,Bはバケットの手動修正処理１７８の原理を示し、図１７はバケットの手動修正処理１７８の流れを示す。 16A and 16B show the principle of the manual correction processing 178 of the bucket, and FIG. 17 shows the flow of the manual correction processing 178 of the bucket.

この処理１７８において、記憶装置１０６に保存されている投影ADスペクトルSpと前述した２度の自動修正を終えたバケットセットが、NMRスペクトル処理装置１００のディスプレイスクリーンに表示される。オペレータはその表示を見ながら、任意のバケットの始点又は終点に対する修正要求をNMRスペクトル処理装置１００に入力する。例えば、図１６Aに例示するように、或るバケット２００jを修正したい場合、オペレータは、そのバケット２００jを指定した上で、その始点を終点の修正要求として、新たな始点s(j)と新たな終点e(j)を入力する。すると、図１７に示すように、ステップ２７０で、指定されたバケット２００jの始点と終点が、オペレータから入力された始点s(j)と終点e(j)と同一値に修正される。また、ステップ２７２で、指定されたバケットの始点s(j)側の隣バケットの終点e(i)が指定バケットの始点s(ｊ)と同一値に修正され、ステップ２７４で、指定バケットの終点e(j)側の隣バケットの始点s(k)が指定バケットの終点e(ｊ)と同一値に修正される。その後、ステップ２７６で、修正されたバケット毎にバケット積分が再計算され、再計算された積分データにより、記憶装置１０６に記憶されている対応するバケットの積分データが置き換えられる。そして、ステップ２７８で、手動修正の終わったバケットセットと、そのバケットセットに基づいて上記のように修正された積分データセットとが、記憶装置１０６に保存され、また必要に応じて外部へ出力される。 In this process 178, the projected AD spectrum Sp stored in the storage device 106 and the bucket set that has been subjected to the above-described two automatic corrections are displayed on the display screen of the NMR spectrum processing apparatus 100. The operator inputs a correction request for the start point or end point of an arbitrary bucket into the NMR spectrum processing apparatus 100 while viewing the display. For example, as illustrated in FIG. 16A, when a certain bucket 200j is to be corrected, the operator designates the bucket 200j and uses the start point as a request for correcting the end point, and sets a new start point s (j) and a new start point s (j). Enter the end point e (j). Then, as shown in FIG. 17, in step 270, the start point and end point of the designated bucket 200j are corrected to the same value as the start point s (j) and end point e (j) input from the operator. In step 272, the end point e (i) of the adjacent bucket on the start point s (j) side of the designated bucket is corrected to the same value as the start point s (j) of the designated bucket. In step 274, the end point of the designated bucket The start point s (k) of the adjacent bucket on the e (j) side is corrected to the same value as the end point e (j) of the designated bucket. Thereafter, in step 276, the bucket integral is recalculated for each modified bucket, and the integral data of the corresponding bucket stored in the storage device 106 is replaced with the recalculated integral data. In step 278, the manually corrected bucket set and the integration data set corrected as described above based on the bucket set are stored in the storage device 106, and output to the outside as necessary. The

以上のように、図６のステップ１７２で、全バケットが一定幅（例えば０．０４ppm）をもつ均等幅のバケットセットが初期的に設定され、その後、ステップ１７４と１７６と１７８の３段階のバケット修正が行われて、バケットにより幅の異なる不均等幅のバケットセットが完成する。なお、ステップ１７４と１７６と１７８の３つの修正段階は、必ずしもその全てが実行されなければならないわけではなく、いずれかの修正段階（例えば手動修正）が省略されても良い。いずれにしても、最終的に完成された不均等幅のバケットセットは「最適化バケットセット」と呼ばれる。 As described above, in Step 172 of FIG. 6, a uniform-width bucket set in which all buckets have a constant width (for example, 0.04 ppm) is initially set, and thereafter, three-stage buckets of Steps 174, 176, and 178 are performed. Corrections are made to complete a non-uniform bucket set with different widths depending on the bucket. Note that not all of the three correction stages of steps 174, 176, and 178 have to be executed, and any of the correction stages (for example, manual correction) may be omitted. In any case, the finally completed non-uniform width bucket set is called an “optimized bucket set”.

図１８は、記憶装置１０６に保存された最適化バケットセットのデータ構造例を示す。 FIG. 18 shows an example data structure of an optimized bucketset stored in the storage device 106.

図１８に示すように、スペクトル解析の異なる用途にそれぞれ対応して異なる最適化バケットセット２８０A、２８０B、２８０Cが、記憶装置１０６に記憶されている。用途が異なれば、既に説明したように指定ピークが異なり、また、処理対象のNMRスペクトル自体も異なるので、当然、出来上がる最適化バケットセットが異なるからである。最適化バケットセット２８０A、２８０B、２８０Cの各々には、用途を示す用途ID、ピーク幅PWを示す数値データ、ケミカルシフト軸スケールの基準点（原点）を示す数値データ、並びに、このバケットセットを構成する多数のバケットのバケット番号、始点及び終点のデータなどが登録されている。さらに、いずれかの指定ピークに対応するバケットには、その対応する指定ピークを指すピークIDも登録されている。 As illustrated in FIG. 18, different optimization bucket sets 280A, 280B, and 280C corresponding to different uses of spectrum analysis are stored in the storage device 106. This is because, if the use is different, the designated peak is different as described above, and the NMR spectrum itself to be processed is also different, so that naturally the optimized bucket set is different. Each of the optimized bucket sets 280A, 280B, and 280C includes an application ID indicating an application, numerical data indicating a peak width PW, numerical data indicating a reference point (origin) of the chemical shift axis scale, and the bucket set. The bucket number, start point and end point data of a large number of buckets are registered. Furthermore, a peak ID indicating the corresponding designated peak is also registered in the bucket corresponding to any designated peak.

上述した３段階のバケット修正処理の説明から分かるように、最適化バケットセットにおいては、全てのバケットが均等な幅（例えば０．０４ppm）をもってはおらず、それぞれに特有の不均等な幅をもっている。そして、投影ADスペクトルSpから自動的に検出された全てのピークについても、加えて、指定ピーク情報により指定された全てのピークについても、各ピークのピーク領域は必ずいずれか一つのバケットの中に入っており、いずれのピーク領域も複数バケットにより分割されてはいない。さらに、この最適化バケットセットの作成の基礎となった投影ADスペクトルSpには、解析対象である多数のADスペクトル上のほぼ全てのピークが投影されており、それらのピークのピーク領域が最適化バケットセット反映されている。さらに、解析用途のために重要であると考えられる既知の指定ピークのピーク領域も、最適化バケットセットに反映されている。従って、この最適化バケットセットを用いて、解析対象である多数のADスペクトルのそれぞれのバケット積分を行なうことにより、いずれのADスペクトルも、そこに含まれるほぼ全てのピークの情報が良好に維持された状態で、積分データセット（ヒストグラム）に変換されることになる。 As can be seen from the description of the three-stage bucket correction process described above, in the optimized bucketset, not all buckets have a uniform width (for example, 0.04 ppm), but have unique unequal widths. And for all peaks automatically detected from the projected AD spectrum Sp, in addition to all the peaks specified by the specified peak information, the peak area of each peak must be in any one bucket. And no peak area is divided by multiple buckets. Furthermore, almost all the peaks on the many AD spectra to be analyzed are projected on the projected AD spectrum Sp, which is the basis for creating this optimized bucket set, and the peak area of those peaks is optimized. The bucket set is reflected. Furthermore, the peak areas of known designated peaks that are considered important for analytical applications are also reflected in the optimized bucket set. Therefore, using this optimized bucket set, each bucket integration of a large number of AD spectra to be analyzed is performed, so that information on almost all the peaks contained in each AD spectrum is well maintained. In this state, it is converted into an integral data set (histogram).

再び図６を参照する。最適化バケットセットが完成すると、ステップ１８０の、多数ADスペクトルの最適化バケット積分処理が行われる。 Refer to FIG. 6 again. When the optimized bucket set is completed, an optimized bucket integration process of multiple AD spectra in step 180 is performed.

図１９は、多数ADスペクトルの最適化バケット積分処理１８０の流れを示す。 FIG. 19 shows a flow of optimization bucket integration processing 180 for multiple AD spectra.

図１９に示すように、ステップ２９０で、最適化バケットセットが記憶装置１０６から読込まれる。ステップ２９２で、解析対象である多数のADスペクトルの各々が記憶装置１０６から読込まれる。ステップ２９２で、読込まれたADスペクトルのケミカルシフト軸スケール（δスケール）が、最適化バケットセットのケミカルシフト軸スケール（δスケール）に合わされる（例えば、ADスペクトルの基準点が最適化バケットセットの基準点に一致するように、ADスペクトルのケミカルシフト軸スケールがシフトされる。）。そして、ステップ２９６で、そのADスペクトルのバケット積分が、最適化バケットセットを用いて実行され、ステップ２９８で、そのバケット積分で求まった積分データセット（ヒストグラムデータ）が記憶装置１０６に保存され、必要に応じて外部へ出力される。 As shown in FIG. 19, at step 290, an optimized bucket set is read from the storage device. In step 292, each of a large number of AD spectra to be analyzed is read from the storage device 106. In step 292, the chemical shift axis scale (δ scale) of the read AD spectrum is adjusted to the chemical shift axis scale (δ scale) of the optimized bucket set (eg, the reference point of the AD spectrum is The chemical shift axis scale of the AD spectrum is shifted to match the reference point.) Then, in step 296, bucket integration of the AD spectrum is executed using the optimized bucket set, and in step 298, an integration data set (histogram data) obtained by the bucket integration is stored in the storage device 106 and necessary. Depending on the output.

ステップ２９９により、解析対象の全てのADスペクトルについて、上述したステップ２９２〜２９８の処理が繰り返される。その結果、解析対象の全てのADスペクトルが、それぞれ積分データセット（ヒストグラムデータ）に縮約されて、記憶装置１０６に保存される。 In step 299, the above-described processing in steps 292 to 298 is repeated for all AD spectra to be analyzed. As a result, all AD spectra to be analyzed are reduced to integral data sets (histogram data) and stored in the storage device 106.

再び図６を参照する。その後、ステップ１８２で、解析対象の全てのADスペクトルの最適化バケット積分データセット（ヒストグラムデータ）が、次の統計処理に渡すのに適したデータ形式に整えられた上で、記憶装置１０６に保存され、必要に応じて外部へ出力される。 Refer to FIG. 6 again. After that, in step 182, the optimized bucket integral data set (histogram data) of all AD spectra to be analyzed is arranged in a data format suitable for passing to the next statistical processing and stored in the storage device 106. And output to the outside as necessary.

以上図６に示された流れに沿って説明した処理が実行されることにより、図１に示された最適化バケット積分を用いたデータの縮約処理１１２と、積分結果データの保存／出力処理１１４が完了する。その後に、図１に示される統計処理１１６が実行される。この統計処理１１６では、解析対象の全てのADスペクトルの積分データセット（ヒストグラムデータ）が記憶装置１０６から読込まれ、それらの積分データセット（ヒストグラムデータ）を用いて、所定の統計処理（例えば多変量解析法の一つである主成分分析）が実行され、そして、マーカとなるピークの確定などの解析結果が生成され出力される。統計処理の具体的な手法としては、公知の種々の方法を用いることができる。例えば、後に詳述するように、本発明の原理に従う統計処理では、いずれも多変量解析法の一つである主成分分析（Principal component Analysis：PCA）と部分空間法（Soft Independent Modeling of Class Analogy：SIMCA）とが組み合わせて用いられる。既に説明するようなしたように、統計処理の材料であるADスペクトルの積分データセット（ヒストグラムデータ）には、ADスペクトル上の重要なピークの情報が良好に含まれているので、統計処理の結果は精度の高いものとなる。 6 is executed, the data reduction processing 112 using the optimized bucket integration shown in FIG. 1 and the integration result data storage / output processing are performed. 114 is complete. Thereafter, the statistical processing 116 shown in FIG. 1 is executed. In this statistical process 116, an integrated data set (histogram data) of all AD spectra to be analyzed is read from the storage device 106, and a predetermined statistical process (for example, multivariate) is used by using these integrated data sets (histogram data). A principal component analysis which is one of analysis methods is executed, and an analysis result such as determination of a peak serving as a marker is generated and output. As a specific method of statistical processing, various known methods can be used. For example, as will be described in detail later, in the statistical processing according to the principle of the present invention, principal component analysis (PCA) and subspace method (Soft Independent Modeling of Class Analogy), both of which are multivariate analysis methods. : SIMCA). As already explained, the integrated data set (histogram data) of the AD spectrum, which is the material for statistical processing, contains information on important peaks on the AD spectrum well. Is highly accurate.

図２０は、本発明に従うNMRデータ処理装置の別の実施形態の全体的な構成と機能を示す。 FIG. 20 shows the overall configuration and function of another embodiment of the NMR data processing apparatus according to the present invention.

このNMRデータ処理装置３００は、前述のNMRデータ処理装置１００とは別の用途に使用される。すなわち、前述のNMRデータ処理装置１００の用途は、多数のサンプルのNMRスペクトルを解析し統計的なデータを得ることであるのに対し、このNMRデータ処理装置３００の用途は、基本的に一つのサンプルのNMRスペクトルを分析してそのサンプルの性質又は成分などを検査すること（例えば、或る人の血液のNMRスペクトルを分析して、その人が特定の疾病に罹患してないかどうか検査すること、あるいは、或る食品のNMRスペクトルを分析して、そこに如何なる物質が含有されているか検査すること、など）にある。このNMRデータ処理装置３００も、前述のNMRデータ処理装置１００と同様に、プログラムされたコンピュータシステムを用いて実現することができる。 The NMR data processing apparatus 300 is used for a different application from the NMR data processing apparatus 100 described above. That is, the use of the NMR data processing apparatus 100 described above is to analyze the NMR spectra of a large number of samples and obtain statistical data, whereas the use of the NMR data processing apparatus 300 is basically one. Analyzing the NMR spectrum of a sample to examine the nature or composition of the sample (eg, analyzing the NMR spectrum of a person's blood to see if the person is afflicted with a particular disease Or analyzing the NMR spectrum of a food and examining what substances it contains). This NMR data processing apparatus 300 can also be realized by using a programmed computer system in the same manner as the NMR data processing apparatus 100 described above.

図２０に示すように、NMRデータ処理装置３００のプロセッサ３０４は、記憶装置３０６に記憶されている所定のコンピュータプログラムを実行することにより、FIDデータ入力処理３０８、NMRスペクトル処理３１０、最適化バケット積分を用いたデータ縮約処理３１２、ヒストグラム分析処理３１４、分析結果データ保存／出力処理３１６を行う。記憶装置３０６には、上記コンピュータプログラムの他に、予め、図１８に例示したような用途別の最適化バケットセット２８０A、２８０B、２８０Cが記憶されている。さらに記憶装置３０６には、検査目的に対応した参照モデルデータが記憶されている。検査目的に対応した参照モデルデータとは、検査の判断を行う際に検査対象のNMRデータと対比される基準のデータである。例えば、人の血液からその人が特定の疾病に罹患してないかどうか検査するという用途の場合であれば、その特定の疾病に罹患している多数の人々の血液サンプルのNMRデータを前述のNMRデータ処理装置２００で解析することによって確定されたマーカの情報が、参照モデルデータの一つとして採用することができる。 As shown in FIG. 20, the processor 304 of the NMR data processing device 300 executes a predetermined computer program stored in the storage device 306, thereby performing an FID data input processing 308, an NMR spectrum processing 310, an optimized bucket integration. A data reduction process 312, a histogram analysis process 314, and an analysis result data storage / output process 316 are performed. In addition to the computer program, the storage device 306 stores in advance optimization bucket sets 280A, 280B, and 280C for each application as illustrated in FIG. Further, the storage device 306 stores reference model data corresponding to the inspection purpose. The reference model data corresponding to the inspection purpose is standard data to be compared with the NMR data to be inspected when the inspection is determined. For example, if the application is to test a person's blood to see if the person is afflicted with a particular disease, the NMR data of blood samples of a large number of people with the particular illness Marker information determined by analysis by the NMR data processing apparatus 200 can be adopted as one of the reference model data.

FIDデータ入力処理３１２では、NMR装置１０２から出力される検査対象の一サンプルのFIDデータが入力され、記憶装置３０６に記憶される。NMRスペクトル処理３１０では、入力されたFIDデータに対して、図２に示されたと同様の流れの処理が行われて、そのFIDデータの絶対値微分であるADスペクトルが生成され、記憶装置３０６に記憶される。 In the FID data input process 312, FID data of one sample to be examined output from the NMR apparatus 102 is input and stored in the storage device 306. In the NMR spectrum processing 310, the input FID data is processed in the same flow as shown in FIG. 2 to generate an AD spectrum that is an absolute value derivative of the FID data, and is stored in the storage device 306. Remembered.

データ縮約処理３１２では、記憶装置３０６に予め記憶されている用途に応じた最適化バケットセットを用いた最適化バケット積分が、検査対象のADスペクトルに対して実行され、それにより、検査対象のADスペクトルが最適化された積分データセット（ヒストグラム）に縮約される。ヒストグラム分析処理３１４では、検査対象の積分データセット（ヒストグラム）が、記憶装置３０６に予め記憶されている用途に応じた参照モデルデータと対比され、対比結果に基づく判断がなされて検査結果が生成される。分析結果の保存／出力処理３１６では、生成された検査結果が記憶装置３０６に保存され、また外部へ出力される。 In the data reduction process 312, an optimization bucket integration using an optimization bucket set corresponding to an application stored in advance in the storage device 306 is performed on the AD spectrum to be inspected. The AD spectrum is reduced to an optimized integral data set (histogram). In the histogram analysis process 314, the integral data set (histogram) to be inspected is compared with reference model data corresponding to the use stored in advance in the storage device 306, and a determination based on the comparison result is made to generate an inspection result. The In the analysis result storage / output process 316, the generated inspection result is stored in the storage device 306 and output to the outside.

図２１は、図２０に示された最適化バケット積分を用いたデータ縮約処理３１２の流れを示す。 FIG. 21 shows the flow of the data reduction processing 312 using the optimized bucket integration shown in FIG.

ステップ３２０で、検査対象のADスペクトルが記憶装置３０６から読込まれ、ステップ３２２で、用途に対応した最適化バケットセットが記憶装置３０６から読込まれる。ステップ３２４で、検査対象のADスペクトルのケミカルシフト軸スケール（δスケール）が最適化バケットセットのそれに一致させられる。その後、ステップ３２６で、最適化バケットセットを用いた最適化バケット積分が、検査対象のADスペクトルに対して実行され、検査対象の積分データセット（ヒストグラム）が求まる。ステップ３２８で、検査対象の積分データセット（ヒストグラム）が記憶装置３０６に保存され、必要に応じて外部へ出力される。 In step 320, the AD spectrum to be examined is read from the storage device 306, and in step 322, an optimized bucket set corresponding to the application is read from the storage device 306. At step 324, the chemical shift axis scale (δ scale) of the AD spectrum to be examined is matched to that of the optimized bucket set. Thereafter, in step 326, an optimized bucket integration using the optimized bucket set is performed on the AD spectrum to be inspected to obtain an integral data set (histogram) to be inspected. In step 328, the integral data set (histogram) to be inspected is stored in the storage device 306 and output to the outside as necessary.

図２２は、図２０に示されたヒストグラムの分析処理３１４の流れを示す。 FIG. 22 shows the flow of the histogram analysis processing 314 shown in FIG.

ステップ３３０で、検査対象の積分データセット（ヒストグラム）が記憶装置３０６から読込まれる。ステップ３３２で、検査目的に対応した参照モデルデータが記憶装置３０６から読込まれる。ステップ３３４で、検査対象の積分データセット（ヒストグラム）と、検査目的に対応した参照モデルデータとが対比され、検査結果が判断される。ステップ３３６で、検査結果が記憶装置３０６に保存され、また外部へ出力される。 In step 330, the integral data set (histogram) to be examined is read from the storage device 306. In step 332, reference model data corresponding to the inspection purpose is read from the storage device 306. In step 334, the integral data set (histogram) to be inspected is compared with the reference model data corresponding to the inspection purpose, and the inspection result is determined. In step 336, the inspection result is stored in the storage device 306 and output to the outside.

以上のようなサンプルの検査を用途とするNMRデータ処理においても、用途に応じて予め用意された最適化バケットセットを用いた最適化バケット積分がスペクトルに対して行われるので、スペクトルがもつピーク情報が良好に維持され、精度の高い検査結果を得ることができる。また、分析には、NMRスペクトルの絶対値微分であるADスペクトルが用いられるため、面倒な位相補正やベースライン補正が不要になり、処理効率が向上する。 Even in the NMR data processing for the sample inspection as described above, the optimization bucket integration using the optimization bucket set prepared in advance according to the application is performed on the spectrum, so the peak information that the spectrum has Is maintained well, and a highly accurate inspection result can be obtained. In addition, since an AD spectrum, which is an absolute value derivative of the NMR spectrum, is used for the analysis, troublesome phase correction and baseline correction become unnecessary, and processing efficiency is improved.

次に、本発明の一実施形態にかかるNMRスペクトルの統計処理装置について説明する。 Next, an NMR spectrum statistical processing apparatus according to an embodiment of the present invention will be described.

本発明の一実施形態にかかる統計処理装置は、図１に示したNMRデータ処理装置１００内に統計処理部１１６として組み込まれており、そして、典型的には、既に説明した前処理部分（図１内の処理部１０８，１１０，１１２，１１４）と一緒にコンピュータプログラムとして具現化されている。しかし、これは説明のための一つの例示にすぎず、本発明に従うNMRスペクトルの統計処理装置は、既に説明した前処理部分からは分離された単独装置として構成されることもできる。 The statistical processing apparatus according to an embodiment of the present invention is incorporated as the statistical processing unit 116 in the NMR data processing apparatus 100 shown in FIG. 1, and typically, the preprocessing part (see FIG. 1 and the processing units 108, 110, 112, and 114) in FIG. However, this is merely an example for explanation, and the NMR spectrum statistical processing apparatus according to the present invention can be configured as a single apparatus separated from the pre-processed part already described.

図２２は、本発明の一実施形態にかかるNMRスペクトルの統計処理装置、すなわち、図１に示した統計処理部１１６、の機能的な構成を示す。 FIG. 22 shows a functional configuration of an NMR spectrum statistical processing apparatus according to an embodiment of the present invention, that is, the statistical processing unit 116 shown in FIG.

図２２に示すように、統計処理部１１６は、多変量解析部３４０と、ユーザ要求入力部３４２と、表示制御部３４４とを有する。 As illustrated in FIG. 22, the statistical processing unit 116 includes a multivariate analysis unit 340, a user request input unit 342, and a display control unit 344.

多変量解析部３４０は、PCA解析部３４６とSIMCA解析部３４８とを有する。PCA解析部３４６は、記憶装置１０６に保存されている多数のサンプルのNMRスペクトルの積分データセット（以下、「NMRヒストグラム」という）を入力して、主成分分析（Principal component Analysis： PCA）による多変量解析を行なう。また、SIMCA解析部３４８は、記憶装置１０６に保存されている多数のサンプルのNMRヒストグラムを入力して、部分空間法（Soft Independent Modeling of Class Analogy：SIMCA）（以下、SIMCAという）による多変量解析を行なう。PCA解析部３４６とSIMCA解析部３４８の入力となる解析対象としての多数のサンプルのヒストグラムのデータは、記憶装置１０６内のPCAテーブル３５０に記録されている。PCAテーブル３５０の構成については、後に説明する。 The multivariate analysis unit 340 includes a PCA analysis unit 346 and a SIMCA analysis unit 348. The PCA analysis unit 346 inputs an integral data set (hereinafter referred to as “NMR histogram”) of NMR spectra of a large number of samples stored in the storage device 106, and performs multi-component analysis (Principal component Analysis: PCA). Perform a random analysis. The SIMCA analysis unit 348 inputs NMR histograms of a large number of samples stored in the storage device 106, and performs multivariate analysis using a subspace method (Soft Independent Modeling of Class Analogy: SIMCA) (hereinafter referred to as SIMCA). To do. Histogram data of a large number of samples as analysis targets that are input to the PCA analysis unit 346 and the SIMCA analysis unit 348 are recorded in the PCA table 350 in the storage device 106. The configuration of the PCA table 350 will be described later.

記憶装置１０６内には、図１に示した積分結果データの保存／出力処理１１４により書き込まれた大量のサンプルのNMRヒストグラムデータ３５６がある。PCA解析部３４６とSIMCA解析部３４８は、それぞれ、その大量のサンプルのNMRヒストグラムデータ３５６の中から、任意の多数のサンプルのNMRヒストグラムのデータを解析対象として抽出してPCAテーブル３５０に登録することができる。多変量解析部３４０は、ユーザからの要求に応じてPCAテーブル３５０の更新も行なう。また、PCA解析部３４６とSIMCA解析部３４８は、それぞれ、PCA解析結果（例えば、サンプルのスコア及び変数のローディングなど）とSIMCA解析結果（サンプルの各クラスまでの距離、変数のモデリング力及び識別力など）を、解析結果データ３５２として、記憶装置１０６内に保存することができる。 In the storage device 106, there is a large number of sample NMR histogram data 356 written by the integration result data storage / output process 114 shown in FIG. Each of the PCA analysis unit 346 and the SIMCA analysis unit 348 extracts NMR histogram data of an arbitrarily large number of samples from the large number of sample NMR histogram data 356 as an analysis target, and registers them in the PCA table 350. Can do. The multivariate analysis unit 340 also updates the PCA table 350 in response to a request from the user. Also, the PCA analysis unit 346 and the SIMCA analysis unit 348 respectively have a PCA analysis result (for example, sample score and variable loading) and a SIMCA analysis result (distance to each class of sample, variable modeling ability and discriminating ability). And the like can be stored in the storage device 106 as analysis result data 352.

SIMCA解析部３４８とPCA解析部３４６は、後に詳述するように、前者による解析結果が後者の解析にフィードバックされるように連携して動作することができる。それにより、このような多変量解析を行なう目的、例えばメタボノミクスにおいてバイオマーカを探し当てる目的が、より容易に達成されるようになる。 As will be described later in detail, the SIMCA analysis unit 348 and the PCA analysis unit 346 can operate in cooperation so that the analysis result of the former is fed back to the latter analysis. Accordingly, the purpose of performing such multivariate analysis, for example, the purpose of finding a biomarker in metabonomics can be achieved more easily.

表示制御部３４４は、PCA解析部３４６からPCA解析結果（スコア及びローディングなど）を受け、また、SIMCA解析部３４８からとSIMCA解析結果（各クラスまでの距離、モデリング力及び識別力など）を受けて、そして、そのPCA解析結果とSIMCA解析結果を表示したグラフィカルユーザインタフェース（以下、「UI」という）３４６を作成し、そのUI３４６をコンピュータシステム１００（図１）のディスプレイ装置（図示省略）に表示する。表示制御部３４４は、記憶装置１０６に記憶されている多数のサンプルのNMRスペクトルデータ３５０の中から、ユーザにより選択された１以上のサンプルのNMRスペクトルデータを読み込み、そのUI３４６上に、PCA解析結果とSIMCA解析結果と一緒に、ユーザにより選択されたサンプルのNMRスペクトルを表示する。なお、この実施形態では、表示されるNMRスペクトルデータは、元のNMRスペクトルデータではなく、それを絶対値微分して得られたADスペクトルデータであるが、これを以下では単に「NMRスペクトルデータ」と呼ぶ。 The display control unit 344 receives the PCA analysis result (score, loading, etc.) from the PCA analysis unit 346, and receives the SIMCA analysis result (distance to each class, modeling power, discriminating power, etc.) from the SIMCA analysis unit 348. Then, a graphical user interface (hereinafter referred to as “UI”) 346 displaying the PCA analysis result and the SIMCA analysis result is created, and the UI 346 is displayed on the display device (not shown) of the computer system 100 (FIG. 1). To do. The display control unit 344 reads the NMR spectrum data of one or more samples selected by the user from the NMR spectrum data 350 of a large number of samples stored in the storage device 106, and the PCA analysis result is displayed on the UI 346. Along with the SIMCA analysis results, the NMR spectrum of the sample selected by the user is displayed. In this embodiment, the displayed NMR spectrum data is not the original NMR spectrum data, but is AD spectrum data obtained by differentiating the absolute value thereof. Hereinafter, this is simply referred to as “NMR spectrum data”. Call it.

表示制御部３４４は、UI３４６上のPCA解析結果とSIMCA解析結果の表示とNMRスペクトルの表示とが連携するように、UI３４６上での両者の表示を制御する。それにより、オペレータは多変量解析結果とNMRスペクトルとの関連性が視覚的に容易に理解できるようになり、このような多変量解析を行なう目的、例えばメタボノミクスにおいてバイオマーカを探し当てる目的が、より容易に達成されるようになる。さらに、そのUI３４６上には、ユーザが各種の要求を入力するための各種の入力ボタンも表示される。そのUI３４６の詳細については、後に説明する。 The display control unit 344 controls the display on the UI 346 so that the display of the PCA analysis result, the SIMCA analysis result on the UI 346, and the display of the NMR spectrum cooperate. As a result, the operator can easily visually understand the relationship between the multivariate analysis results and the NMR spectrum, and the purpose of such multivariate analysis, for example, to find biomarkers in metabonomics, Will be easily achieved. Furthermore, on the UI 346, various input buttons for the user to input various requests are also displayed. Details of the UI 346 will be described later.

ユーザ要求入力部３４２は、ユーザによりマウスまたはキーボードなどから上記UIに入力された各種の要求を受け、その入力要求を上述した多変量解析部３４０および表示制御部３４４に渡す。多変量解析部３４０および表示制御部３４４は、ユーザから入力された要求に応答して、後述するような各種の動作を行う。 The user request input unit 342 receives various requests input to the UI from a mouse or a keyboard by the user, and passes the input requests to the multivariate analysis unit 340 and the display control unit 344 described above. The multivariate analysis unit 340 and the display control unit 344 perform various operations as described below in response to a request input from the user.

図２４は、図２３に示したPCAテーブル３５０の構成例を示す。 FIG. 24 shows a configuration example of the PCA table 350 shown in FIG.

既に述べたように、PCAテーブル３５０には、解析の対象となる多数のサンプルのNMRヒストグラムのデータが記録されている。すなわち、図２４に示すように、PCAテーブル３５０には、ヒストグラムを作成するときに行なわれたバケット積分に関するバケット積分条件４００が記録されており、これには、バケット積分が行なわれたケミカルシフトδの全範囲（図９に示した全範囲２０２に相当）、バケットの基本的な幅（図９に示した幅Ｗに相当）、自動積分ブロック設定でバケット幅の調整を行ったか否か、全範囲の合計積分値、及びダークリジョン（（図９に示したダークリジョン２０４に相当）（図示の例では、５．３ｐｐｍから４．３ｐｐｍという１つのダークリジョンだけが登録されているが、複数のダークリジョンが登録され得る）が含まれる。 As described above, in the PCA table 350, NMR histogram data of a large number of samples to be analyzed are recorded. That is, as shown in FIG. 24, the PCA table 350 records a bucket integration condition 400 related to the bucket integration performed when creating the histogram, and this includes the chemical shift δ on which the bucket integration was performed. 9 (corresponding to the entire range 202 shown in FIG. 9), the basic width of the bucket (corresponding to the width W shown in FIG. 9), whether or not the bucket width is adjusted by the automatic integration block setting, The total integrated value of the range and dark region (corresponding to the dark region 204 shown in FIG. 9) (in the example shown, only one dark region of 5.3 ppm to 4.3 ppm is registered. Dark region can be registered).

また、PCAテーブル３５０には、解析対象のサンプルの数４０２、各サンプルを構成する多数の変数の数（つまり、バケットの数）４０４、及び、各サンプルのNMRスペクトルデータが格納されているフォルダ名４０６とファイル名４０８のセット（つまり、各サンプルのNMRスペクトルデータファイルのパス名）が記録されている。ここで、前述した図２３では、図示の都合から、多数のサンプルのNMRスペクトルデータは纏めて1つのブロック３５４で示されているが、このブロック３５４は実際には、サンプル毎のNMRスペクトルデータを格納した多数のスペクトルデータファイルの集まりであり、それら多数のスペクトルデータファイルのパス名（フォルダ名４０６とファイル名４０８）がPCAテーブル３５０に記録されている。図２４では、図示の都合から、３つのサンプルのスペクトルデータファイルのパス名しか示されていないが、実際には、サンプル数４０２に示された数のサンプルの全てのスペクトルデータファイルのパス名がPCAテーブル３５０に記録されている。 In the PCA table 350, the number of samples to be analyzed 402, the number of many variables (that is, the number of buckets) 404 constituting each sample, and the name of the folder storing the NMR spectrum data of each sample are stored. A set of 406 and file name 408 (that is, the path name of the NMR spectrum data file of each sample) is recorded. Here, in FIG. 23 described above, for the convenience of illustration, the NMR spectrum data of a large number of samples are collectively shown in one block 354, but this block 354 actually shows the NMR spectrum data for each sample. It is a collection of a large number of stored spectral data files, and the path names (folder name 406 and file name 408) of the large number of spectral data files are recorded in the PCA table 350. In FIG. 24, only the path names of the spectrum data files of three samples are shown for the convenience of illustration, but actually, the path names of all the spectrum data files of the number of samples shown in the sample number 402 are shown. It is recorded in the PCA table 350.

さらに、PCAテーブル３５０には、UI３４６の画面上にプロットされるサンプル毎のマークの形状とカラーをそれぞれ指定したプロットタイプ４１０とプロットカラー４１２が定義されている。プロットタイプ４１０は、例えば、値「０」が×形マーク、値「１」が塗りつぶしの無い円形マーク、値「−１」が黒く塗りつぶされた円形マーク、…などを指定する。また、プロットカラー４１２は、３原色である赤（R)、緑（G）及び青（B)の明度（濃度）値のセットで定義される。後述するように、UI３４６上にはPCA解析結果の一つであるスコアプロットや、SIMCA解析結果の一つであるクーマンプロットが表示されるが、そのスコアプロットやクーマンプロット上では、各サンプルは、PCAテーブル３５０で指定された形状とカラーを持ったマークで表示されることになる。 Furthermore, the PCA table 350 defines a plot type 410 and a plot color 412 that specify the shape and color of the mark for each sample plotted on the UI 346 screen. The plot type 410 specifies, for example, a value “0” as an X mark, a value “1” as an unfilled circle mark, a value “−1” as a circle mark filled in black,. The plot color 412 is defined by a set of lightness (density) values of the three primary colors red (R), green (G), and blue (B). As will be described later, a score plot which is one of the PCA analysis results and a Kuman plot which is one of the SIMCA analysis results are displayed on the UI 346. On the score plot and the Kuman plot, each sample is displayed as follows. A mark having a shape and a color designated in the PCA table 350 is displayed.

さらに、PCAテーブル３５０には、サンプル毎のサンプルステータス４１４が登録されている。各サンプルのサンプルステータス４１４は、該当のサンプルが解析対象に含まれるか否かを示すものであり、例えば、値「０」が該当のサンプルが解析対象に含まれることを意味し、値「１」が該当のサンプルがユーザからの要求により解析対象から除外されたことを意味する。後に説明するように、PCAテーブル３５０に登録されているサンプルの全ては初期的には解析対象に含まれるのである（つまり、サンプルステータスが「０」である）が、ユーザは、UI３４６のスコアプロット上でクーマンプロット上で任意のサンプルを選択して、選択したサンプルを解析対象から除外することができ、そのようにして除外されたサンプルのサンプルステータスは「１」に変わる。サンプルステータスが「１」であるサンプルは、PCA解析やSIMCA解析において計算に入れられない。 Further, a sample status 414 for each sample is registered in the PCA table 350. The sample status 414 of each sample indicates whether or not the corresponding sample is included in the analysis target. For example, the value “0” means that the corresponding sample is included in the analysis target, and the value “1”. "Means that the corresponding sample has been excluded from the analysis target at the request of the user. As will be described later, all of the samples registered in the PCA table 350 are initially included in the analysis target (that is, the sample status is “0”), but the user can make a score plot on the UI 346. By selecting an arbitrary sample on the Couman plot above, the selected sample can be excluded from the analysis target, and the sample status of the sample thus excluded is changed to “1”. Samples with a sample status of “1” cannot be entered in the PCA analysis or SIMCA analysis.

さらに、PCAテーブル３５０には、実質的な解析対象である多変量データ行列４１８が登録されている。この多変量データ行列４１８には、上記積分全範囲に含まれる全バケットのそれぞれの中点のケミカルシフト値（これは、多変量解析における変数つまり記述子に相当する）４１８と、サンプル毎のNMRヒストグラムのデータ（すなわち、サンプル毎の最適化バケット積分で得られた積分データのセットであり、解析対象である多変量データ行列の正味の部分である）４２０、それぞれのバケットの始点のケミカルシフト値４２２と終点のケミカルシフト値４２４、及び、それぞれのバケットのステータス４２６が登録されている。各バケットのステータス４２６は、そのバケット（つまり、その変数）を解析に使用するか否かを示すものであり、例えば、値「０」はそのバケットを解析に使用することを意味し、値「１」は解析に使用しないことを意味する。後述するように、UI３４６上に表示されるSIMCA解析結果の一つである識別力に基づいて、ユーザは任意のレベル以上に高い識別力をもつ変数（バケット）だけを選択し、選択されたバケットだけを用いて再度PCA解析やSIMCA解析を行なうことを要求することができる。全てのバケットは、初期的にはステータスが「０」であって、解析に使用されるようになっているが、上記のようにして特定のバケットだけが後の解析のために選択された場合、選択されなかったバケットのステータスは「１」に変わり、ステータスが「１」であるバケットは後の解析では使用されないことになる。 Furthermore, a multivariate data matrix 418 that is a substantial analysis target is registered in the PCA table 350. This multivariate data matrix 418 includes chemical shift values (which correspond to variables or descriptors in multivariate analysis) 418 of each bucket included in the integration total range, and NMR for each sample. Histogram data (ie, a set of integration data obtained by optimization bucket integration for each sample and the net part of the multivariate data matrix to be analyzed) 420, the chemical shift value of the starting point of each bucket 422, the chemical shift value 424 of the end point, and the status 426 of each bucket are registered. The status 426 of each bucket indicates whether or not the bucket (that is, the variable) is used for analysis. For example, the value “0” means that the bucket is used for analysis, and the value “ “1” means not used for analysis. As will be described later, the user selects only a variable (bucket) having a discrimination power higher than an arbitrary level based on the discrimination power that is one of SIMCA analysis results displayed on the UI 346, and the selected bucket. It is possible to request that PCA analysis and SIMCA analysis be performed again using only All buckets initially have a status of "0" and are used for analysis, but only a specific bucket is selected for later analysis as described above The status of the unselected bucket is changed to “1”, and the bucket having the status “1” is not used in the later analysis.

以上のような構成を持つPCAテーブル３５０を使用して、PCA解析とSIMCA解析が実行されることになる。 PCA analysis and SIMCA analysis are executed using the PCA table 350 having the above configuration.

図２５は、図２３に示した多変量解析部３４０のPCA解析部３４６とSIMCA解析部３４８並びに表示制御部３４４の機能をより詳細に示している。 FIG. 25 shows the functions of the PCA analyzer 346, SIMCA analyzer 348, and display controller 344 of the multivariate analyzer 340 shown in FIG. 23 in more detail.

図２５に示すように、PCA解析部３４６は、PCAモデル読込・追加・保存部３６０と、PCA計算部３６２とを有する。PCAモデル読込・追加・保存部３６０は、記憶装置１０６上の大量サンプルのNMRヒストグラムデータ３５６の中から、解析対象となる多数のサンプルのNMRヒストグラムデータを選んで、その選ばれた多数のサンプルのNMRヒストグラムデータに基づいてPCAテーブルの情報を新規に作成する機能や、記憶装置１０６上の大量サンプルのNMRヒストグラムデータ３５６の中から追加のサンプルのNMRヒストグラムデータを選んで、それをPCAテーブルの多変量データ行列に追加する機能をもつ。さらに、PCAモデル読込・追加・保存部３６０は、作成されたり追加されたり或は変更されたりしたPCAテーブルの情報を、記憶装置１０６にPCAテーブル３５０として保存する機能をもつ。PCA計算部３６２は、PCAモデル読込・追加・保存部３６０により用意されたPCAテーブルの情報を用いてPCA解析の計算を行ってPCAモデルを作成し、そのPCAモデルに基づいたPCA解析結果（例えば、各サンプルのスコアや、各変数(各バケット）のローディングなど）を出力する。なお、PCA解析の計算方法それ自体は周知であるから（例えば、非特許文献１を参照）、これについての詳細な説明はこの明細書では省略する。 As illustrated in FIG. 25, the PCA analysis unit 346 includes a PCA model reading / adding / saving unit 360 and a PCA calculation unit 362. The PCA model reading / adding / storing unit 360 selects NMR histogram data of a large number of samples to be analyzed from among the NMR histogram data 356 of a large number of samples on the storage device 106, and A function for newly creating information on the PCA table based on the NMR histogram data, or selecting NMR histogram data for additional samples from the NMR histogram data 356 for a large number of samples on the storage device 106, and storing them in the PCA table. Has the ability to add to a random data matrix. Further, the PCA model reading / adding / storing unit 360 has a function of storing information on the PCA table created, added, or changed as the PCA table 350 in the storage device 106. The PCA calculation unit 362 calculates a PCA analysis by using the information in the PCA table prepared by the PCA model reading / adding / saving unit 360 to create a PCA model, and the PCA analysis result based on the PCA model (for example, , Output the score of each sample, loading of each variable (each bucket), etc.). In addition, since the calculation method of PCA analysis itself is well-known (for example, refer nonpatent literature 1), the detailed description about this is abbreviate | omitted in this specification.

SIMCA解析部３７０は、PCAモデル読込・追加・保存部３７０と、SIMCA計算部３７０を有する。PCAモデル読込・追加・保存部３７０は、記憶装置１０６上の大量サンプルのNMRヒストグラムデータ３５６の中からユーザが第1と第２のクラスにそれぞれ割り当てたサンプルのNMRヒストグラムデータを読込んで、第１と第２の２つのクラスのPCAテーブルの情報を用意する機能や、記憶装置１０６上の大量サンプルのNMRヒストグラムデータ３５６の中からユーザがテストクラスに割り当てたサンプルのNMRヒストグラムデータを読込んで、テストクラスのPCAテーブルの情報を用意する機能をもつ。さらに、PCAモデル読込・追加・保存部３７０は、各クラスのPCAテーブルの情報を、記憶装置１０６に各クラスのPCAテーブル３５０として保存する機能をもつ。SIMCA計算部３７２は、PCAモデル読込・追加・保存部３７０により用意された複数のクラスのPCAテーブルの情報を用いてSIMCA解析の計算を行って各クラスのPCAモデルを作成し、それらのPCAモデルに基づいたSIMCA解析結果（例えば、各サンプルの各クラスからの距離や、各変数(各バケット）のモデリング力や識別力など）を出力する。SIMCA解析の計算には、クラス毎のPCA計算が含まれており、このクラス毎のPCA計算を行うために、SIMCA計算部３７２は、PCA解析部３４６のPCA計算部３６２を使用する。なお、SIMCA解析の計算方法それ自体は周知であるから（例えば、非特許文献１を参照）、これについての詳細な説明はこの明細書では省略する。 The SIMCA analysis unit 370 includes a PCA model reading / adding / saving unit 370 and a SIMCA calculation unit 370. The PCA model reading / adding / saving unit 370 reads the NMR histogram data of the samples assigned by the user to the first and second classes from the NMR histogram data 356 of the large number of samples on the storage device 106, and the first And the function to prepare the PCA table information of the second two classes, and the NMR histogram data of the sample assigned by the user to the test class from the NMR histogram data 356 of a large amount of samples on the storage device 106, and the test It has a function to prepare PCA table information for classes. Further, the PCA model reading / adding / saving unit 370 has a function of saving the information of the PCA table of each class in the storage device 106 as the PCA table 350 of each class. The SIMCA calculation unit 372 creates a PCA model for each class by performing a calculation of SIMCA analysis using the information of the PCA table of a plurality of classes prepared by the PCA model reading / adding / storing unit 370. SIMCA analysis results (for example, distance from each class of each sample, modeling power and discriminating power of each variable (each bucket), etc.) are output. The SIMCA analysis calculation includes a PCA calculation for each class, and the SIMCA calculation unit 372 uses the PCA calculation unit 362 of the PCA analysis unit 346 in order to perform the PCA calculation for each class. In addition, since the calculation method of SIMCA analysis itself is well-known (for example, refer nonpatent literature 1), the detailed description about this is abbreviate | omitted in this specification.

PCA解析部３４６とSIMCA解析部３４８は、解析データ記憶部３６４と解析データ保存部３６６とを有する。解析データ記憶部３６４は、PCA解析部３４６から出力されたPCA解析結果と、SIMCA解析部３４８から出力されたSIMCA解析結果を記憶する。解析データ保存部３６６は、解析データ記憶部３６４に記憶されたPCA解析結果とSIMCA解析結果を、記憶装置１０６上に解析結果データ３５２として保存する。 The PCA analysis unit 346 and the SIMCA analysis unit 348 include an analysis data storage unit 364 and an analysis data storage unit 366. The analysis data storage unit 364 stores the PCA analysis result output from the PCA analysis unit 346 and the SIMCA analysis result output from the SIMCA analysis unit 348. The analysis data storage unit 366 stores the PCA analysis result and the SIMCA analysis result stored in the analysis data storage unit 364 as analysis result data 352 on the storage device 106.

後に詳述するように、解析データ記憶部３６４に記憶されるSIMCA解析結果の中には、或るレベル以上に高い識別力をもつ変数（バケット又はそれに対応するケミカルシフト値）を選択する情報が含まれている。この変数選択情報は、解析データ記憶部３６４を通じて、PCA計算部３６２及びSIMCA計算部３７２にフィードバックされることができる。そして、PCA計算部３６２とSIMCA計算部３７２はそれぞれ、フィードバックされた変数選択情報に従って、選択された変数（つまり、或るレベル以上に高い識別力をもつバケット又はそれに対応するケミカルシフト値）だけを用いて、再度、PCA解析とSIMCA解析を行なって、修正されたPCA解析結果とSIMCA解析結果を再度出力することができるようになっている。これにより、より望ましいPCA解析結果とSIMCA解析結果を得ることができるようになる。 As will be described in detail later, in the SIMCA analysis result stored in the analysis data storage unit 364, information for selecting a variable (bucket or a chemical shift value corresponding thereto) having a discrimination power higher than a certain level is included. include. This variable selection information can be fed back to the PCA calculation unit 362 and the SIMCA calculation unit 372 through the analysis data storage unit 364. Then, each of the PCA calculation unit 362 and the SIMCA calculation unit 372 only selects a selected variable (that is, a bucket having a discrimination power higher than a certain level or a chemical shift value corresponding thereto) according to the fed back variable selection information. The PCA analysis and SIMCA analysis are performed again, and the corrected PCA analysis result and SIMCA analysis result can be output again. As a result, more desirable PCA analysis results and SIMCA analysis results can be obtained.

表示制御部３４４は、PCA解析画面制御部３７４と、SIMCA解析画面制御部３７６と、NMRスペクトル連携部３７８とを有する。PCA解析画面制御部３７４は、PCA計算部３６２から解析データ記憶部３６４に出力されたPCA解析結果を受けて、後に具体的に説明するように、UI３４６上にPCA解析結果を示した複数種のチャート（例えば、スコアプロット、ローディングプロット、寄与率チャート及びローディングチャート）(以下、「PCA解析結果チャート」と総称する）を表示し、また、UI３４６に入力されたユーザからの選択、拡大、削除及び初期化などの要求に応答して、UI３４６上のそれらのチャートを変化させる。SIMCA解析画面制御部３７６は、SIMCA計算部３７２から解析データ記憶部３６４に出力されたSIMCA解析結果を受けて、後に具体的に説明するように、SIMCA解析結果を示す複数種のチャート（例えば、クーマンプロット、モデリング力チャート及び識別力チャート）(以下、「SIMCA解析結果チャート」と総称する）をUI３４６上に表示し、また、UI３４６に入力されたユーザからの選択、拡大、削除及び初期化などの要求に応答して、UI３４６上のそれらのチャートを変化させる。 The display control unit 344 includes a PCA analysis screen control unit 374, a SIMCA analysis screen control unit 376, and an NMR spectrum cooperation unit 378. The PCA analysis screen control unit 374 receives the PCA analysis result output from the PCA calculation unit 362 to the analysis data storage unit 364 and, as will be described in detail later, a plurality of types of PCA analysis results that are displayed on the UI 346. A chart (for example, a score plot, a loading plot, a contribution rate chart, and a loading chart) (hereinafter collectively referred to as “PCA analysis result chart”) is displayed, and selection, enlargement, deletion, and deletion from the user input to the UI 346 In response to a request such as initialization, the charts on UI 346 are changed. The SIMCA analysis screen control unit 376 receives a SIMCA analysis result output from the SIMCA calculation unit 372 to the analysis data storage unit 364 and, as will be described in detail later, a plurality of types of charts (for example, the SIMCA analysis result) (Cuman plot, modeling power chart and discriminating power chart) (hereinafter collectively referred to as “SIMCA analysis result chart”) are displayed on the UI 346, and are selected, enlarged, deleted, and initialized from the user input to the UI 346. In response to the request, the charts on the UI 346 are changed.

NMRスペクトル連携部３７８は、記憶装置１０６上の大量のサンプルのNMRスペクトルデータ３５４の中から、UI３４６上でユーザにより選択された１又は複数のサンプルのNMRスペクトルデータを読込んで、その選択されたサンプルのNMRスペクトルを示すチャートを、UI３４６上に上述したPCA解析結果チャートまたはSIMCA解析結果チャートと並べて表示する。 The NMR spectrum cooperation unit 378 reads the NMR spectrum data of one or more samples selected by the user on the UI 346 from the NMR spectrum data 354 of a large number of samples on the storage device 106, and the selected sample A chart showing the NMR spectrum is displayed on the UI 346 side by side with the PCA analysis result chart or SIMCA analysis result chart described above.

以下では、図２５に示した各部の機能を、UI３４６の画面例を参照しつつ説明する。 In the following, the function of each unit illustrated in FIG. 25 will be described with reference to a screen example of the UI 346.

図２６は、PCA解析結果を表示した場合におけるUI３４６の画面例を示す。図２７は、SIMCA解析結果を表示した場合におけるUI３４６の画面例を示す。 FIG. 26 shows a screen example of the UI 346 when the PCA analysis result is displayed. FIG. 27 shows a screen example of the UI 346 when the SIMCA analysis result is displayed.

図２６と図２７に示すように、この実施形態におけるUI３４６の画面４３０（以下、「UI画面」という）は、PCA解析結果とSIMCA解析結果とが交互に切り替え可能な別の画面に表示されるようになっているが、これは表示方法の一例にすぎない。十分に大型のディスプレイ装置を使用する場合のために、同一画面にPCA解析結果とSIMCA解析結果とを表示できるようになっていてもよい。 As shown in FIGS. 26 and 27, the screen 430 of the UI 346 (hereinafter referred to as “UI screen”) in this embodiment is displayed on another screen in which the PCA analysis result and the SIMCA analysis result can be switched alternately. However, this is only an example of the display method. In order to use a sufficiently large display device, the PCA analysis result and the SIMCA analysis result may be displayed on the same screen.

図２６と図２７に示すように、PCA解析結果とSIMCA解析結果のいずれを表示する場合にも、UI画面４３０には、コントロールパネル４３２があり、そこには、図１に示したステップ１０８〜１１４の前処理を制御するための各種要求をユーザが入力するための前処理コントロールパネル４３４と、PCA解析を制御するための各種要求をユーザが入力するためのPCAコントロールパネル４３６と、SIMCA解析を制御するための各種要求をユーザが入力するためのSIMCAコントロールパネル４３８とが図示のように配置される。 As shown in FIGS. 26 and 27, the UI screen 430 has a control panel 432 for displaying either the PCA analysis result or the SIMCA analysis result. 114, a preprocessing control panel 434 for a user to input various requests for controlling preprocessing of 114, a PCA control panel 436 for a user to input various requests for controlling PCA analysis, and SIMCA analysis. A SIMCA control panel 438 for the user to input various requests for control is arranged as shown.

このUI画面４３０上のPCAコントロールパネル４３６の一番上の「PCA」選択ラジオボタンがユーザにより操作された（例えばクリックされた）場合には、それに応答して、図２５に示したPCA解析画面制御部３７４が動作する。PCA解析画面制御部３７４は、解析データ記憶部３６４からPCA解析結果を受けて、UI画面４３０上に図２６に示すように、複数のPCA解析結果チャート、すなわち、スコアプロット４４０、ローディングプロット４４２、寄与率プロット４４４及びローディングプロットを表示する。他方、SIMCAコントロールパネル４３８の一番上の「SIMCA」選択ラジオボタンが操作された場合には、それに応答して図２５に示したSIMCA解析画面制御部３７６が動作する。SIMCA解析画面制御部３７６は、解析データ記憶部３６４からSIMCA解析結果を受けて、UI画面４３０上に図２７に示すように、複数のSIMCA解析チャート、すなわち、クーマンプロット４６２、第１のクラスに対するモデリング力チャート４６４、第２のクラスに対するモデリング力チャート４６６及び識別力チャート４６８を表示する。 When the user selects (for example, clicks) the “PCA” selection radio button at the top of the PCA control panel 436 on the UI screen 430, the PCA analysis screen shown in FIG. The control unit 374 operates. The PCA analysis screen control unit 374 receives the PCA analysis result from the analysis data storage unit 364, and a plurality of PCA analysis result charts, that is, a score plot 440, a loading plot 442, as shown in FIG. A contribution plot 444 and a loading plot are displayed. On the other hand, when the “SIMCA” selection radio button at the top of the SIMCA control panel 438 is operated, the SIMCA analysis screen control unit 376 shown in FIG. 25 operates in response thereto. The SIMCA analysis screen control unit 376 receives the SIMCA analysis result from the analysis data storage unit 364 and, as shown in FIG. 27 on the UI screen 430, a plurality of SIMCA analysis charts, that is, the Kuman plot 462 and the first class. A modeling power chart 464, a modeling power chart 466 for the second class, and a discrimination power chart 468 are displayed.

さらに、図２６と図２７に示すように、PCA解析結果とSIMCA解析結果のいずれを表示する場合にも、図２５に示したNMRスペクトル連携部３７８が動作して、UI画面４３０上に、NMRスペクトルチャート４５０を表示する。NMRスペクトルチャート４５０には、ユーザに選択された１以上のサンプルのNMRスペクトルのグラフが表示される。ここで、注目すべき点は、PCA解析結果を表示した場合、図２６に示すように、NMRスペクトルチャート４５０と寄与率チャート４４４とローディングチャート４４６とが、それぞれの横軸であるケミカルシフト軸のスケールが互に一致するようにして、ケミカルシフト軸に直交する縦軸方向に配列される点である。また、同様に、SIMCA解析結果を表示した場合、図２７に示すように、NMRスペクトルチャート４５０と識別力チャート４６８とが、それぞれの横軸であるケミカルシフト軸のスケールが互に一致するようにして、ケミカルシフト軸に直交する縦軸方向に配列される。これにより、ユーザは、寄与率、ローディング又は識別力の大きい変数（バケット又はそれに対応するケミカルシフト値）とNMRスペクトルのピークとの相関関係が視覚的に容易に把握でき、例えばメタボノミクスにおいてバイオマーカを決定することがより容易になる。 Further, as shown in FIGS. 26 and 27, when displaying either the PCA analysis result or the SIMCA analysis result, the NMR spectrum cooperation unit 378 shown in FIG. 25 operates, and the NMR screen is displayed on the UI screen 430. A spectrum chart 450 is displayed. The NMR spectrum chart 450 displays a graph of the NMR spectrum of one or more samples selected by the user. Here, it should be noted that, when the PCA analysis result is displayed, as shown in FIG. 26, the NMR spectrum chart 450, the contribution ratio chart 444, and the loading chart 446 are each of the horizontal axis of the chemical shift axis. They are arranged in the direction of the vertical axis perpendicular to the chemical shift axis so that the scales coincide with each other. Similarly, when the SIMCA analysis result is displayed, as shown in FIG. 27, the NMR spectrum chart 450 and the discriminating power chart 468 are set so that the scales of the chemical shift axes, which are the horizontal axes, coincide with each other. And arranged in the vertical axis direction orthogonal to the chemical shift axis. As a result, the user can easily visually grasp the correlation between the contribution rate, the variable having high loading or discriminating power (the bucket or the chemical shift value corresponding thereto) and the peak of the NMR spectrum, for example, biomarker in metabonomics. It will be easier to determine.

以下では、PCA解析を行なう場合と、SMCA解析を行なう場合とに分けて、それぞれの場合における動作をより具体的に説明する。まず、PCA解析を行なう場合について説明する。 In the following, the operation in each case will be described more specifically by dividing into the case of performing PCA analysis and the case of performing SMCA analysis. First, a case where PCA analysis is performed will be described.

まず、図２６に示したUI画面４３０上のPCAコントロールパネル４３６内の「PCA」選択ラジオボタンが操作された上で「Open_Model」ボタンが操作されると、図２５に示したPCAモデル読込・追加・保存部３６０が動作して、記憶装置１０６内の大量のサンプルのNMRヒストグラムデータ３５６の中から、ユーザに指定された複数のサンプルのNMRヒストグラムデータを選択して読み込み、図２４に示したような、多変量データ行列やその他の情報をからなるPCAテーブルの情報を作成する。ユーザ指定されたサンプルの数がN個で、各サンプルのNMRスペクトルを構成する変数（バケット）の数がM個であるならば、PCAテーブルの情報に含まれる多変量データ行列はN×Mのデータ行列となる。 First, when the “PCA” selection radio button in the PCA control panel 436 on the UI screen 430 shown in FIG. 26 is operated and then the “Open_Model” button is operated, the PCA model reading / adding shown in FIG. 25 is performed. The storage unit 360 operates to select and read NMR histogram data of a plurality of samples designated by the user from among a large number of sample NMR histogram data 356 in the storage device 106, as shown in FIG. Create PCA table information consisting of multivariate data matrix and other information. If the number of samples specified by the user is N and the number of variables (buckets) constituting the NMR spectrum of each sample is M, the multivariate data matrix included in the PCA table information is N × M. It becomes a data matrix.

PCAモデル読込・追加・保存部３６０は、作成したPCAテーブルの情報をPCA計算部３６２に引渡す。PCA計算部３６２は、そのPCAテーブルの情報を用いてPCA解析の計算を行なって、そのPCAテーブルの情報に含まれている多変量データ行列についてのPCAモデルを算出する。ここで、PCA解析計算において計算される主成分の数は、UI画面４３０上のPCAコントロールパネル４３６の一番上の「PCA」選択ラジオボタンの右脇に置かれた入力ボックスにユーザが入力した数で決まる（図示の例では主成分数として「６」が設定されている）。PCA計算部３６２は、算出されたPCAモデルに基づくPCA解析結果として、各主成分に対する各サンプルのスコア、そのスコアの分散、各主成分に対する各変数のローディング及び各変数の寄与率（全主成分に対するローディングの絶対値の和）などを算出する。算出されたPCA解析結果は、解析データ記憶部３６４に記憶される。 The PCA model reading / adding / saving unit 360 delivers the information of the created PCA table to the PCA calculation unit 362. The PCA calculation unit 362 performs a PCA analysis calculation using the information in the PCA table, and calculates a PCA model for the multivariate data matrix included in the information in the PCA table. Here, the number of principal components calculated in the PCA analysis calculation is input by the user in the input box placed on the right side of the “PCA” selection radio button at the top of the PCA control panel 436 on the UI screen 430. (In the example shown, “6” is set as the number of principal components). As a PCA analysis result based on the calculated PCA model, the PCA calculation unit 362 obtains the score of each sample with respect to each principal component, the variance of the score, the loading of each variable with respect to each principal component, and the contribution rate of each variable (total principal components) The sum of the absolute values of loading with respect to) is calculated. The calculated PCA analysis result is stored in the analysis data storage unit 364.

PCA解析画面制御部３７４が、解析データ記憶部３６４から上記PCA解析結果を受け取る。PCA解析画面制御部３７４は、図２６に示すように、各主成分に対する各サンプルのスコアに基づいて、スコアプロット４４０を作成してUI画面４３０上に表示し、各主成分に対する各変数のローディングに基づいて、ローディングプロット４４２とローディングチャート４４６を作成してUI画面４３０上に表示し、さらに、各変数の寄与率に基づいて、寄与率チャート４４４を作成してUI画面４３０上に表示する。 The PCA analysis screen control unit 374 receives the PCA analysis result from the analysis data storage unit 364. As shown in FIG. 26, the PCA analysis screen control unit 374 creates a score plot 440 based on the score of each sample for each principal component and displays it on the UI screen 430, and loads each variable for each principal component. Based on the above, a loading plot 442 and a loading chart 446 are created and displayed on the UI screen 430, and a contribution rate chart 444 is created and displayed on the UI screen 430 based on the contribution rate of each variable.

図２６に示すように、スコアプロット４４０では、ユーザにより選択された２つの主成分に対するそれぞれのスコアをそれぞれX軸とY軸にとり、各サンプルのその２つの主成分に対するスコアがプロットされる。X軸とY軸に相当する主成分は、PCAコントロールパネル４３６内の「X」軸プルダウンメニュー及び「Y」軸プリダウンメニューからユーザにより選ばれた番号の主成分である（図示の例では、X軸が第「１」主成分（PCA1)、Y軸が第「２」主成分（PCA2)。）。プロットされるサンプルのマークの形状と色は、上記PCAテーブル情報の中のサンプル毎のプロットタイプとプロットカラー（図２４の参照番号４１０、４１２に対応する情報）に従う。さらに、X軸とY軸には、対応する主成分番号とその分散も表示される（例：PC1(39.8%)）。さらに、計算に使用されたサンプル数も表示される。 As shown in FIG. 26, in the score plot 440, the scores for the two principal components selected by the user are taken on the X axis and the Y axis, respectively, and the scores for the two principal components of each sample are plotted. The principal components corresponding to the X axis and the Y axis are principal components of numbers selected by the user from the “X” axis pull-down menu and the “Y” axis pre-down menu in the PCA control panel 436 (in the illustrated example, The X-axis is the “first” principal component (PCA1), and the Y-axis is the “second” principal component (PCA2). The shape and color of the mark of the sample to be plotted follows the plot type and plot color (information corresponding to reference numbers 410 and 412 in FIG. 24) for each sample in the PCA table information. Furthermore, the corresponding principal component numbers and their variances are also displayed on the X-axis and Y-axis (example: PC1 (39.8%)). In addition, the number of samples used in the calculation is displayed.

スコアプロット４４０上で、複数のサンプルマーク間の位置が近いほど、それらのサンプルは特徴的に互いにより似ており、他方、位置が遠いほどより違っていることを意味する。従って、スコアプロット４４０上に、例えば互いに分離された幾つかのサンプルマークの塊が存在するならば、分析対象のサンプルが、それぞれの塊に対応する特徴的に異なる幾つかのクラスに分けられたと判断することができる。 On the score plot 440, the closer the position between the plurality of sample marks, the more similar the samples are characteristically to each other, while the farther the position, the more different. Thus, if there are several sample mark chunks separated from each other on the score plot 440, for example, the sample to be analyzed is divided into several distinct classes corresponding to each chunk. Judgment can be made.

また、各主成分におけるスコアの分散は、その値が大きいほど、その主成分がサンプルのクラス分けに寄与する度合いが高いことを意味する。例えば、X軸とY軸の主成分の分散の和が、或る値（例えば、６０％程度）より大きければ、それらの主成分を用いたPCAモデルにおいてサンプルがかなり明確にクラス分けされ得ると判断することができる。 Further, the variance of the score in each principal component means that the larger the value, the higher the degree that the principal component contributes to the classification of samples. For example, if the sum of the variances of the principal components of the X and Y axes is greater than a certain value (for example, about 60%), the sample can be classified fairly clearly in the PCA model using those principal components. Judgment can be made.

PCAコントロールパネル４３６内の「Name」チェックボックスにユーザがチェックを入れると、PCA解析画面制御部３７４は、図２８（スコアプロット４４０内の部分領域の拡大図）に示すように、スコアプロット４４０上の全てのサンプルマークの近傍に、対応するサンプルのNMRスペクトルのファイル名を表示する。マウスポインターが各サンプルの上に置かれた場合にも、スコアプロット４４０の右上に自動的に対応するファイル名が表示される。これにより、ユーザは、スコアプロット４４０上の各サンプルマークが、具体的にどのサンプルに該当するのかを容易に把握できる。 When the user checks the “Name” check box in the PCA control panel 436, the PCA analysis screen control unit 374 displays on the score plot 440 as shown in FIG. 28 (enlarged view of a partial area in the score plot 440). The file names of the NMR spectra of the corresponding samples are displayed in the vicinity of all the sample marks. Even when the mouse pointer is placed on each sample, the corresponding file name is automatically displayed on the upper right of the score plot 440. Thereby, the user can easily grasp which sample each sample mark on the score plot 440 specifically corresponds to.

図２６に示すように、ローディングプロット４４２では、PCAコントロールパネル４３６内の「X」軸プルダウンメニュー及び「Y」軸プリダウンメニューで選択された２つの主成分におけるローディングをそれぞれX軸とY軸にとり、その２つの主成分に対する各変数（バケット又はそれに対応するケミカルシフト値）のローディングが、所定の形状とカラーをもつ変数マークでプロットされる。また、X軸とY軸には、対応する主成分願号とその分散が表示される（例：PC1(39.8%)）。また、計算に使用された変数（バケット）の数も表示される。 As shown in FIG. 26, in the loading plot 442, the loadings of the two principal components selected by the “X” axis pull-down menu and the “Y” axis pre-down menu in the PCA control panel 436 are respectively taken as the X axis and the Y axis. The loading of each variable (bucket or corresponding chemical shift value) for the two principal components is plotted with variable marks having a predetermined shape and color. In addition, the corresponding principal component application numbers and their variances are displayed on the X and Y axes (example: PC1 (39.8%)). The number of variables (buckets) used for the calculation is also displayed.

ローディングプロット４４２上で、X軸とY軸の原点から遠くに存在する変数ほど、X軸とY軸にそれぞれ対応する主成分に対する寄与度が高い、つまり、サンプルのクラス分けに寄与する度合いが高いことを意味する。 On the loading plot 442, variables that are farther from the origin of the X and Y axes have a higher contribution to the principal components corresponding to the X and Y axes, that is, a higher degree of contribution to the classification of samples. Means that.

PCAコントロールパネル４３６内の「Name」チェックボックスにユーザがチェックを入れると、PCA解析画面制御部３７４は、図２９（ローディングプロット４４２内の部分領域の拡大図）に示すように、ローディングプロット４４２上の全ての変数マークの近傍に、対応する変数（バケット）のケミカルシフト値(ppm）を表示する。また、マウスポインターが各変数マークの上に置かれた場合にも、ローディングプロット４４２の右上に自動的に対応するケミカルシフト値が表示される。これにより、ユーザは、ローディングプロット４４２上の各変数マークが、具体的にどの変数（ケミカルシフト値）に該当するのかを容易に把握できる。 When the user checks the “Name” check box in the PCA control panel 436, the PCA analysis screen control unit 374 displays on the loading plot 442 as shown in FIG. 29 (an enlarged view of a partial area in the loading plot 442). The chemical shift value (ppm) of the corresponding variable (bucket) is displayed in the vicinity of all the variable marks. Also, when the mouse pointer is placed on each variable mark, the corresponding chemical shift value is automatically displayed on the upper right of the loading plot 442. Accordingly, the user can easily grasp which variable (chemical shift value) each variable mark on the loading plot 442 specifically corresponds to.

図２６に示すように、寄与率チャート４４４では、ケミカルシフトを横軸にとり、各変数（ケミカルシフト値）の第１、２、３主成分に対するローディングの絶対値と残差（residual）成分の和が、棒グラフの形で表示される。第１、２、３主成分のスコアの分散と残差成分の分散も表示される。寄与率チャート４４４において、寄与率（棒グラフの高さ）が高い変数ほど、サンプルのクラス分けに寄与する度合いが高いと判断することができる。 As shown in FIG. 26, in the contribution rate chart 444, the horizontal axis represents the chemical shift, and the sum of the absolute value of the loading and the residual component for the first, second and third principal components of each variable (chemical shift value). Is displayed in the form of a bar graph. The variance of the scores of the first, second, and third principal components and the variance of the residual component are also displayed. In the contribution rate chart 444, it can be determined that the higher the contribution rate (the height of the bar graph), the higher the degree of contribution to the sample classification.

図２６に示すように、ローディングチャート４４６では、ケミカルシフトを横軸にとり、PCAコントロールパネル４３６内の「X」軸プルダウンメニューで選択された１つの主成分に対する各変数のローディングが、棒グラフの形で表示される。また、そのX軸に選択された主成分のスコアの分散も表示される。ローディングチャート４４６において、ローディングの絶対値（棒グラフの高さ）が高い変数ほど、その主成分に寄与する度合いが高いと判断することができる。 As shown in FIG. 26, in the loading chart 446, the horizontal axis indicates the chemical shift, and the loading of each variable for one principal component selected from the “X” axis pull-down menu in the PCA control panel 436 is shown in the form of a bar graph. Is displayed. Also, the variance of the score of the selected principal component is displayed on the X axis. In the loading chart 446, it can be determined that the higher the absolute value of loading (the height of the bar graph), the higher the degree of contribution to the principal component.

図２６に示されたスコアプロット４４０、ローディングプロット４４２、寄与率プロット４４４及びローディングチャート４４６のそれぞれにおいて、選択、拡大／縮小、削除、初期化等の各種操作を行うことが可能である。 In the score plot 440, loading plot 442, contribution rate plot 444, and loading chart 446 shown in FIG. 26, various operations such as selection, enlargement / reduction, deletion, and initialization can be performed.

まず、スコアプロット４４０の各種操作について説明する。 First, various operations of the score plot 440 will be described.

図３０に示すように、スコアプロット４４０上の任意の１以上のサンプルマークをマウスのドラッグにより選択すると、PCA解析画面制御部３７４（図２５）が、その選択範囲４７０をスコアプロット４４０上に表示する。これと連携して、NMRスペクトル連携部３７８（図２５）が動作して、選択されたサンプルマークに対応するサンプルのNMRスペクトルデータを、PCMテーブルに登録されているそのサンプルのパス名とファイル名とを用いて、記憶装置１０６内のNMRスペクトルデータ３５４から読み込み、そして、選択されたサンプルのNMRスペクトルをNMRスペクトルチャート４５０上に表示する。NMRスペクトルチャート４５０には、表示されたNMRスペクトルのファイル名も表示される。 As shown in FIG. 30, when any one or more sample marks on the score plot 440 are selected by dragging the mouse, the PCA analysis screen control unit 374 (FIG. 25) displays the selection range 470 on the score plot 440. To do. In cooperation with this, the NMR spectrum cooperation unit 378 (FIG. 25) operates to store the NMR spectrum data of the sample corresponding to the selected sample mark, and the path name and file name of the sample registered in the PCM table. Are read from the NMR spectrum data 354 in the storage device 106, and the NMR spectrum of the selected sample is displayed on the NMR spectrum chart 450. The NMR spectrum chart 450 also displays the file name of the displayed NMR spectrum.

NMRスペクトルチャート４５０上で、複数のNMRスペクトルは、ケミカルシフト軸スケールを互いに一致させるようにして、ケミカルシフト軸に直交する縦軸方向に並んで表示され、且つ、規定の異なる色で区別して表示されるので、相互間の識別および相関が一目で分かる。加えて、図２６に示されているように、NMRスペクトルチャート４５０上の各NMRスペクトルと、寄与率チャート４４４とローディングチャート４４６とが、ケミカルシフト軸スケールを互いに一致させて、縦軸方向に並んで表示されるので、それらの間の相関も一目で分かる。 On the NMR spectrum chart 450, a plurality of NMR spectra are displayed side by side in the vertical axis direction orthogonal to the chemical shift axis so that the chemical shift axis scales coincide with each other, and are distinguished and displayed in different colors. Thus, the mutual identification and correlation can be seen at a glance. In addition, as shown in FIG. 26, each NMR spectrum on the NMR spectrum chart 450, the contribution ratio chart 444, and the loading chart 446 are aligned in the vertical axis direction with the chemical shift axis scales matched with each other. So that you can see the correlation between them at a glance.

ユーザがキーボードの「CTRL」キーを押しながら、スコアプロット４４０上の任意のサンプルマークをマウスにてドラッグして選択すると、PCA解析画面制御部３７４は、そのサンプルマークを選択範囲に追加し、又は、そのサンプルマークが既に選択されている場合には、そのサンプルマークを選択範囲から外す。これと連携して、NMRスペクトル連携部３７８が、選択範囲に追加されたサンプルマークに対応するサンプルのNMRスペクトルをNMRスペクトルチャート４５０上に追加表示したり、又は、選択範囲から外されたサンプルのNMRスペクトルをNMRスペクトルチャート４５０から消去する。 When the user drags and selects an arbitrary sample mark on the score plot 440 with the mouse while pressing the “CTRL” key on the keyboard, the PCA analysis screen control unit 374 adds the sample mark to the selection range, or If the sample mark is already selected, the sample mark is removed from the selection range. In cooperation with this, the NMR spectrum linking unit 378 additionally displays the NMR spectrum of the sample corresponding to the sample mark added to the selected range on the NMR spectrum chart 450, or the sample spectrum removed from the selected range. The NMR spectrum is erased from the NMR spectrum chart 450.

図３１Aに示すように、スコアプロット４４０上で選択されたサンプルマークに関し、ユーザが「Del」キーを押すなどの削除要求を行うと、PCA解析画面制御部３７４は、図３１Bに示すように、そのサンプルマークをスコアプロット４４０上から消去し、これと連携して、NMRスペクトル連携部３７８が、その選択されたサンプルのNMRスペクトルをNMRスペクトルチャート４５０から消去する。同時に、PCA計算部３６２が、その削除要求のあったサンプルのPCAテーブル上でのサンプルステータス（図２４の参照番号４１４に相当）を「０」（解析対象に含まれる）から「１」（解析対象に含まれない）に切り替え、そして、サンプルステータスが「０」（解析対象に含まれる）であるサンプルのNMRヒストグラムデータだけを用いて、PCA解析計算を再度行い、その再計算の結果を解析データ記憶部３６４に出力する。PCA解析画面制御部３７４は、解析データ記憶部３６４から再計算の結果を受けて、UI画面４３０上に、再計算の結果に基づくスコアプロット４４０、ローディングプロット４４２、寄与率チャート４４４及びローディングロット４４６を表示する（図３１Bは、再計算後に表示されたスコアプロット４４０の例を示す）。 As shown in FIG. 31A, regarding the sample mark selected on the score plot 440, when the user makes a deletion request such as pressing the “Del” key, the PCA analysis screen control unit 374, as shown in FIG. 31B, The sample mark is deleted from the score plot 440, and in cooperation with this, the NMR spectrum cooperation unit 378 deletes the NMR spectrum of the selected sample from the NMR spectrum chart 450. At the same time, the PCA calculation unit 362 changes the sample status (corresponding to the reference number 414 in FIG. 24) of the sample requested to be deleted from “0” (included in the analysis target) to “1” (analysis). And the PCA analysis calculation is performed again using only the NMR histogram data of the sample whose sample status is “0” (included in the analysis target), and the result of the recalculation is analyzed. The data is output to the data storage unit 364. The PCA analysis screen control unit 374 receives the result of the recalculation from the analysis data storage unit 364, and on the UI screen 430, the score plot 440, the loading plot 442, the contribution rate chart 444, and the loading lot 446 based on the result of the recalculation. (FIG. 31B shows an example of the score plot 440 displayed after recalculation).

例えば、スコアプロット４４０上に、解析目的からみて支障になる或いは寄与しないと考えられるサンプルが表示されていた場合、そのサンプルを上記のようにして解析対象から除外した上で再度PCA計算を行うことにより、より望ましい解析結果が得られることになる。 For example, on the score plot 440, if a sample that is considered to be an obstacle or does not contribute to the analysis purpose is displayed, the sample is excluded from the analysis target as described above, and the PCA calculation is performed again. Thus, a more desirable analysis result can be obtained.

また、ユーザがキーボードの「Shift」キーを押しながら、スコアプロット４４０上の任意の部分領域をマウスにてドラッグし選択すると、PCA解析画面制御部３７４は、選択した部分領域を拡大してスコアプロット４４０上に表示する。その後、ユーザが拡大表示されたスコアプロット４４０の画面を選択して、キーボードの「Home」キーを押すと、PCA解析画面制御部３７４は、スコアプロット４４０上の拡大率を「１」に初期化し、全てのスコア範囲を表示する。 When the user drags and selects an arbitrary partial area on the score plot 440 with the mouse while pressing the “Shift” key on the keyboard, the PCA analysis screen control unit 374 enlarges the selected partial area and displays the score plot. 440 is displayed. Thereafter, when the user selects the enlarged screen of the score plot 440 and presses the “Home” key on the keyboard, the PCA analysis screen control unit 374 initializes the enlargement ratio on the score plot 440 to “1”. , Display all score ranges.

また、ユーザがPCMコントロールパネル４３６上の「Reset_score」ボタンを操作すると、PCA計算部３６２が、PCAテーブル内の上記サンプル削除機能により「１」となっていたサンプルステータスを初期値「０」し、このように初期化されたPCAテーブルの内容に基づいて再度PCA解析計算を行い、そして、PCA解析画面制御部３７４が、そのPCA解析計算の結果に基づいたスコアプロット４４０、ローディングプロット４４２、寄与率チャート４４４及びローディングロット４４６を改めて表示する。 When the user operates the “Reset_score” button on the PCM control panel 436, the PCA calculation unit 362 sets the sample status that has been “1” by the sample deletion function in the PCA table to the initial value “0”. The PCA analysis calculation is performed again based on the contents of the PCA table initialized as described above, and the PCA analysis screen control unit 374 generates a score plot 440, a loading plot 442, a contribution rate based on the result of the PCA analysis calculation. The chart 444 and the loading lot 446 are displayed again.

次に、ローディングプロット４４２の各種操作について説明する。 Next, various operations of the loading plot 442 will be described.

図３２に示すように、ユーザがローディングプロット４４２内の任意の１以上の変数マークをマウスでドラッグして選択すると、PCA解析画面制御部３７４が、その選択範囲４７２をローディングプロット４４２上に表示する。これと連携して、PCA解析画面制御部３７４は、寄与率チャート４４４及びローディングチャート４４６内の選択された変数マークに対応する変数（ケミカルシフト値）の棒グラフを、他の変数の棒グラフとは異なる所定の選択色で表示する（図中では、選択された変数の棒グラフを黒塗りで表示している。）。これにより、ローディングプロット４４２、寄与率チャート４４４及びローディングチャート４４６上での各変数（ケミカルシフト値）の識別と相関が一目で分かる。 As shown in FIG. 32, when the user selects one or more variable marks in the loading plot 442 by dragging with the mouse, the PCA analysis screen control unit 374 displays the selection range 472 on the loading plot 442. . In conjunction with this, the PCA analysis screen control unit 374 makes the bar graph of the variable (chemical shift value) corresponding to the selected variable mark in the contribution rate chart 444 and the loading chart 446 different from the bar graphs of other variables. It is displayed in a predetermined selection color (in the figure, the bar graph of the selected variable is displayed in black). Thereby, identification and correlation of each variable (chemical shift value) on the loading plot 442, the contribution rate chart 444, and the loading chart 446 can be known at a glance.

ユーザがキーボードの「CTRL」キーを押しながら、ローディングプロット４４２上の任意の変数マークをマウスにてドラッグして選択すると、PCA解析画面制御部３７４は、その変数マークを選択範囲に追加し、又は、その変数マークが既に選択されている場合には、その変数マークを選択範囲から外す。これと連携して、PCA解析画面制御部３７４は、寄与率チャート４４４及びローディングチャート４４６上、選択範囲に追加された変数マークに対応する変数の棒グラフを追加的に選択色で表示したり、又は、選択範囲から外された変数の棒グラフの選択色表示を解除したりする。 When the user selects an arbitrary variable mark on the loading plot 442 by dragging with the mouse while pressing the “CTRL” key of the keyboard, the PCA analysis screen control unit 374 adds the variable mark to the selection range, or If the variable mark is already selected, the variable mark is removed from the selection range. In conjunction with this, the PCA analysis screen control unit 374 additionally displays a bar graph of variables corresponding to the variable marks added to the selection range in the selection color on the contribution rate chart 444 and the loading chart 446, or , Cancel the selected color display of the bar graph of the variable that has been removed from the selection range.

また、図３３Bに示すように、上記のようにして或る選択範囲４７６で選択された変数マークについて、ユーザが「Del」キーを押すなどの削除要求を行うと、PCA解析画面制御部３７４は、その変数マークをローディングプロット４４０上から消去する。同時に、PCA計算部３６２が、その削除要求のあった変数マークに対応する変数（バケット）のPCAテーブル上でのステータス（図２４の参照番号４２６に相当）を「０」（解析に使用する）から「１」（解析に使用しない）に切り替え、そして、解析対象のサンプルのNMRヒストグラムデータ中の、ステータスが「０」（解析に使用する）である変数（バケット）に対応する積分値だけを使用して、PCA解析計算を再度行い、その再計算の結果を解析データ記憶部３６４に出力する。PCA解析画面制御部３７４は、解析データ記憶部３６４から再計算の結果を受けて、UI画面４３０上に、再計算の結果に基づくスコアプロット４４０、ローディングプロット４４２、寄与率チャート４４４及びローディングロット４４６を表示する（図３３Bは、再計算後に表示されたローディングプロット４４２の例を示す。）。 Also, as shown in FIG. 33B, when the user makes a deletion request such as pressing the “Del” key for the variable mark selected in a certain selection range 476 as described above, the PCA analysis screen control unit 374 The variable mark is deleted from the loading plot 440. At the same time, the PCA calculation unit 362 sets the status (corresponding to the reference number 426 in FIG. 24) of the variable (bucket) corresponding to the variable mark requested to be deleted to “0” (used for analysis). To “1” (not used for analysis), and only the integral value corresponding to the variable (bucket) whose status is “0” (used for analysis) in the NMR histogram data of the sample to be analyzed The PCA analysis calculation is performed again, and the result of the recalculation is output to the analysis data storage unit 364. The PCA analysis screen control unit 374 receives the result of the recalculation from the analysis data storage unit 364, and on the UI screen 430, the score plot 440, the loading plot 442, the contribution rate chart 444, and the loading lot 446 based on the result of the recalculation. (FIG. 33B shows an example of a loading plot 442 displayed after recalculation.)

例えば、ローディングプロット４４０上に、解析目的からみて支障になる或いは寄与しないと考えられる変数が表示されていた場合、その変数を上記のようにして解析に使用する変数から除外した上で再度PCA計算を行うことにより、より望ましい解析結果が得られることになる。 For example, on the loading plot 440, if a variable that is considered to be an obstacle or not contribute to the analysis purpose is displayed, the variable is excluded from the variables used for the analysis as described above, and then the PCA calculation is performed again. By performing the above, a more desirable analysis result can be obtained.

また、ユーザが「Shift」キーを押しながら、ローディングプロット４４２上の任意の変数マークをマウスにてドラッグして選択すると、PCA解析画面制御部３７４は、ローディングプロット４４２上の選択範囲に相当する部分を拡大して表示する。この後、ユーザが拡大表示されたローディングプロット４４２を選択して、キーボードの「Home」キーを押すと、PCA解析画面制御部３７４は、ローディングプロット４４２の拡大率を「１」に初期化し、全てのケミカルシフト範囲を表示する。 If the user drags and selects an arbitrary variable mark on the loading plot 442 with the mouse while pressing the “Shift” key, the PCA analysis screen control unit 374 corresponds to the selection range on the loading plot 442. Magnify and display. Thereafter, when the user selects the enlarged loading plot 442 and presses the “Home” key on the keyboard, the PCA analysis screen control unit 374 initializes the enlargement ratio of the loading plot 442 to “1”. Displays the chemical shift range.

また、ユーザがPCMコントロールパネル４３６上の「Reset loading」ボタンを操作すると、PCA計算部３６２が、PCAテーブル内容の上記サンプル削除機能により「１」となっていたバケットのステータスを初期値「０」に戻し、このように初期化されたPCAテーブルの内容に基づいて再度PCA解析計算を行い、そして、PCA解析画面制御部３７４が、そのPCA解析計算の結果に基づいたスコアプロット４４０、ローディングプロット４４２、寄与率チャート４４４及びローディングロット４４６を改めて表示する。 When the user operates the “Reset loading” button on the PCM control panel 436, the PCA calculation unit 362 sets the status of the bucket that has been “1” by the sample deletion function of the PCA table contents to the initial value “0”. , The PCA analysis calculation is performed again based on the contents of the PCA table initialized in this way, and the PCA analysis screen control unit 374 performs score plot 440 and loading plot 442 based on the result of the PCA analysis calculation. The contribution rate chart 444 and the loading lot 446 are displayed again.

次に、寄与率チャート４４４とローディングチャート４４６の各種操作について説明する。 Next, various operations of the contribution rate chart 444 and the loading chart 446 will be described.

ユーザが、寄与率チャート４４４又はローディングチャート４４６のケミカルシフト軸上の任意の変数（ケミカルシフト値）範囲をマウスにてドラッグして選択すると、PCA解析画面制御部３７４が、その選択範囲に属する変数のローディングチャート４４２上の変数マークと、寄与率チャート４４４及びローディングチャート４４６上の棒グラフを、選択範囲外の変数のそれと区別した所定の選択色で表示する。それにより、ローディングプロット４４２、寄与率チャート４４４及びローディングチャート４４６上での各変数（ケミカルシフト値）の識別と相関が一目で分かる。 When the user selects an arbitrary variable (chemical shift value) range on the chemical shift axis of the contribution rate chart 444 or the loading chart 446 by dragging with the mouse, the PCA analysis screen control unit 374 selects a variable belonging to the selected range. The variable mark on the loading chart 442 and the bar graph on the contribution rate chart 444 and the loading chart 446 are displayed in a predetermined selection color that is distinguished from that of the variable outside the selection range. Thereby, identification and correlation of each variable (chemical shift value) on the loading plot 442, the contribution rate chart 444, and the loading chart 446 can be known at a glance.

また、上記のようにして寄与率チャート４４４又はローディングチャート４４６上で選択された変数（ケミカルシフト値）範囲について、ユーザが「Del」キーを押すなどの削除要求を行うと、PCA解析画面制御部３７４は、その選択範囲に属する変数の変数マークをローディングプロット４４０上から消去する。同時に、PCA計算部３６２が、その選択範囲に属する変数（バケット）のPCAテーブル上でのステータス（図２４の参照番号４２６に相当）を「０」（解析に使用する）から「１」（解析に使用しない）に切り替え、そして、解析対象のサンプルのNMRヒストグラムデータ中の、ステータスが「０」（解析に使用する）である変数（バケット）に対応する積分値だけを使用して、PCA解析計算を再度行い、その再計算の結果を解析データ記憶部３６４に出力する。PCA解析画面制御部３７４は、解析データ記憶部３６４から再計算の結果を受けて、UI画面４３０上に、再計算の結果に基づくスコアプロット４４０、ローディングプロット４４２、寄与率チャート４４４及びローディングロット４４６を表示する。 When the user makes a deletion request such as pressing the “Del” key for the variable (chemical shift value) range selected on the contribution rate chart 444 or the loading chart 446 as described above, the PCA analysis screen control unit In step 374, the variable mark of the variable belonging to the selected range is deleted from the loading plot 440. At the same time, the PCA calculation unit 362 changes the status (corresponding to the reference number 426 in FIG. 24) of variables (buckets) belonging to the selected range from “0” (used for analysis) to “1” (analysis). PCA analysis using only integration values corresponding to variables (buckets) whose status is “0” (used for analysis) in the NMR histogram data of the sample to be analyzed The calculation is performed again, and the result of the recalculation is output to the analysis data storage unit 364. The PCA analysis screen control unit 374 receives the result of the recalculation from the analysis data storage unit 364, and on the UI screen 430, the score plot 440, the loading plot 442, the contribution rate chart 444, and the loading lot 446 based on the result of the recalculation. Is displayed.

例えば、寄与率チャート４４４又はローディングチャート４４６上に、解析目的からみて支障になる或いは寄与しないと考えられる変数（ケミカルシフト値）範囲が在る場合、その範囲の変数を上記のようにして解析に使用する変数から除外した上で再度PCA計算を行うことにより、より望ましい解析結果が得られることになる。 For example, when there is a variable (chemical shift value) range on the contribution rate chart 444 or loading chart 446 that is considered to be a problem or not contributing to the analysis purpose, the variable in the range is analyzed as described above. A more desirable analysis result can be obtained by performing the PCA calculation again after excluding the variables to be used.

また、図３４Aに示すように、ユーザが「Shift」キーを押しながら、ローディングチャート４４６又は寄与率チャート４４４のケミカルシフト軸上の任意の変数（ケミカルシフト値）範囲をマウスにてドラッグして選択すると、PCA解析画面制御部３７４は、図３４Bに示すように、ローディングチャート４４６及び寄与率チャート４４４の中の選択された変数（ケミカルシフト値）範囲に相当する部分を拡大して表示する。これに連携して、NMRスペクトル連携部３７８が、NMRスペクトルチャート４５０の中の選択された変数（ケミカルシフト値）範囲に相当する部分を拡大して表示する。従って、どのような拡大率で表示した場合でも常に、ローディングチャート４４６と寄与率チャート４４４とNMRスペクトルチャート４５０はケミカルシフト軸スケールが一致した状態に維持され、それらの間の相関関係が一目で分かるようになっている。 Further, as shown in FIG. 34A, the user selects an arbitrary variable (chemical shift value) range on the chemical shift axis of the loading chart 446 or the contribution ratio chart 444 by dragging with the mouse while pressing the “Shift” key. Then, the PCA analysis screen control unit 374 enlarges and displays a portion corresponding to the selected variable (chemical shift value) range in the loading chart 446 and the contribution ratio chart 444, as shown in FIG. 34B. In cooperation with this, the NMR spectrum cooperation unit 378 enlarges and displays a portion corresponding to the selected variable (chemical shift value) range in the NMR spectrum chart 450. Therefore, the loading chart 446, the contribution ratio chart 444, and the NMR spectrum chart 450 are always maintained in a state where the chemical shift axis scales coincide with each other even when displayed at any magnification, and the correlation between them can be seen at a glance. It is like that.

このようにローディングチャート４４６及び寄与率チャート４４４並びにNMRスペクトルチャート４５０の拡大表示を行なった後、ユーザが拡大表示されたローディングチャート４４６又は寄与率チャート４４４を選択して、キーボードの「Home」キーを押すと、PCA解析画面制御部３７４とNMRスペクトル連携部３７８は、それらのチャート４４６、４４４、４５０の拡大率を「１」に初期化し、全てのケミカルシフト範囲を表示する。 After the loading chart 446, the contribution ratio chart 444 and the NMR spectrum chart 450 are enlarged and displayed in this way, the user selects the loading chart 446 or the contribution ratio chart 444 that has been enlarged, and presses the “Home” key on the keyboard. When pressed, the PCA analysis screen control unit 374 and the NMR spectrum cooperation unit 378 initialize the enlargement ratios of the charts 446, 444, and 450 to “1” and display all the chemical shift ranges.

PCA解析を行なう場合、以上のような各種の操作がUI画面４３０上で行なうことができる。このような操作を逐次に行いながらPCA解析を進めていく過程の任意の段階で、ユーザがPCAコントロールパネル４３６内の「Add_Model」ボタンを操作すると、PCAモデル読込・追加・保存部３６０が、記憶装置１０６内の大量のサンプルのNMRヒストグラムデータ３５６の中から、ユーザに指定された追加のサンプルのNMRヒストグラムデータを選択して読み込み、これを現在処理中のPCAテーブル内の多変量データ行列に追加し、その多変量データ行列が追加されたPCAテーブルの情報をPCA計算部３６２に渡す。PCA計算部３６２は、その多変量データ行列が追加されたPCAテーブルの情報を用いてPCA解析計算を再実行する。PCA解析画面制御部３７４は、その再実行されたPCA解析計算の結果を、UI画面４３０に改めて表示する。 When performing PCA analysis, various operations as described above can be performed on the UI screen 430. When the user operates the “Add_Model” button in the PCA control panel 436 at any stage in the process of proceeding with the PCA analysis while sequentially performing such operations, the PCA model reading / adding / saving unit 360 stores Select and read the NMR histogram data of the additional sample specified by the user from the NMR histogram data 356 of the large number of samples in the apparatus 106, and add this to the multivariate data matrix in the PCA table currently being processed. Then, the information of the PCA table to which the multivariate data matrix is added is passed to the PCA calculation unit 362. The PCA calculation unit 362 re-executes the PCA analysis calculation using information of the PCA table to which the multivariate data matrix is added. The PCA analysis screen control unit 374 displays the result of the re-executed PCA analysis calculation on the UI screen 430 again.

また、上記のようにPCA解析を進めていく過程の任意の段階で、ユーザがPCAコントロールパネル４３６内の「Save_model」ボタンを操作すると、PCAモデル読込・追加・保存部３６０が、その時点で処理中の多変量データ行列を始めとするPCAテーブルの情報を、記憶装置１０６内にPCAテーブル３５０として保存する。 In addition, when the user operates the “Save_model” button in the PCA control panel 436 at any stage of the process of proceeding with the PCA analysis as described above, the PCA model reading / adding / saving unit 360 performs processing at that time. Information on the PCA table including the multivariate data matrix therein is stored as the PCA table 350 in the storage device 106.

以上が、PCA解析を行なう場合の各部の動作である。上述した説明から分かるように、PCA解析を行なう場合、UI画面４３０上では、スコアプロット４４０、ローディングプロット４４２、寄与率チャート４４４、ローディングチャート４４６及びNMRスペクトルチャート４５０の全ての情報が有機的に結合されて表示され、そして、いずれかの操作が行われると、その操作結果が、それに関連する全てのチャートにリアルタイムに反映される。特に、解析対象からのサンプルの削除や変数の削除などを行なった後にPCA解析をすばやく再実行し、より精度の高い解析結果を導き出すことが容易である。また、UI画面４３０上では、PCA解析結果に連携してNMRスペクトルも表示される。そのため、PCA解析結果とNMRスペクトルの形状とを合わせて評価することができ、解析結果の意味づけを考察することがより容易になる。PCA解析過程で必要な操作を行った後の最終的に評価結果からPCAの再解析およびスペクトルの再処理をすばやく実行しより精度の高い解析結果を導き出すことが容易になる。 The above is the operation of each part when performing PCA analysis. As can be seen from the above description, when the PCA analysis is performed, all information of the score plot 440, the loading plot 442, the contribution rate chart 444, the loading chart 446, and the NMR spectrum chart 450 is organically combined on the UI screen 430. When any operation is performed, the operation result is reflected in real time on all the related charts. In particular, it is easy to quickly re-execute a PCA analysis after deleting a sample or a variable from an analysis target, and derive a more accurate analysis result. On the UI screen 430, the NMR spectrum is also displayed in cooperation with the PCA analysis result. Therefore, the PCA analysis result and the shape of the NMR spectrum can be evaluated together, and it becomes easier to consider the meaning of the analysis result. After performing the necessary operations in the PCA analysis process, it becomes easy to quickly execute PCA reanalysis and spectrum reprocessing from the evaluation results to derive more accurate analysis results.

次に、SIMCA解析を行なう場合における動作について説明する。 Next, the operation when SIMCA analysis is performed will be described.

すなわち、まず、図２６又は図２７に示したUI画面４３０上のSIMCAコントロールパネル４３８内の「SIMCA」選択ラジオボタンが操作された上で「Model1」ボタンが操作されると、図２５に示したSIMCA解析部３４８内のPCAモデル読込・追加・保存部３７０が動作して、記憶装置１０６内の大量のサンプルのNMRヒストグラムデータ３５６の中から、ユーザに指定された複数のサンプルのNMRヒストグラムデータを選択して読み込み、図２４に示したような、多変量データ行列やその他の情報からなるPCAテーブルの情報を作成する。作成されたPCAテーブルの情報は、第1のクラスのPCAテーブル情報として扱われる。同様に、「Model２」ボタンが操作されると、PCAモデル読込・追加・保存部３７０が、記憶装置１０６内のNMRヒストグラムデータ３５６の中から、ユーザに指定された複数のサンプルのNMRヒストグラムデータを選択して読み込み、第２のクラスのPCAテーブルの情報を作成する。 That is, first, when the “Model 1” button is operated after the “SIMCA” selection radio button in the SIMCA control panel 438 on the UI screen 430 shown in FIG. The PCA model reading / adding / storing unit 370 in the SIMCA analysis unit 348 operates, and the NMR histogram data of a plurality of samples specified by the user is selected from the NMR histogram data 356 of a large number of samples in the storage device 106. Select and read to create PCA table information consisting of a multivariate data matrix and other information as shown in FIG. Information on the created PCA table is handled as PCA table information of the first class. Similarly, when the “Model 2” button is operated, the PCA model reading / adding / saving unit 370 selects the NMR histogram data of a plurality of samples designated by the user from the NMR histogram data 356 in the storage device 106. Select and read to create second class PCA table information.

SIMCA解析を開始する場合には、少なくとも第１と第２クラスのPCAテーブルが定義される必要がある。更に、ユーザは、所望すればSIMCAコントロールパネル４３８内の「Model1」ボタンが操作されると、「Test_model」ボタンを操作することができる。「Test_model」ボタンが操作されると、PCAモデル読込・追加・保存部３７０が、記憶装置１０６内のNMRヒストグラムデータ３５６の中から、ユーザに指定された複数のサンプルのNMRヒストグラムデータを選択して読み込み、テストクラスのPCAテーブルの情報を作成する。 When starting the SIMCA analysis, at least first and second class PCA tables need to be defined. Further, if desired, the user can operate the “Test_model” button when the “Model1” button in the SIMCA control panel 438 is operated. When the “Test_model” button is operated, the PCA model reading / adding / saving unit 370 selects NMR histogram data of a plurality of samples designated by the user from the NMR histogram data 356 in the storage device 106. Read and create PCA table information of test class.

PCAモデル読込・追加・保存部３７０は、作成した第１クラスと第２クラスのPCAテーブルの
情報をSIMCA計算部３７２に引渡す。テストクラスのPCAテーブルが更に作成された場合には、そのテストクラスのPCAテーブルの情報もSIMCA計算部３７２に引渡される。SIMCA計算部３７２は、それらのクラスのPCAテーブルの情報を用いて、PCA計算部３６２を呼び出してクラス毎のPCAモデルを算出させ、それらのクラスのPCAモデルを用いてSIMCA解析の計算を行なう。各種クラスのPCAモデルの主成分数は、SIMCAコントロールパネル４３８内の「SIMCA」選択ラジオボタンの右脇の入力ボックスにユーザが入力した数に従う。SIMCA計算部３７２は、SIMCA解析結果として、第1と第２の各クラスに対する各サンプルの距離、第1と第２の各クラスの残差標準偏差（RSD）、第1と第２の各クラスに対する各変数（ケミカルシシフト値）のモデリング力（modeling power）、及び各変数（ケミカルシシフト値）の識別力（discrimination power）などを算出する。算出されたSIMCA解析結果は、解析データ記憶部３６４に記憶される。 The PCA model reading / adding / saving unit 370 delivers the information of the created first and second class PCA tables to the SIMCA calculation unit 372. When a test class PCA table is further created, information on the test class PCA table is also delivered to the SIMCA calculation unit 372. The SIMCA calculation unit 372 calls the PCA calculation unit 362 using the information of the PCA table of those classes, calculates the PCA model for each class, and performs the calculation of the SIMCA analysis using the PCA model of those classes. The number of principal components of PCA models of various classes follows the number input by the user in the input box on the right side of the “SIMCA” selection radio button in the SIMCA control panel 438. The SIMCA calculation unit 372 calculates the SIMCA analysis results as follows: distance of each sample to each of the first and second classes, residual standard deviation (RSD) of each of the first and second classes, and each of the first and second classes. The modeling power of each variable (chemical shift value) and the discriminating power of each variable (chemical shift value) are calculated. The calculated SIMCA analysis result is stored in the analysis data storage unit 364.

SIMCA解析画面制御部３７４が、解析データ記憶部３６４から上記SIMCA解析結果を受け取る。SIMCA解析画面制御部３７６は、図２７に示すように、第1と第２の各クラスに対する各サンプルの距離に基づいてクーマンプロット４６０を作成してUI画面４３０上に表示し、第1と第２の各クラスに対する各変数（ケミカルシシフト値）のモデリング力に基づいて、第１クラスのモデリング力チャート４６２と第２クラスのモデリング力チャート４６４を作成してUI画面４３０上に表示し、さらに、各変数（ケミカルシシフト値）の識別力に基づいて、識別力チャート４６６を作成してUI画面４３０上に表示する。 The SIMCA analysis screen control unit 374 receives the SIMCA analysis result from the analysis data storage unit 364. As shown in FIG. 27, the SIMCA analysis screen control unit 376 creates a Couman plot 460 based on the distance of each sample with respect to each of the first and second classes and displays it on the UI screen 430. Based on the modeling ability of each variable (chemical shift value) for each class of 2, a first class modeling ability chart 462 and a second class modeling ability chart 464 are created and displayed on the UI screen 430. Based on the discriminating power of each variable (chemical shift value), a discriminating power chart 466 is created and displayed on the UI screen 430.

図２７に示すように、クーマンプロット４６０では、第1クラスと第２クラスからの距離をそれぞれX軸とY軸にとり、各サンプルのその２つのクラスまでの距離がプロットされる。プロットされるサンプルのマークの形状と色は、上記PCAテーブル情報の中のサンプル毎のプロットタイプとプロットカラー（図２４の参照番号４１０、４１２に対応する情報）に従う。さらに、クーマンプロット４６０上には、第1クラスと第２クラスのそれぞれの残差標準偏差（RSD）の名称と値（例：RSD(1)0.5449）がX軸とY軸の近傍に表示され、かつ、それぞれのクラスの残差標準偏差（RSD）を示す直線（以下、「RSD」線という）４８０、４８２が表示される。 As shown in FIG. 27, in the Kuman plot 460, the distances from the first class and the second class are taken on the X axis and the Y axis, respectively, and the distances to the two classes of each sample are plotted. The shape and color of the mark of the sample to be plotted follows the plot type and plot color (information corresponding to reference numbers 410 and 412 in FIG. 24) for each sample in the PCA table information. In addition, on the Couman plot 460, the names and values (eg, RSD (1) 0.5449) of the residual standard deviation (RSD) for each of the first and second classes are displayed near the X and Y axes. In addition, straight lines (hereinafter referred to as “RSD” lines) 480 and 482 indicating the residual standard deviation (RSD) of each class are displayed.

クーマンプロット４６０上でのサンプルマークの配置状態が、次のような明快分類状態に近い状態であるほど、サンプルがより明確に第１クラスと第２クラスに分類されたと判断することができる。すなわち、その明快分類状態とは、サンプルマークが２つの塊に分かれ、一方の塊が第１クラスのRSD線４８０とY軸との間に領域に存在し、他方の塊が第２クラスのRSD線４８２とX軸との間の領域に存在するという状態である。 It can be determined that the sample is more clearly classified into the first class and the second class as the arrangement state of the sample marks on the Kuman plot 460 is closer to the following clear classification state. That is, the clear classification state is that the sample mark is divided into two chunks, one chunk is in the area between the first class RSD line 480 and the Y axis, and the other chunk is the second class RSD. It is in a state where it exists in the region between the line 482 and the X axis.

SIMCAコントロールパネル４３８内の「Name」チェックボックスにユーザがチェックを入れると、SIMCA解析画面制御部３７４は、図２８を参照して説明したスコアプロット４４０の場合と同様に、クーマンプロット４６０上の全てのサンプルマークの近傍に、対応するサンプルのNMRスペクトルのファイル名を表示する。マウスポインターが各サンプルの上に置かれた場合にも、クーマンプロット４６０の右上に自動的に対応するファイル名が表示される。これにより、ユーザは、クーマンプロット４６０の各サンプルマークが、具体的にどのサンプルに該当するのかを容易に把握できる。 When the user checks the “Name” check box in the SIMCA control panel 438, the SIMCA analysis screen control unit 374, as in the case of the score plot 440 described with reference to FIG. The file name of the NMR spectrum of the corresponding sample is displayed near the sample mark. Even when the mouse pointer is placed on each sample, the corresponding file name is automatically displayed on the upper right of the Kuman plot 460. Thereby, the user can easily grasp which sample each sample mark of the Kuman plot 460 specifically corresponds to.

図２７に示すように、第１クラスと第２クラスのモデリングチャート４６２、４６４では、ケミカルシフトを横軸にとり、各変数（ケミカルシフト値）の第１クラスと第２クラスについてのモデリング力（これは、それぞれのクラスのデータ構造を記述するための各変数の有効性を評価するための一つの指標である）が、棒グラフの形で表示される。各クラスのモデリングチャート４６２、４６４において、モデリング力（棒グラフの高さ）が高い変数ほど、各クラスのデータ構造を記述する有効性がより高いと判断することができる。 As shown in FIG. 27, in the modeling charts 462 and 464 of the first class and the second class, the chemical shift is taken on the horizontal axis, and the modeling power (this) for the first class and the second class of each variable (chemical shift value). Is an index for evaluating the effectiveness of each variable for describing the data structure of each class), and is displayed in the form of a bar graph. In the modeling charts 462 and 464 of each class, it can be determined that the variable having a higher modeling ability (height of the bar graph) is more effective in describing the data structure of each class.

図２７に示すように、識別力チャート４６６では、ケミカルシフトを横軸にとり、各変数（ケミカルシフト値）の識別力（これは、第1と第２のクラスへの分類のための各変数の有効性を評価するための一つの指標である）が、棒グラフの形で表示される。識別力チャート４６６において、識別力（棒グラフの高さ）が高い変数ほど、第1と第２のクラスへの分類のための有効性がより高いと判断することができる。 As shown in FIG. 27, in the discriminating power chart 466, the horizontal axis indicates the chemical shift, and the discriminating power of each variable (chemical shift value) (this is the classification of each variable for classification into the first and second classes). Is an indicator for evaluating effectiveness) in the form of a bar graph. In the discriminating power chart 466, it can be determined that the higher the discriminating power (the height of the bar graph), the higher the effectiveness for classification into the first and second classes.

図２７に示されたクーマンプロット４６０、モデリングチャート４６２、４６４、識別力チャート４６６のそれぞれにおいて、選択、拡大／縮小、削除、初期化等の各種操作を行うことが可能である。 In the Cooman plot 460, the modeling charts 462 and 464, and the discriminant power chart 466 shown in FIG. 27, various operations such as selection, enlargement / reduction, deletion, and initialization can be performed.

まず、クーマンプロット４６０の各種操作について説明する。 First, various operations of the Kuman plot 460 will be described.

図３５に示すように、クーマンプロット４６０上の任意の１以上のサンプルマークをマウスのドラッグにより選択すると、SIMCA解析画面制御部３７６が、その選択範囲４８４をクーマンプロット４６０上に表示する。これと連携して、NMRスペクトル連携部３７８が動作して、選択されたサンプルマークに対応するサンプルのNMRスペクトルデータを、PCMテーブルに登録されているそのサンプルのパス名とファイル名とを用いて、記憶装置１０６内のNMRスペクトルデータ３５４から読み込み、そして、選択されたサンプルのNMRスペクトルをNMRスペクトルチャート４５０上に表示する。NMRスペクトルチャート４５０には、表示されたNMRスペクトルのファイル名も表示される。 As shown in FIG. 35, when any one or more sample marks on the Couman plot 460 are selected by dragging the mouse, the SIMCA analysis screen control unit 376 displays the selection range 484 on the Couman plot 460. In cooperation with this, the NMR spectrum cooperation unit 378 operates, and the NMR spectrum data of the sample corresponding to the selected sample mark is used by using the path name and file name of the sample registered in the PCM table. The NMR spectrum data 354 in the storage device 106 is read, and the NMR spectrum of the selected sample is displayed on the NMR spectrum chart 450. The NMR spectrum chart 450 also displays the file name of the displayed NMR spectrum.

NMRスペクトルチャート４５０上で、複数のNMRスペクトルは、ケミカルシフト軸スケールを互いに一致させるようにして、ケミカルシフト軸に直交する縦軸方向に並んで表示され、且つ、規定の異なる色で区別して表示されるので、相互間の識別および相関が一目で分かる。加えて、図２７に示されているように、NMRスペクトルチャート４５０上の各NMRスペクトルと識別力チャート４６６とが、ケミカルシフト軸スケールを互いに一致させて、縦軸方向に並んで表示されるので、それらの間の相関も一目で分かる。 On the NMR spectrum chart 450, a plurality of NMR spectra are displayed side by side in the vertical axis direction orthogonal to the chemical shift axis so that the chemical shift axis scales coincide with each other, and are distinguished and displayed in different colors. Thus, the mutual identification and correlation can be seen at a glance. In addition, as shown in FIG. 27, each NMR spectrum on the NMR spectrum chart 450 and the discriminating power chart 466 are displayed side by side in the vertical axis direction with the chemical shift axis scales matched. The correlation between them can be seen at a glance.

ユーザがキーボードの「CTRL」キーを押しながら、クーマンプロット４６０上の任意のサンプルマークをマウスにてドラッグして選択すると、SIMCA解析画面制御部３７６は、そのサンプルマークを選択範囲に追加し、又は、そのサンプルマークが既に選択されている場合には、そのサンプルマークを選択範囲から外す。これと連携して、NMRスペクトル連携部３７８が、選択範囲に追加されたサンプルマークに対応するサンプルのNMRスペクトルをNMRスペクトルチャート４５０上に追加表示したり、又は、選択範囲から外されたサンプルのNMRスペクトルをNMRスペクトルチャート４５０から消去する。 When the user selects an arbitrary sample mark on the Couman plot 460 while dragging with the mouse while pressing the “CTRL” key on the keyboard, the SIMCA analysis screen control unit 376 adds the sample mark to the selection range, or If the sample mark is already selected, the sample mark is removed from the selection range. In cooperation with this, the NMR spectrum linking unit 378 additionally displays the NMR spectrum of the sample corresponding to the sample mark added to the selected range on the NMR spectrum chart 450, or the sample spectrum removed from the selected range. The NMR spectrum is erased from the NMR spectrum chart 450.

図３６Aに示すように、クーマンプロット４６０上で選択されたサンプルマークに関し、ユーザが「Del」キーを押すなどの削除要求を行うと、SIMCA解析画面制御部３７６は、図３６Bに示すように、そのサンプルマークをクーマンプロット４６０上から消去し、これと連携して、NMRスペクトル連携部３７８が、その選択されたサンプルのNMRスペクトルをNMRスペクトルチャート４５０から消去する。同時に、SIMCA計算部３７２が、その削除要求のあったサンプルの該当クラスのPCAテーブル上でのサンプルステータス（図２４の参照番号４１４に相当）を「０」（解析対象に含まれる）から「１」（解析対象に含まれない）に切り替え、そして、サンプルステータスが「０」（解析対象に含まれる）であるサンプルのNMRヒストグラムデータだけを用いて、SIMCA解析計算を再度行い、その再計算の結果を解析データ記憶部３６４に出力する。SIMCA解析画面制御部３７６は、解析データ記憶部３６４から再計算の結果を受けて、UI画面４３０上に、再計算の結果に基づくクーマンプロット４６０、モデリングプロット４６２、４６４及び識別力チャート４６６を表示する（図３６Bは、再計算後に表示されたクーマンプロット４６０の例を示す）。 As shown in FIG. 36A, when the user makes a deletion request such as pressing the “Del” key for the sample mark selected on the Kuman plot 460, the SIMCA analysis screen control unit 376, as shown in FIG. 36B, The sample mark is erased from the Couman plot 460, and in cooperation therewith, the NMR spectrum cooperation unit 378 erases the NMR spectrum of the selected sample from the NMR spectrum chart 450. At the same time, the SIMCA calculation unit 372 changes the sample status (corresponding to the reference number 414 in FIG. 24) of the corresponding class of the sample requested to be deleted from “0” (included in the analysis target) to “1”. ”(Not included in the analysis target), and the SIMCA analysis calculation is performed again using only the NMR histogram data of the sample whose sample status is“ 0 ”(included in the analysis target). The result is output to the analysis data storage unit 364. The SIMCA analysis screen control unit 376 receives the recalculation result from the analysis data storage unit 364 and displays the Couman plot 460, the modeling plots 462 and 464, and the discriminating power chart 466 based on the recalculation result on the UI screen 430. (FIG. 36B shows an example of the Couman plot 460 displayed after recalculation).

例えば、クーマンプロット４６０上に、解析目的からみて支障になる或いは寄与しないと考えられるサンプルが表示されていた場合、そのサンプルを上記のようにして解析対象から除外した上で再度SIMCA計算を行うことにより、より望ましい解析結果が得られることになる。 For example, on the Couman plot 460, if a sample that is considered to be a problem or does not contribute to the analysis purpose is displayed, the sample is excluded from the analysis target as described above, and then the SIMCA calculation is performed again. Thus, a more desirable analysis result can be obtained.

また、ユーザがキーボードの「Shift」キーを押しながら、クーマンプロット４６０上の任意の部分領域をマウスにてドラッグし選択すると、SIMCA解析画面制御部３７６は、選択した部分領域を拡大してクーマンプロット４６０上に表示する。その後、ユーザが拡大表示されたクーマンプロット４６０の画面を選択して、キーボードの「Home」キーを押すと、SIMCA解析画面制御部３７６は、クーマンプロット４６０上の拡大率を「１」に初期化し、元の表示範囲の全部を表示する。 If the user drags and selects an arbitrary partial area on the Couman plot 460 with the mouse while pressing the “Shift” key on the keyboard, the SIMCA analysis screen control unit 376 enlarges the selected partial area and sets the Couman plot. 460 is displayed on the screen. Thereafter, when the user selects the enlarged screen of the Kuman plot 460 and presses the “Home” key on the keyboard, the SIMCA analysis screen control unit 376 initializes the enlargement ratio on the Kuman plot 460 to “1”. , Display the entire original display range.

また、ユーザがSIMCAコントロールパネル４３８上の「Reset_score」ボタンを操作すると、SIMCA計算部３７２が、各クラスのPCAテーブル内の上記サンプル削除機能により「１」となっていたサンプルステータスを初期値「０」し、このように初期化されたPCAテーブルの内容に基づいて再度SIMCA解析計算を行い、そして、SIMCA解析画面制御部３７６が、そのSIMCA解析計算の結果に基づいたクーマンプロット４６０、モデリングプロット４６２、４６４及び識別力チャート４６６を改めて表示する。 When the user operates the “Reset_score” button on the SIMCA control panel 438, the SIMCA calculation unit 372 sets the sample status that has been set to “1” in the PCA table of each class to the initial value “0”. Then, the SIMCA analysis calculation is performed again based on the contents of the PCA table thus initialized, and the SIMCA analysis screen control unit 376 performs the Kuman plot 460 and the modeling plot 462 based on the result of the SIMCA analysis calculation. 464 and the discriminating power chart 466 are displayed again.

次に、モデリング力チャート４６２、４６４と識別力チャート４６６の各種操作について説明する。 Next, various operations of the modeling power charts 462 and 464 and the discrimination power chart 466 will be described.

図３７に示すように、識別力チャート４６６上で、ユーザが縦軸方向の任意の高さ位置（つまり、任意の識別力の値）をマウスでドラッグして閾値として指定すると、SMCA解析画面制御部３７６が、識別力チャート４６６上のその指定された閾値を示す閾値ライン４８６を表示するとともに、その指定された閾値より高い識別力を持つ変数のみを選択して、その選択された変数の棒グラフを、モデリング力チャート４６２、４６４及び識別力チャート４６６の全てにおいて、他の変数とは異なる所定の選択色で表示する。同時に、解析結果記憶部３６４が、選択された変数（すなわち、指定された閾値より識別力の高い変数）を特定する情報を記憶する。同様に、ユーザが第1クラス又は第２クラスモデリング力チャート４６２又は４６４上で、或るモデリング力の値を閾値として指定した場合にも、その閾値より高いモデリング力をもつ変数が他の変数とは異なる選択色で表示され、かつ、その選択された変数を特定する情報が解析結果記憶部３６４に記憶される。これにより、モデリング力チャート４６２、４６４、識別力チャート４６６及NMRスペクトルチャート４５０における選択された変数の識別や相関関係が一目で分かる。 As shown in FIG. 37, when the user drags an arbitrary height position in the vertical axis direction (that is, an arbitrary discriminating power value) as a threshold by dragging with the mouse on the discriminating power chart 466, SMCA analysis screen control is performed. The unit 376 displays a threshold line 486 indicating the designated threshold on the discriminating power chart 466, selects only variables having discriminating power higher than the designated threshold, and displays a bar graph of the selected variables Are displayed in a predetermined selection color different from other variables in all of the modeling power charts 462 and 464 and the discrimination power chart 466. At the same time, the analysis result storage unit 364 stores information that identifies the selected variable (that is, a variable having a higher discriminating power than the specified threshold value). Similarly, when the user designates a certain modeling force value as a threshold value on the first class or second class modeling force chart 462 or 464, a variable having a modeling ability higher than the threshold value is different from other variables. Are displayed in different selected colors, and information specifying the selected variable is stored in the analysis result storage unit 364. Thereby, the identification and correlation of the selected variables in the modeling power charts 462 and 464, the discriminating power chart 466 and the NMR spectrum chart 450 can be understood at a glance.

上記のようにして識別力又はモデリング力の高い変数が選択された後、ユーザがSIMCAコントロールパネル４３８内の「Set_D.P.」ボタンを操作すると、SIMCA計算部３７２が、解析データ記憶部３６４から選択された変数の特定情報を受け取り、各クラスのPCAテーブル情報中の選択されていない変数のステータスを「０」（解析に使用する）から「１」（解析に使用しない）に変更し、そして、選択された変数だけを用いて再度SIMCA計算を行なう。続いて、SIMCA解析画面制御部３７６が、図３８に示すように、SIMCA再計算の結果を受けて、それに基づきクーマンプロット４６０、モデリングプロット４６２、４６４及び識別力チャート４６６を改めて表示する。このようにして識別力又はモデリング力の高い変数だけを用いてSIMCA解析を再度行うことで、より望ましい解析結果を得ることが可能になる。 After the variable having high discriminating power or modeling power is selected as described above, when the user operates the “Set_D.P.” Button in the SIMCA control panel 438, the SIMCA calculating unit 372 reads from the analysis data storage unit 364. Receives specific information on selected variables, changes the status of unselected variables in the PCA table information for each class from “0” (used for analysis) to “1” (not used for analysis), and The SIMCA calculation is performed again using only the selected variable. Subsequently, as shown in FIG. 38, the SIMCA analysis screen control unit 376 receives the result of the SIMCA recalculation, and displays the Coomman plot 460, the modeling plots 462 and 464, and the discriminating power chart 466 again based on the result. In this way, it is possible to obtain a more desirable analysis result by performing the SIMCA analysis again using only a variable having a high discrimination power or modeling power.

また、上記のようにして識別力又はモデリング力の高い変数が選択された後、ユーザがPCAコントロールパネル４３６内の「Set_D.P.」ボタンを操作すると、PCA計算部３６２が、解析データ記憶部３６４から選択された変数の特定情報を受け取り、SIMCA解析で使用されていた全クラスのPCAテーブル情報を統合して、全クラスのサンプルの多変量データ行列をもつPCA解析用のPCAテーブル情報を作成し、そのPCA解析用のPCAテーブル情報中の選択された変数のステータスを「０」（解析に使用する）とし、選択されてない変数のステータスを「１」（解析に使用しない）とする。そして、PCA計算部３６２は、その選択された変数だけを用いて、PCA解析用のPCAテーブル情報中の全クラスのサンプルの多変量データ行列についてPCA計算を再度行なう。続いて、PCA解析画面制御部３７４が、UI画面４３０を図２７に示すようなSIMCA解析用のものから、図２６に示すようなPCA解析用のもの切り替え、上述したPCA解析の再計算の結果に基づいたスコアプロット４４０、ローディングプロット４４２、寄与率チャート４４４及びローディングチャート４４６を改めて表示する。このようにして識別力又はモデリング力の高い変数だけを用いてPCA解析を再度行うことで、より望ましい解析結果を得ることが可能になる。 When a user operates the “Set_D.P.” Button in the PCA control panel 436 after a variable having high discriminating power or modeling power is selected as described above, the PCA calculation unit 362 causes the analysis data storage unit to PCA table information for PCA analysis with multivariate data matrix of samples of all classes is created by receiving the specific information of the variable selected from 364 and integrating PCA table information of all classes used in SIMCA analysis Then, the status of the selected variable in the PCA table information for PCA analysis is set to “0” (used for analysis), and the status of the unselected variable is set to “1” (not used for analysis). Then, using only the selected variable, the PCA calculation unit 362 performs the PCA calculation again on the multivariate data matrix of the samples of all classes in the PCA table information for PCA analysis. Subsequently, the PCA analysis screen control unit 374 switches the UI screen 430 from the one for SIMCA analysis as shown in FIG. 27 to the one for PCA analysis as shown in FIG. A score plot 440, a loading plot 442, a contribution rate chart 444 and a loading chart 446 are displayed again. In this way, it is possible to obtain a more desirable analysis result by performing the PCA analysis again using only the variables having high discrimination power or modeling power.

また、図２７に示すようなSIMCA解析用のUI画面４３０において、ユーザが「Shift」キーを押しながら、モデリング力チャート４６２、４６４又は識別力チャート４６６中のケミカルシフト軸上の任意の変数（ケミカルシフト値）範囲をマウスにてドラッグして選択すると、SIMCA解析画面制御部３７６は、モデリング力チャート４６２、４６４及び識別力チャート４６６中の選択された変数（ケミカルシフト値）範囲に相当する部分を拡大して表示する。これに連携して、NMRスペクトル連携部３７８が、NMRスペクトルチャート４５０の中の選択された変数（ケミカルシフト値）範囲に相当する部分を拡大して表示する。従って、どのような拡大率で表示した場合でも常に、識別力チャート４６６とNMRスペクトルチャート４５０はケミカルシフト軸スケールが一致した状態に維持され、それらの間の相関関係が一目で分かるようになっている。 In addition, in the UI screen 430 for SIMCA analysis as shown in FIG. 27, while the user presses the “Shift” key, an arbitrary variable (chemical variable) on the chemical shift axis in the modeling force charts 462 and 464 or the discriminating power chart 466 is displayed. When the shift value) range is selected by dragging with the mouse, the SIMCA analysis screen control unit 376 selects a portion corresponding to the selected variable (chemical shift value) range in the modeling power charts 462 and 464 and the discriminating power chart 466. Enlarge and display. In cooperation with this, the NMR spectrum cooperation unit 378 enlarges and displays a portion corresponding to the selected variable (chemical shift value) range in the NMR spectrum chart 450. Therefore, the discriminating power chart 466 and the NMR spectrum chart 450 are always maintained in a state where the chemical shift axis scales coincide with each other even when displayed at any magnification, and the correlation between them can be seen at a glance. Yes.

このようにモデリング力チャート４６２、４６４、識別力チャート４６６及びNMRスペクトルチャート４５０の拡大表示を行なった後、ユーザが拡大表示されたモデリング力チャート４６２、４６４又は識別力チャート４６６を選択して、キーボードの「Home」キーを押すと、SIMCA解析画面制御部３７６とNMRスペクトル連携部３７８は、それらのチャート４６２、４６４、４６６の拡大率を「１」に初期化し、全てのケミカルシフト範囲を表示する。 After the modeling power charts 462 and 464, the discriminating power chart 466 and the NMR spectrum chart 450 are enlarged and displayed, the user selects the enlarged modeling power charts 462 and 464 or the discriminating power chart 466, and the keyboard is selected. When the “Home” key is pressed, the SIMCA analysis screen control unit 376 and the NMR spectrum cooperation unit 378 initialize the magnifications of the charts 462, 464, and 466 to “1” and display all the chemical shift ranges. .

以上が、SIMCA解析を行なう場合の各部の動作である。上述した説明から分かるように、SIMCA解析を行なう場合、UI画面４３０上では、クーマンプロット４６０、モデリングプロット４６２、４６４、識別力チャート４６６及びNMRスペクトルチャート４５０の全ての情報が有機的に結合されて表示され、そして、いずれかの操作が行われると、その操作結果が、それに関連する全てのチャートにリアルタイムに反映される。特に、解析対象からのサンプルの削除や変数の選択などを行なった後にSIMCA解析をすばやく再実行し、より精度の高い解析結果を導き出すことが容易である。また、UI画面４３０上では、SIMCA解析結果に連携してNMRスペクトルも表示される。そのため、SIMCA解析結果とNMRスペクトルの形状とを合わせて評価することができ、解析結果の意味づけを考察することがより容易になる。 The above is the operation of each unit when performing SIMCA analysis. As can be understood from the above description, when the SIMCA analysis is performed, all information of the Couman plot 460, the modeling plots 462 and 464, the discriminating power chart 466, and the NMR spectrum chart 450 is organically combined on the UI screen 430. When any operation is performed, the operation result is reflected in real time on all the related charts. In particular, it is easy to quickly re-execute SIMCA analysis after deleting a sample from the analysis target or selecting a variable, and to derive a more accurate analysis result. On the UI screen 430, an NMR spectrum is also displayed in cooperation with the SIMCA analysis result. Therefore, the SIMCA analysis result and the shape of the NMR spectrum can be evaluated together, and it becomes easier to consider the meaning of the analysis result.

また、SIMCA解析では、各変数についてモデリング力と識別力という二つの尺度が算出される。識別力は２つのクラスについてどの変数が分類に有効であるかを表している。下がって、識別力に基づいて分類に寄与する変数を抽出することができ、他の変数は分類にあまり寄与しないノイズと考えることができる。この実施形態では、上述したように分類に有効な変数のみを用いて再度PCA解析もしくはSIMCA解析を実行することができる。それにより、サンプルをより一層明確にクラス分けすることが可能となる。 In addition, in the SIMCA analysis, two measures are calculated for each variable: modeling ability and discriminating ability. The discriminating power represents which variable is effective for classification for the two classes. Based on the discriminating power, variables that contribute to classification can be extracted, and other variables can be considered as noise that does not contribute much to classification. In this embodiment, as described above, the PCA analysis or SIMCA analysis can be executed again using only the variables effective for classification. This makes it possible to classify the samples more clearly.

図３９は、SIMCA解析で選択された識別力の高い変数を使ってSIMCA解析計算又はPCA解析計算を再実行するための処理手順を示す。 FIG. 39 shows a processing procedure for re-executing SIMCA analysis calculation or PCA analysis calculation using a variable having high discriminating power selected in SIMCA analysis.

図３９に示すように、ステップ５００でSIMCA解析が実施され、ステップ５０２で、識別力チャート４６６上で識別力の閾値が指定されると、ステップ５０４で、その閾値以上の識別力をもつ変数が抽出される。その後、ステップ５０６で、SIMCAコントロールパネル４３８上の「Set_D.P.」ボタンが操作されると、ステップ５０８で、抽出された変数だけを用いてSIMCA解析計算が再度実行され、SIMCA解析結果が表示される。 As shown in FIG. 39, when a SIMCA analysis is performed in step 500 and a threshold value of discrimination power is specified on the discrimination power chart 466 in step 502, a variable having discrimination power equal to or higher than the threshold value is determined in step 504. Extracted. Thereafter, when the “Set_D.P.” Button on the SIMCA control panel 438 is operated in step 506, the SIMCA analysis calculation is executed again using only the extracted variable in step 508, and the SIMCA analysis result is displayed. Is done.

また、ステップ５０４の後、ステップ５１０で、PCAコントロールパネル４３６上の「Set_D.P.」ボタンが操作されると、ステップ５１２で、抽出された変数だけを用いて、SIMCA解析の対象となったサンプルについてPCA解析計算が再度実行され、PCA解析結果が表示される。また、ステップ５０４の後、ステップ５１４で、PCAコントロールパネル４３６上の「Add_D.P.」ボタンが操作されると、ステップ５１６で、現在の解析に使用することになっている変数に、抽出された変数を追加した上で、それらの変数を用いてSIMCA解析の対象となったサンプルについてPCA解析計算が再度実行され、PCA解析結果が表示される。 In addition, after step 504, when the “Set_D.P.” Button on the PCA control panel 436 is operated in step 510, in step 512, only the extracted variables are used for SIMCA analysis. The PCA analysis calculation is executed again for the sample, and the PCA analysis result is displayed. In addition, after step 504, when the “Add_D.P.” Button on the PCA control panel 436 is operated in step 514, it is extracted in step 516 as a variable to be used for the current analysis. After adding these variables, PCA analysis calculation is performed again for the sample that was subjected to SIMCA analysis using those variables, and the PCA analysis results are displayed.

ところで、上述した「Add_D.P.」ボタンを用いた変数の追加機能は、例えば次のような用途に利用可能である。すなわち、クラスが３つ以上存在する場合、それぞれ２つのクラスの組合せを第１と第２のクラスとしてSIMCA解析を行って、分類に有効な変数を求め、その有効な変数を、上述した「Add_D.P.」ボタン操作で逐次に加算して行くことができる。すなわち、まず、最初に選んだ２つのクラスの組合せでSIMCA解析を実施して有効な変数を選択した後、PCAコントロールパネル４３６上の「Set_D.P.」ボタン操作で、その選択された変数を使用対象に設定する。次に、別の２つのクラスの組合せでSIMCA解析を実施して有効な変数を選択した後、PCAコントロールパネル４３６上の「ADD_D.P.」ボタン操作で、その選択された変数を、最初に選択された変数に追加する。以後、その他の２つのクラスの組み合わせのそれぞれについて、SIMCA解析を実施して有効な変数を選択した後上記「ADD_D.P.」ボタン操作を繰り返す。これにより、ぞれぞれのクラスの組み合わせで有効な変数が使用対象にセットされることになり、いずれのクラスの組み合わせにおいても有効ではないとされた変数だけが、使用対象から外される。 By the way, the variable addition function using the “Add_D.P.” Button described above can be used for the following purposes, for example. That is, when there are three or more classes, SIMCA analysis is performed with each combination of the two classes as the first and second classes to obtain a variable effective for classification, and the effective variable is defined as “Add_D” described above. .P. "Button can be added sequentially. That is, first, a SIMCA analysis is performed with the combination of the two classes selected first, and a valid variable is selected. Then, by operating the “Set_D.P.” Button on the PCA control panel 436, the selected variable is selected. Set the target. Next, after performing SIMCA analysis with another two class combination and selecting a valid variable, the selected variable is first displayed by operating the “ADD_D.P.” Button on the PCA control panel 436. Append to selected variable. Thereafter, SIMCA analysis is performed for each of the other two combinations of classes to select valid variables, and then the above “ADD_D.P.” Button operation is repeated. As a result, a variable that is effective for each class combination is set as a target of use, and only a variable that is not valid for any combination of classes is excluded from the target of use.

以上、図２３〜図３９を参照して、本発明の一実施形態にかかるNMRスペクトルの統計処理装置（図１に示した統計処理部１１６）の構成、機能、動作について説明した。以下では、このNMRスペクトルの統計処理装置を特定の用途に用いる場合の使用例を紹介する。 The configuration, function, and operation of the NMR spectrum statistical processing apparatus (the statistical processing unit 116 illustrated in FIG. 1) according to an embodiment of the present invention have been described above with reference to FIGS. 23 to 39. Below, the usage example in the case of using this statistical processing apparatus of a NMR spectrum for a specific use is introduced.

図４０は、メタボノミクスにおいてバイオマーカを決定する用途にこの統計処理装置を使用する場合の使用手順を示す。 FIG. 40 shows a procedure for using this statistical processing apparatus for the purpose of determining a biomarker in metabonomics.

図４０に示すように、ステップ５２０で分析が開始される。この分析では、例えば、人からの採取物に基づいてその人が特定の疾病に罹っているか否かを判断するためのバイオマーカを決定したいという場合、既にその疾病に罹っている人々からの採取物と健康な人々からの採取物とが分析対象のサンプルとして選ばれる。クラス毎に、異なるカラーがプロットカラーとして指定される。まず、ステップ５２２で、分析対象の多数のサンプルについてPCA解析が行われる。ステップ５２４で、スコアプロット４４０などに示された第１主成分（PC1）と第２主成分（PC2）のスコアの分散の和が、ユーザが決めた或る閾値、例えば６０％、を超えているかどうかが、ユーザにより判断される。もし、上記スコアの分散の和が閾値６０％を超えているならば、ステップ５２６で、ユーザは、ステップ５２２で行ったPCA解析で得られたPCAモデルにおいて、サンプルは明快にクラス分けされていると判断する。その場合、ステップ５２８で、ユーザは、ローディングプロット４４２、寄与率チャート４４４、ローディングチャート４４６及びNMRスペクトルチャート４５０などを参照しながら、寄与率の高いケミカルシフト値又はそれに対応するNMRスペクトルのピークを選定し、それに基づきバイオマーカを決定する。 As shown in FIG. 40, analysis begins at step 520. In this analysis, for example, if you want to determine a biomarker to determine whether a person is suffering from a particular disease based on a sample from that person, a sample from a person already suffering from the disease Goods and samples from healthy people are selected as samples for analysis. Different colors are specified as plot colors for each class. First, in step 522, PCA analysis is performed on a number of samples to be analyzed. In step 524, the sum of the variances of the scores of the first principal component (PC1) and the second principal component (PC2) shown in the score plot 440 or the like exceeds a certain threshold determined by the user, for example, 60%. It is determined by the user whether or not it exists. If the sum of the variances of the scores exceeds the threshold value of 60%, in step 526, the user clearly classifies the sample in the PCA model obtained by the PCA analysis performed in step 522. Judge. In that case, in step 528, the user selects a chemical shift value having a high contribution rate or a corresponding NMR spectrum peak while referring to the loading plot 442, the contribution rate chart 444, the loading chart 446, the NMR spectrum chart 450, and the like. Then, the biomarker is determined based on that.

他方、ステップ５２４で、上記スコアの分散の和が閾値６０％以下であるならば、ユーザは、ステップ５２２で行ったPCA解析で得られたPCAモデルでは、サンプルは明快にはクラス分けされていないと判断し、次に、ステップ５３２のSIMCA解析を実行する。SIMCA解析においては、ステップ５３４で、検査対象のサンプルが第1と第２の２つのクラスに分けられる。例えば、その疾病に罹っている人々から採取されたサンプルが第1のクラスに、健康な人々から採取されたサンプルが第２のクラスに割り当てられる。この２つのクラスについてSIMCA解析が実行され、そのSIMCA解析結果が表示されると、ユーザは、ステップ５３６で、識別力チャート４６６上で識別力の閾値を設定して、その閾値以上の識別力を持った変数を選択する。その後、ユーザは、PCAコントロールパネル４３６の「Set_D.P.」ボタンを操作する。それにより、ステップ５２２で、選択された変数だけを用いて、同じサンプルに対してPCA解析計算が再度実行される。その結果、前回のPCA解析計算の結果よりも、より明確にサンプルが分散したPCAモデルが得られ、バイオマーカの決定がより容易になる。 On the other hand, in step 524, if the sum of the variances of the scores is 60% or less, the user does not clearly classify the sample in the PCA model obtained by the PCA analysis performed in step 522. Then, the SIMCA analysis in step 532 is executed. In the SIMCA analysis, in step 534, the sample to be inspected is divided into first and second classes. For example, samples taken from people suffering from the disease are assigned to a first class, and samples taken from healthy people are assigned to a second class. When the SIMCA analysis is executed for these two classes and the SIMCA analysis result is displayed, the user sets a threshold value of the discrimination power on the discrimination power chart 466 in step 536, and the discrimination power equal to or higher than the threshold value is set. Select the variable you have. Thereafter, the user operates the “Set_D.P.” Button on the PCA control panel 436. Thereby, in step 522, the PCA analysis calculation is performed again on the same sample using only the selected variable. As a result, a PCA model in which the sample is more clearly dispersed than the result of the previous PCA analysis calculation is obtained, and the determination of the biomarker becomes easier.

図４１は、テストサンプルの性向を診断する用途に、本発明の一実施形態にかかるNMRスペクトルの統計処理装置（図１に示した統計処理部１１６）を使用する場合における使用手順の一例を示す。 FIG. 41 shows an example of a use procedure when the NMR spectrum statistical processing apparatus (the statistical processing unit 116 shown in FIG. 1) according to an embodiment of the present invention is used for diagnosing the propensity of a test sample. .

図４１に示すように、ステップ５４０で診断が開始される。この診断は、例えば、或る人々について特定の疾病に罹っているか否かを診断するものである。ステップ５４２で、２つの基本クラスつまり第1と第２のクラスが設定される。例えば、既に疾病に罹っている人々から採取されたサンプルが第1のクラスに、健康な人々から採取されたサンプルが第２のクラスに割り当てられる。また、ステップ５４４で、診断対象の人々から採取されたサンプルがテストクラスとして設定される。クラス毎に、異なるカラーがプロットカラーとして指定される。 As shown in FIG. 41, diagnosis is started at step 540. This diagnosis is, for example, diagnosing whether or not a certain person has a specific disease. In step 542, two basic classes are set, the first and second classes. For example, samples taken from people who already have a disease are assigned to a first class, and samples taken from healthy people are assigned to a second class. In step 544, a sample collected from people to be diagnosed is set as a test class. Different colors are specified as plot colors for each class.

その後、ステップ５４６で、それら３つのクラスのサンプルを統合した多変量データ行列についてPCA解析が実行される。また、ステップ５４８で、それら３つのクラスのサンプルについてSIMCA解析が行なわれる。SIMCA解析では、テストクラスのサンプルの第1のクラスと第２のクラスにそれぞれに対する距離が求まり、それがクーマンプロット４４０上に表現される。スコアププロット４４０でも、クーマンプロットでも、３つのクラスのサンプルはクラス別の異なるプロットカラーで表示されるので、それらの相関が一目で分かる。 Thereafter, in step 546, a PCA analysis is performed on the multivariate data matrix integrating the three classes of samples. Also, in step 548, SIMCA analysis is performed on these three classes of samples. In the SIMCA analysis, the distance to each of the first class and the second class of the sample of the test class is obtained and expressed on the Couman plot 440. In both the score plot 440 and the Kuman plot, the three classes of samples are displayed in different plot colors by class, so that their correlation can be seen at a glance.

ステップ５５０で、ユーザは、PCA解析結果とSIMCA解析結果を参照することで、テストサンプルである診断対象の人々と、第1クラスである当該疾病に罹っている人々と、第２クラスである健康な人々との間の相関を把握し、そこから、診断対象の人々が当該疾病に罹っているのか、健康であるのか、或は未定なのかなどの判断を下す。 In step 550, the user refers to the PCA analysis result and the SIMCA analysis result, so that the test target people, the first class of people suffering from the disease, and the second class of health. From this, the person who is diagnosed has the disease, is healthy, or is undecided.

以上、本発明の実施形態を説明したが、この実施形態は本発明の説明のための例示にすぎず、本発明の範囲をこの実施形態にのみ限定する趣旨ではない。本発明は、その要旨を逸脱することなく、その他の様々な態様でも実施することができる。例えば、上記の実施形態では、多変量解析結果であるスコアプロット、ローディング及びクーマンプロットを、UI画面上で２本の座標軸からなる二次元図形の形で表示したが、それに代えて又はそれと併用して、３本の座標軸からなる三次元図形として形成し、その三次元図形を任意の視線方向から表示できるようにすることもできる。また、本発明は、メタボノミクスだけでなく、それ以外の用途にも適用することができる。 As mentioned above, although embodiment of this invention was described, this embodiment is only the illustration for description of this invention, and is not the meaning which limits the scope of the present invention only to this embodiment. The present invention can be implemented in various other modes without departing from the gist thereof. For example, in the above embodiment, the score plot, loading, and Cooman plot, which are the results of multivariate analysis, are displayed in the form of a two-dimensional figure consisting of two coordinate axes on the UI screen, but instead of or in combination with it. It is also possible to form a three-dimensional figure composed of three coordinate axes so that the three-dimensional figure can be displayed from an arbitrary line-of-sight direction. Further, the present invention can be applied not only to metabonomics but also to other uses.

本発明に従うNMRデータ処理装置の一実施形態の全体的な構成と機能を示すブロック図。The block diagram which shows the whole structure and function of one Embodiment of the NMR data processing apparatus according to this invention. NMRスペクトル処理１１０の流れを示すフローチャート。The flowchart which shows the flow of the NMR spectrum process 110. NMRスペクトル（実部）と、これに絶対値微分を施した結果であるADスペクトルの一例を示す図。The figure which shows an example of the AD spectrum which is a result of having given an NMR value (real part) and absolute value differentiation to this. 位相補正を説明する図。The figure explaining phase correction. ベースライン補正を説明する図。The figure explaining baseline correction. 最適化バケット積分を用いたデータ縮約処理１１２の流れを示すフローチャート。The flowchart which shows the flow of the data reduction process 112 using an optimization bucket integral. スペクトル投影処理１７２の原理を説明する図。The figure explaining the principle of the spectrum projection process 172. FIG. スペクトル投影処理１７２の流れを示すフローチャート。The flowchart which shows the flow of the spectrum projection process 172. 投影ADスペクトルのバケット積分処理１７２の原理を説明する図。The figure explaining the principle of the bucket integration process 172 of a projection AD spectrum. バケット積分処理１７２の流れを示すフローチャート。The flowchart which shows the flow of the bucket integration process 172. 自動積分ブロック設定に基づくバケットの自動修正処理１７４の原理を説明する図。The figure explaining the principle of the automatic correction process 174 of the bucket based on an automatic integration block setting. 自動積分ブロック設定に基づくバケットの自動修正処理１７４の流れを示すフローチャート。The flowchart which shows the flow of the automatic correction process 174 of the bucket based on an automatic integration block setting. NMRスペクトル上の３分裂と４分裂のピークの例を示す図。The figure which shows the example of the peak of 3 split and 4 split on a NMR spectrum. 指定ピーク情報のデータ構造例を示す図。The figure which shows the data structure example of designation | designated peak information. 指定ピーク情報に基づくバケット自動修正処理１７６の流れを示すフローチャート。The flowchart which shows the flow of the bucket automatic correction process 176 based on designated peak information. バケットの手動修正処理１７８の原理を示す図。The figure which shows the principle of the manual correction process 178 of a bucket. バケットの手動修正処理１７８の流れを示すフローチャート。The flowchart which shows the flow of the manual correction process 178 of a bucket. 最適化バケットセットのデータ構造例を示す図。The figure which shows the data structure example of an optimization bucketset. 多数ADスペクトルの最適化バケット積分処理１８０の流れを示す。The flow of the optimization bucket integration process 180 of many AD spectrum is shown. 本発明に従うNMRデータ処理装置の別の実施形態の全体的な構成と機能を示すブロック図The block diagram which shows the whole structure and function of another embodiment of the NMR data processing apparatus according to this invention. 最適化バケット積分を用いたデータ縮約処理３１２の流れを示すフローチャート。The flowchart which shows the flow of the data reduction process 312 using an optimization bucket integral. ヒストグラムの分析処理３１４の流れを示すフローチャート。The flowchart which shows the flow of the analysis process 314 of a histogram. 本発明の一実施形態にかかるNMRスペクトルの統計処理装置、すなわち、図１に示した統計処理部１１６、の機能的な構成を示すブロック図。The block diagram which shows the functional structure of the statistical processing apparatus of the NMR spectrum concerning one Embodiment of this invention, ie, the statistical processing part 116 shown in FIG. PCAテーブル３５０の構成例を示す図。The figure which shows the structural example of the PCA table 350. FIG. 図２３に示した多変量解析部３４０のPCA解析部３４６とSIMCA解析部３４８並びに表示制御部３４４の機能をより詳細に示したブロック図。24 is a block diagram showing in more detail the functions of the PCA analyzer 346, SIMCA analyzer 348, and display controller 344 of the multivariate analyzer 340 shown in FIG. PCA解析結果を表示した場合におけるUI３４６の画面例を示す図。The figure which shows the example of a screen of UI346 when a PCA analysis result is displayed. SIMCA解析結果を表示した場合におけるUI３４６の画面例を示す図。The figure which shows the example of a screen of UI346 when a SIMCA analysis result is displayed. ファイル名の表示機能を説明するためのスコアプロット４４０内の部分領域の拡大図。The enlarged view of the partial area | region in the score plot 440 for demonstrating the display function of a file name. ケミカルシフト値の表示機能を説明するためのローディングプロット４４２内の部分領域の拡大図。The enlarged view of the partial area | region in the loading plot 442 for demonstrating the display function of a chemical shift value. スコアプロット４４０上でのプロット選択機能とNMRスペクトル連携機能とを説明するためのスコアプロット４４０とNMRスペクトルをャート４５０の例を示す図。The figure which shows the example of the score plot 440 and NMR spectrum yart 450 for demonstrating the plot selection function and the NMR spectrum cooperation function on the score plot 440. FIG. スコアプロット４４０上でのプロットの削除機能を説明するためのスコアプロット４４０の例を示す図。The figure which shows the example of the score plot 440 for demonstrating the deletion function of the plot on the score plot 440. FIG. ローディングプロット４４２上でのプロット選択機能とそれに連携する寄与率チャート４４４とローディングチャート４４６の表示変更機能とを説明するためのローディングプロット４４２と寄与率チャート４４４とローディングチャート４４６の例を示す図。The figure which shows the example of the loading plot 442, the contribution rate chart 444, and the loading chart 446 for demonstrating the plot selection function on the loading plot 442, the contribution rate chart 444 linked to it, and the display change function of the loading chart 446. ローディングプロット４４２上でのプロットの削除機能を説明するためのローディングプロット４４２の例を示す図。The figure which shows the example of the loading plot 442 for demonstrating the deletion function of the plot on the loading plot 442. FIG. 寄与率チャート４４４又はローディングチャート４４６の拡大機能とNMRスペクトル連携機能とを説明するための寄与率チャート４４４とローディングチャート４４６とNMRスペクトルをャート４５０の例を示す図。The figure which shows the example of the chart 450 of the contribution ratio chart 444, the loading chart 446, and NMR spectrum for demonstrating the expansion function of the contribution ratio chart 444 or the loading chart 446, and an NMR spectrum cooperation function. クーマンプロット４６０上でのプロット選択機能とNMRスペクトル連携機能とを説明するためのクーマンプロット４６０とNMRスペクトルをャート４５０の例を示す図。The figure which shows the example of the Kuart plot 460 and NMR spectrum Yart 450 for demonstrating the plot selection function on the Couman plot 460, and an NMR spectrum cooperation function. クーマンプロット４６０上でのプロットの削除機能を説明するためのクーマンプロット４６０の例を示す図。The figure which shows the example of the Cooman plot 460 for demonstrating the deletion function of the plot on the Cooman plot 460. FIG. モデリングチャート４６２、４６４及び識別力チャート４６６での変数の選択機能を説明するためのモデリングチャート４６２、４６４及び識別力チャート４６６の例を示す図。The figure which shows the example of the modeling charts 462 and 464 and the discrimination power chart 466 for demonstrating the variable selection function in the modeling charts 462 and 464 and the discrimination power chart 466. 変数選択後のSIMCA解析計算の再実行を説明するためのUI画面４３０の例を示す図。The figure which shows the example of the UI screen 430 for demonstrating re-execution of the SIMCA analysis calculation after variable selection. 変数選択後のSIMCA解析計算とPCA解析計算の再実行の手順を示すフローチャート。The flowchart which shows the procedure of re-execution of SIMCA analysis calculation after variable selection and PCA analysis calculation. メタボノミクスにおいてバイオマーカを決定する用途に、本発明の一実施形態にかかるNMRスペクトルの統計処理装置（図１に示した統計処理部１１６）を使用する場合における使用手順の一例を示すフローチャート。The flowchart which shows an example of the use procedure in the case of using the statistical processing apparatus (statistical processing part 116 shown in FIG. 1) of the NMR spectrum concerning one Embodiment of this invention for the use which determines a biomarker in metabonomics. テストサンプルの性向を診断する用途に、本発明の一実施形態にかかるNMRスペクトルの統計処理装置（図１に示した統計処理部１１６）を使用する場合における使用手順の一例を示すフローチャート。The flowchart which shows an example of the use procedure in the case of using the statistical processing apparatus (statistical processing part 116 shown in FIG. 1) of the NMR spectrum concerning one Embodiment of this invention for the use which diagnoses the tendency of a test sample.

１００、３００コンピュータシステム（NMRデータ処理装置）
１０２ NMR装置
１０４、３０４プロセッサ
１０６、３０６記憶装置
１０８、３０８ FIDデータ入力処理
１１０、３１０ NMRスペクトル処理
１１２、３１２データ縮約処理
１２６絶対値微分処理（ADスペクトル生成処理）
１４０ NMRスペクトル（実部）
１４４、Sa、Sb ADスペクトル
１７１多数のADスペクトルの投影処理
１７２投影ADスペクトルのバケット積分処理
１７４自動積分ブロック設定に基づくバケットの自動修正処理
１７６指定ピーク情報に基づくバケットの自動修正処理
１７８バケットの手動修正処理
１８０多数ADスペクトルの最適化バケット積分処理
Sp 投影ADスペクトル
２００a−２００f、２００i−２００k バケット
２２０x、２２０y 自動設定された積分ブロック
２４０x、２４０y 指定ピーク情報に基づくピーク領域
２５０A−２５０C 指定ピーク情報セット
２８０A−２８０C 最適化バケットセット
３４０多変量解析部
３４２ユーザ要求入力部
３４４表示制御部
３４６ PCA解析部
３４８ SIMCA解析部
３５０ PCAテーブル
３５４ NMRスペクトルデータ
３５６ NMRヒストグラムデータ
３６０ PCAモデル読込・追加・保存部
３６２ PCA計算部
３６４解析データ記憶部
３６６解析結果保存部
３７０ PCAモデル読込・追加・保存部
３７２ SIMCA計算部
３７４ PCA解析画面制御部
３７６ SIMCA解析画面制御部
３７８ NMRスペクトル連携部
４３０ UI画面
４４０スコアプロット
４４２ローディングプロット
４４４寄与率チャート
４４６ローディングチャート
４５０ NMRスペクトルチャート
４６０クーマンプロット
４６２、４６４モデリング力プロット
４６６識別力プロット

100, 300 Computer system (NMR data processor)
102 NMR device 104, 304 Processor 106, 306 Storage device 108, 308 FID data input processing 110, 310 NMR spectrum processing 112, 312 Data reduction processing 126 Absolute value differentiation processing (AD spectrum generation processing)
140 NMR spectrum (real part)
144, Sa, Sb AD spectrum 171 Projection processing of many AD spectra 172 Bucket integration processing of projected AD spectrum 174 Automatic correction processing of bucket based on automatic integration block setting 176 Automatic correction processing of bucket based on specified peak information 178 Manual operation of bucket Correction processing 180 Multiple AD spectrum optimization bucket integration processing
Sp projected AD spectrum 200a-200f, 200i-200k bucket 220x, 220y automatically set integration block 240x, 240y peak area based on specified peak information 250A-250C specified peak information set 280A-280C optimized bucket set 340 multivariate analysis unit 342 User request input unit 344 Display control unit 346 PCA analysis unit 348 SIMCA analysis unit 350 PCA table 354 NMR spectrum data 356 NMR histogram data 360 PCA model reading / adding / saving unit 362 PCA calculation unit 364 Analysis data storage unit 366 Analysis result storage Unit 370 PCA model reading / addition / saving unit 372 SIMCA calculation unit 374 PCA analysis screen control unit 376 SIMCA analysis screen control unit 378 NMR spectrum linkage unit 430 UI screen 440 score plot 442 loading plot 4 44 Contribution chart 446 Loading chart 450 NMR spectrum chart 460 Cooman plot 462, 464 Modeling power plot 466 Discrimination power plot

Claims

NMR spectrum data defined by the mathematical formula (1) indicating the NMR characteristics of the sample, and means for performing absolute value differentiation on the NMR spectral data to obtain the absolute value differential spectral data defined by the mathematical formula (2);
  Data reduction means for reducing the absolute value differential spectrum data to histogram data by executing bucket integration using a bucket set consisting of a large number of buckets having unequal widths on the absolute value differential spectrum data. When,
  Multivariate analysis means for performing multivariate analysis on a multivariate data matrix comprising the histogram data;
With
  When δ is a chemical shift, j is an imaginary number symbol, R (δ) is a real part of the NMR spectrum, and I (δ) is an imaginary part of the NMR spectrum, Equations (1) and (2) are respectively
  NMR spectrum = R (δ) + j · I (δ) (1)

Because
The width of the bucket has a plurality of individual peak regions included in the absolute value differential spectrum.
The peak width is detected in advance so that it is not divided into buckets.
Is decided according to
NMR spectrum statistical processing device.

2. The apparatus according to claim 1, wherein the multivariate analysis means includes:
A multivariate data matrix input means for inputting a multivariate data matrix comprising the histogram data;
PCA analysis means for performing a PCA analysis calculation on the input multivariate data matrix;
PCA analysis result display means for receiving a result of the PCA analysis calculation and displaying a plurality of types of PCA analysis result charts on a user interface screen;
User request input means for inputting a user request using the user interface screen;
When the input user request is to select one or more samples, the NMR spectrum data of the selected sample is input, and the input NMR spectrum is represented on the chemical shift axis. and a NMR spectrum linkage means for displaying the chart on the user interface screen,
The PCA analysis result chart includes at least a score plot representing the score of the sample with respect to a plurality of principal components, a loading plot representing loading with respect to the plurality of principal components of a predetermined plurality of chemical shift values as variables, A contribution ratio / loading chart representing the contribution ratio or loading of a plurality of predetermined chemical shift values on the chemical shift axis, and
The contribution ratio / loading chart and the NMR spectrum chart are arranged and displayed in a direction perpendicular to the chemical shift axis so that the scales of the chemical shift axes coincide with each other.
NMR spectrum statistical processing device.

The apparatus of claim 2 .
If the input user request is to expand the contribution / loading chart,
The PCA analysis result display means and the NMR spectrum cooperation means maintain the coincidence of the scale of the chemical shift axis between the contribution rate / loading chart and the NMR spectrum chart, so that the contribution rate / loading An enlarged display of the chart and the NMR spectrum chart;
NMR spectrum statistical processing device.

The apparatus of claim 2 .
If the input user request is to delete one or more selected samples,
The PCA analysis means executes the PCA analysis calculation again for a multivariate data matrix consisting of NMR histogram data of samples other than the deleted sample,
The PCA analysis result display means receives the result of the PCA analysis calculation executed again and displays the PCA analysis result chart again.
NMR spectrum statistical processing device.

The apparatus of claim 2 .
If the input user request is to delete a chemical shift value as one or more selected variables,
The PCA analysis means executes the PCA analysis calculation again using a chemical shift value other than the deleted chemical shift value,
The PCA analysis result display means receives the result of the PCA analysis calculation executed again and displays the PCA analysis result chart again.
NMR spectrum statistical processing device.

2. The apparatus according to claim 1, wherein the multivariate analysis means includes:
A multivariate data matrix input means for inputting a multivariate data matrix of a plurality class consisting of the histogram data,
SIMCA analysis means for performing SIMCA analysis calculation on the input multivariate data matrix of the plurality of classes;
A SIMCA analysis result display means for receiving a plurality of types of SIMCA analysis result charts on a user interface screen in response to the result of the SIMCA analysis calculation;
User request input means for inputting a user request using the user interface screen;
When the input user request is to select one or more samples, the NMR spectrum data of the selected sample is input, and the input NMR spectrum is represented on the chemical shift axis. and a NMR spectrum linkage means for displaying the chart on the user interface screen,
The SIMMA analysis result chart includes at least a Cooman plot representing the distance of the sample with respect to the plurality of classes, and a modeling force representing a modeling force or discriminating power of a plurality of predetermined chemical shift values as variables on the chemical shift axis. / Discrimination chart and
The modeling force / discriminating power chart and the NMR spectrum chart are arranged and displayed in a direction orthogonal to the chemical shift axis so that the scales of the chemical shift axes coincide with each other.
NMR spectrum statistical processing device.

The apparatus of claim 6.
If the input user request is to enlarge the modeling power / discrimination power chart,
The SIMCA analysis result display means and the NMR spectrum cooperation means maintain the matching of the scale of the chemical shift axis between the modeling force / discrimination power chart and the NMR spectrum chart so that the modeling force / Displaying the discrimination power chart and the NMR spectrum chart in an enlarged manner;
NMR spectrum statistical processing device.

The apparatus of claim 6.
If the input user request is to delete one or more selected samples,
The SIMCA analysis means performs SIMCA analysis calculation again on a multivariate data matrix of a plurality of classes consisting of NMR histogram data of samples other than the deleted sample,
The SIMCA analysis result display means receives the result of the SIMCA analysis calculation performed again and displays the SIMCA analysis result chart again.
NMR spectrum statistical processing device.

The apparatus of claim 6.
When the input user request is to select a chemical shift value that is a variable whose discriminating power or modeling power is higher than a predetermined threshold and to request recalculation,
The SIMCA analysis means performs SIMCA analysis calculation again using only the selected chemical shift value,
The SIMCA analysis result display means receives the result of the SIMCA analysis calculation performed again and displays the SIMCA analysis result chart again.
NMR spectrum statistical processing device.

2. The apparatus according to claim 1, wherein the multivariate analysis means includes:
PCA analysis means for performing PCA analysis calculation on a multivariate data matrix comprising the histogram data;
PCA analysis result display means for receiving a result of the PCA analysis calculation and displaying a plurality of types of PCA analysis result charts on a user interface screen;
A multivariate data matrix input means for inputting a multivariate data matrix of a plurality of classes consisting of NMR histogram data of a plurality of samples assigned to a plurality of classes;
SIMCA analysis means for performing SIMCA analysis calculation on the input multivariate data matrix of the plurality of classes;
A SIMCA analysis result display means for receiving a plurality of types of SIMCA analysis result charts on a user interface screen in response to the result of the SIMCA analysis calculation;
User request input means for inputting a user request using the user interface screen;
Including
The PCA analysis result chart includes at least a score plot representing the score of the sample with respect to a plurality of principal components, a loading plot representing loading with respect to the plurality of principal components of a predetermined plurality of chemical shift values as variables, A contribution ratio / loading chart representing the contribution ratio or loading of a plurality of predetermined chemical shift values on the chemical shift axis, and
The SIMMA analysis result chart includes at least a Cooman plot representing the distance of the sample with respect to the plurality of classes, and a modeling force representing a modeling force or discriminating power of a plurality of predetermined chemical shift values as variables on the chemical shift axis. / Discrimination chart and
The user request input by the user request input means selects a chemical shift value that is a variable in which the discriminating power or modeling power displayed in the modeling power / discriminating power chart is higher than a predetermined threshold and re-executes the PCA analysis calculation. If it is a request for execution,
The PCA analysis means executes the PCA analysis calculation again using only the selected chemical shift value,
The PCA analysis result display means receives the result of the PCA analysis calculation executed again and displays the PCA analysis result chart again.
NMR spectrum statistical processing device.

The apparatus of claim 10 .
The PCA in which the user request input by the user request input means selects a chemical shift value that is a variable in which the discriminating power or modeling power displayed in the modeling power / discriminating power chart is higher than a predetermined threshold and adds a variable If it is a request to re-run the analysis calculation,
The PCA analysis means executes the PCA analysis calculation again using a variable obtained by adding the selected chemical shift value to the current chemical shift value.
The PCA analysis result display means receives the result of the PCA analysis calculation executed again and displays the PCA analysis result chart again.
NMR spectrum statistical processing device.

The apparatus of claim 10 .
The user request input by the user request input means selects a chemical shift value that is a variable in which the discriminating power or the modeling power displayed in the modeling power / discriminating power chart is higher than a predetermined threshold, and re-calculates SIMCA analysis calculation. If it is a request for execution,
The SIMCA analysis means performs the PCA analysis calculation again using only the selected chemical shift value,
The SIMCA analysis result display means receives the result of the SIMCA analysis calculation performed again and displays the SIMCA analysis result chart again.
NMR spectrum statistical processing device.

NMR spectrum data defined by the mathematical expression (1) indicating the NMR characteristics of the sample, and obtaining absolute value differential spectral data defined by the mathematical expression (2) by subjecting the NMR spectrum data to absolute value differentiation;
  Reducing the absolute value differential spectrum data into histogram data by performing bucket integration using a bucket set consisting of multiple buckets having unequal widths on the absolute value differential spectrum data;
  Performing a multivariate analysis on a multivariate data matrix comprising the histogram data;
With
  When δ is a chemical shift, j is an imaginary number symbol, R (δ) is a real part of the NMR spectrum, and I (δ) is an imaginary part of the NMR spectrum, Equations (1) and (2) are respectively
  NMR spectrum = R (δ) + j · I (δ) (1)

Because
The width of the bucket is determined according to the detected peak width in advance so that individual peak areas included in the absolute value differential spectrum are not divided into a plurality of buckets. NMR data processing method.

An NMR spectrum data defined by the mathematical formula (1) indicating the NMR characteristics of the sample and an absolute value differential spectral data defined by the mathematical formula (2) by performing absolute value differentiation on the NMR spectral data are obtained in a computer. A program part to be executed, and
  A step of reducing the absolute value differential spectrum data to histogram data by performing bucket integration using a bucket set consisting of a large number of buckets having unequal widths on the absolute value differential spectrum data. A program part to be executed by
  A program portion for causing a computer to execute a multivariate analysis on a multivariate data matrix comprising the histogram data;
With
  When δ is a chemical shift, j is an imaginary number symbol, R (δ) is a real part of the NMR spectrum, and I (δ) is an imaginary part of the NMR spectrum, Equations (1) and (2) are respectively
  NMR spectrum = R (δ) + j · I (δ) (1)

Because
The width of the bucket is determined according to the detected peak width in advance so that individual peak areas included in the absolute value differential spectrum are not divided into a plurality of buckets. A machine-readable computer program for processing NMR data.