JPH05143636A

JPH05143636A - Method and device for discrimination and analysis

Info

Publication number: JPH05143636A
Application number: JP3332848A
Authority: JP
Inventors: Tokumasa Yakura; 得正矢倉; Toru Fujii; 徹藤井; Hirohide Ushida; 牛田　　博英; Yasuhiro Tsutsumi; 康弘堤; Atsushi Irie; 篤入江
Original assignee: Omron Corp; Omron Tateisi Electronics Co
Current assignee: Omron Corp
Priority date: 1991-11-22
Filing date: 1991-11-22
Publication date: 1993-06-11

Abstract

PURPOSE:To surely separate populations with a linear discrimination function regardless of the nonlinear and complicated boundary form of populations. CONSTITUTION:This method consists or a discrimination function setting device 12 which receives output data of a state setting device 11 and sets a linear discrimination function for separation to plural populations, a separation end discriminating device 13 which discriminates whether sample data is completely separated by the discrimination function obtained by the discrimination function setting device 12 or not, a first discrimination area narrowing-down device 14 which sets a vertical function, which is perpendicular to the obtained discrimination function and equally divides the number of data into two, in the case of a failure of separation and separates sample data of parts divided by this vertical function and inputs these data to the discrimination function setting device, and a discriminating device 15 which receives all discrimination functions obtained by the discrimination function setting device and uses the discrimination function for inputted recollected data W and determines the population to which this data W belongs.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、複数の母集団を分離す
るための判別分析方法及び装置に関するもので、特に判
別分析を行う基準となる判別関数の作成する方法の改良
並びにそれを行うための装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a discriminant analysis method and apparatus for separating a plurality of populations, and more particularly, to improve a method for creating a discriminant function serving as a reference for discriminant analysis, and to perform the method. Of equipment.

【０００２】[0002]

【従来の技術】判別分析は、ｐ種の変量を持つデータ
が、与えられたｋ個の母集団のどこに属するかを判別す
るもので、まず予め多次元のサンプルデータから最も判
別しやすい境界、すなわち、誤判別を生じる確率和を最
小にするような境界を判別関数として取出し、次いで新
たに与えられた未知の想起用データが係る判別関数のど
ちら側に位置しているかを判断することにより属する母
集団の特定を行うようになっている。2. Description of the Related Art Discriminant analysis discriminates to which of a given k population a data having p kinds of variables belongs. First, a boundary which is most easily discriminable from multidimensional sample data in advance, That is, a boundary that minimizes the sum of the probabilities of causing misjudgment is extracted as a discriminant function, and then it is determined by determining to which side of the discriminant function the newly given unknown recall data is located. It is designed to identify the population.

【０００３】[0003]

【発明が解決しようとする課題】しかし、上記した従来
の判別分析で用いられた線形の判別関数では、母集団の
境界形状が複雑になると、その母集団を厳密に分離でき
なくなる。そのため、想起用データがどの母集団に属す
るかの判断における信頼性が低下する。また、近年、ニ
ューラルネットやファジィで代表されるような非線形分
離のニーズが高くなり、係るニューラルネットを用いれ
ば、上記した複雑な境界形状を有する母集団の分離も可
能である。しかし、ニューラルネットを用いた場合、境
界形状が複雑になればなるほど、また、入力するサンプ
ルデータ数が多くなるほど学習に時間がかかるとともに
得られた計算結果の解析がしにくいなどの問題を有す
る。However, with the linear discriminant function used in the above-described conventional discriminant analysis, when the boundary shape of the population becomes complicated, the population cannot be separated exactly. Therefore, the reliability in determining which population the recall data belongs to decreases. Further, in recent years, the need for non-linear separation represented by neural networks and fuzzy has increased, and by using such neural networks, it is possible to separate the population having the complicated boundary shape described above. However, when a neural network is used, the more complex the boundary shape and the larger the number of input sample data, the more time it takes for learning, and it is difficult to analyze the obtained calculation results.

【０００４】本発明は、上記した背景に鑑みてなされた
もので、その目的とするところは、たとえ分離する母集
団の境界形状が非線形で複雑であるような場合であって
も、線形な判別関数でもって確実に分離することがで
き、しかも、係る分離に要する学習時間等を短時間で処
理することのできる判別分析方法及び装置を提供するこ
とにある。The present invention has been made in view of the above background, and an object of the present invention is to make a linear discrimination even if the boundary shape of the population to be separated is non-linear and complicated. It is an object of the present invention to provide a discriminant analysis method and device that can be reliably separated by a function and can process learning time and the like required for the separation in a short time.

【０００５】[0005]

【課題を解決するための手段】上記した目的を達成する
ために、本発明に係る判別分析方法では、サンプルデー
タから形成される複数の母集団に対して判別分析を行っ
て判別関数を求め、係る判別関数で母集団の分離ができ
ない時には、前記判別関数に垂直な所定の垂直関数を引
き、その垂直関数で分離される領域に位置するサンプル
データ毎に判別関数を求めるようにした。In order to achieve the above object, in the discriminant analysis method according to the present invention, discriminant analysis is performed on a plurality of populations formed from sample data to obtain a discriminant function, When the population cannot be separated by such a discriminant function, a predetermined vertical function perpendicular to the discriminant function is drawn, and the discriminant function is obtained for each sample data located in the region separated by the vertical function.

【０００６】また、サンプルデータから形成される複数
の母集団に対して判別分析を行って判別関数を求め、係
る判別関数で母集団の分離ができない時に、その分離で
きない部分を挟むようにして前記判別関数と平行な２つ
の関数を設定し、それら両関数で囲まれる領域に対し、
判別関数と平行な方向に度数分布をとり、その分布状態
からデータを分離できるか否かを判断し、分離できる場
合にはその分布を分ける仕切り値を求め、一方分離でき
ない場合には前記領域内に位置するサンプルデータに対
し判別関数を行うようにしてもよい。さらには、上記２
つの方法を適宜組み合わせて実行するようにしても良
い。Further, when a discriminant function is obtained by performing discriminant analysis on a plurality of populations formed from sample data, and when the populations cannot be separated by such discriminant function, the discriminant function is sandwiched by the non-separable portions. Set two functions parallel to, and for the area surrounded by both functions,
The frequency distribution is taken in the direction parallel to the discriminant function, and it is judged whether the data can be separated from the distribution state. If it can be separated, the partition value for dividing the distribution is obtained. You may make it perform a discriminant function with respect to the sample data located in. Furthermore, the above 2
The two methods may be combined appropriately and executed.

【０００７】また、上記方法を実施するに適した判別分
析装置としては、入力されたサンプルデータに基づき複
数の母集団に分離すべく確率的に最も誤判別の少なくな
るような線形の判別関数を設定する判別関数設定装置
と、その判別関数設定装置にて得られた判別関数によっ
てサンプルデータが完全に分離されたか否かを判断する
分離終了判定装置と、サンプルデータが完全に分離され
なかった場合に、求められた判別関数に垂直な垂直関数
を引くと共に、その垂直関数で分けられた部分のサンプ
ルデータをそれぞれ分離して前記判別関数設定装置に入
力する第１判別領域絞り装置とから構成することであ
る。Further, as a discriminant analysis apparatus suitable for implementing the above method, a linear discriminant function that stochastically minimizes misjudgment so as to separate into a plurality of populations based on input sample data is used. When the discriminant function setting device to be set, the separation end determination device that determines whether or not the sample data is completely separated by the discriminant function obtained by the discriminant function setting device, and the sample data is not completely separated In addition, a vertical function perpendicular to the obtained discriminant function is subtracted, and the sample data of the portion divided by the vertical function is separated and input into the discriminant function setting device. That is.

【０００８】また、上記第１判別領域絞り装置に替え
て、求められた判別関数に平行な２つの関数によって分
離できない部分を挟み、その挟まれた部分を判別関数と
平行な方向に度数分布を取り、その分布状態からデータ
を分離できる時にはその仕切り値を設定し、分離できな
い時は前記２つの関数間に挾まれた領域に属するサンプ
ルデータを前記判別関数設定装置に入力する第２判別領
域絞り装置を設けても良い。Further, in place of the first discriminant region diaphragm device, a portion which cannot be separated is sandwiched by two functions which are parallel to the discriminant function obtained, and the sandwiched portion is subjected to frequency distribution in a direction parallel to the discriminant function. When the data can be separated from the distribution state, the partition value is set, and when the data cannot be separated, the second discriminant region diaphragm for inputting the sample data belonging to the region sandwiched between the two functions to the discriminant function setting device. A device may be provided.

【０００９】[0009]

【作用】与えられたサンプルデータに対して線形の判別
関数を設定し、その判別関数により母集団を分離できた
か否かを判断する。そして、分離できない場合には、得
られた判別関数を基準にして得られる垂直な垂直関数、
或いはその判別関数に平行な関数により新たな領域を形
成し、その新たな領域のサンプルデータに対して判別関
数を求める。そして、母集団同士を完全に分離できるま
で行う。これにより、たとえ母集団の境界形状が非線形
であったとしても、線形の判別関数でもって分離され
る。A linear discriminant function is set for given sample data, and it is determined whether the population can be separated by the discriminant function. Then, when it is not possible to separate, a vertical vertical function obtained based on the obtained discriminant function,
Alternatively, a new region is formed by a function parallel to the discriminant function, and the discriminant function is obtained for the sample data of the new region. Then, the processes are repeated until the populations are completely separated. As a result, even if the boundary shape of the population is non-linear, they are separated by a linear discriminant function.

【００１０】また、必要に応じ、上記判別関数に加え、
所定の判別関数に平行な方向に度数分布をとり、得られ
る仕切り値を用いることにより、少ない判別関数でもっ
て母集団の分離が行われる。If necessary, in addition to the above discriminant function,
By taking the frequency distribution in the direction parallel to the predetermined discriminant function and using the obtained partition value, the population can be separated with a small discriminant function.

【００１１】[0011]

【実施例】以下、本発明に係る判別分析方法及び装置の
好適な実施例を添付図面を参照にして詳述する。図１は
本発明に係る判別分析装置の第１実施例を示している。
同図に示すように、母集団の判別を行うための多変量解
析用のサンプルデータＸijを出力する状態設定装置１１
と、その状態設定装置１１の出力データを受け取り、複
数の母集団に分離すべく確率的に最も誤判別の少なくな
るような線形の判別関数を設定する判別関数設定装置１
２と、その判別関数設定装置１２にて得られた判別関数
によってサンプルデータが完全に分離されたか否かを判
断する分離終了判定装置１３と、サンプルデータが完全
に分離されなかった場合に、求められた判別関数に垂直
な分割線（垂直関数）を、データ数を２等分する位置に
引くと共に、その垂直関数で分けられた部分のサンプル
データをそれぞれ分離して判別関数設定装置１２に入力
する第１判別領域絞り装置１４と、判別関数設定装置１
２にて得られたすべての判別関数を受け取り、入力され
た想起データＷに対しその判別関数を用いてその想起デ
ータＷが属する母集団を決定する判別装置１５とを備え
ている。ここで、上記状態設定装置１１で設定される
サンプルデータＸij（ｉは変量の数（ｉ＝１〜ｎ）を、
ｊはサンプル数（ｊ＝１〜Ｎ）をそれぞれ意味する）の
具体的なデータ構造は、Ｘ1 ＝［Ｘ11，Ｘ21，…，Ｘn1］Ｘ2 ＝［Ｘ12，Ｘ22，…，Ｘn2］：：Ｘj ＝［Ｘ1j，Ｘ2j，…，Ｘnj］：：ＸN ＝［Ｘ1N，Ｘ2N，…，ＸnN］のようになっている。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Preferred embodiments of the discriminant analysis method and apparatus according to the present invention will be described in detail below with reference to the accompanying drawings. FIG. 1 shows a first embodiment of the discriminant analysis apparatus according to the present invention.
As shown in the figure, the state setting device 11 that outputs sample data Xij for multivariate analysis for discriminating the population.
And a discriminant function setting device 1 that receives output data of the state setting device 11 and sets a linear discriminant function that stochastically minimizes misjudgment so as to be separated into a plurality of populations.
2, a separation end determination device 13 that determines whether or not the sample data is completely separated by the discriminant function obtained by the discriminant function setting device 12, and a determination result when the sample data is not completely separated. A dividing line (vertical function) perpendicular to the determined discriminant function is drawn at a position that divides the number of data into two parts, and the sample data of the portion divided by the vertical function is separated and input to the discriminant function setting device 12. First discriminant region diaphragm device 14 and discriminant function setting device 1
The discriminator 15 receives all the discriminant functions obtained in 2 and uses the discriminant function for the input recall data W to determine the population to which the recall data W belongs. Here, the sample data Xij (i is the number of variables (i = 1 to n) set by the state setting device 11 is
The specific data structure of the number of samples (j means the number of samples (j = 1 to N)) is: X1 = [X11, X21, ..., Xn1] X2 = [X12, X22, ..., Xn2] :: Xj = [X1j, X2j, ..., Xnj] :: XN = [X1N, X2N, ..., XnN].

【００１２】また、判別関数設定装置１１で形成される
判別関数Ｚk （Ｘ）のデータ構造は、Ｚk （Ｘ）＝ak1*Ｘ1+ak2*Ｘ2+… +akn*Ｘn+bk となっている。ここで、aki 並びにbkは定数項である。
また、ｋは不連続な自然数であり、判別される母集団の
番号を示す。そして、第１判別領域絞り装置１４で形成
される判別関数に垂直な分割線ＺＴ1 （Ｘ）のデータ構
造は、ＺＴk （Ｘ）＝ａtk1*Ｘ1+ａtk2*Ｘ2+…ａtkn*Ｘn+ｂtk となっている。ここで、ａtki 並びにｂtkは定数項であ
る。The data structure of the discriminant function Zk (X) formed by the discriminant function setting device 11 is Zk (X) = ak1 * X1 + ak2 * X2 + ... + akn * Xn + bk. Here, aki and bk are constant terms.
In addition, k is a discontinuous natural number and indicates the number of the determined population. The data structure of the dividing line ZT1 (X) perpendicular to the discriminant function formed by the first discriminant device 14 is ZTk (X) = atk1 * X1 + atk2 * X2 + ... atkn * Xn + btk. There is. Here, atki and btk are constant terms.

【００１３】さらに、判別関数にて分離された母集団Ａ
k とＢk は、それぞれ通し番号が付加され、この番号は
生成された順につけられるようになっている。すなわ
ち、ｓ回目の分割でＡk からできた集団はＡ2sとＡ(2s+
1)となり、同様にＢk からはＢ2sとＢ(2s+1)となる。な
お、ｓは１から始まる連続した自然数である。また、分
割された２つの母集団に対し付加する数字は、原点に近
いほうの母集団を２ｓにする。Furthermore, the population A separated by the discriminant function
Serial numbers are added to k and Bk, respectively, and these numbers are assigned in the order of generation. That is, the population formed from Ak in the sth division is A2s and A (2s +
1), and similarly, Bk becomes B2s and B (2s + 1). Note that s is a continuous natural number starting from 1. The number added to the two divided populations is 2s for the population closer to the origin.

【００１４】次に、上記した実施例を用いて本発明に係
る判別分析方法の第１実施例について図２に示すフロー
チャートを参照にして説明する。まず、所定の母集団を
分離するための判別関数の設定する手順について説明す
ると、状態設定装置１１によりすべての多変量解析用の
サンプルデータＸijを判別関数設定装置１２に入力し
（Ｓ１０１）、その判別関数設定装置１２では与えられ
たサンプルデータＸijから、２つの母集団Ａk ，Ｂk を
最も効率よく分割する線形の判別関数Ｚk （Ｘ）を求め
る（Ｓ１０２）。次いで、その判別関数Ｚk（Ｘ）を分
離終了判定装置１３に入力し、そこにおいてその得られ
た判別関数Ｚk （Ｘ）に全サンプルデータＸijを入力し
両母集団Ａ，Ｂに完全に分離され、判別可能であるか否
かが判断される（Ｓ１０３）。そして、判別可能である
場合には、すべての判別関数Ｚk （Ｘ）を判別装置１５
に入力し、処理を終了する（Ｓ１０４）。Next, a first embodiment of the discriminant analysis method according to the present invention will be described with reference to the flow chart shown in FIG. First, a procedure for setting a discriminant function for separating a predetermined population will be described. The state setting device 11 inputs all sample data Xij for multivariate analysis to the discriminant function setting device 12 (S101), The discriminant function setting device 12 obtains a linear discriminant function Zk (X) that most efficiently divides the two populations Ak and Bk from the given sample data Xij (S102). Then, the discriminant function Zk (X) is input to the separation end determination device 13, and all the sample data Xij is input to the obtained discriminant function Zk (X), and completely separated into both populations A and B. Then, it is judged whether or not the judgment is possible (S103). If it is possible to determine, all the determination functions Zk (X) are determined by the determination device 15
Is input to end the processing (S104).

【００１５】一方、判別できなかった場合には、判別関
数並びにサンプルデータを第１判別領域絞り装置１４に
送り、その判別関数Ｚk （Ｘ）に垂直で、しかも、両母
集団Ａk ，Ｂk を構成するサンプルデータの数を２等分
するような垂直関数ＺＴk （Ｘ）を求める。そして、求
められた分割線ＺＴk （Ｘ）でもって両母集団を分割す
る（Ｓ１０５）。なお、上述したごとく、この分割によ
り分けられた各領域には、通し番号が付与される。すな
わち、例えば母集団Ａ1 に対し１回目の分割によって分
割された２つの母集団は、２×１＝２，２×１＋１＝３
より、それぞれＡ2 ，Ａ3 となる。これにより４つの母
集団Ａ2s，Ａ（2s+1) ，Ｂ2s，Ｂ（2s+1) が生成される
ことになる。On the other hand, when the discrimination cannot be made, the discriminant function and the sample data are sent to the first discriminant region diaphragm device 14 and are perpendicular to the discriminant function Zk (X), and both populations Ak and Bk are constructed. A vertical function ZTk (X) that divides the number of sample data to be halved is obtained. Then, both populations are divided by the obtained dividing line ZTk (X) (S105). It should be noted that, as described above, serial numbers are given to the respective areas divided by this division. That is, for example, two populations obtained by dividing the population A1 by the first division are 2 × 1 = 2, 2 × 1 + 1 = 3.
Therefore, they become A2 and A3, respectively. As a result, four populations A2s, A (2s + 1), B2s, B (2s + 1) are generated.

【００１６】また、後の想起データの判別を行うため
に、各母集団の関連付けを明確化しておく必要がある。
よって、下記に示すようなＴａｉとＴｂｉのテーブルを
作成する。Further, in order to discriminate later recall data, it is necessary to clarify the association of each population.
Therefore, the table of Tai and Tbi as shown below is created.

【００１７】Ｔａｉ：［（Ａｉ，ＡL ，ＡL+1 ），（Ｆ），（ｄui，ｄoi）］Ｔｂｉ：［（Ｂｉ，ＢL ，ＢL+1 ），（Ｆ），（ｄui，ｄoi）］但しｉ，L ＝１，２，３，… ここで、各テーブルの意味は、第１項から第３項までは
母集団のＴｒｅｅ関係、すなわち、判別関数に直交する
垂直関数によりＡｉ（Ｂｉ）がＡL ，ＡL+1 （ＢL ，Ｂ
L+1 ）に分割されたことを意味する。なお、分岐先がな
い、すなわち対となる母集団Ａk とＢk とが分離された
場合には、上記第１項〜第３項は（０，０，０）とす
る。また、第４項目のＦは、判別関数により母集団Ａk
とＢk とが分離されたか否かのフラグを意味し、分離さ
れた場合には「１」または「−１］となり、分離されな
かった場合には「０」となる。具体的な算出手法は、母
集団Ａi の全データを判別関数Ｚi に代入し、すべての
Ｚi （Ｘ）＞０ならばＦ＝１すべてのＺi （Ｘ）＜０な
らばＦ＝−１とし、それ以外（判別関数の両側にＡiの
データが存在する）ならＦ＝０とする。なお、第５項と
第６項は、後述する第２，第３実施例における方法及び
装置に用いられるＴｒｅｅとの共通化を図るために設け
たもので、本例ではすべて（０，０）となる。したがっ
て、本実施例のみの使用であれば、第４項までにしても
良い。Tai: [(Ai, AL, AL + 1), (F), (dui, doi)] Tbi: [(Bi, BL, BL + 1), (F), (dui, doi)] where i, L = 1, 2, 3, ... Here, the meaning of each table is that from the first term to the third term, the Tree relationship of the population, that is, Ai (Bi) is defined by the vertical function orthogonal to the discriminant function. AL, AL + 1 (BL, B
L + 1). When there is no branch destination, that is, when the paired populations Ak and Bk are separated, the first to third terms are set to (0,0,0). The fourth item F is the population Ak
And Bk are flags indicating whether or not they have been separated, and are "1" or "-1" when they are separated, and "0" when they are not separated. As a specific calculation method, all the data of the population Ai are substituted into the discriminant function Zi, and if all Zi (X)> 0, then F = 1. If all Zi (X) <0, then set F = -1. Otherwise (Ai data exists on both sides of the discriminant function), F = 0. It should be noted that the fifth and sixth terms are provided for the purpose of sharing with Tree used in the methods and apparatuses in the second and third embodiments described later, and in this example, all (0, 0) Becomes Therefore, if only this embodiment is used, the items up to item 4 may be used.

【００１８】そして、上述した第１判別領域絞り装置１
４で生成された垂直関数ＺＴk （Ｘ）により分割された
母集団に含まれるサンプルデータをそれぞれ別々に分離
した状態で判別関数設定装置１２に送る（Ｓ１０６）。
すなわち、Ａ2sとＢ2sを対にし，またＡ（2s+1) とＢ
（2s+1) を対にして送る。そして、ステップ１０２に戻
り上記処理を繰り返し行う。なお、ステップ１０２で行
う判別分析は母集団に付与された通し番号の小さい方か
ら行うようになっている。Then, the above-mentioned first discrimination area diaphragm device 1
The sample data included in the population divided by the vertical function ZTk (X) generated in 4 is sent to the discriminant function setting device 12 in a state of being separated separately (S106).
That is, A2s and B2s are paired, and A (2s + 1) and B2
Send (2s + 1) as a pair. Then, the process returns to step 102 and the above process is repeated. The discriminant analysis performed in step 102 is performed from the smaller serial number assigned to the population.

【００１９】ここで、上記した学習方法の具体例を図３
を用いて説明する。まず、同図（Ａ）に示すように与え
られたサンプルデータにより２つの母集団Ａ1，Ｂ1 が
構成されているとする。すると、係るサンプルデータに
対して判別関数設定装置１２にて判別関数Ｚ1 を引く。
するとこの判別関数Ｚ1 では両母集団Ａ1 ，Ｂ1 を分離
することができないため、その判別関数Ｚ1 に直交する
所定の垂直関数ＺＴ1を引き、その垂直関数ＺＴ1 によ
り両母集団Ａ1 ，Ｂ1 をそれぞれＡ2 ，Ａ3 ，Ｂ2 ，Ｂ
3 に分割する（同図（Ｂ））。そしてこれら分割された
対となる母集団Ａ2 ，Ｂ2 に属するサンプルデータ並び
に、Ａ3 ，Ｂ3 に属するサンプルデータをそれぞれ別々
に分離した状態で判別関数設定装置１２に入力する。そ
して、判別関数設定装置１２では、通し番号の小さい母
集団すなわちＡ2 ，Ｂ2 のサンプルデータに対して判別
分析を行い判別関数Ｚ2 を引く（同図（Ｃ））。する
と、分離終了判定装置１３にて係る判別関数Ｚ2 で両母
集団Ａ2 ，Ｂ2 にて分離されたことが判断されるため、
この母集団にＡ2 ，Ｂ2 に対する判別処理は終了する。Here, a concrete example of the above-mentioned learning method is shown in FIG.
Will be explained. First, it is assumed that two populations A1 and B1 are constituted by given sample data as shown in FIG. Then, the discriminant function setting device 12 subtracts the discriminant function Z1 from the sample data.
Then, since the discriminant function Z1 cannot separate the two populations A1 and B1 from each other, a predetermined vertical function ZT1 orthogonal to the discriminant function Z1 is drawn, and the two populations A1 and B1 are respectively separated by A2 and A3, B2, B
Divide into 3 (Fig. (B)). Then, the sample data belonging to the divided populations A2 and B2 and the sample data belonging to A3 and B3 are input to the discriminant function setting device 12 in a state where they are separately separated. Then, in the discriminant function setting device 12, a discriminant analysis is performed on the sample data of the population having a small serial number, that is, sample data of A2 and B2, and the discriminant function Z2 is drawn ((C) in the same figure). Then, the separation end determination device 13 determines that the two populations A2 and B2 have been separated by the discriminant function Z2.
The discrimination process for A2 and B2 is completed for this population.

【００２０】次いで、通し番号の大きい母集団すなわち
Ａ3 ，Ｂ3 のサンプルデータに対して判別分析を行い判
別関数Ｚ3 を引く。しかし、この判別関数Ｚ3 では両母
集団Ａ3 ，Ｂ3 を分離することができないため、その判
別関数Ｚ3 に直交する所定の垂直関数ＺＴ3 を引き、そ
の垂直関数ＺＴ3 により両母集団Ａ3 ，Ｂ3 をそれぞれ
Ａ4 ，Ａ5 ，Ｂ4 ，Ｂ5 に分割する（同図（Ｄ））。Then, a discriminant analysis is performed on the sample data of a large population, that is, sample data of A3 and B3, and a discriminant function Z3 is drawn. However, since the discriminant function Z3 cannot separate the two populations A3 and B3, a predetermined vertical function ZT3 orthogonal to the discriminant function Z3 is drawn, and the two vertical populations Z3 and A3 are used to divide the two populations A3 and B3, respectively. , A5, B4, B5 (FIG. 7 (D)).

【００２１】次いで、上記と同様にＡ4 ，Ｂ4 のサンプ
ルデータに対して判別分析を行い判別関数Ｚ4 を引く
（同図（Ｅ））。また、Ａ5，Ｂ5 のサンプルデータに
対して判別分析を行い判別関数Ｚ5 を引く（同図
（Ｆ））。これにより２つの母集団Ａk ，Ｂk が完全に
分離され、上記すべての判別関数Ｚ1 ，Ｚ2 ，Ｚ3 ，Ｚ
4，Ｚ5 並びにＴｒｅｅのテーブルデータを判別装置１
５に供給し、処理を終了する。なお、上記した具体例に
おけるテーブルデータ並びにＴｒｅｅ構造は、それぞれ
図４，図５に示すようになっている。なお、母集団Ｂ側
も上記母集団Ａのものと同様であるため記載を省略す
る。Then, similarly to the above, the discriminant analysis is performed on the sample data of A4 and B4, and the discriminant function Z4 is drawn ((E) in the same figure). In addition, the discriminant analysis is performed on the sample data of A5 and B5, and the discriminant function Z5 is drawn ((F) of the same figure). As a result, the two populations Ak and Bk are completely separated, and all the above discriminant functions Z1, Z2, Z3, Z
Discriminating device 1 for table data of 4, Z5 and Tree
5, and the process is terminated. The table data and the Tree structure in the above-described specific example are as shown in FIGS. 4 and 5, respectively. The description of the population B side is omitted because it is similar to that of the population A.

【００２２】次ぎに、上記した方法により形成された判
別関数に基づいて未知の想起データがどの母集団に属す
るかの判別を行う方法について図２のフローチャートに
基づいて説明する。Next, a method of discriminating which population the unknown recall data belongs to based on the discriminant function formed by the above method will be described with reference to the flowchart of FIG.

【００２３】すなわち、想起モードを稼働させ、想起用
データＷを判別装置１５に入力する（Ｓ１１０）。する
と、判別装置１５では、与えられた想起用データＷがど
の母集団に属するかを判別する（Ｓ１１１）。具体的に
は、想起用データＷを最初の判別関数Ｚ1 に垂直な垂直
関数ＺＴ1 に代入し、その値が正または負のいずれにな
るかで（Ａ2 ，Ｂ2 ）か（Ａ3 ，Ｂ3 ）のどちらの母集
団に属するかを判断する。ここで、原点に近い母集団の
通し番号のほうを小さくしているため、ＺＴ1（Ｗ）が
負なら想起データＷは（Ａ2 ，Ｂ2 ）に属し、正なら
（Ａ3 ，Ｂ3 ）に属することになる。That is, the recall mode is activated and the recall data W is input to the discriminator 15 (S110). Then, the discrimination device 15 discriminates which population the given recall data W belongs to (S111). Specifically, the recall data W is substituted into the vertical function ZT1 which is perpendicular to the first discriminant function Z1, and whether the value is positive or negative (A2, B2) or (A3, B3). To determine if they belong to the population. Here, since the serial number of the population near the origin is made smaller, if ZT1 (W) is negative, the recall data W belongs to (A2, B2), and if it is positive, it belongs to (A3, B3). ..

【００２４】そして、想起データＷがＡk かＢk に属す
ることが分かったなら、そのＴａkとＴｂk のテーブル
を参照し、各母集団が分割されているかを調べ、分割さ
れているならそのテーブルから分割される４つの母集団
を調べ、上記と同様に判別関数の垂直関数ＺＴk に代入
し、得られた値の正負でどちら側の集団に属するかを判
断する。すなわち、ＺＴk （Ｗ）が負なら想起データＷ
は（Ａ2s，Ｂ2s）に属し、正なら（Ａ2s+1，Ｂ2s+1）に
属することになる。If it is found that the recall data W belongs to Ak or Bk, the table of Tak and Tbk is referred to, and it is checked whether each population is divided. If divided, the table is divided. The four populations described above are examined and substituted into the vertical function ZTk of the discriminant function in the same manner as above, and it is determined which side of the population the positive or negative of the obtained value belongs to. That is, if ZTk (W) is negative, recall data W
Belongs to (A2s, B2s), and if positive, it belongs to (A2s + 1, B2s + 1).

【００２５】そして、上記操作を繰り返し行い、最終的
に想起データを含んでいる母集団Ａe ，Ｂe とを特定さ
れたなら、最後にそれらどちらの母集団に属するかをＴ
ａeとＴｂe のテーブルに含まれるＦで判断する。そし
て、その分析結果を出力し（Ｓ１１３）想起処理を終了
する。When the populations Ae and Be containing the recall data are finally specified by repeating the above operation, it is finally determined which of the populations Te belongs to.
Judgment is made based on F included in the ae and Tbe tables. Then, the analysis result is output (S113), and the recall process ends.

【００２６】なお上記した第１実施例における装置及び
方法では、判別領域を絞り込む際に、判別関数に垂直な
垂直関数を、対象となるサンプルデータ数を２等分する
ような位置に引くようにしたが、本発明はこれに限ら
ず、例えば、係る垂直関数から各サンプルデータまでの
距離の和が両側で等しくなるようにしたり、距離の二乗
の和が等しくなるようにしたり、母集団の面積が２等分
なるようにしたりする位置に、垂直関数を引くなど種々
の手法をとることができる。In the apparatus and method according to the first embodiment described above, when narrowing down the discriminant region, the vertical function perpendicular to the discriminant function is drawn at a position that divides the target sample data number into two equal parts. However, the present invention is not limited to this, and for example, the sum of distances from the vertical function to each sample data is equal on both sides, the sum of squares of the distances is equal, or the area of the population is equal. It is possible to take various methods such as drawing a vertical function at a position where B is divided into two.

【００２７】図６は本発明に係る判別分析装置の第２実
施例を示している。同図に示すように基本構成は上記第
１実施例で同様であるため、同一符合を付すことにより
その説明を省略する。ここで本例では、上記実施例と相
違して、上記第１判別領域絞り装置１４に替えて、第２
判別領域絞り装置１４ａを設けている。この第２判別領
域絞り装置１４ａは、判別関数Ｚk （Ｘ）でサンプルデ
ータを完全に分離できなかった場合に、求められた判別
関数Ｚk （Ｘ）に平行な２つの関数によって分離できな
い部分を挟み、その挟まれた部分を判別関数Ｚk （Ｘ）
と平行な方向に度数分布を取り、その分布状態からデー
タを分離できる時（度数分布にオーバーラップ部分がな
い時）はそれを利用して分離（判別）し、分離できない
時は、前記判別関数に平行な２つの関数間に挾まれた領
域に属するデータを判別関数設定装置１２に入力するよ
うになっている。さらに、本装置１４ａでは、上記度数
分布により分離できた場合には、その時の分布の境界を
示す仕切り値Ｓを判別装置１５に設定する機能も有して
いる。FIG. 6 shows a second embodiment of the discriminant analysis apparatus according to the present invention. As shown in the figure, the basic structure is the same as that of the first embodiment, and therefore the description thereof is omitted by giving the same reference numeral. Here, in this example, unlike the above-described embodiment, the first determination area diaphragm device 14 is replaced by a second
A discrimination area diaphragm device 14a is provided. When the sample data cannot be completely separated by the discriminant function Zk (X), the second discriminant area diaphragm device 14a sandwiches a portion which cannot be separated by two functions parallel to the obtained discriminant function Zk (X). , The sandwiched portion is the discriminant function Zk (X)
Takes a frequency distribution in the direction parallel to, and when the data can be separated from the distribution state (when there is no overlapping part in the frequency distribution), it is used for separation (determination). Data belonging to a region sandwiched between two functions parallel to is input to the discriminant function setting device 12. Further, the present device 14a also has a function of setting the partition value S indicating the boundary of the distribution at that time to the discriminating device 15 when the frequency distribution can be separated.

【００２８】また、上記判別関数Ｚk （Ｘ）に平行な２
つの関数と判別関数Ｚk （Ｘ）との距離、すなわちマー
ジン量をｄui，ｄoiとする。但し、原点に近い関数側の
マージン量をｄuiとする。さらに、その一方の関数によ
り、母集団ＡｉがＡL とＡL+1 に分離され、他方の関数
により母集団ＢｉがＢL とＢL+1 とに分離される。な
お、分離される母集団に付与される通し番号の付け方は
マージン内すなわち、２つの関数に囲まれた領域内に位
置する母集団の方を大きくする。また、各母集団相互の
Ｔｒｅｅのテーブルは、上記第１実施例と同様にＴａｉ：（Ａｉ，ＡL ，ＡL+1 ），（Ｆ），（ｄui，ｄoi）Ｔｂｉ：（Ｂｉ，ＢL ，ＢL+1 ），（Ｆ），（ｄui，ｄoi）であらわさる。但し、第４項目のＦには、上記第１実施
例の際に用いた「−１，０，１」に加え、仕切り値を有
する時の「２」がある。また、第５項，第６項には、上
記マージン量が入力されるが、判別関数で分離可能、す
なわち、分岐先の母集団がない場合には第５，第６項は
いずれも０となる。In addition, 2 which is parallel to the discriminant function Zk (X)
The distances between the two functions and the discriminant function Zk (X), that is, the margin amounts are dui and doi. However, the function-side margin amount near the origin is dui. Further, one of the functions separates the population Ai into AL and AL + 1, and the other function separates the population Bi into BL and BL + 1. The serial numbers assigned to the separated populations are larger within the margin, that is, within the area surrounded by the two functions. Further, the table of Trees among the respective populations is: Tai: (Ai, AL, AL + 1), (F), (dui, doi) Tbi: (Bi, BL, BL +) as in the first embodiment. 1), (F), (dui, doi) However, in the fourth item F, in addition to "-1, 0, 1" used in the first embodiment, there is "2" when it has a partition value. The margin amounts are input to the fifth and sixth terms, but they can be separated by a discriminant function, that is, if there is no branch destination population, both the fifth and sixth terms are 0. Become.

【００２９】次に、上記した実施例を用いて本発明に係
る判別分析方法の第２実施例について図７に示すフロー
チャートを参照にして説明する。まず、所定の母集団を
分離するための判別関数の設定する手順について説明す
ると、状態設定装置１０によりすべての多変量解析用の
サンプルデータＸijを判別関数設定装置１２に入力し
（Ｓ２０１）、その判別関数設定装置１２では与えられ
たサンプルデータＸijから、２つの母集団Ａk ，Ｂk を
最も効率よく分割する線形の判別関数Ｚk （Ｘ）を求め
る（Ｓ２０２）。次いで、その判別関数Ｚk（Ｘ）を分
離終了判定装置１３に入力し、そこにおいてその得られ
た判別関数Ｚk （Ｘ）に全サンプルデータＸijを入力し
両母集団Ａ，Ｂに完全に分離され、判別可能であるか否
かが判断される（Ｓ２０３）。そして、判別可能である
場合には、すべての判別関数Ｚk （Ｘ）を判別装置１５
に入力し、処理を終了する（Ｓ２０４）。Next, a second embodiment of the discriminant analysis method according to the present invention will be described with reference to the flow chart shown in FIG. First, a procedure of setting a discriminant function for separating a predetermined population will be described. The state setting device 10 inputs all sample data Xij for multivariate analysis to the discriminant function setting device 12 (S201). The discriminant function setting device 12 obtains a linear discriminant function Zk (X) that most efficiently divides the two populations Ak and Bk from the given sample data Xij (S202). Then, the discriminant function Zk (X) is input to the separation end determination device 13, and all the sample data Xij is input to the obtained discriminant function Zk (X), and completely separated into both populations A and B. Then, it is judged whether or not the judgment is possible (S203). If it is possible to determine, all the determination functions Zk (X) are determined by the determination device 15
Is input to end the processing (S204).

【００３０】一方、判別できなかった場合には、判別関
数並びにサンプルデータを第２判別領域絞り装置１４ａ
に送り、判別関数Ｚk （Ｘ）から平行に所定のマージン
を取って、判別できない領域を含むように２本の平行な
関数を引く（Ｓ２０５）。そして、その２本の平行な関
数で囲まれた領域を判別関数Ｚk（Ｘ）と平行な方向に
度数分布を取り、各母集団を分離できるか否かを判断す
る（Ｓ２０６）。そして、分離できたなら、隣接する母
集団を分離する仕切り値Ｓ（平行な関数上の値）を求め
（Ｓ２０７）、その仕切り値Ｓを判別装置１５に設定し
（Ｓ２０４）、処理を終了する。On the other hand, when the discrimination cannot be made, the discriminant function and the sample data are set to the second discriminant area diaphragm device 14a.
Then, a predetermined margin is taken in parallel from the discriminant function Zk (X), and two parallel functions are drawn so as to include a region that cannot be discriminated (S205). Then, the region surrounded by the two parallel functions is subjected to frequency distribution in the direction parallel to the discriminant function Zk (X), and it is determined whether or not each population can be separated (S206). Then, if it is possible to separate, a partition value S (value on a parallel function) that separates adjacent populations is obtained (S207), the partition value S is set in the discrimination device 15 (S204), and the process is ended. ..

【００３１】また、分離できなかった場合には、上記平
行な２つの関数で囲まれた領域内のデータを判別関数設
定装置１２に設定し（Ｓ２０８）、ステップ２０２に戻
り、係るデータに対して判別関数を設定し、以下、母集
団を分離できるまで上記処理を繰り返し行う。If the data cannot be separated, the data in the area surrounded by the two parallel functions is set in the discriminant function setting device 12 (S208), the process returns to step 202, and the data is extracted. After setting the discriminant function, the above process is repeated until the population can be separated.

【００３２】ここで、上記した学習方法の具体例を図８
を用いて説明する。まず、同図（Ａ）に示すように与え
られたサンプルデータにより２つの母集団Ａ1，Ｂ1 が
構成されているとする。すると、係るサンプルデータに
対して判別関数設定装置１２にて判別関数Ｚ1 を引く。
するとこの判別関数Ｚ1 では両母集団Ａ1 ，Ｂ1 を分離
することができないため、判別関数Ｚ1 に平行で所定の
マージンｄo1，ｄu1だけ隔たれた２本の関数を引く。そ
して、両関数にて両母集団Ａ1 ，Ｂ1 がそれぞれＡ2 ，
Ａ3 ，Ｂ2 ，Ｂ3 に分割される（同図（Ｂ））。これに
より、この判別関数Ｚ1 により、母集団Ａ2 ，とＢ2 と
が分離される。Here, a concrete example of the above-mentioned learning method is shown in FIG.
Will be explained. First, it is assumed that two populations A1 and B1 are constituted by given sample data as shown in FIG. Then, the discriminant function setting device 12 subtracts the discriminant function Z1 from the sample data.
Then, the discriminant function Z1 cannot separate the two populations A1 and B1. Therefore, two functions parallel to the discriminant function Z1 and separated by predetermined margins do1 and du1 are drawn. Then, in both functions, both populations A1 and B1 are A2 and
It is divided into A3, B2, and B3 (FIG. 7B). As a result, the discriminant function Z1 separates the populations A2 and B2.

【００３３】一方、分離できなかった、両関数で囲まれ
た領域（図中ハッチングで示す）に存在する母集団Ａ3
，Ｂ3 を判別関数Ｚ1 と平行な方向に度数分布を取る
（同図（Ｃ））。本例では、母集団Ａ3 ，Ｂ3の一部が
オーバーラップしているため母集団を分離することがで
きない。よって、その母集団Ａ3 ，Ｂ3 に属するサンプ
ルデータを取り出し、それに対して判別関数Ｚ3 を引
き、上記と同様に判別関数Ｚ3 に平行で所定のマージン
ｄo3，ｄu3だけ隔たれた２本の関数を引く（同図
（Ｄ））。そして、両関数にて両母集団Ａ3 ，Ｂ3 がそ
れぞれＡ4 ，Ａ5 ，Ｂ4 ，Ｂ5 に分割される（同図
（Ｅ））。これにより、この判別関数Ｚ3 によって、２
つの母集団Ａ4 ，とＢ4 とが分離される。そして、分離
できなかった、両関数で囲まれた領域（図中ハッチング
で示す）に存在する母集団Ａ5 ，Ｂ5 に対し判別関数Ｚ
3 と平行な方向に度数分布を取る（同図（Ｆ））。する
と、同図から明らかなように、両母集団Ａ5 とＢ5 の分
布にオーバーラップ部分がなく、両母集団Ａ5 ，Ｂ5
は、仕切り値Ｓ1 ，Ｓ2 で分離される。すなわち、判別
関数Ｚ1 にて母集団Ａ2 とＢ2 とが分離され、判別関数
Ｚ3 にて母集団Ａ4 とＢ4 とが分離され、度数分布にと
もなう仕切り値Ｓ1 ，Ｓ2 にて母集団Ａ5 とＢ5 とが分
離され、完全に分離作業が終了する。On the other hand, the population A3 which cannot be separated and which exists in the area surrounded by both functions (shown by hatching in the figure)
, B3 has a frequency distribution in the direction parallel to the discriminant function Z1 ((C) in the figure). In this example, since the populations A3 and B3 are partially overlapped, the populations cannot be separated. Therefore, the sample data belonging to the populations A3 and B3 are taken out, the discriminant function Z3 is subtracted therefrom, and two functions parallel to the discriminant function Z3 and separated by predetermined margins do3 and du3 are drawn in the same manner as described above ( The same figure (D)). Then, both populations A3 and B3 are divided into A4, A5, B4, and B5 by both functions ((E) in the same figure). Therefore, this discriminant function Z3 is 2
Two populations A4 and B4 are separated. Then, the discriminant function Z is applied to the populations A5 and B5 existing in the area (indicated by hatching in the figure) surrounded by both functions that cannot be separated.
The frequency distribution is taken in the direction parallel to 3 (Fig. (F)). Then, as is clear from the figure, the distributions of the two populations A5 and B5 have no overlapping portions, and the two populations A5 and B5 have no overlap.
Are separated by partition values S1 and S2. That is, the discriminant function Z1 separates the populations A2 and B2, the discriminant function Z3 separates the populations A4 and B4, and the partition values S1 and S2 associated with the frequency distribution form the populations A5 and B5. It is separated and the separation work is completed.

【００３４】そして、上記すべての判別関数Ｚ1 ，Ｚ3
，すべての仕切り値Ｓ1 ，Ｓ2 並びにＴｒｅｅのテー
ブルデータを判別装置１５に供給し、処理を終了する。
なお、上記した具体例におけるテーブルデータ並びにＴ
ｒｅｅ構造は、それぞれ図９，図１０に示すようになっ
ている。なお、母集団Ｂ側も上記母集団Ａのものと同様
であるため記載を省略する。Then, all of the above discriminant functions Z1 and Z3
, All the partition values S1, S2 and the table data of Tree are supplied to the discriminator 15, and the process is terminated.
The table data and T in the above-mentioned specific example
The ree structure is as shown in FIGS. 9 and 10, respectively. The description of the population B side is omitted because it is similar to that of the population A.

【００３５】次ぎに、上記した方法により形成された判
別関数に基づいて未知の想起データがどの母集団に属す
るかの判別を行う方法について図７のフローチャートに
基づいて説明する。Next, a method for determining which population the unknown recall data belongs to based on the discriminant function formed by the above method will be described with reference to the flowchart of FIG.

【００３６】すなわち、想起モードを稼働させ、想起用
データＷを判別装置１５に入力する（Ｓ２１０）。する
と、判別装置１５では、与えられた想起用データＷがど
の母集団に属するかを判別する（Ｓ２１１）。具体的に
は、まずＴａ1，Ｔｂ1 のテーブルのフラグＦを見に行
き、その値が１か−１ならば、１本の判別関数Ｚ1 で母
集団が分離されることを意味するため、想起用データＷ
を最初の判別関数Ｚ1（Ｗ）に代入し、その値が正また
は負のいずれになるかでＡ1 とＢ1 のどちらの母集団に
属するかを判断し、想起処理を終了する。ここで、Ｚ1
（Ｗ）が正ならＦ＝１となっている母集団に属し、負な
ら想起データＷはＦ＝−１となっている母集団に属する
ことになる。That is, the recall mode is activated and the recall data W is input to the discriminator 15 (S210). Then, the discrimination device 15 discriminates which population the given recall data W belongs to (S211). Specifically, first look at the flag F in the table of Ta1 and Tb1, and if the value is 1 or -1, it means that the population is separated by one discriminant function Z1. Data W
Is assigned to the first discriminant function Z1 (W), it is determined whether the population is A1 or B1 depending on whether the value is positive or negative, and the recall process ends. Where Z1
If (W) is positive, it belongs to the population of F = 1, and if it is negative, the recall data W belongs to the population of F = -1.

【００３７】一方、Ｔａ1 ，Ｔｂ2 が１または−１でな
いときには（係るケースが一般的であるが）、想起用デ
ータＷを最初の判別関数Ｚ1 （Ｗ）に代入し、得られた
値とマージンｄu1，ｄo1との大小で（Ａ2 ，Ｂ2 ）か
（Ａ3 ，Ｂ3 ）のどちらの母集団の対に属するかを判断
する。ここで、母集団に付与される通し番号は平行な２
つの関数に囲まれている領域側を大きくしているため、Ｚ1 （Ｘ）＜ｄu1 ｏｒｄo1＜Ｚ1 （Ｘ）なら、想起データＷは、前記領域外に位置、すなわちこ
の判別関数Ｚ1 で判別できるＡ2 またはＢ2 に位置する
ことを意味しているため、上記と同様に得られた値とＴ
ａｉ，ＴｂｉのテーブルのフラグＦとから、Ａ2 とＢ2
のどちらの母集団に属するかを判断し、想起処理を終了
する。また、Ｚ1 （Ｘ）＞ｄu1 and ｄo1＞Ｚ1 （Ｘ）なら、想起データＷは、前記領域内に位置、すなわち、
Ａ3 またはＢ3に位置することがわかる。On the other hand, when Ta1 and Tb2 are not 1 or -1 (this case is common), the recall data W is substituted into the first discriminant function Z1 (W), and the obtained value and margin du1 , Do1 and (A2, B2) or (A3, B3) to determine which pair of populations they belong to. Here, the serial numbers assigned to the population are parallel 2
Since the area surrounded by the two functions is enlarged, if Z1 (X) <du1 or do1 <Z1 (X), the recall data W can be located outside the area, that is, by this discriminant function Z1. Since it means that it is located at A2 or B2, the value obtained as above and T
From the flag F of the ai and Tbi table, A2 and B2
It is determined which of the two populations belong to, and the recall process is terminated. If Z1 (X)> du1 and do1> Z1 (X), the recall data W is located in the area, that is,
It can be seen that it is located at A3 or B3.

【００３８】そして、想起データＷがＡ3 またはＢ3
（以下、一般データとして、付与する数値をk （Ａk か
Ｂk ）として説明する）に属することがわかったなら、
そのＴａk とＴｂk のテーブルを参照し、フラグＦが２
であるか否かを判断し、２ならば仕切り値による分離が
可能であるため、想起データＷを係る判別関数の平行方
向に投射し、仕切り値Ｓm からどの母集団に含まれるか
を判断し、想起処理を終了する。Then, the recall data W is A3 or B3.
(Hereinafter, as general data, the numerical value to be given will be described as k (Ak or Bk))
The flag F is set to 2 by referring to the table of Tak and Tbk.
If it is 2, it is possible to separate by the partition value. Therefore, the recall data W is projected in the parallel direction of the discriminant function, and it is determined which population is included from the partition value Sm. , The recall process ends.

【００３９】一方、仕切り値からの分離ができない場合
には、次ぎの判別関数Ｚk （Ｗ）に想起データを代入
し、得られた値並びにマージンｄul，ｄol ，Ｂk から
分岐された４つの母集団（Ａ2s，Ｂ2s）或いは（Ａ2s+
1，Ｂ2s+1）のどこに属するかを判断する。具体的な判
断手法は上述の通りである。このようにし、判別関数或
いは仕切り値にて想起データＷを含んでいる母集団Ａｅ
或いはＢｅが求められる。そして、得られた分析結果を
出力し、想起処理が終了する（Ｓ２１２）。On the other hand, when it is not possible to separate from the partition value, the recall data is substituted into the next discriminant function Zk (W), and the obtained values and the four populations branched from the margins dul, dol and Bk. (A2s, B2s) or (A2s +
1, B2s + 1) to determine where it belongs. The specific determination method is as described above. In this way, the population Ae including the recall data W in the discriminant function or the partition value
Alternatively, Be is required. Then, the obtained analysis result is output, and the recall process ends (S212).

【００４０】図１１は、本発明の判別分析装置の第３実
施例を示している。同図に示すように、基本構成は上記
した第１実施例と同様のものを用いており（同一符号で
示す）、その特徴は、第１実施例における分離終了判定
装置１３と、第１判別領域絞り装置１４との間に第３判
別領域絞り装置１６を挿入配置している。この第３判別
領域絞り装置１６は、第２実施例における第２判別領域
絞り装置１４ａと類似する機能を有している。すなわ
ち、求められた判別関数に平行な２つの関数によって分
離できない部分を挟み、その挟まれた部分に対し判別関
数と平行な方向に度数分布を取り、その分布状態からデ
ータを分離できる時にはその仕切り値を求め、それを判
別装置１５に設定する点では第２判別領域絞り装置１４
ａと同様である。但し、この第３判別領域絞り装置１６
では、分離できないときに出力するデータが、入力され
た判別関数をそのまま出力し、しかもそれを第１判別領
域絞り装置１４に供給するようになっている。FIG. 11 shows a third embodiment of the discriminant analysis apparatus of the present invention. As shown in the figure, the basic configuration is the same as that of the first embodiment described above (denoted by the same reference numeral), and its features are the separation end determination device 13 in the first embodiment and the first determination. A third discrimination area limiting device 16 is inserted and arranged between the area limiting device 14 and the area determining device 14. The third discrimination area diaphragm device 16 has a function similar to that of the second discrimination area diaphragm device 14a in the second embodiment. That is, a part that cannot be separated is sandwiched by two functions that are parallel to the obtained discriminant function, a frequency distribution is taken in the direction parallel to the discriminant function for the sandwiched part, and when the data can be separated from the distribution state, the partition is created. In terms of obtaining a value and setting it in the discriminator 15, the second discriminant area diaphragm device 14
The same as a. However, this third discrimination area diaphragm device 16
In the case of data that cannot be separated, the input discriminant function is output as it is, and it is supplied to the first discriminant area diaphragm device 14.

【００４１】そして、係る装置を用いて行う判別分析方
法は、上記第１，第２実施例の方法を適宜組み合わせて
実行するもので、第１実施例を実行し、その最終工程に
第２実施例の度数分布を用いた判別方法を組み込むよう
にしている。これにより、使用する判別関数の数を少な
くすることができ、システムの簡略化が図れる。The discriminant analysis method performed by using such an apparatus is a combination of the methods of the first and second embodiments described above. The first embodiment is executed and the second step is executed in the final step. A discrimination method using an example frequency distribution is incorporated. As a result, the number of discriminant functions used can be reduced, and the system can be simplified.

【００４２】また、具体的な方法について説明すると、
上記各実施例と同様に与えられたサンプルデータより判
別関数を求め、分離終了判定装置１３にてその判別関数
によって分離が終了したか否かが判断される。そして、
分離が終了していない場合には、第３判別領域絞り装置
１６に判別関数が入力され、そこにおいて判別関数に平
行な関数が引かれると共に度数分布を求め、その分布に
より分離か可能か否かが判断され、可能な場合には仕切
り値Ｓを求め、それを判別装置１５に設定する。A specific method will be described below.
A discriminant function is obtained from the given sample data as in each of the above-described embodiments, and the separation end determination device 13 determines whether or not the separation is completed by the discriminant function. And
If the separation has not been completed, a discriminant function is input to the third discriminant region diaphragm device 16, a function parallel to the discriminant function is drawn therein, and a frequency distribution is obtained. Is determined, and if possible, the partition value S is determined and set in the determination device 15.

【００４３】一方、上記度数分布では分離できない場合
には、上記第１実施例と同様に判別関数を第１判別領域
絞り装置１４に代入し、係る判別関数に垂直な所定の垂
直関数を引くと共に、その垂直関数により領域（サンプ
ルデータ）を２分しそれを判別関数設定装置１２に供給
し、それぞれに対して判別関数を行う。以後、上記操作
を繰り返し行うことにより、求められるすべての判別関
数並びに必要に応じて仕切り値を判別装置１５に設定
し、想起処理を終了する。On the other hand, if the frequency distribution cannot be separated, the discriminant function is substituted into the first discriminant region diaphragm device 14 as in the first embodiment, and a predetermined vertical function perpendicular to the discriminant function is drawn. , The area (sample data) is divided into two parts by the vertical function, and the divided area is supplied to the discriminant function setting device 12, and the discriminant function is performed for each. After that, by repeating the above-mentioned operation, all the required discriminant functions and partition values as necessary are set in the discriminator 15, and the recall process is terminated.

【００４４】[0044]

【発明の効果】以上のように、本発明に係る判別分析方
法及び装置では、たとえ分離する母集団の境界形状が非
線形であったとしても、線形の判別関数でもって分離す
ることができる。よって、ニューラルネット等に比べ、
学習時間が短く、しかも、計算結果の解析が容易とな
る。そして、未知の想起データが入力された場合に、得
られた判別関数並びに必要に応じて仕切り値に基づいて
母集団の特定が容易に行える。As described above, in the discriminant analysis method and apparatus according to the present invention, even if the boundary shapes of the populations to be separated are non-linear, they can be separated by a linear discriminant function. Therefore, compared to neural networks,
The learning time is short and the analysis of the calculation result is easy. Then, when unknown recall data is input, the population can be easily specified based on the obtained discriminant function and, if necessary, the partition value.

[Brief description of drawings]

【図１】本発明に係る判別分析装置の第１実施例を示す
ブロック構成図である。FIG. 1 is a block configuration diagram showing a first embodiment of a discriminant analysis apparatus according to the present invention.

【図２】本発明に係る判別分析方法の第１実施例を示す
フローチャートである。FIG. 2 is a flowchart showing a first embodiment of the discriminant analysis method according to the present invention.

【図３】本発明の処理工程の一例を示す図である。FIG. 3 is a diagram showing an example of processing steps of the present invention.

【図４】母集団Ａについてのテーブルの構成を示す図で
ある。4 is a diagram showing a configuration of a table for a population A. FIG.

【図５】母集団ＡについてのＴｒｅｅ構造を示す図であ
る。5 is a diagram showing a Tree structure for a population A. FIG.

【図６】本発明に係る判別分析装置の第２実施例を示す
ブロック構成図である。FIG. 6 is a block configuration diagram showing a second embodiment of the discriminant analysis apparatus according to the present invention.

【図７】本発明に係る判別分析方法の第２実施例を示す
フローチャートである。FIG. 7 is a flowchart showing a second embodiment of the discriminant analysis method according to the present invention.

【図８】本発明の処理工程の一例を示す図である。FIG. 8 is a diagram showing an example of processing steps of the present invention.

【図９】母集団Ａについてのテーブルの構成を示す図で
ある。9 is a diagram showing a configuration of a table for a population A. FIG.

【図１０】母集団ＡについてのＴｒｅｅ構造を示す図で
ある。10 is a diagram showing a Tree structure for a population A. FIG.

【図１１】本発明に係る判別分析装置の第３実施例を示
すブロック構成図である。FIG. 11 is a block diagram showing a third embodiment of the discriminant analysis apparatus according to the present invention.

[Explanation of symbols]

１１状態設定装置１２判別関数設定装置１３分離終了判定装置１４第１判別領域絞り装置１４ａ第２判別領域絞り装置１５判別装置１６第３判別領域絞り装置 11 State Setting Device 12 Discriminant Function Setting Device 13 Separation End Determination Device 14 First Discrimination Area Diaphragm Device 14a Second Discrimination Area Diaphragm Device 15 Discrimination Device 16 Third Discrimination Area Diaphragm Device

───────────────────────────────────────────────────── フロントページの続き (72)発明者堤康弘京都府京都市右京区花園土堂町10番地オムロン株式会社内 (72)発明者入江篤京都府京都市右京区花園土堂町10番地オムロン株式会社内 ─────────────────────────────────────────────────── ─── Continuation of the front page (72) Inventor Yasuhiro Tsutsumi 10 Hanazono Dodo-cho, Ukyo-ku, Kyoto City, Kyoto Prefecture Omron Co., Ltd. Within the corporation

Claims

[Claims]

1. A discriminant function is obtained by performing discriminant analysis on a plurality of populations formed from sample data, and when the discriminant function cannot separate the populations, a predetermined vertical function perpendicular to the discriminant function. And a discriminant analysis method is obtained for each sample data located in a region separated by the vertical function.

2. A discriminant function setting device that sets a linear discriminant function that stochastically minimizes misjudgment so as to separate into a plurality of populations based on input sample data.
A separation end determining device that determines whether or not the sample data is completely separated by the discriminant function obtained by the discriminant function setting device, and a discriminant function that is obtained when the sample data is not completely separated. A discriminant analysis device, comprising: a first discriminant region diaphragm device that draws a vertical vertical function and separates sample data of portions divided by the vertical function into the discriminant function setting device.

3. A discriminant function is obtained by performing discriminant analysis on a plurality of populations formed from sample data, and when the populations cannot be separated by the discriminant function, the discriminant function is sandwiched by the non-separable portions. And two functions parallel to each other are set, a frequency distribution is taken in a direction parallel to the discriminant function with respect to a region surrounded by both functions, and it is determined whether or not the data can be separated from the distribution state, and the data can be separated. In this case, a partition value for dividing the distribution is obtained, and when the partition values cannot be separated, a discriminant analysis method is performed on the sample data located in the area.

4. A discriminant function setting device that sets a linear discriminant function that stochastically minimizes misjudgment so as to separate into a plurality of populations based on input sample data.
A discriminant termination determination device that determines whether or not the sample data is completely separated by the discriminant function obtained by the discriminant function setting device, and a part that cannot be separated by two functions that are parallel to the determined discriminant function are sandwiched, A frequency distribution is taken in the sandwiched part in a direction parallel to the discriminant function, and when the data can be separated from the distribution state, the partition value is set, and when the data cannot be separated, the region sandwiched between the two functions is set. A discriminant analysis device, comprising: a second discriminant region diaphragm device for inputting belonging sample data to the discriminant function setting device.

5. When a population function cannot be separated by the obtained discriminant function at a predetermined timing during the execution of the discriminant analysis method according to claim 1, the non-separable portion is sandwiched so as to be parallel to the discriminant function. Two functions are set, a frequency distribution is taken in a direction parallel to the discriminant function for the area surrounded by both functions, and it is judged whether or not the data can be separated from the distribution state. A discriminant analysis method, wherein a value is obtained, and when the values cannot be separated, the method according to claim 1 is continued.

6. A discriminant function setting device that sets a linear discriminant function that stochastically minimizes erroneous discrimination so as to separate into a plurality of populations based on input sample data.
A discriminant termination determination device that determines whether or not the sample data is completely separated by the discriminant function obtained by the discriminant function setting device, and a part that cannot be separated by two functions that are parallel to the determined discriminant function are sandwiched, A third discriminant region diaphragm device that takes a frequency distribution in the sandwiched portion in a direction parallel to the discriminant function and sets the partition value when data can be separated from the distribution state, and the third discriminant region diaphragm device. When the partition value cannot be obtained, a vertical function perpendicular to the discriminant function is drawn, and sample data of portions divided by the vertical function are separated and input to the discriminant function setting device. And a discriminant analysis device.

7. When unknown recall data is input,
A discriminant analysis method comprising discriminating which one of the plurality of populations belongs using a discriminant function set by the method of claim 1.

8. When unknown recall data is input,
A discriminant analysis method, comprising discriminating which one of the plurality of populations belongs using the discriminant function and the partition value set by the method according to claim 3 or 5.