JP3096353B2

JP3096353B2 - How to classify data

Info

Publication number: JP3096353B2
Application number: JP04130083A
Authority: JP
Inventors: 正人戸上
Original assignee: 株式会社戸上電機製作所
Priority date: 1992-04-22
Filing date: 1992-04-22
Publication date: 2000-10-10
Anticipated expiration: 2015-10-10
Also published as: JPH0675985A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、データが属性とその値
の対の集合で与えられている事例または計算結果がある
場合に、データをいくつかのカテゴリー（クラス）に分
類する方法において、特に属性に分布があり、その分布
に重なりがある場合の帰納的機械学習方法に関し、特に
パターン認識、事故診断に有用なデータの分類方法に関
する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method for classifying data into several categories (classes) when there are cases or calculation results in which the data is given as a set of attribute-value pairs. In particular, the present invention relates to a recursive machine learning method in the case where attributes have a distribution and the distributions overlap, and particularly to a data classification method useful for pattern recognition and accident diagnosis.

【０００２】[0002]

【従来の技術】帰納的機械学習方法は、従来属性の値に
分布を持たず、離散的な属性値により識別木を作成して
いた。例えば、従来は表１のような車の種類についての
２０代の人の好みを、好むならばＰと分類し、好まない
ならばＮと分類したデータがある場合、図１のような識
別木を与えてデータのクラス分類をしていた。2. Description of the Related Art In the recursive machine learning method, conventionally, an attribute tree has no distribution, and a discrimination tree is created by discrete attribute values. For example, conventionally, if there is data that classifies the tastes of people in their twenties with respect to the type of car as shown in Table 1 as P if they like it, and classifies as N if they do not like, an identification tree as shown in FIG. To classify the data.

【０００３】[0003]

【表１】 [Table 1]

【０００４】しかしながら、表１の属性は離散的な値、
例えばオートマチックの有無などを考えているが、通常
のデータは属性が連続値を取っている場合、または離散
値であるが属性の分布を持っている場合と考えてよい場
合がある。表１では色については明度、彩度、色相の３
つの属性があり、例えば一般に青色といっても、明度、
彩度、色相にある一定の分布内で青色と認識される。す
なわち属性値が連続値を取っている場合である。また、
排気量を考えても、表１では２０００ｃｃクラス、１６
００ｃｃクラス、２８００ｃｃクラスというように離散
値を考えているが、実際には１５００ｃｃ，１６００ｃ
ｃを１６００ｃｃクラス、１８００ｃｃ，２０００ｃ
ｃ，２２００ｃｃを２０００ｃｃクラス、２５００ｃ
ｃ，２８００ｃｃ，３０００ｃｃを２８００ｃｃクラス
と考えて、離散値に分布があるにも拘わらず、代表的な
離散値を取っている場合もある。この場合、１５００ｃ
ｃ以上１８００ｃｃ未満を一つの属性値分布と考え、同
様に１８００ｃｃ以上２５００ｃｃ未満、２５００ｃｃ
以上３０００ｃｃ未満を一つの属性値分布と考えてもよ
い。[0004] However, the attributes in Table 1 are discrete values,
For example, whether or not automatic data is present is considered, but there are cases in which normal data may be considered to be a case where attributes have continuous values or a case where discrete values are present but have attribute distributions. In Table 1, the colors are 3 for brightness, saturation, and hue.
There are two attributes, for example, generally speaking blue, lightness,
Blue is recognized within a certain distribution of saturation and hue. That is, this is a case where the attribute value is a continuous value. Also,
Considering the displacement, Table 1 shows 2000cc class, 16
Though discrete values such as 00cc class and 2800cc class are considered, 1500cc and 1600c are actually used.
c in 1600cc class, 1800cc, 2000c
c, 2200cc to 2000cc class, 2500c
Assuming that c, 2800 cc, and 3000 cc are in the 2800 cc class, there may be cases where representative discrete values are taken despite the distribution of discrete values. In this case, 1500c
It is considered that one attribute value distribution is c or more and less than 1800 cc, and similarly, it is 1800 cc or more and less than 2500 cc
More than 3000 cc may be considered as one attribute value distribution.

【０００５】また、実際のデータを集計または計算して
みると、データ自体は離散値をとるにも拘わらず、統計
的処理して属性値の分布を考えた方がよい。[0005] In addition, when tabulating or calculating actual data, it is better to consider the distribution of attribute values through statistical processing even though the data itself takes discrete values.

【０００６】また、実際パターン認識及び事故診断にお
ける属性値、すなわち実際には測定値およびセンサ値
は、通常ノイズや種々のパラメータによる属性値の変化
により属性値に幅を持つ。In addition, attribute values in actual pattern recognition and accident diagnosis, that is, actually measured values and sensor values have a range of attribute values usually due to changes in attribute values due to noise or various parameters.

【０００７】このように属性値に幅を持つ場合、あるい
は属性値の分布が明確に得られる場合は、先に特願平３
−２６９４１１号において提案した方法により、識別木
学習が可能となった。[0007] In the case where the attribute value has a width or the distribution of the attribute value can be clearly obtained, the Japanese Patent Application No.
The method proposed in -269411 makes it possible to perform discrimination tree learning.

【０００８】[0008]

【発明が解決しようとする課題】ところで、先に提案し
た特願平３−２６９４１１号において提案した方法で
は、属性値が幅を持つ場合においても、カテゴリー（実
際には事故種別）とカテゴリーの持つ属性値の分布が完
全に分離している属性があった場合のみ、識別が可能で
あった（図２の（ｉ）参照）。According to the method proposed in Japanese Patent Application No. 3-269411, the category (actually the type of accident) and the category (actually, the type of accident) exist even when the attribute value has a width. Identification was possible only when there was an attribute whose attribute value distribution was completely separated (see (i) of FIG. 2).

【０００９】エキスパートシステムは、アルゴリズムが
はっきりしない悪構造問題に対して有用であり、診断に
適用されてきた。しかしエキスパートシステムでは、知
識を人間が獲得することが前提となっている。また一度
知識獲得が終わったかのように見えても、システムの信
頼度向上の要求のため、その診断知識を増加したり、修
正したりする必要がある。しかし、修正に際してはどの
ような知識を加えたらよいかの決定は難しい。既存の知
識との整合性の維持ならびに知識の検証も難しい。した
がってデータの分類において知識の獲得、修正、増加さ
らには、その知識との整合性の維持ならびに検証は、膨
大な人的労力と開発費を必要とする。Expert systems are useful for badly structured problems where the algorithm is unclear, and have been applied to diagnostics. However, expert systems assume that humans acquire knowledge. Even if it seems that knowledge acquisition has been completed, it is necessary to increase or correct the diagnostic knowledge in order to improve the reliability of the system. However, it is difficult to decide what kind of knowledge to add when making corrections. It is also difficult to maintain consistency with existing knowledge and verify knowledge. Therefore, acquiring, modifying, and increasing the knowledge in classifying data, and maintaining and verifying the consistency with the knowledge, require enormous human labor and development costs.

【００１０】エキスパートシステムではｉｆ−ｔｈｅｎ
ルールが用いられるため診断のための計算時間がかかる
ことも問題点である。In the expert system, if-then
Another problem is that it takes a long calculation time for diagnosis because rules are used.

【００１１】それに対し、識別木による機械学習では、
人間の主観が入らない診断を自動的かつ効率的に作成す
ることが可能になる。また、新しい属性値を使うことに
より、新しい識別木を機械学習により開発することが期
待される。On the other hand, in the machine learning using the identification tree,
It is possible to automatically and efficiently create a diagnosis that does not involve human subjectivity. It is expected that a new identification tree will be developed by machine learning by using new attribute values.

【００１２】最近、ニューラルネットワークを用いたデ
ータの分類方法が研究されているが、データの分類論理
が不透明であり、あいまいな結果が得られた場合もあ
り、結果の正確度についての推定は困難である。中間層
を増やせばデータの分類結果の正確度は向上するが、学
習時間が著しく増大するため、データの分類に使えるネ
ットの切り換えを容易化できない。ニューラルネットワ
ークでは目標概念の一部を構成しない属性を与えた場
合、それが概念とは無関係の属性であることを知ること
はできない。Recently, a method of classifying data using a neural network has been studied. However, the classification logic of data is opaque, and in some cases, ambiguous results have been obtained, and it is difficult to estimate the accuracy of the results. It is. Increasing the number of intermediate layers improves the accuracy of the data classification result, but significantly increases the learning time, so that it is not easy to switch the nets that can be used for data classification. In the neural network, when an attribute that does not form a part of the target concept is given, it cannot be known that the attribute is irrelevant to the concept.

【００１３】それに対し識別木による機械学習では、識
別木が複雑になればデータの分類結果の根拠についての
説明が理解しにくくはなるが、診断アルゴリズムからデ
ータの分類結果の根拠や不要な属性について知ることが
できる。また診断できないときにはその原因を推定する
ことが可能である。On the other hand, in the machine learning using the identification tree, if the identification tree becomes complicated, it becomes difficult to understand the grounds of the data classification result. You can know. If the diagnosis cannot be made, the cause can be estimated.

【００１４】本発明では、カテゴリーとカテゴリーのも
つ属性値の分布が完全に分離していなくても、属性値の
重なりのない部分を分類する方法を提案する。例えば図
３の属性値ａならびにｂはカテゴリーＣ_iと分類できる
し、属性値ｃならびにｄはカテゴリーＣ_jと分類でき
る。重なりのある部分、すなわちそれらのカテゴリーが
分類できない部分についても、属性の確率分布を求める
ことにより、重なりのある部分の出現確率を求め、カテ
ゴリーを推定する。例えば図４（１）の確率分布で斜線
部に示す部分の面積を求めることにより出現確率を算出
し、また重なりのある部分において、任意の属性値が得
られた場合の出現確率を求めることができる。図４の確
率分布で属性値ｅが得られた場合の出現確率を実線で示
す。The present invention proposes a method of classifying non-overlapping portions of attribute values even if the distribution of the category and the attribute values of the category is not completely separated. For example, the attribute values a and b in FIG. 3 can be classified as category C _i, and the attribute values c and d can be classified as category C _j . Even for overlapping parts, that is, parts in which the categories cannot be classified, the probability distribution of the attributes is obtained to determine the occurrence probability of the overlapping parts, and the category is estimated. For example, it is possible to calculate the appearance probability by calculating the area of the portion indicated by the diagonal line in the probability distribution of FIG. 4A, and to calculate the occurrence probability when an arbitrary attribute value is obtained in the overlapping portion. it can. The appearance probability when the attribute value e is obtained from the probability distribution of FIG. 4 is indicated by a solid line.

【００１５】そこで本発明が解決すべき課題は、上記の
ように属性値の幅が重なる場合も、可能な限り分類する
ことにある。分類できない場合は、分類できないカテゴ
リーとカテゴリーの出現頻度を出し、どのような状況で
分類できないかということを提示することにある。Therefore, the problem to be solved by the present invention is to classify as much as possible even when the widths of the attribute values overlap as described above. When classification is not possible, the category and the frequency of appearance of the category that cannot be classified are given, and the situation in which classification is not possible is presented.

【００１６】また、前記の先の出願で提案した方法で
は、分類する属性の組を見つけ、またその中の最も適切
な属性を属性の組の中から配置する際においても、その
評価関数が適切でないため、その効果が充分でなかっ
た。本発明はそのような問題も解決しようとするもので
ある。Further, in the method proposed in the above-mentioned prior application, when finding a set of attributes to be classified and arranging the most appropriate attribute among the set of attributes, the evaluation function is not suitable. Therefore, the effect was not sufficient. The present invention seeks to solve such a problem.

【００１７】[0017]

【課題を解決するための手段】これらの課題を解決する
ため、本発明のデータの分類方法は、（ａ）データを分類するカテゴリーＣ ₁ 〜Ｃ _i 〜Ｃ _m を設
定し、それぞれのカテゴリーが持つ属性Ｔ ₁ 〜Ｔ _j 〜Ｔ _n
毎に測定データを集計するかあるいは計算によりシミュ
レートし、その結果を、対応するカテゴリーに区分けし
て各属性毎に上限値と下限値で表された分布を表す属性
値分布テーブルとして、記憶装置に格納するステップ
と、（ｂ）前記記憶装置に格納された属性値テーブルを参照
して、各属性毎に、あるカテゴリーＣ _i と他のカテゴリ
ーＣ _j との属性値の分布の重なりの状態を分析して、少
なくとも１つの属性についての属性値の分布の状態が、
カテゴリーＣ _i の属性値の分布からカテゴリーＣ _j の属性
値の分布を完全に識別できる状態（ｉ）か、カテゴリー
Ｃ _i の属性値の分布がカテゴリーＣ _j の属性値の分布と一
部分重なりのある状態（ｉｉ）か、またはカテゴリーＣ
_j の属性値の分布がＣ _i の属性値の分布に包含される状態
（ｉｉｉ）のいずれの状態に属するかを判別する処理を
行うステップと、（ｃ）各属性が前記状態（ｉ）にあるときは１、その他
のときは０という係数を定義すると共に、各属性を論理
変数として、前記カテゴリーＣ _i と他のカテゴリーＣ _j と
を分類可能とする属性の集合を、前記係数をそれぞれ乗
じた属性の論理変数の論理和の論理式に表現し、前記論
理式を属性の集合として選択する処理を行うステップ
と、（ｄ）前記カテゴリーＣ _i とＣ _j が状態（ｉ）の組合せに
おいて、カテゴリーＣ _i と他のすべてのカテゴリーとを
分類可能とする属性の組の集合を、ステップ（ｃ）で求
めた論理式の集合の論理積で求める処理を行うステップ
と、（ｅ）前記カテゴリーＣ _i とＣ _j が状態（ｉ）の組合せに
おいて、すべてのカテゴリーを互いに分類可能とするた
めの属性の組の集合をステップ（ｄ）で求めた論理式の
集合の論理積の論理積で求める処理を行うステップと、（ｆ）前記属性の組の集合の中から識別木作成に最も効
率的な属性の組を選択するために、属性値の分布の重な
り状態とカテゴリーＣ _i の出現頻度に基づく評価関数に
より評価を行い、最も効率的な属性の組を選択する処理
を行うステップと、（ｇ）前記ステップ（ｆ）において選択した属性の組の
中で、評価が最大となる属性をそれに含まれるカテゴリ
ーと共に親ノードとして配置し、（ｇ−１）親ノードに
含まれるあるカテゴリーにおける属性の属性値の分布が
他のカテゴリーにおける当該属性の属性値の分布と重な
っていないときは前記あるカテゴリーを当該親ノードに
対する子ノードとして配置することで分類を完了させ、
（ｇ−２）重なっているときは他のカテゴリーと分類で
きなかったカテゴリーの組を当該親ノードに対する子ノ
ードとして配置し、（ｇ−３）その子ノードのカテゴリ
ーの組の間で前記ステップ（ｂ）〜（ｅ）の処理を行っ
てステップ（ｆ）で選択した属性の組の中で評価が当該
子ノードに対する親ノードでの分類に使用した属性を除
いた属性の内で最大となる属性を、当該子ノードを親ノ
ードとする子ノードの分類に使用する属性として配置
し、（ｇ−４）前記（ｇ−１）〜（ｇ−３）の処理をカ
テゴリーＣ _i と状態（ｉ）にあるカテゴリーＣ _j との間に
おいて分類すべき子ノードがなくなるまで行う処理を行
うステップと、（ｈ）前記ステップ（ｇ）において分類が完了しなかっ
た子ノードについて、カテゴリーの分割を、ある属性Ｔ
_k についての属性値の分布が、あるｓ個のカテゴリー
Ｃ ₁ ，…，Ｃ _i ，…Ｃ _s において重なりあっている場合、
ある属性Ｔ _k についての属性値の分布が、任意のカテゴ
リーＣ _i と他のすべてのカテゴリーとが重ならない部
分、任意のカテゴリーＣ _i と他の任意の１個のカテゴリ
ーが重なる部分、任意のカテゴリーＣ _i と他の任意の２
個のカテゴリーが重なる部分、・・・、任意のカテゴリ
ーＣ _i と他の任意のｓ−１個のカテゴリーが重なる部分
に分け、これらに分割したカテゴリーが属性Ｔ _k に対し
て空集合の場合以外について、分割された新たなカテゴ
リーを作る処理を行うステップと、（ｉ）属性毎に属性値の範囲の確率分布を用いて、ステ
ップ（ｈ）で作られた新たなカテゴリーＣ _i とカテゴリ
ーＣ _j の全く重なっていない領域の属性値の確率を演算
するステップと、（ｊ）前記ステップ（ａ）で記憶装置に格納された属性
値テーブルの属性値からカテゴリーの出現頻度を求め、
その出現頻度とステップ（ｉ）で求めた属性値の分布の
重なりの確率を用いた評価関数により評価を行い、子ノ
ードの分類に最も効果的な属性の選択を行うステップ
と、（ｋ）前記ステップ（ｃ）〜（ｇ）で分類できなかった
子ノードに対し、前記子ノードの分類に最も効果的な属
性を用いて前記ステップ（ｈ）で求めたカテゴリーの分
割によってできた新しいカテゴリーで分類する処理を行
うステップと、（ｌ）前記ステップ（ｂ）〜（ｋ）により作成された識
別木よりデータの分類のフローチャートを作成して記憶
装置に格納するステップとを有し、このフローチャート
によりデータの分類を行うを特徴とする。Means for Solving the Problems To solve these problems, the data classification method of the present invention comprises the steps of: (a) setting categories C _{1 to} C _{i to} C _m for classifying data;
And the attributes T _{1 to} T _{j to} T _{n of} each category
Data is aggregated or calculated by calculation
Rate and divide the results into corresponding categories
Attribute that represents the distribution represented by the upper and lower limits for each attribute
Storing in a storage device as a value distribution table
With reference to the attribute value table stored in (b) the storage device
Then, for each attribute, a certain category C _i and another category C _i
-Analysis of the state of overlap of the attribute value distribution with C _j
The state of attribute value distribution for at least one attribute is
Attributes of the category C _j from the distribution of the attribute values of category C _i
Whether the distribution of values can be completely identified (i) or category
The distribution of the attribute values of the distribution of category C _j of attribute values of C _i and one
Partial overlap (ii) or category C
state distribution of the attribute values of _j are included in the distribution of the attribute values of C _i
The process of determining which state (iii) it belongs to
And performing, 1 if (c) each attribute is in the state (i), other
In the case of, define a coefficient of 0 and logically define each attribute.
As variables, and the category C _i and other categories C _j
The set of attributes that can classify
The logical expression of the logical variable with the same attribute
Step of selecting a formula as a set of attributes
If, on the combination of; (d) Category C _i and C _j is the state (i)
, The category C _i and all other categories
In step (c ), a set of attribute sets that can be classified is determined.
Performing the process of calculating the logical product of the set of logical expressions
If, on the combination of (e) the category C _i and C _j is the state (i)
To make all categories classifiable with each other.
A set of attribute sets for the logical expression obtained in step (d)
(F) performing the process of obtaining the identification tree from the set of the attribute sets;
In order to select a rational set of attributes,
Evaluation function based on the state of appearance and the appearance frequency of category C _i
Process to evaluate more and select the most efficient attribute set
(G) setting the attribute set selected in the step (f).
Of the attributes that have the highest rating
-And placed as parent node, and (g-1) parent node
The distribution of attribute values of attributes in a category included
Overlap with the distribution of attribute values of the attribute in other categories
If not, assign the category to the parent node.
Classification is completed by placing it as a child node to
(G-2) When overlapping, use another category and classification
The set of unsuccessful categories is assigned to the child node for the parent node.
(G-3) The category of the child node
The steps (b) to (e) are performed between
The evaluation in the set of attributes selected in step (f)
Excludes attributes used for classification at parent node for child nodes.
The attribute that is the largest of the attributes
Placed as attributes used to classify child nodes as nodes
(G-4) The processing of the above (g-1) to (g-3)
Between the category C _j in Tegori C _i and state (i)
Process until there are no more child nodes to be classified in
Cormorants and steps not completed classification in (h) wherein step (g)
For each child node, the category is divided into an attribute T
The distribution of attribute values for _k is s categories
C _1, ..., if C _i, are overlapping in ... C _s,
The distribution of attribute values for a certain attribute T _k is
Part where Lee C _i does not overlap with all other categories
Min, any category C _i and any other one category
-Overlap, any category C _i and any other 2
Where the categories overlap, ..., any category
-Part where C _i and any other s-1 categories overlap
To divide, for these to split the category attribute T _k
Except for the case of the empty set Te, split a new category
Performing a process to make Lee, using the probability distribution in the range of attribute values for each (i) attribute, stearyl
New category C _i and category created in top (h)
-Calculate the attribute value probabilities of areas where C _j does not overlap at all
Step a, the attribute stored in the storage device (j) said step (a) to
From the attribute values in the value table, find the frequency of the category,
The frequency of occurrence and the distribution of attribute values obtained in step (i)
Evaluation is performed using an evaluation function that uses the probability of overlap, and
Steps to select the attributes that are most effective for classifying the code
When could not be classified in (k) the step (c) ~ (g)
For the child node, the attribute that is most effective in classifying the child node
Of the category obtained in step (h) using the property
Perform processing to classify in a new category created by splitting
And (l) the knowledge created by the steps (b) to (k).
Create and store data classification flowcharts from different trees
Storing the data in an apparatus, and classifying the data according to the flowchart.

【００１８】[0018]

【実施例】以下、本発明を、具体的に説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS The present invention will be specifically described below.

【００１９】本発明の概念的な考え方を図５に示す。ま
た、本発明の全体的なフローチャートを図６に示す。FIG. 5 shows the concept of the present invention. FIG. 6 shows an overall flowchart of the present invention.

【００２０】本発明では、送配電線事故診断の場合のデ
ータの分類方法を実施例として具体的に説明する。In the present invention, a method of classifying data in the case of transmission / distribution line fault diagnosis will be specifically described as an embodiment.

【００２１】通常、データが計算または集計により連続
値として得られる場合は、連続値で与えられる範囲で、
それぞれの属性の属性値の分布として与える。また、デ
ータが離散値で与えられる場合は、例えば統計処理によ
り、標準偏差の３σをとる値の範囲を属性値の分布とし
て与える。Normally, when data is obtained as continuous values by calculation or tabulation, within the range given by the continuous values,
It is given as the distribution of attribute values of each attribute. When the data is given as discrete values, a range of values having a standard deviation of 3σ is given as an attribute value distribution by, for example, statistical processing.

【００２２】本実施例では以下の条件により属性値の分
布を求めた。In this embodiment, the distribution of attribute values is obtained under the following conditions.

【００２３】なお、本発明の実施例では、図７に示す３
回線配電線の線路モデルを想定し、ｃ配電線上で事故が
起きたとする。各配電線の静電容量は図示の通りであ
る。またｃ配電線の変電所２次母線のインピーダンスは
０．３６２Ωならびに線路インピーダンスは０．５３６
＋ｊ１．４０７Ωとする。負荷は均等負荷と考え、電源
端の大地間電圧は３８１０Ｖ、また電流２００Ａを中心
値とし、負荷予測により、相対誤差の標準偏差を３．８
％とし、３σを考えた場合、１７７Ａ〜２２３Ａとし
た。また断線事故においては、負荷は三相負荷のみと考
え、負荷の力率は１００％と考えた。事故はｃ配電線の
電源端と受電端との間で起こったと考え、１線地絡事故
並びに２線線間短絡事故ならびに１線断線事故につい
て、定常状態で電源端での絶対値を計算した。配電線は
非接地方式とし、地絡事故時における故障点抵抗は０〜
６０００Ωとした。In the embodiment of the present invention, 3 shown in FIG.
Assuming a line model of a line distribution line, it is assumed that an accident has occurred on the line c. The capacitance of each distribution line is as shown. Further, the impedance of the secondary bus of the substation of the c distribution line is 0.362Ω and the line impedance is 0.536.
+ J 1.407Ω. The load is considered to be a uniform load, the ground voltage at the power supply end is 3810 V, and the current is 200 A. The standard deviation of the relative error is 3.8 based on the load prediction.
% And 3σ, 177A to 223A. In the event of a disconnection accident, the load was considered to be only a three-phase load, and the power factor of the load was considered to be 100%. The accident was considered to have occurred between the power supply end and the receiving end of the c distribution line, and the absolute values at the power supply end were calculated in a steady state for the 1-line ground fault, the 2-line short-circuit, and the 1-line disconnection. . The distribution line shall be ungrounded, and the fault point resistance in case of ground fault
6000Ω.

【００２４】（Ａ）第１実施例１）識別木の作成方法ならびにその配電線事故診断にお
けるデータの分類の適用例本実施例では、センサの零相電流、零相電圧、各相電
流、各相対地間電圧などのセンサ情報をもとに地絡事
故、短絡事故、断線事故などの事故を検出するアルゴリ
ズムをデータの分類識別木により作成する。ここでは、
センサ情報を属性値とし、正常及び短絡事故、地絡事故
ならびに断線事故の区別をカテゴリーとする。(A) First Embodiment 1) Method of Creating Identification Tree and Application of Data Classification in Distribution Line Fault Diagnosis In this embodiment, zero-phase current, zero-phase voltage, each phase current, Based on sensor information such as relative ground-to-ground voltage, an algorithm for detecting an accident such as a ground fault, short-circuit, or disconnection is created by a data classification tree. here,
The sensor information is used as the attribute value, and the categories of normal and short circuit accidents, ground fault accidents, and disconnection accidents are classified.

【００２５】ここで、選択すべきｍ個のカテゴリーをＣ
₁・・・Ｃ_i・・・Ｃ_mとし、これらのカテゴリーが個々
にもつｎ個の属性をＴ₁・・・Ｔ_j・・・Ｔ_nとする。Here, m categories to be selected are represented by C
And ₁ ··· C _i ··· C _m, these categories of n attributes to T ₁ ··· T _j ··· T _n having individually.

【００２６】配電線事故診断におけるデータの分類の選
択すべき事故及び正常値のカテゴリーをＣ_N：正常Ｃ_bc：ｂｃ線２線短絡事故Ｃ_ca：ｃａ線２線短絡事故Ｃ_ab：ａｂ線２線短絡事故Ｃ_a：ａ線地絡事故Ｃ_b：ｂ線地絡事故Ｃc：ｃ線地絡事故Ｃ_Da：ａ線断線事故Ｃ_Db：ｂ線断線事故Ｃ_Dc：ｃ線断線事故とする。The categories of data to be selected and the normal values in the classification of data in the distribution line accident diagnosis are as _follows : C _N : normal C _bc : bc line 2 line short circuit accident C _ca : ca line 2 line short circuit accident C _ab : ab line 2 Line short circuit accident C _a : a line ground fault accident C _b : b line ground fault accident Cc: c line ground fault accident C _Da : a line disconnection accident _CDB : b line disconnection accident C _Dc : c line disconnection accident

【００２７】また上記のカテゴリーが個々にもつ属性をＴ_V0：零相電圧Ｔ_I0：零相電流Ｔ_Ia：ａ相電流Ｔ_Ib：ｂ相電流Ｔ_Ic：ｃ相電流Ｔ_Va：ａ相対地間電圧Ｔ_Vb：ｂ相対地間電圧Ｔ_Vc：ｃ相対地間電圧とする。The attributes of each of the above categories are T _V0 : zero-phase voltage T _I0 : zero-phase current T _Ia : a-phase current T _Ib : b-phase current T _Ic : c-phase current T _Va : a relative ground Voltage T _Vb : b relative ground-to-ground voltage T _Vc : c relative ground-to-ground voltage.

【００２８】前記の配電線モデルで計算した属性値を表
２に示す。Table 2 shows the attribute values calculated by the distribution line model.

【００２９】[0029]

【表２】ここでＴ_V0，Ｔ_Va，Ｔ_Vb，Ｔ_Vcの属性値の単位はＶ、ま
たＴ_I0，Ｔ_Ia，Ｔ_Ib，Ｔ_Icの属性値の単位はＡである。[Table 2] Here, the unit of the attribute value of T _V0 , T _Va , T _Vb , and T _Vc is V, and the unit of the attribute value of T _I0 , T _Ia , T _Ib , and T _Ic is A.

【００３０】２）２−１）任意の二つのカテゴリーの分類に必要な属性の
選択（図６のフローチャートの３に相当する）すべてのカテゴリーを分類するために必要な属性を見つ
けるために、まず、任意のある一つのカテゴリーに注目
し、それを分類するのに必要な属性を求める。今、注目
しているカテゴリーをＣ_iとし、Ｃ_i以外の任意の一つＣ
_jとの属性値分布図上での相対的な分布関係を考える。
属性Ｔ_kにおける分布図上でのＣ_iから見たＣ_jの相対的
な分布関係は、図２に示すように、次の三つの状態が考
えられる。2) 2-1) Selection of Attributes Required for Classification of Arbitrary Two Categories (corresponding to 3 in the flowchart of FIG. 6) First, in order to find the attributes required for classifying all the categories, Attention is paid to one arbitrary category, and an attribute necessary for classifying it is obtained. The category of interest is C _i, and any one other than C _i C
Consider a relative distribution relationship with _j on the attribute value distribution diagram.
Relative distribution relationship C _j as viewed from the C _i in the distribution map in the attribute T _k, as shown in FIG. 2, can be considered the following three states.

【００３１】状態（ｉ）Ｃ_iの分布とＣ_jの分布は重
なっていない。状態（ｉｉ）Ｃ_iの分布はＣ_jの分布とすべて重なって
いる。状態（ｉｉｉ）Ｃ_iの分布はＣ_jの分布と一部重なってい
る。State (i) The distribution of C _{i and} the distribution of C _j do not overlap. State (ii) The distribution of C _i all overlaps with the distribution of C _j . The distribution of the state (iii) C _i partially overlaps with the distribution of C _j .

【００３２】これら三つの状態のうち、Ｃ_iとＣ_jが完全
に分類可能な状態は状態（ｉ）のみである。つまり、任
意の属性Ｔ_kでＣ_iとＣ_jが分類可能であるためには、そ
の二つのカテゴリーの属性値分布の状態が状態（ｉ）で
あることが必要条件となる。そこで属性Ｔ_kが状態
（ｉ）であるか否かを示すために式（１）に示すような
係数ａ_ikを定義する。Of these three states, the only state in which C _i and C _j can be completely classified is state (i). That is, in order for C _i and C _{j to} be categorized by an arbitrary attribute T _k , a necessary condition is that the state of the attribute value distribution of the two categories is state (i). Therefore, a coefficient _aik as shown in Expression (1) is defined to indicate whether the attribute _Tk is in the state (i).

【００３３】ａ_ik＝１Ｔ_kが状態（ｉ）０その他（１）[0033] a _ik = 1 T _k is the state (i) 0 Other (1)

【００３４】また、Ｔ_kを論理変数と考え、分類に用い
る場合には１、用いない場合には０の２値の係数を考え
る。Ｃ_iとＣ_jを分類可能とする属性値は係数ａ_ikを用い
論理和の形に表現すると次式のようになる。 _Considering that T _k is a logical variable, a binary coefficient of 1 is used when it is used for classification, and 0 when it is not used. The attribute value that allows classification of C _i and C _j is expressed by the following equation when expressed in the form of a logical sum using a coefficient a _ik .

【００３５】ｆ（Ｃ_i，Ｃ_j）＝ａ_i1Ｔ₁＋・・・＋ａ_ikＴ_k＋・・・・ａ_inＴ_n （２）[0035] _{_{f (C i, C j)}} = a i1 T 1 + ··· + a ik T k + ···· a in T n (2)

【００３６】つまり（２）式においてＣ_iとＣ_jはｆ（Ｃ
_i，Ｃ_j）＝１となる場合に分類可能となり、ｆ（Ｃ_i，
Ｃ_j）の項の少なくとも一つの属性を用いればＣ_iとＣ_j
は分類できる。That is, in equation (2), C _i and C _j are f (C
_i , C _j ) = 1, classification is possible, and f (C _i ,
If at least one attribute of the term C _j ) is used, C _i and C _j
Can be classified.

【００３７】カテゴリーＣ_Nと他のカテゴリーとを分類
するために必要な属性を選択した結果を次に示す。The result of selecting the attributes necessary for classifying the category C _N from other categories is shown below.

【００３８】ｆ（Ｃ_N，Ｃ_bc）＝Ｔ_Ib＋Ｔ_Ic＋Ｔ_Vb＋Ｔ_Vc （３）ｆ（Ｃ_N，Ｃ_ca）＝Ｔ_Ia＋Ｔ_Ic＋Ｔ_Va＋Ｔ_Vc （４）ｆ（Ｃ_N，Ｃ_ab）＝Ｔ_Ia＋Ｔ_Ib＋Ｔ_Va＋Ｔ_Vb （５）ｆ（Ｃ_N，Ｃ_a）＝Ｔ_V0＋Ｔ_I0＋Ｔ_Va＋Ｔ_Vb＋Ｔ_Vc （６）ｆ（Ｃ_N，Ｃ_b）＝Ｔ_V0＋Ｔ_I0＋Ｔ_Va＋Ｔ_Vb＋Ｔ_Vc （７）ｆ（Ｃ_N，Ｃ_c）＝Ｔ_V0＋Ｔ_I0＋Ｔ_Va＋Ｔ_Vb＋Ｔ_Vc （８）F (C _N , C _bc ) = T _Ib + T _Ic + T _Vb + T _Vc (3) f (C _N , C _ca ) = T _Ia + T _Ic + T _Va + T _Vc (4) f (C _N , C _ab) ) = T _Ia + T _Ib + T _Va + T _Vb (5) f (C _N , C _a ) = T _V0 + T _I0 + T _Va + T _Vb + T _Vc (6) f (C _N , C _b ) = T _V0 + T _I0 + T _Va + T _Vb + T _Vc (7) f (C _N , C _c ) = T _V0 + T _I0 + T _Va + T _Vb + T _Vc (8)

【００３９】しかしながら、ｆ（Ｃ_N，Ｃ_Da），ｆ
（Ｃ_N，Ｃ_Db），ｆ（Ｃ_N，Ｃ_Dc）は属性Ｔ_kが状態
（ｉ）である属性がない。先に提案した方法では、この
場合は識別木を作成することが不可能であった。However, f (C _N , C _Da ), f
(C _N , C _Db ) and f (C _N , C _Dc ) have no attribute whose attribute T _k is state (i). With the method proposed earlier, it was impossible to create an identification tree in this case.

【００４０】２−２）注目カテゴリーの分類に必要な属
性の選択（図６のフローチャートの４に相当する）ここ
では、今注目しているカテゴリーＣ_iと、ある属性Ｔ_kが
状態（ｉ）の状態のすべてのカテゴリーＣ_jを分類可能
とする属性の組を求める。2-2) Selection of Attributes Necessary for Classification of Attention Category (corresponding to 4 in the flowchart of FIG. 6) Here, the category C _{i of} interest and a certain attribute T _k are in state (i). A set of attributes that can classify all the categories C _j in the state is obtained.

【００４１】Ｃ_iと属性Ｔ_kが状態（ｉ）である状態の一
つＣ_jとを分類可能とする属性は式（２）で求まってい
る。従って、Ｃ_iと属性Ｔ_kが状態（ｉ）である状態のす
べてのカテゴリーとを分類可能とするためにはＣ_iとそ
れ以外のそれぞれのカテゴリーに対してｆ（Ｃ_i，Ｃ_j）
（ｊ＝１，・・・，ｍ，ｉ≠ｊ）の論理積を式（９）の
ように行う。The attributes C _i and attribute T _k is to be classified and one C _j state is the state (i) is been determined by equation (2). Thus, C _i and attribute T _k is f with respect to the state (i) a is all categories and the C _i is to allow classify other each category of state (C _i, C _j)
The logical product of (j = 1,..., M, i ≠ j) is performed as in Expression (9).

【００４２】ｆ（Ｃ_i）＝ｆ（Ｃ_i，Ｃ₁）・・ｆ（Ｃ_i，Ｃ_j）・・ｆ（Ｃ_i，Ｃ_m）但しｉ≠ｊ（９）F (C _i ) = f (C _i , C ₁ ) ·· f (C _i , C _j ) ·· f (C _i , C _m ) where i ≠ j (9)

【００４３】すなわち、このｆ（Ｃ_i）の演算結果にお
ける論理積の形で与えられる属性の組は、それぞれ独立
して、Ｃ_iと属性Ｔ_kが状態（ｉ）である状態のすべての
カテゴリーを分類可能とする属性の組である。That is, the set of attributes given in the form of a logical product in the operation result of f (C _i ) is independent of all the categories of the state in which C _i and the attribute T _k are in state (i). Is a set of attributes that can be classified.

【００４４】すなわち、カテゴリーＣ_iと属性Ｔ_kが状態
（ｉ）の状態でない任意のＣ_jに対してはｆ（Ｃ_i，
Ｃ_j）がｆ（Ｃ_i）の論理積の一項としては含まれないこ
とになる。但し、Ｃ_iと他のすべてのカテゴリーにおい
てｆ（Ｃ_i，Ｃ_j）＝０の場合は、ｆ（Ｃ_i）を求めず、
次の計算ステップに移る。[0044] That is, for any of C _j category C _i and attributes T _k is not in the state of the state _(i) f (C i,
C _j ) is not included as a term of the logical product of f (C _i ). However, if f (C _i , C _j ) = 0 in C _i and all other categories, f (C _i ) is not obtained, and
Move on to the next calculation step.

【００４５】以上の式により、Ｃ_Nと属性Ｔ_kが状態
（ｉ）である状態のすべてのカテゴリーＣ_jとを分類す
るために必要な属性は次の（１０）式のように式（３）
〜（８）の論理積で表すことができる。According to the above equation, the attribute necessary for classifying C _N and all the categories C _j in the state where the attribute T _k is the state (i) is expressed by the following equation (3) as in the following equation (10). )
To (8).

【００４６】ｆ（Ｃ_N）＝ｆ（Ｃ_N，Ｃ_bc）ｆ（Ｃ_N，Ｃ_ca）ｆ（Ｃ_N，Ｃ_ab）ｆ（Ｃ_N，Ｃ_a）ｆ（Ｃ_N，Ｃ_b）ｆ（Ｃ_N，Ｃ_c）＝Ｔ_IcＴ_IaＴ_V0＋Ｔ_IcＴ_VaＴ_I0＋Ｔ_IcＴ_VaＴ_V0 ＋Ｔ_VbＴ_IaＴ_I0＋Ｔ_VbＴ_IaＴ_V0＋Ｔ_VbＴ_IcＴ_I0 ＋Ｔ_VbＴ_IcＴ_V0＋Ｔ_VbＴ_Va＋Ｔ_VbＴ_Vc＋Ｔ_VcＴ_IaＴ_I0 ＋Ｔ_VcＴ_IaＴ_V0＋Ｔ_VcＴ_Va （１０）式（１０）において、１２の項のそれぞれの属性の組に
よってＣ_Nは属性Ｔ_kが状態（ｉ）である状態のすべてカ
テゴリーＣ_jを分類可能とする。F (C _N ) = f (C _N , C _bc ) f (C _N , C _ca ) f (C _N , C _ab ) f (C _N , C _a ) f (C _N , C _b ) f _{_{(C N, C c) =}} T Ic T Ia T V0 + T Ic T Va T I0 + T Ic T Va T V0 + T Vb T Ia T I0 + T Vb T Ia T V0 + T Vb T Ic T I0 + T Vb T Ic T V0 + T _Vb T _Va + T _Vb T _Vc + T _Vc T _Ia T _I0 + T _Vc T _Ia T _V0 + T _Vc T _Va (10) In the expression (10), the _CN has the attribute T _k according to each attribute set of the 12 terms. All categories _Cj in the state (i) can be classified.

【００４７】２−３）属性Ｔ_kが状態（ｉ）の状態であ
るカテゴリーすべてを分類可能な属性の選択（図６のフ
ローチャートの５に相当する）式（９）によって求まっ
た各カテゴリーが属性Ｔ_kが状態（ｉ）の状態のカテゴ
リーを分類するのに必要な属性の組から、少なくとも１
組ずつを取り出し、それらのすべてを含む属性の組を用
いれば、属性Ｔ_kが状態（ｉ）の状態のカテゴリーが分
類可能となる。つまり、属性Ｔ_kが状態（ｉ）の状態の
カテゴリーを分類可能とするために必要な属性の組は、
各々のカテゴリーに対してｆ（Ｃ_i）＝１（ｉ＝１，・
・・，ｍ）とならしめる属性を見つけることによって求
まるから、それらの論理積を式（１１）のように行う。2-3) Selection of attributes that can classify all categories in which attribute T _k is in state (i) (corresponding to 5 in the flowchart of FIG. 6) Each category determined by equation (9) is an attribute T _k is at least 1 from the set of attributes required to classify the category of state in state (i).
By taking out each set and using a set of attributes including all of them, the category of the state where the attribute T _k is state (i) can be classified. That is, a set of attributes necessary for the attribute T _k to be able to classify the state category of the state (i) is:
For each category, f (C _i ) = 1 (i = 1,.
.., m) are obtained by finding an attribute that can be expressed as: (m), and their logical product is performed as in Expression (11).

【００４８】Ｅ＝ｆ（Ｃ₁）・・・ｆ（Ｃ_i）・・・ｆ（Ｃ_m）（１１）この演算結果は、論理式の積和形となり次のように表せ
る。E = f (C ₁ )... F (C _i )... F (C _m ) (11) The operation result is a product-sum form of a logical expression and can be expressed as follows.

【００４９】Ｅ＝Ｔ ₁ ・Ｔ ₂ ・Ｔ ₃ ・・・＋・・・＋Ｔ _a ・Ｔ _b ・Ｔ _c ・・・＋・・・＋Ｔ _p ・Ｔ _q ・Ｔ _r ・・・ここでＡ ₁ ＝Ｔ ₁ ・Ｔ ₂ ・Ｔ ₃ ・・・、Ａ _x ＝Ｔ _a Ｔ _b Ｔ _c ・・・、Ａ _p ＝Ｔ _p ・Ｔ _q ・Ｔ _r ・・・とすると、Ｅ＝Ａ₁＋・・・＋Ａ_x＋・・・＋Ａ_p （１２）となる。 [0049] _{_{E = T 1 · T 2 ·}} T 3 ··· + ··· + T a · T b · T c ··· + ··· + T p · T q · T r ··· where A ₁ _{_{= T 1 · T 2 · T}} 3 ···, a x = T a T b T c ···, When _{_{_{a p = T p · T q}}} · T r ···, E = a 1 + ·· · + a _x + a ··· + a _p (12).

【００５０】従って、Ａ₁，・・・，Ａ_x，・・・，Ａ_p
は属性Ｔ_kが状態（ｉ）の状態のカテゴリーを分類可能
とするのに必要な属性の組である。Therefore, A ₁ ,..., A _x _,.
Is a set of attributes necessary for the attribute T _k to classify the category of the state of the state (i).

【００５１】以下同様に、ｆ（Ｃ_bc），ｆ（Ｃ_ca），ｆ
（Ｃ_ab），ｆ（Ｃ_a），ｆ（Ｃ_b），ｆ（Ｃ_c），ｆ（Ｃ
_Da），ｆ（Ｃ_Db），ｆ（Ｃ_Dc）を求める属性Ｔ_kが状態
（ｉ）の状態のカテゴリーを分類可能とする属性の組
は、Similarly, f (C _bc ), f (C _ca ) and f (C _ca )
(C _ab ), f (C _a ), f (C _b ), f (C _c ), f (C
A set of attributes that allows the attribute T _k for _obtaining _Da ), f (C _Db ), and f (C _Dc ) to classify the category of the state of the state (i) is:

【００５２】Ｅ＝ｆ（Ｃ_N）ｆ（Ｃ_bc）ｆ（Ｃ_ca）ｆ（Ｃ_ab）ｆ（Ｃ_a）ｆ（Ｃ_b）ｆ（Ｃ_c）ｆ（Ｃ_Da）ｆ（Ｃ_Db）ｆ（Ｃ_Dc）＝Ｔ_IaＴ_IbＴ_VaＴ_VbＴ_Vc＋Ｔ_IaＴ_IcＴ_VaＴ_VbＴ_Vc＋Ｔ_IbＴ_IcＴ_VaＴ_VbＴ_Vc （１３）となる。これを次のように置き換える。E = f (C _N ) f (C _bc ) f (C _ca ) f (C _ab ) f (C _a ) f (C _b ) f (C _c ) f (C _Da ) f (C _Db ) f (C _Dc ) = T _Ia T _Ib T _Va T _Vb T _Vc + T _Ia T _Ic T _Va T _Vb T _Vc + T _Ib T _Ic T _Va T _Vb T _Vc (13) Replace this with:

【００５３】Ａ₁＝Ｔ_IaＴ_IbＴ_VaＴ_VbＴ_Vc，Ａ₂＝Ｔ_IaＴ_IcＴ_VaＴ_VbＴ_Vc，Ａ₃＝Ｔ_IbＴ_IcＴ_VaＴ_VbＴ_Vc （１４）A ₁ = T _Ia T _Ib T _Va T _Vb T _Vc , A ₂ = T _Ia T _Ic T _Va T _Vb T _Vc , A ₃ = T _Ib T _Ic T _Va T _Vb T _Vc (14)

【００５４】つまり、これらの３組は、それぞれ独立し
て、少なくとも１つの属性Ｔ_kが状態（ｉ）である状態
のカテゴリーを分類可能とする属性の組である。また、
Ａ₃の属性の組を選択した場合、Ｔ_Ib，Ｔ_Ic，Ｔ_Va，Ｔ
_Vb，Ｔ_Vc以外の属性は、分類に必要のない属性である。In other words, these three sets are attribute sets that enable classification of a state category in which at least one attribute T _k is state (i). Also,
If you choose a set of attributes of _{_{_{A 3, T Ib, T Ic}}} , T Va, T
Attributes other than _Vb and _TVc are attributes that are not required for classification.

【００５５】今までの手続きを考察してみると、求めら
れた３組の属性の組Ａ₁，Ａ₂，Ａ₃の属性を使うことに
より、図８の実線で結んだカテゴリー同士を分類でき
る。したがってＡ₁，Ａ₂，Ａ₃の３組のそれぞれの属性
は実線で結ばれたカテゴリーは分類できるが、破線で結
ばれたカテゴリーは分類は完全にはできない。Considering the procedures so far, the categories connected by the solid line in FIG. 8 can be classified by using the attributes of the three sets of attributes A ₁ , A ₂ , and A ₃ obtained. . Therefore, for each of the three sets of attributes A ₁ , A ₂ , and A ₃ , the category connected by the solid line can be classified, but the category connected by the broken line cannot be completely classified.

【００５６】３）識別木の各ノードへの属性の配置（図
６のフローチャートの６に相当する）まず実線で結ばれ
たカテゴリーを分類するために選択した属性の組の最適
なものを選択し、さらには、最も効率的に配置するには
どうするかについて述べる。3) Arrangement of attributes in each node of the identification tree (corresponding to 6 in the flowchart of FIG. 6) First, an optimal set of attributes selected to classify the categories connected by solid lines is selected. And how to arrange them most efficiently.

【００５７】３−１）最適な属性を選ぶ評価方法（１）非重なり度合いａ_k（ｉ，ｊ）属性値分布において、他の分布と重なりが全くない領域
を多くもつ属性値は分類のための貢献度が高くなる。そ
のような属性値を多く含む属性の組を用いて識別木を構
成した方が上位のノードにおいて分類が完了する確率が
大きくなり分類時間の短縮につながる。そこで、あるカ
テゴリーＣ_iの属性値分布について全く重なっていない
領域がＣ_jの属性値分布に対してどの程度占めるかを示
す非重なり度合いａ_k（ｉ，ｊ）を次式で表す。これは
Ｔ_kがＣ_iの分類に対してどの程度Ｃ_jの貢献があるかを
示すものである。3-1) Evaluation method for selecting an optimal attribute (1) Degree of non-overlapping a _k (i, j) In an attribute value distribution, an attribute value having many areas having no overlap with other distributions is classified. Contributes more. Constructing an identification tree using a set of attributes including a large number of such attribute values increases the probability of completion of classification at a higher-level node, leading to a reduction in classification time. Therefore, a non-overlapping degree a _k (i, j) indicating how much a region that does not overlap at all with the attribute value distribution of a certain category C _i occupies the attribute value distribution of C _j is expressed by the following equation. This shows how T _k is the contribution degree C _j with respect to the classification of C _i.

【００５８】ａ_k（ｉ，ｊ）＝ｌ_ik／Ｌ（Ｃ_i）（１５）ここでｌ_ik：Ｔ_kの属性値分布において、Ｃ_iの分布に対
してＣ_jの分布により重なりがない領域の範囲（図９参
照）Ｌ（Ｃ_i）：Ｃ_iの分布の範囲A _k (i, j) = l _ik / L (C _i ) (15) Here, in the attribute value distribution of l _ik : T _k , there is no overlap with the distribution of C _{i due to} the distribution of C _j. Range of region (see FIG. 9) L (C _i ): range of distribution of C _i

【００５９】前掲の表２の測定データに基づいて非重な
り度合いａ_k（ｉ，ｊ）をＴ_V0について算出すると表３
のようになる。表３ではｉは列、ｊは行を表す。The non-overlapping degree a _k (i, j) is calculated for T _V0 based on the measurement data in Table 2 above, and Table 3
become that way. In Table 3, i represents a column and j represents a row.

【００６０】[0060]

【表３】 [Table 3]

【００６１】（２）出現頻度Ｐ₁ 次に、カテゴリー、すなわち事故の種類の出現頻度を求
める。その結果を表４に示す。(2) Appearance Frequency P ₁ Next, the category, that is, the appearance frequency of the type of accident is obtained. Table 4 shows the results.

【００６２】[0062]

【表４】 [Table 4]

【００６３】当然、出現頻度Ｐ₁は正常時≫地絡事故＞
短絡事故＞断線事故である。Naturally, the appearance frequency P ₁ is normal time≫ground fault>
Short circuit accident> Disconnection accident.

【００６４】（３）評価値Ｆ（Ｔ_k）以上挙げた２つのパラメータａ_k（ｉ，ｊ），Ｐ₁を用い
て、各属性に対して次式に示すような評価関数を定め
た。(3) Evaluation value F (T _k ) Using the two parameters a _k (i, j) and P ₁ described above, an evaluation function as shown in the following equation is determined for each attribute.

【００６５】[0065]

【数１】 (Equation 1)

【００６６】このＦ（Ｔ_k）が大きいＴ_kほど、出現頻度
の大きいカテゴリーに対して分類の可能性が大きい。[0066] The F (T _k) is larger T _k, there is a high possibility of classification for large category of frequency of occurrence.

【００６７】前述の例の場合、評価値は次のようにな
る。In the case of the above example, the evaluation values are as follows.

【００６８】[0068]

【表５】 [Table 5]

【００６９】以下、定義２で進める。Hereinafter, the process proceeds with definition 2.

【００７０】識別木作成に効果的な属性の組Ａ_effは、
Ａ₁，Ａ₂，Ａ₃のそれぞれの属性の評価値Ｆ（Ｔ_k）の積
Ｇ（Ａ_ｘ）が最大となる組である。そこで各組について
Ｇ（Ａ_ｘ）を求める。A set of attributes A _eff effective for creating an identification tree is
This is a set that maximizes the product G (A _x ) of the evaluation values F (T _k ) of the respective attributes of A ₁ , A ₂ , and A ₃ . Therefore, G (A _x ) is obtained for each set.

【００７１】Ｇ（Ａ₁）＝Ｆ（Ｔ_Ia）Ｆ（Ｔ_Ib）Ｆ（Ｔ_Va）Ｆ（Ｔ_Vb）Ｆ（Ｔ_Vc）＝５．３６４×１０^-12 （１８）Ｇ（Ａ₂）＝Ｆ（Ｔ_Ia）Ｆ（Ｔ_Ic）Ｆ（Ｔ_Va）Ｆ（Ｔ_Vb）Ｆ（Ｔ_Vc）＝５．３６４×１０^-12 （１９）Ｇ（Ａ₃）＝Ｆ（Ｔ_Ib）Ｆ（Ｔ_Ic）Ｆ（Ｔ_Va）Ｆ（Ｔ_Vb）Ｆ（Ｔ_Vc）＝８．６４８×１０^-12 （２０）Ｇ（Ａ₃）の値が最大であるため、Ｇ（Ａ₃）をとる。G (A ₁ ) = F (T _Ia ) F (T _Ib ) F (T _Va ) F (T _Vb ) F (T _Vc ) = 5.364 × 10 ⁻¹² (18) G (A ₂ ) = F (T _Ia ) F (T _Ic ) F (T _Va ) F (T _Vb ) F (T _Vc ) = 5.364 × 10 ^-12 (19) G (A ₃ ) = F (T _Ib ) F ( T _Ic ) F (T _Va ) F (T _Vb ) F (T _Vc ) = 8.648 × 10 ⁻¹² (20) Since the value of G (A ₃ ) is the maximum, G (A ₃ ) is taken.

【００７２】４）識別木の各ノードへの属性（図６のフ
ローチャートの７に相当する）識別木の各ノードへの配置は次のようにする。まず根ノ
ードに関してはＡ_effのうち評価値Ｆ（Ｔ_k）が最も大き
い方を根ノードに考える。ここではＦ（Ｔ_Va）＝Ｆ（Ｔ
_Vc）なのでＴ_Vaとする。属性の重なりの状態により、属
性の分布に重なりのない領域、属性の分布に重なる領域
に分かれる。4) Attributes to each node of the identification tree (corresponding to 7 in the flowchart of FIG. 6) The arrangement of the identification tree to each node is as follows. First, regarding the root node, the one having the largest evaluation value F (T _k ) of A _eff is considered as the root node. Here, F (T _Va ) = F (T
_Vc), so the T _Va. Depending on the state of the attribute overlap, the area is divided into an area where the attribute distribution does not overlap and an area where the attribute distribution overlaps.

【００７３】属性がこれらの重なりのない領域の値にな
った場合には、根ノードで分類が完了する。重なりのあ
る領域はカテゴリー間の分類が不可能であり、他の属性
で再度分類する。すなわち、前者は葉ノードＮ_eとし、
後者は再分類ノードＮ_cとする。Ｎ_cにおける集合Ｎ_c’
は例えば図１０に示した領域１に関しては、（Ｃ_ca，Ｃ
_ab，Ｃ_a）となる。When the attribute becomes the value of the non-overlapping area, the classification is completed at the root node. Overlapping regions cannot be classified between categories, and are re-classified with other attributes. That is, the former is the leaf node N _e,
The latter is a re-classification node _Nc . Set in N _{_c} N _c '
For example, regarding the area 1 shown in FIG. 10, (C _ca , C
_ab , C _a ).

【００７４】次に再分類ノードに配置する属性は次のよ
うに選択する。領域１を例にとればＳ_c’の要素の２つ
ずつのカテゴリーをそれぞれ分類可能とする。属性は次
式のようになる。但し、ｆ（Ｃ_i，Ｃ_j）＝ｆ（Ｃ_j，
Ｃ_i）である。Next, the attribute to be arranged in the re-classification node is selected as follows. Taking region 1 as an example, it is possible to classify two categories of elements of S _c ′. The attributes are as follows: Here, f (C _i , C _j ) = f (C _j ,
C _i ).

【００７５】ｆ（Ｃ_ca，Ｃ_ab）＝Ｔ_Ib＋Ｔ_Ic＋Ｔ_Vb＋Ｔ_Vc （２１）ｆ（Ｃ_ab，Ｃ_a）＝Ｔ_V0＋Ｔ_I0＋Ｔ_Ia＋Ｔ_Ib＋Ｔ_Vb＋Ｔ_Vc （２２）ｆ（Ｃ_a，Ｃ_ca）＝Ｔ_V0＋Ｔ_I0＋Ｔ_Ia＋Ｔ_Ic＋Ｔ_Vc （２３）F (C _ca , C _ab ) = T _Ib + T _Ic + T _Vb + T _Vc (21) f (C _ab , C _a ) = T _V0 + T _I0 + T _Ia + T _Ib + T _Vb + T _Vc (22) f (C _a , C _ca ) = T _V0 + T _I0 + T _Ia + T _Ic + T _Vc (23)

【００７６】Ｓ_c’の全要素を分類可能とする属性は、
これらの論理積により次式のように求まる。An attribute that allows all elements of S _c ′ to be classified is
The logical product of these results in the following equation.

【００７７】ｆ（Ｃ_ca，Ｃ_ab）ｆ（Ｃ_ab，Ｃ_a）ｆ（Ｃ_a，Ｃ_ca）＝Ｔ_V0Ｔ_Ib＋Ｔ_V0Ｔ_Ic＋Ｔ_I0Ｔ_Ib＋Ｔ_I0Ｔ_Ic＋Ｔ_IaＴ_Ib＋Ｔ_IaＴ_Ic ＋Ｔ_IbＴ_Ic＋Ｔ_Vc＋Ｔ_V0Ｔ_Vb＋Ｔ_I0Ｔ_Vb＋Ｔ_IaＴ_Vb＋Ｔ_IcＴ_Vb （２４）F (C _ca , C _ab ) f (C _ab , C _a ) f (C _a , C _ca ) = T _V0 T _Ib + T _V0 T _Ic + T _I0 T _Ib + T _I0 T _Ic + T _Ia T _Ib + T _Ia T _Ic + T _Ib T _Ic + T _Vc + T _V0 T _Vb + T _I0 T _Vb + T _Ia T _Vb + T _Ic T _Vb (24)

【００７８】この結果、Ａ_effの部分集合となっている
属性はＴ_Vc，Ｔ_IbＴ_Ic，Ｔ_IcＴ_Vbである。この３つの属
性の組について、それぞれの属性の評価値Ｆ（Ｔ_k）の
積Ｇ（Ａ_Ｘ）を求めるとＴ_Vcの属性の組が最大となるの
でＴ_Vcを配置する。As a result, the attributes that are a subset of A _eff are T _Vc , T _Ib T _Ic , and T _Ic T _Vb . The set of three attributes, the set of attributes of the product G (A _X) Request the T _Vc of the evaluation value F for each attribute (T _k) is to place the T _Vc since the maximum.

【００７９】以上のような操作をＡ_effの部分集合の属
性を使って行う。その結果の一部を図１１に示す。但
し、＊印をつけたノードは、これまでの手続きでは分離
することができない。The above operation is performed using the attributes of the subset of A _eff . FIG. 11 shows a part of the result. However, nodes marked with * cannot be separated by the conventional procedures.

【００８０】５）カテゴリーの分割（図６のフローチャ
ートの８に相当する）ある属性分布Ｔ_kにおいてあるｓ個のカテゴリーＣ₁，
…，Ｃ_i，…，Ｃ_sが重なりあっている場合、すなわちｓ
個のカテゴリーのすべての組み合わせが図２の状態（ｉ
ｉ）または状態（ｉｉｉ）の場合、以下の方法でカテゴ
リーの分割を行う。[0080] 5) divided categories (corresponding to 8 in the flowchart of FIG. 6) s number of category C _1, which is in the attribute distribution T _k with,
.., C _i ,..., C _s overlap, ie, s
All combinations of the categories are in the state (i
In the case of i) or state (iii), the category is divided by the following method.

【００８１】ある属性Ｔ_kにおいて任意のカテゴリーＣ_i
は他の全てのカテゴリーと重なりのない部分、任意のカ
テゴリーＣ_iと他の任意の一個のカテゴリーが重なる部
分、任意のカテゴリーＣ_iと他の任意の二個のカテゴリ
ーが重なる部分、・・・、任意のカテゴリーＣ_iと他の
任意のｓ−２個のカテゴリーが重なる部分、任意のカテ
ゴリーＣ_iと他の任意のｓ−１個のカテゴリーが重なる
部分に分けることができる。上記の分割により、分割し
た新たなカテゴリーを作ることができる。また任意のカ
テゴリーＣ_iと他の任意のｓ−ｎ個のカテゴリーが重な
る部分の組み合わせの数は_sＣ_s−n＋1で与えられる。ま
た分割したカテゴリーがすべての属性Ｔ_kに対して空集
合の場合、新たなカテゴリーは作らないとする。For any attribute T _k , any category C _i
Is a portion that does not overlap with all other categories, a portion where any category C _i overlaps with any other one category, a portion where any category C _i overlaps with any other two categories, ... , An arbitrary category C _i and another arbitrary s−2 categories overlap, and an arbitrary category C _i and another arbitrary s−1 categories overlap. With the above division, a new divided category can be created. In addition, the number of combinations of a portion where an arbitrary category C _i and another arbitrary sn category overlap with each other is given by _s C _{s−n + 1} . If the divided category is an empty set for all the attributes T _k , no new category is created.

【００８２】具体的に図１２で子ノードが３個のカテゴ
リーＣ₁，Ｃ₂，Ｃ₃が区別できない場合を考える。ここ
での属性をＴ₁，Ｔ₂とする。More specifically, consider a case in FIG. 12 in which categories C ₁ , C ₂ and C ₃ having three child nodes cannot be distinguished. The attributes here are T ₁ and T ₂ .

【００８３】ここでカテゴリーＣ_iが他の全てのカテゴ
リーと重なりのない部分によって新しくできたカテゴリ
ーをＣ_1*とする。例えば図１２の属性Ｔ₁のＣ_1*であ
り、この例のように属性の分布が分離する場合もある。
任意のカテゴリーＣ_iと他の任意の一つのＣ_jカテゴリー
が重なる部分によって、新しくできたカテゴリーをＣ_ij
とする。以下任意のカテゴリーＣ_iと他の任意の２つの
カテゴリーが重なる部分によって、新しくできたカテゴ
リーを同様に定義する。図１２の属性Ｔ₁におけるＣ_2*
ならびにＣ_3*は、空集合のため新たなカテゴリーを作ら
ないとする。このとき新たに作られたカテゴリーは、す
べての任意の２つの組み合わせにおいて状態（１）を満
たす。属性Ｔ_kは上記の方法によりカテゴリーの分割を
行うことができる。Here, a category newly created by a portion where the category C _i does not overlap with all other categories is defined as C _{1 *} . For example, C _{1 *} attributes T ₁ of the FIG. 12, there is a case where the distribution of the attribute as in this example are separated.
The newly created category is represented by C _ij by the portion where any category C _i overlaps with any one other C _j category.
And Hereinafter, a newly created category is similarly defined by a portion where an arbitrary category C _i and another arbitrary two categories overlap. C _{2 *} in attribute T ₁ in FIG.
And C _{3 *} does not create a new category due to the empty set. At this time, the newly created category satisfies state (1) in all arbitrary two combinations. The attribute T _k can be divided into categories by the above method.

【００８４】上記方法で、ノードのカテゴリーの分割を
行う。この図１１の＊印では通常２つのカテゴリーを分
類できない場合が多いが、＊１のように四つのカテゴリ
ーＣ_N，Ｃ_Da，Ｃ_Db，Ｃ_Dcが認識できない場合がある。With the above method, the category of the node is divided. Usually, two categories cannot be classified by the mark * in FIG. 11, but there are cases where the four categories C _N , C _Da , C _Db , and C _Dc cannot be recognized as shown by * 1.

【００８５】＊１について考えると、属性Ｔ_kにおいて
任意のカテゴリーＣ_iは他の全てのカテゴリーと重なり
のない部分のカテゴリー、この場合Ｃ_N*，Ｃ_Da*，
Ｃ_Db*，Ｃ_Dc*を作ることができるが、Ｃ_N*はすべての属
性に対して空集合のため、カテゴリーＣ_N*を作ることは
できない。以下同様に任意のカテゴリーＣ_iと他の任意
の一つのカテゴリーが重なる部分のカテゴリー、Ｃ_iと
他の任意の２つのカテゴリーが重なる部分のカテゴリ
ー、Ｃ_iと他の任意の三つのカテゴリーが重なる部分の
カテゴリーにより、新たな八個のカテゴリーＣ_Da*，Ｃ
_Db*，Ｃ_Dc*，Ｃ_DaDb*，Ｃ_DbDc*，Ｃ_DbDc*，Ｃ_DaDbDc*，
Ｃ_NDaDbDc*を作ることができる。そのカテゴリーの属性
値の分布を表６に示す。Considering * 1, an arbitrary category C _i in the attribute T _k is a category of a portion that does not overlap with all other categories, in this case, C _{N *} , C _{Da *} ,
Although C _{Db *} and C _{Dc *} can be created, category C _{N *} cannot be created because C _{N *} is an empty set for all attributes. Similarly, the category of a portion where any category C _i overlaps with any other one category, the category of a portion where C _i overlaps with any other two categories, and the category where C _i overlaps with any three other categories Depending on the category of the part, eight new categories C _{Da *} , C
_{Db *} , C _{Dc *} , C _{DaDb *} , C _{DbDc *} , C _{DbDc *} , C _{DaDbDc *} ,
C _{NDaDbDc *} can be made. Table 6 shows the distribution of the attribute values of the category.

【００８６】[0086]

【表６】 [Table 6]

【００８７】６）分離した属性を持たないカテゴリー分
類属性値の分布が完全に分離していないカテゴリーに対し
てカテゴリーの分割により、新たなカテゴリーを生成す
る。どの属性を使ってそれらの分類を行うかを考えるた
め、属性の確率分布を考える。6) Category Classification without Separated Attributes A new category is generated by dividing a category into categories whose attribute value distributions are not completely separated. To consider which attributes are used to classify them, consider the probability distribution of the attributes.

【００８８】６−１）属性の確率分布（図６のフローチ
ャートの９に相当する）属性値はいくつかのパラメータを用いて計算される。例
えば断線事故の電流値の場合、事故前の電流と事故点が
パラメータである。それらのパラメータの確率分布が判
れば属性の確率分布を知ることができ、ある任意の２つ
のカテゴリーにおいて、属性の分布に重なりのある場
合、それぞれのカテゴリーの重なりのある部分の確率と
重なりのない部分の確率を知ることができる。6-1) Attribute probability distribution (corresponding to 9 in the flowchart of FIG. 6) Attribute values are calculated using several parameters. For example, in the case of a current value of a disconnection accident, the current before the accident and the accident point are parameters. If the probability distribution of those parameters is known, the probability distribution of the attribute can be known. If the distribution of the attribute overlaps in any two categories, the probability of the overlapping part of each category does not overlap. You can know the probability of the part.

【００８９】ここでは離散的な、２次元の確率分布を例
にとって考え方を説明する。二つの確率変数Ｘ，Ｙは互
いに独立で、それぞれ任意の値ｘ_iとｙ_iにおいて、それ
ぞれ確率ｐ_iならびにｑ_jとすると、Here, the concept will be described using a discrete two-dimensional probability distribution as an example. The two random variables X and Y are independent of each other, and given arbitrary values x _i and y _i , respectively, the probabilities p _i and q _j are given by

【００９０】Ｐ（Ｘ＝ｘ_i）＝ｐ_i （２５）Ｐ（Ｙ＝ｙ_j）＝ｑ_j （２６）とおくことができ、任意のｉ，ｊに対してＰ（Ｘ＝ｘ_i，Ｙ＝ｙ_j）＝Ｐ（Ｘ＝ｘ_i）Ｐ（Ｙ＝ｙ_j）（２７）すなわちｐ_ij＝ｐ_iｑ_j （２８）が成り立つ。P (X = x _i ) = p _i (25) P (Y = y _j ) = q _j (26), and P (X = x _i , _{Y = y j) = P (} X = x i) P (Y = y j) (27) i.e. _{_{_{p ij = p i q j (}}} 28) holds.

【００９１】いま、属性Ｔ_kが変数Ｘ，Ｙの関数ｈとし
て表すことができるとすると、（２９）式のように表す
ことができ、Ｔ_k＝ｈ（Ｘ，Ｙ）（２９）Now, assuming that the attribute T _k can be expressed as a function h of the variables X and Y, it can be expressed as in equation (29), and T _k = h (X, Y) (29)

【００９２】したがって例えば断線事故の電流値の場
合、二つのパラメータの事故前の電流値ｘ_iと事故点ｙ_j
とその確率ｐ_iとｐ_jからｈ（Ｘ，Ｙ）とｐ_ijの総和を求
めることにより、属性Ｔ_kの確率分布を求めることがで
きる。したがって属性Ｔ_kの確率分布をＺとし、ｐ_ijの
総和の確率をｚ_kとすると、属性Ｔ_kの確率分布はＰ（Ｚ＝ｚ_k）（３０）とおくことができ、任意の属性Ｔ_kの属性分布において
ａ≦Ｚ≦ｂの範囲の確率は、Ｐ（ａ≦Ｚ≦ｂ）＝Σ＊ｐ_r （３１）ただし、Σ^*ｐ_rはａ≦Ｚ≦ｂである確率の総和を表す。
したがって任意の属性Ｔ_kのＺでの確率ならびに属性Ｔ_k
の属性分布においてａ≦Ｚ≦ｂの範囲の確率を求めるこ
とができる。[0092] Thus for example, in the case of the current value of accidental disconnection, before the accident two parameters current value x _i and the fault point y _j
By _calculating the sum of h (X, Y) and p _ij from the probability and the probabilities p _i and p _j , the probability distribution of the attribute T _k can be obtained. Therefore, _{assuming that} the probability distribution of the attribute T _k is Z and the probability of the sum of p _ij is z _k , the probability distribution of the attribute T _k can be expressed as P (Z = z _k ) (30). _In the attribute distribution of _k , the probability in the range of a ≦ Z ≦ b is as follows: P (a ≦ Z ≦ b) = Σ * _pr (31) where Σ ^* _pr is the sum of the probability that a ≦ Z ≦ b. Represent.
Therefore, the probability of any attribute T _k at Z and the attribute T _k
Can be obtained in the attribute distribution of a ≦ Z ≦ b.

【００９３】６−２）属性の選択前節で求めた確率分布を用い、子ノードの分類に効果的
な属性の選択を行う。属性値分布において、他の分布と
重なりのない部分の確率の高い属性値は分類のための貢
献度が高くなる。そこで、ある属性Ｔ_kにおけるカテゴ
リーＣ_iのＣ_jに対して全く重なっていない領域の属性値
の確率分布を求め、その確率をΣ^*ｐ_r（ｉ，ｊ）とす
る。実際には図４（１）における斜線部以外の確率であ
る。これはＴ_kがＣ_iの分類に対してどの程度Ｃ_jの影響
があるかを示すものである。確率Σ^*ｐ_r（ｉ，ｊ）とカ
テゴリーの出現頻度Ｐ_iを使い、次の評価関数を定め
る。6-2) Attribute Selection Using the probability distribution obtained in the previous section, an attribute is selected which is effective for classifying child nodes. In the attribute value distribution, an attribute value having a high probability of a portion that does not overlap with other distributions has a high contribution for classification. Therefore, a probability distribution of attribute values of a region of a certain attribute T _k that does not overlap C _j of the category C _i at all is obtained, and the probability is defined as Σ ^* _pr (i, j). Actually, it is the probability other than the shaded portion in FIG. This shows how T _k is the effect of extent C _j with respect to the classification of C _i. Probability ^{_{Σ * p r (i, j}} ) and use the frequency of occurrence P _i of the category, define the following evaluation function.

【００９４】[0094]

【数２】 (Equation 2)

【００９５】ここで（１７）式で定めたａ_k（ｉ，ｊ）
は、属性Ｔ_kの確率分布が一様に分布している場合のΣ^*
ｐ_r（ｉ，ｊ）と一致している。このＦ^*（Ｔ_k）が大き
い属性Ｔ_kを用いて、全ての属性に対し属性値の分布が
完全に分離していない子ノードの分類は８）で述べたカ
テゴリーの分割によって新しくできたカテゴリーにより
分類を行う。Here, a _k (i, j) determined by equation (17)
Is Σ ^* when the probability distribution of the attribute T _k is uniformly distributed
p _r (i, j). Using the attribute T _k having a large F ^* (T _k ), the classification of the child node in which the distribution of the attribute values is not completely separated for all the attributes is a category newly formed by the category division described in 8). Classify according to

【００９６】６−３）属性の確率分布の実施例ここでは１線断線時の各相の電流値を例とする。その電
流値は三相負荷かつ力率１００％の場合、事故点におけ
る事故時の電流は断線相で０Ａ、他の２相の電流は事故
前の電流の半分となる。また事故点以前の負荷は変動し
ない。例えば図７のｃ配電線のちょうど中央で断線した
と仮定し、正常時の電源端の電流を２００Ａとした場
合、負荷は均等負荷のため、電流端での断線相の電流
は、ｃ配電線の中央までの負荷の１００Ａが流れること
となる。他の２相の電流はｃの中央までの負荷の１００
Ａと断線点以降の５０Ａの計１５０Ａが流れることにな
る。6-3) Embodiment of attribute probability distribution Here, the current value of each phase when one line is broken is taken as an example. When the current value is a three-phase load and the power factor is 100%, the current at the time of the fault at the fault point is 0 A in the disconnection phase and the currents of the other two phases are half the current before the fault. The load before the accident point does not fluctuate. For example, assuming that the disconnection was made exactly at the center of the c-distribution line in FIG. 7 and the current at the power supply end in a normal state was 200 A, the load was an equal load. 100A of the load to the center of the current flows. The other two phase currents are 100% of the load to the center of c.
A and a total of 150A of 50A after the disconnection point flow.

【００９７】事故前の負荷予測の電流値の確率分布は正
規分布が得られ、また事故は配電線上に一様に起きると
考えられることより、事故点の確率分布は一様分布と考
えてよい。したがって、断線時の各相の電流値の確率分
布は（３０）式により上記の二つの確率変数の２次元確
率分布から算出することができる。Since the probability distribution of the current value in the load prediction before the accident is a normal distribution, and the accident is considered to occur uniformly on the distribution line, the probability distribution of the accident point may be considered to be a uniform distribution. . Therefore, the probability distribution of the current value of each phase at the time of disconnection can be calculated from the two-dimensional probability distribution of the above two random variables by the equation (30).

【００９８】ここでは確率を連続値と考え、図１３にａ
相断線時のａ，ｂ，ｃ相の電流の確率密度関数とその正
常時の確率密度関数を示す。カテゴリーＣ_Daの属性Ｔ_Ia
の属性分布がカテゴリーＣ_Nの属性Ｔ_Iaの属性分布と重
なり合わない部分の確率は図１３のグラフで０〜１７６
Ａまでの確率密度関数の面積から算出でき、その確率は
０．８８で、同様に重なりのある部分の確率は０．１２
と求めることができる。属性Ｔ_Iaの属性値が１８０Ａと
得られ、カテゴリーＣ_DaもしくはカテゴリーＣ_Nの分類
が他の属性ではできなかった場合の確率は図１３の確率
密度関数より、Ｃ_Daは０．００５、Ｃ_Nは０．００１６
と得られる。しかしながら、表４よりカテゴリーの出現
頻度Ｐ_iまで考慮すると、断線事故の確率は小さいた
め、Ｃ_Daは１．０５×１０^-４、Ｃ_Nは１．５８×１０^-4
であり、正常である確率が高い。Here, the probability is considered as a continuous value, and FIG.
The probability density functions of the currents of the a, b, and c phases at the time of phase disconnection and the probability density functions of the normal times are shown. Attribute T _{Ia of} category C _Da
The probability of the portion where the attribute distribution does not overlap with the attribute distribution of the attribute T _Ia of the category C _N is 0 to 176 in the graph of FIG.
A can be calculated from the area of the probability density function up to A, the probability is 0.88, and the probability of the overlapping part is 0.12.
Can be requested. Attribute value of the attribute T _Ia is obtained and 180A, the probability when the classification categories C _Da or category C _N is not possible with other attributes than the probability density function of FIG. 13, C _Da is 0.005, C _N Is 0.0016
Is obtained. However, considering the category occurrence frequency P _i from Table 4, the probability of a disconnection accident is small, so C _Da is 1.05 × 10 ⁻⁴ and C _N is 1.58 × 10 ^−4.
And the probability of normality is high.

【００９９】また、属性Ｔ_Iaの属性値が１５０Ａと得ら
れた場合、図１３の確率密度関数より、確率は０．００
５、表４のカテゴリーの出現頻度Ｐ_iまで考慮すると、
Ｃ_Daである確率は１．０５×１０^-4と求めることができ
る。When the attribute value of the attribute T _Ia is obtained as 150 A, the probability becomes 0.00 from the probability density function shown in FIG.
5. Considering the appearance frequency P _i of the category in Table 4,
The probability of being C _Da can be obtained as 1.05 × 10 ⁻⁴ .

【０１００】従来の断線検出の一番初歩的な方法は上述
の通り、電源端での電流値検出である。しかしながら、
この方法の最大の欠点は、末端近くで、断線事故が発生
した場合、断線事故が発生したのか、ただ負荷が減少し
たのかわからない点にある。したがって配電線の末端で
センサー情報すなわち属性値を使って断線事故を検出す
る方法もあるが、変電所までの通信線を設置しなければ
ならず、コストがかかる。電源端検出で断線事故を末端
まで完全に検出するのは原理的に不可能であることがわ
かっている。As described above, the most rudimentary method of the conventional disconnection detection is the current value detection at the power supply end. However,
The biggest disadvantage of this method is that if a disconnection accident occurs near the end, it is not known whether the disconnection accident has occurred or the load has simply decreased. Therefore, there is a method of detecting a disconnection accident using sensor information, that is, an attribute value at the end of a distribution line, but a communication line to a substation must be installed, which increases costs. It has been found in principle that it is impossible in principle to completely detect a disconnection accident to the end by detecting the power supply end.

【０１０１】したがって、断線事故か正常かより高い確
度で分類するには、負荷予測の精度を高める必要があ
る。例えば一時間前の負荷予測は、今回データとして用
いた５時間前の負荷予測よりも、予測の相対誤差が小さ
くなると考えられる。すなわち図１３の正規分布の確率
分布が、２００Ａの近くに集まることにより、Ｃ_NのＴ
_Ia、Ｔ_Ib、Ｔ_Icのとる属性分布の範囲が狭くなる。した
がって断線事故を検出する確率を高くすることができ
る。それにより、短時間負荷予測により、負荷予測の精
度を高め、時間によって変わる負荷変動を考慮にいれ、
その時々の診断アルゴリズムを識別木学習で作ることに
より、従来検出できなかったより末端に近い断線事故も
検出できるようになる。Therefore, in order to classify a disconnection accident or normal with higher accuracy, it is necessary to increase the accuracy of load prediction. For example, it is considered that the load prediction one hour ago has a smaller relative error in the prediction than the load prediction five hours ago used as the current data. That probability distribution of the normal distribution of FIG. 13, by gathering near the 200A, the C _N T
The range of attribute distributions taken by _Ia , T _Ib , and T _Ic becomes narrow. Therefore, the probability of detecting a disconnection accident can be increased. As a result, short-time load prediction improves the accuracy of load prediction and takes into account load fluctuations that change with time.
By making the diagnosis algorithm at that time by discriminating tree learning, it becomes possible to detect a disconnection accident closer to the end than was previously impossible.

【０１０２】６−４）評価関数Ｆ^*（Ｔ_k）の算出ならび
に子ノードの分類ここでは四つのカテゴリーＣ_N，Ｃ_Da，Ｃ_Db，Ｃ_Dcでの
評価関数Ｆ^*（Ｔ_k）を表４の出現頻度Ｐ_iと、（３１）
式より求めるΣ^*ｐ_r（ｉ，ｊ）により、式（３２）で評
価関数Ｆ^*（Ｔ_k）の算出を行う。Σ^*ｐ_r（ｉ，ｊ）は前
節で求めたように例えば属性Ｔ_IaのＣ_DaのＣ_Nに対して
全く重なっていない領域の確率は０．８８と得らる。そ
の結果を表７に示す。この場合、Ｆ^*（Ｔ_Va）＝Ｆ^*（Ｔ
_Vb）＝Ｆ^*（Ｔ_Vc）であるので、属性Ｔ_Vaにより分類す
る。子ノードとしてカテゴリーＣ_Da*，Ｃ_Db*，
Ｃ_DbDc*，Ｃ_NDaDbDc*を分類することができる。その結
果を図１１に示す。[0102] 6-4) Table four categories _{_{_{C N, C Da, C Db}}} , * the evaluation function F at C _Dc to (T _k), where classification calculation Narabiniko node evaluation function F ^* (T _k) 4, the appearance frequency P _i and (31)
The evaluation function F ^* (T _k ) is calculated by Expression (32) using Σ ^* _pr (i, j) obtained from the expression. As for Σ ^* _pr (i, j), for example, the probability of a region not overlapping at all with C _N of C _Da of the attribute T _Ia is obtained as 0.88 as obtained in the previous section. Table 7 shows the results. In this case, F ^* (T _Va ) = F ^* (T
_{Since Vb} ) = F ^* (T _Vc ), classification is based on the attribute T _Va . Categories C _{Da *} , C _{Db *} ,
C _{DbDc *} and C _{NDaDbDc *} can be classified. The result is shown in FIG.

【０１０３】[0103]

【表７】 [Table 7]

【０１０４】図１４〜図１９は、前述した実施例におい
て用いた数値を使用して具体的に事故診断を行うための
データの分類を行うフローチャートを示している。FIGS. 14 to 19 show flowcharts for classifying data for concretely diagnosing an accident using the numerical values used in the above-described embodiment.

【０１０５】以上、第１実施例について説明した。この
第１実施例は、最も効率的な識別木およびフローチャー
トの作成方法であるが、効率をある程度犠牲にしても同
様な分類を行うことができる。その例を以下に示す。The first embodiment has been described above. Although the first embodiment is the most efficient method for creating an identification tree and a flowchart, similar classification can be performed even if efficiency is sacrificed to some extent. An example is shown below.

【０１０６】（Ｂ）第２実施例本実施例では少なくとも一つの属性が状態（ｉ）である
カテゴリーの組合せにおいてカテゴリーＣ_iと他のすべ
てのカテゴリーを分類する属性の組を求め、さらにはす
べてのカテゴリーを分類する属性の組を求めるという２
段階のステップを行っていたが本実施例では上記ステッ
プを一度に行うところにある。第１実施例の２−１）ま
では同じなので説明を省略する。(B) Second Embodiment In this embodiment, in a combination of categories in which at least one attribute is state (i), a set of attributes for classifying category C _i and all other categories is obtained. To find a set of attributes that classify categories
Although the steps of the steps are performed, in the present embodiment, the above steps are performed at once. The description up to 2-1) of the first embodiment is omitted because it is the same.

【０１０７】２−２’）属性Ｔ_kが状態（ｉ）の状態で
あるカテゴリーすべてを分類可能な属性の選択を行う。2-2 ') Select an attribute capable of classifying all categories in which the attribute T _k is in the state (i).

【０１０８】少なくとも一つ以上の属性値の分布が完全
に分離しているカテゴリーの組み合わせを識別可能な属
性集合の組は、ｆ（Ｃ_i，Ｃ_j）＝１となるすべての組み
合わせに対してｆ（Ｃ_i，Ｃ_j）（ｉ＝１，・・・，ｎ、
ｊ＝１，・・・，ｍ、ｉ≠ｊ）の論理積をとることによ
り求めることができ、式（３３）で求めることができ
る。A set of attribute sets that can identify a combination of categories in which the distribution of at least one or more attribute values is completely separated corresponds to all combinations where f (C _i , C _j ) = 1. f (C _i , C _j ) (i = 1,..., n,
j = 1,..., m, i ≠ j), and can be obtained by equation (33).

【０１０９】Ｅ＝ｆ（Ｃ₁，Ｃ₁）・・ｆ（Ｃ₁，Ｃ_j）・・ｆ（Ｃ₁，Ｃ_m）・・・・・・ｆ（Ｃ_i，Ｃ₁）・・ｆ（Ｃ_i，Ｃ_j）・・ｆ（Ｃ_i，Ｃ_m）・・・・・・ｆ（Ｃ_n，Ｃ₁）・・ｆ（Ｃ_n，Ｃ_j）・・ｆ（Ｃ_n，Ｃ_m）但し、ｉ≠ｊ（３３）E = f (C ₁ , C ₁ ) ·· f (C ₁ , C _j ) ··· f (C ₁ , C _m ) ······· f (C _i , C ₁ ) ·· f (C _i , C _j ) f (C _i , C _m ) f (C _n , C ₁ ) f (C _n , C _j ) f (C _n , C _m) ) Where i ≠ j (33)

【０１１０】この演算結果は、論理式の積和形となり次
のように表せる。Ｅ＝Ｔ ₁ ・Ｔ ₂ ・Ｔ ₃ ・・・＋・・・＋Ｔ _a ・Ｔ _b ・Ｔ _c ・・・＋・・・＋Ｔ _p ・Ｔ _q ・Ｔ _r ・・・ここでＡ ₁ ＝Ｔ ₁ ・Ｔ ₂ ・Ｔ ₃ ・・・、Ａ _x ＝Ｔ _a Ｔ _b Ｔ _c ・・・、Ａ _p ＝Ｔ _p ・Ｔ _q ・Ｔ _r ・・・とすると、Ｅ＝Ａ₁＋・・・＋Ａ_x＋・・・＋Ａ_p （３４）となる。 The result of the operation is a product-sum form of a logical expression and can be expressed as follows. E = T ₁ · T ₂ · T ₃ ··· + + T _a · T _b · T _c ··· + + T _p · T _q · T _r ··· where A ₁ = T ₁ ··· T ₂ · T ₃ ···, A _x = T _a T _b T _c ···, A _p = T _p · T _q · T _r ··· E = A ₁ +... + A _x + ... + A _p (34) .

【０１１１】したがって、Ａ₁，・・・，Ａ_x，・・・，
Ａ_pは、少なくとも一つ以上の属性Ｔ_kの属性値の分布が
完全に分離しているカテゴリーの組み合わせを分類可能
な属性集合である。Therefore, A ₁ ,..., A _x ,.
A _p is the attribute set that can classify the combination of category distribution of the attribute values of the at least one attribute T _k are completely separated.

【０１１２】式（３３）によって少なくとも一つ以上の
属性値の分布が完全に分離しているカテゴリーの組み合
わせすべてを分類可能な属性集合の組が選択できる。According to equation (33), a set of attribute sets that can classify all combinations of categories in which the distribution of at least one or more attribute values is completely separated can be selected.

【０１１３】Ｅ＝Ｔ_IaＴ_IbＴ_VaＴ_VbＴ_Vc＋Ｔ_IaＴ_IcＴ_VaＴ_VbＴ_Vc＋Ｔ_IbＴ_IcＴ_VaＴ_VbＴ_Vc （３５）となる。これを次のように置き換える。Ａ₁＝Ｔ_IaＴ_IbＴ_VaＴ_VbＴ_Vc，Ａ₂＝Ｔ_IaＴ_IcＴ_VaＴ_VbＴ_Vc Ａ₃＝Ｔ_IbＴ_IcＴ_VaＴ_VbＴ_Vc （３６）E = T _Ia T _Ib T _Va T _Vb T _Vc + T _Ia T _Ic T _Va T _Vb T _Vc + T _Ib T _Ic T _Va T _Vb T _Vc (35) Replace this with: A ₁ = T _Ia T _Ib T _Va T _Vb T _Vc , A ₂ = T _Ia T _Ic T _Va T _Vb T _Vc A ₃ = T _Ib T _Ic T _Va T _Vb T _Vc (36)

【０１１４】つまり、これらの３組は、それぞれ独立し
て属性Ｔ_kが状態（ｉ）である状態のカテゴリーを分類
可能とする属性の組である。That is, these three sets are sets of attributes that allow the category of the state in which the attribute T _k is state (i) to be classified independently.

【０１１５】今までの手続きを考察してみると、求めら
れた３組の属性の組Ａ₁，Ａ₂，Ａ₃の属性を使うことに
より、図８の実線で結んだカテゴリー同士を分類でき
る。したがってＡ₁，Ａ₂，Ａ₃の３組のそれぞれの属性
は実線で結ばれたカテゴリーは分類できるが、破線で結
ばれたカテゴリーは分類は完全にはできない。Considering the procedure so far, the categories connected by the solid line in FIG. 8 can be classified by using the attributes of the three sets of attributes A ₁ , A ₂ , and A ₃ obtained. . Therefore, for each of the three sets of attributes A ₁ , A ₂ , and A ₃ , the category connected by the solid line can be classified, but the category connected by the broken line cannot be completely classified.

【０１１６】以下は、第１実施例における３）以降と同
様であるので、説明を省略する。The following is the same as 3) and subsequent steps in the first embodiment, and a description thereof will not be repeated.

【０１１７】（Ｃ）第３実施例本実施例は、少なくとも一つの属性が状態（ｉ）のカテ
ゴリーの組み合わせにおいて、分類する属性の組を求
め、求めた組の中から、任意に１組を選び、さらに選択
した属性の組をノードに配置する際にも、任意の属性を
配置するものである。これは、効率的な属性の組を選択
する点とさらにはその属性の組をノードに配置する際に
は効率的な属性から配置するということが考慮されず、
任意に選択配置する点が第１実施例と異なる。( C ) Third Embodiment In this embodiment, in a combination of categories in which at least one attribute is state (i), a set of attributes to be classified is obtained, and one set is arbitrarily selected from the obtained sets. When arranging a selected and further set of selected attributes in a node, an arbitrary attribute is arranged. This is because it does not take into account the point of selecting an efficient attribute set, and furthermore, when arranging the attribute set in the node, from the efficient attribute.
It differs from the first embodiment in that it is arbitrarily selected and arranged.

【０１１８】第１実施例の２−３）までは同じなので説
明を省略する。また２−３）項のあとに以下の事項を加
える。Since the steps up to 2-3) of the first embodiment are the same, the description will be omitted. The following items are added after section 2-3).

【０１１９】（１４）式のＡ₁、Ａ₂、Ａ₃の任意の一組
を選択する。ここではＡ₃を選択するものとする。Ａ₃の
属性の組の中で任意の属性を根ノードと考える。ここで
はＴ_Vaとする。An arbitrary set of A ₁ , A ₂ and A _{3 in} the equation (14) is selected. Here, it is assumed to select the A _3. Any attribute in the set of attributes of A ₃ considered the root node. Here, _let it be T _Va .

【０１２０】属性の重なりの状態により、属性の分布に
重なりのない領域、属性の分布に重なる領域に分かれ
る。Depending on the state of the attribute overlap, the area is divided into an area where the attribute distribution does not overlap and an area where the attribute distribution overlaps.

【０１２１】属性がこれらの重なりのない領域の値にな
った場合には、根ノードで分類が完了する。重なりのあ
る領域はカテゴリー間の分類が不可能であり、他の属性
で再度分類する。すなわち、前者は葉ノードＮ_eとし、
後者は再分類ノードＮ_cとする。Ｎ_cにおける集合Ｎ_c’
は例えば図１０に示した領域１に関しては、（Ｃ_ca，Ｃ
_ab，Ｃ_a）となる。When the attribute becomes the value of these non-overlapping areas, the classification is completed at the root node. Overlapping regions cannot be classified between categories, and are re-classified with other attributes. That is, the former is the leaf node N _e,
The latter is a re-classification node _Nc . Set in N _{_c} N _c '
For example, regarding the area 1 shown in FIG. 10, (C _ca , C
_ab , C _a ).

【０１２２】次に再分類ノードに配置する属性は次のよ
うに選択する。領域１を例にとればＳ_c’の要素の２つ
ずつのカテゴリーをそれぞれ分類可能とする。属性は次
式のようになる。但し、ｆ（Ｃ_i，Ｃ_j）＝ｆ（Ｃ_j，
Ｃ_i）である。Next, the attribute to be arranged in the re-classification node is selected as follows. Taking region 1 as an example, it is possible to classify two categories of elements of S _c ′. The attributes are as follows: Here, f (C _i , C _j ) = f (C _j ,
C _i ).

【０１２３】ｆ（Ｃ_ca，Ｃ_ab）＝Ｔ_Ib＋Ｔ_Ic＋Ｔ_Vb＋Ｔ_Vc （２１）ｆ（Ｃ_ab，Ｃ_a）＝Ｔ_V0＋Ｔ_I0＋Ｔ_Ia＋Ｔ_Ib＋Ｔ_Vb＋Ｔ_Vc （２２）ｆ（Ｃ_a，Ｃ_ca）＝Ｔ_V0＋Ｔ_I0＋Ｔ_Ia＋Ｔ_Ic＋Ｔ_Vc （２３）F (C _ca , C _ab ) = T _Ib + T _Ic + T _Vb + T _Vc (21) f (C _ab , C _a ) = T _V0 + T _I0 + T _Ia + T _Ib + T _Vb + T _Vc (22) f (C _a , C _ca ) = T _V0 + T _I0 + T _Ia + T _Ic + T _Vc (23)

【０１２４】Ｓ_c’の全要素を分類可能とする属性は、
これらの論理積により次式のように求まる。An attribute that allows all elements of S _c ′ to be classified is
The logical product of these results in the following equation.

【０１２５】ｆ（Ｃ_ca，Ｃ_ab）ｆ（Ｃ_ab，Ｃ_a）ｆ（Ｃ_a，Ｃ_ca）＝Ｔ_V0Ｔ_Ib＋Ｔ_V0Ｔ_Ic＋Ｔ_I0Ｔ_Ib＋Ｔ_I0Ｔ_Ic＋Ｔ_IaＴ_Ib＋Ｔ_IaＴ_Ic ＋Ｔ_IbＴ_Ic＋Ｔ_Vc＋Ｔ_V0Ｔ_Vb＋Ｔ_I0Ｔ_Vb＋Ｔ_IaＴ_Vb＋Ｔ_IcＴ_Vb （２４）F (C _ca , C _ab ) f (C _ab , C _a ) f (C _a , C _ca ) = T _V0 T _Ib + T _V0 T _Ic + T _I0 T _Ib + T _I0 T _Ic + T _Ia T _Ib + T _Ia T _Ic + T _Ib T _Ic + T _Vc + T _V0 T _Vb + T _I0 T _Vb + T _Ia T _Vb + T _Ic T _Vb (24)

【０１２６】この結果、Ａ₃の部分集合となっている属
性はＴ_Vc，Ｔ_IbＴ_Ic，Ｔ_IcＴ_Vbである。この３つの属性
の組について、ここでは任意のＴ_Vcを配置する。[0126] As a result, the attribute that is a subset of A ₃ is a _{_{_{T Vc, T Ib T Ic,}}} T Ic T Vb. For this set of three attributes, an arbitrary _TVc is placed here.

【０１２７】以上のような操作をＡ₃の部分集合の属性
を使って行う。その結果の一部を図１１に示す。但し、
＊印をつけたノードは、これまでの手続きでは分離する
ことができない。[0127] or more of such an operation performed using the attributes of a subset of A _3. FIG. 11 shows a part of the result. However,
Nodes marked with * cannot be separated by the previous procedure.

【０１２８】以下、第１実施例の５）以降と同じなので
省略する。Hereinafter, since it is the same as 5) and subsequent steps of the first embodiment, the description is omitted.

【０１２９】（Ｄ）第４実施例本実施例では、少なくとも一つの属性が状態（ｉ）のカ
テゴリーの組み合わせの分類する属性の組を求め、求め
た組の中から最も効率的な属性の組を求め、さらにその
属性の組をノードに配置する際は任意に属性を選択して
ノードに配置するものである。したがって、第１，第２
実施例とは、属性の組を選択するところまでは効率を考
慮して同じであるが、その属性の組をノードに配置する
場合において、任意に選択することとしており、その
点、効率が考慮されていない点が相違する。( D ) Fourth Embodiment In this embodiment, a set of attributes in which at least one attribute is classified as a combination of categories of state (i) is obtained, and the most efficient attribute set is obtained from the obtained sets. Is obtained, and when the attribute set is arranged in the node, the attribute is arbitrarily selected and arranged in the node. Therefore, the first and second
The embodiment is the same in consideration of efficiency up to the point of selecting a set of attributes. However, when the set of attributes is arranged in a node, it is arbitrarily selected. The difference is that it is not done.

【０１３０】第１実施例の３）までは同じなので省略す
る。Up to 3) of the first embodiment, the description is omitted because it is the same.

【０１３１】４）識別木の各ノードへの属性（図６のフ
ローチャートの７に相当する）識別木の各ノードへの配置は次のようにする。まず根ノ
ードに関してはＡ_effのうち任意のＴ_Vaとする。属性の
重なりの状態により、属性の分布に重なりのない領域、
属性の分布に重なる領域に分かれる。4) Attributes to each node of the identification tree (corresponding to 7 in the flowchart of FIG. 6) The arrangement of the identification tree to each node is as follows. First, an arbitrary T _Va of A _eff is set for the root node. Depending on the state of attribute overlap, areas where attribute distribution does not overlap,
It is divided into areas that overlap the distribution of attributes.

【０１３２】属性がこれらの重なりのない領域の値にな
った場合には、根ノードで分類が完了する。重なりのあ
る領域はカテゴリー間の分類が不可能であり、他の属性
で再度分類する。すなわち、前者は葉ノードＮ_eとし、
後者は再分類ノードＮ_cとする。Ｎ_cにおける集合Ｎ_c’
は例えば図１０に示した領域１に関しては、（Ｃ_ca，Ｃ
_ab，Ｃ_a）となる。When the attribute becomes the value of these non-overlapping areas, the classification is completed at the root node. Overlapping regions cannot be classified between categories, and are re-classified with other attributes. That is, the former is the leaf node N _e,
The latter is a re-classification node _Nc . Set in N _{_c} N _c '
For example, regarding the area 1 shown in FIG. 10, (C _ca , C
_ab , C _a ).

【０１３３】次に再分類ノードに配置する属性は次のよ
うに選択する。領域１を例にとればＳ_c’の要素の２つ
ずつのカテゴリーをそれぞれ分類可能とする。属性は次
式のようになる。但し、ｆ（Ｃ_i，Ｃ_j）＝ｆ（Ｃ_j，
Ｃ_i）である。Next, the attribute to be arranged in the re-classification node is selected as follows. Taking region 1 as an example, it is possible to classify two categories of elements of S _c ′. The attributes are as follows: Here, f (C _i , C _j ) = f (C _j ,
C _i ).

【０１３４】ｆ（Ｃ_ca，Ｃ_ab）＝Ｔ_Ib＋Ｔ_Ic＋Ｔ_Vb＋Ｔ_Vc （２１）ｆ（Ｃ_ab，Ｃ_a）＝Ｔ_V0＋Ｔ_I0＋Ｔ_Ia＋Ｔ_Ib＋Ｔ_Vb＋Ｔ_Vc （２２）ｆ（Ｃ_a，Ｃ_ca）＝Ｔ_V0＋Ｔ_I0＋Ｔ_Ia＋Ｔ_Ic＋Ｔ_Vc （２３）F (C _ca , C _ab ) = T _Ib + T _Ic + T _Vb + T _Vc (21) f (C _ab , C _a ) = T _V0 + T _I0 + T _Ia + T _Ib + T _Vb + T _Vc (22) f (C _a , C _ca ) = T _V0 + T _I0 + T _Ia + T _Ic + T _Vc (23)

【０１３５】Ｓ_c’の全要素を分類可能とする属性は、
これらの論理積により次式のように求まる。An attribute that allows all elements of S _c ′ to be classified is
The logical product of these results in the following equation.

【０１３６】ｆ（Ｃ_ca，Ｃ_ab）ｆ（Ｃ_ab，Ｃ_a）ｆ（Ｃ_a，Ｃ_ca）＝Ｔ_V0Ｔ_Ib＋Ｔ_V0Ｔ_Ic＋Ｔ_I0Ｔ_Ib＋Ｔ_I0Ｔ_Ic＋Ｔ_IaＴ_Ib＋Ｔ_IaＴ_Ic ＋Ｔ_IbＴ_Ic＋Ｔ_Vc＋Ｔ_V0Ｔ_Vb＋Ｔ_I0Ｔ_Vb＋Ｔ_IaＴ_Vb＋Ｔ_IcＴ_Vb （２４）F (C _ca , C _ab ) f (C _ab , C _a ) f (C _a , C _ca ) = T _V0 T _Ib + T _V0 T _Ic + T _I0 T _Ib + T _I0 T _Ic + T _Ia T _Ib + T _Ia T _Ic + T _Ib T _Ic + T _Vc + T _V0 T _Vb + T _I0 T _Vb + T _Ia T _Vb + T _Ic T _Vb (24)

【０１３７】この結果、Ａ_effの部分集合となっている
属性はＴ_Vc，Ｔ_IbＴ_Ic，Ｔ_IcＴ_Vbである。この３つの属
性の組について、ここでは任意のＴ_Vcを配置する。As a result, the attributes that are a subset of A _eff are T _Vc , T _Ib T _Ic , and T _Ic T _Vb . For this set of three attributes, an arbitrary _TVc is placed here.

【０１３８】以上のような操作をＡ_effの部分集合の属
性を使って行う。その結果の一部を図１１に示す。但
し、＊印をつけたノードは、これまでの手続きでは分離
することができない。The above operation is performed using the attributes of the subset of A _eff . FIG. 11 shows a part of the result. However, nodes marked with * cannot be separated by the conventional procedures.

【０１３９】以下、第１実施例の５）以降と同じなので
省略する。Hereinafter, since it is the same as 5) and subsequent steps of the first embodiment, the description is omitted.

【０１４０】（Ｅ）第５実施例本実施例では、少なくとも一つの属性が状態（ｉ）のカ
テゴリーの組み合わせの分類する属性の組を求め、求め
た組のすべての組のそれぞれの属性に対し、評価関数に
基づいて評価し、上記で求めた互いに識別可能な属性の
組の中で任意の組を選択し、その選択した属性の組をノ
ードに配置する際は、識別が効率的になるように効率的
な属性から優先して配置する。したがって本実施例では
効率的な属性の組は選択されていないが、選択された属
性の組をノードに配置する際においては効率的な属性を
優先して配置するようにしている。( E ) Fifth Embodiment In this embodiment, a set of attributes in which at least one attribute is a combination of the category of the state (i) is obtained, and the attribute of each of all the obtained sets is determined. When an arbitrary set is selected from the set of mutually identifiable attributes evaluated based on the evaluation function and determined above, and the selected set of attributes is arranged in the node, the identification becomes efficient. So that priority is given to efficient attributes. Therefore, in the present embodiment, an efficient attribute set is not selected, but when arranging the selected attribute set in the node, the efficient attribute is preferentially arranged.

【０１４１】第１実施例３）の（３）項中、定義２で進
めるところまでは同じである。（１４）式の組の中でＡ
₃を選択する。In the section (3) of the first embodiment 3), the description is the same up to the point of proceeding with the definition 2. A in the set of (14)
Select ₃ .

【０１４２】４）識別木の各ノードへの属性（図６のフ
ローチャートの７に相当する）識別木の各ノードへの配置は次のようにする。まず根ノ
ードに関してはＡ₃のうち評価値Ｆ（Ｔ_k）が最も大きい
方を根ノードに考える。ここではＦ（Ｔ_Va）＝Ｆ
（Ｔ_Vc）なのでＴ_Vaとする。属性の重なりの状態によ
り、属性の分布に重なりのない領域、属性の分布に重な
る領域に分かれる。4) Attributes to each node of the identification tree (corresponding to 7 in the flowchart of FIG. 6) The arrangement of the identification tree to each node is as follows. First, regarding the root node, the one having the largest evaluation value F (T _k ) of A ₃ is considered as the root node. Here, F (T _Va ) = F
(T _Vc ), so T _Va . Depending on the state of the attribute overlap, the area is divided into an area where the attribute distribution does not overlap and an area where the attribute distribution overlaps.

【０１４３】属性がこれらの重なりのない領域の値にな
った場合には、根ノードで分類が完了する。重なりのあ
る領域はカテゴリー間の分類が不可能であり、他の属性
で再度分類する。すなわち、前者は葉ノードＮ_eとし、
後者は再分類ノードＮ_cとする。Ｎ_cにおける集合Ｎ_c’
は例えば図１０に示した領域１に関しては、（Ｃ_ca，Ｃ
_ab，Ｃ_a）となる。When the attribute becomes the value of these non-overlapping areas, the classification is completed at the root node. Overlapping regions cannot be classified between categories, and are re-classified with other attributes. That is, the former is the leaf node N _e,
The latter is a re-classification node _Nc . Set in N _{_c} N _c '
For example, regarding the area 1 shown in FIG. 10, (C _ca , C
_ab , C _a ).

【０１４４】次に再分類ノードに配置する属性は次のよ
うに選択する。領域１を例にとればＳ_c’の要素の２つ
ずつのカテゴリーをそれぞれ分類可能とする。属性は次
式のようになる。但し、ｆ（Ｃ_i，Ｃ_j）＝ｆ（Ｃ_j，
Ｃ_i）である。Next, the attribute to be arranged in the re-classification node is selected as follows. Taking region 1 as an example, it is possible to classify two categories of elements of S _c ′. The attributes are as follows: Here, f (C _i , C _j ) = f (C _j ,
C _i ).

【０１４５】ｆ（Ｃ_ca，Ｃ_ab）＝Ｔ_Ib＋Ｔ_Ic＋Ｔ_Vb＋Ｔ_Vc （２１）ｆ（Ｃ_ab，Ｃ_a）＝Ｔ_V0＋Ｔ_I0＋Ｔ_Ia＋Ｔ_Ib＋Ｔ_Vb＋Ｔ_Vc （２２）ｆ（Ｃ_a，Ｃ_ca）＝Ｔ_V0＋Ｔ_I0＋Ｔ_Ia＋Ｔ_Ic＋Ｔ_Vc （２３）F (C _ca , C _ab ) = T _Ib + T _Ic + T _Vb + T _Vc (21) f (C _ab , C _a ) = T _V0 + T _I0 + T _Ia + T _Ib + T _Vb + T _Vc (22) f (C _a , C _ca ) = T _V0 + T _I0 + T _Ia + T _Ic + T _Vc (23)

【０１４６】Ｓ_c’の全要素を分類可能とする属性は、
これらの論理積により次式のように求まる。An attribute that allows all elements of S _c ′ to be classified is
The logical product of these results in the following equation.

【０１４７】ｆ（Ｃ_ca，Ｃ_ab）ｆ（Ｃ_ab，Ｃ_a）ｆ（Ｃ_a，Ｃ_ca）＝Ｔ_V0Ｔ_Ib＋Ｔ_V0Ｔ_Ic＋Ｔ_I0Ｔ_Ib＋Ｔ_I0Ｔ_Ic＋Ｔ_IaＴ_Ib＋Ｔ_IaＴ_Ic ＋Ｔ_IbＴ_Ic＋Ｔ_Vc＋Ｔ_V0Ｔ_Vb＋Ｔ_I0Ｔ_Vb＋Ｔ_IaＴ_Vb＋Ｔ_IcＴ_Vb （２４）F (C _ca , C _ab ) f (C _ab , C _a ) f (C _a , C _ca ) = T _V0 T _Ib + T _V0 T _Ic + T _I0 T _Ib + T _I0 T _Ic + T _Ia T _Ib + T _Ia T _Ic + T _Ib T _Ic + T _Vc + T _V0 T _Vb + T _I0 T _Vb + T _Ia T _Vb + T _Ic T _Vb (24)

【０１４８】この結果、Ａ₃の部分集合となっている属
性はＴ_Vc，Ｔ_IbＴ_Ic，Ｔ_IcＴ_Vbである。この３つの属性
の組について、それぞれの属性の評価値Ｆ（Ｔ_k）が最
も大きい方を根ノードと考える。ここではＴ_Vcの属性の
組が最大となるのでＴ_Vcを配置する。[0148] As a result, the attribute that is a subset of A ₃ is a _{_{_{T Vc, T Ib T Ic,}}} T Ic T Vb. Regarding a set of these three attributes, the one having the largest evaluation value F (T _k ) of each attribute is considered as a root node. Wherein the set of attributes of the T _Vc is to place the T _Vc since the maximum.

【０１４９】以上のような操作をＡ₃の部分集合の属性
を使って行う。その結果の一部を図１１に示す。但し、
＊印をつけたノードは、これまでの手続きでは分離する
ことができない。[0149] or more of such an operation performed using the attributes of a subset of A _3. FIG. 11 shows a part of the result. However,
Nodes marked with * cannot be separated by the previous procedure.

【０１５０】以下は第１実施例の５）以降と同じなので
省略する。The following is the same as 5) et seq. Of the first embodiment, and a description thereof will be omitted.

【０１５１】（Ｆ）第６実施例本実施例では、子ノードの分割を行う際には、最も効率
的な属性を選択せず、任意の属性を選択するものであ
る。したがって、本実施例は子ノードの分割を行う際、
効率的な属性を選択しない点が特徴である。( F ) Sixth Embodiment In this embodiment, when dividing a child node, an arbitrary attribute is selected without selecting the most efficient attribute. Therefore, in this embodiment, when dividing a child node,
The feature is that efficient attributes are not selected.

【０１５２】第１実施例の５）までは同じである。第１
実施例の６−１），６−２），６−３）を省き、６−
４）を以下のように変更する。Up to 5) of the first embodiment is the same. First
6-1), 6-2) and 6-3) of the embodiment are omitted, and 6-
4) is changed as follows.

【０１５３】６−４）子ノードの識別任意の属性Ｔ_Vaより識別する子ノードとしてカテゴリー
Ｃ_Da*，Ｃ_Db*，Ｃ_DbDc*，Ｃ_NDaDbDc*を識別することが
できる。その結果を図１１に示す。6-4) Identification of Child Nodes Categories C _{Da *} , C _{Db *} , C _{DbDc *} , and C _{NDaDbDc *} can be identified as child nodes to be identified from an arbitrary attribute T _Va . The result is shown in FIG.

【０１５４】図１４〜図１９は、前述した実施例におい
て用いた数値を使用して具体的に事故診断を行うための
データの分類を行うフローチャートを示している。FIGS. 14 to 19 show flowcharts for classifying data for concretely diagnosing an accident using the numerical values used in the above-described embodiment.

【０１５５】[0155]

【発明の効果】以上に述べたように、本発明によれば下
記の効果を奏する。As described above, the present invention has the following effects.

【０１５６】（１）任意の２つのカテゴリーにおいて、
属性値の分布が完全に分離していなくても、属性値の重
なりのない部分は分類することができる。(1) In any two categories,
Even if the distribution of the attribute values is not completely separated, a portion where the attribute values do not overlap can be classified.

【０１５７】（２）重なりのある部分、すなわちカテゴ
リーが識別できない部分についても、属性の確率分布を
求めることにより、重なりのある部分の確率を求め、カ
テゴリーの推定を行うことができる。(2) Even for an overlapping portion, that is, a portion in which a category cannot be identified, the probability of the overlapping portion can be obtained by obtaining the probability distribution of the attribute, and the category can be estimated.

【０１５８】（３）任意の属性値が得られた場合、出現
確率を求めることができ、その属性によってカテゴリー
が分離できなかった場合、どちらのカテゴリーに属する
かの確度を知ることができる。(3) When an arbitrary attribute value is obtained, an appearance probability can be obtained. When categories cannot be separated by the attribute, it is possible to know the certainty to which category the attribute belongs.

【０１５９】（４）識別木およびフローチャートより、
どんなカテゴリーが分類できないかがわかり、そのとき
の属性値の範囲を知ることができる。(4) From the identification tree and the flowchart,
You can see what categories cannot be classified, and you can know the range of attribute values at that time.

【０１６０】（５）データの属性値が分布をもつ場合、
診断、パターン認識、画像処理などいろいろな分類に適
用できる。(5) When the attribute values of data have a distribution,
It can be applied to various classifications such as diagnosis, pattern recognition, and image processing.

【０１６１】（６）シミュレータなどで属性値の分布を
求めている場合、シミュレータのパラメータを変えて
も、その変化に伴いデータの分類を機械学習により学習
させることにより、迅速に作成することができる。(6) When the distribution of attribute values is obtained by a simulator or the like, even if the parameters of the simulator are changed, the data can be quickly created by learning the classification of the data by machine learning in accordance with the change. .

【０１６２】（７）人間の主観が入らないアルゴリズム
を自動的に作成することができる。(7) It is possible to automatically create an algorithm that does not allow human subjectivity.

【０１６３】（８）効率のよいアルゴリズムを作成する
ことができる。(8) An efficient algorithm can be created.

【０１６４】（９）データに不要な属性を知ることがで
きる。(9) Attributes unnecessary for data can be known.

[Brief description of the drawings]

【図１】表１のデータに基づく識別木学習の結果を示
す説明図である。FIG. 1 is an explanatory diagram showing a result of discrimination tree learning based on data in Table 1.

【図２】属性値分布と任意の２つのカテゴリーの関係
を示す説明図である。FIG. 2 is an explanatory diagram showing a relationship between an attribute value distribution and two arbitrary categories.

【図３】一部に重なりがある場合のカテゴリーの分類
を示す説明図である。FIG. 3 is an explanatory diagram showing classification of a category in a case where there is a partial overlap.

【図４】属性に重なりのある部分の出現確率と任意の
値での出現確率を表す説明図である。FIG. 4 is an explanatory diagram showing an appearance probability of a part having an attribute overlap and an appearance probability at an arbitrary value.

【図５】本発明におけるアルゴリズムの作成の手順を
示す概念図である。FIG. 5 is a conceptual diagram showing a procedure of creating an algorithm according to the present invention.

【図６】本発明の全体的なフローチャートである。FIG. 6 is an overall flowchart of the present invention.

【図７】本発明実施例における配電線線路モデルの系
統図である。FIG. 7 is a system diagram of a distribution line model in an embodiment of the present invention.

【図８】カテゴリー間の分類可能，不可能の関係を示
す説明図である。FIG. 8 is an explanatory diagram showing a classifiable / impossible relationship between categories.

【図９】属性が重なるカテゴリーの分布の説明図であ
る。FIG. 9 is an explanatory diagram of a distribution of categories in which attributes overlap.

【図１０】各カテゴリーの属性値の分布の例を示す図
である。FIG. 10 is a diagram showing an example of a distribution of attribute values of each category.

【図１１】本発明実施例における識別木の説明図であ
る。FIG. 11 is an explanatory diagram of an identification tree in the embodiment of the present invention.

【図１２】本発明におけるカテゴリーの分割の例を示
す説明図である。FIG. 12 is an explanatory diagram showing an example of category division in the present invention.

【図１３】本発明における電流の確率分布の例を示す
グラフである。FIG. 13 is a graph showing an example of a probability distribution of a current in the present invention.

【図１４】本発明を事故診断に適用した例を示すフロ
ーチャートの（１）である。FIG. 14 is a flowchart (1) showing an example in which the present invention is applied to an accident diagnosis.

【図１５】本発明を事故診断に適用した例を示すフロ
ーチャートの（２）である。FIG. 15 is a flowchart (2) showing an example in which the present invention is applied to an accident diagnosis.

【図１６】本発明を事故診断に適用した例を示すフロ
ーチャートの（３）である。FIG. 16 is a flowchart (3) showing an example in which the present invention is applied to an accident diagnosis.

【図１７】本発明を事故診断に適用した例を示すフロ
ーチャートの（４）である。FIG. 17 is a flowchart (4) showing an example in which the present invention is applied to an accident diagnosis.

【図１８】本発明を事故診断に適用した例を示すフロ
ーチャートの（５）である。FIG. 18 is a flowchart (5) showing an example in which the present invention is applied to an accident diagnosis.

【図１９】本発明を事故診断に適用した例を示すフロ
ーチャートの（６）である。FIG. 19 is a flowchart (6) showing an example in which the present invention is applied to an accident diagnosis.

フロントページの続き (56)参考文献特許3019227（ＪＰ，Ｂ２) 情報処理学会第43回（平成３年後期) 全国大会７Ｄ−６「配電線事故診断システムにおける決定木の学習検討」戸上正人他Continued on the front page (56) References Patent No. 3019227 (JP, B2) Information Processing Society of Japan 43rd (late 1991) National Convention 7D-6 "Learning Study of Decision Tree in Distribution Line Accident Diagnosis System" Masato Togami et al.

Claims

(57) [Claims]

[Claim 1] (a) category C ₁ ~ to classify the data
C _{i to} C _m are set, and the attribute T of each category is set.
₁ through T _j through T _n aggregates the measurement data for each or calculated
And simulate the result with the corresponding category
And the upper and lower limits for each attribute
Stored in the storage device as an attribute value distribution table representing cloth
That the reference and the step, the attribute value table stored in (b) the storage device
Then, for each attribute, a certain category C _i and another category C _i
-Analysis of the state of overlap of the attribute value distribution with C _j
The state of attribute value distribution for at least one attribute is
Attributes of the category C _j from the distribution of the attribute values of category C _i
Whether the distribution of values can be completely identified (i) or category
The distribution of the attribute values of the distribution of category C _j of attribute values of C _i and one
Partial overlap (ii) or category C
state distribution of the attribute values of _j are included in the distribution of the attribute values of C _i
The process of determining which state (iii) it belongs to
And performing, 1 if (c) each attribute is in the state (i), other
In the case of, define a coefficient of 0 and logically define each attribute.
As variables, and the category C _i and other categories C _j
The set of attributes that can classify
The logical expression of the logical variable with the same attribute
Step of selecting a formula as a set of attributes
If, on the combination of; (d) Category C _i and C _j is the state (i)
, The category C _i and all other categories
A set of attribute sets that can be classified is obtained in step (c).
Performing the process of calculating the logical product of the set of logical expressions
If, on the combination of (e) the category C _i and C _j is the state (i)
To make all categories classifiable with each other.
A set of attribute sets for the logical expression obtained in step (d)
(F) performing the process of obtaining the identification tree from the set of the attribute sets;
In order to select a set of rate attributes, heavy ne of the distribution of the attribute values
Evaluation function based on the state of appearance and the appearance frequency of category C _i
Process to evaluate more and select the most efficient attribute set
(G) setting the attribute set selected in the step (f).
Of the attributes that have the highest rating
-And placed as parent node, and (g-1) parent node
The distribution of attribute values of attributes in a category included
Overlap with the distribution of attribute values of the attribute in other categories
If not, assign the category to the parent node.
Classification is completed by placing it as a child node to
(G-2) When overlapping, use another category and classification
The set of unsuccessful categories is assigned to the child node for the parent node.
(G-3) The category of the child node
The steps (b) to (e) are performed between
The evaluation in the set of attributes selected in step (f)
Excludes attributes used for classification at parent node for child nodes.
The attribute that is the largest of the attributes
Placed as attributes used to classify child nodes as nodes
(G-4) The processing of the above (g-1) to (g-3)
Between the category C _j in Tegori C _i and state (i)
Process until there are no more child nodes to be classified in
Cormorants and steps not completed classification in (h) wherein step (g)
For each child node, the category is divided into an attribute T
The distribution of attribute values for _k is s categories
C _1, ..., if C _i, are overlapping in ... C _s,
The distribution of attribute values for a certain attribute T _k is
Part where Lee C _i does not overlap with all other categories
Min, any category C _i and any other one category
-Overlap, any category C _i and any other 2
Where the categories overlap, ..., any category
-Part where C _i and any other s-1 categories overlap
To divide, for these to split the category attribute T _k
Other than the case of empty set
Performing a process to make Lee, using the probability distribution in the range of attribute values for each (i) attribute, stearyl
New category C _i and category created in top (h)
-Calculate the attribute value probabilities of areas where C _j does not overlap at all
Step a, the attribute stored in the storage device (j) said step (a) to
From the attribute values in the value table, find the frequency of the category,
The frequency of occurrence and the distribution of attribute values obtained in step (i)
Evaluation is performed using an evaluation function that uses the probability of overlap, and
Steps to select the attributes that are most effective for classifying the code
When could not be classified in (k) the step (c) ~ (g)
For the child node, the attribute that is most effective in classifying the child node
Of the category obtained in step (h) using the property
Perform processing to classify in a new category created by splitting
And (l) the knowledge created by the steps (b) to (k).
Create and store data classification flowcharts from different trees
Storing the data in a device, and classifying the data according to the flowchart.

2. (a) Categories C ₁ to C for Classifying Data
C _{i to} C _m are set, and the attribute T of each category is set.
₁ through T _j through T _n aggregates the measurement data for each or calculated
And simulate the result with the corresponding category
And the upper and lower limits for each attribute
Stored in the storage device as an attribute value distribution table representing cloth
That the reference and the step, the attribute value table stored in (b) the storage device
Then, for each attribute, a certain category C _i and another category C _i
-Analysis of the state of overlap of the attribute value distribution with C _j
The state of attribute value distribution for at least one attribute is
Attributes of the category C _j from the distribution of the attribute values of category C _i
Whether the distribution of values can be completely identified (i) or category
The distribution of the attribute values of the distribution of category C _j of attribute values of C _i and one
Partial overlap (ii) or category C
state distribution of the attribute values of _j are included in the distribution of the attribute values of C _i
The process of determining which state (iii) it belongs to
And performing, 1 if (c) each attribute is in the state (i), other
In the case of, define a coefficient of 0 and logically define each attribute.
As variables, and the category C _i and other categories C _j
The set of attributes that can classify
The logical expression of the logical variable with the same attribute
Step of selecting a formula as a set of attributes
And (d) the categories C _i and C _j are combinations of the state (i)
In all categories, all categories
In step (c), a set of attribute sets that can be classified
Step for performing the process of obtaining the logical product of the set of logical expressions obtained in
And (e) the most effective way to create an identification tree from the set of attribute sets.
In order to select a rational set of attributes,
Evaluation function based on the state of appearance and the appearance frequency of category C _i
Process to evaluate more and select the most efficient attribute set
And (f) the set of attributes selected in step (e).
Of the attributes that have the highest rating
-And placed as parent node together with (f-1) parent node
The distribution of attribute values of attributes in a category included
Overlap with the distribution of attribute values of the attribute in other categories
If not, assign the category to the parent node.
Classification is completed by placing it as a child node to
(F-2) If they overlap, use another category and classification
The set of unsuccessful categories is assigned to the child node for the parent node.
(F-3) Category of its child node
The steps (b) to (d) are performed between pairs of
Of the attribute set selected in step (e)
Excludes attributes used for classification at parent node for child nodes.
The attribute that is the largest of the attributes
Placed as attributes used to classify child nodes as nodes
(F-4) The processing of the above (f-1) to (f-3) is performed
Between the category C _j in Tegori C _i and state (i)
Process until there are no more child nodes to be classified in
Cormorants and steps not completed classification (g) In step (f)
For each child node, the category is divided into an attribute T
The distribution of attribute values for _k is s categories
C _1, ..., if C _i, are overlapping in ... C _s,
The distribution of attribute values for a certain attribute T _k is
Part where Lee C _i does not overlap with all other categories
Min, any category C _i and any other one category
-Overlap, any category C _i and any other 2
Where the categories overlap, ..., any category
-Part where C _i and any other s-1 categories overlap
To divide, for these to split the category attribute T _k
Other than the case of empty set
And the step of performing a process to make Lee, using the probability distribution of the range of attribute values for each (h) attribute, stearyl
New category C _i and category created in top (g)
-Calculate the attribute value probabilities of areas where C _j does not overlap at all
Step a, the attribute stored in the storage device by (i) step (a) to
From the attribute values in the value table, find the frequency of the category,
The frequency of occurrence and the distribution of attribute values obtained in step (h)
Evaluation is performed using an evaluation function that uses the probability of overlap, and
Steps to select the attributes that are most effective for classifying the code
When could not be classified in (j) said step (c) ~ (f)
For the child node, the attribute that is most effective in classifying the child node
Of the category obtained in step (g) using the property
Perform processing to classify in a new category created by splitting
Cormorants and step, identification created by (k) the step (b) ~ (j)
Create and store data classification flowcharts from different trees
Storing the data in a device, and classifying the data according to the flowchart.

3. (a) Categories C ₁ to C for Classifying Data
C _{i to} C _m are set, and the attribute T of each category is set.
₁ through T _j through T _n aggregates the measurement data for each or calculated
And simulate the result with the corresponding category
And the upper and lower limits for each attribute
Stored in the storage device as an attribute value distribution table representing cloth
That the reference and the step, the attribute value table stored in (b) the storage device
Then, for each attribute, a certain category C _i and another category C _i
-Analysis of the state of overlap of the attribute value distribution with C _j
The state of attribute value distribution for at least one attribute is
Attributes of the category C _j from the distribution of the attribute values of category C _i
Whether the distribution of values can be completely identified (i) or category
The distribution of the attribute values of the distribution of category C _j of attribute values of C _i and one
Partial overlap (ii) or category C
state distribution of the attribute values of _j are included in the distribution of the attribute values of C _i
The process of determining which state (iii) it belongs to
And performing, 1 if (c) each attribute is in the state (i), other
In the case of, define a coefficient of 0 and logically define each attribute.
As variables, and the category C _i and other categories C _j
The set of attributes that can classify
The logical expression of the logical variable with the same attribute
Step of selecting a formula as a set of attributes
If, on the combination of; (d) Category C _i and C _j is the state (i)
, The category C _i and all other categories
A set of attribute sets that can be classified is obtained in step (c).
Performing the process of calculating the logical product of the set of logical expressions
If, on the combination of (e) the category C _i and C _j is the state (i)
To make all categories classifiable with each other.
A set of attribute sets for the logical expression obtained in step (d)
(F) performing a process of obtaining a logical product of the logical product of the sets; and (f) selecting a set of the attribute sets obtained in the step (e).
Performing a process of selecting one set of arbitrary attributes from the set
When, the set of attributes selected in (g) said step (f)
Inside the parent attribute with any attributes
Placed as a node, and (g-1)
Distribution of attribute values of attributes in other categories
Does not overlap with the distribution of attribute values for that attribute in the
In some cases, the certain category is
(G-2)
When overlapped, it could not be classified into another category
A set of categories as child nodes for the parent node
(G-3) step between the set of its child nodes.
(B) to (e) and perform processing on the child node.
Any of the attributes except those used for classification at the parent node
Attribute is a child node having the child node as a parent node.
(G-4) of the above (g-1) to (g-3)
The process is defined as category C _i and category C _{j in} state (i).
Until there are no more child nodes to classify between
And performing a process, not completed classification in (h) wherein step (g)
For each child node, the category is divided into an attribute T
The distribution of attribute values for k is s categories
C _1, ..., if C _i, are overlapping in ... C _s,
The distribution of attribute values for a certain attribute T _k is
Part where Lee C _i does not overlap with all other categories
Min, any category C _i and any other one category
-Overlap, any category C _i and any other 2
Where the categories overlap, ..., any category
-Part where C _i and any other s-1 categories overlap
To divide, for these to split the category attribute T _k
Other than the case of empty set
Performing a process to make Lee, using the probability distribution in the range of attribute values for each (i) attribute, stearyl
New category C _i and category created in top (h)
-Calculate the attribute value probabilities of areas where C _j does not overlap at all
Step a, the attribute stored in the storage device (j) said step (a) to
From the attribute values in the value table, find the frequency of the category,
The frequency of occurrence and the distribution of attribute values found in step (i)
Evaluation is performed using an evaluation function that uses the probability of overlap, and
Steps to select the attributes that are most effective for classifying the code
When could not be classified in (k) the step (c) ~ (g)
For the child node, the attribute that is most effective in classifying the child node
Of the category obtained in step (h) using the property
Perform processing to classify in a new category created by splitting
And (l) the knowledge created by the steps (b) to (k).
Create and store data classification flowcharts from different trees
Storing the data in a device, and classifying the data according to the flowchart.

4. (a) Categories C ₁ to C for Classifying Data
C _{i to} C _m are set, and the attribute T of each category is set.
₁ through T _j through T _n aggregates the measurement data for each or calculated
And simulate the result with the corresponding category
And the upper and lower limits for each attribute
Stored in the storage device as an attribute value distribution table representing cloth
That the reference and the step, the attribute value table stored in (b) the storage device
Then, for each attribute, a certain category C _i and another category C _i
-Analysis of the state of overlap of the attribute value distribution with C _j
The state of attribute value distribution for at least one attribute is
Attributes of the category C _j from the distribution of the attribute values of category C _i
Whether the distribution of values can be completely identified (i) or category
The distribution of the attribute values of the distribution of category C _j of attribute values of C _i and one
Partial overlap (ii) or category C
state distribution of the attribute values of _j are included in the distribution of the attribute values of C _i
The process of determining which state (iii) it belongs to
And performing, 1 if (c) each attribute is in the state (i), other
In the case of, define a coefficient of 0 and logically define each attribute.
As variables, and the category C _i and other categories C _j
The set of attributes that can classify
The logical expression of the logical variable with the same attribute
Step of selecting a formula as a set of attributes
If, on the combination of; (d) Category C _i and C _j is the state (i)
, The category C _i and all other categories
A set of attribute sets that can be classified is obtained in step (c).
Performing the process of calculating the logical product of the set of logical expressions
If, on the combination of (e) the category C _i and C _j is the state (i)
To make all categories classifiable with each other.
A set of attribute sets for the logical expression obtained in step (d)
(F) performing the process of obtaining the identification tree from the set of the attribute sets;
In order to select a rational set of attributes,
Evaluation function based on the state of appearance and the appearance frequency of category C _i
Process to evaluate more and select the most efficient attribute set
(G) setting the attribute set selected in the step (f).
Inside the parent attribute with any attributes
Placed as a node, and (g-1)
Distribution of attribute values of attributes in other categories
Does not overlap with the distribution of attribute values for that attribute in the
In some cases, the certain category is
(G-2)
When overlapped, it could not be classified into another category
A set of categories as child nodes for the parent node
(G-3) between a set of categories of its child nodes
Performs the processing of steps (b) to (e)
In the attribute set selected in (f), the child node
Arbitrary attributes except those used for classification at the parent node
Is used to classify child nodes whose parent node is the child node.
(G-4) The above (g-1) ~
Ca in the (g-3) processing the category C _i and state (i)
There is no child node to classify with the category C _j
And (h) the classification is not completed in the step (g).
For each child node, the category is divided into an attribute T
The distribution of attribute values for _k is s categories
C _1, ..., if C _i, are overlapping in ... C _s,
The distribution of attribute values for a certain attribute T _k is
Part where Lee C _i does not overlap with all other categories
Min, any category C _i and any other one category
-Overlap, any category C _i and any other 2
Where the categories overlap, ..., any category
-Part where C _i and any other s-1 categories overlap
To divide, for these to split the category attribute T _k
Other than the case of empty set
Performing a process to make Lee, using the probability distribution in the range of attribute values for each (i) attribute, stearyl
New category C _i and category created in top (h)
-Calculate the attribute value probabilities of areas where C _j does not overlap at all
Step a, the attribute stored in the storage device (j) said step (a) to
From the attribute values in the value table, find the frequency of the category,
The frequency of occurrence and the distribution of attribute values obtained in step (i)
Evaluation is performed using an evaluation function that uses the probability of overlap, and
Steps to select the attributes that are most effective for classifying the code
When could not be classified in (k) the step (c) ~ (g)
For the child node, the attribute that is most effective in classifying the child node
Of the category obtained in step (h) using the property
Perform processing to classify in a new category created by splitting
And (l) the knowledge created by the steps (b) to (k).
Create and store data classification flowcharts from different trees
Storing the data in a device, and classifying the data according to the flowchart.

5. (a) Categories C ₁ to C for Classifying Data
C _{i to} C _m are set, and the attribute T of each category is set.
₁ through T _j through T _n aggregates the measurement data for each or calculated
And simulate the result with the corresponding category
And the upper and lower limits for each attribute
Stored in the storage device as an attribute value distribution table representing cloth
That the reference and the step, the attribute value table stored in (b) the storage device
Then, for each attribute, a certain category C _i and another category C _i
-Analysis of the state of overlap of the attribute value distribution with C _j
The state of attribute value distribution for at least one attribute is
Attributes of the category C _j from the distribution of the attribute values of category C _i
Whether the distribution of values can be completely identified (i) or category
The distribution of the attribute values of the distribution of category C _j of attribute values of C _i and one
Partial overlap (ii) or category C
state distribution of the attribute values of _j are included in the distribution of the attribute values of C _i
The process of determining which state (iii) it belongs to
And performing, 1 if (c) each attribute is in the state (i), other
In the case of, define a coefficient of 0 and logically define each attribute.
As variables, and the category C _i and other categories C _j
The set of attributes that can classify
The logical expression of the logical variable with the same attribute
Step of selecting a formula as a set of attributes
If, on the combination of; (d) Category C _i and C _j is the state (i)
, The category C _i and all other categories
A set of attribute sets that can be classified is obtained in step (c).
Performing the process of calculating the logical product of the set of logical expressions
If, on the combination of (e) the category C _i and C _j is the state (i)
To make all categories classifiable with each other.
A set of attribute sets for the logical expression obtained in step (d)
(F) performing a process of obtaining a logical product of the logical product of the sets; and (f) selecting a set of the attribute sets obtained in the step (e).
Performing a process of selecting one set of arbitrary attributes from the set
And (g) from the set of attributes selected in step (f) above.
Place the most efficient attributes on the nodes for creating the identification tree
In order, the distribution of attribute values overlapping state and category C _i
Perform evaluation processing using an evaluation function based on the frequency of appearance
And (h) the set of attributes selected in step (f).
And the group having the highest evaluation in step (g)
Property with its included categories as a parent node.
(H-1) in a certain category included in the parent node
Distribution of attribute values in other categories
If there is no overlap with the attribute value distribution of the attribute,
Place category as child node for the parent node
To complete the classification, and (h-2)
A group of categories that could not be classified with other categories
Is arranged as a child node to the parent node, and (h−
3) step between the set of categories of its child nodes
Perform the processing of (b) to (e) and select in step (f)
The evaluation in step (g) above is
The attribute used for classification at the parent node for the child node
Set the largest attribute among the excluded attributes to the parent node of the child node.
Arranged as attributes used to classify child nodes as nodes
(H-4) The processing of (h-1) to (h-3) is
Between the category C _j in Tegori C _i and state (i)
Process until there are no more child nodes to be classified in
Cormorants and steps not completed classification in (i) wherein step (h)
For each child node, the category is divided into an attribute T
The distribution of attribute values for _k is s categories
C _1, ..., if C _i, are overlapping in ... C _s,
The distribution of attribute values for a certain attribute T _k is
Part where Lee C _i does not overlap with all other categories
Min, any category C _i and any other one category
-Overlap, any category C _i and any other 2
Where the categories overlap, ..., any category
-Part where C _i and any other s-1 categories overlap
To divide, for these to split the category attribute T _k
Other than the case of empty set
Performing a process to make Lee, using the probability distribution in the range of attribute values for each (j) attribute, stearyl
New category C _i and category created in step (i)
-Calculate the attribute value probabilities of areas where C _j does not overlap at all
Step a, the attribute stored in the storage device (k) the step (a) to
From the attribute values in the value table, find the frequency of the category,
Of the frequency of occurrence and the distribution of attribute values found in step (j)
Evaluation is performed using an evaluation function that uses the probability of overlap, and
Steps to select the attributes that are most effective for classifying the code
When could not be classified in (l) said step (c) ~ (h)
For the child node, the attribute that is most effective in classifying the child node
Min category determined in step (i) with sex
Perform processing to classify in a new category created by splitting
Cormorants and step, identification created by (m) the step (b) ~ (l)
Create and store data classification flowcharts from different trees
Storing the data in a device, and classifying the data according to the flowchart.

6. (a) Categories C ₁ to C for Classifying Data
C _{i to} C _m are set, and the attribute T of each category is set.
₁ through T _j through T _n aggregates the measurement data for each or calculated
And simulate the result with the corresponding category
And the upper and lower limits for each attribute
Stored in the storage device as an attribute value distribution table representing cloth
That the reference and the step, the attribute value table stored in (b) the storage device
Then, for each attribute, a certain category C _i and another category C _i
-Analysis of the state of overlap of the attribute value distribution with C _j
The state of attribute value distribution for at least one attribute is
Attributes of the category C _j from the distribution of the attribute values of category C _i
Whether the distribution of values can be completely identified (i) or category
The distribution of the attribute values of the distribution of category C _j of attribute values of C _i and one
Partial overlap (ii) or category C
state distribution of the attribute values of _j are included in the distribution of the attribute values of C _i
The process of determining which state (iii) it belongs to
And performing, 1 if (c) each attribute is in the state (i), other
In the case of, define a coefficient of 0 and logically define each attribute.
As variables, and the category C _i and other categories C _j
The set of attributes that can classify
The logical expression of the logical variable with the same attribute
Step of selecting a formula as a set of attributes
If, on the combination of; (d) Category C _i and C _j is the state (i)
, The category C _i and all other categories
A set of attribute sets that can be classified is obtained in step (c).
Performing the process of calculating the logical product of the set of logical expressions
If, on the combination of (e) the category C _i and C _j is the state (i)
To make all categories classifiable with each other.
A set of set of eyes of the attribute step (d) in the obtained logical formula
(F) performing the process of obtaining the identification tree from the set of the attribute sets;
In order to select a rational set of attributes,
Evaluation function based on the state of appearance and the appearance frequency of category C _i
Process to evaluate more and select the most efficient attribute set
(G) setting the attribute set selected in the step (f).
Of the attributes that have the highest rating
-And placed as parent node, and (g-1) parent node
The distribution of attribute values of attributes in a category included
Overlap with the distribution of attribute values of the attribute in other categories
If not, assign the category to the parent node.
Classification is completed by placing it as a child node to
(G-2) When overlapping, use another category and classification
The set of unsuccessful categories is assigned to the child node for the parent node.
(G-3) The category of the child node
The steps (b) to (e) are performed between
The evaluation in the set of attributes selected in step (f)
Excludes attributes used for classification at parent node for child nodes.
The attribute that is the largest of the attributes
Placed as attributes used to classify child nodes as nodes
(G-4) The processing of the above (g-1) to (g-3)
Between the category C _j in Tegori C _i and state (i)
Process until there are no more child nodes to be classified in
Cormorants and steps not completed classification in (h) wherein step (g)
For each child node, the category is divided into an attribute T
The distribution of attribute values for _k is s categories
C _1, ..., if C _i, are overlapping in ... C _s,
The distribution of attribute values for a certain attribute T _k is
Part where Lee C _i does not overlap with all other categories
Min, any category C _i and any other one category
-Overlap, any category C _i and any other 2
Where the categories overlap, ..., any category
-Part where C _i and any other s-1 categories overlap
To divide, for these to split the category attribute T _k
Other than the case of empty set
Performing a process of making a lie, and (i) Classification failed in the above steps (c) to (g)
For the child node, use any attribute
New made by the division of the obtained category in (h)
Performing a process of classifying by category; and (j) the knowledge created in steps (b) to (i).
Create and store data classification flowcharts from different trees
Storing the data in a device, and classifying the data according to the flowchart.