JPH05303597A

JPH05303597A - Record clasififying system of information processor

Info

Publication number: JPH05303597A
Application number: JP3066042A
Authority: JP
Inventors: Yuji Honda; 祐司本多; Shigeo Tachibana; 茂生立花
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1991-03-29
Filing date: 1991-03-29
Publication date: 1993-11-16
Anticipated expiration: 2015-01-31
Also published as: JP3004067B2

Abstract

PURPOSE:To easily specify a group that record of decision analytic processing belongs to by means of the information processor which classifies records by using the decision analytic processing. CONSTITUTION:When the decision analytic processing is performed, a group that each record being input data belongs to needs to be clear. For the designation of the group that the input record belongs to, a classification scale field prescribing the group is selected (201). Then a condition which is applied to the data in the selected classification scale field to determine which group the data belongs to is incorporated (202). Lastly, groups that respective records belong to are determined according to the incorporated condition (203 and 204). Then proceeding to the decision analytic processing is made right after the group are determined (207-210).

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は情報処理装置のレコード
分類方式に関し、特に、判別分析処理を利用した方式に
適用して好適なものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a record classification system of an information processing apparatus, and is particularly suitable for application to a system utilizing discriminant analysis processing.

【０００２】[0002]

【従来の技術】情報処理装置として、複数種類のデータ
（いわゆるフィールド）が組となったいわゆるレコード
単位に情報処理するものがある。この場合に、複数のレ
コードを所定の観点から分類し、分類毎に又はいずれか
の分類のレコードについて処理することがある。例え
ば、支店名、支店所在地、売場面積、売上高、利益、従
業員数等のフィールドでなるレコードを、売上高に基づ
いて分類し、従業員一人当りの利益に分類された群間で
差が生じているか否かを検出するようなことも行なわれ
る。2. Description of the Related Art As an information processing device, there is an information processing device that processes information in so-called record units, which is a set of a plurality of types of data (so-called fields). In this case, a plurality of records may be classified from a predetermined viewpoint, and records may be processed for each classification or for any of the classification records. For example, records that consist of fields such as branch name, branch location, sales floor area, sales, profit, and number of employees are classified based on sales, and there is a difference between groups classified as profits per employee. It is also performed to detect whether or not it is present.

【０００３】ここで、新たなレコードが入力された場合
には、どの群に属するのかを決定する。この場合、分類
先の群を規定する分類尺度フィールドにデータがあれば
その値に基づいて分類すれば良いが、分類尺度フィール
ドのデータが不明である場合には、他のフィールドのデ
ータから所属群を決定する（分類する）ことが行なわれ
る。このような所属群の決定には、多変量解析の一手法
である判別分析処理が利用されることが多い。なお、判
別分析処理は、この他に分類尺度の選定の妥当性や分類
条件の妥当性の判断等にも利用される。Here, when a new record is input, it is determined which group it belongs to. In this case, if there is data in the classification scale field that defines the group to be classified, classification may be performed based on that value. However, if the data in the classification scale field is unknown, the group belonging to other fields may be used. Is determined (classified). A discriminant analysis process, which is a method of multivariate analysis, is often used to determine such belonging groups. The discriminant analysis process is also used to judge the validity of the selection of the classification scale, the validity of the classification conditions, and the like.

【０００４】以下、判別分析処理（厳密に言えば教育判
別分析処理）の概念について図２を用いて簡単に説明す
る。図２は、２個のフィールドデータｘ1 及びｘ2 から
所属群Ｇ1 及びＧ2 を判別する場合の例である。The concept of discriminant analysis processing (strictly speaking, educational discriminant analysis processing) will be briefly described below with reference to FIG. FIG. 2 shows an example in which the belonging groups G1 and G2 are discriminated from the two field data x1 and x2.

【０００５】予め所属群Ｇ1 又はＧ2 が分かっているレ
コードＲ11、Ｒ12、…及びＲ21、Ｒ22、…から、群間の
境界を規定する平面である判別関数Ｄを決定しておく。
図２からも明らかなように、判別関数Ｄは、２次元座標
系では直線で表されるように、各フィールドデータｘ1
及びｘ2 の１次結合で表されるものであり、群間の差を
最も大きくかつ群内のばらつきを最も小さくする、各レ
コードの座標の投影面になっている。From the records R11, R12, ... And R21, R22, ... Which the belonging group G1 or G2 is known in advance, the discriminant function D which is a plane defining the boundary between the groups is determined.
As is apparent from FIG. 2, the discriminant function D is represented by a straight line in the two-dimensional coordinate system, and each field data x1
And x2 linear combination, which is the projection plane of the coordinates of each record that maximizes the difference between the groups and minimizes the variation within the group.

【０００６】なお、レコードＲ11、Ｒ12、…及びＲ21、
Ｒ22、…の所属群Ｇ1 又はＧ2 の決定は、上述のフィー
ルドデータｘ1 及びｘ2 とは異なる分類尺度フィールド
のデータ（ｘ3 ）による。The records R11, R12, ... And R21,
The determination of the belonging group G1 or G2 of R22, ... Is based on the data (x3) of the classification scale field different from the above-mentioned field data x1 and x2.

【０００７】今、分類尺度フィールドのデータ値ｘ3 が
不明なレコードＲｘが入力されたとする。このときに
は、レコードＲｘのフィールドデータｘ1 及びｘ2 から
座標系における位置を求め、この位置を、判別関数Ｄで
規定される面に投影した位置ＤＲｘに基づいて、このレ
コードＲｘの所属群（Ｇ1)を決定する。It is assumed that a record Rx whose data value x3 in the classification scale field is unknown is input. At this time, the position in the coordinate system is obtained from the field data x1 and x2 of the record Rx, and based on the position DRx obtained by projecting this position on the surface defined by the discriminant function D, the belonging group (G1) of this record Rx is determined. decide.

【０００８】従って、所属不明のレコードの所属群を決
定するための従来の一連の処理は、図３に示すようにな
る。まず、判別分析処理を起動し、次に、例えば、補助
記憶装置等に既に格納されているレコードから一部レコ
ードを選択し、選択したレコードが属する群の指定を取
込む（ステップ１０１〜１０３）。その後、各種の演算
を実行して判別関数等の決定を行ない、得られた判別関
数等の処理結果を利用者に表示認識させる（ステップ１
０４、１０５）。このような処理を終了すると、所属不
明のレコードを取込んで所属群を決定し、その結果を例
えば表形式等で表示して一連の処理を終了する（１０
６、１０７）。Therefore, a conventional series of processes for determining the belonging group of records whose belongings are unknown is shown in FIG. First, the discriminant analysis process is started, and then, for example, some records are selected from the records already stored in the auxiliary storage device, and the designation of the group to which the selected record belongs is taken in (steps 101 to 103). .. After that, various calculations are executed to determine the discriminant function and the like, and the user is caused to display and recognize the obtained processing result of the discriminant function and the like (step 1).
04, 105). When such a process is completed, a record whose affiliation is unknown is taken in, the affiliation group is determined, the result is displayed in, for example, a table format, and the series of processes is terminated (10
6, 107).

【０００９】このように、判別関数を求めるためには、
所属が既に判っている複数組のレコードを選択指定して
おくことを要する。この具体的な方法としては、既存の
データファイルに基づいて群毎の別個のデータファイル
を形成したり、また、レコードに群識別情報を付与した
りしていた。Thus, in order to obtain the discriminant function,
It is necessary to select and specify multiple sets of records whose affiliation is already known. As a concrete method, a separate data file for each group is formed based on an existing data file, or group identification information is added to a record.

【００１０】[0010]

【発明が解決しようとする課題】しかしながら、従来で
は、判別関数等を演算させるために入力するレコードの
所属群を指定する場合、いずれの具体的方法を採用した
としても、利用者がレコードの分類尺度フィールドのデ
ータ値をレコード毎に判断して、一々所属群の指定を行
なっていた。従って、多くの時間及び手間を要するもの
であった。However, in the past, when specifying a group to which a record to be input for calculating a discriminant function or the like is assigned, the user classifies the record regardless of which concrete method is adopted. The data value of the scale field was judged for each record, and the belonging group was designated one by one. Therefore, it takes a lot of time and labor.

【００１１】レコードの分類精度を向上させようとする
と、所属が指定された入力レコードが多数求められ、そ
のため、レコード毎に指定操作を行なうという上述の問
題は大きい。In order to improve the classification accuracy of records, a large number of input records whose affiliations are designated are required, so that the above-described problem of performing the designation operation for each record is great.

【００１２】本発明は、以上の点を考慮してなされたも
ので、判別分析処理の入力レコードに対する所属群の指
定を、簡単に行なうことができ、かつ一連の処理時間を
短縮化できる情報処理装置のレコード分類方式を提供し
ようとするものである。The present invention has been made in consideration of the above points, and the information processing that can easily specify the belonging group for the input record of the discriminant analysis process and can shorten the series of processing time. It is intended to provide a record classification system for a device.

【００１３】[0013]

【課題を解決するための手段】かかる課題を解決するた
め、本発明においては、所属群が不明のレコードをいず
れかの群に分類したり、所属群が指定された各レコード
がその所属群に属する確率を計算したりする判別分析処
理を実行する情報処理装置のレコード分類方式におい
て、判別分析処理の入力データたる各レコードの所属群
の指定を、以下の処理で行なうこととした。In order to solve such a problem, in the present invention, a record whose belonging group is unknown is classified into one of the groups, and each record in which the belonging group is designated belongs to that group. In the record classification method of the information processing device that executes the discriminant analysis process for calculating the probability of belonging, the affiliation group of each record that is the input data of the discriminant analysis process is specified by the following process.

【００１４】すなわち、所属群を規定する分類尺度フィ
ールドを選択する選択処理と、選択された分類尺度フィ
ールドのデータに適用してどの群に所属するかを決定す
る条件を取込む条件取込み処理と、取込まれた条件に基
づいて各レコードの所属群を決定する判定処理とによっ
て行なうこととした。That is, a selection process for selecting a classification scale field that defines a belonging group, and a condition fetching process for incorporating a condition for applying to the data of the selected classification scale field to determine which group it belongs to, The determination process is to determine the group to which each record belongs based on the captured conditions.

【００１５】[0015]

【作用】情報処理装置において、所属群が不明のレコー
ドをいずれかの群に分類したり、所属群が指定された各
レコードがその所属群に属する確率を計算したりする処
理には、判別分析処理が用いられる。判別分析処理を行
なう場合、その入力データたる各レコードについては所
属群が明らかである必要がある。In the information processing apparatus, a discriminant analysis is performed in a process of classifying a record whose belonging group is unknown into one of the groups or calculating a probability that each record in which the belonging group is designated belongs to the belonging group. Processing is used. When performing the discriminant analysis process, it is necessary that the belonging group is clear for each record that is the input data.

【００１６】本発明は、このような入力レコードの所属
群の指定に関する。本発明では、まず、所属群を規定す
る分類尺度フィールドを選択し、次に、選択された分類
尺度フィールドのデータに適用してどの群に所属するか
を決定する条件を取込み、最後に、取込まれた条件に基
づいて各レコードの所属群を決定する。そして、この所
属群の決定の後、直ちに判別分析処理に進む。The present invention relates to designation of a group to which such an input record belongs. In the present invention, first, a classification scale field that defines the belonging group is selected, and then a condition that is applied to the data of the selected classification scale field to determine which group it belongs to is taken in. Finally, The group to which each record belongs is determined based on the embedded conditions. Then, after the affiliation group is determined, the process immediately proceeds to the discriminant analysis process.

【００１７】[0017]

【実施例】以下、本発明の一実施例を図面を参照しなが
ら詳述する。この実施例に係る情報処理装置も、一般の
コンピュータシステムと同様に、図示は省略するがハー
ドウェア的にはＣＰＵやメモリやキーボードやＣＲＴデ
ィスプレイ等から構成されている。この実施例の情報処
理装置は、レコード分類の機能からは図４に示すように
構成されている。なお、図４に示す機能構成は、実際上
ソフトウェアによって構成されている。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described in detail below with reference to the drawings. Like the general computer system, the information processing apparatus according to this embodiment is also configured by a CPU, a memory, a keyboard, a CRT display and the like in terms of hardware, although not shown. The information processing apparatus of this embodiment is configured as shown in FIG. 4 from the function of record classification. The functional configuration shown in FIG. 4 is actually configured by software.

【００１８】図４に示すように、機能構成（プログラム
構造概念）は、制御モジュール１１、入力処理モジュー
ル１２、出力処理モジュール１３、表管理処理モジュー
ル１４、強調処理モジュール１５及び統計処理モジュー
ル１６に分けられる。As shown in FIG. 4, the functional configuration (program structure concept) is divided into a control module 11, an input processing module 12, an output processing module 13, a table management processing module 14, an emphasis processing module 15, and a statistical processing module 16. Be done.

【００１９】制御モジュール１１は全体制御を司るもの
であり、各種モジュール１２〜１６はこの制御モジュー
ル１１の管理下にあって所定の処理を実行する。The control module 11 controls the overall control, and the various modules 12 to 16 are under the control of the control module 11 and execute predetermined processing.

【００２０】入力処理モジュール１２は、キーボード等
の入力装置から入力されたレコードやコマンドを取込む
ものである。例えば、分類を決定する分類尺度フィール
ドのデータを有するレコードや分類尺度フィールドのデ
ータが未知で所属群が不明のレコードを取込むものであ
る。また、分類尺度フィールドの指定を受付けたり、入
力レコードの群を決定する条件を取込んだりするもので
ある。The input processing module 12 takes in records and commands input from an input device such as a keyboard. For example, a record having data of a classification scale field that determines a classification or a record of which the data of the classification scale field is unknown and the belonging group is unknown is taken. It also accepts the designation of the classification scale field and incorporates the conditions that determine the group of input records.

【００２１】出力処理モジュール１３は、ＣＲＴディス
プレイやプリンタ等に対する出力制御を行なうものであ
る。例えば、入力レコードの所属群の指定に供する画像
を出力させたり、得られた判別関数を出力させたり、所
属不明のレコードについて得られた所属群情報を出力さ
せたりするものである。The output processing module 13 controls output to a CRT display, a printer and the like. For example, an image used for designating the affiliation group of an input record is output, the obtained discriminant function is output, or affiliation group information obtained for a record whose affiliation is unknown is output.

【００２２】表管理処理モジュール１４は、補助記録装
置や主メモリ等に格納されている複数のレコードを含む
表データを管理するものである。この実施例の場合、こ
のモジュール１４によって管理される表データには、複
数のレコードの一部を他のレコードより強調することを
表す強調情報が付与されることがある。The table management processing module 14 manages table data including a plurality of records stored in the auxiliary recording device, the main memory or the like. In this embodiment, the tabular data managed by this module 14 may be provided with emphasis information indicating that a part of a plurality of records is emphasized over other records.

【００２３】強調処理モジュール１５は、表管理処理モ
ジュール１４によって管理されている表データにおけ
る、所定の条件を満足する一部のレコードを、他のレコ
ードと区別して強調するものである。すなわち、強調さ
れたレコード群と強調されないレコード群とに２分し
て、判別関数等の決定に必要な入力レコードの所属群の
指定を行なうものである。強調処理モジュール１５は、
詳細には、条件定義部１５ａと、判定部１５ｂと、強調
処理部１５ｃとからなる。条件定義部１５ａは、入力処
理モジュール１２を起動して、分類尺度フィールドの指
定と、分類尺度フィールドのデータ値を基に強調するレ
コードか否かを決定する条件とを取込むものである。判
定部１５ｂは、条件に従って各レコードが強調すべきも
のか否かを判定するものである。強調処理部１５ｃは、
判定結果に基づいて表管理処理モジュール１４の管理下
にある表に強調情報を付与するものである。The emphasis processing module 15 emphasizes a part of the records in the table data managed by the table management processing module 14 that satisfy a predetermined condition, in distinction from other records. That is, the group to which the input record belongs, which is necessary for determining the discriminant function and the like, is designated by dividing into the emphasized record group and the unemphasized record group. The emphasis processing module 15
More specifically, the condition definition unit 15a, the determination unit 15b, and the emphasis processing unit 15c are included. The condition definition unit 15a activates the input processing module 12 and takes in the designation of the classification scale field and the condition for determining whether or not the record is an emphasis record based on the data value of the classification scale field. The determination unit 15b determines whether or not each record should be emphasized according to the condition. The emphasis processing unit 15c
Emphasized information is added to the table under the control of the table management processing module 14 based on the determination result.

【００２４】統計処理モジュール１６は、判別関数等の
算出及び所属不明レコードの所属先決定を行なうもので
ある。統計処理モジュール１６は判別分析部１６ａ及び
確率計算部１６ｂからなる。判別分析部１６ａは、強調
されたレコード群と強調されていないレコード群との双
方を用いて、判別関数等の情報を算出するものである。
確率計算部１６ｂは、所属不明のレコードが与えられた
場合に、判別分析部１６ａが算出した判別関数その他の
情報を用いて、そのレコードの所属群を決定するもので
あり、その所属群決定情報を一方の所属群に属している
確率で表すものである。なお、確率計算部１６ｂは、所
属群が指定された入力レコードに対しても、その所属確
率を演算する。The statistical processing module 16 calculates a discriminant function or the like and determines the affiliation destination of an unknown affiliation record. The statistical processing module 16 includes a discriminant analysis unit 16a and a probability calculation unit 16b. The discriminant analysis unit 16a calculates information such as a discriminant function using both the emphasized record group and the unemphasized record group.
The probability calculation unit 16b determines the group to which the record belongs by using the discriminant function calculated by the discriminant analysis unit 16a and other information when a record whose affiliation is unknown is given. Is represented by the probability of belonging to one belonging group. The probability calculator 16b also calculates the belonging probability of the input record for which the belonging group is designated.

【００２５】次に、所属不明のレコードの所属群を決定
する（レコードを分類する）ための一連のレコード分類
処理を、図１、図５ないし図７を参照しながら説明す
る。なお、図１は処理の流れを示すフローチャートであ
る。図５及び図６は判別関数の決定等に用いられる入力
レコードでなる表データを示す説明図であって、図５は
強調処理前のものを、図６は強調処理後のものを示して
いる。図７は、所属不明レコードの所属群の決定結果を
示す説明図である。図５ないし図７に示す例のものは、
４個のフィールドＡ、Ｂ、Ｃ、Ｄによって１個のレコー
ドが形成されているものである。Next, a series of record classification processing for determining the belonging group of records whose belongings are unknown (classifying the records) will be described with reference to FIGS. 1 and 5 to 7. 1. FIG. 1 is a flowchart showing the flow of processing. 5 and 6 are explanatory views showing table data which are input records used for determining a discriminant function, etc., FIG. 5 shows before emphasis processing, and FIG. 6 shows after emphasis processing. .. FIG. 7 is an explanatory diagram showing the determination result of the affiliation group of the affiliation unknown record. The example shown in FIG. 5 to FIG.
One record is formed by four fields A, B, C and D.

【００２６】まず、使用者によって指定された分類尺度
フィールドを取込む（ステップ２０１）。この場合、複
数のフィールドを分類尺度フィールドとして指定しても
良い。次に、各入力レコードを２群に振り分けるため
の、分類尺度フィールドについての条件を取込む（ステ
ップ２０２）。分類尺度フィールドとして複数のフィー
ルドが指定された場合には、各フィールドについての条
件と、その結果に対する総合条件とを取込む。そして、
取込んだ条件に基づいた判定を行ない、レコードに強調
レコードか非強調レコードかを表す情報を付与する（ス
テップ２０３）。First, the classification scale field designated by the user is fetched (step 201). In this case, a plurality of fields may be designated as the classification scale fields. Next, the condition about the classification scale field for distributing each input record into two groups is taken in (step 202). When a plurality of fields are specified as the classification scale field, the condition for each field and the total condition for the result are taken in. And
A determination is made based on the fetched conditions, and information indicating whether the record is an emphasized record or a non-emphasized record is added to the record (step 203).

【００２７】図６は、分類尺度フィールドとしてフィー
ルドＡが指定され、かつ、強調レコードの条件として
「フィールドＡの値＞６００」が指示された場合の強調
結果を表している。FIG. 6 shows an emphasis result when the field A is designated as the classification scale field and "the value of the field A>600" is specified as the condition of the emphasis record.

【００２８】その後、使用者による指定を待たずに、強
調結果に基づいてレコードを自動的に２群に振り分ける
（ステップ２０４）。図６の例の場合、レコード番号が
１、２、４、５及び６のレコード群と、レコード番号が
３、７、８、９及び１０のレコード群に分けられる。Thereafter, the records are automatically sorted into two groups based on the emphasis result without waiting for the designation by the user (step 204). In the case of the example of FIG. 6, it is divided into a record group having record numbers 1, 2, 4, 5 and 6 and a record group having record numbers 3, 7, 8, 9 and 10.

【００２９】この状態において、判別分析処理が起動さ
れ、まず振り分けられた各レコード群を自動的に判別分
析に用いる入力レコード群として指定する（ステップ２
０５、２０６）。その後、判別分析処理を実行して判別
関数の決定等を行ない、得られた判別関数等の処理結果
を利用者に表示認識させる（ステップ２０７、２０
８）。このような分類のための前処理を終了すると、所
属不明のレコードを取込んで、判別関数等に基づいて所
属群を決定し（例えば所属確率を演算し）、その結果を
例えば表形式等で表示して一連の処理を終了する（２０
９、２１０）。In this state, the discriminant analysis process is activated, and firstly, each sorted record group is automatically designated as an input record group to be used for discriminant analysis (step 2).
05, 206). After that, a discriminant analysis process is executed to determine a discriminant function and the like, and the user is caused to display and recognize the processing result of the discriminant function obtained (steps 207, 20).
8). When the preprocessing for such classification is completed, the records whose affiliation is unknown are taken in, the affiliation group is determined based on the discriminant function, etc. (for example, the affiliation probability is calculated), and the result is displayed in, for example, a table format. Display and end the series of processing (20
9, 210).

【００３０】図７は、分類尺度フィールドＡの値が未知
の所属不明のレコードが２個（レコード番号１１及び１
２）与えられた場合の結果を示しており、この例の場合
には、これらレコードは共に強調側の群（フィールドＡ
の値が６００より大きい群）に属する確率（強調確率）
の方が、非強調側の群に属する確率より大きくなってい
る。FIG. 7 shows that there are two records of which the value of the classification scale field A is unknown and whose membership is unknown (record numbers 11 and 1).
2) shows the result when given, and in the case of this example, these records are both the group on the emphasis side (field A).
Probability of belonging to a group whose value of is greater than 600) (emphasis probability)
Is greater than the probability of belonging to the non-emphasized group.

【００３１】なお、図７は、所属群が指定された入力レ
コードについても確率を示している。例えば、レコード
番号が６のレコードは非強調群に属すると指定されてい
るが、判別分析結果からは強調群に属する確率もかなり
高くなっている。このような結果は、分類尺度フィール
ドの選定や分類条件の妥当性の解析にも利用される。Note that FIG. 7 also shows the probabilities for the input records for which the affiliation group is designated. For example, although the record with the record number 6 is designated as belonging to the non-emphasized group, the probability of belonging to the emphasized group is considerably high from the result of the discriminant analysis. Such results are also used for selecting the classification scale field and analyzing the validity of the classification conditions.

【００３２】従って、上述の実施例によれば、所属不明
のレコードを分類したり、所属が指定されたレコードが
その所属群に属する確率を計算したりする判別分析処理
を実行するために、複数の入力レコードを２群以上の群
に振り分ける処理を、分類尺度フィールドの指定や条件
設定によって自動的に行なうようにしたので、処理時間
及び使用者の手間を大幅に削減することができる。すな
わち、レコードを複数群に振り分けた後に入力操作する
必要がなく、群を意識することなく入力し、その後分類
尺度フィールドの指定及び条件設定を行なえば良いこと
を意味し、使用者の使い勝手が向上する。また、既に、
記憶装置に格納されているレコードを用いる場合でも、
その所属群指定を容易に行なうことができる。Therefore, according to the above-mentioned embodiment, in order to execute the discriminant analysis process of classifying records whose affiliation is unknown and calculating the probability that the record whose affiliation is designated belongs to the affiliation group, Since the process of allocating the input record of 2 to the group of 2 or more is automatically performed by the designation of the classification scale field and the condition setting, the processing time and the labor of the user can be significantly reduced. In other words, there is no need to perform input operation after sorting records into multiple groups, it is sufficient to input without being aware of the group, and then specify the classification scale field and set the conditions, which improves the usability of the user. To do. Also, already
Even when using the record stored in the storage device,
The affiliation group can be easily specified.

【００３３】実際上、データベース装置や表計算装置等
の情報処理装置においては、多量のレコードを処理し、
必要に応じてレコードを分類したり、その分類の妥当性
を判断する必要があり、上述の実施例のように用いられ
ることが多く生じているので、上述の効果は大きなもの
である。In practice, in an information processing device such as a database device or a spreadsheet device, a large number of records are processed,
Since it is necessary to classify the records as needed and to judge the appropriateness of the classification, which is often used as in the above-mentioned embodiment, the above-described effect is great.

【００３４】因に、データベース装置や表計算装置等の
従来の情報処理装置の中には、一部レコードを他のレコ
ードと異なるように強調する処理を有しているが、この
強調処理は判別分析処理と別個に設けられている処理で
あり、そのため、判別分析処理で直接には利用できず、
使用者は強調処理を実行させた後に、判別分析処理で利
用できるように加工する必要があり、実施例に比較すれ
ば、やはり時間や手間のかかるものであった。Incidentally, some conventional information processing devices such as database devices and spreadsheet devices have a process of emphasizing some records so that they are different from other records. It is a process that is provided separately from the analysis process, so it cannot be used directly in the discriminant analysis process.
It is necessary for the user to carry out the emphasizing process and then process it so that it can be used in the discriminant analysis process, which is time-consuming and time-consuming as compared with the embodiment.

【００３５】なお、上述の実施例においては２群に判別
するものを示したが、３群以上に分類（判別）するもの
に適用することもできる。また、１レコードが４フィー
ルドデータでなるものを示したが、これに限定されるも
のではない。さらに、所属群が不明のレコードに対する
分類処理を実行しない、判別分析処理にも適用できる。
すなわち、特許請求の範囲でいうレコード分類は、所属
群が指定されたレコードに対する所属確率を演算するよ
うなことも含んでいる。In the above-mentioned embodiment, the case of discriminating into two groups is shown, but the invention can be applied to the case of discriminating (determining) into three or more groups. Further, although one record is shown to be composed of four field data, it is not limited to this. Furthermore, the present invention can be applied to the discriminant analysis process in which the classification process is not executed for records whose belonging group is unknown.
That is, the record classification referred to in the claims includes calculating the belonging probability for the record in which the belonging group is designated.

【００３６】[0036]

【発明の効果】以上のように、本発明によれば、レコー
ド分類を行なう判別分析処理で必要となる、複数群の入
力レコードの指定を、分類尺度フィールドの指定、及び
そのフィールドデータについての条件の決定によって自
動的に行なうようにしたので、使用者による操作を大幅
に削減することができると共に、入力レコードの指定を
も含めた処理時間を従来より格段的に短くすることがで
きる。As described above, according to the present invention, the specification of a plurality of groups of input records, the specification of the classification scale field, and the conditions for the field data thereof, which are necessary in the discriminant analysis process for classifying records, are performed. Since it is automatically performed according to the determination of, the operation by the user can be significantly reduced, and the processing time including the designation of the input record can be significantly shortened as compared with the conventional case.

[Brief description of drawings]

【図１】実施例方式の処理フローチャートである。FIG. 1 is a processing flowchart of an embodiment method.

【図２】判別分析処理の概略説明図である。FIG. 2 is a schematic explanatory diagram of a discriminant analysis process.

【図３】従来方式の処理フローチャートである。FIG. 3 is a processing flowchart of a conventional method.

【図４】実施例方式の機能ブロック図である。FIG. 4 is a functional block diagram of an embodiment system.

【図５】強調処理前の表データの説明図である。FIG. 5 is an explanatory diagram of table data before emphasis processing.

【図６】強調処理後の表データの説明図である。FIG. 6 is an explanatory diagram of table data after emphasis processing.

【図７】最終処理結果の説明図である。FIG. 7 is an explanatory diagram of a final processing result.

[Explanation of symbols]

１１…制御モジュール、１２…入力処理モジュール、１
３…出力処理モジュール、１４…表管理処理モジュー
ル、１５…強調処理モジュール、１６…統計処理モジュ
ール。11 ... Control module, 12 ... Input processing module, 1
3 ... Output processing module, 14 ... Table management processing module, 15 ... Emphasis processing module, 16 ... Statistical processing module.

Claims

[Claims]

1. An information processing apparatus for executing a discriminant analysis process of classifying a record whose belonging group is unknown into one of the groups, and calculating a probability that each record in which the belonging group is designated belongs to the belonging group. In the record classification method of, the selection process that selects the classification scale field that defines the belonging group, and the condition capture process that captures the condition that determines which group it belongs to by applying to the data of the selected classification scale field. The determination process for determining the affiliation group of each record based on the fetched conditions specifies the affiliation group of each record that is the input data of the discriminant analysis process, and then shifts to the discriminant analysis process. Record classification method of information processing device.