JPS61246885A

JPS61246885A - Plural dictionary producing system

Info

Publication number: JPS61246885A
Application number: JP60087529A
Authority: JP
Inventors: ▲はい▼　東善; Touzen Hai
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1985-04-25
Filing date: 1985-04-25
Publication date: 1986-11-04

Abstract

PURPOSE:To avoid a learning pattern from being erroneously classified by producing plural dictionaries by taking into account the relationship between areas. CONSTITUTION:Areas U1 and U2 are ones for producing the dictionaries of categories alpha and beta, respectively. An area U4 is the one where no dictionary is produced. An area U5 other than the area U1, U2 and U4 is the one where the dictionary of any category can be produced. In this case, where the learning pattern beta* belonging to the category beta arises, the dictionary of the category betahas to be produced within a circle U3 with a radius alpha. However it can be produced in a blank area U5 or in the area U2 other than the overlapping area U4.

Description

【発明の詳細な説明】〔概　要〕本発明は複数辞書作成方式に係り、新たな辞書を作成す
る場合に、その辞書だけならば正分類されていた学習パ
タンか既存の他のカテゴリの新辞書により誤分類される
という悪影響をなくし、すべての学習パタンを正分類で
きるようにしたものである。[Detailed Description of the Invention] [Summary] The present invention relates to a multiple dictionary creation method, and when creating a new dictionary, learning patterns that would have been correctly classified if only the dictionary was used or new patterns of other existing categories are used. This eliminates the negative effects of misclassification by dictionaries and enables correct classification of all learning patterns.

[Industrial application field]

本発明は複数辞書作成方式に関する。一般に、人間が書
く文字はそれぞれ特有のくせを持っている。このことは
、たとえ簡単な数字であっても例外ではない。例えば数
字の「８」を書いた場合でも正確に「８」と書かれれば
問題はないが、「３」に見えたり、全く判別できないこ
ともあり、また「８」と認識できても機械が判別するに
は照合するための辞書が幾つも必要となる。The present invention relates to a multiple dictionary creation method. In general, each character written by humans has its own unique characteristics. This is no exception, even when it comes to simple numbers. For example, if you write the number ``8'', there is no problem if it is written as ``8'' correctly, but it may look like ``3'' or cannot be recognized at all, and even if the machine can recognize ``8'', it may not be a problem. In order to make a determination, several dictionaries are required for comparison.

従って、通常はある１つのカテゴリに関してもいくつか
の入力された学習パタンに関し、複数個の辞書を作成し
ておき、実際の人力パタンを照合する際にはこれら複数
個の辞書がその照合の対象となる。Therefore, normally, multiple dictionaries are created for several input learning patterns for one category, and when comparing actual human-powered patterns, these multiple dictionaries are used as the target for matching. becomes.

本発明は、かかる複数辞書の作成方式に関する。The present invention relates to a method for creating such multiple dictionaries.

〔従来の技術及び発明が解決しようとする問題点〕従来
の複数辞書作成方式は、第３図に示すように、異なる文
字間の辞書の関係は何ら考慮されていなかった。[Prior Art and Problems to be Solved by the Invention] As shown in FIG. 3, the conventional multiple dictionary creation system does not take into account the relationship between different characters in the dictionary.

第３図において、あるパターン、例えば数字の「８」と
「９」についてそれぞれ複数の辞書を作成する場合には
、先ず「８」について入力されたいくつかの学習パタン
の平均Ｍ、をとり１つの辞書を作り、それを適当に、例
えば３つに分割して「８」についての３つの辞書Ｍｅ’
　、Ｍｅ”　、Ｍ％を作成する。In Figure 3, when creating multiple dictionaries for a certain pattern, for example the numbers "8" and "9", first take the average M of several learning patterns input for "8" and Create one dictionary and divide it into three parts, for example, to create three dictionaries for "8".
, Me'', M% are created.

次に「９」に関しても同様に、１つの辞書Ｍ９から３つ
の複数辞書Ｍ９’ｌ　Ｍｅ”ｌ　Ｍｅ”ｌを作成する。Next, for "9", three plural dictionaries M9'l Me"l Me"l are similarly created from one dictionary M9.

このように作成された複数辞書を以って、入力パタンと
の照合を行う。Using the plurality of dictionaries created in this way, the input pattern is compared with the input pattern.

しかし、従来は、上記のとおり「８」に関する複数辞書
Ｍ　ｓ　’　＋　Ｍ　ｓ　”　＋　Ｍ　ｓ　３と「９」
に関する複数辞書Ｍｑ’　、Ｍｑ”　、Ｍ、”との関係
は考慮されずに、全く独立に作成されていた。However, conventionally, as mentioned above, multiple dictionaries related to "8" M s ' + M s '' + M s 3 and "9"
The relationships between the multiple dictionaries Mq', Mq'', M,'' were created completely independently.

従って、学習パタンＰが「８」である場合、本来は辞書
Ｍ８１により認識されるべき筈であるにもかかわらず、
距離ｄ、よりもｄ、の方が小さいため、辞書Ｍ　ｇ　’
によって「９」であると認識されてしまう。Therefore, when the learning pattern P is "8", although it should originally be recognized by the dictionary M81,
Since the distance d is smaller than the distance d, the dictionary M g'
Therefore, it is recognized as "9".

即ち、従来は、学習パタンか正しく分類されないという
問題点があった。That is, in the past, there was a problem that learning patterns were not classified correctly.

〔問題点を解決するための手段及び作用〕本発明は上記
問題点を解消し誤分類をな（すものであり、その手段は
先ず各カテゴリごとに仮りの複数辞書作成領域を想定し
、次にこれらの重複領域をいずれの辞書も作成できない
領域として決定し、更に直前の仮り領域との関係で誤分
類されないようにすることより成り、各領域間の関係も
十分考慮されているので従来のように学習パタンか誤分
類されることがない。[Means and operations for solving the problem] The present invention solves the above problem and eliminates misclassification.The method is to first assume a plurality of temporary dictionary creation areas for each category, and then perform the following steps. This method consists of determining these overlapping areas as areas for which no dictionary can be created, and further preventing misclassification due to the relationship with the previous temporary area. In this way, the learned pattern will not be misclassified.

〔Example〕

以下、本発明を実施例により添付図面を参照して説明す
る。Hereinafter, the present invention will be explained by way of examples with reference to the accompanying drawings.

第１図は、本発明方式の説明図である。FIG. 1 is an explanatory diagram of the system of the present invention.

今、仮りに辞書のカテゴリをα、β、例えば数字のｒ８
Ｊ、ｒ９Ｊとしそれらの辞書作成のために用いる学習パ
タンをαｈα２．α３１・・・α７ｃα、β１．β２．
β１．・・・α、６βとする。Now, let's assume that the categories in the dictionary are α, β, for example the number r8.
J, r9J, and the learning patterns used to create these dictionaries are αhα2. α31...α7cα, β1. β2.
β1. ...α, 6β.

図中、■、■は単数辞書である。In the figure, ■ and ■ are singular dictionaries.

上記各学習パタンのうち、辞書■によって、α１．α２
．α３のみが認識されたとする。Among the above learning patterns, α1. α2
．． Assume that only α3 is recognized.

ここで、学習パタンα１に着眼すれば、α１が辞書＠に
よって認識されたことは、α１を中心とし１を半径ｒ、
とする円Ｕ、の中に他のカテゴリの辞書は他に１つもな
いことを意味する。Here, if we focus on the learning pattern α1, the fact that α1 was recognized by the dictionary @ means that α1 is the center and 1 is the radius r,
This means that there is no other dictionary of other categories in circle U.

同様に、学習パタンα２．α１に関しても、半径ｒｇ、
ｒ３の円Ｕ＋ｚ、　　Ｕ１３の中には他のカテゴリの辞
書は他に存在しないことになる。Similarly, learning pattern α2. Regarding α1, the radius rg,
There are no other dictionaries of other categories in the circles U+z and U13 of r3.

従って、正分類された学習パタンαｈ　α２゜α、に関
しては、円Ｕｌｌ、　ＵＩ！＋　　ＵＩ３で包囲された
実線で示す領域Ｕ、内には■以外の辞書がない。Therefore, for the correctly classified learning pattern αh α2゜α, the circle Ull, UI! + There are no dictionaries other than ■ in the area U shown by the solid line surrounded by UI3.

即ち、上記実線領域Ｕ、内に新しい辞書を作成しようと
すればカテゴリαの辞書しか作れない。That is, if an attempt is made to create a new dictionary within the solid line area U, only a dictionary for category α can be created.

もし、他のカテゴリの辞書を作れば、それまで正分類さ
れていたカテゴリαの学習パタンかその他ノカテゴリの
辞書によって誤分類されてしまう（第２図の■）。If a dictionary for other categories is created, it will be misclassified by the learning pattern of category α, which was correctly classified up to that point, or by the dictionary of other categories (■ in Fig. 2).

同じ考え方を、カテゴリ　（βについてあてはめれば、
βについては１点鎖線で包囲された領域Ｕ２がカテゴリ
の辞書作成領域となる（第２図の■）。If we apply the same idea to the category (β),
Regarding β, the area U2 surrounded by the dashed-dotted line becomes the category dictionary creation area (■ in FIG. 2).

また、斜線を施した部分Ｕ４はカテゴリαの辞書作成領
域賜とβについてのそれＵ２との重複領域である（第２
図の■）。In addition, the shaded area U4 is an overlapping area between the dictionary creation area for category α and that for β (second
■) in the figure.

この領域Ｕ４にカテゴリβの学習パタンか入れば、■と
の距離が■より近いので、αと認識される。逆の場合は
、αがβと認識される。従って上記Ｕ、はいかなる辞書
も作成されない領域である。If a learning pattern of category β is included in this area U4, it will be recognized as α because the distance to ■ is closer than ■. In the opposite case, α is recognized as β. Therefore, the above U is an area in which no dictionary is created.

更に、上記指摘したＵ＋　、Ｕｚ　、Ｕ４以外の空白の
領域Ｕ、は、どのカテゴリの辞書も作成可能な領域であ
る（第２図の■）。Furthermore, the blank area U other than U+, Uz, and U4 mentioned above is an area in which a dictionary of any category can be created (■ in FIG. 2).

このように辞書作成領域が決定されるが、上記は必らず
しも十分な条件ではない。Although the dictionary creation area is determined in this way, the above conditions are not necessarily sufficient conditions.

即ち、カテゴリαに関する辞書領域Ｕ１が決定されたが
、他のカテゴリβの領域Ｕ２との関係は考慮されてはい
ない。That is, although the dictionary area U1 related to the category α has been determined, the relationship with the area U2 of the other category β is not taken into consideration.

例えば、カテゴリβに属する学習パタンβ１が辞書αに
より誤分類されてしまうことがある。これはβ“を中心
とした半径ｌｆＩ”−■ｌ＝ｄ内に＠以外の辞書がなか
ったからである。For example, learning pattern β1 belonging to category β may be misclassified by dictionary α. This is because there are no dictionaries other than @ within the radius lfI''-■l=d centered on β''.

従って、この半径ｄの円Ｕ、内にカテゴリβの辞書を作
成しなければならないことになる（第２図の■）。Therefore, it is necessary to create a dictionary for category β within this circle U of radius d (■ in Fig. 2).

しかも、このβの辞書は、領域Ｕ２内で重複領域Ｕ４外
又は空白領域Ｕ、に作成しなければならない（第２図の
■）。Moreover, this dictionary for β must be created within the area U2, outside the overlapping area U4, or in the blank area U (■ in FIG. 2).

上述までの動作の流れのフローを示したのが、第２図の
フローチャートである。The flowchart in FIG. 2 shows the flow of the operations up to the above.

〔Effect of the invention〕

上述のとおり、本発明によれば、先ず各カテゴリごとに
仮りの複数辞書作成領域を想定し、次にこれらの重複領
域をいずれの辞書も作成できない領域として決定し、更
に直前の仮り領域との関係で誤分類されないようになっ
ているので、各領域間の関係も十分考慮され、従来のよ
うに学習パタンか誤分類されることがなくなった。As described above, according to the present invention, a temporary multiple dictionary creation area is first assumed for each category, then these overlapping areas are determined as areas in which no dictionary can be created, and furthermore, the overlapping areas with the previous temporary area are determined as areas in which no dictionary can be created. Since it is designed to prevent misclassification based on relationships, the relationships between each area are also fully considered, and learning patterns are no longer misclassified as in the past.

[Brief explanation of the drawing]

第１図は本発明方式の説明図、第２図は本発明の実施例
を示すフローチャート、第３図は従来方式の説明図であ
る。Ｕ＋　、Ｕｚ　、Ｕｚ　、Ｕｓ・・・辞書作成領域、Ｕ
４・・・辞書作成禁止領域、ｒ、＋　　ｒｚ　ｌ　　ｒコ・・・半径、■、■・・・
カテゴリα、βの辞書、 α！、α２．α３・・・カテゴリαの学習パタン、β１
．β２．β３・・・カテゴリβの学習パタン。FIG. 1 is an explanatory diagram of the method of the present invention, FIG. 2 is a flowchart showing an embodiment of the present invention, and FIG. 3 is an explanatory diagram of the conventional method. U+, Uz, Uz, Us...Dictionary creation area, U
4... Dictionary creation prohibited area, r, + rz l r... radius, ■, ■...
Dictionary of categories α and β, α! , α2. α3...Learning pattern of category α, β1
．． β2. β3: Learning pattern for category β.

Claims

[Claims] In a method of creating a plurality of dictionaries for each category in order to classify patterns correctly, first, representative values of learning patterns belonging to the first category to the Nth category are temporarily used in the dictionary for each category. Then, the area where all the learning patterns belonging to each of the above categories are independently recognized is set as a temporary multiple dictionary creation area, and then the above temporary second category multiple dictionary creation area and the previous temporary first category multiple are set as a temporary multiple dictionary creation area. The overlapping area with the dictionary creation area is determined as a common dictionary creation prohibited area, and furthermore, a distance smaller than the distance between the learning pattern to be correctly classified by the temporary second category dictionary and the temporary first category dictionary is determined. The area where the area with radius vector and the temporary second category multiple dictionary creation area overlap is determined as the true second category multiple dictionary creation area, and the same operation is performed sequentially from the third area to the Nth area. A multiple dictionary creation method characterized by determining in advance a true multiple dictionary creation area for each category by repeating the process, and increasing the number of dictionaries within each area.