JPS61246885A - Plural dictionary producing system - Google Patents

Plural dictionary producing system

Info

Publication number
JPS61246885A
JPS61246885A JP60087529A JP8752985A JPS61246885A JP S61246885 A JPS61246885 A JP S61246885A JP 60087529 A JP60087529 A JP 60087529A JP 8752985 A JP8752985 A JP 8752985A JP S61246885 A JPS61246885 A JP S61246885A
Authority
JP
Japan
Prior art keywords
area
category
dictionary
temporary
dictionary creation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP60087529A
Other languages
Japanese (ja)
Inventor
▲はい▼ 東善
Touzen Hai
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Priority to JP60087529A priority Critical patent/JPS61246885A/en
Publication of JPS61246885A publication Critical patent/JPS61246885A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To avoid a learning pattern from being erroneously classified by producing plural dictionaries by taking into account the relationship between areas. CONSTITUTION:Areas U1 and U2 are ones for producing the dictionaries of categories alpha and beta, respectively. An area U4 is the one where no dictionary is produced. An area U5 other than the area U1, U2 and U4 is the one where the dictionary of any category can be produced. In this case, where the learning pattern beta* belonging to the category beta arises, the dictionary of the category betahas to be produced within a circle U3 with a radius alpha. However it can be produced in a blank area U5 or in the area U2 other than the overlapping area U4.

Description

【発明の詳細な説明】 〔概 要〕 本発明は複数辞書作成方式に係り、新たな辞書を作成す
る場合に、その辞書だけならば正分類されていた学習パ
タンか既存の他のカテゴリの新辞書により誤分類される
という悪影響をなくし、すべての学習パタンを正分類で
きるようにしたものである。
[Detailed Description of the Invention] [Summary] The present invention relates to a multiple dictionary creation method, and when creating a new dictionary, learning patterns that would have been correctly classified if only the dictionary was used or new patterns of other existing categories are used. This eliminates the negative effects of misclassification by dictionaries and enables correct classification of all learning patterns.

〔産業上の利用分野〕[Industrial application field]

本発明は複数辞書作成方式に関する。一般に、人間が書
く文字はそれぞれ特有のくせを持っている。このことは
、たとえ簡単な数字であっても例外ではない。例えば数
字の「8」を書いた場合でも正確に「8」と書かれれば
問題はないが、「3」に見えたり、全く判別できないこ
ともあり、また「8」と認識できても機械が判別するに
は照合するための辞書が幾つも必要となる。
The present invention relates to a multiple dictionary creation method. In general, each character written by humans has its own unique characteristics. This is no exception, even when it comes to simple numbers. For example, if you write the number ``8'', there is no problem if it is written as ``8'' correctly, but it may look like ``3'' or cannot be recognized at all, and even if the machine can recognize ``8'', it may not be a problem. In order to make a determination, several dictionaries are required for comparison.

従って、通常はある1つのカテゴリに関してもいくつか
の入力された学習パタンに関し、複数個の辞書を作成し
ておき、実際の人力パタンを照合する際にはこれら複数
個の辞書がその照合の対象となる。
Therefore, normally, multiple dictionaries are created for several input learning patterns for one category, and when comparing actual human-powered patterns, these multiple dictionaries are used as the target for matching. becomes.

本発明は、かかる複数辞書の作成方式に関する。The present invention relates to a method for creating such multiple dictionaries.

〔従来の技術及び発明が解決しようとする問題点〕従来
の複数辞書作成方式は、第3図に示すように、異なる文
字間の辞書の関係は何ら考慮されていなかった。
[Prior Art and Problems to be Solved by the Invention] As shown in FIG. 3, the conventional multiple dictionary creation system does not take into account the relationship between different characters in the dictionary.

第3図において、あるパターン、例えば数字の「8」と
「9」についてそれぞれ複数の辞書を作成する場合には
、先ず「8」について入力されたいくつかの学習パタン
の平均M、をとり1つの辞書を作り、それを適当に、例
えば3つに分割して「8」についての3つの辞書Me’
 、Me” 、M%を作成する。
In Figure 3, when creating multiple dictionaries for a certain pattern, for example the numbers "8" and "9", first take the average M of several learning patterns input for "8" and Create one dictionary and divide it into three parts, for example, to create three dictionaries for "8".
, Me'', M% are created.

次に「9」に関しても同様に、1つの辞書M9から3つ
の複数辞書M9’l Me”l Me”lを作成する。
Next, for "9", three plural dictionaries M9'l Me"l Me"l are similarly created from one dictionary M9.

このように作成された複数辞書を以って、入力パタンと
の照合を行う。
Using the plurality of dictionaries created in this way, the input pattern is compared with the input pattern.

しかし、従来は、上記のとおり「8」に関する複数辞書
M s ’ + M s ” + M s 3と「9」
に関する複数辞書Mq’ 、Mq” 、M、”との関係
は考慮されずに、全く独立に作成されていた。
However, conventionally, as mentioned above, multiple dictionaries related to "8" M s ' + M s '' + M s 3 and "9"
The relationships between the multiple dictionaries Mq', Mq'', M,'' were created completely independently.

従って、学習パタンPが「8」である場合、本来は辞書
M81により認識されるべき筈であるにもかかわらず、
距離d、よりもd、の方が小さいため、辞書M g ’
によって「9」であると認識されてしまう。
Therefore, when the learning pattern P is "8", although it should originally be recognized by the dictionary M81,
Since the distance d is smaller than the distance d, the dictionary M g'
Therefore, it is recognized as "9".

即ち、従来は、学習パタンか正しく分類されないという
問題点があった。
That is, in the past, there was a problem that learning patterns were not classified correctly.

〔問題点を解決するための手段及び作用〕本発明は上記
問題点を解消し誤分類をな(すものであり、その手段は
先ず各カテゴリごとに仮りの複数辞書作成領域を想定し
、次にこれらの重複領域をいずれの辞書も作成できない
領域として決定し、更に直前の仮り領域との関係で誤分
類されないようにすることより成り、各領域間の関係も
十分考慮されているので従来のように学習パタンか誤分
類されることがない。
[Means and operations for solving the problem] The present invention solves the above problem and eliminates misclassification.The method is to first assume a plurality of temporary dictionary creation areas for each category, and then perform the following steps. This method consists of determining these overlapping areas as areas for which no dictionary can be created, and further preventing misclassification due to the relationship with the previous temporary area. In this way, the learned pattern will not be misclassified.

〔実施例〕〔Example〕

以下、本発明を実施例により添付図面を参照して説明す
る。
Hereinafter, the present invention will be explained by way of examples with reference to the accompanying drawings.

第1図は、本発明方式の説明図である。FIG. 1 is an explanatory diagram of the system of the present invention.

今、仮りに辞書のカテゴリをα、β、例えば数字のr8
J、r9Jとしそれらの辞書作成のために用いる学習パ
タンをαhα2.α31・・・α7cα、β1.β2.
β1.・・・α、6βとする。
Now, let's assume that the categories in the dictionary are α, β, for example the number r8.
J, r9J, and the learning patterns used to create these dictionaries are αhα2. α31...α7cα, β1. β2.
β1. ...α, 6β.

図中、■、■は単数辞書である。In the figure, ■ and ■ are singular dictionaries.

上記各学習パタンのうち、辞書■によって、α1.α2
.α3のみが認識されたとする。
Among the above learning patterns, α1. α2
.. Assume that only α3 is recognized.

ここで、学習パタンα1に着眼すれば、α1が辞書@に
よって認識されたことは、α1を中心とし1を半径r、
とする円U、の中に他のカテゴリの辞書は他に1つもな
いことを意味する。
Here, if we focus on the learning pattern α1, the fact that α1 was recognized by the dictionary @ means that α1 is the center and 1 is the radius r,
This means that there is no other dictionary of other categories in circle U.

同様に、学習パタンα2.α1に関しても、半径rg、
r3の円U+z、  U13の中には他のカテゴリの辞
書は他に存在しないことになる。
Similarly, learning pattern α2. Regarding α1, the radius rg,
There are no other dictionaries of other categories in the circles U+z and U13 of r3.

従って、正分類された学習パタンαh α2゜α、に関
しては、円Ull、 UI!+  UI3で包囲された
実線で示す領域U、内には■以外の辞書がない。
Therefore, for the correctly classified learning pattern αh α2゜α, the circle Ull, UI! + There are no dictionaries other than ■ in the area U shown by the solid line surrounded by UI3.

即ち、上記実線領域U、内に新しい辞書を作成しようと
すればカテゴリαの辞書しか作れない。
That is, if an attempt is made to create a new dictionary within the solid line area U, only a dictionary for category α can be created.

もし、他のカテゴリの辞書を作れば、それまで正分類さ
れていたカテゴリαの学習パタンかその他ノカテゴリの
辞書によって誤分類されてしまう(第2図の■)。
If a dictionary for other categories is created, it will be misclassified by the learning pattern of category α, which was correctly classified up to that point, or by the dictionary of other categories (■ in Fig. 2).

同じ考え方を、カテゴリ (βについてあてはめれば、
βについては1点鎖線で包囲された領域U2がカテゴリ
の辞書作成領域となる(第2図の■)。
If we apply the same idea to the category (β),
Regarding β, the area U2 surrounded by the dashed-dotted line becomes the category dictionary creation area (■ in FIG. 2).

また、斜線を施した部分U4はカテゴリαの辞書作成領
域賜とβについてのそれU2との重複領域である(第2
図の■)。
In addition, the shaded area U4 is an overlapping area between the dictionary creation area for category α and that for β (second
■) in the figure.

この領域U4にカテゴリβの学習パタンか入れば、■と
の距離が■より近いので、αと認識される。逆の場合は
、αがβと認識される。従って上記U、はいかなる辞書
も作成されない領域である。
If a learning pattern of category β is included in this area U4, it will be recognized as α because the distance to ■ is closer than ■. In the opposite case, α is recognized as β. Therefore, the above U is an area in which no dictionary is created.

更に、上記指摘したU+ 、Uz 、U4以外の空白の
領域U、は、どのカテゴリの辞書も作成可能な領域であ
る(第2図の■)。
Furthermore, the blank area U other than U+, Uz, and U4 mentioned above is an area in which a dictionary of any category can be created (■ in FIG. 2).

このように辞書作成領域が決定されるが、上記は必らず
しも十分な条件ではない。
Although the dictionary creation area is determined in this way, the above conditions are not necessarily sufficient conditions.

即ち、カテゴリαに関する辞書領域U1が決定されたが
、他のカテゴリβの領域U2との関係は考慮されてはい
ない。
That is, although the dictionary area U1 related to the category α has been determined, the relationship with the area U2 of the other category β is not taken into consideration.

例えば、カテゴリβに属する学習パタンβ1が辞書αに
より誤分類されてしまうことがある。これはβ“を中心
とした半径lfI”−■l=d内に@以外の辞書がなか
ったからである。
For example, learning pattern β1 belonging to category β may be misclassified by dictionary α. This is because there are no dictionaries other than @ within the radius lfI''-■l=d centered on β''.

従って、この半径dの円U、内にカテゴリβの辞書を作
成しなければならないことになる(第2図の■)。
Therefore, it is necessary to create a dictionary for category β within this circle U of radius d (■ in Fig. 2).

しかも、このβの辞書は、領域U2内で重複領域U4外
又は空白領域U、に作成しなければならない(第2図の
■)。
Moreover, this dictionary for β must be created within the area U2, outside the overlapping area U4, or in the blank area U (■ in FIG. 2).

上述までの動作の流れのフローを示したのが、第2図の
フローチャートである。
The flowchart in FIG. 2 shows the flow of the operations up to the above.

〔発明の効果〕〔Effect of the invention〕

上述のとおり、本発明によれば、先ず各カテゴリごとに
仮りの複数辞書作成領域を想定し、次にこれらの重複領
域をいずれの辞書も作成できない領域として決定し、更
に直前の仮り領域との関係で誤分類されないようになっ
ているので、各領域間の関係も十分考慮され、従来のよ
うに学習パタンか誤分類されることがなくなった。
As described above, according to the present invention, a temporary multiple dictionary creation area is first assumed for each category, then these overlapping areas are determined as areas in which no dictionary can be created, and furthermore, the overlapping areas with the previous temporary area are determined as areas in which no dictionary can be created. Since it is designed to prevent misclassification based on relationships, the relationships between each area are also fully considered, and learning patterns are no longer misclassified as in the past.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明方式の説明図、第2図は本発明の実施例
を示すフローチャート、第3図は従来方式の説明図であ
る。 U+ 、Uz 、Uz 、Us・・・辞書作成領域、U
4・・・辞書作成禁止領域、 r、+  rz l  rコ・・・半径、■、■・・・
カテゴリα、βの辞書、 α!、α2.α3・・・カテゴリαの学習パタン、β1
.β2.β3・・・カテゴリβの学習パタン。
FIG. 1 is an explanatory diagram of the method of the present invention, FIG. 2 is a flowchart showing an embodiment of the present invention, and FIG. 3 is an explanatory diagram of the conventional method. U+, Uz, Uz, Us...Dictionary creation area, U
4... Dictionary creation prohibited area, r, + rz l r... radius, ■, ■...
Dictionary of categories α and β, α! , α2. α3...Learning pattern of category α, β1
.. β2. β3: Learning pattern for category β.

Claims (1)

【特許請求の範囲】 パタンを正分類するために各カテゴリごとに複数の辞書
を作成する方式において、 先ず、第1カテゴリから第Nカテゴリまでに属する学習
パタンの代表値を仮りに各カテゴリの辞書とし、 次いで、上記各カテゴリに属する学習パタンが独立にす
べて認識される領域を仮りの複数辞書作成領域とし、 次に、上記仮りの第2カテゴリ複数辞書作成領域と直前
の仮りの第1カテゴリ複数辞書作成領域との重複領域を
双方共通の辞書作成禁止領域として決定し、 更に、上記仮りの第2カテゴリ辞書により正分類される
べき学習パタンと上記仮りの第1カテゴリ辞書間の距離
より小さい距離を動径とする領域と、上記仮りの第2カ
テゴリ複数辞書作成領域とが重なる領域を真の第2カテ
ゴリ複数辞書作成領域として決定し、 以下、第3領域から第N領域まで順次同じ動作を繰り返
すことにより予め各カテゴリの真の複数辞書作成領域を
決定し、各領域内で辞書を増やしてゆくことを特徴とす
る複数辞書作成方式。
[Claims] In a method of creating a plurality of dictionaries for each category in order to classify patterns correctly, first, representative values of learning patterns belonging to the first category to the Nth category are temporarily used in the dictionary for each category. Then, the area where all the learning patterns belonging to each of the above categories are independently recognized is set as a temporary multiple dictionary creation area, and then the above temporary second category multiple dictionary creation area and the previous temporary first category multiple are set as a temporary multiple dictionary creation area. The overlapping area with the dictionary creation area is determined as a common dictionary creation prohibited area, and furthermore, a distance smaller than the distance between the learning pattern to be correctly classified by the temporary second category dictionary and the temporary first category dictionary is determined. The area where the area with radius vector and the temporary second category multiple dictionary creation area overlap is determined as the true second category multiple dictionary creation area, and the same operation is performed sequentially from the third area to the Nth area. A multiple dictionary creation method characterized by determining in advance a true multiple dictionary creation area for each category by repeating the process, and increasing the number of dictionaries within each area.
JP60087529A 1985-04-25 1985-04-25 Plural dictionary producing system Pending JPS61246885A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP60087529A JPS61246885A (en) 1985-04-25 1985-04-25 Plural dictionary producing system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP60087529A JPS61246885A (en) 1985-04-25 1985-04-25 Plural dictionary producing system

Publications (1)

Publication Number Publication Date
JPS61246885A true JPS61246885A (en) 1986-11-04

Family

ID=13917523

Family Applications (1)

Application Number Title Priority Date Filing Date
JP60087529A Pending JPS61246885A (en) 1985-04-25 1985-04-25 Plural dictionary producing system

Country Status (1)

Country Link
JP (1) JPS61246885A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS63265376A (en) * 1985-10-10 1988-11-01 ザ パランチ−ル コ−ポレ−シヨン Pattern sorting means used for pattern recognition system

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS63265376A (en) * 1985-10-10 1988-11-01 ザ パランチ−ル コ−ポレ−シヨン Pattern sorting means used for pattern recognition system

Similar Documents

Publication Publication Date Title
EP0307111B1 (en) Character recognition apparatus
US5418864A (en) Method for identifying and resolving erroneous characters output by an optical character recognition system
US6249786B1 (en) Data-conversion method
JPS61246885A (en) Plural dictionary producing system
Dolatian et al. Strong generative capacity of morphological processes
JP2618018B2 (en) Character recognition device
EP0608148A2 (en) Recognition template enhancement
Griffin et al. A pattern identification system using linear decision functions
Feldman et al. A class of universal linear bounded automata
PAWLAK Learning from Examples–the Case of an Imperfect Teacher
Kurohashi et al. Structural disambiguation in Japanese by evaluating case structures based on examples in a case frame dictionary
Dengel A step towards understanding paper documents
JPH02171876A (en) Pattern recognition processing system
KR20230029256A (en) Method of searching character string and system of performing the method
JPS5851390A (en) Font character recognizing device
Mills Lexicon Based Critical Tokenisation: An Algorithm
JP2639314B2 (en) Character recognition method
Clothiaux et al. Spline Templated Based Handwriting Generation
JP3476872B2 (en) Character recognition device
Uttal A reply to Walsh
Kaur et al. A Brief Approach to Applications of Artificial Intelligence
JPS61114388A (en) Character input device
AR124569A1 (en) NEURONAL NETWORK ARCHITECTURE FOR TRANSACTION DATA PROCESSING
JPH0256085A (en) Method for postprocessing for character recognition
JPS63100585A (en) Character recognizing device