JPS6349822B2

JPS6349822B2 -

Info

Publication number: JPS6349822B2
Application number: JP55178881A
Authority: JP
Inventors: Hiromichi Fujisawa; Shunichi Yajima; Yasuaki Nakano; Osamu Kunisaki; Hirohide Endo; Akizo Kadota
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1980-12-19
Filing date: 1980-12-19
Publication date: 1988-10-05
Also published as: JPS57103581A

Description

【発明の詳細な説明】本発明はパターン認識方式に関し、特に構造解
析手法を用いる文字認識方式の改良に関する。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a pattern recognition method, and more particularly to an improvement in a character recognition method using a structural analysis method.

一般に、認識処理過程にはパラメトリツクな、
あるいは離散的な選択肢が含まれている。たとえ
ば、二値化レベルの選択、パターンに含まれる小
面積の連結成分を除去するか否かの選択、また除
去するとすればどの小成分を除去するかの選択
（特公昭46−17698号公報参照）、切れたパターン
をも読むための求済策としての太め処理をおこな
うかやらないかの選択、おこなうとすれば何ビツ
ト太めるかの選択などがある。 In general, the recognition process involves parametric,
Or it contains discrete choices. For example, selection of the binarization level, selection of whether or not to remove small-area connected components included in the pattern, and selection of which small components to remove if removed (see Japanese Patent Publication No. 17698/1983). ), the choice of whether or not to perform thickening processing as a compensation measure to read even broken patterns, and if so, the number of bits to be thickened.

構造解析手法を用いる文字認識手法においては
更に構造解析過程におけるいくつかの選択が含ま
れている。この選択肢はあいまい性なく選択でき
る場合もあるが、あいまい性が残る場合が少くな
い。あいまい性が残る場合は、どの選択肢が選ば
れても良いように、それぞれに対して辞書項目
（標準パターン）を準備しておくことが必要であ
る。 The character recognition method using the structural analysis method further includes several selections in the structural analysis process. Although there are cases where this option can be selected without ambiguity, ambiguity often remains. If ambiguity remains, it is necessary to prepare dictionary entries (standard patterns) for each option so that any option can be selected.

通常は、まず標準的な（あるいは最も信頼性の
高い）選択肢を選択し、認識処理を一通り行い、
得られた結果が十分類似性の高いものであればそ
の結果を答とし、十分高くない場合は、選択した
中で最も信頼性の低い肢の選択を変更して認識処
理を再試行する。 Usually, the standard (or most reliable) option is selected first, the recognition process is run through, and
If the obtained result has a sufficiently high degree of similarity, that result is used as the answer, and if it does not have a sufficiently high degree of similarity, the least reliable option among the selected ones is changed and the recognition process is retried.

このように、認識処理には複数の処理経路があ
ると見ることができる。この処理経路の選択は各
段階における処理パラメータによつて決定され
る。 In this way, recognition processing can be seen as having multiple processing paths. The selection of this processing path is determined by the processing parameters at each stage.

これらの再試行は従来、「ひげ取り」、「ノイズ
除去」、「太め処理」などの観点から行われており
公知である。これらの処理は一括して「摂動」と
呼ばれる。 These retrials have conventionally been performed from the viewpoints of "hair removal,""noiseremoval,""thickeningprocessing," etc., and are well known. These processes are collectively called "perturbation."

これらの方法を用いれば、第１図に示すような
ノイズのある英字「Ａ」が、あるいは切れのある
ような数字「９」が正しく読めるようになる。し
かし、この方法によるとたとえば第２図ａ，ｂの
場合に片仮名「ソ」における小面積の成分３をノ
イズと見做して除去することにより片仮名「ノ」
と誤読されたり、数字「９」における小さなルー
プ４がつぶれて数字「１」と誤読される問題があ
る。 By using these methods, the noisy alphabetic letter "A" as shown in FIG. 1 or the truncated number "9" can be read correctly. However, according to this method, in the case of Figure 2 a and b, for example, the small area component 3 of the katakana ``so'' is regarded as noise and removed, so that the katakana ``no'' is
There is also a problem that the small loop 4 in the number "9" is collapsed and misread as the number "1".

従来、この問題を解決するため、適用された摂
動の種類に応じて判定用閾値を変更させて、誤読
をなくそうとする方式がある。（特願昭55−36075
号参照）この方法によれば、小さなノイズや小さ
な切れを救済しようとしたときに発生し得る誤読
をなくす、という意味で認識方式の信頼性を向上
することができる。しかし、この方法ではたとえ
ば第２図ｃ，ｄに示すような大きなノイズ５や、
除去しても誤読にならないノイズ６の除去を積極
的に行うことは難しい。これは摂動の種類に応じ
た閾値変更などのペナルテイを各カテゴリーに一
様に与えていることに起因する。 Conventionally, in order to solve this problem, there is a method in which the determination threshold is changed depending on the type of perturbation applied to eliminate misreading. (Special application 1986-36075
According to this method, the reliability of the recognition method can be improved in the sense that it eliminates misreading that may occur when attempting to repair small noises or small cuts. However, with this method, for example, large noises 5 as shown in Fig. 2c and d,
It is difficult to actively remove noise 6 that will not result in misreading even if removed. This is due to the fact that penalties such as threshold changes depending on the type of perturbation are uniformly given to each category.

したがつて、本発明の目的は、上記の問題点を
解決し、第２図ｃ，ｄのノイズ５，６のように除
去しても良い場合は除去し、同図ａの孤立成分３
のように除去してはならない場合は除去しないよ
うにして従来以上にノイズや文字変形に対して認
識精度を向上させるとともに、大きな切れや小さ
な凸凹をもつ文字パターンを認識可能とするため
の太め処理を行いかつ、太め処理などから派生し
得る誤読をなくして、上記のようなパターンをも
信頼度高く認識する認識方式を提供することであ
る。以下、文字パターンを例にして、本発明を説
明するが、本発明は文字パターンに限定されるこ
となく様々なパターンの認識に適用され得る。 Therefore, an object of the present invention is to solve the above-mentioned problems, remove noises 5 and 6 in FIG. 2c and d when they can be removed, and remove isolated components 3 in
In cases where it should not be removed, such as in cases where it should not be removed, it is not removed to improve recognition accuracy against noise and character deformation, and thickening processing allows character patterns with large cuts and small irregularities to be recognized. It is an object of the present invention to provide a recognition method that can recognize even the above-mentioned patterns with high reliability while eliminating misreading that may result from thick processing. The present invention will be described below using character patterns as an example, but the present invention is not limited to character patterns and can be applied to recognition of various patterns.

まず、本発明の原理と特徴を説明する。 First, the principle and features of the present invention will be explained.

本発明は以下の３項の事実（F1〜F3）にもと
づいてなされたものである。 The present invention has been made based on the following three facts (F1 to F3).

(F1) 大きな孤立成分を除去することや、大きな
ビツト数の太め処理を行うことにより、同じ辞
書容量（標準パターン数）で認識できる文字パ
ターンの範囲はかなり増大する。(F1) By removing large isolated components and thickening a large number of bits, the range of character patterns that can be recognized with the same dictionary capacity (standard number of patterns) increases considerably.

(F2) 前記対策を実施した際に派生する誤読は、
かなり限られた数の辞書項目（標準パターン）
においてのみ生起する。(F2) Misreadings that occur when implementing the above measures are
A fairly limited number of dictionary entries (standard pattern)
Occurs only in

(F3) 前記の問題となる辞書項目と、誤読となる
場合の摂動の種類や程度および内容とは、かな
り相関が強く、前もつてそれらの関係を記述し
ておくことができる。(F3) There is a fairly strong correlation between the dictionary item in question and the type, degree, and content of perturbation that results in misreading, and it is possible to describe these relationships in advance.

まず、（F1）については、低品質の文字パター
ン（ボールペンや薄い筆記具によるパターン、切
り出しミスのあるパターン、自由手書文字、な
ど）を構造解析的な認識手法で認識しようとする
場合を考えれば、容易に理解できる。 First, regarding (F1), consider the case where we are trying to recognize low-quality character patterns (patterns made with ballpoint pens or thin writing instruments, patterns with cutting errors, free handwritten characters, etc.) using a structural analysis recognition method. , easily understood.

（F2）については、実験的にも示せるが、第
２図からも理解される。構造解析的認識手法にお
いては、各文字変形に対して辞書項目（標準パタ
ーン）を用意しており、たとえば、第２図ｆ，
ｇ，ｈは数字「３」に対する辞書項目（標準パタ
ーン）である。これらのうち、未知の文字パター
ンｅにおける大きなノイズ２１を除去することに
より誤つて認識される可能性のある標準パターン
はｇのみである。しかも、除去してならない領域
は、ごく限られた領域であることが多い。本例で
は、同図ｇの領域２２付近のみが除去を禁止する
領域である。 (F2) can be shown experimentally, but it can also be understood from Figure 2. In the structural analysis recognition method, dictionary entries (standard patterns) are prepared for each character transformation.
g and h are dictionary entries (standard patterns) for the number "3". Of these, only g is the standard pattern that may be erroneously recognized by removing the large noise 21 in the unknown character pattern e. Moreover, the area that must be removed is often a very limited area. In this example, only the area around area 22 in g in the figure is an area where removal is prohibited.

このように、一般的にも、問題となる辞書項目
は限られたものであり、しかも、その範囲を細か
く限定することができる。 In this way, the number of dictionary items that are problematic is generally limited, and the range can be narrowly limited.

（F3）についても、作成した辞書を用いた予
備認識実験によつて示される。各種の変形文字パ
ターンを含んだ50万〜100万字を認識し、誤読と
なつた場合の原因を自動的に分析することによ
り、問題となる辞書項目番号と、摂動の種類、摂
動の程度および内容との相関性に関する情報を収
集できる。 (F3) is also demonstrated through preliminary recognition experiments using the created dictionary. By recognizing 500,000 to 1,000,000 characters containing various deformed character patterns and automatically analyzing the causes of misreading, we can identify the dictionary entry number in question, the type of perturbation, the degree and content of the perturbation. It is possible to collect information regarding the correlation between

したがつて、本発明はこれら３項の事実にもと
づいて、認識結果を得るに到るまでの途中の処理
経路での情報にもとづいて認識処理を制御するよ
うにして、摂動にともなう前記ペナルテイを各カ
チゴリ毎に可変にしたことと等価な処理をおこな
い、その結果として従来より少ない標準パターン
数で高精度な認識ができるようにしたものであ
る。 Therefore, based on these three facts, the present invention controls the recognition process based on information in the processing path on the way to obtaining the recognition result, thereby reducing the penalty caused by perturbation. The process is equivalent to making it variable for each category, and as a result, highly accurate recognition can be achieved with a smaller number of standard patterns than before.

以下、本発明を実施例を参照して詳細に説明す
る。 Hereinafter, the present invention will be explained in detail with reference to Examples.

第３図は本発明の方式を用いる文字認識装置の
１実施例のブロツク図である。これを認識処理の
流れに沿つて説明する。 FIG. 3 is a block diagram of one embodiment of a character recognition device using the method of the present invention. This will be explained along the flow of recognition processing.

未知文字パターン１０を観測部１００によつて
観測し、二値化処理により二値パターン１１を得
る。二値化に際してはスライス・レベルがパラメ
ータとして存在し、このパラメータをP₁と記す。 The unknown character pattern 10 is observed by the observation unit 100, and a binary pattern 11 is obtained by binarization processing. During binarization, a slice level exists as a parameter, and this parameter is denoted as _P1 .

二値パターン１１は太め処理部２００で太めら
れる。太めに際しては太めビツト数がパラメータ
として存在し、このパラメータをP₂と記す。但
し、太めビツト数P₂は０であることもある。す
なわち、太めない場合も、仮想的に太めビツト数
０で太め処理を施すと言うことにする。 The binary pattern 11 is thickened by the thickening processing section 200. For thicker bits, the number of thicker bits exists as a parameter, and this parameter is written as _P2 . However, the number of thick bits _P2 may be 0. In other words, even if the data cannot be thickened, thickening processing is performed by virtually setting the number of thickening bits to 0.

太め処理を受けた二値パターン１２は構造解析
部３００へ入力され、後に詳述するように、輪郭
抽出、輪郭分割という処理を受ける。分割された
輪郭の部分をセグメントと呼ぶ。結局、構造解析
部３００は未知入力パターンの二値パターンをセ
グメントに分割し、それぞれに所定の順で固有番
号を付すという役を担う。 The binary pattern 12 that has undergone the thickening process is input to the structure analysis section 300, and is subjected to processes such as contour extraction and contour division, as will be described in detail later. The divided contour parts are called segments. Ultimately, the structure analysis unit 300 has the role of dividing the binary pattern of the unknown input pattern into segments and assigning unique numbers to each segment in a predetermined order.

特徴抽出部４００は構造解析部３００の出力を
受けて、各セグメントの形状を記述する特徴、お
よび各セグメント間の相対的位置関係を記述する
特徴を抽出する。 The feature extractor 400 receives the output of the structure analyzer 300 and extracts features that describe the shape of each segment and features that describe the relative positional relationship between the segments.

パターン整合部５００は特徴抽出部４００の出
力を受けて、同出力の特徴量と辞書９００内の文
字概念を示す各辞書項目との類似性を評価する。
類似性は所定の定義式で定められた距離で計測す
る。パターン整合部５００は、結局、未知パター
ンの特徴を入力し各辞書項目との距離を計算し、
距離の値の小さい方からＮ（Ｎ：１以上の整数）
個の候補を選出し、Ｎ組の辞書項目番号と対応す
る距離のペアを出力する。 The pattern matching unit 500 receives the output of the feature extraction unit 400 and evaluates the similarity between the output feature amount and each dictionary item indicating a character concept in the dictionary 900.
Similarity is measured by a distance determined by a predetermined definition formula. In the end, the pattern matching unit 500 inputs the features of the unknown pattern and calculates the distance from each dictionary entry.
N from the smallest distance value (N: an integer greater than or equal to 1)
Then, N pairs of dictionary entry numbers and corresponding distances are output.

正当性検定部６００は観測部１００、太め処理
部２００および構造解析部３００から送られるパ
ラメータ２０と、辞書項目番号と距離とのペア１
５を入力を禁則表１０００を参照しながら、各辞
書項目に対する正当性を検定する。禁則表１００
０の詳細については後述する。正当性を否定され
た辞書項目番号と距離のペアは捨てられて、残つ
て辞書項目番号と距離のペアのみが正当性検定部
６００の出力１６となる。 The validity testing unit 600 receives the parameters 20 sent from the observation unit 100, the thick processing unit 200, and the structure analysis unit 300, and the pair 1 of dictionary item number and distance.
5, the validity of each dictionary item is tested while referring to the prohibition table 1000. Table of prohibitions 100
The details of 0 will be described later. The pair of dictionary item number and distance whose validity is denied is discarded, and only the pair of dictionary item number and distance that remains becomes the output 16 of the validity test unit 600.

戦略決定部７００はこの出力１６を受けて、最
終判定を行うか、またはある段階のパラメータを
変更して認識処理を再試行する（摂動する）か否
かの判断をする。再試行（摂動）する場合は、ど
の段階からどのようなパラメータで再試行するか
を指令５０によつて該当する処理部へ伝達する。 Upon receiving this output 16, the strategy determining unit 700 determines whether to make a final determination or to change the parameters at a certain stage and retry (perturb) the recognition process. When retrying (perturbing), the command 50 is used to transmit to the relevant processing unit which stage and with what parameters the retry is to be performed.

戦略決定部７００が最終判定を下して良いと判
定した場合は、辞書項目番号と距離のペアの内、
各カテゴリ内の最小の距離をもつペアのみを選ん
で最終判定部８００へ転送する。 If the strategy determining unit 700 determines that it is OK to make the final determination, among the pairs of dictionary item number and distance,
Only the pair with the minimum distance within each category is selected and transferred to the final determination section 800.

最終判定部８００はカテゴリ名と距離とのペア
を距離の小さい方から２組戦略決定部７００から
入力して、最小距離が絶対閾値δより小さく、か
つ次小距離との差が相対閾値εより大きいときの
み、最小距離を与えるカテゴリ名を認識結果とし
て、それ以外のときは認識不能を示すコードを認
識結果として最終出力２０を得るものである。 The final determination unit 800 inputs two pairs of category names and distances from the strategy determining unit 700, starting from the one with the smallest distance, and determines that the minimum distance is smaller than the absolute threshold δ, and the difference from the next smallest distance is smaller than the relative threshold ε. Only when the distance is large, the category name that provides the minimum distance is used as the recognition result, and in other cases, the final output 20 is obtained using the code that indicates unrecognizable as the recognition result.

さて、主要な部分の処理内容を詳細に説明す
る。 Now, the main processing contents will be explained in detail.

観測部１００、太め処理部２００は公知の技術
で容易に実現できるので説明を省略する。（たと
えば、○如嵎源 The observation unit 100 and the thickening processing unit 200 can be easily realized using known techniques, so their explanation will be omitted. (For example, ○...Jyoga Gen

Claims

[Claims] 1. Observing an unknown pattern and converting it into an electrical signal,
Perform preprocessing on the observed unknown pattern, evaluate the similarity between the pattern corresponding to the preprocessed unknown pattern and a standard pattern registered in the dictionary, and make a predetermined judgment based on the evaluation result. In a pattern recognition method in which the unknown pattern is recognized through a series of recognition processes, a predetermined rule related to the preprocessing is assigned to each standard pattern in advance, and the unknown pattern is When evaluating the similarity between a pattern corresponding to a standard pattern and a standard pattern, it is determined whether the preprocessing performed on the unknown pattern conforms to the above rules of the standard pattern, and thereby the validity of the evaluation result is verified. Test the gender,
A pattern recognition method characterized in that a predetermined control is performed on the series of recognition processes based on the test results. 2. Paragraph 1, wherein the predetermined rule specifies whether or not at least one standard pattern extracted based on the evaluation result is allowed to be used as a candidate pattern for recognition. pattern recognition method. 3. The pattern recognition method according to item 1, wherein the standard pattern registered in the dictionary includes control information for performing the series of recognition processes. 4. The pattern recognition method according to any one of items 1 to 3, wherein the predetermined control is a control for retrying the series of recognition processes based on the test result.