JPH07200738A

JPH07200738A - Character recognition method

Info

Publication number: JPH07200738A
Application number: JP5335104A
Authority: JP
Inventors: Toshihiro Suzuki; 俊博鈴木
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1993-12-28
Filing date: 1993-12-28
Publication date: 1995-08-04

Abstract

PURPOSE:To improve the performance of recognition on the conditions of extension possibility for the character type of an object. CONSTITUTION:Strict conditions are decided for each model. At the same time, dedicated identification processing is decided to the set of models to be matched as needed. In the case of character recognition due to a sub phase structure analyzing method, the strict conditions for each model are checked at the time of detailed classification concerning candidates due to structure matching (rough classification) or when the dedicated identification processing is decided to the set of models, that identification processing is performed.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、文字認識の技術に関す
る。より詳細には、本発明は文字認識における整合（マ
ッチング）処理に係り、特に、大分類結果を出発点とし
て詳細分類を行なう技術に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to character recognition technology. More specifically, the present invention relates to matching processing in character recognition, and more particularly to a technique for performing detailed classification using a large classification result as a starting point.

【０００２】なお、本発明は文字認識装置を初めとして
図面認識装置や文書処理装置の全般に適用できるが、本
発明の原理は音声認識や画像認識を含めたパターン認識
全般にも応用可能である。Although the present invention can be applied to a drawing recognition apparatus and a document processing apparatus in general including a character recognition apparatus, the principle of the present invention can be applied to general pattern recognition including voice recognition and image recognition. .

【０００３】[0003]

【従来の技術】図１に、文字認識におけるこれまでの識
別方法の概要を示す。以下の説明において、一文字のパ
ターン全体を評価するスカラー量を距離と仮定するが、
類似度であっても本質的に変わるものではない。距離で
あれば最も近いものを選択し、類似度であれば最も高い
ものを選択するという違いがあるだけである。2. Description of the Related Art FIG. 1 shows an outline of a conventional identification method in character recognition. In the following description, it is assumed that the scalar amount that evaluates the entire pattern of one character is the distance,
Even the degree of similarity does not essentially change. The only difference is that the closest one is selected for the distance and the highest one is selected for the similarity.

【０００４】最も基本的な形は、図１（ａ）にあるよう
に、構造マッチングによりマッチした候補クラスについ
て、モデルとの間で距離計算を行ない、最も近い距離に
あるものを識別結果とするという方法である。ここでは
大分類と詳細分類といった区分はなく、一つの識別処理
があるのみである。In the most basic form, as shown in FIG. 1A, a candidate class matched by structure matching is subjected to distance calculation with a model, and the closest one is used as an identification result. Is the method. Here, there is no classification such as large classification and detailed classification, and there is only one identification process.

【０００５】この方法では、本出願人の保有する全ての
数字データ（１５６３３７文字）に対して行なった実験
で、正読率９１.８％、拒否率３.６％、誤読率４.６％
という結果が得られた。なお、ここでの”拒否”という
のは、どのモデルにも構造的にマッチしなかったことを
意味する。In this method, in an experiment conducted on all numerical data (156337 characters) possessed by the applicant, the correct reading rate is 91.8%, the rejection rate is 3.6%, and the misreading rate is 4.6%.
The result was obtained. It should be noted that "rejection" here means that it did not structurally match any model.

【０００６】これに対して、本出願人が先に特願平１−
２４５５０７号または特願平２−３３９６９６号で提案
したような準位相構造解析法では、図１（ｂ）に示すよ
うに、構造マッチングにより得られた候補に応じて、幾
つかの特殊な処理を行なっていた。例えば数字であれ
ば、’７’のなかの、あるモデルにマッチした場合には
特殊な処理を行ない、あるものは’１’にマッチしたよ
うに変更し、あるものはリジェクトするといった具合で
ある。また、誤読を減らす目的で、距離に応じてリシェ
クトを決める閾値が導入されており、これは候補モデル
によらず一定値であった。On the other hand, the present applicant previously filed Japanese Patent Application No. 1-
In the quasi-topological structure analysis method proposed in Japanese Patent Application No. 245507 or Japanese Patent Application No. 2-339696, some special processing is performed according to the candidates obtained by the structure matching, as shown in FIG. I was doing. For example, if it is a number, a special process is performed if it matches a certain model in '7', some are changed to match '1' and some are rejected. . Moreover, in order to reduce misreading, a threshold that determines the recess according to the distance was introduced, and it was a constant value regardless of the candidate model.

【０００７】この一定値とすることの利点として、全体
の認識性能の制御が容易となることが挙げられるが、し
かし、開発の段階で認識性能が高まるにつれて、より詳
細な制御が必要となることも無視できない事実である。
この方法では、正読率９３.７％、拒否率５.５％、誤読
率０.８％という結果であった。The advantage of the constant value is that the overall recognition performance can be easily controlled. However, as the recognition performance is improved in the development stage, more detailed control is required. Is a fact that cannot be ignored.
According to this method, the correct reading rate was 93.7%, the rejection rate was 5.5%, and the misreading rate was 0.8%.

【０００８】[0008]

【発明が解決しようとする課題】さて、従来の準位相構
造解析法の詳細分類方法についてさらに検討しよう。こ
の方法は、図２に示すように、構造的にマッチングがと
れたクラスについて距離計算を行ない、しかる後、特殊
処理によって各クラスについての制約を満たさないもの
については候補から外し（図２の例２の候補クラス５
ｂ）、あるいは他の候補を生成し（図２の例１では’
１’も候補に加える）、残ったものについては先の距離
によって最適な候補を一つ残す（図２の例２の候補クラ
ス５ａ，９ｂの一つ）というものであったと言える。こ
こでは、例３のように候補が得られなかった場合は考慮
しない。The detailed classification method of the conventional quasi-phase structure analysis method will be further examined. In this method, as shown in FIG. 2, distance calculation is performed for structurally matched classes, and then those that do not satisfy the constraints for each class due to special processing are excluded from candidates (example in FIG. 2). Candidate class 5 of 2
b) or another candidate is generated (in the example 1 of FIG.
It can be said that 1'is also added to the candidates), and the remaining one leaves one optimal candidate according to the previous distance (one of the candidate classes 5a and 9b in Example 2 of FIG. 2). Here, when no candidate is obtained as in Example 3, no consideration is given.

【０００９】ここにおいて、正読を増やすという目的と
誤読を減らすという目的は、明確に分離されていたわけ
ではない。対象が特定字種のみであれば、このような形
でも十分であるが、対象字種の拡張を考慮した場合には
問題があると考えられる。というのも、誤読を減らすと
いうことは、候補として挙げられたそのモデルに本当に
マッチすべきであるか否かという極めて狭い問題である
のが主であるのに対し、正読を増やすということは、そ
のモデルのみならず、読み取り対象の字種に依存したモ
デルセット全体にかかわる問題となるからである。Here, the purpose of increasing correct reading and the purpose of reducing erroneous reading have not been clearly separated. If the target is only a specific character type, such a form is sufficient, but it is considered to be a problem when the expansion of the target character type is considered. Because reducing misreading is mostly a very narrow question of whether or not it should really match the model listed as a candidate, while increasing correct reading is This is because not only the model but also the entire model set depending on the character type to be read.

【００１０】本発明の解決しようとする課題は、文字認
識における基本的な課題である認識性能の向上、認識速
度の向上、記憶容量の削減の三つのうち、主として認識
性能の向上にかかわるものである。そして、この認識性
能の向上を、対象字種の拡張可能性という条件の下に達
成しようというのが本発明の目的とするとろこである。
また、本発明には、認識速度の向上に関する工夫も盛り
込まれている。The problem to be solved by the present invention is mainly concerned with the improvement of the recognition performance among the three basic problems in the character recognition, namely the improvement of the recognition performance, the improvement of the recognition speed, and the reduction of the storage capacity. is there. The object of the present invention is to achieve this improvement in recognition performance under the condition that the target character type can be expanded.
Further, the present invention includes a device for improving the recognition speed.

【００１１】[0011]

【課題を解決するための手段】さて、対象字種の拡張を
考慮した上で認識性能の向上をはかる場合に、どのよう
な視点が必要となるのであろうか。それは、正読を増や
す、誤認を減らすという二つの目的を見据えたうえで、
さらにそれぞれの目的とモデルセットとのかかわりを知
ったうえで、ある対象字種（例：数字）であれば必ず使
えるもの、他の対象字種（例：英字）であれば必ず使え
るもの、両者の混在するときに使えるもの、といった具
合に「目的に沿った機能の独立性を高めること」である
と考えられる。さらに言うならば、「この文字とこの文
字だけを読みたい」といった個別の要求にも対応可能で
あれば理想的である。[Means for Solving the Problems] What kind of viewpoint is needed when the recognition performance is improved in consideration of the expansion of the target character type? With the dual purpose of increasing correct reading and reducing false positives,
In addition, knowing the relationship between each purpose and model set, you can always use it with a certain target character type (example: number), you can always use it with another target character type (example: English character), both It is considered to be "to increase the independence of the function according to the purpose" such as what can be used when mixed. Furthermore, ideally, it is ideal if it can meet individual requests such as "I want to read this character and only this character."

【００１２】まず、本発明を説明するうえで必要な用語
について説明する。本発明では、大分類結果を分類・把
握するため四つの視点が導入された。これらは、詳細分
類における重要性という観点から導入されたものであ
る。First, terms necessary for describing the present invention will be described. In the present invention, four viewpoints are introduced in order to classify / understand the result of large classification. These are introduced from the viewpoint of importance in detailed classification.

【００１３】その視点とは次の四つである。The viewpoints are the following four.

【００１４】（１）パターン（２）クラス（モデル）（３）クラス組（モデル組）（４）カテゴリもちろん、これ以外の視点として、準位相構造解析法で
あれば、例えばプリミティブであるとか、プリミティブ
系列、あるいは特異点などを考えることも可能であ。個
別のモデルの中においても、ある特定のプリミティブ系
列と特異点の組み合わせという視点を考えることも可能
である。ここでは最も基本的なものとして、上記の四つ
の視点を導入したわけである。(1) Pattern (2) Class (model) (3) Class set (model set) (4) Category Of course, from another viewpoint, if it is a quasi-topological structure analysis method, for example, it is a primitive, It is also possible to consider primitive sequences or singular points. It is possible to consider the viewpoint of a combination of a specific primitive sequence and a singular point even in individual models. Here, we have introduced the above four viewpoints as the most basic ones.

【００１５】個々の視点を説明すると次のようになる。The individual viewpoints will be described below.

【００１６】「パターン」とは、ある種の形態を通して
具象化しているものである。ここでは具体的に言うと、
画像配列による文字表現である。[0016] A "pattern" is embodied through a certain form. Specifically, here
It is a character representation by an image array.

【００１７】「モデル」とは、辞書の一項目であり、文
字の表われの傾向をひとくくりにした代表と考える。A "model" is an item in a dictionary, and is considered as a representative that summarizes the tendency of the appearance of characters.

【００１８】一方、「クラス」とは、そのモデルにマッ
チするパターンの集合と考える。なお、ここでいうモデ
ルは、幾つかのプリミティブ系列と幾つかの特異点によ
って表わされており、それぞれに幾つかの選択可能性が
ある。大分類の観点から言えば、少なくともそのモデル
を候補の一つとしている場合には、そのパターンはその
クラスに属するとみなす。On the other hand, a "class" is considered as a set of patterns that match the model. It should be noted that the model here is represented by several primitive sequences and some singular points, and there are several selection possibilities for each. From the point of view of large classification, the pattern is considered to belong to the class if at least the model is one of the candidates.

【００１９】「モデル組」とは、文字どおりモデルの組
み合わせであり、構造的マッチングという大分類によっ
て同時にマッチしうるモデルの組み合わせを意味する。The "model set" is literally a combination of models, and means a combination of models that can be matched at the same time by a major classification called structural matching.

【００２０】モデル組に対応するパターンの集合を「ク
ラス組」と呼ぶ。これは、類似文字の詳細識別を考える
うえで、非常に重要な視点である。大分類の時点で同時
に一つしか候補をもたない場合もあり、これもまた一つ
のモデル組と考えることにする。A set of patterns corresponding to a model set is called a "class set". This is a very important point of view when considering detailed identification of similar characters. In some cases, there is only one candidate at the same time at the time of major classification, and this is also considered as one model set.

【００２１】「カテゴリ」とは、文字そのもの、変形を
捨象した文字そのものであり、もっとも抽象的なもので
ある。一般には、カテゴリがパターン空間における一つ
のまとまりに対応していないことが多い。そこで複数の
モデルを用意して、それらのクラスによってカテゴリを
覆うようにするというのが普通であり、ここでもその例
にならっている。図３はカテゴリとクラスの概念を説明
するものである。The "category" is the character itself, or the character itself with the transformation removed, and is the most abstract. In general, categories often do not correspond to one group in the pattern space. Therefore, it is usual to prepare multiple models and cover the categories with their classes, which is also an example here. FIG. 3 illustrates the concept of categories and classes.

【００２２】例えば、数字の’４’には上が開いたもの
もあれば閉じたものもあり、これらはカテゴリとして
は’４’という一つのカテゴリではあるが、モデルとし
ては別々に用意するいった具合である。図３において、
クラス４ａ，４ｂ，４ｃとは、それぞれ’４’という文
字カテゴリを表わす一つのモデルに対応するクラスとい
う程度の意味である。同様に、クラス９ａ，９ｂとはそ
れぞれ、’９’というカテゴリを表わす一つのモデルに
対応するクラスという意味である。For example, the numeral "4" may have an open top or a closed top. Although these are one category of "4", they are prepared separately as models. It is in good condition. In FIG.
The classes 4a, 4b, and 4c mean the classes corresponding to one model representing the character category "4". Similarly, the classes 9a and 9b mean the classes corresponding to one model representing the category "9".

【００２３】さて、本発明の目的は、既に述べたように
「目的に沿った機能の独立性を高める」という方針を具
体化することにより、対象字種の拡張可能性を考慮しつ
つ認識性能を高めることにある。これは、本発明によれ
ば、図４に示すように具体化される。The object of the present invention is to realize the recognition performance while considering the expandability of the target character type by embodying the policy of "increasing the independence of the function according to the purpose" as described above. Is to raise. This is embodied according to the invention as shown in FIG.

【００２４】本発明によれば、まず、全てのクラス（モ
デル）のそれぞれに対して拘束条件が定められる。この
拘束条件とは、そのクラスとして許容できる範囲として
の必要条件にあたる。すなわち、この拘束条件（チェッ
ク基準）に反するならば、明かに許容範囲の外にあると
いう条件である。この拘束条件はクラス（モデル）毎に
定めることが可能であり、したがってクラス毎の独立性
がきわめて高い。これは図４においては特殊な矢印記号
で示されており、全てのクラスに必ず一つ用意される。According to the present invention, first, a constraint condition is set for each of all classes (models). This constraint condition is a necessary condition as an allowable range for the class. That is, if the constraint condition (check standard) is violated, it is clearly a condition that the condition is outside the allowable range. This constraint condition can be set for each class (model), and therefore the independence of each class is extremely high. This is indicated by a special arrow symbol in FIG. 4, and one must be prepared for all classes.

【００２５】そして、本発明によれば、構造マッチング
（大分類）によって得られた候補クラスに対する詳細分
類において、対応するモデルの拘束条件のチェックが行
なわれることによって候補が絞り込まれる。According to the present invention, the candidates are narrowed down by checking the constraint conditions of the corresponding model in the detailed classification of the candidate class obtained by the structure matching (major classification).

【００２６】本発明によれば、詳細分類において、クラ
ス組（モデル組）に関して、”必要であれば”、個別の
クラス組用の識別処理が導入される。クラス組用の識別
処理とは、「このクラスとこのクラスの組については、
このように識別する」という個別の処理である。”必要
であれば”というのは、もともの分離が非常によく距離
の大小によって容易に識別できるものについては、この
限りでないという意味である。こうしたクラス組用の識
別処理というのも、クラス組毎に作成すればよく、きわ
めて独立性の高いものである。このクラス組用の識別処
理によって、構造マッチング（大分類）の候補クラスを
絞り込むことができる。According to the present invention, in the detailed classification, with respect to a class set (model set), an identification process for an individual class set is introduced "if necessary". The identification process for a class set is "For this class and this class set,
It is a separate process of “identifying in this way”. By "if necessary" is meant that this is not the case if the original separation is so good that it can be easily identified by the size of the distance. The identification processing for such class sets may be created for each class set, and is highly independent. With this class group identification processing, it is possible to narrow down candidate classes for structural matching (major classification).

【００２７】[0027]

【作用】このようにクラス毎の拘束条件及びそのチェッ
ク、クラス組の識別処理という二つの概念を分離して詳
細分類に導入することによって、対象字種の拡張への対
処が容易になる。また、認識性能の向上という面では、
分離が非常に難しいものについては、新たな特徴を導入
した識別処理を個別に作成し置き換えることが可能とな
り、きわめて好都合である。骨格表現や輪郭表現などの
レベルではなく、画像のレベルまで下がって局所的な画
像特徴を利用するもまた可能である。これによって、よ
り正確な識別が可能となり、認識性能を向上できる。As described above, by separating the two concepts of the constraint condition for each class and the checking thereof and the class group identification processing into the detailed classification, it becomes easy to deal with the expansion of the target character type. Also, in terms of improving recognition performance,
For those that are very difficult to separate, it is extremely convenient because it is possible to individually create and replace an identification process that introduces new features. It is also possible to use local image features down to the level of the image, rather than at the level of skeletal or contour representation. This enables more accurate identification and improves the recognition performance.

【００２８】[0028]

【実施例】次に、クラス毎の拘束条件と、クラス組の識
別処理とのそれぞれについて、より具体的に説明する。[Embodiment] Next, each of the constraint conditions for each class and the class group identification processing will be described more specifically.

【００２９】まず、クラス（モデル）毎の拘束条件につ
いて説明する。クラス毎の拘束条件を、ここでは二つの
段階に分けて考える。一段目がパターンの一文字全体に
かかわる条件であり、二段目が局所的な条件である。First, the constraint conditions for each class (model) will be described. The constraint conditions for each class are considered here in two stages. The first row is the condition related to the entire character of the pattern, and the second row is the local condition.

【００３０】一段目の「全体にかかわる条件」というの
は、パターンとモデルとの距離に関する条件である。こ
れは閾値として導入された。The "condition relating to the whole" in the first step is a condition relating to the distance between the pattern and the model. This was introduced as a threshold.

【００３１】従来は、モデル（クラス）によらない一定
の閾値を採用しており、これはシステム全体の能力を制
御するうえで非常に都合のよいものであった。しかし、
認識性能を高めていくうえでは、むしろ妨げとなってい
る。というのは、全てのクラスがパターン空間において
等しい密度で分布しているということはなく、あるクラ
スは広範囲にわたり、別なクラスはきわめて狭い範囲に
しか存在を許されないということが十分にあり得るから
である。また、一定の閾値の選択が容易でないことは、
実際の分析結果においても確認されている。したがっ
て、認識性能を高めるためには、距離にな関する閾値を
クラス毎に用意することが必要となるこれを一段目とし
て採用するのは、処理量の増加を押さえるためであり、
そのモデルから十分距離の離れたパターンは、局所的な
特徴をみるまでもなく取り除くことが可能であり、また
そのモデルに十分距離の近いパターンは、局所的な特徴
をみるまでもなく、候補として許容しうるであろうとの
考えに基づいている。一方、二段目の「局所的なもの」
というのは、個々のプリミティブ系列と特異点に関する
条件である。プリミティブ系列の特徴パラメータとし
て、両端点の座標、外接矩形の重心と幅と高さがある。
また、特異点の特徴パラメータとして、その座標があ
る。これらの特徴パラメータに加え、プリミティブ系列
の凹凸の度合いを表わすパラメータを導入して、それぞ
れのスカラー量について制限範囲を設けた。これらの組
み合わせをベクトル量として制限範囲を設けることも可
能である。Conventionally, a fixed threshold value which does not depend on the model (class) is adopted, which is very convenient for controlling the capacity of the entire system. But,
It is rather an obstacle to improving the recognition performance. Because not all classes are evenly distributed in the pattern space, it is quite possible that one class is widespread and another is only allowed to exist in a very narrow range. Is. Also, it is not easy to select a certain threshold
It is also confirmed in the actual analysis results. Therefore, in order to improve the recognition performance, it is necessary to prepare a threshold related to distance for each class. This is adopted as the first step in order to suppress an increase in processing amount,
Patterns that are sufficiently far from the model can be removed without looking at local features, and patterns that are sufficiently close to the model are candidates as candidates without looking at local features. It is based on the idea that it will be acceptable. On the other hand, the second-stage "local thing"
This is the condition for individual primitive sequences and singularities. The characteristic parameters of the primitive series include the coordinates of both end points, the center of gravity of the circumscribing rectangle, the width, and the height.
Further, there is a coordinate as a characteristic parameter of the singular point. In addition to these feature parameters, we introduced a parameter that represents the degree of unevenness of the primitive sequence, and set a limit range for each scalar quantity. It is also possible to set a limited range using these combinations as vector quantities.

【００３２】以上は、準位相構造解析法による実現例で
あるが、通常のパターンマッチング的な手法における特
徴ベクトルにおいても、同様に制限範囲を設けることは
可能である。さらに、必要とあれば画像レベルでの特徴
を抽出し直し、これについてもまた制限範囲を設けるこ
とは可能である。こうすることで認識性能を高めること
が可能である。Although the above is an example of implementation by the quasi-phase structure analysis method, it is possible to similarly set a limit range in the feature vector in the ordinary pattern matching method. Further, if necessary, it is possible to re-extract the feature at the image level and set a limited range also for this feature. By doing so, the recognition performance can be improved.

【００３３】次にクラス（モデル）組の識別処理につい
て説明する。クラス毎の拘束条件を導入することによ
り、実際には必要でない候補が整合を取り消されること
になるため、クラス組の種類は減少する。Next, the class (model) set identification processing will be described. By introducing a constraint for each class, candidates that are not actually needed are canceled out of matching, so the number of class sets is reduced.

【００３４】発明者の実験によると、準位相構造解析法
による手書き数字認識（モデル数６８）において、大分
類が終了した時点でのクラス組は１０６種類（モデル数
０：１種類）、モデル数１：６８種類、モデル数２：３
１種類、モデル数３：６種類、モデル数４以上：なし）
であったが、先に述べたクラス毎の拘束条件を導入する
ことによって、クラス組は８２種類（モデル数０：１種
類、モデル数１：６８種類、モデル数２：１３種類、モ
デル数３以上：なし）となった。According to the experiment by the inventor, in handwritten numeral recognition (the number of models is 68) by the quasi-topological structure analysis method, there are 106 kinds of class sets (the number of models is 0: 1) and the number of models at the time when the major classification is completed. 1:68 types, number of models 2: 3
(1 type, 3 models, 6 types, 4 or more models: none)
However, by introducing the constraint conditions for each class described above, there are 82 types of class sets (0: 1 types of models, 1:68 types of models, 2:13 types of models, 3 types of models). Above: None).

【００３５】このうち、クラス組の識別処理が必要とな
るのは、クラス数が２以上である１３種類であるが、そ
のうち５種類は距離の大小によって正しく識別すること
が可能であり、特別な識別処理を要するのは８種類のみ
であった。Of these, 13 types in which the number of classes is 2 or more require class identification processing. Of these, 5 types can be correctly identified depending on the size of the distance and are special. Only eight types required identification processing.

【００３６】この８種類のそれぞれについて、それぞれ
のモデルに対する距離の分布をもとに、識別条件を個別
に定めた。基本的にはそれぞれのモデルに対する距離に
重みを変えることによって、大部分の正しい識別が可能
となった。For each of these eight types, the identification conditions were individually determined based on the distribution of the distances for each model. Basically, by changing the weight to the distance for each model, most correct identification was possible.

【００３７】実験によれば、正読率９４．９％、拒否率
４．７％、誤読率０．４％という結果を得られた。局所
的な特徴や、画像レベルにおける新しい特徴などを導入
すれば、さらに認識性能は向上すると考えられる。According to the experiment, the correct reading rate was 94.9%, the rejection rate was 4.7%, and the misreading rate was 0.4%. It is considered that the recognition performance will be further improved by introducing local features and new features at the image level.

【００３８】以上に述べた本発明を実施するための装置
構成の一例を図５に示す。この例は、準位相構造解析法
による文字認識に本発明を適用した場合である。FIG. 5 shows an example of a device configuration for carrying out the present invention described above. In this example, the present invention is applied to character recognition by the quasi-phase structure analysis method.

【００３９】図５において、画像入力部１００によって
手書き文字等の画像を読み取り、２値イメージデータと
して入力する。特徴抽出部１０２において、２値イメー
ジの骨格線を抽出し、その曲線構造を解析して準位相的
特徴と特異点による文字形状の大局的構造を抽出する。
この大局的構造は、文字骨格線を構成する各ストローク
成分を、単調な曲線構成要素（プリミティブ）の連鎖
（プリミティブ系列）によって表現し、この系列要素の
組み合わせと系列要素間の接続関係、及び、特異点での
系列要素の接続関係などの、系列要素間の２項関係によ
り、文字形状の特徴を記述するものである。なお、この
ような大局的構造の詳細については、例えば本出願人が
先に出願した特願平１−２４５５０７号及び特願平２−
３３９６９６号に述べられているので、これ以上の説明
は省略する。In FIG. 5, an image such as a handwritten character is read by the image input unit 100 and input as binary image data. The feature extraction unit 102 extracts the skeleton line of the binary image, analyzes the curved line structure, and extracts the global structure of the character shape by the quasi-topological feature and the singular point.
This global structure expresses each stroke component that constitutes a character skeleton line by a chain (primitive series) of monotonous curve constituent elements (primitives), and the combination of the series elements and the connection relation between the series elements, and Characteristic features of a character are described by binary relations between series elements, such as connection relationships of series elements at singular points. For details of such a global structure, for example, Japanese Patent Application No. 1-245507 and Japanese Patent Application No. 2-245507 previously filed by the present applicant.
No. 339696, so further explanation is omitted.

【００４０】特徴抽出部１０２においては、さらに、大
局的構造表現をもとに、文字形状の局所的特徴のパラメ
ータベクトルを計算する。求められるパラメータとして
は、例えば、プリミティブ系列の両端点の座標、外接矩
形の重心と幅及び高さ、さらには凹凸の度合い、特異点
の座標などである。これについては、例えば特願平２−
３３９６９６号に詳しいので、これ以上の説明は割愛す
る。The feature extraction unit 102 further calculates the parameter vector of the local feature of the character shape based on the global structure expression. The parameters to be obtained include, for example, the coordinates of both end points of the primitive series, the center of gravity and the width and height of the circumscribing rectangle, the degree of unevenness, and the coordinates of the singular point. Regarding this, for example, Japanese Patent Application No. 2-
Since I am familiar with No. 339696, I will omit further explanation.

【００４１】１１０は文字認識のための辞書部であり、
ここには従来と同様に予め求められた文字形状の大局的
構造記述と局所的パラメータベクトルが登録されてい
る。また、本発明によれば、辞書部１１０に、前述した
ような認識対象字種の各文字に関する各モデル（クラ
ス）毎に定められた拘束条件と、必要なモデル（クラ
ス）組の識別処理も登録されている。Reference numeral 110 is a dictionary section for character recognition,
As in the conventional case, the global structure description of the character shape and the local parameter vector which are obtained in advance are registered therein. Further, according to the present invention, the dictionary unit 110 also performs a constraint condition determined for each model (class) regarding each character of the recognition target character type as described above, and a process of identifying a necessary model (class) set. It is registered.

【００４２】構造マッチング部（大分類部）１０４にお
いては、特徴抽出部１０２より入力する文字形状の大局
的構造と辞書部１１０に登録されている大局的構造との
マッチングをとり、距離の近い（または類似度の高い）
候補クラスを求める。In the structure matching unit (major classification unit) 104, the global structure of the character shape input from the feature extraction unit 102 and the global structure registered in the dictionary unit 110 are matched and the distances are close ( Or high similarity)
Find a candidate class.

【００４３】１０６は、本発明により導入された詳細分
類部（拘束条件チェック／モデル組認識処理部）であ
り、ここでは構造マッチング（大分類）で得られた候補
に関して、辞書部１１０に登録された対応するモデル
（クラス）毎の拘束条件に関するチェックを行ない、ま
た、必要に応じて、辞書部１１０に登録されている対応
のモデル（クラス）組の識別処理を実行することによ
り、候補の絞り込み（詳細分類）を行なう。Reference numeral 106 denotes a detailed classification section (constraint condition check / model group recognition processing section) introduced by the present invention. Here, the candidates obtained by the structure matching (major classification) are registered in the dictionary section 110. The constraint condition is checked for each corresponding model (class), and if necessary, identification processing of the corresponding model (class) set registered in the dictionary unit 110 is performed to narrow down the candidates. (Detailed classification).

【００４４】局所的特徴マッチング部１０８では、詳細
分類部１０８で絞り込んだ候補に関して、辞書部１１０
に登録されている文字形状のパラメータベクトルとのマ
ッチングをとって最終的な認識結果を求める。なお、詳
細分類の結果を最終的な認識結果とし、局所的マッチン
グ部１０８を省くことも可能である。The local feature matching unit 108 has a dictionary unit 110 for the candidates narrowed down by the detailed classification unit 108.
The final recognition result is obtained by matching with the character shape parameter vector registered in. It is also possible to omit the local matching unit 108 by setting the result of the detailed classification as the final recognition result.

【００４５】詳細分類部（拘束条件チェック／モデル組
識別処理部）１０６及び拘束条件に関して、次のような
様々な態様を採り得る。With respect to the detailed classification section (constraint condition check / model set identification processing section) 106 and the restraint conditions, various aspects as described below can be adopted.

【００４６】（１）モデル毎に定められた拘束条件のチ
ェックのみを行なう。拘束条件の一つとして、大分類の
ための特徴（大局的特徴）の距離もしくは類似度等の一
つのスカラー量を用いる。(1) Only the constraint conditions determined for each model are checked. As one of the constraint conditions, one scalar quantity such as the distance or the similarity of the feature (global feature) for the large classification is used.

【００４７】（２）モデル毎に定められた拘束条件のチ
ェックのみを行なう。拘束条件の一つとして、大分類の
ための特徴の要素（スカラー量）あるいはその組み合わ
せ（ベクトル量）毎に制限範囲を設ける。(2) Only the constraint conditions determined for each model are checked. As one of the constraint conditions, a limit range is set for each feature element (scalar amount) or combination (vector amount) of features for large classification.

【００４８】（３）モデル毎に定められた拘束条件のチ
ェックのみを行なう。拘束条件の一つとして、大分類の
ための特徴のみならず、あらかじめ定められた個別の特
殊な特徴（スカラー量またはベクトル量）を用いる。(3) Only the constraint conditions determined for each model are checked. As one of the constraint conditions, not only features for large classification but also predetermined special features (scalar amount or vector amount) are used.

【００４９】（４）モデル毎に定められた拘束条件のチ
ェックのみを行なう。モデル毎の拘束条件のなかで、大
分類のための特徴の距離や類似などの一つのスカラー量
を最初の条件とし、これがあらかじめ定められた範囲内
にあれば、他の特徴を調べることなくモデルとの整合を
取り消す。(4) Only the constraint conditions determined for each model are checked. Of the constraint conditions for each model, one scalar quantity such as the distance or similarity of features for large classification is used as the first condition, and if this is within a predetermined range, the model is examined without examining other features. Cancel the alignment with.

【００５０】（５）モデル毎に定められた拘束条件のチ
ェックのみを行なう。モデル毎の拘束条件のなかで、大
分類のための特徴の距離や類似度などの一つのスカラー
量を最初の条件とし、これがあらかじめ定められた範囲
内にあれば、他の特徴を調べることなくモデルとの整合
を許容する。(5) Only the constraint conditions determined for each model are checked. Of the constraint conditions for each model, one scalar quantity such as the distance or similarity of features for large classification is set as the first condition, and if this is within the predetermined range, other features will not be examined. Allows matching with the model.

【００５１】（６）モデル毎に定められた拘束条件のチ
ェックとモデル（クラス）組の識別処理を行なう。該当
のモデル組に関して識別処理が定められていない場合に
は、距離や類似度等の一つのスカラー量によって識別す
る。(6) The constraint condition determined for each model is checked and the model (class) set is identified. When the identification process is not defined for the model set, the identification is performed by one scalar amount such as distance or similarity.

【００５２】（７）モデル毎に定められた拘束条件のチ
ェックは行なわず、モデル組の識別処理だけを行なう。
該当のモデル組の識別処理が定められていない場合に
は、距離や類似度等の一つのスカラー量によって識別す
る。(7) The constraint condition determined for each model is not checked, but only the model group identification process is performed.
When the identification process of the corresponding model set is not defined, identification is performed by one scalar amount such as distance or similarity.

【００５３】[0053]

【発明の効果】以上の説明から理解されるように、本発
明によれば次の効果を得られる。As can be understood from the above description, according to the present invention, the following effects can be obtained.

【００５４】請求項１乃至６の発明は、モデル毎の拘束
条件をもつことにより、認識性能を高めることができ
る。According to the first to sixth aspects of the present invention, the recognition performance can be improved by having the constraint condition for each model.

【００５５】請求項２の発明は、拘束条件の一つとし
て、一般的な距離や類似度などの一つのスカラー量を用
いるので、ほとんどの文字認識技術に利用可能である。Since the invention of claim 2 uses one scalar quantity such as general distance and similarity as one of the constraint conditions, it can be used for most character recognition techniques.

【００５６】請求項３の発明は、拘束条件の一つとし
て、大分類のための特徴の要素（スカラー量あるいはそ
の組み合わせ（ベクトル量）毎に制限範囲を設けるが、
これは請求項２の発明における距離や類似度などのスカ
ラー量が与えられた一文字のパターン全体を評価する量
であるのに対して、局所的な特徴を制限するものである
から、より詳細な制御が可能となるので、認識性能を高
めることができる。According to the third aspect of the present invention, as one of the constraint conditions, a limit range is set for each element (scalar amount or combination thereof (vector amount)) of a feature for large classification.
This is an amount for evaluating the entire pattern of one character to which a scalar amount such as distance or similarity in the invention of claim 2 is given, whereas it limits a local feature, and is therefore more detailed. Since the control becomes possible, the recognition performance can be improved.

【００５７】請求項４の発明は、拘束条件として、大分
類のための特徴のみならず、あらかじめ定められた個別
の特殊な特徴（スカラー量またはベクトル量）を用いる
が、このことは、全てのモデルに対して定められた特徴
以外の、モデルに応じた特徴を導入することに相当し、
したがって柔軟な処理が可能となり、認識性能を高める
ことができる。According to the fourth aspect of the invention, not only the features for the large classification but also the specific special features (scalar amount or vector amount) that are set in advance are used as the constraint conditions. It is equivalent to introducing features according to the model other than the features specified for the model,
Therefore, flexible processing is possible, and recognition performance can be improved.

【００５８】請求項５の発明は、モデル毎の拘束条件の
なかで、距離や類似度などの与えられた一文字のパター
ン全体を評価するスカラー量をまず調べ、これがある一
定の範囲になければ他の個別の特徴を評価するまでもな
く整合を取り消すことが可能であり、こうすることによ
って処理時間の短縮を計ることができる。According to the fifth aspect of the invention, the scalar quantity for evaluating the entire pattern of a given character such as the distance or the similarity is first checked in the constraint condition for each model, and if it is not within a certain range, the other is obtained. It is possible to cancel the matching without evaluating the individual characteristics of each, and by doing so, the processing time can be shortened.

【００５９】請求項６の発明は、請求項５の発明とは逆
に、モデル毎の拘束条件のなかで、距離や類似度などの
与えられた一文字のパターン全体を評価するスカラー量
をまず調べ、これがある一定範囲にあれば他の個別の特
徴を評価するまでもなく整合を許容することが可能であ
り、こうすることによって処理時間の短縮を計ることが
できる。Contrary to the invention of claim 5, in contrast to the invention of claim 5, the amount of scalar for evaluating the entire pattern of one character given the distance, the degree of similarity and the like is first checked in the constraint condition for each model. However, if this is within a certain range, it is possible to allow matching without evaluating other individual characteristics, and by doing so, the processing time can be shortened.

【００６０】請求項７の発明は、同時に整合し得るモデ
ルの組毎に専用の識別処理を定めることが可能であるよ
うにし、専用の識別処理が定められていない場合には、
距離や類似度などの一つのスカラー量によって識別する
という構成をとることにより、識別の難しいモデルの組
の識別性能を高め、かつ不要な識別処理の導入を避ける
ことができ、認識時間と記憶容量の導入を抑えることが
できる。According to the invention of claim 7, it is possible to set a dedicated identification process for each set of models that can be matched at the same time. When the dedicated identification process is not set,
By adopting a configuration in which identification is performed using a single scalar quantity such as distance or similarity, it is possible to improve the identification performance of a set of models that are difficult to identify, and avoid the introduction of unnecessary identification processing. Can be suppressed.

【００６１】請求項８の発明は、同時に整合し得るモデ
ルの組毎に専用の識別処理を定めうえで、モデル毎に拘
束条件をもたせることで、モデル組の種類を減らすこと
が可能であり、したがって辞書の記憶容量の増加を抑え
ることが可能となる。According to the eighth aspect of the present invention, it is possible to reduce the number of types of model sets by defining a dedicated identification process for each set of models that can be matched at the same time and providing a constraint condition for each model. Therefore, it is possible to suppress an increase in the storage capacity of the dictionary.

[Brief description of drawings]

【図１】（ａ）最も基本的な文字認識方法の説明図であ
る。（ｂ）従来の準位相構造解析法の説明図である。FIG. 1A is an explanatory diagram of a most basic character recognition method. (B) It is explanatory drawing of the conventional quasi-phase structure analysis method.

【図２】従来の準位相構造解析法における識別方法の説
明図である。FIG. 2 is an explanatory diagram of an identification method in a conventional quasi-phase structure analysis method.

【図３】パターン空間とクラス／カテゴリの関係の説明
図である。FIG. 3 is an explanatory diagram of a relationship between a pattern space and classes / categories.

【図４】本発明による詳細分類方法の説明図である。FIG. 4 is an explanatory diagram of a detailed classification method according to the present invention.

【図５】本発明を実施するための装置構成の一例を示す
ブロック図である。FIG. 5 is a block diagram showing an example of a device configuration for implementing the present invention.

[Explanation of symbols]

１００画像入力部１０２特徴抽出部１０４構造マッチング部（大分類部）１０６詳細分類部（拘束条件チェック／モデル組識別
処理部）１０８局所的特徴マッチング部１１０辞書部100 image input unit 102 feature extraction unit 104 structure matching unit (major classification unit) 106 detailed classification unit (constraint condition check / model group identification processing unit) 108 local feature matching unit 110 dictionary unit

Claims

[Claims]

1. A character recognition method in which one or more models are defined for each character of a character type to be read, and a character is recognized by matching with these models. A character recognition method characterized in that a constraint condition is checked in a detailed classification of a candidate having a condition and obtained by a large classification.

2. The character recognition method according to claim 1, wherein one of the constraint conditions is one scalar quantity such as a distance or similarity of features for large classification.

3. The character recognition method according to claim 1, wherein, as one of the constraint conditions, a limit range is set for each scalar amount which is an element of a feature for large classification or for each vector amount which is a combination thereof.

4. The character according to claim 1, wherein, as one of the constraint conditions, not only a feature for a large classification, but also a scalar amount or a vector amount, which is a predetermined special feature, is used. Recognition method.

5. Among the constraint conditions for each model, one scalar quantity such as the distance or similarity of features for large classification is set as the first condition, and if this is within a predetermined range, another The character recognition method according to claim 1, wherein the matching with the model is canceled without checking the characteristics.

6. Among the constraint conditions for each model, one scalar quantity such as the distance or similarity of features for large classification is set as a first condition, and if this is within a predetermined range, another The character recognition method according to claim 1, wherein matching with the model is allowed without checking the characteristics.

7. One or more models are defined for each character of the character type to be read, and in a character recognition method for recognizing a character by matching with these models, the character classification methods are classified into two groups. Dedicated identification processing is established as needed for each set of models that can be matched, and in the detailed classification of candidates obtained by large classification, if dedicated identification processing is defined, it is identified by that identification processing The character recognition method is characterized in that when the identification processing is not specified, the identification is performed by one scalar amount such as distance or similarity.

8. A character recognition method in which one or more models are defined for each character of a character type to be read, and a character recognition method for recognizing a character by matching these models restricts each model. In addition to having conditions, each set of models that can be matched at the same time has a dedicated identification process as needed, and in the detailed classification of the candidates obtained by the large classification, the constraint conditions of the corresponding model are checked and When the identification processing is defined, the identification processing is performed, and when the dedicated identification processing is not provided, the identification is performed by one scalar amount such as distance or similarity. Character recognition method described.