JP3475422B2

JP3475422B2 - Image recognition device, image recognition method, and recording medium

Info

Publication number: JP3475422B2
Application number: JP22305497A
Authority: JP
Inventors: 久高山岸; 世紅労
Original assignee: Omron Corp
Current assignee: Omron Corp
Priority date: 1997-08-04
Filing date: 1997-08-04
Publication date: 2003-12-08
Anticipated expiration: 2017-08-04
Also published as: JPH1153475A

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、画像認識装置、画
像認識方法および該画像認識方法を実行させるためのプ
ログラムを記録した記録媒体に係り、特に、複数のカテ
ゴリを持ち、入力データに対する各カテゴリの一致度
（若しくはそれに類する情報）を算出する画像認識にお
いて、一致度が高いとされた結果が正しいものか、間違
ったものか決定できない認識結果を、「認識結果が正解
と確信できるため受け付ける」，「認識結果が正解と確
信できないためリジェクトする」，「認識結果が未確定
のためリジェクトする」或いは「認識結果が不正解のた
めリジェクトする」等といった数種類の結果に確定し、
受付け正解率を向上させ、かつ誤認識率を０［％］に近
づけた画像認識装置、画像認識方法および記録媒体に関
する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image recognition apparatus, an image recognition method, and a recording medium having a program for executing the image recognition method recorded therein, and more particularly, it has a plurality of categories and each category for input data. In image recognition that calculates the degree of matching (or similar information), the recognition result that the degree of matching is high cannot be determined whether it is correct or incorrect, "we accept because the recognition result is confident that it is correct" , "Reject because the recognition result is not sure that it is correct", "Reject because the recognition result is undetermined", or "Reject because the recognition result is incorrect", etc.
The present invention relates to an image recognition device, an image recognition method, and a recording medium that improve the acceptance correct rate and make the false recognition rate close to 0 [%].

【０００２】[0002]

【従来の技術】従来、複数のカテゴリを持ち、入力デー
タに対する各カテゴリの一致度（若しくはそれに類する
情報）を算出する手段を備えた画像認識システムにおい
ては、画像認識の認識結果は、該認識結果が正解である
か或いは誤認識であるかを人がチェックするまで確定で
きなかった。2. Description of the Related Art Conventionally, in an image recognition system having a plurality of categories and provided with means for calculating the degree of coincidence (or similar information) of each category with respect to input data, the recognition result of image recognition is It could not be determined until a person checked whether was the correct answer or a false recognition.

【０００３】つまり、従来の画像認識システムにおける
運用方法は、以下のような手順で行われていた。すなわ
ち、まず第１に、画像認識システムで認識対象データを
認識させ、次に第２に、コンピュータ等の画面に認識結
果を表示させ、認識対象と認識結果を人が突き合わせて
確認をし、そして第３には、誤認識している結果を修正
し、さらに第４には、認識対象について最後まで突き合
わせ確認が終了した後、認識結果を確定するというもの
であった。That is, the operation method in the conventional image recognition system is performed by the following procedure. That is, first, the image recognition system is made to recognize the recognition target data, and secondly, the recognition result is displayed on the screen of a computer or the like, and a person matches the recognition target with the recognition result and confirms the result. Thirdly, the erroneously recognized result is corrected, and fourthly, the recognition result is finalized after the matching confirmation of the recognition target is completed to the end.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、上記従
来の画像認識システムの運用方法にあっては、全文を人
手によって入力することに比べると省力化が図られてい
るものの、第２の人手による認識対象と認識結果の突き
合わせ確認処理に手間がかかるという問題点があった。
例えば、一度に数百、数千の帳票などを認識させる場
合、１枚１枚を認識させた後、人がチェックするのでは
自動化を図るとは言い難いため、画像認識システムの導
入が見送られることも多かった。However, in the operation method of the conventional image recognition system described above, although the labor saving is achieved as compared with the case where the whole sentence is manually input, the second manual recognition is performed. There is a problem that it takes a lot of time to confirm the matching between the object and the recognition result.
For example, when recognizing hundreds or thousands of forms at a time, it is difficult to say that it is automated if a person checks each one after recognizing it, so the introduction of an image recognition system is postponed. There were many things.

【０００５】また、認識性能にも問題があり、従来の画
像認識システムによって認識した結果をノーチェックで
全て受け付けたとすると、正解率は８０〜９５［％］程
度であるのが現状である。すなわち、５〜２０［％］は
誤認識しているにもかかわらず正解として受付けてしま
うのである。Further, there is also a problem with the recognition performance, and if all the results recognized by the conventional image recognition system are accepted without checking, the correct answer rate is about 80 to 95 [%] at present. In other words, 5 to 20% is accepted as a correct answer even though it is erroneously recognized.

【０００６】本発明は、このような従来の問題点に着目
してなされたもので、その目的とするところは、複数の
カテゴリを持ち、入力データに対する各カテゴリの一致
度（若しくはそれに類する情報）を算出する画像認識に
おいて、一致度が高いとされた結果が正しいものか、間
違ったものか決定できない認識結果を、「認識結果が正
解と確信できるため受け付ける」，「認識結果が正解と
確信できないためリジェクトする」，「認識結果が未確
定のためリジェクトする」或いは「認識結果が不正解の
ためリジェクトする」等といった数種類の結果に確定
し、受付け正解率を向上させ、かつ誤認識率を０［％］
に近づけた画像認識装置、画像認識方法および記録媒体
を提供することにある。The present invention has been made by paying attention to such a conventional problem, and an object thereof is to have a plurality of categories and the degree of coincidence of each category with input data (or information similar thereto). In the image recognition that calculates, the recognition result that cannot be determined whether the result with a high degree of coincidence is correct or incorrect is "accepted because the recognition result is confident that it is correct", "recognition result is not correct" Therefore, it will be rejected "," Rejected because the recognition result is undetermined ", or" Rejected because the recognition result is incorrect ", etc., to improve the acceptance rate and correct the false recognition rate. [%]
To provide an image recognition device, an image recognition method and a recording medium which are close to the above.

【０００７】[0007]

【課題を解決するための手段】上記課題を解決するため
に、本出願の請求項１に記載の発明は、複数カテゴリの
認識対象を持ち、登録画像パタンと入力画像パタンとの
パタンマッチングにおける適合度（距離）に基づき入力
画像の認識を行う画像認識装置において、画像認識の認識結果およびその付随情報を含むデータか
ら、認識結果として同じ第１候補のカテゴリを持つデー
タを集め、該集めたデータから第２候補以下の特定の候
補順位における類似カテゴリの出現率の高いものを抽出
して、第１候補カテゴリに対する類似カテゴリの関連を
示す第１前段知識として保持する第１前段知識作成手段
と、入力画像の認識結果として第１候補のカテゴリおよ
び第２候補以下の特定候補順位の類似カテゴリが得られ
るとき、該第１候補のカテゴリおよび第２候補以下の特
定候補順位の類似カテゴリと前記第１前段知識に保持さ
れた関連との比較により、該第１候補のカテゴリが”正
解”か否かを判定する第１前段判定手段とを具備するこ
とを特徴とする画像認識装置にある。In order to solve the above-mentioned problems, the invention according to claim 1 of the present application has recognition objects of a plurality of categories, and conforms in pattern matching between a registered image pattern and an input image pattern. In an image recognition device that recognizes an input image based on a degree (distance), data having the same first candidate category as a recognition result is collected from data including a recognition result of image recognition and its accompanying information, and the collected data A first pre-stage knowledge creating means for extracting a high-presence rate of similar categories in specific candidate ranks below the second candidate and holding the same as first pre-stage knowledge indicating the relationship of the similar categories to the first candidate category; when similar category specification candidate ranking recognition result as follows first candidate category Oyo beauty second candidate input image is obtained, the first candidate catheter It is held in Li and the first pre-knowledge similar category of second candidate following JP constant candidate ranking
The image recognition apparatus is characterized by further comprising a first pre-stage determination means for determining whether or not the category of the first candidate is “correct answer” by comparison with the established association.

【０００８】また、本出願の請求項２に記載の発明は、
複数カテゴリの認識対象を持ち、登録画像パタンと入力
画像パタンとのパタンマッチングにおける適合度（距
離）に基づき入力画像の認識を行う画像認識装置におい
て、画像認識の認識結果およびその付随情報を含むデー
タから、認識結果として同じ第１候補のカテゴリを持つ
データを集め、該集めたデータから第１候補のカテゴリ
における適合度（距離）から、該第１候補のカテゴリ
を”正解”と確定し得る適合度（距離）の大きさを算定
し、第２前段知識として保持する第２前段知識作成手段
と、入力画像の認識結果として第１候補のカテゴリが得
られるとき、該第１候補のカテゴリにおける適合度（距
離）の大きさと前記第２前段知識に保持された適合度
（距離）の大きさとの比較により、該第１候補のカテゴ
リが”正解”か否かを判定する第２前段判定手段とを具
備することを特徴とする画像認識装置にある。The invention according to claim 2 of the present application is
It has recognition targets of multiple categories and inputs registered image patterns.
Goodness of fit in pattern matching with the image pattern (distance
Image recognition device that recognizes the input image based on
Data including the recognition result of image recognition and its accompanying information.
Have the same first candidate category as the recognition result.
Collect data, and from the collected dataFirst candidate category
From the goodness of fit (distance) inCategory of the first candidate
Calculates the degree of goodness of fit (distance) that can be determined as "correct answer"
Second pre-stage knowledge creating means for holding as second pre-stage knowledge
And as the recognition result of the input imageFirst candidateGot the category of
WhenOf the first candidateGoodness of fit (distance
The distance and the second knowledgeRetainedGoodness of fit
By comparing with the size of (distance),First candidate catego
Li"Correct answer"Whether or notA second front-stage judging means for judging
An image recognition device characterized in that it is provided.

【０００９】また、本出願の請求項３に記載の発明は、
複数カテゴリの認識対象を持ち、登録画像パタンと入力
画像パタンとのパタンマッチングにおける適合度（距
離）に基づき入力画像の認識を行う画像認識装置におい
て、画像認識の認識結果およびその付随情報を含むデー
タから、認識結果として同じ第１候補のカテゴリを持つ
データを集め、該集めたデータから第１候補のカテゴリ
と第２候補以下の特定候補順位の類似カテゴリとの適合
度（距離）差から、該第１候補のカテゴリを”正解”と
確定し得る適合度（距離）差の大きさを算定し、第３前
段知識として保持する第３前段知識作成手段と、入力画
像の認識結果として第１候補のカテゴリおよび第２候補
以下の特定候補順位の類似カテゴリが得られるとき、該
第１候補のカテゴリと第２候補以下の特定候補順位の類
似カテゴリとの適合度（距離）差の大きさと前記第３前
段知識に保持された適合度（距離）差の大きさとの比較
により、該第１候補のカテゴリが”正解”か否かを判定
する第３前段判定手段とを具備することを特徴とする画
像認識装置にある。The invention according to claim 3 of the present application is
An image recognition apparatus that has a plurality of categories of recognition targets and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern, and includes data including a recognition result of the image recognition and its accompanying information. From the data, the data having the same first candidate category as the recognition result is collected, and from the collected data, from the difference in the matching degree (distance) between the category of the first candidate and the similar category of the specific candidate ranks below the second candidate , A third pre-stage knowledge creating unit that calculates the magnitude of the goodness-of-fit (distance) difference that can determine the category of the first candidate as “correct answer” and holds it as the third pre-stage knowledge, and the first candidate as the recognition result of the input image. categories and the second candidate
When similar categories with the following specific candidate ranks are obtained,
First candidate category and adaptability of the kind similar categories following specific candidate ranking second candidate (distance) difference in size between said held in the third front knowledge fit (distance) the size of the difference The image recognition apparatus is characterized by further comprising a third pre-stage determination means for determining whether or not the category of the first candidate is “correct answer” by comparison with the above.

【００１０】また、本出願の請求項４に記載の発明は、
複数カテゴリの認識対象を持ち、登録画像パタンと入力
画像パタンとのパタンマッチングにおける適合度（距
離）に基づき入力画像の認識を行う画像認識装置におい
て、画像認識の認識結果およびその付随情報を含むデー
タをサンプル収集し、あるカテゴリで認識結果が”正
解”とされているデータから、前記データの集合をクラ
スタリングするときの軸を、上位候補に出現率の高いカ
テゴリを該軸として、下位候補に出現率の高いカテゴリ
を該軸として、上位候補に出現率の高いカテゴリおよび
下位候補に出現率の高いカテゴリをあわせて軸として、
中位候補に出現率の高いカテゴリを該軸として、相関係
数の高いカテゴリ同士を該軸として、或いは、相関係数
の低いカテゴリ同士を該軸として、抽出する第１軸抽出
手段と、前記第１軸抽出手段で抽出された軸に基づき前
記データの集合のクラスタリングを行い、該カテゴリの
ＯＫクラスタとして生成し、生成されたＯＫクラスタの
形状を第１後段知識として保持する第１後段知識作成手
段と、入力画像の認識結果として認識結果およびその付
随情報を含むデータが得られるとき、該データが前記第
１後段知識のあるカテゴリに所属するＯＫクラスタの中
に含まれるか否かをチェックして、該ＯＫクラスタに含
まれる場合には該ＯＫクラスタが所属するカテゴリを前
記入力画像の認識結果として”正解”と判定し、何れの
ＯＫクラスタにも含まれていない場合には”未確定”と
判定する第１後段判定手段とを具備することを特徴とす
る画像認識装置にある。The invention according to claim 4 of the present application is
An image recognition apparatus that has a plurality of categories of recognition targets and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern, and includes data including a recognition result of the image recognition and its accompanying information. From the data whose recognition result is “correct” in a certain category, the axis when clustering the set of the above data is used as the upper candidate, and the category with a high appearance rate is used as the axis and appears in the lower candidate. With the category having a high rate as the axis, the category having a high appearance rate in the upper candidates and the category having a high appearance rate in the lower candidates are combined as an axis,
A first axis extracting means for extracting a category having a high appearance rate in the middle candidates as the axis, categories having a high correlation coefficient as the axis, or categories having a low correlation coefficient as the axis; First post-stage knowledge creation in which clustering of the set of data is performed based on the axes extracted by the first axis extraction means to generate OK clusters of the category, and the shape of the generated OK cluster is retained as first post-stage knowledge. And a data including a recognition result and its accompanying information as a recognition result of the input image, it is checked whether or not the data is included in the OK cluster belonging to the category having the first latter knowledge. If it is included in the OK cluster, the category to which the OK cluster belongs is determined to be the “correct answer” as the recognition result of the input image, and any OK cluster is included. Rare in the case not is in the image recognition apparatus characterized by comprising a first second-stage determination means determines that the "undetermined".

【００１１】また、本出願の請求項５に記載の発明は、
前記第１後段判定手段は、前記データがあるカテゴリに
所属するＯＫクラスタに含まれる場合に、どの程度の確
信を持って”正解”といえるかを示す指標である確信度
を計算する確信度算出手段を具備し、前記データが複数
の別カテゴリに所属するＯＫクラスタに入った場合に
は、前記確信度に基づいてどのカテゴリを”正解”とす
るかを決定することを特徴とする請求項４に記載の画像
認識装置にある。The invention according to claim 5 of the present application is
The first post-stage determination means calculates a certainty factor that is an index indicating how much certainty the "correct answer" can be called when the data is included in an OK cluster belonging to a certain category. A means is provided, and when the data enters an OK cluster belonging to a plurality of different categories, which category is determined as "correct answer" is determined based on the certainty factor. The image recognition device described in 1.

【００１２】また、本出願の請求項６に記載の発明は、
複数カテゴリの認識対象を持ち、登録画像パタンと入力
画像パタンとのパタンマッチングにおける適合度（距
離）に基づき入力画像の認識を行う画像認識装置におい
て、画像認識の認識結果およびその付随情報を含むデー
タをサンプル収集し、あるカテゴリで認識結果が”誤
認”とされているデータから、前記データの集合をクラ
スタリングするときの軸を、上位候補に出現率の高いカ
テゴリを該軸として、下位候補に出現率の高いカテゴリ
を該軸として、上位候補に出現率の高いカテゴリおよび
下位候補に出現率の高いカテゴリをあわせて軸として、
中位候補に出現率の高いカテゴリを該軸として、相関係
数の高いカテゴリ同士を該軸として、或いは、相関係数
の低いカテゴリ同士を該軸として、抽出する第２軸抽出
手段と、前記第２軸抽出手段で抽出された軸に基づき前
記データの集合のクラスタリングを行い、該カテゴリの
ＮＧクラスタとして生成し、生成されたＮＧクラスタの
形状を第２後段知識として保持する第２後段知識作成手
段と、入力画像の認識結果として認識結果およびその付
随情報を含むデータが得られるとき、該データが前記第
２後段知識のあるカテゴリに所属するＮＧクラスタの中
に含まれるか否かをチェックして、該ＮＧクラスタに含
まれる場合には該ＮＧクラスタが所属するカテゴリを前
記入力画像の認識結果として”不正解”と判定し、何れ
のＮＧクラスタにも含まれていない場合には”正解”と
判定する第２後段判定手段とを具備することを特徴とす
る画像認識装置にある。The invention according to claim 6 of the present application is
An image recognition apparatus that has a plurality of categories of recognition targets and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern, and includes data including a recognition result of the image recognition and its accompanying information. From the data whose recognition result is "misidentified" in a certain category, the axis when clustering the set of the above data is used as the upper candidate, and the category with a high appearance rate is used as the axis and appears in the lower candidate. With the category having a high rate as the axis, the category having a high appearance rate in the upper candidates and the category having a high appearance rate in the lower candidates are combined as an axis,
A second axis extracting means for extracting a category having a high appearance rate as a middle candidate as the axis, categories having a high correlation coefficient as the axis, or categories having a low correlation coefficient as the axis; Second post-stage knowledge creation in which clustering of the set of data is performed based on the axes extracted by the second axis extraction means to generate NG clusters of the category, and the shape of the generated NG cluster is held as second post-stage knowledge. And a data including the recognition result and its accompanying information as the recognition result of the input image, whether or not the data is included in the NG cluster belonging to the category having the second latter knowledge is checked. If it is included in the NG cluster, the category to which the NG cluster belongs is determined to be “incorrect” as the recognition result of the input image, and which NG cluster is included Included when not is the image recognition apparatus characterized by comprising a second second-stage determination means determines that the "correct answer".

【００１３】また、本出願の請求項７に記載の発明は、
複数カテゴリの認識対象を持ち、登録画像パタンと入力
画像パタンとのパタンマッチングにおける適合度（距
離）に基づき入力画像の認識を行う画像認識方法におい
て、画像認識の認識結果およびその付随情報を含むデー
タから、認識結果として同じ第１候補のカテゴリを持つ
データを集め、該集めたデータから第２候補以下の特定
の候補順位における類似カテゴリの出現率の高いものを
抽出して、第１候補カテゴリに対する類似カテゴリの関
連を示す第１前段知識として保持する第１前段知識作成
ステップと、入力画像の認識結果として第１候補のカテ
ゴリおよび第２候補以下の特定順位候補の類似カテゴリ
が得られるとき、該第１候補のカテゴリおよび特定候補
順位の類似カテゴリと前記第１前段知識に保持された関
連との比較により、該第１候補のカテゴリが”正解”か
否かを判定する第１前段判定ステップとを具備すること
を特徴とする画像認識方法にある。The invention according to claim 7 of the present application is
An image recognition method for recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern, which includes recognition objects of a plurality of categories, and data including a recognition result of the image recognition and its accompanying information. Data having the same first candidate category as the recognition result is collected, and those having a high appearance rate of the similar categories in the specific candidate ranks of the second candidate and below are extracted from the collected data to obtain the first candidate category. A first pre-stage knowledge creation step that holds the first pre-stage knowledge that indicates the relationship of similar categories, and a first candidate category as the recognition result of the input image.
When similar categories of specific rank candidates below Gori and the second candidate are obtained, the category of the first candidate and the similar category of the specific candidate rank are compared with the relation held in the first pre-stage knowledge. Whether the category of the first candidate is “correct”
And a first pre-stage determination step for determining whether or not there is an image recognition method.

【００１４】また、本出願の請求項８に記載の発明は、
複数カテゴリの認識対象を持ち、登録画像パタンと入力
画像パタンとのパタンマッチングにおける適合度（距
離）に基づき入力画像の認識を行う画像認識方法におい
て、画像認識の認識結果およびその付随情報を含むデー
タから、認識結果として同じ第１候補のカテゴリを持つ
データを集め、該集めたデータから第１候補のカテゴリ
における適合度（距離）から、該第１候補のカテゴリ
を”正解”と確定し得る適合度（距離）の大きさを算定
し、第２前段知識として保持する第２前段知識作成ステ
ップと、入力画像の認識結果として第１候補のカテゴリ
が得られるとき、該第１候補のカテゴリにおける適合度
（距離）の大きさと前記第２前段知識に保持された適合
度（距離）の大きさとの比較により、該第１候補のカテ
ゴリが”正解”か否かを判定する第２前段判定ステップ
とを具備することを特徴とする画像認識方法にある。The invention according to claim 8 of the present application is
It has recognition targets of multiple categories and inputs registered image patterns.
Goodness of fit in pattern matching with the image pattern (distance
In the image recognition method that recognizes the input image based on
Data including the recognition result of image recognition and its accompanying information.
Have the same first candidate category as the recognition result.
Collect data, and from the collected dataFirst candidate category
From the goodness of fit (distance) inFirst candidate category
Calculates the degree of goodness of fit (distance) that can be determined as "correct answer"
Then, the second pre-stage knowledge creation step that is held as the second pre-stage knowledge is
And the recognition result of the input imageFirst candidateCategories
When is obtained, theFirst candidateGoodness of fit in the category
The size of (distance) and the knowledge of the second pre-stageRetainedConform
By comparing with the magnitude of the degree (distance),First candidate category
Gori"Correct answer"Whether or notSecond pre-stage determination step for determination
An image recognition method characterized by comprising:

【００１５】また、本出願の請求項９に記載の発明は、
複数カテゴリの認識対象を持ち、登録画像パタンと入力
画像パタンとのパタンマッチングにおける適合度（距
離）に基づき入力画像の認識を行う画像認識方法におい
て、画像認識の認識結果およびその付随情報を含むデー
タから、認識結果として同じ第１候補のカテゴリを持つ
データを集め、該集めたデータから第１候補のカテゴリ
と第２候補以下の特定候補順位の類似カテゴリとの適合
度（距離）差から、該第１候補のカテゴリを”正解”と
確定し得る適合度（距離）差の大きさを算定し、第３前
段知識として保持する第３前段知識作成ステップと、入
力画像の認識結果として第１候補のカテゴリおよび第２
候補以下の特定候補順位の類似カテゴリが得られると
き、該第１候補のカテゴリと第２候補以下の特定候補順
位のカテゴリとの適合度（距離）差の大きさと前記第３
前段知識に保持された適合度（距離）差の大きさとの比
較により、該第１候補のカテゴリが”正解”か否かを判
定する第３前段判定ステップとを具備することを特徴と
する画像認識方にある。The invention according to claim 9 of the present application is
An image recognition method for recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern, which includes recognition objects of a plurality of categories, and data including a recognition result of the image recognition and its accompanying information. From the data, the data having the same first candidate category as the recognition result is collected, and from the collected data, from the difference in the matching degree (distance) between the category of the first candidate and the similar category of the specific candidate ranks below the second candidate , A third pre-stage knowledge creating step of calculating the magnitude of a goodness-of-fit (distance) difference that can determine the category of the first candidate as "correct answer", and retaining it as the third pre-stage knowledge, and the first candidate as the recognition result of the input image. categories and the second
When similar categories following specific candidate ranking candidates are obtained, the first candidate of the category and the second candidate following specific candidate sequence
The degree of conformance (distance) difference with the rank category and the third
A third pre-stage determination step of determining whether or not the category of the first candidate is “correct” by comparing with the magnitude of the goodness-of-fit (distance) difference held in the pre-stage knowledge. The image recognition method is characterized by.

【００１６】また、本出願の請求項１０に記載の発明
は、複数カテゴリの認識対象を持ち、登録画像パタンと
入力画像パタンとのパタンマッチングにおける適合度
（距離）に基づき入力画像の認識を行う画像認識方法に
おいて、画像認識の認識結果およびその付随情報を含む
データから、認識結果として同じ第１候補のカテゴリを
持つデータを集め、該集めたデータから第２候補以下の
特定の候補順位における類似カテゴリの出現率の高いも
のを抽出して、第１候補カテゴリに対する類似カテゴリ
の関連を示す第１前段知識として保持する第１前段知識
作成ステップと、画像認識の認識結果およびその付随情
報を含むデータから、認識結果として同じ第１候補のカ
テゴリを持つデータを集め、該集めたデータから第１候
補のカテゴリにおける適合度（距離）から、該第１候補
のカテゴリを”正解”と確定し得る適合度（距離）の大
きさを算定し、第２前段知識として保持する第２前段知
識作成ステップと、画像認識の認識結果およびその付随
情報を含むデータから、認識結果として同じ第１候補の
カテゴリを持つデータを集め、該集めたデータから第１
候補のカテゴリと第２候補以下の特定候補順位の類似カ
テゴリとの適合度（距離）差から、該第１候補のカテゴ
リを”正解”と確定し得る適合度（距離）差の大きさを
算定し、第３前段知識として保持する第３前段知識作成
ステップと、入力画像の認識結果として第１候補のカテ
ゴリおよび第２候補以下の特定順位候補の類似カテゴリ
が得られるとき、該第１候補のカテゴリおよび特定候補
順位の類似カテゴリと前記第１前段知識に保持された関
連との比較により、該第１候補のカテゴリが”正解”か
否かを判定する第１前段判定ステップと、前記第１前段
判定ステップで”正解”と判定された入力画像につい
て、該第１候補のカテゴリにおける適合度（距離）の大
きさと前記第２前段知識に保持された適合度（距離）の
大きさとの比較により、該第１候補のカテゴリが”正
解”か否かを判定する第２前段判定ステップと、前記第
２前段判定ステップで”正解”と判定された入力画像に
ついて、該第１候補のカテゴリと第２候補以下の特定候
補順位のカテゴリとの適合度（距離）差の大きさと前記
第３前段知識に保持された適合度（距離）差の大きさと
の比較により、該第１候補のカテゴリが”正解”か否か
を判定する第３前段判定ステップと、を具備すること、
を特徴とする画像認識方法にある。Further, the invention according to claim 10 of the present application has recognition objects of a plurality of categories, and recognizes the input image based on the matching degree (distance) in the pattern matching between the registered image pattern and the input image pattern. In the image recognition method, data having the same first candidate category as the recognition result is collected from the data including the recognition result of the image recognition and its accompanying information, and from the collected data, the second candidate and the following specific candidates are identified. A first pre-stage knowledge creating step of extracting a high-similarity appearance rate of similar categories in the candidate ranks and holding the same as first pre-stage knowledge indicating a relationship of the similar categories to the first candidate category, a recognition result of image recognition and its attendant From the data including information, data having the same first candidate category as the recognition result is collected, and the first data is collected from the collected data .
Fit in complementary category from (distance), the first candidate
From the data including the second pre-stage knowledge creation step of calculating the degree of goodness of fit (distance) that can determine the category of "correct answer" and holding it as the second pre-stage knowledge, the recognition result of image recognition and its accompanying information. , Collecting data having the same first candidate category as the recognition result, and collecting the first data from the collected data .
The categorization of the first candidate is calculated based on the difference in the degree of matching (distance) between the candidate category and the similar categories in the ranks of the second and lower candidates .
To calculate the size of the goodness of fit (distance) difference can confirm with the Li "correct answer", a third preceding knowledge generating step of holding the third preceding knowledge, catheterization of the first candidate as a recognition result of the input image
When similar categories of specific rank candidates below Gori and the second candidate are obtained, the category of the first candidate and the similar category of the specific candidate rank are compared with the relation held in the first pre-stage knowledge. Whether the category of the first candidate is “correct”
A first pre-determination step of determining whether said the determined input image to "correct" in the first pre-determination step, the size and the second pre knowledge of fit in the category of the first candidate (distance) by comparison with the size of the retained fitness (distance), the category of the first candidate and the second pre-determination step of determining whether "correct", and "correct answer" at the second pre-determination step for the determined input image, category of the first candidate and the second candidate following specific climate
Whether or not the category of the first candidate is “correct” by comparing the magnitude of the difference in fitness (distance) with the category of the complementary rank with the magnitude of the difference in fitness (distance) held in the third pre-stage knowledge
A third pre-stage determination step for determining
The image recognition method is characterized by

【００１７】また、本出願の請求項１１に記載の発明
は、複数カテゴリの認識対象を持ち、登録画像パタンと
入力画像パタンとのパタンマッチングにおける適合度
（距離）に基づき入力画像の認識を行う画像認識方法に
おいて、画像認識の認識結果およびその付随情報を含む
データをサンプル収集し、あるカテゴリで認識結果が”
正解”とされているデータから、前記データの集合をク
ラスタリングするときの軸を、上位候補に出現率の高い
カテゴリを該軸として、下位候補に出現率の高いカテゴ
リを該軸として、上位候補に出現率の高いカテゴリおよ
び下位候補に出現率の高いカテゴリをあわせて軸とし
て、中位候補に出現率の高いカテゴリを該軸として、相
関係数の高いカテゴリ同士を該軸として、或いは、相関
係数の低いカテゴリ同士を該軸として、抽出する第１軸
抽出ステップと、前記第１軸抽出ステップで抽出された
軸に基づき前記データの集合のクラスタリングを行い、
該カテゴリのＯＫクラスタとして生成し、生成されたＯ
Ｋクラスタの形状を第１後段知識として保持する第１後
段知識作成ステップと、入力画像の認識結果として認識
結果およびその付随情報を含むデータが得られるとき、
該データが前記第１後段知識のあるカテゴリに所属する
ＯＫクラスタの中に含まれるか否かをチェックして、該
ＯＫクラスタに含まれる場合には該ＯＫクラスタが所属
するカテゴリを前記入力画像の認識結果として”正解”
と判定し、何れのＯＫクラスタにも含まれていない場合
には”未確定”と判定する第１後段判定ステップとを具
備することを特徴とする画像認識方法にある。The invention according to claim 11 of the present application has a plurality of categories of recognition targets, and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. In the image recognition method, data including the recognition result of image recognition and its accompanying information is collected as a sample, and the recognition result is obtained in a certain category.
From the data “correct answer”, the axis when clustering the set of data is set as the upper candidate by using the category with a high appearance rate as the upper candidate as the axis and the category with a higher appearance rate by the lower candidate as the axis. A category with a high appearance rate and a category with a high appearance rate for lower candidates are combined as an axis, a category with a high appearance rate for a middle candidate is used as the axis, categories with high correlation coefficients are used as the axis, or a correlation is obtained. A first axis extracting step of extracting with categories having a low number as the axes, and clustering of the data set based on the axes extracted in the first axis extracting step,
Generated as an OK cluster of the category and generated O
When a first post-stage knowledge creating step of holding the shape of the K cluster as the first post-stage knowledge and data including the recognition result and its accompanying information as the recognition result of the input image are obtained,
It is checked whether or not the data is included in the OK cluster belonging to the category having the first latter knowledge, and if the data is included in the OK cluster, the category to which the OK cluster belongs is included in the input image. "Correct answer" as recognition result
The image recognition method is characterized by further comprising a first post-stage determination step of determining “undetermined” when the image is not included in any of the OK clusters.

【００１８】また、本出願の請求項１２に記載の発明
は、複数カテゴリの認識対象を持ち、登録画像パタンと
入力画像パタンとのパタンマッチングにおける適合度
（距離）に基づき入力画像の認識を行う画像認識方法に
おいて、画像認識の認識結果およびその付随情報を含む
データをサンプル収集し、あるカテゴリで認識結果が”
誤認”とされているデータから、前記データの集合をク
ラスタリングするときの軸を、上位候補に出現率の高い
カテゴリを該軸として、下位候補に出現率の高いカテゴ
リを該軸として、上位候補に出現率の高いカテゴリおよ
び下位候補に出現率の高いカテゴリをあわせて軸とし
て、中位候補に出現率の高いカテゴリを該軸として、相
関係数の高いカテゴリ同士を該軸として、或いは、相関
係数の低いカテゴリ同士を該軸として、抽出する第２軸
抽出ステップと、前記第２軸抽出ステップで抽出された
軸に基づき前記データの集合のクラスタリングを行い、
該カテゴリのＮＧクラスタとして生成し、生成されたＮ
Ｇクラスタの形状を第２後段知識として保持する第２後
段知識作成ステップと、入力画像の認識結果として認識
結果およびその付随情報を含むデータが得られるとき、
該データが前記第２後段知識のあるカテゴリに所属する
ＮＧクラスタの中に含まれるか否かをチェックして、該
ＮＧクラスタに含まれる場合には該ＮＧクラスタが所属
するカテゴリを前記入力画像の認識結果として”不正
解”と判定し、何れのＮＧクラスタにも含まれていない
場合には”正解”と判定する第２後段判定ステップとを
具備することを特徴とする画像認識方法にある。The invention according to claim 12 of the present application has a plurality of categories of recognition targets, and recognizes the input image based on the matching degree (distance) in the pattern matching between the registered image pattern and the input image pattern. In the image recognition method, data including the recognition result of image recognition and its accompanying information is collected as a sample, and the recognition result is obtained in a certain category.
From the data that is regarded as “misidentification”, the axis when clustering the set of the above data is set as the upper candidate by using the category having a high appearance rate as the upper candidate as the axis and the category having the higher appearance rate by the lower candidate as the axis. A category with a high appearance rate and a category with a high appearance rate for lower candidates are combined as an axis, a category with a high appearance rate for a middle candidate is used as the axis, categories with high correlation coefficients are used as the axis, or a correlation is obtained. A second axis extracting step of extracting with a category having a low number as the axis, and clustering of the set of data based on the axis extracted in the second axis extracting step,
Generated as an NG cluster of the category and generated N
When a second post-stage knowledge creating step for holding the shape of the G cluster as the second post-stage knowledge and data including the recognition result and its accompanying information as the recognition result of the input image are obtained,
It is checked whether or not the data is included in the NG cluster belonging to the category having the second latter knowledge, and if the data is included in the NG cluster, the category to which the NG cluster belongs is included in the input image. The image recognition method is characterized by comprising a second post-stage determination step of determining "incorrect answer" as a recognition result, and determining "correct answer" when it is not included in any NG cluster.

【００１９】また、本出願の請求項１３に記載の発明
は、前記第１後段判定ステップは、前記データがあるカ
テゴリに所属するＯＫクラスタに含まれる場合に、どの
程度の確信を持って”正解”といえるかを示す指標であ
る確信度を計算する確信度算出ステップを具備し、前記
データが複数の別カテゴリに所属するＯＫクラスタに入
った場合には、前記確信度に基づいてどのカテゴリを”
正解”とするかを決定することを特徴とする請求項１１
に記載の画像認識方法にある。In the invention according to claim 13 of the present application, in the first latter-stage determination step, when the data is included in an OK cluster belonging to a certain category, with certainty of confidence, "correct answer" is obtained. If the data is in an OK cluster belonging to a plurality of different categories, which category is calculated based on the certainty factor. ”
12. It is determined whether or not the answer is "correct".
The image recognition method described in 1.

【００２０】また、本出願の請求項１４に記載の発明
は、複数カテゴリの認識対象を持ち、登録画像パタンと
入力画像パタンとのパタンマッチングにおける適合度
（距離）に基づき入力画像の認識を行う画像認識方法
を、コンピュータに実行させるためのプログラムを記録
した記録媒体において、画像認識の認識結果およびその
付随情報を含むデータから、認識結果として同じ第１候
補のカテゴリを持つデータを集め、該集めたデータから
第２候補以下の特定の候補順位における類似カテゴリの
出現率の高いものを抽出して、第１候補カテゴリに対す
る類似カテゴリの関連を示す第１前段知識として保持す
る第１前段知識作成ステップと、入力画像の認識結果と
して第１候補のカテゴリおよび第２候補以下の特定候補
順位の類似カテゴリが得られるとき、該第１候補のカテ
ゴリおよび第２候補以下の特定候補順位の類似カテゴリ
と前記第１前段知識に保持された関連との比較により、
該第１候補のカテゴリが”正解”か否かを判定する第１
前段判定ステップとを具備することを特徴とする記録媒
体記録媒体にある。The invention according to claim 14 of the present application has a plurality of categories of recognition targets, and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. In a recording medium in which a program for causing a computer to execute the image recognition method is recorded, data having the same first candidate category as the recognition result is collected from the data including the recognition result of the image recognition and its accompanying information, and the collection is performed. From the data
A first pre-stage knowledge creating step of extracting a high-similarity appearance rate of similar categories in specific candidate ranks less than or equal to the second candidate , and retaining the same as first pre-stage knowledge indicating the relationship of the similar categories to the first candidate category; As a recognition result of the image, the category of the first candidate and the specific candidates below the second candidate
When similar categories of ranks are obtained, by comparing the similar categories of the first candidate category and specific candidate ranks equal to or lower than the second candidate with the relationships held in the first pre-stage knowledge,
First to determine whether the category of the first candidate is “correct”
A recording medium, characterized in that the recording medium comprises a first-stage determination step.

【００２１】また、本出願の請求項１５に記載の発明
は、複数カテゴリの認識対象を持ち、登録画像パタンと
入力画像パタンとのパタンマッチングにおける適合度
（距離）に基づき入力画像の認識を行う画像認識方法
を、コンピュータに実行させるためのプログラムとして
記録した記録媒体において、画像認識の認識結果および
その付随情報を含むデータから、認識結果として同じ第
１候補のカテゴリを持つデータを集め、該集めたデータ
から第１候補のカテゴリにおける適合度（距離）から、
該第１候補のカテゴリを”正解”と確定し得る適合度
（距離）の大きさを算定し、第２前段知識として保持す
る第２前段知識作成ステップと、入力画像の認識結果と
して第１候補のカテゴリが得られるとき、該第１候補の
カテゴリおける適合度（距離）の大きさと前記第２前段
知識に保持された適合度（距離）の大きさとの比較によ
り、該第１候補のカテゴリが”正解”か否かを判定する
第２前段判定ステップとを具備することを特徴とする記
録媒体にある。Further, the invention according to claim 15 of the present application has a plurality of categories of recognition targets, and recognizes the input image based on the matching degree (distance) in the pattern matching between the registered image pattern and the input image pattern. In a recording medium in which the image recognition method is recorded as a program for causing a computer to execute, data having the same first candidate category as the recognition result is collected from the data including the recognition result of the image recognition and its accompanying information, and the collection is performed. From the goodness of fit (distance) in the first candidate category ,
The category of the first candidate to calculate the size of the goodness of fit may be determined as "correct" (distance), a second pre-knowledge generating step of holding the second pre-knowledge, the first candidate as a recognition result of the input image when the category is obtained by comparison with the size of category definitive fit of the first candidate (distance) measurement and the second pre-knowledge held the fit (distance), the first A second pre-stage determination step of determining whether or not the candidate category is “correct answer” is provided.

【００２２】また、本出願の請求項１６に記載の発明
は、複数カテゴリの認識対象を持ち、登録画像パタンと
入力画像パタンとのパタンマッチングにおける適合度
（距離）に基づき入力画像の認識を行う画像認識方法
を、コンピュータに実行させるためのプログラムとして
記録した記録媒体において、画像認識の認識結果および
その付随情報を含むデータから、認識結果として同じ第
１候補のカテゴリを持つデータを集め、該集めたデータ
から第１候補のカテゴリと第２候補以下の特定候補順位
の類似カテゴリとの適合度（距離）差から、該第１候補
のカテゴリを”正解”と確定し得る適合度（距離）差の
大きさを算定し、第３前段知識として保持する第３前段
知識作成ステップと、入力画像の認識結果として第１候
補のカテゴリおよび第２候補以下の特定候補順位の類似
カテゴリが得られるとき、該第１候補のカテゴリと第２
候補以下の特定候補順位の適合度（距離）差の大きさと
前記第３前段知識における適合度（距離）差の大きさと
の比較により、該第１候補のカテゴリを”正解”また
は”不明”と判定する第３前段判定ステップとを具備す
ることを特徴とする記録媒体にある。The invention according to claim 16 of the present application has recognition objects of a plurality of categories, and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. In a recording medium in which the image recognition method is recorded as a program for causing a computer to execute, data having the same first candidate category as the recognition result is collected from the data including the recognition result of the image recognition and its accompanying information, and the collection is performed. The first candidate category and the second and lower candidate ranks
From the similarity (distance) with the similar category of
The third pre-stage knowledge creation step of calculating the magnitude of the goodness-of-fit (distance) difference that can determine the category of "correct answer" as the third pre-stage knowledge, and the first result as the recognition result of the input image.
When similar category of auxiliary categories and second candidate following specific candidate ranking is obtained, and the category of the first candidate second
The category of the first candidate is determined to be “correct answer” or “unknown” by comparing the magnitude of the difference in fitness (distance) of the specific candidate ranks below the candidate with the magnitude of the difference in fitness (distance) in the third preceding knowledge. And a third pre-stage determination step for making a determination.

【００２３】また、本出願の請求項１７に記載の発明
は、複数カテゴリの認識対象を持ち、登録画像パタンと
入力画像パタンとのパタンマッチングにおける適合度
（距離）に基づき入力画像の認識を行う画像認識方法
を、コンピュータに実行させるためのプログラムとして
記録した記録媒体において、画像認識の認識結果および
その付随情報を含むデータから、認識結果として同じ第
１候補のカテゴリを持つデータを集め、該集めたデータ
から第２候補以下の特定の候補順位における類似カテゴ
リの出現率の高いものを抽出して、第１候補カテゴリに
対する類似カテゴリの関連を示す第１前段知識として保
持する第１前段知識作成ステップと、画像認識の認識結
果およびその付随情報を含むデータから、認識結果とし
て同じ第１候補のカテゴリを持つデータを集め、該集め
たデータから第１候補のカテゴリにおける適合度（距
離）から、該第１候補のカテゴリを”正解”と確定し得
る適合度（距離）の大きさを算定し、第２前段知識とし
て保持する第２前段知識作成ステップと、画像認識の認
識結果およびその付随情報を含むデータから、認識結果
として同じ第１候補のカテゴリを持つデータを集め、該
集めたデータから第１候補のカテゴリと第２候補以下の
特定候補順位の類似カテゴリとの適合度（距離）差か
ら、該第１候補のカテゴリを”正解”と確定し得る適合
度（距離）差の大きさを算定し、第３前段知識として保
持する第３前段知識作成ステップと、入力画像の認識結
果として第１候補のカテゴリおよび第２候補以下の特定
候補順位の類似カテゴリが得られるとき、該第１候補の
カテゴリおよび第２候補以下の特定候補順位の類似カテ
ゴリと前記第１前段知識に保持された関連との比較によ
り、該第１候補のカテゴリが”正解”か否かを判定する
第１前段判定ステップと、前記第１前段判定ステップ
で”正解”と判定された入力画像について、該第１候補
のカテゴリにおける適合度（距離）の大きさと前記第２
前段知識に保持された適合度（距離）の大きさとの比較
により、該第１候補のカテゴリが”正解”か否かを判定
する第２前段判定ステップと、前記第２前段判定ステッ
プで”正解”と判定された入力画像について、該第１候
補のカテゴリと第２候補以下の特定候補順位の類似カテ
ゴリとの適合度（距離）差の大きさと前記第３前段知識
に保持された適合度（距離）差の大きさとの比較によ
り、該第１候補のカテゴリが”正解”か否かを判定する
第３前段判定ステップと、を具備すること、を特徴とす
る記録媒体にある。The invention according to claim 17 of the present application has a plurality of categories of recognition targets, and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. In a recording medium in which the image recognition method is recorded as a program for causing a computer to execute, data having the same first candidate category as the recognition result is collected from the data including the recognition result of the image recognition and its accompanying information, and the collection is performed. A first pre-stage knowledge creating step of extracting a high-presence occurrence rate of similar categories in specific candidate ranks below the second candidate from the obtained data, and retaining the same as first pre-stage knowledge indicating the relationship of similar categories to the first candidate category And the data including the recognition result of the image recognition and the accompanying information, the categorization of the same first candidate as the recognition result. Data were collected with adaptability in category of the first candidate from the collected data from the (distance), and calculate the magnitude of the goodness of fit may be determined to "correct" the category of the first candidate (distance), Data having the same first candidate category as the recognition result is collected from the second pre-stage knowledge creating step held as the second pre-stage knowledge and the data including the recognition result of the image recognition and its accompanying information, and the One candidate category and the second and following candidates
From the difference in the degree of matching (distance) with the similar category of the specific candidate order, the magnitude of the difference in the degree of matching (distance) that can determine the category of the first candidate as the “correct answer” is calculated and stored as the third pre-stage knowledge. Third pre-stage knowledge creation step, and identification of the first candidate category and second candidate and below as the recognition result of the input image
When similar category of candidate ranking is obtained by comparison with related held in the first pre-knowledge similar category of the first candidate category and the second candidate following specific candidate ranking, the first candidate category There a first pre-determination step of determining whether "correct", the determination input image to "correct" in the first pre-determination step, the first candidate
Of the goodness of fit (distance) in the category of
By comparison with the size of the preceding knowledge held the fit (distance), a second pre-determination step of determining whether the category of the first candidate is "correct", in the second pre-determination step "correct answer for "a determined input image, the first climate
By comparing the magnitude of the goodness- of- fit (distance) difference between the complementary category and the similar category of the second candidate or lower with a specific candidate rank with the magnitude of the goodness- of- fit (distance) difference held in the third preceding knowledge, And a third pre-stage determination step of determining whether or not one candidate category is “correct answer”.

【００２４】また、本出願の請求項１８に記載の発明
は、複数カテゴリの認識対象を持ち、登録画像パタンと
入力画像パタンとのパタンマッチングにおける適合度
（距離）に基づき入力画像の認識を行う画像認識方法
を、コンピュータに実行させるためのプログラムとして
記録した記録媒体において、画像認識の認識結果および
その付随情報を含むデータをサンプル収集し、あるカテ
ゴリで認識結果が”正解”とされているデータから、前
記データの集合をクラスタリングするときの軸を、上位
候補に出現率の高いカテゴリを該軸として、下位候補に
出現率の高いカテゴリを該軸として、上位候補に出現率
の高いカテゴリおよび下位候補に出現率の高いカテゴリ
をあわせて軸として、中位候補に出現率の高いカテゴリ
を該軸として、相関係数の高いカテゴリ同士を該軸とし
て、或いは、相関係数の低いカテゴリ同士を該軸とし
て、抽出する第１軸抽出ステップと、前記第１軸抽出ス
テップで抽出された軸に基づき前記データの集合のクラ
スタリングを行い、該カテゴリのＯＫクラスタとして生
成し、生成されたＯＫクラスタの形状を第１後段知識と
して保持する第１後段知識作成ステップと、入力画像の
認識結果として認識結果およびその付随情報を含むデー
タが得られるとき、該データが前記第１後段知識のある
カテゴリに所属するＯＫクラスタの中に含まれるか否か
をチェックして、該ＯＫクラスタに含まれる場合には該
ＯＫクラスタが所属するカテゴリを前記入力画像の認識
結果として”正解”と判定し、何れのＯＫクラスタにも
含まれていない場合には”未確定”と判定する第１後段
判定ステップとを具備することを特徴とする記録媒体に
ある。The invention according to claim 18 of the present application has a plurality of categories of recognition targets, and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. Data in which a recording result is recorded as a program for causing a computer to execute the image recognition method, data including the recognition result of the image recognition and its accompanying information is sampled, and the recognition result is "correct" in a certain category. From the above, the axis when clustering the data set is a category with a high appearance rate in the upper candidates, a category with a high appearance rate in the lower candidates as the axis, and a category with a high appearance rate in the upper candidates and a lower rank. Correlation coefficient with the category with a high appearance rate as candidates as the axis and the category with a high appearance rate as the middle candidate as the axis A first axis extracting step of extracting categories having a high category as the axis or categories having a low correlation coefficient as the axis, and clustering of the data set based on the axes extracted in the first axis extracting step. Data including the recognition result and its accompanying information as the recognition result of the input image, and the first second-stage knowledge creating step of generating the OK cluster of the category and retaining the shape of the generated OK cluster as the first second-stage knowledge. When it is obtained, it is checked whether or not the data is included in the OK cluster belonging to the category having the first latter knowledge, and if it is included in the OK cluster, the category to which the OK cluster belongs Is determined as the “correct answer” as the recognition result of the input image, and is determined as “undetermined” if it is not included in any OK cluster. In recording medium characterized by comprising a first second-stage determination step.

【００２５】また、本出願の請求項１９に記載の発明
は、複数カテゴリの認識対象を持ち、登録画像パタンと
入力画像パタンとのパタンマッチングにおける適合度
（距離）に基づき入力画像の認識を行う画像認識方法
を、コンピュータに実行させるためのプログラムとして
記録した記録媒体において、画像認識の認識結果および
その付随情報を含むデータをサンプル収集し、あるカテ
ゴリで認識結果が”誤認”とされているデータから、前
記データの集合をクラスタリングするときの軸を、上位
候補に出現率の高いカテゴリを該軸として、下位候補に
出現率の高いカテゴリを該軸として、上位候補に出現率
の高いカテゴリおよび下位候補に出現率の高いカテゴリ
をあわせて軸として、中位候補に出現率の高いカテゴリ
を該軸として、相関係数の高いカテゴリ同士を該軸とし
て、或いは、相関係数の低いカテゴリ同士を該軸とし
て、抽出する第２軸抽出ステップと、前記第２軸抽出ス
テップで抽出された軸に基づき前記データの集合のクラ
スタリングを行い、該カテゴリのＮＧクラスタとして生
成し、生成されたＮＧクラスタの形状を第２後段知識と
して保持する第２後段知識作成ステップと、入力画像の
認識結果として認識結果およびその付随情報を含むデー
タが得られるとき、該データが前記第２後段知識のある
カテゴリに所属するＮＧクラスタの中に含まれるか否か
をチェックして、該ＮＧクラスタに含まれる場合には該
ＮＧクラスタが所属するカテゴリを前記入力画像の認識
結果として”不正解”と判定し、何れのＮＧクラスタに
も含まれていない場合には”正解”と判定する第２後段
判定ステップとを具備することを特徴とする記録媒体に
ある。The invention according to claim 19 of the present application has a plurality of categories of recognition targets, and recognizes the input image based on the matching degree (distance) in the pattern matching between the registered image pattern and the input image pattern. Data in which a data including the recognition result of image recognition and its accompanying information is sampled on a recording medium in which the image recognition method is recorded as a program to be executed by a computer, and the recognition result is "misidentification" in a certain category. From the above, the axis when clustering the data set is a category with a high appearance rate in the upper candidates, a category with a high appearance rate in the lower candidates as the axis, and a category with a high appearance rate in the upper candidates and a lower rank. Correlation coefficient with the category with a high appearance rate as candidates as the axis and the category with a high appearance rate as the middle candidate as the axis A second axis extracting step of extracting with categories having a high category as the axis or categories having a low correlation coefficient as the axis, and clustering of the data set based on the axes extracted in the second axis extracting step. A second post-stage knowledge creating step of generating the NG cluster of the category and holding the shape of the generated NG cluster as the second post-stage knowledge, and the data including the recognition result and its accompanying information as the recognition result of the input image. When it is obtained, it is checked whether or not the data is included in the NG cluster belonging to the category having the second latter knowledge, and if it is included in the NG cluster, the category to which the NG cluster belongs Is determined as "incorrect answer" as the recognition result of the input image, and is determined as "correct answer" if it is not included in any NG cluster. In recording medium characterized by comprising a second second-stage determination step.

【００２６】また、本出願の請求項２０に記載の発明
は、前記第１後段判定ステップは、前記データがあるカ
テゴリに所属するＯＫクラスタに含まれる場合に、どの
程度の確信を持って”正解”といえるかを示す指標であ
る確信度を計算する確信度算出ステップを具備し、前記
データが複数の別カテゴリに所属するＯＫクラスタに入
った場合には、前記確信度に基づいてどのカテゴリを”
正解”とするかを決定することを特徴とする請求項１８
に記載の記録媒体にある。Further, in the invention according to claim 20 of the present application, in the first latter-stage determination step, when the data is included in an OK cluster belonging to a certain category, with a certain degree of confidence, the "correct answer" is given. If the data is in an OK cluster belonging to a plurality of different categories, which category is calculated based on the certainty factor. ”
20. It is determined whether the answer is "correct".
The recording medium described in 1.

【００２７】そして、この請求項１、７または１４に記
載の発明によれば、第１前段知識作成手段（第１前段知
識作成ステップ）により、画像認識の認識結果およびそ
の付随情報を含むデータから、認識結果として同じ第１
候補のカテゴリを持つデータを集め、該集めたデータか
ら第２候補以下の特定の候補順位における類似カテゴリ
の出現率の高いものを抽出して、第１候補カテゴリに対
する類似カテゴリの関連を示す第１前段知識として保持
し、第１前段判定手段（第１前段判定ステップ）では、
入力画像の認識結果として第１候補のカテゴリおよび第
２候補以下の特定候補順位の類似カテゴリが得られると
き、該第１候補のカテゴリおよび第２候補以下の特定候
補順位の類似カテゴリと前記第１前段知識に保持された
関連との比較により、該第１候補のカテゴリが”正解”
か否かを判定するようにしている。According to the invention described in claim 1, 7 or 14, the first pre-stage knowledge creating means (first pre-stage knowledge creating step) converts the data including the recognition result of the image recognition and its accompanying information. , The same as the recognition result 1st
First, a collection of data having candidate categories, a collection of data having a high appearance rate of similar categories in a specific candidate rank of the second candidate and below, from the collected data, showing a relation of the similar categories to the first candidate category. The first-stage determination means (first first-stage determination step) holds the knowledge as the first-stage knowledge.
As the recognition result of the input image, the first candidate category and the first candidate
When a similar category with a specific candidate rank of 2 candidates or less is obtained, the category of the first candidate and the similar category with a specific candidate rank of 2 or less candidates and the association held in the first pre-stage knowledge are obtained. By comparison, the category of the first candidate is “correct”
Whether or not it is decided.

【００２８】ここで、「データ」とは、画像認識システ
ムにおける認識結果とそれに付随する情報を指す。また
「カテゴリ」とは、同じ文字の集まりを指す。また、本
発明が適用される画像認識システムは、例えば、類似し
たカテゴリのうち、登録画像パタンと入力画像パタンと
のパタンマッチング（適合演算）における適合度（距
離）が最も高いものを認識システムの認識結果（第１候
補）として出力し、該認識結果について”正解”、”正
解と確信できない”、”不正解”を確定するものであ
る。尚、適合度の演算方法によっては、認識結果として
適合度（距離）が最も低いものを出力する場合もある。
また、本発明では、画像認識システムの認識結果とそれ
に付随する情報を含むデータを得て、該データを基に認
識結果の確定処理を行うが、「付随情報」とは、類似カ
テゴリの候補とその候補で演算されたカテゴリ辞書（登
録画像パタン）への距離である。Here, "data" refers to the recognition result in the image recognition system and the information accompanying it. A "category" refers to a group of the same characters. Further, the image recognition system to which the present invention is applied, for example, of the similar categories, the one having the highest degree of matching (distance) in the pattern matching (matching calculation) between the registered image pattern and the input image pattern is recognized. The recognition result is output as a recognition result (first candidate), and "correct answer", "cannot be convinced correct", and "incorrect answer" are determined for the recognition result. Depending on the calculation method of the goodness of fit, the one having the lowest goodness of fit (distance) may be output as the recognition result.
Further, in the present invention, the data including the recognition result of the image recognition system and the information associated therewith is obtained, and the recognition result confirmation process is performed based on the data. The “accompanying information” is a candidate for a similar category. It is the distance to the category dictionary (registered image pattern) calculated by the candidate.

【００２９】つまり、本発明の構成は、画像認識システ
ムにおいて、画像認識手段の後段に配置されるものであ
る。That is, the structure of the present invention is arranged in the image recognition system after the image recognition means.

【００３０】本発明によれば、正解として確定できたも
のについては人がチェックする必要がなくなり、正解と
確定できなかったものについてのみチェックするだけで
よくなることから、受付け誤認識率を低下させることが
でき、画像認識システムにおける省力化をさらに図るこ
とができる。According to the present invention, it is not necessary for a person to check what has been confirmed as a correct answer, and it suffices to check only those that cannot be determined as a correct answer, thus reducing the false recognition rate of acceptance. Therefore, labor saving in the image recognition system can be further achieved.

【００３１】また、請求項２、８または１５に記載の発
明によれば、第２前段知識作成手段（第２前段知識作成
ステップ）により、画像認識の認識結果およびその付随
情報を含むデータから、認識結果として同じ第１候補の
カテゴリを持つデータを集め、該集めたデータから第１
候補のカテゴリにおける適合度（距離）から、該第１候
補のカテゴリを”正解”と確定し得る適合度（距離）の
大きさを算定し、第２前段知識として保持し、第２前段
判定手段（第２前段判定ステップ）では、入力画像の認
識結果として第１候補のカテゴリが得られるとき、該第
１候補のカテゴリにおける適合度（距離）の大きさと前
記第２前段知識に保持された適合度（距離）の大きさと
の比較により、該第１候補のカテゴリが”正解”か否か
を判定するようにしている。これにより、正解として確
定できたものについては人がチェックする必要がなくな
り、正解と確定できなかったものについてのみチェック
するだけでよくなることから、受付け誤認識率を低下さ
せることができ、画像認識システムにおける省力化をさ
らに図ることができる。Further, according to the present invention as set forth in claim 2, 8 or 15, the second pre-stage knowledge creating means (second pre-stage knowledge creating step) converts the data including the recognition result of the image recognition and the accompanying information thereof. Data having the same first candidate category as the recognition result is collected, and the first data is collected from the collected data .
Fit in the category of the candidate from (distance), the first climate
The magnitude of the goodness of fit (distance) that can determine the complementary category as "correct answer" is calculated and held as the second pre-stage knowledge, and the second pre-stage determination means (second pre-stage determination step) recognizes the input image recognition result. when the category of the first candidate is obtained as, the first
By comparison with the size of one fit in the candidate category (distance) measurement and the goodness of fit held in the second pre-knowledge (distance), the category of the first candidate is whether "correct"
So that to determine. As a result, it is not necessary for a person to check what has been confirmed as a correct answer, and it suffices to check only those that cannot be determined as a correct answer. It is possible to further reduce labor.

【００３２】また、請求項３、９または１６に記載の発
明によれば、第３前段知識作成手段（第３前段知識作成
ステップ）により、画像認識の認識結果およびその付随
情報を含むデータから、認識結果として同じ第１候補の
カテゴリを持つデータを集め、該集めたデータから第１
候補のカテゴリと第２候補以下の特定候補順位の類似カ
テゴリとの適合度（距離）差から、該第１候補のカテゴ
リを”正解”と確定し得る適合度（距離）差の大きさを
算定して、第３前段知識補のとして保持し、第３前段判
定手段（第３前段判定ステップ）では、入力画像の認識
結果として第１候補のカテゴリと第２候補以下の特定候
補順位の類似カテゴリが得られるとき、該第１候補のカ
テゴリと第２候補以下の特定候補順位の類似カテゴリと
の適度（距離）差の大きさと前記第３前段知識に保持さ
れた適合度（距離）差の大きさとの比較により、該第１
候補のカテゴリが”正解”か否かを判定するようにして
いる。これにより、正解として確定できたものについて
は人がチェックする必要がなくなり、正解と確定できな
かったものについてのみチェックするだけでよくなるこ
とから、受付け誤認識率を低下させることができ、画像
認識システムにおける省力化をさらに図ることができ
る。Further, according to the invention described in claim 3, 9 or 16, the third pre-stage knowledge creating means (the third pre-stage knowledge creating step) is used to extract the data including the recognition result of the image recognition and the accompanying information thereof. Data having the same first candidate category as the recognition result is collected, and the first data is collected from the collected data .
The categorization of the first candidate is calculated based on the difference in the degree of matching (distance) between the candidate category and the similar categories in the ranks of the second and lower candidates .
Is calculated as the magnitude of the difference in fitness (distance) that can be determined as the “correct answer”, and is stored as the third pre-stage knowledge complement. In the third pre-stage determination means (third pre-stage determination step), As a recognition result, the category of the first candidate and the identification signs below the second candidate
When similar category of auxiliary rank is obtained, and similar categories of mosquito categories and the following specific candidate ranking second candidate of the first candidate
Moderate (distance) of the holding on the size and the third front knowledge of the difference between
By comparison with the size of the fit (distance) difference, said first
It is determined whether or not the candidate category is “correct answer”. As a result, it is not necessary for a person to check what has been confirmed as a correct answer, and it suffices to check only those that cannot be determined as a correct answer. It is possible to further reduce labor.

【００３３】また、請求項１０または１７に記載の発明
によれば、予め、第１前段知識作成手段（第１前段知識
作成ステップ）により第１前段知識（第１候補カテゴリ
に対する類似カテゴリの関連）を、第２前段知識作成手
段（第２前段知識作成ステップ）により第２前段知識
（特定順位の候補を”正解”と確定し得る適合度（距
離）の大きさ）を、第３前段知識作成手段（第３前段知
識作成ステップ）により第３前段知識（第１候補を”正
解”と確定し得る適合度（距離）差の大きさ）を、それ
ぞれ作成しておき、第１前段判定手段（第１前段判定ス
テップ）、第２前段判定手段（第２前段判定ステップ）
および第３前段判定手段（第３前段判定ステップ）で
は、これら第１、第２および第３前段知識を用いて、入
力画像の認識結果について、”正解”か否かを判別する
ようにしている。これにより、受付け誤認識率を低下さ
せることができ、画像認識システムにおける省力化をさ
らに図ることができる。また、帳票読み込みの自動化を
図ることができる。すなわち、例えば、１枚の帳票を認
識対象エリアとするとき、確定結果がすべて正解の場合
はチェックを行うことなく結果を保存することが可能と
なり、逆に、正解と確信できるもの以外の結果が１つで
も含まれる場合は、その帳票全体の認識結果を保存せず
に、後に人の入力をまとめて求めるといった運用が可能
になる。In addition, according to the invention described in claim 10 or 17, the first pre-stage knowledge (relation of similar category to the first candidate category) is performed in advance by the first pre-stage knowledge producing means (first pre-stage knowledge producing step). The second pre-stage knowledge creating step (second pre-stage knowledge creating step) produces the second pre-stage knowledge (the degree of goodness of fit (distance) at which a candidate of a specific rank can be determined as "correct answer") to the third pre-stage knowledge creating By means (third pre-stage knowledge creating step), third pre-stage knowledge (size of difference in fitness (distance) at which the first candidate can be determined as "correct answer") is created, and first pre-stage determination means ( First front stage determination step), second front stage determination means (second front stage determination step)
And the third pre-stage determination means (third pre-stage determination step) uses the first, second and third pre-stage knowledge to determine whether or not the recognition result of the input image is a "correct answer" . I am trying. As a result, the false recognition rate of acceptance can be reduced, and labor saving in the image recognition system can be further achieved. In addition, it is possible to automate the reading of forms. That is, for example, when one form is used as the recognition target area, if all confirmed results are correct answers, it is possible to save the results without checking, and conversely, results other than those that can be convinced to be correct answers When even one is included, it is possible to perform an operation such that the input of a person is collectively requested later without saving the recognition result of the entire form.

【００３４】また、請求項４、１１または１８に記載の
発明によれば、第１軸抽出手段（第１軸抽出ステップ）
により、画像認識の認識結果およびその付随情報を含む
データをサンプル収集し、あるカテゴリで認識結果が”
正解”とされているデータから、前記データの集合をク
ラスタリングするときの軸を抽出し、第１後段知識作成
手段（第１後段知識作成ステップ）により、第１軸抽出
手段（第１軸抽出ステップ）で抽出された軸に基づき前
記データの集合のクラスタリングを行い、該カテゴリの
ＯＫクラスタとして生成し、生成されたＯＫクラスタの
形状を第１後段知識として保持し、第１後段判定手段
（第１後段判定ステップ）では、入力画像の認識結果と
して認識結果およびその付随情報を含むデータが得られ
るとき、該データが第１後段知識のあるカテゴリに所属
するＯＫクラスタの中に含まれるか否かをチェックし
て、該ＯＫクラスタに含まれる場合には該ＯＫクラスタ
が所属するカテゴリを入力画像の認識結果として”正
解”と判定し、何れのＯＫクラスタにも含まれていない
場合には”未確定”と判定する。尚、第１軸抽出手段
（第１軸抽出ステップ）による軸抽出には、例えば、上
位候補に出現率の高いカテゴリを該軸として、下位候補
に出現率の高いカテゴリを該軸として、上位候補に出現
率の高いカテゴリおよび下位候補に出現率の高いカテゴ
リをあわせて軸として、中位候補に出現率の高いカテゴ
リを該軸として、相関係数の高いカテゴリ同士を該軸と
して、或いは、相関係数の低いカテゴリ同士を該軸とし
て抽出する方法等が考えられる。これにより、正解とし
て確定できたものについては人がチェックする必要がな
くなり、正解と確定できなかったものについてのみチェ
ックするだけでよくなることから、受付け誤認識率を低
下させることができ、画像認識システムにおける省力化
をさらに図ることができる。According to the invention as defined in claim 4, 11 or 18, the first axis extracting means (first axis extracting step).
Enables to collect data including the recognition result of image recognition and its accompanying information, and the recognition result can be obtained in a certain category.
The axis for clustering the set of data is extracted from the data that are “correct answer”, and the first second-stage knowledge creating unit (first second-stage knowledge creating step) extracts the first axis extracting unit (first axis extracting step). ) Clustering of the data set based on the axis extracted in step 1) to generate an OK cluster of the category, hold the shape of the generated OK cluster as first second-stage knowledge, and determine the first second-stage determination means (first In the latter-stage determination step), when data including the recognition result and its accompanying information is obtained as the recognition result of the input image, it is determined whether the data is included in the OK cluster belonging to the first knowledge category. If it is included in the OK cluster, it is determined that the category to which the OK cluster belongs is “correct” as the recognition result of the input image. If it is not included in the cluster, it is determined to be “undetermined.” In addition, for the axis extraction by the first axis extraction means (first axis extraction step), for example, a category with a high appearance rate is selected as a top candidate. As an axis, a category having a high appearance rate in the lower candidates is used as an axis, a category having a high appearance rate in the upper candidates and a category having a high appearance rate in the lower candidates are used as an axis, and a category having a high appearance rate in the middle candidate is used as the axis. As an axis, a method of extracting categories having a high correlation coefficient as the axis, or extracting categories having a low correlation coefficient as the axis can be considered. Since it becomes unnecessary to check only the items that could not be confirmed as correct answers, it is possible to reduce the false recognition rate of acceptance and reduce the image recognition system. It can be further achieved that labor saving.

【００３５】また、請求項６、１２または１９に記載の
発明によれば、第２軸抽出手段（第２軸抽出ステップ）
により、画像認識の認識結果およびその付随情報を含む
データをサンプル収集し、あるカテゴリで認識結果が”
誤認”とされているデータから、前記データの集合をク
ラスタリングするときの軸を抽出し、第２後段知識作成
手段（第２後段知識作成ステップ）により、第２軸抽出
手段（第２軸抽出ステップ）で抽出された軸に基づき前
記データの集合のクラスタリングを行い、該カテゴリの
ＮＧクラスタとして生成し、生成されたＮＧクラスタの
形状を第２後段知識として保持し、第２後段判定手段
（第２後段判定ステップ）では、入力画像の認識結果と
して認識結果およびその付随情報を含むデータが得られ
るとき、該データが第２後段知識のあるカテゴリに所属
するＮＧクラスタの中に含まれるか否かをチェックし
て、該ＮＧクラスタに含まれる場合には該ＮＧクラスタ
が所属するカテゴリを前記入力画像の認識結果として”
不正解”と判定し、何れのＮＧクラスタにも含まれてい
ない場合には”正解”と判定するようにしている。尚、
第２軸抽出手段（第２軸抽出ステップ）における軸抽出
には、上位候補に出現率の高いカテゴリを該軸として、
下位候補に出現率の高いカテゴリを該軸として、上位候
補に出現率の高いカテゴリおよび下位候補に出現率の高
いカテゴリをあわせて軸として、中位候補に出現率の高
いカテゴリを該軸として、相関係数の高いカテゴリ同士
を該軸として、或いは、相関係数の低いカテゴリ同士を
該軸として抽出する方法等が考えられる。According to the invention of claim 6, 12 or 19, second axis extracting means (second axis extracting step).
Enables to collect data including the recognition result of image recognition and its accompanying information, and the recognition result can be obtained in a certain category.
An axis for clustering the set of data is extracted from the data which is regarded as "misidentification", and the second second-stage knowledge creating unit (second second-stage knowledge creating step) extracts the second axis extracting unit (second axis extracting step). ) Clustering the set of data based on the axis extracted in), generating as an NG cluster of the category, and holding the shape of the generated NG cluster as second second-stage knowledge, a second second-stage determining means (second In the latter-stage determination step), when data including the recognition result and its accompanying information is obtained as the recognition result of the input image, it is determined whether the data is included in the NG cluster belonging to the second knowledge category. If it is checked, if it is included in the NG cluster, the category to which the NG cluster belongs is determined as the recognition result of the input image.
It is determined that the answer is "wrong", and if it is not included in any NG cluster, it is determined as "correct".
For the axis extraction in the second axis extraction means (second axis extraction step), a category having a high appearance rate in the top candidates is used as the axis.
A category with a high appearance rate in the lower candidates is used as the axis, a category with a high appearance rate in the upper candidates and a category with a high appearance rate in the lower candidates are combined as an axis, and a category with a high appearance rate in the middle candidate is used as the axis. A method of extracting categories having a high correlation coefficient as the axis, or extracting categories having a low correlation coefficient as the axis can be considered.

【００３６】これにより、正解として確定できたものに
ついては人がチェックする必要がなくなり、正解と確定
できなかったものについてのみチェックするだけでよく
なることから、受付け誤認識率を低下させることがで
き、画像認識システムにおける省力化をさらに図ること
ができる。As a result, it is not necessary for a person to check what has been confirmed as a correct answer, and it suffices to check only those that cannot be determined as a correct answer, so that the acceptance error recognition rate can be reduced. It is possible to further save labor in the image recognition system.

【００３７】また、請求項５、１３または２０に記載の
発明によれば、第１後段判定手段（第１後段判定ステッ
プ）において、前記データがあるカテゴリに所属するＯ
Ｋクラスタに含まれる場合に、どの程度の確信を持っ
て”正解”といえるかを示す指標である確信度を確信度
算出手段（確信度算出ステップ）により計算するように
し、前記データが複数の別カテゴリに所属するＯＫクラ
スタに入った場合には、確信度に基づいてどのカテゴリ
を”正解”とするかを決定し、例えば、確信度の大きさ
が所定値以下の場合には”正解と確信できない”と判定
するようにしている。これにより、あるカテゴリのＯＫ
クラスタに入ったデータをそのカテゴリに属するデータ
と決定付け、逆にどのＯＫクラスタにも入らない場合
は”不明”（どのカテゴリにも属さない）データとされ
るが、あるカテゴリに属するデータと決定付けられた場
合に、認識結果の確信度合いを算出することにより、”
不明”、”正解”、”正解と確信できない”の３つに認
識結果を分別して確定されることとなる。また、さら
に、ＮＧクラスタの少なくとも１つに入った場合は”不
正解”データと決定され、”正解”、”不正解”、”正
解と確信できない”の３つに認識結果を分別して確定さ
れることとなる。Further, according to the invention described in claim 5, 13 or 20, in the first second-stage determination means (first second-stage determination step), the data belonging to a certain category is O.
When included in the K cluster, the certainty factor, which is an index indicating how much confidence can be said to be the “correct answer”, is calculated by the certainty factor calculation means (certainty factor calculation step), and the data is stored in plural numbers. When an OK cluster belonging to another category is entered, it is determined which category is the “correct answer” based on the certainty factor. For example, when the magnitude of the certainty factor is equal to or less than a predetermined value, the “correct answer” is determined. I am not sure. " As a result, the OK of a certain category
Data that has entered the cluster is determined to belong to that category. Conversely, if it does not belong to any OK cluster, it is regarded as "unknown" (not belonging to any category) data, but it is determined to be data that belongs to a certain category. If it is attached, by calculating the confidence level of the recognition result,
The recognition result will be classified into three types, "unknown", "correct answer", and "we cannot be convinced to be correct". Furthermore, if at least one of the NG clusters is entered, it will be regarded as "incorrect" data. The recognition result is determined, and the recognition result is classified into three of “correct answer”, “incorrect answer”, and “I cannot be sure of correct answer”.

【００３８】[0038]

【発明の実施の形態】以下、本発明の画像認識装置、画
像認識方法および記録媒体の実施の形態について、図面
を参照して詳細に説明する。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of an image recognition apparatus, an image recognition method and a recording medium of the present invention will be described in detail below with reference to the drawings.

【００３９】図１は本発明の一実施形態に係る画像認識
装置の構成図である。FIG. 1 is a block diagram of an image recognition apparatus according to an embodiment of the present invention.

【００４０】図１において、本実施形態の画像認識装置
は、おおまかにＯＣＲ認識処理部１０１および認識結果
確定部１０２を備えて構成されており、認識結果確定部
１０２は、知識生成部１０３、前段判定部（第１前段判
定手段、第２前段判定手段および第３前段判定手段）１
１５、後段判定部（第１後段判定手段および第２後段判
定手段）１１６、ルール判定部１１７、認識確定部１１
８およびリジェクト部１１９を備えて構成されている。
また、知識生成部１０３は、前段知識の作成部（第１前
段知識作成手段、第２前段知識作成手段および第３前段
知識作成手段）１１１、前段知識ベース（第１前段知
識、第２前段知識および第３前段知識）１１２、後段知
識の作成部（第１後段知識作成手段および第２後段知識
作成手段）１１３および後段知識ベース（第１後段知識
および第２後段知識）１１４を備えた構成である。In FIG. 1, the image recognition apparatus of the present embodiment roughly comprises an OCR recognition processing unit 101 and a recognition result confirmation unit 102. The recognition result confirmation unit 102 includes a knowledge generation unit 103 and a preceding stage. Judgment unit (first front stage judgment means, second front stage judgment means and third front stage judgment means) 1
15, rear stage determination unit (first rear stage determination unit and second rear stage determination unit) 116, rule determination unit 117, recognition determination unit 11
8 and a reject unit 119.
In addition, the knowledge generation unit 103 includes a pre-stage knowledge creating unit (first pre-stage knowledge creating unit, second pre-stage knowledge creating unit, and third pre-stage knowledge creating unit) 111, a pre-stage knowledge base (first pre-stage knowledge, second pre-stage knowledge). And third pre-stage knowledge) 112, a post-stage knowledge creating unit (first post-stage knowledge creating means and second post-stage knowledge creating means) 113, and a post-stage knowledge base (first post-stage knowledge and second post-stage knowledge) 114. is there.

【００４１】また、図２は、本実施形態の画像認識装置
の全体的な処理の概略を説明する説明図である。同図に
示すように、本実施形態の画像認識装置は、複数カテゴ
リの認識対象を持ち、登録画像パタンと入力画像パタン
とのパタンマッチングにおける適合度（距離）に基づき
入力画像の認識を行うものであるが、認識結果確定部１
０２に特徴を持つものであり、ＯＣＲ認識処理部１０１
からの認識結果と付随情報１５１ｂについて、前段知識
ベース１１２および後段知識ベース１１４に基づく前段
判定部１１５および後段判定部１１６の判定を行うこと
によって、認識確定部１１８により、該認識結果が”未
確定（不正解）”、”正解”或いは”正解と確信が持て
ない”の何れであるかを確定するものである。FIG. 2 is an explanatory view for explaining the outline of the overall processing of the image recognition apparatus of this embodiment. As shown in the figure, the image recognition apparatus of this embodiment has recognition targets of a plurality of categories, and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. However, the recognition result determination unit 1
02, and the OCR recognition processing unit 101
The recognition determination unit 118 determines that the recognition result is “undetermined” by performing the determination of the recognition result and the associated information 151b from the preceding determination unit 115 and the determination unit 116 based on the upstream knowledge base 112 and the downstream knowledge base 114. (Incorrect answer), “correct answer”, or “I am not sure that the answer is correct”.

【００４２】本実施形態の画像認識装置を構成する各構
成要素および各構成要素間の引き渡しデータについて簡
単に説明する。尚、本実施形態の画像認識装置では、説
明を簡単にするため、認識対象の文字画像として、数字
（０〜９）並びに３種の記号（”（”，”）”，”
−”）、合計１３種の文字を扱うものとする。A brief description will be given of each of the components that make up the image recognition apparatus of this embodiment and the transfer data between the components. In the image recognition apparatus of the present embodiment, in order to simplify the description, the character images to be recognized are numbers (0 to 9) and three types of symbols ("(", ")", ".
-"), A total of 13 types of characters are handled.

【００４３】まず、ＯＣＲ認識処理部１０１では、類似
した文字のうち、各文字辞書に対して適合度演算を行
う。ここで、適合度演算は、登録画像パタンと入力画像
パタンとのパタンマッチングにおいて行われるもので、
適合度（以下、距離という）の高いもの順に類似文字を
並べ、距離が最も高いものをＯＣＲの認識結果として出
力する。First, the OCR recognition processing unit 101 performs a fitness calculation on each character dictionary among similar characters. Here, the fitness calculation is performed in pattern matching between the registered image pattern and the input image pattern,
Similar characters are arranged in descending order of suitability (hereinafter referred to as distance), and the one with the highest distance is output as the OCR recognition result.

【００４４】また、認識結果１５１ａ，１５１ｂは、画
像認識の認識結果およびその付随情報を含むデータであ
り、該データは、１文字あたり１３個の候補と各々の候
補辞書に対する距離により構成されている（図６参
照）。特に、知識生成用認識結果１５１ａは、知識生成
のために利用するサンプル画像について実際の運用前に
ＯＣＲ認識処理部１０１によって取得し、知識生成部１
０３に渡される。また特に、認識結果１５１ｂは、実際
のシステム運用時に、ＯＣＲ認識処理部１０１から１文
字単位で前段判定部１１５に供給される。The recognition results 151a and 151b are data including the recognition result of image recognition and its accompanying information, and the data is composed of 13 candidates per character and the distance to each candidate dictionary. (See Figure 6). In particular, the knowledge generation recognition result 151a is acquired by the OCR recognition processing unit 101 before the actual operation of the sample image used for knowledge generation, and the knowledge generation unit 1
Passed to 03. Further, in particular, the recognition result 151b is supplied from the OCR recognition processing unit 101 to the pre-stage determination unit 115 on a character-by-character basis during actual system operation.

【００４５】前段知識の作成部１１１は、上記のよう
に、第１前段知識作成手段、第２前段知識作成手段およ
び第３前段知識作成手段を備えて構成され、第１前段知
識作成手段では、画像認識の認識結果およびその付随情
報を含むデータから、認識結果として同じ第１候補のカ
テゴリを持つデータを集め、該集めたデータから第２候
補以下の特定の候補順位における類似カテゴリの出現率
の高いものを抽出して、これを第１候補カテゴリに対す
る類似カテゴリの関連を示す第１前段知識として前段知
識ベース１１２に登録する。As described above, the pre-stage knowledge creating unit 111 is configured to include the first pre-stage knowledge creating unit, the second pre-stage knowledge creating unit, and the third pre-stage knowledge creating unit. In the first pre-stage knowledge creating unit, Data having the same first candidate category as the recognition result is collected from the data including the recognition result of the image recognition and its accompanying information, and the second data is collected from the collected data .
In addition, the one with a high appearance rate of similar categories in the following specific candidate ranks is extracted, and this is registered in the upstream knowledge base 112 as the first upstream knowledge indicating the relationship of the similar categories to the first candidate category.

【００４６】また、第２前段知識作成手段では、画像認
識の認識結果およびその付随情報を含むデータから、認
識結果として同じ第１候補のカテゴリを持つデータを集
め、該集めたデータから第１候補のカテゴリにおける適
合度（距離）から、該第１候補のカテゴリを”正解”と
確定し得る適合度（距離）の大きさを算定し、第２前段
知識として前段知識ベース１１２に登録する。In the second pre-stage knowledge creating means, data having the same first candidate category as the recognition result is collected from the data including the recognition result of the image recognition and its accompanying information, and the first candidate is selected from the collected data . from goodness of fit in the category (distance), a category of the first candidate to calculate the size of the goodness of fit may be determined as "correct" (distance), and registers the preceding knowledge base 112 as second pre-knowledge.

【００４７】さらに、第３前段知識作成手段では、画像
認識の認識結果およびその付随情報を含むデータから、
認識結果として同じ第１候補のカテゴリを持つデータを
集め、該集めたデータから第１候補のカテゴリと第２候
補以下の特定の候補順位の類似カテゴリとの適合度（距
離）差から、該第１候補のカテゴリを”正解”と確定し
得る適合度（距離）差の大きさを算定して、第３前段知
識として前段知識ベース１１２に登録する。Furthermore, in the third pre-stage knowledge creating means, from the data including the recognition result of the image recognition and its accompanying information,
Data having the same first candidate category as the recognition result is collected, and from the collected data, the first candidate category and the second candidate are collected.
Complementary, the magnitude of the goodness of fit (distance) difference that can determine the category of the first candidate as the “correct answer” is calculated from the goodness of fit (distance) difference with similar categories of specific candidate ranks, It is registered in the upstream knowledge base 112 as the upstream knowledge.

【００４８】本実施形態では、認識対象の文字はカテゴ
リが１３個あり、ＯＣＲ認識処理部１０１から出力され
るデータは、その１３個の全てが必ず候補文字の中に含
まれている。そのうち第２、第３候補文字データで出現
率の高い文字（第１前段知識）を取得すると共に、第１
候補の距離データで正解と判別できる最低値（第２前段
知識）、並びに、第１候補距離と第２候補距離の差のデ
ータから正解と判別できる最低値（第３前段知識）を算
出する。ここで作成された知識は、前段判定部１１５で
使用されることとなる。尚、１３種の数字および記号以
外の文字の場合については、数千種のカテゴリがある
が、この場合、ＯＣＲ認識処理部１０１から出力される
データは、距離の大きいカテゴリの上位数十種類とし
て、これらから同様にして、第Ｘ（Ｘは任意に選択可
能）候補に出現の多いものを選択していくことになろ
う。In this embodiment, there are 13 categories of characters to be recognized, and all 13 of the data output from the OCR recognition processing unit 101 are always included in the candidate characters. Of the second and third candidate character data, a character with a high appearance rate (first knowledge) is acquired, and
The minimum value (second pre-stage knowledge) at which the correct distance can be determined from the candidate distance data, and the minimum value (third pre-stage knowledge) at which the correct answer can be determined from the data of the difference between the first candidate distance and the second candidate distance are calculated. The knowledge created here will be used by the pre-stage determination unit 115. There are thousands of categories for characters other than 13 numbers and symbols, but in this case, the data output from the OCR recognition processing unit 101 is the top tens of the categories with a large distance. From these, in the same way, it is likely that the ones that appear most frequently will be selected as the Xth (X can be arbitrarily selected) candidates.

【００４９】次に、後段知識の作成部１１３は、上記の
ように、第１後段知識作成手段および第２後段知識作成
手段を備えて構成され、第１後段知識作成手段に含まれ
る第１軸抽出手段により、画像認識の認識結果およびそ
の付随情報を含むデータをサンプル収集し、あるカテゴ
リで認識結果が”正解”とされているデータから、デー
タの集合をクラスタリングするときの軸を抽出し、第１
後段知識作成手段により、抽出した軸に基づきデータの
集合のクラスタリングを行い、該カテゴリのＯＫクラス
タとして生成し、生成されたＯＫクラスタの形状を第１
後段知識として後段知識ベース１１４に登録する。Next, the second-stage knowledge creating unit 113 is configured to include the first second-stage knowledge creating unit and the second second-stage knowledge creating unit as described above, and the first axis included in the first second-stage knowledge creating unit. The extraction means collects a sample of data including the recognition result of image recognition and its accompanying information, and extracts the axis when clustering the data set from the data whose recognition result is "correct" in a certain category, First
The latter-stage knowledge creating unit performs clustering on a set of data based on the extracted axes to generate an OK cluster of the category, and the shape of the generated OK cluster is first
It is registered in the second-stage knowledge base 114 as the second-stage knowledge.

【００５０】また、第２後段知識作成手段に含まれる第
２軸抽出手段により、画像認識の認識結果およびその付
随情報を含むデータをサンプル収集し、あるカテゴリで
認識結果が”誤認”とされているデータから、データの
集合をクラスタリングするときの軸を抽出し、第２後段
知識作成手段により、抽出した軸に基づきデータの集合
のクラスタリングを行い、該カテゴリのＮＧクラスタと
して生成し、生成されたＮＧクラスタの形状を第２後段
知識として後段知識ベース１１４に登録する。Further, the second axis extracting means included in the second latter-stage knowledge creating means collects a sample of data including the recognition result of the image recognition and its accompanying information, and the recognition result is regarded as "misidentification" in a certain category. From the existing data, an axis for clustering the data set is extracted, and the second post-stage knowledge creating unit performs clustering of the data set based on the extracted axis to generate an NG cluster of the category. The shape of the NG cluster is registered in the second-stage knowledge base 114 as the second-stage second-stage knowledge.

【００５１】尚、第１および第２軸抽出手段における軸
抽出には、上位候補に出現率の高いカテゴリを該軸とし
て（誤認識し易いカテゴリを選択）、下位候補に出現率
の高いカテゴリを該軸として（誤認識し難いカテゴリを
選択）、上位候補に出現率の高いカテゴリおよび下位候
補に出現率の高いカテゴリをあわせて軸として（誤認識
し易いカテゴリとし難いカテゴリを選択）、中位候補に
出現率の高いカテゴリを該軸として（誤認識し易くもし
難くも無いカテゴリを選択）、相関係数の高いカテゴリ
同士を該軸として、或いは、相関係数の低いカテゴリ同
士を該軸として抽出する方法等が考えられる。In the axis extraction in the first and second axis extracting means, the category having a high appearance rate is used as the upper candidate (the category which is easily misrecognized is selected), and the category having a high appearance rate is selected as the lower candidate. As the axis (select a category that is difficult to misrecognize), combine the category with a high appearance rate in the upper candidates and the category with a high appearance rate in the lower candidates (select a category that is difficult to misidentify), The category with a high appearance rate as a candidate is selected as the axis (a category that is not easily misidentified or difficult is selected), the categories with a high correlation coefficient are used as the axis, or the categories with a low correlation coefficient are used as the axis. A method of extracting may be considered.

【００５２】つまり、クラスタを生成するために軸を選
択し、その軸データを利用してクラスタを生成する。数
字であれば、カテゴリが１３個あり、候補にはその１３
個のカテゴリへの距離が大きいもの順に出力される。そ
の中から、認識確定に利用出来値特徴のある軸を選択す
ることになる。本実施形態では、上位候補に出現率が高
い文字（すなわち、誤認識し易い文字である）上位３つ
をクラスタ使用軸として選択した。そして、認識結果１
５１ａを用いてクラスタリングを実施した。作成される
クラスタ形状は楕円体となる。また１カテゴリ当たり、
複数のＯＫクラスタと複数のＮＧクラスタが存在するこ
ととなる。That is, an axis is selected to generate a cluster, and the axis data is used to generate a cluster. If it is a number, there are 13 categories, and 13 candidates
The items are output in descending order of distance to each category. From among them, the axis having the available price characteristic for the recognition confirmation is selected. In the present embodiment, the top three characters having a high appearance rate (that is, the characters that are likely to be erroneously recognized) as the top candidates are selected as the cluster use axes. And the recognition result 1
Clustering was performed using 51a. The created cluster shape is an ellipsoid. Also, per category,
There will be a plurality of OK clusters and a plurality of NG clusters.

【００５３】次に、前段判定部１１５は、上記のよう
に、第１前段判定手段、第２前段判定手段および第３前
段判定手段を備えて構成され、第１前段判定手段では、
入力画像の認識結果として第１候補のカテゴリおよび第
２候補以下の特定候補順位の類似カテゴリが得られると
き、該第１候補のカテゴリおよび第２候補以下の特定候
補順位の類似カテゴリと第１前段知識に保持された関連
との比較により、該第１候補のカテゴリを”正解”また
は”不明”と判定する。Next, the pre-stage determination unit 115 is configured to include the first pre-stage determination unit, the second pre-stage determination unit, and the third pre-stage determination unit as described above.
As the recognition result of the input image, the first candidate category and the first candidate
When similar categories with a specific candidate rank less than or equal to 2 candidates are obtained, the first candidate category and the similar categories with a specific candidate rank less than or equal to the second candidate are compared with the association held in the first pre-stage knowledge to determine the first category. 1 Candidate category is determined to be “correct answer” or “unknown”.

【００５４】また、第２前段判定手段では、該第１候補
のカテゴリにおける適合度（距離）の大きさと第２前段
知識に保持された適合度（距離）の大きさとの比較によ
り、該第１候補のカテゴリを”正解”または”不明”と
判定する。さらに、第３前段判定手段では、該第１候補
のカテゴリと第２候補以下の特定候補順位の類似カテゴ
リとの適合度（距離）差の大きさと第３前段知識に保持
された適合度（距離）差の大きさとの比較により、該第
１候補のカテゴリを”正解”または”不明”と判定す
る。In addition, the second front stage judging means uses the first candidate.
By comparison with the size of the goodness of fit (distance) measurement and retained fit the second pre-knowledge (distance) in the category, it is determined that the category of the first candidate "correct" or "unknown". Furthermore, in the third front determination means, the first candidate
Categories and holding the magnitude and third front knowledge of fitness (distance) difference between similar categories following specific candidate ranking second candidate
By comparison with the magnitude of has been fit (distance) difference, it is determined that the category of the first candidate "correct" or "unknown".

【００５５】本実施形態では、認識結果１５１ｂにおけ
る第２候補および第３候補の文字と、前段知識ベース１
１２の知識とを比較する。例えば、第１候補が”１”で
あった場合には、第２候補および第３候補にそれぞれ”
６”および”）”という文字が出現する率が高い。つま
り、第１前段知識には、第２候補および第３候補に出現
する文字が、”６”および”）”のとき、第１候補が”
１”であるものを正解とする知識を所有しており、まず
認識結果１５１ｂの第２候補および第３候補の文字種類
をチェックするのである。In the present embodiment, the characters of the second candidate and the third candidate in the recognition result 151b and the preceding knowledge base 1
Compare with 12 knowledge. For example, if the first candidate is “1”, the second candidate and the third candidate are respectively “
The characters "6" and ")" appear at a high rate. That is, when the characters appearing in the second and third candidates are "6" and ")" in the first pre-stage knowledge, the first candidate is the first candidate. But"
The person who possesses the knowledge that the correct answer is "1", and first checks the character types of the second and third candidates of the recognition result 151b.

【００５６】次に、前段知識ベース１１２で保持する第
２前段知識（第１候補の距離データで正解と判別できる
最低値）を参照して、該最低値以上の値が第１候補の距
離について算出されているかをチェックし、最低値以上
なら”正解”とする。さらに、第１候補および第２候補
の距離が非常に近い値であると、誤認識している可能性
があるため、前段知識ベース１１２で保持する第３前段
知識（第１候補距離と第２候補距離の差のデータで正解
と判別できる最低値）を参照して、該最低値以上の値で
あるかをチェックし、最低値以上なら”正解”とする。
以上の３つのチェック全てで”正解”と判断されたデー
タについてのみ、認識確定部１１８に結果（確定データ
１５５）が渡されることとなる。Next, with reference to the second pre-stage knowledge (the lowest value that can be determined to be the correct answer from the distance data of the first candidate) held in the pre-stage knowledge base 112, a value equal to or greater than the minimum value is the distance of the first candidate. Check whether it has been calculated, and if it is at least the minimum value, it is regarded as "correct". Further, since it is possible that the first candidate and the second candidate have very close distances to each other, it may be erroneously recognized. Therefore, the third upstream knowledge (first candidate distance and second The minimum value that can be determined to be the correct answer based on the data of the difference in the candidate distances) is referred to, and it is checked whether the value is equal to or more than the minimum value.
The result (confirmed data 155) is passed to the recognition confirmation unit 118 only for the data determined to be “correct” in all of the above three checks.

【００５７】したがって、確定データ１５５は前段判定
部１１５で正解と判断されたデータであり、不明データ
１５４は前段判定部１１５で正解と判断されなかったデ
ータである。Therefore, the definite data 155 is the data that the pre-stage determination unit 115 has determined to be correct, and the unknown data 154 is the data that the pre-stage determination unit 115 has not determined to be correct.

【００５８】次に、後段判定部１１６は、上記のよう
に、第１後段判定手段および第２後段判定手段を備えて
構成され、第１後段判定手段では、入力画像の認識結果
として認識結果およびその付随情報を含むデータが得ら
れるとき、該データが第１後段知識のあるカテゴリに所
属するＯＫクラスタの中に含まれるか否かをチェックし
て、該ＯＫクラスタに含まれる場合には該ＯＫクラスタ
が所属するカテゴリを入力画像の認識結果として”正
解”と判定し、何れのＯＫクラスタにも含まれていない
場合には”未確定”と判定する。また、第２後段判定手
段では、該データが第２後段知識のあるカテゴリに所属
するＮＧクラスタの中に含まれるか否かをチェックし
て、該ＮＧクラスタに含まれる場合には該ＮＧクラスタ
が所属するカテゴリを前記入力画像の認識結果として”
不正解”と判定し、何れのＮＧクラスタにも含まれてい
ない場合には”正解”と判定する。Next, as described above, the rear-stage determination section 116 is configured to include the first rear-stage determination means and the second rear-stage determination means, and the first rear-stage determination means recognizes the recognition result and the recognition result as the recognition result of the input image. When the data including the accompanying information is obtained, it is checked whether the data is included in the OK cluster belonging to the category having the first second-stage knowledge, and if the data is included in the OK cluster, the OK is determined. The category to which the cluster belongs is determined as "correct answer" as the recognition result of the input image, and if it is not included in any OK cluster, it is determined as "undetermined". Further, the second latter-stage determination means checks whether or not the data is included in the NG cluster belonging to the category having the second latter-stage knowledge, and if the data is included in the NG cluster, the NG cluster is The category to which the user belongs belongs as the recognition result of the input image.
If the answer is "wrong answer" and it is not included in any NG cluster, it is judged as "correct answer".

【００５９】つまり、前段判定部１１５で不明とされた
データ１５４を対象に、各カテゴリで生成されたクラス
タを用いて判定を行う。この後段判定部１１６には、Ｏ
ＫクラスタおよびＮＧクラスタの知識が後段知識ベース
１１４から供給され、ＯＫクラスタに入るかどうかのチ
ェック、並びに、ＮＧクラスタに入るかどうかのチェッ
クという２段階の判定を行う。That is, the data 154 which is determined to be unknown by the pre-stage determination unit 115 is subjected to the determination using the clusters generated in each category. The subsequent determination unit 116 is
Knowledge of the K clusters and the NG clusters is supplied from the post-stage knowledge base 114, and a two-stage judgment is performed to check whether or not to enter the OK cluster and to check whether or not to enter the NG cluster.

【００６０】第１後段判定手段において、データがある
カテゴリに所属するＯＫクラスタに含まれる場合には、
どの程度の確信を持って”正解”といえるかを示す指標
である確信度を当該後段判定部１１６に含まれる確信度
算出手段により計算し、データが複数の別カテゴリに所
属するＯＫクラスタに入った場合には、該確信度に基づ
いてどのカテゴリを”正解”とするかを決定する。例え
ば、確信度の大きさが所定値以下の場合には”正解と確
信できない”と判定する。When the data is included in the OK cluster belonging to a certain category in the first latter-stage determination means,
The certainty factor, which is an index indicating how much certainty can be said to be the “correct answer”, is calculated by the certainty factor calculation means included in the latter-stage determination unit 116, and the data falls within the OK cluster belonging to a plurality of different categories. If so, which category is set as the “correct answer” is determined based on the certainty factor. For example, when the degree of certainty is equal to or less than a predetermined value, it is determined that "the correct answer cannot be obtained".

【００６１】尚、確信度算出手段における確信度算出方
法には、データが、該ＯＫクラスタの重心からどの位の
距離に位置しているかを演算し、重心から近ければ高い
確信度を、重心から遠ければ低い確信度をそれぞれ算出
する第１の方法と、また、該ＯＫクラスタを構成するデ
ータの分布に基づいて、データが密度の高いエリアに存
在する場合は高い確信度を、密度の低いエリアに存在す
る場合は低い確信度をそれぞれ算出する第２の方法とが
ある。The confidence factor calculation method in the confidence factor calculation means calculates how far the data is located from the center of gravity of the OK cluster, and if it is closer to the center of gravity, a higher confidence factor is calculated. Based on the first method of calculating a low confidence degree if the distance is far, and based on the distribution of the data forming the OK cluster, a high confidence degree is set when the data exists in the high density area, and a low confidence area is set in the low density area. 2), there is a second method for calculating a low certainty factor.

【００６２】また、第２後段判定手段において、データ
があるカテゴリに所属するＮＧクラスタに含まれる場合
には、当該後段判定部１１６に含まれる確信度更新手段
により、ＯＫクラスタに含まれることで計算された確信
度を低くする処理が行われる。If the second post-stage determination means includes the data in an NG cluster belonging to a certain category, the confidence factor updating means included in the post-stage determination section 116 calculates that the data is included in the OK cluster. A process of lowering the determined certainty factor is performed.

【００６３】したがって、確定データ１５６は、少なく
とも１つのＯＫクラスタに含まれ、確信度が計算された
データであり、未確定データ１５７は、後段判定部１１
６で、どのＯＫクラスタにも含まれなかった場合のデー
タであり、認識結果をどのカテゴリにも確定することが
できなかったデータである。Therefore, the definite data 156 is data for which the certainty factor is included in at least one OK cluster, and the undecided data 157 is the post-stage determination unit 11.
6 is the data when it was not included in any OK cluster, and the recognition result could not be determined in any category.

【００６４】次に、リジェクト部１１９は、後段判定部
１１６で未確定とされたデータ１５７、或いは、ルール
判定部１１７で正確と確信できないと判定されたデータ
１５９を受ける部分である。Next, the reject unit 119 is a unit that receives the data 157 that has been undetermined by the post-stage determination unit 116 or the data 159 that has been determined by the rule determination unit 117 to be inaccurate.

【００６５】また、ルール判定部１１７は、後段判定部
１１６で算出された確信度に基づいて、正解と確信でき
ないデータとしてリジェクトするか、確定データとして
受け付けるか判定する部分である。実際には、ユーザの
設定した値との大小比較で”確定”、”正解と確信でき
ない”を分けている。The rule determination section 117 is a section for determining, based on the certainty factor calculated by the post-stage determination section 116, whether to reject the data as uncertain as the correct answer or accept it as the final data. Actually, “determined” and “not sure that the answer is correct” are classified by comparing the value with the value set by the user.

【００６６】したがって、確定データ１５８は、ルール
判定部１１７でユーザの設定した値より確信度が大きい
データであり、正解と確信できないデータ１５９は、ル
ール判定部１１７でユーザの設定した値より確信度が小
さいデータである。Therefore, the fixed data 158 is data having a higher certainty factor than the value set by the user in the rule determination unit 117, and the data 159 that cannot be surely the correct answer is higher than the certain value set by the user in the rule determination unit 117. Is small data.

【００６７】さらに、認識確定部１１８は、前段判定部
１１５またはルール判定部１１７で認識結果を確定でき
たデータ１５５，１５８を受ける部分である。Further, the recognition confirming unit 118 is a unit for receiving the data 155, 158 for which the recognition result can be confirmed by the pre-stage judging unit 115 or the rule judging unit 117.

【００６８】以上説明した各構成要素により、本実施形
態の画像認識装置では、概略的に以下のような動作が行
われる。図３、図４および図５には、本実施形態の画像
認識装置の概略的動作を説明する概略フローチャートを
示す。With the components described above, the image recognition apparatus of this embodiment performs the following operations. FIG. 3, FIG. 4 and FIG. 5 show schematic flow charts for explaining the schematic operation of the image recognition apparatus of this embodiment.

【００６９】まず、図３は知識作成処理の概略フローチ
ャートである。知識作成処理では、まずステップ３０１
で、サンプルデータについて、ＯＣＲ認識処理部１０１
により知識生成用の認識結果１５１ａの収集が行われ
る。First, FIG. 3 is a schematic flowchart of the knowledge creating process. In the knowledge creation process, first step 301
Then, for the sample data, the OCR recognition processing unit 101
Thus, the recognition result 151a for knowledge generation is collected.

【００７０】次に、ステップ３０２では、知識生成部１
０３の前段知識の作成部１１１により前段知識ベース１
１２が構築される。すなわち、第１前段知識作成手段に
より第１前段知識（第１候補カテゴリに対する類似カテ
ゴリの関連）が、第２前段知識作成手段により第２前段
知識（特定順位の候補を”正解”と確定し得る適合度
（距離）の大きさ）が、第３前段知識作成手段により第
３前段知識（第１候補を”正解”と確定し得る適合度
（距離）差の大きさ）が、それぞれ作成され登録され
る。Next, in step 302, the knowledge generation unit 1
The pre-stage knowledge base 1 by the pre-stage knowledge creation unit 111 of 03.
12 are built. That is, the first pre-stage knowledge creating unit can determine the first pre-stage knowledge (the association of the similar category with the first candidate category) and the second pre-stage knowledge creating unit can determine the second pre-stage knowledge (the candidate of the specific rank is "correct answer"). The degree of goodness of fit (distance) is created and registered by the third knowledge making means as the third knowledge of knowledge (the magnitude of the difference of goodness of fit (distance) at which the first candidate can be determined as "correct answer"). To be done.

【００７１】さらに、ステップ３０３では、知識生成部
１０３の後段知識の作成部１１３により後段知識ベース
１１４が構築される。すなわち、第１後段知識作成手段
により第１後段知識（ＯＫクラスタの形状）が、第２後
段知識作成手段により第２後段知識（ＮＧクラスタの形
状）がそれぞれ作成され登録される。Further, in step 303, the latter-stage knowledge base 114 is constructed by the latter-stage knowledge creating unit 113 of the knowledge generating unit 103. That is, the first second-stage knowledge creating unit creates and registers the first second-stage knowledge (OK cluster shape), and the second second-stage knowledge creating unit creates and registers the second second-stage knowledge (NG cluster shape).

【００７２】次に、図４および図５は、システム運用時
の処理の概略フローチャートである。まず、ステップ４
０１で、実際の認識対象画像について、ＯＣＲ認識処理
部１０１により認識処理されて、認識結果１５１ｂが認
識結果確定部１０２の前段判定部１１５に供給される。Next, FIG. 4 and FIG. 5 are schematic flow charts of processing during system operation. First, step 4
In 01, the OCR recognition processing unit 101 performs recognition processing on the actual recognition target image, and the recognition result 151b is supplied to the pre-stage determination unit 115 of the recognition result determination unit 102.

【００７３】次に、前段判定部１１５による処理が行わ
れる。まず、ステップ４０２では、前段判定部１１５の
第１前段判定手段により、第１候補のカテゴリおよび第
２候補以下の特定候補順位の類似カテゴリと第１前段知
識に保持された関連とを比較して、知識と同じであれば
該第１候補のカテゴリを”正解”と判定してステップ４
０３に進み、また知識と違っていれば”不明”と判定し
て図５のステップ５０１に進む。Next, the processing by the pre-stage determination section 115 is performed. First, in step 402, the first front stage determination means of the front stage determination section 115 determines the first candidate category and the first candidate .
The similar category of the specific candidate ranks of two candidates or less is compared with the relation held in the first-stage knowledge, and if the knowledge is the same, the category of the first candidate is determined as “correct answer” and step 4
If it is different from the knowledge, it is judged as "unknown" and the process proceeds to step 501 in FIG.

【００７４】次に、ステップ４０３では、第２前段判定
手段により、該第１候補のカテゴリにおける適合度（距
離）の大きさと第２前段知識における適合度（距離）の
大きさとを比較して、知識の距離の大きさより大きい値
であれば該第１候補を”正解”と判定してステップ４０
４に進み、また知識の距離の大きさ以下であれば”不
明”と判定して図５のステップ５０１に進む。Next, at step 403, the second pre-stage determination means compares the magnitude of the goodness of fit (distance) in the category of the first candidate with the magnitude of the goodness of fit (distance) in the second pre-stage knowledge, If the value is larger than the knowledge distance, the first candidate is determined to be "correct" and step 40
4. If it is less than the knowledge distance, it is judged as "unknown" and the process proceeds to step 501 in FIG.

【００７５】さらに、ステップ４０４では、第３前段判
定手段により、該特定の候補順位のカテゴリまたは類似
カテゴリ間における適合度（距離）差の大きさと第３前
段知識における適合度（距離）差の大きさとを比較し
て、知識の距離差の大きさより大きい値であれば該第１
候補を”正解”と判定して図５のステップ５０８に進
み、また知識の距離差の大きさ以下であれば”不明”と
判定して図５のステップ５０１に進む。Further, in step 404, the third pre-stage determination means determines the magnitude of the goodness-of-fit (distance) difference between the category or the similar category of the specific candidate rank and the magnitude of the goodness-of-fit (distance) difference in the third pre-stage knowledge. And if the value is larger than the magnitude of the difference in knowledge distance, the first
The candidate is determined to be "correct" and the process proceeds to step 508 in FIG. 5, and if it is less than or equal to the knowledge distance difference, it is determined to be "unknown" and the process proceeds to step 501 in FIG.

【００７６】次に、図５の後段判定部１１６における処
理に進む。まず、ステップ５０１では、後段判定部１１
６の第１後段判定手段により、データが第１後段知識の
あるカテゴリに所属するＯＫクラスタの中に含まれるか
否かをチェックする。ここで、ＯＫクラスタの中に含ま
れていなければ、ステップ５０９でリジェクトされる。Next, the processing proceeds to the latter stage judging section 116 in FIG. First, in step 501, the post-stage determination unit 11
The first second-stage determination unit 6 checks whether the data is included in the OK cluster belonging to the category having the first second-stage knowledge. Here, if it is not included in the OK cluster, it is rejected in step 509.

【００７７】また、ステップ５０１において、データが
あるカテゴリに所属するＯＫクラスタに含まれる場合に
は、ステップ５０２またはステップ５０３に進んで、ど
の程度の確信を持って”正解”といえるかを示す指標で
ある確信度を確信度算出手段により計算する。In step 501, if the data is included in an OK cluster belonging to a certain category, the procedure proceeds to step 502 or step 503 to show how much confidence the index is. The certainty factor is calculated by the certainty factor calculation means.

【００７８】確信度算出手段による確信度算出は、ステ
ップ５０２のように、データが、該ＯＫクラスタの重心
からどの位の距離に位置しているかを演算し、重心から
近ければ高い確信度を、重心から遠ければ低い確信度を
それぞれ算出するか、或いは、ステップ５０３のよう
に、該ＯＫクラスタを構成するデータの分布に基づい
て、データが密度の高いエリアに存在する場合は高い確
信度を、密度の低いエリアに存在する場合は低い確信度
をそれぞれ算出するか、の方法で行われる。In the confidence factor calculation by the confidence factor calculating means, as in step 502, the distance at which the data is located from the center of gravity of the OK cluster is calculated. If it is far from the center of gravity, a low confidence factor is calculated, or, as in step 503, based on the distribution of the data forming the OK cluster, a high confidence factor is calculated when the data exists in a high density area. If it exists in a low density area, low confidence is calculated for each of them.

【００７９】本実施形態では、図５に示すように、ステ
ップ５０２でクラスタ重心からの距離による確信度を求
め、ステップ５０３でクラスタ内座標値による確信度を
求め、ステップ５０４において、これら２つの算出され
た確信度を加算して最終的な確信度としている。In this embodiment, as shown in FIG. 5, in step 502, the certainty factor based on the distance from the cluster centroid is obtained, in step 503 the certainty factor based on the intra-cluster coordinate value is obtained, and in step 504, these two calculations are performed. The determined certainty factor is added to obtain the final certainty factor.

【００８０】次に、ステップ５０５では、第２後段判定
手段により、該データが第２後段知識のあるカテゴリに
所属するＮＧクラスタの中に含まれるか否かをチェック
する。ここで、ＮＧクラスタの中に含まれていなければ
ステップ５０７に進む。Next, in step 505, the second latter-stage determination means checks whether or not the data is included in the NG cluster belonging to the category having the second latter-stage knowledge. Here, if it is not included in the NG cluster, the process proceeds to step 507.

【００８１】また、ステップ５０５において、データが
あるカテゴリに所属するＮＧクラスタに含まれる場合に
は、ステップ５０６に進んで、確信度更新手段により、
ＯＫクラスタに含まれることで計算された確信度を低く
するように、確信度を調整する。In step 505, if the data is included in the NG cluster belonging to a certain category, the process proceeds to step 506, and the certainty factor updating means
The certainty factor is adjusted so that the certainty factor calculated by being included in the OK cluster is lowered.

【００８２】さらに、ステップ５０７では、ルール判定
部１１７により、後段判定部１１６で算出された確信度
とユーザ設定された値とを比較して、ユーザ設定値以下
の確信度の場合には、ステップ５０９で”正解と確信で
きない”データとしてリジェクトし、また、ユーザ設定
値より大きい確信度の場合には、ステップ５０８で”正
解”の確定データとして受け付ける。Further, in step 507, the rule determination unit 117 compares the certainty factor calculated by the latter determination unit 116 with the value set by the user, and if the certainty factor is less than or equal to the user set value, the step is performed. In step 509, the data is rejected as "uncertain that the answer is correct", and if the certainty factor is larger than the user set value, in step 508 it is accepted as the "correct answer" confirmation data.

【００８３】次に、以下では、本実施形態の画像認識装
置を構成する各構成要素について、その機能および動作
並びに用いる手法等を各説明図を参照して詳細に説明し
ていく。Next, the function and operation of each component constituting the image recognition apparatus of this embodiment, the method used, and the like will be described in detail below with reference to the drawings.

【００８４】まず、ＯＣＲ認識処理部１０１から出力さ
れる認識結果について図６を参照して説明する。認識対
象１文字当たりに出力されるデータは、図６の各行に示
すように、各行上段の候補文字と、各行下段の各候補に
ついての距離である。First, the recognition result output from the OCR recognition processing unit 101 will be described with reference to FIG. The data output for each character to be recognized is the distance between the candidate character in the upper row of each line and each candidate in the lower row of each line, as shown in each row of FIG.

【００８５】第１行目のデータを例にとり説明を加える
と、次のようになる。すなわち、第１候補は、”０”
で、”０”の距離は２３４である。第２候補は、”９”
で、”９”の距離は２１４である。…。第１３候補
は、”）”で、”）”の距離は１３５である。ここで、
距離が大きいほど、認識した文字への信頼度が高いこと
を意味する。つまり、第１行目のデータについて、ＯＣ
Ｒ認識処理部１０１は認識対象文字を”０”として認識
したことになる。The description will be as follows, taking the data of the first row as an example. That is, the first candidate is "0"
Therefore, the distance of “0” is 234. The second candidate is "9"
Therefore, the distance of "9" is 214. …. The 13th candidate is “)”, and the distance of “)” is 135. here,
The larger the distance, the higher the reliability of the recognized character. That is, regarding the data in the first row, OC
The R recognition processing unit 101 has recognized the recognition target character as “0”.

【００８６】次に、前段知識の作成部１１１による前段
知識ベース１１２の構築について説明する。まず、第１
前段知識作成手段では、認識結果１５１ａ（図６参照）
のデータのうち、第２候補および第３候補に出現する候
補文字をカウントする。そのうち出現数の上位２つの候
補文字を第１前段知識として保存する。Next, the construction of the upstream knowledge base 112 by the upstream knowledge creation unit 111 will be described. First, the first
In the first-stage knowledge creating means, the recognition result 151a (see FIG. 6)
The candidate characters appearing in the second candidate and the third candidate of the data of are counted. Among them, the two candidate characters with the highest number of appearances are saved as the first-stage knowledge.

【００８７】次に、第２および第３前段知識作成手段で
は、認識結果１５１ａ（図６参照）のデータから、第１
候補距離の大きさ並びに（第１候補距離−第２候補距
離）の大きさの２つの特徴を用いて、認識結果の”確
定”、”未確定”を判断する第２および第３前段知識
（しきい値パラメータ）を作成する。Next, in the second and third front-stage knowledge creating means, the first result is obtained from the data of the recognition result 151a (see FIG. 6).
The second and third pre-stage knowledge for judging "confirmed" or "unconfirmed" of the recognition result by using two features of the size of the candidate distance and the size of (first candidate distance-second candidate distance) Threshold parameter).

【００８８】まず第１に、知識作成用データからパラメ
ータ決定に使用するデータの選択を行う（カテゴリ毎に
実施する）。具体的には、以下のイ）からニ）の手順で
行われる。First, the data used for parameter determination is selected from the knowledge creation data (performed for each category). Specifically, the steps from a) to d) are performed.

【００８９】イ）（第１候補距離−第２候補距離）が、
０、１、２…の場合の正解サンプル数を調べる。A) (first candidate distance-second candidate distance)
The number of correct samples for 0, 1, 2, ...

【００９０】ロ）正解サンプル数と誤判別サンプル数の
比を調べる。(B) Check the ratio of the number of correct samples and the number of erroneously determined samples.

【００９１】ハ）誤判別サンプル数と正解サンプル数の
比が一定以上（例えば、４０％以上）になる距離差αを
見つける。C) Find the distance difference α at which the ratio of the number of erroneously determined samples and the number of correct samples is equal to or more than a certain value (for example, 40% or more).

【００９２】ニ）（第１候補距離−第２候補距離）がα
以下のデータは、全てリジェクトの対象として（２）の
パラメータ決定の対象から除く。D) (first candidate distance-second candidate distance) is α
All the following data are excluded from the object of parameter determination in (2) as the object of reject.

【００９３】次に（第２に）、文字（カテゴリ）ごとに
誤判別をリジェクトするのに必要な第１候補距離並びに
第１候補および第２候補の距離差のしきい値を見つけ
る。具体的には、以下のイ）からハ）の手順で行われ
る。Next (secondly), the first candidate distance necessary for rejecting misjudgment for each character (category) and the threshold value of the distance difference between the first candidate and the second candidate are found. Specifically, the steps a) to c) are performed.

【００９４】イ）誤判別サンプルの第１候補距離と（第
１候補距離−第２候補距離）を調べる。A) Check the first candidate distance and (first candidate distance-second candidate distance) of the misjudgment sample.

【００９５】ロ）正解のサンプルの第１候補距離と（第
１候補距離−第２候補距離）を調べる。(B) Check the first candidate distance and (first candidate distance-second candidate distance) of the correct sample.

【００９６】ハ）誤判別をすべてリジェクトし、かつ正
解のリジェクトを最小限に抑えることのできる第１候補
と（第１候補距離−第２候補距離）のしきい値を決め
る。C) The thresholds of the first candidate and (first candidate distance-second candidate distance) that can reject all misjudgments and minimize the rejection of correct answers are determined.

【００９７】以上のようにして得られたパラメータを図
７に示す。図中、「第１候補距離」が第２前段知識であ
り、「第１−第２候補距離」が第３前段知識であり、
「第２、第３候補に出現率の高い文字」が第１前段知識
である。The parameters obtained as described above are shown in FIG. In the figure, the "first candidate distance" is the second upstream knowledge, the "first-second candidate distance" is the third upstream knowledge,
“Characters with a high appearance rate in the second and third candidates” are the first preliminary knowledge.

【００９８】次に、前段判定部１１５による判定方法に
ついて説明する。前段判定部１１５は、知識生成部１０
３で作成した候補文字の距離データによる前段知識ベー
ス１１２を使用して、ＯＣＲ認識処理部１０１の認識結
果１５１ｂを”確定”、”未確定”に分けるものであ
る。ここで、”確定”、”未確定”に分けるためのルー
ルは、以下に述べる通りである。Next, the determination method by the upstream determination unit 115 will be described. The pre-stage determination unit 115 includes the knowledge generation unit 10
The recognition result 151b of the OCR recognition processing unit 101 is divided into "confirmed" and "unconfirmed" by using the preceding knowledge base 112 based on the distance data of the candidate character created in 3. Here, the rules for dividing into “confirmed” and “unconfirmed” are as described below.

【００９９】まず第１に（手順１）、第１候補を参照
し、第１前段知識（図７参照）からチェックに使用する
パラメータを選択する。次に（手順２）、第２候補およ
び第３候補文字が第１前段知識（出現率が高いもの）に
一致しているかチェックする。一致しないデータが入っ
ている場合は”不明”データとする。次に（手順３）、
手順２で”不明”データとされなかったデータについ
て、第１候補距離が第２前段知識（図７の第１候補距離
パラメータ）より小さければ”不明”データとする。さ
らに（手順４）、手順３で”不明”データとされなかっ
たデータについて、（第１候補距離−第２候補距離）の
チェックを行う。すなわち、（第１候補距離−第２候補
距離）が第３前段知識（図７の（第１候補距離−第２候
補距離）パラメータ）より小さければ”不明”データと
する。First (procedure 1), the first candidate is referred to, and the parameter used for checking is selected from the first pre-stage knowledge (see FIG. 7). Next (procedure 2), it is checked whether the second candidate character and the third candidate character match the first pre-stage knowledge (those with a high appearance rate). If there is data that does not match, it is treated as "unknown" data. Next (procedure 3),
If the first candidate distance is smaller than the second pre-stage knowledge (the first candidate distance parameter in FIG. 7) for the data that was not made "unknown" data in step 2, it is made "unknown" data. Further (procedure 4), for data that is not "unknown" data in procedure 3, (first candidate distance-second candidate distance) is checked. That is, if (first candidate distance-second candidate distance) is smaller than the third pre-stage knowledge ((first candidate distance-second candidate distance) parameter in FIG. 7), "unknown" data is set.

【０１００】図８には、以上の手順をフローチャートと
して示す。すなわち、まずステップ８０１では、ＯＣＲ
認識処理部１０１の認識結果１５１ｂから第１候補を取
得する。次に、ステップ８０２では、第１候補を参照
し、第１前段知識から「第２、第３候補に出現率の高い
候補文字」を抽出する。そして、ステップ８０３では、
認識結果１５１ｂの第２、第３候補と第１前段知識の第
２、第３候補とが一致するかを判断する。ここで、一致
しない場合には、ステップ８０８に進んで、認識結果１
５１ｂの第１候補を”不明”とする。FIG. 8 shows the above procedure as a flowchart. That is, first, in step 801, the OCR is performed.
The first candidate is acquired from the recognition result 151b of the recognition processing unit 101. Next, in step 802, the “first candidate” is referred to and “a candidate character having a high appearance rate in the second and third candidates” is extracted from the first preliminary knowledge. Then, in step 803,
It is determined whether the second and third candidates of the recognition result 151b match the second and third candidates of the first preliminary knowledge. If they do not match, the process proceeds to step 808, and the recognition result 1
The first candidate of 51b is set to "unknown".

【０１０１】次に、ステップ８０３において一致した場
合には、ステップ８０４に進んで、第２および第３前段
知識から「第１候補距離」パラメータαおよび「第１−
第２候補距離差」パラメータβを取得する。そして、ス
テップ８０５では、認識結果１５１ｂの第１候補距離と
「第１候補距離」パラメータαとを比較して、第１候補
距離が「第１候補距離」パラメータαより小さければス
テップ８０８に進んで、認識結果１５１ｂの第１候補
を”不明”とする。Next, if they match in step 803, the process proceeds to step 804, in which the "first candidate distance" parameter α and the "first-first distance" are calculated from the second and third pre-stage knowledge.
The second candidate distance difference ”parameter β is acquired. Then, in step 805, the first candidate distance of the recognition result 151b is compared with the “first candidate distance” parameter α, and if the first candidate distance is smaller than the “first candidate distance” parameter α, the process proceeds to step 808. , The first candidate of the recognition result 151b is "unknown".

【０１０２】また、ステップ８０５において、第１候補
距離が「第１候補距離」パラメータαより大きければ、
ステップ８０６に進んで、認識結果１５１ｂの（第１候
補距離−第２候補距離）と「第１−第２候補距離差」パ
ラメータβとを比較して、（第１候補距離−第２候補距
離）が「第１−第２候補距離差」パラメータβより小さ
ければステップ８０８に進んで、認識結果１５１ｂの第
１候補を”不明”とする。If the first candidate distance is larger than the "first candidate distance" parameter α in step 805,
In step 806, the (first candidate distance-second candidate distance) of the recognition result 151b is compared with the "first-second candidate distance difference" parameter β to calculate (first candidate distance-second candidate distance). Is smaller than the “first-second candidate distance difference” parameter β, the process proceeds to step 808, and the first candidate of the recognition result 151b is set to “unknown”.

【０１０３】また、ステップ８０６において、（第１候
補距離−第２候補距離）が「第１−第２候補距離差」パ
ラメータβより大きければステップ８０７に進んで、認
識結果１５１ｂの第１候補を”正解”と確定する。If (first candidate distance-second candidate distance) is larger than the "first-second candidate distance difference" parameter β in step 806, the process proceeds to step 807 to select the first candidate of the recognition result 151b. Confirm the "correct answer".

【０１０４】尚、図８中で用いている「第１候補距離」
パラメータαおよび「第１−第２候補距離差」パラメー
タβは、外部から変動させることができる。つまり、図
７に示したパラメータは、各文字（カテゴリ）毎に用い
る標準的なパラメータであり、これを変動させることで
確定、未確定データ数の割合を調節できる。The "first candidate distance" used in FIG.
The parameter α and the “first-second candidate distance difference” parameter β can be changed from the outside. That is, the parameters shown in FIG. 7 are standard parameters used for each character (category), and by varying this, the ratio of the number of confirmed and unconfirmed data can be adjusted.

【０１０５】すなわち、「第１候補距離」パラメータα
および「第１−第２候補距離差」パラメータβは、次式
において、Ｋｄ、第１固定値および第２固定値を可変設
定することで調整可能である。That is, the "first candidate distance" parameter α
The “first-second candidate distance difference” parameter β can be adjusted by variably setting Kd, the first fixed value, and the second fixed value in the following equation.

【０１０６】α＝第１候補距離＋第１固定値×Ｋｄ（−１．０≦Ｋｄ≦１．０） β＝（第１候補距離−第２候補距離）＋第２固定値×Ｋ
ｄ（−１．０≦Ｋｄ≦１．０）ここで、第１固定値および第２固定値の決定法は、図７
のパラメータを基準に±２の変動幅を持たせる場合に
は、第１固定値＝第２固定値＝２とする。また、図７の
パラメータを基準値とすると、パラメータαおよびβが
共に大きいほど、図８における条件（ステップ８０５，
８０６）を通過できるデータ数は少なくなるから、Ｋｄ
が１．０に近くなればなるほど厳しい設定ということに
なる。逆に、Ｋｄが−１．０に近くなればなるほど緩い
設定ということになる。Α = first candidate distance + first fixed value × Kd (−1.0 ≦ Kd ≦ 1.0) β = (first candidate distance−second candidate distance) + second fixed value × K
d (−1.0 ≦ Kd ≦ 1.0) Here, the method for determining the first fixed value and the second fixed value is shown in FIG.
When the fluctuation range of ± 2 is provided with reference to the parameter of, the first fixed value = the second fixed value = 2. Further, when the parameters in FIG. 7 are used as reference values, the larger the parameters α and β are, the conditions in FIG. 8 (step 805,
Since the number of data that can pass 806) is small, Kd
The closer to 1.0, the more severe the setting. Conversely, the closer Kd is to -1.0, the looser the setting.

【０１０７】次に、後段知識の作成部１１３による後段
知識ベース１１４の構築方法について説明する。本実施
形態では、上位候補に出現率が高い文字（すなわち、誤
認識し易い文字）の上位３つを、クラスタ使用軸として
選択し、３次元空間にクラスタを形成することとする。
つまり、図７における出現率の高い候補文字を軸として
選択したことになる。これらの軸を使用して、クラスタ
リングを実施するまでの過程を以下に簡単に述べる。Next, a method of constructing the latter-stage knowledge base 114 by the latter-stage knowledge creating unit 113 will be described. In the present embodiment, the top three characters having a high appearance rate (that is, the characters that are likely to be erroneously recognized) are selected as the top candidates as the cluster use axis, and the cluster is formed in the three-dimensional space.
That is, the candidate character having a high appearance rate in FIG. 7 is selected as the axis. The process of performing clustering using these axes will be briefly described below.

【０１０８】まず（手順１）、ＯＣＲ認識処理部１０１
から認識結果（知識生成用）１５１ａを受け取り、その
データをカテゴリ別（１３個）に分類する。次に（手順
２）、カテゴリ別に分類されたデータを、ＯＣＲ認識結
果正解データ（以下、甲群という）と誤認識結果（以
下、乙群という）に分類する。次に（手順３）、図７に
示した各カテゴリの「第２、第３候補に出現率の高い候
補文字」のデータ群を、甲群のデータから抽出して、こ
れらを出現率の高い順にＡ、ＢおよびＣとする。カテゴ
リ”０”を例に取ると、０（Ａ）、６（Ｂ）、９（Ｃ）
のカテゴリの甲群データを抽出する。First (procedure 1), the OCR recognition processing unit 101
The recognition result (for knowledge generation) 151a is received from and the data is classified into categories (13 pieces). Next (procedure 2), the data classified by category is classified into OCR recognition result correct data (hereinafter referred to as "group A") and error recognition result (hereinafter referred to as "group B"). Next (procedure 3), a data group of "candidate characters having a high appearance rate in the second and third candidates" of each category shown in FIG. 7 is extracted from the data of group A, and these are extracted with a high appearance rate. Let A, B and C be in order. Taking category "0" as an example, 0 (A), 6 (B), 9 (C)
The group A data of the category is extracted.

【０１０９】次に（手順４）、Ａのデータ群を対象に、
軸Ａ、軸Ｂおよび軸Ｃのデータを抽出してクラスタリン
グを実施する。ここで作成されたクラスタをａとする。
次に（手順５）、Ｂのデータ群を対象に、軸Ａ、軸Ｂお
よび軸Ｃのデータを抽出してクラスタリングを実施す
る。ここで作成されたクラスタをｂとする。さらに（手
順６）、Ｃのデータ群を対象に、軸Ａ、軸Ｂおよび軸Ｃ
のデータを抽出してクラスタリングを実施する。ここで
作成されたクラスタをｃとする。次に（手順７）、乙群
のデータに対して、軸Ａ、軸Ｂおよび軸Ｃのデータを抽
出してクラスタリングを実施する。Next (procedure 4), for the data group A,
The data of axis A, axis B and axis C are extracted and clustering is performed. The cluster created here is a.
Next (procedure 5), data of axis A, axis B, and axis C is extracted and clustered for the data group of B. Let the cluster created here be b. Further (procedure 6), for the data group of C, axis A, axis B, and axis C
Data is extracted and clustering is performed. Let the cluster created here be c. Next (procedure 7), data of axis A, axis B, and axis C is extracted from the data of group B to perform clustering.

【０１１０】ここで、カテゴリは１３個あるので手順１
〜手順６および手順７はそれぞれ１３回実施することに
なる。また、甲群および乙群それぞれについて、複数の
クラスタが生成されることになる。尚、手順３〜手順６
で作成されたクラスタを「ＯＫクラスタ」（ａ、ｂ、ｃ
が含まれる）と呼び、手順７で作成されたクラスタを
「ＮＧクラスタ」と呼ぶ。Here, since there are 13 categories, the procedure 1
-Procedure 6 and procedure 7 will be performed 13 times, respectively. Also, a plurality of clusters will be generated for each of group A and group B. Incidentally, steps 3 to 6
"OK cluster" (a, b, c
Is included) and the cluster created in step 7 is called an “NG cluster”.

【０１１１】次に、本実施形態で採用するクラスタリン
グ手法（超楕円体法）について説明する。クラスタ間距
離が近いものの同士を合併しクラスタを生成していく通
常のクラスタリング方法とは異なり、超楕円体法による
クラスタリングは、線型性（線型モデルをいかにして形
成できるかということを意味する）と、連続性（クラス
タを構成するデータの分布が連続的かどうかを意味す
る）とを考慮してクラスタを生成することができる。Next, the clustering method (hyperellipsoidal method) used in this embodiment will be described. Unlike the normal clustering method that merges clusters that are close to each other in distance to generate clusters, clustering by the hyperellipsoid method is linear (meaning how a linear model can be formed). And the continuity (meaning whether or not the distribution of data forming the cluster is continuous) can be taken into consideration to generate the cluster.

【０１１２】通常のクラスタリングでは、距離という規
範のみで一度クラスタが合併されると、形状は変更でき
ない。つまり、合併後、クラスタを形成している一部分
を分離するための規範が無いのである。しかし、超楕円
体クラスタリングでは、線型性と連続性の考慮により、
合併対象クラスタの変更や、合併後のクラスタ構成デー
タの移動により、定義した規範に即したクラスタ群を構
築することができる。In normal clustering, the shape cannot be changed once the clusters are merged only by the criterion of distance. That is, there is no standard for separating the parts forming the cluster after the merger. However, in hyperelliptic clustering, linearity and continuity are taken into account,
By changing the cluster to be merged and moving the cluster configuration data after the merger, it is possible to construct a cluster group according to the defined norm.

【０１１３】図９に示すクラスタ生成を例に採ると、連
続度を重視した場合には、距離の近いクラスタＡとＢが
合併され、構成データ密度の高いクラスタが生成され
る。また、線型度を重視した場合には、クラスタＢとＣ
が合併され、さらに必要があれば一点移動をおこない、
線型モデルが構築し易いデータ分布のクラスタが生成さ
れることになる。尚、超楕円体クラスタリングの規範
は、各クラスタの体積（超積）の和を最小にすることで
ある。Taking the cluster generation shown in FIG. 9 as an example, when importance is attached to the degree of continuity, the clusters A and B having a close distance are merged to generate a cluster having a high configuration data density. If linearity is emphasized, clusters B and C are
Will be merged, and if necessary, we will move one point,
A cluster having a data distribution that is easy to construct a linear model is generated. The standard of hyper-ellipsoidal clustering is to minimize the sum of the volumes (super products) of each cluster.

【０１１４】次に、超楕円体クラスタリングの処理の流
れについて、以下、簡単に説明する。超楕円体法クラス
タリングの処理の概要は、図１０に示すフローチャート
の通りである。前述した通り、この処理を文字（カテゴ
リ）毎に行い、複数のクラスタ／カテゴリを作成する。Next, the flow of the hyperelliptic clustering process will be briefly described below. The outline of the hyperelliptic method clustering processing is as shown in the flowchart in FIG. 10. As described above, this process is performed for each character (category) to create a plurality of clusters / categories.

【０１１５】以下に、処理のステップを簡単に述べる。
まず、ステップ１００１では各種パラメータを設定す
る。次に、初期のクラスタは、１［データ／クラスタ］
で形成されており、そのクラスタを、クラスタ間距離が
近いもの同士で合併し、処理クラスタ群を形成する。こ
こで用いられるのがウォード法と呼ばれる従来のクラス
タリング手法（ステップ１００２）である。The processing steps will be briefly described below.
First, in step 1001, various parameters are set. Next, the initial cluster is 1 [data / cluster]
And the clusters having a short inter-cluster distance are merged to form a processing cluster group. The conventional clustering method (step 1002) called the Ward method is used here.

【０１１６】次に、ステップ１００３では、複数［デー
タ／クラスタ］となった後（中には１［データ／クラス
タ］のものも存在する）、超楕円体の評価規範であるク
ラスタ体積和の増加を最小にするクラスタ同士を合併す
る（ステップ１００５）。ただし、合併対象のクラスタ
重心間距離は、δｕ以下でなければならない（ステップ
１００４）。Next, in step 1003, after a plurality of [data / cluster] (including 1 [data / cluster] in some cases), the cluster volume sum, which is the evaluation criterion for the hyperellipsoid, increases. The clusters that minimize? Are merged (step 1005). However, the distance between the cluster centroids to be merged must be δu or less (step 1004).

【０１１７】次に、合併後、クラスタ数を変更せずに、
構成データユニットのクラスタ間移動を行う。ここで移
動対象となるのは、超楕円体の評価規範であるクラスタ
体積和の減少量が最大となるデータである（ステップ１
００７〜１０１０）。ただし、データの移動をおこなう
クラスタは、クラスタ重心間処理がδｍ以下でなければ
ならない（ステップ１００８）。Next, after the merger, without changing the number of clusters,
Move configuration data units between clusters. Here, the data to be moved is the data in which the amount of decrease in the cluster volume sum, which is the evaluation criterion for the hyperellipsoid, is maximum (step 1
007-1010). However, the cluster between which the data is moved must have a processing between cluster centroids of δm or less (step 1008).

【０１１８】最後に、終了判定を行う。クラスタの個数
が指定した最終クラスタ数であるか（ステップ１００
６）、或いは、規範増加量が指定値以上である（ステッ
プ１０１１）とき終了する。そうでなければ、ステップ
１００５〜１０１０を繰り返す。Finally, the end judgment is made. Whether the number of clusters is the designated final cluster number (step 100
6) Alternatively, when the reference increase amount is equal to or greater than the specified value (step 1011), the process ends. Otherwise, steps 1005-1010 are repeated.

【０１１９】尚、上記説明の中で用いたパラメータδｕ
およびδｍには、次のような意味がある。すなわち、δ
ｕは合併範囲制限パラメータであり、δｕを大きくする
と線型度重視となり、δｕ小さくすると連続度重視とな
る。またδｍは移動範囲制限パラメータであり、δｍを
大きくすると線型度重視であり、δｍを小さくすると連
続度重視である。これらのパラメータδｕおよびδｍ
は、一意に決定できるものではないため、試行錯誤して
設定することが必要である。The parameter δu used in the above description
And δm have the following meanings. That is, δ
u is a merging range limiting parameter. When δu is increased, linearity is emphasized, and when δu is decreased, continuity is emphasized. Further, δm is a movement range limiting parameter. When δm is increased, linearity is emphasized, and when δm is decreased, continuity is emphasized. These parameters δu and δm
Cannot be uniquely determined, so it is necessary to set it by trial and error.

【０１２０】以上のようなクラスタリングを実行した結
果、後段知識としてファイル出力される場合のデータフ
ォーマットを図１１に示す。FIG. 11 shows a data format when a file is output as post-stage knowledge as a result of executing the above clustering.

【０１２１】次に、楕円体と各パラメータの関係につい
て説明する。クラスタ構成データで生成される共分散行
列の固有ベクトルが楕円体の広がり方向を決め、その方
向の楕円体半径は共分散行列の固有値の平方根に相当す
る。例えば、１データが１３の成分からなり、全ての成
分を使用して超楕円体クラスタリングを行うと、１３×
１３の共分散行列が形成される。そして、その固有ベク
トル、固有値はそれぞれ１３個求めることができる。た
だし、図１１のクラスタデータフォーマットに示すよう
に、共分散行列は後段知識として保存しない。Next, the relationship between the ellipsoid and each parameter will be described. The eigenvectors of the covariance matrix generated by the cluster configuration data determine the spreading direction of the ellipsoid, and the ellipsoid radius in that direction corresponds to the square root of the eigenvalues of the covariance matrix. For example, if one data consists of 13 components and hyper-ellipsoidal clustering is performed using all components, 13 ×
Thirteen covariance matrices are formed. Then, 13 eigenvectors and 13 eigenvalues can be obtained. However, as shown in the cluster data format of FIG. 11, the covariance matrix is not stored as the latter knowledge.

【０１２２】図１２には、楕円体と固有値および固有ベ
クトルの関係を説明する説明図を示す。また、楕円体
式、楕円体体積は次式であらわすことができる。FIG. 12 is an explanatory view for explaining the relationship between the ellipsoid and the eigenvalues and eigenvectors. Further, the ellipsoidal formula and the ellipsoidal volume can be represented by the following formulas.

【０１２３】[0123]

【数１】 [Equation 1]

【数２】次に、後段判定部１１６による判定方法について説明す
る。後段判定部１１６は、前述のように、前段判定部１
１５で未確定とされた不明データ１５４について、確
定、未確定を再度チェックするものである。チェック
は、後段知識の作成部１１３で作成した後段知識ベース
１１４（ＯＫクラスタ）に対象データが含まれるか否か
で行う。もし、あるカテゴリのクラスタに含まれる場合
は、その確信度（信頼度）を計算する確信度算出手段を
持つ。[Equation 2] Next, the determination method by the latter-stage determination unit 116 will be described. As described above, the latter-stage determination unit 116 is the former-stage determination unit 1
With respect to the unknown data 154 which has been undecided in 15, the undecided data is checked again. The check is performed depending on whether or not the target data is included in the subsequent knowledge base 114 (OK cluster) created by the subsequent knowledge creating unit 113. If it is included in the cluster of a certain category, it has a certainty factor calculation means for calculating the certainty factor (reliability).

【０１２４】後段判定部１１６における処理の流れを説
明するフローチャートを図１３に示す。まず、ステップ
１３０１では、確定対象データ１５４の第１候補を取得
し、その第１候補で利用する軸データ（３次元）を抽出
する。例えば、第１候補が”０”ならば、図７より、
０、６、９の軸を選択する。FIG. 13 shows a flow chart for explaining the flow of processing in the latter-stage judging section 116. First, in step 1301, the first candidate of the confirmation target data 154 is acquired, and the axis data (three-dimensional) used by the first candidate is extracted. For example, if the first candidate is “0”, then from FIG.
Select the 0, 6, and 9 axes.

【０１２５】次に、ステップ１３０３では、後段知識と
して所有しているクラスタのうち、「ＯＫクラスタ」の
全てに対して、内部に含まれるかチェックを行う。Next, in step 1303, it is checked whether all of the “OK clusters” among the clusters possessed as the latter knowledge are included in the inside.

【０１２６】ステップ１３０３において、あるクラスタ
内部に含まれる場合には、ステップ１３０４に進んで、
クラスタ重心からの距離による確信度を算出し、続いて
ステップ１３０５で、クラスタ内座標値による確信度を
算出する。そして、ステップ１３０６では、これら２つ
の確信度を加算して最終的な確信度とする。以上の処理
を全クラスタ似付いて行う（ステップ１３０２〜１３０
７）。In step 1303, if it is included in a certain cluster, the process proceeds to step 1304,
The certainty factor based on the distance from the cluster centroid is calculated, and subsequently, in step 1305, the certainty factor based on the intra-cluster coordinate values is calculated. Then, in step 1306, these two certainty factors are added to obtain the final certainty factor. The above process is performed for all clusters (steps 1302 to 130).
7).

【０１２７】ここで、算出した確信度を、そのクラスタ
を作成する際に使用したデータ群（カテゴリ）に対する
所属度合いとする。つまり、後段知識に作成されている
ａというクラスタに入った場合はＡに対する所属度合
い、また、ｂというクラスタに入った場合はＢに対する
所属度合い、さらに、ｃというクラスタに入った場合は
Ｃに対する所属度合いとなる。Here, the calculated certainty factor is the degree of belonging to the data group (category) used when creating the cluster. That is, the degree of belonging to A when entering the cluster a created in the latter knowledge, the degree of belonging to B when entering the cluster b, and the degree of belonging to C when entering the cluster c. It becomes a degree.

【０１２８】”０”というカテゴリについて、更に具体
例を述べる。”０”というカテゴリのＯＫクラスタは、
０、６、９カテゴリのデータ群それぞれについて、０
軸、６軸、９軸を用いてクラスタが作成されている。こ
のうち、０のデータ群で作成されたクラスタに入った場
合は０に対する所属度合いとなる。また６のデータ群で
作成されたクラスタに入った場合は６に対する所属度合
い、９のデータ群で作成されたクラスタに入った場合は
９に対する所属度合いとなる。A specific example will be described for the category "0". The OK cluster in the category "0" is
0 for each of the data groups of 0, 6, and 9 categories
A cluster is created using axes 6, 6 and 9. Of these, if it enters a cluster created with a data group of 0, the degree of belonging is 0. In addition, when a cluster created by the data group 6 is included, the degree of belonging to 6 is entered, and when a cluster created by the group of 9 data is entered, the degree of belonging to 9 is entered.

【０１２９】次に、ステップ１３０８では、内部に含ま
れるクラスタの有無を判断する。複数のクラスタに含ま
れる場合は、ステップ１３０９で、最大確信度を得たク
ラスタのカテゴリを抽出し、そのクラスタを生成する際
に用いたデータ群を認識結果とする。Next, in step 1308, it is determined whether or not there is a cluster contained inside. If it is included in a plurality of clusters, in step 1309, the category of the cluster having the highest certainty factor is extracted, and the data group used when generating the cluster is used as the recognition result.

【０１３０】次に、ステップ１３１２では、知識として
所有しているクラスタのうち、ステップ１３０９で得ら
れたカテゴリ（認識結果）の所有するＮＧクラスタ全て
に対して、内部に含まれるかのチェックを行う。もしＮ
Ｇクラスタ内部に含まれていた場合には、ステップ１３
１３において、ステップ１３０９で得られた確信度を、
ある規則に従って小さくなるような方向に微調整する。
内部に含まれない場合は、ステップ１３０９の結果を確
定する。Next, in step 1312, of the clusters owned as knowledge, it is checked whether all the NG clusters owned by the category (recognition result) obtained in step 1309 are included in the inside. . If N
If it is included in the G cluster, step 13
In step 13, the certainty factor obtained in step 1309 is
Fine-tune it according to a rule so that it becomes smaller.
If it is not included inside, the result of step 1309 is confirmed.

【０１３１】ここで注目すべき点は、ステップ１３０９
の処理によって、ＯＣＲ認識処理部１０１による認識結
果が覆され、認識結果を置き直して正解とすることがで
きる点である。The point to be noted here is step 1309.
This is because the recognition result by the OCR recognition processing unit 101 is overridden and the recognition result can be replaced to obtain the correct answer.

【０１３２】次に、「ＯＫクラスタ」のチェック（ステ
ップ１３０３）について詳しく説明する。確定対象デー
タがクラスタ内部に含まれるかどうかは、クラスタリン
グにおいて使用する３軸をクラスタの持つ固有ベクトル
で新座標に変換した後、楕円体クラスタを表す次の楕円
体式に代入して、算出された距離が楕円体クラスタのカ
バリング距離以下であるかどうかで判断する。尚、次式
において、γは３軸座標を固有ベクトルの座標に変換し
たものである。Next, the check of "OK cluster" (step 1303) will be described in detail. Whether or not the data to be determined is included in the cluster is calculated by converting the three axes used in clustering into new coordinates with the eigenvectors of the cluster and then substituting into the following ellipsoidal expression representing the ellipsoidal cluster. Is less than or equal to the covering distance of the ellipsoidal cluster. In the following equation, γ is the coordinate of three axes converted to the coordinate of the eigenvector.

【０１３３】[0133]

【数３】ここで、もしカバリング距離以下であれば、そのクラス
タに含まれることになり、確信度計算処理（ステップ１
３０４〜１３０６）へ進む。逆に、カバリング距離以上
であれば、他のＯＫクラスタのチェックへ進む。[Equation 3] If the distance is less than the covering distance, it is included in the cluster, and the confidence factor calculation process (step 1
304-1306). On the contrary, if the distance is equal to or more than the covering distance, the process proceeds to check another OK cluster.

【０１３４】次に、確信度の計算処理（ステップ１３０
４〜１３０６）について詳しく説明する。まず、認識結
果があるクラスタ内部に含まれると判断された時、算出
するクラスタ重心からの距離の確信度（ステップ１３０
４）についての説明を行う。Next, the confidence factor calculation process (step 130)
4 to 1306) will be described in detail. First, when it is determined that the recognition result is included in the cluster, the certainty factor of the distance from the cluster centroid to be calculated (step 130).
4) will be described.

【０１３５】概要を簡単に述べると、クラスタ知識を生
成した際に確保されたファイル（図１１参照）の中で、
ｄｉｓ＿ｍａｘ、ｄｉｓ＿ｍｅｄｉａｎ、ｄｉｓ＿ｍｉ
ｎ（以下、距離ポイントデータという）を用いてメンバ
シップ関数を生成し、その関数への適合度を計算するの
である。ここで用いる距離ポイントデータとは、クラス
タを構成するデータ全てについて中心からの距離を算出
し、その中の最大、中央値、最小値を保存したものであ
る。平均値ではなく中央値を採用したのは、あるかけ離
れた値によって値の変動が少ないデータという理由から
である。ただし、現時点ではｄｉｓ＿ｍａｘの値は保存
しているだけで使用していない。To briefly explain the outline, in the file (see FIG. 11) secured when the cluster knowledge is generated,
dis_max, dis_median, dis_mi
The membership function is generated by using n (hereinafter referred to as distance point data), and the fitness to the function is calculated. The distance point data used here is data in which the distance from the center is calculated for all data forming the cluster, and the maximum, median, and minimum values among them are stored. The reason why the median value is used instead of the average value is that the data does not fluctuate much due to some distant values. However, at this time, the value of dis_max is only stored and not used.

【０１３６】採用するメンバシップ関数の数式は、次の
通りである。式のｄｉｓに認識対象文字の距離を入れる
ことで距離が算出される。尚、ｔ＞０であり、ｄｉｓ＞
ｄｉｓ＿ｍｉｎならば適合度＝１である。The formula of the membership function adopted is as follows. The distance is calculated by inserting the distance of the recognition target character into the expression dis. Note that t> 0 and dis>
If dis_min, the degree of conformity = 1.

【０１３７】[0137]

【数４】また、中心からの距離の確信度算出で用いるメンバシッ
プ関数を図示すると図１４のようになる。ここで、図１
４の概略を述べておく。認識対象データのクラスタ中心
からの距離がｄｉｓ＿ｍｉｎ以下の時、適合度＝１を与
える。メンバシップ関数の変極点は、ｄｉｓ＿ｍｅｄｉ
ａｎである。ｔの値を変化させることで、算出される適
合度の大きさを変更できる。すなわち、ｔが大きくなる
ほど算出適合度は大きくなる。[Equation 4] Further, FIG. 14 shows the membership function used for calculating the certainty factor of the distance from the center. Here, FIG.
The outline of No. 4 will be described. When the distance from the cluster center of the recognition target data is equal to or less than dis_min, the goodness of fit = 1 is given. The inflection point of the membership function is dis_medi
It is an. The magnitude of the calculated goodness of fit can be changed by changing the value of t. That is, the greater t is, the greater the calculation suitability is.

【０１３８】尚、後段判定部１１６で用いている「距
離」とは、楕円体の固有ベクトル方向の軸長さを考慮し
たものであり、メンバシップ関数と楕円体の関係は、図
１５の適合度と楕円の関係に示す如くなる。説明に用い
るのは２次元の楕円で、ｘ、ｙ軸方向への広がりを持
つ。ここで算出された適合度を、「クラスタ重心からの
確信度」という。The "distance" used in the post-stage judging section 116 is a consideration of the axial length of the ellipsoid in the direction of the eigenvector, and the relationship between the membership function and the ellipsoid is the conformity of FIG. And ellipse. A two-dimensional ellipse is used for the explanation and has a spread in the x and y axis directions. The goodness of fit calculated here is referred to as the "certainty from the cluster centroid".

【０１３９】次に、クラスタ内座標からの確信度の算出
（ステップ１３０５）について説明する。前述のよう
に、後段知識の作成部１１３による後段知識ベース１１
４の構築において、クラスタを構成するデータの座標値
の頻度を計算している。その頻度によって確信度を割り
当てたものが、図１１に示した保存知識おけるｆｒｅｑ
ｉｅｎｃｙである。２次元の楕円体についての場合に説
明を加える。Next, the calculation of the certainty factor from the intra-cluster coordinates (step 1305) will be described. As described above, the post-stage knowledge base 11 by the post-stage knowledge creating unit 113 is used.
In the construction of No. 4, the frequency of the coordinate values of the data forming the cluster is calculated. The confidence factor assigned according to the frequency is freq in the stored knowledge shown in FIG.
iency. A description will be added in the case of a two-dimensional ellipsoid.

【０１４０】図１６に示されるようなデータの分布でク
ラスタが構成される場合、確信度を求める関数は、頻度
の高い座標値付近で高い値を取るよう設定されている。
これは、このクラスタに属するデータであれば、どの座
標値に偏って集まるかを示していることになる。図１６
では、第１象限に偏って集まることが示されている。つ
まり、第１象限に確定対象データが含まれた場合には、
高い確信度を算出する。逆に、第３、第４象限では低い
確信度が算出されることとなる。When a cluster is formed by the distribution of data as shown in FIG. 16, the function for obtaining the certainty factor is set so as to take a high value in the vicinity of frequently used coordinate values.
This indicates to which coordinate value the data belonging to this cluster is concentrated. FIG.
Shows that they are concentrated in the first quadrant. That is, when the data to be determined is included in the first quadrant,
Calculate high confidence. On the contrary, a low certainty factor is calculated in the third and fourth quadrants.

【０１４１】尚、確信度関数のピーク値は、１を使用す
る軸数（クラスタ構成軸数）で割ったものを使用する。
つまり、２軸でクラスタを構成しているならば、各軸の
確信度関数のピーク値は０．５であり、３軸の場合は
０．３３である。よって、各軸で算出された確信度を足
し合わせたときに、最大で１を取るように設定する。The peak value of the certainty factor function is obtained by dividing 1 by the number of axes used (number of cluster constituent axes).
That is, if a cluster is composed of two axes, the peak value of the certainty factor function of each axis is 0.5, and if it is three axes, it is 0.33. Therefore, when the certainty factors calculated for each axis are added together, the maximum value is set to 1.

【０１４２】次に、最大確信度の抽出処理（ステップ１
３０９）について説明する。複数クラスタに確定対象デ
ータが含まれる場合が存在する。その場合、全ての算出
された確信度の中で、最大の確信度を与えるクラスタを
生成する際に用いたデータ群（カテゴリ）をこの時点で
有効とする。前述した通り、この時点で、ＯＣＲ認識処
理部１０１による認識結果の第１候補が有効でなくなる
場合があり得る。Next, the maximum confidence factor extraction processing (step 1
309) will be described. There are cases where data to be confirmed is included in multiple clusters. In that case, the data group (category) used when generating the cluster that gives the maximum certainty factor among all the calculated certainty factors is valid at this time. As described above, the first candidate of the recognition result by the OCR recognition processing unit 101 may be invalid at this point.

【０１４３】次に、ＮＧクラスタチェック（ステップ１
３０８）について説明する。最大確信度の抽出処理（ス
テップ１３０９）において、有効なカテゴリが算出され
た場合には、ＮＧクラスタチェックを行う。ＮＧクラス
タとは、前述した通り、ＯＣＲ認識処理部１０１で誤認
識しているデータのみを集めてクラスタを生成したもの
である。クラスタの生成法は、ＯＫクラスタと同様であ
る。Next, NG cluster check (step 1
308) will be described. When a valid category is calculated in the maximum confidence factor extraction process (step 1309), an NG cluster check is performed. As described above, the NG cluster is a cluster generated by collecting only the data erroneously recognized by the OCR recognition processing unit 101. The cluster generation method is similar to the OK cluster.

【０１４４】チェック対象のＮＧクラスタは、有効なカ
テゴリとして算出されたカテゴリに属するＮＧクラスタ
全てである。つまり、２が有効クラスタと判断された場
合は、本当は２であるのに他のカテゴリに誤って認識さ
れたしまったデータを集めて作成されたＮＧクラスタす
べてでチェックを行うのである。The NG clusters to be checked are all NG clusters belonging to the category calculated as a valid category. In other words, if 2 is determined to be a valid cluster, the check is performed on all NG clusters created by collecting data that is actually 2 but has been erroneously recognized by another category.

【０１４５】次に、確信度を低くする処理（ステップ１
３１３）について説明する。ＮＧクラスタチェック（ス
テップ１３０８）で、あるＮＧクラスタに入った場合、
最大確信度の抽出処理（ステップ１３０９）において算
出された確信度を低くする処理を行う。ＮＧクラスタ
も、ＯＫクラスタと同様に、第２、第３候補に出現し易
い候補を軸として作成されている。つまり、ＯＫクラス
タ、ＮＧクラスタは、同じ空間（軸も同じ）内に存在す
ることになる。Next, the process of lowering the certainty factor (step 1
313) will be described. If a certain NG cluster is entered in the NG cluster check (step 1308),
A process of lowering the certainty factor calculated in the maximum certainty factor extraction process (step 1309) is performed. Like the OK cluster, the NG cluster is also created with the candidates that are likely to appear in the second and third candidates as the axis. That is, the OK cluster and the NG cluster exist in the same space (same axis).

【０１４６】図１７には、あるカテゴリのＯＫクラスタ
とＮＧクラスタの位置関係を示す。後段知識ベース１１
４の構築方法でも述べたように、Ａというカテゴリの第
２、第３候補に多く出現する文字（Ａ、Ｂ、Ｃ）を軸と
してとり、Ａという文字の正解データを集めたデータ群
の軸Ａ、軸Ｂ、軸Ｃから構成される「ＯＫクラスタａ」
と、Ｂという文字の正解データを集めたデータ群の軸
Ａ、軸Ｂ、軸Ｃから構成される「ＯＫクラスタｂ」と、
Ｃという文字の正解データを集めたデータ群の軸Ａ、軸
Ｂ、軸Ｃから構成される「ＯＫクラスタｃ」と、正解が
ＡなのにＢに間違えているデータ群の軸Ａ、軸Ｂ、軸Ｃ
から構成される「ＮＧクラスタ」とを、２次元に射影し
図示している。ただし、「ＯＫクラスタｃ」は、説明を
簡単にするために図示していない。FIG. 17 shows the positional relationship between OK clusters and NG clusters of a certain category. Second-stage knowledge base 11
As described in the construction method of No. 4, the character (A, B, C) that frequently appears in the second and third candidates of the category A is used as the axis, and the axis of the data group in which the correct answer data of the character A is collected. "OK cluster a" composed of A, axis B, and axis C
And “OK cluster b” composed of axis A, axis B, and axis C of a data group in which correct answer data of the letter B is collected,
"OK cluster c" composed of axis A, axis B, and axis C of the data group that collects correct answer data of the letter C, and axis A, axis B, and axis of the data group in which the correct answer is A but is mistaken for B C
The “NG cluster” composed of is shown in a two-dimensionally projected manner. However, the “OK cluster c” is not shown for the sake of simplicity.

【０１４７】図１７に示すように、ＯＫクラスタａとＯ
Ｋクラスタｂの重なるエリアに、確定対象のデータが存
在し、ＯＫクラスタチェック（ステップ１３０３）にお
いて、ＯＫクラスタｂの確信度が最大を取る場合で、Ａ
をＢと誤認識してしまったデータがあると仮定する。こ
のときそのデータは、ＮＧクラスタにも含まれるため、
図中のＮＧクラスタ内部の軸上に示す関数により、算出
された結果をＢの確信度に掛け合わせて調節する。すな
わち、ＯＫクラスタａ、ｂの重なりが大きい領域Ｒ４で
はＢの確信度はそれほど低くならないが、他の領域Ｒ
１，Ｒ２，Ｒ３ではかなり低くなる。As shown in FIG. 17, OK clusters a and O
If there is data to be confirmed in the area where the K cluster b overlaps, and the certainty factor of the OK cluster b is maximized in the OK cluster check (step 1303), A
It is assumed that there is data that has been erroneously recognized as B. At this time, since the data is also included in the NG cluster,
By the function shown on the axis inside the NG cluster in the figure, the calculated result is multiplied by the certainty factor of B to adjust. That is, in the region R4 where the overlap of the OK clusters a and b is large, the certainty factor of B is not so low, but the other region R
It becomes considerably lower for 1, R2 and R3.

【０１４８】次に、ルール判定部１１７による処理につ
いて詳しく説明する。ルール判定部１１７は、後段判定
部１１６の確信度算出手段によって算出された確信度を
元に、確定データとするか、或いは、未確定データとし
てリジェクト判定データとするか決定する処理を行うも
のである。図１８に、ルール判定部１１７による処理の
フローチャートを示す。Next, the processing by the rule determination section 117 will be described in detail. The rule determination unit 117 performs a process of determining whether the data is determined data or reject determination data as undetermined data, based on the certainty factor calculated by the certainty factor calculation unit of the subsequent determination unit 116. is there. FIG. 18 shows a flowchart of processing by the rule determination unit 117.

【０１４９】処理の内容は簡単で、ステップ１８０１に
おいて、確信度算出手段によって算出された確信度が所
定値γを越えるときは、ステップ１８０２に進んで確定
データとし、それ以下の値のときは、ステップ１８０３
に進んで正解と確信できない判定データとし、リジェク
ト対象データにする。ここで用いた所定値は、ユーザが
設定でき任意に設定できる。所定値を大きくすればする
ほど確定データは減少するようなしくみである。The content of the process is simple. In step 1801, if the certainty factor calculated by the certainty factor calculating means exceeds the predetermined value γ, the process proceeds to step 1802 to make the finalized data, and if the value is less than that, Step 1803
Proceed to step (4) and make the judgment data that cannot be convinced to be the correct answer, and set it as the rejection target data. The predetermined value used here can be set by the user and can be set arbitrarily. The larger the predetermined value, the smaller the fixed data.

【０１５０】以上説明したように、本実施形態の画像認
識装置によれば、予め、前段知識の作成部１１１におい
て、第１前段知識作成手段により第１前段知識（第１候
補カテゴリに対する類似カテゴリの関連）を、第２前段
知識作成手段により第２前段知識（特定順位の候補を”
正解”と確定し得る適合度（距離）の大きさ）を、第３
前段知識作成手段により第３前段知識（第１候補を”正
解”と確定し得る適合度（距離）差の大きさ）をそれぞ
れ前段知識ベース１１２に作成しておき、また、後段知
識の作成部１１３において、第１後段知識作成手段によ
り第１後段知識（ＯＫクラスタの形状）を、第２後段知
識作成手段により第２後段知識（ＮＧクラスタの形状）
をそれぞれ後段知識ベース１１４に作成しておき、前段
判定部（第１前段判定手段、第２前段判定手段および第
３前段判定手段）１１５では、これら前段知識ベース１
１２の第１、第２および第３前段知識を用いて、入力画
像の認識結果について、”正解”または”不明”（どの
カテゴリにも属さないデータ）の２つに認識結果を分別
して確定し、また、ここで”不明”とされた入力画像の
認識結果については、後段判定部（第１後段判定手段お
よび第２後段判定手段）１１６により、後段知識ベース
１１４の第１および第２後段知識を用いて、あるカテゴ
リのＯＫクラスタに入ったデータを該カテゴリに属す
る”正解”データと決定付け、逆にどのＯＫクラスタに
も入らない場合は”不明”（どのカテゴリにも属さな
い）データとした後、さらに、ＮＧクラスタの少なくと
も１つに入った場合は”不正解”データと決定し、３つ
に認識結果を分別して確定するようにして、再チェック
を行うようにしている。これにより、受付け誤認識率を
低下させることができ、画像認識システムにおける省力
化をさらに図ることができ、また、帳票読み込みの自動
化を図ることができる。さらに、画像認識システムの認
識結果を否定して”正解”の認識結果に置き換えること
ができるので、認識性能を向上させることができる。As described above, the image recognition of this embodiment is performed.
According to the identification device, the knowledge generation unit 111
Then, the first pre-stage knowledge is created by the first pre-stage knowledge creating means.
(Relationship of similar category to complementary category)
2nd pre-stage knowledge by knowledge creation means
The degree of goodness of fit (distance) that can be determined as “correct answer”
The third-stage knowledge (first candidate is "correct"
The degree of goodness of fit (distance) that can be determined as a "solution"
Created in the previous knowledge base 112,
In the knowledge creation unit 113, the first second-stage knowledge creation means
1st second-stage knowledge (shape of OK cluster), second second-stage knowledge
Second-stage knowledge (shape of NG cluster) by knowledge creation means
Are created in the post-stage knowledge base 114, respectively.
Determination unit (first front stage determination unit, second front stage determination unit, and
3 pre-stage determination means) 115, these pre-stage knowledge bases 1
Input image using 12 first, second and third pre-knowledge
About the recognition result of the image, "correct answer" or "unknown" (which
Classification of the recognition results into two (data that does not belong to any category)
Of the input image that was confirmed as “Unknown” here.
For the recognition result, the second-stage determination unit (first second-stage determination unit or
And second second-stage determination means) 116, the second-stage knowledge base
Using the first and second latter knowledge of 114,
Data that has entered the OK cluster of
"OK" data, and to which OK cluster
If you don't enter, "Unknown" (does not belong to any category
Data) and then at least NG clusters
If 1 is included, it is determined as "incorrect answer" data and 3
Re-check by separating and confirming recognition results
I'm trying to do. This will reduce the false recognition rate
Can be lowered, labor saving in image recognition system
Can be made even more automated, and the form can be automatically read.
Can be realized. In addition, the recognition of the image recognition system
Replacing the recognition result with a "correct answer" recognition result
As a result, the recognition performance can be improved.

【０１５１】[0151]

【発明の効果】以上の説明から明らかなように、本発明
によれば、第１前段知識作成手段（第１前段知識作成ス
テップ）により、画像認識の認識結果およびその付随情
報を含むデータから、認識結果として同じ第１候補のカ
テゴリを持つデータを集め、該集めたデータから第２候
補以下の特定の候補順位における類似カテゴリの出現率
の高いものを抽出して、第１候補カテゴリに対する類似
カテゴリの関連を示す第１前段知識として保持し、第１
前段判定手段（第１前段判定ステップ）では、入力画像
の認識結果として第１候補のカテゴリおよび第２候補以
下の特定の候補順位の類似カテゴリが得られるとき、該
第１候補のカテゴリおよび第２候補以下の特定候補順位
の類似カテゴリと前記第１前段知識に保持された関連と
の比較により、該第１候補のカテゴリが”正解”か否か
を判定することとしたので、正解として確定できたもの
については人がチェックする必要がなくなり、正解と確
定できなかったものについてのみチェックするだけでよ
くなることから、受付け誤認識率を低下させることがで
き、画像認識システムにおける省力化をさらに図り得る
画像認識装置、画像認識方法および記録媒体を提供する
ことができる。As is apparent from the above description, according to the present invention, the first pre-stage knowledge creating means (first pre-stage knowledge creating step) is used to extract the data including the recognition result of image recognition and the accompanying information thereof. Data having the same first candidate category as the recognition result is collected, and second data is collected from the collected data .
In addition, the ones with a high appearance rate of similar categories in the following specific candidate ranks are extracted and held as first-stage knowledge that indicates the relationship of similar categories to the first candidate category.
In the first-stage determination means (first first-stage determination step), as the recognition result of the input image, the category of the first candidate and the second candidate
When a lower similar category with a specific candidate rank is obtained, by comparing the category of the first candidate and the similar category with a specific candidate rank lower than or equal to the second candidate with the relation held in the first pre-stage knowledge, Whether 1 candidate category is "correct"
Since it is decided to judge, it is not necessary for a person to check what was confirmed as a correct answer, and it suffices to check only those that could not be confirmed as a correct answer. Therefore, it is possible to provide the image recognition device, the image recognition method, and the recording medium that can further save the labor in the image recognition system.

【０１５２】また、本発明によれば、第２前段知識作成
手段（第２前段知識作成ステップ）により、画像認識の
認識結果およびその付随情報を含むデータから、認識結
果として同じ第１候補のカテゴリを持つデータを集め、
該集めたデータから第１候補のカテゴリにおける適合度
（距離）から、該第１候補のカテゴリを”正解”と確定
し得る適合度（距離）の大きさを算定し、第２前段知識
として保持し、第２前段判定手段（第２前段判定ステッ
プ）では、入力画像の認識結果として第１候補のカテゴ
リにおける適合度（距離）の大きさと前記第２前段知識
に保持された適合度（距離）の大きさとの比較により、
該第１候補のカテゴリを”正解”または”不明”と判定
することとし、また、第３前段知識作成手段（第３前段
知識作成ステップ）により、画像認識の認識結果および
その付随情報を含むデータから、認識結果として同じ第
１候補のカテゴリを持つデータを集め、該集めたデータ
から第１候補のカテゴリと第２候補以下の特定候補順位
の類似カテゴリとの適合度（距離）差から、該第１候補
のカテゴリを”正解”と確定し得る適合度（距離）差の
大きさを算定して、第３前段知識として保持し、第３前
段判定手段（第３前段判定ステップ）では、入力画像の
認識結果として第１候補のカテゴリと第２候補以下の特
定候補順位の類似カテゴリが得られるとき、該第１候補
のカテゴリと第２候補以下の特定候補順位の類似カテゴ
リとの適合度（距離）差の大きさと前記第３前段知識に
保持された適合度（距離）差の大きさとの比較により、
該第１候補のカテゴリが”正解”か否かを判定すること
としたので、正解として確定できたものについては人が
チェックする必要がなくなり、正解と確定できなかった
ものについてのみチェックするだけでよくなることか
ら、受付け誤認識率を低下させることができ、画像認識
システムにおける省力化をさらに図り得る画像認識装
置、画像認識方法および記録媒体を提供することができ
る。Further, according to the present invention, by the second pre-stage knowledge creating means (second pre-stage knowledge creating step), from the data including the recognition result of the image recognition and its accompanying information, the category of the first candidate which is the same as the recognition result is obtained. Collect data with
Fit in the category of the first candidate from data collected the from (distance), and calculate the magnitude of the goodness of fit may be determined to "correct" the category of the first candidate (distance), held as second pre-knowledge Then, in the second pre-stage determination means (second pre-stage determination step), the magnitude of the goodness of fit (distance) in the category of the first candidate and the goodness of fit (distance) held in the second pre-stage knowledge as the recognition result of the input image. By comparison with the size of
And determining that the category of the first candidate "correct" or "unknown", also by the third front knowledge generating means (3 front knowledge generating step), the recognition result of the image recognition and data including the accompanying information From this, data having the same first candidate category as the recognition result is collected, and from the collected data, the category of the first candidate and the rank of the specific candidate below the second candidate
From the similarity (distance) with the similar category of
The degree of fitness (distance) difference that can determine the category of "correct answer" is calculated and held as the third pre-stage knowledge, and the third pre-stage determination means (third pre-stage determination step) recognizes the input image. As a result, the category of the first candidate and the features below the second candidate
When similar categories of constant candidate ranking is obtained, the size and the third front knowledge of fitness (distance) difference between similar categories of the first candidate category and the following specific candidate ranking second candidate
By comparison with the magnitude of the held goodness of fit (distance) difference,
Since it is determined whether or not the category of the first candidate is “correct answer”, it is not necessary for a person to check what has been confirmed as a correct answer, and only to check those that cannot be determined as a correct answer. As a result, it is possible to provide an image recognition device, an image recognition method, and a recording medium that can reduce the acceptance error recognition rate and can further reduce the labor in the image recognition system.

【０１５３】また、本発明によれば、予め、第１前段知
識作成手段（第１前段知識作成ステップ）により第１前
段知識（第１候補カテゴリに対する類似カテゴリの関
連）を、第２前段知識作成手段（第２前段知識作成ステ
ップ）により第２前段知識（特定順位の候補を”正解”
と確定し得る適合度（距離）の大きさ）を、第３前段知
識作成手段（第３前段知識作成ステップ）により第３前
段知識（第１候補を”正解”と確定し得る適合度（距
離）差の大きさ）を、それぞれ作成しておき、第１前段
判定手段（第１前段判定ステップ）、第２前段判定手段
（第２前段判定ステップ）および第３前段判定手段（第
３前段判定ステップ）では、これら第１、第２および第
３前段知識を用いて、入力画像の認識結果について、”
正解”か否かを判定することとしたので、受付け誤認識
率を低下させることができ、画像認識システムにおける
省力化をさらに図ることができ、また、帳票読み込みの
自動化を図り得る画像認識装置、画像認識方法および記
録媒体を提供することができる。Further, according to the present invention, the first pre-stage knowledge creation means (first pre-stage knowledge creation step) previously creates the first pre-stage knowledge (relationship of the similar category to the first candidate category) with the second pre-stage knowledge creation. By means (second pre-stage knowledge creation step), the second pre-stage knowledge (a candidate with a specific rank is “correct”)
The degree of goodness of fit (distance) that can be decided as follows is the goodness of fit (distance) that can be decided as third corrective knowledge (the first candidate is “correct answer”) by the third preliminary knowledge creating means (third preliminary knowledge creating step). ). The magnitude of the difference) is created respectively, and the first front stage determination means (first front stage determination step), the second front stage determination means (second front stage determination step), and the third front stage determination means (third front stage determination step) In the step), using the knowledge of the first, second and third pre-stages, the recognition result of the input image is
Since it is decided whether or not the answer is "correct", the false recognition rate of acceptance can be reduced, the labor saving in the image recognition system can be further promoted, and the image recognition apparatus capable of automating the reading of the form, An image recognition method and a recording medium can be provided.

【０１５４】また、本発明によれば、第１軸抽出手段
（第１軸抽出ステップ）により、画像認識の認識結果お
よびその付随情報を含むデータをサンプル収集し、ある
カテゴリで認識結果が”正解”とされているデータか
ら、前記データの集合をクラスタリングするときの軸を
抽出し、第１後段知識作成手段（第１後段知識作成ステ
ップ）により、第１軸抽出手段（第１軸抽出ステップ）
で抽出された軸に基づき前記データの集合のクラスタリ
ングを行い、該カテゴリのＯＫクラスタとして生成し、
生成されたＯＫクラスタの形状を第１後段知識として保
持し、第１後段判定手段（第１後段判定ステップ）で
は、入力画像の認識結果として認識結果およびその付随
情報を含むデータが得られるとき、該データが第１後段
知識のあるカテゴリに所属するＯＫクラスタの中に含ま
れるか否かをチェックして、該ＯＫクラスタに含まれる
場合には該ＯＫクラスタが所属するカテゴリを入力画像
の認識結果として”正解”と判定し、何れのＯＫクラス
タにも含まれていない場合には”未確定”と判定するこ
ととしたので、受付け誤認識率を低下させることがで
き、画像認識システムにおける省力化をさらに図り得る
画像認識装置、画像認識方法および記録媒体を提供する
ことができる。Further, according to the present invention, the first axis extracting means (first axis extracting step) samples and collects the data including the recognition result of the image recognition and its accompanying information, and the recognition result is "correct" in a certain category. The axis for clustering the set of data is extracted from the data described as ", and the first second-stage knowledge creating unit (first second-stage knowledge creating step) extracts the first axis-extracting unit (first axis-extracting step).
Clustering the set of data based on the axis extracted in step S6 to generate an OK cluster of the category,
When the shape of the generated OK cluster is held as the first post-stage knowledge and the first post-stage determination means (first post-stage determination step) obtains the data including the recognition result and the accompanying information as the recognition result of the input image, It is checked whether or not the data is included in the OK cluster belonging to the category having the first latter-stage knowledge. If the data is included in the OK cluster, the category to which the OK cluster belongs is recognized as the recognition result of the input image. As a result, it is determined as "correct answer", and when it is not included in any of the OK clusters, it is determined as "undetermined". Therefore, the false recognition rate of acceptance can be reduced, and labor saving in the image recognition system can be achieved. It is possible to provide an image recognition device, an image recognition method, and a recording medium that can further achieve the above.

【０１５５】また、本発明によれば、第２軸抽出手段
（第２軸抽出ステップ）により、画像認識の認識結果お
よびその付随情報を含むデータをサンプル収集し、ある
カテゴリで認識結果が”誤認”とされているデータか
ら、前記データの集合をクラスタリングするときの軸を
抽出し、第２後段知識作成手段（第２後段知識作成ステ
ップ）により、第２軸抽出手段（第２軸抽出ステップ）
で抽出された軸に基づき前記データの集合のクラスタリ
ングを行い、該カテゴリのＮＧクラスタとして生成し、
生成されたＮＧクラスタの形状を第２後段知識として保
持し、第２後段判定手段（第２後段判定ステップ）で
は、入力画像の認識結果として認識結果およびその付随
情報を含むデータが得られるとき、該データが第２後段
知識のあるカテゴリに所属するＮＧクラスタの中に含ま
れるか否かをチェックして、該ＮＧクラスタに含まれる
場合には該ＮＧクラスタが所属するカテゴリを前記入力
画像の認識結果として”不正解”と判定し、何れのＮＧ
クラスタにも含まれていない場合には”正解”と判定す
ることとしたので、受付け誤認識率を低下させることが
でき、画像認識システムにおける省力化をさらに図り得
る画像認識装置、画像認識方法および記録媒体を提供す
ることができる。Further, according to the present invention, the second axis extracting means (second axis extracting step) samples and collects the data including the recognition result of the image recognition and its accompanying information, and the recognition result in a certain category is "erroneous recognition". The axis for clustering the set of data is extracted from the data described as ", and the second second-stage knowledge creating means (second second-stage knowledge creating step) extracts the second axis-extracting means (second axis extracting step).
Clustering the set of data based on the axis extracted in step S6 to generate an NG cluster of the category,
When the shape of the generated NG cluster is held as second second-stage knowledge, and the second second-stage determination means (second second-stage determination step) obtains data including the recognition result and its accompanying information as the recognition result of the input image, It is checked whether or not the data is included in an NG cluster belonging to the category having the second latter knowledge, and if it is included in the NG cluster, the category to which the NG cluster belongs is recognized as the input image. As a result, it was judged as “incorrect” and which NG
If it is not included in the cluster, it is determined to be the “correct answer”, so the false recognition rate for acceptance can be reduced, and the image recognition device, the image recognition method, and the image recognition method that can further reduce the labor in the image recognition system. A recording medium can be provided.

【０１５６】また、本発明によれば、予め、第１後段知
識作成手段（第１後段知識作成ステップ）により第１後
段知識（ＯＫクラスタの形状）を、第２後段知識作成手
段（第２後段知識作成ステップ）により第２後段知識
（ＮＧクラスタの形状）を、それぞれ作成しておき、第
１後段判定手段（第１後段判定ステップ）および第２後
段判定手段（第２後段判定ステップ）では、これら第１
および第２後段知識を用いて、入力画像の認識結果につ
いて、あるカテゴリのＯＫクラスタに入ったデータを該
カテゴリに属する”正解”データと決定付け、逆にどの
ＯＫクラスタにも入らない場合は”不明”（どのカテゴ
リにも属さない）データとした後、さらに、ＮＧクラス
タの少なくとも１つに入った場合は”不正解”データと
決定され、３つに認識結果を分別して確定することとし
たので、受付け誤認識率を低下させることができ、画像
認識システムにおける省力化をさらに図ることができ、
また、帳票読み込みの自動化を図ることができ、さら
に、画像認識システムの認識結果を否定して”正解”の
認識結果に置き換えることができるので、認識性能を向
上させ得る画像認識装置、画像認識方法および記録媒体
を提供することができる。According to the present invention, the first second-stage knowledge (shape of OK cluster) is previously stored in the second second-stage knowledge creating unit (second second-stage knowledge creating unit) by the first second-stage knowledge creating unit (first second-stage knowledge creating step). The second post-stage knowledge (shape of the NG cluster) is created by the knowledge creating step), and the first post-stage determining means (first post-stage determining step) and the second post-stage determining means (second post-stage determining step) These first
Using the knowledge of the second post-stage, it is determined that the data obtained in the OK cluster of a certain category is the “correct answer” data belonging to the category in the recognition result of the input image. After the data is set to "unknown" (does not belong to any category), and if at least one of the NG clusters is entered, it is determined to be "incorrect" data, and the recognition results are classified into three and confirmed. Therefore, the false recognition rate of acceptance can be reduced, and labor saving in the image recognition system can be further achieved.
Further, it is possible to automate the reading of the form, and further, since the recognition result of the image recognition system can be negated and replaced with the “correct” recognition result, an image recognition device and an image recognition method that can improve the recognition performance. And a recording medium can be provided.

【０１５７】また、本発明によれば、予め、第１前段知
識作成手段（第１前段知識作成ステップ）により第１前
段知識（第１候補カテゴリに対する類似カテゴリの関
連）を、第２前段知識作成手段（第２前段知識作成ステ
ップ）により第２前段知識（第１候補のカテゴリを”正
解”と確定し得る適合度（距離）の大きさ）を、第３前
段知識作成手段（第３前段知識作成ステップ）により第
３前段知識（第１候補を”正解”と確定し得る適合度
（距離）差の大きさ）を、第１後段知識作成手段（第１
後段知識作成ステップ）により第１後段知識（ＯＫクラ
スタの形状）を、第２後段知識作成手段（第２後段知識
作成ステップ）により第２後段知識（ＮＧクラスタの形
状）を、それぞれ作成しておき、第１前段判定手段（第
１前段判定ステップ）、第２前段判定手段（第２前段判
定ステップ）および第３前段判定手段（第３前段判定ス
テップ）では、これら第１、第２および第３前段知識を
用いて、入力画像の認識結果について、”正解”また
は”不明”（どのカテゴリにも属さないデータ）の２つ
に認識結果を分別して確定し、また、ここで”不明”と
された入力画像の認識結果については、第１後段判定手
段（第１後段判定ステップ）および第２後段判定手段
（第２後段判定ステップ）により、第１および第２後段
知識を用いて、あるカテゴリのＯＫクラスタに入ったデ
ータを該カテゴリに属する”正解”データと決定付け、
逆にどのＯＫクラスタにも入らない場合は”不明”（ど
のカテゴリにも属さない）データとした後、さらに、Ｎ
Ｇクラスタの少なくとも１つに入った場合は”不正解”
データと決定され、３つに認識結果を分別して確定する
ようにして、再チェックを行うこととしたので、受付け
誤認識率を低下させることができ、画像認識システムに
おける省力化をさらに図ることができ、また、帳票読み
込みの自動化を図ることができ、さらに、画像認識シス
テムの認識結果を否定して”正解”の認識結果に置き換
えることができるので、認識性能を向上させ得る画像認
識装置、画像認識方法および記録媒体を提供することが
できる。Further, according to the present invention, the first pre-stage knowledge (relation of the similar category to the first candidate category) is previously created by the first pre-stage knowledge creating means (first pre-stage knowledge creating step) to the second pre-stage knowledge creating step. The second pre-stage knowledge ( the degree of goodness of fit (distance) that can determine the category of the first candidate as “correct answer”) by the means (second pre-stage knowledge creating step) is defined as the third pre-stage knowledge creating unit (third pre-stage knowledge). In the creation step), the third pre-stage knowledge (the magnitude of the difference in fitness (distance) that can determine the first candidate as the “correct answer”) is used as the first post-stage knowledge creating means (first
The first post-stage knowledge (OK cluster shape) is created by the post-stage knowledge creating step, and the second post-stage knowledge (NG cluster shape) is created by the second post-stage knowledge creating unit (second post-stage knowledge creating step). In the first front stage determination means (first front stage determination step), the second front stage determination means (second front stage determination step) and the third front stage determination means (third front stage determination step), these first, second and third Using the previous knowledge, the recognition result of the input image is classified into two, "correct answer" or "unknown" (data that does not belong to any category), and is confirmed. Regarding the recognition result of the input image obtained by the first rear-stage determination means (first rear-stage determination step) and the second rear-stage determination means (second rear-stage determination step), the first and second rear-stage knowledge is used to obtain a certain result. The entered data to the Gori of OK cluster belonging to the category "correct" data and decisions with,
On the contrary, if the data does not belong to any OK cluster, the data is set to "unknown" (does not belong to any category), and then N
"Incorrect answer" when entering at least one of G clusters
Since it is decided that the data is recognized and the recognition result is divided into three to be confirmed and the check is performed again, the false recognition rate of acceptance can be reduced and the labor saving in the image recognition system can be further achieved. In addition, it is possible to automate the reading of forms, and since the recognition result of the image recognition system can be negated and replaced with the “correct” recognition result, an image recognition device and an image that can improve recognition performance can be obtained. A recognition method and a recording medium can be provided.

【０１５８】また、本発明によれば、第１後段判定手段
（第１後段判定ステップ）において、前記データがある
カテゴリに所属するＯＫクラスタに含まれる場合に、ど
の程度の確信を持って”正解”といえるかを示す指標で
ある確信度を確信度算出手段（確信度算出ステップ）に
より計算するようにし、前記データが複数の別カテゴリ
に所属するＯＫクラスタに入った場合には、確信度に基
づいてどのカテゴリを”正解”とするかを決定すること
としたので、あるカテゴリのＯＫクラスタに入ったデー
タをそのカテゴリに属するデータと決定付け、逆にどの
ＯＫクラスタにも入らない場合は”不明”（どのカテゴ
リにも属さない）データとされるが、あるカテゴリに属
するデータと決定付けられた場合に、認識結果の確信度
合いを算出することにより、”不明”、”正解”、”正
解と確信できない”の３つに認識結果を分別して確定で
き、また、さらに、ＮＧクラスタの少なくとも１つに入
った場合は”不正解”データと決定され、”不明”、”
正解”、”不正解”、”正解と確信できない”の４つに
認識結果を分別して確定し得る画像認識装置、画像認識
方法および記録媒体を提供することができる。Further, according to the present invention, in the first second-stage determination means (first second-stage determination step), when the data is included in the OK cluster belonging to a certain category, with certainty of confidence, "correct answer" The certainty factor, which is an index indicating whether or not it can be said, is calculated by the certainty factor calculation means (certainty factor calculation step), and when the data enters an OK cluster belonging to a plurality of different categories, the certainty factor is calculated. Since it was decided based on which category the "correct answer" should be set, data that has entered an OK cluster of a certain category is determined as data belonging to that category, and conversely, if it does not belong to any OK cluster, Although it is considered as "unknown" data (which does not belong to any category), when it is determined that the data belongs to a certain category, the confidence level of the recognition result can be calculated. By this, the recognition result can be classified and determined into three, "Unknown", "Correct answer", and "I can't be sure of correct answer". Furthermore, if at least one of the NG clusters is entered, it is determined as "wrong answer" data. "Unknown", "
It is possible to provide an image recognition device, an image recognition method, and a recording medium capable of classifying and recognizing recognition results into four categories, “correct answer”, “incorrect answer”, and “not sure that the answer is correct”.

【０１５９】さらに、本発明によれば、第２後段判定手
段（第２後段判定ステップ）において、前記データがあ
るカテゴリに所属するＮＧクラスタに含まれる場合に
は、確信度更新手段（確信度更新ステップ）により、ど
の程度の確信を持って”正解”といえるかを示す指標で
ある確信度を低くすることとしたので、あるカテゴリの
ＯＫクラスタに入ったデータをそのカテゴリに属するデ
ータと決定付け、逆にどのＯＫクラスタにも入らない場
合は”不明”（どのカテゴリにも属さない）データとさ
れるが、あるカテゴリに属するデータと決定付けられた
場合に、認識結果の確信度合いを算出することによ
り、”不明”、”正解”、”正解と確信できない”の３
つに認識結果を分別して確定でき、さらに、ＮＧクラス
タの少なくとも１つに入った場合は”不正解”データと
決定されるか、算出された確信度を低くすることができ
る。Further, according to the present invention, in the second latter stage judging means (second latter stage judging step), when the data is included in the NG cluster belonging to a certain category, the confidence factor updating means (confidence factor updating) By determining the confidence level, which is an index indicating how much confidence can be said to be the “correct answer”, the data entered in the OK cluster of a certain category is determined as the data belonging to that category. On the contrary, if the data does not belong to any OK cluster, it is regarded as "unknown" data (which does not belong to any category), but if it is determined that the data belongs to a certain category, the confidence level of the recognition result is calculated. As a result, "unknown", "correct answer", and "I can't be sure of correct answer"
The recognition result can be separately classified and confirmed, and further, if at least one of the NG clusters is entered, it is determined as "incorrect answer" data or the calculated certainty factor can be lowered.

【０１６０】以上を総括して、本発明によれば、複数の
カテゴリを持ち、入力データに対する各カテゴリの一致
度（若しくはそれに類する情報）を算出する画像認識に
おいて、一致度が高いとされた結果が正しいものか、間
違ったものか決定できない認識結果を、「認識結果が正
解と確信できるため受け付ける」，「認識結果が正解と
確信できないためリジェクトする」，「認識結果が未確
定のためリジェクトする」，並びに「認識結果が不正解
のためリジェクトする」という数種類の結果に確定し、
受付け正解率を向上させ、かつ誤認識率を０［％］に近
づけた画像認識装置、画像認識方法および記録媒体を提
供することができる。Summarizing the above, according to the present invention, as a result of the image matching having a plurality of categories, the degree of matching is high in the image recognition for calculating the degree of matching (or similar information) of each category with respect to the input data. Is not correct or incorrect, accepts the recognition result because it is confident that the recognition result is correct, rejects because the recognition result is not certain that it is correct, and rejects because the recognition result is inconclusive. , "And" Rejected because the recognition result is incorrect "
It is possible to provide an image recognition device, an image recognition method, and a recording medium that improve the acceptance correct rate and make the false recognition rate close to 0 [%].

[Brief description of drawings]

【図１】本発明の一実施形態に係る画像認識装置の構成
図である。FIG. 1 is a configuration diagram of an image recognition apparatus according to an embodiment of the present invention.

【図２】実施形態の画像認識装置の全体的な処理の概略
を説明する説明図である。FIG. 2 is an explanatory diagram illustrating an overview of overall processing performed by the image recognition apparatus according to the embodiment.

【図３】実施形態の画像認識装置における知識作成処理
の概略フローチャートである。FIG. 3 is a schematic flowchart of knowledge creation processing in the image recognition apparatus of the embodiment.

【図４】実施形態の画像認識装置におけるシステム運用
時の処理の概略フローチャート（その１）である。FIG. 4 is a schematic flowchart (No. 1) of processing during system operation in the image recognition apparatus of the embodiment.

【図５】実施形態の画像認識装置におけるシステム運用
時の処理の概略フローチャート（その２）である。FIG. 5 is a schematic flowchart (part 2) of processing during system operation in the image recognition apparatus of the embodiment.

【図６】ＯＣＲ認識処理部から出力される認識結果を例
示する説明図である。FIG. 6 is an explanatory diagram illustrating a recognition result output from an OCR recognition processing unit.

【図７】第１、第２および第３前段知識のパラメータを
例示する説明図である。FIG. 7 is an explanatory diagram illustrating parameters of first, second, and third pre-stage knowledge.

【図８】前段判定部による判定処理を説明するフローチ
ャートである。FIG. 8 is a flowchart illustrating a determination process performed by a former determination unit.

【図９】クラスタ生成を例示する説明図である。FIG. 9 is an explanatory diagram illustrating cluster generation.

【図１０】超楕円体法クラスタリングの処理の概要を説
明するフローチャートである。FIG. 10 is a flowchart illustrating an outline of a hyperelliptic method clustering process.

【図１１】後段知識としてファイル出力される場合のデ
ータフォーマットを例示する説明図である。FIG. 11 is an explanatory diagram exemplifying a data format when a file is output as second-stage knowledge.

【図１２】楕円体と固有値および固有ベクトルの関係を
説明する説明図である。FIG. 12 is an explanatory diagram illustrating a relationship between an ellipsoid and eigenvalues and eigenvectors.

【図１３】後段判定部における処理の流れを説明するフ
ローチャートである。FIG. 13 is a flowchart illustrating a processing flow in a latter-stage determination unit.

【図１４】中心からの距離の確信度算出で用いるメンバ
シップ関数を例示する説明図である。FIG. 14 is an explanatory diagram illustrating a membership function used in the calculation of the certainty factor of the distance from the center.

【図１５】適合度と楕円の関係を示す説明図である。FIG. 15 is an explanatory diagram showing a relationship between a goodness of fit and an ellipse.

【図１６】楕円と確信度関数の関係を示す説明図であ
る。FIG. 16 is an explanatory diagram showing a relationship between an ellipse and a certainty factor function.

【図１７】あるカテゴリのＯＫクラスタとＮＧクラスタ
の位置関係を例示する説明図である。FIG. 17 is an explanatory diagram exemplifying a positional relationship between an OK cluster and an NG cluster of a certain category.

【図１８】ルール判定部による処理のフローチャートで
ある。FIG. 18 is a flowchart of processing by a rule determination unit.

[Explanation of symbols]

１０１ＯＣＲ認識処理部１０２認識結果確定部１０３知識生成部１１１前段知識の作成部（第１前段知識作成手段、第２
前段知識作成手段および第３前段知識作成手段）１１２前段知識ベース（第１前段知識、第２前段知識お
よび第３前段知識）１１３後段知識の作成部（第１後段知識作成手段および
第２後段知識作成手段）１１４後段知識ベース（第１後段知識および第２後段知
識）１１５前段判定部（第１前段判定手段、第２前段判定手
段および第３前段判定手段）１１６後段判定部（第１後段判定手段および第２後段判
定手段）１１７ルール判定部１１８認識確定部１１９リジェクト部１５１ａ，１５１ｂ認識結果１５２，１５３知識１５４不明データ１５５，１５６，１５８確定データ１５７未確定データ１５９正解と確信できないデータ101 OCR recognition processing unit 102 recognition result confirmation unit 103 knowledge generation unit 111 first-stage knowledge creation unit (first first-stage knowledge creation unit, second
1st-stage knowledge creation means and 3rd-stage knowledge creation means) 112 1st-stage knowledge base (1st-stage knowledge, 2nd-stage knowledge and 3rd-stage knowledge) 113 2nd-stage knowledge creation unit (1st 2nd-stage knowledge creation means and 2nd 2nd-stage knowledge) Creating means) 114 Second-stage knowledge base (first second-stage knowledge and second second-stage knowledge) 115 First-stage determination section (first front-stage determination means, second front-stage determination means and third front-stage determination means) 116 Second-stage determination section (first second-stage determination) Means and second latter-stage determination means) 117 rule determination section 118 recognition confirmation section 119 rejection section 151a, 151b recognition results 152,153 knowledge 154 unknown data 155, 156, 158 confirmed data 157 unconfirmed data 159 data that cannot be confirmed to be the correct answer

フロントページの続き (56)参考文献特開平４−88490（ＪＰ，Ａ) 特開平９−128487（ＪＰ，Ａ) 特開平１−277983（ＪＰ，Ａ) 特開平８−329194（ＪＰ，Ａ) 特開平11−15969（ＪＰ，Ａ) 特開昭63−8884（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06K 9/00 - 9/82 Continuation of the front page (56) Reference JP-A-4-88490 (JP, A) JP-A-9-128487 (JP, A) JP-A-1-277983 (JP, A) JP-A-8-329194 (JP , A) JP-A-11-15969 (JP, A) JP-A-63-8884 (JP, A) (58) Fields investigated (Int.Cl. ⁷ , DB name) G06K 9 / 00-9 / 82

Claims

(57) [Claims]

1. An image recognition apparatus which has a plurality of categories of recognition targets and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern, and a recognition result of the image recognition and its recognition result. Data having the same first candidate category as the recognition result is collected from the data including the accompanying information, and those having a high appearance rate of similar categories in the specific candidate ranks of the second candidate and below are extracted from the collected data, First pre-stage knowledge creating means for holding as first pre-stage knowledge showing the relationship of similar categories to the first candidate category, and the first candidate category and the first candidate category as the recognition result of the input image .
When a similar category with a specific candidate rank of 2 candidates or less is obtained, the category of the first candidate and the similar category with a specific candidate rank of 2 or less candidates and the association held in the first pre-stage knowledge are obtained. By comparison, the category of the first candidate is “correct”
An image recognition apparatus comprising: a first front-stage determination means for determining whether or not

2. A recognition result of image recognition and its recognition result in an image recognition device having recognition objects of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. Data having the same first candidate category as the recognition result is collected from the data including the accompanying information, and the first candidate is calculated from the matching degree (distance) in the first candidate category from the collected data . The second pre-stage knowledge creating means for calculating the magnitude of the goodness of fit (distance) that can determine the category of "correct answer" as the second pre-stage knowledge, and the first candidate category as the recognition result of the input image are obtained. The degree of matching (distance) in the category of the first candidate when
The size and compared with the size of the retained fit the second pre-knowledge (distance), the category of said first candidate
An image recognition apparatus comprising: a second front-stage determining means for determining whether or not is a "correct answer".

3. A recognition result of image recognition and its recognition result in an image recognition device having recognition objects of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. Data having the same first candidate category as the recognition result is collected from the data including the accompanying information, and the first candidate category and the first candidate category are collected from the collected data .
From the goodness- of- fit (distance) difference with the similar category of the specific candidate rank of two candidates or less, the magnitude of the goodness-of-fit (distance) difference that can determine the category of the first candidate as “correct answer” is calculated, and the third front stage The third pre-stage knowledge creating means that holds the knowledge, and the first candidate category and the first candidate as the recognition result of the input image .
When similar categories 2 candidates following specific candidate ranking is obtained, the first candidate of the category and the second candidate following specific candidate sequence
Whether or not the category of the first candidate is the “correct answer” by comparing the magnitude of the difference in fitness (distance) with the similar category of rank and the magnitude of the difference in fitness (distance) held in the third preceding knowledge.
An image recognition apparatus, comprising: a third front-stage determination means for determining.

4. An image recognition apparatus which has a plurality of categories of recognition targets and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern and a recognition result of the image recognition and its recognition result. Data that includes incidental information is sampled, and from the data whose recognition result is "correct" in a certain category, the axis when clustering the above-mentioned data set is the axis with the category with a high appearance rate as the top candidate. , The lower candidate is a category with a high appearance rate, the upper candidate is a category with a high appearance rate and the lower candidate is a category with a high appearance rate, and the middle candidate is a category with a high appearance rate , The first axis extractor that extracts categories with high correlation coefficient as the axis, or categories with low correlation coefficient as the axis And clustering the set of data on the basis of the axes extracted by the first axis extracting means to generate an OK cluster of the category, and retaining the shape of the generated OK cluster as the first second-stage knowledge. When data including the recognition result and its accompanying information as the recognition result of the input image is obtained, whether the data is included in the OK cluster belonging to the category having the first latter knowledge If it is included in the OK cluster, the category to which the OK cluster belongs is determined as “correct answer” as the recognition result of the input image, and if it is not included in any OK cluster, An image recognition apparatus comprising: a first rear-stage determination unit that determines "undetermined".

5. The first second-stage determination means sets a certainty factor, which is an index indicating how much confidence can be said to be a “correct answer” when the data is included in an OK cluster belonging to a certain category. A certainty factor calculating means for calculating is provided, and when the data enters an OK cluster belonging to a plurality of different categories, which category is determined as the “correct answer” is determined based on the certainty factor. The image recognition device according to claim 4.

6. An image recognition apparatus which has a plurality of categories of recognition targets and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern, and a recognition result of the image recognition and its recognition result. Data that includes incidental information is sampled, and from the data whose recognition result is "misidentified" in a certain category, the axis when clustering the above-mentioned data sets the category with a high appearance rate as the top candidate as the axis. , The lower candidate is a category with a high appearance rate, the upper candidate is a category with a high appearance rate and the lower candidate is a category with a high appearance rate, and the middle candidate is a category with a high appearance rate , A second axis extractor for extracting categories with high correlation coefficient as the axis, or categories with low correlation coefficient as the axis And clustering the set of data based on the axes extracted by the second axis extracting means to generate NG clusters of the category, and holding the shape of the generated NG cluster as second post-stage knowledge. When the data including the recognition result and its accompanying information as the recognition result of the input image is obtained by the latter-stage knowledge creating means, whether or not the data is included in the NG cluster belonging to the category having the second latter-stage knowledge. If any of the NG clusters is included in the NG cluster, the category to which the NG cluster belongs is determined to be “incorrect” as the recognition result of the input image.
An image recognition apparatus, comprising: a second latter-stage determination unit that determines “correct answer” if it is not included in the cluster.

7. An image recognition method for recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern, the recognition result of the image recognition and its recognition result. Data having the same first candidate category as the recognition result is collected from the data including the accompanying information, and those having a high appearance rate of similar categories in the specific candidate ranks of the second candidate and below are extracted from the collected data, a first front knowledge creation step of holding the first pre-knowledge to related similar categories for the first candidate category, the category of the first candidate as a recognition result of the input image and the
When similar categories 2 candidates following specific rank candidate is obtained by comparison with related held in the first pre-knowledge similar category of the first candidate category and a specific candidate ranking, the first candidate category A first pre-stage determination step of determining whether or not is a “correct answer”, and the image recognition method.

8. An image recognition method for recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern, the recognition result of the image recognition, and the recognition result thereof. from data including accompanying information, from the recognition result as collected data with the categories of the same first candidate, Contact Keru fit from the collected data to the category of the first candidate (distance), the first candidate The second pre-stage knowledge creation step of calculating the degree of goodness of fit (distance) that can determine the category of "correct answer" as the second pre-stage knowledge, and the first candidate category as the recognition result of the input image are obtained. when it is, it fits in the category of the first candidate (distance)
The size and compared with the size of the retained fit the second pre-knowledge (distance), the category of said first candidate
And a second pre-stage determination step of determining whether or not is a "correct answer".

9. An image recognition method for recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern, the recognition result of the image recognition and its recognition result. Data having the same first candidate category as the recognition result is collected from the data including the accompanying information, and the first candidate category and the first candidate category are collected from the collected data .
From the goodness- of- fit (distance) difference with the similar category of the specific candidate rank of two candidates or less, the magnitude of the goodness-of-fit (distance) difference that can determine the category of the first candidate as “correct answer” is calculated, and the third front stage The third pre-stage knowledge creation step that holds as knowledge, and the first candidate category and the first candidate as the recognition result of the input image .
When similar categories 2 candidates following specific candidate ranking is obtained, the first candidate of the category and the second candidate following specific candidate sequence
The degree of conformance (distance) difference with the rank category and the third
A third pre-stage determination step of determining whether or not the category of the first candidate is “correct” by comparing with the magnitude of the goodness-of-fit (distance) difference held in the pre-stage knowledge. An image recognition method characterized by:

10. An image recognition method for recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern, the recognition result of the image recognition, and the recognition result thereof. Data having the same first candidate category as the recognition result is collected from the data including the accompanying information, and those having a high appearance rate of similar categories in the specific candidate ranks of the second candidate and below are extracted from the collected data, From the first pre-stage knowledge creating step for holding the first pre-stage knowledge indicating the relation of the similar category to the first candidate category, and the data including the recognition result of image recognition and its accompanying information, the same first candidate category is selected as the recognition result. having collected data, Keru fit from the collected data to the category of the first candidate from (distance) of the first candidate Categories were calculated size of the goodness of fit may be determined as "correct" (distance) from the data including a second pre knowledge generating step of holding the second pre-knowledge, the recognition result of the image recognition and the accompanying information, Data having the same first candidate category as the recognition result is collected, and the first candidate category and the first candidate category are collected from the collected data .
From the goodness- of- fit (distance) difference with the similar category of the specific candidate rank of two candidates or less, the magnitude of the goodness-of-fit (distance) difference that can determine the category of the first candidate as “correct answer” is calculated, and the third front stage The third pre-stage knowledge creation step that holds as knowledge, and the first candidate category and the first candidate as the recognition result of the input image .
When similar categories 2 candidates following specific rank candidate is obtained by comparison with related held in the first pre-knowledge similar category of the first candidate category and a specific candidate ranking, the first candidate category a first pre-determination step determines whether but "correct" or for determining input image to "correct" in the first pre-determination step, the size of the fit in the category of the first candidate (distance) by comparison with the size of the retained fit the second pre-knowledge (distance), catheterization of the first candidate
Gori and a second pre-determination step of determining whether "correct", the determination input image to "correct" in the second pre-determination step, of the first candidate category and the following second candidate
By comparing the magnitude of the difference in fitness (distance) with the category of the specific candidate rank with the magnitude of the difference in fitness (distance) held in the third preceding knowledge, the category of the first candidate is “correct”.
And a third pre-stage determination step for determining whether or not the image recognition method.

11. An image recognition method for recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern, the recognition result of the image recognition and its recognition result. Data that includes incidental information is sampled, and from the data whose recognition result is "correct" in a certain category, the axis when clustering the above-mentioned data set is the axis with the category with a high appearance rate as the top candidate. , The lower candidate is a category with a high appearance rate, the upper candidate is a category with a high appearance rate and the lower candidate is a category with a high appearance rate, and the middle candidate is a category with a high appearance rate , 1st axis extraction to extract categories with high correlation coefficient as the axis, or categories with low correlation coefficient as the axis And step performs clustering of the set of data based on the axis extracted by said first shaft extraction step, OK of the categories
When a data including a recognition result and its accompanying information is obtained as a recognition result of the input image, a first second-stage knowledge creating step of generating the cluster as an OK cluster and retaining the shape of the generated OK cluster as the first second-stage knowledge, Is included in the OK cluster belonging to the category having the first latter knowledge, and if it is included in the OK cluster, the category to which the OK cluster belongs is recognized as the recognition result of the input image. And a first post-stage determination step for determining “correct answer” and determining “undetermined” if the cluster is not included in any OK cluster.

12. An image recognition method for recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern, the recognition result of the image recognition, and the recognition result thereof. Data that includes incidental information is sampled, and from the data whose recognition result is "misidentified" in a certain category, the axis when clustering the above-mentioned data sets the category with a high appearance rate as the top candidate as the axis. , The lower candidate is a category with a high appearance rate, the upper candidate is a category with a high appearance rate and the lower candidate is a category with a high appearance rate, and the middle candidate is a category with a high appearance rate , 2nd axis extraction to extract categories with high correlation coefficient as the axis, or categories with low correlation coefficient as the axis Step and, based on the axis extracted by said second shaft extraction step performs clustering of said set of data, NG of the categories
When a data including a recognition result and its accompanying information is obtained as a recognition result of the input image, the data is generated as a cluster and holds the shape of the generated NG cluster as the second knowledge. Is included in the NG cluster belonging to the category having the second latter knowledge, and if included in the NG cluster, the category to which the NG cluster belongs is recognized as the recognition result of the input image. Is judged as “incorrect” and which NG
An image recognition method comprising: a second post-stage determination step of determining “correct answer” if the cluster is not included in the cluster.

13. The first post-stage determination step is performed to determine a certainty factor, which is an index indicating how much confidence can be said to be a “correct answer” when the data is included in an OK cluster belonging to a certain category. A certainty factor calculation step for calculating, wherein when the data enters an OK cluster belonging to a plurality of different categories, which category is determined as the “correct answer” is determined based on the certainty factor. The image recognition method according to claim 11.

14. A program for causing a computer to execute an image recognition method having recognition targets of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. In the recording medium in which is recorded, data having the same first candidate category as the recognition result is collected from the data including the recognition result of the image recognition and its accompanying information, and the specific candidate ranks of the second candidate and below are collected from the collected data . In the first pre-stage knowledge creation step of extracting the one having a high appearance rate of the similar category in 1) and retaining it as the first pre-stage knowledge showing the relation of the similar category to the first candidate category, and the first candidate as the recognition result of the input image . Category and number
When a similar category with a specific candidate rank of 2 candidates or less is obtained, the category of the first candidate and the similar category with a specific candidate rank of 2 or less candidates and the association held in the first pre-stage knowledge are obtained. By comparison, the category of the first candidate is “correct”
A first pre-stage determination step of determining whether or not the recording medium is provided.

15. A program for causing a computer to execute an image recognition method having recognition objects of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. In the recording medium recorded as, the data having the same first candidate category as the recognition result is collected from the data including the recognition result of the image recognition and its accompanying information, and the collected data is classified into the first candidate category. /> from kicking fit (distance), and the category of the first candidate to calculate the size of the goodness of fit may be determined as "correct" (distance), second pre-knowledge generating step of holding the second pre-knowledge , when the category of the first candidate is obtained as a recognition result of the input image, the size and the second pre knowledge of the first candidate category definitive fitness (distance) Lifting been fit (distance)
Of the comparison between the size, the recording medium that comprises a said second pre determination step of determining whether the first candidate category is "correct" or the features a.

16. A program for causing a computer to execute an image recognition method having recognition objects of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. In the recording medium recorded as, the data having the same first candidate category as the recognition result is collected from the data including the recognition result of the image recognition and its accompanying information, and the first candidate category and the first candidate category are collected from the collected data .
From the goodness- of- fit (distance) difference with the similar category of the specific candidate rank of two candidates or less, the magnitude of the goodness-of-fit (distance) difference that can determine the category of the first candidate as “correct answer” is calculated, and the third front stage The third pre-stage knowledge creation step that holds as knowledge, and the first candidate category and the first candidate as the recognition result of the input image .
When similar categories 2 candidates following specific candidate ranking is obtained, the first candidate of the category and the second candidate following specific candidate sequence
By comparison with the size of the goodness of fit (distance) difference in magnitude and said third front knowledge of position of fitness (distance) difference, said first
And a third pre-stage determination step of determining whether or not the candidate category is “correct”.

17. A program for causing a computer to execute an image recognition method having recognition targets of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. In the recording medium recorded as, the data having the same first candidate category as the recognition result is collected from the data including the recognition result of the image recognition and its accompanying information, and the specific candidate ranks of the second candidate and below are collected from the collected data . The first pre-stage knowledge creation step of extracting a high-similarity appearance rate of similar categories in 1) and holding it as first pre-stage knowledge showing the relationship of the similar categories to the first candidate category, the recognition result of image recognition and its accompanying information. From the included data, data having the same first candidate category as the recognition result is collected, and the collected data is collected. Et Contact Keru fit into the category of the first candidate from (distance), and calculate the magnitude of the goodness of fit may be determined to "correct" the category of the first candidate (distance), second pre-knowledge a second pre-knowledge generating step of holding the recognition result of the image recognition and the data including the accompanying information, collects data having the category of the same first candidate as a recognition result, the first candidate from the collected data category And the
From the goodness- of- fit (distance) difference with the similar category of the specific candidate rank of two candidates or less, the magnitude of the goodness-of-fit (distance) difference that can determine the category of the first candidate as “correct answer” is calculated, and the third front stage The third pre-stage knowledge creation step that holds as knowledge, and the first candidate category and the first candidate as the recognition result of the input image .
When a similar category with a specific candidate rank of 2 candidates or less is obtained, the category of the first candidate and the similar category with a specific candidate rank of 2 or less candidates and the association held in the first pre-stage knowledge are obtained. By comparison, the category of the first candidate is “correct”
A first pre-stage determination step for determining whether or not the first pre-stage determination step is performed, and for the input image determined as “correct answer” in the first pre-stage determination step, the degree of goodness of fit (distance) in the category of the first candidate and the second pre-stage by comparison with the size of the retained fitness knowledge (distance), catheterization of the first candidate
Gori and a second pre-determination step of determining whether "correct", the determination input image to "correct" in the second pre-determination step, of the first candidate category and the following second candidate
Whether the category of the first candidate is the “correct answer” by comparing the magnitude of the difference in fitness (distance) with the similar category of the specific candidate rank and the magnitude of the difference in fitness (distance) held in the third previous knowledge. A third pre-stage determination step of determining whether or not the recording medium is provided.

18. A program for causing a computer to execute an image recognition method having recognition objects of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. In the recording medium recorded as, a sample of data including the recognition result of image recognition and its accompanying information is sampled, and the axis when clustering the set of the data from the data whose recognition result is "correct" in a certain category , The category with high appearance rate in the upper candidates is the axis, the category with high appearance rate in the lower candidates is the axis, and the category with high appearance rate in the upper candidates and the category with high appearance rate in the lower candidates is the axis , Categories with a high appearance rate in the middle candidates as the axis, categories with high correlation coefficients as the axis, There performs the shaft lower category between correlation coefficients, a first shaft extracting, clustering of said set of data based on the axis extracted by said first shaft extraction step, OK of the categories
When a data including a recognition result and its accompanying information is obtained as a recognition result of the input image, a first second-stage knowledge creating step of generating the cluster as an OK cluster and retaining the shape of the generated OK cluster as the first second-stage knowledge, Is included in the OK cluster belonging to the category having the first latter knowledge, and if it is included in the OK cluster, the category to which the OK cluster belongs is recognized as the recognition result of the input image. And a first post-stage determination step of determining “correct answer” as “correct answer” and determining as “undetermined” when not included in any OK cluster.

19. A program for causing a computer to execute an image recognition method having recognition objects of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. In the recording medium recorded as, a sample of data including the recognition result of image recognition and its accompanying information is collected, and the axis when clustering the set of the data from the data whose recognition result is "misidentified" in a certain category , The category with high appearance rate in the upper candidates is the axis, the category with high appearance rate in the lower candidates is the axis, and the category with high appearance rate in the upper candidates and the category with high appearance rate in the lower candidates is the axis , Categories with a high appearance rate in the middle candidates as the axis, categories with high correlation coefficients as the axis, There is, as a shaft of the lower category between correlation coefficients, and the second shaft extracting performs clustering of the set of data based on the axis extracted by said second shaft extraction step, NG of the categories
When a data including a recognition result and its accompanying information is obtained as a recognition result of the input image, the data is generated as a cluster and holds the shape of the generated NG cluster as the second knowledge. Is included in the NG cluster belonging to the category having the second latter knowledge, and if included in the NG cluster, the category to which the NG cluster belongs is recognized as the recognition result of the input image. Is judged as “incorrect” and which NG
A recording medium, comprising: a second latter-stage determination step of determining a “correct answer” if it is not included in the cluster.

20. The first latter-stage determination step includes a certainty factor, which is an index indicating how much certainty can be said to be a “correct answer” when the data is included in an OK cluster belonging to a certain category. A certainty factor calculation step for calculating, wherein when the data enters an OK cluster belonging to a plurality of different categories, which category is determined as the “correct answer” is determined based on the certainty factor. The recording medium according to claim 18.