JPH01211090A

JPH01211090A - Recognizing system for character in information picture

Info

Publication number: JPH01211090A
Application number: JP63034970A
Authority: JP
Inventors: Atsushi Otani; 淳大谷; Shigeru Akamatsu; 茂赤松
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1988-02-19
Filing date: 1988-02-19
Publication date: 1989-08-24
Anticipated expiration: 2014-11-15
Also published as: JP2979089B2

Abstract

PURPOSE:To eliminate the ambiguity of character recognizing processing and to consistently and uniquely extract a character pattern from a scenic picture by updating the degree of similarity by the character recognizing processing and selecting the character pattern instead of adopting the character pattern obtained from a decision by the character recognizing processing as it is. CONSTITUTION:A degree of similarity updating circuit 30 is composed of a directly related pattern calculating circuit 100, a directly related pattern storing memory 101, a degree of similarity calculating circuit 102, a memory 103 for category parameter, a memory 104 for updating degree of similarity, an indirectly related pattern calculating circuit 105, a memory 106 for storing indirectly related pattern, and a character pattern deciding circuit 107. Here, the character pattern is extracted not by using the value of the degree of similarity from the character recognizing processing as it is but by comparing basic parameters between mutually related patterns and updating the degree of similarity, and the uniquely determined processed result of the character pattern is outputted to a memory 31 for recognized result. Thus, the ambiguity of the character recognizing processing can be eliminated, and the character pattern in the scenic picture can be determined consistently and uniquely.

Description

【発明の詳細な説明】〔産業上の利用分野］この発明は、文字認識装置（以下、ＯＣＲ：０ｐｔｉｃ
ａｌ’　Ｃｈａｒａｃｔｅｒ　Ｒｅａｄｅｒ）に関する
もので、従来紙上に存在する文字だけを読取り対象とし
ていたものを、標識や看板等の実空間中に存在する文字
までを読取り対象に拡張する方法に関するものである。[Detailed Description of the Invention] [Industrial Application Field] The present invention relates to a character recognition device (hereinafter referred to as OCR: 0ptic
The present invention relates to a method for expanding the conventional method of reading only characters existing on paper to include characters existing in real space such as signs and signs.

特に、実空間中に存在する文字の文字サイズ、文字線の
濃度、書式、照明条件等の事前知識が与えられなくても
文字を認識することができる方式に関するものである。In particular, it relates to a system that can recognize characters without prior knowledge of the character size, density of character lines, format, illumination conditions, etc. of characters existing in real space.

（従来の技術）こめ発明に最も近い従来の技術としては、特開昭６２−
１７３５７８号公報「情景画像中の文字認識方式」があ
る。その原理を第９図に示す。なお、（１）〜　（４）
は各ステップを示す。(Prior art) The prior art closest to the Kome invention is JP-A-62-
No. 173578 ``Character recognition method in scene images'' is available. The principle is shown in FIG. In addition, (1) to (4)
indicates each step.

同図で、ま−ず、文字が存在する空間をＴＶカメラ等に
より情景画像としてとらえる　（１）０次に、情景画像
の領域分割処理と各領域の濃度差評価により文字線候補
領域（以下、領域という）を検出する　（２）。これら
の領域εを単独に切出すだけでは不十分である。すなわ
ち、文字には複数の領域から構成される分離文字（「旧
Ｊ、ｒｉ」等）が存在する。このため、互いに近傍に存
在する領域の組合せを考え、文字パターン候補として抽
出する　（３）。さらに、これらの文字パターン候補が
文字に対応するか否かは文字認識処理を用いて判定する
　（４）。文字と判定したならば最も類似度の大ぎいカ
テゴリを認識結果として出力する。In the same figure, first, the space in which characters exist is captured as a scene image using a TV camera, etc. (1) Next, character line candidate regions (hereinafter referred to as (2). It is not sufficient to cut out these regions ε alone. That is, there are separated characters (such as "old J, ri") that are composed of multiple areas. Therefore, we consider combinations of areas that are close to each other and extract them as character pattern candidates (3). Furthermore, whether or not these character pattern candidates correspond to characters is determined using character recognition processing (4). If it is determined to be a character, the category with the highest degree of similarity is output as the recognition result.

（発明が解決しようとする課題）このように、従来は文字パターン候補から文字パターン
を抽出するために文字認識処理を使用しているので、真
の文字パターンに対応していない文字パターン候補が偶
然いずれかのカテゴリとの類似度が高くなり、文字パタ
ーンとして抽出される誤りがしばしば生じる。これは現
状の文字認識処理の開発方針が、なるべく入カバターン
のりジェツトを防ぎ、極力いずれかのカテゴリーを文字
認識結果として出力するように構成されていることにも
起因している。(Problem to be Solved by the Invention) In this way, conventionally, character recognition processing is used to extract character patterns from character pattern candidates, so character pattern candidates that do not correspond to the true character pattern may occur by chance. The degree of similarity to one of the categories becomes high, and errors often occur where the character pattern is extracted as a character pattern. This is also due to the fact that the current development policy for character recognition processing is to prevent input pattern jets as much as possible, and to output one of the categories as a character recognition result as much as possible.

文字パターン候補は前述のように領域の組合せから求ま
る関係上、前述のような文字認識処理のあいまい性によ
る文字に対応しない文字パターン候補の誤抽出により情
景画像から抽出された文字パターンに関して本来成立す
べき以下の２つの原則に矛盾する文字パターンが抽出さ
れるという問題点があった。Since character pattern candidates are determined from combinations of regions as described above, it is possible that character pattern candidates that do not correspond to characters are incorrectly extracted due to the ambiguity of character recognition processing as described above, and that this originally holds true for character patterns extracted from scene images. There was a problem in that a character pattern that contradicted the following two principles was extracted.

（１）　　領域は１つの文字パターンに対応するか、全
く対応しないかのいずれかである。(1) A region corresponds to either one character pattern or none at all.

（２）　　ある領域がいずれかの文字パターンに対応す
る場合は、その領域の穴またはその領域を囲む領域はい
ずれの文字パターンにも対応しない。(2) If a certain area corresponds to any character pattern, the hole in that area or the area surrounding the area does not correspond to any character pattern.

これら２つのいずれかに矛盾するパターンが存在すると
、情景画像中の文字の一意的な認識が行えないという欠
点が存在する。If a pattern contradicting either of these two exists, there is a drawback that characters in the scene image cannot be uniquely recognized.

この発明の目的は、文字認識処理のあいまい性と文字パ
ターン候補の抽出法に起因した文字パターン間の矛盾を
除去し、情景画像から一意的に文字パターンを抽出し、
その認識結果を得る方法を提供することにある。The purpose of this invention is to remove inconsistencies between character patterns caused by ambiguity in character recognition processing and the extraction method of character pattern candidates, and to uniquely extract character patterns from scene images.
The purpose of this invention is to provide a method for obtaining the recognition result.

[Means to solve the problem]

この発明にかかる情景画像中の文字認識方式は、文字認
識処理により抽出された高類似度パターンの持つカテゴ
リの確からしさを比較することにより類似度の更新を行
い、互いに関連する各パターン群において最終的に最大
の類似度を有するパターンを文字パターンとして抽出し
、その類似度を与えるカテゴリを文字認識結果として出
力するものである。The character recognition method for scene images according to the present invention updates the degree of similarity by comparing the certainty of categories of high similarity patterns extracted by character recognition processing, and updates the degree of similarity in each group of mutually related patterns. The pattern with the highest degree of similarity is extracted as a character pattern, and the category giving that degree of similarity is output as a character recognition result.

[Effect]

この発明は、本来相矛盾する文字パターン同士において
、それらの文字認識結果の類似度を文字パターンの相互
関係から反復的に更新して、最終的に関連するパターン
群の各々において類似度が最大となったカテゴリをもつ
文字パターンを一意的に決定された文字パターンとして
出力する。This invention repeatedly updates the similarity of character recognition results between character patterns that are contradictory in nature based on the mutual relationship of the character patterns, and finally reaches the maximum similarity in each of the related pattern groups. The character pattern with the new category is output as a uniquely determined character pattern.

従来の技術とは、文字認識処理による判定により得られ
た文字パターンをそのまま採用するのではなく、文字認
識処理による類似度の更新を行って文字パターンを選択
するようにした点が異なる。This method differs from the conventional technology in that the character pattern is selected by updating the degree of similarity through the character recognition process, rather than directly adopting the character pattern obtained by the determination through the character recognition process.

〔Example〕

第１図はこの発明を実施するための装置の一例を示すブ
ロック図であり、２１は画像入力部、２２は画像用メモ
リ、２３は文字線候補領域検出回路、２４は領域分割結
果格納メモリ、２５は文字パターン候補抽出回路、２６
は文字パターン候補格納メモリ、２７は文字認識処理回
路、２８は文字パターン性判定回路、２９は高類似度パ
ターン格納メモリ、３ｏは類似度更新回路、３１は文字
パターンの抽出と認識結実用メモリである。FIG. 1 is a block diagram showing an example of a device for carrying out the present invention, in which 21 is an image input section, 22 is an image memory, 23 is a character line candidate area detection circuit, 24 is an area division result storage memory, 25 is a character pattern candidate extraction circuit, 26
27 is a character pattern candidate storage memory, 27 is a character recognition processing circuit, 28 is a character pattern determination circuit, 29 is a high similarity pattern storage memory, 3o is a similarity update circuit, and 31 is a practical memory for character pattern extraction and recognition. be.

この動作は、まず、ＴＶカメラ等の画像入力部２１から
文字を含む実空間を濃淡を有する情景画像としてとらえ
、画像用メモリ２２に格納する。In this operation, first, a real space including characters is captured from an image input unit 21 such as a TV camera as a scene image having shading, and is stored in the image memory 22.

次に文字線候補領域検出回路２３により画像用メモリ２
２の画像に対して文字線候補領域を検出するための領域
分割を施し、結果をメモリ２４に格納する。Next, the image memory 2 is detected by the character line candidate area detection circuit 23.
The image No. 2 is divided into regions to detect character line candidate regions, and the results are stored in the memory 24.

ここで、文字線候補領域検出回路２３の処理例としては
、第２図に示すようなものがある。すなわち、第２図（
ａ）のように画像用メモリ２２をそれぞれｎｘｍ個の画
素からなるＮＸＭ個のサブブロックに分割して、各々こ
のサブブロックで２値化のためのしきい値を第２図（ｂ
）のように独立に求め、さらに各サブブロックの中心画
素にしきい値を割り当てて線形補間を行い、第２図（Ｃ
）のように各画素のしきい値を決定する。各画素のしき
い値と原画（画像用メモリ２２の内容）の対応する画素
を比較して２値化を行い、その結果得られた第２図（ｄ
）のような２値（白、黒）画像において８連結で連続し
た白または黒画素に対してラベリング処理を行い、画像
を領域分割する。ここで、文字線に対応する可能性が高
いと考えられる領域を検出するため、第３図に示すよう
に、各領域とそれを囲む外接矩形枠の内部の濃度差を評
価して文字線候補領域を検出する。Here, an example of processing by the character line candidate area detection circuit 23 is as shown in FIG. In other words, Figure 2 (
The image memory 22 is divided into NXM subblocks each consisting of nxm pixels as shown in a), and the threshold value for binarization in each subblock is set as shown in FIG. 2(b).
), and then allocate a threshold value to the center pixel of each subblock and perform linear interpolation to obtain the result shown in Figure 2 (C
) determine the threshold value for each pixel. Binarization is performed by comparing the threshold value of each pixel with the corresponding pixel of the original image (contents of the image memory 22), and the result is shown in FIG.
) in a binary (white, black) image, a labeling process is performed on 8 consecutive white or black pixels, and the image is divided into regions. Here, in order to detect areas that are considered to have a high possibility of corresponding to character lines, as shown in Figure 3, the density difference inside each area and the circumscribed rectangular frame surrounding it is evaluated and character line candidates are detected. Detect areas.

第３図で、１３１は注目領域、１３２は前記注目領域１
３１の外接矩形枠、１３３は前記注目領域１３１を囲む
領域のうち外接矩形枠１３２に含まれる範囲である。濃
度差の評価は、注目領域１３１と範囲１３３のそれぞれ
の平均濃度り、。In FIG. 3, 131 is a region of interest, and 132 is the region of interest 1.
31 is a circumscribed rectangular frame, and 133 is a range included in the circumscribed rectangular frame 132 among the areas surrounding the attention area 131. The density difference is evaluated based on the average density of the region of interest 131 and range 133.

Ｌ２を求め、ΔＬ＝　ｌ　Ｌ、−Ｌ２　ｌで定義される
濃度差ΔＬがΔＬ〉θＬ　（θ、はしきい値）を充せば
文字線候補領域として検出し、充さなければ注目領域１
３１を各む領域に統合する。この結果は領域分割結果格
納メモリ２４に格納される。L2 is calculated, and if the density difference ΔL defined by ΔL=lL, -L2l satisfies ΔL>θL (θ is a threshold value), it is detected as a character line candidate region, and if it does not, it is detected as a region of interest 1.
31 into each area. This result is stored in the area division result storage memory 24.

文字パターン候補抽出回路２５では、領域分割結果格納
メモリ２４の内容である領域分割結果から、文字パター
ン候補を切出す。この処理内容例を第４図に示す。The character pattern candidate extraction circuit 25 extracts character pattern candidates from the region division results that are the contents of the region division result storage memory 24. An example of the contents of this process is shown in FIG.

第４図（ａ）〜（ｄ）で、６１は文字「け」、６２は文
字「いＪ、６３．６４は前記文字６１を構成する領域、
６５．６６は前記文字６２を構成する領域、６７は探索
範囲、６８〜７１は組合せパターンである。いま、領域
６５に注目しているとすると、探索範囲６７の中につい
て、領域６５の平均濃度Ｌ０に対し、平均濃度りがＩＬ
−Ｌ。In FIGS. 4(a) to (d), 61 is the character "ke", 62 is the character "iJ, 63.64 is the area constituting the character 61,
65 and 66 are areas constituting the character 62, 67 is a search range, and 68 to 71 are combination patterns. Assuming that we are currently paying attention to the region 65, in the search range 67, the average density is IL compared to the average density L0 of the region 65.
-L.

１≦θ。（θ。はしきい値）を充す領域、あるいは前述
の２値化結果における値（白または黒）をＢとし、領域
６５の値をＢＯとすると、Ｂ＝ＢＯの領域を探す。第４
図では領域６４と６６が見出される。領域６４，６５．
６６の組合せから４個の組合せパターン６８〜７１が抽
出される。ところで、文字に対応するパターンの外接矩
形枠１３２の縦横比ｒはθｌ　＜ｒ＜θｈ　（θ１．θ
ゎはｒの下限、上限値）の範囲にあるので、この式を満
しているか否かを判定すると、第４図で組合せパターン
７１は横長であるので棄却され、組合せパターン６８〜
７０が文字パターン候補として抽出され、文字パターン
候補格納メモリ２６に各文字パターン候補を構成する領
域の識別子とともに格納される。1≦θ. (θ is a threshold value) or the value (white or black) in the above-mentioned binarization result is B, and the value of the area 65 is BO, then an area where B=BO is searched. Fourth
In the figure regions 64 and 66 can be found. Areas 64, 65.
Four combination patterns 68 to 71 are extracted from the 66 combinations. By the way, the aspect ratio r of the circumscribed rectangular frame 132 of the pattern corresponding to the character is θl < r < θh (θ1.θ
Since ゎ is in the range of the lower limit and upper limit of r), when it is determined whether this formula is satisfied, combination pattern 71 in Fig. 4 is rejected because it is horizontally long, and combination patterns 68 to 68 are rejected.
70 is extracted as a character pattern candidate and stored in the character pattern candidate storage memory 26 together with the identifier of the area forming each character pattern candidate.

次に、文字パターン候補格納メモリ２６に格納された文
字パターン候補は、文字認識処理回路２７により文字認
識辞書中のカテゴリーＣＩとの類似度Ｓ（ＣＩ）（０≦
Ｓ　（ＣＩ　）≦１、値が１に近い程類似していること
を示す）を計算する。ここで類似度Ｓ（ＣＩ）は文字認
識法に応じた計算法により得られる。また、認識処理に
より各カテゴリとの距！ｌ　（ｃ＋）（ｏｒａ　（ＣＩ
）、値が０に近い程よく類似している）を用いるものが
あるが、ｓ　ｚ　　（（、）　＝＝ｅ　−ｄ　Ｌ　ＣＩ
　１等の変換によりソノ値域を０≦Ｓ’　　（ＣＩ）≦
１に変換できるので、以後は類似度を用いて説明する。Next, the character pattern candidates stored in the character pattern candidate storage memory 26 are processed by the character recognition processing circuit 27 to have a degree of similarity S (CI) (0≦) with the category CI in the character recognition dictionary.
S (CI)≦1, the closer the value is to 1, the more similar they are). Here, the degree of similarity S(CI) is obtained by a calculation method according to the character recognition method. Also, the distance between each category is determined through recognition processing! l (c+)(ora (CI
), the closer the value is to 0, the more similar) is used, but s z ((,) ==e −d L CI
The sono range is changed to 0≦S' (CI)≦ by transformation of 1st class.
Since it can be converted to 1, the following explanation will be made using similarity.

文字パターン性判定回路２日によりＳ（ＣＩ）≧Ｔ　（
ＣＩ）、（ｉ＝ｔ、２．・・・・・−、Ｎ、Ｎはカテゴ
リ数、’ｒ（ａｔ）はカテゴリＣ１における類似度のし
きい値）を評価し、これを満足するカテゴリが１つでも
あれば、高類似度パターンとしてパターンを構成する領
域の数と識別子、候補カテゴリ。By character pattern determination circuit 2nd day, S(CI)≧T (
CI), (i=t, 2...-, N, N is the number of categories, 'r(at) is the threshold of similarity in category C1), and the category that satisfies this is evaluated. If there is at least one, it is considered a high similarity pattern, including the number of regions, identifiers, and candidate categories that make up the pattern.

類似度を高類似度パターン格納メモリ２９に格納する。The similarity is stored in the high similarity pattern storage memory 29.

ここで候補カテゴリとは、上式を満すカテゴリを指す。Here, the candidate category refers to a category that satisfies the above formula.

上式を満すカテゴリが１つもない場合は、文字に対応す
る可能性がないとして棄却する。If there is no category that satisfies the above formula, the category is rejected as there is no possibility of it corresponding to the character.

ここで、もし高類似度パターン格納メモリ２９に格納さ
れた高類似度パターンが、全て正しく真の文字に対応し
ていれば問題はないが、実際には文字認識処理のあいま
い性により真の文字に対応していないものを高類似度パ
ターンとして抽出されてしまうことがしばしば起こる。Here, there would be no problem if all the high similarity patterns stored in the high similarity pattern storage memory 29 correctly correspond to true characters, but in reality, due to the ambiguity of character recognition processing, true characters It often happens that patterns that do not correspond to the pattern are extracted as high similarity patterns.

本来、画像の領域分割から得られた領域と文字パターン
との間には以下の原則が成り立つ。Originally, the following principle holds between the region obtained by region segmentation of an image and a character pattern.

（１）１つの領域は１つの文字パターンに対応するか、
いずれの文字パターンにも対応しないかのいずれかであ
る。(1) Does one area correspond to one character pattern?
Either it does not correspond to any character pattern.

（２）　　もし、ある領域が文字パターンに対応するな
らばそれと隣り合う領域は文字パターンに対応しない。(2) If a certain region corresponds to a character pattern, the adjacent region does not correspond to the character pattern.

しかし、前述のような文字認識処理のあいまい性のため
、上記　（１）と　（２）に矛盾するパターン同士が高
類似度パターンとして抽出される。また、単純な高類似
度の値の比較により上記（１）と　（２）を満足するよ
うに文字パターンを決定しようとするとやはり誤りを起
こす確立が高い。そこで、類似度更新回路３０により高
類似度パターンの相互関係から矛盾を除去しつつ類似度
の更新を行い、前記（１）と　（２）に矛盾しない形で
文字パターンの一意的決定を行う、第５図に類似度更新
回路３０の詳細を示す。However, due to the ambiguity of character recognition processing as described above, patterns that contradict the above (1) and (2) are extracted as high similarity patterns. Furthermore, if a character pattern is attempted to be determined so as to satisfy the above (1) and (2) by simply comparing high similarity values, there is a high probability that an error will occur. Therefore, the similarity update circuit 30 updates the similarity while removing contradictions from the mutual relationships of high similarity patterns, and uniquely determines the character pattern in a manner that does not contradict the above (1) and (2). FIG. 5 shows details of the similarity update circuit 30.

第５図で、１００は直接関連パターン計算回路、１０１
は直接関連パターン格納メモリ、１０２は類似度計算回
路、１０３はカテゴリパラメータ用メモリ、１０４は更
新類似度用メモリ、１０５は間接関連パターン計算回路
、１０６は間接関連パターン格納用メモリ、１０７は文
字パターン判定回路である。In FIG. 5, 100 is a directly related pattern calculation circuit, 101
102 is a memory for storing directly related patterns, 102 is a similarity calculation circuit, 103 is a memory for category parameters, 104 is a memory for updating similarity, 105 is a circuit for calculating indirect related patterns, 106 is a memory for storing indirect related patterns, 107 is a character pattern This is a judgment circuit.

類似度更新回路３０では、まず、各高類似度パターンの
直接関連パターンを直接関連パターン計算回路１００に
より求め、直接関連パターン格納メモリ１０１に格納す
る。直接関連パターン格納メモリ１０１には決定フラグ
が各パターンに対応して設けられ、初期状態ではＯＮに
なフている。In the similarity update circuit 30, first, directly related patterns of each high similarity pattern are calculated by the directly related pattern calculation circuit 100 and stored in the directly related pattern storage memory 101. A determination flag is provided in the directly related pattern storage memory 101 corresponding to each pattern, and is turned on in the initial state.

ここで、ある高類似度パターンｉの直接関連パターンｉ
′とは、以下の条件を満す集合ＲＩ、　Ｒ２。Here, a directly related pattern i of a certain high similarity pattern i
' is a set RI, R2 that satisfies the following conditions.

Ｒ３，Ｒ４のいずれかの要素である領域から構成される
パターンと定義する。It is defined as a pattern consisting of a region that is an element of either R3 or R4.

■ＰＣＲ，，■Ｒ２ＣＰ、■ｆｔ３　（Ｈ，■Ｑ　ｃ　
Ｒ４ここで、Ｐはｉを構成する領域の集合、ＨはＰの要
素である領域の穴の集合、ＱはＰの要素が全て他の領域
の穴であるときにＰの要素を囲む領域の集合を示す。こ
れらの例を第６図（ａ）、（ｂ）に示す。■PCR,, ■R2CP, ■ft3 (H, ■Q c
R4 Here, P is the set of regions constituting i, H is the set of holes in regions that are elements of P, and Q is the region surrounding the elements of P when all the elements of P are holes in other regions. Indicates a set. Examples of these are shown in FIGS. 6(a) and 6(b).

第５図の類似度計算回路１０２では、第１図の高類似度
パターン格納メモリ２９に格納された各高類似度パター
ン毎に、直接関連パターン格納メモリ１０１に格納され
た直接関連パターンとの相互関係から、以下のように類
似度の更新を行う（Ｓ−〇）（ＣＪ）がパターンｉの候
補カテゴリＣ。The similarity calculation circuit 102 shown in FIG. 5 calculates the correlation between each high similarity pattern stored in the high similarity pattern storage memory 29 shown in FIG. 1 and the directly related pattern stored in the directly related pattern storage memory 101. From the relationship, the similarity is updated as follows: (S-〇) (CJ) is candidate category C for pattern i.

の初期類似度）。initial similarity).

（ＣＪ）　ｉ　ｆ　Ｓ　ｒ”’　（ＣＪ）＜０．５ｊ＝
１．２．−−−−−−、ｍ：ｍはパターンｉの候補カテ
ゴリ数ｑ＋・（１／Ｎｔ’）Σｄ、ｌΣｒ　目’（Ｃ、（’）
　Ｓ、　′ｋ）１／　　　　　　　ｃ／（Ｃ′）Ｃ′は直接関連パターンｉ′のＮｃ個の候補カテゴリの
集合ｄｉ諺１／Ｎｅここで、ｒム１’（Ｃ＋　Ｃ’　）は適合係数であり、
パターンｉの候補カテゴリＣとパターンｉ′の候補カテ
ゴリＣ′の確からしさを比較し、前者の方が確からしい
と＋１、逆だと−１に近い値をとる。カテゴリの確から
しさの評価パラメータはいろいろ考えられるが、例えば
カテゴリの領域数。(CJ) if S r”'(CJ)<0.5j=
1.2. --------, m: m is the number of candidate categories for pattern i q+・(1/Nt') Σd, lΣr'th' (C, (')
S, 'k)1/ c/ (C') C' is the set of Nc candidate categories of directly related pattern i' di proverb 1/Ne, where rm1'(C+C') is the fitness coefficient can be,
The likelihood of candidate category C of pattern i and candidate category C' of pattern i' is compared, and if the former is more likely, the value is +1, and if the opposite is true, it takes a value close to -1. There are many possible evaluation parameters for the certainty of a category, such as the number of areas in the category.

穴の数、外接矩形枠の縦横比がある。具体的には以下の
ように計算する。There are the number of holes and the aspect ratio of the circumscribed rectangle. Specifically, it is calculated as follows.

（１）　Ｐ　ＣＲ、の関係にある直接関連パターンｉ″
ｉ′は複数の領域から構成されるので、もしｃ’　　（
ｔ’の候補カテゴリ）が単一領域文字であるならＣ′の
確からしさはＣ（ｉの候補カテゴリ）のそれより低いの
で、ｒ目（Ｃ，Ｃ’　）を正にすることによりＣの類似
度を増加させる。もしＣ′が複数流域文字ならばｒ目（
Ｃ，ｃｌ　）はｉ′とＣ′の領域数の差に応じて負の値
をとる。(1) Directly related pattern i'' in the relationship P CR
Since i′ consists of multiple regions, if c′ (
If C' (candidate category of t') is a single region character, the probability of C' is lower than that of C (candidate category of i). increase degree. If C′ is a multi-basin character, then the rth character (
C, cl) takes a negative value depending on the difference in the number of regions between i' and C'.

式で書くと、 −Ｎ’ｌ’　　ｌ／ＮＲ’　））（ＮＲ’＝２）ｅはＯ＜ｅ＜１の定数、ＮＩ、ＮＲ’　はそれぞれｉ′
とＣ′の領域数である。Written as a formula, −N'l'l/NR' )) (NR'=2) e is a constant of O<e<1, NI and NR' are each i'
and the number of regions of C'.

（２）Ｒ２ＣＰの関係にある直接関連パターンｉ′ｉ′
とｉ、Ｃ’　とＣを入れ替えて同様にＩ　ｌ／ＮＲ））
　　（ＮＨ−４）ここで、ＮＩとＮＲはｉとＣの領域数である。(2) Directly related pattern i′i′ in R2CP relationship
and i, C' and C are replaced and I l/NR))
(NH-4) Here, NI and NR are the numbers of regions of i and C.

（３）Ｒ，ＣＨの関係にある直接関連パターンｉ′注目
パターンｉは穴を有するので、もしＣに穴がなければＣ
の確からしさはＣ′よりも小さい。(3) Directly related pattern i′ in the relationship between R and CH Since the pattern of interest i has a hole, if C does not have a hole, C
The probability of is smaller than C'.

ｒ目（ｃ、ｃ’　）を負にすることによりＣの類似度は
減少する。もしＣに穴があればｒ＋＋（Ｃ。By making the r-th (c, c') negative, the similarity of C decreases. If C has a hole, r++(C.

Ｃ′）はｉとＣの穴に数に差に応じて正となる。C') becomes positive depending on the difference in the number of holes in i and C.

すなわち、ＩＩ／ＨＲ））　　　（ＮＲ≧１）ここで、ＨｌとＨＲはｉとＣの穴の数である。That is, II/HR)) (NR≧1) Here, Hl and HR are the numbers of holes i and C.

（４）ＱＥＲ４の関係にある直接関連パターンｉ′ｉと
ｉ、ＣとＣ′を入れ替えることによりＮｉｌ／ＨＲ））（Ｎ　Ｒ’≧１）ここで、Ｈ１’　とＨＲ’　はｉ′とＣ′の穴の数であ
る。(4) Nil/HR by replacing directly related patterns i'i and i, C and C' in the relationship of QER4)) (N R'≧1) Here, H1' and HR' are i' and C ′ is the number of holes.

各カテゴリの外接矩形枠の縦横比の確からしらの比較に
より以下のように計算する。The aspect ratios of the circumscribed rectangular frames of each category are calculated as follows.

ｒ＋＋’（Ｃ，Ｃ’）−Ｃｅ／４）（２−Ｉ　ＱＡ　Ｉ
　−ＱＡＣｌ）　−（ｅ／８）（ＩＱＡＩ　− ＱＡＣＩ　−Ｉ　ＱＡ　Ｉ−ＱＡＣ＋　”）ここで、Ｑ
ＡＣ，ＱＡＣ’　、ＱＡｌ、ＱＡＩ’　はＣ，Ｃ’　、
ｉ、ｉ’の量子化された縦横比ＱＡ値である。ＱＡ値は
縦横比を数段階程度に量子化した値である。この式によ
り、もしｉとＣが同じようなＱＡ値をとりｉ′　とＣ′
が異なったＱＡ値をとるとｒ目（Ｃ，Ｃ’　）は正とな
り、Ｃの類似度が大きくなる。r++'(C,C')-Ce/4)(2-I QA I
−QACl) −(e/8)(IQAI − QACI −I QA I−QAC+ ”) where, Q
AC, QAC', QAl, QAI' are C, C',
is the quantized aspect ratio QA value of i, i'. The QA value is a value obtained by quantizing the aspect ratio into several levels. According to this formula, if i and C have similar QA values, i' and C'
If they take different QA values, the r-th (C, C') becomes positive, and the similarity of C increases.

以上の処理は第７図に示すように、２段階（前半は領域
数が穴、後半は縦横比）で行われる。また、適当回数（
理想的には類似度が各々ある値に収束するまで）繰り返
し、結果を更新類似度用メモリ１０４に格納する。また
、カテゴリの各評価パラメータは、第８図に示すように
、テーブル形式でカテゴリパラメータ１０３に格納され
ているので、適宜前述の処理中ｃ、ｃ’のパラメータを
求めるのに利用する。As shown in FIG. 7, the above processing is performed in two stages (the first half is based on the number of regions, and the second half is based on the aspect ratio). Also, an appropriate number of times (
ideally until each similarity converges to a certain value), and the results are stored in the update similarity memory 104. Furthermore, as shown in FIG. 8, each category evaluation parameter is stored in the category parameter 103 in a table format, so that it is appropriately used to obtain the parameters c and c' during the above-described processing.

なお、第８図中のＱＡ値は以下のように量子化される。Note that the QA values in FIG. 8 are quantized as follows.

ＱＡ＝１　ｉ　ｆＡｐ≦０．５ＱＡ−２ｉｆ０．５＜Ａｐ≦０．８ＱＡ＝３ｉｆ０．８＜Ａｐ≦１．２５ＱＡ＝４ｉｆ１．２５＜Ａｐ≦２．０ＱＡ＝５ｉｆＡｐ＞２．０次に、文字パターンの決定を行うが、あらかじめ高類似
度パターンを間接関連パターン計算回路１０５により間
接関連パターングループに分けておく。あるパターン１
０を初期グループに入れる。次に、１０の直接関連パタ
ーン１１を、全てその初期グループの入れる。同様に、
各ｉｋ　（ｋ〉０）の直接関連パターン１　ｋ＋１を入
れる。ここで、既に入れられたものは処理しない、もし
これ以上新たに入れるパターンが無くなれば、次のまだ
いずれの初期グループにも入れられていないパターンを
新たに初期グループを設けてこれに入れ、同様の処理を
行う。これを全てのパターンがいずれかのグループに入
れられるまで繰返す。ある初期グループのパターン中に
存在する領域の集合をＲｐ　Ｉ　Ｒｐのいずれかの要素
の領域の穴の集合をＲｈ１Ｒｐの要素を囲む領域の集合
をＲ１とすると、もし異なる初期グループのいずれかの
パターン同士がＲ，、Ｒｈ、Ｒ，の要素から構成されて
いれば、これらの初期グループを結合し、間接関連パタ
ーングループを形成する。この例を第６図に示す。間接
関連パターン計算回路１０５の処理結果は間接関連パタ
ーン格納用メモリ１０６に格納される。前述の類似度の
更新後、文字パターン判定回路１０７により各パターン
ｉ毎に最大類似度Ｓ　ｌ、ｍａｘ−ｍａｘ（Ｓ　１′に
’　（Ｃ＋）、　　・・・、　　３１′に’　（Ｃ，）
）が得られる。次に、各間接関連パターングループ中の
最大Ｓ、、ｔｘａｘを有するパターンを文字パターンと
して抽出し、その類似度を与える候補カテゴリを文字認
識結果として抽出し、これらを文字パターンの抽出と認
識結実用メモリ３１に格納する全てノ間接関連パターン
グループから同様に文字パターンと認識結果を抽出する
。その後、以下のような領域を１つでも有するパターン
は対応しないとして直接関連パターン格納メモリ１０１
のそのパターンの決定フラグをＯＦＦにする。すなわち
、（ａ）　　文字パターンとして決定されたパターンを構
成する領域（ｂ）　　（ａ）の領域の穴に対応する領域（Ｃ）　　
（ａ）の領域を囲む領域のいずれかである。直接関連パターン格納メモリ１０１
の決定グラフが全てＯＦＦならば処理を停止し、ＯＮの
パターンが残っていればＯＮのものについてのみ同様の
処理を行う。QA=1 i fAp≦0.5 QA-2if0.5<Ap≦0.8 QA=3if0.8<Ap≦1.25 QA=4if1.25<Ap≦2.0 QA=5ifAp>2.0 Next Next, character patterns are determined, and the high similarity patterns are divided into indirectly related pattern groups by the indirectly related pattern calculation circuit 105 in advance. A certain pattern 1
Put 0 into the initial group. Next, all 10 directly related patterns 11 are placed in that initial group. Similarly,
Insert directly related pattern 1 k+1 for each ik (k>0). Here, the ones that have already been placed are not processed. If there are no more new patterns to add, create a new initial group and put the next pattern that has not been placed in any initial group into this, and do the same. Process. Repeat this until all patterns are placed in one of the groups. The set of regions existing in the pattern of a certain initial group is Rp I The set of holes in the region of any element of Rp is Rh1 The set of regions surrounding the elements of Rp is R1. If any pattern of a different initial group If each group is composed of elements R,,Rh,R,, these initial groups are combined to form an indirectly related pattern group. An example of this is shown in FIG. The processing results of the indirect related pattern calculation circuit 105 are stored in the indirect related pattern storage memory 106. After updating the above-mentioned similarity, the character pattern determination circuit 107 calculates the maximum similarity S l, max-max (S 1' to ' (C+), ..., 31'to' (C,) for each pattern i by the character pattern determination circuit 107.
) is obtained. Next, the pattern with the maximum S,,txax in each indirectly related pattern group is extracted as a character pattern, candidate categories that give the similarity are extracted as character recognition results, and these are used for character pattern extraction and recognition. Similarly, character patterns and recognition results are extracted from all indirectly related pattern groups stored in the memory 31. Thereafter, patterns having at least one of the following areas are deemed not to correspond and are stored in the directly related pattern storage memory 101.
The determination flag for that pattern is turned OFF. That is, (a) an area constituting a pattern determined as a character pattern (b) an area (C) corresponding to the hole in the area of (a)
This is any area surrounding the area in (a). Directly related pattern storage memory 101
If all of the decision graphs are OFF, the process is stopped, and if ON patterns remain, the same process is performed only for the ON patterns.

以上のようにして、文字パターンの抽出と認識結実用メ
モリ３１には文字パターンの一意的決定処理結果が得ら
える。As described above, the result of character pattern extraction and recognition processing is obtained in the practical memory 31.

なお、上記説明では文字線候補領域検出のため前述の方
法を用いているが、他の画像の領域分割法等を用いても
、この発明に適用可能である。In the above description, the method described above is used to detect character line candidate regions, but other image region division methods can also be applied to the present invention.

〔Effect of the invention〕

以上説明したようにこの発明は、文字認識処理の類似度
のそのものの値を信用するのではなく、基本的なパラメ
ータを互いに関連するパターン同士で比較し、類似度を
更新するようにしたため、文字認識処理のあいまい性を
除去でき、矛盾のない形で情景画像中の文字パターンの
一意的決定が行えるという利点がある。As explained above, this invention does not rely on the value of the similarity itself in character recognition processing, but rather compares basic parameters between mutually related patterns and updates the similarity. This method has the advantage that ambiguity in recognition processing can be removed and character patterns in a scene image can be uniquely determined without contradiction.

したがって、この発明は車のナンバープレートの文字１
名札の文字認識によるパトロールロボット、荷物の行先
の認識による仕分は作業の自動化、ブック形成の文書に
書かれた文字認識等に広く適用できる。Therefore, this invention is based on the letter 1 of the car license plate.
It can be widely applied to patrol robots that recognize characters on name tags, sorting by recognizing the destination of packages, automation of work, and character recognition written on documents in book formation.

[Brief explanation of the drawing]

第１図はこの発明を実施する装置の一例を示すブロック
図、第２図は２値化処理の例を示す図、第３図は濃度差
の評価法を説明する図、第４図は文字パターン候補の抽
出原理の説明図、第５図は類似度の更新回路の実施例を
示すブロック図、第６図は直接関連パターンと間接関連
パターングループの例を示す図、第７図は類似度更新の
ための結合係数の求め方を示す流れ図、第８図は評価用
パラメータテーブルの一例を示す図、第９図は従来技術
の説明図である。図中、２１は画像入力部、２２は画像用メモリ、２３は
文字線候補領域検出回路、２４は領域分割結果格納メモ
リ、２５は文字パターン候補抽出回路、２６は文字パタ
ーン候補格納メモリ、２７は文字認識処理回路、２８は
文字パターン性判定回路、２日は高類似度パターン格納
メモリ、３ｏは類似度更新回路、３１は文字パターンの
抽出と認識結実用メモリ、６１は文字「け」、６２は文
字「いＪ、６３．６４は文字６１を構成する領域、６５
．６８は文字６２を構成する領域、６７は探索範囲、６
８〜７１は組合せパターン、１００は直接関連パターン
計算回路、１０１は直接関連パターン格納メモリ、１０
２は類似度計算回路、１０３はカテゴリパラメータ用メ
モリ、１０４は更新類似度用メモリ、１０５は間接関連
パターン計算回路、１０６は間接関連パターン格納用メ
ーＴ−リ、１０７は文字パターン判定回路である。第１図第２図（ａ）　　　　（ｂ）（Ｃ）　　　　（ｄ）第３図第４図第５図ユ立第６図（ａ）第６図（ｂ）Fig. 1 is a block diagram showing an example of a device implementing the present invention, Fig. 2 is a diagram showing an example of binarization processing, Fig. 3 is a diagram explaining a density difference evaluation method, and Fig. 4 is a text diagram. An explanatory diagram of the principle of extracting pattern candidates, FIG. 5 is a block diagram showing an example of a similarity updating circuit, FIG. 6 is a diagram showing an example of a directly related pattern and an indirectly related pattern group, and FIG. 7 is a diagram showing the similarity. FIG. 8 is a flowchart showing how to obtain a coupling coefficient for updating, FIG. 8 is a diagram showing an example of an evaluation parameter table, and FIG. 9 is an explanatory diagram of the prior art. In the figure, 21 is an image input unit, 22 is an image memory, 23 is a character line candidate area detection circuit, 24 is a region division result storage memory, 25 is a character pattern candidate extraction circuit, 26 is a character pattern candidate storage memory, and 27 is a character pattern candidate storage memory. Character recognition processing circuit, 28 character pattern determination circuit, 2nd high similarity pattern storage memory, 3o similarity update circuit, 31 character pattern extraction and recognition practical memory, 61 character "ke", 62 is the character ``J, 63.64 is the area that makes up the character 61, 65
．． 68 is an area forming the character 62, 67 is a search range, 6
8 to 71 are combination patterns, 100 is a directly related pattern calculation circuit, 101 is a directly related pattern storage memory, 10
2 is a similarity calculation circuit, 103 is a memory for category parameters, 104 is a memory for updated similarity, 105 is an indirect related pattern calculation circuit, 106 is a memory for storing indirect related patterns, and 107 is a character pattern determination circuit. . Figure 1 Figure 2 (a) (b) (C) (d) Figure 3 Figure 4 Figure 5 Standing Figure 6 (a) Figure 6 (b)

Claims

[Claims]

A scene containing characters existing on a two-dimensional surface or three-dimensional space is input as a gray scale image, areas that are likely to correspond to character lines are detected as regions, and combinations of regions that exist in the vicinity of each other are used as character pattern candidates. In a character recognition processing method for determining whether or not the character pattern candidate corresponds to a character, the character pattern candidate is extracted as A scene image characterized by updating the degree of similarity, finally extracting a pattern having the maximum degree of similarity among mutually related pattern groups as a character pattern, and outputting a category giving that degree of similarity as a character recognition result. Character recognition method inside.