JPH0458383A

JPH0458383A - Multi font character recognizing device

Info

Publication number: JPH0458383A
Application number: JP2170980A
Authority: JP
Inventors: Satoru Arita; 悟有田
Original assignee: Omron Corp; Omron Tateisi Electronics Co
Current assignee: Omron Corp
Priority date: 1990-06-27
Filing date: 1990-06-27
Publication date: 1992-02-25

Abstract

PURPOSE:To withstand noises and to surely recognize even a graphic with an underline, etc., by performing matching with the picture of a character string for recognition object while scanning a template and repeating the matching processing about the font with high coincidence. CONSTITUTION:The template of the font for the recognition object is stored in a template storage means 4, and template matching is performed by a matching processing means 5 while scanning the template against the picture of the character string for the recognition object. A control means 3 controls the matching processing means 5 to successively perform the matching processing to the font with high coincidence. Thus, the segmenting of the character to be recognized is unnecessitated, and the font is recognized in spite of the condition entering the character to be recognized, and the recognition time is shortened.

Description

【発明の詳細な説明】［産業上の利用分野］本発明は、文字列画像上の文字が複数のフォントのいず
れに対応するものであるのかを認識するマルチフォント
文字認識装置に関する。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a multi-font character recognition device that recognizes which of a plurality of fonts a character on a character string image corresponds to.

［従来の技術］一般に、文字は同一のものについても多数のフォントが
ある。例えば漢字についてでも措置、行書、草書等の書
体がある。また、措置だけについて着目しても、明朝体
あるいはゴシック体など、各種のものが使用されている
。[Prior Art] Generally, there are many fonts for the same character. For example, even for kanji, there are fonts such as mesaku, gyosho, and cursive. Furthermore, even if we focus only on the measures taken, various types of fonts are used, such as Mincho font and Gothic font.

このような文字種類を認識するためにフォント認識装置
がある。これは、第６図に示す様な文字列のフォントを
認識する場合、例えば点線で囲った文字「ｃ」を認識し
ようとするとき、第７図に示すような認識可能な文字種
類のフォントで記述きれたフォント１、フォント２、フ
ォント３・・・・のテンプレートを用意しておき、全て
のフォントの全ての文字についてテンプレートと対象文
字とをマツチングさせ、最も一致度の高いフォントをそ
の文字列のフォントと認識するものである。Font recognition devices are available to recognize such character types. When recognizing a font with a character string like the one shown in Figure 6, for example, when trying to recognize the character "c" surrounded by a dotted line, it is necessary to use a font with a recognizable character type as shown in Figure 7. Prepare fully described templates for font 1, font 2, font 3, etc., match the template with the target character for all characters of all fonts, and select the font with the highest degree of matching for that character string. It is recognized as a font.

すなわち、先ず認識対象とされる文字ｒ　ＣＪを切り出
し、この切り出した文字「ｃ」に対して、フォント１の
最初の文字ａのテンプレート、次の文字すのテンプレー
ト、・・・と順次マツチングを行なう。フォント１の全
ての文字のマッチングが終了したとき、フォント２の全
ての文字について同様のマツチングを行なう。以下同様
にして、全てのフォントの全ての文字についてマツチン
グを行う。That is, first, the character r CJ to be recognized is extracted, and this extracted character "c" is sequentially matched with the template of the first character a of font 1, the template of the next character S, and so on. . When matching of all characters of font 1 is completed, similar matching is performed for all characters of font 2. Thereafter, matching is performed for all characters of all fonts in the same manner.

以下同様に文字列の残りの文字ｒｈ］、　　’ａ」。Similarly, the remaining characters of the string are rh], 'a'.

’ｒＪ、　　’ａＪ　・・・についても順次マツチング
が行なわれる。Matching is also performed sequentially for 'rJ, 'aJ, and so on.

このように全ての文字の全てのフォントについてマツチ
ングを行うことで、文字列のフォントを認識している。By performing matching for all fonts of all characters in this way, the font of the character string is recognized.

［発明が解決しようとする課題］しかしながらこのような従来のものは、全ての文字の全
てのフォントに対して大量のテンプレートをマツチング
きせるためデータ処理量が多く、処理時間が長くなると
いう課題がある。また、認識対象とされる文字の切り出
しができることが前提であるから、第８図（ａ）に示す
ようにノイズが混入するとその画像４３号は第８図（ｂ
）に示すようになり、ノイズの混入した部分の切り出し
ができないので、正しい認識を行なうことができない。[Problem to be solved by the invention] However, such conventional methods have the problem that a large amount of templates are matched to all fonts of all characters, resulting in a large amount of data processing and a long processing time. . Furthermore, since it is assumed that the characters to be recognized can be cut out, if noise is mixed in as shown in Figure 8(a), the image No. 43 will be cut out as shown in Figure 8(b).
), and it is not possible to extract the part where noise is mixed, so correct recognition cannot be performed.

第９図に示すようにアンダーラインのある文字、第１０
図に示すように回転した文字についても同様に、認識す
べき文字の切り出しができないので正しいマツチングの
判定ができない。Underlined letters as shown in Figure 9, number 10
Similarly, for rotated characters as shown in the figure, correct matching cannot be determined because the characters to be recognized cannot be extracted.

そこで本発明は、認識する文字の記載されている条件に
よらずフォントの認識が可能で、また認識時間も短いマ
ルチフォント文字認識装置を提供するものである。SUMMARY OF THE INVENTION Therefore, the present invention provides a multi-font character recognition device that is capable of recognizing fonts regardless of the conditions in which the characters to be recognized are written, and that also takes a short recognition time.

し課題を解決するための手段］このような課題を解決するために本発明のマルチフォン
ト文字認識装置は、複数のフォントのテンプレートを記
憶するテンプレート記憶手段と、所定のフォントの所定
の文字のテンプレートを選択し、認識対象の文字列画像
に対して走査して文字列画像とテンプレートとのマツチ
ングを順次判定するマツチング処理手段と、マツチング
処理手段の出力に対応して一致度の高いフォントを選択
し、マツチング処理手段のマツチング処理を制御する制
御手段とで構成される。Means for Solving the Problem] In order to solve the above problem, the multi-font character recognition device of the present invention includes a template storage means for storing templates of a plurality of fonts, and a template of a predetermined character of a predetermined font. and a matching processing means that scans the character string image to be recognized and sequentially determines the matching between the character string image and the template, and selects a font with a high degree of matching according to the output of the matching processing means. , and a control means for controlling the matching processing of the matching processing means.

［作用］認識対象とする複数のフォントのテンプレートをテンプ
レート記憶手段に記憶し、マツチング処理手段によって
認識対象の文字列の画像に対してテンプレートを走査し
てテンプレートマツチングを行なう。制御手段は一致度
の高いフォントについてマツチング処理が順次行なわれ
るようにマツチング処理手段を制御する。[Operation] Templates of a plurality of fonts to be recognized are stored in the template storage means, and the matching processing means scans the templates against the image of the character string to be recognized to perform template matching. The control means controls the matching processing means so that matching processing is performed sequentially for fonts with a high degree of matching.

［実施例〕第１図は本発明の一実施例を示すブロック図である。図
において１はｇ！！　識する文字およびテンプレートの
画像データを取込む画像入力部、２は画像入力部ｌの出
力データを記憶する画像メモ１人３は全体の制御を行う
ＣＰＵ（制御手段）、４はテンプレートの画像データを
記憶するテンプレートメモリ（テンプレート記憶手段）
、５は認識対称の文字列の画像データとテンプレートメ
モリ４から読みだしたテンプレートデータとのマツチン
グを行って各走査位置での画像の一致度を算出するマツ
チング処理部（マツチング処理手段）である。[Embodiment] FIG. 1 is a block diagram showing an embodiment of the present invention. In the diagram, 1 is g! ! 2 is an image memo that stores the output data of the image input section 1; 3 is a CPU (control means) that performs overall control; 4 is the image data of the template; Template memory (template storage means) that stores
, 5 is a matching processing unit (matching processing means) that performs matching between the image data of the character string to be recognized and the template data read from the template memory 4 to calculate the degree of matching of the images at each scanning position.

なお、ＣＰＵ３は画像入力部１のデータの画像メモリ２
への取込み、画像メモリ２のデータのテンプテートメモ
リ４への取込み、テンプレートメモリ４のデータのマツ
チング処理部５への供給等、−船釣なｆｆ１ｌＪ＃を行
っている。また、このＣＰＵ３はある文字のいくつかの
フォントのテンプレートについて求めた認識文字列との
一致回数を計数して、その回数が所定のしきい値以上の
フォントを選択して、そのフォントについてマツチング
処理が順次継続されるように制御する。Note that the CPU 3 stores data in the image memory 2 of the image input section 1.
- Loading data from the image memory 2 into the template memory 4, supplying data from the template memory 4 to the matching processing section 5, etc. - Boat fishing ff1lJ# is performed. In addition, this CPU 3 counts the number of times a certain character matches the recognized character string obtained for several font templates, selects a font for which the number of matches is equal to or greater than a predetermined threshold, and performs matching processing on that font. control so that they are continued sequentially.

６は所定の操作を行う操作パネル、７はホスト部８との
信号授受を行う外部インターフェイス部（外部ｉ／ｆ）
である。6 is an operation panel for performing predetermined operations, and 7 is an external interface unit (external I/F) for exchanging signals with the host unit 8.
It is.

第２図はこの装置の動作を示すフローチャートである。FIG. 2 is a flowchart showing the operation of this device.

ステップ１００で画像入力部１によって認識対象の文字
列を含む画像および複数のフォントのテンプレートの画
像の取込みを行う。次に、ステップ１０１において全て
のフォントについてスキャンフラグをオンにする。そし
て、ステップ１０２でテンプレートパターンメモリ４が
ら所定のフォントの所定の文字のテンプレートデータを
マツチング処理部５にロードする。In step 100, the image input unit 1 captures an image including a character string to be recognized and images of templates of a plurality of fonts. Next, in step 101, scan flags are turned on for all fonts. Then, in step 102, template data of a predetermined character in a predetermined font is loaded from the template pattern memory 4 into the matching processing section 5.

その後、ステップ１０３で、ロードきれたテンプレート
を、認識対象とされる文字列を含む画像マツプ上で走査
し、しきい値以上の一致度を持つＸ、　　Ｙ座標を抽出
する。ステップ１０４において、抽出した画素を統合し
て対象文字の位置及び個数を同定し、ステップ１０５で
個数の少ないテンプレートについてそのフォントのスキ
ャンフラグをオフにする。Thereafter, in step 103, the fully loaded template is scanned on an image map containing the character string to be recognized, and X and Y coordinates having a degree of matching equal to or higher than a threshold value are extracted. In step 104, the extracted pixels are integrated to identify the position and number of target characters, and in step 105, the scan flag of the font is turned off for templates with a small number of characters.

次に、ステップ１０６において、ステップ１０２からス
テップ１０５の処理が、いま選択されている文字のフラ
グオンの全てのフォントについて終わっているか否かを
判断する。これらの処理が終わっていないフォントが残
っていればフローはステップ１０２に戻り、フラグオン
の全てのフォントについてステップ１０２から１０５の
処理が終るまで前述の処理を行う。Next, in step 106, it is determined whether the processes from step 102 to step 105 have been completed for all fonts whose flags are on for the currently selected character. If there are any remaining fonts for which these processes have not been completed, the flow returns to step 102, and the above-described processes are performed until the processes from steps 102 to 105 are completed for all flag-on fonts.

ステップ１０６において、選択きれている文字のフラグ
オンの全てのフォントについて、ステップ１０２から１
０５までの処理が終わったと判断きれたとき、次の処理
としてステップ１０７に示す全ての文字について、フォ
ントの認識が終ったか否かの判断が行われる。In step 106, all flag-on fonts of the selected characters are checked from step 102 to 1.
When it is determined that the processes up to 05 have been completed, the next process is to determine whether or not font recognition has been completed for all characters shown in step 107.

全ての文字についてのフォントｉＥ　ｆｆａが終ってい
なければフローはステップ１０２がらステップ１０７の
処理を繰り返し行う。そして全ての文字についてのフォ
ント認識が終了した時点でこのフローの処理を全て終了
する。If the font iE ffa for all characters has not been completed, the flow repeats the processing from step 102 to step 107. When font recognition for all characters is completed, the entire process of this flow is ended.

以上の処理をざらに具体的に説明すると、次のようにな
る。A detailed explanation of the above processing is as follows.

いま、第３図に示すような文字列のフォントの種類を認
識するために、第４図（ａ）　、　（ｂ）　、　（ｃ）
　、　（ｄ）　。Now, in order to recognize the font type of the character string shown in Figure 3, we will use Figure 4 (a), (b), (c).
, (d).

（ｅ）・・・・に示すように複数のテンプレートのフォ
ントがあるものとする。最初に走査される文字（例えば
文字ａ）が選択される。そして次に複数のフォント（フ
ラグがオンされているフォント）のいずれか（例えばフ
ォント１）が選択され、そのフォントの選択された文字
ａのテンプレートデータが続出される。また、認識され
る文字列画像として、例えば第３図に点線で示す範囲の
データが続出される。この範囲はテンプレートの大きさ
に対応して設定される。この範囲は一例として第５図（
ａ）に示すようなものになる。この図においては１０×
１０画素の範囲に文字ａが書き込まれている。この範囲
のデータとテンプレートデータとが比較される。Assume that there are multiple template fonts as shown in (e). The first character scanned (eg, character a) is selected. Then, one of the plurality of fonts (fonts whose flags are turned on) (for example, font 1) is selected, and template data of the selected character a of that font is successively displayed. Further, as the character string image to be recognized, for example, data in the range shown by the dotted line in FIG. 3 is successively displayed. This range is set according to the size of the template. This range is shown in Figure 5 (as an example)
The result will be as shown in a). In this figure, 10×
The character a is written in a range of 10 pixels. Data in this range and template data are compared.

この比較動作は、テンプレートの１０×１０画素のデー
タと、所定の範囲のｌｏＸＩＯ画素のデータの対応する
もの同士の一致度を判定することにより行なわれる。す
なわち、画素データは第５図（ａ）に示すように、黒色
部分（図中＊印が付されている部分）と白色部分（図中
何も付きれていない部分）により構成されている。黒色
部分の画素が論理１のテンプレートデータと対応してい
るとき、両者は一致するものとされる。また白色部分の
画素が論理０のテンプレートデータと対応しているとき
、両者は一致するものとされる。黒色部分のデータが論
理０に対応しているか、白色部分のデータが論理１に対
応しているとき、両者は不一致とされる。そして、一致
している画素の数がカウントされ、このカウント値があ
るしきい値を越えてたとき、その範囲の画像データはテ
ンプレートデータと一致している（そのテンプレートの
文字である）と判定される。This comparison operation is performed by determining the degree of matching between corresponding data of 10×10 pixels of the template and data of loXIO pixels in a predetermined range. That is, as shown in FIG. 5(a), the pixel data is composed of a black part (the part marked with * in the figure) and a white part (the part with nothing marked in the figure). When the pixels in the black portion correspond to the template data of logic 1, the two are considered to match. Furthermore, when pixels in the white portion correspond to template data of logic 0, the two are considered to match. When the data in the black part corresponds to logic 0 or the data in the white part corresponds to logic 1, it is determined that the two do not match. Then, the number of matching pixels is counted, and when this count value exceeds a certain threshold, it is determined that the image data in that range matches the template data (it is a character of that template). be done.

この判定が終了すると、第３図に点線で示した範囲が１
画素分だけ右にずらされ、その範囲のデータがテンプレ
ートデータと比較される。以下同様にして点線で示す範
囲が順次右方向に水平に移動される。その位置が右端に
達したとき、点線で示す範囲は再び左端に戻され、かつ
、今度は１画素分だけ下に移動され、さらに再び右方向
に順次移動される。このようにして文字列画像の全ての
範囲が走査される。これにより、第３図の文字列の文字
のうちから、フォント１の文字ａに相当する文字と一致
するものが捜し出されるので、その文字数をカウントし
ておく。When this determination is completed, the range indicated by the dotted line in Figure 3 is 1
It is shifted to the right by a number of pixels, and the data in that range is compared with the template data. Thereafter, in the same manner, the range indicated by the dotted line is sequentially moved horizontally to the right. When the position reaches the right end, the range indicated by the dotted line is returned to the left end, and this time it is moved down by one pixel, and then sequentially moved to the right again. In this way, the entire range of the character string image is scanned. As a result, a character corresponding to the character a of font 1 is found out of the characters in the character string shown in FIG. 3, and the number of characters is counted.

この処理が済むと、次はフォント２の文字ａと一致する
ものが文字列の中にいくつあるかを同様の方法で検出す
る。第３図の文字列の中では一致するものはないので、
この場合の一致文字数は零になる。このように一致する
文字数が所定の基準値（例えば１）を満たきない場合、
フォント２についてはスキャンフラグをオフにする。こ
のような処理をフォント３、フォント４、フォント５・
・・・・と最後のフォントまで続けると、文字ａについ
て、一致しないフォントについてはスキャンフラグがオ
フになる。After this process is completed, the next step is to detect how many characters in the character string match the character a of font 2 using the same method. There is no match among the strings in Figure 3, so
In this case, the number of matching characters will be zero. If the number of matching characters does not meet a predetermined standard value (for example, 1),
For font 2, turn off the scan flag. This kind of processing is done in font 3, font 4, font 5, etc.
... until the last font, the scan flag is turned off for fonts that do not match the character a.

文字ａについて最後のフォントまで認識が終了すると、
次は文字すについて同様の処理を行う。When the recognition of the letter a is completed up to the last font,
Next, perform the same process for the characters.

このとき、文字ａの走査時にスキャンフラグをオフにし
たものについてはマツチング（一致の照合）を省略し、
スキャンフラグがオンのものについてだけマツチングを
行う。すなわち先に照合したとき、一致した文字がなか
ったということは、その文字列はそのフォントではない
と認識するわけである。At this time, matching (matching) is omitted for those whose scan flag was turned off when scanning the character a,
Matching is performed only for those whose scan flags are on. In other words, if there are no matching characters in the first comparison, it is recognized that the character string does not belong to that font.

このように、文字すについてスキャンフラグがオンなっ
ているものについての照合が終ると、次は文字Ｃ９ｄ＋
　　・・・・と文字Ｚまでの照合を行う。これらについ
ても、一致するものがなかったときは、そのフォントの
スキャンフラグをオフにすることについては前述した場
合と同様である。In this way, when the verification of the characters whose scan flags are turned on is completed, the next character C9d+
...and performs matching up to the letter Z. Regarding these, if there is no match, the scan flag for that font is turned off in the same manner as described above.

そして、最後の文字の照合が終った時点で一致した数の
もっと多かったフォントが文字列のフォントであると判
定する。Then, the font with the greater number of matches at the end of the last character match is determined to be the font of the character string.

なお、以上に説明はａからＺまでを順番に照合する方法
で説明しているが、出現頻度の高い確率の文字から照合
して行けば、認識時間はより短くなる。また、一致度の
検出は、文字列の画像、テンプレートの画像とも微分処
理し、第５図（ｂ）に示す情報を得て、そのうち＊にあ
たる画素の一致数をカウントする方法も考えられる。こ
の場合、白色部分の一致数についてはカウントする必要
がない。Note that although the above explanation is based on a method of matching letters a to Z in order, the recognition time will be shorter if the letters are matched starting from the characters with the highest probability of appearing. Furthermore, to detect the degree of matching, it is also possible to perform differential processing on both the character string image and the template image, obtain the information shown in FIG. 5(b), and count the number of matching pixels corresponding to *. In this case, there is no need to count the number of matches in the white part.

［発明の効果］以上説明したように本発明のマルチフォント文字認識装
置によれば、テンプレートを走査することにより認識対
象の文字列の画像とのマツチングラ行って一致度の高い
フォントについてマツチング処理を繰返すようにしたの
で、切り出し処理が不要となり、ノイズに強く、アンダ
ーライン等のある図形でも確実に認識することができる
。また、一致度の高いフォントを絞りながら認識してい
るので、全ての文字について照合をする必要がなくなり
、処理速度が早くなるという効果を有する。[Effects of the Invention] As explained above, according to the multi-font character recognition device of the present invention, a template is scanned to match a character string to be recognized with an image, and the matching process is repeated for fonts with a high degree of matching. This eliminates the need for cutting out processing, is resistant to noise, and allows reliable recognition even of figures with underlines and the like. Furthermore, since fonts with a high degree of matching are recognized while being narrowed down, there is no need to match all characters, which has the effect of increasing processing speed.

[Brief explanation of the drawing]

第１図は本発明のマルチフォント文字認識装置の一実施
例の構成を示すブロック図、第２図はその動作を示すフ
ローチャート、第３図は本発明で認識される文字列を表
す図、第４図は本発明に使用されるフォントを説明する
ための図、第５図は一致度の検出を説明するための図、
第６図は従来の装置で認識される文字列を表す図、第７
図は従来の装置に用いられるフォントを説明するための
図、第８図から第１０図は従来装置の切り出し動作を説
明するための図である。１・−・画像入力部、２・・・画像メモリ、３・・・Ｃ
ＰＵ（制御手段）、４・・・テンプレートパターンメモ
リ（テンプレート記憶手段）、５・・・マツチング処理
部（マツチング処理手段）、６・・・操作パネル、７・
・・外部ｉ／ｆ１８・・・ホスト部。FIG. 1 is a block diagram showing the configuration of an embodiment of the multi-font character recognition device of the present invention, FIG. 2 is a flowchart showing its operation, FIG. 3 is a diagram showing character strings recognized by the present invention, and FIG. Figure 4 is a diagram for explaining the font used in the present invention, Figure 5 is a diagram for explaining the detection of the degree of matching,
Figure 6 is a diagram showing character strings recognized by conventional devices;
The figure is a diagram for explaining the font used in the conventional device, and FIGS. 8 to 10 are diagrams for explaining the cutting operation of the conventional device. 1... Image input section, 2... Image memory, 3... C
PU (control means), 4... template pattern memory (template storage means), 5... matching processing section (matching processing means), 6... operation panel, 7.
...External I/F18...Host part.

Claims

[Scope of Claims] Template storage means for storing templates of a plurality of fonts, and selecting the template of a predetermined character of a predetermined font and scanning it against a character string image to be recognized. It is comprised of a matching processing means that sequentially determines matching with a template, and a control means that selects the font with a high degree of matching in response to the output of the matching processing means and controls the matching processing of the matching processing means. Multi-font character recognition device.