JPS6318484A

JPS6318484A - Method for recognizing printed character

Info

Publication number: JPS6318484A
Application number: JP61161684A
Authority: JP
Inventors: Koichi Ejiri; 公一江尻; Hajime Sato; 元佐藤
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1986-07-09
Filing date: 1986-07-09
Publication date: 1988-01-26

Abstract

PURPOSE:To rapidly recognize various kinds of characters with the small number of templates by correcting the height of a detected character to a character pattern height in a character dictionary, and at every generation of a reject, referring a template with a different font and a general template. CONSTITUTION:A character feature pattern is extracted from a character string read out by a scanner 11 in a optical input device 2 by a character extracting unit 12 and a feature extracting unit 13 and matched with the content of a template memory 15 by a feature matching unit 14 controlled by a processor such as a CPU to recognize the character. At that time, the character height in the feature pattern is corrected to the character pattern height of the character dictionary, templates respectively corresponding to various fonts are successively referred at every generation of a reject, and a reject is still generated, the general template including characters with low using frequency is referred. Thereby, various kinds of characters can be recognized with the small number of templates and characters in a sentence can be rapidly recognized.

Description

【発明の詳細な説明】［技術分野］本発明は、文字認識技術に関し、特に、種々な文字種を
含む和文、英文等の活字文字を認識する技術に関するも
のである。DETAILED DESCRIPTION OF THE INVENTION [Technical Field] The present invention relates to character recognition technology, and particularly to technology for recognizing printed characters such as Japanese and English including various character types.

［従来技術］従来、多種の文字を読取るためには、次のような手順を
使っていた０例えば、文字の大きさで文字種を大分類し
た後、パターンマツチングのような通常の手続で文字認
識を行い、もし、リジェクト状態になった時は、異なる
フォントのテンプレート（辞書）と比較していた。すな
わち、第６図に示すように、まず１文字列を認識すると
き１段階５１で、ある幅以上の空白部を検出して文字パ
ターンを切出し、段階５２で文字切出しが終了したかを
判断し、終了していれば処理終了（ＥＮＤ）となる６文
字切出しが終了していなれば１段Ｎ５３に移る。[Prior Art] Conventionally, in order to read a wide variety of characters, the following procedure was used. Recognition is performed, and if the font is rejected, it is compared with a template (dictionary) of a different font. That is, as shown in FIG. 6, when one character string is recognized, in step 1 51, a blank space of a certain width or more is detected and a character pattern is cut out, and in step 52 it is determined whether character cutting is completed. If the process has been completed, the process ends (END).If the six character extraction has not been completed, the process moves to the first stage N53.

この段階５３で切出された文字パターンが特徴抽出・マ
ツチング処理される（第６図では、特徴抽出処理につい
てのフローは通常のものを用いるので省略している）、
そして、段階５４でマツチング結果がリジェクトである
か否かの判断を行い、その結果がリジェクトならば（Ｙ
ＥＳ）、段階５５でテンプレートを変更し、リジェクト
でなければ（Ｎｏ）、段階５６で前記マツチング結果を
出力して段階５１に戻す。The character pattern cut out in this step 53 is subjected to feature extraction and matching processing (in FIG. 6, the flow for feature extraction processing is omitted because a normal flow is used).
Then, in step 54, it is determined whether the matching result is a reject, and if the result is a reject (Y
ES), the template is changed in step 55, and if it is not rejected (No), the matching result is output in step 56 and the process returns to step 51.

段階５５でテンプレートを変更した結果、段階５７で変
更するテンプレートがないか否かを判断し、変更するテ
ンプレートがなければ（ＹＥＳ）。As a result of changing the template in step 55, it is determined in step 57 whether or not there is any template to be changed, and if there is no template to be changed (YES).

段階５８でリジェクトコードを出力して段階５１に戻す
、また、変更するテンプレートがあれば（Ｎｏ）、段階
５３に戻す、このように、フォントを変更してみて、マ
ツチングするまで、あるいは最大マツチングするまで候
補テンプレートをサーチしていた。Output a reject code in step 58 and return to step 51. If there is a template to be changed (No), return to step 53. Try changing the font in this way until matching is achieved or maximum matching is achieved. I was searching for candidate templates until now.

しかしながら、印刷物のように文字種が多くなると、全
てのフォント（ｆｏｎｔ）を用意しなければならないの
で、テンプレート（辞書）が膨大になる。However, when there are many types of characters as in printed matter, all the fonts must be prepared, so the template (dictionary) becomes enormous.

そのために１文字を認識するのに時間がかかり過ぎると
いう問題があった。Therefore, there was a problem in that it took too much time to recognize one character.

［目的］本発明の目的は、多様な文字種を含む文章の認識を少な
いテンプレート（辞書）で行うことができる技術を提供
することにある。[Objective] An object of the present invention is to provide a technology that can recognize sentences including various character types using a small number of templates (dictionaries).

本発明の他の目的は、多様な文字種を含む文章の認識を
迅速に行うことができる技術を提供することにある。Another object of the present invention is to provide a technique that can quickly recognize sentences containing various character types.

本発明の前記ならびにその他の目的と新規な特徴は１本
明細書の以下の記述及び添付図面によって説明する。The above and other objects and novel features of the present invention will be explained by the following description of the specification and the accompanying drawings.

［構成］本発明は、多様なフォントに対応するフォント別テンプ
レート及び利用頻度の少ない文字パターンをも含む汎用
テンプレートを用いて活字文字を認識する活字文字認識
方法であって、認識すべく文字の高さを検出する段階と
、該検出された文字の高さを前記各辞書の文字パターン
の高さに修正する段階と、リジェクトが発生する毎に、
異なるフォント別テンプレートを参照し、それでもリジ
ェクトの場合、汎用テンプレートを利用して文字を認識
する段階とを具備したことを特徴とするものである。[Structure] The present invention is a printed character recognition method for recognizing printed characters using font-specific templates corresponding to various fonts and general-purpose templates including character patterns that are rarely used. a step of detecting the height of the detected character, a step of correcting the height of the detected character to the height of the character pattern of each of the dictionaries, and each time a rejection occurs,
The present invention is characterized by the step of referring to templates for different fonts and, if the template is still rejected, recognizing the character using a general-purpose template.

［実施例］以下、本発明の一実施例を図面を用いて具体的に説明す
る。[Example] Hereinafter, an example of the present invention will be specifically described using the drawings.

なお、実施例を説明するための全回において、同一機能
を有するものは同一符号を付け、その繰り返しの説明は
省略する。Note that throughout the description of the embodiments, parts having the same functions are given the same reference numerals, and repeated explanations thereof will be omitted.

第１図は、本発明の一実施例の活字文字認識方法に係る
ＯＣＲ入力装置の概略構成を示すブロック図、第２図は、第１図に示すＯＣＲ入力装置を用いた多機能
活字文字情報処理装置の概略構成を示すブロック図、第３図は、本発明の一実施例の活字文字認識方法のフロ
ーチャートである。FIG. 1 is a block diagram showing a schematic configuration of an OCR input device according to a printed character recognition method according to an embodiment of the present invention, and FIG. 2 shows multifunctional printed character information using the OCR input device shown in FIG. FIG. 3 is a block diagram showing a schematic configuration of the processing device. FIG. 3 is a flowchart of a printed character recognition method according to an embodiment of the present invention.

本実施例の活字文字認識方法を適用した活字文字情報処
理装置において、第２図に示すように。In the printed character information processing apparatus to which the printed character recognition method of this embodiment is applied, as shown in FIG.

キーボード１は１文字を入力する他に各種のモード（仮
名漢字変換、漢字仮名変換、ＯＣＲ文字認識、英文等）
を指定するものに用いる。ＯＣＲ入力装置２は、原稿を
光学的に読取り入力する。処理装置３は、キーボード１
やＯＣＲ入力装置２からの入力情報について、指定され
たモードに従った処理を実行し、出力袋Ｗ１４に出力す
る。出力装置４は、ディスプレイ装置、プリンタ等を総
称して示したものである。処理装置３の処理に必要なプ
ログラムメモリ（ＲＯＭ）５に格納されるが、キーボー
ド入力による仮名漢字変換、ＯＣＲ文字認識の後処理、
ＯＣＲ入力された文字列の仮名漢字変換や漢字仮名変換
についてできるだけ共通のアルゴリズムが利用される。In addition to inputting a single character, keyboard 1 can also be used in various modes (kana-kanji conversion, kanji-kana conversion, OCR character recognition, English text, etc.)
Used to specify. The OCR input device 2 optically reads and inputs a document. The processing device 3 includes a keyboard 1
The input information from the OCR input device 2 is processed according to the specified mode, and outputted to the output bag W14. The output device 4 is a general term for a display device, a printer, etc. It is stored in the program memory (ROM) 5 necessary for the processing of the processing device 3, and includes post-processing such as kana-kanji conversion by keyboard input, OCR character recognition,
A common algorithm is used as much as possible for the kana-kanji and kanji-kana conversions of character strings input by OCR.

データメモリ（ＲＡＭ）６は、処理装置３での処理途中
のデータやパラメータを格納するのに用いられる。単語
辞書メモリ７には読み表記対応データを付加した単語辞
書が格納されている。A data memory (RAM) 6 is used to store data and parameters that are being processed by the processing device 3. The word dictionary memory 7 stores a word dictionary to which reading orthography correspondence data is added.

前記第２図に示すＯＣＲ入力装置２は、第１図に示す°
ように、光源と電荷結合素子（ＣＣＤ）等からなる光学
的スキャナー１１により、原稿上の文字等の画像情報を
読み取って入力する。この入力された仮名文字列又は仮
名漢字混合文字列、英字列等の画像情報を１文字切出し
ユニット１２により、第５図に示すように、１文字毎に
切出され。The OCR input device 2 shown in FIG. 2 is the OCR input device 2 shown in FIG.
Image information such as characters on a document is read and input using an optical scanner 11 comprising a light source, a charge-coupled device (CCD), and the like. The input image information such as a kana character string, a kana-kanji mixed character string, or an alphabetic character string is cut out character by character by a character cutting unit 12, as shown in FIG.

特徴抽出ユニット１３でその切出された文字の特徴を抽
出する。この抽出されたデータは、特徴マツチングユニ
ット１４でテンプレートメモリ（ＲＯＭ又はＲＡＭ）１
５に格納されている特徴辞書データ（フォント別テンプ
レート、汎用テンプレート）とのマツチングがとられる
。マツチングがとれれば、入力文字が認識され処理装置
３に送られる。マツチングがリジェクトとなった場合に
は、特徴マツチングユニット１４からリジェクト信号が
発生して前記文字切出しユニット１２に送られる。A feature extraction unit 13 extracts the features of the extracted characters. This extracted data is stored in a template memory (ROM or RAM) 1 in a feature matching unit 14.
Matching is performed with the feature dictionary data (font-specific templates, general-purpose templates) stored in 5. If matching is achieved, the input characters are recognized and sent to the processing device 3. If the matching is rejected, a reject signal is generated from the feature matching unit 14 and sent to the character cutting unit 12.

次に、本実施例のＯＣＲ入力装置用文字認識方法の処理
プロセスを、第３図に示すフローチャートに従って説明
する。Next, the processing process of the character recognition method for an OCR input device of this embodiment will be explained according to the flowchart shown in FIG.

まず、第４図に示すような文字列を認識するとき１段階
１０１である幅以上の空白部を検出して■の範囲の複数
の文字パターンを切り出し、次に個々の文字パターンに
分割する。この文字切出し方法は、投影法が一般的であ
る。前記■の範囲の文字列パターンにおいて、ＨＯ（最
頻の文字の高さ）＝３０．Ｈ，（次の頻度の文字の高さ
）＝２２．Ｐ。First, when recognizing a character string as shown in FIG. 4, a blank space having a width equal to or greater than the first stage 101 is detected, a plurality of character patterns within the range of ■ are cut out, and then divided into individual character patterns. A projection method is generally used as this character extraction method. In the character string pattern in the range of ■, HO (height of the most frequent character) = 30. H, (height of letters with next frequency)=22. P.

（最頻の文字の幅）＝１９．Ｐ、（次の頻度の文字の幅
）＝１５が抽出されたとする。ここで数値の単位は１画
素数である。(Width of most frequent character)=19. Suppose that P, (width of the character with the next frequency)=15 is extracted. Here, the unit of numerical value is one pixel.

次に１段階１０２で文字切出しが終了したかを判断し、
終了していれば処理終了（ＥＮＤ）となる。Next, in step 1 102, it is determined whether character cutting is completed,
If the process has ended, the process ends (END).

文字切出しが終了していなければ１段階１０３に移る。If character segmentation has not been completed, the process moves to step 1 103.

この段階１０３で最初のパターン大文字Ｔが特徴抽出・
マツチング処理される（第３図では、特徴抽出を省略し
である）、このとき、マツチングに必要な辞書は１文字
の高さＨ，、Ｈｌ、文字の幅（ピッチ）Ｐ、、Ｐ、であ
り、これらと抽出された情報）ｌｏｗ　ｈ＞＊　ｐａｙ
　Ｐｉとをそれぞれ比較し、抽出された情報）ｌｅｅ　
）ｌｔｔ　Ｐａ、Ｐｌが前記辞書のＨｏ。At this stage 103, the first pattern capital letter T is used for feature extraction.
Matching processing is performed (feature extraction is omitted in Figure 3).At this time, the dictionary required for matching is the height of one character, H,, Hl, and the width (pitch) of character, P,, P. Yes, these and extracted information) low h>* pay
Pi and the extracted information) lee
) ltt Pa, Pl is Ho in the dictionary.

Ｈ，、Ｐ、、　Ｐｌの範囲内であれば、以後のマツチン
グを実行する。もし、範囲外であれば１段階１０４に移
る０段階１０４でマツチング結果がリジェクトであるか
否かの判断を行い、その結果がリジェクト（ＹＥＳ）な
らば、段階１０５でＨ，Ｐを含む辞書内で次候補に変更
し、リジェクトでなければ（ＮＯ）１段階１０６で前記
マツチング結果を出力して段階１０７に移る。If it is within the range of H, , P, , Pl, subsequent matching is executed. If it is outside the range, proceed to step 104. At step 104, it is determined whether or not the matching result is a reject. If the result is reject (YES), at step 105, if the matching result is in the dictionary containing H and P. If the candidate is not rejected (NO), the matching result is output in step 106 and the process moves to step 107.

前記マツチング後の処理は、従来法を用いる。The post-matching process uses a conventional method.

ここで、前記テンプレートメモリ１５に格納されている
フォント別テンプレート及び汎用テンプレートは、第５
図に示すように、まず、辞書識別コード２１が配置され
、次にこの辞書の文字の高さＨ，、Ｈ，２２が配置され
る。その次にこの辞書の文字の幅（ピッチ）ｐ、、ｐ工
２３が配置され１次にこの辞書の特徴パラメータが配置
されている。Here, the font-specific templates and general-purpose templates stored in the template memory 15 are the fifth
As shown in the figure, the dictionary identification code 21 is placed first, and then the heights H, , H, 22 of the characters of this dictionary are placed. Next, the character width (pitch) p, .

汎用テンプレートは、フォントコード及び文字の高さＨ
，文字の幅Ｐの部分には、０コードが入っているものと
する。The general template has font code and character height H
, it is assumed that a 0 code is included in the width P portion of the character.

同様にして、第４図に示す■の部分を認識し。In the same way, the part marked ■ shown in Fig. 4 is recognized.

次に■の部分を認識する。■の部分のうち、最初の６パ
ターンは、上述の方法でよいが、第７，８パターンでは
マツチングに失敗し、前記フローチャートの段階１０４
でリジェクトが発生する可能性がある。このとき、前記
段階１０５で変更する次候補がない場合には、段階１０
８に移り１次候補に変更する辞書がないかの判断を行い
、変更する辞書がなければ１段階１０９に移動し、抽出
された情報ｈａｓ　ｈ１＋　Ｐａｔ　ｐｔを利用して、
これを含むテンプレートを順次選定しＣｈＥＥＨ，ｐＣ
Ｐ）。Next, recognize the part marked ■. The above-mentioned method may be used for the first six patterns in the part (3), but matching fails in the seventh and eighth patterns, and step 104 of the flowchart
Rejection may occur. At this time, if there is no next candidate to be changed in step 105, step 10
8, it is determined whether there is a dictionary to be changed to the primary candidate, and if there is no dictionary to be changed, the process moves to step 109, and using the extracted information has h1+ Pat pt,
Templates containing this are sequentially selected and ChEEH, pC
P).

適当なフォントが見つかるまで、これを繰り返す。Repeat this until you find a suitable font.

利用頻度の高いフォントは５通常辞書化（テンプレート
）されているから、見出されたフォントが認識される。Frequently used fonts are usually converted into dictionaries (templates), so the found fonts can be recognized.

第４図に示す■の部分のような例外的な文字パターンが
あるときは、文字パターンを正規化した（例えば汎用テ
ンプレートと同じ高さ。When there is an exceptional character pattern, such as the part marked ■ in Figure 4, the character pattern is normalized (for example, the same height as the general template).

幅の統一したサイズに変換する）後、フォントコード０
のテンプレートによって、段階１１０で汎用テンプレー
ト（正規化イメージの辞書）とマツチングを°試みる。After converting to a size with a uniform width), the font code is 0.
At step 110, matching with the general template (dictionary of normalized images) is attempted using the template.

そのマツチング結果について段階１１１でリジェクトで
あるか否かを判断し、リジェクトでなけ九ば（Ｎ　Ｏ）
、段ＦＩ１１０７に移り、すジェクトであれば（ＹＥＳ
）、段階１１２に移り。It is determined whether or not the matching result is rejected in step 111, and if it is not rejected, the result is 9 (NO).
, move to stage FI1107, and if it is a project (YES
), proceed to step 112.

リジェクトコードを出力し１段階１０７に移る。A reject code is output and the process moves to step 1 107.

段階１０７で画像情報パターンから次の注目画像パター
ンの切出しを行い１段階１１３に移る１段階１１３で次
の注目画像パターンがないか否かの判断を行い、次の注
目画像パターンがあれば（ＮＯ）、段階１０４に戻し１
次の注目画像パターンがなければ（Ｎｏ）、段階１０１
に戻す６本実施例のＯＣＲ入力装置用文字認識方法の処
理プロセスを終了する。In step 107, the next image pattern of interest is cut out from the image information pattern, and the process proceeds to step 113.In step 113, it is determined whether or not there is the next image pattern of interest. ), return to step 104 1
If there is no next image pattern of interest (No), step 101
6. The process of the character recognition method for the OCR input device of this embodiment is ended.

以上、本発明を実施例にもとすき具体的に説明したが１
本発明は、前記実施例に限定されるものではなく、その
要旨を逸脱しない範囲において種々変更可能であること
は言うまでもない。The present invention has been specifically explained above using examples.
It goes without saying that the present invention is not limited to the embodiments described above, and can be modified in various ways without departing from the spirit thereof.

〔Effect of the invention〕

以上、説明したように、本発明によれば、認識すべく文
字の高さを検出する段階と、該検出された文字の高さを
前記各辞書の文字パターンの高さに修正する段階と、リ
ジェクトが発生する毎に。As described above, according to the present invention, there are a step of detecting the height of a character to be recognized, a step of correcting the height of the detected character to the height of the character pattern of each dictionary, Every time a rejection occurs.

異・なるフォント別テンプレートを参照し、それでもリ
ジェクトの場合、汎用テンプレートを利用して文字を認
識する段階とを備えたので、多様な文字種を含む文章の
認識を少ないテンプレートで行うことができ、かつ多様
な文字種を含む文章の認識を迅速に行うことができる。The system includes a step of referring to templates for different fonts and recognizing characters using a general-purpose template if the template is still rejected, so that sentences containing a variety of character types can be recognized with a small number of templates. It is possible to quickly recognize sentences containing a variety of character types.

[Brief explanation of the drawing]

第１図は５本発明の一実施例の活字文字認識方法に係る
ＯＣＲ入力装置の概略構成を示すブロック図。第２図は、第１図に示すＯＣＲ入力装置を用いた多機能
活字文字情報処理装置の概略構成を示すブロック図。第３図は１本発明の一実施例の活字文字認識方法のフロ
ーチャート、第４図は、切出された複数文字のパターン例を示す図。第５図は、本実施例に係るテンプレートメモリの内容・
を示す図、第６図は、従来のＯＣＲ入力装置用文字認識方法を説明
するためのフローチャートである。図中、３・・・処理装置、１１・・・スキャナー・　１
２・・・文字切出しユニット、１３・・・特徴抽出ユニ
ット・１４・・・特徴マツチングユニット、１５・・・
テンプレートメモリである。FIG. 1 is a block diagram showing a schematic configuration of an OCR input device according to a printed character recognition method according to an embodiment of the present invention. FIG. 2 is a block diagram showing a schematic configuration of a multifunctional printed character information processing device using the OCR input device shown in FIG. 1. FIG. 3 is a flowchart of a printed character recognition method according to an embodiment of the present invention, and FIG. 4 is a diagram showing an example of a pattern of a plurality of cut out characters. FIG. 5 shows the contents of the template memory according to this embodiment.
FIG. 6 is a flowchart for explaining a conventional character recognition method for an OCR input device. In the figure, 3...processing device, 11...scanner 1
2... Character extraction unit, 13... Feature extraction unit, 14... Feature matching unit, 15...
This is template memory.

Claims

[Claims]

(1) A printed character recognition method that recognizes printed characters using font-specific templates that support a variety of fonts and general-purpose templates that include character patterns that are less frequently used, and which detects the height of characters to be recognized. a step of correcting the height of the detected character to the height of the character pattern of each dictionary, and a step of referencing a different template for each font each time a rejection occurs, and if a rejection still occurs, a general template is used. A printed character recognition method characterized by comprising a step of recognizing characters using the method.

(2) The printed character recognition method according to claim 1, wherein each dictionary has a typical character size (height, width) in addition to its identification code.