JPH0516632B2

JPH0516632B2 -

Info

Publication number: JPH0516632B2
Application number: JP59148725A
Authority: JP
Inventors: Makoto Kato; Hirohisa Sone; Hiroyasu Takahashi
Original assignee: International Business Machines Corp
Current assignee: International Business Machines Corp
Priority date: 1984-07-19
Filing date: 1984-07-19
Publication date: 1993-03-04
Also published as: JPS6132187A

Description

[Detailed description of the invention]

［産業上の利用分野］本発明は文字認識方式に関し、更に詳細にいえ
ば、促音、拗音を含む手書きを文字を複数のタイ
プに類別して正規化するようにした文字認識方式
に関する。［従来技術］最近は、OA化の波の中で、日本語情報処理シ
ステムが種々開発されているが、日本語情報処理
システムで大きなネツクになつているのはデータ
入力の部分である。これまで、日本語入力方式と
して、タブレツト方式、キータツチ方式、キーボ
ードを用いた仮名−漢字変換方式などに提案さ
れ、ある程度成功をおさめているが、一般に入力
速度の点で問題があり、例えば統計調査などの大
量データの入力には不向きである。従つて、
OCRによる文字認識、特に手書き文字認識は日
本語データ入力方式の一翼を担うものとして大い
に期待されている。特に今後は、OCR装置をOA機器と組合わせ、
手書き文字をOCRで読取つて表示し編集し処理
しプリントするアプリケーシヨンが盛んになると
考えられるが、このようなアプリケーシヨンでは
大文字と促拗音とを明確に区別し認識処理できる
機能を持つことが不可欠であると考えられる。しかしながら従来の英数字・仮名用のOCRで
は、通常、仮名の大文字と小文字（促拗音）の区
別がなく、同じ字形の大文字と小文字とを同一の
文字として処理している。また、これまでの手書
き漢字OCRは住所、氏名などを読取るアプリケ
ーシヨンが主体であつたため、促拗音の認識につ
いてはあまり考慮されていなかつた。大文字と小文字を区別するための１つの方法と
しては、入力文字パターンの大きさ、位置を判定
し、この情報を１つの特徴データとして利用して
分類識別を行なう方法が考えられるが、この方法
は文字の大きさ及び位置を付加的な特徴データと
して用いなければならないため、認識処理が複雑
になり、ハードウエア化しにくく、また認識時間
も長くなりやすいという問題がある。また一般に、OCRによる手書き文字の認識は、
文字読取り、前処理、特徴抽出、分類識別という
手順で行なわれ、前処理では入力文字パターンの
大きさの正規化を行なうが、大文字、小文字を意
識しない従来のOCRではすべての入力文字パタ
ーンを画一的に正規化するのが普通である。画一
的正規化を用いた場合は、もともと小さな寸法を
有する小文字あるいはピリオド、コンマなどの記
号を不必要に拡大するため、無駄な処理を含むだ
けでなく、線の微小な凹凸など無用な特性を拡大
し、また認識対象としての文字パターンのデータ
量を不必要に増やすことになるため、特徴抽出、
分類識別でのデータ処理量が増えるという問題が
ある。特開昭55−10624号公報は大文字英字、数字、
特殊記号の識別を行なう文字識別処理方法を示し
ている。この処理方法では、先ず第１ステツプで
ピリオド、コンマなどの背丈の低い特殊記号を分
離してそのまま識別を行ない、残つたものについ
て高さの正規化を行なつて、大分類、識別を行な
い、次に、更に残つたものについて幅の正規化を
行なつて更に大分類、識別を行なうようにしてい
る。この特開昭は文字の高さと幅の正規化を区別
して用いているが、これは前の識別結果に基いて
段階的に一定の大きさに正規化するものであつ
て、本発明のように入力文字パターンの大きさ、
位置に基づいて異なる大きさ、位置に正規化する
ことは示しておらず、また大文字、促拗音の識別
についても示していない。［発明が解決しようとする問題点］したがつて本発明の目的は、認識処理を複雑に
することなく、簡単に且つ高速に、大文字と小文
字（特に促拗音）を区別して認識できるようにす
ることである。［問題点を解決するための手段］本発明は、前処理の正規化の段階で、入力文字
の大きさ及び位置の要素に基いて入力文字を複数
のタイプに類別し正規化するものである。即ち、
読取つた２値化入力文字パターンの大きさと位置
に基いて入力文字を通常文字、促拗音などの複数
のタイプに類別し、各タイプ毎に、大きさ及び位
置の少なくとも一方を異ならせて正規化イメージ
領域に正規化し、そしてこの正規化イメージ領域
全体を対象として特徴抽出するようにしたもので
ある。したがつて本発明によれば、入力文字パタ
ーンの大きさと位置のパラメータは正規化された
文字パターンそのものの中に内在することになる
ため、以後の特徴抽出、分類識別の処理では、
入力文字の大きさ、位置のパラメータを考慮する
必要がなくなる。したがつてすべての文字カテゴ
リーに対して統一的認識アルゴリズムを使用で
き、特徴抽出、分類識別処理が簡単になり、認識
速度を高めることができる。また、すべての文字
カテゴリーに対して同一の認識処理の使用が可能
となるため、文字認識システムのハードウエア化
が容易であり、一層認識速度を高めることが可能
となる。［実施例］次に図面を参照して本発明の良好な実施例につ
いて説明する。第１図は本発明による文字認識方式の機能的ブ
ロツク図である。先ず、文字読取り部では、帳票
に手書きされた文字がOCRスキヤナによつて読
取られる。この例では、帳票は10mm×10mmの文字
枠のものが用いられた。読取られた２値化文字パ
ターンデータは一旦メモリにドツト・パターンと
して入れられ、文字切出し部で文字枠部分が切出
される。OCRスキヤナの解像度を８ドツト／mm
とすると、切出された文字枠の大きさは80×80ド
ツトとなる。外周検出部では、第２図に示すよう
に、切出した入力文字パターンに外接する長方形
のＸ、Ｙ座標の最小値と最大値（Xmin、Xmax、
Ymin、Ymax）が検出される。切出した文字枠
の左上の角の座標が（０、０）にされている。上
記の文字読取り、文字切出し、外周検出は周知の
一般的処理であるから、これらについての詳しい
説明は省略することにする。第１図は正規化分類部及び正規化部が本発明に
よつて改良された部分である。正規化分類部は外周検出部によつて得られた外
接長方形のXmax、Xmin、Ymax、Yminに基づ
いて以下の如くＸ、Ｙ方向の幅及びＸ、Ｙ方向の
重心を求める。Ｘ方向幅ΔX＝Xmax−Xmin＋１Ｙ方向幅ΔY＝Ymax−Ymin＋１Ｘ方向重心Gx＝１／２（Xmax＋Xmin）Ｙ方向重心Gy＝１／２（Ymax＋Ymin）そして正規化分類部は、求められた入力文字パ
ターンの大きさ及び位置に基づいて入力文字を複
数のタイプに類別する。下表１は、読取りスキヤ
ナの解像度を８ドツト／mm、文字枠を80×80ドツ
トとしたときの分類の一例を示している。 [Industrial Field of Application] The present invention relates to a character recognition system, and more specifically, to a character recognition system that classifies and normalizes handwritten characters including consonants and consonants into a plurality of types. [Prior Art] Recently, with the wave of OA, various Japanese information processing systems have been developed, but the major bottleneck in Japanese information processing systems is data input. Until now, various Japanese input methods have been proposed, such as the tablet method, key touch method, and kana-to-kanji conversion method using a keyboard, and have achieved some degree of success. It is not suitable for inputting large amounts of data such as Therefore,
Character recognition using OCR, especially handwritten character recognition, is highly expected to play a role in Japanese data input methods. In particular, in the future, OCR devices will be combined with OA equipment,
Applications that use OCR to read handwritten characters, display them, edit them, process them, and print them are likely to become popular, but it is essential for such applications to have the ability to clearly distinguish between capital letters and accents, and to recognize and process them. It is thought that. However, conventional OCR for alphanumeric characters and kana usually does not distinguish between uppercase and lowercase letters (press sounds) of kana, and processes uppercase and lowercase letters of the same shape as the same character. Furthermore, since the handwritten kanji OCR to date has mainly been used for applications such as reading addresses and names, little consideration has been given to the recognition of accentuated sounds. One possible method for distinguishing between uppercase and lowercase letters is to determine the size and position of the input character pattern and use this information as feature data for classification and identification. Since the size and position of the characters must be used as additional feature data, there are problems in that the recognition process becomes complicated, difficult to implement in hardware, and the recognition time tends to be long. In general, handwritten character recognition using OCR is
It is performed through the steps of character reading, preprocessing, feature extraction, and classification identification.The preprocessing normalizes the size of the input character pattern, but in conventional OCR, which does not consider uppercase or lowercase letters, all input character patterns are It is common to normalize all at once. When uniform normalization is used, lowercase letters or symbols such as periods and commas that originally have small dimensions are unnecessarily enlarged, which not only involves unnecessary processing but also eliminates unnecessary characteristics such as minute irregularities in lines. , and unnecessarily increases the amount of character pattern data to be recognized.
There is a problem that the amount of data processing for classification identification increases. JP-A No. 55-10624 uses uppercase letters, numbers,
A character identification processing method for identifying special symbols is shown. In this processing method, first, in the first step, short special symbols such as periods and commas are separated and identified as they are, and the remaining symbols are normalized in height and then roughly classified and identified. Next, the width of the remaining items is normalized for further classification and identification. This patent application uses the normalization of character height and width separately, but this is normalization to a constant size in stages based on the previous identification result, and unlike the present invention, the size of the input character pattern,
It does not show normalization to different sizes and positions based on position, nor does it show the identification of capital letters and accents. [Problems to be Solved by the Invention] Therefore, an object of the present invention is to easily and quickly distinguish between uppercase and lowercase letters (especially accents) and to recognize them without complicating the recognition process. That's true. [Means for Solving the Problems] The present invention classifies and normalizes input characters into a plurality of types based on the size and position of the input characters at the normalization stage of preprocessing. . That is,
Based on the size and position of the read binary input character pattern, the input characters are classified into multiple types such as regular characters and accent sounds, and normalized by making at least one of the size and position different for each type. The image area is normalized, and features are extracted from the entire normalized image area. Therefore, according to the present invention, the size and position parameters of the input character pattern are inherent in the normalized character pattern itself, so in the subsequent feature extraction, classification, and identification processes,
There is no need to consider parameters such as the size and position of input characters. Therefore, a unified recognition algorithm can be used for all character categories, simplifying feature extraction and classification/identification processing, and increasing recognition speed. Furthermore, since the same recognition process can be used for all character categories, it is easy to implement the character recognition system in hardware, and the recognition speed can be further increased. [Example] Next, a preferred example of the present invention will be described with reference to the drawings. FIG. 1 is a functional block diagram of a character recognition system according to the present invention. First, in the character reading section, characters handwritten on a form are read by an OCR scanner. In this example, the form used had a 10mm x 10mm character frame. The read binary character pattern data is temporarily stored in a memory as a dot pattern, and a character frame portion is cut out by a character cutting section. Increase OCR scanner resolution to 8 dots/mm
Then, the size of the cut out character frame will be 80 x 80 dots. The outer circumference detection section detects the minimum and maximum values (Xmin, Xmax,
Ymin, Ymax) are detected. The coordinates of the upper left corner of the cut out character frame are set to (0, 0). Since the above-mentioned character reading, character cutting, and outer circumference detection are well-known general processes, a detailed explanation thereof will be omitted. FIG. 1 shows the normalization classification section and normalization section improved by the present invention. The normalization classification section calculates the width in the X and Y directions and the center of gravity in the X and Y directions as follows based on the Xmax, Xmin, Ymax, and Ymin of the circumscribed rectangle obtained by the perimeter detection section. X-direction width ΔX = Xmax-Xmin+1 Y-direction width ΔY = Ymax-Ymin+1 X-direction center of gravity Gx = 1/2 (Xmax + Xmin) Y-direction center of gravity Gy = 1/2 (Ymax + Ymin) Classifying input characters into multiple types based on pattern size and position. Table 1 below shows an example of classification when the resolution of the reading scanner is 8 dots/mm and the character frame is 80 x 80 dots.

【表】表１において、重心Gxの欄の「左部」は文字
枠をＸ方向に３等分したときの左部を表わし、重
心Gyの欄の「上部」、「中部」、「下部」は文字枠
をＹ方向に３等分したときの、上部、中部、下部
を表わす。また、空白の欄は“don′t care”であ
る。ΔX、ΔY、Gx、Gyの４つの条件（空白の欄
は除く）が満たされた場合、それに対応するタイ
プ番号が選択される。ここで、上記10個のタイプの意味について説明
する。この例では、JIS C6235日本語入力用文字
盤配列に含まれる漢字、仮名、促拗音、英字大文
字（小文字除く）、数字、及び特殊文字の一部を
基本の認識対象としているが、任意の選択した文
字あるいは記号の組合わせを使用しその位置、大
きさに基いて種々タイプ分けできることは理解さ
れよう。タイプ１は文字枠の上部に小さく書かれ
る単一引用符「‘」、「’」などの特殊文字用、タ
イプ２は枠の中央部に小さく書かれる中点「・」
などの特殊文字用、タイプ３は枠の下部に小さく
書かれるピリオド「．」などの特殊文字用である。
タイプ４は枠の上部にやや大きく書かれる２重引
用符「“」、「”」などの特殊文字用、タイプ５は枠
の中央部に書かれる漢字の繰り返し記号「〃」
用、タイプ６は促拗音、句読点などのためのもの
である。タイプ７は漢字の「」、感嘆符「！」、
コロン「：」、数字の「１」など縦長の文字のた
めのものであり、タイプ８は漢字のイチ「一」、
負符号「−」などのためのものであり、タイプ９
はアンダライン「」用である。タイプ10はその
他の漢字、英字大文字、仮名大文字、数字、及び
「％」、「￥」などの大きめの特殊文字用である。
タイプ５はタイプ４、６よりも小さな寸法にされ
ているが、これは、文字枠の中央部に記入した文
字は多少小さめでも通常文字である可能性が高
く、これらの通常文字がタイプ５の繰り返し記号
「〃」として分類されるのを防止するためである。例えば数字「１」のようにタイプ７及びタイプ
10の両方に該当する場合が生じうるが、この場合
はプライオリテイ・エンコーダによつてタイプ番
号の若い方を優先させる。また、特殊文字及び促拗音では、大きさと文字
枠内の位置が重要な要素となる。したがつて、帳
票記入に当つては、記入者に対し記入要領のガイ
ドを与えておくのが望ましいが、このタイプ分け
の場合は、「促拗音、句読点は文字枠の左下隅に
１／４程度の大きさで記す」及び「特殊文字はその
種類に応じて文字枠内の上部、中部、下部に区別
して記す」程度のガイドを与えれば十分である。第１図の正規化部は、正規化分類部において判
定された入力文字のタイプに応じて入力文字を、
タイプ毎に大きさ及び位置の少なくとも一方を異
ならせて所定の正規化イメージ領域に正規化す
る。この例では、切出された80×80ビツトの文字
枠内の２値化文字パターンを64×64ビツトの正規
化イメージ領域に正規化するものとしている。第
３図は、各タイプの文字が64×64ビツトの正規化
イメージ領域にどのように正規化されるかを例示
している。斜線の部分は正規化されたイメージが
書込まれる領域を示し、その回りの白地の部分は
白イメージ領域を示している。タイプ１〜６は小
文字サイズの文字を対象としており、タイプ１〜
６では正規化サイズ30が用いられ、タイプ７〜10
では正規化サイズ60が用いられる。但し、細長い
文字パターンのタイプ１、２、３、７、８、９で
は、これらの文字パターンを30×30または60×60
に正規化すると、細長いという形状上の特性が失
われ、特に正規化後細線化する場合は、長さ方向
の線縁の微小凹凸が拡大によつて強調され、本来
の特徴を歪めることになるので、タイプ１、２、
３、７ではΔXの値をそのまま用い、タイプ８、
９ではΔYの値をそのまま用いて正規化してい
る。上記のように正規化した後、正規化された文字
パターンは、特徴抽出部において、まわりの白地
を含めた64×64ビツトの正規化イメージ領域全体
を対象として特徴抽出される。次に、分類識別部
において、抽出された特徴と、同様に正規化され
た標準文字パターンについての予じめ用意した特
徴とのパターン・マツチングが行なわれ、入力文
字パターンの識別が行なわれる。特徴抽出の手法
としては、一定の正規化イメージ領域の中に正規
化した文字パターンの大きさ、位置の差が、この
正規化イメージ領域全体を対象として特徴抽出し
たときに特徴の差として反映するものであれば、
即ち、特徴が大きさ及び位置に依存するものであ
れば、任意のものを使用できる。従来公知のもの
としては、Ｘ方向、Ｙ方向または斜め方向から見
たときの位置別あるいは区画別の黒ドツトの数ま
たは分布状態を求める方法、細線化した文字パタ
ーンの方向別、区画別の水平、垂直、斜め方向の
黒ドツト連結数を求める方法、上下左右の各辺か
ら文字パターンまでの白ドツトの数（深さ）また
は一定の深さまでの白ドツトの総数（面積）を求
める方法などが知られているが、一例としては方
向別、区画別の黒ドツト連結数及び各辺に沿つた
白領域の面積を求める特開昭58−201184公報があ
る。これらの特徴抽出、分類識別の処理自体はパ
ターン・マツチング的な認識手法において一般的
なものであるので、詳しい説明は省略する。必要
があれば、上記特開昭を参照されたい。次に、第４図を参照して本発明による良好な正
規化機構について説明する。タイプ・セレクタ１
０は第１図の正規化分類部に対応し、その他の部
分は第１図の正規化部に対応する。タイプ・セレ
クタ１０は10mm×10mm（80×80ドツト）の文字枠
内の入力文字パターン外接長方形の座標値、
Ymin、Ymax、Xmin、Xmaxに基づいて、入力
文字のタイプ（表１のタイプ１〜タイプ10）を判
別し、各タイプに応じて正規化に必要な制御信号
を発生する。これらの制御信号は以後明らかとな
ろう。正規化機構は正規化ROM18、28、128×128ビ
ツトのイメージ・バツフア２２、及び正規化イメ
ージ領域となる64×64ビツトの正規化イメージ・
バツフア５２を有する。イメージ・バツフア２２
は切出された文字枠の入力文字パターンを含む。
入力文字パターンは、イメージ・バツフア２２の
左上の角の座標を（０、０）としたとき、文字枠
の左上の角を座標（０、０）に合わせて記憶され
ている。アドレツシングを容易にするためイメー
ジ・バツフア２２は128×128ビツトにされている
が、文字枠を含む大きさであれば任意の大きさで
よい。この正規化機構の目的は、入力文字の判別され
たタイプに応じて入力文字パターンを第３図に示
す所定の大きさ及び位置に正規化して正規化イメ
ージ・バツフア５２に記憶することである。正規
化ROM１８，２８はそれぞれ２つの正規化用マ
トリクスＡ，Ｂを有し、マトリクスＡは正規化サ
イズ30用、マトリクスＢは正規化サイズ60用であ
る。マトリクスＡ，Ｂは判別されたタイプに応じ
てタイプ・セレクタ１０によつて選択される。正
規化ROM１８，２８の働きは、イメージ・バツ
フア２２の文字パターンを縮小／拡大により正規
化サイズ30または60に正規化してバツフア５２に
記憶する場合、文字パターンの大きさに応じて、
文字パターンのどのドツトを正規化バツフア５２
に書込むかを指示するアドレスを発生することで
ある。縮小の場合は間引いて読取られ、拡大の場
合は選択された同じドツトが反復して読取られ、
あるいは場合によつては拡大／縮小なしにそのま
まイメージ・バツフア２２から出力される。そのために、YminとYmax（それぞれ７ビツ
ト）は減算器１２に与えられ、その出力（ΔY−
１）はマルチプレクサMPX１４のＡ入力を介し
て、ROM１８の行位置（Ｙ位置）を選択する上
位アドレス(H)に与えられる。ROM１８の列位置
（Ｘ位置）を選択する下位アドレス(L)はアドレ
ス・カウンタ３０の上位ビツト2⁶〜2¹¹から与え
られる。MPX１４のＢ入力“59”は正規化サイ
ズ60の場合にタイプ・セレクタによつて選択さ
れ、これはイメージ・バツフア２２の文字パター
ンをＹ方向拡大比率＝１でそのまま出力する場合
に用いられる。正規化ROM１８の出力は文字パ
ターン外接長方形におけるＹアドレスを示すた
め、ROM１８の出力は加算器２０でYminと加
算され、イメージ・バツフア２２を実際にアドレ
スするためのＹアドレスに変換される。同様に、XminとXmax（それぞれ７ビツト）は
減算器２４に与えられ、その出力（ΔX−１）は
マルチプレクサMPX２６のＡ入力を介して、
ROM２８の行位置（Ｙ位置）を選択する上位ア
ドレス(H)に与えられる。ROM２８の列位置（Ｘ
位置）を選択する下位アドレス(L)はアドレス・カ
ウンタ３０の下位ビツト2⁰〜2⁵から与えられる。
MPX２６のＢ入力“59”は正規化サイズ60の場
合にイメージ・バツフア２２の文字パターンをＸ
方向拡大比率＝１でそのまま出力するときタイ
プ・セレクタ１０によつて選択される。MPX２
６のＣ入力“29”は正規化サイズ30の場合にイメ
ージ・バツフア２２の文字パターンをＸ方向拡大
比率＝１で出力するときタイプ・セレクタ１０に
よつて選択される。ROM２８の出力も外接長方
形におけるＸアドレスを示すため、加算器３２で
Xminと加算され、イメージ・バツフア２２を実
際にアドレスするためのＸアドレスに変換され
る。ここで、第５図を参照して正規化ROMマトリ
クスの構成について説明する。この例では、説明
を簡単にするため、文字枠の寸法を10×10ビツト
（第５図Ａ）、正規化サイズを４×４ビツト（第５
図Ｂ）としたときの正規化ROMマトリクス１８
または２８（第５図Ｃ）の構成を示している。正規化マトリクスの第ｋ行、第ｌ列の要素Ekl
の値は Ekl＝INT（ｋ×ｌ／Ｌ＋0.5）で求められるものである。ここで、Ｌはアドレ
ス・インデツクスのとりうる最大値（第５図Ｃで
は３）であり、INT（ｘ）はｘの整数部分であ
る。行位置の０〜９はΔX−１またはΔY−１に
対応するサイズ・インデツクスであり、正規化マ
トリツクスの上位アドレスに相当する。列位置の
０〜３は正規化マトリツクスの下位アドレスに相
当する。一般に、Ｘ方向に正規化するためのマトリクス
の行数は文字枠のＸ方向のビツト数（第５図の場
合は10）、列数は正規化サイズのＸ方向のビツト
数（第５図の場合は４）に等しく、Ｙ方向に正規
化するためのマトリクスの行数は文字枠のＹ方向
のビツト数（第５図の場合は10）、列数は正規化
サイズのＹ方向のビツト数（第５図の場合は４）
に等しい、従つて、第４図の実際例では正規化マ
トリクス１８Ａ，２８Ａの行数は80、列数は30に
され、正規化マトリクス１８Ｂ，２８Ｂの行数は
80、列数は60にされる。第５図の場合、文字枠及び正規化サイズのＸ、
Ｙ方向のビツト数は等しいから、第５図Ｃの正規
化マトリクスはＸ、Ｙ方向で共用できる。動作に
おいて、例えば、入力文字パターンのＸ方向の寸
法Xmax−Xmin＝ΔX−１（サイズ・インデツク
ス）が２の場合は、Ｘ正規化マトリクスの値０、
１、１、２が読取られ、したがつてＸアドレス＝
１が２回発生されて拡大される。サイズ・インデ
ツクス＝３では外接長方形のＸ方向の寸法＝Ｘ方
向の正規化サイズであり、拡大比率１に相当す
る。サイズ・インデツクス＝９の場合はＸアドレ
ス＝０、３、６、９が発生され、間引き読取りが
行なわれる。Ｙ方向正規化マトリクスも同様に動
作する。しかし正規化マトリクスは外接長方形の左上の
角のアドレスを（０、０）として文字パターンを
正規化サイズに変換するアドレスを発生するの
で、イメージ・バツフア２２をアクセスするため
の実際のＸ、ＹアドレスAx、Ayは次式によつて
与えられる。 Ax＝Xmin＋マトリクス（ΔX−１、ix） Ay＝Ymin＋マトリクス（ΔY−１、iy）ここで、マトリクス（ΔX−１、ix）、マトリ
クス（ΔY−１、iy）はそれぞれΔX−１、ΔY−
１をサイズ・インデツクス、ix、iyをアドレス・
インデツクスとして得られるマトリクス値であ
る。第４図の加算器２０，３２はこの加算を行な
う。したがつて、64×64ビツトの正規化バツフア５
２を用いる第４図の場合正規化ROM１８，２８
は4096回＝（＝64×64）アドレス信号を発生して
イメージ・バツフア２２を読取る。しかし、第３
図のように正規化して正規化バツフア５２に書込
むためには一工夫が必要である。比較器CMP４
２，４４及び加算器４６，４８はそのためのもの
である。先ず、第６図を参照して原理を説明する。正規
化バツフア５２のＸ、ＹアドレスAnx、Anyを次
式によつて与えるものとする。 Anx＝ixαx Any＝iyαy ここで、は２進数（第６図では２ビツト、第
４図では６ビツト）の加算で、オーバフローを無
視したものを示す。したがつて、例えばαx＝０、
αy＝０で第６図Ａのように書込まれるデータは、
αx＝１、αy＝２とした場合は第６図Ｂのように
（１、２）を始点としてラツプアラウンドして書
込むことができ、またix＞１、iy＞１で書込むデ
ータをマスクすれば第６図Ｃのように書込まれる
ことになる。即ち、αx、αyで正規化バツフア５
２の書込み始点を指定し、ix＞limx、iy＞limy
で書込みデータをマスクすればイメージ・バツフ
ア２２の読取り出力を正規化バツフア５２の任意
の位置に、任意の大きさでマスクして書込むこと
ができる。このときの限界値limx、limyをリミ
ツト・カウント値とする。加算器４６，４８及び比較器４２，４４は上記
の原理で正規化バツフア５２への書込みを制御す
るものである。加算器４６，４８は一方の入力と
して上記のiy、ixに相当するアドレスをアドレ
ス・カウンタ３０から受取り、他方の入力として
αy、αxに相当するアドレスをマルチプレクサ
MPX３８，４０から受取る。タイプ・セレクタ
１０はMPX３８，４０を制御し、判別されたタ
イプに応じて、対応する始点アドレスαx（２、
17、23または27）、αy（２、17、25、32または47）
を加算器４８，４６へゲートする。加算器４６は
正規化バツフア５２のＹアドレスを選択する上位
アドレス(H)を与え、加算器４８はＸアドレスを選
択する下位アドレス(L)を与える。したがつてイメ
ージ・バツフア２２から読取られたドツト・デー
タは始点（αx、αy）から書込まれる。比較器４２，４４は一方の入力としてiy、ixに
対応するアドレスをアドレス・カウンタ３０から
受取り、他方の入力としてリミツト・カウント値
をマルチプレクサMPX３４，３６から受取る。
タイプ・セレクタ１０は判別されたタイプに応じ
てリミツト・カウント値をMPX３４，３６へゲ
ートする。比較器CMP４２，４４はiy、ixがリ
ミツト・カウント値以下のときANDゲート５０
を付勢し、リミツト・カウント値を超えたとき
ANDゲート５０を禁止する。なお、タイプ・セ
レクタ１０の出力“２”、“17”、“23”、“25”、
“27”、“29”、“32”、“47”、“59”、“ΔX−１
”、
“ΔY−１”、はタイプ判別に応じてこれらの出力
が発生されたとき、MPX14、26、34、36、38、
40を制御して対応番号の入力を選択するものであ
る。下表２は各タイプ１〜10の正規化で用いられる
ROMマトリクス（正規化サイズ30用……Ａ、正
規化サイズ60用……Ｂ）、サイズ・インデツクス
（Ｘ、Ｙ）、リミツト・カウント値（Ｘ、Ｙ）、始
点（Ｘ、Ｙ）を示している。[Table] In Table 1, "left part" in the column for center of gravity Gx represents the left part when the character frame is divided into three equal parts in the X direction, and "upper", "middle", and "lower" in the column for center of gravity Gy represents the upper, middle, and lower parts when the character frame is divided into three equal parts in the Y direction. Also, a blank column means "don't care." If the four conditions ΔX, ΔY, Gx, and Gy (excluding blank columns) are satisfied, the corresponding type number is selected. Here, we will explain the meanings of the above 10 types. In this example, the basic recognition targets are kanji, kana, kanji, uppercase letters (excluding lowercase letters), numbers, and some special characters included in the JIS C6235 Japanese input dial layout. It will be appreciated that various types can be created based on location and size using combinations of letters or symbols. Type 1 is for special characters such as single quotation marks "'" and "'" written in small letters at the top of the text frame, and Type 2 is for special characters such as single quotation marks "'" and "'" written in small letters at the center of the frame.
Type 3 is for special characters such as a period "." written in small size at the bottom of the frame.
Type 4 is for special characters such as double quotation marks "", "'', which are written slightly larger at the top of the frame, and type 5 is for special characters such as double quotation marks "" and "'', which are written in the center of the frame.
Type 6 is for accents, punctuation marks, etc. Type 7 is the kanji "", exclamation mark "!",
Type 8 is for vertically long characters such as the colon ":" and the number "1", and type 8 is for the kanji ichi "ichi",
It is for negative sign "-" etc., type 9
is an underline ” is for. Type 10 is for other kanji, uppercase English letters, uppercase kana letters, numbers, and larger special characters such as "%" and "\".
Type 5 has smaller dimensions than Types 4 and 6, but this is because the characters written in the center of the character frame are likely to be regular characters even if they are a little smaller, and these regular characters are smaller than Type 5. This is to prevent it from being classified as a repeating symbol "〃". For example, type 7 and type like the number "1"
A case may occur where both of the above 10 apply, but in this case, the priority encoder gives priority to the one with the smaller type number. Furthermore, for special characters and accentuated sounds, the size and position within the character frame are important factors. Therefore, when filling in forms, it is desirable to give the filler a guide on how to fill in the forms, but in the case of this type classification, it is recommended to It is sufficient to provide guides such as "Special characters should be written in appropriate sizes" and "Special characters should be written separately at the top, middle, and bottom of the character frame depending on their type." The normalization unit in FIG. 1 converts input characters into
At least one of size and position is made different for each type and normalized to a predetermined normalized image area. In this example, the binarized character pattern within the extracted 80 x 80 bit character frame is normalized to a 64 x 64 bit normalized image area. FIG. 3 illustrates how each type of character is normalized into a 64 x 64 bit normalized image area. The hatched area indicates the area where the normalized image is written, and the white area around it indicates the white image area. Types 1 to 6 are for lowercase sized characters;
6 uses a normalized size of 30, types 7 to 10
In this case, a normalized size of 60 is used. However, for types 1, 2, 3, 7, 8, and 9 of elongated character patterns, these character patterns are 30x30 or 60x60.
When normalized to , the shape characteristic of elongation is lost, and especially when thinning the line after normalization, minute irregularities on the line edge in the length direction are emphasized by enlarging, distorting the original characteristics. Therefore, types 1, 2,
In 3 and 7, the value of ΔX is used as is, and in type 8,
9, the value of ΔY is used as is for normalization. After being normalized as described above, the normalized character pattern is subjected to feature extraction in the feature extraction section from the entire 64×64 bit normalized image area including the surrounding white background. Next, in the classification identification section, pattern matching is performed between the extracted features and features prepared in advance for standard character patterns that have been similarly normalized, and the input character pattern is identified. As a feature extraction method, differences in the size and position of normalized character patterns within a certain normalized image area are reflected as feature differences when features are extracted from the entire normalized image area. If it is something,
That is, any feature that is size and position dependent can be used. Conventionally known methods include methods for determining the number or distribution of black dots by position or by section when viewed from the X direction, Y direction, or diagonal direction, and methods for determining the number or distribution of black dots by position or by section when viewed from the , how to find the number of connected black dots in vertical and diagonal directions, how to find the number of white dots from each side (top, bottom, left and right) to the character pattern (depth), or the total number of white dots (area) up to a certain depth. This is known, but one example is Japanese Patent Application Laid-Open No. 58-201184, which calculates the number of connected black dots for each direction and section and the area of a white region along each side. These feature extraction and classification/identification processes themselves are common in pattern matching-type recognition methods, so detailed explanations will be omitted. If necessary, please refer to the above-mentioned JP-A-Sho. Next, a good normalization mechanism according to the present invention will be explained with reference to FIG. Type selector 1
0 corresponds to the normalization classification section in FIG. 1, and the other sections correspond to the normalization section in FIG. The type selector 10 is the coordinate value of the input character pattern circumscribing rectangle within the 10 mm x 10 mm (80 x 80 dots) character frame,
Based on Ymin, Ymax, Xmin, and Xmax, the type of input character (type 1 to type 10 in Table 1) is determined, and a control signal necessary for normalization is generated according to each type. These control signals will become clear later. The normalization mechanism includes a normalization ROM 18, 28, 128 x 128 bit image buffer 22, and a 64 x 64 bit normalization image area which is the normalization image area.
It has a buffer 52. image battle 22
contains the input character pattern of the cut out character frame.
The input character pattern is stored with the upper left corner of the character frame aligned with the coordinates (0, 0) when the coordinates of the upper left corner of the image buffer 22 are (0, 0). The image buffer 22 is set to 128 x 128 bits to facilitate addressing, but it may be of any size as long as it includes the character frame. The purpose of this normalization mechanism is to normalize the input character pattern to a predetermined size and position shown in FIG. 3 in accordance with the determined type of input character and store it in the normalized image buffer 52. The normalization ROMs 18 and 28 each have two normalization matrices A and B, where matrix A is for normalized size 30 and matrix B is for normalized size 60. Matrices A and B are selected by the type selector 10 according to the determined type. The functions of the normalization ROMs 18 and 28 are such that when a character pattern in the image buffer 22 is normalized to a normalized size of 30 or 60 by reduction/enlargement and stored in the buffer 52, depending on the size of the character pattern,
Normalize buffer 52 which dots in the character pattern
It is to generate an address that indicates what to write to. In the case of reduction, the dots are thinned out and read, and in the case of enlargement, the same selected dots are read repeatedly,
Alternatively, in some cases, the image may be outputted from the image buffer 22 as it is without being enlarged/reduced. For this purpose, Ymin and Ymax (7 bits each) are given to the subtracter 12, and its output (ΔY−
1) is applied to the upper address (H) that selects the row position (Y position) of the ROM 18 via the A input of the multiplexer MPX14. The lower address (L) for selecting the column position (X position) of the ROM 18 is given from the upper bits ²⁶ to ²¹¹ of the address counter 30. The B input "59" of the MPX 14 is selected by the type selector when the normalized size is 60, and is used when the character pattern of the image buffer 22 is output as is with the Y direction enlargement ratio=1. Since the output of the normalization ROM 18 indicates the Y address in the character pattern circumscribing rectangle, the output of the ROM 18 is added to Ymin in the adder 20 and converted into a Y address for actually addressing the image buffer 22. Similarly, Xmin and Xmax (7 bits each) are given to the subtracter 24, whose output (ΔX-1) is passed through the A input of the multiplexer MPX26 as
It is given to the upper address (H) that selects the row position (Y position) of the ROM 28. ROM28 column position (X
The lower address (L) for selecting the location) is given from lower bits ²⁰ to ²⁵ of address counter 30.
The B input “59” of MPX26 converts the character pattern of image buffer 22 to X when the normalized size is 60.
This is selected by the type selector 10 when outputting as is with the direction enlargement ratio=1. MPX2
The C input "29" of No. 6 is selected by the type selector 10 when outputting the character pattern of the image buffer 22 with the X direction enlargement ratio=1 in the case of the normalized size of 30. The output of the ROM 28 also indicates the X address in the circumscribed rectangle, so the adder 32
It is added to Xmin and converted into an X address for actually addressing the image buffer 22. Here, the configuration of the normalized ROM matrix will be explained with reference to FIG. In this example, to simplify the explanation, the character frame dimensions are 10 x 10 bits (Figure 5A), and the normalized size is 4 x 4 bits (Figure 5A).
Normalized ROM matrix 18 for Figure B)
28 (FIG. 5C). Element Ekl of the kth row and lth column of the normalization matrix
The value of is determined by Ekl=INT(k×l/L+0.5). Here, L is the maximum value that the address index can take (3 in FIG. 5C), and INT(x) is the integer part of x. Row positions 0 to 9 are size indices corresponding to ΔX-1 or ΔY-1, and correspond to the upper addresses of the normalization matrix. Column positions 0 to 3 correspond to lower addresses of the normalization matrix. Generally, the number of rows in a matrix for normalization in the X direction is the number of bits in the X direction of the character frame (10 in the case of Figure 5), and the number of columns is the number of bits in the X direction of the normalized size (in the case of Figure 5). 4), the number of rows of the matrix for normalization in the Y direction is the number of bits in the Y direction of the character frame (10 in the case of Figure 5), and the number of columns is the number of bits in the Y direction of the normalized size. (4 in case of Figure 5)
Therefore, in the practical example shown in FIG. 4, the number of rows of the normalization matrices 18A, 28A is 80, the number of columns is 30, and the number of rows of the normalization matrices 18B, 28B is
80, and the number of columns is set to 60. In the case of Figure 5, the character frame and normalized size X,
Since the number of bits in the Y direction is the same, the normalization matrix shown in FIG. 5C can be used in common in the X and Y directions. In operation, for example, if the dimension in the X direction of the input character pattern Xmax-Xmin=ΔX-1 (size index) is 2, the value of the X normalization matrix is 0,
1, 1, 2 are read, so X address =
1 is generated twice and expanded. When the size index = 3, the dimension in the X direction of the circumscribed rectangle = the normalized size in the X direction, which corresponds to an enlargement ratio of 1. When the size index=9, X addresses=0, 3, 6, 9 are generated and thinning reading is performed. The Y-direction normalization matrix operates similarly. However, since the normalization matrix generates an address that converts the character pattern to the normalized size by setting the address of the upper left corner of the circumscribed rectangle as (0, 0), the actual X, Y address for accessing the image buffer 22 is Ax and Ay are given by the following equations. Ax = Xmin + matrix (ΔX-1, ix) Ay = Ymin + matrix (ΔY-1, iy) Here, matrix (ΔX-1, ix) and matrix (ΔY-1, iy) are ΔX-1 and ΔY-, respectively.
1 is the size index, ix, iy are the address
This is a matrix value obtained as an index. Adders 20 and 32 in FIG. 4 perform this addition. Therefore, the normalization buffer 5 of 64×64 bits
In the case of Fig. 4 using 2, normalization ROM18, 28
generates an address signal 4096 times (=64×64) and reads the image buffer 22. However, the third
In order to normalize the data and write it into the normalization buffer 52 as shown in the figure, some ingenuity is required. Comparator CMP4
2, 44 and adders 46, 48 are for that purpose. First, the principle will be explained with reference to FIG. Let the X and Y addresses Anx and Any of the normalization buffer 52 be given by the following equations. Anx=ixαx Any=iyαy Here, indicates addition of binary numbers (2 bits in FIG. 6, 6 bits in FIG. 4), ignoring overflow. Therefore, for example αx=0,
The data written as shown in Figure 6A when αy=0 is
When αx = 1 and αy = 2, data can be written by wrapping around starting points (1, 2) as shown in Figure 6B, and data to be written when ix > 1 and iy > 1 can be written. If masked, it will be written as shown in FIG. 6C. That is, the normalized buffer 5 with αx and αy
Specify the writing start point of 2, ix > limx, iy > limy
By masking the write data, the read output of the image buffer 22 can be masked and written to any position in the normalization buffer 52 with any size. The limit values limx and limy at this time are taken as limit count values. Adders 46, 48 and comparators 42, 44 control writing to normalization buffer 52 based on the above principle. Adders 46 and 48 receive addresses corresponding to the above iy and ix as one input from the address counter 30, and receive addresses corresponding to αy and αx as the other inputs from the multiplexer.
Receive from MPX38,40. The type selector 10 controls the MPXs 38 and 40 and selects the corresponding starting point address αx(2,
17, 23 or 27), αy (2, 17, 25, 32 or 47)
to adders 48 and 46. Adder 46 provides an upper address (H) for selecting the Y address of normalization buffer 52, and adder 48 provides a lower address (L) for selecting the X address. Therefore, the dot data read from the image buffer 22 is written from the starting point (αx, αy). Comparators 42, 44 receive as one input the addresses corresponding to iy, ix from address counter 30, and the limit count value from multiplexer MPX 34, 36 as the other input.
Type selector 10 gates the limit count value to MPX 34, 36 depending on the determined type. Comparators CMP42 and CMP44 operate AND gate 50 when iy and ix are less than the limit count value.
energized and exceeds the limit count value
AND gate 50 is prohibited. Note that the outputs of the type selector 10 are “2”, “17”, “23”, “25”,
“27”, “29”, “32”, “47”, “59”, “ΔX-1
”,
“ΔY−1” is MPX14, 26, 34, 36, 38, when these outputs are generated according to the type determination.
40 to select input of the corresponding number. Table 2 below is used for normalization of each type 1 to 10.
Indicates the ROM matrix (for normalized size 30...A, for normalized size 60...B), size index (X, Y), limit count value (X, Y), and starting point (X, Y). There is.

【表】第３図に示したように、タイプ１、２、３、７
ではイメージ・バツフア２２の文字パターンのＸ
方向の拡大比率を１にしてそのまま出力し、タイ
プ８、９ではＹ方向の拡大比率を１にしてそのま
ま出力する。したがつて、タイプ１、２、３、７
のＸリミツト・カウント値はΔX−１、タイプ
８、９のリミツト・カウント値はΔY−１にされ
ている。また、拡大比率１はサイズ・インデツク
スが正規化マトリクスのアドレス・インデツクス
の最大値（正規化サイズ30では29、正規化サイズ
60では59）に等しい場合に得られるから、正規化
サイズ30を用いるタイプ１、２、３のＸサイズ・
インデツクスは29、正規化サイズ60のタイプ７の
Ｘサイズ・インデツクスは59、正規化サイズ60の
タイプ８、９のＹサイズ・インデツクスは59にさ
れている。そのため、タイプ・セレクタ１０はタ
イプ１、２、３のときはマルチプレクサ２６にお
いてサイズ・インデツクス２９を選択しＸ方向拡
大比率１として正規化マトリクス２８Ａをアクセ
スし、タイプ７のときはマルチプレクサ２６にお
いてサイズ・インデツクス５９を選択しＸ方向拡
大比率１として正規化マトリクス２８Ｂをアクセ
スし、またタイプ８、９のときはマルチプレクサ
１４においてサイズ・インデツクス５９を選択
し、Ｙ方向拡大比率１として正規化マトリクス１
８Ｂをアクセスする。以上の正規化機構によれば、アドレス・カウン
タが4096（＝2¹²）カウントする間に、イメージ・
バツフア２２の１つの入力文字パターンがそのタ
イプに応じた大きさ及び位置をもつて正規化バツ
フア５２に正規化して記憶されることになる。また80×80ビツトの文字枠の場合正規化ROM
１８，２８の各Ｘ、Ｙアドレスは７ビツト・バイ
トで十分であるが、１ビツトを付加して８ビツ
ト・バイトとし、この１ビツトを出力ANDゲー
ト５０の制御に使用すると、不用意な拡大を防止
することができる。即ち、例えばΔX＝26、ΔY
＝５、Gy重心＝中部というような小さく横に細
長い文字パターンが入力された場合、この文字パ
ターンは表１に従いタイプ・セレクタ１０におい
てタイプ５と判定される。従つてこの文字パター
ンは30×30ビツトに正規化され、その結果細長い
パターンが正方形に変形され、元の細長いパター
ンの形状上の特徴が失われることになる。このと
きのＹ方向の拡大率は６（＝30／５）であるが、
正規化マトリクスの作成時に、正規化マトリクス
の同じ行に同じアドレスが所定数以上並ばないよ
うにし、残りの位置のアドレスには上記の付加１
ビツトによつて出力ANDゲートの禁止を指定し
ておけば、不用意な拡大を防止できる。第７図は拡大率を最大２に抑えた正規化マトリ
クスの簡単な例を示している。第７図のＡは拡大
率を抑えない場合であり、１ドツト幅の線（サイ
ズ・インデツクス＝０）の場合は６倍に拡大さ
れ、２ドツト幅の線（サイズ・インデツクス＝
１）の場合は３倍に拡大される。第７図のＢは拡
大率を最大２に抑えた場合であり、出力ANDゲ
ート禁止ビツトＨをセツトしたアドレスを両端に
配置し、このアドレスが正規化マトリクスから読
取られたとき出力ANDゲートを禁止するもので
ある。したがつて、サイズ・インデツクス＝０、
１においても２倍の拡大に抑えられる。第４図に
おいて、出力ANDゲート５０への線１８Ｈ，２
８Ｈはこの出力ゲート禁止ビツトである。出力ゲ
ート禁止ビツトを有するアドレスを正規化マトリ
クスの両端に配置すれば、文字パターンを第３図
の正規化領域（斜線部分）の中心部に位置合わせ
して正規化できる。なお、正規化段階で文字のタイプを特定する本
発明によれば、タイプ分エラーによる誤認識が１
つの問題となるが、予じめ記入者に大体の記入ガ
イドを与えておき、普通の注意力で記入してもら
つた実験結果によれば、十分実用しうることがわ
かつた。更に認識率を高める場合は、例えば
OCR帳票読取り用フオーマツト・プログラムに
おいてフイールド毎の文字種を規定し、小文字を
許さないフイールドではタイプ１〜６の選択を禁
止し、文字枠に小さめに書かれた大文字が小文字
サイズに正規化されて誤認識されるのを防止した
り、あるいは、タイプ分けエラーの生じやすい文
字パターンに対しては、類別される可能性のある
タイプでそれぞれ正規化したときの特徴を含むテ
ンプレート複数個用意しておくこともできよう。正規化後、正規化バツフア５２の文字パターン
は読出されて特徴抽出され、分類識別処理され
る。［発明の効果］本発明は入力文字パターンの大きさ、位置に基
いて文字パターンを複数のタイプに分け、入力文
字パターンをタイプ毎に異なる大きさ及び位置を
もつて正規化イメージ領域に正規化するから、正
規化イメージ領域内の文字パターンそのものが大
きさ、位置の情報を包含することになり、したが
つて以後の特徴抽出、分類識別の段階では入力文
字パターンの大きさ、位置を一切考慮する必要が
ないから、認識処理が簡単且つ高速になり、しか
も大文字と促拗音、小文字とを明確に区別でき
る。[Table] As shown in Figure 3, types 1, 2, 3, 7
Now, the character pattern of image buffer 22 is X.
The enlargement ratio in the direction is set to 1 and the data is output as is.For types 8 and 9, the enlargement ratio in the Y direction is set to 1 and the data is output as is. Therefore, types 1, 2, 3, 7
The X limit count value of type 8 and type 9 is set to ΔX-1, and the limit count value of types 8 and 9 is set to ΔY-1. Also, when the expansion ratio is 1, the size index is the maximum value of the address index of the normalized matrix (29 for normalized size 30,
60 is equal to 59), so the X size of types 1, 2, and 3 using normalized size 30.
The index is 29, the X size index of type 7 with normalized size 60 is 59, and the Y size index of types 8 and 9 with normalized size 60 is 59. Therefore, for types 1, 2, and 3, the type selector 10 selects the size index 29 in the multiplexer 26 and accesses the normalization matrix 28A with an X-direction enlargement ratio of 1; The index 59 is selected and the normalization matrix 28B is accessed with the X-direction enlargement ratio of 1, and in the case of types 8 and 9, the size index 59 is selected in the multiplexer 14 and the normalization matrix 28B is accessed with the Y-direction enlargement ratio of 1.
Access 8B. According to the above normalization mechanism, while the address counter counts 4096 (=2 ¹² ), the image
One input character pattern in the buffer 22 is normalized and stored in the normalization buffer 52 with a size and position depending on its type. Also, in the case of a character frame of 80 x 80 bits, the normalization ROM
A 7-bit byte is sufficient for each of the X and Y addresses 18 and 28, but adding 1 bit to make an 8-bit byte and using this 1 bit to control the output AND gate 50 will result in unintentional expansion. can be prevented. That is, for example ΔX=26, ΔY
When a small, horizontally elongated character pattern such as =5, Gy center of gravity = middle part is input, this character pattern is determined to be type 5 by the type selector 10 according to Table 1. Therefore, this character pattern is normalized to 30 x 30 bits, resulting in the elongated pattern being transformed into a square and losing the geometric features of the original elongated pattern. At this time, the magnification ratio in the Y direction is 6 (=30/5),
When creating a normalization matrix, make sure that no more than a predetermined number of the same addresses are lined up in the same row of the normalization matrix, and add the above addition 1 to the addresses in the remaining positions.
By specifying inhibition of the output AND gate using a bit, careless expansion can be prevented. FIG. 7 shows a simple example of a normalization matrix with a maximum magnification of 2. A in Figure 7 shows the case where the magnification rate is not suppressed; a line with a width of 1 dot (size index = 0) is expanded six times, and a line with a width of 2 dots (size index = 0) is expanded six times.
In the case of 1), the image is expanded three times. B in Figure 7 shows the case where the magnification rate is suppressed to a maximum of 2, and an address with the output AND gate prohibition bit H set is placed at both ends, and when this address is read from the normalization matrix, the output AND gate is prohibited. It is something to do. Therefore, size index=0,
1, the expansion can be suppressed to 2 times. In FIG. 4, lines 18H, 2 to output AND gate 50
8H is this output gate inhibition bit. By arranging addresses having output gate inhibit bits at both ends of the normalization matrix, the character pattern can be normalized by aligning it with the center of the normalization area (shaded area) in FIG. According to the present invention, which specifies the character type at the normalization stage, the number of misrecognitions due to type errors is reduced to 1.
However, according to the results of an experiment in which a rough guide was given to the person filling out the form in advance and the person filled it out with normal attentiveness, it was found that it was sufficiently practical. If you want to further increase the recognition rate, for example
In the format program for reading OCR forms, the character type is specified for each field, and selection of types 1 to 6 is prohibited in fields that do not allow lowercase letters, and uppercase letters written smaller in the character frame are normalized to lowercase size and erroneously. To prevent character patterns from being recognized or to easily cause type classification errors, prepare multiple templates that include the characteristics when normalized for each type that may be classified. I can do it too. After normalization, the character patterns in the normalization buffer 52 are read out, features are extracted, and classification and identification processing is performed. [Effects of the Invention] The present invention divides the input character pattern into a plurality of types based on the size and position of the input character pattern, and normalizes the input character pattern into a normalized image area with a different size and position for each type. Therefore, the character pattern itself in the normalized image area includes size and position information, and therefore, the size and position of the input character pattern are not taken into account at all in the subsequent feature extraction and classification/identification stages. Since there is no need to do so, recognition processing becomes simple and fast, and uppercase letters, accents, and lowercase letters can be clearly distinguished.

[Brief explanation of the drawing]

第１図は本発明による正規化技術を用いた文字
認識方式の機能的ブロツク図、第２図は文字枠に
おける文字パターンの外接長方形を示す図、第３
図は、種々の正規化タイプを示す図、第４図は本
発明による正規化装置を示す図、第５図はイメー
ジ・バツフア、正規化イメージ・バツフア、及び
正規化ROMマトリクスを示す図、第６図は正規
化イメージ・バツフアの記憶パターンを示す図、
第７図は正規化ROMマトリクスの記憶パターン
を示す図である。 Fig. 1 is a functional block diagram of a character recognition method using the normalization technology according to the present invention, Fig. 2 is a diagram showing a circumscribed rectangle of a character pattern in a character frame, and Fig. 3 is a diagram showing a circumscribing rectangle of a character pattern in a character frame.
4 is a diagram showing a normalization device according to the present invention; FIG. 5 is a diagram showing an image buffer, a normalized image buffer, and a normalized ROM matrix; FIG. Figure 6 shows the storage pattern of the normalized image buffer.
FIG. 7 is a diagram showing a storage pattern of a normalized ROM matrix.

Claims

[Claims] 1. Binarization of a recognition target character pattern including special characters, small characters including consonants and consonant characters, and large characters occupying a large area in a character frame, and binarization of cut out characters. In a character recognition method that recognizes characters by normalizing character patterns, extracting features from the normalized character patterns, and classifying and identifying characters based on the extracted features, the above-mentioned binary character patterns are divided into sizes and character frames. means for classifying into a plurality of types based on the position of at least a predetermined special character type, a predetermined lower case type including consonant and consonant characters, and a predetermined upper case type; and means for normalizing the normalized character pattern into a normalized image area of a constant size by varying at least one of the size and position of the character pattern normalized area for each type, the entire normalized image area. A character recognition method characterized by extracting features from the target.