JP2003346083A

JP2003346083A - Character recognition device and method, program, storage medium, and character recognition system

Info

Publication number: JP2003346083A
Application number: JP2002152019A
Authority: JP
Inventors: Hiroaki Ikeda; 裕章池田
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2002-05-27
Filing date: 2002-05-27
Publication date: 2003-12-05

Abstract

<P>PROBLEM TO BE SOLVED: To achieve a high recognition rate even for characters of high flatness by using a recognition dictionary for standard patterns with fixed vertical and horizontal dimensions. <P>SOLUTION: The aspect ratio R of a character block is determined and compared with predetermined thresholds T1, T2, T3 and T4. By comparing with the thresholds, the character block is determined to be either a character block within the range of standard aspect ratios or a flat (portrait or landscape) character block or a character block that could be both the character block with the standard aspect ratios or the flat character block. Then normalization suited for each of the blocks is performed and character recognition is carried out. <P>COPYRIGHT: (C)2004,JPO

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、原稿を光学的に読
み取り文字を認識する文字認識装置、文字認識方法、プ
ログラムおよび記憶媒体、および文字認識システムに関
するものであり、特に文字認識の処理中に行われる文字
の正規化に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device, a character recognition method, a program and a storage medium, and a character recognition system for optically reading a document and recognizing characters. It concerns the character normalization to be performed.

【０００２】[0002]

【従来の技術】光学的文字認識装置（ＯＣＲ）では、濃
度射影（ヒストグラム）により文字行を切出し（抽
出）、さらに１文字単位の文字ブロック切出し（抽出）
を行う。文字ブロックの切出しに際しては、文字行方向
に濃度射影を取り、濃度射影値の変化に基づいて文字行
を分離し、さらに、各文字行について、文字行と垂直方
向に濃度射影を取ることで個々の文字ブロックを抽出す
る。また必要に応じて、標準的な文字サイズや文字ピッ
チの推定値、および行と垂直方向に濃度射影値等の情報
を用いて、１文字単位の文字画像となる、最終的な文字
ブロックを切出す。切出された文字ブロックは、縦横寸
法の正規化を行った後に、所定の特徴データ抽出の処理
が施される。認識対象となる文字はその画像の特徴デー
タが予め求められており、標準パタンとして登録されて
いる。特徴データが抽出された個々の文字ブロックは、
標準パタンとの類似度が計算され、最も類似度の高い文
字が認識結果とされる。標準パタンの集合は認識辞書と
呼ばれる。2. Description of the Related Art In an optical character recognition device (OCR), a character line is cut out (extracted) by density projection (histogram), and a character block is cut out (extracted) for each character.
I do. When extracting a character block, density projection is performed in the character line direction, character lines are separated based on changes in the density projection value, and density projection is performed for each character line in the direction perpendicular to the character line. Extract the character block of. If necessary, the final character block, which becomes a character image in units of one character, is cut using information such as a standard character size and character pitch estimation value and a density projection value in a direction perpendicular to the line. put out. The extracted character block is subjected to predetermined feature data extraction processing after normalizing the vertical and horizontal dimensions. For the character to be recognized, the feature data of the image is obtained in advance and registered as a standard pattern. Each character block from which feature data is extracted is
The similarity with the standard pattern is calculated, and the character with the highest similarity is regarded as the recognition result. A set of standard patterns is called a recognition dictionary.

【０００３】文字ブロックの正規化は、標準パタンの形
状に対応した縦横寸法（正規化サイズ）とする処理であ
り、従来、特開平７−２７１９０８号、特許第３１２１
０９１号、特許第３２１６９８３号に記載の処理が知ら
れていた。（１）特開平７−２７１９０８号記載の処理（従来例１
という。）縦横比によって、正規化サイズを多段階に分け、縦横比
が最も近い正規化サイズに正規化する。（２）特許第３１２１０９１号記載の処理（従来例２と
いう。）縦横比が所定範囲内であれば、所定の縦横寸法に一致す
るように文字画像を拡大・縮小して正方形の文字枠に正
規化し、縦横比が所定範囲よりも大または小のとき、す
なわち縦長の文字や横長の文字については、縦長、横長
の特徴を保存し得る正規化サイズを採用する。縦長、横
長の文字画像は、それぞれ所定の係数を乗じた縦横比と
する。（３）特許第３２１６９８３号記載の処理（従来例３と
いう。）縦横比によって、文字画像を多段階に分けるとともに、
各段階に応じたスケーリングを行い、正規化サイズを緩
やかに変化させる。The normalization of a character block is a process of setting the vertical and horizontal dimensions (normalized size) corresponding to the shape of a standard pattern. Conventionally, Japanese Patent Laid-Open No. Hei 7-271908 and Japanese Patent No. 3121
No. 091 and Japanese Patent No. 32169983 are known. (1) Processing described in JP-A-7-271908 (conventional example 1
That. According to the aspect ratio, the normalized size is divided into multiple stages, and the normalized size is normalized to the closest normalized size. (2) Processing described in Japanese Patent No. 3121091 (referred to as Conventional Example 2) If the aspect ratio is within a predetermined range, the character image is enlarged / reduced so as to match a predetermined vertical / horizontal dimension, and is normally formed into a square character frame. When the aspect ratio is larger or smaller than a predetermined range, that is, for a vertically long character or a horizontally long character, a normalized size capable of preserving the vertically and horizontally long characteristics is adopted. The lengthwise and widthwise character images each have an aspect ratio multiplied by a predetermined coefficient. (3) Processing described in Japanese Patent No. 3216983 (referred to as Conventional Example 3) Character images are divided into multiple stages according to the aspect ratio.
Scaling is performed according to each stage, and the normalized size is gradually changed.

【０００４】[0004]

【発明が解決しようとする課題】しかし、従来例１、２
では同一字種においても、文字によって正規化サイズが
異なり、一方認識辞書における各標準パタンは一定の縦
横寸法に設定されているため、類似度の判定の処理が複
雑になるとともに、認識率が低下する。However, in the conventional examples 1 and 2,
In the same character type, the normalized size differs depending on the character.On the other hand, since each standard pattern in the recognition dictionary is set to a fixed vertical and horizontal dimension, the process of determining similarity becomes complicated and the recognition rate decreases. I do.

【０００５】また従来例３は、文字の扁平度が著しいと
き、すなわち縦長、横長の程度が顕著になると、変形が
無視できなくなり、認識率が低下する。In Conventional Example 3, when the flatness of a character is remarkable, that is, when the degree of length and width becomes remarkable, deformation cannot be ignored and the recognition rate decreases.

【０００６】本発明はこのような従来の問題を解決すべ
く創案されたものであり、一定縦横寸法の標準パタンの
認識辞書を用いて、扁平度の大きい文字に対しても高い
認識率を得ることを目的とする。The present invention has been made in order to solve such a conventional problem, and obtains a high recognition rate even for a character having a large flatness by using a recognition dictionary of a standard pattern having a constant vertical and horizontal dimension. The purpose is to:

【０００７】[0007]

【課題を解決するための手段】上記課題を解決するため
に本発明の文字認識装置は、文字の標準パタンの特徴デ
ータを格納した認識辞書と、文字画像を含む画像データ
から文字ブロックを抽出する文字ブロック抽出手段と、
前記抽出された文字ブロックの縦横比を算出し、この縦
横比に基づいて、正規化手順を決定する正規化手順決定
手段と、前記正規化手順決定手段により決定された正規
化手順を用いて、前記文字ブロックの特徴データを得る
特徴データ取得手段と、前記特徴データ取得手段で得た
特徴データと前記標準パタンの特徴データとの比較によ
り類似度を算出する類似度算出手段と、を備えた文字認
識装置であって、前記正規化手順決定手段は、前記算出
した縦横比に基づいて、前記文字ブロック抽出手段で抽
出された文字ブロックを、少なくとも、標準パタンの縦
横比に略等しい縦横比（以下「標準縦横比」と呼ぶ。）
の文字ブロックと、扁平な文字ブロックと標準縦横比の
文字ブロックとのどちらの可能性もある文字ブロックと
のうち、いずれの文字ブロックであるか判断し、前記標
準縦横比の文字ブロックとして判断された文字ブロック
については標準縦横比の文字ブロック用の正規化手順を
用いるように決定し、前記扁平な文字ブロックと標準縦
横比の文字ブロックとのどちらの可能性もある文字ブロ
ックとして判断された文字ブロックについては前記標準
縦横比の文字ブロック用の正規化手順と扁平な文字ブロ
ック用の正規化手順の両方を用いるように決定する。SUMMARY OF THE INVENTION In order to solve the above-mentioned problems, a character recognition apparatus of the present invention extracts a character block from a recognition dictionary storing characteristic data of a character standard pattern and image data including a character image. Character block extracting means;
Calculating the aspect ratio of the extracted character block, using a normalization procedure determining means for determining a normalization procedure based on the aspect ratio, using a normalization procedure determined by the normalization procedure determination means, A character data obtaining unit that obtains the characteristic data of the character block; and a similarity calculating unit that calculates a similarity by comparing the characteristic data obtained by the characteristic data obtaining unit with the characteristic data of the standard pattern. In the recognition apparatus, the normalization procedure determining means converts the character block extracted by the character block extracting means into an aspect ratio (hereinafter, referred to as an aspect ratio (hereinafter, referred to as an aspect ratio) substantially equal to a standard pattern) based on the calculated aspect ratio. It is called "standard aspect ratio.")
Character block, and a character block that may be either a flat character block or a character block having a standard aspect ratio, which character block is determined, and is determined as the character block having the standard aspect ratio. The character block determined to use the normalization procedure for a character block having a standard aspect ratio is determined as a character block that may be either the flat character block or the character block having a standard aspect ratio. For the block, it is determined to use both the normalization procedure for the character block having the standard aspect ratio and the normalization procedure for the flat character block.

【０００８】上記課題を解決するために本発明の文字認
識方法は、文字画像を含む画像データから文字ブロック
を抽出する文字ブロック抽出ステップと、前記抽出され
た文字ブロックの縦横比を算出し、この縦横比に基づい
て、正規化手順を決定する正規化手順決定ステップと、
前記正規化手順決定ステップで決定された正規化手順を
用いて、前記文字ブロックの特徴データを得る特徴デー
タ取得ステップと、前記特徴データ取得ステップで得た
特徴データと、認識辞書に格納されている標準パタンの
特徴データとの比較により類似度を算出する類似度算出
ステップと、を備えた文字認識方法であって、前記正規
化手順決定ステップでは、前記算出した縦横比に基づい
て、前記文字ブロック抽出ステップで抽出された文字ブ
ロックを、少なくとも、標準パタンの縦横比に略等しい
縦横比（以下「標準縦横比」と呼ぶ。）の文字ブロック
と、扁平な文字ブロックと標準縦横比の文字ブロックと
のどちらの可能性もある文字ブロックとのうち、いずれ
の文字ブロックであるか判断し、前記標準縦横比の文字
ブロックとして判断された文字ブロックについては標準
縦横比の文字ブロック用の正規化手順を用いるように決
定し、前記扁平な文字ブロックと標準縦横比の文字ブロ
ックとのどちらの可能性もある文字ブロックとして判断
された文字ブロックについては前記標準縦横比の文字ブ
ロック用の正規化手順と扁平な文字ブロック用の正規化
手順の両方を用いるように決定する。In order to solve the above problem, a character recognition method according to the present invention comprises: a character block extracting step of extracting a character block from image data including a character image; and calculating an aspect ratio of the extracted character block. A normalization procedure determining step of determining a normalization procedure based on the aspect ratio;
Using the normalization procedure determined in the normalization procedure determination step, a feature data obtaining step for obtaining the feature data of the character block, the feature data obtained in the feature data obtaining step, and stored in a recognition dictionary. A similarity calculation step of calculating a similarity by comparing with the characteristic data of the standard pattern, wherein the normalization procedure determination step includes the step of determining the character block based on the calculated aspect ratio. The character blocks extracted in the extraction step are divided into at least a character block having an aspect ratio substantially equal to the aspect ratio of the standard pattern (hereinafter referred to as “standard aspect ratio”), a flat character block and a character block having the standard aspect ratio. It is determined whether the character block is a character block having the above-mentioned standard aspect ratio. The determined character block is determined to use a normalization procedure for a standard aspect ratio character block, and is determined to be a character block that may be either the flat character block or the standard aspect ratio character block. For the character block, it is determined to use both the normalization procedure for the standard aspect ratio character block and the flat character block normalization procedure.

【０００９】上記課題を解決するために本発明の文字認
識システムのための文字認識装置は、文字画像を含む画
像データを受信し得る通信手段と、参照可能な認識辞書
と、受信した画像データから文字ブロックを抽出する文
字ブロック抽出手段と、前記抽出された文字ブロックの
縦横比を算出し、この縦横比に基づいて、正規化手順を
決定する正規化手順決定手段と、前記正規化手順決定手
段により決定された正規化手順を用いて、前記文字ブロ
ックの特徴データを得る特徴データ取得手段と、前記特
徴データ取得手段で得た特徴データと前記認識辞書に格
納されている標準パタンの特徴データとの比較により類
似度を算出する類似度算出手段と、を備えた、文字認識
システムのための文字認識装置であって、前記正規化手
順決定手段は、前記算出した縦横比に基づいて、前記文
字ブロック抽出手段で抽出された文字ブロックを、少な
くとも、標準パタンの縦横比に略等しい縦横比（以下
「標準縦横比」と呼ぶ。）の文字ブロックと、扁平な文
字ブロックと標準縦横比の文字ブロックとのどちらの可
能性もある文字ブロックとのうち、いずれの文字ブロッ
クであるか判断し、前記標準縦横比の文字ブロックとし
て判断された文字ブロックについては標準縦横比の文字
ブロック用の正規化手順を用いるように決定し、前記扁
平な文字ブロックと標準縦横比の文字ブロックとのどち
らの可能性もある文字ブロックとして判断された文字ブ
ロックについては前記標準縦横比の文字ブロック用の正
規化手順と扁平な文字ブロック用の正規化手順の両方を
用いるように決定する、ことを特徴とする。In order to solve the above-mentioned problems, a character recognition device for a character recognition system according to the present invention comprises: a communication unit capable of receiving image data including a character image; a recognition dictionary that can be referred to; A character block extracting unit for extracting a character block; a normalization procedure determining unit for calculating an aspect ratio of the extracted character block; and determining a normalization procedure based on the aspect ratio; and the normalization procedure determining unit. A feature data obtaining unit that obtains feature data of the character block using the normalization procedure determined by the above, and feature data of a standard pattern stored in the recognition dictionary and the feature data obtained by the feature data obtaining unit. A similarity calculating means for calculating a similarity by comparing the character recognition apparatus for a character recognition system, wherein the normalization procedure determining means comprises: Based on the calculated aspect ratio, the character block extracted by the character block extraction means is divided into a character block having an aspect ratio that is at least substantially equal to the aspect ratio of the standard pattern (hereinafter, referred to as “standard aspect ratio”) and a flat shape. Character block that is likely to be a character block having a normal aspect ratio or a character block having a standard aspect ratio. It is determined to use the normalization procedure for the character block having the aspect ratio, and the character block determined as a character block that may be either the flat character block or the character block having the standard aspect ratio is the standard aspect ratio. Deciding to use both the normalization procedure for the ratio character block and the normalization procedure for the flat character block, That.

【００１０】[0010]

【発明の実施の形態】次に本発明に係る文字認識装置、
文字認識方法、プログラムおよび記憶媒体、および文字
認識システムの実施の形態を図面に基づいて説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS Next, a character recognition device according to the present invention,
An embodiment of a character recognition method, a program and a storage medium, and a character recognition system will be described with reference to the drawings.

【００１１】［文字認識装置］図１は、本発明に係る光
学的文字認識装置（ＯＣＲ）等の文字認識装置の一実施
形態を示すブロック図である。[Character Recognition Apparatus] FIG. 1 is a block diagram showing an embodiment of a character recognition apparatus such as an optical character recognition apparatus (OCR) according to the present invention.

【００１２】文字認識装置は、磁気ディスク等の外部記
憶装置１０４およびイメージスキャナ１０８を備え、原
稿の画像データを外部記憶装置１０４から読み込み、あ
るいは原稿を直接イメージスキャナ１０８で読み取る。
文字認識装置は全体制御のためのＣＰＵ１０１を備え、
ＣＰＵ１０１の制御プログラムはＲＯＭ１０２に格納さ
れている。イメージスキャナ１０８から入力された画像
はＲＡＭ１０３に保持され、ＲＯＭ１０２内の制御プロ
グラムにより、文字認識のための処理を実行する。文字
認識のための認識辞書は制御プログラムとともにＲＯＭ
１０２に格納され、あるいは必要な字種ごとに、外部記
憶装置１０４からＲＡＭ１０３に読み込まれる。なお必
要に応じ、文字認識に先立ってノイズ除去等の前処理を
実行する。The character recognition device includes an external storage device 104 such as a magnetic disk and an image scanner 108, and reads image data of an original from the external storage device 104 or directly reads an original with the image scanner 108.
The character recognition device includes a CPU 101 for overall control,
The control program of the CPU 101 is stored in the ROM 102. The image input from the image scanner 108 is held in the RAM 103, and a process for character recognition is executed by a control program in the ROM 102. Recognition dictionary for character recognition is ROM with control program
The data is stored in the RAM 102 or read from the external storage device 104 into the RAM 103 for each required character type. If necessary, preprocessing such as noise removal is performed prior to character recognition.

【００１３】図１において、１０５はディスプレイ、１
０６はキーボード、１０７はマウス等のポインティング
デバイスである。また、１０９はネットワークインター
フェースであり、図示しない遠隔地に存在する装置と通
信し、プログラム、認識辞書、データ等を送受信し得
る。In FIG. 1, reference numeral 105 denotes a display,
06 is a keyboard, 107 is a pointing device such as a mouse. A network interface 109 communicates with a remote device (not shown) and can transmit and receive programs, recognition dictionaries, data, and the like.

【００１４】次に、文字認識装置において実行される文
字認識方法を説明する。Next, a character recognition method executed by the character recognition device will be described.

【００１５】［文字認識方法］［第１の実施形態］図２
は、図１の文字認識装置が実行する文字認識処理の第１
の実施形態を示すフローチャートである。[Character Recognition Method] [First Embodiment] FIG.
Is the first character recognition process performed by the character recognition device of FIG.
It is a flowchart which shows embodiment.

【００１６】第１の実施形態では、文字画像の縦横比に
応じて正規化方法を変更し、正規化方法の決定が容易で
ない場合に、暫定的に複数の正規化方法を用いて類似度
を計算し、計算結果に基づいて最終的な正規化方法を決
定する。なお、本実施形態では、正規化サイズの縦横比
は１：１であるものとする。In the first embodiment, the normalization method is changed according to the aspect ratio of the character image, and when it is not easy to determine the normalization method, the similarity is provisionally determined using a plurality of normalization methods. Calculate and determine the final normalization method based on the calculation result. In the present embodiment, the aspect ratio of the normalized size is 1: 1.

【００１７】ステップＳ２０１：まずスキャナ１０８で
読み取った画像あるいは外部記憶１０４に記憶された画
像を入力する。Step S201: First, an image read by the scanner 108 or an image stored in the external storage 104 is input.

【００１８】ステップＳ２０２：入力された画像から文
字行を抽出する。文字行抽出に際しては、文字行方向を
検出し、文字行方向の濃度射影を取り、濃度射影値（ヒ
ストグラム）を得る。この濃度射影値の変化に基づい
て、濃度射影値が所定値以上の個所を文字行として抽出
する。Step S202: A character line is extracted from the input image. At the time of character line extraction, the character line direction is detected, the density projection in the character line direction is taken, and the density projection value (histogram) is obtained. Based on the change in the density projection value, a portion where the density projection value is equal to or more than a predetermined value is extracted as a character line.

【００１９】ステップＳ２０３：抽出された文字行から
１文字単位の文字ブロックを抽出する。ステップＳ２０
２で抽出されたそれぞれの文字行に対して、文字行と垂
直方向に濃度射影を取り、濃度射影値の変化に基づいて
濃度射影値が所定値以上の個所を文字ブロックとして抽
出する。原稿によっては各文字ブロックが１文字単位の
文字画像となることもあるが、文字ピッチが小さい原稿
等では１つの文字が分割されてしまっている場合もある
ので、さらに各文字ブロックについて、文字サイズや文
字ピッチを推定しつつ、文字行と垂直方向の濃度射影値
を評価し、１文字単位の文字画像が最終的な文字ブロッ
クとして抽出されるように制御する。Step S203: A character block of one character unit is extracted from the extracted character line. Step S20
A density projection is performed on each of the character lines extracted in step 2 in a direction perpendicular to the character line, and a portion where the density projection value is equal to or more than a predetermined value is extracted as a character block based on a change in the density projection value. Depending on the document, each character block may be a character image in units of one character. However, in a document having a small character pitch, one character may be divided. The density projection value in the vertical direction with respect to the character line is evaluated while estimating the character pitch and the character pitch, and control is performed so that a character image in units of one character is extracted as a final character block.

【００２０】ステップＳ２０４：抽出された文字ブロッ
クの縦横比を求め、所定の閾値Ｔ１、Ｔ２、Ｔ３、Ｔ４
と比較する。ここで、図３の文字ブロック３０１（ひら
がなの「つ」１文字の画像を示す。）において、その高
さをｈ、幅をｗとするとき、縦横比Ｒ＝ｗ／ｈと定義す
る。閾値Ｔ１、Ｔ２、Ｔ３、Ｔ４はＴ１＜Ｔ２＜Ｔ３＜
Ｔ４の関係にあり、ＲがＴ１より小のとき（式１）、文
字ブロックは縦長、ＲがＴ４より大のとき（式２）、文
字ブロックは横長、ＲがＴ２以上であり、かつＴ３以下
のとき（式３）、文字ブロックは標準パタンの縦横比に
略等しい縦横比（以下「標準縦横比」と呼ぶ。）である
と判断する。Step S204: The aspect ratio of the extracted character block is determined, and predetermined threshold values T1, T2, T3, T4
Compare with Here, in the character block 301 in FIG. 3 (showing an image of one character of Hiragana), when the height is h and the width is w, an aspect ratio R = w / h is defined. The threshold values T1, T2, T3, and T4 are T1 <T2 <T3 <
When R is smaller than T1 (Equation 1), the character block is vertically long, and when R is larger than T4 (Equation 2), the character block is horizontal, R is T2 or more, and T3 or less. At this time (Equation 3), it is determined that the character block has an aspect ratio substantially equal to the aspect ratio of the standard pattern (hereinafter, referred to as “standard aspect ratio”).

【００２１】[0021]

【数１】文字ブロックが標準縦横比であったときはステップＳ２
０５に移行し、縦長あるいは横長であったときはステッ
プＳ２０６に移行する。(Equation 1) If the character block has the standard aspect ratio, step S2
The process proceeds to step S206, and if it is portrait or landscape, the process proceeds to step S206.

【００２２】縦横比ＲがＴ１以上であり、かつＴ２より
小のとき（式４）、文字ブロックは縦長あるいは標準縦
横比と判断し、そのいずれであるかの最終判断は保留す
る。また縦横比ＲがＴ３より大であり、かつＴ４以下の
とき（式５）、文字ブロックは横長あるいは標準縦横比
と判断し、そのいずれであるかの最終判断は保留する。
以後ステップＳ２０８に移行する。When the aspect ratio R is equal to or greater than T1 and smaller than T2 (Equation 4), the character block is determined to be a portrait or a standard aspect ratio, and the final determination of which is the character block is suspended. When the aspect ratio R is greater than T3 and equal to or less than T4 (Equation 5), the character block is determined to be landscape or standard aspect ratio, and the final determination as to which is the character block is suspended.
Thereafter, the process proceeds to step S208.

【００２３】[0023]

【数２】 (Equation 2)

【００２４】漢字を含む文字原稿については、閾値を例
えばＴ１＝０．４、Ｔ２＝０．６、Ｔ３＝１．４、Ｔ４
＝２．５と設定して良好な認識結果が得られている。For character originals containing kanji, the threshold values are, for example, T1 = 0.4, T2 = 0.6, T3 = 1.4, T4
= 2.5, a good recognition result is obtained.

【００２５】図３の文字ブロック３０１は縦横比Ｒ＝
２.０の例であり、ステップＳ２０８により処理され
る。The character block 301 shown in FIG.
2.0, which is processed in step S208.

【００２６】ステップＳ２０５：文字ブロックが標準縦
横比の範囲内であるので、文字ブロックの縦横寸法がそ
れぞれ正規化サイズになるように正規化を実行する。こ
の場合、文字ブロックは多少の変形が生じることもある
が、認識率に悪影響が生じない程度の変形であるので問
題が生じることはない。Step S205: Since the character block is within the range of the standard aspect ratio, normalization is performed so that the vertical and horizontal dimensions of the character block become the normalized size. In this case, the character block may be slightly deformed, but there is no problem because the character block is deformed to such an extent that the recognition rate is not adversely affected.

【００２７】ステップＳ２０６：縦長または横長の文字
ブロックについては、より長い方の長さ、すなわち縦長
の文字ブロックではその高さ、横長の文字ブロックでは
その幅を正規化サイズに一致させ、縦横比Ｒを保持する
ように他の長さを決定する。これによって文字の縦横比
Ｒの特徴が保存され、文字ブロック変形による認識率低
下を防止し得る。なお、正規化サイズに対して、縦長文
字では左右、横長文字では上下の長さが足りないので、
文字が中央に位置するように左右余白、または上下余白
を付加して正規化サイズに一致させて良好な認識結果を
得られるようにする。Step S206: For a vertically long or horizontally long character block, the longer length, that is, the height of a vertically long character block, and the width of a horizontally long character block are matched with the normalized size, and the aspect ratio R Determine the other length to hold As a result, the characteristics of the character's aspect ratio R are preserved, and a reduction in the recognition rate due to the character block deformation can be prevented. Note that, for the normalized size, the vertical characters are not enough left and right, and the horizontal characters are not enough up and down.
Left and right margins or upper and lower margins are added so that the character is located at the center, and the characters are made to match the normalized size so that good recognition results can be obtained.

【００２８】ステップＳ２０８：文字ブロックが縦長あ
るいは標準縦横比であると判断したとき、または、横長
あるいは標準縦横比であると判断したとき、ステップＳ
２０５における標準縦横比の正規化処理と同様の正規化
処理と、ステップＳ２０６における縦長文字ブロック
（または横長文字ブロック）の正規化処理と同様の正規
化処理との両者を実行する。Step S208: When it is determined that the character block has the portrait or standard aspect ratio, or when it is determined that the character block has the landscape or standard aspect ratio, the process proceeds to step S208.
Both normalization processing similar to the normalization processing of the standard aspect ratio in 205 and normalization processing similar to the normalization processing of the vertical character block (or the horizontal character block) in step S206 are executed.

【００２９】例えば、図３の文字ブロック３０１（Ｒ＝
２．０）については、標準縦横比に対する正規化処理
（正規化結果を図３のａの正規化画像３０２で示す。）
と横長文字ブロックに対する正規化処理（正規化結果を
図３のｂの正規化画像３０３で示す。）を実行する。For example, the character block 301 (R = R
2.0), the normalization process for the standard aspect ratio (the normalization result is shown by the normalized image 302 in FIG. 3A).
And a normalization process for the horizontally long character block (the normalization result is shown by a normalized image 303 in FIG. 3B).

【００３０】ステップＳ２０７：ステップＳ２０５、ま
たはステップＳ２０６の正規化処理が終了すると、ステ
ップＳ２０７で正規化後の画像を識別演算する。識別演
算は、正規化画像から特徴データ抽出を行い、認識辞書
に格納された字種ごとの標準パタンとの類似度を計算
し、ステップＳ２１１に移行して、最も類似度が高い文
字を認識結果とする。Step S207: When the normalization processing in step S205 or step S206 is completed, in step S207, the image after the normalization is identified and calculated. In the identification operation, feature data is extracted from the normalized image, the similarity with the standard pattern for each character type stored in the recognition dictionary is calculated, and the process proceeds to step S211 to determine the character having the highest similarity. And

【００３１】ステップＳ２０９：ステップＳ２０８の正
規化処理で２つの正規化画像が得られたら、ステップＳ
２０９で両方の正規化画像を識別演算し、ステップＳ２
１０に移行する。Step S209: If two normalized images are obtained by the normalization processing in step S208, step S209
In step 209, both normalized images are identified and calculated, and in step S2
Move to 10.

【００３２】ステップＳ２１０：ステップＳ２０９の２
つの識別演算結果を比較し、いずれか類似度が高かった
方の正規化結果、類似度を採用し、ステップＳ２１１に
移行する。Step S210: Step S209-2
The two identification calculation results are compared, and the normalized result with the higher similarity and the similarity are adopted, and the process proceeds to step S211.

【００３３】ステップＳ２１１：正規化選択、識別演算
の結果に基づいて文字ブロックに最も類似する文字を決
定する。Step S211: The character most similar to the character block is determined based on the results of the normalization selection and the identification operation.

【００３４】以上の文字認識処理の処理過程において
は、２通りの正規化処理方法が実行されることがある
が、該文字認識処理に用いる認識辞書を作成する際は正
規化方法が複数存在すると辞書の構成が著しく複雑にな
り、あるいは標準パタンを一様に決定できないことがあ
るので、本実施形態では認識辞書を作成する際は字種ご
とに一定の正規化方法を選択（つまり、同じ文字コード
となる複数の学習用文字画像データを用いて認識辞書を
作成するが、同じ文字コードとなる学習データに対して
は同じ正規化方法を選択）するものとする。正規化方法
の選択に際しては、字種ごとに認識辞書の学習データに
おける縦横比の最頻値あるいは平均値に近い縦横比を採
用し得る。なお経験則に基づいた縦横比決定も可能であ
る。In the above-described character recognition processing, two normalization methods may be executed. However, when a recognition dictionary used for the character recognition processing is created, there are a plurality of normalization methods. In this embodiment, when a recognition dictionary is created, a certain normalization method is selected for each character type (that is, the same character is used) because the configuration of the dictionary becomes significantly complicated or the standard pattern cannot be determined uniformly. A recognition dictionary is created using a plurality of learning character image data serving as codes, but the same normalization method is selected for learning data having the same character code. When selecting the normalization method, an aspect ratio close to the mode value or the average value of the aspect ratio in the learning data of the recognition dictionary can be adopted for each character type. It is also possible to determine the aspect ratio based on a rule of thumb.

【００３５】以上説明したように、第１実施形態では、
文字ブロックごとに、認識辞書との類似度を考慮した最
適な正規化方法が選択されるので、一定縦横比の標準パ
タンよりなる認識辞書を用いても、高い認識率を確保し
得る。［第２の実施形態］図４は、図１の文字認識装置が実行
する文字認識方法の第２の実施形態を示すフローチャー
トである。As described above, in the first embodiment,
Since an optimal normalization method is selected for each character block in consideration of the degree of similarity with the recognition dictionary, a high recognition rate can be ensured even if a recognition dictionary having a standard pattern with a fixed aspect ratio is used. [Second Embodiment] FIG. 4 is a flowchart showing a second embodiment of the character recognition method executed by the character recognition device of FIG.

【００３６】第２の実施形態は文字ブロックそのもので
はなく特徴データを正規化するもので、文字画像の縦横
比に応じて正規化方法を変更し、正規化方法の決定が容
易でない場合に、暫定的に複数の正規化方法を用いて類
似度を計算し、計算結果に基づいて最終的な正規化方法
を決定する。The second embodiment normalizes feature data instead of the character block itself, and changes the normalization method according to the aspect ratio of the character image. The similarity is calculated using a plurality of normalization methods, and the final normalization method is determined based on the calculation results.

【００３７】ステップＳ４０１：第１実施形態のステッ
プＳ２０１と同様、画像を入力する。Step S401: An image is input as in step S201 of the first embodiment.

【００３８】ステップＳ４０２：第１実施形態のステッ
プＳ２０２と同様、文字行を抽出する。Step S402: Similar to step S202 of the first embodiment, a character line is extracted.

【００３９】ステップＳ４０３：第１実施形態のステッ
プＳ２０３と同様、文字ブロックを抽出する。Step S403: As in step S203 of the first embodiment, a character block is extracted.

【００４０】ステップＳ４０４：抽出した文字ブロック
の縦横比Ｒを求めて、縦横比Ｒを所定の閾値Ｔ１、Ｔ２
（Ｔ１＜Ｔ２）と比較する。ＲがＴ１以上であり、かつ
Ｔ２以下のとき（式６）、文字ブロックは標準縦横比の
範囲内であると判断する。ＲがＴ１より小のとき（式
７）、文字ブロックは縦長あるいは標準縦横比のいずれ
かに属すると判断し、ＲがＴ２より大のとき（式８）、
文字ブロックは横長あるいは標準縦横比のいずれかに属
すると判断する。文字ブロックが標準縦横比であると判
断したとき（Ｔ１≦Ｒ≦Ｔ２）はステップＳ４０５に移
行し、縦長のとき（Ｒ＜Ｔ１）はステップＳ４０８、横
長のとき（Ｒ＞Ｔ２）はステップＳ４０９に移行する。
漢字を含む文字原稿については、閾値を例えばＴ１＝
０．４、Ｔ２＝２．５と設定して良好な認識結果が得ら
れている。Step S404: The aspect ratio R of the extracted character block is determined, and the aspect ratio R is set to a predetermined threshold value T1, T2.
(T1 <T2). When R is equal to or greater than T1 and equal to or less than T2 (Equation 6), it is determined that the character block is within the range of the standard aspect ratio. When R is smaller than T1 (Equation 7), it is determined that the character block belongs to either the portrait or standard aspect ratio. When R is larger than T2 (Equation 8),
It is determined that the character block belongs to either the landscape or the standard aspect ratio. If it is determined that the character block has the standard aspect ratio (T1 ≦ R ≦ T2), the process proceeds to step S405. If the character block is portrait (R <T1), the process proceeds to step S408. If it is landscape (R> T2), the process proceeds to step S409. Transition.
For a character document containing kanji, the threshold value is set to, for example,
Good recognition results are obtained by setting 0.4 and T2 = 2.5.

【００４１】[0041]

【数３】ステップＳ４０５：文字ブロック内の画像の輪郭を抽出
し、輪郭を形成する境界画素に、図８の方向指数を与え
ることによって、文字ブロックの特徴データを抽出す
る。文字ブロック内の境界画素は左から右の走査を上方
から下方に順次進める順序で評価され、注目されている
境界画素Ｐ０に続く画素の方向は、注目画素の周囲８画
素のうち対向しない４方向（例えば下、右下、右、右
上）のいずれに存在するか表現すれば、該特徴データを
表現するのに十分である。これらの方向は、例えばＣ１
（下）、Ｃ２（右下）、Ｃ３（右）、Ｃ４（右上）の方
向指数によって表現される。(Equation 3) Step S405: The contour of the image in the character block is extracted, and the characteristic data of the character block is extracted by giving the direction index of FIG. 8 to the boundary pixels forming the contour. The boundary pixels in the character block are evaluated in the order in which the scanning from left to right proceeds sequentially from top to bottom, and the direction of the pixel following the boundary pixel P0 of interest is the four non-opposite directions among the eight pixels surrounding the pixel of interest. (Eg, lower, lower right, right, upper right) is sufficient to express the feature data. These directions are, for example, C1
(Lower), C2 (lower right), C3 (right), C4 (upper right) are represented by directional indices.

【００４２】図７に示すように、文字ブロック７０１
（ひらがな「い」１文字の画像を示す。）の幅をｗ、高
さをｈとするとき、文字ブロック７０１はその幅、高さ
を所定数に分割し、複数の小領域を形成する。分割数は
例えば、幅方向分割数ｎ＝４、高さ方向分割数ｍ＝４で
ある。As shown in FIG. 7, a character block 701
Assuming that the width of (a single character image of Hiragana "i") is w and the height is h, the character block 701 divides the width and height into a predetermined number to form a plurality of small areas. The number of divisions is, for example, n = 4 in the width direction and m = 4 in the height direction.

【００４３】図７では１個の小領域７０２を代表的に示
し、小領域内の画素の内、境界画素に斜線を付し、かつ
Ｐ１、Ｐ２、Ｐ３、Ｐ４、Ｐ５、Ｐ６、Ｐ７、Ｐ８の符
号を与えている。これら境界画素は、左から右、上から
下の走査順序でみるとき、画素Ｐ１に続いて画素Ｐ２が
存在し、画素Ｐ２に続いて画素Ｐ３が存在する。また画
素Ｐ４に続いて画素Ｐ５が存在し、画素Ｐ５に続いて画
素Ｐ６が存在し、画素Ｐ６に続いて画素Ｐ７が存在し、
画素Ｐ７に続いて画素Ｐ８が存在する。この連続状態
は、画素Ｐ１に方向指数Ｃ２、画素Ｐ２に方向指数Ｃ
２、画素Ｐ４に方向指数Ｃ２、画素Ｐ５に方向指数Ｃ
３、画素Ｐ６に方向指数Ｃ２、画素Ｐ７に方向指数Ｃ２
を与えることによって表現される。In FIG. 7, one small area 702 is shown as a representative, and among the pixels in the small area, boundary pixels are shaded, and P1, P2, P3, P4, P5, P6, P7, P8 Is given. When these boundary pixels are viewed in the scanning order from left to right and from top to bottom, a pixel P2 follows the pixel P1, and a pixel P3 follows the pixel P2. Further, a pixel P5 exists following the pixel P4, a pixel P6 exists following the pixel P5, a pixel P7 exists following the pixel P6,
Following the pixel P7, there is a pixel P8. In this continuous state, the direction index C2 is applied to the pixel P1, and the direction index C
2. Direction index C2 for pixel P4, direction index C for pixel P5
3, the direction index C2 for the pixel P6 and the direction index C2 for the pixel P7
Is expressed by giving

【００４４】各小領域では方向指数を統計的に処理し、
例えば各方向指数の個数をその小領域の特徴データとす
る。小領域７０２ではＣ２が５回、Ｃ３が１回生じてい
るので特徴データを｛Ｃ１，Ｃ２，Ｃ３，Ｃ４｝とする
と、｛０，５，１，０｝となる。あるいは、最頻の方向
指数を該小領域の特徴データとすると、画像の大局的方
向が表現される。In each small area, the direction index is statistically processed,
For example, the number of each direction index is set as the feature data of the small area. Since C2 occurs five times and C3 occurs once in the small area 702, if the feature data is {C1, C2, C3, C4}, it will be {0, 5, 1, 0}. Alternatively, if the most frequent direction index is the feature data of the small area, the global direction of the image is expressed.

【００４５】ステップＳ４０６：ステップＳ４０５で求
められた小領域ごとの特徴データを正規化しつつ、文字
ブロック全体の特徴データを算出する。図３と同様に正
規化サイズをＮ×Ｎとするとき、文字ブロックは横方向
に（Ｎ／ｗ）倍、縦方向に（Ｎ／ｈ）倍に拡大される。
この拡大率に応じて特徴データにバイアスをかけること
により、特徴データを正規化し得る。斜め方向の方向指
数については、文字ブロック７０１と等面積の正方形を
想定し、この正方形と正規化サイズとの比率を乗じるも
のとする。すなわち、各方向指数に式９、式１０、式１
１、式１２示す係数を乗じて、正規化された特徴データ
Ｃｎ１、Ｃｎ２、Ｃｎ３、Ｃｎ４を算出する。Step S406: The feature data of the entire character block is calculated while normalizing the feature data of each small area obtained in step S405. Assuming that the normalized size is N × N as in FIG. 3, the character block is enlarged (N / w) times in the horizontal direction and (N / h) times in the vertical direction.
By biasing the feature data according to the enlargement ratio, the feature data can be normalized. Regarding the direction index in the oblique direction, a square having the same area as the character block 701 is assumed, and the ratio between the square and the normalized size is multiplied. That is, Equation 9, Equation 10, Equation 1 are assigned to each direction index.
1. The normalized feature data Cn1, Cn2, Cn3, and Cn4 are calculated by multiplying the coefficients shown in Equation 12 by 1.

【００４６】[0046]

【数４】ステップＳ４０７：ステップＳ４０６で正規化された特
徴データに基づき文字ブロックの識別演算を実行する。
識別演算は、正規化された特徴データについて、認識辞
書に格納された字種ごとの標準パタンとの類似度を計算
し、ステップＳ４１４に移行して最も類似度が高い文字
を認識結果とする。(Equation 4) Step S407: A character block identification calculation is performed based on the feature data normalized in step S406.
The discrimination calculation calculates the similarity between the normalized feature data and the standard pattern for each character type stored in the recognition dictionary, and proceeds to step S414 to determine the character having the highest similarity as the recognition result.

【００４７】ステップＳ４０８：縦横比ＲがＴ１より小
のとき、文字ブロックは縦長あるいは標準縦横比のいず
れかと判断し、そのいずれであるかの最終判断は保留す
る。図６に示すように、文字ブロックａが縦長の可能性
があるときは、文字ブロックａの左右に文字ブロック高
さｈに応じた余白をそれぞれ追加した文字ブロックｂを
生成する。各余白の幅は（ｈ×Ｔ１−ｗ）／２であり、
結果的に文字ブロックの幅はｈ×Ｔ１となる。これは高
さｈのときに標準縦横比範囲内とされる最小幅であり、
余白追加後の文字ブロックは縦横比Ｒ＝Ｔ１となる。以
後ステップＳ４１０に移行する。Step S408: When the aspect ratio R is smaller than T1, the character block is determined to be either the portrait or the standard aspect ratio, and the final decision as to which is the character block is suspended. As shown in FIG. 6, when there is a possibility that the character block a is vertically long, a character block b is generated by adding a margin corresponding to the character block height h to the left and right of the character block a. The width of each margin is (h × T1-w) / 2,
As a result, the width of the character block is h × T1. This is the minimum width that is within the standard aspect ratio range at height h,
The character block after adding the margin has the aspect ratio R = T1. Thereafter, the process proceeds to step S410.

【００４８】ステップＳ４０９：縦横比ＲがＴ２より大
のとき、文字ブロックは横長あるいは標準縦横比のいず
れかと判断し、そのいずれであるかの最終判断は保留す
る。図５に示すように、文字ブロックａが横長の可能性
があるときは、文字ブロックａの上下に文字ブロック幅
ｗに応じた余白をそれぞれ追加した文字ブロックｂを生
成する。各余白の高さは（ｗ／Ｔ２−ｈ）／２であり、
結果的に文字ブロックｂの幅はｗ／Ｔ２となる。これは
幅ｗのときに標準縦横比範囲内とされる最小高さであ
り、余白追加後の文字ブロックは縦横比Ｒ＝Ｔ２とな
る。以後ステップＳ４１０に移行する。Step S409: When the aspect ratio R is larger than T2, the character block is determined to be either the landscape or the standard aspect ratio, and the final determination as to which is the character block is suspended. As shown in FIG. 5, when there is a possibility that the character block a is horizontally long, a character block b is generated by adding a margin corresponding to the character block width w above and below the character block a. The height of each margin is (w / T2-h) / 2,
As a result, the width of the character block b becomes w / T2. This is the minimum height within the standard aspect ratio range when the width is w, and the character block after adding the margin has the aspect ratio R = T2. Thereafter, the process proceeds to step S410.

【００４９】ステップＳ４１０：ステップＳ４０８、Ｓ
４０９において余白追加の処理がされた文字ブロック
（図５、図６における文字ブロックｂ）および余白が追
加されていない文字ブロック（図５、図６における文字
ブロックａ）の両者について特徴データを抽出する。特
徴データ抽出の処理はステップＳ４０５と同様である。
以後ステップＳ４１１に移行する。Step S410: Step S408, S
At 409, feature data is extracted for both the character block to which margin addition processing has been performed (character block b in FIGS. 5 and 6) and the character block to which no margin has been added (character block a in FIGS. 5 and 6). . The process of extracting characteristic data is the same as in step S405.
Thereafter, the process proceeds to step S411.

【００５０】ステップＳ４１１：余白追加処理がされた
文字ブロック（図５、図６における文字ブロックｂ）お
よび余白追加のない文字ブロック（図５、図６における
文字ブロックａ）の両者について特徴データを正規化す
る。正規化の処理はステップＳ４０６と同様である。以
後ステップＳ４１２に移行する。Step S411: Normalize the characteristic data for both the character block (character block b in FIGS. 5 and 6) subjected to the margin addition processing and the character block without the margin addition (character block a in FIGS. 5 and 6). Become The normalization process is the same as in step S406. Thereafter, the process proceeds to step S412.

【００５１】ステップＳ４１２：余白追加処理がされた
文字ブロック（図５、図６における文字ブロックｂ）お
よび余白追加のない文字ブロック（図５、図６における
文字ブロックａ）の両者について識別演算を実行する。
識別演算はステップＳ４０７と同様である。以後ステッ
プＳ４１３に移行する。Step S412: Identification calculation is performed for both the character block subjected to the margin addition processing (character block b in FIGS. 5 and 6) and the character block without margin addition (character block a in FIGS. 5 and 6). I do.
The identification calculation is the same as in step S407. Thereafter, the flow shifts to step S413.

【００５２】ステップＳ４１３：余白追加処理がされた
文字ブロック（図５、図６における文字ブロックｂ）お
よび余白追加のない文字ブロック（図５、図６における
文字ブロックａ）の両者について識別演算結果、すなわ
ち類似度を比較し、より類似度が高い文字ブロックを採
用する。Step S413: The result of the discrimination calculation is performed on both the character block subjected to the margin addition processing (the character block b in FIGS. 5 and 6) and the character block without the margin addition (the character block a in FIGS. 5 and 6). That is, the similarities are compared, and a character block having a higher similarity is adopted.

【００５３】ステップＳ４０８、Ｓ４０９では縦横比に
対する判断は保留されていたが、この段階で最終的な判
断がなされ、以後ステップＳ４１４に移行する。In steps S408 and S409, the judgment on the aspect ratio has been suspended, but a final judgment is made at this stage, and the process proceeds to step S414.

【００５４】ステップＳ４１４：ステップＳ４０７で算
出され、またはステップＳ４１３で選択された識別演算
結果に基づいて、文字認識結果を決定する。Step S414: A character recognition result is determined based on the result of the identification calculation calculated in step S407 or selected in step S413.

【００５５】第２実施形態では文字ブロック画像自体を
正規化しないので、文字ブロック内画像の輪郭における
微細な凹凸等が拡大する等、ノイズ成分増幅の悪影響は
生じない。In the second embodiment, since the character block image itself is not normalized, there is no adverse effect of noise component amplification, such as enlargement of fine irregularities in the outline of the image in the character block.

【００５６】また縦長、横長の文字ブロックには必要最
小限の余白追加により標準縦横比の文字ブロックと同様
の処理を可能とするので、画像を変形させることなく、
一定縦横寸法の標準パタンよりなる認識辞書を採用でき
る。従って文字ブロック変形に起因した認識率の低下を
招くことはない。Also, by adding the minimum necessary margins to the vertically and horizontally long character blocks, the same processing as for the character blocks having the standard aspect ratio can be performed.
It is possible to use a recognition dictionary composed of standard patterns having a constant vertical and horizontal dimension. Therefore, the recognition rate does not decrease due to the character block deformation.

【００５７】なお、第２実施形態では、ステップＳ４０
４において、所定の閾値Ｔ１，Ｔ２を用いて、標準縦横
比範囲内の文字ブロックと、扁平（縦長または横長）あ
るいは標準縦横比のいずれかに属する文字ブロックとに
分けて判断するようにしたが、第１実施形態と同様に所
定の閾値Ｔ１，Ｔ２，Ｔ３，Ｔ４の４つの閾値を用い
て、標準縦横批判以内の文字ブロックと、扁平（縦長ま
たは横長）な文字ブロックと、扁平（縦長または横長）
あるいは標準縦横比のいずれかに属する文字ブロックと
に分けて処理するようにしてもよい。その場合、扁平な
文字ブロックと判断された場合は、上述したように余白
追加処理を行って特徴データを抽出し、更に該特徴デー
タを正規化してから識別演算を行うことになる。In the second embodiment, step S40
In 4, the determination is made by using the predetermined threshold values T1 and T2 separately for character blocks within the standard aspect ratio range and character blocks belonging to either the flat (vertical or horizontal) or standard aspect ratio. In the same manner as in the first embodiment, using the four threshold values T1, T2, T3, and T4, a character block within standard vertical and horizontal criticism, a flat (vertical or horizontal) character block, and a flat (vertical or horizontal) Landscape)
Alternatively, the processing may be performed separately for character blocks belonging to any of the standard aspect ratios. In this case, if it is determined that the block is a flat character block, the margin addition processing is performed as described above to extract the characteristic data, and the characteristic data is further normalized before performing the identification operation.

【００５８】なお、第２実施形態における特徴データ、
正規化は一例を示したものであり、特徴データとして画
像密度等を用いてもよく、また文字ブロックの分割数や
分割方法、正規化方法などはこれに限るものではない。
例えば、上述した例では方向指数として、Ｃ１（下）、
Ｃ２（右下）、Ｃ３（右）、Ｃ４（右上）を用いていた
が、Ｃ１（左下）、Ｃ２（下）、Ｃ３（右下）、Ｃ４
（右）というような方向指数を用いて評価するようにし
ても構わない。Note that the feature data in the second embodiment
The normalization is an example, and the image density or the like may be used as the feature data. The number of divisions, the division method, and the normalization method of the character block are not limited thereto.
For example, in the example described above, the direction index is C1 (lower),
C2 (lower right), C3 (right) and C4 (upper right) were used, but C1 (lower left), C2 (lower), C3 (lower right), C4
The evaluation may be performed using a direction index such as (right).

【００５９】［プログラム］図１の文字認識装置におけ
る制御プログラムには文字認識方法の第１の実施形態お
よび／または第２の実施形態を実行するためのプログラ
ムが含まれる。第１の実施形態を実施するためのプログ
ラムはステップＳ２０１〜Ｓ２１１の処理を実行するル
ーチンを含み、第２の実施形態を実施するためのプログ
ラムはステップＳ４０１〜Ｓ４１４の処理を実行するル
ーチンを含む。[Program] The control program in the character recognition apparatus shown in FIG. 1 includes a program for executing the first and / or second embodiment of the character recognition method. The program for implementing the first embodiment includes a routine for executing the processing of steps S201 to S211 and the program for implementing the second embodiment includes a routine for executing the processing of steps S401 to S414.

【００６０】［記憶媒体］本発明にかかる文字認識方法
は図１の文字認識装置以外の装置、例えば汎用的なコン
ピュータでも実施可能であり、その場合、記憶媒体等で
提供される制御プログラムを汎用コンピュータの外部記
憶装置にセットし、オペレータ等の指示によりＣＰＵ等
で実行してもよい。[Storage Medium] The character recognition method according to the present invention can be implemented by a device other than the character recognition device shown in FIG. 1, for example, a general-purpose computer. It may be set in an external storage device of a computer and executed by a CPU or the like in accordance with an instruction from an operator or the like.

【００６１】図９は、文字認識方法の第１および第２の
実施形態を実行するためのプログラムを記憶した記憶媒
体を示すメモリマップである。FIG. 9 is a memory map showing a storage medium storing a program for executing the first and second embodiments of the character recognition method.

【００６２】図９においてアドレスＡ０乃至Ａ１にはデ
ィレクトリ情報が格納され、Ａ１の次のアドレスからア
ドレスＡ２までの領域には第１の実施形態を実行するた
めのプログラムが格納され、Ａ２の次のアドレスからア
ドレスＡ３までの領域には第２の実施形態を実行するた
めのプログラムが格納されている。In FIG. 9, directory information is stored in addresses A0 to A1, a program for executing the first embodiment is stored in an area from an address next to A1 to address A2, and a program next to A2 is stored. A program for executing the second embodiment is stored in an area from an address to an address A3.

【００６３】特に図示しないが、各種プログラムに従属
するデータもディレクトリ情報として管理し、また、各
種プログラムをコンピュータにインストールするための
プログラム等をディレクトリ情報内に記憶してもよい。Although not shown, data dependent on various programs may be managed as directory information, and a program for installing various programs in a computer may be stored in the directory information.

【００６４】［文字認識システム］図１０は、本発明に
係る文字認識方法を複数の機器により実行する文字認識
システムの一実施形態を示すブロック図である。[Character Recognition System] FIG. 10 is a block diagram showing an embodiment of a character recognition system in which the character recognition method according to the present invention is executed by a plurality of devices.

【００６５】この実施形態では、インターネット・サー
ビス・プロバイダにおいて、文字認識のための認識辞書
を含むプログラムがサーバＳＶ１に格納されている。イ
ンターネット・サービス・プロバイダは通信手段を介し
てインターネットに接続され、インターネットからサー
バＳＶ１にアクセスすることによって、複数のユーザが
文字認識プログラムを利用し得る。In this embodiment, in the Internet service provider, a program including a recognition dictionary for character recognition is stored in the server SV1. The Internet service provider is connected to the Internet via communication means, and a plurality of users can use the character recognition program by accessing the server SV1 from the Internet.

【００６６】ユーザはスキャナＳＣ１、パーソナル・コ
ンピュータＰＣ１を有し、スキャナＳＣ１で読み取った
原稿の画像をサーバＳＶ１に送信する。サーバにおいて
は、原稿の画像を入力画像として文字認識の処理を実行
する。The user has a scanner SC1 and a personal computer PC1, and transmits an image of a document read by the scanner SC1 to the server SV1. The server executes character recognition processing using the image of the document as an input image.

【００６７】なお認識辞書は、必ずしも文字認識プログ
ラムと同一のコンピュータ上に存在する必要はなく、通
信手段等を介して参照可能であればよい。例えば、文字
認識プログラムをサーバＳＶ１、認識辞書を別個のサー
バＳＶ２に設け、別個のサービスとすることも可能であ
る。この場合文字認識プログラムの保有者、認識辞書の
保有者それぞれがプログラム、辞書という知的資産を内
部に保有でき、違法コピー等を防止できる。Note that the recognition dictionary does not necessarily need to be present on the same computer as the character recognition program, and it is sufficient that the recognition dictionary can be referred to via communication means or the like. For example, the character recognition program may be provided on the server SV1 and the recognition dictionary may be provided on the separate server SV2, and may be provided as separate services. In this case, each of the holder of the character recognition program and the holder of the recognition dictionary can hold intellectual assets such as programs and dictionaries therein, thereby preventing illegal copying and the like.

【００６８】[0068]

【発明の効果】本発明によれば、一定縦横寸法の標準パ
タンの認識辞書を用いて、扁平度の大きい文字に対して
も高い認識率を得ることができる。According to the present invention, a high recognition rate can be obtained even for a character having a large flatness by using a recognition dictionary of a standard pattern having a constant vertical and horizontal dimension.

[Brief description of the drawings]

【図１】本発明に係る文字認識装置の一実施形態を示
すブロック図である。FIG. 1 is a block diagram showing an embodiment of a character recognition device according to the present invention.

【図２】図１の文字認識装置において実施される文字
認識方法の第１実施形態を示すフローチャートである。FIG. 2 is a flowchart illustrating a first embodiment of a character recognition method performed by the character recognition device of FIG. 1;

【図３】図２の文字認識方法における正規化の概要を
示す概念図である。FIG. 3 is a conceptual diagram showing an outline of normalization in the character recognition method of FIG.

【図４】図１の文字認識装置において実施される文字
認識方法の第２実施形態を示すフローチャートである。FIG. 4 is a flowchart illustrating a second embodiment of a character recognition method performed by the character recognition device of FIG. 1;

【図５】図４の文字認識方法における横長文字の正規
化方法の詳細を示す概念図である。FIG. 5 is a conceptual diagram showing details of a method for normalizing a horizontally long character in the character recognition method of FIG. 4;

【図６】図４の文字認識方法における縦長文字の正規
化方法の詳細を示す概念図である。FIG. 6 is a conceptual diagram showing details of a method for normalizing vertically long characters in the character recognition method of FIG. 4;

【図７】図４の文字認識方法における特徴量を示す概
念図である。FIG. 7 is a conceptual diagram showing feature amounts in the character recognition method of FIG.

【図８】図７の特徴量設定のための方向指数を示す概
念図である。FIG. 8 is a conceptual diagram showing a direction index for setting the feature amount in FIG. 7;

【図９】図１の文字認識装置で実行される各種データ
処理プログラムを格納する記憶媒体のメモリマップを示
す概念図である。9 is a conceptual diagram showing a memory map of a storage medium that stores various data processing programs executed by the character recognition device of FIG.

【図１０】本発明に係る文字認識方法を複数の機器に
より実行する文字認識システムを示すブロック図であ
る。FIG. 10 is a block diagram showing a character recognition system in which a character recognition method according to the present invention is executed by a plurality of devices.

[Explanation of symbols]

１０１・・・ＣＰＵ１０２・・・ＲＯＭ１０３・・・ＲＡＭ１０４・・・外部記憶装置１０５・・・ディスプレイ１０６・・・キーボード１０７・・・ポインティングデバイス１０８・・・イメージスキャナ１０９・・・ネットワークインターフェース 101 ... CPU 102 ROM 103 RAM 104: External storage device 105 ・・・ Display 106 ... keyboard 107 Pointing device 108 ・・・ Image scanner 109 ・・・ Network interface

Claims

[Claims]

1. A recognition dictionary storing characteristic data of a standard pattern of characters, a character block extracting means for extracting a character block from image data including a character image, and calculating an aspect ratio of the extracted character block; A normalization procedure determining unit that determines a normalization procedure based on the aspect ratio; and a feature data obtaining unit that obtains feature data of the character block using the normalization procedure determined by the normalization procedure determination unit. A similarity calculating unit that calculates a similarity by comparing the feature data obtained by the feature data obtaining unit with the feature data of the standard pattern, wherein the normalization procedure determining unit includes: Based on the calculated aspect ratio, the character block extracted by the character block extracting means is converted into an aspect ratio at least substantially equal to the aspect ratio of the standard pattern (hereinafter, referred to as the standard pattern). It is determined whether the character block is a character block of a character block having a standard aspect ratio or a flat character block or a character block having a standard aspect ratio. For the character block determined as the character block of the aspect ratio, it is determined to use the normalization procedure for the character block of the standard aspect ratio, and the possibility of both the flat character block and the character block of the standard aspect ratio is determined. A character recognition apparatus characterized in that a character block determined as a certain character block is determined to use both the normalization procedure for the standard aspect ratio character block and the normalization procedure for a flat character block. .

2. The normalization procedure determining unit determines a character block extracted by the character block extracting unit based on the calculated aspect ratio as a character block having a standard aspect ratio,
Judge which character block is a flat character block, or a character block that can be either a flat character block or a character block having a standard aspect ratio, and judge it as a character block having the standard aspect ratio. The determined character block is determined to use a normalization procedure for a character block having a standard aspect ratio, and the character block determined to be a flat character block is determined to use a normalization procedure for a flat character block. The character block which is determined as a character block which may be either the flat character block or the standard aspect ratio character block is determined by the normalization procedure for the standard aspect ratio character block and the flat character block. 2. The character recognition apparatus according to claim 1, wherein the determination is made to use both of the normalization procedures for use.

3. The feature data obtaining means extracts the feature data after normalizing the character block using the normalization procedure determined by the normalization procedure determination means, thereby obtaining the feature of the character block. 3. The character recognition device according to claim 1, wherein data is obtained.

4. The feature data obtaining unit extracts the feature data from the character block using the normalization procedure determined by the normalization procedure determination unit, and then normalizes the feature data, thereby obtaining the feature data. 3. The character recognition device according to claim 1, wherein characteristic data of the character block is obtained.

5. The feature data acquiring means for acquiring feature data using a normalization procedure for a character block having a standard aspect ratio means that a character block is deformed so that an aspect ratio matches a standard pattern. Is to obtain the feature data by extracting the feature data from the normalized character block image obtained by the above, and obtaining the feature data by using the normalization procedure for the flat character block, Normalization obtained by matching the longer side of the vertical or horizontal length to the corresponding side length of the standard pattern while maintaining the aspect ratio of, and adding a margin so that the shorter side becomes the normalized size 4. The character recognition device according to claim 3, wherein the characteristic data is obtained by extracting characteristic data from the character block image. Sense device.

6. The feature data acquiring means for acquiring feature data using a normalization procedure for a character block having a standard aspect ratio includes extracting feature data from the character block and extracting the extracted feature data. Acquiring the feature data by normalizing, and acquiring the feature data using the normalization procedure for the flat character block is to add a predetermined margin to the short side of the character block. 5. The character recognition apparatus according to claim 4, wherein the characteristic data is obtained by extracting characteristic data from the character block to which the margin has been added, and then normalizing the extracted characteristic data.

7. The normalization of the feature data is to apply a bias to the feature data extracted from the character block based on the ratio between the vertical and horizontal dimensions of the character block and the vertical and horizontal dimensions of the standard pattern. The character recognition device according to claim 4, wherein

8. The apparatus according to claim 1, wherein the feature data is a set of feature data obtained by dividing a character block into a plurality of small areas and obtaining the data for each of the small areas. Character recognition device.

9. The character recognition device according to claim 1, wherein the feature data is obtained by expressing a continuous state of boundary pixels of a character outline by a direction index.

10. A character block extracting step of extracting a character block from image data including a character image, calculating an aspect ratio of the extracted character block, and determining a normalization procedure based on the aspect ratio. A feature data obtaining step of obtaining feature data of the character block using the normalization procedure determined in the normalization procedure determining step; a feature data obtained in the feature data obtaining step; A similarity calculation step of calculating a similarity by comparing with the feature data of the standard pattern stored in the dictionary, wherein the normalization procedure determining step includes: The character block extracted in the character block extraction step is based on the aspect ratio (at least approximately equal to the aspect ratio of the standard pattern). Below, referred to as a “standard aspect ratio”), and a character block that can be either a flat character block or a character block with a standard aspect ratio. For the character block determined as the character block having the standard aspect ratio, it is determined to use the normalization procedure for the character block having the standard aspect ratio, and either the flat character block or the character block having the standard aspect ratio is possible. A character block determined as having a characteristic character block is determined to use both the normalization procedure for the character block having the standard aspect ratio and the normalization procedure for the flat character block. Recognition method.

11. In the normalization procedure determining step, the character block extracted in the character block extracting step is converted into a character block having a standard aspect ratio, a flat character block, and a flat character block based on the calculated aspect ratio. Character block that is likely to be a character block having a normal aspect ratio or a character block having a standard aspect ratio. Determined to use the normalization procedure for the character block of the aspect ratio, for the character block determined as the flat character block, determined to use the normalization procedure for the flat character block,
For the character block determined as a character block that may be either the flat character block or the standard aspect ratio character block, the normalization procedure for the standard aspect ratio character block and the normalization procedure for the flat character block The character recognition method according to claim 10, wherein a determination is made to use both of the conversion procedures.

12. The feature data obtaining step includes: extracting the feature data after normalizing the character block by using the normalization procedure determined in the normalization procedure determination step; 12. The character recognition method according to claim 10, wherein data is obtained.

13. In the feature data obtaining step, the feature data is extracted from the character block using the normalization procedure determined in the normalization procedure determination step, and then the feature data is normalized. 12. The character recognition method according to claim 10, wherein characteristic data of a character block is obtained.

14. A computer-executable program having a program code for causing a computer to execute each step of the character recognition method according to claim 10.

15. A storage medium storing the program according to claim 14.

16. A communication unit capable of receiving image data including a character image, a reference recognition dictionary, a character block extraction unit for extracting a character block from the received image data, and a vertical and horizontal direction of the extracted character block. A normalization procedure determining unit that calculates a ratio and determines a normalization procedure based on the aspect ratio; and using the normalization procedure determined by the normalization procedure determination unit, obtains the characteristic data of the character block. Character data comprising: a characteristic data obtaining unit; and a similarity calculating unit configured to calculate a similarity by comparing the characteristic data obtained by the characteristic data obtaining unit with the characteristic data of a standard pattern stored in the recognition dictionary. A character recognition device for a recognition system, wherein the normalization procedure determination means includes a sentence extracted by the character block extraction means based on the calculated aspect ratio. A character block may be a character block having at least an aspect ratio substantially equal to the aspect ratio of a standard pattern (hereinafter referred to as a “standard aspect ratio”), a flat character block and a character block having a standard aspect ratio. It is determined which of the character blocks is a character block, and a character block determined as the character block having the standard aspect ratio is determined to use a normalization procedure for a character block having a standard aspect ratio. For the character block determined as a character block that may be either the flat character block or the standard aspect ratio character block, the normalization procedure for the standard aspect ratio character block and the normalization procedure for the flat character block Character recognition device for a character recognition system, characterized in that both are determined to be used.