JP2972443B2

JP2972443B2 - Character recognition device

Info

Publication number: JP2972443B2
Application number: JP4173166A
Authority: JP
Inventors: 義弘北村
Original assignee: Consejo Superior de Investigaciones Cientificas CSIC
Current assignee: Consejo Superior de Investigaciones Cientificas CSIC
Priority date: 1992-06-30
Filing date: 1992-06-30
Publication date: 1999-11-08
Anticipated expiration: 2014-11-08
Also published as: JPH0620096A

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、読取った文字画像から
特徴データを抽出し、辞書に予め格納されている標準文
字画像と照合することによって、文字を認識する文字認
識装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition apparatus for recognizing characters by extracting characteristic data from a read character image and comparing the extracted characteristic data with a standard character image stored in a dictionary in advance.

【０００２】[0002]

【従来の技術】文書の文字情報をコンピュータ処理によ
って認識する文字認識装置として、認識しようとする文
字情報、たとえば日本語文字をＣＣＤ（電荷結合素子）
などを用いて光電変換し、該光電変換された電気信号を
１文字画像単位で切出し、認識部において所定の認識論
理に従って１文字画像ずつ認識を行う、光学的文字認識
装置（ＯＣＲ）が知られている。2. Description of the Related Art As a character recognition device for recognizing character information of a document by computer processing, a CCD (Charge Coupled Device) converts character information to be recognized, for example, Japanese characters.
2. Description of the Related Art An optical character recognition device (OCR) is known, which performs photoelectric conversion by using such a method, cuts out the photoelectrically converted electric signal in units of one character image, and recognizes each character image in accordance with a predetermined recognition logic in a recognition unit. ing.

【０００３】認識の方法としては、文字の始点から輪郭
を追跡し、方向コードを付与し、これを、方向コード列
として辞書中の同様の標準パターンと照合し、類似度順
に候補をあげ、文字コードを出力するなどの認識方法が
ある。[0003] As a recognition method, a contour is traced from the starting point of a character, a direction code is given, and this is collated with a similar standard pattern in a dictionary as a direction code sequence, candidates are listed in order of similarity, and characters are listed. There is a recognition method such as outputting a code.

【０００４】[0004]

【発明が解決しようとする課題】この種の文字認識装置
においては、認識対象となる全ての文字に関して、大文
字、小文字双方の標準パターンを持っており、文字画像
と文字辞書の全ての標準パターンとの照合後に認識され
た結果（文字コード）を出力する認識方法が実施されて
いる。この方法では、大文字、小文字双方の標準パター
ンを持つので、標準パターンの数が増加し、標準パター
ンとの照合に時間がかかるといった欠点がある。In this type of character recognition apparatus, all characters to be recognized have both uppercase and lowercase standard patterns. A recognition method for outputting a result (character code) recognized after the collation is implemented. In this method, since both uppercase and lowercase standard patterns are used, the number of standard patterns increases, and there is a disadvantage that it takes time to collate with the standard patterns.

【０００５】また前述のように、輪郭特徴データを抽出
して文字認識を行う場合、輪郭の多い漢字などの認識に
は好適であるが、ひらがな、かたかな、アルファベット
のうち、大文字と小文字とが相似形である相似形文字の
場合、たとえば「ゆ」と「ゅ」では輪郭の区別がつきに
くく、認識率が低くなってしまう。As described above, when character recognition is performed by extracting contour feature data, it is suitable for recognizing kanji having many contours. However, hiragana, katakana, uppercase and lowercase letters in the alphabet are preferable. Is a similar character with similar shapes, for example, “Y” and “ゅ”, it is difficult to distinguish the outline, and the recognition rate is low.

【０００６】本発明の目的は、前述のような欠点を解消
し、文字認識における認識処理の高速化を図り、大文
字、小文字のより正確な識別を行えるような文字認識装
置を提供することである。SUMMARY OF THE INVENTION An object of the present invention is to solve the above-mentioned drawbacks, speed up the recognition process in character recognition, and provide a character recognition device capable of more accurately distinguishing uppercase and lowercase characters. .

【０００７】[0007]

【課題を解決するための手段】本発明は、複数の文字が
表示されている原稿を読取って画像データに変換して出
力する読取手段と、読取手段からの画像データを記憶す
る画像メモリと、前記画像メモリから各文字画像を順次
抽出する文字抽出手段と、抽出文字画像と照合する標準
文字画像および対応する文字コードが記憶され、幾何学
的に相似の関係にある大文字と小文字とを持つ相似形文
字に関しては、大文字または小文字のいずれか一方の標
準文字画像のみが記憶されている辞書メモリと、抽出文
字画像の大きさを検出する検出手段と、辞書メモリの記
憶内容に基づいて、抽出文字画像を認識し、文字コード
を出力する認識手段と、認識手段からの文字コードと、
前記検出手段からの大きさとを認識結果として記憶する
認識結果メモリと、認識結果メモリの内容に基づいて、
相似形文字の文字コード毎に頻度を計数し、頻度が１以
上である文字の大きさに基づく頻度のヒストグラムを作
成し、該ヒストグラムの分割点をしきい値として大文字
であるか小文字であるかの判定を行う判定手段とを含む
ことを特徴とする文字認識装置である。According to the present invention, there is provided a reading means for reading an original on which a plurality of characters are displayed, converting the original into image data, and outputting the image data; an image memory for storing image data from the reading means; A character extracting means for sequentially extracting each character image from the image memory, a standard character image to be matched with the extracted character image and a corresponding character code are stored, and a similar character having a geometrically similar uppercase and lowercase character is stored. As for the shape characters, a dictionary memory in which only one of the standard character images of uppercase or lowercase is stored, a detecting means for detecting the size of the extracted character image, and an extracted character based on the storage contents of the dictionary memory A recognition unit that recognizes an image and outputs a character code, a character code from the recognition unit,
A recognition result memory for storing the size from the detection means as a recognition result, based on the contents of the recognition result memory,
The frequency is counted for each character code of the similar character, a histogram of the frequency based on the size of the character having the frequency of 1 or more is created, and whether the division point of the histogram is uppercase or lowercase is used as a threshold value. And a determination unit for determining the character.

【０００８】また本発明は、前記判定手段は、特定の相
似形文字に関して、文字コードの頻度が１未満であって
ヒストグラムを作成できないとき、または、文字の大き
さが大文字のみもしくは小文字のみであってヒストグラ
ムに基づいてしきい値を決定できないとき、前記特定相
似形文字以外の相似形文字のしきい値に基づいて、判定
を行うことを特徴とする。In the present invention, the determination means may determine whether the frequency of a character code is less than 1 and a histogram cannot be created for a specific similar character, or if the character size is only uppercase or lowercase. When the threshold value cannot be determined based on the histogram, the determination is performed based on the threshold value of similar characters other than the specific similar character.

【０００９】[0009]

【００１０】[0010]

【作用】本発明に従えば、文字認識装置は、読取手段に
おいて、複数の文字が表示されている原稿を読取り、画
像データに変換し、読取手段からの画像データを画像メ
モリに記憶して、文字抽出手段によって、前記画像メモ
リから各文字画像を順次抽出し、認識手段においては、
辞書メモリの内容に基づいて、抽出文字画像を認識し、
文字コードを出力する。辞書メモリは、抽出文字画像と
照合する標準文字画像および対応する文字コードを記憶
し、幾何学的に相似の関係にある大文字と小文字とを持
つ相似形文字に関しては、大文字または小文字のいずれ
か一方の標準文字画像のみを記憶している。さらに、文
字認識装置は検出手段によって抽出文字画像の大きさを
検出し、認識手段からの文字コードとともに認識結果と
して認識結果メモリに記憶する。前記文字認識装置は、
判定手段において、認識結果メモリの内容に基づいて、
１行分にそれぞれ含まれる相似形文字の文字コード毎に
頻度を計数し、頻度が１以上である文字の大きさに基づ
くヒストグラムを作成し、該ヒストグラムの分割点をし
きい値として大文字であるか小文字であるかの判定を行
う。According to the present invention, the character recognition device reads an original on which a plurality of characters are displayed, converts the image into image data, and stores the image data from the reading unit in an image memory. By character extracting means, each character image is sequentially extracted from the image memory, and in the recognizing means,
Recognize the extracted character image based on the contents of the dictionary memory,
Output character code. The dictionary memory stores a standard character image to be matched with the extracted character image and a corresponding character code. For similar characters having a geometrically similar upper case and lower case, one of upper case and lower case is used. Only the standard character image is stored. Further, the character recognizing device detects the size of the extracted character image by the detecting means, and stores it in the recognition result memory together with the character code from the recognizing means. The character recognition device,
In the determining means, based on the contents of the recognition result memory,
The frequency is counted for each character code of the similar character included in one line, a histogram is created based on the size of the character having the frequency of 1 or more, and the division point of the histogram is used as a threshold value to be a capital letter. Or lowercase.

【００１１】また本発明に従えば、文字認識装置は前記
判定手段が特定の相似形文字に関してヒストグラムを作
成できないとき、すなわち入力された相似形文字が１個
未満であるとき、または、しきい値を決定できないと
き、すなわち相似形文字の大きさが大または小にかたよ
って入力されたとき、前記特定の相似形文字以外の相似
形文字のしきい値に基づいて、大文字、小文字の判定を
行う。Further, according to the present invention, the character recognizing device may be configured to determine whether a histogram cannot be created for a specific similar character, that is, when the number of similar characters input is less than one, or When it is not possible to determine, that is, when the size of the similar character is large or small, the uppercase and lowercase are determined based on the threshold value of the similar character other than the specific similar character. .

【００１２】[0012]

【００１３】[0013]

【実施例】図１は、本発明の一実施例である文字認識装
置１６の構成を簡略化して示すブロック図である。入力
部１は、スキャナなどの光電変換可能な入力装置であ
る。制御部２は、文字認識装置１６全体の制御を行う部
分であり、制御部として独立している必要はない。表示
部３は、入力された文字画像について行われた認識の結
果を活字文字で表示する。認識部４は、後述の認識辞書
１５中の標準文字画像と、入力された文字画像から特徴
抽出された文字画像とを照合し、その認識結果を後述の
認識結果メモリ９に出力する。FIG. 1 is a block diagram showing a simplified configuration of a character recognition device 16 according to an embodiment of the present invention. The input unit 1 is an input device capable of photoelectric conversion, such as a scanner. The control unit 2 is a part that controls the entire character recognition device 16 and does not need to be independent as a control unit. The display unit 3 displays the result of recognition performed on the input character image in printed characters. The recognition unit 4 collates a standard character image in a recognition dictionary 15 described later with a character image whose features are extracted from the input character image, and outputs the recognition result to a recognition result memory 9 described later.

【００１４】言語処理部５は、認識部４での結果を解析
し、言語情報を生かし、文字認識精度を向上させるもの
である。言語処理部５は場合によっては不要であり、認
識部４に包含されていても構わない。出力部６は、イン
ターフェイスであり、フロッピィディスク、ＲＳ−２３
２Ｃ、ＬＡＮ（ローカルエリアネットワーク）などに情
報を出力する。The language processing section 5 analyzes the result of the recognition section 4 and utilizes the linguistic information to improve the character recognition accuracy. The language processing unit 5 is unnecessary in some cases, and may be included in the recognition unit 4. The output unit 6 is an interface, a floppy disk, RS-23
Output information to 2C, LAN (local area network), etc.

【００１５】図２は、図１において示される制御部２の
構成を簡略化して示すブロック図である。制御部２は、
ＣＰＵ（中央処理装置）もしくは専用ＬＳＩ（大規模集
積回路）７、画像メモリ８、認識結果メモリ９、大文字
小文字識別メモリ１０から成る。ＣＰＵ７および大文字
小文字識別メモリ１０によって、大文字小文字判定部１
１が構成される。画像メモリ８は、入力部１で得られた
２値画像（多値画像でもよい）を蓄積する。FIG. 2 is a simplified block diagram showing the configuration of control unit 2 shown in FIG. The control unit 2
It comprises a CPU (Central Processing Unit) or a dedicated LSI (Large Scale Integrated Circuit) 7, an image memory 8, a recognition result memory 9, and a case identification memory 10. By the CPU 7 and the case identification memory 10, the case determination unit 1 is used.
1 is configured. The image memory 8 stores the binary image (may be a multi-valued image) obtained by the input unit 1.

【００１６】ＣＰＵ７は、画像メモリ８にある画像から
文字画像を切出し、切出した文字画像とその画像が位置
する座標（Ｘ１，Ｙ１），（Ｘ２，Ｙ２）を認識部４に
転送する。また、ＣＰＵ７は、認識部４からの認識結果
を受け取る。さらに、ＣＰＵ７は、後述する方法で大文
字、小文字の判定を行う。画像メモリ８は、ＣＰＵ７が
必要とする作業バッファを含んでいる。含まない場合に
は、ＣＰＵ７に直結された作業バッファを必要とする。The CPU 7 cuts out a character image from the image in the image memory 8 and transfers the cut-out character image and the coordinates (X1, Y1) and (X2, Y2) where the image is located to the recognizing unit 4. Further, the CPU 7 receives a recognition result from the recognition unit 4. Further, the CPU 7 determines uppercase and lowercase letters by a method described later. The image memory 8 includes a work buffer required by the CPU 7. If not included, a work buffer directly connected to the CPU 7 is required.

【００１７】認識結果メモリ９は、認識部４から得られ
た認識結果、たとえば、文字コード、文字画像の位置、
文字画像の縦横の大きさを順次蓄積するメモリである。
大文字小文字識別メモリ１０は、個々の文字画像のヒス
トグラム、頻度表を蓄積すると同時に、一部は作業バッ
ファとして用いてもよい。The recognition result memory 9 stores the recognition results obtained from the recognition unit 4 such as a character code, a position of a character image,
This is a memory for sequentially storing the vertical and horizontal sizes of the character image.
The case identification memory 10 may store a histogram and a frequency table of each character image, and at the same time, may partially use it as a work buffer.

【００１８】文字切出し部１２は、ＣＰＵ７が画像メモ
リ８から１文字に対応する文字画像の切出しを行わない
場合のみ必要とする。文字切出し部１２が存在する場
合、作業バッファ１３への出力は、文字画像とその画像
が位置する座標（Ｘ１，Ｙ１），（Ｘ２，Ｙ２）であ
る。作業バッファ１３は、画像メモリ８を用いてＣＰＵ
７が処理を行い、得られる文字画像と座標とを蓄積する
メモリであり、照合処理の作業バッファとして用いる。The character extracting section 12 is necessary only when the CPU 7 does not extract a character image corresponding to one character from the image memory 8. When the character cutout unit 12 exists, the output to the work buffer 13 is a character image and coordinates (X1, Y1) and (X2, Y2) where the image is located. The work buffer 13 includes a CPU using the image memory 8.
Reference numeral 7 denotes a memory that performs processing and stores the obtained character image and coordinates, and is used as a work buffer for the collation processing.

【００１９】照合部１４は、認識辞書１５を参照して、
予め定める認識論理に従って照合し、制御部２の認識結
果メモリ９に文字画像に対応すると考えられる文字コー
ドを類似度順に並べて、その先頭から複数個（１位から
ｎ位まで）分の、文字画像の位置、文字画像の縦横の大
きさを送る。The collating unit 14 refers to the recognition dictionary 15 and
Matching is performed according to a predetermined recognition logic, and character codes considered to correspond to the character images are arranged in the recognition result memory 9 of the control unit 2 in order of similarity, and a plurality of character images (from the first place to the n-th place) from the top are displayed. And the size of the character image in the vertical and horizontal directions.

【００２０】本実施例では、認識辞書１５に大文字、小
文字が相似である相似形文字に関しては小文字を入れな
いことによって、照合部１４での処理を軽くしている。In the present embodiment, the processing in the collating unit 14 is reduced by not putting lowercase letters in the recognition dictionary 15 for similar characters whose uppercase letters and lowercase letters are similar.

【００２１】図３は、ＣＰＵ７が切出した文字画像と座
標軸との関係を示す図である。Ｘ軸およびＹ軸は、画像
メモリ８に予め設定されており、各１文字に対してＸ
軸、Ｙ軸に対応する２辺が平行となるように、外接矩形
を選ぶ。この外接矩形の、座標原点Ｏから最も近い点Ｐ
の座標（Ｘ１，Ｙ１）と、座標原点Ｏから最も遠い点Ｑ
の座標（Ｘ２，Ｙ２）とを、この文字画像の位置データ
として認識結果メモリ９に格納する。FIG. 3 is a diagram showing the relationship between the character image cut out by the CPU 7 and the coordinate axes. The X axis and the Y axis are set in the image memory 8 in advance.
A circumscribed rectangle is selected such that two sides corresponding to the axis and the Y axis are parallel. The point P of this circumscribed rectangle that is closest to the coordinate origin O
(X1, Y1) and the point Q farthest from the coordinate origin O
(X2, Y2) are stored in the recognition result memory 9 as position data of the character image.

【００２２】図４は、文字認識装置１６における文字認
識の動作を示すフローチャートである。まず、ステップ
ｋ１で入力部１において、スキャナなどの入力装置から
画像データが入力され、制御部２の画像メモリ８に格納
される。次のステップｋ２で、画像メモリ８内に文字画
像データがあれば、ステップｋ３に移り、ＣＰＵ７ある
いは文字切出し部１２は、前述の図３に示されるような
外接矩形で１文字画像を切出すと同時に、その文字画像
が原稿の何行、何列目に位置するかを画像メモリ８内に
記憶する。FIG. 4 is a flowchart showing the operation of character recognition in the character recognition device 16. First, in step k1, the input unit 1 receives image data from an input device such as a scanner, and stores the image data in the image memory 8 of the control unit 2. In the next step k2, if there is character image data in the image memory 8, the process proceeds to step k3, where the CPU 7 or the character cutout unit 12 cuts out one character image with the circumscribed rectangle as shown in FIG. At the same time, the image memory 8 stores the row and column of the original where the character image is located.

【００２３】ステップｋ４では、ＣＰＵ７は画像メモリ
８内の作業バッファ、あるいは作業バッファ１３を用い
て１文字分の画像データから文字としての特徴を抽出し
た文字画像データと、その文字画像の位置データ（Ｘ
１，Ｙ１），（Ｘ２，Ｙ２）とを認識部４に出力する。
ステップｋ５では、認識部４の照合部１４は、作業バッ
ファ１３から送られてきた入力文字データと標準文字パ
ターンとを照合し、標準文字パターンと対応した文字コ
ードを、入力文字データとの類似度順に並べかえ、次の
ステップｋ６でその第１位から第ｎ位までの複数個の文
字コードを認識結果メモリ９に出力する。ただし、照合
に使用する標準文字パターンは、相似形文字に関しては
大文字のみ（あるいは小文字のみ）である。１文字分の
認識処理が終了するとステップｋ２に戻り、画像メモリ
８内の次の１文字画像データについて、ステップｋ２〜
ｋ６の処理を行う。In step k4, the CPU 7 uses the work buffer in the image memory 8 or the work buffer 13 to extract character image data of one character from image data of one character, and position data of the character image ( X
1, Y1) and (X2, Y2) are output to the recognition unit 4.
In step k5, the collating unit 14 of the recognizing unit 4 collates the input character data sent from the work buffer 13 with the standard character pattern, and determines the character code corresponding to the standard character pattern with the similarity to the input character data. In the next step k6, the character codes from the first place to the n-th place are output to the recognition result memory 9. However, the standard character pattern used for collation is only uppercase (or only lowercase) for similar characters. When the recognition process for one character is completed, the process returns to step k2, and for the next one character image data in the image memory 8, steps k2 to k2 are performed.
The processing of k6 is performed.

【００２４】ステップｋ２において、画像メモリ８に文
字画像データがない場合は、次のステップｋ７に移る。
ステップｋ７で認識結果メモリ９に認識結果データがあ
ればステップｋ８に移り、データがなければ終了する。In step k2, if there is no character image data in the image memory 8, the operation proceeds to the next step k7.
If there is any recognition result data in the recognition result memory 9 in step k7, the process proceeds to step k8, and if there is no data, the process ends.

【００２５】ステップｋ８で、ＣＰＵ７は大文字小文字
判定部１１で使用する頻度表を作成し、この頻度表に基
づいて、相似形文字の抽出文字画像の大きさのヒストグ
ラムを作成する。次に、ステップｋ９において、ＣＰＵ
７は、前述のように作成されたヒストグラムから、大文
字か小文字かを判定する基準となる分割点の文字画像の
大きさ、すなわち、しきい値ｃを決定する。In step k8, the CPU 7 creates a frequency table to be used by the upper / lower case determination unit 11, and creates a histogram of the size of the extracted character image of similar characters based on the frequency table. Next, in step k9, the CPU
Reference numeral 7 determines the size of the character image at the division point serving as a reference for determining whether the case is uppercase or lowercase, that is, the threshold value c, from the histogram created as described above.

【００２６】ステップｋ１０で、ＣＰＵ７はステップｋ
９で得られた分割点の値ｃと、各文字画像の位置座標か
ら計算された文字の大きさを照合し、大文字か小文字か
を判定する。ステップｋ１１では、ＣＰＵ７は、小文字
と判定されたステップｋ６の認識結果に関して、対応す
る文字コードを修正した後、認識結果を表示部３や出力
部６から出力する。At step k10, the CPU 7 determines at step k
The value c of the division point obtained in 9 is compared with the size of the character calculated from the position coordinates of each character image, and it is determined whether the character is uppercase or lowercase. In step k11, the CPU 7 outputs the recognition result from the display unit 3 or the output unit 6 after correcting the corresponding character code with respect to the recognition result in step k6 determined to be lowercase.

【００２７】図５は大文字小文字判定部１１で実行され
る頻度表作成の手順を示すフローチャートであり、図６
は図５のフローチャートに示す動作で作成される頻度表
のデータの構成例を示す図である。まず、ステップｍ１
で図６で示すテーブルの頻度欄および大きさ欄をクリア
する。頻度表を作成する対象となるカテゴリは、大文字
と小文字とが相似形である相似形文字、すなわち「あ、
い、う、え、お、つ、や、ゆ、よ、わ、ア、イ、ウ、
エ、オ、ツ、ヤ、ユ、ヨ、ワ、カ、ケ、Ｃ、Ｏ、Ｓ、
Ｖ、Ｗ、Ｘ、Ｚ」である。FIG. 5 is a flowchart showing a procedure for creating a frequency table executed by the upper / lower case judging section 11, and FIG.
FIG. 6 is a diagram showing a configuration example of data of a frequency table created by the operation shown in the flowchart of FIG. 5. First, step m1
Clear the frequency column and the size column of the table shown in FIG. The categories for which the frequency table is created are similar characters whose uppercase and lowercase are similar, that is, "A,
I, u, e, o, tsu, ya, yu, yo, wow, a, i, u,
D, o, t, ya, yu, yo, wa, ka, ke, C, O, S,
V, W, X, Z ".

【００２８】ステップｍ２で、認識結果メモリ９内に認
識結果データがあるかをチェックする。認識結果メモリ
９内にデータがない場合は、処理を終了する。ステップ
ｍ３で、認識結果メモリ９から１文字ずつの認識結果、
たとえば、文字コードおよび座標を読取る。ステップｍ
４で、認識結果メモリ９から読取った文字コードが
「あ、い、う、え、お、つ、や、ゆ、よ、わ、ア、イ、
ウ、エ、オ、ツ、ヤ、ユ、ヨ、ワ、カ、ケ、Ｃ、Ｏ、
Ｓ、Ｖ、Ｗ、Ｘ、Ｚ」等の相似形文字でない場合はステ
ップｍ２に戻る。前記文字コードが相似形文字である場
合は、次のステップｍ５に移る。ステップｍ５では、対
応する相似形文字の文字コードの頻度欄に１を加えて計
数する。ステップｍ６では対応する大きさ欄の末尾に座
標から引算で求まる縦横の大きさを書加え、ステップｍ
２へ戻る。At step m2, it is checked whether or not there is recognition result data in the recognition result memory 9. If there is no data in the recognition result memory 9, the process ends. At step m3, the recognition result of each character is recognized from the recognition result memory 9;
For example, a character code and coordinates are read. Step m
In 4, the character code read from the recognition result memory 9 is "A, I, U, E, O, T, Y, Y, Y, W, A, I,
U, d, o, t, ya, yu, yo, wa, ka, ke, C, O,
If it is not a similar character such as "S, V, W, X, Z", the process returns to step m2. If the character code is a similar character, the process proceeds to the next step m5. At step m5, 1 is added to the frequency field of the character code of the corresponding similar character, and counting is performed. In step m6, the vertical and horizontal sizes obtained by subtracting from the coordinates are added to the end of the corresponding size column, and step m6 is performed.
Return to 2.

【００２９】図７は図６に示される相似形文字の頻度表
のデータに基づいてヒストグラムを作成し、大文字・小
文字判別の分割点を求める動作を示すフローチャートで
あり、図８は図６に示される相似形文字の頻度表のデー
タをもとに作成されるヒストグラムである。まず、ステ
ップｎ１で前記頻度表にデータがあるかどうかを判断
し、データがない場合は、処理を終了する。データがあ
る場合は次のステップｎ２に進み、図６の前記頻度表の
文字コード１行分のデータを読む。ステップｎ３で、読
んだ行の頻度が１未満の場合、ステップｎ１に戻り、次
の文字コードの行のデータを読む。ステップｎ３で読ん
だ文字コードの行の頻度が１以上の場合、ステップｎ４
でテーブルに記述されている大きさを用い、図８に示す
ような水平軸を大きさ、垂直軸を個数すなわち頻度とし
たヒストグラムを作成し、さらに、図８に示すような値
ａ，ｂを求める。FIG. 7 is a flow chart showing an operation for creating a histogram based on the data of the frequency table of similar characters shown in FIG. 6 and obtaining a division point for distinguishing between uppercase and lowercase characters, and FIG. 8 is shown in FIG. 4 is a histogram created based on data of a frequency table of similar characters. First, it is determined in step n1 whether or not there is data in the frequency table. If there is no data, the process is terminated. If there is data, the process proceeds to the next step n2, where data of one line of the character code in the frequency table of FIG. 6 is read. If the frequency of the read line is less than 1 at step n3, the process returns to step n1 to read the data of the line of the next character code. If the frequency of the line of the character code read in step n3 is 1 or more, step n4
Using the size described in the table, a histogram is created in which the horizontal axis is the size and the vertical axis is the number, that is, the frequency as shown in FIG. 8, and the values a and b as shown in FIG. Ask.

【００３０】さらに、ステップｎ５では、ｃ＝（ａ＋
ｂ）／２を満足する値ｃを決定する。この際、入力原稿
画像が横書きならば、横の大きさヒストグラムで値ｃを
求める。縦書きならば、縦の大きさヒストグラムから値
ｃを求めることとする。Further, at step n5, c = (a +
b) Determine a value c that satisfies / 2. At this time, if the input document image is written horizontally, the value c is obtained using a horizontal size histogram. In the case of vertical writing, the value c is determined from the vertical size histogram.

【００３１】なお、相似形文字の頻度表の特定文字に関
して１行分のデータの頻度が１未満であって、当該特定
文字に関してヒストグラムを作成できないとき、また
は、大文字ばかりや小文字ばかりで値ａ，ｂが検出でき
ないとき、分割点の値をカテゴリ内の他の相似形文字の
データに関して得られた分割点の値を分割点として用い
る。When the frequency of data for one line is less than 1 for a specific character in the frequency table of similar characters and a histogram cannot be created for the specific character, or when the value a, When b cannot be detected, the value of the division point is used as the division point for the value of the division point obtained with respect to the data of other similar characters in the category.

【００３２】分割点の値ｃが決定された後、認識結果メ
モリ９の内容を縦もしくは横の大きさと分割点の値ｃと
を比べ、小文字の文字コードに変換する。値ｃより大き
さが小さい場合、小文字となる。After the value c of the division point is determined, the contents of the recognition result memory 9 are compared with the vertical or horizontal size and the value c of the division point, and are converted into lowercase character codes. If the size is smaller than the value c, it becomes lowercase.

【００３３】図９は、文字を含む原稿２０が読取られ、
一旦文字画像として記憶され、認識後、再び、原稿２０
と同様の形態で出力される様子を示す図である。図９
（１）において、入力部１が読取る原稿２０の一例を示
す。スキャナなどの入力装置である入力部１は、原稿２
０の仮想線２２で囲まれた領域の画像２１を読取り、光
電変換などによって画像メモリ８に２値画像として出力
する。２値画像とは、入力信号の振幅、すなわち光の強
度や印刷濃度などのヒストグラムを求め、あるレベル、
すなわちしきい値より振幅が大きいとき１に、小さいと
き０として表す画像である。たとえば、ある点の濃度が
１のとき黒、０のとき白にそれぞれ対応して表される。FIG. 9 shows a case where a document 20 including characters is read,
Once stored as a character image and recognized,
It is a figure showing signs that it is outputted in the form similar to. FIG.
1A shows an example of a document 20 read by the input unit 1. An input unit 1 which is an input device such as a scanner includes an original 2
The image 21 in the area surrounded by the virtual line 22 of 0 is read and output as a binary image to the image memory 8 by photoelectric conversion or the like. A binary image is obtained by obtaining a histogram of the amplitude of an input signal, that is, a light intensity or a print density, and obtaining a certain level,
That is, the image is represented as 1 when the amplitude is larger than the threshold value and as 0 when the amplitude is smaller than the threshold value. For example, when the density of a certain point is 1, it is expressed in black, and when it is 0, it is expressed in white.

【００３４】図９（２）は、画像メモリ８内に入力され
た２値画像２１の一部を示す。画像メモリ８内では、２
値画像２１がＸ座標およびＹ座標に対応して記憶され、
ＣＰＵ４または文字切出し部１２は、切出す文字の外接
矩形２３の位置を表す点Ｐおよび点Ｑの各座標（Ｘ１，
Ｙ１），（Ｘ２，Ｙ２）とともに、この座標値から計算
された１文字の縦横の大きさ、および切出す文字が何行
何列目にあるかを認識結果メモリ９に出力する。FIG. 9B shows a part of the binary image 21 inputted into the image memory 8. In the image memory 8, 2
The value image 21 is stored corresponding to the X coordinate and the Y coordinate,
The CPU 4 or the character cutout unit 12 sets the coordinates (X1, X1) of the points P and Q representing the position of the
Along with Y1) and (X2, Y2), the vertical and horizontal sizes of one character calculated from the coordinate values and the number and line of the character to be cut out are output to the recognition result memory 9.

【００３５】図９（３）は、読込まれた原稿２０が、文
字認識装置１６で認識処理された後、出力部６を経て記
録紙３２に印刷出力された一例を示す。文字認識装置１
６で認識処理された文字は、以下に示す図１０（１）の
認識結果メモリ８内の位置を表す行と列のデータ３０に
基づいて、配列される。FIG. 9C shows an example in which the read document 20 is subjected to recognition processing by the character recognition device 16 and then printed out on the recording paper 32 via the output unit 6. Character recognition device 1
The characters recognized in step 6 are arranged on the basis of row and column data 30 indicating positions in the recognition result memory 8 shown in FIG.

【００３６】図１０は、認識結果メモリ９内に記憶され
ている各入力文字に関するデータを示す図である。FIG. 10 is a view showing data relating to each input character stored in the recognition result memory 9.

【００３７】図１０（１）は、照合部１４において認識
辞書１５との照合を受けた後、認識結果メモリ９内に格
納された入力文字データを示す。認識結果メモリ９内に
は、各１文字分に対応する種々のデータが格納されてい
る記憶領域のアドレス２４、切出された文字画像が照合
部１４で認識辞書１５との照合を受けた結果、類似度順
に並べられ出力された第１文字コード２５〜第ｎ位文字
コード２６、図３および図９（２）に示される、切出さ
れた１文字画像の位置座標Ｐ（Ｘ１，Ｙ１）２７および
Ｑ（Ｘ２，Ｙ２）２８、１文字画像の縦横の大きさ２
９、その文字が入力された入力画像２１の何行何列目に
位置するかを示す位置（行と列）３０などのデータが各
１文字に対応して記憶されている。FIG. 10A shows input character data stored in the recognition result memory 9 after the collating unit 14 has collated with the recognition dictionary 15. In the recognition result memory 9, an address 24 of a storage area in which various data corresponding to each one character is stored, and a result obtained when the cut-out character image is compared with the recognition dictionary 15 by the matching unit 14. The first character code 25 to the n-th character code 26 arranged and output in the order of similarity, and the position coordinates P (X1, Y1) of the cut-out one-character image shown in FIGS. 3 and 9 (2). 27 and Q (X2, Y2) 28, vertical and horizontal size 2 of one character image
9. Data such as a position (row and column) 30 indicating the row and column of the input image 21 where the character is input is stored in correspondence with each character.

【００３８】図１０（２）は、図１０（１）の第１位文
字コード２５の部分を抜出した図である。認識結果とし
て出力される第１位文字コードについて、図１０（１）
に示す文字コード２５は、相似形文字の照合に関して大
文字のみの標準パターンが使用されているので、小文字
「_ッ」に対して大文字「ツ」の文字コード０５３６が与
えられている。また、同様に、小文字「_ィ」に対して、
大文字「イ」の文字コード０５０４が与えられている。FIG. 10B is a diagram in which the first character code 25 in FIG. 10A is extracted. Regarding the first character code output as the recognition result, FIG.
Character code 25 shown in, since the standard pattern of the upper case only for matching shape similar characters are used, the character code 0536 uppercase "Tsu" is given for lower case _"Tsu". Similarly, for the lowercase letter " _i ",
The character code 0504 of the capital letter “A” is given.

【００３９】図１０（３）は、入力文字画像がヒストグ
ラムから求められたしきい値に基づいて、大文字小文字
判定部１１で大文字か小文字かを判定された後、書直さ
れた文字コードのうち、第１位文字コード３１を示す。FIG. 10 (3) shows a case where an uppercase / lowercase determination unit 11 determines whether an input character image is uppercase or lowercase based on a threshold value obtained from a histogram. , The first character code 31.

【００４０】図１０（２）で、それぞれ大文字の文字コ
ードを与えられていた「_ッ」、「_ィ」は、それぞれ小文
字のコード０５３５，０５０３に書直されている。[0040] In FIG. 10 (2), respectively, had been given a character code of a capital _"Tsu", _"I" are respectively healed written in lower case letters of the code 0535,0503.

【００４１】このように認識辞書１５には、標準パター
ンとして幾何学的に相似の関係にある大文字と小文字と
を持つ相似形文字に関しては、大文字のみの標準パター
ンを記憶し、この相似形文字に関しては、文字認識に際
して照合部１４による大文字か小文字かの判定を行わな
い。これによって照合する標準パターンの数を減らすこ
とができ、より高速に文字認識を行うことができる。ま
た相似形文字に関しては、抽出文字画像の大きさに基づ
いて大文字か小文字かを判定するので、標準パターンと
の照合では正確な認識がむずかしい相似形文字の認識精
度を高めることができる。As described above, the recognition dictionary 15 stores a standard pattern consisting of only uppercase characters for similar characters having uppercase and lowercase characters that are geometrically similar to each other as a standard pattern. Does not determine whether uppercase or lowercase by the collating unit 14 in character recognition. As a result, the number of standard patterns to be compared can be reduced, and character recognition can be performed faster. In addition, for similar characters, whether uppercase or lowercase is determined based on the size of the extracted character image, it is possible to increase the recognition accuracy of similar characters that are difficult to accurately recognize in comparison with a standard pattern.

【００４２】[0042]

【発明の効果】以上のように本発明によれば、辞書メモ
リには標準文字画像として幾何学的に相似の関係にある
大文字と小文字とを持つ相似形文字に関して大文字ある
いは小文字のいずれか一方の標準文字画像を記憶し、相
似形文字に関しては、文字認識に際して認識手段による
大文字か小文字かの判定を行わない。標準文字画像は、
相似形文字であるか否かに拘わらず、全抽出文字画像に
対して照合が行われるので、標準文字画像の数を減少
し、文字認識の高速化を図ることができる。また、相似
形文字に関しては、別途、抽出文字画像の大きさを検出
手段において検出し、認識結果メモリの内容に基づい
て、相似形文字の頻度表の１行分にそれぞれ含まれる相
似形文字の文字コード毎に頻度を計数し、頻度が１以上
である文字の大きさに基づく頻度のヒストグラムを作成
して分割点を求め、この大きさをしきい値として大文字
であるか小文字であるかを判定するので、従来、標準文
字画像との照合では正確な認識が困難である該相似形文
字の認識を、より正確に行うことができる。As described above, according to the present invention, one of uppercase and lowercase characters is stored in a dictionary memory as a standard character image with respect to similar characters having a geometrically similar uppercase and lowercase character. A standard character image is stored, and the recognition unit does not determine whether uppercase or lowercase characters are similar for similar characters. The standard character image is
Regardless of whether the characters are similar characters or not, matching is performed for all extracted character images, so that the number of standard character images can be reduced and character recognition can be speeded up. For similar characters, the size of the extracted character image is separately detected by the detection means, and based on the contents of the recognition result memory, the similar character characters included in one line of the frequency table of similar characters are respectively detected. Count the frequency for each character code, create a histogram of the frequency based on the size of the character whose frequency is 1 or more, find the division point, and use this size as a threshold to determine whether it is uppercase or lowercase. Since the determination is made, it is possible to more accurately recognize the similar-shaped character, which is conventionally difficult to accurately recognize in the collation with the standard character image.

【００４３】また本発明によれば、特定の相似形文字に
関して、ヒストグラムを作成できないとき、またはヒス
トグラムに基づいてしきい値を決定できないときは、前
記特定相似形文字以外の相似形文字のしきい値に基づい
て、抽出文字画像が大文字か小文字かの判定を行うの
で、文字の誤認識の発生頻度を低減することができる。According to the present invention, when a histogram cannot be created for a specific similar character or when a threshold value cannot be determined based on the histogram, the threshold of similar characters other than the specific similar character is determined. Since it is determined whether the extracted character image is uppercase or lowercase based on the value, the frequency of occurrence of erroneous character recognition can be reduced.

【００４４】[0044]

[Brief description of the drawings]

【図１】本発明の一実施例である文字認識装置１６の構
成を簡略化して示すブロック図である。FIG. 1 is a simplified block diagram showing a configuration of a character recognition device 16 according to an embodiment of the present invention.

【図２】図１に示される制御部２の構成を簡略化して示
すブロック図である。FIG. 2 is a simplified block diagram showing a configuration of a control unit 2 shown in FIG.

【図３】ＣＰＵ７が切出した文字画像と座標軸との関係
を示す図である。FIG. 3 is a diagram illustrating a relationship between a character image cut out by a CPU 7 and coordinate axes.

【図４】文字認識装置１６における文字認識の動作を示
すフローチャートである。FIG. 4 is a flowchart showing an operation of character recognition in the character recognition device 16;

【図５】大文字小文字判定部１１で実行される頻度表作
成の手順を示すフローチャートである。FIG. 5 is a flowchart illustrating a procedure for creating a frequency table, which is executed by the case determination unit 11;

【図６】図５のフローチャートに示す動作で作成される
頻度表のデータの構成例を示す図である。FIG. 6 is a diagram showing a configuration example of data of a frequency table created by the operation shown in the flowchart of FIG. 5;

【図７】大文字・小文字判別の分割点ｃを求める手順を
示すフローチャートである。FIG. 7 is a flowchart showing a procedure for obtaining a division point c for upper / lower case discrimination.

【図８】図６に示される頻度表のデータに基づいて作成
されるヒストグラムである。8 is a histogram created based on the data of the frequency table shown in FIG.

【図９】文字を含む原稿２０が読取られ、一旦文字画像
として記憶され、認識後、再び、原稿２０と同様の形態
で出力される様子を示す図である。FIG. 9 is a diagram showing a state in which a document 20 including characters is read, temporarily stored as a character image, recognized, and output again in the same form as the document 20.

【図１０】認識結果メモリ９内に記憶されている各入力
文字に関するデータの一例を示す図である。FIG. 10 is a diagram showing an example of data relating to each input character stored in a recognition result memory 9;

[Explanation of symbols]

１入力部２制御部４認識部７ＣＰＵ８画像メモリ９認識結果メモリ１０大文字小文字識別メモリ１１大文字小文字判定部１２文字切出し部１４照合部１５認識辞書１６文字認識装置 DESCRIPTION OF SYMBOLS 1 Input part 2 Control part 4 Recognition part 7 CPU 8 Image memory 9 Recognition result memory 10 Uppercase / lowercase identification memory 11 Uppercase / lowercase determination part 12 Character extraction part 14 Collation part 15 Recognition dictionary 16 Character recognition device

───────────────────────────────────────────────────── フロントページの続き (58)調査した分野(Int.Cl.⁶，ＤＢ名) G06K 9/46 G06K 9/62 ──────────────────────────────────────────────────続き Continued on the front page (58) Field surveyed (Int.Cl. ⁶ , DB name) G06K 9/46 G06K 9/62

Claims

(57) [Claims]

A reading unit that reads a document on which a plurality of characters are displayed, converts the document into image data, and outputs the image data; an image memory that stores image data from the reading unit; A character extracting means for sequentially extracting, and a standard character image to be matched with the extracted character image and a corresponding character code are stored. For similar characters having a geometrically similar uppercase and lowercase character, uppercase or lowercase characters are used. A dictionary memory in which only one of the standard character images is stored, a detecting means for detecting the size of the extracted character image, and a character code which recognizes the extracted character image based on the storage contents of the dictionary memory and converts the character code. A recognition means for outputting, a recognition result memory for storing a character code from the recognition means, and a size from the detection means as a recognition result; , The frequency is counted for each character code of the similar character, a histogram of the frequency based on the size of the character having the frequency of 1 or more is created, and the division point of the histogram is used as a threshold to determine whether the character is uppercase. A character recognition device comprising: a determination unit configured to determine whether the character is a lowercase letter.

2. The method according to claim 1, wherein the frequency of the character code is less than 1 and a histogram cannot be created for the specific similar character, or the character size is only uppercase or lowercase and the histogram is not included. The character recognition device according to claim 1, wherein when a threshold value cannot be determined based on the threshold value, a determination is made based on a threshold value of a similar character other than the specific similar character.