JPH08279022A

JPH08279022A - Character recognition device

Info

Publication number: JPH08279022A
Application number: JP7080376A
Authority: JP
Inventors: Meishiyuku Kin; 明淑金
Original assignee: Teac Corp
Current assignee: Teac Corp
Priority date: 1995-04-05
Filing date: 1995-04-05
Publication date: 1996-10-22

Abstract

PURPOSE: To perform an optimum normalizing process at a high speed as for a character recognition device which recognizes a character by normalizing character data of a document. CONSTITUTION: A rectangle data processing means 13 generates circumscribed rectangle data from image data fetched into an input means 12 and a matching process means 14 processes the circumscribed rectangle data by individual, merging, and separation coordination and temporarily stores the result and rectangle data in a rectangle data processing means 13. After a data extracting process means 15 extracts normal character data from the image data on the basis of the rectangle data, a recognizing process means 16 performs character recognition from the character data.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、文書の文字データを正
規化処理して文字認識を行う文字認識装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device for normalizing character data of a document to perform character recognition.

【０００２】[0002]

【従来の技術】近年、パーソナルコンピュータやファク
シミリの普及に伴って、文字認識装置や文字認識のプロ
グラムソフトの需要が高まっており、文字認識において
正確さと高速性が要求されている。2. Description of the Related Art In recent years, along with the widespread use of personal computers and facsimiles, the demand for character recognition devices and program software for character recognition has increased, and character recognition is required to be accurate and fast.

【０００３】従来、文字認識装置は、スキャナ等で文字
をドットイメージとして読み込み、読み込んだドットイ
メージデータから文字データを切り出す正規化処理を行
う。そして、切り出したドット文字データを認識し、認
識した文字が適合しているか否かの文法上のチェックを
行うものである。Conventionally, a character recognition device performs a normalization process of reading a character as a dot image with a scanner or the like and cutting out character data from the read dot image data. Then, the cut-out dot character data is recognized, and a grammatical check is made as to whether or not the recognized character matches.

【０００４】上記のような正規化処理は次の認識処理を
より確実に行うためのものとして必要な処理であり、偏
った文字に対しても配慮がなされている。このような偏
った文字に対する正規化処理として、特開平３−６５７
８１号公報に記載されている事項が知られている。これ
は、文字記号の最大高さ又は幅の一方を基準辺とし、他
辺を文字記号パターンのドット存在幅として矩形領域で
注目文字記号パターンを切り出し、正規処理のときに正
規化サイズに対する基準辺の比で、当該注目文字記号パ
ターンの高さ及び幅方向の大きさを変倍する。すなわ
ち、句点“。”等のような平均的な漢字と大きくかけ離
れたドット分布を持つものに対して、その位置情報を失
うことなく正規化させるものである。The above-described normalization processing is necessary to more reliably perform the next recognition processing, and consideration is given to biased characters. Japanese Patent Laid-Open No. 3-657 discloses a normalization process for such a biased character.
The matters described in Japanese Patent No. 81 are known. This is one of the maximum height or width of the character symbol as the reference side, the other side as the dot existence width of the character symbol pattern, and the target character symbol pattern is cut out in the rectangular area. The size of the target character / symbol pattern in the height direction and the width direction is scaled by the ratio. That is, a character having a dot distribution that is far apart from an average Chinese character such as a punctuation mark "." Is normalized without losing its position information.

【０００５】[0005]

【発明が解決しようとする課題】しかし、特開平３−６
５７８１号公報に記載のものは、一行に存在するドット
を左側（又は右側）からスキャンしてドットの存在しな
いところを文字間隔として矩形切り出しを行うことか
ら、分離文字や合併文字等に対して誤認識を生じるとい
う問題がある。However, Japanese Unexamined Patent Publication (Kokai) No. 3-6.
The method described in Japanese Patent No. 5781 scans the dots existing in one line from the left side (or the right side) and performs rectangular cutout using the spaces where the dots do not exist as character intervals. There is a problem of causing recognition.

【０００６】例えば、文字“は”のように一文字中で分
離している文字が存在すると分離された状態で正規化処
理が行われ、一方文字間隔の少ない漢字同士の場合に誤
って一文字として切り出すことになる。また、上記誤認
識によるエラー発生では、改めて正規化処理をやり直す
こととなり、時間を要するという問題がある。For example, if there is a separated character such as the character "wa" in one character, the normalization process is performed in the separated state. On the other hand, in the case of Chinese characters with small character spacing, they are erroneously cut out as one character. It will be. Further, when an error occurs due to the above-mentioned erroneous recognition, the normalization process is performed again, which takes time.

【０００７】そこで、本発明は上記課題に鑑みなされた
もので、高速かつ最適に正規化処理を行う文字認識装置
を提供することを目的とする。Therefore, the present invention has been made in view of the above problems, and an object of the present invention is to provide a character recognition device that performs normalization processing at high speed and optimally.

【０００８】[0008]

【課題を解決するための手段】図１に、本発明の原理構
成図を示す。図１において、請求項１に示すように上記
課題を解決するために、文字認識装置１１が、文書の文
字データをイメージデータとして取り込む入力手段１２
と、該入力手段１２からの該イメージデータより該文字
データのうちの独立する構成部分の外接矩形データを作
成して格納すると共に、該外接矩形データに基づいて整
合化された矩形データが書き込まれる矩形データ処理手
段１３と、該矩形データ処理手段１３から読み出した該
外接矩形データに基づいて、前記イメージデータの文字
データ部分より検出された所定のデータを用いて独立、
合併、分離の処理を行うことにより、一文字に相当する
前記矩形データに整合化して該矩形データ処理手段１３
に格納させる整合化処理手段１４と、該整合化処理手段
１４の矩形データに基づいて、前記入力手段１２に取り
込まれたイメージデータより正規の前記文字データを抽
出するデータ抽出処理手段１５と、該文字データ抽出処
理手段１５からの正規の文字データより文字認識を行う
認識処理手段１６と、を有して構成される。FIG. 1 is a block diagram showing the principle of the present invention. In FIG. 1, in order to solve the above problems as described in claim 1, the character recognition device 11 inputs the character data of a document as image data.
Then, the circumscribing rectangle data of the independent constituent part of the character data is created and stored from the image data from the input means 12, and the rectangle data matched based on the circumscribing rectangle data is written. Independently using the rectangular data processing means 13 and predetermined data detected from the character data portion of the image data based on the circumscribed rectangular data read from the rectangular data processing means 13,
By performing the merging and separation processing, the rectangular data corresponding to one character is matched and the rectangular data processing means 13
And the data extraction processing means 15 for extracting the regular character data from the image data taken in by the input means 12 based on the rectangular data of the matching processing means 14, And a recognition processing means 16 for performing character recognition from the regular character data from the character data extraction processing means 15.

【０００９】[0009]

【作用】上述のように本発明では、入力手段１２に取り
込まれたイメージデータより外接矩形データを矩形デー
タ処理手段１３で作成し、該外接矩形データを整合化処
理手段１４で単独、合併、分離の整合化の処理を行っ
て、矩形データとして該矩形データ処理手段１３に一担
格納し、該矩形データに基づいてイメージデータより正
規の文字データをデータ抽出処理手段１５により抽出し
た後に、該文字データより認識処理手段１６で文字認識
を行う。これにより、分離文字や合併文字等に対して最
適な正規化処理が行われ、誤認識が減少して高速化を図
ることが可能となる。As described above, in the present invention, the circumscribed rectangular data is created by the rectangular data processing means 13 from the image data taken in the input means 12, and the circumscribed rectangular data is independently, merged or separated by the matching processing means 14. Of the character data, and the character data is stored in the rectangular data processing means 13 as rectangular data, and the normal character data is extracted from the image data by the data extraction processing means 15 based on the rectangular data. Character recognition is performed by the recognition processing means 16 from the data. As a result, optimal normalization processing is performed on separated characters, merged characters, etc., and it is possible to reduce erroneous recognition and increase speed.

【００１０】[0010]

【実施例】図２に、本発明の一実施例の機能ブロック図
を示す。図２は文字認識装置１１の機能ブロック図を示
したもので、例えばパーソナルコンピュータにスキャナ
等が接続されたシステムとして構成される。FIG. 2 shows a functional block diagram of an embodiment of the present invention. FIG. 2 is a functional block diagram of the character recognition device 11, which is configured as a system in which a scanner or the like is connected to a personal computer, for example.

【００１１】図２において、文書イメージ入力手段２１
は文書の文字データをイメージデータとして読み込むも
ので、例えばスキャナやファクシミリモデムなどの文書
をドットイメージとして入力する機器が使用される。こ
の読み込まれたイメージデータはイメージデータ格納手
段２２に格納されて取り込まれる。イメージデータ格納
手段２２には、例えばパーソナルコンピュータのメモリ
（ＲＡＭ：Random Access Memory）又はＨＤＤ（ハード
ディスクドライブ) が使用される。この文書イメージ入
力手段２１及びイメージデータ格納手段２２により入力
手段１２が構成される。In FIG. 2, document image input means 21
Reads character data of a document as image data, and a device such as a scanner or a facsimile modem for inputting a document as a dot image is used. The read image data is stored and fetched in the image data storage means 22. As the image data storage means 22, for example, a memory (RAM: Random Access Memory) of a personal computer or an HDD (hard disk drive) is used. The document image input means 21 and the image data storage means 22 constitute the input means 12.

【００１２】例えば、文書イメージ入力手段２１にスキ
ャナを使用した場合、スキャナにおいて文書のイメージ
データが作成されてパーソナルコンピュータの所定のバ
スに入力され、上記ＲＡＭにバイナリで「１」（例えば
黒い部分）と「０」（例えば白い部分）として記憶され
る。この場合、バスには、ＲＳ（Recommended Standar
d) ２３２Ｃのようなシリアルポートやセントロニクス
仕様、又はＳＣＳＩ（Small Computer System Interfac
e)やＧＰＩＢ（General Purpose Interface Bus)などの
パラレルパポート等が使用される。そして、イメージデ
ータの１ラインの終端にはＣＲＬＦなどのレコートセパ
レータに相当するデータが付加される。For example, when a scanner is used as the document image input means 21, image data of a document is created by the scanner and input to a predetermined bus of a personal computer, and is binary "1" (for example, a black portion) in the RAM. And “0” (for example, white part) are stored. In this case, RS (Recommended Standar)
d) Serial port such as 232C, Centronics specification, or SCSI (Small Computer System Interfac)
e) or a parallel port such as GPIB (General Purpose Interface Bus) is used. Then, data corresponding to a recording separator such as CRLF is added to the end of one line of the image data.

【００１３】なお、専用のインタフェースボードを設け
てもよく、また文書イメージ入力手段としてファクシミ
リやファクシミリモデム等を使用した場合も同様であ
る。また、ＲＡＭには、イメージデータの総てのデータ
がＲＡＭに格納される必要はなく、一旦ＨＤＤにファイ
ルとして蓄えられたイメージデータから、正規化処理や
文字認識処理の際に必要なデータが適宜ロードされ、格
納される形式であってもよい。この場合、ＲＡＭの中は
ビットとドットが対応して格納されており、文書が記載
されている「紙の幅」に相当する１ラインの長さのデー
タも格納される。A dedicated interface board may be provided, and the same applies when a facsimile or facsimile modem is used as the document image input means. Further, it is not necessary for all the image data to be stored in the RAM in the RAM, and the data necessary for the normalization processing and the character recognition processing can be appropriately selected from the image data once stored as a file in the HDD. It may be loaded and stored. In this case, bits and dots are stored in the RAM in association with each other, and one-line length data corresponding to the "paper width" in which the document is written is also stored.

【００１４】続いて、行幅認識手段２３はイメージデー
タ格納手段２２からイメージデータを読み込んで行幅デ
ータを作成して、例えば上記ＲＡＭ（又はＨＤＤ）上の
一部としての行幅データ格納手段２４に格納するもので
あり、データの小容量、繁雑な書き換え等を考慮してＲ
ＡＭに格納することが望ましい。外接矩形データ作成手
段２５はイメージデータ格納手段２２からイメージデー
タを読み出し、また行幅データ格納手段２４から行幅デ
ータを読み出して文字データのうちの独立する構成部分
の外接領域のデータとしての外接矩形データを作成し、
矩形データ格納手段２６の一部に格納される。この行幅
認識手段２３、行幅データ格納手段２４、外接矩形デー
タ作成手段２５、及び矩形データ格納手段２６により矩
形データ処理手段１３が構成される。Subsequently, the line width recognition means 23 reads the image data from the image data storage means 22 to create line width data, and the line width data storage means 24 as a part of the RAM (or HDD), for example. It is stored in R, considering small capacity of data, complicated rewriting, etc.
It is desirable to store in AM. The circumscribing rectangle data creating means 25 reads out the image data from the image data storing means 22, and reads out the line width data from the line width data storing means 24 to define a circumscribing rectangle as data of the circumscribing area of the independent constituent part of the character data. Create data,
It is stored in a part of the rectangular data storage means 26. The line width recognition unit 23, the line width data storage unit 24, the circumscribing rectangular data creation unit 25, and the rectangular data storage unit 26 constitute a rectangular data processing unit 13.

【００１５】一方、標準矩形データ作成作成手段２７
は、行幅データ格納手段２４より行幅データを読み出
し、行幅データで得られる文字データの文字高さに対応
する文字幅を決定してワードプロセッサ文書の全角文字
の標準矩形データを作成し、これをＲＡＭ（又はＨＤ
Ｄ）の一部分に割り当てられた標準矩形データ格納手段
２８に格納する。On the other hand, the standard rectangular data creating / creating means 27
Reads the line width data from the line width data storage means 24, determines the character width corresponding to the character height of the character data obtained by the line width data, and creates standard rectangular data of full-width characters of the word processor document. RAM (or HD
The data is stored in the standard rectangular data storage means 28 assigned to a part of D).

【００１６】また、矩形データ整合化手段２９は、標準
矩形データ格納手段２８から読み出した標準矩形データ
と、矩形データ格納手段２６から読み出した外接矩形デ
ータとにより、またイメージデータの文字データ部分よ
り検出された所定のデータと該外接矩形データとによ
り、単独、合併、分離の処理を行って一文字に相当する
矩形データに整合化して、矩形データ格納手段（メモリ
（又はＨＤＤ）の一部）２６に格納させる。すなわち、
矩形データ格納手段２６には外部接矩形データと矩形デ
ータの両方が格納される。これら標準矩形データ作成手
段２７、標準矩形データ２８、及び矩形データ整合化手
段２９により、整合化処理手段１４が構成される。The rectangular data matching means 29 detects the standard rectangular data read from the standard rectangular data storage means 28, the circumscribed rectangular data read from the rectangular data storage means 26, and the character data portion of the image data. The specified data and the circumscribed rectangular data are processed individually, merged, and separated to be matched with rectangular data corresponding to one character, and are stored in the rectangular data storage means (a part of the memory (or HDD)) 26. Store. That is,
The rectangular data storage means 26 stores both externally connected rectangular data and rectangular data. The standardized rectangular data creating means 27, the standard rectangular data 28, and the rectangular data matching means 29 constitute the matching processing means 14.

【００１７】また、文字データ抽出手段３０はデータ抽
出処理手段１５であり、矩形データ格納手段２６より読
み出した矩形データに基づいて、イメージデータ格納手
段２２より読み出したイメージデータから正規の文字デ
ータを抽出する。この正規の文字データが文字認識手段
３１に供給されて、ここで予め格納されている参照文字
データと比較して文字認識を行う。ディスク３２は認識
した文字データを記憶し、ＣＲＴ３２は認識した文字デ
ータを文字として適宜表示を行う。この文字認識手段３
１、ディスク３２及びＣＲＴ３３により認識処理手段１
６が構成される。The character data extraction means 30 is the data extraction processing means 15, and extracts the normal character data from the image data read from the image data storage means 22 based on the rectangular data read from the rectangular data storage means 26. To do. This regular character data is supplied to the character recognizing means 31 and compared with the reference character data stored here in advance to perform character recognition. The disk 32 stores the recognized character data, and the CRT 32 appropriately displays the recognized character data as a character. This character recognition means 3
1, a disc 32 and a CRT 33 for recognition processing means 1
6 is configured.

【００１８】なお、行幅認識手段２３、外接矩形データ
作成手段２５、標準矩形データ作成手段２７、矩形デー
タ整合化手段２９、文字データ抽出手段３０、及び文字
認識手段３１はプログラム処理である。そこで、図３
に、図２の文字認識装置による文字切り出しの過程の説
明図を示す。図３において、まず、正規化処理を行う文
字行の検出を行い（ステップ（Ｓ１））、該当ラインで
文字データのうちの独立する構成部分を外接矩形データ
として検出する（Ｓ２）。この各独立部分の文字高さや
文字幅の統計処理による全部文字（ワードプロセッサ文
字の全角文字）を標準矩形データにより判別して抽出し
（Ｓ３）、この抽出以外の一部文字のうちの抽出可能な
記号を抽出する（Ｓ４）。また、Ｓ４で抽出した以外の
一部文字のうち、分裂文字の合併を行い（Ｓ５）、接触
文字部分の切り出しを行う（Ｓ６）。そして、上記以外
の文字データを半角文字として切り出して総ての切り出
しを行い（Ｓ７）、次ラインに移行して同様の切り出し
処理を繰り返す。The line width recognizing means 23, the circumscribing rectangular data creating means 25, the standard rectangular data creating means 27, the rectangular data matching means 29, the character data extracting means 30, and the character recognizing means 31 are program processes. Therefore, FIG.
FIG. 2 is an explanatory diagram of a process of character extraction by the character recognition device of FIG. In FIG. 3, first, a character line to be normalized is detected (step (S1)), and an independent component part of the character data on the corresponding line is detected as circumscribed rectangle data (S2). All the characters (full-width characters of word processor characters) obtained by the statistical processing of the character height and character width of each independent portion are discriminated and extracted by the standard rectangular data (S3), and some of the partial characters other than this extraction can be extracted. The symbol is extracted (S4). Further, among the partial characters other than those extracted in S4, the split characters are merged (S5), and the contact character portion is cut out (S6). Then, character data other than the above is cut out as half-width characters and all the cut-outs are performed (S7), the process moves to the next line, and the same cut-out processing is repeated.

【００１９】ここで、図４に図３の文字切り出しを行う
文書例の説明図を示すと共に、図５に図４の文字切り出
し過程の説明図を示す。図４（Ａ）に示すように、文書
４１に各行で文字が記録されており、例えば図４（Ｂ）
に示す１行目の「本製品は、湿度１０Here, FIG. 4 is an explanatory diagram of an example of a document for performing the character segmentation of FIG. 3, and FIG. 5 is an explanatory diagram of the character segmentation process of FIG. As shown in FIG. 4 (A), characters are recorded in each line in the document 41. For example, FIG. 4 (B)
The first line shown in "This product has a humidity of 10

【００２０】[0020]

【外１】 [Outside 1]

【００２１】３５％を」の文字切り出しを行うものと
し、図５に示すような切り出しが行われる。図５は、図
３におけるＳ１〜Ｓ７と対応するもので、まずＳ１１
（Ｓ１）のように１ラインの文字行の総てが検出され、
この文字行で独立した構成部分がＳ１２（Ｓ２）のよう
に検出される。この場合、「は」は分裂して検出され
る。It is assumed that the character segmentation of "35%" is performed, and the segmentation as shown in FIG. 5 is performed. FIG. 5 corresponds to S1 to S7 in FIG.
All the character lines of one line are detected as in (S1),
Independent parts of this character line are detected as in S12 (S2). In this case, the “ha” is detected in pieces.

【００２２】続いて、Ｓ１３（Ｓ３）に示すように上述
の統計処理（詳細は後述する）で全角文字が抽出され、
Ｓ１４（Ｓ４）に示すように全角文字以外の抽出可能な
記号「、」が抽出される。その後、Ｓ１５（Ｓ５）に示
すように分裂文字「は」の合併が行われると共に、Ｓ１
６（Ｓ６）に示すように接触文字の切り出し（この例で
は存在しない）が行われる。そして、上記切り出した以
外のものを半角文字としてＳ１７（Ｓ７）のように切り
出しが行われるものである。Subsequently, as shown in S13 (S3), full-width characters are extracted by the above-described statistical processing (details will be described later),
As shown in S14 (S4), the extractable symbols “,” other than the double-byte characters are extracted. After that, as shown in S15 (S5), the split character "ha" is merged and S1
6 (S6), the contact character is cut out (not present in this example). Then, the characters other than the cutout are used as half-width characters and the cutout is performed as in S17 (S7).

【００２３】次に、文字切り出しの詳細を説明する。図
６に、外接矩形データ作成の説明図を示す。前段階とし
て、イメージデータ格納手段２２のイメージデータが格
納されている領域以外に、ＲＡＭ上に文字データをバイ
ナリデータとして記憶する空のデータ領域２２ａ（大き
さは紙幅と同じ）と空のフラグ領域２２ｂとが用意され
る。図６では、例えば上記の「本」の場合について説明
する。図６に示すように、ドット列の１ライン毎にドッ
トが存在しないときは「０」とし、一旦ドットの存在が
確認されると「１」としてドット存在を検出する。Next, the details of character segmentation will be described. FIG. 6 shows an explanatory diagram of creating circumscribed rectangle data. As a preliminary step, in addition to the area where the image data of the image data storage means 22 is stored, an empty data area 22a (the size is the same as the paper width) and an empty flag area for storing character data as binary data on the RAM. 22b are prepared. In FIG. 6, for example, the case of the above-mentioned “book” will be described. As shown in FIG. 6, when there is no dot for each line of the dot row, it is set to "0", and once it is confirmed that there is a dot, "1" is detected.

【００２４】一旦ドットの存在が確認できると、行方向
の左端に相当するＲＡＭ上のアドレスを書いて、行の始
まりとみなし、フラグを立てる。フラグはドットの存在
を示すものであり、次々に行方向に１ラインずつチェッ
クを行った際に、ドットが確認できなくなると、そこで
フラグを下ろす。そして、フラグを下ろした際の行の左
端に相当するアドレスを格納する。Once the presence of dots can be confirmed, the address on the RAM corresponding to the left end in the row direction is written, regarded as the beginning of a row, and a flag is set. The flag indicates the existence of dots, and when the dots cannot be confirmed when the lines are successively checked one by one in the row direction, the flag is set there. Then, the address corresponding to the left end of the line when the flag is set is stored.

【００２５】一方、ラインをスキャンすると、ドットの
存在が検出されることから、ドットが現れた際に、「行
の左端からの距離」に相当するドットにも対応するビッ
トを立てる。これにより、図６のように、文字のドット
分布を行方向（用紙の縦方向）に見るような形で、デー
タが作成される。このデータが、行方向の文字の存在範
囲を示す行幅データとなり、また前述のドット開始アド
レスとドット終了アドレスとの差が文字幅となる。図６
のようなデータが、行を認識する毎に増えていく形とな
る。行幅データは行幅データ格納手段２４としてのＲＡ
Ｍ上の別領域（図示せず）に格納される。On the other hand, when a line is scanned, the presence of dots is detected. Therefore, when a dot appears, a bit corresponding to the "distance from the left end of the row" is set. As a result, as shown in FIG. 6, data is created such that the dot distribution of characters is viewed in the row direction (vertical direction of the paper). This data is line width data indicating the range of characters in the line direction, and the difference between the dot start address and the dot end address is the character width. Figure 6
Data such as will increase every time a line is recognized. The line width data is RA as the line width data storage means 24.
It is stored in another area (not shown) on M.

【００２６】なお、この処理によって、ノイズの多いデ
ータであっても、行データ毎のドットの存在率を見るこ
とにより、自ずとノイズを除去することが可能となる。
すなわち、行幅データの分布を調べることにより、極端
に行幅の少ないデータはノイズとみなすことができるも
のである。このようにノイズ成分を除去した後、改めて
行幅の平均値を取ると、文字高さの平均値が求められる
ものである。そして、上記行幅データと文字幅データを
組み合わせて外接矩形データが作成される。By this processing, it is possible to automatically remove the noise even if the data has a lot of noise by checking the dot existence rate for each row data.
That is, by examining the distribution of line width data, data with extremely small line width can be regarded as noise. After removing the noise component in this way, the average value of the line width is calculated again to obtain the average value of the character height. Then, the line width data and the character width data are combined to create circumscribed rectangle data.

【００２７】そこで、図７に、外接矩形データによる独
立部分検出の説明図を示す。また、図８に、外接矩形デ
ータのメモリ（又はＲＡＭ）への格納の説明図を示す。
図７は図４（Ｂ）に示した一行分の文字データを、図６
の方法で作成した外接矩形データの外接矩形領域を示し
たものである。そして、図７の外接矩形データが図８の
ようにメモリ（又はＲＡＭ）上にテーブルとして作成さ
れ、総ての独立部分に対して、先頭アドレス、オフセッ
ト（最大７とする）幅（文字幅）、全角文字フラグ、半
角文字フラグとして格納される。なお、全角文字フラグ
及び半角文字フラグは、後述する整合化処理で順次立て
られていくものである。Therefore, FIG. 7 shows an explanatory view of the independent portion detection by the circumscribed rectangle data. Further, FIG. 8 shows an explanatory view of storing the circumscribed rectangle data in the memory (or RAM).
FIG. 7 shows the character data for one line shown in FIG.
3 shows the circumscribed rectangular area of the circumscribed rectangular data created by the above method. Then, the circumscribed rectangle data of FIG. 7 is created as a table on the memory (or RAM) as shown in FIG. 8, and the head address and offset (maximum 7) width (character width) are set for all independent parts. , A full-width character flag and a half-width character flag are stored. The full-width character flag and the half-width character flag are sequentially set in the matching process described later.

【００２８】そこで、全角文字、分離文字、合併文字及
び半角文字の認識を行う整合化処理について説明する。
この場合、上述のように取得した文字高さの平均値をそ
のまま文字幅の平均値とみなしている。これは、特にワ
ードプロセッサ文書のような活字文字は、２バイト文字
（全角文字）については縦と横の比が略１：１であり、
活字文字の文字ピッチは略均等であると共に、活字文字
の高さは略平均であるということに基づくものである。The matching process for recognizing full-width characters, separated characters, merged characters and half-width characters will be described.
In this case, the average value of the character height obtained as described above is regarded as it is as the average value of the character width. This is especially true for type characters such as word processor documents, where the ratio of height to width is approximately 1: 1 for double-byte characters (double-byte characters).
This is based on the fact that the character pitches of printed characters are substantially uniform and the heights of printed characters are approximately average.

【００２９】全角文字の認識は、まず、文字幅データｍ
と文字幅データの平均値（すなわち文字高さの平均値）
ｈとの比ｘが、０．７＜ｘ＜１．４に合致するデータを
抜き出し、それらにフラグ（図８の全角文字フラグ）を
立てる。これはこの範囲に含まれる文字は正規の全角文
字であろう、という予測に基づいている。To recognize double-byte characters, first, character width data m
And average value of character width data (that is, average value of character height)
Data whose ratio x with h matches 0.7 <x <1.4 is extracted, and a flag (full-width character flag in FIG. 8) is set on them. This is based on the prediction that the characters contained in this range will be regular double-byte characters.

【００３０】全角文字以外の認識は、上記フラグを立て
た矩形データの文字幅のみを対象に平均値を取り、より
正確な文字幅の平均値Ｗを算出すると共に、最大値Ｗｍ
ａｘを抜き出す。上記フラグが立たなかった矩形データ
を対象に、上記で求めた平均値Ｗと最大値Ｗｍａｘとを
比較して、文字の分離処理を行う。すなわち、Ｗ≦ｍ≦
Ｗｍａｘの場合は１文字として抽出し、２Ｗ≦ｍ≦２Ｗ
ｍａｘの場合は２文字に分離して抽出する。そして、ｍ
＞２Ｗｍａｘの場合は、（ｍ／Ｗｍａｘ）の整数部分の
数の文字に分離しこれらの抽出処理で認識された外接矩
形データにフラグを立てる。そして、未だフラグの立た
ないものに対するデータは、ｍ＜Ｗの条件に合致してい
ることになる。すなわち全角文字の１文字分の幅に満た
ない外接矩形データである。For recognition of characters other than full-width characters, an average value is calculated only for the character width of the flagged rectangular data, a more accurate average value W of the character width is calculated, and the maximum value Wm is calculated.
Extract ax. For the rectangular data for which the flag is not set, the average value W and the maximum value Wmax obtained above are compared to perform character separation processing. That is, W ≦ m ≦
In case of Wmax, it is extracted as one character, and 2W ≦ m ≦ 2W
In case of max, it is separated into two characters and extracted. And m
When> 2 Wmax, the circumscribed rectangle data recognized by the extraction process is separated by dividing the character into the integer part of (m / Wmax) and a flag is set. Then, the data for the ones for which the flag is not yet set matches the condition of m <W. That is, the circumscribed rectangle data is less than the width of one full-width character.

【００３１】ここで、図９に、全角文字以下の外接矩形
データの認識の説明図を示す。図９（Ａ）は、説明上例
えば「馴」の文字を示したもので、独立部分の文字幅ｍ
（＜Ｗ）の外接矩形に対して、隣接する外接矩形データ
との距離をＷ₁，Ｗ₂，…Ｗ _n（図９ではＷ₁〜Ｗ₄）
として、Ｗ₁≧Ｗｍａｘの場合は記号であると思われる
ことから１文字として抽出する。また、Ｗ≦Ｗ_n≦Ｗｍ
ａｘ（ｎ＝２，３，４，…）の場合、Ｗ_nまでを１文字
として抽出する。図９（Ａ）ではＷ₄までを１文字とし
て抽出することが示される。Here, FIG. 9 shows a circumscribed rectangle of full-width characters or less.
An explanatory view of recognition of data is shown. FIG. 9A shows an example for explanation.
For example, it shows a "custom" character, and the character width m of the independent part
The circumscribed rectangle data adjacent to the circumscribed rectangle of (<W)
Distance to W₁, W₂, ... W _n(W in FIG. 9₁~ W_Four)
As W₁If ≧ Wmax, it seems to be a symbol
Therefore, it is extracted as one character. Also, W ≦ W_n≤ Wm
In the case of ax (n = 2, 3, 4, ...), W_nUp to 1 character
To extract. W in FIG. 9 (A)_FourUp to 1 character
It is shown to extract.

【００３２】これを、図９（Ｂ）に示すように、「は」
に当てはめると、Ｗ₂までが１文字として抽出されるこ
とになるものである。従って、図８の（４），（５）に
示すデータは、整合化により図９（Ｃ）に示すように全
角文字として認識されてフラグが立てられて矩形データ
整合化手段２９より矩形データ格納手段２６に矩形デー
タとして書き込みが行われる。As shown in FIG. 9B, this is "ha".
Then, up to W ₂ will be extracted as one character. Therefore, the data shown in (4) and (5) of FIG. 8 are recognized as full-width characters as shown in FIG. 9 (C) by matching and are flagged, and the rectangular data matching means 29 stores the rectangular data. Writing to the means 26 is performed as rectangular data.

【００３３】以上の処理でフラグが立たなかった外接矩
形データは、半角英数文字等の、漢字とは異なる規則の
文字と判断して独立文字として抽出してフラグ（半角文
字フラグ）が立てられて、書き込みが行われる。図８で
は、「１」，「０」，「３」，「５」が１文字として半
角文字フラグが立てられるものである。The circumscribed rectangle data which has not been flagged by the above processing is judged as a character having a rule different from Kanji such as half-width alphanumeric characters and is extracted as an independent character and a flag (half-width character flag) is set. Is written. In FIG. 8, "1", "0", "3", and "5" are set as one character and a half-width character flag is set.

【００３４】このように、外接矩形データが修正されて
矩形データとして矩形データ格納手段２６に書き込まれ
て格納されるものである。そして、文字データ抽出手段
３０において上記矩形データに基づいて、イメージデー
タ格納手段２２から読み出したイメージデータから文字
データを抽出し、後段の文字認識手段３１が認識し易い
ように適宜拡大又は縮小の正規化処理を行うための倍率
を決めて該文字認識手段３１に処理を移すものである。In this way, the circumscribed rectangular data is modified and written and stored as rectangular data in the rectangular data storage means 26. Then, the character data extraction unit 30 extracts character data from the image data read from the image data storage unit 22 based on the rectangular data, and appropriately enlarges or reduces the character data so that the character recognition unit 31 in the subsequent stage can easily recognize the character data. The scaling factor for the conversion processing is determined and the processing is transferred to the character recognition means 31.

【００３５】このように、分離文字や合併文字を最適に
正規化処理することができ、しかもプログラム処理が大
きな処理でないことから、高速処理を実現することがで
きるものである。また、一行内で異なる大きさの文字フ
ォントが使われている文書の場合は、一行の中を同じ大
きさの文字同士で一旦分離してから処理を行うことによ
り、最適に正規化処理を行うことができる。例えば、一
行分の文字高さ位置をフラグ領域２２ｂの各ドット（フ
ラグ）毎に順次格納しておき、これに対して正規分布等
の統計処理を行うことにより、複数の大きさの文字フォ
ントを使用した場合の各々のフォントの文字高さを得る
ことができるものである。In this way, the separated characters and the merged characters can be optimally normalized, and since the program processing is not large, high-speed processing can be realized. In addition, in the case of a document in which character fonts of different sizes are used in one line, normalization processing is optimally performed by first separating the characters in one line with characters of the same size and then processing. be able to. For example, a character height position for one line is sequentially stored for each dot (flag) of the flag area 22b, and statistical processing such as normal distribution is performed on the character height position to generate character fonts of a plurality of sizes. The character height of each font when used can be obtained.

【００３６】[0036]

【発明の効果】以上のように本発明によれば、入力手段
に取り込まれたイメージデータより外接矩形データを矩
形データ処理手段で作成し、該外接矩形データを整合化
処理手段で単独、合併、分離の整合化の処理を行って、
矩形データとして該矩形データ処理手段に一担格納し、
該矩形データに基づいてイメージデータより正規の文字
データをデータ抽出処理手段により抽出した後に、該文
字データより認識処理手段で文字認識を行うことによ
り、分離文字や合併文字等に対して最適な正規化処理が
行われ、誤認識が減少して高速化を図ることができる。As described above, according to the present invention, the circumscribed rectangular data is created by the rectangular data processing means from the image data taken in the input means, and the circumscribed rectangular data is independently or merged by the matching processing means. By performing the process of separation matching,
The rectangular data is stored in the rectangular data processing means as a rectangular data,
After the normal character data is extracted from the image data based on the rectangular data by the data extraction processing means, the recognition processing means performs character recognition from the character data, so that the optimum normal character for separated characters and merged characters can be obtained. As a result of the conversion processing being performed, erroneous recognition is reduced and speeding up can be achieved.

[Brief description of drawings]

【図１】本発明の原理構成図である。FIG. 1 is a principle configuration diagram of the present invention.

【図２】本発明の一実施例の機能ブロック図である。FIG. 2 is a functional block diagram of an embodiment of the present invention.

【図３】図２の文字認識装置による文字切出しの過程の
説明図である。FIG. 3 is an explanatory diagram of a process of extracting a character by the character recognition device of FIG.

【図４】図３の文字切出しを行なう文書例の説明図であ
る。FIG. 4 is an explanatory diagram of an example of a document that cuts out characters in FIG.

【図５】図４の文字切出し過程の説明図である。FIG. 5 is an explanatory diagram of a character cutting process of FIG.

【図６】外接矩形データ作成の説明図である。FIG. 6 is an explanatory diagram of creating circumscribed rectangle data.

【図７】外接矩形[Fig. 7] Bounding rectangle

【図８】外接矩形データのメモリへの格納の説明図であ
る。FIG. 8 is an explanatory diagram of storing circumscribed rectangle data in a memory.

【図９】全角文字以外の外接矩形データの認識の説明図
である。FIG. 9 is an explanatory diagram of recognition of circumscribed rectangle data other than full-width characters.

[Explanation of symbols]

１１文字認識装置１２入力手段１３矩形データ処理手段１４整合化処理手段１５データ抽出処理手段１６認識処理手段２１文字イメージ入力手段２２イメージデータ格納手段２３行幅認識手段２４行幅データ格納手段２５外接矩形データ作成手段２６矩形データ格納手段２７標準矩形データ格納手段２８標準矩形データ整合化手段２９矩形データ整合化手段３０文字データ抽出手段３１文字認識手段 11 Character Recognition Device 12 Input Means 13 Rectangular Data Processing Means 14 Matching Processing Means 15 Data Extraction Processing Means 16 Recognition Processing Means 21 Character Image Input Means 22 Image Data Storage Means 23 Line Width Recognition Means 24 Line Width Data Storage Means 25 Circumscribing Rectangle Data creating means 26 Rectangular data storing means 27 Standard rectangular data storing means 28 Standard rectangular data matching means 29 Rectangular data matching means 30 Character data extracting means 31 Character recognition means

Claims

[Claims]

1. Input means for capturing character data of a document as image data, and circumscribing rectangle data of independent constituent parts of the character data is created and stored from the image data from the input means. Rectangular data processing means in which rectangular data matched based on the circumscribed rectangular data is written, and a predetermined data detected from the character data portion of the image data based on the circumscribed rectangular data read from the rectangular data processing means. A matching processing unit that matches the rectangular data corresponding to one character and stores it in the rectangular data processing unit by performing independent processing, merging, and separation processing using the data, and the rectangular data of the matching processing unit. And data extraction processing means for extracting the regular character data from the image data taken in by the input means, A recognition processing means for performing character recognition from the regular character data from the character data extraction processing means.