JPH0785215A

JPH0785215A - Character recognizing device

Info

Publication number: JPH0785215A
Application number: JP5252268A
Authority: JP
Inventors: Kazumasa Nakazawa; 一政中澤; Hideo Wakabayashi; 英夫若林
Original assignee: Japan Digital Laboratory Co Ltd
Current assignee: Japan Digital Laboratory Co Ltd
Priority date: 1993-09-14
Filing date: 1993-09-14
Publication date: 1995-03-31

Abstract

PURPOSE:To provide a character recognizing device capable of accurately reproducing the format image of a read original by segmenting the positions, sizes and ruled fines of characters together with the characters at the time of reading out an original and converting the segmented data into the one to be utilized by a character editing device or the like. CONSTITUTION:In this character recognizing device, an original is read out by an image scanner 1 and stored in an image memory 2 as image data. When an operator specifies an area, a CPU 3 executes layout analysis, character recognition and data conversion as recognizing processing. Ira the layout analysis, whether the specified area is a character field or a table field is judged. In the case of a table field table ruled line (coordinate) is segmented. In the case of a character field, a character string is extracted, the character field is segmented in each line and its inclination is corrected (similar processing is executed also when a character string exists in a table field). In the character recognition, recognition dictionary data are loaded from a storage device to a work memory 5 and characters in the specified area are recognized. In the data conversion processing, the result of the layout analysis is collated with that of the character recognition, processed data are converted into data to be edited by a document editing device and the converted data are stored in a recognized data file as recognized data.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は文字認識装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device.

【０００２】[0002]

【従来の技術】従来の文字認識装置は、活字文書や手書
き文書の指定された範囲の文字部分を切り出して１文字
のドットイメージで読取り、その読み取ったデータを解
析してから文字認識用の辞書から適切な文字を探し出
し、その文字を自動的に文字コード（テキストデータ）
に変換していた。2. Description of the Related Art A conventional character recognition device is a dictionary for character recognition in which a character portion within a designated range of a print document or a handwritten document is cut out and read as a dot image of one character, and the read data is analyzed. Find the appropriate character from the, and automatically character code that character (text data)
Had been converted to.

【０００３】[0003]

【発明が解決しようとする課題】上述したような従来技
術においてはコード変換されたデータを文字編集装置
（或いは、コード変換された文字コード（データ）を当
該文字認識装置の編集機能（以下、文字編集装置等と記
す）で修正・追加等の編集操作をするため文字編集装置
等）で呼び出した場合に、文字データのみが表示され読
取り原稿のイメージ（文字の位置やサイズ、或いは罫線
や表）を再現することができないため再度修正や編集を
し直さなければならないという問題点があった。In the prior art as described above, the code-converted data is converted into a character editing device (or the code-converted character code (data) is converted into an editing function of the character recognition device (hereinafter referred to as a character). When referred to by a character editing device, etc.) for performing editing operations such as corrections and additions), only character data is displayed and the image of the read document (position and size of characters, ruled lines or table) is displayed. There was a problem in that since it could not be reproduced, it had to be corrected or edited again.

【０００４】即ち、従来の文字認識装置では、文書の文
字コードのみの変換を行っており書式レイアウトの再現
（文字のサイズ、文字の位置、罫線）ができなかった。
そこで、従来は書式レイアウトの再現には文字コード変
換後のデータ（認識データ）をワープロなどで編集し
て、文字の大きさや行ピッチ及び罫線等の書式等を再現
することを必要としていた。That is, in the conventional character recognition apparatus, only the character code of the document is converted, and the format layout cannot be reproduced (character size, character position, ruled line).
Therefore, conventionally, in order to reproduce the format layout, it has been necessary to edit the data (recognition data) after the character code conversion with a word processor or the like to reproduce the character size, line pitch, ruled line, and other formats.

【０００５】本発明は上記問題点に鑑みてなされたもの
であり、原稿読取りの際に文字と一緒に文字の位置やサ
イズ及び罫線、或いは表を切り出し、データとして読み
込み文字編集装置等で利用できるデータ（文書編集装置
編集データ）に変換して文字編集装置等で呼び出した場
合、読取り原稿そのままの書式イメージを再現すること
により再度の修正や編集（文字の位置合わせやサイズ変
更・罫線（表）の入力）を必要とない文字認識装置を提
供することを目的とする。The present invention has been made in view of the above problems, and when reading a document, the character position, size, ruled line, or table is cut out together with the character and can be used as data in a character editing device or the like. When it is converted to data (editing data by the text editing device) and called by a text editing device, the format image of the read original is reproduced to make corrections and edits again (character alignment, size change, ruled lines (table)). It is an object of the present invention to provide a character recognition device that does not require input).

【０００６】[0006]

【課題を解決するための手段】上記の目的を達成するた
めに第１の発明の文字認識装置は、記入或いは印字され
た文字および／または罫線からなる表を含む文書を読み
取ってイメージデータとして出力するイメージリーダー
と、イメージデータを記憶するイメージメモリと、認識
処理手段による認識処理を実行する処理装置と、文字認
識に用いる認識辞書データを格納する認識辞書データ格
納装置と、前記認識処理により得た認識データを格納す
る格納装置と、操作情報を入力する入力装置と、を有す
る文字認識装置において、認識処理手段が、イメージデ
ータの領域を入力装置により指定する領域指定手段と、
領域指定手段により指定された指定領域ごとに当該指定
領域のレイアウトを解析してレイアウト解析結果を得る
レイアウト解析手段と、領域指定手段により指定された
指定領域の中の文字フィールドの文字列を認識辞書デー
タに基づいて文字認識して文字認識結果を得る文字認識
手段と、文字認識結果とレイアウト解析結果を合成して
書式情報を付加し、書式文書として認識データを作成す
るデータ変換手段と、を有することを特徴とする。In order to achieve the above object, the character recognition device of the first invention reads a document including a table composed of written or printed characters and / or ruled lines and outputs it as image data. An image reader for storing image data, a processing device for executing recognition processing by the recognition processing means, a recognition dictionary data storage device for storing recognition dictionary data used for character recognition, and a recognition device obtained by the recognition processing. In a character recognition device having a storage device for storing recognition data and an input device for inputting operation information, a recognition processing means, an area designating means for designating a region of image data by the input device,
Layout analysis means for analyzing the layout of the designated area for each designated area designated by the area designating means to obtain a layout analysis result, and a dictionary for recognizing character strings of character fields in the designated area designated by the area designating means Character recognition means for recognizing characters based on data to obtain character recognition results, and data conversion means for synthesizing character recognition results and layout analysis results to add format information and creating recognition data as format documents It is characterized by

【０００７】第２の発明は上記第１の発明の文字認識装
置において、レイアウト解析手段が、指定領域のフィー
ルド種別を判別するフィールド種別判別手段と、上記フ
ィールド種別が罫線または表フィールドのとき、罫線を
切り出す罫線切り出し手段と、上記フィールド種別が文
字列を含む表フィールドのとき、該表フィールドから文
字列を切り出す第１の文字列切り出し手段と、上記フィ
ールド種別が文字フィールドのとき、該文字フィールド
から文字列を切り出す第２の文字列切り出し手段と、を
有することを特徴とする。According to a second aspect of the present invention, in the character recognition apparatus according to the first aspect of the present invention, the layout analysis means determines the field type of the designated area, and the ruled line when the field type is a ruled line or a table field. A ruled line cutting-out means for cutting out a character string, a first character string cutting-out means for cutting out a character string from the table field when the field type is a character field, and a character field when the field type is a character field A second character string slicing means for slicing a character string.

【０００８】第３の発明は上記第１の発明の文字認識装
置において、レイアウト解析手段が、文字または文書全
体の傾きを検出する傾き検出手段と、指定領域のフィー
ルド種別を判別するフィールド種別判別手段と、上記フ
ィールド種別が表フィールドのとき、罫線を切り出す罫
線切り出し手段と、上記フィールド種別が文字列を含む
表フィールドのとき、該表フィールドから文字列を切り
出し、該文字列から１行分毎の文字列を切り出す第１の
文字列切り出し手段と、上記フィールド種別が文字フィ
ールドのとき、該文字フィールドから文字列を切り出
し、該文字フィールドから１行分毎の文字列を切り出す
第２の文字列切り出し手段と、文字列１行毎に傾きを補
正する傾き補正手段と、を有することを特徴とする。According to a third aspect of the present invention, in the character recognition apparatus of the first aspect, the layout analysis means detects the inclination of the character or the entire document, and the field type determination means which determines the field type of the designated area. When the field type is a table field, a ruled line cutting-out means for cutting out a ruled line, and when the field type is a table field including a character string, a character string is cut out from the table field and every one line is extracted from the character string. First character string cutting means for cutting out a character string, and second character string cutting out for cutting out a character string from the character field and cutting out a character string for each line from the character field when the field type is the character field And a tilt correction means for correcting the tilt for each line of the character string.

【０００９】第４の発明は上記第１の発明の文字認識装
置において、書式文書として編集可能な認識データが、ａ．罫線データ長と、少なくとも、罫線種別コード、開
始座標、及び終了座標とからなる個別罫線情報の組の繰
返しからなる罫線情報と、ｂ．文字データ長、文字ピッチ、行間隔を示すコード
と、少なくとも、制御コード及び文字コードの組の繰返
しからなる文字情報と、を含むことを特徴とする。According to a fourth aspect of the present invention, in the character recognition device according to the first aspect of the present invention, the recognition data editable as a format document is a. Ruled line information which is a repetition of a set of ruled line data and at least a set of individual ruled line information including a ruled line type code, a start coordinate, and an end coordinate, and b. It is characterized in that it includes a code indicating a character data length, a character pitch, and a line interval, and at least character information including a repetition of a set of a control code and a character code.

【００１０】第５の発明は上記第１ないし第４の発明の
文字認識装置において、更に、認識データを格納装置に
格納する認識データ格納手段を有することを特徴とす
る。A fifth aspect of the present invention is the character recognition apparatus according to any of the first to fourth aspects, further comprising a recognition data storage means for storing the recognition data in a storage device.

【００１１】第６の発明は上記第１ないし第４の発明の
文字認識装置において、更に、認識データを格納装置に
格納する認識データ格納手段と、イメージデータを格納
装置に格納するイメージデータ格納手段を有することを
特徴とする。A sixth aspect of the present invention is the character recognition device according to any one of the first to fourth aspects, further comprising a recognition data storage means for storing the recognition data in a storage device and an image data storage means for storing the image data in the storage device. It is characterized by having.

【００１２】[0012]

【作用】上記構成により第１の発明の文字認識装置は、
認識処理手段が、領域指定手段により入力装置でイメー
ジデータの領域を指定し、レイアウト解析手段により領
域指定手段で指定された指定領域ごとに当該指定領域の
レイアウトを解析してレイアウト解析結果を得て、文字
認識手段により領域指定手段により指定された指定領域
の中の文字フィールドの文字列を認識辞書データに基づ
いて文字認識して文字認識結果を得て、データ変換手段
により文字認識結果とレイアウト解析結果を合成して書
式情報を付加し、書式文書として認識データを作成す
る。With the above construction, the character recognition device of the first invention is
The recognition processing means specifies the area of the image data with the input device by the area specifying means, analyzes the layout of the specified area for each specified area specified by the area specifying means by the layout analyzing means, and obtains a layout analysis result. The character recognition unit obtains a character recognition result by performing character recognition on the character string of the character field in the designated area designated by the area designation unit based on the recognition dictionary data, and the character conversion result and layout analysis by the data conversion unit. The results are combined and format information is added to create recognition data as a format document.

【００１３】第２の発明は上記第１の発明の文字認識装
置において、レイアウト解析手段が、フィールド種別判
別手段により指定領域のフィールド種別を判別し、フィ
ールド種別に基づいてデータ列の切り出しを行う。この
場合フィールド種別が表フィールドのときは罫線切り出
し手段により罫線を切り出し、文字列を含む表フィール
ドのときは第１の文字列切り出し手段によりその表フィ
ールドから文字列を切り出し、フィールド種別が文字フ
ィールドのときは第２の文字列切り出し手段によりその
文字フィールドから文字列を切り出す。According to a second aspect of the present invention, in the character recognition apparatus according to the first aspect, the layout analyzing means discriminates the field type of the designated area by the field type discriminating means and cuts out a data string based on the field type. In this case, when the field type is a table field, the ruled line cutting means cuts out a ruled line, and when the field type includes a character string, the first character string cutting means cuts out a character string from the table field. In this case, the character string is cut out from the character field by the second character string cutting means.

【００１４】第３の発明は上記第１の発明の文字認識装
置において、レイアウト解析手段が、傾き検出手段によ
り文字または文書全体の傾きを検出し、フィールド種別
判別手段により指定領域のフィールド種別を判別する。
そして、フィールド種別が表フィールドのときは罫線を
切り出し手段により罫線を切り出し、文字列を含む表フ
ィールドのときは第１の文字列切り出し手段によりその
表フィールドから文字列を切り出した後、その文字列か
ら１行分毎の文字列を切り出し、文字フィールドのとき
は第２の文字列切り出し手段によりその文字フィールド
から文字列を切り出した後、その文字フィールドから１
行分毎の文字列を切り出す。更に、傾き補正手段により
文字列１行毎に文字の傾きを補正する。According to a third aspect of the present invention, in the character recognition apparatus of the first aspect, the layout analysis means detects the inclination of the character or the entire document by the inclination detection means, and the field type determination means determines the field type of the designated area. To do.
Then, when the field type is a table field, the ruled line is cut out by the cutting out means, and when the field type includes a character string, the character string is cut out from the table field by the first character string cutting out means, and then the character string A character string for each line from the character field, and in the case of a character field, the character string is cut out from the character field by the second character string cutting means, and then 1 from the character field.
Cut out a character string for each line. Further, the inclination correction means corrects the inclination of the character for each line of the character string.

【００１５】第４の発明は上記第１の発明の文字認識装
置において、データ変換手段レイアウト解析結果と認識
結果に基づいて、ａ．罫線データ長と、少なくとも、罫線種別コード、開
始座標、及び終了座標とからなる個別罫線情報の組の繰
返しからなる罫線情報と、ｂ．文字データ長、文字ピッチ、行間隔を示すコード
と、少なくとも、制御コード及び文字コードの組の繰返
しからなる文字情報と、を含む、書式文書として編集可能な認識データを作成す
る。A fourth aspect of the present invention is the character recognition device according to the first aspect of the present invention, wherein a. Ruled line information which is a repetition of a set of ruled line data and at least a set of individual ruled line information including a ruled line type code, a start coordinate, and an end coordinate, and b. A recognition data editable as a format document is created, which includes a code indicating a character data length, a character pitch, and a line spacing, and at least character information including a repetition of a set of a control code and a character code.

【００１６】第５の発明は上記第１ないし第４の発明の
文字認識装置において、更に、認識データ格納手段によ
り認識データを格納装置に格納する。A fifth aspect of the present invention is the character recognition device according to any one of the first to fourth aspects, wherein the recognition data storage means further stores the recognition data in the storage device.

【００１７】第６の発明は上記第１ないし第４の発明の
文字認識装置において、更に、認識データ格納手段によ
り認識データを、イメージデータ格納手段によりイメー
ジデータを格納装置に格納する。According to a sixth aspect of the present invention, in the character recognition device according to the first to fourth aspects, the recognition data storage means stores the recognition data, and the image data storage means stores the image data in the storage device.

【００１８】[0018]

【実施例】図１は、本発明の文字認識装置の一実施例の
構成図であり、図２は本発明の文字認識装置の他の実施
例の構成図である。図２の装置は文字（及び書式）原稿
を連続的に読み取って認識データを保管装置に保管し、
読取り処理終了後に一括して文字編集装置等（図示せ
ず）で修正するパッチ処理を行う機能を有する。1 is a block diagram of an embodiment of the character recognition device of the present invention, and FIG. 2 is a block diagram of another embodiment of the character recognition device of the present invention. The apparatus of FIG. 2 continuously reads character (and format) originals and stores recognition data in a storage device.
After the reading process is completed, it has a function of collectively performing a patch process for correcting with a character editing device or the like (not shown).

【００１９】図１及び図２で、１は活字原稿或いは手書
き原稿の文字を読み取るイメージスキャナ（画像読取り
装置）、２はイメージスキャナで読み取ったイメージデ
ータを一時的に保存（記憶）するイメージメモリ、３は
文字認識装置の動作の制御及び後述する認識処理手段に
よる認識処理の実行等を行うＣＰＵ（中央演算装置）、
４は認識処理により認識された認識データ及びイメージ
メモリ２からのイメージデータを格納する記憶（格納）
装置、５はＣＰＵ３による認識処理等において必要とす
る各処理手段（プログラム）、データ、辞書データ、あ
るいは設定値ないしパラメータ等を一時的に記憶するワ
ークメモリ（内部メモリ）、６はイメージリーダ１で読
み取った文字を認識するための認識辞書データからなる
認識辞書ファイルを格納した格納装置、７は文字認識装
置の操作に必要な操作コマンド、メニュー或いはアイコ
ン等の操作情報を表示する表示装置、８は文字認識装置
の操作に必要な操作情報を入力するキーボード、或いは
マウス等のポインティングデバイスを有する入力装置で
ある。In FIGS. 1 and 2, reference numeral 1 is an image scanner (image reading device) for reading characters of a printed original or handwritten original, 2 is an image memory for temporarily storing (storing) image data read by the image scanner, Reference numeral 3 denotes a CPU (central processing unit) that controls the operation of the character recognition device and executes recognition processing by the recognition processing means described later.
4 is a memory for storing the recognition data recognized by the recognition process and the image data from the image memory 2.
A device 5 is a work memory (internal memory) for temporarily storing each processing means (program), data, dictionary data, or set values or parameters required for recognition processing by the CPU 3, and 6 is an image reader 1. A storage device that stores a recognition dictionary file composed of recognition dictionary data for recognizing the read characters, 7 is a display device that displays operation information necessary for operating the character recognition device, operation information such as menus or icons, and 8 is The input device has a keyboard or a pointing device such as a mouse for inputting operation information necessary for operating the character recognition device.

【００２０】また、図２で、９は出力装置（プリンタ或
いは認識データを格納する磁気ディスク）である。な
お、格納装置４と６は物理的に同じ装置でもよい。ま
た、実施例ではレイアウト解析手段、文字認識手段、デ
ータ変換手段等の認識処理手段はプログラム手段として
格納装置４に格納されており、文字認識装置の起動時に
読み出されワークメモリ５に装置が駆動の間保持され、
ＣＰＵ３により適時実行される。Further, in FIG. 2, 9 is an output device (printer or magnetic disk for storing recognition data). The storage devices 4 and 6 may be physically the same device. Further, in the embodiment, the recognition processing means such as the layout analysis means, the character recognition means, the data conversion means, etc. are stored in the storage device 4 as the program means, and are read out when the character recognition device is activated and the devices are driven into the work memory 5. Held for
It is executed by the CPU 3 in a timely manner.

【００２１】図３は文字読取り装置の動作ブロック図で
あり、図１１は読取り原稿の例である。以下、図３及び
図１１に従って本発明の文字読取り装置の動作について
説明する。なお、図１１では説明上、文章（文字群）１
１１と表１１２を原稿の上下に示してあるが、表１１２
の中に文字が書かれている場合も本発明を適用できる。FIG. 3 is an operation block diagram of the character reading device, and FIG. 11 is an example of a read document. The operation of the character reading device of the present invention will be described below with reference to FIGS. In addition, in FIG. 11, for the sake of explanation, a sentence (character group) 1
11 and Table 112 are shown above and below the document, but Table 112
The present invention can be applied to the case where a character is written inside.

【００２２】設定ブロック３０では認識条件の設定を行
う。認識条件としては、例えば原稿の種類（例えば、ワ
ープロ印字、書籍（印刷文字）等の別）、文字方向の判
別条件（自動判別、手動等）、書式再現の可否、罫線再
現の可否、表内文字再現の可否等があり、これらの条件
は設定時に表示装置７に示され、操作者によりマウスで
指定され入力される。入力された各設定条件はＣＰＵ３
でチェックされた後にワークメモリ５の所定のアドレス
に記憶される。読取りブロック３１ではイメージスキャ
ナ１にセットされた原稿１１０（図１１参照）を読取り
バッファ（５４Ｋバイト）単位で読取りながらイメージ
メモリ２にイメージデータを描画し、同時にイメージデ
ータを原稿全体ごと表示装置７に表示する。The setting block 30 sets recognition conditions. The recognition conditions include, for example, the type of document (for example, word processing printing, book (printed characters), etc.), character direction determination conditions (automatic determination, manual operation, etc.), whether or not format can be reproduced, whether ruled lines can be reproduced, or in the table. Whether or not characters can be reproduced, etc., and these conditions are displayed on the display device 7 at the time of setting, and are specified and input by the operator with the mouse. Each input setting condition is CPU3
It is stored in a predetermined address of the work memory 5 after being checked in. In the reading block 31, the image data is drawn in the image memory 2 while reading the original 110 (see FIG. 11) set in the image scanner 1 in units of a reading buffer (54 Kbytes), and at the same time, the entire image of the original is displayed on the display device 7. indicate.

【００２３】次に、領域指定ブロック３２では領域指定
手段により画面上に表示された矩形の始点、終点の位置
を入力装置８のマウスで指定し入力する。図１２は図１
１の原稿１１０のイメージ表示であり領域指定を行った
例である。図１２では指定領域として２つの領域Ｆ１
（ＦＩＸ１，ＦＩＹ１）、Ｆ２（Ｆ２Ｘ１、Ｆ２Ｙ１）
が示されている。Next, in the area designating block 32, the position of the starting point and the ending point of the rectangle displayed on the screen by the area designating means are designated by the mouse of the input device 8 and input. FIG. 12 shows FIG.
This is an example of image display of one document 110 and area designation. In FIG. 12, two areas F1 are designated areas.
(FIX1, FIY1), F2 (F2X1, F2Y1)
It is shown.

【００２４】この場合、領域は領域指定を行った順に認
識される。なお、領域指定をしない場合は原稿全体が一
領域として指定されたことになる。認識ブロック３３で
は図４のフローチャートに基づいてレイアウト解析、文
字認識、データ変換等の処理が実行される。なお、図５
は図４のレイアウト解析（ステップＳ３）の詳細フロー
チャート、図６は図４の文字認識（ステップＳ１３）の
詳細フローチャート、図７は図４のデータ変換（ステッ
プＳ１５）の詳細フローチャートである。In this case, the areas are recognized in the order in which the areas are designated. When the area is not designated, the entire original is designated as one area. In the recognition block 33, processes such as layout analysis, character recognition, and data conversion are executed based on the flowchart of FIG. Note that FIG.
4 is a detailed flowchart of the layout analysis (step S3) of FIG. 4, FIG. 6 is a detailed flowchart of the character recognition (step S13) of FIG. 4, and FIG. 7 is a detailed flowchart of the data conversion (step S15) of FIG.

【００２５】図４でステップＳ１，Ｓ２は上述した領域
指定ブロックによる領域確保の動作であり、説明を省略
する。ステップＳ３ではＣＰＵ３は図５に示すレイアウ
ト解析手段を実行して設定ブロック３０で設定されワー
クメモリ５に記憶されている各設定条件と領域設定ブロ
ック３２で指定された領域の座標を基にレイアウト（la
y-out）解析を行う。レイアウト解析は図５のフローチ
ャートに示すように最初にステップＳ３０の前処理で未
知データのうち黒点の集まっている領域を探し出し矩形
で囲む（内部的処理）。In FIGS. 4A and 4B, steps S1 and S2 are the operation of securing the area by the above-mentioned area designating block, and the description thereof is omitted. In step S3, the CPU 3 executes the layout analysis unit shown in FIG. 5 to perform layout (based on the setting conditions set in the setting block 30 and stored in the work memory 5 and the coordinates of the area specified in the area setting block 32). la
y-out) Perform analysis. In the layout analysis, as shown in the flowchart of FIG. 5, first, in the preprocessing of step S30, a region where black dots are gathered in unknown data is searched for and enclosed by a rectangle (internal processing).

【００２６】次に、ステップＳ３１で文字の傾きをチェ
ックする。±５°以上の場合は読取りやり直しとする。
文字の傾きは図８に示すように原稿の位置が正常であっ
て原稿内の文字そのものが傾いている場合と、図９に示
すように原稿の傾きに困るもの及び両者が組合わさった
ものがある。Next, in step S31, the inclination of the character is checked. If ± 5 ° or more, read again.
As shown in FIG. 8, there are two types of inclination of characters: the case where the position of the original is normal and the characters themselves are inclined, and the case where the original is not inclined as shown in FIG. 9 and a combination of both. is there.

【００２７】原稿が傾いている場合には図９（ａ）のよ
うに左上座標（ｘ１，ｙ１）、左下座標（ｘ２，ｙ２）
だけでは傾きの判定が難しいので、図９（ｂ）に示すよ
うに左下座標（ｘ１，ｙ４）、右上座標（ｘ２，ｙ３）
を用いて傾きを判定し全体として右下がりか左下がりか
を判定する。傾きチェックがＯＫの場合、図５のステッ
プＳ３２で読取り原稿の指定領域が文字フィールドか表
フィールドかの判別を行う。表フィールドが判別された
場合にはステップＳ３３〜Ｓ３５の処理を行った後にス
テップＳ３６に移り、表フィールドでない場合にはステ
ップＳ３６の処理に移る。表フィールドか否かの判定は
線分が所定の大きさより大きい（長い）場合に罫線の抽
出を行いその数が所定数を越える時に表と判定する。ま
た、罫線の抽出はラングレス（黒点の連絡長）による。When the document is tilted, the upper left coordinates (x1, y1) and the lower left coordinates (x2, y2) as shown in FIG. 9A.
Since it is difficult to determine the inclination only by itself, the lower left coordinates (x1, y4) and the upper right coordinates (x2, y3) as shown in FIG. 9B.
Is used to determine the inclination, and whether it is falling to the right or to the left is determined as a whole. If the tilt check is OK, it is determined in step S32 of FIG. 5 whether the designated area of the read document is a character field or a front field. If the table field is discriminated, the process proceeds to steps S33 to S35 and then moves to step S36. If it is not the table field, the process proceeds to step S36. Whether the field is a table field is determined by extracting ruled lines when the line segment is larger (longer) than a predetermined size, and when the number exceeds a predetermined number, it is determined as a table. In addition, ruled lines are extracted by rungless (black dot contact length).

【００２８】ステップＳ３３では表の中に文字列があれ
ば、ステップＳ３４で周辺分布により文字列を抽出し、
文字フィールドを１行毎に切り出し傾き補正をする。文
字列がない場合にはステップＳ３５に移行する。If there is a character string in the table in step S33, the character string is extracted from the marginal distribution in step S34,
The character field is cut line by line and the inclination is corrected. If there is no character string, the process proceeds to step S35.

【００２９】図８は文字が傾いている場合の補正（なな
め補正）の場合であり、（ａ）は右下がりの場合を示
す。右下がりの場合には座標を正の方向（図中の矢印の
方向）に原点０を中心として回転移動（座標変換）して
ｘ，ｙ座標を補正する。また、右下がりの場合には座標
を負の方向（図中の矢印の方向）に原点０を中心として
回転移動してｘ，ｙ座標を補正する。原稿が傾いている
場合にも全体として右下がりか左下がりかを判定した後
に、例えば座標（ｘ１，ｙ１）を原点として回転移動し
てｘ，ｙ座標を変換する。FIG. 8 shows a case of correction (licking correction) when a character is inclined, and FIG. 8A shows a case of downward sloping. In the case of downward sloping, the coordinates are rotationally moved (coordinate conversion) about the origin 0 in the positive direction (the direction of the arrow in the drawing) to correct the x and y coordinates. In the case of downward sloping, the coordinates are rotationally moved in the negative direction (the direction of the arrow in the figure) about the origin 0 to correct the x, y coordinates. Even if the document is tilted, it is determined whether the document is on the right or left as a whole, and then, for example, the coordinate (x1, y1) is rotated as the origin to convert the x, y coordinates.

【００３０】ステップＳ３５では表罫線（座標）の切り
出しのみを行い、ステップＳ３６で罫線かどうか周辺分
布の平坦正及び外接矩形の縦横比で判定する。判定の結
果が罫線の場合はステップＳ３７で罫線の切り出しを行
いステップＳ３８に移行する。判定の結果罫線以外の場
合は直ちにステップＳ３８に移行する。In step S35, only the table ruled lines (coordinates) are cut out, and in step S36 it is determined whether or not the ruled line is the flat distribution of the peripheral distribution and the aspect ratio of the circumscribed rectangle. If the determination result is a ruled line, the ruled line is cut out in step S37, and the process proceeds to step S38. If the result of determination is other than a ruled line, the process immediately proceeds to step S38.

【００３１】ステップＳ３８では矩形の大きさにより文
字か否かを判定する。文字の場合はステップＳ３９で周
辺分布により文字列を抽出し、文字フィールドを１行毎
に切り出し傾き補正をしてステップＳ４０に移行する。
文字がない場合は図または写真の場合であり、レイアウ
ト解析を終了しステップＳ４に移行する（図４）。In step S38, it is determined whether the character is a character based on the size of the rectangle. In the case of a character, a character string is extracted from the peripheral distribution in step S39, the character field is cut out line by line, and the inclination is corrected, and the process proceeds to step S40.
The case where there is no character is the case of a figure or a photograph, the layout analysis is ended, and the process proceeds to step S4 (FIG. 4).

【００３２】ステップＳ４０ではレイアウト解析が済ん
でいない指定領域の有無をチェックし、レイアウト解析
が済んでいない指定領域がある場合は次の指定領域の処
理を行うためにステップＳ３２〜Ｓ３９を繰り返す。指
定領域のレイアウト解析が全く済んでいる場合はレイア
ウト解析を終了しステップＳ４に移行する（図４）。レ
イアウト解析結果は図１８に示すようなフォーマットで
ワークメモリ５の所定の領域に保存される。なお、レイ
アウト解析結果の結果の行フィールドの座標はななめ補
正をしていない行座標（元のイメージに忠実な行座標）
である。In step S40, it is checked whether or not there is a designated area for which layout analysis has not been completed, and if there is a designated area for which layout analysis has not been completed, steps S32 to S39 are repeated to process the next designated area. If the layout analysis of the designated area has been completed, the layout analysis is terminated and the process proceeds to step S4 (FIG. 4). The layout analysis result is stored in a predetermined area of the work memory 5 in a format as shown in FIG. The coordinates of the row field that is the result of the layout analysis are the row coordinates that have not been licked (the row coordinates that are faithful to the original image).
Is.

【００３３】ここで、ステップＳ３２〜Ｓ３９について
具体例として、図１１の原稿１１０の場合を例にとる
と、ステップＳ３２で領域Ｆ１が表フィールドかを判定
する。領域Ｆ１は表ではないのでステップＳ３６で罫線
かどうかを判定する。また、領域Ｆ１は罫線ではないの
でステップＳ３８で文字列か否かの判定を行う。領域Ｆ
１には文字列しか存在しないのでステップＳ３９で文字
フィールド１行毎の切り出し及び行毎の傾き補正を行
う。なお、図１６，１７に図１１の原稿１１０を例とす
るレイアウト解析結果等を示す。ここで、図１７はレイ
アウト解析情報（原稿１１０のイメージデータ）であ
り、図１６はレイアウト解析結果（原稿１１０の行座
標、罫線座標）である。Here, as a concrete example of steps S32 to S39, taking the case of the document 110 of FIG. 11 as an example, it is determined in step S32 whether the area F1 is a front field. Since the area F1 is not a table, it is determined in step S36 whether it is a ruled line. Further, since the area F1 is not a ruled line, it is determined in step S38 whether it is a character string. Area F
Since there is only a character string in No. 1, the character field is cut out line by line and inclination is corrected line by line in step S39. Note that FIGS. 16 and 17 show layout analysis results and the like using the document 110 of FIG. 11 as an example. Here, FIG. 17 shows layout analysis information (image data of the original 110), and FIG. 16 shows layout analysis results (row coordinates and ruled line coordinates of the original 110).

【００３４】Ｓ４０の指定領域存在判定の結果領域Ｆ２
がまだ未処理領域のためステップＳ３２に戻り領域Ｆ２
が表フィールドかを判定する。領域Ｆ２は表であるため
ステップＳ３３で表内に文字列があるか否かを判定す
る。この例の場合領域Ｆ２には文字がないと判定され、
ステップＳ３５で罫線座標の切り出しを行う。領域Ｆ２
の罫線座標は次のようになる。Resulting area F2 of the presence judgment of the designated area in S40
Is an unprocessed area, the process returns to step S32 and the area F2
Is a table field. Since the area F2 is a table, it is determined in step S33 whether there is a character string in the table. In this example, it is determined that there are no characters in the area F2,
In step S35, ruled line coordinates are cut out. Area F2
The ruled line coordinates of are as follows.

【００３５】罫線１Ｋ１（Ｋ１Ｘ，Ｋ１Ｙ），Ｋ２（Ｋ２Ｘ，Ｋ２
Ｙ）罫線２Ｋ３（Ｋ３Ｘ，Ｋ３Ｙ），Ｋ４（Ｋ２Ｘ，Ｋ４
Ｙ）罫線３Ｋ１（Ｋ１Ｘ，Ｋ１Ｙ），Ｋ３（Ｋ３Ｘ，Ｋ３
Ｙ）罫線４Ｋ２（Ｋ１Ｘ，Ｋ１Ｙ），Ｋ４（Ｋ４Ｘ，Ｋ４
Ｙ）図４のステップＳ４でＣＰＵ３はレイアウト解析エラー
の有無を判定し、レイアウト解析エラーがある場合には
指定領域（インターフェイス領域）を解放して処理を終
了する（操作者はエラー表示によりブロック３２の領域
設定からやり直す）。レイアウト解析エラーがない場合
にはステップＳ５でＣＰＵ３は表示装置７の画面上に、
例えば、文字列を矩形で囲みレイアウト解析結果を表示
する（図１３）。Ruled Line 1 K1 (K1X, K1Y), K2 (K2X, K2
Y) Ruled line 2 K3 (K3X, K3Y), K4 (K2X, K4
Y) Ruled line 3 K1 (K1X, K1Y), K3 (K3X, K3
Y) Ruled line 4 K2 (K1X, K1Y), K4 (K4X, K4
Y) In step S4 of FIG. 4, the CPU 3 determines whether or not there is a layout analysis error, and if there is a layout analysis error, releases the designated area (interface area) and terminates the process (the operator displays a block 32 due to an error display). Start again from the area setting). If there is no layout analysis error, the CPU 3 displays on the screen of the display device 7 in step S5.
For example, the character string is enclosed in a rectangle and the layout analysis result is displayed (FIG. 13).

【００３６】次に、ステップＳ６〜Ｓ１４で文字の認識
処理を実行する。ステップＳ６ではＣＰＵ３は文字認識
処理実行の可否を問うメッセージを表示装置７に表示
し、操作者による支持入力を促す。操作者はマウスによ
り入力指示を行う。実行不可の場合は指定領域（インタ
ーフェイス領域）を解放して処理を終了する。実行可の
場合はステップＳ７でＣＰＵ３は格納装置６に格納され
ている認識辞書ファイルから認識辞書データをワークメ
モリ５にロードし（ブロック３４）、ステップ８でロー
ドエラーの有無をチェックした後にステップＳ９で表示
装置７のレイアウト解析画面を認識画面に切換えステッ
プＳ１０に移行する。例えば、図１２の領域Ｆ１の行Ｋ
１の部分を反転表示する。Next, in steps S6 to S14, character recognition processing is executed. In step S6, the CPU 3 displays a message asking whether to execute the character recognition process on the display device 7 and prompts the operator to input the support. The operator gives an input instruction with a mouse. If it cannot be executed, the designated area (interface area) is released and the processing is terminated. If it is executable, the CPU 3 loads the recognition dictionary data from the recognition dictionary file stored in the storage device 6 into the work memory 5 in step S7 (block 34), checks whether or not there is a load error in step 8, and then executes step S9. Then, the layout analysis screen of the display device 7 is switched to the recognition screen and the process proceeds to step S10. For example, row K of area F1 in FIG.
The part 1 is highlighted.

【００３７】ステップＳ１０ではＣＰＵ３は未認識の領
域があるか否かを判定する。未認識の領域がある場合は
ステップ１１に移行し、未認識の領域がない場合はステ
ップＳ１７に移行してワークエリア中の認識辞書データ
を解放しステップＳ１８で指定領域を解放して次のブロ
ックに移る。In step S10, the CPU 3 determines whether or not there is an unrecognized area. If there is an unrecognized area, the process proceeds to step 11, and if there is no unrecognized region, the process proceeds to step S17 to release the recognition dictionary data in the work area and release the designated area in step S18 to move to the next block. Move on to.

【００３８】ステップＳ１１ではＣＰＵ３は認識処理を
続行するか否かの判定を行い続行しない場合にはステッ
プＳ１７に移行してワークエリア中の認識辞書データを
解放しステップＳ１８で指定領域を解放して次のブロッ
クに移る。ステップＳ１２ではＣＰＵ３は認識対象のイ
メージ行を表示装置７に表示する。ステップＳ１３では
ＣＰＵはレイアウト結果を基に図６のフローチャートに
基づく文字認識手段を実行して文字認識を行う。具体例
として図１３の領域Ｆ１の行Ｋ１の部分の文字認識につ
いて図６を基に説明する。In step S11, the CPU 3 determines whether or not to continue the recognition process. If not, the process proceeds to step S17, the recognition dictionary data in the work area is released, and the designated area is released in step S18. Move on to the next block. In step S12, the CPU 3 displays the image line to be recognized on the display device 7. In step S13, the CPU executes character recognition means based on the layout result based on the flowchart of FIG. 6 to perform character recognition. As a specific example, the character recognition of the portion of the row K1 of the area F1 of FIG. 13 will be described with reference to FIG.

【００３９】ステップＳ１３１でＣＰＵは行Ｌ１から文
字を切り出し文字数（この例では１ステップＳ１３１で
ＣＰＵは行Ｌ１から文字を切り出し文字数（この例では
１０個；図１３では４個目以降は具体的文字の表示を説
明上省略（スペース）表示してある）、文字座標Ｃ１
（Ｃ１Ｘ１，Ｃ１Ｙ１），Ｃ２（Ｃ２Ｘ１，Ｃ２Ｙ
１），Ｃ２（Ｃ２Ｘ１，Ｃ２Ｙ１），……Ｃ１０（Ｃ１
０Ｘ１，Ｃ１０Ｙ１）、及び文字の形（全角；１０字
分）をワークメモリ５に記憶する。In step S131, the CPU cuts out characters from the line L1 (1 in this example, the CPU cuts out characters from the line L1 in step S131 (10 in this example; in FIG. 13, the fourth and subsequent characters are specific characters). Is displayed for the sake of explanation (space is displayed), the character coordinate C1
(C1X1, C1Y1), C2 (C2X1, C2Y
1), C2 (C2X1, C2Y1), ... C10 (C1
0X1, C10Y1) and the character shape (full-width; 10 characters) are stored in the work memory 5.

【００４０】次に、ステップＳ１３２で文字座標Ｃ１か
ら順に１文字ずつＣ１０まで前処理（大きさの正規化、
位置の正規化、ノイズ除去、ドット欠落の修正）を、ス
テップＳ１３３で１文字の認識（前処理したイメージデ
ータから特徴量を抽出し、認識辞書と比較）を行い文字
コードと認識精度を得る。Next, in step S132, preprocessing (normalization of size,
Position normalization, noise removal, and dot loss correction) are performed in step S133 to recognize one character (extract a feature amount from preprocessed image data and compare it with a recognition dictionary) to obtain a character code and recognition accuracy.

【００４１】ステップＳ１３４では認識結果が所定の閾
値外の場合に認識結果をリジェクトしステップＳ１３１
で次の行（Ｌ２）の認識処理に移行する。リジェクトさ
れた文字座標は例えば、“黒塗りの四角文字”としてコ
ード化される。In step S134, if the recognition result is outside the predetermined threshold, the recognition result is rejected and step S131
Then, the process proceeds to the recognition process of the next line (L2). The rejected character coordinates are coded, for example, as “black square characters”.

【００４２】ステップＳ１３２では原稿行（Ｌ１）に未
認識文字がある場合はステップＳ１３２に戻り次の文字
の認識処理（前処理、文字認識）を繰り返す。文字フィ
ールド１行（具体例では行Ｌ１）について図６の文字認
識が終わると図４のステップＳ１４でＣＰＵ３は認識エ
ラーの有無を判定し認識エラーがない場合はステップＳ
１５に移り図７のフローチャートに示すようなデータ変
換手段を実行してデータ変換処理を行う。なお、図１７
は認識結果の例（原稿１１０を例とする文字コード、文
字座標）である。認識結果は図１７に示すようなフォー
マットでワークメモリ５に保存されている。If there is an unrecognized character in the original line (L1) in step S132, the process returns to step S132 to repeat the recognition process (preprocessing, character recognition) of the next character. When the character recognition of FIG. 6 is completed for one line of the character field (line L1 in the specific example), the CPU 3 determines in step S14 of FIG. 4 whether there is a recognition error.
15, the data conversion processing is performed by executing the data conversion means as shown in the flowchart of FIG. Note that FIG.
Is an example of a recognition result (a character code and a character coordinate taking the document 110 as an example). The recognition result is stored in the work memory 5 in the format shown in FIG.

【００４３】データ変換処理は図７のステップＳ１５１
でステップＳ３のレイアウト解析結果とステップＳ１３
の文字認識の結果を突き合わせ、ステップＳ１５２，Ｓ
１５３で文書編集装置（図示せず）で編集可能なデータ
（文書編集装置編集データ、図１８参照）に変換する。
変換された文書編集装置編集データは認識データファイ
ルに認識データとして格納すると共に表示装置７にその
認識データを表示する。The data conversion process is step S151 in FIG.
Then, the layout analysis result of step S3 and step S13
Results of character recognition are compared, and steps S152, S
At 153, the data is converted into data (document editing device edit data, see FIG. 18) that can be edited by the document editing device (not shown).
The converted document editing device edit data is stored in the recognition data file as recognition data and the display device 7 displays the recognition data.

【００４４】具体的には、罫線座標については罫線情報
を作成し、行座標については行フィールドの算出、改行
コードの付加を行い、文字座標については文字制御コー
ド（文字ポイント）、文字ピッチの算出及びピッチコー
ドの付加、スペースコードの付加等を行う。なお、文字
ポイントの算出は文字座標から大きさを算出することで
行う。Specifically, ruled line information is created for ruled line coordinates, line fields are calculated for line coordinates, line feed codes are added, and character control codes (character points) and character pitches are calculated for character coordinates. Also, the pitch code and space code are added. The calculation of the character points is performed by calculating the size from the character coordinates.

【００４５】認識データは図１８に示すように基本的に
は文書データのテキスト情報（下記参照）とグラフィッ
ク罫線情報（下記参照）からなる簡単な構造を有してい
る。（１）テキスト情報文字データ長文字ピッチ行フィールド制御コード、文字コード…（組として繰り返す）（２）罫線情報罫線データ長罫線種、頁、罫線コード、開始座標（Ｘ，Ｙ）、終了座
標（Ｘ，Ｙ）…（組として繰り返す）As shown in FIG. 18, the recognition data basically has a simple structure composed of text information (see below) of document data and graphic ruled line information (see below). (1) Text information Character data length Character pitch Line field Control code, character code ... (Repeated as a set) (2) Ruled line information Ruled line data length Ruled line type, page, ruled line code, start coordinate (X, Y), end coordinate ( (X, Y) ... (repeat as a group)

【００４６】ステップＳ１５で認識文字を表示させる場
合は、行は認識行座標のｙ座標を使用し桁は認識文字座
標のＸ座標を使用する。認識文字座標は図１０（ａ）の
下段のように、文字のレイアウト結果の密接矩形の座標
をななめ補正して文字認識したものであり、上記認識文
字座標（図１０（ａ）参照）をそのまま認識結果の表示
座標として用いるとＹ座標が文字毎に異なることとな
り、行がバラバラになる可能性があるのでＣＰＵ３は行
毎に全ての認識文字座標がそのの収まる矩形を算出し、
それを認識行座標とする。When the recognition character is displayed in step S15, the line uses the y coordinate of the recognition line coordinate and the digit uses the X coordinate of the recognition character coordinate. As shown in the lower part of FIG. 10A, the recognized character coordinates are obtained by performing character recognition by licking and correcting the coordinates of the close rectangle of the layout result of the characters, and the recognized character coordinates (see FIG. 10A) are not changed. If used as the display coordinates of the recognition result, the Y coordinate will be different for each character, and the lines may be disjointed. Therefore, the CPU 3 calculates a rectangle in which all the recognized character coordinates are included in each line,
Let it be the recognition line coordinate.

【００４７】ＣＰＵ３は１行分のデータ変換、認識デー
タの格納、及び表示が終わるとステップＳ１６で次の行
を処理するためステップＳ１１〜Ｓ１５を繰り返す。な
お、具体例で文字領域Ｆ１のデータ変換が終了すると表
（罫線）領域Ｆ２の処理に移るがこの場合は表内に文字
列がないので文字認識（Ｓ１３，Ｓ１４）をすることな
くレイアウト結果のみに基づきステップＳ１５で罫線情
報を作成し、認識データファイルに認識データ（文書編
集装置編集データ：図１８参照）として格納する。読み
取った原稿の全ての指定領域の処理が終了するとステッ
プＳ１７で認識辞書を解放し、ステップＳ１８で指定領
域の解放を行って次のブロックの処理に移行する。When the data conversion for one line, the storage of the recognition data, and the display are completed, the CPU 3 repeats steps S11 to S15 to process the next line in step S16. In the specific example, when the data conversion of the character area F1 is completed, the process proceeds to the table (ruled line) area F2. In this case, since there is no character string in the table, only the layout result is obtained without performing character recognition (S13, S14). Based on the above, ruled line information is created in step S15 and stored as recognition data (document editing device edit data: see FIG. 18) in the recognition data file. When the processing of all the designated areas of the read document is completed, the recognition dictionary is released in step S17, the designated area is released in step S18, and the processing of the next block is performed.

【００４８】なお、図３でブロック３３の認識処理の終
了後、認識データ（文書編集装置編集データ）を表示装
置７に逐次表示し（図１４の表示例参照）、図１の文字
認識装置では引き続いて修正処理（リジェクト文字の変
更或いは削除、文字の追加等）、を行うが、図２の文字
認識装置では次の原稿の読取りのためにブロック３１に
戻りブロック３１〜３３の処理を繰り返す（修正は全て
の原稿の読取り及び認識データの格納終了後、一括して
別途行う）ことができる。また、図２の一括処理（バッ
チ処理）の場合はブロック３１で読み取ったイメージデ
ータを格納装置に格納する。After the recognition process of block 33 in FIG. 3 is completed, the recognition data (document editing device edit data) is sequentially displayed on the display device 7 (see the display example of FIG. 14), and the character recognition device of FIG. Subsequently, correction processing (change or deletion of rejected characters, addition of characters, etc.) is performed, but the character recognition apparatus of FIG. 2 returns to block 31 to read the next document and repeats the processing of blocks 31 to 33 ( The correction can be collectively performed separately after reading all the originals and storing the recognition data). Further, in the case of batch processing (batch processing) of FIG. 2, the image data read in block 31 is stored in the storage device.

【００４９】以上説明したように第１の発明の文字認識
装置によれば、ワードプロセッサ等の文書編集装置で編
集可能な認識データ（文書編集装置編集データ）に変換
された認識データを得ることができるので、読み取った
書式のままのレイアウト再現（罫線、文字の大きさ、文
字のピッチ等の再現）が可能となる。また、第２の発明
の文字認識装置によれば、認識データの修正をワードプ
ロセッサ等の文書編集装置で別途修正できる。As described above, according to the character recognition device of the first invention, it is possible to obtain the recognition data converted into the recognition data (document editing device editing data) which can be edited by the document editing device such as a word processor. Therefore, it is possible to reproduce the layout in the read format (reproduction of ruled lines, character size, character pitch, etc.). Further, according to the character recognition device of the second invention, the correction of the recognition data can be separately corrected by the document editing device such as a word processor.

【００５０】[0050]

【発明の効果】以上説明したように第１の発明の文字認
識装置によれば、ワードプロセッサ等の文書編集装置で
編集可能なデータ（文書編集装置編集データ）に変換さ
れた認識データを得ることができるので、読み取った書
式のままのレイアウト再現（罫線、文字の大きさ、文字
のピッチ等の再現）が可能となる。したがって、従来の
ように罫線を付加したり、文字の大きさを再編集するた
めの編集作業が不要となる。また、第２の発明の文字認
識装置によれば、認識データの修正をワードプロセッサ
等の文書編集装置で別途修正できるので、大量の文書
（例えば、図書）等のデータベース化などの場合に修正
作業を複数人に分散でき、作業期間の短縮に寄与し得
る。As described above, according to the character recognition device of the first invention, it is possible to obtain the recognition data converted into the data editable by the document editing device such as a word processor (document editing device editing data). As a result, it is possible to reproduce the layout in the read format (reproduction of ruled lines, character size, character pitch, etc.). Therefore, it is not necessary to add ruled lines and edit the size of characters again as in the conventional case. Further, according to the character recognition device of the second invention, since the correction of the recognition data can be separately corrected by the document editing device such as a word processor, it is necessary to perform the correction work in the case of making a database of a large number of documents (for example, books). It can be distributed to multiple people and can contribute to shortening the work period.

[Brief description of drawings]

【図１】本発明の文字認識装置の一実施例の構成図であ
る。FIG. 1 is a configuration diagram of an embodiment of a character recognition device of the present invention.

【図２】本発明の文字認識装置の他の実施例の構成図で
ある。FIG. 2 is a configuration diagram of another embodiment of the character recognition device of the present invention.

【図３】文字読取り装置の動作ブロック図である。FIG. 3 is an operation block diagram of the character reading device.

【図４】本発明の文字認識装置の認識動作を示すフロー
チャートである。FIG. 4 is a flowchart showing a recognition operation of the character recognition device of the present invention.

【図５】図４のレイアウト解析の詳細フローチャートで
ある。5 is a detailed flowchart of the layout analysis of FIG.

【図６】図４の文字認識の詳細フローチャートである。6 is a detailed flowchart of the character recognition of FIG.

【図７】図４のデータ変換の詳細フローチャートであ
る。FIG. 7 is a detailed flowchart of the data conversion of FIG.

【図８】文字の傾き補正の説明図である。FIG. 8 is an explanatory diagram of the inclination correction of a character.

【図９】原稿の傾き補正の説明図である。FIG. 9 is an explanatory diagram of document skew correction.

【図１０】認識文字座標及び認識行座標の説明図であ
る。FIG. 10 is an explanatory diagram of recognized character coordinates and recognized line coordinates.

【図１１】読み取る原稿の例である。FIG. 11 is an example of a document to be read.

【図１２】領域指定の例である。FIG. 12 is an example of area designation.

【図１３】レイアウト解析結果の表示例である。FIG. 13 is a display example of a layout analysis result.

【図１４】文書編集装置編集データの表示例である。FIG. 14 is a display example of edit data of a document editing device.

【図１５】レイアウト解析情報の例である。FIG. 15 is an example of layout analysis information.

【図１６】レイアウト解析結果の例である。FIG. 16 is an example of a layout analysis result.

【図１７】認識結果の例である。FIG. 17 is an example of a recognition result.

【図１８】データ変換された文書編集装置編集データの
例である。FIG. 18 is an example of the data edited by the document editing apparatus.

[Explanation of symbols]

１イメージスキャナ２イメージメモリ３ＣＰＵ４格納装置５ワークメモリ６認識辞書ファイル７表示装置８入力装置 1 Image Scanner 2 Image Memory 3 CPU 4 Storage Device 5 Work Memory 6 Recognition Dictionary File 7 Display Device 8 Input Device

Claims

[Claims]

1. An image reader for reading a document including a table of written or printed characters and / or ruled lines and outputting it as image data, an image memory for storing the image data, and a recognition process by a recognition processing means. Character recognition having a processing device, a recognition dictionary data storage device that stores recognition dictionary data used for character recognition, a storage device that stores recognition data obtained by the recognition process, and an input device that inputs operation information. In the apparatus, the recognition processing means analyzes the layout of the designated area for each designated area designated by the area designating means, the area designating means designating the area of the image data by the input device, and a layout analysis result. A layout analysis means for obtaining the specified area specified by the area specifying means. Character recognition means for recognizing a character string in a character field based on the recognition dictionary data to obtain a character recognition result, and combining the character recognition result and the layout analysis result to add format information and recognizing as a format document. A character recognition device comprising: a data conversion unit that creates data.

2. The character recognition device according to claim 1, wherein
When the layout analysis means determines the field type of the designated area, and the field type is a ruled line or a table field,
A ruled line cutting-out means for cutting out a ruled line; a first character string cutting-out means for cutting out a character string from the table field when the field type is a table field containing a character string; and a character field when the field type is a character field A second character string cutout unit for cutting out a character string from the character recognition device.

3. The character recognition device according to claim 1, wherein
The layout analysis unit detects a character or the inclination of the entire document, a field type determination unit that determines the field type of the designated area, and a ruled line cutting unit that cuts out a ruled line when the field type is a table field. When the field type is a table field including a character string, a first character string cutout unit that cuts out a character string from the table field and cuts out a character string for each line from the character string, and the field type is a character field At this time, a second character string cutout unit that cuts out a character string from the character field and a character string for each line from the character field, and a tilt correction unit that corrects the tilt for each line of the character string are provided. A character recognition device having.

4. The character recognition device according to claim 1,
The recognition data that can be edited as a format document includes a. Ruled line information which is a repetition of a set of ruled line data and at least a set of individual ruled line information including a ruled line type code, a start coordinate, and an end coordinate, and b. A character recognition device comprising: a character data length, a character pitch, a code indicating a line spacing, and at least character information including a repetition of a set of a control code and a character code.

5. The character recognition device according to claim 1, further comprising a recognition data storage means for storing the recognition data in a storage device.

6. The character recognition device according to claim 1, further comprising a recognition data storage means for storing recognition data in a storage device, and an image data storage means for storing image data in the storage device. A character recognition device comprising: