JPH08335248A

JPH08335248A - Document reader

Info

Publication number: JPH08335248A
Application number: JP7164681A
Authority: JP
Inventors: Tetsuo Nakamura; 哲夫中村; Kiyoshi Ishihara; 清志石原
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1995-06-07
Filing date: 1995-06-07
Publication date: 1996-12-17

Abstract

PURPOSE: To provide a document reader which does not perform unnecessary rerecognition. CONSTITUTION: A layout data confirmation and correction part 10a sets all the recognition flags of layout data ON when its confirmation and correction are performed for the 1st time and sets only a corrected part ON when not. A recognition part 5 recognizes only parts where recognition flags are ON. A recognition data confirmation and correction part 10b instructs the layout data confirmation and correction part 10a to conform and correct the layout data when the layout data need to be corrected after the recognition processing. Therefore, only parts of the layout data which are corrected are recognized for the 2nd and succeeding times and the parts which are already recognized correctly are not recognized.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、文書上に記録された文
字とイメージ（文字以外の図形、写真および罫線等）を
読み取る文書読取装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document reading device for reading characters and images (graphics other than characters, photographs, ruled lines, etc.) recorded on a document.

【０００２】[0002]

【従来の技術】従来より、文字とイメージが混在する文
書の読取装置が開発されている。これらの装置では、以
下のような処理を行う。即ち、認識対象となるデータ領
域の文書上の位置のレイアウトを解析し、この結果をオ
ペレータが確認・修正する。そして、確認・修正したレ
イアウトデータに従って、文字認識等を行い、更に、こ
の結果をオペレータが確認・修正する。この認識結果の
確認時にレイアウト解析結果の誤りを発見した場合、レ
イアウト解析の確認・修正処理に戻り、レイアウトデー
タを修正し、このレイアウトデータに基づいて再認識を
行うといったものであった（例えば、特公平４−７３１
９２号等）。2. Description of the Related Art Conventionally, a reading device for a document in which characters and images are mixed has been developed. These devices perform the following processing. That is, the layout of the position on the document of the data area to be recognized is analyzed, and the operator confirms / corrects the result. Then, character recognition or the like is performed according to the confirmed / corrected layout data, and the operator further confirms / corrects the result. If an error in the layout analysis result is found during the confirmation of the recognition result, the process returns to the layout analysis confirmation / correction process, the layout data is corrected, and the recognition is performed again based on the layout data (for example, Japanese Patent Publication 4-731
92, etc.).

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、上記従
来の文書読取装置では、再認識を行う場合、以前の認識
時から変わっていないレイアウトデータと、修正したレ
イアウトデータは区別できない。このため、再認識時に
おいて、修正しない部分に対しても以前と同じ認識をす
ことになり、冗長な処理を行ってしまう。また、以前修
正した認識結果が再度認識されて修正前に戻ってしま
い、以前と同じ認識結果の修正が必要になるといった問
題点を有していた。However, in the above-described conventional document reading apparatus, when re-recognition is performed, the layout data that has not changed since the previous recognition and the corrected layout data cannot be distinguished. For this reason, at the time of re-recognition, the same recognition as before is performed on the portion that is not corrected, and redundant processing is performed. In addition, there is a problem that the previously corrected recognition result is recognized again and returns to the state before the correction, and the same recognition result as before needs to be corrected.

【０００４】このような点から、再認識を行う場合に無
駄な処理を行うことない文書読取装置の実現が望まれて
いた。From this point of view, it has been desired to realize a document reading apparatus which does not perform unnecessary processing when re-recognizing.

【０００５】[0005]

【課題を解決するための手段】本発明は、上記の課題を
解決するため、以下の構成を採用する。即ち、本発明の
文書読取装置は、文書上の画像データから、認識対象と
なるデータ領域の文書上の位置のレイアウトを解析し、
その文書のレイアウトデータを得るレイアウト解析部
と、このレイアウト解析部で取得したレイアウトデータ
と、文書上の画像データとを重ねて表示部に表示させ、
そのレイアウトデータの確認・修正を受け付けると共
に、各領域に対応した認識フラグを有し、確認・修正処
理を２度以上行った場合は、その領域の認識フラグを、
確認・修正処理を行わなかった領域とは異なる状態とし
て、これら認識フラグを付加したレイアウトデータを出
力するレイアウトデータ確認・修正部と、このレイアウ
トデータの認識フラグに基づき、対象となる文書の最初
の認識では、文字認識を含む全ての領域の認識を行い、
それ以降の認識では、レイアウトデータ確認・修正部で
修正が行われた領域のみ認識を行う認識部と、この認識
部における認識処理後のレイアウトデータの修正が必要
か否かの選択指示を表示部に対して行うと共に、修正が
必要であると指定された場合は、レイアウト確認・修正
部に対してレイアウトデータの確認・修正指示を行う認
識データ確認・修正部とを備えたことを特徴とするもの
である。The present invention adopts the following configurations in order to solve the above problems. That is, the document reading apparatus of the present invention analyzes the layout of the position on the document of the data area to be recognized from the image data on the document,
A layout analysis unit that obtains the layout data of the document, the layout data acquired by the layout analysis unit, and the image data on the document are overlapped and displayed on the display unit,
When the confirmation / correction of the layout data is accepted, the recognition flag corresponding to each area is provided, and when the confirmation / correction processing is performed twice or more, the recognition flag of the area is changed to
Based on the layout data confirmation / correction unit that outputs the layout data to which these recognition flags are added, as a state different from the area where the confirmation / correction processing has not been performed, the first In recognition, all areas including character recognition are recognized,
In the subsequent recognition, a recognition unit for recognizing only the area corrected by the layout data confirmation / correction unit, and a display unit for instructing whether or not the layout data after the recognition processing by the recognition unit needs to be corrected. And a recognition data confirmation / correction unit for confirming / correcting layout data to the layout confirmation / correction unit when a correction is designated. It is a thing.

【０００６】[0006]

【作用】本発明の文書読取装置においては、レイアウト
データ確認・修正部は、レイアウト解析部で取得したレ
イアウトデータの確認・修正処理を行う。この時、確認
・修正処理が２度目以上であった場合は、その領域の認
識フラグを、確認・修正処理を行わなかった領域とは異
なる状態として、これらの認識フラグを含んだレイアウ
トデータを出力する。認識部は、認識フラグに基づき、
最初の認識では、全ての領域の認識を行い、２度目以降
の認識ではレイアウトデータ確認・修正部で修正が行わ
れた領域のみ認識を行う。認識データ確認・修正部は、
認識後のレイアウトデータの修正が必要か否かの選択指
示を行い、修正が必要であると指定された場合はレイア
ウトデータ確認・修正部に対して、確認・修正指示を行
う。その結果、再度の認識処理では、レイアウトデータ
が修正された部分のみ認識され、既に正しく認識された
部分は認識処理を行わない。In the document reading apparatus of the present invention, the layout data confirmation / correction unit performs the confirmation / correction processing of the layout data acquired by the layout analysis unit. At this time, if the confirmation / correction processing is performed for the second time or more, the recognition flag of the area is set to a state different from the area in which the confirmation / correction processing is not performed, and the layout data including these recognition flags is output. To do. The recognition unit, based on the recognition flag,
In the first recognition, all areas are recognized, and in the second and subsequent recognitions, only the areas corrected by the layout data confirmation / correction unit are recognized. The recognition data confirmation / correction section
A selection instruction is made as to whether or not the layout data after recognition needs to be corrected, and when the correction is designated, a confirmation / correction instruction is given to the layout data confirmation / correction unit. As a result, in the second recognition process, only the part in which the layout data has been corrected is recognized, and the part that has already been recognized correctly is not recognized.

【０００７】[0007]

【実施例】以下、本発明の実施例を図面を用いて詳細に
説明する。先ず、実施例１を説明する。Embodiments of the present invention will now be described in detail with reference to the drawings. First, the first embodiment will be described.

【０００８】《実施例１の構成》図１は、本発明の文書
読取装置の構成図である。図の装置は、画像入力部１、
画像メモリ２、レイアウト解析部３、レイアウトメモリ
４、認識部５、認識メモリ６、出力部７、表示部８、操
作部９、制御部１０からなる。また、画像入力部１〜結
果出力部７および制御部１０は、それぞれデータバス１
１を介して接続されており、画像入力部１、レイアウト
解析部３、認識部５、出力部７および制御部１０は制御
バス１２を介して接続されている。<< Structure of First Embodiment >> FIG. 1 is a block diagram of a document reading apparatus according to the present invention. The apparatus shown in FIG.
The image memory 2, the layout analysis unit 3, the layout memory 4, the recognition unit 5, the recognition memory 6, the output unit 7, the display unit 8, the operation unit 9, and the control unit 10. Further, the image input unit 1 to the result output unit 7 and the control unit 10 are respectively connected to the data bus 1
The image input unit 1, the layout analysis unit 3, the recognition unit 5, the output unit 7, and the control unit 10 are connected via a control bus 12.

【０００９】画像入力部１は、イメージスキャナ等から
なり、読取対象の文書を光学的に走査し、文書上に記録
された文字とイメージを光電変換により画像信号に変換
し、更にこの画像信号を二値の画像データに変換する機
能を有している。画像メモリ２は、画像入力部１から出
力される二値の画像データを記憶するメモリである。レ
イアウト解析部３は、画像メモリ２内の画像データから
文字とイメージを、黒画素の周辺分布を利用する等の手
法で領域に分割し、それぞれの領域の幾何学的特徴等に
より領域を文字とイメージに識別する。そして、文字領
域については、黒画素の周辺分布を利用するといった手
法により行を切り出し、各行から文字を切り出す。更
に、文字領域、行および文字の読取の順序を解析すると
いった機能を有している。尚、レイアウトデータは、以
上の領域、行、文字、およびその読取順序等からなるも
のである。The image input unit 1 is composed of an image scanner or the like, optically scans a document to be read, converts characters and images recorded on the document into an image signal by photoelectric conversion, and further converts the image signal. It has a function of converting into binary image data. The image memory 2 is a memory that stores binary image data output from the image input unit 1. The layout analysis unit 3 divides the characters and images from the image data in the image memory 2 into regions by a method such as using the peripheral distribution of black pixels, and the regions are defined as characters according to the geometrical characteristics of each region. Identify in the image. Then, with respect to the character area, a line is cut out by a method of utilizing the peripheral distribution of black pixels, and a character is cut out from each line. Further, it has a function of analyzing the reading order of the character area, line and character. The layout data is made up of the above areas, lines, characters, their reading order, and the like.

【００１０】レイアウトメモリ４は、レイアウト解析部
３から出力されるレイアウトデータを記憶するメモリで
ある。認識部５は、標準的な文字の認識特徴を記録して
いる認識辞書や単語辞書を備え、レイアウトメモリ４の
レイアウトデータを読み出して、このレイアウトデータ
に従い、認識辞書を用いたパターンマッチング等により
文字認識を行い、更に、単語辞書や、文法ルールを用い
る等の後処理を行い文書として正しく認識する機能を有
している。尚、認識データは、文字コード、候補文字コ
ード、それぞれの確信度、および後処理結果である文字
列、候補単語とその確信度等からなる。認識メモリ６
は、認識部５から出力される認識データを記憶するメモ
リである。The layout memory 4 is a memory for storing layout data output from the layout analysis section 3. The recognition unit 5 includes a recognition dictionary and a word dictionary that record standard character recognition features, reads the layout data of the layout memory 4, and performs character matching by pattern matching using the recognition dictionary according to the layout data. It has a function of performing recognition, and further performing post-processing such as using a word dictionary and grammar rules to correctly recognize as a document. The recognition data includes a character code, a candidate character code, a certainty factor of each, a character string as a post-processing result, a candidate word and its certainty factor, and the like. Recognition memory 6
Is a memory for storing the recognition data output from the recognition unit 5.

【００１１】出力部７は、画像メモリ２から画像デー
タ、レイアウトメモリ４からレイアウトデータ、および
認識メモリ６から認識データを読み出し、この認識デー
タ内の文字列と画像データをレイアウトデータに従い、
任意のフォーマットの文書データに変換する等を行い、
出力部内の出力装置に出力する機能を有している。尚、
この出力装置は、文書データを記憶する磁気ディスク装
置や文書データを印刷するプリンタ等である。表示部８
はＣＲＴ等からなり、また、操作部９は、キーボードや
マウス等からなる。これら表示部８や操作部９は、画像
入力部１、レイアウト解析部３および認識部５における
処理の開始指示や処理結果（画像データ、レイアウトデ
ータ、および認識データ）の表示、また、その確認・修
正等のオペレータとのインタフェースをとるための入出
力装置としての機能を有するものである。The output unit 7 reads the image data from the image memory 2, the layout data from the layout memory 4, and the recognition data from the recognition memory 6, and the character string and the image data in the recognition data are read according to the layout data.
Convert to document data of any format, etc.,
It has a function of outputting to an output device in the output section. still,
The output device is a magnetic disk device that stores document data, a printer that prints document data, or the like. Display 8
Is a CRT or the like, and the operation unit 9 is a keyboard, a mouse, or the like. The display unit 8 and the operation unit 9 display processing start instructions and processing results (image data, layout data, and recognition data) in the image input unit 1, the layout analysis unit 3, and the recognition unit 5, and confirm / check them. It has a function as an input / output device for interfacing with an operator for correction and the like.

【００１２】制御部１０は、上記の各部や各メモリの動
作全体を制御する機能を有し、レイアウトデータ確認・
修正部１０ａ、認識データ確認・修正部１０ｂを備えて
いる。レイアウトデータ確認・修正部１０ａは、レイア
ウト解析部３で取得したレイアウトデータと、画像入力
部１からの画像データとを重ねて表示部８に表示させ、
オペレータが操作部９より、その確認・修正内容が入力
された場合は、その入力を受け付けて、対応した処理を
行うと共に、その確認・修正処理が２度目以降の処理で
あった場合は、前回のレイアウトデータに基づいて確認
・修正を行い、修正が再度行われたものであるか否かを
識別するための認識フラグを付加して出力する機能を備
えている。The control unit 10 has a function of controlling the overall operation of each unit and each memory, and confirms layout data.
A correction unit 10a and a recognition data confirmation / correction unit 10b are provided. The layout data confirmation / correction unit 10a causes the display unit 8 to display the layout data acquired by the layout analysis unit 3 and the image data from the image input unit 1 in an overlapping manner.
When the operator inputs the confirmation / correction content from the operation unit 9, the input is accepted and the corresponding processing is performed, and if the confirmation / correction processing is the second or subsequent processing, It has a function of performing confirmation / correction based on the layout data, and adding and outputting a recognition flag for identifying whether or not the correction is made again.

【００１３】認識データ確認・修正部１０ｂは、認識部
５にて認識処理を行った後のレイアウトデータの修正が
必要か否かの選択指示を表示部８に対して行うと共に、
操作部９より、修正が必要であると指定された場合は、
レイアウトデータ確認・修正部１０ａに対してレイアウ
トデータの確認・修正指示を行う機能を備えている。
尚、これらレイアウトデータ確認・修正部１０ａおよび
認識データ確認・修正部１０ｂは、専用のプロセッサあ
るいはプログラム等から構成されているものである。The recognition data confirmation / correction unit 10b instructs the display unit 8 to select whether or not the layout data after the recognition processing by the recognition unit 5 needs to be corrected.
When the operation unit 9 specifies that the correction is necessary,
The layout data confirmation / correction unit 10a has a function of instructing layout data confirmation / correction.
The layout data confirmation / correction unit 10a and the recognition data confirmation / correction unit 10b are composed of a dedicated processor or program.

【００１４】《実施例１の動作》図２は、本発明の文書
読取装置の動作を示すフローチャートである。先ず、画
像を入力する（ステップＳ１）。即ち、画像入力部１に
より、読取対象の文書を走査し、文書上に記録された文
字とイメージを光電変換により画像信号に変換し、更に
この画像信号を二値の画像データに変換する。そして、
この画像データを画像メモリ２に記憶すると共に、表示
部８に表示する。<< Operation of First Embodiment >> FIG. 2 is a flowchart showing the operation of the document reading apparatus of the present invention. First, an image is input (step S1). That is, the image input unit 1 scans a document to be read, converts characters and images recorded on the document into image signals by photoelectric conversion, and further converts the image signals into binary image data. And
The image data is stored in the image memory 2 and displayed on the display unit 8.

【００１５】画像入力が終了すると、レイアウト解析を
行う（ステップＳ２）。レイアウト解析では、先ず、領
域を解析する。この領域解析は以下のように行う。レイ
アウト解析部３により、画像メモリ２内の画像データか
ら文字とイメージの領域の外接枠（領域枠）を、黒画素
の周辺分布を利用して抽出し、更に、それぞれの領域の
幾何学的特徴により文字とイメージとに識別する。そし
て、文字領域については、その黒画素の周辺分布を利用
して縦書きか横書きかを判別する。この判別結果で縦書
きが多い場合、文字領域を右上から左下に向かって読取
順序を付ける。一方、横書きが多い場合、文字領域を左
上から右下に向かって読取順序を付ける。尚、これら領
域枠、領域種類、文字領域の縦書き／横書き、および読
取順序等で領域データを構成する。When image input is completed, layout analysis is performed (step S2). In the layout analysis, first, the area is analyzed. This area analysis is performed as follows. The layout analysis unit 3 extracts the circumscribing frame (region frame) of the character and image regions from the image data in the image memory 2 by using the peripheral distribution of black pixels, and further, the geometrical characteristics of each region. To distinguish between characters and images. Then, regarding the character area, it is determined whether the writing is vertical writing or horizontal writing using the peripheral distribution of the black pixels. If there is much vertical writing as a result of this determination, the reading order is assigned to the character areas from the upper right to the lower left. On the other hand, when there is a large amount of horizontal writing, the reading order is given to the character areas from the upper left to the lower right. The area data is configured by these area frames, area types, vertical / horizontal writing of character areas, and reading order.

【００１６】次に、この領域データのうち、文字領域に
ついて行切り出しを行う。行切り出しは、先ず、文字領
域の黒画素の周辺分布を用いて行の外接枠（行枠）を抽
出する。そして、領域解析で判別した文字領域の縦書き
／横書きに従い、縦書きの場合は右から左に、横書きの
場合は上から下に向かって行の読取順序を付ける。この
行枠と読取順序等で行データを構成する。最後に、この
行データについて、文字切り出しを行う。文字切り出し
においても、先ず、行の黒画素の周辺分布を用いて文字
の外接枠（文字枠）を抽出する。そして、領域解析で判
別したその文字領域の縦書き／横書きに従って、文字を
縦書きの場合は上から下に、横書きの場合は左から右に
向かって読取順序を付ける。この文字枠と読取順序等で
文字データを構成する。また、上記の領域、行および文
字データでレイアウトデータを構成する。Next, of the area data, line cutting is performed for the character area. For line segmentation, first, a circumscribing frame (line frame) of a line is extracted using a peripheral distribution of black pixels in a character area. Then, according to the vertical writing / horizontal writing of the character area determined by the area analysis, the reading order of the lines is given from right to left in the case of vertical writing and from top to bottom in the case of horizontal writing. The row data is composed of this row frame and the reading order. Finally, character cutting is performed on this line data. Also in the character cutout, first, the circumscribing frame (character frame) of the character is extracted using the peripheral distribution of the black pixels in the row. Then, according to the vertical writing / horizontal writing of the character area determined by the area analysis, the reading order is given from the top to the bottom in the case of the vertical writing and from the left to the right in the case of the horizontal writing. Character data is constituted by this character frame and the reading order. Further, the layout data is composed of the above area, line, and character data.

【００１７】全文字領域の行、文字切り出しが終了する
と、求めたレイアウトデータをレイアウトメモリ４に記
憶する。When the line and character segmentation of all character regions is completed, the layout data thus obtained is stored in the layout memory 4.

【００１８】レイアウト解析が終了すると、レイアウト
データの確認・修正を行う（ステップＳ３）。このレイ
アウトデータの確認・修正処理では、レイアウトデータ
確認・修正部１０ａが、画像メモリ２から画像データ
を、また、レイアウトメモリ４からレイアウトデータを
読み出し、表示部８にこの画像データとレイアウトデー
タとを重ねて表示する。オペレータはこの表示を見て、
レイアウトデータで修正が必要な部分を操作部９を使っ
て修正する。When the layout analysis is completed, the layout data is confirmed / corrected (step S3). In this layout data confirmation / correction processing, the layout data confirmation / correction unit 10a reads the image data from the image memory 2 and the layout data from the layout memory 4, and displays the image data and the layout data on the display unit 8. Overlap and display. The operator looks at this display and
The portion of the layout data that needs to be corrected is corrected using the operation unit 9.

【００１９】図３は、レイアウトデータの確認・修正処
理の説明図である。この図３において、読取対象文書の
画像データをＧ１とする。また、領域データの確認・修
正画面がＧ２である。尚、図面上では、「１横」等の文
字領域の読取順序Ｇ２４と縦書き／横書き区分Ｇ２６を
示す部分と、「文書」といった文字領域枠Ｇ２１内の文
字列との重なりを避けるため、文字領域枠は文字列より
も若干大きめに示されているが、実際には文字列のほぼ
外側の枠となっているものである。FIG. 3 is an explanatory diagram of the layout data confirmation / correction processing. In FIG. 3, the image data of the document to be read is G1. Further, the area data confirmation / correction screen is G2. In the drawing, in order to avoid overlapping of the reading order G24 and the vertical writing / horizontal writing section G26 of the character area such as "1 horizontal" and the character string in the character area frame G21 such as "document", Although the area frame is shown slightly larger than the character string, it is actually a frame substantially outside the character string.

【００２０】先ず、画像データＧ１に重ねて文字領域枠
Ｇ２１、Ｇ２２、イメージ領域枠Ｇ２３（文字領域枠と
イメージ領域枠は表示色を変えて区別する）、文字領域
の読取順序Ｇ２４、Ｇ２５、および文字領域の縦書き／
横書き区分Ｇ２６、Ｇ２７を表示する。オペレータは、
この表示を見て領域枠の位置、大きさ、種類、読取順
序、縦書き／横書き等を操作部９を使って確認・修正す
る。この時、文字領域データを修正すると、この文字領
域の行、文字切り出しデータが修正前と違ったものにな
るため、修正した文字領域について行、文字切り出しを
行い、修正、再切り出し後のレイアウト（領域、行、文
字）データをレイアウトメモリ４に記憶する。First, the character area frames G21 and G22, the image area frame G23 (the character area frame and the image area frame are distinguished by changing the display color) overlaid on the image data G1, the reading order G24, G25 of the character area, and Vertical writing of character area /
Horizontal writing sections G26 and G27 are displayed. The operator
Using this display, the position, size, type, reading order, vertical writing / horizontal writing, etc. of the area frame are confirmed and corrected using the operation unit 9. At this time, if the character area data is modified, the line and character cutout data of this character area will be different from the one before correction.Therefore, the line and character cutout will be performed for the corrected character area, and the layout after correction and recutout ( The area, line, character) data is stored in the layout memory 4.

【００２１】領域データの確認・修正を終了すると、画
面Ｇ３を表示して行データの確認・修正を行う。この画
面Ｇ３は文字領域Ｇ２１の行データの確認・修正画面を
示す。ここでは、文字領域Ｇ２１の画像データに重ねて
行枠Ｇ３１、Ｇ３２、読取順序Ｇ３３、Ｇ３４を表示す
る。オペレータは、この表示を見て行枠の位置、大き
さ、読取順序等を操作部９を使って確認・修正する。こ
の時、行データを修正すると、この行の文字切り出しデ
ータが修正前と違ったものとなるため、修正した行につ
いて文字切り出しを行い、修正、再切り出し後のレイア
ウト（行、文字）データをレイアウトメモリ４に記憶す
る。以上の行データの確認・修正を文字領域Ｇ２２につ
いても同様に行う。When the confirmation / correction of the area data is completed, the screen G3 is displayed to confirm / correct the row data. This screen G3 shows a confirmation / correction screen for line data in the character area G21. Here, the line frames G31 and G32 and the reading order G33 and G34 are displayed so as to be superimposed on the image data of the character area G21. The operator sees this display and confirms / corrects the position, size, reading order, etc. of the line frame using the operation unit 9. At this time, if the line data is modified, the character cutout data of this line will be different from that before correction, so character cutout will be performed for the corrected line, and layout (line, character) data after correction and recutout will be laid out. Stored in the memory 4. The above line data confirmation / correction is similarly performed for the character area G22.

【００２２】行データの確認・修正が終了すると、画面
Ｇ４を表示して文字データの確認・修正を行う。この画
面Ｇ４は、行枠Ｇ３１の文字データの確認・修正画面を
示す。ここでは、行枠Ｇ３１の画像データに重ねて文字
枠Ｇ４１、Ｇ４２、Ｇ４３、Ｇ４４、読取順序Ｇ４５、
Ｇ４６、Ｇ４７、Ｇ４８を表示する。オペレータは、こ
の表示を見て、文字枠の位置、大きさ、読取順序等を操
作部９を使って確認・修正する。そして、修正後のレイ
アウト（文字）データをレイアウトメモリ４に記憶す
る。以上の文字データの確認・修正を、行枠Ｇ３２、更
に文字領域Ｇ２２の行についても同様に行なう。When the confirmation / correction of the line data is completed, the screen G4 is displayed to confirm / correct the character data. This screen G4 shows a confirmation / correction screen for the character data in the line frame G31. Here, the character frames G41, G42, G43, G44, the reading order G45, are superimposed on the image data of the line frame G31.
G46, G47 and G48 are displayed. The operator sees this display and confirms / corrects the position, size, reading order, etc. of the character frame using the operation unit 9. Then, the corrected layout (character) data is stored in the layout memory 4. The above-described confirmation / correction of the character data is similarly performed for the line frame G32 and the line of the character area G22.

【００２３】上記のレイアウトデータの確認・修正処理
では、処理対象を領域、行および文字のレイアウト解析
の階層全部としたが、文字切り出しではほとんど誤りが
ない場合、確認・修正処理を領域と行で終了することも
できる。同様に、確認・修正処理を領域で終了すること
もできる。このようにして、確認・修正処理が必要な階
層だけを処理対象とし、確認・修正作業を軽減すること
ができる。In the above-described layout data confirmation / correction processing, the processing target is the entire hierarchy of the area, line, and character layout analysis. However, if there is almost no error in character segmentation, the confirmation / correction processing is performed for the area and line. You can also finish. Similarly, the confirmation / correction process can be terminated in the area. In this way, only the layers that require confirmation / correction processing are processed, and the confirmation / correction work can be reduced.

【００２４】また、領域データの確認・修正から行デー
タの確認・修正に処理を進める場合、上記のように読取
順序に従って全部行うのではなく、オペレータが行デー
タの確認・修正が必要な文字領域を指定して進める方法
もある。行データの確認・修正から文字データの確認・
修正に処理を進める場合も同様である。このように、確
認・修正が必要な部分だけを処理対象とし、確認・修正
作業を軽減できる。Further, when the process proceeds from the confirmation / correction of the area data to the confirmation / correction of the line data, the operator does not need to perform all the operations in the reading order as described above, but a character area in which the operator needs to confirm / correct the line data. There is also a method to specify and proceed. Confirmation of character data from confirmation / correction of line data
The same applies to the case of proceeding with the correction. In this way, only the parts that need confirmation / correction are processed, and the work of confirmation / correction can be reduced.

【００２５】図４は、レイアウトデータと認識データの
構成例を示す説明図である。先ず、レイアウトデータの
構成は、レイアウト解析処理の階層構成の領域解析→行
切り出し→文字切り出しに対応して、ページ属性・読取
枠→領域属性・領域枠→行属性・行枠→文字属性・文字
枠となる。各属性は、階層により異なるが、領域の種
類、読取順序、縦書き／横書きの区別、認識フラグ等で
ある。各枠は外接枠を表す座標データ等である。また、
認識フラグは、最初のレイアウトデータの確認・修正時
には全てオンとし、この状態でレイアウトメモリ４に記
憶する（ステップＳ４→ステップＳ５）。一方、認識後
に再度のレイアウトデータの確認・修正時には修正した
部分のみをオンにし、その他の修正しなかった部分はオ
フで記憶する（ステップＳ４→ステップＳ６）。FIG. 4 is an explanatory diagram showing a configuration example of layout data and recognition data. First, the layout data is composed of page attribute / reading frame → region attribute / region frame → line attribute / line frame → character attribute / character corresponding to region analysis → line segmentation → character segmentation of the hierarchical structure of layout analysis processing. It becomes a frame. Each attribute is, for example, the type of area, the reading order, the vertical writing / horizontal writing distinction, the recognition flag, etc., although it differs depending on the hierarchy. Each frame is coordinate data representing a circumscribing frame. Also,
The recognition flags are all turned on at the time of checking / correcting the first layout data and stored in the layout memory 4 in this state (step S4 → step S5). On the other hand, when the layout data is checked / corrected again after the recognition, only the corrected part is turned on, and the other uncorrected parts are turned off and stored (step S4 → step S6).

【００２６】レイアウトデータの認識・修正が終了する
と、これらのレイアウトデータに従い、認識フラグのオ
ンの部分の画像データの文字を認識する（ステップＳ
７）。この認識処理では、認識部５により、先ず、レイ
アウトデータに従い画像データの文字に対して、認識部
５内にある認識辞書を用いたパターンマッチングにより
１文字毎の文字認識データ（候補文字コード、それぞれ
の確信度）を得る。次に、得た文字認識データに対し
て、認識部５内にある単語辞書、文法ルールを用いて後
処理を行い、文章として正しく認識して後処理データ
（後処理の結果である文字列、候補単語とその確信度）
を作成する。ここで、文字認識データと後処理データを
併せて認識データと呼ぶ。そして、後処理が終了する
と、この認識データを認識メモリ６に記憶する。When the recognition / correction of the layout data is completed, the characters of the image data of the part where the recognition flag is turned on are recognized according to these layout data (step S).
7). In this recognition processing, the recognition unit 5 first performs character matching data for each character (candidate character code, respectively) on the characters of the image data according to the layout data by pattern matching using the recognition dictionary in the recognition unit 5. Confidence). Next, the obtained character recognition data is post-processed using a word dictionary and grammar rules in the recognition unit 5, and is correctly recognized as a sentence, and post-processed data (a character string that is a result of the post-processing, Candidate words and their certainty)
Create Here, the character recognition data and the post-processing data are collectively referred to as recognition data. Then, when the post-processing is completed, the recognition data is stored in the recognition memory 6.

【００２７】認識が終了すると、認識データ確認・修正
部１０ｂは、先ず、認識フラグをオフにし（ステップＳ
８）、次に認識データの確認・修正を行う（ステップＳ
９）。この確認・修正処理では、認識メモリ６から認識
データ、レイアウトメモリ４からレイアウトデータ、お
よび画像メモリ２から画像データを読み出し、その認識
データの確認・修正画面を表示部８に表示する。When the recognition is completed, the recognition data confirmation / correction unit 10b first turns off the recognition flag (step S
8) Next, confirm / correct the recognition data (step S)
9). In this confirmation / correction processing, the recognition data from the recognition memory 6, the layout data from the layout memory 4, and the image data from the image memory 2 are read out, and a confirmation / correction screen of the recognition data is displayed on the display unit 8.

【００２８】図５は、この認識データの確認・修正画面
である。図５において、Ｒ１が認識データの認識・修正
画面、Ｒ２はテキスト画面で、後処理後の文字列Ｒ２１
を表示し、確認・修正対象文字をマウスカーソルＲ２２
で指示し、その指示した文字を反転表示する。尚、図
中、枠で囲んだ「読」は反転表示状態を示すものとす
る。Ｒ３は文字画像画面で、マウスカーソルＲ２２で指
示した文字の文字画像Ｒ３１を表示し、この文字枠Ｒ３
２を重ねて表示する。更に、Ｒ４は候補画面で、マウス
カーソルＲ２２で指示した文字の候補文字（または候補
単語）Ｒ４１を表示する。この表示をオペレータが見
て、文字列Ｒ２１と文字画像Ｒ３１を比較して文字列を
確認し、誤りが有った場合に候補文字Ｒ４１と置換した
り、操作部９から正解文字を入力する等の処理を行っ
て、認識データの確認・修正をする。FIG. 5 is a confirmation / correction screen for this recognition data. In FIG. 5, R1 is a recognition data correction / correction screen, R2 is a text screen, and the post-processed character string R21
Is displayed, and the character to be checked / corrected is displayed with the mouse cursor R22.
, And then the highlighted character is highlighted. In the figure, "reading" surrounded by a frame indicates an inverted display state. R3 is a character image screen, which displays a character image R31 of the character designated by the mouse cursor R22, and the character frame R3
Display 2 on top of each other. Further, R4 is a candidate screen for displaying a candidate character (or candidate word) R41 of the character designated by the mouse cursor R22. The operator looks at this display, compares the character string R21 with the character image R31, confirms the character string, and if there is an error, replaces it with the candidate character R41, inputs the correct character from the operation unit 9, etc. Checking and correcting the recognition data.

【００２９】次に、認識データ確認・修正部１０ｂは、
レイアウトデータの修正が必要か否かの判定を行う（ス
テップＳ１０）。図６は、レイアウトデータの修正判定
の説明図である。尚、この図６においても、「１横」等
の文字と、「文書」といった文字領域枠内の文字列との
重なりを避けるため、文字領域枠は文字列よりも若干大
きめに示されているが、実際には文字列のほぼ外側の枠
となっているものである。例えば、図６（ｂ）に示すよ
うに、文字領域をイメージ領域に誤って領域解析した
（文字領域である領域Ｆ２３をイメージ領域と識別し
た）。図６（ｃ）に示すように２行を１行に誤って切り
出し（２行である部分を１行Ｆ３１として切り出し
た）、そのまま認識した等の場合がある。Next, the recognition data confirmation / correction unit 10b
It is determined whether the layout data needs to be modified (step S10). FIG. 6 is an explanatory diagram of the layout data correction determination. Note that in FIG. 6 as well, the character area frame is shown slightly larger than the character string in order to avoid overlapping of characters such as "1 landscape" and the character string in the character area frame such as "document". However, it is actually the outer frame of the character string. For example, as shown in FIG. 6B, the character area is erroneously analyzed as an image area (the area F23, which is a character area, is identified as an image area). In some cases, as shown in FIG. 6C, two lines are erroneously cut out into one line (a part of two lines is cut out as one line F31), and the line is recognized as it is.

【００３０】このような場合、認識データの修正では作
業が多くなり過ぎるので、レイアウトデータを修正して
再認識が必要な部分が有るか否かをオペレータが判断す
る。即ち、認識データ確認・修正部１０ｂは、レイアウ
トデータの修正が必要であるか否かの選択画面を表示部
８に表示させ、これによりオペレータが操作部９より指
定を行う。例えば、文字領域をイメージ領域と識別した
場合、認識データの修正では、その領域内の文字を操作
部９より入力するといった作業が必要となり、その文字
領域内の文字数が多い場合は、多大な労力と時間を要す
ることになる。従って、このような場合は、レイアウト
データそのものを修正する。In such a case, since the work of correcting the recognition data is too much, the operator judges whether or not there is a portion that needs to be re-recognized by correcting the layout data. That is, the recognition data confirmation / correction unit 10b causes the display unit 8 to display a selection screen as to whether or not the layout data needs to be corrected, so that the operator makes a designation from the operation unit 9. For example, when the character area is identified as the image area, the correction of the recognition data requires a work of inputting the character in the area from the operation unit 9, and when the number of characters in the character area is large, a great deal of labor is required. And it will take time. Therefore, in such a case, the layout data itself is corrected.

【００３１】上記ステップＳ１０において、修正が必要
であると指定された場合は、ステップＳ３に戻り、レイ
アウトデータ確認・修正部１０ａは、レイアウトデータ
を修正する。When it is specified in step S10 that the correction is necessary, the process returns to step S3, and the layout data confirmation / correction unit 10a corrects the layout data.

【００３２】図７〜９は、レイアウトデータの修正判定
時におけるレイアウトデータの内容説明図である。これ
ら図７〜９は、上記図６のレイアウトデータの内容であ
る。図７は、図２におけるステップＳ１０でレイアウト
データの修正を必要と判断してステップＳ３に戻り、レ
イアウトデータの修正を行う前のレイアウトデータであ
る。この時、全ての領域、行および文字は認識済みであ
るため、認識フラグは、ステップＳ９の処理によって
「オフ」となっている。7 to 9 are views for explaining the contents of the layout data at the time of determining the correction of the layout data. 7 to 9 are the contents of the layout data shown in FIG. FIG. 7 shows layout data before it is determined that the layout data needs to be corrected in step S10 in FIG. 2 and the process returns to step S3 to correct the layout data. At this time, all areas, lines, and characters have been recognized, so the recognition flag is "off" by the processing in step S9.

【００３３】次に、図６（ｂ）のレイアウトデータの修
正で、イメージ領域Ｆ２３を文字領域に変更して、この
文字領域について行、文字切り出しを行った場合、レイ
アウトデータを図７→図８に修正する。また、この場
合、レイアウトデータの確認・修正は最初ではないた
め、ステップＳ６に進み、修正した部分Ｌ１のみ、領
域、行および文字の認識フラグを「オン」とする。更
に、図６（ｃ）のレイアウトデータの修正で、行Ｆ３１
を１行から２行に変更して、この２行について文字切り
出しを行った場合、レイアウトデータを図８→図９に修
正する。同様に、修正した部分Ｌ２は、行および文字の
認識フラグを「オン」とする。Next, when the image area F23 is changed to a character area and lines and characters are cut out for this character area by modifying the layout data of FIG. 6B, the layout data is changed from FIG. 7 to FIG. To fix. Further, in this case, since the confirmation / correction of the layout data is not the first, the process proceeds to step S6, and the recognition flag of only the corrected portion L1, the area, the line and the character is turned on. Further, by modifying the layout data of FIG.
When the line is changed from 1 line to 2 lines and character cutting is performed on these 2 lines, the layout data is corrected to FIG. 8 → FIG. 9. Similarly, in the modified portion L2, the line and character recognition flags are set to "ON".

【００３４】その後は、認識部５による認識処理、認識
データ確認・修正部１０ｂによる認識データの確認・修
正処理およびレイアウトデータの修正の要／不要の判定
を行う。そして、ステップＳ１０で、修正が不要と判定
された場合は、ステップＳ１１に移行して、出力部７で
上記の認識、レイアウト、および画像データを変換して
出力する。出力部７は、画像メモリ２から画像データ、
レイアウトメモリ４からレイアウトデータ、および認識
メモリ６から文字列を読み出し、この文字列と画像デー
タをレイアウトデータに従い、任意のフォーマットの文
書データに変換し、出力部７内にある磁気ディスク装置
やプリンタといった出力装置に出力する。After that, the recognition processing by the recognition unit 5, the recognition data confirmation / correction processing by the recognition data confirmation / correction unit 10b, and the necessity / unnecessity of the layout data correction are determined. If it is determined in step S10 that the correction is unnecessary, the process proceeds to step S11, and the output unit 7 converts and outputs the recognition, layout, and image data. The output unit 7 outputs image data from the image memory 2,
The layout data is read from the layout memory 4 and the character string is read from the recognition memory 6, the character string and the image data are converted into document data of an arbitrary format according to the layout data, and a magnetic disk device or a printer in the output unit 7 is used. Output to the output device.

【００３５】《実施例１の効果》以上のように、上記実
施例１では、領域属性、行属性および文字属性に認識フ
ラグを設け、レイアウトデータの再修正において、修正
した部分の認識フラグを「オン」に、修正しない部分の
認識フラグを「オフ」にすることにより、修正の有無を
区別できる。そして、この認識フラグの「オン」の文字
だけを再認識するので、冗長な認識（認識フラグ「オ
フ」の文字は、再認識しても以前の認識結果と同一にな
る）をせずに、効率の良い処理ができる。また、以前の
認識データの認識・修正でレイアウトデータの修正が不
要な部分については、その修正処理を行わないので、認
識データが修正前に戻らず、再度認識データの修正をし
なければならないといった処理を防止することができ
る。<< Effects of First Embodiment >> As described above, in the first embodiment, the recognition flags are provided for the area attribute, the line attribute, and the character attribute, and when the layout data is recorrected, the recognition flag of the corrected portion is set to " By setting the recognition flag of the portion that is not corrected to "off" to "off", the presence or absence of the correction can be distinguished. Then, since only the character "on" of the recognition flag is re-recognized, redundant recognition (the character of the recognition flag "off" becomes the same as the previous recognition result even if it is re-recognized), Efficient processing is possible. In addition, since the correction process is not performed for the part that does not require the correction of the layout data due to the recognition and correction of the previous recognition data, the recognition data does not return to the state before the correction, and the recognition data must be corrected again. Processing can be prevented.

【００３６】次に、実施例２を説明する。《実施例２の構成》実施例２において、図面上の構成
は、図１に示す実施例１と同様であるが、認識データ確
認・修正部１０ｂの機能が異なっている。即ち、実施例
２では、認識データ確認・修正部１０ｂが、認識対象と
なるレイアウトデータの表示を行う場合、再認識を行う
領域と行わない領域を区別して表示するよう制御を行う
構成となっている。また、他の構成については、実施例
１と同様であるため、ここでの説明は省略する。Next, a second embodiment will be described. << Structure of Second Embodiment >> In the second embodiment, the structure on the drawing is the same as that of the first embodiment shown in FIG. 1, but the function of the recognition data confirmation / correction unit 10b is different. That is, in the second embodiment, when the recognition data confirmation / correction unit 10b displays the layout data to be recognized, the recognition data confirmation / correction unit 10b performs control so as to distinguish and display the area for re-recognition and the area for no recognition. There is. Further, the other configurations are similar to those of the first embodiment, and thus the description thereof is omitted here.

【００３７】《実施例２の動作》以下、実施例２の動作
において、実施例１とは異なる部分を説明する。以下で
説明しない部分は実施例１と同様である。図２のステッ
プＳ８の再度の認識データの認識・修正処理において、
ステップＳ７で再認識した部分と再認識しない部分を区
別して表示する。例えば、再認識しない部分を網掛けと
し（網掛けの他に反転表示、表示色の変更、ブリンク、
アンダライン、太線表示等でも良い）、以前の認識デー
タの認識・修正で処理済みであることを表す。一方、再
認識した部分は通常表示として、今回の確認データの認
識・修正の対象であることを表す。<< Operation of Second Embodiment >> In the operation of the second embodiment, a part different from that of the first embodiment will be described below. The parts not described below are the same as in the first embodiment. In the recognition / correction processing of the recognition data again in step S8 of FIG.
The portion re-recognized in step S7 and the portion not re-recognized are displayed separately. For example, the part that is not re-recognized is shaded (other than shaded, it is highlighted, the display color is changed, blinking,
(Underline, bold line, etc. may be displayed), indicating that the processing has been performed by the recognition / correction of the previous recognition data. On the other hand, the portion that has been re-recognized is displayed as a normal display to show that the confirmation data is recognized and corrected this time.

【００３８】図１０は、この認識データの確認・修正画
面を示す。図１０（ａ）は、上記図９のレイアウトデー
タの修正部分に対応する認識データの確認・修正画面、
図１０（ｂ）は、図８のレイアウトデータの修正に対応
する確認・修正画面である。尚、図中、枠で囲んだ
「域」「認」は反転表示状態を示すものとする。FIG. 10 shows a confirmation / correction screen for this recognition data. FIG. 10A shows a confirmation / correction screen of the recognition data corresponding to the corrected portion of the layout data shown in FIG.
FIG. 10B is a confirmation / correction screen corresponding to the correction of the layout data of FIG. In addition, in the figure, "area" and "acknowledgement" surrounded by a frame indicate an inverted display state.

【００３９】《実施例２の効果》以上のように、実施例
２によれば、再認識結果の確認・修正時、修正したレイ
アウトデータに基づいた認識結果なのか、以前確認済み
の認識結果なのかを区別できるため、認識結果の確認・
修正作業におけるオペレータの負担を軽減することがで
きる。<Effect of Second Embodiment> As described above, according to the second embodiment, at the time of confirming / correcting the re-recognition result, whether the recognition result is based on the corrected layout data or the previously confirmed recognition result. Since it is possible to distinguish whether or not
It is possible to reduce the burden on the operator in the correction work.

【００４０】尚、上記各実施例では、画像入力部１は、
読取対象の文書を光学的に読み取るスキャナ等の装置と
したが、これ以外にも、通信回線を介して画像データを
入力する構成であってもよい。また、出力部７において
も、磁気ディスク装置やプリンタだけでなく、通信回線
への外部出力として構成してもよい。In each of the above embodiments, the image input unit 1 is
Although a device such as a scanner that optically reads a document to be read is used, other configurations may be adopted in which image data is input via a communication line. Further, the output unit 7 may be configured not only as a magnetic disk device or a printer but also as an external output to a communication line.

【００４１】[0041]

【発明の効果】以上説明したように、本発明の文書読取
装置によれば、再度の認識処理においては、レイアウト
データを再修正した部分のみを認識するようにしたの
で、冗長な認識を行うことなく、処理の効率向上を図る
ことができる。また、レイアウトデータの修正が不要な
部分の認識処理は行わないことから、認識データが修正
前に戻ってしまうといった状態を避けることができ、そ
の結果、オペレータに無駄な負担をかけるといったこと
を防止することができる。As described above, according to the document reading apparatus of the present invention, in the re-recognition process, only the portion in which the layout data is re-corrected is recognized, and therefore redundant recognition is performed. Therefore, the processing efficiency can be improved. In addition, since the recognition processing is not performed for the portion of the layout data that does not need to be corrected, it is possible to avoid a situation in which the recognition data returns to the state before correction, and as a result, it is possible to prevent unnecessary load on the operator. can do.

[Brief description of drawings]

【図１】本発明の文書読取装置の構成図である。FIG. 1 is a configuration diagram of a document reading apparatus of the present invention.

【図２】本発明の文書読取装置の動作フローチャートで
ある。FIG. 2 is an operation flowchart of the document reading device of the present invention.

【図３】本発明の文書読取装置におけるレイアウトデー
タの確認・修正処理の説明図である。FIG. 3 is an explanatory diagram of layout data confirmation / correction processing in the document reading apparatus of the present invention.

【図４】本発明の文書読取装置におけるレイアウトデー
タと認識データの構成例を示す説明図である。FIG. 4 is an explanatory diagram showing a configuration example of layout data and recognition data in the document reading device of the present invention.

【図５】本発明の文書読取装置における認識データの確
認・修正画面である。FIG. 5 is a confirmation / correction screen of recognition data in the document reading apparatus of the present invention.

【図６】本発明の文書読取装置におけるレイアウトデー
タの修正判定の説明図である。FIG. 6 is an explanatory diagram of layout data correction determination in the document reading apparatus of the present invention.

【図７】本発明の文書読取装置におけるレイアウトデー
タの修正判定時におけるレイアウトデータの内容説明図
（その１）である。FIG. 7 is a diagram (part 1) for explaining the contents of layout data when determining whether to correct the layout data in the document reading apparatus of the present invention.

【図８】本発明の文書読取装置におけるレイアウトデー
タの修正判定時におけるレイアウトデータの内容説明図
（その２）である。FIG. 8 is a diagram (part 2) for explaining the contents of layout data at the time of determining whether to correct the layout data in the document reading device of the present invention.

【図９】本発明の文書読取装置におけるレイアウトデー
タの修正判定時におけるレイアウトデータの内容説明図
（その３）である。FIG. 9 is a content explanatory view (No. 3) of the layout data at the time of determining whether to correct the layout data in the document reading apparatus of the present invention.

【図１０】本発明の文書読取装置の実施例２における認
識データの確認・修正画面の説明図である。FIG. 10 is an explanatory diagram of a recognition data confirmation / correction screen in the second embodiment of the document reading apparatus of the present invention.

[Explanation of symbols]

１画像入力部３レイアウト解析部５認識部８表示部９操作部１０制御部１０ａレイアウトデータ確認・修正部１０ｂ認識データ確認・修正部 1 Image Input Section 3 Layout Analysis Section 5 Recognition Section 8 Display Section 9 Operation Section 10 Control Section 10a Layout Data Confirmation / Correction Section 10b Recognition Data Confirmation / Correction Section

Claims

[Claims]

1. A layout analysis unit that analyzes a layout of a position of a document in a data area to be recognized from image data on the document to obtain layout data of the document, and layout data acquired by the layout analysis unit. When,
The image data is displayed on the display unit in an overlapping manner to accept confirmation / correction of the layout data, and a recognition flag corresponding to each area is provided.
If it has been done more than once, check the recognition flag of the area.
The layout data confirmation / correction unit that outputs the layout data to which these recognition flags are added is set as a state different from the area in which the correction process is not performed, and the first recognition of the target document is performed based on the recognition flag of the layout data. , A recognition unit for recognizing all regions including character recognition, and for subsequent recognition, recognizing only a region corrected by the layout data confirmation / correction unit, and a layout after recognition processing in the recognition unit. The display unit is instructed to select whether or not the data needs to be corrected, and when the correction is designated, the layout confirmation / correction unit confirms the layout data.
A document reading device, comprising: a recognition data confirmation / correction unit for issuing a correction instruction.

2. The document reading device according to claim 1, wherein when the layout data to be recognized is displayed, the layout data confirmation / correction is performed so that the area to be re-recognized and the area not to be recognized are displayed separately. A document reading device comprising a section.