JPH04147391A

JPH04147391A - Text recognition apparatus

Info

Publication number: JPH04147391A
Application number: JP2271865A
Authority: JP
Inventors: Noboru Shimizu; 昇清水
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 1990-10-09
Filing date: 1990-10-09
Publication date: 1992-05-20
Anticipated expiration: 2014-05-17
Also published as: JP2890788B2

Abstract

PURPOSE:To improve a correcting effect of a recognition result in the recognition process for a text image by displaying layout information and a result of character recognition and by reprocessing the text recognition based on layout correcting information. CONSTITUTION:Layout analysis section 2 extracts a layout of text image. A character recognition section 3, based on layout information, recognizes a character section, and outputs a result of recognition in conjunction with object characters. A knowledge processing section 4 carries out context processing from a result of character recognition. The correcting interface section 51 of a correcting section 5 displays layout information and a result of character recognition and makes layout correction possible. A reprocessing instruction section 52, based on information from a correcting interface section 51, instructs the knowledge processing section 4 for reprocessing. The operator uses an instruction device such as keyboard/mouse 511 to carry out the text correction while watching a recognition result display section 512.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は、文書画像を認識して電子的な文書を作成する
装置に関する。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to an apparatus for recognizing document images and creating electronic documents.

[Conventional technology]

近年の文字認識技術の発達と共に種々の０ＣＲ（Ｏｐｔ
ｉｃａｌ　Ｃｈａｒａｃｔｏｒ　Ｒｅａｄｅｒ）が開発
されている。With the development of character recognition technology in recent years, various OCR (Opt)
ical Character Reader) has been developed.

そして、これまでに書式に関する事前情報を用いずに紙
面を構成する文字、図表或いは写真等の要素を抽出する
手法として黒／白両画素のランレングスを用いるもの、
一定の大きさのウィンドウ内の画素濃度を用いるもの、
近接線密度法を用いるものなどがある。これらの手法は
イメージ蓄積編集成いはイメージ伝送を前提とした濃淡
図形と２値図形の分離、図表領域と文字領域の分離を主
な目的としたものである。Until now, methods that use the run length of both black and white pixels have been used to extract elements such as characters, diagrams, or photographs that make up a page without using prior information about the format.
one that uses pixel density within a window of constant size;
Some methods use the proximity linear density method. The main purposes of these methods are to separate grayscale figures and binary figures, and to separate graphic areas and character areas, on the premise of image storage, editing, or image transmission.

一方、文字の認識を前提として文字列を抽出する手法と
して、黒画素の連結成分の追跡による手法が報告されて
いる。また、段の位置等、書式に関する大まかな情報を
予め与えておき、それを手掛かりに紙面構成要素を抽出
してゆく手法がある。On the other hand, a method based on tracking connected components of black pixels has been reported as a method for extracting character strings based on character recognition. There is also a method in which rough information about the format, such as column positions, is given in advance and page components are extracted using this information as clues.

さらに、本田、増圧「書式指定情報によらない紙面構成
要素抽出法」、電子通信学会論文集、８３／１．　Ｖｏ
ｌ、　Ｊ６６−ＤＮｏ、１．　ｐＨｌ　〜ｐＨ８記載の
論文では、フィールドセパレータ、すなわち、紙面上の
領域を強制的に分割する直線状図形と見出し及び本文の
文字列とを構成要素とする紙面から、紙面上での位置に
関する事前情報を全く用いずに、これらの紙面構成要素
を自動的に抽出する手法が提案されている。Furthermore, Honda, Bosou, "Paper component extraction method that does not depend on format specification information," Proceedings of the Institute of Electronics and Communication Engineers, 83/1. Vo
l, J66-DNo, 1. In papers describing pH1 to pH8, field separators, i.e., prior information about the position on the paper, are used to collect information from the paper, which consists of a linear figure that forcibly divides the area on the paper, a heading, and a text string. A method has been proposed to automatically extract these page components without using any information.

このように紙面構成要素の配列、すなわち、レイアウト
を自動的に認識する場合、誤って認識される場合がある
。このため、自動認識されたレイアウトに対して、人間
が修正を加えていた。In this way, when automatically recognizing the arrangement of page components, that is, the layout, there are cases where the recognition is erroneous. For this reason, humans had to make corrections to the automatically recognized layout.

従来の文書認識装置においては、認識結果を修正する作
業において、レイアウトの修正時はレイアウトのみを修
正し、文字部分の修正時は文字部分のみを修正すること
によって、認識結果の修正を行っていた。このため、レ
イアウト構造の認識が間違っているために文字認識が間
違う場合が生じても、レイアウトのみの修正で終わり°
、自動的に文字部分の修正が行われるということはなか
った。In conventional document recognition devices, when correcting the recognition results, the recognition results are corrected by correcting only the layout when correcting the layout, and correcting only the text when correcting the text part. . Therefore, even if a character recognition error occurs due to an incorrect recognition of the layout structure, all you need to do is correct the layout.
, the text part was not automatically corrected.

具体的な例をあげて説明すると、第３図（ａ）に示すよ
うな２段の文章とその中間にある行番号等の文字から構
成される原画像に対して、本来は同図ら）のような２段
とその中間にある文字に分けてレイアウト解析をすべき
箇所を、中間にある文字と両端のブロック内の文字行と
が水平方向に接近しているた於、同図（Ｃ）のように中
間文字部分が隣接する段の行と融合してレイアウト解析
される場合がある。この場合、同図（ｄ）に破線で囲ん
だ領域６で示されるように、左の段の１行と、中間文字
部分と、右の段の１行とが連続した１行と見做されて文
脈処理が行われるため、本来［気です。明日１０　　ロ
ーフでゴー」と文脈処理が行われるべき部分が、同図（
ｅ）に示すように、「気です。明日１００７でゴーＪと
誤って文脈処理が行われてしまう。To explain with a specific example, for an original image consisting of two columns of text as shown in Figure 3(a) and characters such as line numbers in between, the The layout analysis should be done by dividing into two columns and the characters in the middle, but the characters in the middle and the character lines in the blocks at both ends are horizontally close to each other, as shown in Figure (C). In some cases, the layout is analyzed such that the intermediate character part is merged with the line of the adjacent column. In this case, as shown by the area 6 surrounded by a broken line in Figure (d), one line in the left column, the intermediate character part, and one line in the right column are considered to be one continuous line. Because context processing is performed, it is originally [Qi]. The part where context processing should be performed, such as “Tomorrow 10 Loaf,” is shown in the figure (
As shown in e), context processing is mistakenly performed as ``Go J'' in 1007 tomorrow.

なお、このように文脈処理が行われるのは、後述するよ
うに数字の認識が優先される場合があるためである。Note that the context processing is performed in this way because, as will be described later, recognition of numbers may be prioritized.

このように、間違ってレイアウト解析が行われると、文
字認識結果に基づいて文脈処理を行って意味の通る文章
を作成するという知識処理において正常な働きをするこ
とができず、第３図（ｅ）に示すような間違った認識結
果を出力してしまう。In this way, if layout analysis is performed incorrectly, knowledge processing, which involves performing context processing based on character recognition results to create sentences that make sense, will not work properly, as shown in Figure 3 (e). ) will output incorrect recognition results as shown in the figure below.

このような場合、操作者が手動でレイアウトを修正する
ことになるが、従来は、正確なレイアウト解析ができて
いれば文字認識結果に対する知識処理も正解が得られる
場合においても、操作者がレイアウトを修正しても、文
字認識結果までは自動的に修正されなかった。In such cases, the operator would have to manually modify the layout, but in the past, even if accurate layout analysis had enabled knowledge processing of the character recognition results to yield a correct answer, the operator would have to manually modify the layout. Even if I corrected it, the character recognition results were not automatically corrected.

[Problem to be solved by the invention]

このように従来の文書認識装置は、レイアウト解析にお
いての間違いが操作者によって修正されても、なんら文
字認識結果にまで正常な影響を及ぼず働きにはならなか
った。As described above, in the conventional document recognition apparatus, even if a mistake in layout analysis is corrected by the operator, it does not affect the character recognition result in any way and does not function properly.

本発明は、以上のような問題点を解決するためになされ
たものであり、認識結果の修正効率を改善することを目
的とする。The present invention has been made to solve the above-mentioned problems, and an object of the present invention is to improve the efficiency of correcting recognition results.

[Means to solve the problem]

本発明の文書認識装置は、上記の目的を達成するた杓に
、文書を画像として入力する画像入力部と、この画像入
力部により入力された文書画像のレイアウトを抽出する
レイアウト解析部と、このレイアウト解析部から得られ
たレイアウト情報に基づいて文字の部分を文字認識しｉ
７ｉ！識結果とともに候補文字も出力する文字認識部と
、この文字認識部の文字認識の結果から文脈処理を行う
知識処理部と、前記レイアウト解析部からのレイアウト
情報と前記文字認識部による文字認識との結果を表示し
レイアウトの修正を可能にする修正インタフェース部と
、この修正インタフェース部からのレイアウト修正の情
報に基づいて知識処理部へ再処理の指示を行う再処理指
示部とから構成されたことを特徴とする。In order to achieve the above object, the document recognition device of the present invention includes: an image input section that inputs a document as an image; a layout analysis section that extracts the layout of the document image input by the image input section; Character recognition is performed based on the layout information obtained from the layout analysis section.
7i! a character recognition unit that outputs candidate characters along with recognition results; a knowledge processing unit that performs context processing based on the character recognition results of the character recognition unit; and a knowledge processing unit that performs context processing based on the character recognition results of the character recognition unit; The present invention is comprised of a modification interface section that displays the results and allows modification of the layout, and a reprocessing instruction section that instructs the knowledge processing section to perform reprocessing based on the layout modification information from the modification interface section. Features.

[Effect]

本発明によると、画像入力部から一般文書をディジタル
画像として入力し、レイアウト解析部において画像入力
部から入力された原画像のレイアウドを抽出し、レイア
ウト解析により文字ブロックであると判定された領域に
対して文字認識部により候補文字の出力も行う文字認識
処理を行う。According to the present invention, a general document is input as a digital image from the image input section, and the layout analysis section extracts the layout of the original image input from the image input section. On the other hand, the character recognition unit performs character recognition processing that also outputs candidate characters.

次いて、知識処理部において、候補文字を含む文字認識
結果より文脈処理を行って意味の通る文章を作成する。Next, the knowledge processing unit performs context processing on the character recognition results including the candidate characters to create a meaningful sentence.

次いで、修正インタフェース部がレイアウト解析部と知
識処理部とを通した文字認識部の結果を認識結果表示部
に表示し、操作者がキーボード／マウス等の指示装置を
通して修正を行う。この修正の結果、レイアウト情報が
修正された場合、このレイアウト修正の情報に基づいて
、再処理指示部が知識処理部にフィードバックして新た
に知識処理された結果を認識結果表示部に表示する。こ
のように、修正処理を繰り返すことによって、目的とす
る文書が作成される。Next, the correction interface section displays the results of the character recognition section that have passed through the layout analysis section and the knowledge processing section on the recognition result display section, and the operator makes corrections using an instruction device such as a keyboard/mouse. When the layout information is modified as a result of this modification, the reprocessing instructing section feeds back to the knowledge processing section based on the layout modification information, and displays the new knowledge processed result on the recognition result display section. In this way, the target document is created by repeating the correction process.

〔Example〕

以下、図面に示す実施例に基づいて本発明の特徴を具体
的に説明する。Hereinafter, features of the present invention will be specifically explained based on embodiments shown in the drawings.

第１図は本実施例における文書認識装置の構成図、第２
図は同文書認識装置において使用される修正部の構成図
である。Fig. 1 is a configuration diagram of the document recognition device in this embodiment, and Fig. 2
The figure is a configuration diagram of a correction section used in the document recognition device.

文書認識装置は、文書を画像として入力する画像入力部
１と、入力された文書画像のレイアウトを抽出するレイ
アウト解析部２と、得られたレイアウト情報に基づいて
文字の部分を文字認識し認識結果とともに候補文字も出
力する文字認識部３と、文字認識の結果から文法処理等
を行う知識処理部４と、レイアウト情報と文字認識との
結果を表示しレイアウトの修正を可能にする修正インタ
フェース部５１と、この修正情報に基づいて知識処理部
４へ再処理の指示を行う再処理指示部５２とから構成さ
れている。The document recognition device includes an image input unit 1 that inputs a document as an image, a layout analysis unit 2 that extracts the layout of the input document image, and a recognition result that recognizes characters based on the obtained layout information. A character recognition unit 3 that also outputs candidate characters, a knowledge processing unit 4 that performs grammatical processing etc. from the result of character recognition, and a modification interface unit 51 that displays layout information and the results of character recognition and allows modification of the layout. and a reprocessing instruction section 52 that instructs the knowledge processing section 4 to perform reprocessing based on this correction information.

さらに、上記修正インタフェース部５１と再処理指示部
５２とで修正部を構成しており、修正インタフェース部
５１は、指示装置であるキーボード／マウス５１１　と
、文字認識部３での認識結果を表示する認識結果表示部
５１２　と、文字認識の結果を受は取る修正処理制御部
５１３　から構成されている。なお、修正処理制御部５
１３　は、キーボード／マウス５１１　からの入力に基
づき文書を作成する機能、すなわち、ワードプロセッザ
機能を有している。Further, the modification interface section 51 and the reprocessing instruction section 52 constitute a modification section, and the modification interface section 51 displays the recognition result of the keyboard/mouse 511 which is an instruction device and the character recognition section 3. It consists of a recognition result display section 512 and a correction processing control section 513 that receives and receives the results of character recognition. Note that the correction processing control unit 5
13 has a function of creating a document based on input from the keyboard/mouse 511, that is, a word processor function.

次に、上述の文書認識装置の動作について説明する。Next, the operation of the above-mentioned document recognition device will be explained.

まず、画像入力部１で入力し２値化した文書画像を、レ
イアウト解析部２で解析する。解析の方法は、前記の参
考文献「書式指定情報によらない紙面構成要素抽出法」
等に提案されている解析の方法、即ち、フィールドセパ
レータ（紙面上の領域を強制的に分割する直線状図形）
と見出し及び本文の文字列とを構成要素とする紙面から
、紙面上での位置に関する事前情報を全く用いずにこれ
らの紙面構成要素を自動的に抽出する手法等を用い、文
字のみの領域と判断したブロックに対して文字認識部３
により文字認識を行う。First, a document image inputted by the image input section 1 and binarized is analyzed by the layout analysis section 2. The analysis method is based on the above-mentioned reference "Paper component extraction method that does not rely on format specification information"
The analysis method proposed in et al., i.e., field separator (a linear figure that forcibly divides the area on the paper)
Using a method that automatically extracts these page components from a page consisting of text, headings, and text strings without using any prior information regarding their position on the page, we can extract text-only areas and The character recognition unit 3
Character recognition is performed using

文字認識の方法は既知の方法を用いる。この文字認識部
３は、認識の確からしさとして第１位の認識文字以外に
もその文字らしいという複数の候補文字も出力する。知
識処理部４では、文字認識部３から出力された認識結果
から文法処理等を行い、用意した文法に適合した文字列
に変換する処理を行う。A known method is used for character recognition. In addition to the first recognized character, the character recognition unit 3 also outputs a plurality of candidate characters that are likely to be the same character. The knowledge processing section 4 performs grammatical processing and the like on the recognition result output from the character recognition section 3, and performs a process of converting the character string into a character string that conforms to the prepared grammar.

この処理の１例を、第３図に示す画像を例に挙げて説明
する。いま、元の文字列に対して文字認識部３から出力
された候補文字を含む認識結果が第１表に示すようなも
のであったとする。An example of this process will be described using the image shown in FIG. 3 as an example. Assume now that the recognition results including candidate characters output from the character recognition unit 3 for the original character string are as shown in Table 1.

第１表第１表に示すように、文字認識部３から出力された候補
文字を含む認識結果から、確からしさ第１位の文字列の
中で、現在注目している文字のすぐ前の文字が数字であ
るならば、現在注目している文字も数字である確率が高
い。そこで、確からしさにおいて第１位の文字が数字で
なく確からしさにおいて第２位の文字が数字であるなら
ば、その第２位の文字を認識結果として出力する。同様
のごとを数字以外の漢字、ひらがな、カタカナに対して
行う。この処理によって、第１表において文字列「１０
ローク」は、認識処理の確からしさ第１位の文字のみで
構成すると、文字列「１０ローク」と正しく認識される
。ここで、認識結果「ローフ」の確からしさ第２位の候
補文字としてｒＯ−７Ｊがあるならば、上記に説明した
知識処理を行うと、中間の文字、すなわち、行番号「１
０」まで−緒にしてｒｌｏｏ−７Ｊという文字列を生成
する。このように、文字認識部３の結果を知識処理部４
によって自動修正する。Table 1 As shown in Table 1, from the recognition results including candidate characters output from the character recognition unit 3, the character immediately before the character of interest among the character strings with the highest probability If is a number, there is a high probability that the character currently being focused on is also a number. Therefore, if the character with the highest probability is not a number but the character with the second highest probability is a number, the second-highest character is output as the recognition result. Do the same thing for kanji, hiragana, and katakana other than numbers. By this process, the character string "10" is displayed in Table 1.
If the character string ``10 ROKU'' is composed of only the characters with the highest probability of recognition processing, it will be correctly recognized as the character string ``10 ROKU''. Here, if rO-7J is the candidate character with the second highest probability for the recognition result "loaf", if the knowledge processing described above is performed, the intermediate character, that is, the line number "1"
0'' to generate the character string rloo-7J. In this way, the results of the character recognition section 3 are transferred to the knowledge processing section 4.
Automatically correct by.

ただし、この処理は処理対象として文字列が正しく切り
出されていないと、つまり、レイアウト解析部２の処理
が、目的とするレイアウトとして解析されていないと、
第３図に示したように認識率を下げる処理になってしま
う。レイアウト解析の結果、すなわち、ブロックの種類
、各文字の位置、大きさなど、知識処理を行った文字認
識の結果（候補文字も含む）および、どのブロックのど
の位置に認識した文字が対応しているかを表わしている
表も修正部５に送る。However, this process may occur if the character string to be processed is not correctly extracted, that is, if the processing by the layout analysis unit 2 is not analyzed as the intended layout.
As shown in FIG. 3, this results in a process that lowers the recognition rate. The results of layout analysis, that is, the type of block, the position and size of each character, the results of character recognition using knowledge processing (including candidate characters), and the position of the recognized character in which block it corresponds to. A table showing whether or not there is a fish is also sent to the correction section 5.

修正部５では、まず修正インタフェース部５１において
、レイアウト解析の結果および知識処理を行った文字認
識の結果を修正処理制御部５１３が受は取り、認識結果
表示部５１２　にレイアウトと文字認識結果の表示を行
う。In the correction section 5, first, the correction processing control section 513 receives the results of the layout analysis and the character recognition results after knowledge processing in the correction interface section 51, and displays the layout and character recognition results on the recognition result display section 512. I do.

そして、操作者がこの認識結果表示部５１２を見ながら
、キーボード／マウス５１１　などの指示装置を用いて
修正を行う。Then, while viewing the recognition result display section 512, the operator makes corrections using an instruction device such as the keyboard/mouse 511.

表示の際に、文字のみのブロックは青色、図形のブロッ
クは赤色などのように色分けすることによって、操作者
にブロックの種類が分かりやすいようにすることが可能
である。When displayed, it is possible to make it easier for the operator to understand the type of block by color-coding the blocks, such as text-only blocks in blue and graphic blocks in red.

また、修正完了のブロックと未修正のプロ・ツクとの区
別を色分けることも可能である。修正手順は、まず操作
者が見てわかりやすいレイアウトから修正する。修正に
よって、プロ・ツクの種類（文字のみのブロックまたは
図形のブロックなど）、その位置や大きさなどが変更さ
れる。これによって、修正処理制御部５１が持っている
プロ・ツクの位置、大きさ、そのブロックに含まれてい
る文字などの情報が入っている表を書き換える。It is also possible to distinguish between blocks that have been modified and blocks that have not been modified by color. The modification procedure begins with modifying the layout that is easy for the operator to see and understand. Modifications change the type of block (text-only block, figure block, etc.), its position, size, etc. As a result, the table which contains information such as the position and size of blocks, characters included in the block, etc., held by the correction processing control section 51, is rewritten.

そして、レイアウト修正が終了して目的とするレイアウ
トの作成ができた時点て、再処理指示部５２が、変更さ
れたレイアウト部分について、知識処理部４に再処理の
指示を行う。Then, when the layout modification is completed and the desired layout has been created, the reprocessing instruction unit 52 instructs the knowledge processing unit 4 to reprocess the changed layout portion.

これによって、第３図（ａ）のようなレイアウトを持つ
文書に対して、同図（Ｃ）のようなレイアウト解析が行
われた場合、操作者が同図ら）のように修正すれば、知
識処理部では、誤レイアウト解析された領域６に対して
も正しく知識処理を行うことができる。As a result, if a layout analysis as shown in Figure 3(C) is performed on a document with a layout as shown in Figure 3(a), if the operator makes modifications as shown in Figure 3(a), the knowledge The processing unit can correctly perform knowledge processing even on the area 6 for which the layout has been analyzed incorrectly.

最後に、文字の部分を修正処理制御部５１３　の前記ワ
ープロ機能を用い、文字認識結果の間違いを修正する。Finally, errors in the character recognition results are corrected using the word processing function of the correction processing control unit 513.

これらの処理によって、操作者が目的とする文書の作成
が行われる。Through these processes, the document desired by the operator is created.

上記の例では文書作成のみの実施例について説明したが
、伝票入力にも使える。Although the above example describes an embodiment only for document creation, it can also be used for inputting slips.

また、上記の実施例では、全てのレイアウト修正が完了
してから知識処理へ再処理の指示を行っていだが、一つ
一つのレイアウト修正が行われる度に知識処理へ再処理
の指示を行うことも可能である。Furthermore, in the above embodiment, the knowledge processing is instructed to reprocess after all layout corrections are completed, but it is also possible to instruct the knowledge processing to reprocess each time each layout correction is made. is also possible.

また、上記の実施例では、知識処理への再処理のみにつ
いて説明したが、文字認識への再処理も可能である。Further, in the above embodiment, only reprocessing to knowledge processing has been described, but reprocessing to character recognition is also possible.

ところで、文字認識への再処理指示が行われるのは、レ
イアウト解析の中で最小単位である文字の切り出しが失
敗した場合である。By the way, a reprocessing instruction for character recognition is issued when extraction of a character, which is the smallest unit, fails during layout analysis.

具体的に説明すると、「林」の字について正確な文字の
切り出しが失敗すると、「木」、「木」というように文
字認識してしまう。この場合は、「木」、「木」という
文字を囲む二つの矩形を一文字であると修正することに
よって、文字認識部へ再処理指示を行う。To be more specific, if accurate character extraction fails for the character ``林'', the characters will be recognized as ``木'' or ``木''. In this case, a reprocessing instruction is given to the character recognition unit by correcting the two rectangles surrounding the characters ``木'' and ``木'' as one character.

〔Effect of the invention〕

以上に説明したように、本発明によると、レイアウト修
正の結果が知識処理部で処理されるた約知識処理が完全
に行われるようになり、文書作成の際の文字修正操作の
効率が著しく改善されるという効果を奏する。As explained above, according to the present invention, knowledge processing is completely performed since the results of layout correction are processed in the knowledge processing unit, and the efficiency of character correction operations when creating a document is significantly improved. It has the effect of being

[Brief explanation of the drawing]

第１図は本発明の一実施例を示す文書認識装置の構成図
、第２図は同文書認識装置において使用される修正部の
構成図、第３図は従来例における問題点の説明図である
。に画像入力部　　　２ニレイアウド解析部３：文字認識
部　　　４：知識処理部５：修正部５１：修正インタフェース５１１：キーボード／マウス５１２：認識結果表示部　５１３：修正処理制御部５２
；再処理指示部６：誤レイアウト解析領域特許出願人　　富士ゼロックス株式会社代　　理　　人
　　　小　　掘　　　益第図文を第図ｌ耳口小７写目（Ｊよい大１０Ｕ　［］　（Ｊ）久　ト（ｄ）（ｅ）Fig. 1 is a block diagram of a document recognition device showing an embodiment of the present invention, Fig. 2 is a block diagram of a correction section used in the same document recognition device, and Fig. 3 is an explanatory diagram of problems in the conventional example. be. 2. Image input section 2: Layout analysis section 3: Character recognition section 4: Knowledge processing section 5: Correction section 51: Correction interface 511: Keyboard/mouse 512: Recognition result display section 513: Correction processing control section 52
; Reprocessing instruction unit 6: Incorrect layout analysis area Patent applicant Fuji Xerox Co., Ltd. Agent d) (e)

Claims

[Claims]

1. An image input section that inputs a document as an image; a layout analysis section that extracts the layout of the document image input by this image input section; and a layout analysis section that extracts the text part based on the layout information obtained from this layout analysis section. a character recognition unit that recognizes characters and outputs candidate characters along with the recognition results; a knowledge processing unit that performs context processing based on the character recognition results of the character recognition unit; and a knowledge processing unit that performs context processing based on the character recognition results of the character recognition unit; It is composed of a correction interface section that displays the recognition results and allows the layout to be corrected, and a reprocessing instruction section that instructs the knowledge processing section to perform reprocessing based on the layout correction information from the correction interface section. A document recognition device characterized by: