JPH08202824A

JPH08202824A - Document picture recognition device

Info

Publication number: JPH08202824A
Application number: JP7010751A
Authority: JP
Inventors: Masafumi Shimoyama; 雅史下山; Kazuhiro Ishikawa; 和弘石川
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1995-01-26
Filing date: 1995-01-26
Publication date: 1996-08-09

Abstract

PURPOSE: To automatically edit a document and to reconstruct it by describing an indication mark on the document and designating an editing operation. CONSTITUTION: A color extraction part 1 dividing input picture data having a character and the indication mark into a character binary picture and an indication mark binary picture by the color, an indication mark segment part 2 taking out indication mark position/size information from the indication mark binary picture, an indication mark recognition part 3 recognizing indication information for the editing processing based on the indication mark binary picture and indication mark position/size information, a character segment part 4 taking out character position/size information from the character binary picture, a character recognition part 5 recognizing character information based on the character binary picture and character position/size information and a document constitution part 6 executing the editing processing corresponding to indication information based on indication mark position/size information, indication information and character position/size information and character information and reconstructing and outputting the document are provided.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は、例えば新聞や雑誌そ
の他の一般文書の文書画像データ等から文字を読み取っ
てテキストデータに変換する文書画像認識装置に関する
ものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document image recognition apparatus for reading characters from document image data of general documents such as newspapers and magazines and converting them into text data.

【０００２】[0002]

【従来の技術】従来の文書画像認識装置においては、入
力の文書画像について文字図形分離等の処理を行い、文
字と判別された領域においては文字パタンを１文字づつ
切り出して認識している。このような従来技術による
と、入力された文書画像はそのままテキストデータに変
換されるだけであり、もしその文書をそのままではな
く、一部の文章に訂正や編集操作を施してデータ化した
いときは、一旦認識処理を行ったのちに、テキスト上で
人手による修正を行っていた。2. Description of the Related Art In a conventional document image recognizing device, processing such as character / figure separation is performed on an input document image, and character patterns are cut out and recognized one by one in an area determined to be a character. According to such a conventional technique, the input document image is simply converted into text data as it is, and if the user wants to correct or edit a part of the text and convert it to data, instead of converting the document as it is. After performing the recognition process once, the text was manually corrected.

【０００３】[0003]

【発明が解決しようとする課題】しかし、従来の技術に
よると、文書画像認識後の修正の過程に人手を介入させ
る必要があり、原稿を連続してバッチ処理することがで
きず、能率がよくないという問題があった。However, according to the conventional technique, it is necessary to manually intervene in the correction process after the document image recognition, and the originals cannot be batch processed continuously, resulting in high efficiency. There was a problem of not having.

【０００４】[0004]

【課題を解決するための手段】本発明は、文字およびそ
の文字とは異なった色を有してその文字を編集処理する
ための指示記号から成る入力画像データをその色によっ
て文字２値画像と指示記号２値画像とに分ける色抽出部
と、指示記号２値画像から指示記号位置・大きさ情報を
取り出す指示記号切出部と、指示記号２値画像および指
示記号位置・大きさ情報を基に編集処理のための指示情
報を認識する指示記号認識部と、文字２値画像から文字
位置・大きさ情報を取り出す文字切出部と、文字２値画
像および文字位置・大きさ情報を基に文字情報を認識す
る文字認識部と、指示記号位置・大きさ情報、指示情
報、文字位置・大きさ情報および文字情報を基に、当該
指示情報に対応した編集処理を行って文書を再構成して
出力する文書構成部を有することを特徴とする。SUMMARY OF THE INVENTION According to the present invention, input image data consisting of a character and an instruction symbol having a color different from the character and used for editing the character is converted into a character binary image by the color. Based on the color extraction unit that separates the indicator symbol binary image, the indicator symbol cutting unit that extracts the indicator symbol position / size information from the indicator symbol binary image, and the indicator symbol binary image and the indicator symbol position / size information. Based on the character binary image and the character position / size information, the character recognition unit that recognizes the instruction information for the editing process, the character cutout unit that extracts the character position / size information from the character binary image Based on the character recognition unit that recognizes the character information, the instruction symbol position / size information, the instruction information, the character position / size information, and the character information, the editing process corresponding to the instruction information is performed to reconstruct the document. Document structure part to be output as Characterized in that it has.

【０００５】[0005]

【作用】色抽出部は、入力画像データをその色によって
区分することにより、指示記号２値画像と文字２値画像
とに分ける。指示記号切出部は、その指示記号２値画像
を入力とし、指示記号位置・大きさ情報を出力する。指
示記号認識部は、指示記号２値画像および指示記号位置
・大きさ情報を入力とし、指示情報を認識して文書構成
部に出力する。The color extracting section divides the input image data into the reference symbol binary image and the character binary image by dividing the input image data by the color. The instruction symbol cutout unit receives the instruction symbol binary image and outputs the instruction symbol position / size information. The instruction symbol recognition unit receives the instruction symbol binary image and the instruction symbol position / size information, recognizes the instruction information, and outputs it to the document configuration unit.

【０００６】文字切出部は、文字２値画像を入力とし、
文字位置・大きさ情報を出力する。文字認識部は、文字
２値画像および文字位置・大きさ情報を入力とし、文字
を認識して文書構成部に出力する。文書構成部は、指示
記号位置・大きさ情報、指示情報、文字位置・大きさ情
報および文字情報を入力とし、その指示記号が指定する
編集操作に従って文字情報を編集することにより、文書
を再構成して出力する。The character cutout unit receives a character binary image as an input,
Outputs character position and size information. The character recognition unit receives the character binary image and the character position / size information, recognizes the character, and outputs it to the document configuration unit. The document composition unit inputs the instruction symbol position / size information, instruction information, character position / size information and character information, and reconstructs the document by editing the character information according to the editing operation specified by the instruction symbol. And output.

【０００７】[0007]

【実施例】以下に図を用いて本発明の実施例を説明す
る。図１は実施例のブロック図であり、本発明の実施例
の構成の概略を示している。図において、１は色抽出部
であり、例えば図示しないカラースキャナ等を用いてカ
ラー文書を走査することにより得られたカラー文書画像
の赤成分、緑成分および青成分それぞれの多値画像デー
タを入力とする。文字色と指示記号色は相異なっていて
識別可能としてあり、その入力画像データから文字色成
分と指示記号色成分を抽出してそれぞれの２値画像を求
め、文字２値画像と指示記号２値画像としてそれぞれ出
力する。Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram of an embodiment and shows an outline of the configuration of the embodiment of the present invention. In the figure, reference numeral 1 is a color extraction unit, which inputs multi-valued image data of each of red, green and blue components of a color document image obtained by scanning a color document using a color scanner or the like (not shown). And The character color and the indicator color are different and can be distinguished. The character color component and the indicator color component are extracted from the input image data to obtain respective binary images, and the character binary image and the indicator binary value are obtained. Output as images respectively.

【０００８】上記カラー文書とは、文書を記す色と、該
文書に対する指示記号の色を異なる色としたもののこと
であり、例えば黒で記された文書にカラーペン等で指示
記号を書き加えたもの等がある。なお、文書の色、指示
記号、地色が相互に識別可能であればそれぞれどのよう
な色としてもよい。２は指示記号切出部であり、上記色
抽出部１から出力された指示記号２値画像を入力とし、
指示記号の位置および大きさを求め、指示記号位置・大
きさ情報として出力する。The above-mentioned color document is a color in which a document is written and a color of an instruction symbol for the document are different from each other. For example, an instruction symbol is added to a document written in black with a color pen or the like. There are things. It should be noted that any color may be used as long as the color of the document, the instruction mark, and the background color can be mutually distinguished. Reference numeral 2 is an instruction symbol cutout unit, which receives the instruction symbol binary image output from the color extraction unit 1 as an input,
The position and size of the instruction symbol are obtained and output as the instruction symbol position / size information.

【０００９】３は指示記号認識部であり、前記色抽出部
１から出力された指示記号２値画像と、前記指示記号切
出部２から出力された指示記号位置・大きさ情報とを入
力とし、例えばパターン認識等の手法によって当該指示
記号を認識し、指示情報として求めて出力する。４は文
字切出部であり、前記色抽出部１から出力された文字２
値画像を入力とし、文字の位置および大きさを求めて文
字位置・大きさ情報として出力する。Reference numeral 3 denotes an instruction symbol recognition unit, which receives the instruction symbol binary image output from the color extraction unit 1 and the instruction symbol position / size information output from the instruction symbol cutout unit 2 as inputs. For example, the instruction symbol is recognized by a technique such as pattern recognition, and is obtained and output as instruction information. Reference numeral 4 denotes a character cutout portion, which is a character 2 output from the color extraction portion 1.
The value image is input, the position and size of the character are calculated, and output as character position / size information.

【００１０】５は文字認識部であり、前記色抽出部１か
ら出力された文字２値画像と、文字切出部４から出力さ
れた文字位置・大きさ情報とを入力とし、例えばパター
ン認識等の手法により、文字を認識して例えばその文字
に対応する文字コード等の当該文字情報を出力する。６
は文書構成部であり、前記指示記号切出部２から出力さ
れた指示記号位置・大きさ情報と、前記指示記号認識部
３から切り出された指示情報と、前記文字切出部４から
出力された文字位置・大きさ情報と、前記文字認識部５
から出力された文字情報とを入力とし、これらの入力か
ら文書を再構成して文書データを例えばテキストデータ
として出力する。Reference numeral 5 denotes a character recognition unit, which receives the character binary image output from the color extraction unit 1 and the character position / size information output from the character cutout unit 4, and, for example, pattern recognition or the like. By the method described above, a character is recognized and the character information such as a character code corresponding to the character is output. 6
Is a document configuration unit, and the instruction symbol position / size information output from the instruction symbol cutting unit 2, the instruction information cut out from the instruction symbol recognition unit 3, and the character cutting unit 4 are output. Character position / size information and the character recognition unit 5
The character information output from the above is input, and the document is reconstructed from these inputs to output the document data as, for example, text data.

【００１１】図２は入力画像の説明図である。図では文
書の一例として「あいえお」という文字列の画像が示さ
れており、その「い」と「え」の間に指示記号の一例と
しての挿入を意味する記号があり、この指示記号の直上
に「う」の文字が示されている。この指示記号は実際に
は文字色とは異なる色により記されたものであり、これ
を読み取った後、以下説明する手順に従って挿入の処理
を行い、「あいうえお」の文字列を再構成された文書と
して出力することになる。FIG. 2 is an explanatory diagram of an input image. In the figure, an image of the character string "Aieo" is shown as an example of a document, and there is a symbol between the "i" and "e" that means insertion as an example of the instruction symbol. The letter "u" is shown just above. This instruction symbol is actually written in a color different from the character color, and after reading this, the insertion process is performed according to the procedure described below, and the character string of "aiueo" is reconstructed Will be output as.

【００１２】まず、図示しないカラースキャナ等によ
り、原稿を読み取ってカラー文書画像を得る。これは例
えば各画素における赤成分、青成分および緑成分の強度
（以下それぞれＲ，Ｇ，Ｂと記す。）に基づく多値画像
データとして色抽出部１に入力される。色抽出部１はそ
の入力画像である多値画像のデータをＲ，Ｇ，Ｂ毎に定
めておいたしきい値とそれぞれ比較することにより、各
画素について、「地色」、「文字色」、「指示記号色」
のいずれかに属するものと判別する。First, an original is read by a color scanner or the like (not shown) to obtain a color document image. This is input to the color extraction unit 1 as multi-valued image data based on the intensities of the red component, the blue component, and the green component (hereinafter referred to as R, G, and B, respectively) in each pixel. The color extracting unit 1 compares the data of the multi-valued image, which is the input image, with the threshold values set for R, G, and B, respectively, and thereby, for each pixel, “ground color”, “character color”, "Indicator color"
It is determined to belong to any of the above.

【００１３】図３は文字２値画像の説明図であり、図４
は指示記号２値画像の説明図を示している。この文字２
値画像とは、上記の各画素の判別によって文字色と判別
されたデータのみを「黒」として文字色以外を「白」と
みなしたもののことであり、指示記号２値画像とは、指
示記号色と判別されたデータのみを「黒」として指示記
号色以外を「白」とみなしたもののことである。なおこ
こで２値画像の各画素について「白」または「黒」とい
う表現を用いたが、これは実際の色を示すものではなく
例えば「０」または「１」等と言い換えることができる
２値化データを意味するものである。FIG. 3 is an explanatory view of a binary character image, and FIG.
Shows an explanatory diagram of a binary image of an instruction symbol. This character 2
A value image is an image in which only the data determined to be the character color by the above-described pixel discrimination is regarded as “black” and the other than the character color is regarded as “white”, and the instruction symbol binary image is the instruction symbol. Only the data determined to be a color is regarded as “black” and the colors other than the indicator color are regarded as “white”. Although the expression “white” or “black” is used here for each pixel of the binary image, this does not indicate the actual color but can be paraphrased as “0” or “1”. It means the digitized data.

【００１４】得られた指示記号２値画像は指示記号切出
部２および指示記号認識部３へ、文字２値画像は文字切
出部４および文字認識部５へとそれぞれ出力される。そ
の文字切出部４では、上記文字２値画像を入力とし、こ
の入力に対して例えば文字図形分離等の処理を施し、文
字と判断された領域について文字を切り出して認識する
方法により、文字の位置および大きさを求め、その文字
を切り出して文字パターンとし、これを文字認識部５で
例えば各文字パターンの特徴データが登録された認識辞
書を用いてパターン認識を行うこと等により認識し、対
応する文字コードを文字情報として求める。このように
して、文字の位置および大きさを得るとともに文字情報
を得る。例えば、図３における文字７、文字８、文字
９、文字１０および文字１１のそれぞれの位置・大きさ
情報および文字コードを得るということである。The obtained instruction symbol binary image is outputted to the instruction symbol cutting unit 2 and the instruction symbol recognition unit 3, and the character binary image is outputted to the character cutting unit 4 and the character recognition unit 5, respectively. In the character cutout unit 4, the character binary image is input, the input is subjected to processing such as character / graphic separation, and a character is cut out in an area determined to be a character to recognize the character. The position and size are obtained, the character is cut out to form a character pattern, and the character pattern is recognized by the character recognition unit 5, for example, by performing pattern recognition using a recognition dictionary in which the characteristic data of each character pattern is registered, and the corresponding The character code to be used is obtained as character information. In this way, the character position and size as well as the character information are obtained. For example, it is to obtain the position / size information and the character code of each of the character 7, character 8, character 9, character 10 and character 11 in FIG.

【００１５】一方、指示記号切出部２では、上記指示記
号２値画像を入力とし、この入力から指示記号を切り出
し、指示記号の位置および大きさを求め、切り出された
指示記号のパターンを基に、指示記号認識部３において
指示記号の認識を行う。この指示記号切出および指示記
号認識は、上述の文字切出および文字認識と同様の手法
によるものであり、「文字」の代わりに「指示記号」を
対象として処理することとすればよい。そのために、文
字認識における認識辞書と同様に、指示記号に対応した
専用の認識辞書を作成しておいて指示記号の認識を行
う。On the other hand, the instruction symbol cutout unit 2 receives the above-mentioned instruction symbol binary image, cuts out the instruction symbol from this input, obtains the position and size of the instruction symbol, and determines the pattern of the cut out instruction symbol. First, the instruction symbol recognition unit 3 recognizes the instruction symbol. The instruction symbol cutout and the instruction symbol recognition are performed by the same method as the above-described character cutout and character recognition, and the “instruction symbol” may be processed instead of the “character”. Therefore, similarly to the recognition dictionary in character recognition, a dedicated recognition dictionary corresponding to the instruction symbols is created and the instruction symbols are recognized.

【００１６】このようにして、指示記号の位置および大
きさを得るとともに、例えば指示記号を示すコード等の
指示情報を得ることができる。例えば、図４における指
示記号１２の位置・大きさ情報とこの指示記号１２に対
応するコードを得るということである。ここで指示記号
の一例として挙げた挿入記号では、その直上の文字を直
左下の文字後に挿入することを意味するものと決めてお
き、この挿入記号にコードを付与しておく。In this way, it is possible to obtain the position and size of the instruction symbol and obtain instruction information such as a code indicating the instruction symbol. For example, it means obtaining the position / size information of the indicator 12 and the code corresponding to the indicator 12 in FIG. In the insertion symbol given as an example of the instruction symbol, it is determined that the character immediately above it is inserted after the character immediately below and to the left, and a code is given to this insertion symbol.

【００１７】図５は位置・大きさ情報の説明図である。
図５の（ａ）は指示記号位置・大きさ情報について説明
するためのものであり、例として指示記号である挿入記
号が示されている。指示記号の位置および大きさは、例
えばその指示記号に外接する矩形の始点座標、幅および
高さとして表される。この指示記号については、始点座
標を（ｘ１，ｙ１）、幅をｗ１、高さをｈ１として示し
ている。FIG. 5 is an explanatory diagram of position / size information.
FIG. 5A is for explaining the instruction symbol position / size information, and an insertion symbol which is an instruction symbol is shown as an example. The position and size of the instruction symbol are represented as, for example, the starting point coordinates, width, and height of a rectangle circumscribing the instruction symbol. Regarding this instruction symbol, the starting point coordinates are shown as (x1, y1), the width as w1, and the height as h1.

【００１８】図５の（ｂ）は文字位置・大きさ情報につ
いて説明するためのものである。文字の位置および大き
さは、その文字に外接する矩形の始点座標、幅および高
さとして表される。この文字については、始点座標を
（ｘ２，ｙ２）、幅をｗ２、高さをｈ２として示してい
る。なお、ここでは座標系の一例として、ｘ軸の正方向
は図の右方向、ｙ軸の正方向は図の下方向とした座標系
を用いて説明している。FIG. 5B is for explaining the character position / size information. The position and size of a character are represented as the starting point coordinates, width, and height of a rectangle circumscribing the character. For this character, the starting point coordinates are (x2, y2), the width is w2, and the height is h2. Here, as an example of the coordinate system, a description is given using a coordinate system in which the positive direction of the x-axis is the right direction in the figure and the positive direction of the y-axis is the down direction of the figure.

【００１９】文書構成部６では、上述のように得られた
指示記号位置・大きさ情報、指示情報、文字位置・大き
さ情報および文字情報を基に、指示記号を１文字ずつ調
べ、当該指示記号のコードを判別する等により、その指
示記号を特定する。図で示した例では、指示記号のコー
ドを識別することにより、この指示記号が挿入記号であ
ると判定し、これに対応する編集操作は、その挿入記号
の直上の文字を直左下の文字の後ろに挿入することと解
釈する。In the document construction unit 6, the instruction symbols are examined character by character based on the instruction symbol position / size information, instruction information, character position / size information and character information obtained as described above, and the instruction is concerned. The instruction symbol is specified by, for example, determining the code of the symbol. In the example shown in the figure, by identifying the code of the instruction symbol, it is determined that this instruction symbol is an insertion symbol, and the editing operation corresponding to this is that the character immediately above the insertion symbol is Interpreted as inserting after.

【００２０】指示記号の意味が判明すると、次に指示記
号が指示の対象とする文字を探索する。本実施例では指
示記号について、該指示記号に外接する矩形の上辺の中
心座標（ｘ３，ｙ３）を、ｘ３＝ｘ１＋ｗ１／２ｙ３＝ｙ１として求め、また、文字について、該文字に外接する矩
形の下辺の中心座標（ｘ４，ｙ４）をｘ４＝ｘ２＋ｗ１／２ｙ４＝ｙ２＋ｈ２として求める。When the meaning of the instruction symbol is known, the character to be instructed by the instruction symbol is searched for next. In the present embodiment, the center coordinates (x3, y3) of the upper side of the rectangle circumscribing the indicator are obtained as x3 = x1 + w1 / 2y3 = y1 for the indicator, and the character is circumscribed with the character. The center coordinates (x4, y4) of the lower side of the rectangle are calculated as x4 = x2 + w1 / 2y4 = y2 + h2.

【００２１】入力された文字の中で、｜ｘ３ − ｘ４｜＜ｘｓ１かつ｜ｙ３ − ｙ４｜＜ｙｓ１を満たすものがあれば、これを指示記号の直上の文字と
解釈する。但し、ｘｓ１およびｙｓ１は定数とする。ま
た、指示記号について、該指示記号に外接する矩形の下
辺の中心座標（ｘ５，ｙ５）を、ｘ５＝ｘ１＋ｗ１／２ｙ５＝ｙ１＋ｈ１として求め、また文字について、該文字に外接する矩形
の右上の頂点座標（ｘ６，ｙ６）をｘ６＝ｘ２＋ｗ２ｙ６＝ｙ２として求める。If any of the input characters satisfies | x3-x4 | <xs1 and | y3-y4 | <ys1, this is interpreted as the character immediately above the indicator. However, xs1 and ys1 are constants. Also, regarding the instruction symbol, the center coordinates (x5, y5) of the lower side of the rectangle circumscribing the instruction symbol are obtained as x5 = x1 + w1 / 2 y5 = y1 + h1 and regarding the character, a rectangle circumscribing the character. The vertex coordinates (x6, y6) at the upper right of is calculated as x6 = x2 + w2 y6 = y2.

【００２２】入力された文字の中で｜ｘ５ − ｘ６｜＜ｘｓ２かつ｜ｙ５ − ｙ６｜＜ｙｓ２を満たすものがあれば、これを直左下の文字と解釈す
る。但し、ｘｓ２およびｙｓ２は定数とする。このよう
にして各指示記号について、その直上の文字と直左下の
文字を特定してその直上の文字を直左下の文字の後に挿
入する。図２では、「う」を挿入記号直上の文字と判断
するとともに「い」を挿入記号直左下の文字として判断
し、この文字「い」の後に文字「う」を挿入することに
より、文字列「あいうえお」を出力文書データとして得
ることになる。If any of the input characters satisfies | x5-x6 | <xs2 and | y5-y6 | <ys2, it is interpreted as the character immediately below the left. However, xs2 and ys2 are constants. In this way, for each instruction symbol, the character immediately above and the character immediately below left are specified, and the character immediately above that is inserted after the character immediately below left. In FIG. 2, "u" is determined to be the character immediately above the insertion symbol, "i" is determined to be the character immediately below and to the left of the insertion symbol, and the character "u" is inserted after this character "i" to obtain the character string. "AIUEO" will be obtained as output document data.

【００２３】文字列の先頭に文字を挿入する等、挿入記
号の直左下に該当する文字がないことがあるが、この場
合には、挿入記号直右下の文字の前に、挿入記号直上の
文字を挿入することとすればよい。また、図に示した例
では横書きを扱っているが、指示記号の直右の文字を、
指示記号直上の文字の後ろに挿入する等としておくこと
により、縦書き用の挿入記号を指示記号の一つとして定
義・登録することもできる。In some cases, such as when inserting a character at the beginning of a character string, there is no corresponding character immediately below and to the left of the insertion symbol. In this case, the character immediately below and to the right of the insertion symbol is located immediately above the insertion symbol. You can insert a character. Also, in the example shown in the figure, horizontal writing is handled, but the character to the right of the instruction symbol is
The insertion symbol for vertical writing can be defined and registered as one of the instruction symbols by inserting it after the character directly above the instruction symbol.

【００２４】なお、上記説明では指示記号として挿入記
号を挙げて説明したが、これは、削除や置換を意味する
記号等、どのような指示記号を用いても、その指示記号
の形状および編集処理の手順を登録しておくことによ
り、同様に本発明を適用することができる。上述のよう
に、文書中にその文書と異なる色の指示記号を書き込ん
でおき、その指示記号を認識して自動的に編集操作を行
うことにより、人手を介さずに読取・編集処理を連続し
て行うことができ、処理の能率が向上する。In the above description, the insertion symbol is used as the instruction symbol. However, this means that the shape of the instruction symbol and the editing process can be performed by using any instruction symbol such as a symbol meaning deletion or replacement. The present invention can be similarly applied by registering the procedure of. As described above, by writing an instruction symbol of a different color from the document in the document and recognizing the instruction symbol and automatically performing the editing operation, the reading / editing process can be continuously performed without human intervention. Can be carried out by the above-mentioned method, and the processing efficiency is improved.

【００２５】[0025]

【発明の効果】以上詳細に説明したように、文書中に指
示記号を書き込んでおき、その指示記号を認識して自動
的に編集操作を行うことにより、人手を介さずに読取・
編集処理を連続して実行可能となり、処理の能率が向上
する効果を有する。As described above in detail, by writing an instruction symbol in a document and recognizing the instruction symbol and automatically performing an editing operation, the reading / reading can be performed without human intervention.
The editing process can be continuously executed, and the processing efficiency is improved.

[Brief description of drawings]

【図１】実施例のブロック図FIG. 1 is a block diagram of an embodiment.

【図２】入力画像の説明図FIG. 2 is an explanatory diagram of an input image

【図３】文字２値画像の説明図FIG. 3 is an explanatory diagram of a binary character image.

【図４】指示記号２値画像の説明図FIG. 4 is an explanatory view of an instruction symbol binary image.

【図５】位置・大きさ情報の説明図FIG. 5 is an explanatory diagram of position / size information.

[Explanation of symbols]

１色抽出部２指示記号切出部３指示記号認識部４文字切出部５文字認識部６文書構成部 1 Color Extraction Unit 2 Indicator Symbol Extraction Unit 3 Indicator Symbol Recognition Unit 4 Character Extraction Unit 5 Character Recognition Unit 6 Document Composition Unit

Claims

[Claims]

1. Input image data consisting of a character and an instruction symbol having a color different from that of the character and used for editing the character is converted into a character binary image and an instruction symbol binary image according to the color. A color extracting unit for separating, an instruction symbol cutting unit for extracting the instruction symbol position / size information from the instruction symbol binary image, and an editing process based on the instruction symbol binary image and the instruction symbol position / size information. And a character cutout unit for extracting character position / size information from the character binary image, and character information based on the character binary image and the character position / size information. Based on the character recognition unit to recognize and the above-mentioned instruction symbol position / size information, instruction information, character position / size information, and character information, edit processing corresponding to the instruction information is performed to reconstruct the document. Document structure to output Document image recognition apparatus characterized by having.

2. The document according to claim 1, further comprising a document configuration unit for performing edit processing by specifying a character to be designated by the instruction symbol based on a positional relationship between the instruction symbol and the character. Image recognition device.