JP3761923B2

JP3761923B2 - Image processing apparatus and method

Info

Publication number: JP3761923B2
Application number: JP15149095A
Authority: JP
Inventors: 忠則中塚
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1995-06-19
Filing date: 1995-06-19
Publication date: 2006-03-29
Anticipated expiration: 2021-03-29
Also published as: JPH096898A

Description

【０００１】
【産業上の利用分野】
本発明は画像処理装置、詳しくは入力した画像データの属性の判定或いは訂正を行ない、下位の処理に渡す画像処理装置及び方法に関するものである。
【０００２】
【従来の技術】
一般に文字認識装置においては、イメージスキャナ等から入力した文書画像中の文字領域を抽出し、その領域毎の属性（縦書きか横書きか等の判定）が誤っているか否か、或いは、操作者が１つ１つの領域に対する属性を指定した後、文字認識処理を行なわせている。
【０００３】
【発明が解決しようとする課題】
しかしながら、各領域毎に属性を一々指定したのでは、多くの領域が存在する場合にはその作業が繁雑になるし、非常に面倒でもある。
【０００４】
【課題を解決するための手段】
本発明はかかる問題点に鑑みなされたものであり、画像中の各領域の属性の指定を簡単な操作でもって設定できる画像処理装置及び方法を提供しようとするものである。
【０００５】
この課題を解決するため、例えば本発明の画像処理装置は以下の構成を備える。すなわち、
画像を複数の領域に分割する領域分割手段と、
前記分割された領域毎の属性を判定する属性判定手段と、
前記分割された領域毎に前記判定された属性を示すシンボルを関連づけて、前記画像とシンボルとを表示するように制御する表示手段と、
前記表示されている画像中から、ユーザからの選択指示に応じて複数の領域を選択する選択手段と、
前記選択された複数の領域のいずれかに関連づけられたシンボルに対してユーザから属性変更の指示があった場合、当該属性変更の指示を受けたシンボルに関連づけられた領域の属性を変更するとともに、前記選択手段により選択された複数の領域を前記属性変更された領域の属性と同じ属性に変更する属性変更手段とを有する。
【０００６】
また、本発明の好適な実施態様に従えば、更に、入力した画像中の各領域に毎の枠を表示する枠表示手段と、
各領域毎の属性情報を表示する属性情報表示手段とを備え、
前記指定手段は、所望とする枠を指定し、前記属性指定手段は変更指定する複数の中の１つの属性情報を指定して属性を指定することが望ましい。
【０００７】
これによって、視覚的な枠単位に属性が指定できるようになり、格別な知識を有さなくても操作することが可能になる。
【０００８】
また、この場合においては、更に、前記属性指定手段で属性が指定された場合、前記指定手段で指定された複数の枠内に表示された属性情報を指定された属性に応じて更新する属性情報更新手段を備えるが望ましい。これによって、変更後の属性が確認できるようになる。
【０００９】
また、前記属性情報表示手段で表示される属性情報は、シンボルパターンであるが望ましい。これによって、文字等で属性を表示するのに対して、少ない表示領域でもって属性を操作者に報知することが可能になる。
【００１０】
また、前記指定手段は、全領域を一括して指定することを含むことが望ましい。これによって単一の属性からなる画像を入力した際に、一度に全部の属性を指定できるようになる。
【００１１】
また、前記下位の処理は、文字認識処理であることが望ましい。これによって、正しい認識処理を行なわせる、もしくは効率の良い文字認識を行なわせることが可能になる。
【００１２】
また、前記属性には、文字イメージが縦書き、横書きの種別情報が含まれることが望ましい。これによって文字認識する際の認識の順序が、文字列の読み方向として認識されるようになる。
【００１３】
【実施例】
以下、添付図面に従って本発明にかかる実施例を詳細に説明する。
【００１４】
＜文字認識装置の説明（図１）＞
図１は実施例の文字認識装置の概略構成を示すブロック図である。図１において、１は文字認識装置であり、２は文書画像を読取って入力する入力部で、例えばスキャナ等により構成され、原稿画像データをデジタルデータとして入力している。３は装置１全体を制御する演算処理用の中央処理装置（以下、ＣＰＵという）を示している。４はＲＯＭで、ＣＰＵ３の図３以降のフローチャートで示された制御プログラムや各種データを格納している。５はメモリ（ＲＡＭ）で、ＣＰＵ３のワークエリアとして使用されると共に、入力部２より入力された文書画像データを記憶する領域、前処理部７で切り出された文字パターンの位置、大きさを記憶する領域、更にはイメージの中で認識を行う場所すなわちブロック枠座標情報を格納する場合にも使用する。６は属性変更部で、不図示のポインティングデバイス等による操作によって、ブロックシンボル（詳細は後述する）をクリックする事によってその領域の選択及び属性の変更を行う。７は前処理部で、メモリ５に記憶された文書画像の１文字のパターンを切出す。８は認識部で、前処理部７によって前処理されたパターンから幾何学的特徴を抽出し、あらかじめ認識用辞書９に格納されている標準パターンと照合して文書画像の文字認識を行っている。１０は表示部で、入力された文書画像を表示するイメージ表示部１４、認識結果表示部１５、ブロック枠表示部１６、属性表示部１７からなる。１１はシステムバスで、ＣＰＵ３よりのデータバス、アドレスバスおよび制御信号バス等を含んでいる。１２は外部の出力装置、例えばプリンタ１３等とのインタフェース制御を行うインタフェース部を示している。
【００１５】
上記構成において、入力部２から原稿画像を入力しメモリ５に格納する。そして、格納された画像データとは別個に縮小画像をメモリ５上に生成する。縮小画像は、例えば原画像のｎ×ｎの画素ブロックを１つの縮小画素とするもので、ｎ×ｎの画素ブロック中に１つでも黒画素が存在する場合に縮小画素を黒画素として決定するものである。この処理を行うと、結局、原画像を縦横それぞれ１／ｎ倍にしたことと同じになり、且つ、連接する文字イメージ（文字パターンのドットイメージ）が互いに連結された状態になる。
【００１６】
この連結されたドット分布に外接する矩形を順次定義していき、領域分割していく。尚、領域分割処理で得られた領域の大きさ及びその中に含まれる黒画素の割合等で、処理ブロックの大方の属性（文字か図形か、文字であれば縦書きか或いは横書きか等）は判断できるようになっている。
【００１７】
以下、実施例では、領域分割処理を上記のようにして処理したものとして説明するが、領域分割処理はこれに限らず如何なる処理でも行ってよく、属性判定処理もこれに限るものではない。
【００１８】
＜イメージウインドウの説明（図２）＞
図２は実施例の文字認識装置１に入力された文書画像（メモリ５に格納されている）と領域分割結果及び領域の属性を示すイメージウインドウの一例を示す図である。
【００１９】
図２において、２１は入力部２で読取られた文書画像を示すイメージウインドウである。２２は領域分割の結果である枠（１つの領域のサイズを明示する枠）であり、ここではテキスト部分の段落の位置を示している。２３及び２４は領域の属性を示すブロックシンボルである。ブロックシンボル２３は右側を向いていることで、そのブロック枠内の文章は横書きとして認定されたことを示している。中の数字１は文字認識させる順序を示している。また、ブロックシンボル２４は該当する領域が縦書きであることを示している。
【００２０】
すなわち、図２は、本来、全ての領域が横書きとして認定すべきところ、いくつかのブロック枠内の文章が縦書きであると認定されてしまっている状態を示している。
【００２１】
かかる状況になった場合において、実施例では、誤判定された属性を、一括して訂正或いは指定するものである。
【００２２】
以下、実施例における文字認識処理を図３のフローチャートに基づいて説明する。尚、この処理を実行する制御プログラムはＲＯＭ４に記憶されている。
【００２３】
まずステップＳ３１で、操作者によりセットされた文書画像を入力部２（スキャナ）により読取って入力し、メモリ５に記憶する。
【００２４】
次にステップＳ３２に進み、領域分割を行う。領域分割によってテキストの段落、図、表、罫線などを取り出す。
【００２５】
次にステップＳ３３に進み、領域の属性を判定し、結果をメモリ５に記憶し、属性表示部１７で属性をブロックシンボルという形式で表示する。
【００２６】
次にステップＳ３４に進み、マウスなどを使って領域の属性を修正する。そして、ステップＳ３５に進み、修正された属性に従って、文字認識を行う。最後に、ステップＳ３６に進み、認識結果を表示する。
【００２７】
図４は、図３のステップＳ３４の対応イメージ表示の処理を示すフローチャートである。
【００２８】
まずステップＳ４１で、複数のブロック枠を選択する。実施例では、例えば、図５に示す様に、ポインティングデバイスを操作して、表示画面の上部に表示されたメニューバー中の“編集”のプルダウンメニューを表示させ、“すべてのブロック枠を選択”を指定することで、イメージウインドウ内の全てのブロック枠の属性が編集対象として設定できる。選択されたブロック枠は図６に示す様に４隅に灰色の四角からなる印が付いて選択されていないブロック枠と区別される。ここでは、すべてのブロック枠が選択されている。
【００２９】
次にステップＳ４２で、マウスでひとつのブロックシンボルをポインティングデバイスで指定（クリック）するとその変更に合わせて、選択されているすべての属性を一度に修正する。例えば、図６に示す様に認識順序３，５，６番目のブロック枠は組方向を誤って判定して縦書きになっているとすると、３，５，６番目のいずれかのブロックシンボルがクリックすることでブロックの属性、ここでは組方向を縦書きから横書きに変更する。その際に選択されたすべてのブロックワークの属性を横書きにする。つまり、ここでは６番目のブロック枠のブロックシンボルをクリックする事で、３，５番目の属性も一度に修正できる。他のブロック枠は、元々横書きであるとして認定されているわけであるから、何等変更は行われない。
【００３０】
以上の指定が行われると、ステップＳ４３では、変更されたすべてのブロック枠のブロックシンボルの表示を変更する。その結果が図７である。ここではすべてのブロックシンボルが右側を向いて、横書きである事を示している。
【００３１】
以上の結果、少ない操作でもって複数のブロック枠内の情報の属性を変更もしくは修正することが可能になる。
【００３２】
後は、メニューバー中の“認識”を指定することで、その時に選択されていた（四隅に四角の印がつけられた）ブロック枠のイメージ情報と共に、その属性情報が下位の文字認識処理に渡され、認識処理が行われることになる。文字認識処理は、その時点で選択されているブロック番号の小さいものから順番にブロック枠の内部にある文字イメージを受け、決定された方向（属性）に従って順次文字認識処理を行っていく。
【００３３】
尚、上記説明では、一度に全てのブロック枠内の属性を変更対象にする例を示したが、所望とするブロック枠を順次選択しても良い。例えば、マウス等を使用するのであれは、右ボタンを押下しながら（もしくはキーボードの所定のキーを押しながら）、所望とするブロック枠を左ボタンでクリックすることで、処理対象のブロック枠を追加的に選択する。このようにすることで、所望とするブロック枠のみ（複数のブロック枠）を選択することが可能になる。
【００３４】
また、上記説明では、ブロック枠内の属性として縦書き、横書きの２つの状態についてのみ説明した。すなわち、縦書きとして誤認定されたブロック枠のシンボルを単に指定することで、一義的にそれを横書きに修正した。しかしながら、属性の種別が３以上ある場合には、変更後の属性を一義的に指定することはできない。そこで、属性の候補が３以上ある場合には、変更しようとするブロック枠（複数）を選択した際に、どのような属性するのかを問い合わせるメニューウインドウを表示させ、そのメニューの中の１つもしくは複数を選択し、対応する属性に変更するようにしても良い。
【００３５】
例えば、原稿が文章（文字）、写真、表や図形等が混在している場合を考え、文字領域であると判定された場合のシンボルは上記のように５角形（縦・横向きがある）、写真の場合には円、表や図形の場合に三角形として表示されるとする。このとき、文字であるにもかかわらず、いくつかの領域が写真と認定されている場合にはそのブロック枠のシンボル（この場合には円）を指定したとき、属性として文字、表図形のいずれにするのかのメニューウインドウを表示し、“文字”であることを指定する。
【００３６】
また、特に、文字であるにもかかわらず、写真、及び表や図形と認定されている場合には、残りの属性は“文字”のみであるので、このときには単に変更対象のブロック枠の属性を指定した際に自動的に“文字”としての属性を与えることもできる。
【００３７】
属性としては、上記文字、写真、図形に限定されるものではない。例えば文字の場合には、先に説明したように、横書き、縦書きがあるが、その他としては日本語、英語等の言語の種別も含めることもできる。尚、属性のおおざっぱな判定処理は、如何なる手法を取り入れても良い。
【００３８】
また、認識結果は、テキストデータとして記憶保持することも、編集することも、さらには印刷することも可能になるのは言うまでもない。
【００３９】
以上説明したように本実施例によれば、原稿内に領域が細かく複数存在し、属性の修正が多い場合でも容易に修正ができる、ユーザにとって使い易い文字認識装置を実現することができる。
【００４０】
特に、一度に全てのブロック枠を選択することを可能にすることで、単一の属性を持った原稿（例えば横書きの文章のみの原稿）の場合には、非常に少ない操作でもって、全属性を一括して変更、修正或いは指定することが可能になる。
【００４１】
尚、表（罫線を含む）や図形等の属性を付されたブロック枠内のイメージ、或いは写真等の属性が付されたブロック内のイメージについての下位の処理としては、ファイルとして保存する際の圧縮処理の種別、変倍するときの変倍の種別等に適応できる。
【００４２】
＜第２の実施例＞
第１の実施例では、図４のステップＳ４２においてマウスを用いて属性を変更しているが、キーやメニューなどを使うなど、属性変更できれば何でも良い。以上の説明により第２の実施例においても前述の第１の実施例と同様の作用・効果を得ることができる。
【００４３】
＜第３の実施例＞
第１の実施例では、ブロックの属性を示すブロックシンボルを５角形等であるとして説明したがが、この形や位置、色、大きさ等何でも良い。但し、操作者から見て、それらが互いに区別可能になっていることが望ましい。
【００４４】
以上の説明により第３の実施例においても前述の第１の実施例と同様の作用・効果を得ることができる。
【００４５】
以上説明した様に本実施例によれば、原稿内に領域が細かく複数存在し、属性の修正が多い場合でも容易に修正ができる。このため、文書原稿を読取って認識し修正する際、使い易く文字認識の操作性が向上する効果がある。
【００４６】
また、実施例では、処理対象の原稿画像はイメージスキャナ等から入力するものとして説明したが、イメージを記憶した記憶媒体（例えばフロッピーディスク等）や、回線を介して受信した場合にも適応できるので、上記例によって本願発明が限定されるものではない。
【００４７】
また、更に、実施例では、単独の装置、或いは複数の機器から構成されるシステムに適応しても、装置或いはシステムに外部からプログラムを供給することによっても達成できるのは言うまでもないことでである。従って、本願発明を実現するプログラムを内蔵させた記憶媒体にも容易に適応することが可能になる。
【００４８】
【発明の効果】
以上説明した様に本発明によれば、入力した画像中の各領域の属性の指定を簡単な操作でもって設定できるようになる。
【００４９】
【図面の簡単な説明】
【図１】第１の実施例の文字認識装置の概略構成を示すブロック図である。
【図２】第１の実施例の文字認識装置における表示部及びメニューを示す図である。
【図３】第１の実施例の文字認識装置の処理を示すフローチャートである。
【図４】第１の実施例のステップＳ３４の詳細を示すフローチャートである。
【図５】第１の実施例の文字認識装置における表示部及びメニューを示す図である。
【図６】第１の実施例の文字認識装置における表示部及びメニューを示す図である。
【図７】第１の実施例の文字認識装置における表示部及びメニューを示す図である。
【符号の説明】
１文字認識装置
２入力部
３ＣＰＵ
４ＲＯＭ
５メモリ
６属性変更部
７前処理部
８認識部
９認識用辞書
１０表示部
１１システムバス
１２インタフェース部
１３プリンタ
１４イメージ表示部
１５認識結果表示部
１６ブロック枠表示部
１７属性表示部
２１イメージウインドウ
２２ブロック枠
２３，２４ブロックシンボル[0001]
[Industrial application fields]
The present invention relates to an image processing apparatus, and more particularly to an image processing apparatus and method for determining or correcting attributes of input image data and passing them to a lower-level process.
[0002]
[Prior art]
Generally, in a character recognition device, a character area in a document image input from an image scanner or the like is extracted, and whether or not the attribute for each area (judgment of vertical writing or horizontal writing) is incorrect, or the operator After the attribute for each area is specified, the character recognition process is performed.
[0003]
[Problems to be solved by the invention]
However, if one attribute is specified for each area, when there are many areas, the work becomes complicated and very troublesome.
[0004]
[Means for Solving the Problems]
The present invention has been made in view of such problems, and an object of the present invention is to provide an image processing apparatus and method capable of setting the designation of the attribute of each area in an image by a simple operation.
[0005]
In order to solve this problem, for example, an image processing apparatus of the present invention has the following configuration. That is,
Area dividing means for dividing the image into a plurality of areas;
Attribute determination means for determining an attribute for each of the divided areas;
Display means for controlling the display of the image and the symbol by associating the symbol indicating the determined attribute for each of the divided areas;
A selection means for selecting a plurality of areas in response to a selection instruction from the user from the displayed image;
When there is an instruction to change the attribute for the symbol associated with any of the selected plurality of areas, the attribute of the area associated with the symbol that has received the instruction to change the attribute is changed, Attribute changing means for changing a plurality of areas selected by the selecting means to the same attribute as the attribute of the area whose attribute has been changed.
[0006]
Further, according to a preferred embodiment of the present invention, a frame display means for displaying a frame for each region in the input image,
Attribute information display means for displaying attribute information for each area,
Preferably, the designation means designates a desired frame, and the attribute designation means designates an attribute by designating one attribute information among a plurality of alteration designations.
[0007]
As a result, attributes can be specified in visual frame units, and operations can be performed without having special knowledge.
[0008]
In this case, further, when the attribute is specified by the attribute specifying unit, the attribute information displayed in the plurality of frames specified by the specifying unit is updated according to the specified attribute. It is desirable to provide updating means. As a result, the changed attribute can be confirmed.
[0009]
The attribute information displayed by the attribute information display means is preferably a symbol pattern. As a result, the attribute can be displayed to the operator with a small display area, while the attribute is displayed with characters or the like.
[0010]
Further, it is preferable that the specifying means includes specifying all areas at once. As a result, when an image having a single attribute is input, all attributes can be specified at once.
[0011]
Moreover, it is desirable that the lower-order process is a character recognition process. This makes it possible to perform correct recognition processing or to perform efficient character recognition.
[0012]
In addition, it is desirable that the attribute includes type information of vertical writing and horizontal writing of a character image. As a result, the recognition order for character recognition is recognized as the reading direction of the character string.
[0013]
【Example】
Embodiments according to the present invention will be described below in detail with reference to the accompanying drawings.
[0014]
<Description of Character Recognition Device (FIG. 1)>
FIG. 1 is a block diagram illustrating a schematic configuration of a character recognition apparatus according to an embodiment. In FIG. 1, reference numeral 1 denotes a character recognition device, and 2 denotes an input unit for reading and inputting a document image, which is composed of, for example, a scanner and inputs document image data as digital data. Reference numeral 3 denotes a central processing unit (hereinafter referred to as a CPU) for arithmetic processing that controls the entire apparatus 1. Reference numeral 4 denotes a ROM, which stores the control program and various data shown in the flowchart of FIG. A memory (RAM) 5 is used as a work area of the CPU 3 and stores an area for storing document image data input from the input unit 2 and a position and size of a character pattern cut out by the preprocessing unit 7. This is also used for storing the area to be recognized, and the place where the recognition is performed in the image, that is, the block frame coordinate information. An attribute change unit 6 is used to select a region and change an attribute by clicking a block symbol (details will be described later) by an operation using a pointing device (not shown). A pre-processing unit 7 cuts out a one-character pattern of the document image stored in the memory 5. A recognition unit 8 extracts a geometric feature from the pattern preprocessed by the preprocessing unit 7 and performs character recognition of the document image by comparing with a standard pattern stored in the recognition dictionary 9 in advance. . A display unit 10 includes an image display unit 14 for displaying an input document image, a recognition result display unit 15, a block frame display unit 16, and an attribute display unit 17. A system bus 11 includes a data bus from the CPU 3, an address bus, a control signal bus, and the like. Reference numeral 12 denotes an interface unit that performs interface control with an external output device such as a printer 13.
[0015]
In the above configuration, a document image is input from the input unit 2 and stored in the memory 5. Then, a reduced image is generated on the memory 5 separately from the stored image data. The reduced image is, for example, an n × n pixel block of the original image as one reduced pixel, and when at least one black pixel exists in the n × n pixel block, the reduced pixel is determined as a black pixel. Is. When this process is performed, the result is the same as when the original image is 1 / n times in length and width, and the connected character images (dot images of character patterns) are connected to each other.
[0016]
The rectangle circumscribing the connected dot distribution is sequentially defined, and the region is divided. It should be noted that most attributes of the processing block, such as the size of the area obtained by the area division process and the ratio of black pixels included in the area (character or graphic, vertical writing or horizontal writing if character, etc.) Can be judged.
[0017]
In the following description, the region division process is described as being performed as described above. However, the region division process is not limited to this, and any process may be performed, and the attribute determination process is not limited to this.
[0018]
<Description of image window (FIG. 2)>
FIG. 2 is a diagram illustrating an example of an image window indicating a document image (stored in the memory 5) input to the character recognition device 1 of the embodiment, a region division result, and region attributes.
[0019]
In FIG. 2, reference numeral 21 denotes an image window showing a document image read by the input unit 2. Reference numeral 22 denotes a frame (a frame that clearly indicates the size of one area) as a result of area division, and shows the position of a paragraph in the text portion. Reference numerals 23 and 24 are block symbols indicating region attributes. The block symbol 23 faces the right side, indicating that the text in the block frame is recognized as horizontal writing. The number 1 in the middle indicates the order of character recognition. The block symbol 24 indicates that the corresponding area is vertically written.
[0020]
That is, FIG. 2 shows a state in which all the areas should be recognized as horizontal writing, but sentences in some block frames have been recognized as vertical writing.
[0021]
In such a situation, in the embodiment, misjudged attributes are corrected or designated collectively.
[0022]
Hereinafter, the character recognition processing in the embodiment will be described based on the flowchart of FIG. A control program for executing this process is stored in the ROM 4.
[0023]
First, in step S31, a document image set by the operator is read and input by the input unit 2 (scanner) and stored in the memory 5.
[0024]
Next, it progresses to step S32 and area division is performed. Extract text paragraphs, diagrams, tables, ruled lines, etc. by area division.
[0025]
In step S33, the attribute of the area is determined, the result is stored in the memory 5, and the attribute display unit 17 displays the attribute in the form of a block symbol.
[0026]
In step S34, the region attributes are corrected using a mouse or the like. In step S35, character recognition is performed according to the corrected attribute. Finally, the process proceeds to step S36, and the recognition result is displayed.
[0027]
FIG. 4 is a flowchart showing the corresponding image display processing in step S34 of FIG.
[0028]
First, in step S41, a plurality of block frames are selected. In the embodiment, for example, as shown in FIG. 5, the pointing device is operated to display the “Edit” pull-down menu in the menu bar displayed at the top of the display screen, and “Select all block frames”. By specifying, the attributes of all the block frames in the image window can be set as edit targets. As shown in FIG. 6, the selected block frame is marked with a gray square at the four corners to be distinguished from the unselected block frame. Here, all the block frames are selected.
[0029]
In step S42, when one block symbol is designated (clicked) with the pointing device with the mouse, all the selected attributes are corrected at once according to the change. For example, as shown in FIG. 6, if the third, fifth, and sixth block frames in the recognition order are erroneously determined in the set direction and are vertically written, any of the third, fifth, and sixth block symbols is Click to change the block attribute, here the group direction, from vertical writing to horizontal writing. All the blockwork attributes selected at that time are written horizontally. In other words, the third and fifth attributes can be corrected at once by clicking the block symbol in the sixth block frame. Since the other block frames are originally certified as horizontal writing, no changes are made.
[0030]
When the above designation is performed, in step S43, the display of the block symbols of all the changed block frames is changed. The result is shown in FIG. Here, all block symbols turn to the right, indicating that they are written horizontally.
[0031]
As a result, it is possible to change or modify the attributes of information in a plurality of block frames with few operations.
[0032]
After that, by specifying “Recognition” in the menu bar, the attribute information is used for lower-level character recognition processing along with the image information of the block frame selected at that time (with square marks at the four corners). Then, recognition processing is performed. In the character recognition process, the character images in the block frame are received in order from the smallest block number selected at that time, and the character recognition process is sequentially performed according to the determined direction (attribute).
[0033]
In the above description, the example in which the attributes in all the block frames are changed at a time is shown. However, the desired block frames may be sequentially selected. For example, if you use a mouse or the like, add the block frame to be processed by clicking the desired block frame with the left button while pressing the right button (or pressing a predetermined key on the keyboard) To choose. By doing so, it becomes possible to select only a desired block frame (a plurality of block frames).
[0034]
In the above description, only two states of vertical writing and horizontal writing have been described as attributes in the block frame. That is, by simply designating a block frame symbol that was erroneously recognized as vertical writing, it was uniquely corrected to horizontal writing. However, when there are three or more attribute types, the changed attribute cannot be uniquely specified. Therefore, when there are three or more candidate attributes, when a block frame (plural) to be changed is selected, a menu window for inquiring about the attribute is displayed, and one of the menus or A plurality may be selected and changed to the corresponding attribute.
[0035]
For example, considering a case where a manuscript is a mixture of text (characters), photographs, tables, figures, etc., the symbol when it is determined to be a character area is a pentagon (has portrait and landscape orientations) as described above, In the case of a photograph, it is assumed to be displayed as a circle in the case of a circle, a table or a figure. At this time, if some areas are recognized as photographs even though they are characters, when the symbol of the block frame (in this case, a circle) is specified, either a character or a table graphic will be used as an attribute. A menu window is displayed to specify whether it is “character”.
[0036]
In particular, even if it is a character, even if it is recognized as a photo, a table, or a figure, the remaining attribute is only “character”. At this time, simply change the attribute of the block frame to be changed. When specified, an attribute as "character" can be automatically given.
[0037]
The attributes are not limited to the above characters, photos, and figures. For example, in the case of characters, there are horizontal writing and vertical writing as described above, but other types such as Japanese and English can also be included. Note that any method may be adopted for the rough determination process of the attribute.
[0038]
Needless to say, the recognition result can be stored and held as text data, edited, and printed.
[0039]
As described above, according to the present embodiment, it is possible to realize a character recognition device that is easy to use for a user, in which a plurality of regions exist in a document and correction can be easily performed even when there are many attribute corrections.
[0040]
In particular, by making it possible to select all block frames at once, a document with a single attribute (for example, a document with only horizontal writing) can be selected with a very small number of operations. Can be changed, modified or specified in a batch.
[0041]
In addition, as a low-level process for images in block frames with attributes such as tables (including ruled lines) and graphics, or images in blocks with attributes such as photographs, the processing at the time of saving as a file This can be applied to the type of compression processing, the type of scaling when scaling.
[0042]
<Second embodiment>
In the first embodiment, the attribute is changed by using the mouse in step S42 in FIG. 4, but anything can be used as long as the attribute can be changed, such as using a key or a menu. As described above, also in the second embodiment, the same operations and effects as those in the first embodiment can be obtained.
[0043]
<Third embodiment>
In the first embodiment, the block symbol indicating the attribute of the block has been described as being a pentagon or the like, but any shape, position, color, size, etc. may be used. However, it is desirable that they can be distinguished from each other as viewed from the operator.
[0044]
As described above, also in the third embodiment, the same operations and effects as those in the first embodiment can be obtained.
[0045]
As described above, according to the present embodiment, even when there are a plurality of fine areas in the document and there are many attribute corrections, the correction can be easily performed. For this reason, when the document original is read, recognized and corrected, there is an effect that it is easy to use and the operability of character recognition is improved.
[0046]
In the embodiments, the document image to be processed is described as being input from an image scanner or the like. However, the present invention can also be applied to a storage medium (for example, a floppy disk) storing an image or a case where the image is received via a line. The present invention is not limited by the above example.
[0047]
Furthermore, in the embodiment, it goes without saying that it can be achieved by supplying a program to the apparatus or system from the outside even if it is adapted to a system composed of a single apparatus or a plurality of devices. . Therefore, it is possible to easily adapt to a storage medium in which a program for realizing the present invention is incorporated.
[0048]
【The invention's effect】
As described above, according to the present invention, the designation of attributes of each area in the input image can be set by a simple operation.
[0049]
[Brief description of the drawings]
FIG. 1 is a block diagram showing a schematic configuration of a character recognition apparatus according to a first embodiment.
FIG. 2 is a diagram illustrating a display unit and a menu in the character recognition device according to the first embodiment.
FIG. 3 is a flowchart showing processing of the character recognition device according to the first embodiment;
FIG. 4 is a flowchart showing details of step S34 in the first embodiment;
FIG. 5 is a diagram illustrating a display unit and a menu in the character recognition device according to the first embodiment.
FIG. 6 is a diagram illustrating a display unit and a menu in the character recognition device according to the first embodiment.
FIG. 7 is a diagram illustrating a display unit and a menu in the character recognition device according to the first embodiment.
[Explanation of symbols]
1 Character recognition device 2 Input unit 3 CPU
4 ROM
DESCRIPTION OF SYMBOLS 5 Memory 6 Attribute change part 7 Preprocessing part 8 Recognition part 9 Recognition dictionary 10 Display part 11 System bus 12 Interface part 13 Printer 14 Image display part 15 Recognition result display part 16 Block frame display part 17 Attribute display part 21 Image window 22 Block frame 23, 24 Block symbol

Claims

Area dividing means for dividing the image into a plurality of areas;
Attribute determination means for determining an attribute for each of the divided areas;
Display means for controlling the display of the image and the symbol by associating the symbol indicating the determined attribute for each of the divided areas;
A selection means for selecting a plurality of areas in response to a selection instruction from the user from the displayed image;
When there is an instruction to change the attribute for the symbol associated with any of the selected plurality of areas, the attribute of the area associated with the symbol that has received the instruction to change the attribute is changed, An image processing apparatus comprising: attribute changing means for changing a plurality of areas selected by the selecting means to the same attributes as the attributes of the changed areas.

The display means controls to display a frame indicating the range of the divided area, and further, when the area is selected by the selection means, the selected area can be identified. The image processing apparatus according to claim 1, wherein control is performed so that a display form of a frame indicating a range of the selected area is changed and displayed.

The image processing apparatus according to claim 1, wherein the selection unit can select all regions at once.

The image processing apparatus according to claim 1, wherein each region of the image and the attribute changed by the attribute changing unit are passed to a character recognition process.

The image processing apparatus according to claim 1, wherein the attribute includes information indicating a type of whether the character image is vertical writing or horizontal writing.

The image processing apparatus according to claim 1, wherein the attribute includes information indicating a type of a character or a graphic.

The attribute changing means controls to display a menu of attribute options when the user gives an instruction to change the attribute for a symbol associated with any of the plurality of areas selected by the selecting means. The user selects a desired attribute from the menu, changes the attribute of the area associated with the symbol for which the attribute change instruction has been received, and changes the plurality of areas selected by the selection unit to the menu-selected attribute. The image processing apparatus according to claim 1, wherein the attribute is changed to the same attribute as the attribute of the changed area.

A region dividing step for dividing the image into a plurality of regions;
An attribute determination step for determining an attribute for each of the divided areas;
A display step of controlling to display the image and the symbol by associating a symbol indicating the determined attribute for each of the divided areas;
A selection step of selecting a plurality of regions in accordance with a selection instruction from the user from among the displayed images;
When there is an instruction to change the attribute for the symbol associated with any of the selected plurality of areas, the attribute of the area associated with the symbol that has received the instruction to change the attribute is changed, An image processing method comprising: an attribute change step of changing a plurality of areas selected in the selection step to the same attribute as the attribute of the attribute-changed area.

In the display step, control is performed so that a frame indicating the range of the divided area is displayed, and when the area is selected in the selection step, the selected area is identified so that the selected area can be identified. 9. The image processing method according to claim 8, wherein control is performed such that the display form of a frame indicating the range of the selected area is changed and displayed.

The image processing method according to claim 8, wherein in the selection step, it is possible to select all regions at once.

9. The image processing method according to claim 8, wherein each region of the image and the attribute changed in the attribute changing step are passed to a character recognition process.

The image processing method according to claim 8, wherein the attribute includes information indicating a type of whether the character image is vertical writing or horizontal writing.

The image processing method according to claim 8, wherein the attribute includes information indicating a type of character or graphic.

In the attribute change step, control is performed to display a menu of attribute options when an instruction to change the attribute is given from the user to a symbol associated with any of the plurality of areas selected in the selection step. The user selects a desired attribute from the menu, changes the attribute of the area associated with the symbol for which the attribute change instruction has been received, and changes the plurality of areas selected in the selection step to the menu-selected attribute. The image processing method according to claim 8, wherein the attribute is changed to the same attribute as the attribute of the changed area.