JPH08221515A

JPH08221515A - Image processor

Info

Publication number: JPH08221515A
Application number: JP7030612A
Authority: JP
Inventors: Toshio Miyazawa; 利夫宮澤; Toshifumi Yamaai; 敏文山合
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1995-02-20
Filing date: 1995-02-20
Publication date: 1996-08-30

Abstract

PURPOSE: To improve the character recognition rate by highly accurately and stably binarizing the character area. CONSTITUTION: An area identifying part 104 roughly identifies the character/ table area and the other areas on an input document. A threshold value control part 120 decides the binarization threshold value based on a density histogram at the entire part or one part of the character/table area. While using the binarization threshold value set by the threshold value control part 120, a multilevel/binary converting part 108 converts the multilevel expression image data in the character/table area on the input document to a binary expression by performing a simply binarizing processing. The binarization threshold value is fixed in the same area, at least. The multilevel expression image data in the other areas are converted into the binary expressions by bither processing by a multilevel/binary converting part 110.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、多値表現の画像データ
を扱う画像処理装置に係り、特に、多値表現の画像デー
タを２値表現の画像データに変換するための画像処理装
置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image processing apparatus for handling multi-valued image data, and more particularly to an image processing apparatus for converting multi-valued image data into binary image data.

【０００２】[0002]

【従来の技術】近年、文字や写真等の属性の異なる領域
が混在した文書等の原稿をスキャナーで読み取って得ら
れるような多値表現の画像データを扱う分野が拡大して
いる。特に画像認識、文字認識、画像通信、画像蓄積等
々の様々な分野において、そのような多値表現の画像デ
ータを２値表現の画像データへ変換する処理がしばしば
必要とされ、そのための処理手法が多々提案されてい
る。2. Description of the Related Art In recent years, the field of handling multi-valued image data obtained by scanning a document such as a document in which areas having different attributes such as characters and photographs are mixed is expanding. Particularly in various fields such as image recognition, character recognition, image communication, image storage, etc., a process of converting such multivalued image data into binary image data is often required, and a processing method therefor is required. Many have been proposed.

【０００３】例えば４×４画素、３×３画素といった小
さなブロック毎に、２値画像もしくは文字画像である
か、または濃淡画像もしくは絵柄であるかの属性判別を
行ない、２値画像もしくは文字画像であると判別したブ
ロックについては単一閾値による単純２値化処理を適用
し、濃淡画像もしくは絵柄であると判別したブロックに
ついてはディザ処理を適用する処理手法が知られている
（鉄谷信二ほか，”２値画像と濃淡画像の混在する原稿
の２値化処理法”，信学論Ｖol．Ｊ６７−ＢＮo.７，p
p.７８１−７８８、及び、大内敏ほか，”文字／網点／
写真混在画像の像域分離方法”，信学技報ＩＥ９０−３
２）。For example, for each small block such as 4 × 4 pixels or 3 × 3 pixels, the attribute determination is performed to determine whether the image is a binary image or a character image, or a grayscale image or a pattern. A processing method is known in which simple binarization processing with a single threshold is applied to blocks that are determined to be present, and dither processing is applied to blocks that are determined to be grayscale images or patterns (Shinji Tetsuya et al. "Binarization processing method for manuscripts in which binary images and grayscale images coexist", SIJ Vol.J67-BNo.7, p.
p.781-788 and Satoshi Ouchi et al., "Character / Dot /
Image Area Separation Method for Mixed Photo Image ", IEICE Technical Report IE90-3
2).

【０００４】また、多値表現画像の構造情報である尾根
点と谷点を検出し、尾根点と谷点の２値化エラーが少な
くなるように走査ライン毎に２値化閾値を調整する方法
が知られている（大内敏ほか，”尾根点・谷点方式によ
る文書画像の前処理付入力装置”，信学技報ＥＣ８２−
５）。Further, a method of detecting a ridge point and a valley point, which are structural information of a multivalued image, and adjusting a binarization threshold value for each scanning line so that a binarization error of the ridge point and the valley point is reduced. Is known (Satoshi Ouchi et al., "Input device with document image preprocessing by ridge point / valley method", IEICE Tech.
5).

【０００５】[0005]

【発明が解決しようとする課題】しかし、上述の小さな
ブロック単位に２値化処理方法を切り替える手法では、
原稿上の同じ属性の領域の内部で２値化処理方法が切り
替わるという不都合がしばしば起こる。２値化処理後の
画像データを文字認識に用いる場合、原稿上の文字領域
の内部で部分的とはいっても不適切な２値化処理方法が
選択されると、その部分での誤認識の発生に直結するた
め、極めて重大な問題である。However, in the above-mentioned method of switching the binarization processing method in small block units,
The inconvenience that the binarization processing method is switched within the area of the same attribute on the document often occurs. When the image data after the binarization processing is used for character recognition, if an improper binarization processing method is selected within the character area on the original, it may cause erroneous recognition at that portion. It is a very serious problem because it is directly related to the occurrence.

【０００６】上述の走査ライン単位で２値化閾値を適応
的に制御する方法にも、同一の領域内で２値化閾値が変
動するため、同様の問題が起こりがちである。さらに、
２値化閾値の決定に際して領域を考慮しないので、文字
領域や写真領域等が混在している場合には、文字領域を
最適な閾値で２値化できるとは限らないという問題もあ
る。In the above-mentioned method of adaptively controlling the binarization threshold value for each scanning line, the binarization threshold value fluctuates within the same region, and therefore the same problem tends to occur. further,
Since the region is not taken into consideration when determining the binarization threshold value, there is a problem that the character region cannot be binarized with the optimum threshold value when the character region, the photo region, and the like are mixed.

【０００７】本発明は、多値表現画像データの２値表現
への変換処理に関する上述の如き問題点を解決した画像
処理装置を提供しようとするものであり、特に、文字認
識装置の前処理装置として利用するのに好適で、文字認
識率の向上に寄与できる画像処理装置を提供することを
目的とするものである。The present invention is intended to provide an image processing apparatus which solves the above-mentioned problems relating to the conversion processing of multi-valued expression image data into binary expression, and in particular, a pre-processing apparatus for a character recognition apparatus. It is an object of the present invention to provide an image processing device that is suitable for use as a computer and can contribute to an improvement in the character recognition rate.

【０００８】[0008]

【課題を解決するための手段】上記目的を達成するた
め、請求項１ないし請求項５に記載の発明による画像処
理装置は次のような特徴を有するものである。In order to achieve the above-mentioned object, the image processing apparatus according to the invention described in claims 1 to 5 has the following features.

【０００９】請求項１記載の発明の特徴は、原稿の多値
表現画像データより該原稿上の文字が書かれている領域
とその他の領域とを大局的に識別する領域識別手段と、
該領域識別手段により識別された文字が書かれている領
域の多値表現画像データを、一定の処理方法によって２
値表現画像データへ変換する多値−２値変換手段とを有
することである。According to a first aspect of the present invention, there is provided area identifying means for globally identifying an area in which characters are written on the original and other areas from multivalued image data of the original.
The multi-valued representation image data of the area in which the character identified by the area identification means is written is converted into 2 by a certain processing method.
And multi-value / binary conversion means for converting the image data into the value-represented image data.

【００１０】請求項２記載の発明の特徴は、請求項１記
載の発明の構成において、多値−２値変換手段が、一定
の２値化閾値による単純２値化処理によって多値表現画
像データを２値表現画像データへ変換する手段であるこ
とである。According to a second aspect of the present invention, in the configuration of the first aspect of the invention, the multi-value / two-value conversion means performs multi-value representation image data by simple binarization processing with a constant binarization threshold value. Is a means for converting the image data into binary representation image data.

【００１１】請求項３記載の発明の特徴は、原稿の多値
表現画像データより該原稿上の文字が書かれている領域
とその他の領域とを大局的に識別する領域識別手段と、
該領域識別手段により識別された文字が書かれている領
域の多値表現画像データを単純２値化処理により２値表
現画像データへ変換する多値−２値変換手段と、該領域
識別手段によって識別された文字が書かれている領域の
多値表現画像データから濃度ヒストグラムを求め、該濃
度ヒストグラムに基づいて該多値−２値変換手段のため
の２値化閾値を決定する手段とを有し、該２値化閾値は
少なくとも同一領域内では固定されることである。According to a third aspect of the present invention, there is provided area identifying means for globally identifying an area in which characters are written on the original and other areas from multivalued image data of the original.
The multi-valued binary conversion means for converting the multi-valued expression image data of the area in which the character identified by the area identification means is written into the binary expression image data by the simple binarization processing, and the area identification means. A density histogram is obtained from the multivalued image data of the region in which the identified character is written, and means for determining a binarization threshold value for the multivalue-binary conversion means is provided based on the density histogram. However, the binarization threshold is fixed at least in the same region.

【００１２】請求項４記載の発明の特徴は、原稿の多値
表現画像データより該原稿上の文字が書かれている領域
とその他の領域とを大局的に識別する領域識別手段と、
該領域識別手段により識別された文字が書かれている領
域の多値表現画像データを、単純２値化処理によって２
値表現画像データへ変換する多値−２値変換手段と、該
領域識別手段によって識別された文字が書かれている領
域の中より一部の領域を選択し、該選択した領域の多値
表現画像データより濃度ヒストグラムを求め、該濃度ヒ
ストグラムに基づいて該多値−２値変換手段のための２
値化閾値を決定する手段とを有し、該２値化閾値は少な
くとも同一領域内では固定されることである。According to a fourth aspect of the present invention, there is provided area identifying means for globally identifying an area in which characters are written on the original and other areas from multivalued image data of the original.
The multi-valued representation image data of the area in which the character identified by the area identification means is written is converted into 2 by simple binarization processing.
A multivalue-binary conversion means for converting into value expression image data, and a part of the area in which the character identified by the area identification means is written is selected, and the multivalued representation of the selected area A density histogram is obtained from the image data, and based on the density histogram, 2 for the multi-value / binary conversion means is obtained.
Means for determining a binarization threshold value, and the binarization threshold value is fixed at least in the same region.

【００１３】請求項５記載の発明の特徴は、請求項３ま
たは４記載の発明の構成において、閾値を決定する手段
が領域領域の内部の白または黒ベタ部を除外して濃度ヒ
ストグラムを求めることである。According to a fifth aspect of the present invention, in the structure of the third or fourth aspect of the invention, the means for determining the threshold value obtains the density histogram by excluding the solid white or black solid portion inside the area region. Is.

【００１４】[0014]

【作用】請求項１記載の発明によれば、原稿上の文字が
書かれている領域（後記実施例に関連して詳述するよう
に、文字のみからなる”文字領域”と罫線を含む”表領
域”とがある）が大局的に識別され、その多値表現画像
データは一定の処理方法によって２値表現へ変換され
る。また、請求項２記載の発明によれば、文字が書かれ
た領域の多値表現画像データは、一定の単純２値化処理
によって２値表現画像データへ変換される。したがっ
て、前記した従来の局所的方法や走査線毎に２値化閾値
を制御する方法のように、文字が書かれた領域の内部で
処理方法が切り替わったり２値化閾値が変動することが
なく、安定した２値表現画像データを得ることができ
る。According to the first aspect of the present invention, the area on the manuscript in which the characters are written (as will be described in detail in connection with the embodiments described later, "character area" including only characters and ruled lines ") A table area ") is globally identified, and the multivalued representation image data is converted into a binary representation by a certain processing method. According to the second aspect of the present invention, the multi-valued representation image data of the area in which the character is written is converted into the binary representation image data by a certain simple binarization process. Therefore, unlike the conventional local method or the method of controlling the binarization threshold value for each scanning line, the processing method does not change or the binarization threshold value does not change within the area where the character is written. Therefore, stable binary representation image data can be obtained.

【００１５】請求項３記載の発明によれば、文字が書か
れた領域の多値表現−２値表現変換は単純２値化処理に
よってなされるとともに、その２値化閾値は文字が書か
れた領域の濃度ヒストグラムに基づいて最適化され、そ
して、少なくとも同一の領域内では２値化閾値は固定さ
れる。したがって、文字が書かれた領域について、極め
て高精度かつ安定な２値化が可能である。２値化閾値と
領域との対応については、後記実施例に関連して述べる
ように、各領域毎に、その濃度ヒストグラムに基づいて
２値化閾値を決定する態様や、全領域の領域の濃度ヒス
トグラムに基づいて全領域に対する２値化閾値を決定す
る態様が含まれる。According to the third aspect of the invention, the multivalued representation / binary representation conversion of the area in which the character is written is performed by a simple binarization process, and the binarization threshold value is the character written. It is optimized based on the density histogram of the area, and the binarization threshold is fixed at least in the same area. Therefore, it is possible to perform highly accurate and stable binarization on a region in which characters are written. Regarding the correspondence between the binarization threshold value and the area, as will be described in connection with the embodiment described later, a mode in which the binarization threshold value is determined based on the density histogram of each area, and the density of all the area areas A mode in which the binarization threshold value for the entire region is determined based on the histogram is included.

【００１６】請求項４記載の発明によれば、２値化閾値
を決定するために、文字が書かれた領域の中より選択さ
れた一部の領域の濃度ヒストグラムのみを利用するの
で、２値化閾値決定のための処理量が少なくてすみ、処
理を高速化できる。また、後記実施例に関連して説明す
るように、文字数が多いと推定される一つまたは所定数
の領域を選択すれば、利用する領域数が少なくとも、２
値化閾値を十分に最適化し高精度の２値化が可能であ
る。According to the fourth aspect of the present invention, in order to determine the binarization threshold value, only the density histogram of a partial area selected from the area in which the character is written is used. The amount of processing for determining the activation threshold can be small, and the processing can be speeded up. Further, as will be described in connection with the embodiment described later, if one area or a predetermined number of areas estimated to have a large number of characters is selected, the number of areas to be used is at least 2.
It is possible to optimize the binarization threshold sufficiently and perform highly accurate binarization.

【００１７】文字が書かれている領域の内部に面積の大
きな白または黒ベタ領域が存在すると、濃度ヒストグラ
ムが本来の分布からずれる可能性がある。しかし、請求
項５記載の発明によれば、文字が書かれている領域内部
の白または黒ベタ部を除外して濃度ヒストグラムを求
め、それに基づいて２値化閾値を決定するので、文字が
書かれている領域内に白または黒ベタ部が存在しても、
最適な２値化閾値により高精度な２値化が可能となる。If a large white or black solid area exists inside the area in which the characters are written, the density histogram may deviate from the original distribution. However, according to the fifth aspect of the invention, the density histogram is obtained by excluding the white or black solid portion inside the area where the character is written, and the binarization threshold value is determined based on the density histogram. Even if there are white or black solid areas in the area,
Highly accurate binarization is possible with the optimal binarization threshold.

【００１８】[0018]

【実施例】以下、本発明の実施例について添付図面を用
いて説明する。＜実施例１＞図１は本実施例による画像処理装置の概略
ブロック図である。図１において、図示されないスキャ
ナー等から文書等の原稿の多値表現の画像データ１００
が入力され、この多値表現画像データ１００は画像メモ
リ１０２に格納される。Embodiments of the present invention will be described below with reference to the accompanying drawings. <Embodiment 1> FIG. 1 is a schematic block diagram of an image processing apparatus according to this embodiment. In FIG. 1, image data 100 of a multivalued representation of a document such as a document is read from a scanner (not shown) or the like.
Is input, and the multivalued representation image data 100 is stored in the image memory 102.

【００１９】領域識別部１０４は、画像メモリ１０２内
の多値表現画像データを読み込み、原稿上の属性の違う
各領域を識別して矩形の領域として切り出す。図２は、
この領域識別の説明図である。図示の文書２００の多値
表現画像データが入力された場合、同文書上の文字領域
２０２〜２０５と、表領域２０６と、その他領域（囲み
枠２０７，水平セパレータ２０８，垂直セパレータ２０
９，図２１０、さらには写真や網点画像領域）とが識別
され、それぞれが矩形の領域として切り出される。The area identification unit 104 reads the multi-valued expression image data in the image memory 102, identifies each area having different attributes on the document, and cuts it out as a rectangular area. Figure 2
It is explanatory drawing of this area | region identification. When the multi-valued representation image data of the illustrated document 200 is input, the character areas 202 to 205, the table area 206, and other areas (enclosing frame 207, horizontal separator 208, vertical separator 20) on the document are input.
9, FIG. 210, and further, a photograph and a halftone dot image area) are identified and each is cut out as a rectangular area.

【００２０】この例から理解されるように、領域識別部
１０４は大局的な領域識別、つまり、原稿上の文字行ま
たは複数の文字行列の集まった部分（コラム）を囲む矩
形領域や、縦横罫線からなる表を囲む矩形領域といっ
た、ある程度の大きさを持った領域を識別し切り出すも
ので、前記従来技術にみられるような小さなブロックを
単位にして属性を識別するというものではない。なお、
文字の書かれている文字領域と表領域は文字認識の対象
となる領域であるが、その区別を必要としないときに
は、文字領域と表領域とを同一属性の領域として切り出
してもよい。As can be understood from this example, the area identification unit 104 performs global area identification, that is, a rectangular area surrounding a portion (column) of a character line or a plurality of character matrices on a document, and vertical and horizontal ruled lines. It identifies and cuts out a region having a certain size, such as a rectangular region surrounding a table consisting of, and does not identify the attribute in units of small blocks as in the prior art. In addition,
The character area in which the characters are written and the table area are areas for character recognition, but when it is not necessary to distinguish them, the character area and the table area may be cut out as areas having the same attribute.

【００２１】このような大局的な領域識別は様々な方法
によって行なうことができるが、例えば本出願人の特願
平５−１５９１９０号または特願平５−３３１２５２号
の方法を用いることができる。両特許出願の方法の詳細
はその明細書及び図面を参照されたいが、その概略は次
のとおりである。まず、入力画像データは多値表現であ
るので、準備として適当な閾値による単純２値化によっ
て２値表現画像データに変換する。次に２値表現の画像
データを適当な倍率で圧縮し、得られた圧縮画像データ
より黒画素連結成分に外接する矩形を抽出し、得られた
矩形を文字の矩形、その他の矩形に分類する。そして、
文字の矩形を統合することによって文字領域を抽出す
る。その他の矩形についても必要に応じ分類と統合を行
なって、その領域を抽出する。この処理中の２値表現画
像データや圧縮画像データの記憶領域として、画像メモ
リ１０２を使用してもよい。Such global area identification can be performed by various methods, and for example, the method of the applicant's Japanese Patent Application No. 5-159190 or Japanese Patent Application No. 5-331252 can be used. The details of the methods of both patent applications should be referred to the specification and drawings, and the outline thereof is as follows. First, since the input image data has a multi-valued representation, as preparation, it is converted into binary representation image data by simple binarization with an appropriate threshold value. Next, the binary-represented image data is compressed with an appropriate magnification, a rectangle circumscribing the black pixel connected component is extracted from the obtained compressed image data, and the obtained rectangle is classified into a character rectangle and other rectangles. . And
A character area is extracted by integrating the character rectangles. The other rectangles are also classified and integrated as necessary to extract the area. The image memory 102 may be used as a storage area for the binary representation image data and the compressed image data during this processing.

【００２２】また、その他領域の中で、網点画像または
写真の領域を積極的に識別するようにしてもよい。例え
ば、前記の”文字／網点／写真混在画像の像域分離方
法”と同様に、ブロック単位で網点／写真の判定を行な
い、網点または写真のブロックを統合することによっ
て、それぞれの領域を抽出可能である。Further, among the other areas, the area of the halftone dot image or the photograph may be positively identified. For example, similar to the above-mentioned “image area separation method of mixed image of characters / dots / photos”, determination of dots / photos on a block-by-block basis and integration of dots or blocks of photo to obtain respective areas Can be extracted.

【００２３】このような領域識別の結果は、装置各部を
制御する制御部１０６へ渡される。制御部１０６は、領
域識別処理が済むと、画像メモリ１０２より多値表現画
像データを順次読み出して３種類の多値−２値変換部１
０８，１０９，１１０に与えるが、この際に、そのデー
タの所属領域の種類に応じて３種類の多値−２値変換部
１０８，１０９，１１０の一つを選択し有効にする。す
なわち、文字領域内の画素のデータが読み出された時に
は多値−２値変換部１０８が、表領域内の画素のデータ
が読み出された時には多値−２値変換部１０９が、その
他領域内の画素のデータが読み出された時には多値−２
値変換部１１０が、それぞれ有効になる。３種類の多値
−２値変換部１０８，１０９，１１０はそれぞれ多値表
現を２値表現に変換するものであるが、文字領域用の多
値−２値変換部１０８と表領域用の多値−２値変換部１
０９はそれぞれ、予め指定された一つの閾値による単純
２値化処理によって多値表現画像データを２値表現画像
データに変換し、その他領域用の多値−２値変換部１１
０は例えばディザ処理によって多値表現から２値表現へ
の変換を行なう。The result of such area identification is passed to the control section 106 which controls each section of the apparatus. When the area identification processing is completed, the control unit 106 sequentially reads the multi-valued expression image data from the image memory 102 and outputs three types of multi-valued / two-value conversion units 1.
08, 109, 110. At this time, one of the three types of multi-value / binary conversion units 108, 109, 110 is selected and validated according to the type of the region to which the data belongs. That is, when the pixel data in the character area is read, the multi-value / binary conversion unit 108 is read, and when the pixel data in the table area is read, the multi-value / binary conversion unit 109 is used, and the other area is read. When the data of the pixel inside is read out, it is multi-valued-2
The value converters 110 are enabled respectively. The three types of multi-value / binary conversion units 108, 109, and 110 respectively convert multi-valued expressions into binary expressions. Value-to-binary converter 1
Reference numerals 09 respectively convert the multivalued representation image data into the binary representation image data by a simple binarization process using one threshold value designated in advance, and the multivalued-to-binary conversion unit 11 for other areas.
A value of 0 performs conversion from multi-valued representation to binary representation by, for example, dither processing.

【００２４】なお、その他領域を、図・セパレータ領域
と網点／写真領域とに分類した場合、多値−２値変換部
１１０において、網点／写真領域のデータをディザ処理
によって変換し、図・セパレータ領域のデータを単純２
値化処理によって変換することもできる。また、本装置
の用途によっては、その他領域の多値表現画像データを
そのまま出力させてもよい。When the other areas are classified into a figure / separator area and a halftone / photo area, the multi-value / binary conversion unit 110 converts the data of the halftone / photo area by dithering, -Simple data for separator area 2
It can also be converted by a digitization process. Further, depending on the use of this device, the multi-valued representation image data of other areas may be output as it is.

【００２５】以上のように、大局的な領域識別によって
切り出された原稿上の各領域は、それぞれ全体が同じ処
理方法によって２値表現へ変換されるため、従来の局所
的方法におけるような同じ領域内で処理方法が切り替わ
るということがなく、文字領域及び表領域について、安
定した２値表現画像データを得られる。したがって、本
装置を文字認識装置の前処理装置として用い、多値−２
値変換部１０８，１０９により得られた２値表現画像デ
ータを文字認識装置に入力すれば、２値化不良による誤
認識を減らし認識率の向上を期待できる。As described above, since the respective areas on the original cut out by the global area identification are converted into the binary representation by the same processing method, the same areas as in the conventional local method are used. It is possible to obtain stable binary representation image data for the character area and the table area without switching the processing method inside. Therefore, this device is used as a pre-processing device for a character recognition device and multivalued-2
By inputting the binary representation image data obtained by the value conversion units 108 and 109 to the character recognition device, it is possible to reduce the erroneous recognition due to the binarization failure and expect to improve the recognition rate.

【００２６】＜実施例２＞図３は本実施例による画像処
理装置の概略ブロック図である。図３において、図１と
同一の符号は同一部または対応部を示すので、その説明
を省略する。図１に示した前記実施例１との相違点は、
文字認識の対象となる文字領域及び表領域のデータの２
値表現への変換を、共通の多値−２値変換部１１１にお
いて単純２値化処理によって行なうようにした点と、こ
の多値−２値変換部１１１で用いられる２値化閾値を閾
値制御部１２０によって制御するようにした点である。<Second Embodiment> FIG. 3 is a schematic block diagram of an image processing apparatus according to the present embodiment. In FIG. 3, the same reference numerals as those in FIG. 1 indicate the same parts or corresponding parts, and the description thereof will be omitted. The difference from the first embodiment shown in FIG.
2 of the data of the character area and the table area that are the target of character recognition
The common multi-value / two-value conversion unit 111 performs the conversion into the value expression by a simple binarization process, and the binarization threshold value used in the multi-value / two-value conversion unit 111 is threshold-controlled. The point is that the control is performed by the unit 120.

【００２７】本装置は、文字／表領域データの２値化閾
値の制御に関し次に述べるモード〜を有し、利用者
が任意に一つのモードを指定できる。This device has the following modes for controlling the binarization threshold of character / table area data, and the user can arbitrarily specify one mode.

【００２８】モード：文字及び表の各領域毎に、当該
領域の濃度ヒストグラムに基づいて閾値を求める。この
モードにおいて、文字領域の濃度ヒストグラムを求める
際に、領域内の白または黒ベタ領域を除外するかしない
かを指定することができるが、これは文字領域の濃度ヒ
ストグラムを用いて閾値を設定する他のモードでも
同様である。Mode: For each area of the character and the table, a threshold value is obtained based on the density histogram of the area. In this mode, when obtaining the density histogram of the character area, you can specify whether to exclude the white or black solid area in the area, but this sets the threshold value using the density histogram of the character area. The same applies to other modes.

【００２９】モード：全ての文字領域の濃度ヒストグ
ラムに基づいて閾値を求め、この閾値を全ての文字領域
及び表領域に適用する。Mode: A threshold is obtained based on the density histograms of all character areas, and this threshold is applied to all character areas and table areas.

【００３０】モード：一つの文字領域を選択し、この
文字領域の濃度ヒストグラムに基づいて閾値を求め、こ
の閾値を全ての文字領域及び表領域に適用する。このモ
ードにおいて、文字領域の選択方法として、面積が最大
の文字領域を選択させる方法、行数が最大の文字領域を
選択させる方法、文字数が最大の文字領域を選択させる
方法を指定できる。Mode: One character area is selected, a threshold value is obtained based on the density histogram of this character area, and this threshold value is applied to all character areas and table areas. In this mode, as a character area selection method, a method of selecting a character area with the largest area, a method of selecting a character area with the largest number of lines, and a method of selecting a character area with the largest number of characters can be specified.

【００３１】モード：全ての表領域の濃度ヒストグラ
ムに基づいて閾値を求め、この閾値を全ての文字領域及
び表領域に適用する。Mode: A threshold value is obtained based on the density histograms of all table areas, and this threshold value is applied to all character areas and table areas.

【００３２】モード：面積が最大の表領域を選択し、
その濃度ヒストグラムに基づいて閾値を求め、この閾値
を全ての文字領域及び表領域に適用する。Mode: Select the table area with the largest area,
A threshold value is obtained based on the density histogram, and this threshold value is applied to all character areas and table areas.

【００３３】図４は閾値制御部１２０のブロック図であ
る。図４において、１２２は文字領域内の白または黒ベ
タ部の除去を行なうベタ部除去部、１２３は文字領域ま
たは表領域の濃度ヒストグラムを求めるヒストグラム作
成部、１２４は作成された濃度ヒストグラムに基づいて
閾値を算出する閾値計算部、１２５は算出された閾値を
多値−２値変換部１１１に対して設定する閾値設定部で
ある。以下、前記各モードの閾値制御動作を説明する。FIG. 4 is a block diagram of the threshold controller 120. In FIG. 4, 122 is a solid area removing unit for removing solid white or black areas in a character area, 123 is a histogram creating section for obtaining a density histogram of a character area or a surface area, and 124 is based on the created density histogram. A threshold value calculation unit that calculates a threshold value and a threshold value setting unit 125 that sets the calculated threshold value to the multi-value to binary conversion unit 111. The threshold control operation in each mode will be described below.

【００３４】まず、モードが指定された場合について
説明する。制御部１０６は、領域識別部１０４によって
識別された文字領域及び表領域を順に１領域ずつ選択
し、その多値表現画像データを画像メモリ１０２より読
み出して閾値制御部１２０へ入力する。First, the case where the mode is designated will be described. The control unit 106 sequentially selects the character region and the table region identified by the region identification unit 104 one by one, reads the multivalued representation image data from the image memory 102, and inputs the multivalued representation image data to the threshold value control unit 120.

【００３５】閾値制御部１２０に入力された画像データ
はベタ部除去部１２２を経由してヒストグラム作成部１
２３へ送られるが、文字領域内の白または黒ベタ部の除
去を指定された場合には、文字領域の画像データが入力
された時にベタ部除去部１２２は一定以上の面積の白ま
たは黒ベタ部を除いた画像データだけをヒストグラム作
成部１２３へ送る。The image data input to the threshold control unit 120 passes through the solid area removing unit 122 and the histogram creating unit 1
However, if the removal of the white or black solid portion in the character area is designated, the solid portion removal unit 122 receives the image data of the character area, and the solid portion removal unit 122 has a certain area or more solid white or black area. Only the image data excluding the copy is sent to the histogram creation unit 123.

【００３６】図５の文字領域２３０の例では、行長の異
なる文字行２３１〜２３５を囲む矩形の範囲が一つの領
域として切り出されているので、文字の存在しない、か
なり広い白ベタ部２３６，２３７が含まれる。このよう
な白ベタ部には裏写り等によるノイズが乗りやすいの
で、面積の広い白ベタ部を含めて濃度ヒストグラムを求
めると、極く薄い濃度値で濃度ヒストグラムにピークが
発生するなど、濃度ヒストグラムに基づいて閾値を決定
する上で悪影響を受けやすい。また、広い黒ベタ部を残
した場合にも本来の濃度ヒストグラムからのずれが大き
くなり、閾値決定に悪影響を及ぼしやすい。したがっ
て、一定以上の面積の白または黒ベタ部を除去するわけ
である。In the example of the character area 230 of FIG. 5, since the rectangular range enclosing the character lines 231 to 235 having different line lengths is cut out as one area, a fairly wide white solid portion 236, in which no character exists, is formed. 237 is included. Since noise due to show-through etc. is likely to be introduced into such white solid areas, if a density histogram is obtained including a large white area, a peak appears in the density histogram at an extremely thin density value. It is likely to be adversely affected when determining the threshold value based on. Further, even when a wide black solid portion is left, the deviation from the original density histogram becomes large, and the threshold value determination is likely to be adversely affected. Therefore, a white or black solid part having a certain area or more is removed.

【００３７】なお、文字領域内に一定以上の面積の白ま
たは黒ベタ部が含まれないように、領域識別部１０４に
おいて文字領域の再分割を行なわせることによって、同
様の効果を得ることもでき、これも本発明の実施態様に
含まれる。図５に示した文字領域２３０の例では、文字
行２３１〜２３３を囲む領域と、文字行２３４，２３５
を囲む領域とに分割すれば、面積の大きな白ベタ部２３
６，２３７が取り除かれる。It should be noted that the same effect can be obtained by subdividing the character area in the area identifying unit 104 so that the character area does not include a solid white or black area having a certain area or more. , Which is also included in the embodiments of the present invention. In the example of the character area 230 shown in FIG. 5, the area surrounding the character lines 231 to 233 and the character lines 234 and 235.
The white solid part 23 having a large area
6,237 are removed.

【００３８】さて、ヒストグラム作成部１２３は、ベタ
部除去部１２２経由で入力した文字または表の領域の画
像データの濃度ヒストグラムを作成する。この濃度ヒス
トグラムに基づいて、閾値計算部１２４は２値化のため
の閾値を算定する。The histogram creating section 123 creates a density histogram of image data of the character or table area input via the solid area removing section 122. The threshold calculator 124 calculates a threshold for binarization based on this density histogram.

【００３９】一般に、文字領域（または表領域）では、
図６の（ａ）に示すような双峰性の濃度ヒストグラムが
得られるが、写真領域等では例えば図６の（ｂ）に示す
ような濃度ヒストグラムが得られる。このように濃度分
布が全く異なる領域について同じ２値化方法もしくは閾
値を適用しても良好な結果を得られない。そこで、文字
認識の性能向上を目的とした場合、文字領域または表領
域のための２値化閾値は、その領域の濃度分布をもとに
決定するわけである。Generally, in the character area (or table area),
Although a bimodal density histogram as shown in FIG. 6A is obtained, a density histogram as shown in FIG. 6B is obtained in a photographic region or the like. Even if the same binarization method or threshold value is applied to the regions having completely different density distributions as described above, good results cannot be obtained. Therefore, for the purpose of improving the character recognition performance, the binarization threshold value for the character area or the table area is determined based on the density distribution of the area.

【００４０】さて、基本的には、図６（ａ）のような濃
度ヒストグラムの谷の付近の濃度値を２値化閾値に選べ
ばよいが、実際に得られる濃度ヒストグラムは必ずしも
図６（ａ）に示したほどには谷が明確でないことが少な
くない。そこで、従来より濃度ヒストグラムに基づいて
最適な２値化閾値を求めるための各種方法が考案されて
いる。閾値計算部１２４は、そのような従来の適当な閾
値決定方法を採用してもよいし、あるいは他の方法を採
用してもよい。ここでは、計算量が少ないことに着目し
て、濃度ヒストグラムの０次と１次の累積モーメントを
用いて閾値を決定する公知の方法（大津展之，”判別お
よび最小２乗基準に基づく自動しきい値選定法”，信学
論 ’８０／４Ｖol.Ｊ６３−ＤＮo.４）によって、
閾値を決定するものとする。処理の内容は同論文に詳し
いので、その記載を援用し説明にかえる。Now, basically, it is sufficient to select the density value near the valley of the density histogram as shown in FIG. 6A as the binarization threshold value, but the density histogram actually obtained is not necessarily shown in FIG. Often, the valleys are not as clear as shown in (). Therefore, conventionally, various methods have been devised for obtaining the optimum binarization threshold value based on the density histogram. The threshold value calculation unit 124 may adopt such a conventional appropriate threshold value determining method or another method. Here, paying attention to the fact that the amount of calculation is small, a known method of determining the threshold value by using the 0th and 1st order cumulative moments of the density histogram (Otsuno Nobuyuki, “discrimination and an automatic method based on the least squares criterion”). Threshold value selection method ", Theological theory '80 / 4 Vol.J63-D No.4)
The threshold shall be determined. The details of the process are detailed in the same paper, so the description is incorporated by reference.

【００４１】このようにして決定された閾値は、領域と
対応付けて閾値設定部１２５の内部に記憶される。全て
の文字領域及び表領域に対する閾値が決定されると、制
御部１０６は画像メモリ１０２内の画像データを順次読
み出して多値−２値変換部１１０，１１１へ送るととも
に、その画像データが文字または表の領域のデータの時
に多値−２値変換部１１１を有効にし、その他領域のデ
ータの時に多値−２値変換部１１０を有効にする。ま
た、文字領域または表領域のデータの読み出し時に、当
該領域の識別情報を閾値制御部１２０へ与える。閾値制
御部１２０の閾値設定部１２５は、この識別情報により
いずれの文字または表の領域のデータを処理中であるか
判断し、当該領域に対応した閾値（内部に記憶してい
る）を多値−２値変換部１１１に対し設定する。The threshold value thus determined is stored in the threshold value setting section 125 in association with the area. When the thresholds for all the character areas and the table areas are determined, the control unit 106 sequentially reads out the image data in the image memory 102 and sends the image data to the multi-value / binary conversion units 110 and 111. The multi-value to binary conversion unit 111 is enabled when the data is in the area of the table, and the multi-value to binary conversion unit 110 is enabled when the data is in the other area. Further, when the data of the character area or the table area is read, the identification information of the area is given to the threshold control unit 120. The threshold value setting unit 125 of the threshold value control unit 120 determines which character or table area data is being processed based on this identification information, and multi-values the threshold value (stored inside) corresponding to the area. It is set in the -2 value conversion unit 111.

【００４２】このように、このモードによれば、文字ま
たは表の各領域毎に２値化のための閾値が当該各領域の
濃度ヒストグラムに応じて適応的に制御され、しかも各
領域内では閾値は一定しているため、領域によって濃度
分布に違いがあるような場合でも、文字領域及び表領域
の画像データについて極めて精度のよい、かつ安定な２
値化が可能である。したがって、多値−２値変換部１１
１の出力データを文字認識装置に入力するようにすれ
ば、文字認識率の大幅な向上を期待できる。As described above, according to this mode, the threshold value for binarization is adaptively controlled according to the density histogram of each area of each character or table, and the threshold value is set within each area. Is constant, the image data of the character area and the table area is highly accurate and stable even when the density distribution varies depending on the area.
It can be valued. Therefore, the multi-value / binary conversion unit 11
If the output data of 1 is input to the character recognition device, a significant improvement in the character recognition rate can be expected.

【００４３】次に、モードが指定された場合について
説明する。制御部１０６は、領域識別部１０４によって
識別された文字領域を順に１領域ずつ選択し、その多値
表現画像データを画像メモリ１０２より読み出して閾値
制御部１２０へ入力する。閾値制御部１２０に入力され
た画像データはベタ部除去部１２２を経由してヒストグ
ラム作成部１２３へ送られる。文字領域内の白または黒
ベタ部の除去を指定された場合には、文字領域の画像デ
ータが入力された時には、ベタ部除去部１２２によって
一定以上の面積の白または黒ベタ部を除いた画像データ
だけがヒストグラム作成部１２３へ送られる。ヒストグ
ラム作成部１２４は入力された画像データより、その領
域を区別せずに濃度ヒストグラムの作成を行なう。全文
字領域の画像データが入力され、濃度ヒストグラムが得
られると、閾値計算部１２５は、この濃度ヒストグラム
に基づいて、前述の方法によって閾値を求める。この閾
値は閾値設定部１２５の内部に記憶される。Next, the case where the mode is designated will be described. The control unit 106 sequentially selects the character regions identified by the region identification unit 104 one by one, reads the multi-valued representation image data from the image memory 102, and inputs it to the threshold value control unit 120. The image data input to the threshold value control unit 120 is sent to the histogram creation unit 123 via the solid portion removal unit 122. When the removal of the white or black solid portion in the character area is designated, when the image data of the character area is input, the solid portion removing unit 122 removes the image of the white or black solid portion having a certain area or more. Only the data is sent to the histogram creation unit 123. The histogram creation unit 124 creates a density histogram from the input image data without distinguishing the area. When the image data of all character areas is input and the density histogram is obtained, the threshold calculation unit 125 obtains the threshold value by the above-mentioned method based on this density histogram. This threshold is stored inside the threshold setting unit 125.

【００４４】このようにして閾値が決定されると、制御
部１０６は画像メモリ１０２内の画像データを順次読み
出して多値−２値変換部１１０，１１１へ送るととも
に、その画像データが文字または表の領域のデータの時
に多値−２値変換部１１１を有効にし、その他領域のデ
ータの時に多値−２値変換部１１０を有効にする。閾値
制御部１２０の閾値設定部１２５は、記憶している閾値
を多値−２値変換部１１１に設定する。When the threshold value is thus determined, the control unit 106 sequentially reads the image data in the image memory 102 and sends it to the multi-value / binary conversion units 110 and 111, and the image data is a character or a display. The multi-value to binary conversion unit 111 is enabled for the data of the area, and the multi-value to binary conversion unit 110 is enabled for the data of the other area. The threshold value setting unit 125 of the threshold value control unit 120 sets the stored threshold value in the multi-value / binary conversion unit 111.

【００４５】このように、このモードによれば、文字及
び表の各領域の２値化のための閾値が、全文字領域の濃
度ヒストグラムに応じて適応的に制御され、しかも各領
域内で閾値は一定している。したがって、文字領域及び
表領域の精度のよい、かつ安定な２値化が可能である。
ただし、全文字領域について平均化した濃度分布に基づ
いて閾値が制御されるので、領域によって濃度分布のば
らつきが激しい場合には、モードに比べて２値化の精
度は悪くなる。As described above, according to this mode, the threshold value for binarizing each area of the character and the table is adaptively controlled according to the density histogram of the entire character area, and the threshold value is set within each area. Is constant. Therefore, it is possible to perform accurate and stable binarization of the character area and the table area.
However, since the threshold value is controlled based on the density distribution averaged over all the character areas, the binarization accuracy becomes worse than that of the mode when the density distribution greatly varies depending on the area.

【００４６】なお、全部の文字領域について濃度ヒスト
グラムを求めるのではなく、予め定めた一定数の文字領
域を例えば文書の先頭のものから選び、その濃度ヒスト
グラムをもとに閾値を決定するようにしてもよく、これ
も本発明の実施態様に含まれる。It should be noted that instead of obtaining the density histograms for all the character areas, a predetermined fixed number of character areas are selected, for example, from the beginning of the document, and the threshold value is determined based on the density histogram. Of course, this is also included in the embodiments of the present invention.

【００４７】モードが指定された場合について説明す
る。制御部１０６は、領域識別部１０４から必要な情報
を取得し、濃度ヒストグラムを作成するための文字領域
を一つ選択する。面積が最大の文字領域の選択を指定さ
れたときには、領域識別部１０４より取得している各文
字領域の位置情報から面積を把握し、それを比較するこ
とによって、一つの文字領域を選択する。行数最大の文
字領域の選択を指定されたときには、領域識別部１０４
より各文字領域に含まれている文字行数を取得し、これ
が最大の文字領域を選択する。最大文字数の文字領域の
選択を指定された場合、領域識別部１０４において領域
識別中に得られた各文字領域内の矩形数と文字サイズと
から、各文字領域内の文字数を推定し、それが最大の文
字領域を選択する。つまり、できるだけ文字数が多い文
字領域を選びだそうとしているわけであるが、この選択
のための処理の負担は当然、面積を利用する方法が最も
軽く、行数を利用する方法が次に軽く、文字数を直接推
定する方法が最も重い。The case where the mode is designated will be described. The control unit 106 acquires necessary information from the region identification unit 104 and selects one character region for creating a density histogram. When the selection of the character area with the largest area is designated, one area is selected by grasping the area from the position information of each character area acquired from the area identification unit 104 and comparing them. When the selection of the character area with the maximum number of lines is designated, the area identification unit 104
The number of character lines contained in each character area is obtained, and this selects the largest character area. When the selection of the character area of the maximum number of characters is designated, the number of characters in each character area is estimated from the number of rectangles and the character size in each character area obtained during area identification in the area identification unit 104, Select the largest character area. In other words, we are trying to select a character area with as many characters as possible, but the processing load for this selection is naturally the lightest using the area, and the lightest using the number of lines, The method of directly estimating the number of characters is the heaviest.

【００４８】このようにして一つの文字領域を選択する
と、この文字領域の画像データを画像メモリ１０２より
読み出して閾値制御部１２０へ送る。閾値制御部１２０
において、入力された画像データは、ベタ部除去部１２
２経由でそのままヒストグラム作成部１２３へ送られ、
あるいは、ベタ部除去を指定されているときには、文字
領域の白または黒ベタ部のデータを除去されてヒストグ
ラム作成部１２３へ送られる。そして、ヒストグラム作
成部１２３で濃度ヒストグラムが作成され、閾値計算部
１２４で濃度ヒストグラムに基づき閾値が決定され、こ
れが閾値設定部１２５の内部に記憶され、多値−２値変
換部１１１に設定される。When one character area is selected in this way, the image data of this character area is read from the image memory 102 and sent to the threshold value controller 120. Threshold control unit 120
In the input image data, the solid image removal unit 12
2 is sent to the histogram creation unit 123 as it is,
Alternatively, when the solid portion removal is designated, the data of the white or black solid portion of the character area is removed and sent to the histogram creation unit 123. Then, the histogram creation unit 123 creates a density histogram, the threshold calculation unit 124 determines a threshold value based on the density histogram, and the threshold value is stored in the threshold value setting unit 125 and set in the multi-value / binary conversion unit 111. .

【００４９】次に制御部１０６は画像メモリ１０２より
画像データを順次読み出して多値−２値変換部１１０，
１１１へ送るが、文字または表の領域のデータの読み出
し時には多値−２値変換部１１１を有効にし、その他領
域のデータの読み出し時に多値−２値変換部１１０を有
効にする。Next, the control unit 106 sequentially reads out the image data from the image memory 102, and the multi-value / binary conversion unit 110,
Although the data is sent to 111, the multi-value to binary conversion unit 111 is enabled when reading the data of the character or table area, and the multi-value to binary conversion unit 110 is enabled when reading the data of the other area.

【００５０】このモードは、一つの文字領域だけを用い
て閾値を決定するので、閾値決定のための処理量が少な
くてすみ処理を高速化することができる。しかも、文字
数が最も多いと推定される文字領域を代表として選ぶの
で、トータルの認識率を向上させるような閾値を決定す
ることが可能である。すなわち、閾値決定のために代表
として選ばれる文字領域は、本文の領域の一つであるの
が普通である。見出し等の本文以外の文字領域は、領域
サイズが小さいことが多く、文字サイズも本文より大き
いことが多く、また、ゴシックのような特別な書体で書
かれていることが多いので、本文の領域の濃度ヒストグ
ラムに基づいて決定した閾値で２値化した場合、見出し
等の認識率は低下する可能性がある。しかし、見出し等
の文字数が原稿全体の総文字数に占める割合は小さいの
で、原稿全体としてみれば高い認識率を得られる。In this mode, since the threshold value is determined by using only one character area, the processing amount for determining the threshold value is small and the corner processing can be speeded up. Moreover, since the character area estimated to have the largest number of characters is selected as a representative, it is possible to determine a threshold value that improves the total recognition rate. That is, the character area selected as a representative for determining the threshold value is usually one of the body areas. The text area other than the text, such as headings, is often small in size, and the text size is often larger than the text. In addition, it is often written in a special typeface such as Gothic. When binarization is performed using the threshold value determined based on the density histogram of, the recognition rate of headings and the like may decrease. However, since the ratio of the number of characters such as a headline to the total number of characters of the entire manuscript is small, a high recognition rate can be obtained for the entire manuscript.

【００５１】なお、面積、行数または文字数が大きい文
字領域を１個ではなく、２個以上の所定数選び、それら
の濃度ヒストグラムに基づいて閾値を決定することも可
能であり、これも本発明の実施態様に含まれる。It is also possible to select not a single character area having a large area, a large number of lines or a large number of characters but a predetermined number of two or more, and determine the threshold value based on the density histograms thereof. Embodiment of the present invention.

【００５２】次にモードの動作であるが、文字領域を
表領域に置き換えたこと以外はモードと同じである。
また、モードの動作は、閾値決定のために文字領域に
代えて面積が最大の表領域を一つ用いること以外はモー
ドと同様である。モードとモードは、表とそのキ
ャプションのみの原稿のような、表領域のウエイトの高
い原稿を扱う場合に適する。Next, the operation of the mode is the same as that of the mode except that the character area is replaced with the table area.
The operation of the mode is the same as that of the mode except that one table area having the largest area is used instead of the character area for determining the threshold value. Modes and modes are suitable for handling documents with a high weight in the table area, such as a document with only a table and its caption.

【００５３】＜実施例３＞図７は本実施例による画像処
理装置の概略ブロック図である。図７において、図３と
同一の符号は同一部または対応部を示すので、その説明
を省略する。図３に示した前記実施例２との相違は、文
字認識の対象外の、その他領域も、文字領域及び表領域
と同じ多値−２値変換部１１２において単純２値化処理
により２値化するようにした点のみである。この多値−
２値変換部１１２の２値化閾値は閾値制御部１２０によ
り設定されるが、前記実施例２におけるモード〜と
同様のモードを有し、その任意のモードを利用者が指定
できる。なお、多値−２値変換部１１１は、２値化処理
中、常に有効とされる。<Third Embodiment> FIG. 7 is a schematic block diagram of an image processing apparatus according to the present embodiment. In FIG. 7, the same reference numerals as those in FIG. 3 indicate the same parts or corresponding parts, and the description thereof will be omitted. The difference from the second embodiment shown in FIG. 3 is that other areas, which are not the target of character recognition, are binarized by simple binarization processing in the same multi-value / binary conversion unit 112 as the character area and the table area. It is only the point that I tried to do. This multivalue-
The binarization threshold value of the binary conversion unit 112 is set by the threshold value control unit 120, but it has the same modes as the modes 1 to 3 in the second embodiment, and the user can specify any of the modes. The multi-value / binary conversion unit 111 is always valid during the binarization process.

【００５４】本実施例では、写真や網点画像のような領
域の画像データは、単一閾値を用いる単純２値化法によ
って２値表現に変換されるので、ディザ処理等による場
合に比べて写真や網点画像等の画像品質の劣化が避けら
れない。しかし、本装置を文字認識装置の前処理装置と
して使用する場合、文字と表を除いた領域はもともと文
字認識の対象外であるので、その品質悪化は文字認識性
能に格別の悪影響を及ぼさない。In this embodiment, image data of a region such as a photograph or a halftone dot image is converted into a binary representation by a simple binarization method using a single threshold, so that it is compared with the case of dithering or the like. Inevitable deterioration of image quality such as photographs and halftone images. However, when this apparatus is used as a pre-processing apparatus for a character recognition apparatus, since the area excluding the characters and the table is originally outside the object of character recognition, the quality deterioration does not have a particularly bad effect on the character recognition performance.

【００５５】＜実施例４＞図８は本実施例による画像処
理装置の概略ブロック図である。本装置は、図１に示し
た前記実施例１の構成に、図３に示した前記実施例２の
閾値制御部１２０と同様の構成の閾値制御部１３０を追
加し、この閾値制御部１３０によって多値−２値変換部
（単純２値化処理部）１０８，１０９の２値化閾値を制
御するようにした構成である。<Fourth Embodiment> FIG. 8 is a schematic block diagram of an image processing apparatus according to the present embodiment. This apparatus adds a threshold control unit 130 having the same configuration as the threshold control unit 120 of the second embodiment shown in FIG. 3 to the configuration of the first embodiment shown in FIG. The configuration is such that the binarization thresholds of the multi-value / binary conversion units (simple binarization processing units) 108 and 109 are controlled.

【００５６】本装置は、文字／表領域の２値化閾値の制
御に関して次のモード(10)〜(16)を有し、利用者が任意
のモードを指定することができる。This apparatus has the following modes (10) to (16) for controlling the binarization threshold of the character / table area, and the user can specify any mode.

【００５７】モード(10)：前記実施例２のモードに対
応するモードである。各文字領域毎に、領域の濃度ヒス
トグラムに基づき各文字領域用の閾値を決定し、また各
表領域毎に、領域の濃度ヒストグラムに基づき各表領域
用の閾値を決定し、文字領域の閾値を多値−２値変換部
１０８に設定し，表領域に対する閾値を多値−２値変換
部１０９に設定する。このモード、及び後記モード(1
1)，(12)，(13)，(16)において、文字領域の濃度ヒスト
グラムを求める際に、白または黒ベタ領域を除外するか
否かの指定が可能である。Mode (10): This mode corresponds to the mode of the second embodiment. For each character area, the threshold for each character area is determined based on the density histogram of the area, and for each table area, the threshold for each table area is determined based on the density histogram of the area. The multi-value to binary conversion unit 108 is set, and the threshold for the table area is set to the multi-value to binary conversion unit 109. This mode and the mode described later (1
In 1), (12), (13), and (16), it is possible to specify whether to exclude solid white or black areas when obtaining the density histogram of the character area.

【００５８】モード(11)：前記実施例２のモードに対
応するモードである。全文字領域の濃度ヒストグラムに
基づき閾値を決定し、この閾値を多値−２値変換部１０
８，１０９に対して設定される。Mode (11): This mode corresponds to the mode of the second embodiment. A threshold is determined based on the density histogram of all character areas, and this threshold is set to the multi-value / binary conversion unit 10.
It is set for 8,109.

【００５９】モード(12)：前記実施例２のモードとモ
ードを組み合わせたモードに相当する。すなわち、全
文字領域の濃度ヒストグラムに基づいて文字領域用の閾
値を決定し、これを多値−２値変換部１０８に対して設
定し、全表領域の濃度ヒストグラムに基づいて表領域用
の閾値を決定し、これを多値−２値変換部１０９に対し
て設定する。Mode (12): Corresponds to a mode obtained by combining the modes of the second embodiment with the modes. That is, the threshold value for the character area is determined based on the density histogram of all the character areas, this is set for the multi-value / binary conversion unit 108, and the threshold value for the table area is set based on the density histogram of all the table areas. Is determined and set in the multi-value / binary conversion unit 109.

【００６０】モード(13)：前記実施例２のモードに対
応するモードである。面積、行数または文字数が最大の
一つの文字領域を選び、その濃度ヒストグラムに基づい
て閾値を決定し、この閾値を多値−２値変換部１０８，
１０９に対して設定する。Mode (13): This mode corresponds to the mode of the second embodiment. One character area having the largest area, number of lines or number of characters is selected, a threshold value is determined based on the density histogram, and the threshold value is set to the multi-value / binary conversion unit 108.
Set to 109.

【００６１】モード(14)：前記実施例２のモードに対
応するモードである。全表領域の濃度ヒストグラムに基
づいて閾値を決定し、この閾値を多値−２値変換部１０
８，１０９に対して設定する。Mode (14): This mode corresponds to the mode of the second embodiment. A threshold value is determined based on the density histograms of all the surface areas, and this threshold value is set to the multi-value / binary conversion unit 10.
Set for 8,109.

【００６２】モード(15)：前記実施例２のモードに対
応するモードである。面積が最大の一つの表領域を選び
だし、その濃度ヒストグラムに基づいて閾値を決定し、
この閾値を多値−２値変換部１０８，１０９に設定す
る。Mode (15): This mode corresponds to the mode of the second embodiment. Select one table area with the largest area, determine the threshold value based on the density histogram,
This threshold is set in the multi-value / binary conversion units 108 and 109.

【００６３】モード(16)：前記実施例２のモードとモ
ードを組み合わせてモードに対応する。面積、行数ま
たは文字数が最大の一つの文字領域を選びだし、その濃
度ヒストグラムに基づいて閾値を決定し、これを多値−
２値変換部１０８に設定する。また、面積が最大の一つ
の表領域を選びだし、その濃度ヒストグラムに基づいて
閾値を設定し、これを多値−２値変換部１０９に設定す
る。Mode (16): Corresponds to the mode by combining the modes of the second embodiment with the modes. Select one character area with the maximum area, number of lines or number of characters, determine the threshold value based on the density histogram, and use this as a multi-value
It is set in the binary conversion unit 108. Further, one table area having the largest area is selected, a threshold value is set based on the density histogram, and this is set in the multi-value / binary conversion unit 109.

【００６４】以上の各モードにおける動作は前記各実施
例から容易に理解し得るであろうから、ここでは説明を
繰り返さない。本実施例によれば、文字領域と表領域の
２値化閾値を別個に制御するモード(10)，(12)，(16)が
あるため、前記実施例２または前記実施例３に比べ、文
字領域及び表領域の画像データの２値化をより高精度に
行なうことができる。The operation in each of the above modes can be easily understood from each of the above-mentioned embodiments, and therefore the description will not be repeated here. According to the present embodiment, there are modes (10), (12), and (16) for controlling the binarization thresholds of the character area and the table area separately, so that compared to the second embodiment or the third embodiment, It is possible to binarize the image data of the character area and the table area with higher accuracy.

【００６５】[0065]

【発明の効果】以上、詳細に説明したように、請求項１
乃至請求項５の各項記載の発明によれば、原稿上の文字
が書かれた各領域について、同一の処理方法または一定
の２値化閾値の単純２値化処理によって、画像データの
２値化を安定に行なうことができる。請求項３または４
記載の発明によれば、文字が書かれた領域の濃度ヒスト
グラムに基づいて２値化閾値が最適化されるため、文字
が書かれた領域について安定かつ高精度な２値化が可能
である。請求項５記載の発明によれば、文字が書かれた
領域の内部に白または黒ベタ領域が存在しても、その影
響を排除して２値化閾値を最適化し高精度の２値化を行
なうことができる。また、請求項４記載の発明によれ
ば、２値化閾値の決定のために一部の領域を選択し、そ
の濃度ヒストグラムを利用するので、２値化閾値決定の
ための処理量を減らし処理を高速化でき、しかも、文字
数が多いと推定される領域を選択するならば、２値化閾
値を十分に最適化して高精度かつ安定な２値化が可能で
ある。かかる本発明による画像処理装置を文字認識装置
の前処理装置として利用すれば、原稿上の文字が書かれ
ている領域について高精度かつ安定な２値化データを文
字認識装置に入力することができるので、認識率の大幅
な向上を期待できる。As described in detail above, claim 1 is as follows.
According to the invention described in each of the fifth to fifth aspects, the binarization of the image data is performed by the same processing method or the simple binarization processing with a constant binarization threshold for each area in which the characters are written on the document. Can be stabilized. Claim 3 or 4
According to the described invention, the binarization threshold value is optimized based on the density histogram of the area where the character is written, so that the area where the character is written can be stably and highly accurately binarized. According to the invention described in claim 5, even if there is a white or black solid area inside the area in which characters are written, the influence thereof is eliminated and the binarization threshold value is optimized to perform highly accurate binarization. Can be done. Further, according to the invention as set forth in claim 4, since a part of the region is selected for determining the binarization threshold and the density histogram thereof is used, the processing amount for determining the binarization threshold is reduced. If the area where the number of characters is estimated to be high can be selected and the area where the number of characters is estimated to be large is selected, the binarization threshold value can be sufficiently optimized to perform highly accurate and stable binarization. If the image processing apparatus according to the present invention is used as a preprocessing apparatus for a character recognition apparatus, highly accurate and stable binarized data can be input to the character recognition apparatus in a region where a character is written on a document. Therefore, a significant improvement in recognition rate can be expected.

【図面の簡単な説明】[Brief description of drawings]

【図１】実施例１の装置構成を示す概略ブロック図であ
る。FIG. 1 is a schematic block diagram showing a device configuration of a first embodiment.

【図２】領域識別の説明図である。FIG. 2 is an explanatory diagram of area identification.

【図３】実施例２の装置構成を示す概略ブロック図であ
る。FIG. 3 is a schematic block diagram showing a device configuration of a second embodiment.

【図４】閾値制御部の構成を示すブロック図である。FIG. 4 is a block diagram showing a configuration of a threshold control unit.

【図５】白ベタ部を含む文字領域の例を示す図である。FIG. 5 is a diagram showing an example of a character area including a white solid portion.

【図６】（ａ）文字領域の濃度ヒストグラムの例を示す
図である。（ｂ）写真や網点画像の領域の濃度ヒストグラムの例を
示す図である。FIG. 6A is a diagram showing an example of a density histogram of a character area. (B) It is a figure which shows the example of the density histogram of the area | region of a photograph or a halftone image.

【図７】実施例３の装置構成を示す概略ブロック図であ
る。FIG. 7 is a schematic block diagram showing a device configuration according to a third embodiment.

【図８】実施例４の装置構成を示す概略ブロック図であ
る。FIG. 8 is a schematic block diagram showing a device configuration of a fourth embodiment.

[Explanation of symbols]

１００多値表現画像データ１０２画像メモリ１０４領域識別部１０６制御部１０８文字領域用の多値−２値変換部（単純２値化処
理部）１０９表領域用の多値−２値変換部（単純２値化処理
部）１１０その他領域用の多値−２値変換部（ディザ処理
部）１１１文字／表領域用の多値−２値変換部（単純２値
化処理部）１１２全領域用の多値−２値変換部（単純２値化処理
部）１２０閾値制御部１２２ベタ部除去部１２３ヒストグラム作成部１２４閾値計算部１２５閾値設定部１３０閾値制御部２００入力文書２０２〜２０５文字領域２０６表領域２０８，２０９セパレータ２１０図２３０文字領域２３１〜２３５文字行２３６，２３７白ベタ部Reference Signs List 100 multi-valued representation image data 102 image memory 104 area identification section 106 control section 108 multi-valued / binary conversion section for character area (simple binarization processing section) 109 multi-valued / binary conversion section for table area (simple Binarization processing unit) 110 Multi-value to 2-value conversion unit for other areas (dither processing unit) 111 Multi-value to 2-value conversion unit for character / table areas (Simple binary conversion processing unit) 112 For all areas Multi-value to binary conversion unit (simple binary conversion processing unit) 120 Threshold control unit 122 Solid part removal unit 123 Histogram creation unit 124 Threshold calculation unit 125 Threshold setting unit 130 Threshold control unit 200 Input document 202 to 205 Character area 206 Table Area 208,209 Separator 210 Figure 230 Character area 231 to 235 Character line 236,237 White solid part

Claims

[Claims]

1. An area discriminating means for globally discriminating an area in which characters are written and other areas on the original from multivalued image data of the original, and a character discriminated by the area discriminating means. Multi-valued representation image data of the written area,
An image processing apparatus comprising: a multi-value / two-value conversion means for converting into binary expression image data by a constant processing method.

2. The image processing apparatus according to claim 1,
The multi-value to binary conversion means is a simple binary with a constant binarization threshold.
An image processing apparatus, which converts multi-valued expression image data into binary expression image data by a binarization process.

3. An area identifying means for globally identifying an area in which characters are written on the original and other areas from multivalued image data of the original, and a character identified by the area identifying means. Multi-value / binary conversion means for converting multi-valued expression image data in the written area into binary expression image data by simple binarization processing, and an area in which the character identified by the area identification means is written And a means for determining a binarization threshold value for the multivalue-to-binary conversion means based on the density histogram, and the binarization threshold value is at least the same. An image processing device characterized in that it is fixed in the area.

4. An area discriminating means for globally discriminating an area in which characters are written and other areas on the original from multivalued image data of the original, and a character discriminated by the area discriminating means. Multi-valued representation image data of the written area,
The multi-value / binary conversion means for converting into binary representation image data by the simple binarization processing and a part of the area in which the character identified by the area identification means is written are selected, A density histogram is obtained from the multivalued image data of the selected region, and a binarization threshold value for the multivalue-binary conversion means is determined based on the density histogram. Is fixed in at least the same area.

5. The image processing apparatus according to claim 3 or 4, wherein the means for determining the threshold value obtains a density histogram by excluding white or black solid portions inside the area.