JPH04317261A

JPH04317261A - Image area identification system for image processor

Info

Publication number: JPH04317261A
Application number: JP3085239A
Authority: JP
Inventors: Yuzuru Suzuki; 譲鈴木
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 1991-04-17
Filing date: 1991-04-17
Publication date: 1992-11-09

Abstract

PURPOSE:To provide an image area identification system which is capable of identifying with high performance the respective area of an image in which color characters and intermediate tones are mixed and which has excellent hardware realization. CONSTITUTION:A blocking means 14 for plural picture elements, characteristic value detection means 16 to 18 detecting plural characteristic values from the picture element within the block, and area identification means 20 to 24 identifying the area of characters/intermediate tones from plural character values are provided. The plural characteristic values are available in three kinds: an average value within the block, and the total values of high level picture elements and intermediate level picture elements within the block, respectively. The area identification means 20 to 24 are provided with quantization means 20 to 22 for plural characteristic values, a LUT23 preliminarily set to a threshold value for the total value of the intermediate level picture elements for the area identification and a comparison means 24, and obtains the threshold value for the total value of the intermediate picture elements from the LUT23 while using a value obtained by quantization and synthesizing the average value and the total value of the intermediate level picture elements within the block as an address, compares the threshold with the actually obtained total value of the intermediate level picture elements and identifies the areas in the block.

Description

[Detailed description of the invention]

【０００１】0001

【産業上の利用分野】本発明は、カラーの文字／中間調
画像の領域識別を行う画像処理装置の画像領域識別方式
に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image area identification method for an image processing apparatus that identifies areas of color characters/halftone images.

【０００２】0002

【従来の技術】カラー複写機等のカラー画像処理装置で
は、Ｙ（イエロー）、Ｍ（マゼンタ）、Ｃ（シアン）、
Ｋ（黒）からなる４色のトナー現像器を使用し、これら
の各画像を重ね合わせてフルカラーの原稿を再現してい
る。そのため、各トナーの現像プロセスに合わせて繰り
返し４回のスキャンを行い、その都度、原稿を読み取っ
たフルカラーの画像データを処理している。[Prior Art] Color image processing devices such as color copying machines use Y (yellow), M (magenta), C (cyan),
A four-color toner developer consisting of K (black) is used, and these images are superimposed to reproduce a full-color original. Therefore, scanning is repeated four times in accordance with the development process of each toner, and the full-color image data obtained by reading the document is processed each time.

【０００３】図８は従来のカラー画像処理装置の構成例
及び画像領域識別回路の構成例を示す図である。FIG. 8 is a diagram showing an example of the configuration of a conventional color image processing device and an example of the configuration of an image area identification circuit.

【０００４】一般にカラー画像処理装置では、ラインセ
ンサーを用いて原稿を光学的に読み取り、Ｂ（青）、Ｇ
（緑）、Ｒ（赤）の色分解信号により画像データを取り
込むと、図８（イ）に示すようにその画像データをＥＮ
Ｄ変換３１、カラーマスキング（カラーコレクション）
３２を通してカラーのトナー信号ＹＭＣに変換している
。そして、ＵＣＲ３３により墨版（Ｋ）生成、下色除去
を行い、色相分離型非線型フィルタ部、ＴＲＣ（トーン
調整）４０、ＳＧ（スクリーンジェネレータ）４１を通
して現像色のトナー信号Ｘをオン／オフの２値化データ
にし、この２値化データでレーザ光を制御して帯電した
感光体を露光し各色の画像を形成している。Generally, in a color image processing device, a line sensor is used to optically read a document, and
When image data is captured using the color separation signals of green (green) and red (R), the image data is converted to EN as shown in FIG.
D conversion 31, color masking (color correction)
32, it is converted into color toner signals YMC. Then, the UCR 33 generates a black plate (K) and removes the undercolor, and turns on/off the toner signal The data is converted into binary data, and a laser beam is controlled using the binary data to expose a charged photoreceptor to form an image of each color.

【０００５】色相分離型非線型フィルタ部には、ＵＣＲ
３３において墨版生成、下色除去処理を行って生成され
たＹＭＣＫの信号から現像プロセスにしたがって現像色
の画像データＸがセレクトされ、この画像データＸが入
力されて２系統に分岐される。そのうちの一方には平滑
化フィルタ３４、他方にはγ変換回路３６、エッジ検出
フィルタ３７、エッジ強調用ＬＵＴ３５が接続され、前
者の系統で中間調画像に対する平滑化の処理が行われ、
後者の系統で文字画像に対するエッジ強調が行われる。そして、これらの出力が加算器３９で合成されて非線型
フィルタ処理後の信号として出力される。この場合、エ
ッジ処理では、色相検出回路３５により入力画像の色相
を検出し、そのときの現像色が必要色か否かの判定を行
う。ここで、もし入力画像が黒領域である場合には、Ｙ
、Ｍ、Ｃの有彩色信号のエッジ強調は行わずに、Ｋのみ
をエッジ量に応じて強調するように制御する。[0005] The hue separation type nonlinear filter section includes a UCR
In step 33, image data X of the developed color is selected according to the development process from the YMCK signals generated by performing black plate generation and undercolor removal processing, and this image data X is input and branched into two systems. A smoothing filter 34 is connected to one of them, and a γ conversion circuit 36, an edge detection filter 37, and an edge enhancement LUT 35 are connected to the other, and the former system performs smoothing processing on the halftone image.
In the latter system, edge enhancement is performed on character images. Then, these outputs are combined by an adder 39 and output as a signal after nonlinear filter processing. In this case, in edge processing, the hue of the input image is detected by the hue detection circuit 35, and it is determined whether the developed color at that time is a required color. Here, if the input image is a black area, Y
, M, and C, but only K is controlled to be emphasized according to the edge amount.

【０００６】上記のように非線形フィルタ処理を導入し
、種類の異なった文字、線画等の２値画像と、写真や網
点印刷物等の原稿に対して、文字、線画等の２値画像は
、エッジを強調して鮮鋭度を高めるようにし、写真や網
点印刷物等の中間調画像は、平滑化して画像の滑らかさ
や粒状性を高めるようにするため非線形フィルタ処理を
行っている。[0006] By introducing non-linear filter processing as described above, binary images such as characters and line drawings of different types and original documents such as photographs and halftone printed matter are processed. Nonlinear filter processing is performed to emphasize edges to increase sharpness, and to smooth halftone images such as photographs and halftone printed materials to improve image smoothness and graininess.

【０００７】しかし、文字、線画等の２値画像と、写真
や網点印刷物等の中間調画像において、それぞれの原稿
や領域を予め指定することが容易である場合には、その
原稿毎に或いは領域毎に画像種を指定することによって
、それぞれに最適なパラメータを選択することができ、
再現性を高めることができるが、これらの混在画像の場
合には、２値画像にも中間調画像にもそれなりに再現で
きるパラメータが選択されることになる。この場合には
、２値画像に対しても中間調画像に対しても最適な処理
にはならないため、いずれにも満足できる画像を得るこ
とは難しい。However, if it is easy to specify in advance the document or area for binary images such as characters and line drawings, and halftone images such as photographs and halftone printed materials, the By specifying the image type for each area, you can select the optimal parameters for each area.
Although reproducibility can be improved, in the case of mixed images, parameters that can be reproduced to a certain degree as both binary images and halftone images are selected. In this case, since the processing is not optimal for either binary images or halftone images, it is difficult to obtain an image that is satisfactory for both.

【０００８】例えば２値画像では、エッジ強調が弱くぼ
けて、文字が鮮明でなくなったり、また、黒文字では、
エッジ部や小文字部に濁りが生じてしまうという問題が
ある。他方、中間調画像については、エッジ検出周波数
の近傍が強調されてしまうため、中間調画像の滑らかさ
がなくなり、変なモアレやエッジが変に強調された荒い
画像になってしまう。そのため、黒文字と色文字と中間
調の３種の画像領域を識別し、フィルタ部のパラメータ
の切り換え等を行うことが必要であり、この種の識別処
理方式が既に数多く提案されている。[0008] For example, in a binary image, the edge emphasis is weak and blurry, and the characters are not clear, and in the case of black characters,
There is a problem in that the edges and lowercase letters become cloudy. On the other hand, in the case of a halftone image, since the vicinity of the edge detection frequency is emphasized, the smoothness of the halftone image is lost, resulting in a rough image with strange moiré and edges being strangely emphasized. Therefore, it is necessary to identify three types of image areas, black characters, color characters, and halftones, and to switch the parameters of the filter section, and many identification processing methods of this type have already been proposed.

【０００９】例えば図８（ロ）に示す構成（特願平２ー
２９４０号参照）は、ランレングスによる識別方式を導
入したものである。色相検出部４１は、各画素毎に８色
（Ｙ、Ｍ、Ｃ、Ｋ、Ｗ、Ｂ、Ｇ、Ｒ）の色相のいずれか
を検出するものであり、コンパレータ４２は、ＹＭＣの
うち最大のものが閾値ｔｈｍａｘ　以上か否かを検出す
るものである。色相検出部４１では、低い濃度のノイズ
等も検出するので、この場合には、後述するランレング
スが短くなって、文字領域と誤って認識してしまうとい
う問題が生じる。そこで、少なくともある程度以上の濃
度を有する画素を文字候補とするために閾値ｔｈｍａｘ
　との比較判定を行い、ノイズを除去するのがコンパレ
ータ４２である。For example, the configuration shown in FIG. 8(b) (see Japanese Patent Application No. 2-2940) introduces an identification method based on run length. The hue detection unit 41 detects one of eight colors (Y, M, C, K, W, B, G, R) for each pixel, and the comparator 42 detects one of eight colors (Y, M, C, K, W, B, G, R), and the comparator 42 detects one of the hues of eight colors (Y, M, C, K, W, B, G, R). This is to detect whether or not the value is greater than or equal to a threshold value thmax. Since the hue detection unit 41 also detects low-density noise, etc., in this case, a problem arises in that the run length, which will be described later, becomes short and it is mistakenly recognized as a character area. Therefore, in order to select pixels having at least a certain level of density as character candidates, a threshold value thmax is set.
The comparator 42 performs a comparative judgment with the noise and removes noise.

【００１０】ブロック化部４３は、数画素例えば４×４
〜８×８に各画素をブロック化するものであり、ブロッ
ク判定部４４は、７色判定を行うものである。７色判定
は、Ｗを除く７色（Ｙ、Ｍ、Ｃ、Ｋ、Ｂ、Ｇ、Ｒ）につ
いてブロック内の最大頻度色を判定し、それをブロック
色とするものである。例えば４×４画素を１ブロックと
してそれぞれの色の頻度をカウントし、Ｋ＝６，Ｍ＝２
，Ｒ＝１，Ｃ＝１，Ｗ＝６になったとすると、Ｗを除い
た色／黒画素で最大である色Ｋをブロック色として採用
する。The blocking unit 43 blocks several pixels, for example 4×4.
Each pixel is divided into blocks of ~8×8, and the block determination unit 44 performs seven-color determination. Seven-color determination is to determine the maximum frequency color within a block for seven colors (Y, M, C, K, B, G, R) excluding W, and use that as the block color. For example, count the frequency of each color with 4 x 4 pixels as one block, K = 6, M = 2
, R=1, C=1, and W=6, then the color K, which is the largest among the colors excluding W and black pixels, is adopted as the block color.

【００１１】主走査方向カラーランカウント部４６は、
ブロック色判定部４４からＷ（０）か、それ以外の色黒
（１）かを表す１ビットをもらい、主走査方向の色／黒
ブロックのランレングスをカウントするものであり、コ
ンパレータ４７は、各ブロックのカウント値（ランレン
グス）が閾値ｔｈｒｕｎ　より短いか否かを判定するも
のである。ここでは、色／黒ブロックのランレングスが
閾値ｔｈｒｕｎ　より短いと文字の候補となる。The main scanning direction color run count section 46 includes:
The comparator 47 receives one bit indicating whether it is W (0) or other black (1) from the block color determination unit 44 and counts the run length of the color/black block in the main scanning direction. It is determined whether the count value (run length) of each block is shorter than the threshold value thrun. Here, if the run length of a color/black block is shorter than the threshold thrun, it becomes a character candidate.

【００１２】ランレングスで見た場合、画素単位では網
点領域も文字領域と同様にランレングスが短くなり文字
との切り分けが困難であるが、ブロック単位では網点領
域のランレングスが長くなり、文字領域の場合は、文字
と文字との間が網点のピッチより長いためブロックラン
レングスは短くなる。このようにブロックランレングス
でみると文字と網点領域との切り分けが可能になる。When viewed in terms of run length, the run length of the halftone dot area is short in pixel units, just like the character area, and it is difficult to distinguish it from the characters, but in the block unit, the run length of the halftone dot area is long, In the case of a character area, the block run length is short because the distance between characters is longer than the pitch of halftone dots. In this way, by looking at the block run length, it becomes possible to separate the characters and the halftone dot area.

【００１３】副走査方向エッジ検出部４５は、副走査方
向の数ブロックの範囲でエッジがあるか否かを検出する
ものであり、エッジがある場合に文字の候補とする。こ
れは、文字領域で用いられる横線も長いランレングスと
なるため、上記の主走査方向のランレングスでは文字の
候補とならないので、このような場合にも文字の候補と
して検出するためのものである。The sub-scanning direction edge detection section 45 detects whether or not there is an edge within a range of several blocks in the sub-scanning direction, and if an edge is present, it is determined as a character candidate. This is because the horizontal lines used in the character area also have a long run length, so the above run length in the main scanning direction does not serve as a character candidate, so this is to detect it as a character candidate in such cases as well. .

【００１４】オアゲート４８とアンドゲート４９は、上
記の検出結果から主走査方向か副走査方向のいずれかで
文字の候補が存在し、且つブロックのｍａｘフラグが１
であれば、このブロックを文字領域と判定し、ブロック
色判定部４４で判定したブロック色信号を出力するもの
である。なお、ｍａｘフラグは、コンパレータ４２でＹ
ＭＣのうち最大のものが閾値ｔｈｍａｘ　以上であると
判定された画素がブロック内に１以上あることを示す信
号である。The OR gate 48 and the AND gate 49 are used when a character candidate exists in either the main scanning direction or the sub-scanning direction from the above detection result, and the max flag of the block is 1.
If so, this block is determined to be a character area, and the block color signal determined by the block color determining section 44 is output. Note that the max flag is set to Y by the comparator 42.
This is a signal indicating that there is one or more pixels in the block whose maximum MC is determined to be equal to or greater than the threshold value thmax.

【００１５】上記の例は、この文字部周辺のランレング
スが短いことに着目したものである。すなわち、文字部
は、文字のブロックのかたまりと背景のかたまりで構成
され、地肌背景中にあるため、濃度変化が急峻であるの
に対し、画像部は、画像背景中にあり濃度が緩やかであ
る。そのため、ランレングスを観察すると、文字領域で
はランレングスが短く、中間調領域ではランレングスが
長くなる。しかも、画素単位では網点領域と文字領域と
の識別が困難であったが、網点の場合にはブロック化す
ることによりランレングスが長くなるので、文字領域で
はなく中間調領域で認識することができる。つまり、ブ
ロック化によって高い周波数を有する網点パッチ部もラ
ンレングスが長い色ブロックとして発生しやすくなり、
中間調領域として取り込むことができる。また、主走査
方向カラーランカウント部４６とコンパレータ４７によ
り文字の候補を検出するだけでは、主走査方向にランレ
ングスの長いライン等を誤認してしまうため、副走査方
向エッジ検出部４５によりエッジがある場合も文字の候
補とすることによって認識精度を上げている。The above example focuses on the fact that the run length around the character portion is short. In other words, the text area consists of a block of characters and a block of background, and because it is in the background, the density changes sharply, whereas the image area is in the background of the image, and the density changes gradually. . Therefore, when observing the run length, the run length is short in the character area and long in the halftone area. Moreover, it was difficult to distinguish between halftone dot areas and text areas on a pixel basis, but in the case of halftone dots, the run length becomes longer by dividing them into blocks, so it is now possible to recognize them in halftone areas rather than in text areas. Can be done. In other words, due to blocking, halftone patch areas with high frequencies also tend to occur as color blocks with long run lengths.
It can be captured as a halftone area. Furthermore, if only the main scanning direction color run count section 46 and the comparator 47 detect character candidates, lines with long run lengths in the main scanning direction will be mistakenly recognized. In some cases, recognition accuracy is improved by using the characters as candidates.

【００１６】また、図８（ハ）に示す構成（特願平２ー
１３６０７１号参照）は、ブロック単位の複数画素から
最大値と最小値とを検出し、それらの差が所定の値より
大きいか否かにより文字領域を識別すると共に、さらに
周囲のブロックの領域判定結果から誤り補正を行うもの
であり、領域識別回路は、大きく分けて一次識別と二次
識別からなる。一次識別は、８×８ブロック化回路６１
で８×８画素のブロック化を行って文字領域の識別を行
うものであり、そのために、ブロックの８×８画素から
最大値検出回路６２、最小値検出回路６３によりそれぞ
れ最大値と最小値を検出し、演算回路６４でそれらの差
を求める。そして、コンパレータ６５で演算回路６４の
出力、すなわち最大値と最小値との差を閾値ｔｈと比較
し、該差が閾値ｔｈより大きい場合に文字領域の識別信
号、例えば論理「１」を出力する。二次識別は、さらに
３×３のブロックで一次識別によるブロックの識別領域
の誤り補正を行うものである。Furthermore, the configuration shown in FIG. 8(C) (see Japanese Patent Application No. 2-136071) detects the maximum value and minimum value from a plurality of pixels in each block, and detects the difference between them when the difference is larger than a predetermined value. In addition to identifying the character area based on whether or not the character area is the same, error correction is also performed based on the area determination results of surrounding blocks.The area identification circuit is broadly divided into primary identification and secondary identification. The primary identification is an 8×8 blocking circuit 61
The character area is identified by forming blocks of 8 x 8 pixels into blocks, and for this purpose, the maximum value and minimum value are determined from the 8 x 8 pixels of the block by a maximum value detection circuit 62 and a minimum value detection circuit 63, respectively. The arithmetic circuit 64 calculates the difference between them. Then, a comparator 65 compares the output of the arithmetic circuit 64, that is, the difference between the maximum value and the minimum value, with a threshold th, and if the difference is larger than the threshold th, outputs a character area identification signal, for example, logic "1". . In the secondary identification, errors in the identification area of the block based on the primary identification are further corrected using 3×3 blocks.

【００１７】例えば注目ブロックが一次識別では文字領
域と識別された場合において、周囲ブロックのほとんど
が文字領域でないときは、注目ブロックの識別内容を非
文字領域に補正する。同様に、注目ブロックが一次識別
では非文字領域と識別された場合において、周囲ブロッ
クのほとんどが文字領域であるときは、注目ブロックの
識別内容を文字領域に補正する。For example, when the block of interest is identified as a character area in the primary identification, if most of the surrounding blocks are not character areas, the identification content of the block of interest is corrected to be a non-character area. Similarly, when the block of interest is identified as a non-text area in the primary identification, and most of the surrounding blocks are text areas, the identification content of the block of interest is corrected to be a text area.

【００１８】その他にも、所定の画素ブロック内の平均
値と標準偏差等を使うもの（例えば特開昭６３ー２０５
７８３号公報）や位相の異なる複数のディザ変換で変換
した２値出力を使うもの（例えば特開昭６３ー１９３７
７０号公報）、Ｒ、Ｇ、Ｂの色分解信号のそれぞれに対
してブロック内の最大値と最小値を求め、少なくとも１
つの色成分において、その差が予め定められた値以上の
とき文字領域と判定するもの（例えば特公昭６３ー５１
６３１号公報参照）等、種々の方式がこれまでに提案さ
れている。In addition, there are also methods that use the average value and standard deviation within a predetermined pixel block (for example, Japanese Patent Laid-Open No. 63-205
No. 783) and those that use binary output converted by multiple dither conversions with different phases (for example, Japanese Patent Application Laid-Open No. 63-1937).
No. 70), the maximum and minimum values within the block are determined for each of the R, G, and B color separation signals, and at least 1
When the difference between two color components is greater than a predetermined value, it is determined to be a character area (for example,
Various methods have been proposed so far, such as (see Publication No. 631).

【００１９】[0019]

【発明が解決しようとする課題】しかしながら、上記従
来の各種領域識別方式でも、確かにブロック判定を行う
ことによって識別性能を向上させるようにはなっている
が、現実には、装置の高速化を図りかつ高画質のカラー
コピーを得ようとすると、原稿読取を高速で動作させる
ようにし、読み取った画像データも高速で処理しなけれ
ばならないため、例えば画像処理装置の機構的な動作に
伴う振動やノイズにより現像プロセスで領域認識にバラ
ツキが生じ、黒文字の途切れ等のディフェクトが現れる
という問題がある。[Problem to be Solved by the Invention] However, although the various conventional area identification methods described above certainly improve identification performance by performing block judgment, in reality, it is difficult to increase the speed of the device. In order to obtain high-quality color copies, the document must be scanned at high speed, and the scanned image data must also be processed at high speed. There is a problem in that noise causes variations in area recognition during the development process, resulting in defects such as interruptions in black characters.

【００２０】すなわち、カラーコピーでは、先に述べた
ように４回のスキャンでＹ、Ｍ、Ｃ、Ｋのトナー画像を
重ねることによってフルカラー画像を再現しているため
、最大最小値の差、ランレングス、平均値、標準偏差等
のそれぞれの特徴量を用い、ブロック単位で領域の判定
を行うようにしても、Ｙ、Ｍ、Ｃ、Ｋの４回の現像プロ
セスの中で異なった判定が出てしまうことがあり、毎回
確実に同じ判定を得ることが難しい。That is, in color copying, as mentioned above, a full color image is reproduced by overlapping Y, M, C, and K toner images in four scans, so the difference between the maximum and minimum values, the run Even if the area is judged block by block using each characteristic value such as length, average value, standard deviation, etc., different judgments may occur during the four development processes of Y, M, C, and K. It is difficult to reliably obtain the same judgment each time.

【００２１】例えばＭ、Ｃの現像プロセスでは黒文字領
域の判定であるにもかかわらず、たまたまＹの現像プロ
セスとＫの現像プロセスで黒文字領域の判定でなかった
場合には、そのブロックで黒文字が途切れてＹで出力さ
れることになる。つまり、Ｍ、Ｃは、黒文字領域の判定
であるため出力が抑えられ、また、Ｋも黒文字領域以外
の判定であるため出力が抑えられ、Ｙのみが出力される
。しかも、ブロックで判定し出力されるため、画素単位
の出力に比べて黒文字のところどころが色文字になって
しまう様子がより目立ってしまう。このようなディフェ
クトは色文字や中間調の領域においても同様に現れるこ
とになるので、著しい画質の劣化を招くという問題があ
る。For example, if the M and C development processes determine a black character area, but by chance the Y and K development processes do not determine a black character area, the black characters may be interrupted in that block. It will be output as Y. That is, the output of M and C is suppressed because the judgment is for a black text area, and the output of K is also suppressed because the judgment is for a non-black text area, and only Y is output. Moreover, since the determination is made in blocks and output, it becomes more noticeable that some black characters become colored characters compared to output in pixel units. Since such defects similarly appear in color characters and halftone areas, there is a problem of significant deterioration of image quality.

【００２２】また、従来方式では、ハードウエア構成が
複雑であったり、逆に簡単であれば識別性能が悪かった
りして満足されるものではなかった。特に、カラー画像
に対応するための黒文字／色文字識別のハードウエア整
合がとりにくく、領域識別と別系統での構成とならざる
を得なかった。[0022]Furthermore, the conventional system is unsatisfactory because the hardware configuration is complicated, and conversely, if the hardware configuration is simple, the identification performance is poor. In particular, it has been difficult to match the hardware for black character/color character recognition in order to handle color images, and the system has had to be configured in a separate system from area recognition.

【００２３】本発明は、上記の課題を解決するものであ
って、カラー文字／中間調の混在画像のそれぞれの領域
を高い性能で識別でき、ハードウエア実現性に優れ文字
部における黒／色識別処理とのハードウエア整合性の良
好な画像処理装置の画像領域識別方式を提供することを
目的とするものである。The present invention solves the above-mentioned problems, and is capable of identifying each region of a color text/halftone mixed image with high performance, and is excellent in hardware implementation and is capable of black/color discrimination in text areas. It is an object of the present invention to provide an image area identification method for an image processing apparatus that has good hardware compatibility with processing.

【００２４】[0024]

【課題を解決するための手段及び作用】そのために本発
明の画像処理装置の画像領域識別方式は、複数画素をブ
ロック化するブロック化手段と、ブロック内の画素から
複数の特性値を検出する特性値検出手段と、複数の特性
値から文字／中間調の領域識別を行う領域識別手段とを
備え、複数画素をブロック化して当該ブロック内の複数
の特性値を検出し、該複数の特性値から原稿画像の文字
／中間調領域の識別を行うように構成したことを特徴と
し、ブロック内の複数の特性値は、ブロック内平均値と
画像信号を２つの閾値で３つのレベルに量子化したとき
の高レベル画素及び中間レベル画素のそれぞれのブロッ
ク内の総和値との３種とし、これらの空間で文字領域の
識別を行うことを特徴とする。このことにより、３次元
空間での判定を行うことができるので、領域識別の精度
を高めることができる。[Means and effects for solving the problem] To achieve this, the image area identification method of the image processing device of the present invention includes a blocking means that blocks a plurality of pixels, and a characteristic that detects a plurality of characteristic values from pixels in the block. The method includes a value detection means and an area identification means for identifying a character/halftone area from a plurality of characteristic values. The feature is that the character/halftone area of the original image is identified, and the plurality of characteristic values within a block are determined by quantizing the average value within the block and the image signal into three levels using two thresholds. , and the total sum value of high-level pixels and intermediate-level pixels in each block, and character areas are identified in these spaces. This allows determination to be made in three-dimensional space, thereby increasing the accuracy of region identification.

【００２５】また、領域識別手段は、識別結果をＬＵＴ
に格納し、複数の特性値を合成した値をアドレスとして
ＬＵＴの中の識別結果を読み出すように構成し、或いは
複数の特性値の量子化手段を備え、領域識別手段は、複
数の特性値を非線形量子化しそれぞれの語長を圧縮した
後に合成してＬＵＴのアドレスデータとするように構成
し、複数の特性値の量子化手段と、予め領域識別のため
の中間レベル画素の総和値の閾値をセットしたＬＵＴと
、該ＬＵＴにセットした閾値と中間レベル画素の総和値
とを比較する比較手段とを備え、ブロック内平均値と高
レベル画素の総和値とを量子化し合成した値をアドレス
としてＬＵＴをひくことによって中間レベル画素の総和
値の閾値を求め、該閾値と実際に求めた中間レベル画素
の総和値とを比較してブロック内の領域を識別するよう
に構成したので、識別性能を低下させることなくハード
ウエア規模を縮小させることができ、前記中間レベル画
素の総和値の閾値をセットしたＬＵＴの後段に加減算器
を設け、前記ＬＵＴの閾値と調整値とを加減算して前記
比較手段における判定閾値を調整できるように構成した
ので、文字識別の判定領域を調整することができる。[0025] Furthermore, the area identification means stores the identification results in an LUT.
The area identification means stores the plurality of characteristic values in the LUT and reads out the identification result in the LUT by using a value obtained by combining the plurality of characteristic values as an address, or comprises means for quantizing the plurality of characteristic values. It is configured to perform non-linear quantization, compress the word lengths of each word, and then synthesize it to create address data for the LUT. It is equipped with a set LUT, a comparison means for comparing the threshold value set in the LUT and the sum value of intermediate level pixels, and uses the quantized and synthesized value of the intra-block average value and the sum value of high level pixels as an address. By subtracting , a threshold value for the sum of intermediate level pixels is obtained, and the area within the block is identified by comparing this threshold with the actually obtained sum of intermediate level pixels, which reduces the identification performance. An adder/subtractor is provided after the LUT in which the threshold value of the sum of the intermediate level pixels is set, and the threshold value of the LUT and the adjustment value are added/subtracted to reduce the hardware scale without causing any problems. Since the determination threshold value is configured to be adjustable, the determination area for character identification can be adjusted.

【００２６】ブロック化手段の前に画素データを量子化
する量子化手段を設け、ブロック内平均値を求める際の
画素データの語長を圧縮するように構成したことを特徴
とするので、ブロック化手段のサイズを縮小することが
でき、ブロック化手段は、ライン同期信号にしたがって
間引くことによってブロックの副走査方向の観測幅を拡
大できるように構成したことを特徴とするので、拡大時
の識別精度の低下を防ぐことができる。The present invention is characterized in that a quantization means for quantizing pixel data is provided before the blocking means, and the word length of the pixel data is compressed when calculating the intra-block average value. The size of the blocking means can be reduced, and the blocking means is characterized in that it can expand the observation width of the block in the sub-scanning direction by thinning out according to the line synchronization signal, so that the identification accuracy during expansion can be improved. can prevent a decline in

【００２７】さらに、画素単位で色相の識別を行う色相
識別手段と、複数画素をブロック化するブロック化手段
と、ブロック単位の色相信号をカウントして多数決判定
によりブロック全体の色相を決定するブロック色相識別
手段を備え、画素単位の色相信号をブロック化した後、
各色相信号のブロック内のカウント値を求め、その多数
決判定によりブロック全体の色相を決定することを特徴
とする。[0027]Furthermore, there is a hue identification means for identifying hue in pixel units, a blocking means for dividing a plurality of pixels into blocks, and a block hue determining means for counting the hue signals of each block and determining the hue of the entire block by majority decision. After having a discrimination means and dividing the hue signal of each pixel into blocks,
The method is characterized in that the count value of each hue signal within the block is determined, and the hue of the entire block is determined by a majority decision.

【００２８】そして、文字／中間調領域の識別信号と色
相識別信号から原稿画像の黒文字領域、色文字領域、中
間調領域の３種の画像領域を識別する領域識別手段を備
えたことを特徴とし、領域識別用信号と色相識別用信号
を量子化する量子化手段を備え、領域識別用信号と色相
識別用信号を合成した信号の語長がブロック化手段の１
画素の有限語長内に収まるように制限して領域識別用信
号と色相識別用信号を同時に同一のブロック化手段でブ
ロック化できるように構成したことを特徴とする。[0028] The present invention is characterized in that it includes an area identification means for identifying three types of image areas of a document image, a black character area, a color character area, and a halftone area, from the character/halftone area identification signal and the hue identification signal. , comprising a quantization means for quantizing the area identification signal and the hue identification signal, and the word length of the signal obtained by combining the area identification signal and the hue identification signal is one of the block forming means.
The present invention is characterized in that the area identification signal and the hue identification signal are restricted to be within a finite word length of a pixel and can be simultaneously blocked by the same blocking means.

【００２９】ブロック内の画素と背景判定のための閾値
とを比較する手段を備え、該閾値以上の画素が少なくと
もブロック内に１つ以上あることを条件に文字領域と判
断された場合の識別結果を採用することを特徴とする。[0029] A means is provided for comparing pixels in a block with a threshold value for background determination, and an identification result is obtained when the block is determined to be a character area on the condition that there is at least one pixel in the block that is equal to or greater than the threshold value. It is characterized by the adoption of

【００３０】[0030]

【実施例】以下、本発明の実施例を図面を参照して説明
する。Embodiments Hereinafter, embodiments of the present invention will be described with reference to the drawings.

【００３１】図１は本発明に係る画像処理装置の画像領
域識別方式の１実施例を説明するための図、図２はカラ
ー文字／中間調画像の分布を説明するための図、図３は
文字領域きの分布状態の例を説明するための図、図４は
パラメータ最適化条件を説明するための図である。FIG. 1 is a diagram for explaining one embodiment of the image area identification method of the image processing apparatus according to the present invention, FIG. 2 is a diagram for explaining the distribution of color characters/halftone images, and FIG. 3 is a diagram for explaining the distribution of color characters/halftone images. FIG. 4 is a diagram for explaining an example of the distribution state of character areas, and FIG. 4 is a diagram for explaining parameter optimization conditions.

【００３２】線画や文字等の２値画像と写真や網点の中
間調画像が混在する原稿において、ブロック化した領域
で文字と中間調の領域の特徴量を採取するため、先に述
べたように従来も種々の方法が採用されているが、それ
らは、ランレングスや最大値と最小値との差のようにあ
る１つに限定した特徴量を採用している。これに対して
本発明は、所定の画素数のブロック化を行ってブロック
内平均値Ｐａとブロック内高レベル画素数Ｐｈとブロッ
ク内中間レベル画素数Ｐｍを採取し、その３次元的な分
布状態から領域を判定するものであり、その基本構成例
を示したのが図１である。[0032] In a document containing a mixture of binary images such as line drawings and characters, and halftone images such as photographs and halftone dots, in order to collect the feature quantities of the text and halftone regions in block areas, as described above, Although various methods have been used in the past, these methods use only one feature, such as run length or the difference between the maximum value and the minimum value. In contrast, in the present invention, a predetermined number of pixels are divided into blocks, the intra-block average value Pa, the intra-block high-level pixel number Ph, and the intra-block intermediate-level pixel number Pm are collected, and their three-dimensional distribution state is An example of the basic configuration is shown in FIG. 1.

【００３３】図１（イ）に示す実施例は、まず、８ビッ
トのブロック内平均値Ｐａと６ビットのブロック内高レ
ベル画素数Ｐｈと６ビットのブロック内中間レベル画素
数Ｐｍを量子化器１〜３でそれぞれ３ビット、４ビット
、４ビットに量子化した後、ブロック内平均値Ｐａとブ
ロック内高レベル画素数Ｐｈから７ビットのＬＵＴ４で
閾値ＴＨを読み出し、その閾値を比較器５でブロック内
中間レベル画素数Ｐｍと比較して文字／中間調の領域判
定信号を得るものである。In the embodiment shown in FIG. 1A, first, an 8-bit intra-block average value Pa, a 6-bit intra-block high-level pixel number Ph, and a 6-bit intra-block intermediate-level pixel number Pm are converted into a quantizer. After quantizing to 3 bits, 4 bits, and 4 bits in 1 to 3, respectively, the threshold TH is read out using the 7-bit LUT 4 from the intra-block average value Pa and the intra-block high-level pixel number Ph, and the threshold value is read out using the comparator 5. A character/halftone area determination signal is obtained by comparing with the number Pm of intermediate level pixels in the block.

【００３４】これに対し、同（ロ）に示す実施例は、量
子化したブロック内平均値Ｐａとブロック内高レベル画
素数Ｐｈとブロック内中間レベル画素数Ｐｍを使って１
１ビットのＬＵＴ６で文字／中間調の領域判定信号を得
るものであり、また、同（ハ）に示す実施例は、量子化
することなくそのままのブロック内平均値Ｐａとブロッ
ク内高レベル画素数Ｐｈとブロック内中間レベル画素数
Ｐｍを使って２０ビットのＬＵＴ７で文字／中間調の領
域判定信号を得るものである。On the other hand, the embodiment shown in (b) uses the quantized intra-block average value Pa, the intra-block high-level pixel number Ph, and the intra-block intermediate-level pixel number Pm to calculate 1.
A 1-bit LUT6 is used to obtain a character/halftone area determination signal, and the embodiment shown in (c) uses the intra-block average value Pa and the number of high-level pixels within the block as they are without quantization. A 20-bit LUT 7 uses Ph and the number of intermediate level pixels in the block Pm to obtain a character/halftone area determination signal.

【００３５】文字／中間調の混在する原稿について考察
すると、図２（イ）に示すように高レベルの閾値ｔｈ１
と低レベルの閾値ｔｈ２を設けた場合、文字は、例えば
Ａのように低レベルの背景から急峻に立ち上がるため、
閾値ｔｈ１を越える高レベルの画素と閾値ｔｈ２に達し
ない低レベルの画素が多く、閾値ｔｈ１と閾値ｔｈ２の
間の中間レベルの画素は少ない。つまり、相対的には文
字部の高レベルより背景の低レベルの方が多い。これに
対して中間調は、例えばＢのように各レベルに万遍なく
分布するが、領域でみると低レベルと中間レベルに分布
する場合、中間レベルに分布する場合、中間レベルと高
レベルに分布する場合、高レベルに分布する場合になる
。したがって、平均値（ブロック内平均値Ｐａ）と頻度
で見た場合には、（ロ）に示すように文字は平均値の低
い方に分布し、中間調は、平均値の高い方に分布する。[0035] Considering a manuscript containing a mixture of characters and halftones, as shown in FIG.
If a low-level threshold th2 is set, a character, such as A, rises steeply from a low-level background, so
There are many pixels with a high level exceeding the threshold th1 and pixels with a low level that does not reach the threshold th2, and there are few pixels with an intermediate level between the threshold th1 and the threshold th2. In other words, relatively speaking, there are more low levels in the background than high levels in the text area. On the other hand, intermediate tones are evenly distributed at each level, for example B, but when viewed in terms of areas, they are distributed between low and intermediate levels, when they are distributed at intermediate levels, when they are distributed between intermediate and high levels. If it is distributed, it will be distributed at a high level. Therefore, when looking at the average value (intra-block average value Pa) and frequency, as shown in (b), characters are distributed on the lower side of the average value, and halftones are distributed on the higher side of the average value. .

【００３６】すなわち、それぞれの特徴量をみると、以
下のようになる。ブロック内平均値Ｐａでは、文字領域
の場合、文字部の高レベルより背景が多く存在するため
低いところにある。また、ブロック内高レベル画素数Ｐ
ｈでは、画素信号を複数レベル（高中低の３レベル）に
量子化した際の高レベルであった画素のブロック内総数
を表すので、文字領域の場合、最低１画素以上存在し、
なおかつ低レベル画素も最低１画素以上存在する。この
条件を満たさなければ、例えばブロック内全てが高レベ
ル画素の場合は、中間調領域となる。そして、ブロック
内中間レベル画素数Ｐｍでは、画素信号を複数レベル（
高中低の３レベル）に量子化した際の中間レベルであっ
た画素のブロック内総数を表すので、文字領域では、そ
の濃度分布が急峻に変化するため少ない。That is, looking at each feature amount, it is as follows. The intra-block average value Pa is low in the case of a text area because there is more background than the high level of the text area. Also, the number of high-level pixels in the block P
h represents the total number of pixels in the block that were at a high level when the pixel signal was quantized into multiple levels (3 levels of high, medium and low), so in the case of a character area, at least one pixel exists,
Furthermore, there is also at least one low level pixel. If this condition is not met, for example, if all pixels in the block are high-level pixels, the area will be a halftone area. Then, in the number of intermediate level pixels Pm within the block, pixel signals are converted to multiple levels (
It represents the total number of pixels in a block that were at an intermediate level when quantized into three levels (high, medium, and low), and is small in character areas because the density distribution changes sharply.

【００３７】したがって、上記の３つの特徴量を３次元
的にみると、文字領域の分布状態は図３に示すようにな
る。すなわち、３種の特性値の合成により文字である範
囲がかなり限定できることがわかる。[0037] Therefore, when the above three feature quantities are viewed three-dimensionally, the distribution state of the character area is as shown in FIG. That is, it can be seen that the range of characters can be considerably limited by combining three types of characteristic values.

【００３８】今、ブロック内全画素総数をＰｔとすると
、文字領域は、少なくとも以下の条件を満たしていなけ
ればならない。Now, assuming that the total number of all pixels in a block is Pt, the character area must satisfy at least the following conditions.

【００３９】■　　Ｐｈ＋Ｐｍ≦Ｐｔ−１■　　１≦Ｐ
ｈ≦Ｐｔ−１つまり、ＰｈとＰｍからなる平面でみると、文字領域は
図４に示す黒地の領域に分布し、中間調領域は白地の領
域に分布する。したがって、■でかつ■を満足している
ときに初めて文字領域となる。[0039]■ Ph+Pm≦Pt-1■ 1≦P
h≦Pt-1 That is, when viewed on a plane consisting of Ph and Pm, the character area is distributed in the black background area shown in FIG. 4, and the halftone area is distributed in the white background area. Therefore, it becomes a character area only when it is ■ and satisfies ■.

【００４０】さらに、実際の識別パラメータは、各画像
データを統計的に解析し、上記の条件も加味して決定す
る。これを実現するための構成を示したのが先に説明し
た図１であり、（ハ）が３次元ＬＵＴに予め識別結果を
格納しておき、Ｐａ、Ｐｈ、Ｐｍの各８、６、６ビット
の結果を合成した２０ビットをＬＵＴのアドレスとして
入力するように構成したものである。これに対して（ロ
）はＬＵＴの容量を削減するため、Ｐａ、Ｐｈ、Ｐｍに
量子化器を挿入し、それぞれのデータ語長を圧縮してＬ
ＵＴの容量を減らしたものである。つまり、ＬＳＩ化す
るためには、内部でなるべくメモリを小さくすることが
必要であり、この場合、３、４、４の１１ビットまでの
削減を実現している。勿論、この量子化は単純に適用で
きないが、Ｐａ、Ｐｈ、Ｐｍの分布状態と量子化を非線
形にすることによって、この程度の圧縮をしても識別性
能に支障を来さないことが確認できた。そして、（イ）
はＰｍの特性に着目して（ロ）のハードウエア規模をさ
らに縮小したものであり、２次元のＬＵＴと１つの比較
器を用いた構成となっている。これは、図３、図４を見
たときに、文字領域のＰｍの分布状態が必ずＰｍ＝０を
最小値とし、Ｐｍ≦Ｐｔ−Ｐｈ−１を最大値とする範囲
に入ることに着目している。これによりＰｍの値がＰａ
、Ｐｈによって予め決定されたＰｍの最大値の閾値と比
較して、小さければ文字領域、大きければ中間調領域と
いう判定方式が実現できることが分かる。これにより、
識別性能を落とさず、ハードウエア実現性を向上できる
。Furthermore, the actual identification parameters are determined by statistically analyzing each image data and taking into account the above conditions. The configuration for realizing this is shown in FIG. 1 described above, in which (c) stores the identification results in a three-dimensional LUT in advance, The configuration is such that 20 bits obtained by combining the bit results are input as an address of the LUT. On the other hand, in (b), in order to reduce the capacity of the LUT, a quantizer is inserted into Pa, Ph, and Pm, and the data word length of each is compressed.
This is a UT with reduced capacity. In other words, in order to implement an LSI, it is necessary to make the internal memory as small as possible, and in this case, the reduction to 3, 4, 4 bits to 11 has been achieved. Of course, this quantization cannot be applied simply, but by making the distribution state and quantization of Pa, Ph, and Pm nonlinear, it can be confirmed that even with this degree of compression, there will be no problem with the identification performance. Ta. And (a)
The hardware scale of (b) is further reduced by focusing on the characteristics of Pm, and has a configuration using a two-dimensional LUT and one comparator. This is because when looking at Figures 3 and 4, it is noted that the distribution state of Pm in the character area always falls within the range where Pm = 0 is the minimum value and Pm≦Pt-Ph-1 is the maximum value. ing. As a result, the value of Pm becomes Pa
, Ph can be compared with the threshold value of the maximum value of Pm predetermined by Ph, and it is possible to realize a determination method in which the character region is determined if the value is smaller, and the halftone region is determined if the value is larger. This results in
Hardware feasibility can be improved without reducing identification performance.

【００４１】図５は図１（イ）の実施例を適用した処理
回路の構成例を示す図、図６は図１（ロ）の実施例を適
用した処理回路の構成例を示す図、図７は図１（ハ）の
実施例を適用した処理回路の構成例を示す図である。FIG. 5 is a diagram showing a configuration example of a processing circuit to which the embodiment of FIG. 1(A) is applied, and FIG. 6 is a diagram showing a configuration example of a processing circuit to which the embodiment of FIG. 1(B) is applied. 7 is a diagram showing a configuration example of a processing circuit to which the embodiment of FIG. 1(c) is applied.

【００４２】図５において、入力信号Ｌ＊　、ａ＊　、
ｂ＊　は、システムバリューで表現された８ビットの信
号であり、Ｌ＊　軸で明度を表し、これと直交するａ＊
　軸とｂ＊軸の２次元平面で彩度と色相を表すものであ
る。非線形量子化器１１は、各画素の８ビットの信号Ｌ
＊　を４ビットに圧縮するものであり、量子化器１２は
、各画素の８ビットの信号Ｌ＊　を閾値ｔｈ１、ｔｈ２
で高レベル／中間レベル／低レベルにレベル分けした２
ビットの信号を出力するものであり、色相識別器１３は
、各画素の８ビットの信号Ｌ＊　、ａ＊　、ｂ＊　から
色か黒か白かを識別した２ビットの色相信号を出力する
ものである。In FIG. 5, input signals L*, a*,
b* is an 8-bit signal expressed as a system value, and the L* axis represents brightness, and a* is orthogonal to this.
Saturation and hue are expressed on a two-dimensional plane of the axis and b* axis. The nonlinear quantizer 11 receives an 8-bit signal L of each pixel.
* to 4 bits, and the quantizer 12 converts the 8-bit signal L* of each pixel into thresholds th1 and th2.
Classified into high level / intermediate level / low level 2
The hue discriminator 13 outputs a 2-bit hue signal that identifies color, black, or white from the 8-bit signals L*, a*, b* of each pixel. It is.

【００４３】ブロック化ＦＩＦＯ１４は、副走査方向に
４ライン、主走査方向に８画素のデータを保持して４×
８のブロック化を行い、また、８ラインの入力ラインか
ら１ラインずつ間引きして４ラインを保持することによ
って８×８のサイズに相当するブロック化を行うもので
ある。つまり、精度を上げるためにブロックサイズを大
きくしようとするときに、間引きブロック化が採用され
る。ブロック化ＦＩＦＯ１４に保持する１画素のデータ
は、図２（ハ）に示すように圧縮した４ビットの信号Ｌ
＊　と２ビットのレベル分け信号と２ビットの色相信号
からなる８ビットであり、それぞれ４ビットの信号Ｌ＊
　は平均値算出回路１６と比較器１５に、２ビットのレ
ベル分け信号はカウンタ１７と１８に、２ビットの色相
信号はブロック色相識別器１９に分配される。The blocking FIFO 14 holds data of 4 lines in the sub-scanning direction and 8 pixels in the main scanning direction, and
In addition, by thinning out one line at a time from the eight input lines and retaining four lines, blocks corresponding to the size of 8×8 are performed. In other words, when attempting to increase the block size to improve accuracy, thinning blocks is employed. One pixel data held in the blocking FIFO 14 is a compressed 4-bit signal L as shown in FIG.
* It is an 8-bit signal consisting of a 2-bit level division signal and a 2-bit hue signal, each of which is a 4-bit signal L*.
is distributed to an average value calculation circuit 16 and a comparator 15, a 2-bit level classification signal is distributed to counters 17 and 18, and a 2-bit hue signal is distributed to a block hue discriminator 19.

【００４４】平均値算出回路１６は、ブロック内画素の
信号Ｌ＊　を加算してブロック内平均値Ｐａを算出する
ものである。カウンタ１７はレベル分け信号から高レベ
ル画素数Ｐｈをカウントするものであり、カウンタ１８
は中間レベル画素数Ｐｍをカウントするものである。量
子化器１２において、低レベルの閾値をｔｈ１、高レベ
ルの閾値をｔｈ２とし、信号Ｌ＊　が閾値ｔｈ１とｔｈ
２との間にある場合に２ビットのうちの下位ビットのみ
を「１」に、閾値ｔｈ２を越える場合には上位ビットの
みを「１」にすると、カウンタ１７は上位ビットをカウ
ントし、カウンタ１８は下位ビットをカウントすること
になる。The average value calculation circuit 16 calculates the intra-block average value Pa by adding the signals L* of the pixels within the block. The counter 17 counts the number of high level pixels Ph from the level division signal, and the counter 18
is for counting the number of intermediate level pixels Pm. In the quantizer 12, the low level threshold is set to th1, the high level threshold is set to th2, and the signal L* is set to the thresholds th1 and th.
2, the lower bit of the 2 bits is set to "1", and when it exceeds the threshold th2, only the upper bit is set to "1", the counter 17 counts the upper bits, and the counter 18 will count the lower bits.

【００４５】そして、図１（イ）と同様の非線型量子化
器２０〜２２でそれぞれのデータを圧縮した後、ＬＵＴ
２３、比較器２４により文字／中間調の判定信号を出力
する。ＬＵＴ２３は、図３に示す３次元空間のＰａ─Ｐ
ｈ平面（図示左側平面）で文字領域とされる図示実線の
ブロックにおけるＰｍの閾値を読み出すものであり、比
較器２４でこの閾値の方が大きい場合には、図３に示す
３次元空間において実線のブロック内に入り、文字領域
の判定信号が出力される。ＬＵＴ２３と比較器２４との
間に挿入接続された加算器は、バイアスＴＨｂｉａｓに
よりＬＵＴ２３から読み出された閾値のバイアスを調整
するものであり、文字の領域を調整するものである。After each data is compressed by nonlinear quantizers 20 to 22 similar to those shown in FIG. 1(a), the LUT
23, the comparator 24 outputs a character/halftone determination signal. LUT23 is Pa-P in the three-dimensional space shown in Figure 3.
This is to read the threshold value of Pm in the block indicated by the solid line in the figure, which is a character area in the h plane (left plane in the figure), and if this threshold value is larger in the comparator 24, the solid line in the three-dimensional space shown in FIG. 3 is read out. enters the block, and a character area determination signal is output. The adder inserted and connected between the LUT 23 and the comparator 24 is used to adjust the bias of the threshold value read from the LUT 23 using the bias THbias, and is used to adjust the character area.

【００４６】比較器１５とフラグ検出器２５は、背景ブ
ロックか否かを検出するものであって、比較器１５で圧
縮した４ビットの信号Ｌ＊　を閾値ＴＨａｖと比較し、
フラグ検出器で閾値ＴＨａｖを越える画素がブロック内
にあったか否かを検出するものである。したがってここ
では、ブロック内の全ての画素が閾値ＴＨａｖより小さ
い場合には、背景であると判定する。つまり、閾値ＴＨ
ａｖを越えるものが最低１画素以上ないと文字／中間調
のいずれでもなく背景とするものである。誤り補正回路
２６は、文字／中間調の判定信号と背景検出信号を入力
して例えば周囲のブロックの判定信号と比較してさらに
大きなブロック単位での補正を行うものであり、文字／
中間調の判定信号を出力する。The comparator 15 and flag detector 25 are for detecting whether it is a background block or not, and compare the 4-bit signal L* compressed by the comparator 15 with a threshold value THav.
A flag detector detects whether there are pixels in the block that exceed the threshold value THav. Therefore, here, if all the pixels in a block are smaller than the threshold THav, it is determined that the block is the background. In other words, the threshold TH
If there is not at least one pixel that exceeds av, it is treated as a background rather than a character or halftone. The error correction circuit 26 inputs a character/halftone determination signal and a background detection signal, and performs correction in larger block units by comparing it with, for example, the determination signals of surrounding blocks.
Outputs a halftone judgment signal.

【００４７】ブロック色相識別器１９は、ブロック化Ｆ
ＩＦＯ１４でブロック化された各画素の色相信号から例
えば多数決判定によりブロック単位でカラーか白黒かの
色相を識別するものであり、ディレイ回路２８は、カラ
ー／白黒のブロック色相信号をディレイさせて誤り補正
回路２６の文字／中間調の判定信号とを同期をとるもの
である。そして、最終判定回路２７は、文字／中間調の
判定信号とカラー／白黒のブロック色相信号から最終的
に色文字／黒文字／中間調のＴ／Ｉ（テキストイメージ
）領域判定信号を出力するものである。[0047] The block hue discriminator 19
The hue signal of each pixel divided into blocks by the IFO 14 is used to identify whether the hue is color or black and white in each block by majority decision, and the delay circuit 28 corrects errors by delaying the color/black and white block hue signal. This is to synchronize the character/halftone determination signal of the circuit 26. The final determination circuit 27 finally outputs a color character/black character/halftone T/I (text image) region determination signal from the character/halftone determination signal and the color/monochrome block hue signal. be.

【００４８】図６に示す例は、信号Ｌ＊　の非線型量子
化器を省き上位４ビットだけをブロック化ＦＩＦＯの入
力とし、平均値算出器１６、カウンタ１７、１８で求め
たブロック内平均値Ｐａとブロック内中間レベル画素数
Ｐｍとブロック内高レベル画素数Ｐｈをそれぞれ非線型
量子化器（ＬＵＴ）２０〜２２で圧縮した後、これらを
直接判定用のＬＵＴ２３の入力とし１１ビットの入力で
２ビットの出力を得るように構成したものである。In the example shown in FIG. 6, the nonlinear quantizer of the signal L* is omitted, only the upper 4 bits are input to the blocking FIFO, and the intra-block average value calculated by the average value calculator 16 and counters 17 and 18 is calculated. After compressing Pa, the number of mid-level pixels in the block Pm, and the number of high-level pixels in the block Ph using nonlinear quantizers (LUTs) 20 to 22, these are input to the LUT 23 for direct determination, and are input with 11 bits. It is configured to obtain a 2-bit output.

【００４９】また、図７に示す例は、トナー信号ｙｍｃ
を入力信号とし、最大値検出回路３１を使って先の実施
例と同様の信号を採取すると共に、ブロック内平均値Ｐ
ａとブロック内中間レベル画素数Ｐｍとブロック内高レ
ベル画素数Ｐｈを圧縮することなく、直接判定用のＬＵ
Ｔ３２の入力とし２０ビットの入力で２ビットの出力を
得るように構成したものである。Further, in the example shown in FIG. 7, the toner signal ymc
is used as an input signal, the maximum value detection circuit 31 is used to collect the same signal as in the previous embodiment, and the intra-block average value P
LU for direct determination without compressing a, the number of intermediate level pixels in the block Pm, and the number of high level pixels in the block Ph.
It is configured to obtain a 2-bit output with a 20-bit input to T32.

【００５０】なお、本発明は、上記の実施例に毛低され
るものではなく、種々の変形が可能である。例えば上記
の実施例では、ブロック化回路で所定のサイズにブロッ
ク化を行ってブロック単位の領域識別信号を順次出力す
るように構成したが、判定するブロックをラップさせる
ようにしてもよい。このようにすることによって、判定
の安定性、判定精度の向上を計ることができる。また、
領域検出用信号として明度、彩度、色相の情報からなる
信号Ｌ＊　、ａ＊、ｂ＊　を入力し処理する構成を基本
に説明したが、図７に示す例のようにトナー信号ＹＭＣ
を入力とする他、光色分解信号ＢＧＲを入力とし、最大
値或いは最小値を用いるように構成してもよいし、輝度
信号を用いるようにしてもよいことはいうまでもない。さらに、ブロックサイズは、縮拡率に応じて変化させる
ようにしてもよい。例えば拡大時には副走査方向のライ
ン数が少なくなるので、識別精度が低下するという問題
が生じるが、ブロックサイズを変化させることによって
このような問題を解消することができる。Note that the present invention is not limited to the above-described embodiments, and various modifications are possible. For example, in the above-mentioned embodiment, the blocking circuit blocks the blocks to a predetermined size and sequentially outputs area identification signals for each block, but the blocks to be determined may be wrapped. By doing so, it is possible to improve the stability of determination and the accuracy of determination. Also,
The basic explanation has been based on a configuration in which signals L*, a*, b* consisting of lightness, saturation, and hue information are input and processed as area detection signals, but as shown in the example shown in FIG.
It goes without saying that, in addition to the input, the light color separation signal BGR may be input, and the maximum value or minimum value may be used, or the luminance signal may be used. Furthermore, the block size may be changed depending on the reduction/enlargement ratio. For example, when enlarging, the number of lines in the sub-scanning direction decreases, resulting in a problem of reduced identification accuracy, but this problem can be solved by changing the block size.

【００５１】[0051]

【発明の効果】以上の説明から明らかなように、本発明
によれば、複数画素をブロック化し、そのブロック内平
均値Ｐａとブロック内高レベル画素数Ｐｈとブロック内
中間レベル画素数Ｐｍという３つの特徴量を採取して、
これらからなる３次元空間で文字／中間調の領域判定を
行うので、領域判定の設定の自由度が増し領域判定の精
度を上げることができる。また、非線型量子化器やＬＵ
Ｔ等の圧縮器を用いることによって判定値の調整を容易
にし、ハードウエア規模の縮小、簡素化を図ることがで
きる。さらに、入力信号を量子化してブロック化するこ
とにより、ブロック化のためのラインメモリ（ＦＩＦＯ
）の数を削減することができる。As is clear from the above description, according to the present invention, a plurality of pixels are divided into blocks, and three values are calculated: the average value Pa within the block, the number Ph of high-level pixels within the block, and the number Pm of intermediate-level pixels within the block. Collect two features and
Since character/halftone area determination is performed in a three-dimensional space consisting of these, the degree of freedom in setting area determination is increased, and the accuracy of area determination can be improved. In addition, nonlinear quantizers and LU
By using a compressor such as T, it is possible to easily adjust the determination value and to reduce and simplify the hardware scale. Furthermore, by quantizing the input signal and creating blocks, line memory (FIFO) for blocking is used.
) can be reduced.

[Brief explanation of drawings]

【図１】　　本発明に係る画像処理装置の画像領域識別
方式の１実施例を説明するための図である。FIG. 1 is a diagram for explaining one embodiment of an image area identification method of an image processing apparatus according to the present invention.

【図２】　　カラー文字／中間調画像の分布を説明する
ための図である。FIG. 2 is a diagram for explaining the distribution of color characters/halftone images.

【図３】　　文字領域きの分布状態の例を説明するため
の図である。FIG. 3 is a diagram for explaining an example of the distribution state of character areas.

【図４】　　パラメータ最適化条件を説明するための図
である。FIG. 4 is a diagram for explaining parameter optimization conditions.

【図５】　　図１（イ）の実施例を適用した処理回路の
構成例を示す図である。FIG. 5 is a diagram showing a configuration example of a processing circuit to which the embodiment of FIG. 1(A) is applied.

【図６】　　図１（ロ）の実施例を適用した処理回路の
構成例を示す図である。FIG. 6 is a diagram illustrating a configuration example of a processing circuit to which the embodiment of FIG. 1(b) is applied.

【図７】　　図１（ハ）の実施例を適用した処理回路の
構成例を示す図である。7 is a diagram showing an example of the configuration of a processing circuit to which the embodiment of FIG. 1(c) is applied; FIG.

【図８】　　従来のカラー画像処理装置の構成例及び画
像領域識別回路の構成例を示す図である。FIG. 8 is a diagram showing a configuration example of a conventional color image processing device and a configuration example of an image area identification circuit.

[Explanation of symbols]

１〜３、１１、１２、２０〜２２…量子化器、４、６、
７、２３…ＬＵＴ、５、１５、２４…比較器、１３…色
相識別器、１４…ブロック化ＦＩＦＯ、１６…平均値算
出器、１７、１８…カウンタ、１９…ブロック色相識別
器、２５…フラグ検出器1 to 3, 11, 12, 20 to 22...quantizer, 4, 6,
7, 23... LUT, 5, 15, 24... Comparator, 13... Hue discriminator, 14... Blocking FIFO, 16... Average value calculator, 17, 18... Counter, 19... Block hue discriminator, 25... Flag Detector

Claims

[Claims]

1. Blocking means for forming a plurality of pixels into blocks; characteristic value detection means for detecting a plurality of characteristic values from pixels within the block; and area identification for identifying character/halftone areas from the plurality of characteristic values. The method is characterized in that it is configured to block a plurality of pixels, detect a plurality of characteristic values in the block, and identify characters/halftone regions of the original image from the plurality of characteristic values. An image area identification method for an image processing device.

2. The plurality of characteristic values within a block are the average value within the block and the sum total value within each block of high level pixels and intermediate level pixels when the image signal is quantized into three levels using two threshold values. 2. An image area identification method for an image processing apparatus according to claim 1, characterized in that there are three types.

3. The area identification means stores the identification result in an LUT, and uses a value obtained by combining a plurality of characteristic values as an address in the LUT.
3. The image area identification method for an image processing apparatus according to claim 1, wherein the method is configured to read out identification results in the UT.

4. A plurality of characteristic value quantization means are provided, and the area identification means is configured to quantize the plurality of characteristic values, compress the respective word lengths, and then synthesize the plurality of characteristic values to obtain address data of the LUT. 4. An image area identification method for an image processing apparatus according to claim 3.

5. The region identification means includes quantization means for a plurality of characteristic values, an LUT in which a threshold value of the sum of intermediate level pixels for region identification is set in advance, and a threshold value set in the LUT and the intermediate level pixels. and a comparison means for comparing the total sum value of the intermediate level pixels, and calculates the threshold value of the total sum value of the intermediate level pixels by pulling the LUT using the quantized and synthesized value of the intra-block average value and the total sum value of the high level pixels as an address, 3. The image area identification method for an image processing apparatus according to claim 2, wherein the area within the block is identified by comparing the threshold value with a total value of intermediate level pixels actually determined.

6. An adder/subtractor is provided at a subsequent stage of the LUT in which a threshold value for the sum of the intermediate level pixels is set, and the LUT
6. The image area identification method for an image processing apparatus according to claim 5, wherein the determination threshold in the comparison means can be adjusted by adding or subtracting the threshold and the adjustment value.

7. Image region identification of an image processing apparatus according to claim 1 or 2, characterized in that quantization means for quantizing pixel data for obtaining a plurality of specific values is provided before the blocking means. method.

8. Image area identification of the image processing apparatus according to claim 1, wherein the blocking means is configured to expand the observation width of the block in the sub-scanning direction by thinning out the blocks in accordance with the line synchronization signal. method.

9. Hue identification means for identifying hue on a pixel-by-pixel basis; blocking means for forming blocks from a plurality of pixels;
It is equipped with a block hue identification means that counts the hue signals of each block and determines the hue of the entire block by majority decision. An image area identification method for an image processing device, characterized in that the hue of the entire block is determined by majority decision.

10. The present invention is characterized by comprising an area identification means for identifying three types of image areas of a document image, a black character area, a color character area, and a halftone area, from a character/halftone area identification signal and a hue identification signal. An image area identification method for an image processing apparatus according to claim 1 or 9.

11. Quantization means for quantizing the area identification signal and the hue identification signal, wherein the word length of the signal obtained by combining the area identification signal and the hue identification signal is a finite word of one pixel of the blocking means. Claim 1 characterized in that the area identification signal and the hue identification signal are configured so that they can be simultaneously blocked by the same blocking means by limiting the signal to within a length.
Alternatively, the image area identification method of the image processing apparatus according to 10.

12. Means for comparing pixels in a block with a threshold value for background determination, wherein the block is determined to be a character area on the condition that there is at least one pixel equal to or greater than the threshold value in the block. The image area identification method for an image processing apparatus according to claim 1 or 10, characterized in that the identification result is adopted.