JPH03153167A

JPH03153167A - Character area separation system

Info

Publication number: JPH03153167A
Application number: JP1291353A
Authority: JP
Inventors: Kaoru Imao; 今尾　薫; Satoshi Ouchi; 敏大内
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1989-11-10
Filing date: 1989-11-10
Publication date: 1991-07-01

Abstract

PURPOSE:To extract both a white background character part and a dot character part as a character area by converting an input picture into a digital multi- value density data, separating the area into a dot part, a non-dot part, a photograph part and a non-photograph part respectively and ANDing the non-dot part and the non-photograph part. CONSTITUTION:An inputted picture signal is converted into a digital multi-value density data at an A/D converter 1 and fed respectively to a dot/non-dot separate circuit 2, a photograph/non-photograph separate circuit 3 and a dot- character/non-dot-character separate circuit 4. An overall discrimination circuit 5 ANDs the result of dot/non-dot separation outputted from the dot/non-dot separate circuit 2 and the result of photograph/non-photograph separation outputted from the photograph/non-photograph separate circuit 3 and extracts a picture area being a non-dot and a non-photograph part as a white background character part. Then the circuit 5 discriminates the white background character part and the dot character part separated by the dot-character/non-dot-character separate circuit 4 as the character area and discriminates other areas as the non-character area and outputs an area discrimination signal representing character/non-character corresponding to each area.

Description

【発明の詳細な説明】［産業上の利用分野〕本発明は、文字、網点、写真の混在する画像中から文字
部を精度良く分離するための文字領域分離方式に関する
。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a character area separation method for accurately separating character parts from an image containing a mixture of characters, halftone dots, and photographs.

[Conventional technology]

一般に、文書画像中には、線画からなる文字、ドツトか
らなる網点、連続階調の写真などが混在するのが普通で
ある。このような混在画像を、例えばディジタル式複写
機やファクシミリなどで画像再生する場合、再生画像の
画質向上を図るために、その前処理として、文字部には
エツジ強調による鮮鋭化処理を、またそれ以外の非文字
部にはモアレ除去・階調性再現のためのデイザ処理など
を適応的に施すことが望ましい。そのためには、まず、
入力画像を文字部とそれ以外の非文字部とに正確に分離
する必要がある。In general, a document image usually contains a mixture of characters consisting of line drawings, halftone dots consisting of dots, continuous tone photographs, and the like. When such a mixed image is reproduced using a digital copying machine or facsimile, for example, in order to improve the image quality of the reproduced image, pre-processing includes sharpening processing by edge emphasis on the text, and It is desirable to adaptively apply dither processing to remove moiré and reproduce gradation for non-text portions other than the above. To do this, first,
It is necessary to accurately separate the input image into text parts and non-text parts.

従来、上記した文字、ｙｊ点、写真の混在する画像中か
ら文字部を分離するために多くの方法が提案されている
。これら従来の方法は、いずれも、基本的には、写真部
か非写真部かを判定するブロック、網点部か非網点部か
を判定するブロック、これら２つのプロ・；・りにおけ
る判定結果に基づいて、非写真部でかつ非網点部と判定
された領域を文字部と判定するブロックにより構成して
いるのが普通である。Conventionally, many methods have been proposed for separating text from an image containing the above-mentioned text, yj points, and photographs. All of these conventional methods basically consist of a block for determining whether it is a photo area or a non-photo area, a block for determining whether it is a halftone area or a non-halftone area, and a block for determining whether it is a halftone area or a non-halftone area. Based on the results, areas that are determined to be non-photograph areas and non-halftone areas are usually configured with blocks that are determined to be character areas.

例えば、従来の文字領域分離方式の１例を挙げれば、ｒ
文字・写真・網点印刷の混在する画像の２値化処理方法
ｊ　（南口俊彦；平成元年度画像電子学会全国大会予稿
２２　＋　ｐｐ９１−ｐｐ９４）がある。For example, to give an example of a conventional character region separation method, r
There is a binarization processing method for images containing a mixture of text, photographs, and halftone dot printing (Toshihiko Minamiguchi; Proceedings of the National Conference of the Institute of Image Electronics Engineers, 1989, 22 + pp91-pp94).

この方法は、まず画像中の画素の白黒変化点のエツジを
検出することにより、画像を写真部と非写真部とに分離
し、さらにこの分離した非写真部を２値化し、この２値
化画像に対して黒画素と白画素の分布状態に基づいたパ
ターンマツチングを行うことにより、文字部（非網点部
）と網点部を分離するものである。This method first separates the image into photographic areas and non-photographic areas by detecting the edges of black and white transition points of pixels in the image, then binarizes the separated non-photographic areas, and then converts the image into two parts. By performing pattern matching on an image based on the distribution state of black pixels and white pixels, character areas (non-halftone areas) and halftone areas are separated.

上記したように文字部（非網点部）と網点部を分離する
場合に、その分離精度をさらに向上しようとすると、上
記南口の提案方法に限らず、どのような文字領域分離方
法においても、−船釣にその分離結果に対して周辺の画
素の分離結果に基づいた領域補正を行う必要がある。こ
の周辺画素に基づいた領域補正を行うことにより、網点
部を文字部にまた文字部を網点部にそれぞれ誤って分離
する確率をさらに減少することができる。When separating text areas (non-halftone areas) and halftone areas as described above, in order to further improve the separation accuracy, it is difficult to improve the separation accuracy not only with the method proposed by Minamiguchi above but also with any text area separation method. , - It is necessary to perform area correction on the separation result based on the separation result of surrounding pixels during boat fishing. By performing area correction based on the surrounding pixels, it is possible to further reduce the probability of erroneously separating a halftone dot part into a text part and a text part into a halftone dot part.

[Problem to be solved by the invention]

ところで、画像中の文字には、背景が白地の文字部（以
下「白地文字部Ｊという）と、背景が網点の文字部（以
下「網地文字部」という）が存在する。このような画像
に対し、上記領域補正を行うと、網地文字部は背景が網
点からなるため、まず文字のまわりの網点画素が網点と
して１次判定され、この文字周辺の網点画素の判定結果
に基づいて文字画素の領域補正が行われるため、網点上
に存在する文字画素が網点領域として誤判定される確率
が極めて大きく、網地文字部を正確に抽出することがで
きなくなるという問題があった。By the way, the characters in the image include a character part with a white background (hereinafter referred to as "white character part J") and a character part with a background of halftone dots (hereinafter referred to as "halftone character part"). When the above area correction is performed on such an image, since the background of the halftone text part consists of halftone dots, the halftone dot pixels around the character are first determined as halftone dots, and the halftone dots around this character are first determined as halftone dots. Since area correction of character pixels is performed based on the pixel determination results, there is an extremely high probability that character pixels existing on halftone dots will be incorrectly determined as halftone dot areas, making it difficult to accurately extract halftone text. The problem was that I couldn't do it.

本発明は上記事情に基づきなされたもので、その目的と
するところは、文字、網点、写真の混在する画像中から
白地文字部および網地文字部の両者を精度良く抽出する
ことのできる文字領域分離方式を提供することである。The present invention has been made based on the above circumstances, and an object of the present invention is to enable accurate extraction of both white text and halftone text from an image containing a mixture of text, halftone dots, and photographs. The purpose is to provide a region separation method.

[Means to solve the problem]

本発明の文字領域分離方式は、第１図にその原理を示す
ように、入力画像をディジタル多値濃度データに変換し
た後、該ディジタル多値濃度データに基づいて、入力画
像を、網点部と非網点部、写真部と非写真部、背景が網
点からなる網地文字部と背景がそれ以外の非網地文字部
にそれぞれ領域分離し、非網点部と非写真部の論理積を
とることにより非網点部でかつ非写真部となる画像領域
を背景が白地からなる白地文字部と判定し、該白地文字
部と前記網地文字部の両者を文字領域と判定して抽出す
るようにしたものである。The character area separation method of the present invention, as shown in the principle in FIG. The areas are separated into a halftone and non-halftone part, a photographic part and a non-photographic part, a halftone text part whose background is halftone dots, and a non-halftone text part whose background is other than that, and the logic of the non-halftone part and non-photographic part is divided into By calculating the product, an image area that is a non-halftone area and a non-photo area is determined to be a white text area with a white background, and both the white background text area and the halftone text area are determined to be character areas. It was designed to be extracted.

〔作　用］入力画像は、ディジタル多値濃度データに変換され、網
点部と非網点部、写真部と非写真部に領域分離される。[Operation] The input image is converted into digital multivalued density data and separated into halftone dot areas and non-halftone dot areas, photographic areas and non-photographic areas.

入力画像を網点部と非網点部に分熱するは、ディジタル
多値濃度データに基づいて濃度の山と谷の変化点を与え
る極点画素を検出し、所定の小領域における極点画素の
密度情報から注目画素が網点候補領域または非網点候補
領域のいずれに属するかを判定ｊ７、周囲の画素の判定
結果に基づいて所定の領域内の注目画素が網点部または
非網点部のいずれであるかを判定ずればよい。To divide the input image into halftone and non-halftone areas, the extreme pixels that give the changing points of the peaks and troughs of density are detected based on the digital multilevel density data, and the density of the extreme pixels in a predetermined small area is calculated. It is determined from the information whether the pixel of interest belongs to a halftone dot candidate area or a non-halftone dot candidate areaj7, and based on the determination results of surrounding pixels, whether the pixel of interest in a predetermined area belongs to a halftone dot area or a non-halftone dot area. All you have to do is decide which one it is.

また、入力画像を写真部と非写真部とに分離するには、
ディジタル多値濃度データに基づいて入力画像を黒画素
と白画素に５分類し、該分類した黒画素群と自画素群中
から連結する黒画素および連結する白画素をそれぞれ検
出Ｌ７、注目画素の近傍に存在する連結黒画素と連結白
画素の数に基づいて注目画素が写真部または非写真部の
いずれであるかを判定すればよい。Also, to separate the input image into photo and non-photo parts,
The input image is classified into five black pixels and white pixels based on digital multilevel density data, and connected black pixels and connected white pixels are respectively detected from the classified black pixel group and the own pixel group. It may be determined whether the pixel of interest is in a photographic area or a non-photographic area based on the number of connected black pixels and connected white pixels existing in the vicinity.

上記のようにして分離された非網点部には、網点以外の
画像たる文字部と写真部とが含まれている。一方、非写
真部には、写真以外の画像たる文字部と網点部とが含ま
れている。したがって、この非網点部と非写真部の論理
積をとり、非編点部でかつ非写真部となる画像領域のみ
を抽出すれば、画像中から文字部のみを抽出することが
できる。The non-halftone dot area separated as described above includes a text area and a photographic area, which are images other than halftone dots. On the other hand, the non-photograph area includes a text area and a halftone dot area, which are images other than photographs. Therefore, by performing the logical product of the non-halftone dot area and the non-photograph area and extracting only the image area that is the non-knit area and the non-photograph area, only the text area can be extracted from the image.

しかし、上記のようにして抽出された文字部は、背景が
白地のいわゆる白地文字部だけであり、背景が網点から
なるいわゆる絹地文字部は抽出されないい何故なら、網
点部と非網点部の分離処理において、分離精度を上げる
ために、前述した領域補正が行われるからである。すな
わち、網点部と非網点部の分離処理において、画像中の
各画素が網点候補領域または非網点候？ｎｆ領域のいず
れム、Ｊａするか否かを判定１〜だ後、さらに、その周
辺の複数の画素の判定結果に基づいて網点部または非網
点部のいずれであるか判定する、：二とにより領域補正
を行っている。このため１、背景が編点からなる網地文
字部は網点として領域補正されてしまい、。However, the text extracted as described above is only the so-called white text with a white background, and the so-called silk text with a background of halftone dots is not extracted. This is because the above-mentioned area correction is performed in order to improve the separation accuracy in the separation processing of the area. That is, in the separation process of halftone dot areas and non-halftone dot areas, is each pixel in the image a halftone dot candidate area or a non-halftone dot candidate? After determining whether any part of the nf area is Ja or not, it is further determined whether it is a halftone part or a non-halftone part based on the determination results of a plurality of surrounding pixels. Area correction is performed by For this reason, 1. The halftone text portion whose background is made of knitting dots is corrected as a halftone dot.

網点上に存在する文字を分離することができないからで
ある。This is because characters existing on the halftone dots cannot be separated.

そこで、白地文字部に加え、背景がｍ点からなる網地文
字部も分離抽出するために、上記白地文４字部の分離処
理とは別に網地文字部のための分離処理を付加する。こ
の網地文字部とそれ以外の井網地文字部とを分離するに
は、ディジタル多値濃度データを所定の重み係数で平滑
化した後、所定の局所領域内の濃度変化に基づいて画像
中の尾根画素を検出し、尾根画素の密度情報に基づいて
注目の尾根画素が背景網点の文字部または背景非網点の
文字部のいずれであるかを判定すればよい。Therefore, in order to separate and extract not only the white text part but also the halftone text part whose background is made up of m points, a separation process for the halftone text part is added in addition to the separation process for the four character parts on a white background. In order to separate this halftone text portion from other halftone text portions, the digital multilevel density data is smoothed using a predetermined weighting coefficient, and then the density changes in a predetermined local area are calculated based on the density changes within the image. , and determine whether the ridge pixel of interest is a character part of a background halftone dot or a character part of a non-halftone dot background based on the density information of the ridge pixel.

そして、上記のようにして抽出された白地文字部と網地
文字部とを文字領域として判定出力する。Then, the white text portion and halftone text portion extracted as described above are determined and output as a character region.

これ／、、’：よ、゛ン＼入力画像。中の白地文字部と
網地文字部の肉汁を積度ハ！＜分離することができる。This/,,':Yo, \input image. Add the juices from the white text and net text inside! <Can be separated.

（実施例）以下、図面６尾仝照ｊ２、ｌ−本発明の実施例につき説
明する。(Embodiments) Hereinafter, embodiments of the present invention will be described with reference to Figures 6 and 6.

第２図は、本発明の１実施例のブロック図を示す。図中
、１はＡ／Ｄ変換回路、２は網点／非網点分離回路、３
は写真／非写真分離回路、４は網地文字／非網地文字分
離回路、５は総合判定回路である。FIG. 2 shows a block diagram of one embodiment of the invention. In the figure, 1 is an A/D conversion circuit, 2 is a halftone/non-halftone separation circuit, and 3
4 is a photo/non-photo separation circuit, 4 is a screen character/non-screen character separation circuit, and 5 is a comprehensive determination circuit.

入力した画像信号は、Ａ／Ｄ変換回路１においてディジ
タル多値濃度データに変換された後、網点／非網点分離
回路２、写真／非写真分離回路３、網地文字／非網地文
字分離回路４にそれぞれ送られる。The input image signal is converted into digital multi-level density data in the A/D conversion circuit 1, and then processed through a halftone/non-halftone separation circuit 2, a photo/non-photo separation circuit 3, and halftone/non-halftone text. The signals are respectively sent to the separation circuit 4.

網点／非網点分離回路２は、ディジタル多値濃度データ
に基づき、後述するようにして入力画像を網点部と非網
点部とに分離する。また、写真／非写真分離回路３は、
ディジタル多値濃度データに基づき、後述するようにし
て入力画像を写真部と非写真部とに分離する。さらに、
網地文字／非網地文字分離回路４は、ディジタル多値濃
度データに基づき、後述するようにして入力画像を網地
文字部と非網地文字部とに分離する。これら各分離回路
の分離結果は総合判定回路５へ送られる。The halftone dot/non-halftone dot separation circuit 2 separates the input image into a halftone dot portion and a non-halftone dot portion based on the digital multilevel density data, as will be described later. Further, the photo/non-photo separation circuit 3 is
Based on the digital multilevel density data, the input image is separated into photographic parts and non-photographic parts as will be described later. moreover,
The halftone text/non-halftone character separation circuit 4 separates the input image into halftone text portions and non-halftone text portions based on the digital multilevel density data, as will be described later. The separation results of each of these separation circuits are sent to the comprehensive judgment circuit 5.

総合判定回路５は、網点／非網点分離回路２の出力する
網点／非網点の分離結果と、写真／非写真分離回路３の
出力する写真／非写真の分離結果の論理積（ＡＮＤ）演
算を行い、非網点でかつ非写真部となる画像領域を白地
文字部として取り出す。そして、この白地文字部と、前
記網地文字／非網地文字分離回路４で分離された網地文
字部の両者を文字領域、またそれ以外の領域を非文字領
域と判定し、各領域に対応して文字／非文字の領域判定
信号を出力する。したがって、この領域判定信号に従っ
て、入力画像の各画素に対し、文字のための鮮鋭化処理
あるいは網点および写真のためのデイザ処理を施せば、
高品質の再生画像を得ることができる。The comprehensive judgment circuit 5 calculates the logical product ( AND) is performed, and the image area that is a non-halftone dot and non-photograph area is extracted as a blank character area. Then, both this white text area and the halftone text area separated by the halftone text/non-halftone text separation circuit 4 are determined to be character areas, and the other areas are determined to be non-text areas, and each area is Correspondingly, a character/non-character area determination signal is output. Therefore, if each pixel of the input image is subjected to sharpening processing for characters or dithering processing for halftone dots and photographs, according to this area determination signal,
High quality reproduced images can be obtained.

次に、網点／非網点分離回路２における網点部と非網点
部の分離処理について、第３図〜第６図を参照して説明
する。Next, the separation process of the halftone dot portion and the non-halftone dot portion in the halftone dot/non-halftone dot separation circuit 2 will be explained with reference to FIGS. 3 to 6.

第３図は、網点／非網点分離回路２の具体例のブロック
図を示す。極点画素検出回路２０１は、所定の局所領域
内の濃度情報を利用して、入力画像中の網点の山および
谷の極点に相当する画素（ピーク画素）を検出する。す
なわち、例えば、第４図に示す３×３のブロックにおい
て、中心画素の濃度レベルがその周囲の８画素の濃度レ
ベルに比べて最大（最小）であるとき、当該中心画素を
濃度の山（谷）を示す極点画素として検出する。FIG. 3 shows a block diagram of a specific example of the halftone/non-halftone dot separation circuit 2. As shown in FIG. The extreme point pixel detection circuit 201 detects pixels (peak pixels) corresponding to the extreme points of the peaks and valleys of halftone dots in the input image, using density information within a predetermined local area. That is, for example, in the 3×3 block shown in FIG. 4, when the density level of the center pixel is the maximum (minimum) compared to the density level of the surrounding eight pixels, the center pixel is defined as the density peak (valley). ) is detected as an extreme point pixel.

網点候補領域検出回路２０２は、上記極点画素検出回路
２０１で検出した山と谷の極点画素の所定の小領域にお
ける密度情報から注目画素が網点候補領域または非網点
候補領域のいずれに属するかを判定する。例えば、第５
図（ａ）または（ｂ）に示すように、４×４の小領域内
の山と谷の極点画素を計数する。そして、山と谷の極点
画素の合計値が所定のしきい値を越えるとき、図（ａ）
のブロック単位で処理する場合には、当該小領域内のす
べての画素を網点候補領域として検出し、また、画素単
位で処理する図（ｂ）の場合には、当該小領域内の所定
の注目画素（ハツチング画素）を網点候補領域として検
出する。The halftone candidate area detection circuit 202 determines whether the pixel of interest belongs to a halftone candidate area or a non-halftone candidate area based on the density information in a predetermined small area of the peak and valley extreme pixels detected by the extreme pixel detection circuit 201. Determine whether For example, the fifth
As shown in Figure (a) or (b), the peak and valley peak pixels within a 4x4 small area are counted. Then, when the total value of the peak and valley peak pixels exceeds a predetermined threshold, as shown in Figure (a)
When processing in block units, all pixels in the small area are detected as halftone dot candidate areas, and in case of processing in pixel units as shown in figure (b), predetermined pixels in the small area are detected as halftone dot candidate areas. A pixel of interest (hatched pixel) is detected as a halftone dot candidate area.

網点領域検出回路２０３は、上記網点候補領域検出回路
２０２の検出結果に基づいて、所定の領域内の注目画素
が網点部または非網点部のいずれに属するかを判定する
。例えば、第６図（ａ）に示すように、ブロック単位で
判定する場合、注目の小領域とその周囲の小領域のうち
で、網点候補領域とされた小領域の個数が所定のしきい
値よりも大きいときは、注目の小領域内のすべての画素
を網点領域として判定する。また、第６図（ｂ）に示す
ように、画素単位で判定する場合、注目画素（／％ラッ
チング素）とその周囲の画素のうちで、網点候補領域と
された画素の個数が所定のしきい値よりも大きいときは
、注目画素を網点領域として判定する。これにより、入
力画像中の全画素について網点部と非網点部とが分離さ
れる。The halftone dot area detection circuit 203 determines, based on the detection result of the halftone dot candidate area detection circuit 202, whether the pixel of interest in a predetermined area belongs to a halftone area or a non-halftone area. For example, as shown in FIG. 6(a), when the determination is made in block units, the number of small areas that are halftone candidate areas among the small area of interest and the surrounding small areas reaches a predetermined threshold. When it is larger than the value, all pixels in the small area of interest are determined to be a halftone area. Furthermore, as shown in FIG. 6(b), when the determination is made pixel by pixel, the number of pixels designated as halftone dot candidate areas among the pixel of interest (/% latching pixel) and its surrounding pixels is If it is larger than the threshold, the pixel of interest is determined to be a halftone dot area. As a result, halftone dots and non-halftone dots are separated for all pixels in the input image.

次に、写真／非写真分離回路３における写真部と非写真
部の分離処理の例について、第７図〜第９図を参照して
説明する。Next, an example of separation processing of a photograph part and a non-photo part in the photograph/non-photo separation circuit 3 will be explained with reference to FIGS. 7 to 9.

第７図は、写真／非写真分離回路３の具体例のブロック
図を示す。２値化回路３０１は、入力するディジタル多
値濃度データを特定のしきい値で２値化し、白黒２値画
像に変換する。すなわち、この２値化によって入力画像
上の全画素を白画素または黒画素に分類する。FIG. 7 shows a block diagram of a specific example of the photo/non-photo separation circuit 3. As shown in FIG. The binarization circuit 301 binarizes input digital multi-value density data using a specific threshold value and converts it into a black and white binary image. That is, by this binarization, all pixels on the input image are classified as white pixels or black pixels.

黒画素パターンマツチング回路３０２は、前記２値画像
から連結黒画素を検出するもので、注目画素を中心とし
て例えば３×３のマトリックス内の白黒パターンが、第
８図に示す画素パターンのいずれかとマツチングしたと
きに注目画素を連結黒画素と判断し、出力を発生する。The black pixel pattern matching circuit 302 detects connected black pixels from the binary image, and determines whether a black and white pattern in a 3×3 matrix centering on the pixel of interest is one of the pixel patterns shown in FIG. When matched, the pixel of interest is determined to be a connected black pixel, and an output is generated.

白画素パターンマツチング回路３０３は、前記２値画像
から連結白画素を検出するもので、注目画素を中心とし
て例えば３×３のマトリックス内の白黒パターンが、第
９図に示す画素パターンのいずれかとマツチングしたと
きに注目画素を連結白画素と判断し、出力を発生する。The white pixel pattern matching circuit 303 detects connected white pixels from the binary image, and determines whether a black and white pattern in a 3×3 matrix centered around the pixel of interest is one of the pixel patterns shown in FIG. When matched, the pixel of interest is determined to be a connected white pixel, and an output is generated.

計数回路３０４は、黒画素パターンマツチング回路３０
２の出力信号より、注目画素を中心とした３×３のマト
リックス内に存在する連結黒画素を計数し、計数値が例
えば２以」二となった時に出力を発生する。同様に、計
数回路３０５は、注目画素を中心とした３×３のマトリ
ックス内に存在する連結白画素を計数し、計数値が例え
ば２以トとなった時に出力を発生する。The counting circuit 304 is a black pixel pattern matching circuit 30
Based on the output signal No. 2, connected black pixels existing in a 3×3 matrix centered on the pixel of interest are counted, and an output is generated when the count value becomes, for example, 2 or more. Similarly, the counting circuit 305 counts the connected white pixels existing in a 3×3 matrix centered on the pixel of interest, and generates an output when the count value becomes 2 or more, for example.

そｊ−で、この計数回路３０４と３０５のＡＮＤをとり
、注「１画素を中心として３×３のマトリックス内に２
個以上の連結黒画素および連結白画素が同時に存在する
場合には注目画素を非写真候補画素と判定し、またそれ
以外の場合には写真候補画素と判定し、ＡＮＤ回路３０
６からそれぞれの場合に対応した出力を発生する。Then, the counting circuits 304 and 305 are ANDed, and two pixels are placed in a 3x3 matrix with one pixel as the center.
If more than 10 connected black pixels and connected white pixels exist at the same time, the pixel of interest is determined to be a non-photo candidate pixel; otherwise, it is determined to be a photograph candidate pixel, and the AND circuit 30
6 generates an output corresponding to each case.

判定回路３０７は、注目画素を中心とした例えば５×５
のマトリックス内に、例えば非写真候補画素が１個以上
存在するときは、当該注目画素を非写真画素と判定し、
１個も存在しないときは、当該注目画素を写真画素と判
定する。これにより入力画像中の全画素について写真部
と非写真部とが分離される。The determination circuit 307 uses, for example, a 5×5 pixel centered on the pixel of interest.
For example, if there is one or more non-photo candidate pixels in the matrix, the pixel of interest is determined to be a non-photo pixel,
If there is no such pixel, the pixel of interest is determined to be a photographic pixel. As a result, photographic parts and non-photographic parts are separated for all pixels in the input image.

次に、網地文字／非網地文字分離回路４における網地文
字部と非網地文字部の分離処理の例について、第１０図
へ・第１２図を参照して説明する。Next, an example of separation processing between a screen character part and a non-screen character part in the screen character/non-screen character separation circuit 4 will be described with reference to FIG. 10 and FIG. 12.

第１０図は、網地文字／非網地文字分離回路４のブロッ
ク図を示す。平滑化処理回路４０１は、入力するディジ
タル多値濃度データを所定の重み計数で平滑化する。FIG. 10 shows a block diagram of the screen character/non-screen character separation circuit 4. As shown in FIG. The smoothing processing circuit 401 smoothes input digital multi-level density data using a predetermined weighting factor.

尾根画素検出回路４０２は、上記平滑化されたディジタ
ル多値濃度データに対し、例えば第１１図に示ず５×５
の尾根画素検出パターンを適用し１、二の５×５のパタ
ーン内において、例えばＬｃＬ、＋≧ＴＨ１かつＬｅ−Ｌｚｚ≧Ｔｌ＋またはＬｃ　　Ｌｓ≧ＴＩ、かつＬｃ＝Ｌ＊＊≧ＴＩまたはＬｃ　　Ｌｓ≧ＴＨ，かつＬｅ−Ｌｚｎ≧ＴＨまたはＬｃ−Ｌ、、≧Ｔｌ＋、かつＬｃ−１，，４≧ＴＩ但し
　ＴＨ：Ｌきい値の−組でも成立するとき、その中心画）；そＩ５（・を
、第１２図に示すような文字の尾根画素と１７で検出す
る。The ridge pixel detection circuit 402 detects the smoothed digital multilevel density data, for example, in a 5×5 pixel (not shown in FIG. 11).
For example, LcL, +≧TH1 and Le-Lzz≧Tl+ or Lc Ls≧TI, and Lc=L**≧TI or Lc Ls≧ TH, and Le-Lzn≧TH or Lc-L, ,≧Tl+, and Lc-1,,4≧TI However, if the − pair of TH:L threshold also holds, then its center image); , 17 is detected as the ridge pixel of the character as shown in FIG.

判定回路４０３は、尾根画素検出回路４０２で検出され
た上記尾根画素を、例えば９×９のブロック単位で計数
し、その計数値が所定のしきい値を越えたときに、当該
ブロック内の注目画素（中心画素またはブロック内の全
画素）を網点上の文字画素として判定する。このように
、入力画像を所定の重み計数によって平滑化した後、処
理を行うことにより、背景が網点からなる網地文字部を
画像中から分離することができる。The determination circuit 403 counts the ridge pixels detected by the ridge pixel detection circuit 402 in units of, for example, 9×9 blocks, and when the counted value exceeds a predetermined threshold value, the determination circuit 403 counts the ridge pixels detected by the ridge pixel detection circuit 402, and when the counted value exceeds a predetermined threshold, the A pixel (center pixel or all pixels in a block) is determined as a character pixel on a halftone dot. In this way, by performing processing after smoothing the input image using a predetermined weight factor, it is possible to separate the halftone text portion whose background is halftone dots from the image.

〔Effect of the invention〕

以上述べたところから明らかなように、本発明によると
きは、画像中の白地文字部および網地文字部の両者を文
字領域として正確に抽出することができるので、分離精
度の高い文字分離方式を捷供することができる。As is clear from the above, when the present invention is used, it is possible to accurately extract both the white character part and the halftone character part in an image as character areas, so a character separation method with high separation accuracy can be used. Can be offered.

[Brief explanation of the drawing]

第１図は本発明の）京理図、第２図は本発明の実施例のブロック図、第３図は網点／
非網点分離回路の具体例のブロック図、第４図は局所Ｎ域と極点画素の例を示す図、第５図は網
点領域候補の検出の説明図、第６図は網点領域の検出の
説明図、第７図は写真／非写真分離回路の具体例のブロック図、第８図は連結黒画素検出のためのパターンを示す図、第９図は連結白画素検出のためのパターンを示す図、第１θ図は網地文字／非網地文字分離回路の具体例のブ
ロック図、第１１図は尾根画素検出パターンの例を示す図、第１２
図は尾根画素の説明図である。ｌ・・・Ａ／Ｄ変換回路、２・・・網点／非網点分離回
路、３・・・写真／非写真分離回路、４・・・網地文字
／非網地文字分離回路、５・・・総合判定回路。Figure 1 is a Kyori diagram of the present invention, Figure 2 is a block diagram of an embodiment of the present invention, and Figure 3 is a halftone dot/
A block diagram of a specific example of a non-halftone dot separation circuit, FIG. 4 is a diagram showing an example of a local N area and a pole pixel, FIG. 5 is an explanatory diagram of detection of halftone dot area candidates, and FIG. An explanatory diagram of detection. Figure 7 is a block diagram of a specific example of a photo/non-photo separation circuit. Figure 8 is a diagram showing a pattern for detecting connected black pixels. Figure 9 is a pattern for detecting connected white pixels. Figure 1θ is a block diagram of a specific example of a screen character/non-screen character separation circuit; Figure 11 is a diagram showing an example of a ridge pixel detection pattern; Figure 12 is a diagram showing an example of a ridge pixel detection pattern;
The figure is an explanatory diagram of ridge pixels. l... A/D conversion circuit, 2... Halftone/non-halftone separation circuit, 3... Photo/non-photo separation circuit, 4... Halftone text/non-halftone text separation circuit, 5 ... Comprehensive judgment circuit.

Claims

[Claims]

(1) After converting the input image into digital multi-value density data, based on the digital multi-value density data, the input image is divided into halftone dot areas and non-halftone areas, photographic areas and non-photographic areas, and the background is halftone dots. The halftone text area and the background are separated into the other non-halftone text areas, and by taking the AND of the non-halftone area and the non-photograph area, an image that becomes a non-halftone area and a non-photograph area is created. A character region separation method characterized in that the region is determined to be a white character portion having a white background, and both the white character portion and the halftone character portion are determined to be character regions and extracted.

(2) When separating the input image into halftone areas and non-halftone areas, detect the extreme point pixels that give the changing points of the peaks and troughs of density based on the digital multilevel density data, and select the extreme points in a predetermined small area. It is determined whether the pixel of interest belongs to a halftone dot candidate area or a non-halftone dot candidate area based on pixel density information, and based on the determination results of surrounding pixels, the pixel of interest within a predetermined area is determined to be a halftone dot candidate area or a non-halftone dot candidate area. 2. The character area separation method according to claim 1, wherein the character area separation method determines whether the character area is a dot part.

(3) When separating the input image into photographic parts and non-photographic parts, the input image is classified into black pixels and white pixels based on digital multilevel density data, and from the classified black pixel group and white pixel group, The system detects connected black pixels and connected white pixels, respectively, and determines whether the pixel of interest is in a photo area or a non-photo area based on the number of connected black pixels and connected white pixels that exist in the vicinity of the pixel of interest. A character area separation method according to claim (1), characterized in that:

(4) When separating the input image into character parts whose background is halftone dots and character parts whose background is non-halftone dots, after smoothing the digital multilevel density data with a predetermined weighting coefficient, The ridge pixel in the image is detected based on the density change, and based on the density information of the ridge pixel, it is determined whether the ridge pixel of interest is a background halftone dot character part or a background non-halftone dot character part. The character area separation method according to claim (1), characterized in that: