JP2000295468A

JP2000295468A - Image processor

Info

Publication number: JP2000295468A
Application number: JP11094981A
Authority: JP
Inventors: Hiroki Sugano; 浩樹菅野; Naoaki Kodaira; 直朗小平; Hiroaki Kubota; 浩明久保田
Original assignee: Toshiba Corp; Toshiba TEC Corp
Current assignee: Toshiba Corp; Toshiba TEC Corp
Priority date: 1999-04-01
Filing date: 1999-04-01
Publication date: 2000-10-20

Abstract

PROBLEM TO BE SOLVED: To efficiently perform image processing by suppressing degradation of an image. SOLUTION: Image of a document is fetched by an image input part 101, character, photograph, figure, chart and dot areas are extracted from fetched image data by an area identifying part 102, an image class discriminating part 103 discriminates which kind of document the inputted image data are by finding a feature amount such as complexity as a document structure from the presence/ absence of an area to be a key, such as dots or photographs and the degrees of overlapping rectangular areas from the result identified from the characters, photographs, figures, chart or dot for each area. A data transforming part 104 performs transformation processing of an image data format according to the discriminated document kind, and an image output part 105 outputs the output image of the transformed image data format.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、文書などの画像
をファイリングあるいは複写する際に行うデータ形式や
濃度値など変換処理のための画像処理装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image processing apparatus for performing a conversion process such as a data format and a density value when filing or copying an image such as a document.

【０００２】[0002]

【従来の技術】文書画像を画像データとして取り込んで
その画像をハードコピーとして出力したり、イメージフ
ァイルとして保存したりする技術として、複写機および
パソコンにおけるイメージ取り込みに利用される技術
や、ファイリングシステムやデータベースに利用される
技術がある。2. Description of the Related Art Techniques for capturing a document image as image data and outputting the image as a hard copy or saving the image file as an image file include a technique used for image capture in a copying machine and a personal computer, a filing system and the like. There are technologies used for databases.

【０００３】ハードコピーとして出力する場合、文書画
像として利用者が見やすくするために画像処理を施す場
合がある。例えば、文書画像に文字が含まれている場
合、ハイパスフィルタ処理を施すことによって、文字の
エッジ部分が強調され、くっきりとした出力になり、読
みやすい文字となる。また、写真の場合、滑らかな階調
を再現するためにローパスフィルタを施すことによっ
て、ざらつき感が無くなり、奇麗な出力となる。同じ写
真であっても、網点で構成された網点写真であるのか、
あるいは、銀塩写真であるのかによって処理を切り替え
ることも可能である。以上のように、文書画像の種類に
よって処理手法を変更することは、ハードコピーとして
出力する際も、イメージファイルとして保存する際にも
非常に有効である。[0003] In the case of outputting as a hard copy, image processing is sometimes performed to make it easy for a user to view as a document image. For example, when a character is included in the document image, the edge portion of the character is emphasized by performing the high-pass filter processing, the output becomes clear, and the character becomes easy to read. In addition, in the case of a photograph, a low-pass filter is applied to reproduce a smooth gradation, thereby eliminating a feeling of roughness and providing a beautiful output. If the same photo is a halftone dot photo,
Alternatively, the processing can be switched depending on whether the photograph is a silver halide photograph. As described above, changing the processing method according to the type of the document image is very effective both when outputting as a hard copy and when saving as an image file.

【０００４】文書画像は通常、文字、図表、写真等の領
域がそれぞれ単独にあるいは重なり合って構成されてい
る。したがって、画像処理を行なう場合、文書画像にど
のような領域がどのように構成されているかを検知する
必要がある。[0004] A document image is usually composed of regions such as characters, figures, photographs, etc., singly or overlapping. Therefore, when performing image processing, it is necessary to detect what area and how are configured in the document image.

【０００５】従来、文字、網点、写真等の文書画像を構
成する要素を識別するために、特願平８−３４７０２号
に記載されているように、レイアウト解析を利用して、
文書画像の構造を抽出する方法がある。この方法では、
２値化処理を行なった後に画素の連結性を調べ、連結し
ている画素同士を領域として抽出し、その位置や大きさ
等の特徴量を利用して識別するものである。この方法の
場合、前処理として２値化を行なっているため、写真等
の中間的な濃度値を含んだ領域を正確に抽出することが
困難であった。Conventionally, in order to identify elements constituting a document image such as a character, a halftone dot, and a photograph, a layout analysis is used as described in Japanese Patent Application No. 8-34702.
There is a method for extracting the structure of a document image. in this way,
After performing the binarization process, the connectivity of pixels is checked, connected pixels are extracted as regions, and identification is performed using feature amounts such as positions and sizes. In the case of this method, since binarization is performed as preprocessing, it has been difficult to accurately extract a region including an intermediate density value such as a photograph.

【０００６】さらに、この方法を拡張した方法として特
願平１０−０５３３１７号に記載されているように、多
値画像を複数枚の２値画像に変換し、各々の２値画像に
対してレイアウト解析を行うことにより、写真等の中間
的な濃度値を含んだ領域を正確に抽出するものがある。
この方法では、網点上にある文字といった領域が重なり
合った場合においても、領域の抽出が正しく行えるとい
う利点がある。Further, as described in Japanese Patent Application No. 10-053317, which is an extension of this method, a multivalued image is converted into a plurality of binary images, and a layout is applied to each of the binary images. In some cases, an area including an intermediate density value such as a photograph is accurately extracted by performing analysis.
This method has an advantage that the region can be correctly extracted even when the regions such as characters on the halftone dot overlap.

【０００７】さて、実際の文書原稿は複雑なレイアウト
構造を持つことがある。例えば、雑誌の表紙やグラフィ
ックが多用された広告など、文字や図、写真が複雑に重
なり合っているものがあげられる。このような文書原稿
に対して、特願平１０−０５３３１７号に記載されてい
るような矩形による領域識別を行うと、領域が不自然に
分割される場合がある。その領域識別の結果を用いて画
像処理を施すと、逆にユーザにとって見にくいものとな
る恐れがある。[0007] An actual document manuscript may have a complicated layout structure. For example, there are magazine covers and advertisements that frequently use graphics, in which characters, figures, and photographs are complicatedly overlapped. If such a document document is subjected to rectangular area identification as described in Japanese Patent Application No. 10-053317, the area may be unnaturally divided. If image processing is performed using the result of the area identification, it may be difficult for the user to see the image.

【０００８】また、これとは反対に、非常に単純なレイ
アウト構造を持つ文書原稿も存在する。例えば、写真の
みで構成されていたり、文字のみあるいは文字と網点下
地のみで構成されていたりする場合などがこれに該当す
る。このような場合、文書原稿全体に対して同じ画像処
理を施すことで済むにもかかわらず、領域毎の識別結果
を用いて処理を行うため、処理時間のコストがかかるこ
とが予想される。On the other hand, there is a document manuscript having a very simple layout structure. For example, this case corresponds to a case where the image is composed only of a photograph, or only a character, or only a character and a halftone dot base. In such a case, although the same image processing may be performed on the entire document document, the processing is performed using the identification result for each area, so that it is expected that processing time will be costly.

【０００９】このように原稿の種別に関わらず、同様な
処理を行うため不都合が生じる場合があった。As described above, the same processing is performed irrespective of the type of the original, which may cause inconvenience.

【００１０】[0010]

【発明が解決しようとする課題】上記したように、従来
の画像処理装置において、多値画像を複数枚の２値画像
に変換し、各々の２値画像に対してレイアウト解析を利
用して、文書画像の構造を領域として抽出し、その領域
毎の属性にしたがって画像処理を行うため、不自然な領
域が発生して画像の劣化を起こしたり、不要な処理時間
がかかったり効率が悪いといった問題があった。As described above, in a conventional image processing apparatus, a multivalued image is converted into a plurality of binary images, and layout analysis is performed on each of the binary images by using a layout analysis. Since the structure of a document image is extracted as an area and image processing is performed according to the attributes of each area, unnatural areas occur, causing image degradation, unnecessary processing time, and poor efficiency. was there.

【００１１】そこで、この発明は、画像の劣化を少なく
して効率的に画像処理を行うことのできる画像処理装置
を提供することを目的とする。SUMMARY OF THE INVENTION It is an object of the present invention to provide an image processing apparatus capable of performing image processing efficiently with less deterioration of an image.

【００１２】[0012]

【課題を解決するための手段】この発明の画像処理装置
は、文書画像を入力する入力手段と、この入力手段で入
力された画像データから画素の特徴を用いて領域を抽出
し、この抽出された領域の属性を識別する領域識別手段
と、この領域識別手段で識別された属性毎の領域の分布
から上記画像データの種類を判定する判定手段とから構
成されている。SUMMARY OF THE INVENTION An image processing apparatus according to the present invention extracts an area using input means for inputting a document image, and features of pixels from image data input by the input means. And a determination unit for determining the type of the image data from the distribution of the area for each attribute identified by the area identification unit.

【００１３】この発明の画像処理装置は、文書画像を入
力する入力手段と、この入力手段で入力された画像デー
タから画素の特徴を用いて領域を抽出し、この抽出され
た領域の属性を識別する領域識別手段と、この領域識別
手段で識別された属性毎の領域の分布から上記画像デー
タの種類を判定する判定手段と、この判定手段で判定さ
れた種類に応じて上記画像データの解像度、圧縮率、色
数などを変換する変換手段と、この変換手段で変換され
た画像データを出力する出力手段とから構成されてい
る。An image processing apparatus according to the present invention is characterized in that an input means for inputting a document image, an area is extracted from the image data input by the input means using characteristics of pixels, and an attribute of the extracted area is identified. Area identifying means, determining means for determining the type of the image data from the distribution of the area for each attribute identified by the area identifying means, resolution of the image data according to the type determined by the determining means, It comprises a conversion means for converting the compression ratio, the number of colors, and the like, and an output means for outputting the image data converted by the conversion means.

【００１４】この発明の画像処理装置は、文書画像を入
力する入力手段と、この入力手段で入力された画像デー
タから画素の特徴を用いて領域を抽出し、この抽出され
た領域の属性を識別する領域識別手段と、この領域識別
手段で識別された属性毎の領域の分布から上記画像デー
タの種類を判定する判定手段と、この判定手段で判定さ
れた種類に応じて濃度変換あるいはフィルタ処理を施し
て上記画像データを補正する補正手段と、この補正手段
で補正された画像データを出力する出力手段とから構成
されている。An image processing apparatus according to the present invention is characterized in that an input means for inputting a document image, an area is extracted from the image data input by the input means using characteristics of pixels, and an attribute of the extracted area is identified. Area identifying means for performing the determination, determining means for determining the type of the image data from the distribution of the area for each attribute identified by the area identifying means, and performing density conversion or filter processing according to the type determined by the determining means. And correcting means for correcting the image data by applying the image data, and outputting means for outputting the image data corrected by the correcting means.

【００１５】この発明の画像処理装置は、文書画像を入
力する入力手段と、この入力手段で入力された画像デー
タから画素の特徴を用いて領域を抽出し、この抽出され
た領域の属性を識別する領域識別手段と、この領域識別
手段で識別された属性毎の領域の分布から上記画像デー
タの種類を判定する判定手段と、この判定手段で判定さ
れた種類に応じて上記画像データの画像形式を変更する
変更手段と、この変更手段で変更された画像形式の画像
データを記憶する記憶手段とから構成されている。An image processing apparatus according to the present invention is characterized in that an input means for inputting a document image, an area is extracted from the image data input by the input means using characteristics of pixels, and an attribute of the extracted area is identified. Area determining means for determining the type of the image data from the distribution of the area for each attribute identified by the area identifying means; and an image format of the image data according to the type determined by the determining means. And storage means for storing the image data of the image format changed by the changing means.

【００１６】この発明の画像処理装置は、文書画像を入
力する入力手段と、この入力手段で入力された画像デー
タから画素の特徴を用いて領域を抽出し、この抽出され
た領域の属性を識別する領域識別手段と、この領域識別
手段で識別された属性毎の領域の分布から上記画像デー
タの種類を判定する判定手段と、この判定手段で判定さ
れた種類に応じて、画像ファイリング、印刷、送信、レ
イアウト理解、文字認識、罫線抽出などの画像処理のア
プリケーションを選択し、パラメータを設定後、アプリ
ケーションを起動させ上記画像データの画像処理を行う
処理手段と、この処理手段で画像処理された画像データ
を出力する出力手段とから構成されている。An image processing apparatus according to the present invention is characterized in that an input means for inputting a document image, an area is extracted from the image data input by the input means using characteristics of pixels, and an attribute of the extracted area is identified. Area identifying means, a determining means for determining the type of the image data from the distribution of the area for each attribute identified by the area identifying means, and, according to the type determined by the determining means, image filing, printing, Processing means for selecting an image processing application such as transmission, layout understanding, character recognition, ruled line extraction, etc., setting parameters, activating the application, and performing image processing on the image data, and an image processed by this processing means And output means for outputting data.

【００１７】[0017]

【発明の実施の形態】以下、この発明の一実施の形態に
ついて図面を参照して説明する。An embodiment of the present invention will be described below with reference to the drawings.

【００１８】第１実施例について説明する。The first embodiment will be described.

【００１９】図１は、本発明の第１実施例に係る画像処
理装置の概略構成を示すものである。図１において、１
０１は画像入力部、１０２は領域識別部、１０３は画像
種別判定部、１０４はデータ変換部、１０５は画像出力
部である。FIG. 1 shows a schematic configuration of an image processing apparatus according to a first embodiment of the present invention. In FIG. 1, 1
01 is an image input unit, 102 is an area identification unit, 103 is an image type determination unit, 104 is a data conversion unit, and 105 is an image output unit.

【００２０】画像入力部１０１は画像データを入力する
装置であり、書類を読み取って画像データに変換する装
置であるイメージスキャナ等の画像入力装置により、文
書等の書類から描かれているものを取り込む装置であ
る。なお、本発明における画像入力部はイメージスキャ
ナ等の読み取り装置で構成されたものでも良いし、既に
画像データとしてファイリング装置等に保存された画像
イメージを取り込む装置で構成されても良い。An image input unit 101 is a device for inputting image data. An image input device such as an image scanner, which is a device for reading a document and converting it into image data, captures an image drawn from a document such as a document. Device. Note that the image input unit in the present invention may be configured by a reading device such as an image scanner, or may be configured by a device that captures an image already stored in a filing device or the like as image data.

【００２１】領域識別部１０２は、画像入力部１０１に
おいて入力された画像データから文字を持った画素を領
域として識別するものである。まず、画像入力部１０１
において入力された画像データに対して、周辺画素の濃
度差や彩度などの状態によって複数の２値の画像データ
に分離し、その画像より文字や図形等が物理的にあるい
は論理的に連結されている各領域に分割して抽出し、そ
の領域の位置、大きさ、形状、構造、濃度分布等の特徴
量を計測して、文書要素としての属性を決定する。文書
要素の種類として、文字、写真、図、表、網点などがあ
げられる。The area identification section 102 identifies pixels having characters from the image data input by the image input section 101 as areas. First, the image input unit 101
The input image data is separated into a plurality of binary image data according to the state such as the density difference and the saturation of peripheral pixels, and characters or graphics are physically or logically connected from the image. Each region is divided and extracted, and the feature amount such as the position, size, shape, structure, density distribution and the like of the region is measured, and the attribute as a document element is determined. The types of document elements include text, photographs, figures, tables, and halftone dots.

【００２２】複数の２値の画像データに分離する具体的
な手法として、既に公知となっているものがあり、例え
ば、特願平１０−０５３３１７号で開示されている手法
により実現しても良い。この場合、画像分離部１０２の
出力として、文字画像、中間調画像、下地画像、網点画
像、カラー画像、グレー画像、黒画像の７つの２値分離
画像データが生成される。As a specific method of separating the image data into a plurality of binary image data, there is a known method. For example, the method may be realized by a method disclosed in Japanese Patent Application No. 10-053317. . In this case, as the output of the image separating unit 102, seven binary separated image data of a character image, a halftone image, a background image, a halftone image, a color image, a gray image, and a black image are generated.

【００２３】また、文書要素の抽出および識別の具体的
な手法として、既に公知となっているものがあり、例え
ば、特願平８−３４７０２号で開示されている手法によ
り実現しても良い。領域識別部１０２では、単一画像デ
ータからだけではなく、複数画像データでの特徴量をル
ールにしたがって統合し、入力された画像データの領域
属性を決定する。例えば、文字画像と中間調画像の双方
から同じ位置に領域が抽出された場合、その領域の種類
や領域の大きさはどれだけなのかを決定する。具体的な
例として、文字画像上に写真領域が存在し、同じ位置に
中間調画像に中間調画素が存在する場合、連続階調写真
領域と決定する。複数画像データからの領域属性決定の
具体的手法として、既に公知となっているものがあり、
例えば、特願平１０−０５３３１７号で開示されている
手法により実現しても良い。As a specific method for extracting and identifying document elements, there is a known method, and for example, the method may be realized by a method disclosed in Japanese Patent Application No. 8-34702. The region identification unit 102 integrates not only single image data but also feature amounts of a plurality of image data according to rules, and determines a region attribute of the input image data. For example, when an area is extracted at the same position from both the character image and the halftone image, the type of the area and the size of the area are determined. As a specific example, when a photographic area exists on a character image and a halftone pixel exists on the halftone image at the same position, it is determined to be a continuous tone photographic area. As a specific method of determining an area attribute from a plurality of image data, there is a known method,
For example, it may be realized by a method disclosed in Japanese Patent Application No. 10-053317.

【００２４】画像種別判定部１０３は、詳細は後述する
が、領域識別部１０２において文字、写真、図、表、網
点などと領域毎に識別された結果から、網点、写真とい
ったキーとなる領域の有無や、矩形領域の重なりの度合
いから文書構造としての複雑さなどといった特徴量を求
めることにより、入力された画像データがどのような種
類の文書であるのか判定を行う。As will be described in detail later, the image type determination unit 103 becomes a key such as a halftone dot or a photograph based on the result of the region identification unit 102 identifying each region as a character, photograph, figure, table, halftone dot, or the like. By determining a feature amount such as the complexity of the document structure based on the presence or absence of a region and the degree of overlap of the rectangular regions, it is determined what type of document the input image data is.

【００２５】ここで言う画像データの種類とは、例え
ば、１下地が一様であって、文字、図、表などしか存在
しないもの。The type of image data referred to here is, for example, one in which the background is uniform and only characters, figures, tables, and the like are present.

【００２６】２下地に網点があり、文字、図、表などし
か存在しないもの。(2) Halftone dots on the base and only characters, figures, tables, etc. are present.

【００２７】３網点写真しか存在しないもの。One in which only three halftone photographs exist.

【００２８】４連続写真しか存在しないもの。One in which only four consecutive photographs exist.

【００２９】５文字、図、表のうち少なくとも一つと網
点写真が混在していて、なおかつ矩形で領域を抽出する
ことが可能なもの、または文字、図、表、網点のうち少
なくとも一つと連続写真が混在し、なおかつ矩形で領域
を抽出することが可能なもの。At least one of five characters, figures, tables, and halftone photographs are mixed, and a rectangular area can be extracted, or at least one of characters, figures, tables, and halftone dots A series of continuous photos, and a rectangular area can be extracted.

【００３０】６文字、図、表のうち少なくとも一つと網
点写真が混在していて、なおかつ矩形で領域を抽出する
ことが不可能なもの、または文字、図、表、網点のうち
少なくとも一つと連続写真が混在し、なおかつ矩形で領
域を抽出することが不可能なもの。At least one of the six characters, figures, tables, and halftone photographs are mixed, and a rectangular area cannot be extracted, or at least one of characters, figures, tables, and halftone dots One is a mixture of one and a series of photographs, and it is impossible to extract a rectangular area.

【００３１】などである。And so on.

【００３２】データ変換部１０４は、詳細は後述する
が、画像種別判定部１０３において判定された文書の種
類にしたがって入力画像データの形式の変更を行う。画
像データの形式の変更とは、例えば、解像度、圧縮率、
色数などを変更することであっても良い。また、属性の
領域情報を画像に変換したものでも良い。属性の領域情
報を画像に変換する具体的な手法として、既に公知とな
っているものがあり、例えば、特願平１０−０５３３１
７号で開示されている手法により実現しても良い。The data conversion unit 104 changes the format of the input image data according to the type of the document determined by the image type determination unit 103, which will be described in detail later. Changing the format of image data includes, for example, resolution, compression ratio,
The number of colors may be changed. Alternatively, attribute area information may be converted into an image. As a specific method of converting area information of an attribute into an image, there is a known method, for example, Japanese Patent Application No. 10-05331.
7 may be realized.

【００３３】画像出力部１０５は、データ変換部１０４
によって変換された画像データを出力するものである。
出力形態としては、ハードコピーであったり、イメージ
ファイルとしてストレイジに保存したりしても良い。The image output unit 105 includes a data conversion unit 104
The image data converted by the above is output.
The output form may be a hard copy or may be stored in storage as an image file.

【００３４】次に、このような構成において、本発明の
画像処理装置の処理を説明する。Next, the processing of the image processing apparatus of the present invention in such a configuration will be described.

【００３５】図２は、図１に示した画像処理装置におけ
る文書処理手順の例を示すフローチャートである。この
フローチャートを参照しながら本発明による画像処理装
置の処理の流れを説明する。FIG. 2 is a flowchart showing an example of a document processing procedure in the image processing apparatus shown in FIG. The processing flow of the image processing apparatus according to the present invention will be described with reference to this flowchart.

【００３６】まず、書類の画像を画像入力部１０１によ
り取り込む（ステップＳＴ２０１）。すなわち、スキャ
ナ等の画像入力装置を利用して書類から画像を読み取っ
たり、またはファイリングシステム等の画像ファイルデ
ータを入力したりしたものを、画像入力部１０１により
画像データに変換する。First, an image of a document is fetched by the image input unit 101 (step ST201). That is, the image input unit 101 converts an image read from a document using an image input device such as a scanner or image file data input from a filing system or the like into image data.

【００３７】領域識別部１０２において、画像入力装置
から１ライン分、または数ライン分ずつ読み取り、周辺
画素の濃度差や彩度などの画素ごとの状態によって複数
の２値画像データに分離され、文字や図形等が物理的に
あるいは論理的に連結されている各領域毎に分割して抽
出し、その領域の位置、大きさ、形状、構造、濃度分布
等の特徴量を計測して領域の種類や重要度等の識別を行
ない、その結果、文字、写真、図、表、網点領域を抽出
する（ステップＳＴ２０２）。入力された画像の全画素
に対して処理が終了するまで繰り返す（ステップＳＴ２
０３）。The area discriminating unit 102 reads one line or several lines from the image input device, and separates the data into a plurality of binary image data according to the state of each pixel such as the density difference and saturation of peripheral pixels. And extract each area where data, graphics, etc. are physically or logically connected, and measure the amount of features such as position, size, shape, structure, density distribution, etc. of the area, and type of area Then, the character, the photograph, the figure, the table, and the dot area are extracted as a result (step ST202). Repeat until the processing is completed for all pixels of the input image (step ST2
03).

【００３８】すべての画素属性が決定した後、画像種別
判定部１０３において、文字、写真、図、表、網点など
と領域毎に識別された結果から、網点、写真といったキ
ーとなる領域の有無や、矩形領域の重なりの度合いから
文書構造としての複雑さなどといった特徴量を求めるこ
とにより、入力された画像データがどのような種類の文
書であるのか判定を行う（ステップＳＴ２０４）。な
お、画像種別判定部１０３における具体的な処理の内容
の詳細は、図３を参照して後述する。After all pixel attributes have been determined, the image type determination unit 103 identifies characters, photographs, figures, tables, halftone dots, and the like for each area, and determines the key areas such as halftone dots and photographs. By determining feature quantities such as the presence / absence and the degree of overlap of rectangular areas, such as the complexity of the document structure, it is determined what type of document the input image data is (step ST204). The details of the specific processing in the image type determination unit 103 will be described later with reference to FIG.

【００３９】入力された画像データの文書としての種類
を決定した後、データ変換部１０４において、文書種類
にしたがった画像データ形式の変換処理を行う（ステッ
プＳＴ２０５）。なお、データ変換部１０４における具
体的な処理の内容の詳細は、図５を参照して後述する。After determining the type of the input image data as a document, the data conversion unit 104 performs a conversion process of an image data format according to the document type (step ST205). The details of the specific processing in the data conversion unit 104 will be described later with reference to FIG.

【００４０】最後に画像出力部１０５において、画像デ
ータ形式を変換した出力画像を出力する（ステップＳＴ
２０６）。Finally, the image output unit 105 outputs an output image whose image data format has been converted (step ST).
206).

【００４１】以上が本画像処理装置の大まかな処理動作
である。次に個々の要素の処理の詳細を説明する。The rough processing operation of the image processing apparatus has been described above. Next, details of processing of each element will be described.

【００４２】図３は、画像処理装置の画像種別判定部１
０３における一例としての画像種別判定処理の詳細を示
すフローチャートであり、図２に示したステップＳＴ２
０４で行われる処理のフローチャートである。画像種別
判定処理は、画素毎の領域識別結果を利用して、入力さ
れた画像データが文書としてどのような種類なのかを判
定する処理である。FIG. 3 shows an image type determining unit 1 of the image processing apparatus.
FIG. 3 is a flowchart illustrating details of an example of an image type determination process in Step ST2 shown in FIG.
14 is a flowchart of a process performed in 04. The image type determination process is a process of determining what type of input image data is a document by using a region identification result for each pixel.

【００４３】まず、本処理において分類する文書種類の
一例を説明する。First, an example of a document type to be classified in this processing will be described.

【００４４】一様下地原稿の文書とは、写真領域と網点
領域が存在しないものをさす。したがって文字領域の
み、または何もかかれていない白紙の状態のいずれかが
これに該当する。A document with a uniform background original is one in which a photograph area and a halftone area do not exist. Therefore, either the character area alone or a blank state where nothing is applied corresponds to this.

【００４５】網点下地原稿とは、網点領域が存在し、か
つ網点領域内に文字領域が存在するものをさす。The halftone base document refers to a document having a halftone area and a character area within the halftone area.

【００４６】網点写真原稿とは、網点領域が存在し、か
つ網点領域内に写真領域が存在し、かつ文字領域が存在
しないものをさす。The halftone photo original is one in which a halftone dot region exists, a photograph region exists in the halftone dot region, and no character region exists.

【００４７】連続階調写真原稿とは、網点領域と文字領
域が存在せず、かつ写真領域のみが存在するものをさ
す。A continuous tone photographic original is one in which a halftone dot area and a character area do not exist and only a photographic area exists.

【００４８】矩形識別原稿とは、一様下地原稿、網点下
地原稿、網点写真原稿、連続階調原稿のいずれの条件に
当てはまらなく、かつ複雑なレイアウト構造を持ってい
ないで、文字、網点、写真、図表等の領域毎に分割が可
能なものをさす。The rectangular identification document does not satisfy any of the conditions of a uniform background document, a halftone background document, a halftone photograph document, and a continuous tone document, and does not have a complicated layout structure. Points that can be divided into areas such as points, photographs, charts, etc.

【００４９】なお、複雑なレイアウト構造を持ち、文
字、網点、写真、図表等の領域毎に分割が不可能なもの
は識別不能原稿として、他の原稿と区別する。A document which has a complicated layout structure and cannot be divided into regions such as characters, halftone dots, photographs, charts, etc. is distinguished from other documents as an unidentifiable document.

【００５０】図３において、画素属性データ１５１は領
域識別部１０２において識別された複数画像データの領
域の種類や重要度を用いて、入力された画像データの画
素ごとの属性を決定したもので、各々の文書要素ごとの
領域として表されている。In FIG. 3, pixel attribute data 151 is obtained by determining the attribute of each pixel of the input image data using the type and importance of the region of the plurality of image data identified by the region identification unit 102. It is represented as an area for each document element.

【００５１】まず、画素属性データ１５１からデータを
読み取り、網点領域が存在するか調べる（ステップＳＴ
３０１）。網点領域が存在する場合、次に写真領域が存
在するか調べる（ステップＳＴ３０２）。写真領域が存
在する場合、さらに文字領域が存在するか調べる（ステ
ップＳＴ３０３）。もし文字領域が存在するなら、矩形
識別候補原稿１５２となる。矩形識別候補原稿１５２
は、この後レイアウト構造の複雑さを検証する処理を行
い、矩形識別原稿または識別不能原稿に分類する。この
処理の詳細な説明は図４を用いて後述する。First, data is read from the pixel attribute data 151 to check whether a halftone dot area exists (step ST).
301). If a halftone dot region exists, it is checked whether or not a photograph region exists (step ST302). If a photograph area exists, it is checked whether or not a character area further exists (step ST303). If there is a character area, it becomes the rectangle identification candidate document 152. Rectangle identification candidate document 152
Performs a process of verifying the complexity of the layout structure, and classifies the document into a rectangular identification document or an unidentifiable document. A detailed description of this processing will be described later with reference to FIG.

【００５２】ステップＳＴ３０３にて文字領域が存在し
ない場合、網点写真原稿１５３となる。If there is no character area in step ST303, a halftone photographic original 153 is obtained.

【００５３】ステップＳＴ３０２にて写真領域が存在し
ない場合、網点領域内に文字領域が存在するか調べる
（ステップＳＴ３０４）。網点領域内に文字領域が存在
する場合、網点下地原稿１５４となる。網点領域内に文
字領域が存在しない場合、さらに網点領域外に文字領域
が存在しないか調べる（ステップＳＴ３０５）。網点領
域外に文字領域が存在しない場合、網点写真原稿１５３
となる。網点領域外に文字領域が存在する場合、矩形識
別候補原稿１５２となる。If no photograph area exists in step ST302, it is checked whether a character area exists in the halftone dot area (step ST304). If a character area exists in the halftone dot area, the halftone dot original document 154 is obtained. If no character area exists within the halftone dot area, it is further checked whether a character area exists outside the halftone dot area (step ST305). If there is no character area outside the halftone dot area,
Becomes If a character area exists outside the halftone dot area, the document becomes the rectangle identification candidate document 152.

【００５４】ステップＳＴ３０１にて網点領域が存在し
ない場合、写真領域が存在するか調べる（ステップＳＴ
３０６）。写真領域が存在しない場合、一様下地原稿１
５６となる。写真領域が存在する場合、さらに文字領域
が存在するか調べる（ステップＳＴ３０７）。文字領域
が存在しない場合、連続階調写真原稿１５５となる。文
字領域が存在する場合、矩形識別候補原稿１５２とな
る。If no halftone dot region exists in step ST301, it is checked whether a photograph region exists (step ST301).
306). If there is no photo area, uniform base document 1
56. If a photograph area exists, it is checked whether or not a character area further exists (step ST307). If there is no character area, a continuous tone photographic original 155 is obtained. If a character area exists, the document becomes a rectangular identification candidate document 152.

【００５５】以上の処理によって、入力画像データは、
矩形識別候補原稿１５２、網点写真原稿宇１５３、網点
下地原稿１５４、連続階調写真原稿１５５、一様下地原
稿１５６のいずれかに決定される。By the above processing, the input image data is
It is determined to be one of a rectangle identification candidate document 152, a halftone photo document 153, a halftone background document 154, a continuous tone photo document 155, and a uniform background document 156.

【００５６】図４は、本発明の画像処理装置の画像種別
判定部１０３における一例としての画像種別判定処理の
詳細を示すフローチャートであり、図３に示した矩形識
別候補原稿１５２に対して行われる処理のフローチャー
トである。FIG. 4 is a flowchart showing the details of an example of the image type determination processing in the image type determination section 103 of the image processing apparatus of the present invention, which is performed on the rectangular identification candidate document 152 shown in FIG. It is a flowchart of a process.

【００５７】矩形識別候補原稿１５２と判定された場
合、レイアウト構造の複雑さを検証する処理を行い、矩
形識別原稿または識別不能原稿に分類するが、その際の
レイアウト構造の複雑さを測る指標の一例として、矩形
領域の重なり度合いを計測しても良い。直接、矩形の重
なりを検証しても良いが、矩形領域の数が多くなると、
比較する回数が劇的に増加することが想定される。そこ
で重なり度合いを予め用意した２次元の投票空間に矩形
サイズを投票し、その投票空間の状態から推定しても良
い（ステップＳＴ４０１）。If it is determined that the document is a rectangle identification candidate document 152, a process for verifying the complexity of the layout structure is performed, and the document is classified as a rectangle identification document or an unidentifiable document. As an example, the degree of overlap of the rectangular areas may be measured. Although the overlap of rectangles may be directly verified, if the number of rectangular areas increases,
It is expected that the number of comparisons will increase dramatically. Therefore, the rectangular size may be voted in a two-dimensional voting space in which the degree of overlap is prepared in advance, and may be estimated from the state of the voting space (step ST401).

【００５８】この投票空間は、矩形領域の重なり度合い
を計測するためのものであるため、解像度は元の画像の
解像度より低いもので良い。投票は、矩形領域の座標位
置を投票空間の座標位置に変換した個所にその領域の矩
形サイズなどの特徴を加算することによりなされる。矩
形サイズの例としては、矩形の幅や高さや面積値などの
いずれかを用いることが可能である。また、矩形領域の
属性、つまり、文字、写真、図表、網点などの属性にし
たがって投票値に重みを付けても良い。例えば、写真や
図表のように、レイアウト構造に大きな影響を与える属
性は重み付けを大きくし、文字などは重み付けを小さく
すると、写真や図表の重なった領域の投票値は大きくな
る。レイアウト構造の複雑さを考慮すると、写真や図表
の重なりというのは複雑なものに分類される。したがっ
て、この場合は複雑なものと見做しやすくなる。また、
処理をより高速に行うために、投票する矩形領域を制限
することも可能である。例えば、小さな文字領域はレイ
アウト構造を判定する際に大きな影響を与えるものでは
ないことがある。このような場合、投票する矩形サイズ
を制限することで、投票回数の削減が図られ、その結果
高速になることが予想される。ただし、画像データに存
在する文字領域が小さな文字領域のみで構成されている
場合は、一概に除外することはできない。そのような場
合、領域識別部１０２の出力である画素属性データの状
態によって制限を調整することも可能である。Since this voting space is for measuring the degree of overlap of the rectangular areas, the resolution may be lower than the resolution of the original image. The voting is performed by adding a feature such as the rectangular size of the rectangular area to a position where the coordinate position of the rectangular area is converted into the coordinate position of the voting space. As an example of the rectangle size, any of the width, height, area value, and the like of the rectangle can be used. Also, the voting value may be weighted according to the attributes of the rectangular area, that is, attributes such as characters, photographs, charts, and halftone dots. For example, if an attribute that greatly affects the layout structure, such as a photograph or a chart, is given a higher weight and a character or the like is given a lower weight, the voting value of an area where the picture or chart overlaps becomes larger. Considering the complexity of the layout structure, overlapping of photographs and charts is classified as complicated. Therefore, in this case, it can be easily regarded as complicated. Also,
In order to perform the processing at a higher speed, it is possible to limit the rectangular area to be voted. For example, a small character area may not have a significant effect when determining a layout structure. In such a case, by limiting the size of the rectangle to be voted, the number of votes can be reduced, and as a result, it is expected that the speed will be increased. However, when the character area existing in the image data is composed of only a small character area, it cannot be simply excluded. In such a case, the restriction can be adjusted according to the state of the pixel attribute data output from the area identification unit 102.

【００５９】投票空間への矩形サイズの投票がすべて終
了した後、投票空間の特徴量の計測を行う（ステップＳ
Ｔ４０２）。特徴量としては、例えば最大値や分散など
の統計量を用いても良いし、投票空間を多値画像と見做
して、２値化処理を行った後、領域の抽出を行っても良
い。前者の場合、算出した結果と予め設定しておいた閾
値との比較によって判定することが可能であり、後者の
場合、抽出された領域の図形的特徴を計測することで判
定することが可能である。ステップＳＴ４０２では、前
者の例を用いている。この場合、最大値ｍａｘまたは分
散値ｖａｒが閾値ａまたはｂより小さいとき、レイアウ
ト構造は複雑ではないと見做し矩形識別原稿１５７と判
定し、逆に大きい場合、レイアウト構造は複雑と見做し
識別不能原稿１５８と判定する（ステップＳＴ４０
３）。After all rectangular-size voting in the voting space is completed, the feature amount of the voting space is measured (step S).
T402). As the feature amount, for example, a statistic amount such as a maximum value or a variance may be used, or a voting space may be regarded as a multi-valued image, and after performing a binarization process, an area may be extracted. . In the former case, the determination can be made by comparing the calculated result with a preset threshold. In the latter case, the determination can be made by measuring the graphic features of the extracted region. is there. In step ST402, the former example is used. In this case, when the maximum value max or the variance value var is smaller than the threshold values a or b, the layout structure is considered to be uncomplicated and the rectangular identification document 157 is determined, and when it is larger, the layout structure is considered to be complex. It is determined that the document 158 cannot be identified (step ST40).
3).

【００６０】以上が画像種別判定部１０３の詳細な説明
であり、その結果、入力画像データは、網点写真原稿１
５３、網点下地原稿１５４、連続階調写真原稿１５５、
一様下地原稿１５６、矩形識別原稿１５７、識別不能原
稿１５８のいずれかに決定される。The above is the detailed description of the image type determination unit 103. As a result, the input image data is
53, halftone base document 154, continuous tone photographic document 155,
The original is determined to be one of the uniform background original 156, the rectangular identification original 157, and the unidentifiable original 158.

【００６１】図５は、画像処理装置のデータ変換部１０
４における一例としてのデータ変換処理の詳細を示すフ
ローチャートであり、図２に示したステップＳＴ２０５
で行われる処理のフローチャートである。データ変換処
理は、画像種別判定結果を利用して、入力された画像デ
ータの形式を変更する処理である。FIG. 5 shows a data conversion unit 10 of the image processing apparatus.
FIG. 4 is a flowchart showing details of an example of data conversion processing in FIG. 4 and is a step ST205 shown in FIG.
3 is a flowchart of the process performed in step S1. The data conversion process is a process of changing the format of input image data using the image type determination result.

【００６２】画像種別情報１６１は、画像種別判定部１
０３の出力で、入力画像データの種類を表し、網点写真
原稿１５３、網点下地原稿１５４、連続階調写真原稿１
５５、一様下地原稿１５６、矩形識別原稿１５７、識別
不能原稿１５８のいずれかである。The image type information 161 is stored in the image type
03 indicates the type of the input image data, and includes a halftone photo original 153, a halftone background original 154, and a continuous tone photo original 1
55, a uniform base document 156, a rectangular identification document 157, or an unidentifiable document 158.

【００６３】まず、画像種別情報１６１が識別不能原稿
であるか調べる（ステップＳＴ５０１）。識別不能原稿
でない場合、次に矩形識別原稿であるか調べる（ステッ
プＳＴ５０２）。矩形識別原稿である場合、領域識別部
１０２で得られた画素属性の情報を利用して、入力画像
データに対して矩形領域毎にデータ変換処理を行う（ス
テップＳＴ５０３）。First, it is checked whether or not the image type information 161 is an unidentifiable document (step ST501). If the original is not an unidentifiable original, it is checked whether the original is a rectangular identification original (step ST502). If the document is a rectangular identification document, data conversion processing is performed on the input image data for each rectangular area using the information on the pixel attributes obtained by the area identification unit 102 (step ST503).

【００６４】データ変換処理として、例えば、解像度、
圧縮率、色数などがある。予め画素属性毎にデータ変換
処理のルールを設定しておき、そのルールにしたがって
データ変換処理を行わせても良い。例えば、網点写真領
域や連続写真領域の場合、解像度を低くする変換を行っ
たり、圧縮率を上げても著しい画像劣化は見られない場
合があり、情報量の削減が可能となる。また、文字領域
で黒文字しか存在しない場合、多値データを２値に変換
することで同様に情報量の削減が可能となる。このよう
に予め画素属性毎にデータ変換処理のルールを設定して
おくことで、効率の良い画像データに変換することが可
能となる。As data conversion processing, for example, resolution,
There are a compression ratio and the number of colors. A rule of the data conversion process may be set in advance for each pixel attribute, and the data conversion process may be performed according to the rule. For example, in the case of a halftone dot photograph region or a continuous photograph region, remarkable image deterioration may not be observed even if conversion for lowering the resolution is performed or the compression ratio is increased, so that the amount of information can be reduced. If only black characters exist in the character area, the amount of information can be reduced by converting the multi-valued data into binary. By setting the rules of the data conversion process for each pixel attribute in advance, it is possible to convert the data into efficient image data.

【００６５】データ変換処理の結果、変換画像データ１
６２として出力する。As a result of the data conversion processing, converted image data 1
Output as 62.

【００６６】また、ステップＳＴ５０２で矩形識別原稿
でない場合、画像全体に一様なデータ変換処理を行う
（ステップＳＴ５０４）。一様なデータ変換処理とし
て、例えば、画像種別情報１６１が網点写真原稿や連続
階調写真であった場合、解像度を低くする変換を行った
り、圧縮率を上げても著しい画像劣化は見られない場合
があり、情報量の削減が可能となる。また、画像種別情
報１６１が一様下地原稿で黒文字しか存在しない場合、
多値データを２値に変換することで同様に情報量の削減
が可能となる。このようにステップＳＴ５０３と同様に
画像の種類毎にデータ変換処理のルールを設定しておく
ことで、効率の良い画像データに変換することが可能と
なる。なおこの場合、ステップＳＴ５０３のように矩形
領域毎にデータ変換処理を行わないで、画像全体に同一
の処理を施すため、高速に行えるという利点がある。If it is determined in step ST502 that the document is not a rectangular identification document, uniform data conversion processing is performed on the entire image (step ST504). As a uniform data conversion process, for example, when the image type information 161 is a halftone photo original or a continuous tone photograph, remarkable image deterioration is observed even if conversion for lowering the resolution is performed or the compression ratio is increased. In some cases, the amount of information can be reduced. If the image type information 161 is a uniform background document and only black characters exist,
By converting multi-valued data into binary, the amount of information can be reduced in the same manner. By setting the data conversion rule for each type of image in the same manner as in step ST503, it is possible to convert the image data into efficient image data. In this case, since the same processing is performed on the entire image without performing the data conversion processing for each rectangular area as in step ST503, there is an advantage that the processing can be performed at high speed.

【００６７】データ変換処理の結果、ステップＳＴ５０
３と同様に変換画像データ１６２として出力する。As a result of the data conversion processing, step ST50
3, and output as converted image data 162.

【００６８】また、ステップＳＴ５０１で識別不能原稿
である場合、領域識別部１０２で得られた画素属性の情
報を利用したデータ変換処理は行わないで、入力画像デ
ータをそのまま補正画像データとして出力する（ステッ
プＳＴ５０５）。なお、データ変換処理（ステップＳＴ
５０３およびステップＳＴ５０４）は解像度、圧縮率、
色数など入力画像データに対して処理を行うものだけで
はなく、先に説明したように、属性の領域情報を画像デ
ータに変換する処理でも良い。If the document is an unidentifiable document in step ST501, the input image data is output as it is as the corrected image data without performing the data conversion process using the information of the pixel attribute obtained by the area identifying section 102 ( Step ST505). The data conversion process (step ST
503 and step ST504) are resolution, compression ratio,
Not only processing for input image data such as the number of colors but also processing for converting attribute area information into image data as described above may be used.

【００６９】このように、領域識別した結果から入力画
像データの種類を判定して、判定された画像データの種
類にしたがってデータ変換処理を行うことで、画質の劣
化が少ない効率的な画像データに変換することが可能と
なる。As described above, the type of the input image data is determined from the result of the region identification, and the data conversion process is performed according to the determined type of the image data. It can be converted.

【００７０】次に、第２実施例について説明する。Next, a second embodiment will be described.

【００７１】第２実施例は、画像の種類の判別結果にし
たがって、適切な濃度変換あるいはフィルタ処理を施し
画像データを補正する画像処理装置である。The second embodiment is an image processing apparatus that corrects image data by performing appropriate density conversion or filter processing in accordance with the result of determining the type of an image.

【００７２】図６は、第２実施例に係る画像処理装置の
概略構成を示すものである。図６において、１０１は画
像入力部、１０２は領域識別部、１０３は画像種別判定
部、１０６は画像補正部、１０５は画像出力部であり、
図１で示した第１実施例におけるデータ変換部１０４を
画像補正部１０６とした形態をなしている。画像入力部
１０１、領域識別部１０２、画像種別判定部１０３、画
像出力部１０５は既に説明した通りであるので、ここで
は画像補正部１０６について説明する。FIG. 6 shows a schematic configuration of an image processing apparatus according to the second embodiment. 6, 101 is an image input unit, 102 is an area identification unit, 103 is an image type determination unit, 106 is an image correction unit, 105 is an image output unit,
In this embodiment, the data converter 104 in the first embodiment shown in FIG. Since the image input unit 101, the region identification unit 102, the image type determination unit 103, and the image output unit 105 have already been described, the image correction unit 106 will be described here.

【００７３】図７は、図６に示した画像処理装置におけ
る文書処理手順の例を示すフローチャートである。図２
に示した図１における構成のフローチャートのうち、デ
ータ変換処理（ステップＳＴ２０５）を画像補正処理
（ステップＳＴ２０７）に変更した形態をなす。FIG. 7 is a flowchart showing an example of a document processing procedure in the image processing apparatus shown in FIG. FIG.
In the flowchart of the configuration shown in FIG. 1, the data conversion processing (step ST205) is changed to an image correction processing (step ST207).

【００７４】画像種別判定部１０３で入力された画像デ
ータの文書としての種類を決定した後、画像補正部１０
６において、文書の種類にしたがった適切な濃度変換あ
るいはフィルタ処理を施して画像データを補正する処理
を行う（ステップＳＴ２０７）。After determining the type of the image data input by the image type determination unit 103 as a document, the image correction unit 10
In step 6, the image data is corrected by performing appropriate density conversion or filter processing according to the type of the document (step ST207).

【００７５】図８は、画像補正部１０６における一例と
しての画像補正処理の詳細を示すフローチャートであ
り、図７に示したステップＳＴ２０７で行われる処理の
フローチャートである。FIG. 8 is a flowchart showing details of an example of image correction processing in image correction section 106, and is a flowchart of processing performed in step ST207 shown in FIG.

【００７６】画像種別情報１６１は、画像種別判定部１
０３の出力で、入力画像データの種類を表し、網点写真
原稿１５３、網点下地原稿１５４、連続階調写真原稿１
５５、一様下地原稿１５６、矩形識別原稿１５７、識別
不能原稿１５８のいずれかである。The image type information 161 is stored in the image type
03 indicates the type of the input image data, and includes a halftone photo original 153, a halftone background original 154, and a continuous tone photo original 1
55, a uniform base document 156, a rectangular identification document 157, or an unidentifiable document 158.

【００７７】まず、画像種別情報１６１が識別不能原稿
であるか調べる（ステップＳＴ６０１）。識別不能原稿
でない場合、次に矩形識別原稿であるか調べる（ステッ
プＳＴ６０２）。矩形識別原稿である場合、領域識別部
１０２で得られた画素属性の情報を利用して、入力画像
データに対して矩形領域毎に画像補正処理を行う（ステ
ップＳＴ６０３）。First, it is checked whether or not the image type information 161 is an unidentifiable document (step ST601). If the document is not an unidentifiable document, it is checked whether the document is a rectangular identification document (step ST602). If the document is a rectangular identification document, image correction processing is performed on input image data for each rectangular region using the pixel attribute information obtained by the region identification unit 102 (step ST603).

【００７８】画像補正処理として、例えば、濃度変換処
理やフィルタ処理などがある。予め画素属性毎に画像補
正処理のルールを設定しておき、そのルールにしたがっ
て画像補正処理を行わせても良い。例えば、文字領域で
あるならば文字が見やすいようにハイパスフィルタ処理
を施したり、網点写真領域ならばモアレがない滑らかな
画像にするためにローパスフィルタ処理を施しても良
い。その結果、品質の良い画像データに補正され、補正
画像データ１６３として出力することが可能となる。こ
のように予め画素属性毎に画像補正処理のルールを設定
しておくことで、品質の良い画像データに補正すること
が可能となる。The image correction processing includes, for example, density conversion processing and filter processing. A rule of the image correction processing may be set in advance for each pixel attribute, and the image correction processing may be performed according to the rule. For example, a high-pass filter process may be performed so that the characters are easy to see in a character region, and a low-pass filter process may be performed in a halftone dot photograph region to obtain a smooth image without moiré. As a result, the image data is corrected to high-quality image data, and can be output as corrected image data 163. By setting the image correction processing rule for each pixel attribute in advance in this way, it is possible to correct image data of high quality.

【００７９】また、ステップＳＴ６０２で矩形識別原稿
でない場合、画像全体に一様な画像補正処理を行う（ス
テップＳＴ６０４）。一様な画像補正処理として、一様
下地原稿であるならば文字が見やすいように画像全体に
ハイパスフィルタ処理を施したり、網点写真原稿である
ならばモアレがない滑らかな画像にするために画像全体
にローパスフィルタを施しても良い。このようにステッ
プＳＴ６０３と同様に画像の種類毎に画像補正処理のル
ールを設定しておくことで、品質の良い画像データに補
正することが可能となる。なおこの場合、ステップＳＴ
６０３のように矩形領域毎に画像補正処理を行わない
で、画像全体に同一の処理を施すため、高速に行えると
いう利点がある。画像補正処理の結果、ステップＳＴ６
０３と同様に補正画像データ１６３として出力する。If the document is not a rectangular identification document in step ST602, uniform image correction processing is performed on the entire image (step ST604). As a uniform image correction process, if the document is a uniform background document, the entire image is subjected to high-pass filter processing so that the characters are easy to see. A low-pass filter may be applied to the whole. As described above, by setting the rule of the image correction process for each type of image as in step ST603, it is possible to correct the image data to high-quality image data. In this case, step ST
Since the same processing is performed on the entire image without performing image correction processing for each rectangular area as in 603, there is an advantage that high-speed processing can be performed. As a result of the image correction processing, step ST6
In the same manner as in step S03, the image data is output as corrected image data 163.

【００８０】また、ステップＳＴ６０１で識別不能原稿
である場合、領域識別部１０２で得られた画素属性の情
報を利用した画像補正処理は行わないで、入力画像デー
タをそのまま補正画像データとして出力する（ステップ
ＳＴ６０５）。If the document is an unidentifiable document in step ST601, the input image data is directly output as corrected image data without performing the image correction process using the pixel attribute information obtained by the area identification section 102 ( Step ST605).

【００８１】このように、領域識別した結果から入力画
像データの種類を判定して、判定された画像データの種
類にしたがって画像補正処理を行うことで、品質の良い
画像データを得ることが可能となる。As described above, by determining the type of the input image data from the result of the region identification and performing the image correction processing according to the determined type of the image data, it is possible to obtain high-quality image data. Become.

【００８２】次に、第３実施例について説明する。Next, a third embodiment will be described.

【００８３】第３実施例は、画像をファイリングする
際、画像の種類の判別結果にしたがって、格納する画像
形式を選択し、その形式に変換し、格納する画像処理装
置である。The third embodiment is an image processing apparatus for selecting an image format to be stored in accordance with the result of discriminating the type of the image, converting the format into the format, and storing the format in filing the image.

【００８４】図９は、第３実施例に係る画像処理装置の
概略構成を示すものである。図９において、１０１は画
像入力部、１０２は領域識別部、１０３は画像種別判定
部、１０７は画像形式変更部、１０５は画像記憶部であ
り、図１で示した第１実施例におけるデータ変換部１０
４を画像形式変更部１０７とし、画像出力部１０５を画
像記憶部１０８とした形態をなしている。画像入力部１
０１、領域識別部１０２、画像種別判定部１０３は既に
説明した通りであるので説明を省略する。画像記憶部１
０８は画像形式を変更後の出力先であるため、実質的に
は画像出力部１０５と同様の動作を行うとみなせる。し
たがって、ここでは画像形式変更部１０７ついて説明す
る。FIG. 9 shows a schematic configuration of an image processing apparatus according to the third embodiment. 9, reference numeral 101 denotes an image input unit, 102 denotes an area identification unit, 103 denotes an image type determination unit, 107 denotes an image format change unit, and 105 denotes an image storage unit, and data conversion in the first embodiment shown in FIG. Part 10
4 as an image format change unit 107 and an image output unit 105 as an image storage unit 108. Image input unit 1
01, the region identification unit 102, and the image type determination unit 103 are the same as those already described, and a description thereof will be omitted. Image storage unit 1
08 is the output destination after changing the image format, it can be considered that the same operation as the image output unit 105 is performed. Therefore, here, the image format changing unit 107 will be described.

【００８５】図１０は、図９に示した画像処理装置にお
ける文書処理手順の例を示すフローチャートである。図
２に示した図１における構成のフローチャートのうち、
データ変換処理（ステップＳＴ２０５）を画像形式変更
処理（ステップＳＴ２０８）に変更した形態をなす。FIG. 10 is a flowchart showing an example of a document processing procedure in the image processing apparatus shown in FIG. In the flowchart of the configuration in FIG. 1 shown in FIG. 2,
The data conversion process (step ST205) is changed to an image format change process (step ST208).

【００８６】画像種別判定部１０３で入力された画像デ
ータの文書としての種類を決定した後、画像形式変更部
１０７において、文書の種類にしたがって画像記憶部１
０８に格納するのに適した画像形式を選択し、その形式
に変更する処理を行う（ステップＳＴ２０８）。After the type of the image data input as a document is determined by the image type determining unit 103, the image format changing unit 107 stores the image data in the image storage unit 1 according to the type of the document.
Then, a process of selecting an image format suitable for storing the data in the format 08 is performed (step ST208).

【００８７】図１１は、画像形式変更部１０７における
一例としての画像形式変更処理の詳細を示すフローチャ
ートであり、図１０に示したステップＳＴ２０８で行わ
れる処理のフローチャートである。FIG. 11 is a flowchart showing details of an example of the image format changing process in the image format changing unit 107, and is a flowchart of the process performed in step ST208 shown in FIG.

【００８８】画像種別情報１６１は、画像種別判定部１
０３の出力で、入力画像データの種類を表し、網点写真
原稿１５３、網点下地原稿１５４、連続階調写真原稿１
５５、一様下地原稿１５６、矩形識別原稿１５７、識別
不能原稿１５８のいずれかである。The image type information 161 is stored in the image type
03 indicates the type of the input image data, and includes a halftone photo original 153, a halftone background original 154, and a continuous tone photo original 1
55, a uniform base document 156, a rectangular identification document 157, or an unidentifiable document 158.

【００８９】まず、画像種別情報１６１が識別不能原稿
であるか調べる（ステップＳＴ７０１）。識別不能原稿
でない場合、次に矩形識別原稿であるか調べる（ステッ
プＳＴ７０２）。矩形識別原稿である場合、領域識別部
１０２で得られた画素属性の情報を利用して、入力画像
データに対して矩形領域毎に画像形式変更処理を行う
（ステップＳＴ７０３）。First, it is checked whether the image type information 161 is an unidentifiable document (step ST701). If the original is not an unidentifiable original, it is checked whether the original is a rectangular identification original (step ST702). If the document is a rectangular identification document, an image format change process is performed on the input image data for each rectangular region using the pixel attribute information obtained by the region identification unit 102 (step ST703).

【００９０】画像形式としては、例えば、ＪＰＥＧ形式
やＴＩＦＦ（Ｇ４）形式などがある。予め画素属性毎に
画像形式の変更のルールを設定しておき、そのルールに
したがって画像形式変更処理を行わせても良い。例え
ば、文字領域であるならばＴＩＦＦ（Ｇ４）形式に変更
することでデータ量を削減でき、効率の良い画像データ
に変更される。また、網点写真領域や連続階調写真領域
であるならばＪＰＥＧ形式にすることで、画質の劣化を
抑えつつデータ量の圧縮が可能となり、この場合も効率
の良い画像データに変更される。その結果、形式変更画
像データ１６４として格納することが可能となる。この
場合、ひとつの画像データから複数の形式を持った画像
データが生成される。したがって格納する際は、それぞ
れ別個に格納されても良い。このとき、元の画像データ
の構造とそこから発生した形式変更画像データ名を参照
ファイルとして保持しておき、復元する際はその参照フ
ァイルを参照して形式変更画像データ１６４から元の画
像データを再構成することも可能である。このように予
め画素属性毎に画像形式変更処理のルールを設定してお
くことで、データ量を削減した効率の良い画像データ形
式に変更することが可能となる。The image format includes, for example, the JPEG format and the TIFF (G4) format. A rule for changing the image format may be set in advance for each pixel attribute, and the image format may be changed according to the rule. For example, in the case of a character area, the data amount can be reduced by changing to the TIFF (G4) format, and the image data is changed to efficient image data. Further, if the area is a halftone photographic area or a continuous tone photographic area, the data amount can be compressed while suppressing the deterioration of the image quality by using the JPEG format. In this case, the image data is changed to efficient image data. As a result, the image data can be stored as the format-changed image data 164. In this case, image data having a plurality of formats is generated from one image data. Therefore, when storing, they may be stored separately. At this time, the structure of the original image data and the name of the format-changed image data generated therefrom are held as a reference file, and when restoring, the original image data is converted from the format-changed image data 164 by referring to the reference file. Reconfiguration is also possible. By setting the rule of the image format change process for each pixel attribute in advance in this way, it is possible to change to an efficient image data format with a reduced data amount.

【００９１】また、ステップＳＴ７０２で矩形識別原稿
でない場合、画像全体に一様な画像形式変更処理を行う
（ステップＳＴ７０４）。一様な画像形式変更処理とし
て、一様下地原稿であるならば写真がないのでＴＩＦＦ
（Ｇ４）形式に変更することでデータ量を削減すること
がで、網点写真原稿や連続階調写真原稿であるならばＪ
ＰＥＧ形式にすることで、画質の劣化を抑えつつデータ
量の圧縮を図ること可能となる。このようにステップＳ
Ｔ７０３と同様に画像の種類毎に画像形式変更処理のル
ールを設定しておくことで、データ量を削減した効率の
良い画像データ形式に変更することが可能となる。な
お、この場合、ステップＳＴ７０３のように矩形領域毎
に画像形式変更処理を行わないで、画像全体に同一の処
理を施すため、画像形式変更処理が高速に行え、かつ複
数の画像形式が発生することがないため参照ファイルが
不要であり、格納した形式変更画像データからもとの画
像データを復元する際も高速に行えるという利点があ
る。画像形式変更処理の結果、ステップＳＴ７０３と同
様に形式変更画像データ１６４として格納することが可
能となる。If it is determined in step ST702 that the document is not a rectangular identification document, a uniform image format change process is performed on the entire image (step ST704). As uniform image format change processing, if there is no photo if the original is a uniform background document, TIFF
By changing to the (G4) format, the data amount can be reduced.
By adopting the PEG format, it is possible to compress the data amount while suppressing the deterioration of the image quality. Thus, step S
By setting the rule of the image format change process for each type of image in the same manner as in T703, it is possible to change to an efficient image data format with a reduced data amount. In this case, since the same processing is performed on the entire image without performing the image format change processing for each rectangular area as in step ST703, the image format change processing can be performed at high speed, and a plurality of image formats are generated. Since there is no need for a reference file, there is an advantage that the original image data can be restored at high speed from the stored format-changed image data. As a result of the image format change processing, it is possible to store the format change image data 164 as in step ST703.

【００９２】また、ステップＳＴ７０１で識別不能原稿
である場合、領域識別部１０２で得られた画素属性の情
報を利用した画像形式変更処理は行わないで、入力画像
データをそのまま形式変更画像データとして出力する
（ステップＳＴ７０５）。If the original is not identifiable in step ST701, the input image data is output as is as the format-changed image data without performing the image format change process using the pixel attribute information obtained by the area identification unit 102. (Step ST705).

【００９３】このように、領域識別した結果から入力画
像データの種類を判定して、判定された画像データの種
類にしたがって画像形式変更処理を行うことで、効率の
良い画像データに変更することが可能となり、格納する
際の情報量の削減につながる。As described above, the type of the input image data is determined from the result of the region identification, and the image format is changed in accordance with the determined type of the image data. This makes it possible to reduce the amount of information when storing.

【００９４】次に、第４実施例について説明する。Next, a fourth embodiment will be described.

【００９５】第４実施例は、画像の種類の判別結果にし
たがって、起動するアプリケーションを選択し、起動パ
ラメータを決定し、アプリケーションを起動する画像処
理装置である。The fourth embodiment is an image processing apparatus that selects an application to be started, determines start parameters, and starts the application according to the result of determining the type of image.

【００９６】図１２は、第４実施例に係る画像処理装置
の概略構成を示すものである。図１２において、１０１
は画像入力部、１０２は領域識別部、１０３は画像種別
判定部、１０９はアプリケーション起動部、１１０はア
プリケーション処理結果出力部であり、図１で示した第
１実施例におけるデータ変換部１０４をアプリケーショ
ン起動部１０９とし、画像出力部１０５をアプリケーシ
ョン処理結果出力部１１０とした形態をなしている。画
像入力部１０１、領域識別部１０２、画像種別判定部１
０３は既に説明した通りであるので説明を省略する。ア
プリケーション処理結果出力部１１０は、アプリケーシ
ョンを起動し、処理が行われた結果の出力先であるた
め、実質的には画像出力部１０５と同様の動作を行うと
みなせる。したがって、ここではアプリケーション起動
部１０９について説明する。FIG. 12 shows a schematic configuration of an image processing apparatus according to the fourth embodiment. In FIG. 12, 101
Is an image input unit, 102 is an area identification unit, 103 is an image type determination unit, 109 is an application activation unit, and 110 is an application processing result output unit. The data conversion unit 104 in the first embodiment shown in FIG. The starting unit 109 is used, and the image output unit 105 is used as an application processing result output unit 110. Image input unit 101, area identification unit 102, image type determination unit 1
03 is as described above, and the description is omitted. The application processing result output unit 110 starts the application and is the output destination of the result of the processing. Therefore, it can be considered that the operation is substantially the same as that of the image output unit 105. Therefore, here, the application starting unit 109 will be described.

【００９７】図１３は、図１２に示した画像処理装置に
おける文書処理手順の例を示すフローチャートである。
図２に示した図１における構成のフローチャートのう
ち、データ変換処理（ステップＳＴ２０５）をアプリケ
ーション起動処理（ステップＳＴ２０９）に、画像出力
処理（ステップＳＴ２０６）をアプリケーション処理結
果出力処理（ステップＳＴ２１０）に変更した形態をな
す。FIG. 13 is a flowchart showing an example of a document processing procedure in the image processing apparatus shown in FIG.
In the flowchart of the configuration in FIG. 1 shown in FIG. 2, the data conversion process (step ST205) is changed to the application activation process (step ST209), and the image output process (step ST206) is changed to the application process result output process (step ST210). Form.

【００９８】画像種別判定部１０３で入力された画像デ
ータの文書としての種類を決定した後、アプリケーショ
ン起動部１０９において、文書の種類にしたがって画像
処理を行うのに適したアプリケーションを選択し、起動
パラメータを決定し、アプリケーションを起動する処理
を行う（ステップＳＴ２０９）。After the type of the image data input as a document is determined by the image type determination unit 103, an application suitable for performing image processing according to the type of the document is selected by the application activation unit 109, and the activation parameter Is determined, and processing for activating the application is performed (step ST209).

【００９９】図１４は、アプリケーション起動部１０９
における一例としてアプリケーション起動処理の詳細を
示すフローチャートであり、図１３に示したステップＳ
Ｔ２０９で行われる処理のフローチャートである。FIG. 14 shows the application starting unit 109.
FIG. 14 is a flowchart showing details of an application activation process as an example in FIG.
It is a flowchart of the process performed in T209.

【０１００】画像種別情報１６１は、画像種別判定部１
０３の出力で、入力画像データの種類を表し、網点写真
原稿１５３、網点下地原稿１５４、連続階調写真原稿１
５５、一様下地原稿１５６、矩形識別原稿１５７、識別
不能原稿１５８のいずれかである。The image type information 161 is stored in the image type
03 indicates the type of the input image data, and includes a halftone photo original 153, a halftone background original 154, and a continuous tone photo original 1
55, a uniform base document 156, a rectangular identification document 157, or an unidentifiable document 158.

【０１０１】まず、画像種別情報１６１が識別不能原稿
であるか調べる（ステップＳＴ８０１）。識別不能原稿
でない場合、次に矩形識別原稿であるか調べる（ステッ
プＳＴ８０２）。矩形識別原稿である場合、領域識別部
１０２で得られた画素属性の情報を利用して、入力画像
データに対して矩形領域毎にアプリケーションを起動
し、各々の画像処理を行う（ステップＳＴ８０３）。First, it is checked whether the image type information 161 is an unidentifiable document (step ST801). If the original is not an unidentifiable original, it is checked whether the original is a rectangular identification original (step ST802). If the document is a rectangular identification document, an application is started for each rectangular region on the input image data using the information on the pixel attribute obtained by the region identification unit 102, and each image processing is performed (step ST803).

【０１０２】アプリケーションとしては、例えば、画像
ファイリング、印刷、ＦＡＸ送信、レイアウト理解、文
字認識、罫線抽出などの画像処理をおこなうものでも良
い。予め画素属性毎に起動させるアプリケーションのル
ールを設定しておき、そのルールにしたがってアプリケ
ーション起動処理を行わせても良い。例えば、文字領域
であるならばレイアウト理解を行って、段組などの文書
構造の認識を行い、さらに文字認識によって文字をコー
ド化しても良い。また、網点写真領域や連続階調写真領
域であるならば、イメージ保存のために画像ファイリン
グを起動させても良く、さらに印刷する際、その領域が
カラーであるならば、カラーという起動パラメータを付
属させることで、出力先のプリンタをカラープリンタに
自動的に変更させても良い。アプリケーションによって
処理結果は、アプリケーション処理結果データ１６５と
なり、プリントアウト、ファイル、文字認識結果などさ
まざまな形態をとることも可能である。このように予め
画素属性毎にアプリケーション起動処理のルールを設定
しておくことで、画像データが入力され次第、自動的に
さまざまな処理を行うことが可能となる。The application may be one that performs image processing such as image filing, printing, facsimile transmission, layout understanding, character recognition, and ruled line extraction. A rule of an application to be activated may be set in advance for each pixel attribute, and the application activation processing may be performed according to the rule. For example, if it is a character area, the layout may be understood, a document structure such as a column may be recognized, and the character may be encoded by character recognition. If the area is a halftone picture area or a continuous tone photographic area, the image filing may be activated to save the image. By attaching the printer, the output destination printer may be automatically changed to a color printer. Depending on the application, the processing result is application processing result data 165, which can take various forms such as a printout, a file, and a character recognition result. By setting the rules of the application start processing for each pixel attribute in advance in this way, various processing can be automatically performed as soon as image data is input.

【０１０３】また、ステップＳＴ８０２で矩形識別原稿
でない場合、アプリケーションを起動して画像全体に一
様な画像処理を行う（ステップＳＴ８０４）。一様な画
像形式変換処理として、一様下地原稿であるならば写真
がないのでレイアウト理解を行って、パラグラフなどの
文書構造の認識を行い、さらに文字認識によって文字を
コード化する処理を行ったりすることができ、網点写真
原稿や連続階調写真原稿であるならばイメージ保存のた
めに画像ファイリングを起動させたり、直接印刷やＦＡ
Ｘ送信を行ったりすることが可能である。アプリケーシ
ョンによっての処理の結果は、ステップＳＴ８０３と同
様にアプリケーション処理結果データ１６５となり、プ
リントアウト、ファイル、文字認識結果などさまざまな
形態をとることも可能である。If it is determined in step ST802 that the document is not a rectangular identification document, the application is activated to perform uniform image processing on the entire image (step ST804). As a uniform image format conversion process, if there is no photo if it is a uniform background document, understand the layout, recognize the document structure such as paragraphs, and perform the character encoding process by character recognition. If it is a halftone or continuous tone photo original, activate image filing to save the image, print directly
X transmission can be performed. The result of the processing by the application is the application processing result data 165 as in step ST803, and can take various forms such as a printout, a file, and a character recognition result.

【０１０４】また、ステップＳＴ８０１で識別不能原稿
である場合、領域識別部１０２で得られた画素属性の情
報を利用したアプリケーション起動処理は行わないで、
入力画像データをそのままアプリケーション処理結果デ
ータとして出力するか、画像ファイリングなどを起動し
て画像データをそのままファイルとして格納し、後から
ユーザの判断で処理を選択させることも可能である（ス
テップＳＴ８０５）。If the document is an unidentifiable document in step ST801, the application starting process using the information on the pixel attribute obtained by the area identifying unit 102 is not performed.
It is also possible to output the input image data as it is as the application processing result data, or to activate the image filing or the like, store the image data as a file as it is, and then allow the user to select the processing later (step ST805).

【０１０５】このように、領域識別した結果から入力画
像データの種類を判定して、判定された画像データの種
類にしたがって画像処理を行うアプリケーションを起動
させることで、画像データが入力され次第、自動的にさ
まざまな処理を行うことが可能となり、省力化につなが
る。As described above, the type of input image data is determined from the result of area identification, and an application for performing image processing according to the determined type of image data is activated. Various processing can be performed in a specific manner, leading to labor saving.

【０１０６】このように本発明の画像処理装置は、文書
画像を画像入力部１０１によって取り込み、これに対し
て領域識別部１０２において各画素および近傍画素のデ
ータの状況から物理的あるいは論理的に連結しているも
のを一つの領域として抽出したのち、個々の領域の画像
上の位置、大きさ、形状、構造、濃度分布等の特徴量を
計測し、その計測結果を予め定められたルールに基づい
て文書構成要素として文字、写真、網点などの属性を持
った領域として識別を行う。そして、その属性情報等を
もとに画像種別判定部１０３において入力された文書画
像全体の複雑さなどの特徴から文書の種類を決定し、文
書の種類にしたがってデータ変換部１０４にて画像デー
タ形式の変換を行うことを可能とするものである。As described above, in the image processing apparatus of the present invention, a document image is fetched by the image input unit 101, and the area identification unit 102 physically or logically connects the document image based on the state of the data of each pixel and neighboring pixels. After extracting what is done as one area, measure the feature amount such as position, size, shape, structure, density distribution etc. of each area on the image, and based on the measurement results based on predetermined rules Then, identification is performed as an area having attributes such as characters, photographs, and halftone dots as document constituent elements. Then, based on the attribute information and the like, the image type determination unit 103 determines the type of the document from the characteristics such as the complexity of the entire document image input, and the data conversion unit 104 determines the image data format according to the type of the document. Can be converted.

【０１０７】また、画像補正部１０６において、文書の
種類にしたがった適切な濃度変換あるいはフィルタ処理
を施し画像データを補正する処理を行い、画像形式変更
部１０７において、文書の種類にしたがって画像記憶部
１０８に格納するのに適した画像形式を選択し、その形
式に変更する処理を行うことを可能にする。The image correcting section 106 performs an appropriate density conversion or filter processing in accordance with the type of the document to correct the image data, and the image format changing section 107 stores the image in the image storage section according to the type of the document. In this case, it is possible to select an image format suitable for storing in the storage unit 108 and to perform processing for changing the format.

【０１０８】さらに、アプリケーション起動部１０９に
おいて、文書の種類にしたがって画像処理を行うのに適
したアプリケーションを選択し、起動パラメータを決定
し、アプリケーションを起動する処理を行うことも可能
にしたものである。Further, in the application starting unit 109, it is possible to select an application suitable for performing image processing according to the type of document, determine a starting parameter, and perform a process of starting the application. .

【０１０９】したがって、本装置により入力された文書
画像データに対し、文字、写真、図表、網点等の異なる
文書要素を区別して領域抽出し、網点、写真といったキ
ーとなる領域の有無や、矩形領域の重なりの度合いから
文書構造としての複雑さなどといった特徴量を求めるこ
とにより、入力された画像データがどのような種類の文
書であるのか判定を行うことが可能となり、この判定結
果を利用することによって、画質の劣化が少ない効率的
な画像データに変換したり、品質の良い画像データを得
るための画像補正を行えるようになる。Therefore, in the document image data input by the present apparatus, areas are extracted by distinguishing different document elements such as characters, photographs, charts, and halftone dots. By determining a feature amount such as complexity as a document structure from the degree of overlap of the rectangular areas, it is possible to determine what kind of document the input image data is, and use the determination result. By doing so, it is possible to convert the image data into efficient image data with little deterioration in image quality, and to perform image correction for obtaining high-quality image data.

【０１１０】また、イメージファイルとして保存する際
も、画像データに合ったフォーマットを自動的に選択で
き、効率的なファイリングができる。Also, when saving as an image file, a format suitable for image data can be automatically selected, and efficient filing can be performed.

【０１１１】さらに、文書画像データが入力され次第、
自動的に画像処理アプリケーションを選択・起動するた
め、省力化につながる。Further, as soon as the document image data is input,
Since the image processing application is automatically selected and started, labor is saved.

【０１１２】以上説明したように上記発明の実施の形態
によれば、文書をハードコピーしたり、イメージデータ
に変換して保存しようとする場合、入力された文書画像
データに対し、文字、図、表、写真、網点等の異なる文
書要素を区別して抽出・識別し、網点、写真といったキ
ーとなる領域の有無や、矩形領域の重なりの度合いから
文書構造としての複雑さなどといった特徴量を求めるこ
とにより、入力された画像データがどのような種類の文
書であるのか判定を行うことが可能となる。この判定結
果を利用して、解像度や圧縮率や色数などのデータ変換
処理やフィルタリング処理を施したり、イメージファイ
ルとして保存する際にも最適な画像フォーマットが選択
でき、効率的なファイリングが行えるようになる。As described above, according to the embodiment of the present invention, when a document is to be hard-copyed or converted into image data and stored, characters, figures, Different document elements such as tables, photos, halftone dots, etc. are distinguished and extracted and identified, and features such as the presence / absence of key areas such as halftone dots, photos, and the degree of overlap of rectangular areas, such as the complexity of the document structure, are determined. By obtaining, it is possible to determine what kind of document the input image data is. By using this determination result, the most suitable image format can be selected when performing data conversion processing such as resolution, compression ratio, number of colors, filtering processing, and saving as an image file, so that efficient filing can be performed. become.

【０１１３】また、入力画像データを処理するのに適し
た画像処理アプリケーションを起動させることができ
る。Also, an image processing application suitable for processing input image data can be started.

【０１１４】[0114]

【発明の効果】以上詳述したようにこの発明によれば、
画像の劣化を少なくして効率的に画像処理を行うことの
できる画像処理装置を提供することができる。As described in detail above, according to the present invention,
It is possible to provide an image processing apparatus capable of performing image processing efficiently with less deterioration of an image.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明の第１実施例に係わる画像処理装置の概
略構成を示すブロック図。FIG. 1 is a block diagram illustrating a schematic configuration of an image processing apparatus according to a first embodiment of the present invention.

【図２】第１実施例の画像処理装置における画像処理の
例を説明するためのフローチャート。FIG. 2 is a flowchart illustrating an example of image processing in the image processing apparatus according to the first embodiment.

【図３】第１実施例の画像処理装置における画像種別判
定処理を説明するためのフローチャート。FIG. 3 is a flowchart illustrating an image type determination process in the image processing apparatus according to the first embodiment.

【図４】第１実施例の画像処理装置における画像種別判
定処理を説明するためのフローチャート。FIG. 4 is a flowchart illustrating an image type determination process in the image processing apparatus according to the first embodiment.

【図５】第１実施例の画像処理装置におけるデータ変換
処理を説明するためのフローチャート。FIG. 5 is a flowchart illustrating data conversion processing in the image processing apparatus according to the first embodiment.

【図６】本発明の第２実施例に係わる画像処理装置の概
略構成を示すブロック図。FIG. 6 is a block diagram illustrating a schematic configuration of an image processing apparatus according to a second embodiment of the present invention.

【図７】第２実施例の画像処理装置における画像処理の
例を説明するためのフローチャート。FIG. 7 is a flowchart illustrating an example of image processing in the image processing apparatus according to the second embodiment.

【図８】第２実施例の画像処理装置における画像補正処
理を説明するためのフローチャート。FIG. 8 is a flowchart illustrating an image correction process in the image processing apparatus according to the second embodiment.

【図９】本発明の第３実施例に係わる画像処理装置の概
略構成を示すブロック図。FIG. 9 is a block diagram illustrating a schematic configuration of an image processing apparatus according to a third embodiment of the present invention.

【図１０】第３実施例の画像処理装置における画像処理
の例を説明するためのフローチャート。FIG. 10 is a flowchart illustrating an example of image processing in the image processing apparatus according to the third embodiment.

【図１１】第３実施例の画像処理装置における画像形式
変更処理を説明するためのフローチャート。FIG. 11 is a flowchart illustrating an image format change process in the image processing apparatus according to the third embodiment.

【図１２】本発明の第４実施例に係わる画像処理装置の
概略構成を示すブロック図。FIG. 12 is a block diagram illustrating a schematic configuration of an image processing apparatus according to a fourth embodiment of the present invention.

【図１３】第４実施例の画像処理装置における画像処理
の例を説明するためのフローチャート。FIG. 13 is a flowchart illustrating an example of image processing in the image processing apparatus according to the fourth embodiment.

【図１４】第４実施例の画像処理装置におけるアプリケ
ーション起動処理を説明するためのフローチャート。FIG. 14 is a flowchart illustrating an application activation process in the image processing apparatus according to the fourth embodiment.

[Explanation of symbols]

１０１…画像入力部（入力手段）１０２…領域識別部（識別手段）１０３…画像種別判定部（判定手段）１０４…データ変換部（変換手段）１０５…画像出力部（出力手段）１０６…画像補正部（補正手段）１０７…画像形式変更部（変更手段）１０８…画像記憶部（記憶手段）１０９…アプリケーション起動部（起動処理手段）１１０…アプリケーション処理結果出力部（出力手段） 101 image input unit (input means) 102 area discriminating unit (identifying means) 103 image type determining unit (determining means) 104 data conversion unit (converting means) 105 image output unit (output means) 106 image correction Unit (correction unit) 107 ... Image format change unit (change unit) 108 ... Image storage unit (storage unit) 109 ... Application start unit (start processing unit) 110 ... Application processing result output unit (output unit)

フロントページの続き (72)発明者小平直朗神奈川県川崎市幸区小向東芝町１番地株式会社東芝研究開発センター内 (72)発明者久保田浩明神奈川県川崎市幸区小向東芝町１番地株式会社東芝研究開発センター内Ｆターム(参考） 5B050 BA10 BA15 BA16 BA18 BA19 DA02 DA03 EA04 EA06 EA14 FA01 5B057 BA11 CA08 CA12 CA16 CB08 CB12 CB16 CE06 CE11 CE17 DA08 DB02 DC01 DC36 5C077 LL18 LL19 MP01 MP02 MP05 MP06 PP15 PP20 PP27 PP28 PP37 PQ22 5L096 BA12 BA17 EA03 FA43 FA44 FA45 LA05 Continuing from the front page (72) Inventor Naorou Kodaira 1 Tokoba, Komukai Toshiba-cho, Kawasaki-shi, Kanagawa Prefecture Inside the Toshiba R & D Center (72) Inventor Hiroaki Kubota 1, Komukai Toshiba-cho, Koyuki-ku, Kawasaki-shi, Kanagawa F-term in Toshiba Research and Development Center Co., Ltd. (reference) PP28 PP37 PQ22 5L096 BA12 BA17 EA03 FA43 FA44 FA45 LA05

Claims

[Claims]

1. An input means for inputting a document image, an area identifying means for extracting an area from image data input by the input means using characteristics of pixels, and identifying an attribute of the extracted area; Determining means for determining the type of the image data from the distribution of the area for each attribute identified by the area identifying means.

2. The image processing apparatus according to claim 1, wherein the image data of the uniform background, the image data of the entire image having the halftone background, the image data of the halftone photograph only, the image data of the continuous tone photograph only, and the area identification based on the rectangle are provided. 2. The image processing apparatus according to claim 1, wherein at least one type is determined from possible image data and image data in which a region cannot be identified by a rectangle.

3. An input means for inputting a document image, an area identifying means for extracting an area from the image data input by the input means by using a feature of a pixel, and identifying an attribute of the extracted area; Determining means for determining the type of the image data from the distribution of the area for each attribute identified by the area identifying means; resolution, compression ratio, number of colors, etc. of the image data according to the type determined by the determining means An image processing apparatus comprising: a conversion unit configured to convert the image data; and an output unit configured to output image data converted by the conversion unit.

4. An input means for inputting a document image, an area identifying means for extracting an area from the image data input by the input means using a feature of a pixel, and identifying an attribute of the extracted area; Determining means for determining the type of the image data from the distribution of the area for each attribute identified by the area identifying means; and performing density conversion or filter processing according to the type determined by the determining means to convert the image data. An image processing apparatus comprising: a correcting unit for correcting; and an output unit for outputting image data corrected by the correcting unit.

5. An input means for inputting a document image, an area extracting means for extracting an area from the image data input by the input means using characteristics of pixels, and identifying an attribute of the extracted area; Determining means for determining the type of the image data from the distribution of the area for each attribute identified by the area identifying means; changing means for changing the image format of the image data according to the type determined by the determining means; A storage unit for storing image data in the image format changed by the changing unit.

6. An input means for inputting a document image, an area extracting means for extracting an area from the image data input by the input means using characteristics of pixels, and identifying an attribute of the extracted area; Determining means for determining the type of the image data from the distribution of the area for each attribute identified by the area identifying means; and image filing, printing, transmission, layout understanding, and text according to the type determined by the determining means. Processing means for selecting an application for image processing such as recognition and ruled line extraction, setting parameters, and activating the application to perform image processing of the image data; and output means for outputting image data image-processed by the processing means An image processing apparatus, comprising: