JP2003338935A

JP2003338935A - Image processing equipment and method therefor

Info

Publication number: JP2003338935A
Application number: JP2002145062A
Authority: JP
Inventors: Tomotoshi Kanatsu; 知俊金津
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2002-05-20
Filing date: 2002-05-20
Publication date: 2003-11-28

Abstract

<P>PROBLEM TO BE SOLVED: To provide image processing equipment wherein image is compressed in such a manner that high picture quality is maintained in a target size, and to provide its method. <P>SOLUTION: In image compression processing, region dividing is performed to a color document image, a binary MMR compression code is used for a latter region part, and a JPEG compression code is used for a substratum part extracting letters. Data are obtained wherein information regarding both of the latter region and the substratum, region position information and latter color information are synthesized, so that high compression ratio is obtained while picture quality is maintained to be high. In the processing, resolution is determined while resolution conversion is performed repeatedly in such a manner that the whole compression code size is accommodated in a previously designated size. <P>COPYRIGHT: (C)2004,JPO

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は画像処理装置、及び
方法に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image processing apparatus and method.

【０００２】[0002]

【従来の技術】近年、スキャナの普及により文書の電子
化が進んでいる。電子化された文書をフルカラービット
マップ形式で記憶しようとすると、例えば。Ａ４サイズ
の場合では３００ｄｐｉで約２４Ｍバイトにもなり、必
要なメモリが膨大になる。このような大容量のデータ
は、メイルに添付して送信するのに適したサイズとはい
えない。そこで、フルカラー画像を圧縮することが通常
行われており、その圧縮方式としてＪＰＥＧが知られて
いる。ＪＰＥＧは写真などの自然画像を圧縮するには非
常に効果も高く、画質も良い。しかし一方で、文字部な
どの高周波部分をＪＰＥＧ圧縮すると、モスキートノイ
ズと呼ばれる画像劣化が発生し、圧縮率も悪い。そこ
で、領域分割を行い、文字領域を抜いた下地部分のＪＰ
ＥＧ圧縮と、色情報付き文字領域部分のＭＭＲ圧縮を作
成し、解凍時は白部分はＪＰＥＧ画像を透過し、黒部分
は代表文字色を載せて表現する方法があった。2. Description of the Related Art In recent years, the spread of scanners has promoted the digitization of documents. If you try to store a digitized document in full color bitmap format, for example: In the case of A4 size, 300 dpi requires about 24 Mbytes, which requires a huge amount of memory. Such a large amount of data cannot be said to have a size suitable for being attached to a mail and transmitted. Therefore, compression of a full-color image is usually performed, and JPEG is known as a compression method thereof. JPEG is very effective in compressing a natural image such as a photograph, and has good image quality. On the other hand, however, when a high frequency portion such as a character portion is JPEG-compressed, image deterioration called mosquito noise occurs and the compression rate is poor. Therefore, the JP of the background part is extracted by dividing the area and removing the character area.
There is a method in which EG compression and MMR compression of a character area portion with color information are created, and when decompressing, a white portion transmits a JPEG image and a black portion is represented by a representative character color.

【０００３】[0003]

【発明が解決しようとする課題】文書画像送信の為の圧
縮においては、高画質と低データ量という相反するふた
つの事項が要求される。特に、あらかじめユーザがデー
タ量を指定している場合、そのデータサイズを上限とし
て、最も高画質が得られる圧縮方法で圧縮を行うことが
要求される。In the compression for transmitting a document image, two contradictory matters, high image quality and low data amount, are required. In particular, when the user specifies the amount of data in advance, it is required to perform compression by the compression method that gives the highest image quality, with the data size as the upper limit.

【０００４】本発明は上記のように、定められたサイズ
内に圧縮して文書画像を送信しようとする際に、より高
画質が得られるような圧縮方法で圧縮をおこなう画像処
理装置、並びにその方法を提供することを目的とする。As described above, the present invention is an image processing apparatus which compresses a document image by compressing it within a predetermined size by a compression method so as to obtain higher image quality, and an image processing apparatus therefor. The purpose is to provide a method.

【０００５】[0005]

【課題を解決するための手段】上記目的を達成するた
め、本発明に係る装置は、多値画像データから文字領域
を抽出し、該文字領域の位置データを生成する抽出手段
と、前記文字領域の多値画像データを二値化して、文字
領域二値画像データを生成する二値化手段と、前記文字
領域の代表色を算出し、文字の色データを生成する色算
出手段と、文字以外の領域の多値画像データを用いて、
前記文字領域の多値画像データを変換し、文字なし多値
画像データを生成する変換手段と、前記文字なし多値画
像データの解像度を落として縮小文字なし多値画像デー
タを生成する解像度変換手段と、前記縮小文字なし多値
画像データを圧縮する第１圧縮手段と、前記文字領域二
値画像データを圧縮する第２圧縮手段と、を備え、あら
かじめ定められた要求総圧縮データサイズと、文字領域
二値画像データの差より、文字なし多値画像データの解
像度変換手段を制御することを特徴とする。In order to achieve the above object, an apparatus according to the present invention comprises an extracting means for extracting a character area from multi-valued image data and generating position data of the character area, and the character area. Binarizing means for binarizing the multivalued image data to generate character area binary image data, color calculating means for calculating a representative color of the character area and generating character color data, Using the multi-valued image data of the area
Conversion means for converting the multi-valued image data of the character area to generate multi-valued image data without characters, and resolution conversion means for reducing the resolution of the multi-valued image data without characters to generate reduced multi-valued image data without characters. And a first compression unit for compressing the reduced character-less multivalued image data and a second compression unit for compressing the character region binary image data, and a predetermined required total compressed data size and a character It is characterized in that the resolution conversion means for characterless multi-valued image data is controlled based on the difference between the area binary image data.

【０００６】[0006]

【発明の実施の形態】以下に、図面を参照して、この発
明の好適な実施の形態を例示的に詳しく説明する。ただ
し、この実施の形態に記載されている構成要素はあくま
で例示であり、この発明の範囲をそれらのみに限定する
趣旨のものではない。BEST MODE FOR CARRYING OUT THE INVENTION Preferred embodiments of the present invention will be illustratively described in detail below with reference to the drawings. However, the constituent elements described in this embodiment are merely examples, and the scope of the present invention is not intended to be limited thereto.

【０００７】図１に本発明の第一の実施の形態のブロッ
ク図を示す。FIG. 1 shows a block diagram of a first embodiment of the present invention.

【０００８】１０１は入力した原画像から文字領域を検
出して複数の文字領域の座標１０９を作成する文字領域
検出部である。１０２は上記文字領域座標を入力し、原
画像の文字領域部分の二値画像１０７を作成する二値化
部である。１０３は上記二値画像の黒部分と原画像を参
照しながら黒部分の代表色１１０を算出する色算出部で
ある。１０４は上記二値画像（黒）の領域を原画像から
抜いて、その周りの色で塗りつぶした下地画像１０８を
作成する文字部塗りつぶし部である。１０５は上記複数
の二値画像を圧縮して複数の圧縮コードＤ（１１２）を
作成する二値画像圧縮部である。１０６は下地画像１０
８を入力し、圧縮コードＣ（１１２）を作成する下地画
像圧縮部である。１０６は、縮小を行う１６１解像度変
換部、ＪＰＥＧ圧縮を行う１６２、および１６１、１６
２の動作を制御する圧縮制御部からなる。最終的に破線
で囲まれた、１０９から１１２までのデータを結合して
これが圧縮データとなる。Reference numeral 101 denotes a character area detection unit which detects a character area from an input original image and creates coordinates 109 of a plurality of character areas. Reference numeral 102 denotes a binarizing unit that inputs the above-mentioned character area coordinates and creates a binary image 107 of the character area portion of the original image. Reference numeral 103 denotes a color calculation unit that calculates the representative color 110 of the black portion with reference to the black portion of the binary image and the original image. Reference numeral 104 denotes a character portion filling portion that creates the background image 108 by removing the area of the binary image (black) from the original image and filling the background image 108 with the surrounding color. Reference numeral 105 denotes a binary image compression unit that compresses the plurality of binary images to create a plurality of compression codes D (112). 106 is the base image 10
8 is a base image compression unit that inputs 8 and creates a compression code C (112). Reference numeral 106 denotes a 161 resolution conversion unit that performs reduction, 162 that performs JPEG compression, and 161, 16
2 is composed of a compression controller for controlling the operation. Finally, the data from 109 to 112 surrounded by the broken line are combined to form the compressed data.

【０００９】図３は、文字領域検出部での処理を説明す
るフローチャートである。FIG. 3 is a flow chart for explaining the processing in the character area detecting section.

【００１０】ステップＳ３０１にてカラー画像を入力
し、間引いて解像度を落しながら輝度変換を行い、輝度
画像Ｊを作成する。例えば原画像がＲＧＢ２４ビット３
００ｄｐｉだとすると、縦方向、横方向とも４画素ごと
にＹ＝０．２９９Ｒ＋０．５８７Ｇ＋０．１１４Ｂの演算を行い、新しい画像Ｊを作成すると画像ＪはＹ８
ビット７５ｄｐｉの画像となる。ステップＳ３０２にて
輝度データのヒストグラムを取り、二値化閾値Ｔを算出
する。次にステップＳ３０３にて輝度画像ＪをＴにて二
値化し、二値画像Ｋを作成する。更にステップＳ３０４
にて黒画素の輪郭線追跡を行いすべての黒領域をラベル
付けする。次に、ステップＳ３０５にて黒領域中の文字
らしい領域を判定する。ステップＳ３０６にて形や位置
から結合するものを結合する。In step S301, a color image is input, luminance conversion is performed while thinning to reduce the resolution, and a luminance image J is created. For example, the original image is RGB 24-bit 3
If it is 00 dpi, Y = 0.299R + 0.587G + 0.114B is calculated every four pixels in the vertical and horizontal directions, and when a new image J is created, the image J becomes Y8.
It becomes an image of bit 75 dpi. In step S302, the histogram of the brightness data is taken and the binarization threshold value T is calculated. Next, in step S303, the luminance image J is binarized by T to create a binary image K. Further step S304
The contour lines of black pixels are traced and all black areas are labeled. Next, in step S305, a character-like area in the black area is determined. In step S306, shapes and positions to be combined are combined.

【００１１】一例を示す。例えば図４に示すカラー原稿
を入力し、間引いて輝度変換したもののヒストグラムを
取ると図５のようになる。このヒストグラムから平均、
分散、などのデータを利用して閾値Ｔ＝１５０を算出
し、二値化した画像は図６のようになる。図６の黒画素
の輪郭線追跡を行い、すべてをラベリングして、例え
ば、横幅が閾値以下、または高さが閾値以下の黒画素の
集まりのみ文字として許すと図７に示す黒画素の集まり
が文字領域となる（実際にこのような画像が作成される
わけではなく、イメージである）。An example will be shown. For example, the color original shown in FIG. 4 is input, and the histogram of the thinned and converted luminance is obtained as shown in FIG. Average from this histogram,
A threshold value T = 150 is calculated using data such as variance, and the binarized image is as shown in FIG. If the contour lines of the black pixels in FIG. 6 are traced and all are labeled and, for example, only the collection of black pixels whose width is less than or equal to the threshold value or height is less than the threshold value is allowed as characters, the collection of black pixels shown in FIG. It becomes a character area (it is an image, not such an image is actually created).

【００１２】これらの黒画素の集まりを位置の近さや横
幅、高さの一致からグループ化していくと、図８に示す
ような１７個の文字領域が検出できる。これらの座標デ
ータが図１の１０９に格納される。When these groups of black pixels are grouped according to the closeness of position, width and height, 17 character areas as shown in FIG. 8 can be detected. These coordinate data are stored in 109 of FIG.

【００１３】なお上記処理において、カラー画像の二値
化をおこなうのではなく、微分フィルタをかけ、すべて
の画素の近隣の画素とのエッジ量を算出し、そのエッジ
量を二値化することにより得られた二値画像を同様に輪
郭線追跡をして文字領域を検出してもよい。In the above process, the color image is not binarized, but a differential filter is applied to calculate edge amounts of all pixels with neighboring pixels, and the edge amounts are binarized. The obtained binary image may be similarly contour-tracked to detect the character region.

【００１４】二値化部１０２にて、上記の方法にて得ら
れた１７個の文字領域の二値画像を作成する。この二値
画像は、例えば文字領域検出部にて算出したＴにて二値
化しても良いし、その領域ごとにヒストグラムを取って
その文字領域により最適な二値化閾値を算出しても良
い。図５に示した全面のヒストグラムに比べて、文字領
域一部分の輝度ヒストグラムは、図９のようなシンプル
な形が期待できるので、閾値の決定は容易である。９０
１は下地色の集合であり、９０２は文字色の集合であ
る。The binarizing unit 102 creates a binary image of 17 character areas obtained by the above method. This binary image may be binarized by, for example, T calculated by the character area detection unit, or a histogram may be taken for each area to calculate an optimum binarization threshold for the character area. . As compared with the histogram of the entire surface shown in FIG. 5, the luminance histogram of a part of the character area can be expected to have a simple shape as shown in FIG. 9, so that the threshold value can be easily determined. 90
Reference numeral 1 is a set of base colors, and 902 is a set of character colors.

【００１５】上記二値化結果を利用した文字部塗りつぶ
し部１０４の処理の一例を図１０と図１１を用いて説明
する。An example of the processing of the character portion filling portion 104 utilizing the binarization result will be described with reference to FIGS. 10 and 11.

【００１６】グラデーション画像を背景とし、ＡＢＣと
いう青色の文字が中央付近に描かれた、図１０（ａ）の
ような画像を原画像とする。この原画像から（ｂ）のよ
うな１つの文字領域の二値画像を得たとする。本実施の
形態では、全画像を３２×３２の領域（以下、パーツ）
に分割し、パーツごとに処理をおこなう。図１０（ｃ）
にパーツごとに分けた様子を示す。この図では簡単に説
明するため、５×４のパーツに分割した状態を示してい
る。各領域の左上の数字はパーツ番号を示す。このよう
に分割した場合、パーツ００〜０４，１０，１４，２
０，２４，３０〜３５は、ステップＳ１１０３において
文字領域がないと判断されるため、処理は行われずに、
次のパーツに進む。パーツ１１に対しては、ステップＳ
１１０４に進み、対応する二値画像を参照し、二値画像
の白部分に対応するカラー画像のＲＧＢ値（またはＹＵ
Ｖ等でも良い）の平均値ａｖｅ＿ｃｏｌｏｒを算出す
る。次にステップＳ１１０５において、対応する二値画
像を参照し、黒画素に対応する画素の濃度データを上記
ａｖｅ＿ｃｏｌｏｒとする。An image as shown in FIG. 10A, in which a blue character ABC is drawn near the center with a gradation image as a background, is an original image. It is assumed that a binary image of one character area as shown in (b) is obtained from this original image. In the present embodiment, the entire image is a 32 × 32 area (hereinafter, parts)
Divide into and process each part. Figure 10 (c)
Shows how the parts are divided. For the sake of simplicity, this figure shows a state of being divided into 5 × 4 parts. The number on the upper left of each area shows the part number. When divided in this way, parts 00-04,10,14,2
Nos. 0, 24, 30 to 35 are determined to have no character area in step S1103, and thus no processing is performed,
Go to the next part. Step S for parts 11
Proceeding to 1104, the corresponding binary image is referenced, and the RGB value (or YU) of the color image corresponding to the white part of the binary image (or YU
The average value ave_color of V) may be calculated. Next, in step S1105, the corresponding binary image is referred to, and the density data of the pixel corresponding to the black pixel is set as the ave_color.

【００１７】以上の処理を文字領域の存在するパーツ１
２，１３，２１，２２，２３に繰り返す。The above processing is performed for the part 1 having a character area.
Repeat steps 2, 13, 21, 22, 23.

【００１８】このようにして、文字の存在する部分の周
りの画素の平均値をもって、文字の存在する画素を塗り
つぶすことができ、見掛け上自然に文字のみが取り除か
れた、下地画像１０８が生成される。In this manner, the pixel having the character can be filled with the average value of the pixels around the portion having the character, and the background image 108 in which only the character is apparently removed naturally is generated. It

【００１９】一方、上記二値化結果を利用した文字色算
出部の一例を図１２に示す。本実施の形態では部分二値
化結果１０７の結果を利用したがその限りでなく、例え
ば文字領域の座標とカラー画像のみ入力し、カラー画像
を改めて二値化した結果を利用して色算出処理を行って
も良い。抽出されたすべての文字領域ごとに処理を行う
ので、ステップＳ１２０１では未処理の文字座標がある
かどうかチェックし、あったらステップＳ１２０２にす
すみ、無かったらｅｎｄにすすむ。ステップＳ１２０２
にてその文字座標が参照する二値画像の細線化処理を行
い、スキャナ読み込み時の下地から文字部への変化部に
あたる黒を減らしていき、新しい二値画像ｎｅｗｂｉを
作成する。次にステップＳ１２０３にてｎｅｗｂｉの黒
画素に対応する原画像のＲＧＢのそれぞれの値のヒスト
グラムを取る（もちろん、ＹＵＶ等の他の色空間でも構
わない）。ステップＳ１２０４にてＲＧＢそれぞれの代
表値を算出する。例えば、一番大きな値でも良い。また
はヒストグラムのステップ数を少なくして大まかなヒス
トグラムで一番大きな値を得た後に、このヒストグラム
に存在する細かなヒストグラムで一番大きな値を得る方
法でもよい。以上の処理をすべての文字座標に繰り返す
ことにより、すべての文字座標に１つずつ代表色が算出
される。On the other hand, FIG. 12 shows an example of a character color calculation unit utilizing the above binarization result. In the present embodiment, the result of the partial binarization result 107 is used, but the present invention is not limited to this. For example, only the coordinates of the character area and the color image are input, and the color calculation process is performed using the result of binarizing the color image again. You may go. Since processing is performed for all the extracted character areas, it is checked in step S1201 whether or not there is an unprocessed character coordinate, and if there is, it proceeds to step S1202, and if there is none, it proceeds to end. Step S1202
The thinning process of the binary image referred to by the character coordinates is performed to reduce the black corresponding to the changing portion from the background to the character portion at the time of reading by the scanner, and a new binary image newbi is created. Next, in step S1203, a histogram of each value of RGB of the original image corresponding to the black pixel of newbi is obtained (of course, another color space such as YUV may be used). In step S1204, representative values of RGB are calculated. For example, it may be the largest value. Alternatively, the number of steps in the histogram may be reduced to obtain the largest value in the rough histogram, and then the largest value in the fine histogram existing in this histogram may be obtained. By repeating the above process for all the character coordinates, one representative color is calculated for all the character coordinates.

【００２０】１０５にて、部分二値画像１０７のそれぞ
れをＭＭＲ圧縮して圧縮コードＤを作成する。At 105, each of the partial binary images 107 is MMR compressed to generate a compression code D.

【００２１】１０６にて、下地画像に対し解像度変換お
よびＪＰＥＧ圧縮を行い圧縮コードＣを作成する。At 106, resolution conversion and JPEG compression are performed on the base image to create a compression code C.

【００２２】ここで、１０６における下地画像圧縮部の
処理の詳細を図１３のフローチャートを用いて説明す
る。The details of the processing of the background image compression section 106 will be described with reference to the flowchart of FIG.

【００２３】ステップＳ１３０１にて、あらかじめ与え
られた総圧縮コードサイズＳと、部分二値画像を圧縮し
た圧縮コードＤのコードサイズＢの差より、下地圧縮コ
ードの目標サイズＰを計算する。In step S1301, the target size P of the base compression code is calculated from the difference between the total compression code size S given in advance and the code size B of the compression code D obtained by compressing the partial binary image.

【００２４】ステップＳ１３０２にて、下地多値画像に
対してＪＰＥＧ圧縮を行い、仮圧縮コードＸを作成す
る。In step S1302, the background multi-valued image is subjected to JPEG compression to create a temporary compression code X.

【００２５】ステップＳ１３０３にて、Ｘのサイズ≦Ｐ
のとき、Ｘを最終的な圧縮コードＣとして出力して終了
する。それ以外の場合は、ステップＳ１３０４に進む。In step S1303, the size of X ≦ P
At that time, X is output as the final compressed code C, and the process ends. Otherwise, the process proceeds to step S1304.

【００２６】ステップＳ１３０４では、下地多値画像に
対して１／２の解像度変換を施す。その後Ｓ１３０２に
戻ってＳ１３０２からＳ１３０４までの処理を繰り返
す。In step S1304, 1/2 resolution conversion is applied to the background multi-valued image. After that, the process returns to S1302 and the processes from S1302 to S1304 are repeated.

【００２７】最終的に、下地画像に対してはその目標サ
イズＰを上限として可能な限り解像度の高い圧縮コード
Ｃが生成される。Finally, for the background image, the compression code C having the highest possible resolution is generated with the target size P as the upper limit.

【００２８】最後に、文字領域座標（１０９）、文字部
代表色（１１０）、圧縮コードＣ（１１１）、圧縮コー
ドＤ（１１２）の４つをまとめたフォーマットを必要な
らば作成する。Finally, if necessary, a format in which the character region coordinates (109), the character portion representative color (110), the compression code C (111), and the compression code D (112) are combined is created.

【００２９】このフォーマットは、ＰＤＦであってもよ
いし、ＸＭＬであてってもよい。The format may be PDF or XML.

【００３０】図２に伸長時の構成図を示す。FIG. 2 shows a configuration diagram at the time of expansion.

【００３１】２０１は圧縮コードＣを入力し、ＪＰＥＧ
伸長処理を行い多値画像Ｅを作成するＪＰＥＧ伸長部で
ある。２０２は圧縮コードを入力し二値画像Ｆを作成す
るＭＭＲ伸長部である。２０３は多値画像Ｅを入力し、
拡大して画像Ｇを作成する拡大部である。２０４は文字
領域の座標と代表色を入力し、上記二値画像Ｆを参照し
ながら白部分は画像Ｇを、黒部分は代表色を選択し、最
終的な画像である画像Ｈ２０７を作成する画像合体部で
ある。A compressed code C is input to 201, and JPEG is input.
This is a JPEG decompression unit that performs decompression processing and creates a multi-valued image E. Reference numeral 202 denotes an MMR decompression unit that inputs a compression code and creates a binary image F. 203 inputs the multi-valued image E,
An enlargement unit that enlarges and creates an image G. Reference numeral 204 is an image for inputting coordinates of a character area and a representative color, selecting the image G for the white portion and the representative color for the black portion while referring to the binary image F, and creating an image H207 as a final image. It is a united part.

【００３２】図１４に合体処理２０４の結果例を示す。
まず図１４（ａ）に圧縮コードＣのＪＰＥＧ伸長結果を
示す。ＪＰＥＧ圧縮の量子化非可逆方式を利用すると図
１０（ｃ）とは微妙に画素値が異なるデータとなってい
る。しかし、文字部を抜く前の原画像をＪＰＥＧ非可逆
圧縮方式で圧縮する場合と比較して、同じ量子化テーブ
ルを利用した場合において画素値の変化は少ない。二値
画像（ｂ）を参照して黒画素の対応するところの画像
（ａ）上に代表色（２０，３０，２５５）データをの
せ、最終的に（ｃ）のような画像ができ上がる。これが
伸長画像２０７となる。FIG. 14 shows an example of the result of the merge processing 204.
First, FIG. 14A shows a JPEG decompression result of the compressed code C. When the quantized lossy method of JPEG compression is used, the pixel values are slightly different from those in FIG. 10C. However, compared to the case where the original image before the character part is removed is compressed by the JPEG lossy compression method, the pixel value changes less when the same quantization table is used. By referring to the binary image (b), representative color (20, 30, 255) data is placed on the image (a) corresponding to the black pixel, and finally an image as shown in (c) is completed. This becomes the expanded image 207.

【００３３】以上、本発明によれば、文書の電子化送信
においてあらかじめ定められた圧縮コードサイズを目標
に圧縮を行う場合に、文書中で重要な文字部分に対して
は高い解像度により常に情報量を高く保ち、文字部分以
外に対しては必要に応じて解像度を落とすことで、目標
サイズ内で効率良く高画質な状態の圧縮コードを提供す
る画像処理装置が実現できる。As described above, according to the present invention, in the case of performing compression with a predetermined compression code size as a target in the electronic transmission of a document, the information amount is always provided to the important character portion in the document by the high resolution. By keeping the value high and reducing the resolution as needed for portions other than the character portion, it is possible to realize an image processing apparatus that efficiently provides a compressed code of a high image quality within a target size.

【００３４】［その他の実施の形態］第一の実施の形態
では、下地画像圧縮部において、解像度変換を繰り返し
おこないながら目標サイズに漸近する圧縮コードを得て
いたが、同時にＪＰＥＧ圧縮の際の圧縮率を変化させ
て、より目標サイズに近い圧縮コードを得るようにして
もよい。[Other Embodiments] In the first embodiment, the background image compression section obtains a compression code that asymptotically approaches the target size while repeatedly performing resolution conversion. However, at the same time, compression in JPEG compression is performed. The rate may be changed to obtain a compressed code closer to the target size.

【００３５】本発明第二の実施の形態として、その場合
の下地画像圧縮部の動作を図１５のフローチャートを用
いて説明する。As a second embodiment of the present invention, the operation of the background image compression unit in that case will be described with reference to the flowchart of FIG.

【００３６】ステップＳ１５０１において、下地画像の
圧縮コードＣの目標サイズＰを、目標総サイズＳと圧縮
コードDのサイズＢから求める。In step S1501, the target size P of the compression code C of the background image is obtained from the target total size S and the size B of the compression code D.

【００３７】ステップＳ１５０２において、下地多値画
像のＪＰＥＧ圧縮を行い、仮圧縮コードＸを得る。ここ
では圧縮率は低めの量子化テーブルを用いる。In step S1502, the background multi-valued image is subjected to JPEG compression to obtain a temporary compression code X. Here, a quantization table with a low compression rate is used.

【００３８】ステップＳ１５０３において、Ｘのサイズ
≦Ｐならば、Ｘを最終的な圧縮コードＣとして終了す
る。それ以外の場合は、ステップＳ１５０４に進む。In step S1503, if the size of X ≦ P, X is set as the final compressed code C, and the process ends. Otherwise, the process advances to step S1504.

【００３９】ステップＳ１５０４において、下地多値画
像のＪＰＥＧ圧縮を行い、仮圧縮コードＸ'を得る。こ
こではＳ１５０２よりも圧縮率が高くなるような量子化
テーブルを用いる。In step S1504, the background multi-valued image is subjected to JPEG compression to obtain a temporary compression code X '. Here, a quantization table that has a higher compression rate than S1502 is used.

【００４０】ステップＳ１５０５において、Ｘ'のサイ
ズ≦Ｐならば、Ｘ'を最終的な圧縮コードＣとして終了
する。それ以外の場合は、ステップＳ１５０６に進む。In step S1505, if the size of X'≤P, X'is set as the final compressed code C, and the process ends. Otherwise, the process proceeds to step S1506.

【００４１】ステップＳ１３０４では、下地多値画像に
対して１／２の解像度変換を施す。その後Ｓ１５０２に
戻ってＳ１５０２からＳ１３０４までの処理を繰り返
す。In step S1304, 1/2 resolution conversion is applied to the background multi-valued image. After that, the process returns to S1502 and the processes from S1502 to S1304 are repeated.

【００４２】なお、本説明ではＪＰＥＧの圧縮率を、低
高２段階としたが、更に細かな段階に分けてもよい。In this description, the compression ratio of JPEG is set to two levels, low and high, but it may be divided into more detailed stages.

【００４３】以上、説明したように、本発明第二の実施
の形態においても、文書の電子化送信においてあらかじ
め定められた圧縮コードサイズを目標に圧縮を行う場合
に、文書中で重要な文字部分に対しては高い解像度によ
り常に情報量を高く保ち、文字部分以外に対しては必要
に応じて解像度を落とすとともに、ＪＰＥＧの圧縮率を
選択することで、目標サイズ内で効率良く高画質な状態
の圧縮コードを提供する画像処理装置が実現できる。As described above, also in the second embodiment of the present invention, when the compression is performed with the predetermined compression code size as the target in the electronic transmission of the document, the important character portion in the document For high resolution, the amount of information is always kept high, and for non-text portions, the resolution is reduced as necessary, and the JPEG compression rate is selected, so that high image quality can be achieved efficiently within the target size. An image processing apparatus that provides the compressed code of

【００４４】（他の実施形態）以上、本発明の実施形態
について詳述したが、本発明は、複数の機器から構成さ
れるシステムに適用しても良いし、また、一つの機器か
らなる装置に適用しても良い。(Other Embodiments) Although the embodiments of the present invention have been described in detail above, the present invention may be applied to a system composed of a plurality of devices, or an apparatus composed of a single device. May be applied to.

【００４５】なお、本発明は、前述した実施形態の機能
を実現するソフトウェアのプログラムを、システム或い
は装置に直接或いは遠隔から供給し、そのシステム或い
は装置のコンピュータが該供給されたプログラムコード
を読み出して実行することによっても達成される場合を
含む。その場合、プログラムの機能を有していれば、形
態は、プログラムである必要はない。In the present invention, a software program for realizing the functions of the above-described embodiments is supplied to a system or device directly or remotely, and the computer of the system or device reads the supplied program code. Including cases that can be achieved by executing. In that case, the form need not be a program as long as it has the functions of the program.

【００４６】従って、本発明の機能処理をコンピュータ
で実現するために、該コンピュータにインストールされ
るプログラムコード自体も本発明を実現するものであ
る。つまり、本発明のクレームでは、本発明の機能処理
を実現するためのコンピュータプログラム自体も含まれ
る。Therefore, the program code itself installed in the computer to implement the functional processing of the present invention by the computer also implements the present invention. That is, the claims of the present invention include the computer program itself for realizing the functional processing of the present invention.

【００４７】その場合、プログラムの機能を有していれ
ば、オブジェクトコード、インタプリタにより実行され
るプログラム、ＯＳに供給するスクリプトデータ等、プ
ログラムの形態を問わない。In this case, the program may take any form such as an object code, a program executed by an interpreter, or script data supplied to an OS as long as it has the function of the program.

【００４８】プログラムを供給するための記録媒体とし
ては、例えば、フロッピー（登録商標）ディスク、ハー
ドディスク、光ディスク、光磁気ディスク、ＭＯ、ＣＤ
−ＲＯＭ、ＣＤ−Ｒ、ＣＤ−ＲＷ、磁気テープ、不揮発
性のメモリカード、ＲＯＭ、ＤＶＤ（ＤＶＤ−ＲＯＭ，
ＤＶＤ−Ｒ）などがある。A recording medium for supplying the program is, for example, a floppy (registered trademark) disk, a hard disk, an optical disk, a magneto-optical disk, an MO, a CD.
-ROM, CD-R, CD-RW, magnetic tape, non-volatile memory card, ROM, DVD (DVD-ROM,
DVD-R).

【００４９】その他、プログラムの供給方法としては、
クライアントコンピュータのブラウザを用いてインター
ネットのホームページに接続し、該ホームページから本
発明のコンピュータプログラムそのもの、もしくは圧縮
され自動インストール機能を含むファイルをハードディ
スク等の記録媒体にダウンロードすることによっても供
給できる。また、本発明のプログラムを構成するプログ
ラムコードを複数のファイルに分割し、それぞれのファ
イルを異なるホームページからダウンロードすることに
よっても実現可能である。つまり、本発明の機能処理を
コンピュータで実現するためのプログラムファイルを複
数のユーザに対してダウンロードさせるＷＷＷサーバ
も、本発明のクレームに含まれるものである。In addition, as a method of supplying the program,
It can also be supplied by connecting to a homepage on the Internet using a browser of a client computer, and downloading the computer program itself of the present invention or a compressed file having an automatic installation function from the homepage to a recording medium such as a hard disk. It can also be realized by dividing the program code constituting the program of the present invention into a plurality of files and downloading each file from different homepages. That is, a WWW server that allows a plurality of users to download a program file for implementing the functional processing of the present invention on a computer is also included in the claims of the present invention.

【００５０】また、本発明のプログラムを暗号化してＣ
Ｄ−ＲＯＭ等の記憶媒体に格納してユーザに配布し、所
定の条件をクリアしたユーザに対し、インターネットを
介してホームページから暗号化を解く鍵情報をダウンロ
ードさせ、その鍵情報を使用することにより暗号化され
たプログラムを実行してコンピュータにインストールさ
せて実現することも可能である。The program of the present invention is encrypted to C
By storing the information in a storage medium such as a D-ROM and distributing it to the user, and having the user who satisfies the predetermined conditions download the key information for decrypting the encryption from the home page via the Internet, and by using the key information It is also possible to execute the encrypted program and install the program in a computer to realize it.

【００５１】また、コンピュータが、読み出したプログ
ラムを実行することによって、前述した実施形態の機能
が実現される他、そのプログラムの指示に基づき、コン
ピュータ上で稼動しているＯＳなどが、実際の処理の一
部または全部を行ない、その処理によっても前述した実
施形態の機能が実現され得る。Further, the computer executes the read program to realize the functions of the above-described embodiments, and the OS running on the computer executes the actual processing based on the instructions of the program. The function of the above-described embodiment can be realized also by performing a part or all of the above.

【００５２】さらに、記録媒体から読み出されたプログ
ラムが、コンピュータに挿入された機能拡張ボードやコ
ンピュータに接続された機能拡張ユニットに備わるメモ
リに書き込まれた後、そのプログラムの指示に基づき、
その機能拡張ボードや機能拡張ユニットに備わるＣＰＵ
などが実際の処理の一部または全部を行ない、その処理
によっても前述した実施形態の機能が実現される。Further, after the program read from the recording medium is written in the memory provided in the function expansion board inserted into the computer or the function expansion unit connected to the computer, based on the instruction of the program,
CPU provided on the function expansion board or function expansion unit
Performs a part or all of the actual processing, and the processing realizes the functions of the above-described embodiments.

【００５３】[0053]

【発明の効果】本発明によれば、カラー文書画像に対し
て領域分割をおこない、文字領域部は二値のＭＭＲ圧縮
コード、文字を抜いた下地部分はＪＰＥＧ圧縮コードと
した上で、両者および領域位置情報、文字色情報を合成
したデータにすることで画質を高く保ちながら高い圧縮
率を得る画像圧縮処理において、ユーザが所望する総コ
ードサイズの制限内で、より高画質な画像が得られる画
像処理装置を実現することができる。According to the present invention, a color document image is divided into areas, a character area portion is a binary MMR compression code, and a background portion without characters is a JPEG compression code. In image compression processing that obtains a high compression rate while maintaining high image quality by combining area position information and character color information, a higher quality image can be obtained within the limit of the total code size desired by the user. An image processing device can be realized.

[Brief description of drawings]

【図１】本発明の第１の実施の形態に係る圧縮装置のブ
ロック図である。FIG. 1 is a block diagram of a compression device according to a first embodiment of the present invention.

【図２】本発明の第１の実施の形態に係る伸長装置のブ
ロック図である。FIG. 2 is a block diagram of a decompression device according to the first embodiment of the present invention.

【図３】本発明の第１の実施の形態の文字領域検出部の
処理を示すフローチャートである。FIG. 3 is a flowchart showing processing of a character area detecting unit according to the first embodiment of this invention.

【図４】本発明の第１の実施の形態の文字領域検出処理
を説明するための原画像の例を示す図である。FIG. 4 is a diagram showing an example of an original image for explaining a character area detection process according to the first embodiment of the present invention.

【図５】本発明の第１の実施の形態の文字領域検出処理
を説明するためのヒストグラムを示す図である。FIG. 5 is a diagram showing a histogram for explaining a character area detection process according to the first embodiment of the present invention.

【図６】本発明の第１の実施の形態の文字領域検出処理
を説明するための二値画像の例を示す図である。FIG. 6 is a diagram showing an example of a binary image for explaining a character area detection process according to the first embodiment of the present invention.

【図７】本発明の第１の実施の形態の文字領域検出処理
を説明するための文字領域画像の例を示す図である。FIG. 7 is a diagram showing an example of a character area image for explaining the character area detection processing according to the first embodiment of the present invention.

【図８】本発明の第１の実施の形態の文字領域検出処理
を説明するための文字領域部分の例を示す図である。FIG. 8 is a diagram showing an example of a character area portion for explaining a character area detecting process according to the first embodiment of the present invention.

【図９】本発明の第１の実施の形態の二値化部１０９に
入力される文字領域のヒストグラム結果の例を示す図で
ある。FIG. 9 is a diagram showing an example of a histogram result of a character area input to the binarizing unit 109 according to the first embodiment of this invention.

【図１０】本発明の第１の実施の形態の文字部塗りつぶ
し処理を説明するための図である。FIG. 10 is a diagram illustrating a character part filling process according to the first embodiment of this invention.

【図１１】本発明の第１の実施の形態の文字部塗りつぶ
し処理を説明するためのフローチャートである。FIG. 11 is a flowchart illustrating a character part filling process according to the first embodiment of this invention.

【図１２】本発明の第１の実施の形態に係る色算出部の
処理を説明するためのフローチャートである。FIG. 12 is a flowchart illustrating a process of a color calculation unit according to the first embodiment of the present invention.

【図１３】本発明の第１の実施の形態に係る下地画像圧
縮部の処理を説明するためのフローチャートである。FIG. 13 is a flowchart illustrating a process of a background image compression unit according to the first embodiment of the present invention.

【図１４】本発明の第１の実施の形態に係る伸長装置の
合体部２０４の説明をするための図である。FIG. 14 is a diagram for explaining a united portion 204 of the extension device according to the first embodiment of the present invention.

【図１５】本発明の第２の実施の形態に係る下地画像圧
縮部の処理を説明するためのフローチャートである。FIG. 15 is a flowchart illustrating a process of a background image compression unit according to the second embodiment of the present invention.

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｈ０４Ｎ 1/40 Ｈ０４Ｎ 1/40 ＦＦターム(参考） 5B057 CA01 CA08 CA16 CB01 CB08 CB16 CD05 CE12 CE16 CG01 CH07 DA08 DB06 DB09 DC23 DC25 5C076 AA21 AA22 BA06 5C077 LL19 MP08 PP27 PP28 PP32 PQ19 PQ23 RR02 RR21 5C078 BA27 BA57 CA02 CA21 CA27 DA01 DA07 5L096 AA02 FA06 FA37 FA44 GA34─────────────────────────────────────────────────── ─── Continuation of front page (51) Int.Cl. ⁷ Identification code FI theme code (reference) H04N 1/40 H04N 1/40 FF term (reference) 5B057 CA01 CA08 CA16 CB01 CB08 CB16 CD05 CE12 CE16 CG01 CH07 DA08 DB06 DB09 DC23 DC25 5C076 AA21 AA22 BA06 5C077 LL19 MP08 PP27 PP28 PP32 PQ19 PQ23 RR02 RR21 5C078 BA27 BA57 CA02 CA21 CA27 DA01 DA07 5L096 AA02 FA06 FA37 FA44 GA34

Claims

[Claims]

1. A character area is extracted from multi-valued image data,
Extraction means for generating position data of the character area, binarization means for binarizing the multi-valued image data of the character area to generate character area binary image data, and calculation of a representative color of the character area Then, the color calculating means for generating the color data of the character and the converting means for converting the multivalued image data of the character area by using the multivalued image data of the area other than the character to generate the multivalued image data without character. A resolution conversion means for reducing the resolution of the characterless multi-valued image data to generate reduced characterless multi-valued image data; a first compression means for compressing the reduced characterless multi-valued image data; An image processing apparatus comprising: a second compression unit that compresses the value image data.

2. The image processing apparatus according to claim 1, wherein the first compression unit complies with JPEG compression.

3. The image processing apparatus according to claim 1, wherein the second compression unit complies with MMR compression.

4. The image processing apparatus according to claim 1, wherein the resolution conversion unit determines the resolution by repeatedly performing resolution conversion so that the resolution falls within a total compression code size designated in advance.

5. The resolution conversion means and the first compression means determine the resolution and the compression ratio by repeating the resolution conversion and the compression of a plurality of compression degrees so as to be within the total compression code size designated in advance. The image processing apparatus according to claim 1, wherein the image processing apparatus is an image processing apparatus.

6. An image processing apparatus for decompressing an image compressed by the image processing apparatus according to claim 1, wherein the image is compressed by the first compression means. A first decompression unit for decompressing the reduced character-less multivalued image data, a second decompression unit for decompressing the character region binary image data compressed by the second compression unit, and a reduced characterless multivalued image data A resolution conversion unit for increasing the resolution to generate the characterless multi-valued image data, the position data and the color data are input, and the multi-valued image is obtained from the character area binary image data and the characterless multi-valued image data. An image processing device, comprising: an image merging unit that generates data.