JP2003338934A

JP2003338934A - Image processing equipment and method therefor

Info

Publication number: JP2003338934A
Application number: JP2002145061A
Authority: JP
Inventors: Tomotoshi Kanatsu; 知俊金津
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2002-05-20
Filing date: 2002-05-20
Publication date: 2003-11-28

Abstract

<P>PROBLEM TO BE SOLVED: To provide image processing equipment wherein image is compressed effectively while picture quality is maintained, and to provide its method. <P>SOLUTION: In image compression processing, region dividing is performed to a color document image, a binary MMR compression code is used for a latter region part, and a JPEG compression code is used for a substratum part extracting letters. Data are obtained wherein information regarding both of the latter region and the substratum, region position information and latter color information are synthesized, so that high compression ratio is obtained while picture quality is maintained to be high. In the processing, a quantization table in the case of producing the JPEG compression code is changed in accordance with areas of a letter region and a not-latter region of an original document image. <P>COPYRIGHT: (C)2004,JPO

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は画像処理装置、及び
方法に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image processing apparatus and method.

【０００２】[0002]

【従来の技術】近年、スキャナの普及により文書の電子
化が進んでいる。電子化された文書をフルカラービット
マップ形式で記憶しようとすると、例えば。Ａ４サイズ
の場合では３００ｄｐｉで約２４Ｍバイトにもなり、必
要なメモリが膨大になる。このような大容量のデータ
は、メイルに添付して送信するのに適したサイズとはい
えない。そこで、フルカラー画像を圧縮することが通常
行われており、その圧縮方式としてＪＰＥＧが知られて
いる。ＪＰＥＧは写真などの自然画像を圧縮するには非
常に効果も高く、画質も良い。しかし一方で、文字部な
どの高周波部分をＪＰＥＧ圧縮すると、モスキートノイ
ズと呼ばれる画像劣化が発生し、圧縮率も悪い。そこ
で、領域分割を行い、文字領域を抜いた下地部分のＪＰ
ＥＧ圧縮と、色情報付き文字領域部分のＭＭＲ圧縮を作
成し、解凍時は白部分はＪＰＥＧ画像を透過し、黒部分
は代表文字色を載せて表現する方法があった。2. Description of the Related Art In recent years, the spread of scanners has promoted the digitization of documents. If you try to store a digitized document in full color bitmap format, for example: In the case of A4 size, 300 dpi requires about 24 Mbytes, which requires a huge amount of memory. Such a large amount of data cannot be said to have a size suitable for being attached to a mail and transmitted. Therefore, compression of a full-color image is usually performed, and JPEG is known as a compression method thereof. JPEG is very effective in compressing a natural image such as a photograph, and has good image quality. On the other hand, however, when a high frequency portion such as a character portion is JPEG-compressed, image deterioration called mosquito noise occurs and the compression rate is poor. Therefore, the JP of the background part is extracted by dividing the area and removing the character area.
There is a method in which EG compression and MMR compression of a character area portion with color information are created, and when decompressing, a white portion transmits a JPEG image and a black portion is represented by a representative character color.

【０００３】さらに、画質をできるだけ落さずに高い圧
縮率を実現する為に、画像下地に残った高周波成分の大
小により、下地解像度を変化させる技術がある。Further, in order to realize a high compression rate without degrading the image quality as much as possible, there is a technique for changing the background resolution depending on the magnitude of the high frequency component remaining in the image background.

【０００４】[0004]

【発明が解決しようとする課題】しかし、上記方法で
は、解像度変換によって下地画像の全体が一律に縮小さ
れてしまうため、比較的高周波成分の少ない部分までも
過剰に情報削減されるおそれがあった。However, in the above method, since the entire base image is uniformly reduced by the resolution conversion, there is a possibility that information may be excessively reduced even in a portion having a relatively high frequency component. .

【０００５】また実際、電子画像を効率よく作成／表示
するために、解像度変換は整数倍率でおこなわれること
が好まれる。よって、下地画像のサイズは大まかにしか
変更できず、画質と圧縮率のバランスを要求される場合
などに柔軟に対応することが困難だった。In fact, in order to efficiently create / display an electronic image, it is preferable that the resolution conversion is performed at an integer magnification. Therefore, the size of the background image can be roughly changed, and it is difficult to flexibly deal with a case where a balance between image quality and compression rate is required.

【０００６】本発明は上記従来技術の課題を解決するた
めに成されたものであり、画質を保ちながら効率良く画
像を圧縮する画像処理装置、並びにその方法を提供する
ことを目的とする。The present invention has been made to solve the above-mentioned problems of the prior art, and an object of the present invention is to provide an image processing apparatus and method for efficiently compressing an image while maintaining the image quality.

【０００７】[0007]

【課題を解決するための手段】上記目的を達成するた
め、本発明に係る装置は、多値画像データから文字領域
を抽出し、該文字領域の位置データを生成する抽出手段
と、前記文字領域の多値画像データを二値化して、文字
領域二値画像データを生成する二値化手段と、前記文字
領域の代表色を算出し、文字の色データを生成する色算
出手段と、文字以外の領域の多値画像データを用いて、
前記文字領域の多値画像データを変換し、文字なし多値
画像データを生成する変換手段と、前記文字なし多値画
像データの解像度を落として縮小文字なし多値画像デー
タを生成する解像度変換手段と、前記縮小文字なし多値
画像データを圧縮する第１圧縮手段と、前記文字領域二
値画像データを圧縮する第２圧縮手段と、を備え、前記
第１圧縮手段において、文字領域と非文字領域の面積に
よってＪＰＥＧ圧縮の量子化テーブルを変化させること
を特徴とする。In order to achieve the above object, an apparatus according to the present invention comprises an extracting means for extracting a character area from multi-valued image data and generating position data of the character area, and the character area. Binarizing means for binarizing the multivalued image data to generate character area binary image data, color calculating means for calculating a representative color of the character area and generating character color data, Using the multi-valued image data of the area
Conversion means for converting the multi-valued image data of the character area to generate multi-valued image data without characters, and resolution conversion means for reducing the resolution of the multi-valued image data without characters to generate reduced multi-valued image data without characters. A first compression means for compressing the reduced character-less multi-valued image data and a second compression means for compressing the character area binary image data, wherein the first compression means It is characterized in that the quantization table for JPEG compression is changed according to the area of the region.

【０００８】[0008]

【発明の実施の形態】以下に、図面を参照して、この発
明の好適な実施の形態を例示的に詳しく説明する。ただ
し、この実施の形態に記載されている構成要素はあくま
で例示であり、この発明の範囲をそれらのみに限定する
趣旨のものではない。BEST MODE FOR CARRYING OUT THE INVENTION Preferred embodiments of the present invention will be illustratively described in detail below with reference to the drawings. However, the constituent elements described in this embodiment are merely examples, and the scope of the present invention is not intended to be limited thereto.

【０００９】図１に本発明の第一の実施の形態のブロッ
ク図を示す。FIG. 1 shows a block diagram of a first embodiment of the present invention.

【００１０】１０１は入力した原画像から文字領域を検
出して複数の文字領域の座標１０９を作成する文字領域
検出部である。１０２は上記文字領域座標を入力し、原
画像の文字領域部分の二値画像１０８を作成する二値化
部である。１０３は上記二値画像の黒部分と原画像を参
照しながら黒部分の代表色１１０を算出する色算出部で
ある。１０４は上記二値画像（黒）の領域を原画像から
抜いて、その周りの色で塗りつぶし画像Ａを作成する文
字部塗りつぶし部である。１０５は画像Ａを入力し、縮
小して画像Ｂを作成する縮小部である。１０６は画像Ｂ
を入力し、ＪＰＥＧ圧縮して圧縮コードＣ（１１２）を
作成するＪＰＥＧ圧縮部である。１０７は上記複数の二
値画像を入力しＭＭＲ圧縮して複数の圧縮コードＤ（１
１２）を作成するＭＭＲ圧縮部である。最終的に破線で
囲まれた、１０９から１１２までのデータを結合してこ
れが圧縮データとなる。Reference numeral 101 denotes a character area detection unit which detects a character area from an input original image and creates coordinates 109 of a plurality of character areas. Reference numeral 102 denotes a binarization unit that inputs the above-mentioned character area coordinates and creates a binary image 108 of the character area portion of the original image. Reference numeral 103 denotes a color calculation unit that calculates the representative color 110 of the black portion with reference to the black portion of the binary image and the original image. Reference numeral 104 denotes a character portion filling portion that extracts the area of the binary image (black) from the original image and creates a filled image A with the surrounding color. Reference numeral 105 denotes a reduction unit that inputs the image A and reduces it to create the image B. 106 is image B
Is input and JPEG compressed to create a compression code C (112). A plurality of compression codes D (1
12) is an MMR compressing unit for producing. Finally, the data from 109 to 112 surrounded by the broken line are combined to form the compressed data.

【００１１】図３は、文字領域検出部での処理を説明す
るフローチャートである。FIG. 3 is a flow chart for explaining the processing in the character area detecting section.

【００１２】ステップＳ３０１にてカラー画像を入力
し、間引いて解像度を落しながら輝度変換を行い、輝度
画像Ｊを作成する。例えば原画像がＲＧＢ２４ビット３
００ｄｐｉだとすると、縦方向、横方向とも４画素ごと
にＹ＝０．２９９Ｒ＋０．５８７Ｇ＋０．１１４Ｂの演
算を行い、新しい画像Ｊを作成すると画像ＪはＹ８ビッ
ト７５ｄｐｉの画像となる。ステップＳ３０２にて輝度
データのヒストグラムを取り、二値化閾値Ｔを算出す
る。次にステップＳ３０３にて輝度画像ＪをＴにて二値
化し、二値画像Ｋを作成する。更にステップＳ３０４に
て黒画素の輪郭線追跡を行いすべての黒領域をラベル付
けする。次に、ステップＳ３０５にて黒領域中の文字ら
しい領域を判定する。ステップＳ３０６にて形や位置か
ら結合するものを結合する。In step S301, a color image is input, luminance conversion is performed while thinning to reduce the resolution, and a luminance image J is created. For example, the original image is RGB 24-bit 3
If it is 00 dpi, Y = 0.299R + 0.587G + 0.114B is calculated for every four pixels in the vertical and horizontal directions, and when a new image J is created, the image J becomes an image of Y8 bits 75 dpi. In step S302, the histogram of the brightness data is taken and the binarization threshold value T is calculated. Next, in step S303, the luminance image J is binarized by T to create a binary image K. Further, in step S304, contour lines of black pixels are traced to label all black regions. Next, in step S305, a character-like area in the black area is determined. In step S306, shapes and positions to be combined are combined.

【００１３】一例を示す。例えば図４に示すカラー原稿
を入力し、間引いて輝度変換したもののヒストグラムを
取ると図５のようになる。このヒストグラムから平均、
分散、などのデータを利用して閾値Ｔ＝１５０を算出
し、二値化した画像は図６のようになる。図６の黒画素
の輪郭線追跡を行い、すべてをラベリングして、例え
ば、横幅が閾値以下、または高さが閾値以下の黒画素の
集まりのみ文字として許すと図７に示す黒画素の集まり
が文字領域となる（実際にこのような画像が作成される
わけではなく、イメージである）。An example is shown. For example, the color original shown in FIG. 4 is input, and the histogram of the thinned and converted luminance is obtained as shown in FIG. Average from this histogram,
A threshold value T = 150 is calculated using data such as variance, and the binarized image is as shown in FIG. If the contour lines of the black pixels in FIG. 6 are traced and all are labeled and, for example, only the collection of black pixels whose width is less than or equal to the threshold value or height is less than the threshold value is allowed as characters, the collection of black pixels shown in FIG. It becomes a character area (it is an image, not such an image is actually created).

【００１４】これらの黒画素の集まりを位置の近さや横
幅、高さの一致からグループ化していくと、図８に示す
ような１７個の文字領域が検出できる。これらの座標デ
ータが図１の１０９に格納される。When these groups of black pixels are grouped according to the closeness of positions, the width and height, 17 character areas as shown in FIG. 8 can be detected. These coordinate data are stored in 109 of FIG.

【００１５】なお上記処理において、カラー画像の二値
化をおこなうのではなく、微分フィルタをかけ、すべて
の画素の近隣の画素とのエッジ量を算出し、そのエッジ
量を二値化することにより得られた二値画像を同様に輪
郭線追跡をして文字領域を検出してもよい。In the above process, the color image is not binarized, but a differential filter is applied, edge amounts of all pixels with neighboring pixels are calculated, and the edge amounts are binarized. The obtained binary image may be similarly contour-tracked to detect the character region.

【００１６】二値化部１０２にて、上記の方法にて得ら
れた１７個の文字領域の二値画像を作成する。この二値
画像は、例えば文字領域検出部にて算出したＴにて二値
化しても良いし、その領域ごとにヒストグラムを取って
その文字領域により最適な二値化閾値を算出しても良
い。図５に示した全面のヒストグラムに比べて、文字領
域一部分の輝度ヒストグラムは、図９のようなシンプル
な形が期待できるので、閾値の決定は容易である。９０
１は下地色の集合であり、９０２は文字色の集合であ
る。The binarization unit 102 creates a binary image of 17 character areas obtained by the above method. This binary image may be binarized by, for example, T calculated by the character area detection unit, or a histogram may be taken for each area to calculate an optimum binarization threshold for the character area. . As compared with the histogram of the entire surface shown in FIG. 5, the luminance histogram of a part of the character area can be expected to have a simple shape as shown in FIG. 9, so that the threshold value can be easily determined. 90
Reference numeral 1 is a set of base colors, and 902 is a set of character colors.

【００１７】上記二値化結果を利用した文字部塗りつぶ
し部１０４の処理の一例を図１０と図１１を用いて説明
する。An example of the processing of the character portion filling portion 104 using the binarization result will be described with reference to FIGS. 10 and 11.

【００１８】グラデーション画像を背景とし、ＡＢＣと
いう青色の文字が中央付近に描かれた、図１０（ａ）の
ような画像を原画像とする。この原画像から（ｂ）のよ
うな１つの文字領域の二値画像を得たとする。本実施の
形態では、全画像を３２×３２の領域（以下、パーツ）
に分割し、パーツごとに処理をおこなう。図１０（ｃ）
にパーツごとに分けた様子を示す。この図では簡単に説
明するため、５×４のパーツに分割した状態を示してい
る。各領域の左上の数字はパーツ番号を示す。このよう
に分割した場合、パーツ００〜０４，１０，１４，２
０，２４，３０〜３５は、ステップＳ１１０３において
文字領域がないと判断されるため、処理は行われずに、
次のパーツに進む。パーツ１１に対しては、ステップＳ
１１０４に進み、対応する二値画像を参照し、二値画像
の白部分に対応するカラー画像のＲＧＢ値（またはＹＵ
Ｖ等でも良い）の平均値ａｖｅ＿ｃｏｌｏｒを算出す
る。次にステップＳ１１０５において、対応する二値画
像を参照し、黒画素に対応する画素の濃度データを上記
ａｖｅ＿ｃｏｌｏｒとする。An image as shown in FIG. 10A in which a blue character ABC is drawn near the center with a gradation image as a background is an original image. It is assumed that a binary image of one character area as shown in (b) is obtained from this original image. In the present embodiment, the entire image is a 32 × 32 area (hereinafter, parts)
Divide into and process each part. Figure 10 (c)
Shows how the parts are divided. For the sake of simplicity, this figure shows a state of being divided into 5 × 4 parts. The number on the upper left of each area shows the part number. When divided in this way, parts 00-04,10,14,2
Nos. 0, 24, 30 to 35 are determined to have no character area in step S1103, and thus no processing is performed,
Go to the next part. Step S for parts 11
Proceeding to 1104, the corresponding binary image is referenced, and the RGB value (or YU) of the color image corresponding to the white part of the binary image (or YU
The average value ave_color of V) may be calculated. Next, in step S1105, the corresponding binary image is referred to, and the density data of the pixel corresponding to the black pixel is set as the ave_color.

【００１９】以上の処理を文字領域の存在するパーツ１
２，１３，２１，２２，２３に繰り返す。The above processing is performed for a part 1 having a character area.
Repeat steps 2, 13, 21, 22, 23.

【００２０】このようにして、文字の存在する部分の周
りの画素の平均値をもって、文字の存在する画素を塗り
つぶすことができ、見掛け上自然に文字のみが取り除か
れた、画像Aが生成される。In this way, the pixel having the character can be filled with the average value of the pixels around the portion having the character, and the image A in which only the character is apparently removed naturally is generated. .

【００２１】一方、上記二値化結果を利用した文字色算
出部の一例を図１２に示す。本実施の形態では部分二値
化結果１０８の結果を利用したがその限りでなく、例え
ば文字領域の座標とカラー画像のみ入力し、カラー画像
を改めて二値化した結果を利用して色算出処理を行って
も良い。抽出されたすべての文字領域ごとに処理を行う
ので、ステップＳ１２０１では未処理の文字座標がある
かどうかチェックし、あったらステップＳ１２０２にす
すみ、無かったらｅｎｄにすすむ。ステップＳ１２０２
にてその文字座標が参照する二値画像の細線化処理を行
い、スキャナ読み込み時の下地から文字部への変化部に
あたる黒を減らしていき、新しい二値画像ｎｅｗｂｉを
作成する。次にステップＳ１２０３にてｎｅｗｂｉの黒
画素に対応する原画像のＲＧＢのそれぞれの値のヒスト
グラムを取る（もちろん、ＹＵＶ等の他の色空間でも構
わない）。ステップＳ１２０４にてＲＧＢそれぞれの代
表値を算出する。例えば、一番大きな値でも良い。また
はヒストグラムのステップ数を少なくして大まかなヒス
トグラムで一番大きな値を得た後に、このヒストグラム
に存在する細かなヒストグラムで一番大きな値を得る方
法でもよい。以上の処理をすべての文字座標に繰り返す
ことにより、すべての文字座標に１つずつ代表色が算出
される。On the other hand, FIG. 12 shows an example of a character color calculation unit utilizing the above binarization result. Although the result of the partial binarization result 108 is used in the present embodiment, the present invention is not limited to this. For example, only the coordinates of the character region and the color image are input, and the color calculation process is performed using the result of binarizing the color image again. You may go. Since processing is performed for all the extracted character areas, it is checked in step S1201 whether or not there is an unprocessed character coordinate, and if there is, it proceeds to step S1202, and if there is none, it proceeds to end. Step S1202
The thinning process of the binary image referred to by the character coordinates is performed to reduce the black corresponding to the changing portion from the background to the character portion at the time of reading by the scanner, and a new binary image newbi is created. Next, in step S1203, a histogram of each value of RGB of the original image corresponding to the black pixel of newbi is obtained (of course, another color space such as YUV may be used). In step S1204, representative values of RGB are calculated. For example, it may be the largest value. Alternatively, the number of steps of the histogram may be reduced to obtain the largest value in the rough histogram, and then the largest value in the fine histogram existing in this histogram may be obtained. By repeating the above process for all the character coordinates, one representative color is calculated for all the character coordinates.

【００２２】１０７にて、部分二値画像１０８のそれぞ
れをＭＭＲ圧縮して、圧縮コードＤを作成する。At 107, each of the partial binary images 108 is MMR compressed to create a compressed code D.

【００２３】１０５では、文字がぬりつぶされた画像A
の解像度変換を行い、画像Bを生成する。これは文字を
除いた状態で残った、画、写真、背景、飾りなどの各種
情報は、文字ほど高い解像度を必要としないという一般
的原則に基くものである。ここでは一律に１／２縮小を
施すことにするが、要求される画質と、最終的な生成画
像のサイズ次第では、１／４縮小、あるいは縮小せずに
１／１のままにしてもよい。In the case of 105, an image A in which characters are filled
Then, the resolution is converted to generate image B. This is based on the general principle that the various information such as images, photographs, backgrounds, and decorations that remain without the characters require a higher resolution than the characters. Here, the size is uniformly reduced by 1/2, but depending on the required image quality and the size of the final generated image, it may be reduced to 1/4 or may be left as 1/1 without being reduced. .

【００２４】次に１０６において、縮小画像Ｂに対しＪ
ＰＥＧ圧縮を行い圧縮コードＣを作成する。Next, at 106, J is applied to the reduced image B.
PEG compression is performed to create a compression code C.

【００２５】ＪＰＥＧ圧縮処理は一般的なものを用い
る。簡単に説明すると、画像をYUVの各成分に分割し、
それぞれを小領域（たとえば８x８pixel）ごとにDCT変
換し、得られた変換係数を量子化し、符号化することで
圧縮コードを得る。A general JPEG compression process is used. Briefly, divide the image into each component of YUV,
A DCT transform is performed for each small area (for example, 8 × 8 pixels), and the obtained transform coefficient is quantized and encoded to obtain a compressed code.

【００２６】上記量子化は、DCT変換の各係数を、図１
３のような量子化テーブルの対応項ごとにおのおの除算
し、小数点以下の余りを切り捨てることで行う。すなわ
ち量子化テーブルの各要素値が大きいほど粗い量子化と
なり、情報量が削減されるが高い圧縮率が得られる。In the above quantization, each coefficient of DCT transformation is converted into
It is performed by dividing each corresponding term of the quantization table as shown in 3 and truncating the remainder below the decimal point. That is, the larger the value of each element of the quantization table, the coarser the quantization becomes, and the amount of information is reduced, but a high compression rate is obtained.

【００２７】本実施の形態では画像の内容、具体的に
は、図３のＳ３０５にて、文字らしいとされた黒画素集
合の総面積ｍとそれ以外の黒画素集合の総面積ｐからな
る比率p/(m+p)の値によって量子化テーブルを変化させ
る。In the present embodiment, the content of the image, specifically, the ratio of the total area m of the black pixel set which is considered to be the character and the total area p of the other black pixel set in S305 of FIG. The quantization table is changed according to the value of p / (m + p).

【００２８】例を挙げると、p/(m+p)が０.１以上の場合
は、値の大きい量子化テーブルを用いて圧縮率を大きく
する。０.１未満の場合は、値の小さい量子化テーブル
を用いて圧縮率を小さくする。As an example, when p / (m + p) is 0.1 or more, the compression rate is increased by using a quantization table having a large value. When it is less than 0.1, the compression rate is reduced by using a quantization table having a small value.

【００２９】後者の場合、m＞＞pであり、画像内の大部
分が文字領域であり、かつ画像B上で該当領域は文字塗
りつぶしにより平坦になっているので、量子化テーブル
の如何にかかわらず高い圧縮率が得られる。一方残る文
字以外の領域、おそらく写真や絵の領域は、値の小さい
量子化テーブルのおかげで情報量の欠落が少なくなり画
質が向上する。一方圧縮率も低下するが、該当部分の総
面積は小さいので、画像全体での圧縮コードサイズが著
しく増大することは避けられる。In the latter case, m >> p, most of the image area is a character area, and the area on image B is flattened by character filling. High compression ratio can be obtained. On the other hand, in the area other than the remaining characters, perhaps in the area of a picture or a picture, the loss of information amount is reduced and the image quality is improved due to the quantization table having a small value. On the other hand, although the compression rate also decreases, the total area of the corresponding portion is small, so that the compression code size of the entire image can be prevented from significantly increasing.

【００３０】なお、文字、非文字領域と量子化テーブル
設定の関係は上例の限りではないことは言うまでもな
く、p/(m+p)をn段階（n＞２）に分け、それぞれに量子
化テーブルを設定してもよい。p/(m+p)ではなくp/mの
値、あるいはｐ単独の面積から量子化テーブルを決定し
てもよい。Needless to say, the relationship between the character and non-character areas and the quantization table setting is not limited to the above example, p / (m + p) is divided into n stages (n> 2), and each is quantized. The conversion table may be set. The quantization table may be determined from the value of p / m instead of p / (m + p) or the area of p alone.

【００３１】さらに、圧縮率の大小は最終的な圧縮コー
ド総量の目標値まで鑑みて決定してもよい。たとえば、
非文字領域の面積と圧縮後のコードのサイズと関係に対
し、複数の量子化テーブルに対してあらかじめ実験でも
とめておき、実際の処理時には、先に得られた文字部分
の二値画像圧縮コードＤのサイズと目標総サイズとの差
より、最適な量子化テーブルを選択するようにすればよ
い。Further, the magnitude of the compression rate may be determined in consideration of the final target value of the total compression code amount. For example,
For the relationship between the area of the non-character area and the size of the code after compression, we determined in advance the experiment for multiple quantization tables, and during the actual processing, the binary image compression code of the character part obtained earlier The optimum quantization table may be selected based on the difference between the size of D and the target total size.

【００３２】最後に、文字領域座標（１０９）、文字部
代表色（１１０）、圧縮コードＣ（１１１）、圧縮コー
ドＤ（１１２）の４つをまとめたフォーマットを必要な
らば作成する。Finally, if necessary, a format in which the four character region coordinates (109), the character part representative color (110), the compression code C (111), and the compression code D (112) are put together is created.

【００３３】このフォーマットは、ＰＤＦであってもよ
いし、ＸＭＬであてってもよい。The format may be PDF or XML.

【００３４】図２に伸長時の構成図を示す。FIG. 2 shows a block diagram at the time of expansion.

【００３５】２０１は圧縮コードＣを入力し、ＪＰＥＧ
伸長処理を行い多値画像Ｅを作成するＪＰＥＧ伸長部で
ある。２０２は圧縮コードを入力し二値画像Ｆを作成す
るＭＭＲ伸長部である。２０３は多値画像Ｅを入力し、
拡大して画像Ｇを作成する拡大部である。２０４は文字
領域の座標と代表色を入力し、上記二値画像Ｆを参照し
ながら白部分は画像Ｇを、黒部分は代表色を選択し、最
終的な画像である画像Ｈ２０７を作成する画像合体部で
ある。A compressed code C is input to 201, and JPEG is input.
This is a JPEG decompression unit that performs decompression processing and creates a multi-valued image E. Reference numeral 202 denotes an MMR decompression unit that inputs a compression code and creates a binary image F. 203 inputs the multi-valued image E,
An enlargement unit that enlarges and creates an image G. Reference numeral 204 is an image for inputting coordinates of a character area and a representative color, selecting the image G for the white portion and the representative color for the black portion while referring to the binary image F, and creating an image H207 as a final image. It is a united part.

【００３６】図１４に合体処理２０４の結果例を示す。
まず図１４（ａ）に圧縮コードＣのＪＰＥＧ伸長結果を
示す。ＪＰＥＧ圧縮の量子化非可逆方式を利用すると図
１０（ｃ）とは微妙に画素値が異なるデータとなってい
る。しかし、文字部を抜く前の原画像をＪＰＥＧ非可逆
圧縮方式で圧縮する場合と比較して、同じ量子化テーブ
ルを利用した場合において画素値の変化は少ない。二値
画像（ｂ）を参照して黒画素の対応するところの画像
（ａ）上に代表色（２０，３０，２５５）データをの
せ、最終的に（ｃ）のような画像ができ上がる。これが
伸長画像２０７となる。FIG. 14 shows an example of the result of the merge processing 204.
First, FIG. 14A shows a JPEG decompression result of the compressed code C. When the quantized lossy method of JPEG compression is used, the pixel values are slightly different from those in FIG. 10C. However, compared to the case where the original image before the character part is removed is compressed by the JPEG lossy compression method, the pixel value changes less when the same quantization table is used. By referring to the binary image (b), representative color (20, 30, 255) data is placed on the image (a) corresponding to the black pixel, and finally an image as shown in (c) is completed. This becomes the expanded image 207.

【００３７】以上、本発明によれば、画質を保ちながら
効率良く画像を圧縮する画像処理装置が実現できる。As described above, according to the present invention, it is possible to realize an image processing apparatus which efficiently compresses an image while maintaining the image quality.

【００３８】［その他の実施の形態］第一の実施の形態
では、量子化テーブルを選択する際に、主に圧縮前画像
の非文字領域の面積に注目していたが、非文字画素の形
状に注目して量子化テーブルの選択をおこなってもよ
い。[Other Embodiments] In the first embodiment, when the quantization table is selected, the focus is mainly on the area of the non-character region of the pre-compression image. The quantization table may be selected by paying attention to.

【００３９】例えば、表組みを含む文書画像において、
非文字の画素中から表罫線とみなされる画素群が抽出さ
れた場合、画像Bに施す圧縮に小さい値の量子化テーブ
ルを用いるようにする。For example, in a document image including a table,
When a pixel group regarded as a table ruled line is extracted from non-character pixels, a quantization table having a small value is used for compression applied to the image B.

【００４０】その結果、線部分の周辺に発生するノイズ
が少なくなり画質は向上する。一方で、圧縮率は低下す
るが、罫線の存在する部分は表全体の面積に比して疎な
ので、全体の圧縮コードサイズが著しく増大することは
ない。As a result, the noise generated around the line portion is reduced and the image quality is improved. On the other hand, although the compression rate decreases, the part where the ruled line exists is sparse compared to the area of the entire table, so the overall compressed code size does not increase significantly.

【００４１】なお、上記中の表罫線画素か否かの判定に
おいては、既知の領域分割アルゴリズムを用いるものと
して、ここでの説明は省略する。It is to be noted that a known area division algorithm is used for determining whether the pixel is a table ruled line pixel in the above description, and a description thereof will be omitted here.

【００４２】この場合でも、画質を保ちながら効率良く
画像を圧縮する画像処理装置が実現できる。Even in this case, it is possible to realize an image processing apparatus which efficiently compresses an image while maintaining the image quality.

【００４３】（他の実施形態）以上、本発明の実施形態
について詳述したが、本発明は、複数の機器から構成さ
れるシステムに適用しても良いし、また、一つの機器か
らなる装置に適用しても良い。(Other Embodiments) Although the embodiments of the present invention have been described in detail above, the present invention may be applied to a system including a plurality of devices, and an apparatus including one device. May be applied to.

【００４４】なお、本発明は、前述した実施形態の機能
を実現するソフトウェアのプログラムを、システム或い
は装置に直接或いは遠隔から供給し、そのシステム或い
は装置のコンピュータが該供給されたプログラムコード
を読み出して実行することによっても達成される場合を
含む。その場合、プログラムの機能を有していれば、形
態は、プログラムである必要はない。The present invention supplies a software program for realizing the functions of the above-described embodiments to a system or apparatus directly or remotely, and a computer of the system or apparatus reads the supplied program code. Including cases that can be achieved by executing. In that case, the form need not be a program as long as it has the functions of the program.

【００４５】従って、本発明の機能処理をコンピュータ
で実現するために、該コンピュータにインストールされ
るプログラムコード自体も本発明を実現するものであ
る。つまり、本発明のクレームでは、本発明の機能処理
を実現するためのコンピュータプログラム自体も含まれ
る。Therefore, the program code itself installed in the computer to implement the functional processing of the present invention by the computer also implements the present invention. That is, the claims of the present invention include the computer program itself for realizing the functional processing of the present invention.

【００４６】その場合、プログラムの機能を有していれ
ば、オブジェクトコード、インタプリタにより実行され
るプログラム、ＯＳに供給するスクリプトデータ等、プ
ログラムの形態を問わない。In this case, the program may take any form such as an object code, a program executed by an interpreter, or script data supplied to an OS as long as it has the function of the program.

【００４７】プログラムを供給するための記録媒体とし
ては、例えば、フロッピー（登録商標）ディスク、ハー
ドディスク、光ディスク、光磁気ディスク、ＭＯ、ＣＤ
−ＲＯＭ、ＣＤ−Ｒ、ＣＤ−ＲＷ、磁気テープ、不揮発
性のメモリカード、ＲＯＭ、ＤＶＤ（ＤＶＤ−ＲＯＭ，
ＤＶＤ−Ｒ）などがある。As a recording medium for supplying the program, for example, a floppy (registered trademark) disk, hard disk, optical disk, magneto-optical disk, MO, CD
-ROM, CD-R, CD-RW, magnetic tape, non-volatile memory card, ROM, DVD (DVD-ROM,
DVD-R).

【００４８】その他、プログラムの供給方法としては、
クライアントコンピュータのブラウザを用いてインター
ネットのホームページに接続し、該ホームページから本
発明のコンピュータプログラムそのもの、もしくは圧縮
され自動インストール機能を含むファイルをハードディ
スク等の記録媒体にダウンロードすることによっても供
給できる。また、本発明のプログラムを構成するプログ
ラムコードを複数のファイルに分割し、それぞれのファ
イルを異なるホームページからダウンロードすることに
よっても実現可能である。つまり、本発明の機能処理を
コンピュータで実現するためのプログラムファイルを複
数のユーザに対してダウンロードさせるＷＷＷサーバ
も、本発明のクレームに含まれるものである。In addition, as a program supply method,
It can also be supplied by connecting to a homepage on the Internet using a browser of a client computer, and downloading the computer program itself of the present invention or a compressed file having an automatic installation function from the homepage to a recording medium such as a hard disk. It can also be realized by dividing the program code constituting the program of the present invention into a plurality of files and downloading each file from different homepages. That is, a WWW server that allows a plurality of users to download a program file for implementing the functional processing of the present invention on a computer is also included in the claims of the present invention.

【００４９】また、本発明のプログラムを暗号化してＣ
Ｄ−ＲＯＭ等の記憶媒体に格納してユーザに配布し、所
定の条件をクリアしたユーザに対し、インターネットを
介してホームページから暗号化を解く鍵情報をダウンロ
ードさせ、その鍵情報を使用することにより暗号化され
たプログラムを実行してコンピュータにインストールさ
せて実現することも可能である。The program of the present invention is encrypted to C
By storing the information in a storage medium such as a D-ROM and distributing it to the user, and having the user who satisfies the predetermined conditions download the key information for decrypting the encryption from the home page via the Internet, and by using the key information It is also possible to execute the encrypted program and install the program in a computer to realize it.

【００５０】また、コンピュータが、読み出したプログ
ラムを実行することによって、前述した実施形態の機能
が実現される他、そのプログラムの指示に基づき、コン
ピュータ上で稼動しているＯＳなどが、実際の処理の一
部または全部を行ない、その処理によっても前述した実
施形態の機能が実現され得る。Further, the computer executes the read program to realize the functions of the above-described embodiments, and the OS and the like running on the computer execute the actual processing based on the instructions of the program. The function of the above-described embodiment can be realized also by performing a part or all of the above.

【００５１】さらに、記録媒体から読み出されたプログ
ラムが、コンピュータに挿入された機能拡張ボードやコ
ンピュータに接続された機能拡張ユニットに備わるメモ
リに書き込まれた後、そのプログラムの指示に基づき、
その機能拡張ボードや機能拡張ユニットに備わるＣＰＵ
などが実際の処理の一部または全部を行ない、その処理
によっても前述した実施形態の機能が実現される。Furthermore, after the program read from the recording medium is written in the memory provided in the function expansion board inserted into the computer or the function expansion unit connected to the computer, based on the instructions of the program,
CPU provided on the function expansion board or function expansion unit
Performs a part or all of the actual processing, and the processing realizes the functions of the above-described embodiments.

【００５２】[0052]

【発明の効果】本発明によれば、カラー文書画像に対し
て領域分割をおこない、文字領域部は二値のＭＭＲ圧縮
コード、文字を抜いた下地部分はＪＰＥＧ圧縮コードと
した上で、両者および領域位置情報、文字色情報を合成
したデータにすることで画質を高く保ちながら高い圧縮
率を得る画像圧縮処理において、ＪＰＥＧ圧縮コード生
成時の量子化テーブルを、元文書画像の文字・非文字領
域の面積に応じて変化させることで、より柔軟に画質・
圧縮率の両立が可能な画像処理装置を実現することがで
きる。According to the present invention, a color document image is divided into areas, a character area portion is a binary MMR compression code, and a background portion without characters is a JPEG compression code. In the image compression processing that obtains a high compression rate while maintaining high image quality by combining the area position information and the character color information, the quantization table at the time of JPEG compression code generation is set to the character / non-character area of the original document image. More flexible image quality by changing according to the area of
It is possible to realize an image processing device that can achieve both compression rates.

[Brief description of drawings]

【図１】発明の第１の実施の形態に係る圧縮装置のブロ
ック図である。FIG. 1 is a block diagram of a compression device according to a first embodiment of the invention.

【図２】発明の第１の実施の形態に係る伸長装置のブロ
ック図である。FIG. 2 is a block diagram of a decompression device according to the first embodiment of the invention.

【図３】発明の第１の実施の形態の文字領域検出部の処
理を示すフローチャートである。FIG. 3 is a flowchart showing processing of a character area detection unit according to the first embodiment of the invention.

【図４】発明の第１の実施の形態の文字領域検出処理を
説明するための原画像の例を示す図である。FIG. 4 is a diagram showing an example of an original image for explaining a character area detection process according to the first embodiment of the invention.

【図５】発明の第１の実施の形態の文字領域検出処理を
説明するためのヒストグラムを示す図である。FIG. 5 is a diagram showing a histogram for explaining a character area detection process according to the first embodiment of the invention.

【図６】発明の第１の実施の形態の文字領域検出処理を
説明するための二値画像の例を示す図である。FIG. 6 is a diagram showing an example of a binary image for explaining a character area detection process according to the first embodiment of the invention.

【図７】発明の第１の実施の形態の文字領域検出処理を
説明するための文字領域画像の例を示す図である。FIG. 7 is a diagram showing an example of a character area image for explaining the character area detection processing according to the first embodiment of the invention.

【図８】発明の第１の実施の形態の文字領域検出処理を
説明するための文字領域部分の例を示す図である。FIG. 8 is a diagram illustrating an example of a character area portion for explaining a character area detection process according to the first embodiment of the invention.

【図９】発明の第１の実施の形態の二値化部１０９に入
力される文字領域のヒストグラム結果の例を示す図であ
る。FIG. 9 is a diagram showing an example of a histogram result of a character area input to the binarization unit 109 according to the first embodiment of this invention.

【図１０】発明の第１の実施の形態の文字部塗りつぶし
処理を説明するための図である。FIG. 10 is a diagram illustrating a character part filling process according to the first embodiment of this invention.

【図１１】発明の第１の実施の形態の文字部塗りつぶし
処理を説明するためのフローチャートである。FIG. 11 is a flowchart illustrating a character part filling process according to the first embodiment of this invention.

【図１２】発明の第１の実施の形態に係る色算出部の処
理を説明するためのフローチャートである。FIG. 12 is a flowchart for explaining processing of a color calculation unit according to the first embodiment of the invention.

【図１３】発明の第１の実施の形態に係るＪＰＥＧ圧縮
処理部の量子化テーブルの例を示す図である。FIG. 13 is a diagram showing an example of a quantization table of a JPEG compression processing unit according to the first embodiment of the invention.

【図１４】発明の第１の実施の形態に係る合体処理の結
果例を示す図である。FIG. 14 is a diagram showing an example of a result of a combination process according to the first embodiment of the invention.

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｈ０４Ｎ 1/40 Ｈ０４Ｎ 1/40 ＦＦターム(参考） 5B057 CA01 CA08 CA16 CB01 CB08 CB16 CD05 CE12 CE16 CG01 CH07 DA08 DB06 DB09 DC23 DC25 5C076 AA21 AA22 BA06 5C077 MP08 PP27 PP28 PP32 PQ19 PQ23 RR02 RR21 5C078 BA27 BA57 CA02 CA21 CA27 DA02 5L096 AA02 FA06 FA37 FA44 GA34─────────────────────────────────────────────────── ─── Continued Front Page (51) Int.Cl. ⁷ Identification Code FI Theme Coat (Reference) H04N 1/40 H04N 1/40 FF Term (Reference) 5B057 CA01 CA08 CA16 CB01 CB08 CB16 CD05 CE12 CE16 CG01 CH07 DA08 DB06 DB09 DC23 DC25 5C076 AA21 AA22 BA06 5C077 MP08 PP27 PP28 PP32 PQ19 PQ23 RR02 RR21 5C078 BA27 BA57 CA02 CA21 CA27 DA02 5L096 AA02 FA06 FA37 FA44 GA34

Claims

[Claims]

1. A character area is extracted from multi-valued image data,
Extraction means for generating position data of the character area, binarization means for binarizing the multi-valued image data of the character area to generate character area binary image data, and calculation of a representative color of the character area Then, the color calculating means for generating the color data of the character and the converting means for converting the multivalued image data of the character area by using the multivalued image data of the area other than the character to generate the multivalued image data without character. A resolution conversion means for reducing the resolution of the characterless multi-valued image data to generate reduced characterless multi-valued image data; a first compression means for compressing the reduced characterless multi-valued image data; An image processing apparatus comprising: a second compression unit that compresses the value image data.

2. The image processing apparatus according to claim 1, wherein the first compression unit complies with JPEG compression.

3. The image processing apparatus according to claim 1, wherein the second compression unit complies with MMR compression.

4. The image processing apparatus according to claim 1, wherein in the JPEG compression in the first compression means, a quantization table for performing quantization is variable.

5. The image processing apparatus according to claim 1, wherein the quantization table is selected according to the areas of the character area and the non-character area obtained by the character area extracting unit.

6. The quantization table is selected from the areas of the character area and the non-character area and the target compression size obtained by the character area extracting means.
The image processing device according to item 1.

7. The image processing apparatus according to claim 1, wherein the quantization table is selected according to the shape of the non-character area obtained by the character area extracting means.

8. An image processing apparatus for decompressing an image compressed by the image processing apparatus according to claim 1, wherein the image is compressed by the first compression unit. A first decompression unit for decompressing the reduced character-less multivalued image data, a second decompression unit for decompressing the character region binary image data compressed by the second compression unit, and a reduced characterless multivalued image data A resolution conversion unit for increasing the resolution to generate the characterless multi-valued image data, the position data and the color data are input, and the multi-valued image is obtained from the character area binary image data and the characterless multi-valued image data. An image processing device, comprising: an image merging unit that generates data.