JP2010067157A

JP2010067157A - Image processor

Info

Publication number: JP2010067157A
Application number: JP2008234870A
Authority: JP
Inventors: Tamon Sadasue; 多聞貞末
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2008-09-12
Filing date: 2008-09-12
Publication date: 2010-03-25
Anticipated expiration: 2028-09-12
Also published as: JP5019477B2

Abstract

<P>PROBLEM TO BE SOLVED: To make inconspicuous image misalignment and fluctuation when displaying a decoded enlarged image instead of a transitional zooming display operation. <P>SOLUTION: When an enlargement operation is started by a user, a transitional zooming display operation is performed in which an interpolating and enlarging part 106 interpolates and enlarges a decoded image displayed in a display part 110 at that time to display the decoded image while the enlargement operation is continued. The interpolating and enlarging part 106 can select a mode A for arranging a reference pixel between dots of a pixel of an interpolated and enlarged image or a mode B for arranging the reference pixel on a dot of the pixel of the interpolated and enlarged image. A mode selecting part 116 selects either mode A or B in accordance with an encoding method of a code or the like. When the enlargement operation ends, a decoding part 102 decodes the code to be enlarged to decode the enlarged image, and displays the image to be overwritten on the interpolated and enlarged image. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、圧縮された符号として与えられた画像を復号して表示する画像処理装置に係り、特に、表示画像の拡大操作が可能な画像処理装置に関する。 The present invention relates to an image processing apparatus that decodes and displays an image given as a compressed code, and more particularly to an image processing apparatus that can perform an enlargement operation of a display image.

画像データベース内の画像を閲覧する場合、ユーザが目的とする画像を検索する必要がある。画像を検索する手法としては、検索対象画像の画像名（ファイル名）が既知であれば、画像名の一覧から検索する場合もあるが、多くの場合はサムネイルの一覧表示が使用される。例えば、文書画像を検索する場合、キーワード検索した後にキーワードにヒットした候補画像をサムネイル一覧表示して、最終的に検索対象画像を選択する方法や、最初からサムネイル一覧表示のみを使用して検索対象画像を探し出したりする方法が用いられている。このようなサムネイル一覧表示では、解像度を落とした縮小画像の状態なので画像の詳細な内容を把握する事が難しいため、画像を拡大表示させる必要がある。 When browsing an image in the image database, the user needs to search for a target image. As a method of searching for an image, if the image name (file name) of the search target image is known, the image may be searched from a list of image names. In many cases, a list display of thumbnails is used. For example, when searching for a document image, a candidate image that has been hit by a keyword after a keyword search is displayed in a thumbnail list, and finally a search target image is selected. A method of searching for an image is used. In such a thumbnail list display, it is difficult to grasp the detailed contents of the image because it is a reduced image state with a reduced resolution, so it is necessary to display the image in an enlarged manner.

このように画像の拡大表示が必要とされる場合は少なくない。しかし、通常、画像は圧縮された符号として記憶・伝送されるため、低解像度から段階的に高い解像度へと拡大操作を行った場合に、ある解像度の画像を得るための符号の復号処理を行い、次に、より高い解像度の画像を得るための符号の復号処理を行い、というように連続して何度も復号処理を行うのでは処理に時間がかかり、高速なズーミングが難しい。そこで、拡大操作の途中では低解像度の復号画像を画像補間により拡大した画像を表示する「過渡的なズーミング表示動作」を行い、拡大操作が終了した時点で、最終的な高解像度の画像表示に必要な符号の復号処理を行い、復号した高解像度画像で表示を更新するという手法が用いられる（例えば特許文献１を参照）。 There are many cases where enlargement display of an image is required in this way. However, since an image is usually stored and transmitted as a compressed code, a code decoding process is performed to obtain an image of a certain resolution when an enlargement operation is performed from a low resolution to a high resolution in stages. Next, if a code decoding process for obtaining an image with a higher resolution is performed, and the decoding process is performed many times in succession, the process takes time, and high-speed zooming is difficult. Therefore, during the enlargement operation, a “transient zooming display operation” that displays an image obtained by enlarging the low-resolution decoded image by image interpolation is performed, and when the enlargement operation is completed, the final high-resolution image display is performed. A technique of performing necessary code decoding processing and updating the display with the decoded high-resolution image is used (see, for example, Patent Document 1).

特開平１０−１９８３３７号公報JP-A-10-198337

上に述べた「過渡的なズーミング表示動作」による補間拡大画像から、高解像度の復号画像の表示へと切り替えた瞬間に、画像が全体的にずれる、もしくは揺れる現象が起きることがある。本発明の目的は、そのような画像の「ずれ」もしくは「揺れ」の発生を抑えることにある。 There may be a phenomenon in which the entire image shifts or shakes at the moment of switching from the interpolated enlarged image by the “transient zooming display operation” described above to the display of a decoded image of high resolution. An object of the present invention is to suppress the occurrence of such “shift” or “sway” of an image.

まず、従来、過渡的ズーミング表示動作で表示された補間拡大画像から高解像度復号画像に置き換わった瞬間に画像がずれたり、揺れたりするように見える現象の発生メカニズムについて簡単に説明する。 First, a mechanism of occurrence of a phenomenon that appears to be shifted or shaken at the moment when an interpolation enlarged image displayed by a transient zooming display operation is replaced with a high-resolution decoded image will be briefly described.

例えばＪＰＥＧ２０００のロスレス圧縮モードは、５／３フィルタによる可逆な離散ウェーブレット変換（ＤＷＴ：Discrete Wavelet Transform）を用いており、垂直方向、水平方向にそれぞれ１次元可逆離散ウェーブレット変換がかけられる。通常は離散ウェーブレット変換後のダウンサンプリングを考慮して計算を省いたリフティング演算が用いられる、このリフティング演算は以下のような式で与えられる。 For example, the lossless compression mode of JPEG2000 uses a reversible discrete wavelet transform (DWT) using a 5/3 filter, and a one-dimensional reversible discrete wavelet transform is applied in the vertical direction and the horizontal direction, respectively. Usually, a lifting operation that omits the calculation in consideration of the downsampling after the discrete wavelet transform is used. This lifting operation is given by the following equation.

垂直、水平方向にリフティング演算を行った後に、図４のようにデインターリーブされサブバンドに分けられる。図４において、ＬＬは垂直水平ともに低周波の成分、ＨＬは垂直方向の低周波成分から水平方向の高周波成分を抽出した信号、ＬＨは垂直方向の高周波成分から水平方向の低周波成分を抽出した信号、ＨＨは垂直水平ともに高周波成分を抽出した信号である。 After performing the lifting operation in the vertical and horizontal directions, it is deinterleaved and divided into subbands as shown in FIG. In FIG. 4, LL is a low frequency component in both the vertical and horizontal directions, HL is a signal obtained by extracting a horizontal high frequency component from a vertical low frequency component, and LH is a horizontal low frequency component extracted from a vertical high frequency component. A signal, HH, is a signal obtained by extracting high-frequency components in both vertical and horizontal directions.

低解像度の画像を生成するときはＬＬの成分のみを使う。ズーミング中に低解像度の画素値を使って補間拡大するということは、このＬＬ成分の値を画素値として画像補間することに他ならない。 When generating a low-resolution image, only the LL component is used. Interpolating and enlarging using low-resolution pixel values during zooming is nothing other than image interpolation using the LL component values as pixel values.

ここで簡単のために信号を１次元に限定して考える。図５に示すような1次元のオリジナルデータに５／３可逆ＤＷＴを施し低周波成分を抜き出すと、図６のようになる。抜き出された低周波成分がズーミング前の画素値であり、この画素値を使って２倍に補間拡大するものとする。 Here, for simplicity, the signal is limited to one dimension. When 5/3 reversible DWT is applied to the one-dimensional original data as shown in FIG. 5 to extract low frequency components, the result is as shown in FIG. The extracted low frequency component is a pixel value before zooming, and the pixel value is used to perform interpolation enlargement twice.

画像補間方法として、ニアレストネイバーを用いた場合のオリジナルデータとの比較を図７に示す。ニアレストネイバーでは、オリジナルデータの偶数番目の画素と同じ値をその隣の奇数番目の画素にも使うので、そもそも画質が汚い印象を与えるうえに、高解像度の画像を重ねたときに奇数番目の画素値が本来の値に修正される形になるため、左にずれるように見える。１次元データで左にずれて見える現象は、２次元の画像では左上にずれて見えることになる。 FIG. 7 shows a comparison with original data in the case of using a nearest neighbor as an image interpolation method. Nearest neighbor uses the same value as the even-numbered pixel of the original data for the odd-numbered pixel next to it, so in addition to giving the impression that the image quality is dirty, the odd-numbered pixel when overlapping high-resolution images in the first place Since the pixel value is corrected to the original value, it appears to shift to the left. A phenomenon that appears to be shifted to the left in the one-dimensional data appears to be shifted to the upper left in the two-dimensional image.

５／３可逆ＤＷＴを２回適用し、抜き出された低周波成分をニアレストネイバー法で補間拡大したデータとオリジナルデータを比較すると図８のようになり、さらにオリジナルデータとのズレが大きく見える。 When the 5/3 reversible DWT is applied twice and the extracted low-frequency component is interpolated and expanded by the nearest neighbor method and compared with the original data, the result is as shown in FIG. .

ここまではニアレストネイバーを用いる例について説明したが、複数の参照画素の値を用いた演算により補間画素の値を決定する画像補間法を用いる場合も同様の現象が生じる。そのような画像補間法として広く使われているバイリニア法で補間拡大したデータは、通常、図９のようになり、補間拡大したデータに比べてオリジナルデータ（高解像度の画像）は全体的に左側にずれる。 The example using the nearest neighbor has been described so far, but the same phenomenon occurs when using an image interpolation method that determines the value of an interpolation pixel by calculation using the values of a plurality of reference pixels. The data that has been interpolated and enlarged by the bilinear method widely used as such an image interpolation method is usually as shown in FIG. 9, and the original data (high-resolution image) is generally on the left side compared to the data that has been interpolated and enlarged. Sneak away.

このことについて、さらに説明する。バイリニア法では、補間画素の４近傍の参照画素の値から線形近似によって補間画素値を決める。入力画像ｆ（ｉ，ｊ）を水平、垂直方向にＡ倍する場合、４つの参照画素（左上のものをf00、右上のものをf10、左下のものをf01、右下のものをf11とする）などを次式のように定義する。 This will be further described. In the bilinear method, an interpolation pixel value is determined by linear approximation from the values of reference pixels in the vicinity of 4 of the interpolation pixel. When the input image f (i, j) is multiplied by A in the horizontal and vertical directions, four reference pixels (the upper left one is f00, the upper right one is f10, the lower left one is f01, and the lower right one is f11). ) Etc. are defined as follows:

補間画素の値は次式により計算される。

The value of the interpolated pixel is calculated by the following equation.

ここで、バイリニア法では、参照画素と補間画素の位置が同一の格子を使って示されているのではなく、図１０に示すように、参照画素が、補間拡大画像の画素の格子点の間に配置されるのが通常である（図１０はＡ＝２の例である）。したがって、この場合、例えば補間画素g00の値は次式で計算される。 Here, in the bilinear method, the position of the reference pixel and the interpolation pixel is not shown using the same grid, but as shown in FIG. 10, the reference pixel is located between the grid points of the pixels of the interpolation enlarged image. (FIG. 10 shows an example of A = 2). Therefore, in this case, for example, the value of the interpolation pixel g00 is calculated by the following equation.

このような参照画素の配置は、参照画素をオリジナルデータ（高解像度の画像）の２×２画素の中央に仮想的に存在する代表画素のように扱っていることを意味する。しかるに、５／３可逆ＤＷＴで得られる低周波成分は、ダウンサンプリングされる画素を中心とした画素から計算されており、いわば偶数番目の画素値に近い値が低周波成分となるため、５／３可逆ＤＷＴで得られる低周波成分を用いて、図１０のような補間画像の格子点の間に参照画素を配置するような画像補間を行うと、図９に見られるように、オリジナルデータ（高解像度の画像）と補間拡大画像の間にギャップが生じてしまい、補間拡大画像に高解像度画像を上書き表示した時に、画像が左上にずれたように見えてしまうわけである。 Such an arrangement of the reference pixels means that the reference pixels are handled like representative pixels that exist virtually in the center of 2 × 2 pixels of the original data (high resolution image). However, the low-frequency component obtained by the 5/3 reversible DWT is calculated from the pixels centered on the down-sampled pixel, so that a value close to the even-numbered pixel value becomes the low-frequency component. When image interpolation is performed such that reference pixels are arranged between lattice points of an interpolated image as shown in FIG. 10 using low frequency components obtained by 3 reversible DWT, as shown in FIG. A gap is generated between the high-resolution image) and the interpolated enlarged image, and when the high-resolution image is overwritten and displayed on the interpolated enlarged image, the image appears to be shifted to the upper left.

このような画像の「ずれ」もしくは「揺れ」は、参照画素を図１１のように補間拡大画像の画素の格子点上に配置することで防止することができる（図１１ではＡ＝２としている）。このように参照画素を配置する場合、補間画素g00，g10，g01，g11の算出式は次のように表され、偶数番目の補間画素値として参照画素の値がそのまま使われ、奇数番目の補間画素の値は両隣の参照画素の値を用いて決定される。 Such “shift” or “shake” of the image can be prevented by arranging the reference pixel on the grid point of the pixel of the interpolation enlarged image as shown in FIG. 11 (A = 2 in FIG. 11). ). When the reference pixels are arranged in this way, the calculation formulas for the interpolation pixels g00, g10, g01, and g11 are expressed as follows, and the value of the reference pixel is used as the even-numbered interpolation pixel value, and the odd-numbered interpolation is performed. The pixel value is determined using the values of the reference pixels on both sides.

このように、補間拡大画像の画素の格子点上に参照画素を配置してバイリニア法により補間拡大を施した結果は、図１２のようになり、オリジナルデータとほぼ同じ変化をしている。したがって、補間拡大画像に高解像度画像を上書きした瞬間のずれ、もしくは揺れは目立たない。 As described above, the result of performing the interpolation enlargement by the bilinear method by arranging the reference pixel on the lattice point of the pixel of the interpolation enlarged image is as shown in FIG. 12, and changes almost the same as the original data. Therefore, the momentary shift or shake of overwriting the high-resolution image on the interpolated enlarged image is inconspicuous.

なお、ＪＰＥＧ２０００では、周波数変換として５／３可逆ＤＷＴと９／７非可逆ＤＷＴを使用できるが、そのいずれを選択するとしても、高い解像度の画像の奇数個の画素値から低い解像度の画像の画素値を生成する変換である点で共通している。 In JPEG2000, 5/3 reversible DWT and 9/7 irreversible DWT can be used as frequency conversion. Regardless of which one is selected, pixels of a low resolution image are obtained from an odd number of pixel values of a high resolution image. This is common in that it is a transformation that generates a value.

次に、高い解像度の画像の偶数個の画素値から低い解像度の画像の画素値を生成する変換が用いられる符号化方式の画像符号の場合について説明する。このような符号化方式の例として、現在標準化が進められているＪＰＥＧ−ＸＲがある。 Next, a description will be given of a case of an image code of an encoding method in which conversion for generating a pixel value of a low resolution image from an even number of pixel values of a high resolution image is used. An example of such an encoding method is JPEG-XR, which is currently being standardized.

ＪＰＥＧ−ＸＲによる画像符号の場合、１つの符号から、最高解像度及び１／４のスケールの画像を得られる符号と、１／１６のスケールの画像を得られる符号とを抜き出すことができる。ＪＰＥＧ−ＸＲの画像符号の場合、図１３に示すような周波数階層となっており、４画素×４画素のブロック単位で直交変換を行った後の係数を、４×４ブロック（マクロブロック）単位で集めたものを最高解像度の係数HPとし、マクロブロックの各ブロックの係数のDC成分を集めたものを１段階低い解像度の係数LPとし、LPに対して4×4の単位で直交変換した結果得られるDC成分を最も低い解像度の係数DCとして扱う。 In the case of an image code based on JPEG-XR, a code that can obtain an image with the highest resolution and a 1/4 scale and a code that can obtain an image with a 1/16 scale can be extracted from one code. In the case of a JPEG-XR image code, the frequency hierarchy is as shown in FIG. 13, and the coefficient after orthogonal transformation is performed in units of 4 × 4 pixel blocks is expressed in units of 4 × 4 blocks (macro blocks). The result of the orthogonal transformation in units of 4x4 with respect to LP as the coefficient HP of the highest resolution and the coefficient DC of the coefficient of each block of the macroblock as the coefficient LP of one step lower resolution. Treat the resulting DC component as the lowest resolution coefficient DC.

ここで、LPは４画素×４画素の各ブロックを平坦化したようなものを表す係数であるので、LPのみを使って復号した低い解像度の画像をバイリニア法で補間拡大した画像に、HP係数を全て使って復号した高解像度の画像を上書きした時に画像のずれを生じさせないためには、ＪＰＥＧ２０００の場合とは逆に、図１０に示すように、補間拡大画像の画素の格子点の間に参照画素を配置して補間拡大を行えばよい。
同様の符号の別の例としてＦｌａｓｈＰｉｘ符号がある。 Here, LP is a coefficient representing a flattened block of 4 pixels × 4 pixels. Therefore, an HP coefficient is added to an image obtained by interpolating and enlarging a low resolution image decoded using only LP using the bilinear method. In order to prevent the image from being shifted when the high-resolution image decoded using all of the above is overwritten, contrary to the case of JPEG2000, as shown in FIG. Interpolation enlargement may be performed by arranging reference pixels.
Another example of a similar code is the FlashPix code.

以上では、圧縮された画像が解像度スケーラブルの符号として与えられる場合を想定して説明した。しかし、高い解像度の画像の奇数個の画素値から低い解像度の画像の画素値を生成する解像度変換により得られた複数の異解像度画像それぞれの独立した符号の集まりとして与えられる圧縮画像は、ＪＰＥＧ２０００符号などと同様に、補間拡大画像の画素の格子点上に参照画素を配置して補間拡大を行えば、補間拡大画像を高解像度の復号画像で上書き表示した際の画像のずれが目立たなくなる。一方、高い解像度の画像の偶数個の画素値から低い解像度の画像の画素値を生成する解像度変換により得られた複数の異解像度画像それぞれの独立した符号の集まりとして与えられる圧縮画像は、ＪＰＥＧ−ＸＲ符号などと同様に扱うことができ、補間拡大画像の画素の格子点の間に参照画素を配置して補間拡大を行うとよい。 In the above description, it is assumed that a compressed image is given as a resolution scalable code. However, a compressed image given as a collection of independent codes of a plurality of different resolution images obtained by resolution conversion that generates pixel values of a low resolution image from odd pixel values of a high resolution image is a JPEG2000 code. Similar to the above, if reference enlargement is performed by arranging reference pixels on the pixel grid points of the interpolation enlarged image, the image shift when the interpolation enlarged image is overwritten with the high-resolution decoded image becomes inconspicuous. On the other hand, a compressed image given as a collection of independent codes of a plurality of different resolution images obtained by resolution conversion that generates pixel values of a low resolution image from an even number of pixel values of a high resolution image is JPEG- It can be handled in the same way as an XR code or the like, and interpolation enlargement may be performed by arranging reference pixels between lattice points of pixels of the interpolation enlarged image.

以上、画像補間法としてバイリニア法を例にしたが、バイキュービック法などが用いられる場合も同様である。 As described above, the bilinear method is taken as an example of the image interpolation method, but the same applies when the bicubic method or the like is used.

本発明は、以上の考察に基づき、過渡的なズーミング表示動作による補間拡大画像から高解像度の復号画像の表示へと切り替えた瞬間の画像のずれもしくは揺れの発生を抑えた画像処理装置を実現するものであり、その特徴は次の通りである。 Based on the above considerations, the present invention realizes an image processing device that suppresses the occurrence of image shift or shaking at the moment of switching from an interpolated enlarged image by a transient zooming display operation to a display of a high-resolution decoded image. The features are as follows.

請求項１記載の発明は、
拡大操作の対象である符号の復号処理により得られた復号画像を表示している状態でユーザによる拡大操作が始まると、当該拡大操作の継続中、当該復号画像の補間拡大処理を行い、それにより得た補間拡大画像を表示させる過渡的ズーミング表示動作を行い、当該拡大操作が終了すると拡大操作の対象である符号の復号処理を行い、それにより得た拡大後の復号画像を、当該過渡的ズーミング表示動作の最後に表示された補間拡大画像に上書きする形で表示する画像処理装置であって、
前記補間拡大処理は、補間拡大画像の各画素の値を複数の参照画素の値を用いた演算により決定する画像補間法による処理であり、
前記補間拡大処理を行う補間拡大手段は、参照画素と補間拡大画像の画素との位置関係に関連した動作モードとして、補間拡大画像の画素の格子点の間に参照画素を配置するＡモード、又は、補間拡大画像の画素の格子点上に参照画素を配置するＢモードを選択可能であり、
ユーザの指示に従って、前記補間拡大手段の動作モードとして前記Ａモード、Ｂモードのいずれかを選択するモード選択手段を有することを特徴とする画像処理装置である。 The invention described in claim 1
When the enlargement operation by the user is started in a state where the decoded image obtained by the decoding process of the code that is the target of the enlargement operation is displayed, an interpolation enlargement process of the decoded image is performed while the enlargement operation is continued. A transient zooming display operation for displaying the obtained interpolated enlarged image is performed, and when the enlargement operation is completed, the code that is the target of the enlargement operation is decoded, and the obtained decoded image is converted to the transient zooming An image processing apparatus for displaying in an overwritten form on the interpolation enlarged image displayed at the end of the display operation,
The interpolation enlargement process is a process by an image interpolation method that determines the value of each pixel of the interpolation enlarged image by calculation using the values of a plurality of reference pixels.
The interpolation enlargement means for performing the interpolation enlargement process is an A mode in which the reference pixels are arranged between the lattice points of the pixels of the interpolation enlarged image as an operation mode related to the positional relationship between the reference pixels and the pixels of the interpolation enlarged image, or The B mode in which the reference pixel is arranged on the lattice point of the pixel of the interpolation enlarged image can be selected.
An image processing apparatus comprising mode selection means for selecting either the A mode or the B mode as an operation mode of the interpolation enlargement means in accordance with a user instruction.

請求項２記載の発明は、
拡大操作の対象である符号の復号処理により得られた復号画像を表示している状態でユーザによる拡大操作が始まると、当該拡大操作の継続中、当該復号画像の補間拡大処理を行い、それにより得た補間拡大画像を表示させる過渡的ズーミング表示動作を行い、当該拡大操作が終了すると拡大操作の対象である符号の復号処理を行い、それにより得た拡大後の復号画像を、当該過渡的ズーミング表示動作の最後に表示された補間拡大画像に上書きする形で表示する画像処理装置であって、
前記補間拡大処理は、補間拡大画像の各画素の値をその近傍の複数の参照画素の値を用いた演算により決定する画像補間法による処理であり、
前記補間拡大処理を行う補間拡大手段は、参照画素と補間拡大画像の画素との位置関係に関連した動作モードとして、補間拡大画像の画素の格子点の間に参照画素を配置するＡモード、又は、補間拡大画像の画素の格子点上に参照画素を配置するＢモードを選択可能であり、
前記補間拡大手段の動作モードとして前記Ａモード、Ｂモードのいずれかを自動的に選択するモード選択手段を有することを特徴とする画像処理装置である。 The invention according to claim 2
When the enlargement operation by the user is started in a state where the decoded image obtained by the decoding process of the code that is the target of the enlargement operation is displayed, an interpolation enlargement process of the decoded image is performed while the enlargement operation is continued. A transient zooming display operation for displaying the obtained interpolated enlarged image is performed, and when the enlargement operation is completed, the code that is the target of the enlargement operation is decoded, and the obtained decoded image is converted to the transient zooming An image processing apparatus for displaying in an overwritten form on the interpolation enlarged image displayed at the end of the display operation,
The interpolation enlargement process is a process by an image interpolation method that determines the value of each pixel of the interpolation enlarged image by calculation using the values of a plurality of reference pixels in the vicinity thereof,
The interpolation enlargement means for performing the interpolation enlargement process is an A mode in which the reference pixels are arranged between the lattice points of the pixels of the interpolation enlarged image as an operation mode related to the positional relationship between the reference pixels and the pixels of the interpolation enlarged image, or The B mode in which the reference pixel is arranged on the lattice point of the pixel of the interpolation enlarged image can be selected.
An image processing apparatus comprising mode selection means for automatically selecting one of the A mode and B mode as an operation mode of the interpolation enlargement means.

請求項３記載の発明は、請求項２記載の発明に係る画像処理装置において、
前記モード選択手段は、高い解像度の画像の奇数個の画素値から低い解像度の画像の画素値を生成する変換が用いられる符号化方式による解像度スケーラブルの符号が拡大操作の対象である場合には、前記補間拡大手段の動作モードとしてＢモードを選択し、高い解像度の画像の偶数個の画素値から低い解像度の画像の画素値を生成する変換が用いられる符号化方式による解像度スケーラブルの符号が拡大操作の対象である場合には、前記補間拡大手段の動作モードとしてＡモードを選択することを特徴とする。 According to a third aspect of the present invention, in the image processing apparatus according to the second aspect of the present invention,
The mode selection means, when a resolution scalable code by an encoding method using a conversion that generates a pixel value of a low resolution image from an odd number of pixel values of a high resolution image is an object of an enlargement operation, The B mode is selected as the operation mode of the interpolation enlarging means, and the resolution scalable code by the encoding method using the conversion that generates the pixel value of the low resolution image from the even number of pixel values of the high resolution image is enlarged. In the case of A, the A mode is selected as the operation mode of the interpolation enlarging means.

請求項４記載の発明は、請求項２記載の発明に係る画像処理装置において、
前記モード選択手段は、高い解像度の画像の奇数個の画素値から低い解像度の画像の画素値を生成する解像度変換により得られた複数の異解像度画像それぞれの独立した符号が拡大操作の対象である場合には、前記補間拡大手段の動作モードとしてＢモードを選択し、高い解像度の画像の偶数個の画素値から低い解像度の画像の画素値を生成する解像度変換により得られた複数の異解像度画像それぞれの独立した符号が拡大操作の対象である場合には、前記補間拡大手段の動作モードとしてＡモードを選択することを特徴とする。 According to a fourth aspect of the present invention, in the image processing apparatus according to the second aspect of the present invention,
In the mode selection means, an independent code of each of a plurality of different resolution images obtained by resolution conversion that generates pixel values of a low resolution image from an odd number of pixel values of a high resolution image is an object of an enlargement operation. In this case, the B mode is selected as the operation mode of the interpolation enlargement means, and a plurality of different resolution images obtained by resolution conversion for generating pixel values of a low resolution image from an even number of pixel values of a high resolution image. When each independent code is an object of the enlargement operation, the A mode is selected as the operation mode of the interpolation enlargement means.

請求項５記載の発明は、請求項３又は４に記載の発明に係る画像処理装置において、
拡大操作の対象が複数ある場合、それら対象ごとに、前記モード選択手段は前記補間拡大手段の動作モードとしてＡモード又はＢモードのいずれかを自動的に選択することを特徴とする。 According to a fifth aspect of the present invention, in the image processing apparatus according to the third or fourth aspect of the present invention,
When there are a plurality of enlargement operation targets, the mode selection means automatically selects either the A mode or the B mode as the operation mode of the interpolation enlargement means for each target.

本発明によれば、過渡的なズーミング表示動作による補間拡大画像から高解像度の復号画像の表示へと切り替えた瞬間の画像のずれもしくは揺れの発生を抑えることが可能な画像処理装置を実現することができる。 According to the present invention, it is possible to realize an image processing apparatus capable of suppressing the occurrence of image shift or shaking at the moment of switching from an interpolated enlarged image by a transient zooming display operation to a display of a high-resolution decoded image. Can do.

図１は、本発明の一実施形態に係る画像処理装置を説明するためのブロック図である。この画像処理装置は、画像の符号を記憶するための符号記憶部１００、符号記憶部１００に記憶されている符号を読み込んで復号処理を行い復号画像を出力する復号部１０２、この復号画像を一時的に記憶するための復号画像記憶部１０４、復号画像記憶部１０４より復号画像を読み込んで補間拡大処理を行い補間拡大画像を出力する補間拡大部１０６、この補間拡大画像を一時的に記憶するための補間拡大画像記憶部１０８、復号画像記憶部１０４又は補間拡大画像記憶部１０８に記憶されている画像を画面に表示する表示部１１０、復号部１０２や補間拡大部１０６、表示部１１０の制御を行う制御部１１２、ユーザが拡大操作などを行うためのユーザ操作部１１４を有する。表示部１１０の画面には、画像が１つだけ表示されることも複数が同時に表示されることもある。ユーザは、ユーザ操作部１１４による拡大操作により、画面に表示されている１つの画像だけを拡大表示させることも複数の画像を同時に拡大表示させることもできる。 FIG. 1 is a block diagram for explaining an image processing apparatus according to an embodiment of the present invention. The image processing apparatus includes a code storage unit 100 for storing a code of an image, a decoding unit 102 that reads a code stored in the code storage unit 100, performs a decoding process, and outputs a decoded image, and temporarily stores the decoded image. A decoded image storage unit 104 for storing the decoded image, an interpolation expansion unit 106 that reads the decoded image from the decoded image storage unit 104, performs an interpolation enlargement process, and outputs an interpolation enlarged image, and temporarily stores the interpolation enlarged image Control of the display unit 110, the decoding unit 102, the interpolation enlargement unit 106, and the display unit 110 that display the image stored in the interpolation enlarged image storage unit 108, the decoded image storage unit 104, or the interpolation enlarged image storage unit 108 on the screen. The control unit 112 performs a user operation unit 114 for a user to perform an enlargement operation or the like. On the screen of the display unit 110, only one image may be displayed or a plurality of images may be displayed simultaneously. The user can enlarge only one image displayed on the screen or enlarge a plurality of images at the same time by an enlargement operation by the user operation unit 114.

符号記憶部１００に記憶される符号としては、通常、ＪＰＥＧ２０００符号、ＪＰＥＧ−ＸＲ符号、ＦｌａｓｈＰｉｘ符号のような解像度スケーラブルの符号が想定される。ただし、高い解像度の画像に低い解像度へと変換する解像度変換を適用することに得られた複数の異解像度画像それぞれの独立した符号（例えばＪＰＥＧ符号）の集まりであってもよい。 As codes stored in the code storage unit 100, resolution-scalable codes such as JPEG2000 codes, JPEG-XR codes, and FlashPix codes are usually assumed. However, it may be a collection of independent codes (for example, JPEG codes) of a plurality of different resolution images obtained by applying resolution conversion for converting a high resolution image to a low resolution.

補間拡大部１０６は、補間画素値をその近傍の複数の参照画素の値を用いた演算により決定する画像補間法、例えばバイリニア法を利用するものである。そして、補間拡大部１０６は、参照画素と補間拡大画像の画素との位置関係に関連した動作モードとして、図１０に例示したように補間拡大画像の画素の格子点の間に参照画素を配置するＡモードを選択することも、図１１に例示したように補間拡大画像の画素の格子点上に参照画素を配置するＢモードを選択することもできる。 The interpolation enlargement unit 106 uses an image interpolation method, for example, a bilinear method, in which an interpolation pixel value is determined by calculation using values of a plurality of reference pixels in the vicinity thereof. Then, the interpolation enlargement unit 106 arranges the reference pixels between the lattice points of the pixels of the interpolation enlarged image as illustrated in FIG. 10 as the operation mode related to the positional relationship between the reference pixels and the pixels of the interpolation enlarged image. The A mode can be selected, or the B mode in which the reference pixels are arranged on the grid points of the pixels of the interpolation enlarged image as illustrated in FIG. 11 can be selected.

制御部１１２は、補間拡大部１０６をＡ，Ｂモードのいずれで動作させるかのモード選択を行うモード選択部１１６を含んでいる。このモード選択は、ユーザからの指示に基づいて行わせることも、対象とする符号に応じて自動的に行わせることも可能である。 The control unit 112 includes a mode selection unit 116 that selects a mode for operating the interpolation enlargement unit 106 in either the A or B mode. This mode selection can be performed based on an instruction from the user, or can be automatically performed according to a target code.

このような画像処理装置は、例えば、図２に示すようなＣＰＵ２００、メモリ２０２、ディスプレイ２０４、マウス２０６、キーボード２０８、ハードディスク２１０、ネットワーク・インターフェース２１２などから構成される一般的なコンピュータを利用しソフトウェアによって実現される。各記憶部１００，１０４，１０８としてはメモリ２０２の特定の記憶域が利用される。ユーザ操作部１１４としては例えばマウス２０６が利用され、ユーザの拡大操作は例えばマウス２０６のホイールを回転させる操作である。表示される画像の符号は、ハードディスク２１０に蓄積されているものであったり、ネットワーク・インターフェース２１２を通じリモート・サーバ等からダウンロードされたものであったりする。 Such an image processing apparatus uses, for example, a general computer including a CPU 200, a memory 202, a display 204, a mouse 206, a keyboard 208, a hard disk 210, a network interface 212, etc. as shown in FIG. It is realized by. A specific storage area of the memory 202 is used as each storage unit 100, 104, 108. For example, the mouse 206 is used as the user operation unit 114, and the user's enlargement operation is an operation of rotating the wheel of the mouse 206, for example. The displayed image code may be stored in the hard disk 210 or downloaded from a remote server or the like through the network interface 212.

例えば、解像度スケーラブルの符号の復号処理により得られた低解像度の復号画像（復号画像記憶部１０４に記憶されている）が表示部１１０の画面に表示されている状態でユーザの拡大操作が行われる場合の動作について、図３の状態遷移図を参照し、説明する。 For example, the user's enlargement operation is performed in a state where a low-resolution decoded image (stored in the decoded image storage unit 104) obtained by the decoding process of the resolution scalable code is displayed on the screen of the display unit 110. The operation in this case will be described with reference to the state transition diagram of FIG.

拡大操作の対象である符号の復号処理により得られた低解像度の復号画像が表示されている操作待ちの状態Ｓ０で、ユーザがユーザ操作部１１４で拡大操作を開始すると、制御部１１２は拡大操作中の状態Ｓ１に遷移する。この拡大操作とは、ここではマウス２０６のホイールを回転させる操作であるとする。 When the user starts the enlargement operation with the user operation unit 114 in the operation waiting state S0 in which the low-resolution decoded image obtained by the decoding process of the code that is the target of the enlargement operation is displayed, the control unit 112 performs the enlargement operation. Transition to the middle state S1. Here, the enlargement operation is an operation for rotating the wheel of the mouse 206.

状態Ｓ１では、制御部１１２は、補間拡大部１０６に、拡大倍率を指定して復号画像記憶部１０４内の復号画像の補間拡大処理を行わせて補間拡大画像を補間拡大画像記憶部１０８に出力させ、この補間拡大画像を表示部１１０の画面に表示させる。拡大操作が継続中は、拡大倍率を徐々に上げて補間拡大処理と補間拡大画像の表示を繰り返し行わせる。すなわち、過渡的ズーミング表示動作が行われるわけである。 In state S <b> 1, the control unit 112 causes the interpolation enlargement unit 106 to specify the enlargement magnification and perform the interpolation enlargement process of the decoded image in the decoded image storage unit 104, and outputs the interpolation enlarged image to the interpolation enlarged image storage unit 108. The interpolated enlarged image is displayed on the screen of the display unit 110. While the enlargement operation continues, the enlargement magnification is gradually increased to repeatedly perform the interpolation enlargement process and the display of the enlarged interpolation image. That is, a transient zooming display operation is performed.

ユーザの拡大操作が終了すると（具体的にはマウス２０６のホイールからユーザが指を離すと）、制御部１１２は状態Ｓ２に遷移する。制御部１１２は、当該符号について、過渡的ズーミング表示動作の最後で表示された補間拡大画像のサイズに対応した高い解像度の画像（拡大後の画像）を得るための復号処理を復号部１０２に実行させ、復号部１０２より出力され復号画像記憶部１０４に記憶されている高解像度の復号画像を、表示部１１０の画面に、それまで表示されていた補間拡大画像に上書きする形で表示させる。この処理が終わると、操作待ち状態Ｓ０に戻る。 When the user's enlargement operation is completed (specifically, when the user releases his / her finger from the wheel of the mouse 206), the control unit 112 transitions to the state S2. The control unit 112 performs a decoding process on the code to obtain a high-resolution image (enlarged image) corresponding to the size of the interpolated enlarged image displayed at the end of the transient zooming display operation for the code. Then, the high-resolution decoded image output from the decoding unit 102 and stored in the decoded image storage unit 104 is displayed on the screen of the display unit 110 so as to overwrite the interpolated enlarged image displayed so far. When this process ends, the process returns to the operation waiting state S0.

さて、過渡的ズーミング表示動作において補間拡大部１０６がＡモード、Ｂモードのいずれのモードで動作するかはモード選択部１１６で選択される。拡大操作の対象となる符号の符号化方式が予め決まっているような場合、その符号化方式に適した特定のモードをモード選択部１１６で選択させるように、ユーザ側で例えばユーザ操作部１１４の操作を通じて予め指示することができる。 The mode selection unit 116 selects whether the interpolation enlargement unit 106 operates in the A mode or the B mode in the transient zooming display operation. When the encoding method of the code that is the target of the enlargement operation is determined in advance, the user side, for example, the user operation unit 114 may select a specific mode suitable for the encoding method by the mode selection unit 116. An instruction can be given in advance through the operation.

すなわち、高い解像度の画像の奇数個の画素値から低い解像度の画像の画素値を生成する変換が用いられる符号化方式による解像度スケーラブルの符号、例えばＪＰＥＧ２０００符号が拡大操作の対象である場合には（もしくは、同様の解像度変換により得られた複数の異解像度画像それぞれの独立した符号が拡大操作の対象である場合には）、補間拡大画像の画素の格子点上に参照画素を配置するＢモードを選択するように指示すればよい。このようなモード選択を行えば、過渡的ズーミング表示動作で最後に表示された補間拡大画像に高解像度復号画像を上書き表示した時の画像のずれは目立たなくなる。 In other words, when a resolution scalable code such as a JPEG2000 code using an encoding method that uses a conversion that generates a pixel value of a low resolution image from an odd number of pixel values of a high resolution image is the target of an enlargement operation ( Alternatively, in the case where independent codes of a plurality of different resolution images obtained by the same resolution conversion are objects of enlargement operation), the B mode in which the reference pixels are arranged on the lattice points of the pixels of the interpolation enlarged image is selected. It only has to be instructed to select. If such a mode is selected, the image shift when the high-resolution decoded image is overwritten and displayed on the interpolated enlarged image displayed last in the transient zooming display operation becomes inconspicuous.

一方、高い解像度の画像の偶数個の画素値から低い解像度の画像の画素値を生成する変換が用いられる符号化方式による解像度スケーラブルの符号、例えばＪＰＥＧ−ＸＲ符号やＦｌａｓｈＰｉｘ符号が拡大操作の対象である場合には（もしくは、同様の解像度変換により得られた複数の異解像度画像それぞれの独立した符号が拡大操作の対象である場合には）、補間拡大画像の画素の格子点の間に参照画素を配置するＡモードを選択するように指示すればよい。このようなモード選択を行えば、過渡的ズーミング表示動作の最後に表示された補間拡大画像に高解像度復号画像を上書き表示した時の画像のずれ、もしくは揺れは目立たなくなる。 On the other hand, a resolution scalable code such as a JPEG-XR code or a FlashPix code is used as an enlargement operation by an encoding method in which a conversion for generating a pixel value of a low resolution image from an even number of pixel values of a high resolution image is used. In some cases (or in the case where independent codes of a plurality of different resolution images obtained by the same resolution conversion are the targets of the enlargement operation), the reference pixel is between the lattice points of the interpolated enlarged image pixels. May be instructed to select the A mode in which to place the If such a mode selection is performed, the image shift or shake when the high resolution decoded image is overwritten and displayed on the interpolated enlarged image displayed at the end of the transient zooming display operation becomes inconspicuous.

また、ユーザはユーザ操作部１１４の操作を通じて、モード選択部１１６で、拡大操作の対象となる符号に適したモードを自動的に選択させるように指示することができる。この場合、モード選択部１１６では、高い解像度の画像の奇数個の画素値から低い解像度の画像の画素値を生成する変換が用いられる符号化方式による解像度スケーラブルの符号、例えばＪＰＥＧ２０００符号が拡大操作の対象である場合には（もしくは同様の解像度変換により得られた複数の異解像度画像それぞれの独立した符号が拡大操作の対象である場合には）Ｂモードを自動選択し、高い解像度の画像の偶数個の画素値から低い解像度の画像の画素値を生成する変換が用いられる符号化方式による解像度スケーラブルの符号、例えばＪＰＥＧ−ＸＲ符号やＦｌａｓｈＰｉｘ符号が拡大操作の対象である場合には（もしくは、同様の解像度変換により得られた複数の異解像度画像それぞれの独立した符号が拡大操作の対象である場合には）Ａモードを自動選択する。 In addition, the user can instruct the mode selection unit 116 to automatically select a mode suitable for the code to be subjected to the enlargement operation through the operation of the user operation unit 114. In this case, the mode selection unit 116 performs an enlargement operation on a resolution-scalable code such as a JPEG2000 code based on an encoding method that uses conversion that generates a pixel value of a low-resolution image from an odd number of pixel values of a high-resolution image. If it is the target (or if the independent codes of a plurality of different resolution images obtained by the same resolution conversion are the targets of the enlargement operation), the B mode is automatically selected, and an even number of high resolution images If a resolution-scalable code based on an encoding method that uses conversion that generates pixel values of a low-resolution image from a single pixel value, for example, a JPEG-XR code or a FlashPix code, is the target of an enlargement operation (or similar When the independent codes of multiple different resolution images obtained by resolution conversion are subject to enlargement operation ) To automatically select the A mode.

このような自動的なモード選択は、例えば、符号のヘッダに記述されている符号化方式に関する情報や、符号のヘッダに予め記述したモード選択に関する情報、あるいは、符号ごとにＡ，Ｂモードのいずれを適用するかを管理するために用意したテーブルの情報などを参照することにより、容易に行うことができる。そして、このような自動的なモード選択を行わせるならば、対象とする符号の符号化方式などをユーザが意識しなくとも、過渡的ズーミング表示動作で表示された補間拡大画像に高解像度復号画像を上書き表示した時の画像のずれ、もしくは揺れを目立たなくすることができる。 Such automatic mode selection is, for example, information on the encoding method described in the code header, information on the mode selection previously described in the code header, or A or B mode for each code. This can be easily done by referring to information in a table prepared for managing whether to apply. If such automatic mode selection is performed, the high-resolution decoded image is displayed on the interpolated enlarged image displayed by the transient zooming display operation without the user being aware of the encoding method of the target code. It is possible to make the shift or shake of the image inconspicuous when overwritten.

なお、ユーザの拡大操作により、表示部１１０の画面に表示されている複数の画像を同時に拡大表示させることもできる。このように拡大操作の対象が複数ある場合においても、モード選択部１１６により、過渡的ズーミング表示における補間拡大処理のモードを拡大操作の対象ごとに適切に選択することにより、補間拡大画像に高解像度復号画像を上書き表示した際の画像のずれを目立たなくすることができる。拡大操作の対象に符号化方式などの異なる画像が混在している場合には、画像ごとに適したモードを選択させる必要があるが、これは上述の自動的なモード選択により可能である。 A plurality of images displayed on the screen of the display unit 110 can be enlarged and displayed at the same time by the user's enlargement operation. As described above, even when there are a plurality of enlargement operation targets, the mode selection unit 116 appropriately selects the interpolation enlargement processing mode in the transient zooming display for each enlargement operation target, thereby obtaining a high resolution for the interpolation enlargement image. It is possible to make the image shift inconspicuous when the decoded image is overwritten and displayed. When different images such as encoding methods are mixed in the enlargement operation target, it is necessary to select a mode suitable for each image, but this is possible by the automatic mode selection described above.

なお、本発明は以上に述べた実施形態のみに限定されるものではなく、様々に変形して実施し得るものである。 The present invention is not limited to the above-described embodiment, and can be implemented with various modifications.

本発明の画像処理装置のブロック図である。1 is a block diagram of an image processing apparatus of the present invention. 本発明の画像処理装置が実現されるコンピュータの一例を示すブロック図である。It is a block diagram which shows an example of the computer by which the image processing apparatus of this invention is implement | achieved. 拡大操作に関連した動作を説明するための状態遷移図である。It is a state transition diagram for demonstrating the operation | movement relevant to expansion operation. ２次元ＤＷＴを１回適用した後のサブバンド分割を示す図である。It is a figure which shows the subband division | segmentation after applying two-dimensional DWT once. ５／３ＤＷＴの対象となる１次元のオリジナルデータを示す図である。It is a figure which shows the one-dimensional original data used as the object of 5 / 3DWT. オリジナルデータと、それに５／３ＤＷＴを適用した後のデータの低周波成分とをプロットした図である。It is the figure which plotted the original data and the low frequency component of the data after applying 5 / 3DWT to it. オリジナルデータと、それに５／３ＤＷＴを適用した後のデータの低周波成分をニアリストネイバーにより２倍拡大したデータとをプロットした図である。It is the figure which plotted the original data and the data which expanded the low frequency component of the data after applying 5 / 3DWT to it 2 times by the near neighbor. オリジナルデータと、それに５／３ＤＷＴを２回適用した後のデータの低周波成分をニアレストレイバーにより２倍拡大したデータとをプロットした図である。It is the figure which plotted the original data and the data which expanded the low frequency component of the data after applying 5 / 3DWT twice to it by the near-less tray bar twice. オリジナルデータと、それに５／３ＤＷＴを適用した後のデータの低周波成分をバイリニアで２倍拡大したデータとをプロットした図である。It is the figure which plotted the original data and the data which expanded the low frequency component of the data after applying 5 / 3DWT to it twice bilinearly. バイリニア法において補間画像の画素の格子点の間に参照画素を配置するモードの説明図である。It is explanatory drawing of the mode which arrange | positions a reference pixel between the lattice points of the pixel of an interpolation image in a bilinear method. バイリニア法において補間画像の画素の格子点上に参照画素を配置するモードの説明図である。It is explanatory drawing of the mode which arrange | positions a reference pixel on the lattice point of the pixel of an interpolation image in a bilinear method. オリジナルデータと、それに５／３ＤＷＴを適用した後のデータの低周波成分を、バイリニア法により、補間画像の画素の格子点上に参照画素を配置するモードで２倍拡大したデータとをプロットした図である。A plot of original data and data obtained by enlarging the low-frequency component of the data after applying 5 / 3DWT to it twice in a mode in which reference pixels are arranged on the lattice points of the pixels of the interpolation image by the bilinear method. It is. ＪＰＥＧ−ＸＲの周波数階層の説明図である。It is explanatory drawing of the frequency hierarchy of JPEG-XR.

Explanation of symbols

１００符号記憶部
１０２復号部
１０４復号画像記憶部
１０６補間拡大部
１０８補間拡大画像記憶部
１１０表示部
１１２制御部
１１４ユーザ操作部
１１６モード選択部 100 Code storage unit 102 Decoding unit 104 Decoded image storage unit 106 Interpolation enlargement unit 108 Interpolation enlarged image storage unit 110 Display unit 112 Control unit 114 User operation unit 116 Mode selection unit

Claims

When the enlargement operation by the user is started in a state where the decoded image obtained by the decoding process of the code that is the target of the enlargement operation is displayed, an interpolation enlargement process of the decoded image is performed while the enlargement operation is continued. A transient zooming display operation for displaying the obtained interpolated enlarged image is performed, and when the enlargement operation is completed, the code that is the target of the enlargement operation is decoded, and the obtained decoded image is converted to the transient zooming. An image processing apparatus for displaying in an overwritten form on the interpolation enlarged image displayed at the end of the display operation,
The interpolation enlargement process is a process by an image interpolation method that determines the value of each pixel of the interpolation enlarged image by calculation using the values of a plurality of reference pixels.
The interpolation enlargement means for performing the interpolation enlargement process is an A mode in which the reference pixels are arranged between the lattice points of the pixels of the interpolation enlarged image as an operation mode related to the positional relationship between the reference pixels and the pixels of the interpolation enlarged image, or The B mode in which the reference pixel is arranged on the lattice point of the pixel of the interpolation enlarged image can be selected.
An image processing apparatus comprising mode selection means for selecting either the A mode or the B mode as an operation mode of the interpolation enlargement means in accordance with a user instruction.

When the enlargement operation by the user is started in a state where the decoded image obtained by the decoding process of the code that is the target of the enlargement operation is displayed, an interpolation enlargement process of the decoded image is performed while the enlargement operation is continued. A transient zooming display operation for displaying the obtained interpolated enlarged image is performed, and when the enlargement operation is completed, the code that is the target of the enlargement operation is decoded, and the obtained decoded image is converted to the transient zooming. An image processing apparatus for displaying in an overwritten form on the interpolation enlarged image displayed at the end of the display operation,
The interpolation enlargement process is a process by an image interpolation method that determines the value of each pixel of the interpolation enlarged image by calculation using the values of a plurality of reference pixels in the vicinity thereof,
The interpolation enlargement means for performing the interpolation enlargement process is an A mode in which the reference pixels are arranged between the lattice points of the pixels of the interpolation enlarged image as an operation mode related to the positional relationship between the reference pixels and the pixels of the interpolation enlarged image, or The B mode in which the reference pixel is arranged on the lattice point of the pixel of the interpolation enlarged image can be selected.
An image processing apparatus comprising mode selection means for automatically selecting one of the A mode and B mode as an operation mode of the interpolation enlargement means.

The mode selection means, when a resolution scalable code by an encoding method using a conversion that generates a pixel value of a low resolution image from an odd number of pixel values of a high resolution image is an object of an enlargement operation, The B mode is selected as the operation mode of the interpolation enlarging means, and the resolution scalable code by the encoding method using the conversion that generates the pixel value of the low resolution image from the even number of pixel values of the high resolution image is enlarged. The image processing apparatus according to claim 2, wherein an A mode is selected as an operation mode of the interpolation enlarging means.

In the mode selection means, an independent code of each of a plurality of different resolution images obtained by resolution conversion that generates pixel values of a low resolution image from an odd number of pixel values of a high resolution image is an object of an enlargement operation. In this case, the B mode is selected as the operation mode of the interpolation enlargement means, and a plurality of different resolution images obtained by resolution conversion for generating pixel values of a low resolution image from an even number of pixel values of a high resolution image. The image processing apparatus according to claim 2, wherein when each independent code is an object of an enlargement operation, an A mode is selected as an operation mode of the interpolation enlargement unit.

4. When there are a plurality of enlargement operation targets, for each of the targets, the mode selection unit automatically selects either the A mode or the B mode as the operation mode of the interpolation enlargement unit. 5. The image processing apparatus according to 4.