JP2000197044A

JP2000197044A - Image processing unit, its method and computer-readable memory

Info

Publication number: JP2000197044A
Application number: JP37224198A
Authority: JP
Inventors: Hirochika Matsuoka; 寛親松岡
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1998-12-28
Filing date: 1998-12-28
Publication date: 2000-07-14

Abstract

PROBLEM TO BE SOLVED: To provide an image processing unit that can easily execute compositing of plural images and generate a composite image with satisfactory image quality, and to provide its method and a computer-readable memory. SOLUTION: This image processing unit extracts a characteristic of a background from coded data of at least one background image, extracts a characteristic of an object including statistic information of the image information from coded data of at least one object image, decodes the coded data of the background image to generate a background reproduction image, and generates the object reproduction image to decode the coded data of the object image. A correction device 216 corrects the object reproduction image, on the basis of the characteristics of the background and the object. An image synthesizer 217 composites the background reproduction image and the corrected object reproduction image.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、複数の画像を合成
する画像処理装置及びその方法、コンピュータ可読メモ
リに関するものである。[0001] 1. Field of the Invention [0002] The present invention relates to an image processing apparatus and method for synthesizing a plurality of images, and a computer readable memory.

【０００２】[0002]

【従来の技術】従来、動画像の符号化方式として、ｈ．
２６１、ＭＰＥＧ−１、ＭＰＥＧ−２などが知られてい
る。これらの符号化方式は、ＩＴＵやＩＳＯによって国
際標準化されており、それぞれ、ｈ．２６４勧告、ＩＳ
Ｏ１１１７２、１３８１８として文書化されている。ま
た、静止画符号化（例えば、ＪＰＥＧ符号化）を各フレ
ームに適応させることで動画像を符号化するＭｏｔｉｏ
ｎＪＰＥＧ符号化も知られている。2. Description of the Related Art Conventionally, h.
H.261, MPEG-1, and MPEG-2 are known. These encoding systems are internationally standardized by the ITU and ISO, and are respectively defined by h. H.264 Recommendation, IS
It is documented as O11172, 13818. In addition, a motion image is encoded by adapting still image encoding (for example, JPEG encoding) to each frame.
n JPEG encoding is also known.

【０００３】以下、図１９を用いて、ビデオ信号をＭＰ
ＥＧ−１で動画像を符号化する符号化システムについて
説明する。[0003] Hereinafter, referring to FIG.
An encoding system that encodes a moving image using EG-1 will be described.

【０００４】図１９は従来の符号化システムの構成を示
す図である。FIG. 19 is a diagram showing the configuration of a conventional encoding system.

【０００５】ＴＶカメラ１００１により入力されたビデ
オ信号は、動画像符号化装置１００２における入力端子
１００３に入力し、Ａ／Ｄ変換器１００４に出力され
る。Ａ／Ｄ変換器１００４でディジタル信号に変換され
たビデオ信号は、ブロック形成器１００５に入力され、
１６×１６のマクロブロックをビデオ信号に基づく画像
の左上から右下方向の順に形成する。ＭＰＥＧ−１では
フレーム内符号化を行うＩ−フレーム、過去のフレーム
からフレーム間符号化を行うＰ−フレーム、過去と未来
のフレームからフレーム間符号化を行うＢ−フレームが
ある。フレームモード器１０１５は、これらのフレーム
のモードを決定する。フレームのモードは符号化のビッ
トレート、ＤＣＴの演算誤差の蓄積による画質劣化の防
止、画像の編集やシーンチェンジを考慮して決定され
る。[0005] A video signal input from a TV camera 1001 is input to an input terminal 1003 of a moving picture coding apparatus 1002 and output to an A / D converter 1004. The video signal converted into a digital signal by the A / D converter 1004 is input to the block former 1005,
16 × 16 macroblocks are formed in order from the upper left to the lower right of the image based on the video signal. In MPEG-1, there are an I-frame for performing intra-frame coding, a P-frame for performing inter-frame coding from past frames, and a B-frame for performing inter-frame coding from past and future frames. The frame mode unit 1015 determines the modes of these frames. The frame mode is determined in consideration of the encoding bit rate, prevention of image quality deterioration due to accumulation of DCT calculation errors, image editing, and scene changes.

【０００６】Ｉ−フレームでは、動き補償器１００６は
動作せず、０を出力する。差分器１００７はブロック形
成器１００５の出力から動き補償器１００６の出力を減
算し、ＤＣＴ変換器１００８に入力する。ＤＣＴ変換器
１００８は、入力された信号を８×８のブロック単位に
ＤＣＴ変換を行い、ＤＣＴ変換された信号を量子化器１
００９で量子化する。次に、量子化された信号を符号器
１０１０で１次元に並び替え、０ラン長と値で符号を決
定する。そして、符号化された信号は、端子１０１１か
ら出力され、記憶媒体に記録されたり、ネットワークや
回線等を介して送信されたりする。また、量子化器１０
０９の出力は、逆量子化器１０１２で逆量子化され、逆
ＤＣＴ変換器１０１３で逆ＤＣＴ変換され、加算器１０
１４で動き補償器１００６の出力と加算され、フレーム
メモリ１０１５または１０１６に格納される。In an I-frame, the motion compensator 1006 does not operate and outputs 0. The subtractor 1007 subtracts the output of the motion compensator 1006 from the output of the block former 1005 and inputs the result to the DCT transformer 1008. The DCT transformer 1008 performs DCT transform on the input signal in units of 8 × 8 blocks, and converts the DCT-transformed signal into the quantizer 1.
In step 009, quantization is performed. Next, the quantized signal is one-dimensionally rearranged by the encoder 1010, and the code is determined based on the 0 run length and the value. The coded signal is output from the terminal 1011 and recorded on a storage medium or transmitted via a network or a line. Also, the quantizer 10
09 is inversely quantized by an inverse quantizer 1012, inverse DCT-transformed by an inverse DCT transformer 1013, and
At 14, the output is added to the output of the motion compensator 1006 and stored in the frame memory 1015 or 1016.

【０００７】Ｐ−フレームでは、動き補償器１００６を
動作させ、ブロック形成器１００５の出力は、動き補償
器１００６に入力され、時間的に直前のフレームの画像
が入力されるフレームメモリ１０１５または１０１６か
ら動き補償を行い動きベクトルと予測マクロブロックを
出力する。差分器１００７は、ブロック形成器１００５
からの入力と予測マクロブロックとの差分を求め、ＤＣ
変換器１００８に入力する。ＤＣＴ変換器１００８は、
入力された信号にＤＣＴ変換を行い、ＤＣＴ変換された
信号を量子化器１００９で量子化する。量子化された信
号は符号器１０１０で動きベクトルとともに符号が決定
され、端子１０１１から出力される。また、量子化器１
００９の出力は、逆量子化器１０１２で逆量子化され、
逆ＤＣＴ変換器１０１３で逆ＤＣＴ変換され加算器１０
１４で動き補償器１００６の出力と加算され、フレーム
メモリ１０１５または１０１６に格納される。In the case of a P-frame, the motion compensator 1006 is operated, and the output of the block former 1005 is input to the motion compensator 1006, from the frame memory 1015 or 1016 to which the image of the immediately preceding frame is input. Perform motion compensation and output a motion vector and a predicted macroblock. A differentiator 1007 is a block former 1005
Of the prediction macroblock and the input from the
Input to converter 1008. DCT converter 1008 is:
The input signal is subjected to DCT transform, and the DCT-transformed signal is quantized by a quantizer 1009. The sign of the quantized signal is determined together with the motion vector by the encoder 1010 and output from the terminal 1011. Quantizer 1
The output of 009 is inversely quantized by an inverse quantizer 1012,
The inverse DCT transform is performed by the inverse DCT transformer 1013 and the adder 10
At 14, the output is added to the output of the motion compensator 1006 and stored in the frame memory 1015 or 1016.

【０００８】Ｂ−フレームでは、Ｐフレームと同様に動
き補償を行うが、動き補償器１００６はフレームメモリ
１０１５、１０１６の両方から動き補償を行い、予測マ
クロブロックを生成し、符号化を行う。In the B-frame, motion compensation is performed in the same manner as in the P frame. However, the motion compensator 1006 performs motion compensation from both the frame memories 1015 and 1016, generates a predicted macroblock, and performs encoding.

【０００９】[0009]

【発明が解決しようとする課題】しかしながら、上記従
来の画像全体を符号化する方法では、動きの無い背景部
分等の画像を繰り返して送る必要があり、符号長を無駄
に使用している。例えば、テレビ電話やテレビ会議等で
実際に動いているのは人だけであり、背景は動いていな
い。一定の時間毎に送られるＩ−フレームでは、この動
いていない背景画像も送られており、符号に無駄が生じ
る。図２０に、その例を示す。However, in the above-mentioned conventional method of encoding an entire image, it is necessary to repeatedly transmit an image of a background portion or the like having no motion, and the code length is wasted. For example, only a person actually moves in a videophone or a video conference, and the background does not move. In an I-frame sent at regular intervals, the background image that is not moving is also sent, resulting in wasted code. FIG. 20 shows an example.

【００１０】図２０では、部屋で人がテレビカメラに向
かっている状態のフレームを示している。この人１０５
１と背景１０５０は同じフレーム内で同じ符号化がなさ
れる。背景１０５０に動きはないため、動き補償を行え
ば符号はほとんど発生しないが、Ｉ−フレームを送ると
きには符号化される。このため、動きがない部分に関し
ても繰り返し符号を送ることになり、無駄である。ま
た、人１０５１の動きが大きく、符号化で大きな符号長
を発生した後のＩ−フレームには、十分な符号長が得ら
れない。そのため、Ｉ−フレームでは量子化係数を荒く
する必要が生じ、動きのない背景の画質まで低下させて
しまうという問題点がある。FIG. 20 shows a frame in which a person is facing a television camera in a room. This person 105
1 and the background 1050 are encoded the same in the same frame. Since there is no motion in the background 1050, little code is generated if motion compensation is performed, but it is coded when an I-frame is sent. For this reason, a code is repeatedly sent to a portion where there is no motion, which is wasteful. In addition, a sufficient code length cannot be obtained in an I-frame after the motion of the person 1051 is large and a large code length is generated in encoding. For this reason, in the I-frame, it is necessary to make the quantization coefficient rough, and there is a problem that the image quality of a background with no motion is reduced.

【００１１】そこで、ＭＰＥＧ−４のように背景と対象
を分離して符号化することによって符号化効率を向上さ
せることが考えられる。この場合、別な場所で撮影され
た対象を合成することも可能なので、例えば、図２１に
示すように、図２０におけるフレームに更にもう１人の
人１０５２を合成したフレームを構成できる。Therefore, it is conceivable to improve the coding efficiency by separately coding the background and the object as in MPEG-4. In this case, it is also possible to combine an object photographed in another place, so that, for example, as shown in FIG. 21, a frame in which another person 1052 is further combined with the frame in FIG. 20 can be configured.

【００１２】ところが、撮影器材の特性から生ずる色か
ぶりのため合成して得られた画像（人１０５２）に不自
然さが残り、見る側に違和感を与える場合がある。例え
ば、、人１０５２が緑かぶり傾向を示す器材で撮影し、
他方、人１０５１が赤かぶりの傾向を示す器材で撮影し
た場合、両者を合成した画像は色かぶりが顕著に目立
ち、非常に不自然となる。[0012] However, an unnatural image may remain in an image (a person 1052) obtained by synthesis due to a color cast generated due to the characteristics of the photographing equipment, and the viewer may feel uncomfortable. For example, a person 1052 shoots with equipment showing a green cast,
On the other hand, when the person 1051 is photographed with a device exhibiting a tendency of red fogging, an image obtained by combining the two becomes remarkably conspicuous in color fog, and becomes very unnatural.

【００１３】また、照明条件や撮影器材の特性といった
環境の相違から生ずるコントラストの相違から合成して
得られた画像にも同様に不自然さが残り、見る側に違和
感を与える場合がある。例えば、人１０５２が太陽光下
で、他方、人１０５１が人工光で撮影される場合、両者
のコントラストが大きく異なり、非常に不自然となる。[0013] Also, an image obtained by combining images due to a difference in contrast caused by a difference in environment such as lighting conditions and characteristics of photographing equipment may also remain unnatural, giving the viewer a sense of discomfort. For example, when the person 1052 is photographed in sunlight and the person 1051 is photographed with artificial light, the contrast between the two is greatly different, which is very unnatural.

【００１４】本発明は上記の問題点に鑑みてなされたも
のであり、複数の画像の合成を容易に実行でき、かつ画
品位が良好な合成画像を生成することができる画像処理
装置及びその方法、コンピュータ可読メモリを提供する
ことを目的とする。SUMMARY OF THE INVENTION The present invention has been made in view of the above problems, and has an image processing apparatus and method capable of easily executing a synthesis of a plurality of images and generating a synthesized image having a good image quality. To provide a computer readable memory.

【００１５】[0015]

【課題を解決するための手段】上記の目的を達成するた
めの本発明による画像処理装置は以下の構成を備える。
即ち、複数の画像を合成する画像処理装置であって、少
なくとも１つの背景画像の符号化データから背景特徴を
抽出する背景特徴抽出手段と、少なくとも１つの対象画
像の符号化データから画像情報の統計的情報を含む対象
特徴を抽出する対象特徴抽出手段と、前記背景画像の符
号化データを復号して背景再生画像を生成する背景復号
手段と、前記対象画像の符号化データを復号する対象再
生画像を生成する対象復号手段と、前記背景特徴と前記
対象特徴に基づいて、前記対象再生画像を補正する補正
手段と、前記背景再生画像と前記補正手段で補正された
対象再生画像を合成する合成手段とを備える。An image processing apparatus according to the present invention for achieving the above object has the following arrangement.
That is, an image processing apparatus for synthesizing a plurality of images, a background feature extracting means for extracting a background feature from encoded data of at least one background image, and a statistical information of image information from encoded data of at least one target image. Target feature extraction means for extracting a target feature including target information, background decoding means for decoding encoded data of the background image to generate a background reproduced image, and a target reproduced image for decoding encoded data of the target image A correction means for correcting the target reproduction image based on the background feature and the target characteristic, and a synthesis means for synthesizing the background reproduction image and the target reproduction image corrected by the correction means. And

【００１６】また、好ましくは、前記対象特徴抽出手段
は、前記画像情報の統計的情報に基づくヒストグラムを
算出する算出手段を備え、前記補正手段は、前記ヒスト
グラムに基づいて、前記対象画像の補正方法を決定す
る。Preferably, the target feature extraction means includes a calculation means for calculating a histogram based on the statistical information of the image information, and the correction means includes a correction method for the target image based on the histogram. To determine.

【００１７】また、好ましくは、前記対象特徴抽出手段
は、前記符号化データに含まれるブロック画像の直流情
報を前記画像情報の統計的情報として抽出する。Preferably, the target feature extracting means extracts DC information of a block image included in the encoded data as statistical information of the image information.

【００１８】また、好ましくは、前記対象特徴抽出手段
は、前記符号化データに含まれるブロック画像の低周波
情報を前記画像情報の統計的情報として抽出する。Preferably, the target feature extracting means extracts low-frequency information of a block image included in the encoded data as statistical information of the image information.

【００１９】また、好ましくは、前記背景復号手段及び
前記対象復号手段のいずれか一方あるいはその両方は、
前記符号化データから量子化データを復号する復号手段
と前記量子化データから周波数域データを算出する逆量
子化手段と前記周波数域データから空間域データを算出
する高速逆離散コサイン変換手段とを備え、前記高速逆
離散コサイン変換手段は、任意の段数のラディックスバ
タフライ演算結果を出力する出力手段を備え、前記対象
特徴抽出手段は、前記任意の段数のラディックスバタフ
ライ演算結果を画像情報の低周波情報として抽出する。Preferably, one or both of the background decoding means and the target decoding means are
Decoding means for decoding quantized data from the encoded data, inverse quantization means for calculating frequency domain data from the quantized data, and high-speed inverse discrete cosine transform means for calculating spatial domain data from the frequency domain data. The high-speed inverse discrete cosine transform unit includes an output unit that outputs a radix butterfly operation result of an arbitrary number of stages, and the target feature extraction unit uses the radix butterfly operation result of the arbitrary number of stages as low-frequency information of image information. Extract.

【００２０】また、好ましくは、前記補正手段は、当該
補正手段が入力する信号と出力する信号の入出力関係を
時系列に従ってゆるやかに変化させる時系列適応手段と
を備える。Preferably, the correction means includes time series adaptation means for gradually changing an input / output relationship between a signal input to the correction means and a signal output from the correction means according to a time series.

【００２１】また、好ましくは、前記対象特徴抽出手段
は、前記画像情報の統計的情報として、前記符号化デー
タに含まれるブロック画像の直流情報あるいは低周波情
報から画素値の最大値と最小値を抽出する。Preferably, the target feature extracting means calculates the maximum value and the minimum value of a pixel value from DC information or low frequency information of a block image included in the encoded data as statistical information of the image information. Extract.

【００２２】また、好ましくは、前記対象特徴抽出手段
は、前記画像情報の統計的情報として、前記符号化デー
タに含まれるブロック画像の直流情報あるいは低周波情
報から画素値の分散と平均値を抽出する。また、好まし
くは、前記補正手段は、前記対象画像に線形変換を施
す。また、好ましくは、前記補正手段は、前記対象画像
に区分的スプライン変換を施す。Preferably, the target feature extracting means extracts, as statistical information of the image information, a variance and an average of pixel values from DC information or low frequency information of a block image included in the encoded data. I do. Preferably, the correction unit performs a linear transformation on the target image. Preferably, the correction unit performs a piecewise spline transformation on the target image.

【００２３】また、好ましくは、前記補正手段は、前記
対象特徴抽出手段により抽出された対象特徴から有意性
のある色の偏りの有無を検出する検出手段と、前記検出
手段の検出結果から色の偏りを補正する色補正手段とを
備える。また、好ましくは、前記検出手段は、抽出され
た対象特徴に含まれる統計的情報より各色信号間におけ
る平均値の差分絶対値との分散の差分絶対値とがある閾
値以下であるという条件を満たすか否かを検出し、更
に、前記条件が満たされた場合においては、前記統計的
情報に基づくヒストグラムの特定領域において有意性の
ある色の偏りの有無を検出する。また、好ましくは、前
記色補正手段は、各色信号の最大値が等しくなるよう線
形に補正する。また、好ましくは、前記色補正手段は、
青色信号については補正を行わない。Preferably, the correction means detects a presence or absence of a significant color bias from the target feature extracted by the target feature extraction means, and detects a color deviation based on a detection result of the detection means. Color correction means for correcting the bias. Preferably, the detecting means satisfies a condition that a difference absolute value of a variance from a difference absolute value of an average value between each color signal is equal to or less than a threshold value from statistical information included in the extracted target feature. And if the above condition is satisfied, the presence or absence of significant color bias in a specific area of the histogram based on the statistical information is detected. Preferably, the color correction means performs linear correction so that the maximum value of each color signal is equal. Also, preferably, the color correction means is:
No correction is performed for the blue signal.

【００２４】また、好ましくは、前記補正手段は、前記
対象特徴抽出手段により抽出された対象特徴と前記背景
特徴抽出手段により抽出された背景特徴とから有意性の
あるコントラストの差異を検出する検出手段と、前記検
出手段の検出結果から、コントラストを補正するコント
ラスト補正手段とを備える。また、好ましくは、前記検
出手段は、前記対象特徴と前記背景特徴とから得られる
画素値の最大値と最小値をそれぞれ抽出し、前記コント
ラスト補正手段は、前記最大値あるいは前記最小値が異
なる対象画像と背景画像とにおいては、それぞれ画素値
の最大値の差分絶対値と画素値の最小値の差分絶対値と
が減少するように補正を行う。また、好ましくは、前記
検出手段は、前記対象特徴と前記背景特徴とから得られ
る画素値の最大値と最小値とをそれぞれ抽出し、前記コ
ントラスト補正手段は、前記最大値あるいは前記最小値
とがほぼ等しい対象画像と背景画像とにおいては、それ
ぞれ分散の差分絶対値が減少するように補正を行う。Preferably, the correction means detects a significant difference in contrast between the target feature extracted by the target feature extraction means and the background feature extracted by the background feature extraction means. And a contrast correction means for correcting contrast based on the detection result of the detection means. Preferably, the detecting unit extracts a maximum value and a minimum value of a pixel value obtained from the target feature and the background feature, respectively, and the contrast correction unit determines an object having a different maximum value or the minimum value. In the image and the background image, the correction is performed so that the difference absolute value of the maximum pixel value and the difference absolute value of the minimum pixel value are reduced. Preferably, the detecting unit extracts a maximum value and a minimum value of a pixel value obtained from the target feature and the background feature, respectively, and the contrast correction unit determines whether the maximum value or the minimum value is The correction is performed so that the difference absolute value of the variance is reduced between the target image and the background image that are substantially equal.

【００２５】上記の目的を達成するための本発明による
画像処理方法は以下の構成を備える。即ち、複数の画像
を合成する画像処理方法であって、少なくとも１つの背
景画像の符号化データから背景特徴を抽出する背景特徴
抽出工程と、少なくとも１つの対象画像の符号化データ
から画像情報の統計的情報を含む対象特徴を抽出する対
象特徴抽出工程と、前記背景画像の符号化データを復号
して背景再生画像を生成する背景復号工程と、前記対象
画像の符号化データを復号する対象再生画像を生成する
対象復号工程と、前記背景特徴と前記対象特徴に基づい
て、前記対象再生画像を補正する補正工程と、前記背景
再生画像と前記補正工程で補正された対象再生画像を合
成する合成工程とを備える。An image processing method according to the present invention for achieving the above object has the following arrangement. That is, an image processing method for synthesizing a plurality of images, a background feature extracting step of extracting a background feature from encoded data of at least one background image, and a statistical process of image information from the encoded data of at least one target image. Feature extraction step of extracting a feature containing target information, a background decoding step of decoding encoded data of the background image to generate a background reproduction image, and a target reproduction image of decoding the encoded data of the target image , A correction step of correcting the target reproduction image based on the background feature and the target characteristic, and a synthesis step of synthesizing the background reproduction image and the target reproduction image corrected in the correction step. And

【００２６】上記の目的を達成するための本発明による
コンピュータ可読メモリは以下の構成を備える。即ち、
複数の画像を合成する画像処理のプログラムコードが格
納されたコンピュータ可読メモリであって、少なくとも
１つの背景画像の符号化データから背景特徴を抽出する
背景特徴抽出工程のプログラムコードと、少なくとも１
つの対象画像の符号化データから画像情報の統計的情報
を含む対象特徴を抽出する対象特徴抽出工程のプログラ
ムコードと、前記背景画像の符号化データを復号して背
景再生画像を生成する背景復号工程のプログラムコード
と、前記対象画像の符号化データを復号する対象再生画
像を生成する対象復号工程のプログラムコードと、前記
背景特徴と前記対象特徴に基づいて、前記対象再生画像
を補正する補正工程のプログラムコードと、前記背景再
生画像と前記補正工程で補正された対象再生画像を合成
する合成工程のプログラムコードとを備えるA computer readable memory according to the present invention for achieving the above object has the following configuration. That is,
A computer readable memory storing a program code of an image processing for synthesizing a plurality of images, the program code of a background feature extracting step of extracting a background feature from encoded data of at least one background image;
Program code for a target feature extraction step of extracting a target feature including statistical information of image information from encoded data of two target images, and a background decoding step of decoding the encoded data of the background image to generate a background reproduced image A program code of a target decoding step of generating a target playback image for decoding encoded data of the target image, and a correction step of correcting the target playback image based on the background feature and the target feature. And a program code for a combining step of combining the background reproduced image and the target reproduced image corrected in the correction step.

【発明の実施の形態】以下、図面を参照して本発明の好
適な実施形態を詳細に説明する。＜実施形態１＞図１は本発明の実施形態１の動画像送信
システムの構成を示すブロック図である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Preferred embodiments of the present invention will be described below in detail with reference to the drawings. <First Embodiment> FIG. 1 is a block diagram showing the configuration of a moving image transmission system according to a first embodiment of the present invention.

【００２７】実施形態１では、撮影環境の異なる複数の
場所で撮影された画像を符号化し伝送される符号化画
像、あらがじめデータベース等の記憶媒体に蓄積されて
いる符号化データを、データベースを管理するホストに
おいてそれぞれ復号して合成し、他の端末やネットワー
クへ送信する場合を例に挙げて説明する。In the first embodiment, encoded images that are encoded and transmitted at a plurality of locations in different photographing environments and encoded data stored in a storage medium such as a pre-database are stored in a database. Will be described by taking as an example a case where a host that manages the information is decrypted and synthesized, and transmitted to another terminal or network.

【００２８】図１において、１０１は青一色の背景（ブ
ルーバック）で動画像を撮影するＴＶカメラである。Ｔ
Ｖカメラ１０１は、動画像の入力手段であってＴＶカメ
ラや他の記憶媒体等の動画像入力手段であればどんなも
のでもよい。ここでは、図１３の人１０５２を撮像して
いるものとする。１０２は動画像を撮影するＴＶカメラ
である。これも動画像入力手段であればよい。ここで
は、図２０の背景画像１０５０ならびに人１０５１を撮
像しているものとする。１０３はブルーバックから対象
画像である人１０５２を抽出する対象抽出器である。１
０５は抽出された対象画像を符号化する対象符号化部１
０５である。ここでは、ＭＰＥＧ−４による符号化を行
うこととする。In FIG. 1, reference numeral 101 denotes a TV camera for shooting a moving image with a blue background (blue background). T
The V camera 101 may be any moving image input means such as a TV camera or another storage medium as a moving image input means. Here, it is assumed that the person 1052 in FIG. 13 is imaged. A TV camera 102 captures a moving image. This may be any moving image input means. Here, it is assumed that the background image 1050 and the person 1051 in FIG. 20 are imaged. A target extractor 103 extracts a person 1052 as a target image from the blue background. 1
05 is a target encoding unit 1 for encoding the extracted target image
05. Here, it is assumed that encoding by MPEG-4 is performed.

【００２９】１０４はＴＶカメラ１０２で撮像された動
画像を符号化する符号化器である。符号化方式は特に限
定されないが、ここでは、ＭＰＥＧ−１による符号化を
例に挙げる。１０６、１０７は符号化データを送信する
送信器である。１０８、１０９は通信回線である。１１
０、１１１は符号化データを受信する受信器である。１
１６はあらかじめ符号化された符号化データを記憶する
記憶装置である。１１２は本発明に係るところの動画像
編集装置である。１１３は動画像編集装置１１２による
編集結果を符号化する符号化器であり、ここでは、ＭＰ
ＥＧ−１による符号化を例に挙げて説明する。尚、符号
化器１１３で使用する符号化方式は、これに限定され
ず、ＭＰＥＧ−４やＭＰＥＧ−２、ｈ．２６３等の動画
像を符号化する符号化方式であればどんなものでもよ
い。１１４は符号化器１１３より得られる符号化データ
を送信する送信器である。１１５は公衆回線または放送
電波等の通信網である。Reference numeral 104 denotes an encoder for encoding a moving image captured by the TV camera 102. Although the encoding method is not particularly limited, an example of encoding by MPEG-1 will be described here. Reference numerals 106 and 107 are transmitters for transmitting encoded data. 108 and 109 are communication lines. 11
Numerals 0 and 111 are receivers for receiving encoded data. 1
Reference numeral 16 denotes a storage device that stores encoded data that has been encoded in advance. Reference numeral 112 denotes a moving image editing apparatus according to the present invention. Reference numeral 113 denotes an encoder for encoding the result of editing performed by the moving image editing apparatus 112, and here, MP
A description will be given by taking coding by EG-1 as an example. Note that the encoding method used by the encoder 113 is not limited to this, but may be MPEG-4, MPEG-2, h. 263, etc., as long as it is a coding method for coding moving images. Reference numeral 114 denotes a transmitter for transmitting the encoded data obtained from the encoder 113. Reference numeral 115 denotes a communication network such as a public line or a broadcast wave.

【００３０】このような構成において、ＴＶカメラ１０
１はブルーバックを背景にして撮像対象である人１０５
２を撮像する。対象抽出器１０３は入力された動画像か
ら対象画像である人１０５２を抽出する。その様子を図
２〜図４に表す。In such a configuration, the TV camera 10
1 is a person 105 to be imaged against a blue background
2 is imaged. The target extractor 103 extracts a person 1052, which is a target image, from the input moving image. This is shown in FIGS.

【００３１】図２は撮像対象である人１０５２を、矩形
領域１２００で切り出す。続いて、ブルーバック部分を
抽出して、図３に示すようなマスク情報１２０１を生成
する領域１２００の画像データと、マスク情報１２０１
を対象符号化部１０５に入力する。図４は、対象符号化
部１０５による処理によって得られる画像を示してお
り、その詳細については後述する。FIG. 2 cuts out a person 1052 to be imaged in a rectangular area 1200. Subsequently, the image data of an area 1200 for extracting the blue back portion and generating the mask information 1201 as shown in FIG.
Is input to the target encoding unit 105. FIG. 4 shows an image obtained by the processing by the target encoding unit 105, the details of which will be described later.

【００３２】記憶装置１１６は、例えば、ＣＤ−ＲＯＭ
や磁気ディスクやテープ記憶装置等で構成されており、
特定の符号化方式に限定されず記憶できる。ここでは、
特に、ＭＰＥＧ−４で符号化したシーケンスで構成され
る符号化データを格納する記憶装置であるとし、例え
ば、図２２に示すようなあらかじめ抽出された人１０５
３を記憶しているものとする。The storage device 116 is, for example, a CD-ROM
And magnetic disk and tape storage devices, etc.
The data can be stored without being limited to a specific coding method. here,
In particular, it is assumed that the storage device is a storage device that stores encoded data composed of a sequence encoded by MPEG-4.
3 is stored.

【００３３】次に、実施形態１の対象符号化部１０５の
詳細構成について、図５を用いて説明する。Next, a detailed configuration of the target encoding unit 105 according to the first embodiment will be described with reference to FIG.

【００３４】図５は本発明の実施形態１の対象符号化部
の詳細構成を示すブロック図である。FIG. 5 is a block diagram showing a detailed configuration of the target encoding unit according to the first embodiment of the present invention.

【００３５】１２１、１２２は端子であり、図１の対象
抽出器１０３から端子１２２は符号化する対象画像の領
域１２００の画像データを、端子１２１はマスク情報１
２０１を入力する。１２３はマスク情報１２０１を格納
するマスクメモリである。１２４はマスク情報１２０１
を符号化するマスク符号化器である。１２５は領域１２
００の画像データを格納する対象メモリである。１２６
は対象画像の画素値の平均値を求める平均値算出器であ
る。１２７は対象画像を符号化単位のブロックに分割す
るブロック形成器である。１２８はフレームの符号化モ
ードを予め決めた周期にしたがって、Ｉ、Ｐ、Ｂのフレ
ームモードを適宜選択するフレームモード設定器であ
る。Reference numerals 121 and 122 denote terminals. The terminal extractor 103 shown in FIG. 1 outputs terminal image data of a region 1200 of the target image to be encoded.
Enter 201. Reference numeral 123 denotes a mask memory for storing mask information 1201. 124 is mask information 1201
Is a mask encoder for encoding. 125 is the area 12
00 is a target memory for storing image data of 00. 126
Is an average value calculator for calculating the average value of the pixel values of the target image. A block forming unit 127 divides the target image into blocks in units of coding. Reference numeral 128 denotes a frame mode setting unit that appropriately selects an I, P, or B frame mode according to a predetermined cycle of a frame encoding mode.

【００３６】１２９は差分器である。１３０はＤＣＴ
（Discrete Cosine Transform：離散コサイン変換）変
換を行うＤＣＴ変換器である。１３１はＤＣＴ変換器１
３０の出力を量子化する量子化器である。１３２は符号
器であり、量子化結果を１次元に整列し、０ラン長と値
に対して符号を割り当て、符号化する。１３３はマスク
符号器１２４と符号器１３２で生成された符号化データ
を合成する。１３４は端子であり、生成された符号化デ
ータを最終的に出力する。１３５は逆量子化を行う逆量
子化器である。１３６は逆ＤＣＴ変換を行う逆ＤＣＴ変
換器である。１３７は加算器であり、１３８、１３９は
再生された画像データを格納する対象メモリである。１
４０はブロック形成器１２７からの入力と対象メモリ１
３８、１３９の内容から動き補償を行う動き補償器であ
る。Reference numeral 129 denotes a differentiator. 130 is DCT
(Discrete Cosine Transform) This is a DCT transformer for performing a transform. 131 is a DCT converter 1
30 is a quantizer which quantizes the output of 30. Reference numeral 132 denotes an encoder that arranges the quantization results one-dimensionally, assigns a code to the 0 run length and the value, and performs encoding. 133 synthesizes the encoded data generated by the mask encoder 124 and the encoder 132. Reference numeral 134 denotes a terminal, which finally outputs the generated encoded data. 135 is an inverse quantizer for performing inverse quantization. 136 is an inverse DCT converter that performs an inverse DCT transform. 137 is an adder, 138 and 139 are target memories for storing reproduced image data. 1
40 is an input from the block former 127 and the target memory 1
This is a motion compensator that performs motion compensation based on the contents of 38 and 139.

【００３７】上記の構成において、符号化開始に際して
各メモリのクリアを行い、各構成要素の初期化を行う。
フレームモード設定器１２８は、最初のフレームの符号
化時はＩ−フレームを指示する。この時、動き補償器１
４０は動作せず、動き補償予測値として０を出力する。
領域１２００の画像データを端子１２２から、マスク情
報１２０１を端子１２１から同期して読み込み、それぞ
れ対象メモリ１２５とマスクメモリ１２３に格納する。In the above configuration, each memory is cleared at the start of encoding, and each component is initialized.
The frame mode setting unit 128 indicates an I-frame when encoding the first frame. At this time, the motion compensator 1
40 does not operate and outputs 0 as the motion compensation prediction value.
The image data of the area 1200 is read from the terminal 122 in synchronization with the mask information 1201 from the terminal 121, and stored in the target memory 125 and the mask memory 123, respectively.

【００３８】１フレーム分が格納されたら、マスク符号
器１２４はマスク情報１２０１を符号化し、合成器１３
３に出力する。平均値算出器１２７はマスク情報１２０
１に従って入力された画素が背景か対象画像かを判断し
て、対象画像である人１０５２の平均値ｍを算出する。
ブロック形成器１２７は領域１２００の画像データとマ
スク情報１２０１を同期してブロック単位に読み込み、
入力された画素のマスク情報１２０１が背景画素を表し
ていれば平均値ｍに置換し、そうでなければ入力された
画素値をそのまま出力して８×８のブロックを形成す
る。即ち、画像全体では、図４に示すように、背景部分
は平均値ｍに置き換えられる。動き補償予測値は０なの
で、差分器１２９は入力をそのまま出力する。この出力
は、ＤＣＴ変換器１３０でＤＣＴ変換され、その係数は
量子化器１３１で量子化される。量子化結果は符号器１
３２で符号を割り当てられ、合成器１３３に出力され
る。合成器１３３はマスク符号器１２４と符号器１３２
で生成された符号化データに必要なヘッダの付加やデー
タの整列を行い、端子１３４から出力する。また、量子
化結果は逆量子化器１３５で逆量子化され、逆ＤＣＴ変
換器１３６で再生画素値を得て、加算器１３７を介して
対象メモリ１３８または１３９のいずれかに格納する。When one frame is stored, the mask encoder 124 encodes the mask information 1201 and
Output to 3. The average value calculator 127 outputs the mask information 120
It is determined according to 1 whether the input pixel is the background or the target image, and the average value m of the person 1052 as the target image is calculated.
The block forming unit 127 reads the image data of the area 1200 and the mask information 1201 in synchronization with each other on a block basis.
If the mask information 1201 of the input pixel indicates a background pixel, the pixel value is replaced with the average value m. Otherwise, the input pixel value is output as it is to form an 8 × 8 block. That is, in the entire image, the background portion is replaced with the average value m as shown in FIG. Since the motion compensation prediction value is 0, the differentiator 129 outputs the input as it is. This output is DCT-transformed by the DCT transformer 130, and its coefficient is quantized by the quantizer 131. The quantization result is the encoder 1
A code is assigned at 32 and output to the combiner 133. The combiner 133 includes a mask encoder 124 and an encoder 132
A header necessary for the encoded data generated in step (1) is added and the data is aligned, and output from the terminal 134. Further, the quantization result is inversely quantized by the inverse quantizer 135, a reproduction pixel value is obtained by the inverse DCT transformer 136, and stored in the target memory 138 or 139 via the adder 137.

【００３９】また、フレームモード設定器１２８がＰ−
フレームまたはＢ−フレームを指示していたら、動き補
償器１４０を動作させ、対象メモリ１３８、１３９から
動き補償に必要な画像データを適宜読み出し、動き補償
を行うか否かを判定する。動き補償を行う場合は、その
動き補償予測値を差分器１２９、加算器１３７に出力
し、動き補償に用いる動きベクトルを符号器１３２に入
力する。動き補償を行わない場合は動き補償予測値を０
とする。Further, the frame mode setting unit 128 sets the P-
If a frame or B-frame has been designated, the motion compensator 140 is operated, image data necessary for motion compensation is read from the target memories 138 and 139, and it is determined whether or not to perform motion compensation. When performing motion compensation, the motion compensation prediction value is output to a differentiator 129 and an adder 137, and a motion vector used for motion compensation is input to an encoder 132. When motion compensation is not performed, the motion compensation prediction value is set to 0.
And

【００４０】このようにして、対象符号化部１０５で符
号化して得られた符号化データは送信器１０６を介して
通信回線１０８に出力される。The encoded data obtained by the encoding by the target encoding unit 105 is output to the communication line 108 via the transmitter 106.

【００４１】一方、ＴＶカメラ１０２で撮像された画像
は符号化器１０４で、図１９に示した動画像符号化装置
１００２と同様の構成でＭＰＥＧ−１の符号化を行い、
送信器１０７を介して通信回線１０９に出力される。On the other hand, the image picked up by the TV camera 102 is coded by an encoder 104 into MPEG-1 with the same configuration as that of the moving picture coding apparatus 1002 shown in FIG.
Output to the communication line 109 via the transmitter 107.

【００４２】受信器１１０、１１１は通信回線１０８、
１０９を介して符号化データを受信し、動画像編集装置
１１２に送信する。また、記憶装置１１６はＭＰＥＧ−
４で符号化した符号化シーケンスを読み出し、動画像編
集装置１１２に符号化データを送信する。The receivers 110 and 111 are connected to the communication line 108,
The coded data is received via the control unit 109 and transmitted to the moving image editing apparatus 112. Further, the storage device 116 stores the MPEG-
The encoded sequence read in step 4 is read, and the encoded data is transmitted to the moving image editing apparatus 112.

【００４３】次に、実施形態１の動画像編集装置１１２
の詳細構成について、図６を用いて説明する。Next, the moving image editing apparatus 112 according to the first embodiment
The detailed configuration will be described with reference to FIG.

【００４４】図６は本発明の実施形態１の動画像編集装
置の詳細構成を示すブロック図である。FIG. 6 is a block diagram showing a detailed configuration of the moving picture editing apparatus according to the first embodiment of the present invention.

【００４５】２００、２０１、２０２は端子であり、端
子２００は受信器１１０から、端子２０１は受信器１１
１から、端子２０２は記憶装置１１６からの符号化デー
タが入力される。これらの符号化データは、対象復号器
２０３、２０４および復号器２０５のそれぞれの端子２
１９、２２０、２２１に入力される。端子２０６、２０
９、２２５からはＲＧＢ形式による画像データが出力さ
れる。端子２０８、２１１、２１２からは色かぶり補正
値を算出するために必要な色かぶり補正用画像情報２２
２、２２３、２２４が出力される。端子２０７、２１０
からはマスク情報がそれぞれ出力される。２１３は色か
ぶり補正用画像情報から補正値を算出する補正値算出器
である。Reference numerals 200, 201, and 202 denote terminals. The terminal 200 is connected to the receiver 110, and the terminal 201 is connected to the receiver 11
From 1, encoded data from the storage device 116 is input to the terminal 202. These encoded data are supplied to respective terminals 2 of the target decoders 203 and 204 and the decoder 205.
19, 220 and 221. Terminals 206, 20
9, 225 output image data in RGB format. From the terminals 208, 211, and 212, color fog correction image information 22 necessary for calculating a color fog correction value.
2, 223 and 224 are output. Terminals 207, 210
Output mask information. A correction value calculator 213 calculates a correction value from the color cast correction image information.

【００４６】２１４、２１５、２１６は補正値から画像
データの色かぶりを補正する補正器である。２１７は画
像データとマスク情報とから画像データの合成を行う画
像合成器である。２１８は端子であり、合成されたＲＧ
Ｂ形式の画像データを符号化器１１３に出力する。Reference numerals 214, 215 and 216 denote correctors for correcting the color cast of the image data from the correction values. An image synthesizer 217 synthesizes image data from the image data and the mask information. 218 is a terminal, which is a synthesized RG
The image data in the B format is output to the encoder 113.

【００４７】次に、実施形態１の対象復号器２０３、２
０４の詳細構成について、図７を用いて説明する。尚、
図７では、対象復号器２０３の詳細構成として説明し、
同様の構成を有する対象復号器２０４の詳細構成につい
ては、ここでは省略する。Next, the target decoders 203, 2
The detailed configuration of No. 04 will be described with reference to FIG. still,
In FIG. 7, a detailed configuration of the target decoder 203 will be described.
The detailed configuration of the target decoder 204 having the same configuration is omitted here.

【００４８】図７は本発明の実施形態１の対象復号器の
詳細構成を示すブロック図である。２１９は端子であ
り、受信器１１０からの符号化データが入力される。２
４１は分離器であり、入力された符号化データからマス
ク情報の符号化データと対象画像の領域の符号化データ
を分離する。２４２はマスク情報を復号するマスク復号
器である。２４３はマスク情報を格納するマスクメモリ
である。マスクメモリ２４３内のマスク情報は、端子２
０７より出力される。２４４は対象画像の領域の符号化
データを格納する符号メモリである。２４５は対象画像
の領域の符号化データを復号する復号器である。２４
６、２４８はデマルチプレクサである。２４８、２５
５、２６２は逆量子化器である。２４９、２５６、２６
３は逆ＤＣＴ変換器である。FIG. 7 is a block diagram showing a detailed configuration of the target decoder according to the first embodiment of the present invention. Reference numeral 219 denotes a terminal to which encoded data from the receiver 110 is input. 2
Reference numeral 41 denotes a separator, which separates the encoded data of the mask information and the encoded data of the area of the target image from the input encoded data. Reference numeral 242 denotes a mask decoder that decodes mask information. A mask memory 243 stores mask information. The mask information in the mask memory 243 is
07. Reference numeral 244 denotes a code memory for storing the encoded data of the area of the target image. A decoder 245 decodes the encoded data of the area of the target image. 24
6, 248 are demultiplexers. 248, 25
5, 262 are inverse quantizers. 249, 256, 26
3 is an inverse DCT converter.

【００４９】２５０、２５７、２６４は加算器である。
２５１、２５２、２５３は再生した対象画像の領域の輝
度Ｙデータを格納する対象メモリである。２５８、２５
９、２６０は再生した対象画像の領域の色差Ｃｂデータ
を格納する対象メモリである。２６５、２６６、２６７
は再生した対象画像の領域の色差Ｃｒデータを格納する
対象メモリである。２５４、２６１、２６８は動き補償
器である。２６９、２７３はＹＣｂＣｒ形式の画像デー
タからＲＧＢ形式の画像データへの色信号変換を行う色
信号変換器である。２７０、２７１、２７２はバッファ
である。２０６は端子であり、ＲＧＢ形式の画像データ
が出力される。２０７は端子であり、マスク情報が出力
される。２０８は端子であり、色かぶり補正用画像情報
が出力される。Reference numerals 250, 257, and 264 denote adders.
Reference numerals 251, 252 and 253 are target memories for storing luminance Y data of the reproduced target image area. 258, 25
Reference numerals 9 and 260 are target memories for storing the color difference Cb data of the reproduced target image area. 265, 266, 267
Is a target memory for storing the color difference Cr data of the area of the reproduced target image. 254, 261, and 268 are motion compensators. Reference numerals 269 and 273 denote color signal converters for performing color signal conversion from YCbCr format image data to RGB format image data. 270, 271, and 272 are buffers. Reference numeral 206 denotes a terminal from which image data in RGB format is output. A terminal 207 outputs mask information. Reference numeral 208 denotes a terminal from which color fog correction image information is output.

【００５０】上記構成において、入力された符号化デー
タからマスク情報の符号化データと対象画像の領域の符
号化データを分離器２１４で分離し、それぞれマスク復
号器２４２と符号メモリ２４４に入力する。マスク復号
器２４２はマスク情報の符号化データを復号してマスク
情報を再生し、マスクメモリ２４３に格納する。符号メ
モリ２４４に格納された符号化データは復号器２４５で
復号され、量子化された値が再生された後、デマルチプ
レクサ２４７で輝度Ｙデータ、色差Ｃｂデータ、色差Ｃ
ｒデータにデマルチプレクスされる。輝度Ｙデータは逆
量子化器２４８に、色差Ｃｂデータは逆量子化器２５５
に、色差Ｃｒデータは逆量子化器２６２に入力される。In the above configuration, the coded data of the mask information and the coded data of the target image area are separated by the separator 214 from the input coded data, and are input to the mask decoder 242 and the code memory 244, respectively. The mask decoder 242 decodes the encoded data of the mask information to reproduce the mask information, and stores the reproduced mask information in the mask memory 243. The encoded data stored in the code memory 244 is decoded by the decoder 245, and the quantized value is reproduced. After that, the demultiplexer 247 decodes the luminance Y data, the chrominance Cb data, and the chrominance Cb.
Demultiplexed into r data. The luminance Y data is supplied to the inverse quantizer 248, and the chrominance Cb data is supplied to the inverse quantizer 255.
In addition, the chrominance Cr data is input to the inverse quantizer 262.

【００５１】輝度Ｙデータは逆量子化器２４８で逆量子
化され、逆ＤＣＴ変換器２４９で逆ＤＣＴ変換される。
Ｉ−フレームの時は動き補償器２５４は動作せず、０を
出力する。Ｐ−フレームとＢ−フレームの時は動き補償
器２５４は動作し、動き補償予測値を出力する。加算器
２５０は逆ＤＣＴ変換器２４９の出力と動き補償器２５
４の出力を加算し、対象メモリ２５１および対象メモリ
２５２または２５３に格納する。一方、Ｉ−フレムの時
のみに限り、逆量子化器２４８からの出力のうち輝度Ｙ
データの平均値を表す直流成分情報のみが、バッファ２
７２に格納される。The luminance Y data is inversely quantized by an inverse quantizer 248 and inverse DCT transformed by an inverse DCT transformer 249.
In the case of an I-frame, the motion compensator 254 does not operate and outputs 0. At the time of a P-frame and a B-frame, the motion compensator 254 operates and outputs a motion compensation prediction value. The adder 250 outputs the output of the inverse DCT transformer 249 and the motion compensator 25.
4 are added and stored in the target memory 251 and the target memory 252 or 253. On the other hand, only during the I-frame, the luminance Y out of the output from the inverse quantizer 248
Only the DC component information representing the average value of the data is stored in the buffer 2
72.

【００５２】色差Ｃｂデータは逆量子化器２５５で逆量
子化され、逆ＤＣＴ変換器２５６で逆ＤＣＴ変換され
る。Ｉ−フレームの時は動き補償器２６１は動作せず、
０を出力する。Ｐ−フレームとＢ−フレームの時は動き
補償器２６１は動作し、動き補償予測値を出力する。加
算器２５７は逆ＤＣＴ変換器２５６の出力と動き補償器
２６１の出力を加算し、対象メモリ２５８および対象メ
モリ２５９または２６０に格納する。一方、Ｉ−フレー
ムの時のみに限り、逆量子化器２５５からの出力のうち
色差Ｃｂデータの平均値を表す直流成分情報のみが、バ
ッファ２７１に格納される。The chrominance Cb data is inversely quantized by the inverse quantizer 255 and inverse DCT-transformed by the inverse DCT converter 256. In the case of an I-frame, the motion compensator 261 does not operate,
Outputs 0. At the time of a P-frame and a B-frame, the motion compensator 261 operates and outputs a motion compensation prediction value. The adder 257 adds the output of the inverse DCT transformer 256 and the output of the motion compensator 261 and stores the result in the target memory 258 and the target memory 259 or 260. On the other hand, only in the case of an I-frame, only the DC component information representing the average value of the color difference Cb data among the outputs from the inverse quantizer 255 is stored in the buffer 271.

【００５３】色差Ｃｒデータは逆量子化器２６２で逆量
子化され、逆ＤＣＴ変換器２６３で逆ＤＣＴ変換され
る。Ｉ−フレームの時は動き補償器２６８は動作せず、
０を出力する。Ｐ−フレームとＢ−フレームの時は動き
補償器２６８は動作し、動き補償予測値を出力する。加
算器２６４は逆ＤＣＴ変換器２６３の出力と動き補償器
２６８の出力を加算し、対象メモリ２６５および対象メ
モリ２６６または２６７に格納する。一方、Ｉ−フレー
ムの時のみに限り、逆量子化器２６２からの出力のうち
色差Ｃｒデータの平均値を表す直流成分情報のみが、バ
ッファ２７０に格納される。The chrominance Cr data is inversely quantized by the inverse quantizer 262 and inverse DCT transformed by the inverse DCT transformer 263. In the case of an I-frame, the motion compensator 268 does not operate,
Outputs 0. At the time of a P-frame and a B-frame, the motion compensator 268 operates and outputs a motion compensation prediction value. The adder 264 adds the output of the inverse DCT transformer 263 and the output of the motion compensator 268, and stores the sum in the target memory 265 and the target memory 266 or 267. On the other hand, only in the case of the I-frame, only the DC component information representing the average value of the color difference Cr data among the outputs from the inverse quantizer 262 is stored in the buffer 270.

【００５４】マクロブロックの処理が終了した時、バッ
ファ２７０、２７１、２７２より輝度Ｙ直流成分情報、
色差Ｃｂ直流成分情報、色差Ｃｒ直流成分情報が読み出
され、色信号変換器２７３でＲＧＢ形式に変換された上
で、色かぶり補正用画像情報として端子２０８から出力
される。When the processing of the macro block is completed, the luminance Y DC component information from the buffers 270, 271, 272,
The color difference Cb DC component information and the color difference Cr DC component information are read out, converted into RGB format by the color signal converter 273, and output from the terminal 208 as color cast correction image information.

【００５５】また、対象メモリ２５１、２５８、２６５
よりＹＣｂＣｒ形式の画像データが読み出される際に
は、色信号変換器２６９でＲＧＢ形式の画像データに変
換された上で端子２０６より出力される。The target memories 251, 258, 265
When the image data in the YCbCr format is read, the image data is converted into image data in the RGB format by the color signal converter 269 and then output from the terminal 206.

【００５６】次に、実施形態１の復号器２０５の詳細構
成について、図８を用いて説明する。Next, the detailed configuration of the decoder 205 according to the first embodiment will be described with reference to FIG.

【００５７】図８は本発明の実施形態１の復号器の詳細
構成を示すブロック図である。FIG. 8 is a block diagram showing a detailed configuration of the decoder according to the first embodiment of the present invention.

【００５８】２２１は端子であり、記憶装置１１６から
の符号化データが入力される。３０１は符号化データを
格納する符号メモリである。３０２は符号化データを復
号する復号器である。３０３、３０４はデマルチプレク
サである。３０５、３１２、３１９は逆量子化器であ
る。３０６、３１３、３２０は逆ＤＣＴ変換器である。
３０７、３１４、３２１は加算器である。３０８、３０
９、３１０は符号化データを復号して得られた画像の輝
度Ｙデータを格納するメモリである。３１５、３１６、
３１７は符号化データを復号して得られた画像の色差Ｃ
ｂデータを格納するメモリである。３２２、３２３、３
２４は符号化データを復号して得られた画像の色差Ｃｒ
データを格納するメモリである。３１１、３１８、３２
５は動き補償器である。３２６、３３０はＹＣｂＣｒ形
式の画像データからＲＧＢ形式の画像データへ色信号変
換を行う色信号変換器である。３２７、３２８、３２９
はバッファである。２２５は端子であり、ＲＧＢ形式の
画像データが出力される。２１２は端子であり、色かぶ
り補正用画像情報が出力される。Reference numeral 221 denotes a terminal to which encoded data from the storage device 116 is input. Reference numeral 301 denotes a code memory for storing encoded data. Reference numeral 302 denotes a decoder that decodes encoded data. 303 and 304 are demultiplexers. 305, 312 and 319 are inverse quantizers. 306, 313, and 320 are inverse DCT converters.
307, 314 and 321 are adders. 308, 30
Reference numerals 9 and 310 are memories for storing luminance Y data of an image obtained by decoding encoded data. 315, 316,
317 is a color difference C of an image obtained by decoding the encoded data.
b is a memory for storing data. 322, 323, 3
24 is the color difference Cr of the image obtained by decoding the encoded data.
This is a memory for storing data. 311, 318, 32
5 is a motion compensator. 326 and 330 are color signal converters for performing color signal conversion from YCbCr format image data to RGB format image data. 327, 328, 329
Is a buffer. Reference numeral 225 denotes a terminal from which image data in RGB format is output. Reference numeral 212 denotes a terminal from which color cast image information is output.

【００５９】上記構成において、符号メモリ３０１に格
納された符号化データは、復号器２６２で復号された
後、デマルチプレクサ３０３で、輝度Ｙデータ、色差Ｃ
ｂデータ、色差Ｃｒデータにデマルチプレクスされる。
輝度Ｙデータは逆量子化器３０５に、色差Ｃｂデータは
逆量子化器３１２に、色差Ｃｒデータは逆量子化器３１
９に入力される。In the above configuration, the coded data stored in the code memory 301 is decoded by the decoder 262 and then decoded by the demultiplexer 303 to obtain the luminance Y data and the color difference C.
The data is demultiplexed into b data and color difference Cr data.
The luminance Y data is supplied to the inverse quantizer 305, the color difference Cb data is supplied to the inverse quantizer 312, and the color difference Cr data is supplied to the inverse quantizer 31.
9 is input.

【００６０】輝度Ｙデータは逆量子化器３０５で逆量子
化され、逆ＤＣＴ変換器３０６で逆ＤＣＴ変換される。
Ｉ−フレームの時は動き補償器３１１は動作せず、０を
出力する。Ｐ−フレームとＢ−フレームの時は動き補償
器３１１は動作し、動き補償予測値を出力する。加算器
３０７は逆ＤＣＴ変換器３０６の出力と動き補償器３１
１の出力を加算し、メモリ３０８およびメモリ３０９ま
たは３１０に格納する。一方、Ｉ−フレームの時のみに
限り、逆量子化器３０５からの出力のうち輝度Ｙデータ
の平均値を表す直流成分情報のみが、バッファ３２９に
格納される。The luminance Y data is inversely quantized by the inverse quantizer 305 and inverse DCT transformed by the inverse DCT transformer 306.
In the case of an I-frame, the motion compensator 311 does not operate and outputs 0. At the time of a P-frame and a B-frame, the motion compensator 311 operates and outputs a motion compensation prediction value. The adder 307 outputs the output of the inverse DCT converter 306 and the motion compensator 31
One output is added and stored in the memory 308 and the memory 309 or 310. On the other hand, only in the case of the I-frame, only the DC component information representing the average value of the luminance Y data among the outputs from the inverse quantizer 305 is stored in the buffer 329.

【００６１】色差Ｃｂデータは逆量子化器３１２で逆量
子化され、逆ＤＣＴ変換器３１３で逆ＤＣＴ変換され
る。Ｉ−フレームの時は動き補償器３１８は動作せず、
０を出力する。Ｐ−フレームとＢ−フレームの時は動き
補償器３１８は動作し、動き補償予測値を出力する。加
算器３１４は逆ＤＣＴ変換器３１３の出力と動き補償器
３１８の出力を加算し、メモリ３１５およびメモリ３１
６または３１７に格納する。一方、Ｉ−フレームの時の
みに限り、逆量子化器３１２からの出力のうち色差Ｃｂ
データの平均値を表す直流成分情報のみが、バッファ３
２８に格納される。The chrominance Cb data is inversely quantized by the inverse quantizer 312 and inverse DCT transformed by the inverse DCT transformer 313. In the case of an I-frame, the motion compensator 318 does not operate,
Outputs 0. At the time of a P-frame and a B-frame, the motion compensator 318 operates and outputs a motion compensation prediction value. The adder 314 adds the output of the inverse DCT transformer 313 and the output of the motion compensator 318, and
6 or 317. On the other hand, the color difference Cb of the output from the inverse quantizer 312 is limited only to the I-frame.
Only the DC component information representing the average value of the data is stored in the buffer 3
28.

【００６２】色差Ｃｒデータは逆量子化器３１９で逆量
子化され、逆ＤＣＴ変換器３２０で逆ＤＣＴ変換され
る。Ｉ−フレームの時は動き補償器３２５は動作せず、
０を出力する。Ｐ−フレームとＢ−フレームの時は動き
補償器３２５は動作し、動き補償予測値を出力する。加
算器３２１は逆ＤＣＴ変換器３２０の出力と動き補償器
３２５の出力を加算し、メモリ３２２およびメモリ３２
３または３２４に格納する。一方、Ｉ−フレームの時の
みに限り、逆量子化器３１９からの出力のうち色差Ｃｒ
データの平均値を表す直流成分情報のみが、バッファ３
２７に格納される。The chrominance Cr data is inversely quantized by the inverse quantizer 319 and inverse DCT transformed by the inverse DCT transformer 320. In the case of an I-frame, the motion compensator 325 does not operate,
Outputs 0. At the time of a P-frame and a B-frame, the motion compensator 325 operates and outputs a motion compensation prediction value. The adder 321 adds the output of the inverse DCT converter 320 and the output of the motion compensator 325, and
3 or 324. On the other hand, only during the I-frame, the chrominance Cr
Only the DC component information representing the average value of the data is stored in the buffer 3
27.

【００６３】マクロブロックの処理が終了した時、バッ
ファ３２７、３２８、３２９より輝度Ｙ直流成分情報、
色差Ｃｂ直流成分情報、色差Ｃｒ直流成分情報が読み出
され、色信号変換器３３０でＲＧＢ形式に変換された上
で、色かぶり補正用画像情報として端子２１２から出力
される。When the processing of the macro block is completed, the luminance Y DC component information from the buffers 327, 328, and 329,
The chrominance Cb DC component information and the chrominance Cr DC component information are read out, converted into the RGB format by the color signal converter 330, and output from the terminal 212 as color cast image information.

【００６４】また、メモリ３０８、３１５、３２２より
ＹＣｂＣｒ形式の画像データが読み出される際には、色
信号変換器３２６でＲＧＢ形式の画像データに変換され
た上で端子２２４より出力される以上説明した動画像編
集装置１１２の構成において、１フレーム分の復号が終
了し、対象復号器２０３内の対象メモリ２５１、２５
８、２６５と、対象復号器２０４内の対象メモリ２５
１、２５８、２６５と、符号器２０５内のメモリ３０
８、３１５、３２２に画像データが格納されたら、補正
値算出器２１３は、色かぶり補正用画像情報を用いて後
述の補正式算出アルゴリズムから、以下の補正式を求め
る。つまり、補正器２１４用Ｒ画素値補正式ｆ1R(ｘ)、
補正器２１４用Ｇ画素値補正式ｆ1G(ｘ)、補正器２１４
用Ｂ画素値補正式ｆ1B(ｘ)と、補正器２１５用Ｒ画素値
補正式ｆ2R(ｘ)、補正器２１５用Ｇ画素値補正式ｆ2G
(ｘ)、補正器２１５用Ｂ画素値補正式ｆ2B(ｘ)と、補正
器２１６用Ｒ画素値補正式ｆ3R(ｘ)、補正器２１６用Ｇ
画素値補正式ｆ3G(ｘ)、補正器２１６用Ｂ画素値補正式
ｆ3B(ｘ)とを求める。When image data in the YCbCr format is read from the memories 308, 315, and 322, the image data is converted into image data in the RGB format by the color signal converter 326 and output from the terminal 224. In the configuration of the video editing device 112, decoding of one frame is completed, and the target memories 251 and 25 in the target decoder 203 are finished.
8, 265 and the target memory 25 in the target decoder 204
1, 258, 265 and the memory 30 in the encoder 205.
When the image data is stored in 8, 315, and 322, the correction value calculator 213 obtains the following correction formula from the correction formula calculation algorithm described later using the color cast correction image information. That is, the R pixel value correction formula f1R (x) for the corrector 214,
G pixel value correction formula f1G (x) for corrector 214, corrector 214
B pixel value correction formula f1B (x), R pixel value correction formula f2R (x) for corrector 215, G pixel value correction formula f2G for corrector 215
(x), B pixel value correction formula f2B (x) for corrector 215, R pixel value correction formula f3R (x) for corrector 216, G for corrector 216
A pixel value correction formula f3G (x) and a B pixel value correction formula f3B (x) for the corrector 216 are obtained.

【００６５】その後、復号器２０５から走査線の画素順
にラスタスキャンにてＲＧＢ画素値を読み出し、補正器
２１６で補正を行い、画像合成器２１７に入力する。補
正器２１６では入力されたＲ画素値ｒ、Ｇ画素値ｇ、Ｂ
画素値ｂに対して補正式ｆ3R(ｘ)、ｆ3G(ｘ)、ｆ3B(ｘ)
による補正を次式にしたがって行い、補正されたＲ画素
値Ｒ、Ｇ画素値Ｇ、Ｂ画素値Ｂを求め、出力する。Thereafter, RGB pixel values are read from the decoder 205 by raster scan in the order of the pixels of the scanning line, corrected by the corrector 216, and input to the image synthesizer 217. In the corrector 216, the input R pixel value r, G pixel value g, B
Correction formulas f3R (x), f3G (x), f3B (x) for pixel value b
Is performed according to the following equation to obtain and output corrected R pixel values R, G pixel values G, and B pixel values B.

【００６６】Ｒ＝ｆ3R(ｒ)，Ｇ＝ｆ3G(ｇ)，Ｂ＝ｆ3b(ｂ) …（１）一方、スキャン位置が対象復号器２０３の対象画像デー
タを合成する位置に到達したら、対象復号器２０３から
マスク情報とＲＧＢ画素値を読み出し、補正器２１４で
補正を行い、画像合成器２１７に入力する。補正器２１
４では、入力されたＲ画素値ｒ、Ｇ画素値ｇ、Ｂ画素値
ｂに対して補正式ｆ1R(ｘ)、ｆ1G(ｘ)、ｆ1B(ｘ)による
補正を次式にしたがって行い、補正されたＲ画素値Ｒ、
Ｇ画素値Ｇ、Ｂ画素値Ｂを求め、出力する。R = f3R (r), G = f3G (g), B = f3b (b) (1) On the other hand, when the scan position reaches the position where the target image data of the target decoder 203 is synthesized, the target decoding is performed. The mask information and the RGB pixel values are read from the device 203, corrected by the corrector 214, and input to the image synthesizer 217. Corrector 21
In step 4, the input R pixel value r, G pixel value g, and B pixel value b are corrected by the correction formulas f1R (x), f1G (x), and f1B (x) according to the following equations. R pixel value R,
A G pixel value G and a B pixel value B are obtained and output.

【００６７】Ｒ＝ｆ1R(ｒ)，Ｇ＝ｆ1G(ｇ)，Ｂ＝ｆ1b(ｂ) …（２）また、スキャン位置が対象復号器２０４の対象画像デー
タを合成する位置に到達したら、対象復号器２０４から
マスク情報とＲＧＢ画素値を読み出し、補正器２１５で
補正を行い、画像合成器２１７に入力する。補正器２１
５では、入力されたＲ画素値ｒ、Ｇ画素値ｇ、Ｂ画素値
ｂに対して補正式ｆ2R(ｘ)、ｆ2G(ｘ)、ｆ2B(ｘ)による
補正を次式にしたがって行い、補正されたＲ画素値Ｒ、
Ｇ画素値Ｇ、Ｂ画素値Ｂを求め、出力する。R = f1R (r), G = f1G (g), B = f1b (b) (2) When the scan position reaches the position where the target image data of the target decoder 204 is synthesized, the target decoding is performed. The mask information and the RGB pixel values are read from the device 204, corrected by the corrector 215, and input to the image synthesizer 217. Corrector 21
In 5, the input R pixel value r, G pixel value g, and B pixel value b are corrected by the correction formulas f2R (x), f2G (x), and f2B (x) according to the following formulas. R pixel value R,
A G pixel value G and a B pixel value B are obtained and output.

【００６８】Ｒ＝ｆ2R(ｒ)，Ｇ＝ｆ2G(ｇ)，Ｂ＝ｆ2b(ｂ) …（３）画像合成器２１７は、マスク情報が対象復号器２０３の
対象画像データを示している場合は補正器２１４からの
画素値を、マスク情報が対象復号器２０４の対象画像デ
ータを示している場合は補正器２１５からの画素値を、
これらのいずれにもあたらない場合は補正器２１６から
の画素値を出力することで、画像の合成を行い、端子２
１８から符号化器１１３に出力する。図９に、背景１０
５０と人１０５１とを補正して得られた画像である背景
１１６０と人１０６１と、人１０５２を補正して得られ
た画像である人１０６２と、人１０５３を補正して得ら
れた画像である人１０６３とを合成した様子を示す。符
号化器１１３は出力された画像をＭＰＥＧ−１で符号化
し、送信器１１４を介して通信網１１５に送出される。R = f2R (r), G = f2G (g), B = f2b (b) (3) The image synthesizer 217 determines whether the mask information indicates the target image data of the target decoder 203. The pixel value from the corrector 214, and the pixel value from the corrector 215 when the mask information indicates the target image data of the target decoder 204,
If none of the above cases is satisfied, the image is synthesized by outputting the pixel value from the corrector 216 and the terminal 2
18 to the encoder 113. FIG. 9 shows the background 10
A background 1160 and a person 1061, which are images obtained by correcting 50 and the person 1051, a person 1062 which is an image obtained by correcting the person 1052, and an image obtained by correcting the person 1053. This shows a state in which a person 1063 is combined. The encoder 113 encodes the output image using MPEG-1, and sends the image to the communication network 115 via the transmitter 114.

【００６９】上記動作において、補正値算出器２１３の
補正式算出アルゴリズムは下記に従って動作する。In the above operation, the correction formula calculating algorithm of the correction value calculator 213 operates according to the following.

【００７０】補正器２１６用補正式ｆ3R(ｒ)，ｆ3G
(ｇ)，ｆ3b(ｂ)は以下のように算出する。Correction formulas f3R (r), f3G for corrector 216
(g) and f3b (b) are calculated as follows.

【００７１】人間の目は青色に対しては比較的鈍く、補
正効果がさほど現れない。そこで、Ｂ画素値を補正する
ｆ3b(ｂ)については、ｆ3B(ｂ)＝ｂ …（４）とする。The human eye is relatively dull to blue, and the correction effect does not appear so much. Therefore, f3b (b) for correcting the B pixel value is given by f3B (b) = b (4).

【００７２】次に、復号器２０５からの色かぶり補正用
画像情報２２４におけるＲ情報の最大値ＲＭａｘ1、平
均値ＲＥ1、分散ＲＲ1を求める。Next, the maximum value RMax1, the average value RE1, and the variance RR1 of the R information in the color fog correction image information 224 from the decoder 205 are obtained.

【００７３】次に、復号器２０５からの色かぶり補正用
画像情報２２４におけるＧ情報の最大値ＧＭａｘ1、平
均値ＧＥ1、分散ＧＲ1を求める。Next, the maximum value GMax1, the average value GE1, and the variance GR1 of the G information in the color fog correction image information 224 from the decoder 205 are obtained.

【００７４】そして、Ｒ情報の値とＧ情報の値との分布
を示す２次元ヒストグラムを算出する。Then, a two-dimensional histogram showing the distribution of the value of the R information and the value of the G information is calculated.

【００７５】｜ＲＥ1−ＧＥ1｜がある閾値以下かつ｜Ｒ
Ｒ1−ＧＲ1｜がある閾値以下である場合ＲＭａｘ1≧ＧＭａｘ1であって、２次元ヒストグラム内
の対角線を（ＲＭａｘ1，ＲＭａｘ1）−（ＧＭａｘ1−
Ｔ，ＧＭａｘ1−Ｔ）とする正方形領域においてＲ軸側
への有意性のある偏りがある場合、ｆ3B(ｒ)＝ｒ，ｆ3G(ｇ)＝ｇ×ＲＭａｘ1／ＧＭａｘ1 …（５）とする。| RE1-GE1 | is less than a threshold and | R
When R1−GR1 | is equal to or smaller than a certain threshold value, RMax1 ≧ GMax1, and the diagonal line in the two-dimensional histogram is represented by (RMax1, RMax1) − (GMax1−
If there is a significant bias toward the R axis in a square area defined as (T, GMax1-T), f3B (r) = r, f3G (g) = g × RMax1 / GMax1 (5)

【００７６】ＧＭａｘ1≧ＲＭａｘ1であって、２次元ヒ
ストグラム内の対角線を（ＧＭａｘ1，ＧＭａｘ1）−
（ＲＭａｘ1−Ｔ，ＲＭａｘ1−Ｔ）とする正方形領域に
おいてＧ軸側への有意性のある偏りがある場合、ｆ3G(ｇ)＝ｇ，ｆ3R(ｒ)＝ｒ×ＧＭａｘ1／ＲＭａｘ1 …（６）とする。GMax1 ≧ RMax1, and the diagonal line in the two-dimensional histogram is represented by (GMax1, GMax1) −
If there is a significant bias toward the G axis in the square area defined as (RMax1−T, RMax1−T), f3G (g) = g, f3R (r) = r × GMax1 / RMax1 (6) I do.

【００７７】上記いずれの場合にもあてはまらなかった
場合、ｆ3R(ｒ)＝ｒ，ｆ3G(ｇ)＝ｇ …（７）とする。但し、Ｔはある正数である。If none of the above cases applies, f3R (r) = r, f3G (g) = g (7) Here, T is a certain positive number.

【００７８】さもなければＲ画素値を補正するｆ3r
(ｒ)、Ｇ画素値を補正するｆ3g(ｇ)についてｆ3R(ｒ)＝ｒ，ｆ3G(ｇ)＝ｇ …（８）とする。Otherwise, f3r for correcting the R pixel value
(r), f3g (g) for correcting the G pixel value: f3R (r) = r, f3G (g) = g (8)

【００７９】以上にて、補正式ｆ3R(ｒ)，ｆ3G(ｇ)，ｆ
3b(ｂ)の算出を終了する。From the above, the correction equations f3R (r), f3G (g), f
The calculation of 3b (b) ends.

【００８０】同様にして、補正器２１４用補正式ｆ1R
(ｒ)，ｆ1G(ｇ)，ｆ1b(ｂ)、補正器２１５用補正式ｆ2R
(ｒ)，ｆ2G(ｇ)，ｆ2b(ｂ)を算出する。Similarly, the correction formula f1R for the corrector 214
(r), f1G (g), f1b (b), correction formula f2R for corrector 215
(r), f2G (g) and f2b (b) are calculated.

【００８１】以上説明したように、実施形態１によれ
ば、背景画像と対象画像を分離し、それぞれについて符
号化したものを合成する際にそれぞれの画像データの特
徴量を抽出し、合成する対象画像の画素値を補正するこ
とで違和感のない画像合成が行える。また、ブロック単
位での直流成分を補正値算出に用いることで高速な処理
が可能になる。As described above, according to the first embodiment, when the background image and the target image are separated, and when the coded images are combined, the feature amount of each image data is extracted and the object to be combined is extracted. By correcting the pixel values of the image, it is possible to perform image synthesis without a sense of discomfort. Further, high-speed processing can be performed by using a DC component in block units for calculating a correction value.

【００８２】尚、実施形態１においては、対象画像の符
号化にＭＰＥＧ−４を、それ以外の符号化にＭＰＥＧ−
１を用いて説明をおこなったが、これに限定されず、こ
れらと同様の機能を果たすものであればなんでもかまわ
ない。In the first embodiment, MPEG-4 is used for encoding a target image, and MPEG-
1 has been described, but the present invention is not limited to this, and any device that performs the same function as those described above may be used.

【００８３】また、メモリ構成はこれに限定されず、ラ
インメモリ等で処理を行ってももちろんかまわないし、
その他の構成であってもよい。The memory configuration is not limited to this, and it goes without saying that processing may be performed by a line memory or the like.
Other configurations may be used.

【００８４】また、各構成要素の一部または全部をＣＰ
Ｕ等で動作するソフトウェアによって実現させてももち
ろんかまわない。＜実施形態２＞実施形態２は、実施形態１における対象
復号器２０３、２０４、復号器２０５ならびに補正値算
出器２１３を変更したものである。そこで、実施形態１
と重複する部分についてはその説明を割愛し、変更部に
ついてのみ説明する。Further, a part or all of each component may be replaced with a CP.
Of course, it may be realized by software operating on U or the like. Embodiment 2 Embodiment 2 is a modification of Embodiment 1 in which the target decoders 203 and 204, the decoder 205, and the correction value calculator 213 are changed. Therefore, the first embodiment
The description of the parts overlapping with the above is omitted, and only the changed part will be described.

【００８５】動画像送信システムは実施形態１と同様に
図１の構成を用いる。また、動画像編集装置１１２は、
実施形態１と同様に図５の構成を用いる。The moving image transmission system uses the configuration shown in FIG. 1 as in the first embodiment. In addition, the moving image editing device 112
The configuration of FIG. 5 is used as in the first embodiment.

【００８６】次に、実施形態２の対象復号器２０３、２
０４の詳細構成について、図１０を用いて説明する。
尚、図１０では、対象復号器２０３の詳細構成として説
明し、同様の構成を有する対象復号器２０４の詳細構成
については、ここでは省略する。Next, the target decoders 203 and 2 in the second embodiment
The detailed configuration of No. 04 will be described with reference to FIG.
In FIG. 10, the detailed configuration of the target decoder 203 will be described, and the detailed configuration of the target decoder 204 having the same configuration will be omitted.

【００８７】図１０は本発明の実施形態２の対象復号器
の詳細構成を示すブロック図である。FIG. 10 is a block diagram showing a detailed configuration of a target decoder according to the second embodiment of the present invention.

【００８８】２１９は端子であり、受信器１１０からの
符号化データが入力される。４０１は分離器であり、符
号化データからマスク情報の符号化データと対象画像の
領域の符号化データを分離する。４０２はマスク情報を
復号するマスク復号器である。４０３はマスク情報を格
納するマスクメモリである。マスクメモリ４０３内のマ
スク情報は、端子２０７より出力される。４０４は対象
の領域の符号化データを格納する符号メモリである。４
０５は対象画像の領域の符号化データを復号する復号器
である。４０６、４０７はデマルチプレクサである。４
０８、４１５、４２２は逆量子化器である。４０９、４
１６、４２３は高速逆ＤＣＴ変換器である。Reference numeral 219 denotes a terminal to which encoded data from the receiver 110 is input. Reference numeral 401 denotes a separator, which separates encoded data of mask information and encoded data of a target image area from the encoded data. Reference numeral 402 denotes a mask decoder that decodes mask information. A mask memory 403 stores mask information. The mask information in the mask memory 403 is output from a terminal 207. A code memory 404 stores the encoded data of the target area. 4
Reference numeral 05 denotes a decoder for decoding the encoded data of the area of the target image. 406 and 407 are demultiplexers. 4
08, 415, and 422 are inverse quantizers. 409, 4
16 and 423 are high-speed inverse DCT converters.

【００８９】ここで、実施形態１の高速逆ＤＣＴ変換器
４０９、４１６、４２３の詳細構成について、図１１を
用いて説明する。Here, the detailed configuration of the high-speed inverse DCT converters 409, 416, and 423 of the first embodiment will be described with reference to FIG.

【００９０】図１１は本発明の実施形態２の高速逆ＤＣ
Ｔ変換器の詳細構成を示すブロック図である。FIG. 11 shows a high-speed inverse DC according to the second embodiment of the present invention.
It is a block diagram which shows the detailed structure of a T converter.

【００９１】図１１における各ラディックスバタフライ
演算器１１０１〜１１０４の出力は、通常のラディック
スバタフライ演算の経路の他に、各段の出力をマルチプ
レクサ１１０５にてマルチプレクスして出力する経路を
有する。但し、第１段ラディックスバタフライ演算器１
１０１の前からは直流成分のみがマルチプレクサ１１０
５へ入力される。また、第２段ラディックスバタフライ
演算器１１０２の後ろからは２×２低周波成分のラディ
ックスバタフライ演算結果がマルチプレクサ１１０５へ
入力される。また、第３段ラディックスバタフライ演算
器１１０３の後ろからは４×４低周波成分のラディック
スバタフライ演算結果がマルチプレクサ１１０５へ入力
される。また、第４段ラディックスバタフライ演算器１
１０４の後ろからは８×８の逆ＤＣＴ結果がマルチプレ
クサ１１０５へ入力される。The output of each Radix butterfly operation unit 1101 to 1104 in FIG. 11 has a path for multiplexing and outputting the output of each stage by the multiplexer 1105 in addition to the path of the normal Radix butterfly operation. However, the first stage Radix butterfly operation unit 1
Only the direct current component from the multiplexer
5 is input. From the rear of the second-stage Radix butterfly operation unit 1102, a Radix butterfly operation result of 2 × 2 low frequency components is input to the multiplexer 1105. Also, from the rear of the third-stage Radix butterfly operation unit 1103, a Radix butterfly operation result of 4 × 4 low frequency components is input to the multiplexer 1105. In addition, the fourth stage Radix butterfly operation unit 1
An 8 × 8 inverse DCT result is input to the multiplexer 1105 from the back of the 104.

【００９２】再び、図１０の説明に戻る。Returning to the description of FIG.

【００９３】４１０、４１７、４２４は加算器である。
４１１、４１２、４１３は再生した対象画像の領域の輝
度Ｙデータを格納する対象メモリである。４１８、４１
９、４２０は再生した対象の領域画像の色差Ｃｂデータ
を格納する対象メモリである。４２５、４２６、４２７
は再生した対象画像の領域の色差Ｃｒデータを格納する
対象メモリである。４１４、４２１、４２８は動き補償
器である。４２９、４３３はＹＣｂＣｒ形式の画像デー
タからＲＧＢ形式の画像データへの色信号変換を行う色
信号変換器である。４３０、４３１、４３２はバッファ
である。２０６は端子であり、ＲＧＢ形式の画像データ
が出力される。２０８は端子であり、色かぶり補正用画
像情報が出力される。Reference numerals 410, 417 and 424 denote adders.
411, 412, and 413 are target memories for storing luminance Y data of the reproduced target image area. 418, 41
Reference numerals 9 and 420 are target memories for storing the color difference Cb data of the reproduced target region image. 425, 426, 427
Is a target memory for storing the color difference Cr data of the area of the reproduced target image. 414, 421, 428 are motion compensators. Reference numerals 429 and 433 denote color signal converters for performing color signal conversion from YCbCr format image data to RGB format image data. 430, 431 and 432 are buffers. Reference numeral 206 denotes a terminal from which image data in RGB format is output. Reference numeral 208 denotes a terminal from which color fog correction image information is output.

【００９４】上記構成において、入力された符号データ
からマスク情報の符号化データと対象画像の領域の符号
化データを分離器４０１で分離し、それぞれマスク復号
器４０２と符号メモリ４０４に入力する。マスク復号器
４０２は符号化データを復号してマスク情報を再生し、
マスクメモリ４０３に格納する。符号メモリ４０４に格
納された符号化データは復号器４０５で復号され、量子
化された値が再生された後、デマルチプレクサ４０７で
輝度Ｙデータ、色差Ｃｂデータ、色差Ｃｒデータにデマ
ルチプレクスされる。輝度Ｙデータは逆量子化器４０８
に、色差Ｃｂデータは逆量子化器４１５に、色差Ｃｒデ
ータは逆量子化器４２２に入力される。In the above configuration, the coded data of the mask information and the coded data of the target image area are separated by the separator 401 from the input coded data, and are input to the mask decoder 402 and the code memory 404, respectively. The mask decoder 402 decodes the encoded data to reproduce mask information,
It is stored in the mask memory 403. The encoded data stored in the code memory 404 is decoded by the decoder 405 and the quantized value is reproduced, and then demultiplexed by the demultiplexer 407 into luminance Y data, color difference Cb data, and color difference Cr data. . The luminance Y data is converted to an inverse quantizer 408.
The chrominance Cb data is input to the inverse quantizer 415, and the chrominance Cr data is input to the inverse quantizer 422.

【００９５】輝度Ｙデータは逆量子化器４０８で逆量子
化され、高速逆ＤＣＴ変換器４０９でラディックスバタ
フライ演算により逆ＤＣＴ変換される。Ｉ−フレームの
時は動き補償器４１４は動作せず、０を出力する。Ｐ−
フレームとＢ−フレームの時は動き補償器４１４は動作
し、動き補償予測値を出力する。加算器４１０は高速逆
ＤＣＴ変換器４０９の出力と動き補償器４１４の出力を
加算し、対象メモリ４１１および対象メモリ４１２また
は４１３に格納する。一方、Ｉ−フレームの時のみに限
り、高速逆ＤＣＴ変換器４０９からｎ段目のラディック
スバタフライ演算結果がマルチプレクサされたのち出力
され、輝度Ｙデータの低周波成分のみからなる画像デー
タがバッファ４３２に格納される。The luminance Y data is inversely quantized by the inverse quantizer 408 and inverse DCT-transformed by the high-speed inverse DCT converter 409 by a radix butterfly operation. In the case of an I-frame, the motion compensator 414 does not operate and outputs 0. P-
At the time of a frame and a B-frame, the motion compensator 414 operates and outputs a motion compensation prediction value. The adder 410 adds the output of the high-speed inverse DCT transformer 409 and the output of the motion compensator 414, and stores the sum in the target memory 411 and the target memory 412 or 413. On the other hand, only in the case of the I-frame, the result of the n-th stage Radix butterfly operation is multiplexed and output from the high-speed inverse DCT converter 409, and the image data consisting only of the low-frequency component of the luminance Y data is stored in the buffer 432. Is stored.

【００９６】色差Ｃｂデータは逆量子化器４１５で逆量
子化され、高速逆ＤＣＴ変換器４１６でラディックスバ
タフライ演算により逆ＤＣＴ変換される。Ｉ−フレーム
の時は動き補償器４２１は動作せず、０を出力する。Ｐ
−フレームとＢ−フレームの時は動き補償器４２１は動
作し、動き補償予測値を出力する。加算器４１７は高速
逆ＤＣＴ変換器４１６の出力と動き補償器４２１の出力
を加算し、対象メモリ４１８および対象メモリ４１９ま
たは４２０に格納する。一方、Ｉ−フレームの時のみに
限り、高速逆ＤＣＴ変換器４１６からｎ段目のラディッ
クスバタフライ演算結果がマルチプレクサされたのち出
力され色差Ｃｂデータの低周波成分のみからなる画像デ
ータがバッファ４３１に格納される。The color difference Cb data is inversely quantized by an inverse quantizer 415 and inverse DCT-transformed by a high-speed inverse DCT converter 416 by a radix butterfly operation. In the case of an I-frame, the motion compensator 421 does not operate and outputs 0. P
The motion compensator 421 operates at the time of a -frame and a B-frame, and outputs a motion compensation prediction value. The adder 417 adds the output of the high-speed inverse DCT converter 416 and the output of the motion compensator 421, and stores the result in the target memory 418 and the target memory 419 or 420. On the other hand, only in the case of the I-frame, the high-speed inverse DCT converter 416 multiplexes the n-th stage Radix butterfly operation result and outputs the image data consisting of only the low-frequency component of the color difference Cb data and stores the image data in the buffer 431. Is done.

【００９７】色差Ｃｒデータは逆量子化器４２２で逆量
子化され、高速逆ＤＣＴ変換器４２３でラディックスバ
タフライ演算により逆ＤＣＴ変換される。Ｉ−フレーム
の時は動き補償器４２８は動作せず、０を出力する。Ｐ
−フレームとＢ−フレームの時は動き補償器４２８は動
作し、動き補償予測値を出力する。加算器４２４は高速
逆ＤＣＴ変換器４２３の出力と動き補償器４２８の出力
を加算し、対象メモリ４２５および対象メモリ４２６ま
たは４２７に格納する。一方、Ｉ−フレームの時のみに
限り、高速逆ＤＣＴ変換器４２３からｎ段目のラディッ
クスバタフライ演算結果がマルチプレクサされたのち出
力され色差Ｃｒデータの低周波成分のみからなる画像デ
ータがバッファ４３０に格納される。The chrominance Cr data is inversely quantized by an inverse quantizer 422 and inverse DCT-transformed by a high-speed inverse DCT converter 423 by a radix butterfly operation. In the case of an I-frame, the motion compensator 428 does not operate and outputs 0. P
The motion compensator 428 operates at the time of a -frame and a B-frame, and outputs a motion compensation prediction value. The adder 424 adds the output of the high-speed inverse DCT transformer 423 and the output of the motion compensator 428, and stores them in the target memory 425 and the target memory 426 or 427. On the other hand, only in the case of an I-frame, the n-th stage Radix butterfly operation result is multiplexed from the high-speed inverse DCT converter 423 and output and stored in the buffer 430. Is done.

【００９８】マクロブロックの処理が終了した時、バッ
ファ４３０、４３１、４３２より輝度Ｙデータ、色差Ｃ
ｂデータ、色差Ｃｒデータが読み出され、色信号変換器
２７３でＲＧＢ形式に変換された上で、色かぶり補正用
画像情報として端子２０８から出力される。When the processing of the macro block is completed, the luminance Y data and the chrominance C are output from the buffers 430, 431, and 432.
The b data and the color difference Cr data are read out, converted into the RGB format by the color signal converter 273, and then output from the terminal 208 as color fog correction image information.

【００９９】また、対象メモリ４１１、４１８、４２５
よりＹＣｂＣｒ形式の画像データが読み出される際に
は、色信号変換器４２９でＲＧＢ形式の画像データに変
換された上で端子２０６より出力される。Also, the target memories 411, 418, 425
When image data in the YCbCr format is read, the image data is converted into image data in the RGB format by the color signal converter 429 and then output from the terminal 206.

【０１００】次に、実施形態２の復号器２０５の詳細構
成について、図１２を用いて説明する。Next, the detailed configuration of the decoder 205 according to the second embodiment will be described with reference to FIG.

【０１０１】図１２は本発明の実施形態２の復号器の詳
細構成を示すブロック図である。FIG. 12 is a block diagram showing a detailed configuration of the decoder according to the second embodiment of the present invention.

【０１０２】２０２は端子であり、記憶装置１１６から
の符号化データが入力される。４５２は符号化データを
格納する符号メモリである。４５３は符号化データを復
号する復号器である。４５４、４５５はデマルチプレク
サである。４５６、４６３、４７０は逆量子化器であ
る。４５７、４６４、４７１は高速逆ＤＣＴ変換器であ
る。尚、高速逆ＤＣＴ変換器４５７、４６４、４７１の
詳細構成は図１１と同様に構成される。４５８、４６
５、４７２は加算器である。４５９、４６０、４６１は
符号化データを復号して得られた画像の輝度Ｙデータを
格納するメモリである。４６６、４６７、４６８は符号
化データを復号して得られた画像の色差Ｃｂデータを格
納するメモリである。４７３、４７４、４７５は符号化
データを復号して得られた画像の色差Ｃｒデータを格納
するメモリである。４６２、４６９、４７６は動き補償
器である。４７７、４８１はＹＣｂＣｒ形式の画像デー
タからＲＧＢ形式の画像データへの色信号変換を行う色
信号変換器である。４７８、４７９、４８０はバッファ
である。２２５は端子であり、ＲＧＢ形式の画像データ
が出力される。２１２は端子であり、色かぶり補正用画
像情報が出力される。Reference numeral 202 denotes a terminal to which encoded data from the storage device 116 is input. Reference numeral 452 denotes a code memory for storing encoded data. A decoder 453 decodes the encoded data. 454 and 455 are demultiplexers. Reference numerals 456, 463, and 470 denote inverse quantizers. 457, 464 and 471 are high-speed inverse DCT converters. The detailed configuration of the high-speed inverse DCT converters 457, 464, and 471 is the same as that shown in FIG. 458, 46
5, 472 are adders. Reference numerals 459, 460, and 461 denote memories for storing luminance Y data of an image obtained by decoding encoded data. Reference numerals 466, 467, and 468 denote memories for storing color difference Cb data of an image obtained by decoding encoded data. Reference numerals 473, 474, and 475 are memories for storing color difference Cr data of an image obtained by decoding encoded data. 462, 469 and 476 are motion compensators. Reference numerals 477 and 481 denote color signal converters for performing color signal conversion from YCbCr format image data to RGB format image data. 478, 479, and 480 are buffers. Reference numeral 225 denotes a terminal from which image data in RGB format is output. Reference numeral 212 denotes a terminal from which color cast image information is output.

【０１０３】上記構成において、符号メモリ４５２に格
納された符号化データは復号器４５３で復号され量子化
された値が再生された後、デマルチプレクサ４５５で輝
度Ｙデータ、色差Ｃｂデータ、色差Ｃｒデータにデマル
チプレクスされる。輝度Ｙデータは逆量子化器４５６
に、色差Ｃｂデータは逆量子化器４６３に、色差Ｃｒデ
ータは逆量子化器４７０に入力される。In the above configuration, the coded data stored in the code memory 452 is decoded by the decoder 453 and the quantized value is reproduced, and then the luminance data Y, the chrominance Cb data, and the chrominance Cr data are decoded by the demultiplexer 455. Is demultiplexed. The luminance Y data is converted to an inverse quantizer 456.
The chrominance Cb data is input to the inverse quantizer 463, and the chrominance Cr data is input to the inverse quantizer 470.

【０１０４】輝度Ｙデータは逆量子化器４５６で逆量子
化され、高速逆ＤＣＴ変換器４５７でラディックスバタ
フライ演算により逆ＤＣＴ変換される。Ｉ−フレームの
時は動き補償器４６２は動作せず、０を出力する。Ｐ−
フレームとＢ−フレームの時は動き補償器４６２は動作
し、動き補償予測値を出力する。加算器４５８は高速逆
ＤＣＴ変換器４５７の出力と動き補償器４６２の出力を
加算し、メモリ４５９およびメモリ４６０または４６１
に格納する。一方、Ｉ−フレームの時のみに限り、高速
逆ＤＣＴ変換器４５７からｎ段目のラディックスバタフ
ライ演算結果がマルチプレクサされたのち出力され、輝
度Ｙデータの低周波成分のみからなる画像データがバッ
ファ４８０に格納される。The luminance Y data is inversely quantized by an inverse quantizer 456 and inverse DCT-transformed by a high-speed inverse DCT converter 457 by a Radix butterfly operation. During an I-frame, the motion compensator 462 does not operate and outputs 0. P-
At the time of a frame and a B-frame, the motion compensator 462 operates and outputs a motion compensation prediction value. The adder 458 adds the output of the high-speed inverse DCT converter 457 and the output of the motion compensator 462, and outputs the result from the memory 459 and the memory 460 or 461.
To be stored. On the other hand, only during the I-frame, the high-speed inverse DCT converter 457 outputs the result of the nth-stage Radix butterfly operation after being multiplexed, and outputs the image data consisting only of the low-frequency component of the luminance Y data to the buffer 480. Is stored.

【０１０５】色差Ｃｒデータは逆量子化器４６３で逆量
子化され、高速逆ＤＣＴ変換器４６４でラディックスバ
タフライ演算により逆ＤＣＴ変換される。Ｉ−フレーム
の時は動き補償器４６９は動作せず、０を出力する。Ｐ
−フレームとＢ−フレームの時は動き補償器４６９は動
作し、動き補償予測値を出力する。加算器４６５は高速
逆ＤＣＴ変換器４６４の出力と動き補償器４６９の出力
を加算し、メモリ４６６およびメモリ４６７または４６
８に格納する。一方、Ｉ−フレームの時のみに限り、高
速逆ＤＣＴ変換器４６４からｎ段目のラディックスバタ
フライ演算結果がマルチプレクサされたのち出力され、
色差Ｃｒデータの低周波成分のみからなる画像データが
バッファ４７９に格納される。The chrominance Cr data is inversely quantized by an inverse quantizer 463 and inverse DCT-transformed by a high-speed inverse DCT converter 464 by a radix butterfly operation. In the case of an I-frame, the motion compensator 469 does not operate and outputs 0. P
The motion compensator 469 operates at the time of a -frame and a B-frame, and outputs a motion compensation prediction value. The adder 465 adds the output of the high-speed inverse DCT converter 464 and the output of the motion compensator 469, and outputs the result to the memory 466 and the memory 467 or 46.
8 is stored. On the other hand, only in the case of the I-frame, the n-th stage Radix butterfly operation result is multiplexed and output from the high-speed inverse DCT converter 464,
Image data consisting of only low-frequency components of the color difference Cr data is stored in the buffer 479.

【０１０６】色差Ｃｂデータは逆量子化器４７０で逆量
子化され、高速逆ＤＣＴ変換器４７１でラディックスバ
タフライ演算により逆ＤＣＴ変換される。Ｉ−フレーム
の時は動き補償器４７６は動作せず、０を出力する。Ｐ
−フレームとＢ−フレームの時は動き補償器４７６は動
作し、動き補償予測値を出力する。加算器４７２は高速
逆ＤＣＴ変換器４７１の出力と動き補償器４７６の出力
を加算し、メモリ４７３およびメモリ４７４または４７
５に格納する。一方、Ｉ−フレームの時のみに限り、高
速逆ＤＣＴ変換器４７１からｎ段目のラディックスバタ
フライ演算結果がマルチプレクサされたのち出力され、
色差Ｃｂデータの低周波成分のみからなる画像データが
バッファ４７８に格納される。The chrominance Cb data is inversely quantized by an inverse quantizer 470 and inverse DCT-transformed by a high-speed inverse DCT converter 471 by a radix butterfly operation. In the case of an I-frame, the motion compensator 476 does not operate and outputs 0. P
The motion compensator 476 operates at the time of a -frame and a B-frame, and outputs a motion compensation prediction value. An adder 472 adds the output of the high-speed inverse DCT converter 471 and the output of the motion compensator 476, and outputs the result to the memory 473 and the memory 474 or 47.
5 is stored. On the other hand, only in the case of an I-frame, the n-th stage Radix butterfly operation result is multiplexed from the high-speed inverse DCT converter 471 and output.
Image data including only the low-frequency component of the color difference Cb data is stored in the buffer 478.

【０１０７】マクロブロックの処理が終了した時、バッ
ファ４７８、４７９、４８０より輝度Ｙデータ、色差Ｃ
ｂデータ、色差Ｃｒデータが読み出され、色信号変換器
４８１でＲＧＢ形式に変換された上で、色かぶり補正用
画像情報として端子２１２から出力される。When the processing of the macro block is completed, the luminance Y data and the color difference C are output from the buffers 478, 479 and 480.
The b data and the color difference Cr data are read out, converted into the RGB format by the color signal converter 481, and then output from the terminal 212 as color fog correction image information.

【０１０８】また、メモリ４５９、４６６、４７３より
ＹＣｂＣｒ形式の画像データが読み出される際には、色
信号変換器４２９でＲＧＢ形式の画像データに変換され
た上で端子２２５より出力される。When image data in the YCbCr format is read from the memories 459, 466, and 473, the image data is converted into image data in the RGB format by the color signal converter 429 and then output from the terminal 225.

【０１０９】以上説明した動画像編集装置１１２の構成
において、１フレーム分の復号が終了し、対象符号器２
０３内の対象メモリ４１１、４１８、４２５と、対象符
号器２０４内の対象メモリ４１１、４１８、４２５と、
符号器２０５内のメモリ４５９、４６６、４７３に画像
データが格納されたら、補正値算出器２１３は、色かぶ
り補正用画像情報を用いて後述の補正式算出アルゴリズ
ムから、以下の補正式を求める。つまり、補正器２１４
用Ｒ画素値補正式ｆ1R(ｘ)、補正器２１４用Ｇ画素値補
正式ｆ1G(ｘ)、補正器２１４用Ｂ画素値補正式ｆ1B(ｘ)
と、補正器２１５用Ｒ画素値補正式ｆ2R(ｘ)、補正器２
１５用Ｇ画素値補正式ｆ2G(ｘ)、補正器２１５用Ｂ画素
値補正式ｆ2B(ｘ)と、補正器２１６用Ｒ画素値補正式ｆ
3R(ｘ)、補正器２１６用Ｇ画素値補正式ｆ3G(ｘ)、補正
器２１６用Ｂ画素値補正式ｆ3B(ｘ)とを求める。In the configuration of the moving picture editing apparatus 112 described above, decoding of one frame is completed,
03; target memories 411, 418, and 425 in the target encoder 204;
When the image data is stored in the memories 459, 466, and 473 in the encoder 205, the correction value calculator 213 obtains the following correction formula from the correction formula calculation algorithm described later using the color cast correction image information. That is, the corrector 214
R pixel value correction formula f1R (x), G pixel value correction formula f1G (x) for corrector 214, B pixel value correction formula f1B (x) for corrector 214
And R pixel value correction formula f2R (x) for corrector 215, corrector 2
15 G pixel value correction formula f2G (x), corrector 215 B pixel value correction formula f2B (x), and corrector 216 R pixel value correction formula f
3R (x), a G pixel value correction formula f3G (x) for the corrector 216, and a B pixel value correction formula f3B (x) for the corrector 216 are obtained.

【０１１０】その後、復号器２０５から走査線の画素順
にラスタスキャンにてＲＧＢ画素値を読み出し、補正器
２１６で補正を行い、画像合成器２１７に入力する。補
正器２１６では入力されたＲ画素値ｒ、Ｇ画素値ｇ、Ｂ
画素値ｂに対して補正式ｆ3R(ｘ)，ｆ3G(ｘ)，ｆ3B(ｘ)
による補正を次式にしたがって行い、補正されたＲ画素
値Ｒ、Ｇ画素値Ｇ、Ｂ画素値Ｂを求め、出力する。After that, the RGB pixel values are read from the decoder 205 by raster scan in the order of the pixels of the scanning line, corrected by the corrector 216, and input to the image synthesizer 217. In the corrector 216, the input R pixel value r, G pixel value g, B
Correction formulas f3R (x), f3G (x), f3B (x) for pixel value b
Is performed according to the following equation to obtain and output corrected R pixel values R, G pixel values G, and B pixel values B.

【０１１１】Ｒ＝ｆ3R(ｒ)，Ｇ＝ｆ3G(ｇ)，Ｂ＝ｆ3b(ｂ) …（９）一方、スキャン位置が対象復号器２０３の対象画像デー
タを合成する位置に到達したら、対象復号器２０３から
マスク情報とＲＧＢ画素値を読み出し、補正器２１４で
補正を行い、画像合成器２１７に入力する。補正器２１
４では、入力されたＲ画素値ｒ、Ｇ画素値ｇ、Ｂ画素値
ｂに対して補正式ｆ1R(ｘ)、ｆ1G(ｘ)、ｆ1B(ｘ)による
補正を次式にしたがって行い、補正されたＲ画素値Ｒ、
Ｇ画素値Ｇ、Ｂ画素値Ｂを求め、出力する。R = f3R (r), G = f3G (g), B = f3b (b) (9) On the other hand, when the scan position reaches the position where the target decoder 203 combines the target image data, the target decoding is performed. The mask information and the RGB pixel values are read from the device 203, corrected by the corrector 214, and input to the image synthesizer 217. Corrector 21
In step 4, the input R pixel value r, G pixel value g, and B pixel value b are corrected by the correction formulas f1R (x), f1G (x), and f1B (x) according to the following equations. R pixel value R,
A G pixel value G and a B pixel value B are obtained and output.

【０１１２】Ｒ＝ｆ1R(ｒ)，Ｇ＝ｆ1G(ｇ)，Ｂ＝ｆ1b(ｂ) …（１０）また、スキャン位置が対象復号器２０４の対象画像デー
タを合成する位置に到達したら、対象復号器２０４から
マスク情報とＲＧＢ画素値を読み出し、補正器２１５で
補正を行い、画像合成器２１７に入力する。補正器２１
４では、入力されたＲ画素値ｒ、Ｇ画素値ｇ、Ｂ画素値
ｂに対して補正式ｆ2R（ｘ）、ｆ2G(ｘ)、ｆ2B(ｘ)によ
る補正を次式にしたがって行い、補正されたＲ画素値
Ｒ、Ｇ画素値Ｇ、Ｂ画素値Ｂを求め、出力する。R = f1R (r), G = f1G (g), B = f1b (b) (10) When the scan position reaches the position where the target image data of the target decoder 204 is synthesized, the target decoding is performed. The mask information and the RGB pixel values are read from the device 204, corrected by the corrector 215, and input to the image synthesizer 217. Corrector 21
In step 4, the input R pixel value r, G pixel value g, and B pixel value b are corrected by the correction formulas f2R (x), f2G (x), and f2B (x) according to the following equations. The calculated R pixel value R, G pixel value G, and B pixel value B are output.

【０１１３】Ｒ＝ｆ2R(ｒ)，Ｇ＝ｆ2G(ｇ)，Ｂ＝ｆ2b(ｂ) …（１１）画像合成器２１７は、マスク情報が対象復号器２０３の
対象画像データを示している場合は補正器２１４からの
画素値を、マスク情報が対象復号器２０４の対象画像デ
ータを示している場合は補正器２１５からの画素値を、
これらのいずれにもあたらない場合は補正器２１６から
の画素値を出力することで、画像の合成を行い、端子２
１８から符号化器１１３に出力する。図９に、背景１０
５０と人１０５１とを補正して得られた画像である背景
１１６０と人１０６１と、人１０５２を補正して得られ
た画像である人１０６２と、人１０５３を補正して得ら
れた画像である人１０６３とを合成した様子を示す。符
号化器１１３は出力された画像をＭＰＥＧ−１の符号化
方式で符号化し、送信器１１４を介して通信網１１５に
送出される。R = f2R (r), G = f2G (g), B = f2b (b) (11) The image synthesizer 217 determines whether the mask information indicates the target image data of the target decoder 203. The pixel value from the corrector 214, and the pixel value from the corrector 215 when the mask information indicates the target image data of the target decoder 204,
If none of the above cases is satisfied, the image is synthesized by outputting the pixel value from the corrector 216 and the terminal 2
18 to the encoder 113. FIG. 9 shows the background 10
A background 1160 and a person 1061, which are images obtained by correcting 50 and the person 1051, a person 1062 which is an image obtained by correcting the person 1052, and an image obtained by correcting the person 1053. This shows a state in which a person 1063 is combined. The encoder 113 encodes the output image using the MPEG-1 encoding method, and sends the image to the communication network 115 via the transmitter 114.

【０１１４】上記動作において、補正値算出器２１３の
補正式算出アルゴリズムは下記に従って動作する。In the above operation, the correction formula calculating algorithm of the correction value calculator 213 operates according to the following.

【０１１５】補正器２１６用補正式ｆ3R(ｒ)，ｆ3G
(ｇ)，ｆ3b(ｂ)は以下のように算出する。Correction formulas f3R (r), f3G for corrector 216
(g) and f3b (b) are calculated as follows.

【０１１６】人間の目は青色に対しては比較的鈍く、補
正効果がさほど現れない。そこで、Ｂ画素値を補正する
ｆ3b(ｂ)については、ｆ3B(ｂ)＝ｂ …（１２）とする。The human eye is relatively dull for blue, and does not show much correction effect. Therefore, f3b (b) for correcting the B pixel value is given by f3B (b) = b (12).

【０１１７】次に、復号器２０５からの色かぶり補正用
画像情報２２４におけるＲ情報の最大値ＲＭａｘ1、平
均値ＲＥ1、分散ＲＲ1を求める。Next, the maximum value RMax1, average value RE1, and variance RR1 of the R information in the color fog correction image information 224 from the decoder 205 are obtained.

【０１１８】次に、復号器２０５からの色かぶり補正用
画像情報２２４におけるＧ情報の最大値ＧＭａｘ1、平
均値ＧＥ1、分散ＧＲ1を求める。Next, the maximum value GMax1, average value GE1, and variance GR1 of the G information in the color fog correction image information 224 from the decoder 205 are obtained.

【０１１９】そして、Ｒ情報の値とＧ情報の値との分布
を示す２次元ヒストグラムを算出する。Then, a two-dimensional histogram showing the distribution of the value of the R information and the value of the G information is calculated.

【０１２０】｜ＲＥ1−ＧＥ1｜がある閾値以下かつ｜Ｒ
Ｒ1−ＧＲ1｜がある閾値以下である場合ＲＭａｘ1≧ＧＭａｘ1であって、２次元ヒストグラム内
の対角線を（ＲＭａｘ1，ＲＭａｘ1）−（ＧＭａｘ1−
Ｔ，ＧＭａｘ1−Ｔ）とする正方形領域において、この
領域におけるヒストグラムの重心と分散とヒストグラム
の値が０である領域とから、Ｒ軸側への有意性のある偏
りが認められる場合、/ｆ3R(ｘ)と/ｆ3G(ｘ)とを /ｆ3R(ｘ)＝ｒ，/ｆ3G(ｇ)＝ｇ×ＲＭａｘ1／ＧＭａｘ1…（１３）と定める。| RE1-GE1 | is equal to or less than a threshold and | R
When R1−GR1 | is equal to or smaller than a certain threshold value, RMax1 ≧ GMax1, and the diagonal line in the two-dimensional histogram is represented by (RMax1, RMax1) − (GMax1−
T, GMax1−T), if there is a significant bias toward the R axis from the center of gravity and variance of the histogram and the area where the histogram value is 0 in this area, / f3R ( x) and / f3G (x) are defined as / f3R (x) = r, / f3G (g) = g × RMax1 / GMax1 (13).

【０１２１】ＧＭａｘ1≧ＲＭａｘ1であって、２次元ヒ
ストグラム内の対角線を（ＧＭａｘ1，ＧＭａｘ1）−
（ＲＭａｘ1−Ｔ，ＲＭａｘ1−Ｔ）とする正方形領域に
おいて、この領域におけるヒストグラムの重心と分散と
ヒストグラムの値が０である領域とから、Ｇ軸側への有
意性のある偏りが認められる場合、/ｆ3R(ｘ)と/ｆ3G
(ｘ)とを /ｆ3G(ｇ)＝ｇ，/ｆ3R(ｒ)＝ｒ×ＧＭａｘ1／ＲＭａｘ1…（１４）と定める。GMax1 ≧ RMax1, and the diagonal line in the two-dimensional histogram is represented by (GMax1, GMax1) −
In a square area defined as (RMax1−T, RMax1−T), when a significant bias toward the G axis is recognized from the center of gravity and variance of the histogram and the area where the value of the histogram is 0 in this area, / f3R (x) and / f3G
(x) and / f3G (g) = g, / f3R (r) = r × GMax1 / RMax1 (14)

【０１２２】上記いずれの場合にもあてはまらなかった
場合、/ｆ3R(ｘ)と/ｆ3G(ｘ)とを /ｆ3R(ｒ)＝ｒ，/ｆ3G(ｇ)＝ｇ …（１５）と定める。但し、Ｔはある正数である。If none of the above cases applies, / f3R (x) and / f3G (x) are determined as / f3R (r) = r, / f3G (g) = g (15). Here, T is a certain positive number.

【０１２３】さもなければ/ｆ3R(ｘ)と/ｆ3G(ｘ)とを /ｆ3R(ｒ)＝ｒ，/ｆ3G(ｇ)＝ｇ …（１６）と定める。Otherwise, / f3R (x) and / f3G (x) are determined as / f3R (r) = r, / f3G (g) = g (16).

【０１２４】以上が｜ＲＥ1−ＧＥ1｜と｜ＲＲ1−ＧＲ1
｜とによる場合分けである。The above is | RE1-GE1 | and | RR1-GR1.
|.

【０１２５】１フレーム前の補正式を基に、現在の補正
式を次のように定める。Based on the correction formula one frame before, the current correction formula is determined as follows.

【０１２６】ｆ3R(ｘ)＝ｆ3R(ｘ)＋γ（/ｆ3R(ｘ)−ｆ3R(ｘ)）ｆ2G(ｘ)＝ｆ3G(ｘ)＋γ（/ｆ3G(ｘ)−ｆ3G(ｘ)） …（１７）ここで、γは、補正式の時変化追従用重み変数である。F3R (x) = f3R (x) + γ (/ f3R (x) −f3R (x)) f2G (x) = f3G (x) + γ (/ f3G (x) −f3G (x)) (17) Here, γ is a time-variant tracking weight variable of the correction formula.

【０１２７】以上にて、補正式ｆ3R(ｒ)，ｆ3G(ｇ)，ｆ
3b(ｂ)の算出を終了する。As described above, the correction equations f3R (r), f3G (g), f
The calculation of 3b (b) ends.

【０１２８】同様にして、補正器２１４用補正式ｆ1R
(ｒ)，ｆ1G(ｇ)，ｆ1b(ｂ)、補正器２１５用補正式ｆ2R
(ｒ)，ｆ2G(ｇ)，ｆ2b(ｂ)を算出する。Similarly, the correction equation f1R for the corrector 214
(r), f1G (g), f1b (b), correction formula f2R for corrector 215
(r), f2G (g) and f2b (b) are calculated.

【０１２９】以上説明したように、実施形態２によれ
ば、背景画像と対象画像を分離し、それぞれについて符
号化したものを合成する際にそれぞれの画像データの特
徴量を抽出し、合成する対象画像の画素値を補正するこ
とで違和感のない画像合成が行える。また、対象画像の
サイズと演算速度との兼ね合いにおいて、直流成分、も
しくは２×２あるいは４×４の低周波成分の逆ＤＣＴ結
果、もしくは８×８の逆ＤＣＴ結果を選択的に、補正値
算出に用いることで柔軟で精度の高い処理が可能にな
る。さらに、色かぶり補正処理を時変化にゆるやかに追
従させることで、変化の激しい画像に対しても違和感の
無い画像合成が行える。As described above, according to the second embodiment, when separating the background image and the target image and synthesizing the coded images, the characteristic amount of each image data is extracted and the target image is synthesized. By correcting the pixel values of the image, it is possible to perform image synthesis without a sense of discomfort. Further, in consideration of the size of the target image and the calculation speed, a correction value is calculated by selectively selecting a DC component or an inverse DCT result of a 2 × 2 or 4 × 4 low frequency component or an 8 × 8 inverse DCT result. In this case, flexible and highly accurate processing can be performed. Furthermore, by causing the color fogging correction process to slowly follow the time change, it is possible to perform image synthesis without discomfort even for an image that changes rapidly.

【０１３０】尚、実施形態２においては、対象画像の符
号化にＭＰＥＧ−４を、それ以外の符号化にＭＰＥＧ−
１を用いて説明をおこなったが、これに限定されず、こ
れらと同様の機能を果たすものであればなんでもかまわ
ない。In the second embodiment, MPEG-4 is used for encoding a target image, and MPEG-
1 has been described, but the present invention is not limited to this, and any device that performs the same function as those described above may be used.

【０１３１】また、メモリ構成はこれに限定されず、ラ
インメモリ等で処理を行ってももちろんかまわないし、
その他の構成であってもよい。The memory configuration is not limited to this, and it goes without saying that processing may be performed by a line memory or the like.
Other configurations may be used.

【０１３２】また、各構成要素の一部または全部をＣＰ
Ｕ等で動作するソフトウエアによって実現させてももち
ろんかまわない。＜実施形態３＞実施形態３は、実施形態１に動画像編集
装置１１２を変更したものである。そこで、実施形態１
と重複する部分についてはその説明を割愛し、変更部に
ついてのみ説明する。A part or all of the constituent elements may be replaced by a CP.
Of course, it may be realized by software operating on U or the like. <Third Embodiment> A third embodiment is a modification of the first embodiment in which the moving image editing apparatus 112 is modified. Therefore, the first embodiment
The description of the parts overlapping with the above is omitted, and only the changed part will be described.

【０１３３】動画像送信システムは実施形態１と同様に
図１の構成を用いる。The moving image transmission system uses the configuration shown in FIG. 1 as in the first embodiment.

【０１３４】次に、実施形態３の動画像編集装置１１２
の詳細構成について、図１３を用いて説明する。Next, the moving picture editing apparatus 112 according to the third embodiment
Will be described with reference to FIG.

【０１３５】図１３は本発明の実施形態３の動画像編集
装置の詳細構成を示すブロック図である。FIG. 13 is a block diagram showing a detailed configuration of the moving picture editing apparatus according to the third embodiment of the present invention.

【０１３６】１２００、１２０１、１２０２は端子であ
り、端子１２００は受信器１１０から、端子１２０１は
受信器１１１から、端子１２０２は記憶装置１１６から
の符号化データが入力される。これらの符号化データ
は、対象復号器１２０３、１２０４および復号器１２０
５に入力される。端子１２０６、１２０９、１２２５か
らは画像データが出力される。端子１２０７、１２１
０、１２１２からはコントラスト補正値を算出するため
に必要なコントラスト補正用画像情報１２２２、１２２
３、１２２４がそれぞれ出力される。端子１２０７、１
２１０からはマスク情報が出力される。１２１３はコン
トラスト補正用画像情報から補正値を算出する補正値算
出器である。１２１４、１２１５、１２１６は補正値か
ら画像データのコントラストを補正する補正器である。
１２１７は画像データとマスク情報とから画像データの
合成を行う画像合成器である。１２１８は端子であり、
合成された画像データを符号化器１１３に出力する。Reference numerals 1200, 1201, and 1202 denote terminals. The terminal 1200 receives encoded data from the receiver 110, the terminal 1201 receives encoded data from the receiver 111, and the terminal 1202 receives encoded data from the storage device. These encoded data are supplied to the target decoders 1203 and 1204 and the decoders 1203 and 1204.
5 is input. Image data is output from the terminals 1206, 1209, and 1225. Terminals 1207, 121
From 0 and 1212, contrast correction image information 1222 and 122 necessary for calculating a contrast correction value.
3 and 1224 are output. Terminals 1207, 1
210 outputs mask information. A correction value calculator 1213 calculates a correction value from the image information for contrast correction. 1214, 1215 and 1216 are correctors for correcting the contrast of the image data from the correction values.
Reference numeral 1217 denotes an image synthesizer that synthesizes image data from the image data and the mask information. 1218 is a terminal,
The combined image data is output to the encoder 113.

【０１３７】次に、実施形態３の対象復号器１２０３、
１２０４の詳細構成について、図１４を用いて説明す
る。尚、図１４では、対象復号器１２０３の詳細構成と
して説明し、対象復号器１２０４の詳細構成は同様なの
で、ここでは省略する。Next, the target decoder 1203 of the third embodiment,
The detailed configuration of 1204 will be described with reference to FIG. In FIG. 14, the detailed configuration of the target decoder 1203 will be described, and the detailed configuration of the target decoder 1204 is the same.

【０１３８】図１４は本発明の実施形態３の対象復号器
の詳細構成を示すブロック図である。FIG. 14 is a block diagram showing a detailed configuration of a target decoder according to the third embodiment of the present invention.

【０１３９】２１９は端子であり、受信器１１０からの
符号化データが入力される。１２４１は分離器であり、
符号化データからマスク情報の符号化データと対象画像
の領域の符号化データを分離する。１２４２はマスク情
報を復号するマスク復号器である。１２４３はマスク情
報を格納するマスクメモリである。マスクメモリ１２４
３内のマスク情報は、端子１２０７より出力される。１
２４４は対象画像の領域の符号化データを格納する符号
メモリである。１２４５は対象画像の領域の符号化デー
タを復号する復号器である。１２４６は逆量子化器であ
る。逆量子化された画像データ内の直流情報は、コント
ラスト補正用画像情報として端子１２０８から出力され
る。１２４７は逆ＤＣＴ変換器である。１２４８は加算
器である。１２４９、１２５０、１２５１は再生した対
象画像の領域の画像データを格納する対象メモリであ
る。１２５２は動き補償器である。対象メモリ１２４９
内の画像データは、端子１２０６より出力される。Reference numeral 219 denotes a terminal to which encoded data from the receiver 110 is input. 1241 is a separator;
The encoded data of the mask information and the encoded data of the area of the target image are separated from the encoded data. Reference numeral 1242 denotes a mask decoder that decodes mask information. Reference numeral 1243 denotes a mask memory for storing mask information. Mask memory 124
The mask information in 3 is output from the terminal 1207. 1
Reference numeral 244 denotes a code memory for storing the encoded data of the area of the target image. Reference numeral 1245 denotes a decoder for decoding the encoded data of the area of the target image. Reference numeral 1246 denotes an inverse quantizer. The DC information in the dequantized image data is output from the terminal 1208 as image information for contrast correction. Reference numeral 1247 denotes an inverse DCT converter. 1248 is an adder. Reference numerals 1249, 1250, and 1251 denote target memories for storing image data of a reproduced target image area. Reference numeral 1252 denotes a motion compensator. Target memory 1249
Are output from the terminal 1206.

【０１４０】上記構成において、入力された符号データ
からマスク情報の符号化データと対象画像の領域の符号
化データを分離器１２４１で分離し、それぞれマスク復
号器１２４２と符号メモリ１２４４に入力する。マスク
復号器１２４２は符号化データを復号してマスク情報を
再生し、マスクメモリ１２４３に格納する。符号メモリ
１２４４に格納された符号化データは復号器１２４５で
復号され、量子化された値を再生する。この値は、逆量
子化器１２４６で逆量子化され、逆ＤＣＴ変換器１２４
７で逆ＤＣＴ変換される。Ｉ−フレームの時は動き補償
器１２５２は動作せず、０を出力する。Ｐ−フレームと
Ｂ−フレームの時は動き補償器１２５２は動作し、動き
補償予測値を出力する。加算器１２４８は逆ＤＣＴ変換
器１２４７の出力と動き補償器１２５２の出力を加算
し、対象メモリ１２４９および対象メモリ１２５０また
は１２５１に格納する。一方、逆量子化器１２４６から
は、輝度データの平均値を表す直流成分が、コントラス
ト補正用画像情報として端子１２０８から出力される。In the above configuration, the coded data of the mask information and the coded data of the target image area are separated from the input coded data by the separator 1241 and input to the mask decoder 1242 and the code memory 1244, respectively. The mask decoder 1242 decodes the encoded data to reproduce the mask information, and stores the mask information in the mask memory 1243. The encoded data stored in the code memory 1244 is decoded by the decoder 1245, and the quantized value is reproduced. This value is inversely quantized by the inverse quantizer 1246 and the inverse DCT transformer 124
7 is subjected to inverse DCT. In the case of an I-frame, the motion compensator 1252 does not operate and outputs 0. At the time of a P-frame and a B-frame, the motion compensator 1252 operates and outputs a motion compensation prediction value. The adder 1248 adds the output of the inverse DCT transformer 1247 and the output of the motion compensator 1252, and stores the result in the target memory 1249 and the target memory 1250 or 1251. On the other hand, a DC component representing the average value of the luminance data is output from the terminal 1208 from the inverse quantizer 1246 as image information for contrast correction.

【０１４１】次に、実施形態３の復号器１２０５の詳細
構成について、図１５を用いて説明する。Next, the detailed configuration of the decoder 1205 according to the third embodiment will be described with reference to FIG.

【０１４２】図１５は本発明の実施形態３の復号器の詳
細構成を示すブロック図である。FIG. 15 is a block diagram showing a detailed configuration of the decoder according to the third embodiment of the present invention.

【０１４３】１２２１は端子であり、記憶装置１１６か
らの符号化データが入力される。１２６１は符号化デー
タを格納する符号メモリである。１２６２は符号化デー
タを復号する復号器である。１２６３は逆量子化器であ
る。逆量子化された画像データ内の直流情報は、コント
ラスト補正用画像情報として端子１２１２から出力され
る。１２６４は逆ＤＣＴ変換器である。１２６５は加算
器である。１２６６、１２６７、１２６８は復号された
画像データを格納するメモリである。１２６９は動き補
償器である。メモリ１２６６内の画像データは、端子１
２２５より出力される。Reference numeral 1221 denotes a terminal to which encoded data from the storage device 116 is input. Reference numeral 1261 denotes a code memory for storing encoded data. Reference numeral 1262 denotes a decoder for decoding encoded data. Reference numeral 1263 denotes an inverse quantizer. The DC information in the dequantized image data is output from the terminal 1212 as image information for contrast correction. Reference numeral 1264 denotes an inverse DCT converter. 1265 is an adder. Reference numerals 1266, 1267, and 1268 denote memories for storing the decoded image data. Reference numeral 1269 denotes a motion compensator. The image data in the memory 1266 is
225.

【０１４４】上記構成において、符号メモリ１２６１に
格納された符号化データは復号器１２６２で復号され、
量子化された値を再生する。この値は、逆量子化器１２
６３で逆量子化され、逆ＤＣＴ変換器１２６４で逆ＤＣ
Ｔ変換される。Ｉ−フレームの時は動き補償器１２６９
は動作せず、０を出力する。Ｐ−フレームとＢ−フレー
ムの時は動き補償器１２６９は動作し、動き補償予測値
を出力する。加算器１２６５は逆ＤＣＴ変換器１２６４
の出力と動き補償器１２６９の出力を加え、メモリ１２
６６およびメモリ１２６７または１２６８に格納する。
一方、逆量子化器１２６３からは、輝度データの平均値
を表す直流成分が、コントラスト補正用画像情報として
端子１２１２から出力される。In the above configuration, the encoded data stored in the code memory 1261 is decoded by the decoder 1262,
Regenerate the quantized value. This value is calculated by the inverse quantizer 12
The inverse DCT is performed by the inverse DCT converter 1264.
T conversion is performed. Motion compensator 1269 for I-frame
Does not operate and outputs 0. At the time of the P-frame and the B-frame, the motion compensator 1269 operates and outputs a motion compensation prediction value. The adder 1265 is an inverse DCT converter 1264
Of the motion compensator 1269 and the output of the memory
66 and the memory 1267 or 1268.
On the other hand, a DC component representing the average value of the luminance data is output from the terminal 1212 from the inverse quantizer 1263 as contrast correction image information.

【０１４５】以上説明した動画像編集装置１１２の構成
において、１フレーム分の復号が終了し、対象符号器１
２０３内の対象メモリ１２４９と、対象符号器１２０４
内の対象メモリ１２４９と、符号器１２０５内のメモリ
１２６６に画像データが格納されたら、補正値算出器１
２１３は、コントラスト補正用画像情報を用いて後述の
補正式算出アルゴリズムから、以下の補正式を求める。
つまり、補正器１２１４用補正式ｆ1(ｘ)と、補正器１
２１５用補正式ｆ2(ｘ)と、補正器１２１６用補正式ｆ3
(ｘ)とを求める。In the structure of the moving picture editing apparatus 112 described above, decoding of one frame is completed.
The target memory 1249 in the target 203 and the target encoder 1204
When the image data is stored in the target memory 1249 in the memory and the memory 1266 in the encoder 1205, the correction value calculator 1
213 obtains the following correction formula from the correction formula calculation algorithm described below using the image information for contrast correction.
That is, the correction formula f1 (x) for the corrector 1214 and the corrector 1
The correction formula f2 (x) for the 215 and the correction formula f3 for the corrector 1216
(x).

【０１４６】その後、復号器１２０５内のメモリ１２６
６から走査線の画素順にラスタスキャンにて画素値を読
み出し、補正器１２１６で補正を行い、画像合成器１２
１７に入力する。補正器１２１６では入力された画素値
ｐに対して補正式ｆ3(ｘ)による補正を次式にしたがっ
て行い、補正された画素値Ｐを求め、出力する。After that, the memory 126 in the decoder 1205
6, pixel values are read out by raster scan in the order of the pixels of the scanning line, corrected by the corrector 1216, and
Enter 17. The corrector 1216 corrects the input pixel value p by the correction formula f3 (x) according to the following formula, obtains a corrected pixel value P, and outputs the corrected pixel value P.

【０１４７】Ｐ＝ｆ3(ｐ) …（１ａ）一方、スキャン位置が対象復号器１２０３の対象画像デ
ータを合成する位置に到達したら、対象復号器１２０３
内のマスクメモリ１２４３と対象メモリ１２４９からマ
スク情報と画像データを読み出し、補正器１２１４で補
正を行い、画像合成器１２１７に入力する。補正器１２
１４では入力された画素値ｐに対して補正式ｆ1(ｘ)に
よる補正を次式にしたがって行い、補正された画素値Ｐ
を求め、出力する。P = f3 (p) (1a) On the other hand, when the scan position reaches the position where the target image data of the target decoder 1203 is synthesized, the target decoder 1203
The mask information and the image data are read out from the mask memory 1243 and the object memory 1249 in the above, corrected by the corrector 1214, and input to the image synthesizer 1217. Corrector 12
At 14, the input pixel value p is corrected by the correction formula f1 (x) according to the following formula, and the corrected pixel value P
And output.

【０１４８】Ｐ＝ｆ1(ｐ) …（２ａ）また、スキャン位置が対象復号器１２０４の対象画像デ
ータを合成する位置に到達したら、対象復号器１２０４
内のマスクメモリ１２４３と対象メモリ１２４９からマ
スク情報と画像データを読み出し、補正器１２１５で補
正を行い、画像合成器１２１７に入力する。補正器１２
１５では入力された画素値ｐに対して補正式ｆ2(ｘ)に
よる補正を次式にしたがって行い、補正された画素値Ｐ
を求め、出力する。P = f1 (p) (2a) When the scan position reaches the position where the target image data of the target decoder 1204 is synthesized, the target decoder 1204
The mask information and the image data are read out from the mask memory 1243 and the object memory 1249, corrected by the corrector 1215, and input to the image synthesizer 1217. Corrector 12
In step 15, the input pixel value p is corrected by the correction formula f2 (x) according to the following formula, and the corrected pixel value P
And output.

【０１４９】Ｐ＝ｆ2(ｐ) …（３ａ）画像合成器１２１７は、マスク情報が対象復号器１２０
３の対象画像データを示している場合は補正器１２１４
からの画素値を、マスク情報が対象復号器１２０４の対
象画像データを示している場合は補正器１２１５からの
画素値を、これらのいずれにもあたらない場合は補正器
１２１６からの画素値を出力することで、画像の合成を
行い、端子１２１８から符号化器１１１３に出力する。
図９に、背景１０５０と人１０５１とを補正して得られ
た画像である背景１１６０と人１０６１と、人１０５２
を補正して得られた画像である人１０６２と、人１０５
３を補正して得られた画像である人１０６３とを合成し
た様子を示す。符号化器１１３は出力された画像をＭＰ
ＥＧ−１で符号化し、送信器１１４を介して通信網１１
５に送出される。P = f 2 (p) (3a) The image synthesizer 1217 outputs the mask information to the target decoder 120.
In the case of indicating the target image data of No. 3, the corrector 1214
, And outputs the pixel value from the corrector 1215 when the mask information indicates the target image data of the target decoder 1204, and outputs the pixel value from the corrector 1216 when none of these values is met. Then, the images are synthesized and output from the terminal 1218 to the encoder 1113.
FIG. 9 shows a background 1160, a person 1061, and a person 1052, which are images obtained by correcting the background 1050 and the person 1051.
1062, which is an image obtained by correcting
3 shows a state in which an image obtained by correcting 3 is combined with a person 1063 which is an image obtained. The encoder 113 converts the output image to MP
Encoded by EG-1 and transmitted to the communication network 11 via the transmitter 114
5 is sent.

【０１５０】上記動作において、補正値算出器１２１３
の補正式算出アルゴリズムは下記に従って動作する。In the above operation, the correction value calculator 1213
Operates according to the following.

【０１５１】まず、対象復号器１２０３からのコントラ
スト補正用画像情報１２２２における最大値Ｍａｘ1、
最小値Ｍｉｎ1、平均値Ｅ1、分散Ｒ1を求める。First, the maximum value Max1, Max1 in the contrast correction image information 1222 from the target decoder 1203,
A minimum value Min1, an average value E1, and a variance R1 are obtained.

【０１５２】次に、対象復号器１２０４からのコントラ
スト補正用画像情報１２２３における最大値Ｍａｘ2、
最小値Ｍｉｎ2、平均値Ｅ2、分散Ｒ2を求める。Next, the maximum value Max2 in the contrast correction image information 1223 from the target decoder 1204,
The minimum value Min2, the average value E2, and the variance R2 are obtained.

【０１５３】次に、復号器１２０５からのコントラスト
補正用画像情報１２２４における最大値Ｍａｘ3、最小
値Ｍｉｎ3、平均値Ｅ3、分散Ｒ3を求める。Next, the maximum value Max3, the minimum value Min3, the average value E3, and the variance R3 in the contrast correction image information 1224 from the decoder 1205 are obtained.

【０１５４】そして、コントラスト補正用画像情報１２
２２、１２２３、１２２４のうち、最大値が２５５かつ
最小値が０であるものが多くとも１つである場合最大値
をＭａｘ、最小をＭｉｎとすると、ｆ1(ｘ)，ｆ2(ｘ)，
ｆ3(ｘ)とを次のように定める。Then, the image information 12 for contrast correction is obtained.
22, 1223, and 1224, when the maximum value is 255 and the minimum value is 0 at most, if the maximum value is Max and the minimum value is Min, f 1 (x), f 2 (x),
f3 (x) is determined as follows.

【０１５５】 f1(x)＝[{α(Max−Max1)＋Max1}−{β(Min−Min1)＋Min1}] ／(Max1−Min1)×(x−Min1) ＋{α(Max−Max1)＋Max1} …（４ａ） f2(x)＝[{α(Max−Max2)＋Max2}−{β(Min−Min2)＋Min2}] ／(Max2−Min2)×(x−Min2) ＋{α(Max−Max2)＋Max2} …（５ａ） f3(x)＝[{α(Max−Max3)＋Max3}−{β(Min−Min3)＋Min3}] ／(Max3−Min3)×(x−Min3) ＋{α(Max−Max3)＋Max3} …（６ａ）但し、αならびにβは重み付け変数、もしくは重み付け
関数である。F1 (x) = [{α (Max−Max1) + Max1} − {β (Min−Min1) + Min1}] / (Max1−Min1) × (x−Min1) + {α (Max−Max1) + Max1 }… (4a) f2 (x) = [{α (Max−Max2) + Max2} − {β (Min−Min2) + Min2}] / (Max2−Min2) × (x−Min2) + {α (Max−Max2) ) + Max2} (5a) f3 (x) = [{α (Max−Max3) + Max3} − {β (Min−Min3) + Min3}] / (Max3−Min3) × (x−Min3) + {α (Max) −Max3) + Max3} (6a) where α and β are weighting variables or weighting functions.

【０１５６】さもなくば、コントラスト補正用画像情報
１２２２、１２２３、１２２４のうち、最大値が２５５
かつ最小値が２５５であるものが２つである場合例え
ば、最大値が２５５でなくあるいは最小値が０でないも
のがコントラスト補正用画像情報１２２２とすると、ｆ
1(ｘ)、ｆ2(ｘ)、ｆ3(ｘ)とを次のように定める。Otherwise, the maximum value of the contrast correction image information 1222, 1223, and 1224 is 255.
If the minimum value is 255 and the minimum value is 255. For example, if the maximum value is not 255 or the minimum value is not 0, the contrast correction image information 1222 is obtained.
1 (x), f2 (x) and f3 (x) are determined as follows.

【０１５７】 f1(x)＝[{α(255−Max1)＋Max1}−{β(0−Min1)＋Min1}] ／(Max1−Min1)×(x−Min1) ＋{α(255−Max1)＋Max1} …（７ａ）ｆ2(ｘ)とｆ3(ｘ)とは、分散の差｜Ｒ2−Ｒ3｜が小さく
なるような関数を定める。定め方の一例として、次のよ
うな３接点からなる３次スプラインを用いた例を挙げ
る。F1 (x) = [{α (255−Max1) + Max1} − {β (0−Min1) + Min1}] / (Max1−Min1) × (x−Min1) + {α (255−Max1) + Max1 } (7a) f2 (x) and f3 (x) determine a function such that the variance difference | R2-R3 | As an example of the determination method, an example using a cubic spline including the following three contact points will be described.

【０１５８】例えば、Ｒ2＞Ｒ3である場合、ｆ2(ｘ)＝ｘ …（８ａ）ｆ3(ｘ)＝ｆ31(ｘ)；ｘ≦Ｅ3 ｆ32(ｘ)；ｘ＞Ｅ3 …（９ａ）但し、ｆ31(０)＝０；ｆ31(Ｅ3)＝Ｅ3；ｆ32(２５５)＝
２５５；ｆ32(Ｅ3)＝Ｅ3；ｆ(2)31(Ｅ3)＝ｆ(2)32(Ｅ
3)；ｆ(1)31(Ｅ3)＝φ；ｆ(1)32(Ｅ3)＝ψを満たすもの
とする。For example, when R2> R3, f2 (x) = x (8a) f3 (x) = f31 (x); x ≦ E3 f32 (x); x> E3 (9a) where f31 (0) = 0; f31 (E3) = E3; f32 (255) =
255; f32 (E3) = E3; f (2) 31 (E3) = f (2) 32 (E
3); f (1) 31 (E3) = φ; f (1) 32 (E3) = ψ.

【０１５９】また、α，β，φならびにψは重み付け変
数、もしくは重み付け関数である。Further, α, β, φ and ψ are weighting variables or weighting functions.

【０１６０】さもなくば、ｆ2(ｘ)とｆ2(ｘ)とｆ3(ｘ)
とは、分散の差｜Ｒ1−Ｒ2｜、｜Ｒ1−Ｒ3｜、｜Ｒ2−
Ｒ3｜が小さくなるような関数を定める。Otherwise, f2 (x), f2 (x) and f3 (x)
Means the difference | R1-R2 |, | R1-R3 |, | R2-
A function is determined so that R3 | is small.

【０１６１】定め方の一例として、次のような３接点か
らなる３次スプラインを用いた例を挙げる。As an example of the determination method, an example using a cubic spline having the following three contact points will be described.

【０１６２】例えば、Ｒ1＞Ｒ2＞Ｒ3である場合、ｆ1(ｘ)＝ｘ …（１０ａ）ｆ2(ｘ)＝ｆ21(ｘ)；ｘ≦Ｅ2 ｆ22(ｘ)；ｘ＞Ｅ2 …（１１ａ）ｆ3(ｘ)＝ｆ31(ｘ)；ｘ≦Ｅ3 ｆ32(ｘ)；ｘ＞Ｅ3 …（１２ａ）但し、ｆ21(０)＝０；ｆ21(Ｅ2)＝Ｅ2；ｆ22(２５５)＝
２５５；ｆ22(Ｅ2)＝Ｅ2；ｆ(2)21(Ｅ2)＝ｆ(2)22(Ｅ
2)；ｆ(1)21(Ｅ3)＝φ2；ｆ(1)22(Ｅ2)＝ψ2ならびにｆ
31(０)＝０；ｆ31(Ｅ3)＝Ｅ3；ｆ32(２５５)＝２５５；
ｆ32(Ｅ3)＝Ｅ3；ｆ(2)31(Ｅ3)＝ｆ(2)32(Ｅ3)；ｆ(1)3
1(Ｅ3)＝φ3；ｆ(1)32(Ｅ3)＝ψ3を満たすものとする。For example, when R1>R2> R3, f1 (x) = x (10a) f2 (x) = f21 (x); x ≦ E2 f22 (x); x> E2 (11a) f3 (x) = f31 (x); x ≦ E3 f32 (x); x> E3 (12a) where f21 (0) = 0; f21 (E2) = E2; f22 (255) =
255; f22 (E2) = E2; f (2) 21 (E2) = f (2) 22 (E
2); f (1) 21 (E3) = φ2; f (1) 22 (E2) = ψ2 and f
31 (0) = 0; f31 (E3) = E3; f32 (255) = 255;
f32 (E3) = E3; f (2) 31 (E3) = f (2) 32 (E3); f (1) 3
1 (E3) = φ3; f (1) 32 (E3) = ψ3.

【０１６３】また、φ2，ψ2，φ3ならびにψ3は重み付
け変数、もしくは重み付け関数である。Φ2, ψ2, φ3 and ψ3 are weighting variables or weighting functions.

【０１６４】以上説明したように、実施形態３によれ
ば、背景画像と対象画像を分離し、それぞれについて符
号化したものを合成する際にそれぞれの画像データの特
徴量を抽出し、合成する対象画像の画素値を補正するこ
とで違和感のない画像合成が行えるとともに、ブロック
単位での直流成分を補正値算出に用いることで高速な処
理が可能になる。As described above, according to the third embodiment, when the background image and the target image are separated, and when the coded images are combined, the feature amount of each image data is extracted and the object to be combined is extracted. By correcting the pixel values of the image, it is possible to perform image synthesis without a sense of incongruity, and it is possible to perform high-speed processing by using a DC component in block units for calculating a correction value.

【０１６５】尚、実施形態３においては、対象の符号化
にＭＰＥＧ−４を、それ以外の符号化にＭＰＥＧ−１を
用いて説明をおこなったが、これに限定されず、これら
と同様の機能を果たすものであればなんでもかまわな
い。In the third embodiment, MPEG-4 is used for the target encoding and MPEG-1 is used for the other encoding. However, the present invention is not limited to this. It does not matter anything that fulfills.

【０１６６】また、メモリ構成はこれに限定されず、ラ
インメモリ等で処理を行ってももちろんかまわないし、
その他の構成であってもよい。The memory configuration is not limited to this, and it goes without saying that processing may be performed by a line memory or the like.
Other configurations may be used.

【０１６７】また、各構成要素の一部または全部をＣＰ
Ｕ等で動作するソフトウェアによって実現させてももち
ろんかまわない。＜実施形態４＞実施形態４は、実施形態３における対象
復号器１２０３、１２０４、復号器１２０５ならびに補
正値算出器１２１３を変更したものである。そこで、実
施形態３と重複する部分については割愛し、変更部につ
いてのみ説明する。Further, a part or all of the constituent elements may be replaced by a CP.
Of course, it may be realized by software operating on U or the like. Fourth Embodiment A fourth embodiment is a modification of the third embodiment in which the target decoders 1203 and 1204, the decoder 1205, and the correction value calculator 1213 are changed. Therefore, portions that are the same as those in the third embodiment are omitted, and only the changed portion will be described.

【０１６８】動画像送信システムは実施形態１と同様に
図１の構成を用いる。また、動画像編集装置１１２の詳
細構成は、実施形態３と同様に図１３の構成を用いる。The moving image transmission system uses the configuration shown in FIG. 1 as in the first embodiment. The detailed configuration of the moving image editing apparatus 112 uses the configuration shown in FIG. 13 as in the third embodiment.

【０１６９】次に、実施形態４の対象復号器１２０３、
１２０４の詳細構成について、図１６を用いて説明す
る。尚、図１６では、対象復号器１２０３の詳細構成と
して説明し、同様の構成を有する対象復号器１２０４の
詳細構成については、ここでは省略する。Next, the object decoder 1203 of the fourth embodiment,
The detailed configuration of 1204 will be described with reference to FIG. In FIG. 16, the detailed configuration of the target decoder 1203 is described, and the detailed configuration of the target decoder 1204 having the same configuration is omitted here.

【０１７０】図１６は本発明の実施形態４の対象復号器
の詳細構成を示すブロック図である。FIG. 16 is a block diagram showing a detailed configuration of a target decoder according to Embodiment 4 of the present invention.

【０１７１】１２１９は端子であり、受信器１１０から
の符号化データが入力される。１３０２は分離器であ
り、符号化データからマスク情報の符号化データと対象
画像の領域の画像データの符号化データを分離する。１
３０３はマスク情報を復号するマスク復号器である。１
３０４はマスク情報を格納するマスクメモリである。マ
スクメモリ１３０４内のマスク情報は、端子１２０７よ
り出力される。１３０５は対象画像の領域の符号化デー
タを格納する符号メモリである。１３０６は対象画像の
領域の画像データを復号する復号器である。１３０７は
逆量子化器である。１３０８は高速逆ＤＣＴ変換器であ
る。尚、高速逆ＤＣＴ変換器１３０８の詳細構成は図１
１と同様に構成される。１３０９は加算器である。１３
１０、１３１１、１３１２は再生した対象画像の領域を
格納する対象メモリである。１３１３は動き補償器であ
る。対象メモリ１３１０内の画像データは、端子１２０
６より出力される。Reference numeral 1219 denotes a terminal to which encoded data from the receiver 110 is input. Reference numeral 1302 denotes a separator, which separates the encoded data of the mask information and the encoded data of the image data of the target image area from the encoded data. 1
Reference numeral 303 denotes a mask decoder that decodes mask information. 1
Reference numeral 304 denotes a mask memory for storing mask information. The mask information in the mask memory 1304 is output from a terminal 1207. Reference numeral 1305 denotes a code memory for storing the encoded data of the area of the target image. Reference numeral 1306 denotes a decoder for decoding the image data of the area of the target image. 1307 is an inverse quantizer. Reference numeral 1308 denotes a high-speed inverse DCT converter. The detailed configuration of the high-speed inverse DCT converter 1308 is shown in FIG.
1 is configured. 1309 is an adder. 13
Reference numerals 10, 1311 and 1312 are target memories for storing the areas of the reproduced target images. 1313 is a motion compensator. The image data in the target memory 1310 is
6 is output.

【０１７２】上記構成において、入力された符号データ
からマスク情報の符号化データと対象画像の領域の符号
化データを分離器１３０２で分離し、それぞれマスク復
号器１３０３と符号メモリ１３０５に入力する。マスク
復号器１３０３は符号化データを復号してマスク情報を
再生し、マスクメモリ１３０４に格納する。符号メモリ
１３０５に格納された符号化データは復号器１３０６で
復号され、量子化された値を再生する。この値は，逆量
子化器１３０７で逆量子化され、高速逆ＤＣＴ変換器１
３０８でラディックスバタフライ演算により逆ＤＣＴ変
換される。Ｉ−フレームの時は動き補償器１３１３は動
作せず、０を出力する。Ｐ−フレームとＢ−フレームの
時は動き補償器１３１３は動作し、動き補償予測値を出
力する。加算器１３０９は高速逆ＤＣＴ変換器１３０８
の出力と動き補償器１３１３の出力を加算し、対象メモ
リ１３１０および対象メモリ１３１１または１３１２に
格納する。一方、高速逆ＤＣＴ変換器１３０８からは、
ｎ段目のラディックスバタフライ演算結果がマルチプレ
クサされ、コントラスト補正用画像情報として端子１２
０８から出力される。In the above configuration, the coded data of the mask information and the coded data of the area of the target image are separated from the input coded data by the separator 1302 and input to the mask decoder 1303 and the code memory 1305, respectively. The mask decoder 1303 decodes the encoded data to reproduce the mask information, and stores it in the mask memory 1304. The coded data stored in the code memory 1305 is decoded by the decoder 1306, and the quantized value is reproduced. This value is inversely quantized by an inverse quantizer 1307, and the high-speed inverse DCT converter 1
In step 308, inverse DCT is performed by Radix butterfly operation. In the case of an I-frame, the motion compensator 1313 does not operate and outputs 0. At the time of a P-frame and a B-frame, the motion compensator 1313 operates and outputs a motion compensation prediction value. The adder 1309 is a high-speed inverse DCT converter 1308
And the output of the motion compensator 1313 are added and stored in the target memory 1310 and the target memory 1311 or 1312. On the other hand, from the high-speed inverse DCT converter 1308,
The n-th stage Radix butterfly operation result is multiplexed, and is output to terminal 12 as image information for contrast correction.
08 is output.

【０１７３】次に、実施形態４の復号器１２０５の詳細
構成について、図１７を用いて説明する。Next, the detailed configuration of the decoder 1205 of the fourth embodiment will be described with reference to FIG.

【０１７４】図１７は本発明の実施形態４の復号器の詳
細構成を示すブロック図である。FIG. 17 is a block diagram showing a detailed configuration of the decoder according to the fourth embodiment of the present invention.

【０１７５】１２２１は端子であり、記憶装置１１６か
らの符号化データが入力される。１３２２は符号化デー
タを格納する符号メモリである。１３２３は符号化デー
タを復号する復号器である。１３２４は逆量子化器であ
る。１３２５は高速逆ＤＣＴ変換器である。尚、高速逆
ＤＣＴ変換器１３２５の詳細構成は図１１と同様に構成
される。１３２６は加算器である。１３２７、１３２
８、１３２９は符号化データを復号して得られた画像デ
ータを格納するメモリである。１３３０は動き補償器で
ある。メモリ１３２７内の画像データは、端子１２２５
より出力される。Reference numeral 1221 denotes a terminal to which encoded data from the storage device 116 is input. Reference numeral 1322 denotes a code memory for storing encoded data. Reference numeral 1323 denotes a decoder that decodes encoded data. Reference numeral 1324 denotes an inverse quantizer. Reference numeral 1325 denotes a high-speed inverse DCT converter. The detailed configuration of the high-speed inverse DCT converter 1325 is the same as that shown in FIG. 1326 is an adder. 1327, 132
Reference numerals 8 and 1329 denote memories for storing image data obtained by decoding encoded data. 1330 is a motion compensator. The image data in the memory 1327 is
Output.

【０１７６】上記構成において、符号メモリ１３２２に
格納された符号化データは復号器１３２３で復号され、
量子化された値を再生する。この値は，逆量子化器１３
２４で逆量子化され、高速逆ＤＣＴ変換器１３２５で逆
ＤＣＴ変換される。Ｉ−フレームの時は動き補償器１３
３０は動作せず、０を出力する。Ｐ−フレームとＢ−フ
レームの時は動き補償器１３３０は動作し、動き補償予
測値を出力する。加算器１３２６は逆ＤＣＴ変換器１３
２５の出力と動き補償器１３３０の出力を加算し、メモ
リ１３２７およびメモリ１３２８または１３２９に格納
する。一方、高速逆ＤＣＴ変換器１３２５からは、直流
成分かもしくはｎ段目のラディックスバタフライ演算結
果がマルチプレクサされ、コントラスト補正用画像情報
として端子１２１２から出力される。In the above configuration, the encoded data stored in the code memory 1322 is decoded by the decoder 1323,
Regenerate the quantized value. This value is calculated by the inverse quantizer 13
24, and is inversely quantized by a high-speed inverse DCT transformer 1325. Motion compensator 13 for I-frame
30 does not operate and outputs 0. At the time of a P-frame and a B-frame, the motion compensator 1330 operates and outputs a motion compensation prediction value. The adder 1326 is an inverse DCT converter 13
25 and the output of the motion compensator 1330 are added together and stored in the memory 1327 and the memory 1328 or 1329. On the other hand, from the high-speed inverse DCT converter 1325, the DC component or the n-th stage Radix butterfly operation result is multiplexed and output from the terminal 1212 as image information for contrast correction.

【０１７７】以上説明した動画像編集装置１１２の構成
において、１フレーム分の復号が終了し、対象符号器１
２０３内の対象メモリ１３１０と、対象符号器１２０４
内の対象メモリ１３１０と、符号器１２０５内のメモリ
１３２７に画像データが格納されたら、補正値算出器１
２１３は、コントラスト補正用画像情報を用いて後述の
補正式算出アルゴリズムから、以下の補正式を求める。
つまり、補正器１２１４用補正式ｆ1(ｘ)と、補正器１
２１５用補正式ｆ2(ｘ)と、補正器１２１６用補正式ｆ3
(ｘ)とを求める。In the structure of the moving picture editing apparatus 112 described above, decoding of one frame is completed.
The target memory 1310 in the target 203 and the target encoder 1204
When the image data is stored in the target memory 1310 in the memory and the memory 1327 in the encoder 1205, the correction value calculator 1
213 obtains the following correction formula from the correction formula calculation algorithm described below using the image information for contrast correction.
That is, the correction formula f1 (x) for the corrector 1214 and the corrector 1
The correction formula f2 (x) for the 215 and the correction formula f3 for the corrector 1216
(x).

【０１７８】その後、復号器１２０５内のメモリ１３２
７から走査線の画素順にラスタスキャンにて画素値を読
み出し、補正器１２１６で補正を行い、画像合成器１２
１７に入力する。補正器１２１６では入力された画素値
ｐに対して補正式ｆ3(ｘ)による補正を次式にしたがっ
て行い、補正された画素値Ｐを求め、出力する。Thereafter, the memory 132 in the decoder 1205
7, the pixel values are read out by raster scan in the order of the pixels of the scanning line, corrected by the corrector 1216, and
Enter 17. The corrector 1216 corrects the input pixel value p by the correction formula f3 (x) according to the following formula, obtains a corrected pixel value P, and outputs the corrected pixel value P.

【０１７９】Ｐ＝ｆ3(ｐ) …（１３ａ）一方、スキャン位置が対象復号器１２０３の対象画像デ
ータを合成する位置に到達したら、対象復号器１２０３
内のマスクメモリ１３０４と対象メモリ１３１０からマ
スク情報と画像データを読み出し、補正器１２１４で補
正を行い、画像合成器２１７に入力する。補正器１２１
５では、入力された画素値ｐに対して補正式ｆ1(ｘ)に
よる補正を次式にしたがって行い、補正された画素値Ｐ
を求め、出力する。P = f3 (p) (13a) On the other hand, when the scan position reaches the position where the target image data of the target decoder 1203 is synthesized, the target decoder 1203
The mask information and the image data are read out from the mask memory 1304 and the target memory 1310, corrected by the corrector 1214, and input to the image synthesizer 217. Corrector 121
In step 5, the input pixel value p is corrected by the correction formula f1 (x) according to the following formula, and the corrected pixel value P
And output.

【０１８０】Ｐ＝ｆ1(ｐ) …（１４ａ）また、スキャン位置が対象復号器１２０４の対象画像デ
ータを合成する位置に到達したら、対象復号器１２０４
内のマスクメモリ１３０４と対象メモリ１３１０からマ
スク情報と画像データを読み出し、補正器１２１５で補
正を行い、画像合成器１２１７に入力する。補正器１２
１４では、入力された画素値ｐに対して補正式ｆ2(ｘ)
による補正を次式にしたがって行い、補正された画素値
Ｐを求め、出力する。P = f1 (p) (14a) When the scan position reaches the position where the target image data of the target decoder 1204 is synthesized, the target decoder 1204
The mask information and the image data are read out from the mask memory 1304 and the target memory 1310, corrected by the corrector 1215, and input to the image synthesizer 1217. Corrector 12
At 14, the correction equation f2 (x) is applied to the input pixel value p.
Is performed according to the following equation, and a corrected pixel value P is obtained and output.

【０１８１】Ｐ＝ｆ2(ｐ) …（１５ａ）画像合成器１２１７は、マスク情報が対象復号器１２０
３の対象画像データを示している場合は補正器１２１４
からの画素値を、マスク情報が対象復号器１２０４の対
象画像データを示している場合は補正器１２１５からの
画素値を、これらのいずれにもあたらない場合は補正器
１２１６からの画素値を出力することで画像の合成を行
い、端子１２１８から符号化器１１３に出力する。背景
１０５０と人１０５１とを補正して得られた画像である
背景１０６０と人１０６１と、人１０５２を補正して得
られた画像である人１０６２と、人１０５３を補正して
得られた画像である人１０６３とを合成した様子は、実
施形態３で用いた図９とほぼ同じである。但し、厳密に
は実施形態３におけるそれとはコントラストが異なる。
符号化器１１３は出力された画像をＭＰＥＧ−１で符号
化し、送信器１１４を介して通信網１１５に送出され
る。P = f 2 (p) (15a) The image synthesizer 1217 outputs the mask information to the target decoder 120.
In the case of indicating the target image data of No. 3, the corrector 1214
, And outputs the pixel value from the corrector 1215 when the mask information indicates the target image data of the target decoder 1204, and outputs the pixel value from the corrector 1216 when none of these values is met. By doing so, the images are synthesized and output from the terminal 1218 to the encoder 113. A background 1060 and a person 1061, which are images obtained by correcting the background 1050 and the person 1051, a person 1062 which is an image obtained by correcting the person 1052, and an image obtained by correcting the person 1053. The state of combining with a certain person 1063 is almost the same as FIG. 9 used in the third embodiment. However, the contrast is strictly different from that in the third embodiment.
The encoder 113 encodes the output image using MPEG-1, and sends the image to the communication network 115 via the transmitter 114.

【０１８２】上記動作において、補正値算出器１２１３
のアルゴリズムは下記に従って動作する。In the above operation, the correction value calculator 1213
Works according to the following.

【０１８３】まず、対象復号器１２０３からのコントラ
スト補正用画像情報１２２２における最大値Ｍａｘ1、
最小値Ｍｉｎ1、平均値Ｅ1、分散Ｒ1を求める。First, the maximum value Max1 in the contrast correction image information 1222 from the target decoder 1203,
A minimum value Min1, an average value E1, and a variance R1 are obtained.

【０１８４】次に、対象復号器１２０４からのコントラ
スト補正用画像情報１２２３における最大値Ｍａｘ2、
最小値Ｍｉｎ2、平均値Ｅ2、分散Ｒ2を求める。Next, the maximum value Max2 in the contrast correction image information 1223 from the target decoder 1204,
The minimum value Min2, the average value E2, and the variance R2 are obtained.

【０１８５】次に、対象復号器１２０５からのコントラ
スト補正用画像情報１２２４における最大値Ｍａｘ3、
最小値Ｍｉｎ3、平均値Ｅ3、分散Ｒ3を求める。Next, the maximum value Max3 in the contrast correction image information 1224 from the target decoder 1205,
The minimum value Min3, the average value E3, and the variance R3 are obtained.

【０１８６】コントラスト補正用画像情報１２２２、１
２２３、１２２４のうち、最大値が２５５かつ最小値が
０であるものが多くとも１つである場合最大値をＭａ
ｘ、最小をＭｉｎとすると、/ｆ1(ｘ)、/ｆ2(ｘ)、/ｆ3
(ｘ)とを次のように定める。Image information for contrast correction 1222, 1
When the maximum value is 255 and the minimum value is 0 among at most one of 223 and 1224, the maximum value is Ma
x, the minimum is Min, / f1 (x), / f2 (x), / f3
(x) is defined as follows.

【０１８７】 /f1(x)＝[{α(Max−Max1)＋Max1}−{β(Min−Min1)＋Min1}] ／(Max1−Min1)×(x−Min1) ＋{α(Max−Max1)＋Max1} …（１６ａ） /f2(x)＝[{α(Max−Max2)＋Max2}−{β(Min−Min2)＋Min2}] ／(Max2−Min2)×(x−Min2) ＋{α(Max−Max2)＋Max2} …（１７ａ） /f3(x)＝[{α(Max−Max3)＋Max3}−{β(Min−Min3)＋Min3}] ／(Max3−Min3)×(x−Min3) ＋{α(Max−Max3)＋Max3} …（１８ａ）但し、αならびにβは重み付け変数、もしくは重み付け
関数である。/ F1 (x) = [{α (Max−Max1) + Max1} − {β (Min−Min1) + Min1}] / (Max1−Min1) × (x−Min1) + {α (Max−Max1) + Max1} (16a) / f2 (x) = [{α (Max−Max2) + Max2} − {β (Min−Min2) + Min2}] / (Max2−Min2) × (x−Min2) + {α (Max −Max2) + Max2} (17a) / f3 (x) = [{α (Max−Max3) + Max3} − {β (Min−Min3) + Min3}] / (Max3−Min3) × (x−Min3) + { α (Max−Max3) + Max3} (18a) where α and β are weighting variables or weighting functions.

【０１８８】さもなくば、コントラスト補正用画像情報
１２２２、１２２３、１２２４のうち、最大値が２５５
かつ最小値が２５５であるものが２つである場合、例え
ば、最大値が２５５でなくあるいは最小値が０でないも
のがコントラスト補正用画像情報１２２２とすると、/
ｆ1(ｘ)、/ｆ2(ｘ)、/ｆ3(ｘ)とを次のように定める。Otherwise, of the contrast correction image information 1222, 1223, and 1224, the maximum value is 255.
In addition, when the minimum value is 255 and the minimum value is 255, for example, if the maximum value is not 255 or the minimum value is not 0, the image information 1222 for contrast correction is obtained.
f1 (x), / f2 (x), and / f3 (x) are determined as follows.

【０１８９】 /f1(x)＝[{α(255−Max1)＋Max1}−{β(0−Min1)＋Min1}] ／(Max1−Min1)×(x−Min1) ＋{α(255−Max1)＋Max1} …（１９ａ） /ｆ2(ｘ)と/ｆ3(ｘ)とは、分散の差｜Ｒ2−Ｒ3｜が小さ
くなるような関数を定める。定め方の一例として、次の
ような３接点からなる３次スプラインを用いた例を挙げ
る。/ F1 (x) = [{α (255−Max1) + Max1} − {β (0−Min1) + Min1}] / (Max1−Min1) × (x−Min1) + {α (255−Max1) + Max1} (19a) / f2 (x) and / f3 (x) determine a function such that the variance difference | R2-R3 | is small. As an example of the determination method, an example using a cubic spline including the following three contact points will be described.

【０１９０】例えば、Ｒ2＞Ｒ3である場合、 /ｆ2(ｘ)＝ｘ …（２０ａ） /ｆ3(ｘ)＝/ｆ31(ｘ)；ｘ≦Ｅ3 /ｆ32(ｘ)；ｘ＞Ｅ3 …（２１ａ）但し、/ｆ31(０)＝０；/ｆ31(Ｅ3)＝Ｅ3；/ｆ32(２５
５)＝２５５；/ｆ32(Ｅ3)＝Ｅ3；/ｆ⁽²⁾31(Ｅ3)＝/ｆ
⁽²⁾32(Ｅ3)；/ｆ⁽¹⁾31(Ｅ3)＝φ；/ｆ(1)32(Ｅ3)＝ψを
満たすものとする。For example, when R2> R3, / f2 (x) = x (20a) / f3 (x) = / f31 (x); x ≦ E3 / f32 (x); x> E3 (21a However, / f31 (0) = 0; / f31 (E3) = E3; / f32 (25
5) = 255; / f32 (E3) = E3; / f ⁽²⁾ 31 (E3) = / f
⁽²⁾ 32 (E3); / f ⁽¹⁾ 31 (E3) = φ; / f (1) 32 (E3) = ψ.

【０１９１】また、α，β，φならびにψは重み付け変
数、もしくは重み付け関数である。Α, β, φ and ψ are weighting variables or weighting functions.

【０１９２】さもなくば、/ｆ2(ｘ)と/ｆ2(ｘ)と/ｆ3
(ｘ)とは、分散の差｜Ｒ1−Ｒ2｜、｜Ｒ1−Ｒ3｜、｜Ｒ
2−Ｒ3｜が小さくなるような関数を定める。Otherwise, / f2 (x), / f2 (x) and / f3
(x) is the variance difference | R1-R2 |, | R1-R3 |, | R
A function that reduces 2-R3 | is determined.

【０１９３】定め方の一例として、次のような３接点か
らなる３次スプラインを用いた例を挙げる。As an example of the determination method, an example using a cubic spline having the following three contact points will be described.

【０１９４】例えば、Ｒ1＞Ｒ2＞Ｒ3である場合、 /ｆ1(ｘ)＝ｘ …（２２ａ） /ｆ2(ｘ)＝/ｆ21(ｘ)；ｘ≦Ｅ2 /ｆ22(ｘ)；ｘ＞Ｅ2 …（２３ａ） /ｆ3(ｘ)＝/ｆ31(ｘ)；ｘ≦Ｅ3 /ｆ32(ｘ)；ｘ＞Ｅ3 …（２４ａ）但し、/ｆ21(０)＝０；/ｆ21(Ｅ2)＝Ｅ2；ｆ22(２５５)
＝２５５；ｆ22(Ｅ2)＝Ｅ2；/ｆ⁽²⁾21(Ｅ2)＝/ｆ⁽²⁾22
(Ｅ2)；/ｆ⁽¹⁾21(Ｅ3)＝φ2；/ｆ⁽¹⁾22(Ｅ2)＝ψ2なら
びに/ｆ31(０)＝０；/ｆ31(Ｅ3)＝Ｅ3；/ｆ32(２５５)
＝２５５；/ｆ32(Ｅ3)＝Ｅ3；/ｆ⁽²⁾31(Ｅ3)＝ｆ⁽²⁾32
(Ｅ3)；/ｆ⁽¹⁾31(Ｅ3)＝φ3；/ｆ⁽¹⁾32(Ｅ3)＝ψ3を満
たすものとする。For example, when R1>R2> R3, / f1 (x) = x (22a) / f2 (x) = / f21 (x); x ≦ E2 / f22 (x); x> E2 ... (23a) / f3 (x) = / f31 (x); x ≦ E3 / f32 (x); x> E3 (24a) where / f21 (0) = 0; / f21 (E2) = E2; f22 (255)
= 255; f22 (E2) = E2; / f ⁽²⁾ 21 (E2) = / f ⁽²⁾ 22
(E2); / f ⁽¹⁾ 21 (E3) = φ2; / f ⁽¹⁾ 22 (E2) = ψ2 and / f31 (0) = 0; / f31 (E3) = E3; / f32 (255)
= 255; / f32 (E3) = E3; / f ⁽²⁾ 31 (E3) = f ⁽²⁾ 32
(E3); / f ⁽¹⁾ 31 (E3) = φ3; / f ⁽¹⁾ 32 (E3) = ψ3.

【０１９５】また、φ2，ψ2，φ3ならびにψ3は重み付
け変数、もしくは重み付け関数である。Φ2, ψ2, φ3 and ψ3 are weighting variables or weighting functions.

【０１９６】１フレーム前の補正式を基に、現在の補正
式を次のように定める。The current correction equation is determined as follows based on the correction equation one frame before.

【０１９７】ｆ1(ｘ)＝ｆ1(ｘ)＋γ(/ｆ1(ｘ)−ｆ1(ｘ)） …（２５ａ）ｆ2(ｘ)＝ｆ2(ｘ)＋γ(/ｆ2(ｘ)−ｆ2(ｘ)） …（２６ａ）ｆ3(ｘ)＝ｆ3(ｘ)＋γ(/ｆ3(ｘ)−ｆ3(ｘ)） …（２７ａ）ここで、γは、補正式の時変化追従用重み変数である。F1 (x) = f1 (x) + γ (/ f1 (x) −f1 (x)) (25a) f2 (x) = f2 (x) + γ (/ f2 (x) −f2 (x) ) (26a) f3 (x) = f3 (x) + γ (/ f3 (x) −f3 (x)) (27a) Here, γ is a time-variation tracking weight variable of the correction formula.

【０１９８】以上説明したように、実施形態４によれ
ば、背景画像と対象画像を分離し、それぞれについて符
号化したものを合成する際にそれぞれの画像データの特
徴量を抽出し、合成する対象画像の画素値を補正するこ
とで違和感のない画像合成が行える。また、対象画像の
サイズと演算速度との兼ね合いにおいて、直流成分、も
しくは２×２あるいは４×４の低周波成分の逆ＤＣＴ結
果、もしくは８×８の逆ＤＣＴ結果を選択的に、補正値
算出に用いることで柔軟で精度の高い処理が可能にな
る。さらに、コントラスト補正処理を時変化にゆるやか
に追従させることで、変化の激しい画像に対しても違和
感の無い画像合成が行える。As described above, according to the fourth embodiment, when the background image and the target image are separated, and when the coded images are combined, the feature amount of each image data is extracted and the object to be combined is extracted. By correcting the pixel values of the image, it is possible to perform image synthesis without a sense of discomfort. Further, in consideration of the size of the target image and the calculation speed, a correction value is calculated by selectively selecting a DC component or an inverse DCT result of a 2 × 2 or 4 × 4 low frequency component or an 8 × 8 inverse DCT result. In this case, flexible and highly accurate processing can be performed. Furthermore, by causing the contrast correction processing to slowly follow the time change, it is possible to perform image synthesis without discomfort even for an image that changes rapidly.

【０１９９】尚、実施形態４においては、対象の符号化
にＭＰＥＧ−４を、それ以外の符号化にＭＰＥＧ−１を
用いて説明をおこなったが、これに限定されず、これら
と同様の機能を果たすものであればなんでもかまわな
い。In the fourth embodiment, the description has been made using MPEG-4 for the target encoding and MPEG-1 for the other encoding. However, the present invention is not limited to this. It does not matter anything that fulfills.

【０２００】また、メモリ構成はこれに限定されず、ラ
インメモリ等で処理を行ってももちろんかまわないし、
その他の構成であってもよい。Further, the memory configuration is not limited to this, and it goes without saying that processing may be performed by a line memory or the like.
Other configurations may be used.

【０２０１】また、各構成要素の一部または全部をＣＰ
Ｕ等で動作するソフトウェアによって実現させてももち
ろんかまわない。A part or all of the constituent elements may be replaced by a CP.
Of course, it may be realized by software operating on U or the like.

【０２０２】最後に、上記実施形態１〜実施形態４で実
行される処理の処理フローについて、図１８を用いて説
明する。Finally, the processing flow of the processing executed in the first to fourth embodiments will be described with reference to FIG.

【０２０３】図１８は本発明で実行される処理の処理フ
ローを示すフローチャートである。まず、ステップＳ１
０１で、入力された符号化データを、背景画像の符号化
データと対象画像の符号化データに分離する。ステップ
Ｓ１０２で、背景画像の符号化データの背景特徴を抽出
する。ステップＳ１０３で、対象画像の符号化データの
対象特徴を抽出する。ステップＳ１０４で、背景画像の
符号化データを復号して背景再生画像を生成する。ステ
ップＳ１０５で、対象画像の符号化データを復号して対
象再生画像を生成する。ステップＳ１０６で、抽出した
背景特徴と対象特徴に基づいて、対象再生画像を補正す
る。この補正の詳細については、各実施形態で説明した
通りである。ステップＳ１０７で、背景再生画像と補正
された対象再生画像を合成する。FIG. 18 is a flowchart showing the processing flow of the processing executed in the present invention. First, step S1
At 01, the input encoded data is separated into encoded data of a background image and encoded data of a target image. In step S102, a background feature of the encoded data of the background image is extracted. In step S103, a target feature of the encoded data of the target image is extracted. In step S104, the encoded data of the background image is decoded to generate a background reproduced image. In step S105, the coded data of the target image is decoded to generate a target reproduced image. In step S106, the target playback image is corrected based on the extracted background features and target features. The details of this correction are as described in each embodiment. In step S107, the background reproduced image and the corrected target reproduced image are combined.

【０２０４】尚、本発明は、複数の機器（例えばホスト
コンピュータ、インタフェース機器、リーダ、プリンタ
など）から構成されるシステムに適用しても、一つの機
器からなる装置（例えば、複写機、ファクシミリ装置な
ど）に適用してもよい。Even if the present invention is applied to a system including a plurality of devices (for example, a host computer, an interface device, a reader, a printer, etc.), a device including one device (for example, a copying machine, a facsimile machine) Etc.).

【０２０５】また、本発明の目的は、前述した実施形態
の機能を実現するソフトウェアのプログラムコードを記
録した記憶媒体を、システムあるいは装置に供給し、そ
のシステムあるいは装置のコンピュータ（またはＣＰＵ
やＭＰＵ）が記憶媒体に格納されたプログラムコードを
読出し実行することによっても、達成されることは言う
までもない。Further, an object of the present invention is to supply a storage medium storing a program code of software for realizing the functions of the above-described embodiments to a system or apparatus, and to provide a computer (or CPU) of the system or apparatus.
And MPU) read and execute the program code stored in the storage medium.

【０２０６】この場合、記憶媒体から読出されたプログ
ラムコード自体が前述した実施形態の機能を実現するこ
とになり、そのプログラムコードを記憶した記憶媒体は
本発明を構成することになる。In this case, the program code itself read from the storage medium realizes the functions of the above-described embodiment, and the storage medium storing the program code constitutes the present invention.

【０２０７】プログラムコードを供給するための記憶媒
体としては、例えば、フロッピディスク、ハードディス
ク、光ディスク、光磁気ディスク、ＣＤ−ＲＯＭ、ＣＤ
−Ｒ、磁気テープ、不揮発性のメモリカード、ＲＯＭな
どを用いることができる。As a storage medium for supplying the program code, for example, a floppy disk, hard disk, optical disk, magneto-optical disk, CD-ROM, CD
-R, a magnetic tape, a nonvolatile memory card, a ROM, or the like can be used.

【０２０８】また、コンピュータが読出したプログラム
コードを実行することにより、前述した実施形態の機能
が実現されるだけでなく、そのプログラムコードの指示
に基づき、コンピュータ上で稼働しているＯＳ（オペレ
ーティングシステム）などが実際の処理の一部または全
部を行い、その処理によって前述した実施形態の機能が
実現される場合も含まれることは言うまでもない。When the computer executes the readout program codes, not only the functions of the above-described embodiment are realized, but also the OS (Operating System) running on the computer based on the instructions of the program codes. ) May perform some or all of the actual processing, and the processing may realize the functions of the above-described embodiments.

【０２０９】更に、記憶媒体から読出されたプログラム
コードが、コンピュータに挿入された機能拡張ボードや
コンピュータに接続された機能拡張ユニットに備わるメ
モリに書込まれた後、そのプログラムコードの指示に基
づき、その機能拡張ボードや機能拡張ユニットに備わる
ＣＰＵなどが実際の処理の一部または全部を行い、その
処理によって前述した実施形態の機能が実現される場合
も含まれることは言うまでもない。Further, after the program code read from the storage medium is written into a memory provided in a function expansion board inserted into the computer or a function expansion unit connected to the computer, based on the instruction of the program code, It goes without saying that the CPU included in the function expansion board or the function expansion unit performs part or all of the actual processing, and the processing realizes the functions of the above-described embodiments.

【０２１０】[0210]

【発明の効果】以上説明したように、本発明によれば、
複数の画像の合成を容易に実行でき、かつ画品位が良好
な合成画像を生成することができる画像処理装置及びそ
の方法、コンピュータ可読メモリを提供できる。As described above, according to the present invention,
It is possible to provide an image processing apparatus, a method thereof, and a computer-readable memory capable of easily executing the synthesis of a plurality of images and generating a synthesized image with good image quality.

[Brief description of the drawings]

【図１】本発明の実施形態１の動画像送信システムの構
成を示すブロック図である。FIG. 1 is a block diagram illustrating a configuration of a moving image transmission system according to a first embodiment of the present invention.

【図２】本発明の実施形態１の対象画像の領域の一例を
示す図である。FIG. 2 is a diagram illustrating an example of a region of a target image according to the first embodiment of the present invention.

【図３】本発明の実施形態１のマスク情報の一例を示す
図である。FIG. 3 is a diagram illustrating an example of mask information according to the first embodiment of the present invention.

【図４】本発明の実施形態１の符号化画像の一例を示す
図である。FIG. 4 is a diagram illustrating an example of an encoded image according to the first embodiment of the present invention.

【図５】本発明の実施形態１の対象符号化部の詳細構成
を示すブロック図である。FIG. 5 is a block diagram illustrating a detailed configuration of a target encoding unit according to the first embodiment of the present invention.

【図６】本発明の実施形態１の動画像編集装置の詳細構
成を示すブロック図である。FIG. 6 is a block diagram illustrating a detailed configuration of the moving image editing apparatus according to the first embodiment of the present invention.

【図７】本発明の実施形態１の対象復号器の詳細構成を
示すブロック図である。FIG. 7 is a block diagram illustrating a detailed configuration of a target decoder according to the first embodiment of the present invention.

【図８】本発明の実施形態１の復号器の詳細構成を示す
ブロック図である。FIG. 8 is a block diagram illustrating a detailed configuration of a decoder according to the first embodiment of the present invention.

【図９】本発明の実施形態１の対象画像の合成の一例を
示す図である。FIG. 9 is a diagram illustrating an example of combining target images according to the first embodiment of the present invention.

【図１０】本発明の実施形態２の対象復号器の詳細構成
を示すブロック図である。FIG. 10 is a block diagram illustrating a detailed configuration of a target decoder according to Embodiment 2 of the present invention.

【図１１】本発明の実施形態２の高速逆ＤＣＴ変換器の
詳細構成を示すブロック図である。FIG. 11 is a block diagram illustrating a detailed configuration of a high-speed inverse DCT converter according to a second embodiment of the present invention.

【図１２】本発明の実施形態２の復号器の詳細構成を示
すブロック図である。FIG. 12 is a block diagram illustrating a detailed configuration of a decoder according to Embodiment 2 of the present invention.

【図１３】本発明の実施形態３の動画像編集装置の詳細
構成を示すブロック図である。FIG. 13 is a block diagram illustrating a detailed configuration of a moving image editing device according to a third embodiment of the present invention.

【図１４】本発明の実施形態３の対象復号器の詳細構成
を示すブロック図である。FIG. 14 is a block diagram illustrating a detailed configuration of a target decoder according to Embodiment 3 of the present invention.

【図１５】本発明の実施形態３の復号器の詳細構成を示
すブロック図である。FIG. 15 is a block diagram illustrating a detailed configuration of a decoder according to Embodiment 3 of the present invention.

【図１６】本発明の実施形態４の対象復号器の詳細構成
を示すブロック図である。FIG. 16 is a block diagram illustrating a detailed configuration of a target decoder according to Embodiment 4 of the present invention.

【図１７】本発明の実施形態４の復号器の詳細構成を示
すブロック図である。FIG. 17 is a block diagram illustrating a detailed configuration of a decoder according to Embodiment 4 of the present invention.

【図１８】本発明で実行される処理の処理フローを示す
フローチャートである。FIG. 18 is a flowchart showing a processing flow of processing executed in the present invention.

【図１９】従来の符号化システムの構成を示す図であ
る。FIG. 19 is a diagram illustrating a configuration of a conventional encoding system.

【図２０】本発明の実施形態１の画像の一例を示す図で
ある。FIG. 20 is a diagram illustrating an example of an image according to the first embodiment of the present invention.

【図２１】本発明の実施形態１の画像の一例を示す図で
ある。FIG. 21 is a diagram illustrating an example of an image according to the first embodiment of the present invention.

【図２２】本発明の実施形態１の符号化データの一例を
示す図である。FIG. 22 is a diagram illustrating an example of encoded data according to the first embodiment of the present invention.

[Explanation of symbols]

１０１、１０２ＴＶカメラ１０３対象抽出器１０４、１１３符号化器１０５対象符号化部１０６、１０７送信器１０８、１０９通信回線１１０、１１１受信器１１２動画像編集装置１１５通信網１１６記憶装置１２１、１２２端子１２３マスクメモリ１２４マスク符号器１２５、１３８、１３９対象メモリ１２６平均値算出器１２７ブロック形成器１２８フレームモード設定器１２９差分器１３０ＤＣＴ変換器１３１量子化器１３２符号器１３３合成器１３５逆量子化器１３６逆ＤＣＴ変換器１３７加算器１４０動き補償器２００、２０１、２０２、２０６、２０７、２０８、２
０９、２１０、２１１、２１２、２１８、２２５端子２０３、２０４対象復号器２０５復号器２１３補正値算出器２１４、２１５、２１６補正器２１７画像合成器２４１分離器２４２マスク復号器２４３マスクメモリ２４４、３０１符号メモリ２４５、３０２復号器２４６、２４７、３０３、３０４デマルチプレクサ２４８、２５５、２６２、３０５、３１２、３１９逆
量子化器２４９、２５６、２６３、３０６、３１３、３２０逆
ＤＣＴ変換器２５０、２５７、２６４、３０７、３１４、３２１加
算器２５１、２５２、２５３、２５８、２５９、２６０、２
６５、２６６、２６７対象メモリ２５４、２６１、２６８、３１１、３１８、３２５動
き補償器３０８、３０９、３１０、３１５、３１６、３１７、３
２２、３２３、３２４メモリ２７０、２７１、２７２、３２７、３２８、３２９バ
ッファ２６９、２７３、３２６、３３０色信号変換器４０１分離器４０２マスク復号器４０３マスクメモリ４０４、４５２符号メモリ４０５、４５３復号器４０６、４０７、４５４、４５５デマルチプレクサ４０８、４１５、４２２、４５６、４６３、４７０逆
量子化器４０９、４１６、４２３、４５７、４６４、４７１高
速逆ＤＣＴ変換器４１０、４１７、４２４、４５８、４６５、４７２加
算器４１１、４１２、４１３、４１８、４１９、４２０、４
２５、４２６、４２７対象メモリ４１４、４２１、４２８、４６２、４６９、４７６動
き補償器４５９、４６０、４６１、４６６、４６７、４６８、４
７３、４７４、４７５メモリ４３０、４３１、４３２、４７８、４７９、４８０バ
ッファ４２９、４３３、４７７、４８１色信号変換器１２００、１２０１、１２０２、１２０６、１２０７、
１２０８、１２０９、１２１０、１２１１、１２１２、
１２１８、１２２５端子１２０３、１２０４対象復号器１２０５復号器１２１３補正値算出器１２１４、１２１５、１２１６補正器１２１７画像合成器１２４１、１３０２分離器１２４２、１３０３マスク復号器１２４３、１３０４マスクメモリ１２４４、１２６１、１３０５、１３２２符号メモリ１２４５、１２６２、１３０６、１３２３復号器１２４６、１２６３、１３０７、１３２４逆量子化器１２４７、１２６４逆ＤＣＴ変換器１２４８、１２６５、１３０９、１３２６加算器１２４９、１２５０、１２５１、１３１０、１３１１、
１３１２対象メモリ１２５２、１２６９、１３１３、１３３０動き補償器１２６６、１２６７、１２６８、１３２７、１３２８、
１３２９メモリ１３０８、１３２５高速逆ＤＣＴ変換器101, 102 TV camera 103 Target extractor 104, 113 Encoder 105 Target encoder 106, 107 Transmitter 108, 109 Communication line 110, 111 Receiver 112 Video editing device 115 Communication network 116 Storage device 121, 122 Terminal 123 mask memory 124 mask encoder 125, 138, 139 target memory 126 average value calculator 127 block former 128 frame mode setter 129 differentiator 130 DCT converter 131 quantizer 132 encoder 133 synthesizer 135 inverse quantizer 136 Inverse DCT converter 137 Adder 140 Motion compensator 200, 201, 202, 206, 207, 208, 2
09, 210, 211, 212, 218, 225 Terminal 203, 204 Object decoder 205 Decoder 213 Correction value calculator 214, 215, 216 Corrector 217 Image synthesizer 241 Separator 242 Mask decoder 243 Mask memory 244, 301 Code memories 245, 302 Decoders 246, 247, 303, 304 Demultiplexers 248, 255, 262, 305, 312, 319 Inverse quantizers 249, 256, 263, 306, 313, 320 Inverse DCT transformers 250, 257 264, 307, 314, 321 Adders 251, 252, 253, 258, 259, 260, 2
65, 266, 267 Target memory 254, 261, 268, 311, 318, 325 Motion compensator 308, 309, 310, 315, 316, 317, 3
22, 323, 324 memory 270, 271, 272, 327, 328, 329 Buffer 269, 273, 326, 330 Color signal converter 401 Separator 402 Mask decoder 403 Mask memory 404, 452 Code memory 405, 453 Decoder 406 , 407, 454, 455 Demultiplexer 408, 415, 422, 456, 463, 470 Dequantizer 409, 416, 423, 457, 464, 471 High-speed inverse DCT converter 410, 417, 424, 458, 465, 472 Adders 411, 412, 413, 418, 419, 420, 4
25, 426, 427 Target memory 414, 421, 428, 462, 469, 476 Motion compensator 459, 460, 461, 466, 467, 468, 4
73, 474, 475 memories 430, 431, 432, 478, 479, 480 buffers 429, 433, 477, 481 color signal converters 1200, 1201, 1202, 1206, 1207,
1208, 1209, 1210, 1211, 1212,
1218, 1225 Terminals 1203, 1204 Target decoder 1205 Decoder 1213 Correction value calculator 1214, 1215, 1216 Corrector 1217 Image synthesizer 1241, 1302 Separator 1242, 1303 Mask decoder 1243, 1304 Mask memory 1244, 1261, 1305 , 1322 Code memory 1245, 1262, 1306, 1323 Decoder 1246, 1263, 1307, 1324 Inverse quantizer 1247, 1264 Inverse DCT transformer 1248, 1265, 1309, 1326 Adder 1249, 1250, 1251, 1310, 1311,
1312 Target memory 1252, 1269, 1313, 1330 Motion compensator 1266, 1267, 1268, 1327, 1328,
1329 Memory 1308, 1325 High-speed inverse DCT converter

───────────────────────────────────────────────────── フロントページの続きＦターム(参考） 5C059 KK37 MA00 MA05 MA09 MA23 MB12 MB22 MC23 PP05 PP06 PP07 PP15 PP16 PP26 PP27 PP29 RC19 SS07 SS20 SS26 TA01 TA44 TB07 TC02 TC04 TC24 TD01 TD02 TD03 TD04 TD10 TD12 UA05 UA25 UA31 UA39 5C066 AA13 CA09 CA17 EC06 ED04 EE01 GA01 GA02 GA05 GA22 GA32 HA01 JA02 KE09 KE16 ──────────────────────────────────────────────────続き Continued on the front page F term (reference) 5C059 KK37 MA00 MA05 MA09 MA23 MB12 MB22 MC23 PP05 PP06 PP07 PP15 PP16 PP26 PP27 PP29 RC19 SS07 SS20 SS26 TA01 TA44 TB07 TC02 TC04 TC24 TD01 TD02 TD03 TD04 TD10 TD12 UA05 UA25 066 AA13 CA09 CA17 EC06 ED04 EE01 GA01 GA02 GA05 GA22 GA32 HA01 JA02 KE09 KE16

Claims

[Claims]

1. An image processing apparatus for synthesizing a plurality of images, comprising: a background feature extracting unit configured to extract a background feature from encoded data of at least one background image; and an image processing unit configured to extract an image from encoded data of at least one target image. A target feature extracting unit that extracts a target feature including statistical information of information; a background decoding unit that decodes encoded data of the background image to generate a background reproduction image; and decodes encoded data of the target image. A target decoding unit for generating a target reproduction image; a correction unit for correcting the target reproduction image based on the background feature and the target characteristic; and combining the background reproduction image and the target reproduction image corrected by the correction unit. An image processing apparatus comprising:

2. The target feature extraction unit includes a calculation unit that calculates a histogram based on statistical information of the image information, and the correction unit determines a correction method of the target image based on the histogram. 2. The method according to claim 1, wherein
An image processing apparatus according to claim 1.

3. The image processing apparatus according to claim 1, wherein the target feature extracting unit extracts DC information of a block image included in the encoded data as statistical information of the image information.
An image processing apparatus according to claim 1.

4. The image processing apparatus according to claim 1, wherein the target feature extracting unit extracts low-frequency information of a block image included in the encoded data as statistical information of the image information. apparatus.

5. One or both of said background decoding means and said target decoding means are decoding means for decoding quantized data from said encoded data, and inverse quantization for calculating frequency band data from said quantized data. And a high-speed inverse discrete cosine transform unit for calculating spatial domain data from the frequency domain data.The high-speed inverse discrete cosine transform unit includes an output unit that outputs a result of a Radix butterfly operation of an arbitrary number of stages. The image processing apparatus according to claim 4, wherein the target feature extracting unit extracts the Radix butterfly operation result of the arbitrary number of stages as low-frequency information of image information.

6. The apparatus according to claim 1, wherein the correction unit includes a time series adaptation unit that gradually changes an input / output relationship between a signal input to the correction unit and an output signal according to a time series. Image processing device.

7. The target feature extracting means extracts a maximum value and a minimum value of a pixel value from DC information or low frequency information of a block image included in the encoded data as statistical information of the image information. The image processing apparatus according to claim 1, wherein:

8. The target feature extracting means extracts, as statistical information of the image information, a variance and an average value of pixel values from DC information or low frequency information of a block image included in the encoded data. The image processing apparatus according to claim 1, wherein:

9. The image processing apparatus according to claim 1, wherein the correction unit performs a linear transformation on the target image.

10. The apparatus according to claim 1, wherein the correction unit performs a piecewise spline transformation on the target image.

11. A detecting means for detecting presence or absence of a significant color bias from the target feature extracted by the target feature extracting means, and correcting the color bias based on a detection result of the detecting means. 5. A color correction means comprising:
The image processing device according to any one of the above.

12. The detecting means satisfies a condition that, based on statistical information included in the extracted target feature, a difference absolute value of a variance from a difference absolute value of an average value between color signals is equal to or less than a threshold. Detecting whether or not there is significant color bias in a specific area of the histogram based on the statistical information when the condition is satisfied. An image processing apparatus according to claim 1.

13. The image processing apparatus according to claim 11, wherein the color correction unit performs linear correction so that the maximum value of each color signal is equal.

14. The image processing apparatus according to claim 11, wherein the color correction unit does not correct a blue signal.

15. The detecting means for detecting a significant contrast difference from the target feature extracted by the target feature extracting means and the background feature extracted by the background feature extracting means, The image processing apparatus according to claim 1, further comprising: a contrast correction unit configured to correct a contrast based on a detection result of the detection unit.

16. The detecting means extracts a maximum value and a minimum value of a pixel value obtained from the target feature and the background feature, respectively, and the contrast correcting means extracts an object having a different maximum value or the minimum value. 16. The image processing according to claim 15, wherein the correction is performed so that the difference absolute value between the maximum pixel value and the minimum pixel value is reduced between the image and the background image. apparatus.

17. The detection unit extracts a maximum value and a minimum value of a pixel value obtained from the target feature and the background feature, respectively, and the contrast correction unit determines whether the maximum value or the minimum value is 16. The image processing apparatus according to claim 15, wherein the correction is performed such that the difference absolute value of the variance is reduced between the substantially equal target image and the background image.

18. An image processing method for synthesizing a plurality of images, comprising: extracting a background feature from encoded data of at least one background image; and extracting an image from encoded data of at least one target image. A target feature extraction step of extracting a target feature including statistical information of information; a background decoding step of decoding encoded data of the background image to generate a background reproduction image; and decoding encoded data of the target image. A target decoding step of generating a target playback image; a correction step of correcting the target playback image based on the background feature and the target feature; and synthesizing the background playback image and the target playback image corrected in the correction step. An image processing method comprising:

19. The target feature extracting step includes a calculating step of calculating a histogram based on the statistical information of the image information, and the correcting step determines a correction method of the target image based on the histogram. 2. The method according to claim 1, wherein
9. The image processing method according to 8.

20. The image processing method according to claim 18, wherein the target feature extracting step extracts DC information of a block image included in the encoded data as statistical information of the image information.

21. The image processing apparatus according to claim 18, wherein the target feature extracting step extracts low-frequency information of a block image included in the encoded data as statistical information of the image information. Method.

22. One or both of the background decoding step and the target decoding step include: a decoding step of decoding quantized data from the encoded data; and an inverse quantization step of calculating frequency band data from the quantized data. A high-speed inverse discrete cosine transform step of calculating spatial domain data from the frequency domain data, and the high-speed inverse discrete cosine transform step includes an output step of outputting a Radix butterfly operation result of an arbitrary number of stages, 22. The image processing method according to claim 21, wherein the target feature extracting step extracts the Radix butterfly operation result of the arbitrary number of stages as low-frequency information of image information.

23. The correction method according to claim 18, wherein the correction step includes a time-series adaptation step of gradually changing an input / output relationship between a signal input to the correction step and an output signal according to a time series. Image processing method.

24. The target feature extracting step includes extracting a maximum value and a minimum value of a pixel value from DC information or low frequency information of a block image included in the encoded data as statistical information of the image information. 19. The method according to claim 18, wherein
The image processing method according to 1.

25. The target feature extracting step includes extracting, as statistical information of the image information, a variance and an average value of pixel values from DC information or low frequency information of a block image included in the encoded data. 19. The image processing method according to claim 18, wherein:

26. The image processing method according to claim 18, wherein the correcting step performs a linear transformation on the target image.

27. The image processing method according to claim 18, wherein the correcting step performs a piecewise spline transformation on the target image.

28. The correcting step includes detecting a presence or absence of a significant color bias from the target feature extracted in the target feature extracting step, and correcting the color bias from the detection result of the detecting step. 22. The image processing method according to claim 18, further comprising:

29. The detecting step satisfies a condition that, based on statistical information included in the extracted target feature, a difference absolute value of a variance from a difference absolute value of an average value between color signals is equal to or less than a threshold. 29. The method according to claim 28, further comprising: detecting whether or not there is a significant color bias in a specific area of the histogram based on the statistical information when the condition is satisfied. The image processing method according to 1.

30. The image processing method according to claim 28, wherein in the color correction step, linear correction is performed so that the maximum value of each color signal becomes equal.

31. The image processing method according to claim 28, wherein in the color correction step, no correction is performed on a blue signal.

32. The detecting step, comprising: detecting a significant contrast difference between the target feature extracted in the target feature extracting step and the background feature extracted in the background feature extracting step. 22. The image processing method according to claim 18, further comprising: a contrast correction step of correcting contrast based on a detection result of the detection step.

33. The detection step extracts a maximum value and a minimum value of a pixel value obtained from the target feature and the background feature, respectively. The contrast correction step includes a step of extracting a target having a different maximum value or the minimum value. 33. The image processing according to claim 32, wherein the image and the background image are corrected such that the absolute value of the difference between the maximum pixel value and the absolute value of the minimum pixel value decreases. Method.

34. The detecting step extracts a maximum value and a minimum value of pixel values obtained from the target feature and the background feature, respectively, and the contrast correction step determines whether the maximum value or the minimum value is 33. The image processing method according to claim 32, wherein the correction is performed so that the difference absolute value of the variance is reduced between the substantially equal target image and the background image.

35. A computer readable memory storing a program code for image processing for synthesizing a plurality of images, the program code for a background feature extracting step of extracting a background feature from encoded data of at least one background image; A program code for a target feature extraction step of extracting a target feature including statistical information of image information from encoded data of at least one target image; and decoding the encoded data of the background image to generate a background reproduced image. A program code for a background decoding step, a program code for a target decoding step for generating a target playback image for decoding the encoded data of the target image, and correcting the target playback image based on the background feature and the target feature. The program code of the correction step, the background reproduced image and the target reproduced image corrected in the correction step are combined. Computer-readable memory comprising: a program code for a synthesis process to be performed.