JP2019087883A

JP2019087883A - Encoding device and program, and image processing system

Info

Publication number: JP2019087883A
Application number: JP2017214854A
Authority: JP
Inventors: 和仁迫水; Kazuhito Sakomizu
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 2017-11-07
Filing date: 2017-11-07
Publication date: 2019-06-06
Anticipated expiration: 2037-11-07
Also published as: JP7043797B2

Abstract

To provide an encoding device capable of alleviating image quality degradation due to interaction between pixels in a boundary region between a background region and a target region.SOLUTION: An encoding device that encodes an input image according to the present invention includes a target region setting unit that sets a target region corresponding to the input image and outputs target region information, a target region coordinate correction unit that corrects the target region information and outputs the corrected target region information, a pre-processing unit that identifies pixels belonging to a background region in the input image on the basis of the corrected target region information, executes filtering for limiting the dynamic range for pixels belonging to the identified background region, and output the pixels as a pre-encoded image, and an encoding unit that compresses the pre-encoded image by using a predetermined image encoding method and outputs the compressed encoded data.SELECTED DRAWING: Figure 1

Description

本発明は、符号化装置及びプログラム、並びに、画像処理システムに関し、例えば、画像又は映像を圧縮するシステムに適用可能である。 The present invention relates to an encoding apparatus and program, and an image processing system, and is applicable to, for example, a system for compressing an image or video.

近年、監視カメラの普及が進み、さらなる高解像度化、高フレームレート化、及び多視点化も望まれている。しかし、高解像度化、高フレームレート化、及び多視点化は、動画像の符号量の著しい増加を引き起こし、通信コストやストレージコストの増加を招く。 In recent years, surveillance cameras have become widespread, and it is also desired to further increase the resolution, increase the frame rate, and increase the number of viewpoints. However, higher resolution, higher frame rate, and multiple viewpoints cause a significant increase in the code amount of moving images, resulting in an increase in communication costs and storage costs.

そこで、この問題を緩和するため、動画像より注目領域を検出し、注目領域に多くのビット数を配分する方式が提案されている。注目領域とは、例えば、顔領域である。なお、以下では、注目領域のことをＲｅｇｉｏｎｏｆＩｎｔｅｒｅｓｔの略である「ＲＯＩ」と呼ぶものとする。また、注目領域に多くのビット数を配分する方式を「ＲＯＩ符号化」と呼ぶものとする。 Therefore, in order to alleviate this problem, a method has been proposed in which a region of interest is detected from a moving image and a large number of bits are allocated to the region of interest. The attention area is, for example, a face area. Hereinafter, the region of interest will be referred to as "ROI" which is an abbreviation of Region of Interest. Further, a method of allocating a large number of bits to a region of interest is called “ROI coding”.

例えば、特許文献１では、符号量と映像品質を制御するパラメータをブロックごとに与えられるというエンコーダの機能を利用して、エンコーダに与えるパラメータをブロックごとに制御することで、ＲＯＩ以外の領域である非注目領域の情報量をＲＯＩの情報量よりも削減するシステム構成を提案している。なお、以下、この明細書では、非注目領域のことを「背景領域」と呼ぶものとする。また、背景領域は必ずしも静止状態とは限られないものとする。 For example, in Patent Document 1, it is an area other than the ROI by controlling the parameters given to the encoder for each block using the function of the encoder that the code amount and the parameter for controlling the video quality are given for each block. We propose a system configuration that reduces the amount of information in non-interest areas more than the amount of information in ROI. Hereinafter, in this specification, the non-interesting area will be referred to as a “background area”. In addition, the background area is not necessarily limited to the stationary state.

このようにエンコーダが提供する機能を用いて情報量を削減すれば、効率よく確実に符号量を削減することが可能である。Ｈ．２６４／ＭＰＥＧ−４ＡＶＣ（ＡｄｖａｎｃｅｄＶｉｄｅｏＣｏｄｉｎｇ：以下、「ＡＶＣ」とも呼ぶ）やＨ．２６５／ＭＰＥＧ−ＨＨＥＶＣ（ＨｉｇｈＥｆｆｉｃｉｅｎｃｙＶｉｄｅｏＣｏｄｉｎｇ：以下、「ＨＥＶＣ」とも呼ぶ）のような主要な規格は、領域ごとに品質を制御する機能を備えているため、この機能を利用することは、ＲＯＩに多くのビット数を配分する代表的な方式と言える。 By reducing the amount of information using the function provided by the encoder in this manner, it is possible to reduce the amount of code efficiently and reliably. H. H.264 / MPEG-4 AVC (Advanced Video Coding: hereinafter also referred to as "AVC") or H.264. As major standards such as H.265 / MPEG-H HEVC (High Efficiency Video Coding: hereinafter referred to as "HEVC") have a function to control the quality for each area, it is necessary to use this function as follows. It can be said that this is a typical method for allocating a large number of bits to the ROI.

一方、例えば、コストや互換性などの諸事情により、映像品質制御パラメータをブロックごとに与える機能を有していないエンコーダの利用が不可避の場合、前述の方式を採用することはできない。このような場合でも、エンコーダに依存せず、領域ごとに映像品質を制御可能な技術として、例えば、特許文献２、又は非特許文献１に記載の技術が提案されている。 On the other hand, for example, due to various reasons such as cost and compatibility, when the use of an encoder not having a function of providing the video quality control parameter for each block is unavoidable, the above-described system can not be adopted. Even in such a case, for example, the technology described in Patent Document 2 or Non-Patent Document 1 is proposed as a technology capable of controlling the video quality for each area independently of the encoder.

特許文献２は、エンコーダに依存せずに背景領域の情報量を削減するために、背景領域に低域通過フィルタによる前処理を行い、高周波成分の情報を取り除くことで背景領域の情報量を抑圧することを提案するものである。 In order to reduce the amount of information in the background region without relying on an encoder, Patent Document 2 performs preprocessing on the background region with a low-pass filter to remove information in high-frequency components, thereby suppressing the amount of information in the background region. It is proposed to do.

一方、非特許文献１は、エンコーダに依存せずに背景領域の情報量を削減するために、背景領域の画素信号のダイナミックレンジを制限する前処理を行うことで背景領域の情報量を抑圧することを提案するものである。 On the other hand, in Non-Patent Document 1, in order to reduce the amount of information in the background region without depending on the encoder, the amount of information in the background region is suppressed by performing preprocessing for limiting the dynamic range of pixel signals in the background region. Suggest that.

図８は、非特許文献１に代表される従来の画像処理システムの構成例を示す図である。 FIG. 8 is a view showing a configuration example of a conventional image processing system represented by Non-Patent Document 1. As shown in FIG.

図８において、画像処理システムＺは、入力された入力画像の内、背景領域に属する画素をＲＯＩの情報よりも少ないビット数で圧縮し、ビットストリームとして出力する画像符号化装置Ｘ１と、入力されたビットストリームを、復号し、出力画像を出力する画像復号装置Ｘ２とから構成される。 In FIG. 8, the image processing system Z receives an image encoding device X1 that compresses pixels belonging to the background region of the input image by a smaller number of bits than the information of the ROI and outputs as a bit stream. And an image decoding apparatus X2 that decodes the bit stream and outputs an output image.

画像符号化装置Ｘ１は、入力画像に対応するＲＯＩを設定し、ＲＯＩ情報を出力するＲＯＩ設定部３０１と、ＲＯＩ情報に基づき背景領域に属する画素を特定し、入力画像のうちの背景領域に属する画素にダイナミックレンジを限定（抑圧）するためのフィルタリングを施し、符号化前画像として出力する前処理部Ｕ１と、ＡＶＣ等の画像符号化方式を用いて符号化前画像を圧縮し、ビットストリームを出力するエンコーダ部３０６とを有する。ここで、前処理とは、画素信号のダイナミックレンジを制限する処理で、画素信号を１２８などの固定値に限定してしまう場合等も含む。なお、画素信号を固定値にするとは、ダイナミックレンジを１にすることと等価である。 The image encoding device X1 sets an ROI corresponding to an input image, and an ROI setting unit 301 that outputs ROI information, identifies a pixel belonging to a background region based on the ROI information, and belongs to the background region among the input image The pre-processing unit U1 performs filtering for limiting (suppressing) the dynamic range to the pixels, and the pre-coding unit is compressed using an image coding method such as AVC or the like, and a bit stream is output. And an encoder unit 306 for outputting. Here, the preprocessing is a process of limiting the dynamic range of the pixel signal, and also includes the case where the pixel signal is limited to a fixed value such as 128. Note that setting the pixel signal to a fixed value is equivalent to setting the dynamic range to 1.

ＲＯＩ設定部３０１は、入力画像に対応するＲＯＩを設定する機能であるが、ＲＯＩの特定手段に関しては特に限定されない。例えば、ＲＯＩ補助情報として入力画像が与えられ、入力画像に対して顔検出アルゴリズムや人物検出アルゴリズム、ナンバープレート検出アルゴリズム、車体検出アルゴリズム等を適用することで入力画像の中からＲＯＩの位置と大きさを検出することで特定するという方法が考えられる。また、ＲＯＩ補助情報として、例えば予め手動で入力されたＲＯＩの位置と大きさによって特定するという方法も考えられるし、ユーザインタフェースを通して入力されたＲＯＩの位置と大きさによって特定するものでも良い。さらに、ＲＯＩ補助情報として、例えば入力画像に対応する赤外線カメラ画像や深度センサ画像を利用してＲＯＩを特定するという方法でも良い。 The ROI setting unit 301 is a function of setting an ROI corresponding to an input image, but the means for specifying the ROI is not particularly limited. For example, the input image is given as ROI auxiliary information, and by applying a face detection algorithm, a person detection algorithm, a license plate detection algorithm, a vehicle body detection algorithm, etc. to the input image, the position and size of the ROI from the input image There is a way to identify by detecting. Further, as auxiliary ROI information, for example, a method of specifying by the position and size of the ROI manually input in advance may be considered, or it may be specified by the position and size of the ROI input through the user interface. Furthermore, as the ROI auxiliary information, for example, an infrared camera image corresponding to the input image or a depth sensor image may be used to specify the ROI.

画像復号装置Ｘ２は、入力されたビットストリームデータをエンコーダ部３０６に対応する方式で復号し、復号画像（出力画像）として出力するデコーダ部３０７と、ＲＯＩ情報に基づき背景領域に属する画素を特定し、復号画像のうちの背景領域に属する画素にダイナミックレンジを回復するフィルタリング（信号の振幅の増幅に相当）を施し、出力画像として出力する後処理部Ｕ２とを有する。なお、前処理部Ｕ１と後処理部Ｕ２は、情報の多重化手段を用いてネットワーク等を介して通信するなど、任意の手段を用いて画像に対応するＲＯＩ情報を共有しているものとする。 The image decoding apparatus X2 decodes the input bit stream data according to a method corresponding to the encoder unit 306, and specifies a pixel belonging to the background area based on the ROI information and a decoder unit 307 that outputs the decoded image (output image). And a post-processing unit U2 that applies filtering (corresponding to amplification of the amplitude of the signal) for restoring the dynamic range to the pixels belonging to the background area in the decoded image and outputs the result as an output image. The preprocessing unit U1 and the post-processing unit U2 share ROI information corresponding to an image using any means such as communicating via a network using information multiplexing means .

特開２００９−４９９７９号公報JP, 2009-49979, A 特許第３０４６３７９号Patent No. 3046379

「ＦｉｌｔｅｒｉｎｇＳｃｈｅｍｅｆｏｒＲＯＩＣｏｄｉｎｇｂｙＤｙｎａｍｉｃＲａｎｇｅＣｏｍｐｒｅｓｓｉｏｎａｎｄＵｐｄａｔｉｎｇＳｏｕｒｃｅＰｉｃｔｕｒｅＦｉｌｔｅｒ，Ｐｒｏｃｅｅｄｉｎｇｓｏｆｔｈｅ４ｔｈＩＩＡＥＩｎｔｅｒｎａｔｉｏｎａｌＣｏｎｆｅｒｅｎｃｅｏｎＩｎｔｅｌｌｉｇｅｎｔＳｙｓｔｅｍｓａｎｄＩｍａｇｅＰｒｏｃｅｓｓｉｎｇ２０１６」“Filtering Scheme for ROI Coding by Dynamic Range Compression and Updating Source Picture Filter, Proceedings of the 4th II AE International Conference on Intelligent Systems and Image Processing 2016”

以上のように、前処理部Ｕ１を利用し、背景領域とＲＯＩのダイナミックレンジを変えることで、ＲＯＩに多くのビット数を配分することは可能である。 As described above, it is possible to allocate a large number of bits to the ROI by changing the dynamic range of the background region and the ROI using the preprocessing unit U1.

しかしながら、前処理部Ｕ１を利用して背景領域とＲＯＩのダイナミックレンジを変えると、符号化前画像において、背景領域とＲＯＩの質感は大きく異なることになる。例えば、入力画像の一部で画素値が「２００２１０２２０２４０」と並んでいたとして、２１０と２２０の間を境に背景領域とＲＯＩに分かれていたとする。このとき、ダイナミックレンジの圧縮率を０．５（ダイナミックレンジを半分にする）とし、画素値０を中心にダイナミックレンジを圧縮する、符号化前画像の画素値は「１００１０５２２０２４０」と並んでいることになる。２１０と２２０の差に比ベて、１０５と２２０の差は当然大きく、この差を質感が大きく異なると表現している。 However, when the dynamic range of the background region and the ROI is changed using the preprocessing unit U1, the texture of the background region and the ROI is largely different in the image before encoding. For example, it is assumed that the pixel value is aligned with “200 210 220 240” in a part of the input image, and is divided into a background area and an ROI at the boundary between 210 and 220. At this time, the compression ratio of the dynamic range is set to 0.5 (half the dynamic range), and the dynamic range is compressed around the pixel value 0. The pixel value of the image before encoding is aligned with “100 105 220 240”. It will be Compared to the difference between 210 and 220, the difference between 105 and 220 is naturally large, and this difference is expressed as that the texture is greatly different.

一方、近年の動画像符号化はＤｉｓｃｒｅｔｅＣｏｓｉｎｅＴｒａｎｓｆｏｒｍ等の変換と量子化に基づく符号化のように近隣の画素間が影響し合って圧縮するものが大半であり、背景領域とＲＯＩの境界領域で画素間の相互作用が起きる。ここで、背景領域とＲＯＩの境界領域で起きる画素間の相互作用とは、背景領域の画素値がＲＯＩの画素値に影響し、逆にＲＯＩの画素値が背景領域の画素値に影響する（画素値が領域外に染み出す）ことである。また、ＡＶＣが採用しているデブロッキングフィルタのようなインループフィルタもまた背景領域とＲＯＩの境界領域で画素間の相互作用が起きる原因になる。 On the other hand, most video encodings in recent years such as Discrete Cosine Transform and other encoding based on transformation and quantization affect each other and compress between neighboring pixels, and in the border region between background region and ROI Interaction between pixels occurs. Here, in the interaction between pixels occurring in the boundary region of the background region and the ROI, the pixel values of the background region affect the pixel values of the ROI, and conversely, the pixel values of the ROI affect the pixel values of the background region ( Pixel values leak out of the area). Also, an in-loop filter such as a deblocking filter adopted by AVC also causes an interaction between pixels in the boundary region between the background region and the ROI.

前処理部Ｕ１を利用して背景領域とＲＯＩのダイナミックレンジを変えるようなシステムにおいて、背景領域とＲＯＩの境界領域で画素間の相互作用が起きると、異なる質感の画素値が互いに染み出すことになる。特にＲＯＩは高い品質を保証したい領域であるにも関わらず、フィルタを通過した低品質な背景領域の画素値の影響を受けて画質が低下してしまうことになる。 In a system in which the dynamic range of the background region and the ROI is changed using the preprocessing unit U1, when the interaction between pixels occurs in the boundary region of the background region and the ROI, pixel values of different textures exude each other. Become. In particular, although the ROI is an area where high quality is desired to be guaranteed, the image quality is degraded due to the influence of the pixel values of the low quality background area passed through the filter.

そのため、背景領域とＲＯＩの境界領域で画素間の相互作用による画質低下を緩和できる符号化装置及びプログラム、並びに、画像処理システムが望まれている。 Therefore, there is a need for an encoding apparatus and program, and an image processing system capable of alleviating image quality degradation due to interaction between pixels in the border region between the background region and the ROI.

第１の本発明は、入力画像を符号化する符号化装置において、（１）前記入力画像に対応する注目領域を設定し、注目領域情報を出力する注目領域設定部と、（２）前記注目領域情報を補正し、補正済み注目領域情報を出力する注目領域座標補正部と、（３）前記補正済み注目領域情報に基づき、前記入力画像の内、背景領域に属する画素を特定し、特定した前記背景領域に属する画素にダイナミックレンジを限定するためのフィルタリングを施し、符号化前画像として出力する前処理部と、（４）所定の画像符号化方式を用いて、前記符号化前画像を圧縮し、圧縮した符号化データを出力する符号化部とを有することを特徴とする。 According to a first aspect of the present invention, in an encoding apparatus for encoding an input image, (1) an attention area setting unit for setting an attention area corresponding to the input image and outputting attention area information; and (2) the attention A region of interest is corrected, and a region of interest coordinate correction unit for outputting corrected region of interest information, and (3) based on the corrected region of interest region, the pixels belonging to the background region in the input image are identified and identified The pre-processing unit applies filtering for limiting the dynamic range to the pixels belonging to the background area, and outputs the pre-encoding image, and (4) compresses the pre-encoding image using a predetermined image encoding method. And an encoding unit for outputting compressed encoded data.

第２の本発明は、符号化装置と復号装置とが対向する画僧処理システムにおいて、前記符号化装置として、第１の本発明の符号化装置を適用したことを特徴とする。 A second present invention is characterized in that the coding device of the first present invention is applied as the coding device in an image processing system in which the coding device and the decoding device face each other.

第３の本発明は、符号化装置と復号装置とが対向する画僧処理システムにおいて、前記符号化装置として、第１の本発明の符号化装置を適用し、（１−１）前記符号化装置の前記入力画像は、前記背景領域に属する画素の内、前記注目領域を囲むＭ（Ｍは１以上の整数）重の画素の集合である境界領域と、前記背景領域に属する画素の内、前記境界領域以外の画素の集合である外部領域とを含み、前記符号化装置の前記前処理部は、（１−２）前記補正済み注目領域情報に基づき、前記入力画像の内、前記外部領域に属する画素を特定し、特定した前記外部領域に属する画素にダイナミックレンジを限定するフィルタリングを施す外部領域用前処理部と、（１−３）前記補正済み注目領域情報に基づき、前記入力画像の内、前記境界領域に属する画素を特定し、特定した前記境界領域に属する画素にダイナミックレンジを限定するフィルタリングを施す境界領域用前処理部とを有し、前記復号装置は、（２−１）入力された前記符号化データを、前記符号化部で使用した前記所定の画像符号化方式に対応する方式で復号し、復号画像として出力する復号部と、（２−２）前記補正済み注目領域情報に基づき、前記復号画像の内、前記外部領域に属する画素を特定し、特定した前記外部領域に属する画素にダイナミックレンジを回復するためのフィルタリングを施す外部領域用後処理部と、（２−３）前記補正済み注目領域情報に基づき、前記復号画像の内、前記境界領域に属する画素を特定し、特定した前記該境界領域に属する画素にダイナミックレンジを回復するためのフィルタリングを施す境界領域用後処理部とを有することを特徴とする。 According to a third aspect of the present invention, in the picture processing system in which the coding device and the decoding device face each other, the coding device of the first present invention is applied as the coding device, and (1-1) the coding Among the pixels belonging to the background area, the input image of the device is a boundary area which is a set of M (M is an integer of 1 or more) double pixels surrounding the area of interest, and of the pixels belonging to the background area The preprocessing unit of the encoding device includes (1-2) the external region of the input image based on the corrected attention region information. An external area preprocessing unit for specifying pixels belonging to the external area and performing filtering for limiting a dynamic range to the pixels belonging to the specified external area; (1-3) the input image based on the corrected attention area information Belongs to the boundary area And a preprocessing unit for boundary area that performs filtering to limit the dynamic range to the pixels that belong to the specified boundary area, and the decoding apparatus is configured to (2-1) input the encoded A decoding unit that decodes data according to a method corresponding to the predetermined image coding method used in the coding unit and outputs the decoded image as a decoded image; (2-2) the decoding based on the corrected attention area information An external area post-processing unit which identifies pixels belonging to the external area in the image and applies filtering for recovering the dynamic range to the pixels belonging to the identified external area; (2-3) the corrected attention A filter for identifying a pixel belonging to the boundary area among the decoded image based on area information, and restoring a dynamic range to the pixel belonging to the specified boundary area And having a for border region post-processing unit that performs ring.

第４の本発明の符号化プログラムは、入力画像を符号化する符号化装置に搭載されるコンピュータを、（１）前記入力画像に対応する注目領域を設定し、注目領域情報を出力する注目領域設定部と、（２）前記注目領域情報を補正し、補正済み注目領域情報を出力する注目領域座標補正部と、（３）前記補正済み注目領域情報に基づき、前記入力画像の内、背景領域に属する画素を特定し、特定した前記背景領域に属する画素にダイナミックレンジを限定するためのフィルタリングを施し、符号化前画像として出力する前処理部と、（４）所定の画像符号化方式を用いて、前記符号化前画像を圧縮し、圧縮した符号化データを出力する符号化部として機能させることを特徴とする。 A coding program according to a fourth aspect of the present invention sets a computer mounted on a coding apparatus for coding an input image, (1) sets a region of interest corresponding to the input image, and outputs a region of interest information A setting unit, (2) an attention area coordinate correction unit that corrects the attention area information and outputs the corrected attention area information, and (3) a background area of the input image based on the corrected attention area information. And a pre-processing unit that performs filtering to limit the dynamic range to the pixels belonging to the identified background area, and outputs the image as a pre-encoding image, and (4) using a predetermined image encoding method It is characterized in that it functions as an encoding unit that compresses the pre-encoding image and outputs compressed encoded data.

本発明によれば、背景領域とＲＯＩの境界領域で画素間の相互作用による画質低下を緩和できる。 According to the present invention, it is possible to alleviate the image quality deterioration due to the interaction between pixels in the boundary region between the background region and the ROI.

第１の実施形態に係る画像処理システムの構成について示すブロック図である。It is a block diagram shown about composition of an image processing system concerning a 1st embodiment. 第１の実施形態に係る画像処理システムの動作を示すフローチャートである。It is a flowchart which shows operation | movement of the image processing system which concerns on 1st Embodiment. 第１の実施形態に係るＲＯＩの大きさを拡大する処理の例を示す図である。It is a figure which shows the example of the process which expands the magnitude | size of ROI which concerns on 1st Embodiment. 第１の実施形態に係るブロックにおけるＲＯＩの正規化例を示す図である。It is a figure which shows the normalization example of ROI in the block which concerns on 1st Embodiment. 第２の実施形態に係る画像処理システムの構成について示すブロック図である。It is a block diagram shown about composition of an image processing system concerning a 2nd embodiment. 第２の実施形態に係る画像処理システムの動作を示すフローチャートである。It is a flowchart which shows operation | movement of the image processing system which concerns on 2nd Embodiment. 第２の実施形態に係る２重の境界領域を求めた場合の例を示す図である。It is a figure which shows the example at the time of calculating | requiring the double boundary area which concerns on 2nd Embodiment. 非特許文献１に代表される従来の画像処理システムの構成例を示す図である。FIG. 1 is a diagram showing a configuration example of a conventional image processing system represented by Non-Patent Document 1.

（Ａ）第１の実施形態
以下、本発明による符号化装置及びプログラム、並びに、画像処理システムの第１の実施形態を、図面を参照しながら詳述する。以下では、本発明の符号化装置を画像符号化装置に適用し、本発明の復号装置を画像復号装置に適用した例について説明する。 (A) First Embodiment Hereinafter, a first embodiment of an encoding device and program according to the present invention and an image processing system will be described in detail with reference to the drawings. Hereinafter, an example in which the encoding device of the present invention is applied to an image encoding device and the decoding device of the present invention is applied to an image decoding device will be described.

（Ａ−１）第１の実施形態の構成
図１は、第１の実施形態に係る画像処理システム１の構成について示すブロック図である。 (A-1) Configuration of First Embodiment FIG. 1 is a block diagram showing the configuration of an image processing system 1 according to the first embodiment.

画像処理システム１は、入力画像を符号化してストリーム（ビット列）を出力する画像符号化装置１０と、画像符号化装置１０で符号化されたストリーム（ビットストリーム）を復号して復号画像を出力する画像復号装置２０とを有している。 An image processing system 1 encodes an input image and outputs a stream (bit string), and an image encoding apparatus 10 decodes a stream (bit stream) encoded by the image encoding apparatus 10 and outputs a decoded image. And an image decoding apparatus 20.

画像符号化装置１０から出力されたストリームを画像復号装置２０に入力する媒体は限定されないものである。例えば、画像符号化装置１０から出力されたストリームを、通信（例えば、インターネット等）により画像復号装置２０に送信するようにしても良いし、画像符号化装置１０から出力されたストリームのデータをデータ記録媒体（例えば、ＤＶＤやＨＤＤ等の媒体）に記録してオフラインで画像復号装置２０に入力するようにしても良い。 The medium for inputting the stream output from the image encoding device 10 to the image decoding device 20 is not limited. For example, the stream output from the image encoding device 10 may be transmitted to the image decoding device 20 by communication (for example, the Internet or the like), or the stream data output from the image encoding device 10 may be data It may be recorded on a recording medium (for example, a medium such as a DVD or HDD) and input to the image decoding apparatus 20 offline.

この実施形態では、画像符号化装置１０は入力画像ごとの符号化を行うものとして説明するが、画像符号化装置１０に連続して複数の入力画像を処理させることで動画像の符号化処理に適用するようにしても良い。また、画像復号装置２０についても同様に、連続して複数の符号化データのストリームを復号処理させることにより、動画像の復号処理に適用するようにしても良い。 In this embodiment, the image encoding device 10 is described as performing encoding for each input image, but for the moving image encoding process by causing the image encoding device 10 to process a plurality of input images continuously. It may be applied. Similarly, the image decoding apparatus 20 may be applied to the decoding process of a moving image by decoding a stream of a plurality of encoded data successively.

次に、画像符号化装置１０の内部構成について説明する。図１は、第１の実施形態に係る画像符号化装置１０の機能的構成について示すブロック図でもある。画像符号化装置は、ハードウェア（例えば、専用の半導体チップ等）で構成するようにしても良いし、一部又は全部を、ソフトウェア的に構成するようにしても良い。 Next, the internal configuration of the image coding device 10 will be described. FIG. 1 is also a block diagram showing a functional configuration of the image coding device 10 according to the first embodiment. The image coding apparatus may be configured by hardware (for example, a dedicated semiconductor chip or the like), or some or all may be configured as software.

画像処理システム１では、図８の画像符号化装置Ｘ１が、画像符号化装置１０に置き換わっている点で従来の構成（画像処理システムＺ）と異なっている。以下では、第１の実施形態について従来との差異を中心に説明する。 The image processing system 1 is different from the conventional configuration (image processing system Z) in that the image coding device X1 of FIG. 8 is replaced with the image coding device 10. In the following, the first embodiment will be described focusing on differences from the prior art.

画像符号化装置１０は、入力画像に対応するＲＯＩを設定し、ＲＯＩ情報を出力するＲＯＩ設定部３０１、ＲＯＩ情報を補正し、補正済みＲＯＩ情報を出力するＲＯＩ座標補正部Ｕ３と、補正済みＲＯＩ情報に基づき背景領域に属する画素を特定し、入力画像のうちの背景領域に属する画素にダイナミックレンジを削減するフィルタリングを施し、符号化前画像として出力する前処理部Ｕ１、及びエンコーダ部３０６を有している。なお、エンコーダ部３０６の機能は前述の図８で説明したものと同様の機能を有する。 The image coding apparatus 10 sets an ROI corresponding to an input image, an ROI setting unit 301 that outputs ROI information, corrects ROI information, and an ROI coordinate correction unit U3 that outputs corrected ROI information; Based on the information, identify the pixels belonging to the background area, apply filtering to reduce the dynamic range to the pixels belonging to the background area of the input image, and output the pre-encoding image as the pre-processing unit U1 and the encoder unit 306 doing. The function of the encoder unit 306 has the same function as that described above with reference to FIG.

ＲＯＩ座標補正部Ｕ３は、ＲＯＩ情報に基づきＲＯＩの領域を拡大し、拡大ＲＯＩ情報として出力するＲＯＩ拡大部１０２と、拡大ＲＯＩ情報を正規化し、補正済みＲＯＩ情報として出力するＲＯＩ座標正規化部３０３とを有する。 The ROI coordinate correction unit U3 enlarges the region of the ROI based on the ROI information and outputs the ROI enlargement unit 102 that outputs it as enlarged ROI information, and the ROI coordinate normalization unit 303 that normalizes the enlarged ROI information and outputs it as corrected ROI information. And.

ＲＯＩ拡大部１０２やＲＯＩ座標正規化部３０３の機能や目的については動作の項で詳述する。 The functions and purposes of the ROI enlargement unit 102 and the ROI coordinate normalization unit 303 will be described in detail in the operation section.

また、後述のとおり、ＲＯＩ座標正規化部３０３の機能は、符号化効率を改善する目的の機能であるため、ＲＯＩ座標補正部Ｕ３がＲＯＩ拡大部１０２の機能のみを有し、ＲＯＩ拡大部１０２の出力が補正済みＲＯＩ情報となる構成であっても一定の効果は存在する。 Further, as described later, since the function of the ROI coordinate normalization unit 303 is a function to improve the coding efficiency, the ROI coordinate correction unit U3 has only the function of the ROI enlargement unit 102, and the ROI enlargement unit 102. Even if the output of is the corrected ROI information, a certain effect exists.

なお、前処理部Ｕ１と後処理部Ｕ２は、情報の多重化手段を用いてネットワーク等を介して通信するなど、任意の手段を用いて画像に対応する補正済みＲＯＩ情報を共有してことを前提とする。 The preprocessing unit U1 and the post-processing unit U2 share the corrected ROI information corresponding to the image using any means such as communicating via a network etc using the information multiplexing means. It assumes.

（Ａ−２）第１の実施形態の動作
次に、第１の実施形態に係る画像処理システム１の動作を、図面を参照しながら詳細に説明する。図２は、第１の実施形態に係る画像処理システムの動作を示すフローチャートである。以下では、画像符号化装置１０及び画像復号装置２０の動作を各々分けて説明する。 (A-2) Operation of First Embodiment Next, the operation of the image processing system 1 according to the first embodiment will be described in detail with reference to the drawings. FIG. 2 is a flowchart showing the operation of the image processing system according to the first embodiment. The operations of the image encoding device 10 and the image decoding device 20 will be separately described below.

（Ａ−２−１）画像符号化装置１０の動作
図２（Ａ）は、第１の実施形態に係る画像符号化装置の動作を示すフローチャートである。 (A-2-1) Operation of Image Encoding Device 10 FIG. 2A is a flowchart showing the operation of the image encoding device according to the first embodiment.

ＲＯＩ設定部３０１は、ＲＯＩ補助情報に基づき、ＲＯＩの初期値を決定し、ＲＯＩ情報として出力する（Ｓ１１）。 The ROI setting unit 301 determines an initial value of the ROI based on the ROI auxiliary information, and outputs it as ROI information (S11).

次に、ＲＯＩ拡大部１０２は、ＲＯＩの大きさをＮ画素だけ周辺方向に大きくし、拡大ＲＯＩ情報として出力する（Ｓ１２）。 Next, the ROI enlargement unit 102 enlarges the size of the ROI by N pixels in the peripheral direction, and outputs it as enlargement ROI information (S12).

図３は、第１の実施形態に係るＲＯＩの大きさを拡大する処理の例を示す図である。図３では、入力されたＲＯＩ部分（斜線部分）に対して、周辺方向に２画素分、ＲＯＩの大きさを拡大（膨張）させている例が示されている。膨張したＲＯＩの部分における各数字は、ＲＯＩ部分から拡大（膨張）した多重構造を示している（例えば、ＲＯＩ部分に直接接する領域は「１」と示されている）。拡大の方法には様々な方法があると考えられるが、典型的な例は、画像処理でしばしば使用される膨張（ｄｉｌａｔｉｏｎ）である。 FIG. 3 is a diagram showing an example of processing of enlarging the size of the ROI according to the first embodiment. FIG. 3 shows an example in which the size of the ROI is enlarged (expanded) by 2 pixels in the peripheral direction with respect to the input ROI portion (hatched portion). Each number in the dilated ROI portion indicates a multiple structure expanded (dilated) from the ROI portion (for example, a region directly in contact with the ROI portion is shown as “1”). There are various methods of dilation, but a typical example is dilation often used in image processing.

Ｎの値は任意であり、エンコーダ部３０６およびデコーダ部３０７において、何画素分の近隣の画素間が影響を与え合うかによって最適な値は変化する。 The value of N is arbitrary, and in the encoder unit 306 and the decoder unit 307, the optimal value changes depending on how many pixels in the vicinity affect and influence each other.

この実施形態においてＮの値は限定されないものであるが、多くのコーデックでは、４×４画素で形成されるブロックや８×８画素で形成されるブロックごとに圧縮を行うことが多いため、１≦Ｎ≦４か大きくても１≦Ｎ≦８とすることで良好な結果となることが期待できる。また、エンコーダ部３０６が十分に小さな量子化ステップ幅で圧縮する場合、変換および量子化に伴う相互作用は無視しても良い。前述のどおり、十分に小さな量子化ステップ幅で圧縮され、インループフィルタによる相互作用のみを考慮すれば良い場合で、なおかつ、ＡＶＣやＨＥＶＣを使用する場合は、これらの動画像符号化方式のインループフィルタは近隣２画素の画素値が染み出すので、Ｎ＝２またはその前後が好ましい値であると言える。 Although the value of N is not limited in this embodiment, many codecs perform compression for each block formed by 4 × 4 pixels and each block formed by 8 × 8 pixels. It can be expected that good results can be obtained by setting 1 ≦ N ≦ 8 even if ≦ N ≦ 4 or more. In addition, when the encoder unit 306 compresses with a sufficiently small quantization step width, the interaction associated with conversion and quantization may be ignored. As described above, in the case where compression is performed with a sufficiently small quantization step width and it is sufficient to consider only the interaction by the in-loop filter, and in the case of using AVC or HEVC, these video encoding schemes Since the loop filter exudes the pixel values of adjacent two pixels, it can be said that N = 2 or around is a preferable value.

Ｎが大きすぎると、むやみに符号量を多く割く領域を増やすことに繋がるため、使用するエンコーダ部３０６およびデコーダ部３０７に応じて適切な値とする必要がある。 If N is too large, it will lead to an increase in the area to divide the code amount by a large amount needlessly, so it is necessary to set an appropriate value according to the encoder unit 306 and decoder unit 307 to be used.

この処理の目的は、ＲＯＩの範囲を広げることで、フィルタを通過した低品質な背景領域の画素値の影響を受けてＲＯＩの画質が低下してしまうことを防ぐことである。ＲＯＩに多くの符号量を割く画像処理システム１において、ＲＯＩは特に高い品質を保証したい領域であることが多いため、ＲＯＩを膨張することでＲＯＩの品質を保証する。 The purpose of this process is to widen the range of the ROI to prevent the image quality of the ROI from being degraded due to the influence of the pixel values of the low quality background area passed through the filter. In the image processing system 1 in which a large amount of code is allocated to the ROI, the ROI is often an area where it is desired to guarantee particularly high quality, so the ROI is expanded to guarantee the quality of the ROI.

特にＲＯＩ設定部３０１において、ズームされた領域がＲＯＩとして設定されるようなアプリケーションの場合、ＲＯＩは拡大して表示されることになる。ここで、背景領域の染み出しによってＲＯＩが劣化していると、視聴者に顕著なアーティファクトを見せることになる。この実施形態の画像処理システム１を用いることで、このようなシーンでも品質の保証されたＲＯＩを視聴者に提供できる。 In particular, in the case of an application in which a zoomed area is set as an ROI in the ROI setting unit 301, the ROI is displayed in an enlarged manner. Here, if the ROI is degraded due to the exudation of the background area, the viewer will see a noticeable artifact. By using the image processing system 1 of this embodiment, it is possible to provide the viewer with an ROI with guaranteed quality even in such a scene.

次に、ＲＯＩ座標正規化部３０３は、背景領域とＲＯＩの境界がコーデックによる圧縮時のブロック境界に揃うように正規化し、補正済みＲＯＩ情報として出力する（Ｓ１３）。 Next, the ROI coordinate normalization unit 303 normalizes the boundary between the background area and the ROI so as to be aligned with the block boundary at the time of compression by the codec, and outputs it as corrected ROI information (S13).

正規化は、例えば、以下の（１）〜（３）の何れかの基準で行う。 The normalization is performed, for example, on the basis of any one of the following (1) to (3).

（１）予め定められた大きさのブロックに属する画素のうち１画素でもＲＯＩに含まれているならば、当該ブロックに含まれる画素はすべてＲＯＩに属するとする。 (1) If even one pixel among the pixels belonging to a block of a predetermined size is included in the ROI, it is assumed that all the pixels included in the block belong to the ROI.

（２）予め定められた大きさのブロックに属する画素のうち半数以上がＲＯＩに含まれているならば、当該ブロックに含まれる画素はすべてＲＯＩに属するとする。 (2) If half or more of the pixels belonging to a block of a predetermined size are included in the ROI, it is assumed that all the pixels included in the block belong to the ROI.

（３）予め定められた大きさのブロックに属するすべての画素がＲＯＩに含まれているときのみ、当該ブロックに含まれる画素はＲＯＩに属するとする。 (3) The pixels included in the block are considered to belong to the ROI only when all the pixels belonging to the block of the predetermined size are included in the ROI.

図４は、第１の実施形態に係るブロックにおけるＲＯＩの正規化例を示す図である。図４は、前述の基準例のうち（１）の基準で正規化する場合をイメージ化したものである。 FIG. 4 is a diagram showing an example of ROI normalization in the block according to the first embodiment. FIG. 4 is an image of the case where normalization is performed based on (1) of the reference examples described above.

図４では、各ブロック（Ｂ１〜Ｂ３）のサイズを幅４画素、高さ４画素としている。例えば、ブロックＢ２は正規化前にＲＯＩに含まれる画素が４であるため（つまり、１以上であるため）に、当該ブロックに含まれる画素はＲＯＩに属するとする。 In FIG. 4, the size of each block (B1 to B3) is 4 pixels wide and 4 pixels high. For example, it is assumed that the pixels included in the block B2 belong to the ROI because the number of pixels included in the ROI before normalization is 4 (that is, because it is 1 or more).

エンコーダ部３０６で採用されうる多くの画像符号化方式では、特定の大きさのブロックごとに予測したり変換したりすることで情報量を削減する。仮に、ブロック内に鋭いエッジが発生していると、多くのＡＣ成分が発生し、符号量増加の原因となる。つまり、ブロック内に不要なエッジを発生させないことが望ましく、そのために正規化によって背景領域とＲＯＩの境界を、画像符号化方式が採用するブロックの境界と一致させる。 In many image coding methods that can be adopted by the encoder unit 306, the amount of information is reduced by performing prediction or conversion for each block of a specific size. If a sharp edge is generated in the block, many AC components are generated, which causes an increase in the code amount. That is, it is desirable not to generate an unnecessary edge in the block, so that the boundary between the background region and the ROI is matched with the boundary of the block adopted by the image coding method by normalization.

前述のとおり、正規化には種々の方法が考えられ、全て一定の効果は備えるが、ＲＯＩ拡大部１０２の機能および目的と照らし合わせると、ＲＯＩに選ばれた画素が背景領域として処理されてしまうことは背景領域の画素値が本来のＲＯＩに染み出す可能性を生じさせるため好ましくなく、この観点では前述３つの基準のうち、（１）の基準が最も望ましい。 As described above, various methods can be considered for normalization, all of which have a certain effect, but the pixels selected as the ROI are treated as a background region in comparison with the function and purpose of the ROI enlargement unit 102 That is not preferable because it causes the possibility that the pixel values in the background area may leak into the original ROI, and in this aspect, the criterion (1) is most desirable among the above three criteria.

次に、前処理部Ｕ１は、補正済みＲＯＩ情報に基づき背景領域に属する画素を特定し、入力画像のうちの背景領域に属する画素にダイナミックレンジを限定するフィルタリングを施し、符号化前画像として出力する（Ｓ１４）。ダイナミックレンジを限定するフィルタリングは、例えば、非特許文献１に記載の技術を適用できる。ただし、この実施形態においては、ダイナミックレンジを限定する際の中心画素値については特に制約はなく、またダイナミックレンジを１にすることでもこの本発明は一定の効果を奏する。 Next, the preprocessing unit U1 specifies the pixels belonging to the background area based on the corrected ROI information, performs filtering for limiting the dynamic range to the pixels belonging to the background area in the input image, and outputs as an image before encoding To do (S14). For the filtering that limits the dynamic range, for example, the technology described in Non-Patent Document 1 can be applied. However, in this embodiment, the central pixel value at the time of limiting the dynamic range is not particularly limited, and even if the dynamic range is set to 1, the present invention exhibits certain effects.

次に、エンコーダ部３０６は、ＡＶＣ等の画像符号化方式を用いて、符号化前画像を圧縮し、ビットストリームを出力する（Ｓ１５）。 Next, the encoder unit 306 compresses the uncoded image using an image coding method such as AVC, and outputs a bit stream (S15).

（Ａ−２−２）画像復号装置Ｘ２の動作
図２（Ｂ）は、第１の実施形態に係る画像復号装置Ｘ２の動作を示すフローチャートである。 (A-2-2) Operation of Image Decoding Device X2 FIG. 2 (B) is a flowchart showing the operation of the image decoding device X2 according to the first embodiment.

まず、デコーダ部３０７は、画像符号化装置１０からビットストリームを受け取り、ＡＶＣ等の画像符号化方式を用いて復号し、復号画像を出力する（Ｓ２１）。 First, the decoder unit 307 receives a bit stream from the image coding apparatus 10, decodes it using an image coding method such as AVC, and outputs a decoded image (S21).

後処理部Ｕ２は、補正済みＲＯＩ情報に基づき背景領域に属する画素を特定し、復号画像のうちの背景領域に属する画素にダイナミックレンジを回復するフィルタリングを施し、出力画像として出力する（Ｓ２２）。なお、ダイナミックレンジを回復するフィルタリングは、前処理部Ｕ１と同様に非特許文献１に記載の技術を適用できるが、ダイナミックレンジを回復するフィルタであれば、非特許文献１の方法に限定されない。 The post-processing unit U2 specifies pixels belonging to the background area based on the corrected ROI information, performs filtering for recovering the dynamic range on pixels belonging to the background area in the decoded image, and outputs the result as an output image (S22). The filtering for recovering the dynamic range can be applied to the technique described in Non-Patent Document 1 as in the case of the preprocessing unit U1, but the filter is not limited to the method described in Non-Patent Document 1 as long as it is a filter for recovering the dynamic range.

（Ａ−３）第１の実施形態の効果
以上のように、第１の実施形態によれば、ＲＯＩ拡大部１０２において、ＲＯＩの大きさをＮ画素だけ周辺方向に大きく拡大していることで、フィルタを通過した低品質な背景領域の画素値の影響を受けて本来のＲＯＩの画質が低下してしまうことを防止することができる。 (A-3) Effects of the First Embodiment As described above, according to the first embodiment, in the ROI enlargement unit 102, the size of the ROI is greatly enlarged in the peripheral direction by N pixels. It is possible to prevent the image quality of the original ROI from being degraded due to the influence of the pixel values of the low quality background area passed through the filter.

（Ｂ）第２の実施形態
以下、本発明による符号化装置及びプログラム、並びに、画像処理システムの第２の実施形態を、図面を参照しながら詳述する。以下では、本発明の符号化装置を画像符号化装置に適用し、本発明の復号装置を画像復号装置に適用した例について説明する。 (B) Second Embodiment Hereinafter, a second embodiment of the encoding device and program according to the present invention and the image processing system will be described in detail with reference to the drawings. Hereinafter, an example in which the encoding device of the present invention is applied to an image encoding device and the decoding device of the present invention is applied to an image decoding device will be described.

（Ｂ−１）第２の実施形態の構成
図５は、第２の実施形態に係る画像処理システム１Ａの構成について示すブロック図である。 (B-1) Configuration of Second Embodiment FIG. 5 is a block diagram showing the configuration of an image processing system 1A according to the second embodiment.

画像処理システム１Ａでは、図１の画像符号化装置１０と画像復号装置Ｘ２が、画像符号化装置１０Ａと画像復号装置２０に置き換わっている点で第１の実施形態と異なっている。以下では、第２の実施形態について第１の実施形態との差異を中心に説明する。 The image processing system 1A is different from the first embodiment in that the image coding device 10 and the image decoding device X2 in FIG. 1 are replaced with the image coding device 10A and the image decoding device 20. Hereinafter, the second embodiment will be described focusing on differences from the first embodiment.

第２の実施形態の画像符号化装置１０Ａにおいて、前処理部Ｕ４は、補正済みＲＯＩ情報に基づき外部領域に属する画素を特定し、入力画像のうちの外部領域に属する画素にダイナミックレンジを限定するフィルタリングを施し、前処理中間画像として出力する外部領域用前処理部３０４と、補正済みＲＯＩ情報に基づき境界領域に属する画素を特定し、前処理中間画像のうちの境界領域に属する画素にダイナミックレンジを限定するフィルタリングを施し、符号化前画像として出力する境界領域用前処理部２０５とを有している。 In the image encoding device 10A of the second embodiment, the preprocessing unit U4 specifies pixels belonging to the external region based on the corrected ROI information, and limits the dynamic range to pixels belonging to the external region in the input image. An external area preprocessing unit 304 that performs filtering and outputs as a preprocessing intermediate image, and identifies pixels belonging to the boundary area based on the corrected ROI information, and dynamic range to pixels belonging to the boundary area of the preprocessing intermediate image And a boundary region pre-processing unit 205 that performs filtering to limit and outputs as a pre-coding image.

また、境界領域とは、背景領域に属する画素のうちのＲＯＩを囲むＭ重の画素の集合であり、外部領域とは、背景領域に属する画素のうちの境界領域以外の画素の集合である。なお、外部領域用前処理部３０４や境界領域用前処理部２０５の詳細については動作の項で述べる。 The boundary region is a set of M-fold pixels surrounding the ROI among the pixels belonging to the background region, and the outer region is a set of pixels other than the boundary region among the pixels belonging to the background region. The details of the external area preprocessing unit 304 and the boundary area preprocessing unit 205 will be described in the section of operation.

この実施形態では、外部領域用前処理部３０４が境界領域用前処理部２０５よりも先に適用されるような構成としているが、外部領域用前処理部３０４が処理対象とする画素と境界領域用前処理部２０５が処理対象とする画素は異なるため、境界領域用前処理部２０５が外部領域用前処理部３０４よりも先に適用されても良く、外部領域用前処理部３０４と境界領域用前処理部２０５が同時に適用されても良い。つまり、前処理部Ｕ４が外部領域用前処理部３０４と境界領域用前処理部２０５の機能により構成されていることに意味を有する。 In this embodiment, the external area preprocessing unit 304 is configured to be applied earlier than the boundary area preprocessing unit 205. However, the pixels and the boundary area to be processed by the external area preprocessing unit 304 are to be processed. Since the pixels to be processed by the pre-processing unit 205 are different, the boundary area pre-processing unit 205 may be applied prior to the external area pre-processing unit 304, and the external area pre-processing unit 304 and the boundary area may be applied. The pre-processing unit 205 may be applied simultaneously. That is, it has a meaning that the preprocessing unit U4 is configured by the functions of the external region preprocessing unit 304 and the boundary region preprocessing unit 205.

第２の実施形態の画像復号装置２０において、後処理部Ｕ５は、補正済みＲＯＩ情報に基づき外部領域に属する画素を特定し、復号画像のうちの外部領域に属する画素にダイナミックレンジを回復するためのフィルタリングを施し、後処理中間画像として出力する外部領域用後処理部３０８と、補正済みＲＯＩ情報に基づき境界領域に属する画素を特定し、後処理中間画像のうちの境界領域に属する画素にダイナミックレンジを回復するためのフィルタリングを施し、出力画像として出力する境界領域用後処理部２０９とを有している。なお、外部領域用後処理部３０８や境界領域用後処理部２０９の詳細については動作の項で述べる。 In the image decoding apparatus 20 according to the second embodiment, the post-processing unit U5 identifies pixels belonging to the external area based on the corrected ROI information, and restores the dynamic range to pixels belonging to the external area of the decoded image. And an external region post-processing unit 308 that outputs the post-processing intermediate image, and identifies pixels belonging to the boundary region based on the corrected ROI information, and dynamically applies pixels belonging to the boundary region of the post-processed intermediate image It has a boundary area post-processing unit 209 which performs filtering for recovering the range and outputs it as an output image. The details of the external area post-processing unit 308 and the boundary area post-processing unit 209 will be described in the section of operation.

この実施形態では、外部領域用後処理部３０８が境界領域用後処理部２０９よりも先に適用されるような構成としているが、外部領域用後処理部３０８が処理対象とする画素と境界領域用後処理部２０９が処理対象とする画素は異なるため、境界領域用後処理部２０９が外部領域用後処理部３０８よりも先に適用されても良く、外部領域用後処理部３０８と境界領域用後処理部２０９が同時に適用されても良い。つまり、後処理部Ｕ５が外部領域用後処理部３０８と境界領域用後処理部２０９の機能により構成されていることに意味を有する。 In this embodiment, the external area post-processing unit 308 is configured to be applied before the boundary area post-processing unit 209, but the pixels and the boundary area to be processed by the external area post-processing unit 308 Since the pixels to be processed by the post-processing unit 209 are different, the boundary region post-processing unit 209 may be applied before the external region post-processing unit 308, and the external region post-processing unit 308 and the boundary region The post-processing unit 209 may be applied simultaneously. That is, it has meaning that the post-processing unit U5 is configured by the functions of the external region post-processing unit 308 and the boundary region post-processing unit 209.

なお、前処理部Ｕ４と後処理部Ｕ５は、情報の多重化手段を用いてネットワーク等を介して通信するなど、任意の手段を用いて画像に対応する補正済みＲＯＩ情報を共有しているものとする。 The preprocessing unit U4 and the post-processing unit U5 share the corrected ROI information corresponding to the image using any means such as communicating via a network etc using the information multiplexing means I assume.

なお、後述する通り、外部領域用前処理部３０４と境界領域用前処理部２０５の組み合わせ及び外部領域用後処理部３０８と境界領域用後処理部２０９の組み合わせを導入する目的の一つは、ＲＯＩ座標正規化部３０３のより良い代替手段を提供することにある。そのため、ＲＯＩ座標補正部Ｕ３がＲＯＩ座標正規化部３０３の機能を備えず、拡大ＲＯＩ情報が補正済みＲＯＩ情報として出力する構成であっても、本発明は一定の効果を奏する。 As described later, one of the purposes for introducing the combination of the external area preprocessing unit 304 and the boundary area preprocessing unit 205 and the combination of the external area post processing unit 308 and the boundary area post processing unit 209 is It is to provide a better alternative to the ROI coordinate normalization unit 303. Therefore, even if the ROI coordinate correction unit U3 does not have the function of the ROI coordinate normalization unit 303 and outputs the enlarged ROI information as the corrected ROI information, the present invention has a certain effect.

また、外部領域用前処理部３０４と境界領域用前処理部２０５の組み合わせ及び外部領域用後処理部３０８と境界領域用後処理部２０９の組み合わせを導入する目的は、ＲＯＩ座標正規化部３０３のより良い代替手段を提供すること以外にもあるので、第１の実施形態のようにＲＯＩ座標補正部Ｕ３がＲＯＩ拡大部１０２の機能とＲＯＩ座標正規化部３０３の機能とを備える場合であっても、本発明は一定の効果を奏する。 Further, the purpose of introducing the combination of the external region preprocessing unit 304 and the boundary region preprocessing unit 205 and the combination of the external region post-processing unit 308 and the boundary region post-processing unit 209 is that the ROI coordinate normalization unit 303 In addition to providing a better alternative, in the case where the ROI coordinate correction unit U3 includes the function of the ROI enlargement unit 102 and the function of the ROI coordinate normalization unit 303 as in the first embodiment, Also, the present invention has certain effects.

また、後述する通り、外部領域用前処理部３０４と境界領域用前処理部２０５の組み合わせ及び外部領域用後処理部３０８と境界領域用後処理部２０９の組み合わせを導入することで、たとえＲＯＩ座標補正部Ｕ３がＲＯＩ拡大部１０２の機能や、ＲＯＩ拡大部１０２の機能とＲＯＩ座標正規化部３０３の両方の機能を備えなかったとしても、背景領域とＲＯＩの境界に鋭いエッジが発生しなくなるため（エッジの変化量が緩くなるため）、背景領域とＲＯＩの境界領域で発生する画素間の相互作用による染み出しによる背景領域やＲＯＩの品質劣化は抑えられる。そのため、ＲＯＩ座標補正部Ｕ３がＲＯＩ拡大部１０２の機能や、ＲＯＩ拡大部１０２の機能とＲＯＩ座標正規化部３０３の両方の機能を備えなかったとしても、本発明は一定の効果を発揮する。 Also, as described later, even if ROI coordinates are obtained by introducing the combination of the external region preprocessing unit 304 and the boundary region preprocessing unit 205 and the combination of the external region post processing unit 308 and the boundary region post processing unit 209. Even if the correction unit U3 does not have the function of the ROI enlargement unit 102 or the function of both the ROI enlargement unit 102 and the ROI coordinate normalization unit 303, sharp edges are not generated at the boundary between the background region and the ROI. (Because the amount of change in the edge becomes loose), deterioration of the quality of the background area and the ROI due to the exudation due to the interaction between the pixels occurring at the boundary area of the background area and the ROI can be suppressed. Therefore, even if the ROI coordinate correction unit U3 does not have the function of the ROI enlargement unit 102 or the function of both the ROI enlargement unit 102 and the ROI coordinate normalization unit 303, the present invention exerts a certain effect.

（Ｂ−２）第２の実施形態の動作
図６は、第２の実施形態に係る画像処理システムの動作を示すフローチャートである。図６と前述の図２の内、同一符号の処理は同様の処理であるので、以下では、第２の実施形態の画像処理システム１Ａの特有の処理のみついて述べる。 (B-2) Operation of Second Embodiment FIG. 6 is a flowchart showing an operation of the image processing system according to the second embodiment. In FIG. 6 and FIG. 2 described above, the processes with the same reference numerals are the same processes, so only the process specific to the image processing system 1A of the second embodiment will be described below.

（Ｂ−２−１）画像符号化装置１０Ａの動作
図６（Ａ）は、第２の実施形態に係る画像符号化装置１０Ａの動作を示すフローチャートである。 (B-2-1) Operation of Image Encoding Device 10A FIG. 6A is a flowchart showing the operation of the image encoding device 10A according to the second embodiment.

前述のステップＳ１３の後、外部領域用前処理部３０４は、補正済みＲＯＩ情報に基づき外部領域に属する画素を特定し、入力画像のうちの外部領域に属する画素にダイナミックレンジを限定するフィルタリングを施し、前処理中間画像として出力する（Ｓ３１）。このステップＳ３１の処理は、第１の実施形態における前処理部Ｕ１に相当する機能を担っている。外部領域用前処理部３０４は、外部領域に属する画素のダイナミックレンジを、外部領域に属する画素向けのダイナミックレンジ（以下、「Ｒ_０」で示す）に限定する。 After the above-described step S13, the external region preprocessing unit 304 identifies the pixels belonging to the external region based on the corrected ROI information, and applies filtering for limiting the dynamic range to the pixels belonging to the external region in the input image. , And output as a preprocessed intermediate image (S31). The process of step S31 has a function corresponding to the pre-processing unit U1 in the first embodiment. The external area preprocessing unit 304 limits the dynamic range of the pixels belonging to the external area to the dynamic range (hereinafter referred to as “R ₀ ”) for the pixels belonging to the external area.

次に、境界領域用前処理部２０５は、補正済みＲＯＩ情報に基づき境界領域に属する画素を特定し、前処理中間画像のうちの境界領域に属する画素にダイナミックレンジを限定するフィルタリングを施し、符号化前画像として出力する（Ｓ３２）。 Next, the boundary region pre-processing unit 205 identifies pixels belonging to the boundary region based on the corrected ROI information, performs filtering to limit the dynamic range to pixels belonging to the boundary region among the preprocessing intermediate images, and Output as an image before conversion (S32).

ここで境界領域とは前述のとおり、背景領域に属する画素のうちのＲＯＩを囲むＭ重の画素の集合である。ＲＯＩを囲む画素の集合は、例えば、画像処理でしばしば使用される膨張（ｄｉｌａｔｉｏｎ）を使用すると求められる。ＲＯＩに対して１度膨張を適用し、新たに広がった領域が１重目の境界領域である。さらにもう１度膨張を適用し、新たに広がった領域が２重目の境界領域である。図７は、第２の実施形態に係る２重の境界領域を求めた場合の例を示す図である。 Here, as described above, the boundary area is a set of M-fold pixels surrounding the ROI among the pixels belonging to the background area. The set of pixels surrounding the ROI is, for example, determined using dilation often used in image processing. The first dilation is applied to the ROI, and the newly expanded region is the first boundary region. The expansion is applied one more time, and the newly expanded area is the second boundary area. FIG. 7 is a diagram showing an example in the case where double boundary regions according to the second embodiment are obtained.

なお、境界領域用前処理部２０５は、境界領域に属する画素のダイナミックレンジを、外部領域に属する画素向けのダイナミックレンジＲ_０よりも広いダイナミックレンジに限定する（即ち、外部領域よりも高画質に圧縮する）ことを特徴とする。つまり、境界領域は、ＲＯＩと外部領域の質感のギャップの緩衝帯であると言える。境界領域が２重以上になっている場合、ｘ重目の境界領域とｘ＋１重目の境界領域では、それぞれダイナミックレンジが異なっていても良く、この場合、ＲＯＩ側に位置する境界領域のダイナミックレンジＲ_ｘは、その外側に位置する境界領域のダイナミックレンジＲ_ｘ＋１以上の大きさのダイナミックレンジとする。 Note that the boundary area preprocessing unit 205 limits the dynamic range of the pixels belonging to the boundary area to a dynamic range wider than the dynamic range R ₀ for pixels belonging to the external area (that is, higher image quality than the external area). Compressed). That is, it can be said that the boundary area is a buffer zone of the gap between the ROI and the texture of the external area. When the border area is double or more, the dynamic range may be different between the x-th border area and the x + 1-th border area. In this case, the dynamic range of the border area located on the ROI side It is assumed that R _x is a dynamic range of a size greater than or equal to the dynamic range R _{x + 1 of} the boundary region located outside of it.

つまり、ＲＯＩのダイナミックレンジをＲ_ｒとすると、下記の（１）式の関係を満たすようにする。
Ｒ_０＜．．．≦Ｒ_ｘ＋１≦Ｒ_ｘ≦．．．、≦Ｒ_１＜Ｒ_ｒ …（１） That is, assuming that the dynamic range of the ROI is R _r , the relationship of the following equation (1) is satisfied.
R ₀ <. . . ≦ R _{x + 1} ≦ R _x ≦. . . , R R ₁ <R _r (1)

図７において各画素内に記載している数値は、画素のダイナミックレンジの例であり、ＲＯＩのダイナミックレンジを最も大きくＲ_ｒ＝２５６に、１重目の境界領域のダイナミックレンジをＲ１＝１２８に、２重目の境界領域のダイナミックレンジをＲ_２＝６４に、外部領域のダイナミックレンジをＲ_０＝３２にしている場合を示している。なお、何重目にどれくらいのダイナミックレンジを与えるかは、線形な変化や非線形な変化など種々考えられるが、いずれにしても、例えば、予め決めておくものとし、前処理部Ｕ４と後処理部Ｕ５で共有できているものとする。 The numerical values described in each pixel in FIG. 7 are an example of the dynamic range of the pixel, and the dynamic range of the ROI is maximized to R _r = 256 and the dynamic range of the first boundary region to R 1 = 128. The dynamic range of the _second boundary region is R ₂ = 64, and the dynamic range of the outer region is R ₀ = 32. In addition, although it is variously considered how many dynamic ranges are given to what number of layers, such as a linear change and a non-linear change, in any case, for example, it shall be decided in advance, the pre-processing unit U4 and the post-processing unit It shall be shared with U5.

第２の実施形態において、境界領域を設けている理由は、主に以下の通りである。 In the second embodiment, the reason for providing the boundary area is mainly as follows.

第１の実施形態では、ブロック内で背景領域とＲＯＩの境界に起因するエッジが入ることによる符号量の増加を問題視し、その解決策としてＲＯＩ座標正規化部３０３を使用した。しかしながら、この解決策は、問題の緩和に寄与するものの、ＲＯＩ相当の符号量を割かなければならない画素の数を増やし、不必要な符号量を発生させている点では未だ効率向上の余地を残している。第２の実施形態では、外部領域とＲＯＩを、境界領域を介して滑らかに接続することで、ＲＯＩ相当の符号量を割かなければならない画素の増加を最小限に留めつつ、ブロック内で背景領域とＲＯＩの境界に起因するエッジが入ることによる符号量の増加も阻止する。 In the first embodiment, the increase in the code amount due to the entry of an edge due to the boundary between the background region and the ROI in the block is considered as a problem, and the ROI coordinate normalization unit 303 is used as a solution. However, although this solution contributes to the alleviation of the problem, it still leaves room for efficiency improvement in that it increases the number of pixels that must be allocated code amounts equivalent to the ROI and generates unnecessary code amounts. ing. In the second embodiment, the external area and the ROI are connected smoothly through the boundary area, thereby minimizing the increase in the number of pixels for which the code equivalent to the ROI must be allocated, and the background area in the block. It also prevents an increase in the code amount due to the edge entering due to the boundary between and and the ROI.

また、エンコーダ部３０６とデコーダ部３０７にデブロッキングフィルタ等のインループフィルタを使用するコーデックを用いる場合、たとえ背景領域とＲＯＩの境界をブロック境界に揃えたとしても、インループフィルタが背景領域とＲＯＩの境界領域で画素間の相互作用を発生させる。ＲＯＩ拡大部１０２が存在すれば、ＲＯＩの品質が損なわれることはないが、背景領域にはＲＯＩの画素値が染み出すことになる。背景領域の画素にとっては、質感の異なるＲＯＩの画素値はノイズになってしまう。さらに、後処理部Ｕ２を施すと、後処理部Ｕ２は振幅の増幅処理に相当するため、ＲＯＩの近隣の背景領域ではＲＯＩの画素値の染み出しによるノイズを増幅することになる。結果として、背景領域の品質が必要以上に損なわれることになる。インループフィルタは、隣接する画素値間のギャップが大きいほど強くかかるため、第２の実施形態では、外部領域とＲＯＩを、境界領域を介して滑らかに接続することで、本問題の緩和を行う。 When a codec using an in-loop filter such as a deblocking filter is used for the encoder unit 306 and the decoder unit 307, the in-loop filter is a background region and ROI even if the boundary between the background region and the ROI is aligned with the block boundary. Generate inter-pixel interactions in the border region of If the ROI enlargement unit 102 is present, the quality of the ROI is not impaired, but the pixel value of the ROI exudes to the background area. For pixels in the background area, pixel values of ROIs with different textures become noise. Further, when the post-processing unit U2 is applied, since the post-processing unit U2 corresponds to the amplification processing of the amplitude, the noise due to the exudation of the pixel value of the ROI is amplified in the background region near the ROI. As a result, the quality of the background area is lost more than necessary. Since the in-loop filter becomes stronger as the gap between adjacent pixel values becomes larger, in the second embodiment, this problem is alleviated by smoothly connecting the external region and the ROI via the boundary region. .

そして、外部領域とＲＯＩを、境界領域を介して滑らかに接続することで、背景領域と隣接しているＲＯＩのブロック、またはＲＯＩと隣接している背景領域のブロックがイントラブロックモード（ＡＶＣやＨＥＶＣ等で定義されているモード）で符号化されるときに、イントラ予測が効きやすくなり、符号化効率が改善する。 Then, by connecting the external area and the ROI smoothly through the boundary area, the block of the ROI adjacent to the background area or the block of the background area adjacent to the ROI is in the intra block mode (AVC or HEVC Etc. intra-prediction becomes more effective and coding efficiency improves.

なお、Ｍの大きさについては任意であるが、前処理部Ｕ４と後処理部Ｕ５では共有されているものとする。また、第１の実施形態におけるＮの値と同様に、エンコーダ部３０６およびデコーダ部３０７において、何画素分の近隣の画素間が影響を与え合うか（つまりインループフィルタのカーネルサイズ）によって最適な値は変化する。多くのコーデックでは、４×４画素で形成されるブロックや８×８画素で形成されるブロックごとに圧縮を行うことが多いため、１≦Ｍ≦４か大きくても１≦Ｍ≦８とすることで良好な結果となることが期待できる。また、ＨＡＶＣやＨＥＶＣのインループフィルタでは、近隣２画素の画素値が染み出すので、Ｍ＝２またはその前後が最も好ましい値であると言える。 Although the size of M is arbitrary, it is assumed that they are shared by the pre-processing unit U4 and the post-processing unit U5. Further, like the value of N in the first embodiment, in the encoder unit 306 and the decoder unit 307, it is optimal depending on how many pixels of adjacent pixels affect each other (that is, kernel size of in-loop filter). The value changes. In many codecs, compression is often performed for each block formed by 4 × 4 pixels and each block formed by 8 × 8 pixels, so 1 ≦ M ≦ 4 or at most 1 ≦ M ≦ 8. Good results can be expected. Further, in the in-loop filter of HAVC or HEVC, the pixel values of two neighboring pixels exude, so it can be said that M = 2 or around is the most preferable value.

Ｎと同様に、Ｍが大きすぎると、むやみに符号量を多く割く領域を増やすことに繋がるため、使用するエンコーダ部３０６およびデコーダ部３０７に応じて適切な値とする必要がある。 As in the case of N, if M is too large, it will lead to an increase in the area for dividing the code amount by much, so it is necessary to set an appropriate value according to the encoder unit 306 and decoder unit 307 used.

（Ｂ−２−２）画像復号装置２０の動作
図６（Ｂ）は、第２の実施形態に係る画像復号装置２０の動作を示すフローチャートである。 (B-2-2) Operation of Image Decoding Device 20 FIG. 6 (B) is a flowchart showing the operation of the image decoding device 20 according to the second embodiment.

前述のステップＳ２１の後、外部領域用後処理部３０８は、補正済みＲＯＩ情報に基づき外部領域に属する画素を特定し、復号画像のうちの外部領域に属する画素にダイナミックレンジを回復するフィルタリングを施し、後処理中間画像として出力する（Ｓ４１）。なお、このステップＳ４１の処理は、外部領域用前処理部３０４の逆演算に相当し、第１の実施形態における後処理部Ｕ２に相当する機能を担っている。 After the above-described step S21, the external region post-processing unit 308 identifies the pixels belonging to the external region based on the corrected ROI information, and applies filtering for recovering the dynamic range to the pixels belonging to the external region in the decoded image. , And output as a post-processing intermediate image (S41). The process of step S41 corresponds to the inverse operation of the external region preprocessing unit 304, and has a function corresponding to the post-processing unit U2 in the first embodiment.

境界領域用後処理部２０９は、補正済みＲＯＩ情報に基づき境界領域に属する画素を特定し、後処理中間画像のうちの境界領域に属する画素にダイナミックレンジを回復するフィルタリングを施し、出力画像として出力する（Ｓ４２）。なお、このステップＳ４２の処理は、境界領域用前処理部２０５の逆演算に相当する機能を担っている。 The boundary region post-processing unit 209 identifies pixels belonging to the boundary region based on the corrected ROI information, performs filtering to restore the dynamic range to pixels belonging to the boundary region of the post-processing intermediate image, and outputs as an output image (S42). The process of step S42 has a function equivalent to the inverse operation of the boundary area preprocessing unit 205.

（Ｂ−３）第２の実施形態の効果
以上のように、第２の実施形態によれば、前述の第１の実施形態の効果に加えて、以下の効果を奏する。 (B-3) Effects of the Second Embodiment As described above, according to the second embodiment, in addition to the effects of the first embodiment described above, the following effects can be obtained.

背景領域を外部領域と境界領域に分け、それぞれの信号を異なるダイナミックレンジに限定するフィルタを適用していることで、ＲＯＩ座標補正部Ｕ３でＲＯＩ座標正規化部３０３の機能を有効にする場合に比べて、ＲＯＩ相当の符号量を割かなければならない画素の増加を最小限に留めつつ、ブロック内で背景領域とＲＯＩの境界に起因するエッジが入ることによる符号量の増加も阻止できる。 When the function of the ROI coordinate normalization unit 303 is enabled in the ROI coordinate correction unit U3 by dividing the background region into the external region and the boundary region and applying a filter that limits each signal to different dynamic ranges In comparison, it is possible to prevent an increase in the code amount due to the edge due to the boundary between the background region and the ROI entering in the block while minimizing the increase in pixels that must be allocated the code amount equivalent to the ROI.

さらに、ＲＯＩと背景領域の境界が滑らかに変化するようになり、背景領域の品質が必要以上に損なわれるという問題が緩和されたり、背景領域と隣接しているＲＯＩのブロック、またはＲＯＩと隣接している背景領域のブロックがイントラブロックモードで符号化されるときに、イントラ予測が効きやすくなって符号化効率が改善できる。 Furthermore, the boundary between the ROI and the background area smoothly changes, and the problem that the quality of the background area is lost more than necessary is alleviated, or the block of the ROI adjacent to the background area or the ROI is adjacent When a block in the background region is encoded in the intra block mode, intra prediction is more effective and coding efficiency can be improved.

（Ｃ）他の実施形態
前述した第１及び第２の実施形態においても種々の変形実施形態を言及したが、本発明は、以下の変形実施形態にも適用できる。 (C) Other Embodiments Although various modified embodiments are mentioned in the first and second embodiments described above, the present invention can also be applied to the following modified embodiments.

（Ｃ−１）前述した第１及び第２の実施形態において、上記では説明を簡単にするために、画像全体がＲＯＩと背景領域の２種類に区別される例を示しているが、背景領域にも重要度の段階があっても良い。例えば、ＲＯＩの設定に歩行者検出アルゴリズムと顔検出アルゴリズムを使用し、顔検出アルゴリズムによって検出されたＲＯＩは顔領域として、歩行者検出アルゴリズムによって検出された歩行者領域は相対的に重要な背景領域とし、顔でも歩行者でもない領域は相対的に重要ではない背景領域としても良い。この場合、背景領域の重要度に応じて異なるダイナミックレンジに異なる値を割り振って良く、例えば、相対的に重要な背景領域には広いダイナミックレンジで圧縮を行い、相対的に重要ではない背景領域には狭いダイナミックレンジで圧縮を行うという構成でも良い。割り振りは、例えば予め設定あるいは作成しておいた背景領域の重要度とフィルタ強度のルックアップテーブルに基づいて行えば良い。また、上記では説明を簡単にするためにＲＯＩにはフィルタリングを適用しない例を示しているが、ＲＯＩに背景領域よりも弱いフィルタリングをかける構成であっても良い。 (C-1) In the first and second embodiments described above, an example is shown in which the entire image is divided into two types, ROI and background area, in order to simplify the description. There may also be a level of importance. For example, a pedestrian detection algorithm and a face detection algorithm are used for setting an ROI, and an ROI detected by the face detection algorithm is a face area, and a pedestrian area detected by the pedestrian detection algorithm is a relatively important background area An area that is neither a face nor a pedestrian may be a background area that is relatively unimportant. In this case, different values may be allocated to different dynamic ranges according to the importance of the background area, for example, relatively important background areas are compressed with a wide dynamic range, and relatively less important background areas are compressed. May be compressed in a narrow dynamic range. The allocation may be performed, for example, based on the background area importance and the filter strength look-up table set or created in advance. Further, although an example in which filtering is not applied to the ROI is described above for the sake of simplicity of the description, the ROI may be filtered more weakly than the background region.

（Ｃ−２）前述した第１及び第２の実施形態において、例えば、顔領域をＲＯＩとしたときの背景領域を劣化させデータ量を削減する例を示しているが、顔領域のようにＲＯＩとして選ばれる領域を劣化させても良い。この場合、本明細書における言葉の定義上は、非顔領域がＲＯＩであり、顔領域が背景領域である。非顔領域をＲＯＩとすることで、例えばプライバシーを守ることができるなどの効果も得られる。 (C-2) In the first and second embodiments described above, for example, there is shown an example in which the background area is degraded when the face area is set as the ROI to reduce the data amount, but the ROI like the face area The area selected as may be degraded. In this case, in the definition of words in the present specification, the non-face area is the ROI, and the face area is the background area. By setting the non-face area as the ROI, for example, the effect of being able to protect privacy can be obtained.

（Ｃ−３）前述した第１及び第２の実施形態において、上記では機能ブロック間のデータの受け渡しを画像単位でおこなっているように記述しているが、画素値信号のデータの意味を明らかにするためであり、実装等では画素単位にデータを受け渡ししても良い。 (C-3) In the first and second embodiments described above, it is described that data exchange between functional blocks is performed in image units, but the meaning of data of pixel value signals is clarified In implementation etc., data may be delivered in pixel units.

（Ｃ−４）前述した第１及び第２の実施形態において、ＲＯＩとは主に画像空間の一部の限定された領域のように記載しているが、画像全体がＲＯＩであったり画像全体が背景領域であったりする画像が存在しても良い。 (C-4) In the first and second embodiments described above, the ROI is mainly described as a limited area of a part of the image space, but the entire image is the ROI or the entire image There may be an image in which the background area is the background area.

（Ｃ−５）前述した第１及び第２の実施形態において、画像符号化装置１０（１０、１０Ａ）内の各機能や画像復号装置Ｘ２、２０内の各機能は単一の装置のなかに実装されているように記述しているが、これらは実装例の一つであり、各機能が別の装置で実装されていたとしても、各信号が各構成図（図１、図５）のように入出力がなされていれば本発明の効果は得られる。つまり、画像符号化装置１０（１０、１０Ａ）や画像復号装置Ｘ２、２０が１個以上の装置から構成され、それぞれの機能が分散して実装されていたとしても本発明の効果は存在する。 (C-5) In the first and second embodiments described above, each function in the image encoding device 10 (10, 10A) and each function in the image decoding device X2, 20 are included in a single device. Although described as being implemented, these are only one implementation example, and even if each function is implemented by another device, each signal is shown in each configuration diagram (FIG. 1, FIG. 5). If the input / output is performed as described above, the effect of the present invention is obtained. That is, even if the image encoding device 10 (10, 10A) and the image decoding devices X2 and 20 are configured of one or more devices and the respective functions are distributed and implemented, the effects of the present invention still exist.

（Ｃ−６）前述した第１及び第２の実施形態において、色成分ごとの処理の違いについては特別に触れていないが、各実施形態において、色成分ごとに同じダイナミックレンジを使用しても良いし、異なるダイナミックレンジを使用しても良い。 (C-6) In the first and second embodiments described above, differences in processing for each color component are not particularly mentioned, but in each embodiment, the same dynamic range may be used for each color component. You can use different dynamic ranges.

（Ｃ−７）前述した第１及び第２の実施形態において、画像処理システム１、１Ａは、画像復号装置Ｘ２、２０を備えている例を示しているが、画像処理システム１、１Ａが画像符号化装置１０（１０、１０Ａ）のみを備える場合であっても本発明は一定程度の効果を有する。 (C-7) In the first and second embodiments described above, the image processing systems 1 and 1A show an example in which the image decoding devices X2 and 20 are provided. Even when only the encoding device 10 (10, 10A) is provided, the present invention has a certain degree of effect.

非特許文献１でも述べられている通り、前処理によりダイナミックレンジを限定することでＲＯＩ符号化を行う、画像処理システム１、１Ａでは、画像符号化装置１０（１０、１０Ａ）で行う前処理に対応する後処理を画像復号装置Ｘ２、２０において実施することで、背景領域の画質を一定程度回復する。しかし、非特許文献１でも述べられている通り、後処理を伴わない通例の画像復号装置（画像符号化装置１０（１０、１０Ａ）で前処理によるＲＯＩ符号化を行っていることを知らない画像復号装置）で復号したとしても、色合いがＲＯＩと背景領域とで異なるだけで、一定の意味を人間が汲み取れる画像になる。そして、この構成でも、背景領域の画素値がＲＯＩに染み出したり、ＲＯＩの画素値が背景領域に染み出したりという本発明が対象としている課題は発生し、本発明を導入することで課題を解決することができる。そのため、画像処理システム１、１Ａが画像符号化装置１０（１０、１０Ａ）のみを備える場合であっても本発明は一定程度の効果を有する。 As described in Non-Patent Document 1, ROI encoding is performed by limiting the dynamic range by preprocessing. In the image processing systems 1 and 1A, preprocessing is performed by the image encoding device 10 (10, 10A). The image processing of the background area is restored to a certain extent by performing the corresponding post-processing in the image decoding device X2, 20. However, as also described in Non-Patent Document 1, an image that does not know that ROI coding is being performed by pre-processing in a conventional image decoding device (image coding device 10 (10, 10A) without post-processing is known Even when decoded by the decoding device), only the color tone differs between the ROI and the background area, and an image in which a human can decipher a certain meaning can be obtained. Then, even with this configuration, the problems targeted by the present invention such that the pixel values of the background region ooze out to the ROI and the pixel values of the ROI ooze out to the background region occur, and the problems are solved by introducing the present invention. It can be solved. Therefore, even when the image processing system 1 or 1A includes only the image encoding device 10 (10, 10A), the present invention has an effect to a certain extent.

１、１Ａ、Ｚ…画像処理システム、１０、１０Ａ、Ｘ１…画像符号化装置、２０、Ｘ２…画像復号装置、１０２…ＲＯＩ拡大部、２０５…境界領域用前処理部、２０９…境界領域用後処理部、３０１…ＲＯＩ設定部、３０３…ＲＯＩ座標正規化部、３０４…外部領域用前処理部、３０６…エンコーダ部、３０７…デコーダ部、３０８…外部領域用後処理部、Ｕ１…前処理部、Ｕ２…後処理部、Ｕ３…ＲＯＩ座標補正部、Ｕ４…前処理部、Ｕ５…後処理部。
1, 1A, Z: image processing system, 10, 10A, X1: image coding device, 20, X2: image decoding device, 102: ROI enlargement unit, 205: boundary region preprocessing unit, 209: boundary region for back Processing unit 301: ROI setting unit 303: ROI coordinate normalization unit 304: external region preprocessing unit 306: encoder unit 307: decoder unit 308: external region post-processing unit U1: preprocessing unit , U2 ... post-processing unit, U3 ... ROI coordinate correction unit, U4 ... pre-processing unit, U5 ... post-processing unit.

Claims

In a coding device for coding an input image,
An attention area setting unit which sets an attention area corresponding to the input image and outputs attention area information;
An attention area coordinate correction unit that corrects the attention area information and outputs the corrected attention area information;
Based on the corrected attention area information, the pixels belonging to the background area in the input image are identified, and the pixels belonging to the identified background area are subjected to filtering for limiting the dynamic range, and output as an image before encoding A pre-processing unit that
An encoding unit that compresses the image before encoding using a predetermined image encoding method and outputs encoded data that has been compressed.

The attention area coordinate correction unit includes an attention area enlargement unit that enlarges the attention area in the peripheral direction by N (N is an integer of 1 or more) pixels based on the attention area information.
The encoding device according to claim 1, wherein the attention area coordinate correction unit outputs enlarged attention area information expanded using the attention area expansion unit as the corrected attention area information.

The attention area coordinate correction unit has an attention area coordinate normalization unit that normalizes the enlarged attention area information so that the boundary between the background area and the attention area is aligned with a block boundary.
The encoding apparatus according to claim 2, wherein the attention area coordinate correction unit outputs the enlarged attention area information normalized using the attention area coordinate normalization unit as the corrected attention area information. .

The region-of-interest coordinate normalization unit considers that all pixels included in the block belong to the region of interest if at least one of the pixels belonging to the block of a predetermined size is included in the region of interest. The encoding apparatus according to claim 3, wherein normalization is performed.

The predetermined image coding method is as follows. H.264 / MPEG-4 AVC or H.264. 265 / MPEG-H HEVC,
The encoding apparatus according to any one of claims 2 to 4, wherein the N is 1 or 2 or 3.

The input image is a boundary area which is a set of M (M is an integer of 1 or more) double pixels surrounding the target area among the pixels belonging to the background area, and the boundary among the pixels belonging to the background area And an external area which is a set of pixels other than the area,
The pre-processing unit
An external area preprocessing unit for specifying pixels belonging to the external area among the input image based on the corrected attention area information and performing filtering for limiting a dynamic range to the pixels belonging to the specified external area;
A boundary area preprocessing unit for specifying pixels belonging to the boundary area among the input image based on the corrected attention area information and performing filtering for limiting the dynamic range to the pixels belonging to the specified boundary area; The coding apparatus according to any one of claims 1 to 5, wherein the coding apparatus is provided.

7. The encoding apparatus according to claim 6, wherein a dynamic range of pixels belonging to the boundary area is limited to a dynamic range wider than a dynamic range for pixels belonging to the external area.

When the boundary area is double or more, the dynamic range of the Y (Y is an integer of 1 or more) second boundary area is larger than the dynamic range of the Y + 1th boundary area. The encoding apparatus according to claim 6 or 7, characterized in that:

The predetermined image coding method is as follows. H.264 / MPEG-4 AVC or H.264. 265 / MPEG-H HEVC,
The encoding apparatus according to any one of claims 6 to 8, wherein the M is 1 or 2 or 3.

The said attention area coordinate correction | amendment part is not equipped with the said attention area expansion part, The said attention area information is input into the said attention area coordinate normalization part as said expansion attention area information. The encoding device according to any one.

10. The encoding apparatus according to any one of claims 6 to 9, wherein the encoding apparatus does not include the attention area coordinate correction unit, and the attention area information is directly input as the corrected attention area information to the preprocessing unit. The encoding device according to claim 1.

In the picture processing system in which the encoding device and the decoding device face each other,
A picture processing system characterized in that the coding device according to any one of claims 1 to 11 is applied as the coding device.

The decoding device is
A decoding unit that decodes the input coded data according to a method corresponding to the predetermined image coding method used by the coding unit, and outputs the decoded data as a decoded image;
A post-processing unit that identifies pixels belonging to the background region among the decoded image based on the region of interest information, applies filtering to restore the dynamic range to the pixels belonging to the identified background region, and outputs as an output image The picture processing system according to claim 12, characterized in that it comprises:

In the picture processing system in which the encoding device and the decoding device face each other,
The encoding device according to any one of claims 6 to 11 is applied as the encoding device,
The decoding device is
A decoding unit that decodes the input coded data according to a method corresponding to the predetermined image coding method used by the coding unit, and outputs the decoded data as a decoded image;
An external region post-processing unit that identifies pixels belonging to the external region among the decoded image based on the corrected attention region information, and performs filtering for restoring a dynamic range to the pixels belonging to the identified external region. When,
A post-process for boundary area which specifies a pixel belonging to the boundary area among the decoded image based on the corrected attention area information, and performs filtering for restoring a dynamic range to the pixel belonging to the specified boundary area An image processing system comprising:

A computer mounted on a coding device for coding an input image,
An attention area setting unit which sets an attention area corresponding to the input image and outputs attention area information;
An attention area coordinate correction unit that corrects the attention area information and outputs the corrected attention area information;
Based on the corrected attention area information, the pixels belonging to the background area in the input image are identified, and the pixels belonging to the identified background area are subjected to filtering for limiting the dynamic range, and output as an image before encoding A pre-processing unit that
An encoding program characterized in that the image before encoding is compressed using a predetermined image encoding method, and the encoding program functions as an encoding unit that outputs compressed encoded data.