JP2018129688A

JP2018129688A - Video encoding device, video encoding/decoding system, and video encoding program

Info

Publication number: JP2018129688A
Application number: JP2017021623A
Authority: JP
Inventors: 和仁迫水; Kazuhito Sakomizu; 中井　敏久; Toshihisa Nakai; 敏久中井
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 2017-02-08
Filing date: 2017-02-08
Publication date: 2018-08-16

Abstract

PROBLEM TO BE SOLVED: To provide a video encoding device, a video encoding/decoding system, and a video encoding program capable of preventing an increase in the amount of codes due to an intraframe code and reducing the amount of computation required for decoding an ROI.SOLUTION: A video encoding device X1 includes an ROI (Region Of Interest) setting unit that sets an ROI in a picture and generates ROI information representing the position and size of the ROI in the picture and an encoder unit U1 that executes compression coding processing for an input image signal on the basis of the ROI information in response to the input image signal and the ROI information to generate a bit stream signal. On the basis of the ROI information, the encoder unit executes intra coding as the compression coding processing for an image block included in the ROI, and selects and executes one of intra coding and inter coding as the compression coding processing for image blocks included in the non-ROI.SELECTED DRAWING: Figure 1

Description

本発明は、映像信号を符号化する映像符号化装置、映像信号を符号・復号する映像符号復号システム、及び映像符号化プログラムに関する。 The present invention relates to a video encoding device that encodes a video signal, a video encoding / decoding system that encodes / decodes a video signal, and a video encoding program.

近年、監視カメラの普及は進み、さらなる高解像度化、高フレームレート化、多視点化が望まれている。しかし、高解像度化、高フレームレート化、多視点化は、動画像のデータ量の著しい増加を引き起こし、通信コストやストレージコストの増加を招く。 In recent years, surveillance cameras have become popular, and further higher resolution, higher frame rate, and multi-viewpoint are desired. However, increasing the resolution, increasing the frame rate, and increasing the number of viewpoints cause a significant increase in the amount of moving image data, resulting in increased communication costs and storage costs.

この問題を緩和するため、動画像中から例えば「顔」が表示されている特定の領域をＲＯＩ（Region of Interest）として検出し、このＲＯＩに対して多くのビット数を割り当てて画像圧縮符号化処理を施すようにした画像処理装置が提案されている（例えば、特許文献１参照）。 In order to alleviate this problem, for example, a specific region in which “face” is displayed is detected as a ROI (Region of Interest) from a moving image, and a large number of bits are assigned to this ROI to perform image compression coding. An image processing apparatus that performs processing has been proposed (see, for example, Patent Document 1).

当該画像処理装置では、符号量と映像品質を制御するパラメータをブロック毎に設定するという圧縮部の機能を利用して、ＲＯＩ以外の領域、つまり背景領域の情報量をＲＯＩの情報量よりも削減するように、圧縮部に与えるパラメータを制御することにより、データ量を削減している。 In the image processing apparatus, the information amount of the area other than the ROI, that is, the background area is reduced from the information amount of the ROI by using the function of the compression unit that sets the parameter for controlling the code amount and the video quality for each block. As described above, the data amount is reduced by controlling the parameters given to the compression unit.

一方、例えば価格や互換性等の事情により、符号量と映像品質を制御するパラメータをブロック毎に設定するという機能を備えていない圧縮部を用いる場合には、上記したような方法で情報量の削減を図ることはできない。 On the other hand, when using a compression unit that does not have a function of setting a parameter for controlling the code amount and video quality for each block due to circumstances such as price and compatibility, the amount of information is It cannot be reduced.

そこで、圧縮部に依存せずに背景領域の情報量を削減する為に、背景領域に低域通過フィルタによるフィルタリング処理を施して高周波成分の情報を取り除くことで背景領域の情報量を削減するようにした画像符号化装置が提案されている（例えば、特許文献２参照）。この画像符号化装置では、低域通過フィルタによるフィルタリングにより、画像信号の情報量を削減しつつ、ブロック境界に発生するアーティファクトの発生を抑制している。 Therefore, in order to reduce the amount of information in the background area without depending on the compression unit, the amount of information in the background area is reduced by filtering the background area with a low-pass filter to remove high-frequency component information. An image coding apparatus has been proposed (see, for example, Patent Document 2). In this image encoding device, the generation of artifacts occurring at block boundaries is suppressed while reducing the amount of image signal information by filtering with a low-pass filter.

特開２００９−４９９７９号公報JP 2009-49979 A 特開平４−２１９０８９号公報JP-A-4-219089

ところで、上記したような従来の画像処理では、以下の問題が生じる。すなわち、復号に必要な演算量を少なくするために、他のピクチャを参照せずに各フレーム内で画像信号を圧縮する、いわゆるフレーム内符号化を実施した場合には、他のピクチャの参照を許容するフレーム間符号化に比べて符号量が著しく増加する。よって、大幅な符号量の削減を図ることが困難になる。 By the way, the following problems occur in the conventional image processing as described above. In other words, in order to reduce the amount of computation required for decoding, when so-called intraframe coding is performed in which an image signal is compressed within each frame without referring to other pictures, reference to other pictures is performed. Compared with the allowable interframe coding, the code amount is significantly increased. Therefore, it becomes difficult to reduce the code amount significantly.

一方、フレーム間符号を行った場合には、符号量の削減を図ることができるものの、復号時の演算量が大きくなるという問題が生じる。すなわち、フレーム間符号の施された符号化画像信号に対して、ＲＯＩを含む任意のピクチャを復号する際に、当該ピクチャが他のピクチャを参照している場合には、ＲＯＩに対する復号を行う前に、この他のピクチャを復号しておく必要がある。更に、当該他のピクチャが更に別のピクチャを参照している場合、この別のピクチャを先に復号しておく必要が生じる。このように、フレーム間符号を行った場合、ＲＯＩを含む任意のピクチャを復号する際に、１つ以上のピクチャを予め復号しておかなければならない為、ＲＯＩの復号に必要となる演算量が大きくなるという問題が生じる。 On the other hand, when inter-frame coding is performed, the amount of code can be reduced, but there is a problem that the amount of calculation at the time of decoding becomes large. That is, when an arbitrary picture including an ROI is decoded with respect to an encoded image signal subjected to inter-frame coding, if the picture refers to another picture, before decoding the ROI In addition, it is necessary to decode other pictures. Further, when the other picture refers to another picture, it is necessary to decode this other picture first. As described above, when inter-frame coding is performed, when decoding any picture including ROI, one or more pictures must be decoded in advance. Therefore, the amount of calculation required for ROI decoding is small. The problem of becoming larger arises.

そこで、本発明は、符号量の削減及び演算量の削減を両立させることが可能な映像符号化装置、映像符号復号システム、及び映像符号化プログラムを提供することを目的とする。 Therefore, an object of the present invention is to provide a video encoding device, a video encoding / decoding system, and a video encoding program capable of achieving both a reduction in code amount and a reduction in calculation amount.

本発明に係る映像符号化装置は、入力画像信号に圧縮符号化処理を施してビットストリーム信号を生成する映像符号化装置であって、ピクチャ内にＲＯＩ(Region Of Interest)を設定し、前記ピクチャ内での前記ＲＯＩの位置及びサイズを表すＲＯＩ情報を生成するＲＯＩ設定部と、前記入力画像信号及び前記ＲＯＩ情報を受け、前記ＲＯＩ情報に基づいて前記入力画像信号に圧縮符号化処理を施してビットストリーム信号を生成するエンコーダ部と、を有し、前記エンコーダ部は、前記ＲＯＩ情報に基づき、前記入力画像信号におけるピクチャ内の前記ＲＯＩに含まれる画像ブロックに対しては前記圧縮符号化処理としてイントラ符号化を施し、非ＲＯＩに含まれる画像ブロックに対しては前記圧縮符号化処理として前記イントラ符号化及びインター符号化のうちの一方を選択して施す。 A video encoding apparatus according to the present invention is a video encoding apparatus that generates a bitstream signal by performing compression encoding processing on an input image signal, and sets an ROI (Region Of Interest) in a picture, and the picture A ROI setting unit that generates ROI information representing the position and size of the ROI within the network, and receives the input image signal and the ROI information, and performs compression encoding processing on the input image signal based on the ROI information An encoder unit that generates a bitstream signal, and the encoder unit performs compression coding processing on an image block included in the ROI in a picture in the input image signal based on the ROI information. Intra coding is performed, and for the image blocks included in the non-ROI, the intra coding and intra coding are performed as the compression coding processing. Applied by selecting one of the over coding.

本発明に係る映像符号復号システムは、入力画像信号に圧縮符号化処理を施してビットストリーム信号を生成する映像符号化装置と、前記ビットストリーム信号に対して復号処理を施す映像復号装置と、を含む画像符号復号システムであって、前記映像符号化装置は、ピクチャ内にＲＯＩ(Region Of Interest)を設定し、前記ピクチャ内での前記ＲＯＩの位置及びサイズを表すＲＯＩ情報を生成するＲＯＩ設定部と、復号に必要となるパラメータセットを含む上位シンタックス情報を生成する上位シンタックス生成部と、前記入力画像信号及び前記ＲＯＩ情報を受け、前記ＲＯＩ情報に基づいて前記入力画像信号に圧縮符号化処理を施して得られたエレメンタリーストリーム信号と、前記上位シンタックス情報とを多重化したビットストリーム信号を出力するエンコーダ部と、を有し、前記エンコーダ部は、前記ＲＯＩ情報に基づき、前記入力画像信号におけるピクチャ内の前記ＲＯＩに含まれる画像ブロックに対しては前記圧縮符号化処理としてイントラ符号化を施し、非ＲＯＩに含まれる画像ブロックに対しては前記圧縮符号化処理として前記イントラ符号化及びインター符号化のうちの一方を選択して施し、前記映像復号装置は、前記ビットストリーム信号を受け、前記ビットストリーム信号から前記上位シンタックス情報に含まれる前記パラメータセット、及び前記エレメンタリーストリーム信号に対応したアクセスユニットを抽出する復号画像選択部と、前記アクセスユニットに対して前記パラメータセットに基づく復号処理を施して出力画像を生成する映像復号部と、を含む。 A video encoding / decoding system according to the present invention includes: a video encoding device that performs compression encoding processing on an input image signal to generate a bitstream signal; and a video decoding device that performs decoding processing on the bitstream signal. An image encoding / decoding system including: an ROI setting unit configured to set an ROI (Region Of Interest) in a picture and generate ROI information indicating a position and a size of the ROI in the picture An upper syntax generation unit that generates upper syntax information including a parameter set necessary for decoding, and receives the input image signal and the ROI information, and compresses and encodes the input image signal based on the ROI information A bit stream signal obtained by multiplexing an elementary stream signal obtained by processing and the upper syntax information An encoder unit for outputting, based on the ROI information, the encoder unit performs intra coding as the compression coding process on an image block included in the ROI in a picture in the input image signal. For the image blocks included in the non-ROI, one of the intra coding and the inter coding is selected as the compression coding processing, and the video decoding device receives the bit stream signal, A decoded image selection unit that extracts the parameter set included in the higher-level syntax information from the bit stream signal and an access unit corresponding to the elementary stream signal, and a decoding process based on the parameter set for the access unit And a video decoding unit that generates an output image.

本発明に係る映像符号化プログラムは、入力画像信号に圧縮符号化処理を施してビットストリーム信号を生成する情報処理装置の制御部が実行する映像符号化プログラムであって、ピクチャ内にＲＯＩ(Region Of Interest)を設定し、前記ピクチャ内での前記ＲＯＩの位置及びサイズを表すＲＯＩ情報を生成する第１のステップと、前記入力画像信号及び前記ＲＯＩ情報を受け、前記ＲＯＩ情報に基づいて前記入力画像信号に圧縮符号化処理を施してビットストリーム信号を生成する第２のステップと、を有し、前記第２のステップでは、前記ＲＯＩ情報に基づき、前記入力画像信号におけるピクチャ内の前記ＲＯＩに含まれる画像ブロックに対しては前記圧縮符号化処理としてイントラ符号化を施し、非ＲＯＩに含まれる画像ブロックに対しては前記圧縮符号化処理として前記イントラ符号化及びインター符号化のうちの一方を選択して施す。 A video encoding program according to the present invention is a video encoding program executed by a control unit of an information processing apparatus that generates a bitstream signal by performing compression encoding processing on an input image signal, and includes a ROI (Region A first step of generating ROI information representing the position and size of the ROI within the picture, receiving the input image signal and the ROI information, and receiving the input based on the ROI information A second step of generating a bitstream signal by performing compression coding processing on the image signal, and in the second step, based on the ROI information, the ROI in the picture in the input image signal Intra coding is performed as the compression coding process for the included image block, and for the image block included in the non-ROI Applied by selecting one of said intra-coded and inter-coded as serial compression coding processing.

本発明では、ピクチャ内においてＲＯＩを設定し、そのピクチャ内でのＲＯＩの位置及びサイズを表すＲＯＩ情報を生成し、当該ＲＯＩ情報Ｄ０１に基づいて入力画像信号に対して圧縮符号化処理を施すにあたり、以下の処理を行う。つまり、ＲＯＩ情報に基づき、ＲＯＩに含まれる画像ブロックには圧縮符号化処理としてイントラ符号化を施し、非ＲＯＩに含まれる画像ブロック群に対してはイントラ符号化及びインター符号化のうちの一方を選択して施す。 In the present invention, an ROI is set in a picture, ROI information representing the position and size of the ROI in the picture is generated, and an input image signal is compressed and encoded based on the ROI information D01. The following processing is performed. That is, based on the ROI information, the image block included in the ROI is subjected to intra coding as a compression coding process, and one of the intra coding and the inter coding is performed on the image block group included in the non-ROI. Select and apply.

これにより、ピクチャ内の全ての画像ブロックをイントラ符号化によって圧縮符号化する場合に比して、符号量を削減することが可能となる。また、ピクチャ内の全ての画像ブロックをインター符号化によって圧縮符号化する場合に比して演算量を削減することが可能となる。 As a result, it is possible to reduce the code amount as compared with a case where all image blocks in a picture are compression-encoded by intra-encoding. In addition, the amount of calculation can be reduced as compared with the case where all image blocks in a picture are compression-encoded by inter-encoding.

よって、本発明によれば、符号量の削減及び演算量の削減を両立させることが可能となる。 Therefore, according to the present invention, it is possible to achieve both a reduction in code amount and a reduction in calculation amount.

映像符号化装置Ｘ１の構成の一例を示すブロック図である。It is a block diagram which shows an example of a structure of the video encoding apparatus X1. 画像符号化部Ｕ１の映像符号化処理を示すフロー図である。It is a flowchart which shows the video coding process of the image coding part U1. 映像処理システムの一例としてのＲＯＩ識別システムＲＩＳの構成を示すブロック図である。It is a block diagram which shows the structure of ROI identification system RIS as an example of a video processing system. 復号画像選択装置Ｘ２が実施する復号画像選択処理の一例を示すフロー図である。It is a flowchart which shows an example of the decoding image selection process which the decoding image selection apparatus X2 implements. 復号画像選択装置Ｘ２が実施する復号画像選択処理の他の一例を示すフロー図である。It is a flowchart which shows another example of the decoding image selection process which the decoding image selection apparatus X2 implements. 映像符号化装置Ｘ１の構成の他の一例を示すブロック図である。It is a block diagram which shows another example of a structure of the video encoding apparatus X1. 図６に示す構成からなる映像符号化装置Ｘ１が実施する復号画像選択処理の一例を示すフロー図である。It is a flowchart which shows an example of the decoding image selection process which the video coding apparatus X1 which consists of a structure shown in FIG. 6 implements. 映像符号化処理をプログラムによって実行する情報処理装置の構成を示すブロック図である。It is a block diagram which shows the structure of the information processing apparatus which performs a video encoding process by a program.

以下、本発明の実施例を図面を参照しつつ詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

図１は、本発明に係る映像符号化装置Ｘ１の構成を示すブロック図である。図１に示すように、映像符号化装置Ｘ１は、ＲＯＩ設定部３０１及びエンコーダ部Ｕ１を含む。エンコーダ部Ｕ１は、イントラ予測モード指定部１０２、及び予測モード可変エンコーダ部Ｕ２を含む。予測モード可変エンコーダ部Ｕ２は、予測モード判定部１０３、エンコーダコア部Ｕ３、上位シンタックス生成部１０６及びビットストリーム生成部３０７を含む。 FIG. 1 is a block diagram showing a configuration of a video encoding device X1 according to the present invention. As shown in FIG. 1, the video encoding device X1 includes a ROI setting unit 301 and an encoder unit U1. The encoder unit U1 includes an intra prediction mode designating unit 102 and a prediction mode variable encoder unit U2. The prediction mode variable encoder unit U2 includes a prediction mode determination unit 103, an encoder core unit U3, an upper syntax generation unit 106, and a bit stream generation unit 307.

以下に、上記したＲＯＩ設定部３０１、イントラ予測モード指定部１０２、予測モード可変エンコーダ部Ｕ２、予測モード判定部１０３、エンコーダコア部Ｕ３、上位シンタックス生成部１０６及びビットストリーム生成部３０７の動作について、図２に示す映像符号化処理を示すフロー図に沿って説明する。 Hereinafter, the operations of the ROI setting unit 301, the intra prediction mode specifying unit 102, the prediction mode variable encoder unit U2, the prediction mode determining unit 103, the encoder core unit U3, the upper syntax generation unit 106, and the bit stream generation unit 307 will be described. 2 will be described with reference to the flowchart showing the video encoding process shown in FIG.

先ず、ＲＯＩ設定部３０１が、各画素の輝度レベルを表す画素値の系列を含む入力画像信号Ｅ１を受け、当該入力画像信号Ｅ１によって表されるピクチャに対してＲＯＩを設定し、設定したＲＯＩのピクチャ内での位置及びサイズを表すＲＯＩ情報Ｄ０１を生成する（ステップＳ１０１）。 First, the ROI setting unit 301 receives an input image signal E1 including a series of pixel values representing the luminance level of each pixel, sets the ROI for the picture represented by the input image signal E1, and sets the ROI thus set. ROI information D01 representing the position and size in the picture is generated (step S101).

例えば、ＲＯＩ設定部３０１は、特定のオブジェクトとして人の顔の検出を行う顔検出アルゴリズムを用いて、入力画像信号Ｅ１中から人の顔が表示される領域をＲＯＩとして検出する。そして、ＲＯＩ設定部３０１は、ピクチャ内でのＲＯＩの位置（座標）及びサイズを表すＲＯＩ情報Ｄ０１を設定する。尚、ＲＯＩ設定部３０１は、入力画像信号Ｅ１に依存せず、ユーザによって任意に指定されたＲＯＩを設定し、そのＲＯＩのピクチャ内での位置及びサイズを表すＲＯＩ情報Ｄ０１を生成しても良い。ＲＯＩ設定部３０１は、当該ＲＯＩ情報Ｄ０１をイントラ予測モード指定部１０２及びエンコーダコア部Ｕ３に供給する。 For example, the ROI setting unit 301 detects, as an ROI, an area in which a human face is displayed from the input image signal E1 using a face detection algorithm that detects a human face as a specific object. Then, the ROI setting unit 301 sets ROI information D01 representing the position (coordinates) and size of the ROI in the picture. The ROI setting unit 301 may set the ROI arbitrarily designated by the user without depending on the input image signal E1, and generate the ROI information D01 representing the position and size of the ROI in the picture. . The ROI setting unit 301 supplies the ROI information D01 to the intra prediction mode designating unit 102 and the encoder core unit U3.

次に、イントラ予測モード指定部１０２が、ＲＯＩ情報Ｄ０１に基づきＲＯＩに含まれる画像ブロックに対してはイントラ予測モードを指定するイントラ予測モード指定信号Ｄ０２を生成する（ステップＳ１０２）。例えば、エンコーダコア部Ｕ３が画像符号方式としてＨ．２６４／ＭＰＥＧ（Moving Picture Experts Group）−４ＡＶＣを採用している場合、イントラ予測モード指定信号Ｄ０２は、例えば１６画素×１６画素からなる画像ブロック（以下、イントラマクロブロック又はイントラＭＢと称する）単位でイントラ予測モードを指定する。イントラ予測モード指定部１０２は、イントラ予測モード指定信号Ｄ０２を予測モード判定部１０３に供給する。 Next, the intra prediction mode designating unit 102 generates an intra prediction mode designating signal D02 that designates the intra prediction mode for the image block included in the ROI based on the ROI information D01 (step S102). For example, the encoder core unit U3 uses H.264 as an image encoding method. When the H.264 / MPEG (Moving Picture Experts Group) -4 AVC is adopted, the intra prediction mode designation signal D02 is, for example, an image block (hereinafter referred to as an intra macroblock or an intra MB) composed of 16 pixels × 16 pixels. Specify the intra prediction mode with. The intra prediction mode designation unit 102 supplies the intra prediction mode designation signal D02 to the prediction mode determination unit 103.

次に、予測モード判定部１０３が、イントラ予測モード指定信号Ｄ０２の有無に応じて予測モードを判定し、判定した予測モードを示す予測モード情報Ｄ０３を生成する（ステップＳ１０３）。すなわち、予測モード判定部１０３は、イントラ予測モード指定信号Ｄ０２によってイントラ予測モードに指定された画像ブロックに対してはイントラ予測モードを示す予測モード情報Ｄ０３を生成する。 Next, the prediction mode determination unit 103 determines a prediction mode according to the presence or absence of the intra prediction mode designation signal D02, and generates prediction mode information D03 indicating the determined prediction mode (step S103). That is, the prediction mode determination unit 103 generates prediction mode information D03 indicating the intra prediction mode for the image block designated as the intra prediction mode by the intra prediction mode designation signal D02.

一方、イントラ予測モード指定信号Ｄ０２によってイントラ予測モードに指定されていない画像ブロックに対しては、予測モード判定部１０３は、各種の予測モードのうちから適切な予測モード（イントラ予測モード又はインター予測モード）を判定し、判定した予測モードを示す予測モード情報Ｄ０３を生成する。例えば、予測モード判定部１０３は、入力画像信号Ｅ１によって表されるピクチャに基づき、イントラ予測モード及びインター予測モードのうちの一方の予測モードを選択し、選択した方の予測モードを示す予測モード情報Ｄ０３を生成する。予測モード判定部１０３は、予測モード情報Ｄ０３をエンコーダコア部Ｕ３に供給する。 On the other hand, for an image block that is not designated as the intra prediction mode by the intra prediction mode designation signal D02, the prediction mode determination unit 103 selects an appropriate prediction mode (intra prediction mode or inter prediction mode) from among various prediction modes. ) And prediction mode information D03 indicating the determined prediction mode is generated. For example, the prediction mode determination unit 103 selects one of the intra prediction mode and the inter prediction mode based on the picture represented by the input image signal E1, and predicts the prediction mode information indicating the selected prediction mode. D03 is generated. The prediction mode determination unit 103 supplies the prediction mode information D03 to the encoder core unit U3.

次に、エンコーダコア部Ｕ３が、予測モード情報Ｄ０３に従って入力画像信号Ｅ１に圧縮符号化処理を施して得られたエレメンタリーストリーム信号Ｄ０５を出力する（ステップＳ１０５）。エンコーダコア部Ｕ３は、前処理部３０５Ａ及び圧縮部３０５Ｂを含む。前処理部３０５Ａは、入力画像信号Ｅ１及びＲＯＩ情報Ｄ０１を受け、入力画像信号Ｅ１に以下のような前処理を施すことにより、画像品質を制御する。 Next, the encoder core unit U3 outputs an elementary stream signal D05 obtained by subjecting the input image signal E1 to compression coding processing according to the prediction mode information D03 (step S105). The encoder core unit U3 includes a preprocessing unit 305A and a compression unit 305B. The preprocessing unit 305A receives the input image signal E1 and the ROI information D01, and controls the image quality by performing the following preprocessing on the input image signal E1.

すなわち、前処理部３０５Ａは、入力画像信号Ｅ１に含まれる画素値の系列に対して、ＲＯＩ情報Ｄ０１に基づきＲＯＩ以外の領域（以下、背景領域又は非ＲＯＩと称する）に該当すると判定された画素値には、ＲＯＩに該当すると判定された画素値に比べて少ない符号量となるような符号化処理を施す。かかる符号化処理は、例えばガウシアンフィルタ等を用いて入力画像信号Ｅ１を畳み込むことによって、入力画像信号Ｅ１の高周波成分を抑圧する処理である。 That is, the pre-processing unit 305A determines that the pixel value series included in the input image signal E1 corresponds to a region other than the ROI (hereinafter referred to as a background region or a non-ROI) based on the ROI information D01. The value is subjected to an encoding process such that the code amount is smaller than that of the pixel value determined to correspond to the ROI. Such an encoding process is a process of suppressing the high frequency component of the input image signal E1 by convolving the input image signal E1 using, for example, a Gaussian filter.

前処理部３０５Ａは、入力画像信号Ｅ１に上記のような前処理を施して得られた前処理画像信号を圧縮部３０５Ｂに供給する。すなわち、前処理部３０５Ａにより、ＲＯＩを含むピクチャに対しては画像品質を抑えて符号量の削減を図るという、画像品質の制御が為される。 The preprocessing unit 305A supplies a preprocessed image signal obtained by performing the above-described preprocessing on the input image signal E1 to the compression unit 305B. That is, the pre-processing unit 305A performs image quality control for reducing the code amount by suppressing the image quality for the picture including the ROI.

圧縮部３０５Ｂは、予測モード情報Ｄ０３及び上位シンタックスに基づき、前処理画像信号に上記した画像符号方式に従った圧縮符号化処理を施して得られたエレメンタリーストリーム信号Ｄ０５を、ビットストリーム生成部３０７に供給する。尚、この画像符号方式としては、復号側においてＲＯＩを含むピクチャのみを復号することでＲＯＩを抽出することが可能となる例えばＨ．２６４／ＭＰＥＧ−４ＡＶＣ（Advanced Video Coding）、或いはＨ．２６５／ＭＰＥＧ−ＨＨＥＶＣ（High Efficiency Video Coding）等を用いる。 Based on the prediction mode information D03 and the upper syntax, the compression unit 305B generates an elementary stream signal D05 obtained by subjecting the preprocessed image signal to compression encoding processing in accordance with the above-described image encoding method. 307. As this image encoding method, ROI can be extracted by decoding only pictures including ROI on the decoding side. H.264 / MPEG-4 AVC (Advanced Video Coding), or H.264. H.265 / MPEG-H HEVC (High Efficiency Video Coding) or the like is used.

かかる構成により、エンコーダコア部Ｕ３は、ＲＯＩ情報Ｄ０１、予測モード情報Ｄ０３、及び後述する制約付きイントラを示す上位シンタックスに基づき、入力画像信号Ｅ１に圧縮符号化処理を施す。すなわち、エンコーダコア部Ｕ３は、入力画像信号Ｅ１によって表されるピクチャ内のＲＯＩに該当するブロックに対しては、予測モード情報Ｄ０３によって示されるイントラ予測モードに従って、イントラ符号化処理を施す。この際、制約付きイントラを示す上位シンタックスに応じて、ＲＯＩに含まれる画像ブロックの周辺の画素値を用いてイントラ符号化を行う場合には、エンコーダコア部Ｕ３は、かかる周辺の画素値として、既にイントラ符号化された画素値又は所定値のみを用いる。 With this configuration, the encoder core unit U3 performs compression coding processing on the input image signal E1 based on the ROI information D01, the prediction mode information D03, and higher-order syntax indicating a restricted intra, which will be described later. That is, the encoder core unit U3 performs an intra coding process on the block corresponding to the ROI in the picture represented by the input image signal E1 according to the intra prediction mode indicated by the prediction mode information D03. At this time, in the case of performing intra coding using the pixel values around the image block included in the ROI according to the upper syntax indicating the restricted intra, the encoder core unit U3 uses the surrounding pixel values as the surrounding pixel values. Only pixel values that have already been intra-coded or predetermined values are used.

一方、当該ピクチャ内においてＲＯＩに該当しない領域、つまり非ＲＯＩ（背景領域）に該当するブロック群に対しては、エンコーダコア部Ｕ３は、予測モード情報Ｄ０３によって示されるイントラ予測モード又はインター予測モードに従った符号化処理を施す。つまり、予測モード情報Ｄ０３がイントラ予測モードを示す場合には、エンコーダコア部Ｕ３は、入力画像信号Ｅ１によって表されるピクチャ内の非ＲＯＩのブロックにイントラ符号化処理を施す。また、予測モード情報Ｄ０３がインター予測モードを示す場合には、エンコーダコア部Ｕ３は、入力画像信号Ｅ１によって表されるピクチャ内の非ＲＯＩのブロックにインター符号化処理を施す。 On the other hand, for an area that does not correspond to ROI in the picture, that is, a block group that corresponds to a non-ROI (background area), the encoder core unit U3 switches to the intra prediction mode or the inter prediction mode indicated by the prediction mode information D03. The encoding process is performed accordingly. That is, when the prediction mode information D03 indicates the intra prediction mode, the encoder core unit U3 performs an intra coding process on the non-ROI block in the picture represented by the input image signal E1. When the prediction mode information D03 indicates the inter prediction mode, the encoder core unit U3 performs inter coding processing on the non-ROI block in the picture represented by the input image signal E1.

エンコーダコア部Ｕ３は、上記した圧縮符号化処理によって得られたエレメンタリーストリーム信号Ｄ０５を、ビットストリーム生成部３０７に供給する。 The encoder core unit U3 supplies the elementary stream signal D05 obtained by the compression encoding process described above to the bit stream generation unit 307.

次に、上位シンタックス生成部１０６が、復号時に必要となる情報を含み且つ制約付きイントラを示す上位シンタックスを生成する（ステップＳ１０６）。例えば、エンコーダコア部Ｕ３の画像符号方式が上記したＨ．２６４／ＭＰＥＧ−４ＡＶＣ、或いはＨ．２６５／ＭＰＥＧ−ＨＨＥＶＣである場合、上位シンタックス生成部１０６は、ＳＰＳ（Sequence Parameter Set）及びＰＰＳ（Picture Parameter Set）を上位シンタックスとして生成する。ＳＰＳは、シーケンス全体の符号化に係る情報を示すパラメータセットであり、ＰＰＳは、ピクチャ全体の符号化に係る情報を示すパラメータセットである。尚、例えばエンコーダコア部Ｕ３がＨ．２６４／ＭＰＥＧ−４ＡＶＣ、或いはＨ．２６５／ＭＰＥＧ−ＨＨＥＶＣ等のような制約付きイントラの指定に対応した画像符号方式を採用している場合、上位シンタックス生成部１０６は、ＰＰＳにより、制約付きイントラの指定を行う。制約付きイントラでは、イントラＭＢ周辺の画素値がインター符号化されたものである場合には、それを使用せず、当該イントラＭＢ周辺のイントラ符号化された画素値のみを用いてイントラＭＢをイントラ符号化する。この際、イントラＭＢ周辺にイントラ符号化された画素値が存在しない場合には、制約付きイントラでは、所定の固定値を用いてイントラＭＢをイントラ符号化する。 Next, the higher-order syntax generation unit 106 generates higher-order syntax that includes information necessary for decoding and indicates a restricted intra (step S106). For example, the image coding method of the encoder core unit U3 is the above-described H.264. H.264 / MPEG-4 AVC or H.264 In the case of H.265 / MPEG-H HEVC, the upper syntax generation unit 106 generates SPS (Sequence Parameter Set) and PPS (Picture Parameter Set) as upper syntax. SPS is a parameter set indicating information related to encoding of the entire sequence, and PPS is a parameter set indicating information related to encoding of the entire picture. For example, the encoder core unit U3 is H.264. H.264 / MPEG-4 AVC or H.264 When the image coding method corresponding to the restricted intra designation such as H.265 / MPEG-H HEVC is adopted, the upper syntax generation unit 106 designates the restricted intra using the PPS. In the intra with restrictions, when the pixel values around the intra MB are inter-coded, the intra MB is not used but is used only with the intra-coded pixel values around the intra MB. Encode. At this time, if there is no intra-coded pixel value around the intra MB, the intra MB is intra-coded using a predetermined fixed value in the restricted intra.

上位シンタックス生成部１０６は、これらＳＰＳ及びＰＰＳ等のパラメータセットを上位シンタックスとして表す上位シンタックス情報Ｄ０６をビットストリーム生成部３０７に供給する。 The upper syntax generation unit 106 supplies the bit stream generation unit 307 with higher syntax information D06 representing the parameter set such as SPS and PPS as the upper syntax.

尚、エンコーダコア部Ｕ３が、複数の上位シンタックスのうちから、圧縮符号化処理で用いる上位シンタックスをピクチャ又はスライス単位で選択可能な画像符号方式を採用している場合、上位シンタックス生成部１０６は、以下のように上位シンタックス情報Ｄ０６をビットストリーム生成部３０７に供給する。 In addition, when the encoder core unit U3 employs an image coding method in which the upper syntax used in the compression encoding process can be selected in units of pictures or slices from among a plurality of upper syntaxes, the upper syntax generation unit 106 supplies the upper syntax information D06 to the bitstream generation unit 307 as follows.

つまり、上位シンタックス生成部１０６は、エンコーダコア部Ｕ３がピクチャ又はスライス毎に上位シンタックスを選択する度に、その選択された上位シンタックスを示す上位シンタックス情報Ｄ０６をビットストリーム生成部３０７に供給する。スライスは、１ピクチャ内の複数のマクロブロックからなる処理単位を表す。 That is, each time the encoder core unit U3 selects an upper syntax for each picture or slice, the upper syntax generation unit 106 transmits upper syntax information D06 indicating the selected upper syntax to the bitstream generation unit 307. Supply. A slice represents a processing unit composed of a plurality of macroblocks in one picture.

上位シンタックス生成部１０６は、エンコーダコア部Ｕ３が例えばＭＰＥＧ−２等のように、制約付きイントラが想定されていない画像符号方式を採用している場合には、その画像符号方式で規定されている通常の上位シンタックスを生成する。 If the encoder core unit U3 employs an image coding method that does not assume a restricted intra, such as MPEG-2, the higher-order syntax generation unit 106 is defined by the image coding method. Generate the normal high-level syntax.

尚、上位シンタックス生成部１０６、予測モード判定部１０３、及びエンコーダコア部Ｕ３は、共にワーキングメモリＭに接続されており、当該ワーキングメモリＭを介して各種の画像情報や符号化情報を共有している。 Note that the upper syntax generation unit 106, the prediction mode determination unit 103, and the encoder core unit U3 are all connected to the working memory M, and share various image information and encoding information via the working memory M. ing.

次に、ビットストリーム生成部３０７が、エンコーダコア部Ｕ３から供給されたエレメンタリーストリーム信号Ｄ０５、及び上位シンタックス情報Ｄ０６を多重化したビットストリーム信号Ｄ０７を生成する（ステップＳ１０７）。 Next, the bit stream generation unit 307 generates a bit stream signal D07 obtained by multiplexing the elementary stream signal D05 supplied from the encoder core unit U3 and the upper syntax information D06 (step S107).

以上のように、映像符号化装置Ｘ１では、ＲＯＩ設定部３０１がＲＯＩを設定し、ピクチャ内でのＲＯＩの位置及びサイズを表すＲＯＩ情報Ｄ０１を生成する。そして、エンコーダ部Ｕ１が、入力画像信号Ｅ１及びＲＯＩ情報Ｄ０１を受け、当該ＲＯＩ情報Ｄ０１に基づいて入力画像信号Ｅ１に対して圧縮符号化処理を施してビットストリーム信号Ｄ０７を生成する。この際、エンコーダ部Ｕ１は、ＲＯＩ情報Ｄ０１に基づき、入力画像信号Ｅ１によるピクチャ内のＲＯＩに含まれる画像ブロックには圧縮符号化処理としてイントラ符号化を施し、非ＲＯＩに含まれる画像ブロック群に対してはイントラ符号化及びインター符号化のうちの一方を選択して施すようにしている。 As described above, in the video encoding device X1, the ROI setting unit 301 sets the ROI, and generates the ROI information D01 indicating the position and size of the ROI in the picture. Then, the encoder unit U1 receives the input image signal E1 and the ROI information D01, performs compression coding processing on the input image signal E1 based on the ROI information D01, and generates a bit stream signal D07. At this time, based on the ROI information D01, the encoder unit U1 performs intra coding on the image blocks included in the ROI in the picture based on the input image signal E1 as compression coding processing, and applies the image blocks included in the non-ROI to the image block group. On the other hand, one of intra coding and inter coding is selected and applied.

よって、映像符号化装置Ｘ１によれば、符号量の削減及び演算量の削減を両立させることが可能となる。 Therefore, according to the video encoding device X1, it is possible to achieve both a reduction in code amount and a reduction in calculation amount.

以下に、上記した映像符号化装置Ｘ１を採用した映像処理システムについて説明する。 Hereinafter, a video processing system employing the video encoding device X1 will be described.

図３は、映像処理システムの一例としてのＲＯＩ識別システムＲＩＳの構成を示すブロック図である。 FIG. 3 is a block diagram showing a configuration of an ROI identification system RIS as an example of a video processing system.

ＲＯＩ識別システムＲＩＳは、例えばビデオカメラ等によって撮影して得られた入力画像信号Ｅ１を受け、当該入力画像信号Ｅ１によって表されているピクチャ中から特定のＲＯＩを識別するシステムである。尚、特定のＲＯＩとは、予め登録されている例えば人の顔を含むＲＯＩである。 The ROI identification system RIS is a system that receives an input image signal E1 obtained by photographing with a video camera or the like, for example, and identifies a specific ROI from the picture represented by the input image signal E1. The specific ROI is an ROI including a human face registered in advance, for example.

図３に示すように、ＲＯＩ識別システムＲＩＳは、映像符号復号システムＶＣＤと、ＲＯＩ認識装置Ｘ４と、を有する。 As shown in FIG. 3, the ROI identification system RIS includes a video code decoding system VCD and an ROI recognition device X4.

映像符号復号システムＶＣＤは、図１に示される映像符号化装置Ｘ１の他に、復号画像選択装置Ｘ２及び映像復号化装置Ｘ３を含む。 The video code decoding system VCD includes a decoded image selection device X2 and a video decoding device X3 in addition to the video encoding device X1 shown in FIG.

映像符号化装置Ｘ１は、上記した入力画像信号Ｅ１を受け、当該入力画像信号Ｅ１に前述した圧縮符号化処理を施してビットストリーム信号Ｄ０７を生成する。映像符号化装置Ｘ１は、このビットストリーム信号Ｄ０７を、ＬＡＮ（Local Area Network）、或いは公衆回線等の任意の通信路を介して復号画像選択装置Ｘ２に供給する。 The video encoding device X1 receives the above-described input image signal E1, and performs the above-described compression encoding processing on the input image signal E1 to generate the bit stream signal D07. The video encoding device X1 supplies the bit stream signal D07 to the decoded image selection device X2 via an arbitrary communication path such as a LAN (Local Area Network) or a public line.

復号画像選択装置Ｘ２は、かかるビットストリーム信号Ｄ０７、及び起動イベントＥ２を受け、以下の復号画像選択処理を行う。 The decoded image selection device X2 receives the bit stream signal D07 and the activation event E2, and performs the following decoded image selection processing.

図４は、かかる復号画像選択処理の一例を表すフロー図である。図４に示すように、先ず、復号画像選択装置Ｘ２は、ビットストリーム信号Ｄ０７から、上位シンタックス情報Ｄ０６としてのパラメータセット（ＳＰＳ、ＰＰＳ）を抽出し、保持する（ステップＳ５０１）。 FIG. 4 is a flowchart showing an example of such a decoded image selection process. As shown in FIG. 4, first, the decoded image selection device X2 extracts and holds a parameter set (SPS, PPS) as the upper syntax information D06 from the bit stream signal D07 (step S501).

次に、復号画像選択装置Ｘ２は、ビットストリーム信号Ｄ０７から、当該ビットストリーム信号Ｄ０７に含まれるエレメンタリーストリーム信号Ｄ０５を復号の処理単位毎に区切ったアクセスユニットの系列を得る（ステップＳ５０２１）。次に、復号画像選択装置Ｘ２は、アクセスユニット毎にそのアクセスユニットに対応したピクチャにＲＯＩが含まれているか否かを判定する（ステップＳ５０２２）。 Next, the decoded image selection device X2 obtains, from the bit stream signal D07, an access unit sequence obtained by dividing the elementary stream signal D05 included in the bit stream signal D07 for each decoding processing unit (step S5021). Next, the decoded image selection device X2 determines whether or not the ROI is included in the picture corresponding to the access unit for each access unit (step S5022).

なお、アクセスユニットに対応したピクチャにＲＯＩが含まれているか否かを判定する方法としては、ビットストリーム信号Ｄ０７を復号、あるいはアクセスユニットのみを復号し、その復号結果に基づきＲＯＩの検出を行ってＲＯＩが含まれているか否かの判定を行う。或いは、先ず、映像符号化装置Ｘ１のビットストリーム生成部３０７において、ＲＯＩ情報Ｄ０１を多重化したビットストリーム信号Ｄ０７を復号画像選択装置Ｘ２に供給する。 As a method for determining whether or not the picture corresponding to the access unit includes the ROI, the bit stream signal D07 is decoded or only the access unit is decoded and the ROI is detected based on the decoding result. It is determined whether or not the ROI is included. Alternatively, first, the bit stream generation unit 307 of the video encoding device X1 supplies the bit stream signal D07 obtained by multiplexing the ROI information D01 to the decoded image selection device X2.

この多重化の方法としては、例えばＨ．２６４／ＭＰＥＧ−４ＡＶＣ、或いはＨ．２６５／ＭＰＥＧ−ＨＨＥＶＣの場合、"User unregistered SEI message"を使用する。そして、復号画像選択装置Ｘ２側で当該ビットストリーム信号Ｄ０７からＲＯＩ情報Ｄ０１を抽出することにより、ＲＯＩが含まれているか否かの判定を行うのである。 As this multiplexing method, for example, H.264. H.264 / MPEG-4 AVC or H.264 In the case of H.265 / MPEG-H HEVC, “User unregistered SEI message” is used. Then, the ROI information D01 is extracted from the bit stream signal D07 on the decoded image selection device X2 side to determine whether or not the ROI is included.

ステップＳ５０２２において、アクセスユニットに対応するピクチャにＲＯＩが含まれていると判定された場合、復号画像選択装置Ｘ２は、上記したステップＳ５０１で保持したパラメータセットを表すパラメータセット情報Ｄ０８、及びアクセスユニットの系列を表すアクセスユニット系列Ｄ０９を映像復号装置Ｘ３に供給する（ステップＳ５０２３）。 If it is determined in step S5022 that the ROI is included in the picture corresponding to the access unit, the decoded image selection device X2 receives the parameter set information D08 representing the parameter set held in step S501 described above, and the access unit The access unit sequence D09 representing the sequence is supplied to the video decoding device X3 (step S5023).

そして、かかるステップＳ５０２３の終了後、復号画像選択装置Ｘ２は、図４に示す復号画像選択処理を終了する。 And after completion | finish of this step S5023, the decoding image selection apparatus X2 complete | finishes the decoding image selection process shown in FIG.

また、上記ステップＳ５０２２において、アクセスユニットに対応するピクチャにＲＯＩが含まれていないと判定されたときも、復号画像選択装置Ｘ２は、図４に示す復号画像選択処理を終了する。よって、この際、復号画像選択装置Ｘ２は、ビットストリーム信号Ｄ０７を受けたにも拘わらず、映像復号装置Ｘ３へのパラメータセット情報Ｄ０８及びアクセスユニット系列Ｄ０９の供給は行わない。 Also, when it is determined in step S5022 that the picture corresponding to the access unit does not include the ROI, the decoded image selection device X2 ends the decoded image selection process shown in FIG. Therefore, at this time, the decoded image selection device X2 does not supply the parameter set information D08 and the access unit sequence D09 to the video decoding device X3 even though it receives the bit stream signal D07.

映像復号装置Ｘ３は、アクセスユニット系列Ｄ０９に、パラメータセット情報Ｄ０８に基づく復号処理を施して得られた出力画像信号Ｅ３を、ＲＯＩ認識装置Ｘ４に供給する。尚、映像復号装置Ｘ３は、アクセスユニット系列Ｄ０９がフレーム間符号化されたものであっても、参照ピクチャの復号を行うことなくアクセスユニット系列Ｄ０９に復号処理を施す。 The video decoding device X3 supplies the output image signal E3 obtained by subjecting the access unit sequence D09 to decoding processing based on the parameter set information D08, to the ROI recognition device X4. Note that the video decoding device X3 performs a decoding process on the access unit sequence D09 without decoding the reference picture even if the access unit sequence D09 is inter-frame encoded.

つまり、映像復号装置Ｘ３は、参照ピクチャの復号を行う代わりに任意の仮想ピクチャを生成し、この仮想ピクチャを参照ピクチャとみなして復号を行う。尚、仮想ピクチャとは、例えば、全ての画素値を最低の輝度値「０」、又は中間の輝度値「１２８」、或いは最大の輝度値「２５５」等に固定したピクチャである。 That is, the video decoding device X3 generates an arbitrary virtual picture instead of decoding the reference picture, and performs decoding by regarding the virtual picture as a reference picture. Note that the virtual picture is, for example, a picture in which all pixel values are fixed to a minimum luminance value “0”, an intermediate luminance value “128”, a maximum luminance value “255”, or the like.

ＲＯＩ認識装置Ｘ４は、出力画像信号Ｅ３及び登録画像信号Ｅ４を受ける。尚、登録画像信号Ｅ４は、例えば予め登録されている一人又は複数の人の顔と、その顔を識別する為の識別情報（例えば氏名）とを表す画像信号である。ＲＯＩ認識装置Ｘ４は、例えば顔認識エンジンを利用して、出力画像信号Ｅ３中に含まれるＲＯＩに、登録画像信号Ｅ４にて表される顔が含まれているか否かを判定する。ＲＯＩ認識装置Ｘ４は、出力画像信号Ｅ３中に、登録画像信号Ｅ４にて表される顔が含まれていると判定した場合には、その顔に対応した識別情報を出力する。 The ROI recognition device X4 receives the output image signal E3 and the registered image signal E4. The registered image signal E4 is an image signal representing, for example, one or more faces registered in advance and identification information (for example, name) for identifying the faces. The ROI recognition device X4 determines whether the face represented by the registered image signal E4 is included in the ROI included in the output image signal E3 by using, for example, a face recognition engine. When the ROI recognition apparatus X4 determines that the face represented by the registered image signal E4 is included in the output image signal E3, the ROI recognition device X4 outputs identification information corresponding to the face.

以上のように、映像符号復号システムＶＣＤでは、復号画像選択装置Ｘ２により、アクセスユニットの系列中から、ＲＯＩを含むピクチャに対応したアクセスユニットのみを選択して映像復号装置Ｘ３に供給するようにている。よって、映像復号装置Ｘ３はＲＯＩを含むピクチャのみを復号するので、図３においてＲＯＩ認識装置Ｘ４を一例とした下流のシステムで必要となる、ＲＯＩを復号する為に必要となる演算量を抑えることが可能となる。 As described above, in the video code decoding system VCD, the decoded image selection device X2 selects only the access unit corresponding to the picture including the ROI from the access unit sequence and supplies it to the video decoding device X3. Yes. Therefore, since the video decoding device X3 decodes only the picture including the ROI, the amount of calculation required for decoding the ROI, which is necessary in the downstream system in FIG. 3 using the ROI recognition device X4 as an example, is suppressed. Is possible.

ところで、図４に示す復号画像選択処理では、アクセスユニットに対応したピクチャにＲＯＩが含まれている場合には、常に、当該ＲＯＩを含むピクチャに対応したアクセスユニットを映像復号装置Ｘ３に供給する。しかしながら、起動イベントＥ２を受けたときにのみ、ＲＯＩを含むピクチャに対応したアクセスユニットを映像復号装置Ｘ３に供給しても良い。 By the way, in the decoded image selection process shown in FIG. 4, when the ROI is included in the picture corresponding to the access unit, the access unit corresponding to the picture including the ROI is always supplied to the video decoding device X3. However, the access unit corresponding to the picture including the ROI may be supplied to the video decoding device X3 only when the activation event E2 is received.

図５は、かかる点に鑑みて為された復号画像選択処理の他の一例を示すフロー図である。図５において、復号画像選択装置Ｘ２は、ビットストリーム信号Ｄ０７をデータベースに蓄積する（ステップＳ６０１）。ここで、起動イベントＥ２を受けると、復号画像選択装置Ｘ２は、データベースからビットストリーム信号Ｄ０７に含まれるパラメータセット、及びＲＯＩを含むアクセスユニットのみを抽出する。そして、復号画像選択装置Ｘ２は、抽出したパラメータセットを表すパラメータセット情報Ｄ０８、及びＲＯＩを含むアクセスユニットの系列からなるアクセスユニット系列Ｄ０９を出力する。 FIG. 5 is a flowchart showing another example of the decoded image selection process performed in view of such points. In FIG. 5, the decoded image selection device X2 accumulates the bit stream signal D07 in the database (step S601). Here, when receiving the activation event E2, the decoded image selection device X2 extracts only the parameter unit included in the bit stream signal D07 and the access unit including the ROI from the database. Then, the decoded image selection device X2 outputs parameter set information D08 representing the extracted parameter set, and an access unit sequence D09 including an access unit sequence including the ROI.

復号画像選択装置Ｘ２は、図５に示される復号画像選択処理を行うことにより、起動イベントＥ２が発せられたときにのみ、ＲＯＩの含まれるピクチャに対応したアクセスユニットのみをデータベースから検索し、これを復号対象とする。これにより、図４に示される復号画像選択処理を採用した場合に比してＲＯＩの復号に必要な演算量が削減される。尚、図５に示される復号画像選択処理を採用した場合にも、図４に示される復号画像選択処理を採用した場合と同様に、ＲＯＩ認識装置Ｘ４においてＲＯＩを含むピクチャのみを認識対象とすることができる。よって、データベースに蓄積した過去のＲＯＩを低演算量で復号できるので、再認識処理の実用性を図ることが可能となる。 The decoded image selection device X2 searches the database for only the access unit corresponding to the picture including the ROI only when the activation event E2 is issued by performing the decoded image selection process shown in FIG. Is the decryption target. As a result, the amount of calculation required for ROI decoding is reduced as compared with the case where the decoded image selection processing shown in FIG. 4 is adopted. Even when the decoded image selection process shown in FIG. 5 is adopted, only the picture including the ROI is recognized in the ROI recognition device X4 as in the case where the decoded image selection process shown in FIG. 4 is adopted. be able to. Therefore, since past ROIs accumulated in the database can be decoded with a small amount of computation, it is possible to achieve practicality of re-recognition processing.

図６は、本発明に係る映像符号化装置Ｘ１の他の構成を示すブロック図である。尚、図６に示す構成では、上位シンタックス指定部２０４を新たに加え、上位シンタックス生成部１０６に代えて上位シンタックス生成部１０６ａを採用した点を除く他の構成は、図１に示されるものと同一である。よって、以下に、上位シンタックス指定部２０４及び上位シンタックス生成部１０６ａを中心に動作の説明を行う。 FIG. 6 is a block diagram showing another configuration of the video encoding device X1 according to the present invention. In the configuration shown in FIG. 6, other configurations except that a higher syntax specifying unit 204 is newly added and a higher syntax generation unit 106 a is employed instead of the higher syntax generation unit 106 are shown in FIG. 1. Is the same as Therefore, the operation will be described below with a focus on the upper syntax designation unit 204 and the upper syntax generation unit 106a.

上位シンタックス生成部１０６ａは、制約付きイントラを指定した上位シンタックス（制約付き上位シンタックスと称する）と共に、制約を受けない通常のイントラを指定した上位シンタックス（通常シンタックスと称する）を生成する。上位シンタックス生成部１０６ａは、制約付き上位シンタックス及び通常シンタックスを表す上位シンタックス情報Ｄ０６をビットストリーム生成部３０７に供給する。 The high-order syntax generation unit 106a generates high-order syntax (referred to as normal syntax) that specifies a normal intra that is not subject to restriction, together with high-order syntax (referred to as restricted high-order syntax) that specifies a restricted intra. To do. The upper syntax generation unit 106 a supplies the upper stream syntax information D06 indicating the upper syntax with restrictions and the normal syntax to the bitstream generation unit 307.

上位シンタックス指定部２０４は、ＲＯＩ情報Ｄ０１を受け、当該ＲＯＩ情報Ｄ０１に基づき、圧縮符号化処理に用いる上位シンタックス（通常シンタックス、制約付き上位シンタックス）を指定する上位シンタックス指定信号Ｄ０４を生成し、エンコーダコア部Ｕ３に供給する。 The upper syntax designating unit 204 receives the ROI information D01, and based on the ROI information D01, an upper syntax designating signal D04 that designates an upper syntax (normal syntax, constrained upper syntax) used for compression encoding processing. Is supplied to the encoder core unit U3.

エンコーダコア部Ｕ３は、ＲＯＩ情報Ｄ０１、予測モード情報Ｄ０３及び上位シンタックス指定信号Ｄ０４によって指定された上位シンタックスに基づき、入力画像信号Ｅ１に圧縮符号化処理を施す。エンコーダコア部Ｕ３は、かかる圧縮符号化処理によって得られたエレメンタリーストリーム信号Ｄ０５を、ビットストリーム生成部３０７に供給する。 The encoder core unit U3 performs compression coding processing on the input image signal E1 based on the upper syntax specified by the ROI information D01, the prediction mode information D03, and the higher syntax specification signal D04. The encoder core unit U3 supplies the elementary stream signal D05 obtained by the compression encoding process to the bit stream generation unit 307.

図７は、図６に示す構成を有する映像符号化装置Ｘ１で実施される映像符号化処理を示すフロー図である。尚、図７に示すフローでは、上記したステップＳ１０５及びＳ１０６に代えてＳ１０５ａ及びＳ１０６ａを採用し、更にステップＳ１０３の後にステップＳ１０４を追加した点を除く他のステップ（Ｓ１０１〜Ｓ１０３、Ｓ１０７）は、図２に示すフローと同一である。よって、ステップＳ１０４、Ｓ１０５ａ及びＳ１０６ａを中心に各動作を説明する。 FIG. 7 is a flowchart showing a video encoding process performed by the video encoding device X1 having the configuration shown in FIG. In the flow shown in FIG. 7, S105a and S106a are adopted instead of the above-described steps S105 and S106, and other steps (S101 to S103, S107) except that step S104 is added after step S103 are as follows. This is the same as the flow shown in FIG. Therefore, each operation will be described focusing on steps S104, S105a, and S106a.

すなわち、ステップＳ１０３の実行後、上位シンタックス指定部２０４が、ＲＯＩ情報Ｄ０１に基づき、ＲＯＩが含まれるピクチャについては制約付きイントラを指定し、ＲＯＩが含まれていないピクチャについては制約を受けない通常のイントラを指定する上位シンタックス指定信号Ｄ０４を生成する（ステップＳ１０４）。尚、ＲＯＩのみで構成されるピクチャに対しては、いずれの上位シンタックスを指定しても良い。 That is, after execution of step S103, the higher-level syntax specifying unit 204 specifies a restricted intra for a picture that includes an ROI based on the ROI information D01, and is not subject to a restriction for a picture that does not include an ROI. The higher-level syntax designation signal D04 that designates the intra is generated (step S104). It should be noted that any upper syntax may be specified for a picture composed only of ROI.

次に、エンコーダコア部Ｕ３が、予測モード情報Ｄ０３及び上位シンタックス指定信号Ｄ０４に基づく圧縮符号化処理を入力画像信号Ｅ１に施して得られたエレメンタリーストリーム信号Ｄ０５を出力する（ステップＳ１０５ａ）。ここで、上位シンタックス指定信号Ｄ０４が制約付きイントラを指定した上位シンタックスを示す場合、エンコーダコア部Ｕ３は、イントラ予測モードで符号化すると決定したブロックを、制約付きイントラで符号化する。そして、エンコーダコア部Ｕ３は、エレメンタリーストリーム信号Ｄ０５中の所定の位置に、制約付きイントラを指定した上位シンタックス示すインデックスを含ませる。例えば、Ｈ．２６４／ＭＰＥＧ−４ＡＶＣの場合、スライスヘッダに、使用する上位シンタックスを指定する領域があるので、その領域に制約付きイントラを指定した上位シンタックスを示すインデックスを含ませる。 Next, the encoder core unit U3 outputs an elementary stream signal D05 obtained by subjecting the input image signal E1 to compression encoding processing based on the prediction mode information D03 and the upper syntax designation signal D04 (step S105a). Here, when the higher-order syntax designation signal D04 indicates the higher-order syntax designating the restricted intra, the encoder core unit U3 codes the block determined to be coded in the intra prediction mode, with the restricted intra. Then, the encoder core unit U3 includes an index indicating the upper syntax specifying the restricted intra at a predetermined position in the elementary stream signal D05. For example, H.M. In the case of H.264 / MPEG-4 AVC, the slice header has an area for designating the upper syntax to be used. Therefore, an index indicating the upper syntax for designating the restricted intra is included in the area.

また、上位シンタックス指定信号Ｄ０４が通常の上位シンタックスを示す場合には、エンコーダコア部Ｕ３は、イントラ予測モードで符号化すると決定したブロックを、制約無しの通常のイントラで符号化する。そして、エンコーダコア部Ｕ３は、エレメンタリーストリーム信号Ｄ０５中の所定の位置に、通常の上位シンタックス示すインデックスを含ませる。 Further, when the upper syntax designation signal D04 indicates the normal higher syntax, the encoder core unit U3 encodes the block determined to be encoded in the intra prediction mode, with the normal intra without restriction. Then, the encoder core unit U3 includes an index indicating a normal higher-order syntax at a predetermined position in the elementary stream signal D05.

次に、上位シンタックス生成部１０６ａが、制約付きイントラを指定した上位シンタックス、及び通常の上位シンタックスを生成し、両者を示す上位シンタックス情報Ｄ０６を生成する（ステップＳ１０６ａ）。ここで、制約付きイントラを指定した上位シンタックスと、通常の上位シンタックスとの差は、制約付きイントラを指定しているか否かである。例えば、Ｈ．２６４／ＭＰＥＧ−４ＡＶＣの場合、ＰＰＳに含まれる"constrained_intra_pred_flag"によってその差が表される。 Next, the upper syntax generation unit 106a generates an upper syntax specifying a restricted intra and a normal upper syntax, and generates upper syntax information D06 indicating both (step S106a). Here, the difference between the upper syntax specifying the restricted intra and the normal higher syntax is whether or not the restricted intra is specified. For example, H.M. In the case of H.264 / MPEG-4 AVC, the difference is represented by “constrained_intra_pred_flag” included in the PPS.

尚、上位シンタックスとしては、ＰＰＳだけではなく、例えばＨ．２６４ではＳＰＳ等を含むことも考えられる。その場合、ＰＰＳ以外の上位シンタックスについては、制約付きイントラを指定した上位シンタックスと、通常の上位シンタックスとを個別に生成するのではなく、１つの上位シンタックスを、制約付きイントラを指定した上位シンタックスと、通常の上位シンタックスとで共通化しても良い。 Note that the upper syntax includes not only PPS but also H.264. H.264 may include SPS and the like. In this case, for higher-level syntax other than PPS, a higher-level syntax specifying a constrained intra and a normal higher-level syntax are not generated separately, but a single higher-level syntax is specified as a constrained intra. The higher-level syntax and the normal higher-level syntax may be shared.

また、制約付きイントラを指定した上位シンタックスで表される情報群内に、通常の上位シンタックスで表される情報群に含まれる情報片と同一の情報片が存在する場合には、この同一の情報片を、制約付きイントラを指定した上位シンタックスと、通常の上位シンタックスとで共有しても良い。すなわち、上位シンタックス生成部１０６ａは、先ず、制約付きイントラを指定した上位シンタックスを生成する。そして、上記同一の情報片を省いた通常の上位シンタックスを生成し、この生成した通常の上位シンタックスに上記同一の情報片を含ませたものを最終的な通常の上位シンタックスとする。これにより、上位シンタックス生成部１０６ａでの動作負荷が軽減される。 If the same information piece as the information piece included in the information group represented by the normal upper syntax exists in the information group represented by the upper syntax specifying the restricted intra, this same These pieces of information may be shared between the upper syntax specifying the restricted intra and the normal higher syntax. That is, the upper syntax generation unit 106a first generates an upper syntax specifying a restricted intra. Then, a normal upper syntax in which the same piece of information is omitted is generated, and a final normal upper syntax is obtained by including the same information piece in the generated normal upper syntax. As a result, the operation load on the upper syntax generation unit 106a is reduced.

以上のように、実施例２の構成では、非ＲＯＩのみで構成されているが故に制約付きイントラの必要性がないピクチャ又はスライスについては、通常のイントラＭＢとして符号化している。ここで、制約付きイントラよりも通常のイントラの方が符号効率が高い傾向にあるため、制約付きイントラの必要性が無いピクチャやスライスに対しては、通常のイントラＭＢとして符号化することにより符号量の削減が図られる。 As described above, in the configuration of the second embodiment, a picture or a slice that does not need a restricted intra because it is configured only by a non-ROI is encoded as a normal intra MB. Here, since a normal intra tends to have higher code efficiency than a constrained intra, a picture or a slice that does not require a constrained intra is encoded by encoding as a normal intra MB. The amount can be reduced.

尚、上記した実施例１及び２では、例えば前処理部３０５Ａを設けることにより、ＲＯＩを含むピクチャに対しては画像品質を抑えて符号量の削減を図る処理を施すようにしているが、必ずしも、このような画像品質を抑える品質制御を行う必要はない。この際、予測モード可変エンコーダ部Ｕ２はＲＯＩ情報Ｄ０１を受ける必要がなくなり、エンコーダコア部Ｕ３の前処理部３０５Ａが不要となる。 In the first and second embodiments described above, for example, by providing the preprocessing unit 305A, processing for reducing the code amount by suppressing the image quality is performed on the picture including the ROI. Therefore, it is not necessary to perform quality control to suppress such image quality. At this time, the prediction mode variable encoder unit U2 does not need to receive the ROI information D01, and the preprocessing unit 305A of the encoder core unit U3 becomes unnecessary.

また、互いに異なる基準で設定した２種類のＲＯＩのうちの一方を制約付きイントラでの符号化対象とし、他方を上記したような品質制御の対象として圧縮符号化処理を施すようにしても良い。この際、ＲＯＩ設定部３０１は、互いに異なる基準で設定した２系統のＲＯＩを表すＲＯＩ情報Ｄ０１を生成する。そして、ＲＯＩ情報Ｄ０１にて表されるＲＯＩのうちで有用度が最大となるＲＯＩを制約付きイントラの符号化対象とし、その他のＲＯＩを品質制御を優先とした符号化の対象とする。 Further, one of the two types of ROIs set based on different standards may be set as the encoding target in the restricted intra, and the other may be subjected to the compression encoding process as the quality control target as described above. At this time, the ROI setting unit 301 generates ROI information D01 representing two systems of ROIs set based on different standards. Then, among the ROIs represented by the ROI information D01, the ROI having the maximum usefulness is set as the encoding target of the restricted intra, and the other ROIs are set as the encoding targets with priority on the quality control.

尚、有用度が最大となるＲＯＩとは、例えばサイズが最大となるＲＯＩである。ここで、複数のＲＯＩのうちで、制約付きイントラで符号化するＲＯＩを限定する方法の一例を以下に説明する。すなわち、着目するＲＯＩが動画像の前後のピクチャに含まれるか否かを判定し、含まれる場合にはそれぞれの有用度（例えばＲＯＩのサイズ）を計算する。そして、その有用度が最大であるか否かを判定し、最大であると判定されたＲＯＩを制約付きイントラの符号化対象とする。 The ROI having the maximum usefulness is, for example, the ROI having the maximum size. Here, an example of a method for limiting the ROI to be encoded with the restricted intra from among the plurality of ROIs will be described below. That is, it is determined whether or not the ROI of interest is included in the pictures before and after the moving image, and when included, the usefulness (for example, the size of the ROI) is calculated. Then, it is determined whether or not the usefulness is the maximum, and the ROI determined to be the maximum is set as the encoding target of the restricted intra.

このように、例えば実施例１及び２の方法で、制約付きイントラで符号化するＲＯＩを、品質制御の対象とするＲＯＩよりも厳しい基準で設定したものとすることで、ＲＯＩの復号に必要な演算量を最小にしつつ、符号量の増加を最小限に抑えることが可能となる。 As described above, for example, by using the method of the first and second embodiments, the ROI to be encoded with the restricted intra is set based on a stricter standard than the ROI targeted for quality control, which is necessary for the ROI decoding. It is possible to minimize the increase in the code amount while minimizing the calculation amount.

また、実施例１及び２では、ＲＯＩ設定部３０１は、ユーザによって指定された領域をＲＯＩとして設定する、或いは入力画像信号Ｅ１から検出したＲＯＩをＲＯＩとして設定している。ユーザによって指定された領域をＲＯＩとして設定する場合には、入力画像信号Ｅ１をＲＯＩ設定部３０１に供給する必要は無い。 In the first and second embodiments, the ROI setting unit 301 sets the region specified by the user as the ROI, or sets the ROI detected from the input image signal E1 as the ROI. When the region designated by the user is set as the ROI, it is not necessary to supply the input image signal E1 to the ROI setting unit 301.

また、ＲＯＩを設定する方法としては、上記した方法に限定されない。例えば赤外線カメラで撮影した画像や、深度センサで撮影した画像を用いてＲＯＩの設定を行うようにしても良い。この際、赤外線カメラで撮影した画像や、深度センサで撮影した画像を表す信号を入力画像信号Ｅ１としてＲＯＩ設定部３０１に供給する。 Further, the method for setting the ROI is not limited to the method described above. For example, the ROI may be set using an image taken with an infrared camera or an image taken with a depth sensor. At this time, an image captured by an infrared camera or a signal representing an image captured by a depth sensor is supplied to the ROI setting unit 301 as an input image signal E1.

また、図１及び図６に示される映像符号化装置Ｘ１による映像符号化処理をハードウェアで実現しているが、当該映像符号化処理をソフトウェアで実現しても良い。 Moreover, although the video encoding process by the video encoding device X1 shown in FIGS. 1 and 6 is realized by hardware, the video encoding process may be realized by software.

図８は、この映像符号化処理をソフトウェアで実現する情報処理装置の構成の一例を示すブロック図である。当該情報処理装置は、制御部としてのＣＰＵ（Central Processing Unit）１００１、ＲＡＭ（Random Access Memory）１００２、ＲＯＭ（Read Only Memory）１００３及び入出力部１００４を含む。ＲＯＭ１００３には、図２又は図７に示される映像符号化処理に従ったプログラムが記憶されている。ＣＰＵ１００１は、ＲＯＭ１００３に記憶されているプログラムを実行することにより、入出力部１００４を介して入力された入力画像信号Ｅ１に基づきビットストリーム信号Ｄ０７を生成し、これを入出力部１００４を介して出力する。 FIG. 8 is a block diagram showing an example of the configuration of an information processing apparatus that implements this video encoding processing by software. The information processing apparatus includes a central processing unit (CPU) 1001, a random access memory (RAM) 1002, a read only memory (ROM) 1003, and an input / output unit 1004 as control units. The ROM 1003 stores a program according to the video encoding process shown in FIG. The CPU 1001 generates a bit stream signal D07 based on the input image signal E1 input via the input / output unit 1004 by executing a program stored in the ROM 1003, and outputs this through the input / output unit 1004. To do.

１０２イントラ予測モード指定部
１０３予測モード判定部
１０６上位シンタックス生成部
３０７ビットストリーム生成部
Ｕ１エンコーダ部
Ｕ２予測モード可変エンコーダ部
Ｕ３エンコーダコア部
Ｘ１映像符号化装置
Ｘ２復号画像選択装置
Ｘ３映像復号装置
Ｘ４ＲＯＩ認識装置
ＶＣＤ映像符号復号システム

102 Intra prediction mode designating unit 103 Prediction mode determining unit 106 Upper syntax generating unit 307 Bit stream generating unit U1 Encoder unit U2 Predictive mode variable encoder unit U3 Encoder core unit X1 Video encoding device X2 Decoded image selecting device X3 Video decoding device X4 ROI recognition device VCD video code decoding system

Claims

A video encoding device that generates a bitstream signal by performing compression encoding processing on an input image signal,
An ROI setting unit that sets ROI (Region Of Interest) in a picture and generates ROI information indicating the position and size of the ROI in the picture;
An encoder unit that receives the input image signal and the ROI information, and performs a compression encoding process on the input image signal based on the ROI information to generate a bitstream signal;
The encoder unit is
Based on the ROI information, the image block included in the ROI in the picture in the input image signal is subjected to intra encoding as the compression encoding process, and the image block included in the non-ROI is subjected to the compression. A video encoding apparatus, wherein one of the intra encoding and the inter encoding is selected and applied as an encoding process.

The encoder unit is
An intra prediction mode designating unit that generates an intra prediction mode designating signal for designating an intra prediction mode for an image block included in the ROI in a picture in the input image signal based on the ROI information;
When the intra prediction mode designation signal is generated, the intra prediction mode is indicated. When the intra prediction mode designation signal is not generated, the prediction mode information indicating the intra prediction mode or the inter prediction mode is generated. A mode determination unit;
When the prediction mode information indicates the intra prediction mode, the intra coding is performed on the input image signal, and when the prediction mode information indicates the inter prediction mode, the inter coding is performed on the input image signal. An encoder core unit for generating an elementary stream signal,
An upper syntax generation unit that generates upper syntax information representing information necessary for decoding as upper syntax;
The video encoding apparatus according to claim 1, further comprising: a bit stream generation unit that generates a signal obtained by multiplexing the elementary stream signal and the upper syntax information as the bit stream signal.

The video encoding apparatus according to claim 2, further comprising a working memory that shares information among the prediction mode determination unit, the encoder core unit, and the higher syntax generation unit.

The upper syntax generation unit generates the upper syntax information representing the upper syntax for designating a restricted intra;
The constrained intra uses only the intra-encoded pixel value or the predetermined value as the peripheral pixel value when performing the intra-encoding using the peripheral pixel value of the image block included in the ROI. 4. The video encoding apparatus according to claim 2, wherein the video encoding apparatus is urged to be used.

The encoder core unit selects the upper syntax used in the compression encoding process from a plurality of upper syntaxes in units of pictures or slices,
The high-order syntax generation unit generates the high-order syntax information indicating the selected high-order syntax every time the encoder core unit selects the high-order syntax. Video encoding device.

The encoder unit is
Based on the ROI information, the first upper syntax designating the restricted intra is specified for a picture including both the ROI and the non-ROI, and there is no restriction for a picture including only the non-ROI. An upper syntax specifying unit for generating an upper syntax specifying signal indicating a second upper syntax for specifying an intra;
The encoder core unit performs the compression encoding process on the input image signal according to the upper syntax specified by the upper syntax specifying signal,
6. The high-order syntax generation unit generates the high-order syntax information that represents the first high-order syntax and the second high-order syntax, respectively. 6. The video encoding device described.

The encoder core part is
When the higher syntax designation signal indicates the first higher syntax, the image block to be subjected to the intra coding is encoded according to the restricted intra and the first higher syntax. An index indicating a tax is included in the elementary stream signal,
When the higher-order syntax designation signal indicates the second higher-order syntax, the image block to be subjected to the intra coding is encoded according to the intra without restriction, and the first The video encoding apparatus according to claim 6, wherein the index indicating a higher-order syntax of 2 is included in the elementary stream signal.

When generating the first and second upper syntaxes, the upper syntax generation unit is represented by the second upper syntax in an information group represented by the first upper syntax. If there is an information piece identical to the information piece included in the information group,
The first high-order syntax is generated, and the high-order syntax in which the same piece of information is omitted is generated. The high-order syntax in which the same piece of information is omitted is included in the first high-order syntax. The video encoding apparatus according to claim 6 or 7, wherein the same information piece is included in the second upper syntax.

The encoder unit is
The video according to any one of claims 2 to 8, further comprising a processing unit that compresses the ROI and the non-ROI included in the input image signal with different video quality based on the ROI information. Encoding device.

The ROI setting unit sets a first ROI and a second ROI different from the first ROI, a position and a size in the picture of the first ROI, and a picture in the second ROI Generating information representing the position and size in the ROI information,
The intra prediction mode designating unit designates the intra prediction mode for an image block included in the first ROI based on the ROI information,
The video encoding device according to claim 9, wherein the processing unit performs the compression on a picture including the second ROI based on the ROI information.

The video encoding apparatus according to claim 10, wherein the size of the first ROI is larger than the size of the second ROI.

An image encoding / decoding system including: a video encoding device that performs compression encoding processing on an input image signal to generate a bitstream signal; and a video decoding device that performs decoding processing on the bitstream signal,
The video encoding device includes:
An ROI setting unit that sets ROI (Region Of Interest) in a picture and generates ROI information indicating the position and size of the ROI in the picture;
An upper syntax generation unit that generates upper syntax information including a parameter set necessary for decoding;
A bit stream obtained by multiplexing the elementary stream signal obtained by receiving the input image signal and the ROI information and subjecting the input image signal to compression encoding processing based on the ROI information, and the upper syntax information An encoder unit for outputting a signal,
The encoder unit performs intra coding as the compression coding process on an image block included in the ROI in the picture in the input image signal based on the ROI information, and converts the image block included in the non-ROI to the image block included in the non-ROI For the compression encoding process, one of the intra encoding and inter encoding is selected and applied,
The video decoding device includes:
A decoded image selection unit that receives the bit stream signal and extracts the parameter set included in the higher-level syntax information from the bit stream signal and an access unit corresponding to the elementary stream signal;
And a video decoding unit that generates an output image by performing a decoding process on the access unit based on the parameter set.

The image coding / decoding system according to claim 12, wherein the video decoding unit performs the decoding process on the access unit without decoding a reference picture.

When the access unit is the inter-coded one, the video decoding unit regards a predetermined virtual picture as the reference picture, and performs the decoding process on the access unit. The image encoding / decoding system according to claim 13.

15. The image code decoding according to claim 14, wherein the video decoding unit generates a picture having all pixel values as a minimum luminance value, an intermediate luminance value, or a maximum luminance value as the virtual picture. system.

The decoded image selection unit includes:
While holding the parameter set, it is determined whether or not the ROI is included in a picture corresponding to the access unit. If it is determined that the ROI is included, the held parameter and the access are determined. The image code decoding system according to claim 12, wherein a unit is supplied to the video decoding unit.

The decoded image selection unit includes a database for storing the bit stream signal,
16. The method according to claim 12, wherein when an activation event is given, only the access unit including the parameter set and the ROI is extracted from the database, and each is supplied to the video decoding unit. The image encoding / decoding system described.

A video encoding program executed by a control unit of an information processing apparatus that performs compression encoding processing on an input image signal to generate a bitstream signal,
A first step of setting an ROI (Region Of Interest) in a picture and generating ROI information representing the position and size of the ROI in the picture;
Receiving the input image signal and the ROI information, and performing a compression encoding process on the input image signal based on the ROI information to generate a bitstream signal, and
In the second step, based on the ROI information, an image block included in the ROI in a picture in the input image signal is subjected to intra encoding as the compression encoding process, and an image included in a non-ROI A video encoding program, wherein one of the intra encoding and inter encoding is selected and applied to a block as the compression encoding processing.