JP2021182728A

JP2021182728A - Image encoding device and image decoding device, method, and program

Info

Publication number: JP2021182728A
Application number: JP2020098885A
Authority: JP
Inventors: 浩司大川; Koji Okawa; 真悟志摩; Shingo Shima
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2020-05-19
Filing date: 2020-06-05
Publication date: 2021-11-25

Abstract

To reduce the amount of information in parameter information more than before.SOLUTION: An image encoding device that encodes an image composed of N (N>1) rectangles so that each rectangle can be decoded independently includes a segmentation unit for dividing an image into N rectangles and an encoding unit that encodes position and size information of the rectangles. Here, the encoding unit encodes the position information of second to N-first rectangles and does not encode position information of first and Nth rectangles.SELECTED DRAWING: Figure 15

Description

本発明は、動画像の符号化技術に関するものである。 The present invention relates to a moving image coding technique.

デジタルビデオカメラ等に代表される撮像装置は、Ｈ．２６４（非特許文献１）、Ｈ．２６５（非特許文献２）等の圧縮符号化技術を利用して、撮像して得た動画像データを符号化する。この符号化では、１ピクチャを複数画素で構成されるブロックを単位に符号化して得た符号化データと、ブロックの再現に係るパラメータをＳＰＳ（Sequence Parameter Set）やＰＰＳ（Picture Parameter Set）に格納したデータとを多重化したビットストリームを生成する。 An image pickup device represented by a digital video camera or the like is described by H. 264 (Non-Patent Document 1), H. et al. The moving image data obtained by imaging is encoded by using a compression coding technique such as 265 (Non-Patent Document 2). In this coding, the coded data obtained by coding one picture in units of blocks composed of a plurality of pixels and the parameters related to the reproduction of the blocks are stored in SPS (Sequence Parameter Set) and PPS (Picture Parameter Set). Generate a bitstream that is multiplexed with the data.

近年、ＨＥＶＣの後継としてさらに高効率な符号化方式の国際標準化を行う活動が開始された。ＪＶＥＴ（ＪｏｉｎｔＶｉｄｅｏＥｘｐｅｒｔｓＴｅａｍ）がＩＳＯ／ＩＥＣとＩＴＵ−Ｔの間で設立され、ＶＶＣ（ＶｅｒｓａｔｉｌｅＶｉｄｅｏＣｏｄｉｎｇ）符号化方式（以下、ＶＶＣ）として標準化が進められている。ＶＶＣでは、１つ以上のスライスを包含するように構成され、矩形を形成するサブピクチャがある。このサブピクチャはピクチャを１つ以上に分割することができ、サブピクチャ毎に独立した符号データとして処理することが可能となっている。 In recent years, as a successor to HEVC, activities to carry out international standardization of more efficient coding methods have been started. JVET (Joint Video Experts Team) was established between ISO / IEC and ITU-T, and is being standardized as a VVC (Versatile Video Coding) coding method (hereinafter referred to as VVC). In VVC, there are subpictures that are configured to contain one or more slices and form a rectangle. This sub-picture can be divided into one or more, and each sub-picture can be processed as independent code data.

ITU-T Rec. H.264 Edition 8.0ITU-T Rec. H.264 Edition 8.0 ITU-T Rec. H.265 Edition4.0 (V4)ITU-T Rec. H.265 Edition4.0 (V4)

しかしながら、多重化されるパラメータ情報には冗長な部分があり、改善の余地がある。 However, the parameter information to be multiplexed has a redundant part, and there is room for improvement.

この課題を解決するため、例えば本発明の画像符号化装置は以下の構成を備える。すなわち、
Ｎ個（Ｎ＞１）の矩形により構成された画像を、各々の矩形が独立して復号できるように符号化する画像符号化装置であって、
画像をＮ個の前記矩形に分割する分割手段と、
前記矩形の位置及び大きさの情報を符号化する符号化手段とを有し、
前記符号化手段は、２番目からＮ−１番目の矩形の位置に関する情報を符号化し、１番目及びＮ番目の矩形の位置に関する情報を符号化しないことを特徴とする。 In order to solve this problem, for example, the image coding apparatus of the present invention has the following configuration. That is,
An image coding device that encodes an image composed of N (N> 1) rectangles so that each rectangle can be independently decoded.
A dividing means for dividing an image into N rectangles, and
It has a coding means for encoding information on the position and size of the rectangle.
The coding means encodes information about the positions of the second to N-1th rectangles and does not encode information about the positions of the first and Nth rectangles.

本発明によれば、パラメータ情報の情報量を、これまでよりも削減することが可能になる。 According to the present invention, the amount of parameter information can be reduced more than before.

ＣＴＵ、タイル、スライスの関係を示す図。The figure which shows the relationship between CTU, tile, and slice. ＣＴＵ、タイル、スライスの関係を示す図。The figure which shows the relationship between CTU, tile, and slice. ＣＴＵ、タイル、サブピクチャの関係を示す図。The figure which shows the relationship between a CTU, a tile, and a subpicture. 実施形態におけるＳＰＳのシンタックスを示す図。The figure which shows the syntax of SPS in an embodiment. 実施形態におけるＳＰＳのシンタックスを示す図。The figure which shows the syntax of SPS in an embodiment. 実施形態におけるＳＰＳのシンタックスを示す図。The figure which shows the syntax of SPS in an embodiment. 実施形態におけるＳＰＳのシンタックスを示す図。The figure which shows the syntax of SPS in an embodiment. 実施形態におけるＳＰＳのシンタックスを示す図。The figure which shows the syntax of SPS in an embodiment. 実施形態におけるＳＰＳのシンタックスを示す図。The figure which shows the syntax of SPS in an embodiment. 実施形態におけるＰＰＳのシンタックスを示す図。The figure which shows the syntax of PPS in an embodiment. 実施形態におけるＰＰＳのシンタックスを示す図。The figure which shows the syntax of PPS in an embodiment. 実施形態におけるＰＰＳのシンタックスを示す図。The figure which shows the syntax of PPS in an embodiment. 実施形態におけるスライスヘッダのシンタックスを示す図。The figure which shows the syntax of the slice header in an embodiment. 実施形態におけるＰＰＳのシンタックスを示す図。The figure which shows the syntax of PPS in an embodiment. 実施形態におけるＰＰＳのシンタックスを示す図。The figure which shows the syntax of PPS in an embodiment. 実施形態が適用する撮像装置（画像符号化装置）のブロック構成図。The block block diagram of the image pickup apparatus (image coding apparatus) to which an embodiment applies. 実施形態が適用する画像復号装置のブロック構成図。The block block diagram of the image decoding apparatus to which an embodiment applies. 実施形態における画像復号装置の構成を示すブロック図。The block diagram which shows the structure of the image decoding apparatus in embodiment. 実施形態における画像符号化装置における画像符号化処理を示すフローチャート。The flowchart which shows the image coding processing in the image coding apparatus in embodiment. 実施形態における画像復号装置における画像復号処理を示すフローチャート。The flowchart which shows the image decoding processing in the image decoding apparatus in embodiment. 実施形態で用いられるビットストリーム構成を示す図。The figure which shows the bit stream structure used in embodiment. 画像の分割の一例を示す図。The figure which shows an example of image division. 実施形態で用いられる画像の分割を示す図。The figure which shows the division of the image used in an embodiment. 実施形態における画像符号化装置の構成を示すブロック図。The block diagram which shows the structure of the image coding apparatus in embodiment.

以下、添付図面を参照して実施形態を詳しく説明する。尚、以下の実施形態は特許請求の範囲に係る発明を限定するものでない。実施形態には複数の特徴が記載されているが、これらの複数の特徴の全てが発明に必須のものとは限らず、また、複数の特徴は任意に組み合わせられてもよい。さらに、添付図面においては、同一若しくは同様の構成に同一の参照番号を付し、重複した説明は省略する。 Hereinafter, embodiments will be described in detail with reference to the accompanying drawings. The following embodiments do not limit the invention according to the claims. Although a plurality of features are described in the embodiment, not all of the plurality of features are essential for the invention, and the plurality of features may be arbitrarily combined. Further, in the attached drawings, the same or similar configurations are given the same reference numbers, and duplicate explanations are omitted.

図７は、実施形態が適用する撮像装置１００のブロック構成を示す図である。撮像装置１００は、ＣＰＵ１０１、メモリ１０２、不揮発性メモリ１０３、操作部１０４、撮像レンズ１１１、撮像部１１２、画像処理部１１３、符号化処理部１１４、表示制御部１１５、表示部１１６、通信制御部１１７、通信部１１８、記録媒体制御部１１９、記録媒体１２０、検出部１４０、及び内部バス１３０を有する。 FIG. 7 is a diagram showing a block configuration of the image pickup apparatus 100 to which the embodiment is applied. The image pickup device 100 includes a CPU 101, a memory 102, a non-volatile memory 103, an operation unit 104, an image pickup lens 111, an image pickup unit 112, an image processing unit 113, a coding processing unit 114, a display control unit 115, a display unit 116, and a communication control unit. It has 117, a communication unit 118, a recording medium control unit 119, a recording medium 120, a detection unit 140, and an internal bus 130.

ＣＰＵ１０１は、不揮発性メモリ１０３に記憶されているコンピュータプログラムを実行することによって、撮像装置１００の各部の動作を制御する。メモリ１０２は、書き換え可能な揮発性メモリ（ＲＡＭ）であり、一時的に撮像装置１００の各部の動作を制御するコンピュータプログラム、各部の動作に関するパラメータ等の情報、通信制御部１１７によって受信した情報等を記憶する。メモリ１０２は、撮像部１１２、画像処理部１１３、符号化処理部１１４等で処理した画像や情報を一時的に記憶するワークメモリとしても機能する。 The CPU 101 controls the operation of each part of the image pickup apparatus 100 by executing a computer program stored in the non-volatile memory 103. The memory 102 is a rewritable volatile memory (RAM), and is a computer program that temporarily controls the operation of each part of the image pickup apparatus 100, information such as parameters related to the operation of each part, information received by the communication control unit 117, and the like. Remember. The memory 102 also functions as a work memory for temporarily storing images and information processed by the image pickup unit 112, the image processing unit 113, the coding processing unit 114, and the like.

不揮発性メモリ１０３は、電気的に消去・記録可能なメモリであり、例えばＥＥＰＲＯＭやＳＤメモリカード等の記憶媒体が用いられる。不揮発性メモリ１０３は、撮像装置１００の各部の動作を制御するコンピュータプログラム及び各部の動作に関するパラメータ等の情報を記憶する。ここでいう、コンピュータプログラムとは、本実施形態にて後述する各種処理を実行するためのプログラムが含まれる。 The non-volatile memory 103 is a memory that can be electrically erased and recorded, and a storage medium such as an EEPROM or an SD memory card is used. The non-volatile memory 103 stores information such as a computer program that controls the operation of each part of the image pickup apparatus 100 and parameters related to the operation of each part. The computer program referred to here includes a program for executing various processes described later in the present embodiment.

操作部１０４は、撮像装置１００を操作するためのユーザインターフェースを提供する。操作部１０４は、撮像装置１００の電源ボタン及びメニューボタン、シャッターボタン、モード切り替えボタン等を有し、各ボタンはスイッチ、タッチパネル等により構成される。ＣＰＵ１０１は、操作部１０４を介して入力されたユーザの指示に従って撮像装置１００を制御する。なお、操作部１０４は、不図示のリモートコントローラから受信したリモコン信号や、不図示の携帯端末から通信制御部１１７を介して通知された要求に応じて撮像装置１００を制御してもよい。 The operation unit 104 provides a user interface for operating the image pickup apparatus 100. The operation unit 104 has a power button, a menu button, a shutter button, a mode switching button, and the like of the image pickup apparatus 100, and each button is composed of a switch, a touch panel, and the like. The CPU 101 controls the image pickup apparatus 100 according to a user's instruction input via the operation unit 104. The operation unit 104 may control the image pickup apparatus 100 in response to a remote control signal received from a remote controller (not shown) or a request notified from a mobile terminal (not shown) via the communication control unit 117.

撮像レンズ１１１は、ズームレンズ、フォーカスレンズを含むレンズ群、レンズ制御部、絞りなどにより構成される。撮像レンズ１１１は図示しないレンズ制御部を有し、ＣＰＵ１０１から送信される制御信号により、焦点の調整や絞り値（Ｆ値）を制御する。撮像部１１２は、被写体の光学像を電気信号に変換する撮像素子を備える。撮像素子は、例えばＣＣＤ（電荷結合素子）やＣＭＯＳ（相補型金属酸化膜半導体）素子等で構成されるエリアイメージセンサである。撮像部１１２は、例えば３０ＦＰＳのフレームレートで撮像し、この撮像で得た画像を画像処理部１１３またはメモリ１０２に出力する。 The image pickup lens 111 is composed of a zoom lens, a lens group including a focus lens, a lens control unit, an aperture, and the like. The image pickup lens 111 has a lens control unit (not shown), and controls focus adjustment and aperture value (F value) by a control signal transmitted from the CPU 101. The image pickup unit 112 includes an image pickup element that converts an optical image of a subject into an electric signal. The image pickup device is an area image sensor composed of, for example, a CCD (charge-coupled device), a CMOS (complementary metal oxide semiconductor) device, or the like. The image pickup unit 112 takes an image at a frame rate of, for example, 30 FPS, and outputs the image obtained by this image pickup to the image processing unit 113 or the memory 102.

画像処理部１１３は、撮像部１１２から出力されるデータ、又は、メモリ１０２から読み出されたデータに対し所定の画素補間、縮小といったリサイズ処理、アスペクト比を合わせるための画像の付与、色変換処理等を行う。また、画像処理部１１３では、撮像した画像データを用いて所定の演算処理が行われ、得られた演算結果に基づいてＣＰＵ１０１が露光制御、測距制御を行う。これにより、ＡＥ（自動露出）処理、ＡＷＢ（オートホワイトバランス）処理、ＡＦ（オートフォーカス）処理が行われる。 The image processing unit 113 performs resizing processing such as predetermined pixel interpolation and reduction with respect to the data output from the imaging unit 112 or the data read from the memory 102, image addition for matching the aspect ratio, and color conversion processing. And so on. Further, in the image processing unit 113, a predetermined calculation process is performed using the captured image data, and the CPU 101 performs exposure control and distance measurement control based on the obtained calculation result. As a result, AE (automatic exposure) processing, AWB (auto white balance) processing, and AF (autofocus) processing are performed.

また、画像処理部１１３では、画像データの位置合わせを行い、位置合わせした複数枚の画像を合成して、広角な合成画像を生成する広角合成を行う。本実施形態における広角合成の詳細は後述する。 Further, the image processing unit 113 aligns the image data, synthesizes a plurality of aligned images, and performs wide-angle composition to generate a wide-angle composite image. Details of wide-angle synthesis in this embodiment will be described later.

符号化処理部１１４は、入力された画像データを、ＣＰＵ１０１からの設定に従って複数の矩形領域に分割し、各領域の画像を圧縮符号化する。実施形態における符号化処理部１１４は、例えばＶＶＣ方式に従って圧縮処理を行うものとする。ただし、後述する点については、現段階において提案されているＶＶＣ方式とは異なる。 The coding processing unit 114 divides the input image data into a plurality of rectangular areas according to the settings from the CPU 101, and compresses and encodes the image in each area. The coding processing unit 114 in the embodiment shall perform compression processing according to, for example, the VVC method. However, the points to be described later are different from the VVC method proposed at this stage.

表示制御部１１５は、表示部１１６を制御するための制御部である。表示制御部１１５は表示部１１６で表示可能な画像になるようにリサイズ処理や色変換処理等を行い、画像信号を表示部１１６に出力する。 The display control unit 115 is a control unit for controlling the display unit 116. The display control unit 115 performs resizing processing, color conversion processing, and the like so that the image can be displayed on the display unit 116, and outputs the image signal to the display unit 116.

表示部１１６は、液晶ディスプレイや有機ＥＬ等で構成されており、表示制御部１１５からの画像データに基づく画像を表示する。なお、表示部１１６の前面にはタッチパネルが設けられ、ユーザからの操作を受け付ける操作部も兼ねる。 The display unit 116 is composed of a liquid crystal display, an organic EL, or the like, and displays an image based on the image data from the display control unit 115. A touch panel is provided on the front surface of the display unit 116, which also serves as an operation unit for receiving operations from the user.

通信制御部１１７は、ＣＰＵ１０１に制御され、ＩＥＥＥ８０２．１１等で予め定められた無線通信規格に適用した変調信号を生成して、通信部１１８へ出力する。また、通信制御部１１７は、通信部１１８より無線通信規格に適用した変調信号を受信して復号することでアナログ信号をデジタル信号変換してＣＰＵ１０１に通知する。また、通信制御部１１７は通信を設定するためのレジスタを持っており、ＣＰＵ１０１から制御されることで通信時の送受信感度を調整し、所定の変調方式で送受信をおこなうことができる。通信部１１８は、通信制御部１１７から送られてくる変調信号を外部へ出力、もしくは外部からの変調信号を受信するアンテナ及び、アナログ回路等により構成される。 The communication control unit 117 is controlled by the CPU 101, generates a modulation signal applied to a wireless communication standard predetermined by 802.11 or the like, and outputs the modulation signal to the communication unit 118. Further, the communication control unit 117 receives the modulation signal applied to the wireless communication standard from the communication unit 118 and decodes it, thereby converting the analog signal into a digital signal and notifying the CPU 101. Further, the communication control unit 117 has a register for setting communication, and by being controlled by the CPU 101, the transmission / reception sensitivity at the time of communication can be adjusted, and transmission / reception can be performed by a predetermined modulation method. The communication unit 118 is composed of an antenna that outputs a modulation signal sent from the communication control unit 117 to the outside or receives a modulation signal from the outside, an analog circuit, and the like.

記録媒体制御部１１９は、記録媒体１２０を制御するための制御部であり、ＣＰＵ１０１から要求を受けて、記録媒体１２０を制御するための制御信号を出力する。記録媒体１２０は、撮像され符号化された画像データを記録するための着脱式、または内蔵式の不揮発性メモリや磁気ディスク等から構成される。なお、ＣＰＵ１０１が記録媒体１２０に記録する場合には、記録媒体のファイルシステムに適応した形式でファイルデータとして保存する。 The recording medium control unit 119 is a control unit for controlling the recording medium 120, and outputs a control signal for controlling the recording medium 120 in response to a request from the CPU 101. The recording medium 120 is composed of a removable or built-in non-volatile memory, a magnetic disk, or the like for recording image data captured and encoded. When the CPU 101 records on the recording medium 120, it is saved as file data in a format suitable for the file system of the recording medium.

検出部１４０は、ＧＰＳセンサ、ジャイロセンサや加速度センサ等を含む。 The detection unit 140 includes a GPS sensor, a gyro sensor, an acceleration sensor, and the like.

内部バス１３０は、ＣＰＵ１０１とメモリ１０２に各処理部がアクセスするための内部バスである。 The internal bus 130 is an internal bus for each processing unit to access the CPU 101 and the memory 102.

上記構成においてＣＰＵ１０１は、符号化処理部１１４を制御し、ピクチャを複数の矩形領域に分割し、各矩形領域の符号化処理を行わせる。また、ＣＰＵ１０１は、各矩形領域を再現するための情報（後述するＳＰＳ等）を生成する。そして、ＣＰＵ１０１は、符号化処理部１１４が生成した画像の符号化データと、ＳＰＳ等の情報とを多重化して、ビットストリームを生成する。そして、ＣＰＵ１０１は、記録媒体制御部１１９を制御し、記録媒体１２０に動画像ファイルとして記録する。 In the above configuration, the CPU 101 controls the coding processing unit 114, divides the picture into a plurality of rectangular areas, and causes the coding processing of each rectangular area to be performed. Further, the CPU 101 generates information (SPS and the like described later) for reproducing each rectangular area. Then, the CPU 101 multiplexes the coded data of the image generated by the coding processing unit 114 and the information such as SPS to generate a bit stream. Then, the CPU 101 controls the recording medium control unit 119 and records the moving image file on the recording medium 120.

次に、実施形態における符号化処理部１１４による符号化処理と矩形領域の分割について説明する。符号化処理部１１４は、ＣＰＵ１０１の制御の下、符号化対象の１ピクチャ（フレーム画像）を１以上のタイル行、１以上のタイル列に分割し、符号化を行う。 Next, the coding process by the coding process unit 114 and the division of the rectangular area in the embodiment will be described. Under the control of the CPU 101, the coding processing unit 114 divides one picture (frame image) to be coded into one or more tile rows and one or more tile columns, and performs coding.

ここで１つのタイルは、複数画素で構成される矩形の領域をカバーするブロック（ＣＴＵ: Coding Tree Unit）の集合体である。また、以下の説明でのスライスとは、複数個のタイルで構成される領域、或いは、１つのタイルにおける１つ以上のＣＴＵ行の領域を言う。 Here, one tile is an aggregate of blocks (CTU: Coding Tree Unit) covering a rectangular area composed of a plurality of pixels. Further, the slice in the following description means an area composed of a plurality of tiles or an area of one or more CTU rows in one tile.

図１、図２は、ＣＴＵ，タイル(Tile)、スライス(Slice）の関係の例である。図１の例では、１ピクチャが４行６列のタイルに分割され、そのタイルを集合的にカバーする３×３の計９つのスライスが定義された例を示している。また、図２は、１ピクチャが２行２列の４個のタイルに分割され、右上のタイルが２つのスライスに分けられ、左側の垂直に隣り合う２つのタイルが１つのスライスを構成し、そして、右下のタイルが１つのスライスとなり、計４つのスライスが定義された例を示している。 1 and 2 are examples of the relationship between CTU, Tile, and Slice. In the example of FIG. 1, one picture is divided into tiles of 4 rows and 6 columns, and a total of 9 slices of 3 × 3 that collectively cover the tiles are defined. Further, in FIG. 2, one picture is divided into four tiles of 2 rows and 2 columns, the upper right tile is divided into two slices, and two vertically adjacent tiles on the left side form one slice. Then, the tile at the lower right becomes one slice, and an example in which a total of four slices are defined is shown.

また、実施形態では、フレーム画像内の矩形領域を集合的にカバーする１以上のスライスから構成され、独立して復号可能な領域をサブピクチャ(subpicture）とする。図３は、ＣＴＵ，タイル、サブピクチャの関係の例を示している。同図では、１ピクチャが３行６列の１８個のタイルに分割され、それぞれのタイルを集合的にカバーする計２４個のスライス及びサブピクチャが設定された例を示している。 Further, in the embodiment, a region composed of one or more slices collectively covering a rectangular region in a frame image and which can be independently decoded is referred to as a subpicture. FIG. 3 shows an example of the relationship between the CTU, tiles, and subpictures. The figure shows an example in which one picture is divided into 18 tiles in 3 rows and 6 columns, and a total of 24 slices and sub-pictures that collectively cover each tile are set.

なお、ピクチャをいくつの、そして、どのようなサイズにサブピクチャに分割するか等の設定は、ＣＰＵ１０１が行うものとする。この設定は、例えば、動画像を記録するに先立って撮像して映像から導出した色分布が、予め登録した色パターンのいずれに最も近いかに基づいて決定するものとする。 It should be noted that the CPU 101 is responsible for setting how many pictures are divided into sub-pictures and what size they are divided into sub-pictures. This setting is determined based on, for example, which of the color patterns registered in advance is the closest to the color distribution derived from the image taken prior to recording the moving image.

ＶＶＣでは、図１乃至図３に示すような、タイル、スライス、サブピクチャを用いたピクチャ分割を実現するためのパラメータ等の情報（復号側にとっては再現するための情報）を、画像の符号化で得たビットストリームのシーケンスのヘッダ部（ＳＰＳ：Sequence Parameter Set）や、ピクチャに対するヘッダ部（ＰＰＳ：Picture Parameter Set）に格納することになっている。 In VVC, information such as parameters for realizing picture division using tiles, slices, and sub-pictures (information for reproduction for the decoding side) as shown in FIGS. 1 to 3 is encoded in the image. It is supposed to be stored in the header part (SPS: Sequence Parameter Set) of the sequence of the bitstream obtained in 1 and the header part (PPS: Picture Parameter Set) for the picture.

例えば、ＳＰＳには、各サブピクチャの左上端を示すｘ、ｙ座標を表すｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ、ｙ座標ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙを格納する。しかしながら、例えばピクチャ内の右下端に位置する、最後のサブピクチャの左上の座標は、それ以前の各サブピクチャの座標から算出できる。つまり、最後のサブピクチャに係る位置を示す情報は冗長であると言える。 For example, the SPS stores sps_subpic_ctu_top_left_x and y-coordinates sps_subpic_ctu_top_left_y indicating the upper left end of each sub-picture. However, for example, the coordinates of the upper left of the last sub-picture located at the lower right of the picture can be calculated from the coordinates of each of the previous sub-pictures. That is, it can be said that the information indicating the position related to the last sub-picture is redundant.

そこで、本実施形態のＣＰＵ１０１は、最後のサブピクチャに係るパラメータ情報を省略したＳＰＳを生成し、そのＳＰＳの情報量をこれまでよりも削減する。 Therefore, the CPU 101 of the present embodiment generates an SPS in which the parameter information related to the last sub-picture is omitted, and the amount of information of the SPS is reduced more than before.

図４Ａ〜図４Ｆは、実施形態で生成されるＳＰＳの情報のシンタックスを示している。なお、図４Ａ〜図４Ｆはこの順番に連続させることで、ＳＰＳのシンタックスを表している点に注意されたい。 4A-4F show the syntax of the SPS information generated in the embodiment. It should be noted that FIGS. 4A to 4F represent the syntax of SPS by making them continuous in this order.

図示されている参照符号４０１、４０２の行の記述における“ｓｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１”は、「１ピクチャに含まれるサブピクチャの個数−１」を表している。そして、条件「ｉ＜ｓｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１」を新たに追加することで、１ピクチャに含まれるサブピクチャの個数をＮとしたとき、それよりも１つ少ないＮ−２個のサブピクチャの位置｛ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［ｉ］，ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［ｉ］｝と、サイズ｛ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［ｉ］，ｓｐｓ＿ｓｕｂｐｉｃ＿ｈｉｇｈｔ＿ｍｉｎｕｓ１［ｉ］｝がＳＰＳに含まれるようにした。本実施形態では参照符号４０１及び４０２の行に条件式を追加する構成としたが、本発明はこれに限定されるものではない。例えば４０１の上の行のｆｏｒ文の条件式をｉ＜＝ｓｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１から、ｉ＜ｓｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１に変更してもよい。 "Sps_num_subpics_minus1" in the description of the lines of reference numerals 401 and 402 shown represents "the number of sub-pictures included in one picture-1". Then, by newly adding the condition "i <sps_num_subpics_minus1", when the number of sub-pictures included in one picture is N, the positions of N-2 sub-pictures, which is one less than that, {sps_subpic_ctu_top_left_x [i]. ], Sps_subpic_ctu_top_left_y [i]} and the size {sps_subpic_width_minus1 [i], sps_subpic_hight_minus1 [i]} are included in the SPS. In the present embodiment, the conditional expression is added to the lines of reference numerals 401 and 402, but the present invention is not limited thereto. For example, the conditional expression of the for statement in the line above 401 may be changed from i <= sps_num_subpics_minus1 to i <sps_num_subpics_minus1.

なお、復号側では、図４Ａ〜図４Ｆに示すＳＰＳのシンタックスに従って復号する過程で、最後（Ｎ個目）のサブピクチャの位置やサイズは、それ以前のサブピクチャの復号結果から自動的に求めることができるので、求めた位置に従ってサブピクチャの画像を再現できるので、問題は発生しない。 On the decoding side, in the process of decoding according to the SPS syntax shown in FIGS. 4A to 4F, the position and size of the last (Nth) sub-picture are automatically set from the decoding results of the previous sub-pictures. Since it can be obtained, the image of the sub-picture can be reproduced according to the obtained position, so that no problem occurs.

ＣＴＵの縦及び横の画素数であるＣｔｂＳｉｚｅＹは、１＜＜ＣｔｂＳｉｚｅ（ここで、ＣｔｂＳｉｚｅ＝ｓｐｓ＿ｌｏｇ２＿ｃｔｕ＿ｓｉｚｅ＿ｍｉｎｕｓ５＋５）で示される。画像の水平方向のＣＴＵ数であるＰｉｃＣｔｂＮｕｍＨは、（ｓｐｓ＿ｐｉｃ＿ｗｉｄｔｈ＿ｍａｘ＿ｉｎ＿ｌｕｍａ＿ｓａｍｐｌｅｓ＋ＣｔｂＳｉｚｅＹ−１）＞＞ＣｔｂＬｏｇ２ＳｉｚｅＹで求める事ができる。同様に垂直方向のＣＴＵ数であるＰｉｃＣｔｂＮｕｍＶは、（ｓｐｓ＿ｐｉｃ＿ｈｅｉｇｈｔ＿ｍａｘ＿ｉｎ＿ｌｕｍａ＿ｓａｍｐｌｅｓ＋ＣｔｂＳｉｚｅＹ−１）＞＞ＣｔｂＬｏｇ２ＳｉｚｅＹで求める事ができる。なお、基本ブロック（ＣＴＵ）の縦及び横の画素数であるＣｔｂＳｉｚｅＹは、１＜＜ＣｔｂＳｉｚｅ（ここで、ＣｔｂＬｏｇ２ＳｉｚｅＹ＝ｓｐｓ＿ｌｏｇ２＿ｃｔｕ＿ｓｉｚｅ＿ｍｉｎｕｓ５＋５）で示される。また、以下の説明において、基本ブロックとはＣＴＵであるものとして説明するが必ずしもそれに限られない。また、各実施形態で示す基本ブロックやＣＴＵのサイズについては一例である。各実施形態で示すタイルやスライス等の領域のサイズについても同様に一例である。 CtbSizeY, which is the number of vertical and horizontal pixels of the CTU, is represented by 1 << CtbSize (here, CtbSize = sps_log2_ctu_size_minus5 + 5). PicCtbNumH, which is the number of CTUs in the horizontal direction of the image, can be obtained by (sps_pic_width_max_in_luma_samples + CtbSizeY-1) >> CtbLog2SizeY. Similarly, PicCtbNumV, which is the number of CTUs in the vertical direction, can be obtained by (sps_pic_height_max_in_luma_samples + CtbSizeY-1) >> CtbLog2SizeY. The number of vertical and horizontal pixels of the basic block (CTU), CtbSizeY, is indicated by 1 << CtbSize (here, CtbLog2SizeY = sps_log2_ctu_size_minus5 + 5). Further, in the following description, the basic block will be described as being a CTU, but the basic block is not necessarily limited to that. Moreover, the size of the basic block and the CTU shown in each embodiment is an example. The size of the area such as tiles and slices shown in each embodiment is also an example.

ｓｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１番目（最後）のサブピクチャの左上のＣＴＵの座標の導出に関して説明する。初めにＸ座標の求め方について説明する。まず、最後のサブピクチャの左に隣接するサブピクチャを求める。画像の下端のＣＴＵを含むサブピクチャの中で、サブピクチャの右端ＣＴＵが最大のＸ座標を持つサブピクチャが、最後のサブピクチャの左に隣接するサブピクチャである。まず下記の式が成り立つサブピクチャが下端のＣＴＵを含むサブピクチャである。
ＰｉｃＣｔｂＮｕｍＶ＝＝ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［ｉ］＋ｓｐｓ＿ｓｕｂｐｉｃ＿ｈｅｉｇｈｔ＿ｍｉｎｕｓ１［ｉ］＋１ sps_num_subpics_minus The derivation of the coordinates of the CTU on the upper left of the first (last) subpicture will be described. First, how to obtain the X coordinate will be described. First, the subpicture adjacent to the left of the last subpicture is obtained. Among the sub-pictures including the CTU at the lower end of the image, the sub-picture having the maximum X coordinate at the right end CTU of the sub-picture is the sub-picture adjacent to the left of the last sub-picture. First, the sub-picture for which the following equation holds is a sub-picture including the CTU at the lower end.
PicCtbNumV == sps_subpic_ctu_top_left_y [i] + sps_subpic_height_minus1 [i] + 1

画像内で上記条件を満たす全てのサブピクチャの中で、最も右に位置するサブピクチャ、即ちｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［ｉ］が最も大きいサブピクチャが左に隣接するサブピクチャになる。従って、最後のサブピクチャの左に隣接するサブピクチャがＮ番目のサブピクチャだとすると、最後のサブピクチャの左上のＣＴＵのＸ座標は以下の式で求める事ができる。
ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［Ｎ］＋ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［Ｎ］＋１ Among all the sub-pictures satisfying the above conditions in the image, the sub-picture located on the rightmost side, that is, the sub-picture having the largest sps_subpic_ctu_top_left_x [i] is the sub-picture adjacent to the left side. Therefore, assuming that the sub-picture adjacent to the left of the last sub-picture is the Nth sub-picture, the X coordinate of the CTU on the upper left of the last sub-picture can be obtained by the following equation.
sps_subpic_ctu_top_left_x [N] + sps_subpic_width_minus1 [N] + 1

次に、最後のサブピクチャの左上のＣＴＵのＹ座標の求め方について説明する。まず、最後のサブピクチャの上に隣接するサブピクチャを求める。画像の右端のＣＴＵを含むサブピクチャの中で、サブピクチャの下端のＣＴＵが最大のＹ座標を持つサブピクチャが、最後のサブピクチャの上に隣接するサブピクチャである。まず下記の式が成り立つサブピクチャが右端のＣＴＵを含むサブピクチャである。
ＰｉｃＣｔｂＮｕｍＨ＝＝ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［ｉ］＋ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［ｉ］＋１ Next, how to obtain the Y coordinate of the CTU on the upper left of the last subpicture will be described. First, the sub-pictures adjacent to the last sub-picture are obtained. Among the sub-pictures including the CTU at the right end of the image, the sub-picture having the maximum Y coordinate of the CTU at the lower end of the sub-picture is the sub-picture adjacent to the last sub-picture. First, the sub-picture for which the following equation holds is a sub-picture including the CTU at the right end.
PicCtbNumH == sps_subpic_ctu_top_left_x [i] + sps_subpic_width_minus1 [i] + 1

画像内で上記条件を満たす全てのサブピクチャの中で、最も下に位置するサブピクチャ、即ちｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［ｉ］が最も大きいサブピクチャが上に隣接するサブピクチャになる。従って、最後のサブピクチャの上に隣接するサブピクチャがＭ番目のサブピクチャだとすると、最後のサブピクチャの左上のＣＴＵのＹ座標は以下の式で求める事ができる。
ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［Ｍ］＋ｓｐｓ＿ｓｕｂｐｉｃ＿ｈｅｉｇｈｔ＿ｍｉｎｕｓ１［Ｍ］＋１ Among all the sub-pictures satisfying the above conditions in the image, the sub-picture located at the bottom, that is, the sub-picture having the largest sps_subpic_ctu_top_left_y [i] is the sub-picture adjacent to the top. Therefore, assuming that the sub-picture adjacent to the last sub-picture is the M-th sub-picture, the Y coordinate of the CTU on the upper left of the last sub-picture can be obtained by the following equation.
sps_subpic_ctu_top_left_y [M] + sps_subpic_height_minus1 [M] + 1

また、これまでの技術におけるＳＰＳ、ＰＰＳ、並びに、スライスヘッダは、１つのピクチャに、２以上のサブピクチャが存在することを許容する記述を採用している。換言すれば、これまでは、１つのピクチャ内に２つ以上のサブピクチャが存在している場合にのみ意味がある情報が含まれていることになる。 Further, the SPS, PPS, and slice header in the conventional techniques employ a description that allows two or more sub-pictures to exist in one picture. In other words, so far, information that is meaningful only when two or more sub-pictures exist in one picture is included.

しかしながら、１ピクチャ内に１つのサブピクチャしか含まれない場合もあり、このようなケースの場合には、ＳＰＳ、ＰＰＳ、並びに、スライスヘッダは冗長な情報を含むことになり、符号量の削減という観点からまだ改善の余地がある。 However, there are cases where only one sub-picture is included in one picture, and in such a case, the SPS, PPS, and slice header contain redundant information, which means that the amount of code is reduced. There is still room for improvement from the point of view.

そこで、本実施形態では、ＳＰＳのシンタックスを示す図４Ａ〜図４Ｆにおける、参照符号４０３、４０４の行は、１ピクチャ内に２以上のサブピクチャが存在する場合にのみ有効であるとして｛｝内のシンタクスが符号化され、１ピクチャ内に１つのサブピクチャのみが存在する場合には｛｝内のシンタクスは符号化されない。 Therefore, in the present embodiment, the lines of reference numerals 403 and 404 in FIGS. 4A to 4F showing the syntax of SPS are considered to be valid only when two or more subpictures are present in one picture {}. If the syntax in is encoded and there is only one subpicture in one picture, the syntax in {} is not encoded.

また、図５Ａの５０１の行の上の行に存在するｐｐｓ＿ｎｏ＿ｐｉｃ＿ｐａｒｔｉｔｉｏｎ＿ｆｌａｇは、画像がサブピクチャ、タイル、スライスに分割されていないか否かを示すフラグである。当該シンタクスが１の時は、画像が単一のサブピクチャ、タイル、スライスで構成されている事を示す。図５Ａ〜図５Ｃ（この順番にＰＰＳのシンタックスを表す）が示すＰＰＳ（Picture Parameter Set）のシンタックスにおける参照符号５０１が示す行、並びに、参照符号５０３が示す行中の条件「＆＆（ｐｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１＞０」は、１ピクチャ内に２以上のサブピクチャが存在する場合にのみ有効であるとして｛｝内のシンタクスが符号化される。そして、１ピクチャ内に１つのサブピクチャのみが存在する場合には｛｝内のシンタクスが符号化されない。 Further, the pps_no_pic_partition_flag present in the line above the line 501 in FIG. 5A is a flag indicating whether or not the image is divided into subpictures, tiles, and slices. When the syntax is 1, it indicates that the image is composed of a single subpicture, tile, or slice. The line indicated by reference numeral 501 in the syntax of PPS (Picture Parameter Set) shown in FIGS. 5A to 5C (in this order representing the syntax of PPS), and the condition "&& (pps_num_subpics_minus1" in the line indicated by reference numeral 503). > 0 ”is encoded as valid only when there are two or more subpictures in one picture, and the syntax in {} is encoded, and when there is only one subpicture in one picture. The syntax in {} is not encoded in.

そして、図６Ａ〜図６Ｃ（この順番にスライスヘッダのシンタックスを表す）が示すスライスヘッダのシンタックスおける参照符号６０１が示す行中の条件「ｓｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１＞０」は、１ピクチャ内に２以上のサブピクチャが存在する場合にのみ有効であるとして｛｝内のシンタクスが符号化される。そして、１ピクチャ内に１つのサブピクチャのみが存在する場合には｛｝内のシンタクスが符号化されない。 Then, the condition "sps_num_subpix_minus1> 0" in the line indicated by the reference code 601 in the syntax of the slice header shown in FIGS. 6A to 6C (in this order, the syntax of the slice header is represented) is two or more in one picture. The syntax in {} is encoded as valid only if a subpicture is present. Then, when only one sub-picture exists in one picture, the syntax in {} is not encoded.

上記の結果、ＣＰＵ１０１は、１ピクチャ内に２以上のサブピクチャが存在する場合のみ、サブピクチャを特定するインデックス番号を符号化する。そして、ＣＰＵ１０１は、１ピクチャ内に１つのサブピクチャしか存在しない場合には、サブピクチャのインデックス番号の符号を生成しなくなり、その分だけ符号量を削減できるようになる。 As a result of the above, the CPU 101 encodes the index number that identifies the sub-picture only when there are two or more sub-pictures in one picture. Then, when only one sub-picture exists in one picture, the CPU 101 does not generate the code of the index number of the sub-picture, and the code amount can be reduced by that amount.

上記の結果、特に、１ピクチャ内に含まれるサブピクチャの数が１つの場合に、ＳＰＳ，ＰＰＳ，スライスヘッダの情報量を削減できることになる。 As a result of the above, the amount of information in the SPS, PPS, and slice header can be reduced, especially when the number of sub-pictures contained in one picture is one.

なお、上記実施形態では、画像符号化装置として撮像装置に適用する例を説明したが、符号化対象の画像を入力もしくは受信し、符号化する装置であれば良いので、撮像装置のみに限定されるものではない。 In the above embodiment, an example of applying to an image pickup device as an image coding device has been described, but the device is limited to the image pickup device as long as it is a device that inputs or receives an image to be coded and encodes it. It's not something.

また、実施形態では、撮像装置が、図７の構成を有するものとして説明したが、同図の幾つかの構成、例えば画像処理部１１３、符号化処理部１１４等は、ＣＰＵ１０１がプログラムを実行することで実現しても構わない。 Further, in the embodiment, the image pickup apparatus has been described as having the configuration of FIG. 7, but the CPU 101 executes a program for some configurations of the figure, such as the image processing unit 113 and the coding processing unit 114. It doesn't matter if it is realized.

次に、図１０、図１２〜図１５を参照して、これまで説明してきた画像符号化装置について、更に詳細に説明する。なお、以下の説明では、撮像部１１２等で行う撮像処理については説明を省略し、主に符号化処理部１１４における処理として説明してきた処理の更なる詳細について説明する。 Next, the image coding apparatus described so far will be described in more detail with reference to FIGS. 10 and 12 to 15. In the following description, the image pickup process performed by the image pickup unit 112 and the like will be omitted, and further details of the process described mainly as the process in the coding process unit 114 will be described.

図１５は本実施形態の画像符号化装置を示すブロック図である。図１５は、符号化処理部１１４等を更に詳細に記した機能ブロック図である。 FIG. 15 is a block diagram showing an image coding apparatus of this embodiment. FIG. 15 is a functional block diagram showing the coding processing unit 114 and the like in more detail.

本装置は、画像分割部８０２、ブロック分割部８０３、予測部８０４、変換・量子化部８０５、逆量子化・逆変換部８０６、画像再生部８０７、フレームメモリ８０８、インループフィルタ部８０９、符号化部８１０、統合符号化部８１１を有する。 This device has an image division unit 802, a block division unit 803, a prediction unit 804, a conversion / quantization unit 805, an inverse quantization / inverse conversion unit 806, an image reproduction unit 807, a frame memory 808, an in-loop filter unit 809, and a code. It has a quantization unit 810 and an integrated coding unit 811.

画像分割部８０２は、入力端子８０１を介して入力した入力画像を一つもしくは複数のタイル行および一つもしくは複数のタイル列に分割する。タイルは画像内の矩形領域を覆う、連続する基本ブロック（ＣＴＵ：Coding Tree Unit）の集合である。画像分割部８０２はさらに、画像をスライスに分割する。スライスは、画像内の一つまたは複数のタイルを包含する形で構成されるか、あるいは一つのタイル内の一つ以上の基本ブロック行で構成される。スライスは符号化の基本単位であり、スライス毎にスライスの種類を示す情報等のヘッダ情報が付加される。画像分割部８０２はさらに、入力画像を一つもしくは複数のサブピクチャを定義する。サブピクチャは複数のスライスの集合（矩形状）で構成される。入力画像を４個のタイル、４個のサブピクチャ、１１個のスライスに分割する例を図１３に示す。左上のタイルは１個のスライス、左下のタイルは２個のスライス、右上のタイルは５個のスライス、右下のタイルは３個のスライスにそれぞれ分割されている。そして左のサブピクチャは３個のスライス、右上のサブピクチャは２個のスライス、右中央のサブピクチャは３個のスライス、右下のスライスは３個のスライスを包含するように構成されている。 The image dividing unit 802 divides the input image input via the input terminal 801 into one or a plurality of tile rows and one or a plurality of tile columns. A tile is a set of continuous basic blocks (CTUs: Coding Tree Units) that cover a rectangular area in an image. The image dividing unit 802 further divides the image into slices. A slice is composed of one or more tiles in an image, or is composed of one or more basic block lines in one tile. A slice is a basic unit of coding, and header information such as information indicating a slice type is added to each slice. The image dividing unit 802 further defines one or more sub-pictures of the input image. A subpicture is composed of a set of multiple slices (rectangular shape). FIG. 13 shows an example of dividing the input image into 4 tiles, 4 subpictures, and 11 slices. The upper left tile is divided into 1 slice, the lower left tile is divided into 2 slices, the upper right tile is divided into 5 slices, and the lower right tile is divided into 3 slices. The left subpicture is configured to contain 3 slices, the upper right subpicture is configured to contain 2 slices, the right center subpicture is configured to contain 3 slices, and the lower right slice is configured to contain 3 slices. ..

ブロック分割部８０３は、画像分割部８０２から出力された基本ブロックで構成される行画像を複数の基本ブロックに分割し、基本ブロック単位の画像を後段に出力する。 The block division unit 803 divides a line image composed of basic blocks output from the image division unit 802 into a plurality of basic blocks, and outputs an image in units of basic blocks to a subsequent stage.

予測部８０４は、基本ブロック単位の画像データに対し、サブブロック分割を決定し、サブブロック単位でフレーム内予測であるイントラ予測やフレーム間予測であるインター予測などを行い、予測画像データを生成する。ここで予測部８０４は、サブピクチャを跨いだイントラ予測、動きベクトルの予測は行われない。さらに、予測部８０４は、入力された画像データと予測画像データから予測誤差を算出し、出力する。また、予測部８０４は、予測に必要な情報、例えばサブブロック分割、予測モードや動きベクトル等の情報も予測誤差と併せて出力する。以下ではこの予測に必要な情報を予測情報と呼称する。 The prediction unit 804 determines sub-block division for the image data in the basic block unit, performs intra-frame prediction such as intra-frame prediction and inter-frame prediction in the sub-block unit, and generates prediction image data. .. Here, the prediction unit 804 does not perform intra-prediction or motion vector prediction across sub-pictures. Further, the prediction unit 804 calculates and outputs a prediction error from the input image data and the predicted image data. Further, the prediction unit 804 also outputs information necessary for prediction, for example, information such as sub-block division, prediction mode, motion vector, and the like together with the prediction error. Hereinafter, the information necessary for this prediction is referred to as prediction information.

変換・量子化部８０５は、予測誤差をサブブロック単位で直交変換して変換係数を得、さらに量子化を行い、量子化係数を生成する。 The conversion / quantization unit 805 orthogonally transforms the prediction error in subblock units to obtain a conversion coefficient, and further performs quantization to generate a quantization coefficient.

逆量子化・逆変換部８０６は、変換・量子化部８０５から出力された量子化係数を逆量子化して変換係数を再生し、さらに逆直交変換して予測誤差を再生する。 The inverse quantization / inverse conversion unit 806 inversely quantizes the quantization coefficient output from the conversion / quantization unit 805 to reproduce the conversion coefficient, and further performs inverse orthogonal conversion to reproduce the prediction error.

画像再生部８０７は、予測部８０４から出力された予測情報に基づいて、フレームメモリ８０８を適宜参照して予測画像データを生成し、これと入力された予測誤差から再生画像データを生成する。画像再生部８０７は、生成した再生画像データをフレームメモリ８０７に再格納する。 Based on the prediction information output from the prediction unit 804, the image reproduction unit 807 generates the prediction image data by appropriately referring to the frame memory 808, and generates the reproduction image data from the input prediction error. The image reproduction unit 807 re-stores the generated reproduced image data in the frame memory 807.

インループフィルタ部８０９は、フレームメモリ８０８に格納された再生画像データに対し、デブロッキングフィルタやサンプルアダプティブオフセットなどのインループフィルタ処理を行い、フィルタ処理後の画像データを、フレームメモリ８０８に再格納する。 The in-loop filter unit 809 performs in-loop filter processing such as a deblocking filter and a sample adaptive offset on the reproduced image data stored in the frame memory 808, and re-stores the filtered image data in the frame memory 808. do.

符号化部８１０は、変換・量子化部８０４から出力された量子化係数および予測部８０４から出力された予測情報を符号化して、符号データを生成し出力する。 The coding unit 810 encodes the quantization coefficient output from the conversion / quantization unit 804 and the prediction information output from the prediction unit 804, and generates and outputs code data.

統合符号化部８１１は、画像分割部８０２から分割情報を受け取り、ヘッダ符号データを生成する。さらに統合符号化部８１１は、符号化部８１０から出力された符号データと合わせて、ビットストリームを形成して、形成したビットストリームを出力端子８１２を介して出力する。 The integrated coding unit 811 receives the division information from the image division unit 802 and generates the header code data. Further, the integrated coding unit 811 forms a bit stream together with the code data output from the coding unit 810, and outputs the formed bit stream via the output terminal 812.

実施形態における画像符号化装置における画像の符号化動作を以下に説明する。本実施形態では動画像データをフレーム単位に入力する構成とするが、１フレーム分の静止画像データを入力する構成としても構わない。また、本実施形態では、説明を容易にするため、イントラ予測符号化の処理のみを説明するが、これに限定されずインター予測符号化の処理においても適用可能である。さらに本実施形態では説明のため、ブロック分割部８０３においては６４×６４画素の基本ブロックに分割するものとして説明するが、これはあくまで例示であり、これに限定されるものではない。 The image coding operation in the image coding device according to the embodiment will be described below. In the present embodiment, the moving image data is input in frame units, but the still image data for one frame may be input. Further, in the present embodiment, for the sake of facilitation of explanation, only the intra-predictive coding process will be described, but the present invention is not limited to this and can be applied to the inter-predictive coding process. Further, in the present embodiment, for the sake of explanation, the block division unit 803 will be described as being divided into basic blocks of 64 × 64 pixels, but this is merely an example and is not limited thereto.

本実施形態では、まず端子８０１から入力された１フレーム分の画像データが画像分割部８０２において、図１４（ａ），（ｂ）のようにタイルとスライスに分割され、サブピクチャが定義される。図１４（ａ）はタイルの分割とＩＤを示す図である。本実施形態では画像データは３８４×３８４画素の９つのタイルに分割される。タイルは左上からラスタ順にＩＤが付与され、左上のタイルＩＤは０、右下のタイルＩＤは８である。 In the present embodiment, first, one frame of image data input from the terminal 801 is divided into tiles and slices in the image dividing unit 802 as shown in FIGS. 14A and 14B, and a sub-picture is defined. .. FIG. 14A is a diagram showing tile division and ID. In this embodiment, the image data is divided into nine tiles of 384 × 384 pixels. The tiles are assigned IDs in raster order from the upper left, the upper left tile ID is 0, and the lower right tile ID is 8.

図１４（ｂ）はタイル、サブピクチャの定義及びスライスの分割例を示す。タイルＩＤ０及びタイルＩＤ７のタイルは更に３８４×１９２画素の２つのスライスに分割される。タイルＩＤ２のタイルは、３８４×１２８、３８４×２５６画素の２つのスライスに分割される。タイルＩＤ３のタイルは３８４×１２８の３つのスライスに分割される。それら以外のタイルは単一のスライスで構成される。各スライスにはラスタ順のタイルの中で上から順にＩＤが付与される。図１４（ｂ）に示すＳＩＤがスライスのＩＤである。サブピクチャは、夫々ＳＩＤ＝０〜２を含むサブピクチャ０、ＳＩＤ＝３を含むサブピクチャ１、ＳＩＤ＝４を含むスサブピクチャ２、ＳＩＤ＝５〜８及び１０〜１２を含むサブピクチャ３、ＳＩＤ＝９及び１３を含むサブピクチャ４と定義される。これらの、タイル、サブピクチャ、スライスの大きさに関する情報は、分割情報として、統合符号化部８１１に送られる。また、各スライスは基本ブロック行単位の画像データである基本ブロック行画像に分割され、ブロック分割部８０３に送られる。 FIG. 14B shows definitions of tiles and subpictures and an example of dividing slices. The tiles with tile ID 0 and tile ID 7 are further divided into two slices of 384 × 192 pixels. The tile with tile ID 2 is divided into two slices of 384 × 128 and 384 × 256 pixels. The tile with tile ID 3 is divided into three slices of 384 × 128. The other tiles consist of a single slice. IDs are assigned to each slice in order from the top in the tiles in raster order. The SID shown in FIG. 14B is the ID of the slice. The sub-pictures are sub-picture 0 including SID = 0 to 2, sub-picture 1 including SID = 3, sub-picture 2 including SID = 4, and sub-picture 3 including SID = 5-8 and 10-12, respectively. It is defined as a sub-picture 4 containing SID = 9 and 13. Information on the sizes of tiles, subpictures, and slices is sent to the integrated coding unit 811 as division information. Further, each slice is divided into basic block line images, which are image data for each basic block line, and sent to the block division unit 803.

ブロック分割部８０３は、入力された基本ブロックで構成される行画像を複数の基本ブロックに分割し、基本ブロック単位の画像を予測部８０４に出力する。本実施形態のブロック分割部８０３は、６４×６４画素の基本ブロック単位の画像を出力する。 The block division unit 803 divides the line image composed of the input basic blocks into a plurality of basic blocks, and outputs the image of each basic block unit to the prediction unit 804. The block division unit 803 of the present embodiment outputs an image in basic block units of 64 × 64 pixels.

予測部８０４は、ブロック分割部８０３から入力された基本ブロック単位の画像データに対し予測処理を実行する。具体的には、基本ブロックをさらに細かいサブブロックに分割するサブブロック分割を決定し、さらにサブブロック単位で水平予測や垂直予測などのイントラ予測モードを決定する。 The prediction unit 804 executes prediction processing on the image data of the basic block unit input from the block division unit 803. Specifically, the sub-block division that divides the basic block into smaller sub-blocks is determined, and the intra-prediction mode such as horizontal prediction or vertical prediction is determined for each sub-block.

予測部８０４は、決定したイントラ予測モードおよび符号化済の画素から予測画像データを生成し、さらに入力された画像データと前記予測画像データの差分である予測誤差を生成し、変換・量子化部８０５に出力する。また、予測部８０４は、サブブロック分割やイントラ予測モードなどの情報は予測情報として、符号化部８１０、画像再生部８０７に供給する。 The prediction unit 804 generates prediction image data from the determined intra prediction mode and encoded pixels, further generates a prediction error which is the difference between the input image data and the prediction image data, and converts / quantizes the prediction unit. Output to 805. Further, the prediction unit 804 supplies information such as sub-block division and intra prediction mode as prediction information to the coding unit 810 and the image reproduction unit 807.

変換・量子化部８０５は、入力された予測誤差に対して、直交変換、及び、量子化を行い、量子化係数を生成する。変換・量子化部８０５は、まずはサブブロックのサイズに対応した直交変換処理が施されて直交変換係数を生成する。次に変換・量子化部８０５は、直交変換係数を量子化し、量子化係数を生成する。そして、変換・量子化部８０５は、生成された量子化係数を符号化部８１０および逆量子化・逆変換部８０６に出力する。 The conversion / quantization unit 805 performs orthogonal transformation and quantization with respect to the input prediction error, and generates a quantization coefficient. The conversion / quantization unit 805 first performs orthogonal conversion processing corresponding to the size of the subblock to generate an orthogonal conversion coefficient. Next, the conversion / quantization unit 805 quantizes the orthogonal transformation coefficient and generates the quantization coefficient. Then, the conversion / quantization unit 805 outputs the generated quantization coefficient to the coding unit 810 and the inverse quantization / inverse conversion unit 806.

逆量子化・逆変換部８０６は、入力された量子化係数を逆量子化して変換係数を再生する。逆量子化・逆変換部８０６は、さらに、再生された変換係数を逆直交変換して予測誤差を再生する。そして、逆量子化・逆変換部８０６は、再生された予測誤差を画像再生部８０７に出力する。 The inverse quantization / inverse conversion unit 806 inversely quantizes the input quantization coefficient and reproduces the conversion coefficient. The inverse quantization / inverse transformation unit 806 further performs inverse orthogonal transformation of the reproduced conversion coefficient to reproduce the prediction error. Then, the inverse quantization / inverse transformation unit 806 outputs the reproduced prediction error to the image reproduction unit 807.

画像再生部８０７は、予測部８０４から入力される予測情報に基づいて、フレームメモリ８０８を適宜参照し、予測画像を再生する。そして、画像再生部８０７は、再生された予測画像と逆量子化・逆変換部８０６から入力した予測誤差から画像データを再生し、再生した画像をフレームメモリ８０８に格納する。 The image reproduction unit 807 appropriately refers to the frame memory 808 based on the prediction information input from the prediction unit 804, and reproduces the predicted image. Then, the image reproduction unit 807 reproduces the image data from the reproduced predicted image and the prediction error input from the inverse quantization / inverse conversion unit 806, and stores the reproduced image in the frame memory 808.

インループフィルタ部８０９は、フレームメモリ８０８から再生画像を読み出し、デブロッキングフィルタなどのインループフィルタ処理を行う。そして、インループフィルタ部８０９は、フィルタ処理された画像をフレームメモリ８０８に再格納する。 The in-loop filter unit 809 reads the reproduced image from the frame memory 808 and performs in-loop filter processing such as a deblocking filter. Then, the in-loop filter unit 809 re-stores the filtered image in the frame memory 808.

符号化部８１０は、ブロック単位で、変換・量子化部８０５で生成された量子化係数、予測部８０４から入力された予測情報をエントロピー符号化し、符号データを生成する。エントロピー符号化の種類は特に問わないが、ゴロム符号化、算術符号化、ハフマン符号化などを用いることができる。符号化部８１０は、生成した符号データを統合符号化部８１１に出力する。 The coding unit 810 entropy-codes the quantization coefficient generated by the conversion / quantization unit 805 and the prediction information input from the prediction unit 804 in block units to generate code data. The type of entropy coding is not particularly limited, but Golomb coding, arithmetic coding, Huffman coding, and the like can be used. The coding unit 810 outputs the generated code data to the integrated coding unit 811.

統合符号化部８１１は、画像分割部８０２より分割情報を受け取り、ヘッダの符号化データを生成する。そして、統合符号化部８１１は、ヘッダの符号化データと、符号化部８１０から受信した符号化データとを多重化してビットストリームを形成する。そして、統合符号化部８１１は、形成したビットストリームを、出力端子８１２を介して外部に出力する。 The integrated coding unit 811 receives the division information from the image division unit 802 and generates the coding data of the header. Then, the integrated coding unit 811 multiplexes the coded data of the header and the coded data received from the coding unit 810 to form a bit stream. Then, the integrated coding unit 811 outputs the formed bit stream to the outside via the output terminal 812.

本画像符号化装置が符号化するＶＶＣによる符号化データのフォーマットを図１２に示す。図１２の符号化データには、まずシーケンスの符号化に関わる情報が含まれたヘッダ情報であるシーケンス・パラメータ・セット（ＳＰＳ）が存在する。さらに、ピクチャの符号化に関わる情報が含まれたヘッダ情報であるピクチャ・パラメータ・セット（ＰＰＳ）、スライスの符号化に関わる情報が含まれたヘッダ情報であるスライスヘッダ及び各スライスの符号化データが続く。 FIG. 12 shows the format of the VVC-encoded data encoded by the image coding apparatus. In the coded data of FIG. 12, first, there is a sequence parameter set (SPS) which is header information including information related to the coding of the sequence. Further, a picture parameter set (PPS) which is header information including information related to picture coding, a slice header which is header information including information related to slice coding, and coding data of each slice. Followed by.

シーケンス・パラメータ・セット（ＳＰＳ）には、画像サイズ情報として、ｓｐｓ＿ｐｉｃ＿ｗｉｄｔｈ＿ｍａｘ＿ｉｎ＿ｌｕｍａ＿ｓａｍｐｌｅｓ及びｓｐｓ＿ｐｉｃ＿ｈｅｉｇｈｔ＿ｍａｘ＿ｉｎ＿ｌｕｍａ＿ｓａｍｐｌｅｓが存在する。これらはそれぞれ、画像の輝度の水平方向の画素数、垂直方向の画素数を表す。本実施形態では図１４の画像を符号化するため、ｓｐｓ＿ｐｉｃ＿ｗｉｄｔｈ＿ｍａｘ＿ｉｎ＿ｌｕｍａ＿ｓａｍｐｌｅｓは“１１５２”、ｓｐｓ＿ｐｉｃ＿ｈｅｉｇｈｔ＿ｍａｘ＿ｉｎ＿ｌｕｍａ＿ｓａｍｐｌｅｓは“１１５２”となる。また、基本ブロック分割情報として、基本ブロックの大きさを示すｓｐｓ＿ｌｏｇ２＿ｃｔｕ＿ｓｉｚｅ＿ｍｉｎｕｓ５が存在する。前述のように、基本ブロックの縦及び横の画素数であるＣｔｂＳｉｚｅＹは、１＜＜ＣｔｂＳｉｚｅ（ここで、ＣｔｂＬｏｇ２ＳｉｚｅＹ＝ｓｐｓ＿ｌｏｇ２＿ｃｔｕ＿ｓｉｚｅ＿ｍｉｎｕｓ５＋５）で示される。画像の水平方向の基本ブロック数であるＰｉｃＣｔｂＮｕｍＨは、（ｓｐｓ＿ｐｉｃ＿ｗｉｄｔｈ＿ｍａｘ＿ｉｎ＿ｌｕｍａ＿ｓａｍｐｌｅｓ＋ＣｔｂＳｉｚｅＹ−１）＞＞ＣｔｂＬｏｇ２ＳｉｚｅＹで求める事ができる。同様に垂直方向の基本ブロック数であるＰｉｃＣｔｂＮｕｍＶは、（ｓｐｓ＿ｐｉｃ＿ｈｅｉｇｈｔ＿ｍａｘ＿ｉｎ＿ｌｕｍａ＿ｓａｍｐｌｅｓ＋ＣｔｂＳｉｚｅＹ−１）＞＞ＣｔｂＬｏｇ２ＳｉｚｅＹで求める事ができる。本実施形態では基本ブロックは６４×６４画素であるため、ｓｐｓ＿ｌｏｇ２＿ｃｔｕ＿ｓｉｚｅ＿ｍｉｎｕｓ５の値は１となる。 In the sequence parameter set (SPS), sps_pic_width_max_in_luma_samples and sps_pic_height_max_in_luma_samples are present as image size information. These represent the number of pixels in the horizontal direction and the number of pixels in the vertical direction of the brightness of the image, respectively. In this embodiment, since the image of FIG. 14 is encoded, sps_pic_width_max_in_luma_samples is "1152" and sps_pic_height_max_in_luma_samples is "1152". Further, as basic block division information, sps_log2_ctu_size_minus5 indicating the size of the basic block exists. As described above, CtbSizeY, which is the number of vertical and horizontal pixels of the basic block, is indicated by 1 << CtbSize (here, CtbLog2SizeY = sps_log2_ctu_size_minus5 + 5). PicCtbNumH, which is the number of basic blocks in the horizontal direction of the image, can be obtained by (sps_pic_width_max_in_luma_samples + CtbSizeY-1) >> CtbLog2SizeY. Similarly, PicCtbNumV, which is the number of basic blocks in the vertical direction, can be obtained by (sps_pic_height_max_in_luma_samples + CtbSizeY-1) >> CtbLog2SizeY. In this embodiment, since the basic block has 64 × 64 pixels, the value of sps_log2_ctu_size_minus5 is 1.

さらにサブピクチャ定義情報Ｓ（ＳＰＳ）として、サブピクチャの定義に関する情報があるか否かを示すｓｐｓ＿ｓｕｂｐｉｃ＿ｉｎｆｏ＿ｐｒｅｓｅｎｔ＿ｆｌａｇが存在する。このｓｐｓ＿ｓｕｂｐｉｃ＿ｉｎｆｏ＿ｐｒｅｓｅｎｔ＿ｆｌａｇが１の時は、サブピクチャの定義に関する情報を統合符号化部８１１が符号化していることを示し、０の時は符号化していないことを示す。このｓｐｓ＿ｓｕｂｐｉｃ＿ｉｎｆｏ＿ｐｒｅｓｅｎｔ＿ｆｌａｇが１の時はサブピクチャ数−１を表すｓｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１を統合符号化部８１１が符号化する。更に統合符号化部８１１は、ｉ番目のサブピクチャの左上の基本ブロックの座標を示すｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［ｉ］（Ｘ座標）及びｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿y［ｉ］（Ｙ座標）を符号化する。ただし統合符号化部８１１は、０番目のサブピクチャの座標情報に関しては必ず画像の左上であり、座標が（０、０）である事が確定しているため符号化しない。また、ｓｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１番目（最後＆右下）のサブピクチャの座標も後述の通り求められるため符号化しない。また統合符号化部８１１は、座標と同様にｉ番目のサブピクチャの水平及び垂直方向の基本ブロック数−１を示す、ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［ｉ］及びｓｐｓ＿ｓｕｂｐｉｃ＿ｈｅｉｇｈｔ＿ｍｉｎｕｓ１［ｉ］も符号化する。統合符号化部８１１は、ｓｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１番目のサブピクチャに関しては水平方向及び垂直方向の基本ブロック数は自明であるため符号化しない。 Further, as the sub-picture definition information S (SPS), there is sps_subpic_info_present_flag indicating whether or not there is information regarding the definition of the sub-picture. When sps_subpic_info_present_flag is 1, it indicates that the integrated coding unit 811 encodes the information regarding the definition of the sub-picture, and when it is 0, it indicates that it is not encoded. When the sps_subpic_info_present_flag is 1, the integrated coding unit 811 encodes sps_num_subpics_minus1 representing the number of sub-pictures-1. Further, the integrated coding unit 811 encodes sps_subpic_ctu_top_left_x [i] (X coordinate) and sps_subpic_ctu_top_left_y [i] (Y coordinate) indicating the coordinates of the upper left basic block of the i-th subpicture. However, the integrated coding unit 811 does not encode the coordinate information of the 0th subpicture because it is always in the upper left of the image and it is determined that the coordinates are (0, 0). Further, the coordinates of the first (last & lower right) sub-pictures of sps_num_subpics_minus are also obtained as described later, and are not encoded. The integrated coding unit 811 also encodes sps_subpic_width_minus1 [i] and sps_subpic_height_minus1 [i], which indicate the number of basic blocks-1 in the horizontal and vertical directions of the i-th subpicture as well as the coordinates. The integrated coding unit 811 does not encode the first subpicture of sps_num_subpics_minus because the number of basic blocks in the horizontal and vertical directions is obvious.

本実施形態では５個のサブピクチャを定義するので、ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｎｆｏ＿ｐｒｅｓｅｎｔ＿ｆｌａｇは１になり、ｓｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１は４となる。０番目のサブピクチャは基本ブロック数が１２×６なので、ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［０］は１１、ｓｐｓ＿ｓｕｂｐｉｃ＿ｈｅｉｇｈｔ＿ｍｉｎｕｓ１［０］は５になる。１番目のサブピクチャの左上の基本ブロックの座標は（１２、０）なので、ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［１］は１２、ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［１］は０となる。１番目のサブピクチャは基本ブロック数が６×２なので、ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［１］は５、ｓｐｓ＿ｓｕｂｐｉｃ＿ｈｅｉｇｈｔ＿ｍｉｎｕｓ１［１］は１になる。２番目のサブピクチャの左上の基本ブロックの座標は（１２、２）なので、ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［２］は１２、ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［２］は２となる。２番目のサブピクチャは基本ブロック数が６×４なので、ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［２］は５、ｓｐｓ＿ｓｕｂｐｉｃ＿ｈｅｉｇｈｔ＿ｍｉｎｕｓ１［２］は３になる。３番目のサブピクチャの左上の基本ブロックの座標は（０、６）なので、ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［３］は０、ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［３］は６となる。３番目のサブピクチャは基本ブロック数が１２×１２なので、ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［３］は１１、ｓｐｓ＿ｓｕｂｐｉｃ＿ｈｅｉｇｈｔ＿ｍｉｎｕｓ１［３］は１１になる。４番目のサブピクチャの左上の基本ブロックの座標は（１２、６）、基本ブロック数が６×１２だが、統合符号化部８１１はこれらの情報を符号化しない。 Since five subpictures are defined in this embodiment, sps_subpic_info_present_flag is 1, and sps_num_subpics_minus1 is 4. Since the number of basic blocks of the 0th subpicture is 12 × 6, sps_subpic_width_minus1 [0] is 11, and sps_subpic_height_minus1 [0] is 5. Since the coordinates of the upper left basic block of the first subpicture are (12, 0), sps_subpic_ctu_top_left_x [1] is 12, and sps_subpic_ctu_top_left_y [1] is 0. Since the number of basic blocks of the first subpicture is 6 × 2, sps_subpic_width_minus1 [1] is 5, and sps_subpic_height_minus1 [1] is 1. Since the coordinates of the upper left basic block of the second subpicture are (12, 2), sps_subpic_ctu_top_left_x [2] is 12, and sps_subpic_ctu_top_left_y [2] is 2. Since the number of basic blocks of the second subpicture is 6 × 4, sps_subpic_width_minus1 [2] is 5, and sps_subpic_height_minus1 [2] is 3. Since the coordinates of the upper left basic block of the third subpicture are (0, 6), sps_subpic_ctu_top_left_x [3] is 0, and sps_subpic_ctu_top_left_y [3] is 6. Since the number of basic blocks of the third subpicture is 12 × 12, sps_subpic_width_minus1 [3] is 11, and sps_subpic_height_minus1 [3] is 11. The coordinates of the upper left basic block of the fourth subpicture are (12, 6), and the number of basic blocks is 6 × 12, but the integrated coding unit 811 does not encode this information.

ｓｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１番目（最後）のサブピクチャの左上の基本ブロックの座標の導出に関して説明する。初めにＸ座標の求め方について説明する。まず、最後のサブピクチャの左に隣接するサブピクチャを求める。画像の下端の基本ブロックを含むサブピクチャの中で、サブピクチャの右端基本ブロックが最大のＸ座標を持つサブピクチャが、最後のサブピクチャの左に隣接するサブピクチャである。まず下記の式が成り立つサブピクチャが下端の基本ブロックを含むサブピクチャである。
ＰｉｃＣｔｂＮｕｍＶ＝＝ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［ｉ］＋ｓｐｓ＿ｓｕｂｐｉｃ＿ｈｅｉｇｈｔ＿ｍｉｎｕｓ１［ｉ］＋１ sps_num_subpics_minus The derivation of the coordinates of the upper left basic block of the first (last) subpicture will be described. First, how to obtain the X coordinate will be described. First, the subpicture adjacent to the left of the last subpicture is obtained. Among the sub-pictures including the basic block at the lower end of the image, the sub-picture in which the right-most basic block of the sub-picture has the maximum X coordinate is the sub-picture adjacent to the left of the last sub-picture. First, the sub-picture for which the following formula holds is a sub-picture including the basic block at the lower end.
PicCtbNumV == sps_subpic_ctu_top_left_y [i] + sps_subpic_height_minus1 [i] + 1

画像内で上記条件を満たす全てのサブピクチャの中で、最も右に位置するサブピクチャ、即ちｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［ｉ］が最も大きいサブピクチャが左に隣接するサブピクチャになる。従って、最後のサブピクチャの左に隣接するサブピクチャがＮ番目のサブピクチャだとすると、最後のサブピクチャの左上の基本ブロックのＸ座標は以下の式で求める事ができる。
ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［Ｎ］＋ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［Ｎ］＋１ Among all the sub-pictures satisfying the above conditions in the image, the sub-picture located on the rightmost side, that is, the sub-picture having the largest sps_subpic_ctu_top_left_x [i] is the sub-picture adjacent to the left side. Therefore, assuming that the subpicture adjacent to the left of the last subpicture is the Nth subpicture, the X coordinate of the upper left basic block of the last subpicture can be obtained by the following equation.
sps_subpic_ctu_top_left_x [N] + sps_subpic_width_minus1 [N] + 1

本実施形態においては、ＰｉｃＣｔｂＮｕｍＶが１８であり、ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［ｉ］＋ｓｐｓ＿ｓｕｂｐｉｃ＿ｈｅｉｇｈｔ＿ｍｉｎｕｓ１［ｉ］＋１＝＝１８を満たすのは、サブピクチャ３である。ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［３］＋ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［３］＋１＝１２であり、最後のサブピクチャの左上の基本ブロックのＸ座標は１２となる。 In the present embodiment, PicCtbNumV is 18, and it is subpicture 3 that satisfies sps_subpic_ctu_top_left_y [i] + sps_subpic_height_minus1 [i] +1 == 18. sps_subpic_ctu_top_left_x [3] + sps_subpic_width_minus1 [3] + 1 = 12, and the X coordinate of the upper left basic block of the last subpicture is 12.

次に、最後のサブピクチャの左上の基本ブロックのＹ座標の求め方について説明する。まず、最後のサブピクチャの上に隣接するサブピクチャを求める。画像の右端の基本ブロックを含むサブピクチャの中で、サブピクチャの下端の基本ブロックが最大のＹ座標を持つサブピクチャが、最後のサブピクチャの上に隣接するサブピクチャである。まず下記の式が成り立つサブピクチャが、右端の基本ブロックを含むサブピクチャである。
ＰｉｃＣｔｂＮｕｍＨ＝＝ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［ｉ］＋ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［ｉ］＋１ Next, how to obtain the Y coordinate of the basic block on the upper left of the last subpicture will be described. First, the sub-pictures adjacent to the last sub-picture are obtained. Among the sub-pictures including the basic block at the right end of the image, the sub-picture in which the basic block at the lower end of the sub-picture has the maximum Y coordinate is the sub-picture adjacent to the last sub-picture. First, the sub-picture for which the following equation holds is a sub-picture including the rightmost basic block.
PicCtbNumH == sps_subpic_ctu_top_left_x [i] + sps_subpic_width_minus1 [i] + 1

画像内で上記条件を満たす全てのサブピクチャの中で、最も下に位置するサブピクチャ、即ちｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［ｉ］が最も大きいサブピクチャが上に隣接するサブピクチャになる。従って、最後のサブピクチャの上に隣接するサブピクチャがＭ番目のサブピクチャだとすると、最後のサブピクチャの左上の基本ブロックのＹ座標は以下の式で求める事ができる。
ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［Ｍ］＋ｓｐｓ＿ｓｕｂｐｉｃ＿ｈｅｉｇｈｔ＿ｍｉｎｕｓ１［Ｍ］＋１ Among all the sub-pictures satisfying the above conditions in the image, the sub-picture located at the bottom, that is, the sub-picture having the largest sps_subpic_ctu_top_left_y [i] is the sub-picture adjacent to the top. Therefore, assuming that the sub-picture adjacent to the last sub-picture is the M-th sub-picture, the Y coordinate of the upper left basic block of the last sub-picture can be obtained by the following equation.
sps_subpic_ctu_top_left_y [M] + sps_subpic_height_minus1 [M] + 1

本実施形態においては、ＰｉｃＣｔｂＮｕｍＨが１８であり、ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［ｉ］＋ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［ｉ］＋１＝＝１８を満たすのは、サブピクチャ１と２である。その中でｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［ｉ］が最も大きいのはサブピクチャ２である。ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［２］＋ｓｐｓ＿ｓｕｂｐｉｃ＿ｈｅｉｇｈｔ＿ｍｉｎｕｓ１［２］＋１＝６であり、最後のサブピクチャの左上の基本ブロックのＹ座標は６となる。 In the present embodiment, PicCtbNumH is 18, and it is subpictures 1 and 2 that satisfy sps_subpic_ctu_top_left_x [i] + sps_subpic_width_minus1 [i] +1 == 18. Among them, subpicture 2 has the largest sps_subpic_ctu_top_left_y [i]. sps_subpic_ctu_top_left_y [2] + sps_subpic_height_minus1 [2] + 1 = 6, and the Y coordinate of the upper left basic block of the last subpicture is 6.

このようにして、従来手法では最後のサブピクチャの左上の基本ブロックの座標情報を符号化していたが、それを用いずに他のサブピクチャの情報から最後のサブピクチャの左上の基本ブロックの座標を求める事ができる。これによってビットストリームの符号量を削減する事が可能になる。 In this way, in the conventional method, the coordinate information of the upper left basic block of the last subpicture was encoded, but without using it, the coordinates of the upper left basic block of the last subpicture from the information of other subpictures. Can be asked. This makes it possible to reduce the amount of code in the bitstream.

図１２の符号化データのフォーマットの説明に戻る。統合符号化部８１１は、次にサブピクチャのＩＤに関する情報を符号化する。更に統合符号化部８１１は、ＳＰＳのサブピクチャＩＤのビット長−１を示すｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｌｅｎ＿ｍｉｎｕｓ１を符号化する。統合符号化部８１１は、次にサブピクチャのＩＤを明示的に示すか否かを示すｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｅｘｐｌｉｃｉｔｌｙ＿ｓｉｇｎａｌｅｄ＿ｆｌａｇを符号化する。統合符号化部８１１は、ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｅｘｐｌｉｃｉｔｌｙ＿ｓｉｇｎａｌｅｄ＿ｆｌａｇが１の時のみ、ＳＰＳにＩＤを示すか否かを示すｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｐｒｅｓｅｎｔ＿ｆｌａｇを符号化する。ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｐｒｅｓｅｎｔ＿ｆｌａｇが１の時のみ、統合符号化部８１１は、ｉ番目のサブピクチャのＩＤを示すｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ［ｉ］を符号化する。本実施形態では、統合符号化部８１１は、これらのサブピクチャのＩＤに関する情報を、ｓｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１が１以上の時のみ（サブピクチャの数が２以上の時のみ）符号化する。従来はｓｐｓ＿ｓｕｂｐｉｃ＿ｉｎｆｏ＿ｐｒｅｓｅｎｔ＿ｆｌａｇが１ならばサブピクチャの数が１の時にも符号化していたが、本実施形態ではサブピクチャの数が１の時には、サブピクチャのＩＤを０とする事で、ＩＤに関する以下のシンタクスは符号化しない。
・ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｌｅｎ＿ｍｉｎｕｓ１
・ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｅｘｐｌｉｃｉｔｌｙ＿ｓｉｇｎａｌｅｄ＿ｆｌａｇ
・ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ
・ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ Returning to the description of the coded data format of FIG. The integrated coding unit 811 then encodes the information regarding the ID of the sub-picture. Further, the integrated coding unit 811 encodes sps_subpic_id_len_minus1 indicating the bit length -1 of the sub-picture ID of the SPS. The integrated coding unit 811 then encodes sps_subpic_id_mapping_explicity_signaled_flag indicating whether or not to explicitly indicate the ID of the sub-picture. The integrated coding unit 811 encodes sps_subpic_id_mapping_present_flag indicating whether or not to indicate an ID to SPS only when sps_subpic_id_mapping_explicity_signed_flag is 1. Only when sps_subpic_id_mapping_present_flag is 1, the integrated coding unit 811 encodes sps_subpic_id [i] indicating the ID of the i-th subpicture. In the present embodiment, the integrated coding unit 811 encodes the information regarding the IDs of these sub-pictures only when sps_num_subpics_minus1 is 1 or more (only when the number of sub-pictures is 2 or more). Conventionally, if sps_subpic_info_present_flag is 1, encoding is performed even when the number of subpictures is 1, but in the present embodiment, when the number of subpictures is 1, the ID of the subpicture is set to 0, and the following regarding the ID The syntax is not encoded.
・ Sps_subpic_id_len_minus1
・ Sps_subpic_id_mapping_explicity_signaled_flag
・ Sps_subpic_id_mapping_present_flag
・ Sps_subpic_id

これによってサブピクチャの数が１の時に符号量を削減する事ができる。また、本実施形態では、サブピクチャの数が１の時にはサブピクチャのＩＤを０としたが、本発明はこれに限定されるものではない。例えば１でもよい。また統合符号化部８１１は、ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｅｘｐｌｉｃｉｔｌｙ＿ｓｉｇｎａｌｅｄ＿ｆｌａｇはサブピクチャの数に関係なく常に符号化しなくてもよい。 As a result, the amount of code can be reduced when the number of sub-pictures is 1. Further, in the present embodiment, when the number of sub-pictures is 1, the ID of the sub-pictures is set to 0, but the present invention is not limited to this. For example, it may be 1. Further, the integrated coding unit 811 does not have to always encode sps_subpic_id_mapping_explicity_signed_flag regardless of the number of subpictures.

本実施形態のサブピクチャは５個なので、ビット長は３ビットで表す事ができる。そのためｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｌｅｎ＿ｍｉｎｕｓ１は３から１を減じて２となる。本実施形態では、ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｅｘｐｌｉｃｉｔｌｙ＿ｓｉｇｎａｌｅｄ＿ｆｌａｇは０とする。 Since there are five sub-pictures in this embodiment, the bit length can be represented by 3 bits. Therefore, sps_subpic_id_len_minus1 is obtained by subtracting 1 from 3 to become 2. In this embodiment, sps_subpic_id_mapping_explicity_signaled_flag is set to 0.

統合符号化部８１１は、ピクチャ・パラメータ・セットにおいて、画像が分割されているか否かを示すｐｐｓ＿ｎｏ＿ｐｉｃ＿ｐａｒｔｉｔｉｏｎ＿ｆｌａｇを符号化する。ｐｐｓ＿ｎｏ＿ｐｉｃ＿ｐａｒｔｉｔｉｏｎ＿ｆｌａｇが１の時は、画像が複数のタイル、スライス、サブピクチャに分割されておらずそれぞれ単一のタイル、スライス、サブピクチャであることを示す。ｐｐｓ＿ｎｏ＿ｐｉｃ＿ｐａｒｔｉｔｉｏｎ＿ｆｌａｇが０の時のみ、タイル分割情報、サブピクチャ定義情報Ｐ（ＰＰＳ）、スライス分割情報Ｐ（ＰＰＳ）が含まれる。統合符号化部８１１は、サブピクチャ定義情報Ｐとして、まずＰＰＳにサブピクチャのＩＤを明示的に示すか否かを示すｐｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｐｒｅｓｅｎｔ＿ｆｌａｇを符号化する。ｐｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｐｒｅｓｅｎｔ＿ｆｌａｇが１の時のみ、統合符号化部８１１は以下の３種類のシンタクスを符号化する。
・サブピクチャの数−１を示すｐｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１
・サブピクチャＩＤのビット長−１を示すｐｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｌｅｎ＿ｍｉｎｕｓ１
・ｉ番目のサブピクチャのＩＤを示すｐｐｓ＿ｓｕｂｐｉｃ＿ｉｄ［ｉ］ The integrated coding unit 811 encodes pps_no_pic_partition_flag indicating whether or not the image is divided in the picture parameter set. When pps_no_pic_partition_flag is 1, it means that the image is not divided into a plurality of tiles, slices, and subpictures, and is a single tile, slice, and subpicture, respectively. Only when pps_no_pic_partition_flag is 0, tile division information, sub-picture definition information P (PPS), and slice division information P (PPS) are included. As the sub-picture definition information P, the integrated coding unit 811 first encodes pps_subpic_id_mapping_present_flag indicating whether or not the ID of the sub-picture is explicitly indicated on the PPS. Only when pps_subpic_id_mapping_present_flag is 1, the integrated coding unit 811 encodes the following three types of syntax.
-Pps_num_subpics_minus1 indicating the number of sub-pictures-1
Pps_subpic_id_len_minus1 indicating the bit length -1 of the sub-picture ID.
-Pps_subpic_id [i] indicating the ID of the i-th subpicture.

従来手法ではｐｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｐｒｅｓｅｎｔ＿ｆｌａｇは常に符号化されるが、本実施形態ではｐｐｓ＿ｎｏ＿ｐｉｃ＿ｐａｒｔｉｔｉｏｎ＿ｆｌａｇが０の時のみ符号化している。これにより画像が分割されていない際の符号量を削減する事ができる。 In the conventional method, pps_subpic_id_mapping_present_flag is always encoded, but in this embodiment, it is encoded only when pps_no_pic_partition_flag is 0. This makes it possible to reduce the amount of code when the image is not divided.

本実施形態では画像が分割されているためｐｐｓ＿ｎｏ＿ｐｉｃ＿ｐａｒｔｉｔｉｏｎ＿ｆｌａｇは０である。またｐｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｐｒｅｓｅｎｔ＿ｆｌａｇは０とする。 In this embodiment, since the image is divided, pps_no_pic_partition_flag is 0. Further, pps_subpic_id_mapping_present_flag is set to 0.

図１２のフォーマットの説明に戻る。統合符号化部８１１は、スライス分割情報Ｐとして、ｐｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１が存在しない、あるいは１以上の時のみ、ｐｐｓ＿ｓｉｎｇｌｅ＿ｓｌｉｃｅ＿ｐｅｒ＿ｓｕｂｐｉｃ＿ｆｌａｇを符号化する。ｐｐｓ＿ｓｉｎｇｌｅ＿ｓｌｉｃｅ＿ｐｅｒ＿ｓｕｂｐｉｃ＿ｆｌａｇは、画像内の全てのサブピクチャが単一のスライスで符号化されているか否かを示す。本実施形態では複数のスライスを包含するサブピクチャが存在するため、ｐｐｓ＿ｓｉｎｇｌｅ＿ｓｌｉｃｅ＿ｐｅｒ＿ｓｕｂｐｉｃ＿ｆｌａｇは０である。しかしｐｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１が存在し、かつ１以上の時のみ統合符号化部８１１がｐｐｓ＿ｓｉｎｇｌｅ＿ｓｌｉｃｅ＿ｐｅｒ＿ｓｕｂｐｉｃ＿ｆｌａｇを符号化する構成となっている。そのため、ｐｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１が存在し、かつ１以上の時には符号量の削減が可能になる。 Returning to the description of the format of FIG. The integrated coding unit 811 encodes pps_single_slice_per_subpic_flag only when pps_num_subpics_minus1 does not exist or is 1 or more as the slice division information P. pps_single_slice_per_subpic_flag indicates whether all subpictures in the image are encoded in a single slice. In this embodiment, since there is a subpicture containing a plurality of slices, pps_single_slice_per_subpic_flag is 0. However, pps_num_subpics_minus1 exists, and the integrated coding unit 811 encodes pps_single_slice_per_subpic_flag only when it is 1 or more. Therefore, when pps_num_subpics_minus1 exists and is 1 or more, the code amount can be reduced.

スライスヘッダにはスライス分割情報ＳＨなどが含まれ、その後に各スライスの符号化データが続く。統合符号化部８１１は、スライス分割情報ＳＨとして、サブピクチャの数が２以上の時のみ、当該スライスのサブピクチャＩＤを示すｓｈ＿ｓｕｂｐｉｃ＿ｉｄを符号化する。従来はｓｐｓ＿ｓｕｂｐｉｃ＿ｉｎｆｏ＿ｐｒｅｓｅｎｔ＿ｆｌａｇが１ならばサブピクチャの数が１の時にも符号化していたが、本実施形態ではサブピクチャの数が１の時には、サブピクチャのＩＤを０としているため符号化する必要がない。これによって符号量を削減する事ができる。 The slice header includes slice division information SH and the like, followed by the coded data of each slice. The integrated coding unit 811 encodes sh_subpic_id indicating the sub-picture ID of the slice only when the number of sub-pictures is 2 or more as the slice division information SH. Conventionally, if sps_subpic_info_present_flag is 1, encoding is performed even when the number of subpictures is 1, but in this embodiment, when the number of subpictures is 1, the ID of the subpicture is 0, so there is no need to encode. .. This makes it possible to reduce the amount of code.

本実施形態の場合、ＳＩＤ＝０〜２のスライスのｓｈ＿ｓｕｂｐｉｃ＿ｉｄは０である。ＳＩＤ＝３のスライスのｓｈ＿ｓｕｂｐｉｃ＿ｉｄは１である。ＳＩＤ＝４のスライスのｓｈ＿ｓｕｂｐｉｃ＿ｉｄは２である。ＳＩＤ＝５〜８及びＳＩＤ＝１０〜１２のスライスのｓｈ＿ｓｕｂｐｉｃ＿ｉｄは３である。ＳＩＤ＝９及び１３のスライスのｓｈ＿ｓｕｂｐｉｃ＿ｉｄは４である。 In the case of this embodiment, the sh_subpic_id of the slice with SID = 0 to 2 is 0. The sh_subpic_id of the slice with SID = 3 is 1. The sh_subpic_id of the slice with SID = 4 is 2. The sh_subpic_id of slices with SID = 5-8 and SID = 10-12 is 3. The sh_subpic_id of slices with SID = 9 and 13 is 4.

図１０は、実施形態に係る画像符号化装置における１フレームの画像に対する符号化処理を示すフローチャートである。 FIG. 10 is a flowchart showing a coding process for a one-frame image in the image coding apparatus according to the embodiment.

まず、Ｓ１００１にて、画像分割部８０２は、前述のように画像をタイル、スライスに分割し、サブピクチャを定義し、分割情報を統合符号化部８１１に送る。Ｓ１００９にて、統合符号化部８１１は、分割情報をヘッダ情報に変換した上で、ビットストリームに符号化する。また画像分割部８０２は、画像を基本ブロック行画像に分割してブロック分割部８０３に送る。 First, in S1001, the image division unit 802 divides the image into tiles and slices as described above, defines subpictures, and sends the division information to the integrated coding unit 811. In S1009, the integrated coding unit 811 converts the division information into header information and then encodes it into a bit stream. Further, the image dividing unit 802 divides the image into basic block line images and sends the image to the block dividing unit 803.

Ｓ１００２にて、ブロック分割部８０３は基本ブロックで構成される行画像を基本ブロック単位に分割する。 In S1002, the block division unit 803 divides a line image composed of basic blocks into basic block units.

Ｓ１００３にて、予測部８０４は、Ｓ１００２にて生成された基本ブロック単位の画像データに対して、予測処理を実行する。また、予測部８０４は、サブブロック分割情報やイントラ予測モードなどの予測情報および予測画像データを生成する。さらに予測部８０４は、入力された画像データと予測画像データから予測誤差を算出する。 In S1003, the prediction unit 804 executes a prediction process on the image data in basic block units generated in S1002. Further, the prediction unit 804 generates prediction information such as sub-block division information and intra prediction mode, and prediction image data. Further, the prediction unit 804 calculates a prediction error from the input image data and the prediction image data.

Ｓ１００４にて、変換・量子化部８０５は、Ｓ１００３で算出された予測誤差を直交変換して変換係数を生成する。そして、変換・量子化部８０５は、生成した変換係数に対して量子化を行い、量子化係数を生成する。 In S1004, the conversion / quantization unit 805 orthogonally converts the prediction error calculated in S1003 to generate a conversion coefficient. Then, the conversion / quantization unit 805 quantizes the generated conversion coefficient to generate the quantization coefficient.

Ｓ１００５にて、逆量子化・逆変換部８０６は、Ｓ１００４で生成された量子化係数を、逆量子化を行い、変換係数を再生する。逆量子化・逆変換部８０６は、更に再生した変換係数に対して逆直交変換し、予測誤差を再生する。 In S1005, the inverse quantization / inverse conversion unit 806 performs inverse quantization on the quantization coefficient generated in S1004 and reproduces the conversion coefficient. The inverse quantization / inverse transformation unit 806 further performs inverse orthogonal transformation with respect to the reproduced conversion coefficient, and reproduces the prediction error.

Ｓ１００６にて、画像再生部８０７は、Ｓ１００３で生成された予測情報に基づいて予測画像を再生する。画像再生部８０７は、さらに、再生された予測画像とＳ１００５で生成された予測誤差から画像データを再生する。 In S1006, the image reproduction unit 807 reproduces the predicted image based on the predicted information generated in S1003. The image reproduction unit 807 further reproduces the image data from the reproduced predicted image and the prediction error generated in S1005.

Ｓ１００７にて、符号化部８１０は、Ｓ１００３で生成された予測情報およびＳ１００４で生成された量子化係数を符号化し、符号データを生成する。 In S1007, the coding unit 810 encodes the prediction information generated in S1003 and the quantization coefficient generated in S1004 to generate code data.

Ｓ１００８にて、画像符号化装置は、スライス内の全ての基本ブロックの符号化が終了したか否かの判定を行い、終了していれば処理をＳ１００９に進め、そうでなければ次の基本ブロックを対象とするため、処理をＳ１００２に戻す。 In S1008, the image coding apparatus determines whether or not the coding of all the basic blocks in the slice is completed, and if it is completed, the process proceeds to S1009, and if not, the next basic block. The process is returned to S1002 in order to target.

Ｓ１００９にて、統合符号化部８１１は、画像分割部８０２から送られた分割情報に基づいて、ヘッダ情報を生成し、符号化する。 In S1009, the integrated coding unit 811 generates and encodes the header information based on the division information sent from the image division unit 802.

以下これらサブピクチャの情報を元にサブピクチャを構成するスライスの符号データを接合し、サブピクチャの符号データを作成する。 Hereinafter, based on the information of these sub-pictures, the code data of the slices constituting the sub-picture are joined to create the code data of the sub-picture.

Ｓ１０１０にて、画像符号化装置は、フレーム内の全ての基本ブロックの符号化が終了したか否かの判定を行い、終了していればＳ１０１１に処理を進め、そうでなければ次の基本ブロックを符号化するため、処理をＳ１００２に戻す。 In S1010, the image coding apparatus determines whether or not the coding of all the basic blocks in the frame is completed, and if it is completed, the process proceeds to S1011, otherwise the next basic block. Is returned to S1002 in order to encode.

Ｓ１０１１にて、インループフィルタ部８０９は、Ｓ１００６で再生された画像データに対し、インループフィルタ処理を行い、フィルタ処理された画像を生成し、処理を終了する。 In S1011, the in-loop filter unit 809 performs in-loop filter processing on the image data reproduced in S1006, generates a filtered image, and ends the processing.

以上の構成と動作により、特にＳ１００９において、複数のサブピクチャが定義された場合に、最後のサブピクチャの左上の座標を符号化しないことで、サブピクチャに関するシンタクスを効率よく符号化する事ができる。 With the above configuration and operation, especially in S1009, when a plurality of subpictures are defined, the syntax related to the subpictures can be efficiently encoded by not encoding the upper left coordinates of the last subpicture. ..

なお本実施形態では、画像を９個のタイル、５個のサブピクチャ、１４個のスライスで構成されるものとしたが、これに限定されない。画像は如何様に分割してもよいし、分割しなくてもよい。例えば複数のサブピクチャを定義せず、更に複数のタイル、スライスに分割せず、画像が単一のサブピクチャ、タイル、スライスで構成されてもよい。その場合は、ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｎｆｏ＿ｐｒｅｓｅｎｔ＿ｆｌａｇが１であっても、ｓｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１が０となり、以下のシーケンス・パラメータ・セット及びスライスヘッダのシンタクスを符号化する必要がない。
・ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｌｅｎ＿ｍｉｎｕｓ１
・ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｅｘｐｌｉｃｉｔｌｙ＿ｓｉｇｎａｌｅｄ＿ｆｌａｇ
・ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ
・ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ
・ｓｈ＿ｓｕｂｐｉｃ＿ｉｄ In the present embodiment, the image is composed of 9 tiles, 5 subpictures, and 14 slices, but the image is not limited to this. The image may or may not be divided in any way. For example, an image may be composed of a single subpicture, tile, or slice without defining a plurality of subpictures and further dividing the image into a plurality of tiles or slices. In that case, even if sps_subpic_info_present_flag is 1, sps_num_subpics_minus1 becomes 0, and it is not necessary to encode the following sequence parameter set and slice header syntax.
・ Sps_subpic_id_len_minus1
・ Sps_subpic_id_mapping_explicity_signaled_flag
・ Sps_subpic_id_mapping_present_flag
・ Sps_subpic_id
・ Sh_subpic_id

同様に画像が分割されていないため、ｐｐｓ＿ｎｏ＿ｐｉｃ＿ｐａｒｔｉｔｉｏｎ＿ｆｌａｇが１となり、以下のピクチャ・パラメータ・セットのシンタクスを符号化する必要がない。
・ｐｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ
・ｐｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１
・ｐｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｌｅｎ＿ｍｉｎｕｓ１
・ｐｐｓ＿ｓｕｂｐｉｃ＿ｉｄ
以上により不要なシンタクスを符号化しないため従来よりも符号量の削減が可能となる。 Similarly, since the image is not divided, pps_no_pic_partition_flag is 1, and it is not necessary to encode the syntax of the following picture parameter set.
・ Pps_subpic_id_mapping_present_flag
・ Pps_num_subpics_minus1
・ Pps_subpic_id_len_minus1
・ Pps_subpic_id
As a result, unnecessary syntax is not encoded, so that the amount of code can be reduced as compared with the conventional case.

次に、これまで説明してきた画像符号化装置（撮像装置１００）で生成されたビットストリームを取得して、それを復号する画像復号装置に２００ついて説明する。画像復号装置２００は、基本的に、これまで説明してきた画像符号化装置（撮像装置１００）の逆の動作を行うこととなる。 Next, 200 will be described for an image decoding device that acquires a bit stream generated by the image coding device (imaging device 100) described so far and decodes the bit stream. The image decoding device 200 basically performs the reverse operation of the image coding device (imaging device 100) described so far.

図８は、本実施形態に係る画像復号装置２００の構成を示すブロック図である。本実施形態における画像復号装置２００は、画像符号化装置（撮像装置１００）で符号化された符号化データ（ビットストリーム）を取得する取得部２１８と、取得されたビットストリームを復号する復号処理部２１４と、ＣＰＵ２０１と、メモリ２０２と、不揮発性メモリ２０３と、内部バス２３０とを有する。 FIG. 8 is a block diagram showing the configuration of the image decoding device 200 according to the present embodiment. The image decoding device 200 in the present embodiment has an acquisition unit 218 for acquiring coded data (bitstream) encoded by the image coding device (imaging device 100) and a decoding processing unit for decoding the acquired bitstream. It has 214, a CPU 201, a memory 202, a non-volatile memory 203, and an internal bus 230.

取得部２１８は、前述のように、画像符号化装置（撮像装置１００）で符号化された符号化データ（ビットストリーム）を取得する。取得部２１８は、例えば、ネットワークを介して符号化データを取得することもできる。 As described above, the acquisition unit 218 acquires the coded data (bit stream) encoded by the image coding device (imaging device 100). The acquisition unit 218 can also acquire the coded data via the network, for example.

ＣＰＵ２０１は、不揮発性メモリ２０３に記憶されているコンピュータプログラムを実行することによって、画像復号装置２００の各部の動作を制御する。メモリ２０２は、書き換え可能な揮発性メモリ（ＲＡＭ）であり、一時的に画像復号装置２００の各部の動作を制御するコンピュータプログラムや、各部の動作に関するパラメータ等の情報、取得部２１８によって取得したデータ等を記憶する。メモリ２０２は、各部で処理した画像や情報を一時的に記憶するワークメモリとしても機能する。 The CPU 201 controls the operation of each part of the image decoding device 200 by executing a computer program stored in the non-volatile memory 203. The memory 202 is a rewritable volatile memory (RAM), and is a computer program that temporarily controls the operation of each part of the image decoding device 200, information such as parameters related to the operation of each part, and data acquired by the acquisition unit 218. And so on. The memory 202 also functions as a work memory for temporarily storing images and information processed by each unit.

不揮発性メモリ２０３は、電気的に消去・記録可能なメモリであり、例えばＥＥＰＲＯＭやＳＤメモリカード等の記憶媒体が用いられる。不揮発性メモリ２０３は、画像復号装置２００の各部の動作を制御するコンピュータプログラム及び各部の動作に関するパラメータ等の情報を記憶する。ここでいう、コンピュータプログラムとは、本実施形態にて後述する各種処理を実行するためのプログラムが含まれる。 The non-volatile memory 203 is a memory that can be electrically erased and recorded, and a storage medium such as an EEPROM or an SD memory card is used. The non-volatile memory 203 stores information such as a computer program that controls the operation of each part of the image decoding device 200 and parameters related to the operation of each part. The computer program referred to here includes a program for executing various processes described later in the present embodiment.

内部バス２３０は、ＣＰＵ２０１とメモリ２０２に各処理部がアクセスするための内部バスである。 The internal bus 230 is an internal bus for each processing unit to access the CPU 201 and the memory 202.

前述のように、取得部２１８によって取得されるビットストリームは、ピクチャを複数の矩形領域に分割することによって符号化されている。ビットストリームには、各矩形領域を再現するための情報（後述するＳＰＳ等）が含まれている。復号処理部２１４は、符号化データから、画像自体の符号化データと、ＳＰＳ等の情報とを分離して、これらを復号する。そして復号したそれらのデータに基づいてピクチャを再生する。 As described above, the bitstream acquired by the acquisition unit 218 is encoded by dividing the picture into a plurality of rectangular areas. The bit stream contains information for reproducing each rectangular area (SPS and the like described later). The decoding processing unit 214 separates the coded data of the image itself and the information such as SPS from the coded data, and decodes them. Then, the picture is reproduced based on the decoded data.

復号処理部２１４は、例えばＶＶＣ方式に従って復号処理を行うものとする。ただし、後述する点については、現段階において提案されているＶＶＣ方式とは異なる。 The decoding processing unit 214 shall perform decoding processing according to, for example, the VVC method. However, the points to be described later are different from the VVC method proposed at this stage.

次に、実施形態における復号処理部２１４が復号するピクチャにおける矩形領域の分割について説明する。復号処理部２１４は、ＣＰＵ１０１の制御の下、復号対象の１ピクチャ（フレーム画像）を１以上のタイル行、１以上のタイル列ごとに復号処理を行う。 Next, the division of the rectangular region in the picture decoded by the decoding processing unit 214 in the embodiment will be described. Under the control of the CPU 101, the decoding processing unit 214 decodes one picture (frame image) to be decoded for each one or more tile rows and one or more tile columns.

また、本実施形態では、フレーム画像内の矩形領域を集合的にカバーする１以上のスライスから構成され、独立して復号可能な領域をサブピクチャ(subpicture）とする。図３は、ＣＴＵ，タイル、サブピクチャの関係の例を示している。同図では、１ピクチャが３行６列の１８個のタイルに分割されており、それぞれのタイルを集合的にカバーする計２４個のスライス及びサブピクチャが設定された例を示している。 Further, in the present embodiment, a region that is composed of one or more slices that collectively cover a rectangular region in a frame image and can be independently decoded is referred to as a subpicture. FIG. 3 shows an example of the relationship between the CTU, tiles, and subpictures. The figure shows an example in which one picture is divided into 18 tiles in 3 rows and 6 columns, and a total of 24 slices and sub-pictures that collectively cover each tile are set.

なお、ピクチャをいくつの、そして、どのようなサイズにサブピクチャに分割するか等の設定は、符号化装置１００において行われているものとする。この設定は、例えば、動画像を記録するに先立って撮像して映像から導出した色分布が、予め登録した色パターンのいずれに最も近いかに基づいて決定するものとする。 It is assumed that the setting of how many pictures are divided into sub-pictures and what size the pictures are divided into is performed in the coding apparatus 100. This setting is determined based on, for example, which of the color patterns registered in advance is the closest to the color distribution derived from the image taken prior to recording the moving image.

ＶＶＣでは、図１乃至図３に示すような、タイル、スライス、サブピクチャを用いたピクチャ分割を実現するためのパラメータ等の情報（復号側にとっては再現するための情報）を、画像の符号化で得たビットストリームのシーケンスのヘッダ部（ＳＰＳ：Sequence Parameter Set）や、ピクチャに対するヘッダ部（ＰＰＳ：Picture Parameter Set）に格納することになっている。これらのヘッダ部に格納されているパラメータ等の情報は復号処理部２１４によって復号される。 In VVC, information such as parameters for realizing picture division using tiles, slices, and sub-pictures (information for reproduction for the decoding side) as shown in FIGS. 1 to 3 is encoded in the image. It is supposed to be stored in the header part (SPS: Sequence Parameter Set) of the sequence of the bitstream obtained in 1 and the header part (PPS: Picture Parameter Set) for the picture. Information such as parameters stored in these header units is decoded by the decoding processing unit 214.

そこで、本実施形態の復号処理部２１４は、最後のサブピクチャに係るパラメータ情報を省略されたＳＰＳを復号する。よって、そのＳＰＳの情報量はこれまでよりも削減されている。 Therefore, the decoding processing unit 214 of the present embodiment decodes the SPS in which the parameter information related to the last sub-picture is omitted. Therefore, the amount of information of the SPS is reduced more than before.

図４Ａ〜図４Ｆは、画像符号化装置（撮像装置１００）で生成されたＳＰＳの情報のシンタックスを示している。なお、図４Ａ〜図４Ｆはこの順番に連続させることで、ＳＰＳのシンタックスを表している点に注意されたい。図４Ａ〜図４Ｆで記されたシンタックスは図４Ａ〜図４Ｆで記される順番で復号処理部２１４によって復号される。 4A to 4F show the syntax of the SPS information generated by the image coding device (imaging device 100). It should be noted that FIGS. 4A to 4F represent the syntax of SPS by making them continuous in this order. The syntaxes shown in FIGS. 4A to 4F are decoded by the decoding processing unit 214 in the order shown in FIGS. 4A to 4F.

図示されている参照符号４０１、４０２の行の記述における“ｓｐｓ＿ｎｕｍ＿ｓｉｂｐｉｃｓ＿ｍｉｎｕｓ１”は、「１ピクチャに含まれるサブピクチャの個数−１」を表している。そして、条件「ｉ＜ｓｐｓ＿ｎｕｍ＿ｓｉｂｐｉｃｓ＿ｍｉｎｕｓ１」を新たに追加することで、１ピクチャに含まれるサブピクチャの個数をＮとしたとき、それよりも１つ少ないＮ−１個のサブピクチャの位置｛ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［ｉ］，ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［ｉ］｝と、サイズ｛ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［ｉ］，ｓｐｓ＿ｓｕｂｐｉｃ＿ｈｉｇｈｔ＿ｍｉｎｕｓ１［ｉ］｝がＳＰＳに含まれるようにしてある。本実施形態では参照符号４０１及び４０２の行に条件式を追加される構成としたが、本発明はこれに限定されるものではない。例えば４０１の上の行のｆｏｒ文の条件式をｉ＜＝ｓｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１から、ｉ＜ｓｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１に変更してもよい。 "Sps_num_sibics_minus1" in the description of the line of reference numerals 401 and 402 shown represents "the number of sub-pictures included in one picture-1". Then, by newly adding the condition "i <sps_num_sibpics_minus1", when the number of sub-pictures included in one picture is N, the position of N-1 sub-pictures, which is one less than that, {sps_subpic_ctu_top_left_x [i]. ], Sps_subpic_ctu_top_left_y [i]} and the size {sps_subpic_width_minus1 [i], sps_subpic_hight_minus1 [i]} are included in the SPS. In the present embodiment, the conditional expression is added to the lines of reference numerals 401 and 402, but the present invention is not limited thereto. For example, the conditional expression of the for statement in the line above 401 may be changed from i <= sps_num_subpics_minus1 to i <sps_num_subpics_minus1.

なお、復号処理部２１４は、図４Ａ〜図４Ｆに示すＳＰＳのシンタックスに従って復号処理を行う過程で、最後のサブピクチャの位置やサイズは、それ以前のサブピクチャの復号結果から求める。よって、求めた位置に従ってサブピクチャの画像を再現できるので、問題は発生しない。 The decoding processing unit 214 obtains the position and size of the last sub-picture from the decoding results of the previous sub-pictures in the process of performing the decoding process according to the syntax of SPS shown in FIGS. 4A to 4F. Therefore, since the sub-picture image can be reproduced according to the obtained position, no problem occurs.

ＣＴＵの縦及び横の画素数であるＣｔｂＳｉｚｅＹは、１＜＜ＣｔｂＳｉｚｅ（ここで、ＣｔｂＳｉｚｅ＝ｓｐｓ＿ｌｏｇ２＿ｃｔｕ＿ｓｉｚｅ＿ｍｉｎｕｓ５＋５）で示される。復号処理部２１４は、画像の水平方向のＣＴＵ数であるＰｉｃＣｔｂＮｕｍＨについては、（ｓｐｓ＿ｐｉｃ＿ｗｉｄｔｈ＿ｍａｘ＿ｉｎ＿ｌｕｍａ＿ｓａｍｐｌｅｓ＋ＣｔｂＳｉｚｅＹ−１）＞＞ＣｔｂＬｏｇ２ＳｉｚｅＹで求める。同様に、復号処理部２１４は、垂直方向のＣＴＵ数であるＰｉｃＣｔｂＮｕｍＶは、（ｓｐｓ＿ｐｉｃ＿ｈｅｉｇｈｔ＿ｍａｘ＿ｉｎ＿ｌｕｍａ＿ｓａｍｐｌｅｓ＋ＣｔｂＳｉｚｅＹ−１）＞＞ＣｔｂＬｏｇ２ＳｉｚｅＹで求める。 CtbSizeY, which is the number of vertical and horizontal pixels of the CTU, is represented by 1 << CtbSize (here, CtbSize = sps_log2_ctu_size_minus5 + 5). The decoding processing unit 214 obtains PicCtbNumH, which is the number of CTUs in the horizontal direction of the image, by (sps_pic_width_max_in_luma_samples + CtbSizeY-1) >> CtbLog2SizeY. Similarly, the decoding processing unit 214 obtains PicCtbNumV, which is the number of CTUs in the vertical direction, by (sps_pic_height_max_in_luma_samples + CtbSizeY-1) >> CtbLog2SizeY.

次に、ｓｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１番目（最後）のサブピクチャの左上のＣＴＵの座標の導出に関して説明する。初めにＸ座標の求め方について説明する。まず、復号処理部２１４は、最後のサブピクチャの左に隣接するサブピクチャを求める。画像の下端のＣＴＵを含むサブピクチャの中で、サブピクチャの右端ＣＴＵが最大のＸ座標を持つサブピクチャが、最後のサブピクチャの左に隣接するサブピクチャであると判断する。下記の式が成り立つサブピクチャが下端のＣＴＵを含むサブピクチャである。
ＰｉｃＣｔｂＮｕｍＶ＝＝ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［ｉ］＋ｓｐｓ＿ｓｕｂｐｉｃ＿ｈｅｉｇｈｔ＿ｍｉｎｕｓ１［ｉ］＋１ Next, the derivation of the coordinates of the CTU on the upper left of the first (last) subpicture of sps_num_subpics_minus will be described. First, how to obtain the X coordinate will be described. First, the decoding processing unit 214 obtains a subpicture adjacent to the left of the last subpicture. Among the sub-pictures including the CTU at the lower end of the image, it is determined that the sub-picture having the maximum X coordinate at the right end CTU of the sub-picture is the sub-picture adjacent to the left of the last sub-picture. The sub-picture for which the following equation holds is a sub-picture including the CTU at the lower end.
PicCtbNumV == sps_subpic_ctu_top_left_y [i] + sps_subpic_height_minus1 [i] + 1

復号処理部２１４は、画像内で上記条件を満たす全てのサブピクチャの中で、最も右に位置するサブピクチャ、即ちｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［ｉ］が最も大きいサブピクチャが左に隣接するサブピクチャであると判断する。従って、復号処理部２１４は、最後のサブピクチャの左に隣接するサブピクチャがＮ番目のサブピクチャだとすると、最後のサブピクチャの左上のＣＴＵのＸ座標は以下の式で求める事ができる。
ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［Ｎ］＋ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［Ｎ］＋１ The decoding processing unit 214 determines that, among all the sub-pictures satisfying the above conditions in the image, the sub-picture located on the rightmost side, that is, the sub-picture having the largest sps_subpic_ctu_top_left_x [i] is the sub-picture adjacent to the left side. do. Therefore, assuming that the subpicture adjacent to the left of the last subpicture is the Nth subpicture, the decoding processing unit 214 can obtain the X coordinate of the CTU on the upper left of the last subpicture by the following equation.
sps_subpic_ctu_top_left_x [N] + sps_subpic_width_minus1 [N] + 1

次に、最後のサブピクチャの左上のＣＴＵのＹ座標の求め方について説明する。まず、復号処理部２１４は、最後のサブピクチャの上に隣接するサブピクチャを求める。復号処理部２１４は、画像の右端のＣＴＵを含むサブピクチャの中で、サブピクチャの下端のＣＴＵが最大のＹ座標を持つサブピクチャが、最後のサブピクチャの上に隣接するサブピクチャであると判断する。下記の式が成り立つサブピクチャが右端のＣＴＵを含むサブピクチャである。
ＰｉｃＣｔｂＮｕｍＨ＝＝ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［ｉ］＋ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［ｉ］＋１ Next, how to obtain the Y coordinate of the CTU on the upper left of the last subpicture will be described. First, the decoding processing unit 214 obtains a subpicture adjacent to the last subpicture. The decoding processing unit 214 determines that among the sub-pictures including the CTU at the right end of the image, the sub-picture having the maximum Y coordinate of the CTU at the lower end of the sub-picture is a sub-picture adjacent to the last sub-picture. to decide. The sub-picture for which the following equation holds is a sub-picture including the CTU at the right end.
PicCtbNumH == sps_subpic_ctu_top_left_x [i] + sps_subpic_width_minus1 [i] + 1

復号処理部２１４は、画像内で上記条件を満たす全てのサブピクチャの中で、最も下に位置するサブピクチャ、即ち、復号処理部２１４は、ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［ｉ］が最も大きいサブピクチャが上に隣接するサブピクチャであると判断する。従って、最後のサブピクチャの上に隣接するサブピクチャがＭ番目のサブピクチャだとすると、復号処理部２１４は、最後のサブピクチャの左上のＣＴＵのＹ座標は以下の式で求める事ができる。
ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［Ｍ］＋ｓｐｓ＿ｓｕｂｐｉｃ＿ｈｅｉｇｈｔ＿ｍｉｎｕｓ１［Ｍ］＋１ The decoding processing unit 214 has the lowest sub-picture among all the sub-pictures satisfying the above conditions in the image, that is, the decoding processing unit 214 has the sub-picture having the largest sps_subpic_ctu_top_left_y [i] adjacent to the top. Judge that it is a sub-picture. Therefore, assuming that the sub-picture adjacent to the last sub-picture is the M-th sub-picture, the decoding processing unit 214 can obtain the Y coordinate of the CTU on the upper left of the last sub-picture by the following equation.
sps_subpic_ctu_top_left_y [M] + sps_subpic_height_minus1 [M] + 1

そこで、本実施形態では、ＳＰＳのシンタックスを示す図４Ａ〜図４Ｆにおける、参照符号４０３、４０４の行は、１ピクチャ内に２以上のサブピクチャが存在する場合にのみ有効であるとして｛｝内のシンタクスが符号化され、１ピクチャ内に１つのサブピクチャのみが存在する場合には｛｝内のシンタクスは符号化されていない。 Therefore, in the present embodiment, the lines of reference numerals 403 and 404 in FIGS. 4A to 4F showing the syntax of SPS are considered to be valid only when two or more subpictures are present in one picture {}. If the syntax in is encoded and there is only one subpicture in one picture, the syntax in {} is not encoded.

また、図５Ａの参照符号５０１の行の上の行に存在するｐｐｓ＿ｎｏ＿ｐｉｃ＿ｐａｒｔｉｔｉｏｎ＿ｆｌａｇは、画像がサブピクチャ、タイル、スライスに分割されていないか否かを示すフラグである。当該シンタクスが１の時は、画像が単一のサブピクチャ、タイル、スライスで構成されている事を示す。図５Ａ〜図５Ｃ（この順番にＰＰＳのシンタックスを表す）が示すＰＰＳ（Picture Parameter Set）のシンタックスにおける参照符号５０１が示す行、並びに、参照符号５０３が示す行中の条件「＆＆（ｐｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１＞０」は、１ピクチャ内に２以上のサブピクチャが存在する場合にのみ有効であるとして｛｝内のシンタクスが符号化されている。そして、１ピクチャ内に１つのサブピクチャのみが存在する場合には｛｝内のシンタクスが符号化されていない。図５Ａ〜５Ｃで記されたシンタックスは、図５Ａ〜５Ｃで記された順番で復号処理部２１４によって復号される。 Further, the pps_no_pic_partition_flag existing in the line above the line of reference numeral 501 in FIG. 5A is a flag indicating whether or not the image is divided into subpictures, tiles, and slices. When the syntax is 1, it indicates that the image is composed of a single subpicture, tile, or slice. The line indicated by reference numeral 501 in the syntax of PPS (Picture Parameter Set) shown in FIGS. 5A to 5C (in this order representing the syntax of PPS), and the condition "&& (pps_num_subpics_minus1" in the line indicated by reference numeral 503). The syntax in {} is encoded as "> 0" is valid only when there are two or more subpictures in one picture, and there is only one subpicture in one picture. In some cases, the syntax in {} is not encoded. The syntaxes shown in FIGS. 5A-5C are decoded by the decoding processing unit 214 in the order shown in FIGS. 5A-5C.

そして、図６Ａ〜図６Ｃ（この順番にスライスヘッダのシンタックスを表す）が示すスライスヘッダのシンタックスおける参照符号６０１が示す行中の条件「ｓｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１＞０」は、１ピクチャ内に２以上のサブピクチャが存在する場合にのみ有効であるとして｛｝内のシンタクスが符号化されている。そして、１ピクチャ内に１つのサブピクチャのみが存在する場合には｛｝内のシンタクスが符号化されていない。図６Ａ〜６Ｃで記されたシンタックスは、図６Ａ〜６Ｃで記された順番で復号処理部２１４によって復号される。 Then, the condition "sps_num_subpix_minus1> 0" in the line indicated by the reference code 601 in the syntax of the slice header shown in FIGS. 6A to 6C (in this order, the syntax of the slice header is represented) is two or more in one picture. The syntax in {} is encoded as valid only if a subpicture is present. When only one sub-picture exists in one picture, the syntax in {} is not encoded. The syntaxes shown in FIGS. 6A to 6C are decoded by the decoding processing unit 214 in the order shown in FIGS. 6A to 6C.

上記の結果、ＣＰＵ１０１は、１ピクチャ内に２以上のサブピクチャが存在する場合のみ、サブピクチャを特定するインデックス番号が復号される。そして、ＣＰＵ１０１は、１ピクチャ内に１つのサブピクチャしか存在しない場合には、サブピクチャのインデックス番号の符号を復号しなくなり、その分だけ符号量を削減できるようになる。 As a result of the above, the CPU 101 decodes the index number that identifies the sub-picture only when there are two or more sub-pictures in one picture. Then, when only one sub-picture exists in one picture, the CPU 101 does not decode the code of the index number of the sub-picture, and the code amount can be reduced by that amount.

上記の結果、特に、１ピクチャ内に含まれるサブピクチャの数が１つの場合に、ＳＰＳ，ＰＰＳ，スライスヘッダの情報量を削減できることになる。また、冗長な情報がない分、画像復号装置２００における処理速度も向上することができる。 As a result of the above, the amount of information in the SPS, PPS, and slice header can be reduced, especially when the number of sub-pictures contained in one picture is one. Further, since there is no redundant information, the processing speed of the image decoding apparatus 200 can be improved.

次に、図９、図１１〜図１４を用いて、これまで説明してきた画像復号装置について、更に詳細に説明する。なお、以下の説明では、主に復号処理部２１４における処理として説明してきた処理の更なる詳細について説明する。 Next, the image decoding apparatus described so far will be described in more detail with reference to FIGS. 9 and 11 to 14. In the following description, further details of the processing described mainly as the processing in the decoding processing unit 214 will be described.

最初に、図９を用いて画像復号装置について説明する。図９は、復号処理部２１４等を更に詳細に記した機能ブロック図である。 First, the image decoding apparatus will be described with reference to FIG. FIG. 9 is a functional block diagram showing the decoding processing unit 214 and the like in more detail.

画像復号装置（復号処理部２１４）は、分離復号部９０２、復号部９０３、逆量子化・逆変換部９０４、画像再生部９０５、フレームメモリ９０６、及び、インループフィルタ部９０７を有する。 The image decoding device (decoding processing unit 214) includes a separation decoding unit 902, a decoding unit 903, an inverse quantization / inverse conversion unit 904, an image reproduction unit 905, a frame memory 906, and an in-loop filter unit 907.

分離復号部９０２は、入力端子９０１を介して入力したビットストリームから、復号処理に関する情報や係数に関する符号データに分離し、復号部９０３へ送る。また分離復号部９０２は、ビットストリームのヘッダ部に存在する符号データを復号する。本実施形態お分離復号部９０２は、サブピクチャ、タイル、スライス、基本ブロックの大きさ等の画像の分割・定義に関するヘッダ情報を復号して分割情報を生成し、画像再生部９０５に出力する。つまり、分離復号部９０２は、図１５の統合符号化部８１１の逆の動作を行うことになる。 The separation / decoding unit 902 separates the bitstream input via the input terminal 901 into information related to the decoding process and code data related to the coefficient, and sends the information to the decoding unit 903. Further, the separation / decoding unit 902 decodes the code data existing in the header unit of the bit stream. The separation / decoding unit 902 of the present embodiment decodes header information related to image division / definition such as sub-pictures, tiles, slices, and basic block sizes, generates division information, and outputs the division information to the image reproduction unit 905. That is, the separation / decoding unit 902 performs the reverse operation of the integrated coding unit 811 of FIG.

復号部９０３は、分離復号部９０２から入力した符号データを復号し、量子化係数および予測情報を再生する。 The decoding unit 903 decodes the code data input from the separation decoding unit 902, and reproduces the quantization coefficient and the prediction information.

逆量子化・逆変換部９０４は、復号部９０３によって再生された量子化係数を逆量子化して変換係数を得、さらに変換係数を逆直交変換して予測誤差を再生する。 The inverse quantization / inverse conversion unit 904 inversely quantizes the quantization coefficient reproduced by the decoding unit 903 to obtain a conversion coefficient, and further performs inverse orthogonal conversion of the conversion coefficient to reproduce a prediction error.

画像再生部９０５は、分離復号部９０２から入力した予測情報に基づいて、フレームメモリ９０６を適宜参照して、予測画像データを生成する。そして、画像再生部９０５は、この予測画像データと、逆量子化・逆変換部９０４で再生された予測誤差から再生画像データを生成する。画像再生部９０５は、分離復号部９０２より入力された分割情報に基づいてサブピクチャ、タイル、スライスのフレーム中の位置を特定し、生成された再生画像データをフレームメモリ９０６に出力する。 The image reproduction unit 905 generates predicted image data by appropriately referring to the frame memory 906 based on the prediction information input from the separation / decoding unit 902. Then, the image reproduction unit 905 generates the reproduction image data from the predicted image data and the prediction error reproduced by the inverse quantization / inverse conversion unit 904. The image reproduction unit 905 identifies the positions of the sub-pictures, tiles, and slices in the frame based on the division information input from the separation / decoding unit 902, and outputs the generated reproduced image data to the frame memory 906.

インループフィルタ部９０７は、図１５におけるインループフィルタ部８０９と同様、再生画像に対し、デブロッキングフィルタなどのインループフィルタ処理を行い、フィルタ処理された画像を出力する。 Similar to the in-loop filter unit 809 in FIG. 15, the in-loop filter unit 907 performs in-loop filter processing such as a deblocking filter on the reproduced image, and outputs the filtered image.

フレームメモリ９０６に生成、格納された１フレーム分の画像データは、出力端子９０８を介して外部に出力される。 The image data for one frame generated and stored in the frame memory 906 is output to the outside via the output terminal 908.

画像復号装置における画像の復号動作を以下に説明する。本実施形態では、先に説明した画像符号化装置で生成されたビットストリームをフレーム単位で入力する構成となっているが、１フレーム分の静止画像ビットストリームを入力する構成としても構わない。また、本実施形態では説明を容易にするため、イントラ予測復号処理のみを説明するが、これに限定されずインター予測復号処理においても適用可能である。 The image decoding operation in the image decoding device will be described below. In the present embodiment, the bitstream generated by the image coding apparatus described above is input in frame units, but a still image bitstream for one frame may be input. Further, in the present embodiment, only the intra-predictive decoding process will be described for the sake of simplicity, but the present embodiment is not limited to this and can be applied to the inter-predictive decoding process.

図９において、入力端子９０１から入力された１フレーム分のビットストリームは分離復号部９０２に入力される。分離復号部９０２は、ビットストリームから復号処理に関する情報や係数に関する符号データに分離し、ビットストリームのヘッダ部に存在する符号データを復号する。より具体的には、図１２における画像サイズ情報、基本ブロック分割情報、サブピクチャ定義情報Ｓ、サブピクチャ定義情報Ｐ、タイル分割情報、スライス分割情報Ｐ、スライス分割情報ＳＨを復号し、分割情報を生成して画像再生部９０５へ送る。続いて、ピクチャデータの基本ブロック単位の符号データを再生し、復号部９０３に出力する。 In FIG. 9, one frame of bitstream input from the input terminal 901 is input to the separation / decoding unit 902. The separation / decoding unit 902 separates the bitstream into code data related to information related to the decoding process and coefficients, and decodes the code data existing in the header unit of the bitstream. More specifically, the image size information, the basic block division information, the sub-picture definition information S, the sub-picture definition information P, the tile division information, the slice division information P, and the slice division information SH in FIG. 12 are decoded, and the division information is obtained. Generate and send to the image reproduction unit 905. Subsequently, the code data in the basic block unit of the picture data is reproduced and output to the decoding unit 903.

復号部９０３は、符号データを復号し、量子化係数および予測情報を再生する。復号部９０３は、再生された量子化係数については逆量子化・逆変換部９０４に出力し、再生された予測情報については画像再生部９０５に出力する。 The decoding unit 903 decodes the code data and reproduces the quantization coefficient and the prediction information. The decoding unit 903 outputs the regenerated quantization coefficient to the inverse quantization / inverse conversion unit 904, and outputs the reproduced prediction information to the image reproduction unit 905.

逆量子化・逆変換部９０４は、復号部９０３から入力した量子化係数に対し、逆量子化を行って直交変換係数を生成する。そして、逆量子化・逆変換部９０４は、生成した直交変換係数に対して逆直交変換を施して予測誤差を再生する。逆量子化・逆変換部９０４は、再生された予測誤差を画像再生部９０５に出力する。 The inverse quantization / inverse conversion unit 904 performs inverse quantization on the quantization coefficient input from the decoding unit 903 to generate an orthogonal conversion coefficient. Then, the inverse quantization / inverse transformation unit 904 performs the inverse orthogonal transformation on the generated orthogonal transformation coefficient and reproduces the prediction error. The inverse quantization / inverse transformation unit 904 outputs the reproduced prediction error to the image reproduction unit 905.

画像再生部９０５は、復号部９０３から入力した予測情報に基づいて、フレームメモリ９０６を適宜参照し、予測画像を再生する。画像再生部９０５は、この予測画像と、逆量子化・逆変換部９０４から入力した予測誤差とから画像データを再生する。そして、画像再生部９０５は、生成された再生画像データを、分離復号部９０２より入力された分割情報に基づいて、例えば図５のような、タイル、スライス、サブピクチャの形状及びフレーム中の位置を特定した上で、フレームメモリ９０６における該当する位置に格納する。このフレームメモリ９０６に格納された画像データは、予測の際の参照に用いられることになる。 The image reproduction unit 905 appropriately refers to the frame memory 906 based on the prediction information input from the decoding unit 903, and reproduces the predicted image. The image reproduction unit 905 reproduces image data from this predicted image and the prediction error input from the inverse quantization / inverse conversion unit 904. Then, the image reproduction unit 905 uses the generated reproduction image data based on the division information input from the separation / decoding unit 902, for example, as shown in FIG. 5, in the shape of the tile, slice, and sub-picture, and the position in the frame. Is specified and then stored in the corresponding position in the frame memory 906. The image data stored in the frame memory 906 will be used as a reference at the time of prediction.

インループフィルタ部９０７は、図１５のインループフィルタ部８０９と同様、フレームメモリ９０６から再生画像を読み出し、デブロッキングフィルタなどのインループフィルタ処理を行う。そして、インループフィルタ部９０７は、フィルタ処理された画像を再びフレームメモリ９０６に格納する。 Similar to the in-loop filter unit 809 of FIG. 15, the in-loop filter unit 907 reads the reproduced image from the frame memory 906 and performs in-loop filter processing such as a deblocking filter. Then, the in-loop filter unit 907 stores the filtered image in the frame memory 906 again.

フレームメモリ９０６に格納された再生画像は、最終的には出力端子９０８から外部に出力される。 The reproduced image stored in the frame memory 906 is finally output to the outside from the output terminal 908.

図１１は、実施形態に係る画像復号装置における１フレーム分の画像の復号処理を示すフローチャートである。 FIG. 11 is a flowchart showing an image decoding process for one frame in the image decoding apparatus according to the embodiment.

まず、Ｓ１１０１にて、分離復号部９０２は、ビットストリームから復号処理に関する情報や係数に関する符号データに分離し、ヘッダ部分の符号データを復号する。分離復号部９０２は、図１２におけるサブピクチャ定義情報Ｓ、サブピクチャ定義情報Ｐ、タイル分割情報、スライス分割情報Ｐ、スライス分割情報ＳＨなどを復号し、復号のための情報を生成して画像再生部９０５へ送る。本実施形態におけるビットストリーム内に格納されている画像の分割は図１４の通りである。 First, in S1101, the separation / decoding unit 902 separates the bitstream into code data related to information related to the decoding process and coefficients, and decodes the code data of the header portion. The separation / decoding unit 902 decodes the sub-picture definition information S, the sub-picture definition information P, the tile division information, the slice division information P, the slice division information SH, etc. in FIG. 12, generates information for decoding, and reproduces the image. Send to section 905. The division of the image stored in the bit stream in the present embodiment is as shown in FIG.

まず画像サイズ情報のｓｐｓ＿ｐｉｃ＿ｗｉｄｔｈ＿ｍａｘ＿ｉｎ＿ｌｕｍａ＿ｓａｍｐｌｅｓ及びｓｐｓ＿ｐｉｃ＿ｈｅｉｇｈｔ＿ｍａｘ＿ｉｎ＿ｌｕｍａ＿ｓａｍｐｌｅｓの値から、画像が１１５２×１１５２画素であることが導かれる。 First, from the values of sps_pic_width_max_in_luma_samples and sps_pic_height_max_in_luma_samples of the image size information, it is derived that the image is 1152 × 1152 pixels.

次に基本ブロック分割情報のｓｐｓ＿ｌｏｇ２＿ｃｔｕ＿ｓｉｚｅ＿ｍｉｎｕｓ５の値が１であることから、ＣｔｂＬｏｇ２ＳｉｚｅＹ＝６である事がわかる。そして基本ブロックの大きさが、１＜＜ＣｔｂＬｏｇ２ＳｉｚｅＹより、６４×６４画素（ＣｔｂＳｉｚｅＹ＝６４）と導かれる。画像の水平方向の基本ブロック数であるＰｉｃＣｔｂＮｕｍＨは、（ｓｐｓ＿ｐｉｃ＿ｗｉｄｔｈ＿ｍａｘ＿ｉｎ＿ｌｕｍａ＿ｓａｍｐｌｅｓ＋ＣｔｂＳｉｚｅＹ−１）＞＞ＣｔｂＬｏｇ２ＳｉｚｅＹより１８と導かれる。垂直方向の基本ブロック数であるＰｉｃＣｔｂＮｕｍＶも同様に、１８と導かれる。 Next, since the value of sps_log2_ctu_size_minus5 of the basic block division information is 1, it can be seen that CtbLog2SizeY = 6. Then, the size of the basic block is derived from 1 << CtbLog2SizeY to be 64 × 64 pixels (CtbSizeY = 64). PicCtbNumH, which is the number of basic blocks in the horizontal direction of the image, is derived from (sps_pic_width_max_in_luma_samples + CtbSizeY-1) >> CtbLog2SizeY. Similarly, PicCtbNumV, which is the number of basic blocks in the vertical direction, is also derived as 18.

次にサブピクチャ定義情報Ｓを取得する。まずｓｐｓ＿ｓｕｂｐｉｃ＿ｉｎｆｏ＿ｐｒｅｓｅｎｔ＿ｆｌａｇが１であることからサブピクチャの定義に関する情報が存在することがわかる。続いてｓｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１で示されるサブピクチャの数−１の情報を取得する。本実施形態では値４を取得し、この値に１を加えることで、サブピクチャの数は５であることが分かる。ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［０］は１１、ｓｐｓ＿ｓｕｂｐｉｃ＿ｈｅｉｇｈｔ＿ｍｉｎｕｓ１［０］は５なので、画像の左上端のサブピクチャ０の水平方向の基本ブロック数は１２、垂直方向の基本ブロック数は６な事がわかる。ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［１］は１２、ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［１］は０なので、サブピクチャ１の左上の基本ブロックの座標は（１２、０）である事がわかる。ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［１］は５、ｓｐｓ＿ｓｕｂｐｉｃ＿ｈｅｉｇｈｔ＿ｍｉｎｕｓ１［１］は１なので、サブピクチャ１の水平方向の基本ブロック数は６、垂直方向の基本ブロック数は２な事がわかる。ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［２］は１２、ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［２］は２なので、サブピクチャ２の左上の基本ブロックの座標は（１２、２）である事がわかる。ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［２］は５、ｓｐｓ＿ｓｕｂｐｉｃ＿ｈｅｉｇｈｔ＿ｍｉｎｕｓ１［２］は３なので、サブピクチャ２の水平方向の基本ブロック数は６、垂直方向の基本ブロック数は４な事がわかる。ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［３］は０、ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［３］は６なので、サブピクチャ３の左上の基本ブロックの座標は（０、６）である事がわかる。ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［３］は１１、ｓｐｓ＿ｓｕｂｐｉｃ＿ｈｅｉｇｈｔ＿ｍｉｎｕｓ１［３］は１１なので、サブピクチャ３の水平方向の基本ブロック数は１２、垂直方向の基本ブロック数は１２な事がわかる。 Next, the sub-picture definition information S is acquired. First, since sps_subpic_info_present_flag is 1, it can be seen that there is information regarding the definition of the sub-picture. Subsequently, the information of the number -1 of the sub-pictures represented by sps_num_subpics_minus1 is acquired. In the present embodiment, by acquiring the value 4 and adding 1 to this value, it can be seen that the number of sub-pictures is 5. Since sps_subpic_width_minus1 [0] is 11 and sps_subpic_height_minus1 [0] is 5, it can be seen that the number of horizontal basic blocks and the number of vertical basic blocks of subpicture 0 at the upper left corner of the image is 12. Since sps_subpic_ctu_top_left_x [1] is 12 and sps_subpic_ctu_top_left_y [1] is 0, it can be seen that the coordinates of the upper left basic block of subpicture 1 are (12, 0). Since sps_subpic_width_minus1 [1] is 5 and sps_subpic_height_minus1 [1] is 1, it can be seen that the number of basic blocks in the horizontal direction of subpicture 1 is 6 and the number of basic blocks in the vertical direction is 2. Since sps_subpic_ctu_top_left_x [2] is 12 and sps_subpic_ctu_top_left_y [2] is 2, it can be seen that the coordinates of the upper left basic block of subpicture 2 are (12, 2). Since sps_subpic_width_minus1 [2] is 5 and sps_subpic_height_minus1 [2] is 3, it can be seen that the number of basic blocks in the horizontal direction of subpicture 2 is 6 and the number of basic blocks in the vertical direction is 4. Since sps_subpic_ctu_top_left_x [3] is 0 and sps_subpic_ctu_top_left_y [3] is 6, it can be seen that the coordinates of the upper left basic block of the subpicture 3 are (0, 6). Since sps_subpic_width_minus1 [3] is 11 and sps_subpic_height_minus1 [3] is 11, it can be seen that the number of basic blocks in the horizontal direction of the sub-picture 3 is 12 and the number of basic blocks in the vertical direction is 12.

サブピクチャ４（最後のサブピクチャ）の左上の基本ブロックの座標を導出する。ＰｉｃＣｔｂＮｕｍＶが１８であり、ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［ｉ］＋ｓｐｓ＿ｓｕｂｐｉｃ＿ｈｅｉｇｈｔ＿ｍｉｎｕｓ１［ｉ］＋１＝＝１８を満たすのは、サブピクチャ３である。ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［３］＋ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［３］＋１＝１２であり、最後のサブピクチャの左上の基本ブロックのＸ座標は１２となる。ＰｉｃＣｔｂＮｕｍＨが１８であり、ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｘ［ｉ］＋ｓｐｓ＿ｓｕｂｐｉｃ＿ｗｉｄｔｈ＿ｍｉｎｕｓ１［ｉ］＋１＝＝１８を満たすのは、サブピクチャ１と２である。その中でｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［ｉ］が最も大きいのはサブピクチャ２である。ｓｐｓ＿ｓｕｂｐｉｃ＿ｃｔｕ＿ｔｏｐ＿ｌｅｆｔ＿ｙ［２］＋ｓｐｓ＿ｓｕｂｐｉｃ＿ｈｅｉｇｈｔ＿ｍｉｎｕｓ１［２］＋１＝６であり、最後のサブピクチャの左上の基本ブロックのＹ座標は６となる。サブピクチャ４の水平方向の基本ブロック数は、ＰｉｃＣｔｂＮｕｍＨ−１２で６、垂直方向の基本ブロック数は、ＰｉｃＣｔｂＮｕｍＶ−６で１２となる。 The coordinates of the upper left basic block of subpicture 4 (last subpicture) are derived. It is subpicture 3 that PicCtbNumV is 18, and sps_subpic_ctu_top_left_y [i] + sps_subpic_height_minus1 [i] + 1 == 18 is satisfied. sps_subpic_ctu_top_left_x [3] + sps_subpic_width_minus1 [3] + 1 = 12, and the X coordinate of the upper left basic block of the last subpicture is 12. It is subpictures 1 and 2 that have a PicCtbNumH of 18 and satisfy sps_subpic_ctu_top_left_x [i] + sps_subpic_width_minus1 [i] + 1 == 18. Among them, subpicture 2 has the largest sps_subpic_ctu_top_left_y [i]. sps_subpic_ctu_top_left_y [2] + sps_subpic_height_minus1 [2] + 1 = 6, and the Y coordinate of the upper left basic block of the last subpicture is 6. The number of basic blocks in the horizontal direction of the sub-picture 4 is 6 for PicCtbNumH-12, and the number of basic blocks in the vertical direction is 12 for PicCtbNumV-6.

このようにして、従来手法では最後のサブピクチャの左上の基本ブロックの座標情報を符号化していたが、それを用いずに他のサブピクチャの情報から最後のサブピクチャの左上の基本ブロックの座標を求める事ができる。これによって符号量を削減したビットストリームを復号する事が可能になる。 In this way, in the conventional method, the coordinate information of the upper left basic block of the last subpicture was encoded, but without using it, the coordinates of the upper left basic block of the last subpicture from the information of other subpictures. Can be asked. This makes it possible to decode a bitstream with a reduced amount of code.

次に、ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｌｅｎ＿ｍｉｎｕｓ１の値が２のため、サブピクチャのＩＤのビット長が３である事がわかる。また、本実施形態ではｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｅｘｐｌｉｃｉｔｌｙ＿ｓｉｇｎａｌｅｄ＿ｆｌａｇが０のため、明示的なサブピクチャのＩＤのマッピングがない事がわかる。 Next, since the value of sps_subpic_id_len_minus1 is 2, it can be seen that the bit length of the ID of the subpicture is 3. Further, in the present embodiment, since sps_subpic_id_mapping_explicity_signed_flag is 0, it can be seen that there is no explicit mapping of sub-picture IDs.

従来はｓｐｓ＿ｓｕｂｐｉｃ＿ｉｎｆｏ＿ｐｒｅｓｅｎｔ＿ｆｌａｇが１ならばサブピクチャの数が１の時にも符号化していたが、本実施形態のフォーマットではサブピクチャの数が１の時には、サブピクチャのＩＤを０とする事でＩＤに関する以下のシンタクスを復号しない。
・ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｌｅｎ＿ｍｉｎｕｓ１
・ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｅｘｐｌｉｃｉｔｌｙ＿ｓｉｇｎａｌｅｄ＿ｆｌａｇ
・ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ
・ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ Conventionally, if sps_subpic_info_present_flag is 1, encoding is performed even when the number of sub-pictures is 1, but in the format of this embodiment, when the number of sub-pictures is 1, the ID of the sub-pictures is set to 0 to describe the following. Do not decrypt the syntax of.
・ Sps_subpic_id_len_minus1
・ Sps_subpic_id_mapping_explicity_signaled_flag
・ Sps_subpic_id_mapping_present_flag
・ Sps_subpic_id

これによってサブピクチャの数が１の時に符号量を削減したビットストリームを復号する事ができる。また、本実施形態では、サブピクチャの数が１の時にはサブピクチャのＩＤを０としたが、本発明はこれに限定されるものではない。例えば１でもよい。また分離復号部９０２は、ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｅｘｐｌｉｃｉｔｌｙ＿ｓｉｇｎａｌｅｄ＿ｆｌａｇはサブピクチャの数に関係なく常に復号しなくてもよい。 As a result, when the number of sub-pictures is 1, it is possible to decode the bit stream in which the code amount is reduced. Further, in the present embodiment, when the number of sub-pictures is 1, the ID of the sub-pictures is set to 0, but the present invention is not limited to this. For example, it may be 1. Further, the separation / decoding unit 902 does not have to always decode sps_subpic_id_mapping_explicity_signed_flag regardless of the number of sub-pictures.

分離復号部９０２は次に、ピクチャ・パラメータ・セットにおいてｐｐｓ＿ｎｏ＿ｐｉｃ＿ｐａｒｔｉｔｉｏｎ＿ｆｌａｇを復号し、０の値を取得する。これによって、画像が単一のサブピクチャ、タイル、スライスで構成されているわけではない事がわかる。 The separation decoding unit 902 then decodes pps_no_pic_partition_flag in the picture parameter set and obtains a value of 0. This shows that the image is not composed of a single subpicture, tile, or slice.

ｐｐｓ＿ｎｏ＿ｐｉｃ＿ｐａｒｔｉｔｉｏｎ＿ｆｌａｇが０なので、分離復号部９０２は続いてタイル分割情報、サブピクチャ定義情報Ｐ（ＰＰＳ）、スライス分割情報Ｐ（ＰＰＳ）を復号する。本実施形態では、ｐｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｐｒｅｓｅｎｔ＿ｆｌａｇの値が０のため、ＰＰＳにサブピクチャのＩＤに関するシンタクスは符号化されない。 Since pps_no_pic_partition_flag is 0, the separation / decoding unit 902 subsequently decodes the tile division information, the sub-picture definition information P (PPS), and the slice division information P (PPS). In this embodiment, since the value of pps_subpic_id_mapping_present_flag is 0, the syntax regarding the ID of the sub-picture is not encoded in PPS.

従来手法ではｐｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｐｒｅｓｅｎｔ＿ｆｌａｇは常に符号化されるが、本実施形態のフォーマットではｐｐｓ＿ｎｏ＿ｐｉｃ＿ｐａｒｔｉｔｉｏｎ＿ｆｌａｇが０の時のみ復号する。これにより画像が分割されていない際の符号量を削減したビットストリームを復号する事ができる。 In the conventional method, pps_subpic_id_mapping_present_flag is always encoded, but in the format of this embodiment, it is decoded only when pps_no_pic_partition_flag is 0. As a result, it is possible to decode a bit stream in which the amount of code is reduced when the image is not divided.

分離復号部９０２は、スライス分割情報Ｐとしてｐｐｓ＿ｓｉｎｇｌｅ＿ｓｌｉｃｅ＿ｐｅｒ＿ｓｕｂｐｉｃ＿ｆｌａｇを復号し、０の値を取得する。これにより、複数のスライスを包含するサブピクチャが存在する可能性がある事がわかる。本実施形態のフォーマットでは、ｐｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１が存在し、かつ１以上の時のみ分離復号部９０２がｐｐｓ＿ｓｉｎｇｌｅ＿ｓｌｉｃｅ＿ｐｅｒ＿ｓｕｂｐｉｃ＿ｆｌａｇを復号する構成となっているため条件に合致した際は符号量の削減が可能になる。 The separation / decoding unit 902 decodes pps_single_slice_per_subpic_flag as the slice division information P, and acquires a value of 0. From this, it can be seen that there may be a subpicture containing a plurality of slices. In the format of the present embodiment, since pps_num_subpics_minus1 exists and the separation / decoding unit 902 decodes the pps_single_slice_per_subpic_flag only when the number is 1 or more, the code amount can be reduced when the conditions are met.

分離復号部９０２は次に、スライスヘッダにおいてスライス分割情報ＳＨとしてｓｈ＿ｓｕｂｐｉｃ＿ｉｄを復号する。本実施形態ではｓｈ＿ｓｕｂｐｉｃ＿ｉｄが０のスライスがＳＩＤ＝０〜２のスライスであるため、サブピクチャ０を構成するスライスがＳＩＤ＝０〜２のスライスである事がわかる。ｓｈ＿ｓｕｂｐｉｃ＿ｉｄが１のスライスがＳＩＤ＝３のスライスであるため、サブピクチャ１を構成するスライスがＳＩＤ＝３のスライスである事がわかる。ｓｈ＿ｓｕｂｐｉｃ＿ｉｄが２のスライスがＳＩＤ＝４のスライスであるため、サブピクチャ２を構成するスライスがＳＩＤ＝４のスライスである事がわかる。ｓｈ＿ｓｕｂｐｉｃ＿ｉｄが３のスライスがＳＩＤ＝５〜８及び１０〜１２のスライスであるため、サブピクチャ３を構成するスライスである事がわかる。ｓｈ＿ｓｕｂｐｉｃ＿ｉｄが４のスライスがＳＩＤ＝９及び１３のスライスであるため、サブピクチャ４を構成するスライスがＳＩＤ＝９及び１３のスライスである事がわかる。 Next, the separation / decoding unit 902 decodes sh_subpic_id as the slice division information SH in the slice header. In the present embodiment, since the slice with sh_subpic_id 0 is the slice with SID = 0 to 2, it can be seen that the slice constituting the sub-picture 0 is the slice with SID = 0 to 2. Since the slice with sh_subpic_id of 1 is the slice with SID = 3, it can be seen that the slice constituting the sub-picture 1 is the slice with SID = 3. Since the slice with sh_subpic_id 2 is the slice with SID = 4, it can be seen that the slice constituting the sub-picture 2 is the slice with SID = 4. Since the slice with sh_subpic_id 3 is a slice with SID = 5-8 and 10-12, it can be seen that it is a slice constituting the sub-picture 3. Since the slice with sh_subpic_id 4 is the slice with SID = 9 and 13, it can be seen that the slices constituting the sub-picture 4 are the slices with SID = 9 and 13.

従来はｓｐｓ＿ｓｕｂｐｉｃ＿ｉｎｆｏ＿ｐｒｅｓｅｎｔ＿ｆｌａｇが１ならばサブピクチャの数が１の時にも符号化していたが、本実施形態ではサブピクチャの数が１の時には、サブピクチャのＩＤを０としているため復号する必要がない。これによって符号量を削減したビットストリームを復号する事ができる。 Conventionally, if sps_subpic_info_present_flag is 1, encoding is performed even when the number of subpictures is 1, but in this embodiment, when the number of subpictures is 1, the ID of the subpicture is 0, so there is no need to decode. This makes it possible to decode a bit stream with a reduced amount of code.

これらの復号に必要な情報を取得し分離復号部９０２より導出された分割情報は画像再生部９０５に送られ、Ｓ１１０４で処理しているデータの画像内の位置の特定に用いられる。 The divided information obtained from the information necessary for decoding and derived from the separation / decoding unit 902 is sent to the image reproduction unit 905 and used to specify the position of the data processed by S1104 in the image.

Ｓ１１０２にて、復号部９０３は、Ｓ１１０１で分離された符号データを復号し、量子化係数および予測情報を再生する。 In S1102, the decoding unit 903 decodes the code data separated in S1101 and reproduces the quantization coefficient and the prediction information.

Ｓ１１０３にて、逆量子化・逆変換部９０４は、復号部９０３より入力した量子化係数に対し逆量子化を行って変換係数を得、さらに変換係数に対して逆直交変換を行い、予測誤差を再生する。 In S1103, the inverse quantization / inverse conversion unit 904 performs inverse quantization on the quantization coefficient input from the decoding unit 903 to obtain a conversion coefficient, and further performs inverse orthogonal conversion on the conversion coefficient to predict an error. To play.

Ｓ１１０４にて、画像再生部９０５は、Ｓ１１０３で生成された予測情報や予測画像を再生する。さらに、画像再生部９０５は、再生された予測画像とＳ１１０４で生成された予測誤差から画像データを再生する。そして、画像再生部９０５は、再生された画像データは、Ｓ１１０１で生成された分割情報に基づいて、画像中の適切な位置に合成する。 In S1104, the image reproduction unit 905 reproduces the prediction information and the prediction image generated in S1103. Further, the image reproduction unit 905 reproduces the image data from the reproduced predicted image and the prediction error generated in S1104. Then, the image reproduction unit 905 synthesizes the reproduced image data at an appropriate position in the image based on the division information generated in S1101.

Ｓ１１０５にて、画像復号装置はフレーム内の全ての基本ブロックの復号が終了したか否かの判定を行い、終了していればＳ１１０６に処理を進め、そうでなければ次の基本ブロックの復号のために処理をＳ１１０２に戻す。 In S1105, the image decoding device determines whether or not the decoding of all the basic blocks in the frame is completed, and if it is completed, the process proceeds to S1106, and if not, the decoding of the next basic block is performed. Therefore, the process is returned to S1102.

Ｓ１１０６にて、インループフィルタ部９０７は、Ｓ１１０４で再生された画像データに対し、インループフィルタ処理を行い、フィルタ処理された画像を生成し、処理を終了する。 In S1106, the in-loop filter unit 907 performs in-loop filter processing on the image data reproduced in S1104, generates a filtered image, and ends the processing.

以上の構成と動作により、特にＳ１１０１において、複数のサブピクチャが定義された場合に、最後のサブピクチャの左上の座標を算出することで、サブピクチャに関するシンタクスを効率よく符号化したビットストリームを復号する事ができる。 With the above configuration and operation, especially in S1101, when a plurality of subpictures are defined, the coordinates of the upper left of the last subpicture are calculated to decode the bitstream in which the syntax related to the subpicture is efficiently encoded. Can be done.

なお本実施形態では、画像を９個のタイル、５個のサブピクチャ、１４個のスライスで構成されるものとしたが、これに限定されない。画像は如何様に分割してもよいし、分割しなくてもよい。例えば複数のサブピクチャを定義せず、更に複数のタイル、スライスに分割せず、画像が単一のサブピクチャ、タイル、スライスで構成されてもよい。その場合は、ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｎｆｏ＿ｐｒｅｓｅｎｔ＿ｆｌａｇが１であっても、ｓｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１が０となり、以下のシーケンス・パラメータ・セット及びスライスヘッダのシンタクスを復号する必要がない。
・ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｌｅｎ＿ｍｉｎｕｓ１
・ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｅｘｐｌｉｃｉｔｌｙ＿ｓｉｇｎａｌｅｄ＿ｆｌａｇ
・ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ
・ｓｐｓ＿ｓｕｂｐｉｃ＿ｉｄ
・ｓｈ＿ｓｕｂｐｉｃ＿ｉｄ In the present embodiment, the image is composed of 9 tiles, 5 subpictures, and 14 slices, but the image is not limited to this. The image may or may not be divided in any way. For example, an image may be composed of a single subpicture, tile, or slice without defining a plurality of subpictures and further dividing the image into a plurality of tiles or slices. In that case, even if sps_subpic_info_present_flag is 1, sps_num_subpics_minus1 becomes 0, and it is not necessary to decode the following sequence parameter set and slice header syntax.
・ Sps_subpic_id_len_minus1
・ Sps_subpic_id_mapping_explicity_signaled_flag
・ Sps_subpic_id_mapping_present_flag
・ Sps_subpic_id
・ Sh_subpic_id

同様に画像が分割されていないため、ｐｐｓ＿ｎｏ＿ｐｉｃ＿ｐａｒｔｉｔｉｏｎ＿ｆｌａｇが１となり、以下のピクチャ・パラメータ・セットのシンタクスを復号する必要がない。
・ｐｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｍａｐｐｉｎｇ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ
・ｐｐｓ＿ｎｕｍ＿ｓｕｂｐｉｃｓ＿ｍｉｎｕｓ１
・ｐｐｓ＿ｓｕｂｐｉｃ＿ｉｄ＿ｌｅｎ＿ｍｉｎｕｓ１
・ｐｐｓ＿ｓｕｂｐｉｃ＿ｉｄ
以上により不要なシンタクスが符号化されていない、従来よりも符号量が削減されたビットストリームの復号が可能となる。 Similarly, since the image is not divided, pps_no_pic_partition_flag is 1, and it is not necessary to decode the syntax of the following picture parameter set.
・ Pps_subpic_id_mapping_present_flag
・ Pps_num_subpics_minus1
・ Pps_subpic_id_len_minus1
・ Pps_subpic_id
As described above, it is possible to decode a bit stream in which unnecessary syntax is not encoded and the amount of code is reduced as compared with the conventional case.

（その他の実施例）
本発明は、上述の実施形態の１以上の機能を実現するプログラムを、ネットワーク又は記憶媒体を介してシステム又は装置に供給し、そのシステム又は装置のコンピュータにおける１つ以上のプロセッサーがプログラムを読出し実行する処理でも実現可能である。また、１以上の機能を実現する回路（例えば、ＡＳＩＣ）によっても実現可能である。 (Other examples)
The present invention supplies a program that realizes one or more functions of the above-described embodiment to a system or device via a network or storage medium, and one or more processors in the computer of the system or device reads and executes the program. It can also be realized by the processing to be performed. It can also be realized by a circuit (for example, ASIC) that realizes one or more functions.

発明は上記実施形態に制限されるものではなく、発明の精神及び範囲から離脱することなく、様々な変更及び変形が可能である。従って、発明の範囲を公にするために請求項を添付する。 The invention is not limited to the above embodiment, and various modifications and modifications can be made without departing from the spirit and scope of the invention. Therefore, a claim is attached to publicize the scope of the invention.

１００…撮像装置、１０１…ＣＰＵ１０１、１０２…メモリ、１０３…不揮発性メモリ、１０４…操作部、１１１…撮像レンズ、１１２…撮像部、１１３…画像処理部、１１４…符号化処理部、１１５…表示制御部、１１６…表示部、１１７…通信制御部、１１８…通信部、１１９…記録媒体制御部、１２０…記録媒体、１４０…検出部、１３０…内部バス 100 ... image pickup device, 101 ... CPU 101, 102 ... memory, 103 ... non-volatile memory, 104 ... operation unit, 111 ... image pickup lens, 112 ... image pickup unit, 113 ... image processing unit, 114 ... coding processing unit, 115 ... display Control unit, 116 ... Display unit, 117 ... Communication control unit, 118 ... Communication unit, 119 ... Recording medium control unit, 120 ... Recording medium, 140 ... Detection unit, 130 ... Internal bus

Claims

An image coding device that encodes an image composed of N (N> 1) rectangles so that each rectangle can be independently decoded.
A dividing means for dividing an image into N rectangles, and
It has a coding means for encoding information on the position and size of the rectangle.
The coding means is an image coding device that encodes information about the positions of the second to N-1st rectangles and does not encode information about the positions of the first and Nth rectangles.

An image decoding device that decodes a bit stream encoded so that each rectangle can independently decode an image composed of N (N> 1) rectangles.
It has an acquisition means for acquiring the positions and sizes of the N rectangles in block units constituting the plurality of rectangles.
The acquisition means is characterized in that information regarding the positions of N-2 rectangles and the size of N-1 rectangles among the N rectangles is obtained by decoding the bitstream.
The acquisition means calculates information on the position and size of the last (Nth) rectangle among the N rectangles by using the information on the position and size of the N-1 rectangles. An image decoding device characterized by being acquired by.

An image coding method for encoding an image composed of N (N> 1) rectangles so that each rectangle can be independently decoded.
A division step of dividing an image into N rectangles, and
It has a coding step for coding information on the position and size of the rectangle.
The coding step is an image coding method comprising encoding information regarding the positions of the second to N-1th rectangles and not encoding information regarding the positions of the first and Nth rectangles.

An image decoding method for decoding an image composed of N (N> 1) rectangles by decoding a bit stream encoded so that each rectangle can be independently decoded.
It has an acquisition step of acquiring the positions and sizes of the N rectangles in block units constituting the plurality of rectangles.
The acquisition step is characterized in that the bit stream is decoded and acquired with information regarding the positions of N-2 rectangles and the size of N-1 rectangles among the N rectangles.
In the acquisition step, information on the position and size of the last (Nth) rectangle among the N rectangles is calculated using the information on the position and size of the N-1 rectangles. An image decoding method characterized by being acquired by.

A program for causing the computer to execute the process of the method according to claim 3 or 4, by reading and executing the computer.