JP7482536B2

JP7482536B2 - Shape-adaptive discrete cosine transform for geometric partitioning with an adaptive number of regions.

Info

Publication number: JP7482536B2
Application number: JP2021543476A
Authority: JP
Inventors: ボリヴォイェファート，; ハリカルバ，; ヴェリボールアジッチ，
Original assignee: オーピーソリューションズ，エルエルシー
Priority date: 2019-01-28
Filing date: 2020-01-28
Publication date: 2024-05-14
Anticipated expiration: 2040-01-28
Also published as: WO2020159982A1; US20210360246A1; SG11202107974YA; BR112021014671A2; CN113597757A; KR20210118166A; JP2022524916A; EP3918784A4; MX2021009030A; EP3918784A1

Description

本願は、２０１９年１月２８日に出願され「ＳＨＡＰＥＡＤＡＰＴＩＶＥＤＩＳＣＲＥＴＥＣＯＳＩＮＥＴＲＡＮＳＦＯＲＭＦＯＲＧＥＯＭＥＴＲＩＣＰＡＲＴＩＴＩＯＮＩＮＧＷＩＴＨＡＮＡＤＡＰＴＩＶＥＮＵＭＢＥＲＯＦＲＥＧＩＯＮＳ」と題された米国仮特許出願第６２／７９７，７９９号の優先権の利益を主張し、その出願は、参照することによってその全体として本明細書に援用される。 This application claims the benefit of priority to U.S. Provisional Patent Application No. 62/797,799, filed January 28, 2019, and entitled "SHAPE ADAPTIVE DISCRETE COSINE TRANSFORM FOR GEOMETRIC PARTITIONING WITH AN ADAPTIVE NUMBER OF REGIONS," which is incorporated herein by reference in its entirety.

本発明は、概して、ビデオ圧縮の分野に関する。具体的には、本発明は、適応的な数の領域を伴う幾何学的分割のための形状適応離散コサイン変換を対象とする。 The present invention relates generally to the field of video compression. In particular, the present invention is directed to a shape-adaptive discrete cosine transform for geometric partitioning with an adaptive number of regions.

ビデオコーデックは、デジタルビデオを圧縮または解凍する電子回路またはソフトウェアを含み得る。それは、圧縮されていないビデオを圧縮されたフォーマットに変換することができ、逆もまた同様である。ビデオ圧縮の文脈において、ビデオを圧縮する（および／またはそのうちのいくつかの機能を実施する）デバイスは、典型的には、エンコーダと呼ばれ得、ビデオを解凍する（および／またはそのうちのいくつかの機能を実施する）デバイスは、デコーダと呼ばれ得る。 A video codec may include electronic circuitry or software that compresses or decompresses digital video. It can convert uncompressed video to a compressed format, or vice versa. In the context of video compression, a device that compresses (and/or performs some of the functions of) the video may typically be called an encoder, and a device that decompresses (and/or performs some of the functions of) the video may be called a decoder.

圧縮されたデータのフォーマットは、標準的なビデオ圧縮仕様に適合することができる。圧縮は、圧縮されたビデオが元のビデオの中に存在するある情報を欠く点で非可逆的であり得る。この結果は、元のビデオを正確に再構築するために不十分な情報しか存在しないので、解凍されたビデオが元の圧縮されていないビデオより低い品質を有し得ることを含み得る。 The format of the compressed data can conform to standard video compression specifications. The compression can be lossy, in that the compressed video lacks certain information present in the original video. Consequences of this can include that the decompressed video may have lower quality than the original uncompressed video, because insufficient information exists to exactly reconstruct the original video.

ビデオ品質と、ビデオを表現するために使用される（例えば、ビットレートによって決定される）データ量と、エンコーディングアルゴリズムおよびデコーディングアルゴリズムの複雑性と、データ損失ならびに誤差に対する感度と、編集のし易さと、ランダムアクセスと、エンドツーエンド遅延（例えば、待機時間）と、同等物との間に、複雑な関係が存在し得る。 There can be a complex relationship between video quality, the amount of data used to represent the video (e.g., determined by the bit rate), the complexity of the encoding and decoding algorithms, sensitivity to data loss and errors, ease of editing, random access, end-to-end delay (e.g., latency), and the like.

ある側面では、デコーダは、回路を含み、回路は、ビットストリームを受信することと、幾何学的分割モードに従って、カレントブロックの第１の領域、第２の領域、および第３の領域を決定することと、第１の領域、第２の領域、および第３の領域毎に逆離散コサイン変換を使用して、カレントブロックをデコードすることとを行うように構成される。 In one aspect, the decoder includes a circuit configured to receive a bitstream, determine a first region, a second region, and a third region of a current block according to a geometric partitioning mode, and decode the current block using an inverse discrete cosine transform for each of the first region, the second region, and the third region.

別の側面では、デコーダは、回路を含み、回路は、ビットストリームを受信することと、幾何学的分割モードに従って、カレントブロックの第１の領域、第２の領域、および第３の領域を決定することと、ビットストリーム内に含有される信号から、第１の領域、第２の領域、および／または第３の領域の各々をデコードするためのコーディング変換則タイプを決定することであって、コーディング変換則タイプは、少なくとも逆ブロック離散コサイン変換および逆形状適応離散コサイン変換を特徴付ける、ことと、カレントブロックをデコードすることであって、カレントブロックのデコーディングは、第１の領域、第２の領域、および／または第３の領域毎に逆変換に関する決定された変換則タイプを使用することを含む、こととを行うように構成される。 In another aspect, a decoder includes a circuit configured to receive a bitstream; determine a first region, a second region, and a third region of a current block according to a geometric partitioning mode; determine a coding transform law type for decoding each of the first region, the second region, and/or the third region from a signal contained in the bitstream, the coding transform law type characterizing at least an inverse block discrete cosine transform and an inverse shape adaptive discrete cosine transform; and decode the current block, the decoding of the current block including using the determined transform law type for the inverse transform for each of the first region, the second region, and/or the third region.

別の側面では、方法は、デコーダが、ビットストリームを受信することと、幾何学的分割モードに従って、カレントブロックの第１の領域、第２の領域、および第３の領域を決定することと、ビットストリーム内に含有される信号から、第１の領域、第２の領域、および／または第３の領域をデコードするためのコーディング変換則タイプを決定することであって、コーディング変換則タイプは、少なくとも逆ブロック離散コサイン変換または逆形状適応離散コサイン変換を特徴付ける、ことと、カレントブロックをデコードすることであって、カレントブロックのデコーディングは、第１の領域、第２の領域、および／または第３の領域毎に逆変換に関する決定された変換則タイプを使用することを含む、こととを含む。 In another aspect, a method includes a decoder receiving a bitstream; determining a first region, a second region, and a third region of a current block according to a geometric partitioning mode; determining a coding transform law type for decoding the first region, the second region, and/or the third region from a signal contained in the bitstream, the coding transform law type characterizing at least an inverse block discrete cosine transform or an inverse shape adaptive discrete cosine transform; and decoding the current block, the decoding of the current block including using the determined transform law type for the inverse transform for each of the first region, the second region, and/or the third region.

本明細書に説明される主題の１つ以上の変形例の詳細が、付随の図面および下記の説明に記載される。本明細書に説明される主題の他の特徴および利点が、説明および図面から、ならびに請求項から明白となる。
本発明は、例えば、以下の項目を提供する。
（項目１）
デコーダであって、前記デコーダは、回路を備え、前記回路は、
ビットストリームを受信することと、
幾何学的分割モードに従って、カレントブロックの第１の領域、第２の領域、および第３の領域を決定することと、
前記第１の領域、前記第２の領域、および前記第３の領域毎に、逆離散コサイン変換を使用して、前記カレントブロックをデコードすることと
を行うように構成される、デコーダ。
（項目２）
前記カレントブロックは、１２８×１２８または６４×６４のサイズを有する、項目１に記載のデコーダ。
（項目３）
前記第１の領域、前記第２の領域、および／または前記第３の領域の逆変換に関する係数の数が、前記ビットストリーム内で信号伝達される、項目１に記載のデコーダ。
（項目４）
前記幾何学的分割モードが有効化されるかどうかを決定することと、
前記カレントブロックに関する第１の線セグメントを決定することと、
前記カレントブロックに関する第２の線セグメントを決定することと
を行うようにさらに構成され、
前記カレントブロックのデコーディングは、前記第１の線セグメントおよび前記第２の線セグメントを使用して、ピクセルデータを再構築することを含み、
前記第１の線セグメントおよび前記第２の線セグメントは、前記カレントブロックを前記第１の領域、前記第２の領域、および前記第３の領域に分割する、項目１に記載のデコーダ。
（項目５）
前記第１の線セグメントは、前記第１の領域を特徴付け、前記第２の線セグメントは、前記第２の領域および前記第３の領域を特徴付ける、項目４に記載のデコーダ。
（項目６）
ピクセルデータを再構築することは、前記ビットストリーム内に含有された関連付けられる動きベクトルを使用して、前記第１の領域に関する予測子を算出することを含む、項目４に記載のデコーダ。
（項目７）
前記ビットストリームを受信し、前記ビットストリームを量子化された係数にデコードするように構成されるエントロピーデコーダプロセッサと、
前記決定されたコーディング変換則タイプに従って、逆離散コサインを実施することを含め、前記量子化された係数を処理するように構成される逆量子化および逆変換プロセッサと、
デブロッキングフィルタと、
フレームバッファと、
イントラ予測プロセッサと
をさらに備える、項目１に記載のデコーダ。
（項目８）
前記ビットストリームは、幾何学的分割モードが前記カレントブロックに関して有効化されるかどうかを示すパラメータを含む、項目１に記載のデコーダ。
（項目９）
前記カレントブロックは、クアッドツリープラスバイナリディシジョンツリーの一部を形成する、項目１に記載のデコーダ。
（項目１０）
前記カレントブロックは、前記クアッドツリープラスバイナリディシジョンツリーの非リーフノードである、項目１に記載のデコーダ。
（項目１１）
前記カレントブロックは、コーディングツリーユニットまたはコーディングユニットである、項目１に記載のデコーダ。
（項目１２）
前記第１の領域は、コーディングユニットまたは予測ユニットである、項目１に記載のデコーダ。
（項目１３）
デコーダであって、前記デコーダは、回路を備え、前記回路は、
ビットストリームを受信することと、
幾何学的分割モードに従って、カレントブロックの第１の領域、第２の領域、および第３の領域を決定することと、
前記ビットストリーム内に含有される信号から、前記第１の領域、前記第２の領域、および／または前記第３の領域の各々をデコードするためのコーディング変換則タイプを決定することであって、前記コーディング変換則タイプは、少なくとも逆ブロック離散コサイン変換および逆形状適応離散コサイン変換を特徴付ける、ことと、
前記カレントブロックをデコードすることであって、前記カレントブロックのデコーディングは、前記第１の領域、前記第２の領域、および／または前記第３の領域毎に逆変換に関する前記決定された変換則タイプを使用することを含む、ことと
を行うように構成される、デコーダ。
（項目１４）
前記カレントブロックは、１２８×１２８または６４×６４のサイズを有する、項目１３に記載のデコーダ。
（項目１５）
前記第１の領域、前記第２の領域、および／または前記第３の領域の逆変換に関する係数の数が、前記ビットストリーム内で信号伝達される、項目１３に記載のデコーダ。
（項目１６）
前記幾何学的分割モードが有効化されるかどうかを決定することと、
前記カレントブロックに関する第１の線セグメントを決定することと、
前記カレントブロックに関する第２の線セグメントを決定することと
を行うようにさらに構成され、
前記カレントブロックのデコーディングは、前記第１の線セグメントおよび前記第２の線セグメントを使用して、ピクセルデータを再構築することを含み、
前記第１の線セグメントおよび前記第２の線セグメントは、前記カレントブロックを前記第１の領域、前記第２の領域、および前記第３の領域に分割する、項目１３に記載のデコーダ。
（項目１７）
前記第１の線セグメントは、前記第１の領域を特徴付け、前記第２の線セグメントは、前記第２の領域および前記第３の領域を特徴付ける、項目１６に記載のデコーダ。
（項目１８）
ピクセルデータを再構築することは、前記ビットストリーム内に含有された関連付けられる動きベクトルを使用して、前記第１の領域に関する予測子を算出することを含む、項目１６に記載のデコーダ。
（項目１９）
前記ビットストリームを受信し、前記ビットストリームを量子化された係数にデコードするように構成されるエントロピーデコーダプロセッサと、
前記決定されたコーディング変換則タイプに従って逆離散コサインを実施することを含め、前記量子化された係数を処理するように構成される逆量子化および逆変換プロセッサと、
デブロッキングフィルタと、
フレームバッファと、
イントラ予測プロセッサと
をさらに備える、項目１３に記載のデコーダ。
（項目２０）
前記ビットストリームは、幾何学的分割モードが前記カレントブロックに関して有効化されるかどうかを示すパラメータを含む、項目１３に記載のデコーダ。
（項目２１）
前記カレントブロックは、クアッドツリープラスバイナリディシジョンツリーの一部を形成する、項目１３に記載のデコーダ。
（項目２２）
前記カレントブロックは、前記クアッドツリープラスバイナリディシジョンツリーの非リーフノードである、項目１３に記載のデコーダ。
（項目２３）
前記カレントブロックは、コーディングツリーユニットまたはコーディングユニットである、項目１３に記載のデコーダ。
（項目２４）
前記第１の領域は、コーディングユニットまたは予測ユニットである、項目１３に記載のデコーダ。
（項目２５）
方法であって、前記方法は、
デコーダによって、ビットストリームを受信することと、
幾何学的分割モードに従って、カレントブロックの第１の領域、第２の領域、および第３の領域を決定することと、
前記ビットストリーム内に含有される信号から、前記第１の領域、前記第２の領域、および／または前記第３の領域をデコードするためのコーディング変換則タイプを決定することであって、前記コーディング変換則タイプは、少なくとも逆ブロック離散コサイン変換または逆形状適応離散コサイン変換を特徴付ける、ことと、
前記カレントブロックをデコードすることであって、前記カレントブロックのデコーディングは、前記第１の領域、前記第２の領域、および／または前記第３の領域毎に逆変換に関する前記決定された変換則タイプを使用することを含む、ことと
を含む、方法。
（項目２６）
前記カレントブロックは、１２８×１２８または６４×６４のサイズを有する、項目２５に記載の方法。
（項目２７）
前記第１の領域、前記第２の領域、および／または前記第３の領域の逆変換に関する係数の数が、前記ビットストリーム内で信号伝達される、項目２５に記載の方法。
（項目２８）
前記デコーダが、前記幾何学的分割モードが有効化されるかどうかを決定することと、
前記デコーダが、前記カレントブロックに関する第１の線セグメントを決定することと、
前記デコーダが、前記カレントブロックに関する第２の線セグメントを決定することと
をさらに含み、
前記カレントブロックのデコーディングは、前記第１の線セグメントおよび前記第２の線セグメントを使用して、ピクセルデータを再構築することを含み、
前記第１の線セグメントおよび前記第２の線セグメントは、前記カレントブロックを前記第１の領域、前記第２の領域、および前記第３の領域に分割する、項目２５に記載の方法。
（項目２９）
前記第１の線セグメントは、前記第１の領域を特徴付け、前記第２の線セグメントは、前記第２の領域および前記第３の領域を特徴付ける、項目２８に記載の方法。
（項目３０）
ピクセルデータを再構築することは、前記ビットストリーム内に含有された関連付けられる動きベクトルを使用して、前記第１の領域に関する予測子を算出することを含む、項目２８に記載の方法。
（項目３１）
前記デコーダは、
前記ビットストリームを受信し、前記ビットストリームを量子化された係数にデコードするように構成されるエントロピーデコーダプロセッサと、
前記決定されたコーディング変換則タイプに従って逆離散コサインを実施することを含め、前記量子化された係数を処理するように構成される逆量子化および逆変換プロセッサと、
デブロッキングフィルタと、
フレームバッファと、
イントラ予測プロセッサと
を備える、項目２５に記載の方法。
（項目３２）
前記ビットストリームは、ブロックレベル幾何学的分割モードが前記ブロックに関して有効化されるかどうかを示すパラメータを含む、項目２５に記載の方法。
（項目３３）
前記カレントブロックは、クアッドツリープラスバイナリディシジョンツリーの一部を形成する、項目２５に記載の方法。
（項目３４）
前記カレントブロックは、前記クアッドツリープラスバイナリディシジョンツリーの非リーフノードである、項目２５に記載の方法。
（項目３５）
前記カレントブロックは、コーディングツリーユニットまたはコーディングユニットである、項目２５に記載の方法。
（項目３６）
前記第１の領域は、コーディングユニットまたは予測ユニットである、項目２５に記載の方法。 The details of one or more variations of the subject matter described herein are set forth in the accompanying drawings and the description below. Other features and advantages of the subject matter described herein will be apparent from the description and drawings, and from the claims.
The present invention provides, for example, the following items.
(Item 1)
10. A decoder, the decoder comprising a circuit, the circuit comprising:
Receiving a bitstream;
determining a first region, a second region, and a third region of the current block according to a geometric partitioning mode;
decoding the current block using an inverse discrete cosine transform for each of the first region, the second region, and the third region;
A decoder configured to:
(Item 2)
2. The decoder of claim 1, wherein the current block has a size of 128x128 or 64x64.
(Item 3)
2. The decoder of claim 1, wherein a number of coefficients for an inverse transform of the first region, the second region, and/or the third region are signaled in the bitstream.
(Item 4)
determining whether the geometric partitioning mode is enabled; and
determining a first line segment for the current block;
determining a second line segment for the current block;
[0023] 20. The method according to claim 1, further comprising:
decoding the current block includes reconstructing pixel data using the first line segment and the second line segment;
2. The decoder of claim 1, wherein the first line segment and the second line segment divide the current block into the first region, the second region, and the third region.
(Item 5)
5. The decoder of claim 4, wherein the first line segment characterizes the first region and the second line segment characterizes the second region and the third region.
(Item 6)
5. The decoder of claim 4, wherein reconstructing pixel data includes calculating a predictor for the first region using an associated motion vector contained in the bitstream.
(Item 7)
an entropy decoder processor configured to receive the bitstream and decode the bitstream into quantized coefficients;
an inverse quantization and inverse transform processor configured to process the quantized coefficients, including performing an inverse discrete cosine transformation, in accordance with the determined coding transform law type;
A deblocking filter;
A frame buffer;
Intra prediction processor
2. The decoder of claim 1, further comprising:
(Item 8)
2. The decoder of claim 1, wherein the bitstream includes a parameter indicating whether a geometric partitioning mode is enabled for the current block.
(Item 9)
2. The decoder of claim 1, wherein the current block forms part of a quad tree plus a binary decision tree.
(Item 10)
2. The decoder of claim 1, wherein the current block is a non-leaf node of the quadtree plus binary decision tree.
(Item 11)
2. The decoder of claim 1, wherein the current block is a coding tree unit or a coding unit.
(Item 12)
2. The decoder of claim 1, wherein the first region is a coding unit or a prediction unit.
(Item 13)
10. A decoder, the decoder comprising a circuit, the circuit comprising:
Receiving a bitstream;
determining a first region, a second region, and a third region of the current block according to a geometric partitioning mode;
determining a coding transform law type for decoding each of the first region, the second region, and/or the third region from a signal contained within the bitstream, the coding transform law type characterizing at least an inverse block discrete cosine transform and an inverse shape adaptive discrete cosine transform;
decoding the current block, wherein decoding the current block includes using the determined transformation rule type for an inverse transformation for each of the first region, the second region, and/or the third region;
A decoder configured to:
(Item 14)
14. The decoder of claim 13, wherein the current block has a size of 128x128 or 64x64.
(Item 15)
14. The decoder of claim 13, wherein a number of coefficients for an inverse transform of the first region, the second region, and/or the third region are signaled in the bitstream.
(Item 16)
determining whether the geometric partitioning mode is enabled; and
determining a first line segment for the current block;
determining a second line segment for the current block;
[0023] 20. The method according to claim 1, further comprising:
decoding the current block includes reconstructing pixel data using the first line segment and the second line segment;
14. The decoder of claim 13, wherein the first line segment and the second line segment divide the current block into the first region, the second region, and the third region.
(Item 17)
17. The decoder of claim 16, wherein the first line segment characterizes the first region and the second line segment characterizes the second region and the third region.
(Item 18)
17. The decoder of claim 16, wherein reconstructing pixel data includes calculating a predictor for the first region using an associated motion vector contained in the bitstream.
(Item 19)
an entropy decoder processor configured to receive the bitstream and decode the bitstream into quantized coefficients;
an inverse quantization and inverse transform processor configured to process the quantized coefficients, including performing an inverse discrete cosine in accordance with the determined coding transform law type;
A deblocking filter;
A frame buffer;
Intra prediction processor
Item 14. The decoder of item 13, further comprising:
(Item 20)
14. The decoder of claim 13, wherein the bitstream includes a parameter indicating whether a geometric partitioning mode is enabled for the current block.
(Item 21)
14. The decoder of claim 13, wherein the current block forms part of a quad tree plus a binary decision tree.
(Item 22)
14. The decoder of claim 13, wherein the current block is a non-leaf node of the quad tree plus binary decision tree.
(Item 23)
Item 14. The decoder of item 13, wherein the current block is a coding tree unit or a coding unit.
(Item 24)
Item 14. The decoder of item 13, wherein the first region is a coding unit or a prediction unit.
(Item 25)
1. A method, comprising:
receiving, by a decoder, a bitstream;
determining a first region, a second region, and a third region of the current block according to a geometric partitioning mode;
determining a coding transform law type for decoding the first region, the second region, and/or the third region from a signal contained in the bitstream, the coding transform law type characterizing at least an inverse block discrete cosine transform or an inverse shape adaptive discrete cosine transform;
decoding the current block, wherein decoding the current block includes using the determined transformation rule type for an inverse transformation for each of the first region, the second region, and/or the third region;
A method comprising:
(Item 26)
26. The method of claim 25, wherein the current block has a size of 128x128 or 64x64.
(Item 27)
26. The method of claim 25, wherein a number of coefficients for an inverse transform of the first region, the second region, and/or the third region are signaled in the bitstream.
(Item 28)
determining, by the decoder, whether the geometric partitioning mode is enabled;
the decoder determining a first line segment for the current block;
the decoder determining a second line segment for the current block;
Further comprising:
decoding the current block includes reconstructing pixel data using the first line segment and the second line segment;
26. The method of claim 25, wherein the first line segment and the second line segment divide the current block into the first region, the second region, and the third region.
(Item 29)
29. The method of claim 28, wherein the first line segment characterizes the first region and the second line segment characterizes the second region and the third region.
(Item 30)
30. The method of claim 28, wherein reconstructing pixel data includes calculating a predictor for the first region using an associated motion vector contained in the bitstream.
(Item 31)
The decoder comprises:
an entropy decoder processor configured to receive the bitstream and decode the bitstream into quantized coefficients;
an inverse quantization and inverse transform processor configured to process the quantized coefficients, including performing an inverse discrete cosine in accordance with the determined coding transform law type;
A deblocking filter;
A frame buffer;
Intra prediction processor
26. The method of claim 25, comprising:
(Item 32)
26. The method of claim 25, wherein the bitstream includes a parameter indicating whether a block-level geometric partitioning mode is enabled for the block.
(Item 33)
26. The method of claim 25, wherein the current block forms part of a quad tree plus a binary decision tree.
(Item 34)
26. The method of claim 25, wherein the current block is a non-leaf node of the quad tree plus binary decision tree.
(Item 35)
26. The method of claim 25, wherein the current block is a coding tree unit or a coding unit.
(Item 36)
26. The method of claim 25, wherein the first region is a coding unit or a prediction unit.

本発明を例証する目的のために、図面は、本発明の１つ以上の実施形態の側面を示す。しかしながら、本発明が図面に示される精密な配列および手段に限定されないことを理解されたい。 For the purpose of illustrating the invention, the drawings show aspects of one or more embodiments of the invention. It should be understood, however, that the invention is not limited to the precise arrangements and instrumentalities shown in the drawings.

図１は、異なる予測誤差を有する３つのセグメントが存在する、指数関数的分割を用いた残差ブロック（例えば、カレントブロック）のある例を示す例証である。FIG. 1 is an illustration showing an example of a residual block (eg, a current block) with exponential partitioning, where there are three segments with different prediction errors.

図２は、ビデオエンコーディングおよびデコーディングに関する複雑性および処理性能を改良することができる、適応的な数の領域を伴う幾何学的分割のための形状適応離散コサイン変換（ＳＡ－ＤＣＴ）が可能な例示的ビデオエンコーダを例証するシステムブロック図である。FIG. 2 is a system block diagram illustrating an example video encoder capable of shape-adaptive discrete cosine transform (SA-DCT) for geometric partitioning with an adaptive number of regions, which can improve complexity and processing performance for video encoding and decoding.

図３は、適応的な数の領域を伴う幾何学的分割のためにＳＡ－ＤＣＴを用いてビデオをエンコードする例示的プロセスを例証する、プロセスフロー図である。FIG. 3 is a process flow diagram illustrating an exemplary process for encoding video using SA-DCT for geometric partitioning with an adaptive number of regions.

図４は、適応的な数の領域を伴う幾何学的分割のためにＳＡ－ＤＣＴを使用してビットストリームをデコードすることが可能なある例示的デコーダを例証するシステムブロック図である。FIG. 4 is a system block diagram illustrating an example decoder capable of decoding a bitstream using SA-DCT for a geometric partition with an adaptive number of regions.

図５は、適応的な数の領域を伴う幾何学的分割のためにＳＡ－ＤＣＴを使用してビットストリームをデコードするある例示的プロセスを例証するプロセスフロー図である。FIG. 5 is a process flow diagram illustrating one example process for decoding a bitstream using SA-DCT for a geometric partition with an adaptive number of regions.

図６は、本明細書に下位支持される方法のうちのいずれか１つ以上、およびそのいずれか１つ以上の部分を実装するために使用され得るコンピューティングシステムのブロック図である。FIG. 6 is a block diagram of a computing system that may be used to implement any one or more of the methods, or any one or more portions thereof, described herein.

図面は、必ずしも縮尺通りではなく、想像線、図式表現、および部分図によって例証され得る。ある事例では、実施形態の理解のためには必要ではないか、または他の詳細を知覚困難にする詳細が、省略されている場合がある。種々の図面内の同様の参照記号は、同様の要素を示す。 The drawings are not necessarily to scale and may be illustrated by phantom lines, schematic representations, and partial views. In some instances, details that are not necessary for an understanding of the embodiments or that make other details difficult to perceive may be omitted. Like reference symbols in the various drawings indicate like elements.

本開示に提示される実施形態は、全てのブロックが必ずしも長方形ではない幾何学的分割においてブロックをエンコードおよびデコードすることに関する。実施形態は、離散コサイン変換（ＤＣＴ）および／または逆ＤＣＴを使用して、エンコーディングおよび／またはデコーディングを実施することを含み、かつ／または実施するように構成され得る。本明細書に提示されるいくつかの実施形態では、ＤＣＴの選定は、幾何学的に分割されたブロック内の情報コンテンツの関数として行われる。いくつかの既存のビデオエンコーディングおよびデコーディングアプローチでは、全てのブロックは、長方形であり、残差は、長方形ブロック全体に関する通常のブロックＤＣＴ（Ｂ－ＤＣＴ）を使用してエンコードされる。しかしながら、ブロックが複数の非長方形領域に分割され得る幾何学的分割では、通常のＢ－ＤＣＴの使用は、いくつかのブロックに関する基礎的ピクセル情報を非効率的に表し得、実施するために不必要な演算資源を要求し得る。本主題のいくつかの実装では、幾何学的分割モードを使用するとき、エンコーダは、Ｂ－ＤＣＴの代替として、またはそれに加えて、形状適応ＤＣＴ（ＳＡ－ＤＣＴ）を使用し得る。いくつかの実施形態では、エンコーダは、幾何学的に分割されたブロック等のブロックの領域毎に、その領域の予測誤差のレベルに基づいて、Ｂ－ＤＣＴとＳＡ－ＤＣＴとの間で選択し得、選択は、デコーディングにおける使用のためにビットストリーム内で信号伝達され得る。Ｂ－ＤＣＴまたはＳＡ－ＤＣＴのいずれかを使用して非長方形領域をエンコードおよび／またはデコードし、そのような選択を信号伝達することによって、残差がより効率的に表現され得るので、ビットストリームにおける伝送のビットレートは、低減され得、処理を実施するために要求される演算資源は、結果として低減され得る。本主題は、例えば、１２８×１２８または６４×６４のサイズを有するブロック等の比較的大きいブロックに適用可能であり得る。いくつかの実装では、幾何学的分割は、カレントブロックを適応的な数の領域（所与のカレントブロックに対して３つ以上の領域等）に分割することを伴い得、ＤＣＴ変換タイプ（例えば、Ｂ－ＤＣＴまたはＳＡ－ＤＣＴ）は、領域毎に信号伝達され得る。 The embodiments presented in this disclosure relate to encoding and decoding blocks in a geometric partitioning where not all blocks are necessarily rectangular. The embodiments may include and/or be configured to perform the encoding and/or decoding using a discrete cosine transform (DCT) and/or an inverse DCT. In some embodiments presented herein, the choice of DCT is made as a function of the information content within the geometrically partitioned block. In some existing video encoding and decoding approaches, all blocks are rectangular and the residual is encoded using a regular block DCT (B-DCT) on the entire rectangular block. However, in a geometric partitioning where a block may be partitioned into multiple non-rectangular regions, the use of a regular B-DCT may represent the underlying pixel information for some blocks inefficiently and may require unnecessary computational resources to implement. In some implementations of the present subject matter, when using a geometric partitioning mode, the encoder may use a shape-adaptive DCT (SA-DCT) as an alternative to or in addition to the B-DCT. In some embodiments, the encoder may select between B-DCT and SA-DCT for each region of a block, such as a geometrically partitioned block, based on the level of prediction error for that region, and the selection may be signaled in the bitstream for use in decoding. By encoding and/or decoding a non-rectangular region using either B-DCT or SA-DCT and signaling such a selection, the bit rate of transmission in the bitstream may be reduced, since the residual may be represented more efficiently, and the computational resources required to perform the processing may be reduced as a result. The present subject matter may be applicable to relatively large blocks, such as blocks having a size of 128x128 or 64x64. In some implementations, the geometric partitioning may involve dividing the current block into an adaptive number of regions (such as three or more regions for a given current block), and the DCT transform type (e.g., B-DCT or SA-DCT) may be signaled for each region.

ある実施形態では、Ｂ－ＤＣＴは、限定ではないが、ピクセルの対応するＮ×Ｎアレイのクロマ値および／またはルマ値等の数値のＮ×Ｎブロックに対してＮ×Ｎ可逆行列を使用して実施されるＤＣＴであり得る。例えば、非限定的な例として、Ｎ×Ｎ行列Ｘが変換されるべきである場合、「ＤＣＴ－Ｉ」変換が、以下のように、変換される行列の各要素を算出し得る。

ここで、ｋ＝０，．．．，Ｎ－１である。さらなる非限定的な例として、「ＤＣＴ－ＩＩ」変換が、以下のように、変換される行列値を算出し得る。

ここで、ｋ＝０，．．．，Ｎ－１である。例証的な例として、ブロックが４×４ピクセルのブロックである場合、一般化された離散コサイン変換行列は、以下の形態をとる一般化された離散コサイン変換ＩＩ行列を含み得る。

式中、ａは、１／２であり、ｂは、

であり、ｃは、

である。 In an embodiment, the B-DCT may be a DCT performed using an N×N invertible matrix on an N×N block of values, such as, but not limited to, the chroma and/or luma values of a corresponding N×N array of pixels. For example, as a non-limiting example, if an N×N matrix X is to be transformed, a "DCT-I" transform may calculate each element of the matrix to be transformed as follows:

where k=0,...,N-1. As a further non-limiting example, a "DCT-II" transform may calculate transformed matrix values as follows:

where k=0,...,N-1. As an illustrative example, if the block is a 4x4 pixel block, the generalized discrete cosine transform matrix may include a generalized discrete cosine transform II matrix taking the form:

In the formula, a is 1/2 and b is

and c is

It is.

いくつかの実装では、効率的なハードウェアおよびソフトウェア実装のために使用され得る変換行列の整数近似が、利用され得る。例えば、ブロックが４×４ピクセルのブロックである場合、一般化された離散コサイン変換行列は、以下の形態をとる一般化された離散コサイン変換ＩＩ行列を含み得る。
In some implementations, integer approximations of the transform matrix may be utilized that may be used for efficient hardware and software implementations. For example, if the block is a 4×4 pixel block, the generalized discrete cosine transform matrix may include a generalized discrete cosine transform II matrix of the form:

逆Ｂ－ＤＣＴが、同一のＮ×Ｎ変換行列を使用して、第２の行列乗算によって算出され得、結果として生じる出力が、元の値を復元するために正規化され得る。例えば、逆ＤＣＴ－Ｉが、正規化のために

で乗算され得る。 The inverse B-DCT may be computed by a second matrix multiplication using the same N×N transform matrix, and the resulting output may be normalized to restore the original values. For example, the inverse DCT-I may be

It can be multiplied by

ＳＡ－ＤＣＴが、ピクセルの非長方形アレイに対して実施され得る。ある実施形態では、ＳＡ－ＤＣＴは、ＤＣＴ－Ｉ、ＤＣＴ－ＩＩ、または同等物等のＤＣＴの一次元バージョンを着目形状におけるピクセル値の垂直列を表すベクトルに対して実施することによって算出され、続いて、結果として生じる値が、水平ベクトルに群化され、２回目の一次元ＤＣＴを受け得、２回目のＤＣＴは、ピクセル値の完了された変換をもたらし得る。ＳＡ－ＤＣＴの変形例はさらに、上記の変換、上記の変換の出力の量子化、ならびに／または変換出力および／もしくは量子化された変換出力の反転によって導入される平均加重欠陥および／または非正規直交欠陥を補正するための係数によってスケーリングおよび／または正規化し得る。さらなる補正が、限定ではないが、潜在的に、変換、量子化、および／もしくは逆変換の前ならびに／または後に適用されるスケーリングプロセスのうちの一方または他方と組み合わせて、上記のＳＡ－ＤＣＴプロセスに先行して、各ピクセル値またはそのスケーリングされたバージョンから対象画像領域の個々の平均値を減算することによって実施され得る。当業者は、本開示の全体を精査することで、上記の説明に一貫して適用され得るＳＡ－ＤＣＴプロセスに対する種々の代替的または付加的な変形例を認識するであろう。 The SA-DCT may be performed on non-rectangular arrays of pixels. In an embodiment, the SA-DCT may be calculated by performing a one-dimensional version of the DCT, such as DCT-I, DCT-II, or equivalent, on vectors representing vertical columns of pixel values in the shape of interest, followed by grouping the resulting values into horizontal vectors and subjecting them to a second one-dimensional DCT, which may result in a completed transformation of the pixel values. Variations of the SA-DCT may further scale and/or normalize by a factor to correct mean-weighted and/or non-orthonormal defects introduced by the above transformation, quantization of the output of the above transformation, and/or inversion of the transformed output and/or the quantized transformed output. Further correction may be performed by subtracting the individual average value of the target image region from each pixel value, or a scaled version thereof, prior to the SA-DCT process described above, potentially in combination with one or other of the scaling processes applied before and/or after the transformation, quantization, and/or inverse transformation. Those skilled in the art will recognize, upon review of this disclosure in its entirety, various alternative or additional modifications to the SA-DCT process that may be applied consistently with the above description.

動き補償は、現在、以前、および／もしくは将来のフレームを含み、かつ／またはそれによって表されるビデオにおけるカメラおよび／またはオブジェクトの動きを考慮することによって、以前および／または将来のフレームを前提として、ビデオフレームまたはその一部を予測するためのアプローチを含み得る。動き補償は、ビデオ圧縮のためのビデオデータのエンコーディングおよびデコーディングにおいて、例えば、動画専門家集団（ＭＰＥＧ）－２（アドバンスドビデオコーディング（ＡＶＣ）とも称される）規格を使用するエンコーディングおよびデコーディングにおいて採用され得る。動き補償は、参照ピクチャのカレントピクチャへの変換の観点からピクチャを記述し得る。参照ピクチャは、カレントピクチャと比較したとき、時間的に以前のもの、または将来からのものであり得る。画像が、以前に伝送および／または記憶された画像から正確に合成されることができると、圧縮効率は、改良されることができる。 Motion compensation may include an approach for predicting a video frame or a portion thereof given previous and/or future frames by considering the motion of the camera and/or objects in the video including and/or represented by the current, previous, and/or future frames. Motion compensation may be employed in encoding and decoding video data for video compression, for example, in encoding and decoding using the Moving Picture Experts Group (MPEG)-2 (also referred to as Advanced Video Coding (AVC)) standard. Motion compensation may describe a picture in terms of the transformation of a reference picture into the current picture. The reference picture may be from a time earlier or future as compared to the current picture. Compression efficiency can be improved when pictures can be accurately synthesized from previously transmitted and/or stored pictures.

本開示に使用されるようなブロック分割は、同様の動きの領域を見出すためのビデオコーディングにおける方法を指し得る。ある形態のブロック分割が、ＭＰＥＧ－２、Ｈ．２６４（ＡＶＣまたはＭＰＥＧ－４Ｐａｒｔ１０とも称される）、およびＨ．２６５（高効率ビデオコーディング（ＨＥＶＣ）とも称される）を含む、ビデオコーデック規格において見出されることができる。例示的ブロック分割アプローチでは、同様の動きを有するピクセルを含有するブロック分割を見出すために、ビデオフレームの非重複ブロックが、長方形サブブロックに分割され得る。このアプローチは、ブロック分割の全てのピクセルが同様の動きを有するときに良好に機能し得る。ブロック内のピクセルの動きは、以前にコーディングされたフレームに対して決定され得る。 Block partitioning as used in this disclosure may refer to a method in video coding for finding regions of similar motion. Some forms of block partitioning can be found in video codec standards, including MPEG-2, H.264 (also referred to as AVC or MPEG-4 Part 10), and H.265 (also referred to as High Efficiency Video Coding (HEVC)). In an exemplary block partitioning approach, non-overlapping blocks of a video frame may be divided into rectangular sub-blocks to find block partitions containing pixels with similar motion. This approach may work well when all pixels of the block partition have similar motion. The motion of pixels within a block may be determined relative to a previously coded frame.

形状適応ＤＣＴおよび／またはＢ－ＤＣＴが、適応的な数の領域を有する幾何学的分割において効果的に使用され得る。図１は、異なる予測誤差を有する３つのセグメントＳ０、Ｓ１、およびＳ２が存在する幾何学的分割を用いた６４×６４または１２８×１２８のサイズの残差ブロック（例えば、カレントブロック）１００の非限定的な例を示す例証であるが、３つのセグメントは、例示目的のために図１に例証され、代替として、または加えて、より多いかまたはより少ない数のセグメントが採用され得る。カレントブロックは、２つの線セグメント（Ｐ１Ｐ２およびＰ３Ｐ４）に従って幾何学的に分割され得、これは、カレントブロックを３つの領域Ｓ０、Ｓ１、およびＳ２に分け得る。この例では、Ｓ０は、比較的高い予測誤差を有し得る一方、Ｓ１およびＳ２は、比較的低い予測誤差を有し得る。セグメントＳ０（領域とも称される）に関して、エンコーダは、残差コーディングのためにＢ－ＤＣＴを選択および使用し得る。低い予測誤差を有するセグメントＳ１およびＳ２のために、エンコーダは、ＳＡ－ＤＣＴを選択および使用し得る。残差エンコーディング変換の選択は、予測誤差（例えば、残差のサイズ）に基づくことができる。ＳＡ－ＤＣＴアルゴリズムは、複雑性の観点からは比較的単純であり、Ｂ－ＤＣＴほど多くの演算を要求しないので、より低い予測誤差の残差コーディングのためにＳＡ－ＤＣＴを利用することは、ビデオエンコーディングおよびデコーディングに関する複雑性ならびに処理性能を改良し得る。 Shape-adaptive DCT and/or B-DCT may be effectively used in a geometric partitioning with an adaptive number of regions. Although FIG. 1 is an illustration showing a non-limiting example of a residual block (e.g., a current block) 100 of size 64×64 or 128×128 with a geometric partitioning in which there are three segments S0, S1, and S2 with different prediction errors, three segments are illustrated in FIG. 1 for illustrative purposes, and alternatively or additionally, a greater or lesser number of segments may be employed. The current block may be geometrically partitioned according to two line segments (P1P2 and P3P4), which may divide the current block into three regions S0, S1, and S2. In this example, S0 may have a relatively high prediction error, while S1 and S2 may have a relatively low prediction error. For segment S0 (also referred to as a region), the encoder may select and use B-DCT for residual coding. For segments S1 and S2 with low prediction errors, the encoder may select and use SA-DCT. The selection of the residual encoding transform may be based on the prediction error (e.g., the size of the residual). Because the SA-DCT algorithm is relatively simple from a complexity standpoint and does not require as many operations as the B-DCT, utilizing SA-DCT for residual coding with lower prediction errors may improve the complexity and processing performance for video encoding and decoding.

故に、引き続き図１を参照すると、ＳＡ－ＤＣＴは、低い予測誤差を有するセグメントのための完全ブロックＤＣＴに対する付加的変換選択肢として信号伝達され得る。どれが低い誤差または高い誤差と見なされるかは、エンコーダにおいて設定されることができ、かつ適用に基づいて変動し得るパラメータであり得る。変換タイプの選定は、ビットストリーム内で信号伝達され得る。デコーダにおいて、ビットストリームが解析され得、所与のカレントブロックに関して、残差が、ビットストリーム内で信号伝達された変換タイプを使用してデコードされ得る。代替として、または加えて、いくつかの実装では、変換と関連付けられる係数の数が、ビットストリーム内で信号伝達され得る。 Thus, with continued reference to FIG. 1, SA-DCT may be signaled as an additional transform option to the full block DCT for segments with low prediction error. What is considered low error or high error may be a parameter that can be set at the encoder and can vary based on the application. The choice of transform type may be signaled in the bitstream. At the decoder, the bitstream may be analyzed and, for a given current block, the residual may be decoded using the transform type signaled in the bitstream. Alternatively or additionally, in some implementations, the number of coefficients associated with the transform may be signaled in the bitstream.

より詳細には、継続して図１を参照すると、適応的な数の領域を有する幾何学的分割は、長方形ブロックが非長方形であり得る２つ以上の領域にさらに分けられるビデオエンコーディングおよびデコーディングのための技法を含み得る。例えば、図１は、適応的な数の領域を有するピクセルレベルでの幾何学的分割の非限定的な例を例証する。例示的長方形ブロック１００（Ｍピクセルの幅とＮピクセルの高さとを有し、Ｍ×Ｎピクセルとして表され得る）が、線セグメントＰ１Ｐ２およびＰ３Ｐ４に沿って３つの領域（Ｓ０、Ｓ１、およびＳ２）に分けられ得る。Ｓ０内のピクセルが同様の動きを有するとき、動きベクトルが、その領域内の全てのピクセルの動きを記述し得、動きベクトルは、領域Ｓ０を圧縮するために使用されることができる。同様に、領域Ｓ１内のピクセルが同様の動きを有するとき、関連付けられる動きベクトルが、領域Ｓ１内のピクセルの動きを記述し得る。同様に、領域Ｓ２内のピクセルが同様の動きを有するとき、関連付けられる動きベクトルが、領域Ｓ２内のピクセルの動きを記述し得る。そのような幾何学的分割は、限定ではないが、例えば、極座標、デカルト座標、もしくは同等物等の座標、所定のテンプレートへのインデックス、またはビデオビットストリーム内の分割の他の特徴付けを使用して、位置Ｐ１、Ｐ２、Ｐ３、Ｐ４、および／またはこれらの位置の表現をエンコードすることによって、受信機（例えば、デコーダ）に信号伝達され得る。 More specifically, and continuing to refer to FIG. 1, geometric partitioning with an adaptive number of regions may include techniques for video encoding and decoding in which a rectangular block is further divided into two or more regions that may be non-rectangular. For example, FIG. 1 illustrates a non-limiting example of geometric partitioning at the pixel level with an adaptive number of regions. An example rectangular block 100 (having a width of M pixels and a height of N pixels and may be represented as M×N pixels) may be divided into three regions (S0, S1, and S2) along line segments P1P2 and P3P4. When pixels in S0 have similar motion, a motion vector may describe the motion of all pixels in that region, and the motion vector may be used to compress region S0. Similarly, when pixels in region S1 have similar motion, an associated motion vector may describe the motion of pixels in region S1. Similarly, when pixels in region S2 have similar motion, an associated motion vector may describe the motion of pixels in region S2. Such a geometric division may be signaled to a receiver (e.g., a decoder) by encoding the positions P1, P2, P3, P4, and/or a representation of these positions using, for example, but not limited to, coordinates such as polar coordinates, Cartesian coordinates, or the like, an index into a predefined template, or other characterization of the division within the video bitstream.

引き続き図１を参照すると、ピクセルレベルでの幾何学的分割を利用してビデオデータをエンコードするとき、線セグメントＰ１Ｐ２（またはより具体的には、点Ｐ１および点Ｐ２）が決定され得る。ピクセルレベルでの幾何学的分割を利用するときにブロックを最良に分ける線セグメントＰ１Ｐ２（または、より具体的には、点Ｐ１および点Ｐ２）を決定するために、点Ｐ１および点Ｐ２の可能な組み合わせは、ブロック幅および高さであるＭおよびＮに依存する。サイズＭ×Ｎのブロックに関して、（Ｍ－１）×（Ｎ－１）×３の可能な分割が存在する。したがって、正しい分割を識別することは、全ての可能な区分に関する動き推定を評価する演算的に高価なタスクになり得、これは、（例えば、ピクセルレベルでの幾何学的分割を伴わない）長方形分割を使用するエンコーディングと比較して、ビデオをエンコードするために要求される時間および／または処理能力の量を増加させ得る。最良または正しい分割を成すものは、メトリックに従って決定されることができ、実装毎に変化し得る。 Continuing with reference to FIG. 1, when encoding video data utilizing geometric partitioning at the pixel level, a line segment P1P2 (or, more specifically, points P1 and P2) may be determined. To determine the line segment P1P2 (or, more specifically, points P1 and P2) that best divides a block when utilizing geometric partitioning at the pixel level, the possible combinations of points P1 and P2 depend on M and N, which are the block width and height. For a block of size M×N, there are (M−1)×(N−1)×3 possible partitions. Thus, identifying the correct partition may be a computationally expensive task of evaluating motion estimates for all possible partitions, which may increase the amount of time and/or processing power required to encode the video compared to encoding using rectangular partitioning (e.g., without geometric partitioning at the pixel level). What constitutes the best or correct partition may be determined according to a metric and may vary from implementation to implementation.

いくつかの実装では、引き続き図１を参照すると、２つの領域を形成する第１の分割が決定され得（例えば、線Ｐ１Ｐ２および関連付けられる領域を決定する）、次いで、それらの領域のうちの１つがさらに分割される点で、分割が反復的に行われる。例えば、図１を参照して説明される分割は、ブロックを２つの領域に分割するために実施されることができる。それらの領域のうちの１つは、（例えば、新しい領域Ｓ１および領域Ｓ２を形成するために）さらに分割されることができる。このプロセスは、停止基準が達せられるまで、ブロックレベル幾何学的分割を実施し続けることができる。 In some implementations, with continued reference to FIG. 1, a first division may be determined that forms two regions (e.g., determining lines P1P2 and associated regions), and then the division is performed iteratively, in that one of the regions is further divided. For example, the division described with reference to FIG. 1 may be performed to divide a block into two regions. One of the regions may be further divided (e.g., to form new regions S1 and S2). This process may continue to perform block-level geometric divisions until a stopping criterion is reached.

図２は、ビデオエンコーディングおよびデコーディングに関する複雑性ならびに処理性能を改良することができる適応的な数の領域を伴う幾何学的分割に関するＳＡ－ＤＣＴおよび／またはＢ－ＤＣＴが可能な例示的ビデオエンコーダ２００を例証するシステムブロック図である。例示的ビデオエンコーダ２００は、入力ビデオ２０５を受信し、入力ビデオ２０５は、最初に、ツリー構造化マクロブロック分割スキーム（例えば、クアッドツリープラスバイナリツリー）等の処理スキームに従って、セグメント化され、または分けられることができる。ツリー構造化マクロブロック分割スキームの例は、ピクチャフレームをコーディングツリーユニット（ＣＴＵ）と呼ばれる大きいブロック要素に分割することを含み得る。いくつかの実装では、各ＣＴＵは、コーディングユニット（ＣＵ）と呼ばれるいくつかのサブブロックに１回以上さらに分割され得る。この分割の最終結果は、予測ユニット（ＰＵ）と呼ばれ得るサブブロックの群を含み得る。変換ユニット（ＴＵ）もまた、利用され得る。そのような分割スキームは、本主題のいくつかの側面に従って、適応的な数の領域を伴う幾何学的分割を実施することを含むことができる。 2 is a system block diagram illustrating an exemplary video encoder 200 capable of SA-DCT and/or B-DCT for geometric partitioning with an adaptive number of regions, which can improve the complexity and processing performance for video encoding and decoding. The exemplary video encoder 200 receives an input video 205, which can first be segmented or divided according to a processing scheme such as a tree-structured macroblock partitioning scheme (e.g., a quad tree plus a binary tree). An example of a tree-structured macroblock partitioning scheme may include partitioning a picture frame into large block elements called coding tree units (CTUs). In some implementations, each CTU may be further partitioned one or more times into several sub-blocks called coding units (CUs). The end result of this partitioning may include a group of sub-blocks that may be called prediction units (PUs). Transform units (TUs) may also be utilized. Such partitioning schemes may include implementing a geometric partitioning with an adaptive number of regions in accordance with some aspects of the present subject matter.

継続して図２を参照すると、例示的ビデオエンコーダ２００は、イントラ予測プロセッサ２１５と、適応的な数の領域を伴う幾何学的分割を支援することが可能な動き推定／補償プロセッサ２２０（インター予測プロセッサとも称される）と、変換／量子化プロセッサ２２５と、逆量子化／逆変換プロセッサ２３０と、ループ内フィルタ２３５と、デコード済ピクチャバッファ２４０と、エントロピーコーディングプロセッサ２４５とを含む。いくつかの実装では、動き推定／補償プロセッサ２２０は、幾何学的分割を実施することができる。幾何学的分割モードを信号伝達するビットストリームパラメータが、出力ビットストリーム２５０内での包含のために、エントロピーコーディングプロセッサ２４５に入力されることができる。 Continuing with reference to FIG. 2, the exemplary video encoder 200 includes an intra-prediction processor 215, a motion estimation/compensation processor 220 (also referred to as an inter-prediction processor) capable of supporting geometric partitioning with an adaptive number of regions, a transform/quantization processor 225, an inverse quantization/inverse transform processor 230, an in-loop filter 235, a decoded picture buffer 240, and an entropy coding processor 245. In some implementations, the motion estimation/compensation processor 220 can perform the geometric partitioning. A bitstream parameter signaling the geometric partitioning mode can be input to the entropy coding processor 245 for inclusion in the output bitstream 250.

動作時、継続して図２を参照すると、入力ビデオ２０５のフレームのブロック毎に、イントラピクチャ予測を介して、または動き推定／補償を使用して、ブロックを処理するかどうかが決定されることができる。ブロックは、イントラ予測プロセッサ２１０または動き推定／補償プロセッサ２２０に提供されることができる。ブロックがイントラ予測を介して処理されるべきである場合、イントラ予測プロセッサ２１０は、処理を実施し、予測子を出力することができる。ブロックが動き推定／補償を介して処理されるべきである場合、動き推定／補償プロセッサ２２０は、幾何学的分割の使用を含む処理を実施し、予測子を出力することができる。 In operation, and continuing to refer to FIG. 2, for each block of a frame of the input video 205, a decision can be made whether to process the block via intra-picture prediction or using motion estimation/compensation. The block can be provided to an intra-prediction processor 210 or a motion estimation/compensation processor 220. If the block is to be processed via intra-prediction, the intra-prediction processor 210 can perform processing and output a predictor. If the block is to be processed via motion estimation/compensation, the motion estimation/compensation processor 220 can perform processing, including the use of geometric partitioning, and output a predictor.

引き続き図２を参照すると、残差が、入力ビデオから予測子を減算することによって形成されることができる。残差は、変換／量子化プロセッサ２２５によって受信されることができ、これは、（例えば、残差のサイズまたは誤差メトリックを閾値と比較することによって）予測誤差（例えば、残差サイズ）が「高い」誤差と見なされるかまたは「低い」誤差と見なされるかを決定することができる。決定に基づいて、変換／量子化プロセッサ２２５は、Ｂ－ＤＣＴおよびＳＡ－ＤＣＴを含み得る変換タイプを選択することができる。いくつかの実装では、変換／量子化プロセッサ２２５は、残差が高い誤差を有すると見なされる場合、Ｂ－ＤＣＴの変換タイプを選択し、残差が低い誤差を有すると見なされる場合、ＳＡ－ＤＣＴの変換タイプを選択する。選択された変換タイプに基づいて、変換／量子化プロセッサ２２５は、変換処理（例えば、ＳＡ－ＤＣＴまたはＢ－ＤＣＴ）を実施し、係数を生成することができ、係数は、量子化されることができる。量子化された係数および任意の関連付けられる信号伝達情報（選択された変換タイプおよび／または使用される係数の数を含み得る）が、エントロピーエンコーディング、および出力ビットストリーム２５０内での包含のために、エントロピーコーディングプロセッサ２４５に提供されることができる。エントロピーエンコーディングプロセッサ２４５は、適応的な数の領域を伴う幾何学的分割のためのＳＡ－ＤＣＴに関連する信号伝達情報のエンコーディングを支援することができる。加えて、量子化された係数は、逆量子化／逆変換プロセッサ２３０に提供されることができ、逆量子化／逆変換プロセッサ２３０はピクセルを再現し得、ピクセルは、予測子と組み合わせられ、ループ内フィルタ２３５によって処理され得、その出力は、適応的な数の領域を伴う幾何学的分割を支援することが可能である動き推定／補償プロセッサ２２０による使用のために、デコード済ピクチャバッファ２４０内に記憶される。 Continuing to refer to FIG. 2, a residual may be formed by subtracting a predictor from the input video. The residual may be received by a transform/quantization processor 225, which may determine (e.g., by comparing the size of the residual or an error metric to a threshold) whether the prediction error (e.g., residual size) is considered to be a "high" error or a "low" error. Based on the determination, the transform/quantization processor 225 may select a transform type, which may include B-DCT and SA-DCT. In some implementations, the transform/quantization processor 225 selects a transform type of B-DCT if the residual is considered to have a high error and selects a transform type of SA-DCT if the residual is considered to have a low error. Based on the selected transform type, the transform/quantization processor 225 may perform a transform process (e.g., SA-DCT or B-DCT) to generate coefficients, which may be quantized. The quantized coefficients and any associated signaling information (which may include the selected transform type and/or the number of coefficients used) may be provided to an entropy coding processor 245 for entropy encoding and inclusion in the output bitstream 250. The entropy encoding processor 245 may support encoding of signaling information related to the SA-DCT for geometric partitioning with an adaptive number of regions. In addition, the quantized coefficients may be provided to an inverse quantization/inverse transform processor 230, which may reconstruct pixels that may be combined with a predictor and processed by an in-loop filter 235, the output of which is stored in a decoded picture buffer 240 for use by a motion estimation/compensation processor 220, which may support geometric partitioning with an adaptive number of regions.

ここで図３を参照すると、ビデオエンコーディングおよびデコーディングに関する複雑性および処理性能を改良することができる、適応的な数の領域を伴う幾何学的分割のためのＳＡ－ＤＣＴを用いてビデオをエンコードする例示的プロセス３００を例証するプロセスフロー図が、例証される。ステップ３１０において、ビデオフレームは、例えば、ピクチャフレームをＣＴＵおよびＣＵに分割することを含み得るツリー構造化マクロブロック分割スキームを使用して、初期ブロックセグメント化を受けて得る。３２０において、ブロックが、幾何学的分割のために選択され得る。選択は、ブロックが幾何学的分割モードに従って処理されるべきであることをメトリックルールに従って識別することを含み得る。ステップ３３０において、選択されたブロックが、幾何学的分割モードに従って、３つ以上の非長方形領域に分割され得る。 Now referring to FIG. 3, a process flow diagram illustrating an example process 300 for encoding video using SA-DCT for geometric partitioning with an adaptive number of regions, which can improve complexity and processing performance for video encoding and decoding, is illustrated. At step 310, a video frame may undergo initial block segmentation using, for example, a tree-structured macroblock partitioning scheme, which may include partitioning a picture frame into CTUs and CUs. At 320, a block may be selected for geometric partitioning. The selection may include identifying, according to a metric rule, that the block should be processed according to a geometric partitioning mode. At step 330, the selected block may be partitioned into three or more non-rectangular regions according to the geometric partitioning mode.

ステップ３４０において、引き続き図３を参照すると、幾何学的に分割された領域毎に、変換タイプ（変換則タイプとも称される）が決定され得る。これは、（例えば、残差のサイズまたは誤差メトリックを閾値と比較することによって）予測誤差（例えば、残差サイズ）が「高い」誤差と見なされるかまたは「低い」誤差と見なされるかを決定することを含み得る。決定に基づいて、変換タイプが、例えば、下記に説明されるようなクアッドツリープラスバイナリディシジョンツリープロセスを使用して選択され得、変換タイプは、限定ではないが、Ｂ－ＤＣＴまたはＳＡ－ＤＣＴを含み得る。いくつかの実装では、残差が高い誤差を有すると見なされる場合、Ｂ－ＤＣＴの変換タイプが選択され、残差が低い誤差を有すると見なされる場合、ＳＡ－ＤＣＴの変換タイプが選択される。選択された変換タイプに基づいて、変換処理（例えば、ＳＡ－ＤＣＴまたはＢ－ＤＣＴ）が、量子化され得る係数を生成するために実施され得る。 At step 340, and still referring to FIG. 3, for each geometrically divided region, a transform type (also referred to as a transform law type) may be determined. This may include determining whether the prediction error (e.g., residual size) is considered to be a "high" or "low" error (e.g., by comparing the size of the residual or an error metric to a threshold). Based on the determination, a transform type may be selected, for example, using a quad tree plus binary decision tree process as described below, which may include, but is not limited to, B-DCT or SA-DCT. In some implementations, if the residual is considered to have a high error, a B-DCT transform type is selected, and if the residual is considered to have a low error, a SA-DCT transform type is selected. Based on the selected transform type, a transform process (e.g., SA-DCT or B-DCT) may be performed to generate coefficients that may be quantized.

ステップ３５０において、継続して図３を参照すると、決定された変換タイプが、ビットストリーム内で信号伝送され得る。変換および量子化された残差が、ビットストリーム内に含まれることができる。いくつかの実装では、変換係数の数が、ビットストリーム内で信号伝送されることができる。 At step 350, and continuing to refer to FIG. 3, the determined transform type may be signaled in the bitstream. The transformed and quantized residual may be included in the bitstream. In some implementations, the number of transform coefficients may be signaled in the bitstream.

図４は、ビデオエンコーディングおよびデコーディングに関する複雑性および処理性能を改良することができる、適応的な数の領域を伴う幾何学的分割のためのＳＡ－ＤＣＴおよび／またはＢ－ＤＣＴを含む（しかしこれに限定されない）ＤＣＴを使用してビットストリーム４７０をデコードすることが可能なデコーダ４００の非限定的な例を例証するシステムブロック図である。デコーダ４００は、エントロピーデコーダプロセッサ４１０と、逆量子化および逆変換プロセッサ４２０と、デブロッキングフィルタ４３０と、フレームバッファ４４０と、動き補償プロセッサ４５０と、イントラ予測プロセッサ４６０とを含む。いくつかの実装では、ビットストリーム４７０は、幾何学的分割モードおよび変換則タイプを信号伝達するパラメータを含む。いくつかの実装では、ビットストリーム４７０は、変換係数の数を信号伝達するパラメータを含む。動き補償プロセッサ４５０は、本明細書に説明されるような幾何学的分割を使用してピクセル情報を再構築することができる。 FIG. 4 is a system block diagram illustrating a non-limiting example of a decoder 400 capable of decoding a bitstream 470 using DCT including, but not limited to, SA-DCT and/or B-DCT for geometric partitioning with an adaptive number of regions, which can improve complexity and processing performance for video encoding and decoding. The decoder 400 includes an entropy decoder processor 410, an inverse quantization and inverse transform processor 420, a deblocking filter 430, a frame buffer 440, a motion compensation processor 450, and an intra prediction processor 460. In some implementations, the bitstream 470 includes parameters signaling a geometric partitioning mode and a transform law type. In some implementations, the bitstream 470 includes parameters signaling a number of transform coefficients. The motion compensation processor 450 can reconstruct pixel information using the geometric partitioning as described herein.

動作時、引き続き図４を参照すると、ビットストリーム４７０が、デコーダ４００によって受信され、エントロピーデコーダプロセッサ４１０に入力され得、これは、ビットストリームを量子化された係数にエントロピーデコードし得る。量子化された係数は、逆量子化および逆変換プロセッサ４２０に提供され得、逆量子化および逆変換プロセッサ４２０は、コーディング変換則タイプ（例えば、Ｂ－ＤＣＴまたはＳＡ－ＤＣＴ）を決定し、決定されたコーディング変換則タイプに従って逆量子化および逆変換を実施し、残差信号を作成し得る。いくつかの実装では、逆量子化および逆変換プロセッサ４２０は、変換係数の数を決定し、変換係数の決定された数に従って、逆変換を実施し得る。 In operation, and still referring to FIG. 4, a bitstream 470 may be received by the decoder 400 and input to the entropy decoder processor 410, which may entropy decode the bitstream into quantized coefficients. The quantized coefficients may be provided to the inverse quantization and inverse transform processor 420, which may determine a coding transform law type (e.g., B-DCT or SA-DCT) and perform inverse quantization and inverse transform according to the determined coding transform law type to create a residual signal. In some implementations, the inverse quantization and inverse transform processor 420 may determine the number of transform coefficients and perform the inverse transform according to the determined number of transform coefficients.

引き続き図４を参照すると、残差信号が、処理モードに従って、動き補償プロセッサ４５０またはイントラ予測プロセッサ４６０の出力に追加され得る。動き補償プロセッサ４５０およびイントラ予測プロセッサ４６０の出力は、以前にデコードされたブロックに基づくブロック予測を含み得る。予測および残差の合計が、デブロッキングフィルタ４３０によって処理され、フレームバッファ４４０内に記憶され得る。所与のブロック（例えば、ＣＵまたはＰＵ）に関して、ビットストリーム４７０が、分割モードがブロックレベル幾何学的分割であることを信号伝達するとき、動き補償プロセッサ４５０は、本明細書に説明される幾何学的分割アプローチに基づいて、予測を構築し得る。 Continuing to refer to FIG. 4, the residual signal may be added to the output of the motion compensation processor 450 or the intra-prediction processor 460, depending on the processing mode. The output of the motion compensation processor 450 and the intra-prediction processor 460 may include a block prediction based on a previously decoded block. The sum of the prediction and the residual may be processed by the deblocking filter 430 and stored in the frame buffer 440. For a given block (e.g., CU or PU), when the bitstream 470 signals that the partition mode is block-level geometric partitioning, the motion compensation processor 450 may construct a prediction based on the geometric partitioning approach described herein.

図５は、ビデオエンコーディングおよびデコーディングに関する複雑性ならびに処理性能を改良することができる、適応的な数の領域を伴う幾何学的分割のためのＳＡ－ＤＣＴを使用してビットストリームをデコードする例示的プロセス５００を例証するプロセスフロー図である。ステップ５１０において、ビットストリームが受信され、これは、カレントブロック（例えば、ＣＴＵ、ＣＵ、ＰＵ）を含み得る。受信することは、ビットストリームからカレントブロックおよび関連付けられる信号伝達情報を抽出および／または解析することを含み得る。デコーダは、幾何学的分割を特徴付ける１つ以上のパラメータを抽出または決定し得る。これらのパラメータは、例えば線セグメントの始点および終点（例えば、Ｐ１、Ｐ２、Ｐ３、Ｐ４）のインデックスを含み得、抽出または決定は、ビットストリームからパラメータを識別し、読み出すこと（例えば、ビットストリームを解析すること）を含み得る。 FIG. 5 is a process flow diagram illustrating an example process 500 for decoding a bitstream using SA-DCT for geometric partitioning with an adaptive number of regions, which can improve complexity and processing performance for video encoding and decoding. At step 510, a bitstream is received, which may include a current block (e.g., CTU, CU, PU). Receiving may include extracting and/or parsing the current block and associated signaling information from the bitstream. The decoder may extract or determine one or more parameters that characterize the geometric partitioning. These parameters may include, for example, indices of start and end points of line segments (e.g., P1, P2, P3, P4), and extracting or determining may include identifying and retrieving the parameters from the bitstream (e.g., parsing the bitstream).

ステップ５２０において、引き続き図５を参照すると、カレントブロックの第１の領域、第２の領域、および第３の領域が、幾何学的分割モードに従って決定され得る。決定することは、幾何学的分割モードがカレントブロックに関して有効化される（例えば、真である）かどうかを決定することを含み得る。幾何学的分割モードが有効化されない（例えば、偽である）場合、デコーダは、代替分割モードを使用してカレントブロックを処理し得る。幾何学的分割モードが有効化される（例えば、真である）場合、３つ以上の領域が、決定および／または処理され得る。 At step 520, with continued reference to FIG. 5, a first region, a second region, and a third region of the current block may be determined according to a geometric partitioning mode. Determining may include determining whether the geometric partitioning mode is enabled (e.g., true) for the current block. If the geometric partitioning mode is not enabled (e.g., false), the decoder may process the current block using an alternate partitioning mode. If the geometric partitioning mode is enabled (e.g., true), more than two regions may be determined and/or processed.

随意のステップ５３０において、継続して図５を参照すると、コーディング変換則タイプが、決定され得る。コーディング変換則タイプが、ビットストリーム内で信号伝達され得る。例えば、ビットストリームは、Ｂ－ＤＣＴまたはＳＡ－ＤＣＴを規定し得るコーディング変換則タイプを決定するために解析され得る。決定されたコーディング変換則タイプは、第１の領域、第２の領域、および／または第３の領域をデコードするためのものであり得る。 At optional step 530, and continuing to refer to FIG. 5, a coding transform law type may be determined. The coding transform law type may be signaled within the bitstream. For example, the bitstream may be analyzed to determine a coding transform law type, which may specify B-DCT or SA-DCT. The determined coding transform law type may be for decoding the first region, the second region, and/or the third region.

５４０において、引き続き図５を参照すると、カレントブロックがデコードされ得る。カレントブロックのデコーディングは、第１の領域、第２の領域、および／または第３の領域の各々のための逆変換のために決定された変換タイプを使用することを含み得る。デコーディングは、領域毎に、幾何学的分割モードに従って関連付けられる動き情報を決定することを含み得る。 At 540, with continued reference to FIG. 5, the current block may be decoded. Decoding the current block may include using the determined transform type for an inverse transform for each of the first region, the second region, and/or the third region. The decoding may include, for each region, determining associated motion information according to the geometric partitioning mode.

少数の変形例が、上記に詳細に説明されたが、他の修正または追加も可能である。例えば、幾何学的分割は、エンコーダにおけるレート歪み決定に基づいて、ビットストリーム内で信号伝達されることができる。コーディングは、通常の所定の分割（例えば、テンプレート）、分割の時間的および空間的予測、ならびに付加的オフセットの組み合わせに基づくことができる。幾何学的に分割された領域の各々は、動き補償された予測またはイントラ予測を利用することができる。予測された領域の境界は、残差が追加される前に平滑化されることができる。 Although a few variations have been detailed above, other modifications or additions are possible. For example, the geometric partitioning can be signaled in the bitstream based on a rate-distortion decision at the encoder. The coding can be based on a combination of regular predefined partitioning (e.g., a template), temporal and spatial prediction of the partitioning, and an additive offset. Each of the geometrically partitioned regions can utilize motion compensated prediction or intra prediction. The boundaries of the predicted regions can be smoothed before the residual is added.

いくつかの実装では、クアッドツリープラスバイナリディシジョンツリー（ＱＴＢＴ）が、実装され得る。ＱＴＢＴでは、コーディングツリーユニットレベルにおいて、ＱＴＢＴの分割パラメータが、いかなるオーバーヘッドも伝送することなく、局所的特性に適合するように動的に導出される。続けて、コーディングユニットレベルにおいて、ジョイント分類器ディシジョンツリー構造が、不必要な反復を排除し、誤った予測のリスクを制御し得る。いくつかの実装では、適応的な数の領域を伴う幾何学的分割は、ＱＴＢＴの全てのリーフノードにおいて利用可能な付加的分割オプションとして利用可能であり得る。 In some implementations, a quad-tree plus binary decision tree (QTBT) may be implemented, where at the coding tree unit level, the splitting parameters of the QTBT are dynamically derived to adapt to local characteristics without transmitting any overhead. Subsequently, at the coding unit level, a joint classifier decision tree structure may eliminate unnecessary iterations and control the risk of erroneous predictions. In some implementations, a geometric split with an adaptive number of regions may be available as an additional splitting option available at all leaf nodes of the QTBT.

いくつかの実装では、デコーダが、カレントブロックに関する幾何学的分割を発生させ、従属プロセスに関する全ての分割関連情報を提供する分割プロセッサを含み得る。分割プロセッサは、これが、ブロックが幾何学的に分割される場合にセグメント毎に実施され得るので、動き補償に直接影響を及ぼし得る。さらに、分割プロセッサは、形状情報をイントラ予測プロセッサおよび変換コーディングプロセッサに提供し得る。 In some implementations, the decoder may include a partitioning processor that generates a geometric partitioning for the current block and provides all partition-related information for the subordinate processes. The partitioning processor may directly affect motion compensation, since this may be performed on a segment-by-segment basis if the block is geometrically partitioned. Additionally, the partitioning processor may provide shape information to the intra-prediction processor and the transform coding processor.

いくつかの実装では、付加的シンタックス要素が、ビットストリームの異なる階層レベルにおいて信号伝達され得る。シーケンス全体に関する適応的な数の領域を伴う幾何学的分割を有効化するために、有効化フラグが、シーケンスパラメータセット（ＳＰＳ）においてコーディングされ得る。さらに、任意のコーディングユニット（ＣＵ）が適応的な数の領域を伴う幾何学的分割を使用するかどうかを示すために、ＣＴＵフラグが、コーディングツリーユニット（ＣＴＵ）レベルにおいてコーディングされ得る。カレントコーディングユニットが適応的な数の領域を伴う幾何学的分割を利用するかどうかを示すために、ＣＵフラグが、コーディングされ得る。ブロック上の線セグメントを規定するパラメータが、コーディングされ得る。領域毎に、カレント領域がインター予測されるかまたはイントラ予測されるかを規定し得るフラグが、デコードされ得る。 In some implementations, additional syntax elements may be signaled at different hierarchical levels of the bitstream. An enable flag may be coded in the sequence parameter set (SPS) to enable geometric partitioning with an adaptive number of regions for the entire sequence. Additionally, a CTU flag may be coded at the coding tree unit (CTU) level to indicate whether a coding unit (CU) uses geometric partitioning with an adaptive number of regions. A CU flag may be coded to indicate whether the current coding unit utilizes geometric partitioning with an adaptive number of regions. Parameters that specify line segments on a block may be coded. For each region, a flag may be decoded that may specify whether the current region is inter-predicted or intra-predicted.

いくつかの実装では、最小領域サイズが規定され得る。 In some implementations, a minimum region size may be specified.

本明細書に説明される主題は、多くの技術的利点を提供する。例えば、本主題のいくつかの実装は、圧縮効率を増加させながら複雑性を低減させるブロックの分割を提供することができる。いくつかの実装では、オブジェクト境界におけるブロッキングアーチファクトが、低減させられることができる。 The subject matter described herein provides many technical advantages. For example, some implementations of the subject matter can provide block partitioning that reduces complexity while increasing compression efficiency. In some implementations, blocking artifacts at object boundaries can be reduced.

本明細書に説明される側面および実施形態のうちの任意の１つ以上のものが、コンピュータ技術分野の当業者に明白であるように、本明細書の教示に従ってプログラムされた１つ以上の機械（例えば、電子ドキュメントのためのユーザコンピューティングデバイスとして利用される１つ以上のコンピューティングデバイス、ドキュメントサーバ等の１つ以上のサーバデバイス等）において実現および／または実装されるデジタル電子回路、集積回路、専用に設計された特定用途向け集積回路（ＡＳＩＣ）、フィールドプログラマブルゲートアレイ（ＦＰＧＡ）コンピュータハードウェア、ファームウェア、ソフトウェア、および／またはそれらの組み合わせを使用して、便宜的に実装され得ることに留意されたい。これらの種々の側面または特徴は、少なくとも１つのプログラム可能なプロセッサを含むプログラム可能なシステム上で実行可能かつ／または解読可能である１つ以上のコンピュータプログラムおよび／またはソフトウェア内での実装を含み得、少なくとも１つのプログラム可能なプロセッサは、専用目的もしくは汎用目的であり得、専用目的もしくは汎用目的であり得、データおよび命令を、ストレージシステム、少なくとも１つの入力デバイス、および少なくとも１つの出力デバイスから受信し、データおよび命令をそれらに伝送するように結合される。適切なソフトウェアコーディングが、ソフトウェア技術分野の当業者に明白であるように、本開示の教示に基づいて、熟練のプログラマによって容易に準備され得る。ソフトウェアおよび／またはソフトウェアモジュールを採用する上記に議論される側面および実装もまた、ソフトウェアおよび／またはソフトウェアモジュールの機械実行可能命令の実装を補助するために適切なハードウェアを含み得る。 It should be noted that any one or more of the aspects and embodiments described herein may be conveniently implemented using digital electronic circuitry, integrated circuits, specially designed application specific integrated circuits (ASICs), field programmable gate array (FPGA) computer hardware, firmware, software, and/or combinations thereof realized and/or implemented in one or more machines (e.g., one or more computing devices utilized as user computing devices for electronic documents, one or more server devices such as document servers, etc.) programmed in accordance with the teachings herein, as would be apparent to one of ordinary skill in the computer arts. These various aspects or features may include implementation in one or more computer programs and/or software executable and/or readable on a programmable system including at least one programmable processor, which may be dedicated or general purpose, coupled to receive data and instructions from and transmit data and instructions to a storage system, at least one input device, and at least one output device. Appropriate software coding may be readily prepared by skilled programmers based on the teachings of the present disclosure, as would be apparent to one of ordinary skill in the software arts. The aspects and implementations discussed above that employ software and/or software modules may also include suitable hardware to assist in implementing the machine-executable instructions of the software and/or software modules.

そのようなソフトウェアは、機械可読記憶媒体を採用するコンピュータプログラム製品であり得る。機械可読記憶媒体は、機械（例えば、コンピューティングデバイス）による実行のための命令のシーケンスを記憶および／またはエンコードすることが可能であり、かつ機械に本明細書に説明される方法および／または実施形態の任意の１つを実施させる任意の媒体であり得る。機械可読記憶媒体の例は、限定ではないが、磁気ディスク、光ディスク（例えば、ＣＤ、ＣＤ－Ｒ、ＤＶＤ、ＤＶＤ－Ｒ等）、光磁気ディスク、読取専用メモリ「ＲＯＭ」デバイス、ランダムアクセスメモリ「ＲＡＭ」デバイス、磁気カード、光学カード、ソリッドステートメモリデバイス、ＥＰＲＯＭ、ＥＥＰＲＯＭ、プログラマブル論理デバイス（ＰＬＤ）、および／またはそれらの任意の組み合わせを含む。機械可読媒体は、本明細書で使用される場合、単一の媒体、ならびに、例えばコンピュータメモリとの組み合わされたコンパクトディスクもしくは１つ以上のハードディスクドライブの集合等の物理的に分離した媒体の集合を含むように意図されている。本明細書で使用される場合、機械可読記憶媒体は、信号伝送の一過性形態を含まない。 Such software may be a computer program product employing a machine-readable storage medium. A machine-readable storage medium may be any medium capable of storing and/or encoding a sequence of instructions for execution by a machine (e.g., a computing device) and causing the machine to perform any one of the methods and/or embodiments described herein. Examples of machine-readable storage media include, but are not limited to, magnetic disks, optical disks (e.g., CDs, CD-Rs, DVDs, DVD-Rs, etc.), magneto-optical disks, read-only memory "ROM" devices, random access memory "RAM" devices, magnetic cards, optical cards, solid-state memory devices, EPROMs, EEPROMs, programmable logic devices (PLDs), and/or any combination thereof. Machine-readable media, as used herein, is intended to include a single medium as well as a collection of physically separate media, such as, for example, a collection of compact discs or one or more hard disk drives in combination with a computer memory. As used herein, machine-readable storage media does not include a transitory form of signal transmission.

そのようなソフトウェアはまた、搬送波等のデータキャリア上のデータ信号として搬送される情報（例えば、データ）を含み得る。例えば、機械実行可能情報は、信号が機械（例えば、コンピューティングデバイス）による実行のために命令のシーケンスまたはその一部をエンコードするデータキャリアにおいて具現化されるデータ搬送信号、ならびに機械に本明細書に説明される方法および／または実施形態の任意の１つを実施させる任意の関連する情報（例えば、データ構造およびデータ）として含まれ得る。 Such software may also include information (e.g., data) carried as a data signal on a data carrier, such as a carrier wave. For example, machine-executable information may be included as a data carrying signal embodied in a data carrier, the signal encoding a sequence of instructions or portions thereof for execution by a machine (e.g., a computing device), as well as any associated information (e.g., data structures and data) that causes the machine to perform any one of the methods and/or embodiments described herein.

コンピューティングデバイスの例は、限定ではないが、電子書籍読書デバイス、コンピュータワークステーション、端末コンピュータ、サーバコンピュータ、ハンドヘルドデバイス（例えば、タブレット型コンピュータ、スマートフォン等）、ウェブ装置、ネットワークルータ、ネットワークスイッチ、ネットワークブリッジ、機械よってとられるべきアクションを規定する命令のシーケンスを実行することが可能である任意の機械、およびそれらの任意の組み合わせを含む。一例では、コンピューティングデバイスは、キオスクを含み、かつ／またはその中に含まれ得る。 Examples of computing devices include, but are not limited to, e-book reading devices, computer workstations, terminal computers, server computers, handheld devices (e.g., tablet computers, smartphones, etc.), web appliances, network routers, network switches, network bridges, any machine capable of executing a sequence of instructions that define actions to be taken by the machine, and any combination thereof. In one example, a computing device may include and/or be included within a kiosk.

図６は、コントロールシステムに本開示の側面および／または方法のうちの任意の１つ以上のものを実施させるための命令のセットが実行され得るコンピュータシステム６００の例示的形態としてのコンピューティングデバイスの一実施形態の図式表現を示す。複数のコンピューティングデバイスが、デバイスのうちの１つ以上に、本開示の側面および／または方法のうちの任意の１つ以上を実施させるために専用に構成された命令のセットを実装するために利用され得ることも、考えられる。コンピュータシステム６００は、プロセッサ６０４と、メモリ６０８とを含み、プロセッサ６０４およびメモリ６０８は、バス６１２を介して相互に、および他の構成要素と通信する。バス６１２は、限定ではないが、種々のバスアーキテクチャのうちのいずれかを使用するメモリバス、メモリコントローラ、周辺バス、ローカルバス、およびそれらの任意の組み合わせを含むいくつかのタイプのバス構造のうちのいずれかを含み得る。 6 illustrates a diagrammatic representation of one embodiment of a computing device as an exemplary form of computer system 600 on which a set of instructions for causing a control system to perform any one or more of the aspects and/or methods of the present disclosure may be executed. It is also contemplated that multiple computing devices may be utilized to implement a set of instructions specifically configured to cause one or more of the devices to perform any one or more of the aspects and/or methods of the present disclosure. Computer system 600 includes a processor 604 and a memory 608, which communicate with each other and with other components via a bus 612. Bus 612 may include any of several types of bus structures, including, but not limited to, a memory bus, a memory controller, a peripheral bus, a local bus, and any combination thereof, using any of a variety of bus architectures.

メモリ６０８は、限定ではないが、ランダムアクセスメモリ構成要素、読取専用構成要素、およびそれらの任意の組み合わせを含む種々の構成要素（例えば、機械可読媒体）を含み得る。一例では、起動中等にコンピュータシステム６００内の要素間で情報を転送することに役立つ基本ルーチンを含む基本入力／出力システム６１６（ＢＩＯＳ）が、メモリ６０８の中に記憶され得る。メモリ６０８はまた、本開示の側面および／または方法のうちの任意の１つ以上を具現化する命令（例えば、ソフトウェア）６２０を含み得る（例えば、１つ以上の機械可読媒体上に記憶されている）。別の例では、メモリ６０８はさらに、限定ではないが、オペレーティングシステム、１つ以上のアプリケーションプログラム、他のプログラムモジュール、プログラムデータ、およびそれらの任意の組み合わせを含む任意の数のプログラムモジュールを含み得る。 The memory 608 may include a variety of components (e.g., machine-readable media), including, but not limited to, random access memory components, read-only components, and any combination thereof. In one example, a basic input/output system 616 (BIOS), including basic routines that help to transfer information between elements within the computer system 600, such as during startup, may be stored in the memory 608. The memory 608 may also include (e.g., stored on one or more machine-readable media) instructions (e.g., software) 620 that embody any one or more of the aspects and/or methods of the present disclosure. In another example, the memory 608 may further include any number of program modules, including, but not limited to, an operating system, one or more application programs, other program modules, program data, and any combination thereof.

コンピュータシステム６００はまた、記憶デバイス６２４を含み得る。記憶デバイス（例えば、記憶デバイス６２４）の例は、限定ではないが、ハードディスクドライブ、磁気ディスクドライブ、光学媒体と組み合わせられた光ディスクドライブ、ソリッドステートメモリデバイス、およびそれらの任意の組み合わせを含む。記憶デバイス６２４は、適切なインターフェース（図示せず）によってバス６１２に接続され得る。例示的インターフェースは、限定ではないが、ＳＣＳＩ、アドバンスト・テクノロジー・アタッチメント（ＡＴＡ）、シリアルＡＴＡ、ユニバーサルシリアルバス（ＵＳＢ）、ＩＥＥＥ１３９４（ＦＩＲＥＷＩＲＥ（登録商標））、およびそれらの任意の組み合わせを含む。一例では、記憶デバイス６２４（または１つ以上のその構成要素）は、（例えば、外部ポートコネクタ（図示せず）を介して）コンピュータシステム６００と除去可能にインターフェース接続され得る。特に、記憶デバイス６２４および関連付けられた機械可読媒体６２８は、コンピュータシステム６００のための機械可読命令、データ構造、プログラムモジュール、ならびに／または、他のデータの不揮発性記憶装置および／または揮発性記憶装置を提供し得る。一例では、ソフトウェア６２０は、完全に、または部分的に、機械可読媒体６２８内に常駐し得る。別の例では、ソフトウェア６２０は、完全に、または部分的に、プロセッサ６０４内に常駐し得る。 Computer system 600 may also include a storage device 624. Examples of storage devices (e.g., storage device 624) include, but are not limited to, hard disk drives, magnetic disk drives, optical disk drives combined with optical media, solid-state memory devices, and any combination thereof. Storage device 624 may be connected to bus 612 by an appropriate interface (not shown). Exemplary interfaces include, but are not limited to, SCSI, Advanced Technology Attachment (ATA), Serial ATA, Universal Serial Bus (USB), IEEE 1394 (FIREWIRE®), and any combination thereof. In one example, storage device 624 (or one or more of its components) may be removably interfaced with computer system 600 (e.g., via an external port connector (not shown)). In particular, storage device 624 and associated machine-readable media 628 may provide non-volatile and/or volatile storage of machine-readable instructions, data structures, program modules, and/or other data for computer system 600. In one example, the software 620 may reside, completely or partially, within the machine-readable medium 628. In another example, the software 620 may reside, completely or partially, within the processor 604.

コンピュータシステム６００はまた、入力デバイス６３２を含み得る。一例では、コンピュータシステム６００のユーザは、入力デバイス６３２を介してコンピュータシステム６００内にコマンドおよび／または他の情報を打ち込み得る。入力デバイス６３２の例は、限定ではないが、英数字入力デバイス（例えば、キーボード）、ポインティングデバイス、ジョイスティック、ゲームパッド、オーディオ入力デバイス（例えば、マイクロホン、音声応答システム等）、カーソル制御デバイス（例えば、マウス）、タッチパッド、光学スキャナ、ビデオ捕捉デバイス（例えば、静止カメラ、ビデオカメラ）、タッチスクリーン、およびそれらの任意の組み合わせを含む。入力デバイス６３２は、限定ではないが、シリアルインターフェース、パラレルインターフェース、ゲームポート、ＵＳＢインターフェース、ＦＩＲＥＷＩＲＥ（登録商標）インターフェース、バス６１２への直接的インターフェース、およびそれらの任意の組み合わせを含む種々のインターフェース（図示せず）のうちのいずれかを介して、バス６１２にインターフェース接続され得る。入力デバイス６３２は、タッチスクリーンインターフェースを含み得、タッチスクリーンインターフェースは、さらに下記に議論されるディスプレイ６３６の一部であるか、またはそれと別個であり得る。入力デバイス６３２は、上記に説明されるようなグラフィカルインターフェースにおいて１つ以上のグラフィック表現を選択するためのユーザ選択デバイスとして利用され得る。 Computer system 600 may also include input devices 632. In one example, a user of computer system 600 may type commands and/or other information into computer system 600 via input devices 632. Examples of input devices 632 include, but are not limited to, alphanumeric input devices (e.g., keyboards), pointing devices, joysticks, gamepads, audio input devices (e.g., microphones, voice response systems, etc.), cursor control devices (e.g., mice), touchpads, optical scanners, video capture devices (e.g., still cameras, video cameras), touch screens, and any combination thereof. Input devices 632 may be interfaced to bus 612 via any of a variety of interfaces (not shown), including, but not limited to, a serial interface, a parallel interface, a game port, a USB interface, a FIREWIRE® interface, a direct interface to bus 612, and any combination thereof. Input devices 632 may include a touch screen interface, which may be part of or separate from display 636, discussed further below. The input device 632 may be utilized as a user selection device for selecting one or more graphical representations in a graphical interface as described above.

ユーザはまた、記憶デバイス６２４（例えば、リムーバブルディスクドライブ、フラッシュドライブ等）および／またはネットワークインターフェースデバイス６４０を介してコマンドおよび／または他の情報をコンピュータシステム６００に入力し得る。ネットワークインターフェースデバイス６４０等のネットワークインターフェースデバイスは、ネットワーク６４４等の種々のネットワークのうちの１つ以上、およびそれに接続される１つ以上の遠隔デバイス６４８にコンピュータシステム６００を接続するために利用され得る。ネットワークインターフェースデバイスの例は、限定ではないが、ネットワークインターフェースカード（例えば、モバイルネットワークインターフェースカード、ＬＡＮカード）、モデム、およびそれらの任意の組み合わせを含む。ネットワークの例は、限定ではないが、ワイドエリアネットワーク（例えば、インターネット、企業ネットワーク）、ローカルエリアネットワーク（例えば、オフィス、建物、キャンパス、または他の比較的小さい地理的空間に関連付けられたネットワーク）、電話ネットワーク、電話／音声プロバイダと関連付けられたデータネットワーク（例えば、モバイル通信プロバイダのデータおよび／または音声ネットワーク）、２つのコンピューティングデバイス間の直接的接続、ならびにそれらの任意の組み合わせを含む。ネットワーク６４４等のネットワークは、有線モードおよび／または無線のモードの通信を採用し得る。概して、任意のネットワークトポロジが使用され得る。情報（例えば、データ、ソフトウェア６２０等）が、ネットワークインターフェースデバイス６４０を介して、コンピュータシステム６００に、および／またはコンピュータシステム６００から通信され得る。 A user may also input commands and/or other information to computer system 600 via storage device 624 (e.g., removable disk drive, flash drive, etc.) and/or network interface device 640. A network interface device, such as network interface device 640, may be utilized to connect computer system 600 to one or more of a variety of networks, such as network 644, and one or more remote devices 648 connected thereto. Examples of network interface devices include, but are not limited to, network interface cards (e.g., mobile network interface cards, LAN cards), modems, and any combination thereof. Examples of networks include, but are not limited to, wide area networks (e.g., the Internet, enterprise networks), local area networks (e.g., networks associated with an office, building, campus, or other relatively small geographic space), telephone networks, data networks associated with a telephone/voice provider (e.g., a mobile communications provider's data and/or voice network), a direct connection between two computing devices, and any combination thereof. A network, such as network 644, may employ wired and/or wireless modes of communication. In general, any network topology may be used. Information (e.g., data, software 620, etc.) may be communicated to and/or from computer system 600 via network interface device 640.

コンピュータシステム６００はさらに、ディスプレイデバイス６３６等のディスプレイデバイスに表示可能な画像を通信するためのビデオディスプレイアダプタ６５２を含み得る。ディスプレイデバイスの例は、限定ではないが、液晶ディスプレイ（ＬＣＤ）、陰極線管（ＣＲＴ）、プラズマディスプレイ、発光ダイオード（ＬＥＤ）ディスプレイ、およびそれらの任意の組み合わせを含む。ディスプレイアダプタ６５２およびディスプレイデバイス６３６は、本開示の側面のグラフィック表現を提供するためにプロセッサ６０４と組み合わせて利用され得る。ディスプレイデバイスに加えて、コンピュータシステム６００は、限定ではないが、オーディオスピーカ、プリンタ、およびそれらの任意の組み合わせを含む１つ以上の他の周辺出力デバイスを含み得る。そのような周辺出力デバイスは、周辺インターフェース６５６を介してバス６１２に接続され得る。周辺インターフェースの例は、限定ではないが、シリアルポート、ＵＳＢ接続、ＦＩＲＥＷＩＲＥ（登録商標）接続、パラレル接続、およびそれらの任意の組み合わせを含む。 The computer system 600 may further include a video display adapter 652 for communicating images displayable on a display device, such as the display device 636. Examples of display devices include, but are not limited to, a liquid crystal display (LCD), a cathode ray tube (CRT), a plasma display, a light emitting diode (LED) display, and any combination thereof. The display adapter 652 and the display device 636 may be utilized in combination with the processor 604 to provide graphical representations of aspects of the present disclosure. In addition to a display device, the computer system 600 may include one or more other peripheral output devices, including, but not limited to, audio speakers, a printer, and any combination thereof. Such peripheral output devices may be connected to the bus 612 via a peripheral interface 656. Examples of peripheral interfaces include, but are not limited to, a serial port, a USB connection, a FIREWIRE® connection, a parallel connection, and any combination thereof.

前述は、本発明の例証的実施形態の詳細な説明である。種々の修正および追加が、本発明の精神および範囲から逸脱することなく成され得る。上記に説明される種々の実施形態の各々の特徴が、関連付けられた新しい実施形態において複数の特徴の組み合わせを提供するために、適宜、他の説明される実施形態の特徴と組み合わせられ得る。さらに、前述は、いくつかの別個の実施形態を説明するが、本明細書に説明されているものは、本発明の原理の適用を例証するにすぎない。加えて、本明細書における特定の方法は、具体的な順序で実施されるものとして例証および／または説明され得るが、順序は、本明細書に開示されるような実施形態を達成するために、通常の技術内で大いに変更可能である。故に、本説明は、例としてのみ捉えられることを意図されており、別様に本発明の範囲を限定するようには意図されていない。 The foregoing is a detailed description of illustrative embodiments of the present invention. Various modifications and additions may be made without departing from the spirit and scope of the present invention. Features of each of the various embodiments described above may be combined with features of other described embodiments, as appropriate, to provide a combination of features in related new embodiments. Moreover, while the foregoing describes several separate embodiments, what has been described herein is merely illustrative of the application of the principles of the present invention. In addition, although certain methods herein may be illustrated and/or described as being performed in a specific order, the order may be varied considerably within ordinary skill in the art to achieve the embodiments as disclosed herein. Thus, this description is intended to be taken only as an example, and is not intended to otherwise limit the scope of the present invention.

上記の説明において、および請求項において、「～のうちの少なくとも１つ」または「～のうちの１つ以上」等の語句が生じ、要素または特徴の接続的列挙が後に続き得る。用語「および／または」もまた、２つ以上の要素または特徴の列挙内に生じ得る。そのような語句が使用される文脈によって別様に暗示的または明示的に否定されない限り、これは、個々に列挙される要素もしくは特徴のいずれか、または他の記載される要素もしくは特徴のいずれかと組み合わせて記載される要素もしくは特徴のいずれかを意味することが意図されている。例えば、語句「ＡおよびＢのうちの少なくとも一方」、「ＡおよびＢのうちの１つ以上」、ならびに「Ａおよび／またはＢ」は、各々、「Ａのみ、Ｂのみ、またはＡおよびＢともに」を意味することが意図されている。同様の解釈が、３つ以上のアイテムを含む列挙に関しても意図されている。例えば、語句「Ａ、Ｂ、およびＣのうちの少なくとも１つ」、「Ａ、Ｂ、およびＣのうちの１つ以上」、ならびに「Ａ、Ｂ、および／またはＣ」は、各々、「Ａのみ、Ｂのみ、Ｃのみ、ＡおよびＢともに、ＡおよびＣともに、ＢおよびＣともに、またはＡおよびＢおよびＣともに」を意味することが意図されている。加えて、上記および請求項内での用語「～に基づいて」の使用は、記載されていない特徴または要素も許容可能であるように、「少なくとも、～に基づいて」を意味することが意図されている。 In the above description and in the claims, phrases such as "at least one of" or "one or more of" may occur followed by a conjunctive enumeration of elements or features. The term "and/or" may also occur within a enumeration of two or more elements or features. Unless otherwise implied or explicitly contradicted by the context in which such a phrase is used, this is intended to mean any of the elements or features listed individually or any of the elements or features listed in combination with any of the other listed elements or features. For example, the phrases "at least one of A and B," "one or more of A and B," and "A and/or B" are each intended to mean "A only, B only, or both A and B." A similar interpretation is intended with respect to enumerations containing more than two items. For example, the phrases "at least one of A, B, and C," "one or more of A, B, and C," and "A, B, and/or C" are each intended to mean "A only, B only, C only, both A and B, both A and C, both B and C, or both A, B, and C." Additionally, use of the term "based on" above and in the claims is intended to mean "based at least on," such that unrecited features or elements are also allowed.

本明細書に説明される主題は、所望の構成に応じて、システム、装置、方法、および／または物品として具現化されることができる。前述の説明に記載される実装は、本明細書に説明される主題と一貫した全実装を表すわけではない。代わりに、それらは、単に説明される主題に関連する側面と一貫するいくつかの例にすぎない。いくつかの変更が、上記で詳細に説明されているが、他の修正または追加も、可能である。特に、さらなる特徴および／または変更が、本明細書に記載されるものに加えて提供され得る。例えば、上記で説明される実装は、開示される特徴の種々の組み合わせおよび副次的組み合わせおよび／または上記に開示されるいくつかのさらなる特徴の組み合わせおよび副次的組み合わせを対象とし得る。加えて、付随の図に描写され、かつ／または本明細書に説明される論理フローは、望ましい結果を達成するために、必ずしも、示される特定の順序または連続的順序を要求しない。他の実装も、以下の請求項の範囲内にあり得る。 The subject matter described herein may be embodied as a system, an apparatus, a method, and/or an article, depending on the desired configuration. The implementations described in the foregoing description do not represent all implementations consistent with the subject matter described herein. Instead, they are merely some examples consistent with aspects related to the subject matter described. Although some variations have been described in detail above, other modifications or additions are possible. In particular, further features and/or variations may be provided in addition to those described herein. For example, the implementations described above may be directed to various combinations and subcombinations of the disclosed features and/or combinations and subcombinations of some further features disclosed above. In addition, the logic flow depicted in the accompanying figures and/or described herein does not necessarily require the particular order or sequential order shown to achieve the desired results. Other implementations may be within the scope of the following claims.

Claims

10. A decoder, the decoder comprising a circuit, the circuit comprising:
Receiving a bitstream;
determining a first region, a second region, and a third region of the current block according to a geometric partitioning mode;
decoding the current block using an inverse discrete cosine transform for each of the first region, the second region, and the third region;
determining whether the geometric partitioning mode is enabled; and
determining a first line segment for the current block;
determining a second line segment for the current block, wherein decoding the current block includes reconstructing pixel data using the first line segment and the second line segment;
The first line segment and the second line segment divide the current block into the first region, the second region, and the third region.

The decoder of claim 1, wherein the current block has a size of 128x128 or 64x64.

The decoder of claim 1, wherein the first line segment characterizes the first region and the second line segment characterizes the second region and the third region.

The decoder of claim 1, wherein reconstructing pixel data includes calculating a predictor for the first region using an associated motion vector contained in the bitstream.

The circuitry is further configured to determine a coding transform law type for decoding each of the first region, the second region, and/or the third region from a signal contained in the bitstream, the coding transform law type characterizing at least an inverse block discrete cosine transform and an inverse shape adaptive discrete cosine transform, the decoder comprising:
an entropy decoder processor configured to receive the bitstream and decode the bitstream into quantized coefficients;
an inverse quantization and inverse transform processor configured to process the quantized coefficients, including performing an inverse discrete cosine transformation, in accordance with the determined coding transform law type;
A deblocking filter;
A frame buffer;
The decoder of claim 1 further comprising: an intra-prediction processor;

The decoder of claim 1, wherein the bitstream includes a parameter indicating whether the geometric partitioning mode is enabled for the current block.

The decoder of claim 1, wherein the current block forms part of a quad tree plus a binary decision tree.

The decoder of claim 1, wherein the current block is a non-leaf node of a quad tree plus a binary decision tree.

The decoder of claim 1, wherein the current block is a coding tree unit or a coding unit.

The decoder of claim 1, wherein the first region is a coding unit or a prediction unit.

10. A decoder, the decoder comprising a circuit, the circuit comprising:
Receiving a bitstream;
determining a first region, a second region, and a third region of the current block according to a geometric partitioning mode;
determining a coding transform law type for decoding each of the first region, the second region, and/or the third region from a signal contained within the bitstream, the coding transform law type characterizing at least an inverse block discrete cosine transform and an inverse shape adaptive discrete cosine transform;
decoding the current block, wherein decoding the current block includes using the determined transformation rule type for an inverse transformation for each of the first region, the second region, and/or the third region;
determining whether the geometric partitioning mode is enabled; and
determining a first line segment for the current block;
determining a second line segment for the current block, wherein decoding the current block includes reconstructing pixel data using the first line segment and the second line segment;
The first line segment and the second line segment divide the current block into the first region, the second region, and the third region.

The decoder of claim 11 , wherein the current block has a size of 128×128 or 64×64.

12. The decoder of claim 11 , wherein the first line segment characterizes the first region and the second line segment characterizes the second region and the third region.

12. The decoder of claim 11 , wherein reconstructing pixel data includes calculating a predictor for the first region using an associated motion vector contained in the bitstream.

an entropy decoder processor configured to receive the bitstream and decode the bitstream into quantized coefficients;
an inverse quantization and inverse transform processor configured to process the quantized coefficients, including performing an inverse discrete cosine in accordance with the determined coding transform law type;
A deblocking filter;
A frame buffer;
The decoder of claim 11 further comprising: an intra-prediction processor;

The decoder of claim 11 , wherein the bitstream includes a parameter indicating whether the geometric partitioning mode is enabled for the current block.

12. The decoder of claim 11 , wherein the current block forms part of a quad-tree plus a binary decision tree.

The decoder of claim 11 , wherein the current block is a non-leaf node of a quadtree plus a binary decision tree.

The decoder of claim 11 , wherein the current block is a coding tree unit or a coding unit.

The decoder of claim 11 , wherein the first region is a coding unit or a prediction unit.

1. A method, comprising:
A decoder receives a bitstream;
determining a first region, a second region, and a third region of the current block according to a geometric partitioning mode;
determining a coding transform law type for decoding the first region, the second region, and/or the third region from a signal contained in the bitstream, the coding transform law type characterizing at least an inverse block discrete cosine transform or an inverse shape adaptive discrete cosine transform;
decoding the current block, wherein decoding the current block includes using the determined transformation rule type for an inverse transformation for each of the first region, the second region, and/or the third region;
determining, by the decoder, whether the geometric partitioning mode is enabled;
the decoder determining a first line segment for the current block;
the decoder determining a second line segment for the current block;
decoding the current block includes reconstructing pixel data using the first line segment and the second line segment;
The method of claim 1, wherein the first line segment and the second line segment divide the current block into the first region, the second region, and the third region.

The method of claim 21 , wherein the current block has a size of 128×128 or 64×64.

22. The method of claim 21 , wherein the first line segment characterizes the first region and the second line segment characterizes the second region and the third region.

22. The method of claim 21 , wherein reconstructing pixel data includes calculating a predictor for the first region using an associated motion vector contained in the bitstream.

The decoder comprises:
an entropy decoder processor configured to receive the bitstream and decode the bitstream into quantized coefficients;
an inverse quantization and inverse transform processor configured to process the quantized coefficients, including performing an inverse discrete cosine in accordance with the determined coding transform law type;
A deblocking filter;
A frame buffer;
and an intra-prediction processor.

22. The method of claim 21 , wherein the bitstream includes a parameter indicating whether a block level geometric partitioning mode is enabled for the current block .

22. The method of claim 21 , wherein the current block forms part of a quad tree plus a binary decision tree.

22. The method of claim 21 , wherein the current block is a non-leaf node of a quad tree plus a binary decision tree.

The method of claim 21 , wherein the current block is a coding tree unit or a coding unit.

The method of claim 21 , wherein the first region is a coding unit or a prediction unit.