JP2018536364A

JP2018536364A - Multi-region search range for block prediction mode for display stream compression (DSC)

Info

Publication number: JP2018536364A
Application number: JP2018529177A
Authority: JP
Inventors: ヤコブソン、ナタン・ハイム; ティルマライ、ビジャヤラガバン; ジョーシー、ラジャン・ラクスマン
Original assignee: Qualcomm Inc
Current assignee: Qualcomm Inc
Priority date: 2015-12-07
Filing date: 2016-12-06
Publication date: 2018-12-06
Anticipated expiration: 2036-12-06
Also published as: WO2017100206A1; TWI692244B; CN108293114A; US10368073B2; CN108293114B; JP7198665B2; BR112018011398B1; CA3004185C; TW201725909A; US20170163986A1; EP3387832B1; KR102102066B1; EP3387832A1; BR112018011398A2; HUE049810T2; CA3004185A1; KR20180091003A

Abstract

ディスプレイリンクを介した送信のために固定ビットレートビデオコーディング方式の簡略化されたブロック予測モードでビデオデータのブロックをコーディングするための方法が開示される。一態様では、方法は、現在スライス中の現在ブロックを予測するために使用される候補ブロックを決定することを含み、候補ブロックは、現在スライス中の再構成されたピクセルに各々対応する複数のピクセル位置の範囲内にある。ピクセル位置の範囲は、（i）現在ブロックとオーバーラップする複数のピクセルの第１のライン中の１つまたは複数の第１のピクセル位置を含む第１の領域と、（ii）現在ブロックとオーバーラップしない複数のピクセルの第２のライン中の１つまたは複数の第２のピクセル位置を含む第２の領域とを備え得る。方法は、候補ブロックのピクセル位置を示す予測ベクトルを決定およびシグナリングすることをさらに備え得る。 A method for coding a block of video data in a simplified block prediction mode of a constant bit rate video coding scheme for transmission over a display link is disclosed. In one aspect, the method includes determining a candidate block that is used to predict a current block in a current slice, wherein the candidate block is a plurality of pixels each corresponding to a reconstructed pixel in the current slice. Within range. The range of pixel locations includes: (i) a first region that includes one or more first pixel locations in a first line of a plurality of pixels that overlaps the current block; and (ii) an overlap with the current block. And a second region including one or more second pixel locations in a second line of non-wrapping pixels. The method may further comprise determining and signaling a prediction vector that indicates a pixel location of the candidate block.

Description

[0001] 本開示は、ビデオコーディングおよび圧縮の分野に関し、詳細には、ディスプレイリンクビデオ圧縮（display link video compression)などの、ディスプレイリンクを介した送信のためのビデオ圧縮に関する。 [0001] The present disclosure relates to the field of video coding and compression, and in particular, to video compression for transmission over a display link, such as display link video compression.

[0002] デジタルビデオ機能は、デジタルテレビジョン、携帯情報端末（ＰＤＡ）、ラップトップコンピュータ、デスクトップモニタ、デジタルカメラ、デジタル記録デバイス、デジタルメディアプレイヤ、ビデオゲームデバイス、ビデオゲームコンソール、セルラ電話または衛星無線電話、ビデオ遠隔会議デバイスなどを含む、広範囲にわたるディスプレイに組み込まれ得る。適切なソースデバイスにディスプレイを接続するために、ディスプレイリンクが使用される。ディスプレイリンクの帯域幅要件はディスプレイの解像度に比例し、従って、高解像度ディスプレイは、大きい帯域幅のディスプレイリンクを必要とする。いくつかのディスプレイリンクは、高解像度ディスプレイをサポートするための帯域幅を有しない。高解像度ディスプレイにデジタルビデオを提供するためにより低い帯域幅のディスプレイリンクが使用され得るように帯域幅要件を低減するために、ビデオ圧縮が使用され得る。 [0002] Digital video functions include digital television, personal digital assistant (PDA), laptop computer, desktop monitor, digital camera, digital recording device, digital media player, video game device, video game console, cellular telephone or satellite radio. It can be incorporated into a wide range of displays, including telephones, video teleconferencing devices, and the like. A display link is used to connect the display to the appropriate source device. Display link bandwidth requirements are proportional to the resolution of the display, and therefore high resolution displays require large bandwidth display links. Some display links do not have the bandwidth to support high resolution displays. Video compression can be used to reduce bandwidth requirements so that lower bandwidth display links can be used to provide digital video to high resolution displays.

[0003] ピクセルデータのイメージ圧縮を含むコーディング方式が存在する。しかしながら、そのような方式は、時々視覚的ロスレス（visually lossless）でないか、または従来のディスプレイデバイスにおいて実装することが困難で費用がかかることがある。 There are coding schemes that include image compression of pixel data. However, such schemes are sometimes not visually lossless or can be difficult and expensive to implement in conventional display devices.

[0004] ビデオエレクトロニクス規格協会（ＶＥＳＡ：Video Electronics Standards Association)は、ディスプレイリンクビデオ圧縮のための規格として、ディスプレイストリーム圧縮（ＤＳＣ：Display Stream Compression）を開発した。ＤＳＣなどの、ディスプレイリンクビデオ圧縮技法は、特に、視覚的ロスレスであるピクチャ品質（すなわち、圧縮がアクティブであることをユーザがわからないような品質のレベルを有するピクチャ）を提供するべきである。ディスプレイリンクビデオ圧縮技法はまた、従来のハードウェアを用いてリアルタイムに実装することが容易で費用がかからない方式を提供するべきである。 [0004] The Video Electronics Standards Association (VESA) has developed Display Stream Compression (DSC) as a standard for display link video compression. Display link video compression techniques, such as DSC, should especially provide picture quality that is visually lossless (ie, pictures with a level of quality that the user does not know compression is active). Display link video compression techniques should also provide a scheme that is easy and inexpensive to implement in real time using conventional hardware.

[0005] 図１Ａは、本開示で説明される態様による技法を利用し得る例示的なビデオ符号化および復号システムを図示するブロック図である。[0005] FIG. 1A is a block diagram illustrating an example video encoding and decoding system that may utilize techniques in accordance with aspects described in this disclosure. [0006] 図１Ｂは、本開示で説明される態様による技法を実行し得る別の例示的なビデオ符号化および復号システムを図示するブロック図である。[0006] FIG. 1B is a block diagram illustrating another example video encoding and decoding system that may perform techniques in accordance with aspects described in this disclosure. [0007] 図２Ａは、技法を実装し得るビデオエンコーダの一例を図示するブロック図である。[0007] FIG. 2A is a block diagram illustrating an example of a video encoder that may implement the techniques. [0008] 図２Ｂは、技法を実装し得るビデオデコーダの一例を図示するブロック図である。[0008] FIG. 2B is a block diagram illustrating an example of a video decoder that may implement the techniques. [0009] 図３は、１Ｄブロックについての第１でないラインのための探索空間（search space）を図示するブロック図である。[0009] FIG. 3 is a block diagram illustrating a search space for a non-first line for a 1D block. [0010] 図４は、２Ｄブロックについての第１でないラインのための探索空間を図示するブロック図である。[0010] FIG. 4 is a block diagram illustrating a search space for a non-first line for a 2D block. [0011] 図５は、１Ｄブロックについての第１のラインのための探索空間を図示するブロック図である。FIG. 5 is a block diagram illustrating the search space for the first line for the 1D block. [0012] 図６は、２Ｄブロックについての第１のラインのための探索空間を図示するブロック図である。FIG. 6 is a block diagram illustrating the search space for the first line for 2D blocks. [0013] 図７は、ブロック予測モードでビデオデータのブロックを予測するための方法を図示するフローチャートである。FIG. 7 is a flowchart illustrating a method for predicting a block of video data in a block prediction mode. [0014] 図８は、区分（partitions）を有するブロックを図示するブロック図である。[0014] FIG. 8 is a block diagram illustrating a block having partitions. [0015] 図９は、適応型区分サイズを有するブロック予測モードのためのデータフローを図示するブロック図である。[0015] FIG. 9 is a block diagram illustrating a data flow for a block prediction mode with an adaptive partition size. [0016] 図１０は、ブロック内の２×２領域のための２つの異なる区分オプションを図示するブロック図である。[0016] FIG. 10 is a block diagram illustrating two different partitioning options for a 2x2 region within a block. [0017] 図１１は、ブロック予測モードのためのエントロピーコーディンググループを図示するブロック図である。[0017] FIG. 11 is a block diagram illustrating an entropy coding group for a block prediction mode. [0018] 図１２は、２×８ブロックのための探索空間を図示するブロック図である。FIG. 12 is a block diagram illustrating a search space for 2 × 8 blocks. [0019] 図１３は、ブロックの異なる領域に関して使用されている異なる区分サイズを図示するブロック図である。[0019] FIG. 13 is a block diagram illustrating different partition sizes being used for different regions of the block. [0020] 図１４は、可変の区分サイズを使用してブロック予測モードでビデオデータのブロックを予測するための方法を示すフローチャートである。FIG. 14 is a flowchart illustrating a method for predicting a block of video data in a block prediction mode using a variable partition size. [0021] 図１５は、４：２：０クロマサブサンプリングの２×２区分に関する例示的なブロック予測探索を図示するブロック図である。[0021] FIG. 15 is a block diagram illustrating an exemplary block prediction search for a 2x2 segment of 4: 2: 0 chroma subsampling. [0022] 図１６は、４：２：０クロマサブサンプリングの１×２区分に関する例示的なブロック予測探索を図示するブロック図である。FIG. 16 is a block diagram illustrating an exemplary block prediction search for a 1 × 2 segment of 4: 2: 0 chroma subsampling. [0023] 図１７は、４：２：２クロマサブサンプリングの２×２区分に関する例示的なブロック予測探索を図示するブロック図である。[0023] FIG. 17 is a block diagram illustrating an exemplary block prediction search for a 2x2 segment of 4: 2: 2 chroma subsampling. [0024] 図１８は、４：２：２クロマサブサンプリングの１×２区分に関する例示的なブロック予測探索を図示するブロック図である。[0024] FIG. 18 is a block diagram illustrating an exemplary block prediction search for a 1x2 segment of 4: 2: 2 chroma subsampling. [0025] 図１９は、ブロック予測モードについての単一の探索範囲を図示するブロック図である。FIG. 19 is a block diagram illustrating a single search range for the block prediction mode. [0026] 図２０は、ブロック予測モードについての複数の探索範囲を図示するブロック図である。FIG. 20 is a block diagram illustrating a plurality of search ranges for the block prediction mode. [0027] 図２１は、複数の探索範囲を使用してブロック予測モードでビデオデータのブロックを予測するための方法を示すフローチャートである。FIG. 21 is a flowchart illustrating a method for predicting a block of video data in a block prediction mode using a plurality of search ranges. [0028] 図２２は、簡略化されたブロック予測モードのための例示的な探索領域を図示するブロック図である。[0028] FIG. 22 is a block diagram illustrating an exemplary search region for a simplified block prediction mode. [0029] 図２３は、簡略化されたブロック予測モードのための例示的な探索領域を図示するブロック図である。[0029] FIG. 23 is a block diagram illustrating an exemplary search region for a simplified block prediction mode. [0030] 図２４は、簡略化されたブロック予測モードのための例示的な探索領域を図示するブロック図である。[0030] FIG. 24 is a block diagram illustrating an exemplary search region for a simplified block prediction mode. [0031] 図２５は、簡略化されたブロック予測モードのための例示的な探索領域を図示するブロック図である。[0031] FIG. 25 is a block diagram illustrating an exemplary search region for a simplified block prediction mode. [0032] 図２６は、簡略化されたブロック予測モードでビデオデータのブロックを予測するための方法を図示するフローチャートである。[0032] FIG. 26 is a flowchart illustrating a method for predicting a block of video data in a simplified block prediction mode.

[0033] ＤＳＣ規格は、ビデオデータの各ブロックが、エンコーダによって符号化され、同様に、デコーダによって復号され得る、いくつかのコーディングモードを含む。いくつかの実装では、エンコーダおよび／またはデコーダは、前にコーディングされたブロックに基づいてコーディングされる現在ブロックを予測し得る。 [0033] The DSC standard includes several coding modes in which each block of video data may be encoded by an encoder and similarly decoded by a decoder. In some implementations, the encoder and / or decoder may predict a current block to be coded based on a previously coded block.

[0034] しかしながら、既存のコーディングモード（例えば、変換コーディング、差分パルスコード変調など）は、ビデオデータ中の極めて複雑な領域（highly complex region）を圧縮する満足のいく方法を提供しない。しばしば、このタイプのデータ（すなわち、高度に圧縮されたビデオデータ）について、コーディングされる現在ブロック（または現在ブロックの構成サブブロック（constituent sub-blocks））は、コーダ（例えば、エンコーダまたはデコーダ）によって遭遇された前のブロック（previous blocks）にコンテンツが類似する。しかしながら、既存のイントラ予測は、そのような現在ブロックの満足のいく予測（例えば、現在ブロックに十分に類似し、従って十分に小さい残差（residual）をもたらすであろう現在ブロックの予測）を行うには制限されすぎていることがある。従って、ビデオデータのブロックをコーディングする改善された方法が望まれる。 [0034] However, existing coding modes (eg, transform coding, differential pulse code modulation, etc.) do not provide a satisfactory way to compress highly complex regions in video data. Often, for this type of data (ie, highly compressed video data), the current blocks (or constituent sub-blocks) of the current block are coded by a coder (eg, encoder or decoder). The content is similar to the previous blocks encountered. However, existing intra prediction makes a satisfactory prediction of such a current block (eg, a prediction of the current block that is sufficiently similar to the current block and therefore will yield a sufficiently small residual). May be too limited. Accordingly, an improved method for coding a block of video data is desired.

[0035] 本開示のシステム、方法およびデバイスは、各々いくつかの発明的態様を有し、それらのうちの単一の態様が、本明細書で開示される望ましい属性を単独で担うものではない。 [0035] Each of the systems, methods and devices of the present disclosure has several inventive aspects, one of which is not solely responsible for the desired attributes disclosed herein. .

[0036] 一態様では、固定ビットレートビデオコーディング方式の簡略化されたブロック予測モードでビデオデータのブロックをコーディングするための方法は、現在スライス中の現在ブロックを予測するために使用される候補ブロックを決定することと、候補ブロックは、現在スライス中の再構成されたピクセルに各々対応する複数のピクセル位置の範囲内にあり、複数のピクセル位置の範囲は、少なくとも（i）現在スライス中の複数のピクセルの第１のライン中の１つまたは複数の第１のピクセル位置を含む第１の領域と、ここで、複数のピクセルの第１のラインは、現在ブロック中の少なくとも１つのピクセルを含み、現在スライスの全体の幅にわたる、（ii）現在スライス中の複数のピクセルの第２のライン中の１つまたは複数の第２のピクセル位置を含む第２の領域と、ここで、複数のピクセルの第２のラインは、現在ブロック中のいずれのピクセルも含まないが、現在スライスの全体の幅にわたる、を備え、複数のピクセル位置の範囲内の候補ブロックのピクセル位置を示す予測ベクトルを決定することと、候補ブロックのピクセル位置は、第１の領域または第２の領域のうちの１つにあり、予測ベクトルをシグナリングすることを少なくとも部分的に介して、簡略化されたブロック予測モードで現在ブロックをコーディングすることと、を含み得る。 [0036] In an aspect, a method for coding a block of video data in a simplified block prediction mode of a constant bit rate video coding scheme may be used to predict a current block in a current slice. And the candidate block is within a range of pixel locations each corresponding to a reconstructed pixel in the current slice, and the range of pixel locations is at least (i) a plurality of pixel locations in the current slice A first region including one or more first pixel locations in a first line of pixels, wherein the first line of pixels includes at least one pixel in the current block (Ii) one or more second in the second line of the plurality of pixels in the current slice, spanning the entire width of the current slice A second region comprising pixel locations, wherein a second line of pixels comprises no pixels in the current block, but spans the entire width of the current slice, and comprises a plurality of pixel locations Determining a prediction vector that indicates a pixel position of a candidate block within a range of, and the pixel position of the candidate block is in one of the first region or the second region and signaling the prediction vector Coding the current block in a simplified block prediction mode, at least in part.

[0037] 別の態様では、固定ビットレートビデオコーディング方式の簡略化されたブロック予測モードでビデオデータのブロックをコーディングするように構成された装置は、ビデオデータの現在スライスの１つまたは複数の再構成されたピクセルを記憶するように構成されたメモリと、メモリと通信状態にある１つまたは複数のプロセッサと、を含み得る。１つまたは複数のプロセッサは、現在スライス中の現在ブロックを予測するために使用される候補ブロックを決定することと、候補ブロックは、現在スライス中の再構成されたピクセルに各々対応する複数のピクセル位置の範囲内にあり、複数のピクセル位置の範囲は、少なくとも（i）現在スライス中の複数のピクセルの第１のライン中の１つまたは複数の第１のピクセル位置を含む第１の領域と、ここで、複数のピクセルの第１のラインは、現在ブロック中の少なくとも１つのピクセルを含み、現在スライスの全体の幅にわたる、（ii）現在スライス中の複数のピクセルの第２のライン中の１つまたは複数の第２のピクセル位置を含む第２の領域と、ここで、複数のピクセルの第２のラインは、現在ブロック中のいずれのピクセルも含まないが、現在スライスの全体の幅にわたる、を備え、複数のピクセル位置の範囲内の候補ブロックのピクセル位置を示す予測ベクトルを決定することと、候補ブロックのピクセル位置は、第１の領域または第２の領域のうちの１つにあり、予測ベクトルをシグナリングすることを少なくとも部分的に介して、簡略化されたブロック予測モードで現在ブロックをコーディングすることと、を行うように構成され得る。 [0037] In another aspect, an apparatus configured to code a block of video data in a simplified block prediction mode of a constant bit rate video coding scheme includes one or more replays of a current slice of video data. A memory configured to store the configured pixels and one or more processors in communication with the memory may be included. One or more processors determine candidate blocks used to predict the current block in the current slice, and the candidate block is a plurality of pixels each corresponding to a reconstructed pixel in the current slice. A range of locations, wherein the range of pixel locations includes at least (i) a first region that includes one or more first pixel locations in a first line of the plurality of pixels in the current slice; Where the first line of pixels includes at least one pixel in the current block and spans the entire width of the current slice; (ii) in the second line of pixels in the current slice A second region including one or more second pixel locations, and wherein the second line of pixels does not include any pixel in the current block; Determining a prediction vector indicative of the pixel position of the candidate block within the range of the plurality of pixel positions, the pixel position of the candidate block being in the first region or second And coding the current block in a simplified block prediction mode, at least in part through signaling a prediction vector.

[0038] 別の態様では、非一時的物理的コンピュータストレージは、固定ビットレートビデオコーディング方式の簡略化されたブロック予測モードでビデオデータのブロックをコーディングするように構成されたコードを備え得る。コードは、実行されたとき、装置に、現在スライス中の現在ブロックを予測するために使用される候補ブロックを決定することと、候補ブロックは、現在スライス中の再構成されたピクセルに各々対応する複数のピクセル位置の範囲内にあり、複数のピクセル位置の範囲は、少なくとも（i）現在スライス中の複数のピクセルの第１のライン中の１つまたは複数の第１のピクセル位置を含む第１の領域と、ここで、複数のピクセルの第１のラインは、現在ブロック中の少なくとも１つのピクセルを含み、現在スライスの全体の幅にわたる、（ii）現在スライス中の複数のピクセルの第２のライン中の１つまたは複数の第２のピクセル位置を含む第２の領域と、ここで、複数のピクセルの第２のラインは、現在ブロック中のいずれのピクセルも含まないが、現在スライスの全体の幅にわたる、を備え、複数のピクセル位置の範囲内の候補ブロックのピクセル位置を示す予測ベクトルを決定することと、候補ブロックのピクセル位置は、第１の領域または第２の領域のうちの１つにあり、予測ベクトルをシグナリングすることを少なくとも部分的に介して、簡略化されたブロック予測モードで現在ブロックをコーディングすることと、を行わせ得る。 [0038] In another aspect, the non-transitory physical computer storage may comprise code configured to code a block of video data in a simplified block prediction mode of a constant bit rate video coding scheme. The code, when executed, determines to the device which candidate block is used to predict the current block in the current slice, and each candidate block corresponds to a reconstructed pixel in the current slice. A plurality of pixel positions within a plurality of pixel positions, the first including at least (i) one or more first pixel positions in a first line of the plurality of pixels in the current slice; And where the first line of pixels includes at least one pixel in the current block and spans the entire width of the current slice, (ii) the second of the pixels in the current slice A second region that includes one or more second pixel locations in the line, and wherein the second line of pixels includes any pixel in the current block. Determining a prediction vector indicative of the pixel position of the candidate block within the range of the plurality of pixel positions, the pixel position of the candidate block being in the first region or Coding the current block in a simplified block prediction mode, at least partly in one of the two regions and signaling a prediction vector.

[0039] 別の態様では、ビデオコーディングデバイスは、固定ビットレートビデオコーディング方式の簡略化されたブロック予測モードでビデオデータのブロックをコーディングするように構成され得る。ビデオコーディングデバイスは、現在スライス中の現在ブロックを予測するために使用される候補ブロックを決定するための手段と、候補ブロックは、現在スライス中の再構成されたピクセルに各々対応する複数のピクセル位置の範囲内にあり、複数のピクセル位置の範囲は、少なくとも（i）現在スライス中の複数のピクセルの第１のライン中の１つまたは複数の第１のピクセル位置を含む第１の領域と、ここで、複数のピクセルの第１のラインは、現在ブロック中の少なくとも１つのピクセルを含み、現在スライスの全体の幅にわたる、（ii）現在スライス中の複数のピクセルの第２のライン中の１つまたは複数の第２のピクセル位置を含む第２の領域と、ここで、複数のピクセルの第２のラインは、現在ブロック中のいずれのピクセルも含まないが、現在スライスの全体の幅にわたる、を備え、複数のピクセル位置の範囲内の候補ブロックのピクセル位置を示す予測ベクトルを決定するための手段と、候補ブロックのピクセル位置は、第１の領域または第２の領域のうちの１つにあり、予測ベクトルをシグナリングすることを少なくとも部分的に介して、簡略化されたブロック予測モードで現在ブロックをコーディングするための手段と、を備え得る。 [0039] In another aspect, a video coding device may be configured to code a block of video data in a simplified block prediction mode of a constant bit rate video coding scheme. The video coding device includes means for determining a candidate block used to predict a current block in the current slice, and the candidate block includes a plurality of pixel locations each corresponding to a reconstructed pixel in the current slice And the range of pixel locations is at least (i) a first region that includes one or more first pixel locations in a first line of pixels in the current slice; Where the first line of pixels includes at least one pixel in the current block and spans the entire width of the current slice; (ii) 1 in the second line of pixels in the current slice A second region including one or more second pixel locations, wherein a second line of pixels is any pixel in the current block The means for determining a prediction vector indicative of the pixel position of the candidate block within the range of the plurality of pixel positions, the pixel position of the candidate block comprising: Means for coding a current block in a simplified block prediction mode, at least partly in signaling in one of the region or the second region.

Detailed description

[0040] 概して、本開示は、例えば、ディスプレイリンクビデオ圧縮で利用されるもののような、ビデオ圧縮技法を改善する方法に関する。より詳細には、本開示は、適応型探索範囲選択を使用してブロック予測モードでビデオデータのブロックをコーディングするためのシステムおよび方法に関する。 [0040] In general, this disclosure relates to methods for improving video compression techniques, such as those utilized in display link video compression, for example. More particularly, this disclosure relates to systems and methods for coding blocks of video data in block prediction mode using adaptive search range selection.

[0041] ある特定の実施形態について、ディスプレイリンクビデオ圧縮技法の例である、ＤＳＣ規格のコンテキストにおいて本明細書で説明されるが、本明細書で開示されるシステムおよび方法が任意の適切なビデオコーディング規格に適用可能であり得ることを、当業者は理解するだろう。例えば、本明細書で開示される実施形態は、下記の規格、国際電気通信連合（ＩＴＵ）電気通信標準化部門（ＩＴＵ−Ｔ）Ｈ．２６１、国際標準化機構／国際電気標準会議（ＩＳＯ／ＩＥＣ）ムービングピクチャエキスパートグループ１（ＭＰＥＧ−１）Ｖｉｓｕａｌ、ＩＴＵ−ＴＨ．２６２またはＩＳＯ／ＩＥＣＭＰＥＧ−２Ｖｉｓｕａｌ、ＩＴＵ−ＴＨ．２６３、ＩＳＯ／ＩＥＣＭＰＥＧ−４Ｖｉｓｕａｌ、（ＩＳＯ／ＩＥＣＭＰＥＧ−４ＡＶＣとしても知られる）ＩＴＵ−ＴＨ．２６４、高効率ビデオコーディング（ＨＥＶＣ：High Efficiency Video Coding)のうちの１つまたは複数、およびそのような規格に対する任意の拡張に適用可能であり得る。また、本開示で説明される技法は、将来開発される規格の一部になり得る。言い換えれば、本開示で説明する技法は、前に開発されたビデオコーディング規格、現在開発中のビデオコーディング規格、および次のビデオコーディング規格に適用可能であり得る。 [0041] Although certain embodiments are described herein in the context of the DSC standard, which is an example of a display link video compression technique, the systems and methods disclosed herein may be any suitable video. Those skilled in the art will appreciate that it may be applicable to coding standards. For example, the embodiments disclosed herein include the following standards: International Telecommunication Union (ITU) Telecommunication Standardization Sector (ITU-T) H.264. 261, International Organization for Standardization / International Electrotechnical Commission (ISO / IEC) Moving Picture Expert Group 1 (MPEG-1) Visual, ITU-T H.264. 262 or ISO / IEC MPEG-2 Visual, ITU-T H.264. 263, ISO / IEC MPEG-4 Visual, ITU-T H.264 (also known as ISO / IEC MPEG-4 AVC). H.264, one or more of High Efficiency Video Coding (HEVC), and any extension to such a standard may be applicable. Also, the techniques described in this disclosure may become part of a standard that will be developed in the future. In other words, the techniques described in this disclosure may be applicable to previously developed video coding standards, currently developed video coding standards, and to the next video coding standard.

[0042] ＤＳＣ規格は、ビデオデータの各ブロックが、エンコーダによって符号化され、同様に、デコーダによって復号され得る、いくつかのコーディングモードを含む。いくつかの実装では、エンコーダおよび／またはデコーダは、前にコーディングされたブロックに基づいてコーディングされるべき現在ブロックを予測し得る。 [0042] The DSC standard includes several coding modes in which each block of video data can be encoded by an encoder and similarly decoded by a decoder. In some implementations, the encoder and / or decoder may predict a current block to be coded based on a previously coded block.

[0043] しかしながら、既存のコーディングモード（例えば、変換コーディング、差分パルスコード変調など）は、ビデオデータ中の極めて複雑な領域を圧縮する満足のいく方法を提供しない。しばしば、このタイプのデータ（すなわち、高度に圧縮されたビデオデータ）について、コーディングされる現在ブロック（または現在ブロックの構成サブブロック）は、コーダ（例えば、エンコーダまたはデコーダ）によって遭遇された前のブロックにコンテンツが類似する。しかしながら、既存のイントラ予測は、そのような現在ブロックの満足のいく予測（例えば、現在ブロックに十分に類似し、従って十分に小さい残差をもたらすであろう現在ブロックの予測）を行うには制限されすぎていることがある。従って、ビデオデータのブロックをコーディングする改善された方法が望まれる。 [0043] However, existing coding modes (eg, transform coding, differential pulse code modulation, etc.) do not provide a satisfactory way to compress very complex regions in video data. Often, for this type of data (ie, highly compressed video data), the current block being coded (or a constituent sub-block of the current block) is the previous block encountered by the coder (eg, encoder or decoder). The content is similar. However, existing intra predictions are limited to making satisfactory predictions of such current blocks (eg predictions of current blocks that are sufficiently similar to the current block and therefore will yield sufficiently small residuals). It may have been done too much. Accordingly, an improved method for coding a block of video data is desired.

[0044] 本開示では、ブロック予測モードでブロックをコーディングする改善された方法が説明される。例えば、現在ブロックを（または現在ブロック内の現在領域）予測するために使用される候補ブロック（または候補領域）を探索するとき、探索範囲は、エンコーダが、探索コストを最小限に抑えながら、良好な一致であり得る潜在的な候補（potential candidates）へのアクセスを有するように定義され得る。別の例では、エンコーダは、レート歪み（ＲＤ：rate distortion)分析に基づいて、現在ブロックをコーディングするために複数の探索範囲のうちのどの１つを使用するかを決定し得る。さらに別の例では、エンコーダは、前にコーディングされたピクセルのうちのどの１つが、現在ブロックのロケーション、ＲＤコストなどのような様々なファクタに基づいて、現在ブロックをコーディングするために使用される探索範囲に含まれるかを決定し得る。エンコーダ側でより多くの動作を実行すること（例えば、コンピューティングリソースと処理能力（power）とを消費し得る、現在ブロックを予測するために使用される候補ブロックを探索すること、現在ブロックに関する候補ブロックのロケーションを識別するベクトルを計算すること、関連付けられたコストを異なる探索範囲を使用することと比較することなど）によって、本方法はデコーダ複雑さ（decoder complexity）を低減し得る。追加的に、複数のおよび／または適用可能探索範囲がブロック予測モードでブロックをコーディングするために使用されることを可能にすることによって、より良い候補区分に位置している可能性が高くなり得、それにより、ブロック予測モードのコーディング効率および／またはコーディング性能を改善する。さらに、探索範囲を適応可能に選択するためのエンコーダが各ブロックをコーディングするために使用されることを可能にすることによって、ブロック予測方式の性能は、さらに改善され得る。 [0044] In this disclosure, an improved method of coding a block in block prediction mode is described. For example, when searching for candidate blocks (or candidate regions) used to predict the current block (or the current region within the current block), the search range is good while the encoder minimizes the search cost Can be defined to have access to potential candidates that may be close matches. In another example, the encoder may determine which one of the multiple search ranges to use to code the current block based on a rate distortion (RD) analysis. In yet another example, the encoder is used to code the current block based on various factors such as the current block location, RD cost, etc., any one of the previously coded pixels. It can be determined whether it falls within the search range. Perform more operations at the encoder side (eg, search for candidate blocks used to predict the current block that may consume computing resources and processing power, candidates for the current block By calculating a vector that identifies the location of the block, comparing the associated cost with using different search ranges, etc., the method may reduce decoder complexity. Additionally, by allowing multiple and / or applicable search ranges to be used to code a block in block prediction mode, it may be more likely to be located in a better candidate partition. Thereby improving the coding efficiency and / or coding performance of the block prediction mode. Further, the performance of the block prediction scheme may be further improved by allowing an encoder for adaptively selecting a search range to be used to code each block.

ビデオコーディング規格
[0045] ビデオ画像、ＴＶ画像、静止画像、あるいはビデオレコーダまたはコンピュータによって生成される画像のようなデジタル画像は、水平ラインおよび垂直ラインで配置されたピクセルまたはサンプルを含み得る。単一の画像中のピクセルの数は一般に数万個である。各ピクセルは、一般に、ルミナンス情報とクロミナンス情報とを含んでいる。圧縮がなければ、画像エンコーダから画像デコーダに搬送されるべき情報の甚だしい（sheer）量は、リアルタイム画像送信を実行不可能（impractical）にするであろう。送信される情報の量を低減するために、ＪＰＥＧ、ＭＰＥＧおよびＨ．２６３規格などの、いくつかの異なる圧縮方法が開発された。 Video coding standard
[0045] A digital image, such as a video image, a TV image, a still image, or an image generated by a video recorder or computer, may include pixels or samples arranged in horizontal and vertical lines. The number of pixels in a single image is typically tens of thousands. Each pixel generally includes luminance information and chrominance information. Without compression, the sheer amount of information to be conveyed from the image encoder to the image decoder would make real-time image transmission impractical. In order to reduce the amount of information transmitted, JPEG, MPEG and H.264. Several different compression methods have been developed, such as the H.263 standard.

[0046] ビデオコーディング規格は、ＩＴＵ−ＴＨ．２６１と、ＩＳＯ／ＩＥＣＭＰＥＧ−１Ｖｉｓｕａｌと、ＩＴＵ−ＴＨ．２６２またはＩＳＯ／ＩＥＣＭＰＥＧ−２Ｖｉｓｕａｌと、ＩＴＵ−ＴＨ．２６３と、ＩＳＯ／ＩＥＣＭＰＥＧ−４Ｖｉｓｕａｌと、（ＩＳＯ／ＩＥＣＭＰＥＧ−４ＡＶＣとしても知られる）ＩＴＵ−ＴＨ．２６４と、そのような規格の拡張を含むＨＥＶＣとを含む。 [0046] The video coding standard is ITU-T H.264. 261, ISO / IEC MPEG-1 Visual, and ITU-T H.264. 262 or ISO / IEC MPEG-2 Visual and ITU-T H.264. H.263, ISO / IEC MPEG-4 Visual, ITU-T H.264 (also known as ISO / IEC MPEG-4 AVC). H.264 and HEVC including extensions of such standards.

[0047] さらに、ＶＥＳＡによって、あるビデオコーディング規格、すなわち、ＤＳＣが開発された。ＤＳＣ規格は、ディスプレイリンクを介した送信のためにビデオを圧縮することができるビデオ圧縮規格である。ディスプレイの解像度が増加するにつれて、ディスプレイを駆動するために必要とされるビデオデータの帯域幅は、対応して増加する。いくつかのディスプレイリンクは、そのような解像度についてディスプレイにビデオデータの全てを送信するための帯域幅を有しない可能性がある。従って、ＤＳＣ規格は、ディスプレイリンクを介した相互運用可能な、視覚的ロスレス圧縮のための圧縮規格を規定する。 [0047] Furthermore, a video coding standard, DSC, was developed by VESA. The DSC standard is a video compression standard that can compress video for transmission over a display link. As the display resolution increases, the bandwidth of the video data required to drive the display increases correspondingly. Some display links may not have the bandwidth to send all of the video data to the display for such resolutions. Thus, the DSC standard defines a compression standard for visual lossless compression that is interoperable over display links.

[0048] ＤＳＣ規格は、Ｈ．２６４およびＨＥＶＣなどの、他のビデオコーディング規格とは異なる。ＤＳＣは、フレーム内圧縮（intra-frame compression）を含むが、フレーム間圧縮（inter-frame compression）を含まず、これは、ビデオデータをコーディングする際にＤＳＣ規格によって時間的情報が使用されない可能性があることを意味する。対照的に、他のビデオコーディング規格は、それらのビデオコーディング技法においてフレーム間圧縮を採用し得る。 [0048] The DSC standard is H.264. Different from other video coding standards such as H.264 and HEVC. DSC includes intra-frame compression but not inter-frame compression, which may not use temporal information by the DSC standard when coding video data. Means there is. In contrast, other video coding standards may employ interframe compression in their video coding techniques.

ビデオコーディングシステム
[0049] 添付の図面を参照して新規のシステム、装置、および方法の様々な態様が以下でより十分に説明される。しかしながら、本開示は、多くの異なる形態で実施され得、本開示全体にわたって提示される任意の特定の構造または機能に限定されるものと解釈されるべきではない。むしろ、これらの態様は、本開示が綿密で完全になり、本開示の範囲を当業者に十分に伝えるために提供されるものである。本明細書の教示に基づいて、本開示の範囲は、本開示の任意の他の態様とは無関係に実装されようと、本開示の任意の他の態様と組み合わせて実装されようと、本明細書で開示される新規のシステム、装置、および方法のいずれの態様をもカバーすると意図されていることを、当業者は理解するべきである。例えば、本明細書に記載されるいずれの数の態様を使用しても、装置は実装され得、または方法は実施され得る。さらに、本開示の範囲は、本明細書に記載される本開示の様々な態様に加えてまたはそれらの態様以外に、他の構造、機能、または構造および機能を使用して実施されるそのような装置または方法をカバーすることが意図される。本明細書で開示されるいずれの態様も請求項の１つまたは複数の要素によって具現化され得ることを理解されたい。 Video coding system
[0049] Various aspects of the novel systems, apparatus, and methods are described more fully hereinafter with reference to the accompanying drawings. However, this disclosure may be implemented in many different forms and should not be construed as limited to any particular structure or function presented throughout this disclosure. Rather, these aspects are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art. Based on the teachings herein, the scope of the present disclosure, whether implemented in combination with any other aspect of the present disclosure, or in combination with any other aspect of the present disclosure, It should be understood by those skilled in the art that it is intended to cover any aspect of the novel systems, devices, and methods disclosed herein. For example, an apparatus may be implemented or a method may be performed using any number of aspects described herein. Further, the scope of the present disclosure may be implemented using other structures, functions, or structures and functions in addition to or in addition to the various aspects of the present disclosure described herein. It is intended to cover various devices or methods. It should be understood that any aspect disclosed herein may be embodied by one or more elements of a claim.

[0050] 本明細書では特定の態様が説明されるが、これらの態様の多くの変形および置換は本開示の範囲内に入る。適切な態様のいくつかの利益および利点が説明されるが、本開示の範囲は特定の利益、使用、または目的に限定されることを意図したものではない。むしろ、本開示の態様は、異なるワイヤレス技術、システム構成、ネットワーク、および送信プロトコルに広く適用可能であることを意図したものであり、それらのいくつかを例として、図および適切な（preferred）態様の以下の説明において図示する。詳細な説明および図面は、本開示を限定するものではなく単に説明するものにすぎず、本開示の範囲は添付の特許請求の範囲およびそれの均等物によって定義される。 [0050] Although particular aspects are described herein, many variations and permutations of these aspects fall within the scope of the disclosure. Although some benefits and advantages of the appropriate aspects are described, the scope of the disclosure is not intended to be limited to particular benefits, uses, or objectives. Rather, the aspects of the present disclosure are intended to be broadly applicable to different wireless technologies, system configurations, networks, and transmission protocols, some of which are illustrated by way of example in the figures and preferred aspects. This is illustrated in the following description. The detailed description and drawings are merely illustrative of the disclosure rather than limiting, the scope of the disclosure being defined by the appended claims and equivalents thereof.

[0051] 添付の図面は例を図示している。添付の図面中の参照番号によって示される要素は、以下の説明における同様の参照番号によって示される要素に対応する。本開示では、序数語（例えば、「第１の」、「第２の」、「第３の」など）で始まる名前を有する要素は、必ずしもそれらの要素が特定の順序を有することを暗示するとは限らない。むしろ、そのような序数語は、同じまたは同様のタイプの異なる要素を指すために使用されるにすぎない。 [0051] The accompanying drawings illustrate examples. Elements indicated by reference numerals in the accompanying drawings correspond to elements indicated by like reference numerals in the following description. In this disclosure, elements having names that begin with ordinal words (eg, “first”, “second”, “third”, etc.) do not necessarily imply that they have a particular order. Is not limited. Rather, such ordinal words are only used to refer to different elements of the same or similar type.

[0052] 図１Ａは、本開示で説明される態様による技法を利用し得る例示的なビデオコーディングシステム１０を図示するブロック図である。本明細書で使用され説明される「ビデオコーダ」または「コーダ」という用語は、ビデオエンコーダとビデオデコーダの両方を総称的に指す。本開示では、「ビデオコーディング」または「コーディング」という用語は、ビデオ符号化とビデオ復号とを総称的に指すことがある。ビデオエンコーダおよびビデオデコーダに加えて、本出願で説明される態様は、トランスコーダ（transcoder）（例えば、ビットストリームを復号し、別のビットストリームを再符号化することができるデバイス）およびミドルボックス（middlebox）（例えば、ビットストリームを変更、変換、および／または他の場合には操作することができるデバイス）など、他の関係するデバイスに拡張され得る。 [0052] FIG. 1A is a block diagram illustrating an example video coding system 10 that may utilize techniques in accordance with aspects described in this disclosure. The term “video coder” or “coder” as used and described herein generically refers to both video encoders and video decoders. In this disclosure, the terms “video coding” or “coding” may refer generically to video encoding and video decoding. In addition to video encoders and video decoders, aspects described in the present application include a transcoder (eg, a device that can decode a bitstream and re-encode another bitstream) and a middle box ( middlebox) (eg, a device that can modify, transform, and / or otherwise manipulate the bitstream) and can be extended to other related devices.

[0053] 図１Ａに示されているように、ビデオコーディングシステム１０は、宛先デバイス１４（すなわち、「ビデオコーディングデバイス１４」または「コーディングデバイス１４」）によって後で復号される符号化ビデオデータを生成するソースデバイス１２（すなわち、「ビデオコーディングデバイス１２」または「コーディングデバイス１２」）を含む。図１Ａの例では、ソースデバイス１２および宛先デバイス１４は、別個のデバイスを構成する。しかしながら、ソースデバイス１２および宛先デバイス１４は、図１Ｂの例に示されているように、同じデバイス上にあるかまたはそれの一部であり得ることに留意されたい。 [0053] As shown in FIG. 1A, video coding system 10 generates encoded video data that is subsequently decoded by destination device 14 (ie, “video coding device 14” or “coding device 14”). Source device 12 (ie, “video coding device 12” or “coding device 12”). In the example of FIG. 1A, the source device 12 and the destination device 14 constitute separate devices. However, it should be noted that source device 12 and destination device 14 may be on or part of the same device, as shown in the example of FIG. 1B.

[0054] もう一度図１Ａを参照すると、ソースデバイス１２および宛先デバイス１４は、それぞれ、デスクトップコンピュータ、ノートブック（例えば、ラップトップ）コンピュータ、タブレットコンピュータ、セットトップボックス、いわゆる「スマート」フォンなどの電話ハンドセット、いわゆる「スマート」パッド、テレビジョン、カメラ、ディスプレイデバイス、デジタルメディアプレイヤ、ビデオゲームコンソール、ビデオストリーミングデバイスなどを含む、デバイス（ビデオコーディングデバイスとも呼ばれる）の広範囲にわたるデバイスのいずれかを備え得る。様々な実施形態では、ソースデバイス１２および宛先デバイス１４は、ワイヤレス通信のために装備され（すなわち、ワイヤレス通信を介して通信するように構成され）得る。 [0054] Referring once again to FIG. 1A, source device 12 and destination device 14 are each a telephone handset, such as a desktop computer, notebook (eg, laptop) computer, tablet computer, set-top box, so-called "smart" phone, or the like. Any of a wide range of devices (also called video coding devices), including so-called “smart” pads, televisions, cameras, display devices, digital media players, video game consoles, video streaming devices, and the like. In various embodiments, source device 12 and destination device 14 may be equipped for wireless communication (ie, configured to communicate via wireless communication).

[0055] ビデオコーディングシステム１０のビデオコーディングデバイス１２、１４は、ワイヤレスワイドエリアネットワーク（ＷＷＡＮ）（例えば、セルラ）および／またはワイヤレスローカルエリアネットワーク（ＷＬＡＮ）キャリアのようなワイヤレスネットワークおよび無線技術を介して通信するように構成され得る。「ネットワーク」および「システム」という用語は、しばしば互換的に使用される。ビデオコーディングデバイス１２、１４の各々は、ユーザ機器（ＵＥ）、ワイヤレスデバイス、端末、モバイル局、加入者局などであり得る。 [0055] The video coding devices 12, 14 of the video coding system 10 may be via wireless networks and radio technologies such as a wireless wide area network (WWAN) (eg, cellular) and / or a wireless local area network (WLAN) carrier. Can be configured to communicate. The terms “network” and “system” are often used interchangeably. Each of video coding devices 12, 14 may be a user equipment (UE), a wireless device, a terminal, a mobile station, a subscriber station, and so on.

[0056] ＷＷＡＮキャリアは、例えば、符号分割多元接続（ＣＤＭＡ）、時分割多元接続（ＴＤＭＡ）、周波数分割多元接続（ＦＤＭＡ）、直交ＦＤＭＡ（ＯＦＤＭＡ）、シングルキャリアＦＤＭＡ（ＳＣ−ＦＤＭＡ）、および他のネットワークのようなワイヤレス通信ネットワークを含み得る。ＣＤＭＡネットワークは、ユニバーサル地上無線アクセス（ＵＴＲＡ）、ｃｄｍａ２０００などのような無線技術を実装し得る。ＵＴＲＡは、広帯域ＣＤＭＡ（ＷＣＤＭＡ（登録商標））およびＣＤＭＡの他の変形を含む。ＣＤＭＡ２０００は、ＩＳ−２０００、ＩＳ−９５およびＩＳ−８５６規格をカバーする。ＴＤＭＡネットワークは、グローバル・システム・フォー・モバイルコミュニケーションズ（ＧＳＭ）（登録商標）のような無線技術を実装し得る。ＯＦＤＭＡネットワークは、発展型ＵＴＲＡ（Ｅ−ＵＴＲＡ）、ウルトラ・モバイル・ブロードバンド（ＵＭＢ）、ＩＥＥＥ８０２．１１（Ｗｉ−Ｆｉ）、ＩＥＥＥ８０２．１６（ＷｉＭＡＸ）、ＩＥＥＥ８０２．２０、Ｆｌａｓｈ−ＯＦＤＭ（登録商標）などのような無線技術を実装し得る。ＵＴＲＡおよびＥ−ＵＴＲＡは、ユニバーサル・モバイル・テレコミュニケーション・システム（ＵＭＴＳ）の一部である。３ＧＰＰ（登録商標）ロングタームエボリューション（ＬＴＥ（登録商標））およびＬＴＥアドバンスト（ＬＴＥ−Ａ）は、Ｅ−ＵＴＲＡを使用するＵＭＴＳの最新リリースである。ＵＴＲＡ、Ｅ−ＵＴＲＡ、ＵＭＴＳ、ＬＴＥ、ＬＴＥ−ＡおよびＧＳＭは、「第３世代パートナーシッププロジェクト」（３ＧＰＰ）と名付けられた団体からの文書で説明されている。ＣＤＭＡ２０００およびＵＭＢは、「第３世代パートナーシッププロジェクト２」（３ＧＰＰ２）名付けられた団体からの文書で説明されている。 [0056] WWAN carriers include, for example, code division multiple access (CDMA), time division multiple access (TDMA), frequency division multiple access (FDMA), orthogonal FDMA (OFDMA), single carrier FDMA (SC-FDMA), and others. Wireless communication networks, such as A CDMA network may implement a radio technology such as Universal Terrestrial Radio Access (UTRA), cdma2000, and so on. UTRA includes Wideband CDMA (WCDMA®) and other variants of CDMA. CDMA2000 covers IS-2000, IS-95 and IS-856 standards. A TDMA network may implement a radio technology such as Global System for Mobile Communications (GSM). OFDMA networks include Evolved UTRA (E-UTRA), Ultra Mobile Broadband (UMB), IEEE 802. 11 (Wi-Fi), IEEE 802. 16 (WiMAX), IEEE 802. 20, may implement a radio technology such as Flash-OFDM. UTRA and E-UTRA are part of the Universal Mobile Telecommunications System (UMTS). 3GPP® Long Term Evolution (LTE®) and LTE Advanced (LTE-A) are the latest releases of UMTS that use E-UTRA. UTRA, E-UTRA, UMTS, LTE, LTE-A and GSM are described in documents from an organization named “3rd Generation Partnership Project” (3GPP). CDMA2000 and UMB are described in documents from an organization named “3rd Generation Partnership Project 2” (3GPP2).

[0057] ビデオコーディングシステム１０のビデオコーディングデバイス１２、１４はまた、例えば、８０２．１１ａ−１９９９（通常、「８０２．１１ａ」と呼ばれる）、８０２．１１ｂ−１９９９（通常、「８０２．１１ｂ」と呼ばれる）、８０２．１１ｇ−２００３（通常、「８０２．１１ｇ」と呼ばれる）などの修正を含む、ＩＥＥＥ８０２．１１規格のような、１つまたは複数の規格に従ったＷＬＡＮ基地局上で互いと通信し得る。 [0057] Video coding devices 12, 14 of video coding system 10 may also be, for example, 802.11a-1999 (usually referred to as "802.11a"), 802.11b-1999 (usually "802.11b"). Communicate with each other over WLAN base stations according to one or more standards, such as the IEEE 802.11 standard, including modifications such as 802.11g-2003 (usually referred to as “802.11g”) Can do.

[0058] 宛先デバイス１４は、復号される符号化ビデオデータを、リンク１６を介して受信し得る。リンク１６は、ソースデバイス１２から宛先デバイス１４に符号化ビデオデータを移動することが可能な任意のタイプの媒体またはデバイスを備え得る。図１Ａの例では、リンク１６は、ソースデバイス１２が符号化ビデオデータをリアルタイムで宛先デバイス１４に送信することを可能にするための通信媒体を備え得る。符号化ビデオデータは、ワイヤレス通信プロトコルなどの通信規格に従って変調され、宛先デバイス１４に送信され得る。通信媒体は、無線周波数（ＲＦ）スペクトルあるいは１つまたは複数の物理伝送路などの、任意のワイヤレスまたはワイヤード通信媒体を備え得る。通信媒体は、ローカルエリアネットワーク、ワイドエリアネットワーク、またはインターネットのようなグローバルネットワークなどの、パケットベースネットワークの一部を形成し得る。通信媒体は、ルータ、スイッチ、基地局、またはソースデバイス１２から宛先デバイス１４への通信を容易にするために有用であり得る任意の他の機器を含み得る。 [0058] Destination device 14 may receive encoded video data to be decoded via link 16. Link 16 may comprise any type of media or device capable of moving encoded video data from source device 12 to destination device 14. In the example of FIG. 1A, link 16 may comprise a communication medium to allow source device 12 to transmit encoded video data to destination device 14 in real time. The encoded video data may be modulated according to a communication standard such as a wireless communication protocol and transmitted to the destination device 14. The communication medium may comprise any wireless or wired communication medium, such as a radio frequency (RF) spectrum or one or more physical transmission lines. The communication medium may form part of a packet-based network, such as a local area network, a wide area network, or a global network such as the Internet. Communication media may include routers, switches, base stations, or any other equipment that may be useful for facilitating communication from source device 12 to destination device 14.

[0059] 図１Ａの例では、ソースデバイス１２は、ビデオソース１８と、ビデオエンコーダ２０（単に、エンコーダ２０とも呼ばれる）と、出力インターフェース２２とを含む。いくつかの場合には、出力インターフェース２２は、変調器／復調器（モデム）および／または送信機を含み得る。ソースデバイス１２において、ビデオソース１８は、ビデオキャプチャデバイス、例えばビデオカメラ、前にキャプチャされたビデオを含んでいるビデオアーカイブ、ビデオコンテンツプロバイダからビデオを受信するためのビデオフィードインターフェース、および／またはソースビデオとしてコンピュータグラフィックデータを生成するためのコンピュータグラフィックシステムなどのソース、あるいはそのようなソースの組合せを含み得る。一例として、ビデオソース１８がビデオカメラである場合、ソースデバイス１２および宛先デバイス１４は、図１Ｂの例に図示されるように、いわゆる「カメラフォン」または「ビデオフォン」を形成し得る。しかしながら、本開示で説明される技法は、概してビデオコーディングに適用可能であり得、ワイヤレスおよび／またはワイヤードアプリケーションに適用され得る。 In the example of FIG. 1A, source device 12 includes a video source 18, a video encoder 20 (also referred to simply as encoder 20), and an output interface 22. In some cases, output interface 22 may include a modulator / demodulator (modem) and / or transmitter. At source device 12, video source 18 may be a video capture device, such as a video camera, a video archive containing previously captured video, a video feed interface for receiving video from a video content provider, and / or source video. As a computer graphic system for generating computer graphic data, or a combination of such sources. As an example, if video source 18 is a video camera, source device 12 and destination device 14 may form a so-called “camera phone” or “video phone” as illustrated in the example of FIG. 1B. However, the techniques described in this disclosure may be generally applicable to video coding and may be applied to wireless and / or wired applications.

[0060] キャプチャされたビデオ、以前にキャプチャされたビデオ、またはコンピュータ生成されたビデオは、ビデオエンコーダ２０によって符号化され得る。符号化ビデオデータは、ソースデバイス１２の出力インターフェース２２を介して宛先デバイス１４に送信され得る。符号化ビデオデータはまた（あるいは代替として）、復号および／または再生のための宛先デバイス１４または他のデバイスによる後のアクセスのためにストレージデバイス３１上に記憶され得る。図１Ａおよび図１Ｂに図示されるビデオエンコーダ２０は、図２Ａに示されるビデオエンコーダ２０、または本明細書で説明される他のビデオエンコーダを備え得る。 [0060] Captured video, previously captured video, or computer-generated video may be encoded by video encoder 20. The encoded video data may be transmitted to the destination device 14 via the output interface 22 of the source device 12. The encoded video data may also (or alternatively) be stored on storage device 31 for later access by destination device 14 or other devices for decoding and / or playback. The video encoder 20 illustrated in FIGS. 1A and 1B may comprise the video encoder 20 illustrated in FIG. 2A, or other video encoder described herein.

[0061] 図１Ａの例では、宛先デバイス１４は、入力インターフェース２８と、ビデオデコーダ３０（単に、デコーダ３０とも呼ばれる）と、ディスプレイデバイス３２とを含む。いくつかの場合には、入力インターフェース２８は、受信機および／またはモデムを含み得る。宛先デバイス１４の入力インターフェース２８は、リンク１６を介しておよび／またはストレージデバイス３１から符号化ビデオデータを受信し得る。リンク１６を介して通信され、またはストレージデバイス３１上に提供された符号化ビデオデータは、ビデオデータを復号する際に、ビデオデコーダ３０などのビデオデコーダが使用するためのビデオエンコーダ２０によって生成される様々なシンタックス要素を含み得る。そのようなシンタックス要素は、通信媒体上で送信された、記憶媒体上に記憶された、またはファイルサーバに記憶された符号化ビデオデータに含まれ得る。図１Ａおよび図１Ｂに図示されているビデオデコーダ３０は、図２Ｂに図示されているビデオデコーダ３０、または本明細書で説明される他のビデオデコーダを備え得る。 In the example of FIG. 1A, destination device 14 includes an input interface 28, a video decoder 30 (also referred to simply as decoder 30), and a display device 32. In some cases, input interface 28 may include a receiver and / or a modem. The input interface 28 of the destination device 14 may receive encoded video data via the link 16 and / or from the storage device 31. Encoded video data communicated over link 16 or provided on storage device 31 is generated by video encoder 20 for use by a video decoder, such as video decoder 30, in decoding the video data. Various syntax elements may be included. Such syntax elements may be included in encoded video data transmitted on a communication medium, stored on a storage medium, or stored on a file server. The video decoder 30 illustrated in FIGS. 1A and 1B may comprise the video decoder 30 illustrated in FIG. 2B, or other video decoder as described herein.

[0062] ディスプレイデバイス３２は、宛先デバイス１４と一体化されるかまたはその外部にあり得る。いくつかの例では、宛先デバイス１４は、一体型ディスプレイデバイスを含み、また、外部ディスプレイデバイスとインターフェースするように構成され得る。他の例では、宛先デバイス１４はディスプレイデバイスであり得る。概して、ディスプレイデバイス３２は、復号ビデオデータをユーザに対して表示し、液晶ディスプレイ（ＬＣＤ）、プラズマディスプレイ、有機発光ダイオード（ＯＬＥＤ）ディスプレイ、または別のタイプのディスプレイデバイスなどの、様々なディスプレイデバイスのいずれかを備え得る。 [0062] The display device 32 may be integrated with or external to the destination device 14. In some examples, destination device 14 includes an integrated display device and may be configured to interface with an external display device. In other examples, destination device 14 may be a display device. In general, the display device 32 displays decoded video data to the user and can be used for various display devices such as a liquid crystal display (LCD), a plasma display, an organic light emitting diode (OLED) display, or another type of display device. Either can be provided.

[0063] 関係する態様では、図１Ｂは例示的なビデオコーディングシステム１０’を示し、ここにおいて、ソースデバイス１２および宛先デバイス１４はデバイス１１上にあるかまたはそれの一部である。デバイス１１は、「スマート」フォンなどのような電話ハンドセットであり得る。デバイス１１は、ソースデバイス１２および宛先デバイス１４と動作可能に通信している（随意に存在する）プロセッサ／コントローラデバイス１３を含み得る。図１Ｂのビデオコーディングシステム１０’およびそれのコンポーネントは、他の場合には図１Ａのビデオコーディングシステム１０およびそれのコンポーネントと同様である。 [0063] In a related aspect, FIG. 1B shows an example video coding system 10 ', where the source device 12 and the destination device 14 are on or part of the device 11. Device 11 may be a telephone handset, such as a “smart” phone. Device 11 may include a processor / controller device 13 that is in operative communication (optionally) with source device 12 and destination device 14. The video coding system 10 'of FIG. 1B and its components are otherwise similar to the video coding system 10 of FIG. 1A and its components.

[0064] ビデオエンコーダ２０およびビデオデコーダ３０は、ＤＳＣなどの、ビデオ圧縮規格に従って動作し得る。代替的に、ビデオエンコーダ２０およびビデオデコーダ３０は、代替的にＭＰＥＧ−４，Ｐａｒｔ１０，ＡＶＣと呼ばれるＩＴＵ−ＴＨ．２６４規格、ＨＥＶＣなどの、他のプロプライエタリ規格または業界規格、あるいはそのような規格の拡張に従って動作し得る。しかしながら、本開示の技法は、いかなる特定のコーディング規格にも限定されない。ビデオ圧縮規格の他の例は、ＭＰＥＧ−２およびＩＴＵ−ＴＨ．２６３を含む。 [0064] Video encoder 20 and video decoder 30 may operate according to a video compression standard, such as DSC. Alternatively, the video encoder 20 and the video decoder 30 are ITU-T H.264, alternatively called MPEG-4, Part 10, AVC. It may operate according to other proprietary or industry standards, such as H.264 standard, HEVC, or extensions of such standards. However, the techniques of this disclosure are not limited to any particular coding standard. Other examples of video compression standards are MPEG-2 and ITU-T H.264. H.263.

[0065] 図１Ａおよび図１Ｂの例には示されていないが、ビデオエンコーダ２０およびビデオデコーダ３０は、各々オーディオエンコーダおよびデコーダと統合され得、共通のデータストリームまたは別個のデータストリーム中のオーディオとビデオの両方の符号化を処理するために、適切なＭＵＸ−ＤＥＭＵＸユニット、または他のハードウェアおよびソフトウェアを含み得る。適用可能な場合、いくつかの例では、ＭＵＸ−ＤＥＭＵＸユニットは、ＩＴＵＨ．２２３マルチプレクサプロトコル、またはユーザデータグラムプロトコル（ＵＤＰ）などの他のプロトコルに準拠し得る。 [0065] Although not shown in the examples of FIGS. 1A and 1B, the video encoder 20 and video decoder 30 may each be integrated with an audio encoder and decoder, with audio in a common data stream or separate data streams. Appropriate MUX-DEMUX units, or other hardware and software, may be included to handle both video encodings. Where applicable, in some examples, the MUX-DEMUX unit is an ITU H.264 standard. It may be compliant with other protocols such as H.223 multiplexer protocol or User Datagram Protocol (UDP).

[0066] ビデオエンコーダ２０およびビデオデコーダ３０は各々、１つまたは複数のマイクロプロセッサ、デジタルシグナルプロセッサ（ＤＳＰ）、特定用途向け集積回路（ＡＳＩＣ）、フィールドプログラマブルゲートアレイ（ＦＰＧＡ）、ディスクリート論理、ソフトウェア、ハードウェア、ファームウェアなどの、様々な適切なエンコーダ回路のいずれか、またはそれらの任意の組合せとして実装され得る。本技法が部分的にソフトウェアで実装されるとき、デバイスは、ソフトウェアのための命令を適切な非一時的コンピュータ可読媒体に記憶し、本開示の技法を実行するために１つまたは複数のプロセッサを使用してハードウェアでその命令を実行し得る。ビデオエンコーダ２０およびビデオデコーダ３０の各々は１つまたは複数のエンコーダまたはデコーダ中に含まれ得、そのいずれも、それぞれのデバイスにおいて複合エンコーダ／デコーダの一部として統合され得る。 [0066] Each of video encoder 20 and video decoder 30 includes one or more microprocessors, digital signal processors (DSPs), application specific integrated circuits (ASICs), field programmable gate arrays (FPGAs), discrete logic, software, It can be implemented as any of a variety of suitable encoder circuits, such as hardware, firmware, or any combination thereof. When the technique is implemented in part in software, the device stores instructions for the software in a suitable non-transitory computer readable medium and includes one or more processors to perform the techniques of this disclosure. Can be used to execute the instructions in hardware. Each of video encoder 20 and video decoder 30 may be included in one or more encoders or decoders, either of which may be integrated as part of a combined encoder / decoder at the respective device.

ビデオコーディングプロセス
[0067] 上記で簡潔に述べたように、ビデオエンコーダ２０はビデオデータを符号化する。ビデオデータは１つまたは複数のピクチャを備え得る。ピクチャの各々は、ビデオの一部を形成する静止画像である。いくつかの事例では、ピクチャはビデオ「フレーム（frame）」と呼ばれ得る。ビデオエンコーダ２０がビデオデータ（例えば、ビデオコーディングレイヤ（ＶＣＬ）データおよび／または非ＶＣＬデータ）を符号化するとき、ビデオエンコーダ２０は、ビットストリームを生成し得る。ビットストリームは、ビデオデータのコード化表現を形成するビットのシーケンスを含み得る。ビットストリームはコード化ピクチャと関連データとを含み得る。コード化ピクチャはピクチャのコード化表現である。ＶＣＬデータは、コード化ピクチャデータ（すなわち、（１つまたは複数の）コード化ピクチャのサンプルに関連付けられた情報）を含み、非ＶＣＬデータは、１つまたは複数のコード化ピクチャに関連付けられた制御情報（例えば、パラメータセットおよび／または補足的な拡張情報）を含み得る。 Video coding process
[0067] As briefly mentioned above, video encoder 20 encodes video data. Video data may comprise one or more pictures. Each of the pictures is a still image that forms part of the video. In some cases, a picture may be referred to as a video “frame”. When video encoder 20 encodes video data (eg, video coding layer (VCL) data and / or non-VCL data), video encoder 20 may generate a bitstream. A bitstream may include a sequence of bits that form a coded representation of video data. A bitstream may include coded pictures and associated data. A coded picture is a coded representation of a picture. The VCL data includes coded picture data (ie, information associated with a sample of coded picture (s)), and non-VCL data is a control associated with one or more coded pictures. Information (eg, parameter sets and / or supplementary extended information) may be included.

[0068] ビットストリームを生成するために、ビデオエンコーダ２０は、ビデオデータ中の各ピクチャに対して符号化演算を実行し得る。ビデオエンコーダ２０がピクチャに対して符号化演算を実行するとき、ビデオエンコーダ２０は、一連のコード化ピクチャと関連データとを生成し得る。関連データは、量子化パラメータ（ＱＰ：quantization parameter)などのコーディングパラメータのセットを含み得る。コード化ピクチャを生成するために、ビデオエンコーダ２０は、ピクチャを等しいサイズのビデオブロックに区分し得る。ビデオブロックは複数のサンプルの２次元アレイであり得る。コーディングパラメータは、ビデオデータのあらゆるブロックについてコーディングオプション（例えば、コーディングモード）を定義し得る。コーディングオプションは、所望のＲＤ性能を達成するために選択され得る。 [0068] To generate a bitstream, video encoder 20 may perform an encoding operation on each picture in the video data. When video encoder 20 performs an encoding operation on a picture, video encoder 20 may generate a series of coded pictures and associated data. The associated data may include a set of coding parameters such as a quantization parameter (QP). To generate a coded picture, video encoder 20 may partition the picture into equally sized video blocks. A video block can be a two-dimensional array of samples. Coding parameters may define coding options (eg, coding mode) for every block of video data. Coding options can be selected to achieve the desired RD performance.

[0069] いくつかの例では、ビデオエンコーダ２０はピクチャを複数のスライスに区分し得る。スライスの各々は、画像またはフレーム中の領域の残り（the rest of the regions）からの情報なしに独立して復号され得る、画像（例えば、フレーム）中の空間的に別個の領域を含み得る。各画像またはビデオフレームは、単一のスライス中で符号化され得るか、あるいは各画像またはビデオフレームは、いくつかのスライス中で符号化され得る。ＤＳＣでは、各スライスを符号化するために割り振られるビットの数は、実質的に一定であり得る。ピクチャに対して符号化演算を実行することの一部として、ビデオエンコーダ２０は、ピクチャの各スライスに対して符号化演算を実行し得る。ビデオエンコーダ２０がスライスに対して符号化演算を実行するとき、ビデオエンコーダ２０は、スライスに関連付けられた符号化データを生成し得る。スライスに関連付けられた符号化データは「コード化スライス（coded slice）」と呼ばれ得る。 [0069] In some examples, video encoder 20 may partition a picture into multiple slices. Each of the slices may include spatially distinct regions in the image (eg, frame) that can be independently decoded without information from the rest of the regions in the image or frame. Each image or video frame can be encoded in a single slice, or each image or video frame can be encoded in several slices. In DSC, the number of bits allocated to encode each slice may be substantially constant. As part of performing the encoding operation on the picture, video encoder 20 may perform the encoding operation on each slice of the picture. When video encoder 20 performs an encoding operation on a slice, video encoder 20 may generate encoded data associated with the slice. Coded data associated with a slice may be referred to as a “coded slice”.

ＤＳＣビデオエンコーダ
[0070] 図２Ａは、本開示で説明される態様による技法を実装し得るビデオエンコーダ２０の一例を図示するブロック図である。ビデオエンコーダ２０は、本開示の技法の一部または全部を実行するように構成され得る。いくつかの例では、本開示で説明される技法は、ビデオエンコーダ２０の様々なコンポーネント間で共有され得る。いくつかの例では、追加または代替として、プロセッサ（図示せず）が、本開示で説明される技法の一部または全部を実行するように構成され得る。 DSC video encoder
[0070] FIG. 2A is a block diagram illustrating an example of a video encoder 20 that may implement techniques in accordance with aspects described in this disclosure. Video encoder 20 may be configured to perform some or all of the techniques of this disclosure. In some examples, the techniques described in this disclosure may be shared between various components of video encoder 20. In some examples, additionally or alternatively, a processor (not shown) may be configured to perform some or all of the techniques described in this disclosure.

[0071] 説明の目的で、本開示では、ＤＳＣコーディングのコンテキストにおいてビデオエンコーダ２０について説明する。しかしながら、本開示の技法は、他のコーディング規格または方法に適用可能であり得る。 [0071] For purposes of explanation, this disclosure describes video encoder 20 in the context of DSC coding. However, the techniques of this disclosure may be applicable to other coding standards or methods.

[0072] 図２Ａの例では、ビデオエンコーダ２０は複数の機能コンポーネントを含む。ビデオエンコーダ２０の機能コンポーネントは、色空間変換器(color-space converter)１０５と、バッファ１１０と、平坦度検出器(flatness detector)１１５と、レートコントローラ(rate controller)１２０と、予測器(predictor)、量子化器(quantizer)、および再構成器コンポーネント(reconstructor component)１２５と、ラインバッファ(line buffer)１３０と、インデックスカラー履歴(indexed color history)１３５と、エントロピーエンコーダ(entropy encoder)１４０と、サブストリームマルチプレクサ(substream multiplexor)１４５と、レートバッファ(rate buffer)１５０とを含む。他の例では、ビデオエンコーダ２０は、より多数の、より少数の、または異なる機能コンポーネントを含み得る。 [0072] In the example of FIG. 2A, video encoder 20 includes a plurality of functional components. The functional components of the video encoder 20 are a color-space converter 105, a buffer 110, a flatness detector 115, a rate controller 120, and a predictor. A quantizer and reconstructor component 125, a line buffer 130, an indexed color history 135, an entropy encoder 140, and a sub A stream multiplexor 145 and a rate buffer 150 are included. In other examples, video encoder 20 may include more, fewer, or different functional components.

[0073] 色空間変換器１０５は、入力色空間をコーディング実装において使用される色空間に変換し得る。例えば、例示的な一実施形態では、入力ビデオデータの色空間は、赤、緑、および青（ＲＧＢ）色空間中にあり、コーディングは、ルミナンスＹ、クロミナンスグリーンＣｇ、およびクロミナンスオレンジＣｏ（ＹＣｇＣｏ）色空間において実装される。色空間変換は、ビデオデータへのシフトおよび追加を含む（１つまたは複数の）方法によって実行され得る。他の色空間（other color-spaces）中の入力ビデオデータが処理され得、他の色空間への変換も実行され得ることに留意されたい。 [0073] The color space converter 105 may convert the input color space to a color space used in a coding implementation. For example, in one exemplary embodiment, the input video data color space is in a red, green, and blue (RGB) color space and the coding is luminance Y, chrominance green Cg, and chrominance orange Co (YCgCo). Implemented in color space. Color space conversion may be performed by a method (s) that includes shifting and adding to the video data. Note that input video data in other color-spaces can be processed and conversion to other color spaces can also be performed.

[0074] 関係する態様では、ビデオエンコーダ２０は、バッファ１１０、ラインバッファ１３０、および／またはレートバッファ１５０を含み得る。例えば、バッファ１１０は、色空間変換されたビデオデータを、ビデオエンコーダ２０の他の部分によるそれの使用に先立って保持（例えば、記憶）し得る。別の例では、色空間変換されたデータはより多くのビットを必要とし得るので、ビデオデータはＲＧＢ色空間中で記憶され得、色空間変換が必要に応じて実行され得る。 [0074] In a related aspect, video encoder 20 may include a buffer 110, a line buffer 130, and / or a rate buffer 150. For example, the buffer 110 may hold (eg, store) the color space converted video data prior to its use by other parts of the video encoder 20. In another example, color space converted data may require more bits, so video data may be stored in the RGB color space and color space conversion may be performed as needed.

[0075] レートバッファ１５０はビデオエンコーダ２０においてレート制御メカニズムの一部として機能し得、それは、レートコントローラ１２０に関して以下でより詳細に説明される。各ブロックを符号化することに費やされるビット数は、大いに、実質的に、ブロックの性質に基づいて変動することがある。レートバッファ１５０は、圧縮されたビデオにおけるレート変動を平滑化することができる。いくつかの実施形態では、レートバッファ（例えば、レートバッファ１５０）中に記憶されたビットが固定ビットレート（ＣＢＲ：constant bit rate)でレートバッファから削除されるＣＢＲバッファモデルが採用される。ＣＢＲバッファモデルでは、ビデオエンコーダ２０がビットストリームにあまりに多くのビットを加えた場合、レートバッファ１５０はオーバーフローし得る。一方、ビデオエンコーダ２０は、レートバッファ１５０のアンダーフローを防ぐために、十分なビットを加える必要があり得る。 [0075] The rate buffer 150 may function as part of a rate control mechanism in the video encoder 20, which is described in more detail below with respect to the rate controller 120. The number of bits spent coding each block can vary greatly based substantially on the nature of the block. The rate buffer 150 can smooth out rate variations in the compressed video. In some embodiments, a CBR buffer model is employed in which bits stored in a rate buffer (eg, rate buffer 150) are removed from the rate buffer at a constant bit rate (CBR). In the CBR buffer model, the rate buffer 150 can overflow if the video encoder 20 adds too many bits to the bitstream. On the other hand, video encoder 20 may need to add enough bits to prevent underflow of rate buffer 150.

[0076] ビデオデコーダ側では、ビットは、固定ビットレートでビデオデコーダ３０のレートバッファ１５５（以下でさらに詳細に説明される図２Ｂを参照）に加えられ得、ビデオデコーダ３０は、各ブロックについて可変数のビットを削除し得る。適切な復号を保証するために、ビデオデコーダ３０のレートバッファ１５５は、圧縮されたビットストリームの復号中に「アンダーフロー」または「オーバーフロー」すべきでない。 [0076] On the video decoder side, bits may be added to the rate buffer 155 of the video decoder 30 (see FIG. 2B, described in further detail below) at a fixed bit rate, and the video decoder 30 may allow for each block. Variable bits may be deleted. In order to ensure proper decoding, the rate buffer 155 of the video decoder 30 should not “underflow” or “overflow” during decoding of the compressed bitstream.

[0077] いくつかの実施形態では、バッファフルネス（ＢＦ：buffer fullness)は、バッファに現在あるビットの数を表す値BufferCurrentSizeと、レートバッファ１５０のサイズ、すなわち、任意の時点においてレートバッファ１５０に記憶され得るビットの最大数を表すBufferMaxSizeとに基づいて定義され得る。ＢＦは次のように計算され得る。

[0077] In some embodiments, the buffer fullness (BF) is a value BufferCurrentSize that represents the number of bits currently in the buffer and the size of the rate buffer 150, ie, the rate buffer 150 at any point in time. It can be defined based on BufferMaxSize which represents the maximum number of bits that can be stored. BF can be calculated as follows.

[0078] 平坦度検出器１１５は、ビデオデータ中の複雑な（すなわち、平坦でない）エリアからビデオデータ中の平坦な（すなわち、単純なまたは均一な）エリアへの変化を検出することができる。「複雑な」および「平坦な」という用語は、本明細書では、概して、ビデオエンコーダ２０がビデオデータのそれぞれの領域を符号化することの困難さを指すために使用される。従って、本明細書で使用される複雑なという用語は、概して、ビデオデータの領域が、ビデオエンコーダ２０が符号化することが複雑であることを表し、例えば、テクスチャードビデオデータ（textured video data）、高い空間周波数、および／または符号化することが複雑である他の特徴を含み得る。本明細書で使用する平坦なという用語は、概して、ビデオデータの領域が、ビデオエンコーダ２０がエンコーダすることが単純であることを表し、例えば、ビデオデータ中の滑らかな勾配、低い空間周波数、および／または符号化することが単純である他の特徴を含み得る。複雑な領域と平坦な領域との間の遷移が、符号化ビデオデータ中の量子化アーティファクト（quantization artifact）を低減するために、ビデオエンコーダ２０によって使用され得る。詳細には、レートコントローラ１２０、ならびに予測器、量子化器、および再構成器コンポーネント１２５は、複雑な領域から平坦な領域への遷移が識別されたとき、そのような量子化アーティファクトを低減することができる。 [0078] The flatness detector 115 can detect a change from a complex (ie, non-flat) area in the video data to a flat (ie, simple or uniform) area in the video data. The terms “complex” and “flat” are generally used herein to refer to the difficulty of video encoder 20 encoding each region of video data. Thus, the term complex as used herein generally indicates that the region of video data is complex for video encoder 20 to encode, eg, textured video data. , High spatial frequencies, and / or other features that are complex to encode. As used herein, the term flat generally indicates that the region of video data is simple for video encoder 20 to encode, eg, smooth slope in video data, low spatial frequency, and Other features that are simple to encode may be included. Transitions between complex and flat regions can be used by video encoder 20 to reduce quantization artifacts in the encoded video data. In particular, the rate controller 120, and the predictor, quantizer, and reconstructor components 125 may reduce such quantization artifacts when complex to flat region transitions are identified. Can do.

[0079] レートコントローラ１２０は、コーディングパラメータのセット、例えば、ＱＰを決定する。ＱＰは、レートバッファ１５０がオーバーフローまたはアンダーフローしないことを保証するターゲットビットレートについてピクチャ品質を最大にするために、レートバッファ１５０のバッファフルネスとビデオデータの画像アクティビティとに基づいて、レートコントローラ１２０によって調整され得る。レートコントローラ１２０はまた、最適ＲＤ性能を達成するために、ビデオデータの各ブロックについて特定のコーディングオプション（例えば、特定のモード）を選択する。レートコントローラ１２０は、再構成された画像の歪みを、レートコントローラ１２０がビットレート制約を満たすように、すなわち、全体的実コーディングレート（overall actual coding rate）がターゲットビットレート内に収まるように最小限に抑える。 [0079] The rate controller 120 determines a set of coding parameters, eg, QP. The QP is based on the buffer fullness of the rate buffer 150 and the image activity of the video data to maximize picture quality for the target bit rate that ensures that the rate buffer 150 does not overflow or underflow. Can be adjusted by. Rate controller 120 also selects a particular coding option (eg, a particular mode) for each block of video data to achieve optimal RD performance. The rate controller 120 minimizes the distortion of the reconstructed image so that the rate controller 120 satisfies the bit rate constraint, ie, the overall actual coding rate is within the target bit rate. Keep it down.

[0080] 予測器、量子化器、および再構成器コンポーネント１２５は、ビデオエンコーダ２０の少なくとも３つの符号化演算を実行し得る。予測器、量子化器、および再構成器コンポーネント１２５は、いくつかの異なるモードで予測を実行し得る。１つの例示的なプレディケーションモード（predication mode）は、メディアン適応予測（median-adaptive prediction）の修正バージョンである。メディアン適応予測はロスレスＪＰＥＧ規格（ＪＰＥＧ−ＬＳ）によって実装され得る。予測器、量子化器、および再構成器コンポーネント１２５によって実行され得るメディアン適応予測の修正バージョンは、３つの連続するサンプル値の並列予測を可能にし得る。別の例示的な予測モードはブロック予測である。ブロック予測では、上のラインまたは同じラインの左にある、前に再構成されたピクセルからサンプルが予測される。いくつかの実施形態では、ビデオエンコーダ２０およびビデオデコーダ３０は、両方とも、ブロック予測使用を決定するために、再構成されたピクセルに対して同じ探索を実行し得、従って、ビットはブロック予測モードで送られる必要がない。他の実施形態では、ビデオエンコーダ２０は、ビデオデコーダ３０が別個の探索を実行する必要がないように、探索を実行し、ビットストリームにおいてブロック予測ベクトルをシグナリングし得る。成分範囲（component range）の中点を使用してサンプルが予測される中点予測モード（midpoint prediction mode）も実装され得る。中点予測モードは、ワーストケースサンプルにおいてさえも、圧縮されたビデオに必要なビットの数の抑制（bounding）を可能にし得る。図３−２６を参照して以下でさらに論じられるように、予測器、量子化器、および再構成器コンポーネント１２５は、本明細書で説明される１つまたは複数の技法に基づいて、ビデオデータのブロック（または予測の任意の他のユニット）をコーディング（例えば、符号化または復号）するように構成され得る。例えば、予測器、量子化器、および再構成器コンポーネント１２５は、図３−２６で図示される方法を実行するように構成され得る。他の実施形態では、予測器、量子化器、および再構成器コンポーネント１２５は、ビデオエンコーダ２０の１つまたは複数の他のコンポーネントを用いて本明細書で説明される１つまたは複数の方法または技法を実行するように構成され得る。 [0080] The predictor, quantizer, and reconstructor component 125 may perform at least three encoding operations of the video encoder 20. The predictor, quantizer, and reconstructor component 125 may perform predictions in a number of different modes. One exemplary predication mode is a modified version of median-adaptive prediction. Median adaptive prediction may be implemented by the lossless JPEG standard (JPEG-LS). A modified version of the median adaptive prediction that may be performed by the predictor, quantizer, and reconstructor components 125 may allow parallel prediction of three consecutive sample values. Another exemplary prediction mode is block prediction. In block prediction, samples are predicted from previously reconstructed pixels that are on the top line or to the left of the same line. In some embodiments, video encoder 20 and video decoder 30 may both perform the same search on the reconstructed pixels to determine block prediction usage, so the bits are in block prediction mode. There is no need to be sent in. In other embodiments, video encoder 20 may perform a search and signal a block prediction vector in the bitstream so that video decoder 30 does not need to perform a separate search. A midpoint prediction mode in which samples are predicted using the midpoint of the component range may also be implemented. The midpoint prediction mode may allow the bounding of the number of bits required for compressed video even in the worst case sample. As discussed further below with reference to FIGS. 3-26, the predictor, quantizer, and reconstructor component 125 is based on video data based on one or more techniques described herein. May be configured to code (eg, encode or decode) a block (or any other unit of prediction). For example, the predictor, quantizer, and reconstructor component 125 may be configured to perform the method illustrated in FIG. 3-26. In other embodiments, the predictor, quantizer, and reconstructor component 125 may use one or more of the methods or methods described herein with one or more other components of video encoder 20. It may be configured to perform the technique.

[0081] 予測器、量子化器、および再構成器コンポーネント１２５はまた、量子化を実行する。例えば、量子化は、シフタを使用して実装され得る２のべき乗量子化器（power-of-2 quantizer）を介して実行され得る。２のべき乗量子化器の代わりに他の量子化技法が実装され得ることに留意されたい。予測器、量子化器、および再構成器コンポーネント１２５によって実行される量子化は、レートコントローラ１２０によって決定されたＱＰに基づき得る。最終的に、予測器、量子化器、および再構成器コンポーネント１２５はまた、予測値に逆量子化された残差を加えることと、結果がサンプル値の有効範囲の外側にないことを保証することとを含む再構成を実行する。 [0081] The predictor, quantizer, and reconstructor component 125 also performs quantization. For example, the quantization may be performed via a power-of-2 quantizer that may be implemented using a shifter. Note that other quantization techniques may be implemented instead of a power-of-two quantizer. The quantization performed by the predictor, quantizer, and reconstructor component 125 may be based on the QP determined by the rate controller 120. Finally, the predictor, quantizer, and reconstructor component 125 also adds an inverse quantized residual to the predicted value and ensures that the result is not outside the valid range of sample values. A reconfiguration including

[0082] 予測器、量子化器、および再構成器コンポーネント１２５によって実行される予測、量子化、および再構成に対する上記で説明された例示的な手法は、単なる事例すぎず、他の手法が実装され得ることに留意されたい。また、予測器、量子化器、および再構成器コンポーネント１２５は、予測、量子化、および／または再構成を実行するための（１つまたは複数の）サブコンポーネントを含み得ることに留意されたい。さらに、予測、量子化、および／または再構成は、予測器、量子化器、および再構成器コンポーネント１２５の代わりにいくつかの別個のエンコーダコンポーネントによって実行され得ることに留意されたい。 [0082] The exemplary techniques described above for prediction, quantization, and reconstruction performed by the predictor, quantizer, and reconstructor components 125 are merely examples, and other techniques are implemented. Note that it can be done. Note also that the predictor, quantizer, and reconstructor component 125 may include subcomponent (s) for performing prediction, quantization, and / or reconstruction. Further, it should be noted that prediction, quantization, and / or reconstruction may be performed by several separate encoder components instead of the predictor, quantizer, and reconstructor component 125.

[0083] ラインバッファ１３０は、予測器、量子化器、および再構成器コンポーネント１２５ならびにインデックスカラー履歴（indexed color history）１３５が、バッファされたビデオデータを使用することができるように、予測器、量子化器、および再構成器コンポーネント１２５からの出力を保持（例えば、記憶）する。インデックスカラー履歴１３５は、最近使用されたピクセル値を記憶する。これらの最近使用されたピクセル値は、専用シンタックスを介してビデオエンコーダ２０によって直接参照され得る。 [0083] Line buffer 130 is a predictor, quantizer, and reconstructor component 125 and indexed color history 135 so that the predictor, quantizer and reconstructor component 125 and indexed color history 135 can use the buffered video data. The output from the quantizer and reconstructor component 125 is retained (eg, stored). The index color history 135 stores recently used pixel values. These recently used pixel values can be referenced directly by the video encoder 20 via a dedicated syntax.

[0084] エントロピーエンコーダ１４０は、インデックスカラー履歴１３５と、平坦度検出器１１５によって識別された平坦度遷移（flatness transitions）とに基づいて、予測器、量子化器、および再構成器コンポーネント１２５から受信された予測残差および任意の他のデータ（例えば、予測器、量子化器、および再構成器コンポーネント１２５によって識別されたインデックス）を符号化する。いくつかの例では、エントロピーエンコーダ１４０は、サブストリームエンコーダごとにクロックごとに３つのサンプルを符号化し得る。サブストリームマルチプレクサ１４５は、ヘッダレスパケット多重化方式（headerless packet multiplexing scheme）に基づいてビットストリームを多重化し得る。これは、ビデオデコーダ３０が並列に３つのエントロピーデコーダを動作させることを可能にし、クロックごとの３つのピクセルの復号を容易にする。サブストリームマルチプレクサ１４５は、パケットがビデオデコーダ３０によって効率的に復号され得るようにパケット順序を最適化し得る。クロックごとの２のべき乗個のピクセル（例えば、２つのピクセル／クロックまたは４つのピクセル／クロック）の復号を容易にし得る、エントロピーコーディングに対する異なる手法が実装され得ることに留意されたい。 [0084] Entropy encoder 140 receives from predictor, quantizer, and reconstructor component 125 based on index color history 135 and flatness transitions identified by flatness detector 115. Encoded prediction residuals and any other data (eg, indices identified by the predictor, quantizer, and reconstructor component 125). In some examples, entropy encoder 140 may encode three samples per clock for each substream encoder. The substream multiplexer 145 may multiplex the bitstream based on a headerless packet multiplexing scheme. This allows video decoder 30 to operate three entropy decoders in parallel, facilitating decoding of three pixels per clock. Substream multiplexer 145 may optimize the packet order so that the packets can be efficiently decoded by video decoder 30. Note that different approaches to entropy coding may be implemented that may facilitate decoding of power-of-two pixels per clock (eg, 2 pixels / clock or 4 pixels / clock).

ＤＳＣビデオデコーダ
[0085] 図２Ｂは、本開示で説明される態様による技法を実装し得るビデオデコーダ３０の一例を図示するブロック図である。ビデオデコーダ３０は、本開示の技法の一部または全部を実行するように構成され得る。いくつかの例では、本開示で説明される技法は、ビデオデコーダ３０の様々なコンポーネント間で共有され得る。いくつかの例では、追加または代替として、プロセッサ（図示せず）が、本開示で説明される技法の一部または全部を実行するように構成され得る。 DSC video decoder
[0085] FIG. 2B is a block diagram illustrating an example of a video decoder 30 that may implement techniques in accordance with aspects described in this disclosure. Video decoder 30 may be configured to perform some or all of the techniques of this disclosure. In some examples, the techniques described in this disclosure may be shared between various components of video decoder 30. In some examples, additionally or alternatively, a processor (not shown) may be configured to perform some or all of the techniques described in this disclosure.

[0086] 説明の目的で、本開示では、ＤＳＣコーディングのコンテキストにおいてビデオデコーダ３０について説明する。しかしながら、本開示の技法は、他のコーディング規格または方法に適用可能であり得る。 [0086] For purposes of explanation, this disclosure describes video decoder 30 in the context of DSC coding. However, the techniques of this disclosure may be applicable to other coding standards or methods.

[0087] 図２Ｂの例では、ビデオデコーダ３０は複数の機能コンポーネントを含む。ビデオデコーダ３０の機能コンポーネントは、レートバッファ１５５と、サブストリームデマルチプレクサ１６０と、エントロピーデコーダ１６５と、レートコントローラ１７０と、予測器、量子化器、および再構成器コンポーネント１７５と、インデックスカラー履歴１８０と、ラインバッファ１８５と、色空間変換器１９０とを含む。ビデオデコーダ３０の図示されたコンポーネントは、図２Ａ中のビデオエンコーダ２０に関して上記で説明された対応するコンポーネントに類似する。従って、ビデオデコーダ３０のコンポーネントの各々は、上記で説明されたビデオエンコーダ２０の対応するコンポーネントと同様の様式で動作し得る。いくつかの実施形態では、ビデオエンコーダ２０および／またはビデオデコーダ３０の１つまたは複数のコンポーネントは、このようなコンポーネントのタスクを行うように構成されたソフトウェアコードを実行するように構成された１つまたは複数のハードウェアプロセッサで実装され得る。他の実施形態では、ビデオエンコーダ２０および／またはビデオデコーダ３０の１つまたは複数のコンポーネントは、このようなコンポーネントのタスクを行うように構成されたハードウェア回路で実装され得る。 [0087] In the example of FIG. 2B, the video decoder 30 includes a plurality of functional components. The functional components of video decoder 30 are rate buffer 155, substream demultiplexer 160, entropy decoder 165, rate controller 170, predictor, quantizer and reconstructor component 175, and index color history 180. A line buffer 185 and a color space converter 190. The illustrated components of video decoder 30 are similar to the corresponding components described above with respect to video encoder 20 in FIG. 2A. Accordingly, each of the components of video decoder 30 may operate in a manner similar to the corresponding component of video encoder 20 described above. In some embodiments, one or more components of video encoder 20 and / or video decoder 30 are one configured to execute software code configured to perform the tasks of such components. Or it may be implemented with multiple hardware processors. In other embodiments, one or more components of video encoder 20 and / or video decoder 30 may be implemented with hardware circuitry configured to perform the tasks of such components.

ＤＳＣにおけるスライス
[0088] 上述のように、スライスは、概して、画像またはフレーム中の領域の残りからの情報を使用することなく独立して復号され得る、画像またはフレーム中の空間的に別個の領域を指す。各画像またはビデオフレームは単一のスライス中で符号化され得るか、またはそれはいくつかのスライス中で符号化され得る。ＤＳＣでは、各スライスを符号化するために割り振られるターゲットビットは、実質的に一定であり得る。 Slice in DSC
[0088] As described above, a slice generally refers to a spatially distinct region in an image or frame that can be independently decoded without using information from the rest of the region in the image or frame. Each image or video frame can be encoded in a single slice, or it can be encoded in several slices. In DSC, the target bits allocated to encode each slice may be substantially constant.

ブロック予測モード
[0089] ビデオデータの単一のブロックはいくつかのピクセルを含み得、ビデオデータの各ブロックは、ブロックがコーディングされ得るいくつかの潜在的なコーディングモードを有する。そのようなコーディングモードのうちの１つが、ブロック予測モードである。ブロック予測モードでは、コーダは、（例えば、現在ブロックが現在スライスの第１のライン中にない場合）前の再構成されたライン中で、または（例えば、現在ブロックが現在スライスの第１のライン中にある場合）コーディングされる現在ブロックに（例えば、ピクセル値が）近い、同じライン中の前の再構成されたブロック中で候補ブロックを見つけることを試みる。いくつかの実施形態では、差分絶対値和（ＳＡＤ：Sum of Absolute Differences)メトリックによって、ピクセル値間の近さが決定される。コーダは、（例えば、エンコーダとデコーダの両方に知られているあらかじめ定められた値であり得る）探索範囲によって定義された前に再構成されたブロックの任意の部分中で候補ブロックを見つけることを試み得る。探索範囲は、エンコーダが、探索コストを最小限に抑えながら、良好な一致を見つけるために、探索範囲内に潜在的な候補を有するように定義される。ブロック予測モードのコーディング効率は、良好な候補（すなわち、コーディングされる現在ブロックにピクセル値が近いと決定された、探索範囲内の候補）が発見された場合、候補ブロックと現在ブロックとの間の（残差として知られる）差分が小さくなるという事実から来る。小さい残差は、現在ブロックの実際のピクセル値をシグナリングするために必要とされるビットの数と比較して、シグナリングするためにより少数のビットを要し、それにより、より低いＲＤコストが生じ、ＲＤメカニズムによって選択される可能性が増加する。ある特定のタイプのグラフィックコンテンツについて、ブロック予測モードを有効にすることからの性能ブーストが極めて著しい。 Block prediction mode
[0089] A single block of video data may include several pixels, and each block of video data has several potential coding modes in which the block may be coded. One such coding mode is the block prediction mode. In block prediction mode, the coder is in the previous reconstructed line (eg, if the current block is not in the first line of the current slice) or (eg, the current block is in the first line of the current slice). Attempts to find a candidate block in a previous reconstructed block in the same line that is close (eg, pixel value) to the current block being coded. In some embodiments, the proximity between pixel values is determined by a Sum of Absolute Differences (SAD) metric. The coder will find a candidate block in any part of the previously reconstructed block defined by the search range (which may be a predetermined value known to both encoder and decoder, for example). You can try. The search range is defined such that the encoder has potential candidates within the search range in order to find a good match while minimizing the search cost. The coding efficiency of the block prediction mode is such that if a good candidate is found (ie, a candidate in the search range whose pixel value is determined to be close to the current block being coded), it is between the candidate block and the current block. Comes from the fact that the difference (known as the residual) is smaller. A small residual requires fewer bits to signal compared to the number of bits required to signal the actual pixel value of the current block, thereby resulting in a lower RD cost, The possibility of being selected by the RD mechanism increases. For certain types of graphic content, the performance boost from enabling the block prediction mode is extremely significant.

ブロック予測モードでのパラメータ
[0090] ブロック予測モードは、指定された探索範囲が与えられると、符号化される現在ブロックからの最小歪みを提供する候補ブロックを生成するように設計される。いくつかの実施形態では、最小歪みは、ＳＡＤを使用して定義される。本開示のいくつかの実装では、ブロック予測方法は、３つのパラメータ、すなわち、探索範囲（ＳＲ：search range)と、スキュー（skew）（α）と、区分サイズ（partition size）（β）とによって定義される。これらの３つのパラメータは、ブロック予測モードの性能に影響を及ぼし、実装中に調整（すなわち、修正または再構成）され得る。これらのパラメータは、エンコーダとデコーダの両方に知られ得る。 Parameters in block prediction mode
[0090] The block prediction mode is designed to generate candidate blocks that provide the minimum distortion from the current block to be encoded given a specified search range. In some embodiments, the minimum distortion is defined using SAD. In some implementations of the present disclosure, the block prediction method is based on three parameters: a search range (SR), a skew (α), and a partition size (β). Defined. These three parameters affect the performance of the block prediction mode and can be adjusted (ie modified or reconfigured) during implementation. These parameters can be known to both the encoder and the decoder.

ブロック予測モードでの探索空間
[0091] 本開示のいくつかの実施形態では、探索空間（例えば、エンコーダが、候補ブロックを見つけるために探索し得る、ピクセルの空間ロケーション）は、現在ブロックの特性に基づいて異なり得る。探索空間は、全ての前に再構成されたブロック／ピクセルを包含し得るが、エンコーダおよび／またはデコーダは、例えば、計算複雑さ（computational complexity）を低減するために、候補ブロックのための探索を探索空間内の指定された部分（例えば、ビットストリーム中であらかじめ定義されるかまたはシグナリングされるかのいずれかである１つまたは複数のパラメータによって定義される「探索範囲」）に制限し得る。ブロック予測探索空間の例が図３−図６に示される。図３および図４は、現在スライスの第１のライン中にない現在ブロック（例えば、現在ブロック３０８および４０８）を伴う場合を示す。図５および図６は、現在スライスの第１のライン中にある現在ブロック（例えば、現在ブロック５０６および６０６）を伴う場合を示す。これらの２つの場合は、スライス中の第１のラインが垂直ネイバー（vertical neighbor）を有していないので、別々に処理される。従って、現在ラインからの再構成されたピクセルは、探索範囲（例えば、探索範囲５０８および６０８）として活用され得る。本開示では、現在スライス中の第１のラインはＦＬＳと呼ばれ得、現在スライス中の任意の他のラインはＮＦＬＳと呼ばれ得る。 Search space in block prediction mode
[0091] In some embodiments of the present disclosure, the search space (eg, the spatial location of pixels that an encoder may search to find a candidate block) may vary based on the characteristics of the current block. While the search space may include all previously reconstructed blocks / pixels, the encoder and / or decoder may, for example, perform a search for candidate blocks to reduce computational complexity. It may be limited to a specified portion within the search space (eg, a “search scope” defined by one or more parameters that are either predefined or signaled in the bitstream). Examples of the block prediction search space are shown in FIGS. 3 and 4 illustrate the case with a current block that is not in the first line of the current slice (eg, current blocks 308 and 408). FIGS. 5 and 6 illustrate the case with a current block (eg, current blocks 506 and 606) that is in the first line of the current slice. These two cases are processed separately because the first line in the slice does not have a vertical neighbor. Thus, the reconstructed pixels from the current line can be utilized as search ranges (eg, search ranges 508 and 608). In this disclosure, the first line in the current slice may be referred to as FLS and any other line in the current slice may be referred to as NFLS.

[0092] さらに、本明細書で説明されるブロック予測技法は、単一のラインバッファ（すなわち、１Ｄブロックサイズ）を使用するコーデックまたは複数のラインバッファ（すなわち、２Ｄブロックサイズ）を使用するコーデックのいずれかにおいて実装され得る。１Ｄの場合のための探索空間の例が図３および図５に示され、２Ｄの場合のための探索空間の例が図４および図６に示される。２Ｄの場合、探索範囲は、前の再構成されたライン（例えば、前のライン４０２）からのピクセル、または２Ｄブロック中のラインと同じラインからの再構成されたブロック（例えば、現在ブロック６０６のすぐ左にある、現在ライン６０２中の前の６０４）を含み得る。２Ｄブロックは、水平方向または垂直方向のいずれかあるいはその両方に区分され得る。ブロック区分を伴う場合、各ブロック区分について、ブロック予測ベクトルが指定され得る。 [0092] Further, the block prediction techniques described herein may be used for codecs that use a single line buffer (ie, 1D block size) or codecs that use multiple line buffers (ie, 2D block size). It can be implemented in either. Examples of search spaces for the 1D case are shown in FIGS. 3 and 5, and examples of search spaces for the 2D case are shown in FIGS. 4 and 6. For 2D, the search range is the pixel from the previous reconstructed line (eg, previous line 402) or the reconstructed block (eg, of current block 606) from the same line as the line in the 2D block. It may include the previous 604) in the current line 602, just to the left. A 2D block can be partitioned either horizontally or vertically or both. When accompanied by block partitions, a block prediction vector may be specified for each block partition.

ブロック予測モードの例示的な実装
[0093] 本開示のいくつかの実施形態では、ＳＡＤ以外の歪みメトリック、例えば２乗差分和（ＳＳＤ：sum of squared difference)が使用され得る。代替または追加として、歪みは重み付けによって修正され得る。例えば、ＹＣｏＣｇ色空間が使用されている場合、コストは次のように計算され得る。

Example implementation of block prediction mode
[0093] In some embodiments of the present disclosure, distortion metrics other than SAD, such as sum of squared difference (SSD), may be used. Alternatively or additionally, distortion can be corrected by weighting. For example, if the YCoCg color space is used, the cost can be calculated as follows:

[0094] 本明細書で説明されるブロック予測技法は、ＲＧＢ色空間またはＹＣｏＣｇ色空間のいずれか中で実行され得る。さらに、代替の実装は、両方の色空間を使用し、２つの色空間のうちのどちらが選択されるか（例えば、レートおよび歪みに関して最低コストを有するのがどちらの色空間か）を示す１ビットフラグをデコーダにシグナリングし得る。 [0094] The block prediction techniques described herein may be performed in either the RGB color space or the YCoCg color space. In addition, the alternative implementation uses both color spaces, and one bit indicating which of the two color spaces is selected (eg, which color space has the lowest cost with respect to rate and distortion) The flag may be signaled to the decoder.

[0095] ＦＬＳに関する本開示のいくつかの実施形態では、１つまたは複数の直前の再構成されたブロック（direct previous reconstructed block or blocks）は、パイプライン化制約およびタイミング制約により探索範囲から除外され得る。例えば、ハードウェア実装に応じて、コーダは、現在ブロックがコーダによって処理される時までに、直前の再構成されたブロックの処理を完了しない可能性があり（例えば、前のブロックのための再構成されたピクセルは、コーダが現在ブロックを処理し始めるときに知られていない可能性があり）、その結果、遅延または失敗が生じる。そのような実装では、前の再構成されたブロックの使用を、再構成されたピクセル値が知られているブロックに制限することによって（例えば、１つまたは複数の直前の再構成されたブロックを除外することによって）、上記に示されたパイプライン化問題は解決され得る。ＮＦＬＳに関する本開示のいくつかの実施形態では、現在ブロックの左の探索範囲は、前の再構成されたラインではなく、同じラインからであり得る。このような実施形態のうちのいくつかでは、１つまたは複数の前の再構成されたブロックは、パイプライン化制約およびタイミング制約により探索範囲から除外され得る。 [0095] In some embodiments of the present disclosure relating to FLS, one or more previous reconstructed blocks or blocks are excluded from the search range due to pipeline constraints and timing constraints. obtain. For example, depending on the hardware implementation, the coder may not complete the processing of the previous reconstructed block by the time the current block is processed by the coder (eg, replay for the previous block). The constructed pixel may not be known when the coder starts processing the current block), resulting in delay or failure. In such an implementation, by restricting the use of the previous reconstructed block to blocks where the reconstructed pixel value is known (eg, one or more previous reconstructed blocks). By excluding) the pipelining problem shown above can be solved. In some embodiments of the present disclosure for NFLS, the search range to the left of the current block may be from the same line, not the previous reconstructed line. In some of such embodiments, one or more previous reconstructed blocks may be excluded from the search range due to pipelining constraints and timing constraints.

ＮＦＬＳの例示的な実装
[0096] 図３に示されているように、ブロック予測方法は、現在ブロック３０８のための候補を見つけるために、探索空間中で探索範囲３１０（ＳＲ）を探索し得る（および図４の探索空間４００中でも同様である）。符号化される現在ブロック３０８の第１のピクセルのｘ座標位置がｊである場合、探索空間内の全ての候補ブロックの開始位置のセットｋが次のように与えられ得る。

Example implementation of NFLS
[0096] As shown in FIG. 3, the block prediction method may search the search range 310 (SR) in the search space to find candidates for the current block 308 (and the search of FIG. 4). The same applies to the space 400). If the x coordinate position of the first pixel of the current block 308 to be encoded is j, a set k of the starting positions of all candidate blocks in the search space may be given as follows:

[0097] この例では、パラメータαは、符号化される現在ブロックに対する探索範囲３１０のｘ座標位置をスキューする（skews）。αのより高い値が探索範囲３１０を右にシフトし、一方、αのより低い値が探索範囲３１０を左にシフトする。例えば、（i）３２のＳＲおよび１５のαが探索範囲３１０を前のライン３０２の中央に置き（place）得、（ii）３２のＳＲおよび０のαが探索範囲３１０を前のライン３０２の左側に置き得、（iii）３２のＳＲおよび３１のαが探索範囲３１０を前のライン３０２の右側に置き得る。 [0097] In this example, the parameter α skews the x coordinate position of the search range 310 relative to the current block to be encoded. Higher values of α shift search range 310 to the right, while lower values of α shift search range 310 to the left. For example, (i) 32 SRs and 15 α may place the search range 310 in the center of the previous line 302, and (ii) 32 SRs and 0 α will search the range 310 of the previous line 302. (Iii) 32 SRs and 31 α can place the search range 310 on the right side of the previous line 302.

[0098] 本開示のいくつかの実装では、探索範囲内にあるが、スライス境界の外側にあるピクセルが、そのピクセルのためのダイナミックレンジの１／２に設定され得る。例えば、コンテンツがＲＧＢ８８８である場合、Ｒ、Ｇ、およびＢのために１２８のデフォルト値が使用され得る。コンテンツがＹＣｏＣｇ空間中にある場合、Ｙのために１２８のデフォルト値が使用され得、ＣｏおよびＣｇのために０のデフォルト値が使用され得る（例えば、ＣｏおよびＣｇは、０を中心とする９ビット値である）。 [0098] In some implementations of the present disclosure, a pixel that is within the search range but outside the slice boundary may be set to 1/2 of the dynamic range for that pixel. For example, if the content is RGB888, 128 default values for R, G, and B may be used. If the content is in YCoCg space, a default value of 128 may be used for Y and a default value of 0 may be used for Co and Cg (eg, Co and Cg are 9 centered on 0) Bit value).

ＦＬＳの例示的な実装
[0099] 図５に示されているように、探索範囲は、ＦＬＳの場合について異なり得る。これは、垂直ネイバーが、そのような垂直ネイバーが現在フレームの外側にあるので、またはそのような垂直ネイバーが異なるスライス内に含まれているので、利用可能でないからである。ＦＬＳの場合に関する本開示のいくつかの実施形態では、ブロック予測のために現在ライン中のピクセルが使用され得る。一実施形態では、現在ブロックの左の現在ライン中の任意のピクセルは探索範囲の一部として考慮され得る。別の実施形態では、１つまたは複数の前にコーディングされたブロック（例えば、現在ブロックのすぐ左にある前のブロック５０４）は、パイプライン化制約およびタイミング制約により探索範囲から除外され得る。 Example implementation of FLS
[0099] As shown in FIG. 5, the search range may be different for the FLS case. This is because vertical neighbors are not available because such vertical neighbors are currently outside the frame, or because such vertical neighbors are included in different slices. In some embodiments of the present disclosure for the FLS case, the pixels in the current line may be used for block prediction. In one embodiment, any pixel in the current line to the left of the current block can be considered as part of the search range. In another embodiment, one or more previously coded blocks (eg, the previous block 504 immediately to the left of the current block) may be excluded from the search range due to pipelining constraints and timing constraints.

[0100] ＦＬＳのいくつかの実装では、スライスの第１のライン中の最初の数個のブロックのための利用可能な範囲は、一般に他のブロックのために予想される探索範囲よりも小さくなり得る。これは、候補ブロックのための有効な位置が、ラインの最初に開始し、現在ブロックの前に終了するからである。ＦＬＳ中の最初の数個のブロックの場合、この有効範囲は、所望の範囲（例えば、３２個または６４個の位置）よりも小さくなり得る。従って、これらのブロックの場合、探索範囲は、候補ブロックの各ブロック区分が探索範囲内に完全に含まれているように調整される必要があり得る。ＮＦＬＳの場合、探索範囲は、探索位置の総数が、定義された探索範囲（例えば、３２個または６４個のピクセル位置）に等しくなるように左または右にシフトされ得る。ｊが現在ブロック中の第１のピクセルであるので、現在ブロック中の最後のピクセルはｊ＋blkWidth−１である。この理由で、探索範囲は、左に（blkWidth−１）ピクセルシフトされる必要があり得る。 [0100] In some implementations of FLS, the available range for the first few blocks in the first line of a slice will generally be less than the expected search range for other blocks. obtain. This is because the valid position for the candidate block starts at the beginning of the line and ends before the current block. For the first few blocks in the FLS, this effective range may be smaller than the desired range (eg, 32 or 64 positions). Thus, for these blocks, the search range may need to be adjusted so that each block segment of the candidate block is completely contained within the search range. For NFLS, the search range may be shifted to the left or right so that the total number of search locations is equal to the defined search range (eg, 32 or 64 pixel locations). Since j is the first pixel in the current block, the last pixel in the current block is j + blkWidth-1. For this reason, the search range may need to be shifted (blkWidth-1) pixels to the left.

[0101] ＦＬＳのいくつかの実装では、符号化される現在ブロックの第１のピクセルのｘ座標ロケーションがｊと呼ばれる場合、探索範囲内の全ての候補ブロックの開始位置のセットが次のように与えられ得る。 [0101] In some implementations of FLS, if the x coordinate location of the first pixel of the current block to be encoded is called j, the set of starting positions of all candidate blocks within the search range is Can be given.

[0102] （i）直近の前の再構成されたブロックが探索範囲の一部であり、例えば、α＝１である場合、

(I) If the most recent previous reconstructed block is part of the search range, eg α = 1,

[0103] （ii）ｎ個の直近の前の再構成されたブロックが探索範囲から除外されるである場合、

[Ii] (ii) If the n previous previous reconstructed blocks are excluded from the search range,

[0104] ここで、ｂｌｋｘはブロック幅である。ＮＦＬＳの場合に関して上記で説明されたように、スライス境界の外側の任意のピクセルがデフォルト値に設定され得る。また、スキューパラメータ（skew parameter）がＦＬＳの場合に関連付けられる必要がないことに留意されたい。 [0104] Here, blkx is a block width. As described above for the NFLS case, any pixel outside the slice boundary can be set to a default value. Note also that there is no need to be associated when the skew parameter is FLS.

ブロック予測モードでコーディングするための例示的なフローチャート
[0105] 図７を参照して、ブロック予測モードでビデオデータのブロックをコーディングするための例示的なプロシージャが説明される。図７に示されているステップは、ビデオエンコーダ（例えば、図２Ａ中のビデオエンコーダ２０）、ビデオデコーダ（例えば、図２Ｂ中のビデオデコーダ３０）、またはそれらの（１つまたは複数の）コンポーネントによって実行され得る。便宜上、方法７００は、ビデオエンコーダ２０、ビデオデコーダ３０、または別のコンポーネントであり得る、（単にコーダとも呼ばれる）ビデオコーダによって実行されるものとして説明される。 Exemplary flowchart for coding in block prediction mode
[0105] With reference to FIG. 7, an exemplary procedure for coding a block of video data in a block prediction mode is described. The steps shown in FIG. 7 may be performed by a video encoder (eg, video encoder 20 in FIG. 2A), a video decoder (eg, video decoder 30 in FIG. 2B), or component (s) thereof. Can be executed. For convenience, the method 700 is described as being performed by a video coder (also referred to simply as a coder), which may be the video encoder 20, the video decoder 30, or another component.

[0106] 方法７００はブロック７０１において開始する。ブロック７０５において、コーダは、現在スライス中の現在ブロックを予測するために使用される候補ブロックを決定する。候補ブロックは、１つまたは複数のブロック予測パラメータによって定義された複数のロケーション（または複数のピクセル位置）の範囲内にあり得る。例えば、ブロック予測パラメータは、（i）複数のロケーションの範囲のサイズを定義する探索範囲パラメータと、（ii）現在ブロックに関する複数のロケーションの範囲の相対ロケーション（relative location）を定義するスキューパラメータと、（iii）現在ブロック中の各区分のサイズを定義する区分サイズパラメータとを含み得る。本開示のいくつかの実施形態では、探索範囲パラメータ、スキューパラメータ、および区分サイズパラメータの各々は、時間的にではなく、空間的に、候補ブロックの複数のロケーションを定義する。 [0106] The method 700 begins at block 701. In block 705, the coder determines candidate blocks that are used to predict the current block in the current slice. A candidate block may be within a range of locations (or pixel locations) defined by one or more block prediction parameters. For example, the block prediction parameters include (i) a search range parameter that defines the size of a range of locations, and (ii) a skew parameter that defines a relative location of the range of locations for the current block; (Iii) a partition size parameter defining the size of each partition in the current block. In some embodiments of the present disclosure, each of the search range parameter, the skew parameter, and the partition size parameter define a plurality of locations of candidate blocks in space rather than in time.

[0107] ブロック７１０において、コーダは、候補ブロックと現在ブロックとに基づいて予測ベクトルを決定する。予測ベクトルは、現在ブロックに関する候補ブロックのロケーションを識別し得る。予測ベクトルは、１つまたは複数の座標値（例えば、１Ｄ空間中のオフセットを示す座標値）を含み得る。ブロック７１５において、コーダは、予測ベクトルをシグナリングすることを少なくとも部分的に介して、ブロック予測モードで現在ブロックをコーディングする。いくつかの実施形態では、コーダはまた、候補ブロックと現在ブロックとの間の残差をシグナリングし得る。現在ブロックの実際のピクセル値をシグナリングしなければならないのではなく、候補ブロックのロケーションを識別する予測ベクトルと、現在ブロックと候補ブロックとの間の差分を表す残差とをシグナリングすることによって、ビット節約（Bit saving）が達成され得る。方法７００はブロック７２０において終了する。 [0107] At block 710, the coder determines a prediction vector based on the candidate block and the current block. The prediction vector may identify the location of the candidate block for the current block. The prediction vector may include one or more coordinate values (eg, coordinate values that indicate an offset in 1D space). At block 715, the coder codes the current block in block prediction mode, at least in part through signaling a prediction vector. In some embodiments, the coder may also signal the residual between the candidate block and the current block. Rather than having to signal the actual pixel value of the current block, a bit is signaled by signaling a prediction vector identifying the location of the candidate block and a residual representing the difference between the current block and the candidate block. Bit saving can be achieved. The method 700 ends at block 720.

[0108] 方法７００では、図７に示されているブロックのうちの１つまたは複数は削除される（例えば、実行されない）可能性があり、および／または方法が実行される順序は入れ替えられ得る。いくつかの実施形態では、さらなるブロックが方法７００に追加され得る。本開示の実施形態は、図７に示されている例にまたはそれによって限定されず、他の変形が本開示の趣旨から逸脱することなく実装され得る。 [0108] In method 700, one or more of the blocks shown in FIG. 7 may be deleted (eg, not performed) and / or the order in which the methods are performed may be reversed. . In some embodiments, additional blocks may be added to the method 700. The embodiments of the present disclosure are not limited to or by the example shown in FIG. 7, and other variations may be implemented without departing from the spirit of the present disclosure.

候補ブロックを見つけた後
[0109] 最良の候補ブロックが決定された後、候補ブロックのピクセル値は、現在ブロックのピクセル値から減算され、その結果、残差が生じる。残差は、ブロック予測モードに関連付けられたあらかじめ選択されたＱＰに基づいて量子化され得る。量子化された残差は、（固定長または可変長のいずれかであり得る）コードブックを使用して符号化され、固定長コード（fixed-length code）または可変長コード（variable-length code）を使用してシグナリングされ得る。選択されたコードブックは、コーディング効率およびハードウェア複雑さ要件に基づき得る。例えば、選択されたコードブックは指数ゴロムコードブック（Exp-Golomb codebook）であり得る。本開示のいくつかの実施形態では、既存のＤＳＣ実装のデルタサイズ単位可変長コーディング（ＤＳＵ−ＶＬＣ：delta size unit variable length coding)と同様であるエントロピーコーディング方式が使用され得る。いくつかの実施形態では、残差は、上記で説明された量子化の前に、（例えば、直接コサイン変換、アダマール変換、または他の知られている変換を使用して）変換され得る。 After finding candidate blocks
[0109] After the best candidate block is determined, the pixel value of the candidate block is subtracted from the pixel value of the current block, resulting in a residual. The residual may be quantized based on a preselected QP associated with the block prediction mode. The quantized residual is encoded using a codebook (which can be either fixed-length or variable-length) and can be either a fixed-length code or a variable-length code. May be signaled using. The selected codebook may be based on coding efficiency and hardware complexity requirements. For example, the selected codebook may be an Exp-Golomb codebook. In some embodiments of the present disclosure, an entropy coding scheme similar to delta size unit variable length coding (DSU-VLC) of existing DSC implementations may be used. In some embodiments, the residual may be transformed (eg, using a direct cosine transform, Hadamard transform, or other known transforms) prior to the quantization described above.

[0110] 本開示のいくつかの実施形態では、現在ブロックの残差中のサンプルは複数のグループに区分され得る（例えば、１６個のサンプルを含んでいるブロックに関して、グループごとに４つのサンプル）。ブロック中の全ての係数が０である場合、ブロックの残差は、スキップモードを使用してコーディングされ、すなわち、ブロック中の現在成分がスキップモードを使用してコーディングされるか否かを示すための、ブロックごとの（成分ごとの）１ビットフラグがシグナリングされる。少なくとも１つの０でない値がブロック内に含まれている場合、各グループは、グループが１つの０でない値を有する場合のみ、ＤＳＵ−ＶＬＣを使用してコーディングされ得る。グループ（例えば、残差中の１６個のサンプルのうちの４つのサンプル）が０でない値を含んでいない場合、グループは、スキップモードを使用してコーディングされ、すなわち、グループがスキップモードを使用してコーディングされるか否かを示すための、グループごとの１ビットフラグがシグナリングされる。より詳細には、各グループについて、グループ中の全ての値が０であるかどうかを決定するために、探索が実行され得る。グループ中の全ての値が０である場合、「１」の値がデコーダにシグナリングされ得、他の場合（少なくとも１つの値が０でない場合）、「０」の値がデコーダにシグナリングされ、その後にＤＳＵ−ＶＬＣコーディングのコーディングが続き得る。代替例では、グループ中の全ての値が０である場合、「０」の値がシグナリングされ得、グループが少なくとも１つの０でない値を含んでいる場合、「１」の値がシグナリングされ得る。 [0110] In some embodiments of the present disclosure, the samples in the current block's residual may be partitioned into multiple groups (eg, 4 samples per group for a block containing 16 samples). . If all the coefficients in the block are 0, the block residual is coded using skip mode, i.e. to indicate whether the current component in the block is coded using skip mode. The 1-bit flag for each block (for each component) is signaled. Each group can be coded using DSU-VLC only if the group has one non-zero value if at least one non-zero value is included in the block. If a group (eg, 4 of the 16 samples in the residual) does not contain a non-zero value, the group is coded using skip mode, ie, the group uses skip mode. A 1-bit flag for each group is signaled to indicate whether it is coded. More particularly, for each group, a search can be performed to determine if all values in the group are zero. If all the values in the group are 0, a value of “1” may be signaled to the decoder; otherwise (if at least one value is not 0), a value of “0” is signaled to the decoder; Followed by coding of DSU-VLC coding. In the alternative, a value of “0” may be signaled if all values in the group are zero, and a value of “1” may be signaled if the group contains at least one non-zero value.

[0111] 本開示のいくつかの実施形態では、最良の候補ブロックは、最良のオフセットを含んでいる固定長コードを送信することによって、デコーダに明示的にシグナリングされる。オフセットは「ベクトル」と呼ばれ得る。ベクトルをデコーダに明示的にシグナリングすることの利点は、デコーダがブロック探索自体を実行する必要がないことである。むしろ、デコーダは、明示的にベクトルを受信し、現在ブロックのピクセル値を決定するために、復号された、逆量子化された残差値に、候補ブロックを加える。 [0111] In some embodiments of the present disclosure, the best candidate block is explicitly signaled to the decoder by transmitting a fixed length code that includes the best offset. The offset may be referred to as a “vector”. The advantage of explicitly signaling the vector to the decoder is that the decoder does not need to perform the block search itself. Rather, the decoder explicitly receives the vector and adds the candidate block to the decoded, dequantized residual value to determine the pixel value of the current block.

ブロック区分
[0112] 本開示のいくつかの実施形態では、コーディングされる現在ブロックが区分され、その結果、ブロックごとに複数の候補ブロックと複数のベクトルとが生じ得る。そのような実施形態のうちのいくつかでは、（１つまたは複数の）ベクトルは、固定長コードを使用して明示的にシグナリングされ得る。例えば、この固定長コードの長さはｌｏｇ_２（ＳＲ）であり得る。別の実施形態では、（１つまたは複数の）ベクトルは、指数ゴロムまたはゴロムライスコード（Golomb-Rice code）ファミリからのコードなどの、可変長コードを使用して明示的にシグナリングされ得る。このコードブックは、（１つまたは複数の）ベクトルに関連付けられた統計的分布に基づいて選択され得る。また別の実施形態では、（１つまたは複数の）ベクトルは、前にコーディングされた（１つまたは複数の）ベクトルに基づいて予測され得、（１つまたは複数の）ベクトルの残差は、何らかの固定長または可変長コードを使用してコーディングされ得る。また別の実施形態では、（１つまたは複数の）ベクトルは、前にコーディングされた（１つまたは複数の）ベクトルに基づいて予測され得、２つのベクトルが同じであるかどうかをシグナリングするための１ビットフラグが使用され得る。このフラグはSameFlagと呼ばれ得る。SameFlag＝１である場合、ベクトル値自体はデコーダにシグナリングされる必要がない。SameFlag=0である場合、ベクトルは、（例えば、固定長コードまたは可変長コードのいずれかを使用して）明示的にシグナリングされる。例示的なブロック区分方式が図８に示されている。 Block classification
[0112] In some embodiments of the present disclosure, the current block to be coded is partitioned, resulting in multiple candidate blocks and multiple vectors per block. In some of such embodiments, the vector (s) may be explicitly signaled using a fixed length code. For example, the length of this fixed length code can be log ₂ (SR). In another embodiment, the vector (s) may be explicitly signaled using a variable length code, such as a code from an exponential Golomb or Golomb-Rice code family. This codebook may be selected based on a statistical distribution associated with the vector (s). In yet another embodiment, the vector (s) can be predicted based on the previously coded vector (s), and the residual of the vector (s) is It can be coded using any fixed or variable length code. In yet another embodiment, the vector (s) can be predicted based on the previously coded vector (s) to signal whether the two vectors are the same. 1-bit flags can be used. This flag can be called SameFlag. If SameFlag = 1, the vector value itself need not be signaled to the decoder. If SameFlag = 0, the vector is explicitly signaled (eg, using either a fixed length code or a variable length code). An exemplary block partitioning scheme is shown in FIG.

[0113] 図８の図８００に示されているように、現在ブロック８０２が単一の区分を含んでいる。現在ブロック８０２のためにシグナリングされる情報は、モードヘッダ、ベクトルSameFlag、ベクトルＡ、およびペイロードを備える。現在ブロック８０４は、２つの区分、区分Ａと区分Ｂとを含んでいる。現在ブロック８０４のためにシグナリングされる情報は、モードヘッダ、ベクトルSameFlag、ベクトルＡ、ベクトルSameFlag、ベクトルＢ、およびペイロードを備える。上記で説明されたように、上記で列挙された１つまたは複数の項目はシグナリングされない可能性がある。例えば、ベクトルSameFlagが１に等しい場合、後続のベクトルはシグナリングされる必要がない。 [0113] As shown in diagram 800 of FIG. 8, current block 802 includes a single partition. The information signaled for the current block 802 comprises a mode header, vector SameFlag, vector A, and payload. The current block 804 includes two sections, section A and section B. The information signaled for the current block 804 comprises a mode header, vector SameFlag, vector A, vector SameFlag, vector B, and payload. As explained above, one or more of the items listed above may not be signaled. For example, if the vector SameFlag is equal to 1, subsequent vectors need not be signaled.

[0114] 区分サイズβは、別個のサブブロックへの現在ブロックの区分を決定し得る。そのような場合、各サブブロックについて、別個のブロック予測が実行され得る。例えば、ブロックサイズがＮ＝１６であり、区分サイズβ＝８β＝８である場合、探索は１６／８＝２つの区分の各々について実行される。別の例では、β＝Ｎである場合、ブロック区分は無効にされる。β＜Ｎである場合、各ベクトルはデコーダに明示的にシグナリングされ得る。（例えば、現在ベクトルを定義するために、前にシグナリングされたベクトルを使用する）ベクトル予測が採用されない場合、各ベクトルは、固定長または可変長コードを使用してシグナリングされる。ベクトル予測が採用される場合、第１のベクトルは、前のコーディングされたベクトルから予測され（例えば、メモリに記憶され）得、ｎ＞０について、ベクトルｎはベクトルｎ−１から予測される。 [0114] The partition size β may determine the partition of the current block into separate sub-blocks. In such cases, a separate block prediction may be performed for each sub-block. For example, if the block size is N = 16 and the partition size β = 8β = 8, the search is performed for each of 16/8 = 2 partitions. In another example, if β = N, the block partition is disabled. If β <N, each vector can be explicitly signaled to the decoder. If vector prediction is not employed (eg, using previously signaled vectors to define the current vector), each vector is signaled using a fixed or variable length code. If vector prediction is employed, the first vector may be predicted (eg, stored in memory) from the previous coded vector, and for n> 0, vector n is predicted from vector n-1.

ブロック予測モードにおける可変の区分サイズ
[0115] 上記の例は、１×８のサイズを有する（例えば、１ピクセルの高さと８ピクセルの幅とを有する）、または２×８（例えば、２ピクセルの高さと８ピクセルの幅とを有する）ブロックが、どのようにブロック予測モードでコーディングされ得るかを例示する。図８に示されているように、ブロックは、複数の領域に区分され得、各領域は、異なる区分方式を使用して（例えば、１×２区分を使用して、２×２区分を使用して、など）コーディングされ得、ブロック予測ベクトルは、各区分について指定され得る（例えば、各区分に関連付けられた残差とともにビットストリームにおいてシグナリングされる）。例えば、各ブロックは、２つのピクセル（または、他の固定されたサイズの区分）を含む複数の１×２区分に区分され得る。 Variable partition size in block prediction mode
[0115] The above examples have a size of 1x8 (eg, having a height of 1 pixel and a width of 8 pixels), or 2x8 (eg, having a height of 2 pixels and a width of 8 pixels). Fig. 6 illustrates how a block (with) can be coded in block prediction mode. As shown in FIG. 8, a block can be partitioned into multiple regions, each region using a different partitioning scheme (eg, using a 1 × 2 partition and using a 2 × 2 partition). And so on) and a block prediction vector may be specified for each partition (eg, signaled in the bitstream with the residual associated with each partition). For example, each block may be partitioned into multiple 1 × 2 partitions that include two pixels (or other fixed size partitions).

[0116] 他の実施形態では、エンコーダは、（ブロック内の各サブ領域についての）各ブロックについての最も効率的であるブロック区分サイズを決定し得る。効率は、所与のブロック区分サイズを使用して、ブロック（またはその中のサブ領域）をコーディングすることに関連付けられたレートおよび歪みに基づいて測定され得る。例えば、４つの２×２領域を含むブロックをコーディングするとき、エンコーダは、単一の区分（例えば、各２×２領域についての単一の２×２区分）を使用して第１の３つの２×２領域をコーディングすること、および２つの区分（例えば、２つの１×２区分）を使用して第４の２×２領域をコーディングすることによって、最大コーディング効率が達成され得ることを決定し得る。エンコーダが各ブロックについての区分サイズを適用可能に選択するのを可能にすることによって、ブロック予測方式の性能はさらに改善され得る。これは、大きい区分が、領域（例えば、領域にわたるピクセル値において、変化がないかまたは閾値量の変化よりも小さいことを示す領域）を平滑化するために使用され得、それにより、ブロック予測ベクトル（例えば、領域のサイズに関連する）をシグナリングするためにより少ないビットを要求するからであり、一方より小さい区分を使用することは、（歪みおよび／またはエントロピーコーディングレートの減少が追加のシグナリングコストを重み付けする）複雑な領域のために使用され得る。例えば、エンコーダは、所与の領域またはブロックが平滑化閾値条件を満たすかどうかを決定し得、所与の領域またはブロックが平滑化閾値条件を満たすと決定することに応答して、より大きい区分サイズを使用してブロック予測モードで所与の領域またはブロックを符号化（他の場合には、より小さい区分サイズを使用してブロック予測モードで所与の領域またはブロックを符号化）し得る。別の例として、エンコーダは、所与の領域またはブロックが複雑さ閾値条件を満たすかどうかを決定し得、所与の領域またはブロックが複雑さ閾値条件を満たすと決定することに応答して、より小さい区分サイズを使用してブロック予測モードで所与の領域またはブロックを符号化（他の場合には、より大きい区分サイズを使用してブロック予測モードで所与の領域またはブロックを符号化）し得る。異なる区分サイズを適用可能に選択するための能力は、ブロック予測モードが、コンテンツタイプ（例えば、グラフィックコンテンツ、自然画像、テストパターン、細かいテキストレンダリング）のより広い範囲で使用されることを可能にし得る。 [0116] In other embodiments, the encoder may determine the most efficient block partition size for each block (for each sub-region within the block). Efficiency can be measured based on the rate and distortion associated with coding a block (or sub-regions therein) using a given block partition size. For example, when coding a block that includes four 2 × 2 regions, the encoder uses a single partition (eg, a single 2 × 2 partition for each 2 × 2 region) using the first three Determine that maximum coding efficiency can be achieved by coding a 2 × 2 region and coding a fourth 2 × 2 region using two partitions (eg, two 1 × 2 partitions) Can do. By allowing the encoder to select the partition size for each block as applicable, the performance of the block prediction scheme may be further improved. This can be used to smooth large regions (eg, regions that show no change in pixel values across the region or less than a change in threshold amount), so that the block prediction vector This is because it requires fewer bits to signal (e.g., related to the size of the region), while using a smaller segmentation (reducing distortion and / or entropy coding rate reduces additional signaling costs). Can be used for complex areas (weighting). For example, the encoder may determine whether a given region or block satisfies a smoothing threshold condition, and in response to determining that a given region or block satisfies a smoothing threshold condition, The size may be used to encode a given region or block in block prediction mode (in other cases, a smaller partition size may be used to encode a given region or block in block prediction mode). As another example, an encoder may determine whether a given region or block satisfies a complexity threshold condition, and in response to determining that a given region or block satisfies a complexity threshold condition, Encode a given region or block in block prediction mode using a smaller partition size (otherwise encode a given region or block in block prediction mode using a larger partition size) Can do. The ability to select different partition sizes adaptively may allow block prediction modes to be used with a wider range of content types (eg, graphic content, natural images, test patterns, fine text rendering). .

ブロック予測モードにおけるコーディングの例示的なデータフロー
[0117] 図９は、適応型区分サイズを使用してブロック予測モードでブロックをコーディングするための例示的なデータフロー９００を図示する。図９に図示さていれるように、ブロック予測モードで予測される現在ブロック９０２は、ブロック区分９０４を含む。一例では、ブロック区分は、１×２または２×２のサイズを有する。ブロック予測（ＢＰ：block prediction）探索９０６は、ブロック予測モードで現在ブロック９０２（またはブロック区分９０４）を予測するために利用可能なおよび既にコーディングされているブロックまたは区分を識別するために実施される。図９に示されているように、ＢＰ探索９０６は、例えば、前のライン（例えば、すぐ前のライン（immediately preceding line）または別の先行するラインなどの、現在ブロックを含む現在のラインをコーディングすることに先立ってコーディングされたライン）中の１つまたは複数の前の再構成されたブロック９０７Ａ、および／または現在のライン（例えば、現在ブロックを含むライン）からの前の再構成されたブロック９０７Ｂを含む、探索範囲内を探索し得る。 Exemplary data flow for coding in block prediction mode
[0117] FIG. 9 illustrates an example data flow 900 for coding a block in block prediction mode using an adaptive partition size. As illustrated in FIG. 9, the current block 902 predicted in the block prediction mode includes a block partition 904. In one example, the block partition has a size of 1 × 2 or 2 × 2. A block prediction (BP) search 906 is performed to identify blocks or partitions that are available and already coded to predict the current block 902 (or block partition 904) in block prediction mode. . As shown in FIG. 9, the BP search 906 codes the current line containing the current block, for example, the previous line (eg, immediately preceding line or another preceding line). One or more previous reconstructed blocks 907A in the line coded prior to doing) and / or previous reconstructed blocks from the current line (eg, the line containing the current block) The search range including 907B may be searched.

[0118] エンコーダは、探索範囲において識別された候補ブロックまたは区分に基づいてブロック予測器９０８を決定する。ブロック予測器９０８は、ブロック９１０において現在ブロック９０２（または候補ブロック９０２内の現在ブロック区分９０４）から減算され、減算に基づいて決定された残差は、ブロック９１２において量子化される。量子化された残差は、エントロピーコーダ９２０によってエントロピーコーディングされる。さらに、逆量子化９１４は、量子化された残差に対して実行され、結果が、再構成されたブロック９１８を生成するためにブロック９１６においてブロック予測器９０８に加えられる。ＢＰ区分サイズ選択９２２は、再構成されたブロック９１８の歪み性能（Ｄ）およびエントロピー符号化された残差のレート性能（Ｒ）に基づいて実行される。ビットストリーム９２４は、選択されたＢＰ区分サイズに基づいて生成される。 [0118] The encoder determines the block predictor 908 based on the candidate block or partition identified in the search range. Block predictor 908 is subtracted from current block 902 (or current block partition 904 in candidate block 902) at block 910, and the residual determined based on the subtraction is quantized at block 912. The quantized residual is entropy coded by an entropy coder 920. In addition, inverse quantization 914 is performed on the quantized residual and the result is applied to block predictor 908 at block 916 to generate a reconstructed block 918. The BP partition size selection 922 is performed based on the distortion performance (D) of the reconstructed block 918 and the rate performance (R) of the entropy encoded residual. Bitstream 924 is generated based on the selected BP partition size.

[0119] 例えば、ＢＰ区分サイズ選択９２２は、現在ブロック９０２内の各区分領域（例えば、２×２）のレート（例えば、Ｒ）および歪み（例えば、Ｄ）を入力として受け取り、区分領域が単一のブロック予測ベクトル（ＢＰＶ：block prediction vector)（例えば、単一の２×２区分について合計１ＢＰＶ）を使用してコーディングされるべきか、または複数のＢＰＶ（例えば、２つの１×２区分について各々１ＢＰＶである、合計２ＢＰＶ）を使用して区分およびコーディングされるべきかを、２つのオプション間のＲＤトレードオフに基づく予測のために決定し得る。本明細書で論じられるいくつかの例は２×２の区分領域サイズを含む（それにより、選択可能なオプションとして１×２、２×１、および２×２の区分サイズを有する）が、エンコーダによって選択可能な区分サイズは、このような例（例えば、１×２および２×２）で使用されるものに限定されず、ブロックサイズおよび／または領域サイズに基づいて他のサイズ（例えば、２×１）を含み得る。 For example, the BP partition size selection 922 receives the rate (eg, R) and distortion (eg, D) of each partition region (eg, 2 × 2) in the current block 902 as input, and the partition region is simply Should be coded using a single block prediction vector (BPV) (eg, a total of 1 BPV for a single 2 × 2 partition) or multiple BPVs (eg, for two 1 × 2 partitions) Whether to be partitioned and coded using a total of 2 BPV, each of which is 1 BPV, may be determined for prediction based on the RD tradeoff between the two options. Some examples discussed herein include a 2 × 2 partition region size (thus having 1 × 2, 2 × 1, and 2 × 2 partition sizes as selectable options), but an encoder The partition sizes selectable by are not limited to those used in such examples (eg, 1 × 2 and 2 × 2), but other sizes (eg, 2 × based on block size and / or region size). X1).

[0120] いくつかの実施形態では、区分サイズは、現在区分領域またはブロック中で固定される（例えば、１×２、２×２、またはピクセルの任意の他のサブコンビネーション。例えば、あるブロックは、２×８のブロックサイズを有し得、そのブロックは、２×２のサイズを有するサブブロックまたは領域に分割され得る。２×８ブロック内の２×２サブブロックまたは領域は、１×２のサイズを有する区分にさらに区分され得る。このような例では、他の区分から独立して、各１×２区分は単一のＢＰＶを使用して予測され得る。他の実施形態では、区分サイズは可変であり、どの区分サイズを使用して各ブロック、サブブロック、および／または領域がブロック予測においてどのようにコーディングされるかは、各区分方式のレートおよび歪み性能に基づいてエンコーダによって決定され得る。例えば、現在ブロック内の２×２領域（例えば、現在領域）について、２つの１×２区分に現在領域を分割することによって現在領域を予測すること、および（例えば、定義された探索範囲内の、前にコーディングされた１×２区分に各々が向けられている）２つのＢＰＶを別個に使用して２つの１×２区分を予測することが、（例えば、２×２のような他の区分方式と比較して）より良いレートおよび／または歪み性能をもたらす場合、現在領域は、１×２区分方式を使用して予測され得る。一方、（例えば、定義された探索範囲内の、前にコーディングされた２×２区分を示す）１つのＢＰＶを使用して単一の２×２区分として現在領域を予測することが、（例えば、１×２のような他の区分方式と比較して）より良いレートおよび／または歪み性能をもたらす場合、現在領域は、２×２区分方式を使用して予測され得る。予測モードにおいてブロックをコーディングするために使用される区分方式を決定するプロセスは、図１４を参照して下記でより詳細に説明される。 [0120] In some embodiments, the partition size is fixed in the current partition region or block (eg, 1 × 2, 2 × 2, or any other sub-combination of pixels. For example, a block is The block size may be 2 × 8, and the block may be divided into sub-blocks or regions having a size of 2 × 2, 2 × 2 sub-blocks or regions within the 2 × 8 block are 1 × 2 In such an example, each 1 × 2 partition can be predicted using a single BPV, independent of other partitions, in other embodiments, the partition The size is variable, and which partition size is used to code each block, sub-block, and / or region in block prediction depends on the rate and distortion of each partition scheme For example, for a 2 × 2 region (eg, the current region) in the current block, predicting the current region by dividing the current region into two 1 × 2 partitions, and Predicting two 1 × 2 partitions using two BPVs separately (eg, each directed to a previously coded 1 × 2 partition within a defined search range) ( The current region may be predicted using a 1 × 2 partitioning scheme if it yields better rate and / or distortion performance (as compared to other partitioning schemes such as 2 × 2, for example) (eg, Predicting the current region as a single 2 × 2 partition using one BPV (indicating a previously coded 2 × 2 partition) within the defined search range (eg, 1 × 2 Other division like The current region can be predicted using a 2 × 2 partitioning scheme if it yields better rate and / or distortion performance (compared to the equation) The partitioning scheme used to code the blocks in prediction mode The process of determining is described in more detail below with reference to FIG.

ブロックサイズおよびサブブロックサイズ
[0121] Ｍ×Ｎのブロックサイズについて、いくつかの実施形態は、Ｍ_ｓｕｂ≦ＭかつＮ_ｓｕｂ≦Ｎである、サイズＭ_ｓｕｂ×Ｎ_ｓｕｂのサブブロック（本明細書では、領域とも呼ばれる）を参照して説明される。いくつかの実装では、計算を簡単にするために、Ｍ_ｓｕｂとＮ_ｓｕｂとの両方がＭ×Ｎブロック内のエントロピーコーディンググループにアラインされる（aligned）。ブロック内の各サブブロックＭ_ｓｕｂ×Ｎ_ｓｕｂは、各区分のために使用されるＢＰＶを用いて、（i）さらに区分されることなく単一のＢＰＶを使用して予測されるか、あるいは（ii）複数の区分に（例えば、２つの１×２区分に）区分されるかのいずれかであり得る。サブブロック全体について単一のＢＰＶを使用すること、または各々がそれ自体のＢＰＶを有する区分にサブブロックを区分することの間の効率的なトレードオフは、より多くのＢＰＶをシグナリングすることがビットストリーム中の余分なレートをもたらす可能性があるが、しかしながら、より多くのＢＰＶを使用することによって、歪みおよびエントロピーコーディングレートは減少し得る。言い換えれば、さらなるＢＰＶをシグナリングするためにより多くのビットを使用することによって、残差（候補ブロック／領域と現在ブロック／領域との間の差）をシグナリングするために使用されるビットの数が減少し、それはさらに、エントロピーコーディングのために使用されるビットの数もまた同様に減少することを引き起こす。エンコーダは、ＲＤコストに関して各オプション（例えば、区分が存在しないこと対複数の区分）を比較し、そのコスト比較に基づいて各サブブロックまたは領域を区分するか否かを選択するか、あるいは最良のＲＤ性能を提供する複数の区分方式から１つの区分方式を選択し得る。 Block size and sub-block size
[0121] For an M × N block size, some embodiments provide subblocks of size M _sub × N _sub (also referred to herein as regions), where M _sub ≦ M and N _sub ≦ N. Reference is made to the description. In some implementations, both M _sub and N _sub are aligned to an entropy coding group in an M × N block to simplify computation. Each sub-block M _sub × N _sub within a block is predicted using a single BPV without further partitioning, using the BPV used for each partition, or ( ii) It can be either divided into multiple sections (eg, two 1 × 2 sections). An efficient trade-off between using a single BPV for the entire sub-block, or partitioning a sub-block into partitions each with its own BPV is a bit that signals more BPV However, using more BPV may reduce distortion and entropy coding rates, which can result in extra rates in the stream. In other words, using more bits to signal additional BPV reduces the number of bits used to signal the residual (difference between candidate block / region and current block / region) However, it also causes the number of bits used for entropy coding to decrease as well. The encoder compares each option (eg, no partition vs. multiple partitions) for RD cost and selects whether to partition each sub-block or region based on the cost comparison, or the best One partitioning scheme may be selected from a plurality of partitioning schemes that provide RD performance.

例示的な区分方式
[0122] 図１０は、例示的な区分方式を図示する図１０００を図示する。図１０には、２×２サブブロックまたは領域についての２つの区分オプションが図示されている。この例では、ブロック１００２（例えば、ピクセルＸ_０〜Ｘ_１５を含む）は、２×８のサイズを有しており、ブロック内のサブブロックまたは領域１００４（例えば、ピクセルＸ_０、Ｘ_１、Ｘ_８、およびＸ_９を含む）は、２×２のサイズを有している。区分オプション１００６は、単一のＢＰＶを使用してサブブロックまたは領域１００４が予測される例を図示しており、区分オプション１００８は、サブブロックまたは領域１００４内の各１×２区分について２つのＢＰＶを使用してサブブロックまたは領域１００４が予測される例を図示している。図１１で示される、ブロック予測モードについてのエントロピーコーディンググループ構造１１００にサブブロックまたは領域をアラインするために、アドバンストＤＳＣ（Ａｄｖ−ＤＳＣ）のようないくつかの実装において、２×２のサイズを有するサブブロックまたは領域が使用される。図１１の例では、エントロピーコーディンググループ０、１、２、および３が図示されており、各々が、ブロック内の４つの２×２サブブロックまたは領域のうちの１つに対応する。しかしながら、本明細書で説明される技法は、このような実施形態に限定されるものではなく、いずれのブロックサイズＭ×ＮおよびいずれのサブブロックサイズＭ_ｓｕｂ×Ｎ_ｓｕｂにも拡張され得る。しかしながら、下記に図示される例では、パラメータＭ＝２、Ｎ＝８、Ｍ_ｓｕｂ＝２、Ｎ_ｓｕｂ＝２が使用される。いくつかの実施形態では、サブブロックおよび／または区分方式は、エントロピーコーディンググループに基づいて決定され得る。例えば、サブブロックおよび／または区分方式は、各サブブロックおよび／または区分方式が単一のエントロピーコーディンググループ内に含まれるように決定され得る。 Example classification scheme
[0122] FIG. 10 illustrates a diagram 1000 illustrating an exemplary partitioning scheme. FIG. 10 illustrates two partitioning options for a 2 × 2 sub-block or region. In this example, block 1002 (eg, including pixels X ₀ -X ₁₅ ) has a size of 2 × 8, and sub-block or region 1004 (eg, pixels X ₀ , X ₁ , X in the block). ₈ and X ₉ ) has a size of 2 × 2. Partition option 1006 illustrates an example where a single BPV is used to predict a sub-block or region 1004, and partition option 1008 includes two BPVs for each 1 × 2 partition in the sub-block or region 1004. Illustrates an example where a sub-block or region 1004 is predicted using. In some implementations such as Advanced DSC (Adv-DSC) to align sub-blocks or regions to the entropy coding group structure 1100 for block prediction mode shown in FIG. A subblock or region is used. In the example of FIG. 11, entropy coding groups 0, 1, 2, and 3 are illustrated, each corresponding to one of the four 2 × 2 sub-blocks or regions in the block. However, the techniques described herein are not limited to such embodiments and can be extended to any block size M × N and any sub-block size M _sub × N _sub . However, in the example illustrated below, parameters M = 2, N = 8, M _sub = 2 and N _sub = 2 are used. In some embodiments, the sub-block and / or partition scheme may be determined based on an entropy coding group. For example, the sub-block and / or partition scheme may be determined such that each sub-block and / or partition scheme is included in a single entropy coding group.

区分サイズを決定する
[0123] エンコーダは、（i）単一の２×２区分として各２×２領域をコーディングするか、または（ii）２つの１×２区分に領域を分割し、最小ＲＤコストに基づいて各１×２区分を別個にコーディングするかを決定し得る。ＲＤコストは、下記に示されるように計算され得る。

Determine partition size
[0123] The encoder may either (i) code each 2x2 region as a single 2x2 partition, or (ii) divide the region into two 1x2 partitions, each based on the minimum RD cost It may be decided whether to code the 1 × 2 partition separately. The RD cost can be calculated as shown below.

[0124] いくつかの実装では、ＢＰＶは、ｌｏｇ_２（ＳＲ）ｌｏｇ_２（ＳＲ）に等しい、固定数のビット（ＢＰＶ_ｂｉｔｓ）を用いてシグナリングされ、ここで、ＳＲはブロック予測モードに関連付けられた探索空間（または探索範囲）である。例えば、探索空間が６４個の位置からなる場合、ｌｏｇ_２（６４）＝６ビットが各ＢＰＶをシグナリングするために使用される。 [0124] In some implementations, the BPV is signaled with a fixed number of bits (BPV _bits ) equal to log ₂ (SR) log ₂ (SR), where SR is associated with the block prediction mode. Search space (or search range). For example, if the search space consists of 64 locations, log ₂ (64) = 6 bits are used to signal each BPV.

[0125] 可変の区分サイズを用いたブロック予測のための探索空間は、図３〜６を参照して論じられる探索範囲とは若干異なり得る。特に、Ｍ_ｓｕｂ×Ｎ_ｓｕｂサブブロックは、高さＭ_ｓｕｂを用いた探索空間を利用し得る。このような場合には、可変の区分サイズを用いないブロック予測に対して、可変の区分サイズを用いたブロック予測を実装するために、追加のラインバッファを必要とし得る。このような探索空間の例が、２×２のサブブロックサイズに関する図１２において証明される。図１２は、例示的な探索範囲を図示する図１２００を図示する。図１２に示されているように、現在ライン１２０２は、（i）現在サブブロック１２０６を有する現在ブロック１２０４と、（ii）前のブロック１２０８とを含む。図１２の例では、前のライン１２１０は、現在サブブロック１２０６を予測するために候補サブブロック１２１４をエンコーダが選択し得る探索範囲１２１２を含む。１Ｄ区分（例えば、１×２）についての探索範囲または空間は、図３を参照して先に説明された探索範囲に類似しており、単一の前の再構成されたラインに依存し得る。 [0125] The search space for block prediction using variable partition sizes may be slightly different from the search range discussed with reference to FIGS. In particular, the M _sub × N _sub sub-block can use a search space using the height M _sub . In such a case, an additional line buffer may be required to implement block prediction using a variable partition size versus block prediction that does not use a variable partition size. An example of such a search space is demonstrated in FIG. 12 for a 2 × 2 sub-block size. FIG. 12 illustrates a diagram 1200 illustrating an exemplary search range. As shown in FIG. 12, current line 1202 includes (i) current block 1204 with current sub-block 1206 and (ii) previous block 1208. In the example of FIG. 12, the previous line 1210 includes a search range 1212 from which the encoder can select a candidate sub-block 1214 to predict the current sub-block 1206. The search range or space for a 1D partition (eg, 1 × 2) is similar to the search range described above with reference to FIG. 3 and may depend on a single previous reconstructed line .

[0126] いくつかの実施形態では、歪みＤ_２×２およびＤ_１×２は、ＹＣｏＣｇ色空間中の修正された差分絶対値和（ＳＡＤ）を使用して計算され得る。例えば、ＹＣｏＣｇ色空間中のピクセルＡ（例えば、現在サブブロックまたは区分中の）とピクセルＢ（例えば、候補サブブロックまたは領域中の）との間のＳＡＤ歪みは、下記のように計算され得る。

[0126] In some embodiments, the distortion D _{2 × 2} and D _{1 × 2} can be calculated using a modified sum of absolute differences in YCoCg color space (SAD). For example, the SAD distortion between pixel A (eg, in the current subblock or partition) and pixel B (eg, in a candidate subblock or region) in the YCoCg color space may be calculated as follows:

[0127] 現在サブブロックまたは区分が１より多いピクセルを有する場合、全体の現在サブブロックまたは区分についての歪みは、現在サブブロックまたは区分中の各ピクセルについて計算された個々のＳＡＤを合計することによって計算され得る。現在サブブロックまたは区分のピクセル値は、実際のピクセル値または再構成されたピクセル値（例えば、候補予測器（candidate predictor）および残差に基づいて計算された）であり得る。いくつかの実装では、ラムダパラメータは、２の値で固定され得る。他の実装では、このパラメータは、ブロックサイズ、ビットレート、または他のコーディングパラメータに依存して調整され（be tuned）得る。 [0127] If the current sub-block or partition has more than one pixel, the distortion for the entire current sub-block or partition is obtained by summing the individual SADs calculated for each pixel in the current sub-block or partition. Can be calculated. The pixel value of the current sub-block or partition can be an actual pixel value or a reconstructed pixel value (eg, calculated based on a candidate predictor and a residual). In some implementations, the lambda parameter may be fixed at a value of two. In other implementations, this parameter may be tuned depending on block size, bit rate, or other coding parameters.

[0128] エントロピーコーディングコストＥＣ_ｂｉｔｓは、各２×２の領域について計算され得る。各エントロピーコーディンググループにおける４つのサンプルは、単一のＢＰＶ（例えば、２×２区分）から予測される２×２の量子化された残差、または２つのベクトル（例えば、２つの１×２区分）を利用する２×２の量子化された残差のいずれかに由来し得る。例えば、エントロピーコーディングコストは、（例えば、（１つまたは複数の）ベクトルおよび残差を含む）ビットストリーム中の各エントロピーコーディンググループをシグナリングするために必要とされるビットの数を表し得る。計算されたエントロピーコーディングコストに基づいて、エンコーダは、各２×２領域についての最低コストを有する区分方式を選択し得る。いくつかの実施形態は、２×２サブブロックサイズ、２×２エントロピーコーディンググループ、および２つの区分方式（１×２および２×２）を有する２×８ブロックを参照して論じられるが、本明細書で説明される技法は、他のブロックサイズ、サブブロックサイズ、エントロピーコーディンググループ、および／または区分方式に拡張され得る。 [0128] Entropy coding costs EC _bits may be calculated for each 2x2 region. The four samples in each entropy coding group are either 2 × 2 quantized residuals predicted from a single BPV (eg 2 × 2 partitions), or 2 vectors (eg 2 1 × 2 partitions) ) Can be derived from any of the 2 × 2 quantized residuals. For example, the entropy coding cost may represent the number of bits required to signal each entropy coding group in the bitstream (eg, including vector (s) and residual). Based on the calculated entropy coding cost, the encoder may select the partitioning scheme with the lowest cost for each 2 × 2 region. Some embodiments are discussed with reference to a 2 × 8 block having a 2 × 2 sub-block size, a 2 × 2 entropy coding group, and two partitioning schemes (1 × 2 and 2 × 2). The techniques described herein may be extended to other block sizes, sub-block sizes, entropy coding groups, and / or partition schemes.

ビットストリーム中のシグナリングコーディング情報
[0129] 図１０に示される２×８ブロック１００２では、４つの２×２領域の各々が、上記で説明されたＲＤコスト分析に基づいて区分され得る。例えば、各２×２領域は、単一の２×２区分または２つの１×２区分のいずれかに区分され得る。このような区分の４つの例が、図１３の図１３００で図示される。図１３に示されているように、ブロック１３０２は、２×２区分方式に基づいて予測された４つのサブブロックを有し、ブロック１３０４は、２×２区分方式に基づいて予測された３つのサブブロックと、１×２区分方式に基づいて予測された１つのサブブロックとを有し、ブロック１３０６は、１×２区分方式に基づいて予測された４つのサブブロックを有し、ブロック１３０８は、２×２区分に基づいて予測された１つのサブブロックと、１×２区分方式に基づいて予測された３つのサブブロックとを有する。デコーダにＢＰＶをシグナリングすることに加えて、エンコーダはまた、デコーダが区分を適切に推測することができるように、各２×２領域についての１ビットを送り得る。Ａｄｖ−ＤＳＣ実装のようないくつかの実装では、ブロック内の各領域（例えば、２×８ブロックにおける各２×２領域）のために選択された区分方式を示す４つのビットのグループは、ビットストリームにおいてシグナリングされる。このような実装では、４つのビット「１０１１」は、ブロック中の第１、第３、および第４の領域（例えば、２×２サブブロック）が第１の区分方式に基づいて（例えば、１×２区分に基づいて）予測またはコーディングされ、一方、第２の領域（例えば、２×２サブブロック）は、第２の区分方式に基づいて（例えば、２×２区分に基づいて）予測またはコーディングされることを示し得る。いくつかの実施形態では、ビットストリーム中のこれら４つのビットに後続して、ＢＰＶが、ＢＰＶごとの固定されたビットを使用してシグナリングされ得る。先の例（例えば、「１０１１」のビットシーケーンス）では、７ＢＰＶがシグナリングされ得る。 Signaling coding information in the bitstream
[0129] In the 2 × 8 block 1002 shown in FIG. 10, each of the four 2 × 2 regions may be partitioned based on the RD cost analysis described above. For example, each 2 × 2 region may be partitioned into either a single 2 × 2 partition or two 1 × 2 partitions. Four examples of such partitions are illustrated in FIG. 1300 of FIG. As shown in FIG. 13, block 1302 has four sub-blocks predicted based on the 2 × 2 partition scheme, and block 1304 includes three predicted subblocks based on the 2 × 2 partition scheme. A sub-block and one sub-block predicted based on the 1 × 2 partition scheme, a block 1306 includes four sub-blocks predicted based on the 1 × 2 partition scheme, and a block 1308 includes It has one sub-block predicted based on the 2 × 2 partition and three sub-blocks predicted based on the 1 × 2 partition scheme. In addition to signaling the BPV to the decoder, the encoder may also send one bit for each 2x2 region so that the decoder can properly guess the partition. In some implementations, such as the Adv-DSC implementation, a group of 4 bits indicating the partitioning scheme selected for each region in the block (eg, each 2 × 2 region in a 2 × 8 block) is a bit Signaled in the stream. In such an implementation, the four bits “1011” indicate that the first, third, and fourth regions (eg, 2 × 2 sub-blocks) in the block are based on the first partitioning scheme (eg, 1 The second region (eg, 2 × 2 sub-block) is predicted or coded based on the second partition scheme (eg, based on 2 × 2 partition) or It can indicate that it is coded. In some embodiments, following these four bits in the bitstream, the BPV may be signaled using a fixed bit for each BPV. In the previous example (eg, a bit sequence of “1011”), 7 BPV may be signaled.

ブロック予測モードでコーディングするための例示的なフローチャート
[0130] 図１４を参照して、ブロック予測モードでビデオデータのブロックをコーディングするための例示的なプロシージャが説明される。図１４に図示されるステップは、ビデオエンコーダ（例えば、図２Ａのビデオエンコーダ２０）、またはそれらの（１つまたは複数の）コンポーネントによって実行され得る。便宜上、方法１４００は、ビデオエンコーダ２０、または別のコンポーネントであり得る、（単にコーダとも呼ばれる）ビデオコーダによって実行されるものとして説明される。 Exemplary flowchart for coding in block prediction mode
[0130] With reference to FIG. 14, an exemplary procedure for coding a block of video data in a block prediction mode is described. The steps illustrated in FIG. 14 may be performed by a video encoder (eg, video encoder 20 of FIG. 2A), or their component (s). For convenience, the method 1400 is described as being performed by a video coder (also referred to simply as a coder), which may be the video encoder 20, or another component.

[0131] 方法１４００はブロック１４０１において開始する。ブロック１４０５において、コーダは、第１の区分方式を使用して現在領域（例えば、ブロック予測モードでコーディングされるビデオデータのブロック内の）を予測するために使用される１つまたは複数の第１の候補領域を決定する。例えば、第１の候補領域は、２×８ブロックにおける２×２領域のうちの１つであり得る。第１の区分方式は、現在領域が複数の区分（例えば、２つの１×２区分）に区分される区分方式であり得る。いくつかの実施形態では、１つまたは複数の第１の候補領域は、第１の区分方式に関連付けられたロケーションの第１の範囲（例えば、第１の区分方式に関連付けられた探索範囲）内にある。１つまたは複数の第１の候補領域は、ビデオ符号化デバイスのメモリに記憶され得る。 [0131] The method 1400 begins at block 1401. At block 1405, the coder uses one or more first schemes to predict a current region (eg, in a block of video data coded in block prediction mode). The candidate area is determined. For example, the first candidate region may be one of 2 × 2 regions in a 2 × 8 block. The first partitioning method may be a partitioning method in which the current region is partitioned into a plurality of partitions (for example, two 1 × 2 partitions). In some embodiments, the one or more first candidate regions are within a first range of locations associated with the first partitioning scheme (eg, a search range associated with the first partitioning scheme). It is in. The one or more first candidate regions may be stored in a memory of the video encoding device.

[0132] ブロック１４１０において、コーダは、第２の区分方式を使用して現在領域を予測するために使用される１つまたは複数の第２の候補領域を決定する。例えば、第２の区分方式は、現在領域が複数の区分に区分されない（例えば、現在領域が単一の２×２区分としてコーディングされる）区分方式であり得る。別の例では、第２の区分方式は、現在領域が、第１の区分方式のために使用される区分の数とは異なる区分の数に区分される区分方式であり得る。いくつかの実施形態では、１つまたは複数の第２の候補領域は、第２の区分方式に関連付けられたロケーションの第２の範囲（例えば、第２の区分方式に関連付けられた探索範囲）内にある。１つまたは複数の第２の候補領域は、ビデオ符号化デバイスのメモリに記憶され得る。 [0132] At block 1410, the coder determines one or more second candidate regions that are used to predict the current region using the second partitioning scheme. For example, the second partition scheme may be a partition scheme in which the current region is not partitioned into multiple partitions (eg, the current region is coded as a single 2 × 2 partition). In another example, the second partitioning scheme may be a partitioning scheme in which the current region is partitioned into a number of partitions that is different from the number of partitions used for the first partitioning scheme. In some embodiments, the one or more second candidate regions are within a second range of locations associated with the second partition scheme (eg, a search range associated with the second partition scheme). It is in. One or more second candidate regions may be stored in a memory of the video encoding device.

[0133] ブロック１４１５において、コーダは、第１の区分方式を使用して現在領域をコーディングすることに関連付けられた第１のコストが、第２の区分方式を使用して現在領域をコーディングすることに関連付けられた第２のコストよりも大きいことを決定する。例えば、コーダは、第１の区分方式を使用して現在領域をコーディングすることに関連付けられたレートおよび歪みに基づくコスト、および第２の区分方式を使用する現在領域をコーディングすることに関連付けられたレートおよび歪みに基づくコストを計算し、計算されたコストを比較し得る。 [0133] At block 1415, the coder may use a first cost associated with coding the current region using the first partitioning scheme to code the current region using the second partitioning scheme. Is determined to be greater than a second cost associated with. For example, the coder was associated with coding a current region using a rate and distortion based on the rate and distortion associated with coding the current region using a first partitioning scheme, and a second partitioning scheme. Costs based on rate and distortion can be calculated and the calculated costs can be compared.

[0134] ブロック１４２０において、コーダは、現在領域に関する１つまたは複数の第２の候補領域のロケーションを識別する１つまたは複数の予測ベクトルをシグナリングすることを少なくとも部分的に介して、第２の区分方式を使用して現在領域をコーディングする。方法１４００はブロック１４２５において終了する。 [0134] At block 1420, the coder receives the second one or more prediction vectors identifying the location of one or more second candidate regions for the current region, at least in part. Code the current region using a partitioning scheme. The method 1400 ends at block 1425.

[0135] 方法１４００では、図１４に示されているブロックのうちの１つまたは複数は削除され（例えば、実行されない）得、および／または方法が実行される順序は入れ替えられ得る。いくつかの実施形態では、さらなるブロックが方法１４００に追加され得る。本開示の実施形態は、図１４に示されている例にまたはそれによって限定されず、他の変形が本開示の趣旨から逸脱することなく実装され得る。 [0135] In the method 1400, one or more of the blocks shown in FIG. 14 may be deleted (eg, not performed) and / or the order in which the methods are performed may be reversed. In some embodiments, additional blocks may be added to the method 1400. The embodiments of the present disclosure are not limited to or by the example shown in FIG. 14, and other variations may be implemented without departing from the spirit of the present disclosure.

４：２：０および４：２：２クロマサブサンプリングフォーマットへの拡張
[0136] いくつかの実装では、本開示で説明されるブロック予測技法（例えば、ブロック予測モードで可変の区分サイズを使用する）は、４：４：４クロマサンプリングフォーマットのみに対して利用され得る。このフォーマットは、主にグラフィックコンテンツのために使用される。例えば、４：４：４クロマサンプリングフォーマットは、（例えば、クロマサブサンプリングを使用しない）同じサンプリングレートを有する色成分（例えば、ルーマ成分およびクロマ成分）を含むイメージまたはビデオデータを利用する。しかしながら、４：４：４クロマサンプリングフォーマットは、他のビデオアプリケーションのために主に使用される頻度は少ない可能性がある。クロマサブサンプリングが提供し得る大幅な圧縮のために、４：２：０および４：２：２の両方のクロマサブサンプリングフォーマットは、ビデオアプリケーションのために主に使用される。例えば、ＤＳＣのいくつかのバージョン（例えば、ＤＳＣｖ１．ｘ）は、４：２：０および４：２：２をサポートし得る。このようなクロマサブサンプリングフォーマットについてのサポートは、将来的なＤＳＣ実装で利用または要求され得る。よって、いくつかの実施形態では、本開示で説明されるブロック予測技法（例えば、ブロック予測モードで可変の区分サイズを使用する）は、４：２：０および／または４：２：２フォーマットに拡張される。４：２：０および４：２：２クロマサブサンプリングフォーマットが本明細書で使用されるが、本出願で説明される様々な技法は、他の既知のサンプリングフォーマットに適用され得る。 Extensions to 4: 2: 0 and 4: 2: 2 chroma subsampling formats
[0136] In some implementations, the block prediction techniques described in this disclosure (eg, using variable partition sizes in block prediction mode) may be utilized only for 4: 4: 4 chroma sampling formats. . This format is mainly used for graphic content. For example, the 4: 4: 4 chroma sampling format utilizes image or video data that includes color components (eg, luma and chroma components) having the same sampling rate (eg, not using chroma subsampling). However, the 4: 4: 4 chroma sampling format may be used less frequently for other video applications. Because of the significant compression that chroma subsampling can provide, both 4: 2: 0 and 4: 2: 2 chroma subsampling formats are primarily used for video applications. For example, some versions of DSC (eg, DSCv1.x) may support 4: 2: 0 and 4: 2: 2. Support for such chroma subsampling formats may be utilized or required in future DSC implementations. Thus, in some embodiments, the block prediction techniques described in this disclosure (eg, using a variable partition size in block prediction mode) are in 4: 2: 0 and / or 4: 2: 2 format. Expanded. Although 4: 2: 0 and 4: 2: 2 chroma subsampling formats are used herein, the various techniques described in this application may be applied to other known sampling formats.

[0137] いくつかの実施形態では、可変の区分サイズを用いたブロック予測のためのアルゴリズムは、クロマサンプリングフォーマットから独立して、同じ方法で大いに機能する。このような実施形態では、フォーマット（例えば、４：４：４、４：２：２、４：２：０など）に関係なく、単一の区分（例えば、２×２）を使用するか、または複数の区分（例えば、２つの別個の１×２区分）を使用するかの決定、あるいは、現在サブブロックまたは領域をコーディングするために使用される区分の数（例えば、１、２、３、４など）の決定は、ルーマサンプルの各サブブロックまたは領域（例えば、２×２ブロック）のためになされ得る。しかしながら、各区分におけるまたは各ブロックにおけるクロマサンプルの数は、サブサンプリングフォーマットに依存して異なり得る。加えて、エントロピーコーディンググループとのアライメントがクロマ成分にもはや利用可能ではない可能性があるため、エンコーダの決定は、４：２：２および／または４：２：０クロマサブサンプリングフォーマットにおいて修正されることを必要とし得る。従って、エンコーダの決定（例えば、最小ＲＤコストに基づいて、エンコーダが単一の２×２区分または２つの１×２区分に各２×２領域を分割するかどうかを決定するとき）のための各区分についてのレート（例えば、単一の２×２区分、または２つの別個の１×２区分のような区分に関連付けられたレート値）は、４：２：２および４：２：０についてのルーマサンプルだけに依存し得る。例えば、ＳＡＤ歪みを計算するとき、（１つまたは複数の）クロマ成分に関連した任意の項（terms）は、ゼロに設定され得る。 [0137] In some embodiments, the algorithm for block prediction using a variable partition size works greatly in the same way, independent of the chroma sampling format. In such an embodiment, regardless of the format (eg, 4: 4: 4, 4: 2: 2, 4: 2: 0, etc.), a single partition (eg, 2 × 2) is used, Or a determination of whether to use multiple partitions (eg, two separate 1 × 2 partitions), or the number of partitions used to code the current sub-block or region (eg, 1, 2, 3, 4) may be made for each sub-block or region (eg, 2 × 2 block) of luma samples. However, the number of chroma samples in each partition or in each block can vary depending on the subsampling format. In addition, the encoder decision is modified in 4: 2: 2 and / or 4: 2: 0 chroma subsampling format because alignment with entropy coding groups may no longer be available for chroma components. You may need that. Thus, for encoder decisions (eg, when the encoder decides whether to divide each 2 × 2 region into a single 2 × 2 partition or two 1 × 2 partitions based on the minimum RD cost) The rates for each partition (eg, rate values associated with a partition such as a single 2 × 2 partition or two separate 1 × 2 partitions) are for 4: 2: 2 and 4: 2: 0 Can depend only on luma samples. For example, when calculating SAD distortion, any terms associated with the chroma component (s) may be set to zero.

４：２：０クロマサブサンプリングフォーマットのためのＢＰ探索
[0138] ４：２：０モード（４：２：０クロマサブサンプリングフォーマット）での２×２区分に関して、各区分は、クロマ成分（例えば、ＣｏおよびＣｇ、またはＣｂおよびＣｒ）の各々について単一のクロマサンプルを含み得る。いくつかの実施形態では、（例えば、現在領域またはブロックにおいて、ＲＤコストを計算するためにおよび／またはサンプルを予測するために）使用されるクロマサンプルは、区分と交差する（intersects）ものである。他の実施形態では、使用されるクロマサンプルは、隣接する区分から導出され得る。４：２：０モードのための例示的な２×２探索１５００が図１５で示されている。図１５では、クロマサイト（chroma sites）（例えば、クロマサンプルを有するサンプル／ピクセルロケーション）は、「Ｘ」を使用して示される。例えば、区分Ａの左上のサンプル、区分Ｂの右上のサンプル、および現在区分の左上のサンプルは、それぞれの区分と交差するクロマサイトを備える。このようなクロマサイトは、それぞれの区分のために行われる全ての計算のために（例えば、クロマサンプル値を使用して差分値を計算するために）使用され得る。 BP search for 4: 2: 0 chroma subsampling format
[0138] With respect to the 2x2 partition in 4: 2: 0 mode (4: 2: 0 chroma subsampling format), each partition is single for each of the chroma components (eg, Co and Cg, or Cb and Cr). One chroma sample may be included. In some embodiments, the chroma samples used (eg, to calculate RD costs and / or predict samples in the current region or block) are those that intersect the partition. . In other embodiments, the chroma samples used can be derived from adjacent segments. An exemplary 2 × 2 search 1500 for 4: 2: 0 mode is shown in FIG. In FIG. 15, chroma sites (eg, sample / pixel locations with chroma samples) are indicated using “X”. For example, the upper left sample of section A, the upper right sample of section B, and the upper left sample of the current section comprise chromasites that intersect each section. Such chromasites can be used for all calculations performed for each segment (eg, to calculate a difference value using chroma sample values).

[0139] ４：２：０モードでの１×２区分について、現在ブロックの第２のライン中にクロマサイトが存在しないため、現在ブロックの第１のライン中の１×２区分と、現在ブロックの第２のライン中の１×２区分との間で区別（distinction）がなされる必要があり得る。例えば、現在ブロックの第１のライン中の区分について、歪み値の計算は、２つのルーマサンプルと各クロマ成分についての１つのクロマサンプルとを含み得る。現在ブロックの第２のライン中の区分に関して、歪み値の計算は、ルーマサンプル（例えば、２つのルーマサンプル）のみを含み得る。図１６の例１６００では、現在の１×２区分Ａは、第１のライン中にあり、クロマサイトを含む。よって、現在の１×２区分Ａを予測するために選択される候補区分は、候補の１×２区分Ａであり、それはまた、クロマサイトを含む。同様に、現在の１×２区分Ｂは、第２のライン中にあり、クロマサイトを含まない。よって、現在の１×２区分Ｂを予測するために選択される候補区分は、候補の１×２区分Ｂであり、それはまた、クロマサイトを含まない。 [0139] For the 1x2 segment in 4: 2: 0 mode, there is no chromasite in the second line of the current block, so the 1x2 segment in the first line of the current block and the current block A distinction may need to be made between the 1 × 2 sections in the second line. For example, for the segment in the first line of the current block, the distortion value calculation may include two luma samples and one chroma sample for each chroma component. For the segment in the second line of the current block, the distortion value calculation may include only luma samples (eg, two luma samples). In the example 1600 of FIG. 16, the current 1 × 2 segment A is in the first line and includes chromasite. Thus, the candidate segment selected to predict the current 1 × 2 segment A is the candidate 1 × 2 segment A, which also includes chromasite. Similarly, the current 1 × 2 segment B is in the second line and does not contain chromasite. Thus, the candidate segment selected to predict the current 1 × 2 segment B is the candidate 1 × 2 segment B, which also does not include chromasite.

４：２：２クロマサブサンプリングフォーマットのためのＢＰ探索
[0140] ４：２：２モード（４：２：２クロマサブサンプリングフォーマット）での２×２区分に関して、各区分は、４つのルーマサンプルと、クロマ成分（例えば、ＣｏおよびＣｇ、またはＣｂおよびＣｒ）の各々についての２つのクロマサンプルとを含み得る。４：２：２モードのための例示的な２×２探索１７００が図１７で示される。図１７では、クロマサイト（例えば、クロマサンプルを有するピクセルロケーション）は、「Ｘ」を使用して示される。例えば、区分Ａの２つの左のサンプル、区分Ｂの２つの右のサンプル、および現在区分の２つの左のサンプルは、それぞれの区分と交差するクロマサイトを備える。このようなクロマサイトは、それぞれの区分のために行われる全ての計算のために（例えば、クロマサンプル値を使用して差分値を計算するために）使用され得る。 BP search for 4: 2: 2 chroma subsampling format
[0140] For a 2x2 partition in 4: 2: 2 mode (4: 2: 2 chroma subsampling format), each partition has four luma samples and chroma components (eg, Co and Cg, or Cb and Two chroma samples for each of (Cr). An exemplary 2 × 2 search 1700 for 4: 2: 2 mode is shown in FIG. In FIG. 17, chroma sites (eg, pixel locations with chroma samples) are indicated using “X”. For example, the two left samples of section A, the two right samples of section B, and the two left samples of the current section comprise chromasites that intersect their respective sections. Such chromasites can be used for all calculations performed for each segment (eg, to calculate a difference value using chroma sample values).

[0141] ４：２：２モードでの１×２区分について、各区分は、２つのルーマサンプルと、クロマ成分（例えば、ＣｏおよびＣｇ、またはＣｂおよびＣｒ）の各々についての１つのクロマサンプルとを含む。４：２：０モードとは異なり、４：２：２モードにおいて現在ブロックの第１のライン中の区分と、現在ブロックの第２のライン中の区分との間の区別は存在しない可能性がある。４：２：２クロマサブサンプリングのための１×２区分についての例示的なブロック予測探索１８００が図１８で図示される。図１８の例では、現在の１×２区分Ａは第１のライン中にあり、現在の１×２区分Ｂは第２のライン中にあり、現在区分ＡおよびＢの各々は、クロマサイトを含む。現在区分Ａは、候補の１×２区分Ａに基づいて予測され、それは、第１のサンプル中にクロマサイトを含み、現在区分Ｂは、候補の１×２区分Ｂに基づいて予測され、それは、第２のサンプル中にクロマサイトを含む。よって、クロマサイトが候補区分内に位置するかどうかに関わらず、クロマサンプルは、現在区分中のクロマサンプルを予測するために使用され得る。 [0141] For a 1x2 segment in 4: 2: 2 mode, each segment includes two luma samples and one chroma sample for each of the chroma components (eg, Co and Cg, or Cb and Cr). including. Unlike the 4: 2: 0 mode, there may not be a distinction between the section in the first line of the current block and the section in the second line of the current block in the 4: 2: 2 mode. is there. An exemplary block prediction search 1800 for a 1 × 2 partition for 4: 2: 2 chroma subsampling is illustrated in FIG. In the example of FIG. 18, the current 1 × 2 segment A is in the first line, the current 1 × 2 segment B is in the second line, and each of the current segments A and B has a chromasite. Including. Current segment A is predicted based on candidate 1 × 2 segment A, which includes chromasite in the first sample, and current segment B is predicted based on candidate 1 × 2 segment B, which is In the second sample, chromasite is included. Thus, regardless of whether the chromasite is located within the candidate partition, the chroma sample can be used to predict the chroma sample in the current partition.

エンコーダの決定
[0142] ４：２：２および４：２：０フォーマットでは、各クロマ成分について、ブロックごとに４未満のエントロピーコーディンググループが存在し得る。例えば、４つのエントロピーコーディンググループは、ルーマ成分のために使用され得、２つ（または１つ）のエントロピーコーディンググループは、オレンジのクロマ成分のために使用され得、２つ（または１つ）のエントロピーコーディンググループは、緑のクロマ成分のために使用され得る。所与のブロックをコーディングするために使用されるエントロピーコーディンググループの数は、所与のブロック中のルーマまたはクロマサンプルの数に基づいて決定され得る。いくつかの実施形態では、エントロピーコーディンググループは、所与のブロックがコーディングされるコーディングモードに基づいて、エンコーダによって決定される。他の実施形態では、エントロピーコーディンググループは、適用可能なコーディング規格で（例えば、所与のブロックがコーディングされるコーディングモードに基づいて）設定される。 Encoder determination
[0142] In 4: 2: 2 and 4: 2: 0 formats, there may be less than 4 entropy coding groups per block for each chroma component. For example, four entropy coding groups may be used for luma components, two (or one) entropy coding groups may be used for orange chroma components, and two (or one). An entropy coding group may be used for the green chroma component. The number of entropy coding groups used to code a given block may be determined based on the number of luma or chroma samples in a given block. In some embodiments, the entropy coding group is determined by the encoder based on the coding mode in which a given block is coded. In other embodiments, the entropy coding group is configured with an applicable coding standard (eg, based on the coding mode in which a given block is coded).

[0143] いくつかの実施形態では、量ＥＣ_ｂｉｔｓは、クロマについてエンコーダによって必ずしも正確には決定されない。このような実施形態のうちのいくつかでは、４：２：２および４：２：０フォーマットのためにルーマサンプルのみを使用して計算されるエントロピーコーディングレートに基づいて、エンコーダは、１×２区分を使用するか、または２×２区分を使用するかを決定し得る。他の実施形態では、量ＥＣ_ｂｉｔｓは、クロマについてエンコーダによって決定され、４：２：２および４：２：０フォーマットのためにルーマおよびクロマサンプルの両方を使用して計算されたエントロピーコーディングレートに基づいて、エンコーダは、１×２区分を使用するか、または２×２区分を使用するかを決定し得る。 [0143] In some embodiments, the quantity EC _bits is not necessarily accurately determined by the encoder for the chroma. In some of such embodiments, based on entropy coding rates calculated using only luma samples for 4: 2: 2 and 4: 2: 0 formats, the encoder is 1 × 2 It may be decided whether to use a partition or a 2 × 2 partition. In other embodiments, the quantity EC _bits is determined by the encoder for the chroma, and to the entropy coding rate calculated using both luma and chroma samples for 4: 2: 2 and 4: 2: 0 formats. Based on this, the encoder may decide whether to use a 1 × 2 partition or a 2 × 2 partition.

シグナリング
[0144] いくつかの実施形態では、エンコーダから、各ブロックについてまたは各色成分ついてデコーダに送信されるエントロピーコーディンググループの数は、クロマサブサンプリングフォーマットに依存して変更され得る。いくつかの実装では、エントロピーコーディンググループの数は、コーデックスループットが十分高いことを保証するように変更される。例えば、４：４：４モードでは、２×８ブロックは、図１１で図示されるような４つのエントロピーコーディンググループを含み得る。このような例では、４つのエントロピーコーディンググループは、各色成分（color components）（例えば、Ｙ、Ｃｏ、およびＣｇ）のために使用され（例えば、エンコーダによってシグナリングされ）得る。表１は、４：２：２および４：２：０モードのために使用されるエントロピーコーディンググループの数への、例示的な変更を説明する。上記で説明されたシグナリングの残り（remainder）（例えば、ＢＰＶのシグナリング、区分方式のインジケーションのシグナリングなど）は、４：２：２および４：２：０モードについて（４：４：４モードに関して説明されたシグナリングから）変更されない可能性がある。例えば、表１では、成分０がルーマ（Ｙ）に対応し得、成分１がオレンジクロマ（Ｃｏ）に対応し得、成分２がグリーンクロマ（Ｃｇ）に対応し得る。

Signaling
[0144] In some embodiments, the number of entropy coding groups transmitted from the encoder to the decoder for each block or for each color component may be varied depending on the chroma subsampling format. In some implementations, the number of entropy coding groups is changed to ensure that the codec throughput is high enough. For example, in 4: 4: 4 mode, a 2 × 8 block may include four entropy coding groups as illustrated in FIG. In such an example, four entropy coding groups may be used (eg, signaled by an encoder) for each color component (eg, Y, Co, and Cg). Table 1 illustrates exemplary changes to the number of entropy coding groups used for 4: 2: 2 and 4: 2: 0 modes. The remainder of signaling described above (eg, BPV signaling, segmented indication signaling, etc.) is for 4: 2: 2 and 4: 2: 0 modes (for 4: 4: 4 mode). May not change) (from the signaling described). For example, in Table 1, component 0 may correspond to luma (Y), component 1 may correspond to orange chroma (Co), and component 2 may correspond to green chroma (Cg).

利点
[0145] 本開示で説明される１つまたは複数のブロック予測モード技法は、非対称設計を使用して実装され得る。非対称設計は、より費用がかかるプロシージャがエンコーダ側で実行されることを可能にし、デコーダの複雑さを減少させる。例えば、（１つまたは複数の）ベクトルがデコーダに明示的にシグナリングされるので、エンコーダは、デコーダと比較して作業の大部分を行う。これは、エンコーダが、しばしば、最先端のプロセスノード（例えば、２０ｎｍ以下）上で高い周波数で動作するシステムオンチップ（ＳｏＣ）設計の一部であるので望ましい。一方、デコーダは、制限されたクロック速度とはるかに大きいプロセスサイズ（例えば、６５ｎｍ以上）とを有するディスプレイドライバ集積回路（ＤＤＩＣ）チップオングラス（ＣＯＧ：chip-on-glass)ソリューションで実装される可能性がある。 advantage
[0145] One or more block prediction mode techniques described in this disclosure may be implemented using an asymmetric design. The asymmetric design allows more expensive procedures to be performed on the encoder side and reduces decoder complexity. For example, since the vector (s) are explicitly signaled to the decoder, the encoder does most of the work compared to the decoder. This is desirable because the encoder is often part of a system-on-chip (SoC) design that operates at high frequencies on state-of-the-art process nodes (eg, 20 nm or less). On the other hand, the decoder can be implemented with a display driver integrated circuit (DDIC) chip-on-glass (COG) solution that has a limited clock speed and a much larger process size (eg, greater than 65 nm). There is sex.

[0146] さらに、ブロック区分サイズの適応可能な選択は、ブロック予測モードがコンテンツタイプのより広い範囲に対して使用されることを可能にする。ＢＰＶを明示的にシグナリングすることは費用がかかるので、可変の区分サイズは、２×２区分を使用して十分に予測されることができるイメージ領域についての低減されたシグナリングコストを可能にする。極めて複雑な領域について、より高いシグナリングコストを補うようにエントロピーコーディングレートが十分に低減され得る場合、またはＲＤトレードオフが１×２をさらに支持する（in favor of）ように歪みが十分に低減され得る場合、１×２区分サイズが選択され得る。例えば、ブロック区分サイズの適応可能な選択は、自然画像、テストパターン、細かいテキストレンダリングなどを含む、全てのコンテンツタイプにわたる性能を上昇させ得る。いくつかの実施形態では、本明細書で説明される適応可能な区分技法は、２×２よりも大きいブロック区分サイズおよび／または２×８よりも大きいブロックサイズを考慮することで拡張され得る。 [0146] Furthermore, the adaptive selection of block partition size allows the block prediction mode to be used for a wider range of content types. Since explicit signaling of BPV is expensive, a variable partition size allows for reduced signaling costs for image regions that can be well predicted using 2 × 2 partitions. For highly complex regions, if the entropy coding rate can be reduced sufficiently to compensate for higher signaling costs, or the distortion is reduced sufficiently so that the RD trade-off further favors 1 × 2. If so, a 1 × 2 partition size may be selected. For example, an adaptive selection of block partition sizes can increase performance across all content types, including natural images, test patterns, fine text rendering, and the like. In some embodiments, the adaptive partitioning techniques described herein may be extended by considering a block partition size greater than 2x2 and / or a block size greater than 2x8.

[0147] 本明細書で説明される１つまたは複数の技法は、固定ビットレートバッファモデルを用いて固定されたビット（fixed-bit）のコーデックにおいて実装され得る。このようなモデル、レートバッファ中に記憶されたビットは、固定ビットレートにおいてレートバッファから削除される。よって、ビデオエンコーダがビットストリームにあまりに多くのビットを加えた場合、レートバッファはオーバーフローし得る。一方、ビデオエンコーダは、レートバッファのアンダーフローを防ぐために、十分なビットを加える必要があり得る。さらに、ビデオデコーダ側では、ビットは、固定ビットレートでレートバッファに加えられ得、ビデオデコーダは、各ブロックについて可変数のビットを削除し得る。適切な復号を保証するために、ビデオデコーダのレートバッファは、圧縮されたビットストリームの復号中に「アンダーフロー」または「オーバーフロー」すべきでない。本明細書で説明される１つまたは複数の技法は、符号化および／または復号中、このようなアンダーフローまたはオーバーフローが起こらないことを保証し得る。いくつかの実施形態では、エンコーダは、ビットバジェット制約（a bit-budget constraint）下で動作し得、ここで、エンコーダは、所与の領域、スライス、またはフレームをコーディングするために固定数のビットを有する。このような実施形態では、ビットバジェットまたは制約に関連した他のビット／帯域幅が満たされ得ることをエンコーダが保証できるように、複数のコーディングモードのうちの各１つの、いくつのビットが、所与の領域、スライス、またはフレームをコーディングできることが必要であるかを正確に（推定する必要なく）知ることができることは、エンコーダにとって不可欠（critical）である。例えば、所与の領域、スライス、またはフレームのコーディングが、推定されたより多くのビットを必要とする場合、いずれの予備的な測定も実施しなければならないということなく、エンコーダは、所与のコーディングモードにおいて所与の領域、スライス、またはフレームをコーディングし得る。 [0147] One or more techniques described herein may be implemented in a fixed-bit codec using a fixed bit rate buffer model. Bits stored in such a model, rate buffer, are deleted from the rate buffer at a fixed bit rate. Thus, if the video encoder adds too many bits to the bitstream, the rate buffer can overflow. On the other hand, the video encoder may need to add enough bits to prevent underflow of the rate buffer. Further, on the video decoder side, bits can be added to the rate buffer at a fixed bit rate, and the video decoder can delete a variable number of bits for each block. To ensure proper decoding, the video decoder rate buffer should not "underflow" or "overflow" during decoding of the compressed bitstream. One or more techniques described herein may ensure that no such underflow or overflow occurs during encoding and / or decoding. In some embodiments, the encoder may operate under a bit-budget constraint, where the encoder is a fixed number of bits to code a given region, slice, or frame. Have In such an embodiment, how many bits of each one of the multiple coding modes are given so that the encoder can ensure that other bits / bandwidth associated with the bit budget or constraints can be met. Being able to know exactly (without having to estimate) whether it is necessary to be able to code a given region, slice or frame is critical to the encoder. For example, if the coding of a given region, slice, or frame requires more bits than estimated, the encoder can perform the given coding without having to perform any preliminary measurements. A given region, slice, or frame may be coded in mode.

[0148] さらに、本明細書で説明される１つまたは複数の技法は、ディスプレイリンクを介した送信におけるビデオ圧縮技術に関連付けられた特定の技術的問題を克服する。ある領域が複数の候補領域（例えば、複数の候補領域のうちの対応する１つに基づいて予測される領域中の各区分）に基づいてコーディングされることを可能にすることによって、ビデオエンコーダおよびデコーダは、領域の性質（例えば、平滑、複雑など）に基づいて、カスタマイズされた予測を提供することができ、それによりビデオエンコーダおよびデコーダ（例えば、ハードウェアおよびソフトウェアコーデック）性能を改善する。 [0148] Furthermore, one or more techniques described herein overcome certain technical problems associated with video compression techniques in transmission over a display link. A video encoder by allowing a region to be coded based on a plurality of candidate regions (eg, each partition in a region predicted based on a corresponding one of the plurality of candidate regions) The decoder can provide customized predictions based on the nature of the region (eg, smooth, complex, etc.), thereby improving video encoder and decoder (eg, hardware and software codec) performance.

ブロック予測モードのための複数の探索範囲
[0149] 図３〜６を参照して説明されるように、探索空間（例えば、エンコーダが、候補ブロックを見つけるために探索し得る、ピクセルの空間ロケーション）は、現在ブロックの特性に基づいて異なり得る。例えば、探索空間は、全ての前に再構成されたブロック／ピクセルを潜在的に含み得る。いくつかの実施形態では、エンコーダおよび／またはデコーダは、例えば、計算複雑さを低減するために、候補ブロックのための探索を探索空間内の指定された部分（例えば、ビットストリーム中であらかじめ定義されるかまたはシグナリングされるかのいずれかである１つまたは複数のパラメータによって定義される「探索範囲」）に制限し得る。いくつかの実装では、ブロック予測は、ブロック予測モードでコーディングされた各ブロックについての単一の探索範囲を利用する。これらの実装では、現在ブロックに関する探索範囲のロケーションは、現在ブロックがＦＬＳ（スライスの第１のライン）にあるかＮＦＬＳ（スライスの第１でないライン）にあるかに依存し得る。図１９の図１９００に示されているように、現在ブロック１９１０がＦＬＳである場合、探索範囲は、同じブロックライン中の現在ブロックの左にあり得（例えば、ＦＬＳ探索範囲１９２０）、現在ブロックがＮＦＬＳにある場合、探索範囲は、現在ブロックラインのすぐ上のブロックラインにあり得る（例えば、ＮＦＬＳ探索範囲１９３０）。ブロックラインという用語は、その通常の意味を有していることに加え、ブロックに属する全てのラスタスキャンラインを含み得る。例えば、ブロックサイズが２×８ピクセルである（アドバンストディスプレイストリーム圧縮［ＡＤＳＣ：Advanced Display Stream Compression]が２×８ピクセルの標準的なブロックサイズを有する)場合、ブロックラインは、２つのラスタスキャンラインを含むであろう。 Multiple search ranges for block prediction mode
[0149] As described with reference to FIGS. 3-6, the search space (eg, the spatial location of pixels that an encoder may search to find a candidate block) varies based on the characteristics of the current block. obtain. For example, the search space may potentially include all previously reconstructed blocks / pixels. In some embodiments, the encoder and / or decoder may perform a search for a candidate block, for example, in a specified portion in the search space (e.g., predefined in the bitstream, to reduce computational complexity, for example. Or “search range” defined by one or more parameters that are either signaled or signaled. In some implementations, block prediction utilizes a single search range for each block coded in block prediction mode. In these implementations, the location of the search range for the current block may depend on whether the current block is at FLS (first line of the slice) or NFLS (non-first line of the slice). As shown in diagram 1900 of FIG. 19, if the current block 1910 is FLS, the search range may be to the left of the current block in the same block line (eg, FLS search range 1920) and the current block is If in NFLS, the search range may be in the block line immediately above the current block line (eg, NFLS search range 1930). The term block line may include all raster scan lines belonging to a block in addition to its normal meaning. For example, if the block size is 2 × 8 pixels (Advanced Display Stream Compression (ADSC) has a standard block size of 2 × 8 pixels), the block line can be two raster scan lines. Would include.

[0150] 対照的に、本開示のいくつかの実施形態では、エンコーダおよび／またはデコーダは、複数の探索範囲を維持し得る。複数の探索範囲がブロック予測モードでブロックをコーディングするために使用されることを可能にすることによって、より良い候補区分に位置している可能性が高くなり得（例えば、ブロック予測モードでコーディングされた各ブロックについて単一の探索範囲のみを考慮する前の実装と比較して）、それにより、ブロック予測モードのコーディング効率および／またはコーディング性能を改善する。さらに、探索範囲を適用可能に選択するためのエンコーダが各ブロックをコーディングするために使用されることを可能にすることによって、ブロック予測方式の性能は、さらに改善され得る。 [0150] In contrast, in some embodiments of the present disclosure, an encoder and / or decoder may maintain multiple search ranges. By allowing multiple search ranges to be used to code a block in block prediction mode, it may be more likely to be located in a better candidate partition (eg, coded in block prediction mode). Compared to previous implementations that only consider a single search range for each block), thereby improving the coding efficiency and / or coding performance of the block prediction mode. Furthermore, the performance of the block prediction scheme can be further improved by allowing an encoder to select a search range to be used to code each block.

[0151] このような実施形態のうちのいくつかでは、ブロック予測モードで所与のブロックをコーディングする際に使用するために複数の探索範囲が考慮され得るが、探索範囲のうちの１つのみが、ある時間において使用されることが可能であり得る。例えば、ブロック予測モードでコーディングされている各ブロックは、複数の探索範囲のうちの、両方ではなく１つに関連付けられ得る。いくつかの実施形態では、ブロック予測モードでコーディングされるブロックが複数の区分を有する場合、それらの区分のコーディングは、ブロックのために選択された同じ探索範囲を使用して各区分がコーディングされるように、制約され得る。単一のブロックのために使用される探索範囲の数を制限することによって、エンコーダは、単一のビットを使用して、どの探索範囲が使用されるかをデコーダに容易にシグナリングすることができる。他の実施形態では、１より多い探索範囲は、単一のブロックのために使用され得る。例えば、第１の探索範囲は、単一のブロック中の第１の区分をコーディングするために使用され得、第１の探索範囲とは異なる第２の探索範囲は、単一のブロック中の第２の区分をコーディングするために使用され得る。 [0151] In some of such embodiments, multiple search ranges may be considered for use in coding a given block in block prediction mode, but only one of the search ranges. May be able to be used at certain times. For example, each block coded in block prediction mode may be associated with one of the plurality of search ranges instead of both. In some embodiments, if a block coded in block prediction mode has multiple partitions, the coding of those partitions is coded using the same search range selected for the block. Can be constrained. By limiting the number of search ranges used for a single block, the encoder can easily signal to the decoder which search range is used using a single bit. . In other embodiments, more than one search range may be used for a single block. For example, a first search range may be used to code a first partition in a single block, and a second search range that is different from the first search range is a first search range in a single block. Can be used to code two partitions.

[0152] 本開示のいくつかの実施形態では、２つの探索範囲（ＳＲ_０およびＳＲ_１）は、図２０の図２０００に示されているように、エンコーダおよび／またはデコーダによって維持される。ＦＬＳ内のブロックについて、参照のために現在ブロックラインを使用することが唯一のオプションであるので、２つの探索範囲間の区別は存在しない可能性がある（または、ブロック予測の結果または性能において差異が存在しない可能性がある）。例えば、現在ブロック２０１０がＦＬＳ内にある場合、ＳＲ_０探索範囲２０２０のみが現在ブロック２０１０をコーディングするために使用され得、ＳＲ_０探索範囲２０３０が利用可能ではない可能性がある。一方、現在ブロック２０１０がＮＦＬＳ内にある場合、ＳＲ_０探索範囲２０２０およびＳＲ_１探索範囲２０３０は両方とも利用可能であり得、探索範囲２０２０と２０３０のうちのいずれかが、ブロック予測モードで現在ブロック２０１０をコーディングするために使用され得る。図２０に図示されるように、ＳＲ_０探索範囲２０３０は、前の再構成されたブロックライン（例えば、最も最近再構成された１つまたは複数のブロックライン）からのデータ（例えば、現在ブロックのコーディングに先立ってコーディングされたピクセル）を含み、ＳＲ_１探索範囲２０２０は、現在ブロックラインから（例えば、現在ブロックの左まで）のデータ（例えば、現在ブロックのコーディングに先立ってコーディングされたピクセル）を含む。いくつかの実施形態では、１つまたは複数の最も最近再構成されたピクセルまたはブロックは、パイプライン化の理由（pipelining reasons）から、（１つまたは複数の）探索範囲から除かれ得る。例えば、現在ブロックすぐ左にある、１つまたは複数のブロック（例えば、ピクセルまたはブロックの閾値数）は、探索範囲ＳＲ_１から除かれ得る。（１つまたは複数の）探索範囲から除かれたピクセルまたはブロックの数は、パイプライン化制約（pipelining constraints）に依存し得る。 [0152] In some embodiments of the present disclosure, the two search ranges (SR ₀ and SR ₁ ) are maintained by an encoder and / or decoder, as shown in FIG. 2000 of FIG. Because the only option is to use the current block line for reference for blocks in the FLS, there may not be a distinction between the two search ranges (or differences in block prediction results or performance) May not exist). For example, if the current block 2010 is in the FLS, only the SR ₀ search range 2020 may be used to code the current block 2010, and the SR ₀ search range 2030 may not be available. On the other hand, if the current block 2010 is in the NFLS, both the SR ₀ search range 2020 and the SR ₁ search range 2030 may be available, and one of the search ranges 2020 and 2030 is in the block prediction mode. Can be used to code 2010. As illustrated in FIG. 20, the SR ₀ search range 2030 includes data (eg, of the current block) from a previous reconstructed block line (eg, the most recently reconstructed block line or lines). SR ₁ search range 2020 includes data from the current block line (eg, to the left of the current block) (eg, pixels that were coded prior to coding of the current block). Including. In some embodiments, one or more most recently reconstructed pixels or blocks may be removed from the search range (s) from pipelining reasons. For example, one or more blocks (eg, pixels or threshold number of blocks) that are immediately to the left of the current block may be removed from the search range SR ₁ . The number of pixels or blocks removed from the search range (s) may depend on pipelining constraints.

[0153] エンコーダは、２つの探索範囲に関して独立して、現在ブロック内の全ての区分のためにブロック予測探索を行い得る。例えば、現在ブロックが２つの区分を有する場合、エンコーダは、第１の探索範囲において第１の区分のためにブロック予測探索を行い得、次に、第１の探索範囲において第２の区分のためにブロック予測探索を行い得る。その探索に基づいて、エンコーダは、第１の探索範囲において複数のブロックまたは複数のブロック区分を使用して現在ブロック中の２つの区分をコーディングするための第１のコストを決定し得る。次に、エンコーダは、第２の探索範囲において第１の区分のためにブロック予測探索を行い得、次に、第２の探索範囲において第２の区分のためにブロック予測探索を行い得る。この探索に基づいて、エンコーダは、第２の探索範囲において複数のブロックまたは複数のブロック区分を使用して現在ブロック中の２つの区分をコーディングするための第２のコストを決定し得る。レートおよび歪みコストが現在ブロック全体について最小化されるように（例えば、現在ブロック内の全ての区分を予測するために）、探索範囲におけるブロックまたはブロック区分が選択され得る。 [0153] The encoder may perform a block prediction search for all partitions in the current block, independently of the two search ranges. For example, if the current block has two partitions, the encoder may perform a block prediction search for the first partition in the first search range, and then for the second partition in the first search range. A block prediction search may be performed. Based on the search, the encoder may determine a first cost for coding two partitions in the current block using multiple blocks or multiple block partitions in a first search range. The encoder may then perform a block prediction search for the first partition in the second search range and then perform a block prediction search for the second partition in the second search range. Based on this search, the encoder may determine a second cost for coding the two partitions in the current block using multiple blocks or multiple block partitions in the second search range. A block or block partition in the search range may be selected so that rate and distortion costs are minimized for the entire current block (eg, to predict all partitions in the current block).

[0154] 各探索範囲についてのコスト（例えば、レートおよび歪み推定）を決定すると、エンコーダは、本開示で論じられるようなＲＤコスト（例えば、Ｄ＋λ・Ｒ）を最小化することによって、２つのオプション間で選択することができる。エンコーダは、最低ＲＤコストをもたらす探索範囲を選択し、選択された探索範囲を使用して、現在ブロックをコーディングし得る。現在ブロックを復号するために使用される探索範囲のインジケーションは、例えば、ビットストリームにおいて明示的に各ブロックについての１ビットフラグをシグナリングすることによって、デコーダに送信される。よって、複数の探索範囲の使用によって必要とされるデコーダ側の変更は、最小限である。本質的に、一方の探索範囲は、もう一方と置き換えられ、ブロック予測のための全ての他のステップは、複数の探索範囲を使用しない実装においてのように行われ得る。 [0154] Having determined the cost (eg, rate and distortion estimation) for each search range, the encoder has two options by minimizing the RD cost (eg, D + λ · R) as discussed in this disclosure. You can choose between. The encoder may select the search range that results in the lowest RD cost and use the selected search range to code the current block. The search range indication used to decode the current block is sent to the decoder, eg, by explicitly signaling a 1-bit flag for each block in the bitstream. Thus, the decoder side changes required by the use of multiple search ranges are minimal. In essence, one search range is replaced with the other, and all other steps for block prediction can be performed as in an implementation that does not use multiple search ranges.

[0155] 他の実施形態では、選択された探索範囲をシグナリングする１ビットフラグは、省略され得る。このような実施形態では、探索範囲は、ブロック予測モードの別個のインスタンスに各々関連付けられ得、ここで、探索範囲インデックスは、モードヘッダによって暗黙的にシグナリングされ得る。例えば、ブロックに関連付けられたコーディングモードをシグナリングために３ビットが使用される場合、６つのコーディングモードのみがエンコーダまたはデコーダに利用可能であり、同じ３ビットのシンタックス要素が、２つのさらなるコーディングモード（例えば、一方は、第１の探索範囲を常に使用するかまたは、デフォルトで第１の探索範囲を使用するブロック予測モード、そしてもう一方は、第２の探索範囲を常に使用するかまたは、デフォルトで第２の探索範囲を使用するブロック予測モード）をシグナリングするために使用され得る。よって、コーディングモードをシグナリングするために使用される既存のシンタックス要素を利用することによって、ビットの節約が達成され得る。 [0155] In other embodiments, the 1-bit flag signaling the selected search range may be omitted. In such an embodiment, the search range may each be associated with a separate instance of the block prediction mode, where the search range index may be implicitly signaled by the mode header. For example, if 3 bits are used to signal the coding mode associated with the block, only 6 coding modes are available to the encoder or decoder, and the same 3 bit syntax element is used for 2 additional coding modes. (For example, one always uses the first search range or the block prediction mode that uses the first search range by default, and the other always uses the second search range or defaults. In block prediction mode using the second search range). Thus, bit savings can be achieved by utilizing existing syntax elements used to signal the coding mode.

複数の探索範囲を使用するブロック予測モードにおけるコーディング
[0156] 図２１を参照して、ブロック予測モードでビデオデータのブロックをコーディングするための例示的なプロシージャが説明される。図２１に示されているステップは、ビデオエンコーダ（例えば、図２Ａ中のビデオエンコーダ２０）、またはそれらの（１つまたは複数の）コンポーネントによって実行され得る。便宜上、方法２１００について、ビデオエンコーダ２０、または別のコンポーネントであり得る、コーダによって実行されるものとして説明する。 Coding in block prediction mode using multiple search ranges
[0156] With reference to FIG. 21, an exemplary procedure for coding a block of video data in a block prediction mode is described. The steps shown in FIG. 21 may be performed by a video encoder (eg, video encoder 20 in FIG. 2A), or component (s) thereof. For convenience, the method 2100 is described as being performed by a coder, which may be the video encoder 20, or another component.

[0157] 方法２１００はブロック２１０１において開始する。ブロック２１０５において、コーダは、現在ブロックに対応するロケーションの第１の範囲内の第１の候補領域に基づいて、現在ブロック（例えば、現在コーディングされているビデオデータのブロック）をコーディングすることに関連付けられた第１のコストを決定する。第１の候補領域は、現在ブロックと同じサイズ（例えば、同じ寸法および／または同じ数のピクセル）を有し得る。第１の候補領域は、前にコーディングされた１つのブロックまたはブロックの一部分を含み得、現在ブロックをコーディングするために現在使用されている。いくつかの実施形態では、第１の候補領域は、現在ブロックの異なる部分をコーディングするために各々使用される、ブロックまたはブロック区分の集合（collection）であり得る。例えば、現在ブロックは、４つのブロック区分を含み得、４つのブロック区分の各々は、ロケーションの第１の範囲内の第１の候補領域の異なるブロックまたはブロック区分を使用して、予測またはコーディングされ得る。いくつかの実装では、現在ブロック内の複数のブロック区分は、複数のロケーションの第１の範囲内の第１の候補領域の同じブロックまたはブロック区分に基づいてコーディングされ得る。複数のロケーションの第１の範囲（例えば、第１の探索範囲）は、エンコーダによって、または適用可能なコーディング規格によって指定された探索範囲であり得る。複数のロケーションの第１の範囲は、本開示で論じられる例示的な探索範囲のうちの１つに類似し得る。複数のロケーションの第１の範囲は、（例えば、コーディング順またはラスタスキャニング順で）後に続く複数のブロックおよび／または複数のブロック区分を予測またはコーディングするために再構成および使用される、複数のブロックまたはブロック区分を備え得る。複数のロケーションの第１の範囲は、現在ブロックとオーバーラップするラスタスキャンラインを含み得る。他の実施形態では、複数のロケーションの第１の範囲は、現在ブロックとオーバーラップするラスタスキャンラインを含まない。第１の候補領域に関連付けられたビデオデータは、ビデオ符号化デバイスのメモリに記憶され得る。 The method 2100 begins at block 2101. At block 2105, the coder associates with coding a current block (eg, a block of video data that is currently coded) based on a first candidate region within a first range of locations corresponding to the current block. Determine the first cost given. The first candidate region may have the same size (eg, the same dimensions and / or the same number of pixels) as the current block. The first candidate region may include a previously coded block or portion of a block that is currently used to code the current block. In some embodiments, the first candidate region may be a collection of blocks or block partitions that are each used to code different portions of the current block. For example, the current block may include four block partitions, each of the four block partitions being predicted or coded using a different block or block partition of the first candidate region within the first range of locations. obtain. In some implementations, multiple block partitions within the current block may be coded based on the same block or block partition of the first candidate region within the first range of multiple locations. The first range of multiple locations (eg, the first search range) may be a search range specified by an encoder or by an applicable coding standard. The first range of locations may be similar to one of the exemplary search ranges discussed in this disclosure. The first range of locations is a plurality of blocks that are reconstructed and used to predict or code subsequent blocks and / or block partitions (eg, in coding order or raster scanning order) Or a block section may be provided. The first range of locations may include a raster scan line that overlaps the current block. In other embodiments, the first range of locations does not include raster scan lines that overlap the current block. Video data associated with the first candidate region may be stored in a memory of the video encoding device.

[0158] ブロック２１１０において、コーダは、現在ブロックに対応する複数のロケーションの第２の範囲内の第２の候補領域に基づいて、現在ブロックをコーディングすることに関連付けられた第２のコストを決定する。第２の候補領域は、現在ブロックと同じサイズ（例えば、同じ寸法および／または同じ数のピクセル）を有し得る。第２の候補領域は、前にコーディングされた１つのブロックまたはブロックの一部を含み得、現在ブロックをコーディングするために現在使用されている。いくつかの実施形態では、第２の候補領域は、現在ブロックの異なる部分をコーディングするために各々使用される、ブロックまたはブロック区分の集合であり得る。例えば、現在ブロックは、４つのブロック区分を含み得、４つのブロック区分の各々は、複数のロケーションの第２の範囲内の第１の候補領域の異なるブロックまたはブロック区分を使用して、予測またはコーディングされ得る。いくつかの実装では、現在ブロック内の複数のブロック区分は、複数のロケーションの第２の範囲内の第１の候補領域の同じブロックまたはブロック区分に基づいてコーディングされ得る。複数のロケーションの第２の範囲は、エンコーダによって、または利用可能なコーディング規格によって指定された探索範囲であり得る。複数のロケーションの第２の範囲は、本開示で論じられる例示的な探索範囲に類似し得る。複数のロケーションの第２の範囲は、（例えば、コーディング順またはラスタスキャニング順で）後に続く複数のブロックおよび／または複数のブロック区分を予測またはコーディングするために再構成および使用される、複数のブロックまたはブロック区分を備え得る。いくつかの実施形態では、複数のロケーションの第１の範囲および複数のロケーションの第２の範囲は、相互に排他的である。代替的にまたは追加的に、複数のロケーションの第１の範囲および複数のロケーションの第２の範囲は、異なるラスタスキャンラインを占有し得る。複数のロケーションの第２の範囲は、現在ブロックとオーバーラップするラスタスキャンラインを含み得る。他の実施形態では、複数のロケーションの第２の範囲は、現在ブロックとオーバーラップするラスタスキャンラインを含まない。例えば、図２０に示されているように、２つの探索範囲２０２０および２０３０は、互いとオーバーラップしない。第２の候補領域に関連付けられたビデオデータは、ビデオ符号化デバイスのメモリに記憶され得る。 [0158] At block 2110, the coder determines a second cost associated with coding the current block based on a second candidate region within a second range of a plurality of locations corresponding to the current block. To do. The second candidate region may have the same size (eg, the same dimensions and / or the same number of pixels) as the current block. The second candidate region may include one previously coded block or part of a block and is currently used to code the current block. In some embodiments, the second candidate region may be a set of blocks or block partitions that are each used to code different portions of the current block. For example, the current block may include four block segments, each of the four block segments being predicted using different blocks or block segments of the first candidate region within the second range of the plurality of locations. Can be coded. In some implementations, multiple block partitions within the current block may be coded based on the same block or block partition of the first candidate region within the second range of multiple locations. The second range of locations may be a search range specified by an encoder or by an available coding standard. The second range of locations may be similar to the exemplary search range discussed in this disclosure. The second range of locations is a plurality of blocks that are reconstructed and used to predict or code subsequent blocks and / or block partitions (eg, in coding order or raster scanning order) Or a block section may be provided. In some embodiments, the first range of locations and the second range of locations are mutually exclusive. Alternatively or additionally, the first range of locations and the second range of locations may occupy different raster scan lines. The second range of locations may include raster scan lines that overlap the current block. In other embodiments, the second range of locations does not include raster scan lines that overlap the current block. For example, as shown in FIG. 20, the two search ranges 2020 and 2030 do not overlap each other. Video data associated with the second candidate region may be stored in a memory of the video encoding device.

[0159] ブロック２１１５において、コーダは、第１の候補領域に基づいて現在ブロックをコーディングすることに関連付けられた第１のコストが、第２の候補領域に基づいて現在ブロックをコーディングすることに関連付けられた第２のコストよりも大きいかどうかを決定する。例えば、コーダは、複数のロケーションの第１の範囲内（例えば、第１の探索範囲内）の第１の候補領域を使用して現在ブロックをコーディングすることに関連付けられたレートおよび歪みに基づくコストと、複数のロケーションの第２の範囲内（例えば、第２の探索範囲内）の第２の候補領域を使用して現在ブロックをコーディングすることに関連付けられたレートおよび歪みに基づくコストとを計算し、これら計算されたコストを比較し得る。いくつかの実施形態では、現在ブロックは、複数のブロック区分を備え得る。このような実施形態のうちのいくつかでは、第１および第２のコストを計算することは、（i）現在ブロック中の対応する複数のブロック区分をコーディングするために使用される関連する探索範囲（例えば、それぞれ、第１の探索範囲および第２の探索範囲）内の複数のブロック区分を決定することと、（ii）関連する探索範囲内の複数のブロック区分に基づいて現在ブロック内の個々のブロック区分をコーディングするために個々のコストを決定することと、（iii）個々のコストに基づいて第１および第２のコストを計算することと、を含み得る。例えば、第１および第２のコストは、個々のコストを合計することによって計算され得る。代替的に、第１および第２のコストは、個々のコストを平均化することによって計算され得る。 [0159] At block 2115, the coder associates a first cost associated with coding the current block based on the first candidate region to code the current block based on the second candidate region. Determine if it is greater than the given second cost. For example, a coder may cost based on rate and distortion associated with coding a current block using a first candidate region within a first range (eg, within a first search range) of multiple locations. And a rate and distortion-based cost associated with coding the current block using a second candidate region within a second range (eg, within a second search range) of the plurality of locations. These calculated costs can then be compared. In some embodiments, the current block may comprise multiple block partitions. In some of such embodiments, calculating the first and second costs includes (i) an associated search range used to code a corresponding plurality of block partitions in the current block. Determining a plurality of block segments within (eg, a first search range and a second search range, respectively), and (ii) individual blocks within the current block based on the plurality of block segments within the associated search range Determining individual costs to code a plurality of block sections and (iii) calculating first and second costs based on the individual costs. For example, the first and second costs can be calculated by summing the individual costs. Alternatively, the first and second costs can be calculated by averaging the individual costs.

[0160] ブロック２１２０において、コーダは、第１のコストが第２のコストよりも大きいと決定することに応答して、第２の範囲に関連付けられたインジケーションを提供することを少なくとも部分的に介して、複数のロケーションの第２の範囲内の第２の候補領域に基づいて現在ブロックをコーディングする。いくつかの実施形態では、インジケーションは、現在ブロックが（i）複数のロケーションの第１の範囲内の第１の候補領域に基づいてコーディングされるか、または（ii）複数のロケーションの第２の範囲内の第２の候補領域に基づいてコーディングされるかを示す１ビットフラグであり得る。例えば、フラグ値が０に等しい場合、そのフラグは、現在ブロックが（例えば、複数のロケーションの第１の範囲内の第１の候補領域に基づいて）第１の探索範囲において１つまたは複数のブロックまたはブロック区分に基づいてコーディングされることを示し得、フラグ値が１に等しい場合、そのフラグは、現在ブロックが（例えば、複数のロケーションの第２の範囲内の第２の候補領域に基づいて）第２の探索範囲において１つまたは複数のブロックまたはブロック区分に基づいてコーディングされることを示し得る。他の実施形態では、インジケーションは、現在ブロックに関連付けられたコーディングモードを示すように構成されるマルチビットシンタックス要素であり得る。例えば、シンタックス要素は、複数のコーディングモードのうちのどの１つが現在ブロックをコーディングするために使用されるべきかを示し得る。コーディングモードのうちの１つが、ブロック予測モードであり得る。いくつかの実施形態では、シンタックス要素が（複数の可能性のある値のうちの）１つの値を有する場合、現在ブロックは、第１の探索範囲のみを使用する（または、他のものに提供されない限り、デフォルトで第１の探索範囲を使用する）ブロック予測モードでコーディングされ、シンタックス要素が（複数の可能性のある値のうちの）別の値を有する場合、現在ブロックは、第２の探索範囲のみを使用する（または、他のものに提供されない限り、デフォルトで第２の探索範囲を使用する）ブロック予測モードでコーディングされる。シンタックス要素が（複数の可能性のある値のうちの）さらに別の値を有し得る場合、現在ブロックは、ブロック予測モード以外のコーディングモードでコーディングされ得る。方法２１００はブロック２１２５において終了する。 [0160] In block 2120, the coder at least partially provides providing an indication associated with the second range in response to determining that the first cost is greater than the second cost. And coding the current block based on the second candidate region within the second range of the plurality of locations. In some embodiments, the indication may be that the current block is coded (i) based on a first candidate region within a first range of multiple locations, or (ii) a second of multiple locations. Can be a 1-bit flag that indicates whether to code based on a second candidate region in the range. For example, if the flag value is equal to 0, the flag indicates that the current block is one or more in the first search range (eg, based on a first candidate region in the first range of locations). If the flag value is equal to 1, the flag may indicate that the current block is based on a second candidate region (eg, in a second range of locations). And) may indicate coding based on one or more blocks or block partitions in the second search range. In other embodiments, the indication may be a multi-bit syntax element configured to indicate a coding mode associated with the current block. For example, the syntax element may indicate which one of a plurality of coding modes should be used to code the current block. One of the coding modes may be a block prediction mode. In some embodiments, if the syntax element has one value (of multiple possible values), the current block uses only the first search range (or others). If not coded, the current block is coded in block prediction mode (which uses the first search range by default) and the syntax element has another value (among several possible values). Coded in a block prediction mode that uses only two search ranges (or uses the second search range by default unless otherwise provided). If the syntax element may have yet another value (among multiple possible values), the current block may be coded in a coding mode other than the block prediction mode. The method 2100 ends at block 2125.

[0161] 方法２１００では、図２１に示されているブロックのうちの１つまたは複数は削除される（例えば、実行されない）可能性があり、および／または方法が実行される順序は入れ替えられ得る。例えば、いくつかの実施形態では、ブロック２１０５、２１１０、および２１１５のうちの１つまたは複数は、（例えば、スライス中の第１のラインのような）同じスライス中の任意の先行するラスタスキャンラインを有していないラスタスキャンラインを現在ブロックが含むことをコーダが決定する場合に省略され得る。いくつかの実施形態では、さらなるブロックが方法２１００に追加され得る。本開示の実施形態は、図２１に示されている例にまたはそれによって限定されず、他の変形が本開示の趣旨から逸脱することなく実装され得る。 [0161] In the method 2100, one or more of the blocks shown in FIG. 21 may be deleted (eg, not executed) and / or the order in which the methods are executed may be interchanged. . For example, in some embodiments, one or more of blocks 2105, 2110, and 2115 may be any preceding raster scan line in the same slice (eg, the first line in the slice). May be omitted if the coder determines that the current block contains raster scan lines that do not have. In some embodiments, additional blocks may be added to method 2100. The embodiments of the present disclosure are not limited to or by the example shown in FIG. 21, and other variations may be implemented without departing from the spirit of the present disclosure.

複数の探索範囲を使用する利点
[0162] ブロック予測モードでブロックをコーディングするときに、複数の探索範囲を使用することに関連した技法は、ブロック予測モードに関連付けられたコーディング効率を改善し、それにより、特にグラフィックタイプイメージおよびグラフィックコンテンツに関して、コーディング性能を上昇させる。これらの技法のうちの１つまたは複数を実装することは、エンコーダ側における計算複雑さを増大させ得る。しかしながら、エンコーダがより小さいプロセスノード（２０ｎｍまたはそれに満たない）において実装されるので、エンコーダは一般に、増大した計算複雑さに対してさらなる許容度（greater degree of tolerance）を示す。重要なことに、デコーダ複雑さは、複数の探索範囲がブロック予測モードでブロックをコーディングするために使用される場合でさえ、大部分が同じままであるだろう。デコーダは一般に、はるかに大きいプロセスサイズ（６０ｎｍ以上）で実装され得、より厳しいハードウェア要件（例えば、ゲートカウントが最小化されなければならないなど）に従い得る。よって、ブロック予測モードで複数の探索範囲を使用するための本開示の技法は、計算複雑さの比較的少ない増加で、コーディング性能を改善する。 Advantages of using multiple search ranges
[0162] Techniques related to using multiple search ranges when coding a block in block prediction mode improve the coding efficiency associated with block prediction mode, and in particular, graphic type images and graphics. Increase content coding performance. Implementing one or more of these techniques may increase computational complexity at the encoder side. However, since the encoder is implemented at a smaller process node (20 nm or less), the encoder typically exhibits a greater degree of tolerance to increased computational complexity. Importantly, the decoder complexity will remain largely the same even when multiple search ranges are used to code the block in block prediction mode. The decoder can generally be implemented with a much larger process size (60 nm or more) and can follow more stringent hardware requirements (eg, gate count must be minimized). Thus, the techniques of this disclosure for using multiple search ranges in block prediction mode improve coding performance with a relatively small increase in computational complexity.

簡略化されたブロック予測モード
[0163] いくつかの場合には、ブロック予測モードで現在ブロックをコーディングするために上記で説明された技法は、さらに簡略化されることができる。例えば、コストが制限された（cost-constrained）ハードウェア実装について、上記で説明された１つまたは複数の特徴は、（エンコーダ側、デコーダ側、または両方において）コーダの計算複雑さを低減するために削除または修正され得る。このような場合、下記の変更のうちの１つまたは複数は、性能を大きく悪化させることなく、ブロック予測モードでブロックをコーディングする方法を行うことができる：（i）コーダは、上記で説明されたような複数の探索範囲を使用する代わりに、現在ブロックまたは区分を予測するために、単一の探索範囲を使用し得る；（ii）探索範囲は、前の再構成されたライン（例えば、現在ラインのすぐ前のライン）と現在ラインとの両方からのピクセルを含み、ここにおいて、このようなライン中のサンプルは、既に再構成されており（例えば、現在ブロックまたは区分がコーディングされる時間までに）、および／または（iii）単一の前に再構成されたラインは（複数のラインを含み得る）前の再構成されたブロックラインを使用する代わりに、現在ブロックまたは区分を予測するために使用される。 Simplified block prediction mode
[0163] In some cases, the techniques described above for coding the current block in block prediction mode can be further simplified. For example, for a cost-constrained hardware implementation, one or more features described above reduce the computational complexity of the coder (on the encoder side, the decoder side, or both) Can be deleted or modified. In such cases, one or more of the following changes can be made to code the block in block prediction mode without significantly degrading performance: (i) The coder is described above. Instead of using multiple search ranges, a single search range may be used to predict the current block or partition; (ii) the search range may be a previous reconstructed line (eg, Includes pixels from both the current line and the current line, where the samples in such a line have already been reconstructed (eg, the time the current block or partition is coded) And / or (iii) a single previously reconstructed line is an alternative to using a previous reconstructed block line (which may include multiple lines) , It is used to predict the current block or segment.

[0164] コーディング性能とハードウェア複雑さとの間の所与の実装の所望のトレードオフに依存して、本明細書で説明されるブロック予測モード（例えば、標準的なブロック予測モード、複数の範囲を使用するブロック予測モード、簡略化されたブロック予測モードなど）においてブロックをコーディングするための技法の様々なバージョンおよび修正が使用され得る。ブロック予測モードのうちのいくつかのバージョンは性能とハードウェア複雑さとの間のＶＥＳＡタスクグループの譲歩（compromise）に依存して、ＡＤＳＣのために選択され得る。 [0164] Depending on the desired trade-off between a given implementation between coding performance and hardware complexity, the block prediction modes described herein (eg, standard block prediction modes, multiple ranges) Various versions and modifications of the techniques for coding blocks in block prediction modes using, simplified block prediction modes, etc. may be used. Some versions of the block prediction mode may be selected for ADSC depending on the VESA task group promise between performance and hardware complexity.

[0165] 上記で説明されたように、いくつかの実施形態では、簡略化されたブロック予測モードは、単一の探索範囲を使用し得る。このような実施形態のうちのいくつかでは、可能性のあるブロック予測ベクトルの総数は、いくつかのｎに関して２^ｎのように決定される。例えば、ＡＤＳＣは一般に、ｎ＝６を使用し、その場合には、可能性のあるブロック予測ベクトルの総数は、６４個の位置であるだろう。探索範囲内の候補ピクセルは、領域Ａ、領域Ｂ、および領域Ｃと本明細書では呼ばれ得る、３つの領域のうちのいずれかに由来し得る。探索範囲（ＳＲ）および探索範囲内の位置（ＳＲｐｏｓ）へのＢＶＰインデックスの例示的なマッピングが表２で示される。例えば、このマッピングは、関連するＳＲの長さＳｒＬｅｎ_ｉ，ｉ∈｛Ａ，Ｂ，Ｃ｝から計算され得る。

[0165] As described above, in some embodiments, the simplified block prediction mode may use a single search range. In some of such embodiments, the total number of possible block prediction vectors is determined as 2 ⁿ for some n. For example, ADSC typically uses n = 6, in which case the total number of possible block prediction vectors would be 64 positions. Candidate pixels within the search range may come from any of three regions that may be referred to herein as region A, region B, and region C. An exemplary mapping of the BVP index to the search range (SR) and a position within the search range (SR pos) is shown in Table 2. For example, this mapping may be computed from the associated SR length SrLen _i , iε {A, B, C}.

[0166] いくつかの実施形態では、エンコーダがデコーダに明示的にシグナリングするブロック予測ベクトルは、範囲［０，２^ｎ−１］における整数であり得る。インデックスから探索範囲へのマッピングは、ＳｒＬｅｎ_ｉに依存し得る。表２は、ＳｒＬｅｎ_Ａ＝２６、ＳｒＬｅｎ_Ｂ＝８、ＳｒＬｅｎ_Ｃ＝３０である例を示す。 [0166] In some embodiments, the block prediction vector that the encoder explicitly signals to the decoder may be an integer in the range [0, 2 ⁿ -1]. The mapping from the index to the search range may depend on SrLen _i . Table 2 shows an example where SrLen _A = 26, SrLen _B = 8, and SrLen _C = 30.

[0167] 図２２の図２２００では、原因となる利用可能なイメージ（causally-available image）（例えば、前に再構成されたピクセル）の異なる領域からのピクセルを備える単一の探索範囲を、簡略化されたブロック予測が使用する例が図示されている。各特定の領域中の候補の数は、コーデックのパラメータに依存して調整され得る。図２２の例では、ＳＲ_Ａ／ＳＲ_Ｂは、前の再構成されたラインから形成されるが、一方、ＳＲ_Ｃは、現在ブロックラインから形成される。例えば、ＳＲ_Ａは、図２２に図示されているような現在ブロック２３４０の真上（例えば、それと垂直方向に（vertically）オーバーラップする）、または右のいずれかであるピクセルを含み、ＳＲ_Ｂは、図２２に図示されているような現在ブロック２３４０の左にあるピクセル（例えば、現在ブロック２３４０と垂直方向にオーバーラップせず、現在ブロック２３４０中のピクセルよりも小さいｘ座標値を有する）を含む。図２２は、ＳＲ_Ａ２２２０、ＳＲ_Ｂ２２１０、ＳＲ_Ｃ２２３０、および現在ブロック２２４０を図示する。図２２に図示されているように、ＳＲ_Ａ２２２０およびＳＲ_Ｂ２２１０は、前の再構成されたライン中にあり、ＳＲ_Ｃ２２３０は、現在ブロックライン中にある。 [0167] In Figure 2200 of Figure 22, a single search range comprising pixels from different regions of a causally-available image (eg, previously reconstructed pixels) is simplified. An example used by generalized block prediction is shown. The number of candidates in each particular region may be adjusted depending on the codec parameters. In the example of FIG. _22, SR A / SR _B is formed from the previous reconstructed line, whereas, SR _C is formed from the current block line. For example, SR _A is directly above the current block 2340, as illustrated in FIG. 22 (e.g., perpendicular thereto direction (a vertically) overlapping) comprises, or right of the pixel either, SR _B is , Including the pixel to the left of the current block 2340 as illustrated in FIG. 22 (eg, does not overlap vertically with the current block 2340 and has a smaller x coordinate value than the pixels in the current block 2340) . FIG. 22 illustrates SR _A 2220, SR _B 2210, SR _C 2230, and current block 2240. As illustrated in FIG. 22, SR _A 2220 and SR _B 2210 are in the previous reconstructed line, and SR _C 2230 is currently in the block line.

[0168] 図２３の図２３００では、可変の区分サイズ（２×２）を用いた簡略化されたブロック予測モードの例が図示されている。ＳＲ_Ｃ内の探索は、本明細書で説明されるように（例えば、ＳＲ_Ｃ内の２×２ブロックを使用して現在ブロックをコーディングすることに関連付けられたコストを決定することによって）行われ得る。ＳＲ_Ａ／ＳＲ_Ｂ内の探索について、候補区分は、２×２候補を作成するためにｙ方向に拡張またはパディングされ得る。図２３は、ＳＲ_Ａ２３２０、ＳＲ_Ｂ２３１０、ＳＲ_Ｃ２３３０、および現在ブロック２３４０を図示する。図２３では、ＳＲ_Ａ２３２０およびＳＲ_Ｂ２３１０は、前の再構成されたライン中にあり、ＳＲ_Ｃ２３３０は、現在ブロックライン中にある。 FIG. 2300 of FIG. 23 illustrates an example of a simplified block prediction mode using a variable partition size (2 × 2). Search in SR _C, as described herein (e.g., by determining the cost associated with coding the current block using the 2 × 2 block in SR _C) are performed obtain. For searches in SR _A / SR _B , the candidate partition may be expanded or padded in the y direction to create 2 × 2 candidates. FIG. 23 illustrates SR _A 2320, SR _B 2310, SR _C 2330, and current block 2340. In FIG. 23, SR _A 2320 and SR _B 2310 are in the previous reconstructed line, and SR _C 2330 is currently in the block line.

[0169] 図２４の図２４００では、可変の区分サイズ（１×２）を用いた簡略化されたブロック予測モードの例が図示されている。ＳＲ_Ａ／ＳＲ_Ｂ内の探索は、本明細書で説明されるように（例えば、ＳＲ_Ａ／ＳＲ_Ｂ内の１×２ブロックを使用して現在ブロックをコーディングすることに関連付けられたコストを決定することによって）実行され得る。ＳＲ_Ｃ（現在ブロックライン）内の探索について、現在ブロックのラインｌ内の区分が、ＳＲ_Ｃのラインｌから探索される。図２４は、ＳＲ_Ａ２４２０、ＳＲ_Ｂ２４１０、ＳＲ_Ｃ２４３０、および現在ブロック２４４０を図示する。図２４では、ＳＲ_Ａ２４２０およびＳＲ_Ｂ２４１０は、前の再構成されたライン中にあり、ＳＲ_Ｃ２４３０は、現在ブロックライン中にある。 FIG. 2400 of FIG. 24 illustrates an example of a simplified block prediction mode using a variable partition size (1 × 2). Search in SR A _/ SR _B, as described herein (e.g., determining a cost associated with coding the current block using the 1 × 2 block in SR A _/ SR _B Can be performed). For SR C search _(current block line) in, classified in the current line l of the block is searched from the line l of SR _C. FIG. 24 illustrates SR _A 2420, SR _B 2410, SR _C 2430, and current block 2440. In FIG. 24, SR _A 2420 and SR _B 2410 are in the previous reconstructed line, and SR _C 2430 is currently in the block line.

[0170] 例えば、特定の領域（例えば、領域Ａ、Ｂ、またはＣ）についての探索位置の数は、領域ｉについてのＳｒＬｅｎ_ｉと本明細書で呼ばれ得る。このような例では、下記の制約が確立され得る：ＳｒＬｅｎ_Ａ＋ＳｒＬｅｎ_Ｂ＋ＳｒＬｅｎ_Ｃ≦２^ｎ。例えば、ブロック予測が単一の探索範囲を使用して実行され、かつ単一の探索範囲における位置の最大数が２^ｎであると定義される場合、それらの領域の各々における位置の合計は、最大数以下である必要があるだろう。ＳｒＬｅｎ_ｉのための値は、コーデックの必要性に依存して調整され得る。加えて、これらの値は、現在スライス内の区分または現在ブロックのロケーションに基づいて、容易に動的に調整されることができる。例えば、現在ブロックまたは区分がＦＬＳに位置する場合、エンコーダおよびデコーダは、ＳＲ_ＡおよびＳＲ_Ｂが現在ブロックをコーディングするために使用できないと推測し得る。従って、多数の位置がＳＲ_Ｃに（例えば、単一の探索範囲に割り付けられた最大値まで）割り振られ得る。 [0170] For example, the number of search positions for a particular region (eg, region A, B, or C) may be referred to herein as SrLen _i for region i. In such an example, the following constraints may be established: SrLen _A + SrLen _B + SrLen _C ≦ 2 ⁿ . For example, if block prediction is performed using a single search range and the maximum number of positions in a single search range is defined to be 2 ⁿ , the sum of the positions in each of those regions is It will need to be below the maximum number. The value for SrLen _i may be adjusted depending on the needs of the codec. In addition, these values can be easily adjusted dynamically based on the partition in the current slice or the location of the current block. For example, if the current block or partition is located in the FLS, the encoder and decoder may infer that SR _A and SR _B cannot be used to code the current block. Therefore, the number of positions in the SR _C (e.g., up to the maximum value assigned to a single search range) may be allocated.

[0171] 単一の探索範囲を使用することに加えて、または代替として、簡略化されたブロック予測モードで、エンコーダ／デコーダが前の再構成されたブロックラインを記憶するための要件が削除され得る。代わりに、１つの前の再構成されたラインのみが記憶され得る。例えば、任意のブロックサイズＰ×Ｑに関して、１つの再構成されたラインのみがＰラインの代わりに記憶され得る（および図２４のＳＲ_Ａのような探索範囲内に含まれる）。 [0171] In addition to or as an alternative to using a single search range, the requirement for the encoder / decoder to store the previous reconstructed block line in a simplified block prediction mode is removed. obtain. Instead, only one previous reconstructed line can be stored. For example, for any block size P × Q, only one reconstructed line (included in the search range as SR _A in and 24) that may be stored in place of the P line.

[0172] 可変の区分サイジング（sizing）が活用される（be leveraged）場合、後続のロジック変更が簡略化されたブロック予測モードのために実装され得る。 [0172] If variable piecewise sizing is leveraged, subsequent logic changes may be implemented for the simplified block prediction mode.

[0173] いくつかの実装では、２×２区分が現在ブロックをコーディングするために使用される場合、ＳＲ_Ａ／ＳＲ_Ｂからのいずれの候補位置も、図２３を参照して上記で説明されたように、２×２候補を生成するために、ｙ方向に拡張またはパディングされ得る。例えば、図２３に図示されるように、１×２候補２３５０は、２×２候補２３６０を生成するためにサンプル値を複製することによって、ｙ方向に拡張またはパディングされ得る。同様の技法が、任意のサイズのブロックに拡張され得る。例えば、候補は、現在ブロックまたは区分の高さに一致するようにｙ方向に拡張またはパディングされ得る。一方、２×２候補２３８０は、拡張またはパディングされないものとして使用され得る。他の実装では、現在ブロック内の２×２区分がどのようにコーディングされるかは、どの探索範囲（例えば、図２２〜２４におけるＳＲ_Ａ、ＳＲ_Ｂ、またはＳＲ_Ｃ）が２×２区分をコーディングするために使用されるかに依存し得る。このような技法は、図２５を参照して下記でより詳細に説明される。 [0173] In some implementations, if a 2x2 partition is used to code the current block, any candidate position from SR _A / SR _B has been described above with reference to FIG. As such, it can be expanded or padded in the y direction to generate 2 × 2 candidates. For example, as illustrated in FIG. 23, a 1 × 2 candidate 2350 may be expanded or padded in the y direction by duplicating sample values to generate a 2 × 2 candidate 2360. Similar techniques can be extended to blocks of any size. For example, candidates can be expanded or padded in the y direction to match the height of the current block or partition. On the other hand, the 2 × 2 candidate 2380 may be used as not expanded or padded. In other implementations, how the 2 × 2 partition in the current block is coded depends on which search range (eg, SR _A , SR _B , or SR _C in FIGS. 22-24) It may depend on what is used for coding. Such a technique is described in more detail below with reference to FIG.

[0174] １×２区分が現在ブロックをコーディングするために使用される場合、ＳＲ_Ｃからのいずれの候補位置も、図２４を参照して上記で説明されたように、現在ブロック中の現在の１×２区分と同じラインから選択され得る。例えば、図２４に図示されるように、現在区分２４５０は、同じライン中の１×２候補２４６０に基づいて予測され、現在区分２４７０は、同じライン中の１×２候補２４８０に基づいて予測される。このような例では、現在区分２４５０のための候補を発見するために、コーダは、現在区分２４５０と同じライン内の探索範囲２４３０における個々の１×２ブロックに基づいて、現在区分２４５０をコーディングするコストを比較し、現在区分２４７０のための候補を発見するために、コーダは、現在区分２４７０と同じライン内の探索範囲２４３０における個々の１×２ブロックに基づいて、現在区分２４７０をコーディングするコストを比較する。 [0174] 1 if × 2 segment is used to code the current block, none of the candidate position from SR _C, as described above with reference to FIG. 24, the current of the current block It can be selected from the same line as the 1 × 2 section. For example, as illustrated in FIG. 24, current segment 2450 is predicted based on 1 × 2 candidates 2460 in the same line, and current segment 2470 is predicted based on 1 × 2 candidates 2480 in the same line. The In such an example, to find a candidate for current partition 2450, the coder codes current partition 2450 based on individual 1 × 2 blocks in search range 2430 in the same line as current partition 2450. To compare the costs and find candidates for the current partition 2470, the coder costs the coding of the current partition 2470 based on the individual 1 × 2 blocks in the search range 2430 in the same line as the current partition 2470. Compare

簡略化されたブロック予測モードでのコーディングの利点
[0175] 簡略化されたブロック予測モードでのコーディングに関する技法は、エンコーダ側とデコーダ側の両方において、性能と複雑さとの間のトレードオフを提供する。これは、ハードウェアコストにおいて制約されるいずれの実装にとっても望ましい。 Benefits of coding in simplified block prediction mode
[0175] Techniques for coding in a simplified block prediction mode provide a trade-off between performance and complexity on both the encoder side and the decoder side. This is desirable for any implementation that is constrained in hardware costs.

簡略化されたブロック予測モードのさらなる簡略化
[0176] ＡＳＩＣ／ＦＰＧＡのためのＡＤＳＣ実装のエリアを低減するために、上記で説明された簡略化されたブロック予測モードで使用される探索範囲へのさらなる修正がなされ得る。ＡＤＳＣデコーダのハードウェア実装は、探索範囲内の全て位置への高速ランダムアクセスを要求し得る。例えば、このようなハードウェア実装は、（例えば、最悪のケースでは）探索範囲のサイズに比例したフリップフロップのアレイを含み得る。よって、（例えば、探索範囲内の領域のサイズなどの）探索範囲の各部分内の可能性のある位置の最大数を制限することが望ましいだろう。一例では、探索範囲内の各領域内の可能性のある位置の最大数は、下記の通りであり得る：ＳＲ_Ａ＝２０，ＳＲ_Ｂ＝１２，ＳＲ_Ｃ＝３２。例えば、それぞれの領域中の位置の数は、他の領域中にいくつ位置があるかに関係なくこのような最大数に制限され得る。処理されている現在ブロックがスライスの第１のライン内の位置ｘ＝１２８（例えば、同じライン内の現在ブロックの前の１２８ピクセルを有する）にある場合、探索範囲ＡおよびＢが現在ブロックをコーディングするために利用可能なピクセルを有しておらず、探索範囲の最大サイズを超えることなく、追加のピクセルが探索範囲Ｃに含まれ得る（例えば、探索範囲の最大サイズが６４ピクセルである場合、１２８の前にコーディングされたピクセルのうちの６４が探索範囲に含まれ得る）という事実にも関わらず、探索範囲Ｃについての位置の数は、３２に制限され得る。ハードウェアにおいて要求されるストレージの量を制限するために、コーディング効率を犠牲にしてこのような制限が置かれ得る。エンコーダの観点から、他の３２の探索範囲位置（例えば、６４の位置探索範囲のうちの、最初の２０ピクセルおよび最後の１２ピクセル）はスライスの第１のライン内の任意の現在ブロックに対して「無効（invalid）」であり得る。いくつかの実装では、探索範囲のそれぞれの部分は常に、同じ数の位置に割り当てられ、各位置は、その位置におけるピクセルが存在するかどうか、あるいは現在ブロックをコーディングする時間にエンコーダに利用可能であるかどうかに依存して、「有効（valid）」または「無効」となり得る。ブロック予測探索および全ての他の動作（例えば、コスト計算および比較）は、このような無効位置に関してスキップされ得る。有効な位置の数は、（例えば、探索範囲２５２０が後続のブロックラインに拡張される、図２５の第２の列によって図示されているような）スライスの第１のラインの右エッジへと増加するであろう。他の実装では、探索範囲のそれぞれの位置における位置の数の和は、最大数（例えば、６４個の位置）以下に制限され得る。このような実装では、処理されている現在ブロックが、スライスの第１のライン内の位置ｘ＝１２８（（例えば、同じライン内の現在ブロックの前の１２８ピクセルを有する）にある場合、探索範囲Ｃについての位置の数は、他の探索範囲（例えば、ＡおよびＢ）が空であり得るので、３２よりも大きいものに等しくなり得る（例えば、最大数が６４である場合、最大６４）。 Further simplification of simplified block prediction mode
[0176] Further modifications to the search range used in the simplified block prediction mode described above may be made to reduce the area of ADSC implementation for ASIC / FPGA. The hardware implementation of the ADSC decoder may require fast random access to all locations within the search range. For example, such a hardware implementation may include an array of flip-flops that are proportional to the size of the search range (eg, in the worst case). Thus, it may be desirable to limit the maximum number of possible locations within each part of the search range (eg, the size of the region within the search range). In one example, the maximum number of possible locations within each region within the search range may be as follows: SR _A = 20, SR _B = 12, SR _C = 32. For example, the number of positions in each region may be limited to such maximum number regardless of how many positions are in other regions. If the current block being processed is at location x = 128 in the first line of the slice (eg, having 128 pixels before the current block in the same line), search ranges A and B code the current block Additional pixels can be included in the search range C without having available pixels to do so and not exceeding the maximum size of the search range (e.g., if the maximum size of the search range is 64 pixels, Despite the fact that 64 of the 128 previously coded pixels may be included in the search range, the number of positions for search range C may be limited to 32. Such limits can be placed at the expense of coding efficiency to limit the amount of storage required in hardware. From the encoder perspective, the other 32 search range positions (eg, the first 20 pixels and the last 12 pixels of the 64 position search ranges) are relative to any current block in the first line of the slice. It can be “invalid”. In some implementations, each part of the search range is always assigned to the same number of positions, and each position is available to the encoder whether there is a pixel at that position or at the time of coding the current block. Depending on whether there is, it can be “valid” or “invalid”. Block predictive search and all other operations (eg, cost calculation and comparison) may be skipped for such invalid locations. The number of valid positions increases to the right edge of the first line of the slice (eg, as illustrated by the second column in FIG. 25, where the search range 2520 is extended to subsequent block lines). Will do. In other implementations, the sum of the number of positions at each position in the search range may be limited to a maximum number (eg, 64 positions) or less. In such an implementation, if the current block being processed is at position x = 128 in the first line of the slice (eg, having 128 pixels before the current block in the same line), the search range The number of positions for C can be equal to greater than 32 (eg, up to 64 if the maximum number is 64), since other search ranges (eg, A and B) can be empty.

[0177] コーディング効率に対する影響を制限するために、ある特定の状況下では、より少ない数のビットがビットストリーム中のブロック予測ベクトルをシグナリングするために使用され得る。例えば、現在ブロックがある特定の位置の範囲内（例えば、スライスの第１のライン）にある場合、エンコーダおよびデコーダの両方は、より少ない数のビットがブロック予測ベクトルをシグナリングするために使用されると推測し、探索範囲内の個々の位置の各々を正確に識別するために必要とされるビットの数よりも少ないものを使用してシグナリングされるブロック予測ベクトルを使用して（例えば、探索範囲が６４個の位置を有する場合、６ビット）、候補ブロックまたは区分を正確に識別し得る。６４個の位置のうちの３２が「無効」であると決定される上記の例では、６４の探索範囲位置のうちの３２のみがその時間中に有効であるので、６の代わりに、ブロック予測ベクトルごとに５ビットがスライスの第１のラインの大部分中で使用され得る。 [0177] To limit the impact on coding efficiency, under certain circumstances, a smaller number of bits may be used to signal a block prediction vector in the bitstream. For example, if the current block is within a certain position (eg, the first line of a slice), both the encoder and decoder are used to signal a block prediction vector with a smaller number of bits. And using block prediction vectors that are signaled using less than the number of bits required to accurately identify each individual position within the search range (e.g., search range 6 bits), the candidate block or partition can be accurately identified. In the above example where 32 of the 64 positions are determined to be “invalid”, only 32 of the 64 search range positions are valid during that time, so instead of 6, block prediction Five bits per vector may be used in the majority of the first line of the slice.

[0178] 加えて、ブロックタイミングに関して、一定レートで探索範囲のフリップフロップを満たすための能力は、ＡＤＳＣのハードウェア実装のために有利であり得る。これは、探索範囲がブロック時間ごとに１ブロック幅で効率的にシフトするべきであることを意味する。結果として、探索範囲Ｃ内のある特定の位置（certain positions）は、一旦、現在ブロックがスライスの次のラインに進むと、現在ブロックに関して技術的には前のブロックライン中にあり得る。この特徴の例示は、図２５の図２５００で示される。現在ブロック２５１０がスライスの右エッジへと移動し、次いで、次のブロックラインへと移動するとき、探索範囲２５３０（例えば、現在ブロックライン内の探索範囲の部分）は、図２５の第４および第５の行（rows）に示されているように、前のブロックライン中に残る。 [0178] In addition, with respect to block timing, the ability to fill the search range flip-flops at a constant rate may be advantageous for hardware implementation of ADSC. This means that the search range should be efficiently shifted by one block width every block time. As a result, certain positions within the search range C may technically be in the previous block line with respect to the current block once the current block has advanced to the next line of the slice. An illustration of this feature is shown in FIG. When the current block 2510 moves to the right edge of the slice and then moves to the next block line, the search range 2530 (eg, the portion of the search range within the current block line) is the fourth and It remains in the previous block line, as shown in the 5 rows.

[0179] いくつかの実施形態では、探索範囲Ｂ（例えば、図２５の探索範囲２５４０の最上位ライン）が現在ブロックラインの前のラインと第１のラインとにわたる２×２の予測候補を生成するために使用され得る。図２５に示されているように、探索範囲２５４０の最上位ラインは探索範囲Ｂであり、探索範囲２５４０の最下位ラインは、探索範囲Ｂに関してコロケートされる探索範囲Ｃの一部分（例えば、探索範囲２５３０）である。よって、このような実施形態のうちのいくつかでは、２×２の予測候補を生成するために探索範囲Ｂ中の１×２予測候補を拡張またはパディングするかわりに、コーダは、前の再構成されたラインからの２つのピクセル（例えば、探索範囲Ｂからの２つのピクセル）と、現在ブロックラインの第１のラインからの２つのピクセル（例えば、探索範囲Ｂ中の２つのピクセルに関してコロケートされた、探索範囲Ｃからの２つのピクセル）とを含む、２×２予測候補を利用し得る。このアプローチは、（例えば、探索範囲Ａ中のピクセルのすぐ下の）探索範囲Ａ中のピクセルに関してコロケートされた現在ブロックラインからのピクセルが、現在ブロック２５１０をコーディングする時間において原因となる利用可能ではないため、探索範囲Ａ（例えば、探索範囲２５２０）のために使用されることはできない。 [0179] In some embodiments, the search range B (eg, the top line of the search range 2540 of FIG. 25) generates 2 × 2 prediction candidates that span the line before the current block line and the first line. Can be used to As shown in FIG. 25, the top line of search range 2540 is search range B, and the bottom line of search range 2540 is a portion of search range C that is collocated with respect to search range B (eg, search range 2530). Thus, in some of such embodiments, instead of extending or padding the 1 × 2 prediction candidates in search range B to generate 2 × 2 prediction candidates, the coder may Two pixels from the captured line (eg, two pixels from the search range B) and two pixels from the first line of the current block line (eg, two pixels in the search range B) , Two pixels from the search range C). This approach is not available when the pixels from the current block line that are collocated with respect to the pixels in search range A (eg, directly below the pixels in search range A) cause the current block 2510 coding time. Therefore, it cannot be used for search range A (eg, search range 2520).

簡略化された探索範囲を使用するブロック予測モードでのコーディング
[0180] 図２６を参照して、ブロック予測モードでビデオデータのブロックをコーディングするための例示的なプロシージャが説明される。図２６に示されているステップは、ビデオエンコーダ（例えば、図２Ａ中のビデオエンコーダ２０）、またはそれらの（１つまたは複数の）コンポーネントによって実行され得る。便宜上、方法２６００は、ビデオエンコーダ２０、または別のコンポーネントであり得る、コーダによって実行されるものとして説明される。 Coding in block prediction mode with simplified search range
[0180] With reference to FIG. 26, an exemplary procedure for coding a block of video data in a block prediction mode will be described. The steps shown in FIG. 26 may be performed by a video encoder (eg, video encoder 20 in FIG. 2A), or component (s) thereof. For convenience, the method 2600 is described as being performed by a coder, which can be the video encoder 20, or another component.

[0181] 方法２６００は、ブロック２６０１において開始する。ブロック２６０５において、コーダは、現在スライス中の現在ブロックを予測するために使用される候補ブロックを決定し、ここで、候補ブロックは、現在スライス中の再構成されたピクセルに各々対応する複数のピクセル位置（例えば、探索範囲）の範囲内にある。例えば、コーダは、複数のピクセル位置の範囲において複数の潜在的な候補ブロックの各潜在的な候補ブロックに基づいて、現在ブロックをコーディングすることに関連付けられたコストを決定し、最低コストを有するブロックのうちの１つを候補ブロックとして識別する。各潜在的な候補ブロックは、複数のピクセル位置の範囲内のピクセル位置のうちの１つに対応し得る。複数のピクセル位置の範囲は、現在スライス中のピクセルの第１のライン中の１つまたは複数の第１のピクセル位置を含む第１の領域を含み得、ここで、複数のピクセルの第１のラインは、現在ブロックとオーバーラップする。例えば、複数のピクセルの第１のラインは、現在スライスの全体の幅にわたり得、複数のピクセルの第１のラインは、現在ブロック中の少なくとも１つのピクセルを含み得る。さらに、複数のピクセル位置の範囲は、現在スライス中のピクセルの第２のライン中の１つまたは複数の第２のピクセル位置を含む第２の領域を含み、ここで、複数のピクセルの第２のラインは、現在ブロックとオーバーラップしない。例えば、複数のピクセルの第２のラインは、現在スライスの全体の幅にわたり得、現在ブロック中の任意のピクセルを含まない。複数のピクセルの第２のラインは、現在スライス中の第１のラインにすぐ前に先行し得る。第１のおよび第２のラインの各々は、現在スライス内のラスタスキャンラインであり得る。いくつかの実施形態では、第１の領域および第２の領域は、異なるラスタスキャンラインを占有する。第１の領域は、（例えば、ラスタスキャンラインおよび現在ブロックが少なくとも１つの共通ピクセルを含む）現在ブロックとオーバーラップするラスタスキャンライン内にあり得る。複数のピクセル位置の範囲は、第２のライン中（例えば、第２の領域を含む同じライン中）の１つまたは複数の第３のピクセル位置を含む第３の領域をさらに含み得る。例えば、第３の領域中の１つまたは複数の第３のピクセル位置は、現在ブロックの一部である第１のライン中のピクセル位置に関してコロケートされる（または、現在ブロックと垂直方向にオーバーラップする）第２のライン中のいずれのピクセル位置も含まない可能性があり、一方、第２の領域中の１つまたは複数の第２のピクセル位置のうちの少なくとも１つは、現在ブロックの一部である第１のライン中のピクセル位置に関してコロケートされる（または、現在ブロックと垂直にオーバーラップする）第２のライン中の１つまたは複数の第２のピクセルを含み得る。本明細書で説明されるように、領域は、異なる数のピクセル位置を各々含み得る。例えば、第１の領域中のピクセル位置の数は、第２の領域中のピクセル位置の数よりも多く、それは、第３の領域よりも多い数のピクセル位置を有する。いくつかの実施形態では、現在ブロックは、簡略化されたブロック予測モードで予測された２×８ブロック内の１×２区分である。他の実施形態では、現在ブロックは、簡略化されたブロック予測モードで予測された２×８ブロック内の２×２区分である。いくつかの他の実施形態では、現在ブロックは、簡略化されたブロック予測モードで予測された２×８ブロックである。複数のピクセル位置の範囲における各潜在的な候補ブロックは、複数のピクセル位置の範囲中の任意のピクセル位置（例えば、第１の領域、第２の領域、または第３の領域中の複数のピクセル位置）に対応し得る（例えば、左上のピクセルまたは別の参照ピクセルとして含む）。候補ブロックに関連付けられたビデオデータは、ビデオ符号化デバイスのメモリに記憶され得る。 [0181] The method 2600 begins at block 2601. At block 2605, the coder determines candidate blocks that are used to predict the current block in the current slice, where the candidate block is a plurality of pixels each corresponding to a reconstructed pixel in the current slice. It is within the range of the position (for example, search range). For example, the coder determines a cost associated with coding the current block based on each potential candidate block of the plurality of potential candidate blocks in a range of pixel locations and has the lowest cost Is identified as a candidate block. Each potential candidate block may correspond to one of the pixel locations within a plurality of pixel locations. The range of pixel locations may include a first region that includes one or more first pixel locations in a first line of pixels in the current slice, where the first of the plurality of pixels The line overlaps the current block. For example, the first line of pixels may span the entire width of the current slice, and the first line of pixels may include at least one pixel in the current block. Further, the plurality of pixel location ranges includes a second region that includes one or more second pixel locations in a second line of pixels in the current slice, wherein a second of the plurality of pixels. This line does not overlap with the current block. For example, the second line of pixels can span the entire width of the current slice and does not include any pixels in the current block. The second line of pixels may immediately precede the first line in the current slice. Each of the first and second lines may be a raster scan line within the current slice. In some embodiments, the first region and the second region occupy different raster scan lines. The first region may be in a raster scan line that overlaps the current block (eg, the raster scan line and the current block include at least one common pixel). The range of pixel locations may further include a third region that includes one or more third pixel locations in the second line (eg, in the same line that includes the second region). For example, one or more third pixel locations in the third region are collocated with respect to pixel locations in the first line that are part of the current block (or overlap vertically with the current block). May not include any pixel location in the second line, while at least one of the one or more second pixel locations in the second region is one of the current block. It may include one or more second pixels in the second line that are collocated with respect to pixel locations in the first line that are parts (or that overlap vertically with the current block). As described herein, a region may each include a different number of pixel locations. For example, the number of pixel locations in the first region is greater than the number of pixel locations in the second region, which has a greater number of pixel locations than the third region. In some embodiments, the current block is a 1 × 2 partition within a 2 × 8 block predicted in a simplified block prediction mode. In other embodiments, the current block is a 2 × 2 partition within a 2 × 8 block predicted in a simplified block prediction mode. In some other embodiments, the current block is a 2 × 8 block predicted in a simplified block prediction mode. Each potential candidate block in the range of pixel locations may be any pixel location in the range of pixel locations (eg, a plurality of pixels in the first region, second region, or third region). Position) (eg, included as an upper left pixel or another reference pixel). Video data associated with the candidate block may be stored in a memory of the video encoding device.

[0182] ブロック２６１０において、コーダは、複数のピクセル位置の範囲内の候補ブロックのピクセル位置を示す予測ベクトルを決定する。例えば、候補ブロックのピクセル位置は、第１の領域または第２の領域のうちの１つにあり得る。 [0182] At block 2610, the coder determines a prediction vector that indicates the pixel location of the candidate block within the range of pixel locations. For example, the pixel location of the candidate block may be in one of the first region or the second region.

[0183] ブロック２６１５において、コーダは、予測ベクトルをシグナリングすることを少なくとも部分的に介して、簡略化されたブロック予測モードで現在ブロックをコーディングする。コーダは、固定数のビット（例えば、ピクセル位置の範囲における各ピクセル位置を一意に識別するために必要とされる最小数のビット）を使用して、予測ベクトルをシグナリングし得る。例えば、複数のピクセル位置の範囲内に６４個のピクセル位置が存在する場合、６ビットが各予測ベクトルをシグナリングするために使用され得る。いくつかの実施形態では、現在スライス内の現在ブロックのロケーションは、複数のピクセル位置の範囲が最大数のピクセル位置よりも小さいある特定数のピクセル位置よりも多いものを有することを妨げる場合、コーダは、複数のピクセル位置の範囲内の最大数のピクセル位置を一意に識別するために必要とされるビットの数よりも小さいものを使用して予測ベクトルをシグナリングし得る。例えば、コーダは、現在スライス内の現在ブロックのロケーションのために、複数のピクセルロケーションの範囲が３２個のピクセル位置よりも多くを有することができない（例えば、現在ラインが現在スライス中の第１のラインであり、ラスタスキャン順で現在ブロックに先行する３２個の再構成されたブロックのみが存在する）場合、低減されたビットの数（例えば、この場合は５個）は、現在ブロックをコーディングするために使用される候補ブロックのピクセル位置を示す予測ベクトルをシグナリングするために使用され得る。方法２６００はブロック２６２０において終了する。 [0183] At block 2615, the coder codes the current block in a simplified block prediction mode, at least in part through signaling a prediction vector. The coder may signal the prediction vector using a fixed number of bits (eg, the minimum number of bits required to uniquely identify each pixel location in the range of pixel locations). For example, if there are 64 pixel locations within a plurality of pixel locations, 6 bits can be used to signal each prediction vector. In some embodiments, if the location of the current block in the current slice prevents the range of pixel positions from having more than a certain number of pixel positions that is less than the maximum number of pixel positions, the coder May signal the prediction vector using less than the number of bits required to uniquely identify the maximum number of pixel locations within the range of pixel locations. For example, the coder may not have a range of pixel locations having more than 32 pixel positions due to the location of the current block in the current slice (eg, the current line is the first in the current slice). If there are only 32 reconstructed blocks that precede the current block in raster scan order), the reduced number of bits (eg, 5 in this case) codes the current block Can be used to signal a prediction vector indicating the pixel position of the candidate block used for the purpose. The method 2600 ends at block 2620.

[0184] 方法２６００では、図２６に示されているブロックのうちの１つまたは複数は削除され得（例えば、実行されない）、および／または方法が実行される順序は入れ替えられることがある。いくつかの実施形態では、さらなるブロックが方法２６００に追加され得る。例えば、いくつかの実施形態では、コーダは、現在スライス中の複数のピクセルの第１のライン中の少なくとも１つのピクセルと、複数のピクセルの第３のライン中の少なくとも１つのピクセルとを現在ブロックが含むことを決定し得、ここで、複数のピクセルの第３のラインは、現在スライスの全体の幅にわたり、現在ブロック中の少なくとも１つのピクセルを含み、ここで、第３のラインは、第１のラインとは異なる。このような決定に基づいて、コーダは、（i）第１のブロックに基づいて、現在ブロックをコーディングすることに関連付けられたコストを決定し、ここで、第１のブロックは、第１の領域中の少なくとも１つのピクセルと、第２の領域中の少なくとも１つのピクセルとを含み、（ii）第１のブロックに基づいて現在ブロックをコーディングすることに関連付けられたコストに基づいて、現在ブロックを予測するために使用される候補ブロックとなる第１のブロックを決定し得る。別の実施形態では、コーダは、現在スライス中の複数のピクセルの第１のライン中の少なくとも１つのピクセルと、複数のピクセルの第３のライン中の少なくとも１つのピクセルとを現在ブロックが含むことを決定し得、ここで、複数のピクセルの第３のラインは、現在スライスの全体の幅にわたり、現在ブロック中の少なくとも１つのピクセルを含み、ここで、第３のラインは、第１のラインとは異なる。このような決定に基づいて、コーダは、（i）現在ブロックよりも少ない数のピクセルを有する第１のブロックに基づいて、現在ブロックをコーディングすることに関連付けられた第１のコストを決定し、ここで、第１のブロックは、第２の領域中の各々にある１つまたは複数のピクセルを含み、（ii）現在ブロックと同じ数のピクセルを有する第２のブロックに基づいて、現在ブロックをコーディングすることに関連付けられた第２のコストを決定し、ここで、第２のブロックは、第１のブロック中の１つまたは複数のピクセルの全てと、第１の領域中の各々にある１つまたは複数の追加のピクセルとを含み、（iii）第２のコストが第１のコストよりも大きいとの決定に基づいて、現在ブロックを予測するために使用される候補ブロックとなる第１のブロックを決定し得る。本開示の実施形態は、図２６に示されている例にまたはそれによって限定されず、他の変形が本開示の趣旨から逸脱することなく実装され得る。 [0184] In the method 2600, one or more of the blocks shown in FIG. 26 may be deleted (eg, not performed) and / or the order in which the methods are performed may be reversed. In some embodiments, additional blocks may be added to the method 2600. For example, in some embodiments, the coder currently blocks at least one pixel in the first line of the plurality of pixels in the current slice and at least one pixel in the third line of the plurality of pixels. Where a third line of pixels includes at least one pixel in the current block across the entire width of the current slice, where the third line is Different from line 1. Based on such determination, the coder determines (i) a cost associated with coding the current block based on the first block, where the first block is in the first region. At least one pixel in the second region and at least one pixel in the second region, and (ii) determining the current block based on a cost associated with coding the current block based on the first block A first block may be determined that is a candidate block used for prediction. In another embodiment, the coder includes the current block including at least one pixel in a first line of the plurality of pixels in the current slice and at least one pixel in a third line of the plurality of pixels. Where the third line of pixels includes at least one pixel in the current block across the entire width of the current slice, where the third line is the first line Is different. Based on such a determination, the coder determines (i) a first cost associated with coding the current block based on a first block having fewer pixels than the current block; Wherein the first block includes one or more pixels in each of the second regions, and (ii) the current block is based on a second block having the same number of pixels as the current block. Determining a second cost associated with coding, wherein the second block is one of each of the one or more pixels in the first block and each in the first region. And (iii) a candidate block used to predict the current block based on the determination that the second cost is greater than the first cost. It may determine a first block. Embodiments of the present disclosure are not limited to or by the example shown in FIG. 26, and other variations can be implemented without departing from the spirit of the present disclosure.

他の考慮事項
[0185] 本明細書で開示する情報および信号は、様々な異なる技術および技法のいずれかを使用して表され得る。例えば、上記の説明全体にわたって言及され得るデータ、命令、コマンド、情報、信号、ビット、シンボル、およびチップは、電圧、電流、電磁波、磁界または磁性粒子、光場または光学粒子、あるいはそれらの任意の組合せによって表され得る。 Other considerations
[0185] Information and signals disclosed herein may be represented using any of a variety of different technologies and techniques. For example, data, instructions, commands, information, signals, bits, symbols, and chips that may be referred to throughout the above description are voltages, currents, electromagnetic waves, magnetic fields or magnetic particles, light fields or optical particles, or any of them Can be represented by a combination.

[0186] 本明細書で開示される実施形態に関して説明された様々な例示的な論理ブロック、およびアルゴリズムステップは、電子ハードウェア、コンピュータソフトウェア、またはその両方の組合せとして実装され得る。ハードウェアとソフトウェアのこの互換性を明確に示すために、様々な例示的なコンポーネント、ブロック、およびステップが、概してそれらの機能に関して上記で説明されている。そのような機能をハードウェアとして実装するか、ソフトウェアとして実装するかは、特定のアプリケーションおよび全体的なシステムに課される設計制約に依存する。当業者は、説明された機能を特定のアプリケーションごとに様々な方法で実装し得るが、そのような実装の決定は、本開示の範囲からの逸脱をもたらすものと解釈されるべきではない。 [0186] The various exemplary logic blocks and algorithm steps described with respect to the embodiments disclosed herein may be implemented as electronic hardware, computer software, or a combination of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the overall system. Those skilled in the art may implement the described functionality in a variety of ways for a particular application, but such implementation decisions should not be construed as causing a departure from the scope of the present disclosure.

[0187] 本明細書で説明された技法は、ハードウェア、ソフトウェア、ファームウェア、またはそれらの任意の組合せで実装され得る。そのような技法は、汎用コンピュータ、ワイヤレス通信デバイスハンドセット、またはワイヤレス通信デバイスハンドセットおよび他のデバイスにおけるアプリケーションを含む複数の用途を有する集積回路デバイスなどの、様々なデバイスのいずれかにおいて実装され得る。デバイスまたはコンポーネントとして説明された任意の特徴は、集積論理デバイスに一緒に、または個別であるが相互運用可能な論理デバイスとして別々に実装され得る。ソフトウェアで実装された場合、本技法は、実行されたとき、上記で説明された方法のうちの１つまたは複数を実行する命令を含むプログラムコードを備えるコンピュータ可読データ記憶媒体によって、少なくとも部分的に実現され得る。コンピュータ可読データ記憶媒体は、パッケージング材料を含み得るコンピュータプログラム製品の一部を形成し得る。コンピュータ可読媒体は、同期型ダイナミックランダムアクセスメモリ（ＳＤＲＡＭ）などのランダムアクセスメモリ（ＲＡＭ）、読み取り専用メモリ（ＲＯＭ）、不揮発性ランダムアクセスメモリ（ＮＶＲＡＭ）、電気的消去可能プログラマブル読み取り専用メモリ（ＥＥＰＲＯＭ（登録商標））、フラッシュメモリ、磁気または光学データ記憶媒体などの、メモリまたはデータ記憶媒体を備え得る。本技法は、追加または代替として、伝搬信号または電波などの、命令またはデータ構造の形態でプログラムコードを搬送または通信し、コンピュータによってアクセスされ、読み取られ、および／または実行され得るコンピュータ可読通信媒体によって、少なくとも部分的に実現され得る。 [0187] The techniques described herein may be implemented in hardware, software, firmware, or any combination thereof. Such techniques may be implemented in any of a variety of devices, such as general purpose computers, wireless communication device handsets, or integrated circuit devices having multiple uses, including applications in wireless communication device handsets and other devices. Any feature described as a device or component may be implemented together in an integrated logical device or separately as a separate but interoperable logical device. When implemented in software, the techniques may be performed, at least in part, by a computer-readable data storage medium comprising program code that includes instructions that, when executed, perform one or more of the methods described above. Can be realized. The computer readable data storage medium may form part of a computer program product that may include packaging material. Computer readable media include random access memory (RAM) such as synchronous dynamic random access memory (SDRAM), read only memory (ROM), non-volatile random access memory (NVRAM), electrically erasable programmable read only memory (EEPROM) (Registered trademark)), flash memory, magnetic or optical data storage media, etc. The techniques may additionally or alternatively be carried by a computer readable communication medium that carries or communicates program code in the form of instructions or data structures, such as propagated signals or radio waves, and that can be accessed, read and / or executed by a computer. Can be realized at least in part.

[0188] プログラムコードは、１つまたは複数のデジタルシグナルプロセッサ（ＤＳＰ）、汎用マイクロプロセッサ、特定用途向け集積回路（ＡＳＩＣ）、フィールドプログラマブル論理アレイ（ＦＰＧＡ）、または他の同等の集積回路またはディスクリート論理回路などの、１つまたは複数のプロセッサを含み得るプロセッサによって実行され得る。そのようなプロセッサは、本開示で説明された技法のいずれかを実行するように構成され得る。汎用プロセッサはマイクロプロセッサであり得るが、代替として、プロセッサは、任意の従来のプロセッサ、コントローラ、マイクロコントローラ、またはステートマシンであり得る。プロセッサはまた、コンピューティングデバイスの組合せ、例えば、ＤＳＰとマイクロプロセッサとの組合せ、複数のマイクロプロセッサ、ＤＳＰコアと連携する１つまたは複数のマイクロプロセッサ、あるいは任意の他のそのような構成として実装され得る。従って、本明細書で使用される「プロセッサ」という用語は、上記の構造、上記の構造の任意の組合せ、または本明細書で説明された技法の実装に適切な任意の他の構造または装置のいずれかを指し得る。さらに、いくつかの態様では、本明細書で説明した機能は、符号化および復号のために構成された専用のソフトウェアもしくはハードウェア内に提供され得るか、または複合ビデオエンコーダ−デコーダ（コーデック）に組み込まれ得る。また、本技法は、１つまたは複数の回路または論理要素で十分に実装され得る。 [0188] The program code may be one or more digital signal processors (DSPs), general purpose microprocessors, application specific integrated circuits (ASICs), field programmable logic arrays (FPGAs), or other equivalent integrated circuits or discrete logic. It may be executed by a processor that may include one or more processors, such as a circuit. Such a processor may be configured to perform any of the techniques described in this disclosure. A general purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. The processor is also implemented as a combination of computing devices, eg, a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors associated with a DSP core, or any other such configuration. obtain. Accordingly, as used herein, the term “processor” refers to any of the above structures, any combination of the above structures, or any other structure or apparatus suitable for implementation of the techniques described herein. Can point to either. Further, in some aspects, the functionality described herein may be provided in dedicated software or hardware configured for encoding and decoding, or in a composite video encoder-decoder (codec). Can be incorporated. Also, the techniques may be fully implemented with one or more circuits or logic elements.

[0189] 本開示の技法は、ワイヤレスハンドセット、集積回路（ＩＣ）またはＩＣのセット（例えば、チップセット）を含む、多種多様なデバイスまたは装置で実装され得る。本開示では、開示される技法を実行するように構成されたデバイスの機能的態様を強調するために、様々なコンポーネントまたはユニットが説明されたが、それらは、必ずしも異なるハードウェアユニットによる実現を必要としない。むしろ、上記で説明されたように、様々なユニットが、適切なソフトウェアおよび／またはファームウェアとともに、上記で説明された１つまたは複数のプロセッサを含めて、コーデックハードウェアユニットにおいて組み合わせられるか、または相互動作可能なハードウェアユニットの集合によって与えられ得る。 [0189] The techniques of this disclosure may be implemented in a wide variety of devices or apparatuses, including a wireless handset, an integrated circuit (IC) or a set of ICs (eg, a chipset). In this disclosure, various components or units have been described to emphasize the functional aspects of a device configured to perform the disclosed techniques, but they necessarily require implementation by different hardware units. And not. Rather, as described above, the various units can be combined in a codec hardware unit, including one or more processors described above, or interoperable with appropriate software and / or firmware. It can be given by a set of operable hardware units.

[0190] 上記で様々な異なる実施形態に関して説明したが、一実施形態からの特徴または要素は、本開示の教示から逸脱することなく他の実施形態と組み合わせられ得る。しかしながら、それぞれの実施形態間の特徴の組合せは、必ずしもそれに限定されるものではない。本開示の様々な実施形態が説明された。これらおよび他の実施形態は以下の特許請求の範囲内に入る。 [0190] Although described above with respect to various different embodiments, features or elements from one embodiment may be combined with other embodiments without departing from the teachings of the present disclosure. However, the combination of features between the embodiments is not necessarily limited thereto. Various embodiments of the disclosure have been described. These and other embodiments are within the scope of the following claims.

[0190] 上記で様々な異なる実施形態に関して説明したが、一実施形態からの特徴または要素は、本開示の教示から逸脱することなく他の実施形態と組み合わせられ得る。しかしながら、それぞれの実施形態間の特徴の組合せは、必ずしもそれに限定されるものではない。本開示の様々な実施形態が説明された。これらおよび他の実施形態は以下の特許請求の範囲内に入る。
以下に本願の出願当初の特許請求の範囲に記載された発明を付記する。
［Ｃ１］
固定ビットレートビデオコーディング方式の簡略化されたブロック予測モードでビデオデータのブロックをコーディングするための方法であって、前記方法は、
現在スライス中の現在ブロックを予測するために使用される候補ブロックを決定することと、前記候補ブロックは、前記現在スライス中の再構成されたピクセルに各々対応する複数のピクセル位置の範囲内にあり、複数のピクセル位置の前記範囲は、少なくとも（i）前記現在スライス中の複数のピクセルの第１のライン中の１つまたは複数の第１のピクセル位置を含む第１の領域と、ここで、複数のピクセルの前記第１のラインは、前記現在ブロック中の少なくとも１つのピクセルを含み、前記現在スライスの全体の幅にわたる、（ii）前記現在スライス中の複数のピクセルの第２のライン中の１つまたは複数の第２のピクセル位置を含む第２の領域と、ここで、複数のピクセルの前記第２のラインは、前記現在ブロック中のいずれのピクセルも含まないが、前記現在スライスの前記全体の幅にわたる、を備える、
複数のピクセル位置の前記範囲内の前記候補ブロックのピクセル位置を示す予測ベクトルを決定することと、前記候補ブロックの前記ピクセル位置は、前記第１の領域または前記第２の領域のうちの１つにある、
前記予測ベクトルをシグナリングすることを少なくとも部分的に介して、簡略化されたブロック予測モードで前記現在ブロックをコーディングすることと
を備える、方法。
［Ｃ２］
複数のピクセルの前記第１のラインおよび複数のピクセルの前記第２のラインは、前記現在スライスの２つの隣接するラスタスキャンラインを備える、Ｃ１に記載の方法。
［Ｃ３］
前記現在ブロックは、簡略化されたブロック予測モードで予測された２×８ブロック内の１×２区分である、Ｃ１に記載の方法。
［Ｃ４］
前記現在ブロックは、簡略化されたブロック予測モードで予測された２×８ブロック内の２×２区分である、Ｃ１に記載の方法。
［Ｃ５］
複数のピクセル位置の前記範囲は、複数のピクセルの前記第２のライン中の１つまたは複数の第３のピクセル位置を備える第３の領域をさらに含み、前記１つまたは複数の第３のピクセル位置は、前記現在ブロックの一部である前記第１のライン中のピクセル位置に関してコロケートされた前記第２のライン中のいずれのピクセル位置も含まない、Ｃ１に記載の方法。
［Ｃ６］
前記第２の領域および前記第３の領域は、同じラスタスキャンラインを占有する、Ｃ５に記載の方法。
［Ｃ７］
前記第１の領域は、第１の数のピクセル位置を含み、前記第２の領域は、第２の数のピクセル位置を含み、前記第３の領域は、第３の数のピクセル位置を含み、前記第１の数は、前記第２の数よりも大きく、かつ前記第３の数よりも大きい、Ｃ５に記載の方法。
［Ｃ８］
前記第１、第２、および第３の数は、互いに異なる、Ｃ７に記載の方法。
［Ｃ９］
複数の潜在的な候補ブロックの各潜在的な候補ブロックに基づいて、前記現在ブロックをコーディングすることに関連付けられたコストを決定することと、前記複数の潜在的な候補ブロックは、前記第１および第２の領域中の前記第１および第２のピクセル位置の１つに各々対応する、
最低コストを有する前記第１および第２の領域中の前記複数の潜在的な候補ブロックのうちの１つを、前記候補ブロックとして識別することと
をさらに備える、Ｃ１に記載の方法。
［Ｃ１０］
複数のピクセル位置の前記範囲中の各ピクセル位置を一意に識別するために必要とされるビットの数は、第１の数に等しく、前記方法は、
前記現在スライス内のあらかじめ定められた領域内に前記現在ブロックがあると決定することと、
ビットの前記第１の数よりも小さいものを使用して前記予測ベクトルをシグナリングすることと
をさらに備える、Ｃ１に記載の方法。
［Ｃ１１］
前記現在スライス中の複数のピクセルの前記第１のライン中の少なくとも１つのピクセルと、複数のピクセルの第３のライン中の少なくとも１つのピクセルとを前記現在ブロックが含むことを決定することと、複数のピクセルの前記第３のラインは、前記現在ブロック中の少なくとも１つのピクセルを含み、前記現在スライスの前記全体の幅にわたり、ここにおいて、前記第３のラインは、前記第１のラインとは異なる、
第１のブロックに基づいて前記現在ブロックをコーディングすることに関連付けられたコストを決定することと、前記第１のブロックは、前記第１の領域中の少なくとも１つのピクセルと前記第２の領域中の少なくとも１つのピクセルとを含む、
前記第１のブロックに基づいて前記現在ブロックをコーディングすることに関連付けられた前記コストに基づいて、前記現在ブロックを予測するために使用される、前記候補ブロックとなる前記第１のブロックを決定することと
をさらに備える、Ｃ１に記載の方法。
［Ｃ１２］
前記現在スライス中の複数のピクセルの前記第１のライン中の少なくとも１つのピクセルと、複数のピクセルの第３のライン中の少なくとも１つのピクセルとを前記現在ブロックが含むことを決定することと、複数のピクセルの前記第３のラインは、前記現在ブロック中の少なくとも１つのピクセルを含み、前記現在スライスの前記全体の幅にわたり、ここにおいて、前記第３のラインは、前記第１のラインとは異なる、
前記現在ブロックよりも少ない数のピクセルを有する第１のブロックに基づいて、前記現在ブロックをコーディングすることに関連付けられた第１のコストを決定することと、前記第１のブロックは、前記第２の領域中の各々にある１つまたは複数のピクセルを含む、
前記現在ブロックと同じ数のピクセルを有する第２のブロックに基づいて、前記現在ブロックをコーディングすることに関連付けられた第２のコストを決定することと、前記第２のブロックは、前記第１のブロック中の前記１つまたは複数のピクセルの全てと、前記第１の領域中の各々にある１つまたは複数の追加のピクセルとを含む、
前記第２のコストが前記第１のコストよりも大きいとの決定に基づいて、前記現在ブロックを予測するために使用される前記候補ブロックとなる前記第１のブロックを決定することと
をさらに備える、Ｃ１に記載の方法。
［Ｃ１３］
固定ビットレートビデオコーディング方式の簡略化されたブロック予測モードでビデオデータのブロックをコーディングするための装置であって、前記装置は、
ビデオデータの現在スライスの１つまたは複数の再構成されたピクセルを記憶するように構成されたメモリと、
前記メモリと通信状態にある１つまたは複数のプロセッサとを備え、前記１つまたは複数のプロセッサは、
前記現在スライス中の現在ブロックを予測するために使用される候補ブロックを決定することと、前記候補ブロックは、前記現在スライス中の再構成されたピクセルに各々対応する複数のピクセル位置の範囲内にあり、複数のピクセル位置の前記範囲は、少なくとも（i）前記現在スライス中の複数のピクセルの第１のライン中の１つまたは複数の第１のピクセル位置を含む第１の領域と、ここで、複数のピクセルの前記第１のラインは、前記現在ブロック中の少なくとも１つのピクセルを含み、前記現在スライスの全体の幅にわたる、（ii）前記現在スライス中の複数のピクセルの第２のライン中の１つまたは複数の第２のピクセル位置を含む第２の領域と、ここで、複数のピクセルの前記第２のラインは、前記現在ブロック中のいずれのピクセルも含まないが、前記現在スライスの前記全体の幅にわたる、を備える、
複数のピクセル位置の前記範囲内の前記候補ブロックのピクセル位置を示す予測ベクトルを決定することと、前記候補ブロックの前記ピクセル位置は、前記第１の領域または前記第２の領域のうちの１つにある、
前記予測ベクトルをシグナリングすることを少なくとも部分的に介して、簡略化されたブロック予測モードで前記現在ブロックをコーディングすることと
を行うように構成される、装置。
［Ｃ１４］
複数のピクセルの前記第１のラインおよび複数のピクセルの前記第２のラインは、前記現在スライスの２つの隣接するラスタスキャンラインを備える、Ｃ１３に記載の装置。
［Ｃ１５］
前記現在ブロックは、簡略化されたブロック予測モードで予測された２×８ブロック内の１×２区分である、Ｃ１３に記載の装置。
［Ｃ１６］
前記現在ブロックは、簡略化されたブロック予測モードで予測された２×８ブロック内の２×２区分である、Ｃ１３に記載の装置。
［Ｃ１７］
複数のピクセル位置の前記範囲は、複数のピクセルの前記第２のライン中の１つまたは複数の第３のピクセル位置を備える第３の領域をさらに含み、前記１つまたは複数の第３のピクセル位置は、前記現在ブロックの一部である前記第１のライン中のピクセル位置に関してコロケートされた前記第２のライン中のいずれのピクセル位置も含まない、Ｃ１３に記載の装置。
［Ｃ１８］
前記第２の領域および前記第３の領域は、同じラスタスキャンラインを占有する、Ｃ１７に記載の装置。
［Ｃ１９］
前記第１の領域は、第１の数のピクセル位置を含み、前記第２の領域は、第２の数のピクセル位置を含み、前記第３の領域は、第３の数のピクセル位置を含み、前記第１の数は、前記第２の数よりも大きく、かつ前記第３の数よりも大きい、Ｃ１７に記載の装置。
［Ｃ２０］
前記第１、第２、および第３の数は、互いに異なる、Ｃ１９に記載の装置。
［Ｃ２１］
前記１つまたは複数のプロセッサは、
複数の潜在的な候補ブロックの各潜在的な候補ブロックに基づいて、前記現在ブロックをコーディングすることに関連付けられたコストを決定することと、前記複数の潜在的な候補ブロックは、前記第１および第２の領域中の前記第１および第２のピクセル位置の１つに各々対応する、
最低コストを有する前記第１および第２の領域中の前記複数の潜在的な候補ブロックのうちの１つを、前記候補ブロックとして識別することと
を行うようにさらに構成される、Ｃ１３に記載の装置。
［Ｃ２２］
複数のピクセル位置の前記範囲中の各ピクセル位置を一意に識別するために必要とされるビットの数は、第１の数に等しく、前記１つまたは複数のプロセッサは、
前記現在スライス内のあらかじめ定められた領域内に前記現在ブロックがあると決定することと、
ビットの前記第１の数よりも小さいものを使用して前記予測ベクトルをシグナリングすることと
を行うようにさらに構成される、Ｃ１３に記載の装置。
［Ｃ２３］
前記１つまたは複数のプロセッサは、
前記現在スライス中の複数のピクセルの前記第１のライン中の少なくとも１つのピクセルと、複数のピクセルの第３のライン中の少なくとも１つのピクセルとを前記現在ブロックが含むことを決定することと、複数のピクセルの前記第３のラインは、前記現在ブロック中の少なくとも１つのピクセルを含み、前記現在スライスの前記全体の幅にわたり、ここにおいて、前記第３のラインは、前記第１のラインとは異なる、
第１のブロックに基づいて前記現在ブロックをコーディングすることに関連付けられたコストを決定することと、前記第１のブロックは、前記第１の領域中の少なくとも１つのピクセルと前記第２の領域中の少なくとも１つのピクセルとを含む、
前記第１のブロックに基づいて前記現在ブロックをコーディングすることに関連付けられた前記コストに基づいて、前記現在ブロックを予測するために使用される、前記候補ブロックとなる前記第１のブロックを決定することと
を行うようにさらに構成される、Ｃ１３に記載の装置。
［Ｃ２４］
前記１つまたは複数のプロセッサは、
前記現在スライス中の複数のピクセルの前記第１のライン中の少なくとも１つのピクセルと、複数のピクセルの第３のライン中の少なくとも１つのピクセルとを前記現在ブロックが含むことを決定することと、複数のピクセルの前記第３のラインは、前記現在ブロック中の少なくとも１つのピクセルを含み、前記現在スライスの前記全体の幅にわたり、ここにおいて、前記第３のラインは、前記第１のラインとは異なる、
前記現在ブロックよりも少ない数のピクセルを有する第１のブロックに基づいて、前記現在ブロックをコーディングすることに関連付けられた第１のコストを決定することと、前記第１のブロックは、前記第２の領域中の各々にある１つまたは複数のピクセルを含む、
前記現在ブロックと同じ数のピクセルを有する第２のブロックに基づいて、前記現在ブロックをコーディングすることに関連付けられた第２のコストを決定することと、前記第２のブロックは、前記第１のブロック中の前記１つまたは複数のピクセルの全てと、前記第１の領域中の各々にある１つまたは複数の追加のピクセルとを含む、
前記第２のコストが前記第１のコストよりも大きいとの決定に基づいて、前記現在ブロックを予測するために使用される前記候補ブロックとなる前記第１のブロックを決定することと
を行うようにさらに構成される、Ｃ１３に記載の装置。
［Ｃ２５］
固定ビットレートビデオコーディング方式の簡略化されたブロック予測モードでビデオデータのブロックをコーディングするように構成されたコードを備える非一時的物理的コンピュータストレージであって、前記コードは、実行されたとき、装置に、
現在スライス中の現在ブロックを予測するために使用される候補ブロックを決定することと、前記候補ブロックは、前記現在スライス中の再構成されたピクセルに各々対応する複数のピクセル位置の範囲内にあり、複数のピクセル位置の前記範囲は、少なくとも（i）前記現在スライス中の複数のピクセルの第１のライン中の１つまたは複数の第１のピクセル位置を含む第１の領域と、ここで、複数のピクセルの前記第１のラインは、前記現在ブロック中の少なくとも１つのピクセルを含み、前記現在スライスの全体の幅にわたる、（ii）前記現在スライス中の複数のピクセルの第２のライン中の１つまたは複数の第２のピクセル位置を含む第２の領域と、ここで、複数のピクセルの前記第２のラインは、前記現在ブロック中のいずれのピクセルも含まないが、前記現在スライスの前記全体の幅にわたる、を備える、
複数のピクセル位置の前記範囲内の前記候補ブロックのピクセル位置を示す予測ベクトルを決定することと、前記候補ブロックの前記ピクセル位置は、前記第１の領域または前記第２の領域のうちの１つにある、
前記予測ベクトルをシグナリングすることを少なくとも部分的に介して、簡略化されたブロック予測モードで前記現在ブロックをコーディングすることと
を行わせる、非一時的物理的コンピュータストレージ。
［Ｃ２６］
複数のピクセル位置の前記範囲は、複数のピクセルの前記第２のライン中の１つまたは複数の第３のピクセル位置を備える第３の領域をさらに含み、前記１つまたは複数の第３のピクセル位置は、前記現在ブロックの一部である前記第１のライン中のピクセル位置に関してコロケートされた前記第２のライン中のいずれのピクセル位置も含まない、Ｃ２５に記載の非一時的物理的コンピュータストレージ。
［Ｃ２７］
前記第１の領域は、第１の数のピクセル位置を含み、前記第２の領域は、第２の数のピクセル位置を含み、前記第３の領域は、第３の数のピクセル位置を含み、前記第１の数は、前記第２の数よりも大きく、かつ前記第３の数よりも大きい、Ｃ２６に記載の非一時的物理的コンピュータストレージ。
［Ｃ２８］
固定ビットレートビデオコーディング方式の簡略化されたブロック予測モードでビデオデータのブロックをコーディングするように構成されたビデオコーディングデバイスであって、前記ビデオコーディングデバイスは、
現在スライス中の現在ブロックを予測するために使用される候補ブロックを決定するための手段と、前記候補ブロックは、前記現在スライス中の再構成されたピクセルに各々対応する複数のピクセル位置の範囲内にあり、複数のピクセル位置の前記範囲は、少なくとも（i）前記現在スライス中の複数のピクセルの第１のライン中の１つまたは複数の第１のピクセル位置を含む第１の領域と、ここで、複数のピクセルの前記第１のラインは、前記現在ブロック中の少なくとも１つのピクセルを含み、前記現在スライスの全体の幅にわたる、（ii）前記現在スライス中の複数のピクセルの第２のライン中の１つまたは複数の第２のピクセル位置を含む第２の領域と、ここで、複数のピクセルの前記第２のラインは、前記現在ブロック中のいずれのピクセルも含まないが、前記現在スライスの前記全体の幅にわたる、を備える、
複数のピクセル位置の前記範囲内の前記候補ブロックのピクセル位置を示す予測ベクトルを決定するための手段と、前記候補ブロックの前記ピクセル位置は、前記第１の領域または前記第２の領域のうちの１つにある、
前記予測ベクトルをシグナリングすることを少なくとも部分的に介して、簡略化されたブロック予測モードで前記現在ブロックをコーディングするための手段と
を備える、ビデオコーディングデバイス。
［Ｃ２９］
複数のピクセル位置の前記範囲は、複数のピクセルの前記第２のライン中の１つまたは複数の第３のピクセル位置を備える第３の領域をさらに含み、前記１つまたは複数の第３のピクセル位置は、前記現在ブロックの一部である前記第１のライン中のピクセル位置に関してコロケートされた前記第２のライン中のいずれのピクセル位置も含まない、Ｃ２８に記載のビデオコーディングデバイス。
［Ｃ３０］
前記第１の領域は、第１の数のピクセル位置を含み、前記第２の領域は、第２の数のピクセル位置を含み、前記第３の領域は、第３の数のピクセル位置を含み、前記第１の数は、前記第２の数よりも大きく、かつ前記第３の数よりも大きい、Ｃ２９に記載のビデオコーディングデバイス。
[0190] Although described above with respect to various different embodiments, features or elements from one embodiment may be combined with other embodiments without departing from the teachings of the present disclosure. However, the combination of features between the embodiments is not necessarily limited thereto. Various embodiments of the disclosure have been described. These and other embodiments are within the scope of the following claims.
The invention described in the scope of claims at the beginning of the application of the present application will be added below.
[C1]
A method for coding a block of video data in a simplified block prediction mode of a constant bit rate video coding scheme, the method comprising:
Determining a candidate block used to predict a current block in a current slice; and the candidate block is within a plurality of pixel locations, each corresponding to a reconstructed pixel in the current slice The range of pixel locations includes at least (i) a first region that includes one or more first pixel locations in a first line of pixels in the current slice; and The first line of pixels includes at least one pixel in the current block and spans the entire width of the current slice; (ii) in a second line of pixels in the current slice; A second region including one or more second pixel locations, wherein the second line of pixels is any pixel in the current block. Although not included, comprises, over the entire width of the current slice,
Determining a prediction vector indicative of a pixel position of the candidate block within the range of a plurality of pixel positions, wherein the pixel position of the candidate block is one of the first region or the second region; It is in,
Coding the current block in a simplified block prediction mode, at least in part through signaling the prediction vector;
A method comprising:
[C2]
The method of C1, wherein the first line of pixels and the second line of pixels comprise two adjacent raster scan lines of the current slice.
[C3]
The method of C1, wherein the current block is a 1x2 partition in a 2x8 block predicted in a simplified block prediction mode.
[C4]
The method of C1, wherein the current block is a 2x2 partition within a 2x8 block predicted in a simplified block prediction mode.
[C5]
The range of pixel locations further includes a third region comprising one or more third pixel locations in the second line of pixels, wherein the one or more third pixels The method of C1, wherein a position does not include any pixel position in the second line that is collocated with respect to a pixel position in the first line that is part of the current block.
[C6]
The method of C5, wherein the second region and the third region occupy the same raster scan line.
[C7]
The first region includes a first number of pixel locations, the second region includes a second number of pixel locations, and the third region includes a third number of pixel locations. The method of C5, wherein the first number is greater than the second number and greater than the third number.
[C8]
The method of C7, wherein the first, second, and third numbers are different from each other.
[C9]
Determining a cost associated with coding the current block based on each potential candidate block of a plurality of potential candidate blocks; and Each corresponding to one of the first and second pixel locations in a second region;
Identifying one of the plurality of potential candidate blocks in the first and second regions having the lowest cost as the candidate block;
The method of C1, further comprising:
[C10]
The number of bits required to uniquely identify each pixel location in the range of pixel locations is equal to the first number, the method comprising:
Determining that the current block is within a predetermined area in the current slice;
Signaling the prediction vector using less than the first number of bits;
The method of C1, further comprising:
[C11]
Determining that the current block includes at least one pixel in the first line of pixels in the current slice and at least one pixel in a third line of pixels; The third line of pixels includes at least one pixel in the current block and spans the entire width of the current slice, wherein the third line is the first line. Different,
Determining a cost associated with coding the current block based on a first block; and the first block includes at least one pixel in the first region and in the second region. And at least one pixel of
Determining the first block to be the candidate block used to predict the current block based on the cost associated with coding the current block based on the first block; And
The method of C1, further comprising:
[C12]
Determining that the current block includes at least one pixel in the first line of pixels in the current slice and at least one pixel in a third line of pixels; The third line of pixels includes at least one pixel in the current block and spans the entire width of the current slice, wherein the third line is the first line. Different,
Determining a first cost associated with coding the current block based on a first block having fewer pixels than the current block; and Including one or more pixels in each of the regions of
Determining a second cost associated with coding the current block based on a second block having the same number of pixels as the current block; and Including all of the one or more pixels in a block and one or more additional pixels in each of the first regions;
Determining the first block to be the candidate block used to predict the current block based on a determination that the second cost is greater than the first cost;
The method of C1, further comprising:
[C13]
An apparatus for coding a block of video data in a simplified block prediction mode of a constant bit rate video coding scheme, the apparatus comprising:
A memory configured to store one or more reconstructed pixels of a current slice of video data;
One or more processors in communication with the memory, the one or more processors comprising:
Determining a candidate block used to predict a current block in the current slice; and the candidate block is within a plurality of pixel locations, each corresponding to a reconstructed pixel in the current slice. And the range of pixel locations is at least (i) a first region that includes one or more first pixel locations in a first line of pixels in the current slice; and The first line of pixels comprises at least one pixel in the current block and spans the entire width of the current slice; (ii) in the second line of pixels in the current slice A second region that includes one or more second pixel locations, wherein the second line of pixels includes any pixel in the current block. Although not included cell, over the entire width of the current slice comprises,
Determining a prediction vector indicative of a pixel position of the candidate block within the range of a plurality of pixel positions, wherein the pixel position of the candidate block is one of the first region or the second region; It is in,
Coding the current block in a simplified block prediction mode, at least in part through signaling the prediction vector;
An apparatus configured to do.
[C14]
The apparatus of C13, wherein the first line of pixels and the second line of pixels comprise two adjacent raster scan lines of the current slice.
[C15]
The apparatus of C13, wherein the current block is a 1 × 2 partition in a 2 × 8 block predicted in a simplified block prediction mode.
[C16]
The apparatus of C13, wherein the current block is a 2 × 2 partition in a 2 × 8 block predicted in a simplified block prediction mode.
[C17]
The range of pixel locations further includes a third region comprising one or more third pixel locations in the second line of pixels, wherein the one or more third pixels The apparatus of C13, wherein a position does not include any pixel position in the second line that is collocated with respect to a pixel position in the first line that is part of the current block.
[C18]
The apparatus according to C17, wherein the second area and the third area occupy the same raster scan line.
[C19]
The first region includes a first number of pixel locations, the second region includes a second number of pixel locations, and the third region includes a third number of pixel locations. The apparatus according to C17, wherein the first number is greater than the second number and greater than the third number.
[C20]
The apparatus according to C19, wherein the first, second, and third numbers are different from each other.
[C21]
The one or more processors are:
Determining a cost associated with coding the current block based on each potential candidate block of a plurality of potential candidate blocks; and Each corresponding to one of the first and second pixel locations in a second region;
Identifying one of the plurality of potential candidate blocks in the first and second regions having the lowest cost as the candidate block;
The apparatus of C13, further configured to perform:
[C22]
The number of bits required to uniquely identify each pixel location in the range of pixel locations is equal to a first number, and the one or more processors are:
Determining that the current block is within a predetermined area in the current slice;
Signaling the prediction vector using less than the first number of bits;
The apparatus of C13, further configured to perform:
[C23]
The one or more processors are:
Determining that the current block includes at least one pixel in the first line of pixels in the current slice and at least one pixel in a third line of pixels; The third line of pixels includes at least one pixel in the current block and spans the entire width of the current slice, wherein the third line is the first line. Different,
Determining a cost associated with coding the current block based on a first block; and the first block includes at least one pixel in the first region and in the second region. And at least one pixel of
Determining the first block to be the candidate block used to predict the current block based on the cost associated with coding the current block based on the first block; And
The apparatus of C13, further configured to perform:
[C24]
The one or more processors are:
Determining that the current block includes at least one pixel in the first line of pixels in the current slice and at least one pixel in a third line of pixels; The third line of pixels includes at least one pixel in the current block and spans the entire width of the current slice, wherein the third line is the first line. Different,
Determining a first cost associated with coding the current block based on a first block having fewer pixels than the current block; and Including one or more pixels in each of the regions of
Determining a second cost associated with coding the current block based on a second block having the same number of pixels as the current block; and Including all of the one or more pixels in a block and one or more additional pixels in each of the first regions;
Determining the first block to be the candidate block used to predict the current block based on a determination that the second cost is greater than the first cost;
The apparatus of C13, further configured to perform:
[C25]
Non-transitory physical computer storage comprising code configured to code a block of video data in a simplified block prediction mode of a constant bit rate video coding scheme, the code being executed when To the device,
Determining a candidate block used to predict a current block in a current slice; and the candidate block is within a plurality of pixel locations, each corresponding to a reconstructed pixel in the current slice The range of pixel locations includes at least (i) a first region that includes one or more first pixel locations in a first line of pixels in the current slice; and The first line of pixels includes at least one pixel in the current block and spans the entire width of the current slice; (ii) in a second line of pixels in the current slice; A second region including one or more second pixel locations, wherein the second line of pixels is any pixel in the current block. Although not included, comprises, over the entire width of the current slice,
Determining a prediction vector indicative of a pixel position of the candidate block within the range of a plurality of pixel positions, wherein the pixel position of the candidate block is one of the first region or the second region; It is in,
Coding the current block in a simplified block prediction mode, at least in part through signaling the prediction vector;
Non-temporary physical computer storage.
[C26]
The range of pixel locations further includes a third region comprising one or more third pixel locations in the second line of pixels, wherein the one or more third pixels The non-transitory physical computer storage of C25, wherein a location does not include any pixel location in the second line that is collocated with respect to a pixel location in the first line that is part of the current block. .
[C27]
The first region includes a first number of pixel locations, the second region includes a second number of pixel locations, and the third region includes a third number of pixel locations. The non-transitory physical computer storage according to C26, wherein the first number is greater than the second number and greater than the third number.
[C28]
A video coding device configured to code a block of video data in a simplified block prediction mode of a constant bit rate video coding scheme, the video coding device comprising:
Means for determining a candidate block used to predict a current block in a current slice; and the candidate block is within a plurality of pixel locations each corresponding to a reconstructed pixel in the current slice And the range of pixel locations includes at least (i) a first region that includes one or more first pixel locations in a first line of pixels in the current slice; Wherein the first line of pixels includes at least one pixel in the current block and spans the entire width of the current slice; (ii) a second line of pixels in the current slice A second region including one or more second pixel locations therein, wherein the second line of pixels is any of the current block Although not included Kuseru spans the entire width of the current slice comprises,
Means for determining a prediction vector indicative of a pixel position of the candidate block within the range of a plurality of pixel positions; and the pixel position of the candidate block is the first region or the second region In one,
Means for coding the current block in a simplified block prediction mode, at least in part through signaling the prediction vector;
A video coding device comprising:
[C29]
The range of pixel locations further includes a third region comprising one or more third pixel locations in the second line of pixels, wherein the one or more third pixels The video coding device of C28, wherein a position does not include any pixel position in the second line that is collocated with respect to a pixel position in the first line that is part of the current block.
[C30]
The first region includes a first number of pixel locations, the second region includes a second number of pixel locations, and the third region includes a third number of pixel locations. The video coding device of C29, wherein the first number is greater than the second number and greater than the third number.

Claims

A method for coding a block of video data in a simplified block prediction mode of a constant bit rate video coding scheme, the method comprising:
Determining a candidate block used to predict a current block in a current slice; and the candidate block is within a plurality of pixel locations, each corresponding to a reconstructed pixel in the current slice The range of pixel locations includes at least (i) a first region that includes one or more first pixel locations in a first line of pixels in the current slice; and The first line of pixels includes at least one pixel in the current block and spans the entire width of the current slice; (ii) in a second line of pixels in the current slice; A second region including one or more second pixel locations, wherein the second line of pixels is any pixel in the current block. Although not included, comprises, over the entire width of the current slice,
Determining a prediction vector indicative of a pixel position of the candidate block within the range of a plurality of pixel positions, wherein the pixel position of the candidate block is one of the first region or the second region; It is in,
Coding the current block in a simplified block prediction mode, at least in part through signaling the prediction vector.

The method of claim 1, wherein the first line of pixels and the second line of pixels comprise two adjacent raster scan lines of the current slice.

The method of claim 1, wherein the current block is a 1 × 2 partition within a 2 × 8 block predicted in a simplified block prediction mode.

The method of claim 1, wherein the current block is a 2 × 2 partition within a 2 × 8 block predicted in a simplified block prediction mode.

The range of pixel locations further includes a third region comprising one or more third pixel locations in the second line of pixels, wherein the one or more third pixels The method of claim 1, wherein a location does not include any pixel location in the second line that is collocated with respect to a pixel location in the first line that is part of the current block.

The method of claim 5, wherein the second region and the third region occupy the same raster scan line.

The first region includes a first number of pixel locations, the second region includes a second number of pixel locations, and the third region includes a third number of pixel locations. 6. The method of claim 5, wherein the first number is greater than the second number and greater than the third number.

The method of claim 7, wherein the first, second, and third numbers are different from each other.

Determining a cost associated with coding the current block based on each potential candidate block of a plurality of potential candidate blocks; and Each corresponding to one of the first and second pixel locations in a second region;
The method of claim 1, further comprising: identifying one of the plurality of potential candidate blocks in the first and second regions having the lowest cost as the candidate block.

The number of bits required to uniquely identify each pixel location in the range of pixel locations is equal to the first number, the method comprising:
Determining that the current block is within a predetermined region in the current slice;
The method of claim 1, further comprising signaling the prediction vector using less than the first number of bits.

Determining that the current block includes at least one pixel in the first line of pixels in the current slice and at least one pixel in a third line of pixels; The third line of pixels includes at least one pixel in the current block and spans the entire width of the current slice, wherein the third line is the first line. Different,
Determining a cost associated with coding the current block based on a first block; and the first block includes at least one pixel in the first region and in the second region. And at least one pixel of
Determining the first block to be the candidate block used to predict the current block based on the cost associated with coding the current block based on the first block; The method of claim 1, further comprising:

Determining that the current block includes at least one pixel in the first line of pixels in the current slice and at least one pixel in a third line of pixels; The third line of pixels includes at least one pixel in the current block and spans the entire width of the current slice, wherein the third line is the first line. Different,
Determining a first cost associated with coding the current block based on a first block having fewer pixels than the current block; and Including one or more pixels in each of the regions of
Determining a second cost associated with coding the current block based on a second block having the same number of pixels as the current block; and Including all of the one or more pixels in a block and one or more additional pixels in each of the first regions;
Determining the first block to be the candidate block used to predict the current block based on a determination that the second cost is greater than the first cost. The method of claim 1.

An apparatus for coding a block of video data in a simplified block prediction mode of a constant bit rate video coding scheme, the apparatus comprising:
A memory configured to store one or more reconstructed pixels of a current slice of video data;
One or more processors in communication with the memory, the one or more processors comprising:
Determining a candidate block used to predict a current block in the current slice; and the candidate block is within a plurality of pixel locations, each corresponding to a reconstructed pixel in the current slice. And the range of pixel locations is at least (i) a first region that includes one or more first pixel locations in a first line of pixels in the current slice; and The first line of pixels comprises at least one pixel in the current block and spans the entire width of the current slice; (ii) in the second line of pixels in the current slice A second region that includes one or more second pixel locations, wherein the second line of pixels includes any pixel in the current block. Although not included cell, over the entire width of the current slice comprises,
Determining a prediction vector indicative of a pixel position of the candidate block within the range of a plurality of pixel positions, wherein the pixel position of the candidate block is one of the first region or the second region; It is in,
An apparatus configured to code the current block in a simplified block prediction mode, at least in part through signaling the prediction vector.

14. The apparatus of claim 13, wherein the first line of pixels and the second line of pixels comprise two adjacent raster scan lines of the current slice.

The apparatus of claim 13, wherein the current block is a 1 × 2 partition within a 2 × 8 block predicted in a simplified block prediction mode.

The apparatus of claim 13, wherein the current block is a 2 × 2 partition within a 2 × 8 block predicted in a simplified block prediction mode.

The range of pixel locations further includes a third region comprising one or more third pixel locations in the second line of pixels, wherein the one or more third pixels 14. The apparatus of claim 13, wherein a location does not include any pixel location in the second line that is collocated with respect to a pixel location in the first line that is part of the current block.

The apparatus of claim 17, wherein the second region and the third region occupy the same raster scan line.

The first region includes a first number of pixel locations, the second region includes a second number of pixel locations, and the third region includes a third number of pixel locations. The apparatus of claim 17, wherein the first number is greater than the second number and greater than the third number.

The apparatus of claim 19, wherein the first, second, and third numbers are different from each other.

The one or more processors are:
Determining a cost associated with coding the current block based on each potential candidate block of a plurality of potential candidate blocks; and Each corresponding to one of the first and second pixel locations in a second region;
The method of claim 13, further comprising: identifying one of the plurality of potential candidate blocks in the first and second regions having the lowest cost as the candidate block. The device described.

The number of bits required to uniquely identify each pixel location in the range of pixel locations is equal to a first number, and the one or more processors are:
Determining that the current block is within a predetermined region in the current slice;
14. The apparatus of claim 13, further configured to: signal the prediction vector using less than the first number of bits.

The one or more processors are:
Determining that the current block includes at least one pixel in the first line of pixels in the current slice and at least one pixel in a third line of pixels; The third line of pixels includes at least one pixel in the current block and spans the entire width of the current slice, wherein the third line is the first line. Different,
Determining a cost associated with coding the current block based on a first block; and the first block includes at least one pixel in the first region and in the second region. And at least one pixel of
Determining the first block to be the candidate block used to predict the current block based on the cost associated with coding the current block based on the first block; 14. The apparatus of claim 13, further configured to:

The one or more processors are:
Determining that the current block includes at least one pixel in the first line of pixels in the current slice and at least one pixel in a third line of pixels; The third line of pixels includes at least one pixel in the current block and spans the entire width of the current slice, wherein the third line is the first line. Different,
Determining a first cost associated with coding the current block based on a first block having fewer pixels than the current block; and Including one or more pixels in each of the regions of
Determining a second cost associated with coding the current block based on a second block having the same number of pixels as the current block; and Including all of the one or more pixels in a block and one or more additional pixels in each of the first regions;
Determining the first block to be the candidate block used to predict the current block based on a determination that the second cost is greater than the first cost. The apparatus of claim 13 further configured.

Non-transitory physical computer storage comprising code configured to code a block of video data in a simplified block prediction mode of a constant bit rate video coding scheme, the code being executed when To the device,
Determining a candidate block used to predict a current block in a current slice; and the candidate block is within a plurality of pixel locations, each corresponding to a reconstructed pixel in the current slice The range of pixel locations includes at least (i) a first region that includes one or more first pixel locations in a first line of pixels in the current slice; and The first line of pixels includes at least one pixel in the current block and spans the entire width of the current slice; (ii) in a second line of pixels in the current slice; A second region including one or more second pixel locations, wherein the second line of pixels is any pixel in the current block. Although not included, comprises, over the entire width of the current slice,
Determining a prediction vector indicative of a pixel position of the candidate block within the range of a plurality of pixel positions, wherein the pixel position of the candidate block is one of the first region or the second region; It is in,
Non-transitory physical computer storage that causes the current block to be coded in a simplified block prediction mode, at least in part through signaling the prediction vector.

The range of pixel locations further includes a third region comprising one or more third pixel locations in the second line of pixels, wherein the one or more third pixels 26. The non-transitory physical of claim 25, wherein a location does not include any pixel location in the second line that is collocated with respect to a pixel location in the first line that is part of the current block. Computer storage.

The first region includes a first number of pixel locations, the second region includes a second number of pixel locations, and the third region includes a third number of pixel locations. 27. The non-transitory physical computer storage of claim 26, wherein the first number is greater than the second number and greater than the third number.

A video coding device configured to code a block of video data in a simplified block prediction mode of a constant bit rate video coding scheme, the video coding device comprising:
Means for determining a candidate block used to predict a current block in a current slice; and the candidate block is within a plurality of pixel locations each corresponding to a reconstructed pixel in the current slice And the range of pixel locations includes at least (i) a first region that includes one or more first pixel locations in a first line of pixels in the current slice; Wherein the first line of pixels includes at least one pixel in the current block and spans the entire width of the current slice; (ii) a second line of pixels in the current slice A second region including one or more second pixel locations therein, wherein the second line of pixels is any of the current block Although not included Kuseru spans the entire width of the current slice comprises,
Means for determining a prediction vector indicative of a pixel position of the candidate block within the range of a plurality of pixel positions; and the pixel position of the candidate block is the first region or the second region In one,
Means for coding the current block in a simplified block prediction mode, at least in part through signaling the prediction vector.

The range of pixel locations further includes a third region comprising one or more third pixel locations in the second line of pixels, wherein the one or more third pixels 30. The video coding device of claim 28, wherein a location does not include any pixel location in the second line that is collocated with respect to a pixel location in the first line that is part of the current block.

The first region includes a first number of pixel locations, the second region includes a second number of pixel locations, and the third region includes a third number of pixel locations. 30. The video coding device of claim 29, wherein the first number is greater than the second number and greater than the third number.