JP6952513B2

JP6952513B2 - Mode information encoder, mode information decoder, and program

Info

Publication number: JP6952513B2
Application number: JP2017126358A
Authority: JP
Inventors: 俊枝三須; 市ヶ谷　敦郎; 敦郎市ヶ谷; 境田　慎一; 慎一境田
Original assignee: Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2017-06-28
Filing date: 2017-06-28
Publication date: 2021-10-20
Anticipated expiration: 2037-06-28
Also published as: JP2019009727A

Description

本発明は、モード情報符号化装置、モード情報復号装置、およびプログラムに関する。 The present invention relates to a mode information encoding device, a mode information decoding device, and a program.

画像符号化や映像符号化のイントラスライスの処理においては、画面内のすでに符号化済み領域内の情報に基づき、これから符号化すべき対象領域の画素値列を予測する。そして、予測によって得られた対象領域の画素値列と、対象領域の実際の画素値列との差分をとって、エントロピー符号化する。これにより、前記差分が統計的に０付近の値に偏在する傾向を活用して符号化効率の向上を実現している。 In the intra-slice processing of image coding and video coding, the pixel value sequence of the target area to be encoded is predicted based on the information in the already coded area in the screen. Then, the difference between the pixel value sequence of the target region obtained by the prediction and the actual pixel value sequence of the target region is taken and entropy-coded. As a result, the coding efficiency is improved by utilizing the tendency that the difference is statistically unevenly distributed to the value near 0.

例えば、MPEG-H HEVC/H.265（以下、「ＨＥＶＣ」と呼ぶ）においては、方向予測モード（３３種類）と、ＤＣ予測と、平面予測の計３５種類の画面内予測モードが利用可能である。このうち、方向予測モードでは、符号化対象ブロック近傍の参照画素値列を所定方向へ外挿することにより予測ブロックを得る。また、ＤＣ予測では、予測ブロック内の全画素を参照画素値列の平均値とする。また、平面予測では、参照画素値列に近似的な双一次補間を適用することで予測ブロックを得る。 For example, in MPEG-H HEVC / H.265 (hereinafter referred to as "HEVC"), a total of 35 types of in-screen prediction modes are available: direction prediction mode (33 types), DC prediction, and plane prediction. be. Of these, in the direction prediction mode, the prediction block is obtained by extrapolating the reference pixel value sequence in the vicinity of the coded block in the predetermined direction. Further, in the DC prediction, all the pixels in the prediction block are set as the average value of the reference pixel value sequence. Further, in the plane prediction, a prediction block is obtained by applying approximate bilinear interpolation to the reference pixel value sequence.

ＨＥＶＣにおいて、画面内予測はＴＵ（Transform Unit，トランスフォームユニット）と称するブロック単位で実行される。このとき、あるＴＵにおいて適用した画面内予測モードの情報を、符号化器側から復号器側へ通知する必要がある。ＨＥＶＣにおいては、注目するＴＵ（以下、「対象ＴＵ」と呼ぶ）の左に隣接するＴＵ（左に隣接するＴＵが複数ある場合には、それらのうち最も上のＴＵ）、および対象ＴＵの上に隣接するＴＵ（上に隣接するＴＵが複数ある場合には、そのうち最も左のＴＵ）の画面内予測モード番号に応じて対象ＴＵの画面内予測モード番号を符号化する。これにより、画面内予測モードの空間的相関を利用して、伝達する画面内予測モードのエントロピー削減を図っている。 In HEVC, in-screen prediction is executed in block units called TU (Transform Unit). At this time, it is necessary to notify the in-screen prediction mode information applied in a certain TU from the encoder side to the decoder side. In HEVC, the TU adjacent to the left of the TU of interest (hereinafter referred to as "target TU") (if there are multiple TUs adjacent to the left, the top TU among them), and above the target TU. The in-screen prediction mode number of the target TU is encoded according to the in-screen prediction mode number of the TU adjacent to (if there are a plurality of TUs adjacent to the top, the leftmost TU). As a result, the entropy of the transmitted in-screen prediction mode is reduced by utilizing the spatial correlation of the in-screen prediction mode.

また、特許文献１には、対象ＴＵに隣接するＴＵの画素値のパターンに応じて、対象ＴＵの画面内予測モード（方向予測の方向）を予測する手法が開示されている。 Further, Patent Document 1 discloses a method of predicting an in-screen prediction mode (direction of direction prediction) of a target TU according to a pattern of pixel values of a TU adjacent to the target TU.

特許第５５１４１３０号公報Japanese Patent No. 5514130

ＨＥＶＣにおける画面内予測モードの符号化法では、対象ＴＵの画面内予測モードが、隣接ＴＵの画面内予測モードと一致する可能性が高いという傾向を利用する。このとき、左または上に隣接するＴＵの画面内予測モードを既定の場合分けルールに当てはめ、対象ＴＵの画面内予測モードに対する3つの候補を画一的に定める。そして、これら３候補内に対象ＴＵの実際の画面内予測モードが含まれればそれら３候補のうち一を特定するインデックスを通知し、含まれなければ当該３候補を除いて数えたときの画面内予測モード番号を通知するよう動作する。 The in-screen prediction mode coding method in HEVC utilizes the tendency that the in-screen prediction mode of the target TU is likely to match the in-screen prediction mode of the adjacent TU. At this time, the in-screen prediction mode of the TU adjacent to the left or above is applied to the default case classification rule, and three candidates for the in-screen prediction mode of the target TU are uniformly determined. Then, if the actual in-screen prediction mode of the target TU is included in these three candidates, the index that identifies one of the three candidates is notified, and if it is not included, the in-screen screen when counting excluding the three candidates. It works to notify the prediction mode number.

例えば、左隣接ブロックが画面内予測モード１６（左斜め上、勾配２１／３２（約３３．３度）の方向を参照する方向予測）であり、上隣接ブロックが画面内予測モード１８（左斜め上、勾配３２／３２（４５．０度）方向を参照する方向予測）であった場合、ＨＥＶＣでは、モード１６、モード１８、およびモード０（平面予測）が前記３候補として設定される。しかしながら、この場合、モード１６とモード１８の中間の方向を参照するモード１７（左斜め上、勾配２６／３２（約３９．１度）の方向を参照する方向予測）が出現する確率も高いと考えられるにもかかわらず、モード１７は上記の３候補には含まれない。つまり、モード情報の通知のしかたが非効率となる可能性がある。 For example, the left adjacent block is the in-screen prediction mode 16 (direction prediction referring to the direction of the slope 21/32 (about 33.3 degrees) diagonally upward to the left), and the upper adjacent block is the in-screen prediction mode 18 (left diagonally upward). In the case of (direction prediction referring to the gradient 32/32 (45.0 degrees) direction), mode 16, mode 18, and mode 0 (plane prediction) are set as the three candidates in HEVC. However, in this case, there is a high probability that mode 17 (direction prediction that refers to the direction of the gradient 26/32 (about 39.1 degrees) diagonally upward to the left) that refers to the intermediate direction between modes 16 and 18 will appear. Despite the possibility, mode 17 is not included in the above three candidates. That is, the method of notifying the mode information may be inefficient.

また、ＨＥＶＣにおいては、対象ＴＵの画面内予測モードが前記３候補に含まれれば大幅なビット数削減が期待できる。その反面、３候補から漏れたものについては相対的にビット数が増え、損を余儀なくされる。 Further, in HEVC, if the in-screen prediction mode of the target TU is included in the above three candidates, a significant reduction in the number of bits can be expected. On the other hand, the number of bits for those leaked from the three candidates increases relatively, and a loss is unavoidable.

特許文献１の手法では、隣接ＴＵの画素値パターンの空間周波数に応じて対象ＴＵのイントラ予測モードの候補を決定する。この手法では規則的なパターンを有する画像領域に対しては適切なイントラ予測モード候補を提示することができる。しかしながら、画像パターンの規則性が乏しい場合や画像パターンが平坦な場合には、たとえイントラ予測モードがブロック間で統計的な相関性が強い場合であっても、その相関性を考慮した情報量削減には寄与しないという問題もある。 In the method of Patent Document 1, candidates for the intra prediction mode of the target TU are determined according to the spatial frequency of the pixel value pattern of the adjacent TU. In this method, an appropriate intra-prediction mode candidate can be presented for an image region having a regular pattern. However, when the regularity of the image pattern is poor or the image pattern is flat, even if the intra prediction mode has a strong statistical correlation between blocks, the amount of information is reduced in consideration of the correlation. There is also the problem that it does not contribute to.

また、ＨＥＶＣおよび特許文献１のいずれの手法においても、対象ＴＵのイントラ予測モード候補を絞り込む手法は画一的である。つまり、符号化対象の映像に応じてモード候補を変えることは行われていない。つまり、時々刻々の映像の特性に合わせたモード候補の生成を行っていないことにより、未だモード情報の符号化における効率改善の余地がある。 Further, in both the methods of HEVC and Patent Document 1, the method of narrowing down the intra-prediction mode candidates of the target TU is uniform. That is, the mode candidates are not changed according to the image to be encoded. That is, there is still room for improving the efficiency in coding the mode information because the mode candidates are not generated according to the characteristics of the video every moment.

本発明は、上記のような事情を考慮して為されたものであり、画像・映像を符号化する際の符号化対象ブロックのモード情報を、効率よく符号化し、そして復号するためのモード情報符号化装置、モード情報復号装置、およびプログラムを提供しようとするものである。 The present invention has been made in consideration of the above circumstances, and mode information for efficiently encoding and decoding the mode information of the coded target block when encoding an image / video. It is intended to provide a coding device, a mode information decoding device, and a program.

［１］上記の課題を解決するため、本発明の一態様によるモード情報符号化装置は、符号化済みの領域である既符号化領域内の学習用対象領域のモード情報と、前記既符号化領域内の前記学習用対象領域に対応する学習用参照領域のモード情報との組の出現頻度を表す頻度情報を生成する頻度情報生成部と、符号化対象の領域である対象領域に対応する参照領域のモード情報に関して前記頻度情報が保持する出現頻度の少なくとも一部に基づいて、前記対象領域のモード情報を順序化する順序化部と、前記対象領域のモード情報を、前記順序化部が求めた順序の少なくとも一部に基づく符号として符号化する符号化部と、を具備することを特徴とする。 [1] In order to solve the above problem, the mode information coding apparatus according to one aspect of the present invention includes the mode information of the learning target area in the coded area, which is the coded area, and the coded area. A frequency information generator that generates frequency information indicating the frequency of appearance of a set with the mode information of the learning reference area corresponding to the learning target area in the area, and a reference corresponding to the target area that is the coding target area. The ordering unit obtains an ordering unit for ordering the mode information of the target area and the mode information of the target area based on at least a part of the appearance frequency held by the frequency information regarding the mode information of the area. It is characterized by comprising a coding unit for coding as a code based on at least a part of the order.

［２］また、本発明の一態様は、上記のモード情報符号化装置において、前記順序化部は、前記参照領域のモード情報に関して前記頻度情報が保持する出現頻度のすべてを順序付けることによって、前記対象領域のモード情報を順序化し、前記符号化部は、前記対象領域のモード情報を、前記順序化部が求めた順序のすべてに基づく符号として符号化する、ことを特徴とする。 [2] Further, in one aspect of the present invention, in the mode information coding apparatus, the ordering unit orders all the appearance frequencies held by the frequency information with respect to the mode information of the reference region. The mode information of the target area is ordered, and the coding unit encodes the mode information of the target area as a code based on all the orders obtained by the ordering unit.

［３］また、本発明の一態様によるモード情報復号装置は、復号済みの領域である既復号領域内の学習用対象領域のモード情報と、前記既復号領域内の前記学習用対象領域に対応する学習用参照領域のモード情報との組の出現頻度を表す頻度情報を生成する頻度情報生成部と、復号対象の領域である対象領域に対応する参照領域のモード情報に関して前記頻度情報が保持する出現頻度の少なくとも一部に基づいて、前記対象領域のモード情報を順序化する順序化部と、入力された符号値に対応する順序を基に、順序化された前記対象領域のモード情報から選ばれる特定のモード情報を復号する復号部と、を具備することを特徴とする。 [3] Further, the mode information decoding device according to one aspect of the present invention corresponds to the mode information of the learning target area in the already decoded area, which is the decoded area, and the learning target area in the already decoded area. The frequency information holds the frequency information generation unit that generates frequency information indicating the frequency of appearance of a set with the mode information of the learning reference area to be used, and the mode information of the reference area corresponding to the target area that is the decoding target area. Select from the ordering unit that orders the mode information of the target area based on at least a part of the appearance frequency, and the mode information of the target area that is ordered based on the order corresponding to the input code value. It is characterized by including a decoding unit that decodes specific mode information.

［４］また、本発明の一態様は、上記のモード情報復号装置において、前記順序化部は、前記参照領域のモード情報に関して前記頻度情報が保持する出現頻度のすべてを順序付けることによって、前記対象領域のモード情報を順序化する、ことを特徴とする。 [4] Further, in one aspect of the present invention, in the mode information decoding apparatus, the ordering unit orders all the appearance frequencies held by the frequency information with respect to the mode information of the reference region. It is characterized in that the mode information of the target area is ordered.

［５］また、本発明の一態様は、コンピューターを、符号化済みの領域である既符号化領域内の学習用対象領域のモード情報と、前記既符号化領域内の前記学習用対象領域に対応する学習用参照領域のモード情報との組の出現頻度を表す頻度情報を生成する頻度情報生成部と、符号化対象の領域である対象領域に対応する参照領域のモード情報に関して前記頻度情報が保持する出現頻度の少なくとも一部に基づいて、前記対象領域のモード情報を順序化する順序化部と、前記対象領域のモード情報を、前記順序化部が求めた順序に基づく符号として符号化する符号化部と、を具備するモード情報符号化装置として機能させるためのプログラムである。 [5] Further, in one aspect of the present invention, the computer is divided into the mode information of the learning target area in the coded area, which is the coded area, and the learning target area in the coded area. The frequency information is related to the frequency information generation unit that generates frequency information indicating the frequency of appearance of the pair with the mode information of the corresponding learning reference area, and the mode information of the reference area corresponding to the target area that is the area to be encoded. An ordering unit that orders the mode information of the target area based on at least a part of the frequency of occurrence to be held, and the mode information of the target area are encoded as codes based on the order obtained by the ordering unit. It is a program for functioning as a mode information coding device including a coding unit.

［６］また、本発明の一態様は、コンピューターを、復号済みの領域である既復号領域内の学習用対象領域のモード情報と、前記既復号領域内の前記学習用対象領域に対応する学習用参照領域のモード情報との組の出現頻度を表す頻度情報を生成する頻度情報生成部と、復号対象の領域である対象領域に対応する参照領域のモード情報に関して前記頻度情報が保持する出現頻度の少なくとも一部に基づいて、前記対象領域のモード情報を順序化する順序化部と、入力された符号値に対応する順序を基に、順序化された前記対象領域のモード情報から選ばれる特定のモード情報を復号する復号部と、を具備するモード情報復号装置として機能させるためのプログラムである。 [6] Further, in one aspect of the present invention, the computer is subjected to learning corresponding to the mode information of the learning target area in the already decoded area, which is the decoded area, and the learning target area in the already decoded area. Frequency information generation unit that generates frequency information indicating the frequency of appearance of a set with the mode information of the reference area, and the appearance frequency held by the frequency information regarding the mode information of the reference area corresponding to the target area that is the decoding target area. An ordering unit that orders the mode information of the target area based on at least a part of the above, and a specification selected from the mode information of the target area that is ordered based on the order corresponding to the input code value. It is a program for functioning as a mode information decoding device including a decoding unit for decoding the mode information of the above.

本発明によれば、モード情報符号化装置は、モード情報の符号化効率を向上させることができる。また、モード情報復号装置は、モード情報符号化装置によって符号化されたモード情報を復号することができる。また、モード情報復号装置は、頻度情報（ヒストグラムデータ）を外部から（例えば、モード情報符号化装置から）受け取ることなく復号を行えるため、符号化効率を低下させる要因となる情報伝達のオーバヘッドが生じない。 According to the present invention, the mode information coding apparatus can improve the coding efficiency of mode information. Further, the mode information decoding device can decode the mode information encoded by the mode information coding device. Further, since the mode information decoding device can perform decoding without receiving frequency information (histogram data) from the outside (for example, from the mode information coding device), an overhead of information transmission that causes a decrease in coding efficiency occurs. No.

本発明の一実施形態によるモード情報符号化装置およびモード情報復号装置の概略機能構成を示したブロック図、およびモード情報符号化装置およびモード情報復号装置が扱う情報（符号化ブロックのモード情報）を示す概略図である。A block diagram showing a schematic functional configuration of a mode information coding device and a mode information decoding device according to an embodiment of the present invention, and information handled by the mode information coding device and the mode information decoding device (mode information of the coding block) are shown. It is a schematic diagram which shows. 同実施形態によるヒストグラム集計部１０の処理の要点を説明するための概略図である。It is the schematic for demonstrating the main point of processing of the histogram totaling part 10 by the same embodiment. 同実施形態によるソート部１１の処理の要点を説明するための概略図である。It is the schematic for demonstrating the main point of processing of the sort part 11 by the same embodiment. 同実施形態による符号化部１２の処理の要点を説明するための概略図である。It is a schematic diagram for demonstrating the main point of processing of the coding unit 12 by the same embodiment. 同実施形態による復号部４２の処理の要点を説明するための概略図である。It is the schematic for demonstrating the main point of processing of the decoding unit 42 by the same embodiment.

［実施形態］
次に、本発明の実施形態について、図面を参照しながら説明する。
図１は、本実施形態によるモード情報符号化装置およびモード情報復号装置の概略機能構成を説明するためのブロック図である。また、図１は、本実施形態によるモード情報符号化装置およびモード情報復号装置が扱う情報（符号化ブロックのモード情報）をも示す。 [Embodiment]
Next, an embodiment of the present invention will be described with reference to the drawings.
FIG. 1 is a block diagram for explaining a schematic functional configuration of a mode information coding device and a mode information decoding device according to the present embodiment. In addition, FIG. 1 also shows information (mode information of the coding block) handled by the mode information coding device and the mode information decoding device according to the present embodiment.

モード情報符号化装置１は、対象ブロック近傍の１個以上の参照ブロックのモード値を参照しながら、対象ブロックのモード値を符号化する。本実施形態によるモード情報符号化装置１は、対象ブロックを符号化する時点より前に既に符号化されたモード情報に基づいて、参照ブロックのモード値と対象ブロックのモード値の間の関連性を学習する。対象ブロックを符号化する時点より前に既に符号化されたモード情報とは、例えば、符号化順序における前フレーム以前のフレームに含まれるブロックのモード情報である。あるいは、対象ブロックと同じフレームに含まれ、既に符号化されているブロック（通常は、対象ブロックよりも上に存在するか、対象ブロックと同じ行の対象ブロックより左にあるブロック）のモード情報である。モード情報符号化装置１は、このような学習に基づきシーンに適応したモード値の変換を実現する点において、ＨＥＶＣとは性質を異にする符号化を行う。 The mode information coding device 1 encodes the mode value of the target block while referring to the mode value of one or more reference blocks in the vicinity of the target block. The mode information coding apparatus 1 according to the present embodiment determines the relationship between the mode value of the reference block and the mode value of the target block based on the mode information already encoded before the time when the target block is encoded. learn. The mode information already encoded before the time when the target block is encoded is, for example, the mode information of the block included in the frame before the previous frame in the coding order. Alternatively, in the mode information of a block that is contained in the same frame as the target block and is already encoded (usually a block that is above the target block or to the left of the target block on the same line as the target block). be. The mode information coding device 1 performs coding different from that of HEVC in that it realizes conversion of mode values adapted to the scene based on such learning.

図１に示すように、モード情報符号化装置１は、ヒストグラム集計部１０（頻度情報生成部）、ソート部１１（順序化部）と、符号化部１２とを含んで構成される。
ヒストグラム集計部１０は、符号化済みの領域である既符号化領域内の学習用対象領域のモード情報と、前記既符号化領域内の前記学習用対象領域に対応する学習用参照領域のモード情報との組の出現頻度を表す頻度情報（ヒストグラムデータ）を集計し、生成する。
ソート部１１は、符号化対象の領域である対象領域に対応する参照領域のモード情報に関して前記頻度情報（ヒストグラムデータ）が保持する出現頻度の少なくとも一部に基づいて、前記対象領域のモード情報を順序化する。また、本実施形態によるソート部１１は、前記参照領域のモード情報に関して前記頻度情報が保持する出現頻度のすべてをソート処理によって順序付けることによって、前記対象領域のモード情報を順序化する。
符号化部１２は、前記対象領域のモード情報を、ソート部１１が求めた順序の少なくとも一部に基づく符号として符号化する。また、本実施形態による符号化部１２は、前記対象領域のモード情報を、ソート部１１が求めた順序のすべてに基づく符号として符号化する。 As shown in FIG. 1, the mode information coding device 1 includes a histogram totaling unit 10 (frequency information generation unit), a sorting unit 11 (ordering unit), and a coding unit 12.
The histogram aggregation unit 10 includes mode information of the learning target area in the coded area, which is a coded area, and mode information of the learning reference area corresponding to the learning target area in the coded area. Frequency information (diaphragm data) indicating the frequency of appearance of the pair with is aggregated and generated.
The sort unit 11 obtains the mode information of the target area based on at least a part of the appearance frequency held by the frequency information (histogram data) with respect to the mode information of the reference area corresponding to the target area which is the area to be encoded. Order. Further, the sort unit 11 according to the present embodiment orders the mode information of the target area by ordering all the appearance frequencies held by the frequency information with respect to the mode information of the reference area by the sort process.
The coding unit 12 encodes the mode information of the target area as a code based on at least a part of the order obtained by the sorting unit 11. Further, the coding unit 12 according to the present embodiment encodes the mode information of the target area as a code based on all the orders obtained by the sorting unit 11.

また、エントロピー符号化部２は、モード情報符号化装置１が出力するモード情報の符号値を、エントロピー符号化する。
また、エントロピー復号部３は、エントロピー符号化部２によって符号化された符号を、モード情報符号化装置１が出力した符号に復号する。 Further, the entropy coding unit 2 entropy-codes the code value of the mode information output by the mode information coding device 1.
Further, the entropy decoding unit 3 decodes the code encoded by the entropy coding unit 2 into the code output by the mode information coding device 1.

また、モード情報復号装置４は、ヒストグラム集計部４０（頻度情報生成部）と、ソート部４１（順序化部）と、復号部４２とを含んで構成される。
ヒストグラム集計部４０は、復号済みの領域である既復号領域内の学習用対象領域のモード情報と、前記既復号領域内の前記学習用対象領域に対応する学習用参照領域のモード情報との組の出現頻度を表す頻度情報（ヒストグラムデータ）を生成する。
ソート部４１は、復号対象の領域である対象領域に対応する参照領域のモード情報に関して前記頻度情報が保持する出現頻度の少なくとも一部に基づいて、前記対象領域のモード情報を順序化する。また、本実施形態のソート部４１は、前記参照領域のモード情報に関して前記頻度情報が保持する出現頻度のすべてをソート処理で順序付けることによって、前記対象領域のモード情報を順序化する。
復号部４２は、入力された符号値に対応する順序を基に、順序化された前記対象領域のモード情報から選ばれる特定のモード情報を復号する。 Further, the mode information decoding device 4 includes a histogram aggregation unit 40 (frequency information generation unit), a sort unit 41 (ordering unit), and a decoding unit 42.
The histogram aggregation unit 40 is a set of the mode information of the learning target area in the already decoded area, which is the decoded area, and the mode information of the learning reference area corresponding to the learning target area in the already decoded area. Generates frequency information (histogram data) that represents the frequency of appearance of.
The sort unit 41 orders the mode information of the target area based on at least a part of the appearance frequency held by the frequency information with respect to the mode information of the reference area corresponding to the target area which is the area to be decoded. Further, the sort unit 41 of the present embodiment orders the mode information of the target area by ordering all the appearance frequencies held by the frequency information with respect to the mode information of the reference area by the sort process.
The decoding unit 42 decodes specific mode information selected from the ordered mode information of the target area based on the order corresponding to the input code value.

なお、モード情報符号化装置１およびモード情報復号装置４を構成する各部の機能や、エントロピー符号化部２およびエントロピー復号部３の機能は、例えば、電子回路を用いて構成する。また、各部の機能において、情報を記憶するために、必要に応じて、磁気ハードディスク装置や半導体メモリなどといった記憶手段を用いる。また、これら各部の機能を、コンピューターとプログラムとで実現するようにしてもよい。 The functions of the respective parts constituting the mode information coding device 1 and the mode information decoding device 4 and the functions of the entropy coding unit 2 and the entropy decoding unit 3 are configured by using, for example, an electronic circuit. Further, in the function of each part, a storage means such as a magnetic hard disk device or a semiconductor memory is used as necessary to store information. Further, the functions of each of these parts may be realized by a computer and a program.

なお、モード情報符号化装置１およびモード情報復号装置４は、モード情報をそれぞれ符号化し復号する装置である。モード情報以外の情報（例えば、画像または映像の内容そのものを表す情報など）は、必要に応じて、別途、符号化され、伝送されあるいは記録媒体等に記録され、そして復号される。 The mode information coding device 1 and the mode information decoding device 4 are devices that encode and decode mode information, respectively. Information other than the mode information (for example, information representing the content of the image or the video itself) is separately encoded, transmitted, recorded on a recording medium, or the like, and then decoded, if necessary.

また、同図において、２１は対象領域である。対象領域２１は１個のブロック（対象ブロック）を含む。６は、参照領域である。図示する例では、参照領域６は２個のブロック（参照ブロック）を含む。なお、本実施形態では１個の対象ブロックに２個の参照ブロックが対応しており、それらの参照ブロックは対象ブロックの左および上に隣接する。しかしながら、参照ブロックの個数は２以外であってもよい。そして、ｘは、対象ブロックのモード（対象モード）である。また、ｙ_１およびｙ_２は、それぞれ、２個の参照ブロックのモード（参照モード）である。また、５は、学習用対象領域および学習用参照領域をあわせた領域である。第ｎ番目の学習用対象ブロックに、２個の学習用参照ブロック（学習用対象ブロックの左および上に隣接）が対応している。第ｎ番目の学習用対象ブロックのモードが、ｘ^（ｎ）である。また、この学習用対象ブロックに対応する２個の学習用参照ブロックのモードは、それぞれ、ｙ_１ ^（ｎ）およびｙ_２ ^（ｎ）である。 Further, in the figure, 21 is a target area. The target area 21 includes one block (target block). Reference numeral 6 is a reference area. In the illustrated example, the reference region 6 includes two blocks (reference blocks). In this embodiment, two reference blocks correspond to one target block, and these reference blocks are adjacent to the left and above the target block. However, the number of reference blocks may be other than 2. And x is the mode (target mode) of the target block. Further, y ₁ and y ₂ are modes (reference modes) of two reference blocks, respectively. Reference numeral 5 denotes an area in which the learning target area and the learning reference area are combined. Two learning reference blocks (adjacent to the left and above the learning target block) correspond to the nth learning target block. The mode of the nth learning target block is x ⁽ⁿ⁾ . The modes of the two learning reference blocks corresponding to the learning target block are y ₁ ⁽ⁿ⁾ and y ₂ ⁽ⁿ⁾ , respectively.

また、２２は、モード情報復号装置４側における対象領域である。また、８は、参照領域である。また、７は、学習用対象領域および学習用参照領域をあわせた領域である。対象領域２２は、上記の対象領域２１に対応する。参照領域８は、上記の参照領域６に対応する。学習用対象領域および学習用参照領域７は、学習用対象領域および学習用参照領域５に対応する。そして、モード情報復号装置４側におけるｘ，ｙ_１，ｙ_２，ｘ^（ｎ），ｙ_１ ^（ｎ），ｙ_２ ^（ｎ）（ただし、ｎ＝１，２，・・・）は、モード情報符号化装置１側におけるそれらと同様のものであり、ここでは説明を省略する。 Reference numeral 22 denotes a target area on the mode information decoding device 4 side. Reference numeral 8 is a reference area. Reference numeral 7 denotes an area in which the learning target area and the learning reference area are combined. The target area 22 corresponds to the above-mentioned target area 21. The reference area 8 corresponds to the above-mentioned reference area 6. The learning target area and the learning reference area 7 correspond to the learning target area and the learning reference area 5. _{Then, x, y 1} , y ₂ , x ⁽ⁿ⁾ , y ₁ ⁽ⁿ⁾ , y ₂ ⁽ⁿ⁾ (where n = 1, 2, ...) On the mode information decoding device 4 side are the mode information. It is the same as those on the encoding device 1 side, and description thereof will be omitted here.

一般化すると、ｋ番目（ｋは１以上、且つＫ以下の整数。Ｋは１個の対象ブロックに対応する参照ブロックの個数であり、１以上の整数。）の参照ブロックのモード値を参照モードｙ_ｋと呼ぶ。また、対象ブロックのモード値を対象モードｘと呼ぶ。即ち、一つの対象ブロックに対し、参照ブロックの個数Ｋは１以上である。また、対象ブロック、参照ブロックともに、モード値の値域は１以上且つＭ以下（Ｍは２以上の整数）である。 In generalization, the mode value of the k-th (k is an integer greater than or equal to 1 and less than or equal to K. K is the number of reference blocks corresponding to one target block and is an integer of 1 or more) is referred to as the reference mode. Called y _k. Further, the mode value of the target block is called the target mode x. That is, the number K of reference blocks is 1 or more for one target block. Further, in both the target block and the reference block, the range of the mode value is 1 or more and M or less (M is an integer of 2 or more).

なお、対象ブロックの位置に対する各参照ブロックの相対位置は、モード情報符号化装置１とモード情報復号装置４とで共通の相対位置関係にある。例えば、モード情報符号化装置１とモード情報復号装置４との両者において、第1の参照ブロックは対象ブロックの左に隣接し、第２の参照ブロックは対象ブロックの上に隣接する。 The relative position of each reference block with respect to the position of the target block has a common relative positional relationship between the mode information coding device 1 and the mode information decoding device 4. For example, in both the mode information coding device 1 and the mode information decoding device 4, the first reference block is adjacent to the left of the target block, and the second reference block is adjacent to the top of the target block.

また、対象ブロックと参照ブロックとの間でブロックサイズが異なっていてもよい。その場合、例えば、第１の参照ブロック（対象ブロックの左に隣接）の右上端画素が、対象ブロックの左上端画素の１画素左隣に位置するようにする。また、その場合、例えば、第２の参照ブロックの左下端画素が対象ブロックの左上端画素の1画素上隣に位置するようにする。 Further, the block size may be different between the target block and the reference block. In that case, for example, the upper right pixel of the first reference block (adjacent to the left of the target block) is located to the left of one pixel of the upper left pixel of the target block. Further, in that case, for example, the lower left pixel of the second reference block is located one pixel above the upper left pixel of the target block.

また、学習用対象ブロック１個あたりの学習用参照ブロックの個数は、対象ブロック１個あたりの参照ブロックの個数と同一とする。また、学習用対象ブロックと学習用参照ブロックの相対位置関係は、対象ブロックと参照ブロックの相対位置関係と同一とする。つまり、図１の、符号５および符号６で示す通りである。学習用対象ブロックおよび学習用参照ブロックは、いずれも、映像領域（または映像部分領域（例えば、イントラ予測が適用される映像領域のみ））のうちすでにモード情報の符号化を終了している部分領域内（つまり、図１においてハッチングして示している領域。以下において、「既符号化領域」と呼ぶ。）に設定する。学習用対象ブロックおよび学習用参照ブロックの組は、既符号化領域内においてＮ組（Ｎは１以上の整数。Ｎは定数であっても変数（例えば、処理対象のブロックが進むごとにＮが増加するなど）であっても構わない。）である。これらの学習用対象ブロックおよび学習用参照ブロックの組を、後述のヒストグラム集計部１０における学習処理に利用する。 Further, the number of learning reference blocks per learning target block is the same as the number of reference blocks per target block. Further, the relative positional relationship between the learning target block and the learning reference block is the same as the relative positional relationship between the target block and the reference block. That is, as shown by reference numerals 5 and 6 in FIG. Both the learning target block and the learning reference block are partial regions of the video region (or the video partial region (for example, only the video region to which the intra prediction is applied)) for which the mode information has already been encoded. It is set within (that is, the area shown by hatching in FIG. 1; hereinafter, referred to as “already coded area”). The set of the learning target block and the learning reference block is N sets (N is an integer of 1 or more. Even if N is a constant, N is a variable (for example, N is each time the block to be processed advances). It does not matter if it increases, etc.). The set of the learning target block and the learning reference block is used for the learning process in the histogram aggregation unit 10 described later.

次に、ヒストグラム集計部１０の処理について詳細に説明する。
図２は、ヒストグラム集計部１０の処理の要点を説明するための概略図である。同図（Ａ）は、ヒストグラム集計部１０が頻度値をカウントするために参照する学習用対象領域および学習用参照領域と、符号化対象である対象領域および関連付けられた参照領域の位置関係を示す概略図である。同図（Ｂ）は、時点（ｎ−１）におけるヒストグラムＨ^{（ｎ−１）}（ヒストグラム５０）を、３次元のマトリックスとして示す概略図である。同図（Ｃ）は、時点（ｎ）におけるヒストグラムＨ^（ｎ）（ヒストグラム５３）を、３次元のマトリックスとして示す概略図である。即ち、同図（Ｂ）の状態から同図（Ｃ）の状態に移る間に、時間が進んでいる。 Next, the processing of the histogram totaling unit 10 will be described in detail.
FIG. 2 is a schematic diagram for explaining the main points of processing of the histogram totaling unit 10. FIG. (A) shows the positional relationship between the learning target area and the learning reference area referred to by the histogram aggregation unit 10 for counting the frequency value, and the target area to be encoded and the associated reference area. It is a schematic diagram. FIG. (B) is ^{a schematic view showing the histogram H (n-1)} (histogram 50) at the time point (n-1) as a three-dimensional matrix. FIG. (C) is ^{a schematic diagram showing the histogram H (n)} (histogram 53) at the time point (n) as a three-dimensional matrix. That is, time is advancing while shifting from the state shown in FIG. (B) to the state shown in FIG.

なお、同図（Ａ）における符号５１は、学習用対象モード値および学習用参照モード値である。同図（Ｂ）における符号５２は、ヒストグラム５０に含まれる一つの頻度値であり、Ｈ^{（ｎ−１）}（１，２，３）である。即ち、時点（ｎ−１）における、ｘ＝１，ｙ_１＝２，ｙ_２＝３の組の頻度値である。同図（Ｃ）における符号５４は、ヒストグラム５３に含まれる一つの頻度値であり、Ｈ^（ｎ）（１，２，３）である。即ち、時点（ｎ）における、ｘ＝１，ｙ_１＝２，ｙ_２＝３の組の頻度値である。 Reference numeral 51 in FIG. 5A is a learning target mode value and a learning reference mode value. Reference numeral 52 in FIG. 5B is one frequency value included in the histogram 50, and is H ^(n-1) (1, 2, 3). That is, it is the frequency value of the set of _{x = 1, y 1} = 2, y ₂ = 3 at the time point (n-1). Reference numeral 54 in FIG. 5C is one frequency value included in the histogram 53, and is H ⁽ⁿ⁾ (1, 2, 3). That is, it is the frequency value of the set of x = 1, y ₁ = 2, y _{2 = 3 at the time point (n).}

ヒストグラム集計部１０は、学習用参照モード値および学習用対象モード値の組に対して、その頻度分布を計数するためのメモリを備える。ヒストグラム集計部１０は、このメモリを用いて、学習用参照モード値および学習用対象モード値の組の出現を順次カウントしていく。 The histogram aggregation unit 10 includes a memory for counting the frequency distribution of a set of a learning reference mode value and a learning target mode value. The histogram aggregation unit 10 uses this memory to sequentially count the appearance of a set of a learning reference mode value and a learning target mode value.

具体的には、ヒストグラム集計部１０は、学習用対象モードＸ^（ｎ）および学習用参照モードＹ_１ ^（ｎ），Ｙ_２ ^（ｎ），・・・，Ｙ_Ｋ ^（ｎ）の頻度値であるＨ^（ｎ）(Ｘ，Ｙ_１，Ｙ_２，・・・，Ｙ_Ｋ）を数え上げていく。ここで、右上の（ｎ）は、インデックス値を示す。インデックス値ｎは整数である。ｎ≧１の場合には、Ｈ^（ｎ）は、総計ｎ組のモード情報を用いて構築したヒストグラムであることを表す。また、ｎ＝０の場合には、Ｈ^（ｎ）は、ヒストグラムを構築する際の初期値を表す。 Specifically, the histogram aggregation unit 10 is a frequency value of the ^{learning target mode X (n)} and the learning reference mode Y ₁ ⁽ⁿ⁾ , Y ₂ ⁽ⁿ⁾ , ..., Y _K ^(n). H ⁽ⁿ⁾ (X, Y ₁ , Y ₂ , ..., Y _K ) is counted. Here, (n) in the upper right indicates an index value. The index value n is an integer. When n ≧ 1, H ⁽ⁿ⁾ represents a histogram constructed by using a total of n sets of mode information. When n = 0, H ⁽ⁿ⁾ represents an initial value when constructing a histogram.

この初期値（ｎ＝０の場合）としては所定の値を適宜与えるようにする。ｎ＝０の場合に、例えば、すべての(Ｘ，Ｙ_１，Ｙ_２，・・・，Ｙ_Ｋ）の組に対して、Ｈ^（ｎ）(Ｘ，Ｙ_１，Ｙ_２，・・・，Ｙ_Ｋ）＝０としてもよい。あるいは、ｎ＝０の場合に、(Ｘ，Ｙ_１，Ｙ_２，・・・，Ｙ_Ｋ）の組のそれぞれの頻度値として、０以外の適切な値を与えるようにしてもよい。例えば、予め様々な映像を用いて計測しておいた典型的な頻度値分布を、Ｈ^（ｎ）(Ｘ，Ｙ_１，Ｙ_２，・・・，Ｙ_Ｋ）の値として与えるようにしてもよい。非零値のヒストグラムを初期値として与える場合、モード情報の学習に要する時間（言い換えれば、学習データ量）を小さくすることも可能となる。 As this initial value (when n = 0), a predetermined value is appropriately given. When n = 0, for example, for all (X, Y ₁ , Y ₂ , ..., Y _K ) pairs, H ⁽ⁿ⁾ (X, Y ₁ , Y ₂ , ..., Y _K ) = 0 may be set. Alternatively, when n = 0, an appropriate value other than 0 may be given as the frequency value of each set of _{(X, Y 1} , Y ₂ , ..., Y _K). For example, a typical frequency value distribution measured in advance using various images may be given as a value of H ⁽ⁿ⁾ (X, Y ₁ , Y ₂ , ..., Y _K ). good. When a non-zero histogram is given as an initial value, it is possible to reduce the time required for learning the mode information (in other words, the amount of training data).

ヒストグラム集計部１０は、既符号化領域内の学習用対象モードおよび参照モードの組を１組以上用いてヒストグラムＨ^（ｎ）を生成する。
例えばある時点において、モード情報符号化装置１が対象モードの符号化処理を行う場合には、ヒストグラム集計部１０は、１ブロック前の処理時点における対象ブロックおよび参照ブロックの位置に、学習用対象ブロックおよび学習用参照ブロックを置く。そして、ヒストグラム集計部１０は、学習用対象ブロックおよび学習用参照ブロックの組の情報に基づいて、ヒストグラムＨ^（ｎ）を更新する。そして、対象モードの符号化処理の進捗とともに、学習用対象ブロックおよび学習用参照ブロックも移動させて、順次ヒストグラムＨ^（ｎ）を更新していく。 The histogram aggregation unit 10 generates the ^{histogram H (n)} by using one or more sets of the learning target mode and the reference mode in the coded region.
For example, when the mode information coding device 1 performs the coding process of the target mode at a certain time point, the histogram aggregation unit 10 sets the target block for learning at the positions of the target block and the reference block at the processing time point one block before. And put a reference block for learning. Then, the histogram totaling unit 10 updates the ^{histogram H (n)} based on the information of the set of the learning target block and the learning reference block. Then, as the coding process of the target mode progresses, the learning target block and the learning reference block are also moved, and the histogram H ⁽ⁿ⁾ is sequentially updated.

ヒストグラム集計部１０は、学習用対象モードｘ^（ｎ）、および学習用参照モード（ｙ_１ ^（ｎ），ｙ_２ ^（ｎ），・・・，ｙ_Ｋ ^（ｎ））の組に基づき、下の式（１）によってヒストグラムをＨ^{（ｎ−１）}からＨ^（ｎ）に更新する。なお、ｘ^（ｎ），ｙ_１ ^（ｎ），ｙ_２ ^（ｎ），・・・，ｙ_Ｋ ^（ｎ）のそれぞれにおける右上の（ｎ）は、それらのモード情報が第ｎ番目（ｎ≧１）のモード情報（それぞれ、学習用対象モード及び学習用参照モード）であることを表す。 The histogram aggregation unit 10 is based on the set of the learning target mode x ⁽ⁿ⁾ and the learning reference mode (y ₁ ⁽ⁿ⁾ , y ₂ ⁽ⁿ⁾ , ..., Y _K ⁽ⁿ⁾ ) below. The histogram is updated from H ⁽ⁿ ^-1) to H (n) by the equation (1). In the ^{upper right (n)} ^{of each of x (n)} , y ₁ ⁽ⁿ⁾ , y ₂ ⁽ⁿ⁾ , ..., Y _K (n), their mode information is the nth (n ≧ 1). ) Mode information (learning target mode and learning reference mode, respectively).

なお、ｘ^（ｎ），ｙ_１ ^（ｎ），ｙ_２ ^（ｎ），・・・，ｙ_Ｋ ^（ｎ）以外のモードの組については、ヒストグラムの値を更新しない。 The histogram value is not updated for a set of modes other than ^{x (n)} , y ₁ ⁽ⁿ⁾ , y ₂ ⁽ⁿ⁾ , ..., Y _K ^(n).

図２に示す具体例において、ヒストグラム集計部１０の動作は次の通りである。図示するように、ヒストグラムＨ^{（ｎ−１）}およびＨ^（ｎ）は、それぞれ、モード情報ｘ，ｙ_１，ｙ_２による３次元のマトリクスとして表される。一般に、学習用対象モード１種類と学習用参照モードＫ種類の組であるモード情報のヒストグラムは、（Ｋ＋１）次元のマトリクスとして表される。それらのマトリクスの個々の要素が、各々、学習用対象モードおよび学習用参照モードＫ種類の組の頻度値である。 In the specific example shown in FIG. 2, the operation of the histogram totaling unit 10 is as follows. As shown, the histogram ^{H (n-1)} and ^{H (n),} respectively, mode information _x, expressed as a three-dimensional matrix according to y 1, _{y 2.} Generally, a histogram of mode information, which is a set of one learning target mode and K learning reference mode, is represented as a (K + 1) dimensional matrix. Each element of the matrix is a frequency value of a set of learning target modes and learning reference modes K, respectively.

同図において、符号５０は、時点（ｎ−１）におけるヒストグラムＨ^{（ｎ−１）}を示す。また、符号５３は、時点（ｎ）におけるヒストグラムＨ^（ｎ）を示す。同図における各升目（マトリクスの要素）がヒストグラムのビンを表し、升目内の数字が頻度値を表す。例えば、図２（Ａ）の符号５１で示す領域は、に示すように、学習用対象モードｘ^（ｎ）が１であり、学習用参照モードがｙ_１ ^（ｎ）およびｙ_２ ^（ｎ）がそれぞれ２および３である。時点（ｎ−１）において当該対象モードおよび当該参照モードのビンに記録された頻度値は、図２（Ｂ）の符号５２で示すように「２」であった。したがって、時点（ｎ）において当該対象モードおよび当該参照モードのビンに記録される頻度値は、図２（Ｃ）の符号５４で示すように「３」に更新される。なお、当該ビン以外の頻度値は、ヒストグラム時点（ｎ−１）から時点（ｎ）の間には変わらない。 In the figure, reference numeral 50 indicates a histogram H ^(n-1) at the time point (n-1). Reference numeral 53 indicates a histogram H ⁽ⁿ⁾ at the time point (n). Each square (matrix element) in the figure represents a histogram bin, and the numbers in the square represent the frequency value. For example, in the region indicated by reference numeral 51 in FIG. 2A, the learning target mode x ⁽ⁿ⁾ is 1, and the learning reference modes y ₁ ⁽ⁿ⁾ and y ₂ ⁽ⁿ⁾ are as shown in. 2 and 3, respectively. The frequency value recorded in the bin of the target mode and the reference mode at the time point (n-1) was “2” as shown by reference numeral 52 in FIG. 2 (B). Therefore, the frequency values recorded in the bins of the target mode and the reference mode at the time point (n) are updated to "3" as indicated by reference numeral 54 in FIG. 2C. The frequency values other than the bin do not change between the histogram time point (n-1) and the time point (n).

次に、ソート部１１の処理について詳細に説明する。
図３は、ソート部１１の処理の要点を説明するための概略図である。同図（Ａ）は、ソート部１１がソートしようとする頻度値の選択条件（参照ブロックのモード値による条件）を示す概略図である。同図（Ｂ）は、時点（ｎ）におけるヒストグラムＨ^（ｎ）から、選択条件（参照ブロックの値が、ｙ_１＝３且つｙ_２＝４）にしたがって選択される頻度値の集合を示す概略図である。同図（Ｃ）は、同図（Ｂ）で選択された頻度値をソートした結果（頻度値のソート結果５８）を数式で示す概略図である。同図（Ｄ）は、ソート結果に基づいて定まった順位ｄに対応付けたモード値Ｘ_ｄ（対象モード値６０）を示す概略図である。 Next, the processing of the sort unit 11 will be described in detail.
FIG. 3 is a schematic view for explaining the main points of the processing of the sort unit 11. FIG. 3A is a schematic diagram showing a selection condition (condition according to the mode value of the reference block) of the frequency value that the sort unit 11 intends to sort. FIG. (B) is an outline showing a set of frequency values ^{selected from the histogram H (n)} at the time point (n) according to the selection condition (the value of the reference block is y ₁ = 3 and y _{2 = 4).} It is a figure. FIG. 3C is a schematic diagram showing a result of sorting the frequency values selected in FIG. 3B (frequency value sorting result 58) by a mathematical formula. _{FIG. 3D is a schematic diagram showing a mode value X d} (target mode value 60) associated with a rank d determined based on a sort result.

ソート部１１は、ヒストグラム集計部１０がヒストグラムを集計済みであることを前提として、以下の処理を行う。即ち、ソート部１１は、その時点における符号化対象である対象領域に対応付けられた参照領域の参照モード値列(ｙ_１，ｙ_２，・・・，ｙ_Ｋ)に基づき、ヒストグラムＨ^（ｎ）(Ｘ，Ｙ_１，Ｙ_２，・・・，Ｙ_Ｋ)に含まれる頻度値の一部をソートする。具体的には、ソート部１１は、ヒストグラムＨ^（ｎ）(Ｘ，Ｙ_１，Ｙ_２，・・・，Ｙ_Ｋ)に含まれる全ビンのうち、Ｙ_１＝ｙ_１，Ｙ_２＝ｙ_２，・・・，Ｙ_Ｋ＝ｙ_Ｋであるビンを、頻度値の降順にソートする。ソート部１１は、このソート処理により、対象モード値Ｘを順序付ける。すなわち、ソート部１１は、すべてのＸ∈{１，２，・・・，Ｍ}について、頻度値Ｈ^（ｎ）（Ｘ，ｙ_１，ｙ_２，・・・，ｙ_Ｋ)を比較し、頻度値の高い方から順にＸを並べる。
The sort unit 11 performs the following processing on the premise that the histogram aggregation unit 10 has already aggregated the histogram. That is, the sort unit 11 has a histogram H ⁽ⁿ _{) based on the reference mode value sequence (y 1} , y ₂ , ..., Y _K ) of the reference area associated with the target area to be encoded at that time. ⁾ Sort some of the frequency values included in (X, Y ₁ , Y ₂ , ..., Y _K). Specifically, the sort unit 11 has _{Y 1} = y ₁ , Y ₂ = y ₂ among all the bins included ^{in the histogram H (n)} (X, Y ₁ , Y ₂ , ..., Y _K). , ..., Sort the bins _{with Y K} = y _K in descending order of frequency value. The sort unit 11 orders the target mode value X by this sort process. That is, the sort unit 11 compares the frequency values H ⁽ⁿ⁾ (X, y ₁ , y ₂ , ..., Y _K ) for all X ∈ {1, 2, ..., M}. Xs are arranged in order from the one with the highest frequency value.

ソート部１１によるこのソート処理を数式で表すと、下の式（２）の通りである。 The sort process by the sort unit 11 is expressed by a mathematical formula as shown in the following formula (2).

つまり、ソート部１１は、ソート処理により、式（２）を満たす数列（Ｘ_ｍ）_{ｍ＝１，２，・・・，Ｍ}を求める。ただし、この数列は、同じ値の項を含まない。また、すべてのi∈{１，２，・・・，Ｍ}に対し、Ｘ_ｉ∈{１，２，・・・，Ｍ}である。 That is, the sort unit 11 obtains a _{sequence (X m} ) _{m = 1, 2, ..., M} satisfying the equation (2) by the sort process. However, this sequence does not contain terms with the same value. Also, for all _i ∈ {1, 2, ..., M}, X i ∈ {1, 2, ..., M}.

なお、Ｈ^（ｎ）(Ｘ，Ｙ_１，Ｙ_２，・・・，Ｙ_Ｋ)が同じになるＸが複数存在する場合があり得る。即ち、式（２）における不等号「≦」において、等号「＝」が成立する箇所が一箇所以上存在する場合があり得る。そのような場合には、ソート処理の結果として、数列（Ｘ_ｍ）_{ｍ＝１，２，・・・，Ｍ}が唯一に定まらない。その場合には、ソート部１１は、例えば、予め定めたルール等によりソート結果の順序を確定させる。そのようなルールの一例は、「頻度において同順位のモード値のうち、モード値がより小さいほうが（あるいはより大きいほうが）頻度が高かったものとみなす」といったルールである。勿論、他のルール等に依ってソート結果を確定させてもよい。 It should be noted that there may be a plurality of Xs having the same ^{H (n)} (X, Y ₁ , Y ₂ , ..., Y _K). That is, in the inequality sign “≦” in the equation (2), there may be one or more places where the equal sign “=” holds. In such a case, as a result of the sorting process, the sequence (X _m ) _{m = 1, 2, ..., M} is not uniquely determined. In that case, the sort unit 11 determines the order of the sort results according to, for example, a predetermined rule or the like. An example of such a rule is a rule such as "among the mode values having the same rank in terms of frequency, the smaller (or larger) mode value is regarded as having the higher frequency". Of course, the sort result may be determined according to other rules or the like.

つまり、ソート部１１は、式（２）および式（３）を満たす数列（Ｘ_ｍ）_{ｍ＝１，２，・・・，Ｍ}を求める。 That is, the sort unit 11 obtains a _{sequence (X m} ) _{m = 1, 2, ..., M} that satisfies the equations (2) and (3).

図３（Ａ）に示す例において、現時点でモード情報を符号化しようとしている対象領域５５に含まれるブロック（対象ブロック）のモード情報（対象モード値）はｘである。また、その対象ブロックに関連付けられる参照領域５６に含まれる２つのブロック（参照ブロック）のモード情報（参照モード値）は、それぞれ、ｙ_１＝３、ｙ_２＝４である。この参照モード値の組に関する各対象モード値ｘの頻度値は、図３（Ｂ）に示すヒストグラムＨ^（ｎ）における射影５７である。即ち、ｘ＝０，１，２，３，および４のそれぞれに対応する頻度値は、１，４，３，１，および５である。ソート部１１がこれらの頻度値を降順にソートした結果、図３（Ｃ）に示す頻度値のソート結果５８が得られる。即ち、ソート結果５８に含まれる頻度値は、次の通りである。
Ｈ^（ｎ）（４，３，４）＝５
Ｈ^（ｎ）（１，３，４）＝４
Ｈ^（ｎ）（２，３，４）＝３
Ｈ^（ｎ）（０，３，４）＝１
Ｈ^（ｎ）（３，３，４）＝１ In the example shown in FIG. 3A, the mode information (target mode value) of the block (target block) included in the target area 55 for which the mode information is to be encoded at the present time is x. Further, the mode information (reference mode value) of the two blocks (reference block) included in the reference area 56 associated with the target block is y ₁ = 3 and y ₂ = 4, respectively. The frequency value of each target mode value x with respect to this set of reference mode values is the projection 57 in the ^{histogram H (n) shown in FIG. 3 (B).} That is, the frequency values corresponding to x = 0, 1, 2, 3, and 4, respectively, are 1, 4, 3, 1, and 5. As a result of the sorting unit 11 sorting these frequency values in descending order, the sorting result 58 of the frequency values shown in FIG. 3C is obtained. That is, the frequency values included in the sort result 58 are as follows.
H ⁽ⁿ⁾ (4,3,4) = 5
H ⁽ⁿ⁾ (1,3,4) = 4
H ⁽ⁿ⁾ (2,3,4) = 3
H ⁽ⁿ⁾ (0,3,4) = 1
H ⁽ⁿ⁾ (3,3,4) = 1

即ち、これらの頻度値の相互の関係は、次の通りである。
Ｈ^（ｎ）（４，３，４）＞Ｈ^（ｎ）（１，３，４）＞Ｈ^（ｎ）（２，３，４）＞Ｈ^（ｎ）（０，３，４）＝Ｈ^（ｎ）（３，３，４） That is, the mutual relationship between these frequency values is as follows.
H ⁽ⁿ⁾ (4,3,4)> H ⁽ⁿ⁾ (1,3,4)> H ⁽ⁿ⁾ (2,3,4)> H ⁽ⁿ⁾ (0,3,4) = H ^{( n)} (3,3,4)

なお、頻度値が等しいビンを表す等号５９が表すように、ｘ＝０の場合とｘ＝３の場合の頻度値はそれぞれ１であり、同値である。ここで、上述した式（３）を適用すると、順位ｄ（１≦ｄ≦５）の対象モードＸ_ｄの値は、図３（Ｄ）に示す対象モード値６０の通りである。即ち、次の通りに一意に定まる。
ｘ_１＝４，ｘ_２＝１，ｘ_３＝２，ｘ_４＝０，ｘ_５＝３ As indicated by the equal sign 59 representing bins having the same frequency value, the frequency values in the case of x = 0 and the case of x = 3 are 1 respectively, which are the same values. Here, when the above equation (3) is applied, _{the value of the target mode X d} of the rank d (1 ≦ d ≦ 5) is as the target mode value 60 shown in FIG. 3 (D). That is, it is uniquely determined as follows.
x ₁ = 4, x ₂ = 1, x ₃ = 2, x ₄ = 0, x ₅ = 3

次に、符号化部１２の処理について詳細に説明する。
図４は、符号化部１２の処理の要点を説明するための概略図である。同図は、ソート部１１によって求められた数値列Ｘ_ｄ（１≦ｄ≦５）と、対象ブロックのモード値および参照ブロックのモード値とに基づいて、符号値ｃを求める一連の処理の例を示している。 Next, the processing of the coding unit 12 will be described in detail.
FIG. 4 is a schematic diagram for explaining the main points of the processing of the coding unit 12. _{The figure shows an example of a series of processes for obtaining the code value c based on the numerical sequence X d} (1 ≦ d ≦ 5) obtained by the sort unit 11 and the mode value of the target block and the mode value of the reference block. Is shown.

符号化部１２は、ソート部１１が求めた数列（Ｘ_ｍ）_{ｍ＝１，２，・・・，Ｍ}に基づき、対象モードxを符号化して、符号cを出力する。そのため、符号化部１２は、対象モードｘの頻度値が、Ｈ^（ｎ）（Ｘ＝１，ｙ_１，ｙ_２，・・・，ｙ_Ｋ），Ｈ^（ｎ）（Ｘ＝２，ｙ_１，ｙ_２，・・・，ｙ_Ｋ），・・・，Ｈ^（ｎ）（Ｘ＝Ｍ，ｙ_１，ｙ_２，・・・，ｙ_Ｋ）の中で何番目に高い頻度であったかを求める。つまり、符号化部１２は、対象モードｘの、頻度値における順位を求める。そして、符号化部１２は、この順位に基づいて符号ｃを定める。 The coding unit 12 encodes the target mode x based on _{the sequence (X m} ) _{m = 1, 2, ..., M} obtained by the sorting unit 11 and outputs the code c. Therefore, in the coding unit 12, the frequency value of the target mode x is H ⁽ⁿ⁾ (X = 1, y ₁ , y ₂ , ..., Y _K ), H ⁽ⁿ⁾ (X = 2, y _1). , Y ₂ , ..., y _K ), ..., H ⁽ⁿ⁾ (X = M, y ₁ , y ₂ , ..., y _K ) .. That is, the coding unit 12 obtains the rank of the target mode x in the frequency value. Then, the coding unit 12 determines the code c based on this order.

例えば、符号化部１２は、下の式（４）を満たす順位ｄに基づいて符号ｃを定める。 For example, the coding unit 12 determines the code c based on the order d that satisfies the following equation (4).

符号化部１２は、一例として、下の式（５）により符号ｃを定める。 As an example, the coding unit 12 determines the reference numeral c by the following equation (5).

式（５）においてｄから１を減じた値を求めているのは、符号値cが０から始まるように、順位ｄの値をオフセットしているためである。 In the equation (5), the value obtained by subtracting 1 from d is obtained because the value of the rank d is offset so that the sign value c starts from 0.

図４に示す例において、対象領域６１に含まれるブロック（対象ブロック）のモード値は４である。また、参照モード値がそれぞれｙ_１＝３、ｙ_２＝４である点は、図３における状況と同様である。ソート部１１が求めた順位ごとの対象モード値６２は図示する通りである。符号化部１２は、この対象モード値６２に照らして、対象モード値「４」の順位を求める。即ち、符号化部１２は、順位としてｄ＝１を得る。そして、式（５）で表した符号化を行う場合、符号化部１２は、ｄ＝１に対応させて、符号値ｃ＝０を得る。 In the example shown in FIG. 4, the mode value of the block (target block) included in the target area 61 is 4. Further, the point that the reference mode values are y ₁ = 3 and y ₂ = 4, respectively, is the same as the situation in FIG. The target mode value 62 for each rank obtained by the sort unit 11 is as shown in the figure. The coding unit 12 obtains the order of the target mode value “4” in light of the target mode value 62. That is, the coding unit 12 obtains d = 1 as the order. Then, when the coding represented by the equation (5) is performed, the coding unit 12 obtains the code value c = 0 in correspondence with d = 1.

なお、上記の式（５）を適用する場合、符号値ｃは常に非負整数である。また、符号値ｃが０寄りの小さな値であるほど、そのモードの発生確率は大きいという傾向がある。つまり、モード情報符号化装置１を用いことで、対象モードの発生確率が大きいほど、符号値cの発生確率分布は０寄りの小さな値になるという傾向が維持される。これは、ヒストグラム集計部１０がヒストグラムを集計した画像領域と、ソート部１１および符号化部１２が各処理を行った画像領域との間で、参照モード値と対象モード値の相関性が維持されているという前提を利用するものである。これにより、符号値ｃのエントロピーを削減することが可能となる。 When the above equation (5) is applied, the code value c is always a non-negative integer. Further, the smaller the code value c is closer to 0, the higher the probability of occurrence of the mode tends to be. That is, by using the mode information coding device 1, the tendency that the occurrence probability distribution of the code value c becomes a small value closer to 0 is maintained as the probability of occurrence of the target mode increases. This is because the correlation between the reference mode value and the target mode value is maintained between the image area in which the histogram totaling unit 10 aggregates the histogram and the image area in which the sorting unit 11 and the coding unit 12 perform each processing. It uses the premise that it is. This makes it possible to reduce the entropy of the code value c.

なお、図１に示したエントロピー符号化部２は、符号化部１２が出力する符号値ｃをエントロピー符号化する。これにより、モード情報のより一層の圧縮が可能となる。言い換えれば、エントロピー符号化部２からエントロピー復号部３に伝達されるモード情報を表現するビット数を削減することが可能となる。 The entropy coding unit 2 shown in FIG. 1 entropy-codes the code value c output by the coding unit 12. This makes it possible to further compress the mode information. In other words, it is possible to reduce the number of bits expressing the mode information transmitted from the entropy coding unit 2 to the entropy decoding unit 3.

エントロピー符号化部２が用いる符号化手法として、例えば、ハフマン符号や、算術符号や、これらの派生であるＣＡＶＬＣ（コンテキスト適応型可変長符号化方式）やＣＡＢＡＣ（コンテクスト適応型算術符号化方式）を用いることができる。また、前記のように符号値cの発生確率分布が０付近で高くなるようにする場合、ゴロム符号（Golomb coding）やその派生である指数ゴロム符号（Exponential-Golomb coding）との整合性も高く、エントロピー符号化部２がこれらを用いるようにしてもよい。
なお、エントロピー復号部３は、エントロピー符号化部２と対になる手法を用いて、符号化されたモード情報を復号する。 As the coding method used by the entropy coding unit 2, for example, Huffman code, arithmetic code, CAVLC (context-adaptive variable-length coding method) and CABAC (context-adaptive arithmetic coding method), which are derivatives of these, are used. Can be used. In addition, when the probability distribution of the code value c is set to be high near 0 as described above, the consistency with the Golomb coding and its derivative, the exponential-Golomb coding, is also high. , The entropy coding unit 2 may use these.
The entropy decoding unit 3 decodes the encoded mode information by using a method of pairing with the entropy coding unit 2.

エントロピー符号化部２による処理と、エントロピー復号部３による処理とを順次適用したとき、情報の損失がない（ロスレスである）ことが望まれる。 When the processing by the entropy coding unit 2 and the processing by the entropy decoding unit 3 are sequentially applied, it is desired that there is no loss of information (lossless).

次に、ヒストグラム集計部４０、ソート部４１、および復号部４２の処理について詳細に説明する。
図５は、復号部４２の処理の要点を説明するための概略図である。同図は、ソート部４１によって求められた数値列Ｘ_ｄ（１≦ｄ≦５）と、符号値ｃとに基づいて、対象ブロックのモード値（ｘ＝４）を求める一連の処理の例を示している。 Next, the processing of the histogram totaling unit 40, the sorting unit 41, and the decoding unit 42 will be described in detail.
FIG. 5 is a schematic diagram for explaining the main points of processing of the decoding unit 42. The figure shows an example of a series of processes for obtaining the mode value (x = 4) of the target block based on _{the numerical string X d} (1 ≦ d ≦ 5) obtained by the sort unit 41 and the code value c. Shown.

ヒストグラム集計部４０は、既復号領域内の学習用対象領域のモード値（学習用対象モード）および学習用参照領域のモード値（学習用参照モード）の組を１組以上用いてヒストグラムＨ^（ｎ）を生成する。なお、既復号領域とは、図１において、学習用対象領域および学習用参照領域７を含み、ハッチングで示している領域である。なお、ヒストグラム集計部１０とヒストグラム集計部４０との間で、学習用対象領域と学習用参照領域の位置を共通に設定する。即ち、ヒストグラム集計部４０は、ヒストグラム集計部１０が頻度値を求める対象である既符号化領域内の学習用対象領域および学習用参照領域５と、同フレーム且つ同位置の既復号領域内の学習用対象領域と学習用参照領域７を用いてヒストグラムＨ^（ｎ）を構築する。
なお、ヒストグラム集計部４０がヒストグラムＨ^（ｎ）を生成する算法は、ヒストグラム集計部１０がヒストグラムＨ^（ｎ）を生成する算法と同一であるので、ここでの説明を省略する。 The histogram aggregation unit 40 uses one or more sets of mode values of the learning target area (learning target mode) and mode values of the learning reference area (learning reference mode) in the already decoded area, and uses the histogram H ^{(n). )} Is generated. The already decoded area is an area shown by hatching in FIG. 1, including a learning target area and a learning reference area 7. The positions of the learning target area and the learning reference area are commonly set between the histogram totaling unit 10 and the histogram totaling unit 40. That is, the histogram aggregation unit 40 learns in the learning target area and the learning reference area 5 in the already coded region, which is the target for which the histogram aggregation unit 10 obtains the frequency value, and in the already decoded region in the same frame and at the same position. ^{A histogram H (n)} is constructed using the target area for use and the reference area for learning 7.
Since the calculation method in which the histogram totaling unit 40 ^{generates the histogram H (n)} is the same as the calculation method in which the histogram totaling unit 10 ^{generates the histogram H (n)} , the description thereof will be omitted here.

ソート部４１は、復号しようとしている対象領域に対応付けられた参照領域の参照モード値列（ｙ_１，ｙ_２，・・・，ｙ_Ｋ)に基づき、対象モード値Ｘを順序付けする。具体的には、ソート部４１は、ヒストグラム集計部４０が生成したヒストグラムＨ^（ｎ）（Ｘ，Ｙ_１，Ｙ_２，・・・，Ｙ_Ｋ)の全ビンのうち、Ｙ_１＝ｙ_１，Ｙ_２＝ｙ_２，・・・，Ｙ_Ｋ＝ｙ_Ｋビンについて、頻度値を降順にソートする。そして、ソート部４１は、このソート結果にしたがって対象モード値Ｘを順序付けする。なお、ソート部４１は、ソート部１１による処理と同様の算法により、数列（Ｘ_ｍ）_{ｍ＝１，２，・・・，Ｍ}を求める。 The sort unit 41 orders the target mode values X based on _{the reference mode value strings (y 1} , y ₂ , ..., Y _K ) of the reference area associated with the target area to be decoded. Specifically, the sort unit 41 has _{Y 1} = y ₁ , among all the bins of ^{the histogram H (n)} (X, Y ₁ , Y ₂ , ..., Y _{K) generated by the histogram aggregation unit 40.} For Y ₂ = y ₂ , ..., Y _K = y _K bins, the frequency values are sorted in descending order. Then, the sort unit 41 orders the target mode values X according to the sort result. The sort unit 41 obtains the _{sequence (X m} ) _{m = 1, 2, ..., M} by the same arithmetic method as the processing by the sort unit 11.

復号部４２は、ソート部４１が求めた数列（Ｘ_ｍ）_{ｍ＝１，２，・・・，Ｍ}に基づき、受け取った符号cを対象モード値ｘに復号して、対象モード値ｘを出力する。
具体的には、モード情報符号化装置１内の符号化部１２に対応する方法で、順位ｄを求める。符号化部１２が式（５）によって符号ｃを求めた場合には、復号部４２は、下の式（６）により順位ｄを求める。 The decoding unit 42 decodes the received code c into the target mode value x based on _{the sequence (X m} ) _{m = 1, 2, ..., M} obtained by the sort unit 41, and outputs the target mode value x. do.
Specifically, the rank d is obtained by a method corresponding to the coding unit 12 in the mode information coding device 1. When the coding unit 12 obtains the code c by the equation (5), the decoding unit 42 obtains the rank d by the following equation (6).

順位ｄが得られると、復号部４２は、続いて、式（４）と同一の関係を表す下の式（７）により、対象モード値ｘを求める。 When the rank d is obtained, the decoding unit 42 subsequently obtains the target mode value x by the following equation (7) expressing the same relationship as the equation (4).

図５に示す例に沿って、上述した、ヒストグラム集計部４０と、ソート部４１と、復号部４２の一連の処理の流れを説明する。
本例では、モード情報復号装置４が受け取る符号値ｃは０である。この符号値は、モード情報符号化装置１内の符号化部１２で求められたものである。ｃ＝０であるので、式（６）により、復号部４２は、ｄ＝１を得る。一方、ソート部４１は、頻度値の降順に並べた対象モードの数列を出力する。図示する例では、Ｘ_１＝４，Ｘ_２＝１，Ｘ_３＝２，Ｘ_４＝０，Ｘ_５＝４である。この数列は、参照モード値として、ｙ_１＝３，ｙ_２＝４を前提とするものである。そして、復号部４２は、ソート部４１から出力された数列に照らして、ｄ＝１の場合の対象モード値であるＸ_１＝４を得る。即ち、復号結果として対象モード値ｘ＝４を得る。 A series of processing flows of the histogram totaling unit 40, the sorting unit 41, and the decoding unit 42 described above will be described with reference to the example shown in FIG.
In this example, the code value c received by the mode information decoding device 4 is 0. This code value is obtained by the coding unit 12 in the mode information coding device 1. Since c = 0, the decoding unit 42 obtains d = 1 according to the equation (6). On the other hand, the sort unit 41 outputs a sequence of target modes arranged in descending order of frequency value. In the illustrated example, X ₁ = 4, X ₂ = 1, X ₃ = 2, X ₄ = 0, X ₅ = 4. This sequence is based on the assumption that _{y 1} = 3, y ₂ = 4 as the reference mode value. _{Then, the decoding unit 42 obtains X 1} = 4, which is the target mode value when d = 1, in light of the sequence of numbers output from the sort unit 41. That is, the target mode value x = 4 is obtained as the decoding result.

以上、モード情報符号化装置１、エントロピー符号化部２、エントロピー復号部３、モード情報復号装置４の機能および処理例等について説明した。なお、モード情報符号化装置１側のヒストグラム集計部１０と、モード情報復号装置４側のヒストグラム集計部４０とは、何らかの手段で互いに同期しながらヒストグラムのデータを更新する。言い換えれば、ソート部１１および符号化部１２による一連の処理と、ソート部４１および復号部４２による一連の処理とは、同一のヒストグラムデータに基づいて行われる。したがって、ヒストグラム集計部１０とヒストグラム集計部４０は、共通の認識に基づく所定のタイミングで、ヒストグラムのデータを所定の値にリセットする。一例として、ＧｏＰ（group of pictures）の区切りのタイミングで、ヒストグラムのデータをリセットする。 The functions and processing examples of the mode information coding device 1, the entropy coding unit 2, the entropy decoding unit 3, the mode information decoding device 4, and the like have been described above. The histogram aggregation unit 10 on the mode information coding device 1 side and the histogram aggregation unit 40 on the mode information decoding device 4 side update the histogram data while synchronizing with each other by some means. In other words, the series of processing by the sorting unit 11 and the coding unit 12 and the series of processing by the sorting unit 41 and the decoding unit 42 are performed based on the same histogram data. Therefore, the histogram aggregation unit 10 and the histogram aggregation unit 40 reset the histogram data to a predetermined value at a predetermined timing based on the common recognition. As an example, the histogram data is reset at the timing of the GoP (group of pictures) delimiter.

また、例えば、ヒストグラム集計部１０とヒストグラム集計部４０は、ヒストグラムのデータを初期値Ｈ^（０）にリセットする。前述の通り、ヒストグラムの初期値Ｈ^（０）は、すべて頻度値０としたものでもよく、また、所定の確率分布に基づき予め定めた非零値を用いたものでもよい。例えば、予めさまざまな映像において計測あるいはモデル化した頻度値の典型的な分布に従って、初期値Ｈ^（０）を定めておくようにする。 Further, for example, the histogram totaling unit 10 and the histogram totaling unit 40 reset the ^{histogram data to the initial value H (0).} As described above, the initial value H ⁽⁰⁾ of the histogram may be a frequency value of 0, or may use a non-zero value predetermined based on a predetermined probability distribution. ^{For example, the initial value H (0)} is set according to a typical distribution of frequency values measured or modeled in various images in advance.

なお、リセットする頻度が相対的に低いと、モード情報符号化装置１およびモード情報復号装置４は、充分な量の画像から得られる学習用対象領域のモード情報と学習用参照領域のモード情報を反映したヒストグラムを構築できる。つまり、学習効果をより多く得られる可能性が高い。逆に、リセットする頻度が相対的に高いと、モード情報符号化装置１側とモード情報復号装置４側とでヒストグラムデータが一致しない状態から処理を開始した場合にも、比較的早期に両者のヒストグラムを一致させて、モード情報を伝達できるようになる。なお、モード情報符号化装置１側とモード情報復号装置４側とでヒストグラムデータが一致しない状態は、例えば、モード情報復号装置４側で（映像等の）コンテンツの途中から復号を開始する場合などに生じ得る。 When the frequency of resetting is relatively low, the mode information coding device 1 and the mode information decoding device 4 obtain the mode information of the learning target area and the mode information of the learning reference area obtained from a sufficient amount of images. You can build a reflected histogram. That is, there is a high possibility that more learning effects can be obtained. On the contrary, if the frequency of resetting is relatively high, even if the processing is started from the state where the histogram data does not match between the mode information coding device 1 side and the mode information decoding device 4 side, both of them are performed relatively early. The histograms can be matched to convey mode information. The state in which the histogram data does not match between the mode information coding device 1 side and the mode information decoding device 4 side is, for example, when the mode information decoding device 4 side starts decoding from the middle of the content (such as a video). Can occur in.

また、ヒストグラムＨ^（ｎ）に含まれる頻度値が所定の状態になったタイミングで、モード情報符号化装置１側とモード情報復号装置４側とでヒストグラムをリセットするようにしてもよい。一例として、ヒストグラムＨ^（ｎ）に含まれる頻度値（非負値）のうちの最大値が所定の閾値を超えたときに、モード情報符号化装置１側とモード情報復号装置４側とでヒストグラムをリセットするようにしてもよい。これにより、各頻度値のメモリの領域のサイズが固定（例えば、１０進数表現における最大桁数固定、あるいは２進数表現におけるビット数固定など）の場合に、頻度値のオーバーフローを防止することができる。この場合、初期化方法の一例として、各頻度値を所定の定数（例えば、２、４など）で除算する（除算の結果の小数部分を切り捨てるか、切り上げるか、予め定める）ようにしてもよい。 Further, the histogram may be reset on the mode information coding device 1 side and the mode information decoding device 4 side at the timing when the frequency value included ^{in the histogram H (n) reaches a predetermined state.} As an example, when ^{the maximum value of the frequency values (non-negative values) included in the histogram H (n)} exceeds a predetermined threshold value, a histogram is created between the mode information encoding device 1 side and the mode information decoding device 4 side. You may want to reset it. This makes it possible to prevent the frequency value from overflowing when the size of the memory area of each frequency value is fixed (for example, the maximum number of digits in the decimal representation is fixed, or the number of bits in the binary representation is fixed). .. In this case, as an example of the initialization method, each frequency value may be divided by a predetermined constant (for example, 2, 4, etc.) (the fractional part of the division result is rounded down or rounded up, or determined in advance). ..

なお、上述した実施形態におけるモード情報符号化装置１、エントロピー符号化部２、エントロピー復号部３、およびモード情報復号装置４の各装置の、少なくとも一部の機能をコンピューターで実現するようにしても良い。その場合、この機能を実現するためのプログラムをコンピューター読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピューターシステムに読み込ませ、実行することによって実現しても良い。なお、ここでいう「コンピューターシステム」とは、ＯＳや周辺機器等のハードウェアを含むものとする。また、「コンピューター読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ、ＤＶＤ−ＲＯＭ、ＵＳＢメモリ等の可搬媒体、コンピューターシステムに内蔵されるハードディスク等の記憶装置のことをいう。さらに「コンピューター読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムを送信する場合の通信線のように、短時間の間、動的にプログラムを保持するもの、その場合のサーバーやクライアントとなるコンピューターシステム内部の揮発性メモリのように、一定時間プログラムを保持しているものも含んでも良い。また上記プログラムは、前述した機能の一部を実現するためのものであっても良く、さらに前述した機能をコンピューターシステムにすでに記録されているプログラムとの組み合わせで実現できるものであっても良い。 Even if at least a part of the functions of the mode information coding device 1, the entropy coding unit 2, the entropy decoding unit 3, and the mode information decoding device 4 in the above-described embodiment are realized by a computer. good. In that case, the program for realizing this function may be recorded on a computer-readable recording medium, and the program recorded on the recording medium may be read by the computer system and executed. The term "computer system" as used herein includes hardware such as an OS and peripheral devices. The "computer-readable recording medium" is a portable medium such as a flexible disk, a magneto-optical disk, a ROM, a CD-ROM, a DVD-ROM, or a USB memory, or a storage device such as a hard disk built in a computer system. Say that. Further, a "computer-readable recording medium" is a communication line for transmitting a program via a network such as the Internet or a communication line such as a telephone line, and dynamically holds the program for a short period of time. It may also include a program that holds a program for a certain period of time, such as a volatile memory inside a computer system that serves as a server or a client in that case. Further, the above-mentioned program may be a program for realizing a part of the above-mentioned functions, and may be a program for realizing the above-mentioned functions in combination with a program already recorded in the computer system.

以上、説明した実施形態のモード情報符号化装置によれば、対象ブロックよりも前に符号化済みの画像領域のモード情報に基づいて、モード値の空間的な相関が学習される。学習に用いた画像領域と対象ブロックを含む画像領域とで画像の統計的性質が類似するという前提に基づき、学習の結果を用いることで、出現頻度が高いモードほど小さな符号値（ｃ）にほとんどの場合で変換することができる。前記の統計的性質が類似するほど、出現頻度が高いモード値を小さな符号値に変換することが成功する。変換の結果得られる符号語の出現頻度は、小さな値の符号語ほど高頻度となる。小さな値にエネルギー集中した結果、符号語のエントロピーも減少し、モード符号化の符号化効率を改善することができる。 According to the mode information coding apparatus of the embodiment described above, the spatial correlation of the mode values is learned based on the mode information of the image region encoded before the target block. Based on the premise that the statistical properties of the image are similar between the image area used for learning and the image area including the target block, by using the learning results, the higher the frequency of appearance, the smaller the code value (c). Can be converted in the case of. The more similar the above statistical properties are, the more successful it is to convert a mode value with a high frequency of occurrence to a small code value. The frequency of appearance of codewords obtained as a result of conversion is higher for codewords with smaller values. As a result of concentrating energy on a small value, the entropy of the codeword is also reduced, and the coding efficiency of mode coding can be improved.

また、実施形態のモード情報復号装置によれば、符号化されたモード情報の復号に必要な出現頻度の順位情報を、モード情報符号化装置から受信せずとも、モード情報復号装置側ですでに復号済みのモード情報から前記順位情報を得ることができる。これにより、符号化効率を低下させるオーバヘッドなしでモード情報の復号が可能となる。 Further, according to the mode information decoding device of the embodiment, the mode information decoding device has already received the order information of the appearance frequency required for decoding the encoded mode information from the mode information decoding device. The ranking information can be obtained from the decoded mode information. This makes it possible to decode the mode information without an overhead that lowers the coding efficiency.

以上、実施形態を説明したが、本発明はさらに次のような変形例でも実施することが可能である。
［第１変形例］
上記の実施形態では、頻度値に基づく順位（順序）ｄと符号値ｃとの関係を、式（５）および式（６）で表すように定めた。しかしながら、必ずしもこれらの式にしたがって順序（順位）ｄと符号値ｃとの関係を決める必要はなく、順位ｄと符号値ｃとを他の対応関係としてもよい。ただし、順位ｄに基づき符号値ｃが一意に定まるように、且つ、符号値ｃに基づき順位ｄが一意に定まるように対応関係を定める。 Although the embodiments have been described above, the present invention can be further implemented in the following modifications.
[First modification]
In the above embodiment, the relationship between the rank (order) d based on the frequency value and the code value c is defined to be represented by the equations (5) and (6). However, it is not always necessary to determine the relationship between the order (rank) d and the code value c according to these equations, and the order d and the code value c may be used as other correspondence relationships. However, the correspondence is determined so that the code value c is uniquely determined based on the order d and the order d is uniquely determined based on the code value c.

一例として、順位ｄと符号値ｃとの関係を、下記の式により定める。なお、ここで、ｄ_ｍａｘは、順位ｄの最大値である。この式により、順位ｄと符号値ｃとを相互に変換可能である。
ｃ＝ｄ_ｍａｘ−ｄ As an example, the relationship between the rank d and the code value c is determined by the following formula. Here, d _max is the maximum value of the rank d. By this equation, the rank d and the code value c can be converted to each other.
c = d _max −d

また、頻度値の順位の階層ごとに、符号値ｃの決め方を変えてもよい。一例としてモードの種類が３５種類である場合、順位ｄ（１≦ｄ≦３５）に応じて下記のように階層化する。
第１層（１≦ｄ≦８）の場合、前記の式（５）によって符号値ｃを決定する。
第２層（９≦ｄ≦１６）の場合、順位ｄとは独立の別の所定ルールにしたがって、符号値ｃ（ただし、８≦ｃ≦１５）を、第２層に属するモード値候補の各々に付与する。
第３層（１７≦ｄ≦２４）の場合、順位ｄとは独立の別の所定ルールにしたがって、符号値ｃ（ただし、１６≦ｃ≦２３）を、第３層に属するモード値候補の各々に付与する。
第４層（２５≦ｄ≦３５）の場合、順位ｄとは独立の別の所定ルールにしたがって、符号値ｃ（ただし、２４≦ｃ≦３４）を、第４層に属するモード値候補の各々に付与する。
この場合、モード情報符号化装置１から出力される符号値ｃは、モード値候補の頻度値についての全順序を表さないが、上位（上記の例では第１層）の順序と、下位（上記の例では第２層から第４層まで）の半順序とを表す。つまり、この例による場合の符号値ｃは、モード情報のエントロピーを減少させる。
なお、モード情報復号装置４側では、上記の順序付けに対応する順序付けを行う。 Further, the method of determining the code value c may be changed for each hierarchy of the frequency value ranking. As an example, when there are 35 types of modes, the layers are layered as follows according to the order d (1 ≦ d ≦ 35).
In the case of the first layer (1 ≦ d ≦ 8), the code value c is determined by the above equation (5).
In the case of the second layer (9 ≦ d ≦ 16), the code value c (however, 8 ≦ c ≦ 15) is set to each of the mode value candidates belonging to the second layer according to another predetermined rule independent of the rank d. Give to.
In the case of the third layer (17 ≦ d ≦ 24), the code value c (however, 16 ≦ c ≦ 23) is set to each of the mode value candidates belonging to the third layer according to another predetermined rule independent of the rank d. Give to.
In the case of the fourth layer (25 ≦ d ≦ 35), the code value c (however, 24 ≦ c ≦ 34) is set to each of the mode value candidates belonging to the fourth layer according to another predetermined rule independent of the rank d. Give to.
In this case, the code value c output from the mode information encoding device 1 does not represent the total order of the frequency values of the mode value candidates, but the order of the upper order (first layer in the above example) and the lower order (first layer in the above example). In the above example, it represents a half order (from the second layer to the fourth layer). That is, the code value c in the case of this example reduces the entropy of the mode information.
The mode information decoding device 4 side performs the ordering corresponding to the above ordering.

つまり、モード情報符号化装置１側の符号化部１２は、前記対象領域のモード情報を、ソート部１１が求めた順序の少なくとも一部に基づく符号として符号化する。
また、モード情報復号装置４側の復号部４２は、符号化部１２に対応する方法を用いて、対象領域のモード情報を復号する。 That is, the coding unit 12 on the mode information coding device 1 side encodes the mode information of the target region as a code based on at least a part of the order obtained by the sorting unit 11.
Further, the decoding unit 42 on the mode information decoding device 4 side decodes the mode information of the target region by using the method corresponding to the coding unit 12.

なお、第１変形例において例示した符号値ｃの決め方に限らず、順位ｄと符号値ｃの関係は任意である。 The relationship between the rank d and the code value c is arbitrary, not limited to the method of determining the code value c illustrated in the first modification.

［第２変形例］
上記の実施形態では、モード情報符号化装置１側のソート部１１およびモード情報復号装置４側のソート部４１は、それぞれ、次の通り頻度値のソート処理を行った。即ち、ソート部１１およびソート部４１は、その時点における符号化または復号の対象である対象領域に対応付けられた参照領域の参照モード値列（ｙ_１，ｙ_２，・・・，ｙ_Ｋ）に関する、全モード値候補Ｘの頻度値を降順にソートした。一方、ここで説明する変形例においては、ソート部１１およびソート部４１は、他の方法でモード値候補Ｘを順序付ける。 [Second modification]
In the above embodiment, the sort unit 11 on the mode information coding device 1 side and the sort unit 41 on the mode information decoding device 4 side each perform the sorting process of the frequency value as follows. That is, the sort unit 11 and the sort unit 41 are reference mode value strings (y ₁ , y ₂ , ..., Y _K ) of the reference area associated with the target area to be encoded or decoded at that time. The frequency values of all mode value candidates X were sorted in descending order. On the other hand, in the modification described here, the sort unit 11 and the sort unit 41 order the mode value candidates X by other methods.

例えば、ソート部１１およびソート部４１は、頻度値の昇順にモード値候補Ｘをソートする。このように頻度値の昇順にソートした場合も、モード値Ｘの頻度の順序の情報を同様に得ることができ、モード情報を効率的に符号化することが可能である。
また、例えば、ソート部１１およびソート部４１は、全てのモード値候補Ｘに関して頻度値順（降順または昇順）のソートを行う代わりに、所定範囲に属するモード値候補Ｘに関してのみ頻度値順のソート処理を行う。一例として、ソート部１１およびソート部４１は、頻度値が所定の閾値よりも高いモード値候補Ｘについてのみソートを行い、他のモード値候補についてはソート処理を行わない。あるいは別の例として、ソート部１１およびソート部４１は、頻度値の上位Ｎ件（Ｎは正整数）のモード値候補Ｘについてのみソートを行い、他のモード値候補についてはソート処理を行わない。これらの場合、ソート部１１およびソート部４１は、ソート処理の対象とならないモード値候補Ｘについては、頻度値順以外の方法による順序を与える。これらの場合にも、頻度値が比較的上位のモード値候補Ｘについて、モード情報のエントロピーを減少させることが可能である。
なお、ここに示したいずれの場合も、ソート部１１およびソート部４１は、共通の処理を行うことによってモード値候補Ｘの順序付けを行う。 For example, the sort unit 11 and the sort unit 41 sort the mode value candidate X in ascending order of the frequency value. Even when the frequency values are sorted in ascending order in this way, the information on the frequency order of the mode value X can be obtained in the same manner, and the mode information can be efficiently encoded.
Further, for example, the sort unit 11 and the sort unit 41 sort in order of frequency value (descending or ascending order) for all mode value candidates X, but sort in order of frequency value only for mode value candidate X belonging to a predetermined range. Perform processing. As an example, the sort unit 11 and the sort unit 41 sort only the mode value candidate X whose frequency value is higher than a predetermined threshold value, and do not perform the sort process for the other mode value candidates. Alternatively, as another example, the sort unit 11 and the sort unit 41 sort only the mode value candidate X of the upper N frequency values (N is a positive integer), and do not perform the sort process for the other mode value candidates. .. In these cases, the sort unit 11 and the sort unit 41 give an order other than the frequency value order to the mode value candidate X that is not the target of the sort process. In these cases as well, it is possible to reduce the entropy of the mode information for the mode value candidate X having a relatively high frequency value.
In any of the cases shown here, the sort unit 11 and the sort unit 41 perform common processing to order the mode value candidates X.

つまり、ソート部１１およびソート部４１は、符号化対象の領域である対象領域に対応する参照領域のモード情報に関して前記ヒストグラムデータが保持する出現頻度の少なくとも一部に基づいて、前記対象領域のモード情報を順序化する。 That is, the sort unit 11 and the sort unit 41 use the mode of the target area based on at least a part of the appearance frequency held by the histogram data with respect to the mode information of the reference area corresponding to the target area which is the region to be encoded. Order the information.

以上、この発明の実施形態および変形例について図面を参照して詳述してきたが、具体的な構成はこの実施形態等に限られるものではなく、この発明の要旨を逸脱しない範囲の設計等も含まれる。 Although the embodiments and modifications of the present invention have been described in detail with reference to the drawings, the specific configuration is not limited to the embodiments and the like, and the design and the like within a range not deviating from the gist of the present invention are also included. included.

本発明は、静止画や動画を符号化して、記録媒体に記録したり、伝送媒体で伝送したりするためなどに利用可能である。例えば、放送事業や、通信事業や、コンテンツ配信事業や、コンテンツをパッケージングして販売する事業等で利用可能である。また、それらの技術を応用した各種事業において利用可能である。また、それらの符号化あるいは服後のための機器あるいはプログラムを製造したり、販売したりする事業において利用可能である。なお、本発明の利用可能性は、ここに例示列挙した事業（産業）に限定されない。 INDUSTRIAL APPLICABILITY The present invention can be used for encoding a still image or a moving image and recording it on a recording medium or transmitting it on a transmission medium. For example, it can be used in a broadcasting business, a communication business, a content distribution business, a business of packaging and selling content, and the like. It can also be used in various businesses that apply these technologies. It can also be used in the business of manufacturing and selling equipment or programs for their coding or after-wear. The availability of the present invention is not limited to the businesses (industries) exemplified here.

１モード情報符号化装置
２エントロピー符号化部
３エントロピー復号部
４モード情報復号装置
５学習用対象領域および学習用参照領域
６参照領域
７学習用対象領域および学習用参照領域
８参照領域
１０ヒストグラム集計部（頻度情報生成部）
１１ソート部（順序化部）
１２符号化部
２１，２２対象領域
４０ヒストグラム集計部（頻度情報生成部）
４１ソート部（順序化部）
４２復号部
５０ヒストグラム（Ｈ^{（ｎ−１）}）
５１学習用対象モード値および学習用参照モード値
５２頻度値（Ｈ^{（ｎ−１）}（１，２，３））
５３ヒストグラム（Ｈ^（ｎ））
５４頻度値（Ｈ^（ｎ）（１，２，３））
５５対象領域
５６参照領域
５７射影（参照モード値ｙ_１＝３且つｙ_２＝４の場合に対応する）
５８頻度値のソート結果
５９頻度値が等しいビンを表す等号
６０対象モード値
６１対象領域
６２対象モード値 1 Mode information coding device 2 Entropy coding unit 3 Entropy decoding unit 4 Mode information decoding device 5 Learning target area and learning reference area 6 Reference area 7 Learning target area and learning reference area 8 Reference area 10 histogram aggregation unit (Frequency information generator)
11 Sorting section (ordering section)
12 Coding unit 21 and 22 Target area 40 Histogram aggregation unit (frequency information generation unit)
41 Sorting section (ordering section)
42 Decoding unit 50 Histogram (H ^(n-1) )
51 Target mode value for learning and reference mode value for learning 52 Mode value (H ^(n-1) (1, 2, 3))
53 Histogram (H ⁽ⁿ⁾ )
54 Frequency value (H ⁽ⁿ⁾ (1, 2, 3))
55 Target area 56 Reference area 57 Projection (corresponds to the case where the _{reference mode values y 1} = 3 and y _{2 = 4)}
58 Mode value sort result 59 Equal sign representing bins with equal frequency value 60 Target mode value 61 Target area 62 Target mode value

Claims

Frequency of appearance of a set of mode information of a learning target area in a coded area which is a coded area and mode information of a learning reference area corresponding to the learning target area in the coded area. Frequency information generator that generates frequency information that represents
An ordering unit that orders the mode information of the target area based on a part of the appearance frequency held by the frequency information with respect to the mode information of the reference area corresponding to the target area which is the target area to be encoded.
A coding unit that encodes the mode information of the target region as a code based on at least a part of the order obtained by the ordering unit.
Equipped with
The ordering unit determines an order value in the order of the appearance frequency based on the frequency information, and divides the range of the order of the appearance frequency into a plurality of layers.
The coding unit assigns a code corresponding to the hierarchy to the mode information of the target area by offsetting a predetermined value to the sequence value with respect to the hierarchy to which the sequence value 1 belongs, and the sequence of the hierarchy to which the sequence value 1 does not belong. from said sequence value for each a shall be given a code in accordance with independent another predetermined rule mode information of the target region,
The learning reference area includes K learning reference blocks (where K is an integer of 2 or more).
The frequency information is represented as a (K + 1) dimensional matrix including the dimension of the mode information of the target area for learning and the dimension of the mode information for each reference block for learning.
The ordering unit orders the mode information of the target area by performing sorting processing according to the occurrence frequency value corresponding to the mode information for each learning reference block based on the (K + 1) dimensional matrix. do,
A mode information encoding device characterized in that.

Frequency representing the appearance frequency of a set of the mode information of the learning target area in the already decoded area, which is the decoded area, and the mode information of the learning reference area corresponding to the learning target area in the already decoded area. Frequency to generate information Information generation unit and
An ordering unit that orders the mode information of the target area based on a part of the appearance frequency held by the frequency information with respect to the mode information of the reference area corresponding to the target area that is the area to be decoded.
A decoding unit that decodes specific mode information selected from the ordered mode information of the target area based on the input code value, and
Equipped with
The ordering unit determines an order value in the order of the appearance frequency based on the frequency information, and divides the range of the order of the appearance frequency into a plurality of layers.
The decoding unit decodes the mode information of the target area corresponding to the order value obtained by offsetting a predetermined value to the code value with respect to the layer to which the order value 1 belongs, and the layer to which the order value 1 does not belong. are those for respectively for decoding the mode information of the target region corresponding to the code values according to independent another predetermined rule and the order value,
The learning reference area includes K learning reference blocks (where K is an integer of 2 or more).
The frequency information is represented as a (K + 1) dimensional matrix including the dimension of the mode information of the target area for learning and the dimension of the mode information for each reference block for learning.
The ordering unit orders the mode information of the target area by performing sorting processing according to the occurrence frequency value corresponding to the mode information for each learning reference block based on the (K + 1) dimensional matrix. do,
A mode information decoding device characterized by the above.

Computer,
The mode information encoding device according to claim 1.
A program to function as.

Computer,
The mode information decoding device according to claim 2.
A program to function as.