JP2018182393A

JP2018182393A - Video encoding apparatus

Info

Publication number: JP2018182393A
Application number: JP2017074619A
Authority: JP
Inventors: 忍工藤; Shinobu Kudo; 正樹北原; Masaki Kitahara; 清水　淳; Atsushi Shimizu; 淳清水
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2017-04-04
Filing date: 2017-04-04
Publication date: 2018-11-15
Anticipated expiration: 2037-04-04
Also published as: JP6767299B2

Abstract

PROBLEM TO BE SOLVED: To obtain an encoding structure having high encoding efficiency using a low operation amount.SOLUTION: A video encoding apparatus generates a reference group including predetermined three or more continuous frames from a frame group acquired from input video information, calculates an evaluation value associated with an error between frames on the basis of an initial frame and a last frame of the reference group generated, and any one determination target frame between the initial frame and the last frame, determines whether or not to couple a reference relationship between a determination target frame, and the initial frame and the last frame on the basis of a relative value of other evaluation value using a reference of an evaluation value between the determination target frame and the initial frame among evaluation values calculated, or on the basis of the relative value and the evaluation values between the determination target frame and both of the initial frame and the last frame, and selects a picture type of the frame included in the frame group on the basis of the determination result.SELECTED DRAWING: Figure 2

Description

本発明は、映像符号化装置に関する。 The present invention relates to a video coding apparatus.

映像符号化の標準規格として、Ｈ．２６４／ＡＶＣ(Advanced Video Coding)やＨ．２６５／ＨＥＶＣ(High Efficiency Video Coding)と呼ばれる映像符号化方式が策定されている。ＨＥＶＣでは、イントラピクチャであるＩ(Intra)ピクチャと、インターピクチャであるＰ(Predictive)ピクチャと、Ｂ(Bi-directional predictive)ピクチャという複数のピクチャタイプが定義されている。各フレームには、いずれかのピクチャタイプが割り当てられる。 As a standard for video coding, H.3. H.264 / AVC (Advanced Video Coding) and H.264. A video coding scheme called H.265 / HEVC (High Efficiency Video Coding) has been formulated. In HEVC, a plurality of picture types are defined: I (Intra) picture which is an intra picture, P (Predictive) picture which is an inter picture, and B (Bi-directional predictive) picture. Each frame is assigned one of the picture types.

Ｉピクチャは、通常、数フレームごとに定められるピクチャタイプであり、符号化対象フレーム内の符号化済み画素から予測するイントラ予測モードのみが適用されるピクチャタイプである。Ｐピクチャ及びＢピクチャは、イントラ予測モードか、または、符号化済みフレームから予測するインター予測モードが選択的に適用されるピクチャタイプである。Ｐピクチャは、時系列の過去のフレームを参照フレームとするのに対して、Ｂピクチャは、過去と未来のフレームを参照フレームとする。 The I picture is usually a picture type defined every several frames, and is a picture type to which only the intra prediction mode predicted from encoded pixels in the encoding target frame is applied. P picture and B picture are picture types to which an intra prediction mode or an inter prediction mode of predicting from a coded frame is selectively applied. P pictures use past frames in time series as reference frames, while B pictures use past and future frames as reference frames.

Ｉピクチャから次のＩピクチャまでのフレーム群をＧＯＰ（Group Of Picture）といい、ＧＯＰごとに、Ｉピクチャの間隔や参照構造等の符号化構造を自由に設定することができる。代表的な符号化構造として、図１７に示す低遅延符号化の符号化構造と、図１８に示すランダムアクセス符号化の符号化構造があり、一般に、ランダムアクセス符号化の方が、符号化効率が高いと言われている。 A group of frames from I picture to the next I picture is called GOP (Group Of Picture), and coding structures such as I picture intervals and reference structures can be freely set for each GOP. As a typical coding structure, there are a coding structure for low delay coding shown in FIG. 17 and a coding structure for random access coding shown in FIG. 18, and in general, random access coding is better in coding efficiency. Is said to be high.

符号化構造を定める際、ＩピクチャまたはＰピクチャ間隔を符号化する映像に応じて適応的に定めることで、符号化効率を向上させることができる。例えば、図１９に示すような、映像の途中でシーンチェンジがある映像フレームのシーケンスに対して、固定されたＩピクチャまたはＰピクチャ間隔よりもシーンチェンジに合わせて適応的にＩピクチャまたはＰピクチャとなるフレームを選択していくことで符号化効率が向上する。 In determining the coding structure, coding efficiency can be improved by adaptively setting the I picture or P picture interval according to the video to be coded. For example, as shown in FIG. 19, for a sequence of video frames having a scene change in the middle of the video, the I picture or P picture is adaptively adapted to the scene change rather than the fixed I picture or P picture interval. The coding efficiency is improved by selecting the frame.

また、その他にも、図２０や図２１に示すような参照構造を階層的に設定する等、適応的に参照構造を変化させることで符号化効率の向上を図ることができる。例えば、図２０に示す適応構造１と図２１に示す適応構造２とを比較した場合、図２１に示す適応構造２の方が、最大階層が２階層になっていることから、参照フレーム間の距離が小さくなっており、符号化効率の向上が期待される。 In addition, coding efficiency can be improved by adaptively changing the reference structure, such as hierarchically setting reference structures as shown in FIG. 20 and FIG. For example, when adaptive structure 1 shown in FIG. 20 is compared with adaptive structure 2 shown in FIG. 21, since adaptive structure 2 shown in FIG. Since the distance is smaller, improvement in coding efficiency is expected.

更に高い符号化効率を有する符号化構造を選択する手法も提案されている（例えば、非特許文献１及び２参照）。以下に、提案されている代表的な２つの技術について説明する。非特許文献１に示す技術では、符号化構造は、以下の手順にしたがって定められる。原画像を、縦横それぞれ１／２サイズに縮小する（ステップＳｘ１）。最初のフレームをＩピクチャとして符号化し、基準フレームとする（ステップＳｘ２）。Ｎ枚先までの各フレームに対して、Ｐピクチャとした場合のコストを算出し、算出したコストが最小となったフレームをＰピクチャとする。ここで、コストとは、フレームの各符号化ブロックをイントラ予測モード及びインター予測モードにより符号化した際の小さい方の符号化コストの全ブロックの総和である（ステップＳｘ３）。 A method of selecting a coding structure having higher coding efficiency has also been proposed (see, for example, Non-Patent Documents 1 and 2). The following describes two representative proposed techniques. In the technique shown in Non-Patent Document 1, the coding structure is determined according to the following procedure. The original image is reduced to half size in the vertical and horizontal directions (step Sx1). The first frame is encoded as an I picture, and is set as a reference frame (step Sx2). For each frame up to N frames ahead, the cost for the case of P picture is calculated, and the frame for which the calculated cost is minimum is taken as a P picture. Here, the cost is the sum of all the blocks of the smaller encoding cost when encoding each encoding block of the frame according to the intra prediction mode and the inter prediction mode (step Sx3).

基準フレームと、ステップＳｘ３でＰピクチャとして選択したフレームとの間に、それらを参照する中間フレームを定義する。基準フレームと中間フレームとの間に位置するフレームに対して、これらのフレームを基準フレームと中間フレームとを参照するＢピクチャとした場合のコストを算出する。また、中間フレームとＰピクチャとの間に位置するフレームに対して、これらのフレームを中間フレームとＰピクチャとを参照するＢピクチャとした場合のコストを算出する。算出したコストが最小となる組み合わせを最終的な符号化構造として選択する（ステップＳｘ４）。ステップＳｘ３でＰピクチャとしたフレームを基準フレームとし、ステップＳｘ３及びＳｘ４を繰り返し実行して適応的に符号化構造を定めていく。 Between the reference frame and the frame selected as the P picture in step Sx3, an intermediate frame referencing them is defined. For a frame located between the reference frame and the intermediate frame, the cost is calculated in the case where these frames are B pictures that reference the reference frame and the intermediate frame. In addition, with respect to a frame located between the intermediate frame and the P picture, the cost is calculated in the case where these frames are B pictures that refer to the intermediate frame and the P picture. A combination that minimizes the calculated cost is selected as a final coding structure (step Sx4). Assuming that the frame used as the P picture in step Sx3 is a reference frame, steps Sx3 and Sx4 are repeatedly executed to adaptively determine the coding structure.

非特許文献２に記載の技術では、符号化構造は、以下の手順にしたがって定められる。１枚目のフレームをＩピクチャとして符号化する（ステップＳｙ１）。Ｉピクチャと次のフレームとの間で動き探索を行い、各符号化ブロックの動きベクトルをＳ_１（ｋ）（ただし、１≦ｋ≦Ｎ_ｂｌｋ：Ｎ_ｂｌｋは、フレーム内のブロック数）とする（ステップＳｙ２）。 In the technique described in Non-Patent Document 2, the coding structure is determined according to the following procedure. The first frame is encoded as an I picture (step Sy1). Motion search is performed between the I picture and the next frame, and the motion vector of each coding block is S ₁ (k) (where 1 ≦ k ≦ N _blk : N _blk is the number of blocks in the frame) (Step Sy2).

ｎ枚目（ｎ≧３）の各フレームに対して１枚目のフレームとの間で動き探索を行い、各符号化ブロックの動きベクトルをフレーム間距離で割ったものをＳ_ｎ（ｋ）とする（ステップＳｙ３）。動きベクトル差の平均（Ｅ＝Σ（Ｓ_１−Ｓ_ｎ）／Ｎ_ｂｌｋ）を算出する（ステップＳｙ４）。予め定められた閾値をｔｈとし、Ｅ＜ｔｈであれば、ｎ枚目のフレームをＢピクチャとし、そうでなければ、Ｐピクチャとする（ステップＳｙ５）。 A motion search is performed between the nth frame (n ≧ 3) and the first frame, and the motion vector of each coding block divided by the interframe distance is S _n (k) To do (step Sy3). The average of the motion vector difference (E = S (S ₁ −S _n ) / N _blk ) is calculated (step Sy 4). A predetermined threshold value is th, and if E <th, the nth frame is B picture, otherwise it is P picture (step Sy 5).

“VideoLAN, a project and a non-profit organization”、[online]、VideoLAN Organization、［平成２９年３月１７日検索］、インターネット（URL:https://www.videolan.org/developers/x265.html）“VideoLAN, a project and a non-profit organization”, [online], VideoLAN Organization, [search on March 17, 2017], Internet (URL: https://www.videolan.org/developers/x265.html ) Adriana Dumitras and Barry G. Haskell,“I/P/B FRAME TYPE DECISION BY COLLINEARITY OF DISPLACEMENTS”, 2004 International Conference on Image Processing(ICIP), p2769-p2772Adriana Dumitras and Barry G. Haskell, “I / P / B FRAME TYPE DECISION BY COLLINEARITY OF DISPLACEMENTS”, 2004 International Conference on Image Processing (ICIP), p2769-p2772

しかしながら、上記の手順に示すように、非特許文献１に記載の技術では、縮小画像を用いているとはいえ、実際の符号化に近い処理を複数の組み合わせに対して行っている。また、計算結果が無駄になる組み合わせも生じており、結果的に、演算量が多い手法になってしまっているという問題がある。また、非特許文献１に記載の技術では、Ｂピクチャの階層として２階層を割り当てることを前提としており、この技術に対して、符号化効率の良いより深い階層構造を適用しようとすると、組み合わせ数が増大してしまうという問題がある。 However, as shown in the above-described procedure, in the technique described in Non-Patent Document 1, although the reduced image is used, processing close to actual coding is performed on a plurality of combinations. In addition, there are combinations in which calculation results are wasted, and as a result, there is a problem that the method has a large amount of calculation. Further, in the technology described in Non-Patent Document 1, it is premised that two layers are allocated as the hierarchy of B pictures, and if it is intended to apply a deeper hierarchical structure with good coding efficiency to this technology, the number of combinations is There is a problem that the

非特許文献２に記載の技術においても、判定の際の計算結果が無駄になる組み合わせが存在するため、演算量が多いという問題がある。また、非特許文献２に記載の技術では、Ｂピクチャの階層として１階層分しか適用できないという問題がある。また、非特許文献２に記載の技術では、動きベクトルのみを判定に用いているため、複雑な映像に対して精度良く判定することができず、符号化効率が悪いという問題がある。 Even in the technique described in Non-Patent Document 2, there is a combination in which the calculation result in the determination is wasted, so there is a problem that the amount of calculation is large. Moreover, in the technique described in Non-Patent Document 2, there is a problem that only one layer can be applied as the layer of the B picture. Further, in the technique described in Non-Patent Document 2, only the motion vector is used for determination, so it is not possible to accurately determine a complex video, and there is a problem that the coding efficiency is poor.

上記事情に鑑み、本発明は、符号化効率の高い符号化構造を低演算量で求めることができる技術の提供を目的としている。 In view of the above circumstances, the present invention aims to provide a technique capable of obtaining a coding structure with high coding efficiency with a small amount of calculation.

本発明の一態様は、フレーム間予測符号化を行う映像符号化装置であって、入力映像情報から取得されるフレーム群から、予め定められる３枚以上の枚数の連続するフレームを含む参照グループを生成する参照グループ生成部と、前記参照グループ生成部が生成する前記参照グループの最初と最後の前記フレームと、前記最初と最後のフレームの間のいずれか１つの判定対象フレームとに基づいて、前記フレーム間の誤差に関する評価値を算出する評価値算出部と、前記評価値算出部が算出する前記評価値のうち前記判定対象フレームと前記最初のフレームとの間の前記評価値を基準とした他の前記評価値の相対値に基づいて、または前記相対値と前記判定対象フレームと前記最初と最後のフレーム双方との間の前記評価値とに基づいて、前記判定対象フレームと前記最初と最後のフレームとの参照関係を結合するか否かを判定する評価値判定部と、前記評価値判定部の判定結果に基づいて、前記フレーム群に含まれる前記フレームのピクチャタイプを選定するピクチャタイプ選定部と、を備える映像符号化装置である。 One aspect of the present invention is a video encoding apparatus for performing inter-frame predictive coding, which includes, from a frame group acquired from input video information, a reference group including three or more consecutive frames determined in advance. The reference group generation unit to generate, the first and last frames of the reference group generated by the reference group generation unit, and any one determination target frame between the first and last frames, Among the evaluation values calculated by the evaluation value calculation unit, an evaluation value calculation unit that calculates an evaluation value regarding an error between the frames, and the evaluation value between the determination target frame and the first frame among the evaluation values calculated Prior to the evaluation value based on the relative value of the evaluation value of the frame, or based on the relative value and the evaluation value between the determination target frame and both the first and last frames. An evaluation value determination unit that determines whether or not the reference relationship between the determination target frame and the first and last frames is combined, and the determination result of the evaluation value determination unit based on the determination results of the frames And a picture type selection unit that selects a picture type.

本発明の一態様は、上記の映像符号化装置であって、前記評価値算出部は、前記判定対象フレームと前記最初のフレームとの間の前記フレーム間の誤差に関する第１の片方向評価値と、前記判定対象フレームと前記最後のフレームとの間の前記フレーム間の誤差に関する第２の片方向評価値と、前記判定対象フレームと前記最初と最後のフレーム双方との間の前記フレーム間の誤差に関する双方向評価値とを算出し、前記評価値判定部は、前記第２の片方向評価値と前記双方向評価値のうち小さい方の評価値を選択し、選択した前記評価値の前記第１の片方向評価値に対する相対的な値の大きさとして前記相対値を算出し、算出した前記相対値と、予め定められる相対評価用閾値とに基づいて、前記判定対象フレームと前記最初と最後のフレームとの参照関係を結合するか否かを判定する。 One embodiment of the present invention is the video encoding device described above, wherein the evaluation value calculation unit is configured to calculate a first one-way evaluation value regarding an error between the frames between the determination target frame and the first frame. And a second one-way evaluation value regarding the inter-frame error between the determination target frame and the last frame, the inter-frame between the determination target frame and both the first and last frames. The evaluation value determination unit selects a smaller one of the second one-way evaluation value and the two-way evaluation value, and calculates the evaluation value of the selected evaluation value. The relative value is calculated as a relative value with respect to the first one-way evaluation value, and the determination target frame and the first are determined based on the calculated relative value and a predetermined relative evaluation threshold value. Last fray It determines whether to combine the reference relationship with the arm.

本発明の一態様は、上記の映像符号化装置であって、前記予め定められる相対評価用閾値は、第１の相対評価用閾値と、前記第１の相対評価用閾値よりも値の大きい第２の相対評価用閾値を含んでおり、前記評価値判定部は、前記相対値が、前記第１の相対評価用閾値未満であるか否か基づいて、前記判定対象フレームと前記最初と最後のフレームとの参照関係を結合するか否かを判定し、前記相対値が、前記第１の相対評価用閾値未満でなく、前記判定対象フレームの参照フレームを前記最初と最後のフレームにしないと判定する場合、前記相対値が、前記第１の相対評価用閾値と前記第２の相対評価用閾値とにより定められる範囲の値である場合、前記双方向評価値と、予め定められる絶対評価用閾値とに基づいて、前記判定対象フレームと前記最初と最後のフレームとの参照関係を結合するか否かを判定する。 One embodiment of the present invention is the above-described video encoding device, wherein the predetermined relative evaluation threshold is larger than a first relative evaluation threshold and a first relative evaluation threshold. And the evaluation value determination unit determines whether the relative value is less than the first relative evaluation threshold, the determination target frame and the first and last frames. It is determined whether or not the reference relationship with the frame is combined, and it is determined that the relative value is not less than the first relative evaluation threshold and the reference frame of the determination target frame is not the first and last frames. In the case where the relative value is a value within a range defined by the first relative evaluation threshold and the second relative evaluation threshold, the bidirectional evaluation value and the predetermined absolute evaluation threshold And the judgment target frame And it determines whether to combine the reference relationship between the first and last frame.

本発明の一態様は、上記の映像符号化装置であって、前記参照グループ生成部は、前記評価値判定部により結合すると判定された前記参照グループの最初の前記フレーム、または最後の前記フレームが前記判定対象フレームとなるように、当該参照グループを含み前記予め定められる３枚以上の枚数よりも多い枚数の連続するフレームを含む新たな前記参照グループを生成し、前記評価値算出部は、前記新たな参照グループの最初と最後の前記フレームと、前記判定対象フレームとに基づいて、前記フレーム間の誤差に関する前記評価値を算出し、前記評価値判定部は、前記評価値算出部が算出する前記評価値のうち前記新たな参照グループの前記判定対象フレームと前記最初のフレームとの間の前記評価値を基準とした他の前記評価値の相対値、または前記相対値と、前記新たな参照グループの前記判定対象フレームと前記最初と最後のフレーム双方との間の前記評価値とに基づいて、前記判定対象フレームと前記最初と最後のフレームとの参照関係を結合するか否かを判定する。 One embodiment of the present invention is the above-described video encoding apparatus, wherein the reference group generation unit determines whether the first frame or the last frame of the reference group determined to be combined by the evaluation value determination unit is A new reference group including the reference group and a number of consecutive frames including the reference group and greater than the predetermined number of three or more is generated so as to be the determination target frame, and the evaluation value calculation unit The evaluation value regarding the error between the frames is calculated based on the first and last frames of the new reference group and the determination target frame, and the evaluation value determination unit calculates the evaluation value calculation unit. Among the evaluation values, the phases of other evaluation values based on the evaluation value between the frame to be judged of the new reference group and the first frame The determination target frame and the first and last frames based on the value or the relative value and the evaluation value between the determination target frame of the new reference group and both the first and last frames It is determined whether or not to combine the reference relationships of.

本発明の一態様は、上記の映像符号化装置であって、前記参照グループ生成部は、前記参照グループに隣接する他の参照グループを生成する際、当該参照グループの最後の前記フレームが、前記隣接する他の参照グループの最初の前記フレームとなるように重複させて前記予め定められる３枚以上の枚数の連続するフレームを含む前記参照グループを生成しており、隣接する２つの前記参照グループの双方において前記評価値判定部が前記判定対象フレームと前記最初と最後のフレームとの参照関係を結合すると判定する場合、当該隣接する２つの前記参照グループを結合して前記新たな参照グループを生成し、重複する前記フレームを前記判定対象フレームとする。 One embodiment of the present invention is the video encoding device described above, wherein when the reference group generation unit generates another reference group adjacent to the reference group, the last frame of the reference group is the Generating the reference group including the predetermined number of consecutive frames of the predetermined number or more of the predetermined number of overlapping frames so as to become the first frame of another adjacent reference group; When the evaluation value determination unit determines that the reference relationship between the frame to be determined and the first and last frames is to be combined in both, the adjacent two reference groups are combined to generate the new reference group. The overlapping frames are taken as the determination target frames.

本発明の一態様は、上記の映像符号化装置であって、前記参照グループ生成部は、
予め定めらえる構築する符号化構造の階層数の値にしたがって、前記新たな参照グループを生成することを繰り返す。 One embodiment of the present invention is the video encoding device described above, wherein the reference group generation unit
The generation of the new reference group is repeated according to the value of the number of layers of the coding structure to be defined in advance.

本発明の一態様は、上記の映像符号化装置であって、前記ピクチャタイプ選定部は、前記入力映像情報から前記フレーム群を取得し、取得した前記フレーム群の最初のフレームをＩピクチャとして選定するＩピクチャ選定部を備え、前記ピクチャタイプ選定部は、前記結合判定部が、前記判定対象フレームと前記最初と最後のフレームとの参照関係を結合すると判定する場合、前記判定対象フレームを前記最初と最後のフレームを参照フレームとするＢピクチャとして選定し、前記Ｉピクチャ、または前記Ｂピクチャとして選定されていない前記フレームを前記Ｐピクチャとして選定し、前記フレーム群の表示順において、選定した前記フレームに最も近い過去の前記Ｐピクチャの前記フレームを、選定した前記フレームの前記参照フレームとし、選定した前記フレームに最も近い過去の前記Ｐピクチャの前記フレームがない場合、前記Ｉピクチャの前記フレームを、選定した前記フレームの前記参照フレームとする。 One embodiment of the present invention is the video encoding device described above, wherein the picture type selection unit acquires the frame group from the input video information, and selects the first frame of the acquired frame group as an I picture. The picture type selection unit, when it is determined that the combination determination unit combines the reference relationship between the determination target frame and the first and last frames, the determination target frame is the first And the last frame as a reference frame, and the frames not selected as the I picture or the B picture are selected as the P picture, and the selected frames in the display order of the frame group The frame of the previous P picture closest to the And, if there is no the frame of the most recent past of the P-picture to the frame selected, the frame of the I picture, and the reference frame of the frame that was selected.

本発明の一態様は、上記の映像符号化装置であって、前記予め定められる３枚以上の枚数は、３枚である。 One embodiment of the present invention is the above-described video encoding apparatus, wherein the predetermined number of three or more is three.

本発明により、符号化効率の高い符号化構造を低演算量で求めることが可能となる。 According to the present invention, it is possible to obtain a coding structure with high coding efficiency with a small amount of calculation.

本発明の第１の実施形態における映像符号化装置の構成を示すブロック図である。It is a block diagram which shows the structure of the video coding apparatus in the 1st Embodiment of this invention. 同実施形態の符号化構造構築部の構成を示すブロック図である。It is a block diagram which shows the structure of the encoding structure construction part of the embodiment. 同実施形態の符号化構造構築処理の流れを示すフローチャートである。It is a flow chart which shows a flow of coding structure construction processing of the embodiment. 同実施形態の結合判定処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the joint determination process of the embodiment. 同実施形態の符号化構造構築処理の具体例を示す図（その１）である。It is a figure (the 1) which shows the specific example of the encoding structure construction process of the embodiment. 同実施形態の符号化構造構築処理の具体例を示す図（その２）である。It is a figure (the 2) which shows the specific example of the encoding structure construction process of the embodiment. 同実施形態の符号化構造構築処理の具体例を示す図（その３）である。It is a figure (the 3) which shows the specific example of the encoding structure construction process of the embodiment. 同実施形態の符号化構造構築処理の具体例を示す図（その４）である。It is a figure (the 4) which shows the specific example of the encoding structure construction process of the embodiment. 同実施形態の結合判定処理の具体例を示す図である。It is a figure which shows the specific example of the coupling determination process of the embodiment. 同実施形態の結合判定処理の具体例において算出した評価値を示す図である。It is a figure which shows the evaluation value calculated in the specific example of the joint determination process of the embodiment. 本発明の第２の実施形態の符号化構造構築部の構成を示すブロック図である。It is a block diagram which shows the structure of the encoding structure construction part of the 2nd Embodiment of this invention. 同実施形態の符号化構造構築処理の流れを示すフローチャート（その１）である。It is a flowchart (the 1) which shows the flow of the encoding structure construction process of the embodiment. 同実施形態の符号化構造構築処理の流れを示すフローチャート（その２）である。It is a flowchart (the 2) which shows the flow of the encoding structure construction process of the embodiment. 同実施形態の符号化構造構築処理の具体例を示す図（その１）である。It is a figure (the 1) which shows the specific example of the encoding structure construction process of the embodiment. 同実施形態の符号化構造構築処理の具体例を示す図（その２）である。It is a figure (the 2) which shows the specific example of the encoding structure construction process of the embodiment. 同実施形態の符号化構造構築処理の具体例を示す図（その３）である。It is a figure (the 3) which shows the specific example of the encoding structure construction process of the embodiment. インター予測符号化における低遅延符号化の符号化構造を示す図である。It is a figure which shows the coding structure of the low delay coding in inter prediction coding. インター予測符号化におけるランダムアクセス符号化の符号化構造を示す図である。It is a figure which shows the coding structure of the random access coding in inter prediction coding. インター予測符号化における固定構造と適応構造の違いを示す図である。It is a figure which shows the difference of the fixed structure and the adaptive structure in inter prediction coding. インター予測符号化における適応構造を説明するための図（その１）である。It is a figure (the 1) for demonstrating the adaptive structure in inter prediction coding. インター予測符号化における適応構造を説明するための図（その２）である。It is a figure (the 2) for demonstrating the adaptive structure in inter prediction coding.

（第１の実施形態）
以下、本発明の実施形態について図面を参照して説明する。図１は、本発明の第１の実施形態における映像符号化装置１の構成を示すブロック図である。映像符号化装置１は、符号化構造構築部１０、減算器１１、直交変換・量子化部１２、可変長符号化部１３、逆量子化・逆直交変換部１４、加算器１５、イントラ予測部１６、ループフィルタ部１７、復号ピクチャ記憶部１８、インター予測部１９、及びイントラ・インター切替スイッチ部２０を備える。 First Embodiment
Hereinafter, embodiments of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing the configuration of a video encoding device 1 according to the first embodiment of the present invention. The video coding apparatus 1 includes a coding structure construction unit 10, a subtractor 11, an orthogonal transformation / quantization unit 12, a variable length encoding unit 13, an inverse quantization / inverse orthogonal transformation unit 14, an adder 15, an intra prediction unit And 16, a loop filter unit 17, a decoded picture storage unit 18, an inter prediction unit 19, and an intra / inter switching unit 20.

符号化構造構築部１０は、外部から入力映像情報を取り込み、取り込んだ入力映像情報から予め定められるＧＯＰのサイズにしたがう枚数のフレームを選択してフレーム群を生成する。また、符号化構造構築部１０は、生成したフレーム群の各フレームのピクチャタイプ、参照構造などを選定して符号化構造を構築する。また、符号化構造構築部１０は、構築した符号化構造が示す符号化順序にしたがってＣＵ（Coding Unit）ブロック単位で符号化対象フレームを出力する。 The encoding structure construction unit 10 fetches input video information from the outside, selects frames of the number according to the size of a predetermined GOP from the fetched input video information, and generates a frame group. Further, the coding structure construction unit 10 selects a picture type of each frame of the generated frame group, a reference structure, and the like, and constructs a coding structure. Also, the coding structure construction unit 10 outputs a coding target frame in units of CU (Coding Unit) blocks in accordance with the coding order indicated by the constructed coding structure.

減算器１１は、符号化構造構築部１０が出力する符号化対象フレームの画像情報と、イントラ・インター切替スイッチ部２０を介してイントラ予測部１６、またはインター予測部１９が出力する予測画像情報との差を算出する。また、減算器１１は、算出した差に基づく差分画像情報を直交変換・量子化部１２に出力する。 The subtractor 11 includes the image information of the encoding target frame output from the encoding structure construction unit 10, and the predicted image information output from the intra prediction unit 16 or the inter prediction unit 19 via the intra / inter switching unit 20. Calculate the difference of Further, the subtractor 11 outputs difference image information based on the calculated difference to the orthogonal transformation / quantization unit 12.

直交変換・量子化部１２は、減算器１１が出力する差分画像情報に対して、直交変換と量子化を行い、可変長符号化部１３と逆量子化・逆直交変換部１４に出力する。可変長符号化部１３は、直交変換・量子化部１２が出力する量子化係数を可変長符号化して符号化データを生成し、生成した符号化データを外部に出力する。 The orthogonal transformation / quantization unit 12 performs orthogonal transformation and quantization on the difference image information output from the subtractor 11, and outputs the result to the variable-length coding unit 13 and the inverse quantization / inverse orthogonal transformation unit 14. The variable-length coding unit 13 performs variable-length coding on the quantization coefficient output from the orthogonal transformation / quantization unit 12 to generate coded data, and outputs the generated coded data to the outside.

逆量子化・逆直交変換部１４は、直交変換・量子化部１２が出力する量子化係数に対して、逆量子化と逆直交変換を行って差分画像情報を復号して加算器１５に出力する。加算器１５は、逆量子化・逆直交変換部１４が出力する復号差分画像情報と、イントラ・インター切替スイッチ部２０を介してイントラ予測部１６、またはインター予測部１９が出力する予測画像情報との和を算出する。また、加算器１５は、算出した和に基づく復号画像情報をイントラ予測部１６とループフィルタ部１７に出力する。 The inverse quantization / inverse orthogonal transformation unit 14 performs inverse quantization and inverse orthogonal transformation on the quantization coefficient output from the orthogonal transformation / quantization unit 12 to decode difference image information, and outputs the decoded image information to the adder 15 Do. The adder 15 outputs the decoded difference image information output from the inverse quantization / inverse orthogonal transform unit 14 and the predicted image information output from the intra prediction unit 16 or the inter prediction unit 19 via the intra / inter switching unit 20. Calculate the sum of The adder 15 also outputs decoded image information based on the calculated sum to the intra prediction unit 16 and the loop filter unit 17.

イントラ予測部１６は、加算器１５が出力する復号画像情報を参照フレームとして、符号化対象フレームに含まれる符号化対象ブロックのイントラ予測画像情報を生成する。 The intra prediction unit 16 generates intra prediction image information of the encoding target block included in the encoding target frame, using the decoded image information output from the adder 15 as a reference frame.

ループフィルタ部１７は、加算器１５が出力する復号画像情報にループフィルタを適用し、ループフィルタを適用した復号画像情報を復号ピクチャ記憶部１８に書き込んで記憶させる。インター予測部１９は、復号ピクチャ記憶部１８に記憶されている復号画像情報を参照フレームとして、符号化構造構築部１０が出力する符号化対象フレームに含まれる符号化対象ブロックのインター予測画像情報を生成する。 The loop filter unit 17 applies a loop filter to the decoded image information output from the adder 15, and writes and stores the decoded image information to which the loop filter is applied in the decoded picture storage unit 18. The inter prediction unit 19 uses the decoded image information stored in the decoded picture storage unit 18 as a reference frame, and inter prediction image information of the coding target block included in the coding target frame output from the coding structure construction unit 10 Generate

イントラ・インター切替スイッチ部２０は、符号化対象ブロックの予測モードに応じて、イントラ予測部１６とインター予測部１９とを切り替えて、イントラ予測部１６が出力するイントラ予測画像情報、またはインター予測部１９が出力するインター予測画像情報を減算器１１と加算器１５に出力する。 The intra / inter switching unit 20 switches between the intra prediction unit 16 and the inter prediction unit 19 according to the prediction mode of the encoding target block, and outputs the intra prediction image information output by the intra prediction unit 16 or the inter prediction unit The inter-prediction image information output by 19 is output to the subtractor 11 and the adder 15.

上記の構成により、映像符号化装置１は、外部から取り込んだ入力映像情報からフレーム群を生成し、生成したフレーム群の符号化対象フレームごとにラスタースキャンしてＣＵブロックを生成する。映像符号化装置１は、生成した各ＣＵブロックを符号化対象ブロックとしてラスタースキャン順に繰り返し符号化を行い、符号化対象ブロックごとの符号化データを出力する。映像符号化装置１は、この処理を入力映像情報から生成する全てのフレーム群に含まれるフレームに対して繰り返し行うことにより、符号化対象入力映像情報を符号化する。 With the above-described configuration, the video encoding device 1 generates a frame group from input video information captured from the outside, and performs raster scan for each encoding target frame of the generated frame group to generate a CU block. The video encoding device 1 repeatedly encodes each of the generated CU blocks as an encoding target block in raster scan order, and outputs encoded data for each encoding target block. The video encoding device 1 encodes the encoding target input video information by repeatedly performing this process on frames included in all frame groups generated from the input video information.

符号化の過程において、インター予測部１９でＰピクチャのフレームの符号化を行うためには、Ｐピクチャのフレームに対する参照フレームが予め復号ピクチャ記憶部１８に記憶されている必要がある。また、Ｂピクチャのフレームの符号化を行うためには、Ｂピクチャのフレームに対する２つの参照フレームが予め復号ピクチャ記憶部１８に記憶されている必要がある。符号化構造構築部１０は、これらの符号化の順序の要求を満たすような符号化構造の構築を行う。 In the process of encoding, in order to encode the P picture frame in the inter prediction unit 19, it is necessary to store in advance in the decoded picture storage unit 18 a reference frame for the P picture frame. In addition, in order to encode a B picture frame, two reference frames for the B picture frame need to be stored in the decoded picture storage unit 18 in advance. The coding structure construction unit 10 constructs a coding structure that satisfies the requirements of the order of coding.

図２は、符号化構造構築部１０の構成を示すブロック図である。符号化構造構築部１０は、符号化対象フレーム選択部１０１、Ｉピクチャ選定部１０２、参照グループ生成部１０３、結合判定部１０４、ピクチャタイプ選定部１０５、ピクチャタイプ記憶部１０６、及びブロック出力部１０７を備える。 FIG. 2 is a block diagram showing the configuration of the coding structure construction unit 10. As shown in FIG. The coding structure construction unit 10 includes a coding target frame selection unit 101, an I picture selection unit 102, a reference group generation unit 103, a combination determination unit 104, a picture type selection unit 105, a picture type storage unit 106, and a block output unit 107. Equipped with

符号化対象フレーム選択部１０１は、入力映像情報から予め定められるＧＯＰのサイズにしたがう枚数のフレームを選択してフレーム群を生成してＩピクチャ選定部１０２とブロック出力部１０７に出力する。Ｉピクチャ選定部１０２は、フレーム群の最初のフレームをＩピクチャとして選定し、最初のフレームを起点フレームとしてフレーム群を参照グループ生成部１０３に出力する。 The encoding target frame selection unit 101 selects the number of frames according to the predetermined GOP size from the input video information, generates a frame group, and outputs the frame group to the I picture selection unit 102 and the block output unit 107. The I picture selection unit 102 selects the first frame of the frame group as the I picture, and outputs the frame group to the reference group generation unit 103 with the first frame as the start frame.

参照グループ生成部１０３は、フレーム群の先頭のフレームを起点として所定の枚数（例えば、３枚）の連続するフレームを選択し、選択した所定の枚数の連続するフレームの組み合わせを最初の参照グループとして生成する。また、参照グループ生成部１０３は、その後に起点となるフレームから所定の枚数の連続するフレームを選択し、選択した所定の枚数の連続するフレームの組み合わせを参照グループとして生成する。また、参照グループ生成部１０３は、隣接する２つの参照グループが共に結合判定部１０４によって結合すると判定されている場合、隣接する２つの参照グループを結合して新たな参照グループを生成する。 The reference group generation unit 103 selects a predetermined number (for example, three) of consecutive frames starting from the first frame of the frame group, and sets the selected combination of the predetermined number of consecutive frames as the first reference group. Generate Further, the reference group generation unit 103 selects a predetermined number of continuous frames from the frame as the starting point thereafter, and generates a combination of the selected predetermined number of continuous frames as a reference group. Further, when it is determined that the two adjacent reference groups are to be combined by the combination determination unit 104, the reference group generation unit 103 combines the two adjacent reference groups to generate a new reference group.

また、参照グループ生成部１０３は、生成した参照グループの情報、すなわち参照グループに含まれるフレームの画像情報、及び参照グループに含まれるフレームのフレームをフレーム群において特定するフレーム番号等の情報を結合判定部１０４とピクチャタイプ選定部１０５に出力する。 In addition, the reference group generation unit 103 performs combined determination of information of the generated reference group, that is, image information of frames included in the reference group, and information such as frame numbers identifying frames of frames included in the reference group Output to the unit 104 and the picture type selection unit 105.

結合判定部１０４は、参照グループ生成部１０３が生成して出力する参照グループに含まれるフレームを結合するか否かを判定する。ここで、参照グループに含まれるフレームを結合するとは、最初と最後のフレームの間のフレームである判定対象フレームを、最初と最後のフレームを参照フレームとするＢピクチャとすることである。結合判定部１０４は、評価値算出部１１０、評価値判定部１１１、及び判定結果記憶部１１２を備える。 The combination determination unit 104 determines whether or not to combine frames included in the reference group generated and output by the reference group generation unit 103. Here, combining the frames included in the reference group means making the determination target frame, which is a frame between the first and last frames, a B picture in which the first and last frames are reference frames. The combination determination unit 104 includes an evaluation value calculation unit 110, an evaluation value determination unit 111, and a determination result storage unit 112.

評価値算出部１１０は、参照グループ生成部１０３が生成した参照グループの評価値を算出する。ここで、評価値は、以下のようにして算出される３つのインターコストの値である。なお、インターコストは、例えば、フレーム間において動き探索を行った場合の誤差に関する評価値であり、例えば、絶対誤差和の最小値等が適用され、この評価値は、ＲＤ(Rate-Distortion)コストのような実際に符号化を行わないと得られない符号化コストとは異なる値である。ここで、絶対誤差和の最小値とは、動きベクトルを複数検出して、動きベクトルごとの絶対誤差和を算出して得られた値の中で最も小さい値を表す。 The evaluation value calculation unit 110 calculates the evaluation value of the reference group generated by the reference group generation unit 103. Here, the evaluation value is a value of three inter costs calculated as follows. The inter cost is, for example, an evaluation value regarding an error in the case of performing motion search between frames, and for example, the minimum value of the sum of absolute errors is applied, and this evaluation value is RD (Rate-Distortion) cost The coding cost is different from the coding cost which can not be obtained unless coding is actually performed. Here, the minimum value of the absolute error sum represents the smallest value among the values obtained by detecting a plurality of motion vectors and calculating the absolute error sum for each motion vector.

まず、最初のフレームと、最初と最後のフレームの間の判定対象フレームと、最後のフレームをそれぞれ、Ｓフレーム、Ｍフレーム、Ｅフレームとする。参照グループが、所定の枚数（例えば、３枚）の連続するフレームを含む場合、判定対象フレームは、中間のフレームとなり、後述するように２つの隣接する参照グループが結合される場合、判定対象フレームは、表示順で１つ前の参照グループの最後のフレームとなる。 First, the first frame, the determination target frame between the first and last frames, and the last frame are respectively S frame, M frame, and E frame. When the reference group includes a predetermined number (for example, three) of consecutive frames, the determination target frame is an intermediate frame, and when two adjacent reference groups are combined as described later, the determination target frame Is the last frame of the previous reference group in display order.

Ｍフレームに対して予め定められる所定のブロックサイズ単位でＳフレームを参照フレームとした場合の片方向インターコストの全ブロックの総和（以下、Ｌ０コストという）を算出する。次に、Ｍフレームに対して予め定められる所定のブロックサイズ単位でＥフレームを参照フレームとした場合の片方向インターコストの全ブロックの総和（以下、Ｌ１コストという）を算出する。更に、Ｍフレームに対して予め定められる所定のブロックサイズ単位でＳフレームとＥフレームを参照フレームとした場合の双方向インターコストの全ブロックの総和（以下、Ｂｉコストという）を算出する。 The sum (hereinafter, referred to as the L0 cost) of all the blocks in the unidirectional inter cost when the S frame is used as the reference frame in a predetermined block size unit predetermined for the M frame is calculated. Next, the sum (hereinafter, referred to as L1 cost) of all the blocks of the unidirectional inter cost when the E frame is used as the reference frame in a predetermined block size unit predetermined for M frames is calculated. Furthermore, the sum (hereinafter referred to as the Bi cost) of all blocks of the bidirectional inter cost when the S frame and the E frame are used as reference frames in predetermined block size units predetermined for M frames is calculated.

評価値判定部１１１は、評価値算出部１１０が算出したＬ０コスト、Ｌ１コスト、Ｂｉコストに基づいて、参照グループのフレームを結合すべきか否か、すなわちＭフレームを、ＳフレームとＥフレームを参照フレームとするＢピクチャとするか否かを判定する結合判定処理を行う。評価値判定部１１１は、判定結果を参照グループを示す情報に対応付けて判定結果記憶部１１２に書き込んで記憶させ、また、判定結果をピクチャタイプ選定部１０５に出力する。判定結果記憶部１１２は、参照グループごとに評価値判定部１１１が判定した判定結果を記憶する。 The evaluation value determination unit 111 determines whether to combine the frames of the reference group based on the L0 cost, the L1 cost, and the Bi cost calculated by the evaluation value calculation unit 110, that is, refers to the M frame and the S frame and the E frame. A combination determination process is performed to determine whether to use a B picture as a frame. The evaluation value determination unit 111 writes the determination result in the determination result storage unit 112 in association with the information indicating the reference group, and outputs the determination result to the picture type selection unit 105. The determination result storage unit 112 stores the determination result determined by the evaluation value determination unit 111 for each reference group.

ピクチャタイプ選定部１０５は、結合判定部１０４が出力する判定結果にしたがって、参照グループ生成部１０３が生成した参照グループに含まれるフレームのピクチャタイプを選定する。また、ピクチャタイプ選定部１０５は、参照グループ生成部１０３が、連続する３枚のフレームを選択できないと判定した場合、残りのフレームをＰピクチャとして選定する。また、ピクチャタイプ選定部１０５は、全てのフレームについて、ピクチャタイプを選定した後、Ｐピクチャとして選定したフレームに対して、当該フレームに表示順で最も近い過去のＰピクチャとして選定されたフレームを当該フレームの参照フレームとして選定する。 The picture type selection unit 105 selects a picture type of a frame included in the reference group generated by the reference group generation unit 103 according to the determination result output from the connection determination unit 104. Further, when the reference group generation unit 103 determines that the three consecutive frames can not be selected, the picture type selection unit 105 selects the remaining frames as P pictures. In addition, after selecting the picture types for all frames, the picture type selection unit 105 selects the frame selected as the P picture and the frame selected as the past P picture closest to the frame in the display order Select as the frame reference frame.

ここで、表示順で最も近い過去のＰピクチャとは、Ｉピクチャとなる最初のフレームから表示順に連続番号を付与していった場合、参照フレームを割り当てるフレームよりも若い番号で最も近いＰピクチャのことである。また、表示順で最も近い過去のＰピクチャがない場合、ピクチャタイプ選定部１０５は、Ｉピクチャを参照フレームとして選定する。 Here, the closest past P picture in display order is the closest P picture with a number smaller than the frame to which the reference frame is assigned, in the case where a sequential number is assigned in display order from the first frame to be I picture. It is. Also, if there is no closest past P picture in display order, the picture type selection unit 105 selects an I picture as a reference frame.

ピクチャタイプ記憶部１０６は、Ｉピクチャ選定部１０２がＩピクチャとして選定したフレームを示す情報に対応付けてＩピクチャのピクチャタイプを示す情報を記憶する。また、ピクチャタイプ記憶部１０６は、ピクチャタイプ選定部１０５が選定したＢピクチャまたはＰピクチャのピクチャタイプを示す情報を、参照フレームを示す情報とともに、各フレームを示す情報に対応付けて記憶する。 The picture type storage unit 106 stores the information indicating the picture type of the I picture in association with the information indicating the frame selected as the I picture by the I picture selecting unit 102. Further, the picture type storage unit 106 stores information indicating the picture type of the B picture or P picture selected by the picture type selection unit 105 in association with the information indicating each frame together with the information indicating the reference frame.

ブロック出力部１０７は、符号化対象フレーム選択部１０１が出力するフレーム群を受けて、一時的に内部の記憶領域に書き込んで記憶する。また、ブロック出力部１０７は、ピクチャタイプの選定が完了し符号化構造が定められると、定められた符号化構造に基づく順番で内部の記憶領域からフレームを読み出し、読み出したフレームの各々をラスタースキャンの順にＣＵブロックに分解して符号化対象ブロックとして出力する。 The block output unit 107 receives the frame group output from the encoding target frame selection unit 101, temporarily writes it in the internal storage area, and stores it. Further, when the selection of the picture type is completed and the coding structure is determined, the block output unit 107 reads the frames from the internal storage area in the order based on the defined coding structure, and raster scans each of the read frames It decomposes | disassembles into CU block in order of, and it outputs as an encoding object block.

（第１の実施形態の符号化構造構築処理）
次に、第１の実施形態の符号化構造構築部１０による処理について説明する。図３は、符号化構造構築部１０による符号化構造構築処理の流れを示すフローチャートである。また、図４は、符号化構造構築処理において呼び出される結合判定部１０４による結合判定処理のサブルーチンの流れを示すフローチャートである。 (Coding structure construction processing of the first embodiment)
Next, processing by the coding structure construction unit 10 of the first embodiment will be described. FIG. 3 is a flowchart showing a flow of coding structure construction processing by the coding structure construction unit 10. Further, FIG. 4 is a flowchart showing a flow of a subroutine of connection determination processing by the connection determination unit 104 called in the coding structure construction processing.

符号化対象フレーム選択部１０１は、入力映像情報から予め定められるＧＯＰのサイズにしたがう枚数のフレームを選択してフレーム群を生成し、生成したフレーム群をＩピクチャ選定部１０２とブロック出力部１０７に出力する。Ｉピクチャ選定部１０２は、フレーム群の最初のフレームのピクチャタイプをＩピクチャとして選定し、当該フレームをＩピクチャとして選定したことを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる。また、Ｉピクチャ選定部１０２は、最初のフレームを起点フレームとし、フレーム群を参照グループ生成部１０３に出力する（ステップＳ１０１）。 The encoding target frame selection unit 101 selects a number of frames according to a predetermined GOP size from input video information to generate a frame group, and generates the generated frame group to the I picture selection unit 102 and the block output unit 107. Output. The I picture selection unit 102 selects the picture type of the first frame of the frame group as an I picture, and writes and stores information indicating that the frame is selected as an I picture in the picture type storage unit 106. Further, the I picture selection unit 102 sets the first frame as the start frame, and outputs the frame group to the reference group generation unit 103 (step S101).

参照グループ生成部１０３は、起点フレームを起点として、フレーム群から連続する３枚のフレームを選択することができるか否かを判定する（ステップＳ１０２）。参照グループ生成部１０３は、連続する３枚のフレームを選択することができると判定した場合（ステップＳ１０２−ＹＥＳ）、参照グループ生成部１０３は、連続する３枚のフレームを表示順にＡ，Ｂ，Ｃフレームとして含む参照グループを生成する。なお、当該参照グループにおいて判定対象フレームは、Ｂフレームとなる。 The reference group generation unit 103 determines whether or not three continuous frames can be selected from the frame group starting from the start point frame (step S102). If the reference group generation unit 103 determines that three consecutive frames can be selected (YES in step S102), the reference group generation unit 103 selects three consecutive frames in the display order A, B, and so on. Generate a reference group to be included as a C frame. The determination target frame in the reference group is a B frame.

参照グループ生成部１０３は、生成した参照グループを結合判定部１０４とピクチャタイプ選定部１０５に出力する（ステップＳ１０５）。結合判定部１０４は、参照グループ生成部１０３が出力する参照グループに含まれるＡ，Ｂ，ＣフレームをＳ，Ｍ，Ｅフレームとして結合判定処理を行う（ステップＳ１０６）。 The reference group generation unit 103 outputs the generated reference group to the combination determination unit 104 and the picture type selection unit 105 (step S105). The connection determination unit 104 performs connection determination processing with the A, B, and C frames included in the reference group output from the reference group generation unit 103 as S, M, and E frames (step S106).

（結合判定部による結合判定処理）
図４に示す結合判定処理のサブルーチンにおいて、評価値算出部１１０は、Ｓフレーム、Ｍフレーム、Ｅフレームに基づいて、Ｌ０コスト、Ｌ１コスト、Ｂｉコストの３つの評価値を算出する（ステップＳ２０１）。 (Coupling determination processing by the combining determination unit)
In the subroutine of the connection determination process shown in FIG. 4, the evaluation value calculation unit 110 calculates three evaluation values of L0 cost, L1 cost, and Bi cost based on the S frame, M frame, and E frame (step S201). .

評価値判定部１１１は、ＢｉコストとＬ１コストのうち値が小さい方を選択し（ステップＳ２０２）、選択した値をＬ０で除算して相対値を算出する（ステップＳ２０３）。なお、ＢｉコストとＬ１コストの値が同じ場合、いずれを選択してもよいため、いずれか一方を選択する。ステップＳ２０２とＳ２０３の処理を数式で示すと以下の式（１）となる。 The evaluation value determination unit 111 selects the smaller one of the Bi cost and the L1 cost (step S202), and divides the selected value by L0 to calculate a relative value (step S203). When the values of the Bi cost and the L1 cost are the same, either one may be selected, so either one is selected. The processes of steps S202 and S203 can be expressed by the following equation (1).

相対値＝ｍｉｎ（Ｂｉコスト，Ｌ１コスト）／Ｌ０コスト・・・（１） Relative value = min (Bi cost, L1 cost) / L0 cost (1)

評価値判定部１１１は、算出した相対値の大きさを予め定められる相対評価用閾値Ｔｈ＿ｍｉｎ及びＴｈ＿ｍａｘに基づいて判定する（ステップＳ２０４）。評価値判定部１１１は、算出した相対値が、Ｔｈ＿ｍｉｎの値未満であると判定した場合（ステップＳ２０４：＜Ｔｈ＿ｍｉｎ）、参照グループに含まれるフレームを結合することを示す「Ｔｒｕｅ」を出力する（ステップＳ２０５）。一方、評価値判定部１１１は、算出した相対値が、Ｔｈ＿ｍａｘの値を超過していると判定した場合（ステップＳ２０４：＞Ｔｈ＿ｍａｘ）、参照グループに含まれるフレームを結合しないことを示す「Ｆａｌｓｅ」を出力する（ステップＳ２０６）。 The evaluation value determination unit 111 determines the magnitude of the calculated relative value based on predetermined relative evaluation threshold values Th_min and Th_max (step S204). When it is determined that the calculated relative value is less than the value of Th_min (step S204: <Th_min), the evaluation value determination unit 111 outputs “True” indicating that the frames included in the reference group are combined (step S204: <Th_min) Step S205). On the other hand, when the evaluation value determination unit 111 determines that the calculated relative value exceeds the value of Th_max (step S 204:> Th_max), “False” indicating that frames included in the reference group are not combined. Are output (step S206).

一方、評価値判定部１１１は、算出した相対値が、Ｔｈ＿ｍｉｎの値以上で、かつＴｈｍａｘの値以下であると判定した場合（ステップＳ２０４：≧Ｔｈ＿ｍｉｎかつ≦Ｔｈ＿ｍａｘ）、Ｂｉコストの値の大きさを予め定められる絶対評価用閾値Ｔｈ＿ａｂｓに基づいて判定する（ステップＳ２０７）。評価値判定部１１１は、Ｂｉコストの値が、Ｔｈ＿ａｂｓの値未満であると判定した場合（ステップＳ２０７：＜Ｔｈ＿ａｂｓ）、参照グループに含まれるフレームを結合することを示す「Ｔｒｕｅ」を出力する（ステップＳ２０８）。一方、評価値判定部１１１は、Ｂｉコストの値が、Ｔｈ＿ａｂｓの値以上と判定した場合（ステップＳ２０７：≧Ｔｈ＿ａｂｓ）、参照グループに含まれるフレームを結合しないことを示す「Ｆａｌｓｅ」を出力する（ステップＳ２０６）。 On the other hand, when the evaluation value determination unit 111 determines that the calculated relative value is equal to or greater than the value of Th_min and equal to or less than the value of Thmax (step S204: ThTh_min and ≦ Th_max), the magnitude of the value of the Bi cost Is determined based on a predetermined absolute evaluation threshold Th_abs (step S207). When it is determined that the value of the Bi cost is less than the value of Th_abs (step S207: <Th_abs), the evaluation value determination unit 111 outputs “True” indicating that frames included in the reference group are combined (step S207: <Th_abs) Step S208). On the other hand, when it is determined that the value of Bi cost is equal to or greater than the value of Th_abs (step S207: ≧ Th_abs), the evaluation value determination unit 111 outputs “False” indicating that the frames included in the reference group are not combined ( Step S206).

評価値判定部１１１は、ステップＳ２０５，Ｓ２０６，Ｓ２０８の処理において、参照グループを示す情報に対応付けて判定結果を示す「Ｔｒｕｅ」または「Ｆａｌｓｅ」の情報を判定結果記憶部１１２に書き込んで記憶させて、結合判定処理のサブルーチンを終了する。 The evaluation value determination unit 111 writes information of “True” or “False” indicating the determination result in association with the information indicating the reference group in the process of steps S205, S206, and S208 and stores the information in the determination result storage unit 112. Then, the subroutine of connection determination processing is ended.

図３に戻り、ピクチャタイプ選定部１０５は、結合判定部１０４が出力する判定結果を判定する（ステップＳ１０７）。ピクチャタイプ選定部１０５は、判定結果が「Ｆａｌｓｅ」の場合（ステップＳ１０７−Ｆａｌｓｅ）、参照グループのＢフレームのピクチャタイプをＰピクチャとして選定し、ＢフレームをＰピクチャとして選定したことを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる。ピクチャタイプ選定部１０５は、Ｂフレームを起点フレームとする情報を参照グループ生成部１０３に出力する（ステップＳ１０８）。参照グループ生成部１０３は、ピクチャタイプ選定部１０５が出力するＢフレームを起点フレームとする情報を受けた場合、Ｂフレームを起点フレームとしてステップＳ１０２の処理に進む。 Returning to FIG. 3, the picture type selection unit 105 determines the determination result output by the combination determination unit 104 (step S107). When the determination result is “False” (Step S107-False), the picture type selection unit 105 selects the picture type of the B frame of the reference group as a P picture and information indicating that the B frame is selected as a P picture. The picture type storage unit 106 is written and stored. The picture type selection unit 105 outputs, to the reference group generation unit 103, information in which the B frame is the start frame (step S108). When the reference group generation unit 103 receives the information that uses the B frame output from the picture type selection unit 105 as the start frame, the reference group generation unit 103 proceeds to the process of step S102 with the B frame as the start frame.

一方、ピクチャタイプ選定部１０５は、判定結果が「Ｔｒｕｅ」の場合（ステップＳ１０７−Ｔｒｕｅ）、ＢフレームのピクチャタイプをＡフレームとＣフレームを参照フレームとするＢピクチャとして選定し、ＢフレームをＢピクチャとして選定したことを示す情報、及びＡフレームとＣフレームが参照フレームであることを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる。また、ピクチャタイプ選定部１０５は、ＣフレームのピクチャタイプをＰピクチャとして選定し、ＣフレームをＰピクチャとして選定したことを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる。ピクチャタイプ選定部１０５は、判定結果が「Ｔｒｕｅ」の参照グループを示す情報を参照グループ生成部１０３に出力する（ステップＳ１０９）。 On the other hand, when the determination result is "True" (step S107-True), the picture type selection unit 105 selects the picture type of the B frame as the B picture with the A frame and the C frame as reference frames, Information indicating that it is selected as a picture, and information indicating that the A frame and the C frame are reference frames are written and stored in the picture type storage unit 106. Further, the picture type selection unit 105 selects the picture type of the C frame as a P picture, and writes and stores information indicating that the C frame is selected as a P picture in the picture type storage unit 106. The picture type selection unit 105 outputs the information indicating the reference group of which the determination result is “True” to the reference group generation unit 103 (step S109).

参照グループ生成部１０３は、ピクチャタイプ選定部１０５が出力する判定結果が「Ｔｒｕｅ」の参照グループを示す情報を受けた場合、結合判定部１０４の判定結果記憶部１１２を参照する。参照グループ生成部１０３は、判定結果記憶部１１２に記憶されている情報に基づいて、表示順において、処理中の参照グループの１つ前の参照グループの判定結果が、「Ｔｒｕｅ」であるか否かを判定する（ステップＳ１１０）。 The reference group generation unit 103 refers to the determination result storage unit 112 of the connection determination unit 104 when the information indicating the reference group whose determination result output by the picture type selection unit 105 is “True” is received. Based on the information stored in the determination result storage unit 112, the reference group generation unit 103 determines whether the determination result of the reference group immediately before the reference group being processed is "True" in the display order. It is determined (step S110).

参照グループ生成部１０３は、表示順において、処理中の参照グループの１つ前の参照グループの判定結果が、「Ｔｒｕｅ」でないと判定した場合（ステップＳ１１０−ＮＯ）、Ｃフレームを起点フレームとして（ステップＳ１１６）、ステップＳ１０２の処理に進む。 When the reference group generation unit 103 determines that the determination result of the reference group immediately before the reference group being processed is not “True” in display order (step S110-NO), the C frame is set as the start frame ((step S110) Step S116) The processing proceeds to step S102.

参照グループ生成部１０３は、表示順において、処理中の参照グループの１つ前の参照グループの判定結果が、「Ｔｒｕｅ」であると判定した場合（ステップＳ１１０−ＹＥＳ）、新たな参照グループを生成する。すなわち、参照グループ生成部１０３は、１つ前の参照グループの最初のフレームと最後のフレームをそれぞれ、αフレーム、βフレームとし、処理中の参照グループの最後のフレームをγフレームとし、α、β、γフレームを含む新たな参照グループを生成する。なお、βフレームは、処理中の参照グループの最初のＡフレームでもあり、当該参照グループにおける判定対象フレームとなる。参照グループ生成部１０３は、生成した新たな参照グループの情報を結合判定部１０４とピクチャタイプ選定部１０５に出力する（ステップＳ１１１）。 The reference group generation unit 103 generates a new reference group when it is determined that the determination result of the reference group immediately before the reference group being processed is "True" in the display order (step S110-YES). Do. That is, the reference group generation unit 103 sets the first frame and the last frame of the previous reference group as an α frame and a β frame, respectively, and the last frame of the reference group being processed as a γ frame. , Γ frame is generated. The β frame is also the first A frame of the reference group being processed, and is a determination target frame in the reference group. The reference group generation unit 103 outputs the generated information on the new reference group to the combination determination unit 104 and the picture type selection unit 105 (step S111).

結合判定部１０４は、参照グループ生成部１０３が出力する参照グループに含まれるα，β，γフレームに対して結合判定処理を行う。評価値算出部１１０は、α，β，γフレームのそれぞれをＳ，Ｍ，Ｅフレームとして図４に示す結合判定処理を行う（ステップＳ１１２）。 The combination determination unit 104 performs combination determination processing on the α, β, and γ frames included in the reference group output by the reference group generation unit 103. The evaluation value calculation unit 110 performs the connection determination process shown in FIG. 4 with each of the α, β and γ frames as an S, M and E frame (step S112).

ピクチャタイプ選定部１０５は、結合判定部１０４が出力する判定結果を判定する（ステップＳ１１３）。ピクチャタイプ選定部１０５は、判定結果が「Ｆａｌｓｅ」の場合（ステップＳ１１３−Ｆａｌｓｅ）、新たな参照グループの最後のフレームであるγフレーム、すなわち結合前の処理中の参照グループのＣフレームを起点フレームとすることを示す情報を参照グループ生成部１０３に出力する（ステップＳ１１５）。参照グループ生成部１０３は、ピクチャタイプ選定部１０５が出力するＣフレームを起点フレームとすることを示す情報を受けて、Ｃフレームを起点フレームとして（ステップＳ１１６）、ステップＳ１０２の処理に進む。 The picture type selection unit 105 determines the determination result output from the combination determination unit 104 (step S113). If the determination result is "False" (step S113-False), the picture type selection unit 105 starts the γ frame which is the last frame of the new reference group, that is, the C frame of the reference group in process before combining The information indicating the setting is output to the reference group generation unit 103 (step S115). The reference group generation unit 103 receives the information indicating that the C frame output from the picture type selection unit 105 is the start frame, sets the C frame as the start frame (step S116), and proceeds to the process of step S102.

一方、ピクチャタイプ選定部１０５は、判定結果が「Ｔｒｕｅ」の場合（ステップＳ１１３−Ｔｒｕｅ）、βフレームのピクチャタイプをαフレームとγフレームを参照フレームとするＢピクチャとして選定し、βフレームをＢピクチャとして選定したことを示す情報、及びαフレームとγフレームが参照フレームであることを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる。また、ピクチャタイプ選定部１０５は、γフレームのピクチャタイプをＰピクチャとして選定し、γフレームをＰピクチャとして選定したことを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる（ステップＳ１１４）。 On the other hand, when the determination result is "True" (step S113-True), the picture type selection unit 105 selects the picture type of the β frame as a B picture with the α frame and the γ frame as reference frames, Information indicating that it is selected as a picture, and information indicating that the α frame and the γ frame are reference frames are written and stored in the picture type storage unit 106. In addition, the picture type selection unit 105 selects the picture type of the γ frame as a P picture, and writes and stores information indicating that the γ frame is selected as a P picture in the picture type storage unit 106 (step S114).

ピクチャタイプ選定部１０５は、新たな参照グループの最後のフレームであるγフレーム、すなわち結合前の処理中の参照グループのＣフレームを起点フレームとすることを示す情報を参照グループ生成部１０３に出力する（ステップＳ１１５）。参照グループ生成部１０３は、ピクチャタイプ選定部１０５が出力するＣフレームを起点フレームとすることを示す情報を受けて、Ｃフレームを起点フレームとして（ステップＳ１１６）、ステップＳ１０２の処理に進む。 The picture type selection unit 105 outputs, to the reference group generation unit 103, information indicating that the γ frame which is the last frame of the new reference group, that is, the C frame of the reference group being processed before combining is used as the start frame. (Step S115). The reference group generation unit 103 receives the information indicating that the C frame output from the picture type selection unit 105 is the start frame, sets the C frame as the start frame (step S116), and proceeds to the process of step S102.

一方、ステップＳ１０２において、参照グループ生成部１０３は、連続する３枚のフレームを選択することができないと判定した場合（ステップＳ１０２−ＮＯ）、フレーム群の残りのフレームをＰピクチャとして選定させる指示情報をピクチャタイプ選定部１０５に出力する。 On the other hand, when it is determined in step S102 that the reference group generation unit 103 can not select three consecutive frames (step S102-NO), instruction information for selecting the remaining frames of the frame group as P pictures Is output to the picture type selection unit 105.

ピクチャタイプ選定部１０５は、参照グループ生成部１０３から当該指示情報を受けて、残りのフレームのピクチャタイプをＰピクチャとして選定し、残りのフレームをＰピクチャとして選定したことを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる（ステップＳ１０３）。ピクチャタイプ選定部１０５は、Ｐピクチャとして選定したフレームの各々に対して、当該フレームに表示順で最も近い過去のＰピクチャのフレームを参照フレームとして選定する。表示順で最も近い過去のＰピクチャがない場合、Ｉピクチャを当該フレームの参照フレームとして選定する。ピクチャタイプ選定部１０５は、各Ｐピクチャに対して選定した参照フレームの情報をピクチャタイプ記憶部１０６に書き込んで記憶させる（ステップＳ１０４）。 The picture type selection unit 105 receives the instruction information from the reference group generation unit 103, selects the picture type of the remaining frame as a P picture, and stores information indicating that the remaining frame is selected as a P picture. The part 106 is written and stored (step S103). For each of the frames selected as a P picture, the picture type selection unit 105 selects, as a reference frame, a frame of the past P picture closest to the frame in display order. If there is no closest past P picture in display order, the I picture is selected as the reference frame of the frame. The picture type selection unit 105 writes and stores information on the reference frame selected for each P picture in the picture type storage unit 106 (step S104).

これにより、ピクチャタイプ記憶部１０６に、最終的に構築された符号化構造の情報が記憶されることになる。ブロック出力部１０７は、内部の記憶領域に一時的に符号化対象フレーム選択部１０１から受けたフレーム群を記憶しており、ピクチャタイプ記憶部１０６に記憶されている符号化構造に基づく順番で内部の記憶領域からフレームを読み出す。ブロック出力部１０７は、読み出したフレームの各々をラスタースキャンの順にＣＵブロックに分解して符号化対象ブロックとして出力し、符号化構造構築部１０は、１つのフレーム群についての処理を終了する。 As a result, the picture type storage unit 106 stores the information of the coding structure finally constructed. The block output unit 107 temporarily stores the frame group received from the encoding target frame selection unit 101 in an internal storage area, and internally stores the frames in the order based on the encoding structure stored in the picture type storage unit 106. Read the frame from the storage area of The block output unit 107 decomposes each of the read frames into CU blocks in raster scan order and outputs the CU block as an encoding target block, and the coding structure construction unit 10 ends the processing for one frame group.

（第１の実施形態の符号化構造処理の具体例）
次に、図５から図８を参照しつつ、図５（ａ）に示すフレーム群に、図３に示した第１の実施形態の符号化構造構築処理を適用した場合の例について説明する。符号化対象フレーム選択部１０１が、入力映像情報から図５（ａ）に示すフレーム群を生成する。当該フレーム群は、フレーム番号８とフレーム番号９のフレームにおいてシーンチェンジが含まれている。 (Specific Example of Encoding Structure Processing of First Embodiment)
Next, with reference to FIG. 5 to FIG. 8, an example in the case of applying the coding structure construction process of the first embodiment shown in FIG. 3 to the frame group shown in FIG. 5A will be described. The encoding target frame selection unit 101 generates a frame group shown in FIG. 5A from the input video information. The frame group includes a scene change in the frames of frame number 8 and frame number 9.

図５（ｂ）に示すように、Ｉピクチャ選定部１０２は、フレーム番号０のフレームのピクチャタイプをＩピクチャとして選定し、当該フレームをＩピクチャとして選定したことを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる。また、Ｉピクチャ選定部１０２は、フレーム番号０のフレームを起点フレームとする（ステップＳ１０１）。 As shown in FIG. 5B, the I picture selection unit 102 selects the picture type of the frame with frame number 0 as an I picture, and information indicating that the frame is selected as an I picture is stored in the picture type storage unit 106. Write to and memorize. Also, the I picture selection unit 102 sets the frame of frame number 0 as the start frame (step S101).

参照グループ生成部１０３は、フレーム番号０のフレームを起点フレームとして、フレーム番号０，１，２の３枚の連続するフレームを選択する（ステップＳ１０２−ＹＥＳ）。参照グループ生成部１０３は、フレーム番号０，１，２のフレームをフレームＡ，Ｂ，Ｃとして含む参照グループ３０１を生成し、生成した参照グループ３０１の情報を結合判定部１０４とピクチャタイプ選定部１０５に出力する（ステップＳ１０５）。なお、参照グループ３０１において判定対象フレームは、フレーム番号１のフレームとなる。 The reference group generation unit 103 selects three consecutive frames of frame numbers 0, 1 and 2 with the frame of frame number 0 as the start frame (step S102-YES). The reference group generation unit 103 generates a reference group 301 including the frames of frame numbers 0, 1 and 2 as frames A, B and C, and the information of the generated reference group 301 is combined determination unit 104 and picture type selection unit 105. Output (step S105). The determination target frame in the reference group 301 is the frame of frame number 1.

結合判定部１０４は、参照グループ３０１に対して図４に示す結合判定処理を行い、ここでは、シーンチェンジがないため、評価値判定部１１１は、「Ｔｒｕｅ」を出力したとする（ステップＳ１０６）。ピクチャタイプ選定部１０５は、結合判定部１０４が出力する判定結果が、「Ｔｒｕｅ」であるため（ステップＳ１０７−Ｔｒｕｅ）、フレーム番号１のフレームのピクチャタイプをフレーム番号０とフレーム番号２のフレームを参照フレームとするＢピクチャとして選定する。ピクチャタイプ選定部１０５は、フレーム番号２のフレームをＢピクチャとして選定したことを示す情報、及びフレーム番号０とフレーム番号２のフレームが参照フレームであることを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる。 The combination determination unit 104 performs the combination determination process shown in FIG. 4 on the reference group 301. Here, it is assumed that the evaluation value determination unit 111 outputs "True" because there is no scene change (step S106). . Since the determination result output from the connection determination unit 104 is “True” (step S 107 -True), the picture type selection unit 105 sets the frame types of frame number 1 to frame number 0 and frame number 2. It is selected as a B picture to be a reference frame. The picture type selection unit 105 writes in the picture type storage unit 106 information indicating that the frame of frame number 2 has been selected as a B picture, and information indicating that the frames of frame number 0 and frame number 2 are reference frames. Remember.

また、ピクチャタイプ選定部１０５は、フレーム番号２のフレームのピクチャタイプをＰピクチャとして選定し、フレーム番号２のフレームをＰピクチャとして選定したことを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる。ピクチャタイプ選定部１０５は、判定結果が「Ｔｒｕｅ」の参照グループ３０１を示す情報を参照グループ生成部１０３に出力する（ステップＳ１０９）。 In addition, the picture type selection unit 105 selects the picture type of the frame of frame number 2 as a P picture, and writes information indicating that the frame of frame number 2 is selected as a P picture into the picture type storage unit 106 for storage. . The picture type selection unit 105 outputs information indicating the reference group 301 whose determination result is “True” to the reference group generation unit 103 (step S109).

図５（ｃ）に示すように、参照グループ生成部１０３は、判定結果記憶部１１２を参照し、参照グループ３０１に対して１つ前の参照グループが存在しないため（ステップＳ１１０−ＮＯ）、フレーム番号２のフレームを起点フレームとして（ステップＳ１１６）、フレーム番号２，３，４の３枚の連続するフレームを選択する（ステップＳ１０２−ＹＥＳ）。参照グループ生成部１０３は、フレーム番号２，３，４のフレームをフレームＡ，Ｂ，Ｃとして含む参照グループ３０２を生成し、生成した参照グループ３０２の情報を結合判定部１０４とピクチャタイプ選定部１０５に出力する（ステップＳ１０５）。なお、参照グループ３０２において判定対象フレームは、フレーム番号３のフレームとなる。 As shown in FIG. 5C, the reference group generation unit 103 refers to the determination result storage unit 112, and there is no reference group one before the reference group 301 (step S110-NO). The frame of No. 2 is set as the starting point frame (step S116), and three consecutive frames of frame numbers 2, 3 and 4 are selected (step S102--YES). The reference group generation unit 103 generates the reference group 302 including the frames of frame numbers 2, 3 and 4 as the frames A, B and C, and the generated information of the reference group 302 is combined determination unit 104 and picture type selection unit 105. Output (step S105). In the reference group 302, the determination target frame is a frame of frame number 3.

結合判定部１０４は、参照グループ３０２に対して図４に示す結合判定処理を行い、ここでも、シーンチェンジがないため、評価値判定部１１１は、「Ｔｒｕｅ」を出力したとする（ステップＳ１０６）。ピクチャタイプ選定部１０５は、結合判定部１０４が出力する判定結果が、「Ｔｒｕｅ」であるため（ステップＳ１０７−Ｔｒｕｅ）、フレーム番号３のフレームのピクチャタイプをフレーム番号２とフレーム番号４のフレームを参照フレームとするＢピクチャとして選定する。ピクチャタイプ選定部１０５は、フレーム番号３のフレームをＢピクチャとして選定したことを示す情報、及びフレーム番号２とフレーム番号４のフレームが参照フレームであることを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる。 The connection determination unit 104 performs the connection determination process shown in FIG. 4 on the reference group 302. Also here, since there is no scene change, the evaluation value determination unit 111 outputs "True" (step S106). . Since the determination result output from the connection determination unit 104 is “True” (step S 107 -True), the picture type selection unit 105 sets the picture types of the frame of frame number 3 to the frames of frame numbers 2 and 4. It is selected as a B picture to be a reference frame. The picture type selection unit 105 writes in the picture type storage unit 106 information indicating that the frame of frame number 3 has been selected as a B picture, and information indicating that the frames of frame numbers 2 and 4 are reference frames. Remember.

また、ピクチャタイプ選定部１０５は、フレーム番号４のピクチャタイプをＰピクチャとして選定し、フレーム番号４のフレームをＰピクチャとして選定したことを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる。ピクチャタイプ選定部１０５は、判定結果が「Ｔｒｕｅ」の参照グループ３０２を示す情報を参照グループ生成部１０３に出力する（ステップＳ１０９）。 Also, the picture type selection unit 105 selects the picture type of frame number 4 as a P picture, and writes and stores information indicating that the frame of frame number 4 is selected as a P picture in the picture type storage unit 106. The picture type selection unit 105 outputs the information indicating the reference group 302 whose determination result is “True” to the reference group generation unit 103 (step S109).

図６（ｄ）に示すように、参照グループ生成部１０３は、判定結果記憶部１１２を参照し、参照グループ３０２に対して１つ前の参照グループ３０１の判定結果が「Ｔｒｕｅ」であると判定する（ステップＳ１１０−ＹＥＳ）。参照グループ生成部１０３は、フレーム番号０，２，４のフレームをフレームα，β，γとして含む参照グループ３０３を生成する。なお、参照グループ３０３において判定対象フレームは、フレーム番号２のフレームとなる。参照グループ生成部１０３は、生成した参照グループ３０３の情報を結合判定部１０４とピクチャタイプ選定部１０５に出力する（ステップＳ１１１）。 As shown in FIG. 6D, the reference group generation unit 103 refers to the determination result storage unit 112 and determines that the determination result of the reference group 301 immediately before the reference group 302 is “True”. (Step S110-YES). The reference group generation unit 103 generates a reference group 303 including the frames of frame numbers 0, 2 and 4 as the frames α, β and γ. The determination target frame in the reference group 303 is the frame of frame number 2. The reference group generation unit 103 outputs the generated information of the reference group 303 to the combination determination unit 104 and the picture type selection unit 105 (step S111).

結合判定部１０４は、参照グループ３０３に対して図４に示す結合判定処理を行い、ここでも、シーンチェンジがないため、評価値判定部１１１は、「Ｔｒｕｅ」を出力したとする（ステップＳ１１２）。ピクチャタイプ選定部１０５は、判定結果が「Ｔｒｕｅ」であるため（ステップＳ１１３−Ｔｒｕｅ）、フレーム番号２のフレームのピクチャタイプをフレーム番号０とフレーム番号４のフレームを参照フレームとするＢピクチャとして選定する。 The connection determination unit 104 performs the connection determination process illustrated in FIG. 4 on the reference group 303. Also here, since there is no scene change, the evaluation value determination unit 111 outputs “True” (step S112). . Since the determination result is "True" (step S113-True), the picture type selection unit 105 selects the picture type of the frame of frame number 2 as a B picture with the frames of frame number 0 and frame number 4 as the reference frame. Do.

ピクチャタイプ記憶部１０６には、既に、フレーム番号２のフレームについて、Ｐピクチャであることを示す情報が書き込まれている。そのため、ピクチャタイプ選定部１０５は、フレーム番号２のフレームをＢピクチャとして選定したことを示す情報、及びフレーム番号０とフレーム番号４のフレームが参照フレームであることを示す情報をピクチャタイプ記憶部１０６に上書きして記憶させる（ステップＳ１１４）。ピクチャタイプ選定部１０５は、次の起点フレームとなるフレーム番号４を示す情報を参照グループ生成部１０３に出力する（ステップＳ１１５）。 In the picture type storage unit 106, information indicating that the frame of frame number 2 is a P picture has already been written. Therefore, the picture type selection unit 105 uses the picture type storage unit 106 as information indicating that the frame of frame number 2 has been selected as a B picture, and information indicating that the frames of frame numbers 0 and 4 are reference frames. Are overwritten and stored (step S114). The picture type selection unit 105 outputs, to the reference group generation unit 103, information indicating frame number 4 that is the next start frame (step S115).

図６（ｅ）に示すように、参照グループ生成部１０３は、フレーム番号４のフレームを起点フレームとして（ステップＳ１１６）、フレーム番号４，５，６の３枚の連続するフレームを選択する（ステップＳ１０２−ＹＥＳ）。参照グループ生成部１０３は、フレーム番号４，５，６のフレームをフレームＡ，Ｂ，Ｃとして含む参照グループ３０４を生成し、生成した参照グループ３０４の情報を結合判定部１０４とピクチャタイプ選定部１０５に出力する（ステップＳ１０５）。なお、参照グループ３０４において判定対象フレームは、フレーム番号５のフレームとなる。 As shown in FIG. 6E, the reference group generation unit 103 selects three consecutive frames of frame numbers 4, 5 and 6 with the frame of frame number 4 as the starting point frame (step S116). S102-YES). The reference group generation unit 103 generates the reference group 304 including the frames of frame numbers 4, 5 and 6 as the frames A, B and C, and the information of the generated reference group 304 is combined determination unit 104 and picture type selection unit 105. Output (step S105). The determination target frame in the reference group 304 is the frame of frame number 5.

結合判定部１０４は、参照グループ３０４に対して図４に示す結合判定処理を行い、ここでも、シーンチェンジがないため、評価値判定部１１１は、「Ｔｒｕｅ」を出力したとする（ステップＳ１０６）。ピクチャタイプ選定部１０５は、結合判定部１０４が出力する判定結果が、「Ｔｒｕｅ」であるため（ステップＳ１０７−Ｔｒｕｅ）、フレーム番号５のフレームのピクチャタイプをフレーム番号４とフレーム番号６のフレームを参照フレームとするＢピクチャとして選定する。ピクチャタイプ選定部１０５は、フレーム番号５のフレームをＢピクチャとして選定したことを示す情報、及びフレーム番号４とフレーム番号６のフレームが参照フレームであることを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる。 The connection determination unit 104 performs the connection determination process shown in FIG. 4 on the reference group 304. Also here, since there is no scene change, the evaluation value determination unit 111 outputs "True" (step S106). . Since the determination result output from the connection determination unit 104 is “True” (step S 107 -True), the picture type selection unit 105 sets the frame types of frame number 5 and frame number 6 as the picture type of the frame of frame number 5. It is selected as a B picture to be a reference frame. The picture type selection unit 105 writes in the picture type storage unit 106 information indicating that the frame of frame number 5 has been selected as a B picture, and information indicating that the frames of frame numbers 4 and 6 are reference frames. Remember.

また、ピクチャタイプ選定部１０５は、フレーム番号６のピクチャタイプをＰピクチャとして選定し、フレーム番号６のフレームをＰピクチャとして選定したことを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる。ピクチャタイプ選定部１０５は、判定結果が「Ｔｒｕｅ」の参照グループ３０４を示す情報を参照グループ生成部１０３に出力する（ステップＳ１０９）。 Also, the picture type selection unit 105 selects the picture type of frame number 6 as a P picture, and writes and stores information indicating that the frame of frame number 6 is selected as a P picture in the picture type storage unit 106. The picture type selection unit 105 outputs the information indicating the reference group 304 whose determination result is “True” to the reference group generation unit 103 (step S109).

図６（ｆ）に示すように、参照グループ生成部１０３は、判定結果記憶部１１２を参照し、参照グループ３０４に対して１つ前の参照グループ３０３の判定結果が「Ｔｒｕｅ」であると判定する（ステップＳ１１０−ＹＥＳ）。参照グループ生成部１０３は、フレーム番号０，４，６のフレームをフレームα，β，γとして含む参照グループ３０５を生成する。なお、参照グループ３０５において判定対象フレームは、フレーム番号４のフレームとなる。参照グループ生成部１０３は、生成した参照グループ３０５の情報を結合判定部１０４とピクチャタイプ選定部１０５に出力する（ステップＳ１１１）。 As shown in FIG. 6F, the reference group generation unit 103 refers to the determination result storage unit 112 and determines that the determination result of the reference group 303 immediately before the reference group 304 is “True”. (Step S110-YES). The reference group generation unit 103 generates a reference group 305 including the frames of frame numbers 0, 4 and 6 as the frames α, β and γ. The determination target frame in the reference group 305 is the frame of frame number 4. The reference group generation unit 103 outputs the generated information of the reference group 305 to the combination determination unit 104 and the picture type selection unit 105 (step S111).

結合判定部１０４は、参照グループ３０５に対して図４に示す結合判定処理を行い、ここでも、シーンチェンジがないため、評価値判定部１１１は、「Ｔｒｕｅ」を出力したとする（ステップＳ１１２）。ピクチャタイプ選定部１０５は、判定結果が「Ｔｒｕｅ」であるため（ステップＳ１１３−Ｔｒｕｅ）、フレーム番号４のフレームのピクチャタイプをフレーム番号０とフレーム番号６のフレームを参照フレームとするＢピクチャとして選定する。 The connection determination unit 104 performs the connection determination process shown in FIG. 4 on the reference group 305. Also here, since there is no scene change, the evaluation value determination unit 111 outputs "True" (step S112). . Since the determination result is “True” (step S 113 -True), the picture type selection unit 105 selects the picture type of the frame of frame number 4 as the B picture with the frames of frame number 0 and frame number 6 as reference frames. Do.

ピクチャタイプ記憶部１０６には、既に、フレーム番号４のフレームについて、Ｐピクチャであることを示す情報が書き込まれている。そのため、ピクチャタイプ選定部１０５は、フレーム番号４のフレームをＢピクチャとして選定したことを示す情報、及びフレーム番号０とフレーム番号６のフレームが参照フレームであることを示す情報をピクチャタイプ記憶部１０６に上書きして記憶させる（ステップＳ１１４）。ピクチャタイプ選定部１０５は、次の起点フレームとなるフレーム番号６を示す情報を参照グループ生成部１０３に出力する（ステップＳ１１５）。 Information indicating that the frame of frame number 4 is a P picture has already been written in the picture type storage unit 106. Therefore, the picture type selection unit 105 uses the picture type storage unit 106 as information indicating that the frame of frame number 4 is selected as a B picture, and information indicating that the frames of frame number 0 and frame number 6 are reference frames. Are overwritten and stored (step S114). The picture type selection unit 105 outputs, to the reference group generation unit 103, information indicating the frame number 6 that is the next starting point frame (step S115).

図７（ｇ）に示すように、参照グループ生成部１０３は、フレーム番号６のフレームを起点フレームとして（ステップＳ１１６）、フレーム番号６，７，８の３枚の連続するフレームを選択する（ステップＳ１０２−ＹＥＳ）。参照グループ生成部１０３は、フレーム番号６，７，８のフレームをフレームＡ，Ｂ，Ｃとして含む参照グループ３０６を生成し、生成した参照グループ３０６の情報を結合判定部１０４とピクチャタイプ選定部１０５に出力する（ステップＳ１０５）。なお、参照グループ３０６において判定対象フレームは、フレーム番号７のフレームとなる。 As shown in FIG. 7G, the reference group generation unit 103 selects three consecutive frames of frame numbers 6, 7 and 8 with the frame of frame number 6 as the starting point frame (step S116). S102-YES). The reference group generation unit 103 generates a reference group 306 including the frames of frame numbers 6, 7 and 8 as frames A, B and C, and the information of the generated reference group 306 is combined determination unit 104 and picture type selection unit 105. Output (step S105). In the reference group 306, the determination target frame is the frame of frame number 7.

結合判定部１０４は、参照グループ３０６に対して図４に示す結合判定処理を行い、ここでも、シーンチェンジがないため、評価値判定部１１１は、「Ｔｒｕｅ」を出力したとする（ステップＳ１０６）。ピクチャタイプ選定部１０５は、結合判定部１０４が出力する判定結果が、「Ｔｒｕｅ」であるため（ステップＳ１０７−Ｔｒｕｅ）、フレーム番号７のフレームのピクチャタイプをフレーム番号６とフレーム番号８のフレームを参照フレームとするＢピクチャとして選定する。ピクチャタイプ選定部１０５は、フレーム番号７のフレームをＢピクチャとして選定したことを示す情報、及びフレーム番号６とフレーム番号８のフレームが参照フレームであることを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる。 The connection determination unit 104 performs the connection determination process shown in FIG. 4 on the reference group 306. Also here, since there is no scene change, the evaluation value determination unit 111 outputs "True" (step S106). . Since the determination result output from the connection determination unit 104 is “True” (step S 107 -True), the picture type selection unit 105 sets the picture types of the frame of frame number 7 as the frame numbers 6 and 8. It is selected as a B picture to be a reference frame. The picture type selection unit 105 writes in the picture type storage unit 106 information indicating that the frame of frame number 7 has been selected as a B picture, and information indicating that the frames of frame numbers 6 and 8 are reference frames. Remember.

また、ピクチャタイプ選定部１０５は、フレーム番号８のピクチャタイプをＰピクチャとして選定し、フレーム番号８のフレームをＰピクチャとして選定したことを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる。ピクチャタイプ選定部１０５は、判定結果が「Ｔｒｕｅ」の参照グループ３０６を示す情報を参照グループ生成部１０３に出力する（ステップＳ１０９）。 In addition, the picture type selection unit 105 selects the picture type of frame number 8 as a P picture, and writes and stores information indicating that the frame of frame number 8 is selected as a P picture in the picture type storage unit 106. The picture type selection unit 105 outputs the information indicating the reference group 306 whose determination result is “True” to the reference group generation unit 103 (step S109).

図７（ｈ）に示すように、参照グループ生成部１０３は、判定結果記憶部１１２を参照し、参照グループ３０６に対して１つ前の参照グループ３０５の判定結果が「Ｔｒｕｅ」であると判定する（ステップＳ１１０−ＹＥＳ）。参照グループ生成部１０３は、フレーム番号０，６，８のフレームをフレームα，β，γとして含む参照グループ３０７を生成する。なお、参照グループ３０７において判定対象フレームは、フレーム番号６のフレームとなる。参照グループ生成部１０３は、生成した参照グループ３０７の情報を結合判定部１０４とピクチャタイプ選定部１０５に出力する（ステップＳ１１１）。 As shown in FIG. 7H, the reference group generation unit 103 refers to the determination result storage unit 112, and determines that the determination result of the reference group 305 immediately before the reference group 306 is "True". (Step S110-YES). The reference group generation unit 103 generates a reference group 307 including the frames of frame numbers 0, 6, and 8 as the frames α, β, and γ. In the reference group 307, the determination target frame is a frame of frame number 6. The reference group generation unit 103 outputs the generated information of the reference group 307 to the combination determination unit 104 and the picture type selection unit 105 (step S111).

結合判定部１０４は、参照グループ３０７に対して図４に示す結合判定処理を行い、ここでも、シーンチェンジがないため、評価値判定部１１１は、「Ｔｒｕｅ」を出力したとする（ステップＳ１１２）。ピクチャタイプ選定部１０５は、判定結果が「Ｔｒｕｅ」であるため（ステップＳ１１３−Ｔｒｕｅ）、フレーム番号６のフレームのピクチャタイプをフレーム番号０とフレーム番号８のフレームを参照フレームとするＢピクチャとして選定する。 The connection determination unit 104 performs the connection determination process shown in FIG. 4 on the reference group 307. Also here, since there is no scene change, the evaluation value determination unit 111 outputs "True" (step S112). . Since the determination result is “True” (step S 113 -True), the picture type selection unit 105 selects the picture type of the frame of frame number 6 as the B picture with the frames of frame number 0 and frame number 8 as the reference frame Do.

ピクチャタイプ記憶部１０６には、既に、フレーム番号６のフレームについて、Ｐピクチャであることを示す情報が書き込まれている。そのため、ピクチャタイプ選定部１０５は、フレーム番号６のフレームをＢピクチャとして選定したことを示す情報、及びフレーム番号０とフレーム番号８のフレームが参照フレームであることを示す情報をピクチャタイプ記憶部１０６に上書きして記憶させる（ステップＳ１１４）。ピクチャタイプ選定部１０５は、次の起点フレームとなるフレーム番号８を示す情報を参照グループ生成部１０３に出力する（ステップＳ１１５）。 In the picture type storage unit 106, information indicating that the frame of frame number 6 is a P picture has already been written. Therefore, the picture type selection unit 105 uses the picture type storage unit 106 as information indicating that the frame of frame number 6 has been selected as a B picture, and information indicating that the frames of frame number 0 and frame number 8 are reference frames. Are overwritten and stored (step S114). The picture type selection unit 105 outputs, to the reference group generation unit 103, information indicating the frame number 8 that is the next start point frame (step S115).

図７（ｉ）に示すように、参照グループ生成部１０３は、フレーム番号８のフレームを起点フレームとして（ステップＳ１１６）、フレーム番号８，９，１０の３枚の連続するフレームを選択する（ステップＳ１０２−ＹＥＳ）。参照グループ生成部１０３は、フレーム番号８，９，１０のフレームをフレームＡ，Ｂ，Ｃとして含む参照グループ３０８を生成し、生成した参照グループ３０８の情報を結合判定部１０４とピクチャタイプ選定部１０５に出力する（ステップＳ１０５）。なお、参照グループ３０８において判定対象フレームは、フレーム番号９のフレームとなる。 As shown in FIG. 7I, the reference group generation unit 103 selects three consecutive frames of frame numbers 8, 9, and 10, using the frame of frame number 8 as the starting point frame (step S116). S102-YES). The reference group generation unit 103 generates the reference group 308 including the frames of frame numbers 8, 9, and 10 as the frames A, B, and C, and the information of the generated reference group 308 is combined determination unit 104 and picture type selection unit 105. Output (step S105). In the reference group 308, the determination target frame is the frame of frame number 9.

結合判定部１０４は、参照グループ３０８に対して図４に示す結合判定処理を行う。図５（ａ）に示したように、フレーム番号８とフレーム番号９の間ではシーンチェンジがあるため、ここでは、評価値判定部１１１は、「Ｆａｌｓｅ」を出力したとする（ステップＳ１０６）。ピクチャタイプ選定部１０５は、結合判定部１０４が出力する判定結果が、「Ｆａｌｓｅ」であるため（ステップＳ１０７−Ｆａｌｓｅ）、フレーム番号９のフレームのピクチャタイプをＰピクチャとして選定する。ピクチャタイプ選定部１０５は、フレーム番号９のフレームをＰピクチャとして選定したことを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる。ピクチャタイプ選定部１０５は、フレーム番号９のフレームを起点フレームとする（ステップＳ１０８）。 The combination determination unit 104 performs the combination determination process illustrated in FIG. 4 on the reference group 308. As shown in FIG. 5A, since there is a scene change between frame number 8 and frame number 9, it is assumed here that the evaluation value determination unit 111 outputs "False" (step S106). The picture type selection unit 105 selects the picture type of the frame with the frame number 9 as the P picture because the determination result output from the connection determination unit 104 is “False” (step S 107 -False). The picture type selection unit 105 writes and stores information indicating that the frame of frame number 9 is selected as a P picture in the picture type storage unit 106. The picture type selection unit 105 sets the frame of frame number 9 as the start frame (step S108).

図８（ｊ）に示すように、参照グループ生成部１０３は、フレーム番号９のフレームを起点として３枚の連続するフレームを選択することができない（ステップＳ１０２−ＮＯ）。参照グループ生成部１０３は、残りのフレームであるフレーム番号１０のフレームのピクチャタイプをＰピクチャとして選定する指示情報をピクチャタイプ選定部１０５に出力する。ピクチャタイプ選定部１０５は、当該指示情報を受けて、フレーム番号１０のフレームをＰピクチャとして選定し、フレーム番号１０のフレームをＰピクチャとして選定したことを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる（ステップＳ１０３）。 As shown in FIG. 8 (j), the reference group generation unit 103 can not select three consecutive frames starting from the frame of frame number 9 (step S102-NO). The reference group generation unit 103 outputs, to the picture type selection unit 105, instruction information for selecting the picture type of the frame of frame number 10 which is the remaining frame as a P picture. The picture type selection unit 105 receives the instruction information, selects the frame of frame number 10 as a P picture, and writes information indicating that the frame of frame number 10 is selected as a P picture in the picture type storage unit 106. It memorizes (Step S103).

ピクチャタイプ選定部１０５は、ピクチャタイプ記憶部１０６に記憶されている情報を参照し、Ｐピクチャとして選定されている、フレーム番号８，９，１０のフレームについて、表示順で最も近い過去のＰピクチャのフレームを参照フレームとして選定する。表示順で最も近い過去のＰピクチャがない場合、Ｉピクチャのフレームを参照フレームとして選定する。ピクチャタイプ選定部１０５は、選定した参照フレームを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる（ステップＳ１０４）。 The picture type selection unit 105 refers to the information stored in the picture type storage unit 106, and for the frame of frame numbers 8, 9, and 10 selected as a P picture, the closest past P picture in display order The frame of is selected as the reference frame. When there is no closest past P picture in display order, a frame of I picture is selected as a reference frame. The picture type selection unit 105 writes and stores information indicating the selected reference frame in the picture type storage unit 106 (step S104).

フレーム番号８のフレームについては、表示順で最も近い過去のＰピクチャがないため、フレーム番号０のＩピクチャのフレームが参照フレームとなる。フレーム番号９のフレームについては、表示順で最も近い過去のＰピクチャは、フレーム番号８のフレームとなる。フレーム番号１０のフレームについては、表示順で最も近い過去のＰピクチャは、フレーム番号９のフレームとなる。 As for the frame of frame number 8, there is no closest past P picture in display order, so the frame of I picture of frame number 0 is the reference frame. For the frame of frame number 9, the closest past P picture in display order is the frame of frame number 8. For the frame of frame number 10, the closest past P picture in display order is the frame of frame number 9.

ピクチャタイプ記憶部１０６に記憶されている情報に基づいて構築された符号化構造を階層化すると、図８（ｋ）に示すような、４階層のＢピクチャの階層構造を有する符号化構造となる。 When the coding structure constructed based on the information stored in the picture type storage unit 106 is hierarchized, the coding structure has a hierarchical structure of B pictures of four layers as shown in FIG. .

（結合判定処理の具体例）
次に、図９及び図１０を参照しつつ、結合判定部１０４による結合判定処理の例について説明する。図９は、結合判定処理を適用するＳ，Ｍ，Ｅフレームの例であり、図９（ｉ）は、Ｓ，Ｍ，Ｅフレームの３枚のフレームにおいてシーンチェンジがない例であり、（ｉｉ）は、ＭフレームとＥフレームの間でシーンチェンジがある例であり、（ｉｉｉ）は、ＳフレームとＭフレームの間でシーンチェンジがある例である。図９（ｉｖ）と（ｖ）は、ＳフレームからＥフレームにわたって部分的にシーンチェンジがある例である。 (Specific example of combination determination processing)
Next, an example of the connection determination process by the connection determination unit 104 will be described with reference to FIGS. 9 and 10. FIG. 9 shows an example of S, M and E frames to which the connection determination process is applied. FIG. 9 (i) is an example in which there are no scene changes in three frames of S, M and E frames. Is an example in which there is a scene change between M frame and E frame, and (iii) is an example in which there is a scene change between S frame and M frame. FIGS. 9 (iv) and (v) are examples in which there is a scene change in part from S frame to E frame.

この例において、相対評価用閾値Ｔｈ＿ｍｉｎ，Ｔｈ＿ｍａｘとして、それぞれ「０．８」，「１．２」が定められており、絶対評価用閾値Ｔｈ＿ａｂｓとして、「３５０」が定められているとする。 In this example, it is assumed that “0.8” and “1.2” are defined as the relative evaluation threshold Th_min and Th_max, and “350” is defined as the absolute evaluation threshold Th_abs.

図９（ｉ）の場合、結合判定部１０４の評価値算出部１１０は、Ｌ０，Ｌ１，Ｂｉコストの評価値として、図１０に示すように、それぞれ「１５０」、「１５０」、「１００」を算出する（ステップＳ２０１）。評価値判定部１１１により式（１）を適用すると、相対値＝ｍｉｎ（１００，１５０）／１５０＝１００／１５０＝２／３となる（ステップＳ２０２，Ｓ２０３）。２／３＜０．８であるため（ステップＳ２０４、＜Ｔｈ＿ｍｉｎ）、評価値判定部１１１は、「Ｔｒｕｅ」を出力する（ステップＳ２０５）。 In the case of FIG. 9I, the evaluation value calculation unit 110 of the connection determination unit 104 sets “150”, “150”, and “100” as evaluation values of L0, L1, and Bi costs, respectively, as shown in FIG. Is calculated (step S201). If Formula (1) is applied by the evaluation value determination part 111, it will become relative value = min (100,150) / 150 = 100/150 = 2/3 (step S202, S203). Since 2/3 <0.8 (step S204, <Th_min), the evaluation value determination unit 111 outputs "True" (step S205).

図９（ｉｉ）の場合、結合判定部１０４の評価値算出部１１０は、Ｌ０，Ｌ１，Ｂｉコストの評価値として、図１０に示すように、それぞれ「１５０」、「５００」、「４００」を算出する（ステップＳ２０１）。評価値判定部１１１により式（１）を適用すると、相対値＝ｍｉｎ（４００，５００）／１５０＝４００／１５０＝８／３となる（ステップＳ２０２，Ｓ２０３）。８／３＞１．２であるため（ステップＳ２０４、＞Ｔｈ＿ｍａｘ）、評価値判定部１１１は、「Ｆａｌｓｅ」を出力する（ステップＳ２０６）。 In the case of FIG. 9 (ii), the evaluation value calculation unit 110 of the connection determination unit 104 sets “150”, “500”, and “400” as evaluation values of L0, L1, and Bi costs, respectively, as shown in FIG. Is calculated (step S201). If Formula (1) is applied by the evaluation value determination part 111, it will become relative value = min (400,500) / 150 = 400/150 = 8/3 (step S202, S203). Since 8/3> 1.2 (step S204,> Th_max), the evaluation value determination unit 111 outputs "False" (step S206).

図９（ｉｉｉ）の場合、結合判定部１０４の評価値算出部１１０は、Ｌ０，Ｌ１，Ｂｉコストの評価値として、図１０に示すように、それぞれ「５００」、「１５０」、「４００」を算出する（ステップＳ２０１）。評価値判定部１１１により式（１）を適用すると、相対値＝ｍｉｎ（４００，１５０）／５００＝１５０／５００＝０．３となる（ステップＳ２０２，Ｓ２０３）。０．３＜０．８であるため（ステップＳ２０４、＜Ｔｈ＿ｍｉｎ）、評価値判定部１１１は、「Ｔｒｕｅ」を出力する（ステップＳ２０５）。 In the case of FIG. 9 (iii), the evaluation value calculation unit 110 of the connection determination unit 104 sets “500”, “150”, and “400” as evaluation values of L0, L1, and Bi costs, respectively, as shown in FIG. Is calculated (step S201). If Formula (1) is applied by the evaluation value determination part 111, it will become relative value = min (400,150) /500=150/500=0.3 (step S202, S203). Since 0.3 <0.8 (step S204, <Th_min), the evaluation value determination unit 111 outputs "True" (step S205).

図９（ｉｖ）の場合、結合判定部１０４の評価値算出部１１０は、Ｌ０，Ｌ１，Ｂｉコストの評価値として、図１０に示すように、それぞれ「３００」、「４００」、「３００」を算出する（ステップＳ２０１）。評価値判定部１１１により式（１）を適用すると、相対値＝ｍｉｎ（３００，４００）／３００＝３００／３００＝１となる（ステップＳ２０２，Ｓ２０３）。０．８≦１≦１．２であるため（ステップＳ２０４、≧Ｔｈ＿ｍｉｎかつ≦Ｔｈ＿ｍａｘ）、評価値判定部１１１は、Ｂｉコストの値の大きさを絶対評価用閾値に基づいて判定する（ステップＳ２０７）。Ｂｉコストの値「３００」＜Ｔｈ＿ａｂｓ「３５０」であるため（ステップＳ２０７、＜Ｔｈ＿ａｂｓ）、評価値判定部１１１は、「Ｔｒｕｅ」を出力する（ステップＳ２０８）。 In the case of FIG. 9 (iv), the evaluation value calculation unit 110 of the connection determination unit 104 sets “300”, “400”, and “300” as evaluation values of L0, L1, and Bi costs, respectively, as shown in FIG. Is calculated (step S201). If Formula (1) is applied by the evaluation value determination part 111, it will become relative value = min (300,400) / 300 = 300/300 = 1 (step S202, S203). Since 0.8 ≦ 1 ≦ 1.2 (step S204, ThTh_min and ≦ Th_max), the evaluation value determination unit 111 determines the value of the Bi cost based on the absolute evaluation threshold (step S207). ). Since the value of Bi cost is “300” <Th_abs “350” (step S207, <Th_abs), the evaluation value determination unit 111 outputs “True” (step S208).

図９（ｖ）の場合、結合判定部１０４の評価値算出部１１０は、Ｌ０，Ｌ１，Ｂｉコストの評価値として、図１０に示すように、それぞれ「４００」、「５００」、「４００」を算出する（ステップＳ２０１）。評価値判定部１１１により式（１）を適用すると、相対値＝ｍｉｎ（４００，５００）／４００＝４００／４００＝１となる（ステップＳ２０２，Ｓ２０３）。０．８≦１≦１．２であるため（ステップＳ２０４、≧Ｔｈ＿ｍｉｎかつ≦Ｔｈ＿ｍａｘ）、評価値判定部１１１は、Ｂｉコストの値の大きさを絶対評価用閾値に基づいて判定する（ステップＳ２０７）。Ｂｉコストの値「４００」≧Ｔｈ＿ａｂｓ「３５０」であるため（ステップＳ２０７、≧Ｔｈ＿ａｂｓ）、評価値判定部１１１は、「Ｆａｌｓｅ」を出力する（ステップＳ２０６）。 In the case of FIG. 9 (v), the evaluation value calculation unit 110 of the connection determination unit 104 sets “400”, “500”, and “400” as evaluation values of L0, L1, and Bi costs, respectively, as shown in FIG. Is calculated (step S201). When the expression (1) is applied by the evaluation value determination unit 111, the relative value = min (400, 500) / 400 = 400/400 = 1 is obtained (steps S202 and S203). Since 0.8 ≦ 1 ≦ 1.2 (step S204, ThTh_min and ≦ Th_max), the evaluation value determination unit 111 determines the value of the Bi cost based on the absolute evaluation threshold (step S207). ). Since the value of the Bi cost "400" ThTh_abs "350" (step S207, ThTh_abs), the evaluation value determination unit 111 outputs "False" (step S206).

上記の第１の実施形態の構成により、符号化構造構築部１０において、符号化対象フレーム選択部１０１が、入力映像情報から予め定められるＧＯＰのサイズにしたがう枚数のフレームを選択してフレーム群を生成する。Ｉピクチャ選定部１０２は、フレーム群の最初のフレームをＩピクチャとして選定し、起点フレームとする。参照グループ生成部１０３は、起点フレームを起点として３枚の連続するフレームを選択し、選択した３枚のフレームをＡ，Ｂ，Ｃフレームとして参照グループを生成する。結合判定部１０４の評価値算出部１１０は、参照グループに含まれるＡ，Ｂ，Ｃフレームに基づいて、Ｌ０コスト、Ｌ１コスト、Ｂｉコストという３つの評価値を算出する。評価値判定部１１１は、ＢｉコストとＬ１コストのうち値の小さい方を選択し、Ｌ０コストの値を基準とした相対値と、予め定められる相対評価用閾値Ｔｈ＿ｍｉｎ，Ｔｈ＿ｍａｘとに基づいて、判定対象フレームであるＢフレームと、ＡとＣフレームの参照関係を結合するか否かを判定する。また、評価値判定部１１１は、相対値が、Ｔｈ＿ｍｉｎの値以上であって、かつＴｈ＿ｍａｘの値以下の範囲に含まれる場合、更に、Ｂｉコストの値と、絶対評価用閾値Ｔｈ＿ａｂｓとに基づいて、判定対象フレームであるＢフレームと、ＡとＣフレームの参照関係を結合するか否かを判定する。ピクチャタイプ選定部１０５は、評価値判定部１１１が、Ｂフレームと、ＡとＣフレームの参照関係を結合すると判定した場合、Ｂフレームを、ＡとＣフレームを参照フレームとするＢピクチャとして選定する。すなわち、符号化構造構築部１０では、実際の符号化を行うような演算を行っておらず、３つのフレームに基づいて３つの評価値を算出し、算出した３つの評価値に基づいて、３つのフレームの参照関係を結合するか否かを判定している。このため、第１の実施形態の符号化構造構築部１０では、符号化効率の高い符号化構造を低演算量で求めることが可能となる。 According to the configuration of the first embodiment, in the coding structure construction unit 10, the coding target frame selection unit 101 selects a number of frames according to the size of the GOP determined in advance from the input video information, and sets the frame group. Generate The I picture selection unit 102 selects the first frame of the frame group as an I picture and uses it as a start frame. The reference group generation unit 103 selects three continuous frames starting from the start frame, and generates a reference group using the selected three frames as A, B, and C frames. The evaluation value calculation unit 110 of the combination determination unit 104 calculates three evaluation values of L0 cost, L1 cost, and Bi cost based on the A, B, and C frames included in the reference group. The evaluation value determination unit 111 selects the smaller one of the Bi cost and the L1 cost, and makes a determination based on the relative value based on the value of the L0 cost and the predetermined relative evaluation threshold values Th_min and Th_max. It is determined whether or not the reference relationship between the B frame, which is the target frame, and the A and C frames is combined. In addition, when the relative value is within the range not less than the value of Th_min and not more than the value of Th_max, the evaluation value determining unit 111 further determines the value of Bi cost based on the absolute evaluation threshold Th_abs. It is determined whether or not the reference relationship between the B frame, which is the determination target frame, and the A and C frames is combined. If the evaluation value determination unit 111 determines that the reference relationship between the B frame and the A and C frames is combined, the picture type selection unit 105 selects the B frame as a B picture having the A and C frames as reference frames. . That is, the encoding structure construction unit 10 does not perform an operation to perform actual coding, but calculates three evaluation values based on three frames, and calculates 3 based on the calculated 3 evaluation values. It is determined whether or not reference relationships of two frames are to be combined. For this reason, in the coding structure construction unit 10 of the first embodiment, it is possible to obtain a coding structure with high coding efficiency with a small amount of calculation.

また、参照グループ生成部１０３は、評価値判定部１１１が、判定対象フレームと最初と最後のフレームの参照関係を結合すると判定した隣接する２つの参照グループを結合して重複するフレームを判定対象フレームとする新たな参照グループを生成することを繰り返す。結合判定部１０４は、参照グループ生成部１０３が、新たに生成した参照グループのα，β，γフレームに基づいて、結合判定処理を行う。これにより、第１の実施形態の符号化構造構築部１０では、２階層以上の深い階層の符号化構造を構築することが可能となっており、符号化効率を向上させることが可能となっている。例えば、上述したように、図８（ｋ）の例では、４階層のＢピクチャの階層構造を有する符号化構造を構築することができることを示している。 Further, the reference group generation unit 103 combines two adjacent reference groups of which the evaluation value determination unit 111 determines that the reference relationship between the determination target frame and the first and last frames are combined Repeat creating a new reference group. In the connection determination unit 104, the reference group generation unit 103 performs connection determination processing based on the newly generated α, β, and γ frames of the reference group. As a result, in the coding structure construction unit 10 of the first embodiment, it is possible to construct a coding structure of two or more deep layers, and it becomes possible to improve the coding efficiency. There is. For example, as described above, the example of FIG. 8 (k) indicates that it is possible to construct a coding structure having a hierarchical structure of B pictures of four layers.

（第２の実施形態）
図１１は、本発明の第２の実施形態における符号化構造構築部１０ａの構成を示すブロック図である。符号化構造構築部１０ａは、第１の実施形態の映像符号化装置１の符号化構造構築部１０に置き換えて適用される機能部である。なお、第２の実施形態の符号化構造構築部１０ａが適用される場合、映像符号化装置の符号を「１」に替えて「１ａ」で表すものとする。第２の実施形態において、第１の実施形態と同一の構成については、同一の符号を付し、以下、第１の実施形態の符号化構造構築部１０と異なる構成について説明する。 Second Embodiment
FIG. 11 is a block diagram showing the configuration of the coding structure construction unit 10a in the second embodiment of the present invention. The coding structure construction unit 10 a is a functional unit to be applied replacing the coding structure construction unit 10 of the video coding device 1 according to the first embodiment. When the coding structure construction unit 10a of the second embodiment is applied, the code of the video coding apparatus is represented by "1a" instead of "1". In the second embodiment, the same components as those of the first embodiment are denoted by the same reference numerals, and components different from those of the coding structure construction unit 10 of the first embodiment will be described below.

符号化構造構築部１０ａは、符号化対象フレーム選択部１０１、Ｉピクチャ選定部１０２ａ、参照グループ生成部１０３ａ、結合判定部１０４、ピクチャタイプ選定部１０５ａ、ピクチャタイプ記憶部１０６、ブロック出力部１０７、及び参照グループ記憶部１０８を備える。 The coding structure construction unit 10a includes a coding target frame selection unit 101, an I picture selection unit 102a, a reference group generation unit 103a, a combination determination unit 104, a picture type selection unit 105a, a picture type storage unit 106, a block output unit 107, And a reference group storage unit 108.

符号化構造構築部１０ａにおいて、Ｉピクチャ選定部１０２ａは、フレーム群の最初のフレームをＩピクチャとして選定する。参照グループ生成部１０３ａは、初期値をｉ＝０、ｊ＝０とし、フレーム群からフレーム番号が「２ｊ」，「２ｊ＋２^ｉ」，「２ｊ＋２^{（ｉ＋１）}」の３枚のフレームを含む参照グループを生成することができなくなるまで、ｊに１を加算しつつ、ｊがＮ／２未満の間繰り返す。ここで、Ｎの値は、フレーム群に含まれるフレームに付与されるフレーム番号の最大値であり、フレーム群のフレーム数から「１」減算した値である。 In the coding structure construction unit 10a, the I picture selection unit 102a selects the first frame of the frame group as an I picture. The reference group generation unit 103a sets an initial value to i = 0 and j = 0, and a reference group including three frames of frame numbers “2j”, “2j + 2 ⁱ ”, and “2j + 2 ^{(i + 1)} ” from the frame group Repeat while j is less than N / 2, adding 1 to j, until it can not be generated. Here, the value of N is the maximum value of the frame numbers assigned to the frames included in the frame group, and is a value obtained by subtracting “1” from the number of frames in the frame group.

また、参照グループ生成部１０３ａは、選択したフレーム番号が「２ｊ」，「２ｊ＋２^ｉ」，「２ｊ＋２^{（ｉ＋１）}」のフレームの組み合わせの各々において、３つのフレームをＡ，Ｂ，Ｃフレームとし、Ａ，Ｂ，Ｃフレームを含む、ｉとｊのインデックスが付いた参照グループＲｅｆ［ｉ］［ｊ］を生成する（ただし、Ａ，Ｂ，Ｃフレームを含む参照グループを生成する場合、ｉ＝０に固定され、ｊのみが変数となる）。また、参照グループ生成部１０３ａは、生成した参照グループＲｅｆ［０］［ｊ］を参照グループ記憶部１０８に書き込んで記憶させる。 In addition, the reference group generation unit 103a sets three frames as A, B, and C frames in each of the combinations of the selected frame numbers “2 ^j ”, “2 j + 2 ⁱ ”, and “2 j + 2 ^{(i + 1)} ”. Create a reference group Ref [i] [j] indexed by i and j, including B, C, C frames (however, if generating a reference group including A, B, C frames, i = 0) Fixed, only j is a variable). Further, the reference group generation unit 103a writes the generated reference group Ref [0] [j] in the reference group storage unit 108 and stores the reference group Ref [0] [j].

また、参照グループ生成部１０３ａは、初期値をｉ＝１，ｊ＝０として、参照グループＲｅｆ［ｉ−１］［ｊ］とＲｅｆ［ｉ−１］［ｊ＋１］の両方において、結合判定部１０４が判定した判定結果が「Ｔｒｕｅ」となっている組み合わせの検出を、ｊに２を加算しつつ、ｊがＮ／２未満の間繰り返すことを、更に、ｉに１加算しつつ、ｉがｄｅｐｔｈ＿ｍａｘ未満の間繰り返し行う。ここで、ｄｅｐｔｈ＿ｍａｘは、予め定めらえる構築する符号化構造の階層数を示す値であり、３階層の場合、ｄｅｐｔｈ＿ｍａｘは、「３」となる。 Further, the reference group generation unit 103 a sets the initial value to i = 1, j = 0, and the connection determination unit 104 in both of the reference groups Ref [i−1] [j] and Ref [i−1] [j + 1]. Is repeated while j is less than N / 2 while 2 is added to j, while i is 1 and depth_max is added. Repeat for less than. Here, depth_max is a value indicating the number of layers of the coding structure to be constructed, which is predetermined, and in the case of three layers, depth_max is "3".

また、参照グループ生成部１０３ａは、検出した参照グループＲｅｆ［ｉ−１］［ｊ］とＲｅｆ［ｉ−１］［ｊ＋１］の組み合わせの各々において、Ｒｅｆ［ｉ−１］［ｊ］の最初と最後のフレームと、Ｒｅｆ［ｉ−１］［ｊ＋１］の最後のフレームの３つのフレームをα，β，γフレームとして選択する。また、参照グループ生成部１０３ａは、選択したα，β，γフレーム含む新たな参照グループを参照グループＲｅｆ［ｉ］「ｊ／２」として参照グループ記憶部１０８に書き込んで記憶させる。 In addition, the reference group generation unit 103a is configured to set the first of Ref [i-1] [j] and the first of Ref [i-1] [j] in each of the detected combinations of the reference groups Ref [i-1] [j] Three frames of the last frame and the last frame of Ref [i-1] [j + 1] are selected as α, β and γ frames. Further, the reference group generation unit 103a writes and stores a new reference group including the selected α, β, and γ frames as a reference group Ref [i] “j / 2” in the reference group storage unit 108.

ピクチャタイプ選定部１０５ａは、結合判定部１０４が出力する判定結果にしたがって、参照グループＲｅｆ［ｉ］［ｊ］に含まれるフレームのピクチャタイプを選定する。また、ピクチャタイプ選定部１０５ａは、参照グループ生成部１０３ａが、フレーム番号が「２ｊ」，「２ｊ＋２^ｉ」，「２ｊ＋２^{（ｉ＋１）}」のフレームを選択できないと判定した場合、残りのフレームをＰピクチャとして選定する。 The picture type selection unit 105a selects the picture type of the frame included in the reference group Ref [i] [j] according to the determination result output from the combination determination unit 104. If the picture type selection unit 105a determines that the reference group generation unit 103a can not select the frames with frame numbers “2j”, “2j + 2 ⁱ ”, and “2j + 2 ^{(i + 1)} ”, the remaining frames are P pictures. To be selected.

また、ピクチャタイプ選定部１０５ａは、全てのフレームについて、ピクチャタイプを選定した後、Ｐピクチャとして選定したフレームに対して、当該フレームに表示順で最も近い過去のＰピクチャとして選定されたフレームを当該フレームの参照フレームとして選定する。表示順で最も近い過去のＰピクチャがない場合、Ｉピクチャのフレームを当該フレームの参照フレームとして選定する。参照グループ記憶部１０８は、参照グループ生成部１０３ａが生成した参照グループＲｅｆ［ｉ］［ｊ］を記憶する。 Further, after selecting the picture types for all the frames, the picture type selection unit 105a selects the frame selected as the P picture, and the frame selected as the past P picture closest to the frame in the display order Select as the frame reference frame. When there is no closest past P picture in display order, a frame of I picture is selected as a reference frame of the frame. The reference group storage unit 108 stores the reference group Ref [i] [j] generated by the reference group generation unit 103a.

（第２の実施形態の符号化構造構築処理）
次に、第２の実施形態の符号化構造構築部１０ａによる処理について説明する。図１２及び図１３は、符号化構造構築部１０による符号化構造構築処理の流れを示すフローチャートである。図１２と図１３に示す処理は、符号Ａで示される箇所で処理が接続される連続した処理である。図１２に示す符号Ａで示される箇所に至るまでの処理は、ｉ＝０で示される１階層目の符号化構造を構築する処理であり、図１３に示す処理は、ｉ＝１，２で示される２階層目、３階層目の符号化構造を構築する処理である。 (Coding structure construction processing of the second embodiment)
Next, processing by the coding structure construction unit 10a of the second embodiment will be described. 12 and 13 are flowcharts showing the flow of coding structure construction processing by the coding structure construction unit 10. The processes shown in FIGS. 12 and 13 are continuous processes in which the processes are connected at a point indicated by a symbol A. The processing up to the point indicated by symbol A shown in FIG. 12 is the processing of constructing the coding structure of the first layer indicated by i = 0, and the processing shown in FIG. 13 is i = 1, 2 This is processing for constructing the coding structure of the second and third layers shown.

符号化対象フレーム選択部１０１は、入力映像情報から予め定められるＧＯＰのサイズにしたがう枚数のフレームを選択してフレーム群を生成する。符号化対象フレーム選択部１０１は、生成したフレーム群に対して表示順に「０」からフレーム番号を付与してＩピクチャ選定部１０２ａとブロック出力部１０７に出力する。Ｉピクチャ選定部１０２ａは、フレーム群の最初のフレーム、すなわちフレーム番号０のフレームのピクチャタイプをＩピクチャとして選定する。Ｉピクチャ選定部１０２ａは、フレーム番号０のフレームをＩピクチャとして選定したことを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる（ステップＳ３０１）。 The encoding target frame selection unit 101 selects a number of frames according to a predetermined GOP size from input video information and generates a frame group. The encoding target frame selection unit 101 assigns a frame number from “0” to the generated frame group in display order, and outputs the frame group to the I picture selection unit 102 a and the block output unit 107. The I picture selection unit 102a selects the picture type of the first frame of the frame group, that is, the frame of frame number 0 as an I picture. The I picture selection unit 102a writes and stores information indicating that the frame of frame number 0 has been selected as an I picture in the picture type storage unit 106 (step S301).

参照グループ生成部１０３ａは、ｉ＝０，ｊ＝０とし（ステップＳ３０２）、フレーム群からフレーム番号が「２ｊ」，「２ｊ＋２^ｉ」，「２ｊ＋２^{（ｉ＋１）}」の３つのフレームを選択できるか否かを判定する（ステップＳ３０３）。 The reference group generation unit 103a sets i = 0 and j = 0 (step S302), and can select three frames of frame numbers “2j”, “2j + 2 ⁱ ”, and “2j + 2 ^{(i + 1)} ” from the frame group. It is determined (step S303).

参照グループ生成部１０３ａは、３枚のフレームを選択することができると判定した場合（ステップＳ３０３−ＹＥＳ）、参照グループ生成部１０３ａは、３枚のフレームを表示順にＡ，Ｂ，Ｃフレームとして含む参照グループＲｅｆ［０］［ｊ］を生成する。参照グループ生成部１０３ａは、生成した参照グループＲｅｆ［０］［ｊ］を参照グループ記憶部１０８に書き込んで記憶させる（ステップＳ３０５）。なお、当該参照グループにおいて判定対象フレームは、Ｂフレームとなる。 When the reference group generation unit 103a determines that three frames can be selected (YES in step S303), the reference group generation unit 103a includes three frames in the display order as A, B, and C frames. Generate reference group Ref [0] [j]. The reference group generation unit 103a writes and stores the generated reference group Ref [0] [j] in the reference group storage unit 108 (step S305). The determination target frame in the reference group is a B frame.

結合判定部１０４は、参照グループ生成部１０３ａによって参照グループ記憶部１０８に参照グループＲｅｆ［０］［ｊ］の情報が書き込まれると、書き込まれた参照グループＲｅｆ［０］［ｊ］に含まれるＡ，Ｂ，ＣフレームをＳ，Ｍ，Ｅフレームとして図４に示す結合判定処理を行う（ステップＳ３０６）。 When the information on the reference group Ref [0] [j] is written to the reference group storage unit 108 by the reference group generation unit 103a, the connection determination unit 104 causes the A included in the written reference group Ref [0] [j] to The combination determination process shown in FIG. 4 is performed with the B, C, and C frames as S, M, and E frames (step S306).

ピクチャタイプ選定部１０５ａは、結合判定部１０４が出力する判定結果を判定する（ステップＳ３０７）。ピクチャタイプ選定部１０５ａは、判定結果が「Ｆａｌｓｅ」の場合（ステップＳ３０７−Ｆａｌｓｅ）、参照グループＲｅｆ［０］［ｊ］のＢフレームとＣフレームのピクチャタイプをＰピクチャとして選定する。ピクチャタイプ選定部１０５ａは、参照グループＲｅｆ［０］［ｊ］のＢフレームとＣフレームをＰピクチャとして選定したことを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる（ステップＳ３０８）。 The picture type selection unit 105a determines the determination result output from the combination determination unit 104 (step S307). When the determination result is “False” (Step S307—False), the picture type selection unit 105a selects the picture types of the B frame and the C frame of the reference group Ref [0] [j] as a P picture. The picture type selection unit 105a writes and stores information indicating that the B and C frames of the reference group Ref [0] [j] are selected as the P picture in the picture type storage unit 106 (step S308).

一方、ピクチャタイプ選定部１０５ａは、判定結果が「Ｔｒｕｅ」の場合（ステップＳ３０７−Ｔｒｕｅ）、参照グループＲｅｆ［０］［ｊ］のＢフレームのピクチャタイプをＡフレームとＣフレームを参照フレームとするＢピクチャとして選定する。ピクチャタイプ選定部１０５ａは、参照グループＲｅｆ［０］［ｊ］のＢフレームをＢピクチャとして選定したことを示す情報、及びＡフレームとＣフレームが参照フレームであることを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる。また、ピクチャタイプ選定部１０５ａは、参照グループＲｅｆ［０］［ｊ］のＣフレームのピクチャタイプをＰピクチャとして選定する。ピクチャタイプ選定部１０５ａは、参照グループＲｅｆ［０］［ｊ］のＣフレームをＰピクチャとして選定したことを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる（ステップＳ３０９）。 On the other hand, when the determination result is “True” (step S 307 -True), the picture type selection unit 105 a sets the picture types of B frames of the reference group Ref [0] [j] as A frame and C frame. Select as B picture. The picture type selection unit 105 a receives information indicating that the B frame of the reference group Ref [0] [j] has been selected as the B picture, and information indicating that the A frame and the C frame are reference frames. Write to 106 for storage. Also, the picture type selection unit 105a selects the picture type of the C frame of the reference group Ref [0] [j] as a P picture. The picture type selection unit 105a writes and stores information indicating that the C frame of the reference group Ref [0] [j] has been selected as a P picture in the picture type storage unit 106 (step S309).

ステップＳ３０８及びステップＳ３０９の処理の後、ピクチャタイプ選定部１０５ａは、処理を継続する指示情報を参照グループ生成部１０３ａに出力する。参照グループ生成部１０３ａは、ｊの値に「１」を加算し（ステップＳ３１０）、ｊの値がＮ／２未満か否かを判定する（ステップＳ３１１）。参照グループ生成部１０３ａは、ｊの値がＮ／２未満であると判定した場合（ステップＳ３１１−ＹＥＳ）、ステップＳ３０３からの処理を繰り返す。一方、参照グループ生成部１０３ａは、ｊの値がＮ／２未満でないと判定した場合（ステップＳ３１１−ＮＯ）、符号Ａで示される箇所を介して図１３に示す処理を開始する。 After the processes of steps S308 and S309, the picture type selection unit 105a outputs instruction information for continuing the process to the reference group generation unit 103a. The reference group generation unit 103a adds “1” to the value of j (step S310), and determines whether the value of j is less than N / 2 (step S311). When the reference group generation unit 103a determines that the value of j is less than N / 2 (YES in step S311), the processing from step S303 is repeated. On the other hand, when the reference group generation unit 103a determines that the value of j is not less than N / 2 (step S311-NO), the reference group generation unit 103a starts the processing illustrated in FIG.

一方、参照グループ生成部１０３ａは、３枚のフレームを選択することができないと判定した場合（ステップＳ３０３−ＮＯ）、フレーム群の残りのフレームをＰピクチャとして選定させる指示情報をピクチャタイプ選定部１０５ａに出力する。ピクチャタイプ選定部１０５ａは、参照グループ生成部１０３ａから当該指示情報を受けて、残りのフレームのピクチャタイプをＰピクチャとして選定し、残りのフレームをＰピクチャとして選定したことを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる（ステップＳ３０４）。参照グループ生成部１０３ａは、当該指示情報をピクチャタイプ選定部１０５ａに出力し、その後、符号Ａで示される箇所を介して図１３に示す処理を開始する。 On the other hand, when the reference group generation unit 103a determines that three frames can not be selected (step S303-NO), the instruction information for selecting the remaining frames of the frame group as P pictures is the picture type selection unit 105a. Output to The picture type selection unit 105a receives the instruction information from the reference group generation unit 103a, selects the picture type of the remaining frame as a P picture, and stores information indicating that the remaining frame is selected as a P picture. The part 106 is written and stored (step S304). The reference group generation unit 103a outputs the instruction information to the picture type selection unit 105a, and thereafter, starts the processing illustrated in FIG. 13 via the portion indicated by the symbol A.

図１３に示すように、参照グループ生成部１０３ａは、ｉ＝１，ｊ＝０とし（ステップＳ３１２）、結合判定部１０４の判定結果記憶部１１２を参照し、参照グループＲｅｆ［ｉ−１］［ｊ］と、Ｒｅｆ［ｉ−１］［ｊ＋１］の判定結果が共に「Ｔｒｕｅ」であるか否かを判定する（ステップＳ３１３）。参照グループ生成部１０３ａは、参照グループＲｅｆ［ｉ−１］［ｊ］と、Ｒｅｆ［ｉ−１］［ｊ＋１］の判定結果が共に「Ｔｒｕｅ」でないと判定した場合（ステップＳ３１３−ＮＯ）、処理をステップＳ３１８に進める。 As shown in FIG. 13, the reference group generation unit 103 a sets i = 1, j = 0 (step S 312), refers to the determination result storage unit 112 of the connection determination unit 104, and selects the reference group Ref [i−1] [ It is determined whether both j] and the determination results of Ref [i-1] [j + 1] are "True" (step S313). When the reference group generation unit 103a determines that the determination results of both the reference group Ref [i-1] [j] and Ref [i-1] [j + 1] are not “True” (step S313-NO), the process is performed. Proceed to step S318.

一方、参照グループ生成部１０３ａは、参照グループＲｅｆ［ｉ−１］［ｊ］と、Ｒｅｆ［ｉ−１］［ｊ＋１］の判定結果が共に「Ｔｒｕｅ」であると判定した場合（ステップＳ３１３−ＹＥＳ）、参照グループＲｅｆ［ｉ−１］［ｊ］の最初と最後のフレームと、参照グループＲｅｆ［ｉ−１］［ｊ＋１］の最後のフレームの３つのフレームをα，β，γフレームとして選択する。参照グループ生成部１０３ａは、選択したα，β，γフレームを含む参照グループＲｅｆ［ｉ］［ｊ／２］を生成し、生成した参照グループＲｅｆ［ｉ］［ｊ／２］を参照グループ記憶部１０８に書き込んで記憶させる（ステップＳ３１４）。 On the other hand, when the reference group generation unit 103a determines that the determination results of both the reference group Ref [i-1] [j] and Ref [i-1] [j + 1] are “True” (step S313-YES) , Select the three frames of the first and last frame of reference group Ref [i-1] [j] and the last frame of reference group Ref [i-1] [j + 1] as α, β, γ frames . The reference group generation unit 103a generates a reference group Ref [i] [j / 2] including the selected α, β, and γ frames, and generates the reference group Ref [i] [j / 2] as a reference group storage unit. It writes in 108 and makes it memorize (Step S314).

結合判定部１０４は、参照グループ生成部１０３ａによって参照グループ記憶部１０８に参照グループＲｅｆ［ｉ］［ｊ／２］の情報が書き込まれると、書き込まれた参照グループＲｅｆ［ｉ］［ｊ／２］に含まれるα，β，γフレームをＳ，Ｍ，Ｅフレームとして図４に示す結合判定処理を行う（ステップＳ３１５）。 When the information on the reference group Ref [i] [j / 2] is written to the reference group storage unit 108 by the reference group generation unit 103a, the connection determination unit 104 writes the written reference group Ref [i] [j / 2]. The combination determination process shown in FIG. 4 is performed with the .alpha., .Beta., And .gamma. Frames included in the frame being S, M, and E frames (step S315).

ピクチャタイプ選定部１０５ａは、結合判定部１０４が出力する判定結果を判定する（ステップＳ３１６）。ピクチャタイプ選定部１０５ａは、判定結果が「Ｆａｌｓｅ」の場合（ステップＳ３１６−Ｆａｌｓｅ）、処理をステップＳ３１８に進める指示情報を参照グループ生成部１０３ａに出力する。 The picture type selection unit 105a determines the determination result output from the combination determination unit 104 (step S316). If the determination result is “False” (step S316-False), the picture type selection unit 105a outputs, to the reference group generation unit 103a, instruction information that causes the process to proceed to step S318.

一方、ピクチャタイプ選定部１０５ａは、判定結果が「Ｔｒｕｅ」の場合（ステップＳ３１６−Ｔｒｕｅ）、参照グループＲｅｆ［ｉ］［ｊ／２］のβフレームのピクチャタイプをαフレームとγフレームを参照フレームとするＢピクチャとして選定する。ピクチャタイプ選定部１０５ａは、参照グループＲｅｆ［ｉ］［ｊ／２］のβフレームをＢピクチャとして選定したことを示す情報、及びαフレームとγフレームが参照フレームであることを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる。また、ピクチャタイプ選定部１０５ａは、参照グループＲｅｆ［ｉ］［ｊ／２］のγフレームのピクチャタイプをＰピクチャとして選定する。ピクチャタイプ選定部１０５ａは、参照グループＲｅｆ［ｉ］［ｊ／２］のγフレームをＰピクチャとして選定したことを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる。ピクチャタイプ選定部１０５ａは、処理をステップＳ３１８に進める指示情報を参照グループ生成部１０３ａに出力する（ステップＳ３１７）。 On the other hand, when the determination result is "True" (step S316-True), the picture type selection unit 105a refers to the .alpha. Frame and the .gamma. Frame as the picture type of the .beta. Frame of the reference group Ref [i] [j / 2]. It is selected as a B picture. The picture type selection unit 105a uses the information indicating that the β frame of the reference group Ref [i] [j / 2] has been selected as the B picture, and the information indicating that the α frame and the γ frame are reference frames. The information is written in the storage unit 106 and stored. Also, the picture type selection unit 105a selects the picture type of the γ frame of the reference group Ref [i] [j / 2] as a P picture. The picture type selection unit 105 a writes and stores information indicating that the γ frame of the reference group Ref [i] [j / 2] is selected as the P picture in the picture type storage unit 106. The picture type selection unit 105a outputs instruction information for advancing the process to step S318 to the reference group generation unit 103a (step S317).

参照グループ生成部１０３ａは、ｊの値に「２」を加算し（ステップＳ３１８）、ｊの値がＮ／２未満であるか否かを判定する（ステップＳ３１９）。参照グループ生成部１０３ａは、ｊの値がＮ／２未満であると判定した場合（ステップＳ３１９−ＹＥＳ）、ステップＳ３１３からの処理を繰り返す。 The reference group generation unit 103a adds “2” to the value of j (step S318), and determines whether the value of j is less than N / 2 (step S319). If the reference group generation unit 103a determines that the value of j is less than N / 2 (YES in step S319), the processing from step S313 is repeated.

一方、参照グループ生成部１０３ａは、ｊの値がＮ／２未満でないと判定した場合（ステップＳ３１９−ＮＯ）、ｉの値に「１」を加算し、ｊ＝０とする（ステップＳ３２０）。
参照グループ生成部１０３ａは、ｉの値が、ｄｅｐｔｈ＿ｍａｘ（ここでは、ｄｅｐｔｈ＿ｍａｘ＝３）未満であるか否かを判定する（ステップＳ３２１）。参照グループ生成部１０３ａは、ｉの値が、ｄｅｐｔｈ＿ｍａｘ未満であると判定した場合（ステップＳ３２１−ＹＥＳ）、ステップＳ３１３からの処理を繰り返す。一方、参照グループ生成部１０３ａは、ｉの値が、ｄｅｐｔｈ＿ｍａｘ未満でないと判定した場合（ステップＳ３２１−ＮＯ）、ピクチャタイプ選定部１０５ａに、Ｐピクチャのフレームに対する参照フレームを選択させる指示情報を出力する。 On the other hand, when it is determined that the value of j is not less than N / 2 (step S319-NO), the reference group generation unit 103a adds “1” to the value of i and sets j = 0 (step S320).
The reference group generation unit 103a determines whether the value of i is less than depth_max (here, depth_max = 3) (step S321). If the reference group generation unit 103a determines that the value of i is less than depth_max (YES in step S321), the process from step S313 is repeated. On the other hand, when it is determined that the value of i is not less than depth_max (step S 321 -NO), the reference group generation unit 103 a outputs, to the picture type selection unit 105 a, instruction information for selecting the reference frame for the P picture frame. .

ピクチャタイプ選定部１０５ａは、当該指示情報を受けて、Ｐピクチャとして選定したフレームの各々に対して、当該フレームに表示順で最も近い過去のＰピクチャのフレームを当該フレームの参照フレームとして選定する。表示順で最も近い過去のＰピクチャがない場合、Ｉピクチャのフレームを当該フレームの参照フレームとして選定する。ピクチャタイプ選定部１０５ａは、各Ｐピクチャに対して選定した参照フレームの情報をピクチャタイプ記憶部１０６に書き込んで記憶させる（ステップＳ３２２）。 In response to the instruction information, the picture type selection unit 105a selects, for each of the frames selected as a P picture, a past P picture frame closest to the frame in display order as a reference frame of the frame. When there is no closest past P picture in display order, a frame of I picture is selected as a reference frame of the frame. The picture type selection unit 105a writes and stores information on the reference frame selected for each P picture in the picture type storage unit 106 (step S322).

これにより、ピクチャタイプ記憶部１０６に、最終的に構築された符号化構造の情報が記憶されることになる。ブロック出力部１０７は、内部の記憶領域に一時的に符号化対象フレーム選択部１０１から受けたフレーム群を記憶しており、ピクチャタイプ記憶部１０６に記憶されている符号化構造に基づく順番で内部の記憶領域からフレームを読み出す。ブロック出力部１０７は、読み出したフレームの各々をラスタースキャンの順にＣＵブロックに分解して符号化対象ブロックとして出力し、符号化構造構築部１０ａは、１つのフレーム群についての処理を終了する。 As a result, the picture type storage unit 106 stores the information of the coding structure finally constructed. The block output unit 107 temporarily stores the frame group received from the encoding target frame selection unit 101 in an internal storage area, and internally stores the frames in the order based on the encoding structure stored in the picture type storage unit 106. Read the frame from the storage area of The block output unit 107 decomposes each of the read frames into CU blocks in raster scan order and outputs the CU block as an encoding target block, and the coding structure construction unit 10a ends the processing for one frame group.

（第２の実施形態の符号化構造構築処理の具体例）
次に、図１４から図１６を参照しつつ、図１４（ａ）に示すフレーム群に、図１２及び図１３に示した第２の実施形態の符号化構造構築処理を適用した場合の例について説明する。符号化対象フレーム選択部１０１が、入力映像情報から図１４（ａ）に示すフレーム群を生成する。当該フレーム群は、フレーム番号４とフレーム番号５のフレームにおいてシーンチェンジが含まれている。 (Specific Example of Encoding Structure Construction Process of Second Embodiment)
Next, referring to FIG. 14 to FIG. 16, an example in the case of applying the coding structure construction process of the second embodiment shown in FIG. 12 and FIG. 13 to the frame group shown in FIG. explain. The encoding target frame selection unit 101 generates a frame group shown in FIG. 14A from the input video information. The frame group includes a scene change in the frames of frame number 4 and frame number 5.

図１４（ｂ）に示すように、Ｉピクチャ選定部１０２ａによってフレーム番号０のフレームがＩピクチャとして選定された後、参照グループ生成部１０３ａによって、フレーム番号（０，１，２）のフレームを含む参照グループＲｅｆ［０］［０］が参照グループ記憶部１０８に書き込まれる（ステップＳ３０３，Ｓ３０５）。結合判定部１０４は、参照グループ記憶部１０８に書き込まれた参照グループＲｅｆ［０］［０］に対して図４に示す結合判定処理を行い、ここでは、シーンチェンジがないため、結合判定部１０４の評価値判定部１１１が「Ｔｒｕｅ」を出力したとする（ステップＳ３１５）。 As shown in FIG. 14B, after the frame of frame number 0 is selected as an I picture by the I picture selection unit 102a, the reference group generation unit 103a includes the frame of frame number (0, 1, 2). The reference group Ref [0] [0] is written to the reference group storage unit 108 (steps S303 and S305). The connection determination unit 104 performs the connection determination process shown in FIG. 4 on the reference group Ref [0] [0] written in the reference group storage unit 108. Here, since there is no scene change, the connection determination unit 104. It is assumed that the evaluation value determination unit 111 of “No” outputs “True” (step S315).

ピクチャタイプ選定部１０５ａは、結合判定部１０４が出力する判定結果が、「Ｔｒｕｅ」であるため、フレーム番号１のフレームのピクチャタイプをフレーム番号０とフレーム番号２のフレームを参照フレームとするＢピクチャとして選定する。ピクチャタイプ選定部１０５は、フレーム番号２のフレームをＢピクチャとして選定したことを示す情報、及びフレーム番号０とフレーム番号２のフレームが参照フレームであることを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる。 Since the determination result output from the connection determination unit 104 is “True”, the picture type selection unit 105 a uses the picture type of the frame of frame number 1 as the frame of frame number 0 and the frame of frame number 2 as the reference frame To be selected. The picture type selection unit 105 writes in the picture type storage unit 106 information indicating that the frame of frame number 2 has been selected as a B picture, and information indicating that the frames of frame number 0 and frame number 2 are reference frames. Remember.

また、ピクチャタイプ選定部１０５ａは、フレーム番号２のフレームのピクチャタイプをＰピクチャとして選定し、フレーム番号２のフレームをＰピクチャとして選定したことを示す情報をピクチャタイプ記憶部１０６に書き込んで記憶させる（ステップＳ３０９）。 Further, the picture type selection unit 105a selects the picture type of the frame of frame number 2 as a P picture and writes information indicating that the frame of frame number 2 is selected as a P picture into the picture type storage unit 106 and stores the information. (Step S309).

参照グループ生成部１０３ａは、ｊの値がＮ／２未満の間、ｊに１を加算し、フレーム番号（２，３，４）のフレームを含む参照グループＲｅｆ［０］［１］、フレーム番号（４，５，６）のフレームを含む参照グループＲｅｆ［０］［２］、フレーム番号（６，７，８）のフレームを含む参照グループＲｅｆ［０］［３］、フレーム番号（８，９，１０）のフレームを含む参照グループＲｅｆ［０］［４］を参照グループ記憶部１０８に書き込んで記憶させていく。結合判定部１０４は、新たな参照グループが参照グループ記憶部１０８に書き込まれるごとに、図４に示す結合判定処理を行う。その結果、図１４（ｂ）に示すように、シーンチェンジを含む参照グループＲｅｆ［０］［２］については、判定結果として「Ｆａｌｓｅ」を出力し、参照グループＲｅｆ［０］［２］以外については「Ｔｒｕｅ」を出力する。 The reference group generation unit 103a adds 1 to j while the value of j is less than N / 2, and the reference group Ref [0] [1] including the frame of the frame number (2, 3, 4), the frame number Reference group Ref [0] [2] including frames of (4, 5, 6), reference group Ref [0] [3] including frames of frame numbers (6, 7, 8), frame numbers (8, 9 , 10) are written in the reference group storage unit 108 and stored therein. The connection determination unit 104 performs the connection determination process illustrated in FIG. 4 each time a new reference group is written to the reference group storage unit 108. As a result, as shown in FIG. 14B, for the reference group Ref [0] [2] including the scene change, “False” is output as the determination result, and for all other than the reference group Ref [0] [2] Will output "True".

その結果、ピクチャタイプ選定部１０５ａは、フレーム番号１〜１０のフレームについて図１４（ｂ）に示すようなピクチャタイプを選定する。すなわち、結合判定部１０４によって、「Ｔｒｕｅ」と判定された参照グループＲｅｆ［０］［０］，Ｒｅｆ［０］［１］，Ｒｅｆ［０］［３］、Ｒｅｆ［０］［４］の各々の判定対象フレームのＢフレームは、各々のＡとＣのフレームを参照するＢピクチャとして選定され、Ｃフレームは、Ｐピクチャとして選定される。これに対して、結合判定部１０４によって、「Ｆａｌｓｅ」と判定された参照グループＲｅｆ［０］［２］のフレームは、ＢフレームとＣフレームは、Ｐピクチャとして選定される。 As a result, the picture type selection unit 105a selects a picture type as shown in FIG. 14B for the frames of frame numbers 1 to 10. That is, each of the reference groups Ref [0] [0], Ref [0] [1], Ref [0] [3], and Ref [0] [4] determined to be “True” by the connection determination unit 104. The B frame of the determination target frame is selected as a B picture referring to each of the A and C frames, and the C frame is selected as a P picture. On the other hand, the frames of the reference group Ref [0] [2] determined to be “False” by the combining determination unit 104, the B frame and the C frame are selected as P pictures.

参照グループ生成部１０３ａが、図１３に示すように、ｉ＝１，ｊ＝０として（ステップＳ３１２）、最初に、結合判定部１０４の判定結果記憶部１１２に記憶されている参照グループＲｅｆ［０］［０］と、参照グループＲｅｆ［０］［１］の判定結果を参照する。参照グループＲｅｆ［０］［０］と、参照グループＲｅｆ［０］［１］の判定結果は、共に「Ｔｒｕｅ」であるため（ステップＳ３１３−ＹＥＳ）、参照グループ生成部１０３ａは、参照グループＲｅｆ［０］［０］の最初と最後のフレーム、及び参照グループＲｅｆ［０］［１］の最後のフレーム、すなわちフレーム番号０，２，４のフレームをα、β，γフレームとして含む参照グループＲｅｆ［１］［０］を生成する。 As shown in FIG. 13, the reference group generation unit 103a sets i = 1, j = 0 (step S312), and firstly, the reference group Ref [0 stored in the determination result storage unit 112 of the connection determination unit 104. Reference is made to the judgment results of reference group Ref [0] [1] and reference group Ref [0] [1]. Since the determination results of both the reference group Ref [0] [0] and the reference group Ref [0] [1] are “True” (YES in step S313), the reference group generation unit 103a selects the reference group Ref [0]. Reference group Ref [0] including the first and last frames of [0] and the last frame of reference group Ref [0] [1], that is, frames of frame numbers 0, 2 and 4 as α, β, and γ frames. 1] Generate [0].

図１５（ｃ）に示すように、参照グループ生成部１０３ａは、生成した参照グループＲｅｆ［１］［０］を参照グループ記憶部１０８に書き込んで記憶させる（ステップＳ３１４）。結合判定部１０４は、参照グループ記憶部１０８に書き込まれた参照グループＲｅｆ［１］［０］に対して図４に示す結合判定処理を行い、ここでは、シーンチェンジがないため、評価値判定部１１１は、「Ｔｒｕｅ」を出力したとする（ステップＳ３１５）。ピクチャタイプ選定部１０５ａは、判定結果が「Ｔｒｕｅ」であるため（ステップＳ３１６−Ｔｒｕｅ）、参照グループＲｅｆ［１］［０］の判定対象フレームであるフレーム番号２のフレームのピクチャタイプをフレーム番号０とフレーム番号４のフレームを参照フレームとするＢピクチャとして選定する。 As shown in FIG. 15C, the reference group generation unit 103a writes and stores the generated reference group Ref [1] [0] in the reference group storage unit 108 (step S314). The connection determination unit 104 performs the connection determination process shown in FIG. 4 on the reference group Ref [1] [0] written in the reference group storage unit 108. Here, since there is no scene change, the evaluation value determination unit It is assumed that 111 outputs "True" (step S315). Since the determination result is “True” (step S316-True), the picture type selection unit 105a sets the picture type of the frame of frame number 2 which is the determination target frame of the reference group Ref [1] [0] to frame number 0 And a frame of frame number 4 is selected as a B picture as a reference frame.

ピクチャタイプ記憶部１０６には、既に、フレーム番号２のフレームについて、Ｐピクチャであることを示す情報が書き込まれている。そのため、ピクチャタイプ選定部１０５ａは、フレーム番号２のフレームをＢピクチャとして選定したことを示す情報、及びフレーム番号０とフレーム番号４のフレームが参照フレームであることを示す情報をピクチャタイプ記憶部１０６に上書きして記憶させる。 In the picture type storage unit 106, information indicating that the frame of frame number 2 is a P picture has already been written. Therefore, the picture type selection unit 105 a receives information indicating that the frame of frame number 2 has been selected as a B picture, and information indicating that the frames of frame numbers 0 and 4 are reference frames. Overwrite and store them.

また、ピクチャタイプ選定部１０５ａは、γフレームであるフレーム番号４のフレームのピクチャタイプをＰピクチャとして選定する。ピクチャタイプ記憶部１０６には、既に、フレーム番号４のフレームについて、Ｐピクチャであることを示す情報が書き込まれているが、ピクチャタイプ選定部１０５ａは、フレーム番号４のフレームをＰピクチャとして選定したことを示す情報をピクチャタイプ記憶部１０６に上書きして記憶させる（ステップＳ３１７）。 Further, the picture type selection unit 105a selects the picture type of the frame of frame number 4 that is a γ frame as a P picture. Although information indicating that the picture of frame number 4 is a P picture has already been written in the picture type storage unit 106, the picture type selection unit 105a has selected the frame of frame number 4 as a P picture. The information indicating that is written over the picture type storage unit 106 and stored (step S317).

参照グループ生成部１０３ａは、ｊの値に２を加算してｊ＝２とし（ステップＳ３１８）、ｊの値がＮ／２（Ｎ＝１０）未満であるため（ステップＳ３１９−ＹＥＳ）、結合判定部１０４の判定結果記憶部１１２に記憶されている参照グループＲｅｆ［０］［２］と、参照グループＲｅｆ［０］［３］の判定結果を参照する。参照グループＲｅｆ［０］［２］の判定結果は「Ｆａｌｓｅ」であるため、参照グループ生成部１０３ａは、参照グループＲｅｆ［０］［２］と、参照グループＲｅｆ［０］［３］の判定結果が共に「Ｔｒｕｅ」でないと判定する（ステップＳ３１３−ＮＯ）。したがって、図１５（ｃ）に、説明の便宜上示している参照グループＲｅｆ［１］［１］は、参照グループ生成部１０３ａによって生成されないことになる。 The reference group generation unit 103a adds 2 to the value of j to set j = 2 (step S318), and since the value of j is less than N / 2 (N = 10) (step S319-YES), the connection determination is performed. The determination results of the reference group Ref [0] [2] and the reference group Ref [0] [3] stored in the determination result storage unit 112 of the unit 104 are referred to. Since the determination result of the reference group Ref [0] [2] is “False”, the reference group generation unit 103a determines the determination results of the reference group Ref [0] [2] and the reference group Ref [0] [3]. It is determined that both are not "True" (step S313-NO). Therefore, the reference group Ref [1] [1] shown for convenience of explanation in FIG. 15C is not generated by the reference group generation unit 103a.

参照グループ生成部１０３ａは、ｊの値に２を加算してｊ＝４とし（ステップＳ３１８）、ｊの値がＮ／２（Ｎ＝１０）未満であるため（ステップＳ３１９−ＹＥＳ）、参照グループＲｅｆ［０］［４］と、参照グループＲｅｆ［０］［５］の判定結果を参照しようとする。しかし、参照グループＲｅｆ［０］［５］が存在しないため、参照グループ生成部１０３ａは、参照グループＲｅｆ［０］［４］と、参照グループＲｅｆ［０］［５］の判定結果が、共に「Ｔｒｕｅ」でないと判定する（ステップＳ３１３−ＮＯ）。参照グループ生成部１０３ａは、ｊの値に２を加算してｊ＝６とし（ステップＳ３１８）、ｊの値がＮ／２（Ｎ＝１０）未満ではなくなるため（ステップＳ３１９−ＮＯ）、ｉの値に「１」を加算してｉ＝１とし、ｊ＝０とする（ステップＳ３２０）。 The reference group generation unit 103a adds 2 to the value of j to set j = 4 (step S318), and since the value of j is less than N / 2 (N = 10) (step S319-YES), the reference group An attempt is made to refer to the determination results of Ref [0] [4] and the reference group Ref [0] [5]. However, since the reference group Ref [0] [5] does not exist, the reference group generation unit 103a determines that the determination results of both the reference group Ref [0] [4] and the reference group Ref [0] [5] It is determined that it is not "True" (step S313-NO). The reference group generation unit 103a adds 2 to the value of j to set j = 6 (step S318), and the value of j is not less than N / 2 (N = 10) (step S319-NO). “1” is added to the value to set i = 1 and j = 0 (step S320).

ｉの値は、ｄｅｐｔｈ＿ｍａｘ（ｄｅｐｔｈ＿ｍａｘ＝３）未満であるため（ステップＳ３２１−ＹＥＳ）、参照グループ生成部１０３ａは、結合判定部１０４の判定結果記憶部１１２に記憶されている参照グループＲｅｆ［１］［０］と、参照グループＲｅｆ［１］［１］の判定結果を参照しようとする。しかし、前述の通り、参照グループＲｅｆ［１］［１］は生成されていないため、参照グループ生成部１０３ａは、参照グループＲｅｆ［０］［４］と、参照グループＲｅｆ［０］［５］の判定結果が、共に「Ｔｒｕｅ」でないと判定する（ステップＳ３１３−ＮＯ）。したがって、図１５（ｄ）に、説明の便宜上示している参照グループＲｅｆ［２］［０］は、参照グループ生成部１０３ａによって生成されないことになる。 Since the value of i is less than depth_max (depth_max = 3) (step S 321 -YES), the reference group generation unit 103 a uses the reference group Ref [1] stored in the determination result storage unit 112 of the connection determination unit 104. It is attempted to refer to the determination results of [0] and the reference group Ref [1] [1]. However, as described above, since the reference group Ref [1] [1] is not generated, the reference group generation unit 103a is configured to compare the reference group Ref [0] [4] and the reference group Ref [0] [5]. It is determined that the determination results are not both "True" (step S313-NO). Therefore, the reference group Ref [2] [0] shown for convenience of description in FIG. 15D is not generated by the reference group generation unit 103a.

その後、ステップＳ３１８及びＳ３１９の処理が、ｊがＮ／２未満でなくなるまで繰り返され、更にステップＳ３２０においてｉの値に「１」が加えられｉ＝２，ｊ＝０となる。ｉの値は、ｄｅｐｔｈ＿ｍａｘ未満であるため（ステップＳ３２１−ＹＥＳ）、処理が継続されるが、前述の通り、図１５（ｄ）に示す参照グループＲｅｆ［２］［０］は、参照グループ生成部１０３ａによって生成されていないため、ステップＳ３１３の判定は、「ＮＯ」となる。再び、ステップＳ３１８及びＳ３１９の処理が、ｊが、Ｎ／２未満でなくなるまで繰り替えされ、ステップＳ３２０においてｉの値に「１」が加えられて、ｉ＝３，ｊ＝０となる。参照グループ生成部１０３ａは、ｉの値が、ｄｅｐｔｈ＿ｍａｘ未満でなくなるため（ステップＳ３２１−ＮＯ）、ピクチャタイプ選定部１０５ａに、Ｐピクチャのフレームに対する参照フレームを選択させる指示情報を出力する。 Thereafter, the processes of steps S318 and S319 are repeated until j is not less than N / 2, and further, "1" is added to the value of i in step S320, and i = 2, j = 0. Since the value of i is less than depth_max (step S 321 -YES), the process is continued, but as described above, the reference group Ref [2] [0] shown in FIG. Since it is not generated by 103a, the determination in step S313 is "NO". Again, the processing in steps S318 and S319 is repeated until j is not less than N / 2, and “1” is added to the value of i in step S320, i = 3, j = 0. Since the value of i is not less than depth_max (NO in step S 321), the reference group generation unit 103 a outputs, to the picture type selection unit 105 a, instruction information for selecting the reference frame for the P picture frame.

ピクチャタイプ選定部１０５ａは、当該指示情報を受けて、Ｐピクチャとして選定したフレームの各々に対して、当該フレームに表示順で最も近い過去のＰピクチャのフレームを当該フレームの参照フレームとして選定する。表示順で最も近い過去のＰピクチャがない場合、Ｉピクチャのフレームを当該フレームの参照フレームとして選定する。ピクチャタイプ選定部１０５ａは、各Ｐピクチャに対して選定した参照フレームの情報をピクチャタイプ記憶部１０６に書き込んで記憶させる（ステップＳ３２２）。これにより、ピクチャタイプ記憶部１０６に、図１６（ｅ）に示す、２階層のＢピクチャの階層構造を有する符号化構造が記憶されることになる。 In response to the instruction information, the picture type selection unit 105a selects, for each of the frames selected as a P picture, a past P picture frame closest to the frame in display order as a reference frame of the frame. When there is no closest past P picture in display order, a frame of I picture is selected as a reference frame of the frame. The picture type selection unit 105a writes and stores information on the reference frame selected for each P picture in the picture type storage unit 106 (step S322). As a result, the picture type storage unit 106 stores the coding structure having a hierarchical structure of B pictures of two layers, as shown in FIG.

上記の第２の実施形態の構成により、符号化構造構築部１０ａにおいて、符号化対象フレーム選択部１０１が、入力映像情報から予め定められるＧＯＰのサイズにしたがう枚数のフレームを選択してフレーム群を生成する。Ｉピクチャ選定部１０２ａは、フレーム群の最初のフレームをＩピクチャとして選定する。参照グループ生成部１０３ａは、ｉ＝０、ｊ＝０とし、フレーム群からフレーム番号が「２ｊ」，「２ｊ＋２^ｉ」，「２ｊ＋２^{（ｉ＋１）}」のフレームをｊに「１」を加算しつつ繰り返し選択し、選択した３枚のフレームをＡ，Ｂ，Ｃフレームとして含む参照グループＲｅｆ［０］［ｊ］を生成する。結合判定部１０４の評価値算出部１１０は、参照グループに含まれるＡ，Ｂ，Ｃフレームに基づいて、Ｌ０コスト、Ｌ１コスト、Ｂｉコストという３つの評価値を算出する。評価値判定部１１１は、ＢｉコストとＬ１コストのうち値の小さい方を選択し、Ｌ０コストの値を基準とした相対値と、予め定められる相対評価用閾値Ｔｈ＿ｍｉｎ，Ｔｈ＿ｍａｘとに基づいて、判定対象フレームであるＢフレームと、ＡとＣフレームの参照関係を結合するか否かを判定する。また、評価値判定部１１１は、相対値が、Ｔｈ＿ｍｉｎの値以上であって、かつＴｈ＿ｍａｘの値以下の範囲に含まれる場合、更に、Ｂｉコストの値と、絶対評価用閾値Ｔｈ＿ａｂｓとに基づいて、判定対象フレームであるＢフレームと、ＡとＣフレームの参照関係を結合するか否かを判定する。ピクチャタイプ選定部１０５ａは、評価値判定部１１１が、Ｂフレームと、ＡとＣフレームの参照関係を結合すると判定した場合、Ｂフレームを、ＡとＣフレームを参照フレームとするＢピクチャとして選定する。すなわち、符号化構造構築部１０ａでは、実際の符号化を行うような演算を行っておらず、３つのフレームに基づいて３つの評価値を算出し、算出した３つの評価値に基づいて、３つのフレームの参照関係を結合するか否かを判定している。このため、第２の実施形態の符号化構造構築部１０ａでは、符号化効率の高い符号化構造を低演算量で求めることが可能となる。 With the configuration of the second embodiment described above, in the coding structure construction unit 10a, the coding target frame selection unit 101 selects a number of frames according to the size of the GOP determined in advance from the input video information, and sets the frame group. Generate The I picture selection unit 102a selects the first frame of the frame group as an I picture. The reference group generation unit 103a sets i = 0 and j = 0, and repeatedly adds “1” to j from the frame group with the frame numbers “2j”, “2j + 2 ⁱ ”, and “2j + 2 ^{(i + 1)} ”. A reference group Ref [0] [j] is generated which includes the selected three frames as A, B and C frames. The evaluation value calculation unit 110 of the combination determination unit 104 calculates three evaluation values of L0 cost, L1 cost, and Bi cost based on the A, B, and C frames included in the reference group. The evaluation value determination unit 111 selects the smaller one of the Bi cost and the L1 cost, and makes a determination based on the relative value based on the value of the L0 cost and the predetermined relative evaluation threshold values Th_min and Th_max. It is determined whether or not the reference relationship between the B frame, which is the target frame, and the A and C frames is combined. In addition, when the relative value is within the range not less than the value of Th_min and not more than the value of Th_max, the evaluation value determining unit 111 further determines the value of Bi cost based on the absolute evaluation threshold Th_abs. It is determined whether or not the reference relationship between the B frame, which is the determination target frame, and the A and C frames is combined. When the evaluation value determination unit 111 determines that the reference relationship between the B frame and the A and C frames is combined, the picture type selection unit 105a selects the B frame as a B picture having the A and C frames as reference frames. . That is, the coding structure construction unit 10a does not carry out an operation to perform actual coding, but calculates three evaluation values based on three frames, and calculates 3 based on the calculated 3 evaluation values. It is determined whether or not reference relationships of two frames are to be combined. For this reason, in the coding structure construction unit 10a of the second embodiment, it is possible to obtain a coding structure with high coding efficiency with a small amount of calculation.

また、参照グループ生成部１０３ａは、評価値判定部１１１が、判定対象フレームと最初と最後のフレームの参照関係を結合すると判定した隣接する２つの参照グループを結合して重複するフレームを判定対象フレームとする新たな参照グループを生成することを繰り返す。結合判定部１０４は、参照グループ生成部１０３ａが、新たに生成した参照グループのα，β，γフレームに基づいて、結合判定処理を行う。これにより、第２の実施形態の符号化構造構築部１０ａでは、２階層以上の深い階層の符号化構造を構築することが可能となっており、符号化効率を向上させることが可能となっている。 Further, the reference group generation unit 103a combines two adjacent reference groups determined by the evaluation value determination unit 111 to combine the reference relationship between the determination target frame and the first and last frames, and determines an overlapping frame as a determination target frame. Repeat creating a new reference group. In the connection determination unit 104, the reference group generation unit 103a performs connection determination processing based on the newly generated α, β, and γ frames of the reference group. As a result, in the coding structure construction unit 10a of the second embodiment, it is possible to construct a coding structure of two or more deep layers, which makes it possible to improve the coding efficiency. There is.

また、第１及び第２の実施形態の符号化構造構築部１０，１０ａでは、評価値として、フレーム間において動き探索を行った場合の絶対誤差和の最小値を算出しており、動きベクトルのみではないため、複雑な入力映像に対しても精度よく結合判定を行うことが可能となっている。このため、第１及び第２の実施形態の符号化構造構築部１０，１０ａは、符号化効率の高い符号化構造を低演算量で求めることを可能としている。 In addition, in the coding structure construction units 10 and 10a according to the first and second embodiments, the minimum value of the absolute error sum in the case of performing the motion search between the frames is calculated as the evaluation value. Since it is not, it is possible to perform the joint determination with high accuracy even for a complex input video. Therefore, the coding structure construction units 10 and 10a according to the first and second embodiments can obtain a coding structure with high coding efficiency with a small amount of calculation.

第１及び第２の実施形態の符号化構造構築部１０，１０ａでは、図４を参照して説明したように、ステップＳ２０７の絶対評価用閾値を用いたＢｉコストの絶対評価を行う前に、ステップＳ２０４において、Ｌ０コストを基準とした相対評価を行っている。このため、ロバストな判定を実現することができている。これに対して、非特許文献１に記載の技術では、評価値として、符号化コストの最小値のみを用いた絶対評価を行うようになっている。そのため、非特許文献１に記載の技術の手法を、図９（ｉｉ）及び（ｉｉｉ）に示すフレームに適用した場合、インターコストが十分に大きいと仮定すると、どちらも同じ「１５０」の値が算出されることになる。したがって、非特許文献１に記載の技術では、第１及び第２の実施形態の評価値判定部１１１のように、図９（ｉｉ）の方を結合しないとし、図９（ｉｉｉ）の方を結合するとする異なる判定を行うことができない。 In the coding structure construction units 10 and 10a according to the first and second embodiments, as described with reference to FIG. 4, before performing the absolute evaluation of the Bi cost using the threshold for absolute evaluation in step S207, In step S204, relative evaluation is performed based on the L0 cost. Therefore, robust determination can be realized. On the other hand, in the technique described in Non-Patent Document 1, absolute evaluation is performed using only the minimum value of the coding cost as the evaluation value. Therefore, when the method of the technique described in Non-Patent Document 1 is applied to the frames shown in FIGS. 9 (ii) and (iii), assuming that the inter cost is sufficiently large, the same “150” value is obtained in both cases. It will be calculated. Therefore, in the technique described in Non-Patent Document 1, as in the evaluation value determination unit 111 of the first and second embodiments, it is assumed that the method of FIG. 9 (ii) is not combined, and the method of FIG. You can not make different decisions to combine.

図９（ｉｉ）及び（ｉｉｉ）の双方において結合するとした場合、図９（ｉｉｉ）では、Ｍフレームにおいて、同シーンであるＥフレームを参照するメリットがあるため結合することが有効と考えられる。これに対して、図９（ｉｉ）では、Ｍフレームと、Ｅフレームとは異なるシーンとなるため、誤った判定となる可能性が高い。結合しないとした場合、逆に、図９（ｉｉｉ）が誤った判定となる可能性が高くなる。上述したように、第１及び第２の実施形態における評価値判定部１１１では、図９（ｉｉ）の方を結合しないとし、図９（ｉｉｉ）の方を結合するとする異なる判定を行うことができるため、どちらのケースに対しても符号化効率を向上させることができる。これは、Ｍフレームに着目した際に、Ｅフレームとの相関が高い、すなわちＬ０コストよりも、Ｌ１コストまたはＢｉコストが十分小さい場合に結合するメリットが得られるということを反映した評価となっているためである。 When combining is performed in both of FIG. 9 (ii) and (iii), in FIG. 9 (iii), since there is a merit in referring to the E frame which is the same scene in M frame, combining is considered effective. On the other hand, in FIG. 9 (ii), since the M frame and the E frame are different scenes, there is a high possibility of an erroneous determination. On the contrary, in the case of not combining, there is a high possibility that the determination in FIG. As described above, in the evaluation value determination unit 111 in the first and second embodiments, it is assumed that the method of FIG. 9 (ii) is not combined and the determination of combining the method of FIG. 9 (iii) is performed. As it can, coding efficiency can be improved in either case. This is an evaluation that reflects the merit of combining when the correlation with the E frame is high, that is, when the L1 cost or the Bi cost is sufficiently smaller than the L0 cost, when focusing on the M frame. It is because

また、上記の第１及び第２の実施形態において、結合判定部１０４は、参照グループ生成部１０３，１０３ａが生成する参照グループの判定対象フレームと、最初と最後のフレームの３枚のフレームを単位として結合判定処理を行うことを前提としているが、本発明の構成は、当該実施の形態に限られない。例えば、非特許文献１に記載の技術に対して適用するようにしてもよい。上述したように、非特許文献１に記載の技術では、基準フレームと、選択したＰフレームとの間に、これらを参照する中間フレームを定義する。基準フレームと中間フレームとの間に位置するフレームに対して、これらのフレームを基準フレームと中間フレームとを参照するＢピクチャとした場合のコストを算出している。また、中間フレームとＰピクチャとの間に位置するフレームに対して、これらのフレームを基準フレームと中間フレームとを参照するＢピクチャとした場合のコストを算出している。そして、算出したコストが最小となる組み合わせを最終的な符号化構造として選択している。これらのコストに替えて、非特許文献１に技術に結合判定部１０４を適用し、結合判定部１０４が算出する３つの評価値を用いて符号化構造を構築するようにしてもよい。 In the first and second embodiments described above, the connection determination unit 104 sets three frames, which are the determination target frames of the reference group generated by the reference group generation unit 103 and 103a, and the first and last frames. Although it is premised that the connection determination processing is performed, the configuration of the present invention is not limited to the embodiment. For example, the present invention may be applied to the technology described in Non-Patent Document 1. As described above, in the technique described in Non-Patent Document 1, intermediate frames that reference these are defined between a reference frame and a selected P frame. For the frame located between the reference frame and the intermediate frame, the cost is calculated in the case where these frames are B pictures that reference the reference frame and the intermediate frame. Further, with respect to a frame located between the intermediate frame and the P picture, the cost is calculated in the case where these frames are B pictures that reference the reference frame and the intermediate frame. Then, a combination that minimizes the calculated cost is selected as the final coding structure. Instead of these costs, the joint determination unit 104 may be applied to the technique in Non-Patent Document 1, and the coding structure may be constructed using the three evaluation values calculated by the joint determination unit 104.

なお、上記の第１及び第２の実施形態では、結合判定の結果にしたがって、ＢフレームをＰピクチャとして選定したり、ＣフレームをＰピクチャとして選定したり、γフレームをＰピクチャとして選定したり、また、３つのフレームを選択することができない場合に、残りのフレームをＰピクチャとして選定したりしているが、本発明の構成は、当該実施の形態に限られない。例えば、Ｉピクチャから予め定められる間隔で、事前にＰピクチャとするフレームを選定しておいてもよいし、結合判定の結果、Ｂピクチャとするフレームを全て選定した後、ＩピクチャとＢピクチャのフレーム以外のフレームをＰピクチャとするようにしてもよい。 In the first and second embodiments described above, a B frame is selected as a P picture, a C frame is selected as a P picture, and a γ frame is selected as a P picture according to the result of connection determination. Also, when three frames can not be selected, the remaining frames are selected as P pictures, but the configuration of the present invention is not limited to this embodiment. For example, a frame to be a P picture may be selected in advance at an interval determined in advance from the I picture, or after all frames to be a B picture have been selected as a result of the combination determination, Frames other than the frame may be P pictures.

また、上記の第１及び第２の実施形態では、３枚の連続するフレームを参照グループとして生成するようにしているが、３枚以上であってもよい。 In the first and second embodiments described above, three consecutive frames are generated as a reference group, but three or more frames may be generated.

また、上記の第１及び第２の実施形態では、隣接する参照グループにおいて重複するフレームが１枚になるように、隣接する参照グループを生成するようにしているが、１枚以上のフレームが重複するように隣接する参照グループを生成してもよい。複数枚のフレームが重複する場合、重複しているフレームのいずれかから判定対象フレームが選択されることになる。 In the first and second embodiments described above, adjacent reference groups are generated so that one overlapping frame is included in adjacent reference groups, but one or more frames overlap. Adjacent reference groups may be generated. When a plurality of frames overlap, the determination target frame is selected from any of the overlapping frames.

また、上記の第１及び第２の実施形態において参照グループ生成部１０３，１０３ａは、隣接する参照グループの判定結果に基づいて、新たな参照グループを生成して２階層以上の符号化構造を構築するようになっているが、１階層のみを構築する処理を行うようにしてもよい。 Also, in the first and second embodiments described above, the reference group generation unit 103, 103a generates a new reference group based on the determination result of the adjacent reference groups, and constructs a coding structure of two or more layers. However, processing for constructing only one layer may be performed.

また、上記の第１及び第２の実施形態において参照グループ生成部１０３，１０３ａは、表示順で過去の隣接する参照グループを結合して新たな参照グループを生成するようにしているが、本発明の実施の形態は、当該実施の形態に限られない。例えば、表示順で未来の隣接する参照グループを結合してもよいし、過去と未来の双方で隣接する参照グループを結合してもよい。 Further, in the first and second embodiments described above, the reference group generation unit 103, 103a combines the past adjacent reference groups in display order to generate a new reference group, but the present invention The embodiment of the invention is not limited to the embodiment. For example, adjacent reference groups in the future may be combined in display order, or adjacent reference groups in both the past and the future may be combined.

また、上記の第１及び第２の実施形態において、評価値判定部１１１は、相対評価用閾値であるＴｈ＿ｍｉｎ及びＴｈ＿ｍａｘ、並びに絶対評価用閾値であるＴｈ＿ａｂｓに基づく判定を行っている。この判定において、評価値判定部１１１は、Ｔｈ＿ｍｉｎ未満であるか否か、Ｔｈ＿ｍａｘを超過しているか否か、Ｔｈ＿ａｂｓ未満であるか否かに基づく判定を行っているが、未満であるか否か、超過しているか否かという判定は、一例であり、Ｔｈ＿ｍｉｎ以下であるか否か、Ｔｈ＿ｍａｘ以上であるか否か、Ｔｈ＿ａｂｓ以下であるか否かに基づいて判定するようにしてもよい。 In the first and second embodiments described above, the evaluation value determination unit 111 performs determination based on Th_min and Th_max, which are relative evaluation thresholds, and Th_abs, which is an absolute evaluation threshold. In this determination, the evaluation value determination unit 111 makes a determination based on whether it is less than Th_min, whether it exceeds Th_max, and whether it is less than Th_abs. The determination as to whether it is exceeded is an example, and it may be determined based on whether it is Th_min or less, Th_max or more, or Th_abs or less.

上述した第１及び第２の実施形態における映像符号化装置１，１ａをコンピュータで実現するようにしてもよい。その場合、この機能を実現するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行することによって実現してもよい。なお、ここでいう「コンピュータシステム」とは、ＯＳや周辺機器等のハードウェアを含むものとする。また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。さらに「コンピュータ読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムを送信する場合の通信線のように、短時間の間、動的にプログラムを保持するもの、その場合のサーバやクライアントとなるコンピュータシステム内部の揮発性メモリのように、一定時間プログラムを保持しているものも含んでもよい。また上記プログラムは、前述した機能の一部を実現するためのものであってもよく、さらに前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるものであってもよく、ＦＰＧＡ（Field Programmable Gate Array）等のプログラマブルロジックデバイスを用いて実現されるものであってもよい。 The video encoding devices 1 and 1a in the first and second embodiments described above may be realized by a computer. In that case, a program for realizing this function may be recorded in a computer readable recording medium, and the program recorded in the recording medium may be read and executed by a computer system. Here, the “computer system” includes an OS and hardware such as peripheral devices. The term "computer-readable recording medium" refers to a storage medium such as a flexible disk, a magneto-optical disk, a ROM, a portable medium such as a ROM or a CD-ROM, or a hard disk built in a computer system. Furthermore, “computer-readable recording medium” dynamically holds a program for a short time, like a communication line in the case of transmitting a program via a network such as the Internet or a communication line such as a telephone line. It may also include one that holds a program for a certain period of time, such as volatile memory in a computer system that becomes a server or a client in that case. Further, the program may be for realizing a part of the functions described above, or may be realized in combination with the program already recorded in the computer system. It may be realized using a programmable logic device such as an FPGA (Field Programmable Gate Array).

以上、この発明の実施形態について図面を参照して詳述してきたが、具体的な構成はこの実施形態に限られるものではなく、この発明の要旨を逸脱しない範囲の設計等も含まれる。 The embodiment of the present invention has been described in detail with reference to the drawings. However, the specific configuration is not limited to this embodiment, and includes design and the like within the scope of the present invention.

１０…符号化構造構築部，１０１…符号化対象フレーム選択部，１０２…Ｉピクチャ選定部，１０３…参照グループ生成部，１０４…結合判定部，１０５…ピクチャタイプ選定部，１０６…ピクチャタイプ記憶部，１０７…ブロック出力部，１１０…評価値算出部，１１１…評価値判定部，１１２…判定結果記憶部 10 Coding structure construction unit 101 Coding target frame selection unit 102 I picture selection unit 103 Reference group generation unit 104 Connection determination unit 105 Picture type selection unit 106 Picture type storage unit , 107 ... block output unit, 110 ... evaluation value calculation unit, 111 ... evaluation value determination unit, 112 ... determination result storage unit

Claims

A video coding apparatus that performs interframe prediction coding, comprising:
A reference group generation unit that generates a reference group including a predetermined number of continuous frames of three or more from a frame group acquired from input video information;
Based on the first and last frames of the reference group generated by the reference group generation unit and any one determination target frame between the first and last frames, an evaluation value regarding the error between the frames is calculated. An evaluation value calculation unit to calculate;
Among the evaluation values calculated by the evaluation value calculation unit, based on relative values of other evaluation values based on the evaluation value between the determination target frame and the first frame, or with the relative value An evaluation value that determines whether or not the reference relationship between the determination target frame and the first and last frames is to be combined based on the determination target frame and the evaluation value between both the first and last frames. A judgment unit,
A picture type selection unit that selects a picture type of the frame included in the frame group based on the determination result of the evaluation value determination unit;
A video coding apparatus comprising:

The evaluation value calculation unit
A first one-way evaluation value regarding the inter-frame error between the determination target frame and the first frame, and a second one regarding the error between the determination target frame and the last frame Calculating a one-way evaluation value and a two-way evaluation value regarding an error between the frames between the determination target frame and both the first and last frames;
The evaluation value determination unit
The smaller one of the second one-way evaluation value and the two-way evaluation value is selected, and the relative value of the selected one of the evaluation values relative to the first one-way evaluation value is selected. A value is calculated, and it is determined whether or not the reference relationship between the frame to be determined and the first and last frames is combined based on the calculated relative value and a predetermined relative evaluation threshold value. The video encoding device according to Item 1.

The predetermined relative evaluation threshold includes a first relative evaluation threshold and a second relative evaluation threshold larger in value than the first relative evaluation threshold.
The evaluation value determination unit
It is determined whether or not the reference relationship between the frame to be determined and the first and last frames is combined based on whether or not the relative value is less than the first relative evaluation threshold.
When it is determined that the relative value is not less than the first relative evaluation threshold and the reference frame of the determination target frame is not the first and last frames, the relative value is for the first relative evaluation. In the case of the value of the range defined by the threshold value and the second relative evaluation threshold value, the determination target frame and the first and last frames are determined based on the bidirectional evaluation value and a predetermined absolute evaluation threshold value. The video encoding apparatus according to claim 2, wherein it is determined whether or not to combine reference relationships with frames.

The reference group generation unit
From the three or more predetermined number of sheets including the reference group such that the first frame or the last frame of the reference group determined to be combined by the evaluation value determination unit becomes the determination target frame Generate a new reference group including a large number of consecutive frames,
The evaluation value calculation unit
The evaluation value regarding the error between the frames is calculated based on the first and last frames of the new reference group and the determination target frame,
The evaluation value determination unit
Among the evaluation values calculated by the evaluation value calculation unit, a relative value of another evaluation value based on the evaluation value between the determination target frame of the new reference group and the first frame or the relative value of the evaluation value The reference relationship between the determination target frame and the first and last frames is determined based on the relative value and the evaluation value between the determination target frame of the new reference group and both the first and last frames. The video coding apparatus according to any one of claims 1 to 3, which determines whether or not to combine.

The reference group generation unit
When generating another reference group adjacent to the reference group, the three predetermined frames are overlapped such that the last frame of the reference group is the first frame of the other adjacent reference group Generating the reference group including the above number of consecutive frames,
When it is determined that the evaluation value determination unit combines the reference relationship between the determination target frame and the first and last frames in both two adjacent reference groups, the adjacent two reference groups are combined to each other. 5. The video encoding apparatus according to claim 4, wherein the new reference group is generated, and the overlapping frame is set as the determination target frame.

The reference group generation unit
The video coding apparatus according to claim 5, wherein generation of the new reference group is repeated according to a value of the number of layers of the coding structure to be defined in advance.

The picture type selection unit
And an I picture selection unit that acquires the frame group from the input video information and selects the first frame of the acquired frame group as an I picture,
The picture type selection unit
When the combination determination unit determines that the reference relationship between the determination target frame and the first and last frames is to be combined, the determination target frame is selected as a B picture having the first and last frames as reference frames,
The frame not selected as the I picture or the B picture is selected as the P picture, and the frame of the past P picture closest to the selected frame is selected in the display order of the frame group The reference frame of the frame, when there is no previous frame of the P picture closest to the selected frame, the frame of the I picture is the reference frame of the selected frame. 6. The video encoding device according to any one of 6.

The video encoding apparatus according to any one of claims 1 to 7, wherein the number of the predetermined three or more sheets is three.