JPWO2008053557A1

JPWO2008053557A1 - Moving image re-encoding device, moving image re-encoding method, moving image re-encoding program, and recording medium storing moving image re-encoding program

Info

Publication number: JPWO2008053557A1
Application number: JP2008541969A
Authority: JP
Inventors: 賢二恒川
Original assignee: Pioneer Corp
Current assignee: Pioneer Corp
Priority date: 2006-11-02
Filing date: 2006-11-02
Publication date: 2010-02-25
Also published as: WO2008053557A1

Abstract

【課題】符号化した動画像を一旦復号して、再度符号化する動画像再符号化装置において、遅延時間やメモリが削減されるとともに適切な符号量を予測する。【解決手段】入力ビットストリームを復号部１０１で復号した際に得られるピクチャタイプから、符号化情報解析部１０２内のＧＯＰ構造予測部２００において、例えば８つのＧＯＰ内のＰピクチャ枚数をカウントし、最大値と最小値を持つＧＯＰを除いた６つのＧＯＰからＰピクチャ枚数とＢピクチャ枚数の平均値を算出し、それを予測したＧＯＰ構造とする。符号化部１０３では、予測したＧＯＰ構造を参照して再符号化を行う。【選択図】図１In a moving image re-encoding apparatus that once decodes an encoded moving image and re-encodes it, a delay time and memory are reduced and an appropriate code amount is predicted. A GOP structure prediction unit in a coding information analysis unit counts, for example, the number of P pictures in eight GOPs from a picture type obtained when an input bit stream is decoded by a decoding unit. An average value of the number of P pictures and the number of B pictures is calculated from six GOPs excluding the GOP having the maximum value and the minimum value, and a predicted GOP structure is obtained. The encoding unit 103 performs re-encoding with reference to the predicted GOP structure. [Selection] Figure 1

Description

本発明は、一度符号化された動画像を符号化方式やビットレートなどを変更して再度符号化する動画像再符号化装置、動画像再符号化方法、動画像再符号化プログラムおよび動画像再符号化プログラムを格納した記録媒体に関する。 The present invention relates to a moving image re-encoding device, a moving image re-encoding method, a moving image re-encoding program, and a moving image that re-encode a moving image that has been encoded once by changing an encoding method, a bit rate, or the like. The present invention relates to a recording medium storing a re-encoding program.

動画像をデジタル化して記録または放送されることが多くなっている。しかし動画像はデータ量が膨大であるために、通常、符号化によりデータ量を少なくして記録または放送されている。そのような動画像の符号化の方式としてＭＰＥＧ（Moving Picture Experts Group）−２などが用いられる。 In many cases, moving images are digitized and recorded or broadcast. However, since moving images have an enormous amount of data, they are usually recorded or broadcast with a reduced amount of data by encoding. MPEG (Moving Picture Experts Group) -2 or the like is used as such a moving image encoding method.

ＭＰＥＧ−２などで符号化された動画像を、記録する記録媒体の容量や伝送する通信回線の速度などに応じてビットレートを変換して再符号化したり、近年登場したＭＰＥＧ−４やＨ．２６４などのＭＰＥＧ−２よりも高効率な符号化方式で再符号化したりすることがある。 A moving picture encoded by MPEG-2 or the like is re-encoded by converting the bit rate according to the capacity of a recording medium to be recorded or the speed of a communication line to be transmitted, or recently introduced MPEG-4 or H.264. In some cases, re-encoding may be performed using a higher-efficiency encoding method than MPEG-2 such as H.264.

このような、一度符号化した動画像を一旦復号して再度符号化する装置として例えば特許文献１に記載のトランスコーダがある。特許文献１に記載のトランスコーダは、復号部１が入力された順に復号したＩフレーム、Ｐフレーム、Ｂフレームを、復号した順のまま再度ＩフレームはＩフレーム、ＰフレームはＰフレーム、ＢフレームはＢフレームとして符号化することで、遅延時間やメモリの削減を可能としている。
特開２０００−９２４９７号公報 For example, there is a transcoder described in Patent Document 1 as an apparatus for once decoding and re-encoding a once-encoded moving image. In the transcoder described in Patent Document 1, the I frame, the P frame, and the B frame that are decoded in the order in which the decoding unit 1 is input, the I frame is the I frame, the P frame is the P frame, and the B frame in the decoding order. Is encoded as a B frame, thereby reducing delay time and memory.
JP 2000-92497 A

上述した特許文献１に記載のトランスコーダは、復号した際に画像を表示順に並べ替える並べ替え器を有しないので表示順に並べ替えるための遅延時間やメモリが削減できる。しかしながら、ビットレートや符号化方式を変えて再符号化する場合は、復号前のビットストリームと再符号化時に割当てる符号量が異なることが多く、前記特許文献１には再符号化時に割当てる符号量に関する記載はないために、例えば適切な符号量が割当てられないことから画質が低下する可能性があった。 Since the transcoder described in Patent Document 1 described above does not have a rearranger that rearranges images in the display order when decoding, delay time and memory for rearranging the images in the display order can be reduced. However, when re-encoding is performed by changing the bit rate or the encoding method, the code amount allocated at the time of re-encoding is often different from the bit stream before decoding. For example, there is a possibility that the image quality deteriorates because an appropriate code amount is not assigned.

適切な符号量を割当てるためには、例えば１ＧＯＰ（Group Of Pictures）分などある程度まとまった画像をメモリ等に記憶することで、適切な符号量を割当てることができるが、このようにすると、多くのメモリが必要となることによるコスト上昇に加え、遅延時間が大きくなってしまう。 In order to assign an appropriate code amount, for example, an appropriate code amount can be assigned by storing a certain amount of images such as 1 GOP (Group Of Pictures) in a memory or the like. In addition to the increase in cost due to the need for memory, the delay time increases.

そこで、本発明は、例えば符号化した動画像をビットレートや符号化方式を変更して再符号化する際に、遅延時間やメモリが削減されるとともに適切な符号量を予測して符号化することができる動画像再符号化装置、動画像再符号化方法、動画像再符号化プログラムおよび動画像再符号化プログラムを格納した記録媒体を提供することを課題とする。 Therefore, for example, when re-encoding an encoded moving image by changing a bit rate or an encoding method, the present invention reduces the delay time and memory and encodes an appropriate code amount. An object is to provide a moving image re-encoding device, a moving image re-encoding method, a moving image re-encoding program, and a recording medium storing the moving image re-encoding program.

上記課題を解決するために、請求項１に記載の発明は、画面内符号化画像と、前方向予測符号化画像と、双方向予測符号化画像のうち少なくとも前記画面内符号化画像が１枚以上含まれる画像群から構成されるビットストリームが入力され該ビットストリームを復号し、画像データおよび符号化情報を得る復号手段と、前記復号手段が復号した前記画像データおよび前記符号化情報に基づいて再度前記画像データを符号化する符号化手段と、を有した動画像再符号化装置において、前記符号化情報に基づいて、複数の前記画像群各々における前記前方向予測符号化画像の数をカウントし、カウントされた複数の前記画像群各々における前記前方向予測符号化画像の数の最大値または最小値を持つ前記画像群のうち少なくともいずれか一方の前記画像群を一つ以上除いた複数の前記画像群における前記前方向予測符号化画像および前記双方向予測符号化画像の平均値を、予測した前記画像群の構造として前記符号化手段に出力する画像群構造予測手段を有することを特徴としている。 In order to solve the above-mentioned problem, the invention according to claim 1 is characterized in that at least one intra-coded image is selected from among an intra-coded image, a forward prediction coded image, and a bidirectional predictive coded image. Based on the decoding unit that receives the bit stream composed of the image group included above and decodes the bit stream to obtain the image data and the encoding information, and the image data and the encoding information decoded by the decoding unit And a coding means for coding the image data again, and counting the number of forward prediction coded images in each of the plurality of image groups based on the coding information. And the front of at least one of the image groups having the maximum value or the minimum value of the number of forward prediction encoded images in each of the counted plurality of image groups. An image that outputs an average value of the forward prediction encoded image and the bidirectional predictive encoded image in a plurality of the image groups excluding one or more image groups to the encoding unit as a predicted structure of the image group It has a group structure prediction means.

請求項８記載の発明は、画面内符号化画像と、前方向予測符号化画像と、双方向予測符号化画像のうち少なくとも前記画面内符号化画像が１枚以上含まれる画像群から構成されるビットストリームが入力され該ビットストリームを復号し、画像データおよび符号化情報を得て、復号した前記画像データおよび前記符号化情報に基づいて再度前記画像データを符号化する動画像再符号化方法において、前記符号化情報に基づいて、複数の前記画像群各々における前記前方向予測符号化画像の数をカウントし、カウントされた複数の前記画像群各々における前記前方向予測符号化画像の数の最大値または最小値を持つ前記画像群のうち少なくともいずれか一方の前記画像群を一つ以上除いた複数の前記画像群における前記前方向予測符号化画像および前記双方向予測符号化画像の平均値を、予測した前記画像群の構造として前記符号化時に用いることを特徴としている。 The invention according to claim 8 is constituted by an image group including at least one of the intra-coded images among the intra-coded image, the forward predictive coded image, and the bidirectional predictive coded image. In a moving image re-encoding method in which a bit stream is input, the bit stream is decoded, image data and encoding information are obtained, and the image data is encoded again based on the decoded image data and the encoding information The number of forward prediction encoded images in each of the plurality of image groups is counted based on the encoding information, and the maximum number of forward prediction encoded images in each of the plurality of counted image groups is counted. The forward predictive encoded image in a plurality of the image groups excluding at least one image group of at least one of the image groups having a value or a minimum value. The average value of the fine the bidirectional predictive coded picture, are characterized by using during the encoding the structure of the image group predicted.

請求項９に記載の発明は、画面内符号化画像と、前方向予測符号化画像と、双方向予測符号化画像のうち少なくとも前記画面内符号化画像が１枚以上含まれる画像群から構成されるビットストリームが入力され該ビットストリームを復号し、画像データおよび符号化情報を得る復号手段と、前記復号手段が復号した前記画像データおよび前記符号化情報に基づいて再度前記画像データを符号化する符号化手段としてコンピュータに機能させる動画像再符号化プログラムにおいて、前記符号化情報に基づいて、複数の前記画像群各々における前記前方向予測符号化画像の数をカウントし、カウントされた複数の前記画像群各々における前記前方向予測符号化画像の数の最大値または最小値を持つ前記画像群のうち少なくともいずれか一方の前記画像群を一つ以上除いた複数の前記画像群における前記前方向予測符号化画像および前記双方向予測符号化画像の平均値を、予測した前記画像群の構造として前記符号化手段に出力する画像群構造予測手段としてコンピュータに機能させることを特徴としている。 The invention according to claim 9 is configured by an image group including at least one of the intra-coded images among the intra-coded image, the forward predictive coded image, and the bidirectional predictive coded image. A decoding unit that receives the bit stream to be decoded and obtains image data and encoding information, and encodes the image data again based on the image data and the encoding information decoded by the decoding unit In a moving image re-encoding program that causes a computer to function as an encoding unit, the number of forward prediction encoded images in each of the plurality of image groups is counted based on the encoding information, and the plurality of counted At least one of the image groups having the maximum value or the minimum value of the number of forward prediction encoded images in each image group. An image that outputs an average value of the forward prediction encoded image and the bidirectional predictive encoded image in a plurality of the image groups excluding one or more image groups to the encoding unit as a predicted structure of the image group It is characterized by having a computer function as group structure prediction means.

本発明の一実施例にかかる動画像再符号化装置のブロック図である。It is a block diagram of the moving image re-encoding apparatus concerning one Example of this invention. 図１に示された動画像再符号化装置の符号化情報解析部のブロック図である。It is a block diagram of the encoding information analysis part of the moving image re-encoding apparatus shown by FIG. ＧＯＰの一例の説明図である。It is explanatory drawing of an example of GOP. ＧＯＰ構造予測部の動作を説明したフローチャートである。It is the flowchart explaining operation | movement of the GOP structure estimation part. 各ＧＯＰにおけるＰピクチャ枚数とＢピクチャ枚数のカウントの説明図である。It is explanatory drawing of the count of the number of P pictures and the number of B pictures in each GOP. 複数のＧＯＰからＰピクチャとＢピクチャの平均を求める際の説明図である。It is explanatory drawing at the time of calculating | requiring the average of P picture and B picture from several GOP. 符号化フレームレート予測部の動作を説明したフローチャートである。It is the flowchart explaining operation | movement of the encoding frame rate estimation part. 符号化ピクチャと表示ピクチャの説明図である。It is explanatory drawing of an encoding picture and a display picture. 各ＧＯＰにおける符号化ピクチャ枚数と表示ピクチャ枚数のカウントの説明図である。It is explanatory drawing of the count of the encoding picture number in each GOP and the number of display pictures. ＧＯＰ符号量予測部の動作を説明したフローチャートである。It is the flowchart explaining operation | movement of the GOP code amount prediction part. Ｐピクチャの最大値または最小値が複数ある場合のＰピクチャとＢピクチャの平均を求める際の説明図である。It is explanatory drawing at the time of calculating | requiring the average of P picture and B picture in case there exist two or more maximum values or minimum values of P picture. 動画像符号化プログラムの動作フローチャートである。It is an operation | movement flowchart of a moving image encoding program.

Explanation of symbols

１００動画像再符号化装置
１０１復号部（復号手段）
１０２符号化情報解析部
１０３符号化部（符号化手段）
２００ＧＯＰ構造予測部（画像群構造予測手段）
２０１符号化フレームレート予測部（フレームレート予測手段）
２０２ＧＯＰ符号量予測部（符号量予測手段）DESCRIPTION OF SYMBOLS 100 Moving image re-encoding apparatus 101 Decoding part (decoding means)
102 Encoding information analysis unit 103 Encoding unit (encoding means)
200 GOP structure prediction unit (image group structure prediction means)
201 Coding frame rate prediction unit (frame rate prediction means)
202 GOP code amount prediction unit (code amount prediction means)

以下、本発明の一実施形態にかかる動画像再符号化装置を説明する。本発明の一実施形態にかかる動画像再符号化装置は、画像群構造予測手段が、再符号化時に参照するための、画像群における前方向予測符号化画像と双方向予測符号化画像の枚数を画像群の構造として予測する際に、複数の画像群における前方向予測符号化画像の数の最大値を持つ画像群または最小値を持つ画像群のうち少なくともいずれか一方の画像群を一つ以上除いた複数の画像群の前方向予測符号化画像の平均値と双方向予測符号化画像の平均値を画像群の構造として予測し、符号化手段が、その画像群の構造に基づいて再符号化する。このようにすることにより、一時的に現れた極端な前方向予測符号化画像の枚数を持つ画像群に影響されることなく、より平均的な画像群の構造を予測することができるのでより適切な符号量を予測することが出来る。また、画像群の構造予測のために余分な画像を保持する必要が無いので遅延時間やメモリ量が削減される。 Hereinafter, a moving image re-encoding device according to an embodiment of the present invention will be described. The moving image re-encoding device according to an embodiment of the present invention includes the number of forward-predictive encoded images and bidirectional predictive-encoded images in an image group for reference by the image group structure predicting unit during re-encoding. Is predicted as the structure of the image group, at least one of the image group having the maximum value or the image group having the minimum value of the number of forward prediction encoded images in the plurality of image groups is one. The average value of the forward prediction encoded images and the average value of the bidirectional predictive encoded images of the plurality of image groups excluded as described above are predicted as the structure of the image group, and the encoding unit re-generates based on the structure of the image group. Encode. By doing so, it is possible to predict the structure of a more average image group without being affected by the image group having the extreme number of forward-predicted encoded images that appear temporarily. A large amount of code can be predicted. Further, since it is not necessary to hold an extra image for the structure prediction of the image group, the delay time and the memory amount are reduced.

また、画像群構造予測手段が、カウントされた複数の画像群各々における前方向予測符号化画像の数のうち最大値および最小値を持つ画像群をそれぞれ一つ以上除いた複数の画像群における前方向予測符号化画像および双方向予測符号化画像の平均値を画像群の構造として符号化手段に出力しても良い。このようにすることにより、最大値方向と最小値方向の両方に一時的に現れた極端な前方向予測符号化画像の枚数を持つ画像群に影響されることなく、より精度良く平均的な画像群の構造を予測することができる。 In addition, the image group structure prediction means includes a front of a plurality of image groups obtained by excluding one or more image groups each having a maximum value and a minimum value from the number of forward prediction encoded images in each of the plurality of counted image groups. The average value of the direction prediction encoded image and the bidirectional prediction encoded image may be output to the encoding means as the structure of the image group. By doing in this way, an average image can be obtained with higher accuracy without being affected by an image group having an extreme number of forward-predictive encoded images that temporarily appear in both the maximum value direction and the minimum value direction. The structure of the group can be predicted.

また、画像群構造予測手段が、カウントされた複数の画像群各々における前方向予測符号化画像の数のうち、大きい順に複数の画像群を除いた複数の画像群における前方向予測符号化画像および双方向予測符号化画像の平均値を画像群の構造として符号化手段に出力しても良い。このようにすることにより、最大値方向に一時的に現れた極端な前方向予測符号化画像の枚数を持つ画像群が複数存在してもそれらに影響されることなく、より精度良く平均的な画像群の構造を予測することができる。 Further, the image group structure prediction means includes a forward prediction encoded image in a plurality of image groups excluding the plurality of image groups in descending order among the counted number of forward prediction encoded images in each of the plurality of image groups, and You may output the average value of a bi-directional predictive coding image to a coding means as a structure of an image group. By doing in this way, even if there are a plurality of image groups having the number of extreme forward predictive encoded images that temporarily appear in the maximum value direction, it is not affected by them and the average can be obtained with higher accuracy. The structure of the image group can be predicted.

また、画像群構造予測手段が、カウントされた複数の画像群各々における前方向予測符号化画像の数のうち、小さい順に複数の画像群を除いた複数の画像群における前方向予測符号化画像および双方向予測符号化画像の平均値を画像群の構造として符号化手段に出力しても良い。このようにすることにより、最小値方向に一時的に現れた極端な前方向予測符号化画像の枚数を持つ画像群が複数存在してもそれらに影響されることなく、より精度良く平均的な画像群の構造を予測することができる。 Further, the image group structure prediction means includes a forward prediction encoded image in a plurality of image groups excluding the plurality of image groups in order from the smallest number of forward prediction encoded images in each of the plurality of counted image groups, and You may output the average value of a bi-directional predictive coding image to a coding means as a structure of an image group. By doing this, even if there are a plurality of image groups having the number of extreme forward predictive encoded images that temporarily appear in the minimum value direction, the average is more accurate without being affected by them. The structure of the image group can be predicted.

また、画像群構造予測手段が、複数の画像群各々における双方向予測符号化画像の数をカウントし、カウントされた複数の画像群各々における前方向予測符号化画像の数のうち、最大値または最小値を持つ画像群が夫々複数ある場合は、複数の最大値または最小値を持つ画像群のうち、カウントされた複数の画像群における双方向予測符号化画像の平均値から離れた値をもつ画像群を除いた複数の画像群における前方向予測符号化画像および双方向予測符号化画像の平均値を画像群の構造として符号化手段に出力しても良い。このようにすることにより、前方向予測符号化画像の最大値または最小値を持つ画像群が複数あっても、双方向予測符号化画像の極端な値を持つ方を除くことが出来るので、前方向予測符号化画像だけでなく双方向予測符号化画像においても、より精度良く平均的な画像群の構造を予測することができる。 Further, the image group structure prediction means counts the number of bidirectional prediction encoded images in each of the plurality of image groups, and the maximum value or the number of forward prediction encoded images in each of the plurality of image groups counted When there are a plurality of image groups each having a minimum value, among the plurality of image groups having the maximum value or the minimum value, the image group has a value far from the average value of the bi-directional predictive encoded images in the counted plurality of image groups. The average value of the forward prediction encoded image and the bidirectional predictive encoded image in a plurality of image groups excluding the image group may be output to the encoding means as the structure of the image group. By doing this, even if there are a plurality of image groups having the maximum value or the minimum value of the forward prediction encoded image, it is possible to exclude the one having the extreme value of the bidirectional predictive encoded image. The structure of the average image group can be predicted with higher accuracy not only in the direction prediction encoded image but also in the bidirectional prediction encoded image.

また、符号化情報に基づいて、複数の画像群各々における符号化された画像の数および複数の画像群各々における表示されるべき画像の数を夫々カウントし、複数の画像群各々における符号化された画像の数と複数の画像群各々における表示されるべき画像の数との比と、符号化情報から、符号化手段が１秒間に再符号化すべき画像枚数（符号化フレームレート）を予測するフレームレート予測手段を有しても良い。このようにすることにより、符号化された画像の数だけでなく表示されるべき画像の数も参照しているので、再表示情報（リピートピクチャ情報）を持つ画像が多い場合は再符号化する際に実質的な符号化フレームレートを精度良く予測することが出来るのでより適切な符号量を予測することが出来る。 In addition, based on the encoding information, the number of encoded images in each of the plurality of image groups and the number of images to be displayed in each of the plurality of image groups are counted, respectively. From the ratio between the number of images to be displayed and the number of images to be displayed in each of the plurality of image groups, and the encoding information, the number of images (encoding frame rate) to be re-encoded by the encoding unit per second is predicted. You may have a frame rate estimation means. In this way, since not only the number of encoded images but also the number of images to be displayed are referred to, if there are many images having redisplay information (repeat picture information), reencoding is performed. In this case, since a substantial encoding frame rate can be accurately predicted, a more appropriate code amount can be predicted.

また、画像群構造予測手段が予測した画像群の構造と、フレームレート予測手段が予測した符号化フレームレートと、予め設定された再符号化のビットレートと、に基づいて符号化手段が符号化する際の画像群当たりの符号量を予測する符号量予測手段を有しても良い。このようにすることにより、再符号化する際に画像群構造やリピートピクチャを考慮した実質的な画像群当たりの符号量を精度良く予測することが出来る。 In addition, the encoding unit encodes based on the structure of the image group predicted by the image group structure prediction unit, the encoded frame rate predicted by the frame rate prediction unit, and a preset re-encoding bit rate. Code amount prediction means for predicting the code amount per image group at the time of image processing may be provided. By doing so, it is possible to accurately predict the code amount per image group in consideration of the image group structure and the repeat picture when re-encoding.

また、本発明の一実施形態にかかる動画像再符号化方法は、再符号化時に参照するために画像群における前方向予測符号化画像と双方向予測符号化画像の枚数を画像群の構造として予測する際に、複数の画像群における前方向予測符号化画像の数の最大値を持つ画像群または最小値を持つ画像群のうち少なくともいずれか一方の画像群を一つ以上除いた複数の画像群の前方向予測符号化画像の平均値と双方向予測符号化画像の平均値を予測した画像群の構造として、その画像群の構造に基づいて再符号化する。このようにすることにより、一時的に現れた極端な前方向予測符号化画像の枚数を持つ画像群に影響されることなく、より平均的な画像群の構造を予測することができる。また、画像群予測のために余分な画像を保持する必要が無いので遅延時間やメモリ量が削減される。 In addition, the moving image re-encoding method according to the embodiment of the present invention uses the number of forward prediction encoded images and bidirectional predictive encoded images in the image group as a structure of the image group for reference during re-encoding. A plurality of images excluding at least one image group of at least one of an image group having a maximum value or an image group having a minimum value of the number of forward prediction encoded images in a plurality of image groups when performing prediction Based on the structure of the image group, re-encoding is performed as the structure of the image group in which the average value of the forward prediction encoded image of the group and the average value of the bidirectional predictive encoded image are predicted. By doing in this way, it is possible to predict a more average image group structure without being influenced by an image group having an extreme number of forward-predictive encoded images that appear temporarily. Further, since there is no need to hold an extra image for the image group prediction, the delay time and the memory amount are reduced.

また、本発明の一実施形態にかかる動画像再符号化プログラムは、画像群構造予測手段が、再符号化時に参照するための、画像群における前方向予測符号化画像と双方向予測符号化画像の枚数を画像群の構造として予測する際に、複数の画像群における前方向予測符号化画像の数の最大値を持つ画像群または最小値を持つ画像群のうち少なくともいずれか一方の画像群を一つ以上除いた複数の画像群の前方向予測符号化画像の平均値と双方向予測符号化画像の平均値を予測した画像群の構造とし、符号化手段が、その画像群の構造に基づいて再符号化するようにコンピュータに機能させる。このようにすることにより、一時的に現れた極端な前方向予測符号化画像の枚数を持つ画像群に影響されることなく、より平均的な画像群の構造を予測することができる。また、画像群予測のために余分な画像を保持する必要が無いので遅延時間やメモリ量が削減される。 In addition, the moving image re-encoding program according to the embodiment of the present invention includes a forward-predictive encoded image and a bi-directional predictive-encoded image in an image group that the image group structure prediction unit refers to when re-encoding. When predicting the number of images as the structure of the image group, at least one of the image group having the maximum value or the image group having the minimum value of the number of forward prediction encoded images in the plurality of image groups is selected. The average value of the forward prediction encoded images and the average value of the bidirectional predictive encoded images of a plurality of image groups excluding one or more are set as the predicted image group structure, and the encoding unit is based on the structure of the image group. And let the computer function to re-encode. By doing in this way, it is possible to predict a more average image group structure without being influenced by an image group having an extreme number of forward-predictive encoded images that appear temporarily. Further, since there is no need to hold an extra image for the image group prediction, the delay time and the memory amount are reduced.

また、請求項９に記載の動画像再符号化プログラムを記録媒体に格納してもよい。このようにすることにより、動画像再符号化プログラムを機器に組み込む以外に単体でも流通させることができる。 Further, the moving image re-encoding program according to claim 9 may be stored in a recording medium. In this way, the moving image re-encoding program can be distributed alone as well as being incorporated into the device.

本発明の一実施例にかかる動画像再符号化装置１００を図１乃至図１０を参照して説明する。動画像再符号化装置１００は、入力端子１１０から入力される動画像が符号化されたビットストリームをビットレートや符号化方式を変更して再符号化して出力端子１１５から出力する装置である。動画像再符号化装置１００は、図１に示すように復号部１０１と、符号化情報解析部１０２と、符号化部１０３と、入力端子１１０と設定データ端子１１１と、出力端子１１５とを備えている。 A moving image re-encoding device 100 according to an embodiment of the present invention will be described with reference to FIGS. The moving image re-encoding device 100 is a device that re-encodes a bit stream obtained by encoding a moving image input from the input terminal 110 while changing the bit rate and the encoding method and outputs the re-encoded image from the output terminal 115. As shown in FIG. 1, the moving image re-encoding apparatus 100 includes a decoding unit 101, an encoded information analysis unit 102, an encoding unit 103, an input terminal 110, a setting data terminal 111, and an output terminal 115. ing.

復号手段としての復号部１０１は、入力端子１１０から入力されるビットストリームを復号し、画像データ１１３を符号化部１０３へ出力するとともに、符号化情報１１２を符号化部１０３および符号化情報解析部１０２へ出力する。例えば、ＭＰＥＧ−２方式によって符号化されたビットストリームが入力された場合は、該ビットストリームを復号し、復号された画像データ１１３（輝度や色差データなど）を符号化部１０３へ出力する。入力ビットストリーム中のヘッダ情報などは符号化情報１１２として符号化情報解析部１０２および符号化部１０３へ出力する。 The decoding unit 101 as decoding means decodes the bit stream input from the input terminal 110, outputs the image data 113 to the encoding unit 103, and encodes the encoded information 112 into the encoding unit 103 and the encoded information analysis unit. To 102. For example, when a bit stream encoded by the MPEG-2 method is input, the bit stream is decoded, and decoded image data 113 (luminance, color difference data, etc.) is output to the encoding unit 103. Header information and the like in the input bitstream are output to the encoded information analysis unit 102 and the encoding unit 103 as encoded information 112.

符号化情報１１２は、例えば、ピクチャタイプ（Ｉピクチャか、Ｐピクチャか、Ｂピクチャか）やフィールド再表示情報またはフレーム再表示情報およびフレームレート情報（画面上に表示する際のフレームレートで、例えばＮＴＳＣであれば２９．９７フレーム／秒）などが出力される。Ｉピクチャ、Ｐピクチャ、ＢピクチャはＭＰＥＧなどで用いられる符号化された画像の分類であり、Ｉピクチャは画面内符号化が施された画像、Ｐピクチャは前方向予測符号化が施された画像、Ｂピクチャは双方向予測符号化が施された画像を示している。 The encoding information 112 is, for example, a picture type (I picture, P picture, or B picture), field redisplay information, frame redisplay information, and frame rate information (frame rate when displayed on the screen, for example, In the case of NTSC, 29.97 frames / second) is output. I picture, P picture, and B picture are classifications of encoded images used in MPEG, etc., I picture is an image that has been subjected to intra-picture encoding, and P picture is an image that has been subjected to forward predictive encoding , B pictures show images that have been subjected to bi-directional predictive coding.

符号化情報解析部１０２では、図２に示すようにＧＯＰ構造予測部２００と、符号化フレームレート予測部２０１と、ＧＯＰ符号量予測部２０２とを備えており、設定データ端子１１１から入力される設定データおよび符号化情報１１２が入力され、前記設定データおよび符号化情報１１２に基づいて符号化部１０３において再符号化する際の割当て符号量を算出し、解析結果情報１１４として符号化部１０３へ出力する。 The encoding information analysis unit 102 includes a GOP structure prediction unit 200, an encoding frame rate prediction unit 201, and a GOP code amount prediction unit 202 as shown in FIG. Setting data and encoding information 112 are input, and based on the setting data and encoding information 112, an allocation code amount when re-encoding is calculated in the encoding unit 103 is calculated, and analysis result information 114 is transmitted to the encoding unit 103. Output.

設定データ端子１１１は、動画像再符号化装置１００の初期設定時に設定されるパラメータ、例えば出力ビットストリームのビットレートなどが入力される。 The setting data terminal 111 receives parameters set when the moving image re-encoding device 100 is initially set, for example, the bit rate of the output bit stream.

解析結果情報１１４は、例えば、ＧＯＰ単位の割当て符号量やＧＯＰ内のＰピクチャ予測枚数およびＢピクチャ予測枚数や符号化フレームレートなどが出力される。画像群としてのＧＯＰとはGroup Of Picturesの略で、少なくとも１枚以上のＩピクチャから構成される単位である（通常、複数のピクチャから構成されることが多い）。ＧＯＰ構造とはＧＯＰ内のＩピクチャ、ＰピクチャおよびＢピクチャの枚数構成である。図３にＧＯＰ構造の例を示す。ＧＯＰはＩピクチャから始まり、次のＩピクチャの１枚前のピクチャまでが１ＧＯＰとなる。図３では１ＧＯＰは１２枚のピクチャで構成されており、ＧＯＰ内のＩピクチャは１枚、Ｐピクチャは３枚、Ｂピクチャは８枚である。 As the analysis result information 114, for example, the allocated code amount in GOP units, the predicted number of P pictures in the GOP, the predicted number of B pictures, the encoding frame rate, and the like are output. GOP as an image group is an abbreviation of Group Of Pictures, and is a unit composed of at least one I picture (usually composed of a plurality of pictures). The GOP structure is a configuration of the number of I pictures, P pictures, and B pictures in a GOP. FIG. 3 shows an example of the GOP structure. A GOP starts from an I picture, and a GOP is the one picture before the next I picture. In FIG. 3, 1 GOP is composed of 12 pictures. There are 1 I picture, 3 P pictures, and 8 B pictures in the GOP.

画像群構造予測手段としてのＧＯＰ構造予測部２００は、符号化情報１１２が入力され、それに基づいてＧＯＰ構造を予測し、ＧＯＰ構造情報２１０を出力する。 The GOP structure prediction unit 200 as an image group structure prediction unit receives the encoded information 112, predicts the GOP structure based on the encoded information 112, and outputs the GOP structure information 210.

ＧＯＰ構造予測部２００におけるＧＯＰ構造の予測方法を図４のフローチャートを参照して説明する。 A GOP structure prediction method in the GOP structure prediction unit 200 will be described with reference to the flowchart of FIG.

まず、ステップＳ１０１において、ＧＯＰ構造の初期値を設定し、ステップＳ１０２へ進む。ＧＯＰ構造の初期値とは、動画像が入力される前に設定されるＩピクチャ、Ｐピクチャ、Ｂピクチャの枚数である。これらは、予測の初期値であって実際に入力される動画像の各ピクチャの枚数と一致しなくともよい。初期値は予めＧＯＰ構造予測部２００内部に持っておいても良いし、設定データ端子１１１から設定するようにしても良い。 First, in step S101, an initial value of the GOP structure is set, and the process proceeds to step S102. The initial value of the GOP structure is the number of I pictures, P pictures, and B pictures set before moving pictures are input. These are initial values of prediction and do not need to match the number of pictures of the moving image actually input. The initial value may be previously stored in the GOP structure prediction unit 200 or may be set from the setting data terminal 111.

次に、ステップＳ１０２において、復号部１０１に入力された画像のピクチャタイプが符号化情報１１２を経由して入力されステップＳ１０３に進む。 Next, in step S102, the picture type of the image input to the decoding unit 101 is input via the encoding information 112, and the process proceeds to step S103.

次に、ステップＳ１０３において、復号部１０１に入力された画像のピクチャタイプがＩピクチャか否かを判断してＩピクチャである場合（ＹＥＳの場合）はステップＳ１０６に進み、そうでない場合（ＮＯの場合）はステップＳ１０４に進む。 Next, in step S103, it is determined whether or not the picture type of the image input to the decoding unit 101 is an I picture. If the picture type is an I picture (YES), the process proceeds to step S106; In case), the process proceeds to step S104.

次に、ステップＳ１０４において、復号部１０１に入力された画像のピクチャタイプがＰピクチャか否かを判断してＰピクチャである場合（ＹＥＳの場合）はステップＳ１０８に進み、そうでない場合（ＮＯの場合）はステップＳ１０５に進む。 Next, in step S104, it is determined whether the picture type of the image input to the decoding unit 101 is a P picture. If the picture type is a P picture (in the case of YES), the process proceeds to step S108; If), the process proceeds to step S105.

次に、ステップＳ１０５において、ＩピクチャでもなくＰピクチャでもないということは、復号部１０１に入力された画像のピクチャタイプはＢピクチャであると判断して現在のＧＯＰにおけるＢピクチャの枚数に１を加算してステップＳ１０２に戻る。 Next, in step S105, if it is neither an I picture nor a P picture, it is determined that the picture type of the image input to the decoding unit 101 is a B picture, and the number of B pictures in the current GOP is set to 1. Add and return to step S102.

ステップＳ１０６においては、Ｉピクチャが入力されたと判断する。Ｉピクチャから次のＩピクチャの前の画像までが１ＧＯＰの範囲なのでＩピクチャ検出でＧＯＰの先頭を検出したとして、直前の過去複数のＧＯＰにおけるＰピクチャ枚数の最大値と最小値を持つＧＯＰを１つずつ除いて、それらＰピクチャ枚数の最大値と最小値のＧＯＰを１つずつ除いた過去複数のＧＯＰのＰピクチャ枚数の平均値とＢピクチャ枚数の平均値を算出しステップＳ１０７に進む。すなわち、複数のＧＯＰにおけるＰピクチャの枚数のうち、最大値および最小値を持つＧＯＰをそれぞれ一つ以上除いた複数のＧＯＰにおけるＰピクチャおよびＢピクチャの平均値を算出している。つまり、最小値か最大値を持つＧＯＰのうち少なくともいずれか一方のＧＯＰを一つ以上除いた複数のＧＯＰにおけるＰピクチャおよびＢピクチャの平均値を算出している。 In step S106, it is determined that an I picture has been input. Since the range from the I picture to the previous image of the next I picture is in the range of 1 GOP, assuming that the top of the GOP is detected by I picture detection, the GOP having the maximum value and the minimum value of the number of P pictures in the previous plurality of GOPs is 1 The average value of the number of P pictures and the average number of B pictures of a plurality of past GOPs, excluding one by one and removing the maximum and minimum GOPs of the P pictures one by one, proceeds to step S107. That is, the average value of P pictures and B pictures in a plurality of GOPs is calculated by removing one or more GOPs having the maximum value and the minimum value from the number of P pictures in the plurality of GOPs. That is, the average value of the P picture and the B picture in a plurality of GOPs excluding at least one of the GOPs having the minimum value or the maximum value is calculated.

ステップＳ１０６を、図５と図６を参照して詳細に説明する。図５は、直前の過去複数のＧＯＰとして８ＧＯＰを保持した場合の例である。入力されたＩピクチャを含むＧＯＰを現在のＧＯＰとすると、現在のＧＯＰから時間的に近い順にＧＯＰ［０］、ＧＯＰ［１］、ＧＯＰ［２］、ＧＯＰ［３］、ＧＯＰ［４］、ＧＯＰ［５］、ＧＯＰ［６］、ＧＯＰ［７］とし、各ＧＯＰにおけるＰピクチャの枚数をＮｐ［０］、Ｎｐ［１］、Ｎｐ［２］、Ｎｐ［３］、Ｎｐ［４］、Ｎｐ［５］、Ｎｐ［６］、Ｎｐ［７］とし、各ＧＯＰにおけるＢピクチャの枚数をＮｂ［０］、Ｎｂ［１］、Ｎｂ［２］、Ｎｂ［３］、Ｎｂ［４］、Ｎｂ［５］、Ｎｂ［６］、Ｎｂ［７］とする。すなわち、常に直前のＧＯＰから数えて８ＧＯＰ分のＰピクチャとＢピクチャの枚数を保持する。そして、各ＧＯＰにおけるＰピクチャの枚数とＢピクチャの枚数が図６のようになっていた場合は、Ｐピクチャ枚数の最大値であるＮｐ［２］と最小値であるＮｐ［６］を除いてＰピクチャの平均値ＡＶＲ＿ＮｐとＢピクチャの平均値ＡＶＲ＿Ｎｂを求めると図６に示すようにそれぞれ４と１０という値が求まる。 Step S106 will be described in detail with reference to FIGS. FIG. 5 shows an example in which 8 GOPs are held as a plurality of previous GOPs. Assuming that the GOP including the input I picture is the current GOP, GOP [0], GOP [1], GOP [2], GOP [3], GOP [4], GOP are in order of time from the current GOP. [5], GOP [6], GOP [7], and the number of P pictures in each GOP is Np [0], Np [1], Np [2], Np [3], Np [4], Np [ 5], Np [6], Np [7], and the number of B pictures in each GOP is Nb [0], Nb [1], Nb [2], Nb [3], Nb [4], Nb [5 ], Nb [6], and Nb [7]. That is, the number of P pictures and B pictures for 8 GOPs from the immediately preceding GOP is always held. If the number of P pictures and the number of B pictures in each GOP are as shown in FIG. 6, except for Np [2], which is the maximum value of P pictures, and Np [6], which is the minimum value. When the average value AVR_Np of the P picture and the average value AVR_Nb of the B picture are obtained, values 4 and 10 are obtained as shown in FIG.

次に、ステップＳ１０７において、ステップＳ１０６で算出したＰピクチャとＢピクチャの平均値（ＡＶＲ＿Ｎｐ、ＡＶＲ＿Ｎｂ）をＧＯＰ構造情報２１０としてＧＯＰ符号量予測部２０２に出力しステップＳ１０２に戻る。 Next, in step S107, the average value (AVR_Np, AVR_Nb) of the P picture and B picture calculated in step S106 is output to the GOP code amount prediction unit 202 as GOP structure information 210, and the process returns to step S102.

ステップＳ１０８においては、復号部１０１に入力された画像のピクチャタイプはＰピクチャであることから現在のＧＯＰにおけるＰピクチャの枚数に１を加算しステップＳ１０２に戻る。 In step S108, since the picture type of the image input to the decoding unit 101 is P picture, 1 is added to the number of P pictures in the current GOP, and the process returns to step S102.

フレームレート予測手段としての符号化フレームレート予測部２０１は、符号化情報１１２が入力され、それを基づいて符号化フレームレートを算出し、符号化フレームレート情報２１１を出力する。 An encoding frame rate prediction unit 201 as a frame rate prediction unit receives the encoding information 112, calculates an encoding frame rate based on the encoding information 112, and outputs encoded frame rate information 211.

符号化フレームレート予測部２０１における符号化フレームレートの算出方法を図７のフローチャートを参照して説明する。 A method of calculating the encoding frame rate in the encoding frame rate prediction unit 201 will be described with reference to the flowchart of FIG.

まず、ステップＳ２０１において、表示ピクチャ枚数と符号化ピクチャ枚数の初期値を設定し、ステップＳ２０２へ進む。これらは、予測の初期値であって実際に入力される表示ピクチャ枚数や符号化ピクチャ枚数と一致しなくともよい。初期値は予め符号化フレームレート予測部２０１に持っておいても良いし、設定データ端子１１１から設定するようにしてもよい。 First, in step S201, initial values of the number of displayed pictures and the number of encoded pictures are set, and the process proceeds to step S202. These are initial values of prediction and do not need to match the number of display pictures or the number of encoded pictures that are actually input. The initial value may be stored in advance in the encoding frame rate prediction unit 201 or may be set from the setting data terminal 111.

表示ピクチャ枚数と、符号化ピクチャ枚数について図８を参照して説明する。表示されるべき画像の数としての表示ピクチャ枚数とは、復号して画面に表示する際のピクチャ枚数を示し、符号化された画像の数としての符号化ピクチャ枚数とは、実際に符号化されているピクチャ枚数である。復号部１０１で復号した際に、フィールド再表示情報またはフレーム再表示情報が検出されたピクチャは、符号化ピクチャ枚数は１枚であるが表示ピクチャ枚数としては１枚以上となる。図８の場合は、１秒間の表示ピクチャ枚数を矢印の間の１２枚とすると、フレームレートは１２フレーム／秒、符号化フレームレートは８フレーム／秒となる。また、符号化情報１１２に含まれるフレームレート情報は表示ピクチャのフレームレートを指している。ここで再表示情報を検出せずにフレームレート情報のみから符号化ピクチャ１２枚に対して１秒分の符号量を割当てた場合、１枚当たりの符号量は１秒分の１／１２となる。しかし、再表示情報を検出すれば、１秒あたり符号化ピクチャが８枚であることが分かるために、１枚当たりの割当て符号量が１秒分の１／８に増大する。したがって、再表示情報を用いることで再符号化時にリピートピクチャ枚数を除いた実際の符号化ピクチャ枚数を再符号化の対象として符号量を割当てることで、適切な符号の割当てが可能となる。 The number of displayed pictures and the number of encoded pictures will be described with reference to FIG. The number of displayed pictures as the number of images to be displayed indicates the number of pictures when decoded and displayed on the screen, and the number of encoded pictures as the number of encoded images is actually encoded. It is the number of pictures. The picture for which field redisplay information or frame redisplay information is detected when decoding is performed by the decoding unit 101 has one encoded picture, but the number of displayed pictures is one or more. In the case of FIG. 8, when the number of displayed pictures per second is 12 between the arrows, the frame rate is 12 frames / second and the encoding frame rate is 8 frames / second. The frame rate information included in the encoding information 112 indicates the frame rate of the display picture. Here, when the code amount for 1 second is assigned to 12 encoded pictures from only the frame rate information without detecting redisplay information, the code amount per frame is 1/12 for 1 second. . However, if the re-display information is detected, it can be seen that there are 8 encoded pictures per second, so the allocated code amount per frame increases to 1/8 of a second. Therefore, by using the redisplay information, it is possible to assign an appropriate code by allocating the code amount by using the actual number of encoded pictures excluding the number of repeat pictures at the time of re-encoding as a re-encoding target.

次に、ステップＳ２０２において、復号部１０１に入力された画像のピクチャタイプが符号化情報１１２を経由して入力されステップＳ２０３に進む。 Next, in step S202, the picture type of the image input to the decoding unit 101 is input via the encoding information 112, and the process proceeds to step S203.

次に、ステップＳ２０３において、復号部１０１に入力された画像のピクチャタイプがＩピクチャか否かを判断してＩピクチャである場合（ＹＥＳの場合）はステップＳ２０４に進み、そうでない場合（ＮＯの場合）はステップＳ２０６に進む。 Next, in step S203, it is determined whether or not the picture type of the image input to the decoding unit 101 is an I picture. If the picture type is an I picture (YES), the process proceeds to step S204; In case), the process proceeds to step S206.

ステップＳ２０４においては、Ｉピクチャが入力されたと判断する。Ｉピクチャから次のＩピクチャの前の画像までが１ＧＯＰの範囲なのでＩピクチャ検出でＧＯＰの先頭を検出したとして、直前の過去複数のＧＯＰにおける符号化ピクチャ枚数と表示ピクチャ枚数から符号化フレームレートを算出しステップＳ２０５に進む。 In step S204, it is determined that an I picture has been input. Since the range from the I picture to the previous image of the next I picture is in the range of 1 GOP, assuming that the top of the GOP is detected by I picture detection, the encoding frame rate is calculated from the number of encoded pictures and the number of displayed pictures in the previous plurality of GOPs. The calculation proceeds to step S205.

符号化フレームレートは、coded_frame_rateを符号化フレームレート、numをデータを保持しておくＧＯＰ数、i=0,1,…,(num-1)、Nc[i]を直前から数えて過去i番目のＧＯＰにおける符号化ピクチャ枚数、Nd[i]を過去i番目のＧＯＰにおける表示ピクチャ枚数、frame_rateを符号化情報１１２に含まれるフレームレート情報とすると、例えば数１に示す式で表される。 The encoded frame rate is the past i-th counted coded_frame_rate, num is the number of GOPs holding data, i = 0,1, ..., (num-1), Nc [i] is counted immediately before When the number of encoded pictures in the GOP, Nd [i] is the number of displayed pictures in the past i-th GOP, and frame_rate is the frame rate information included in the encoded information 112, for example, the following expression 1 is obtained.

すなわち、符号化ピクチャ枚数と表示ピクチャ枚数との比と、符号化情報１１２に含まれるフレームレート情報から予測した符号化フレームレートを算出している。 That is, a predicted encoded frame rate is calculated from the ratio between the number of encoded pictures and the number of displayed pictures and the frame rate information included in the encoded information 112.

ステップＳ２０４を、図９を参照して詳細に説明する。図９は、直前の過去複数のＧＯＰとして８ＧＯＰを保持した場合の例である。入力されたＩピクチャを含むＧＯＰを現在のＧＯＰとすると、現在のＧＯＰから時間的に近い順にＧＯＰ［０］、ＧＯＰ［１］、ＧＯＰ［２］、ＧＯＰ［３］、ＧＯＰ［４］、ＧＯＰ［５］、ＧＯＰ［６］、ＧＯＰ［７］とし、各ＧＯＰにおける符号化ピクチャ枚数をＮｃ［０］、Ｎｃ［１］、Ｎｃ［２］、Ｎｃ［３］、Ｎｃ［４］、Ｎｃ［５］、Ｎｃ［６］、Ｎｃ［７］とし、各ＧＯＰにおける表示ピクチャ枚数をＮｄ［０］、Ｎｄ［１］、Ｎｄ［２］、Ｎｄ［３］、Ｎｄ［４］、Ｎｄ［５］、Ｎｄ［６］、Ｎｄ［７］とする。すなわち、常に直前のＧＯＰから数えて８ＧＯＰ分の符号化ピクチャ枚数と表示ピクチャ枚数を保持する。これらを数１に当てはめると、Ｎｃ［０］〜Ｎｃ［７］まで総数を、Ｎｄ［０］〜Ｎｄ［７］まで総数で割った値にフレームレート情報を掛けることで予測した符号化フレームレートとして求めることができる。 Step S204 will be described in detail with reference to FIG. FIG. 9 shows an example when 8 GOPs are held as a plurality of previous GOPs. Assuming that the GOP including the input I picture is the current GOP, GOP [0], GOP [1], GOP [2], GOP [3], GOP [4], GOP are in order of time from the current GOP. [5], GOP [6], GOP [7], and the number of encoded pictures in each GOP is Nc [0], Nc [1], Nc [2], Nc [3], Nc [4], Nc [ 5], Nc [6], Nc [7], and the number of displayed pictures in each GOP is Nd [0], Nd [1], Nd [2], Nd [3], Nd [4], Nd [5] , Nd [6] and Nd [7]. That is, the number of encoded pictures and the number of displayed pictures are always held for 8 GOPs from the immediately preceding GOP. When these are applied to Equation 1, the encoded frame rate predicted by multiplying the total number from Nc [0] to Nc [7] by the total number from Nd [0] to Nd [7] and the frame rate information. Can be obtained as

次に、ステップＳ２０５において、ステップＳ２０４で算出した符号化フレームレート（coded_frame_rate）を符号化フレームレート情報２１１としてＧＯＰ符号量予測部２０２に出力しステップＳ２０６に進む。 Next, in step S205, the encoded frame rate (coded_frame_rate) calculated in step S204 is output to the GOP code amount prediction unit 202 as encoded frame rate information 211, and the process proceeds to step S206.

次に、ステップＳ２０６において、符号化情報１１２から現在のＧＯＰの符号化ピクチャ枚数と表示ピクチャ枚数にそれぞれ１または（１＋リピートピクチャ枚数）を加算してステップＳ２０２に戻る。すなわち、符号化ピクチャ枚数には常に１を加算し、表示ピクチャ枚数には（１＋リピートピクチャ枚数）を加算する。 Next, in step S206, 1 or (1 + the number of repeat pictures) is added to the number of encoded pictures and the number of displayed pictures of the current GOP from the encoding information 112, and the process returns to step S202. That is, 1 is always added to the number of encoded pictures, and (1 + number of repeat pictures) is added to the number of displayed pictures.

符号量予測手段としてのＧＯＰ符号量予測部２０２は、入力される符号化情報１１２、ＧＯＰ構造情報２１０、符号化フレームレート情報２１１および設定データ端子１１１より設定される出力目標ビットレートに基づいてＧＯＰの割当て符号量を算出し、ＧＯＰ構造情報２１０に含まれるＰピクチャ予測枚数、Ｂピクチャ予測枚数と、符号化フレームレート情報２１１に含まれる符号化フレームレートとともに解析結果情報１１４として符号化部１０３へ出力する。 The GOP code amount prediction unit 202 serving as a code amount prediction unit performs GOP based on the input encoding information 112, the GOP structure information 210, the encoding frame rate information 211, and the output target bit rate set from the setting data terminal 111. Is allocated to the encoding unit 103 as analysis result information 114 together with the predicted number of P pictures, the predicted number of B pictures included in the GOP structure information 210, and the encoded frame rate included in the encoded frame rate information 211. Output.

ＧＯＰ符号量予測部２０２の動作を図１０のフローチャートを参照して説明する。 The operation of the GOP code amount prediction unit 202 will be described with reference to the flowchart of FIG.

まず、ステップＳ３０１において、復号部１０１に入力された画像のピクチャタイプが符号化情報１１２を経由して入力されステップＳ３０２に進む。 First, in step S301, the picture type of the image input to the decoding unit 101 is input via the encoding information 112, and the process proceeds to step S302.

次に、ステップＳ３０２において、復号部１０１に入力された画像のピクチャタイプがＩピクチャか否かを判断してＩピクチャである場合（ＹＥＳの場合）はステップＳ３０３に進み、そうでない場合（ＮＯの場合）はステップＳ３０１に戻る。 Next, in step S302, it is determined whether or not the picture type of the image input to the decoding unit 101 is an I picture. If the picture type is an I picture (in the case of YES), the process proceeds to step S303; If), the process returns to step S301.

次に、ステップＳ３０３において、ＧＯＰ構造情報２１０が更新されたか否かを判断し、更新された場合（ＹＥＳの場合）はステップＳ３０４に進み、そうでない場合（ＮＯの場合）は更新されるまで待機する。 Next, in step S303, it is determined whether or not the GOP structure information 210 has been updated. If updated (in the case of YES), the process proceeds to step S304, and if not (in the case of NO), the process waits until it is updated. To do.

次に、ステップＳ３０４において、符号化フレームレート情報２１１が更新されたか否かを判定し、更新された場合（ＹＥＳの場合）はステップＳ３０５に進み、そうでない場合（ＮＯの場合）は更新されるまで待機する。 Next, in step S304, it is determined whether or not the encoded frame rate information 211 has been updated. If updated (in the case of YES), the process proceeds to step S305, and if not (in the case of NO), it is updated. Wait until.

次に、ステップＳ３０５において、符号化情報１１２、ＧＯＰ構造情報２１０、符号化フレームレート情報２１１および設定データ端子１１１より設定される出力目標ビットレートに基づいてＧＯＰ当たりの割当て符号量を算出しステップＳ３０６へ進む。 Next, in step S305, the allocated code amount per GOP is calculated based on the encoded information 112, the GOP structure information 210, the encoded frame rate information 211, and the output target bit rate set from the setting data terminal 111, and in step S306. Proceed to

ＧＯＰ当たりの割当て符号量GOP_bitsは、AVR_NpをＧＯＰ構造予測部２００で予測した平均Ｐピクチャ枚数、AVR_NbをＧＯＰ構造予測部２００で予測した平均Ｂピクチャ枚数、coded_frame_rateを符号化フレームレート予測部２０１で予測した符号化フレームレート、bitrateを設定データ端子１１１より予め設定される出力目標ビットレートとすると、例えば数２に示す式で表される。 The allocated code amount GOP_bits per GOP is the average number of P pictures predicted by the GOP structure prediction unit 200 for AVR_Np, the average number of B pictures predicted by the GOP structure prediction unit 200, and the coded_frame_rate is predicted by the encoded frame rate prediction unit 201. Assuming that the encoded frame rate and bitrate are output target bit rates set in advance from the setting data terminal 111, they are expressed by, for example, the equation (2).

数２に示した式中の１となっている部分はＩピクチャの枚数を意味しており、フィールド符号化時は、１フレームが２フィールドであることから、ＧＯＰ内の平均Ｉフィールド枚数の１／２を算出してそれに置き換えればよい。(この場合、AVR_NpおよびAVR_Nbも夫々フィールド枚数の１／２で算出される。) The portion of 1 in the equation shown in Equation 2 means the number of I pictures, and at the time of field coding, since one frame is 2 fields, 1 of the average number of I fields in the GOP. / 2 can be calculated and replaced. (In this case, AVR_Np and AVR_Nb are also calculated by half the number of fields.)

次に、ステップＳ３０６において、ステップＳ３０５で算出したＧＯＰ当たりの割当て符号量（GOP_bits）とＧＯＰ構造情報２１０と符号化フレームレート情報２１１とを解析結果情報１１４として符号化部１０３に出力しステップＳ３０１に戻る。 Next, in step S306, the allocated code amount (GOP_bits) per GOP calculated in step S305, the GOP structure information 210, and the encoded frame rate information 211 are output to the encoding unit 103 as analysis result information 114, and the process proceeds to step S301. Return.

符号化手段としての符号化部１０３は、符号化情報１１２、解析結果情報１１４および設定データ端子１１１から入力される初期設定に基づいて復号部１０１から入力される画像データ１１３を再符号化し、出力ビットストリームを出力端子１１５から出力する。すなわち、符号化情報解析部１０２で予測したＧＯＰ当たりの割当て符号量に基づいて符号量制御を行ったり、ＰピクチャとＢピクチャの予測枚数から各ピクチャへの符号量の割当てなどを行う。また、符号化情報１１２や符号化部１０３内部での情報などから符号量制御に変化を加えてもよい。例えば、解析結果情報１１４のＧＯＰ当たり割当て符号量などを得た上で、入力される画像データ１１３の複雑度を基にＧＯＰやピクチャの割当て符号量を増減してもよい。 The encoding unit 103 as an encoding unit re-encodes the image data 113 input from the decoding unit 101 based on the encoding information 112, the analysis result information 114, and the initial setting input from the setting data terminal 111, and outputs the result. The bit stream is output from the output terminal 115. That is, code amount control is performed based on the allocated code amount per GOP predicted by the encoded information analysis unit 102, and the code amount is assigned to each picture based on the predicted number of P pictures and B pictures. Also, the code amount control may be changed based on the encoded information 112, information in the encoding unit 103, and the like. For example, after obtaining the allocated code amount per GOP of the analysis result information 114, the allocated code amount of GOP or picture may be increased or decreased based on the complexity of the input image data 113.

本実施例によれば、ＧＯＰ構造予測部２００において、８つのＧＯＰ内のＰピクチャ枚数のうち、最大値と最小値を持つＧＯＰを除いた６つのＧＯＰからＰピクチャ枚数とＢピクチャ枚数の平均値を算出し、それを予測したＧＯＰ構造とする。符号化フレームレート予測部２０１において、符号化ピクチャ枚数と表示ピクチャ枚数と符号化情報１１２から得られたフレームレートから符号化フレームレートを算出し、それを予測した符号化フレームレートとする。そして、予測したＧＯＰ構造と予測した符号化フレームレートと設定データ端子１１１から設定したビットレートからＧＯＰ当たりの符号量を算出し、それを予測した符号量とする。符号化部１０３では、前記予測したＧＯＰ構造と、予測した符号化フレームレートと、予測した符号量を参照して再符号化を行う。このようにすることで、ＧＯＰ構造やフレームレートおよび符号量の予測精度が上がるとともに、ＧＯＰ構造予測やフレームレート予測および符号量予測のために複数の画像をメモリに記憶することが必要ないために遅延時間やメモリ量の削減が出来る。 According to the present embodiment, in the GOP structure prediction unit 200, the average value of the number of P pictures and the number of B pictures from six GOPs excluding the GOP having the maximum value and the minimum value among the number of P pictures in the eight GOPs. And the predicted GOP structure. The encoded frame rate prediction unit 201 calculates the encoded frame rate from the number of encoded pictures, the number of displayed pictures, and the frame rate obtained from the encoded information 112, and sets it as the predicted encoded frame rate. Then, a code amount per GOP is calculated from the predicted GOP structure, the predicted encoding frame rate, and the bit rate set from the setting data terminal 111, and is used as the predicted code amount. The encoding unit 103 performs re-encoding with reference to the predicted GOP structure, the predicted encoding frame rate, and the predicted code amount. By doing so, the prediction accuracy of the GOP structure, the frame rate, and the code amount is improved, and it is not necessary to store a plurality of images in the memory for the GOP structure prediction, the frame rate prediction, and the code amount prediction. Delay time and memory amount can be reduced.

なお、上述した実施例においてＧＯＰ構造予測部２００は、ＧＯＰ毎のＰピクチャの枚数のうち最大値を持つＧＯＰと最小値を持つＧＯＰを除いて、ＰピクチャとＢピクチャのＧＯＰの平均枚数を算出していたが、図１１に示すようにＰピクチャの最大値または最小値が複数あった場合は、Ｂピクチャの枚数を比較して、Ｂピクチャの平均値からの差異が大きいＧＯＰを除くようにしてもよい。図１１では、Ｐピクチャの最大値がＮｐ［２］とＮｐ［３］であって、最小値がＮｐ［６］とＮｐ［７］であるため、Ｎｂ［２］とＮｂ［３］およびＮｂ［６］とＮｂ［７］のＢピクチャの枚数を比較しＢピクチャの平均値（Ｎｂ［０］〜Ｎｂ［７］の平均値）からの差異が大きいＮｂ［２］を含むＧＯＰを最大値として、Ｎｂ［６］を含むＧＯＰを最小値として、それぞれ除いてＰピクチャの平均枚数ＡＶＲ＿ＮｐとＢピクチャの平均枚数ＡＶＲ＿Ｎｂを求める。また、このような方法をとらずに複数の最大値や複数の最小値をとるＧＯＰからそれぞれ任意に１つずつＧＯＰを選択して削除しても良い。 In the above-described embodiment, the GOP structure prediction unit 200 calculates the average number of GOPs of P and B pictures except for the GOP having the maximum value and the GOP having the minimum value among the number of P pictures for each GOP. However, as shown in FIG. 11, when there are a plurality of maximum values or minimum values of P pictures, the number of B pictures is compared to exclude GOPs that have a large difference from the average value of B pictures. May be. In FIG. 11, since the maximum values of the P picture are Np [2] and Np [3] and the minimum values are Np [6] and Np [7], Nb [2], Nb [3] and Nb The number of B pictures of [6] and Nb [7] is compared, and the GOP including Nb [2] having a large difference from the average value of B pictures (average value of Nb [0] to Nb [7]) is the maximum value. The average number AVR_Np of P pictures and the average number AVR_Nb of B pictures are obtained by removing GOPs including Nb [6] as minimum values. Further, GOPs may be arbitrarily selected and deleted one by one from GOPs having a plurality of maximum values and a plurality of minimum values without taking such a method.

また、ＧＯＰ構造を予測する際に除くＧＯＰは、最大値または最小値のうちいずれか一方のみでも良い。さらに、複数のＧＯＰにおける前方向予測符号化画像の枚数のうち大きい順に複数除いても良いし、小さい順に複数除いても良い。図１１で例えるとＮｐ［２］とＮｐ［３］の両方を除いてもよいし、Ｎｐ［６］とＮｐ［７］の両方を除いても良い。 Further, the GOP excluded when predicting the GOP structure may be only one of the maximum value and the minimum value. Furthermore, a plurality of forward-predicted encoded images in a plurality of GOPs may be removed in descending order, or a plurality may be removed in ascending order. For example, in FIG. 11, both Np [2] and Np [3] may be removed, or both Np [6] and Np [7] may be removed.

また、上述した実施例では動画像再符号化装置１００として説明したが、コンピュータ上で動作するプログラムとすることもできる。すなわち、動画像再符号化装置１００をＣＰＵとＲＡＭ、ＲＯＭに置き換えて、ＲＯＭに動画像再符号化プログラムを記録しておき、ＣＰＵに復号部１０１、符号化部１０３、符号化情報解析部１０２の機能を持たせる。図１２に動画像再符号化プログラムのフローチャートを示す。 In the above-described embodiment, the moving image re-encoding device 100 has been described. However, a program that runs on a computer may be used. That is, the moving image re-encoding device 100 is replaced with a CPU, RAM, and ROM, and a moving image re-encoding program is recorded in the ROM, and the decoding unit 101, the encoding unit 103, and the encoded information analysis unit 102 are stored in the CPU. Have the function of. FIG. 12 shows a flowchart of the moving image re-encoding program.

まず、復号手段としてのステップＳ４０１で入力ビットストリームを復号する。次に、画像群構造予測手段としてのステップＳ４０２でＧＯＰ構造予測を行う。ステップＳ４０２の内容は図４のフローチャートと同様である。次に、フレームレート予測手段としてのステップＳ４０３で符号化フレームレートを予測する。ステップＳ４０３の内容は図７のフローチャートと同様である。次に、符号量予測手段としてのステップＳ４０４でＧＯＰ当たりの割当て符号量を予測する。ステップＳ４０４の内容は図１０のフローチャートと同様である。そして、符号化手段としてのステップＳ４０５で再符号化を行う。以上のステップをピクチャ単位で繰り返すことで動画像再符号化プログラムとして機能することができる。 First, the input bit stream is decoded in step S401 as decoding means. Next, GOP structure prediction is performed in step S402 as image group structure prediction means. The content of step S402 is the same as that of the flowchart of FIG. Next, the encoding frame rate is predicted in step S403 as a frame rate prediction means. The content of step S403 is the same as that of the flowchart of FIG. Next, the allocated code amount per GOP is predicted in step S404 as the code amount predicting means. The content of step S404 is the same as that of the flowchart of FIG. Then, re-encoding is performed in step S405 as encoding means. By repeating the above steps for each picture, it is possible to function as a moving image re-encoding program.

また、上述した実施例ではピクチャをフレームとして説明したが、フィールドでも実施可能である。 In the above-described embodiment, the picture is described as a frame, but the present invention can also be implemented in the field.

また、本発明における復号手段と符号化手段は、ＭＰＥＧ−１，２，４やＨ．２６４など画面内符号化画像と、前方向予測符号化画像と、双方向予測符号化画像のうち少なくとも前記画面内符号化画像が１枚以上含まれる画像群から構成されるビットストリームとなる動画像の符号化方式であれば、いずれの組合せでもよい。勿論、同じ符号化方式間でビットレートの変換に用いても良い。 The decoding means and the encoding means in the present invention are MPEG-1, 2, 4 and H.264. H.264 or other intra-coded image, forward prediction encoded image, and moving image that is a bit stream composed of an image group including at least one intra-coded image among bidirectional predictive encoded images Any combination may be used as long as it is an encoding method. Of course, you may use for the bit rate conversion between the same encoding systems.

また、動画像を放送局で放送する際や通信回線に送信する際に本発明を適用すれば、放送波や通信回線の帯域に合わせた符号化方式やビットレートで低遅延かつ低コストに動画像を再符号化することができる。また、例えば地上デジタル放送など符号化された動画像を、ハードディスクレコーダやＤＶＤレコーダやＢｌｕ−ｒａｙレコーダおよびＨＤＤＶＤレコーダなどに記録する際に本発明を適用すれば、再符号化前よりも高効率な符号化方式に低遅延かつ低コストに記録することができる。 In addition, if the present invention is applied when broadcasting a moving image at a broadcasting station or transmitting it to a communication line, a moving image with a low delay and a low cost can be obtained with an encoding method or bit rate that matches the band of the broadcast wave or communication line. The image can be re-encoded. Further, for example, when the present invention is applied when recording an encoded moving image such as terrestrial digital broadcasting on a hard disk recorder, a DVD recorder, a Blu-ray recorder, an HD DVD recorder, or the like, it is more efficient than before re-encoding. Can be recorded with a low delay and a low cost.

前述した実施例によれば、以下の動画像再符号化装置、動画像再符号化方法および動画像再符号化プログラムが得られる。 According to the embodiment described above, the following moving image re-encoding device, moving image re-encoding method, and moving image re-encoding program can be obtained.

（付記１）Ｉピクチャと、Ｐピクチャと、Ｂピクチャのうち少なくともＩピクチャが１枚以上含まれるＧＯＰから構成されるビットストリームが入力され該ビットストリームを復号し、画像データ１１３および符号化情報１１２を得る復号部１０１と、
復号部１０１が復号した画像データ１１３および符号化情報１１２に基づいて再度画像データ１１３を符号化する符号化部１０３と、を有した動画像再符号化装置１００において、
符号化情報１１２に基づいて、複数のＧＯＰ各々におけるＰピクチャの数をカウントし、カウントされた複数のＧＯＰ各々におけるＰピクチャの数の最大値または最小値を持つＧＯＰのうち少なくともいずれか一方のＧＯＰを一つ以上除いた複数のＧＯＰにおけるＰピクチャおよびＢピクチャの平均値を予測したＧＯＰの構造として符号化部１０３に出力するＧＯＰ構造予測部２００を有することを特徴とする動画像再符号化装置１００。(Supplementary note 1) A bit stream composed of a GOP including at least one I picture among I pictures, P pictures, and B pictures is input, the bit stream is decoded, and image data 113 and encoding information 112 are decoded. A decoding unit 101 for obtaining
In the moving image re-encoding device 100 having the encoding unit 103 that encodes the image data 113 again based on the image data 113 and the encoding information 112 decoded by the decoding unit 101,
Based on the encoding information 112, the number of P pictures in each of the plurality of GOPs is counted, and at least one of the GOPs having the maximum value or the minimum value of the number of P pictures in each of the plurality of counted GOPs And a GOP structure prediction unit 200 that outputs to a coding unit 103 as a GOP structure in which an average value of P pictures and B pictures in a plurality of GOPs is excluded. 100.

この動画像再符号化装置１００によれば、一時的に現れた極端なＰピクチャの枚数を持つＧＯＰに影響されることなく、より平均的なＧＯＰの構造を予測することができる。また、ＧＯＰの構造予測のために余分な画像を保持する必要が無いので遅延時間やメモリ量が削減される。 According to the moving image re-encoding apparatus 100, it is possible to predict a more average GOP structure without being influenced by a GOP having an extremely large number of P pictures that appear temporarily. Further, since there is no need to hold an extra image for the GOP structure prediction, the delay time and the amount of memory are reduced.

（付記２）Ｉピクチャと、Ｐピクチャと、Ｂピクチャのうち少なくともＩピクチャが１枚以上含まれるＧＯＰから構成されるビットストリームが入力され該ビットストリームを復号し、画像データ１１３および符号化情報１１２を得て、復号した画像データ１１３および符号化情報１１２に基づいて再度画像データ１１３を符号化する動画像再符号化方法において、
符号化情報１１２に基づいて、複数のＧＯＰ各々におけるＰピクチャの数をカウントし、カウントされた複数のＧＯＰ各々におけるＰピクチャの数の最大値または最小値を持つＧＯＰのうち少なくともいずれか一方のＧＯＰを一つ以上除いた複数のＧＯＰにおけるＰピクチャおよびＢピクチャの平均値を予測したＧＯＰの構造として符号化時に用いることを特徴とする動画像再符号化方法。(Supplementary note 2) A bit stream composed of a GOP including at least one I picture among an I picture, a P picture, and a B picture is input, the bit stream is decoded, and image data 113 and coding information 112 are decoded. In the moving image re-encoding method for encoding the image data 113 again based on the decoded image data 113 and the encoded information 112,
Based on the encoding information 112, the number of P pictures in each of the plurality of GOPs is counted, and at least one of the GOPs having the maximum value or the minimum value of the number of P pictures in each of the plurality of counted GOPs A moving picture re-encoding method, wherein an average value of a P picture and a B picture in a plurality of GOPs excluding one or more is used as a GOP structure that is predicted.

この動画像再符号化方法によれば、一時的に現れた極端なＰピクチャの枚数を持つＧＯＰに影響されることなく、より平均的なＧＯＰの構造を予測することができる。また、ＧＯＰの構造予測のために余分な画像を保持する必要が無いので遅延時間やメモリ量が削減される。 According to this moving image re-encoding method, it is possible to predict a more average GOP structure without being influenced by a GOP having an extreme number of P pictures that appear temporarily. Further, since there is no need to hold an extra image for the GOP structure prediction, the delay time and the amount of memory are reduced.

（付記３）Ｉピクチャと、Ｐピクチャと、Ｂピクチャのうち少なくともＩピクチャが１枚以上含まれるＧＯＰから構成されるビットストリームが入力され該ビットストリームを復号し、画像データ１１３および符号化情報１１２を得るステップＳ４０１と、
ステップＳ４０１が復号した画像データ１１３および符号化情報１１２に基づいて再度画像データ１１３を符号化するステップＳ４０５と
してコンピュータに機能させる動画像再符号化プログラムにおいて、
符号化情報１１２に基づいて、複数のＧＯＰ各々におけるＰピクチャの数をカウントし、カウントされた複数のＧＯＰ各々におけるＰピクチャの数の最大値または最小値を持つＧＯＰのうち少なくともいずれか一方のＧＯＰを一つ以上除いた複数のＧＯＰにおけるＰピクチャおよびＢピクチャの平均値を予測したＧＯＰの構造としてステップＳ４０５に出力するステップＳ４０２としてコンピュータに機能させることを特徴とするコンピュータ読み取り可能な動画像再符号化プログラム。(Supplementary Note 3) A bit stream composed of a GOP including at least one I picture among an I picture, a P picture, and a B picture is input and decoded, and the image data 113 and the encoding information 112 are decoded. Obtaining step S401;
In the moving image re-encoding program that causes the computer to function as step S405 that re-encodes the image data 113 based on the image data 113 and the encoding information 112 decoded in step S401,
Based on the encoding information 112, the number of P pictures in each of the plurality of GOPs is counted, and at least one of the GOPs having the maximum value or the minimum value of the number of P pictures in each of the plurality of counted GOPs A computer-readable moving image re-encoding that causes a computer to function as step S402 that outputs to step S405 as a GOP structure in which an average value of P pictures and B pictures in a plurality of GOPs excluding one or more is excluded Program.

この動画像再符号化プログラムによれば、一時的に現れた極端なＰピクチャの枚数を持つＧＯＰに影響されることなく、より平均的なＧＯＰの構造を予測することができる。また、ＧＯＰの構造予測のために余分な画像を保持する必要が無いので遅延時間やメモリ量が削減される。 According to this moving image re-encoding program, it is possible to predict a more average GOP structure without being affected by a GOP having an extreme number of P pictures that appear temporarily. Further, since there is no need to hold an extra image for the GOP structure prediction, the delay time and the amount of memory are reduced.

なお、前述した実施例は本発明の代表的な形態を示したに過ぎず、本発明は、実施例に限定されるものではない。すなわち、本発明の骨子を逸脱しない範囲で種々変形して実施することができる。 In addition, the Example mentioned above only showed the typical form of this invention, and this invention is not limited to an Example. That is, various modifications can be made without departing from the scope of the present invention.

Claims

A bit stream composed of an image group including at least one of the intra-coded images among the intra-coded image, the forward predictive coded image, and the bidirectional predictive coded image is input to the bit stream. Decoding means for obtaining image data and encoded information;
In the moving image re-encoding device, the encoding unit that encodes the image data again based on the image data and the encoding information decoded by the decoding unit,
Based on the encoding information, the number of the forward prediction encoded images in each of the plurality of image groups is counted, and the maximum number of the forward prediction encoded images in each of the plurality of the image groups thus counted Or the average value of the forward prediction encoded image and the bidirectional predictive encoded image in a plurality of the image groups excluding at least one of the image groups of the image group having the minimum value, A moving image re-encoding apparatus comprising: an image group structure prediction unit that outputs the predicted structure of the image group to the encoding unit.

The image group structure prediction means excludes one or more of the image groups having the maximum value and the minimum value from the number of forward prediction encoded images in each of the counted plurality of images, respectively. The moving image re-encoding according to claim 1, wherein an average value of the forward prediction encoded image and the bidirectional predictive encoded image in a group is output to the encoding unit as a structure of the image group. apparatus.

The forward prediction in the plurality of image groups excluding the plurality of image groups in descending order from the number of the forward prediction encoded images in each of the plurality of counted image groups. The moving image re-encoding apparatus according to claim 1 or 2, wherein an average value of the encoded image and the bidirectional predictive encoded image is output to the encoding means as the structure of the image group.

The forward prediction code in the plurality of image groups excluding the plurality of image groups in ascending order of the number of forward prediction encoded images in each of the plurality of counted image groups. 4. The moving image re-encoding according to claim 1, wherein an average value of the encoded image and the bi-predictive encoded image is output to the encoding unit as the structure of the image group. 5. Device.

The image group structure prediction means counts the number of the bidirectional prediction encoded images in each of the plurality of image groups, and among the counted number of the forward prediction encoded images in each of the plurality of image groups counted, When there are a plurality of image groups each having a maximum value or a minimum value, an average of the bidirectional predictive encoded images in the plurality of image groups counted among the image groups having a plurality of maximum values or minimum values. An average value of the forward prediction encoded image and the bidirectional prediction encoded image in a plurality of the image groups excluding the image group having a value far from the value is output to the encoding unit as the structure of the image group The moving image re-encoding device according to any one of claims 1 to 4, wherein:

Based on the encoding information, the number of encoded images in each of the plurality of image groups and the number of images to be displayed in each of the plurality of image groups are counted, and the codes in each of the plurality of image groups are counted. Rate prediction for predicting a frame rate at which the encoding means encodes from the ratio between the number of converted images and the number of images to be displayed in each of the plurality of image groups and the encoding information The moving image re-encoding device according to any one of claims 1 to 5, further comprising: means.

Based on the structure of the image group predicted by the image group structure prediction unit, the frame rate predicted by the frame rate prediction unit, and the bit rate when the encoding unit is re-encoded in advance. The moving image re-encoding according to any one of claims 1 to 6, further comprising: a code amount predicting unit that predicts a code amount per group of images when the encoding unit performs encoding. Device.

A bit stream composed of an image group including at least one of the intra-coded images among the intra-coded image, the forward predictive coded image, and the bidirectional predictive coded image is input to the bit stream. In the moving image re-encoding method, the image data and the encoding information are obtained, and the image data is encoded again based on the decoded image data and the encoding information.
Based on the encoding information, the number of the forward prediction encoded images in each of the plurality of image groups is counted, and the maximum number of the forward prediction encoded images in each of the plurality of the image groups thus counted Or the average value of the forward prediction encoded image and the bidirectional predictive encoded image in a plurality of the image groups excluding at least one of the image groups of the image group having the minimum value, A moving image re-encoding method, which is used at the time of encoding as the predicted structure of the image group.

A bit stream composed of an image group including at least one of the intra-coded images among the intra-coded image, the forward predictive coded image, and the bidirectional predictive coded image is input to the bit stream. Decoding means for obtaining image data and encoded information;
In a moving image re-encoding program that causes a computer to function as an encoding unit that encodes the image data again based on the image data and the encoding information decoded by the decoding unit,
Based on the encoding information, the number of the forward prediction encoded images in each of the plurality of image groups is counted, and the maximum number of the forward prediction encoded images in each of the plurality of the image groups thus counted Or the average value of the forward prediction encoded image and the bidirectional predictive encoded image in a plurality of the image groups excluding at least one of the image groups of the image group having the minimum value, A computer-readable moving image re-encoding program that causes a computer to function as an image group structure prediction unit that outputs the predicted structure of the image group to the encoding unit.

A computer-readable recording medium storing the moving image re-encoding program according to claim 9.