JP2008098734A

JP2008098734A - Motion image encoding device, program, and motion image encoding method

Info

Publication number: JP2008098734A
Application number: JP2006275028A
Authority: JP
Inventors: Masanobu Yasugi; 将伸八杉
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2006-10-06
Filing date: 2006-10-06
Publication date: 2008-04-24
Anticipated expiration: 2026-10-06
Also published as: JP4846504B2

Abstract

<P>PROBLEM TO BE SOLVED: To improve the encoding efficiency of a frame image that has a scene change. <P>SOLUTION: A motion image encoding device includes a frame-field converter which converts one frame image into two field images, a scene change detector which detects whether a scene change as a switching point of scenes is included between the two field images, and an image information encoder which encodes the two field images by using an in-picture prediction mode or an inter-picture prediction mode according to the detection result of the scene change detector. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、動画像符号化装置、プログラム及び動画像符号化方法、特に、画面内予測モード又は画面間予測モードを用いて符号化を行なう動画像符号化装置、プログラム及び動画像符号化方法に関する。 The present invention relates to a moving image encoding device, a program, and a moving image encoding method, and more particularly, to a moving image encoding device, a program, and a moving image encoding method that perform encoding using an intra prediction mode or an inter prediction mode. .

現在、通信・放送・蓄積などの各種用途において、デジタル化された動画像情報を高効率に圧縮する動画像符号化方式が標準規格として普及している。非特許文献１や非特許文献２には、標準規格化された動画像符号化方式の例が記載されている。
非特許文献１や非特許文献２をはじめとする符号化方式では、符号化の単位となる画像はフレーム画像又はフィールド画像であり、フレーム画像は２枚のフィールド画像から構成されることがある。 Currently, in various applications such as communication, broadcasting, and storage, a moving picture encoding method that compresses digitized moving picture information with high efficiency is widely used as a standard. Non-Patent Document 1 and Non-Patent Document 2 describe examples of standardized moving image encoding methods.
In encoding methods including Non-Patent Document 1 and Non-Patent Document 2, an image that is a unit of encoding is a frame image or a field image, and the frame image may be composed of two field images.

図１３は、従来の動画像再符号化装置５００の構成を示すブロック図である。動画像再符号化装置５００は、ピクチャタイプ判別部５０１、符号化情報解析部５０２、符号化情報復号部５０３、間引き部５０４、ビデオメモリ５０５、画像情報符号化部５０６、動きベクトル合成部５０７、動きベクトル検出部５０８、情報バッファ５０９、コンプレキシティ算出部５１０、シーンチェンジ検出部５１１、ＧＯＰ構造決定部５１２を備えている。
動画像再符号化装置５００は、入力される第１の符号化動画像情報を、水平及び垂直方向の画素数の間引きとフレームレートの削減を施した上で、第２の符号化動画像情報に再符号化して出力する。 FIG. 13 is a block diagram showing a configuration of a conventional moving image re-encoding device 500. As shown in FIG. The moving image re-encoding device 500 includes a picture type determining unit 501, an encoded information analyzing unit 502, an encoded information decoding unit 503, a thinning unit 504, a video memory 505, an image information encoding unit 506, a motion vector synthesizing unit 507, A motion vector detection unit 508, an information buffer 509, a complexity calculation unit 510, a scene change detection unit 511, and a GOP structure determination unit 512 are provided.
The moving image re-encoding apparatus 500 performs the second encoded moving image information after thinning out the number of pixels in the horizontal and vertical directions and reducing the frame rate of the input first encoded moving image information. Re-encode and output.

ピクチャタイプ判別部５０１は、第１の符号化動画像情報についてピクチャタイプの判別とＢピクチャの破棄を行なう。符号化情報解析部５０２は、第１の符号化動画像を解析して、動きベクトル、量子化幅、符号量を取得し出力する。
符号化情報復号部５０３は、第１の符号化動画像情報を復号してフレーム画像を出力する。間引き部５０４は、フレーム画像の画素を間引いて縮小する。ビデオメモリ５０５は、入力された画像を蓄積する。
画像情報符号化部５０６は、第２の符号化方式を用いて画像を符号化する。符号量の制御には後述のコンプレキシティを用いる。動きベクトル合成部５０７は、第１の符号化動画像情報で用いられた動きベクトルから、再符号化に用いる動きベクトルを合成する。
動きベクトル検出部５０８は、与えられた動きベクトルに基づいてより高精度な動きベクトルを検出する。情報バッファ５０９は、量子化幅と符号量を蓄積する。コンプレキシティ算出部５１０は、符号量と量子化幅の積であるコンプレキシティの算出及び推定を行なう。
シーンチェンジ検出部５１１は、コンプレキシティの変動量によってフレーム画像間のシーンチェンジを検出する。ＧＯＰ構造決定部５１２は、フレーム毎にピクチャタイプを選択し、ＧＯＰの構造を決定する。
この動画像再符号化装置５００では、ピクチャタイプ判別部５０１に入力される第１の符号化動画像情報を符号化情報復号部５０３が復号化し、その第１の符号化動画像情報を構成する各フレーム画像を、画像情報符号化部５０６で第２の符号化動画像情報に再符号化する。 The picture type determination unit 501 determines the picture type and discards the B picture for the first encoded moving picture information. The encoded information analysis unit 502 analyzes the first encoded moving image, acquires a motion vector, a quantization width, and a code amount and outputs them.
The encoded information decoding unit 503 decodes the first encoded moving image information and outputs a frame image. The thinning unit 504 thins and reduces the pixels of the frame image. The video memory 505 stores the input image.
The image information encoding unit 506 encodes an image using the second encoding method. The complexity described later is used to control the code amount. The motion vector synthesis unit 507 synthesizes a motion vector used for re-encoding from the motion vector used in the first encoded video information.
The motion vector detection unit 508 detects a more accurate motion vector based on the given motion vector. The information buffer 509 stores the quantization width and the code amount. The complexity calculation unit 510 calculates and estimates the complexity that is the product of the code amount and the quantization width.
The scene change detection unit 511 detects a scene change between frame images based on a variation amount of complexity. The GOP structure determination unit 512 selects a picture type for each frame and determines the GOP structure.
In this moving image re-encoding device 500, the encoded information decoding unit 503 decodes the first encoded moving image information input to the picture type determining unit 501, and configures the first encoded moving image information. Each frame image is re-encoded by the image information encoding unit 506 into the second encoded moving image information.

動画像再符号化装置５００では、再符号化の際にＢピクチャを破棄することから、そのままでは再符号化後の動画像では相対的にＩピクチャの使用頻度が増えて符号化効率が低下してしまう。このためＧＯＰ構造決定部５１２は符号化効率の低下を防ぐ目的で、Ｉピクチャである一部のフレーム画像を、Ｉピクチャよりも符号化効率の良いＰピクチャに変更する。ただし、再符号化対象のフレーム画像の直前でシーンチェンジが検出された場合には、Ｐピクチャに変更しないことを決定し、画質劣化を軽減する。 Since the moving picture re-encoding apparatus 500 discards the B picture at the time of re-encoding, the use frequency of the I picture is relatively increased in the moving picture after re-encoding, and the encoding efficiency is lowered. End up. For this reason, the GOP structure determination unit 512 changes some frame images, which are I pictures, to P pictures with better encoding efficiency than the I pictures in order to prevent a decrease in encoding efficiency. However, when a scene change is detected immediately before the frame image to be re-encoded, it is determined not to change to a P picture, and image quality deterioration is reduced.

以上のようにして、特許文献１に記載の従来技術による画像再符号化装置５００は、フレーム画像間のシーンチェンジに起因する画質劣化を軽減しつつ、第１の符号化動画像情報から第２の符号化動画像情報へと再符号化を行なう。
また、特許文献２には、再符号化において再符号化対象のピクチャの近隣ピクチャにおける動きベクトルを用いて再符号化対象のピクチャの動きベクトルを推定する技術が開示されている。また、非特許文献３には、画像間差分の非類似度に基づくシーンチェンジの検出方法の技術が開示されている。
特開２００２−１５２７５９号公報特開２０００−１６５８８７号公報ＩＳＯ／ＩＥＣ１３８１８−２ＩＳＯ／ＩＥＣ１４４９６−１０特許庁標準技術集「映像ショット切り換え検出手法」（http://www.jpo.go.jp/shiryou/s_sonota/hyoujun_gijutsu/bidirectional_video/111_15.htm） As described above, the image re-encoding device 500 according to the conventional technique described in Patent Document 1 reduces the image quality deterioration caused by the scene change between the frame images, and reduces the second from the first encoded moving image information. Re-encoding into the encoded moving image information.
Patent Document 2 discloses a technique for estimating a motion vector of a picture to be recoded using a motion vector in a neighboring picture of the picture to be recoded in recoding. Non-Patent Document 3 discloses a technique of a scene change detection method based on a dissimilarity between image differences.
JP 2002-152759 A JP 2000-165887 A ISO / IEC 13818-2 ISO / IEC 14496-10 JPO Standard Technology Collection “Video Shot Switching Detection Method” (http://www.jpo.go.jp/shiryou/s_sonota/hyoujun_gijutsu/bidirectional_video/111_15.htm)

特許文献１に記載されている技術では、フレーム画像が２枚のフィールド画像から構成されている場合、フィールド画像の間には時刻の差があるためにシーンチェンジが起こり得る。 In the technique described in Patent Document 1, when a frame image is composed of two field images, a scene change may occur because there is a time difference between the field images.

図１４（ａ）及び図１４（ｂ）は、動画像にシーンチェンジが起こる場合について説明するための図である。
図１４（ａ）では、時間の経過とともに、フレーム画像が４１、４２、４３と変化している。フレーム画像４２は、フレーム画像４２の奇数ラインの画像からなるフィールド画像４２１と、フレーム画像４２の偶数ラインの画像からなるフィールド画像４２２とからなる。フレーム画像４１はシーンＡに属しており、フレーム画像４２、４３はシーンＢに属している。フレーム画像４１とフレーム画像４２との間で、シーンＡからシーンＢへ変化するシーンチェンジが起こっている。このようなシーンチェンジを、以降ではフレーム間シーンチェンジと呼ぶ。 FIG. 14A and FIG. 14B are diagrams for explaining a case where a scene change occurs in a moving image.
In FIG. 14A, the frame images change to 41, 42, and 43 with the passage of time. The frame image 42 includes a field image 421 composed of an odd line image of the frame image 42 and a field image 422 composed of an even line image of the frame image 42. The frame image 41 belongs to the scene A, and the frame images 42 and 43 belong to the scene B. A scene change that changes from the scene A to the scene B occurs between the frame image 41 and the frame image 42. Such a scene change is hereinafter referred to as an inter-frame scene change.

図１４（ｂ）では、時間の経過とともに、フレーム画像が４４、４５、４６と変化している。フレーム画像４５は、フレーム画像４５の奇数ラインの画像からなるフィールド画像４５１と、フレーム画像４５の偶数ラインの画像からなるフィールド画像４５２とからなる。フレーム画像４４はシーンＡに属しており、フレーム画像４５はシーンＡ＋Ｂに属しており、フレーム画像４６はシーンＢに属している。フレーム画像４５のシーンＡ＋Ｂで、シーンチェンジが起こっている。このようなシーンチェンジを、以降ではフレーム内シーンチェンジと呼ぶ。
図１４（ａ）及び図１４（ｂ）に示すフレーム画像は、第１の符号化動画像情報の一部であり、フレーム画像４２とフレーム画像４５がともにＩピクチャである。 In FIG. 14B, the frame images change to 44, 45, and 46 with the passage of time. The frame image 45 includes a field image 451 composed of an odd-numbered image of the frame image 45 and a field image 452 composed of an even-numbered image of the frame image 45. The frame image 44 belongs to the scene A, the frame image 45 belongs to the scene A + B, and the frame image 46 belongs to the scene B. A scene change has occurred in the scene A + B of the frame image 45. Such a scene change is hereinafter referred to as an intra-frame scene change.
The frame images shown in FIGS. 14A and 14B are a part of the first encoded moving image information, and both the frame image 42 and the frame image 45 are I pictures.

従来技術では、シーンチェンジを行なうフレーム画像４２（図１４（ａ）参照）及びフレーム画像４５（図１４（ｂ）参照）を再符号化する際に、当該フレーム画像全体をＩピクチャのフレーム画像として符号化している。このとき、フィールド画像単位でみれば、フィールド画像４２２は、フィールド画像４２１と相関が高いにも関わらず画面間予測モードを利用せず、画面内予測モードを用いている。
一方、フィールド画像４５１は、フレーム画像４４と相関が高いにもかかわらず画面間予測モードを利用せず、画面内予測モードを用いている。いずれの場合においても、従来技術では、シーンチェンジが存在する動画像を再符号化する際に符号化効率が悪くなるという問題があった。 In the prior art, when the frame image 42 (see FIG. 14A) and the frame image 45 (see FIG. 14B) for scene change are re-encoded, the entire frame image is used as an I picture frame image. Encoding. At this time, when viewed in units of field images, the field image 422 uses the intra-screen prediction mode without using the inter-screen prediction mode even though the correlation with the field image 421 is high.
On the other hand, the field image 451 uses the intra-screen prediction mode without using the inter-screen prediction mode even though the correlation with the frame image 44 is high. In either case, the conventional technique has a problem that the encoding efficiency deteriorates when re-encoding a moving image in which a scene change exists.

本発明は、上記事情に鑑みてなされたものであり、その目的は、シーンチェンジが含まれるフレーム画像の符号化効率を向上させることができる動画像符号化装置、プログラム及び動画像符号化方法を提供することにある。 The present invention has been made in view of the above circumstances, and an object of the present invention is to provide a moving image encoding apparatus, a program, and a moving image encoding method capable of improving the encoding efficiency of a frame image including a scene change. It is to provide.

本発明は、上記課題を解決するためになされたもので、本発明の動画像符号化装置は、１枚のフレーム画像を２枚のフィールド画像に変換するフレーム−フィールド変換部と、前記２枚のフィールド画像間にシーンの切り替わり点であるシーンチェンジが含まれているか否かを検出するシーンチェンジ検出部と、前記シーンチェンジ検出部の検出結果に基づいて前記２枚のフィールド画像のそれぞれを画面内予測モード又は画面間予測モードを用いて符号化する画像情報符号化部とを備える。
本発明では、１枚のフレーム画像を２枚のフィールド画像にフレーム−フィールド変換部が変換し、２枚のフィールド画像間にシーンの切り替わり点であるシーンチェンジが含まれているか否かをシーンチェンジ検出部が検出し、シーンチェンジ検出部の検出結果に基づいて２枚のフィールド画像のそれぞれを画面内予測モード又は画面間予測モードを用いて画像情報符号化部が符号化するようにしたので、２枚のフィールド画像のそれぞれに適した画面内予測モード又は画面間予測モードを用いて符号化することにより、情報量を１枚のフレーム画像よりも削減して符号化することができる。 The present invention has been made to solve the above problems, and the moving image encoding apparatus of the present invention includes a frame-field conversion unit that converts one frame image into two field images, and the two images. A scene change detection unit that detects whether or not a scene change that is a scene switching point is included between the field images, and each of the two field images is displayed on the screen based on the detection result of the scene change detection unit. And an image information encoding unit that performs encoding using the intra prediction mode or the inter-screen prediction mode.
In the present invention, the frame-field conversion unit converts one frame image into two field images, and determines whether or not a scene change that is a scene switching point is included between the two field images. Since the detection unit detects and the image information encoding unit encodes each of the two field images based on the detection result of the scene change detection unit using the intra prediction mode or the inter prediction mode, By encoding using the intra-screen prediction mode or the inter-screen prediction mode suitable for each of the two field images, the information amount can be reduced and encoded as compared with one frame image.

また、本発明の動画像符号化装置の前記画像情報符号化部は、前記シーンチェンジ検出部がシーンチェンジを検出した場合に前記フレーム画像を２枚のフィールド画像として符号化する。
本発明では、２枚のフィールド画像間に相関がある場合に、情報量を１枚のフレーム画像よりも削減して符号化することができる。 The image information encoding unit of the moving image encoding apparatus of the present invention encodes the frame image as two field images when the scene change detection unit detects a scene change.
In the present invention, when there is a correlation between two field images, the amount of information can be reduced and encoded compared to a single frame image.

また、本発明の動画像符号化装置は、前記シーンチェンジ検出部のシーンチェンジの検出結果に応じてフィールド画像毎に画面内予測モード又は画面間予測モードを選択してＧＯＰ構造を決定するＧＯＰ構造決定部を備え、前記画像情報符号化部は、前記ＧＯＰ構造決定部が選択した予測モードを用いて各フィールド画像を符号化する。
本発明では、ＧＯＰを構成するフィールド画像ごとに画面内予測モード又は画面間予測モードを選択して符号化するため、ＧＯＰを構成するフィールド画像間の相関を利用してＧＯＰの情報量を削減して符号化することができる。 Further, the moving picture encoding apparatus of the present invention has a GOP structure that determines a GOP structure by selecting an intra-screen prediction mode or an inter-screen prediction mode for each field image in accordance with a scene change detection result of the scene change detection unit. A determination unit, and the image information encoding unit encodes each field image using the prediction mode selected by the GOP structure determination unit.
In the present invention, since the intra-frame prediction mode or the inter-screen prediction mode is selected and encoded for each field image constituting the GOP, the information amount of the GOP is reduced by using the correlation between the field images constituting the GOP. Can be encoded.

また、本発明の動画像符号化装置の前記ＧＯＰ構造決定部は、前記シーンチェンジ検出部がシーンチェンジを検出した場合に、前記フレーム−フィールド変換部が変換するフレーム画像に用いられていた予測モードに応じて前記２枚のフィールド画像に用いる予測モードの組み合わせを選択する。
本発明では、フレーム画像に用いられていた予測モードに応じて、２枚のフィールド画像に用いる最適な予測モードの組み合わせを選択することにより、フレーム画像よりも情報量を削減して符号化することができる。 Further, the GOP structure determination unit of the video encoding device of the present invention is configured such that the prediction mode used for the frame image converted by the frame-field conversion unit when the scene change detection unit detects a scene change. The combination of prediction modes used for the two field images is selected according to the above.
In the present invention, encoding is performed by reducing the amount of information compared to a frame image by selecting an optimal combination of prediction modes used for two field images in accordance with the prediction mode used for the frame image. Can do.

また、本発明の動画像符号化装置の前記ＧＯＰ構造決定部は、前記シーンチェンジ検出部がシーンチェンジを検出した場合に、シーンチェンジ前のフィールド画像に用いられていた予測モードを画面間予測モードとする。
本発明では、シーンチェンジ前のフィールド画像に用いられていた予測モードを画面間予測モードとすることにより、そのフィールド画像を他のフィールド画像から予測することができ、シーンチェンジ前のフィールド画像の情報量を削減して符号化することができる。 Further, the GOP structure determination unit of the moving picture coding apparatus according to the present invention, when the scene change detection unit detects a scene change, changes the prediction mode used for the field image before the scene change to the inter-screen prediction mode. And
In the present invention, by setting the prediction mode used for the field image before the scene change to the inter-screen prediction mode, the field image can be predicted from the other field images, and the information on the field image before the scene change is obtained. The amount can be reduced and encoded.

また、本発明の動画像符号化装置の前記フレーム−フィールド変換部は、前記シーンチェンジ検出部がシーンチェンジを検出した場合に、前記２枚のフィールド画像のうち一方のフィールド画像を、もう一方のフィールド画像との差分が減少するように変換する。
本発明では、２枚のフィールド画像のうち一方のフィールド画像を、もう一方のフィールド画像との差分が減少するように変換するので、一方のフィールド画像をもう一方のフィールド画像から容易に予測することができるようになり、フレーム画像を符号化する際の情報量を削減することができる。 Further, the frame-field conversion unit of the moving image encoding device of the present invention, when the scene change detection unit detects a scene change, converts one field image of the two field images to the other. Conversion is performed so that the difference from the field image decreases.
In the present invention, one of the two field images is converted so as to reduce the difference from the other field image, so that one field image can be easily predicted from the other field image. Thus, it is possible to reduce the amount of information when encoding a frame image.

また、本発明の動画像符号化装置の前記フレーム−フィールド変換部は、前記シーンチェンジ検出部がシーンチェンジを検出した場合に、前記２枚のフィールド画像のうち一方のフィールド画像を、もう一方のフィールド画像と同一のフィールド画像に変換する。
本発明では、２枚のフィールド画像のうち一方のフィールド画像を、もう一方のフィールド画像と同一のフィールド画像に変換するので、一方のフィールド画像の情報量を０にすることができるようになり、フレーム画像を符号化する際の情報量を削減することができる。 Further, the frame-field conversion unit of the moving image encoding device of the present invention, when the scene change detection unit detects a scene change, converts one field image of the two field images to the other. Convert to the same field image as the field image.
In the present invention, one of the two field images is converted into the same field image as the other field image, so that the information amount of one field image can be reduced to zero. The amount of information when encoding a frame image can be reduced.

また、本発明の動画像符号化装置の前記画像情報符号化部は、前記シーンチェンジ検出部がシーンチェンジを検出した場合に、符号化対象のフレーム画像を構成しシーンチェンジ前のシーンに属するフィールド画像の符号化に用いられる平均量子化幅を、同一の予測モードで直前のフィールド画像の符号化に用いられる平均量子化幅よりも増加させる。
本発明では、符号化対象のフレーム画像を構成しシーンチェンジ前のシーンに属するフィールド画像の符号化に用いられる平均量子化幅を増加させて情報量を削減することにより、フレーム画像を符号化する際の情報量を削減することができる。 In addition, the image information encoding unit of the moving image encoding device of the present invention, when the scene change detection unit detects a scene change, configures a frame image to be encoded and belongs to a field before the scene change. The average quantization width used for image encoding is increased more than the average quantization width used for encoding the previous field image in the same prediction mode.
In the present invention, the frame image is encoded by reducing the amount of information by configuring the frame image to be encoded and increasing the average quantization width used for encoding the field image belonging to the scene before the scene change. The amount of information at the time can be reduced.

また、本発明の動画像符号化装置の前記画像情報符号化部は、前記シーンチェンジ検出部がシーンチェンジを検出した場合に、符号化対象のフレーム画像を構成しシーンチェンジ後のシーンに属するフィールド画像の符号化に用いられる平均量子化幅を、符号化対象のフレーム画像を構成しシーンチェンジ前のシーンに属するフィールド画像の符号化に用いられる平均量子化幅よりも小さくする。
本発明では、符号化対象のフレーム画像を構成しシーンチェンジ後のシーンに属するフィールド画像の符号化に用いられる平均量子化幅を小さくするようにしたので、シーンチェンジ前のシーンに属するフィールド画像に対して、シーンチェンジ後のシーンに属するフィールド画像の情報量を増加させることができる。 In addition, the image information encoding unit of the moving image encoding device of the present invention includes a field that constitutes a frame image to be encoded and belongs to a scene after a scene change when the scene change detection unit detects a scene change. The average quantization width used for encoding the image is made smaller than the average quantization width used for encoding the field image that forms the frame image to be encoded and belongs to the scene before the scene change.
In the present invention, since the frame image to be encoded is configured and the average quantization width used for encoding the field image belonging to the scene after the scene change is reduced, the field image belonging to the scene before the scene change On the other hand, the information amount of the field image belonging to the scene after the scene change can be increased.

また、本発明のプログラムは、コンピュータに、１枚のフレーム画像を２枚のフィールド画像に変換する第１のステップと、前記２枚のフィールド画像間にシーンの切り替わり点であるシーンチェンジが含まれているか否かを検出する第２のステップと、前記シーンチェンジ検出部の検出結果に基づいて前記２枚のフィールド画像のそれぞれを画面内予測モード又は画面間予測モードを用いて符号化する第３のステップとを実行させる。 In the program of the present invention, the computer includes a first step of converting one frame image into two field images, and a scene change that is a scene switching point between the two field images. A second step of detecting whether or not each of the two field images is encoded using an intra-screen prediction mode or an inter-screen prediction mode based on a detection result of the scene change detection unit; The steps are executed.

また、本発明の動画像符号化方法は、１枚のフレーム画像を２枚のフィールド画像に変換するフレーム−フィールド変換過程と、前記２枚のフィールド画像間にシーンの切り替わり点であるシーンチェンジが含まれているか否かを検出するシーンチェンジ検出過程と、前記シーンチェンジ検出部の検出結果に基づいて前記２枚のフィールド画像のそれぞれを画面内予測モード又は画面間予測モードを用いて符号化する画像情報符号化過程とを有する。 The moving image encoding method of the present invention includes a frame-field conversion process for converting one frame image into two field images, and a scene change that is a scene switching point between the two field images. Each of the two field images is encoded using an intra-screen prediction mode or an inter-screen prediction mode based on a scene change detection process for detecting whether or not it is included and a detection result of the scene change detection unit. And an image information encoding process.

本発明では、シーンチェンジが含まれるフレーム画像の符号化効率を向上させることができる。 In the present invention, it is possible to improve the encoding efficiency of a frame image including a scene change.

以下、図面を参照し、本発明の実施形態について説明する。
図１は、２枚のフィールド画像ｆｌｄ１及びｆｌｄ２が、１枚のフレーム画像ｆｒｍを構成する例を示した図である。フレーム画像ｆｒｍの高さｈの半分の高さｈ／２である２枚のフィールド画像ｆｌｄ１とフィールド画像ｆｌｄ２とが、飛び越し走査でそれぞれフレーム画像ｆｒｍの奇数ラインと偶数ラインを占めるように交互に格納されている。このような構造をとる場合、フィールド画像ｆｌｄ１とフィールド画像ｆｌｄ２の映像の時刻は異なる。なお、図１のフィールド画像ｆｌｄ１は、映像の時刻の順序及び符号化の順序において、フィールド画像ｆｌｄ２よりも先の画像である。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.
FIG. 1 is a diagram showing an example in which two field images fld1 and fld2 constitute one frame image frm. Two field images fld1 and field image fld2 having a height h / 2 that is half the height h of the frame image frm are alternately stored so as to occupy the odd and even lines of the frame image frm by interlaced scanning. Has been. When taking such a structure, the time of the video of the field image fld1 and the field image fld2 is different. Note that the field image fld1 in FIG. 1 is an image ahead of the field image fld2 in the video time order and the coding order.

符号化の対象としてのフレーム画像とフィールド画像は総称してピクチャと呼ばれ、ピクチャはマクロブロックと呼ばれる小領域に分割して符号化される。符号化の際には、予測符号化の方法、すなわち予測モードと、画質及び符号量を調節する値である量子化幅とをマクロブロック毎に選択できるのが一般的である。予測モードには、画面内予測モードと画面間予測モードの２種類がある。
画面内予測モードは、画素値の空間方向における相関性を利用した予測モードである。また、画面間予測モードは、画素値の時間方向における相関性を利用した予測モードである。マクロブロック毎に選択可能な予測モードの組み合わせは、ピクチャ毎にピクチャタイプとして選択する。ピクチャタイプには基本的にＩ・Ｐ・Ｂの３種類がある。 Frame images and field images as encoding targets are collectively called pictures, and pictures are encoded by being divided into small areas called macroblocks. At the time of encoding, it is general that a prediction encoding method, that is, a prediction mode and a quantization width that is a value for adjusting image quality and code amount can be selected for each macroblock. There are two types of prediction modes: an intra-screen prediction mode and an inter-screen prediction mode.
The intra prediction mode is a prediction mode that uses the correlation of pixel values in the spatial direction. The inter-screen prediction mode is a prediction mode that uses the correlation of pixel values in the time direction. A combination of prediction modes that can be selected for each macroblock is selected as a picture type for each picture. There are basically three types of picture types: I, P, and B.

ピクチャタイプがＩであるピクチャはＩピクチャといい、画面内予測モードだけが用いられる。ピクチャタイプがＰであるピクチャはＰピクチャといい、Ｉピクチャと同じ画面内予測モードに加えて、予測元のデータとなる参照ピクチャが１枚の画面間予測モードを用いることができる。ピクチャタイプがＢであるピクチャはＢピクチャといい、Ｐピクチャと同じ予測モードに加えて、２枚の参照ピクチャによる画面間予測モードを用いることができる。
画面間予測モードが利用できるＰピクチャ及びＢピクチャは、過去に符号化済みのピクチャとの差分だけを符号化するため、符号化対象のピクチャと参照ピクチャとの相関が高いほど、効率の良い符号化が可能である。
後述する第１〜第３の実施形態では、画面間予測モードとしてＰピクチャを用い、画面内予測モードとしてＩピクチャを用いる場合について説明する。 A picture whose picture type is I is called an I picture, and only the intra prediction mode is used. A picture whose picture type is P is called a P picture, and in addition to the same intra prediction mode as that of an I picture, an inter prediction mode with one reference picture serving as prediction source data can be used. A picture whose picture type is B is called a B picture, and in addition to the same prediction mode as a P picture, an inter-screen prediction mode using two reference pictures can be used.
The P picture and the B picture that can use the inter-screen prediction mode encode only the difference from the previously coded picture. Therefore, the higher the correlation between the picture to be coded and the reference picture, the more efficient the picture. Is possible.
In the first to third embodiments described later, a case will be described in which a P picture is used as an inter-screen prediction mode and an I picture is used as an intra-screen prediction mode.

図２は、ピクチャ間の相関性の例を示す図である。図２は、時間の経過とともに、フレーム画像が、ｆｒｍ１、ｆｒｍ２、ｆｒｍ３、ｆｒｍ４と変化する場合を示している。
同一のシーンに属するピクチャ間であれば、シーンの映像が大きく変化しない限りは高い相関があるが、参照ピクチャと符号化対象のピクチャが異なるシーンに属する場合には相関が低い。具体的に説明すると、フレーム画像ｆｒｍ１とフレーム画像ｆｒｍ２とは、円のパターンｍ１がフレーム画像内を移動している点で類似しており、相関が高い。また、フレーム画像ｆｒｍ３とフレーム画像ｆｒｍ４とは、四角形のパターンｍ２がフレーム画像内を移動している点で類似しており、相関が高い。一方、フレーム画像ｆｒｍ２とフレーム画像ｆｒｍ３とは、フレーム画像内に含まれるパターンが異なっており、相関が低い。
あるシーンから異なるシーンへの切り替わりをシーンチェンジといい、シーンチェンジを挟んだピクチャ間での画面間予測モードの利用は一般に符号化効率が低く画質の劣化につながるため、画面内予測モードを利用するのが望ましい。図２では、フレーム画像ｆｒｍ２とフレーム画像ｆｒｍ３との間で、シーンチェンジが起こっている。 FIG. 2 is a diagram illustrating an example of correlation between pictures. FIG. 2 shows a case where the frame image changes to frm1, frm2, frm3, and frm4 with the passage of time.
Between pictures belonging to the same scene, there is a high correlation as long as the video of the scene does not change significantly, but when the reference picture and the picture to be encoded belong to different scenes, the correlation is low. More specifically, the frame image frm1 and the frame image frm2 are similar in that the circle pattern m1 moves within the frame image, and the correlation is high. Further, the frame image frm3 and the frame image frm4 are similar in that the square pattern m2 moves in the frame image, and the correlation is high. On the other hand, the frame image frm2 and the frame image frm3 have different patterns included in the frame image and have a low correlation.
Switching from one scene to another is called a scene change. Use of the inter-screen prediction mode between pictures with a scene change is generally low in coding efficiency and leads to deterioration of image quality, so use the intra-screen prediction mode. Is desirable. In FIG. 2, a scene change occurs between the frame image frm2 and the frame image frm3.

一方、画面内予測モードのみを利用するＩピクチャは、上記を理由としてシーンチェンジ直後のピクチャの符号化において用いられるほか、一連のピクチャの集まりであるＧＯＰ（Group Of Picture）へのランダムアクセスを実現するために、ＧＯＰの先頭ピクチャの符号化においても用いられる。なお、従来の符号化方式の場合、フィールド画像単位で符号化する際にも、ＧＯＰはフレーム画像単位で構成されるため、ランダムアクセスもフレーム画像単位となる。しかし、フレーム画像内で先に符号化されるフィールド画像がＩピクチャでなければＧＯＰの先頭ピクチャとはならず、後に符号化されるフィールド画像がＩピクチャとして符号化されていても、そのフレーム画像にはランダムアクセスが不可能である。 On the other hand, the I picture that uses only the intra prediction mode is used for coding the picture immediately after the scene change for the above reasons, and realizes random access to a group of pictures (GOP) that is a collection of a series of pictures. Therefore, it is also used in encoding the first picture of the GOP. In the case of the conventional encoding method, when encoding is performed in units of field images, since GOP is configured in units of frame images, random access is also performed in units of frame images. However, if the field image encoded earlier in the frame image is not an I picture, it does not become the first picture of the GOP. Even if the field image encoded later is encoded as an I picture, the frame image Random access is not possible.

ところで、所定の第１の符号化方式によって符号化された第１の動画像情報を、所定の第２の符号化方式によって符号化された第２の動画像情報に変換する処理を一般に、動画像の再符号化という。再符号化は、より符号化効率の高い符号化方式を用いて符号量を削減するほか、解像度の変換や、通信・放送・蓄積機器の仕様への適合などを目的として行われる。いずれの目的にしても、再符号化による画質劣化はできるだけ回避することが好ましい。 By the way, a process for converting the first moving image information encoded by the predetermined first encoding method into the second moving image information encoded by the predetermined second encoding method is generally a moving image. This is called image re-encoding. Re-encoding is performed for the purpose of reducing the amount of code by using an encoding method with higher encoding efficiency, as well as converting the resolution and conforming to the specifications of the communication / broadcast / storage device. For any purpose, it is preferable to avoid image quality degradation due to re-encoding as much as possible.

（第１の実施形態）
図３は、本発明の第１の実施形態における動画像再符号化装置１００（動画像符号化装置とも称する）の構成を示すブロック図である。この動画像再符号化装置１００は、動きベクトル合成部１０１、フレーム−フィールド変換部１０２、画像情報符号化部１０３、フレーム内シーンチェンジ検出部１０４、ＧＯＰ構造決定部１０５、ピクチャタイプ判別部２０１、符号化情報解析部２０２、符号化情報復号部２０３、ビデオメモリ２０５、動きベクトル検出部２０８、情報バッファ２０９、コンプレキシティ算出部２１０を備えている。 (First embodiment)
FIG. 3 is a block diagram illustrating a configuration of the moving image re-encoding device 100 (also referred to as a moving image encoding device) according to the first embodiment of the present invention. The moving image re-encoding device 100 includes a motion vector synthesis unit 101, a frame-field conversion unit 102, an image information encoding unit 103, an intra-frame scene change detection unit 104, a GOP structure determination unit 105, a picture type determination unit 201, The encoded information analysis unit 202, the encoded information decoding unit 203, the video memory 205, the motion vector detection unit 208, the information buffer 209, and the complexity calculation unit 210 are provided.

動画像再符号化装置１００は、第１の符号化方式によるフレーム構造の第１の符号化動画像情報を入力として受け取り、フレーム内シーンチェンジが含まれるフレーム画像に対応して適切な符号化パラメータを選択し、第２の符号化方式によるフィールド構造の第２の符号化動画像情報を出力する。
なお、第１の符号化方式とは、画面内符号化モードと画面間符号化モードがピクチャ毎に選択できる方式であり、例えば、ＩＳＯ／ＩＥＣ１３８１８−２に規定されている方式を用いることができる。
また、第２の符号化方式とは、フィールド画像単位での符号化ができる方式であり、例えば、ＩＳＯ／ＩＥＣ１４４９６−１０に規定されている方式を用いることができる。
なお、第１の符号化方式と第２の符号化方式とで同じ符号化方式を用いてもよい。 The moving image re-encoding device 100 receives, as an input, first encoded moving image information having a frame structure according to the first encoding method, and an appropriate encoding parameter corresponding to a frame image including an intra-frame scene change. And the second encoded moving picture information having the field structure by the second encoding method is output.
The first encoding method is a method in which the intra-screen encoding mode and the inter-screen encoding mode can be selected for each picture. For example, a method specified in ISO / IEC 13818-2 is used. it can.
The second encoding method is a method that enables encoding in units of field images, and for example, a method defined in ISO / IEC 14496-10 can be used.
Note that the same encoding method may be used for the first encoding method and the second encoding method.

動きベクトル合成部１０１は、符号化情報解析部２０２が出力する第１の符号化方式における動きベクトルから、第２の符号化方式の動きベクトルを合成し、動きベクトル検出部２０８に出力する。動きベクトルとは、符号化を行なったフレーム画像を参照することにより、次に符号化するフレーム画像を予測したときの空間的なずれをいう。
フレーム−フィールド変換部１０２は、ビデオメモリから出力されるフレーム構造の画像情報をフィールド構造に変換し、フレーム−フィールド変換部１０２に出力する。
画像情報符号化部１０３は、フレーム−フィールド変換部１０２から出力されるフィールド画像に対して、第２の符号化方式を用いて符号化し、動画像再符号化装置１００の外部に出力する。
フレーム内シーンチェンジ検出部１０４は、フレーム−フィールド変換部１０２から出力されるフィールド画像に基づいて、フレーム内シーンチェンジの有無を検出し、その検出結果であるシーンチェンジフラグをＧＯＰ構造決定部１０５に出力する。
ＧＯＰ構造決定部１０５は、フレーム内シーンチェンジ検出部１０４が出力するシーンチェンジフラグに基づいてピクチャタイプを選択してＧＯＰの構造を決定し、そのピクチャタイプを画像情報符号化部１０３に出力する。 The motion vector synthesis unit 101 synthesizes the motion vector of the second encoding method from the motion vector in the first encoding method output from the encoding information analysis unit 202 and outputs the synthesized motion vector to the motion vector detection unit 208. The motion vector refers to a spatial shift when a frame image to be encoded next is predicted by referring to the encoded frame image.
The frame-field conversion unit 102 converts frame-structured image information output from the video memory into a field structure, and outputs the field structure to the frame-field conversion unit 102.
The image information encoding unit 103 encodes the field image output from the frame-field conversion unit 102 using the second encoding method, and outputs the encoded image to the outside of the moving image re-encoding device 100.
The intra-frame scene change detection unit 104 detects the presence / absence of an intra-frame scene change based on the field image output from the frame-field conversion unit 102, and sends a scene change flag as a detection result to the GOP structure determination unit 105. Output.
The GOP structure determination unit 105 selects a picture type based on the scene change flag output from the intra-frame scene change detection unit 104, determines the GOP structure, and outputs the picture type to the image information encoding unit 103.

ピクチャタイプ判別部２０１は、動画像再符号化装置１００に入力される第１の符号化動画像情報に含まれるフレーム画像のピクチャタイプを判別し、Ｂピクチャを破棄する。ピクチャタイプ判別部２０１は、第１の符号化動画像情報を符号化情報解析部２０２に出力するとともに、判別したピクチャタイプをＧＯＰ構造決定部１０５に出力する。なお、ピクチャタイプ判別部２０１は、フレームレートの変換を行わない場合には、Ｂピクチャの破棄を行わない。この場合、符号化情報復号部２０３には、Ｂピクチャの復号機能が設けられる。 The picture type determination unit 201 determines the picture type of the frame image included in the first encoded moving image information input to the moving image re-encoding device 100, and discards the B picture. The picture type determination unit 201 outputs the first encoded moving image information to the encoding information analysis unit 202 and outputs the determined picture type to the GOP structure determination unit 105. Note that the picture type determination unit 201 does not discard the B picture when frame rate conversion is not performed. In this case, the encoded information decoding unit 203 is provided with a B picture decoding function.

動きベクトル合成部１０１は、符号化情報解析部２０２が出力する第１の符号化方式に対する動きベクトルを、第２の符号化方式における動きベクトルに適合するよう変換し、動きベクトル検出部２０８に出力する。また、動きベクトル合成部１０１は、フレーム画像に対する動きベクトルからフィールド画像に対する動きベクトルへの変換を行なう。
フレーム−フィールド変換部１０２は、フレーム画像とフィールド画像の関係に基づいてフレーム画像を飛び越し走査し、フレーム画像の奇数ラインと偶数ラインそれぞれのみから成る２枚のフィールド画像に変換し、画像情報符号化部１０３に出力する。 The motion vector synthesis unit 101 converts the motion vector for the first encoding method output from the encoding information analysis unit 202 so as to match the motion vector in the second encoding method, and outputs the motion vector to the motion vector detection unit 208. To do. The motion vector synthesis unit 101 converts a motion vector for a frame image into a motion vector for a field image.
The frame-field conversion unit 102 interlaces and scans the frame image based on the relationship between the frame image and the field image, converts the frame image into two field images including only odd lines and even lines of the frame image, and encodes image information. Output to the unit 103.

画像情報符号化部１０３は、第１の符号化動画像情報を用いて符合化された１枚のフレーム画像を、第２の符号化方式を用いて２枚のフィールド画像として符号化し、動画像再符合化装置１００の外部に出力する。２枚のフィールド画像に適用するピクチャタイプはそれぞれ、ＧＯＰ構造決定部１０５が選択するピクチャタイプｐｔ１及びピクチャタイプｐｔ２である。２枚のフィールド画像を符号化するために、符号量の制御はフィールド画像毎に行われる。符号量の制御には後述のコンプレキシティを用いる。 The image information encoding unit 103 encodes one frame image encoded using the first encoded moving image information as two field images using the second encoding method, and generates a moving image. Output to the outside of the re-encoding device 100. The picture types applied to the two field images are the picture type pt1 and the picture type pt2 selected by the GOP structure determination unit 105, respectively. In order to encode two field images, the amount of code is controlled for each field image. The complexity described later is used to control the code amount.

フレーム内シーンチェンジ検出部１０４は、画像間差分の非類似度に基づくシーンチェンジの検出方法を使用して、フレーム画像内の２枚のフィールド画像にシーンチェンジが含まれているか否かについて検出する。なお、フレーム内シーンチェンジ検出部１０４は、画像間差分の非類似度に基づくシーンチェンジの検出方法以外の方法を使用して、２枚のフィールド画像にシーンチェンジが含まれているか否かについて検出してもよい。
具体的には、フレーム内シーンチェンジ検出部１０４は、２枚のフィールド画像の非類似度が所定の閾値よりも大きい場合には２枚のフィールド画像間にシーンチェンジが存在すると判定し、真のシーンチェンジフラグｓｃをＧＯＰ構造決定部１０５に出力する。一方、２枚のフィールド画像の非類似度が所定の閾値以下である場合には２枚のフィールド画像間にシーンチェンジが存在しないと判定し、偽のシーンチェンジフラグｓｃをＧＯＰ構造決定部１０５に出力する。 The in-frame scene change detection unit 104 detects whether or not a scene change is included in the two field images in the frame image by using a scene change detection method based on the dissimilarity of the inter-image difference. . The in-frame scene change detection unit 104 detects whether a scene change is included in the two field images using a method other than the scene change detection method based on the dissimilarity of the difference between images. May be.
Specifically, the in-frame scene change detection unit 104 determines that there is a scene change between two field images when the dissimilarity between the two field images is greater than a predetermined threshold, and the true The scene change flag sc is output to the GOP structure determination unit 105. On the other hand, if the dissimilarity between the two field images is equal to or less than a predetermined threshold, it is determined that there is no scene change between the two field images, and a false scene change flag sc is sent to the GOP structure determination unit 105. Output.

ＧＯＰ構造決定部１０５は、ピクチャタイプ判別部２０１が出力するピクチャタイプｐｔと、フレーム内シーンチェンジ検出部１０４が出力するシーンチェンジフラグｓｃとに基づいて、２枚のフィールド画像のうち一方のフィールド画像に適用するピクチャタイプｐｔ１と、他方のフィールド画像に適用するピクチャタイプｐｔ２を選択し、画像情報符号化部１０３に出力する。 The GOP structure determination unit 105 selects one of the two field images based on the picture type pt output from the picture type determination unit 201 and the scene change flag sc output from the intra-frame scene change detection unit 104. The picture type pt1 to be applied to and the picture type pt2 to be applied to the other field image are selected and output to the image information encoding unit 103.

図４は、本発明の実施形態によるＧＯＰ構造決定部１０５の処理を示すフローチャートである。始めに、ＧＯＰ構造決定部１０５は、ピクチャタイプ判別部２０１（図３）が出力するピクチャタイプがＩピクチャであるか否か、つまり、ｐｔ＝Ｉであるか否かについて判定する（ステップＳ１０）。
ステップＳ１０でピクチャタイプがＩピクチャである場合（ｐｔ＝Ｉ）には、ＧＯＰ構造決定部１０５（図３）は、フレーム内シーンチェンジ検出部１０４が出力するシーンチェンジフラグが真であるか偽であるか否か、つまり、ｓｃ＝ｔｒｕｅであるか否かについて判定する（ステップＳ１１） FIG. 4 is a flowchart showing a process of the GOP structure determination unit 105 according to the embodiment of the present invention. First, the GOP structure determination unit 105 determines whether the picture type output by the picture type determination unit 201 (FIG. 3) is an I picture, that is, whether pt = I (step S10). .
When the picture type is I picture in step S10 (pt = I), the GOP structure determination unit 105 (FIG. 3) determines whether the scene change flag output from the in-frame scene change detection unit 104 is true or false. It is determined whether there is, that is, whether sc = true (step S11).

ＧＯＰ構造決定部１０５は、フレーム内シーンチェンジ検出部１０４がシーンチェンジを検出した場合（ｓｃ＝ｔｒｕｅ）に、フレーム−フィールド変換部１０２が変換するフレーム画像に用いられていたピクチャタイプに応じて２枚のフィールド画像に用いるピクチャタイプの組み合わせを選択する。具体的には、ＧＯＰ構造決定部１０５は、シーンチェンジ検出部１０２がシーンチェンジを検出した場合に、シーンチェンジ前のフィールド画像に用いられていたピクチャタイプをＰピクチャとする。
つまり、ステップＳ１１でシーンチェンジフラグが真である場合（ｓｃ＝ｔｒｕｅ）は、再符号化対象のフレーム画像ｆｒｍ内にシーンチェンジが含まれている場合であり、フレーム画像内に２つのシーンが存在している。この場合、フィールド画像ｆｌｄ１は、同じシーンに属する時間的に前方のピクチャを参照ピクチャとして画面間予測モードを用いると、符号化効率を高めることができる。これに対してシーンチェンジ後のシーンの最初の画像であるフィールド画像ｆｌｄ２は、時間的に前のピクチャから予測することが難しいために画面内予測モードを用いる。
つまり、ＧＯＰ構造決定部１０５は、フィールド画像ｆｌｄ１に適用するピクチャタイプとしてＰピクチャを選択し、フィールド画像ｆｌｄ２に適用するピクチャタイプとしてＩピクチャを選択（ｐｔ１＝Ｐ，ｐｔ２＝Ｉ）する（ステップＳ１２）。 When the intra-frame scene change detection unit 104 detects a scene change (sc = true), the GOP structure determination unit 105 sets 2 depending on the picture type used for the frame image converted by the frame-field conversion unit 102. A combination of picture types used for one field image is selected. Specifically, when the scene change detection unit 102 detects a scene change, the GOP structure determination unit 105 sets the picture type used for the field image before the scene change as a P picture.
That is, when the scene change flag is true in step S11 (sc = true), the scene image is included in the frame image frm to be re-encoded, and there are two scenes in the frame image. is doing. In this case, when the inter-picture prediction mode is used for the field image fld1 using the temporally forward picture belonging to the same scene as a reference picture, the encoding efficiency can be improved. On the other hand, the field image fld2, which is the first image of the scene after the scene change, uses the intra-screen prediction mode because it is difficult to predict from the temporally previous picture.
That is, the GOP structure determination unit 105 selects a P picture as a picture type to be applied to the field image fld1, and selects an I picture as a picture type to be applied to the field image fld2 (pt1 = P, pt2 = I) (step S12). ).

なお、ステップＳ１２において、フィールド画像ｆｌｄ２に適用するピクチャタイプとしてＰピクチャを選択し、各マクロブロックにおいて画面内予測モードを選択するようにしてもよい。
ここで、再符号化対象のフレーム画像ｆｒｍはＩピクチャであって動きベクトルを有しないため、フィールド画像ｆｌｄ１に対する新たな動きベクトルが必要となる。不足する動きベクトルに対しては、動きベクトル検出部２０８で動きベクトルを新規に検出してもよいし、他の方法を用いてもよい。すなわち、動きベクトルを他のピクチャの動きベクトルから推定するようにしてもよい。 In step S12, a P picture may be selected as a picture type to be applied to the field image fld2, and an intra prediction mode may be selected for each macroblock.
Here, since the frame image frm to be re-encoded is an I picture and does not have a motion vector, a new motion vector for the field image fld1 is required. For a motion vector that is insufficient, the motion vector detection unit 208 may newly detect a motion vector, or another method may be used. That is, the motion vector may be estimated from the motion vector of another picture.

ステップＳ１２の処理を行なった場合に、第１の符号化動画像情報においてランダムアクセスが可能であったフレーム画像ｆｒｍが、再符号化後にはランダムアクセスが不可能な画像となるときには、フィールド画像ｆｌｄ１に適用するピクチャタイプとしてＩピクチャを選択することにより、再符号化後もフレーム画像ｆｒｍへのランダムアクセスが可能となる。この場合には、フィールド画像ｆｌｄ１及びｆｌｄ２に適用するピクチャタイプを両方ともＩピクチャにすることによる符号化効率の低下を避けるため、後述の第３の実施形態を用いるとよい。 When the frame image frm that can be randomly accessed in the first encoded moving image information becomes an image that cannot be randomly accessed after re-encoding when the process of step S12 is performed, the field image fld1 By selecting the I picture as the picture type to be applied to, random access to the frame image frm is possible even after re-encoding. In this case, a third embodiment described later may be used in order to avoid a decrease in encoding efficiency due to the fact that both of the picture types applied to the field images fld1 and fld2 are I pictures.

ステップＳ１１でシーンチェンジフラグが偽である場合（ｓｃ≠ｔｒｕｅ）は、フィールド画像ｆｌｄ１とフィールド画像ｆｌｄ２との間にはシーンチェンジが含まれず、フィールド画像ｆｌｄ１とフィールド画像ｆｌｄ２との間に高い相関があるため、フィールド画像ｆｌｄ２をフィールド画像ｆｌｄ１から予測すれば符号化効率を高めることができる。
したがって、フィールド画像ｆｌｄ１はＩピクチャとし、フィールド画像ｆｌｄ２は第１の符号化動画像情報におけるピクチャタイプであるＩを継承せずにＰピクチャとする（ｐｔ１＝Ｉ，ｐｔ２＝Ｐ）。つまり、ＧＯＰ構造決定部１０５は、フィールド画像ｆｌｄ１に適用するピクチャタイプとしてＩピクチャを選択し、フィールド画像ｆｌｄ２に適用するピクチャタイプとしてＰピクチャを選択する（ステップＳ１３）。 If the scene change flag is false in step S11 (sc ≠ true), no scene change is included between the field image fld1 and the field image fld2, and a high correlation is present between the field image fld1 and the field image fld2. Therefore, if the field image fld2 is predicted from the field image fld1, the encoding efficiency can be increased.
Therefore, the field image fld1 is an I picture, and the field image fld2 is a P picture without inheriting the picture type I in the first encoded moving picture information (pt1 = I, pt2 = P). That is, the GOP structure determination unit 105 selects an I picture as a picture type to be applied to the field image fld1, and selects a P picture as a picture type to be applied to the field image fld2 (step S13).

ステップＳ１０でピクチャタイプがＩピクチャではない場合（ｐｔ≠Ｉ）には、ＧＯＰ構造決定部１０５は、フィールド画像ｆｌｄ１に適用するピクチャタイプとして第１の符号化動画像情報におけるピクチャタイプを継続して選択し、フィールド画像ｆｌｄ２に適用するピクチャタイプとして第１の符号化動画像情報におけるピクチャタイプを継続して選択（ｐｔ１＝ｐｔ，ｐｔ２＝ｐｔ）する（ステップＳ１４）。 If the picture type is not an I picture in step S10 (pt ≠ I), the GOP structure determination unit 105 continues the picture type in the first encoded moving picture information as the picture type applied to the field image fld1. The picture type in the first encoded moving picture information is continuously selected as the picture type to be applied to the field picture fld2 (pt1 = pt, pt2 = pt) (step S14).

次に、図５及び図６を参照して、ＧＯＰ構造決定部１０５の動作例を説明する。
図５は、本発明の第１の実施形態による第１の符号化動画像情報の模式図である。図５では、時間の経過とともに、フレーム画像がＩ０、Ｐ１、Ｐ２、Ｉ３、Ｐ４、Ｐ５、Ｉ６、Ｐ７、Ｐ８と変化している。フレーム画像Ｉ０、Ｉ３、Ｉ６には、ピクチャタイプとしてＩピクチャが適用されており、ランダムアクセスが可能である。また、フレーム画像Ｐ１、Ｐ２、Ｐ４、Ｐ５、Ｐ７、Ｐ８には、ピクチャタイプとしてＰピクチャが適用されている。
フレーム画像Ｉ０、Ｐ１、Ｐ２は、フレーム画像の集まりであるＧＯＰ・ｇ１を構成している。同様に、フレーム画像Ｉ３、Ｐ４、Ｐ５はＧＯＰ・ｇ２を構成し、フレーム画像Ｉ６、Ｐ７、Ｐ８はＧＯＰ・ｇ３を構成している。 Next, an operation example of the GOP structure determination unit 105 will be described with reference to FIGS. 5 and 6.
FIG. 5 is a schematic diagram of first encoded moving image information according to the first embodiment of the present invention. In FIG. 5, the frame images change as I0, P1, P2, I3, P4, P5, I6, P7, and P8 with the passage of time. The frame images I0, I3, and I6 have an I picture applied as a picture type, and can be randomly accessed. In addition, a P picture is applied as a picture type to the frame images P1, P2, P4, P5, P7, and P8.
The frame images I0, P1, and P2 constitute GOP · g1, which is a collection of frame images. Similarly, the frame images I3, P4, and P5 constitute GOP · g2, and the frame images I6, P7, and P8 constitute GOP · g3.

図６（ａ）〜図６（ｃ）は、本発明の第１の実施形態による第２の符号化動画像情報の模式図である。
図６（ａ）では、フィールド画像ｆｌｄ１に対応するピクチャを、Ｉ０ｔ、Ｐ１ｔ、Ｐ２ｔ、Ｉ３ｔ、Ｐ４ｔ、Ｐ５ｔ、Ｉ６ｔ、Ｐ７ｔ、Ｐ８ｔで示し、フィールド画像ｆｌｄ２に対応するピクチャを、Ｐ０ｂ、Ｐ１ｂ、Ｐ２ｂ、Ｐ３ｂ、Ｐ４ｂ、Ｐ５ｂ、Ｐ６ｂ、Ｐ７ｂ、Ｐ８ｂで示している。
また、図６（ｂ）では、フィールド画像ｆｌｄ１に対応するピクチャを、Ｉ０ｔ、Ｐ１ｔ、Ｐ２ｔ、Ｐ３ｔ、Ｐ４ｔ、Ｐ５ｔ、Ｉ６ｔ、Ｐ７ｔ、Ｐ８ｔで示し、フィールド画像ｆｌｄ２に対応するピクチャを、Ｐ０ｂ、Ｐ１ｂ、Ｐ２ｂ、Ｉ３ｂ、Ｐ４ｂ、Ｐ５ｂ、Ｐ６ｂ、Ｐ７ｂ、Ｐ８ｂで示している。
また、図６（ｃ）では、フィールド画像ｆｌｄ１に対応するピクチャを、Ｉ０ｔ、Ｐ１ｔ、Ｐ２ｔ、Ｉ３ｔ、Ｐ４ｔ、Ｐ５ｔ、Ｉ６ｔ、Ｐ７ｔ、Ｐ８ｔで示し、フィールド画像ｆｌｄ２に対応するピクチャを、Ｐ０ｂ、Ｐ１ｂ、Ｐ２ｂ、Ｉ３ｂ、Ｐ４ｂ、Ｐ５ｂ、Ｐ６ｂ、Ｐ７ｂ、Ｐ８ｂで示している。
図６（ａ）〜図６（ｃ）では、フィールド画像ｆｌｄ１とフィールド画像ｆｌｄ２とを合わせたフレーム画像を単位としてＧＯＰが構成されている。 FIGS. 6A to 6C are schematic diagrams of second encoded moving image information according to the first embodiment of the present invention.
In FIG. 6A, pictures corresponding to the field image fld1 are denoted by I0t, P1t, P2t, I3t, P4t, P5t, I6t, P7t, and P8t, and pictures corresponding to the field image fld2 are denoted by P0b, P1b, and P2b. , P3b, P4b, P5b, P6b, P7b, P8b.
In FIG. 6B, pictures corresponding to the field image fld1 are denoted by I0t, P1t, P2t, P3t, P4t, P5t, I6t, P7t, and P8t, and pictures corresponding to the field image fld2 are denoted by P0b and P1b. , P2b, I3b, P4b, P5b, P6b, P7b, P8b.
In FIG. 6C, pictures corresponding to the field image fld1 are indicated by I0t, P1t, P2t, I3t, P4t, P5t, I6t, P7t, and P8t, and pictures corresponding to the field image fld2 are indicated by P0b and P1b. , P2b, I3b, P4b, P5b, P6b, P7b, P8b.
In FIG. 6A to FIG. 6C, the GOP is configured in units of frame images obtained by combining the field image fld1 and the field image fld2.

図６（ａ）は、図５に示す動画像にフレーム内シーンチェンジが含まれるフレーム画像が存在しない場合の、再符号化後のＧＯＰ構造の例である。ｇ１ａ、ｇ２ａ、ｇ３ａは再符号化後のＧＯＰであり、それぞれ図５におけるＧＯＰ・ｇ１、ｇ２、ｇ３と対応している。第１の符号化動画像情報でＩピクチャであったフレーム画像は、再符号化後にはＩピクチャのフィールド画像とＰピクチャのフィールド画像となる。例えば、第１の符号化動画像情報のフレーム画像Ｉ０（図５）は、図６（ａ）に示すように２枚のフィールド画像（フィールド画像Ｉ０ｔとフィールド画像Ｐ０ｂ）として再符号化される。このとき、フィールド画像Ｐ０ｂの参照ピクチャは、フィールド画像Ｉ０ｔである。図５のフレーム画像Ｉ３やＩ６についても同様に、ＩピクチャとＰピクチャの２枚のフィールド画像に分離して再符号化される。Ｉピクチャ以外の画像については、再符号化前と同じピクチャタイプを再符号化時に適用する。 FIG. 6A shows an example of a GOP structure after re-encoding when there is no frame image including an intra-frame scene change in the moving image shown in FIG. g1a, g2a, and g3a are GOPs after re-encoding, and correspond to GOP · g1, g2, and g3 in FIG. 5, respectively. The frame image that was an I picture in the first encoded moving image information becomes a field image of an I picture and a field image of a P picture after re-encoding. For example, the frame image I0 (FIG. 5) of the first encoded moving image information is re-encoded as two field images (field image I0t and field image P0b) as shown in FIG. 6 (a). At this time, the reference picture of the field image P0b is the field image I0t. Similarly, the frame images I3 and I6 in FIG. 5 are separated and re-encoded into two field images of an I picture and a P picture. For images other than I pictures, the same picture type as before re-encoding is applied during re-encoding.

図６（ｂ）は、図５に示す第１の符号化動画像情報のフレーム画像Ｉ３がフレーム内シーンチェンジを含む場合の例である。フレーム画像Ｉ３の再符号化においてシーンチェンジフラグｓｃが真となるために、フレーム画像Ｉ３は、Ｐピクチャであるフィールド画像Ｐ３ｔとＩピクチャであるフィールド画像Ｉ３ｂとに分けて再符号化される。図６（ｂ）ではフィールド画像Ｐ３ｔの参照ピクチャはフィールド画像Ｐ２ｔであるが、フィールド画像Ｐ３ｔと同一シーンに属していればフィールド画像Ｐ２ｔ以外のピクチャでも構わない。いずれにしても、フィールド画像Ｐ３ｔがＩピクチャでない限りＧＯＰの先頭とはなりえず、フィールド画像Ｉ０ｔからフィールド画像Ｐ５ｂまでのピクチャが、１つのＧＯＰを構成する。図６（ｂ）のｇ１ｂは、図５におけるＧＯＰ・ｇ１とＧＯＰ・ｇ２とに対応するＧＯＰである。 FIG. 6B shows an example in which the frame image I3 of the first encoded moving image information shown in FIG. 5 includes an intra-frame scene change. Since the scene change flag sc becomes true in the re-encoding of the frame image I3, the frame image I3 is re-encoded separately into a field image P3t that is a P picture and a field image I3b that is an I picture. In FIG. 6B, the reference picture of the field image P3t is the field image P2t, but a picture other than the field image P2t may be used as long as it belongs to the same scene as the field image P3t. In any case, unless the field image P3t is an I picture, it cannot be the head of the GOP, and the pictures from the field image I0t to the field image P5b constitute one GOP. G1b in FIG. 6B is a GOP corresponding to GOP · g1 and GOP · g2 in FIG.

図６（ｃ）も図６（ｂ）と同じく、図５に示すフレーム画像Ｉ３がフレーム内シーンチェンジを含む場合の例である。ただし、フレーム画像Ｉ３は、再符号化後もランダムアクセスが可能となるように、図６（ｂ）とは違い、２枚のフィールド画像ともにＩピクチャとして再符号化される。ＧＯＰ・ｇ１ｃ、ＧＯＰ・ｇ２ｃ、ＧＯＰ・ｇ３ｃは、再符号化前（図５参照）のＧＯＰ・ｇ１、ＧＯＰ・ｇ２、ＧＯＰ・ｇ３に対応する再符号化後のＧＯＰである。 FIG. 6C is an example when the frame image I3 shown in FIG. 5 includes an intra-frame scene change, as in FIG. 6B. However, unlike FIG. 6B, the frame image I3 is re-encoded as an I picture so that random access is possible even after re-encoding. GOP · g1c, GOP · g2c, and GOP · g3c are GOPs after re-encoding corresponding to GOP · g1, GOP · g2, and GOP · g3 before re-encoding (see FIG. 5).

なお、図６（ａ）〜図６（ｃ）では、フレーム画像がＩピクチャとＰピクチャのみからなる例を示したが、第１の符号化動画像情報が、Ｉピクチャ、Ｐピクチャの他に、Ｂピクチャを含んでいてもよい。
以上のようにしてＧＯＰ構造決定部１０５は、フレーム画像毎にピクチャタイプｐｔ１及びピクチャタイプｐｔ２を選択してＧＯＰ構造を決定する。なお、ピクチャタイプｐｔ１及びピクチャタイプｐｔ２の選択方法は、フィールド画像ｆｌｄ１に対しては画面間予測モード、フィールド画像ｆｌｄ２に対しては画面内予測モードが利用できるようなピクチャタイプを選択する方法を用いることができる。
さらに、図３で示した構成に加えてフレーム間シーンチェンジを検出する機能を設け、フレーム内シーンチェンジの有無及びフレーム間シーンチェンジの有無に基づいてピクチャタイプを制御するようにしてもよい。例えば、フレーム間シーンチェンジとフレーム内シーンチェンジがともに存在しない場合には、ピクチャタイプｐｔ１及びピクチャタイプｐｔ２を、ｐｔ１＝Ｐ、ｐｔ２＝Ｐとしてもよい。 FIGS. 6A to 6C show examples in which the frame image is composed only of an I picture and a P picture. However, the first encoded moving picture information includes the I picture and the P picture. , B picture may be included.
As described above, the GOP structure determination unit 105 selects the picture type pt1 and the picture type pt2 for each frame image and determines the GOP structure. As a selection method of the picture type pt1 and the picture type pt2, a method of selecting a picture type that can use an inter-screen prediction mode for the field image fld1 and an intra-screen prediction mode for the field image fld2 is used. be able to.
Further, in addition to the configuration shown in FIG. 3, a function for detecting an inter-frame scene change may be provided, and the picture type may be controlled based on the presence / absence of an intra-frame scene change and the presence / absence of an inter-frame scene change. For example, when neither an inter-frame scene change nor an intra-frame scene change exists, the picture type pt1 and the picture type pt2 may be set to pt1 = P and pt2 = P.

次に、本発明の第１の実施形態による動画像再符号化装置１００が、再符号化を行なう際の処理について説明する。符号化情報復号部２０３は、ピクチャタイプ判別部２０１に入力された第１の符号化動画像情報であるフレーム画像を復号化する。
フレーム−フィールド変換部１０２は、符号化情報復号部２０３が復号化した１枚のフレーム画像を２枚のフィールド画像に変換する。
フレーム内シーンチェンジ検出部１０４は、フレーム−フィールド変換部１０２が変換した２枚のフィールド画像間にシーンの切り替わり点であるシーンチェンジが含まれているか否かを検出する。
ＧＯＰ構造決定部１０５は、フレーム内シーンチェンジ検出部１０４のシーンチェンジの検出結果に応じてフィールド画像毎にピクチャタイプ（Ｉピクチャ又はＰピクチャ）を選択してＧＯＰ構造を決定する。
画像情報符号化部１０３は、フレーム内シーンチェンジ検出部１０４の検出結果に基づいて、ＧＯＰ構造決定部１０５が選択するピクチャタイプを用いて２枚のフィールド画像のそれぞれを符号化する。 Next, processing when the moving image re-encoding device 100 according to the first embodiment of the present invention performs re-encoding will be described. The encoded information decoding unit 203 decodes the frame image that is the first encoded moving image information input to the picture type determining unit 201.
The frame-field conversion unit 102 converts one frame image decoded by the encoding information decoding unit 203 into two field images.
The in-frame scene change detection unit 104 detects whether or not a scene change that is a scene switching point is included between the two field images converted by the frame-field conversion unit 102.
The GOP structure determination unit 105 selects a picture type (I picture or P picture) for each field image in accordance with the scene change detection result of the in-frame scene change detection unit 104 and determines the GOP structure.
Based on the detection result of the in-frame scene change detection unit 104, the image information encoding unit 103 encodes each of the two field images using the picture type selected by the GOP structure determination unit 105.

以上説明したように、フレーム内シーンチェンジ検出部１０４は、第１の符号化動画像情報のうちＩピクチャであるフレーム画像についてフレーム内シーンチェンジの有無を検出し、ＧＯＰ構造決定部１０５は、その結果に基づいて再符号化時にフィールド画像に適用するピクチャタイプを制御する。これにより、動画像再符号化装置１００は、フレーム内シーンチェンジが存在しても、第１の符号化動画像情報を第２の符号化動画像情報へ良好な符号化効率で再符号化できる。 As described above, the intra-frame scene change detection unit 104 detects the presence or absence of an intra-frame scene change for the frame image that is an I picture in the first encoded moving image information, and the GOP structure determination unit 105 Based on the result, the picture type applied to the field image at the time of re-encoding is controlled. As a result, the moving image re-encoding device 100 can re-encode the first encoded moving image information into the second encoded moving image information with good encoding efficiency even if there is an intra-frame scene change. .

（第２の実施形態）
図７は、本発明の第２の実施形態における動画像再符号化装置１１０の構成を示すブロック図である。本実施形態による動画像再符号化装置１１０が、第１の実施形態による動画像再符号化装置１００（図３）と同じ構成を取る部分については、同一の符号を付してそれらの説明を省略する。 (Second Embodiment)
FIG. 7 is a block diagram illustrating a configuration of the moving image re-encoding device 110 according to the second embodiment of the present invention. Parts in which the moving image re-encoding device 110 according to the present embodiment has the same configuration as that of the moving image re-encoding device 100 (FIG. 3) according to the first embodiment are denoted by the same reference numerals and description thereof is provided. Omitted.

動画像再符号化装置１１０は、フレーム−フィールド変換部１１１、ＧＯＰ構造決定部１１２を備えている。
フレーム−フィールド変換部１１１は、ビデオメモリ２０５が出力するフレーム構造の画像情報をフィールド構造に変換し、画像情報符合化部１０３に出力する。
ＧＯＰ構造決定部１１２は、ピクチャタイプ判別部２０１が出力するピクチャタイプに基づいて、ＧＯＰの構造及びピクチャタイプを決定する。 The moving image re-encoding device 110 includes a frame-field conversion unit 111 and a GOP structure determination unit 112.
The frame-field conversion unit 111 converts the frame structure image information output from the video memory 205 into a field structure and outputs the field structure to the image information encoding unit 103.
The GOP structure determination unit 112 determines the GOP structure and picture type based on the picture type output from the picture type determination unit 201.

フレーム−フィールド変換部１１１は、シーンチェンジフラグｓｃが偽でフィールド画像ｆｌｄ１を出力する以外の場合にはフレーム−フィールド変換部１０２と同様の動作をする。シーンチェンジフラグｓｃが真の場合、シーンチェンジ前のシーンに属するフィールド画像ｆｌｄ１の代わりに、シーンチェンジ後のシーンに属するフィールド画像ｆｌｄ２と同一の画像を出力する。フィールド画像ｆｌｄ２の出力は、フレーム−フィールド変換部１０２と同様である。 The frame-field conversion unit 111 operates in the same manner as the frame-field conversion unit 102 except when the scene change flag sc is false and the field image fld1 is output. When the scene change flag sc is true, the same image as the field image fld2 belonging to the scene after the scene change is output instead of the field image fld1 belonging to the scene before the scene change. The output of the field image fld2 is the same as that of the frame-field conversion unit 102.

図８は、１枚のフレーム画像から２枚のフィールド画像への変換処理を説明するための図である。図８に示すように、フレーム−フィールド変換部１１１（図７）は、フレーム内シーンチェンジ検出部１０４がシーンチェンジを検出した場合に、２枚のフィールド画像のうち一方のフィールド画像ｆｌｄ１を、もう一方のフィールド画像ｆｌｄ２と同一のフィールド画像に変換する。逆に、フィールド画像ｆｌｄ２の画像をフィールド画像ｆｌｄ１の画像と同一にして出力することもできるが、ＧＯＰの先頭フレーム画像でそのようにすると、ＧＯＰの先頭フレーム画像だけがシーンチェンジ前のシーンに属する画像となってしまう。したがって、フィールド画像ｆｌｄ１としてフィールド画像ｆｌｄ２と同一の画像を出力することが望ましい。
ＧＯＰ構造決定部１１２は、ＧＯＰ構造決定部１０５と同様に、２枚のフィールド画像ｆｌｄ１及びフィールド画像ｆｌｄ２に対してそれぞれピクチャタイプｐｔ１とピクチャタイプｐｔ２を選択してＧＯＰの構造を決定する。 FIG. 8 is a diagram for explaining conversion processing from one frame image to two field images. As shown in FIG. 8, when the in-frame scene change detection unit 104 detects a scene change, the frame-field conversion unit 111 (FIG. 7) selects one of the two field images fld1. One field image fld2 is converted into the same field image. Conversely, the image of the field image fld2 can be output in the same manner as the image of the field image fld1, but if this is done with the first frame image of the GOP, only the first frame image of the GOP belongs to the scene before the scene change. It becomes an image. Therefore, it is desirable to output the same image as the field image fld2 as the field image fld1.
Similar to the GOP structure determining unit 105, the GOP structure determining unit 112 selects the picture type pt1 and the picture type pt2 for the two field images fld1 and field image fld2, respectively, and determines the GOP structure.

図９は、ＧＯＰ構造決定部１１２におけるピクチャタイプｐｔ１とピクチャタイプｐｔ２の選択処理の流れを示すフローチャートである。以下、図９のフローチャートに沿って、ＧＯＰ構造決定部１１２におけるピクチャタイプの選択処理の各ステップを説明する。
まず、ステップＳ２０では、再符号化対象のフレーム画像ｆｒｍのピクチャタイプｐｔがＩであるか否か、つまり、ｐｔ＝Ｉであるか否かについて判定する。
ステップＳ２１は、ステップＳ２０でピクチャタイプｐｔ＝Ｉの場合に行われる処理である。ここでフレーム内シーンチェンジが検出されていない場合には、フィールド画像ｆｌｄ１とフィールド画像ｆｌｄ２とは相関が高いため、フィールド画像ｆｌｄ２はフィールド画像ｆｌｄ１を参照ピクチャとする画面間予測によって符号化（ｐｔ１＝Ｉ，ｐｔ２＝Ｐ）すれば、符号化効率を向上できる。
一方、フレーム内シーンチェンジが検出されている場合には、フレーム−フィールド変換部１１１からは、フィールド画像ｆｌｄ１及びフィールド画像ｆｌｄ２として同一の画像が出力される。したがって、フィールド画像ｆｌｄ２は、フィールド画像ｆｌｄ１を参照ピクチャとする画面間予測によって、ほとんど符号量を消費せずに符号化が可能である。結局、フレーム内シーンチェンジが検出されているか否かに関わらず、ピクチャタイプｐｔ１とピクチャタイプｐｔ２は、ｐｔ１＝Ｉ、ｐｔ２＝Ｐとする。 FIG. 9 is a flowchart showing a flow of selection processing of the picture type pt1 and the picture type pt2 in the GOP structure determination unit 112. Hereinafter, the steps of the picture type selection process in the GOP structure determination unit 112 will be described with reference to the flowchart of FIG.
First, in step S20, it is determined whether or not the picture type pt of the frame image frm to be re-encoded is I, that is, whether or not pt = I.
Step S21 is a process performed when the picture type pt = I in step S20. Here, when the intra-frame scene change is not detected, the field image fld1 and the field image fld2 have a high correlation. Therefore, the field image fld2 is encoded by inter-screen prediction using the field image fld1 as a reference picture (pt1 = I, pt2 = P), the encoding efficiency can be improved.
On the other hand, when an intra-frame scene change is detected, the same image is output from the frame-field conversion unit 111 as the field image fld1 and the field image fld2. Therefore, the field image fld2 can be encoded with almost no code amount by inter-screen prediction using the field image fld1 as a reference picture. Eventually, regardless of whether or not an intra-frame scene change is detected, the picture type pt1 and the picture type pt2 are set to pt1 = I and pt2 = P.

一方、ステップＳ２２は、ピクチャタイプｐｔ＝Ｉでない場合に行われる処理である。ここでは、ピクチャタイプｐｔ１及びピクチャタイプｐｔ２は、第１の符号化動画像情報におけるピクチャタイプｐｔを継承する（ｐｔ１＝ｐｔ，ｐｔ２＝ｐｔ）。
ＧＯＰ構造決定部１１２の動作例を、図５と図１０を例に説明する。図１０は図６と同様に再符号化後の第２の符号化動画像情報の模式図であり、第１の符号化動画像情報のフレーム画像Ｉ３（図５）がフレーム内シーンチェンジを有した場合の例である。フレーム画像Ｉ３は、図９のステップＳ２１の処理によってフィールド画像Ｉ３ｔとフィールド画像Ｐ３ｂとに分けて再符号化される。このとき、フレーム−フィールド変換部１１１の処理により、フィールド画像Ｉ３ｔはフィールド画像Ｐ３ｂと同一の画像となっており、フィールド画像Ｐ３ｂの参照ピクチャはフィールド画像Ｉ３ｔである。フィールド画像Ｉ３ｔはＧＯＰの先頭ピクチャであるため、ランダムアクセスが可能である。 On the other hand, step S22 is a process performed when the picture type pt is not equal to I. Here, the picture type pt1 and the picture type pt2 inherit the picture type pt in the first encoded moving picture information (pt1 = pt, pt2 = pt).
An example of the operation of the GOP structure determination unit 112 will be described with reference to FIGS. FIG. 10 is a schematic diagram of the second encoded moving image information after re-encoding as in FIG. 6, and the frame image I3 (FIG. 5) of the first encoded moving image information has an intra-frame scene change. This is an example. The frame image I3 is re-encoded separately for the field image I3t and the field image P3b by the process of step S21 in FIG. At this time, the field image I3t is the same image as the field image P3b by the processing of the frame-field conversion unit 111, and the reference picture of the field image P3b is the field image I3t. Since the field image I3t is the first picture of the GOP, random access is possible.

以上のようにしてＧＯＰ構造決定部１１２は、ピクチャタイプｐｔ１及びピクチャタイプｐｔ２を選択してＧＯＰ構造を決定する。なお、ピクチャタイプｐｔ１及びピクチャタイプｐｔ２の選択方法は、フィールド画像ｆｌｄ１に対しては画面内予測モード、フィールド画像ｆｌｄ２に対しては画面間予測モードが利用できるようなピクチャタイプを選択する方法であればよい。
さらに、第１の実施形態と同様に、本実施形態の構成に加えてフレーム間シーンチェンジを検出する機能を設け、フレーム内シーンチェンジの有無及びフレーム間シーンチェンジの有無に基づいてピクチャタイプを制御するようにしてもよい。 As described above, the GOP structure determination unit 112 selects the picture type pt1 and the picture type pt2 and determines the GOP structure. Note that the selection method of the picture type pt1 and the picture type pt2 is a method of selecting a picture type that can use the intra prediction mode for the field image fld1 and the inter prediction mode for the field image fld2. That's fine.
Furthermore, as in the first embodiment, in addition to the configuration of the present embodiment, a function for detecting an inter-frame scene change is provided, and the picture type is controlled based on the presence / absence of an intra-frame scene change and the presence / absence of an inter-frame scene change. You may make it do.

以上説明したように、第１の符号化動画像情報においてフレーム内シーンチェンジ検出部１０４がフレーム内シーンチェンジを検出した際に、フレーム−フィールド変換部１１１は、フィールド画像ｆｌｄ１としてフィールド画像ｆｌｄ２と同一の画像を出力してフレーム画像ｆｒｍがフレーム内シーンチェンジを有しない状態にする。これにより、動画像再符号化装置１１０は、フレーム内シーンチェンジが存在しても、第１の符号化動画像情報を第２の符号化動画像情報へ良好な符号化効率で再符号化できる。 As described above, when the intra-frame scene change detection unit 104 detects the intra-frame scene change in the first encoded moving image information, the frame-field conversion unit 111 is the same as the field image fld2 as the field image fld1. Is output so that the frame image frm has no intra-frame scene change. Thereby, the moving image re-encoding device 110 can re-encode the first encoded moving image information into the second encoded moving image information with good encoding efficiency even if there is an intra-frame scene change. .

なお、第２の実施形態では、フレーム内シーンチェンジ検出部１０４がシーンチェンジを検出した場合に、フレーム−フィールド変換部１１１が、２枚のフィールド画像のうち一方のフィールド画像ｆｌｄ１を、もう一方のフィールド画像ｆｌｄ２と同一のフィールド画像に変換する場合について説明したが、これに限定されるものではない。例えば、フレーム−フィールド変換部１１１は、フレーム内シーンチェンジ検出部１０４がシーンチェンジを検出した場合に、２枚のフィールド画像ｆｌｄ１、ｆｌｄ２のうち一方のフィールド画像ｆｌｄ１をもう一方のフィールド画像ｆｌｄ２との差分が減少するように、つまり、フィールド画像ｆｌｄ１とフィールド画像ｆｌｄ２とが類似するように変換してもよい。このような処理を行なうことにより、一方のフィールド画像ｆｌｄ１をもう一方のフィールド画像ｆｌｄ２から容易に予測することができるようになり、フレーム画像ｆｒｍを符号化する際の情報量を削減することができる。 In the second embodiment, when the in-frame scene change detection unit 104 detects a scene change, the frame-field conversion unit 111 converts one field image fld1 of the two field images to the other field image. Although the case of converting to the same field image as the field image fld2 has been described, the present invention is not limited to this. For example, when the in-frame scene change detection unit 104 detects a scene change, the frame-field conversion unit 111 converts one field image fld1 of the two field images fld1 and fld2 with the other field image fld2. The conversion may be performed so that the difference decreases, that is, the field image fld1 and the field image fld2 are similar. By performing such processing, one field image fld1 can be easily predicted from the other field image fld2, and the amount of information when the frame image frm is encoded can be reduced. .

（第３の実施形態）
図１１は、本発明の第３の実施形態による動画像再符号化装置１２０の構成を示すブロック図である。本実施形態による動画像再符号化装置１２０が、第１の実施形態による動画像再符号化装置１００（図３）と同じ構成を取る部分については、同一の符号を付してそれらの説明を省略する。 (Third embodiment)
FIG. 11 is a block diagram showing a configuration of a moving image re-encoding device 120 according to the third embodiment of the present invention. Parts in which the moving image re-encoding device 120 according to the present embodiment has the same configuration as that of the moving image re-encoding device 100 (FIG. 3) according to the first embodiment are denoted by the same reference numerals and description thereof will be given. Omitted.

画像情報符合化部１２１は、フレーム−フィールド変換部１０２が出力するフィールド画像を、第２の符号化方式を用いて符号化し、動画像再符号化装置１２０の外部に出力する。
画像情報符号化部１２１は、第２の符号化方式を用いて画像情報を符号化する点については、画像情報符号化部１０３と同様である。画像情報符号化部１０３と異なるのは、ピクチャタイプについては、第１の符号化動画像情報におけるフレーム画像のピクチャタイプｐｔを継承して、フィールド画像ｆｌｄ１とフィールド画像ｆｌｄ２のピクチャタイプｐｔ１及びピクチャタイプｐｔ２とする点と、符号量制御の際にシーンチェンジフラグｓｃを参照して量子化幅を決定する点である。 The image information encoding unit 121 encodes the field image output from the frame-field conversion unit 102 using the second encoding method, and outputs the encoded image to the outside of the moving image re-encoding device 120.
The image information encoding unit 121 is the same as the image information encoding unit 103 in that the image information is encoded using the second encoding method. The difference from the image information encoding unit 103 is that the picture type inherits the picture type pt of the frame image in the first encoded moving image information, and the picture type pt1 and the picture type of the field image fld1 and the field image fld2 The point is pt2, and the point where the quantization width is determined with reference to the scene change flag sc during the code amount control.

再符号化対象のフレーム画像ｆｒｍがフレーム内シーンチェンジを有していた場合、シーンチェンジ前のフィールド画像ｆｌｄ１はあるシーンの最後の画像であって、時間的にこれ以降のピクチャから参照される可能性は低く、フィールド画像ｆｌｄ１に多くの符号量を割り当てることは符号化効率の低下につながる。逆に、シーンチェンジ後のフィールド画像ｆｌｄ２は別のシーンの最初の画像であって時間的にそれ以降のピクチャの画質に影響を与える可能性が高く、フィールド画像ｆｌｄ２に多くの符号量を割り当てることにより符号化効率が向上すると考えられる。 If the frame image frm to be re-encoded has an intra-frame scene change, the field image fld1 before the scene change is the last image of a scene and can be temporally referenced from the subsequent pictures. Therefore, assigning a large amount of code to the field image fld1 leads to a decrease in encoding efficiency. On the other hand, the field image fld2 after the scene change is the first image of another scene and has a high possibility of affecting the image quality of subsequent pictures in time, and a large amount of code is assigned to the field image fld2. Therefore, it is considered that the encoding efficiency is improved.

これを考慮し、画像情報符号化部１２１は、フィールド画像ｆｌｄ１及びフィールド画像ｆｌｄ２に対する量子化幅をＱ１及びＱ２を、各フィールドに対して算出した量子化幅を初期値Ｑ１’及びＱ２’と、各フィールド画像に対する量子化幅の補正量ΔＱ１及びΔＱ２を用いて、Ｑ１＝Ｑ１’＋ΔＱ１、Ｑ２＝Ｑ２’＋ΔＱ２と算出する。 Considering this, the image information encoding unit 121 sets the quantization widths for the field image fld1 and the field image fld2 as Q1 and Q2, and the quantization width calculated for each field as initial values Q1 ′ and Q2 ′. Using the quantization width correction amounts ΔQ1 and ΔQ2 for each field image, Q1 = Q1 ′ + ΔQ1 and Q2 = Q2 ′ + ΔQ2 are calculated.

図１２は、画像情報符号化部１２１において、量子化幅の補正量ΔＱ１及びΔＱ２を決定する処理の流れを示すフローチャートである。
まず、ステップＳ３０では、シーンチェンジフラグｓｃの値を判定する。
ステップＳ３１では、画像情報符号化部１２１は、フレーム内シーンチェンジ検出部１０４がシーンチェンジｓｃを検出した場合に、符号化対象のフレーム画像を構成しシーンチェンジ前のシーンに属するフィールド画像の符号化に用いられる平均量子化幅ΔＱ１を、同一の予測モードで直前のフィールド画像の符号化に用いられる平均量子化幅ΔＱ２よりも増加させる。具体的には、シーンチェンジ前のフィールド画像ｆｌｄ１における量子化幅Ｑ１がＱ１’よりも大きく、シーンチェンジ後のフィールド画像ｆｌｄ２における量子化幅Ｑ２がＱ２’よりも小さくなるようにΔＱ１及びΔＱ２を定める。本実施形態では、量子化幅の補正量を、ΔＱ１＝＋ａ，ΔＱ２＝−ａにより算出する。ここで、ａは変数である。なお、ΔＱ１、ΔＱ２の算出方法として他の方法を使用してもよい。
変数ａに正の値を与えればフィールド画像ｆｌｄ１の量子化幅Ｑ１がより大きくなり、フィールド画像ｆｌｄ２の量子化幅Ｑ２がより小さくなる。すなわち、シーンチェンジ前であるフィールド画像ｆｌｄ１の割当て符号量が減少し、シーンチェンジ後であるフィールド画像ｆｌｄ２の割当て符号量が増加することになる。用いる符号化方式に従ってａの絶対値を変えることによって、画質の変化の度合いを調節できる。 FIG. 12 is a flowchart showing a flow of processing for determining quantization width correction amounts ΔQ1 and ΔQ2 in the image information encoding unit 121.
First, in step S30, the value of the scene change flag sc is determined.
In step S31, when the intra-frame scene change detection unit 104 detects a scene change sc, the image information encoding unit 121 configures an encoding target frame image and encodes a field image belonging to the scene before the scene change. The average quantization width ΔQ1 used in the above is increased more than the average quantization width ΔQ2 used for encoding the immediately preceding field image in the same prediction mode. Specifically, ΔQ1 and ΔQ2 are determined so that the quantization width Q1 in the field image fld1 before the scene change is larger than Q1 ′ and the quantization width Q2 in the field image fld2 after the scene change is smaller than Q2 ′. . In the present embodiment, the correction amount of the quantization width is calculated by ΔQ1 = + a and ΔQ2 = −a. Here, a is a variable. In addition, you may use another method as a calculation method of (DELTA) Q1 and (DELTA) Q2.
If a positive value is given to the variable a, the quantization width Q1 of the field image fld1 becomes larger, and the quantization width Q2 of the field image fld2 becomes smaller. That is, the assigned code amount of the field image fld1 before the scene change decreases, and the assigned code amount of the field image fld2 after the scene change increases. The degree of change in image quality can be adjusted by changing the absolute value of a according to the encoding method used.

ステップＳ３２は、シーンチェンジフラグｓｃが偽である場合に行われる処理であり、ΔＱ１＝０，ΔＱ２＝０に設定する。
このようにして画像情報符号化部１２１では、フレーム画像ｆｒｍがフレーム内シーンチェンジを有する場合にも適切な符号量制御を行なう。なお、量子化幅の算出方法は、シーンチェンジ前のシーンに属するフィールドに割り当てる符号量を相対的に減らすか、シーンチェンジ後のシーンに属するフィールドに割当てる符号量を相対的に増やす方法であれば、本実施形態の方法に限定されるものではない。
例えば、画像情報符号化部１２１は、フレーム内シーンチェンジ検出部１０４がシーンチェンジｓｃを検出した場合に、符号化対象のフレーム画像を構成しシーンチェンジ後のシーンに属するフィールド画像の符号化に用いられる平均量子化幅を、符号化対象のフレーム画像を構成しシーンチェンジ前のシーンに属するフィールド画像の符号化に用いられる平均量子化幅よりも小さくするようにしてもよい。このような処理を行なうことにより、シーンチェンジ前のシーンに属するフィールド画像に対して、シーンチェンジ後のシーンに属するフィールド画像の情報量を増加させることができる。 Step S32 is a process performed when the scene change flag sc is false, and ΔQ1 = 0 and ΔQ2 = 0 are set.
In this way, the image information encoding unit 121 performs appropriate code amount control even when the frame image frm has an intra-frame scene change. Note that the quantization width can be calculated by relatively reducing the code amount assigned to the field belonging to the scene before the scene change or relatively increasing the code amount assigned to the field belonging to the scene after the scene change. The method is not limited to the method of this embodiment.
For example, when the intra-frame scene change detection unit 104 detects a scene change sc, the image information encoding unit 121 configures a frame image to be encoded and is used for encoding a field image belonging to the scene after the scene change. The average quantization width that is formed may be smaller than the average quantization width that is used to encode the field image that forms the frame image to be encoded and belongs to the scene before the scene change. By performing such processing, the information amount of the field image belonging to the scene after the scene change can be increased with respect to the field image belonging to the scene before the scene change.

なお、符号量の制御方法として、上述した説明では各フィールド画像に対する量子化幅は各マクロブロックに対し一律としたが、算出した量子化幅Ｑ１及びＱ２を基準としてマクロブロック毎に量子化幅を調整してもよい。 As a method for controlling the code amount, in the above description, the quantization width for each field image is uniform for each macroblock. However, the quantization width for each macroblock is set based on the calculated quantization widths Q1 and Q2. You may adjust.

以上説明したように、第１の符号化動画像情報においてフレーム内シーンチェンジ検出部１０４がフレーム内シーンチェンジを検出した際に、画像情報符号化部１２１は、シーンチェンジ前後のフィールド画像に適用する量子化幅を補正する。これにより、動画像再符号化装置１２０は、フレーム内シーンチェンジが存在しても、第１の符号化動画像情報を第２の符号化動画像情報へ良好な符号化効率で再符号化できる。さらに、本実施形態を第１の実施形態あるいは第２の実施形態と組み合わせて実施することも可能であり、そうすれば符号化効率はより向上する。 As described above, when the intra-frame scene change detection unit 104 detects the intra-frame scene change in the first encoded moving image information, the image information encoding unit 121 applies the field image before and after the scene change. Correct the quantization width. As a result, the moving image re-encoding device 120 can re-encode the first encoded moving image information into the second encoded moving image information with good encoding efficiency even if there is an intra-frame scene change. . Furthermore, the present embodiment can be implemented in combination with the first embodiment or the second embodiment, and the encoding efficiency is further improved.

なお、上述した第１〜第３の実施形態による動画像再符号化装置は、再符号化にはフィールド構造のみを使用しているが、１つの符号化動画像内でフレーム画像毎にフレーム構造とフィールド構造を適応的に選択するフレーム−フィールド適応符号化が可能な符号化方式であれば、それを用いてもよい。 The moving image re-encoding device according to the first to third embodiments described above uses only the field structure for re-encoding, but the frame structure for each frame image in one encoded moving image. As long as it is an encoding scheme capable of frame-field adaptive encoding that adaptively selects the field structure, it may be used.

本発明の実施形態に係る動画像再符号化装置は、第１の符号化動画像情報におけるフレーム内シーンチェンジの有無を検出し、その結果に基づいて再符号化処理を制御することにより、フレーム内シーンチェンジが存在しても、良好な符号化効率で第２の符号化動画像情報へと再符号化できる。フレーム内シーンチェンジの検出時に、第１の実施形態の場合にはシーンチェンジ前のシーンに属するフィールド画像をより符号化効率の良いＰピクチャとして符号化することで上記効果を得ることができる。
第２の実施形態の場合には、フレーム内シーンチェンジの直前のフィールド画像をシーンチェンジ直後のフィールド画像で代替してフレーム内シーンチェンジを解消することで上記効果を得ることができる。
第３の実施形態の場合には、フレーム内シーンチェンジが起こったフレーム画像を構成するフィールド画像の量子化幅を変化させることにより割り当てる符号量を調節し、符号化効率の低下を防ぐことができる。 The moving image re-encoding device according to the embodiment of the present invention detects the presence or absence of an intra-frame scene change in the first encoded moving image information, and controls the re-encoding process based on the result, thereby Even if there is an inner scene change, it can be re-encoded to the second encoded moving image information with good encoding efficiency. In the case of the first embodiment, when the in-frame scene change is detected, the above-described effect can be obtained by encoding the field image belonging to the scene before the scene change as a P picture with higher encoding efficiency.
In the case of the second embodiment, the above effect can be obtained by substituting the field image immediately before the scene change within the frame with the field image immediately after the scene change to eliminate the scene change within the frame.
In the case of the third embodiment, the amount of code to be assigned can be adjusted by changing the quantization width of the field image constituting the frame image in which the intra-frame scene change has occurred, thereby preventing a decrease in encoding efficiency. .

なお、以上説明した実施形態において、図３、図７、図１１の動きベクトル合成部１０１、フレーム−フィールド変換部１０２、画像情報符号化部１０３、フレーム内シーンチェンジ検出部１０４、ＧＯＰ構造決定部１０５、フレーム−フィールド変換部１１１、ＧＯＰ構造決定部１１２、画像情報符合化部１２１、ピクチャタイプ判別部２０１、符号化情報解析部２０２、符号化情報復号部２０３、動きベクトル検出部２０８、コンプレキシティ算出部２１０の機能又はこれらの機能の一部を実現するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行することにより動画像再符号化装置の制御を行なってもよい。なお、ここでいう「コンピュータシステム」とは、ＯＳや周辺機器等のハードウェアを含むものとする。 In the embodiment described above, the motion vector synthesis unit 101, the frame-field conversion unit 102, the image information encoding unit 103, the intra-frame scene change detection unit 104, and the GOP structure determination unit shown in FIGS. 105, frame-field conversion unit 111, GOP structure determination unit 112, image information encoding unit 121, picture type determination unit 201, encoded information analysis unit 202, encoded information decoding unit 203, motion vector detection unit 208, complex By recording the function of the city calculation unit 210 or a program for realizing a part of these functions on a computer-readable recording medium, and causing the computer system to read and execute the program recorded on the recording medium The moving image re-encoding device may be controlled. Here, the “computer system” includes an OS and hardware such as peripheral devices.

また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。さらに「コンピュータ読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムを送信する場合の通信線のように、短時刻の間、動的にプログラムを保持するもの、その場合のサーバやクライアントとなるコンピュータシステム内部の揮発性メモリのように、一定時刻プログラムを保持しているものも含むものとする。また上記プログラムは、前述した機能の一部を実現するためのものであっても良く、さらに前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるものであっても良い。 The “computer-readable recording medium” refers to a storage device such as a flexible medium, a magneto-optical disk, a portable medium such as a ROM and a CD-ROM, and a hard disk incorporated in a computer system. Further, the “computer-readable recording medium” dynamically holds a program for a short time, like a communication line when transmitting a program via a network such as the Internet or a communication line such as a telephone line. In this case, it is also assumed that a server that holds a program for a certain time, such as a volatile memory inside a computer system that serves as a server or client. The program may be a program for realizing a part of the functions described above, and may be a program capable of realizing the functions described above in combination with a program already recorded in a computer system.

以上、この発明の実施形態について図面を参照して詳述してきたが、具体的な構成はこの実施形態に限られるものではなく、この発明の要旨を逸脱しない範囲の設計等も含まれる。 The embodiment of the present invention has been described in detail with reference to the drawings. However, the specific configuration is not limited to this embodiment, and includes designs and the like that do not depart from the gist of the present invention.

２枚のフィールド画像ｆｌｄ１及びｆｌｄ２が、１枚のフレーム画像ｆｒｍを構成する例を示した図である。FIG. 5 is a diagram illustrating an example in which two field images fld1 and fld2 constitute one frame image frm. ピクチャ間の相関性の例を示す図である。It is a figure which shows the example of the correlation between pictures. 本発明の第１の実施形態における動画像再符号化装置１００の構成を示すブロック図である。It is a block diagram which shows the structure of the moving image re-encoding apparatus 100 in the 1st Embodiment of this invention. 本発明の実施形態によるＧＯＰ構造決定部１０５の処理を示すフローチャートである。It is a flowchart which shows the process of the GOP structure determination part 105 by embodiment of this invention. 本発明の第１の実施形態による第１の符号化動画像情報の模式図である。It is a schematic diagram of the 1st encoding moving image information by the 1st Embodiment of this invention. 本発明の第１の実施形態による第２の符号化動画像情報の模式図である。It is a schematic diagram of the 2nd encoding moving image information by the 1st Embodiment of this invention. 本発明の第２の実施形態における動画像再符号化装置１１０の構成を示すブロック図である。It is a block diagram which shows the structure of the moving image re-encoding apparatus 110 in the 2nd Embodiment of this invention. １枚のフレーム画像から２枚のフィールド画像への変換処理を説明するための図である。It is a figure for demonstrating the conversion process from one frame image to two field images. ＧＯＰ構造決定部１１２におけるピクチャタイプｐｔ１とピクチャタイプｐｔ２の選択処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the selection process of the picture type pt1 in the GOP structure determination part 112, and the picture type pt2. 本発明の第２の実施形態による第２の符号化動画像情報の模式図である。It is a schematic diagram of the 2nd encoding moving image information by the 2nd Embodiment of this invention. 本発明の第３の実施形態による動画像再符号化装置１２０の構成を示すブロック図である。It is a block diagram which shows the structure of the moving image re-encoding apparatus 120 by the 3rd Embodiment of this invention. 画像情報符号化部１２１において、量子化幅の補正量ΔＱ１及びΔＱ２を決定する処理の流れを示すフローチャートである。12 is a flowchart showing a flow of processing for determining quantization width correction amounts ΔQ1 and ΔQ2 in the image information encoding unit 121. 従来の動画像再符号化装置５００の構成を示すブロック図である。FIG. 10 is a block diagram showing a configuration of a conventional moving image re-encoding device 500. 動画像にシーンチェンジが起こる場合について説明するための図である。It is a figure for demonstrating the case where a scene change occurs in a moving image.

Explanation of symbols

１００・・・動画像再符号化装置、１０１・・・動きベクトル合成部、１０２・・・フレーム−フィールド変換部、１０３・・・画像情報符号化部、１０４・・・フレーム内シーンチェンジ検出部、１０５・・・ＧＯＰ構造決定部、１１０・・・動画像再符号化装置、１１１・・・フレーム−フィールド変換部、１１２・・・ＧＯＰ構造決定部、１２０・・・動画像再符号化装置、１２１・・・画像情報符合化部、２０１・・・ピクチャタイプ判別部、２０２・・・符号化情報解析部、２０３・・・符号化情報復号部、２０５・・・ビデオメモリ、２０８・・・動きベクトル検出部、２０９・・・情報バッファ、２１０・・・コンプレキシティ算出部 DESCRIPTION OF SYMBOLS 100 ... Moving image re-encoding apparatus, 101 ... Motion vector synthetic | combination part, 102 ... Frame-field conversion part, 103 ... Image information encoding part, 104 ... In-frame scene change detection part , 105... GOP structure determination unit, 110... Video re-encoding device, 111... Frame-field conversion unit, 112... GOP structure determination unit, 120. , 121 ... Image information encoding unit, 201 ... Picture type discrimination unit, 202 ... Encoding information analysis unit, 203 ... Encoding information decoding unit, 205 ... Video memory, 208 ...・ Motion vector detection unit, 209... Information buffer, 210... Complexity calculation unit

Claims

A frame-field conversion unit for converting one frame image into two field images;
A scene change detection unit for detecting whether or not a scene change that is a scene switching point is included between the two field images;
An image information encoding unit that encodes each of the two field images based on a detection result of the scene change detection unit using an intra prediction mode or an inter prediction mode;
A moving picture encoding apparatus comprising:

The moving image encoding apparatus according to claim 1, wherein the image information encoding unit encodes the frame image as two field images when the scene change detection unit detects a scene change. .

A GOP structure determining unit for selecting a GOP structure by selecting an intra-screen prediction mode or an inter-screen prediction mode for each field image according to a scene change detection result of the scene change detection unit;
The moving image encoding apparatus according to claim 2, wherein the image information encoding unit encodes each field image using the prediction mode selected by the GOP structure determination unit.

The GOP structure determination unit is used for the two field images according to the prediction mode used for the frame image converted by the frame-field conversion unit when the scene change detection unit detects a scene change. 4. The moving picture encoding apparatus according to claim 3, wherein a combination of prediction modes is selected.

5. The GOP structure determination unit according to claim 4, wherein, when the scene change detection unit detects a scene change, the prediction mode used for the field image before the scene change is set as an inter-screen prediction mode. The moving image encoding apparatus described.

The frame-field conversion unit converts one field image of the two field images so that a difference from the other field image is reduced when the scene change detection unit detects a scene change. The moving picture encoding apparatus according to claim 2, wherein:

The frame-field conversion unit converts one field image of the two field images into the same field image as the other field image when the scene change detection unit detects a scene change. The moving picture coding apparatus according to claim 6.

When the scene change detection unit detects a scene change, the image information encoding unit forms an encoding target frame image and is used for encoding a field image belonging to the scene before the scene change. The moving picture coding apparatus according to claim 2, characterized in that is increased more than an average quantization width used for coding the immediately preceding field picture in the same prediction mode.

When the scene change detection unit detects a scene change, the image information encoding unit forms an encoding target frame image and is used for encoding a field image belonging to the scene after the scene change. 3. The moving picture coding apparatus according to claim 2, wherein the video image coding apparatus is made smaller than an average quantization width used for coding a field image constituting a frame image to be coded and belonging to a scene before a scene change.

On the computer,
A first step of converting one frame image into two field images;
A second step of detecting whether a scene change that is a scene switching point is included between the two field images;
A third step of encoding each of the two field images based on a detection result of the scene change detection unit using an intra prediction mode or an inter prediction mode;
A program characterized in that is executed.

A frame-field conversion process for converting one frame image into two field images;
A scene change detection process for detecting whether a scene change that is a scene switching point is included between the two field images;
An image information encoding process for encoding each of the two field images based on a detection result of the scene change detection unit using an intra prediction mode or an inter prediction mode;
A moving picture encoding method comprising: