JP2008098980A

JP2008098980A - Moving image encoding apparatus

Info

Publication number: JP2008098980A
Application number: JP2006278475A
Authority: JP
Inventors: Hideki Takehara; 英樹竹原; Takashi Morishige; 孝森重; Shigeru Fukushima; 茂福島; Hiroyuki Kurashige; 宏之倉重
Original assignee: Victor Company of Japan Ltd
Current assignee: Victor Company of Japan Ltd
Priority date: 2006-10-12
Filing date: 2006-10-12
Publication date: 2008-04-24

Abstract

<P>PROBLEM TO BE SOLVED: To provide a picture signal encoding apparatus in which picture quality of a moving image is equalized as well as the processing amount is significantly reduced in comparison with a conventional apparatus. <P>SOLUTION: The moving image encoding apparatus extracts character information of the moving image in encoding an input moving image, and determines a GOP structure based thereon. The apparatus determines a first encoding parameter based on the character information of the moving image and the GOP structure, and calculates encoding complexity for each picture using the character information of the moving image and the GOP structure and the first encoding parameter (a first encoding section 20). Then, the apparatus stores the first encoding parameter and the encoding complexity in a recording medium 30. Next, the apparatus determines a second encoding parameter using the first encoding parameter and the encoding complexity stored in the recording medium 30 in encoding the input moving image of the same content as a previous image again, and encodes the input moving image using the second encoding parameter (a second encoding section 40). <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、動き補償予測を用いて符号化を行う動画像符号化装置に関する。 The present invention relates to a moving picture coding apparatus that performs coding using motion compensation prediction.

映像信号を第１のパスと第２のパスに分けて符号化する２パス型可変ビットレート映像信号符号化装置が知られている（例えば、特開平７-２８４０９７号公報、特開平８-２６５７４７号公報）。これらの従来の映像信号符号化装置では、例えば、入力された２時間の映像信号を、第１のパスの符号化において、発生データ量の大小にかかわらず一定の量子化幅で量子化した後、可変長符号化し、得られた符号化データ量の情報を記憶回路に所定単位毎に記録する。符号化方式としてＭＰＥＧによる圧縮符号化方式を採用した場合は、映像信号は全ビデオシーケンスに対してピクチャ単位に発生符号量を情報として記憶回路に記録される。なお、特開平８-２６５７４７号公報では第１のパスの前段にＧＯＰ構造を決める回路が付加されている。ＭＰＥＧに代表されるような可変長符号化を行っている符号化方式は、量子化幅を固定にして第１のパスで符号化を行うと、符号化画像の複雑さや動き補償残差成分量に応じて発生符号量が多くなる。この性質を利用して、発生符号量の分布状態を予め調査しておく。 2. Description of the Related Art There are known two-pass variable bit rate video signal encoding apparatuses that encode a video signal by dividing it into a first pass and a second pass (for example, JP-A-7-284097 and JP-A-8-265747). Issue gazette). In these conventional video signal encoding devices, for example, after the input two-hour video signal is quantized with a constant quantization width regardless of the amount of generated data in the first pass encoding. Then, variable length coding is performed, and information of the obtained encoded data amount is recorded in the storage circuit for each predetermined unit. When a compression encoding method based on MPEG is adopted as the encoding method, the video signal is recorded in the storage circuit as information on the generated code amount for each video sequence as a picture unit. In JP-A-8-265747, a circuit for determining the GOP structure is added before the first pass. When encoding is performed in the first pass with a fixed quantization width, encoding schemes such as MPEG, which perform variable length encoding, the complexity of the encoded image and the amount of motion compensation residual component are as follows. Accordingly, the amount of generated code increases. Using this property, the distribution state of the generated code amount is investigated in advance.

そして、第２のパスの符号化で、その分布に極力近くなるように符号量の配分を行うことで、全ビデオシーケンスの画質を均一にすることが可能となる。
この場合、第１のパスでは一般的に量子化幅を小さめにして符号化を行い、第２のパスで出力される最終的な符号量より多くの符号量を発生させるのが一般的である。量子化幅を小さくすることで画像の高周波成分まで細かく情報を検出して、その画像の特性を検出する必要があるからである。その発生符号量と量子化幅の示すほぼ反比例の関係を用いて、第２のパスで目標とするピクチャごとの符号量を算出し、その値に制御するようにしている。 Then, in the second pass encoding, the code amount is distributed so as to be as close as possible to the distribution, so that the image quality of all video sequences can be made uniform.
In this case, in general, encoding is performed with a smaller quantization width in the first pass, and a larger code amount than the final code amount output in the second pass is generally generated. . This is because by reducing the quantization width, it is necessary to detect fine information up to the high frequency component of the image and to detect the characteristics of the image. Using the almost inversely proportional relationship between the generated code amount and the quantization width, the code amount for each target picture is calculated in the second pass and controlled to that value.

特開平１１-２７５５７７号公報の２パス型可変ビットレート映像信号符号化装置は、上記従来例がピクチャ単位で符号量の見積もりを行っており、ＭＢ単位の符号量変動を再現できないという課題があるのを解決するためのものである。第１のパスの符号化においてＭＢ単位の発生符号量を情報として記憶回路に記録し、第２のパスの符号化において、ＭＢ単位で符号量制御を行うことで精度を向上させている。 The two-pass variable bit rate video signal encoding apparatus disclosed in Japanese Patent Application Laid-Open No. 11-275577 has a problem that the above-described conventional example estimates the code amount in units of pictures and cannot reproduce code amount fluctuations in units of MB. It is for solving the problem. In the first pass encoding, the generated code amount in MB units is recorded in the storage circuit as information, and in the second pass encoding, the code amount control is performed in MB units to improve accuracy.

一方、２パス型可変ビットレート映像信号符号化装置に近い符号化効率を得るための１パス型可変ビットレート映像信号符号化装置が知られている（例えば、特開平１０−６６０６７号公報、特開２００２−１９９３９８号公報）。この映像信号符号化装置では、２パス型可変ビットレート映像信号符号化装置が、第１のパスの符号化で得ている発生符号量の代わりに、符号化の前段において映像信号が持つ特徴情報から符号化難易度を算出し、符号化難易度に応じて目標符号量を割り当てることで、可変ビットレート制御を可能としている。このような１パス型可変ビットレート映像信号符号化装置では、ビデオシーケンス全体について画質を均一化することは理論上困難であるが、ビデオシーケンスの画質を周期的或いはシーンの区切り毎に均一にすることが可能である。
特開平７-２８４０９７号公報特開平８-２６５７４７号公報特開平１１-２７５５７７号公報特開平１０−６６０６７号公報特開２００２−１９９３９８号公報 On the other hand, a one-pass variable bit rate video signal encoding device for obtaining encoding efficiency close to that of a two-pass variable bit rate video signal encoding device is known (for example, Japanese Patent Laid-Open No. 10-66067, No. 2002-199398). In this video signal encoding device, the two-pass variable bit rate video signal encoding device uses the characteristic information of the video signal in the previous stage of encoding instead of the generated code amount obtained by the first pass encoding. Thus, it is possible to perform variable bit rate control by calculating the encoding difficulty level from the above and assigning a target code amount according to the encoding difficulty level. In such a one-pass variable bit rate video signal encoding apparatus, it is theoretically difficult to make the image quality uniform for the entire video sequence, but the image quality of the video sequence is made uniform periodically or every scene break. It is possible.
JP-A-7-284097 JP-A-8-265747 Japanese Patent Application Laid-Open No. 11-275577 Japanese Patent Laid-Open No. 10-66067 JP 2002-199398 A

ＭＰＥＧ−２よりも新しい符号化方式としてＭＰＥＧ−４ＡＶＣ（ＩＳＯ／ＩＥＣ１４４９６−１０）がある。ＭＰＥＧ−４ＡＶＣでは、選択可能な符号化パラメータをＭＰＥＧ−２の数百倍用意し、その中から最適なパラメータを選択することで、ＭＰＥＧ−２に対して２倍以上の符号化効率の向上が可能となる。符号化パラメータとは、例えば、ＭＢ構造（フィールド／フレーム）、ＭＢ内ブロックサイズ、ＭＢ予測方式、動きベクトルなどである。 There is MPEG-4AVC (ISO / IEC14496-10) as a newer encoding method than MPEG-2. In MPEG-4AVC, selectable encoding parameters are prepared several hundred times that of MPEG-2, and by selecting the optimum parameters from among them, the encoding efficiency can be improved more than twice that of MPEG-2. It becomes possible. The encoding parameter is, for example, an MB structure (field / frame), an intra-MB block size, an MB prediction method, a motion vector, or the like.

そのため、ＭＰＥＧ−４ＡＶＣでは、多数の符号化パラメータ候補の中から最適な符号化パラメータを選択できるか否かが符号化効率向上のために重要である。注意しなければならないことは、最適な符号化パラメータはビットレート（主に量子化幅で制御される）によって変動することである。例えば、一般的には高ビットレートであれば、フレーム内符号化で用いるマクロブック（以下イントラＭＢと呼ぶ）の選択率を上げてもよいが、低ビットレートではイントラＭＢの選択率を下げる必要がある。また、ＭＰＥＧ−４ＡＶＣでは隣接ＭＢ間の符号化パラメータの相関性を利用して圧縮効率を向上させているため、最適な符号化パラメータは隣接するＭＢによっても変化する。 Therefore, in MPEG-4 AVC, whether or not an optimal encoding parameter can be selected from a large number of encoding parameter candidates is important for improving the encoding efficiency. It should be noted that the optimal encoding parameter varies with the bit rate (mainly controlled by the quantization width). For example, in general, if the bit rate is high, the selection rate of the macrobook (hereinafter referred to as intra MB) used for intra-frame coding may be increased, but the selection rate of intra MB needs to be decreased at a low bit rate. There is. In MPEG-4 AVC, since the compression efficiency is improved by utilizing the correlation of coding parameters between adjacent MBs, the optimum coding parameter varies depending on the adjacent MB.

ここで、上記の従来の２パス型可変ビットレート映像信号符号化装置に、ＭＰＥＧ−４ＡＶＣを適用した場合について考える。
従来の２パス型可変ビットレート映像信号符号化装置では、符号化パラメータの選択に関する符号量（主に量子化幅に依存する）について考慮されておらず、第１のパスと第２のパスとで異なる量子化幅を用いている。第1のパスと第２のパスの量子化幅が近いピクチャやＭＢであれば、あまり問題とはならないが、第1のパスと第２のパスとの量子化幅が大きく異なるようなピクチャやＭＢでは、第１のパスで見積もった符号量が第２のパスで発生する実際の符号量と大きく異なったものとなり、第１のパスと第２のパスとで最適な符号化パラメータが変化する。このように、従来例では、符号化パラメータの量子化幅による変動を考慮していないために、第１のパスにおける見積もり符号量が有効に作用せず、ＭＰＥＧ−４ＡＶＣのように多数の符号化パラメータを有する符号化方式においては、見積もり符号量の誤差が大きくなってしまう。一般的に、平均的な映像では第１のパスと第２のパスの量子化幅が近くなるが、難しい映像や簡単な映像では第1のパスと第２のパスの量子化幅が大きく異なる。可変ビットレート技術は難しい映像や簡単な映像に対する符号化効率を向上させることで、映像全体の画質を均一化されるための技術であり、本来の可変ビットレートの特徴を活かしきれないという課題がある。 Here, consider a case where MPEG-4 AVC is applied to the above-described conventional two-pass variable bit rate video signal encoding apparatus.
In the conventional two-pass variable bit rate video signal encoding apparatus, the code amount (mainly depending on the quantization width) related to the selection of the encoding parameter is not considered, and the first pass, the second pass, Different quantization widths are used. If the first pass and the second pass have similar quantization widths of pictures and MBs, this is not a problem, but the pictures and MBs whose quantization widths of the first pass and the second pass are significantly different In the MB, the code amount estimated in the first pass is significantly different from the actual code amount generated in the second pass, and the optimal encoding parameter changes between the first pass and the second pass. . As described above, in the conventional example, since the variation due to the quantization width of the encoding parameter is not considered, the estimated code amount in the first pass does not work effectively, and a large number of encodings are performed as in MPEG-4 AVC. In an encoding method having parameters, an error in the estimated code amount becomes large. In general, the quantization widths of the first pass and the second pass are close to each other in an average image, but the quantization widths of the first pass and the second pass are greatly different in difficult or simple images. . The variable bit rate technology is a technology for making the image quality of the entire video uniform by improving the coding efficiency for difficult video and simple video, and there is a problem that the characteristics of the original variable bit rate cannot be fully utilized. is there.

また、従来の２パス型可変ビットレート映像信号符号化装置には、隣接ＭＢ間の相関性を考慮した符号化についてはなんら記述されていない。
このように従来の２パス型可変ビットレート映像信号符号化装置にＭＰＥＧ−４ＡＶＣのように多数の符号化パラメータを有する符号化方式を適用した場合、難しい映像や簡単な映像が混在しているような動画像に対して、第１のパスの符号化で見積もった符号量と、第２のパスの符号化で実際に発生する符号量との誤差が大きくなり、最適な符号量制御が行われないために、符号化した動画像の映像画質を均一化できない問題が起きる。 Further, the conventional two-pass variable bit rate video signal encoding device does not describe any encoding considering the correlation between adjacent MBs.
As described above, when an encoding method having a large number of encoding parameters such as MPEG-4AVC is applied to the conventional two-pass variable bit rate video signal encoding apparatus, it seems that difficult video and simple video are mixed. Therefore, an error between the code amount estimated by the first pass encoding and the code amount actually generated by the second pass encoding becomes large, and optimal code amount control is performed. Therefore, there is a problem that the image quality of the encoded moving image cannot be made uniform.

また、ＭＰＥＧ−４ＡＶＣでは片側予測において複数の参照画像を持つことができるため、ＭＰＥＧ−２と比較して動きベクトル検出に要する処理量が大幅に大きくなる。多数の符号化パラメータの中から最適なモードを選択する処理には莫大な処理量を要するのが一般的であり、高画質と処理量削減を両立するのが困難である課題もある。例えば、ＭＰＥＧ−４ＡＶＣの参照ソフトウェア（ＩＳＯ／ＩＥＣ１４４９６−５）で紹介されているレート歪最適化方法では、全てのモードについて符号化と復号化を行ってから評価をしているため非常に処理量が大きくなってしまう。 In addition, since MPEG-4 AVC can have a plurality of reference images in one-sided prediction, the processing amount required for motion vector detection is significantly increased compared to MPEG-2. Processing for selecting an optimal mode from a large number of encoding parameters generally requires a huge amount of processing, and there is a problem that it is difficult to achieve both high image quality and reduction in processing amount. For example, in the rate distortion optimization method introduced in the MPEG-4 AVC reference software (ISO / IEC14496-5), since all modes are evaluated after encoding and decoding, the amount of processing is extremely high. Will become bigger.

従来の２パス型可変ビットレート映像信号符号化装置のように、処理量の莫大な符号化パラメータの選択を第1のパスと第２のパスとの両方で行うことは処理量の点から非効率であるといえる。 As in the conventional two-pass variable bit rate video signal encoding apparatus, selection of encoding parameters with a large amount of processing in both the first pass and the second pass is not from the viewpoint of processing amount. It can be said that it is efficient.

一方、従来の１パス型可変ビットレート映像信号符号化装置では、動画像が持つ特徴から符号化難易度を算出しているため、第１のパスと第２のパス間で符号量の見積もりの誤差による画質劣化は起きない。しかし、数百の符号化パラメータの中から最適な符号化パラメータを選択する精度に関して課題がある。また、ビデオシーケンス全体について画質を均一化することにも課題がある。更に、第１のパスと第２のパスとの間で処理の削減についての工夫がなされていない。 On the other hand, in the conventional one-pass variable bit rate video signal encoding device, since the encoding difficulty level is calculated from the characteristics of the moving image, the code amount is estimated between the first pass and the second pass. Image quality degradation due to errors does not occur. However, there is a problem regarding the accuracy of selecting an optimal encoding parameter from among hundreds of encoding parameters. There is also a problem in making the image quality uniform for the entire video sequence. Furthermore, no contrivance has been made for reducing processing between the first pass and the second pass.

映像符号化より高度な符号化を行なうには処理量が多く必要となるため、従来の映像信号符号化装置は、専用ハードウェアによって実現されていた。ところが、最近はＣＰＵの高速化とマルチコア化が進み、ソフトウェアを用いた分散処理による映像信号符号化装置の実現が可能になりつつある。 Since a higher processing amount is required to perform higher-level encoding than video encoding, the conventional video signal encoding apparatus has been realized by dedicated hardware. However, recently, CPUs have been increased in speed and multi-core, and it has become possible to realize a video signal encoding device by distributed processing using software.

本願発明は、上記のような課題を鑑みてなされたものである。
量子化幅の影響が小さな符号化パラメータや隣接ＭＢとの相関性の低い符号化パラメータについては第１のパスの符号化時に決定し、量子化幅の影響が大きな符号化パラメータや隣接ＭＢとの相関性の高い符号化パラメータについては第２のパスの符号化時に決定することで、符号化パラメータの選択における量子化幅の依存性に関する課題を解決し、符号化後の動画像の映像画質の均一化を実現することが可能な２パス型可変ビットレート映像信号符号化装置を提供することを目的とする。 The present invention has been made in view of the above problems.
An encoding parameter having a small influence on the quantization width or an encoding parameter having a low correlation with the adjacent MB is determined at the time of encoding in the first pass, and the encoding parameter having a large influence on the quantization width or an adjacent MB is determined. An encoding parameter with high correlation is determined at the time of encoding in the second pass, thereby solving the problem relating to the dependency of the quantization width in the selection of the encoding parameter, and improving the video quality of the encoded moving image. An object of the present invention is to provide a two-pass variable bit rate video signal encoding device capable of realizing equalization.

また、第１のパスと第２のパスとで符号化に要する処理量を分散することで、従来の２パス型可変ビットレート映像信号符号化装置に比べて大幅に処理量の削減が可能な２パス型可変ビットレート映像信号符号化装置を提供することを目的とする。 Also, by distributing the processing amount required for encoding in the first pass and the second pass, the processing amount can be significantly reduced as compared with the conventional two-pass variable bit rate video signal encoding device. An object of the present invention is to provide a two-pass variable bit rate video signal encoding device.

そこで上記課題を解決するために本発明は、下記の装置を提供するものである。
（１）入力動画像を符号化する際に、シーンの切り換わり位置を示すシーンチェンジ情報、シーンがフェードイン及びフェードアウトする位置を示すフェード情報、画像間の動きの複雑さを示すアクティビティ情報のうち少なくともいずれかひとつを含む動画像の特徴情報を抽出する抽出部と、
前記動画像の特徴情報を基にＧＯＰ構造を決定する第１の決定部と、
前記動画像の特徴情報と前記ＧＯＰ構造とを基に第１の符号化パラメータを決定する第２の決定部と、
前記動画像の特徴情報と前記ＧＯＰ構造と前記第１の符号化パラメータとを用いてピクチャ毎の符号化複雑度を算出する算出部と、
前記第１の符号化パラメータと前記符号化複雑度とを記憶する記憶部と、
を有する第１の符号化手段と、
前記第１の符号化手段で符号化した入力動画像を再度符号化する際に、前記第１の符号化手段が前記入力動画像を符号化した際に前記記憶部に記憶した前記第１の符号化パラメータと前記符号化複雑度とを用いて、第２の符号化パラメータを決定する第３の決定部と、
前記第２の符号化パラメータを用いて、前記入力動画像を符号化する第２の符号化手段と、
を有することを特徴とする動画像符号化装置。
（２）前記第１の符号化パラメータは、画像毎の符号化タイプとピクチャ構造とであることを特徴とする上記（１）記載の動画像符号化装置。
（３）前記第１の符号化パラメータは、更に、ＭＢ構造と動きベクトルの候補値とを用いることを特徴とする上記（２）記載の動画像符号化装置。 Therefore, in order to solve the above problems, the present invention provides the following apparatus.
(1) When encoding an input moving image, among scene change information indicating a scene switching position, fade information indicating a position where a scene fades in and out, and activity information indicating complexity of movement between images An extraction unit for extracting feature information of a moving image including at least one of them,
A first determination unit that determines a GOP structure based on feature information of the moving image;
A second determination unit that determines a first encoding parameter based on the feature information of the moving image and the GOP structure;
A calculation unit that calculates coding complexity for each picture using the feature information of the moving image, the GOP structure, and the first coding parameter;
A storage unit for storing the first encoding parameter and the encoding complexity;
First encoding means comprising:
When the input moving image encoded by the first encoding unit is encoded again, the first encoding unit stores the first moving image stored in the storage unit when the first encoding unit encodes the input moving image. A third determination unit for determining a second encoding parameter using the encoding parameter and the encoding complexity;
Second encoding means for encoding the input moving image using the second encoding parameter;
A moving picture encoding apparatus comprising:
(2) The moving picture coding apparatus according to (1), wherein the first coding parameters are a coding type and a picture structure for each picture.
(3) The moving picture coding apparatus according to (2), wherein an MB structure and a motion vector candidate value are further used as the first coding parameter.

本願発明によれば、以下の効果を得ることが可能である。
第１のパスの符号化では、動画像の特徴情報から符号化複雑度を算出して、量子化幅の影響の小さな符号化パラメータや隣接ＭＢとの相関性の低い符号化パラメータを決定する。そして、第２のパスの符号化では、量子化幅の影響の大きな符号化パラメータや隣接ＭＢとの相関性の高い符号化パラメータを決定して符号化する。これにより、符号化パラメータの選択に関する量子化幅への依存性を排除することができるので、符号化後の動画像全体で均一な映像画質を実現することが可能となる。 According to the present invention, the following effects can be obtained.
In the first pass encoding, the encoding complexity is calculated from the feature information of the moving image, and the encoding parameter having a small influence of the quantization width and the encoding parameter having a low correlation with the adjacent MB are determined. In the second pass encoding, encoding parameters having a large influence on the quantization width and encoding parameters having high correlation with adjacent MBs are determined and encoded. As a result, it is possible to eliminate the dependency on the quantization width related to the selection of the encoding parameter, so that it is possible to realize a uniform image quality in the entire moving image after encoding.

また、最適な符号化パラメータを選択するのに要する処理を、第１のパスの符号化と第２のパスの符号化とで分散して行なうことで、従来の２パス型可変ビットレート映像信号符号化装置に比べて大幅に符号化の処理量の削減が可能となる。 In addition, the processing required to select the optimal encoding parameter is performed in a distributed manner between the first pass encoding and the second pass encoding, so that a conventional two-pass variable bit rate video signal is obtained. Compared to the encoding device, the amount of encoding processing can be greatly reduced.

以下、図面を参照しながら本発明の実施の形態を説明する。
図１は、本発明の実施の形態の構成を示す全体ブロック図である。また、図８は、この図１の構成における全体の処理の流れを説明するフローチャートである。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.
FIG. 1 is an overall block diagram showing a configuration of an embodiment of the present invention. FIG. 8 is a flowchart for explaining the overall processing flow in the configuration of FIG.

第１符号化部２０は、ＶＴＲ１０などの動画像再生装置から入力される動画像信号を、符号化するとともに、この動画像信号における動画像の特徴を解析し、各種の符号化パラメータを生成して記録媒体３０に記録する（ステップＳ１０）。 The first encoding unit 20 encodes a moving image signal input from a moving image reproducing device such as the VTR 10 and analyzes the characteristics of the moving image in the moving image signal to generate various encoding parameters. To be recorded on the recording medium 30 (step S10).

次に、第２符号化部４０は、ＶＴＲ１０などの動画像再生装置から入力される、上記した第１符号化部２０で符号化したものと同じ動画像信号を、記録媒体３０に記録されているこの動画像信号に対応する符号化パラメータを読み出し、この符号化パラメータを用いて符号化を行って、得られたビットストリームを蓄積媒体５０に記録する（ステップＳ２０）。
［第１実施例］
次に、上記した実施の形態を用いた第１実施例について図面を用いて詳細に説明する。
＜第１パス＞
図２は、第１実施例における第１のパスの符号化を実現するブロックの内部構成を示す図である。また、図９はこの第１のパスの符号化の処理の流れを示すフローチャートである。 Next, the second encoding unit 40 records the same moving image signal input from the moving image reproducing device such as the VTR 10 as the one encoded by the first encoding unit 20 on the recording medium 30. The encoding parameter corresponding to the moving image signal is read out, encoding is performed using the encoding parameter, and the obtained bit stream is recorded in the storage medium 50 (step S20).
[First embodiment]
Next, a first example using the above-described embodiment will be described in detail with reference to the drawings.
<First pass>
FIG. 2 is a diagram illustrating an internal configuration of a block that realizes encoding of the first pass in the first embodiment. FIG. 9 is a flowchart showing the flow of the first pass encoding process.

入力画像蓄積部１００は、入力される動画像信号（入力画像信号）を一時的に蓄積する（ステップＳ１００）。
ここで、入力画像蓄積部１００が蓄積する画像の枚数をＮＦとする。ＮＦは、取り得る最大ＧＯＰサイズＮmaxであるとする。 The input image storage unit 100 temporarily stores the input moving image signal (input image signal) (step S100).
Here, the number of images stored in the input image storage unit 100 is NF. Let NF be the maximum possible GOP size Nmax.

入力画像蓄積部１００は、記録部６００から次ＧＯＰ処理開始信号を受け取ると、ＧＯＰ構造決定部３００から得たＮ枚の画像を消去し、新たにＮ枚の画像を蓄積する。入力画像蓄積部１００が蓄積する画像の枚数ＮＦは、動画像特徴抽出部２００の構成によっては、２枚とすることも可能である。これらの処理の詳細は後述する。 When receiving the next GOP processing start signal from the recording unit 600, the input image storage unit 100 deletes the N images obtained from the GOP structure determination unit 300, and newly stores N images. Depending on the configuration of the moving image feature extraction unit 200, the number of images NF stored in the input image storage unit 100 may be two. Details of these processes will be described later.

動画像特徴抽出部２００は、入力画像蓄積部１００から供給される動画像信号における動画像の特徴を示す情報を抽出する（ステップＳ２００）。
図４は、この動画像特徴抽出部２００の具体的な構成例を示す図である。以下、この図４を用いて動画像特徴抽出部２００の動作を説明する。 The moving image feature extraction unit 200 extracts information indicating the feature of the moving image in the moving image signal supplied from the input image storage unit 100 (step S200).
FIG. 4 is a diagram illustrating a specific configuration example of the moving image feature extraction unit 200. Hereinafter, the operation of the moving image feature extraction unit 200 will be described with reference to FIG.

動画像特徴抽出部２００は、動画像のシーンの変化点を検出するシーンチェンジ検出部２１０と、動画像の信号レベルが徐々に変化するシーンを検出するフェード検出部２２０と、動画像を構成する各画像間の動きの複雑さを算出する動き複雑度算出部２３０と、各画像のアクティビティを算出するアクティビティ算出部２４０とで構成する。 The moving image feature extraction unit 200 configures a moving image with a scene change detection unit 210 that detects a scene change point of the moving image, a fade detection unit 220 that detects a scene where the signal level of the moving image gradually changes, and the like. It comprises a motion complexity calculator 230 that calculates the complexity of motion between images, and an activity calculator 240 that calculates the activity of each image.

シーンチェンジ検出部２１０は、入力画像蓄積部１００に蓄積されている動画像信号における画像の切り替わり目であるシーンチェンジを検出する。各隣接画像の画素間における輝度情報の絶対差分和や、二乗平均誤差和の微分値などを算出し、この算出した値の大きな画像間をシーンチェンジとして検出し、シーンチェンジ情報ａを１とする。シーンチェンジを検出しない時はシーンチェンジ情報ａを０とする。 The scene change detection unit 210 detects a scene change that is an image switching point in the moving image signal stored in the input image storage unit 100. The absolute difference sum of luminance information between the pixels of each adjacent image, the differential value of the mean square error sum, and the like are calculated. Between the images having the large calculated values is detected as a scene change, and the scene change information a is set to 1. . When no scene change is detected, the scene change information a is set to 0.

なお、シーンチェンジの検出は、輝度情報のみでなく、輝度情報に加えて色差情報を用いて行なっても良い。また各画像を変換、縮小等の処理をした後にシーンチェンジを検出したり、各画像内の特定の一部の領域のみを抽出して、この領域内の情報を用いてシーンチェンジを検出したりしても良い。 The scene change may be detected using not only the luminance information but also the color difference information in addition to the luminance information. Also, scene changes can be detected after each image has been converted, reduced, etc., or only a specific area within each image can be extracted, and scene changes can be detected using information in these areas. You may do it.

フェード検出部２２０は、入力画像蓄積部１００に蓄積されている動画像信号における画像全体の信号レベルが一定方向に変動するシーンを検出する。各隣接画像の画素間における輝度情報の絶対差分和や、二乗平均誤差和などを算出し、この算出した値がある一定以上であるシーンをフェードとして検出し、フェード情報ｂを１とする。フェードを検出しない時はフェード情報ｂを０とする。 The fade detection unit 220 detects a scene in which the signal level of the entire image in the moving image signal stored in the input image storage unit 100 varies in a certain direction. The absolute difference sum of the luminance information between the pixels of each adjacent image, the root mean square error sum, and the like are calculated. A scene where the calculated value is a certain value or more is detected as a fade, and the fade information b is set to 1. When no fade is detected, the fade information b is set to 0.

なお、フェードの検出は、輝度情報のみでなく、輝度情報に加えて色差情報を用いて行なっても良い。また各画像を変換、縮小等の処理をした後にフェードを検出したり、各画像内の特定の一部の領域のみを抽出して、この領域内の情報を用いてフェードを検出したりしても良い。 Note that the fade may be detected using not only the luminance information but also the color difference information in addition to the luminance information. Also, each image can be converted, reduced, etc., to detect fades, or only a specific part of each image can be extracted and fades can be detected using information in these regions. Also good.

動き複雑度算出部２３０は、入力画像蓄積部１００に蓄積されている動画像信号における各画像間の動き、すなわち動き複雑度ｃを算出する。この動き複雑度ｃの算出には、時間的に隣接する２枚の画像の間、またはあらかじめ指定された時間差を持つ２枚の画像の間で行なう。本第１実施例での動き複雑度ｃの算出は、まず算出対象の２枚の画像のうち、一方の画像を１６×１６画素の小ブロック単位（ＭＢと呼ぶ）に分割し、この各ＭＢ内の画素の輝度情報を用いて、ＭＢ毎に他方の画像と比較して動きベクトル探索を行い、動きベクトルＭＶ（ｉ）を算出する（ｉはＭＢ番号）。結果として得られた各ブロックの動きベクトルの分散の度合いを動き複雑度ｃとする。これらの値が大きい場合は、動きが複雑であることを示し、一方これらの値が小さい場合は、動きが簡易であることを示す。 The motion complexity calculation unit 230 calculates the motion between the images in the moving image signal stored in the input image storage unit 100, that is, the motion complexity c. The calculation of the motion complexity c is performed between two images that are temporally adjacent or between two images that have a predetermined time difference. In the calculation of the motion complexity c in the first embodiment, first, one of the two images to be calculated is divided into small blocks of 16 × 16 pixels (referred to as MB), and each MB is divided. Using the luminance information of the pixels in the image, a motion vector search is performed for each MB in comparison with the other image, and a motion vector MV (i) is calculated (i is an MB number). The degree of dispersion of the motion vector of each block obtained as a result is defined as motion complexity c. When these values are large, it indicates that the movement is complicated, while when these values are small, it indicates that the movement is simple.

なお、この動き複雑度ｃの算出には、上記したようにＭＢ毎に算出する動きベクトルを用いるのでは無く、画面全体の動きを示すグローバル動きベクトルの大きさや、この時間的変化量を利用することも可能である。また、小ブロックとして１６×８、８×１６、８×８、８×４、４×８、４ｘ４のように１６×１６のＭＢよりも小さな画素単位で動きベクトルを算出し、この動きベクトルを用いても良い。また、動き複雑度ｃの算出は、輝度情報のみでなく、輝度情報に加えて色差情報を用いて行なっても良い。また各画像を変換、縮小等の処理をした後に動き複雑度ｃを算出したり、各画像内の特定の一部の領域のみを抽出して、この領域内の情報を用いて動き複雑度ｃを算出したりしても良い。 The calculation of the motion complexity c does not use the motion vector calculated for each MB as described above, but uses the size of the global motion vector indicating the motion of the entire screen and the amount of temporal change. It is also possible. In addition, a motion vector is calculated as a small block in units of pixels smaller than 16 × 16 MB, such as 16 × 8, 8 × 16, 8 × 8, 8 × 4, 4 × 8, and 4 × 4. It may be used. Further, the calculation of the motion complexity c may be performed using not only luminance information but also color difference information in addition to luminance information. In addition, the motion complexity c is calculated after processing such as conversion and reduction of each image, or only a specific partial area in each image is extracted, and the motion complexity c is used using information in the area. Or may be calculated.

アクティビティ算出部２４０は、入力画像蓄積部１００に蓄積されている動画像信号における各画像のアクティビティｄと垂直アクティビティｅを算出する。アクティビティｄは下記式１のように、各ＭＢアクティビティの画面内和として求める。ここで、ＮＭＢは画面内に含まれるＭＢ数である。ｄＭＢ（ｉ）はｉ番目のＭＢのＭＢアクティビティであり、下記式２で求められる。式１、式２では絶対差分和を用いているが、二乗誤差和を用いても良い。 The activity calculation unit 240 calculates the activity d and the vertical activity e of each image in the moving image signal stored in the input image storage unit 100. The activity d is obtained as a screen internal sum of each MB activity as shown in the following formula 1. Here, NMB is the number of MBs included in the screen. dMB (i) is the MB activity of the i-th MB and is obtained by the following equation 2. Although the absolute difference sum is used in Equations 1 and 2, a square error sum may be used.

・・・式１

... Formula 1

・・・式２

垂直アクティビティｅは、、、、図６及び下記式３に示すように、各ＭＢ内の垂直方向に隣接する各画素間の絶対差分和として求めたＭＢ垂直アクティビティの画面内和として求める。ここで、ｅＭＢ（ｉ）はｉ番目のＭＢのＭＢ垂直アクティビティであり、下記式４で求められる。式３、式４では絶対差分和を用いているが、二乗誤差和を用いても良い。

... Formula 2

The vertical activity e is obtained as an in-screen sum of MB vertical activities obtained as a sum of absolute differences between pixels adjacent in the vertical direction in each MB, as shown in FIG. Here, eMB (i) is the MB vertical activity of the i-th MB and is obtained by the following equation 4. Although the absolute difference sum is used in Equations 3 and 4, a square error sum may be used.

・・・式３

... Formula 3

・・・式４

アクティビティｄ、垂直アクティビティｅの算出は、輝度情報のみでなく、輝度情報に加えて色差情報を用いて行なっても良い。また各画像を変換、縮小等の処理をした後にアクティビティｄ、垂直アクティビティｅを算出したり、各画像内の特定の一部の領域のみを抽出して、この領域内の情報を用いてアクティビティｄ、垂直アクティビティｅを算出したりしても良い。

... Formula 4

The calculation of the activity d and the vertical activity e may be performed using not only luminance information but also color difference information in addition to luminance information. Further, after converting and reducing each image, the activity d and the vertical activity e are calculated, or only a specific partial area in each image is extracted, and the activity d is used by using information in this area. The vertical activity e may be calculated.

図２及び図９の説明に戻る。
動画像特徴抽出部２００は、上記のようにして検出した各画像のシーンチェンジ情報ａ、フェード情報ｂ、動き複雑度ｃをＧＯＰ構造決定部３００に送る。また、垂直アクティビティｅ、各ＭＢのＭＢ垂直アクティビティｅＭＢをピクチャ構造決定部４００に送る。さらに、動き複雑度ｃ、アクティビティｄを符号化複雑度計測部５００に送る。 Returning to FIG. 2 and FIG.
The moving image feature extraction unit 200 sends the scene change information a, the fade information b, and the motion complexity c of each image detected as described above to the GOP structure determination unit 300. Also, the vertical activity e and the MB vertical activity eMB of each MB are sent to the picture structure determining unit 400. Further, the motion complexity c and activity d are sent to the encoding complexity measuring unit 500.

ＧＯＰ構造決定部３００は、ＧＯＰを構成する各画像の符号化構造を決定する（ステップＳ３００）。
図５は、このＧＯＰ構造決定部３００の具体的な構成例を示す図である。以下、この図５を用いてＧＯＰ構造決定部３００の動作を説明する。 The GOP structure determination unit 300 determines the coding structure of each image constituting the GOP (step S300).
FIG. 5 is a diagram illustrating a specific configuration example of the GOP structure determination unit 300. Hereinafter, the operation of the GOP structure determination unit 300 will be described with reference to FIG.

ＧＯＰ構造決定部３００は、ＧＯＰサイズＮを決定するＧＯＰサイズ決定部３１０と、予測画像の間隔Ｍを決定する予測画像間隔決定部３２０とで構成する。
ＧＯＰサイズ決定部３１０は、シーンチェンジ情報ａとＮmaxからＧＯＰサイズＮを決定する。入力画像蓄積部１００に蓄積されている最も古い画像からシーンチェンジ情報ａが１になるまでか、またはＮmaxになるまでの画像を１つのＧＯＰとする。決定されたＧＯＰに含まれる画像の枚数をＧＯＰサイズＮとして出力する。 The GOP structure determining unit 300 includes a GOP size determining unit 310 that determines the GOP size N and a predicted image interval determining unit 320 that determines the interval M between predicted images.
GOP size determining section 310 determines GOP size N from scene change information a and Nmax. An image from the oldest image stored in the input image storage unit 100 until the scene change information a becomes 1 or Nmax is defined as one GOP. The number of images included in the determined GOP is output as the GOP size N.

予測画像間隔決定部３２０は、ＧＯＰサイズ決定部３１０から得たＧＯＰサイズＮ、および画像特徴抽出部３００から得たＮ枚の画像の動き複雑度ｃから、対象ＧＯＰの符号化時の予測画像間隔Ｍを決定する。動き複雑度ｃが大きい場合には予測画像間隔Ｍを小さい値に設定し、動き複雑度ｃが小さい場合には予測画像間隔Ｍを大きい値に設定する。また、動き複雑度ｃがあらかじめ指定された値よりも小さい場合には、予測画像間隔Ｍを既定値とすることも可能である。なお、フェード情報ｂが１である場合には予測画像間隔Ｍは２とする。 The predicted image interval determination unit 320 uses the GOP size N obtained from the GOP size determination unit 310 and the motion complexity c of the N images obtained from the image feature extraction unit 300 to predict the predicted image interval at the time of encoding the target GOP. Determine M. When the motion complexity c is large, the predicted image interval M is set to a small value, and when the motion complexity c is small, the predicted image interval M is set to a large value. Further, when the motion complexity c is smaller than a value designated in advance, the predicted image interval M can be set as a default value. When the fade information b is 1, the predicted image interval M is 2.

図２及び図９の説明に戻る。
ＧＯＰ構造決定部３００は、上記のようにして決定したＧＯＰサイズＮ、予測画像間隔Ｍをピクチャ構造決定部４００と記憶部６００とに送る。また、ＧＯＰサイズＮを入力像蓄積部１００に送る。 Returning to FIG. 2 and FIG.
The GOP structure determination unit 300 sends the GOP size N and the predicted image interval M determined as described above to the picture structure determination unit 400 and the storage unit 600. Also, the GOP size N is sent to the input image storage unit 100.

ピクチャ構造決定部４００は、図３に示すＭＢ符号化パラメータ決定部１０００が行う符号化パラメータ決定の一部を担当する。本第１実施例では、符号化パラメータの中でも動画像が持つ本来の特性であって、量子化幅への依存度の低いパラメータであるフレーム／フィールド構造の決定を行う。ピクチャ構造決定部４００は、ＧＯＰ構造決定部３００から得たＧＯＰサイズＮと予測画像間隔Ｍとから各画像の符号化タイプＣＴ（Ｉ画像、Ｐ画像、Ｂ画像）を判定する（ステップＳ４１０）。また、画像特徴抽出部３００から得た垂直アクティビティｅから各画像を符号化するためのピクチャ構造情報ＰＳを決定する（ステップＳ４２０）。ピクチャ構造情報ＰＳがフレームである場合には画像を１つのフレームで符号化し、ピクチャ構造情報ＰＳがフィールドである場合には画像を２つのフィールドで符号化する。垂直アクティビティｅが小さい場合にはフレームとして符号化し、垂直アクティビティｅが大きい場合にはフィールドとして符号化する。また、画像特徴抽出部３００から得たＭＢ垂直アクティビティｅＭＢから各ＭＢのＭＢ構造情報ＭＢＳを決定する（ステップＳ４３０）。ＭＢ構造情報ＭＢＳがフレームである場合にはＭＢを１つのフレームで符号化し、ＭＢ構造情報ＭＢＳがフィールドである場合にはＭＢを２つのフィールドで符号化する。上記のようにして決定した符号化タイプＣＴとピクチャ構造情報ＰＳとＭＢ構造情報ＭＢＳとをまとめてピクチャタイプ情報ＰＴとする。そして、このピクチャタイプ情報ＰＴを、符号化複雑度計測部５００、及び記録部６００へ送る。 The picture structure determination unit 400 is responsible for part of the encoding parameter determination performed by the MB encoding parameter determination unit 1000 shown in FIG. In the first embodiment, the frame / field structure, which is an inherent characteristic of a moving image among the encoding parameters and has a low dependency on the quantization width, is determined. The picture structure determination unit 400 determines the coding type CT (I image, P image, B image) of each image from the GOP size N obtained from the GOP structure determination unit 300 and the predicted image interval M (step S410). Also, picture structure information PS for encoding each image is determined from the vertical activity e obtained from the image feature extraction unit 300 (step S420). When the picture structure information PS is a frame, the image is encoded with one frame, and when the picture structure information PS is a field, the image is encoded with two fields. When the vertical activity e is small, it is encoded as a frame, and when the vertical activity e is large, it is encoded as a field. Also, the MB structure information MBS of each MB is determined from the MB vertical activity eMB obtained from the image feature extraction unit 300 (step S430). When the MB structure information MBS is a frame, the MB is encoded with one frame, and when the MB structure information MBS is a field, the MB is encoded with two fields. The coding type CT, the picture structure information PS, and the MB structure information MBS determined as described above are collectively used as picture type information PT. Then, this picture type information PT is sent to the encoding complexity measuring unit 500 and the recording unit 600.

符号化複雑度計測部５００は、画像特徴抽出部３００から得たＮ枚の画像の動き複雑度ｃとアクティビティｄ、およびピクチャ構造決定部４００から得たピクチャタイプ情報ＰＴから、下記式５の関数ｆによって、ピクチャ毎に符号化複雑度Ｃを算出する（ステップＳ５００）。下記式５の関数ｆは例えば、図７のような特性を持つとする。符号化画像がＩピクチャの時は、同図（Ａ）に示すように、符号化複雑度Ｃはアクティビティｄに比例する。また、符号化画像がＰピクチャ及びＢピクチャの時は、同図（Ｂ）に示すように、符号化複雑度Ｃは動き複雑度ｃに比例する。なお、関数ｆは、Ｐピクチャ及びＢピクチャの場合は、動き複雑度ｃのみではなくアクティビティｄを考慮することもできる。 The encoding complexity measurement unit 500 calculates the function of Equation 5 below from the motion complexity c and activity d of N images obtained from the image feature extraction unit 300 and the picture type information PT obtained from the picture structure determination unit 400. Based on f, the encoding complexity C is calculated for each picture (step S500). It is assumed that the function f of the following formula 5 has characteristics as shown in FIG. When the encoded image is an I picture, the encoding complexity C is proportional to the activity d as shown in FIG. When the encoded image is a P picture and a B picture, the encoding complexity C is proportional to the motion complexity c as shown in FIG. In the case of the P picture and the B picture, the function f can consider not only the motion complexity c but also the activity d.

・・・式５

符号化複雑度計測部５００は、上記式５で算出した符号化複雑度Ｃを記録部６００へ送る。

... Formula 5

The encoding complexity measuring unit 500 sends the encoding complexity C calculated by the above equation 5 to the recording unit 600.

記録部６００は、ＧＯＰ構造決定部３００から得たＧＯＰサイズＮとピクチャ構造決定部４００から得たピクチャタイプ情報ＰＴと符号化複雑度計測部５００から得た符号化複雑度Ｃとを記録媒体７００に記録する（ステップＳ６００）。そして、次ＧＯＰ処理開始信号を入力画像蓄積部１００に送る。
＜第２パス＞
図３は、第１実施例における第２のパスの符号化を実現するブロックの内部構成を示す図である。また、図１０はこの第２のパスの符号化の処理の流れを示すフローチャートである。 The recording unit 600 records the GOP size N obtained from the GOP structure determining unit 300, the picture type information PT obtained from the picture structure determining unit 400, and the coding complexity C obtained from the coding complexity measuring unit 500. (Step S600). Then, a next GOP processing start signal is sent to the input image storage unit 100.
<Second pass>
FIG. 3 is a diagram illustrating an internal configuration of a block that realizes encoding of the second pass in the first embodiment. FIG. 10 is a flowchart showing the flow of the second pass encoding process.

ＧＯＰ符号量割当部８００は、記録媒体７００から得たＧＯＰサイズＮとＧＯＰ分の符号化複雑度からＧＯＰの目標符号量ＴＧＢを求める（ステップＳ７００）。
符号量制御部９００は、ＧＯＰ符号量割当部８００から得た目標符号量ＴＧＢと符号部１１００から出力されたビットストリームの符号量Ｂから、例えば、ＭＰＥＧの代表的な圧縮アルゴリズムであるＴＭ５（Test Model 5; ISO/IEC JTC/SC29 1993）において用いられている符号量制御方式によって符号量制御を行い、ピクチャ毎の目標符号量ＴＢを決定する（ステップＳ８００）。そして、目標符号量ＴＢを符号部１１００に送る。 The GOP code amount allocating unit 800 obtains the GOP target code amount TGB from the GOP size N obtained from the recording medium 700 and the GOP encoding complexity (step S700).
The code amount control unit 900 uses the target code amount TGB obtained from the GOP code amount allocating unit 800 and the bit amount code amount B output from the encoding unit 1100, for example, TM5 (Test Code amount control is performed by a code amount control method used in Model 5; ISO / IEC JTC / SC29 1993), and a target code amount TB for each picture is determined (step S800). Then, the target code amount TB is sent to the encoding unit 1100.

ＭＢ符号化パラメータ決定部１０００は、ＭＢ構造情報ＭＢＳ以外のＭＢの符号化に必要なＭＢ符号化パラメータを決定し、符号部１１００に送る。
符号部１１００は、符号量制御部９００から得た目標符号量ＴＢと、記録媒体７００から得たピクチャタイプ情報ＰＴと、ＭＢ符号化パラメータ決定部１０００から送られたＭＢ符号化パラメータとにしたがって、前述の第１のパスで符号化したものと同じ内容の入力動画像信号（入力画像信号）を符号化する（ステップＳ９００）。そして得られた符号化データ（ビットストリーム）を外部へ出力する。
［第２実施例］
次に、前述した実施の形態を用いた第２実施例について図面を用いて詳細に説明する。
＜第１パス＞
図１１は、第２実施例における第１のパスの符号化を実現するブロックの内部構成を示す図である。 The MB encoding parameter determination unit 1000 determines MB encoding parameters necessary for encoding MBs other than the MB structure information MBS, and sends the MB encoding parameters to the encoding unit 1100.
The encoding unit 1100 follows the target code amount TB obtained from the code amount control unit 900, the picture type information PT obtained from the recording medium 700, and the MB coding parameter sent from the MB coding parameter determination unit 1000. An input moving image signal (input image signal) having the same content as that encoded in the first pass is encoded (step S900). The obtained encoded data (bit stream) is output to the outside.
[Second Embodiment]
Next, a second example using the above-described embodiment will be described in detail with reference to the drawings.
<First pass>
FIG. 11 is a diagram illustrating an internal configuration of a block that realizes the first pass encoding in the second embodiment.

入力画像蓄積部１００、ＧＯＰ構造決定部３００、ピクチャ構造決定部４００、及び符号化複雑度計測部５００、記録媒体７００は、それぞれ第１実施例と同じ機能を有しており、図２と同じ符号を付している。 The input image storage unit 100, the GOP structure determination unit 300, the picture structure determination unit 400, the encoding complexity measurement unit 500, and the recording medium 700 have the same functions as those in the first embodiment, and are the same as those in FIG. The code is attached.

動画像特徴抽出部２０１は、第１実施例の機能に加えて、図４に示す動き複雑度算出部２３０で求めた動きベクトル情報ｇを記録部６００に送る。
記録部６０１は、第１実施例の機能に加えて、動画像特徴抽出部２０１から得た動きベクトル情報ｇを記録媒体７００に記録する。
＜第２パス＞
図１２は、第２実施例における第２のパスの符号化を実現するブロックの内部構成を示す図である。また、図１０はこの第２のパスの符号化の処理の流れを示すフローチャートである。 In addition to the functions of the first embodiment, the moving image feature extraction unit 201 sends the motion vector information g obtained by the motion complexity calculation unit 230 shown in FIG. 4 to the recording unit 600.
The recording unit 601 records the motion vector information g obtained from the moving image feature extraction unit 201 on the recording medium 700 in addition to the functions of the first embodiment.
<Second pass>
FIG. 12 is a diagram illustrating an internal configuration of a block that realizes encoding of the second pass in the second embodiment. FIG. 10 is a flowchart showing the flow of the second pass encoding process.

記録媒体７００、ＧＯＰ符号量割当部８００、及び符号量制御部９００は、それぞれ第１実施例と同じ機能を有しており、図３と同じ符号を付している。
ＭＢ符号化パラメータ決定部１００１は、記録媒体７００から得た動きベクトル情報ｇを、動きベクトルの１つの候補として利用して、符号化する動きベクトル情報を求め、更にＭＢ構造情報ＭＢＳ以外のＭＢの符号化に必要なＭＢ符号化パラメータを決定し、符号部１１０１に送る。 The recording medium 700, the GOP code amount assigning unit 800, and the code amount control unit 900 have the same functions as those in the first embodiment, and are given the same reference numerals as those in FIG.
The MB encoding parameter determination unit 1001 obtains motion vector information to be encoded by using the motion vector information g obtained from the recording medium 700 as one motion vector candidate, and further determines MBs other than the MB structure information MBS. MB encoding parameters necessary for encoding are determined and sent to the encoding unit 1101.

符号部１１０１は、符号量制御部９００から得た目標符号量ＴＢと記録媒体７００から得たピクチャタイプ情報ＰＴとＭＢ符号化パラメータ決定部１００１から送られたＭＢ符号化パラメータとにしたがって、前述の第１のパスで符号化したものと同じ内容の入力動画像信号（入力画像信号）を符号化する。そして得られた符号化データ（ビットストリーム）を外部へ出力する。 The encoding unit 1101 is based on the target code amount TB obtained from the code amount control unit 900, the picture type information PT obtained from the recording medium 700, and the MB coding parameter sent from the MB coding parameter determination unit 1001. An input moving image signal (input image signal) having the same content as that encoded in the first pass is encoded. The obtained encoded data (bit stream) is output to the outside.

この第２実施例のように、第１のパスの符号化で動きベクトルの１つの候補を求めておき、第２のパスの符号化時にその動きベクトルを利用することで、符号化処理に置いて大きなウエートを占める動きベクトル検出の処理を行う符号部１１０１の処理量を大幅に削減することが可能となる。 As in the second embodiment, one candidate for a motion vector is obtained by encoding the first pass, and the motion vector is used at the time of encoding the second pass. Therefore, it is possible to greatly reduce the processing amount of the encoding unit 1101 that performs the motion vector detection process that occupies a large weight.

なお、本発明は上記した動画像符号化装置の機能をコンピュータに実現させるためのプログラムも含むものである。これらのプログラムは、記録媒体から読み取られてコンピュータに取り込まれてもよいし、通信ネットワークを介して伝送されてコンピュータに取り込まれてもよい。 The present invention also includes a program for causing a computer to realize the functions of the above-described moving picture coding apparatus. These programs may be read from a recording medium and loaded into a computer, or may be transmitted via a communication network and loaded into a computer.

本発明の実施の形態を示す全体ブロック図である。It is a whole block diagram which shows embodiment of this invention. 本発明の第１実施例における第１符号部を示す構成図である。It is a block diagram which shows the 1st code | symbol part in 1st Example of this invention. 本発明の第１実施例における第２符号部を示す構成図である。It is a block diagram which shows the 2nd code | symbol part in 1st Example of this invention. 本発明の実施例における動画像特徴抽出部を示す構成図である。It is a block diagram which shows the moving image feature extraction part in the Example of this invention. 本発明の実施例におけるＧＯＰサイズ決定部を示す構成図である。It is a block diagram which shows the GOP size determination part in the Example of this invention. 本発明の実施例における垂直アクティビティを説明する図である。It is a figure explaining the vertical activity in the Example of this invention. 本発明の実施例における複雑度の算出方法を説明する図である。It is a figure explaining the calculation method of the complexity in the Example of this invention. 本発明の実施の形態における全体処理を示すフローチャートである。It is a flowchart which shows the whole process in embodiment of this invention. 本発明の実施例における第１符号部を示すフローチャートである。It is a flowchart which shows the 1st code | symbol part in the Example of this invention. 本発明の実施例における第２符号部を示すフローチャートである。It is a flowchart which shows the 2nd code | symbol part in the Example of this invention. 本発明の第２実施例における第１符号部を示す構成図である。It is a block diagram which shows the 1st code | symbol part in 2nd Example of this invention. 本発明の第２実施例における第２符号部を示す構成図である。It is a block diagram which shows the 2nd code | symbol part in 2nd Example of this invention.

Explanation of symbols

１０ＶＴＲ
２０第１符号部
３０記録媒体
４０第２符号部
５０記録媒体
１００入力画像蓄積部
２００動画像特徴抽出部
３００ＧＯＰ構造決定部
３１０ＧＯＰサイズ決定部
３２０予測画像間隔決定部
４００ピクチャ構造決定部
５００符号化複雑度計測部
６００記録部
７００記録媒体
８００ＧＯＰ符号量割当部
９００符号量制御部
１０００ＭＢ符号化パラメータ決定部
１１００符号部
２０１動画像特徴抽出部（第２実施例）
６０１記録部（第２実施例）
１００１ＭＢ符号化パラメータ決定部（第２実施例）
１１０１符号部（第２実施例） 10 VTR
20 first encoding unit 30 recording medium 40 second encoding unit 50 recording medium 100 input image storage unit 200 moving image feature extraction unit 300 GOP structure determination unit 310 GOP size determination unit 320 predicted image interval determination unit 400 picture structure determination unit 500 Complexity measurement unit 600 recording unit 700 recording medium 800 GOP code amount allocation unit 900 code amount control unit 1000 MB coding parameter determination unit 1100 encoding unit 201 moving image feature extraction unit (second embodiment)
601 Recording unit (second embodiment)
1001 MB coding parameter determination unit (second embodiment)
1101 Code part (second embodiment)

Claims

When encoding an input video, at least one of scene change information indicating the scene switching position, fade information indicating the position where the scene fades in and out, and activity information indicating the complexity of movement between images An extraction unit for extracting feature information of a moving image including one;
A first determination unit that determines a GOP structure based on feature information of the moving image;
A second determination unit that determines a first encoding parameter based on the feature information of the moving image and the GOP structure;
A calculation unit that calculates coding complexity for each picture using the feature information of the moving image, the GOP structure, and the first coding parameter;
A storage unit for storing the first encoding parameter and the encoding complexity;
First encoding means comprising:
When the input moving image encoded by the first encoding unit is encoded again, the first encoding unit stores the first moving image stored in the storage unit when the first encoding unit encodes the input moving image. A third determination unit for determining a second encoding parameter using the encoding parameter and the encoding complexity;
Second encoding means for encoding the input moving image using the second encoding parameter;
A moving picture encoding apparatus comprising:

The moving image encoding apparatus according to claim 1, wherein the second determination unit is a unit that determines an encoding type and a picture structure for each image as the first encoding parameter.

3. The moving picture encoding apparatus according to claim 2, wherein the second determination unit is means for further determining an MB structure and a motion vector candidate value as the first encoding parameter.