JP6470191B2

JP6470191B2 - Video encoding method, video encoding apparatus, and video encoding program

Info

Publication number: JP6470191B2
Application number: JP2016001281A
Authority: JP
Inventors: 隆一谷田; 和也早瀬; 正樹北原
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2016-01-06
Filing date: 2016-01-06
Publication date: 2019-02-13
Anticipated expiration: 2036-01-06
Also published as: JP2017123545A

Description

本発明は、映像符号化の並列処理において、並列処理数を適切に算出する映像符号化方法、映像符号化装置及び映像符号化プログラムに関する。 The present invention relates to a video encoding method, a video encoding device, and a video encoding program that appropriately calculate the number of parallel processes in parallel processing of video encoding.

映像の符号化を高速に行う方法として、複数のピクチャの符号化処理を並列に行う方法がある。図４は、各ピクチャについてその復号時に参照される他のピクチャとの関係（以下、「参照構造」という。）を表示順に示した図である。ピクチャとは、符号化対象映像の１フレーム（または１画面）である。この図は、片方向予測ピクチャ（以下、「Ｐピクチャ」という。）１枚に対し、双方向予測ピクチャ（以下、「Ｂピクチャ」という。）が２枚あるＭ＝３と呼ばれる参照構造を示したものである。各ピクチャの上もしくは下に描かれた矢印は各ピクチャが参照する先のピクチャを表している。 As a method of encoding video at high speed, there is a method of performing encoding processing of a plurality of pictures in parallel. FIG. 4 is a diagram showing the relationship (hereinafter referred to as “reference structure”) of each picture with other pictures referenced at the time of decoding in the order of display. A picture is one frame (or one screen) of an encoding target video. This figure shows a reference structure called M = 3 in which one bidirectional prediction picture (hereinafter referred to as “P picture”) has two bidirectional prediction pictures (hereinafter referred to as “B pictures”). It is a thing. An arrow drawn above or below each picture represents a destination picture to which each picture refers.

この図において、符号化順は各ピクチャの下に数字で示した通り、Ｉ_１→Ｐ_１→Ｂ_１→Ｂ_２→Ｐ_２→Ｂ_３→Ｂ_４という順番である。しかし参照構造によれば、Ｐ_１の符号化が終了すると、Ｂ_１のみではなくＢ_２及びＰ_２も符号化可能である。従って、符号化処理を並列に行えるシステムにおいては、これら３ピクチャを同時に符号化処理することができる。 In this figure, the encoding order is the order of I ₁ → P ₁ → B ₁ → B ₂ → P ₂ → B ₃ → B _{4 as} indicated by numbers below each picture. However, according to the reference structure, when encoding of P ₁ is completed, not only B ₁ but also B ₂ and P ₂ can be encoded. Therefore, in a system that can perform encoding processing in parallel, these three pictures can be encoded simultaneously.

次に、ＨＥＶＣやＨ．２６４をはじめとする一般的な映像符号化規格における符号化部の装置構成を説明する。図５は、一般的な映像符号化規格における符号化部の装置構成を示すブロック図である。図５に示す装置は、原画像バッファ１、復号画像バッファ１０及び符号化部１２を備える。 Next, HEVC and H.C. An apparatus configuration of an encoding unit in a general video encoding standard such as H.264 will be described. FIG. 5 is a block diagram illustrating a device configuration of an encoding unit in a general video encoding standard. The apparatus shown in FIG. 5 includes an original image buffer 1, a decoded image buffer 10, and an encoding unit 12.

符号化部１２は、減算器２、加算器３、ＤＣＴ（離散コサイン変換）計算器（図中「ＤＣＴ」）４、量子化器（図中「Ｑ」）５、逆量子化器（図中「ＩＱ」）６、ＩＤＣＴ（逆離散コサイン変換）計算器（図中「ＩＤＣＴ」）７、予測モード選択部８、ループフィルタ９及びエントロピー符号化部１１を備える。 The encoding unit 12 includes a subtracter 2, an adder 3, a DCT (discrete cosine transform) calculator ("DCT" in the figure) 4, a quantizer ("Q" in the figure) 5, and an inverse quantizer (in the figure). “IQ”) 6, an IDCT (Inverse Discrete Cosine Transform) calculator (“IDCT” in the figure) 7, a prediction mode selection unit 8, a loop filter 9 and an entropy coding unit 11.

原画像バッファ１は、表示順に入力された原画像を保持し、符号化順に並べ替えて順に出力する。減算器２は、原画像バッファ１から送られた原画と予測モード選択部８から送られた予測画像の差分を取って予測残差信号を計算し、ＤＣＴ計算器４に出力する。 The original image buffer 1 holds the original images input in the display order, rearranges them in the encoding order, and outputs them in order. The subtractor 2 calculates a prediction residual signal by taking the difference between the original image sent from the original image buffer 1 and the predicted image sent from the prediction mode selection unit 8 and outputs the prediction residual signal to the DCT calculator 4.

加算器３は、予測モード選択部８から出力された予測画像と、ＩＤＣＴ計算器７から送られる量子化後の予測残差信号の和を計算し、フィルタ前の復号画像としてループフィルタ９に送る。ＤＣＴ計算器４は、予測残差信号に離散コサイン変換を施し、算出されたＤＣＴ係数を量子化器５へ送る。 The adder 3 calculates the sum of the prediction image output from the prediction mode selection unit 8 and the quantized prediction residual signal sent from the IDCT calculator 7, and sends the sum to the loop filter 9 as a decoded image before filtering. . The DCT calculator 4 performs discrete cosine transform on the prediction residual signal and sends the calculated DCT coefficient to the quantizer 5.

量子化器５は、ＤＣＴ計算器４から送られたＤＣＴ係数を、外部から与えられた量子化パラメータＱＰを用いて量子化し、ＤＣＴ係数の量子化値としてエントロピー符号化部１１と逆量子化器６へ送る。逆量子化器６は、量子化器５で求まった量子化後のＤＣＴ係数を、外部から与えられた量子化パラメータＱＰを用いて逆量子化し、量子化後のＤＣＴ係数としてＩＤＣＴ計算器７に送る。 The quantizer 5 quantizes the DCT coefficient sent from the DCT calculator 4 using the quantization parameter QP given from the outside, and uses the entropy encoding unit 11 and the inverse quantizer as a quantized value of the DCT coefficient. Send to 6. The inverse quantizer 6 inversely quantizes the quantized DCT coefficient obtained by the quantizer 5 using the quantization parameter QP given from the outside, and sends it to the IDCT calculator 7 as the DCT coefficient after quantization. send.

ＩＤＣＴ計算器７では、逆量子化器６から送られた量子化後のＤＣＴ係数に逆離散コサイン変換を施し、量子化後の予測残差信号を求める。予測モード選択部８は、入力された原画像に最も近い予測画像を、同じく入力された参照画像から作成して出力すると共に、その予測モードの情報をエントロピー符号化器１１へ送る。 The IDCT calculator 7 performs inverse discrete cosine transform on the quantized DCT coefficient sent from the inverse quantizer 6 to obtain a quantized prediction residual signal. The prediction mode selection unit 8 creates and outputs a prediction image closest to the input original image from the input reference image, and sends the prediction mode information to the entropy encoder 11.

ループフィルタ９は、加算器３から送られたフィルタ前の復号画像にフィルタ処理を施し、復号画像として復号画像バッファ１０に送る。復号画像バッファ１０は、ループフィルタ９から出力された復号画像を保存し、予測モード選択部８に対して参照画像として出力する。 The loop filter 9 performs a filtering process on the decoded image before the filter sent from the adder 3 and sends it to the decoded image buffer 10 as a decoded image. The decoded image buffer 10 stores the decoded image output from the loop filter 9 and outputs the decoded image to the prediction mode selection unit 8 as a reference image.

エントロピー符号化部１１は、量子化器５から送られるＤＣＴ係数の量子化値と予測モード選択部８から送られる予測モード情報を可変長符号化し、符号化ストリームとして出力する。 The entropy encoding unit 11 performs variable length encoding on the quantized value of the DCT coefficient sent from the quantizer 5 and the prediction mode information sent from the prediction mode selection unit 8, and outputs the result as an encoded stream.

次に、図６を参照して、図５に示す装置の処理動作を説明する。図６は、図５に示すＨＥＶＣやＨ．２６４をはじめとする一般的な映像符号化規格における符号化装置の動作を示すフローチャートである。処理が始まると、まず原画像バッファ１は、入力された各ピクチャを符号化順に並べ替える（ステップＳ２１）。 Next, the processing operation of the apparatus shown in FIG. 5 will be described with reference to FIG. 6 shows HEVC or H.264 shown in FIG. 2 is a flowchart showing an operation of an encoding device in a general video encoding standard such as H.264. When the process starts, first, the original image buffer 1 rearranges the inputted pictures in the encoding order (step S21).

以降、符号化部１２は各ピクチャを符号化順に処理する。各ピクチャは矩形のブロックに分割される。予測モード選択部８は、ブロックごとに予測モードを決定する（ステップＳ２２）。減算器２は、予測モードに対応する予測画像と原画像との差分を取得する。減算器２は、取得された差分に基づいて予測残差信号を出力する（ステップＳ２３）。 Thereafter, the encoding unit 12 processes each picture in the encoding order. Each picture is divided into rectangular blocks. The prediction mode selection unit 8 determines a prediction mode for each block (step S22). The subtracter 2 acquires the difference between the predicted image corresponding to the prediction mode and the original image. The subtracter 2 outputs a prediction residual signal based on the acquired difference (step S23).

次に、ＤＣＴ計算器４は、出力された予測残差信号にＤＣＴ（ステップＳ２４）を施す。量子化器５は、ＤＣＴ係数に対して量子化（ステップＳ２５）を施す。逆量子化器６は、逆量子化を行う（ステップＳ２６）。ＩＤＣＴ計算器７は、ＩＤＣＴを計算する（ステップＳ２７）。 Next, the DCT calculator 4 performs DCT (step S24) on the output prediction residual signal. The quantizer 5 performs quantization (step S25) on the DCT coefficient. The inverse quantizer 6 performs inverse quantization (step S26). The IDCT calculator 7 calculates IDCT (step S27).

次に、加算器３は、予測画像と量子化後の予測残差信号とを加算し復号画像を生成する（ステップＳ２８）。ループフィルタ９は、生成された復号画像にループフィルタを掛けて（ステップＳ２９）、復号画像バッファ１に復号画像として保存する（ステップＳ３０）。保存された復号画像は、以降の予測画像生成に利用される。 Next, the adder 3 adds the predicted image and the predicted residual signal after quantization to generate a decoded image (step S28). The loop filter 9 applies a loop filter to the generated decoded image (step S29) and stores it as a decoded image in the decoded image buffer 1 (step S30). The stored decoded image is used for subsequent prediction image generation.

一方、エントロピー符号化部１１は、ＤＣＴと量子化を施した量子化済みのＤＣＴ係数と、対応する予測モード情報とに対して可変長符号化を施し、符号化ストリームとして出力する（ステップＳ３１）。 On the other hand, the entropy encoding unit 11 performs variable-length encoding on the DCT and quantized DCT coefficients that have been subjected to quantization and corresponding prediction mode information, and outputs the result as an encoded stream (step S31). .

図６に示す処理（ステップＳ２２〜Ｓ３１）が行われる際には、各ピクチャに対して量子化パラメータＱＰが与えられる必要がある。量子化パラメータＱＰは、量子化の粗さを表す。量子化パラメータＱＰが大きいほど粗く量子化され、符号化ノイズが多くなる。一方、量子化パラメータＱＰが大きいほど、圧縮率が高くなるため発生する符号量は少なくなる。量子化パラメータＱＰは、予め定められたビットレート及びバッファサイズを元に、デコーダのバッファモデルを順守するように算出される。 When the processing shown in FIG. 6 (steps S22 to S31) is performed, the quantization parameter QP needs to be given to each picture. The quantization parameter QP represents the roughness of quantization. As the quantization parameter QP is larger, the quantization is coarser and coding noise increases. On the other hand, the larger the quantization parameter QP, the higher the compression rate, and the smaller the amount of code generated. The quantization parameter QP is calculated based on a predetermined bit rate and buffer size so as to comply with the buffer model of the decoder.

ここで、デコーダのバッファモデルについて説明する。映像デコーダには符号化ストリームを保存する受信バッファがあり、この受信バッファには外部から受信した符号化ストリームが蓄積される。また、復号の際には１フレーム分ずつ符号化ストリームを受信バッファから引き抜いて復号する。 Here, the buffer model of the decoder will be described. The video decoder has a reception buffer for storing the encoded stream, and the encoded stream received from the outside is stored in the reception buffer. In decoding, the encoded stream is extracted from the reception buffer one frame at a time and decoded.

この受信バッファのサイズは有限のため、受信した符号化ストリームを受信バッファに蓄積する速度と、受信バッファに蓄積された符号化ストリームをバッファから引き抜く速度は釣り合っている必要がある。 Since the size of this reception buffer is finite, the speed at which the received encoded stream is accumulated in the reception buffer and the speed at which the encoded stream accumulated in the reception buffer is extracted from the buffer must be balanced.

例えば、前者の速度が速い場合は受信バッファがフルになって符号化ストリームを受信できなくなる「バッファオーバーフロー」（以下、「オーバーフロー」という。）が発生し、逆に後者の速度が速い場合は受信バッファが空になってピクチャの復号が停止してしまう「バッファアンダーフロー」（以下、「アンダーフロー」という。）が発生する。 For example, when the former speed is high, the reception buffer becomes full and a "buffer overflow" (hereinafter referred to as "overflow") that prevents receiving the encoded stream occurs. Conversely, when the latter speed is high, reception occurs. A “buffer underflow” (hereinafter referred to as “underflow”) occurs in which the buffer becomes empty and decoding of the picture stops.

そこで一般的な映像符号化器では、上記のよう受信バッファのオーバーフローやアンダーフローが起こらないよう、各ピクチャの発生符号量、ひいては量子化パラメータＱＰを適切に制御するレート制御部と呼ばれる機構が組み込まれている。 Therefore, a general video encoder incorporates a mechanism called a rate control unit that appropriately controls the amount of code generated in each picture and thus the quantization parameter QP so as not to cause overflow or underflow of the reception buffer as described above. It is.

ここで復号順でｎ枚目ピクチャの復号時刻をｔ（ｎ）、ｎ枚目ピクチャを復号した直後の受信バッファのデータ蓄積量をＢｔ＿ａｆｔｅｒ（ｔ（ｎ））と表記する。次のｎ＋１枚目ピクチャを復号する時刻ｔ（ｎ＋１）は、フレームレートＦＰＳ［フレーム／秒］を用いて、ｔ（ｎ＋１）＝ｔ（ｎ）＋１／ＦＰＳと表記できる。 Here, in the decoding order, the decoding time of the nth picture is expressed as t (n), and the data accumulation amount in the reception buffer immediately after decoding the nth picture is expressed as Bt_after (t (n)). The time t (n + 1) at which the next n + 1-th picture is decoded can be expressed as t (n + 1) = t (n) + 1 / FPS using the frame rate FPS [frame / second].

このことから、一般的なＣＢＲモデル（ビットレート一定モデル）の場合、ｎ＋１枚目ピクチャを復号する直前の受信バッファのデータ蓄積量Ｂｔ＿ｂｅｆｏｒｅ（ｔ（ｎ＋１））は、ビットレートｂ［ｂｉｔ／秒］を用いて
Ｂｔ＿ｂｅｆｏｒｅ（ｔ（ｎ＋１））＝Ｂｔ＿ａｆｔｅｒ（ｔ（ｎ））＋ｂ／ＦＰＳ
と表せる。 Therefore, in the case of a general CBR model (constant bit rate model), the data accumulation amount Bt_before (t (n + 1)) of the reception buffer immediately before decoding the (n + 1) th picture is the bit rate b [bit / second]. Using Bt_before (t (n + 1)) = Bt_after (t (n)) + b / FPS
It can be expressed.

ここでｎ＋１枚目ピクチャの発生符号量がＧ（ｎ＋１）だったとすれば、ｎ＋１枚目ピクチャを復号した直後の受信バッファにおけるデータ蓄積量Ｂｔ＿ａｆｔｅｒ（ｔ（ｎ＋１））は
Ｂｔ＿ａｆｔｅｒ（ｔ（ｎ＋１））＝Ｂｔ＿ｂｅｆｏｒｅ（ｔ（ｎ＋１））−Ｇ（ｎ＋１）
と表せる。 If the generated code amount of the (n + 1) th picture is G (n + 1), the data accumulation amount Bt_after (t (n + 1)) in the reception buffer immediately after decoding the (n + 1) th picture is Bt_after (t (n + 1)) = Bt_before (t (n + 1))-G (n + 1)
It can be expressed.

ここで、もし、Ｇ（ｎ＋１）＞Ｂｔ＿ｂｅｆｏｒｅ（ｔ（ｎ＋１））であった場合、Ｂｔ＿ａｆｔｅｒ（ｔ（ｎ＋１））＜０となり、バッファアンダーフローが発生する。一方、受信バッファサイズをＳと置いたとき、Ｇ（ｎ＋１）＜Ｂｔ＿ｂｅｆｏｒｅ（ｔ（ｎ＋１））＋ｂ／ＦＰＳ−Ｓであった場合、Ｂｔ＿ａｆｔｅｒ（ｔ（ｎ＋１））＞Ｓとなり、バッファオーバーフローが発生する。 Here, if G (n + 1)> Bt_before (t (n + 1)), Bt_after (t (n + 1)) <0 and a buffer underflow occurs. On the other hand, when the reception buffer size is set to S, if G (n + 1) <Bt_before (t (n + 1)) + b / FPS-S, Bt_after (t (n + 1))> S and buffer overflow occurs. .

そのため前述のレート制御部では、このようにバッファアンダーフローやオーバーフローが起こらないよう各ピクチャの発生符号量Ｇの適切な値を算出し、またそのような発生符号量となるようにＱＰを決定する機構となる。 For this reason, the above-described rate control unit calculates an appropriate value of the generated code amount G of each picture so that the buffer underflow or overflow does not occur in this way, and determines the QP so as to be such a generated code amount. It becomes a mechanism.

この発生符号量Ｇと量子化パラメータＱＰの関係であるが、ある量子化パラメータＱＰに対応する量子化幅をＱｓｔｅｐ（ＱＰ）と表すと、量子化幅Ｑｓｔｅｐ（ＱＰ）と、その時の発生符号量Ｇの間にはおおよそ反比例の関係がある。この両者の積を取ったものをこのピクチャの複雑さ指数Ｘ＝Ｇ×Ｑｓｔｅｐ（ＱＰ）と呼ぶ。この複雑さ指数Ｘは、直前に符号化したピクチャのものとおよそ近い値となることから、直前に符号化した結果の積からＸを求め、次ピクチャのＧとＱＰの算出に用いられることが多い。 The relationship between the generated code amount G and the quantization parameter QP. When the quantization width corresponding to a certain quantization parameter QP is expressed as Qstep (QP), the quantization width Qstep (QP) and the generated code amount at that time There is a roughly inverse relationship between G. The product of these two is called the complexity index X = G × Qstep (QP) of this picture. Since this complexity index X is approximately the same as that of the picture encoded immediately before, X can be obtained from the product of the result of encoding immediately before and used for calculating G and QP of the next picture. Many.

また、各ピクチャの適切な発生符号量Ｇの計算方法としては、ＭＰＥＧ−２のＴＭ５モデルがよく用いられる。これは、Ｉから始まる一連のピクチャ群であるＧＯＰ（Group of Pictures）に対して使用可能な符号量を設定し、各ピクチャタイプＩ・Ｐ・Ｂに対する複雑さ指数Ｘｉ、Ｘｐ、Ｘｂに応じて、ＧＯＰ内の各ピクチャに符号量を分配する、というものである。 Also, as a method for calculating an appropriate generated code amount G for each picture, the MPEG-5 TM5 model is often used. This sets the amount of code that can be used for GOP (Group of Pictures), which is a series of pictures starting from I, according to the complexity indices Xi, Xp, and Xb for each picture type I, P, and B. The code amount is distributed to each picture in the GOP.

１ピクチャずつ符号化する毎に割り当て符号量と実際の発生符号量の間の誤差をフィードバックし、バッファアンダーフローやオーバーフローを抑止しつつ、目的のビットレートｂとなるよう各ピクチャの符号量配分を決定する。 Each time one picture is encoded, an error between the allocated code amount and the actual generated code amount is fed back, and the code amount distribution of each picture is distributed to achieve the target bit rate b while suppressing buffer underflow and overflow. decide.

ここで、レート制御部の処理例を示す。レート制御部の処理は前処理と後処理の２パートに大別される。まず前処理について説明する。処理が始まると、まずバッファ位置Ｂｔ＿ｂｅｆｏｒｅを予め定められた初期値にて初期化する。また、各ピクチャタイプに対する複雑さ指数Ｘｉ、Ｘｐ、Ｘｂを予め定められた定数で初期化する。 Here, a processing example of the rate control unit is shown. The processing of the rate control unit is roughly divided into two parts, pre-processing and post-processing. First, preprocessing will be described. When the process starts, first, the buffer position Bt_before is initialized with a predetermined initial value. Also, the complexity indices Xi, Xp, and Xb for each picture type are initialized with predetermined constants.

次に１ＧＯＰに割り当てる符号量Ｒの初期値を設定する。例えば、１ＧＯＰに含まれるピクチャ枚数をＮとすれば、Ｒの初期値は、Ｒ＝ｂ×Ｎ／ＦＰＳなどと計算される。また、１ＧＯＰに含まれる各タイプ毎のピクチャ枚数をそれぞれＮｉ（＝Ｉピクチャの枚数＝１）、Ｎｐ（＝Ｐピクチャの枚数）、Ｎｂ（＝Ｂピクチャの枚数）に代入する。 Next, an initial value of the code amount R assigned to 1 GOP is set. For example, if the number of pictures included in one GOP is N, the initial value of R is calculated as R = b × N / FPS. Also, the number of pictures of each type included in one GOP is substituted into Ni (= number of I pictures = 1), Np (= number of P pictures), and Nb (= number of B pictures), respectively.

次に、符号化対象のピクチャ毎に以下のような処理により割り当て符号量Ｔを算出する。符号化対象ピクチャがＩピクチャの場合、ＴはＴ＝（Ｘｉ×Ｒ）／（Ｘｉ×Ｎｉ＋Ｘｐ×Ｎｐ＋Ｘｂ×Ｎｂ）として算出する。その後、Ｎｉの値を１デクリメントする。同様に符号化対象ピクチャがＰピクチャの場合はＴ＝（Ｘｐ×Ｒ）／（Ｘｉ×Ｎｉ＋Ｘｐ×Ｎｐ＋Ｘｂ×Ｎｂ）として算出後、Ｎｐの値を１デクリメントする。Ｂピクチャの場合はＴ＝（Ｘｂ×Ｒ）／（Ｘｉ×Ｎｉ＋Ｘｐ×Ｎｐ＋Ｘｂ×Ｎｂ）として算出後、Ｎｂの値を１デクリメントする。 Next, the allocated code amount T is calculated for each picture to be encoded by the following process. When the encoding target picture is an I picture, T is calculated as T = (Xi × R) / (Xi × Ni + Xp × Np + Xb × Nb). Thereafter, the value of Ni is decremented by 1. Similarly, when the encoding target picture is a P picture, after calculating as T = (Xp × R) / (Xi × Ni + Xp × Np + Xb × Nb), the value of Np is decremented by 1. In the case of a B picture, after calculating as T = (Xb × R) / (Xi × Ni + Xp × Np + Xb × Nb), the value of Nb is decremented by 1.

上記で求めた符号量Ｔについて、符号化のバッファ位置推定値Ｂｔ＿ａｆｔｅｒを
Ｂｔ＿ａｆｔｅｒ＝Ｂｔ＿ｂｅｆｏｒｅ−Ｔ
として算出する。この時Ｂｔ＿ａｆｔｅｒ＜０（バッファアンダーフロー）、もしくはバッファサイズをＳとした時にＢｔ＿ａｆｔｅｒ＋ｂ／ＦＰＳ＞Ｓ（バッファオーバーフロー）とならないよう、ＴをＢｔ＿ｂｅｆｏｒｅ＋ｂ／ＦＰＳ−Ｓ≦Ｔ≦Ｂｔ＿ｂｅｆｏｒｅの範囲にクリップする。 For the code amount T obtained above, the encoding buffer position estimated value Bt_after is set to Bt_after = Bt_before-T
Calculate as At this time, T is clipped to a range of Bt_before + b / FPS−S ≦ T ≦ Bt_before so that Bt_after + b / FPS> S (buffer overflow) when Bt_after <0 (buffer underflow) or when the buffer size is S is set.

次に、この割り当て符号量Ｔに相当する量子化パラメータＱＰを算出する。この量子化パラメータＱＰをターゲットＱＰと名付ける。前述の通り、複雑さ指数Ｘと発生符号量Ｇとの間にはおよそ以下の関係が成立する。
Ｘ＝Ｇ×Ｑｓｔｅｐ（ＱＰ） Next, a quantization parameter QP corresponding to the allocated code amount T is calculated. This quantization parameter QP is named target QP. As described above, the following relationship is established between the complexity index X and the generated code amount G.
X = G × Qstep (QP)

そのため、例えばＩピクチャの場合は符号量ＴとＱｓｔｅｐ（ＱＰ）の積がＸｉに最も近くなるＱＰをターゲットＱＰとすればよい。同様に、Ｐピクチャの場合は積がＸｐに最も近くなるＱＰ、Ｂピクチャの場合はＸｂに最も近くなるＱＰをターゲットＱＰとすればよい。 Therefore, for example, in the case of an I picture, a QP in which the product of the code amount T and Qstep (QP) is closest to Xi may be set as the target QP. Similarly, the target QP may be the QP whose product is closest to Xp in the case of a P picture, and the QP that is closest to Xb in the case of a B picture.

ここまでが前処理となる。このようにして求めたターゲットＱＰを用いて実際に符号化を行った後、レート制御部では後処理を行う。この処理は具体的には以下の通りである。まず、符号化結果から複雑さ指数の値を更新する。符号化したピクチャの発生符号量をＧ、その時のターゲットＱＰに対応する量子化幅をＱｓｔｅｐ（ターゲットＱＰ）とすれば、複雑さ指数は両者の積で求められる。 This is the preprocessing. After actually performing encoding using the target QP thus obtained, the rate control unit performs post-processing. Specifically, this processing is as follows. First, the value of the complexity index is updated from the encoding result. If the generated code amount of the encoded picture is G, and the quantization width corresponding to the target QP at that time is Qstep (target QP), the complexity index can be obtained by the product of both.

そこで符号化したピクチャのピクチャタイプがＩピクチャであればＸｉ、ＰピクチャであればＸｐ、ＢピクチャであればＸｂをＧ×Ｑｓｔｅｐ（ターゲットＱＰ）の値に更新する。また、符号量Ｒの値も更新する。具体的にはＲから発生符号量Ｇを引いたものを新たに符号量Ｒとする。 Therefore, Xi is updated to the value of G × Qstep (target QP) if the picture type of the encoded picture is I picture, Xp if it is a P picture, and Xb if it is a B picture. Also, the value of the code amount R is updated. Specifically, a value obtained by subtracting the generated code amount G from R is newly set as the code amount R.

次に、発生符号量Ｇを用いてバッファ位置Ｂｔ＿ｂｅｆｏｒｅの値を更新する。これは、
Ｂｔ＿ｂｅｆｏｒｅ＝Ｂｔ＿ｂｅｆｏｒｅ−Ｇ＋ｂ／ＦＰＳ
として計算される。以上がレート制御部の後処理となる。 Next, the value of the buffer position Bt_before is updated using the generated code amount G. this is,
Bt_before = Bt_before−G + b / FPS
Is calculated as The above is the post-processing of the rate control unit.

その後、またレート制御部の前処理に戻って次の符号化ピクチャに関するターゲットＱＰを算出する処理を行うことを繰り返す。１ＧＯＰ分の符号化が終わったら、Ｒの値を更新し、次のＧＯＰに対してまた同様の処理を繰り返す。具体的には、Ｒの値について、その時点で保持している値に対し、ｂ×Ｎ／ＦＰＳを加算する。また、Ｎｉ、Ｎｐ、Ｎｂの値を１ＧＯＰに含まれる各ピクチャタイプの枚数に戻す。 Thereafter, returning to the preprocessing of the rate control unit, the process of calculating the target QP for the next coded picture is repeated. When encoding for 1 GOP is completed, the value of R is updated, and the same processing is repeated for the next GOP. Specifically, for the value of R, b × N / FPS is added to the value held at that time. Further, the values of Ni, Np, and Nb are returned to the number of pictures of each picture type included in 1 GOP.

このような処理を行うことで、バッファを破綻させないようにしつつも目的のビットレートに沿ったビットストリームを生成できる（例えば、非特許文献１参照）。 By performing such processing, it is possible to generate a bit stream in accordance with the target bit rate while preventing the buffer from failing (see Non-Patent Document 1, for example).

次に、前述したような複数ピクチャを同時に符号化するための装置の装置構成を説明する。図７は、前述したような複数ピクチャを同時に符号化するための装置の装置構成を示すブロック図である。この装置は、原画像バッファ１、復号画像バッファ１０、Ｎ（Ｎは自然数）個の符号化部１２−１〜Ｎ、ストリームバッファ１３、並列処理割り当て部１４、バッファ計算部１５、割り当て符号量算出部１６、ＱＰ計算部１７、複雑さ計算部１８から構成される。バッファ計算部１５、割り当て符号量算出部１６、ＱＰ計算部１７及び複雑さ計算部１８によってレート制御部１９が構成されている。 Next, an apparatus configuration of an apparatus for simultaneously encoding a plurality of pictures as described above will be described. FIG. 7 is a block diagram showing a device configuration of a device for simultaneously encoding a plurality of pictures as described above. This apparatus includes an original image buffer 1, a decoded image buffer 10, N (N is a natural number) encoding units 12-1 to N, a stream buffer 13, a parallel processing allocation unit 14, a buffer calculation unit 15, and an allocated code amount calculation. The unit 16, the QP calculation unit 17, and the complexity calculation unit 18 are configured. A rate control unit 19 is configured by the buffer calculation unit 15, the allocated code amount calculation unit 16, the QP calculation unit 17, and the complexity calculation unit 18.

原画像バッファ１は前述のものと同様、原画像を符号化順に並べ替え、符号化部１２−１〜Ｎに送る。ただし、本構成では符号化部１２−１〜Ｎが複数あるため、並列処理割り当て部１４から送られる原画割り当て情報を元に、符号化順でＮ枚分のピクチャをＮ個ある符号化部１２−１〜Ｎへそれぞれ送信する。 Similar to the above, the original image buffer 1 rearranges the original images in the encoding order and sends them to the encoding units 12-1 to 12 -N. However, since there are a plurality of encoding units 12-1 to 12 -N in this configuration, N encoding units 12 of N pictures in the encoding order based on the original image allocation information sent from the parallel processing allocation unit 14. -1 to N, respectively.

復号画像バッファ１０は前述のものと同様、符号化部１２−１〜Ｎから送られる復号画像を保存し、また必要に応じて参照画像として符号化部１２−１〜Ｎへ送る。但し、本構成では符号化部１２−１〜Ｎが複数あるため、このバッファも複数の復号画像の同時保存、及び複数の参照画像の同時送出に対応している。 Similar to the above, the decoded image buffer 10 stores the decoded images sent from the encoding units 12-1 to N, and sends them to the encoding units 12-1 to 12-N as reference images as necessary. However, since there are a plurality of encoding units 12-1 to 12 -N in this configuration, this buffer also supports simultaneous storage of a plurality of decoded images and simultaneous transmission of a plurality of reference images.

符号化部１２−１〜Ｎは図５の点線で囲まれた符号化部１２と同等の機能を有し、本構成例ではそれがＮ個並列に備わっている。ストリームバッファ１３は、各符号化部１２−１〜Ｎから送られる符号化ストリームを保存すると共に、そのデータサイズを発生符号量としてバッファ計算部１５、割り当て符号量算出部１６、複雑さ計算部８に送る。 The encoding units 12-1 to 12 -N have the same function as the encoding unit 12 surrounded by the dotted line in FIG. 5, and N units are provided in parallel in this configuration example. The stream buffer 13 stores the encoded streams sent from the encoding units 12-1 to 12 -N, and uses the data size as the generated code amount for the buffer calculation unit 15, the assigned code amount calculation unit 16, and the complexity calculation unit 8. Send to.

並列処理割り当て部１４は、入力された並列処理数を元に、原画像バッファのどのピクチャをＮ個あるどの符号化部（符号化部１２−１〜Ｎのいずれか）へ送るかを求め、原画像バッファ１へ割り当て情報として送信する。 The parallel processing allocation unit 14 determines which picture in the original image buffer is sent to which of N encoding units (any of the encoding units 12-1 to N) based on the input parallel processing number, It is transmitted to the original image buffer 1 as allocation information.

バッファ計算部１５は、処理が始まるとレート制御設定情報を元にバッファ位置の初期値Ｂｔ＿ｂｅｆｏｒｅ（０）を計算して保持する。また、ストリームバッファ１３から各ピクチャの発生符号量が求まるたびにＢｔ＿ｂｅｆｏｒｅの値を計算し、割り当て符号量算出部１６へ送る。 When the process starts, the buffer calculation unit 15 calculates and holds an initial value Bt_before (0) of the buffer position based on the rate control setting information. Further, every time the generated code amount of each picture is obtained from the stream buffer 13, the value of Bt_before is calculated and sent to the allocated code amount calculation unit 16.

割り当て符号量算出部１６は、外部から与えられるレート制御設定と並列数、バッファ位置、各ピクチャの発生符号量を元に、向こうＮピクチャ分の各ピクチャの割り当て符号量Ｔ（１）〜Ｔ（Ｎ）を求め、ＱＰ計算部１７へ送る。ＱＰ計算部１７は、複雑さ計算部１８から送られる各ピクチャタイプの複雑さ指数と各ピクチャの割り当て符号量を元に、Ｎピクチャ分のターゲットＱＰを計算して対応するＮ個の符号化部１２−１〜Ｎへ送る。 The allocation code amount calculation unit 16 allocates code amounts T (1) to T (1) to T (1) to T (pictures) for the next N pictures based on the rate control setting and the parallel number, buffer position, and the generated code amount of each picture. N) is obtained and sent to the QP calculation unit 17. The QP calculation unit 17 calculates a target QP for N pictures based on the complexity index of each picture type sent from the complexity calculation unit 18 and the assigned code amount of each picture, and corresponding N encoding units. Send to 12-1 to N.

複雑さ計算部１８は、ＱＰ計算部１７から送られる各ピクチャの量子化パラメータＱＰと、ストリームバッファ１３から送られる対象ピクチャの発生符号量からピクチャタイプ毎の複雑さ指数を計算し、割り当て符号量算出部１６とＱＰ計算部１７へ送る。 The complexity calculation unit 18 calculates a complexity index for each picture type from the quantization parameter QP of each picture sent from the QP calculation unit 17 and the generated code amount of the target picture sent from the stream buffer 13, and the assigned code amount The data is sent to the calculation unit 16 and the QP calculation unit 17.

次に、図８を参照して、図７に示す装置の処理動作を説明する。図８は、前述したような複数ピクチャを同時に符号化する装置が行う処理動作を示すフローチャートである。符号化を開始すると、まず図６のフローと同様、原画像バッファ１は、ピクチャを符号化順に並べ替える（ステップＳ４１）。 Next, the processing operation of the apparatus shown in FIG. 7 will be described with reference to FIG. FIG. 8 is a flowchart showing processing operations performed by the apparatus for simultaneously encoding a plurality of pictures as described above. When encoding is started, the original image buffer 1 first rearranges the pictures in the encoding order (step S41), as in the flow of FIG.

次に、前述のレート制御部１９の処理を元に、割り当て符号量算出部１６は、向こうＮ枚分のピクチャに対する割り当て符号量を算出する（ステップＳ４２）。この時、Ｎ枚分のピクチャのうち、１枚目ピクチャの割り当て符号量算出は前述の通りに行えばよい。しかし２枚目以降のピクチャに関しては前述の式のままでは割り当て符号量算出ができない。 Next, based on the processing of the rate control unit 19 described above, the allocation code amount calculation unit 16 calculates the allocation code amount for the next N pictures (step S42). At this time, the allocation code amount calculation of the first picture out of N pictures may be performed as described above. However, for the second and subsequent pictures, the allocated code amount cannot be calculated with the above formula.

これは、例えば２枚目ピクチャの計算を行うには１枚目ピクチャの符号化結果から求まる発生符号量を使い、バッファ位置Ｂｔ＿ｂｅｆｏｒｅ、符号量Ｒ、及び対応するピクチャタイプの複雑さ指数を更新する必要があるためである。 For example, in order to calculate the second picture, the generated code quantity obtained from the encoding result of the first picture is used, and the buffer position Bt_before, the code quantity R, and the complexity index of the corresponding picture type are updated. This is necessary.

同様に３枚目ピクチャは１枚目と２枚目ピクチャの符号化結果、４枚目ピクチャは１〜３枚目ピクチャの発生符号量が必要となる。しかしこの時点ではこれらピクチャの発生符号量が無いことから、割り当て符号量を発生符号量の代用に用いる。 Similarly, the encoding result of the first and second pictures is required for the third picture, and the generated code amount of the first to third pictures is required for the fourth picture. However, since there is no generated code amount of these pictures at this time, the allocated code amount is used as a substitute for the generated code amount.

従って２枚目ピクチャに関して言えば、複雑さ指数は１枚目と共通とし、バッファ位置Ｂｔ＿ｂｅｆｏｒｅ（２）は、１枚目ピクチャに関するバッファ位置Ｂｔ＿ｂｅｆｏｒｅ（１）と１枚目ピクチャの割り当て符号量Ｔ（１）から
Ｂｔ＿ｂｅｆｏｒｅ’（２）＝Ｂｔ＿ｂｅｆｏｒｅ（１）−Ｔ（１）＋ｂ／ＦＰＳ
となる。 Therefore, regarding the second picture, the complexity index is the same as that of the first picture, and the buffer position Bt_before (2) is the buffer position Bt_before (1) related to the first picture and the assigned code amount T ( From 1) Bt_before ′ (2) = Bt_before (1) −T (1) + b / FPS
It becomes.

また符号量Ｒ（２）は１枚目ピクチャ計算時の符号量Ｒ（１）から符号量Ｔ（１）を引いた
Ｒ（２）＝Ｒ（１）−Ｔ（１）
を用いるとする。これらの値を用い、２枚目ピクチャに対する割り当て符号量Ｔ（２）を計算すればよい。同様に３枚目ピクチャに関しても複雑さ指数は１枚目計算時と共通で、バッファ位置Ｂｔ＿ｂｅｆｏｒｅ（３）は
Ｂｔ＿ｂｅｆｏｒｅ（３）＝Ｂｔ＿ｂｅｆｏｒｅ（２）−Ｔ（２）＋ｂ／ＦＰＳ
＝Ｂｔ＿ｂｅｆｏｒｅ（１）−（Ｔ（１）＋Ｔ（２））＋２×ｂ／ＦＰＳ The code amount R (2) is obtained by subtracting the code amount T (1) from the code amount R (1) at the time of calculating the first picture. R (2) = R (1) −T (1)
Is used. Using these values, the allocated code amount T (2) for the second picture may be calculated. Similarly, the complexity index for the third picture is the same as that for calculating the first picture, and the buffer position Bt_before (3) is Bt_before (3) = Bt_before (2) −T (2) + b / FPS.
= Bt_before (1)-(T (1) + T (2)) + 2 × b / FPS

また、符号量Ｒ（３）は
Ｒ（３）＝Ｒ（２）−Ｔ（２）＝Ｒ（１）−（Ｔ（１）＋Ｔ（２））
と表せ、これらから割り当て符号量Ｔ（３）を算出すればよい。 The code amount R (3) is R (3) = R (2) −T (2) = R (1) − (T (1) + T (2)).
The allocated code amount T (3) may be calculated from these.

上記の計算によりＮピクチャ分の割り当て符号量Ｔ（１）〜Ｔ（Ｎ）を算出したら、ＱＰ計算部１７は、これらの割り当て符号量に対応する各ピクチャの量子化パラメータＱＰを算出する（ステップＳ４３）。これは、各ピクチャのピクチャタイプに対する複雑さ指数ＸｉもしくはＸｐもしくはＸｂを各ピクチャの割り当て符号量で除した値に最も近い量子化幅となるＱＰを求めればよい。例えば、１枚目ピクチャのピクチャタイプがＩピクチャであれば量子化幅はＸｉ／Ｔ（１）で算出でき、この量子化幅に最も近い量子化パラメータＱＰが１枚目ピクチャのターゲットＱＰとなる。 After calculating the allocated code amounts T (1) to T (N) for N pictures by the above calculation, the QP calculating unit 17 calculates the quantization parameter QP of each picture corresponding to these allocated code amounts (step). S43). For this purpose, a QP having a quantization width closest to a value obtained by dividing the complexity index Xi, Xp, or Xb for the picture type of each picture by the allocated code amount of each picture may be obtained. For example, if the picture type of the first picture is an I picture, the quantization width can be calculated by Xi / T (1), and the quantization parameter QP closest to the quantization width becomes the target QP of the first picture. .

同様にＮピクチャ分全てのターゲットＱＰを算出したら、符号化部１２−１〜Ｎは、これらＮピクチャを同時並列に符号化する（ステップＳ４４）。この処理は、具体的には図６に示すフローチャート中の破線で囲まれた「符号化処理のコアフロー」に従って各ピクチャを同時に符号化することに相当する。 Similarly, after calculating the target QP for all N pictures, the encoding units 12-1 to 12-N encode these N pictures in parallel at the same time (step S44). Specifically, this processing corresponds to the simultaneous encoding of each picture in accordance with the “core flow of encoding processing” surrounded by a broken line in the flowchart shown in FIG.

Ｎピクチャ分の符号化が終わったら、各ピクチャに対応する発生符号量Ｇ（１）〜Ｇ（Ｎ）が求まるので、複雑さ計算部１８は、この結果を元に複雑さ指数の更新（ステップＳ４５）し、バッファ計算部１５は、バッファ位置の更新を行う（ステップＳ４６）。複雑さ指数の更新に関しては、発生符号量とターゲットＱＰに対応するＱｓｔｅｐの積で当該ピクチャの複雑さ指数が求まるので、ピクチャタイプ毎に分類し、平均値を求めて当該ピクチャタイプの新たな複雑さ指数とするなどすればよい。 When encoding for N pictures is completed, the generated code amounts G (1) to G (N) corresponding to each picture are obtained, and the complexity calculation unit 18 updates the complexity index based on this result (step Then, the buffer calculation unit 15 updates the buffer position (step S46). Regarding the update of the complexity index, since the complexity index of the picture is obtained by the product of the generated code amount and Qstep corresponding to the target QP, the complexity index of the picture is classified for each picture type, and an average value is obtained to obtain a new complexity of the picture type. For example, the index may be used.

一方バッファ位置の更新については、Ｎピクチャ分の発生符号量Ｇ（１）〜Ｇ（Ｎ）を元にバッファ位置Ｂｔ＿ｂｅｆｏｒｅ（Ｎ＋１）を以下の通り求める。
Ｂｔ＿ｂｅｆｏｒｅ（Ｎ＋１）＝Ｂｔ＿ｂｅｆｏｒｅ（１）−（Ｔ（１）＋Ｔ（２）＋…Ｔ（Ｎ））＋Ｎ×ｂ／ＦＰＳ On the other hand, for updating the buffer position, the buffer position Bt_before (N + 1) is obtained as follows based on the generated code amounts G (1) to G (N) for N pictures.
Bt_before (N + 1) = Bt_before (1) − (T (1) + T (2) +... T (N)) + N × b / FPS

以上の処理を全てのピクチャに対して行う（ステップＳ４７）ことで、複数ピクチャ並列に符号化処理を行うことができる。 By performing the above process on all the pictures (step S47), it is possible to perform the encoding process in parallel with a plurality of pictures.

MPEG-2, Test Model5(TM5) , Doc. ISO / IECJTC1 / SC29 / WG11 / NO400, Test Model Editing Committee, Apr.1993MPEG-2, Test Model5 (TM5), Doc.ISO / IECJTC1 / SC29 / WG11 / NO400, Test Model Editing Committee, Apr.1993

ところで、上記の複数ピクチャを同時に符号化する方式においては、各ピクチャの割り当て符号量に関して、必ずしも正確な情報を元に計算できていない。具体的に言えば、上記において符号量Ｔ（１）は正確なバッファ位置Ｂｔ＿ｂｅｆｏｒｅ（１）及び符号量Ｒ（１）を元に計算できているが、符号量Ｔ（２）に関して言えば、計算の元となるバッファ位置Ｂｔ＿ｂｅｆｏｒｅ（２）及びＲ（２）は仮の値となっている。 By the way, in the above-described method of encoding a plurality of pictures at the same time, the allocation code amount of each picture cannot always be calculated based on accurate information. Specifically, in the above, the code amount T (1) can be calculated based on the exact buffer position Bt_before (1) and the code amount R (1). The buffer positions Bt_before (2) and R (2) that are the origins of are temporary values.

一般的に映像の符号化では、割り当て符号量Ｔと実際の発生符号量Ｇの間には乖離が起きる。そのため、割り当て符号量の計算時点ではバッファ破綻が起こっていないとしても、実際にＮピクチャ符号化した時にはバッファ破綻が起こっている可能性がある。 In general, in video coding, a divergence occurs between the allocated code amount T and the actual generated code amount G. For this reason, even if there is no buffer failure at the time of calculation of the allocated code amount, there is a possibility that a buffer failure occurs when N pictures are actually encoded.

特に並列数Ｎの値が大きければ大きいほど各ピクチャの割り当て符号量Ｔと実際の発生符号量Ｇの乖離の影響を大きく受けるため、バッファ破綻を起こす可能性が高いことが分かる。従って複数ピクチャの同時並列処理は、並列処理数が多いほど処理時間も短縮できるが、バッファ破綻のリスクも並列処理数の増加に伴って高まるという問題がある。 In particular, it can be understood that the larger the value of the parallel number N, the greater the influence of the difference between the allocated code amount T of each picture and the actual generated code amount G, and thus the higher the possibility of buffer failure. Therefore, the simultaneous parallel processing of a plurality of pictures can shorten the processing time as the number of parallel processes increases, but there is a problem that the risk of buffer failure increases as the number of parallel processes increases.

なお、このバッファ破綻に関して、バッファオーバーフローの場合は空データ（ｆｉｌｌｅｒデータ）を間に挟むことで回避する手段が存在する。一方、バッファ位置が０未満となるバッファアンダーフローに関してはこのような回避手段が存在しないため、複数ピクチャ並列符号化においては特にこのバッファアンダーフローの回避が大きな問題となる。 Note that there is a means for avoiding this buffer failure by inserting empty data (filler data) in the case of buffer overflow. On the other hand, since there is no such avoidance means for buffer underflow where the buffer position is less than 0, avoiding this buffer underflow is a big problem particularly in the multi-picture parallel coding.

本発明は、このような事情に鑑みてなされたもので、バッファアンダーフローのリスクを低く抑えつつ、可能な限り並列処理数を高めることにより高速に符号化することができるようにするための並列処理数を算出することができる映像符号化方法、映像符号化装置及び映像符号化プログラムを提供することを目的とする。 The present invention has been made in view of such circumstances, and it is possible to perform high-speed encoding by increasing the number of parallel processes as much as possible while keeping the risk of buffer underflow low. It is an object of the present invention to provide a video encoding method, a video encoding device, and a video encoding program capable of calculating the number of processes.

本発明の一態様は、最大Ｎ（Ｎは２以上の自然数）枚までの複数ピクチャを並列に符号化する符号化部と、Ｎ枚の各ピクチャに対する割り当て符号量を算出する割り当て符号量算出部と、前記割り当て符号量に相当する量子化パラメータであるターゲットＱＰを算出するＱＰ計算部とを備え、映像を符号化する映像符号化装置が行う映像符号化方法であって、前記Ｎ枚の各ピクチャに対する前記割り当て符号量の誤差に基づいて、前記割り当て符号量を再算出する割り当て符号量再算出ステップと、Ｎ枚の各ピクチャに対する前記割り当て符号量から各ピクチャの符号化後のバッファのデータ蓄積量の推定量を示すバッファ位置推定値を算出するバッファ位置推定値算出ステップと、算出した各ピクチャの前記符号化後のバッファ位置推定値と、予め定められた閾値との大小を比較する比較ステップと、前記大小の比較の結果に基づき、前記閾値以上となっているピクチャの枚数を求める枚数算出ステップと、前記ピクチャの枚数に基づいて、前記符号化部によって並列に符号化するピクチャの枚数である並列数を算出する並列数算出ステップとを有する映像符号化方法である。 One aspect of the present invention is an encoding unit that encodes a plurality of pictures up to N (N is a natural number of 2 or more) in parallel, and an allocation code amount calculation unit that calculates an allocation code amount for each of the N pictures. And a QP calculation unit that calculates a target QP that is a quantization parameter corresponding to the allocated code amount, and is a video encoding method performed by a video encoding device that encodes video, each of the N sheets An allocation code amount recalculation step for recalculating the allocation code amount based on an error of the allocation code amount for a picture, and data storage in a buffer after encoding each picture from the allocation code amount for each of N pictures A buffer position estimated value calculating step for calculating a buffer position estimated value indicating an estimated amount of the quantity, and a buffer position estimated value after the encoding of each calculated picture A comparison step for comparing the magnitude with a predetermined threshold, a number calculation step for obtaining the number of pictures that are equal to or greater than the threshold based on the result of the magnitude comparison, and a number based on the number of pictures. And a parallel number calculating step of calculating a parallel number that is the number of pictures to be encoded in parallel by the encoding unit.

本発明の一態様は、前記映像符号化方法であって、前記割り当て符号量再算出ステップでは、各ピクチャの割り当て符号量に対してあらかじめ定めた１以上の係数を乗算する。 One aspect of the present invention is the video encoding method, wherein in the allocation code amount recalculation step, the allocation code amount of each picture is multiplied by one or more predetermined coefficients.

本発明の一態様は、前記映像符号化方法であって、前記割り当て符号量再算出ステップでは、各ピクチャの割り当て符号量に予め定めた固定値を加算する。 One aspect of the present invention is the video encoding method, wherein in the allocation code amount recalculation step, a predetermined fixed value is added to the allocation code amount of each picture.

本発明の一態様は、前記映像符号化方法であって、前記割り当て符号量再算出ステップでは、各ピクチャの割り当て符号量を予め定めた四則演算による方程式に代入して算出した値を用いることによって前記誤差に基づいた前記割り当て符号量を求める。 One aspect of the present invention is the video encoding method, wherein in the allocation code amount recalculation step, a value calculated by substituting the allocation code amount of each picture into a predetermined equation based on four arithmetic operations is used. The allocated code amount based on the error is obtained.

本発明の一態様は、前記映像符号化方法であって、前記バッファ位置推定値算出ステップでは、前記バッファのバッファ位置として入力された初期バッファ位置を起点として前記並列数によって規定されるピクチャ枚数分の符号化後の前記バッファの前記バッファ位置推定値を、各ピクチャに関して前記誤差に基づいた前記割り当て符号量から計算する。 One aspect of the present invention is the video encoding method, wherein in the buffer position estimation value calculation step, an initial buffer position input as the buffer position of the buffer is used as a starting point for the number of pictures defined by the parallel number. The buffer position estimation value of the buffer after encoding is calculated from the allocated code amount based on the error for each picture.

本発明の一態様は、映像を符号化する映像符号化装置であって、最大Ｎ（Ｎは２以上の自然数）枚までの複数ピクチャを並列に符号化する符号化部と、Ｎ枚の各ピクチャに対する割り当て符号量を算出する割り当て符号量算出部と、前記割り当て符号量に相当する量子化パラメータであるターゲットＱＰを算出するＱＰ計算部と、前記Ｎ枚の各ピクチャに対する前記割り当て符号量の誤差に基づいて、前記割り当て符号量を再算出する割り当て符号量再再算出部と、Ｎ枚の各ピクチャに対する前記割り当て符号量から各ピクチャの符号化後のバッファのデータ蓄積量の推定量を示すバッファ位置推定値を算出するバッファ位置推定値算出部と、算出した各ピクチャの前記符号化後のバッファ位置推定値と、予め定められた閾値との大小を比較する比較部と、前記大小の比較の結果に基づき、前記閾値以上となっているピクチャの枚数を求める枚数算出部と、前記ピクチャの枚数に基づいて、前記符号化部によって並列に符号化するピクチャの枚数である並列数を算出する並列数算出部とを備える映像符号化装置である。 One aspect of the present invention is a video encoding device that encodes video, an encoding unit that encodes a plurality of pictures up to N (N is a natural number of 2 or more) in parallel, and each of the N images An allocation code amount calculation unit that calculates an allocation code amount for a picture, a QP calculation unit that calculates a target QP that is a quantization parameter corresponding to the allocation code amount, and an error in the allocation code amount for each of the N pictures An allocation code amount re-recalculation unit that recalculates the allocation code amount based on the above, and a buffer that indicates an estimated amount of data accumulation in a buffer after encoding each picture from the allocation code amount for each of N pictures A buffer position estimated value calculation unit for calculating a position estimated value, and a ratio between the calculated buffer position estimated value of each picture after encoding and a predetermined threshold value are compared. A comparison unit, a number calculation unit for determining the number of pictures that are equal to or greater than the threshold based on the result of the size comparison, and a picture that is encoded in parallel by the encoding unit based on the number of pictures It is a video coding apparatus provided with the parallel number calculation part which calculates the parallel number which is the number of sheets.

本発明の一態様は、コンピュータに、前記映像符号化方法を実行させるための映像符号化プログラムである。 One aspect of the present invention is a video encoding program for causing a computer to execute the video encoding method.

本発明によれば、バッファアンダーフローのリスクを低く抑えつつ、可能な限り並列処理数を高めることにより高速に符号化することができるようにするための並列処理数を算出することができるという効果が得られる。 According to the present invention, it is possible to calculate the number of parallel processes for enabling high-speed encoding by increasing the number of parallel processes as much as possible while suppressing the risk of buffer underflow. Is obtained.

本発明の一実施形態による映像符号化装置の構成を示すブロック図である。It is a block diagram which shows the structure of the video coding apparatus by one Embodiment of this invention. 図１に示す並列数算出部２０の構成を示すブロック図である。It is a block diagram which shows the structure of the parallel number calculation part 20 shown in FIG. 図１に示す映像符号化装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the video coding apparatus shown in FIG. ピクチャの参照構造を表示順に示した図である。It is the figure which showed the reference structure of the picture in display order. ＨＥＶＣやＨ．２６４をはじめとする一般的な映像符号化規格における符号化部の装置構成を示すブロック図である。HEVC and H.C. 1 is a block diagram illustrating a device configuration of an encoding unit in a general video encoding standard such as H.264. 図５に示すＨＥＶＣやＨ．２６４をはじめとする一般的な映像符号化規格における符号化部の処理動作を示すフローチャートである。HEVC and H.264 shown in FIG. 2 is a flowchart showing processing operations of an encoding unit in general video encoding standards such as H.264. 複数ピクチャを同時に符号化するための装置の装置構成を示すブロック図である。It is a block diagram which shows the apparatus structure of the apparatus for encoding a some picture simultaneously. 複数ピクチャを同時に符号化する装置が行う処理動作を示すフローチャートである。It is a flowchart which shows the processing operation which the apparatus which codes a some picture simultaneously performs.

以下、図面を参照して、本発明の一実施形態による映像符号化装置を説明する。図１は同実施形態の構成を示すブロック図である。この図に示す装置が図７に示す従来の装置と異なる点は、並列数算出部２０が新たに設けられている点である。 Hereinafter, a video encoding apparatus according to an embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing the configuration of the embodiment. The apparatus shown in this figure is different from the conventional apparatus shown in FIG. 7 in that a parallel number calculation unit 20 is newly provided.

なお、並列処理割り当て部１４の入力に関して、図７に示す構成では外部から与えられた並列数が固定値として与えられていたが、図１に示す構成では並列数算出部２０から出力される並列処理数ｍが入力となっている。 As for the input to the parallel processing allocation unit 14, the parallel number given from the outside is given as a fixed value in the configuration shown in FIG. 7, but the parallel number output from the parallel number calculation unit 20 in the configuration shown in FIG. The processing number m is an input.

図１に示す装置は、図７に示す装置と同様に、原画像バッファ１、復号画像バッファ１０、Ｎ（Ｎは自然数）個の符号化部１２−１〜Ｎ、ストリームバッファ１３、並列処理割り当て部１４、バッファ計算部１５、割り当て符号量算出部１６、ＱＰ計算部１７、複雑さ計算部１８から構成される。バッファ計算部１５、割り当て符号量算出部１６、ＱＰ計算部１７及び複雑さ計算部１８によってレート制御部１９が構成されている。そして、新たに設けられた並列数算出部２０が備えられている。図１に示す装置は、並列数算出部２０を除く構成は、図７に示す構成と同様であるため、ここでは、説明を簡単に行う。 The apparatus shown in FIG. 1 is the same as the apparatus shown in FIG. 7. The original image buffer 1, the decoded image buffer 10, N (N is a natural number) encoding units 12-1 to N, the stream buffer 13, and parallel processing allocation A unit 14, a buffer calculation unit 15, an allocated code amount calculation unit 16, a QP calculation unit 17, and a complexity calculation unit 18. A rate control unit 19 is configured by the buffer calculation unit 15, the allocated code amount calculation unit 16, the QP calculation unit 17, and the complexity calculation unit 18. And the newly provided parallel number calculation part 20 is provided. The apparatus shown in FIG. 1 has the same configuration as that shown in FIG. 7 except for the parallel number calculation unit 20, and therefore will be described briefly here.

原画像バッファ１は、原画像を符号化順に並べ替え、符号化部１２−１〜Ｎに送る。ただし、本構成では符号化部１２−１〜Ｎが複数あるため、並列処理割り当て部１４から送られる原画割り当て情報を元に、符号化順でｍ枚分のピクチャを、Ｎ個ある符号化部１２−１〜Ｎのなかの任意のｍ個に対してそれぞれ送信する。 The original image buffer 1 rearranges the original images in the encoding order and sends them to the encoding units 12-1 to 12 -N. However, since there are a plurality of encoding units 12-1 to 12 -N in this configuration, N encoding units for m pictures in the encoding order based on the original image allocation information sent from the parallel processing allocation unit 14. Transmission is performed for any m of 12-1 to 12-N.

復号画像バッファ１０は、符号化部１２−１〜Ｎから送られる復号画像を保存し、また必要に応じて参照画像として符号化部１２−１〜Ｎへ送る。ただし、本構成では符号化部１２−１〜Ｎが複数あるため、この復号画像バッファ１０も複数の復号画像の同時保存、及び複数の参照画像の同時送出に対応している。 The decoded image buffer 10 stores the decoded images sent from the encoding units 12-1 to 12 -N, and sends them to the encoding units 12-1 to 12 -N as reference images as necessary. However, since there are a plurality of encoding units 12-1 to 12 -N in this configuration, the decoded image buffer 10 also supports simultaneous storage of a plurality of decoded images and simultaneous transmission of a plurality of reference images.

符号化部１２−１〜Ｎは図５に示す符号化部１２と同等の機能を有し、本構成例では符号化部１２それがＮ個並列に備わっている。ストリームバッファ１３は、各符号化部１２−１〜Ｎから送られる符号化ストリームを保存すると共に、そのデータサイズを発生符号量としてバッファ計算部１５、割り当て符号量算出部１６、複雑さ計算部８に送る。 The encoding units 12-1 to 12-N have the same function as the encoding unit 12 shown in FIG. 5, and in this configuration example, N encoding units 12 are provided in parallel. The stream buffer 13 stores the encoded streams sent from the encoding units 12-1 to 12 -N, and uses the data size as the generated code amount for the buffer calculation unit 15, the assigned code amount calculation unit 16, and the complexity calculation unit 8. Send to.

並列処理割り当て部１４は、入力された並列処理数ｍを元に、原画像バッファ１のどのピクチャをＮ個あるどの符号化部（符号化部１２−１〜Ｎのいずれか）へ送るかを求め、原画像バッファ１へ割り当て情報として送信する。原画像バッファ１のピクチャの符号化部１２−１〜Ｎへの割り当て方法としては、例えば、ｍ枚を前詰めで割り当てればよい。 The parallel processing allocation unit 14 determines which picture in the original image buffer 1 is sent to which N encoding units (any of the encoding units 12-1 to N) based on the input parallel processing number m. Obtained and transmitted to the original image buffer 1 as allocation information. As a method for assigning the pictures in the original image buffer 1 to the encoding units 12-1 to 12-N, for example, m pictures may be assigned in a left-justified manner.

バッファ計算部１５は、処理が始まるとレート制御設定情報を元にバッファ位置の初期値Ｂｔ＿ｂｅｆｏｒｅ（０）を計算して保持する。また、ストリームバッファ１３から各ピクチャの発生符号量が求まるたびにＢｔ＿ｂｅｆｏｒｅの値を計算し、割り当て符号量算出部１６及び並列数算出部２０へ送る。 When the process starts, the buffer calculation unit 15 calculates and holds an initial value Bt_before (0) of the buffer position based on the rate control setting information. Further, every time the generated code amount of each picture is obtained from the stream buffer 13, the value of Bt_before is calculated and sent to the allocated code amount calculation unit 16 and the parallel number calculation unit 20.

割り当て符号量算出部１６は、外部から与えられるレート制御設定と並列数、バッファ位置、各ピクチャの発生符号量を元に、向こうＮピクチャ分の各ピクチャの割り当て符号量Ｔ（１）〜Ｔ（Ｎ）を求め、ＱＰ計算部１７及び並列数算出部２０へ送る。ＱＰ計算部１７は、複雑さ計算部１８から送られる各ピクチャタイプの複雑さ指数と各ピクチャの割り当て符号量を元に、Ｎピクチャ分のターゲットＱＰを計算して対応するＮ個の符号化部１２−１〜Ｎへ送る。 The allocation code amount calculation unit 16 allocates code amounts T (1) to T (1) to T (1) to T (pictures) for the next N pictures based on the rate control setting and the parallel number given from the outside, the buffer position, and the generated code amount of each picture. N) is obtained and sent to the QP calculation unit 17 and the parallel number calculation unit 20. The QP calculation unit 17 calculates a target QP for N pictures based on the complexity index of each picture type sent from the complexity calculation unit 18 and the assigned code amount of each picture, and corresponding N encoding units. Send to 12-1 to N.

並列数算出部２０は、割り当て符号量算出部１６が出力する各ピクチャの割り当て符号量と、バッファ計算部１５が出力するバッファ位置と、外部から指定される並列数とから符号化部１２−１〜Ｎにおける並列処理数ｍを出力する。ここで、出力される並列処理数ｍは、Ｎ≧ｍを満たすことが条件である。 The parallel number calculating unit 20 encodes the encoding unit 12-1 based on the allocated code amount of each picture output from the allocated code amount calculating unit 16, the buffer position output from the buffer calculating unit 15, and the parallel number designated from the outside. Output the number m of parallel processes in .about.N. Here, the number m of parallel processes to be output is required to satisfy N ≧ m.

次に、図２を参照して、図１に示す並列数算出部２０の構成を説明する。図２は、図１に示す並列数算出部２０の構成を示すブロック図である。並列数算出部２０は、誤差考慮済み符号量計算部２１、バッファ推移推定部２２、閾値判定部２３、並列処理数決定部２４から構成される。並列数算出部２０には、各ピクチャの割り当て符号量と、並列数と、バッファ位置とが入力される。 Next, the configuration of the parallel number calculation unit 20 shown in FIG. 1 will be described with reference to FIG. FIG. 2 is a block diagram showing a configuration of the parallel number calculation unit 20 shown in FIG. The parallel number calculation unit 20 includes an error-considered code amount calculation unit 21, a buffer transition estimation unit 22, a threshold determination unit 23, and a parallel processing number determination unit 24. The parallel number calculation unit 20 receives the assigned code amount of each picture, the parallel number, and the buffer position.

誤差考慮済み符号量計算部２１は、入力された並列数で指定されるピクチャ数分の割り当て符号量を入力とし、それらにあらかじめ定められた誤差係数ｋ＝１．２を乗じた値を各ピクチャの誤差考慮済み符号量としてバッファ推移推定部２２に送る。ここで、誤差係数ｋとして、１．２を乗じているが、この値に限るものではなく、任意の値を用いることが可能である。 The error-considered code amount calculation unit 21 receives the assigned code amount for the number of pictures specified by the input parallel number, and multiplies them by a predetermined error coefficient k = 1.2 for each picture. Is sent to the buffer transition estimation unit 22 as an error-considered code amount. Here, the error coefficient k is multiplied by 1.2, but is not limited to this value, and any value can be used.

バッファ推移推定部２２は、バッファ位置として入力された初期バッファ位置を起点として並列数によって規定されるピクチャ枚数分の符号化後バッファ位置（ｎ枚目ピクチャを復号した直後の受信バッファのデータ蓄積量Ｂｔ＿ａｆｔｅｒ）を各ピクチャに関する誤差考慮済み割り当て符号量から計算して閾値判定部２３に送る。 The buffer transition estimation unit 22 uses the initial buffer position input as the buffer position as a starting point and the encoded buffer position for the number of pictures defined by the parallel number (the amount of data stored in the reception buffer immediately after decoding the nth picture) Bt_after) is calculated from the error-considered allocated code amount for each picture and sent to the threshold value determination unit 23.

閾値判定部２３は、並列数によって規定されたピクチャ数分の符号化後バッファ位置（各ピクチャのＢｔ＿ａｆｔｅｒ）に関して、予め定められた閾値Ｔｈとの間で大小比較を施し、その結果を並列処理数決定部２４に送る。 The threshold determination unit 23 compares the buffer positions after encoding (Bt_after of each picture) for the number of pictures defined by the parallel number with a predetermined threshold Th and compares the result with the number of parallel processes. The data is sent to the determination unit 24.

並列処理数決定部２４は、閾値判定部２３の出力を入力し、並列数で規定された値以下の範囲で、閾値判定結果が常に閾値Ｔｈ以上となる最大のピクチャ番号を求め、それを並列処理数ｍとして出力する。 The parallel processing number determination unit 24 receives the output of the threshold determination unit 23, obtains the maximum picture number for which the threshold determination result is always greater than or equal to the threshold Th within a range equal to or less than the value defined by the parallel number, and parallelizes it. Output as the processing number m.

次に、図３を参照して、図２に示す並列数算出部２０を含む図１に示す映像符号化装置の動作を説明する。図３は、図１に示す映像符号化装置の動作を示すフローチャートである。 Next, the operation of the video encoding device shown in FIG. 1 including the parallel number calculation unit 20 shown in FIG. 2 will be described with reference to FIG. FIG. 3 is a flowchart showing the operation of the video encoding apparatus shown in FIG.

まず、符号化が始まると、従来法と同様、原画像バッファ１は、入力画像を符号化順に並べ替える（ステップＳ１）。続いて、割り当て符号量算出部１６は、向こうＮ（Ｎは自然数）枚分のピクチャに関して割り当て符号量を算出する（ステップＳ２）。算出方法は前述した通りである。 First, when encoding starts, as in the conventional method, the original image buffer 1 rearranges the input images in the encoding order (step S1). Subsequently, the allocated code amount calculation unit 16 calculates the allocated code amount for N pictures (N is a natural number) beyond (step S2). The calculation method is as described above.

次に、誤差考慮済み符号量計算部２１は、算出した各ピクチャの割り当て符号量Ｔ（１）〜Ｔ（Ｎ）に対して誤差を加味する（ステップＳ３）。具体的にはＴ（１）〜Ｔ（Ｎ）に誤差係数ｋ＝１．２を掛け、誤差考慮済み符号量Ｔ’（１）〜Ｔ’（Ｎ）を算出する。 Next, the error-considered code amount calculation unit 21 adds an error to the calculated assigned code amounts T (1) to T (N) of each picture (step S3). Specifically, T (1) to T (N) are multiplied by an error coefficient k = 1.2, and error-considered code amounts T ′ (1) to T ′ (N) are calculated.

次に、バッファ推移推定部２２は、バッファ推移の推定を行う（ステップＳ４）。具体的には各ピクチャの符号化終了時点の推定バッファ位置Ｂｔ＿ａｆｔｅｒ（１）〜Ｂｔ＿ａｆｔｅｒ（Ｎ）を以下のように求める。
Ｂｔ＿ａｆｔｅｒ（１）＝Ｂｔ＿ｂｅｆｏｒｅ（１）−Ｔ’（１）
Ｂｔ＿ａｆｔｅｒ（２）＝Ｂｔ＿ｂｅｆｏｒｅ（２）−Ｔ’（２）
＝Ｂｔ＿ｂｅｆｏｒｅ（１）−（Ｔ’（１）＋Ｔ’（２））＋ｂ／ＦＰＳ
…
Ｂｔ＿ａｆｔｅｒ（Ｎ）＝Ｂｔ＿ｂｅｆｏｒｅ（Ｎ） −Ｔ’（Ｎ）
＝Ｂｔ＿ｂｅｆｏｒｅ（１）−（Ｔ’（１）＋Ｔ’（２）＋…Ｔ’（Ｎ））＋（Ｎ−１）×ｂ／ＦＰＳ Next, the buffer transition estimation unit 22 estimates buffer transition (step S4). Specifically, the estimated buffer positions Bt_after (1) to Bt_after (N) at the end of encoding of each picture are obtained as follows.
Bt_after (1) = Bt_before (1) −T ′ (1)
Bt_after (2) = Bt_before (2) −T ′ (2)
= Bt_before (1)-(T '(1) + T' (2)) + b / FPS
...
Bt_after (N) = Bt_before (N) −T ′ (N)
= Bt_before (1) − (T ′ (1) + T ′ (2) +... T ′ (N)) + (N−1) × b / FPS

そして、閾値判定部２３は、Ｂｔ＿ａｆｔｅｒ（１）〜Ｂｔ＿ａｆｔｅｒ（Ｎ）を予め定めた閾値Ｔｈと比較する（ステップＳ５）。この比較結果に基づき、並列処理数決定部２４は、符号化順に見て最も早く閾値Ｔｈを下回るピクチャ番号を見つけ、その直前のピクチャ番号を並列処理数ｍとして出力する（ステップＳ６）。例えばＢｔ＿ａｆｔｅｒ（１）、Ｂｔ＿ａｆｔｅｒ（２）は閾値Ｔｈを上回り、Ｂｔ＿ａｆｔｅｒ（３）が閾値Ｔｈを下回っていた場合、並列処理数ｍは２とする。 Then, the threshold determination unit 23 compares Bt_after (1) to Bt_after (N) with a predetermined threshold Th (step S5). Based on the comparison result, the parallel processing number determination unit 24 finds the picture number that falls below the threshold Th the earliest in the coding order, and outputs the immediately preceding picture number as the parallel processing number m (step S6). For example, if Bt_after (1) and Bt_after (2) are above the threshold Th, and Bt_after (3) is below the threshold Th, the parallel processing count m is 2.

その後、ＱＰ計算部１７は、向こうｍ（並列処理数）枚分のピクチャに対して量子化パラメータＱＰを算出する（ステップＳ７）。この時の量子化パラメータＱＰの計算では、誤差係数ｋ（例えば、１．２）を掛ける前の符号量Ｔ（１）〜Ｔ（ｍ）の値を元に量子化パラメータＱＰを算出する。 After that, the QP calculation unit 17 calculates the quantization parameter QP for m (the number of parallel processing) pictures beyond (step S7). In the calculation of the quantization parameter QP at this time, the quantization parameter QP is calculated based on the values of the code amounts T (1) to T (m) before being multiplied by the error coefficient k (for example, 1.2).

次に、向こうｍ枚分のピクチャに対するターゲットＱＰが求まったら、以降は従来法と同様、符号化部１２−１〜Ｎは、ｍ（並列処理数）枚分のピクチャを並列処理して符号化を行う（ステップＳ８）。ｍ枚の符号化に必ずしも符号化部１２−１〜Ｎの全てが使われる訳ではなく、ｍ個の符号化部のみが使われることになる。そして、複雑さ計算部１８は、求まった発生符号量を元に、複雑さ指数を更新する（ステップＳ９）。 Next, when target QPs for m pictures beyond are obtained, the encoding units 12-1 to 12-N encode and process m (parallel processing number) pictures in parallel as in the conventional method. Is performed (step S8). Not all of the encoding units 12-1 to 12 -N are necessarily used for encoding m sheets, but only m encoding units are used. Then, the complexity calculation unit 18 updates the complexity index based on the obtained generated code amount (step S9).

また、バッファ計算部１５は、求まった発生符号量を元に、バッファ位置を更新する（ステップＳ１０）。そして、符号化フレームが残っていたら向こうＮ枚分のピクチャに対する割り当て符号量算出処理に戻って符号化処理を最後まで繰り返す（ステップＳ１１）。 Further, the buffer calculation unit 15 updates the buffer position based on the obtained generated code amount (step S10). If the encoded frame remains, the process returns to the allocated code amount calculation process for the next N pictures and repeats the encoding process to the end (step S11).

なお、前述した説明では割り当て符号量に誤差を加味する方法として誤差係数ｋ＝１．２を乗算する例を説明したが、誤差係数ｋを乗算することに限るものではない。例えば、固定値を加算してもよい。また、乗算や加算に限らず、各ピクチャの割り当て符号量を予め定めた誤差係数ｋを算出するための四則演算による方程式に代入して得られた値に基づいた割り当て符号量としてもよい。 In the above description, an example in which the error coefficient k = 1.2 is multiplied as a method for adding an error to the allocated code amount, but the present invention is not limited to multiplication by the error coefficient k. For example, a fixed value may be added. Further, not only multiplication and addition, but an allocation code amount based on a value obtained by substituting the allocation code amount of each picture into an equation based on four arithmetic operations for calculating a predetermined error coefficient k may be used.

また、誤差係数ｋを乗算する際の誤差係数ｋは、割り当て符号量に対して、予め定めた１以上の係数を掛けて安全側に設定することを目的とするため、１以上の係数であればよい。特に、本実施形態では、割り当て符号量Ｔ（Ｎ）と、実際の発生符号量Ｇ（Ｎ）の間に乖離が生じるため、その分を補償する際に、安全側に設定するため、１以上の誤差係数を乗算するようにしている。そのため、例えば並列化を行わない状態で予め符号化を行って割り当て符号量Ｔ（Ｎ）と実際の発生符号量Ｇ（Ｎ）を算出し、その比を以って「１以上の係数」を定めれるようにしてもよい。 Further, the error coefficient k when multiplying by the error coefficient k is set to a safe side by multiplying the assigned code amount by one or more predetermined coefficients, so that it should be one or more coefficients. That's fine. In particular, in the present embodiment, there is a divergence between the allocated code amount T (N) and the actual generated code amount G (N). Is multiplied by the error coefficient. Therefore, for example, encoding is performed in advance in a state where parallelization is not performed, and an allocated code amount T (N) and an actual generated code amount G (N) are calculated. It may be determined.

例えば、大量のあらゆる映像を予めエンコードして割り当て符号量Ｔ（Ｎ）と実際の発生符号量Ｇ（Ｎ）の平均値をそれぞれ求め、その比を「１以上の係数」とすると手法を用いて誤差係数ｋを設定することができる。 For example, a method is used in which a large amount of all images are encoded in advance to obtain an average value of the allocated code amount T (N) and the actual generated code amount G (N), and the ratio is “one or more coefficients”. An error coefficient k can be set.

また、誤差係数ｋは必ずしも固定値である必要はない。実際にエンコードを行っていくと各フレームの割り当て符号量Ｔ（Ｎ）と実際の発生符号量Ｇ（Ｎ）の値が順次求まるので、その最新Ｍフレーム分の平均値を以って順次更新したり、先頭からずっと積算し続けた値の平均値を使ったりすることも可能である。 Further, the error coefficient k is not necessarily a fixed value. When encoding is actually performed, the assigned code amount T (N) and the actual generated code amount G (N) of each frame are obtained sequentially, and are updated sequentially with the average value for the latest M frames. It is also possible to use an average value of values continuously accumulated from the beginning.

以上説明したように、実施形態における映像符号化装置は、複数のピクチャを同時に符号化を行う際に、割り当て符号量に誤差を加味したうえでバッファ推移の推定を行う。そして、映像符号化装置は、同時に符号化を行う適切なピクチャ数を設定する。この構成によれば、復号時においてバッファオーバーフロー及びバッファアンダーフローの発生を低く抑えることができるとともに、符号化速度の高速化を実現することができる。 As described above, when encoding a plurality of pictures at the same time, the video encoding apparatus according to the embodiment performs buffer transition estimation in consideration of an error in the allocated code amount. The video encoding apparatus sets an appropriate number of pictures to be encoded at the same time. According to this configuration, it is possible to suppress the occurrence of buffer overflow and buffer underflow at the time of decoding, and it is possible to increase the encoding speed.

特に、バッファ位置が下がっている場合は、並列数算出部２０が並列処理数を少なくすることで処理速度が遅くなる代わりにバッファアンダーフローのリスクを下げることが可能になる。逆に、バッファ位置が上がっている場合などバッファアンダーフローの危険性が低い場合は、並列数算出部２０が並列数を多くすることで処理速度を高めることが可能になる。この結果、バッファアンダーフローのリスクを低く抑えつつ、可能な限り並列処理数を高めることにより高速にエンコードすることができる。 In particular, when the buffer position is lowered, it is possible to reduce the risk of buffer underflow instead of reducing the processing speed by reducing the number of parallel processes by the parallel number calculation unit 20. Conversely, when the risk of buffer underflow is low, such as when the buffer position is raised, the parallel number calculation unit 20 can increase the processing speed by increasing the parallel number. As a result, it is possible to perform high-speed encoding by increasing the number of parallel processes as much as possible while keeping the risk of buffer underflow low.

前述した実施形態における映像符号化装置の全部または一部をコンピュータで実現するようにしてもよい。その場合、この機能を実現するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行することによって実現してもよい。なお、ここでいう「コンピュータシステム」とは、ＯＳや周辺機器等のハードウェアを含むものとする。また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。さらに「コンピュータ読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムを送信する場合の通信線のように、短時間の間、動的にプログラムを保持するもの、その場合のサーバやクライアントとなるコンピュータシステム内部の揮発性メモリのように、一定時間プログラムを保持しているものも含んでもよい。また上記プログラムは、前述した機能の一部を実現するためのものであってもよく、さらに前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるものであってもよく、ＰＬＤ（Programmable Logic Device）やＦＰＧＡ（Field Programmable Gate Array）等のハードウェアを用いて実現されるものであってもよい。 You may make it implement | achieve all or one part of the video coding apparatus in embodiment mentioned above with a computer. In that case, a program for realizing this function may be recorded on a computer-readable recording medium, and the program recorded on this recording medium may be read into a computer system and executed. Here, the “computer system” includes an OS and hardware such as peripheral devices. The “computer-readable recording medium” refers to a storage device such as a flexible medium, a magneto-optical disk, a portable medium such as a ROM and a CD-ROM, and a hard disk incorporated in a computer system. Furthermore, the “computer-readable recording medium” dynamically holds a program for a short time like a communication line when transmitting a program via a network such as the Internet or a communication line such as a telephone line. In this case, a volatile memory inside a computer system serving as a server or a client in that case may be included and a program held for a certain period of time. Further, the program may be a program for realizing a part of the above-described functions, and may be a program capable of realizing the functions described above in combination with a program already recorded in a computer system. It may be realized using hardware such as PLD (Programmable Logic Device) or FPGA (Field Programmable Gate Array).

以上、図面を参照して本発明の実施の形態を説明してきたが、上記実施の形態は本発明の例示に過ぎず、本発明が上記実施の形態に限定されるものではないことは明らかである。したがって、本発明の技術思想及び範囲を逸脱しない範囲で構成要素の追加、省略、置換、その他の変更を行ってもよい。 As mentioned above, although embodiment of this invention has been described with reference to drawings, the said embodiment is only the illustration of this invention, and it is clear that this invention is not limited to the said embodiment. is there. Therefore, additions, omissions, substitutions, and other modifications of the components may be made without departing from the technical idea and scope of the present invention.

バッファアンダーフローのリスクを低く抑えつつ、可能な限り並列処理数を高めることにより高速に符号化することが不可欠な用途に適用できる。 It can be applied to applications where it is essential to perform high-speed encoding by increasing the number of parallel processes as much as possible while keeping the risk of buffer underflow low.

１・・・原画像バッファ、１０・・・復号画像バッファ、１２−１〜Ｎ・・・符号化部、１３・・・ストリームバッファ、１４・・・並列処理割り当て部、１５・・・バッファ計算部、１６・・・割り当て符号量算出部、１７・・・ＱＰ計算部、１８・・・複雑さ計算部、１９・・・レート制御部、２０・・・並列数算出部 DESCRIPTION OF SYMBOLS 1 ... Original image buffer, 10 ... Decoded image buffer, 12-1 to N ... Encoding part, 13 ... Stream buffer, 14 ... Parallel processing allocation part, 15 ... Buffer calculation , 16 ... Allocation code amount calculation unit, 17 ... QP calculation unit, 18 ... Complexity calculation unit, 19 ... Rate control unit, 20 ... Parallel number calculation unit

Claims

An encoding unit that encodes a plurality of pictures up to N (N is a natural number of 2 or more) in parallel, an allocation code amount calculation unit that calculates an allocation code amount for each of the N pictures, and the allocation code amount A video encoding method performed by a video encoding device that encodes video, comprising a QP calculation unit that calculates a target QP that is a corresponding quantization parameter,
An allocation code amount recalculation step of recalculating the allocation code amount based on an error of the allocation code amount for each of the N pictures;
A buffer position estimated value calculating step for calculating a buffer position estimated value indicating an estimated amount of data accumulation in the buffer after encoding each picture from the allocated code amount for each of N pictures;
A comparison step for comparing the calculated buffer position estimated value of each picture with a predetermined threshold value;
A number-of-sheets calculating step for determining the number of pictures that are equal to or greater than the threshold based on the result of the comparison of the magnitudes;
And a parallel number calculating step of calculating a parallel number that is the number of pictures to be encoded in parallel by the encoding unit based on the number of pictures.

2. The video encoding method according to claim 1, wherein in the allocation code amount recalculation step, the allocation code amount of each picture is multiplied by one or more predetermined coefficients.

The video encoding method according to claim 1, wherein in the allocation code amount recalculation step, a predetermined fixed value is added to the allocation code amount of each picture.

2. The allocation code amount recalculation step determines the allocation code amount based on the error by using a value calculated by substituting the allocation code amount of each picture into an equation based on predetermined four arithmetic operations. Video encoding method.

In the buffer position estimated value calculation step, the buffer position estimated value of the buffer after encoding for the number of pictures defined by the parallel number starting from the initial buffer position input as the buffer position of the buffer, The video encoding method according to claim 1, wherein calculation is performed from the allocated code amount based on the error with respect to a picture.

A video encoding device for encoding video,
An encoding unit for encoding a plurality of pictures up to N (N is a natural number of 2 or more) in parallel;
An allocation code amount calculation unit for calculating an allocation code amount for each of the N pictures;
A QP calculation unit for calculating a target QP that is a quantization parameter corresponding to the allocated code amount;
An allocation code amount re-recalculation unit that recalculates the allocation code amount based on an error of the allocation code amount for each of the N pictures;
A buffer position estimated value calculating unit that calculates a buffer position estimated value indicating an estimated amount of data accumulation in a buffer after encoding each picture from the allocated code amount for each of N pictures;
A comparing unit that compares the calculated buffer position estimated value of each picture after the encoding with a predetermined threshold;
A number calculation unit for determining the number of pictures that are equal to or greater than the threshold based on the result of the magnitude comparison;
A video encoding apparatus comprising: a parallel number calculating unit that calculates a parallel number that is the number of pictures to be encoded in parallel by the encoding unit based on the number of pictures.

A video encoding program for causing a computer to execute the video encoding method according to any one of claims 1 to 5.