JP2002171529A

JP2002171529A - Video encoder and method, recording medium, and decoder

Info

Publication number: JP2002171529A
Application number: JP2000364200A
Authority: JP
Inventors: Toshihiro Tanaka; 俊啓田中; Yasuhiko Yamane; 靖彦山根
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2000-11-30
Filing date: 2000-11-30
Publication date: 2002-06-14

Abstract

PROBLEM TO BE SOLVED: To provide a video encoder and method that attains multi-angle broadcasting for a frequency band by one channel in digital TV broadcasting. SOLUTION: A bit allocator 41 allocates bits on the basis of multi-angle information 12 and outputs bit allocation information 43. A 1st encoder 201 and a 2nd encoder 202 respectively encode a main video image 101 and audio 102 and a multi-angle video image 111 and audio 112 on the basis of the multi- angle information 12 and the bit allocation information 43. A multiplexer 51 multiplexes transport streams outputted from the 1st encoder 201 and the 2nd encoder 202 into one broadcast transport stream 8 on the basis of the multi- angle information 12 and the bit allocation information 43.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、デジタルＴＶ放送
において１チャンネル分の帯域でマルチアングル放送を
可能とする映像符号化方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a video encoding method that enables multi-angle broadcasting in a band corresponding to one channel in digital TV broadcasting.

【０００２】[0002]

【従来の技術】デジタルＴＶ放送では、デジタル化した
映像を情報圧縮し、適切な伝送レートで符号化すること
で、限られた伝送帯域幅を最大限に利用し、多くの放送
局の番組を同時に放送することを可能にしている。これ
を可能にした映像圧縮技術は、ＭＰＥＧ２（Moving Pic
ture Expert Group）規格（ＩＳＯ／ＩＥＣ１３８１８
−２）である。2. Description of the Related Art In digital TV broadcasting, digital video data is compressed and encoded at an appropriate transmission rate, thereby making the most of a limited transmission bandwidth and allowing programs of many broadcasting stations to be broadcast. It is possible to broadcast at the same time. The video compression technology that made this possible is MPEG2 (Moving Pic
Environment Expert Group) standard (ISO / IEC13818)
-2).

【０００３】最初に、ＭＰＥＧ２規格に基づく映像符号
化について簡単に説明する。図１８にＭＰＥＧ２に基づ
いたビデオエンコーダのブロック図を示す。入力映像系
列２１１は、フレーム並べ替え回路３２において、フレ
ーム毎にフレーム内符号化フレーム（以下、Ｉピクチャ
と称す）、フレーム間符号化フレーム（以下、Ｐピクチ
ャと称す）およびフレーム内挿符号化フレーム（以下、
Ｂピクチャと称す）の符号化タイプに応じて符号化順に
フレームの並べ替えが行われ、動き推定回路３３１にお
いて動きベクトルが算出される。動き補償回路３３２は
加算器３９１より出力されたデータと動き推定回路３３
１より出力された動きベクトルより動き補償したデータ
を生成し、差分器３９２に出力する。動き補償したデー
タは、次に符号化するピクチャに対する前回の動き補償
分として逆量子化回路３５２、逆ＤＣＴ回路３４２で復
号され、加算器３９１に出力される。差分器３９２は動
き推定回路３３１より出力されたデータと動き補償回路
３３２より出力されたデータの差分を取り、ＤＣＴ回路
３４１は差分器３９２より出力された映像信号に離散コ
サイン変換（ＤＣＴ:Discrete Cosine Transform）処理
を行う。量子化回路３５１はＤＣＴ回路２４１より出力
された変換係数に量子化制御回路３８より与えられる量
子化幅による量子化処理を行い、可変長符号化回路は、
量子化回路３５１より出力された変換係数を符号化し、
バッファ３７に出力する。バッファ３７は復号時のバッ
ファ状態をシミュレートするためのＶＢＶ（Video Buff
eringVerifier）と呼ばれる仮想的なバッファであり、
量子化制御回路３８はバッファ３７より出力される発生
符号量、ビットレート、バッファ残量、およびその他の
条件などから算出した目標符号量を求め、これよりマク
ロブロックに与える量子化幅を計算し、量子化回路３５
１に出力する。バッファ３７に出力されたデータは、ビ
デオストリーム２２１として出力される。[0003] First, video encoding based on the MPEG2 standard will be briefly described. FIG. 18 shows a block diagram of a video encoder based on MPEG2. In the frame rearranging circuit 32, the input video sequence 211 is converted into an intra-coded frame (hereinafter, referred to as an I-picture), an inter-coded frame (hereinafter, referred to as a P-picture), and an interpolated encoded frame for each frame. (Less than,
The frames are rearranged in the coding order according to the coding type of the B picture), and the motion estimation circuit 331 calculates a motion vector. The motion compensation circuit 332 calculates the data output from the adder 391 and the motion estimation circuit 33
The motion-compensated data is generated from the motion vector output from 1 and output to the differentiator 392. The motion-compensated data is decoded by the inverse quantization circuit 352 and the inverse DCT circuit 342 as the previous motion compensation for the next picture to be encoded, and output to the adder 391. The differentiator 392 calculates the difference between the data output from the motion estimator 331 and the data output from the motion compensator 332, and the DCT circuit 341 converts the video signal output from the differentiator 392 into a discrete cosine transform (DCT). Transform) processing. The quantization circuit 351 performs a quantization process on the transform coefficient output from the DCT circuit 241 using a quantization width given by the quantization control circuit 38.
The transform coefficient output from the quantization circuit 351 is encoded,
Output to the buffer 37. The buffer 37 is a VBV (Video Buffer) for simulating a buffer state at the time of decoding.
eringVerifier)
The quantization control circuit 38 obtains a target code amount calculated from the generated code amount output from the buffer 37, the bit rate, the remaining buffer amount, and other conditions, and calculates a quantization width to be given to the macroblock based on the target code amount. Quantization circuit 35
Output to 1. The data output to the buffer 37 is output as a video stream 221.

【０００４】放送のように予め決められた帯域幅で送信
する場合、ビデオの符号化ビットレートを固定して符号
化を行うＣＢＲ（Constant Bit Rate）の符号化が適用
できる。しかし実際の映像には、動きが激しく圧縮が難
しいシーンや、動きが少なく圧縮が容易なシーンなど様
々なシーンが含まれており、ＣＢＲのようにビットレー
トを一定にすると、動きが激しいシーンなどでの画質劣
化が目立つようになる。そこで、画質を改善するため
に、圧縮の容易なシーンには少ないビットレートを割り
当てて、不要なビット量を削減し、圧縮の難しいシーン
には多くのビットレートを割り当てるＶＢＲ（Variable
Bit Rate）の符号化がある。ＶＢＲの符号化方法に
は、大きく分けて２種類の方法があり、一つは、符号化
を１回のみ行う符号化方法であり、もう一つは、一般的
に知られる２パスエンコードに代表される、符号化を複
数回行う方法である。[0004] In the case of transmission with a predetermined bandwidth such as broadcast, CBR (Constant Bit Rate) coding for performing coding by fixing the video coding bit rate can be applied. However, actual video contains a variety of scenes, such as scenes that are hard to compress due to severe movement, and scenes that are easy to compress with little movement. If the bit rate is fixed as in CBR, scenes with severe movement, etc. Image quality degradation becomes noticeable. Therefore, in order to improve the image quality, a VBR (Variable) that allocates a small bit rate to a scene that is easily compressed to reduce an unnecessary bit amount and allocates a large bit rate to a scene that is difficult to compress.
Bit Rate) coding. There are roughly two types of VBR encoding methods, one is an encoding method in which encoding is performed only once, and the other is representative of a generally known two-pass encoding. The encoding is performed a plurality of times.

【０００５】ここで、符号化を複数回行う方法の一例と
して、代表的な２パスエンコードについて説明する。２
パス用ビデオエンコーダの一例を、図１９に示す。入力
映像信号１は、選択器１４１に入力される。第１回目の
符号化では選択器１４１は端子２１１１に接続され、映
像信号１は１パスビデオエンコーダ３０２に入力され
る。１パスビデオエンコーダ３０２では、図１８で説明
したＭＰＥＧ２のビデオエンコーダ３０と同様の符号化
が行われ、符号データもしくは発生符号量などの符号化
難易度情報４６が、第２回目の符号化における目標符号
量を割り当てるための２パスビットアロケータ４５に入
力される。ここでデータ量が多い時などでは、ディスク
７２に得られたデータを記録することもできる。２パス
ビットアロケータ４５は得られたデータと第２回目のビ
ットレートなどの符号化条件から、第２回目の目標符号
量を算出する。次に、第２回目の符号化が実行される。
この場合、選択器１４１は端子２１１２に接続されてお
り、第１回目と全く同じ映像信号１が、２パスビデオエ
ンコーダ３０３に入力され、２パスビットアロケータ４
５において計算された目標符号量に基づいて、第２回目
の符号化を行い、ビデオストリーム２２１を出力する。Here, a typical two-pass encoding will be described as an example of a method of performing encoding a plurality of times. 2
FIG. 19 shows an example of the pass video encoder. The input video signal 1 is input to the selector 141. In the first encoding, the selector 141 is connected to the terminal 2111, and the video signal 1 is input to the one-pass video encoder 302. In the one-pass video encoder 302, encoding similar to that of the MPEG2 video encoder 30 described with reference to FIG. 18 is performed, and encoding difficulty information 46 such as encoded data or generated code amount is set as a target in the second encoding. It is input to a two-pass bit allocator 45 for assigning a code amount. Here, when the data amount is large, the data obtained on the disk 72 can be recorded. The two-pass bit allocator 45 calculates a second target code amount from the obtained data and coding conditions such as the second bit rate. Next, the second encoding is performed.
In this case, the selector 141 is connected to the terminal 2112, and the video signal 1 exactly the same as the first time is input to the two-pass video encoder 303, and the two-pass bit allocator 4
The second encoding is performed based on the target code amount calculated in step 5, and a video stream 221 is output.

【０００６】放送では、ビデオとオーディオを多重して
生成するシステムストリームは、ＭＰＥＧ２のシステム
規格（ＩＳＯ／ＩＥＣ１３８１８−１）にて規定されて
いるトランスポートストリーム（ＴＳ：Transport Stre
am）が用いられる。トランスポートストリームは、固定
長のパケットの列であり、ＣＢＲであるが、複数のプロ
グラムを多重できる構成になっている。受信側では、各
プログラムのパケットを特定するためにつけられた固有
のＩＤであるＰＩＤ（Packet IDentifier ）を読み取る
ことで、現在受信しているプログラムに必要なパケット
のみを抽出することができる。放送でＶＢＲを使用する
場合、最高ビットレートが帯域幅で制限されるため、一
旦最高ビットレートで送信できる帯域幅を確保すると、
常に最高ビットレートで符号化しても、平均ビットレー
トを低くして符号化しても、使用する帯域幅は同じとな
る。このような欠点を補う一つの方法として、複数のプ
ログラム間で帯域を共有し、各プログラム間の符号化難
易度を測定しながら動的に帯域を割り当てる、統計多重
方式がある。統計多重方式は、多くのプログラムを放送
できるような広い帯域幅を有する放送局が、画質を重視
した放送を行いたい時などに適している。In broadcasting, a system stream generated by multiplexing video and audio is a transport stream (TS: Transport Stream) defined by the MPEG2 system standard (ISO / IEC13818-1).
am) is used. The transport stream is a sequence of fixed-length packets and is a CBR, but has a configuration in which a plurality of programs can be multiplexed. On the receiving side, by reading a PID (Packet IDentifier), which is a unique ID assigned to identify a packet of each program, it is possible to extract only packets necessary for the program currently being received. When using VBR in broadcasting, the maximum bit rate is limited by the bandwidth, so once the bandwidth that can be transmitted at the maximum bit rate is secured,
Even if coding is always performed at the highest bit rate or when coding is performed at a lower average bit rate, the same bandwidth is used. One method of compensating for such a drawback is a statistical multiplexing method in which a band is shared among a plurality of programs and a band is dynamically allocated while measuring the degree of difficulty of encoding between the programs. The statistical multiplexing method is suitable when a broadcasting station having a wide bandwidth capable of broadcasting many programs wants to perform a broadcast with emphasis on image quality.

【０００７】ところで、１つのコンテンツのあるシーン
において、異なるアングルからの映像を複数用意してお
き、再生時にどれを再生するか選択できるマルチアング
ルという機能がある。マルチアングルを放送で実現する
場合、例えば特開平８−３２８８７のように、受信機が
本放送と異なる映像シーンを１つ選択できるようにする
と、使用する帯域が２倍になってしまうという欠点があ
った。統計多重方式を利用すると、確かにこの場合では
使用する帯域を２倍よりは少なくできる。しかし、長時
間の映像のうち一部のシーンだけが異なるようなマルチ
アングル映像を想定する場合、冗長なデータを長時間送
信することになるため、帯域の利用効率が良いとは言え
なかった。以上のように、放送でマルチアングル映像を
流す方法は、プログラム数や利用する帯域を増やすしか
なかった。By the way, there is a function called a multi-angle in which a plurality of videos from different angles are prepared in a certain scene of one content, and which one can be selected at the time of reproduction. In the case of realizing multi-angle by broadcasting, if the receiver can select one video scene different from the main broadcasting as in, for example, Japanese Patent Application Laid-Open No. 8-32887, there is a disadvantage that the band used is doubled. there were. If the statistical multiplexing method is used, in this case, the band to be used can be less than twice. However, in the case of assuming a multi-angle video in which only some scenes are different from a long video, redundant data is transmitted for a long time, so that it cannot be said that band use efficiency is high. As described above, the only way to broadcast multi-angle video in broadcasting is to increase the number of programs and the band to be used.

【０００８】[0008]

【発明が解決しようとする課題】しかしながら、ある特
定のシーンのみを異なるアングルや内容で視聴したい場
合、または一部のシーンに視聴制限であるレイティング
（Rating）を施したい場合など、一部が異なるコンテン
ツを放送したいが、利用する帯域を増加させたくない場
合がある。このような帯域を増加させないマルチアング
ル放送の実現の可能性のある従来方法として、特表２０
００−５００６３２がある。この方式では、ストリーム
上の充填パケットの代わりに補助データを挿入すること
ができる。しかし、補助データの多重を主眼においてい
るため、映像データを付加した場合、それが再生される
べき時刻に間に合うように伝送するための機構は考慮さ
れていなかった。However, there are some differences, such as when it is desired to view only a certain specific scene at different angles and contents, or when it is desired to perform rating (Rating) as a viewing restriction on some scenes. There are cases where a user wants to broadcast content but does not want to increase the band used. As a conventional method capable of realizing multi-angle broadcasting without increasing the band, Japanese Patent Application Laid-open No.
00-500632. In this scheme, auxiliary data can be inserted instead of the filling packet on the stream. However, since the main purpose is multiplexing of auxiliary data, when video data is added, a mechanism for transmitting the video data in time to reproduce it has not been considered.

【０００９】本発明は上記従来の問題点を解決するもの
で、デジタルＴＶ放送において、予め制作されたマルチ
アングル映像を含むコンテンツを放送する場合に、１チ
ャンネル分の帯域を利用してマルチアングル放送を提供
することを目的とする。[0009] The present invention solves the above-mentioned conventional problems. In digital TV broadcasting, when broadcasting content including multi-angle video produced in advance, multi-angle broadcasting is performed using the bandwidth of one channel. The purpose is to provide.

【００１０】[0010]

【課題を解決するための手段】上記目的を達成するため
に、本発明は、少なくともマルチアングル映像の開始時
間および終了時間を含むマルチアングル情報に基づい
て、主映像とマルチアングル映像の符号化時のビット割
当てを行うビットアロケータと、マルチアングル情報お
よびビットアロケータの出力したビット割当て情報に基
づいて、主映像をＶＢＲ（Variable Bit Rate）で符号
化する第一のエンコーダと、マルチアングル映像を符号
化する第二のエンコーダと、符号化されたストリームを
１つの放送用のトランスポートストリームに多重化する
マルチプレクサを備えたものである。To achieve the above object, the present invention provides a method for encoding a main image and a multi-angle image based on multi-angle information including at least a start time and an end time of the multi-angle image. A first coder that encodes a main video at a variable bit rate (VBR) based on multi-angle information and bit allocation information output by the bit allocator, and a multi-angle video And a multiplexer for multiplexing the encoded stream into one broadcast transport stream.

【００１１】この構成によって、本発明のマルチアング
ル映像符号化多重方法および装置は、マルチアングル映
像の開始時間および終了時間を含む時間情報とビット割
当て情報に基づいて、マルチアングル映像を前もって送
信できるだけの帯域を、マルチアングル映像が開始され
る以前の時間帯に確保し、確保した帯域分のレートを除
いた平均ビットレートで主映像を符号化し、別途符号化
したマルチアングル映像と共に、トランスポートストリ
ーム上に多重化することにより、帯域を増加させずにマ
ルチアングル映像を送信することができる。With this configuration, the multi-angle video coding and multiplexing method and apparatus of the present invention can transmit a multi-angle video in advance based on time information including a start time and an end time of the multi-angle video and bit allocation information. The band is reserved in the time zone before the start of the multi-angle video, the main video is encoded at the average bit rate excluding the secured bandwidth rate, and the separately encoded multi-angle video is encoded on the transport stream. , It is possible to transmit a multi-angle video without increasing the bandwidth.

【００１２】[0012]

【発明の実施の形態】本発明の第１の発明は、主映像と
前記主映像に付随した副映像を所定のビットレートの符
号化ストリームとして符号化する映像符号化装置であっ
て、少なくとも前記副映像の開始時間及び終了時間を含
む時間情報に基づいて、前記符号化ストリームにおける
前記主映像および前記副映像のビット割当てを示すビッ
ト割り当て情報を出力するビットアロケータと、前記主
映像を符号化したストリームと前記副映像を符号化した
ストリームを前記ビット割当て情報に基づいて多重化
し、前記符号化ストリームを生成するマルチプレクサと
を有することを特徴とする映像符号化装置であり、副映
像を前もって送信できるだけの帯域を、副映像が開始さ
れる以前の時間帯に確保し、確保した帯域分のレートを
除いた平均ビットレートで主映像を符号化し、別途符号
化した副映像と共に、トランスポートストリーム上に多
重化することにより、帯域を増加させずにマルチアング
ル映像を送信することができる。また、映像のコンテン
ツは予め準備しておく必要はあるが、主映像と副映像を
少ない遅延でほぼリアルタイムに符号化して送信するこ
とが可能である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS A first invention of the present invention is a video coding apparatus for coding a main video and a sub-video accompanying the main video as a coded stream of a predetermined bit rate. A bit allocator that outputs bit allocation information indicating bit allocation of the main video and the sub video in the coded stream based on time information including a start time and an end time of a sub video, and the main video is coded. A video encoding apparatus, comprising: a stream that multiplexes a stream and a stream obtained by encoding the sub-image based on the bit allocation information, and a multiplexer that generates the encoded stream. Bandwidth before the start of the sub-picture, and the average bit rate excluding the rate of the secured bandwidth The main video coded by preparative, with sub-picture which is separately encoded, can be transmitted by multiplexing on a transport stream, the multi-angle video without increasing the bandwidth. Although it is necessary to prepare video contents in advance, it is possible to encode and transmit the main video and the sub-video almost in real time with a small delay.

【００１３】本発明の第２の発明は、第１の発明におい
て、時間情報及びビットアロケータが出力するビット割
当て情報に基づいて、主映像及び副映像を符号化するエ
ンコーダと、符号化されたストリームを一時蓄積してお
くための記録媒体とを有することを特徴とする映像符号
化装置であり、主映像と副映像を、エンコーダを用いて
直列に符号化して蓄積しておき、後に多重化して送信す
ることができる。この方式の利点は、エンコーダの個数
が１個で済む点と、符号化された映像の画質を送信前に
確認して、画質が充分でなければ符号化をやり直すとい
った処理を行えることである。According to a second aspect of the present invention, in the first aspect, an encoder for encoding a main video and a sub-video based on time information and bit allocation information output by a bit allocator, and an encoded stream Is a video encoding device characterized by having a recording medium for temporarily storing a main video and a sub video, which are encoded and stored serially using an encoder, and multiplexed later. Can be sent. The advantage of this method is that only one encoder is required, and that the quality of the coded video is checked before transmission, and if the quality is not sufficient, processing such as re-encoding can be performed.

【００１４】本発明の第３の発明は、第１の発明におい
て、ビットアロケータは、少なくとも、映像を符号化し
た際に発生する符号量を含む符号化難易度情報に基づい
て、前記主映像及び副映像の符号化ストリームにおける
ビット割当てを行い、符号化難易度情報を得るために主
映像及び副映像を符号化する１パスエンコーダと、時間
情報及びビットアロケータの出力するビット割当て情報
に基づいて、前記主映像及び前記副映像を符号化する２
パスエンコーダと、前記１パスエンコーダ及び前記２パ
スエンコーダにより符号化されたストリームを一時蓄積
しておくための記録媒体とを有することを特徴とする映
像符号化装置であり、主映像とマルチアングル映像を２
パスエンコードし、多重化して送信することができる。
この方式の利点は、１パス符号化時に映像の符号化難易
度情報を抽出し、抽出した符号化難易度情報に基づいて
主映像およびマルチアングル映像の２パス符号化時のビ
ット割当てを行う２パスビットアロケータを有すること
により、主映像およびマルチアングル映像を含めた全て
の映像シーンの画質を均質にでき、さらに視覚的性質に
応じたビット割当て等を可能にする点である。[0014] In a third aspect of the present invention based on the first aspect, the bit allocator is configured to determine whether the main video and the main video are based on at least coding difficulty information including a coding amount generated when the video is coded. Based on the bit allocation in the coded stream of the sub-picture, the 1-pass encoder that encodes the main picture and the sub-picture in order to obtain the encoding difficulty information, and based on the bit allocation information output by the time information and the bit allocator, Encoding the main image and the sub-image 2
A video encoding apparatus comprising: a pass encoder; and a recording medium for temporarily storing a stream encoded by the one-pass encoder and the two-pass encoder. 2
It can be path encoded, multiplexed and transmitted.
The advantage of this method is that the video encoding difficulty information is extracted at the time of one-pass encoding, and the bits are assigned at the time of two-pass encoding of the main video and the multi-angle video based on the extracted encoding difficulty information. By having the pass bit allocator, the image quality of all video scenes including the main video and the multi-angle video can be made uniform, and bit allocation and the like according to the visual characteristics can be performed.

【００１５】本発明の第４の発明は、第１から第３映像
符号化装置で生成したストリームを復号することを目的
とした復号化装置であって、副映像を符号化したストリ
ーム、または、前記副映像及び前記副映像に付随した音
声を符号化したストリームを一時蓄積するための記録媒
体と、副映像を選択するか否かの設定を外部より行う外
部入力手段と、符号化ストリームから前記副映像の開始
時間及び終了時間の時間情報を読取って、外部入力手段
により選択された映像が副映像である場合には、前記時
間情報に応じた出力映像及び出力音声への切替え信号を
生成する副映像設定器とを有することを特徴とする復号
化装置であり、復号化の際には、マルチアングル映像お
よび音声を記録するためのディスクと、ストリームから
読み取ったマルチアングル映像の開始時間および終了時
間の情報に応じて出力映像および音声の切替え信号を生
成するマルチアングル設定器とを用いることで、受信側
で映像を選択できる復号化装置を構成することが可能と
なる。[0015] A fourth invention of the present invention is a decoding device for decoding a stream generated by the first to third video coding devices, wherein a stream in which a sub-video is coded, or A recording medium for temporarily storing a stream obtained by encoding the sub-picture and audio accompanying the sub-picture, external input means for externally setting whether or not to select the sub-picture; and The time information of the start time and the end time of the sub-video is read, and when the video selected by the external input means is the sub-video, a switching signal to the output video and the output audio according to the time information is generated. A decoding device, comprising: a sub-picture setting device; a disc for recording multi-angle video and audio; By using a multi-angle setting device that generates an output video and audio switching signal according to information on the start time and end time of the single video, it is possible to configure a decoding device that can select a video on the receiving side. Become.

【００１６】本発明における映像と音声を多重して生成
するシステムストリームは、ＭＰＥＧ２のシステム規格
（ＩＳＯ／ＩＥＣ１３８１８−１）にて規定されている
トランスポートストリーム（ＴＳ：Transport Stream）
を用いて説明する。A system stream generated by multiplexing video and audio according to the present invention is a transport stream (TS) defined by the MPEG2 system standard (ISO / IEC13818-1).
This will be described with reference to FIG.

【００１７】以下、本発明の実施の形態について、図面
を参照しながら説明する。Hereinafter, embodiments of the present invention will be described with reference to the drawings.

【００１８】（実施の形態１）本実施の形態では、予め
準備されているマルチアングル映像を含む映像コンテン
ツを符号化するに際し、主映像とマルチアングル映像を
リアルタイムで並列に符号化し、少ない遅延でトランス
ポートストリームに多重化し、１本の放送用トランスポ
ートストリームを出力する映像符号化装置を説明する。(Embodiment 1) In the present embodiment, when encoding video content including a multi-angle video prepared in advance, the main video and the multi-angle video are encoded in parallel in real time, and with a small delay. A video encoding device that multiplexes a transport stream and outputs one broadcast transport stream will be described.

【００１９】図１に、本実施の形態における映像符号化
装置のブロック図を示す。FIG. 1 shows a block diagram of a video encoding apparatus according to the present embodiment.

【００２０】図１において、６は第一のエンコーダ２０
１、第二のエンコーダ２０２、およびマルチプレクサ５
１を外部より制御するための外部コントローラ４１、４
１はマルチアングル情報１２に基づいてビット割当てを
行い、ビット割当て情報４３を出力するビットアロケー
タ、５１は第一のエンコーダ２０１および第二のエンコ
ーダ２０２の出力したトランスポートストリームを多重
化して出力ストリーム８を出力するマルチプレクサ、１
０１は主映像、１０２は主映像に付随した音声、１１１
はマルチアングル映像、１１２はマルチアングル映像に
付随した音声、２０１はビットアロケータ４１の出力す
るビット割当て情報４３およびマルチアングル情報１２
に基づいて主映像１０１および音声１０２を符号化する
第一のエンコーダ、２０２はビットアロケータ４１の出
力するビット割当て情報４３およびマルチアングル情報
１２に基づいてマルチアングル映像１１１および音声１
１２を符号化する第二のエンコーダである。ここで、以
上のように構成された映像符号化装置について、以下そ
の動作について説明する。In FIG. 1, reference numeral 6 denotes a first encoder 20.
1, second encoder 202, and multiplexer 5
External controllers 41 and 4 for externally controlling 1
Reference numeral 1 denotes a bit allocator that performs bit allocation based on the multi-angle information 12 and outputs bit allocation information 43. Reference numeral 51 denotes a multiplexed transport stream output from the first encoder 201 and the second encoder 202 and an output stream 8 Multiplexer that outputs
01 is the main video, 102 is the audio accompanying the main video, 111
Is a multi-angle video, 112 is audio accompanying the multi-angle video, 201 is bit allocation information 43 and multi-angle information 12 output from the bit allocator 41.
The first encoder 202 encodes the main video 101 and the audio 102 based on the multi-angle video 111 and the audio 1 based on the bit allocation information 43 and the multi-angle information 12 output from the bit allocator 41.
12 is a second encoder that encodes the H.12. Here, the operation of the video encoding device configured as described above will be described below.

【００２１】主映像１０１および音声１０２は、ＶＴＲ
等の記録媒体に記録されている。マルチアングル映像１
１１および音声１１２は、主映像と異なるＶＴＲといっ
た主映像と同時に再生できる記録媒体に記録されてい
る。The main video 101 and audio 102 are
And the like. Multi-angle video 1
11 and the audio 112 are recorded on a recording medium that can be reproduced simultaneously with the main video such as a VTR different from the main video.

【００２２】マルチアングル情報１２を、図２に示す例
を用いて説明する。図２において、主映像は時刻０から
時刻Ｔの間、マルチアングル映像は時刻ｔ１からｔ２の
間と、時刻ｔ３からｔ４の間のみ存在する。この場合の
最低限のマルチアングル情報は、（表１）のように、主
映像およびマルチアングル映像の開始時間および終了時
間を記述したテーブルとして与えることができる。The multi-angle information 12 will be described with reference to an example shown in FIG. In FIG. 2, the main video exists only from time 0 to time T, and the multi-angle video exists only from time t1 to t2 and only from time t3 to t4. The minimum multi-angle information in this case can be provided as a table describing the start time and end time of the main video and the multi-angle video as shown in (Table 1).

【００２３】[0023]

【表１】 [Table 1]

【００２４】（表１）において、Ｐｒｉｍａｒｙは主映
像、Ｍｕｌｔｉ−ａｎｇｌｅはマルチアングル映像を表
す。なお、マルチアングル情報は（表１）の形式に制限
されるものではなく、ビットレート情報など映像に関す
る様々な情報を記述することが可能である。ここで、Ｃ
ＢＲ符号化を用いた場合のビット割当ての例を図３に示
す。図３において、ＲＭＡＸは、最終的な出力ストリー
ム８において映像のみで占有することを許されたビット
レートを示す。図３のように符号化を行って単純に多重
化した場合、帯域は２倍の２×ＲＭＡＸを用意しなけれ
ばならないことが確認できる。In Table 1, "Primary" indicates a main image, and "Multi-angle" indicates a multi-angle image. Note that the multi-angle information is not limited to the format shown in (Table 1), but can describe various information related to video such as bit rate information. Where C
FIG. 3 shows an example of bit allocation when BR coding is used. In FIG. 3, RMAX indicates the bit rate allowed to be occupied by video only in the final output stream 8. As shown in FIG. 3, when coding is performed and simple multiplexing is performed, it can be confirmed that the bandwidth must be twice as large as 2 × RMAX.

【００２５】以下、マルチアングル情報１２を用いて、
ビット割当てを行うビットアロケータ４１の動作につい
て説明する。Hereinafter, using the multi-angle information 12,
The operation of the bit allocator 41 for allocating bits will be described.

【００２６】図４にビットアロケータ４１を用いたビッ
ト割当ての一例を示す。まず区間［ｔ１，ｔ２］のマル
チアングル映像に割り当てる総ビット量Ｂ１２を決定す
る。次に、区間［０，ｔ１］の帯域において送信可能な
総ビット量ＲＭＡＸ×ｔ１からＢ１２を減じ、残ったビ
ット数を主映像に割り当てる。同様にして、区間［ｔ
３，ｔ４］のマルチアングル映像に割り当てる総ビット
量Ｂ３４を決定し、区間［ｔ１，ｔ３］の帯域において
送信可能な総ビット量ＲＭＡＸ×（ｔ３−ｔ１）からＢ
３４を減じ、残ったビット数を主映像に割り当てて、図
４の網掛け部で示すような、ビット割当て情報の結果を
得る。このようにして得られたビット割当てに基づいて
符号化されたストリームを多重化すると、図５に示すよ
うなデータ配置となる。図５から、出力ストリーム８の
条件として求められる映像のビットレート、ＲＭＡＸを
満たしていることが確認できる。FIG. 4 shows an example of bit allocation using the bit allocator 41. First, the total bit amount B12 allocated to the multi-angle video in the section [t1, t2] is determined. Next, B12 is subtracted from the total bit amount RMAX × t1 that can be transmitted in the band of the section [0, t1], and the number of remaining bits is allocated to the main video. Similarly, the interval [t
[T1, t3], and the total bit amount RMAX × (t3-t1) that can be transmitted in the band of the section [t1, t3] is calculated as B
By subtracting 34 and allocating the remaining number of bits to the main video, the result of the bit allocation information is obtained as shown by the shaded portion in FIG. When a stream coded based on the bit allocation obtained in this way is multiplexed, the data arrangement is as shown in FIG. From FIG. 5, it can be confirmed that the video bit rate and RMAX required as the conditions of the output stream 8 are satisfied.

【００２７】図４および図５におけるビット割当て情報
４３の例を、（表２）に示す。An example of the bit allocation information 43 in FIGS. 4 and 5 is shown in (Table 2).

【００２８】[0028]

【表２】 [Table 2]

【００２９】（表２）において、Ｐｒｉｍａｒｙは主映
像、Ｐｌａｙは再生時のマルチアングル映像、Ｔｒａｎ
ｓｐｏｒｔはストリームに多重化時のマルチアングル映
像を示す。ここでは、ビットレートＲＭＡＸ＝９Ｍｂｐ
ｓと仮定している。（表２）によると、時刻０に主映像
が６Ｍｂｐｓで、マルチアングル映像のシーン１が３Ｍ
ｂｐｓでそれぞれ符号化され、多重化される。次に、時
刻ｔ１に主映像が７．２Ｍｂｐｓで、マルチアングル映
像のシーン２が１．８Ｍｂｐｓで符号化され、多重化さ
れる。また、主映像およびマルチアングル映像のビット
レートの合計が上限の９Ｍｂｐｓであることも確認でき
る。一方、再生側では、時刻ｔ１にマルチアングル映像
のシーン１が６．６Ｍｂｐｓで再生される。以下同様に
ビット割当て情報を解釈することができる。なお、（表
２）はビット割当て情報の一例であり、マルチアングル
映像の開始・終了時刻および符号化ビットレート情報を
記述できるものであれば、記述形式はこれに限定される
ものではない。また、ビット割当て時の時刻として符号
化順の時間でなくディスプレイなどの表示順の時間を用
いたが、これは説明を簡単にするためのものであり、発
明の実施に本質的な影響は与えないことに注意された
い。In Table 2, Primary is the main video, Play is the multi-angle video at the time of reproduction, Tran is
“sport” indicates a multi-angle image when multiplexing the stream. Here, the bit rate RMAX = 9 Mbp
s. According to (Table 2), at time 0, the main video is 6 Mbps, and the scene 1 of the multi-angle video is 3M.
Each is encoded and multiplexed at bps. Next, at time t1, the main video is encoded at 7.2 Mbps and the scene 2 of the multi-angle video is encoded at 1.8 Mbps and multiplexed. Also, it can be confirmed that the total bit rate of the main video and the multi-angle video is the upper limit of 9 Mbps. On the playback side, on the other hand, at time t1, scene 1 of the multi-angle video is played at 6.6 Mbps. Hereinafter, the bit allocation information can be similarly interpreted. Table 2 is an example of bit allocation information, and the description format is not limited to this as long as it can describe the start / end time of multi-angle video and coding bit rate information. Also, the time in the display order of the display etc. was used as the time at the time of bit allocation instead of the time in the encoding order, but this is for the sake of simplicity of explanation and has an essential effect on the implementation of the invention. Note that there is no.

【００３０】次に、全体構成を示す図１において、ビッ
トアロケータ４１より得られたビット割当て情報４３
と、マルチアングル情報１２とを用いて、第一のエンコ
ーダ２０１で主映像および音声の符号化を、第二のエン
コーダ２０２でマルチアングル映像および音声の符号化
を行う。エンコーダ２０１および２０２の動作の説明
は、図６に示したエンコーダ内部のブロック図にて行
う。Next, in FIG. 1 showing the overall configuration, bit allocation information 43 obtained from a bit allocator 41 is shown.
And the multi-angle information 12, the first encoder 201 encodes main video and audio, and the second encoder 202 encodes multi-angle video and audio. The operation of the encoders 201 and 202 will be described with reference to the block diagram inside the encoder shown in FIG.

【００３１】図６において、２０はエンコーダ、２３は
符号化情報設定器２５より出力されるオーディオ符号化
情報２５２を受け取って入力映像に付随した入力音声２
１２を符号化してオーディオストリームを出力するオー
ディオエンコーダ、２４は符号化情報設定器２５より出
力されるシステム符号化情報２５３よりビデオストリー
ム２２１およびオーディオストリーム２２２を多重化し
てトランスポートストリーム２９を生成するシステムエ
ンコーダ、２５はビット割当て情報２１４とマルチアン
グル情報１２、外部設定手段２１６より入力される符号
化情報より、ビデオ符号化情報４３、オーディオ符号化
情報２５２、システム符号化情報２５３を生成する符号
化情報設定器、２６は外部からの制御信号を処理して符
号化情報設定器２５１、ビデオエンコーダ３０、オーデ
ィオエンコーダ２３、システムエンコーダ２４の同期運
転を可能符号化コントローラ、３０はビデオ符号化情報
２５１と符号化コントローラ２６より入力される制御信
号より入力映像２１２を符号化してビデオストリーム２
２１を出力するビデオエンコーダ３０、２１６は符号化
情報を外部で設定する外部設定手段である。ビデオエン
コーダ３０、オーディオエンコーダ２３、およびシステ
ムエンコーダ２４の符号化コントローラ、２５１はビデ
オ符号化情報を示す。ここで、図６において、図１と同
じ情報に対しては同じ符号を割当てている。In FIG. 6, reference numeral 20 denotes an encoder, and 23 receives an audio coded information 252 output from a coded information setting unit 25 and receives an input audio 2 accompanying an input video.
An audio encoder that encodes 12 and outputs an audio stream. A system 24 generates a transport stream 29 by multiplexing the video stream 221 and the audio stream 222 from system encoding information 253 output from an encoding information setting unit 25. The encoder 25 generates video coding information 43, audio coding information 252, and coding information for generating system coding information 253 from the bit allocation information 214, the multi-angle information 12, and the coding information input from the external setting means 216. The setting unit 26 processes an external control signal to enable the synchronous operation of the coding information setting unit 251, the video encoder 30, the audio encoder 23, and the system encoder 24. The coding controller 30 includes a video coding information 251 and a code. Chemical Controller 26 an input image 212 from the control signal inputted by coding from a video stream 2
The video encoders 30 and 216 which output 21 are external setting means for setting encoding information externally. An encoding controller 251 of the video encoder 30, the audio encoder 23, and the system encoder 24 indicates video encoding information. Here, in FIG. 6, the same reference numerals are assigned to the same information as in FIG.

【００３２】まず、第一のエンコーダ２０１に対して
は、入力映像２１１が図１の主映像１０１、入力音声２
１２が図１の主映像に付随した音声１０２に相当する。
マルチアングル情報１２として主映像２１１の開始時間
および終了時間を、ビット割当て情報４３としてビット
アロケータ４１で算出したビット割当て情報４３を与え
る。符号化情報設定器２５において、ビット割当て情報
４３およびマルチアングル情報１２よりビデオ符号化情
報２５１、オーディオ符号化情報２５２およびシステム
符号化情報２５３が生成される。ビデオ符号化情報２５
１としては、例えばＧＯＰ単位の目標符号量などのＶＢ
Ｒ符号化用の情報を与える。オーディオ符号化情報２５
２としては、オーディオ用に確保されたビットレートを
与える。システム符号化情報２５３としては、マルチア
ングル情報１２およびビット割当て情報４３を適切な形
式で与える。また、外部設定手段２１６で設定した情報
を各符号化情報に与えることも可能である。例えば、シ
ステム符号化情報２５３として、ＰＩＤ（Packet Ident
ifier）を与えることができる。First, for the first encoder 201, the input video 211 is composed of the main video 101 and the input audio 2 shown in FIG.
Reference numeral 12 corresponds to the audio 102 attached to the main video in FIG.
The start time and the end time of the main video 211 are given as the multi-angle information 12, and the bit allocation information 43 calculated by the bit allocator 41 is given as the bit allocation information 43. In the coding information setting unit 25, video coding information 251, audio coding information 252, and system coding information 253 are generated from the bit allocation information 43 and the multi-angle information 12. Video coding information 25
For example, VB such as a target code amount in GOP units
Gives information for R encoding. Audio encoding information 25
As 2, a bit rate reserved for audio is given. As the system encoding information 253, the multi-angle information 12 and the bit allocation information 43 are given in an appropriate format. Further, information set by the external setting means 216 can be given to each piece of encoded information. For example, as the system encoding information 253, a PID (Packet Ident
ifier).

【００３３】ビデオエンコーダ３０はビデオ符号化情報
２５１に基づいて、主映像２１１をリアルタイムでＶＢ
Ｒ符号化し、ビデオストリーム２２１を得る。The video encoder 30 converts the main image 211 into a VB based on the video encoding information 251 in real time.
R encoding is performed to obtain a video stream 221.

【００３４】図７にビデオエンコーダ３０の構成を示す
ブロック図を示す。図７に示すようにビデオエンコーダ
３０は従来の技術で説明したＭＰＥＧ２規格に基づいた
構成をとり、ビデオ符号化情報２５１が量子化制御回路
３８に入力されている。ここで、主映像を符号化する際
に重要な点は、符号化時のＶＢＶバッファ占有度であ
る。既に説明したように、ＶＢＶバッファは図７のバッ
ファ３７に相当する。FIG. 7 is a block diagram showing the configuration of the video encoder 30. As shown in FIG. 7, the video encoder 30 has a configuration based on the MPEG2 standard described in the related art, and video encoding information 251 is input to the quantization control circuit 38. Here, an important point when encoding the main video is the occupancy of the VBV buffer at the time of encoding. As described above, the VBV buffer corresponds to the buffer 37 in FIG.

【００３５】図８に符号化時のＶＢＶバッファ占有度を
示す。図８に示すように、主映像からマルチアングル映
像に切り替わる時およびマルチアングル映像から主映像
に切り替わる時（同図における時刻ｔ１、ｔ２、ｔ３、
ｔ４の時点）には、量子化制御回路３８において、デコ
ーダでのバッファ占有度を一致させるような符号化制御
を行う。すなわち、（数１）を成立させるように符号化
制御を行う。FIG. 8 shows the occupancy of the VBV buffer during encoding. As shown in FIG. 8, when switching from the main video to the multi-angle video and when switching from the multi-angle video to the main video (time t1, t2, t3,
At time t4), the quantization control circuit 38 performs encoding control to match the buffer occupancy in the decoder. That is, encoding control is performed so as to satisfy (Equation 1).

【００３６】[0036]

【数１】 (Equation 1)

【００３７】このように、バッファ占有度の制限を与え
ることにより、デコーダにおいてマルチアングル映像が
選択された場合でも、符号化時に行われるＶＢＶバッフ
ァシミュレーションでデコーダのバッファ占有度を保証
することができる。このバッファ占有度の保証を実現す
る手段はいくつか存在するが、最も簡単な方法は、予め
バッファ占有度の値を、バッファ位置を一致させる必要
がある全ての時間で固定値として与えておくことであ
る。その他にも、主映像またはマルチアングル映像のう
ち先に符号化が終了した方のバッファ占有度に他方が従
う方法もあるが、この方法は主映像とマルチアングル映
像を同時に符号化する本実施の形態に適用するには、複
雑である。As described above, by limiting the buffer occupancy, even when a multi-angle video is selected in the decoder, the buffer occupancy of the decoder can be guaranteed by the VBV buffer simulation performed at the time of encoding. There are several ways to guarantee this buffer occupancy, but the simplest method is to provide the buffer occupancy value in advance as a fixed value at all times when the buffer position needs to be matched. It is. In addition, there is a method in which the other follows the buffer occupancy of the main video or the multi-angle video that has been encoded first, but this method is used in the present embodiment for simultaneously encoding the main video and the multi-angle video. It is complicated to apply to the form.

【００３８】以上のようにしてビデオエンコーダ３０
は、バッファ占有度を保証した符号化を実行する。同時
にオーディオエンコーダ２３は、オーディオ符号化情報
２５２に基づいて、主映像に付随した音声２１１をリア
ルタイムで符号化し、オーディオストリーム２２１を得
る。As described above, the video encoder 30
Performs encoding that guarantees buffer occupancy. At the same time, the audio encoder 23 encodes the audio 211 attached to the main video in real time based on the audio encoding information 252, and obtains an audio stream 221.

【００３９】システムエンコーダ２４は、第一のエンコ
ーダでは図９に示すようなシステムエンコーダ２４１で
あるとする。システムエンコーダ２４１の動作を、図９
を用いて説明する。The system encoder 24 is assumed to be a system encoder 241 as shown in FIG. 9 in the first encoder. The operation of the system encoder 241 is shown in FIG.
This will be described with reference to FIG.

【００４０】図９において、２４３は入力ビデオストリ
ーム２２１からビデオパケットを生成するためのビデオ
パケタイザ、２４４は入力オーディオストリーム２２２
からオーディオパケットを生成するためのオーディオパ
ケタイザ、２４５はシステム符号化情報２５３に基づい
て仮パケットを発生するための仮パケット発生器、２４
６はビデオパケタイザ２４３より出力されるビデオパケ
ット、オーディオパケタイザより出力されるオーディオ
パケットおよび仮パケット発生器２４５より出力される
仮パケットを多重化してトランスポートストリーム２９
１を生成するシステムマルチプレクサ、２４７はビデオ
パケタイザ２４３、オーディオパケタイザ２４４、仮パ
ケット発生器２４５およびシステムマルチプレクサ２４
６の同期を取るためのクロック発生器、２４９は図６の
符号化コントローラ２６からの制御信号である。In FIG. 9, reference numeral 243 denotes a video packetizer for generating video packets from the input video stream 221;
An audio packetizer 245 for generating an audio packet from a temporary packet generator 245 for generating a temporary packet based on the system coding information 253,
6 multiplexes the video packet output from the video packetizer 243, the audio packet output from the audio packetizer, and the temporary packet output from the temporary packet generator 245, and multiplexes the transport stream 29.
1, a system multiplexer 247, a video packetizer 243, an audio packetizer 244, a temporary packet generator 245, and a system multiplexer 24.
A clock generator 249 for synchronizing 6 is a control signal from the encoding controller 26 in FIG.

【００４１】まず、ビデオパケタイザ２４３は、システ
ム符号化情報２５３からＰＩＤなどの情報を得て、ビデ
オパケットを生成する。同時にオーディオパケタイザ２
４４も、オーディオパケットを生成する。仮パケット発
生器２４５は、システム符号化情報２５３に含まれるビ
ット割当て情報とＰＩＤに基づいて仮パケットを生成す
る。ここで仮パケットとは、後にマルチアングルのスト
リームを多重化するために、マルチアングルのストリー
ムが挿入される位置に挿入するパケットである。システ
ムマルチプレクサ２４６は、生成されたビデオパケッ
ト、オーディオパケットおよび仮パケットを多重化し、
トランスポートストリーム２９１を生成する。以上の動
作はクロック発生器２４７が発生するクロックに同期し
て行われ、第一のエンコーダにおいて、主映像と音声、
仮パケットを多重化したトランスポートストリーム２９
１を得る。First, the video packetizer 243 obtains information such as a PID from the system coding information 253, and generates a video packet. Audio packetizer 2 at the same time
44 also generates audio packets. The temporary packet generator 245 generates a temporary packet based on the bit allocation information and the PID included in the system encoding information 253. Here, the temporary packet is a packet to be inserted at a position where a multi-angle stream is inserted in order to multiplex the multi-angle stream later. The system multiplexer 246 multiplexes the generated video packets, audio packets, and provisional packets,
Generate the transport stream 291. The above operation is performed in synchronization with the clock generated by the clock generator 247. In the first encoder, the main video, audio,
Transport stream 29 in which temporary packets are multiplexed
Get 1.

【００４２】一方、全体構造を示す図１において、第二
のエンコーダ２０２に対しては、入力映像２１１がマル
チアングル映像１１１、入力音声２１２がマルチアング
ル映像に付随した音声１１２に相当する。マルチアング
ル情報２１３としてマルチアングル映像の開始時間およ
び終了時間を、ビット割当て情報２１４としてビットア
ロケータ４１で導出したビット割当て情報を与える。エ
ンコーダ内部を示す図６において、第二のエンコーダ２
０２に対して、符号化情報設定器２５が生成するビデオ
符号化情報２５１、オーディオ符号化情報２５２および
システム符号化情報２５３は、第一のエンコーダ２０１
の場合と同様である。On the other hand, in FIG. 1 showing the overall structure, for the second encoder 202, the input video 211 corresponds to the multi-angle video 111 and the input audio 212 corresponds to the audio 112 attached to the multi-angle video. The start time and end time of the multi-angle video are given as the multi-angle information 213, and the bit allocation information derived by the bit allocator 41 is given as the bit allocation information 214. In FIG. 6 showing the inside of the encoder, the second encoder 2
02, the video encoding information 251, the audio encoding information 252, and the system encoding information 253 generated by the encoding information setting unit 25 correspond to the first encoder 201.
Is the same as

【００４３】ビデオエンコーダ３０は、ビデオ符号化情
報２５１に基づいて、マルチアングル映像２１１をＶＢ
Ｒ符号化し、ビデオストリーム２２１を得る。同時にオ
ーディオエンコーダ２３は、オーディオ符号化情報２５
２に基づいて、マルチアングル映像に付随した音声２１
１を符号化し、オーディオストリーム２２２を得る。シ
ステムエンコーダ２４は、第二のエンコーダでは図１０
に示すようなシステムエンコーダ２４２であるとする。
システムエンコーダ２４２の動作を、図１０を用いて説
明する。The video encoder 30 converts the multi-angle image 211 into a VB based on the video encoding information 251.
R encoding is performed to obtain a video stream 221. At the same time, the audio encoder 23 outputs the audio encoded information 25
2 based on the audio 21 attached to the multi-angle video
1 to obtain an audio stream 222. The system encoder 24 is the second encoder shown in FIG.
Is assumed to be the system encoder 242 shown in FIG.
The operation of the system encoder 242 will be described with reference to FIG.

【００４４】図１０において、２４９は符号化コントロ
ーラ２６からの制御信号、２５３は符号化情報設定器２
５からのシステム符号化情報、２４３は入力ビデオスト
リーム２２１からビデオパケットを生成するためのビデ
オパケタイザ、２４４は入力オーディオストリーム２２
２からオーディオパケットを生成するためのオーディオ
パケタイザ、２４６はビデオパケタイザ２４３より出力
されるビデオパケットおよびオーディオパケタイザ２４
４より出力されるオーディオパケットを多重化しトラン
スポートストリーム２９２を生成するシステムマルチプ
レクサ、２４７はビデオパケタイザ２４３、オーディオ
パケタイザ２４４およびシステムマルチプレクサ２４６
の同期を取るためのクロック発生器である。ここで、図
１０において、図６と同じ構成要素のものは同じ番号を
割りつけている。In FIG. 10, reference numeral 249 denotes a control signal from the encoding controller 26, and 253 denotes an encoding information setting unit 2.
5, 243 is a video packetizer for generating video packets from the input video stream 221, and 244 is an input audio stream 22.
An audio packetizer 246 for generating an audio packet from the video packetizer 2 and the video packetizer 246 output from the video packetizer 243;
4 is a system multiplexer for multiplexing the audio packets output from 4 and generating a transport stream 292, and 247 is a video packetizer 243, audio packetizer 244, and system multiplexer 246.
This is a clock generator for synchronizing. Here, in FIG. 10, the same components as those in FIG. 6 are assigned the same numbers.

【００４５】まず、ビデオパケタイザ２４３は、システ
ム符号化情報２５３からＰＩＤなどの情報を得て、ビデ
オパケットを生成する。同時にオーディオパケタイザ２
４４も、オーディオパケットを生成する。システムマル
チプレクサ２４６は、生成されたビデオパケットおよび
オーディオパケットを多重化し、トランスポートストリ
ーム２９１を生成する。ここで注意すべき点は、マルチ
アングル映像を符号化する際は、マルチアングルの開始
時刻と終了時刻に基づいて符号化を行わず、開始時間を
早くして符号化を行っても良い点である。First, the video packetizer 243 obtains information such as PID from the system encoding information 253, and generates a video packet. Audio packetizer 2 at the same time
44 also generates audio packets. The system multiplexer 246 multiplexes the generated video packets and audio packets, and generates a transport stream 291. Here, it should be noted that when encoding a multi-angle video, the encoding may be performed with a shorter start time without performing the encoding based on the start and end times of the multi-angle. is there.

【００４６】符号化時刻変更の例を図１１に示す。図１
１において、上段の図がマルチアングル映像が本来再生
されるべき時刻を示している。この再生時刻に符号化を
行うと、符号化データが多重化されるべき時刻は既に過
ぎているため、主映像のストリームをバッファに蓄積さ
せてマルチアングル映像のストリームが到着するまで待
機させる必要が生じる。そこで、図１１の下段の図のよ
うに、符号化の時刻を早め、空いている時間に行えば、
主映像の遅延を小さくすることができる。この場合は、
図１０において、システム符号化情報２５３を利用して
各パケットに記述する時刻情報を、マルチアングル映像
および音声が再生される時刻にシフトして書き込む必要
がある。この場合、時刻ｔ１から時刻ｔ２間に符号化さ
れるマルチアングル映像に対しては、時刻０から時刻ｔ
２−ｔ１、時刻ｔ３から時刻ｔ４に符号化されるマルチ
アングル映像に対しては、時刻Ｔａから時刻Ｔａ＋ｔ４
−ｔ３を記述する。このように符号化時刻を変更するこ
とで、最終的に主映像とマルチアングル映像を多重化す
る際に、バッファによる遅延が小さくなるため、システ
ム全体として遅延の少ない符号化を実現することが可能
となる。以上のようにして、第二のエンコーダの場合に
おいて、マルチアングル映像と音声を多重化したトラン
スポートストリーム２９を得る。FIG. 11 shows an example of changing the encoding time. FIG.
In FIG. 1, the upper diagram shows the time at which the multi-angle video should be originally reproduced. If encoding is performed at this playback time, since the time at which the encoded data is to be multiplexed has already passed, it is necessary to accumulate the main video stream in a buffer and wait until the multi-angle video stream arrives. Occurs. Therefore, as shown in the lower part of FIG. 11, if the encoding time is advanced and the coding is performed in a vacant time,
The delay of the main image can be reduced. in this case,
In FIG. 10, it is necessary to shift the time information described in each packet using the system encoding information 253 to the time at which the multi-angle video and audio are reproduced, and write the time information. In this case, for a multi-angle video encoded between time t1 and time t2, time 0 to time t
2-t1, for a multi-angle video encoded from time t3 to time t4, from time Ta to time Ta + t4
Describe -t3. By changing the encoding time in this way, when the main video and the multi-angle video are finally multiplexed, the delay due to the buffer is reduced, so that encoding with less delay can be realized as a whole system. Becomes As described above, in the case of the second encoder, a transport stream 29 in which multi-angle video and audio are multiplexed is obtained.

【００４７】全体構造を示す図１において、マルチプレ
クサ５１は、上記説明により得られた第一のエンコーダ
２０１の出力トランスポートストリームと、第二のエン
コーダ２０２の出力トランスポートストリームを多重化
する。In FIG. 1 showing the overall structure, a multiplexer 51 multiplexes the output transport stream of the first encoder 201 and the output transport stream of the second encoder 202 obtained as described above.

【００４８】以下、マルチプレクサ５１の動作を、図１
２を用いて説明する。The operation of the multiplexer 51 will now be described with reference to FIG.
2 will be described.

【００４９】図１２において、５１１は第一のエンコー
ダ２０１より出力される主映像および音声の主映像・音
声のトランスポートストリーム、５１２は第二のエンコ
ーダ２０２より出力されるマルチアングル映像および音
声のマルチアングル映像・音声のトランスポートストリ
ーム、５１３は主映像・音声のトランスポートストリー
ム５１１を蓄積する第一のバッファである主映像・音声
バッファ、５１４はマルチアングル映像・音声のトラン
スポートストリーム５１２を第二のバッファである蓄積
するマルチアングル映像・音声バッファ、５１５は主映
像・音声のトランスポートストリーム５１１を監視し、
マルチアングル映像・音声のトランスポートストリーム
５１２を充填すべき仮パケットのＰＩＤを検出するため
のデータ書き換え手段であるＰＩＤ判定器、８は多重化
された出力トランスポートストリームである。In FIG. 12, reference numeral 511 denotes a main video / audio transport stream of the main video and audio output from the first encoder 201, and 512 denotes a multi-angle video and audio multi-stream output from the second encoder 202. An angle video / audio transport stream 513 is a main video / audio buffer which is a first buffer for storing a main video / audio transport stream 511, and a multi-angle video / audio transport stream 512 is a second buffer 512. The multi-angle video / audio buffer 515 to be stored, which is a buffer of 515, monitors the transport stream 511 of the main video / audio,
A PID determiner 8 is a data rewriting means for detecting a PID of a provisional packet to be filled with the multi-angle video / audio transport stream 512, and 8 is a multiplexed output transport stream.

【００５０】リアルタイムで入力される主映像・音声の
トランスポートストリーム５１１は、主映像・音声バッ
ファ５１３に蓄積される。一方、マルチアングル映像・
音声のトランスポートストリーム５１２は、マルチアン
グル映像・音声バッファ５１４に蓄積される。ＰＩＤ判
定器５１５は、主映像・音声のトランスポートストリー
ム５１１において仮パケットのＰＩＤを検出し、仮パケ
ットであるならスイッチ５１６を通して、マルチアング
ル映像・音声バッファにより該当パケットの置換を行
う。以上により、出力トランスポートストリーム８を得
る。The main video / audio transport stream 511 input in real time is stored in the main video / audio buffer 513. On the other hand, multi-angle video
The audio transport stream 512 is stored in the multi-angle video / audio buffer 514. The PID determiner 515 detects the PID of the temporary packet in the main video / audio transport stream 511, and if the packet is a temporary packet, passes the switch 516 to replace the packet with a multi-angle video / audio buffer. As described above, the output transport stream 8 is obtained.

【００５１】ここで、パケット置換の概念を、図１３を
用いて説明する。Here, the concept of packet replacement will be described with reference to FIG.

【００５２】図１３において、Ｖは映像のパケット、Ａ
は音声のパケット、Ｔは仮パケットを示す。上段のパケ
ット列が主映像・音声のトランスポートストリームであ
り、下段のパケット列がマルチアングル映像・音声のト
ランスポートストリームである。ＰＩＤ判定器５１５は
主映像・音声のトランスポートストリーム上で仮パケッ
トＴを検出すると、スイッチ５１６を主映像・音声バッ
ファに接続し、仮パケットをマルチアングル映像・音声
のトランスポートストリームのパケットに置き換える。
このように操作することで、主映像・音声のトランスポ
ートストリームのパケット数は置換前と置換後で変化し
ない。既に述べたように、仮パケットはビット割当て情
報に基づいて挿入されているため、図１３のように順に
パケットを置換するだけで多重化を行うことが可能であ
る。In FIG. 13, V is a video packet, A
Indicates a voice packet, and T indicates a temporary packet. The upper packet sequence is a main video / audio transport stream, and the lower packet sequence is a multi-angle video / audio transport stream. When detecting the provisional packet T on the main video / audio transport stream, the PID determination unit 515 connects the switch 516 to the main video / audio buffer and replaces the provisional packet with a multi-angle video / audio transport stream packet. .
By performing such operations, the number of packets of the main video / audio transport stream does not change before and after the replacement. As described above, since the temporary packet is inserted based on the bit allocation information, multiplexing can be performed only by replacing the packets in order as shown in FIG.

【００５３】図１４に本実施の形態で生成されたトラン
スポートストリームを受信するデコーダを示す。FIG. 14 shows a decoder for receiving the transport stream generated in the present embodiment.

【００５４】図１４において、８は受信したトランスポ
ートストリーム、７１はディスク７２に読み書きを行う
ためのディスクコントローラ、７２はマルチアングル映
像および音声を記録しておくためのディスク、９０はデ
コーダ、９５はディスク７２に蓄積されたトランスポー
トストリームからマルチアングル情報を得てビデオとオ
ーディオの切替え情報を与える副映像設定器であるマル
チアングル設定器、９６はマルチアングル設定器９５よ
り出力されるビデオとオーディオの切替え情報に基づい
て主映像および音声と、マルチアングルおよび音声の切
替えを行う切替え器、５１５はトランスポートストリー
ムのＰＩＤを判定するＰＩＤ判定器、５１６はＰＩＤ判
定器５１５の判定結果により動作するスイッチ、９１１
はデコーダ９０より出力される映像、９１２はデコーダ
９０より出力される音声９１３はマルチアングルの設定
情報をマルチアングル設定器９５に与えるための外部入
力手段、９１３はマルチアングル設定器にマルチアング
ル設定情報を入力する外部入力手段、９２１はビデオス
トリームを格納するビデオバッファ、９２２はオーディ
オストリームを格納するオーディオバッファ、９３１お
よび９３２はトランスポートストリームをビデオおよび
オーディオストリームに分解するデマルチプレクサ、９
４１はビデオバッファ９２１に格納されているビデオス
トリームをデコードするビデオデコーダ、９４２はオー
ディオバッファ９２２に格納されているオーディオスト
リームをデコードするオーディオデコーダ、９２０１は
主映像および音声のトランスポートストリームを蓄積す
るためのバッファ、９２０２はマルチアングル映像およ
び音声のトランスポートストリームを蓄積するためのバ
ッファを示す。In FIG. 14, reference numeral 8 denotes a received transport stream; 71, a disk controller for reading and writing to a disk 72; 72, a disk for recording multi-angle video and audio; 90, a decoder; A multi-angle setting unit 96 is a sub-picture setting unit that obtains multi-angle information from the transport stream stored on the disc 72 and provides switching information between video and audio. Reference numeral 96 denotes a video and audio output from the multi-angle setting unit 95. A switch 515 for switching between main video and audio, multi-angle and audio based on the switching information, 515 a PID determiner for determining the PID of the transport stream, 516 a switch that operates based on the determination result of the PID determiner 515, 911
Is a video output from the decoder 90, 912 is an audio output from the decoder 90 913 is an external input means for providing multi-angle setting information to the multi-angle setting device 95, 913 is a multi-angle setting information to the multi-angle setting device 921 is a video buffer for storing a video stream, 922 is an audio buffer for storing an audio stream, 931 and 932 are demultiplexers for decomposing a transport stream into video and audio streams, 9
41 is a video decoder for decoding a video stream stored in the video buffer 921, 942 is an audio decoder for decoding an audio stream stored in the audio buffer 922, and 9201 is for storing transport streams of main video and audio. Reference numeral 9202 denotes a buffer for storing multi-angle video and audio transport streams.

【００５５】デコーダ９０が受信したトランスポートス
トリーム８において、ＰＩＤ判定器５１５が、プログラ
ムマップテーブル等から知ることができるＰＩＤをマル
チアングルのものかどうか判定する。判定結果に基づき
スイッチ５１６を動作させ、主映像および音声のトラン
スポートストリームはバッファ９２０１に、マルチアン
グル映像および音声のトランスポートストリームはバッ
ファ９２０２にそれぞれ転送する。バッファ９２０２で
は、転送されてきたストリームをディスクコントローラ
７１を通してディスク７２に蓄積する。主映像および音
声のトランスポートストリームは、デマルチプレクサ９
３１によりビデオとオーディオのストリームに分解さ
れ、それぞれビデオバッファ９２１とオーディオバッフ
ァ９２２に転送される。ビデオデコーダ９４１、オーデ
ィオデコーダ９４２はデマルチプレクサ９３１で得たタ
イムスタンプ情報に基づきそれぞれビデオストリーム、
オーディオストリームのデコードを行う。ディスク７２
に格納されたマルチアングル映像および音声のトランス
ポートストリームはディスクコントローラ７１の制御に
よりデマルチプレクサ９３２に出力され、デマルチプレ
クサ９３２に映像と音声のストリームに分解され、それ
ぞれビデオバッファ９２１、オーディオバッファ９２２
に出力され、ビデオデコーダ９４１、オーディオデコー
ダ９４２でデコードされる。In the transport stream 8 received by the decoder 90, the PID determiner 515 determines whether the PID that can be known from the program map table or the like is a multi-angle PID. The switch 516 is operated based on the determination result, and the main video and audio transport streams are transferred to the buffer 9201, and the multi-angle video and audio transport streams are transferred to the buffer 9202. In the buffer 9202, the transferred stream is accumulated on the disk 72 through the disk controller 71. The main video and audio transport streams are transmitted to the demultiplexer 9.
The video stream is decomposed into video and audio streams by 31 and transferred to a video buffer 921 and an audio buffer 922, respectively. The video decoder 941 and the audio decoder 942 respectively determine a video stream and a video stream based on the time stamp information obtained by the demultiplexer 931.
Decode the audio stream. Disk 72
Are transmitted to the demultiplexer 932 under the control of the disk controller 71, and are decomposed into video and audio streams by the demultiplexer 932, and the video buffer 921 and the audio buffer 922, respectively.
And decoded by the video decoder 941 and the audio decoder 942.

【００５６】一方、外部入力手段９１３からマルチアン
グル設定情報を入力されたマルチアングル設定器９５
は、ディスクコントローラ７１を介して、ディスク７２
に蓄積されたトランスポートストリームのタイムスタン
プよりマルチアングル映像および音声の開始時刻および
終了時刻を確認する。外部入力手段９１３より主映像お
よび音声に設定されている場合は、常にスイッチ９６を
主映像および音声側にし、マルチアングル映像および音
声に設定されている場合は、ディスクより得た開始時刻
にスイッチ９６をマルチアングル映像および音声側に
し、終了時刻に主映像および音声側に戻す。On the other hand, the multi-angle setting device 95 to which the multi-angle setting information is inputted from the external input means 913
Is connected to the disk 72 via the disk controller 71.
The start time and the end time of multi-angle video and audio are confirmed from the time stamp of the transport stream stored in the. When the main input is set to the main video and audio from the external input means 913, the switch 96 is always set to the main video and audio side. When the multi input is set to the multi-angle video and audio, the switch 96 is set to the start time obtained from the disc. To the multi-angle video and audio side, and return to the main video and audio side at the end time.

【００５７】以上のようにして、デコーダにて再生出力
を切替えることが可能である。本実施の形態では、スト
リームを蓄積するのにディスクおよびディスクコントロ
ーラを用いたが、ストリームを記憶するという目的に合
致するものであれば、これに限定するものではない。As described above, the reproduction output can be switched by the decoder. In the present embodiment, a disk and a disk controller are used to store a stream. However, the present invention is not limited to this, as long as it meets the purpose of storing a stream.

【００５８】本実施の形態ではマルチアングル映像が１
つの場合を説明したが、マルチアングル映像を並列に２
つ以上符号化する場合も同様の方法を用いて実現できる
ため、詳細は記述しない。In this embodiment, the multi-angle image is 1
The two cases have been described.
Since the same method can be used to encode one or more codes, the details will not be described.

【００５９】また、本実施の形態では、主映像とマルチ
アングル映像を符号化する際に異なるシステムエンコー
ダを用いて説明したが、これは説明を容易にするためで
ある。実際には２つのシステムエンコーダは仮パケット
発生器の有無の違いだけであるため、図９の仮パケット
発生器のあるシステムエンコーダを用いて、第二のエン
コーダ内のシステムエンコーダでは仮パケット発生器を
動作させずに使用すれば、第一のエンコーダと第二のエ
ンコーダは全く同じ構成のものを用いることができる。Further, in the present embodiment, the description has been made using different system encoders when encoding the main video and the multi-angle video, but this is for the purpose of simplifying the description. Actually, the two system encoders differ only in the presence / absence of the provisional packet generator. Therefore, using the system encoder having the provisional packet generator in FIG. 9, the provisional packet generator is used in the system encoder in the second encoder. If used without operation, the first encoder and the second encoder can have exactly the same configuration.

【００６０】また、本実施の形態では、ビット割当てに
関して、図２のマルチアングル映像を図５のように符号
化する例を示した。この例では、マルチアングル映像を
再生する時間帯と符号化する時間帯は、時間的に全く重
複していない。しかし、多重化されたデータをデコーダ
のディスクに蓄積してから再生するまでに生じる全体の
遅延時間を計算して、その遅延時間を最低限保証すれ
ば、マルチアングル映像の再生時間帯と符号化時間帯は
重複しても構わない。Also, in the present embodiment, an example has been shown in which the multi-angle video of FIG. 2 is encoded as shown in FIG. 5 with respect to bit allocation. In this example, the time zone for reproducing the multi-angle video and the time zone for encoding do not temporally overlap at all. However, if the total delay time that occurs between the time when the multiplexed data is stored on the decoder disk and the time when it is played back is calculated, and the minimum delay time is guaranteed, the playback time zone of the multi-angle video and the encoding time Time zones may overlap.

【００６１】（実施の形態２）本発明の実施の形態２で
は、予め準備されているマルチアングル映像を含む映像
コンテンツを符号化するに際し、１個のエンコーダを用
いて主映像とマルチアングル映像を直列に符号化してデ
ィスクに蓄積し、後にディスクから読み出して多重化し
た１本の放送用トランスポートストリームを出力する映
像符号化装置を説明する。(Embodiment 2) In Embodiment 2 of the present invention, when encoding video content including a multi-angle video prepared in advance, the main video and the multi-angle video are encoded using one encoder. A description will be given of a video encoding apparatus that serially encodes and accumulates the information on a disk, and subsequently outputs one multiplexed broadcast transport stream read from the disk and multiplexed.

【００６２】図１５に、本実施の形態における映像符号
化装置のブロック図を示す。FIG. 15 is a block diagram of a video encoding apparatus according to the present embodiment.

【００６３】図１５において、６はビットアロケータ４
１、エンコーダ２０、ディスクコントローラ７１、およ
びマルチプレクサ５２を外部より制御するための外部コ
ントローラ、８はマルチプレクサ５２より出力されるト
ランスポートストリーム、４１はマルチアングル情報１
２に基づいてビット割当てを行うビットアロケータ、２
０はビットアロケータ４１の出力するビット割当て情報
４３およびマルチアングル情報１２に基づいて映像１３
１および音声１３２を符号化するエンコーダ、５２はデ
ィスク７２から読み出した映像および音声の各ストリー
ムをトランスポートストリーム８に多重化するマルチプ
レクサ、７１はディスク７２に読み書きを行うためのデ
ィスクコントローラ、７２はディスクコントローラ７１
によりエンコーダ２０の出力したストリームを記録して
おくためのディスク、１３１は主映像およびマルチアン
グル映像を含む全ての映像、１３２は映像に付随した音
声である。図１５において、実施の形態１の映像符号化
装置と同じ動作を行う構成要素のものには同符号を割当
てている。In FIG. 15, reference numeral 6 denotes a bit allocator 4
1, an external controller for externally controlling the encoder 20, the disk controller 71, and the multiplexer 52; 8, a transport stream output from the multiplexer 52; 41, multi-angle information 1
2, a bit allocator that performs bit allocation based on
0 is a video 13 based on the bit allocation information 43 output from the bit allocator 41 and the multi-angle information 12.
1 and an encoder for encoding the audio 132; 52, a multiplexer for multiplexing the video and audio streams read from the disk 72 into the transport stream 8; 71, a disk controller for reading from and writing to the disk 72; Controller 71
, A disk for recording the stream output from the encoder 20 according to the above, 131 is all video including the main video and multi-angle video, and 132 is audio accompanying the video. In FIG. 15, components having the same operations as those of the video encoding device according to the first embodiment are assigned the same reference numerals.

【００６４】以上のように構成された映像符号化装置に
ついて、以下その動作について説明する。The operation of the video encoding apparatus configured as described above will be described below.

【００６５】主映像とマルチアングル映像を含む全ての
映像１３１および映像に付随した音声１３２は、ＶＴＲ
等の記録媒体に記録されている。一例として、図２に示
したマルチアングル映像を、図１６に示すような順序で
記録する例を説明する。主映像とマルチアングル映像
は、再生できる程度であればどのように分割して記録さ
れていても良い。マルチアングル情報１２は、実施の形
態１で説明したものと同様である。但し、ＶＴＲ等の記
録媒体に映像を分割記録しておく場合は、映像との対応
が取れる形式にしておかなければならない。マルチアン
グル情報１２を用いてビット割当てを行うビットアロケ
ータ４１の動作は、実施の形態１で説明したものと同様
である。All the video 131 including the main video and the multi-angle video and the audio 132 accompanying the video are
And the like. As an example, an example in which the multi-angle video shown in FIG. 2 is recorded in the order shown in FIG. 16 will be described. The main video and the multi-angle video may be recorded in any way as long as they can be reproduced. The multi-angle information 12 is the same as that described in the first embodiment. However, when the video is divided and recorded on a recording medium such as a VTR, the format must be compatible with the video. The operation of bit allocator 41 for performing bit allocation using multi-angle information 12 is the same as that described in the first embodiment.

【００６６】次に、ビットアロケータ４１より得られた
ビット割当て情報４３と、マルチアングル情報１２とを
用いて、エンコーダ２０において全ての映像および音声
の符号化を行う。エンコーダ内部の構成は、実施の形態
１において図６を用いて説明したものと同様である。実
施の形態１で説明したように、主映像のエンコーダとマ
ルチアングル映像のエンコーダは全く同じ構成のものを
用いることができる。エンコーダ２０の出力するストリ
ームについて、図１６を用いて説明する。図２で示した
ようなマルチアングルの映像ソースを、図１６に示した
順序で符号化する。この場合、［０，Ｔ）の時間では主
映像をエンコードし、トランスポートストリーム１を得
る。次に［Ｔ，Ｔ＋Ｔ１）の時間でマルチアングルのシ
ーン１を符号化し、トランスポートストリーム２を得
る。最後に［Ｔ＋Ｔ１，Ｔ＋Ｔ１＋Ｔ２）の時間でマル
チアングルのシーン２を符号化し、トランスポートスト
リーム３を得る。このように、少なくとも、主映像とマ
ルチアングル映像を別々のストリームにする必要があ
る。Next, using the bit allocation information 43 obtained from the bit allocator 41 and the multi-angle information 12, the encoder 20 encodes all video and audio. The configuration inside the encoder is the same as that described in Embodiment 1 with reference to FIG. As described in the first embodiment, the encoder for the main video and the encoder for the multi-angle video can have exactly the same configuration. The stream output from the encoder 20 will be described with reference to FIG. A multi-angle video source as shown in FIG. 2 is encoded in the order shown in FIG. In this case, at the time [0, T), the main video is encoded, and the transport stream 1 is obtained. Next, the multi-angle scene 1 is encoded at the time [T, T + T1), and the transport stream 2 is obtained. Finally, a multi-angle scene 2 is encoded at a time of [T + T1, T + T1 + T2), and a transport stream 3 is obtained. As described above, at least the main video and the multi-angle video need to be separated into separate streams.

【００６７】ディスクコントローラ７１は、エンコーダ
の出力したトランスポートストリームをディスク７２に
格納する。マルチプレクサ５２は、ディスク７２に格納
されている複数のトランスポートストリームを多重化す
る。マルチプレクサ５２の動作は、ディスクから逐次読
み出して多重化する点が実施の形態１で説明したマルチ
プレクサ５１と異なるが、基本的には同様の構成で実施
可能なため、詳細な説明は省略する。デコーダの動作
は、実施の形態１で述べた通りである。The disk controller 71 stores the transport stream output from the encoder on the disk 72. The multiplexer 52 multiplexes a plurality of transport streams stored on the disk 72. The operation of the multiplexer 52 is different from that of the multiplexer 51 described in the first embodiment in that the operation is sequentially read from the disk and multiplexed. However, since the operation can be basically performed with the same configuration, detailed description is omitted. The operation of the decoder is as described in the first embodiment.

【００６８】なお、本実施の形態では、マルチアングル
映像が１個の場合を説明したが、マルチアングル映像を
並列に２個以上符号化する場合も同様の方法を用いて実
現できるため、詳細は記述しない。Although the present embodiment has been described with reference to a case where there is one multi-angle image, a case where two or more multi-angle images are encoded in parallel can be realized using the same method. Do not write.

【００６９】また、本実施の形態では、ディスクおよび
ディスクコントローラを利用する例を示したが、ストリ
ームを記憶して読み出しできる媒体であればディスクに
特定するものではない。In this embodiment, an example is shown in which a disk and a disk controller are used. However, a medium that can store and read a stream is not limited to a disk.

【００７０】また、本実施の形態ではエンコーダ２０は
トランスポートストリームを生成するとしたが、エンコ
ーダ２０がビデオおよびオーディオのエレメンタリース
トリームをそれぞれ出力しても良い。この場合、各エレ
メンタリーストリームを記憶媒体に蓄積し、マルチプレ
クサ５２で多重化する際にマルチプレクサ内に用意した
システムエンコーダによりトランスポートストリームに
変換するといった構成を取ることが可能である。In the present embodiment, the encoder 20 generates the transport stream. However, the encoder 20 may output the video and audio elementary streams. In this case, it is possible to adopt a configuration in which each elementary stream is stored in a storage medium and converted into a transport stream by a system encoder prepared in the multiplexer when the multiplexer 52 multiplexes the elementary streams.

【００７１】（実施の形態３）本発明の実施の形態３で
は、予め準備されているマルチアングル映像を含む映像
コンテンツを符号化するに際し、主映像とマルチアング
ル映像を１回符号化して映像の符号化難易度情報を測定
し、測定した符号化難易度情報に基づいてビット割当て
を行い、得られたビット割当て情報に基づいて、主映像
とマルチアングル映像を直列に符号化してディスクに蓄
積し、後にディスクから読み出して多重化した１本の放
送用トランスポートストリームを出力する映像符号化装
置を説明する。(Embodiment 3) In Embodiment 3 of the present invention, when encoding video content including a multi-angle video prepared in advance, the main video and the multi-angle video are encoded once to encode the video. Measure the encoding difficulty information, perform bit allocation based on the measured encoding difficulty information, encode the main video and the multi-angle video in series based on the obtained bit allocation information, and accumulate them on the disc. A video encoding apparatus that outputs one broadcast transport stream read from a disk and multiplexed later will be described.

【００７２】図１７に、本実施の形態における映像符号
化装置のブロック図を示す。FIG. 17 is a block diagram of a video encoding apparatus according to the present embodiment.

【００７３】図１７において、６は２パスビットアロケ
ータ４２、１パスエンコーダ２０３、２パスエンコーダ
２０４、ディスクコントローラ７１、およびマルチプレ
クサ５２を外部より制御するための外部コントローラ、
４２はマルチアングル情報１２および符号化難易度情報
４６に基づいてビット割当てを行う２パスビットアロケ
ータ、４４は２パスビットアロケータの出力するビット
割当て情報、４６は１パスエンコーダ２０３より出力さ
れた符号化難易度情報、５２はディスクから読み出した
映像および音声の各ストリームをトランスポートストリ
ーム８に多重化するマルチプレクサ７１はディスクに読
み書きを行うためのディスクコントローラ、７２はエン
コーダ２０の出力したストリームを記録しておくための
ディスク、１４１は主映像およびマルチアングル映像を
含む全ての映像、１４２は全ての映像に付随した音声、
１５１は１パスエンコーダ２０３の入力映像端子、１５
２は２パスエンコーダ２０４の入力映像端子、１６１は
１パスエンコーダ２０３の入力音声端子、１６２は２パ
スエンコーダ２０４の入力音声端子、２０３はマルチア
ングル情報１２に基づいて映像１４１および音声１４２
の第１回目の符号化を行う１パスエンコーダ、２０４は
２パスビットアロケータ４２の出力するビット割当て情
報４４およびマルチアングル情報１２に基づいて映像１
４１および音声１４２の第２回目の符号化を行う２パス
エンコーダである。ここで、図１７において、実施の形
態１、実施の形態２の映像符号化装置と同様の動作を行
う構成要素には同じ番号を割り振っている。In FIG. 17, reference numeral 6 denotes an external controller for externally controlling the 2-pass bit allocator 42, the 1-pass encoder 203, the 2-pass encoder 204, the disk controller 71, and the multiplexer 52.
42 is a two-pass bit allocator that performs bit allocation based on the multi-angle information 12 and the encoding difficulty information 46, 44 is the bit allocation information output from the two-pass bit allocator, and 46 is the encoding output from the one-pass encoder 203. Difficulty level information, 52 is a multiplexer for multiplexing the video and audio streams read from the disc into the transport stream 8, a multiplexer 71 is a disk controller for reading and writing to the disc, and 72 is a stream that records the stream output from the encoder 20. Disc 141, all video including the main video and multi-angle video, 142 is audio accompanying all the video,
151 is an input video terminal of the 1-pass encoder 203;
2 is an input video terminal of the two-pass encoder 204, 161 is an input audio terminal of the one-pass encoder 203, 162 is an input audio terminal of the two-pass encoder 204, and 203 is a video 141 and an audio 142 based on the multi-angle information 12.
The first pass encoder 204 performs the first encoding of the video 1 based on the bit allocation information 44 and the multi-angle information 12 output from the two-pass bit allocator 42.
This is a two-pass encoder that performs a second encoding of the audio 41 and the audio 142. Here, in FIG. 17, the same numbers are assigned to components that perform the same operations as those of the video encoding devices according to the first and second embodiments.

【００７４】以上のように構成された映像符号化装置に
ついて、以下その動作について説明する。The operation of the video encoding apparatus configured as described above will be described below.

【００７５】主映像とマルチアングル映像を含む全ての
映像１４１および全ての映像に付随した音声１４２は、
ＶＴＲ等の記録媒体に記録されている。記録方法は、実
施の形態２で説明した方法と同様である。マルチアング
ル情報１２は、実施の形態１で説明したものと同様であ
る。但し、ＶＴＲ等の記録媒体に映像を分割記録してお
く場合は、映像との対応が取れる形式にしておかなけれ
ばならない。All the images 141 including the main image and the multi-angle image and the sound 142 attached to all the images are
It is recorded on a recording medium such as a VTR. The recording method is the same as the method described in the second embodiment. The multi-angle information 12 is the same as that described in the first embodiment. However, when the video is divided and recorded on a recording medium such as a VTR, the format must be compatible with the video.

【００７６】１パスエンコードの実行時は、映像１４１
は端子１５１に、音声１４２は端子１６１に接続され
る。１パスエンコーダ２０３は、マルチアングル情報１
２を用いて、主映像およびマルチアングル映像を含む全
ての映像および音声の符号化を行う。１パスエンコード
の詳細は、従来の技術における図１９を用いて説明した
通りである。１パスエンコードにより得られる符号化難
易度情報４６は、一旦ディスクコントローラ７１を通し
てディスク７２に蓄積される。ここで、符号化難易度情
報４６は、１パスエンコーダ２０３から出力される発生
符号量などを仮定しているが、１パスエンコーダ２０３
から出力される情報がディスク７２に蓄積された後、加
工・編集したものでも良い。When one-pass encoding is performed, the image 141
Is connected to the terminal 151, and the audio 142 is connected to the terminal 161. The one-pass encoder 203 outputs the multi-angle information 1
2, encoding of all video and audio including the main video and the multi-angle video is performed. The details of the one-pass encoding are as described with reference to FIG. 19 in the related art. The encoding difficulty information 46 obtained by one-pass encoding is temporarily stored in the disk 72 through the disk controller 71. Here, the encoding difficulty information 46 assumes the amount of generated code output from the one-pass encoder 203, but the
May be processed and edited after the information output from is accumulated on the disk 72.

【００７７】２パスビットアロケータ４２は、マルチア
ングル情報１２およびディスク７２から読み出した符号
化難易度情報４６を用いて、２パスエンコーダ２０４に
入力するためのビット割当て情報４４を算出する。これ
により、マルチアングル映像も含めた実際の映像の難易
度に応じたビット割当てが可能となる。ビット割当てを
行う２パスビットアロケータ４２の動作は、実施の形態
１で説明したものと基本的には同様であるが、符号化難
易度情報４６を用いることで、より画質を向上させるビ
ット割当てを行うことが可能である。２パスエンコード
の実行時は、映像１４１は端子１５２に、音声１４２は
端子１６２に接続される。２パスエンコーダ２０４は、
２パスビットアロケータ４２より得られたビット割当て
情報４４と、マルチアングル情報１２とを用いて、全て
の映像１４１および音声１４２の符号化を行う。２パス
エンコーダ２０４内部の構成は、実施の形態１で説明し
たものと同様である。２パスエンコーダ２０４の出力す
るストリームは、実施の形態２で図１６を用いて説明し
た通りである。また、主映像をシステム符号化してトラ
ンスポートストリームを生成する際には、実施の形態１
で説明した仮パケットの挿入を行う。ディスクコントロ
ーラ７１は、エンコーダの出力したトランスポートスト
リームをディスク７２に格納する。The 2-pass bit allocator 42 uses the multi-angle information 12 and the encoding difficulty information 46 read from the disk 72 to calculate bit allocation information 44 to be input to the 2-pass encoder 204. As a result, it is possible to assign bits according to the degree of difficulty of an actual video including a multi-angle video. The operation of the 2-pass bit allocator 42 that performs bit allocation is basically the same as that described in the first embodiment, but by using the encoding difficulty information 46, the bit allocation for further improving the image quality can be performed. It is possible to do. When the two-pass encoding is performed, the video 141 is connected to the terminal 152 and the audio 142 is connected to the terminal 162. The two-pass encoder 204
Using the bit allocation information 44 obtained from the two-pass bit allocator 42 and the multi-angle information 12, all the video 141 and audio 142 are encoded. The configuration inside two-pass encoder 204 is the same as that described in the first embodiment. The stream output from the 2-pass encoder 204 is as described in the second embodiment with reference to FIG. In addition, when a transport stream is generated by performing system encoding on a main video, the first embodiment is used.
The insertion of the temporary packet described in (1) is performed. The disk controller 71 stores the transport stream output from the encoder on the disk 72.

【００７８】マルチプレクサ５２は、実施の形態２で説
明したものと同様の動作を行う。デコーダの動作は、実
施の形態１で述べた通りである。The multiplexer 52 performs the same operation as that described in the second embodiment. The operation of the decoder is as described in the first embodiment.

【００７９】なお、本実施の形態では、マルチアングル
映像が１つの場合を説明したが、マルチアングル映像を
並列に２つ以上符号化する場合も同様の方法を用いて実
現できるため、詳細は記述しない。Although the present embodiment has been described with reference to a case where there is one multi-angle image, a case where two or more multi-angle images are encoded in parallel can be realized by using the same method. do not do.

【００８０】また、本実施の形態では、ディスクおよび
ディスクコントローラを利用する例を示したが、ストリ
ームを記憶して読み出しできる媒体であればディスクに
特定するものではない。In the present embodiment, an example is shown in which a disk and a disk controller are used. However, any medium that can store and read a stream is not limited to a disk.

【００８１】また、本実施の形態では２パスエンコーダ
２０４はトランスポートストリームを生成するとした
が、２パスエンコーダ２０４がビデオおよびオーディオ
のエレメンタリーストリームをそれぞれ出力しても良
い。この場合、各エレメンタリーストリームを記憶媒体
に蓄積し、マルチプレクサ５２で多重化する際にシステ
ムエンコーダによりトランスポートストリームに変換す
るといった構成を取ることが可能である。In this embodiment, the two-pass encoder 204 generates a transport stream. However, the two-pass encoder 204 may output a video and an audio elementary stream. In this case, it is possible to adopt a configuration in which each elementary stream is stored in a storage medium and converted into a transport stream by a system encoder when multiplexed by the multiplexer 52.

【００８２】また、本発明では２パスエンコードを例に
とって説明したが、３回以上のエンコードを行う場合で
も、本質的に処理手段に変化はなく、容易に実行するこ
とができる。さらに、本実施の形態では説明を容易にす
るためにエンコーダの数を２個としたが、１個のエンコ
ーダを用いて主映像およびマルチアングル映像を含む全
ての映像をそれぞれ複数回符号化するといった構成も、
同様に実現することができる。In the present invention, two-pass encoding has been described as an example. However, even when encoding is performed three or more times, there is essentially no change in the processing means, and the encoding can be easily performed. Further, in the present embodiment, the number of encoders is set to two for ease of explanation, but all the images including the main image and the multi-angle image are encoded a plurality of times using one encoder. The configuration is also
It can be realized similarly.

【００８３】[0083]

【発明の効果】以上の説明から明らかなように、本発明
によれば、デジタルＴＶ放送において、予め制作された
マルチアングル映像を含むコンテンツを放送する場合
に、マルチアングル映像の存在する時間情報とビット割
当て情報に基づいて、マルチアングル映像を前もって送
信できるだけの帯域を、当該マルチアングル映像が開始
される以前の時間帯に確保し、確保した帯域分を除いた
平均ビットレートで主映像をＶＢＲ符号化し、各々のト
ランスポートストリームを多重化することにより、帯域
を増加させずにマルチアングル映像を送信することがで
きる。As is apparent from the above description, according to the present invention, in the case of broadcasting a content including a multi-angle video that has been produced in advance in digital TV broadcasting, the time information on the presence of the multi-angle video and Based on the bit allocation information, a band capable of transmitting the multi-angle video in advance is secured in a time period before the start of the multi-angle video, and the main video is VBR-coded at an average bit rate excluding the secured bandwidth. By multiplexing each transport stream, multi-angle video can be transmitted without increasing the bandwidth.

【００８４】また、複数回の符号化を実行する場合に
は、映像の符号化難易度情報を用いて、主映像およびマ
ルチアングル映像を含む全ての映像のビット割当てを行
うことができるため、映像全体の画質を均質にし、全体
として画質を改善した映像符号化装置を構成することが
可能となる。In the case where encoding is performed a plurality of times, the bit assignment of all the images including the main image and the multi-angle image can be performed using the encoding difficulty information of the image. This makes it possible to configure a video encoding device in which the overall image quality is made uniform and the overall image quality is improved.

【００８５】さらに、復号化の際には、マルチアングル
映像および音声を記録するためのディスクと、ストリー
ムから読み取ったマルチアングル映像の開始および終了
時刻の情報に応じて出力映像および音声の切替え信号を
生成するマルチアングル設定器とを用いることで、受信
側で映像を選択できる復号化装置を構成することが可能
となる。Further, at the time of decoding, a disk for recording multi-angle video and audio, and a switching signal for output video and audio in accordance with information on the start and end times of the multi-angle video read from the stream. By using the generated multi-angle setting device, it is possible to configure a decoding device capable of selecting a video on the receiving side.

[Brief description of the drawings]

【図１】本発明の実施の形態１によおける映像符号化装
置の構成を示すブロック図FIG. 1 is a block diagram showing a configuration of a video encoding device according to a first embodiment of the present invention.

【図２】マルチアングル映像の例を示す図FIG. 2 is a diagram showing an example of a multi-angle video.

【図３】マルチアングル映像のＣＢＲ符号化の例を示す
図FIG. 3 is a diagram illustrating an example of CBR encoding of a multi-angle video.

【図４】同ビット割当ての例を示す図FIG. 4 is a diagram showing an example of the bit allocation.

【図５】同ストリーム上のデータ配置の例を示す図FIG. 5 is a diagram showing an example of data arrangement on the stream.

【図６】同エンコーダの構成を示すブロック図FIG. 6 is a block diagram showing a configuration of the encoder.

【図７】同ビデオエンコーダの構成を示すブロック図FIG. 7 is a block diagram showing a configuration of the video encoder.

【図８】同ＶＢＶバッファの占有度の調整を示す図FIG. 8 is a diagram showing adjustment of the occupancy of the VBV buffer;

【図９】同第一のシステムエンコーダの構成を示すブロ
ック図FIG. 9 is a block diagram showing a configuration of the first system encoder.

【図１０】同第二のシステムエンコーダの構成を示すブ
ロック図FIG. 10 is a block diagram showing the configuration of the second system encoder;

【図１１】同マルチアングル映像の符号化時刻の変更の
例を示す図FIG. 11 is a diagram showing an example of changing the encoding time of the multi-angle video.

【図１２】同マルチプレクサの構成を示すブロック図FIG. 12 is a block diagram showing a configuration of the multiplexer.

【図１３】同マルチプレクサにおけるパケット置換方法
を示す概念図FIG. 13 is a conceptual diagram showing a packet replacement method in the multiplexer.

【図１４】同によるデコーダの構成を示すブロック図FIG. 14 is a block diagram showing a configuration of a decoder according to the embodiment;

【図１５】本発明の実施の形態２による映像符号化装置
の構成を示すブロック図FIG. 15 is a block diagram showing a configuration of a video encoding device according to a second embodiment of the present invention.

【図１６】同主映像とマルチアングル映像の符号化順を
示す図FIG. 16 is a diagram showing an encoding order of the main video and the multi-angle video.

【図１７】本発明の実施の形態３による映像符号化装置
の構成を示すブロック図FIG. 17 is a block diagram showing a configuration of a video encoding device according to a third embodiment of the present invention.

【図１８】ＭＰＥＧ２に基づくビデオエンコーダの構成
を示すブロック図FIG. 18 is a block diagram showing a configuration of a video encoder based on MPEG2.

【図１９】２パスエンコードを行う装置の構成を示すブ
ロック図FIG. 19 is a block diagram showing a configuration of an apparatus for performing two-pass encoding.

[Explanation of symbols]

６外部コントローラ８出力トランスポートストリーム１２マルチアングル情報２０エンコーダ２４システムエンコーダ４１ビットアロケータ４２２パスビットアロケータ４３、４４ビット割当て情報４６符号化難易度情報５１、５２マルチプレクサ７１ディスクコントローラ７２ディスク９５マルチアングル設定器１０１主映像１０２主映像に付随した音声１１１マルチアングル映像１１２マルチアングル映像に付随した音声１３１、１４１映像１３２、１４２映像に付随した音声２０１第一のエンコーダ２０２第二のエンコーダ２０３１パスエンコーダ２０４２パスエンコーダ２４５仮パケット発生器５１１主映像のトランスポートストリーム５１２マルチアングル映像のトランスポートストリー
ム５１３主映像・音声バッファ５１４マルチアングル映像・音声バッファ５１５ＰＩＤ判定器９１３外部入力手段6 External controller 8 Output transport stream 12 Multi-angle information 20 Encoder 24 System encoder 41 Bit allocator 42 2-pass bit allocator 43, 44 Bit allocation information 46 Encoding difficulty information 51, 52 Multiplexer 71 Disk controller 72 Disk 95 Multi-angle setting Device 101 main video 102 audio attached to main video 111 multi-angle video 112 audio attached to multi-angle video 131, 141 video 132, 142 audio attached to video 201 first encoder 202 second encoder 203 one-pass encoder 204 2-pass encoder 245 Temporary packet generator 511 Transport stream of main video 512 Transport stream of multi-angle video 5 13 Main video / audio buffer 514 Multi-angle video / audio buffer 515 PID decision unit 913 External input means

フロントページの続きＦターム(参考） 5C053 FA20 FA22 FA23 GB02 GB06 GB08 GB11 GB12 GB18 GB19 GB22 GB26 GB32 GB38 GB40 JA22 KA08 KA18 KA24 5C059 KK23 MA00 MA05 MA23 MC11 RB01 RB10 RC04 RC12 RC32 SS02 SS11 SS20 TA60 TB02 TC10 TC16 TC24 TC37 TC38 UA02 UA05 UA08 5C063 AB03 AB07 AC01 AC05 AC10 CA11 CA12 DA01 Continued on front page F-term (reference) 5C053 FA20 FA22 FA23 GB02 GB06 GB08 GB11 GB12 GB18 GB19 GB22 GB26 GB32 GB38 GB40 JA22 KA08 KA18 KA24 5C059 KK23 MA00 MA05 MA23 MC11 RB01 RB10 RC04 RC12 RC32 SS02 SS11 SS20 TA60 TB02 TC37 TC16 TC16 TC16 TC24 UA02 UA05 UA08 5C063 AB03 AB07 AC01 AC05 AC10 CA11 CA12 DA01

Claims

[Claims]

1. A video encoding apparatus for encoding a main video and a sub-video accompanying the main video as a coded stream of a predetermined bit rate, comprising a time including at least a start time and an end time of the sub-video. Based on the information, a bit allocator that outputs bit allocation information indicating the bit allocation of the main video and the sub-video in the coded stream, a stream that encodes the main video and a stream that encodes the sub-video A video encoding device, comprising: a multiplexer that multiplexes based on the bit allocation information and generates the encoded stream.

2. A method according to claim 1, wherein the bit allocator allocates bits in the coded stream of the main video and the sub video based on at least coding difficulty information including a code amount generated when the video is coded. The video encoding device according to claim 1, wherein:

3. A first encoder for encoding a main image based on time information and bit allocation information output by a bit allocator, and a second encoder for encoding a sub-image based on the time information and the bit allocation information. The video encoding device according to claim 1 or 2, further comprising:

4. An encoder for encoding a main video and a sub video based on time information and bit allocation information output by a bit allocator, and a recording medium for temporarily storing the encoded stream. 3. The video encoding device according to claim 1, wherein:

5. A one-pass encoder for encoding a main image and a sub-image in order to obtain encoding difficulty information, and the main image and the sub-image based on time information and bit allocation information output from a bit allocator. And a recording medium for temporarily storing a stream encoded by the one-pass encoder and the two-pass encoder.
The video encoding device according to Claim 1.

6. A first encoder, encoder or 2
The path encoder, when encoding the main video and the sub video, at all the start time and all the end time of the sub video, the buffer occupancy at the time of decoding the stream obtained by encoding the main video and the sub video 3. An encoding control circuit for performing control for matching
The video encoding device according to claim 4.

7. An apparatus according to claim 3, further comprising an audio encoder for simultaneously encoding audio accompanying the main video or the sub video when encoding the main video or the sub video. The video encoding device according to claim 4, claim 5, or claim 6.

8. A system encoder according to claim 3, further comprising a system encoder for packetizing the encoded main video data and multiplexing the data into a transport stream for transmission. Item 5. The video encoding device according to claim 6.

9. A system encoder for packetizing encoded main video data and audio data encoded along with said main video and multiplexing them into a transport stream for transmission. The video encoding device according to claim 7, further comprising:

10. A tentative packet for generating a tentative packet at a position where a multiplexed stream of a sub-picture is inserted by a multiplexer based on bit allocation information output from a bit allocator in a coded stream of a main picture. The video encoding device according to claim 8, further comprising a generator.

11. A multiplexer, comprising: a first buffer for temporarily storing a transport stream; and a stream encoding a sub-picture or a stream encoding the sub-picture and audio accompanying the sub-picture. A second buffer for temporarily storing data, and data rewriting means for replacing a temporary packet existing in the first buffer with a stream existing in the second buffer. The video encoding device according to claim 10.

12. A stream obtained by encoding a sub-picture or a stream obtained by encoding the sub-picture and audio accompanying the sub-picture is data obtained by encoding the sub-picture,
12. The video encoding device according to claim 11, wherein data obtained by encoding the sub-video and audio accompanying the sub-video are packetized.

13. A video encoding method for encoding a main video and a sub video accompanying the main video with a coded stream of a predetermined bit rate, wherein the time includes at least a start time and an end time of the sub video. A bit allocation step of performing bit allocation of the main video and the sub video in the coded stream based on the information, and multiplexing the main video coded stream and the sub video coded stream. A video multiplexing step.

14. The bit allocating step performs bit allocation in the coded stream of the main video and the sub-video based on at least coding difficulty information including a code amount generated when the video is coded. 14. The video encoding method according to claim 13, wherein:

15. A first encoding step for encoding a main video based on time information and bit allocation information output from the bit allocation step, and encoding a sub-video based on the time information and the bit allocation information. 15. The video encoding method according to claim 13, further comprising a second encoding step.

16. The video code according to claim 13, further comprising an encoding step of encoding the main image and the sub-image based on the time information and the bit allocation information output by the bit allocation step. Device

17. The main video based on a one-pass encoding step of encoding a main video or a sub-video to obtain encoding difficulty information, and time information and bit allocation information output by a bit allocation step. The video encoding method according to claim 14, further comprising a two-pass encoding step of encoding the sub-image.

18. The first encoding step, the encoding step or the two-pass encoding step, when encoding a main video and a sub-video, at all start times and all end times of the sub-video. 18. The coding method according to claim 15, further comprising an encoding control step of performing control for matching the buffer occupancy at the time of decoding of the stream obtained by encoding the main image and the sub-image. Video encoding method.

19. The method according to claim 15, further comprising an audio encoding step for simultaneously encoding audio accompanying the main video or the sub video when encoding the main video or the sub video. Claim 16, claim 17,
The video encoding method according to claim 18.

20. The system according to claim 15, further comprising a system encoding step for packetizing the encoded main video data and multiplexing the data into a transport stream for transmission. , Claim 17,
The video encoding method according to claim 18.

21. System encoding for packetizing encoded main video data and audio data encoded along with the main video and multiplexing them into a transport stream for transmission. The video encoding method according to claim 19, further comprising a step.

22. A system encoding step includes, in a coded stream of a main video, a provisional packet at a position where a coded stream of a sub-video is inserted by a multiplexing step, based on bit allocation information output by the bit allocation step. 22. The video encoding method according to claim 20, further comprising a provisional packet generating step of generating a temporary packet.

23. The multiplexing step includes a data rewriting step for replacing a temporary packet with a stream obtained by encoding a sub-picture or a stream obtained by encoding the sub-picture and audio accompanying the sub-picture. 23. The video encoding method according to claim 22, wherein:

24. A stream obtained by encoding a sub-picture or a stream obtained by encoding the sub-picture and audio accompanying the sub-picture is composed of data obtained by encoding the sub-picture or the sub-picture and the sub-picture. 24. The video encoding method according to claim 23, wherein the data obtained by encoding the attached audio is packetized.

25. A recording medium on which a stream generated by using the video encoding device and method according to claim 1 is recorded.

26. A decoding device for decoding a stream generated by using the video encoding device and method according to claim 1, wherein the stream is obtained by encoding a sub-video. Or a recording device for temporarily storing a stream obtained by encoding the sub-video and audio accompanying the sub-video.

27. An external input means for externally setting whether or not to select a sub-picture, and reading time information of a start time and an end time of the sub-picture from an encoded stream and selecting the sub-picture by the external input means. 27. The decoding apparatus according to claim 26, further comprising: a sub-picture setting device that generates a switching signal for switching to an output picture and an output sound according to the time information when the picture is a sub-picture.