JP2003235011A

JP2003235011A - Program stream production apparatus and recording and reproducing apparatus employing the same

Info

Publication number: JP2003235011A
Application number: JP2002034806A
Authority: JP
Inventors: Keisuke Inada; 圭介稲田; Susumu Takahashi; 将高橋
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2002-02-13
Filing date: 2002-02-13
Publication date: 2003-08-22

Abstract

<P>PROBLEM TO BE SOLVED: To provide a program stream generating apparatus capable of efficiently producing a program stream from a compressed video stream adopting a variable bit rate method in a field with high real time performance such as a DVD-RAM. <P>SOLUTION: A video bit rate in a VOBU (video object unit) is obtained from a video stream subjected to variable bit rate compression for each VOBU, and reference time information (SCR) to be captured in an input buffer and a decoding time stamp/presentation time stamp (DTS/PTS) are calculated on the basis of the bit rate. Audio and video packs are interleaved according to them to produce the program stream. As a result, the audio packs and the video packs are arranged in average and audio and video data for a longer time can be stored in a medium with a limited capacity. <P>COPYRIGHT: (C)2003,JPO

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、ＤＶＤレコーダ等
で実時間エンコード及び記録を行なう分野において、圧
縮された映像ストリーム及び音声ストリームをもとに、
リアルタイムにプログラムストリームを生成し、または
これを記録再生する装置に係る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a field for performing real-time encoding and recording on a DVD recorder or the like, based on a compressed video stream and audio stream.
The present invention relates to an apparatus for generating a program stream in real time or recording / reproducing this.

【０００２】[0002]

【従来の技術】動画像圧縮規格の一つに、ＭＰＥＧ(Mov
ing Picture coding Experts Group)がある。これは、
動画像を効率的に圧縮することに関するＩＳＯ(Interna
tionalStandard Organization)のＷＧ８(Working Group
8)のある委員会、またはそれに関する圧縮規格(CCITT
H261)を意味する。この中で、ＭＰＥＧ−２圧縮方式で
は、固定ビットレート符号化方式、または可変ビットレ
ート符号化方式が使用されている。2. Description of the Related Art MPEG (Mov
ing Picture coding Experts Group). this is,
ISO (Interna) regarding efficient compression of moving images
WG8 (Working Group) of the National Standard Organization
8) or a compression standard (CCITT
H261) is meant. Among them, the MPEG-2 compression method uses a fixed bit rate coding method or a variable bit rate coding method.

【０００３】固定ビットレート符号化方式を用いる場
合、ある画像グループ単位、例えばＧＯＰ（Group Of P
icture）単位（再生専用ＤＶＤでは１５フレーム程度）
で一定の符号化発生情報量となるように符号化制御が行
われる。すなわち、１画面の目標情報量と発生情報量と
の差分値と、仮想的なバッファ占有量の初期値とから、
直交変換係数の量子化ステップを決定することによって
符号化制御を行う。このような固定ビットレート符号化
方式は、例えば特開平８−３４０５３３号公報に開示さ
れる。When the fixed bit rate coding method is used, a certain image group unit, for example, GOP (Group Of P
icture) unit (about 15 frames for playback-only DVD)
The encoding control is performed so that the amount of generated encoding information is constant. That is, from the difference value between the target information amount and the generated information amount for one screen and the initial value of the virtual buffer occupation amount,
Coding control is performed by determining the quantization step of the orthogonal transform coefficient. Such a constant bit rate encoding method is disclosed in, for example, Japanese Patent Laid-Open No. 8-340533.

【０００４】可変ビットレート符号化方式を用いる場
合、一度仮符号化を行い、ピクチャやＧＯＰ単位等での
動画像の符号化難易度を抽出する。その後、抽出された
符号化難易度に従ってピクチャやＧＯＰ単位でビット量
の割り当てを行いながら再度実符号化を行う。ここでビ
ット量を割り当てる際には、所定の目標平均発生情報
量、伝送媒体または記憶媒体の最大許容発生情報量、お
よびデコーダバッファのサイズを考慮する必要がある。
以上のようにして、符号化難易度の低い画像には平均よ
りも少ないビット量を割り当て、逆に符号化難易度の高
い画像には平均よりも多いビット量を割り当てることに
よって、所定の目標平均発生情報量によって限定された
総ビット量内で、動画像の全体的な画質向上を達成する
ことが可能となる。このような可変ビットレート符号化
方式は、再生専用ＤＶＤのオーサリング等、主にパッケ
ージメディアで用いられている。ビットレート符号化方
式は、例えば、特開平８−２８９２９４号公報に開示さ
れる。When the variable bit rate coding method is used, temporary coding is performed once to extract the coding difficulty of a moving image in units of pictures or GOPs. After that, actual encoding is performed again while allocating the bit amount for each picture or GOP according to the extracted encoding difficulty level. Here, when allocating the bit amount, it is necessary to consider the predetermined target average generated information amount, the maximum allowable generated information amount of the transmission medium or the storage medium, and the size of the decoder buffer.
As described above, by assigning a bit amount smaller than the average to an image with low coding difficulty and conversely assigning a bit amount larger than the average to an image with high coding difficulty, a predetermined target average is obtained. It is possible to improve the overall image quality of a moving image within the total bit amount limited by the generated information amount. Such a variable bit rate coding method is mainly used in package media such as authoring of a read-only DVD. The bit rate coding method is disclosed in, for example, Japanese Patent Laid-Open No. 8-289294.

【０００５】[0005]

【発明が解決しようとする課題】従来の固定ビットレー
ト方式で圧縮された映像ストリーム及び音声ストリーム
をもとにプログラムストリームを生成する場合、各パッ
クヘッダに記述する基準時刻値ＳＣＲの間隔は一定であ
る。しかしながら、従来の固定ビットレート方式では、
画像の符号化難易度に関わらずある画像グループ単位で
同一ビット量を割り当てることになる。そのため、符号
化難易度の低い画像では必要以上のビット量が割り当て
られるため、無駄なビット量を消費することになる。一
方、符号化難易度の高い画像では少ないビット量しか割
り当てられないため、画質劣化が生じ易い。When a program stream is generated based on a video stream and an audio stream compressed by the conventional fixed bit rate method, the intervals of the reference time value SCR described in each pack header are constant. is there. However, in the conventional constant bit rate system,
The same bit amount is assigned to each image group regardless of the image coding difficulty. Therefore, an unnecessary bit amount is consumed because an excessive bit amount is allocated to an image having a low coding difficulty. On the other hand, in an image with a high degree of difficulty in encoding, only a small bit amount is assigned, so that image quality deterioration easily occurs.

【０００６】本発明の目的は、上記の問題点を解決し、
ＤＶＤ−ＲＡＭ等の限られた容量の媒体に音声映像情報
を記録する分野で、より長い時間の記録を可能とする可
変ビットレート方式を用いたプログラムストリーム生成
装置および記録再生装置を提供することにある。The object of the present invention is to solve the above problems,
To provide a program stream generation apparatus and a recording / reproducing apparatus using a variable bit rate method that enables recording for a longer time in the field of recording audiovisual information on a medium having a limited capacity such as a DVD-RAM. is there.

【０００７】[0007]

【課題を解決するための手段】上記目的を達成するた
め、本発明のプログラムストリーム生成装置は、映像ス
トリームから映像パック生成に必要なペイロード分の映
像データを取り出して転送する映像データ転送部と、転
送された映像ストリームから固定長の映像パックを生成
する映像パック生成部と、映像パックに含まれるピクチ
ャデータを復号装置の映像入力バッファに取り込むべき
映像基準時刻値と、これを復号開始及び表示開始する時
刻値とを算出する映像時刻情報算出部と、算出された映
像時刻情報を前記映像パックのヘッダに記述する映像パ
ックヘッダ生成部と、音声ストリームから音声パック生
成に必要なペイロード分の音声データを取り出して転送
する音声データ転送部と、転送された音声ストリームか
ら固定長の音声パックを生成する音声パック生成部と、
音声パックに含まれる音声フレームデータを復号装置の
音声入力バッファに取り込むべき音声基準時刻値と、こ
れを復号開始及び表示開始する時刻値とを算出する音声
時刻情報算出部と、算出された音声時刻情報を音声パッ
クのヘッダに記述する音声パックヘッダ生成部と、映像
時刻情報の初期値及び増分値と、音声時刻情報の初期値
及び増分値と、先頭音声パック挿入位置を算出し、さら
にピクチャデータ量をもとに、音声映像パック群におけ
る映像パックと音声パックのパック挿入比率を算出する
初期情報算出部と、先頭音声パック挿入位置及びパック
挿入比率に応じて、映像パックと音声パックの生成要求
を映像パック生成部および音声パック生成部に発行し、
音声映像パック群の生成完了時に初期情報算出部に通知
する制御部と、を備える。In order to achieve the above object, a program stream generation device of the present invention comprises a video data transfer unit for extracting and transferring video data for a payload required for video pack generation from a video stream, A video pack generation unit that generates a fixed-length video pack from the transferred video stream, a video reference time value at which the picture data included in the video pack should be captured in the video input buffer of the decoding device, and decoding start and display start A video time information calculation unit that calculates a time value to be performed, a video pack header generation unit that describes the calculated video time information in the header of the video pack, and audio data for a payload required to generate an audio pack from an audio stream. Audio data transfer part that extracts and transfers the audio data, and a fixed-length audio packet from the transferred audio stream. And audio pack generation unit for generating a,
A voice time information calculation unit that calculates a voice reference time value that should take the voice frame data included in the voice pack into the voice input buffer of the decoding device, and a time value that starts decoding and displaying the voice reference time value, and the calculated voice time. An audio pack header generation unit that describes information in the header of an audio pack, an initial value and an increment value of video time information, an initial value and an increment value of audio time information, a head audio pack insertion position, and further calculates picture data. An initial information calculation unit that calculates the pack insertion ratio of the video pack and the audio pack in the audio-video pack group based on the amount, and a request to generate the video pack and the audio pack according to the leading audio pack insertion position and the pack insertion ratio. To the video pack generator and audio pack generator,
And a control unit that notifies the initial information calculation unit when the generation of the audio / video pack group is completed.

【０００８】また本発明のプログラムストリーム記録再
生装置は、入力されるアナログ映像および音声信号をそ
れぞれデジタル映像および音声信号に変換する映像およ
び音声入力部と、デジタル映像および音声信号を圧縮符
号化し映像および音声ストリームを生成する映像および
音声エンコーダ部と、映像および音声ストリームからプ
ログラムストリームを生成する前記記載のプログラムス
トリーム生成装置と、生成されたプログラムストリーム
を蓄積媒体に記録再生する蓄積部と、蓄積媒体から再生
したプログラムストリームを、映像および音声ストリー
ムに分離するプログラムストリーム分離部と、映像およ
び音声ストリームの復号化を行なう映像および音声デコ
ーダ部と、復号化されたデジタル映像および音声信号を
アナログ映像および音声信号に変換する映像および音声
出力部と、を備える。The program stream recording / reproducing apparatus of the present invention further includes a video and audio input section for converting an input analog video and audio signal into a digital video and audio signal, and a video and audio signal obtained by compressing and encoding the digital video and audio signal. A video and audio encoder section for generating an audio stream, a program stream generating apparatus described above for generating a program stream from the video and audio streams, a storage section for recording and reproducing the generated program stream in a storage medium, and a storage medium A program stream separation unit that separates the reproduced program stream into video and audio streams, a video and audio decoder unit that decodes the video and audio streams, and the decoded digital video and audio signals into analog video and audio signals. It includes a video and audio output unit converts the voice signal.

【０００９】[0009]

【発明の実施の形態】以下、本発明の実施の形態を、図
面を用いて説明する。はじめに、本発明で扱う信号の例
として、ＤＶＤ−ＲＡＭ規格におけるプログラムストリ
ームを説明する。図４は、ＤＶＤ−ＲＡＭ規格におけ
る、プログラムストリームの構成を示したものである。
ＤＶＤ−ＲＡＭ規格で用いられるプログラムストリーム
はＶＯＢ（ＶｉｄｅｏＯｂｊｅｃｔ）と呼ばれる。Ｖ
ＯＢは、複数のＶＯＢＵ（ＶｉｄｅｏＯｂｊｅｃｔ
Ｕｎｉｔ）で構成される。ＶＯＢＵは、ＭＰＥＧ映像ス
トリームの少なくとも１つのＧＯＰと、それに付随する
音声ストリームとをインタリーブした構成となってい
る。また、１つのＧＯＰデータは複数のＶＯＢＵにまた
がってはならない。ＶＯＢＵは、それに含まれるＧＯＰ
を構成する複数の映像パックと、それに付随する複数の
音声パックとで、音声映像パック群の形で構成される。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described below with reference to the drawings. First, a program stream in the DVD-RAM standard will be described as an example of signals handled by the present invention. FIG. 4 shows the structure of a program stream in the DVD-RAM standard.
The program stream used in the DVD-RAM standard is called VOB (Video Object). V
OB is a plurality of VOBUs (Video Objects).
Unit). The VOBU has a configuration in which at least one GOP of an MPEG video stream and an audio stream accompanying it are interleaved. Also, one GOP data should not span multiple VOBUs. VOBU is a GOP included in it
A plurality of video packs that compose the above and a plurality of audio packs that accompany it are configured in the form of an audio / video pack group.

【００１０】映像パック（Ｖ＿ＰＣＫ）は、パックに含
まれるピクチャデータを復号装置の映像入力バッファに
取り込むべき基準時刻値（ＳＣＲ）を記述するパックヘ
ッダ４００と、映像復号開始時刻（ＤＴＳ）及び表示開
始時刻（ＰＴＳ）を記述する映像パケットヘッダ４０１
と、圧縮された映像ストリームが格納されるペイロード
４０２とで構成される。音声パック（Ａ＿ＰＣＫ）は、
パックに含まれる音声フレームを復号装置の音声入力バ
ッファに取り込むべき基準時刻値（ＳＣＲ）を記述する
パックヘッダ４００と、音声表示及び復号開始時刻（Ｐ
ＴＳ）を記述する音声パケットヘッダ４０１と、圧縮さ
れた音声ストリームが格納されるペイロード４０２とで
構成される。The video pack (V_PCK) includes a pack header 400 that describes a reference time value (SCR) for loading the picture data included in the pack into the video input buffer of the decoding device, a video decoding start time (DTS), and a display start. Video packet header 401 describing time (PTS)
And a payload 402 in which the compressed video stream is stored. The audio pack (A_PCK) is
A pack header 400 that describes a reference time value (SCR) that should capture the audio frame included in the pack into the audio input buffer of the decoding device, and the audio display and decoding start time (P
A voice packet header 401 describing a TS) and a payload 402 storing a compressed voice stream.

【００１１】本規格では、ＤＶＤ−ＲＡＭメディアの物
理的なセクタの大きさが２０４８Ｂｙｔｅに規定されて
いる。そのため、生成する映像パック及び音声パック
は、２０４８Ｂｙｔｅ固定長パックとする必要がある。In this standard, the size of the physical sector of the DVD-RAM medium is specified to 2048 bytes. Therefore, the generated video pack and audio pack must be 2048 Byte fixed length packs.

【００１２】図１は本発明によるプログラムストリーム
生成装置の一実施形態である。プログラムストリーム生
成装置１は、圧縮された映像ストリーム１００と、音声
ストリーム１０１と、映像ストリームに含まれるピクチ
ャデータ量１０２を入力とし、プログラムストリーム１
０３を生成して出力する装置である。本実施例では、Ｖ
ＯＢＵ毎にビットレートを可変にした可変ビットレート
方式で圧縮された映像ストリーム１００を用いた場合の
プログラムストリーム生成装置を示す。FIG. 1 shows an embodiment of a program stream generating device according to the present invention. The program stream generation device 1 receives the compressed video stream 100, the audio stream 101, and the picture data amount 102 included in the video stream as input, and
Is a device for generating and outputting 03. In this embodiment, V
1 shows a program stream generation device when a video stream 100 compressed by a variable bit rate method in which a bit rate is variable for each OBU is used.

【００１３】初期情報算出部１１は、ＶＯＢＵ毎に、ピ
クチャデータ量をもとに、以下の時刻情報の初期値を算
出するブロックである。プログラムストリーム生成開始
時、またはＶＯＢＵ生成完了時に制御部１０から供給さ
れるＶＯＢＵ生成完了通知１５１を受けた場合、次のＶ
ＯＢＵに対する初期値算出を開始する。算出する初期時
刻情報は、映像復号開始時刻値（ＤＴＳ）１１０と、音
声復号及び表示開始時刻値（ＰＴＳ）１１１と、先頭音
声パックに含まれる音声フレームデータを復号装置の音
声入力バッファに取り込むべき時刻を示す音声基準時刻
初期値（ＳＣＲａ[０]）１１６と、各パックヘッダに記
述する基準時刻値ＳＣＲの差分を示す基準時刻増分値
（ΔＳＣＲ）１１３と、各パックにＳＣＲ値を配分した
時に生じる端数時刻値、すなわち基準時刻端数値（残Ｓ
ＣＲ）１１４である。また、ＶＯＢＵにおける先頭音声
パック挿入位置（ａ＿ｐａｃｋ＿ｓｔａｒｔ）１１２
と、ＶＯＢＵを構成する映像パックと音声パックのパッ
ク挿入比率１１５を算出する。The initial information calculation unit 11 is a block for calculating the initial value of the following time information for each VOBU based on the picture data amount. When the VOBU generation completion notification 151 supplied from the control unit 10 is received when the program stream generation is started or when the VOBU generation is completed, the next V
The initial value calculation for OBU is started. As the initial time information to be calculated, the video decoding start time value (DTS) 110, the audio decoding and display start time value (PTS) 111, and the audio frame data included in the first audio pack should be taken into the audio input buffer of the decoding device. When the audio reference time initial value (SCRa [0]) 116 indicating the time, the reference time increment value (ΔSCR) 113 indicating the difference between the reference time values SCR described in each pack header, and the SCR value allocated to each pack The resulting fractional time value, that is, the reference time fractional value (remaining S
CR) 114. Also, the head audio pack insertion position (a_pack_start) 112 in VOBU 112
Then, the pack insertion ratio 115 of the video pack and the audio pack forming the VOBU is calculated.

【００１４】時刻情報算出部１４は、初期情報算出部１
１から供給される映像復号開始時刻値（ＤＴＳ）１１０
と、音声復号及び表示開始時刻値（ＰＴＳ）１１１と、
音声基準時刻初期値（ＳＣＲａ[０]）１１６と、基準時
刻増分値（ΔＳＣＲ）１１３と、基準時刻端数値（残Ｓ
ＣＲ）１１４を入力する。そして、映像パックヘッダ生
成部１３に対して、映像パックヘッダ生成時に、パック
に応じた映像復号及び表示開始時刻値（ＤＴＳ及びＰＴ
Ｓ）１２０を供給する。また、音声パックヘッダ生成部
１７に対して、音声パックヘッダ生成時に、パックに応
じた音声復号及び表示開始時刻値（ＰＴＳ）１２１を供
給する。さらに、映像パックヘッダ生成部１３及び、音
声パックヘッダ生成部１７に対して、各パックに付加す
べき基準時刻値（ＳＣＲ）１２３を供給する。The time information calculation unit 14 is the initial information calculation unit 1.
1 video decoding start time value (DTS) 110
And voice decoding and display start time value (PTS) 111,
Voice reference time initial value (SCRa [0]) 116, reference time increment value (ΔSCR) 113, reference time fractional value (remaining S
(CR) 114 is input. Then, when the video pack header is generated, the video pack header generation unit 13 receives the video decoding and display start time values (DTS and PT) according to the pack.
S) 120 is supplied. Further, the audio decoding and display start time value (PTS) 121 corresponding to the pack is supplied to the audio pack header generation unit 17 when the audio pack header is generated. Further, the reference time value (SCR) 123 to be added to each pack is supplied to the video pack header generation unit 13 and the audio pack header generation unit 17.

【００１５】制御部１０は、初期情報算出部１１から供
給される先頭音声パック挿入位置１１２とパック挿入比
率１１５をもとに、映像パック生成部１２と、音声パッ
ク生成部１６に、パック生成要求１５０を発行し、生成
された映像パック１３４及び音声パック１４４をまとめ
て、プログラムストリーム１０３として出力するブロッ
クである。また、ＶＯＢＵ生成完了時に、初期情報算出
部１１に対してＶＯＢＵ生成完了通知１５１を発行す
る。Based on the head audio pack insertion position 112 and the pack insertion ratio 115 supplied from the initial information calculation unit 11, the control unit 10 requests the video pack generation unit 12 and the audio pack generation unit 16 to generate a pack. This is a block that issues 150, collects the generated video pack 134 and audio pack 144, and outputs them as the program stream 103. Further, when the VOBU generation is completed, the VOBU generation completion notification 151 is issued to the initial information calculation unit 11.

【００１６】映像パック生成部１２は、パック生成要求
１５０が映像パック生成を要求していた場合に、映像パ
ックヘッダ生成部１３に対して映像パックヘッダ生成要
求１３０を発行することで映像ヘッダデータ１３１を取
得し、映像データ転送部１５に映像データ要求１３２を
発行することでパック生成に必要な映像データ１３３を
取得して、映像パックを生成するブロックである。The video pack generation unit 12 issues a video pack header generation request 130 to the video pack header generation unit 13 when the pack generation request 150 requests the video pack generation, and thereby the video pack header data 131. Is acquired, and the video data request 132 is issued to the video data transfer unit 15 to acquire the video data 133 necessary for pack generation and generate a video pack.

【００１７】音声パック生成部１６は、パック生成要求
１５０が音声パック生成を要求していた場合に、音声パ
ックヘッダ生成部１７に対して音声パックヘッダ生成要
求１４０を発行することで音声ヘッダデータ１４１を取
得し、音声データ転送部１９に音声データ要求１４２を
発行することでパック生成に必要な音声データ１４３を
取得して、音声パックを生成するブロックである。The audio pack generation unit 16 issues an audio pack header generation request 140 to the audio pack header generation unit 17 when the pack generation request 150 requests the audio pack generation, and thereby the audio header data 141 is issued. Is acquired and the audio data request 142 is issued to the audio data transfer unit 19 to acquire the audio data 143 necessary for the pack generation, and the audio pack is generated.

【００１８】映像パックヘッダ生成部１３は、映像パッ
ク生成部１２からの映像ヘッダ要求１３０を受け付けた
場合に、時刻情報算出部１４から基準時刻値ＳＣＲ１２
３を取得してパックヘッダを生成し、さらに時刻情報算
出部１４から映像時刻情報（ＤＴＳ／ＰＴＳ）１２０を
取得してパケットヘッダを生成し、生成したヘッダデー
タを映像パック生成部１２に供給するブロックである。When the video pack header generation unit 13 receives the video header request 130 from the video pack generation unit 12, the time information calculation unit 14 outputs the reference time value SCR12.
3 is acquired to generate a pack header, the video time information (DTS / PTS) 120 is further acquired from the time information calculation unit 14, a packet header is generated, and the generated header data is supplied to the video pack generation unit 12. It is a block.

【００１９】映像データ転送部１５は、映像パック生成
部１２からの映像データ要求１３２を受け付けた場合
に、映像ストリーム１００から映像パック生成に必要な
ペイロード分の映像データ１３３を取り出して、映像パ
ック生成部１２に供給するブロックである。When the video data transfer unit 15 receives the video data request 132 from the video pack generation unit 12, the video data transfer unit 15 takes out the video data 133 of the payload required for the video pack generation from the video stream 100 to generate the video pack. This block is supplied to the unit 12.

【００２０】音声パックヘッダ生成部１７は、音声パッ
ク生成部１６からの音声ヘッダ要求１４０を受け付けた
場合に、時刻情報算出部１４から基準時刻値ＳＣＲ１２
３を取得してパックヘッダを生成し、さらに時刻情報算
出部１４から音声時刻情報（ＰＴＳ）１２１を取得して
パケットヘッダを生成し、生成したヘッダデータを音声
パック生成部１６に供給するブロックである。When the audio pack header generation unit 17 receives the audio header request 140 from the audio pack generation unit 16, the audio pack header generation unit 17 receives the reference time value SCR12 from the time information calculation unit 14.
3 to obtain a pack header, further obtain the audio time information (PTS) 121 from the time information calculation unit 14 to generate a packet header, and supply the generated header data to the audio pack generation unit 16. is there.

【００２１】音声データ転送部１９は、音声パック生成
部１６からの音声データ要求１４２を受け付けた場合
に、音声ストリーム１０１から音声パック生成に必要な
ペイロード分の音声データ１４３を取り出して、音声パ
ック生成部１６に供給するブロックである。When the audio data transfer unit 19 receives the audio data request 142 from the audio pack generation unit 16, the audio data transfer unit 19 extracts the audio data 143 corresponding to the payload required for the audio pack generation from the audio stream 101 to generate the audio pack. This is a block supplied to the unit 16.

【００２２】本プログラムストリーム生成装置は、ＶＯ
ＢＵ毎にピクチャデータ量を取得し、これをもとに基準
時刻増分値（ΔＳＣＲ）１１３および、音声パックと映
像パックの挿入比率を算出し、ＶＯＢＵ毎に音声パック
と映像パックを平均的に生成することができる。This program stream generating device is a VO
The picture data amount is acquired for each BU, and the reference time increment value (ΔSCR) 113 and the insertion ratio of the audio pack and the video pack are calculated based on this, and the audio pack and the video pack are generated on average for each VOBU. can do.

【００２３】図２は、図１に示すプログラムストリーム
生成装置を構成する初期情報算出部１１の詳細図であ
る。映像ビットレート算出部２０は、ＶＯＢＵ毎に映像
ビットレート２００を算出するブロックである。ＶＯＢ
Ｕの再生時間長（Ｔgop[秒]）と、ＶＯＢＵに含まれる
ピクチャデータ量１０２の総和（Ｄv[ビット]）を用い
ると、映像ビットレート（ＢＲv[bps]）２００は、Ｄv
／Ｔgop[bps]で表すことができる。FIG. 2 is a detailed diagram of the initial information calculating section 11 which constitutes the program stream generating apparatus shown in FIG. The video bit rate calculation unit 20 is a block that calculates the video bit rate 200 for each VOBU. VOB
Using the reproduction time length of U (Tgop [seconds]) and the total sum (Dv [bits]) of the picture data amount 102 included in VOBU, the video bit rate (BRv [bps]) 200 is Dv.
It can be represented by / Tgop [bps].

【００２４】映像パック算出部２３は、ピクチャデータ
量１０２から、ＶＯＢＵ内に含まれる映像パック数（Ｎ
vpck）２０１を算出するブロックである。音声パック算
出部２４は、ＶＯＢＵ内に含まれる音声フレーム数をも
とに音声データ量を算出し、このデータ量をもとに、Ｖ
ＯＢＵに含まれる音声パック数（Ｎapck）２０２を算出
するブロックである。From the picture data amount 102, the video pack calculation unit 23 determines the number of video packs (N included in the VOBU).
This is a block for calculating vpck) 201. The audio pack calculator 24 calculates the audio data amount based on the number of audio frames included in the VOBU, and based on this data amount, V
This is a block for calculating the number of audio packs (Napck) 202 included in the OBU.

【００２５】初期映像時刻情報算出部２１は、映像ビッ
トレート（ＢＲv[bps]）２００の速度で、映像入力バッ
ファに所定量Ｎ0蓄積されるまでの時刻値を、先頭ピク
チャの復号開始時刻すなわち映像復号開始時刻（ＤＴ
Ｓ）１１０として出力するブロックである。ＤＶＤ−Ｒ
ＡＭ規格で規定されている映像入力バッファの大きさ
は、ＭＰＥＧ２を用いた場合に、２３２ＫＢｙｔｅであ
る。Ｎ0は、２３２ＫＢｙｔｅを超えてはならない。初期
音声時刻情報算出部２２は、音声復号及び表示開始時刻
（ＰＴＳ）１１１と、先頭音声パックに含まれる音声フ
レームデータを復号装置の音声入力バッファに取り込む
べき時刻を示す音声基準時刻初期値（ＳＣＲａ[０]）１
１６を算出するブロックである。The initial video time information calculation unit 21 calculates the time value until a predetermined amount N0 is accumulated in the video input buffer at a video bit rate (BRv [bps]) of 200, which is the decoding start time of the first picture, that is, the video. Decoding start time (DT
S) 110 is a block to be output. DVD-R
The size of the video input buffer defined by the AM standard is 232 KBytes when MPEG2 is used. N0 must not exceed 232 KBytes. The initial audio time information calculation unit 22 includes an audio decoding and display start time (PTS) 111 and an audio reference time initial value (SCRa) indicating the time at which the audio frame data included in the first audio pack should be taken into the audio input buffer of the decoding device. [0]) 1
This is a block for calculating 16.

【００２６】音声復号及び表示開始時刻（ＰＴＳ）１１
１は、映像復号開始時刻（ＤＴＳ）１１０と同一とした
場合、音声基準時刻初期値（ＳＣＲａ[０]）１１６は、
映像表示開始時刻（ＰＴＳ）１１０の時点で１音声フレ
ーム以上蓄積されている状態で、かつ、音声入力バッフ
ァ以下の状態となる条件で算出する。Speech decoding and display start time (PTS) 11
1 is the same as the video decoding start time (DTS) 110, the audio reference time initial value (SCRa [0]) 116 is
The calculation is performed under the condition that one audio frame or more is accumulated at the time of the video display start time (PTS) 110 and that the audio input buffer is below the audio input buffer.

【００２７】先頭音声パック挿入位置算出部２５は、音
声基準時刻初期値（ＳＣＲａ[０]）１１６までの時刻に
送出される映像パック数を算出する。なお、算出にあた
っては、ＶＯＢＵ内のパック基準時刻間隔を示す基準時
刻増分（ΔＳＣＲ）１１３を基準に、ＳＣＲa[０]内に
含まれる映像パック数を算出する。得られた映像パック
数の映像パックを送出した次のパック位置にＶＯＢＵ先
頭の音声パックを挿入する。また、ＶＯＢが複数のＶＯ
ＢＵで構成される場合、ＶＯＢ内の２番目以降のＶＯＢ
Ｕに関しては、先頭音声挿入位置１１２は１とする。２
番目以降のＶＯＢＵの先頭音声パックをＶＯＢＵ先頭に
持ってくることで、直前のＶＯＢＵの末尾音声パックと
の間隔が広がること避け、ＶＯＢＵ境界における音声入
力バッファが空になることを防ぐことを可能とする。The head audio pack insertion position calculation unit 25 calculates the number of video packs to be transmitted at times up to the audio reference time initial value (SCRa [0]) 116. In the calculation, the number of video packs included in SCRa [0] is calculated based on the reference time increment (ΔSCR) 113 indicating the pack reference time interval in VOBU. The audio pack at the head of the VOBU is inserted at the next pack position after the video packs of the obtained number of video packs have been transmitted. In addition, VOB is a plurality of VO
When configured with BU, second and subsequent VOBs in VOB
For U, the head voice insertion position 112 is 1. Two
By bringing the first audio pack of the second and subsequent VOBUs to the beginning of the VOBU, it is possible to prevent the interval with the last audio pack of the immediately preceding VOBU from increasing and prevent the audio input buffer from becoming empty at the VOBU boundary. To do.

【００２８】基準時刻増分算出部２６は、ＶＯＢＵの再
生時間長（Ｔgop[秒]）を、ＶＯＢＵに含まれる映像パ
ック数（Ｎvpck）２０１及び音声パック数（Ｎapck）２
０２の総和で均等配分することで得られる時刻値を各パ
ック間の基準時刻増分（ΔＳＣＲ）１１３として、供給
するブロックである。また、配分時に生じる基準時刻の
端数分、すなわち基準時刻端数値１１４を出力するブロ
ックである。The reference time increment calculation unit 26 determines the reproduction time length (Tgop [seconds]) of the VOBU as the number of video packs (Nvpck) 201 and the number of audio packs (Napck) 2 included in the VOBU.
In this block, the time value obtained by evenly distributing the sum of 02 as the reference time increment (ΔSCR) 113 between the packs is supplied. In addition, it is a block that outputs a fraction of the reference time that occurs during distribution, that is, a reference time fractional value 114.

【００２９】パック挿入比率算出部２７は、映像パック
数（Ｎvpck）２０１と音声パック数（Ｎapck）２０２を
用いて、映像パック群に、音声パックを均等に配分挿入
した時の音声パックと映像パックの比率を算出し、パッ
ク挿入比率１１５として、制御部１０に供給するブロッ
クである。The pack insertion ratio calculation unit 27 uses the number of video packs (Nvpck) 201 and the number of audio packs (Napck) 202 to divide the audio packs into the video pack group evenly and insert the audio packs and the video packs. Is a block for calculating the ratio of the pack insertion ratio and supplying it as the pack insertion ratio 115 to the control unit 10.

【００３０】図６及び図７は、図１、図２で示す初期情
報算出部を用いて生成されるプログラムストリームの模
式図を示したものである。図６は、ＶＯＢ先頭のＶＯＢ
Ｕの構成を示した図である。映像入力バッファが所定量
Ｎ0蓄積されるまでの期間、映像パックのみで構成した
後、先頭音声パック６０３を挿入する。また、パック挿
入比率１１４は、図中の音声パック間隔とする。図７
は、ＶＯＢ内の２番目以降のＶＯＢＵの構成を示した図
である。先頭音声パック挿入位置１１２は１とする。FIGS. 6 and 7 are schematic diagrams of a program stream generated by using the initial information calculating section shown in FIGS. 1 and 2. FIG. 6 shows the VOB at the beginning of the VOB.
It is the figure which showed the structure of U. The head audio pack 603 is inserted after the video input buffer is composed of only the video pack until the predetermined amount N0 is accumulated. The pack insertion ratio 114 is the audio pack interval in the figure. Figure 7
FIG. 6 is a diagram showing a configuration of second and subsequent VOBUs in a VOB. The head audio pack insertion position 112 is 1.

【００３１】本実施例のプログラムストリーム生成装置
では、ＤＶＤ−ＲＡＭ等に用いられる固定長パックを用
いたプログラムストリームを生成する場合に、ＶＯＢＵ
内に平均的に音声パック及び映像パックを配置すること
を可能とする。そのために、ＶＯＢＵ毎に、得られるピ
クチャデータ量から、あらかじめ音声パック数及び映像
パック数を算出し、前記基準時刻増分（ΔＳＣＲ）１１
３及び、パック挿入比率１１５を算出することを特徴と
する。In the program stream generating apparatus of this embodiment, VOBU is used when generating a program stream using a fixed-length pack used in a DVD-RAM or the like.
It is possible to arrange the audio pack and the video pack on average in the inside. Therefore, for each VOBU, the number of audio packs and the number of video packs are calculated in advance from the obtained picture data amount, and the reference time increment (ΔSCR) 11
3 and the pack insertion ratio 115 are calculated.

【００３２】図３は、図１に示すプログラムストリーム
生成装置を構成する時刻情報算出部１４の詳細図であ
る。基準時刻補正部３４は、基準時刻端数値１１４を最
小時刻単位に分割し、ＶＯＢＵ先頭パックから順に加算
するブロックである。すなわち、ＶＯＢＵ先頭からパッ
ク生成毎に基準時刻補正値３１０として１を発行する。
基準時刻端数値１１４残量が空になった時点で、基準時
刻補正値３１０として０を発行する。FIG. 3 is a detailed diagram of the time information calculation unit 14 which constitutes the program stream generation apparatus shown in FIG. The reference time correction unit 34 is a block that divides the reference time fractional value 114 into minimum time units and adds them sequentially from the VOBU head pack. That is, 1 is issued as the reference time correction value 310 from the beginning of the VOBU each time a pack is generated.
When the remaining amount of the reference time fractional value 114 becomes empty, 0 is issued as the reference time correction value 310.

【００３３】基準時刻算出部３３は、パック生成毎に、
直前のパックが有する基準時刻値（ＳＣＲ）１２３に基
準時刻増分値（ΔＳＣＲ）１１３を加算し、さらに基準
時刻補正部３４から供給される基準時刻補正値３１０を
加算することで、基準時刻値（ＳＣＲ）１２３を生成す
るブロックである。The reference time calculation unit 33, for each pack generation,
By adding the reference time increment value (ΔSCR) 113 to the reference time value (SCR) 123 of the immediately preceding pack, and further adding the reference time correction value 310 supplied from the reference time correction unit 34, the reference time value ( SCR) 123 is generated.

【００３４】映像時刻情報算出部３０は、生成対象とな
る映像パックにＭＰＥＧアクセスユニットが含まれる場
合、直前のアクセスユニットを含む映像パックが有する
映像時刻情報（ＤＴＳ）に、直前のアクセスユニットを
含む映像パックからのアクセスユニット分の時刻間隔を
付加することで、映像時刻情報（ＤＴＳ／ＰＴＳ）１２
０を算出するブロックである。なお、アクセスユニット
は、ＭＰＥＧ規格（ISO/IEC 13818-1 2.1.1 access uni
t）で定義されるものである。When the video pack to be generated includes an MPEG access unit, the video time information calculation unit 30 includes the immediately preceding access unit in the video time information (DTS) of the video pack including the immediately preceding access unit. By adding the time interval for the access unit from the video pack, the video time information (DTS / PTS) 12
This is a block for calculating 0. The access unit is an MPEG standard (ISO / IEC 13818-1 2.1.1 access uni
defined in t).

【００３５】音声時刻情報算出部３１は、生成対象とな
る音声パックに音声フレームヘッダが含まれる場合、直
前の音声フレームヘッダを含む音声パックが有する音声
時刻情報（ＰＴＳ）に、直前の音声フレームを含む音声
パックからの音声フレーム分の時刻間隔を付加すること
で、音声時刻情報（ＰＴＳ）１２１を算出するブロック
である。When the audio pack to be generated includes an audio frame header, the audio time information calculation unit 31 adds the immediately preceding audio frame to the audio time information (PTS) included in the audio pack including the immediately preceding audio frame header. This is a block for calculating the audio time information (PTS) 121 by adding the time interval for the audio frame from the included audio pack.

【００３６】図５は、本発明のプログラムストリーム生
成装置を記録再生装置に適用した場合の実施例の一例で
ある。本記録再生装置は、ＣＣＤカメラ・マイク等で得
られた映像音声信号を、プログラムストリーム信号に変
換してＤＶＤ等の記録媒体に記録再生するものである。FIG. 5 is an example of an embodiment in which the program stream generating apparatus of the present invention is applied to a recording / reproducing apparatus. The recording / reproducing apparatus converts a video / audio signal obtained by a CCD camera, a microphone or the like into a program stream signal, and records / reproduces it on / from a recording medium such as a DVD.

【００３７】映像入力手段９０はアナログ映像原信号８
００を取り込み、映像入力部８０はデジタル映像原信号
に変換し、映像エンコーダ部８１は符号化処理を行な
い、ピクチャデータ量１０２と映像ストリーム１００を
出力する。音声入力手段９１はアナログ音声原信号８０
２を取り込み、音声入力部８２はデジタル音声原信号８
０３に変換し、音声エンコーダ部８３は符号化処理を行
ない、音声ストリーム１０１を出力する。これら映像ス
トリーム１００、音声ストリーム１０１、ピクチャデー
タ量１０２をもとに、前記説明した本発明に係るプログ
ラムストリーム生成装置９４はプログラムストリームを
作成する。蓄積部８９は、ＤＶＤ―ＲＡＭ等の蓄積メデ
ィアにプログラムストリームの記録及び再生を行なう。The image input means 90 uses the analog image original signal 8
00, the video input unit 80 converts it into a digital video original signal, the video encoder unit 81 performs the encoding process, and outputs the picture data amount 102 and the video stream 100. The voice input means 91 is an analog voice original signal 80.
2 is input, and the audio input unit 82 outputs the original digital audio signal 8
03, the audio encoder unit 83 performs encoding processing, and outputs the audio stream 101. Based on the video stream 100, the audio stream 101, and the picture data amount 102, the above-described program stream generation device 94 according to the present invention creates a program stream. The storage unit 89 records and reproduces the program stream on a storage medium such as a DVD-RAM.

【００３８】プログラムストリーム分離部８８は、蓄積
部８９から読み出したプログラムストリーム８１１を、
映像ストリーム８０５と音声ストリーム８０８に分離す
る。映像デコーダ部８５は、映像デジタル復号化信号８
０６を生成し、映像出力部８４は、映像アナログ復号化
信号８０７に変換してモニタ等の映像出力手段９２に出
力する。音声デコーダ部８７は、音声デジタル復号化信
号８０９を生成し、音声出力部８６は、音声アナログ復
号化信号８１０に変換してスピーカ等の音声出力手段９
３に出力する。The program stream separation unit 88 stores the program stream 811 read from the storage unit 89,
It is separated into a video stream 805 and an audio stream 808. The video decoder unit 85 uses the video digital decoded signal 8
06, and the video output unit 84 converts the video analog decoded signal 807 into a video output means 92 such as a monitor. The audio decoder unit 87 generates an audio digital decoded signal 809, and the audio output unit 86 converts the audio analog decoded signal 810 into audio output means 9 such as a speaker.
Output to 3.

【００３９】プログラムストリーム８１１は、プログラ
ムストリーム生成部９４で付加された時刻情報（ＳＣ
Ｒ）に従って、蓄積部８９からプログラムストリーム分
離部８８に転送される。また、分離された映像ストリー
ム８０５は、プログラムストリーム生成部９４において
付加された映像時刻情報（ＤＴＳ/ＰＴＳ）に従って、
映像デコーダ部８５で復号化される。分離された音声ス
トリーム８０８は、プログラムストリーム生成部９４に
おいて付加された音声時刻情報（ＰＴＳ）に従って音声
デコーダ部８７で復号化される。The program stream 811 is the time information (SC
According to R), the data is transferred from the storage unit 89 to the program stream separation unit 88. In addition, the separated video stream 805 is, in accordance with the video time information (DTS / PTS) added by the program stream generation unit 94,
It is decoded by the video decoder unit 85. The separated audio stream 808 is decoded by the audio decoder unit 87 according to the audio time information (PTS) added by the program stream generation unit 94.

【００４０】本記録再生装置は、カメラ一体型の記録再
生装置としても用いられるし、蓄積メディアは、ＤＶＤ
―ＲＡＭ等以外に磁気媒体・半導体メモリなども可能で
ある。This recording / reproducing apparatus is also used as a recording / reproducing apparatus integrated with a camera, and the storage medium is a DVD.
-In addition to RAM and the like, magnetic media and semiconductor memory are also possible.

【００４１】[0041]

【発明の効果】本発明によれば、固定長パックを用いた
プログラムストリームを生成する場合に、音声映像パッ
ク群（ＶＯＢＵ）毎に、そのピクチャデータ量からあら
かじめ音声パック数及び映像パック数を算出し、基準時
刻増分（ΔＳＣＲ）及びパック挿入比率を算出すること
により、ＶＯＢＵ内に平均的に音声パック及び映像パッ
クを配置することができる。これより、限られた容量の
メディアに対してより長時間の音声映像データを格納可
能なプログラムストリーム生成装置が実現できる。According to the present invention, when a program stream using a fixed-length pack is generated, the number of audio packs and the number of video packs are calculated in advance from the picture data amount of each audio / video pack group (VOBU). Then, by calculating the reference time increment (ΔSCR) and the pack insertion ratio, the audio packs and the video packs can be arranged on average in the VOBU. As a result, it is possible to realize a program stream generation device capable of storing audio / video data for a longer time in a medium having a limited capacity.

[Brief description of drawings]

【図１】本発明に係るプログラムストリーム生成装置
の一実施形態を示すブロック図である。FIG. 1 is a block diagram showing an embodiment of a program stream generation device according to the present invention.

【図２】図１に示す初期情報算出部１１のブロック図
の詳細な一例である。FIG. 2 is a detailed example of a block diagram of an initial information calculation unit 11 shown in FIG.

【図３】図１に示す時刻情報算出部１４のブロック図
の詳細な一例である。FIG. 3 is a detailed example of a block diagram of a time information calculation unit 14 shown in FIG.

【図４】ＤＶＤ−ＲＡＭ規格におけるプログラムスト
リームの構成概略である。FIG. 4 is a schematic structure of a program stream in the DVD-RAM standard.

【図５】プログラムストリーム生成装置を用いた記録
再生装置の一例である。FIG. 5 is an example of a recording / reproducing apparatus using a program stream generating apparatus.

【図６】ＶＯＢ内の先頭ＶＯＢＵの構成例である。FIG. 6 is a configuration example of a head VOBU in a VOB.

【図７】ＶＯＢ内の２番目以降のＶＯＢＵの構成例で
ある。FIG. 7 is a configuration example of second and subsequent VOBUs in a VOB.

[Explanation of symbols]

１０・・・制御部１１・・・初期情報算出部１２・・・映像パック生成部１３・・・映像パックヘッダ生成部１４・・・時刻情報算出部１５・・・映像データ転送部１６・・・音声パック生成部１７・・・音声パックヘッダ生成部１９・・・音声データ転送部２０・・・映像ビットレート算出部２１・・・初期映像時刻情報算出部２２・・・初期音声時刻情報算出部２３・・・映像パック数算出部２４・・・音声パック数算出部２５・・・先頭音声パック挿入位置算出部２６・・・基準時刻増分値算出部２７・・・パック挿入比率算出部８１・・・映像エンコーダ部８３・・・音声エンコーダ部８５・・・音声デコーダ部８７・・・音声デコーダ部８８・・・プログラムストリーム分離部８９・・・蓄積部９４・・・プログラムストリーム生成部１２０・・・映像復号及び表示開始時刻値（ＤＴＳ／Ｐ
ＴＳ）１２１・・・音声復号及び表示開始時刻値（ＰＴＳ）１２３・・・基準時刻値（ＳＣＲ）10 ... Control unit 11 ... Initial information calculation unit 12 ... Video pack generation unit 13 ... Video pack header generation unit 14 ... Time information calculation unit 15 ... Video data transfer unit 16 ... Audio pack generation unit 17 ... Audio pack header generation unit 19 ... Audio data transfer unit 20 ... Video bit rate calculation unit 21 ... Initial video time information calculation unit 22 ... Initial audio time information calculation Part 23 ... Video pack number calculation unit 24 ... Audio pack number calculation unit 25 ... Leading audio pack insertion position calculation unit 26 ... Reference time increment value calculation unit 27 ... Pack insertion ratio calculation unit 81 ... video encoder section 83 ... audio encoder section 85 ... audio decoder section 87 ... audio decoder section 88 ... program stream separation section 89 ... storage section 94 ... program stream raw Part 120 ... video decoding and display start time value (DTS / P
TS) 121 ... Voice decoding and display start time value (PTS) 123 ... Reference time value (SCR)

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｈ０４Ｎ 5/85 Ｈ０４Ｎ 5/92 Ｈ 7/24 7/13 Ｚ (72)発明者高橋将神奈川県横浜市戸塚区吉田町292番地株式会社日立製作所デジタルメディア開発本部内Ｆターム(参考） 5C052 AA04 AB08 5C053 FA24 GB26 GB38 JA22 KA24 5C059 KK22 MA00 RB09 RC04 SS13 TA00 TB00 TC18 UA32 5D044 AB05 AB07 BC06 CC06 DE02 DE03 DE12 DE14 DE25 DE96 GK08 GK12 5D110 AA17 AA27 AA29 BB27 DA11 DB02 ─────────────────────────────────────────────────── ─── Continuation of the front page (51) Int.Cl. ⁷ Identification code FI theme code (reference) H04N 5/85 H04N 5/92 H 7/24 7/13 Z (72) Inventor Masa Takahashi Yokohama, Kanagawa F-Term (Reference) 5C052 AA04 AB08 5C053 FA24 GB26 GB38 JA22 KA24 5C059 KK22 MA00 RB09 RC04 SS13 TA00 TB00 TC18 UA32 5D044 AB05 AB07 BC06 CC06 DE02 DE03 DE12 DE96 GK08 GK12 5D110 AA17 AA27 AA29 BB27 DA11 DB02

Claims

[Claims]

1. A program stream including an audio / video pack group based on a video stream including a picture data group compressed at a variable bit rate and an audio stream including a compressed audio frame group. In a program stream generation device that generates and outputs to a decoding device, a video data transfer unit that extracts and transfers video data of a payload required for video pack generation from the video stream, and a fixed-length video data transferred from the transferred video stream. A video pack generation unit for generating a video pack, a video reference time value at which the picture data included in the video pack should be captured in the video input buffer of the decoding device,
A video time information calculation unit that calculates a time value for starting decoding and displaying the video time, a video pack header generation unit that describes the calculated video time information in the header of the video pack, and an audio pack from the audio stream. An audio data transfer unit that extracts and transfers audio data for a payload required for generation, an audio pack generation unit that generates a fixed-length audio pack from the transferred audio stream, and audio frame data included in the audio pack Of the audio reference time value to be loaded into the audio input buffer of the decoding device, and an audio time information calculation unit for calculating a time value for starting decoding and displaying the audio reference time value, and the calculated audio time information for the audio pack. An audio pack header generation unit described in the header, an initial value and an increment value of the video time information, and an initial value of the audio time information And an increment value and a leading audio pack insertion position are calculated, and an initial information calculation unit that calculates a pack insertion ratio of a video pack and an audio pack in the audio / video pack group based on the amount of picture data, and the leading audio Issue a video pack and audio pack generation request to the video pack generation unit and the audio pack generation unit according to the pack insertion position and the pack insertion ratio, and notify the initial information calculation unit when the generation of the audio / video pack group is completed. A program stream generation device, comprising:

2. The program stream generation device according to claim 1, wherein the initial information calculation unit is based on the amount of picture data included in the audio / video pack group to be generated, and the video bits in the audio / video pack group. A video bit rate calculation unit for calculating a rate; a video pack number calculation unit for calculating the number of fixed-length video packs included in the audio / video pack group based on the picture data amount; An audio pack number calculation unit that calculates the number of fixed-length audio packs included, and calculates the time at which a predetermined amount of data is accumulated in the video input buffer of the decoding device based on the video bit rate, and performs video decoding. An initial video time information calculation unit that outputs the initial value of the start time, and based on the video decoding start time initial value, the initial value of the audio decoding start time and the start audio packet. An initial audio time information calculation unit that calculates and outputs an initial value of the audio reference time to be taken into the audio input buffer of the decoding device; and a video input buffer in the video input buffer during a period up to the time indicated by the initial value of the audio reference time. Calculate the number of captured video packs,
A head audio pack insertion position calculation unit having the next pack as a head audio pack insertion position, and the time intervals of the audio / video pack group are equally distributed by the sum of the video pack number and the audio pack number. A reference time increment calculation unit that outputs a time interval between packs as a reference time increment value, and an audio pack interval when a video pack and an audio pack are uniformly arranged in the audio / video pack group, as a pack insertion ratio. A program stream generation device comprising: a pack insertion ratio calculation unit.

3. The program stream generation device according to claim 2, wherein the leading audio pack insertion position calculation unit calculates an initial value of the audio reference time from a video reference time at which a leading video pack is loaded into a video input buffer of a decoding device. , The number of divisions obtained by dividing the period up to the time indicated by the reference time increment value is calculated as the number of video packs to be taken in by the time indicated by the audio reference time initial value, from which the first audio pack is inserted. A program stream generation device characterized by obtaining a position.

4. The program stream generation device according to claim 2, wherein the reference time increment calculation unit equalizes the time intervals of the audio / video pack group with the sum of the number of video packs and the number of audio packs. A program stream generation device, which further outputs a fractional value of a reference time generated when a time interval between packs is calculated, when distributed.

5. The program stream generation device according to claim 1, wherein the time information calculation unit adds the reference time increment value to the reference time value of the immediately preceding generation pack. Then, a reference time calculation unit that calculates a reference time value to be added to the generated pack, and 0 or a predetermined time value as the reference time value for the first video pack, and for the second and subsequent video packs including the access unit. Then, the video time information calculation unit that outputs the time value obtained by adding the time values indicated by the access unit intervals as the reference time value, and the audio decoding start time initial value for the first audio pack as the reference time value. Also, for the second and subsequent audio packs that include audio frame headers, the number of audio frames from the audio pack that includes the immediately preceding audio frame header is supported. A program stream generation device comprising: a voice time information calculation unit that outputs a time value obtained by adding the time values as a reference time value.

6. The program stream generation device according to claim 4, wherein the time information calculation unit adds the reference time increment value to the reference time value of the immediately previous generation pack, and further calculates the reference time fraction value. A reference time calculation unit that outputs a time value obtained by adding unit amounts from the first pack of the audio / video pack group as a reference time value; and 0 or a predetermined time value for the first video pack as a reference time value. Also, for the second and subsequent video packs including the access unit, the video time information calculation unit that outputs the time value obtained by adding the time value indicated by the access unit interval as the reference time value, and the first audio pack On the other hand, using the initial value of the audio decoding start time as the reference time value, and for the second and subsequent audio packs including the audio frame header, the sound immediately before that is added. A program stream generation, comprising: an audio time information calculation unit that outputs a time value obtained by adding time values corresponding to the number of audio frames from an audio pack including a voice frame header as a reference time value. apparatus.

7. The program stream generating device according to claim 1, wherein the audio / video pack group includes at least one MPEG-compressed GOP and an audio frame associated therewith. A characteristic program stream generation device.

8. The program stream generation device according to claim 7, wherein the video data transfer unit has a buffer for accumulating a video stream of GOPs constituting the audio / video pack group, and the data of the GOPs. The program stream generation device, wherein the initial information calculation unit starts a calculation process and a program generation process at the time when is accumulated.

9. A program stream generation device for generating a program stream composed of an audio / video pack group based on a video stream compressed at a variable bit rate and a compressed audio stream, wherein each audio / video pack group is Calculate the video bit rate,
Based on this, the time information calculation unit that calculates the reference time information, the video decoding start time information, and the video decoding display time information that should be captured in the input buffer, and the video pack and the audio pack based on the video bit rate. An audio / video pack generation unit that calculates an insertion ratio and generates an audio / video pack group in which audio and video packs are interleaved, and a program stream generation device.

10. A program stream recording / reproducing apparatus for converting an input analog video and audio signal into a program stream of a compressed digital signal and recording and reproducing the program stream on a storage medium. And a video and audio input unit for converting the video and audio signals, a video and audio encoder unit for compressing and encoding the digital video and audio signals to generate video and audio streams, and generating a program stream from the video and audio streams. 1 or 9, a program stream generation apparatus, a storage unit that records and reproduces the program stream in the storage medium, and a program stream that separates the program stream reproduced from the storage medium into a video stream and an audio stream. A video separation unit, a video and audio decoder unit for decoding the video and audio streams, and a video and audio output unit for converting the decoded digital video and audio signals into analog video and audio signals. A program stream recording / reproducing apparatus characterized by: