JP2004248104A

JP2004248104A - Information processor and information processing method

Info

Publication number: JP2004248104A
Application number: JP2003037371A
Authority: JP
Inventors: Keita Nakamatsu; 慶太仲松; Takuji Himeno; 卓治姫野; Fumiyoshi Abe; 文善阿部
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2003-02-14
Filing date: 2003-02-14
Publication date: 2004-09-02

Abstract

<P>PROBLEM TO BE SOLVED: To minimize lip-synchronization deviation that occurs by "connection recording". <P>SOLUTION: Overflowed time information Te being the difference between time information at a connection point X of video data recorded beforehand in a recording medium and time information at a connection point Y of audio data is calculated, compressively encoding timing of audio data to be newly recorded on the recording medium 50 is shifted with respect to compressively encoding timing of the video data for the overflowed time information Te, and the video data and audio data subjected to compressive encoding are recorded on the recording medium 50. <P>COPYRIGHT: (C)2004,JPO&NCIPI

Description

【０００１】
【発明の属する技術分野】
本発明は、ＭＰＥＧ方式で符号化されたビデオデータ、及びオーディオデータを記録媒体に記録する情報処理装置及び情報処理方法に関する。
【０００２】
【従来の技術】
ＭＰＥＧ（ＭｏｖｉｎｇＰｉｃｔｕｒｅＥｘｐｅｒｔＧｒｏｕｐ）方式、例えば、ＭＰＥＧ２方式などの動画像符号化方式によって圧縮符号化されたビデオデータ、及びオーディオデータを磁気記録媒体に記録させる記録装置が考案されている（例えば、特許文献１参照。）。
【０００３】
【特許文献１】
特開２００１−３０９３０５号公報
【０００４】
【発明が解決しようとする課題】
ところで、ＭＰＥＧ方式で圧縮符号化されたビデオデータ、及び、オーディオデータをデコードして再生する場合、口の動きと、音声のタイミングとを合わせるリップシンクを行う必要がある。
【０００５】
ＭＰＥＧ方式では、ビデオデータと、オーディオデータとの同期をとるために、ビデオＰＥＳ（ＰａｃｋｅｔｉｚｅｄＥｌｅｍｅｎｔａｒｙＳｔｒｅａｍ）パケット、及びオーディオＰＥＳパケット、それぞれのＰＥＳヘッダの中に再生時刻情報であるＰＴＳ（ＰｒｅｓｅｎｔａｔｉｏｎＴｉｍｅＳｔａｍｐ）や、デコードのタイミングを指定するＤＴＳ（ＤｅｃｏｄｉｎｇＴｉｍｅＳｔａｍｐ）といった制御情報を記述している。そして、デコード時、再生時には、これらの制御情報を抽出し、抽出した制御情報に基づいて、ビデオデータと、オーディオデータとの同期をとることで、リップシンクずれが生じないようにしている。
【０００６】
ビデオデータには、通常１／３０秒又は、１／２５秒といった一定の長さ（時間）のビデオフレームという画の区切りがある。
【０００７】
これに対して、オーディオデータは、連続的であるためサンプル点以外の区切りを持っていないが、ＭＰＥＧ方式でオーディオデータを圧縮符号化する際の計算範囲を決めるために固有の長さのオーディオフレームが形成される。
【０００８】
このビデオフレームと、オーディオフレームとのフレーム長（時間軸長）は、等しくなることは殆どない。
【０００９】
したがって、磁気記録媒体に、あらかじめ記録されている、ビデオデータ、及びオーディオデータの任意の箇所から、圧縮符号化された新たなビデオデータ、及びオーディオデータを“つなぎ録り”した場合、オーディオデータが時間的に不連続に記録されてしまうことになる。
【００１０】
このように、“つなぎ録り”をすると、デコードのタイミングや、出力のタイミングがＤＴＳ、ＰＴＳで制御可能な範囲を超えて磁気記録媒体に記録されてしまい実用に耐えられない程のリップシンクずれが生じてしまうといった問題がある。
【００１１】
さらに、時間的に不連続となって磁気記録媒体にオーディオデータが記録されることで、再生時におけるデコード処理が正常に実行されない可能性があるといった問題もある。特に、上述のようにしてビデオデータ、及びオーディオデータが“つなぎ録り”された磁気記録媒体を、例えば、外部装置によって再生させる場合、上記外部装置は、上記磁気記録媒体に時間的に不連続にオーディオデータが記録されていることを知ることができないため、正常なデコード処理を期待できないといった問題がある。
【００１２】
そこで、本発明は上述したような問題を解決するために案出されたものであり、ビデオデータ、及びオーディオデータを圧縮符号化して“つなぎ録り”を実行した場合でも、リップシンクずれが最小となって再生されるように、ビデオデータ、及びオーディオデータを記録媒体に記録すると共に、外部装置において、正常にデコード処理されるようにビデオデータ、及びオーディオデータを記録媒体に記録する情報処理装置及び情報処理方法を提供することを目的とする。
【００１３】
【課題を解決するための手段】
上述の目的を達成するために、本発明に係る情報処理装置は、ビデオデータ、及びオーディオデータをＭＰＥＧ方式で圧縮符号化する圧縮符号化手段と、上記圧縮符号化されたビデオデータ、及びオーディオデータを記録媒体に記録させる記録手段とを備えた情報処理装置であって、ビデオデータ、及びオーディオデータがあらかじめ記録されている上記記録媒体上で指定された任意の位置から、新たにビデオデータ、及びオーディオデータをつなぎ録りする際、上記指定された任意の位置近傍に記録されたビデオデータ、及びオーディオデータのヘッダ情報である第１のヘッダ情報をそれぞれ検出し、さらに、上記検出された第１のヘッダ情報の直前に記録されているビデオデータ、及びオーディオデータのヘッダ情報である第２のヘッダ情報をそれぞれ検出するヘッダ情報検出手段と、上記ヘッダ情報検出手段によって検出された第２のヘッダ情報をヘッダ情報とするビデオデータの再生終了時刻情報と、オーディオデータの再生終了時刻情報とを算出し、さらに、上記ビデオデータの再生終了時刻情報と、上記オーディオデータの再生終了時刻情報との差分であるはみ出し時間情報を算出するはみ出し時間情報算出手段と、上記記録媒体に新たに記録するビデオデータの圧縮符号化を開始させるビデオデータ圧縮符号化開始命令を生成して上記圧縮符号化手段に出力し、上記ビデオデータ圧縮符号化開始命令が、上記圧縮符号化手段に出力されてから、上記はみ出し時間情報算出手段によって算出された上記はみ出し時間経過後に、上記記録媒体に新たに記録するオーディオデータの圧縮符号化を開始させるオーディオデータ圧縮符号化開始命令を生成して上記圧縮符号化手段に出力する圧縮符号化命令生成手段と、上記圧縮符号化手段によって圧縮符号化されたビデオデータ、及びオーディオデータを含む記録データストリームを生成する記録データストリーム生成手段と、上記記録データストリーム生成手段によって生成された記録データストリームのヘッダ情報に含まれるＰＴＳ（ＰｒｅｓｅｎｔａｔｉｏｎＴｉｍｅＳｔａｍｐ）、及びＤＴＳ（ＤｅｃｏｄｉｎｇＴｉｍｅＳｔａｍｐ）を、上記第２のヘッダ情報に含まれるＰＴＳ、及びＤＴＳにそれぞれ連続となるよう更新するヘッダ情報更新手段と、上記ヘッダ情報更新手段によって、上記ヘッダ情報が更新された記録データストリームが、上記ヘッダ情報検出手段によって検出された上記第１のヘッダ情報の位置から記録されるよう上記記録手段を制御する制御手段とを備えることを特徴とする。
【００１４】
また、上述の目的を達成するために、本発明に係る情報処理方法は、ビデオデータ、及びオーディオデータをＭＰＥＧ方式で圧縮符号化する圧縮符号化手段と、上記圧縮符号化されたビデオデータ、及びオーディオデータを記録媒体に記録させる記録手段とを備えた情報処理装置の情報処理方法であって、ビデオデータ、及びオーディオデータがあらかじめ記録されている上記記録媒体上で指定された任意の位置から、新たにビデオデータ、及びオーディオデータをつなぎ録りする際、上記指定された任意の位置近傍に記録されたビデオデータ、及びオーディオデータのヘッダ情報である第１のヘッダ情報をそれぞれ検出し、さらに、上記検出された第１のヘッダ情報の直前に記録されているビデオデータ、及びオーディオデータのヘッダ情報である第２のヘッダ情報をそれぞれ検出するヘッダ情報検出工程と、上記ヘッダ情報検出工程によって検出された第２のヘッダ情報をヘッダ情報とするビデオデータの再生終了時刻情報と、オーディオデータの再生終了時刻情報とを算出し、さらに、上記ビデオデータの再生終了時刻情報と、上記オーディオデータの再生終了時刻情報との差分であるはみ出し時間情報を算出するはみ出し時間情報算出工程と、上記記録媒体に新たに記録するビデオデータの圧縮符号化を開始させるビデオデータ圧縮符号化開始命令を生成して上記圧縮符号化手段に出力し、上記ビデオデータ圧縮符号化開始命令が、上記圧縮符号化手段に出力されてから、上記はみ出し時間情報算出工程によって算出された上記はみ出し時間経過後に、上記記録媒体に新たに記録するオーディオデータの圧縮符号化を開始させるオーディオデータ圧縮符号化開始命令を生成して上記圧縮符号化手段に出力する圧縮符号化命令生成工程と、上記圧縮符号化手段によって圧縮符号化されたビデオデータ、及びオーディオデータを含む記録データストリームを生成する記録データストリーム生成工程と、上記記録データストリーム生成工程によって生成された記録データストリームのヘッダ情報に含まれるＰＴＳ（ＰｒｅｓｅｎｔａｔｉｏｎＴｉｍｅＳｔａｍｐ）、及びＤＴＳ（ＤｅｃｏｄｉｎｇＴｉｍｅＳｔａｍｐ）を、上記第２のヘッダ情報に含まれるＰＴＳ、及びＤＴＳにそれぞれ連続となるよう更新するヘッダ情報更新工程と、上記ヘッダ情報更新工程によって、上記ヘッダ情報が更新された記録データストリームが、上記ヘッダ情報検出工程によって検出された上記第１のヘッダ情報の位置から記録されるよう上記記録手段を制御する制御工程とを備えることを特徴とする。
【００１５】
【発明の実施の形態】
以下、本発明に係る情報処理装置及び情報処理方法の実施の形態を図面を参照にして詳細に説明する。
【００１６】
まず、図１を用いて本発明の実施の形態として示す情報処理装置の構成について説明をする。
【００１７】
情報処理装置１は、外部入力部１１と、ＭＰＥＧエンコーダ１２と、ヘッダ情報抽出部１３と、ストリーム記録処理部１４と、ＭＰＥＧエンコーダ制御部１５と、ＰＴＳ／ＤＴＳ更新処理部１６と、擬似データ生成部１７と、ＥＣＣ処理部１８と、記録回路１９と、再生回路２０と、ストリーム再生処理部２１と、外部出力部２２と、ＭＰＥＧデコーダ２３と、制御部２４とを備えている。
【００１８】
情報処理装置１は、当該情報処理装置１に着脱可能なテープ状磁気記録媒体５０にＭＰＥＧ方式で圧縮符号化されたビデオデータ、及びオーディオデータを記録し、記録したビデオデータ、及びオーディオデータを再生可能な装置である。
【００１９】
また、情報処理装置１は、あらかじめビデオデータ、及びオーディオデータが記録されているテープ状磁気記録媒体５０の任意の箇所に、新たなビデオデータ、及びオーディオデータを上書きする“つなぎ録り”を自在にすることができる。
【００２０】
外部入力部１１は、外部からＴＳ（ＴｒａｎｓｐｏｒｔＳｔｒｅａｍ）に多重化されて送信されるＭＰＥＧ圧縮符号化されたビデオデータ及びオーディオデータを、ビデオＰＥＳパケット、オーディオＰＥＳパケットに再編し、ストリーム記録処理部１４に供給する。
【００２１】
ＭＰＥＧエンコーダ１２は、デジタルビデオデータ及びデジタルオーディオデータを、ＭＰＥＧ方式にて圧縮符号化（エンコード）してビデオＥＳ（ＥｌｅｍｅｎｔａｒｙＳｔｒｅａｍ）、オーディオＥＳを生成する。また、ＭＰＥＧエンコーダ１２は、エンコードしたビデオＥＳ、オーディオＥＳをビデオＰＥＳパケット、オーディオＰＥＳパケットに分割し、ストリーム記録処理部１４に供給する。
【００２２】
ＭＰＥＧエンコーダ１２は、検出されるＶＢＶ（ＶｉｄｅｏＢｕｆｆｅｒｉｎｇＶｅｒｉｆｉｅｒ）バッファのデータ占有量に基づいて、ピクチャの発生符号量を調節しながらエンコード処理をする。
【００２３】
ＶＢＶバッファは、デコーダの前段に仮想的に設けられるバッファであり、当該ＶＢＶバッファの占有量を参照することでエンコードする際の符号化量を調節する際の指針となる。
【００２４】
例えば、デコード処理においては、デコードされるピクチャが、一旦、一定時間ＶＢＶバッファに蓄えられ、デコード時刻がくると１ピクチャ分のデータがＶＢＶバッファから読み出されて、デコード処理がなされる、というモデルが想定される。
【００２５】
このとき、ピクチャの符号化量が膨大であると、ＶＢＶバッファがアンダーフローしてしまい表示する際に遅延が生じてしまうことになる。また、ピクチャの発生符号化量が少ないとＶＢＶバッファがオーバーフローしてしまい、ＶＢＶバッファが破綻してしまう。
【００２６】
このように、ＶＢＶバッファのアンダーフローや、オーバーフローといった破綻を生じさせないために、ＭＰＥＧエンコーダ１２でエンコードするピクチャの発生符号量を、例えば、量子化パラメータを操作するなどして、各ピクチャ毎に目標符号量となるよう調整する。
【００２７】
量子化パラメータの操作でも十分でない場合は、デコードするデータストリーム上の時間的に前にあるピクチャをコピーした符号量の少ないコピーピクチャ（ＣｏｐｙＰｉｃｔｕｒｅ）や、特に意味をなさないデータであるスタッフィングバイトなどを挿入することで復号タイミングや、データ量の調整を行う。ＶＢＶバッファがオーバーフローするような場合には、スタッフィングバイトを挿入し、アンダーフローするような場合には、コピーピクチャを挿入する。
【００２８】
ヘッダ情報抽出部１３は、再生回路２０、ＥＣＣ処理部１８、ストリーム再生処理部２１を介して、テープ状磁気記録媒体５０から再生されたビデオデータ、オーディオデータのＰＴＳ、ＤＴＳ、フレーム長情報を含んだヘッダ情報を抽出する。ＰＴＳ、及びＤＴＳは、ビデオデータ、オーディオデータのアクセスユニット単位、つまり１フレーム単位に付与されるヘッダ情報である。ＰＴＳは、テープ状磁気記録媒体５０に記録されたビデオデータ、オーディオデータをそれぞれ再生出力する時刻情報であり、ＤＴＳは、ビデオデータ、オーディオデータをそれぞれデコードする時刻情報である。
【００２９】
テープ状磁気記録媒体５０の任意の位置で、“つなぎ録り”をする指示が出されると、ヘッダ情報抽出部１３は、制御部２４の制御によって上記指定された任意の位置に最も近くに記録されているヘッダ情報を読み出す。このヘッダ情報の位置が、“つなぎ録り”を開始するつなぎ点となる。
【００３０】
また、ヘッダ情報抽出部１３は、制御部２４の制御によって、つなぎ点の直前にあるビデオデータ（以下、最終ビデオデータとも呼ぶ。）と、オーディオデータ（以下、最終オーディオデータとも呼ぶ。）のヘッダ情報をそれぞれ抽出する。ヘッダ情報抽出部１３で抽出されたヘッダ情報は、ストリーム記録処理部１４に送出される。
【００３１】
ストリーム記録処理部１４は、外部入力部１１、又はＭＰＥＧエンコーダ１２から供給されるビデオＰＥＳパケット、オーディオＰＥＳパケットを多重化して、テープ状磁気記録媒体５０に記録するデータストリームを生成する。
【００３２】
また、ストリーム記録処理部１４は、ヘッダ情報抽出部１３から送出されたヘッダ情報を用いて、はみ出し時間を算出する。
【００３３】
ここで、はみ出し時間について説明をする。はみ出し時間とは、あらかじめテープ状磁気記録媒体５０に記録されているビデオデータと、上記ビデオデータに対応して記録されたオーディオデータとの表示時間のずれ量を示したものである。
【００３４】
具体的には、図２（ａ）に示すテープ状磁気記録媒体５０に記録されたビデオデータに新たなビデオデータを“つなぎ録り”する際のつなぎ点Ｘと、上記ビデオデータに対応してテープ状磁気記録媒体５０に記録された、図２（ｂ）に示すオーディオデータに新たなオーディオデータを“つなぎ録り”する際のつなぎ点Ｙとのずれ量のことをはみ出し時間という。
【００３５】
上述したようにＭＰＥＧ方式で圧縮符合化されたビデオデータのフレーム長と、オーディオデータのフレーム長は、それぞれ異なっているため、オーディオデータのＰＴＳが、ビデオデータのＰＴＳに対して遅れて、テープ状磁気記録媒体５０に記録されていることが極めて多くなっている。
【００３６】
つまり、上述した図２（ａ），（ｂ）に示したように、テープ状磁気記録媒体５０に記録されたオーディオデータは、ビデオデータに対してリップシンクずれを生じており、ビデオデータの再生が終了しても、当該オーディオデータの再生が終了しない状態となっている。
【００３７】
ストリーム記録処理部１４は、図２（ａ），（ｂ）を用いて説明したはみ出し時間を、ヘッダ情報抽出部１３で抽出された最終ビデオデータ、及び最終オーディオデータのヘッダ情報を用いて算出する。
【００３８】
最終ビデオデータのヘッダ情報に含まれるＰＴＳをＴｖｅ、フレーム長をＶＦとし、最終オーディオデータのヘッダ情報に含まれるＰＴＳをＴａｅ、フレーム長をＡＦとすると、はみ出し時間Ｔｅは、（１）式に示すようにして算出することができる。
【００３９】
Ｔｅ＝（Ｔａｅ＋ＡＦ）―（Ｔｖｅ＋ＶＦ）・・・（１）
【００４０】
この（１）式における、Ｔａｅ＋ＡＦによって算出される値は、最終オーディオデータの再生が終了する時刻情報、つまり、オーディオデータのつなぎ点Ｙにおける時刻情報である。また、Ｔｖｅ＋ＶＦによって算出される値は、最終ビデオデータの再生が終了する時刻情報、つまり、ビデオデータのつなぎ点Ｘにおける時刻情報である。
【００４１】
ストリーム記録処理部１４で算出されたはみ出し時間Ｔｅの値は、ＭＰＥＧエンコーダ制御部１５に送出される。
【００４２】
上述した（１）式のようにして算出されるはみ出し時間Ｔｅは、後述する擬似データ生成部１７で、ビデオデータに対して擬似データが生成された場合は、生成された擬似データのフレーム長を考慮して算出される。
【００４３】
つまり、擬似データのフレーム長をＧＦとすると、はみ出し時間Ｔｅは、（２）式に示すようにして算出される。
【００４４】
Ｔｅ＝（Ｔａｅ＋ＡＦ）―（Ｔｖｅ＋ＶＦ＋ＧＦ）・・・（２）
【００４５】
ＭＰＥＧエンコーダ制御部１５は、ＭＰＥＧエンコーダ１２におけるビデオデータ、及びオーディオデータのエンコード開始を指示するエンコード開始コマンドを生成し、ＭＰＥＧエンコーダ１２に出力する。ＭＰＥＧエンコーダ制御部１５は、ビデオデータ、オーディオデータそれぞれに対して、エンコード開始コマンドを生成する。
【００４６】
まず、ＭＰＥＧエンコーダ制御部１５は、制御部２４から送出される“つなぎ録り”開始命令に応じてビデオデータに対するエンコード開始コマンドを生成し、ビデオエンコーダ１２ａに送出する。
【００４７】
続いて、ＭＰＥＧエンコーダ制御部１５は、ビデオデータに対するエンコード開始コマンドを送出してから、ストリーム記録処理部１４で生成されたはみ出し時間Ｔｅ後に、オーディオデータに対するエンコード開始コマンドを生成し、オーディオエンコーダ１２ｂに送出する。
【００４８】
このように、オーディオエンコーダ１２ｂによるエンコードの開始時刻は、以下の理由によって、はみ出し時間Ｔｅ分だけ遅延させる。
【００４９】
例えば、図３（ａ），（ｂ）示すようなビデオデータ、及びオーディオデータをテープ状磁気記録媒体５０に新たに記録させることを考える。
【００５０】
図３（ａ）に示すビデオデータのうち、先頭フレームのビデオデータ（以下、先頭ビデオデータとも呼ぶ。）のＰＴＳをＴｖｓとし、図３（ｂ）に示すオーディオデータのうち、先頭フレームのオーディオデータ（以下、先頭オーディオデータとも呼ぶ。）のＰＴＳをＴａｓとする。
【００５１】
図３（ｂ）に示すオーディオデータは、図３（ａ）に示すビデオデータに対して、遅延時間Ｔｓだけ遅延している。この遅延時間Ｔｓは、先頭ビデオデータのＰＴＳであるＴｖｓと、先頭オーディオデータのＰＴＳであるＴａｓを用いて、（３）式のように表すことができる。
【００５２】
Ｔｓ＝Ｔａｓ―Ｔｖｓ・・・（３）
【００５３】
この遅延時間Ｔｓが、上述したはみ出し時間Ｔｅと、Ｔｅ＞Ｔｓという関係であったり、Ｔｅ＜Ｔｓという関係であったりすると、新たに記録するオーディオデータは、テープ状磁気記録媒体５０にあらかじめ記録されているオーディオデータに対して、時間的に不連続に記録されてしまうことになる。このように、オーディオデータが、テープ状磁気記録媒体５０に、時間的に不連続となって記録されると、正常なデコード処理ができないといった原因となる。
【００５４】
遅延時間Ｔｓと、はみ出し時間Ｔｅとが、同じ時間となれば、つまりＴｓ＝Ｔｅという関係が満たされれば、新たにテープ状磁気記録媒体５０に記録するオーディオデータが、時間的に連続となって記録されることになる。
【００５５】
そこで、ＭＰＥＧエンコーダ制御部１５は、“つなぎ録り”するオーディオデータが、はみ出し時間Ｔｅだけビデオデータのエンコード処理より遅延するように、エンコード開始コマンドをはみ出し時間Ｔｅだけ遅延させて、オーディオエンコーダ１２ｂに送出する。
【００５６】
ＰＴＳ／ＤＴＳ更新処理部１６は、ストリーム記録処理部１４で生成されるデータストリームのビデオＰＥＳパケット、オーディオＰＥＳパケットのそれぞれのＰＴＳ、及びＤＴＳを、テープ磁気記録媒体５０に記録されている“つなぎ録り”する下地データのＰＴＳ、ＤＴＳに対応して時間的に連続となるように更新する。
【００５７】
擬似データ生成部１７は、上述したＶＢＶバッファの破綻を防止する際に、つなぎ点以降のビデオデータに挿入する、コピーピクチャや、スタッフィングバイトといった擬似データを生成する。上述したようなＶＢＶバッファの破綻は、発生符号量が極端に変化する場合に生ずるため“つなぎ録り”のつなぎ点では、コピーピクチャや、スタッフィングバイトが挿入されることが多い。
【００５８】
ＥＣＣ処理部１８は、ストリーム記録処理部１４から送出されるデータストリームに誤り訂正符号化処理、インタリーブ処理を実行し、記録回路１９に送出する。また、ＥＣＣ処理部１８は、再生回路２０から送出されるデータストリームにデインタリーブ処理、誤り訂正復号処理を実行し、ストリーム再生処理部２１に送出する。
【００５９】
具体的には、ＥＣＣ処理部１８は、図示しないＥＣＣＢａｎｋメモリを有しており、テープ状磁気記録媒体５０へ記録するデータストリーム、又は再生するデータストリームを一旦バッファリングして上記処理を実行する。
【００６０】
記録回路１９は、ＥＣＣ処理部１８から送出されるデータストリームをテープ状磁気記録媒体５０に記録させる記録処理をする。例えば、記録回路１９は、上記データストリームをシリアルデータに変換した上で増幅し、図示しない回転ドラムによって回転させられたテープ状磁気記録媒体５０に、図示しない磁気ヘッドを介して記録する。
【００６１】
再生回路２０は、テープ状磁気記録媒体５０に記録されたデータストリームを再生する再生処理をする。再生回路２０で再生されたデータストリームは、ＥＣＣ処理部１８に送出される。
【００６２】
ストリーム再生処理部２１は、ＥＣＣ処理部１８から供給されたデータストリームをデマルチプレクスして、ビデオＰＥＳパケット、及びオーディオＰＥＳパケットに分離する。
【００６３】
ＰＴＳ抽出部２２は、ストリーム再生処理部２１で、デマルチプレクスされたビデオＰＥＳパケット、及びオーディオＰＥＳパケットからＰＴＳを抽出しストリーム記録部１４に送出する。
【００６４】
外部出力部２２は、ストリーム再生処理部２１から供給されるビデオＰＥＳパケット、オーディオＰＥＳパケットを分割してＴＳパケットを生成し、ＴＳパケットを多重化してＴＳを生成する。
【００６５】
ＭＰＥＧデコーダ２３は、ストリーム再生処理部２１から供給されるビデオＰＥＳパケット、オーディオＰＥＳパケットからビデオＥＳ、オーディオＥＳをそれぞれ生成し、このＭＰＥＧデータからデジタルビデオデータ、デジタルオーディオデータにそれぞれ伸長復号処理をする。
【００６６】
制御部２４は、当該情報処理装置１の各部を統括的に制御する。制御部２４は、例えば、上述した“つなぎ録り”を実行する際に各部の動作を制御して、テープ状磁気記録媒体５０に、ＭＰＥＧ方式で圧縮符号化したビデオデータ、及びオーディオデータを記録させる。
【００６７】
続いて、図４に示すフローチャートを用いて、本発明の実施の形態として示す情報処理装置１において、新たなビデオデータ及び、オーディオデータをテープ状磁気記録媒体５０へ記録させる“つなぎ録り”時の動作について説明をする。
【００６８】
まず、ステップＳ１において、ユーザによって、テープ状磁気記録媒体５０の任意の位置で“つなぎ録り”をする指示が出されると、制御部２４は、再生回路２０、ＥＣＣ処理部１８、ストリーム再生処理部２１を制御して、テープ状磁気記録媒体５０の上記任意の位置近傍を再生させるよう制御する。
【００６９】
ステップＳ２において、制御部２４は、ヘッダ情報抽出部１３を制御して、上記指定された任意の位置に最も近くに記録されているヘッダ情報を読み出し、“つなぎ録り”を開始するつなぎ点を決定する。
【００７０】
ステップＳ３において、制御部２４は、ヘッダ情報抽出部１３を制御して、つなぎ点の直前のビデオデータ、及びオーディオデータである、最終ビデオデータ、及び最終オーディオデータのヘッダ情報を抽出し、ストリーム記録処理部１４に送出する。
【００７１】
ステップＳ４において、ストリーム記録処理部１４は、抽出したヘッダ情報の中から、ビデオデータのＰＴＳ及びフレーム長情報、オーディオデータのＰＴＳ及びフレーム長情報を用いて、上述した（１）式から、はみ出し時間Ｔｅを生成し、ＭＰＥＧエンコーダ制御部１５に送出する。
【００７２】
ストリーム記録処理部は、擬似データ生成部１７で、ビデオデータに対する擬似データが生成された場合には、擬似データのフレーム長も考慮した（２）式から、はみ出し時間Ｔｅを生成し、ＭＰＥＧエンコーダ制御部１５に送出する。
【００７３】
ステップＳ５において、制御部２４は、“つなぎ録り”を開始させる命令である、つなぎ記録開始コマンドを生成し、ＭＰＥＧエンコーダ制御部１５に送出する。
【００７４】
ステップＳ６において、ＭＰＥＧエンコーダ制御部１５は、制御部２４から送出された、つなぎ記録開始コマンドに応じて、“つなぎ録り”するビデオデータのエンコード処理を開始させる命令であるエンコード開始コマンドを生成し、ビデオエンコーダ１２ａに送出する。
【００７５】
ステップＳ７において、ビデオエンコーダ１２ａは、新たにテープ状磁気記録媒体５０に記録するビデオデータをＭＰＥＧ方式で圧縮符号化してストリーム記録処理部１４に送出する。
【００７６】
ステップＳ８において、ＭＰＥＧエンコーダ制御部１５は、ビデオデータのエンコード処理を開始させる命令であるエンコード開始コマンドを生成してから、はみ出し時間Ｔｅが経過したかどうかを判断する。はみ出し時間Ｔｅ経過した場合は、工程をステップＳ９に進め、そうでない場合は、はみ出し時間Ｔｅとなるまで待機する。
【００７７】
ステップＳ９において、ＭＰＥＧエンコーダ制御部１５は、ビデオエンコーダ１２ａにエンコード開始コマンドを送出してから、はみ出し時間Ｔｅだけ経過したことに応じて、“つなぎ録り”するオーディオデータのエンコード処理を開始させる命令であるエンコード開始コマンドを生成し、オーディオエンコーダ１２ｂに送出する。このとき、ＭＰＥＧエンコーダ制御部１５は、オーディオエンコーダ１２ｂにおいて、コマンドを受け取ってから処理が開始されるまでの遅延時間分を考慮してエンコード開始コマンドを送出する。
【００７８】
ステップＳ１０において、オーディオエンコーダ１２ｂは、新たにテープ状磁気記録媒体５０に記録するオーディオデータをＭＰＥＧ方式で圧縮符号化してストリーム記録処理部１４に送出する。
【００７９】
ステップＳ１１において、ストリーム記録処理部１４は、ビデオエンコーダ１２ａ、及びオーディオエンコーダ１２ｂで圧縮符号化されたビデオデータ、及びオーディオデータをテープ状磁気記録媒体５０に記録させるデータストリームに多重化する。
【００８０】
ステップＳ１２において、ＰＴＳ／ＤＴＳ更新処理部１６は、ストリーム記録処理部１４で多重化されたデータストリームのビデオデータ、オーディオデータ、それぞれのフレーム毎に付与されているヘッダ情報のうち、ＰＴＳ、及びＤＴＳを、“つなぎ録り”するテープ状磁気記録媒体５０に記録されている下地のビデオデータ、及び、オーディオデータのＰＴＳ、及びＤＴＳと連続的になるように更新する。
【００８１】
ステップＳ１３において、制御部２４は、ストリーム記録処理部１４で生成されたデータストリームをＥＣＣ処理部１８に送出し、さらに、記録回路１９を介して、テープ状磁気記録媒体５０に記録させる。
【００８２】
このようにして、オーディオエンコーダ１２ｂでのエンコード処理の開始時刻を、ビデオエンコーダ１２ａでのエンコード開始時刻より、ストリーム記録処理部１４によって（１）式、又は（２）式を用いて生成されるはみ出し時間Ｔｅ分だけ遅延させることで、テープ状磁気記録媒体５０に記録するオーディオデータのＰＴＳを時間的に連続して記録させることができる。
【００８３】
このように、情報処理装置１は、“つなぎ録り”を実行する際に、テープ状磁気記録媒体５０にあらかじめ記録されているビデオデータ、及びオーディオデータの最終ビデオデータ、及び最終オーディオデータのヘッダ情報から、はみだし時間Ｔｅを算出し、算出したはみ出し時間Ｔｅ分だけオーディオデータのエンコード開始時刻を遅らせることで、時間的に連続してオーディオデータをテープ状磁気記録媒体５０に記録させることができる。
【００８４】
これにより、再生時のリップシンクずれを最小限に抑制することができると共に、テープ状磁気記録媒体５０に“つなぎ録り”されたビデオデータ、及びオーディオデータを、外部装置などで再生した場合でも、正常なデコード処理を期待することができる。
【００８５】
【発明の効果】
以上の説明からも明らかなように、本発明は、記録媒体への“つなぎ録り”時において、新たなビデオデータ、及びオーディオデータを記録させるつなぎ点、直前に記録されているビデオデータ、及びオーディオデータのヘッダ情報からはみ出し時間情報を算出し、オーディオデータの圧縮符号化処理をビデオデータの圧縮符号化処理に対してはみ出し時間分遅延させ、上記新たなビデオデータ、及びオーディオデータを記録媒体に記録させる。
【００８６】
したがって、“つなぎ録り”されたビデオデータ、及びオーディオデータは、ＰＴＳ間隔が連続して記録媒体に記録されているため、再生時において、外部装置のＭＰＥＧデコーダを使用した場合でも正常にデコード処理させることが可能となる。
【００８７】
また、“つなぎ録り”をしても新たなリップシンクずれが発生することを防止できるため、“つなぎ録り”が複数回繰り返された場合でも、リップシンクずれが累積されることを防止することを可能とする。
【図面の簡単な説明】
【図１】本発明の実施の形態として示す情報処理装置の構成について説明するための図である。
【図２】はみ出し時間について説明するための図である。
【図３】本発明の実施の形態として示す情報処理装置において、テープ状磁気記録媒体に新たに記録するオーディオデータと、ビデオデータの一例を示した図である。
【図４】同情報処理装置において、テープ状磁気記録媒体に“つなぎ録り”する際の動作について説明するためのフローチャートである。
【符号の説明】
１情報処理装置、１１外部入力部、１２ＭＰＥＧエンコーダ、１２ａビデオエンコーダ、１２ｂオーディオエンコーダ、１３ヘッダ情報抽出部、１４ストリーム記録処理部、１５ＭＰＥＧエンコーダ制御部、１６ＰＴＳ／ＤＴＳ更新処理部、１７擬似データ生成部、１８、ＥＣＣ処理部、１９記録回路、２１ストリーム再生処理部、５０テープ状磁気記録媒体[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to an information processing apparatus and an information processing method for recording video data and audio data encoded by the MPEG method on a recording medium.
[0002]
[Prior art]
A recording device for recording video data and audio data compressed and encoded by a moving picture encoding method such as an MPEG (Moving Picture Expert Group) method, for example, an MPEG2 method, on a magnetic recording medium has been devised (for example, Patent Reference 1).
[0003]
[Patent Document 1]
JP 2001-309305 A
[0004]
[Problems to be solved by the invention]
By the way, when decoding and reproducing video data and audio data that have been compression-encoded by the MPEG method, it is necessary to perform lip sync that matches the movement of the mouth with the audio timing.
[0005]
In the MPEG system, in order to synchronize video data and audio data, a video PES (Packetized Elementary Stream) packet and an audio PES packet, and a PTS (Presentation Time Stamp) as playback time information in each PES header. And control information such as DTS (Decoding Time Stamp) for specifying a decoding timing. Then, at the time of decoding and reproduction, the control information is extracted, and video data and audio data are synchronized based on the extracted control information so that a lip-sync shift does not occur.
[0006]
The video data usually has a video frame of a fixed length (time) such as 1/30 seconds or 1/25 seconds.
[0007]
On the other hand, the audio data is continuous and has no delimiter other than the sample points, but has an audio frame of a specific length to determine the calculation range when compressing and encoding the audio data by the MPEG method. Is formed.
[0008]
The frame length (time axis length) of the video frame and the audio frame hardly becomes equal.
[0009]
Therefore, when new video data and audio data that have been compression-encoded are “joined and recorded” from arbitrary portions of video data and audio data that have been recorded in advance on the magnetic recording medium, the audio data is It will be recorded discontinuously in time.
[0010]
As described above, when the “join recording” is performed, the decoding timing and the output timing are recorded on the magnetic recording medium beyond the range that can be controlled by the DTS and the PTS, and the lip-sync deviation becomes unusable. There is a problem that occurs.
[0011]
Furthermore, since audio data is recorded on a magnetic recording medium in a discontinuous manner with time, there is a problem that decoding processing during reproduction may not be performed normally. In particular, when the magnetic recording medium on which the video data and the audio data are “joined-recorded” as described above is played back by, for example, an external device, the external device causes the magnetic recording medium to be discontinuous in time. However, there is a problem that it is not possible to know that audio data is recorded in the audio data, so that normal decoding processing cannot be expected.
[0012]
Therefore, the present invention has been devised in order to solve the above-described problem. Even when video data and audio data are compression-encoded and “join recording” is performed, the lip sync deviation is minimized. An information processing apparatus for recording video data and audio data on a recording medium so that the video data and audio data are recorded on a recording medium so that the video data and audio data are normally decoded by an external device. And an information processing method.
[0013]
[Means for Solving the Problems]
In order to achieve the above object, an information processing apparatus according to the present invention comprises: a compression encoding unit for compressing and encoding video data and audio data according to an MPEG system; And a recording means for recording the video data on a recording medium, wherein video data and audio data are newly recorded from an arbitrary position designated on the recording medium in which video data is newly recorded, and When splicing and recording audio data, the video data and the first header information, which are header information of the audio data, recorded near the specified position are respectively detected. Second header which is header information of video data and audio data recorded immediately before the header information of Header information detecting means for respectively detecting the video information, video data reproduction end time information using the second header information detected by the header information detection means as header information, and audio data reproduction end time information. Further, a protruding time information calculating means for calculating protruding time information which is a difference between the reproduction end time information of the video data and the reproduction end time information of the audio data, and a video data to be newly recorded on the recording medium. A video data compression encoding start instruction for starting compression encoding is generated and output to the compression encoding unit, and the video data compression encoding start instruction is output to the compression encoding unit, and thus the protruding time is determined. After the elapse of the protruding time calculated by the information calculation means, the audio to be newly recorded on the recording medium Compression encoding instruction generating means for generating an audio data compression encoding start instruction for starting compression encoding of data and outputting the generated instruction to the compression encoding means; video data compressed and encoded by the compression encoding means; Recording data stream generating means for generating a recording data stream including audio data and PTS (Presentation Time Stamp) and DTS (Decoding Time Stamp) included in header information of the recording data stream generated by the recording data stream generating means. ) Is updated to be continuous with the PTS and DTS included in the second header information, and the recording data stream in which the header information is updated by the header information updating means is header And a controlling means for controlling said recording means so as to be recorded from the position of the detected said first header information by broadcast detecting means.
[0014]
Further, in order to achieve the above object, an information processing method according to the present invention comprises: a compression encoding unit for compressing and encoding video data and audio data according to an MPEG method; An information processing method for an information processing apparatus comprising: a recording unit that records audio data on a recording medium, wherein video data and audio data are recorded in advance from an arbitrary position specified on the recording medium. When newly connecting and recording video data and audio data, the video data recorded near the specified position and the first header information that is header information of audio data are respectively detected, Header information of video data and audio data recorded immediately before the detected first header information. Header information detecting step for detecting the second header information, video data playback end time information using the second header information detected in the header information detection step as header information, and audio data playback end Time information, and furthermore, an overflow time information calculating step of calculating overflow time information that is a difference between the video data playback end time information and the audio data playback end time information; A video data compression encoding start instruction for starting the compression encoding of the video data to be recorded is output to the compression encoding means, and the video data compression encoding start instruction is output to the compression encoding means. Then, after the protruding time calculated by the protruding time information calculating step has elapsed, the recording medium is newly written. A compression encoding instruction generating step of generating an audio data compression encoding start instruction for starting compression encoding of audio data to be recorded and outputting the instruction to the compression encoding means; and a video encoded by the compression encoding means. A recording data stream generating step of generating a recording data stream including data and audio data; PTS (Presentation Time Stamp) and DTS (Decoding) included in header information of the recording data stream generated by the recording data stream generating step. Time stamp) is updated to be continuous with the PTS and DTS included in the second header information, respectively, and the recording data stream in which the header information is updated by the header information updating step is updated. Over arm, characterized in that a control step for controlling said recording means so as to be recorded from the position of the header information detection process in the first header information detected by.
[0015]
BEST MODE FOR CARRYING OUT THE INVENTION
Hereinafter, embodiments of an information processing apparatus and an information processing method according to the present invention will be described in detail with reference to the drawings.
[0016]
First, a configuration of an information processing apparatus shown as an embodiment of the present invention will be described with reference to FIG.
[0017]
The information processing apparatus 1 includes an external input unit 11, an MPEG encoder 12, a header information extraction unit 13, a stream recording processing unit 14, an MPEG encoder control unit 15, a PTS / DTS update processing unit 16, a pseudo data generation It includes a unit 17, an ECC processing unit 18, a recording circuit 19, a reproduction circuit 20, a stream reproduction processing unit 21, an external output unit 22, an MPEG decoder 23, and a control unit 24.
[0018]
The information processing device 1 records video data and audio data that are compression-encoded by the MPEG method on a tape-shaped magnetic recording medium 50 that is detachable from the information processing device 1, and reproduces the recorded video data and audio data. It is a possible device.
[0019]
Further, the information processing apparatus 1 can freely perform “join recording” in which new video data and audio data are overwritten on an arbitrary portion of the tape-shaped magnetic recording medium 50 on which video data and audio data are recorded in advance. Can be
[0020]
The external input unit 11 reconstructs MPEG compression-encoded video data and audio data, which are multiplexed into a TS (Transport Stream) and transmitted from the outside, into a video PES packet and an audio PES packet. To supply.
[0021]
The MPEG encoder 12 compression-encodes (encodes) digital video data and digital audio data according to the MPEG system to generate a video ES (Elementary Stream) and an audio ES. The MPEG encoder 12 divides the encoded video ES and audio ES into video PES packets and audio PES packets, and supplies the video PES packets and audio PES packets to the stream recording processing unit 14.
[0022]
The MPEG encoder 12 performs the encoding process while adjusting the generated code amount of the picture based on the detected data occupation amount of a VBV (Video Buffering Verifier) buffer.
[0023]
The VBV buffer is a buffer virtually provided in the preceding stage of the decoder, and serves as a guide when adjusting the encoding amount at the time of encoding by referring to the occupation amount of the VBV buffer.
[0024]
For example, in a decoding process, a picture to be decoded is temporarily stored in a VBV buffer for a certain period of time, and when a decoding time comes, data for one picture is read from the VBV buffer and a decoding process is performed. Is assumed.
[0025]
At this time, if the amount of coding of the picture is enormous, the VBV buffer underflows and a delay occurs when displaying the picture. In addition, if the amount of generated coding of a picture is small, the VBV buffer overflows and the VBV buffer breaks down.
[0026]
As described above, in order to prevent a failure such as underflow or overflow of the VBV buffer from occurring, the generated code amount of a picture to be encoded by the MPEG encoder 12 is set to a target value for each picture by operating a quantization parameter, for example. Adjust so that it becomes the code amount.
[0027]
If the operation of the quantization parameter is not sufficient, a copy picture (Copy Picture) having a small code amount obtained by copying a picture that is temporally preceding in the data stream to be decoded, or a stuffing byte which is particularly meaningless data is used. To adjust the decoding timing and the amount of data. If the VBV buffer overflows, a stuffing byte is inserted. If the VBV buffer underflows, a copy picture is inserted.
[0028]
The header information extraction unit 13 includes video data and audio data PTS, DTS, and frame length information reproduced from the tape-shaped magnetic recording medium 50 via the reproduction circuit 20, the ECC processing unit 18, and the stream reproduction processing unit 21. Extract header information. The PTS and the DTS are header information provided for each access unit of video data and audio data, that is, for each frame. PTS is time information for reproducing and outputting video data and audio data recorded on the tape-shaped magnetic recording medium 50, respectively, and DTS is time information for decoding video data and audio data, respectively.
[0029]
When an instruction to perform “join recording” is issued at an arbitrary position on the tape-shaped magnetic recording medium 50, the header information extraction unit 13 records the nearest position to the above-specified arbitrary position under the control of the control unit 24. The header information is read out. The position of the header information is a connection point at which “connection recording” starts.
[0030]
Further, the header information extraction unit 13 controls the header of the video data (hereinafter also referred to as final video data) immediately before the connection point and the audio data (hereinafter also referred to as final audio data) under the control of the control unit 24. Extract each piece of information. The header information extracted by the header information extraction unit 13 is sent to the stream recording processing unit 14.
[0031]
The stream recording processing unit 14 multiplexes video PES packets and audio PES packets supplied from the external input unit 11 or the MPEG encoder 12 to generate a data stream to be recorded on the tape-shaped magnetic recording medium 50.
[0032]
Further, the stream recording processing unit 14 calculates the overflow time using the header information sent from the header information extracting unit 13.
[0033]
Here, the protruding time will be described. The protruding time indicates a shift amount of a display time between video data recorded in advance on the tape-shaped magnetic recording medium 50 and audio data recorded corresponding to the video data.
[0034]
Specifically, a connection point X when new video data is "joined-recorded" to video data recorded on the tape-shaped magnetic recording medium 50 shown in FIG. The amount of deviation from the connecting point Y when new audio data is “joined-recorded” to the audio data shown in FIG. 2B and recorded on the tape-shaped magnetic recording medium 50 is referred to as “protruding time”.
[0035]
As described above, since the frame length of video data and the frame length of audio data compressed and encoded by the MPEG system are different from each other, the PTS of the audio data is delayed from the PTS of the video data, and the It is extremely often recorded on the magnetic recording medium 50.
[0036]
That is, as shown in FIGS. 2A and 2B, the audio data recorded on the tape-shaped magnetic recording medium 50 has a lip-sync shift with respect to the video data, and the video data is reproduced. Is completed, the reproduction of the audio data does not end.
[0037]
The stream recording processing unit 14 calculates the protruding time described with reference to FIGS. 2A and 2B using the header information of the final video data and the final audio data extracted by the header information extracting unit 13. .
[0038]
If the PTS included in the header information of the final video data is Tve, the frame length is VF, the PTS included in the header information of the final audio data is Tae, and the frame length is AF, the protruding time Te is expressed by the equation (1). It can be calculated as follows.
[0039]
Te = (Tae + AF)-(Tve + VF) (1)
[0040]
The value calculated by Tae + AF in the expression (1) is time information at which the reproduction of the final audio data ends, that is, time information at a connection point Y of the audio data. The value calculated by Tve + VF is the time information at which the reproduction of the final video data ends, that is, the time information at the connection point X of the video data.
[0041]
The value of the overflow time Te calculated by the stream recording processing unit 14 is sent to the MPEG encoder control unit 15.
[0042]
When the pseudo data generation unit 17 described later generates pseudo data for video data, the overrun time Te calculated as in the above equation (1) is determined by the frame length of the generated pseudo data. It is calculated taking into account.
[0043]
That is, assuming that the frame length of the pseudo data is GF, the overflow time Te is calculated as shown in Expression (2).
[0044]
Te = (Tae + AF)-(Tve + VF + GF) (2)
[0045]
The MPEG encoder control unit 15 generates an encoding start command for instructing the MPEG encoder 12 to start encoding video data and audio data, and outputs the command to the MPEG encoder 12. The MPEG encoder control unit 15 generates an encoding start command for each of the video data and the audio data.
[0046]
First, the MPEG encoder control unit 15 generates an encoding start command for video data in response to the “join recording” start instruction sent from the control unit 24, and sends it to the video encoder 12a.
[0047]
Subsequently, the MPEG encoder control unit 15 sends an encoding start command for the video data, and after an overflow time Te generated by the stream recording processing unit 14, generates an encoding start command for the audio data, and sends the encoding start command to the audio encoder 12b. Send out.
[0048]
As described above, the start time of the encoding by the audio encoder 12b is delayed by the protruding time Te for the following reason.
[0049]
For example, suppose that video data and audio data as shown in FIGS. 3A and 3B are newly recorded on the tape-shaped magnetic recording medium 50.
[0050]
In the video data shown in FIG. 3A, the PTS of the video data of the head frame (hereinafter, also referred to as head video data) is Tvs, and the audio data of the head frame in the audio data shown in FIG. The PTS of (hereinafter, also referred to as head audio data) is Tas.
[0051]
The audio data shown in FIG. 3B is delayed from the video data shown in FIG. 3A by a delay time Ts. The delay time Ts can be expressed by Expression (3) using Tvs, which is the PTS of the head video data, and Tas, which is the PTS of the head audio data.
[0052]
Ts = Tas-Tvs (3)
[0053]
If the delay time Ts has a relationship of Te> Ts or a relationship of Te <Ts with the protruding time Te, the audio data to be newly recorded is recorded in advance on the tape-shaped magnetic recording medium 50. The recorded audio data is recorded discontinuously in time. As described above, if the audio data is recorded on the tape-shaped magnetic recording medium 50 so as to be discontinuous in time, a normal decoding process cannot be performed.
[0054]
If the delay time Ts and the protruding time Te are the same time, that is, if the relationship of Ts = Te is satisfied, the audio data newly recorded on the tape-shaped magnetic recording medium 50 becomes temporally continuous. Will be recorded.
[0055]
Then, the MPEG encoder control unit 15 delays the encode start command by the overrun time Te so that the audio data to be “joined-recorded” is delayed from the video data encoding process by overrun time Te, and sends the audio start command to the audio encoder 12b. Send out.
[0056]
The PTS / DTS update processing unit 16 stores the PTS and DTS of the video PES packet and the audio PES packet of the data stream generated by the stream recording processing unit 14 in the “linkage recording” recorded on the tape magnetic recording medium 50. The base data is updated so as to be temporally continuous according to the PTS and DTS of the underlying data.
[0057]
The pseudo data generation unit 17 generates pseudo data, such as a copy picture and a stuffing byte, to be inserted into video data after the connection point when preventing the above-described failure of the VBV buffer. Since the failure of the VBV buffer as described above occurs when the generated code amount changes extremely, a copy picture or a stuffing byte is often inserted at a connection point of “connection recording”.
[0058]
The ECC processing unit 18 performs an error correction coding process and an interleave process on the data stream transmitted from the stream recording processing unit 14, and transmits the data stream to the recording circuit 19. Further, the ECC processing unit 18 performs a deinterleave process and an error correction decoding process on the data stream transmitted from the reproduction circuit 20, and transmits the data stream to the stream reproduction processing unit 21.
[0059]
Specifically, the ECC processing unit 18 has an ECC Bank memory (not shown), and temporarily buffers a data stream to be recorded on the tape-shaped magnetic recording medium 50 or a data stream to be reproduced, and executes the above-described processing.
[0060]
The recording circuit 19 performs a recording process of recording the data stream transmitted from the ECC processing unit 18 on the tape-shaped magnetic recording medium 50. For example, the recording circuit 19 converts the data stream into serial data, amplifies the data stream, and records the amplified data on a tape-shaped magnetic recording medium 50 rotated by a rotating drum (not shown) via a magnetic head (not shown).
[0061]
The reproducing circuit 20 performs a reproducing process for reproducing the data stream recorded on the tape-shaped magnetic recording medium 50. The data stream reproduced by the reproduction circuit 20 is sent to the ECC processing unit 18.
[0062]
The stream reproduction processing unit 21 demultiplexes the data stream supplied from the ECC processing unit 18 and separates the data stream into video PES packets and audio PES packets.
[0063]
The PTS extraction unit 22 extracts the PTS from the demultiplexed video PES packet and audio PES packet in the stream reproduction processing unit 21 and sends the PTS to the stream recording unit 14.
[0064]
The external output unit 22 divides a video PES packet and an audio PES packet supplied from the stream reproduction processing unit 21 to generate a TS packet, and multiplexes the TS packet to generate a TS.
[0065]
The MPEG decoder 23 generates a video ES and an audio ES from the video PES packet and the audio PES packet supplied from the stream reproduction processing unit 21, respectively, and performs decompression decoding processing on the MPEG data into digital video data and digital audio data. .
[0066]
The control unit 24 controls each unit of the information processing apparatus 1 in an integrated manner. The control unit 24 controls the operation of each unit, for example, when performing the above-described “join recording”, and records video data and audio data that have been compression-encoded by the MPEG method on the tape-shaped magnetic recording medium 50. Let it.
[0067]
Subsequently, using the flowchart shown in FIG. 4, in the information processing apparatus 1 according to the embodiment of the present invention, when "joint recording" for recording new video data and audio data on the tape-shaped magnetic recording medium 50, Will be described.
[0068]
First, in step S1, when the user issues an instruction to perform “join recording” at an arbitrary position on the tape-shaped magnetic recording medium 50, the control unit 24 starts the reproduction circuit 20, the ECC processing unit 18, the stream reproduction process. The section 21 is controlled so as to reproduce the vicinity of the arbitrary position on the tape-shaped magnetic recording medium 50.
[0069]
In step S2, the control unit 24 controls the header information extraction unit 13 to read out the header information recorded closest to the specified position, and set a connection point at which "connection recording" starts. decide.
[0070]
In step S3, the control unit 24 controls the header information extracting unit 13 to extract the header information of the final video data and the final audio data, which are the video data and the audio data immediately before the connection point, and perform stream recording. It is sent to the processing unit 14.
[0071]
In step S4, the stream recording processing unit 14 uses the PTS and frame length information of the video data and the PTS and frame length information of the audio data from the extracted header information to calculate the protruding time from the expression (1) described above. Te is generated and sent to the MPEG encoder control unit 15.
[0072]
When the pseudo data generation unit 17 generates pseudo data for video data, the stream recording processing unit generates an overflow time Te from Expression (2) that also takes into consideration the frame length of the pseudo data, and controls the MPEG encoder control. To the unit 15.
[0073]
In step S5, the control unit 24 generates a connection recording start command, which is an instruction to start “connection recording”, and sends it to the MPEG encoder control unit 15.
[0074]
In step S6, the MPEG encoder control unit 15 generates an encode start command which is an instruction to start encoding processing of video data to be “joined-recorded” in response to the joint recording start command sent from the control unit 24. , To the video encoder 12a.
[0075]
In step S7, the video encoder 12a compresses and encodes video data to be newly recorded on the tape-shaped magnetic recording medium 50 by the MPEG method, and sends the encoded data to the stream recording processing unit 14.
[0076]
In step S8, the MPEG encoder control unit 15 determines whether or not the overflow time Te has elapsed since the generation of the encoding start command which is a command to start the encoding processing of the video data. If the overrun time Te has elapsed, the process proceeds to step S9; otherwise, the process waits until the overrun time Te is reached.
[0077]
In step S9, the MPEG encoder control unit 15 sends an encoding start command to the video encoder 12a, and starts an encoding process of audio data to be “joined-recorded” in response to a lapse of the protruding time Te. Is generated and transmitted to the audio encoder 12b. At this time, the MPEG encoder control unit 15 sends out the encoding start command in the audio encoder 12b in consideration of the delay time from the reception of the command to the start of the processing.
[0078]
In step S10, the audio encoder 12b compresses and encodes the audio data to be newly recorded on the tape-shaped magnetic recording medium 50 by the MPEG method, and sends it to the stream recording processing unit 14.
[0079]
In step S11, the stream recording processing unit 14 multiplexes the video data and the audio data compressed and encoded by the video encoder 12a and the audio encoder 12b into a data stream to be recorded on the tape-shaped magnetic recording medium 50.
[0080]
In step S12, the PTS / DTS update processing unit 16 performs the PTS and DTS of the video data and the audio data of the data stream multiplexed by the stream recording processing unit 14, and the header information added to each frame. Is updated so as to be continuous with the underlying video data and audio data PTS and DTS recorded on the tape-shaped magnetic recording medium 50 to be “join-recorded”.
[0081]
In step S13, the control unit 24 sends the data stream generated by the stream recording processing unit 14 to the ECC processing unit 18 and records the data stream on the tape-shaped magnetic recording medium 50 via the recording circuit 19.
[0082]
In this way, the start time of the encoding process in the audio encoder 12b is extended from the encoding start time in the video encoder 12a by the stream recording processing unit 14 using the expression (1) or (2). By delaying by the time Te, the PTS of the audio data to be recorded on the tape-shaped magnetic recording medium 50 can be recorded temporally continuously.
[0083]
As described above, the information processing apparatus 1 performs the “joint recording”, the final video data of the video data and the audio data previously recorded on the tape-shaped magnetic recording medium 50, and the header of the final audio data. By calculating the protruding time Te from the information and delaying the encoding start time of the audio data by the calculated protruding time Te, the audio data can be recorded on the tape-shaped magnetic recording medium 50 continuously in time.
[0084]
This makes it possible to minimize the lip-sync deviation at the time of reproduction, and even when the video data and the audio data “spliced and recorded” on the tape-shaped magnetic recording medium 50 are reproduced by an external device or the like. , Normal decoding processing can be expected.
[0085]
【The invention's effect】
As is clear from the above description, according to the present invention, at the time of “join recording” on a recording medium, a joining point for recording new video data and audio data, video data recorded immediately before, and The protruding time information is calculated from the header information of the audio data, the compression encoding process of the audio data is delayed by the protruding time with respect to the compression encoding process of the video data, and the new video data and audio data are recorded on a recording medium. Record.
[0086]
Therefore, the video data and the audio data recorded in "join recording" are recorded on the recording medium with the PTS interval continuously, so that even when the MPEG decoder of the external device is used at the time of reproduction, the decoding process is normally performed. It is possible to do.
[0087]
In addition, even if the “link recording” is performed, a new lip-sync deviation can be prevented from being generated. Therefore, even when the “link recording” is repeated a plurality of times, the lip-sync deviation is prevented from being accumulated. To make things possible.
[Brief description of the drawings]
FIG. 1 is a diagram for describing a configuration of an information processing apparatus shown as an embodiment of the present invention.
FIG. 2 is a diagram for explaining an overflow time.
FIG. 3 is a diagram showing an example of audio data newly recorded on a tape-shaped magnetic recording medium and video data in the information processing apparatus shown as an embodiment of the present invention.
FIG. 4 is a flowchart for explaining an operation at the time of “join recording” on a tape-shaped magnetic recording medium in the information processing apparatus.
[Explanation of symbols]
Reference Signs List 1 information processing device, 11 external input unit, 12 MPEG encoder, 12a video encoder, 12b audio encoder, 13 header information extraction unit, 14 stream recording processing unit, 15 MPEG encoder control unit, 16 PTS / DTS update processing unit, 17 pseudo Data generation unit, 18, ECC processing unit, 19 recording circuit, 21 stream reproduction processing unit, 50 tape-shaped magnetic recording medium

Claims

An information processing apparatus comprising: compression encoding means for compressing and encoding video data and audio data in an MPEG system; and recording means for recording the compression-encoded video data and audio data on a recording medium. ,
When newly connecting video data and audio data from an arbitrary position specified on the recording medium in which video data and audio data are recorded in advance, recording near the arbitrary position specified above First header information, which is header information of the detected video data and audio data, respectively, and further, header information of the video data and audio data recorded immediately before the detected first header information. Header information detecting means for respectively detecting second header information,
It calculates video data playback end time information and audio data playback end time information using the second header information detected by the header information detection means as header information, and further comprises the video data playback end time information. And a protruding time information calculating means for calculating protruding time information that is a difference between the reproduction end time information of the audio data and
Generating a video data compression encoding start command for starting compression encoding of video data to be newly recorded on the recording medium and outputting the instruction to the compression encoding means;
After the video data compression / encoding start command is output to the compression / encoding means, the compression code of audio data to be newly recorded on the recording medium after a lapse of the overflow time calculated by the overflow time information calculation means. Compression encoding instruction generating means for generating an audio data compression encoding start instruction for starting the encoding and outputting the instruction to the compression encoding means,
Recording data stream generating means for generating a recording data stream including video data and audio data compressed and encoded by the compression encoding means,
The PTS (Presentation Time Stamp) and DTS (Decoding Time Stamp) included in the header information of the recording data stream generated by the recording data stream generating means are respectively added to the PTS and DTS included in the second header information. Header information updating means for updating to be continuous,
Control means for controlling the recording means such that the recording data stream in which the header information is updated by the header information updating means is recorded from the position of the first header information detected by the header information detecting means; An information processing apparatus comprising:

The protruding time information calculating means adds the PTS of the video data included in the second header information detected by the header information detecting means and the frame length information of the video data to obtain the reproduction end time information of the video data. Calculate,
The protruding time information calculating means calculates the audio data reproduction end time information by adding the PTS of the audio data included in the header information detected by the header information detecting means and the frame length information of the audio data. The information processing apparatus according to claim 1, wherein:

Pseudo video data generating means for generating pseudo video data generated in accordance with the generated code amount of the video data to be seamlessly recorded and inserted at a position to be seamlessly recorded,
The protruding time information calculating means adds the PTS of the video data included in the second header information detected by the header information detecting means and the frame length information of the video data, and further comprises the pseudo video data generating means. 3. The information processing apparatus according to claim 2, wherein the reproduction end time information of the video data is calculated by adding the frame length information of the pseudo video data generated in the step (c).

Information processing of an information processing apparatus comprising: compression encoding means for compressing and encoding video data and audio data according to the MPEG system; and recording means for recording the compressed and encoded video data and audio data on a recording medium. The method,
When newly connecting video data and audio data from an arbitrary position specified on the recording medium in which video data and audio data are recorded in advance, recording near the arbitrary position specified above First header information, which is header information of the detected video data and audio data, respectively, and further, header information of the video data and audio data recorded immediately before the detected first header information. A header information detecting step of detecting each of the second header information,
Calculate video data playback end time information and audio data playback end time information using the second header information detected in the header information detection step as header information, and further calculate the video data playback end time information. And a protruding time information calculating step of calculating protruding time information that is a difference between the reproduction end time information of the audio data and
Generating a video data compression encoding start command for starting compression encoding of video data to be newly recorded on the recording medium and outputting the instruction to the compression encoding means;
After the video data compression / encoding start command is output to the compression / encoding means, after the protruding time calculated by the protruding time information calculating step, the compression code of the audio data to be newly recorded on the recording medium is passed. A compression encoding instruction generating step of generating an audio data compression encoding start instruction for starting the encoding and outputting the instruction to the compression encoding means;
A recording data stream generating step of generating a recording data stream including video data and audio data compressed and encoded by the compression encoding means,
The PTS (Presentation Time Stamp) and DTS (Decoding Time Stamp) included in the header information of the recording data stream generated in the recording data stream generation step are respectively added to the PTS and DTS included in the second header information. A header information updating step of updating to be continuous,
A control step of controlling the recording means so that the recording data stream in which the header information is updated by the header information updating step is recorded from the position of the first header information detected by the header information detecting step. An information processing method comprising:

The protruding time information calculating step includes adding the PTS of the video data included in the second header information detected in the header information detecting step and the frame length information of the video data to obtain the reproduction end time information of the video data. Calculate,
Also, the protruding time information calculating step calculates the reproduction end time information of the audio data by adding the PTS of the audio data included in the header information detected in the header information detecting step and the frame length information of the audio data. 5. The information processing method according to claim 4, wherein:

A pseudo video data generating step of generating pseudo video data that is generated in accordance with the generated code amount of the video data to be seamlessly recorded and inserted at a position to be seamlessly recorded,
The protruding time information calculating step adds the PTS of the video data included in the second header information detected in the header information detecting step and the frame length information of the video data, and further includes the pseudo video data generating step. 6. The information processing method according to claim 5, wherein the reproduction end time information of the video data is calculated by adding the frame length information of the pseudo video data generated in the step (c).