JP2012109655A

JP2012109655A - Media receiving device

Info

Publication number: JP2012109655A
Application number: JP2010254735A
Authority: JP
Inventors: Yukinari Matsuda; 幸成松田
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2010-11-15
Filing date: 2010-11-15
Publication date: 2012-06-07

Abstract

PROBLEM TO BE SOLVED: To provide a media receiving device capable of reducing time from receiving a transmission packet to video output of encoded video media information through extracting and decoding the information.SOLUTION: A media receiving device comprises: a transmission packet analysis separation unit 11 for analyzing a transmission packet 7 to output video time information 13 including decoding time information and display time information; an encoded video media analysis unit 17 for detecting the beginning of a video frame to notify the detection; an encoded video media decoding unit 20 for notifying decoding completion of the video frame; a reference time generating unit 8 for outputting reference time information of one's own device; and a video decoding time calculation unit 23 for calculating video output timing having the shortest elapsed time from the detection of the beginning of the video frame to video output by comparing a time of the detection of the beginning of the video frame with a time of the decoding completion and the display time information of the video frame.

Description

この発明は、伝送パケットを受信して符号化映像メディア情報を復号するメディア受信装置に関し、特に、映像信号を出力するまでの処理遅延時間を短縮するメディア受信装置に関するものである。 The present invention relates to a media receiver that receives a transmission packet and decodes encoded video media information, and more particularly to a media receiver that shortens a processing delay time until a video signal is output.

MPEG-2 Systems規格に準拠したＴＳパケットから符号化映像メディア情報の分離・復号・表示までの処理において、ＰＥＳヘッダ中に復号時刻情報を示すＤＴＳ（Decoding Time-Stamp）、表示（出力）時刻情報を示すＰＴＳ（Presentation Time-Stamp）と呼ばれる時刻情報が必要に応じて格納されている。
その際、時刻情報に対する時間軸は、ＴＳヘッダのアダプテーション領域中に格納された送信装置側で取得した時点での基準時刻を示すＰＣＲ（Program Clock Reference）と呼ばれる時刻再生情報で示される。この時刻再生情報は一定の頻度で格納されており、このＰＣＲを用いて受信装置側で再現した時間軸がＳＴＣ（System Time Clock）と呼ばれる。 DTS (Decoding Time-Stamp) indicating decoding time information in the PES header and display (output) time information in processing from TS packet compliant with MPEG-2 Systems standard to separation / decoding / display of encoded video media information Time information called PTS (Presentation Time-Stamp) is stored as required.
At this time, the time axis for the time information is indicated by time reproduction information called PCR (Program Clock Reference) indicating the reference time at the time of acquisition on the transmission device side stored in the adaptation area of the TS header. This time reproduction information is stored at a constant frequency, and the time axis reproduced on the receiving device side using this PCR is called STC (System Time Clock).

つまり、受信装置側では、送信装置側から伝送された時刻再生情報ＰＣＲから再現した時間軸ＳＴＣを基準にして、復号時刻情報ＤＴＳが示す時刻になったら受信した符号化データ列の復号を開始し、表示時刻情報ＰＴＳが示す時刻になったら復号済メディア情報を出力するというタイミング制御を規定している。その際、復号対象となる符号化メディア情報はＤＴＳ時刻までに到着、ＤＴＳの瞬間に処理時間０で復号処理が完了し、符号化方式に伴うデータ列の順序入れ替えに必要な時間を考慮した遅延をもったＰＴＳ時刻に出力されるという理想の装置モデルで規定されている。 That is, the receiving device side starts decoding the received encoded data sequence when the time indicated by the decoding time information DTS is reached with reference to the time axis STC reproduced from the time reproduction information PCR transmitted from the transmitting device side. In addition, the timing control is defined such that the decrypted media information is output when the time indicated by the display time information PTS is reached. At that time, the encoded media information to be decoded arrives by the DTS time, the decoding process is completed at the processing time 0 at the DTS instant, and the delay in consideration of the time required to change the order of the data sequence according to the encoding method It is defined by an ideal device model that is output at the PTS time having

実際には映像メディアであれば、映像符号化方式に従った復号処理、ノイズや歪みなどの画質劣化を補正する映像補正処理などの処理時間や、映像信号そのものの映像画素転送の時間が必要となる。そして、モニタ装置などの映像表示装置に映像を表示するにはＳＭＰＴＥ等で規定する映像信号形式に従って一定周期に出力が必要になる。
そのため、受信装置では映像表示装置への映像出力間隔を一定にするため、受信した符号化メディア情報の復号タイミングとは独立なタイミング制御で、映像表示装置への出力タイミングを一定周期化し、その出力タイミングに合うように各処理を制御して最終的な映像画素転送タイミングを合わせた実装方式が多くみられる。
その結果、ＴＳパケットの受信から最終的な映像表示までの間に非同期な部分があって、そのタイミング合わせの待ち時間が発生して、映像表示遅延時間の増加と映像表示動作開始毎に１フレーム時間内のブレが生じる課題があった。 Actually, if it is video media, it takes time to perform decoding processing according to the video encoding method, video correction processing to correct image quality degradation such as noise and distortion, and video pixel transfer time of the video signal itself. Become. In order to display a video on a video display device such as a monitor device, it is necessary to output at a constant cycle in accordance with a video signal format defined by SMPTE or the like.
Therefore, in order to make the video output interval to the video display device constant in the receiving device, the output timing to the video display device is set to a constant cycle by timing control independent of the decoding timing of the received encoded media information, and the output There are many mounting methods in which each process is controlled to match the timing and the final video pixel transfer timing is matched.
As a result, there is an asynchronous part between the reception of the TS packet and the final video display, a waiting time for timing adjustment occurs, and one frame is generated every time the video display delay time increases and the video display operation starts. There was a problem that caused blurring in time.

そこで、従来技術では表示時刻情報ＰＴＳを用いて映像信号を出力する間隔を制御する仕組みや、表示時刻情報ＰＴＳが全ての映像フレームに付与されず離散的に付与されている場合に符号化映像メディア情報中から映像フレーム間隔を抽出して表示間隔を制御する仕組みが開示されている。
しかしながら、映像メディア情報の映像符号化方式や受信装置内部での処理構成によって遅延時間が異なるため、一定の遅延時間を見越したオフセットが必要であり、伝送パケットを受信してから映像信号を出力するまでの遅延時間が長くなる課題が生じてしまう。 Therefore, in the prior art, a mechanism for controlling the interval at which a video signal is output using the display time information PTS, or an encoded video medium when the display time information PTS is given discretely instead of being given to all video frames. A mechanism for extracting a video frame interval from information and controlling a display interval is disclosed.
However, since the delay time differs depending on the video encoding method of the video media information and the processing configuration inside the receiving device, an offset that allows for a certain delay time is necessary, and the video signal is output after receiving the transmission packet. The problem that the delay time until becomes longer occurs.

例えば、特許文献１では、ＴＳパケット中の時刻再生情報ＰＣＲを用いて再現した時間軸ＳＴＣとＰＥＳヘッダ中にある表示時刻情報ＰＴＳが一致したときに当該映像フレームに対する映像出力タイミングを制御するカウンタ回路を初期化することによって、受信装置内の自走タイミングではなくて送信装置から通知された表示時刻情報に従った出力タイミングで映像出力することが開示されている。
また、特許文献２では、特許文献１のように表示時刻情報ＰＴＳで映像出力タイミングを初期化する制御の場合、表示時刻情報ＰＴＳとして通知される時刻の精度に出力タイミングが左右されるため、一定の範囲で補正することを開示している。通常の映像表示装置の場合、映像を表示する間隔は正確に一定である必要があり、大きな揺らぎは許容されていないため、このような措置が必要となってくる。
さらに、特許文献３では、ＴＳパケット中の全ての映像フレームに表示時刻情報ＰＴＳが格納されていない場合にも符号化映像フレーム情報のパラメータとして付与されている映像フレーム間隔に関する情報から不足した表示時刻情報ＰＴＳを算出して出力タイミングの制御に適用する仕組みが開示されている。 For example, in Patent Document 1, when the time axis STC reproduced using the time reproduction information PCR in the TS packet matches the display time information PTS in the PES header, the counter circuit controls the video output timing for the video frame. It is disclosed that the video is output at the output timing according to the display time information notified from the transmitting device instead of the self-running timing in the receiving device by initializing the.
Further, in Patent Document 2, in the case of control for initializing video output timing with display time information PTS as in Patent Document 1, the output timing depends on the accuracy of the time notified as display time information PTS. It is disclosed that the correction is made within the range. In the case of a normal video display device, the interval at which video is displayed needs to be exactly constant, and large fluctuations are not allowed, so such measures are necessary.
Further, in Patent Document 3, even when the display time information PTS is not stored in all the video frames in the TS packet, the display time which is insufficient from the information regarding the video frame interval given as the parameter of the encoded video frame information. A mechanism for calculating information PTS and applying it to output timing control is disclosed.

ワンセグ放送などの低レートでの伝送では良好な映像品質を確保するために符号化映像メディア情報にできるだけ多くの伝送帯域を割り当てて、制御情報は必要最低限に抑制することが望ましい。映像フレームのそれぞれにＰＥＳヘッダを設定すると伝送帯域を消費するため、伝送エラー時からの復帰などで必要となる主要な映像フレームにのみＰＥＳヘッダを付与する手法が取られている。また、送信装置側で映像フレーム信号を符号化する際に全ての映像フレーム信号を符号化して符号化映像メディア情報として伝送しているとは限らず処理状況に応じて映像フレーム信号を間引いて符号化している場合もある。そのような場合に、表示時刻情報ＰＴＳが付与されなかった映像フレームに対して受信側の自走タイミングで映像出力タイミングを制御すると映像表示間隔が不自然になるため、このような装置が必要となる。 In transmission at a low rate such as one-segment broadcasting, it is desirable to allocate as much transmission bandwidth as possible to encoded video media information and to suppress control information to the minimum necessary to ensure good video quality. When a PES header is set for each video frame, a transmission band is consumed. Therefore, a method of adding a PES header only to a main video frame necessary for recovery from a transmission error or the like is employed. Also, when video frame signals are encoded on the transmission device side, not all video frame signals are encoded and transmitted as encoded video media information, and video frame signals are thinned out according to the processing status. There is also a case. In such a case, since the video display interval becomes unnatural if the video output timing is controlled at the self-running timing on the receiving side with respect to the video frame to which the display time information PTS is not given, such a device is necessary. Become.

特開２００４−１２８８６９号公報JP 2004-128869 A 特開２００４−１２８８７０号公報JP 2004-128870 A 特開２００７−２５１５４２号公報JP 2007-251542 A

伝送パケットを受信し、符号化メディア情報を抽出、復号してから表示出力するまでの実際の処理時間は送信装置から通知された表示時刻情報ＰＴＳを基準とする表示装置への出力タイミングとは一致していない上に、装置の実装方法や符号化方式に依存している。 The actual processing time from reception of a transmission packet, extraction and decoding of encoded media information to display output is the same as the output timing to the display device based on the display time information PTS notified from the transmission device. In addition, it depends on the device mounting method and coding method.

先に述べた特許文献のように表示時刻情報ＰＴＳで出力タイミングを初期化する場合や符号化映像メディア情報から不足した表示時刻情報ＰＴＳを算出する場合だけでは、このような理想の処理タイミングと現実の処理タイミングとの差異があっても映像表示遅延を短縮することはできないし、その処理時間待ち分だけ映像フレーム信号を保持するバッファも必要となる課題があった。 Only when initializing the output timing with the display time information PTS or calculating the lack of display time information PTS from the encoded video media information as in the above-mentioned patent document, such ideal processing timing and actual However, there is a problem that the video display delay cannot be shortened even if there is a difference from the processing timing, and a buffer for holding the video frame signal for the processing time is required.

本発明は上記のような課題を解決するためになされたもので、伝送パケットの受信から符号化映像メディア情報の抽出、復号、映像出力までの時間を短縮することのできるメディア受信装置を得ることを目的とする。 The present invention has been made to solve the above-described problems, and provides a media receiver capable of shortening the time from reception of a transmission packet to extraction, decoding, and video output of encoded video media information. With the goal.

この発明に係るメディア受信装置は、映像信号を符号化した符号化映像メディア情報を伝送パケットに格納して出力するメディア送信装置内における動作の基準時刻を示す時刻再生情報と、符号化映像メディア情報を復号開始すべきタイミングを示す復号時刻情報および映像復号した映像信号を出力すべきタイミングを示す表示時刻情報を含む映像時刻情報とが、符号化映像メディア情報と共に付与された伝送パケットを受信するメディア受信装置であって、受信した伝送パケットを解析して時刻再生情報と符号化映像メディア情報と映像時刻情報とを抽出する伝送パケット解析分離手段と、伝送パケットを受信した時点における自装置での動作の基準時刻情報と伝送パケット解析分離手段で抽出された時刻再生情報とを比較して自装置内の基準時刻情報が時刻再生情報に同期化するように基準時刻制御情報を生成する基準時刻再生手段と、基準時刻制御情報に基づいて自装置内の基準時刻の基準となる基準信号の発振周波数を調整して基準時刻の進みを制御すると共に、制御した基準時刻を示す基準時刻情報と調整した発振周波数を示す基準周波数信号を自装置内に分配する基準時刻生成手段と、符号化映像メディア情報を解析し、映像フレーム先頭を検出して通知すると共に、符号化映像フレーム情報として出力する符号化映像メディア解析手段と、符号化映像フレーム情報を復号して映像信号を復元すると共に、映像フレームの復号が完了した際に復号完了を通知する符号化映像メディア復号手段と、映像フレーム先頭の検知が通知された時点での基準時刻が示す映像フレーム先頭検知時刻、映像フレームの復号完了が通知された時点での基準時刻が示す復号完了時刻、および映像フレームに対する表示時刻情報を比較して映像フレーム先頭検知から映像出力までの経過時間を最短とする映像出力タイミングを算出する映像復号時間算出手段とを備えたものである。 The media receiver according to the present invention includes time reproduction information indicating a reference time of operation in a media transmitter that stores and outputs encoded video media information obtained by encoding a video signal in a transmission packet, and encoded video media information For receiving a transmission packet in which decoding time information indicating timing to start decoding and video time information including display time information indicating timing to output a video decoded video signal are added together with encoded video media information A receiving device that analyzes a received transmission packet to extract time reproduction information, encoded video media information, and video time information; and operation of the device at the time when the transmission packet is received The reference time information of the device and the time reproduction information extracted by the transmission packet analysis and separation means Reference time reproduction means for generating reference time control information so that the quasi-time information is synchronized with the time reproduction information, and the oscillation frequency of the reference signal serving as a reference for the reference time in the own device is adjusted based on the reference time control information The reference time generation means for controlling the advance of the reference time, the reference time information indicating the controlled reference time, and the reference frequency signal indicating the adjusted oscillation frequency are distributed within the apparatus, and the encoded video media information is analyzed. And detecting and notifying the head of the video frame and outputting the encoded video frame information as encoded video frame information; decoding the encoded video frame information to restore the video signal; and decoding the video frame Encoded video media decoding means for notifying completion of decoding when it is completed, and a video frame indicated by the reference time when the detection of the start of the video frame is notified The elapsed time from the detection of the start of the video frame to the output of the video is minimized by comparing the start detection time, the decoding completion time indicated by the reference time when the completion of decoding of the video frame is notified, and the display time information for the video frame. Video decoding time calculation means for calculating video output timing is provided.

この発明のメディア受信装置は、映像フレーム先頭の検知が通知された時点での基準時刻が示す映像フレーム先頭検知時刻、映像フレームの復号完了が通知された時点での基準時刻が示す復号完了時刻、および映像フレームに対する表示時刻情報を比較して映像フレーム先頭検知から映像出力までの経過時間を最短とする映像出力タイミングを算出するようにしたので、伝送パケットの受信から符号化映像メディア情報の抽出、復号、映像出力までの時間を短縮することができる。 The media receiving device of the present invention is a video frame head detection time indicated by a reference time when a detection of the start of a video frame is notified, a decoding completion time indicated by a reference time when a video frame decoding completion is notified, And the video output timing that minimizes the elapsed time from video frame head detection to video output by comparing the display time information for the video frame and extracting the encoded video media information from the reception of the transmission packet, Time until decoding and video output can be shortened.

この発明の実施の形態１によるメディア受信装置を示す構成図である。It is a block diagram which shows the media receiver by Embodiment 1 of this invention. この発明の実施の形態１で扱う伝送パケットと符号化映像メディア情報との関係の一例を示す構成図である。It is a block diagram which shows an example of the relationship between the transmission packet handled by Embodiment 1 of this invention, and encoding video media information. この発明の実施の形態１で扱う伝送パケットにおいて符号化映像メディア情報を多重するパケット構造の一例を示す説明図である。It is explanatory drawing which shows an example of the packet structure which multiplexes encoding video media information in the transmission packet handled in Embodiment 1 of this invention. この発明の実施の形態１で扱う伝送パケットにおいて符号化映像メディア情報を多重するパケット構造の一例を示す説明図である。It is explanatory drawing which shows an example of the packet structure which multiplexes encoding video media information in the transmission packet handled in Embodiment 1 of this invention. この発明の実施の形態２によるメディア受信装置を示す構成図である。It is a block diagram which shows the media receiver by Embodiment 2 of this invention. この発明の実施の形態２で扱う伝送パケットにおいて符号化映像メディア情報を多重するパケット構造の一例を示す説明図である。It is explanatory drawing which shows an example of the packet structure which multiplexes encoding video media information in the transmission packet handled in Embodiment 2 of this invention. この発明の実施の形態３によるメディア受信装置での映像フレーム後の映像フレーム先頭検知時刻から映像復号完了時刻までの経過時間の遷移の一例を示す説明図である。It is explanatory drawing which shows an example of transition of the elapsed time from the video frame head detection time after a video frame to video decoding completion time in the media receiver by Embodiment 3 of this invention. この発明の実施の形態４によるメディア受信装置での映像フレーム毎の画面符号化方式の遷移の一例を示す説明図である。It is explanatory drawing which shows an example of the transition of the screen coding system for every video frame in the media receiver by Embodiment 4 of this invention. この発明の実施の形態５で扱う伝送パケットにおいて制御情報を含むパケット構造の説明図である。It is explanatory drawing of the packet structure containing control information in the transmission packet handled in Embodiment 5 of this invention.

実施の形態１．
図１は、この発明の実施の形態１によるメディア受信装置を示す構成図である。
図１において、メディア送信装置１は、映像、音声およびデータ等のメディア情報を符号化、メディアパケット化し、さらに複数のメディアパケットを所定の構成で伝送パケット（ＴＳパケット）２に多重して伝送して伝送路に出力する送信装置である。メディア受信装置３は、メディア送信装置１が出力した伝送パケット２を受信し、伝送パケット２から映像、音声およびデータ等の符号化メディア情報を分離、復号して出力して映像表示装置４に出力する受信装置であり、伝送パケット受信部５、基準時刻生成部８、伝送パケット解析分離部１１、基準時刻再生部１５、符号化映像メディア解析部１７、符号化映像メディア復号部２０、映像復号時間算出部２３、映像出力タイミング生成部２５、映像信号後処理部２７を備えている。また、映像表示装置４はメディア受信装置３にて復号された映像信号を表示する表示装置である。
なお、音声、データなどの映像以外のメディア情報に関する構成は既知であるためこれらの図示及び説明は省略する。 Embodiment 1 FIG.
1 is a block diagram showing a media receiving apparatus according to Embodiment 1 of the present invention.
In FIG. 1, a media transmitting apparatus 1 encodes media information such as video, audio and data into media packets, and further multiplexes a plurality of media packets into a transmission packet (TS packet) 2 with a predetermined configuration and transmits the media packets. And transmitting to the transmission line. The media reception device 3 receives the transmission packet 2 output from the media transmission device 1, separates encoded media information such as video, audio, and data from the transmission packet 2, decodes and outputs them, and outputs them to the video display device 4. A transmission packet reception unit 5, a reference time generation unit 8, a transmission packet analysis / separation unit 11, a reference time reproduction unit 15, an encoded video media analysis unit 17, an encoded video media decoding unit 20, and a video decoding time. A calculation unit 23, a video output timing generation unit 25, and a video signal post-processing unit 27 are provided. The video display device 4 is a display device that displays the video signal decoded by the media receiving device 3.
In addition, since the structure regarding media information other than video, such as an audio | voice and data, is known, these illustration and description are abbreviate | omitted.

伝送パケット受信部５は、メディア送信装置１が出力した伝送パケット２を受信し、受信したタイミングを通知するための伝送パケット到着通知６を基準時刻再生部１５に出力すると共に、伝送パケット受信部５で取り込んだ伝送パケット７を伝送パケット解析分離部１１に出力する機能部である。基準時刻生成部８は、メディア受信装置３内の各機能部へ動作の基準となる基準時刻情報９と基準時刻周波数信号１０を分配する基準時刻生成手段である。伝送パケット解析分離部１１は、伝送パケット７の構造を解析して、メディア送信装置１において当該伝送パケットを出力した時点での基準時刻を示す時刻再生情報１２を基準時刻再生部１５に出力すると共に、基準時刻上での符号化映像メディア情報の復号タイミングを示す復号時刻情報１３ａおよび映像信号の出力タイミングを示す表示時刻情報１３ｂを抽出し、これを映像時刻情報１３として映像復号時間算出部２３に出力し、また、伝送パケット７内に格納された符号化映像メディア情報１４を抽出し、この符号化映像メディア情報１４を符号化映像メディア解析部１７に出力する伝送パケット解析分離手段である。なお、映像時刻情報１３は、図示はしていないが、個々に示す場合には復号時刻情報１３ａ、表示時刻情報１３ｂとする。 The transmission packet receiving unit 5 receives the transmission packet 2 output from the media transmission device 1 and outputs a transmission packet arrival notification 6 for notifying the reception timing to the reference time reproducing unit 15 and also transmits the transmission packet receiving unit 5. This is a functional unit that outputs the transmission packet 7 captured in step 1 to the transmission packet analysis / separation unit 11. The reference time generation unit 8 is a reference time generation unit that distributes the reference time information 9 and the reference time frequency signal 10 serving as a reference for operation to each functional unit in the media reception device 3. The transmission packet analysis / separation unit 11 analyzes the structure of the transmission packet 7 and outputs time reproduction information 12 indicating the reference time when the transmission packet is output in the media transmission device 1 to the reference time reproduction unit 15. Then, the decoding time information 13a indicating the decoding timing of the encoded video media information on the reference time and the display time information 13b indicating the output timing of the video signal are extracted, and these are extracted as the video time information 13 to the video decoding time calculation unit 23. The transmission packet analysis / separation means outputs the encoded video media information 14 stored in the transmission packet 7 and outputs the encoded video media information 14 to the encoded video media analysis unit 17. The video time information 13 is not shown, but when it is shown individually, it is set as the decoding time information 13a and the display time information 13b.

基準時刻再生部１５は、伝送パケット受信部５が通知した伝送パケット到着通知６のタイミングにおける基準時刻生成部８が出力する基準時刻情報９で得られる伝送パケット２の到着時刻と、伝送パケット解析分離部１１が抽出した時刻再生情報１２とを比較して、その値が一致するように基準時刻生成部８の動作を制御する基準時刻制御情報１６を生成し、これを基準時刻生成部８に出力する基準時刻再生手段である。符号化映像メディア解析部１７は、伝送パケット解析分離部１１から通知された符号化映像メディア情報１４における映像フレームの区切れ目を示す識別子を検索して映像フレーム先頭を検知し、これを映像フレーム先頭検知通知１８として映像復号時間算出部２３に出力すると共に、符号化映像フレーム情報１９を符号化映像メディア復号部２０に出力する符号化映像メディア解析手段である。符号化映像メディア復号部２０は、符号化映像フレーム情報１９を復号処理して映像信号を復元し、これを映像信号２２として映像信号後処理部２７に出力すると共に、当該映像フレームの復号完了を映像フレーム復号完了通知２１として映像復号時間算出部２３に通知する符号化映像メディア復号手段である。 The reference time reproduction unit 15 receives the transmission packet 2 arrival time obtained from the reference time information 9 output by the reference time generation unit 8 at the timing of the transmission packet arrival notification 6 notified by the transmission packet reception unit 5 and the transmission packet analysis and separation. The time reproduction information 12 extracted by the unit 11 is compared, the reference time control information 16 for controlling the operation of the reference time generation unit 8 is generated so that the values match, and this is output to the reference time generation unit 8 The reference time reproducing means. The encoded video media analysis unit 17 searches for an identifier indicating the breakpoint of the video frame in the encoded video media information 14 notified from the transmission packet analysis / separation unit 11, detects the start of the video frame, and uses this to detect the start of the video frame. It is an encoded video media analyzing means that outputs the detection notification 18 to the video decoding time calculation unit 23 and outputs the encoded video frame information 19 to the encoded video media decoding unit 20. The encoded video media decoding unit 20 decodes the encoded video frame information 19 to restore the video signal, outputs it as a video signal 22 to the video signal post-processing unit 27, and completes the decoding of the video frame. The encoded video media decoding means notifies the video decoding time calculation unit 23 as the video frame decoding completion notification 21.

映像復号時間算出部２３は、映像フレーム先頭検知通知１８が通知されたタイミングにおける基準時刻情報９で得られる映像フレーム検知時刻と当該映像フレームに対する映像フレーム復号完了通知２１が通知されたタイミングにおける基準時刻情報９で得られる映像フレーム復号完了時刻とから当該映像フレームの復号処理に掛かった経過時間を算出すると共に、当該映像フレームに対する表示時刻情報１３ｂとの関係から、映像フレーム先頭検知から映像出力までの経過時間を最短とする映像出力タイミングを決定して映像出力制御情報２４を生成し、この映像出力制御情報２４を映像出力タイミング生成部２５に出力する映像復号時間算出手段である。映像出力タイミング生成部２５は、映像出力制御情報２４に基づいて基準時刻情報９と基準時刻周波数信号１０で示される基準時刻における出力間隔を映像フレーム間隔に応じて制御して映像出力タイミング通知２６を生成し、これを映像信号後処理部２７に出力する機能部である。 The video decoding time calculation unit 23 receives the video frame detection time obtained from the reference time information 9 at the timing when the video frame head detection notification 18 is notified and the reference time at the timing when the video frame decoding completion notification 21 for the video frame is notified. The elapsed time required for the decoding process of the video frame is calculated from the video frame decoding completion time obtained from the information 9, and from the relationship with the display time information 13b for the video frame, from the detection of the video frame head to the video output. This is a video decoding time calculation means for determining the video output timing with the shortest elapsed time, generating the video output control information 24, and outputting the video output control information 24 to the video output timing generation unit 25. The video output timing generation unit 25 controls the output interval at the reference time indicated by the reference time information 9 and the reference time frequency signal 10 based on the video output control information 24 according to the video frame interval, and outputs a video output timing notification 26. This is a functional unit that generates and outputs this to the video signal post-processing unit 27.

映像信号後処理部２７は、復元された映像信号２２に対してメディア送信装置１で取り込まれた本来の映像信号と同等となるようにノイズ除去や歪み低減などの信号処理や映像フレームの順序を入れ替えるなどの映像符号化方式に伴って発生する処理を実施すると共に、映像信号を一時的に保持し、映像出力タイミング通知２６に基づいて映像信号を出力する機能部であり、最終的な映像出力タイミングを調整された映像出力信号２８を出力する。 The video signal post-processing unit 27 performs signal processing such as noise removal and distortion reduction and video frame order so that the restored video signal 22 is equivalent to the original video signal captured by the media transmission device 1. This is a functional unit that performs processing that accompanies the video encoding method such as switching, temporarily holds the video signal, and outputs the video signal based on the video output timing notification 26, and finally outputs the video signal. The video output signal 28 with the adjusted timing is output.

図２は、この発明の実施の形態１のメディア受信装置３で扱う伝送パケット７と符号化映像メディア情報１４との関係の一例を示す説明図である。また、図３および図４は、伝送パケット２において符号化映像メディア情報１４を多重するパケット構造の一例を示す説明図である。 FIG. 2 is an explanatory diagram showing an example of the relationship between the transmission packet 7 and the encoded video media information 14 handled by the media receiver 3 according to the first embodiment of the present invention. 3 and 4 are explanatory diagrams showing an example of a packet structure in which the encoded video media information 14 is multiplexed in the transmission packet 2. FIG.

伝送パケットの構造は実施の形態１における特徴的な動作には直接関係しないが説明の便宜上、一例を挙げて示す。
図２において、伝送パケット列１００は、メディア送信装置１とメディア受信装置３間で伝送ネットワークなどの伝送経路を介して伝達される伝送パケット２を示しており、伝送ヘッダ１０１は、伝送パケットを識別するＩＤなどの付加情報が格納されたヘッダであり、メディアヘッダ１０２は、映像、音声、データなどの各種メディア情報を格納する際にメディア情報毎に固有な識別をするＩＤや表示時刻情報、復号開始情報などの付加情報が格納されたヘッダである。また、符号化映像メディア情報１０３は、伝送パケット列１００に格納されるデータである。 Although the structure of the transmission packet is not directly related to the characteristic operation in the first embodiment, an example is given for convenience of explanation.
In FIG. 2, a transmission packet sequence 100 indicates a transmission packet 2 that is transmitted between the media transmission device 1 and the media reception device 3 via a transmission path such as a transmission network, and a transmission header 101 identifies the transmission packet. Additional information such as ID to be stored, and the media header 102 includes ID, display time information, and decoding that uniquely identify each media information when storing various media information such as video, audio, and data. This is a header in which additional information such as start information is stored. The encoded video media information 103 is data stored in the transmission packet sequence 100.

符号化映像メディア情報１０３のデータ列の方が伝送パケットよりも大きいサイズであるため、符号化映像メディア情報１０３を伝送パケット列１００におけるそれぞれの伝送パケットに収まるサイズに分割して逐次格納していくことになる。図２の例では符号化映像メディア情報１０３を三つに分割して伝送パケットに格納するケースを示している。ここで、伝送パケット１０４は、符号化映像メディア情報１０３の先頭部分を含む伝送パケット、伝送パケット１０５は、符号化映像メディア情報１０３の中間部分を含む伝送パケット、伝送パケット１０６は、符号化映像メディア情報１０３の最終部分を含む伝送パケットを示している。また、付加領域１０７は、伝送パケット中に格納するデータ列が過少で空き領域が発生した時にスタッフィングする目的やあるいは伝送パケットに対して時刻再生情報などの付加情報を格納するための領域である。この付加領域は適用する伝送フォーマットの多重規格によって伝送ヘッダ１０１の一部として存在する多重規格もあるし、独立に存在する多重規格もあるが、いずれの格納方法でも本質的な差異はない。 Since the data sequence of the encoded video media information 103 is larger in size than the transmission packet, the encoded video media information 103 is divided into sizes that can be accommodated in each transmission packet in the transmission packet sequence 100 and sequentially stored. It will be. The example of FIG. 2 shows a case where the encoded video media information 103 is divided into three and stored in a transmission packet. Here, the transmission packet 104 is a transmission packet including the head portion of the encoded video media information 103, the transmission packet 105 is a transmission packet including an intermediate portion of the encoded video media information 103, and the transmission packet 106 is an encoded video media. A transmission packet including the final part of the information 103 is shown. The additional area 107 is an area for storing additional information such as time reproduction information for the purpose of stuffing when a data string stored in a transmission packet is insufficient and a free area is generated, or for a transmission packet. This additional area includes a multiplex standard that exists as a part of the transmission header 101 depending on the multiplex standard of the transmission format to be applied, and a multiplex standard that exists independently, but there is no essential difference in any storage method.

図２では符号化フレーム情報１０３の最終部分を含む伝送パケット１０６にのみ付加領域１０７を格納しているが、付加領域１０７は伝送パケットに対する付加的な情報を付与する目的もあり、任意の伝送パケットに付与することが可能である。伝送ヘッダ１０１、付加領域１０７、メディアヘッダ１０２を必要に応じて順次格納した後に残った領域に各種データ列、例えば符号化映像メディア情報１０３を適切なサイズに分割して格納して伝送パケット列１００を構成する運用となる。 In FIG. 2, the additional area 107 is stored only in the transmission packet 106 including the final part of the encoded frame information 103. However, the additional area 107 has an object of adding additional information to the transmission packet. It is possible to give to. The transmission header 101, the additional area 107, and the media header 102 are sequentially stored as necessary, and various data strings, for example, the encoded video media information 103, are divided into appropriate sizes and stored in the remaining area, and the transmission packet string 100 is stored. It becomes the operation which constitutes.

次に、実施の形態１の動作について説明する。
メディア送信装置１は入力された映像メディア信号、音声メディア信号、データ信号などのメディア情報をそれぞれのメディアの特性に応じた符号化方式によって映像符号化、音声符号化、データ符号化して符号化メディア情報を生成する。映像符号化方式の一例ではMPEG-2 Video、MPEG−４ Video、MPEG-4 AVC/ITU-T H.264、VC-1などの映像符号化規格に準拠して符号化されたデータ列が該当とする。音声符号化方式の一例では、MPEG-1 Audio、MPEG-2 Audio、MPEG-2/4 AAC、Dolby AC-3などの音声符号化規格に準拠して符号化されたデータ列が該当する。 Next, the operation of the first embodiment will be described.
The media transmission device 1 encodes media information such as an input video media signal, audio media signal, and data signal by encoding video data, audio encoding, and data encoding according to an encoding method according to the characteristics of each media. Generate information. An example of a video encoding method is a data sequence encoded in accordance with video encoding standards such as MPEG-2 Video, MPEG-4 Video, MPEG-4 AVC / ITU-T H.264, and VC-1. And An example of an audio encoding method corresponds to a data string encoded in accordance with an audio encoding standard such as MPEG-1 Audio, MPEG-2 Audio, MPEG-2 / 4 AAC, Dolby AC-3, or the like.

そして、複数種類の符号化メディア情報を一つの伝送データに集約して伝送路に出力し易くするため、図２で示すようなパケット構造で伝送用のデータ列を形成してパケット化し、メディア送信装置１とメディア受信装置３間で伝達される。このとき、どのようなメディア情報が格納されているのかを識別するために伝送ヘッダ１０１に符号化メディア種別やパケット属性などの付加情報を付与してパケット化して伝送路へ出力される。 Then, in order to aggregate a plurality of types of encoded media information into one transmission data and easily output it to the transmission path, a data sequence for transmission is formed in a packet structure as shown in FIG. It is transmitted between the device 1 and the media receiving device 3. At this time, in order to identify what kind of media information is stored, additional information such as an encoded media type and a packet attribute is added to the transmission header 101 to form a packet, which is output to the transmission path.

映像メディア情報を伝送する場合についてさらに説明すると、映像メディア情報は何らかの単位、例えば映像信号を構成するプログレッシブ方式での映像フレーム構造やインタレース方式での映像フィールド構造などの有意な単位で区切れる構造で符号化されていることを利用して、それらの区切れ目に合わせて符号化映像メディア情報のデータ列を分割し、伝送パケットに格納して送付する手法が取られることが多い。リアルタイムな伝送方法において、任意のタイミングから受信して復号処理をする際に、復号処理の実行が可能な意味のある箇所から速やかに処理を開始できるようにするためである。デジタル放送やBluray Discなどのアプリケーションでは、映像フレームまたは映像フィールドを構成する符号化映像メディア情報のデータ列の方が伝送パケットよりも大きいサイズであるため、逐次分割して伝送パケットに格納していくことになる。 The case where video media information is transmitted will be further described. The video media information is divided in some unit, for example, a structure in which a significant unit such as a video frame structure in a progressive method or a video field structure in an interlace method is used. In many cases, the data sequence of the encoded video media information is divided in accordance with the breaks, stored in a transmission packet, and sent. This is because in the real-time transmission method, when receiving and decoding from an arbitrary timing, the processing can be started promptly from a meaningful place where the decoding can be executed. In applications such as digital broadcasting and Bluray Disc, the data sequence of encoded video media information that constitutes a video frame or video field is larger than the transmission packet, so it is sequentially divided and stored in the transmission packet. It will be.

図２で示す例では符号化映像メディア情報１０３を三つに分割して伝送パケット列１００に格納しており、先頭部分を含む伝送パケット１０４では伝送ヘッダ１０１において先頭部分があることを意味するために、伝送ヘッダ中の先頭有無フラグを「有り」に設定し、それ以外の伝送パケット１０５、１０６の伝送パケット中の先頭有無フラグは「無し」に設定される。 In the example shown in FIG. 2, the encoded video media information 103 is divided into three parts and stored in the transmission packet sequence 100, which means that the transmission packet 101 including the leading part has the leading part in the transmission header 101. In addition, the head presence / absence flag in the transmission header is set to “present”, and the head presence / absence flags in the other transmission packets 105 and 106 are set to “none”.

さらに先頭部分を含む伝送パケット１０４では、メディアヘッダ１０２が存在して、当該映像メディア情報に関わる付加的な情報が付与されることになる。このメディアヘッダ１０２にはメディア送信装置１において付与されたメディア送信装置１側の基準時刻上での当該映像フレームの復号タイミングを示す復号時刻情報１３ａ、表示タイミングを示す表示時刻情報１３ｂからなる映像時刻情報１３が格納されている。復号時刻情報１３ａと表示時刻情報１３ｂが同一値のときは伝送効率向上のため一方だけ格納されることもあるが、その場合は同一値として省略された時刻情報を復元することになる。最終部分を含む伝送パケット１０６では符号化映像メディア情報１０３を分割して逐次格納していった結果、伝送パケットのサイズに対して過少となる場合には付加領域１０７によってスタッフィングデータ列を挿入して補填する。この付加領域１０７には伝送パケットに対して各種データ列を格納するための付加的な情報を付与することもできるし、メディア送信装置１側での基準時刻を示す時刻再生情報１２を格納することもできる。 Further, in the transmission packet 104 including the head portion, the media header 102 exists, and additional information related to the video media information is given. The media header 102 includes a video time including decoding time information 13a indicating the decoding timing of the video frame on the reference time on the media transmitting device 1 side given in the media transmitting device 1, and display time information 13b indicating the display timing. Information 13 is stored. When the decoding time information 13a and the display time information 13b have the same value, only one of them may be stored for improving transmission efficiency. In this case, the time information omitted as the same value is restored. In the transmission packet 106 including the final portion, the encoded video media information 103 is divided and sequentially stored. As a result, if the transmission packet 106 becomes too small with respect to the size of the transmission packet, a stuffing data string is inserted by the additional area 107. To compensate. In this additional area 107, additional information for storing various data strings can be added to the transmission packet, and time reproduction information 12 indicating the reference time on the media transmitting apparatus 1 side is stored. You can also.

例えば、図３は、伝送パケットの一例としてMPEG-2 Systems規格で定義するＴＳパケット構造において、実施の形態１の説明に必要なパラメータをビット配置で示した一例を示している。
図３において、各パラメータの意味は実施の形態１の範囲外なので詳細な説明は省略するが、ＴＳパケットは１８８バイトの固定長で構成され、図３ではその先頭バイトの位置を１、最終バイトの位置を１８８で示し、バイト内のＭＳＢ（Most Significant Bit）をｂ７、ＬＳＢ（Least Significant Bit）をｂ０、ＭＳＢからＬＳＢまでの間の各ビットをそれぞれｂ６からｂ１までで示している。 For example, FIG. 3 shows an example in which parameters necessary for the description of the first embodiment are shown in bit arrangement in a TS packet structure defined by the MPEG-2 Systems standard as an example of a transmission packet.
In FIG. 3, the meaning of each parameter is out of the scope of the first embodiment, so detailed description is omitted. However, the TS packet is composed of a fixed length of 188 bytes, and in FIG. 188, the MSB (Most Significant Bit) in the byte is b7, the LSB (Least Significant Bit) is b0, and the bits between the MSB and the LSB are b6 to b1, respectively.

図３において、バイト位置１からバイト位置４までが伝送ヘッダ１０１に相当し、バイト位置５からバイト位置Ｍ−１までが付加領域１０７に相当する。ＴＳパケット中においてバイト位置４にあるフラグ類の一つadaptation_field_controlによって付加領域有りが指定された場合、付加領域１０７が存在し、バイト位置６にあるフラグ類のうち、PCR_flag＝１である場合、バイト位置７からバイト位置１２までにあるPCR_Base、PCR_Extensionで示される時刻情報が存在している。両パラメータを一組にしてPCR（Program_Clock_Reference）と呼ばれる時刻情報で時刻再生情報１２に相当し、PCR_Base部は９０ＫＨｚ精度のカウンタ値、PCR_Extension部は２７ＭＨｚ精度のカウンタ値で構成されている。MPEG-2 Systems規格に準拠した装置が送受信間で動作周波数を一致させるため、ＰＣＲは一定の頻度で通知する必要があり、MPEG-2 Systems規格では１００ｍｓｅｃ以内の伝送間隔でパラメータ設定して通知することとなっている。メディア受信装置３側でこのＰＣＲのカウンタ値の遷移と自装置内のカウンタ値の遷移の差異を検知することによって送受信間での動作周波数差と基準時刻差を算出し、自装置の動作周波数と基準時刻を調整することで動作周波数と基準時刻を送受信間で一致させることができる仕組みとなっている。 In FIG. 3, byte position 1 to byte position 4 correspond to the transmission header 101, and byte position 5 to byte position M−1 correspond to the additional area 107. In the TS packet, when the presence of an additional area is specified by one of the flags at byte position 4 in adaptation_field_control, an additional area 107 exists, and among the flags at byte position 6, PCR_flag = 1, There is time information indicated by PCR_Base and PCR_Extension from position 7 to byte position 12. A set of both parameters is time information called PCR (Program_Clock_Reference), which corresponds to time reproduction information 12. The PCR_Base part is composed of a counter value with 90 KHz accuracy, and the PCR_Extension part is composed of a counter value with 27 MHz precision. Since a device conforming to the MPEG-2 Systems standard matches the operating frequency between transmission and reception, it is necessary to notify the PCR at a certain frequency. In the MPEG-2 Systems standard, parameters are set and notified at a transmission interval of 100 msec or less. It is supposed to be. By detecting the difference between the counter value transition of the PCR and the counter value transition in the own device on the media receiving device 3 side, the operating frequency difference between the transmission and reception and the reference time difference are calculated. By adjusting the reference time, the operating frequency and the reference time can be matched between transmission and reception.

さらに、図４はメディア情報を格納するＴＳパケットにおいてＴＳヘッダにて先頭有りが指定された場合に存在するメディア情報に関わる付加的なヘッダに対するMPEG-2 Systems規格で定義するＰＥＳヘッダ構造の一例を示している。
ＴＳパケット構造と同様、実施の形態１の説明に必要なパラメータをビット配置で示した一例である。各パラメータの意味は実施の形態１の範囲外なので詳細な説明は省略するが、ＰＥＳヘッダはメディアヘッダ１０２に相当する。ＰＥＳヘッダ中においてバイト位置８にあるフラグ類のひとつPTS_DTS_flagが１０ｂまたは１１ｂ（２進表記）である場合、バイト位置１０からバイト位置１４までにあるＰＴＳ（Presentation_Time_Stamp）と呼ばれる時刻情報が存在し、表示時刻情報１３ｂに相当する。PTS_DTS_flagが１１ｂ（２進表記）である場合、バイト位置１６からバイト位置１９までにあるＤＴＳ（Decoding_Time_Stamp）と呼ばれる時刻情報が存在し、復号時刻情報１３ａに相当する。これらの時刻情報は、PCR_Baseと同様に９０ＫＨｚ精度のカウンタ値で構成される。別途通知されるＰＣＲ値で示される基準時刻情報の時間軸上で、当該映像フレームの復号を開始すべき時刻と表示を開始すべき時刻を指し示すものである。 Further, FIG. 4 shows an example of a PES header structure defined in the MPEG-2 Systems standard for an additional header related to media information existing when the head is specified in the TS header in a TS packet storing media information. Show.
Similar to the TS packet structure, this is an example in which parameters necessary for the description of the first embodiment are shown in bit arrangement. Since the meaning of each parameter is out of the range of the first embodiment, the detailed description is omitted, but the PES header corresponds to the media header 102. When one of the flags at byte position 8 in the PES header, PTS_DTS_flag is 10b or 11b (binary notation), time information called PTS (Presentation_Time_Stamp) from byte position 10 to byte position 14 exists and is displayed. It corresponds to the time information 13b. When PTS_DTS_flag is 11b (binary notation), there is time information called DTS (Decoding_Time_Stamp) from byte position 16 to byte position 19, which corresponds to decoding time information 13a. The time information is composed of a counter value with 90 KHz accuracy as in PCR_Base. On the time axis of the reference time information indicated by the separately notified PCR value, it indicates the time to start decoding the video frame and the time to start display.

このような各種符号化・伝送規格に従って複数種類のメディア情報にその復元に必要となる時刻情報などを理想の動作モデルを前提にした仮想装置での動作を想定して付与した伝送パケット列１００を構成して伝送している。実際の装置では各種動作にかかる時間を加味して時刻情報を補正して適用していくこととなる。 In accordance with such various encoding / transmission standards, a transmission packet sequence 100 provided with a plurality of types of media information and time information necessary for the restoration is assumed assuming an operation in a virtual device assuming an ideal operation model. Configured and transmitted. In an actual apparatus, the time information is corrected and applied in consideration of the time required for various operations.

メディア受信装置３では、メディア送信装置１から伝送経路を介して伝達された伝送パケット２を、伝送パケット受信部５において物理的な伝送信号からデータ列を抽出して伝送パケット７として通知すると共に、伝送パケット２を受信したことを伝送パケット到着通知６によって基準時刻再生部１５へ通知する。
伝送パケット解析分離部１１では、伝送パケット７のパケット構造を解析して、先に述べた時刻再生情報を検知した場合、時刻再生情報１２として基準時刻再生部１５へその内容を通知する。同様に映像フレームに対する復号時刻情報１３ａまたは表示時刻情報１３ｂを検知した場合には映像時刻情報１３として映像復号時間算出部２３へ通知し、また伝送パケット７中に符号化映像メディア情報を検知した場合には、符号化映像メディア情報１４を抽出して符号化映像メディア解析部１７へ通知する。 In the media reception device 3, the transmission packet 2 transmitted from the media transmission device 1 via the transmission path is extracted as a transmission packet 7 by extracting a data string from a physical transmission signal in the transmission packet reception unit 5, and The reference time reproduction unit 15 is notified by the transmission packet arrival notification 6 that the transmission packet 2 has been received.
When the transmission packet analysis / separation unit 11 analyzes the packet structure of the transmission packet 7 and detects the time reproduction information described above, the transmission packet analysis / separation unit 11 notifies the reference time reproduction unit 15 of the contents as time reproduction information 12. Similarly, when decoding time information 13a or display time information 13b for a video frame is detected, it is notified to the video decoding time calculation unit 23 as video time information 13, and when encoded video media information is detected in the transmission packet 7. The encoded video media information 14 is extracted and notified to the encoded video media analysis unit 17.

基準時刻再生部１５では伝送パケット到着通知６のタイミングでその時点での基準時刻情報９を取り込み、当該伝送パケット２に対するパケット到着時刻を保持する。伝送パケット解析分離部１１から時刻再生情報１２が通知されたとき、時刻再生情報１２はメディア送信装置１における基準時刻を示し、保持したパケット到着時刻はメディア受信装置３における基準時刻を示し、両基準時刻が一致するように基準時刻生成部８における動作周波数と基準時刻を制御する基準時刻制御情報１６を算出して基準時刻生成部８へ通知する。基準時刻生成部８では基準時刻制御情報１６に基づいてメディア受信装置３内に関係する動作周波数および基準時刻絶対値を調整しつつ、基準時刻情報９と基準時刻周波数信号１０を各機能部へ分配する。なお、メディア送信装置１とメディア受信装置３で基準時刻を一致させる手法については公知の手法と同様であるため、ここでは簡単な説明に止めることとする。 The reference time reproducing unit 15 takes in the reference time information 9 at that time at the timing of the transmission packet arrival notification 6 and holds the packet arrival time for the transmission packet 2. When the time reproduction information 12 is notified from the transmission packet analysis / separation unit 11, the time reproduction information 12 indicates the reference time in the media transmission device 1, the held packet arrival time indicates the reference time in the media reception device 3, and both references The reference time control information 16 for controlling the operating frequency and the reference time in the reference time generation unit 8 is calculated so as to coincide with each other, and is notified to the reference time generation unit 8. The reference time generation unit 8 distributes the reference time information 9 and the reference time frequency signal 10 to each functional unit while adjusting the operating frequency and the reference time absolute value related to the media receiver 3 based on the reference time control information 16. To do. Note that the method of matching the reference time between the media transmission device 1 and the media reception device 3 is the same as a known method, and therefore, only a brief description will be given here.

一般的な手法によれば、メディア受信装置３では、受信動作開始後に最初に到着して抽出した時刻再生情報の値でまず基準時刻生成部８にある基準時刻情報を初期化する。そして一定の頻度でメディア送信装置１から伝送される時刻再生情報と自装置の基準時刻情報とを比較して、時刻再生情報の方が進んでいれば自装置の基準信号の動作周波数がメディア送信装置１の動作周波数よりも低いと判断し、逆に自装置の基準時刻情報の方が進んでいれば自装置の基準信号の動作周波数がメディア送信装置１の動作周波数よりも高いと判断される。その際、メディア送信装置１からの時刻再生情報と自装置の基準時刻情報との差分値の大きさとその前回時刻再生情報を受信した時からの変化量から、メディア送信装置１と自装置との基準動作周波数の差異と基準時刻情報の差異を算出して基準動作周波数と基準時刻情報の両方が一致するように基準時刻制御情報１６を導出する。そして基準時刻制御情報１６を基準時刻生成部８に通知して、基準時刻生成部８における基準動作周波数を上下させてメディア送信装置１と自装置の基準動作周波数と基準時刻情報の両方を一致させる。これによりメディア送信装置１が伝送パケット２を生成時に想定した理想の動作時における伝送パケット受信、符号化メディア情報の復号、出力までの時刻合わせが可能となる。 According to a general method, the media receiving device 3 first initializes the reference time information in the reference time generation unit 8 with the value of the time reproduction information that arrives first after the reception operation starts and is extracted. Then, the time reproduction information transmitted from the media transmission device 1 with a certain frequency is compared with the reference time information of the own device, and if the time reproduction information is advanced, the operating frequency of the reference signal of the own device is the media transmission. It is determined that the operating frequency is lower than the operating frequency of the device 1, and conversely, if the reference time information of the own device is advanced, it is determined that the operating frequency of the reference signal of the own device is higher than the operating frequency of the media transmitting device 1. . At that time, based on the magnitude of the difference between the time reproduction information from the media transmission device 1 and the reference time information of the own device and the amount of change from when the previous time reproduction information was received, the media transmission device 1 and the own device. The difference between the reference operating frequency and the reference time information is calculated, and the reference time control information 16 is derived so that both the reference operating frequency and the reference time information match. Then, the reference time control information 16 is notified to the reference time generation unit 8, and the reference operating frequency in the reference time generation unit 8 is raised or lowered to match both the reference operating frequency and the reference time information of the media transmission device 1 and its own device. . As a result, it is possible to adjust the time until reception of the transmission packet, decoding of the encoded media information, and output in the ideal operation assumed when the media transmission device 1 generates the transmission packet 2.

符号化映像メディア情報１４は先に述べたようなMPEG-2 Video、MPEG-4 Video、MPEG-4 AVC／ITU-T H.264などの映像符号化方式に従った規則に基づいてデータ化されており、符号化効率向上のため、復号処理に共通的に必要なパラメータは一定の頻度でまばらに設定したり、一定の順序でパラメータが存在することを前提に特に識別子もなく詰めて設定するなどの工夫がされている。そのため、符号化映像メディア情報１４の途中から映像復号処理を開始しても正常な映像復号処理ができないという問題がある。そこで、映像復号開始の初期段階や映像復号途中での異常状態からの復帰において早期に正常な映像信号の出力を実施するため、映像信号を構成する映像フレームの境目など、映像復号処理において意味のある箇所には区切れ目を意味するコードが設定される規定になっている。この区切れ目を探索することで正常に映像復号可能な開始位置を検出して、早期に映像出力を復帰できるように工夫されている。例えば、MPEG-2 Videoであれば、start_codeと呼ばれる“０”（２進表記）が２３ｂｉｔ連続した後に“１”（２進表記）が存在するデータパターンが区切れ目に該当する。MPEG-4 AVC／ITU-T H.264でも符号化条件設定で異なるが同様に“０”（２進表記）が所定のビット数連続したのちに“１”（２進表記）が存在するデータパターンが区切れ目に該当する。そして、このstart_codeに続く識別子によってどのような符号化データの種別であるかを識別することができる。 The encoded video media information 14 is converted into data based on the rules according to the video encoding scheme such as MPEG-2 Video, MPEG-4 Video, MPEG-4 AVC / ITU-T H.264 as described above. In order to improve coding efficiency, parameters commonly required for decoding processing are set sparsely at a certain frequency or set without any identifiers on the assumption that the parameters exist in a certain order Etc. have been devised. Therefore, there is a problem that normal video decoding processing cannot be performed even if video decoding processing is started in the middle of the encoded video media information 14. Therefore, in order to perform normal video signal output early in the initial stage of video decoding start or recovery from an abnormal state during video decoding, it is meaningful in video decoding processing such as the boundary of video frames constituting the video signal. There is a rule that a code that means a break is set at a certain point. By searching for this break, it is devised so that a start position where video decoding can be normally performed is detected, and video output can be restored early. For example, in the case of MPEG-2 Video, a data pattern in which “1” (binary notation) exists after 23 bits of “0” (binary notation) called “start_code” corresponds to the break. Even in MPEG-4 AVC / ITU-T H.264, the data differs depending on the encoding condition setting. Similarly, data with “1” (binary notation) after “0” (binary notation) continues for a predetermined number of bits. Patterns are separated and correspond to eyes. The type of encoded data can be identified by the identifier following the start_code.

符号化映像メディア解析部１７ではこのようなstart_codeを符号化映像メディア情報１４とビットパターン比較しながら探索し、start_codeに続く識別子を確認することで映像フレームの区切れ目を検出する。そして、映像フレームの区切れ目を検出したとき、映像フレーム先頭検知通知１８を通知する。start_codeの探索と共に映像符号化メディア情報を映像復号するために、符号化映像フレーム情報１９として符号化映像メディア復号部２０へ伝達する。これは映像フレーム区切れ目の検出タイミングと映像フレームの復号タイミングが処理順序によって前後しないように順序立てて処理するためである。 The encoded video media analysis unit 17 searches for such start_code while comparing the encoded video media information 14 with the bit pattern, and detects the break of the video frame by checking the identifier following the start_code. When a video frame break is detected, a video frame head detection notification 18 is notified. The video encoded media information is transmitted to the encoded video media decoding unit 20 as encoded video frame information 19 for video decoding along with the search for start_code. This is because the detection timing of the video frame break and the decoding timing of the video frame are processed in order so that they do not fluctuate depending on the processing order.

符号化映像メディア復号部２０では符号化映像フレーム情報１９を映像符号化方式によって規定された所定の規則に従って映像復号処理を実施して映像符号化前の映像信号２２を復元して映像信号後処理部２７へ伝達する。映像復号処理の詳細は実施の形態１の動作には直接関係しないため、ここでの詳細な説明は行わないが、符号化映像フレーム情報１９を映像復号する過程において映像フレーム末端までの映像復号処理が完了したとき、映像フレーム復号完了通知２１を通知する。 The encoded video media decoding unit 20 performs video decoding processing on the encoded video frame information 19 in accordance with a predetermined rule defined by the video encoding method, restores the video signal 22 before video encoding, and performs video signal post-processing. To the unit 27. The details of the video decoding process are not directly related to the operation of the first embodiment, and thus will not be described in detail here. However, the video decoding process up to the end of the video frame in the process of decoding the encoded video frame information 19 is performed. Is completed, the video frame decoding completion notification 21 is notified.

映像復号時間算出部２３では符号化映像メディア解析部１７から映像フレーム先頭検知通知１８を受けたタイミングでの基準時刻情報９を取り込んで当該映像フレームに対する先頭検知時刻、つまり実質的に復号処理を開始した復号開始時刻を保持する。同様に符号化映像メディア復号部２０から映像フレーム復号完了通知２１を通知されたタイミングでの基準時刻情報９を取り込んで当該映像フレームに対する復号処理が完了した復号完了時刻を保持する。別途、伝送パケット解析分離部１１から通知された映像時刻情報１３も合わせて保持する。これにより映像復号時間算出部２３では当該映像フレームに対する復号開始時刻、復号完了時刻、メディア送信装置１にて定義した復号時刻情報、表示時刻情報を保持することができる。このとき、各処理部から通知する際に映像フレームに対して一意なＩＤなどの番号を付与して、これらの時刻情報を関連付けられるようにしてもよい。そして、これらの時刻情報を比較することによって映像出力タイミングを決定する。 The video decoding time calculation unit 23 takes in the reference time information 9 at the timing when the video frame head detection notification 18 is received from the encoded video media analysis unit 17 and starts the head detection time for the video frame, that is, substantially starts the decoding process. Hold the decoding start time. Similarly, the reference time information 9 at the timing when the encoded video media decoding unit 20 is notified of the video frame decoding completion notification 21 is fetched, and the decoding completion time when the decoding process for the video frame is completed is held. Separately, the video time information 13 notified from the transmission packet analysis / separation unit 11 is also held. Thereby, the video decoding time calculation unit 23 can hold the decoding start time, the decoding completion time, the decoding time information defined by the media transmission device 1 and the display time information for the video frame. At this time, when notifying from each processing unit, a number such as a unique ID may be assigned to the video frame so that the time information can be associated. The video output timing is determined by comparing the time information.

映像復号時間算出部２３では、表示時刻情報の遷移、および必要ならば映像符号化メディア情報中のパラメータから当該映像フレームを表示する順序を決定し、そして決定された映像フレームの順番で探索したとき各映像フレームの復号完了時刻よりも先行しないように映像出力タイミングを算出して映像出力制御情報２４を映像出力タイミング生成部２５に通知する。
映像出力タイミング生成部２５では映像出力制御情報２４と基準時刻情報９とが一致するタイミングで映像出力タイミング通知２６を出力する。その際、基準時刻周波数信号１０に同期した映像信号に固有な周波数の信号に従って出力される。
映像信号後処理部２７では映像出力タイミング通知２６が通知されたタイミングに合わせて符号化映像メディア復号部２０が出力した映像信号２２の出力時間を調整した映像出力信号２８として映像表示装置４へ出力する。
このとき映像信号後処理部２７では映像信号２２に対して、画質向上させるために輪郭強調、コントラスト調整、色相調整、ノイズ除去などの各種信号処理をして映像出力信号２８を出力してもよい。この場合、これらの信号処理に要する時間分だけ固定的に映像出力タイミングを遅延させて実装することとなるが、実施の形態１の動作で説明する映像表示までの時間の一定化、最短化を図るための復号完了時刻から映像出力までの経過時間を最短とする映像出力タイミングの算出方法には影響を与えず、映像出力において固定的に必要となる出力遅延とみなせる。 When the video decoding time calculation unit 23 determines the display order of the video frames from the transition of the display time information and, if necessary, the parameters in the video encoded media information, and searches in the determined video frame order. The video output timing is calculated so as not to precede the decoding completion time of each video frame, and the video output control information 24 is notified to the video output timing generation unit 25.
The video output timing generation unit 25 outputs a video output timing notification 26 at a timing when the video output control information 24 and the reference time information 9 match. At that time, the video signal is output in accordance with a signal having a frequency unique to the video signal synchronized with the reference time frequency signal 10.
The video signal post-processing unit 27 outputs to the video display device 4 as a video output signal 28 in which the output time of the video signal 22 output from the encoded video media decoding unit 20 is adjusted in accordance with the timing when the video output timing notification 26 is notified. To do.
At this time, the video signal post-processing unit 27 may perform various signal processing such as edge enhancement, contrast adjustment, hue adjustment, and noise removal on the video signal 22 to output the video output signal 28 in order to improve the image quality. . In this case, the video output timing is fixedly delayed by the time required for the signal processing, but the time until video display described in the operation of the first embodiment is made constant and minimized. For this purpose, the calculation method of the video output timing that minimizes the elapsed time from the decoding completion time to the video output is not affected, and can be regarded as an output delay that is fixedly required in the video output.

以上のように、実施の形態１のメディア受信装置によれば、映像信号を符号化した符号化映像メディア情報を伝送パケットに格納して出力するメディア送信装置内における動作の基準時刻を示す時刻再生情報と、符号化映像メディア情報を復号開始すべきタイミングを示す復号時刻情報および映像復号した映像信号を出力すべきタイミングを示す表示時刻情報を含む映像時刻情報とが、符号化映像メディア情報と共に付与された伝送パケットを受信するメディア受信装置であって、受信した伝送パケットを解析して時刻再生情報と符号化映像メディア情報と映像時刻情報とを抽出する伝送パケット解析分離手段と、伝送パケットを受信した時点における自装置での動作の基準時刻情報と伝送パケット解析分離手段で抽出された時刻再生情報とを比較して自装置内の基準時刻情報が時刻再生情報に同期化するように基準時刻制御情報を生成する基準時刻再生手段と、基準時刻制御情報に基づいて自装置内の基準時刻の基準となる基準信号の発振周波数を調整して基準時刻の進みを制御すると共に、制御した基準時刻を示す基準時刻情報と調整した発振周波数を示す基準周波数信号を自装置内に分配する基準時刻生成手段と、符号化映像メディア情報を解析し、映像フレーム先頭を検出して通知すると共に、符号化映像フレーム情報として出力する符号化映像メディア解析手段と、符号化映像フレーム情報を復号して映像信号を復元すると共に、映像フレームの復号が完了した際に復号完了を通知する符号化映像メディア復号手段と、映像フレーム先頭の検知が通知された時点での基準時刻が示す映像フレーム先頭検知時刻、映像フレームの復号完了が通知された時点での基準時刻が示す復号完了時刻、および映像フレームに対する表示時刻情報を比較して映像フレーム先頭検知から映像出力までの経過時間を最短とする映像出力タイミングを算出する映像復号時間算出手段とを備えたので、伝送パケット受信から映像復号、映像表示までの時間を短縮して映像表示遅延時間を少なくし、かつ同一特性の符号化映像メディア情報の受信に対して映像表示タイミングを同一にした映像表示を実施するメディア受信装置を実現することができる。 As described above, according to the media receiver of the first embodiment, time reproduction indicating the reference time of operation in the media transmitter that stores and outputs encoded video media information obtained by encoding a video signal in a transmission packet. Information and video time information including decoding time information indicating a timing at which decoding of encoded video media information should be started and display time information indicating timing at which video decoded video signals should be output are provided together with the encoded video media information A receiving device for receiving a transmission packet, a transmission packet analyzing / separating means for analyzing the received transmission packet and extracting time reproduction information, encoded video media information and video time information; The reference time information of the operation of the device at the time of the transmission and the time reproduction information extracted by the transmission packet analysis and separation means, In comparison, the reference time reproduction means for generating the reference time control information so that the reference time information in the own device is synchronized with the time reproduction information, and the reference time reference in the own device based on the reference time control information Reference time generating means for controlling the advance of the reference time by adjusting the oscillation frequency of the reference signal, and distributing the reference time information indicating the controlled reference time and the reference frequency signal indicating the adjusted oscillation frequency in the own device, Analyzing the encoded video media information, detecting and notifying the beginning of the video frame, and outputting the encoded video frame information by decoding the encoded video frame information, and restoring the video signal In addition, an encoded video media decoding means for notifying the completion of decoding when decoding of the video frame is completed, and a reference when the detection of the start of the video frame is notified The video frame start detection time indicated by the time, the decoding completion time indicated by the reference time when the completion of decoding of the video frame is notified, and the display time information for the video frame are compared, and the process from video frame start detection to video output Video decoding time calculation means for calculating the video output timing that minimizes the time, so the time from transmission packet reception to video decoding and video display is shortened to reduce the video display delay time, and the same characteristics It is possible to realize a media receiving apparatus that performs video display with the same video display timing with respect to reception of encoded video media information.

実施の形態２．
図５は、この発明の実施の形態２によるメディア受信装置３ａを示す構成図である。
実施の形態２のメディア受信装置３ａが実施の形態１のメディア受信装置３と異なる部分は、実施の形態１における映像復号時間算出部２３の代わりに、映像復号時刻蓄積部２００と映像表示時刻判定部２０２を備えた点である。映像復号時刻蓄積部２００は、復号時刻情報１３ａおよび表示時刻情報１３ｂを含む映像時刻情報１３を保持し、映像フレーム先頭検知通知１８または映像フレーム復号完了通知２１を受けてその時点での基準時刻情報９を保持する記憶部であり、蓄積した各時刻情報を含む蓄積映像時刻情報２０１を映像表示時刻判定部２０２に出力する。映像表示時刻判定部２０２は、受信した蓄積映像時刻情報２０１から映像表示タイミングを算出して映像出力制御情報２４を出力する機能部である。その他の構成については実施の形態１の構成と対応する部分には同一符号を付してその説明を省略する。 Embodiment 2. FIG.
FIG. 5 is a block diagram showing a media receiver 3a according to Embodiment 2 of the present invention.
The difference between the media receiving device 3a of the second embodiment and the media receiving device 3 of the first embodiment is that the video decoding time accumulating unit 200 and the video display time determination are used instead of the video decoding time calculating unit 23 in the first embodiment. This is a point provided with a section 202. The video decoding time accumulation unit 200 holds the video time information 13 including the decoding time information 13a and the display time information 13b, receives the video frame head detection notification 18 or the video frame decoding completion notification 21, and the reference time information at that time 9, and outputs accumulated video time information 201 including each accumulated time information to the video display time determination unit 202. The video display time determination unit 202 is a functional unit that calculates video display timing from the received accumulated video time information 201 and outputs the video output control information 24. For other configurations, parts corresponding to those of the first embodiment are denoted by the same reference numerals, and description thereof is omitted.

図６は、この発明の実施の形態２における映像復号処理と映像表示処理に関する時間経過を示す説明図である。
図６において、当該映像フレームに対する映像フレーム先頭検知から映像復号完了までの期間を復号処理[0]、復号処理[1]、復号処理[2]・・・と記載している。同じく当該映像フレームに対して、映像信号を出力している期間を表示処理[0]、表示処理[1]、表示処理[2]・・・と記載している。このとき伝送パケット中から抽出した表示時刻情報とは異なる時間タイミングであり、詳細については後述する。 FIG. 6 is an explanatory diagram showing the passage of time regarding the video decoding process and the video display process according to the second embodiment of the present invention.
In FIG. 6, the period from the detection of the start of the video frame to the completion of video decoding for the video frame is described as decoding processing [0], decoding processing [1], decoding processing [2]. Similarly, a period during which a video signal is output for the video frame is described as display processing [0], display processing [1], display processing [2]. At this time, the timing is different from the display time information extracted from the transmission packet, and details will be described later.

次に、実施の形態２の動作について説明する。メディア受信装置３ａでの基本的な動作は実施の形態１と同様であり、相違点となる動作を述べると、伝送パケット解析分離部１１にて伝送パケット７のメディアヘッダ中にある復号時刻情報、表示時刻情報を抽出したとき、当該映像フレームを識別するためのＩＤを付与して映像時刻情報１３を生成して映像復号時刻蓄積部２００へ通知する。また、伝送パケット解析分離部１１は、伝送パケット７から符号化映像メディア情報１４を抽出して符号化映像メディア解析部１７に通知する際にも映像フレーム識別ＩＤを含めて通知する。 Next, the operation of the second embodiment will be described. The basic operation in the media receiving device 3a is the same as that in the first embodiment, and the operation that is different is described. Decoding time information in the media header of the transmission packet 7 in the transmission packet analyzing / separating unit 11; When the display time information is extracted, an ID for identifying the video frame is assigned, and video time information 13 is generated and notified to the video decoding time storage unit 200. The transmission packet analysis / separation unit 11 also notifies the encoded video media analysis unit 17 including the video frame identification ID when the encoded video media information 14 is extracted from the transmission packet 7 and notified to the encoded video media analysis unit 17.

符号化映像メディア解析部１７では符号化映像メディア情報１４から映像フレームの区切れ目を探索し、映像フレームの区切れ目を検出したときには映像フレーム識別ＩＤと共に映像フレーム先頭検知通知１８を映像復号時刻蓄積部２００へ通知する、そして、映像フレームの区切れ目を明示した形式で符号化映像フレーム情報１９に映像フレーム識別ＩＤを含めて符号化映像メディア復号部２０へ通知する。 The encoded video media analysis unit 17 searches the encoded video media information 14 for a video frame break, and when a video frame break is detected, the video frame head detection notification 18 is sent together with the video frame identification ID. 200, and the encoded video frame information 19 including the video frame identification ID is notified to the encoded video media decoding unit 20 in a format in which the video frame breaks are clearly indicated.

符号化映像メディア復号部２０では符号化映像フレーム情報１９を映像復号処理し、映像フレームの映像復号が完了したときには映像フレーム識別ＩＤと共に映像フレーム復号完了通知２１を映像復号時刻蓄積部２００へ通知する。映像復号時刻蓄積部２００では、映像時刻情報１３と、映像フレーム先頭検知通知１８および映像フレーム復号完了通知２１を通知されたタイミングでそれぞれ基準時刻情報９を取り込んだ映像フレーム先頭検知時刻および映像フレーム復号完了時刻について映像フレーム識別ＩＤを用いて関連付けて保持する。そして各時刻情報を取り纏めて蓄積映像時刻情報２０１として映像表示時刻判定部２０２へ通知する。 The encoded video media decoding unit 20 performs video decoding processing on the encoded video frame information 19. When the video decoding of the video frame is completed, a video frame decoding completion notification 21 is notified to the video decoding time storage unit 200 together with the video frame identification ID. . The video decoding time accumulating unit 200 receives the video time information 13, the video frame head detection notification 18 and the video frame decoding completion notification 21 at the timing of the notification, and the video frame head detection time and the video frame decoding, respectively, incorporating the reference time information 9. The completion time is stored in association with the video frame identification ID. Each time information is collected and notified to the video display time determination unit 202 as accumulated video time information 201.

映像表示時刻判定部２０２では、蓄積映像時刻情報２０１を用いて映像出力制御情報２４を生成する。その生成方法を以下詳細に説明する。
まず、各映像フレームに対する表示時刻情報１３ｂおよび必要ならば符号化映像メディア情報中のパラメータに基づいて映像フレームの表示順序を決定し、その表示順序に従って映像フレーム復号完了時刻を参照していく。図６に示す例では説明を分かり易くするため、このような表示順序に従った並び替えが発生しないケースで図示しているが、並び替えが発生した場合でも復号完了時刻と表示開始時刻の関係は変わらず矢印線で示す対応関係が交差するだけとなる。なお、表示開始時刻はメディア送信装置１が伝達した理想モデルでの表示時刻情報ではなく、実施の形態１で説明したメディア受信装置３で実際に映像信号の出力を開始する時間を意味している。 The video display time determination unit 202 generates video output control information 24 using the stored video time information 201. The generation method will be described in detail below.
First, the display order of video frames is determined based on the display time information 13b for each video frame and, if necessary, the parameters in the encoded video media information, and the video frame decoding completion time is referenced according to the display order. In the example shown in FIG. 6, for ease of explanation, the case where such rearrangement according to the display order does not occur is illustrated, but the relationship between the decoding completion time and the display start time even when rearrangement occurs. Does not change, and the correspondence indicated by the arrow lines intersects. Note that the display start time is not the display time information in the ideal model transmitted by the media transmission device 1, but means the time when the media reception device 3 described in the first embodiment actually starts outputting the video signal. .

図６において、符号化メディア映像情報を構成するｎ番目の映像フレームの復号完了時刻をtd[n]、同じくｎ番目の映像フレームの表示開始時刻をtp[n]とすると、復号完了時刻から復号開始時刻までの時間差は、
Δt[n]=tp[n]-td[n]
で表される。 In FIG. 6, when the decoding completion time of the nth video frame constituting the encoded media video information is td [n] and the display start time of the nth video frame is tp [n], decoding is performed from the decoding completion time. The time difference to the start time is
Δt [n] = tp [n] -td [n]
It is represented by

ここで各映像フレームに対する表示開始時刻tp[n]は映像フレーム周期の刻みで増加する時刻となるため、任意の基準点からの相対時間で記述することが可能であり、例えば最初の映像フレームの復号完了時刻td[0]を基準にして映像フレーム周期時間T、オフセット時間αとすると、
tp[n]=td[0]+nT+α
と表現できる。 Here, since the display start time tp [n] for each video frame is a time that increases in increments of the video frame period, it can be described in a relative time from an arbitrary reference point, for example, the first video frame. When the video frame cycle time T and the offset time α are based on the decoding completion time td [0],
tp [n] = td [0] + nT + α
Can be expressed.

映像表示開始タイミングは映像フレーム周期の刻みである必要があるため任意のタイミングにはできず、また映像復号完了タイミングよりも映像表示開始タイミングが早くなってしまうと表示すべき映像フレームが存在せず、正しい映像フレームが表示できないこととなる。そのため、必ず当該映像フレームに対する映像復号完了タイミングよりも映像表示開始タイミングが後になるように映像表示開始タイミングが発生するように制御する必要がある。
従って、映像フレームの復号完了時刻から映像フレームの表示開始時刻までの遅延時間を最小にするためには、
min[Δt[n]]＞０なお、min[ ]はΔt[n]の集合の最小値を意味する。
を満たす必要がある。 The video display start timing must be in increments of the video frame period, so it cannot be set to any timing, and if the video display start timing is earlier than the video decoding completion timing, there is no video frame to be displayed. Therefore, the correct video frame cannot be displayed. Therefore, it is necessary to perform control so that the video display start timing is generated so that the video display start timing is later than the video decoding completion timing for the video frame.
Therefore, in order to minimize the delay time from the video frame decoding completion time to the video frame display start time,
min [Δt [n]]> 0 Note that min [] means the minimum value of the set of Δt [n].
It is necessary to satisfy.

予め設定した探索期間、あるいは一般的には符号化映像フレームは約０．５秒周期や１秒周期といった固定時間周期で映像符号化制御を変更して伝送されるので、それを上回る時間で決めた探索期間において、上記条件を満たすときをn＝kとすると、
Δt[N]=tp[k]-td[k]
=(td[0]+kT+α)-td[k]
このとき、遅延時間の最小は０に近いことが理想であるので、
Δt[k]≒0
とすると、オフセット時間αは
(td[0]+kT+α)-td[k]=0
α=(td[k]-td[0])-kT
と算出できる。 Since the encoded video frame is transmitted by changing the video encoding control in a fixed search period such as about 0.5 second period or 1 second period in general, it is determined by the time exceeding it. In the search period, when n = k is satisfied when the above condition is satisfied,
Δt [N] = tp [k] -td [k]
= (td [0] + kT + α) -td [k]
At this time, since the minimum delay time is ideally close to 0,
Δt [k] ≒ 0
Then, the offset time α is
(td [0] + kT + α) -td [k] = 0
α = (td [k] -td [0])-kT
And can be calculated.

従って、各映像フレームの表示開始時間は、
tp[n]=td[0]+nT+α
=(td[0]+nT)+(td[k]-td[0])-kT
=(n-k)×T+td[k]
で決定される。
映像出力タイミング生成部２５では、このようにして映像表示開始タイミングを決定した後、探索期間の最終映像フレームをｍとしたとき、
tp[m]=(m-k)×T+td[k]
で算出された表示開始時刻と、メディア受信装置３ａ内の基準時刻情報９で分配される現在の基準時刻と比較して一致したタイミングから映像表示を開始して、映像フレーム周期Ｔ毎に映像出力タイミング通知２６を生成する。 Therefore, the display start time of each video frame is
tp [n] = td [0] + nT + α
= (td [0] + nT) + (td [k] -td [0])-kT
= (nk) × T + td [k]
Determined by
In the video output timing generation unit 25, after determining the video display start timing in this way, when the final video frame in the search period is m,
tp [m] = (mk) × T + td [k]
The video display is started at a timing coincident with the display start time calculated in step 1 and the current reference time distributed by the reference time information 9 in the media receiver 3a, and the video output is performed every video frame period T. A timing notification 26 is generated.

また、映像復号完了タイミングに依存して映像表示開始タイミングが決定されるため、メディア送信装置１から伝送された符号化映像メディア情報が同一特性を有していれば、映像復号処理、映像表示処理までの遅延時間制御も同一傾向の結果となる。従って、映像復号完了タイミングに対して映像表示開始タイミングが前後してメディア送信装置１とメディア受信装置３間での映像入力から映像出力での遅延時間についても、電源Ｏｎ／Ｏｆｆ、あるいは伝送ケーブルの抜き差しなどの操作毎にブレが発生せず安定した動作が実現できる。
このとき、表示時刻として適用する値として表示時刻情報１３ｂをそのまま適用しても相対的な時間の差異があるだけで、同様の算出方法によって復号完了時刻から映像出力までの経過時間を最短とする映像出力表示タイミングを決定することができる。 Since the video display start timing is determined depending on the video decoding completion timing, if the encoded video media information transmitted from the media transmission device 1 has the same characteristics, the video decoding process and the video display process are performed. The delay time control until is also the result of the same tendency. Therefore, the delay time from the video input to the video output between the media transmitting device 1 and the media receiving device 3 before and after the video decoding start timing is also about the power On / Off or the transmission cable. Stable operation can be realized without blurring for each operation such as insertion and removal.
At this time, even if the display time information 13b is directly applied as a value to be applied as the display time, there is only a relative time difference, and the elapsed time from the decoding completion time to the video output is minimized by the same calculation method. The video output display timing can be determined.

以上のように、実施の形態２のメディア受信装置によれば、映像復号時間算出手段は、伝送パケット解析分離手段で抽出した映像フレームに対する表示時刻情報が示す表示時刻と、符号化映像メディア復号手段で映像フレームに対する映像フレーム復号を完了した復号完了時刻とを蓄積する映像復号時刻蓄積部と、映像フレームにおける復号完了時刻から表示時刻までの差分時間を算出し、差分時間の算出を所定期間複数の映像フレームで行い、所定期間内の差分時間に基づいて、復号完了から映像出力までの経過時間を最短とする映像出力タイミングを決定する映像表示時刻判定部とを備えたので、一定期間における復号完了時刻から表示時刻までの差分値の遷移によって伝送パケットを受信してから符号化映像メディア情報の抽出、復号、そして映像出力までにかかる処理遅延時間を一定かつ縮小する映像出力タイミングを決定することができる。 As described above, according to the media receiver of the second embodiment, the video decoding time calculation means includes the display time indicated by the display time information for the video frame extracted by the transmission packet analysis and separation means, and the encoded video media decoding means. The video decoding time storage unit for storing the decoding completion time when the video frame decoding for the video frame is completed, and the difference time from the decoding completion time to the display time in the video frame are calculated. A video display time determination unit that determines the video output timing that minimizes the elapsed time from the completion of decoding to video output based on the difference time within a predetermined period based on the video frame, so that decoding is completed in a certain period Extraction and decoding of encoded video media information after receiving transmission packet by transition of difference value from time to display time And it is possible to determine the video output timing of constant and reduce such processing delay time until image output.

また、実施の形態２のメディア受信装置によれば、符号化映像メディア復号手段は、符号化映像フレーム情報から映像フレーム間隔を抽出して映像フレームに対する表示開始予定時刻を算出すると共に、映像復号時間算出手段は、伝送パケット解析分離手段で抽出した映像フレームに対する表示時刻情報が示す表示時刻と、符号化映像メディア復号手段から通知された映像フレームに対する復号完了時刻とを蓄積する映像復号時刻蓄積部と、映像フレームに対する復号完了時刻から、映像フレームに対する表示開始予定時刻までの差分時間を所定期間複数の映像フレームで行い、所定期間内の差分時間に基づいて、復号完了から映像出力までの経過時間を最短とする映像出力タイミングを決定する映像表示時刻判定部とを備えたので、一定期間における復号完了時刻から表示開始予定時刻までの差分値の遷移によって伝送パケットを受信してから符号化映像メディア情報の抽出、復号、そして映像出力までにかかる処理遅延時間を一定かつ縮小する映像出力タイミングを決定することができる。 In addition, according to the media receiver of the second embodiment, the encoded video media decoding means calculates the display start scheduled time for the video frame by extracting the video frame interval from the encoded video frame information, and the video decoding time. A calculating means for storing a display time indicated by display time information for the video frame extracted by the transmission packet analyzing and separating means and a decoding completion time for the video frame notified from the encoded video media decoding means; The difference time from the decoding completion time for the video frame to the scheduled display start time for the video frame is performed for a plurality of video frames for a predetermined period, and the elapsed time from the decoding completion to the video output is determined based on the difference time within the predetermined period. It has a video display time determination unit that determines the shortest video output timing. Video output timing for constant and reducing the processing delay time from the reception of the transmission packet by the transition of the difference value from the decoding completion time to the display start scheduled time in the video to the extraction, decoding and video output of the encoded video media information Can be determined.

実施の形態３．
実施の形態３におけるメディア受信装置３ａの図面上の構成は実施の形態２と同様であるため、図５を用いて説明する。
実施の形態３の映像復号時刻蓄積部２００は、符号化映像メディア解析部１７から通知された映像フレーム先頭検知時刻と、符号化映像メディア復号部２０から通知された復号完了時刻とを蓄積する。また、映像表示時刻判定部２０２は、伝送パケット解析分離部１１で抽出した復号時刻情報１３ａと、映像フレーム先頭検知時刻との差分値を算出して、この差分値が極大となるピークを探索し、ピークの検知位置と検知回数に基づいて決定した期間内で、復号完了時刻から映像出力までの経過時間を最短とする映像出力タイミングを決定するよう構成されている。その他の構成は実施の形態２と同様である。 Embodiment 3 FIG.
Since the configuration of the media receiving device 3a in the third embodiment in the drawing is the same as that in the second embodiment, it will be described with reference to FIG.
The video decoding time accumulation unit 200 according to the third embodiment accumulates the video frame head detection time notified from the encoded video media analysis unit 17 and the decoding completion time notified from the encoded video media decoding unit 20. In addition, the video display time determination unit 202 calculates a difference value between the decoding time information 13a extracted by the transmission packet analysis / separation unit 11 and the video frame head detection time, and searches for a peak where the difference value becomes maximum. The video output timing that minimizes the elapsed time from the decoding completion time to the video output is determined within a period determined based on the peak detection position and the number of detections. Other configurations are the same as those of the second embodiment.

図７は、この発明の実施の形態３によるメディア受信装置３ａでの映像フレーム後の映像フレーム先頭検知時刻から映像復号完了時刻までの経過時間の遷移の一例を示す図であり、この図を用いて実施の形態３の動作について説明する。
メディア受信装置３ａでの基本的な動作は実施の形態２と同様であり、相違点となる映像表示時刻蓄積部２００における動作を説明する。
先に述べたような映像符号化方式では、映像符号化効率の向上と映像復号異常からの早期復帰を両立させるため、一定の周期で画面内符号化方式による符号化映像フレーム、つまり当該映像フレームに対する符号化映像メディア情報だけで映像復号処理して映像フレームを復元できるようにし、他の映像フレームでは画面間符号化方式による符号化映像フレーム、つまり隣接する映像フレーム間は類似性が高いことを利用してその差分だけを符号化して情報量を縮減する代わりに複数の符号化映像メディア情報を用いた映像復号処理をして初めて映像フレームを復元できるようにしている。 FIG. 7 is a diagram showing an example of the transition of the elapsed time from the video frame head detection time after the video frame to the video decoding completion time in the media receiver 3a according to Embodiment 3 of the present invention. The operation of the third embodiment will be described.
The basic operation in the media receiving device 3a is the same as that in the second embodiment, and the operation in the video display time accumulating unit 200 as a difference will be described.
In the video encoding system as described above, in order to achieve both improvement in video encoding efficiency and early recovery from video decoding abnormality, a video frame encoded by the intra-screen encoding system at a certain period, that is, the video frame It is possible to restore the video frame by decoding the video only with the encoded video media information for the other video frames, and the other video frames have high similarity between the encoded video frames by the inter-screen coding method, that is, the adjacent video frames. Instead of encoding only the difference and reducing the amount of information, the video frame can be restored only after video decoding processing using a plurality of encoded video media information.

その結果、画面内符号化方式による符号化映像フレームは比較的データ量が多く、その伝送時間も長くなるため、当該映像フレームの伝送パケット受信から映像復号完了までに多くの時間を必要とする傾向がある。逆に、画面間符号化方式では比較的データ量が少なく済み、その伝送時間も短くなるため、当該映像フレームの伝送パケット受信から映像復号完了までが少ない時間で済む傾向がある。 As a result, encoded video frames based on the intra-picture encoding method have a relatively large amount of data and a long transmission time, and therefore, a tendency to require a long time from reception of a transmission packet of the video frame to completion of video decoding. There is. On the other hand, since the inter-picture encoding method requires a relatively small amount of data and the transmission time thereof is shortened, there is a tendency that less time is required from the transmission packet reception of the video frame to the completion of video decoding.

このような特性の異なる符号化方式を一定の周期で組み合わせて映像符号化処理がなされているため、図７で示すように映像フレームに対する映像フレーム先頭検知から復号完了までの経過時間にはその一定周期に応じた粗密が発生することになる。そこで、当該符号化映像フレームに対する映像フレーム先頭検知時刻と復号完了時刻との差分値の遷移を統計処理して平均的な範囲を算出して、その遷移が平均的な範囲を超えて差分値が大きくなるピークを検知して、その検知回数で決定する。周期性が一巡以上する必要があるため最低２回の検知、探索期間が長い方が映像表示時刻判定部２０２での判定精度が向上するため、判定までの時間が問題にならない範囲でより多くの検知回数を設定することが望ましい。 Since the video encoding process is performed by combining such encoding methods having different characteristics at a constant period, the elapsed time from the detection of the start of the video frame to the completion of the decoding is fixed as shown in FIG. Roughness corresponding to the cycle occurs. Therefore, the transition of the difference value between the video frame head detection time and the decoding completion time for the encoded video frame is statistically processed to calculate an average range, and the transition value exceeds the average range and the difference value is A peak that becomes large is detected and determined by the number of detections. Since the periodicity needs to be more than one round, the detection accuracy at the video display time determination unit 202 is improved when the detection period is longer at least twice and the search period is longer. It is desirable to set the number of detections.

映像復号時刻蓄積部２００は、このようにして決定した探索期間に対する蓄積映像時刻情報２０１を生成して映像表示時刻判定部２０２へ通知する。映像表示時刻判定部２０２では通知された映像時刻情報２０１内の時刻情報から、映像フレームの復号完了時刻から映像フレームの表示開始時刻までの遅延時間を探索して、その遅延時間を最小とする制御を実施する映像出力タイミングを決定して映像出力制御情報２４を生成する。これにより伝送パケット２から抽出した符号化メディア情報から自動的に映像出力タイミングを生成するための探索期間を決定することができるようになる。 The video decoding time storage unit 200 generates the stored video time information 201 for the search period determined in this way, and notifies the video display time determination unit 202 of it. The video display time determination unit 202 searches the time information in the notified video time information 201 for a delay time from the video frame decoding completion time to the video frame display start time, and controls to minimize the delay time The video output control information 24 is generated by determining the video output timing for performing the above. As a result, the search period for automatically generating the video output timing can be determined from the encoded media information extracted from the transmission packet 2.

以上のように、実施の形態３のメディア受信装置によれば、映像復号時刻蓄積部は、符号化映像メディア解析手段から通知された映像フレーム先頭検知時刻と、符号化映像メディア復号手段から通知された復号完了時刻とを蓄積し、映像表示時刻判定部は、伝送パケット解析分離手段で抽出した復号時刻情報と、映像フレーム先頭検知時刻との差分値を算出して、差分値が極大となるピークを探索し、ピークの検知位置と検知回数に基づいて決定した期間内で、復号完了時刻から映像出力までの経過時間を最短とする映像出力タイミングを決定するようにしたので、伝送パケット受信から映像復号、映像表示までの時間を短縮して映像表示遅延時間を少なくし、かつ同一特性の符号化映像メディア情報の受信に対して映像表示タイミングを同一にした映像表示を実施するメディア受信装置を実現することができる。 As described above, according to the media receiver of the third embodiment, the video decoding time storage unit is notified from the video frame head detection time notified from the encoded video media analyzing unit and from the encoded video media decoding unit. The video display time determination unit calculates the difference value between the decoding time information extracted by the transmission packet analysis and separation means and the video frame head detection time, and the peak value at which the difference value is maximized is stored. The video output timing that minimizes the elapsed time from the decoding completion time to the video output is determined within the period determined based on the peak detection position and the number of detections. The time until decoding and video display is shortened to reduce the video display delay time, and the video display timing is the same for reception of encoded video media information having the same characteristics. It is possible to realize a media receiving apparatus implementing the video display you.

実施の形態４．
実施の形態４におけるメディア受信装置３（３ａ）の図面上の構成は実施の形態１または実施の形態２と同様であるため、実施の形態２の図５を用いて説明する。
実施の形態３の符号化映像メディア復号部２０は、映像復号処理において、符号化映像フレームが画面内符号化方式で構築されているか否かを検知する。また、映像復号時刻蓄積部２００および映像表示時刻判定部２０２では、符号化映像メディア復号部２０において、符号化映像フレームが画面内符号化方式で構築されていることを検知した場合、その検知位置と検知回数に基づいて決定した期間内で、復号完了時刻から映像出力までの経過時間を最短とする映像出力タイミングを決定するよう構成されている。その他の構成は実施の形態１または実施の形態２と同様である。 Embodiment 4 FIG.
Since the configuration of the media receiver 3 (3a) in the fourth embodiment in the drawing is the same as that in the first or second embodiment, it will be described with reference to FIG. 5 of the second embodiment.
The encoded video media decoding unit 20 according to Embodiment 3 detects whether or not the encoded video frame is constructed by the intra-screen encoding method in the video decoding process. Further, in the video decoding time storage unit 200 and the video display time determination unit 202, when the encoded video media decoding unit 20 detects that the encoded video frame is constructed by the intra-screen encoding method, the detection position And the video output timing that minimizes the elapsed time from the decoding completion time to the video output within a period determined based on the number of times of detection. Other configurations are the same as those in the first or second embodiment.

図８は、この発明の実施の形態４によるメディア受信装置３での映像フレーム毎の画面符号化方式の遷移の一例を示す図である。
実施の形態４の動作について説明する。メディア受信装置３での基本的な動作は実施の形態１または実施の形態２と同様であり、相違点となる動作を説明する。
復号化映像メディア復号部２０において符号化映像フレーム情報１９の映像復号処理を実施する際に、その過程で当該映像フレームの符号化方法が先に述べた画面内符号化方式によるものか画面間符号化方式によるものかを識別してそれぞれの方式に応じた処理を実施する。例えば、図８のようにＩピクチャと呼ばれる当該映像フレームだけの情報で符号化処理がなされる画像内符号化方式、Ｐピクチャと呼ばれる過去の映像フレームとの差分情報を用いて符号化処理がなされる片方向予測による画像間符号化方式、Ｂピクチャと呼ばれる過去の映像フレームと未来の映像フレームの両方に対する差分情報を用いて符号化処理がなされる双方向予測による画像間符号化方式を組み合わせて映像符号化処理が実施される。当該映像フレームの情報を直接符号化するよりも差分情報を符号化する方がデータ量を少なくできる一方で、差分を算出する対象となった映像フレームが必要となって伝送エラーなどの復号異常が発生時には正常に復号処理ができなくなる場合も発生する弊害もある。そこで、一定の周期でＩピクチャが存在するように符号化処理がなされる。このような画面内符号化方式か画面間符号化方式かの符号化方式種別を含めて映像フレーム復号完了通知２１を通知する。 FIG. 8 is a diagram showing an example of the transition of the screen encoding method for each video frame in the media receiver 3 according to Embodiment 4 of the present invention.
The operation of the fourth embodiment will be described. The basic operation in the media receiver 3 is the same as that in the first embodiment or the second embodiment, and the operation that is different will be described.
When video decoding processing of the encoded video frame information 19 is performed in the decoded video media decoding unit 20, whether the encoding method of the video frame is in accordance with the above-described intra-screen encoding method in the process or inter-frame code Identify whether the system is based on the conversion method, and perform processing according to each method. For example, as shown in FIG. 8, an intra-picture encoding method in which encoding processing is performed with only information of the video frame called I picture, and encoding processing is performed using difference information from a past video frame called P picture. Combining the inter-picture coding method based on unidirectional prediction, the inter-picture coding method based on bi-prediction in which coding processing is performed using difference information for both past video frames and future video frames called B pictures. A video encoding process is performed. While encoding the difference information can reduce the amount of data rather than directly encoding the information of the video frame, the video frame for which the difference is to be calculated is required and a decoding error such as a transmission error occurs. There is also a problem that the decoding process may not be normally performed at the time of occurrence. Therefore, encoding processing is performed so that I pictures exist at a fixed period. The video frame decoding completion notification 21 is notified including the coding method type, such as the intra-screen coding method or the inter-screen coding method.

映像復号時刻蓄積部２００では各種時刻情報と共に当該映像フレームの符号化方式種別を含めて保持しておき、画面内符号化方式による符号化映像フレームの検知回数で探索時間を決定する。周期性が一巡以上する必要があるため最低２回の検知、探索期間が長い方が映像表示時刻判定部２０２での判定精度が向上するため、判定までの時間が問題にならない範囲でより多くの検知回数を設定することが望ましい。 The video decoding time storage unit 200 holds various types of time information together with the encoding method type of the video frame, and determines the search time based on the number of detections of the encoded video frame by the intra-screen encoding method. Since the periodicity needs to be more than one round, the detection accuracy at the video display time determination unit 202 is improved when the detection period is longer at least twice and the search period is longer. It is desirable to set the number of detections.

映像復号時刻蓄積部２００では、このようにして決定した探索期間に対する蓄積映像時刻情報２０１を生成して映像表示時刻判定部２０２へ通知する。映像表示時刻判定部２０２では通知された映像時刻情報２０１内の時刻情報から、映像フレームの復号完了時刻から映像フレームの表示開始時刻までの遅延時間を探索して、その遅延時間を最小とする制御を実施する映像出力タイミングを決定して映像出力制御情報２４を生成する。これにより伝送パケット２から抽出した符号化メディア情報から自動的に映像出力タイミングを生成するための探索期間を決定することができるようになる。 The video decoding time accumulating unit 200 generates accumulated video time information 201 for the search period determined as described above and notifies the video display time determining unit 202 of the accumulated video time information 201. The video display time determination unit 202 searches the time information in the notified video time information 201 for a delay time from the video frame decoding completion time to the video frame display start time, and controls to minimize the delay time The video output control information 24 is generated by determining the video output timing for performing the above. As a result, the search period for automatically generating the video output timing can be determined from the encoded media information extracted from the transmission packet 2.

以上のように、実施の形態４のメディア受信装置によれば、符号化映像メディア復号手段での映像復号処理において、符号化映像フレームが画面内符号化方式で構築されていることを検知した場合、映像復号時間算出手段は、その検知位置と検知回数に基づいて決定した期間内で、復号完了時刻から映像出力までの経過時間を最短とする映像出力タイミングを決定するようにしたので、伝送パケット受信から映像復号、映像表示までの時間を短縮して映像表示遅延時間を少なくし、かつ同一特性の符号化映像メディア情報の受信に対して映像表示タイミングを同一にした映像表示を実施するメディア受信装置を実現することができる。 As described above, according to the media receiver of the fourth embodiment, when it is detected in the video decoding process by the encoded video media decoding means that the encoded video frame is constructed by the intra-screen encoding method. The video decoding time calculation means determines the video output timing that minimizes the elapsed time from the decoding completion time to the video output within the period determined based on the detection position and the number of detections. Media reception that shortens the time from reception to video decoding and video display to reduce video display delay time, and performs video display with the same video display timing for reception of encoded video media information with the same characteristics An apparatus can be realized.

実施の形態５．
実施の形態５における図面上の構成は実施の形態１または実施の形態２と同様であるため、実施の形態１の図１を用いて説明する。
図９は、この発明の実施の形態５で扱う伝送パケット２において制御情報を含むパケット構造の説明図である。
図９は、メディア送信装置１から送信される伝送パケット２中に制御情報を格納して伝送する場合の一例としてMPEG-2 Systems規格で定義するＴＳパケット構造における映像、音声などのメディア情報に関する制御情報であるProgram_Map_Table（ＰＭＴ）に関して実施の形態５の説明に必要なパラメータをビット配置で示したものである。各パラメータの意味は実施の形態５の特徴点に直接関係しないため詳細な説明は省略するが、図９中において太枠線で囲んだ領域が映像、音声などのメディア情報に対する個別の定義を格納する領域であり、メディア情報の種別数分だけ本領域が列挙されて存在している。 Embodiment 5 FIG.
Since the configuration of the fifth embodiment on the drawing is the same as that of the first or second embodiment, a description will be given with reference to FIG. 1 of the first embodiment.
FIG. 9 is an explanatory diagram of a packet structure including control information in the transmission packet 2 handled in the fifth embodiment of the present invention.
FIG. 9 shows control related to media information such as video and audio in the TS packet structure defined by the MPEG-2 Systems standard as an example in the case of storing control information in a transmission packet 2 transmitted from the media transmission device 1 and transmitting it. The parameters necessary for the explanation of the fifth embodiment regarding the Program_Map_Table (PMT) which is information are shown in bit arrangement. Although the meaning of each parameter is not directly related to the feature points of the fifth embodiment, detailed description thereof will be omitted. In FIG. 9, the area surrounded by a thick frame line stores individual definitions for media information such as video and audio. This area is enumerated as many as the number of types of media information.

メディア種別はstream_typeによって指定されて、そのメディアが格納される伝送パケットを識別するためのＩＤがメディア_PIDによって指定されて、そのメディアに対する付加情報はES_Info領域に格納される。ES_info領域にはディスクリプタ形式と呼ばれる識別子、情報長、そして情報データ列からなる構造で繰り返し各種情報を埋め込むことが可能である。このES_info領域に、伝送パケットに格納する符号化映像メディア情報の最大経過時間情報を埋め込んで制御情報パケットを生成する。 The media type is specified by stream_type, the ID for identifying the transmission packet storing the media is specified by media_PID, and additional information for the media is stored in the ES_Info area. In the ES_info area, various information can be repeatedly embedded with a structure including an identifier called a descriptor format, an information length, and an information data string. A control information packet is generated by embedding the maximum elapsed time information of the encoded video media information stored in the transmission packet in this ES_info area.

伝送パケット解析分析部１１において、伝送パケット７を解析する際に上記制御情報であるＰＭＴを検出し、ＰＭＴ中にあるES_info領域から符号化映像メディア情報に対する映像付加情報を取り出す。そして、表示時刻情報１３ｂを通知する際に符号化映像メディア付加情報を含めて映像復号時間算出部２３へ通知しても良いし、映像付加情報を検出したときに別途通知して良い。 The transmission packet analysis / analysis unit 11 detects the PMT that is the control information when analyzing the transmission packet 7 and extracts video additional information for the encoded video media information from the ES_info area in the PMT. Then, when notifying the display time information 13b, it may be notified to the video decoding time calculation unit 23 including the encoded video media additional information, or may be separately notified when the video additional information is detected.

映像復号時間算出部２３では、符号化映像メディア付加情報が通知されたタイミングに該当する符号化映像フレームが、通知された最大経過時間に対応する映像フレームと特定し、その最大経過時間情報に基づいて、復号完了時刻から映像出力までの経過時間を最短とする映像出力タイミングを決定して映像出力制御情報２４を生成する。 In the video decoding time calculation unit 23, the encoded video frame corresponding to the timing at which the encoded video media additional information is notified is identified as the video frame corresponding to the notified maximum elapsed time, and based on the maximum elapsed time information. Thus, the video output control information 24 is generated by determining the video output timing that minimizes the elapsed time from the decoding completion time to the video output.

以上のように、実施の形態５のメディア受信装置によれば、伝送パケット解析分離手段は、メディア送信装置から出力される伝送パケットの制御情報パケットに、映像フレームの復号開始から復号完了までにかかる最大経過時間情報が含まれている場合、制御情報パケットを解析して最大経過時間情報を抽出し、映像復号時間算出手段は、伝送パケット解析分離手段における抽出タイミングに基づいて最大経過時間情報に該当する映像フレームを特定し、符号化映像メディア解析手段で映像フレーム先頭を検知した時点での基準時刻が示す映像フレーム先頭検知時刻と制御情報パケットで通知された最大経過時間情報によって映像出力タイミングを決定するようにしたので、伝送パケット受信から映像復号、映像表示までの時間を短縮して映像表示遅延時間を少なくし、かつ同一特性の符号化映像メディア情報の受信に対して映像表示タイミングを同一にした映像表示を実施するメディア受信装置を実現することができる。 As described above, according to the media receiving apparatus of the fifth embodiment, the transmission packet analyzing / separating means is applied to the control information packet of the transmission packet output from the media transmitting apparatus from the start of decoding of the video frame to the completion of decoding. If the maximum elapsed time information is included, the control information packet is analyzed to extract the maximum elapsed time information, and the video decoding time calculation means corresponds to the maximum elapsed time information based on the extraction timing in the transmission packet analysis and separation means The video output timing is determined based on the video frame head detection time indicated by the reference time when the encoded video media analysis means detects the video frame head and the maximum elapsed time information notified in the control information packet. As a result, the time from receiving the transmission packet to video decoding and video display is shortened. It is possible to reduce the display delay time, and to realize the media receiving apparatus implementing the video display in which the image display timing in the same for the reception of coded video media information of the same characteristics.

なお、本願発明はその発明の範囲内において、各実施の形態の自由な組み合わせ、あるいは各実施の形態の任意の構成要素の変形、もしくは各実施の形態において任意の構成要素の省略が可能である。 In the present invention, within the scope of the invention, any combination of the embodiments, or any modification of any component in each embodiment, or omission of any component in each embodiment is possible. .

１メディア送信装置、２伝送パケット、３，３ａメディア受信装置、４映像表示装置、５伝送パケット受信部、６伝送パケット到着通知、７伝送パケット、８基準時刻生成部、９基準時刻情報、１０基準時刻周波数信号、１１伝送パケット解析分離部、１２時刻再生情報、１３映像時刻情報、１４符号化映像メディア情報、１５基準時刻再生部、１６基準時刻情報、１７符号化映像メディア解析部、１８映像フレーム先頭検知通知、１９符号化映像フレーム情報、２０符号化映像メディア復号部、２１映像フレーム復号完了通知、２２映像信号、２３映像復号時間算出部、２４映像出力制御情報、２５映像出力タイミング生成部、２６映像出力タイミング通知、２７映像信号後処理部、２８映像出力信号、２００映像復号時刻蓄積部、２０１蓄積映像時刻情報、２０２映像表示時刻判定部。 DESCRIPTION OF SYMBOLS 1 Media transmission device, 2 Transmission packet, 3, 3a Media reception device, 4 Video display device, 5 Transmission packet reception part, 6 Transmission packet arrival notification, 7 Transmission packet, 8 Reference time generation part, 9 Reference time information, 10 Reference Time frequency signal, 11 Transmission packet analysis / separation unit, 12 Time playback information, 13 Video time information, 14 Encoded video media information, 15 Reference time playback unit, 16 Reference time information, 17 Encoded video media analysis unit, 18 Video frame Start detection notification, 19 encoded video frame information, 20 encoded video media decoding unit, 21 video frame decoding completion notification, 22 video signal, 23 video decoding time calculation unit, 24 video output control information, 25 video output timing generation unit, 26 Video output timing notification, 27 Video signal post-processing unit, 28 Video output No., 200 video decoding time storage unit, 201 stored video time information, 202 a video display time determination unit.

Claims

Time reproduction information indicating a reference time of operation in a media transmitting apparatus that stores and outputs encoded video media information obtained by encoding a video signal in a transmission packet, and indicates a timing to start decoding the encoded video media information Video time information including decoding time information and display time information indicating a timing at which a video signal after video decoding is to be output is a media reception device that receives a transmission packet attached together with the encoded video media information,
A transmission packet analyzing / separating means for analyzing the received transmission packet and extracting the time reproduction information, the encoded video media information, and the video time information;
The reference time information in the own device is synchronized with the time reproduction information by comparing the reference time information of the operation in the own device at the time when the transmission packet is received and the time reproduction information extracted by the transmission packet analyzing / separating means. Reference time reproduction means for generating reference time control information to
Based on the reference time control information, the oscillation frequency of the reference signal serving as the reference time in the device itself is adjusted to control the advance of the reference time, and the reference time information indicating the controlled reference time and the adjusted time are adjusted. A reference time generating means for distributing a reference frequency signal indicating an oscillation frequency within the apparatus;
Analyzing the encoded video media information, detecting and notifying the beginning of the video frame, and encoding video media analysis means for outputting as encoded video frame information;
Decoding the encoded video frame information to restore the video signal, and an encoded video media decoding means for notifying the completion of decoding when decoding of the video frame is completed;
Video frame head detection time indicated by the reference time when the detection of the start of the video frame is notified, decoding completion time indicated by the reference time when the completion of decoding of the video frame is notified, and the display for the video frame A media receiving apparatus comprising: a video decoding time calculating unit that compares time information and calculates a video output timing that minimizes an elapsed time from detection of a video frame head to video output.

The video decoding time calculation means
A video decoding time storage unit for storing the display time indicated by the display time information for the video frame extracted by the transmission packet analyzing and separating unit and the decoding completion time when the video frame decoding for the video frame is completed by the encoded video media decoding unit; ,
A difference time from the decoding completion time to the display time in the video frame is calculated, the difference time is calculated for a plurality of video frames for a predetermined period, and based on the difference time within the predetermined period, the video from the decoding completion to the video time is calculated. The media receiving apparatus according to claim 1, further comprising: a video display time determination unit that determines a video output timing that minimizes an elapsed time until output.

The encoded video media decoding means calculates the display start scheduled time for the video frame by extracting the video frame interval from the encoded video frame information,
The video decoding time calculation means
A video decoding time storage unit that stores a display time indicated by display time information for the video frame extracted by the transmission packet analysis and separation unit, and a decoding completion time for the video frame notified from the encoded video media decoding unit;
The difference time from the decoding completion time for the video frame to the scheduled display start time for the video frame is performed for a plurality of video frames for a predetermined period, and the elapsed time from the completion of decoding to video output based on the difference time within the predetermined period The media receiving apparatus according to claim 1, further comprising a video display time determination unit that determines a video output timing that minimizes time.

The video decoding time storage unit stores the video frame head detection time notified from the encoded video media analyzing unit and the decoding completion time notified from the encoded video media decoding unit,
The video display time determination unit calculates a difference value between the decoding time information extracted by the transmission packet analysis and separation unit and the video frame head detection time, searches for a peak where the difference value is maximum, 4. The media reception according to claim 2, wherein a video output timing that minimizes an elapsed time from decoding completion time to video output is determined within a period determined based on the detection position and the number of detections. apparatus.

In the video decoding process in the encoded video media decoding means, when it is detected that the encoded video frame is constructed by the intra-screen encoding method, the video decoding time calculating means is based on the detection position and the number of detections. 5. The media receiving apparatus according to claim 1, wherein a video output timing that minimizes an elapsed time from decoding completion time to video output is determined within the determined period. 6. .

The transmission packet analysis / separation means analyzes the control information packet when the control information packet of the transmission packet output from the media transmission device includes the maximum elapsed time information from the start of decoding of the video frame to the completion of decoding. To extract the maximum elapsed time information,
The video decoding time calculating means specifies a video frame corresponding to the maximum elapsed time information based on the extraction timing in the transmission packet analyzing and separating means, and a reference when the encoded video media analyzing means detects the head of the video frame. 6. The media reception according to claim 1, wherein the video output timing is determined based on the video frame head detection time indicated by the time and the maximum elapsed time information notified by the control information packet. apparatus.