JP7247707B2

JP7247707B2 - Transmission node, broadcasting station system, control node and transmission control method

Info

Publication number: JP7247707B2
Application number: JP2019062658A
Authority: JP
Inventors: 俊喜佐藤; 正幸菅原
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2019-03-28
Filing date: 2019-03-28
Publication date: 2023-03-29
Anticipated expiration: 2039-03-28
Also published as: JP2020162090A

Description

本開示は、送信ノード、放送局システム、制御ノード及び送信制御方法に関する。 The present disclosure relates to a transmission node, a broadcasting station system, a control node, and a transmission control method.

既存の放送局の多くは、映像、音声及び補助（ancillary）データといった放送素材を局内で伝送するための専用ネットワークを有している。専用ネットワークには、例えば、カメラ及びマイクロフォンといったキャプチャデバイス、コンテンツデータを蓄積するストレージサーバ、コンテンツを再生する再生デバイス、及び伝送される信号の形式を変換するゲートウェイデバイスといった、様々な装置が接続される。専用ネットワーク上での放送素材の伝送のための信号形式として、ＳＤＩ（Serial Digital Interface）がこれまで広く利用されている。 Many existing broadcast stations have dedicated networks for transmitting broadcast material such as video, audio and ancillary data within the station. Various devices are connected to the dedicated network, such as capture devices such as cameras and microphones, storage servers that accumulate content data, playback devices that play back content, and gateway devices that convert the format of transmitted signals. . SDI (Serial Digital Interface) has hitherto been widely used as a signal format for transmission of broadcast material on dedicated networks.

しかし、近年のＩＰ（Internet Protocol）技術の目覚ましい性能の向上の結果、放送事業者は、より汎用性の高いＩＰ技術を活用することにメリットを見出し、局内のネットワークをＩＰネットワークへ更新する取り組みを開始した。ＩＰベースのネットワークアーキテクチャを採用すれば、例えば、汎用のルータ及びスイッチといったネットワーク装置を用いて高速かつ大容量のネットワークを低コストで構築することが可能となる。 However, as a result of the remarkable improvement in performance of IP (Internet Protocol) technology in recent years, broadcasters have discovered the merits of using IP technology with greater versatility, and are making efforts to upgrade their intra-office networks to IP networks. started. Adopting an IP-based network architecture makes it possible to build a high-speed, large-capacity network at low cost using network devices such as general-purpose routers and switches.

放送素材をＩＰ技術を活用して伝送するためのプロトコルの標準化も、現在進められている。例えば、ＳＭＰＴＥ（Society of Motion Picture and Television Engineers）は、映像、音声及び補助データの混成であるＳＤＩイメージをＩＰパケットで伝送するためのプロトコルであるＳＭＰＴＥＳＴ２０２２－６を規格化済みである（非特許文献１参照）。さらに、ＳＭＰＴＥは、映像、音声及び補助データを別々のストリームで伝送するためのプロトコル群であるＳＭＰＴＥＳＴ２１１０シリーズの規格化も進めている（非特許文献２参照）。中でも、ＳＭＰＴＥＳＴ２１１０－１０によれば、放送局システムに参加するノードがＰＴＰ（Precision Time Protocol）の仕組みに基づいて相互に高い精度で同期する。この高精度の同期によって、ＩＰネットワーク上で異なる経路に沿って伝送される異なるストリームを受信側で適切に時間合わせすることが可能となる。 Standardization of protocols for transmitting broadcast materials using IP technology is also currently underway. For example, SMPTE (Society of Motion Picture and Television Engineers) has standardized SMPTE ST2022-6, which is a protocol for transmitting SDI images, which are a mixture of video, audio, and ancillary data, in IP packets (non-patent Reference 1). Furthermore, SMPTE is also standardizing the SMPTE ST2110 series, which is a set of protocols for transmitting video, audio, and auxiliary data in separate streams (see Non-Patent Document 2). Among others, according to SMPTE ST2110-10, nodes participating in a broadcasting station system synchronize with each other with high accuracy based on the mechanism of PTP (Precision Time Protocol). This high-precision synchronization allows the receiver to properly time-align different streams transmitted along different paths over an IP network.

日本国内の標準化団体であるＡＲＩＢ（Association of Radio Industries and Businesses）は、映像、音声及び補助データを単一のストリームで伝送するためのデータ構造の規定として、ＡＲＩＢＳＴＤ－Ｂ７３を規格化済みである（非特許文献３参照）。 ARIB (Association of Radio Industries and Businesses), a standardization body in Japan, has standardized ARIB STD-B73 as a data structure specification for transmitting video, audio and auxiliary data in a single stream. (See Non-Patent Document 3).

さらに、多様な装置の間の相互運用性を確保するために、ＡＭＷＡ（Advanced Media Workflow Association）は、装置間のストリームの伝送を管理し及び制御するための制御インタフェース規格の集合であるＮＭＯＳ（Networked Media Open Specifications）規格の策定を進めている。例えば、ＮＭＯＳＩＳ－０４は、ネットワークリソースの発見及び登録（Discovery and Registration）のための制御インタフェース規格である（非特許文献４参照）。ＮＭＯＳＩＳ－０５は、デバイスの接続管理（Device Connection Management）のための制御インタフェース規格である（非特許文献５参照）。ＮＭＯＳ規格が利用される場合、送信ノードは、自ノードにより送信可能な放送信号ストリームの属性を記述するＳＤＰ（Session Description Protocol）オブジェクトを制御ノードへ提供し、制御ノードは、そのＳＤＰオブジェクトの記述に基づいてストリームの伝送をセットアップする。 Furthermore, in order to ensure interoperability between various devices, AMWA (Advanced Media Workflow Association) has established NMOS (Networked Streaming Interface), a set of control interface standards for managing and controlling stream transmission between devices. Media Open Specifications) standards are being formulated. For example, NMOS IS-04 is a control interface standard for discovery and registration of network resources (see Non-Patent Document 4). NMOS IS-05 is a control interface standard for device connection management (see Non-Patent Document 5). When the NMOS standard is used, the transmitting node provides the control node with an SDP (Session Description Protocol) object describing the attributes of a broadcast signal stream that can be transmitted by the own node, and the control node uses the description of the SDP object. set up the transmission of the stream based on

特許文献１及び２は、放送コンテンツのストリーミングの際に利用され得るＳＤＰオブジェクトの例を開示している。 Patent Documents 1 and 2 disclose examples of SDP objects that can be used when streaming broadcast content.

Thomas Edwards， “SMPTE ST 2022: Moving Serial Interfaces (ASI & SDI) to IP”，［online］， 2017年8月17日， SMPTE Standards Webcast Series，［平成31年2月15日検索］，インターネット＜URL：https://www.smpte.org/sites/default/files/2017-08-17-ST-2022-Edwards-V4-Handout.pdf＞Thomas Edwards, "SMPTE ST 2022: Moving Serial Interfaces (ASI & SDI) to IP", [online], August 17, 2017, SMPTE Standards Webcast Series, [searched on February 15, 2019], Internet < URL : https://www.smpte.org/sites/default/files/2017-08-17-ST-2022-Edwards-V4-Handout.pdf> SMPTE， “SMPTE ST 2110 FAQ”，［online］，［平成31年2月15日検索］，インターネット＜URL：https://www.smpte.org/st-2110＞SMPTE, “SMPTE ST 2110 FAQ”, [online], [searched on February 15, 2019], Internet <URL: https://www.smpte.org/st-2110> ARIB， “制作用ＩＰインタフェースにおけるエッセンス独立単一ストリームのＲＴＰデータグラムのデータ構造標準規格（ARIB STD-B73 1.0版）”，［online］， 2018年7月26日，［平成31年2月15日］，インターネット＜URL：https://www.arib.or.jp/kikaku/kikaku_hoso/desc/std-b73.html＞ARIB, “Data Structure Standard for Essence-Independent Single Stream RTP Datagram in IP Interface for Production (ARIB STD-B73 Version 1.0)”, [online], July 26, 2018, [February 15, 2019 Japan], Internet <URL: https://www.arib.or.jp/kikaku/kikaku_hoso/desc/std-b73.html> AMWA， “AMWA IS-04 NMOS Discovery and Registration Specification (Stable)”，［online］，［平成31年2月15日］，インターネット＜URL：https://amwa-tv.github.io/nmos-discovery-registration/＞AMWA, “AMWA IS-04 NMOS Discovery and Registration Specification (Stable)”, [online], [February 15, 2019], Internet <URL: https://amwa-tv.github.io/nmos-discovery -registration/> AMWA， “AMWA IS-05 NMOS Device Connection Management Specification”，［online］，［平成31年2月15日］，インターネット＜URL：https://amwa-tv.github.io/nmos-device-connection-management/＞AMWA, “AMWA IS-05 NMOS Device Connection Management Specification”, [online], [February 15, 2019], Internet <URL: https://amwa-tv.github.io/nmos-device-connection- management/＞

特開２０１５－７３１９７号公報JP 2015-73197 A 特表２００７－５３１３６８号公報Japanese Patent Publication No. 2007-531368

相互に関連する映像、音声及び補助データをＳＭＰＴＥＳＴ２１１０のように別々のストリームで伝送すると、トランスポートレイヤでそれらストリームのポート番号が相違することから、受信側でもとのコンテンツを再構築する処理が複雑化する。ポート番号が相違すれば、ＩＰネットワーク上での経路制御の結果として、個々のストリームが辿る伝送経路の違いからストリームごとに異なる遅延を受ける可能性もある。これに対し、ＡＲＩＢＳＴＤ－Ｂ７３の映像、音声及び補助データのストリームは、トランスポートレイヤで同一のポート番号を有するために、上述したＳＭＰＴＥＳＴ２１１０に固有の問題点を有しない。しかし、ＡＲＩＢＳＴＤ－Ｂ７３は、放送局システムに参加するノード間でどのように協調的に動作してストリームを処理すべきかを規定していない。 When video, audio and ancillary data that are related to each other are transmitted in separate streams like SMPTE ST2110, the port numbers of these streams are different in the transport layer, so processing to reconstruct the original contents on the receiving side is required. Complicated. If the port numbers are different, each stream may receive a different delay due to the difference in the transmission path followed by each stream as a result of path control on the IP network. On the other hand, ARIB STD-B73 video, audio and auxiliary data streams have the same port number in the transport layer, so they do not have the above-mentioned problem specific to SMPTEST ST2110. However, ARIB STD-B73 does not specify how the nodes participating in the broadcasting station system should operate cooperatively to process streams.

例えば、ストリームの伝送に関与するノード間で高精度の同期が確立されておらず、パケット間の時間合わせのために使用すべき時刻情報用のフィールドに合意が無ければ、放送局システム内でやり取りされる多様なエッセンスを柔軟に組み合わせて放送コンテンツを構成することができない。 For example, if high-precision synchronization is not established between nodes involved in stream transmission, and if there is no consensus on the field for time information that should be used for time alignment between packets, communication within the broadcasting station system Broadcast contents cannot be configured by flexibly combining various essences.

また、ＳＭＰＴＥＳＴ２１１０ストリームのために通常利用されるＳＤＰオブジェクトのフォーマットは、フォーマット構造においても、記述される情報の内容においても、ＡＲＩＢＳＴＤ－Ｂ７３ストリームの特性に必ずしも適していない。 Also, the format of the SDP objects normally used for SMPTE ST2110 streams is not necessarily suitable for the characteristics of ARIB STD-B73 streams, either in format structure or in the information content described.

本開示に係る技術は、上述した課題のうちの少なくとも１つを解決することを目的とする。 An object of the technology according to the present disclosure is to solve at least one of the above-described problems.

ある観点によれば、異なるタイプのエッセンスデータを単一のポート番号で伝送する放送信号ストリームを、放送局のＩＰネットワークへ送信する送信部と、上記放送信号ストリームの属性を記述するＳＤＰ（Session Description Protocol）オブジェクトを他のノードへ提供する制御部と、を備え、上記ＳＤＰオブジェクトは、映像エッセンスデータが圧縮されるかを示す圧縮関連情報、音声エッセンスデータに関連する音声チャンネル数情報、及びエッセンスデータへ適用される誤り訂正方式を示す誤り訂正情報、のうちの１つ以上を、上記放送信号ストリームのフォーマット固有のパラメータを記述する属性フィールド内に含む、送信ノードが提供される。 According to one aspect, a transmitter for transmitting a broadcast signal stream that transmits different types of essence data with a single port number to an IP network of a broadcast station, and an SDP (Session Description) that describes attributes of the broadcast signal stream a control unit for providing a protocol object to other nodes, wherein the SDP object includes compression-related information indicating whether video essence data is compressed, audio channel number information related to audio essence data, and essence data. error correction information indicating an error correction scheme to be applied to the broadcast signal stream in attribute fields describing format-specific parameters of said broadcast signal stream.

また別の観点によれば、上記送信ノードと、上記ＳＤＰオブジェクトの記述に従ってセットアップされる上記放送信号ストリームを受信する受信ノードと、を含む放送局システムが提供される。 According to yet another aspect, there is provided a broadcast station system including said transmitting node and a receiving node for receiving said broadcast signal stream set up according to said SDP object description.

また別の観点によれば、放送局のＩＰネットワークにおける、異なるタイプのエッセンスデータを単一のポート番号で伝送する放送信号ストリームの送信ノードから受信ノードへの送信を制御する制御ノードであって、上記放送信号ストリームの属性を記述するＳＤＰ（Session Description Protocol）オブジェクトを上記送信ノードから取得する制御部、を備え、上記ＳＤＰオブジェクトは、映像エッセンスデータが圧縮されるかを示す圧縮関連情報、音声エッセンスデータに関連する音声チャンネル数情報、及びエッセンスデータへ適用される誤り訂正方式を示す誤り訂正情報、のうちの１つ以上を、上記放送信号ストリームのフォーマット固有のパラメータを記述する属性フィールド内に含む、制御ノードが提供される。 According to another aspect, a control node for controlling transmission from a transmitting node to a receiving node of a broadcast signal stream carrying different types of essence data on a single port number in an IP network of a broadcast station, comprising: a control unit that acquires an SDP (Session Description Protocol) object describing attributes of the broadcast signal stream from the transmission node, the SDP object including compression-related information indicating whether video essence data is to be compressed, audio essence including one or more of audio channel number information associated with the data and error correction information indicating an error correction scheme applied to the essence data in an attribute field describing format specific parameters of the broadcast signal stream. , a control node is provided.

また別の観点によれば、異なるタイプのエッセンスデータを単一のポート番号で伝送する放送信号ストリームの属性を記述するＳＤＰ（Session Description Protocol）オブジェクトを他のノードへ提供することと、上記放送信号ストリームを、放送局のＩＰネットワークへ送信することと、を含み、上記ＳＤＰオブジェクトは、映像エッセンスデータが圧縮されるかを示す圧縮関連情報、音声エッセンスデータに関連する音声チャンネル数情報、及びエッセンスデータへ適用される誤り訂正方式を示す誤り訂正情報、のうちの１つ以上を、上記放送信号ストリームのフォーマット固有のパラメータを記述する属性フィールド内に含む、送信制御方法が提供される。当該送信制御方法をプロセッサに実行させるコンピュータプログラムが提供されてもよい。上記コンピュータプログラムを記憶した非一時的なコンピュータ読取可能な記憶媒体が提供されてもよい。 According to another aspect, providing other nodes with an SDP (Session Description Protocol) object describing attributes of a broadcast signal stream that transmits different types of essence data with a single port number; sending the stream to the broadcast station's IP network, wherein the SDP object includes compression related information indicating whether video essence data is compressed, audio channel number information related to audio essence data, and essence data. error correction information indicating an error correction scheme to be applied to the broadcast signal stream in an attribute field describing format-specific parameters of said broadcast signal stream. A computer program may be provided that causes a processor to execute the transmission control method. A non-transitory computer-readable storage medium storing the computer program may be provided.

また別の観点によれば、放送局のＩＰネットワークにおける、異なるタイプのエッセンスデータを単一のポート番号で伝送する放送信号ストリームの送信ノードから受信ノードへの送信を制御するための送信制御方法であって、上記放送信号ストリームの属性を記述するＳＤＰ（Session Description Protocol）オブジェクトを上記送信ノードから取得すること、を含み、上記ＳＤＰオブジェクトは、映像エッセンスデータが圧縮されるかを示す圧縮関連情報、音声エッセンスデータに関連する音声チャンネル数情報、及びエッセンスデータへ適用される誤り訂正方式を示す誤り訂正情報、のうちの１つ以上を、上記放送信号ストリームのフォーマット固有のパラメータを記述する属性フィールド内に含む、送信制御方法が提供される。当該送信制御方法をプロセッサに実行させるコンピュータプログラムが提供されてもよい。上記コンピュータプログラムを記憶した非一時的なコンピュータ読取可能な記憶媒体が提供されてもよい。 According to another aspect, a transmission control method for controlling transmission from a transmission node to a reception node of a broadcast signal stream that transmits different types of essence data with a single port number in an IP network of a broadcast station obtaining from the transmitting node an SDP (Session Description Protocol) object describing attributes of the broadcast signal stream, wherein the SDP object is compression-related information indicating whether video essence data is to be compressed; One or more of audio channel number information associated with the audio essence data and error correction information indicating an error correction method applied to the essence data in the attribute field describing the format-specific parameters of the broadcast signal stream. A transmission control method is provided, comprising: A computer program may be provided that causes a processor to execute the transmission control method. A non-transitory computer-readable storage medium storing the computer program may be provided.

本開示に係る技術によれば、放送局のＩＰネットワーク上で映像データ、音声データ又は補助データといったエッセンスデータを単一のストリームで伝送するプロトコルを利用する際に、当該ストリームの伝送を適切にセットアップすることが可能となる。なお、本開示に係る技術により、当該効果の代わりに、又は当該効果とともに、他の効果が奏されてもよい。 According to the technology according to the present disclosure, when using a protocol for transmitting essence data such as video data, audio data, or auxiliary data in a single stream on an IP network of a broadcasting station, the transmission of the stream is appropriately set up. It becomes possible to It should be noted that the technology according to the present disclosure may produce other effects instead of or in addition to the above effects.

本開示の実施形態に係る放送局システムの構成の一例を示す概略図である。1 is a schematic diagram showing an example of a configuration of a broadcasting station system according to an embodiment of the present disclosure; FIG. 本開示の実施形態に係る放送局システムのＩＰドメインの論理的な構成の一例について説明するための説明図である。FIG. 2 is an explanatory diagram for describing an example of a logical configuration of IP domains of a broadcasting station system according to an embodiment of the present disclosure; FIG. ＳＭＰＴＥＳＴ２１１０－１０により規定されたシステムタイミングモデルについて説明するための説明図である。1 is an explanatory diagram for explaining a system timing model defined by SMPTE ST2110-10; FIG. 図３を用いて説明したシステムタイミングモデルに基づく映像エッセンスと音声エッセンスとの間の時間合わせについて説明するための説明図である。FIG. 4 is an explanatory diagram for explaining time alignment between a video essence and an audio essence based on the system timing model described with reference to FIG. 3; ＡＲＩＢＳＴＤ－Ｂ７３におけるデータグラムの構成について説明するための説明図である。FIG. 4 is an explanatory diagram for explaining the structure of a datagram in ARIB STD-B73; ＡＲＩＢＳＴＤ－Ｂ７３に従った単一ストリームでの映像エッセンス及び音声エッセンスの伝送について説明するための説明図である。FIG. 4 is an explanatory diagram for explaining transmission of video essence and audio essence in a single stream according to ARIB STD-B73; 第１の実施形態に係る放送信号処理ノードの構成の一例を示すブロック図である。2 is a block diagram showing an example of the configuration of a broadcast signal processing node according to the first embodiment; FIG. 図７に示した受信ストリーム処理部の詳細な構成の一例を示すブロック図である。8 is a block diagram showing an example of a detailed configuration of a reception stream processing unit shown in FIG. 7; FIG. ２つのＲＴＰパケットの間のエッセンスデータの時間合わせのための手法の第１の例について説明するための説明図である。FIG. 4 is an explanatory diagram for describing a first example of a technique for time alignment of essence data between two RTP packets; ２つのＲＴＰパケットの間のエッセンスデータの時間合わせのための手法の第２の例について説明するための説明図である。FIG. 10 is an explanatory diagram for explaining a second example of a technique for time alignment of essence data between two RTP packets; ２つのＲＴＰパケットの間のエッセンスデータの時間合わせのための手法の第３の例について説明するための説明図である。FIG. 11 is an explanatory diagram for explaining a third example of a technique for time alignment of essence data between two RTP packets; ２つのＲＴＰパケットの間のエッセンスデータの時間合わせのための手法の第４の例について説明するための説明図である。FIG. 11 is an explanatory diagram for explaining a fourth example of a technique for time alignment of essence data between two RTP packets; 第１の実施形態に係るストリーム送信処理の流れの一例を示すフローチャートである。6 is a flow chart showing an example of the flow of stream transmission processing according to the first embodiment; 第１の実施形態に係るストリーム受信処理の流れの一例を示すフローチャートである。6 is a flow chart showing an example of the flow of stream reception processing according to the first embodiment; 第２の実施形態に係る放送信号処理ノードの構成の一例を示すブロック図である。FIG. 9 is a block diagram showing an example of the configuration of a broadcast signal processing node according to the second embodiment; FIG. 第２の実施形態の第１の変形例に係る放送信号処理ノードの構成の一例を示すブロック図である。FIG. 11 is a block diagram showing an example of the configuration of a broadcast signal processing node according to the first modified example of the second embodiment; FIG. 第２の実施形態の第２の変形例に係る放送信号処理ノードの構成の一例を示すブロック図である。FIG. 12 is a block diagram showing an example of the configuration of a broadcast signal processing node according to a second modified example of the second embodiment; FIG. エッセンス分離型ストリーム用のＳＤＰオブジェクトの典型的なフォーマット構造について説明するための説明図である。FIG. 4 is an explanatory diagram for describing a typical format structure of an SDP object for an essence-separated stream; 図１５を用いて説明したフォーマット構造を有する、ＳＭＰＴＥＳＴ２１１０ストリームの属性を記述したＳＤＰオブジェクトの一例を示す説明図である。FIG. 16 is an explanatory diagram showing an example of an SDP object describing attributes of an SMPTEST ST2110 stream having the format structure explained with reference to FIG. 15; 第３の実施形態においてエッセンス混在型ストリームのために定義されるＳＤＰオブジェクトのフォーマット構造について説明するための説明図である。FIG. 11 is an explanatory diagram for explaining the format structure of an SDP object defined for an essence-mixed stream in the third embodiment; 図１７を用いて説明したフォーマット構造を有する、ＡＲＩＢＳＴＤ－Ｂ７３ストリームの属性を記述したＳＤＰオブジェクトの一例を示す説明図である。FIG. 18 is an explanatory diagram showing an example of an SDP object describing attributes of an ARIB STD-B73 stream having the format structure explained with reference to FIG. 17; 第３の実施形態に係る放送局システムの概略的な構成の一例を示すブロック図である。FIG. 11 is a block diagram showing an example of a schematic configuration of a broadcasting station system according to a third embodiment; FIG. 第３の実施形態に係る送信ノードの構成の一例を示すブロック図である。FIG. 12 is a block diagram showing an example of the configuration of a transmission node according to the third embodiment; FIG. 図２０に示した送信ノードにより実行され得る送信制御処理の流れの第１の例を示すフローチャートである。21 is a flow chart showing a first example of the flow of transmission control processing that can be executed by the transmission node shown in FIG. 20; 図２０に示した送信ノードにより実行され得る送信制御処理の流れの第２の例を示すフローチャートである。21 is a flow chart showing a second example of the flow of transmission control processing that can be executed by the transmission node shown in FIG. 20; 第３の実施形態に係る制御ノードの構成の一例を示すブロック図である。FIG. 12 is a block diagram showing an example of the configuration of a control node according to the third embodiment; FIG. 図２３に示した制御ノードにより実行され得る送信制御処理の流れの第１の例を示すフローチャートである。FIG. 24 is a flow chart showing a first example of the flow of transmission control processing that can be executed by the control node shown in FIG. 23; FIG. 図２３に示した制御ノードにより実行され得る送信制御処理の流れの第２の例を示すフローチャートである。24 is a flow chart showing a second example of the flow of transmission control processing that can be executed by the control node shown in FIG. 23; 第４の実施形態に係る送信ノードの構成の一例を示すブロック図である。FIG. 12 is a block diagram showing an example of the configuration of a transmission node according to the fourth embodiment; FIG. 第４の実施形態に係る制御ノードの構成の一例を示すブロック図である。FIG. 14 is a block diagram showing an example of the configuration of a control node according to the fourth embodiment; FIG.

以下、添付の図面を参照して本開示に係る技術の実施形態を詳細に説明する。なお、本明細書及び図面において、同様に説明されることが可能な要素については、同一の符号を付することにより重複説明が省略され得る。 Hereinafter, embodiments of the technology according to the present disclosure will be described in detail with reference to the accompanying drawings. In addition, in the present specification and drawings, elements that can be described in the same manner can be omitted from redundant description by assigning the same reference numerals.

説明は、以下の順序で行われる。
１．概要
２．第１の実施形態
２－１．放送信号処理ノードの構成例
２－２．受信ストリーム処理部の詳細な構成例
２－３．処理の流れ
３．第２の実施形態
３－１．放送信号処理ノードの構成例
３－２．変形例
４．第１の実施形態及び第２の実施形態のまとめ
５．第３の実施形態
５－１．既存のＳＤＰオブジェクトの例
５－２．ＳＤＰオブジェクトの新たなフォーマット
５－３．放送局システムの構成例
５－４．送信ノードの構成例
５－５．制御ノードの構成例
６．第４の実施形態
６－１．送信ノードの構成例
６－２．制御ノードの構成例
７．第３の実施形態及び第４の実施形態のまとめ The description is given in the following order.
1. Overview 2. First Embodiment 2-1. Configuration example of broadcast signal processing node 2-2. Detailed configuration example of reception stream processing unit 2-3. Flow of processing3. Second Embodiment 3-1. Configuration example of broadcast signal processing node 3-2. Modification 4. Summary of the first embodiment and the second embodiment5. Third Embodiment 5-1. Example of existing SDP object 5-2. New format of SDP object 5-3. Configuration example of broadcasting station system 5-4. Configuration example of transmission node 5-5. Configuration example of control node6. Fourth Embodiment 6-1. Configuration example of transmission node 6-2. Configuration example of control node7. Summary of the third embodiment and the fourth embodiment

＜＜１．概要＞＞
まず、図１を用いて、本開示のいくつかの実施形態が適用され得る放送局システムの概要について説明する。図１は、本開示の実施形態に係る放送局システム１の構成の一例を示す概略図である。図１を参照すると、放送局システム１は、１つ以上のネットワーク装置１２、カメラ２０ａ、モニタ２０ｂ、ＩＰゲートウェイ２０ｃ、ＩＰゲートウェイ２０ｄ、カメラ２２、マイクロフォン２４、データサーバ２６、統合プレイアウト（Integrated Playout）３２、モニタ３４、ＡＰＳ（Automatic Program control System）４０及び制御端末５０を含む。ネットワーク装置１２、カメラ２０ａ、モニタ２０ｂ、ＩＰゲートウェイ２０ｃ、ＩＰゲートウェイ２０ｄ、ＡＰＳ４０、及び制御端末５０は、ＩＰドメイン１０に属する。 <<1. Overview＞＞
First, using FIG. 1, an outline of a broadcasting station system to which some embodiments of the present disclosure can be applied will be described. FIG. 1 is a schematic diagram showing an example configuration of a broadcasting station system 1 according to an embodiment of the present disclosure. Referring to FIG. 1, the broadcasting station system 1 includes one or more network devices 12, a camera 20a, a monitor 20b, an IP gateway 20c, an IP gateway 20d, a camera 22, a microphone 24, a data server 26, an integrated playout ) 32 , a monitor 34 , an APS (Automatic Program Control System) 40 and a control terminal 50 . Network device 12 , camera 20 a , monitor 20 b , IP gateway 20 c , IP gateway 20 d , APS 40 and control terminal 50 belong to IP domain 10 .

（１）様々な装置の説明
ネットワーク装置１２は、ＩＰネットワークにおけるストリームの転送を担当する装置である。ネットワーク装置１２の各々は、例えばルータ、スイッチ、ブリッジ又はリピータなど、いかなる種類のネットワーク装置であってもよい。ネットワーク装置１２の各々は、低コストで導入可能な汎用品（ＣＯＴＳ（Commercial Off-The-Shelf）ともいう）であってもよい。図１には６つのネットワーク装置１２が示されているが、かかる例に限定されず、放送局システム１はいくつのネットワーク装置１２を含んでもよい。ＩＰドメイン１０は、単一のネットワークで構成されてもよく、又は複数のサブネットワークを含んでもよい。 (1) Description of Various Devices The network device 12 is a device in charge of transferring streams in the IP network. Each of network devices 12 may be any type of network device, such as a router, switch, bridge or repeater. Each of the network devices 12 may be a general-purpose product (also called COTS (Commercial Off-The-Shelf)) that can be introduced at low cost. Although six network devices 12 are shown in FIG. 1 , the broadcasting station system 1 may include any number of network devices 12 without being limited to such an example. IP domain 10 may consist of a single network or may include multiple sub-networks.

カメラ２０ａは、放送素材を生成するキャプチャデバイスの一種である。例えば、カメラ２０ａは、何らかの対象を撮影して、映像データを生成する。カメラ２０ａは、内蔵されるマイクロフォンを通じて音声を取得して、音声データを生成してもよい。カメラ２０ａは、ＩＰドメイン１０に属し、映像データ及び音声データのデータストリームを一連のＩＰパケットへパケット化してＩＰネットワークへ送信することができる。 Camera 20a is a type of capture device that generates broadcast material. For example, the camera 20a shoots some object and generates video data. The camera 20a may acquire audio through a built-in microphone and generate audio data. The camera 20a belongs to the IP domain 10 and is capable of packetizing a data stream of video and audio data into a series of IP packets for transmission over the IP network.

モニタ２０ｂは、放送素材を受信してコンテンツを再生する再生デバイスの一種である。例えば、モニタ２０ｂは、映像データを受信して映像を再生する。モニタ２０ｂは、音声データを受信して音声を再生してもよい。モニタ２０ｂは、追加的に伝送される補助データを受信して、補助データを処理（例えば、字幕を再生）してもよい。モニタ２０ｂは、コンテンツを編集するための編集機能をユーザへ提供してもよい。モニタ２０ｂは、ＩＰドメイン１０へ属し、ＩＰネットワーク上で転送されて来る一連のＩＰパケットを受信することができる。 The monitor 20b is a type of reproduction device that receives broadcast material and reproduces content. For example, the monitor 20b receives video data and reproduces the video. The monitor 20b may receive audio data and reproduce audio. The monitor 20b may receive additionally transmitted auxiliary data and process the auxiliary data (eg, play closed captions). The monitor 20b may provide the user with editing functions for editing content. The monitor 20b belongs to the IP domain 10 and can receive a series of IP packets transferred over the IP network.

ＩＰゲートウェイ２０ｃは、ＩＰドメイン１０と他の信号ドメインとの境界に位置するゲートウェイデバイスである。ＩＰゲートウェイ２０ｃは、１つ又は複数のネットワーク装置１２へ接続する。図１の例において、ＩＰゲートウェイ２０ｃには、カメラ２２、マイクロフォン２４及びデータサーバ２６がさらに接続されている。例えば、ＩＰゲートウェイ２０ｃは、映像データを搬送するＳＤＩ信号をカメラ２２から受信し得る。また、ＩＰゲートウェイ２０ｃは、音声データを搬送するＳＤＩ信号をマイクロフォン２４から受信し得る。また、ＩＰゲートウェイ２０ｃは、映像データ、音声データ及び補助データのうちの１つ以上を搬送するＳＤＩ信号をデータサーバ２６から受信し得る。なお、ＩＰドメイン１０の外部で伝送される信号の信号形式は、例えばＳＤ－ＳＤＩ、ＨＤ－ＳＤＩ、３Ｇ－ＳＤＩ、６Ｇ－ＳＤＩ若しくは１２Ｇ－ＳＤＩといったＳＤＩの任意の派生であってもよく、又は、ＳＤＩ以外の信号形式であってもよい。ＩＰゲートウェイ２０ｃは、上述したようにカメラ２２、マイクロフォン２４及びデータサーバ２６から受信される放送素材を搬送する信号を、必要に応じて多重化し又は逆多重化した後、一連のＩＰパケットへパケット化してＩＰネットワークへ送信することができる。 IP gateway 20c is a gateway device located at the boundary between IP domain 10 and other signaling domains. IP gateway 20 c connects to one or more network devices 12 . In the example of FIG. 1, a camera 22, a microphone 24 and a data server 26 are also connected to the IP gateway 20c. For example, IP gateway 20c may receive SDI signals carrying video data from camera 22 . IP gateway 20 c may also receive SDI signals carrying audio data from microphone 24 . IP gateway 20c may also receive SDI signals from data server 26 that carry one or more of video data, audio data and ancillary data. It should be noted that the signal format of signals transmitted outside the IP domain 10 may be any derivative of SDI, such as SD-SDI, HD-SDI, 3G-SDI, 6G-SDI or 12G-SDI, or , SDI may be used. IP gateway 20c packetizes signals carrying broadcast material received from cameras 22, microphones 24 and data server 26 as described above, into a series of IP packets after multiplexing or demultiplexing as necessary. can be sent to the IP network via

ＩＰゲートウェイ２０ｄもまた、ＩＰドメイン１０と他の信号ドメインとの境界に位置するゲートウェイデバイスである。ＩＰゲートウェイ２０ｄは、１つ又は複数のネットワーク装置１２へ接続する。図１の例では、ＩＰゲートウェイ２０ｄには、統合プレイアウト３２及びモニタ３４がさらに接続されている。例えば、ＩＰゲートウェイ２０ｄは、ＩＰネットワーク上で転送されて来るＩＰパケットを受信し、それらＩＰパケットをＳＤＩ信号（又は他の信号形式の信号）へ変換して、統合プレイアウト３２及びモニタ３４の一方又は双方へ送信することができる。なお、当然ながら、ＩＰゲートウェイ２０ｃがＩＰゲートウェイ２０ｄと同様にＩＰパケットをＳＤＩ信号へ変換する機能を有していてもよい。また、ＩＰゲートウェイ２０ｄがＩＰゲートウェイ２０ｃと同様にＳＤＩ信号をＩＰパケットへ変換する機能を有していてもよい。 IP gateway 20d is also a gateway device located at the boundary between IP domain 10 and other signaling domains. IP gateway 20 d connects to one or more network devices 12 . In the example of FIG. 1, an integrated playout 32 and a monitor 34 are further connected to the IP gateway 20d. For example, IP gateway 20d receives IP packets transferred over an IP network, converts the IP packets to SDI signals (or signals in other signal formats), and sends them to one of integrated playout 32 and monitor 34. Or you can send to both. Of course, the IP gateway 20c may have the function of converting IP packets into SDI signals, like the IP gateway 20d. Also, the IP gateway 20d may have a function of converting an SDI signal into an IP packet like the IP gateway 20c.

ＡＰＳ４０は、予め決定されるスケジュールに従って、テレビジョン番組の放送を進行させるシステムである。例えば、ＡＰＳ４０は、ＩＰネットワークへ制御メッセージを送信して、所定の時刻に所与の送信元（例えば、カメラ及びマイクロフォン、又はデータサーバ）から出力されるデータストリームを統合プレイアウト３２へ伝送させる。データストリームを受信した統合プレイアウト３２は、放送局の回線を通じてアンテナへテレビジョン番組の放送信号を送出する。データストリームは、例えばモニタ２０ｂ又はモニタ３４にも配信され、放送局内でも放送コンテンツが再生され得る。 The APS 40 is a system that advances broadcasting of television programs according to a predetermined schedule. For example, APS 40 sends control messages to the IP network to cause transmission of data streams output from given sources (eg, cameras and microphones, or data servers) at predetermined times to integrated playout 32 . The integrated playout 32, which has received the data stream, transmits a broadcast signal of the television program to the antenna through the line of the broadcasting station. The data stream may also be delivered to monitor 20b or monitor 34, for example, to reproduce the broadcast content within the broadcast station.

制御端末５０は、放送局システム１に含まれるノードの管理及び制御に関連するユーザインタフェースをユーザへ提供する端末装置である。制御端末５０は、放送局システム１に専用のユーザ端末であってもよく、又はＰＣ（Personal Computer）若しくはスマートフォンといった汎用的な端末であってもよい。制御端末５０は、例えば、放送局内のネットワーク上でのストリームの伝送に関連する設定情報を、ユーザインタフェースを介して取得してシステム内のデータベースへ登録する。また、制御端末５０は、例えば、ユーザインタフェースを介して所与のノード間のストリームの伝送を求めるリクエストを受け付ける。 The control terminal 50 is a terminal device that provides a user with a user interface related to management and control of nodes included in the broadcasting station system 1 . The control terminal 50 may be a user terminal dedicated to the broadcasting station system 1, or may be a general-purpose terminal such as a PC (Personal Computer) or a smart phone. The control terminal 50 acquires, for example, setting information related to stream transmission on the network within the broadcasting station via the user interface and registers it in the database within the system. The control terminal 50 also accepts requests for transmission of streams between given nodes via, for example, a user interface.

（２）ＩＰマルチキャスト
放送局システム１のＩＰドメイン１０内の放送信号ストリームの伝送は、典型的には、マルチキャストで行われる。マルチキャストされるパケットの送信元ＩＰアドレスは送信ノードのＩＰアドレスであり、宛て先ＩＰアドレスは特定のアドレス範囲に属するマルチキャストアドレスである。個々のマルチキャストアドレスを宛て先とするマルチキャストパケットを受信するノードの集合をマルチキャストグループといい、マルチキャストアドレスをグループアドレスともいう。あるストリームを受信することを意図する受信ノードは、そのストリームに対応するマルチキャストグループへの加入（join）を通知するメッセージ（例えば、ＩＧＭＰ（Internet Group Management Protocol）ＪＯＩＮ）を近傍のルータへ送信する。すると、ルータ間でマルチキャストツリーを更新するためのメッセージ交換が行われ、特定の送信ノードから送信されるストリームがＩＰネットワークを介して受信ノードへ配信されるようになる。受信ノードは、マルチキャストストリームの受信を終了する際には、マルチキャストグループからの離脱（leave）を通知するメッセージを近傍のルータへ送信する。なお、上述した例に限定されず、本開示に係る技術は、ストリームがユニキャストで伝送されるケースにも適用可能である。 (2) IP Multicast Transmission of broadcast signal streams within the IP domain 10 of the broadcasting station system 1 is typically performed by multicast. The source IP address of the multicast packet is the IP address of the sending node, and the destination IP address is a multicast address belonging to a specific address range. A set of nodes that receive multicast packets destined for individual multicast addresses is called a multicast group, and a multicast address is also called a group address. A receiving node that intends to receive a stream sends a message (eg, Internet Group Management Protocol (IGMP) JOIN) to its neighboring routers to join the multicast group corresponding to that stream. Messages are then exchanged between the routers to update the multicast tree, and a stream transmitted from a specific sending node is delivered to the receiving node via the IP network. When the receiving node finishes receiving the multicast stream, the receiving node sends a message notifying of leaving the multicast group to the neighboring routers. It should be noted that the technology according to the present disclosure is not limited to the example described above, and can also be applied to cases where streams are transmitted by unicast.

（３）ＩＰドメインの論理的構成例
図２は、図１に示した放送局システム１のＩＰドメイン１０の論理的な構成の一例を示している（簡明さのために、ここではＡＰＳ４０及び制御端末５０は省略されている）。図２を参照すると、カメラ２０ａに相当する第１ノードは、センダ（sender）６０ａを含む。「センダ」とは、ストリームを送信する能力を有する機能エンティティである。ＩＰゲートウェイ２０ｃに相当する第２ノードは、センダ６０ｂ、６０ｃ及び６０ｄを含む。センダ６０ｂ、６０ｃ及び６０ｄは、ＩＰゲートウェイ２０ｃにより収容される個々のストリームの送信元の装置に相当し得る。モニタ２０ｂに相当する第３ノードは、レシーバ６５ａを含む。「レシーバ」とは、ストリームを受信する能力を有する機能エンティティである。ＩＰゲートウェイ２０ｄに相当する第４ノードは、レシーバ６５ｂ及び６５ｃを含む。レシーバ６５ｂ及び６５ｃは、ＩＰゲートウェイ２０ｄにより収容される個々のストリームの受信先の装置に相当し得る。 (3) Logical Configuration Example of IP Domain FIG. 2 shows an example of a logical configuration of the IP domain 10 of the broadcasting station system 1 shown in FIG. terminal 50 is omitted). Referring to FIG. 2, the first node corresponding to camera 20a includes sender 60a. A "sender" is a functional entity capable of sending a stream. A second node corresponding to IP gateway 20c includes senders 60b, 60c and 60d. Senders 60b, 60c and 60d may correspond to devices from which individual streams are served by IP gateway 20c. A third node, corresponding to monitor 20b, includes a receiver 65a. A "receiver" is a functional entity capable of receiving a stream. A fourth node corresponding to IP gateway 20d includes receivers 65b and 65c. Receivers 65b and 65c may correspond to devices to which individual streams are received by IP gateway 20d.

上の説明から理解されるように、１つのノード（「カード」と呼ばれてもよい）は、１つの物理エンティティを表現する。図１の例に限定されず、１つのノードは、機能エンティティとして、任意の数のセンダ及び／又は任意の数のレシーバを含んでよい。また、１つのノード内で複数の機能エンティティを包含する論理的な単位（例えば、図中の破線枠参照）が定義されてもよい（例えば、１つのＩＰゲートウェイに収容される１つのデバイスが複数のストリームを送信し又は受信するケース）。例えば、ＡＭＷＡにより検討されているＮＭＯＳは、こうした論理的なシステムモデルを前提として、ＩＰドメインでのストリームの伝送を管理し及び制御するためのアプリケーションプロトコルインタフェース（ＡＰＩ）の仕様を規定している。 As understood from the above description, one node (which may be called a "card") represents one physical entity. Without being limited to the example of FIG. 1, a node may include any number of senders and/or any number of receivers as functional entities. Also, a logical unit (for example, see the dashed frame in the figure) that includes multiple functional entities within one node may be defined (for example, one device accommodated in one IP gateway may be (case of sending or receiving a stream of For example, NMOS, which is being considered by AMWA, presupposes such a logical system model and specifies an application protocol interface (API) specification for managing and controlling the transmission of streams in the IP domain.

なお、本明細書において、ノード２０ａ、２０ｂ、２０ｃ及び２０ｄを互いに区別する必要が無い場合には、符号の末尾のアルファベットを省略することによりこれらをノード２０と総称する。センダ６０ａ、６０ｂ、６０ｃ、６０ｄ（センダ６０）及びレシーバ６５ａ、６５ｂ、６５ｃ（レシーバ６５）、並びに他の構成要素の符号についても同様である。 In this specification, when there is no need to distinguish between the nodes 20a, 20b, 20c, and 20d, they are collectively referred to as nodes 20 by omitting the alphabet at the end of the reference numerals. The same applies to senders 60a, 60b, 60c, 60d (sender 60) and receivers 65a, 65b, 65c (receiver 65), as well as other component numbers.

（４）エッセンスとストリーム
上述したように、テレビジョン番組のコンテンツは、概して、映像データ、音声データ及び補助データという３種類の放送素材のデータから構成される。本明細書では、放送素材の種類を区別してこれらコンテンツの構成要素へ言及するために、「エッセンス」との語を用いる。言い換えると、エッセンスは、データとして表現された放送素材である。エッセンスをＩＰネットワーク上で伝送しようとする場合、エッセンスは、あるＩＰベースのプロトコルに従って、デジタル信号へ変換されパケット化される。エッセンスを搬送する一連のＩＰパケットには、ストリーム単位で共通するポート番号が付与される。即ち、ストリームは、ＩＰアドレス及びポート番号が共通するＩＰパケットのシーケンスであり得る。ＩＰベースのストリーム伝送プロトコルとして、後述するどのプロトコルが使用される場合にも、通常、パケットは、アプリケーションレイヤではＲＴＰ（Real-time Transport Protocol）、トランスポートレイヤではＵＤＰ（User Datagram Protocol）に従って伝送される。 (4) Essence and Stream As described above, television program content generally consists of three types of broadcast material data: video data, audio data, and auxiliary data. The term "essence" is used herein to distinguish between types of broadcast material and to refer to these content components. In other words, the essence is broadcast material represented as data. When an essence is to be transmitted over an IP network, it is converted into digital signals and packetized according to some IP-based protocol. A series of IP packets carrying the essence is assigned a common port number for each stream. That is, a stream can be a sequence of IP packets with a common IP address and port number. No matter which protocol is used as the IP-based stream transmission protocol, packets are normally transmitted according to RTP (Real-time Transport Protocol) in the application layer and UDP (User Datagram Protocol) in the transport layer. be.

（５）ＩＰベースのストリーム伝送プロトコル
ＩＰベースのストリーム伝送のための代表的なプロトコルの１つは、ＳＭＰＴＥＳＴ２０２２－６である。ＳＭＰＴＥＳＴ２０２２－６は、ＳＤＩ信号をそのままＩＰパケットへマッピングする。そのため、単一のＳＴ２０２２－６ストリームが、異なる種類のエッセンスとブランキング期間に相当するデータとを含む。ＳＴ２０２２－６ストリームは、ＩＰドメイン及びＳＤＩドメインが混在する過渡期の放送局システムにおいて好適であり得る。 (5) IP-based stream transmission protocol One of the representative protocols for IP-based stream transmission is SMPTE ST2022-6. SMPTE ST 2022-6 maps the SDI signal as-is to the IP packet. Therefore, a single ST2022-6 stream contains different types of essence and data corresponding to blanking periods. The ST2022-6 stream may be suitable for transitional broadcast station systems with mixed IP and SDI domains.

ＩＰベースのストリーム伝送のための代表的なプロトコルの他の１つは、ＳＭＰＴＥＳＴ２１１０である。ＳＭＰＴＥＳＴ２１１０は、異なる種類のエッセンスをそれぞれ異なるストリームへマッピングする。そのため、単一のストリームは単一の種類のエッセンスのみを含み、どのストリームもブランキング期間に相当するデータを含まない。ＳＭＰＴＥＳＴ２１１０－１０は、ＰＴＰの仕組みに基づく、放送局システムに参加するノード間の高精度の同期のための手法を規定している。ＳＭＰＴＥＳＴ２１１０－２０は、非圧縮の映像エッセンスの伝送フォーマットを規定している。ＳＭＰＴＥＳＴ２１１０－２１は、映像エッセンスのトラフィックシェーピングのための手法を規定している。ＳＭＰＴＥＳＴ２１１０－３０は、非圧縮のＰＣＭ音声エッセンスの伝送フォーマットを規定している。ＳＭＰＴＥＳＴ２１１０－３１は、ＡＥＳ３音声エッセンスの伝送フォーマットを規定している。ＳＭＰＴＥＳＴ２１１０－４０は、補助データエッセンスの伝送フォーマットを規定している。なお、ＳＭＰＴＥＳＴ２１１０シリーズでは、映像エッセンスの圧縮はサポートされておらず、誤り訂正符号化／復号も行われない。 Another typical protocol for IP-based stream transmission is SMPTE ST2110. SMPTE ST 2110 maps different types of essences to different streams. Therefore, a single stream contains only a single type of essence, and no stream contains data corresponding to blanking periods. SMPTE ST2110-10 defines a technique for highly accurate synchronization between nodes participating in a broadcasting station system based on the PTP mechanism. SMPTE ST2110-20 defines a transmission format for uncompressed video essence. SMPTE ST2110-21 defines a technique for traffic shaping of video essence. SMPTE ST2110-30 defines a transmission format for uncompressed PCM audio essence. SMPTE ST2110-31 defines a transmission format for AES3 voice essence. SMPTE ST2110-40 defines the transmission format of the auxiliary data essence. It should be noted that the SMPTE ST2110 series does not support compression of video essence, nor does error correction encoding/decoding.

（６）ＳＭＰＴＥＳＴ２１１０－１０での時刻同期
テレビジョン番組を正確なスケジュールに従って放送するためには、システム内のノードが正確な時刻を認識していることを要する。また、異なる複数のノードから受信されるストリームを多重化し、複数の映像を合成し、又は映像と音声とを同時に再生する場合にも、ノード間で時刻の精細な同期が確立されていることを要する。図３は、こうした時刻同期の目的のための、ＳＭＰＴＥＳＴ２１１０－１０により規定されたシステムタイミングモデルについて説明するための説明図である。 (6) Time Synchronization in SMPTE ST2110-10 In order to broadcast television programs according to an accurate schedule, nodes in the system must know the exact time. Also, when multiplexing streams received from different nodes, synthesizing multiple videos, or reproducing video and audio simultaneously, it is necessary to ensure that fine time synchronization is established between nodes. need. FIG. 3 is an explanatory diagram for explaining the system timing model defined by SMPTE ST2110-10 for the purpose of such time synchronization.

図３には、一例として、放送局システム１のノード２０ａ及びノード２０ｂ、並びに共通基準クロック７０が示されている。共通基準クロック７０は、高精度の時刻源（例えば、ＧＰＳ（Global Positioning System）衛星などのＧＮＳＳ（Global Navigation Satellite System）衛星）に同期し、いわゆるＰＴＰグランドマスタとして動作する。共通基準クロック７０は、ＰＴＰの仕組みに従って、システム内のスレーブノードへ同期メッセージを配信する。 FIG. 3 shows the nodes 20a and 20b of the broadcasting station system 1 and the common reference clock 70 as an example. The common reference clock 70 synchronizes with a highly accurate time source (for example, a GNSS (Global Navigation Satellite System) satellite such as a GPS (Global Positioning System) satellite) and operates as a so-called PTP grandmaster. Common reference clock 70 distributes synchronization messages to slave nodes in the system according to the PTP mechanism.

ノード２０ａは、ＰＴＰのスレーブノードである。ノード２０ａは、自身のデバイス内部クロック７２ａを共通基準クロック７０に同期させ、共通基準クロック７０から受信される同期メッセージに基づいてその同期を維持（例えば、遅延を調整）する。ノード２０ａの映像用メディアクロック７３ａは、デバイス内部クロック７２ａにロックされ、映像固有の周波数で進行する。音声用メディアクロック７６ａは、デバイス内部クロック７２ａにロックされ、音声固有の周波数で進行する。ＳＭＰＴＥＳＴ２１１０－２０により規定された映像固有の周波数は９０ｋＨｚであり、ＳＭＰＴＥＳＴ２１１０－３０により規定された音声固有の周波数は４８ｋＨｚである。 The node 20a is a PTP slave node. Node 20 a synchronizes its device internal clock 72 a to common reference clock 70 and maintains that synchronization (eg, adjusts delay) based on synchronization messages received from common reference clock 70 . The video media clock 73a of the node 20a is locked to the device internal clock 72a and runs at a video-specific frequency. Audio media clock 76a is locked to device internal clock 72a and runs at audio-specific frequencies. The video specific frequency specified by SMPTE ST2110-20 is 90 kHz and the audio specific frequency specified by SMPTE ST2110-30 is 48 kHz.

通常、ＲＴＰパケットにはメディアクロックに対してランダムに生成されるオフセットを有するタイムスタンプが付与されるが、ＳＭＰＴＥＳＴ２１１０－１０ではオフセットはゼロとされる。それにより、何らかの障害に起因してセンダがリスタートした場合のストリーム伝送の迅速な復旧が可能とされる（なぜなら、ランダムオフセットの決定のための処理が省略されるためである）。即ち、ノード２０ａの映像用ＲＴＰクロック７４ａは、映像用メディアクロック７３ａに対してオフセットを有さず、映像用メディアクロック７３ａと同一の時刻を示す。そして、ノード２０ａが送信する映像エッセンスのＲＴＰストリームの各パケットのＲＴＰヘッダには、映像用ＲＴＰクロック７４ａに従ってＲＴＰタイムスタンプが付与される。同様に、ノード２０ａの音声用ＲＴＰクロック７７ａは、音声用メディアクロック７６ａに対してオフセットを有さず、音声用メディアクロック７６ａと同一の時刻を示す。そして、ノード２０ａが送信する音声エッセンスのＲＴＰストリームの各パケットのＲＴＰヘッダには、音声用ＲＴＰクロック７７ａに従ってＲＴＰタイムスタンプが付与される。 Normally, RTP packets are timestamped with a randomly generated offset relative to the media clock, but in SMPTE ST2110-10 the offset is zero. This allows rapid restoration of stream transmission when the sender is restarted due to some failure (because the process for determining the random offset is omitted). That is, the video RTP clock 74a of the node 20a has no offset with respect to the video media clock 73a and indicates the same time as the video media clock 73a. An RTP time stamp is added to the RTP header of each packet of the RTP stream of the video essence transmitted by the node 20a according to the video RTP clock 74a. Similarly, the audio RTP clock 77a of the node 20a has no offset with respect to the audio media clock 76a and indicates the same time as the audio media clock 76a. An RTP time stamp is added to the RTP header of each packet of the RTP stream of the audio essence transmitted by the node 20a according to the audio RTP clock 77a.

概していうと、ＳＭＰＴＥＳＴ２１１０－１０において、映像又は音声をキャプチャするデバイスは、キャプチャ時刻をＲＴＰタイムスタンプとして各パケットに付与する。コンテンツをストレージからプレイバックするデバイスは、通信インタフェースからのエッセンスの出力時刻をＲＴＰタイムスタンプとして各パケットに付与する。ＳＤＩ信号をＩＰパケットへ変換するデバイスは、アラインメント時点のＳＤＩ信号のクロック値をＲＴＰタイムスタンプとして各パケットに付与する。 Generally speaking, in SMPTE ST2110-10, a device that captures video or audio attaches the capture time as an RTP timestamp to each packet. A device that plays back content from a storage attaches the essence output time from the communication interface to each packet as an RTP time stamp. A device that converts an SDI signal into IP packets attaches the clock value of the SDI signal at the time of alignment to each packet as an RTP timestamp.

ノード２０ｂもまた、ＰＴＰのスレーブノードである。ノード２０ｂは、自身のデバイス内部クロック７２ｂを共通基準クロック７０に同期させ、共通基準クロック７０から受信される同期メッセージに基づいてその同期を維持（例えば、遅延を調整）する。図には示していないものの、ノード２０ａと同様に、ノード２０ｂも、デバイス内部クロック７２ｂにロックされたメディアクロックと、メディアクロックに対しオフセットを有しないＲＴＰクロックとを有する。ノード２０ｂは、例えばノード２０ａから送信される映像エッセンスのＲＴＰストリーム及び音声エッセンスのＲＴＰストリームをそれぞれ異なるポート番号で受信する。そして、ノード２０ｂは、自身のＲＴＰクロックと、受信したＲＴＰストリームの各パケットのＲＴＰヘッダに付与されたＲＴＰタイムスタンプとに基づいて、パケット間の時間合わせ（time alignment）を行う。ＰＴＰを活用するこうした仕組みにより、放送局のＩＰネットワークへ参加するノード間で、誤差１０μ秒以下という高精度の時間同期が可能とされる。 Node 20b is also a PTP slave node. Node 20 b synchronizes its device internal clock 72 b to common reference clock 70 and maintains that synchronization (eg, adjusts delay) based on synchronization messages received from common reference clock 70 . Although not shown, like node 20a, node 20b also has a media clock locked to device internal clock 72b and an RTP clock that has no offset relative to the media clock. The node 20b receives, for example, the RTP stream of the video essence and the RTP stream of the audio essence transmitted from the node 20a with different port numbers. Then, the node 20b performs time alignment between packets based on its own RTP clock and the RTP timestamp added to the RTP header of each packet of the received RTP stream. Such a scheme that utilizes PTP enables high-precision time synchronization with an error of 10 μs or less between nodes participating in the IP network of a broadcasting station.

図４は、図３を用いて説明したＳＭＰＴＥＳＴ２１１０－１０のシステムタイミングモデルに基づく映像エッセンスと音声エッセンスとの間の時間合わせについて説明するための説明図である。図４を参照すると、ノード２０ａは、映像エッセンスのＲＴＰストリーム８１及び音声エッセンスのＲＴＰストリーム８２を並列的にネットワークへ送出する。ＲＴＰストリーム８１に含まれるパケット（Ｖ_１、Ｖ_２、Ｖ_３、…）の各々は、映像用ＲＴＰクロックに従って付与されたＲＴＰタイムスタンプとシーケンス番号とをＲＴＰヘッダ内に有する。ＲＴＰストリーム８２に含まれるパケット（Ａ_１、Ａ_２、Ａ_３、…）の各々は、音声用ＲＴＰクロックに従って付与されたＲＴＰタイムスタンプとシーケンス番号とをＲＴＰヘッダ内に有する。ノード２０ｂは、ＵＤＰポートＰ_Ｖにおいて、ＲＴＰストリーム８１の一連のＲＴＰパケットを受信し、受信したＲＴＰパケットをシーケンス番号順に処理する。また、ノード２０ｂは、ＵＤＰポートＰ_Ａにおいて、ＲＴＰストリーム８２の一連のＲＴＰパケットを受信し、受信したＲＴＰパケットをシーケンス番号順に処理する。例えば、ノード２０ｂは、映像及び音声を同期的に再生しようする場合、ＲＴＰストリーム８１の各パケットのＲＴＰタイムスタンプとＲＴＰストリーム８２の各パケットのＲＴＰタイムスタンプとを比較して、映像及び音声の再生時刻（例えば、フレームタイミング）を互いに同期させる。なお、複数の映像エッセンスの間、複数の音声エッセンスの間、及び映像エッセンス又は音声エッセンスと補助データエッセンスとの間の時間合わせも同様に行われ得る。 FIG. 4 is an explanatory diagram for explaining time alignment between a video essence and an audio essence based on the SMPTE ST2110-10 system timing model described with reference to FIG. Referring to FIG. 4, the node 20a transmits an RTP stream 81 of video essence and an RTP stream 82 of audio essence in parallel to the network. Each packet (V ₁ , V ₂ , V ₃ , . . . ) included in the RTP stream 81 has an RTP time stamp and a sequence number assigned according to the video RTP clock in the RTP header. Each of the packets (A ₁ , A ₂ , A ₃ , . . . ) included in the RTP stream 82 has an RTP timestamp and a sequence number assigned according to the audio RTP clock in the RTP header. Node 20b receives a series of RTP packets of RTP stream 81 at UDP port _PV and processes the received RTP packets in sequence number order. Node 20b also receives a series of RTP packets of RTP stream 82 at UDP port _PA and processes the received RTP packets in sequence number order. For example, when reproducing video and audio synchronously, the node 20b compares the RTP time stamp of each packet of the RTP stream 81 with the RTP time stamp of each packet of the RTP stream 82, and reproduces video and audio. Synchronize times (eg, frame timing) with each other. It should be noted that time alignment between video essences, between audio essences, and between video or audio essences and auxiliary data essences can be performed as well.

（７）ＡＲＩＢＳＴＤ－Ｂ７３
相互に関連する映像、音声及び補助データをＳＭＰＴＥＳＴ２１１０のように別々のストリームで伝送すると、トランスポートレイヤでそれらストリームのポート番号が相違することから、受信側でもとのコンテンツを再構築する処理が複雑化する。ポート番号が相違すれば、ＩＰネットワーク上での経路制御の結果として、個々のストリームが辿る伝送経路の違いからストリームごとに異なる遅延を受ける可能性もある。こうした不都合を解消するために、日本国内の標準化団体であるＡＲＩＢは、映像、音声及び補助データを単一のストリームで伝送するためのデータ構造の規定として、ＡＲＩＢＳＴＤ－Ｂ７３を規格化済みである。 (7) ARIB STD-B73
When video, audio and ancillary data that are related to each other are transmitted in separate streams like SMPTE ST2110, the port numbers of these streams are different in the transport layer, so processing to reconstruct the original contents on the receiving side is required. Complicated. If the port numbers are different, each stream may receive a different delay due to the difference in the transmission path followed by each stream as a result of path control on the IP network. In order to eliminate these inconveniences, ARIB, a standardization organization in Japan, has standardized ARIB STD-B73 as a data structure specification for transmitting video, audio and auxiliary data in a single stream. .

図５は、ＡＲＩＢＳＴＤ－Ｂ７３１．０版の第２章に記述されているデータグラムの構成について説明するための説明図である。図５に太線の枠で示したデータグラムは、先行するネットワークヘッダ及び後続するＦＣＳ（Frame Check Sequence）と共に、１つのパケットを構成する。ネットワークヘッダは、例えば、ＭＡＣ（Medium Access Control）ヘッダ（例えば、イーサネットヘッダ）、ＩＰヘッダ及びＵＤＰヘッダを含む。エッセンスデータのためのＲＴＰデータグラムは、ＲＴＰヘッダ及び共通ヘッダを含むトランスポートヘッダと、エッセンスヘッダと、エッセンスペイロードとを含む。エッセンスデータグラムは、エッセンスヘッダと、エッセンスペイロードとを含む。共通ヘッダ、エッセンスヘッダ及びエッセンスペイロードを含む部分をＲＴＰペイロードともいう。一方、図には示していないものの、ＦＥＣデータのためのＲＴＰデータグラムは、エッセンスヘッダ及びエッセンスペイロードの代わりにＦＥＣペイロードを含む。なお、本明細書において、パケット及びデータグラムという用語は、互換可能に使用される。これら用語の意味は、当業者により容易に理解されるであろう。 FIG. 5 is an explanatory diagram for explaining the structure of a datagram described in Chapter 2 of ARIB STD-B73 Version 1.0. A datagram indicated by a thick frame in FIG. 5 constitutes one packet together with a preceding network header and a subsequent FCS (Frame Check Sequence). Network headers include, for example, MAC (Medium Access Control) headers (eg, Ethernet headers), IP headers, and UDP headers. An RTP datagram for essence data includes a transport header including an RTP header and a common header, an essence header, and an essence payload. An essence datagram includes an essence header and an essence payload. The portion including the common header, essence header and essence payload is also called RTP payload. On the other hand, although not shown, the RTP datagram for FEC data contains the FEC payload instead of the essence header and essence payload. Note that the terms packet and datagram are used interchangeably herein. The meaning of these terms will be readily understood by those skilled in the art.

ＲＴＰヘッダは、例えば、データ項目として「ペイロードタイプ」、「シーケンス番号」及び「タイムスタンプ」を含む。「ペイロードタイプ」には固定的な値（例えば、１１０）が設定される。「シーケンス番号」の値は各伝送で１インクリメントされる。その初期値はランダムに設定され得る。「タイムスタンプ」の値は単調に線形的にインクリメントされるものの、ＡＲＩＢＳＴＤ－Ｂ７３１．０版は本フィールドを同期に使用しないことを規定している。 The RTP header includes, for example, "payload type", "sequence number" and "time stamp" as data items. A fixed value (for example, 110) is set in the “payload type”. The "sequence number" value is incremented by one with each transmission. Its initial value can be set randomly. Although the "timestamp" value is monotonically linearly incremented, ARIB STD-B73 Version 1.0 specifies that this field is not used for synchronization.

共通ヘッダは、例えば、データ項目として「フレームカウント」、「データグラムタイプ」及び「シーケンス番号」を含む。共通ヘッダの「フレームカウント」には、対応するエッセンスヘッダの「フレームカウント」と同じ値が設定される。「データグラムタイプ」には、対応するデータグラムがエッセンスデータグラムであるか又はＦＥＣ（Forward Error Correction）データグラムであるかを示す値が設定される。「シーケンス番号」の値は、映像エッセンス、音声エッセンス及び補助データエッセンス（並びにＦＥＣ）で別々に、伝送の都度１インクリメントされる。その初期値はランダムに設定され得る。 The common header includes, for example, "frame count", "datagram type" and "sequence number" as data items. The "frame count" of the common header is set to the same value as the "frame count" of the corresponding essence header. A value indicating whether the corresponding datagram is an essence datagram or an FEC (Forward Error Correction) datagram is set in the "datagram type". The "sequence number" value is incremented by one for each transmission separately for video essence, audio essence and auxiliary data essence (and FEC). Its initial value can be set randomly.

エッセンスヘッダは、例えば、データ項目として「ペイロードタイプ」及び「フレームカウント」を含む。「ペイロードタイプ」には、後続するエッセンスペイロードに含まれるエッセンスのタイプを示す値が設定される（０：映像、１：音声、２：補助データ、など）。エッセンスヘッダの「フレームカウント」には、受信側で各エッセンスのデータグラムの同期を取るための値が設定される。ＳＭＰＴＥＳＴ２０５９－１で定義されているエポックの時刻が「フレームカウント」の初期値ゼロとして使用される。「フレームカウント」の値は新しい映像フレームの開始の都度１インクリメントされ、同じ映像フレームに属する全てのエッセンスデータグラムに同じ値が設定される。 The essence header includes, for example, "payload type" and "frame count" as data items. "Payload type" is set with a value indicating the type of essence contained in the subsequent essence payload (0: video, 1: audio, 2: auxiliary data, etc.). A value for synchronizing datagrams of each essence on the receiving side is set in the "frame count" of the essence header. The epoch time defined in SMPTE ST2059-1 is used as the initial value of zero for the "frame count". The "frame count" value is incremented by one at the start of each new video frame, and all essence datagrams belonging to the same video frame are set to the same value.

図６は、ＡＲＩＢＳＴＤ－Ｂ７３に従った単一ストリームでの映像エッセンス及び音声エッセンスの伝送について説明するための説明図である。図６を参照すると、ノード２０ｃは、映像エッセンスのためのパケット及び音声エッセンスのためのパケットの双方を含むＲＴＰストリーム８３をネットワークへ送出する。ＲＴＰストリーム８３に含まれるパケット（Ｖ_１、Ａ_１、Ｖ_２、Ａ_２、…）の各々は、フレームカウント値をエッセンスヘッダ内に有する。ノード２０ｄは、単一のポートＰ_Ｍにおいて、ＲＴＰストリーム８３の一連のＲＴＰパケットを受信し、受信したＲＴＰパケットをシーケンス番号順に処理する。上述したように、ＡＲＩＢＳＴＤ－Ｂ７３によれば、ＲＴＰヘッダ内のシーケンス番号の順で前後に並ぶ一群のＲＴＰパケットが同じ映像フレームに属し、同じフレームカウント値を有する。そのため、ノード２０ｄは、映像エッセンス及び音声エッセンスを含む当該一群のＲＴＰパケットを同期的に処理することができる。 FIG. 6 is an explanatory diagram for explaining transmission of video essence and audio essence in a single stream according to ARIB STD-B73. Referring to FIG. 6, node 20c sends an RTP stream 83 containing both packets for video essence and packets for audio essence to the network. Each of the packets (V ₁ , A ₁ , V ₂ , A ₂ , . . . ) contained in RTP stream 83 has a frame count value in its essence header. Node 20d receives a series of RTP packets of RTP stream 83 at a single port _PM and processes the received RTP packets in sequence number order. As described above, according to ARIB STD-B73, a group of RTP packets that are arranged one behind the other in the order of sequence numbers in the RTP header belong to the same video frame and have the same frame count value. Therefore, the node 20d can synchronously process the group of RTP packets containing video essence and audio essence.

しかし、ＡＲＩＢＳＴＤ－Ｂ７３は、放送局システムに参加するノード間でどのように協調的に動作してストリームを処理すべきかを規定していない。 However, ARIB STD-B73 does not specify how the nodes participating in the broadcasting station system should operate cooperatively to process streams.

例えば、ストリームの伝送に関与するノード間で高精度の同期が確立されておらず、パケット間の時間合わせのために使用すべき時刻情報用のフィールドに合意が無ければ、放送局システム内でやり取りされる多様なエッセンスを柔軟に組み合わせて放送コンテンツを構成することができない。そこで、後述する第１の実施形態及び第２の実施形態において、放送局のＩＰネットワーク上で映像データ、音声データ又は補助データといったエッセンスデータを単一のストリームで伝送するプロトコル（例えば、ＡＲＩＢＳＴＤ－Ｂ７３）を利用する際に、エッセンスデータの適切な時間合わせを行う仕組みを提案する。 For example, if high-precision synchronization is not established between nodes involved in stream transmission, and if there is no consensus on the field for time information that should be used for time alignment between packets, communication within the broadcasting station system Broadcast contents cannot be configured by flexibly combining various essences. Therefore, in a first embodiment and a second embodiment to be described later, a protocol (for example, ARIB STD- B73), we propose a mechanism for appropriately adjusting the time of essence data.

加えて、ＳＭＰＴＥＳＴ２１１０ストリームのために通常利用されるＳＤＰオブジェクトのフォーマットは、フォーマット構造においても、記述される情報の内容においても、ＡＲＩＢＳＴＤ－Ｂ７３ストリームの特性に必ずしも適していない。そこで、後述する第３の実施形態及び第４の実施形態において、放送局のＩＰネットワーク上で映像データ、音声データ又は補助データといったエッセンスデータを単一のストリームで伝送するプロトコルを利用する際に、当該ストリームの伝送を適切にセットアップすることを可能にする手法を説明する。 In addition, the format of the SDP objects normally used for SMPTE ST2110 streams is not necessarily suitable for the characteristics of ARIB STD-B73 streams, either in format structure or in the information content described. Therefore, in the third and fourth embodiments described later, when using a protocol for transmitting essence data such as video data, audio data, or auxiliary data in a single stream on an IP network of a broadcasting station, Techniques are described that allow the transmission of the stream to be properly set up.

なお、本明細書において、異なる種類のエッセンスデータが単一のストリームで伝送されるようなストリームを、エッセンス混在型ストリームともいう。エッセンス混在型ストリームの例は、ＡＲＩＢＳＴＤ－Ｂ７３ストリーム及びＳＭＰＴＥＳＴ２０２２－６ストリームを含む。また、本明細書において、異なる種類のエッセンスデータがそれぞれ異なるストリームで伝送されるようなストリームを、エッセンス分離型ストリームともいう。エッセンス分離型ストリームの例は、ＳＭＰＴＥＳＴ２１１０ストリームを含む。 In this specification, a stream in which different types of essence data are transmitted in a single stream is also called an essence-mixed stream. Examples of mixed-essence streams include ARIB STD-B73 streams and SMPTE ST2022-6 streams. Also, in this specification, a stream in which different types of essence data are transmitted in different streams is also referred to as an essence separation type stream. Examples of essence-separated streams include SMPTE ST2110 streams.

＜＜２．第１の実施形態＞＞
本章で説明する放送信号処理ノード１００は、上述したセンダ６０及びレシーバ６５の双方の機能性を含む。しかしながら、当業者にとって明らかなように、放送信号処理ノード１００において、センダ６０及びレシーバ６５のうちの一方の機能性が省略されてもよい。放送信号処理ノード１００は、放送局システム１において、例えば、図１及び図２を用いて説明したノード２０ａ～２０ｄのいずれかに相当し得る。 <<2. First Embodiment>>
The broadcast signal processing node 100 described in this section includes the functionality of both the sender 60 and receiver 65 described above. However, as will be apparent to those skilled in the art, in broadcast signal processing node 100 the functionality of one of sender 60 and receiver 65 may be omitted. The broadcast signal processing node 100 can correspond to any one of the nodes 20a to 20d described with reference to FIGS. 1 and 2 in the broadcasting station system 1, for example.

＜２－１．放送信号処理ノードの構成例＞
図７は、第１の実施形態に係る放送信号処理ノード１００の構成の一例を示すブロック図である。図７を参照すると、放送信号処理ノード１００は、デバイス内部クロック１１０、メディアクロック１１２、ＲＴＰクロック１１４、ＰＴＰ処理部１１６、通信部１２０、送信ストリーム処理部１３０、受信ストリーム処理部１４０、データ処理部１８０及び制御部１９０を備える。 <2-1. Configuration example of broadcast signal processing node>
FIG. 7 is a block diagram showing an example of the configuration of the broadcast signal processing node 100 according to the first embodiment. Referring to FIG. 7, broadcast signal processing node 100 includes device internal clock 110, media clock 112, RTP clock 114, PTP processing unit 116, communication unit 120, transmission stream processing unit 130, reception stream processing unit 140, data processing unit 180 and a control unit 190 .

（１）デバイス内部クロック
デバイス内部クロック（機器内部クロックともいう）１１０は、放送信号処理ノード１００が保持する固有の内部的なクロックである。本実施形態において、デバイス内部クロック１１０は、ＰＴＰの時刻源に直接的に又は間接的に同期する。典型的には、放送信号処理ノード１００がＰＴＰスレーブである場合、デバイス内部クロック１１０は、ＰＴＰグランドマスタを介してＰＴＰの時刻源に間接的に同期する。一方、放送信号処理ノード１００がＰＴＰマスタである場合、デバイス内部クロック１１０は、ＰＴＰの時刻源に直接的に同期する。例えば、後述する制御部１９０は、放送信号処理ノード１００がＰＴＰマスタにならないように自身を設定してもよい。 (1) Device Internal Clock The device internal clock (also called device internal clock) 110 is a unique internal clock held by the broadcast signal processing node 100 . In this embodiment, the device internal clock 110 is directly or indirectly synchronized to the PTP's time source. Typically, when the broadcast signal processing node 100 is a PTP slave, the device internal clock 110 is indirectly synchronized to the PTP's time source via the PTP grandmaster. On the other hand, if the broadcast signal processing node 100 is the PTP master, the device internal clock 110 is directly synchronized to the PTP's time source. For example, the control unit 190, which will be described later, may set itself so that the broadcast signal processing node 100 does not become the PTP master.

（２）メディアクロック
メディアクロック１１２は、デジタルメディア信号の処理（例えば、サンプリング及び再構成）のために使用されるクロックである。本実施形態において、メディアクロック１１２は、ＳＭＰＴＥＳＴ２０５９－１で定義されているエポックの時刻を初期値ゼロとして使用し、デバイス内部クロック１１０に周波数ロックされて、正確なレートで進行する。放送信号処理ノード１００が共通基準クロックを取得できず、ローカルタイムベース上で動作している場合には、ＳＭＰＴＥＳＴ２０５９－１で定義されている上記エポックを前提として、利用可能な時刻源のうち最良のものがメディアクロック１１２及び後述するＲＴＰクロック１１４のために用いられ得る。 (2) Media Clock Media clock 112 is a clock used for processing (eg, sampling and reconstruction) of digital media signals. In this embodiment, the media clock 112 uses the epoch time defined in SMPTE ST2059-1 as an initial value of zero and is frequency locked to the device internal clock 110 to run at the correct rate. If the broadcast signal processing node 100 cannot obtain a common reference clock and is operating on a local timebase, it will use the best available time source given the above epoch defined in SMPTEST ST2059-1. can be used for the media clock 112 and the RTP clock 114 described below.

本実施形態において、放送信号処理ノード１００は、異なる種類のエッセンスデータを単一のストリームで伝送するための伝送プロトコルであるＡＲＩＢＳＴＤ－Ｂ７３をサポートする。ＡＲＩＢＳＴＤ－Ｂ７３では、映像エッセンス、音声エッセンス及び補助データエッセンスの各々のタイミングは、映像フレームに関連付けられる。この場合、ＲＴＰタイムスタンプの基礎となるメディアクロックはエッセンスデータの種類によらず単一であってよい。後述するＲＴＰクロック１１４も同様である。 In this embodiment, the broadcast signal processing node 100 supports ARIB STD-B73, which is a transmission protocol for transmitting different types of essence data in a single stream. In ARIB STD-B73, the timing of each video essence, audio essence and ancillary data essence is associated with a video frame. In this case, the media clock on which the RTP timestamp is based may be a single one regardless of the type of essence data. The same applies to the RTP clock 114, which will be described later.

ＳＭＰＴＥＳＴ２１１０－２０では、映像エッセンスのためのメディアクロック周波数は、９０ｋＨｚと規定されている。一方、既存の多くの放送用機器は、２７．０ＭＨｚのクロックを有する。ＳＤＩイメージを伝送するＳＭＰＴＥＳＴ２０２２－６ストリームをＳＭＰＴＥＳＴ２１１０－１０システムへ統合するために検討されているＳＭＰＴＥＳＴ２０２２－８では、２７．０ＭＨｚのメディアクロック周波数が規定されている。メディアクロック周波数を９０ｋＨｚとした場合、当該周波数は、映像フレームの周波数として利用されることの多い６０／１．００１Ｈｚの整数倍にならない。一方、メディアクロック周波数を２７．０ＭＨｚとした場合、当該周波数は６０／１．００１Ｈｚの整数倍となる。この点を考慮し、本実施形態では、フレーム期間を正確に一定にして、クロック値を単調増加させる点において取り扱い上有利な２７．０ＭＨｚを、メディアクロック１１２のクロック周波数として利用する。 SMPTE ST2110-20 specifies that the media clock frequency for video essence is 90 kHz. On the other hand, many existing broadcasting equipment have a clock of 27.0 MHz. SMPTE ST2022-8, which is under consideration for integrating SMPTE ST2022-6 streams carrying SDI images into the SMPTE ST2110-10 system, specifies a media clock frequency of 27.0 MHz. When the media clock frequency is 90 kHz, the frequency is not an integer multiple of 60/1.001 Hz, which is often used as the frequency of video frames. On the other hand, when the media clock frequency is 27.0 MHz, the frequency is an integer multiple of 60/1.001 Hz. Considering this point, in the present embodiment, 27.0 MHz is used as the clock frequency of the media clock 112, which is advantageous in terms of handling in that the frame period is accurately kept constant and the clock value is monotonously increased.

（３）ＲＴＰクロック
ＲＴＰクロック１１４は、ＲＴＰパケットのＲＴＰヘッダへ付与されるタイムスタンプの基礎となるクロックである。本実施形態において、ＲＴＰクロック１１４は、ＳＭＰＴＥＳＴ２１１０－１０の規定に従い、メディアクロック１１２に対しオフセットを有しないものとする。即ち、ＲＴＰクロック１１４は、関連付けられるメディアクロック１１２と同一の値を有する。したがって、本実施形態において、ＲＴＰクロック１１４のクロック周波数は、メディアクロック１１２のクロック周波数と同様に２７．０ＭＨｚに等しい。 (3) RTP Clock The RTP clock 114 is a clock that forms the basis of time stamps added to the RTP headers of RTP packets. In this embodiment, the RTP clock 114 shall have no offset with respect to the media clock 112 as specified in SMPTE ST2110-10. That is, the RTP clock 114 has the same value as the associated media clock 112 . Therefore, in this embodiment, the clock frequency of RTP clock 114 is equal to 27.0 MHz, as is the clock frequency of media clock 112 .

（４）ＰＴＰ処理部
本実施形態において、放送信号処理ノード１００は、例えばＳＭＰＴＥＳＴ２０５９－２のＰＴＰプロファイルをサポートする。そして、ＰＴＰ処理部１１６は、ＰＴＰマスタ（例えば、図３を用いて説明した共通基準クロック７０）との間で通信部１２０を介して同期メッセージを交換することにより、デバイス内部クロック１１０のＰＴＰ時刻源との同期を維持する。それにより、放送信号処理ノード１００は、同じくＰＴＰ時刻源と同期したクロックを有する他のノードとの間で、高精度で同期的に動作することが可能となる。 (4) PTP Processing Unit In this embodiment, the broadcast signal processing node 100 supports the PTP profile of SMPTE ST2059-2, for example. Then, the PTP processing unit 116 exchanges synchronization messages with the PTP master (for example, the common reference clock 70 described using FIG. 3) via the communication unit 120 to obtain the PTP time of the device internal clock 110. Stay synchronized with the source. Thereby, the broadcast signal processing node 100 can operate synchronously with high precision with other nodes having clocks synchronized with the PTP time source as well.

（５）通信部
通信部１２０は、放送信号処理ノード１００による他のノードとの通信を仲介するインタフェースである。通信部１２０は、有線通信のための接続端子及び接続回路を含んでもよく、又は無線通信のためのアンテナ、ＲＦ（Radio Frequency）回路及びベースバンド回路を含んでもよい。本実施形態において、通信部１２０は、送信部１２２及び受信部１２４を含む。 (5) Communication Unit The communication unit 120 is an interface that mediates communication between the broadcast signal processing node 100 and other nodes. The communication unit 120 may include connection terminals and connection circuits for wired communication, or may include an antenna, an RF (Radio Frequency) circuit, and a baseband circuit for wireless communication. In this embodiment, the communication unit 120 includes a transmitter 122 and a receiver 124 .

送信部１２２には、後述する送信ストリーム処理部１３０により生成される、放送信号ストリームのための一連のＲＴＰパケットが入力される。各ＲＴＰパケットは、映像データ、音声データ及び補助データのうちのいずれかに相当するエッセンスデータをＲＴＰペイロードに含む。各ＲＴＰパケットのＲＴＰヘッダには、デバイス内部クロック１１０にロックされたメディアクロック１１２に対しオフセットを有しないＲＴＰクロック１１４に従ってＲＴＰタイムスタンプが付与されている。送信部１２２は、入力される各ＲＴＰパケットにネットワークヘッダを追加して、各パケットを放送局システム１に参加する他のノードへ送信する。上記ストリームがＡＲＩＢＳＴＤ－Ｂ７３ストリームである場合、ネットワークヘッダ内に記述されるポート番号は、映像データを搬送するパケット、音声データを搬送するパケット及び補助データを搬送するパケットで共通的である。 A series of RTP packets for a broadcast signal stream generated by a transmission stream processing unit 130 (to be described later) are input to the transmission unit 122 . Each RTP packet contains essence data corresponding to any one of video data, audio data and ancillary data in the RTP payload. The RTP header of each RTP packet is given an RTP timestamp according to the RTP clock 114 which has no offset with respect to the media clock 112 locked to the device internal clock 110 . The transmitting unit 122 adds a network header to each input RTP packet and transmits each packet to other nodes participating in the broadcasting station system 1 . If the stream is an ARIB STD-B73 stream, the port number described in the network header is common to packets carrying video data, packets carrying audio data, and packets carrying auxiliary data.

受信部１２４は、一連のＲＴＰパケットからなる放送信号ストリームを、放送信号処理ノード１００の送信ストリーム処理部１３０及び送信部１２２と同様のセンダ機能を有する他のノードから受信する。各ＲＴＰパケットは、映像データ、音声データ及び補助データのうちのいずれかに相当するエッセンスデータをＲＴＰペイロードに含む。上記ストリームがＡＲＩＢＳＴＤ－Ｂ７３ストリームである場合、一連のＲＴＰパケットは、共通的なポート番号を介して、単一のストリームとして受信される。各ＲＴＰパケットのＲＴＰヘッダは、送信側のノードのデバイス内部クロックにロックされたメディアクロックに対しオフセットを有しないＲＴＰクロックに従って付与されたＲＴＰタイムスタンプを含む。各ＲＴＰパケットのＲＴＰペイロード内のヘッダは、同じＲＴＰクロックに基づくフレームカウント情報を含む。放送信号処理ノード１００及び送信側のノードは、上述したようにＰＴＰの時刻源に直接的に又は間接的に同期している。したがって、それらノード間のクロックの誤差は、例えば１０μ秒以下であり得る。 The receiver 124 receives a broadcast signal stream consisting of a series of RTP packets from another node having the same sender function as the transmission stream processor 130 and the transmitter 122 of the broadcast signal processing node 100 . Each RTP packet contains essence data corresponding to any one of video data, audio data and ancillary data in the RTP payload. If the stream is an ARIB STD-B73 stream, a series of RTP packets are received as a single stream through a common port number. The RTP header of each RTP packet contains an RTP timestamp appended according to the RTP clock with no offset relative to the media clock locked to the device internal clock of the sending node. The header in the RTP payload of each RTP packet contains frame count information based on the same RTP clock. The broadcast signal processing node 100 and the node on the transmitting side are directly or indirectly synchronized with the PTP time source as described above. Therefore, the clock error between those nodes can be, for example, 10 μs or less.

受信部１２４は、ＡＲＩＢＳＴＤ－Ｂ７３ストリーム以外の放送信号ストリームを他のノードから受信してもよい。例えば、受信部１２４は、ＡＲＩＢＳＴＤ－Ｂ７３ストリームと同様のエッセンス混在型ストリーム（例えば、ＳＭＰＴＥＳＴ２０２２－６ストリーム）を受信してもよい。また、受信部１２４は、ＳＭＰＴＥＳＴ２１１０ストリームのようなエッセンス分離型ストリームを受信してもよい。 The receiving unit 124 may receive broadcast signal streams other than the ARIB STD-B73 stream from other nodes. For example, the receiver 124 may receive a mixed-essence stream similar to the ARIB STD-B73 stream (eg, SMPTE ST2022-6 stream). The receiver 124 may also receive an essence-separated stream such as an SMPTE ST2110 stream.

受信部１２４は、受信した放送信号ストリームの各パケットからネットワークヘッダを除去して、ＲＴＰヘッダ及びＲＴＰペイロードからなるＲＴＰパケットを受信ストリーム処理部１４０へ出力する。 The receiving unit 124 removes the network header from each packet of the received broadcast signal stream and outputs RTP packets composed of the RTP header and RTP payload to the received stream processing unit 140 .

（６）送信ストリーム処理部
送信ストリーム処理部１３０は、データ処理部１８０から入力される映像データ、音声データ又は補助データのシーケンスを処理して、放送信号ストリームのための一連のＲＴＰパケットを生成する。送信ストリーム処理部１３０は、例えば、入力されるデータシーケンスを１つ以上のエッセンスペイロードへセグメント化し、各ペイロードにエッセンスヘッダを追加する。エッセンスヘッダ内のペイロードタイプは、対応するエッセンスのタイプを示す値に設定され、フレームカウントの値は新しい映像フレームの開始の都度１インクリメントされる。送信ストリーム処理部１３０は、さらに、各パケットにトランスポートヘッダ（共通ヘッダ及びＲＴＰヘッダ）を追加する。共通ヘッダ内のシーケンス番号の値は、映像エッセンス、音声エッセンス及び補助データエッセンスで別々に、伝送の都度１インクリメントされる。ＲＴＰヘッダ内のペイロードタイプは固定的な値に設定され、シーケンス番号の値はペイロードタイプによらず各伝送で１インクリメントされる。送信ストリーム処理部１３０は、さらに、ＲＴＰクロック１１４に従ってＲＴＰヘッダへＲＴＰタイムスタンプを付与する。送信ストリーム処理部１３０は、このように生成される一連のＲＴＰパケットを送信部１２２へ出力する。 (6) Transmission Stream Processing Unit The transmission stream processing unit 130 processes a sequence of video data, audio data or auxiliary data input from the data processing unit 180 to generate a series of RTP packets for the broadcast signal stream. . The transmission stream processor 130 may, for example, segment an incoming data sequence into one or more essence payloads and add an essence header to each payload. The payload type in the essence header is set to a value that indicates the type of the corresponding essence, and the frame count value is incremented by one at the start of each new video frame. The transmission stream processing unit 130 further adds a transport header (common header and RTP header) to each packet. The value of the sequence number in the common header is incremented by 1 for each transmission separately for video essence, audio essence and ancillary data essence. The payload type in the RTP header is set to a fixed value and the sequence number value is incremented by 1 for each transmission regardless of the payload type. The transmission stream processing unit 130 further adds an RTP timestamp to the RTP header according to the RTP clock 114 . The transmission stream processing unit 130 outputs a series of RTP packets generated in this way to the transmission unit 122 .

（７）受信ストリーム処理部
受信ストリーム処理部１４０は、他のノードから受信部１２４を介して受信される放送信号ストリームの一連のＲＴＰパケットを処理して、映像データ、音声データ又は補助データを復元する。そして、受信ストリーム処理部１４０は、復元したデータのシーケンスをデータ処理部１８０へ出力する。とりわけ、本実施形態において、受信ストリーム処理部１４０は、受信されるＲＴＰパケットのＲＴＰヘッダから取得されるＲＴＰタイムスタンプ、又は、受信されるＲＴＰパケットのＲＴＰペイロード内のヘッダ（例えば、エッセンスヘッダ又は共通ヘッダ）から取得されるフレームカウント情報に基づいて、当該ＲＴＰパケットと他のＲＴＰパケットとの間でエッセンスデータの時間合わせを行う。こうした時間合わせのための受信ストリーム処理部１４０のより詳細な構成の一例について、後にさらに説明する。 (7) Received Stream Processing Unit The received stream processing unit 140 processes a series of RTP packets of the broadcast signal stream received from other nodes via the receiving unit 124 to restore video data, audio data or auxiliary data. do. Then, reception stream processing section 140 outputs the restored data sequence to data processing section 180 . In particular, in this embodiment, the received stream processing unit 140 uses the RTP timestamp obtained from the RTP header of the received RTP packet, or the header (e.g., essence header or common Based on the frame count information obtained from the header), time alignment of essence data is performed between the RTP packet and other RTP packets. An example of a more detailed configuration of the received stream processing unit 140 for such time adjustment will be further described later.

（８）データ処理部
データ処理部１８０は、映像データ、音声データ又は補助データを生成して、生成したデータのシーケンスを送信ストリーム処理部１３０へ出力する。データ処理部１８０は、例えば、図示しないデータソースから入力される映像データを圧縮して、圧縮済みの映像データを生成してもよい。また、データ処理部１８０は、受信ストリーム処理部１４０により復元される映像データ、音声データ又は補助データを処理する。データ処理部１８０は、例えば、受信ストリーム処理部１４０により復元され、互いに時間合わせされたエッセンスデータに基づいて、複数のエッセンス（例えば、映像エッセンス、音声エッセンス及び補助データエッセンスのうちの２つ以上、又は複数の映像エッセンスなど）を同期的に再生してもよい。また、データ処理部１８０は、受信ストリーム処理部１４０により復元される映像データ、音声データ又は補助データを所定のファイルフォーマットで記録媒体（図示せず）に記録してもよい。また、データ処理部１８０は、受信ストリーム処理部１４０から入力される映像データが圧縮済みの映像データである場合には、当該圧縮済みの映像データを逆圧縮して、もとの映像データを復元してもよい。 (8) Data Processing Unit The data processing unit 180 generates video data, audio data, or auxiliary data, and outputs the generated data sequence to the transmission stream processing unit 130 . The data processing unit 180 may, for example, compress video data input from a data source (not shown) to generate compressed video data. Also, the data processing unit 180 processes video data, audio data, or auxiliary data restored by the reception stream processing unit 140 . The data processing unit 180 extracts a plurality of essences (for example, two or more of a video essence, an audio essence and an auxiliary data essence, or multiple video essences) may be played back synchronously. Also, the data processing unit 180 may record the video data, audio data, or auxiliary data restored by the reception stream processing unit 140 in a recording medium (not shown) in a predetermined file format. Further, when the video data input from the reception stream processing unit 140 is compressed video data, the data processing unit 180 decompresses the compressed video data to restore the original video data. You may

（９）制御部
制御部１９０は、放送信号処理ノード１００の上述した動作の全般を制御する。制御部１９０は、例えば、ＡＰＳ４０又は制御端末５０といった外部装置から受信される指示に応じて、放送信号ストリームを送信部１２２から送信させ、又は放送信号ストリームを受信部１２４により受信させる。放送信号処理ノード１００が複数の伝送プロトコルをサポートする場合には、制御部１９０は、例えば外部装置からの指示において選択される伝送プロトコルのための動作を、各処理部に実行させる。制御部１９０は、通信部１２０を介して他のノードへ、放送信号処理ノード１００の機能性（例えば、サポートするプロトコル、送信／受信可能なエッセンスのタイプ、クロック周波数及びその他の属性）を通知してもよい。 (9) Control Unit The control unit 190 controls the overall operations of the broadcast signal processing node 100 described above. The control unit 190 causes the transmission unit 122 to transmit the broadcast signal stream or causes the reception unit 124 to receive the broadcast signal stream in accordance with instructions received from an external device such as the APS 40 or the control terminal 50 . If the broadcast signal processing node 100 supports a plurality of transmission protocols, the control unit 190 causes each processing unit to perform operations for the transmission protocol selected by instructions from the external device, for example. The control unit 190 notifies other nodes via the communication unit 120 of the functionality of the broadcast signal processing node 100 (for example, the supported protocol, the type of essence that can be transmitted/received, the clock frequency and other attributes). may

本明細書で説明するいくつかの実施形態において、ＲＴＰストリームである放送信号ストリームの属性は、ＳＤＰオブジェクトに記述される。ＳＭＰＴＥＳＴ２１１０－１０によれば、１つ以上のセンダを含むデバイスは、ＲＴＰストリームごとに１つのＳＤＰオブジェクトを構築するものとされている。本実施形態においても、センダとしての役割を有する送信ストリーム処理部１３０及び送信部１２２により送信可能なストリームの属性を記述したＳＤＰオブジェクトが、放送信号処理ノード１００の記憶部（図示せず）に予め記憶され得る。そして、制御部１９０は、ストリームの送信の開始に先立って、予め定義される管理用のアプリケーションプロトコルインタフェース（例えば、ＮＭＯＳＡＰＩ）を介して当該ＳＤＰオブジェクトを外部装置へ提供する。一方、制御部１９０は、受信部１２４及び受信ストリーム処理部１４０に他の送信ノードから放送信号ストリームを受信させる場合、当該送信ノードにより提供されるＳＤＰオブジェクトの記述に従って、受信される放送信号ストリームを正しく処理できるように受信側の処理を構成する。なお、ＡＲＩＢＳＴＤ－Ｂ７３ストリームの特性に適したＳＤＰオブジェクトのフォーマットについて、後述する第３の実施形態及び第４の実施形態で詳しく説明する。 In some embodiments described herein, attributes of broadcast signal streams that are RTP streams are described in SDP objects. According to SMPTE ST2110-10, a device containing one or more senders shall construct one SDP object per RTP stream. Also in this embodiment, an SDP object describing attributes of a stream that can be transmitted by the transmission stream processing unit 130 and the transmission unit 122, which serve as senders, is stored in advance in the storage unit (not shown) of the broadcast signal processing node 100. can be stored. Then, prior to starting stream transmission, the control unit 190 provides the SDP object to the external device via a predefined management application protocol interface (for example, NMOS API). On the other hand, when the control unit 190 causes the reception unit 124 and the reception stream processing unit 140 to receive a broadcast signal stream from another transmission node, the control unit 190 processes the received broadcast signal stream according to the description of the SDP object provided by the transmission node. Configure receiver processing for correct processing. The format of the SDP object suitable for the characteristics of the ARIB STD-B73 stream will be described in detail in third and fourth embodiments, which will be described later.

＜２－２．受信ストリーム処理部の詳細な構成例＞
図８は、図７に示した受信ストリーム処理部１４０の詳細な構成の一例を示すブロック図である。図８を参照すると、受信ストリーム処理部１４０は、トランスポート処理部１４２、エッセンスデータグラム処理部１５０及びＦＥＣデータグラム処理部１７０を含む。 <2-2. Detailed Configuration Example of Reception Stream Processing Unit>
FIG. 8 is a block diagram showing an example of a detailed configuration of reception stream processing section 140 shown in FIG. Referring to FIG. 8 , the received stream processor 140 includes a transport processor 142 , an essence datagram processor 150 and an FEC datagram processor 170 .

（１）トランスポート処理部
トランスポート処理部１４２は、トランスポートヘッダ（ＴＨ）除去部１４４を含む。ＴＨ除去部１４４は、受信部１２４から入力される放送信号ストリームのＲＴＰパケットの先頭のＲＴＰヘッダ及び共通ヘッダ（即ち、トランスポートヘッダ）を除去する。そして、ＴＨ除去部１４４は、共通ヘッダ内の「データグラムタイプ」の値に依存して、当該ＲＴＰパケットのデータグラムを、エッセンスデータグラム処理部１５０又はＦＥＣデータグラム処理部１７０へ分配する。例えば、「データグラムタイプ」の値がエッセンスデータグラムを示す場合には、当該エッセンスデータグラムがエッセンスデータグラム処理部１５０へ出力される。一方、「データグラムタイプ」の値がＦＥＣデータグラムを示す場合には、当該ＦＥＣデータグラムがＦＥＣデータグラム処理部１７０へ出力される。 (1) Transport Processing Section The transport processing section 142 includes a transport header (TH) removing section 144 . The TH removing unit 144 removes the RTP header and the common header (that is, transport header) at the beginning of the RTP packets of the broadcast signal stream input from the receiving unit 124 . Then, the TH removal unit 144 distributes the datagram of the RTP packet to the essence datagram processing unit 150 or the FEC datagram processing unit 170 depending on the value of "datagram type" in the common header. For example, when the value of “datagram type” indicates an essence datagram, the essence datagram is output to the essence datagram processing unit 150 . On the other hand, when the value of “datagram type” indicates an FEC datagram, the FEC datagram is output to the FEC datagram processing unit 170 .

トランスポート処理部１４２からエッセンスデータグラム処理部１５０又はＦＥＣデータグラム処理部１７０へのデータグラムの出力は、ＲＴＰヘッダ内の「シーケンス番号」の順に行われ得る。欠落したシーケンス番号に対応するデータグラムの処理は、スキップされてよい。トランスポート処理部１４２は、エッセンスデータグラムに対応するＲＴＰヘッダ内のＲＴＰタイムスタンプの値を、アラインメント部１６０へ出力する。 Datagrams are output from the transport processing unit 142 to the essence datagram processing unit 150 or the FEC datagram processing unit 170 in the order of the "sequence number" in the RTP header. Processing of datagrams corresponding to missing sequence numbers may be skipped. The transport processing unit 142 outputs the value of the RTP timestamp in the RTP header corresponding to the essence datagram to the alignment unit 160 .

（２）エッセンスデータグラム処理部
エッセンスデータグラム処理部１５０は、エッセンスヘッダ（ＥＨ）除去部１５２、映像エッセンス処理部１５４、音声エッセンス処理部１５６、補助データエッセンス処理部１５８、及びアラインメント部１６０を含む。 (2) Essence Datagram Processing Unit The essence datagram processing unit 150 includes an essence header (EH) removal unit 152, a video essence processing unit 154, an audio essence processing unit 156, an auxiliary data essence processing unit 158, and an alignment unit 160. .

ＥＨ除去部１５２は、トランスポート処理部１４２から入力されるエッセンスデータグラムの先頭のエッセンスヘッダを除去する。そして、ＥＨ除去部１５２は、エッセンスヘッダ内の「ペイロードタイプ」の値に依存して、当該エッセンスデータグラムのエッセンスペイロードを、映像エッセンス処理部１５４、音声エッセンス処理部１５６、又は補助データエッセンス処理部１５８へ分配する。例えば、「ペイロードタイプ」の値が映像エッセンスを示す場合には、当該映像エッセンスが映像エッセンス処理部１５４へ出力される。「ペイロードタイプ」の値が音声エッセンスを示す場合には、当該音声エッセンスが音声エッセンス処理部１５６へ出力される。「ペイロードタイプ」の値が補助データエッセンスを示す場合には、当該補助データエッセンスが補助データエッセンス処理部１５８へ出力される。また、ＥＨ除去部１５２は、エッセンスヘッダ内の「フレームカウント」の値をアラインメント部１６０へ出力する。 The EH remover 152 removes the essence header at the beginning of the essence datagram input from the transport processor 142 . Then, the EH removal unit 152 converts the essence payload of the essence datagram into a video essence processing unit 154, an audio essence processing unit 156, or an ancillary data essence processing unit, depending on the value of "payload type" in the essence header. 158. For example, when the value of “payload type” indicates video essence, the video essence is output to video essence processing section 154 . When the value of “payload type” indicates voice essence, the voice essence is output to voice essence processing section 156 . When the value of “payload type” indicates an auxiliary data essence, the auxiliary data essence is output to auxiliary data essence processing section 158 . The EH removal section 152 also outputs the value of “frame count” in the essence header to the alignment section 160 .

映像エッセンス処理部１５４は、ＥＨ除去部１５２から順次入力される映像エッセンスを処理して、映像データを復元する。例えば、映像エッセンス処理部１５４は、ＡＲＩＢＳＴＤ－Ｂ７３ストリームが受信された場合には、ＡＲＩＢＳＴＤ－Ｂ７３により規定された映像ペイロードのパッキング形式に従って映像エッセンスを逆パッキングして、映像データを復元し得る。映像エッセンスが圧縮済みの映像データを含む場合には、映像エッセンス処理部１５４は、映像エッセンスを逆圧縮して映像データを復元してもよい（逆圧縮は、上述したようにデータ処理部１８０により行われてもよい）。 The image essence processor 154 processes the image essence sequentially input from the EH remover 152 to restore image data. For example, when an ARIB STD-B73 stream is received, the video essence processing unit 154 can restore the video data by depacking the video essence according to the video payload packing format defined by ARIB STD-B73. . If the video essence includes compressed video data, the video essence processing unit 154 may decompress the video essence to restore the video data (the decompression is performed by the data processing unit 180 as described above). may be done).

音声エッセンス処理部１５６は、ＥＨ除去部１５２から順次入力される音声エッセンスを処理して、音声データを復元する。例えば、音声エッセンス処理部１５６は、ＡＲＩＢＳＴＤ－Ｂ７３ストリームが受信された場合には、ＡＲＩＢＳＴＤ－Ｂ７３により規定された音声ペイロードのパッキング形式に従って音声エッセンスを逆パッキングして、音声データを復元し得る。 The voice essence processing unit 156 processes the voice essence sequentially input from the EH removal unit 152 to restore voice data. For example, when an ARIB STD-B73 stream is received, the audio essence processing unit 156 can depack the audio essence according to the audio payload packing format defined by ARIB STD-B73 to restore the audio data. .

補助データエッセンス処理部１５８は、ＥＨ除去部１５２から順次入力される補助データエッセンスを処理して、補助データを復元する。例えば、補助データエッセンス処理部１５８は、ＡＲＩＢＳＴＤ－Ｂ７３ストリームが受信された場合には、ＡＲＩＢＳＴＤ－Ｂ７３により規定された補助データペイロードのパッキング形式に従って補助データエッセンスを逆パッキングして、補助データを復元し得る。 The auxiliary data essence processor 158 processes the auxiliary data essence sequentially input from the EH remover 152 to restore the auxiliary data. For example, when the ARIB STD-B73 stream is received, the auxiliary data essence processing unit 158 reverse-packs the auxiliary data essence according to the auxiliary data payload packing format defined by ARIB STD-B73, and converts the auxiliary data to can be restored.

アラインメント部１６０は、各ＲＴＰパケットのＲＴＰヘッダからトランスポート処理部１４２において取得されるＲＴＰタイムスタンプ、又は、各ＲＴＰパケットのＲＴＰペイロード内のヘッダから取得されるフレームカウント情報に基づいて、エッセンスデータ間の時間合わせを行う。 The alignment unit 160 aligns the essence data based on the RTP timestamp acquired by the transport processing unit 142 from the RTP header of each RTP packet or the frame count information acquired from the header in the RTP payload of each RTP packet. time adjustment.

本実施形態において、少なくとも第１のＲＴＰパケットは、エッセンス混在型ストリームのためのプロトコルに従って生成されるパケットである。エッセンス混在型ストリームのためのプロトコルは、例えば、ＡＲＩＢＳＴＤ－Ｂ７３であってよい。ＡＲＩＢＳＴＤ－Ｂ７３が利用される場合、エッセンス混在型ストリームのＲＴＰパケットの各々は、ブランキング期間に相当するデータを含まず、映像データ、音声データ及び補助データのうちのいずれかに相当するエッセンスデータをＲＴＰペイロードに含む。 In this embodiment, at least the first RTP packets are packets generated according to the protocol for mixed-essence streams. The protocol for mixed-essence streams may be, for example, ARIB STD-B73. When ARIB STD-B73 is used, each RTP packet of the essence-mixed stream does not contain data corresponding to the blanking period, and essence data corresponding to any one of video data, audio data and ancillary data. in the RTP payload.

図９Ａは、２つのＲＴＰパケットの間のエッセンスデータの時間合わせのための手法の第１の例について説明するための説明図である。図９Ａには、第１のＲＴＰパケット１６１ａ及び第２のＲＴＰパケット１６６ａが示されている。第１のＲＴＰパケット１６１ａ及び第２のＲＴＰパケット１６６ａは、共にＡＲＩＢＳＴＤ－Ｂ７３に従って生成されたものとする。第１のＲＴＰパケット１６１ａ及び第２のＲＴＰパケット１６６ａは、単一のポートＰ_Ｍを介して受信される。 FIG. 9A is an explanatory diagram for explaining a first example of a technique for time alignment of essence data between two RTP packets. FIG. 9A shows a first RTP packet 161a and a second RTP packet 166a. It is assumed that both the first RTP packet 161a and the second RTP packet 166a are generated according to ARIB STD-B73. A first RTP packet 161a and a second RTP packet 166a are received via a single port _PM .

第１のＲＴＰパケット１６１ａは、ＲＴＰヘッダ１６２ａ内に「タイムスタンプ」を含み、エッセンスヘッダ１６３ａ内に「ペイロードタイプ」及び「フレームカウント」を含む。エッセンスヘッダ１６３ａの「ペイロードタイプ」の値は対応するエッセンスペイロードが映像エッセンスを含むことを示し、「フレームカウント」は値Ｆ１を示す。値Ｆ１は、例えば、ＳＭＰＴＥＳＴ２０５９－１で定義されているエポックの時刻をゼロとした場合の、映像エッセンスが属する映像フレームのキャプチャ時刻に対応する。 The first RTP packet 161a contains a "timestamp" in the RTP header 162a and a "payload type" and "frame count" in the essence header 163a. The "payload type" value of the essence header 163a indicates that the corresponding essence payload contains video essence, and the "frame count" indicates the value F1. The value F1 corresponds to the capture time of the video frame to which the video essence belongs, for example, when the epoch time defined in SMPTE ST2059-1 is zero.

第２のＲＴＰパケット１６６ａは、ＲＴＰヘッダ１６７ａ内に「タイムスタンプ」を含み、エッセンスヘッダ１６８ａ内に「ペイロードタイプ」及び「フレームカウント」を含む。エッセンスヘッダ１６８ａの「ペイロードタイプ」の値は対応するエッセンスペイロードが音声エッセンスを含むことを示し、「フレームカウント」は値Ｆ１を示す。値Ｆ１は、例えば、ＳＭＰＴＥＳＴ２０５９－１で定義されているエポックの時刻をゼロとした場合の、音声エッセンスが属する映像フレームのキャプチャ時刻に対応する。 The second RTP packet 166a includes a "timestamp" in the RTP header 167a and a "payload type" and "frame count" in the essence header 168a. The "payload type" value of the essence header 168a indicates that the corresponding essence payload contains audio essence, and the "frame count" indicates the value F1. The value F1 corresponds to the capture time of the video frame to which the audio essence belongs, for example, when the epoch time defined by SMPTE ST2059-1 is zero.

図９Ａに示した第１の例において、アラインメント部１６０は、これらＲＴＰパケット１６１ａ、１６６ａのエッセンスヘッダ１６３ａ、１６８ａから取得されるフレームカウント情報に基づいて、ＲＴＰパケット１６１ａとＲＴＰパケット１６６ａとの間でエッセンスデータの時間合わせを行う。このようにエッセンスヘッダ内のフレームカウント情報のみに基づいて時間合わせが行われる場合、トランスポート処理部１４２からアラインメント部１６０へタイムスタンプ情報を出力することが不要となり、プロトコルレイヤをまたいだ情報の参照が抑制されることから、受信ストリーム処理部１４０の構成を簡略化することができる。 In the first example shown in FIG. 9A, the alignment unit 160 aligns the RTP packets 161a and 166a between the RTP packets 161a and 166a based on the frame count information obtained from the essence headers 163a and 168a of these RTP packets 161a and 166a. Adjust the time of the essence data. In this way, when time alignment is performed based only on the frame count information in the essence header, it becomes unnecessary to output time stamp information from the transport processing unit 142 to the alignment unit 160, and reference to information across protocol layers becomes unnecessary. is suppressed, the configuration of the reception stream processing unit 140 can be simplified.

図９Ｂは、２つのＲＴＰパケットの間のエッセンスデータの時間合わせのための手法の第２の例について説明するための説明図である。図９Ｂには、第３のＲＴＰパケット１６１ｂ及び第４のＲＴＰパケット１６６ｂが示されている。第３のＲＴＰパケット１６１ｂ及び第４のＲＴＰパケット１６６ｂは、共にＡＲＩＢＳＴＤ－Ｂ７３に従って生成されたものとする。第３のＲＴＰパケット１６１ｂ及び第４のＲＴＰパケット１６６ｂは、単一のポートＰ_Ｍを介して受信される。 FIG. 9B is an explanatory diagram for explaining a second example of a technique for time alignment of essence data between two RTP packets. FIG. 9B shows a third RTP packet 161b and a fourth RTP packet 166b. It is assumed that both the third RTP packet 161b and the fourth RTP packet 166b are generated according to ARIB STD-B73. A third RTP packet 161b and a fourth RTP packet 166b are received via a single port _PM .

第３のＲＴＰパケット１６１ｂは、ＲＴＰヘッダ１６２ｂ内に「タイムスタンプ」を含み、エッセンスヘッダ１６３ｂ内に「ペイロードタイプ」及び「フレームカウント」を含む。ＲＴＰヘッダ１６２ｂの「タイムスタンプ」は、値Ｔ２を示す。エッセンスヘッダ１６３ｂの「ペイロードタイプ」の値は、対応するエッセンスペイロードが映像エッセンスを含むことを示す。値Ｔ２は、例えば、映像エッセンスが属する映像フレームのキャプチャ時刻に対応する。 The third RTP packet 161b contains a "timestamp" in the RTP header 162b and a "payload type" and "frame count" in the essence header 163b. The "timestamp" of the RTP header 162b indicates the value T2. The "payload type" value of the essence header 163b indicates that the corresponding essence payload contains video essence. The value T2 corresponds, for example, to the capture time of the video frame to which the video essence belongs.

第４のＲＴＰパケット１６６ｂは、ＲＴＰヘッダ１６７ｂ内に「タイムスタンプ」を含み、エッセンスヘッダ１６８ｂ内に「ペイロードタイプ」及び「フレームカウント」を含む。ＲＴＰヘッダ１６７ｂの「タイムスタンプ」は、値Ｔ２を示す。エッセンスヘッダ１６８ｂの「ペイロードタイプ」の値は、対応するエッセンスペイロードが音声エッセンスを含むことを示す。値Ｔ２は、例えば、音声エッセンスが属する映像フレームのキャプチャ時刻に対応する。 The fourth RTP packet 166b includes a "timestamp" in the RTP header 167b and a "payload type" and "frame count" in the essence header 168b. The "timestamp" of the RTP header 167b indicates the value T2. The "payload type" value of essence header 168b indicates that the corresponding essence payload contains audio essence. The value T2 corresponds, for example, to the capture time of the video frame to which the audio essence belongs.

図９Ｂに示した第２の例において、アラインメント部１６０は、これらＲＴＰパケット１６１ｂ、１６６ｂのＲＴＰヘッダ１６２ｂ、１６７ｂから取得されるＲＴＰタイムスタンプに基づいて、ＲＴＰパケット１６１ｂとＲＴＰパケット１６６ｂとの間でエッセンスデータの時間合わせを行う。このようにＲＴＰタイムスタンプに基づいて時間合わせが行われる場合、同じくＲＴＰタイムスタンプを利用するＳＭＰＴＥＳＴ２１１０－１０のシステムタイミングモデルのための実装を再利用して、アラインメント部１６０を簡易に構成することができる。 In the second example shown in FIG. 9B, the alignment unit 160 aligns the RTP packets 161b and 166b between the RTP packets 161b and 166b based on the RTP timestamps obtained from the RTP headers 162b and 167b of these RTP packets 161b and 166b. Adjust the time of the essence data. When time alignment is performed based on RTP timestamps in this way, the alignment unit 160 can be configured simply by reusing the implementation for the system timing model of SMPTE ST2110-10, which also uses RTP timestamps. can be done.

図９Ｃは、２つのＲＴＰパケットの間のエッセンスデータの時間合わせのための手法の第３の例について説明するための説明図である。図９Ｃには、第５のＲＴＰパケット１６１ｃ及び第６のＲＴＰパケット１６６ｃが示されている。第５のＲＴＰパケット１６１ｃはＡＲＩＢＳＴＤ－Ｂ７３に従って生成され、第６のＲＴＰパケット１６６ｃはＳＭＰＴＥＳＴ２１１０－３０に従って生成されたものとする。例えば、第５のＲＴＰパケット１６１ｃはポートＰ_Ｍを介して受信され、第６のＲＴＰパケット１６６ｃは音声エッセンスを含むエッセンス分離型ストリームのためのポートＰ_Ａを介して受信される。 FIG. 9C is an explanatory diagram for explaining a third example of a technique for time alignment of essence data between two RTP packets. FIG. 9C shows a fifth RTP packet 161c and a sixth RTP packet 166c. Assume that the fifth RTP packet 161c is generated according to ARIB STD-B73 and the sixth RTP packet 166c is generated according to SMPTE ST2110-30. For example, the fifth RTP packet 161c is received via port _PM and the sixth RTP packet 166c is received via port _PA for essence separated streams containing audio essence.

第５のＲＴＰパケット１６１ｃは、ＲＴＰヘッダ１６２ｃ内に「タイムスタンプ」を含み、エッセンスヘッダ１６３ｃ内に「ペイロードタイプ」及び「フレームカウント」を含む。エッセンスヘッダ１６３ｃの「ペイロードタイプ」の値は対応するエッセンスペイロードが映像エッセンスを含むことを示し、「フレームカウント」は値Ｆ３を示す。値Ｆ３は、映像エッセンスが属する映像フレームのキャプチャ時刻に対応する。 The fifth RTP packet 161c contains a 'timestamp' in the RTP header 162c and a 'payload type' and a 'frame count' in the essence header 163c. The "payload type" value of the essence header 163c indicates that the corresponding essence payload contains video essence, and the "frame count" indicates the value F3. The value F3 corresponds to the capture time of the video frame to which the video essence belongs.

第６のＲＴＰパケット１６６ｃは、ＲＴＰヘッダ１６７ｃ内に「タイムスタンプ」を含む。ＲＴＰヘッダ１６７ｃの「タイムスタンプ」は、値Ｔ３を示す。値Ｔ３は、例えば、音声エッセンス内の音声データの最も早いサンプリング時刻に対応する。 The sixth RTP packet 166c includes a "timestamp" in the RTP header 167c. The "timestamp" of the RTP header 167c indicates the value T3. The value T3 corresponds, for example, to the earliest sampling time of the audio data within the audio essence.

図９Ｃに示した第３の例において、アラインメント部１６０は、第５のＲＴＰパケット１６１ｃのエッセンスヘッダ１６３ｃから取得されるフレームカウント情報、及び、第６のＲＴＰパケット１６６ｃのＲＴＰヘッダ１６７ｃから取得されるＲＴＰタイムスタンプに基づいて、ＲＴＰパケット１６１ｃとＲＴＰパケット１６６ｃとの間でエッセンスデータの時間合わせを行う。例えば、アラインメント部１６０は、値Ｔ３に対応するサンプリング時刻が値Ｆ３に対応する映像フレームのフレーム期間に含まれる場合には、第６のＲＴＰパケット１６６ｃから復元される音声データが第５のＲＴＰパケット１６１ｃから復元される映像データと同じ映像フレームへ同期されるべきであると判定し得る。このような手法によれば、ＡＲＩＢＳＴＤ－Ｂ７３ストリームのＲＴＰパケットのＲＴＰタイムスタンプに実効的なタイムスタンプ値が設定されない場合にも、ＡＲＩＢＳＴＤ－Ｂ７３ストリームとＳＭＰＴＥＳＴ２１１０ストリームとの間で適切に時間合わせを行うことができる。 In the third example shown in FIG. 9C, the alignment unit 160 uses the frame count information obtained from the essence header 163c of the fifth RTP packet 161c and the RTP header 167c of the sixth RTP packet 166c. Based on the RTP timestamp, time alignment of essence data is performed between the RTP packet 161c and the RTP packet 166c. For example, when the sampling time corresponding to the value T3 is included in the frame period of the video frame corresponding to the value F3, the alignment unit 160 transfers the audio data restored from the sixth RTP packet 166c to the fifth RTP packet. 161c should be synchronized to the same video frame as the video data recovered from 161c. According to such a method, even if an effective timestamp value is not set for the RTP timestamp of the RTP packet of the ARIB STD-B73 stream, the time can be properly set between the ARIB STD-B73 stream and the SMPTE ST2110 stream. Alignment can be done.

図９Ｄは、２つのＲＴＰパケットの間のエッセンスデータの時間合わせのための手法の第４の例について説明するための説明図である。図９Ｄには、第７のＲＴＰパケット１６１ｄ及び第８のＲＴＰパケット１６６ｄが示されている。第７のＲＴＰパケット１６１ｄはＡＲＩＢＳＴＤ－Ｂ７３に従って生成され、第８のＲＴＰパケット１６６ｄはＳＭＰＴＥＳＴ２１１０－２０に従って生成されたものとする。例えば、第７のＲＴＰパケット１６１ｄはポートＰ_Ｍを介して受信され、第８のＲＴＰパケット１６６ｄは映像エッセンスを含むエッセンス分離型ストリームのためのポートＰ_Ｖを介して受信される。 FIG. 9D is an explanatory diagram for explaining a fourth example of a technique for time alignment of essence data between two RTP packets. FIG. 9D shows a seventh RTP packet 161d and an eighth RTP packet 166d. It is assumed that the seventh RTP packet 161d is generated according to ARIB STD-B73 and the eighth RTP packet 166d is generated according to SMPTE ST2110-20. For example, the seventh RTP packet 161d is received via port _PM and the eighth RTP packet 166d is received via port _PV for essence separated streams containing video essence.

第７のＲＴＰパケット１６１ｄは、ＲＴＰヘッダ１６２ｄ内に「タイムスタンプ」を含み、エッセンスヘッダ１６３ｄ内に「ペイロードタイプ」及び「フレームカウント」を含む。ＲＴＰヘッダ１６２ｄの「タイムスタンプ」は、値Ｔ４を示す。エッセンスヘッダ１６３ｄの「ペイロードタイプ」の値は、対応するエッセンスペイロードが音声エッセンスを含むことを示す。値Ｔ４は、例えば、音声エッセンスが属する映像フレームのキャプチャ時刻に対応する。 The seventh RTP packet 161d contains a "timestamp" in the RTP header 162d and a "payload type" and "frame count" in the essence header 163d. The "timestamp" of the RTP header 162d indicates the value T4. The "payload type" value of essence header 163d indicates that the corresponding essence payload contains audio essence. The value T4 corresponds, for example, to the capture time of the video frame to which the audio essence belongs.

第８のＲＴＰパケット１６６ｄは、ＲＴＰヘッダ１６７ｄ内に「タイムスタンプ」を含む。ＲＴＰヘッダ１６７ｄの「タイムスタンプ」は、値Ｔ４´を示す。値Ｔ４´は、例えば、映像エッセンスが属する映像フレームのキャプチャ時刻（インターレース方式の第２フィールドの場合には、フレーム期間の半分だけオフセットされた時刻）に対応する。 The eighth RTP packet 166d includes a "timestamp" in the RTP header 167d. The "timestamp" of the RTP header 167d indicates the value T4'. The value T4' corresponds, for example, to the capture time of the video frame to which the video essence belongs (in the case of an interlaced second field, the time offset by half the frame period).

図９Ｄに示した第７の例において、アラインメント部１６０は、第７のＲＴＰパケット１６１ｄのＲＴＰヘッダ１６２ｄから取得されるＲＴＰタイムスタンプ、及び、第８のＲＴＰパケット１６６ｄのＲＴＰヘッダ１６７ｄから取得されるＲＴＰタイムスタンプに基づいて、ＲＴＰパケット１６１ｄとＲＴＰパケット１６６ｄとの間でエッセンスデータの時間合わせを行う。このような手法によれば、ＲＴＰタイムスタンプを利用するＳＭＰＴＥＳＴ２１１０－１０のシステムタイミングモデルのための実装をＡＲＩＢＳＴＤ－Ｂ７３ストリームの処理のために再利用して、アラインメント部１６０を簡易に構成することができる。 In the seventh example shown in FIG. 9D, the alignment unit 160 uses the RTP timestamp obtained from the RTP header 162d of the seventh RTP packet 161d and the RTP timestamp obtained from the RTP header 167d of the eighth RTP packet 166d. Based on the RTP timestamp, the essence data is time aligned between the RTP packet 161d and the RTP packet 166d. According to such an approach, the implementation for the system timing model of SMPTE ST2110-10, which utilizes RTP timestamps, is reused for the processing of ARIB STD-B73 streams to simplify the configuration of alignment section 160. be able to.

（３）ＦＥＣデータグラム処理部
上述したように、ＳＭＰＴＥＳＴ２１１０シリーズでは、誤り訂正処理は行われない。一方、ＡＲＩＢＳＴＤ－Ｂ７３は、ＲＳ（Reed-Solomon）ベースの手法又はＸＯＲベースの手法でのＦＥＣをサポートしている。ＦＥＣデータグラム処理部１７０は、この誤り訂正処理を担当する。即ち、ＦＥＣデータグラム処理部１７０は、トランスポート処理部１４２から入力されるＦＥＣデータグラムについて誤り訂正処理を実行する。 (3) FEC Datagram Processing Unit As described above, the SMPTEST ST2110 series does not perform error correction processing. On the other hand, ARIB STD-B73 supports FEC with RS (Reed-Solomon)-based technique or XOR-based technique. The FEC datagram processing unit 170 takes charge of this error correction processing. That is, the FEC datagram processing unit 170 performs error correction processing on the FEC datagrams input from the transport processing unit 142 .

例えば、ＦＥＣデータグラム処理部１７０は、誤り訂正方式としてＲＳベースの手法が選択される場合には、制御部１９０により設定されるデータグラム数（ｎ，ｋ）をＲＳ復号の処理単位として、ＲＳ復号を実行する。ここで、ｎは、ＦＥＣ演算の対象とされるエッセンスデータグラム数と、対応するＦＥＣデータグラム数との合計を表す。ｋは、当該エッセンスデータグラム数を表す。また、ＦＥＣデータグラム処理部１７０は、誤り訂正方式としてＸＯＲベースの手法が選択される場合には、制御部１９０により設定されるサイズ（Ｌ，Ｄ）のＦＥＣブロックを処理単位として、ＸＯＲベースの復号を実行する。ここで、Ｌは、行方向のエッセンスデータグラム数を表し、Ｄは列方向のエッセンスデータグラム数を表す。 For example, when the RS-based method is selected as the error correction method, the FEC datagram processing unit 170 uses the number of datagrams (n, k) set by the control unit 190 as the processing unit of RS decoding. Perform decryption. Here, n represents the sum of the number of essence datagrams to be subjected to FEC calculation and the number of corresponding FEC datagrams. k represents the number of essence datagrams concerned. In addition, when the XOR-based method is selected as the error correction method, the FEC datagram processing unit 170 performs XOR-based processing on an FEC block of size (L, D) set by the control unit 190 as a processing unit. Perform decryption. Here, L represents the number of essence datagrams in the row direction and D represents the number of essence datagrams in the column direction.

＜２－３．処理の流れ＞
次に、図１０及び図１１を用いて、第１の実施形態において実行され得る主な処理の流れについて説明する。 <2-3. Process Flow>
Next, the flow of main processing that can be executed in the first embodiment will be described with reference to FIGS. 10 and 11. FIG.

（１）ストリーム送信処理
図１０は、第１の実施形態に係るストリーム送信処理の流れの一例を示すフローチャートである。ここでは、ストリーム送信処理が放送信号処理ノード１００により実行されるものとして説明するが、ストリーム送信処理は、上述したセンダ６０の機能を有するいかなるノードにより実行されてもよい。 (1) Stream Transmission Processing FIG. 10 is a flowchart showing an example of the flow of stream transmission processing according to the first embodiment. Here, the stream transmission processing is described as being performed by the broadcast signal processing node 100, but the stream transmission processing may be performed by any node having the functions of the sender 60 described above.

まず、ＰＴＰ処理部１１６は、デバイス内部クロック１１０をＰＴＰの時刻源に直接的に又は間接的に同期させる（ステップＳ１０１）。ＰＴＰの時刻源とのデバイス内部クロック１１０の同期は、この後も継続的に維持される。 First, the PTP processing unit 116 directly or indirectly synchronizes the device internal clock 110 with the PTP time source (step S101). Synchronization of the device internal clock 110 with the PTP's time source continues to be maintained thereafter.

送信ストリーム処理部１３０は、例えば制御部１９０による制御の下で、データ処理部１８０から入力されるエッセンスデータからエッセンスペイロードを生成する（ステップＳ１０３）。エッセンスデータは、映像データ、音声データ又は補助データのいずれかを含む。エッセンスペイロードは、例えば、ＡＲＩＢＳＴＤ－Ｂ７３により規定された映像データ、音声データ又は補助データのパッキング形式に従ってエッセンスデータをパッキングすることにより生成され得る。 The transmission stream processing unit 130 generates an essence payload from the essence data input from the data processing unit 180 under the control of the control unit 190, for example (step S103). Essence data includes either video data, audio data or auxiliary data. The essence payload can be generated by packing essence data according to the packing format for video data, audio data or auxiliary data defined by ARIB STD-B73, for example.

次いで、送信ストリーム処理部１３０は、デバイス内部クロック１１０にロックされたメディアクロック１１２に対しオフセットを有しないＲＴＰクロック１１４に従って、フレームカウント値（及び／又はＲＴＰタイムスタンプ）を算出する（ステップＳ１０５）。フレームカウント値は、同じ映像フレームに属するエッセンスデータが処理されている間は、同じ値に維持される。 Next, the transmission stream processing unit 130 calculates a frame count value (and/or RTP timestamp) according to the RTP clock 114 that has no offset with respect to the media clock 112 locked to the device internal clock 110 (step S105). The frame count value remains the same while essence data belonging to the same video frame is being processed.

次いで、送信ストリーム処理部１３０は、フレームカウント値を含むエッセンスヘッダをエッセンスペイロードの先頭に追加して、エッセンスデータグラムを生成する（ステップＳ１０７）。エッセンスヘッダは、エッセンスペイロードに含まれるエッセンスのタイプを示す「ペイロードタイプ」をも含む。 Next, the transmission stream processing unit 130 adds an essence header including the frame count value to the beginning of the essence payload to generate an essence datagram (step S107). The essence header also contains a "payload type" that indicates the type of essence contained in the essence payload.

次いで、送信ストリーム処理部１３０は、ＲＴＰクロック１１４に従った値を有するＲＴＰタイムスタンプを含むトランスポートヘッダをエッセンスデータグラムに追加して、ＲＴＰパケットを生成する（ステップＳ１０９）。送信ストリーム処理部１３０は、さらに誤り訂正符号化処理を実行して、ＦＥＣデータグラムを含むＲＴＰパケットを生成してもよい。 Next, the transmission stream processing unit 130 adds a transport header including an RTP timestamp having a value according to the RTP clock 114 to the essence datagram to generate an RTP packet (step S109). The transmission stream processing unit 130 may further perform error correction coding processing to generate RTP packets containing FEC datagrams.

次いで、送信部１２２は、送信ストリーム処理部１３０により生成されたＲＴＰパケットにネットワーク（ＮＷ）ヘッダを追加して、ＩＰパケットを生成する（ステップＳ１１１）。ＩＰパケットのＵＤＰヘッダ内のポート番号は、包含されるエッセンスのタイプに関わらず、ＡＲＩＢＳＴＤ－Ｂ７３ストリームに割り当てられる共通的な値を示す。ＩＰヘッダ内の宛て先ＩＰアドレスは、例えば、当該ストリームに割り当てられるマルチキャストアドレスを示す。 Next, the transmission unit 122 adds a network (NW) header to the RTP packet generated by the transmission stream processing unit 130 to generate an IP packet (step S111). The port number in the UDP header of an IP packet indicates a common value assigned to ARIB STD-B73 streams, regardless of the type of essence contained. A destination IP address in the IP header indicates, for example, a multicast address assigned to the stream.

次いで、送信部１２２は、生成したＩＰパケットをネットワークへ送信する（ステップＳ１１３）。送信されたＩＰパケットは、対応するマルチキャストグループへ加入した受信側のノードにより受信される。 Next, the transmission unit 122 transmits the generated IP packet to the network (step S113). A transmitted IP packet is received by a receiving node that has subscribed to the corresponding multicast group.

上述したステップＳ１０３～Ｓ１１３の処理は、未処理のエッセンスデータが残っている間、反復的に実行され得る（ステップＳ１１５）。全てのエッセンスデータが処理されると、図１０に示したストリーム送信処理は終了する。 The processing of steps S103 to S113 described above can be repeatedly performed while unprocessed essence data remains (step S115). When all the essence data have been processed, the stream transmission process shown in FIG. 10 ends.

（２）ストリーム受信処理
図１１は、第１の実施形態に係るストリーム受信処理の流れの一例を示すフローチャートである。 (2) Stream Reception Processing FIG. 11 is a flowchart showing an example of the flow of stream reception processing according to the first embodiment.

まず、放送信号処理ノード１００のＰＴＰ処理部１１６は、デバイス内部クロック１１０をＰＴＰの時刻源に直接的に又は間接的に同期させる（ステップＳ１５１）。ＰＴＰの時刻源とのデバイス内部クロック１１０の同期は、この後も継続的に維持される。 First, the PTP processing unit 116 of the broadcast signal processing node 100 directly or indirectly synchronizes the device internal clock 110 with the PTP time source (step S151). Synchronization of the device internal clock 110 with the PTP's time source continues to be maintained thereafter.

次いで、受信部１２４は、図１０を用いて説明したストリーム送信処理と同様の処理を通じて他のノードから送信される放送信号ストリームのＩＰパケットを受信する（ステップＳ１５３）。放送信号処理ノード１００のデバイス内部クロック１１０がＰＴＰの時刻源と同期していることから、放送信号処理ノード１００は、ＩＰパケットの送信元の上記他のノードとも高い精度で同期している。 Next, the receiving unit 124 receives IP packets of broadcast signal streams transmitted from other nodes through the same process as the stream transmission process described using FIG. 10 (step S153). Since the device internal clock 110 of the broadcast signal processing node 100 is synchronized with the PTP time source, the broadcast signal processing node 100 is also highly accurately synchronized with the above-mentioned other node that is the transmission source of the IP packet.

次いで、受信部１２４は、受信したＩＰパケットのネットワークヘッダを除去することにより、ＲＴＰデータグラムを抽出する（ステップＳ１５３）。そして、受信部１２４は、抽出したＲＴＰデータグラムを受信ストリーム処理部１４０へ出力する。 Next, the receiving unit 124 extracts the RTP datagram by removing the network header of the received IP packet (step S153). The receiving unit 124 then outputs the extracted RTP datagram to the reception stream processing unit 140 .

次いで、受信ストリーム処理部１４０は、ＲＴＰデータグラムのトランスポートヘッダ及びエッセンスヘッダを除去することにより、エッセンスペイロードを抽出する（ステップＳ１５５）。 Next, the reception stream processing unit 140 extracts the essence payload by removing the transport header and essence header of the RTP datagram (step S155).

次いで、受信ストリーム処理部１４０は、エッセンスヘッダ内のペイロードタイプの値に応じて、ＡＲＩＢＳＴＤ－Ｂ７３により規定された映像データ、音声データ又は補助データのパッキング形式に従ってエッセンスペイロードを逆パッキングすることにより、エッセンスデータを復元する（ステップＳ１５７）。なお、図１１には示していないものの、受信ストリーム処理部１４０は、エッセンスデータグラムと共にＦＥＣデータグラムが受信される場合には、ＦＥＣデータグラムに基づいて誤り訂正復号を実行して、受信データに含まれるビット誤りを検出してもよい。 Next, according to the value of the payload type in the essence header, the received stream processing unit 140 reverse-packs the essence payload according to the packing format of video data, audio data or auxiliary data defined by ARIB STD-B73. The essence data is restored (step S157). Although not shown in FIG. 11, when an FEC datagram is received together with an essence datagram, the reception stream processing unit 140 performs error correction decoding based on the FEC datagram to convert the reception data to Bit errors may be detected.

次いで、受信ストリーム処理部１４０のアラインメント部１６０は、ＲＴＰヘッダ内のＲＴＰタイムスタンプ、又はＲＴＰペイロード内のフレームカウント情報に基づいて、異なるＲＴＰパケットに由来するエッセンスデータ間の時間合わせを行う（ステップＳ１５９）。 Next, the alignment unit 160 of the reception stream processing unit 140 performs time alignment between essence data derived from different RTP packets based on the RTP timestamp in the RTP header or the frame count information in the RTP payload (step S159). ).

次いで、データ処理部１８０は、アラインメント部１６０により時間合わせされたエッセンスデータに基づく処理を実行する（ステップＳ１６１）。ここで実行される処理は、例えば、複数のエッセンスの同期的な再生、又はエッセンスデータのＳＤＩ信号への変換などであってよい。 Next, the data processing unit 180 executes processing based on the essence data time-aligned by the alignment unit 160 (step S161). The processing performed here may be, for example, the synchronous reproduction of multiple essences, or the conversion of essence data into SDI signals.

上述したステップＳ１５３～Ｓ１６１の処理は、同一のストリームのパケットが継続的に受信されている間、反復的に実行され得る（ステップＳ１６３）。パケットの受信が停止すると、図１１に示したストリーム受信処理は終了する。 The processing of steps S153 to S161 described above can be repeatedly performed while packets of the same stream are continuously received (step S163). When packet reception stops, the stream reception processing shown in FIG. 11 ends.

＜＜３．第２の実施形態＞＞
次いで、図１２を用いて、第２の実施形態について説明する。上述した第１の実施形態は具体的な実施形態であり、一方で第２の実施形態はより一般化された実施形態である。 <<3. Second Embodiment>>
Next, a second embodiment will be described with reference to FIG. 12 . The first embodiment described above is a specific embodiment, while the second embodiment is a more general embodiment.

＜３－１．放送信号処理ノードの構成例＞
図１２は、第２の実施形態に係る放送信号処理ノード２００の構成の一例を示すブロック図である。放送信号処理ノード２００は、放送信号ストリームを１つ以上の他のノードから受信するノードである。放送信号処理ノード２００により受信されるストリームは、限定ではないものの、ＡＲＩＢＳＴＤ－Ｂ７３ストリームなどのエッセンス混在型ストリームを含む。図１２を参照すると、放送信号処理ノード２００は、通信部２１０及びアラインメント部２２０を備える。 <3-1. Configuration example of broadcast signal processing node>
FIG. 12 is a block diagram showing an example configuration of a broadcast signal processing node 200 according to the second embodiment. Broadcast signal processing node 200 is a node that receives broadcast signal streams from one or more other nodes. Streams received by broadcast signal processing node 200 include, but are not limited to, mixed-essence streams such as ARIB STD-B73 streams. Referring to FIG. 12, the broadcast signal processing node 200 includes a communication section 210 and an alignment section 220.

通信部２１０は、ＰＴＰの時刻源に直接的に又は間接的に同期したデバイス内部クロックにロックされたメディアクロックに対しオフセットを有しないＲＴＰクロックに従ってＲＴＰヘッダへタイムスタンプを付与されたＲＴＰパケットであって、映像データ、音声データ及び補助データのうちのいずれかに相当するエッセンスデータをＲＴＰペイロードに含む当該ＲＴＰパケット（第１のＲＴＰパケット）を受信する。通信部２１０は、さらに他のＲＴＰパケット（第２のＲＴＰパケット）をも受信する。第１のＲＴＰパケットと第２のＲＴＰパケットとは、同一の放送信号ストリームに含まれるパケットであってもよく、又は互いに異なる放送信号ストリームに含まれるパケットであってもよい。 The communication unit 210 is an RTP packet whose RTP header is time-stamped according to the RTP clock, which has no offset to the media clock locked to the device internal clock that is directly or indirectly synchronized to the PTP time source. Then, the RTP packet (first RTP packet) containing the essence data corresponding to any one of the video data, the audio data and the auxiliary data in the RTP payload is received. The communication unit 210 also receives another RTP packet (second RTP packet). The first RTP packet and the second RTP packet may be packets included in the same broadcast signal stream, or may be packets included in different broadcast signal streams.

アラインメント部２２０は、受信される上記ＲＴＰパケットの上記ＲＴＰヘッダから取得される上記タイムスタンプ、又は、受信される上記ＲＴＰパケットの上記ＲＴＰペイロード内のヘッダから取得されるフレームカウント情報に基づいて、上記ＲＴＰパケットと上記他のＲＴＰパケットとの間でエッセンスデータの時間合わせを行う。 The alignment unit 220 , based on the time stamp obtained from the RTP header of the received RTP packet or the frame count information obtained from the header in the RTP payload of the received RTP packet, Time alignment of essence data is performed between the RTP packet and the other RTP packet.

エッセンスデータの時間合わせのための放送信号処理が、放送信号処理ノード２００の上記動作ステップを含んでもよい。また、それら動作ステップをプロセッサに実行させるコンピュータプログラムが提供されてもよい。また、それら動作ステップをプロセッサに実行させるコンピュータプログラムを記憶した非一時的なコンピュータ読取可能な記憶媒体が提供されてもよい。加えて、第１の実施形態において説明した任意の機能又は処理が本実施形態に適用されてよい。 Broadcast signal processing for time alignment of essence data may include the above operational steps of the broadcast signal processing node 200 . A computer program may also be provided that causes a processor to perform those operational steps. A non-transitory computer-readable storage medium storing a computer program that causes a processor to perform those operational steps may also be provided. Additionally, any function or process described in the first embodiment may be applied to this embodiment.

＜３－２．変形例＞
図１３は、第２の実施形態の第１の変形例に係る放送信号処理ノード２００の構成の一例を示すブロック図である。図１３を参照すると、放送信号処理ノード２００は、図１２を用いて説明した通信部２１０及びアラインメント部２２０に加えて、再生部２３０を備える。 <3-2. Variation>
FIG. 13 is a block diagram showing an example configuration of a broadcast signal processing node 200 according to the first modification of the second embodiment. Referring to FIG. 13, broadcast signal processing node 200 includes reproducing section 230 in addition to communication section 210 and alignment section 220 described using FIG.

再生部２３０は、アラインメント部２２０により時間合わせされた上記ＲＴＰパケット（第１のＲＴＰパケット）及び上記他のＲＴＰパケット（第２のＲＴＰパケット）の上記エッセンスデータに基づいて、エッセンスを同期的に再生する。例えば、複数のカメラから受信される映像が同時に再生されてもよく、映像及び音声が同期的に再生されてもよく、又は、映像及び／若しくは音声と共に補助データが再生されてもよい。 The reproduction unit 230 synchronously reproduces the essence based on the essence data of the RTP packet (first RTP packet) and the other RTP packet (second RTP packet) time-aligned by the alignment unit 220. do. For example, video received from multiple cameras may be played simultaneously, video and audio may be played synchronously, or auxiliary data may be played along with video and/or audio.

図１４は、第２の実施形態の第２の変形例に係る放送信号処理ノード２００の構成の一例を示すブロック図である。図１４を参照すると、放送信号処理ノード２００は、第１通信部２１５、アラインメント部２２０、変換部２４０及び第２通信部２５０を備える。 FIG. 14 is a block diagram showing an example configuration of a broadcast signal processing node 200 according to the second modification of the second embodiment. Referring to FIG. 14, broadcast signal processing node 200 includes first communication unit 215 , alignment unit 220 , conversion unit 240 and second communication unit 250 .

第１通信部２１５は、ＰＴＰの時刻源に直接的に又は間接的に同期したデバイス内部クロックにロックされたメディアクロックに対しオフセットを有しないＲＴＰクロックに従ってＲＴＰヘッダへタイムスタンプを付与されたＲＴＰパケットであって、映像データ、音声データ及び補助データのうちのいずれかに相当するエッセンスデータをＲＴＰペイロードに含む当該ＲＴＰパケット（第１のＲＴＰパケット）を受信する。第１通信部２１５は、さらに他のＲＴＰパケット（第２のＲＴＰパケット）をも受信する。第１のＲＴＰパケットと第２のＲＴＰパケットとは、同一の放送信号ストリームに含まれるパケットであってもよく、又は互いに異なる放送信号ストリームに含まれるパケットであってもよい。 The first communication unit 215 generates RTP packets whose RTP headers are time-stamped according to the RTP clock that has no offset with respect to the media clock locked to the device internal clock that is directly or indirectly synchronized with the PTP time source. and receives an RTP packet (first RTP packet) containing essence data corresponding to any one of video data, audio data, and auxiliary data in the RTP payload. The first communication unit 215 also receives another RTP packet (second RTP packet). The first RTP packet and the second RTP packet may be packets included in the same broadcast signal stream, or may be packets included in different broadcast signal streams.

変換部２４０は、アラインメント部２２０による上記ＲＴＰパケット（第１のＲＴＰパケット）と上記他のＲＴＰパケット（第２のＲＴＰパケット）との間のエッセンスデータの時間合わせに基づいて、それらＲＴＰパケットに由来するエッセンスデータをＳＤＩ信号へ変換する。変換部２４０により生成されるＳＤＩ信号の信号形式は、例えばＳＤ－ＳＤＩ、ＨＤ－ＳＤＩ、３Ｇ－ＳＤＩ、６Ｇ－ＳＤＩ又は１２Ｇ－ＳＤＩといった、ＳＤＩの任意の派生であってよい。 Based on the time alignment of the essence data between the RTP packet (first RTP packet) and the other RTP packet (second RTP packet) by the alignment unit 220, the conversion unit 240 extracts data derived from these RTP packets. The essence data to be converted into an SDI signal. The signal format of the SDI signal produced by converter 240 may be any derivative of SDI, eg SD-SDI, HD-SDI, 3G-SDI, 6G-SDI or 12G-SDI.

第２通信部２５０は、上述した時間合わせに基づいて変換部２４０により生成されるＳＤＩ信号を、ＳＤＩドメインに属する他のノードへ送信する。 The second communication unit 250 transmits the SDI signal generated by the conversion unit 240 based on the time adjustment described above to other nodes belonging to the SDI domain.

＜＜４．第１の実施形態及び第２の実施形態のまとめ＞＞
ここまで、本開示の第１の実施形態及び第２の実施形態について詳細に説明した。上述した実施形態では、映像データ、音声データ及び補助データのうちのいずれかに相当するエッセンスデータをＲＴＰペイロードに含むＲＴＰパケットと、他のＲＴＰパケットとの間のエッセンスデータの時間合わせが、ＲＴＰヘッダ内のＲＴＰタイムスタンプ、又は、ＲＴＰペイロード内のヘッダから取得されるフレームカウント情報に基づいて行われる。ＲＴＰタイムスタンプは、ＰＴＰの時刻源に直接的に又は間接的に同期したデバイス内部クロックにロックされたメディアクロックに対しオフセットを有しないＲＴＰクロックに従って付与される。かかる構成によれば、放送局のＩＰネットワーク上で映像データ、音声データ又は補助データといったエッセンスデータを単一のストリームで伝送するプロトコルを利用する際に、エッセンスデータの適切な時間合わせを行うことが可能である。 <<4. Summary of First Embodiment and Second Embodiment>>
So far, the first and second embodiments of the present disclosure have been described in detail. In the above-described embodiment, time alignment of essence data between an RTP packet containing essence data corresponding to any one of video data, audio data, and auxiliary data in the RTP payload and other RTP packets is performed in the RTP header. or based on the frame count information obtained from the header in the RTP payload. RTP timestamps are applied according to the RTP clock, which has no offset to the media clock locked to the device internal clock that is directly or indirectly synchronized to the PTP time source. According to this configuration, when using a protocol for transmitting essence data such as video data, audio data, or auxiliary data in a single stream on the IP network of a broadcasting station, it is possible to perform appropriate time adjustment of the essence data. It is possible.

ある例において、上記ＲＴＰパケットは、エッセンス混在型ストリームのためのプロトコルに従って生成されるパケットである。上記エッセンス混在型ストリームは、例えば、ＡＲＩＢＳＴＤ－Ｂ７３ストリームであってよい。ＡＲＩＢＳＴＤ－Ｂ７３ストリームは、ブランキング期間に相当するデータを含まない。こうした例によれば、受信側でもとのコンテンツを再構築する処理を複雑にすることなく、かつエッセンスタイプごとにネットワーク上で異なる遅延を受ける可能性を回避しつつ、エッセンスデータを容易に時間合わせすることができる。 In one example, the RTP packets are packets generated according to a protocol for mixed-essence streams. The mixed-essence stream may be, for example, an ARIB STD-B73 stream. The ARIB STD-B73 stream does not contain data corresponding to blanking periods. These examples make it easy to time-align essence data without complicating the process of reconstructing the original content at the receiving end, and avoiding the potential for different delays over the network for each essence type. can do.

ある例において、上記メディアクロックのクロック周波数は、２７．０ＭＨｚに等しい。かかる構成によれば、メディアクロックのクロック周波数を映像フレーム周波数の整数倍にしてフレーム期間を正確に一定にしつつ、映像データ、音声データ及び補助データを含み得る単一のエッセンス混在型ストリームに、そのメディアクロックのクロック周波数に基づいて共通的なやり方で時間合わせのための情報を付与することができる。 In one example, the clock frequency of the media clock is equal to 27.0 MHz. According to such a configuration, the clock frequency of the media clock is set to an integer multiple of the video frame frequency to accurately keep the frame period constant, and a single mixed-essence stream that can include video data, audio data, and ancillary data is generated. Information for time alignment can be provided in a common manner based on the clock frequency of the media clock.

＜＜５．第３の実施形態＞＞
上で既に述べたように、ＳＭＰＴＥＳＴ２１１０ストリームのために通常利用されるＳＤＰオブジェクトのフォーマットは、フォーマット構造においても、記述される情報の内容においても、ＡＲＩＢＳＴＤ－Ｂ７３ストリームの特性に必ずしも適していない。ＡＲＩＢＳＴＤ－Ｂ７３ストリームの伝送を適切にセットアップするためには、当該ストリームの特性に適したＳＤＰオブジェクトのフォーマットを定義する必要がある。 <<5. Third Embodiment>>
As already mentioned above, the format of SDP objects normally used for SMPTE ST2110 streams is not necessarily suitable for the characteristics of ARIB STD-B73 streams, neither in format structure nor in the information content described. . In order to properly set up the transmission of an ARIB STD-B73 stream, it is necessary to define the format of the SDP objects suitable for the characteristics of the stream.

＜５－１．既存のＳＤＰオブジェクトの例＞
ＳＤＰは、ストリーミング（又はその他の何らかのセッション）に使用されるパラメータセットをテキスト形式で記述する際のフォーマットを規定するプロトコルである、典型的には、ストリーミングの開始に先立って、ストリームの送信側と受信側との間でＳＤＰオブジェクト（例えば、ＳＤＰ形式で記述されたデータファイル）を送受信することにより、どういったパラメータ値を実際に使用すべきかに関する交渉が行われ、合意が結ばれる。あるいは、ストリームのセットアップに制御ノードが一元的に関与する場合には、ＳＤＰオブジェクトは、当該制御ノードへ提供され得る。 <5-1. Example of an existing SDP object>
SDP is a protocol that defines a format for textually describing parameter sets used in streaming (or any other session). By transmitting and receiving an SDP object (for example, a data file described in SDP format) to and from the receiving side, negotiations are made regarding what parameter values should actually be used, and an agreement is reached. Alternatively, if the control node is centrally involved in setting up the stream, the SDP object may be provided to the control node.

図１５は、エッセンス分離型ストリーム用のＳＤＰオブジェクトの典型的なフォーマット構造について説明するための説明図である。図１５に示したように、ＳＤＰオブジェクトは、セッションレベルの（セッションに属する個々のメディアよりもむしろセッションにとって固有の）属性を記述するためのセッションレベルセクションと、１つ以上のメディアのそれぞれの属性を記述するための１つ以上のメディアレベルセクションとを含む。図１５のＳＤＰオブジェクトは、映像エッセンス用、音声エッセンス用及び補助データエッセンス用のそれぞれのメディアレベルセクションを含んでいる。オプションとして、可用性向上のために映像エッセンスに冗長ＲＴＰストリーム方式が適用される場合には、プライマリの映像エッセンス用のメディアレベルセクションに加えて、セカンダリの映像エッセンス用のメディアレベルセクションがＳＤＰオブジェクトに含められ得る。なお、かかる例に限定されず、冗長ＲＴＰストリーム方式は、映像エッセンス、音声エッセンス及び補助データエッセンスのうちの任意の１つ以上に適用されてよい。各メディアレベルセクションは、対応するエッセンスタイプのストリームの属性を記述する属性フィールド群を含む。即ち、映像エッセンス用のメディアレベルセクションは、映像関連属性フィールド群を、音声エッセンス用のメディアレベルセクションは、音声関連属性フィールド群を、補助データエッセンス用のメディアレベルセクションは、補助データ関連属性フィールド群を含み得る。 FIG. 15 is an explanatory diagram for explaining a typical format structure of an SDP object for an essence-separated stream. As shown in Figure 15, an SDP object consists of a session-level section for describing session-level attributes (specific to a session rather than individual media belonging to the session) and attributes for each of one or more media. and one or more media-level sections for describing The SDP object of FIG. 15 contains media level sections for video essence, audio essence and ancillary data essence respectively. Optionally, in addition to the media level section for the primary video essence, the media level section for the secondary video essence is included in the SDP object if a redundant RTP stream scheme is applied to the video essence for increased availability. can be Note that the redundant RTP stream scheme may be applied to any one or more of video essence, audio essence, and ancillary data essence without being limited to such an example. Each media level section contains attribute fields that describe the attributes of the stream of the corresponding essence type. That is, the media level section for video essence contains video-related attribute fields, the media-level section for audio essence contains audio-related attribute fields, and the media-level section for auxiliary data essence contains auxiliary data-related attribute fields. can include

図１６は、図１５を用いて説明したフォーマット構造を有する、ＳＭＰＴＥＳＴ２１１０ストリームの属性を記述したＳＤＰオブジェクトの一例を示している。 FIG. 16 shows an example of an SDP object describing attributes of an SMPTE ST2110 stream having the format structure described with reference to FIG.

図１６を参照すると、ＳＤＰオブジェクト３０１は、セッションレベルセクション３０２と、４つのメディアレベルセクション３０３、３０４、３０５及び３０６とを含む。セッションレベルセクション３０２は、プロトコルバージョンフィールド（“v=…”）、送信元フィールド（“o=…”）及びセッション名フィールド（“s=…”）など、セッションに固有の情報を記述するためのフィールド群を有する。 Referring to FIG. 16, SDP object 301 includes session level section 302 and four media level sections 303 , 304 , 305 and 306 . The session level section 302 is for describing session-specific information such as a protocol version field ("v=..."), a source field ("o=..."), and a session name field ("s=..."). It has a group of fields.

個々のメディアレベルセクションの開始は、メディア記述フィールド（“m=…”）により識別される。メディアレベルセクション３０３及び３０４は、映像エッセンスのストリームのためのセクションである。図１６の例では、メディアレベルセクション３０３にプライマリの映像エッセンスのストリームの属性が記述されており、メディアレベルセクション３０４にセカンダリの映像エッセンスのストリームの属性が記述されている。メディアレベルセクション３０５には、音声エッセンスのストリームの属性が記述されている。メディアレベルセクション３０６には、補助データエッセンスのストリームの属性が記述されている。 The start of each media level section is identified by a media description field ("m=..."). Media level sections 303 and 304 are for streams of video essence. In the example of FIG. 16, the media level section 303 describes the attributes of the primary video essence stream, and the media level section 304 describes the attributes of the secondary video essence stream. The media level section 305 describes the attributes of the audio essence stream. The media level section 306 describes attributes of the stream of auxiliary data essence.

映像用のメディアレベルセクション３０３の冒頭のメディア記述フィールド（“m=…”）には、メディアタイプ（“video”）、送信ポート番号（“50000”）、トランスポートプロトコル（“RTP/AVP”）、及びフォーマット形式番号（“112”）が記述されている。このメディア記述フィールドに加えて、メディアレベルセクション３０３は、映像関連属性フィールド群３０７を含む。映像関連属性フィールド群３０７は、ソースフィルタ属性フィールド（“a=source-filter:…”）、ＲＴＰマップ属性フィールド（“a=rtpmap:…”）、フォーマット固有パラメータ属性フィールド（“a=fmtp:…”）、基準クロック属性フィールド（“a=ts-refclk:…”）、メディアクロック属性フィールド（“a=mediaclk:…”）、及びマップＩＤ属性フィールド（“a=mid:…”）を含む。これらのうち、ＲＴＰマップ属性フィールド（“a=rtpmap:…”）は、対応するメディア記述フィールドのフォーマット形式番号と同じ値を有するペイロードタイプ番号（“112”）、サブタイプ名（“raw”）及びメディアクロック周波数（“90000”）を示す。フォーマット固有パラメータ属性フィールド（“a=fmtp:…”）には、フォーマット形式番号（“112”）に続いて、フォーマット固有の１つ以上のパラメータのパラメータ名とパラメータ値のペアが列挙される。マップＩＤ属性フィールド（“a=mid:…”）は、冗長ＲＴＰストリームが使用される場合にプライマリストリームとセカンダリストリームとを区別するために使用される。メディアレベルセクション３０４の内容は、メディア記述フィールドの送信ポート番号及びマップＩＤ属性フィールドを除いてメディアレベルセクション３０３と同様であってよいため、図１６では省略されている。 The media description field ("m=...") at the beginning of the media level section 303 for video contains the media type ("video"), transmission port number ("50000"), transport protocol ("RTP/AVP"). , and the format type number (“112”). In addition to this media description field, media level section 303 includes video related attribute fields 307 . The video-related attribute field group 307 includes a source filter attribute field ("a=source-filter:..."), an RTP map attribute field ("a=rtpmap:..."), and a format-specific parameter attribute field ("a=fmtp:..."). ”), a reference clock attribute field (“a=ts-refclk: …”), a media clock attribute field (“a=mediaclk: …”), and a map ID attribute field (“a=mid: …”). Of these, the RTP map attribute field ("a=rtpmap:...") has the same value as the format type number of the corresponding media description field, payload type number ("112"), subtype name ("raw"). and the media clock frequency (“90000”). The format-specific parameter attribute field ("a=fmtp:...") lists the format type number ("112") followed by parameter name and parameter value pairs for one or more format-specific parameters. The Map ID attribute field ("a=mid:...") is used to distinguish between primary and secondary streams when redundant RTP streams are used. The contents of media level section 304 are omitted in FIG. 16 because they may be similar to media level section 303 except for the transmission port number and map ID attribute fields in the media description field.

音声用のメディアレベルセクション３０５の冒頭のメディア記述フィールド（“m=…”）には、メディアタイプ（“audio”）、送信ポート番号（“51200”）、トランスポートプロトコル（“RTP/AVP”）、及びフォーマット形式番号（“97”）が記述されている。このメディア記述フィールドに加えて、メディアレベルセクション３０５は、音声関連属性フィールド群３０８を含む。音声関連属性フィールド群３０８は、ＲＴＰマップ属性フィールド（“a=rtpmap:…”）、パケット時間属性フィールド（“a=ptime:…”）、基準クロック属性フィールド（“a=ts-refclk:…”）、メディアクロック属性フィールド（“a=mediaclk:…”）、フォーマット固有パラメータ属性フィールド（“a=fmtp:…”）、及びマップＩＤ属性フィールド（“a=mid:…”）を含む。これらのうち、ＲＴＰマップ属性フィールド（“a=rtpmap:…”）は、対応するメディア記述フィールドのフォーマット形式番号と同じ値を有するペイロードタイプ番号（“97”）、サブタイプ名（“L24”）、メディアクロック周波数（“48000”）及び符号化パラメータ（“6”）を示す。なお、サブタイプ名“L24”は、音声エッセンスが２４ビットのリニアエンコーディングで符号化されることを示す。符号化パラメータ“6”は、音声チャンネル数が６であることを示す。フォーマット固有パラメータ属性フィールド（“a=fmtp:…”）には、フォーマット形式番号（“97”）に続いて、フォーマット固有の１つ以上のパラメータのパラメータ名とパラメータ値のペアが列挙される。 The media description field ("m=...") at the beginning of the media level section 305 for audio contains the media type ("audio"), transmission port number ("51200"), transport protocol ("RTP/AVP"). , and the format type number (“97”). In addition to this media description field, media level section 305 includes audio-related attribute fields 308 . Audio-related attribute fields 308 include an RTP map attribute field (“a=rtpmap: …”), a packet time attribute field (“a=ptime: …”), a reference clock attribute field (“a=ts-refclk: …”). ), a media clock attribute field ("a=mediaclk:..."), a format specific parameter attribute field ("a=fmtp:..."), and a map ID attribute field ("a=mid:..."). Of these, the RTP map attribute field ("a=rtpmap:...") has a payload type number ("97"), subtype name ("L24") that has the same value as the format type number of the corresponding media description field. , indicates the media clock frequency (“48000”) and the coding parameter (“6”). Note that the subtype name "L24" indicates that the voice essence is encoded by 24-bit linear encoding. The encoding parameter "6" indicates that the number of audio channels is six. The format-specific parameter attribute field ("a=fmtp:...") lists the format type number ("97") followed by parameter name and parameter value pairs for one or more format-specific parameters.

補助データ用のメディアレベルセクション３０６の冒頭のメディア記述フィールド（“m=…”）には、メディアタイプ（“video”）、送信ポート番号（“51300”）、トランスポートプロトコル（“RTP/AVP”）、及びフォーマット形式番号（“98”）が記述されている。このメディア記述フィールドに加えて、メディアレベルセクション３０６は、補助データ関連属性フィールド群３０９を含む。補助データ関連属性フィールド群３０９は、ＲＴＰマップ属性フィールド（“a=rtpmap:…”）、基準クロック属性フィールド（“a=ts-refclk:…”）、メディアクロック属性フィールド（“a=mediaclk:…”）、及びマップＩＤ属性フィールド（“a=mid:…”）を含む。これらのうち、ＲＴＰマップ属性フィールド（“a=rtpmap:…”）は、対応するメディア記述フィールドのフォーマット形式番号と同じ値を有するペイロードタイプ番号（“98”）、サブタイプ名（“smpte291”）及びメディアクロック周波数（“90000”）を示す。 The media description field ("m=...") at the beginning of the media level section 306 for ancillary data includes media type ("video"), transmission port number ("51300"), transport protocol ("RTP/AVP"), ), and the format type number (“98”). In addition to this media description field, media level section 306 includes auxiliary data related attribute fields 309 . The auxiliary data-related attribute field group 309 includes an RTP map attribute field ("a=rtpmap:..."), a reference clock attribute field ("a=ts-refclk:..."), and a media clock attribute field ("a=mediaclk:..."). ”), and a map ID attribute field (“a=mid: …”). Of these, the RTP map attribute field ("a=rtpmap:...") has a payload type number ("98"), subtype name ("smpte291") that has the same value as the format type number of the corresponding media description field. and the media clock frequency (“90000”).

図１６から理解されるように、ＳＭＰＴＥＳＴ２１１０ストリーム向けのＳＤＰオブジェクトは、複数のエッセンスタイプのそれぞれのメディアレベルセクションを含み、それらメディアレベルセクションが、対応するエッセンスタイプに依存して異なるフィールドのセットを有する。こうしたフォーマット構造は、ＳＭＰＴＥＳＴ２１１０ストリームと同様のエッセンス分離型ストリームには再利用可能であるが、ＡＲＩＢＳＴＤ－Ｂ７３ストリームのようなエッセンス混在型ストリームの属性を記述するためには適しない。なぜなら、エッセンス混在型ストリームは、単一のストリーム内に複数のエッセンスタイプのデータを含むためである。さらに、映像エッセンスの圧縮をサポートし、かつ誤り訂正符号化／復号も可能なＡＲＩＢＳＴＤ－Ｂ７３ストリームをセットアップするために要する情報項目が、図１６に例示したＳＤＰオブジェクトのフォーマットには不足している。 As can be seen from Figure 16, the SDP object for an SMPTE ST2110 stream contains media level sections for each of multiple essence types, which have different sets of fields depending on the corresponding essence type. have. Such format structures are reusable for essence-separated streams like SMPTE ST2110 streams, but are not suitable for describing attributes of mixed-essence streams like ARIB STD-B73 streams. This is because the mixed-essence stream contains multiple essence-type data in a single stream. Furthermore, the SDP object format illustrated in FIG. 16 lacks the information items required to set up an ARIB STD-B73 stream that supports compression of video essence and is also capable of error correction encoding/decoding. .

＜５－２．ＳＤＰオブジェクトの新たなフォーマット＞
図１７は、本実施形態においてエッセンス混在型ストリームのために定義されるＳＤＰオブジェクトのフォーマット構造について説明するための説明図である。 <5-2. New Format of SDP Object>
FIG. 17 is an explanatory diagram for explaining the format structure of the SDP object defined for the mixed-essence stream in this embodiment.

図１７に示したように、新たなフォーマットにおいて、ＳＤＰオブジェクトは、セッションレベルの属性を記述するセッションレベルセクションと、複数のエッセンスタイプにとって共通のメディアレベルセクションとを含む。そして、１つの共通的なメディアレベルセクションが、映像エッセンスに関連する属性、音声エッセンスに関連する属性、及び補助データエッセンスに関連する属性を記述するための属性フィールド群を含む。とりわけ、本実施形態では、ＳＤＰの規格において独立した属性フィールドを割り当てられていないパラメータは、フォーマット固有パラメータ属性フィールド内に記述される。このようにして、エッセンス混在型ストリーム（あるいはＡＲＩＢＳＴＤ－Ｂ７３ストリーム）の特性にＳＤＰオブジェクトの構造を適合させることで、ストリームに固有の情報を曖昧性無くＳＤＰオブジェクトに記述することができる。 As shown in Figure 17, in the new format, the SDP object contains a session-level section that describes session-level attributes and a media-level section that is common to multiple essence types. And one common media level section contains attribute fields for describing attributes related to video essence, attributes related to audio essence, and attributes related to auxiliary data essence. Notably, in this embodiment, parameters that are not assigned separate attribute fields in the SDP standard are described in format-specific parameter attribute fields. In this way, by adapting the structure of the SDP object to the characteristics of the mixed-essence stream (or ARIB STD-B73 stream), stream-specific information can be described in the SDP object without ambiguity.

オプションとして、可用性向上のために冗長ＲＴＰストリーム方式が適用される場合には、ＳＤＰオブジェクトは、プライマリのエッセンス共通のメディアレベルセクションに加えて、セカンダリのエッセンス共通のメディアレベルセクションを含み得る。 Optionally, in addition to the primary essence-common media-level section, the SDP object may contain a secondary essence-common media-level section if a redundant RTP stream scheme is applied for increased availability.

図１８は、図１７を用いて説明したフォーマット構造を有する、ＡＲＩＢＳＴＤ－Ｂ７３ストリームの属性を記述したＳＤＰオブジェクトの一例を示している。 FIG. 18 shows an example of an SDP object describing the attributes of the ARIB STD-B73 stream having the format structure described with reference to FIG.

図１８を参照すると、ＳＤＰオブジェクト３１１は、セッションレベルセクション３１２と、２つのメディアレベルセクション３１３及び３１８とを含む。セッションレベルセクション３１２は、プロトコルバージョンフィールド（“v=…”）、送信元フィールド（“o=…”）及びセッション名フィールド（“s=…”）など、セッションに固有の情報を記述するためのフィールド群を有する。 Referring to FIG. 18, SDP object 311 contains session level section 312 and two media level sections 313 and 318 . The session level section 312 is for describing session specific information such as a protocol version field ("v=..."), a source field ("o=...") and a session name field ("s=..."). It has a group of fields.

メディアレベルセクション３１３及び３１８の開始は、それぞれのメディア記述フィールド（“m=…”）により識別される。メディアレベルセクション３１３及び３１８は共に、複数のエッセンスタイプにとって共通のセクションである。図１８の例では、メディアレベルセクション３１３にプライマリのエッセンス混在型ストリームの属性が記述されており、メディアレベルセクション３１８にセカンダリのエッセンス混在型ストリームの属性が記述されている。 The beginnings of media level sections 313 and 318 are identified by their respective media description fields ("m=..."). Both media level sections 313 and 318 are sections common to multiple essence types. In the example of FIG. 18, the media level section 313 describes the attributes of the primary mixed-essence stream, and the media-level section 318 describes the attributes of the secondary mixed-essence stream.

メディアレベルセクション３１３の冒頭のメディア記述フィールド（“m=…”）には、メディアタイプ（“video”）、送信ポート番号（“50000”）、トランスポートプロトコル（“RTP/AVP”）、及びフォーマット形式番号（“110”）が記述される。メディアレベルセクション３１３は、上述したように複数のエッセンスタイプにとって共通のセクションであるものの、本開示に係る技術において、各エッセンスのタイミングは映像フレームに紐付けられることから、ここではメディアタイプの文字列として“video”が選択され得る。送信ポート番号は、例えば、プライベートポート番号の範囲から任意に選択されるＵＤＰポート番号であってよい。フォーマット形式番号は、他の値（例えば、“111”）であってもよい。本実施形態において、フォーマット形式番号は、複数のエッセンスタイプに共通的に付与されるフォーマット識別値としての役割を有する。このメディア記述フィールドに加えて、メディアレベルセクション３１３は、属性フィールド群３１４を含む。とりわけ、図１８の例において、属性フィールド群３１４は、ＲＴＰマップ属性フィールド（“a=rtpmap:…”）３１５、フォーマット固有パラメータ属性フィールド（“a=fmtp:…”）３１６及びパケット時間属性フィールド（“a=ptime:…”）３１７を含む。 The media description field ("m=...") at the beginning of the media level section 313 contains the media type ("video"), transmission port number ("50000"), transport protocol ("RTP/AVP"), and format. A format number (“110”) is described. The media level section 313 is a section common to multiple essence types as described above, but in the technology according to the present disclosure, the timing of each essence is associated with a video frame. "video" can be selected as the The sending port number may be, for example, a UDP port number arbitrarily selected from a range of private port numbers. The format type number may be another value (eg, "111"). In this embodiment, the format type number has a role as a format identification value commonly given to a plurality of essence types. In addition to this media description field, media level section 313 includes attribute fields 314 . Specifically, in the example of FIG. 18, attribute fields 314 include an RTP map attribute field ("a=rtpmap:...") 315, a format specific parameter attribute field ("a=fmtp:...") 316, and a packet time attribute field ( "a=ptime:...") 317.

ＲＴＰマップ属性フィールド３１５は、フォーマット形式番号と同じ値を有するペイロードタイプ番号（“110”）によって、メディアレベルセクション３１３の冒頭のメディア記述フィールドに関連付けられる。ＲＴＰマップ属性フィールド３１５のサブタイプ名には、例えばプロトコル名称“ARIB_STD-B73”が記述され、この名称から、ＳＤＰオブジェクト３１１を提供するセンダにより送信される放送信号ストリームがＡＲＩＢＳＴＤ－Ｂ７３ストリームであることが特定される。ＲＴＰマップ属性フィールド３１５により示されるメディアクロック周波数は、ここでは２７ＭＨｚである。 The RTP map attribute field 315 is related to the media description field at the beginning of the media level section 313 by the payload type number (“110”) having the same value as the format type number. The subtype name of the RTP map attribute field 315 describes, for example, the protocol name “ARIB_STD-B73”, which indicates that the broadcast signal stream transmitted by the sender providing the SDP object 311 is an ARIB STD-B73 stream. is specified. The media clock frequency indicated by the RTP map attribute field 315 is now 27 MHz.

フォーマット固有パラメータ属性フィールド３１６は、フォーマット形式番号（“110”）によって、メディアレベルセクション３１３の冒頭のメディア記述フィールドに関連付けられる。フォーマット固有パラメータ属性フィールド３１６は、少なくとも、第１のエッセンスタイプのエッセンスデータに関連する第１の属性情報及び第２のエッセンスタイプのエッセンスデータに関連する第２の属性情報を含む。図１８の例においては、フォーマット固有パラメータ属性フィールド３１６は、以下のように分類される情報を含む：
・映像関連属性－映像エッセンスのエッセンスデータに関連する属性情報
・音声関連属性－音声エッセンスのエッセンスデータに関連する属性情報
・補助データ関連属性－補助データエッセンスのエッセンスデータに関連する属性情報
・ＦＥＣ関連属性－誤り訂正方式に関連する属性情報
・トラフィックシェーピング関連属性－トラフィックシェーピングに関連する属性情報 The format specific parameter attribute field 316 is related to the media description field at the beginning of the media level section 313 by the format type number (“110”). Format specific parameter attribute field 316 includes at least first attribute information associated with essence data of a first essence type and second attribute information associated with essence data of a second essence type. In the example of FIG. 18, format-specific parameter attribute field 316 contains information categorized as follows:
- Video-related attribute - attribute information related to essence data of video essence - Audio-related attribute - attribute information related to essence data of audio essence - Auxiliary data-related attribute - attribute information related to essence data of auxiliary data essence - FEC-related Attribute - Attribute information related to error correction scheme Traffic shaping related attribute - Attribute information related to traffic shaping

次の表１～表５は、本実施形態においてフォーマット固有パラメータ属性フィールド３１６に記述され得る映像関連属性、音声関連属性、補助データ関連属性、ＦＥＣ関連属性及びトラフィックシェーピング関連属性のパラメータの一覧をそれぞれ示している。 The following Tables 1-5 respectively list parameters for video-related attributes, audio-related attributes, auxiliary data-related attributes, FEC-related attributes, and traffic shaping-related attributes that may be described in the format-specific parameter attribute field 316 in this embodiment. showing.

本実施形態では、映像エッセンスデータが圧縮されるかを示す圧縮関連情報が新たに定義される。この圧縮関連情報を含む映像関連属性は、ＳＤＰオブジェクト３１１のメディアレベルセクション３１３のフォーマット固有パラメータ属性フィールド３１６に記述され得る。具体的には、例えば、表１に示したように、圧縮関連情報は、映像エッセンスデータが圧縮されるかを示す圧縮パラメータ“compression”を含む。さらに、映像エッセンスデータが圧縮されることを当該圧縮パラメータが示す場合には、圧縮関連情報は、映像エッセンスデータを圧縮する際に利用されるコーデック（圧縮方式）を示すコーデックパラメータ“codec”を含む。 In this embodiment, compression-related information is newly defined to indicate whether video essence data is compressed. Video-related attributes containing this compression-related information can be described in the format-specific parameter attributes field 316 of the media level section 313 of the SDP object 311 . Specifically, for example, as shown in Table 1, the compression-related information includes a compression parameter "compression" indicating whether video essence data is compressed. Further, when the compression parameter indicates that the video essence data is compressed, the compression-related information includes a codec parameter "codec" indicating the codec (compression method) used when compressing the video essence data. .

既存のＳＤＰのフォーマットによれば、音声エッセンスデータに関連する音声チャンネル数情報は、図１６に例示したように、音声用のメディアレベルセクションのＲＴＰマップ属性フィールド（“a=rtpmap:…”）に記述される。しかし、本実施形態のように、複数のエッセンスタイプにとって共通のメディアレベルセクションのみを設け、メディアタイプの文字列として“video”を選択した場合、ＲＴＰマップ属性フィールドに音声チャンネル数情報を記述することはＳＤＰの規格に反する。そこで、本実施形態では、表２に示したように、フォーマット固有の音声関連属性として、音声チャンネル数情報を定義する。この音声チャンネル数情報を含む音声関連属性は、上述した映像関連属性と共に、ＳＤＰオブジェクト３１１のメディアレベルセクション３１３のフォーマット固有パラメータ属性フィールド３１６に記述され得る。具体的には、例えば、音声チャンネル数情報は、音声チャンネル数パラメータ“channel-number”を含む。ＳＭＰＴＥＳＴ２１１０－３０では音声チャンネル数は１からレベルに依存して異なる上限値までの範囲内の任意の整数であり得るが、ＡＲＩＢＳＴＤ－Ｂ７３では音声チャンネル数は４、８、１２又は１６のいずれかに制約される。 According to the existing SDP format, audio channel number information related to audio essence data is stored in the RTP map attribute field ("a=rtpmap:...") of the media level section for audio, as illustrated in FIG. Described. However, as in this embodiment, when only a common media level section is provided for multiple essence types and "video" is selected as the media type character string, the number of audio channels cannot be described in the RTP map attribute field. is against the SDP standard. Therefore, in this embodiment, as shown in Table 2, audio channel number information is defined as format-specific audio-related attributes. Audio-related attributes, including this audio channel number information, can be described in the format-specific parameter attributes field 316 of the media level section 313 of the SDP object 311, along with the video-related attributes described above. Specifically, for example, the audio channel number information includes an audio channel number parameter “channel-number”. Whereas in SMPTE ST2110-30 the number of audio channels can be any integer ranging from 1 to different upper limits depending on the level, in ARIB STD-B73 the number of audio channels can be 4, 8, 12 or 16. constrained by

本実施形態では、補助データ関連属性は、映像関連属性（及び音声関連属性）と共に、ＳＤＰオブジェクト３１１のメディアレベルセクション３１３のフォーマット固有パラメータ属性フィールド３１６に記述され得る。具体的には、例えば、表３に示したように、補助データ関連属性は、データＩＤ及びセカンダリデータＩＤ“DID_SDID”並びに映像ペイロードＩＤコード“VPID_Code”を含む。 In this embodiment, auxiliary data-related attributes may be described in format-specific parameter attribute field 316 of media level section 313 of SDP object 311, along with video-related attributes (and audio-related attributes). Specifically, for example, as shown in Table 3, the auxiliary data-related attributes include data ID and secondary data ID "DID_SDID" and video payload ID code "VPID_Code".

本実施形態では、エッセンスデータへ適用される誤り訂正方式を示す誤り訂正情報が新たに定義される。この誤り訂正情報は、上述した他の属性と共に、ＳＤＰオブジェクト３１１のメディアレベルセクション３１３のフォーマット固有パラメータ属性フィールド３１６に記述され得る。具体的には、例えば、表４に示したように、誤り訂正情報は、エッセンスデータへ適用される誤り訂正方式を示すタイプパラメータ“FECtype”と、当該タイプパラメータにより示される誤り訂正方式の設定値を示す設定パラメータと、を含む。タイプパラメータがＸＯＲ符号化（“XOR”）を示す場合には、設定パラメータは、誤り訂正ブロックサイズを示すサイズパラメータ“XORsize”を含む。一方、タイプパラメータがリードソロモン符号化（“RS”）を示す場合には、設定パラメータは、リードソロモン符号化の処理単位に相当するデータグラム数を示すデータグラム数パラメータ“RSnum”を含む。 In this embodiment, error correction information indicating an error correction method applied to essence data is newly defined. This error correction information can be described in the format specific parameter attribute field 316 of the media level section 313 of the SDP object 311 along with the other attributes described above. Specifically, for example, as shown in Table 4, the error correction information includes a type parameter "FECtype" indicating the error correction method applied to the essence data, and the set value of the error correction method indicated by the type parameter. and a configuration parameter indicating If the type parameter indicates XOR encoding (“XOR”), the configuration parameters include a size parameter “XORsize” indicating the error correction block size. On the other hand, when the type parameter indicates Reed-Solomon encoding (“RS”), the setting parameter includes a datagram number parameter “RSnum” indicating the number of datagrams corresponding to the processing unit of Reed-Solomon encoding.

本実施形態では、トラフィックシェーピング関連属性は、上述した他の属性と共に、ＳＤＰオブジェクト３１１のメディアレベルセクション３１３のフォーマット固有パラメータ属性フィールド３１６に記述され得る。具体的には、例えば、表５に示したように、トラフィックシェーピング関連属性は、センダのバッファのタイプを示すバッファタイプパラメータ“TP”を含む。 In this embodiment, the traffic shaping related attributes may be described in the format specific parameter attributes field 316 of the media level section 313 of the SDP object 311 along with the other attributes described above. Specifically, for example, as shown in Table 5, the traffic shaping related attributes include a buffer type parameter "TP" that indicates the type of the sender's buffer.

本実施形態では、パケット時間属性フィールド３１７は、フォーマット固有パラメータ属性フィールド３１６と同一のメディアレベルセクション３１３内に含まれる。これは、図１６の例においてパケット時間属性フィールド（“a=ptime:…”）が映像用のメディアレベルセクション３０３とは異なる音声用のメディアレベルセクション３０５内に含まれていたこととは対照的である。パケット時間属性フィールド３１７は、パケット内のメディアの時間長をミリ秒単位で示す。このフィールドにより示される時間長は、機器のバッファサイズを左右する。本実施形態では、パケット時間属性フィールド３１７により示される時間長は、音声エッセンスのみに適用され得る。 In this embodiment, packet time attribute field 317 is contained within the same media level section 313 as format specific parameter attribute field 316 . This is in contrast to the example of Figure 16 where the packet time attribute field ("a=ptime:...") was included in a different media level section 305 for audio than media level section 303 for video. is. Packet time attribute field 317 indicates the length of time of media in the packet in milliseconds. The length of time indicated by this field governs the buffer size of the device. In this embodiment, the length of time indicated by packet time attribute field 317 may apply only to audio essences.

なお、表１～表５に列挙したフォーマット固有パラメータは、一例に過ぎない。複数のエッセンスタイプにとって共通のメディアレベルセクションは、他の追加的なパラメータを含んでもよく、又は表に示したパラメータのうちの１つ以上が省略されてもよい。 Note that the format-specific parameters listed in Tables 1-5 are only examples. Media level sections common to multiple essence types may include other additional parameters, or may omit one or more of the parameters shown in the table.

メディアレベルセクション３１８の内容は、マップＩＤ属性フィールドなど一部を除いて、メディアレベルセクション３１３と同様であってよい。冗長ＲＴＰストリーム方式が適用されない場合には、メディアレベルセクション３１８はＳＤＰオブジェクト３１１に含まれない。 The contents of the media level section 318 may be similar to the media level section 313 except for some parts such as the map ID attribute field. The media level section 318 is not included in the SDP object 311 if the redundant RTP stream scheme is not applied.

本項で説明したＳＤＰオブジェクトのフォーマット構造を用いることで、ＡＲＩＢＳＴＤ－Ｂ７３ストリームのようなエッセンス混在型ストリームの属性を、ストリームの構造に即して適切に記述することが可能となる。さらに、映像エッセンスの圧縮をサポートし、かつ誤り訂正符号化／復号も可能なＡＲＩＢＳＴＤ－Ｂ７３ストリームを、当該ストリームの属性に関する情報をＳＤＰオブジェクトの提供を通じて不足なく交換することによりセットアップすることが可能となる。 By using the format structure of the SDP object described in this section, it is possible to appropriately describe the attributes of a mixed-essence stream such as the ARIB STD-B73 stream in line with the structure of the stream. Furthermore, it is possible to set up an ARIB STD-B73 stream that supports compression of video essence and is also capable of error correction encoding/decoding by exchanging information on the attributes of the stream without shortage through the provision of SDP objects. becomes.

＜５－３．放送局システムの構成例＞
図１９は、第３の実施形態に係る放送局システム３の概略的な構成の一例を示すブロック図である。図１９を参照すると、放送局システム３は、１つ以上の送信ノード３００ａ～３００ｎと、制御ノード４００と、１つ以上の受信ノード４５０ａ～４５０ｎとを含む。 <5-3. Configuration example of broadcasting station system>
FIG. 19 is a block diagram showing an example of a schematic configuration of a broadcasting station system 3 according to the third embodiment. Referring to FIG. 19, broadcasting station system 3 includes one or more transmitting nodes 300a-300n, a control node 400, and one or more receiving nodes 450a-450n.

送信ノード３００ａ～３００ｎの各々（以下、送信ノード３００という）は、放送局システム３において放送信号ストリームを送信可能な放送信号処理ノードである。各送信ノード３００は、少なくとも１つのセンダ６０を有する。送信ノード３００は、自らが送信可能なストリームの属性を記述したＳＤＰオブジェクトを制御ノード４００へ提供する。送信ノード３００により提供されるＳＤＰオブジェクトは、図１７及び図１８を用いて説明した、エッセンス混在型ストリームのための新たなフォーマットに従って記述され得る。 Each of transmission nodes 300a to 300n (hereinafter referred to as transmission node 300) is a broadcast signal processing node capable of transmitting broadcast signal streams in broadcasting station system 3. FIG. Each sending node 300 has at least one sender 60 . The sending node 300 provides the control node 400 with an SDP object describing the attributes of the streams that it can send. The SDP object provided by the sending node 300 can be described according to the new format for the mixed-essence stream described with reference to FIGS. 17 and 18. FIG.

制御ノード４００は、放送局のＩＰネットワークにおける、放送信号ストリームの送信ノードから受信ノードへの送信を制御するノードである。放送局システム３において送信される放送信号ストリームは、異なるタイプのエッセンスデータを単一のポート番号で伝送するエッセンス混在型ストリームを含む。制御ノード４００は、例えば、図１に例示したＡＰＳ４０又は制御端末５０のような外部装置からのリクエストの受信に応じて、指定されるノード間のストリームの伝送をセットアップし、送信ノード３００へ伝送開始を指示する。エッセンス混在型ストリームの伝送のセットアップは、送信元の送信ノード３００から提供される上述したＳＤＰオブジェクトの記述に従って行われ得る。 The control node 400 is a node that controls transmission of a broadcast signal stream from a transmission node to a reception node in the IP network of the broadcast station. Broadcast signal streams transmitted in the broadcasting station system 3 include mixed essence streams that transmit different types of essence data with a single port number. Control node 400 sets up transmission of a stream between designated nodes in response to receiving a request from an external device such as APS 40 or control terminal 50 illustrated in FIG. to direct. Transmission setup of the mixed-essence stream can be performed according to the above-described SDP object description provided by the transmission node 300 of the transmission source.

受信ノード４５０ａ～４５０ｎの各々（以下、受信ノード４５０という）は、放送局システム３において放送信号ストリームを受信可能な放送信号処理ノードである。各受信ノード４５０は、少なくとも１つのレシーバ６５を有する。受信ノード４５０は、制御ノード４００による制御の下で、放送信号ストリームを受信するための受信処理をＳＤＰオブジェクトの記述に従って構成し、放送信号ストリームを送信ノード３００から受信する。受信対象の放送信号ストリームがエッセンス混在型ストリームである場合には、受信ノード４５０は、異なるタイプのエッセンスデータを単一のポート番号で（即ち、単一のストリーム内で）受信する。受信ノード４５０の構成は、第１の実施形態において説明した放送信号処理ノード１００、又は第２の実施形態において説明した放送信号処理ノード２００の構成と同様であってよい。 Each of the receiving nodes 450 a to 450 n (hereinafter referred to as receiving node 450 ) is a broadcast signal processing node capable of receiving broadcast signal streams in the broadcasting station system 3 . Each receiving node 450 has at least one receiver 65 . The receiving node 450 , under the control of the control node 400 , configures reception processing for receiving the broadcast signal stream according to the description of the SDP object and receives the broadcast signal stream from the transmitting node 300 . If the broadcast signal stream to be received is a mixed-essence stream, the receiving node 450 receives different types of essence data on a single port number (ie, within a single stream). The configuration of the receiving node 450 may be the same as the configuration of the broadcast signal processing node 100 described in the first embodiment or the configuration of the broadcast signal processing node 200 described in the second embodiment.

＜５－４．送信ノードの構成例＞
図２０は、本実施形態に係る送信ノード３００の構成の一例を示すブロック図である。図２０を参照すると、送信ノード３００は、デバイス内部クロック１１０、メディアクロック１１２、ＲＴＰクロック１１４、ＰＴＰ処理部１１６、通信部３２０、送信ストリーム処理部１３０、受信ストリーム処理部１４０、データ処理部１８０、制御部３９０及び記憶部３９５を備える。 <5-4. Configuration example of transmission node>
FIG. 20 is a block diagram showing an example of the configuration of the transmission node 300 according to this embodiment. Referring to FIG. 20, the transmission node 300 includes a device internal clock 110, a media clock 112, an RTP clock 114, a PTP processing unit 116, a communication unit 320, a transmission stream processing unit 130, a reception stream processing unit 140, a data processing unit 180, A control unit 390 and a storage unit 395 are provided.

（１）通信部
通信部３２０は、送信ノード３００による他のノードとの通信を仲介するインタフェースである。通信部３２０は、有線通信のための接続端子及び接続回路を含んでもよく、又は無線通信のためのアンテナ、ＲＦ回路及びベースバンド回路を含んでもよい。本実施形態において、通信部３２０は、送信部３２２及び受信部３２４を含む。 (1) Communication Unit The communication unit 320 is an interface that mediates communication between the transmission node 300 and other nodes. The communication unit 320 may include connection terminals and connection circuitry for wired communication, or may include an antenna, RF circuitry, and baseband circuitry for wireless communication. In this embodiment, the communication unit 320 includes a transmitter 322 and a receiver 324 .

送信部３２２は、センダ６０としての役割を有し、エッセンス混在型ストリームを放送局システム３のＩＰネットワークへ送信する。当該放送信号ストリームは、例えば、ＡＲＩＢＳＴＤ－Ｂ７３ストリームであってよい。具体的には、送信ストリーム処理部１３０により生成される、放送信号ストリームのための一連のＲＴＰパケットが送信部３２２へ入力される。各ＲＴＰパケットは、第１の実施形態及び第２の実施形態と同様の時刻情報（ＲＴＰタイムスタンプ及び／又はフレームカウント情報）を含む。送信部３２２は、入力される各ＲＴＰパケットにネットワークヘッダを追加して、各パケットを他のノードへ送信する。 The transmission unit 322 has a role as the sender 60 and transmits the essence-mixed stream to the IP network of the broadcasting station system 3 . The broadcast signal stream may be, for example, an ARIB STD-B73 stream. Specifically, a series of RTP packets for a broadcast signal stream generated by transmission stream processing section 130 are input to transmission section 322 . Each RTP packet includes time information (RTP timestamp and/or frame count information) similar to the first and second embodiments. The transmitting unit 322 adds a network header to each input RTP packet and transmits each packet to another node.

送信部３２２及び受信部３２４は、ストリームの伝送に関連する制御通信にも関与する。具体的には、例えば、受信部３２４は、制御ノード４００（又は受信ノード４５０などの他のノード）からストリームの伝送に関連する様々な制御メッセージを受信する。送信部３２２は、制御ノード４００（又は受信ノード４５０などの他のノード）へ制御メッセージに対する応答メッセージを送信する。 Transmitter 322 and receiver 324 are also responsible for control communications associated with the transmission of streams. Specifically, for example, receiver 324 receives various control messages related to transmission of the stream from control node 400 (or other nodes such as receiver node 450). The transmitter 322 transmits a response message to the control message to the control node 400 (or another node such as the receiving node 450).

本実施形態において、受信部３２４は、レシーバ６５としての役割を有していてもいなくてもよい。 In this embodiment, the receiver 324 may or may not function as the receiver 65 .

（２）制御部
制御部３９０は、例えば、ＣＰＵ（Central Processing Unit）、ＭＰＵ（Micro Processing Unit）又はマイクロコントローラといった１つ以上のプロセッサを含む。制御部３９０は、記憶部３９５により記憶されるコンピュータプログラムを実行することにより、送信ノード３００の動作の全般を制御する。 (2) Controller The controller 390 includes, for example, one or more processors such as a CPU (Central Processing Unit), MPU (Micro Processing Unit), or microcontroller. Control unit 390 controls overall operations of transmission node 300 by executing computer programs stored in storage unit 395 .

例えば、制御部３９０は、送信ノード３００が放送局システム３のＩＰネットワークへ接続されると、ｍＤＮＳ（multicast Domain Name System）クエリの発行とその応答の受信を通じて、制御ノード４００を発見する。さらに、制御部３９０は、例えば、受信部３２４を介して、制御ノード４００からＳＤＰオブジェクトの提供を求める制御メッセージを受信する。当該制御メッセージの受信に応じて、制御部３９０は、エッセンス混在型ストリームのための上述した新たなフォーマットに従って放送信号ストリームの属性を記述したＳＤＰオブジェクトを、記憶部３９５から取得する。そして、制御部３９０は、取得したＳＤＰオブジェクトを送信部３２２を介して制御ノード４００へ送信する。 For example, when the transmission node 300 is connected to the IP network of the broadcasting station system 3, the control unit 390 discovers the control node 400 by issuing an mDNS (multicast Domain Name System) query and receiving the response. Further, the control unit 390 receives a control message requesting provision of an SDP object from the control node 400 via the receiving unit 324, for example. In response to receiving the control message, the control unit 390 acquires from the storage unit 395 an SDP object describing attributes of the broadcast signal stream according to the above-described new format for the mixed-essence stream. The control unit 390 then transmits the acquired SDP object to the control node 400 via the transmission unit 322 .

送信ノード３００により提供されるＳＤＰオブジェクトに基づいてストリームの伝送がセットアップされた後、制御部３９０は、制御ノード４００から、ストリームの送信の開始を指示する制御メッセージを受信部３２４を介して受信する。当該制御メッセージの受信に応じて、制御部３９０は、送信部３２２からの放送信号ストリームの送信を開始する。 After the transmission of the stream is set up based on the SDP object provided by the transmitting node 300, the control unit 390 receives a control message from the control node 400 via the receiving unit 324 instructing the start of transmission of the stream. . In response to receiving the control message, controller 390 initiates transmission of the broadcast signal stream from transmitter 322 .

例えば、送信ノード３００は、ストリームの伝送を管理し及び制御するための制御インタフェース規格の集合であるＮＭＯＳをサポートしてもよい。この場合、上述した制御メッセージの交換は、例えば、ＮＭＯＳＩＳ－０４及びＮＭＯＳＩＳ－０５において予め定義されるＨＴＴＰ（Hypertext Transfer Protocol）ベースの制御ＡＰＩを介して行われ得る。 For example, the sending node 300 may support NMOS, a set of control interface standards for managing and controlling the transmission of streams. In this case, the above-mentioned exchange of control messages may be performed via an HTTP (Hypertext Transfer Protocol)-based control API, which is predefined in NMOS IS-04 and NMOS IS-05, for example.

（３）記憶部
記憶部３９５は、一時的な及び非一時的なコンピュータ読取可能なメモリを含む。一時的なメモリは、例えばＲＡＭ（Random Access Memory）を含み得る。非一時的なメモリは、例えばＲＯＭ（Read Only Memory）、ＨＤＤ（Hard Disk Drive）又はＳＳＤ（Solid State Drive）のうちの１つ以上を含み得る。記憶部３９５は、送信ノード３００の機能性を実現するためのコンピュータプログラムを記憶する。さらに、本実施形態において、記憶部３９５は、送信ノード３００により送信可能な放送信号ストリームの属性を記述したＳＤＰオブジェクトを記憶する。 (3) Storage Unit Storage unit 395 includes temporary and non-transitory computer readable memory. Temporary memory may include, for example, RAM (Random Access Memory). The non-transitory memory may include, for example, one or more of ROM (Read Only Memory), HDD (Hard Disk Drive), or SSD (Solid State Drive). The storage unit 395 stores computer programs for implementing the functionality of the sending node 300 . Furthermore, in this embodiment, the storage unit 395 stores SDP objects describing attributes of broadcast signal streams that can be transmitted by the transmission node 300 .

ある観点において、上記ＳＤＰオブジェクトは、複数のエッセンスタイプにとって共通のメディア記述フィールドに関連付けられる属性フィールド内に、第１のエッセンスタイプのエッセンスデータに関連する第１の属性情報及び第２のエッセンスタイプのエッセンスデータに関連する第２の属性情報を含む。一例として、第１のエッセンスタイプは、映像エッセンスであってよく、第２のエッセンスタイプは、音声エッセンスであってよい。この場合、ＳＤＰオブジェクトは、複数のエッセンスタイプに共通的な属性フィールド内に、表１に例示したような映像関連属性を第１の属性情報として、表２に例示したような音声関連属性を第２の属性情報として含み得る。他の例として、第１のエッセンスタイプは、映像エッセンスであってよく、第２のエッセンスタイプは、補助データエッセンスであってよい。この場合、ＳＤＰオブジェクトは、複数のエッセンスタイプに共通的な属性フィールド内に、表１に例示したような映像関連属性を第１の属性情報として、表３に例示したような補助データ関連属性を第２の属性情報として含み得る。なお、これらの例に限定されず、送信ノード３００により提供されるＳＤＰオブジェクトは、表１～表５に例示した情報のいかなる組合せを含んでもよい。 In one aspect, the SDP object includes, in an attribute field associated with a media description field common to multiple essence types, first attribute information associated with essence data of a first essence type and of a second essence type. It includes second attribute information associated with the essence data. As an example, the first essence type may be a video essence and the second essence type may be an audio essence. In this case, the SDP object contains video-related attributes as exemplified in Table 1 as first attribute information and audio-related attributes as exemplified in Table 2 as second attribute information in an attribute field common to a plurality of essence types. 2 attribute information. As another example, the first essence type may be a video essence and the second essence type may be an auxiliary data essence. In this case, the SDP object has video-related attributes as shown in Table 1 as first attribute information and auxiliary data-related attributes as shown in Table 3 in the attribute field common to multiple essence types. It can be included as second attribute information. However, without being limited to these examples, the SDP object provided by the sending node 300 may include any combination of the information exemplified in Tables 1-5.

他の観点において、上記ＳＤＰオブジェクトは、映像エッセンスデータが圧縮されるかを示す圧縮関連情報、音声エッセンスデータに関連する音声チャンネル数情報、及びエッセンスデータへ適用される誤り訂正方式を示す誤り訂正情報、のうちの１つ以上を、放送信号ストリームのフォーマット固有のパラメータを記述する属性フィールド内に含む。表１を用いて説明したように、上記圧縮関連情報は、映像エッセンスデータが圧縮されるかを示す圧縮パラメータと、映像エッセンスデータが圧縮されることを当該圧縮パラメータが示す場合に、映像エッセンスデータを圧縮する際に利用されるコーデックを示すコーデックパラメータと、を含み得る。また、表４を用いて説明したように、上記誤り訂正情報は、エッセンスデータへ適用される誤り訂正方式を示すタイプパラメータと、当該タイプパラメータにより示される誤り訂正方式の設定値を示す設定パラメータとを含み得る。上記設定パラメータは、例えば、ＸＯＲ符号化のための誤り訂正ブロックサイズを示すサイズパラメータ、又は、リードソロモン符号化のための処理単位に相当するデータグラム数を示すデータグラム数パラメータであり得る。 In another aspect, the SDP object includes compression-related information indicating whether the video essence data is compressed, audio channel number information associated with the audio essence data, and error correction information indicating the error correction scheme applied to the essence data. , in the attribute field that describes the format-specific parameters of the broadcast signal stream. As described using Table 1, the compression-related information includes a compression parameter indicating whether the video essence data is to be compressed, and if the compression parameter indicates that the video essence data is to be compressed, the video essence data is compressed. and a codec parameter indicating the codec to be used in compressing the . Further, as described using Table 4, the error correction information includes a type parameter indicating the error correction method applied to the essence data, and a setting parameter indicating the setting value of the error correction method indicated by the type parameter. can include The setting parameter may be, for example, a size parameter indicating an error correction block size for XOR encoding, or a datagram number parameter indicating the number of datagrams corresponding to a processing unit for Reed-Solomon encoding.

（４）送信制御処理の流れ－第１の例
図２１は、送信ノード３００により実行され得る送信制御処理の流れの第１の例を示すフローチャートである。 (4) Flow of Transmission Control Processing--First Example FIG.

まず、制御部３９０は、送信ノード３００のＩＰネットワークへの接続に応じて、制御ノード４００を発見する（ステップＳ３０１）。 First, the control unit 390 discovers the control node 400 according to the connection of the transmission node 300 to the IP network (step S301).

次いで、制御部３９０は、送信ノード３００により送信可能な放送信号ストリームの属性を記述したＳＤＰオブジェクトを記憶部３９５から取得し、取得したＳＤＰオブジェクトを制御ノード４００へ提供する（ステップＳ３０３）。ここで提供されるＳＤＰオブジェクトは、複数のエッセンスタイプにとって共通のメディア記述フィールドに関連付けられる属性フィールド内に、第１及び第２のエッセンスタイプのエッセンスデータに関連する属性情報を含む。 Next, the control unit 390 acquires from the storage unit 395 an SDP object describing attributes of broadcast signal streams that can be transmitted by the transmission node 300, and provides the acquired SDP object to the control node 400 (step S303). The SDP object provided herein contains attribute information related to essence data of first and second essence types in attribute fields associated with media description fields common to multiple essence types.

その後、送信ノード３００は、ストリームの送信の開始の指示を待ち受ける（ステップＳ３０５）。そして、制御ノード４００（又は他のノード）からストリームの送信の開始を指示する制御メッセージが受信されると、送信部３２２は、制御部３９０による制御の下で、上記ＳＤＰオブジェクトにより示される属性を有する放送信号ストリームの送信を開始する（ステップＳ３０７）。 After that, the transmitting node 300 waits for an instruction to start transmitting the stream (step S305). Then, when a control message instructing the start of stream transmission is received from the control node 400 (or another node), the transmission unit 322, under the control of the control unit 390, converts the attributes indicated by the SDP object to start transmitting the broadcast signal stream it has (step S307).

（５）送信制御処理の流れ－第２の例
図２２は、送信ノード３００により実行され得る送信制御処理の流れの第２の例を示すフローチャートである。 (5) Flow of Transmission Control Processing--Second Example FIG.

次いで、制御部３９０は、送信ノード３００により送信可能な放送信号ストリームの属性を記述したＳＤＰオブジェクトを記憶部３９５から取得し、取得したＳＤＰオブジェクトを制御ノード４００へ提供する（ステップＳ３０４）。ここで提供されるＳＤＰオブジェクトは、圧縮関連情報、音声チャンネル数情報、及び誤り訂正情報のうちの１つ以上を、放送信号ストリームのフォーマット固有のパラメータを記述する属性フィールド内に含む。 Next, the control unit 390 acquires from the storage unit 395 an SDP object describing attributes of broadcast signal streams that can be transmitted by the transmission node 300, and provides the acquired SDP object to the control node 400 (step S304). The SDP objects provided herein contain one or more of compression-related information, audio channel number information, and error correction information in attribute fields that describe format-specific parameters of the broadcast signal stream.

＜５－５．制御ノードの構成例＞
図２３は、本実施形態に係る制御ノード４００の構成の一例を示すブロック図である。図２３を参照すると、制御ノード４００は、通信部４１０、制御部４２０及び記憶部４３０を備える。 <5-5. Configuration example of control node>
FIG. 23 is a block diagram showing an example of the configuration of the control node 400 according to this embodiment. Referring to FIG. 23, the control node 400 includes a communication unit 410, a control unit 420 and a storage unit 430.

（１）通信部
通信部４１０は、制御ノード４００による他のノードとの通信を仲介するインタフェースである。通信部４１０は、有線通信のための接続端子及び接続回路を含んでもよく、又は無線通信のためのアンテナ、ＲＦ回路及びベースバンド回路を含んでもよい。本実施形態において、通信部４１０は、送信部４１２及び受信部４１４を含む。 (1) Communication Unit The communication unit 410 is an interface that mediates communication between the control node 400 and other nodes. The communication unit 410 may include connection terminals and connection circuitry for wired communication, or may include an antenna, RF circuitry, and baseband circuitry for wireless communication. In this embodiment, the communication unit 410 includes a transmitter 412 and a receiver 414 .

送信部４１２及び受信部４１４は、放送局システム３内のＩＰネットワーク上でのストリームの伝送に関連する制御通信に関与する。具体的には、例えば、送信部４１２は、送信ノード３００へ、ストリームの伝送のセットアップ、送信の開始又は終了のための制御メッセージを送信し得る。同様に、送信部４１２は、受信ノード４５０へ、ストリームの伝送のセットアップ、受信の開始又は終了のための制御メッセージを送信し得る。受信部４１４は、送信ノード３００及び受信ノード４５０から、制御メッセージに対する応答メッセージを受信し得る。 The transmitter 412 and receiver 414 are involved in control communications related to the transmission of streams over the IP network within the broadcast station system 3 . Specifically, for example, the transmitting unit 412 may transmit control messages to the transmitting node 300 for setting up transmission of a stream, starting or ending transmission. Similarly, transmitter 412 may send control messages to receiver node 450 to set up transmission of the stream, start or end reception. The receiving unit 414 can receive response messages to control messages from the transmitting node 300 and the receiving node 450 .

（２）制御部
制御部４２０は、例えば、ＣＰＵ、ＭＰＵ又はマイクロコントローラといった１つ以上のプロセッサを含む。制御部４２０は、記憶部４３０により記憶されるコンピュータプログラムを実行することにより、制御ノード４００の動作の全般を制御する。本実施形態において、制御部４２０は、情報管理部４２２及びストリーム制御部４２４を含む。 (2) Controller The controller 420 includes one or more processors such as a CPU, MPU, or microcontroller. The control unit 420 controls overall operations of the control node 400 by executing computer programs stored in the storage unit 430 . In this embodiment, the controller 420 includes an information manager 422 and a stream controller 424 .

情報管理部４２２は、放送局システム３内の放送信号処理ノードに関する情報のデータベースへの登録及び管理を行う。例えば、情報管理部４２２は、ＩＰネットワークへ接続した送信ノード３００を発見すると、発見した送信ノード３００へ、ＳＤＰオブジェクトの提供を求める制御メッセージを送信部４１２を介して送信する。そして、情報管理部４２２は、送信ノード３００からＳＤＰオブジェクトを受信部４１４を介して受信する。送信ノード３００から受信されるＳＤＰオブジェクトは、送信ノード３００により送信可能な放送信号ストリームの属性を記述している。情報管理部４２２は、受信されるＳＤＰオブジェクトに記述されている情報を記憶部４３０のデータベースへ登録する。情報管理部４２２は、受信ノード４５０からも同様にＳＤＰオブジェクトを取得し、受信ノード４５０により受信可能なストリームの属性などの情報を記憶部４３０のデータベースへ登録し得る。 The information management unit 422 registers and manages information about broadcast signal processing nodes in the broadcasting station system 3 in the database. For example, when the information management unit 422 discovers the transmission node 300 connected to the IP network, the information management unit 422 transmits a control message requesting provision of the SDP object to the discovered transmission node 300 via the transmission unit 412 . The information management unit 422 then receives the SDP object from the transmission node 300 via the reception unit 414 . SDP objects received from transmitting node 300 describe attributes of broadcast signal streams that can be transmitted by transmitting node 300 . The information management unit 422 registers information described in the received SDP object in the database of the storage unit 430 . The information management unit 422 can similarly acquire an SDP object from the receiving node 450 and register information such as attributes of streams receivable by the receiving node 450 in the database of the storage unit 430 .

ストリーム制御部４２４は、放送局システム３内の送信ノード３００から受信ノード４５０へのストリームの伝送を制御する。例えば、ストリーム制御部４２４は、ストリームの伝送を求めるリクエストがＡＰＳ４０又は制御端末５０から受信された場合に、指定された送信ノード３００からの放送信号ストリームの受信をセットアップするように受信ノード４５０に指示する。ストリーム制御部４２４は、例えば、記憶部４３０のデータベースに登録されているストリームの属性を参照することにより、受信処理がどのようにセットアップされるべきかを決定し得る。例えば、次のうちの１つ以上が、ストリームの属性に依存して決定され得る：
・受信される映像エッセンスデータについて逆圧縮を実行すべきか
・逆圧縮を実行する際に利用すべきコーデック
・いくつの音声チャンネルが音声エッセンスデータに含まれるか
・誤り訂正方式としてどの方式を使用すべきか（ＸＯＲ又はＲＳ）
・ＸＯＲ復号により誤り訂正を実行する際のＦＥＣブロックサイズ
・ＲＳ復号により誤り訂正を実行する際の処理単位となるデータグラム数 The stream control unit 424 controls transmission of streams from the transmission node 300 in the broadcasting station system 3 to the reception node 450 . For example, stream controller 424 instructs receiving node 450 to set up reception of a broadcast signal stream from designated transmitting node 300 when a request for transmission of the stream is received from APS 40 or control terminal 50. do. The stream control unit 424 can determine how the reception process should be set up, for example, by referring to the stream attributes registered in the database of the storage unit 430 . For example, one or more of the following may be determined depending on attributes of the stream:
・Should decompression be performed on the received video essence data? ・Codecs to be used when performing decompression ・How many audio channels are included in the audio essence data ・Which method should be used as an error correction method? (XOR or RS)
・FEC block size when executing error correction by XOR decoding ・Number of datagrams used as a processing unit when executing error correction by RS decoding

受信ノード４５０は、ストリーム制御部４２４から受信される上記指示に従って、受信処理を構成し及び対応するマルチキャストグループへ加入することにより、放送信号ストリームの受信をセットアップする。そして、ストリーム制御部４２４は、送信ノード３００へ放送信号ストリームの送信の開始を指示する。それに応じて、送信ノード３００は、放送信号ストリームの送信を開始する。 The receiving node 450 sets up reception of the broadcast signal stream by configuring the reception process and joining the corresponding multicast group according to the above instructions received from the stream controller 424 . The stream control unit 424 then instructs the transmission node 300 to start transmitting the broadcast signal stream. In response, transmitting node 300 begins transmitting the broadcast signal stream.

（３）記憶部
記憶部４３０は、一時的な及び非一時的なコンピュータ読取可能なメモリを含む。一時的なメモリは、例えばＲＡＭを含み得る。非一時的なメモリは、例えばＲＯＭ、ＨＤＤ又はＳＳＤのうちの１つ以上を含み得る。記憶部４３０は、制御ノード４００の機能性を実現するためのコンピュータプログラムを記憶する。さらに、本実施形態において、記憶部４３０は、情報管理部４２２により放送局システム３内のノードからそれぞれ収集されるＳＤＰオブジェクトに記述されている情報をデータベース内に記憶する。 (3) Storage Unit Storage unit 430 includes temporary and non-transitory computer-readable memory. Temporary memory may include, for example, RAM. Non-transitory memory may include, for example, one or more of ROM, HDD or SSD. The storage unit 430 stores computer programs for implementing the functionality of the control node 400 . Furthermore, in this embodiment, the storage unit 430 stores information described in the SDP objects collected from each node in the broadcasting station system 3 by the information management unit 422 in the database.

（４）送信制御処理の流れ－第１の例
図２４は、制御ノード４００により実行され得る送信制御処理の流れの第１の例を示すフローチャートである。 (4) Flow of Transmission Control Processing--First Example FIG.

まず、情報管理部４２２は、ＩＰネットワークへ接続した送信ノード３００を発見する（ステップＳ４０１）。 First, the information management unit 422 discovers the transmission node 300 connected to the IP network (step S401).

次いで、情報管理部４２２は、発見した送信ノード３００により送信可能な放送信号ストリームの属性を記述したＳＤＰオブジェクトを、当該送信ノード３００から受信することにより取得する（ステップＳ４０３）。ここで取得されるＳＤＰオブジェクトは、複数のエッセンスタイプにとって共通のメディア記述フィールドに関連付けられる属性フィールド内に、第１及び第２のエッセンスタイプのエッセンスデータに関連する属性情報を含む。 Next, the information management unit 422 acquires an SDP object describing attributes of broadcast signal streams that can be transmitted by the found transmission node 300 by receiving it from the transmission node 300 (step S403). The SDP object obtained here contains attribute information associated with the essence data of the first and second essence types in attribute fields associated with media description fields common to multiple essence types.

ストリーム制御部４２４は、ストリームの伝送のセットアップを求めるリクエストを待ち受ける（ステップＳ４０５）。ストリームの伝送のセットアップを求めるリクエストは、例えば、ＡＰＳ４０又は制御端末５０といった外部装置から受信され得る。 The stream control unit 424 waits for a request for setting up transmission of the stream (step S405). A request to set up the transmission of a stream may be received from an external device, eg APS 40 or control terminal 50 .

ストリーム制御部４２４は、上記リクエストが受信されると、指定された受信ノード４５０に、上記ＳＤＰオブジェクトにより示される属性に従って受信処理を構成し及び対応するマルチキャストグループに加入するように指示する（ステップＳ４０７）。そして、ストリーム制御部４２４は、指定された送信ノード３００に、放送信号ストリームの送信を開始するように指示する（ステップＳ４０９）。 When the request is received, the stream control unit 424 instructs the specified receiving node 450 to configure the receiving process according to the attributes indicated by the SDP object and join the corresponding multicast group (step S407). ). The stream control unit 424 then instructs the designated transmission node 300 to start transmitting the broadcast signal stream (step S409).

（５）送信制御処理の流れ－第２の例
図２５は、制御ノード４００により実行され得る送信制御処理の流れの第２の例を示すフローチャートである。 (5) Flow of Transmission Control Processing—Second Example FIG. 25 is a flow chart showing a second example of the flow of transmission control processing that can be executed by the control node 400. FIG.

次いで、情報管理部４２２は、発見した送信ノード３００により送信可能な放送信号ストリームの属性を記述したＳＤＰオブジェクトを、当該送信ノード３００から受信することにより取得する（ステップＳ４０４）。ここで取得されるＳＤＰオブジェクトは、圧縮関連情報、音声チャンネル数情報、及び誤り訂正情報のうちの１つ以上を、放送信号ストリームのフォーマット固有のパラメータを記述する属性フィールド内に含む。 Next, the information management unit 422 obtains an SDP object describing attributes of broadcast signal streams that can be transmitted by the found transmission node 300 by receiving it from the transmission node 300 (step S404). The SDP object obtained here contains one or more of compression-related information, audio channel number information, and error correction information in attribute fields that describe format-specific parameters of the broadcast signal stream.

＜＜６．第４の実施形態＞＞
次いで、図２６及び図２７を用いて、第４の実施形態について説明する。上述した第３の実施形態は具体的な実施形態であり、一方で第４の実施形態はより一般化された実施形態である。 <<6. Fourth Embodiment>>
Next, a fourth embodiment will be described with reference to FIGS. 26 and 27. FIG. The third embodiment described above is a specific embodiment, while the fourth embodiment is a more generalized embodiment.

＜６－１．送信ノードの構成例＞
図２６は、第４の実施形態に係る送信ノード５００の構成の一例を示すブロック図である。図２６を参照すると、送信ノード５００は、送信部５１０及び制御部５２０を備える。 <6-1. Configuration example of transmission node>
FIG. 26 is a block diagram showing an example configuration of a transmission node 500 according to the fourth embodiment. Referring to FIG. 26, transmission node 500 includes transmission section 510 and control section 520 .

送信部５１０は、異なるタイプのエッセンスデータを単一のポート番号で伝送する放送信号ストリームを、放送局のＩＰネットワークへ送信する。 The transmitter 510 transmits a broadcast signal stream carrying different types of essence data on a single port number to the broadcaster's IP network.

制御部５２０は、放送信号ストリームの属性を記述するＳＤＰオブジェクトを他のノードへ提供する。 Controller 520 provides SDP objects describing attributes of the broadcast signal stream to other nodes.

ある観点において、上記ＳＤＰオブジェクトは、複数のエッセンスタイプにとって共通のメディア記述フィールドに関連付けられる属性フィールド内に、第１のエッセンスタイプのエッセンスデータに関連する第１の属性情報及び第２のエッセンスタイプのエッセンスデータに関連する第２の属性情報を含む。 In one aspect, the SDP object includes, in an attribute field associated with a media description field common to multiple essence types, first attribute information associated with essence data of a first essence type and of a second essence type. It includes second attribute information associated with the essence data.

他の観点において、上記ＳＤＰオブジェクトは、映像エッセンスデータが圧縮されるかを示す圧縮関連情報、音声エッセンスデータに関連する音声チャンネル数情報、及びエッセンスデータへ適用される誤り訂正方式を示す誤り訂正情報、のうちの１つ以上を、上記放送信号ストリームのフォーマット固有のパラメータを記述する属性フィールド内に含む。 In another aspect, the SDP object includes compression-related information indicating whether the video essence data is compressed, audio channel number information associated with the audio essence data, and error correction information indicating the error correction scheme applied to the essence data. , in an attribute field describing format-specific parameters of said broadcast signal stream.

ストリームの伝送のセットアップのために実行される送信制御処理が、送信ノード５００の上記動作ステップを含んでもよい。また、それら動作ステップをプロセッサに実行させるコンピュータプログラムが提供されてもよい。また、それら動作ステップをプロセッサに実行させるコンピュータプログラムを記憶した非一時的なコンピュータ読取可能な記憶媒体が提供されてもよい。加えて、第３の実施形態において説明した送信ノード３００の任意の機能又は処理が本実施形態に適用されてよい。 A transmission control process performed to set up transmission of a stream may include the above operational steps of the transmitting node 500 . A computer program may also be provided that causes a processor to perform those operational steps. A non-transitory computer-readable storage medium storing a computer program that causes a processor to perform those operational steps may also be provided. Additionally, any function or process of the sending node 300 described in the third embodiment may be applied to this embodiment.

＜６－２．制御ノードの構成例＞
図２７は、第４の実施形態に係る制御ノード６００の構成の一例を示すブロック図である。図２７を参照すると、制御ノード６００は、制御部６１０を備える。 <6-2. Configuration example of control node>
FIG. 27 is a block diagram showing an example of the configuration of the control node 600 according to the fourth embodiment. Referring to FIG. 27 , the control node 600 has a control section 610 .

制御ノード６００は、放送局のＩＰネットワークにおける、異なるタイプのエッセンスデータを単一のポート番号で伝送する放送信号ストリームの送信ノードから受信ノードへの送信を制御するノードである。制御部６１０は、上記放送信号ストリームの属性を記述するＳＤＰオブジェクトを上記送信ノードから取得する。そして、制御部６１０は、取得したＳＤＰオブジェクトに従って、上記放送信号ストリームの伝送をセットアップする。 The control node 600 is a node that controls the transmission of broadcast signal streams carrying different types of essence data on a single port number from a transmitting node to a receiving node in the broadcaster's IP network. The control unit 610 obtains an SDP object describing attributes of the broadcast signal stream from the transmitting node. The control unit 610 then sets up transmission of the broadcast signal stream according to the obtained SDP object.

ストリームの伝送のセットアップのために実行される送信制御処理が、制御ノード６００の上記動作ステップを含んでもよい。また、それら動作ステップをプロセッサに実行させるコンピュータプログラムが提供されてもよい。また、それら動作ステップをプロセッサに実行させるコンピュータプログラムを記憶した非一時的なコンピュータ読取可能な記憶媒体が提供されてもよい。加えて、第３の実施形態において説明した制御ノード４００の任意の機能又は処理が本実施形態に適用されてよい。 A transmission control process performed to set up transmission of a stream may include the above operational steps of the control node 600 . A computer program may also be provided that causes a processor to perform those operational steps. A non-transitory computer-readable storage medium storing a computer program that causes a processor to perform those operational steps may also be provided. Additionally, any function or process of the control node 400 described in the third embodiment may be applied to this embodiment.

＜＜７．第３の実施形態及び第４の実施形態のまとめ＞＞
ここまで、本開示の第３の実施形態及び第４の実施形態について詳細に説明した。これら実施形態では、異なるタイプのエッセンスデータを単一のポート番号で伝送するいわゆるエッセンス混在型ストリームの属性を記述するＳＤＰオブジェクトが、複数のエッセンスタイプにとって共通のメディア記述フィールドに関連付けられる属性フィールド内に、第１のエッセンスタイプのエッセンスデータに関連する第１の属性情報及び第２のエッセンスタイプのエッセンスデータに関連する第２の属性情報を含む。したがって、例えばＡＲＩＢＳＴＤ－Ｂ７３ストリームのようなエッセンス混在型ストリームの属性を、ストリームの構造に即してＳＤＰオブジェクト内に適切に記述することができる。その結果、ＳＤＰベースの管理用ＡＰＩを活用してＳＤＰオブジェクトを交換することにより、エッセンス混在型ストリームの伝送を簡易な手続でセットアップすることが可能となる。 <<7. Summary of Third Embodiment and Fourth Embodiment>>
So far, the third embodiment and the fourth embodiment of the present disclosure have been described in detail. In these embodiments, an SDP object describing the attributes of a so-called mixed-essence stream that carries different types of essence data on a single port number is placed in an attribute field associated with a media description field common to multiple essence types. , first attribute information associated with essence data of a first essence type and second attribute information associated with essence data of a second essence type. Therefore, attributes of a mixed-essence stream such as an ARIB STD-B73 stream can be appropriately described in the SDP object according to the structure of the stream. As a result, by exchanging SDP objects using the SDP-based management API, it becomes possible to set up the transmission of mixed-essence streams with a simple procedure.

例えば、図１５を用いて説明したように、ＳＭＰＴＥＳＴ２１１０ストリームのために通常利用される既存のＳＤＰオブジェクトのフォーマットは、映像関連属性が記述されるメディアレベルセクションとは別に、音声関連属性が記述されるメディアレベルセクション及び補助データ関連属性が記述されるメディアレベルセクションを有する。対照的に、第３の実施形態及び第４の実施形態のように、２つ以上のエッセンスタイプにとって共通のメディアレベルセクションにそれらエッセンスタイプのエッセンスデータに関連する属性を記述することで、１つのメディア（ストリーム）に対し１つのメディアレベルセクションというＳＤＰの解釈の一貫性を保つことができる。 For example, as described with FIG. 15, the existing SDP object format typically used for SMPTEST ST2110 streams describes audio-related attributes separately from the media-level section, which describes video-related attributes. It has a media level section in which attributes related to auxiliary data are described. In contrast, as in the third and fourth embodiments, describing the attributes associated with the essence data of two or more essence types in a media level section common to those essence types allows one The SDP interpretation of one media level section per media (stream) can be kept consistent.

追加的に又は代替的に、エッセンス混在型ストリームの属性を記述する上記ＳＤＰオブジェクトが、映像エッセンスデータが圧縮されるかを示す圧縮関連情報、音声エッセンスデータに関連する音声チャンネル数情報、及びエッセンスデータへ適用される誤り訂正方式を示す誤り訂正情報、のうちの１つ以上をフォーマット固有のパラメータを記述する属性フィールド内に含む。したがって、例えばＡＲＩＢＳＴＤ－Ｂ７３ストリームのような独自の仕様を有する放送信号ストリームの属性を、ＳＤＰオブジェクトに不足なく記述することができる。その結果、ＳＤＰベースの管理用ＡＰＩを活用してＳＤＰオブジェクトを交換することにより、ＡＲＩＢＳＴＤ－Ｂ７３ストリームの伝送を簡易な手続でセットアップすることが可能となる。 Additionally or alternatively, the SDP object describing attributes of the mixed-essence stream includes compression-related information indicating whether video essence data is compressed, audio channel number information associated with audio essence data, and essence data. error correction information indicating the error correction scheme to be applied to the format-specific parameters. Therefore, the attributes of broadcast signal streams having unique specifications, such as ARIB STD-B73 streams, can be fully described in SDP objects. As a result, it becomes possible to set up the transmission of ARIB STD-B73 streams with a simple procedure by exchanging SDP objects by utilizing the SDP-based management API.

例えば、図１６を用いて説明したように、ＳＭＰＴＥＳＴ２１１０ストリームのために通常利用される既存のＳＤＰオブジェクトのフォーマットは、フォーマット固有パラメータ属性フィールド内に、圧縮関連情報、音声チャンネル数情報及び誤り訂正情報のいずれも有しない。対照的に、第３の実施形態及び第４の実施形態のように、フォーマット固有パラメータ属性フィールド内に、圧縮関連情報、音声チャンネル数情報及び／又は誤り訂正情報を含めることで、ＡＲＩＢＳＴＤ－Ｂ７３ストリームの受信側で、ＳＤＰオブジェクトの記述に従って受信処理を適切に構成することができる。 For example, as described with FIG. 16, the existing SDP object format typically used for SMPTEST ST2110 streams contains compression-related information, audio channel number information and error correction information in format-specific parameter attribute fields. does not have either In contrast, by including compression-related information, audio channel number information and/or error correction information in the format-specific parameter attribute field as in the third and fourth embodiments, ARIB STD-B73 At the receiving end of the stream, the receiving process can be configured appropriately according to the SDP object description.

なお、本開示に係る技術は、上述した実施形態に限定されるものではない。これらの実施形態は例示にすぎないということ、並びに、本開示のスコープ及び精神から逸脱することなく様々な変形が可能であるということが、当業者に理解されるであろう。 Note that the technology according to the present disclosure is not limited to the above-described embodiments. Those skilled in the art will appreciate that these embodiments are illustrative only and that various modifications are possible without departing from the scope and spirit of the disclosure.

例えば、フローチャートに示した処理ステップは、必ずしも図示した順序通りに実行されなくてもよい。例えば、処理ステップは図示した順序とは異なる順序で実行されてもよく、２つ以上の処理ステップが並列的に実行されてもよい。また、一部の処理ステップが削除されてもよく、さらなる処理ステップが追加されてもよい。 For example, the process steps shown in the flowcharts do not necessarily have to be performed in the order shown. For example, the processing steps may be performed in a different order than shown, and two or more processing steps may be performed in parallel. Also, some processing steps may be deleted and further processing steps may be added.

また、本明細書において説明したノードの機能は、ソフトウェア、ハードウェア、及びソフトウェアとハードウェアとの組み合わせのいずれで実現されてもよい。ソフトウェアを構成するコンピュータプログラムのプログラム命令は、例えば、ノードの内部又は外部の非一時的なコンピュータ読取可能な記憶媒体において記憶され、実行時にメモリへ読み込まれてプロセッサにより実行される。 Also, the node functions described herein may be implemented in software, hardware, or a combination of software and hardware. Program instructions of a computer program that constitutes software are stored, for example, in a non-transitory computer-readable storage medium inside or outside the node, read into memory at run time, and executed by a processor.

また、本明細書において単一の装置又は単一のノードにより実現されるものとして説明した技術が、複数の装置又は複数のノードが相互に連携することによりシステムとして実現されてもよい。 Also, the technology described herein as implemented by a single device or a single node may be implemented as a system by multiple devices or multiple nodes cooperating with each other.

上記実施形態の一部又は全部は、以下の付記のようにも記載され得るが、以下には限られない。 Some or all of the above embodiments may also be described in the following additional remarks, but are not limited to the following.

（付記１）
異なるタイプのエッセンスデータを単一のポート番号で伝送する放送信号ストリームを、放送局のＩＰネットワークへ送信する送信部と、
前記放送信号ストリームの属性を記述するＳＤＰ（Session Description Protocol）オブジェクトを他のノードへ提供する制御部と、
を備え、
前記ＳＤＰオブジェクトは、映像エッセンスデータが圧縮されるかを示す圧縮関連情報、音声エッセンスデータに関連する音声チャンネル数情報、及びエッセンスデータへ適用される誤り訂正方式を示す誤り訂正情報、のうちの１つ以上を、前記放送信号ストリームのフォーマット固有のパラメータを記述する属性フィールド内に含む、
送信ノード。 (Appendix 1)
a transmitter for transmitting a broadcast signal stream carrying different types of essence data on a single port number to an IP network of a broadcaster;
a control unit that provides an SDP (Session Description Protocol) object describing attributes of the broadcast signal stream to other nodes;
with
The SDP object is one of compression-related information indicating whether video essence data is compressed, audio channel number information associated with audio essence data, and error correction information indicating an error correction method applied to essence data. one or more within an attribute field describing format-specific parameters of said broadcast signal stream;
sending node.

（付記２）
前記圧縮関連情報は、
前記映像エッセンスデータが圧縮されるかを示す圧縮パラメータを含み、
前記映像エッセンスデータが圧縮されることを前記圧縮パラメータが示す場合に、前記映像エッセンスデータを圧縮する際に利用されるコーデックを示すコーデックパラメータをさらに含む、
付記１に記載の送信ノード。 (Appendix 2)
The compression-related information is
including a compression parameter indicating whether the video essence data is compressed;
further comprising a codec parameter indicating a codec to be used in compressing the video essence data when the compression parameter indicates that the video essence data is to be compressed;
The sending node of claim 1.

（付記３）
前記誤り訂正情報は、
前記エッセンスデータへ適用される前記誤り訂正方式を示すタイプパラメータと、
前記タイプパラメータにより示される前記誤り訂正方式の設定値を示す設定パラメータと、
を含む、付記１又は付記２に記載の送信ノード。 (Appendix 3)
The error correction information is
a type parameter indicating the error correction scheme applied to the essence data;
a setting parameter indicating a setting value of the error correction scheme indicated by the type parameter;
A transmitting node according to clause 1 or clause 2, comprising:

（付記４）
前記タイプパラメータがＸＯＲ符号化を示す場合に、前記設定パラメータは、誤り訂正ブロックサイズを示すサイズパラメータを含む、付記３に記載の送信ノード。 (Appendix 4)
4. The transmitting node of clause 3, wherein if the type parameter indicates XOR encoding, the configuration parameters include a size parameter indicating an error correction block size.

（付記５）
前記タイプパラメータがリードソロモン符号化を示す場合に、前記設定パラメータは、リードソロモン符号化の処理単位に相当するデータグラム数を示すデータグラム数パラメータを含む、付記３又は付記４に記載の送信ノード。 (Appendix 5)
The transmitting node according to appendix 3 or appendix 4, wherein when the type parameter indicates Reed-Solomon encoding, the setting parameter includes a datagram number parameter indicating the number of datagrams corresponding to a processing unit of Reed-Solomon encoding. .

（付記６）
前記制御部は、予め定義される管理用のアプリケーションプロトコルインタフェースを介して、前記ＳＤＰオブジェクトを前記他のノードへ提供する、付記１～５のいずれか１項に記載の送信ノード。 (Appendix 6)
6. The transmitting node according to any one of appendices 1 to 5, wherein the control unit provides the SDP object to the other node via a predefined management application protocol interface.

（付記７）
前記放送信号ストリームは、ＡＲＩＢＳＴＤ－Ｂ７３ストリームである、付記１～６のいずれか１項に記載の送信ノード。 (Appendix 7)
7. The transmitting node according to any one of the clauses 1-6, wherein the broadcast signal stream is an ARIB STD-B73 stream.

（付記８）
付記１～７のいずれか１項に記載の送信ノードと、
前記ＳＤＰオブジェクトの記述に従ってセットアップされる前記放送信号ストリームを受信する受信ノードと、
を含む放送局システム。 (Appendix 8)
a transmission node according to any one of Appendices 1 to 7;
a receiving node that receives the broadcast signal stream set up according to the SDP object description;
Broadcasting station system including.

（付記９）
放送局のＩＰネットワークにおける、異なるタイプのエッセンスデータを単一のポート番号で伝送する放送信号ストリームの送信ノードから受信ノードへの送信を制御する制御ノードであって、
前記放送信号ストリームの属性を記述するＳＤＰ（Session Description Protocol）オブジェクトを前記送信ノードから取得する制御部、を備え、
前記ＳＤＰオブジェクトは、映像エッセンスデータが圧縮されるかを示す圧縮関連情報、音声エッセンスデータに関連する音声チャンネル数情報、及びエッセンスデータへ適用される誤り訂正方式を示す誤り訂正情報、のうちの１つ以上を、前記放送信号ストリームのフォーマット固有のパラメータを記述する属性フィールド内に含む、
制御ノード。 (Appendix 9)
A control node for controlling transmission from a transmitting node to a receiving node of a broadcast signal stream carrying different types of essence data on a single port number in a broadcast station IP network,
a control unit that acquires an SDP (Session Description Protocol) object describing attributes of the broadcast signal stream from the transmission node;
The SDP object is one of compression-related information indicating whether video essence data is compressed, audio channel number information associated with audio essence data, and error correction information indicating an error correction method applied to essence data. one or more within an attribute field describing format-specific parameters of said broadcast signal stream;
control node.

（付記１０）
異なるタイプのエッセンスデータを単一のポート番号で伝送する放送信号ストリームの属性を記述するＳＤＰ（Session Description Protocol）オブジェクトを他のノードへ提供することと、
前記放送信号ストリームを、放送局のＩＰネットワークへ送信することと、
を含み、
前記ＳＤＰオブジェクトは、映像エッセンスデータが圧縮されるかを示す圧縮関連情報、音声エッセンスデータに関連する音声チャンネル数情報、及びエッセンスデータへ適用される誤り訂正方式を示す誤り訂正情報、のうちの１つ以上を、前記放送信号ストリームのフォーマット固有のパラメータを記述する属性フィールド内に含む、
送信制御方法。 (Appendix 10)
providing other nodes with an SDP (Session Description Protocol) object that describes attributes of a broadcast signal stream that carries different types of essence data on a single port number;
transmitting the broadcast signal stream to a broadcast station's IP network;
including
The SDP object is one of compression-related information indicating whether video essence data is compressed, audio channel number information associated with audio essence data, and error correction information indicating an error correction method applied to essence data. one or more within an attribute field describing format-specific parameters of said broadcast signal stream;
Transmission control method.

（付記１１）
異なるタイプのエッセンスデータを単一のポート番号で伝送する放送信号ストリームを、放送局のＩＰネットワークへ送信する送信ノードのプロセッサに、
前記放送信号ストリームの属性を記述するＳＤＰ（Session Description Protocol）オブジェクトを他のノードへ提供すること、
を実行させ、
前記ＳＤＰオブジェクトは、映像エッセンスデータが圧縮されるかを示す圧縮関連情報、音声エッセンスデータに関連する音声チャンネル数情報、及びエッセンスデータへ適用される誤り訂正方式を示す誤り訂正情報、のうちの１つ以上を、前記放送信号ストリームのフォーマット固有のパラメータを記述する属性フィールド内に含む、
コンピュータプログラム。 (Appendix 11)
to a processor at a transmitting node that transmits a broadcast signal stream carrying different types of essence data on a single port number to the IP network of the broadcaster;
providing an SDP (Session Description Protocol) object describing attributes of the broadcast signal stream to other nodes;
and
The SDP object is one of compression-related information indicating whether video essence data is compressed, audio channel number information associated with audio essence data, and error correction information indicating an error correction method applied to essence data. one or more within an attribute field describing format-specific parameters of said broadcast signal stream;
computer program.

（付記１２）
異なるタイプのエッセンスデータを単一のポート番号で伝送する放送信号ストリームを、放送局のＩＰネットワークへ送信する送信ノードのプロセッサに、
前記放送信号ストリームの属性を記述するＳＤＰ（Session Description Protocol）オブジェクトを他のノードへ提供すること、
を実行させ、
前記ＳＤＰオブジェクトは、映像エッセンスデータが圧縮されるかを示す圧縮関連情報、音声エッセンスデータに関連する音声チャンネル数情報、及びエッセンスデータへ適用される誤り訂正方式を示す誤り訂正情報、のうちの１つ以上を、前記放送信号ストリームのフォーマット固有のパラメータを記述する属性フィールド内に含む、
コンピュータプログラム、を記憶した非一時的なコンピュータ読取可能な記憶媒体。 (Appendix 12)
to a processor at a transmitting node that transmits a broadcast signal stream carrying different types of essence data on a single port number to the IP network of the broadcaster;
providing an SDP (Session Description Protocol) object describing attributes of the broadcast signal stream to other nodes;
and
The SDP object is one of compression-related information indicating whether video essence data is compressed, audio channel number information associated with audio essence data, and error correction information indicating an error correction method applied to essence data. one or more within an attribute field describing format-specific parameters of said broadcast signal stream;
A non-transitory computer-readable storage medium storing a computer program.

（付記１３）
放送局のＩＰネットワークにおける、異なるタイプのエッセンスデータを単一のポート番号で伝送する放送信号ストリームの送信ノードから受信ノードへの送信を制御するための送信制御方法であって、
前記放送信号ストリームの属性を記述するＳＤＰ（Session Description Protocol）オブジェクトを前記送信ノードから取得すること、
を含み、
前記ＳＤＰオブジェクトは、映像エッセンスデータが圧縮されるかを示す圧縮関連情報、音声エッセンスデータに関連する音声チャンネル数情報、及びエッセンスデータへ適用される誤り訂正方式を示す誤り訂正情報、のうちの１つ以上を、前記放送信号ストリームのフォーマット固有のパラメータを記述する属性フィールド内に含む、
送信制御方法。 (Appendix 13)
A transmission control method for controlling transmission from a transmission node to a reception node of a broadcast signal stream that transmits different types of essence data with a single port number in an IP network of a broadcast station, comprising:
obtaining an SDP (Session Description Protocol) object describing attributes of the broadcast signal stream from the transmitting node;
including
The SDP object is one of compression-related information indicating whether video essence data is compressed, audio channel number information associated with audio essence data, and error correction information indicating an error correction method applied to essence data. one or more within an attribute field describing format-specific parameters of said broadcast signal stream;
Transmission control method.

（付記１４）
放送局のＩＰネットワークにおける、異なるタイプのエッセンスデータを単一のポート番号で伝送する放送信号ストリームの送信ノードから受信ノードへの送信を制御する制御ノードのプロセッサに、
前記放送信号ストリームの属性を記述するＳＤＰ（Session Description Protocol）オブジェクトを前記送信ノードから取得すること、
を実行させ、
前記ＳＤＰオブジェクトは、映像エッセンスデータが圧縮されるかを示す圧縮関連情報、音声エッセンスデータに関連する音声チャンネル数情報、及びエッセンスデータへ適用される誤り訂正方式を示す誤り訂正情報、のうちの１つ以上を、前記放送信号ストリームのフォーマット固有のパラメータを記述する属性フィールド内に含む、
コンピュータプログラム。 (Appendix 14)
to a processor in a control node that controls transmission from a transmitting node to a receiving node of a broadcast signal stream carrying different types of essence data on a single port number in a broadcaster's IP network;
obtaining an SDP (Session Description Protocol) object describing attributes of the broadcast signal stream from the transmitting node;
and
The SDP object is one of compression-related information indicating whether video essence data is compressed, audio channel number information associated with audio essence data, and error correction information indicating an error correction method applied to essence data. one or more within an attribute field describing format-specific parameters of said broadcast signal stream;
computer program.

（付記１５）
放送局のＩＰネットワークにおける、異なるタイプのエッセンスデータを単一のポート番号で伝送する放送信号ストリームの送信ノードから受信ノードへの送信を制御する制御ノードのプロセッサに、
前記放送信号ストリームの属性を記述するＳＤＰ（Session Description Protocol）オブジェクトを前記送信ノードから取得すること、
を実行させ、
前記ＳＤＰオブジェクトは、映像エッセンスデータが圧縮されるかを示す圧縮関連情報、音声エッセンスデータに関連する音声チャンネル数情報、及びエッセンスデータへ適用される誤り訂正方式を示す誤り訂正情報、のうちの１つ以上を、前記放送信号ストリームのフォーマット固有のパラメータを記述する属性フィールド内に含む、
コンピュータプログラム、を記憶した非一時的なコンピュータ読取可能な記憶媒体。 (Appendix 15)
to a processor in a control node that controls transmission from a transmitting node to a receiving node of a broadcast signal stream carrying different types of essence data on a single port number in a broadcaster's IP network;
obtaining an SDP (Session Description Protocol) object describing attributes of the broadcast signal stream from the transmitting node;
and
The SDP object is one of compression-related information indicating whether video essence data is compressed, audio channel number information associated with audio essence data, and error correction information indicating an error correction method applied to essence data. one or more within an attribute field describing format-specific parameters of said broadcast signal stream;
A non-transitory computer-readable storage medium storing a computer program.

本開示に係る技術は、限定ではないものの、放送信号を処理するシステムにおいて利用可能である。 Techniques of the present disclosure can be used, without limitation, in systems that process broadcast signals.

１，３放送局システム
１０ＩＰドメイン
２０（２０ａ～ｄ）放送信号処理ノード
６０（６０ａ～ｄ）センダ
６５（６５ａ～ｃ）レシーバ
７０共通基準クロック
７２ａデバイス内部クロック
７３ａ，７６ａメディアクロック
７４ａ，７７ａＲＴＰクロック
８１，８２エッセンス分離型ストリーム
８３エッセンス混在型ストリーム
１００，２００放送信号処理ノード
１１０デバイス内部クロック
１１２メディアクロック
１１４ＲＴＰクロック
１１６ＰＴＰ処理部
１２０，２１０，２１５通信部（第１通信部）
１２２送信部
１２４受信部
１３０送信ストリーム処理部
１４０受信ストリーム処理部
１４２トランスポート処理部
１４４トランスポートヘッダ除去部
１５０エッセンスデータグラム処理部
１５２エッセンスヘッダ除去部
１５４映像エッセンス処理部
１５６音声エッセンス処理部
１５８補助データエッセンス処理部
１６０，２２０アラインメント部
１６１ａ～ｄ，１６６ａ～ｄＲＴＰパケット
１６２ａ～ｄ，１６７ａ～ｄＲＴＰヘッダ
１６３ａ～ｄ，１６８ａ～ｂエッセンスヘッダ
１７０ＦＥＣデータグラム処理部
１８０データ処理部
１９０制御部
２３０再生部
２４０変換部
２５０第２通信部
３００，５００送信ノード
３１１ＳＤＰオブジェクト
３１２セッションレベルセクション
３１３，３１８メディアレベルセクション
３１５ＲＴＰマップ属性フィールド
３１６フォーマット固有パラメータ属性フィールド
３１７パケット時間属性フィールド
３２０通信部
３２２，５１０送信部
３２４受信部
３９０，５２０制御部
３９５記憶部
４００，６００制御ノード
４１０通信部
４１２送信部
４１４受信部
４２０，６１０制御部
４２２情報管理部
４２４ストリーム制御部
４３０記憶部
４５０受信ノード
1, 3 Broadcast station system 10 IP domain 20 (20a-d) Broadcast signal processing node 60 (60a-d) Sender 65 (65a-c) Receiver 70 Common reference clock 72a Device internal clock 73a, 76a Media clock 74a, 77a RTP Clocks 81, 82 Essence-separated stream 83 Essence-mixed stream 100, 200 Broadcast signal processing node 110 Device internal clock 112 Media clock 114 RTP clock 116 PTP processing unit 120, 210, 215 Communication unit (first communication unit)
122 transmitting unit 124 receiving unit 130 transmission stream processing unit 140 reception stream processing unit 142 transport processing unit 144 transport header removal unit 150 essence datagram processing unit 152 essence header removal unit 154 video essence processing unit 156 audio essence processing unit 158 auxiliary data essence processing unit 160, 220 alignment unit 161a-d, 166a-d RTP packet 162a-d, 167a-d RTP header 163a-d, 168a-b essence header 170 FEC datagram processing unit 180 data processing unit 190 control unit 230 Reproduction unit 240 Conversion unit 250 Second communication unit 300,500 Transmission node 311 SDP object 312 Session level section 313,318 Media level section 315 RTP map attribute field 316 Format specific parameter attribute field 317 Packet time attribute field 320 Communication unit 322,510 Transmission section 324 Reception section 390, 520 Control section 395 Storage section 400, 600 Control node 410 Communication section 412 Transmission section 414 Reception section 420, 610 Control section 422 Information management section 424 Stream control section 430 Storage section 450 Reception node

Claims

a transmitter for transmitting a broadcast signal stream carrying different types of essence data on a single port number to an IP network of a broadcaster;
a control unit that provides an SDP (Session Description Protocol) object describing attributes of the broadcast signal stream to other nodes;
with
The SDP object is one of compression-related information indicating whether video essence data is compressed, audio channel number information associated with audio essence data, and error correction information indicating an error correction method applied to essence data. one or more within an attribute field describing format-specific parameters of said broadcast signal stream;
sending node.

The compression-related information is
including a compression parameter indicating whether the video essence data is compressed;
further comprising a codec parameter indicating a codec to be used in compressing the video essence data when the compression parameter indicates that the video essence data is to be compressed;
A transmitting node according to claim 1.

The error correction information is
a type parameter indicating the error correction scheme applied to the essence data;
a setting parameter indicating a setting value of the error correction scheme indicated by the type parameter;
A transmitting node according to claim 1 or claim 2, comprising:

4. The transmitting node of claim 3, wherein if the type parameter indicates XOR encoding, the configuration parameters include a size parameter indicating an error correction block size.

5. The setting parameter according to claim 3, wherein when the type parameter indicates Reed-Solomon encoding, the setting parameter includes a datagram number parameter indicating the number of datagrams corresponding to a processing unit of Reed-Solomon encoding. sending node.

The sending node according to any one of claims 1 to 5, wherein said control unit provides said SDP object to said other node via a predefined management application protocol interface.

A transmitting node according to any preceding claim, wherein said broadcast signal stream is an ARIB STD-B73 stream.

A transmitting node according to any one of claims 1 to 7;
a receiving node that receives the broadcast signal stream set up according to the SDP object description;
Broadcasting station system including.

A control node for controlling transmission from a transmitting node to a receiving node of a broadcast signal stream carrying different types of essence data on a single port number in a broadcast station IP network,
a control unit that acquires an SDP (Session Description Protocol) object describing attributes of the broadcast signal stream from the transmission node;
The SDP object is one of compression-related information indicating whether video essence data is compressed, audio channel number information associated with audio essence data, and error correction information indicating an error correction method applied to essence data. one or more within an attribute field describing format-specific parameters of said broadcast signal stream;
control node.

providing other nodes with an SDP (Session Description Protocol) object that describes attributes of a broadcast signal stream that carries different types of essence data on a single port number;
transmitting the broadcast signal stream to a broadcast station's IP network;
including
The SDP object is one of compression-related information indicating whether video essence data is compressed, audio channel number information associated with audio essence data, and error correction information indicating an error correction method applied to essence data. one or more within an attribute field describing format-specific parameters of said broadcast signal stream;
Transmission control method.