JP7361287B2

JP7361287B2 - Transmission method and transmission device

Info

Publication number: JP7361287B2
Application number: JP2022032489A
Authority: JP
Inventors: 賀敬井口; 正真遠間
Original assignee: Panasonic Intellectual Property Management Co Ltd
Current assignee: Panasonic Intellectual Property Management Co Ltd
Priority date: 2015-08-03
Filing date: 2022-03-03
Publication date: 2023-10-16
Anticipated expiration: 2036-07-12
Also published as: WO2017022211A1; JP7054780B2; JP2021101581A; JP2022075740A; JP2023168403A

Description

本発明は、送信方法、受信方法、送信装置及び受信装置に関する。 The present invention relates to a transmitting method, a receiving method, a transmitting device, and a receiving device.

放送及び通信サービスの高度化に伴い、８Ｋ（７６８０×４３２０ピクセル：以下では８Ｋ４Ｋとも呼ぶ）及び４Ｋ（３８４０×２１６０ピクセル：以下では４Ｋ２Ｋとも呼ぶ）などの超高精細な動画像コンテンツの導入が検討されている。受信装置は、受信した超高精細な動画像の符号化データを実時間で復号して表示する必要があるが、特に８Ｋなどの解像度の動画像は復号時の処理負荷が大きく、このような動画像を１つの復号器で、実時間で復号することは困難である。従って、複数の復号器を用いて復号処理を並列化することで、１つの復号器あたりの処理負荷を低減し、実時間処理を達成する方法が検討されている。 As broadcasting and communication services become more sophisticated, the introduction of ultra-high-definition video content such as 8K (7680 x 4320 pixels: hereinafter also referred to as 8K4K) and 4K (3840 x 2160 pixels: hereinafter also referred to as 4K2K) is being considered. has been done. The receiving device needs to decode and display the encoded data of the received ultra-high-definition video in real time, but video with resolutions such as 8K requires a large processing load during decoding. It is difficult to decode moving images in real time with one decoder. Therefore, methods are being considered to reduce the processing load per decoder and achieve real-time processing by parallelizing the decoding process using a plurality of decoders.

また、符号化データはＭＰＥＧ－２ＴＳ（ＴｒａｎｓｐｏｒｔＳｔｒｅａｍ）又はＭＭＴ（ＭＰＥＧＭｅｄｉａＴｒａｎｓｐｏｒｔ）などの多重化方式に基づいて多重化されたうえで送信される。例えば、非特許文献１には、ＭＭＴに従って、符号化されたメディアデータをパケット毎に送信する技術が開示されている。 Further, the encoded data is multiplexed based on a multiplexing method such as MPEG-2 TS (Transport Stream) or MMT (MPEG Media Transport) before being transmitted. For example, Non-Patent Document 1 discloses a technique for transmitting encoded media data packet by packet according to MMT.

Ｉｎｆｏｒｍａｔｉｏｎｔｅｃｈｎｏｌｏｇｙ－Ｈｉｇｈｅｆｆｉｃｉｅｎｃｙｃｏｄｉｎｇａｎｄｍｅｄｉａｄｅｌｉｖｅｒｙｉｎｈｅｔｅｒｏｇｅｎｅｏｕｓｅｎｖｉｒｏｎｍｅｎｔ－Ｐａｒｔ１：ＭＰＥＧｍｅｄｉａｔｒａｎｓｐｏｒｔ（ＭＭＴ）、ＩＳＯ／ＩＥＣＤＩＳ２３００８－１Information technology - High efficiency coding and media delivery in heterogeneous environment - Part1: MPEG media transp ort(MMT), ISO/IEC DIS 23008-1

ところで、放送や通信サービスの高度化に伴い、８Ｋや４Ｋ（３８４０ｘ２１６０ピクセル）などの超高精細な動画コンテンツの導入が検討されている。ＭＭＴ（ＭＰＥＧＭｅｄｉａＴｒａｎｓｐｏｒｔ）や、ＭＰＥＧ－ＤＡＳＨなどの多重化方式では、映像や音声、ファイルなどのメディアデータは、ＩＳＯＢＭＦＦ（ＩＳＯｂａｓｅｄｍｅｄｉａｆｉｌｅｆｏｒｍａｔ）（ＭＰ４形式）のファイルに多重化し、ＭＭＴパケット化後に、放送伝送路や通信伝送路を用いて伝送される。 By the way, as broadcasting and communication services become more sophisticated, the introduction of ultra-high definition video content such as 8K and 4K (3840 x 2160 pixels) is being considered. In multiplexing methods such as MMT (MPEG Media Transport) and MPEG-DASH, media data such as video, audio, and files are multiplexed into ISOBMFF (ISO based media file format) (MP4 format) files and converted into MMT packets. Later, it is transmitted using a broadcast transmission path or a communication transmission path.

しかしながら、ＭＭＴの伝送では、受信装置のバッファ動作を保証できない可能性がある。 However, in MMT transmission, there is a possibility that the buffer operation of the receiving device cannot be guaranteed.

本発明は、ＭＭＴのような方式を用いてデータ伝送する場合に、受信装置のバッファ動作を保証できる送信方法などを提供する。 The present invention provides a transmission method that can guarantee buffer operation of a receiving device when transmitting data using a method such as MMT.

上記目的を達成するために、本発明の一態様に係る送信方法は、複数の伝送パケットを送信する送信方法であって、前記伝送パケットは、可変長のパケットヘッダおよび可変長のペイロードで構成され、送信される前記複数の伝送パケットを受信する受信バッファモデルは、前記伝送パケットを受信し、受信した前記伝送パケットに格納されている可変長のパケットヘッダおよび可変長のペイロードで構成される第１パケットを、ヘッダ伸張された固定長のパケットヘッダを有する第２パケットに変換し、変換することにより得られた前記第２パケットを所定の引き抜きレートで出力するバッファを含み、所定のパケット数以下の前記複数の伝送パケットを所定のデータ単位に格納し、互いに重ならない単位時間の期間であって、連続した複数の単位伝送期間のそれぞれについて、当該単位伝送期間において前記所定のデータ単位を送信することで、前記単位時間あたりに前記所定のパケット数以下の前記複数の伝送パケットを送信し、前記複数の伝送パケットは、無効なパケットとは異なる実パケットを格納している第１伝送パケットと、前記無効なパケットを格納している第２伝送パケットとを含み、前記格納では、前記第１伝送パケットを前記所定のデータ単位に格納する場合、前記第１伝送パケットの数を第１パケット数としてカウントし、前記第２伝送パケットを前記所定のデータ単位に格納する場合、当該第２伝送パケットのパケットサイズを前記第１伝送パケットのパケットサイズの最大値で除すことにより得られた商に１を加算した整数値を第２パケット数としてカウントし、カウントすることにより得られた第１パケット数及び第２パケット数の総和が前記所定のパケット数以下となるように、前記複数の伝送パケットを前記所定のデータ単位に格納し、前記第１伝送パケットのパケットサイズの前記最大値は、１５００Ｂである。 In order to achieve the above object, a transmission method according to one aspect of the present invention is a transmission method that transmits a plurality of transmission packets, wherein the transmission packet is composed of a variable-length packet header and a variable-length payload. , a reception buffer model for receiving the plurality of transmission packets to be transmitted includes a first reception buffer model that receives the transmission packets and includes a variable-length packet header and a variable-length payload stored in the received transmission packets. The buffer includes a buffer that converts a packet into a second packet having a header-expanded fixed-length packet header and outputs the second packet obtained by the conversion at a predetermined extraction rate, and Storing the plurality of transmission packets in a predetermined data unit, and transmitting the predetermined data unit in each of a plurality of continuous unit transmission periods, which are periods of unit time that do not overlap with each other. The plurality of transmission packets, the number of which is equal to or less than the predetermined number of packets, are transmitted per unit time, and the plurality of transmission packets include a first transmission packet storing a real packet different from an invalid packet; and a second transmission packet storing an invalid packet, and when storing the first transmission packet in the predetermined data unit, the number of the first transmission packet is counted as the number of first packets. However, when storing the second transmission packet in the predetermined data unit, 1 is added to the quotient obtained by dividing the packet size of the second transmission packet by the maximum value of the packet size of the first transmission packet. The added integer value is counted as the second number of packets, and the plurality of transmission packets are transferred to The first transmission packet is stored in a predetermined data unit, and the maximum value of the packet size of the first transmission packet is 1500B.

また、本発明の一態様に係る受信方法は、バッファを有する受信装置における受信方法であって、それぞれが可変長のパケットヘッダおよび可変長のペイロードで構成される複数の伝送パケットを受信し、受信された前記複数の伝送パケットに格納されている可変長のパケットヘッダおよび可変長のペイロードで構成される第１パケットを、ヘッダ伸張された固定長のパケットヘッダを有する第２パケットに前記バッファを用いて変換し、変換することにより得られた前記第２パケットを所定の引き抜きレートで前記バッファから出力し、前記バッファのバッファサイズは、前記複数の伝送パケットの伝送レート、前記伝送パケットの平均パケット長、及び前記引き抜きレートを用いて求められる最大バッファ占有量より大きく、前記複数の伝送パケットは、互いに重ならない単位時間の期間であって、連続した複数の単位伝送期間のそれぞれについて、当該単位伝送期間において所定のデータ単位に格納されて送信されることで、前記単位時間あたりに送信された所定のパケット数以下の伝送パケットであり、無効なパケットとは異なる実パケットを格納している第１伝送パケットと、前記無効なパケットを格納している第２伝送パケットとを含み、前記所定のデータ単位に格納されている前記複数の伝送パケットは、前記第１伝送パケットが前記所定のデータ単位に格納されている場合、前記第１伝送パケットの数が第１パケット数としてカウントされ、前記第２伝送パケットが前記所定のデータ単位に格納されている場合、当該第２伝送パケットのパケットサイズを前記第１伝送パケットのパケットサイズの最大値で除すことにより得られた商に１を加算した整数値が第２パケット数としてカウントされ、カウントすることにより得られた第１パケット数及び第２パケット数の総和が前記所定のパケット数以下となるように、前記所定のデータ単位に格納されたパケットであり、前記第１伝送パケットのパケットサイズの前記最大値は、１５００Ｂである。 Further, a receiving method according to one aspect of the present invention is a receiving method in a receiving device having a buffer, in which a plurality of transmission packets each including a variable-length packet header and a variable-length payload are received, A first packet consisting of a variable length packet header and a variable length payload stored in the plurality of transmitted packets is transferred to a second packet having a header-expanded fixed length packet header using the buffer. The second packet obtained by the conversion is outputted from the buffer at a predetermined extraction rate, and the buffer size of the buffer is equal to the transmission rate of the plurality of transmission packets, the average packet length of the transmission packets , and the plurality of transmission packets are in unit time periods that do not overlap with each other, and each of the plurality of consecutive unit transmission periods is larger than the maximum buffer occupancy calculated using the extraction rate. The first transmission is a transmission packet that is stored in a predetermined data unit and transmitted in a predetermined data unit, and is a transmission packet that is less than or equal to the predetermined number of packets transmitted per unit time, and stores real packets that are different from invalid packets. packet and a second transmission packet storing the invalid packet, the plurality of transmission packets stored in the predetermined data unit are such that the first transmission packet is stored in the predetermined data unit. If the number of the first transmission packets is counted as the first packet number, and the second transmission packet is stored in the predetermined data unit, the packet size of the second transmission packet is counted as the number of first transmission packets. The integer value obtained by adding 1 to the quotient obtained by dividing by the maximum packet size of one transmission packet is counted as the second packet number, and the first and second packet numbers obtained by counting The packets are stored in the predetermined data unit such that the sum total of the packets is equal to or less than the predetermined number of packets, and the maximum value of the packet size of the first transmission packet is 1500B.

上記目的を達成するために、本発明の一態様に係る送信方法は、複数の伝送パケットを送信する送信方法であって、前記伝送パケットは、可変長のパケットヘッダおよび可変長のペイロードで構成され、送信される前記複数の伝送パケットを受信する受信バッファモデルは、前記伝送パケットを受信し、受信した前記伝送パケットに格納されている可変長のパケットヘッダおよび可変長のペイロードで構成される第１パケットを、ヘッダ伸張された固定長のパケットヘッダを有する第２パケットに変換し、変換することにより得られた前記第２パケットを所定の引き抜きレートで出力するバッファを含み、所定のパケット数以下の前記複数の伝送パケットを所定のデータ単位に格納し、互いに重ならない単位時間の期間であって、連続した複数の単位伝送期間のそれぞれについて、当該単位伝送期間において前記所定のデータ単位を送信することで、前記単位時間あたりに前記所定のパケット数以下の前記複数の伝送パケットを送信し、前記複数の伝送パケットは、無効なパケットとは異なる実パケットを格納している第１伝送パケットと、前記無効なパケットを格納している第２伝送パケットとを含み、前記格納では、前記第１伝送パケットを前記所定のデータ単位に格納する場合、前記第１伝送パケットの数を第１パケット数としてカウントし、前記第２伝送パケットを前記所定のデータ単位に格納する場合、当該第２伝送パケットのパケットサイズを前記第１伝送パケットのパケットサイズの最大値で除すことにより得られた商に１を加算した整数値を第２パケット数としてカウントし、カウントすることにより得られた第１パケット数及び第２パケット数の総和が前記所定のパケット数以下となるように、前記複数の伝送パケットを前記所定のデータ単位に格納し、前記無効なパケットは、ＮＵＬＬパケットである。 In order to achieve the above object, a transmission method according to one aspect of the present invention is a transmission method that transmits a plurality of transmission packets, wherein the transmission packet is composed of a variable-length packet header and a variable-length payload. , a reception buffer model for receiving the plurality of transmission packets to be transmitted includes a first reception buffer model that receives the transmission packets and includes a variable-length packet header and a variable-length payload stored in the received transmission packets. The buffer includes a buffer that converts a packet into a second packet having a header-expanded fixed-length packet header and outputs the second packet obtained by the conversion at a predetermined extraction rate, and Storing the plurality of transmission packets in a predetermined data unit, and transmitting the predetermined data unit in each of a plurality of continuous unit transmission periods, which are periods of unit time that do not overlap with each other. The plurality of transmission packets, the number of which is equal to or less than the predetermined number of packets, are transmitted per unit time, and the plurality of transmission packets include a first transmission packet storing a real packet different from an invalid packet; and a second transmission packet storing an invalid packet, and when storing the first transmission packet in the predetermined data unit, the number of the first transmission packet is counted as the number of first packets. However, when storing the second transmission packet in the predetermined data unit, 1 is added to the quotient obtained by dividing the packet size of the second transmission packet by the maximum value of the packet size of the first transmission packet. The added integer value is counted as the second number of packets, and the plurality of transmission packets are transferred to The invalid packet is stored in a predetermined data unit and is a NULL packet.

また、本発明の一態様に係る受信方法は、バッファを有する受信装置における受信方法であって、それぞれが可変長のパケットヘッダおよび可変長のペイロードで構成される複数の伝送パケットを受信し、受信された前記複数の伝送パケットに格納されている可変長のパケットヘッダおよび可変長のペイロードで構成される第１パケットを、ヘッダ伸張された固定長のパケットヘッダを有する第２パケットに前記バッファを用いて変換し、変換することにより得られた前記第２パケットを所定の引き抜きレートで前記バッファから出力し、前記バッファのバッファサイズは、前記複数の伝送パケットの伝送レート、前記伝送パケットの平均パケット長、及び前記引き抜きレートを用いて求められる最大バッファ占有量よりも大きく、前記複数の伝送パケットは、互いに重ならない単位時間の期間であって、連続した複数の単位伝送期間のそれぞれについて、当該単位伝送期間において所定のデータ単位に格納されて送信されることで、前記単位時間あたりに送信された所定のパケット数以下の伝送パケットであり、無効なパケットとは異なる実パケットを格納している第１伝送パケットと、前記無効なパケットを格納している第２伝送パケットとを含み、前記所定のデータ単位に格納されている前記複数の伝送パケットは、前記第１伝送パケットが前記所定のデータ単位に格納されている場合、前記第１伝送パケットの数が第１パケット数としてカウントされ、前記第２伝送パケットが前記所定のデータ単位に格納されている場合、当該第２伝送パケットのパケットサイズを前記第１伝送パケットのパケットサイズの最大値で除すことにより得られた商に１を加算した整数値が第２パケット数としてカウントされ、カウントすることにより得られた第１パケット数及び第２パケット数の総和が前記所定のパケット数以下となるように、前記所定のデータ単位に格納されたパケットであり、前記無効なパケットは、ＮＵＬＬパケットである。 Further, a receiving method according to one aspect of the present invention is a receiving method in a receiving device having a buffer, in which a plurality of transmission packets each including a variable-length packet header and a variable-length payload are received, A first packet consisting of a variable length packet header and a variable length payload stored in the plurality of transmitted packets is transferred to a second packet having a header-expanded fixed length packet header using the buffer. The second packet obtained by the conversion is outputted from the buffer at a predetermined extraction rate, and the buffer size of the buffer is equal to the transmission rate of the plurality of transmission packets, the average packet length of the transmission packets , and the plurality of transmission packets are in a unit time period that does not overlap with each other, and the unit transmission is larger than the maximum buffer occupancy calculated using the extraction rate, and the plurality of transmission packets are in a unit time period that does not overlap with each other, A first transmission packet that is stored and transmitted in a predetermined data unit in a period, and is a transmission packet that is less than or equal to the predetermined number of packets transmitted per unit time, and that stores real packets that are different from invalid packets. The plurality of transmission packets, which include a transmission packet and a second transmission packet storing the invalid packet, and which are stored in the predetermined data unit are such that the first transmission packet is stored in the predetermined data unit. If stored, the number of the first transmission packets is counted as the number of first packets, and if the second transmission packet is stored in the predetermined data unit, the packet size of the second transmission packet is counted as the number of first transmission packets. The integer value obtained by adding 1 to the quotient obtained by dividing by the maximum packet size of the first transmission packet is counted as the number of second packets, and the number of first packets and second packets obtained by counting The packets are stored in the predetermined data unit such that the sum of the numbers is equal to or less than the predetermined number of packets, and the invalid packet is a NULL packet.

上記目的を達成するために、本発明の一態様に係る送信方法は、受信装置のバッファ動作を保証するために予め定められた受信バッファモデルによる規定を満たした状態で複数の伝送パケットを送信する送信方法であって、前記伝送パケットは、可変長のパケットヘッダおよび可変長のペイロードで構成され、前記受信バッファモデルは、前記伝送パケットを受信し、受信した前記伝送パケットに格納されている可変長のパケットヘッダおよび可変長のペイロードで構成される第１パケットを、ヘッダ伸張された固定長のパケットヘッダを有する第２パケットに変換し、変換することにより得られた前記第２パケットを所定の引き抜きレートで出力するバッファを含み、単位時間あたりに所定のパケット数以下の前記複数の伝送パケットを送信する。 In order to achieve the above object, a transmission method according to one aspect of the present invention transmits a plurality of transmission packets while satisfying regulations according to a predetermined reception buffer model in order to guarantee buffer operation of a reception device. In the transmission method, the transmission packet includes a variable-length packet header and a variable-length payload, and the reception buffer model receives the transmission packet and generates a variable-length data stored in the received transmission packet. A first packet consisting of a packet header and a variable length payload is converted into a second packet having a fixed length packet header with the header expanded, and the second packet obtained by the conversion is extracted in a predetermined manner. It includes a buffer that outputs at a rate, and transmits the plurality of transmission packets in a number not more than a predetermined number of packets per unit time.

また、本発明の一態様に係る受信方法は、バッファを有する受信装置における受信方法であって、それぞれが可変長のパケットヘッダおよび可変長のペイロードで構成される複数の伝送パケットを受信し、受信された前記複数の伝送パケットに格納されている可変長のパケットヘッダおよび可変長のペイロードで構成される第１パケットを、ヘッダ伸張された固定長のパケットヘッダを有する第２パケットに前記バッファを用いて変換し、変換することにより得られた前記第２パケットを所定の引き抜きレートで前記バッファから出力し、前記バッファのバッファサイズは、前記複数の伝送パケットの伝送レート、前記伝送パケットの平均パケット長、及び前記引き抜きレートを用いて求められる最大バッファ占有量よりも大きい。 Further, a receiving method according to one aspect of the present invention is a receiving method in a receiving device having a buffer, in which a plurality of transmission packets each including a variable-length packet header and a variable-length payload are received, A first packet consisting of a variable length packet header and a variable length payload stored in the plurality of transmitted packets is transferred to a second packet having a header-expanded fixed length packet header using the buffer. The second packet obtained by the conversion is outputted from the buffer at a predetermined extraction rate, and the buffer size of the buffer is equal to the transmission rate of the plurality of transmission packets, the average packet length of the transmission packets , and the maximum buffer occupancy determined using the extraction rate.

なお、これらの全般的または具体的な態様は、システム、装置、集積回路、コンピュータプログラムまたはコンピュータ読み取り可能なＣＤ－ＲＯＭなどの記録媒体で実現されてもよく、システム、装置、集積回路、コンピュータプログラム及び記録媒体の任意な組み合わせで実現されてもよい。 Note that these general or specific aspects may be realized by a system, a device, an integrated circuit, a computer program, or a computer-readable recording medium such as a CD-ROM. and a recording medium may be used in any combination.

本発明は、ＭＭＴのような方式を用いてデータ伝送する場合に、受信装置のバッファ動作を保証できる。 The present invention can guarantee buffer operation of a receiving device when transmitting data using a method such as MMT.

図１は、ピクチャをスライスセグメントに分割する例を示す図である。FIG. 1 is a diagram showing an example of dividing a picture into slice segments. 図２は、ピクチャのデータが格納されたＰＥＳパケット列の一例を示す図である。FIG. 2 is a diagram showing an example of a PES packet string in which picture data is stored. 図３は、実施の形態１に係るピクチャの分割例を示す図である。FIG. 3 is a diagram illustrating an example of dividing a picture according to the first embodiment. 図４は、実施の形態１の比較例に係るピクチャの分割例を示す図である。FIG. 4 is a diagram illustrating an example of picture division according to a comparative example of the first embodiment. 図５は、実施の形態１に係るアクセスユニットのデータの一例を示す図である。FIG. 5 is a diagram illustrating an example of data of an access unit according to the first embodiment. 図６は、実施の形態１に係る送信装置のブロック図である。FIG. 6 is a block diagram of a transmitting device according to the first embodiment. 図７は、実施の形態１に係る受信装置のブロック図である。FIG. 7 is a block diagram of the receiving device according to the first embodiment. 図８は、実施の形態１に係るＭＭＴパケットの一例を示す図である。FIG. 8 is a diagram illustrating an example of an MMT packet according to the first embodiment. 図９は、実施の形態１に係るＭＭＴパケットの別の例を示す図である。FIG. 9 is a diagram showing another example of the MMT packet according to the first embodiment. 図１０は、実施の形態１に係る各復号部に入力されるデータの一例を示す図である。FIG. 10 is a diagram illustrating an example of data input to each decoding unit according to the first embodiment. 図１１は、実施の形態１に係るＭＭＴパケット及びヘッダ情報の一例を示す図である。FIG. 11 is a diagram illustrating an example of an MMT packet and header information according to the first embodiment. 図１２は、実施の形態１に係る各復号部に入力されるデータの別の例を示す図である。FIG. 12 is a diagram illustrating another example of data input to each decoding unit according to the first embodiment. 図１３は、実施の形態１に係るピクチャの分割例を示す図である。FIG. 13 is a diagram illustrating an example of dividing a picture according to the first embodiment. 図１４は、実施の形態１に係る送信方法のフローチャートである。FIG. 14 is a flowchart of the transmission method according to the first embodiment. 図１５は、実施の形態１に係る受信装置のブロック図である。FIG. 15 is a block diagram of a receiving device according to the first embodiment. 図１６は、実施の形態１に係る受信方法のフローチャートである。FIG. 16 is a flowchart of the receiving method according to the first embodiment. 図１７は、実施の形態１に係るＭＭＴパケット及びヘッダ情報の一例を示す図である。FIG. 17 is a diagram illustrating an example of an MMT packet and header information according to the first embodiment. 図１８は、実施の形態１に係るＭＭＴパケット及びヘッダ情報の一例を示す図である。FIG. 18 is a diagram illustrating an example of an MMT packet and header information according to the first embodiment. 図１９は、ＭＰＵの構成を示す図である。FIG. 19 is a diagram showing the configuration of the MPU. 図２０は、ＭＦメタデータの構成を示す図である。FIG. 20 is a diagram showing the structure of MF metadata. 図２１は、データの送信順序を説明するための図である。FIG. 21 is a diagram for explaining the data transmission order. 図２２は、ヘッダ情報を用いずに復号を行う方法の例を示す図である。FIG. 22 is a diagram illustrating an example of a method for decoding without using header information. 図２３は、実施の形態２に係る送信装置のブロック図である。FIG. 23 is a block diagram of a transmitting device according to the second embodiment. 図２４は、実施の形態２に係る送信方法のフローチャートである。FIG. 24 is a flowchart of the transmission method according to the second embodiment. 図２５は、実施の形態２に係る受信装置のブロック図である。FIG. 25 is a block diagram of a receiving device according to Embodiment 2. 図２６は、ＭＰＵ先頭位置及びＮＡＬユニット位置を特定するための動作のフローチャートである。FIG. 26 is a flowchart of the operation for specifying the MPU head position and NAL unit position. 図２７は、送信順序タイプに基づいて初期化情報を取得し、初期化情報に基づいてメディアデータを復号する動作のフローチャートである。FIG. 27 is a flowchart of operations for obtaining initialization information based on the transmission order type and decoding media data based on the initialization information. 図２８は、低遅延提示モードが設けられた場合の受信装置の動作のフローチャートである。FIG. 28 is a flowchart of the operation of the receiving device when the low delay presentation mode is provided. 図２９は、補助データが送信される場合のＭＭＴパケットの送信順序の一例を示す図である。FIG. 29 is a diagram illustrating an example of the transmission order of MMT packets when auxiliary data is transmitted. 図３０は、送信装置がｍｏｏｆの構成に基づいて補助データを生成する例を説明するための図である。FIG. 30 is a diagram for explaining an example in which the transmitting device generates auxiliary data based on the configuration of moof. 図３１は、補助データの受信を説明するための図である。FIG. 31 is a diagram for explaining reception of auxiliary data. 図３２は、補助データを用いた受信動作のフローチャートである。FIG. 32 is a flowchart of a receiving operation using auxiliary data. 図３３は、複数のムービーフラグメントで構成されるＭＰＵの構成を示す図である。FIG. 33 is a diagram showing the configuration of an MPU composed of a plurality of movie fragments. 図３４は、図３３の構成のＭＰＵが伝送される場合のＭＭＴパケットの送信順序を説明するための図である。FIG. 34 is a diagram for explaining the transmission order of MMT packets when the MPU having the configuration shown in FIG. 33 is transmitted. 図３５は、１つのＭＰＵが複数のムービーフラグメントで構成される場合の受信装置の動作例を説明するための第１の図である。FIG. 35 is a first diagram for explaining an example of the operation of the receiving device when one MPU is composed of a plurality of movie fragments. 図３６は、１つのＭＰＵが複数のムービーフラグメントで構成される場合の受信装置の動作例を説明するための第２の図である。FIG. 36 is a second diagram for explaining an example of the operation of the receiving device when one MPU is composed of a plurality of movie fragments. 図３７は、図３５及び図３６で説明した受信方法の動作のフローチャートである。FIG. 37 is a flowchart of the operation of the reception method described in FIGS. 35 and 36. 図３８は、非ＶＣＬＮＡＬユニットを、個別にデータユニットとし、アグリゲーションする場合を示す図である。FIG. 38 is a diagram showing a case where non-VCL NAL units are individually treated as data units and aggregated. 図３９は、非ＶＣＬＮＡＬユニットを、まとめてデータユニットとする場合を示す図である。FIG. 39 is a diagram showing a case in which non-VCL NAL units are combined into a data unit. 図４０は、パケットロスが発生した場合の受信装置の動作のフローチャートである。FIG. 40 is a flowchart of the operation of the receiving device when packet loss occurs. 図４１は、ＭＰＵが複数のムービーフラグメントに分割されている場合の受信動作のフローチャートである。FIG. 41 is a flowchart of the reception operation when the MPU is divided into a plurality of movie fragments. 図４２は、時間スケーラビリティを実現する際の各ＴｅｍｐｏｒａｌＩｄにおけるピクチャの予測構造の例を示す図である。FIG. 42 is a diagram illustrating an example of a predicted structure of a picture in each TemporalId when achieving temporal scalability. 図４３は、図４２の各ピクチャにおける復号時刻（ＤＴＳ）と表示時刻（ＰＴＳ）との関係を示す図である。FIG. 43 is a diagram showing the relationship between decoding time (DTS) and display time (PTS) in each picture in FIG. 42. 図４４は、ピクチャの遅延処理、及び、リオーダ処理が必要となるピクチャの予測構造の一例を示す図である。FIG. 44 is a diagram illustrating an example of a predicted structure of a picture that requires picture delay processing and reorder processing. 図４５は、ＭＰ４形式で構成されるＭＰＵが複数のムービーフラグメントに分割されて、ＭＭＴＰペイロード、ＭＭＴＰパケットに格納される例を示す図である。FIG. 45 is a diagram showing an example in which an MPU configured in MP4 format is divided into a plurality of movie fragments and stored in MMTP payloads and MMTP packets. 図４６は、ＰＴＳ及びＤＴＳの算出方法と課題とを説明するための図である。FIG. 46 is a diagram for explaining the calculation method and problems of PTS and DTS. 図４７は、ＤＴＳ算出用の情報を用いてＤＴＳが算出される場合の受信動作のフローチャートである。FIG. 47 is a flowchart of the reception operation when DTS is calculated using the information for DTS calculation. 図４８は、ＭＭＴにおけるデータユニットのペイロードへの格納方法を説明するための図である。FIG. 48 is a diagram for explaining a method of storing a data unit in a payload in MMT. 図４９は、実施の形態３に係る送信装置の動作フローである。FIG. 49 is an operation flow of the transmitting device according to the third embodiment. 図５０は、実施の形態３に係る受信装置の動作フローである。FIG. 50 is an operation flow of the receiving device according to the third embodiment. 図５１は、実施の形態３に係る送信装置の具体的構成の例を示す図である。FIG. 51 is a diagram illustrating an example of a specific configuration of a transmitting device according to Embodiment 3. 図５２は、実施の形態３に係る受信装置の具体的構成の例を示す図である。FIG. 52 is a diagram illustrating an example of a specific configuration of a receiving device according to Embodiment 3. 図５３は、ｎｏｎ－ｔｉｍｅｄメディアのＭＰＵへの格納方法、及び、ＭＭＴＰパケットでの伝送方法を示す図である。FIG. 53 is a diagram showing a method of storing non-timed media in the MPU and a method of transmitting it using MMTP packets. 図５４は、ファイルが分割されることにより得られた複数の分割データ毎にパケット化して伝送する例を示す図である。FIG. 54 is a diagram showing an example in which a plurality of pieces of divided data obtained by dividing a file are packetized and transmitted. 図５５は、ファイルが分割されることにより得られた複数の分割データ毎にパケット化して伝送する他の例を示す図である。FIG. 55 is a diagram showing another example in which a plurality of pieces of divided data obtained by dividing a file are packetized and transmitted. 図５６は、アセット管理テーブルにおけるファイル毎のループのシンタックスを示す図である。FIG. 56 is a diagram showing the syntax of a loop for each file in the asset management table. 図５７は、受信装置における分割データ番号を特定する動作フローである。FIG. 57 is an operation flow for specifying a divided data number in a receiving device. 図５８は、受信装置における分割データ数を特定する動作フローである。FIG. 58 is an operation flow for specifying the number of divided data in the receiving device. 図５９は、送信装置においてフラグメントカウンタを運用するかどうかを決定するための動作フローである。FIG. 59 is an operational flow for determining whether to operate a fragment counter in a transmitting device. 図６０は、分割データ数、分割データ番号を特定する方法（フラグメントカウンタを活用する場合）について説明するための図である。FIG. 60 is a diagram for explaining a method for specifying the number of divided data and the divided data number (when using a fragment counter). 図６１は、フラグメントカウンタを活用する場合の送信装置の動作フローである。FIG. 61 is an operation flow of the transmitting device when utilizing a fragment counter. 図６２は、フラグメントカウンタを活用する場合の受信装置の動作フローである。FIG. 62 is an operation flow of the receiving device when utilizing a fragment counter. 図６３は、同一番組を複数のＩＰデータフローで送信する場合のサービス構成を示す図である。FIG. 63 is a diagram showing a service configuration when the same program is transmitted using multiple IP data flows. 図６４は、送信装置の具体的構成の例を示す図である。FIG. 64 is a diagram showing an example of a specific configuration of a transmitting device. 図６５は、受信装置の具体的構成の例を示す図である。FIG. 65 is a diagram showing an example of a specific configuration of a receiving device. 図６６は、送信装置による動作フローである。FIG. 66 is an operation flow of the transmitting device. 図６７は、受信装置による動作フローである。FIG. 67 is an operation flow of the receiving device. 図６８は、ＡＲＩＢＳＴＤＢ－６０に規定されている受信バッファモデルに基づいて、特に放送伝送路のみを用いた場合の受信バッファモデルを示す。FIG. 68 shows a receive buffer model based on the receive buffer model specified in ARIB STD B-60, particularly when only a broadcast transmission path is used. 図６９は、複数のデータユニットをアグリゲーションして一つのペイロードに格納する例を示す図である。FIG. 69 is a diagram illustrating an example of aggregating a plurality of data units and storing them in one payload. 図７０は、複数のデータユニットをアグリゲーションして一つのペイロードに格納する例であって、ＮＡＬサイズフォーマットの映像信号を一つのデータユニットとした場合の例を示す図である。FIG. 70 is a diagram showing an example in which a plurality of data units are aggregated and stored in one payload, and a video signal in the NAL size format is used as one data unit. 図７１は、データユニット長が示されないＭＭＴＰパケットのペイロードの構成を示す図である。FIG. 71 is a diagram showing the structure of the payload of an MMTP packet in which the data unit length is not indicated. 図７２は、パケット単位に付与されるｅｘｔｅｎｄ領域に示す例である。FIG. 72 is an example of an extend area added to each packet. 図７３は、受信装置による動作フローを示す。FIG. 73 shows an operation flow by the receiving device. 図７４は、送信装置の具体的構成の例を示す図である。FIG. 74 is a diagram illustrating an example of a specific configuration of a transmitting device. 図７５は、受信装置の具体的構成の例を示す図である。FIG. 75 is a diagram showing an example of a specific configuration of a receiving device. 図７６は、送信装置による動作フローである。FIG. 76 is an operational flowchart of the transmitting device. 図７７は、受信装置による動作フローである。FIG. 77 is an operation flow of the receiving device. 図７８は、ＡＲＩＢＳＴＤ－Ｂ６０に規定されるＭＭＴ／ＴＬＶ方式のプロトコルスタックを示す図である。FIG. 78 is a diagram showing a protocol stack of the MMT/TLV method defined in ARIB STD-B60. 図７９は、ＴＬＶパケットの構成を示す図である。FIG. 79 is a diagram showing the structure of a TLV packet. 図８０は、受信装置のブロック図の一例を示す図である。FIG. 80 is a diagram illustrating an example of a block diagram of a receiving device. 図８１は、伝送スロットの構成を示す図である。FIG. 81 is a diagram showing the configuration of a transmission slot. 図８２は、伝送フレームと、伝送フレームに格納される特定のｔｌｖ＿ｓｔｒｅａｍ＿ｉｄを有する１つのＴＬＶストリーム（ＴＬＶパケット列）とを示す図である。FIG. 82 is a diagram showing a transmission frame and one TLV stream (TLV packet sequence) having a specific tlv_stream_id stored in the transmission frame. 図８３は、ＴＬＶパケットバッファのバッファサイズ、および、伝送フレーム単位での最大パケット数を規定した場合の送出方法のフローチャートである。FIG. 83 is a flowchart of a sending method when the buffer size of the TLV packet buffer and the maximum number of packets per transmission frame are defined. 図８４は、送信装置の具体的構成の例を示す図である。FIG. 84 is a diagram showing an example of a specific configuration of a transmitting device. 図８５は、受信装置の具体的構成の例を示す図である。FIG. 85 is a diagram showing an example of a specific configuration of a receiving device. 図８６は、送信装置による動作フローである。FIG. 86 is an operation flow of the transmitting device. 図８７は、受信装置による動作フローである。FIG. 87 is an operation flow of the receiving device.

本発明の一態様に係る送信方法は、受信装置のバッファ動作を保証するために予め定められた受信バッファモデルによる規定を満たした状態で複数の伝送パケットを送信する送信方法であって、前記伝送パケットは、可変長のパケットヘッダおよび可変長のペイロードで構成され、前記受信バッファモデルは、前記伝送パケットを受信し、受信した前記伝送パケットに格納されている可変長のパケットヘッダおよび可変長のペイロードで構成される第１パケットを、ヘッダ伸張された固定長のパケットヘッダを有する第２パケットに変換し、変換することにより得られた前記第２パケットを所定の引き抜きレートで出力するバッファを含み、単位時間あたりに所定のパケット数以下の前記複数の伝送パケットを送信する。 A transmission method according to one aspect of the present invention is a transmission method for transmitting a plurality of transmission packets while satisfying regulations according to a predetermined reception buffer model in order to guarantee buffer operation of a reception device, the transmission method comprising: A packet is composed of a variable-length packet header and a variable-length payload, and the reception buffer model receives the transmission packet and processes the variable-length packet header and variable-length payload stored in the received transmission packet. a buffer that converts a first packet consisting of the following into a second packet having a header-expanded fixed-length packet header, and outputs the second packet obtained by the conversion at a predetermined extraction rate; The plurality of transmission packets, the number of which is equal to or less than a predetermined number of packets, are transmitted per unit time.

このような送信装置は、ＭＭＴのような方式を用いてデータ伝送する場合に、受信装置のバッファ動作を保証できる。 Such a transmitting device can guarantee buffer operation of the receiving device when transmitting data using a method such as MMT.

例えば、さらに、前記所定のパケット数以下の前記複数の伝送パケットを所定のデータ単位に格納し、前記送信では、互いに重ならない前記単位時間の期間であって、連続した複数の単位伝送期間のそれぞれについて、当該単位伝送期間において前記所定のデータ単位を送信することで、前記単位時間あたりに前記所定のパケット数以下の前記複数の伝送パケットを送信してもよい。 For example, further, the plurality of transmission packets having a number equal to or less than the predetermined number of packets are stored in a predetermined data unit, and in the transmission, each of the plurality of consecutive unit transmission periods is a period of the unit time that does not overlap with each other. In this case, by transmitting the predetermined data unit in the unit transmission period, the plurality of transmission packets equal to or less than the predetermined number of packets may be transmitted per unit time.

例えば、前記所定のパケット数は、前記所定のデータ単位の伝送レートに応じて設定された、互いに異なる値である複数のパケット数を含み、前記格納では、前記所定のデータ単位の前記伝送レートに対応するパケット数以下の前記複数の伝送パケットを前記所定のデータ単位に格納してもよい。 For example, the predetermined number of packets includes a plurality of packet numbers having different values set according to the transmission rate of the predetermined data unit; The plurality of transmission packets, the number of which is equal to or less than the corresponding number of packets, may be stored in the predetermined data unit.

例えば、前記複数の伝送パケットは、無効なパケットとは異なる実パケットを格納している第１伝送パケットと、前記無効なパケットを格納している第２伝送パケットとを含み、前記格納では、前記第１伝送パケットを前記所定のデータ単位に格納する場合、前記第１伝送パケットの数を第１パケット数としてカウントし、前記第２伝送パケットを前記所定のデータ単位に格納する場合、当該第２伝送パケットのパケットサイズを前記第１伝送パケットのパケットサイズの最大値で除すことにより得られた整数値を第２パケット数としてカウントし、カウントすることにより得られた第１パケット数及び第２パケット数の総和が前記所定のパケット数以下となるように、前記複数の伝送パケットを前記所定のデータ単位に格納してもよい。 For example, the plurality of transmission packets include a first transmission packet storing a real packet different from an invalid packet, and a second transmission packet storing the invalid packet, and in the storage, the When storing the first transmission packet in the predetermined data unit, the number of the first transmission packet is counted as the first packet number, and when storing the second transmission packet in the predetermined data unit, the second transmission packet is counted as the first packet number. The integer value obtained by dividing the packet size of the transmission packet by the maximum value of the packet size of the first transmission packet is counted as the second packet number, and the first packet number and the second packet number obtained by counting are counted. The plurality of transmission packets may be stored in the predetermined data unit so that the total number of packets is equal to or less than the predetermined number of packets.

例えば、前記無効なパケットは、ＮＵＬＬパケットであってもよい。 For example, the invalid packet may be a NULL packet.

例えば、前記第１伝送パケットのパケットサイズの前記最大値は、１５００Ｂであってもよい。 For example, the maximum value of the packet size of the first transmission packet may be 1500B.

例えば、前記所定のデータ単位は、伝送フレームであり、前記単位伝送期間は、前記伝送フレームに対応する伝送フレーム長であってもよい。 For example, the predetermined data unit may be a transmission frame, and the unit transmission period may be a transmission frame length corresponding to the transmission frame.

例えば、前記所定のパケット数は、最小パケットサイズの前記伝送パケットが連続しないように設定された値であってもよい。 For example, the predetermined number of packets may be a value set so that the transmission packets of the minimum packet size are not consecutive.

本発明の一態様に係る受信方法は、バッファを有する受信装置における受信方法であって、それぞれが可変長のパケットヘッダおよび可変長のペイロードで構成される複数の伝送パケットを受信し、受信された前記複数の伝送パケットに格納されている可変長のパケットヘッダおよび可変長のペイロードで構成される第１パケットを、ヘッダ伸張された固定長のパケットヘッダを有する第２パケットに前記バッファを用いて変換し、変換することにより得られた前記第２パケットを所定の引き抜きレートで前記バッファから出力し、前記バッファのバッファサイズは、前記複数の伝送パケットの伝送レート、前記伝送パケットの平均パケット長、及び前記引き抜きレートを用いて求められる最大バッファ占有量よりも大きい。 A receiving method according to one aspect of the present invention is a receiving method in a receiving device having a buffer, in which a plurality of transmission packets each including a variable-length packet header and a variable-length payload are received, and the received converting a first packet consisting of a variable-length packet header and a variable-length payload stored in the plurality of transmission packets into a second packet having a header-expanded fixed-length packet header using the buffer; and outputs the second packet obtained by the conversion from the buffer at a predetermined extraction rate, and the buffer size of the buffer is determined by the transmission rate of the plurality of transmission packets, the average packet length of the transmission packets, and This is larger than the maximum buffer occupancy determined using the extraction rate.

なお、これらの包括的または具体的な態様は、システム、装置、集積回路、コンピュータプログラムまたはコンピュータ読み取り可能なＣＤ－ＲＯＭなどの記録媒体記録媒体で実現されてもよく、システム、装置、集積回路、コンピュータプログラムまたは記録媒体の任意な組み合わせで実現されてもよい。 Note that these comprehensive or specific aspects may be realized by a system, a device, an integrated circuit, a computer program, or a recording medium such as a computer-readable CD-ROM, and the system, device, integrated circuit, It may be realized by any combination of computer programs or recording media.

以下、実施の形態について、図面を参照しながら具体的に説明する。 Hereinafter, embodiments will be specifically described with reference to the drawings.

なお、以下で説明する実施の形態は、いずれも包括的または具体的な例を示すものである。以下の実施の形態で示される数値、形状、材料、構成要素、構成要素の配置位置及び接続形態、ステップ、ステップの順序などは、一例であり、本発明を限定する主旨ではない。また、以下の実施の形態における構成要素のうち、最上位概念を示す独立請求項に記載されていない構成要素については、任意の構成要素として説明される。 Note that the embodiments described below are all inclusive or specific examples. The numerical values, shapes, materials, components, arrangement positions and connection forms of the components, steps, order of steps, etc. shown in the following embodiments are merely examples, and do not limit the present invention. Further, among the constituent elements in the following embodiments, constituent elements that are not described in the independent claims indicating the most significant concept will be described as arbitrary constituent elements.

（本発明の基礎となった知見）
近年、ＴＶ、スマートフォン、又はタブレット端末などのディスプレイの高解像度化が進んでいる。特に日本国内の放送においては２０２０年に８Ｋ４Ｋ（解像度が８Ｋ×４Ｋ）のサービスが予定されている。８Ｋ４Ｋなどの超高解像度の動画像においては、単一の復号器では実時間での復号が困難であるため、複数の復号器を用いて並列に復号処理を行う手法が検討されている。 (Findings that formed the basis of the present invention)
In recent years, the resolution of displays such as TVs, smartphones, and tablet terminals has been increasing. In particular, 8K4K (8K x 4K resolution) services are planned for broadcasting in Japan in 2020. Since it is difficult to decode ultra-high resolution moving images such as 8K4K in real time with a single decoder, methods of performing decoding processing in parallel using multiple decoders are being considered.

符号化データはＭＰＥＧ－２ＴＳやＭＭＴなどの多重化方式に基づいて多重化して送信されるため、受信装置は、復号に先立って、多重化データから動画の符号化データを分離する必要がある。以下、多重化データから符号化データを分離する処理を逆多重化と呼ぶ。 Since encoded data is multiplexed and transmitted based on a multiplexing method such as MPEG-2 TS or MMT, the receiving device needs to separate the video encoded data from the multiplexed data before decoding. . Hereinafter, the process of separating encoded data from multiplexed data will be referred to as demultiplexing.

復号処理を並列化する際には、各復号器のそれぞれに対して、復号対象となる符号化データを振り分ける必要がある。符号化データを振り分ける際には、符号化データそのものを解析する必要があり、特に８Ｋなどのコンテンツにおいてはビットレートが非常に高いことから、解析に係る処理負荷が大きい。したがって、逆多重化の部分がボトルネックとなり実時間での再生が行えないという課題があった。 When parallelizing decoding processing, it is necessary to distribute encoded data to be decoded to each decoder. When distributing encoded data, it is necessary to analyze the encoded data itself, and since the bit rate is particularly high in content such as 8K, the processing load associated with analysis is large. Therefore, there was a problem that the demultiplexing part became a bottleneck and reproduction could not be performed in real time.

ところで、ＭＰＥＧとＩＴＵにより規格化されたＨ．２６４及びＨ．２６５などの動画像符号化方式においては、送信装置は、ピクチャをスライス又はスライスセグメントと呼ばれる複数の領域に分割し、分割したそれぞれの領域を独立に復号できるように符号化することができる。従って、例えば、Ｈ．２６５の場合には、放送を受信する受信装置は、受信データからスライスセグメント毎のデータを分離し、各スライスセグメントのデータを別々の復号器に出力することで、復号処理の並列化を実現できる。 By the way, H. 264 and H. In a video encoding system such as H.265, a transmitting device can divide a picture into a plurality of regions called slices or slice segments, and can encode each divided region so that it can be independently decoded. Thus, for example, H. In the case of H.265, a receiving device that receives broadcasting can realize parallel decoding processing by separating the data for each slice segment from the received data and outputting the data for each slice segment to separate decoders. .

図１は、ＨＥＶＣにおいて、１つのピクチャを４つのスライスセグメントに分割する例を示す図である。例えば、受信装置は４つの復号器を備え、各復号器が４つのスライスセグメントのうちいずれかを復号する。 FIG. 1 is a diagram showing an example of dividing one picture into four slice segments in HEVC. For example, the receiving device includes four decoders, each decoder decoding one of the four slice segments.

従来の放送においては、送信装置は、１枚のピクチャ（ＭＰＥＧシステム規格におけるアクセスユニット）を１つのＰＥＳパケットに格納し、ＰＥＳパケットをＴＳパケット列に多重化する。このため、受信装置は、ＰＥＳパケットのペイロードを分離したうえで、ペイロードに格納されたアクセスユニットのデータを解析することで、各スライスセグメントを分離し、分離された各スライスセグメントのデータを復号器に出力する必要があった。 In conventional broadcasting, a transmitting device stores one picture (an access unit in the MPEG system standard) in one PES packet, and multiplexes the PES packet into a TS packet string. Therefore, the receiving device separates the payload of the PES packet, analyzes the access unit data stored in the payload, separates each slice segment, and decodes the data of each separated slice segment. It was necessary to output to .

しかしながら、アクセスユニットのデータを解析してスライスセグメントを分離する際の処理量が大きいため、この処理を実時間で行うことが困難であるという課題があることを本発明者は見出した。 However, the inventors have found that there is a problem in that it is difficult to perform this processing in real time because the amount of processing required to analyze access unit data and separate slice segments is large.

図２は、スライスセグメントに分割されたピクチャのデータが、ＰＥＳパケットのペイロードに格納される例を示す図である。 FIG. 2 is a diagram illustrating an example in which data of a picture divided into slice segments is stored in the payload of a PES packet.

図２に示すように、例えば、複数のスライスセグメント（スライスセグメント１～４）のデータが１つのＰＥＳパケットのペイロードに格納される。また、ＰＥＳパケットはＴＳパケット列に多重化される。 As shown in FIG. 2, for example, data of a plurality of slice segments (slice segments 1 to 4) are stored in the payload of one PES packet. Furthermore, the PES packets are multiplexed into a TS packet sequence.

（実施の形態１）
以下では、動画像の符号化方式としてＨ．２６５を用いる場合を例に説明するが、Ｈ．２６４など他の符号化方式を用いる場合にも本実施の形態を適用できる。 (Embodiment 1)
In the following, H. The following will explain the case using H.265 as an example. This embodiment can also be applied when other encoding methods such as H.264 are used.

図３は、本実施の形態におけるアクセスユニット（ピクチャ）を分割単位に分割した例を示す図である。アクセスユニットは、Ｈ．２６５によって導入されたタイルと呼ばれる機能により、水平及び垂直方向にそれぞれ２等分され、合計４つのタイルに分割される。また、スライスセグメントとタイルは１対１に対応付けられる。 FIG. 3 is a diagram showing an example in which the access unit (picture) in this embodiment is divided into division units. The access unit is H. A function called tile introduced by H.265 divides the screen into two equal parts horizontally and vertically, for a total of four tiles. Furthermore, slice segments and tiles have a one-to-one correspondence.

このように水平及び垂直方向に２等分する理由について説明する。まず、復号時には、一般的に水平１ラインのデータを格納するラインメモリが必要となるが、８Ｋ４Ｋなどの超高解像度になると、水平方向のサイズが大きくなるためラインメモリのサイズが増加する。受信装置の実装においては、ラインメモリのサイズを低減できることが望ましい。ラインメモリのサイズを低減するためには垂直方向の分割が必要となる。垂直方向の分割にはタイルというデータ構造が必要である。これらの理由により、タイルが用いられる。 The reason why it is divided into two in the horizontal and vertical directions will be explained. First, during decoding, a line memory that stores one horizontal line of data is generally required, but when it comes to ultra-high resolution such as 8K4K, the size of the line memory increases because the horizontal size increases. In implementing a receiving device, it is desirable to be able to reduce the size of line memory. Vertical partitioning is required to reduce the size of the line memory. Vertical partitioning requires data structures called tiles. For these reasons, tiles are used.

一方で、画像は一般的に水平方向の相関が高いため、水平方向に広い範囲を参照できるほうが符号化効率は向上する。従って、符号化効率の観点ではアクセスユニットが水平方向に分割されることが望ましい。 On the other hand, since images generally have a high correlation in the horizontal direction, encoding efficiency is improved if a wider range can be referenced in the horizontal direction. Therefore, from the viewpoint of coding efficiency, it is desirable that the access unit be divided horizontally.

アクセスユニットが水平及び垂直方向に２等分されることで、これら２つの特性を両立させ、実装面、及び符号化効率の両面を考慮できる。単一の復号器が４Ｋ２Ｋの動画像を実時間での復号が可能の場合には、８Ｋ４Ｋの画像が４等分され、各々のスライスセグメントが４Ｋ２Ｋとなるように分割されることで、受信装置は、８Ｋ４Ｋの画像を実時間で復号できる。 By dividing the access unit into two in the horizontal and vertical directions, these two characteristics can be achieved simultaneously, and both aspects of implementation and encoding efficiency can be considered. If a single decoder is capable of decoding 4K2K video images in real time, the 8K4K image is divided into four equal slices, and each slice segment is divided into 4K2K segments. can decode 8K4K images in real time.

次に、アクセスユニットが水平及び垂直方向に分割されることで得られたタイルとスライスセグメントとを１対１に対応付ける理由について説明する。Ｈ．２６５においては、アクセスユニットは複数のＮＡＬ（ＮｅｔｗｏｒｋＡｄａｐｔａｔｉｏｎＬａｙｅｒ）ユニットと呼ばれる単位から構成される。 Next, the reason why the tiles obtained by dividing the access unit in the horizontal and vertical directions and the slice segments are associated one-to-one will be explained. H. In H.265, an access unit is composed of a plurality of units called NAL (Network Adaptation Layer) units.

ＮＡＬユニットのペイロードは、アクセスユニットの開始位置を示すアクセスユニットデリミタ、シーケンス単位で共通に用いられる復号時の初期化情報であるＳＰＳ（ＳｅｑｕｅｎｃｅＰａｒａｍｅｔｅｒＳｅｔ）、ピクチャ内で共通に用いられる復号時の初期化情報であるＰＰＳ（ＰｉｃｔｕｒｅＰａｒａｍｅｔｅｒＳｅｔ）、復号処理自体には不要であるが復号結果の処理及び表示などにおいて必要となるＳＥＩ（ＳｕｐｐｌｅｍｅｎｔａｌＥｎｈａｎｃｅｍｅｎｔＩｎｆｏｒｍａｔｉｏｎ）、並びに、スライスセグメントの符号化データなどのいずれかを格納する。ＮＡＬユニットのヘッダは、ペイロードに格納されるデータを識別するためのタイプ情報を含む。 The payload of the NAL unit includes an access unit delimiter that indicates the start position of the access unit, an SPS (Sequence Parameter Set) that is initialization information at the time of decoding that is commonly used in sequence units, and an initialization information at the time of decoding that is commonly used within the picture. PPS (Picture Parameter Set), which is encoding information, SEI (Supplemental Enhancement Information), which is not necessary for the decoding process itself but is necessary for processing and displaying the decoding results, and encoded data of slice segments, etc. Store. The header of a NAL unit includes type information to identify the data stored in the payload.

ここで、送信装置は、符号化データをＭＰＥＧ－２ＴＳ、ＭＭＴ（ＭＰＥＧＭｅｄｉａＴｒａｎｓｐｏｒｔ）、ＭＰＥＧＤＡＳＨ（ＤｙｎａｍｉｃＡｄａｐｔｉｖｅ
ＳｔｒｅａｍｉｎｇｏｖｅｒＨＴＴＰ）、又は、ＲＴＰ（Ｒｅａｌ－ｔｉｍｅＴｒａｎｓｐｏｒｔＰｒｏｔｏｃｏｌ）などの多重化フォーマットによって多重化する際には、基本単位をＮＡＬユニットに設定できる。１つのスライスセグメントを１つのＮＡＬユニットに格納するためには、アクセスユニットを領域に分割する際に、スライスセグメント単位に分割することが望ましい。このような理由から、送信装置は、タイルとスライスセグメントとを１対１に対応付ける。 Here, the transmitting device transmits the encoded data to MPEG-2 TS, MMT (MPEG Media Transport), MPEG DASH (Dynamic Adaptive
When multiplexing using a multiplexing format such as Streaming over HTTP (Streaming over HTTP) or RTP (Real-time Transport Protocol), the basic unit can be set to a NAL unit. In order to store one slice segment in one NAL unit, when dividing the access unit into areas, it is desirable to divide the access unit into slice segments. For this reason, the transmitting device makes a one-to-one correspondence between tiles and slice segments.

なお、図４に示すように、送信装置は、タイル１からタイル４までをまとめて１つのスライスセグメントに設定することも可能である。しかし、この場合には、１つのＮＡＬユニットに全てのタイルが格納されることになり、受信装置が、多重化レイヤにおいてタイルを分離することが困難である。 Note that, as shown in FIG. 4, the transmitting device can also set tiles 1 to 4 as one slice segment. However, in this case, all tiles are stored in one NAL unit, making it difficult for the receiving device to separate the tiles in the multiplexing layer.

なお、スライスセグメントには独立に復号可能な独立スライスセグメントと、独立スライスセグメントを参照する参照スライスセグメントとが存在するが、ここでは独立スライスセグメントが用いられる場合を説明する。 Note that the slice segments include independent slice segments that can be decoded independently and reference slice segments that refer to the independent slice segments. Here, a case where independent slice segments are used will be described.

図５は、図３に示すようにタイルとスライスセグメントとの境界が一致するように分割されたアクセスユニットのデータの例を示す図である。アクセスユニットのデータは、先頭に配置されたアクセスユニットデリミタが格納されるＮＡＬユニットと、その後に配置されるＳＰＳ、ＰＰＳ、及びＳＥＩのＮＡＬユニットと、その後に配置されるタイル１からタイル４までのデータが格納されたスライスセグメントのデータとを含む。なお、アクセスユニットのデータは、ＳＰＳ、ＰＰＳ及びＳＥＩのＮＡＬユニットの一部又は全てを含まなくてもよい。 FIG. 5 is a diagram illustrating an example of data of access units divided so that the boundaries between tiles and slice segments coincide as shown in FIG. 3. The access unit data consists of the NAL unit in which the access unit delimiter placed at the beginning is stored, the SPS, PPS, and SEI NAL units placed after that, and the tiles 1 to 4 placed after that. and the data of the slice segment in which the data is stored. Note that the access unit data does not need to include some or all of the SPS, PPS, and SEI NAL units.

次に、本実施の形態に係る送信装置１００の構成を説明する。図６は、本実施の形態に係る送信装置１００の構成例を示すブロック図である。この送信装置１００は、符号化部１０１と、多重化部１０２と、変調部１０３と、送信部１０４とを備える。 Next, the configuration of transmitting device 100 according to this embodiment will be described. FIG. 6 is a block diagram showing a configuration example of transmitting device 100 according to this embodiment. This transmitting device 100 includes an encoding section 101, a multiplexing section 102, a modulating section 103, and a transmitting section 104.

符号化部１０１は、入力画像を、例えば、Ｈ．２６５に従い符号化することで符号化データを生成する。また、符号化部１０１は、例えば、図３に示すように、アクセスユニットを４つのスライスセグメント（タイル）に分割し、各スライスセグメントを符号化する。 The encoding unit 101 converts the input image into, for example, H. Encoded data is generated by encoding according to H.265. Further, the encoding unit 101 divides the access unit into four slice segments (tiles) and encodes each slice segment, for example, as shown in FIG. 3.

多重化部１０２は、符号化部１０１により生成された符号化データを多重化する。変調部１０３は、多重化により得られたデータを変調する。送信部１０４は、変調後のデータを放送信号として送信する。 Multiplexing section 102 multiplexes the encoded data generated by encoding section 101. Modulating section 103 modulates data obtained by multiplexing. The transmitter 104 transmits the modulated data as a broadcast signal.

次に、本実施の形態に係る受信装置２００の構成を説明する。図７は、本実施の形態に係る受信装置２００の構成例を示すブロック図である。この受信装置２００は、チューナー２０１と、復調部２０２と、逆多重化部２０３と、複数の復号部２０４Ａ～２０４Ｄと、表示部２０５とを備える。 Next, the configuration of receiving device 200 according to this embodiment will be explained. FIG. 7 is a block diagram showing a configuration example of receiving device 200 according to this embodiment. This receiving device 200 includes a tuner 201, a demodulating section 202, a demultiplexing section 203, a plurality of decoding sections 204A to 204D, and a display section 205.

チューナー２０１は、放送信号を受信する。復調部２０２は、受信された放送信号を復調する。復調後のデータは逆多重化部２０３に入力される。 Tuner 201 receives broadcast signals. Demodulation section 202 demodulates the received broadcast signal. The demodulated data is input to demultiplexer 203.

逆多重化部２０３は、復調後のデータを分割単位に分離し、分割単位毎のデータを復号部２０４Ａ～２０４Ｄに出力する。ここで、分割単位とは、アクセスユニットが分割されることで得られた分割領域であり、例えば、Ｈ．２６５におけるスライスセグメントである。また、ここでは、８Ｋ４Ｋの画像が４つの４Ｋ２Ｋの画像に分割される。よって、４つの復号部２０４Ａ～２０４Ｄが存在する。 Demultiplexing section 203 separates the demodulated data into division units, and outputs the data for each division unit to decoding sections 204A to 204D. Here, the division unit is a division area obtained by dividing an access unit, and is, for example, a division area obtained by dividing an access unit. This is a slice segment in H.265. Further, here, an 8K4K image is divided into four 4K2K images. Therefore, there are four decoding units 204A to 204D.

複数の復号部２０４Ａ～２０４Ｄは、所定の基準クロックに基づいて互いに同期して動作する。各復号部は、アクセスユニットのＤＴＳ（ＤｅｃｏｄｉｎｇＴｉｍｅＳｔａｍｐ）に従って分割単位の符号化データを復号し、復号結果を表示部２０５に出力する。 The plurality of decoders 204A to 204D operate in synchronization with each other based on a predetermined reference clock. Each decoding section decodes the encoded data in units of division according to the DTS (Decoding Time Stamp) of the access unit, and outputs the decoding result to the display section 205.

表示部２０５は、複数の復号部２０４Ａ～２０４Ｄから出力された複数の復号結果を統合することで８Ｋ４Ｋの出力画像を生成する。表示部２０５は、別途取得したアクセスユニットのＰＴＳ（ＰｒｅｓｅｎｔａｔｉｏｎＴｉｍｅＳｔａｍｐ）に従って、生成された出力画像を表示する。なお、表示部２０５は、復号結果を統合する際に、タイルの境界など、互いに隣接する分割単位の境界領域において、当該境界が視覚的に目立たなくなるようにデブロックフィルタなどのフィルタ処理を行ってもよい。 The display unit 205 generates an 8K4K output image by integrating the multiple decoding results output from the multiple decoding units 204A to 204D. The display unit 205 displays the generated output image according to the PTS (Presentation Time Stamp) of the access unit that is obtained separately. Note that when integrating the decoding results, the display unit 205 performs filter processing such as a deblock filter on the boundary areas of mutually adjacent division units, such as the boundaries of tiles, so that the boundaries become visually inconspicuous. Good too.

なお、上記では、放送の送信又は受信を行う送信装置１００及び受信装置２００を例に説明したが、コンテンツは通信ネットワーク経由で送信及び受信されてもよい。受信装置２００が、通信ネットワーク経由でコンテンツを受信する場合には、受信装置２００は、イーサーネットなどのネットワークにより受信したＩＰパケットから多重化データを分離する。 Note that although the above description has been made using the transmitting device 100 and the receiving device 200 that transmit or receive broadcasts as an example, content may be transmitted and received via a communication network. When the receiving device 200 receives content via a communication network, the receiving device 200 separates multiplexed data from IP packets received via a network such as Ethernet.

放送においては、放送信号が送信されてから受信装置２００に届くまでの間の伝送路遅延は一定である。一方、インターネットなどの通信ネットワークにおいては輻輳の影響により、サーバーから送信されたデータが受信装置２００に届くまでの伝送路遅延は一定でない。従って、受信装置２００は、放送のＭＰＥＧ－２ＴＳにおけるＰＣＲのような基準クロックに基づいた厳密な同期再生を行わないことが多い。そのため、受信装置２００は、各復号部を厳密に同期させることはせずに、表示部において８Ｋ４Ｋの出力画像をＰＴＳに従って表示してもよい。 In broadcasting, the transmission path delay from when a broadcast signal is transmitted until it reaches receiving device 200 is constant. On the other hand, in a communication network such as the Internet, the transmission path delay until data transmitted from the server reaches the receiving device 200 is not constant due to the influence of congestion. Therefore, the receiving device 200 often does not perform strictly synchronized reproduction based on a reference clock such as PCR in MPEG-2 TS for broadcasting. Therefore, the receiving device 200 may display the 8K4K output image on the display unit according to the PTS without strictly synchronizing each decoding unit.

また、通信ネットワークの輻輳などにより、全ての分割単位の復号処理がアクセスユニットのＰＴＳで示される時刻において完了していない場合がある。この場合には、受信装置２００は、アクセスユニットの表示をスキップする、又は、少なくとも４つの分割単位の復号が終了し、８Ｋ４Ｋの画像の生成が完了するまで表示を遅延させる。 Furthermore, due to communication network congestion, decoding processing for all division units may not be completed at the time indicated by the PTS of the access unit. In this case, the receiving device 200 skips the display of the access unit or delays the display until the decoding of at least four division units is completed and the generation of the 8K4K image is completed.

なお、放送と通信とを併用してコンテンツが送信及び受信されてもよい。また、ハードディスク又はメモリなどの記録媒体に格納された多重化データを再生する際にも本手法を適用可能である。 Note that content may be transmitted and received using both broadcasting and communication. Further, this method can also be applied when reproducing multiplexed data stored in a recording medium such as a hard disk or a memory.

次に、多重化方式としてＭＭＴが用いられる場合の、スライスセグメントに分割されたアクセスユニットの多重化方法にについて説明する。 Next, a method for multiplexing access units divided into slice segments when MMT is used as the multiplexing method will be described.

図８は、ＨＥＶＣのアクセスユニットのデータを、ＭＭＴにパケット化する際の例を示す図である。ＳＰＳ、ＰＰＳ及びＳＥＩなどはアクセスユニットに必ずしも含まれる必要はないが、ここでは存在する場合について例示する。 FIG. 8 is a diagram illustrating an example of packetizing HEVC access unit data into MMT. Although SPS, PPS, SEI, etc. do not necessarily need to be included in the access unit, a case where they are present will be exemplified here.

アクセスユニットデリミタ、ＳＰＳ、ＰＰＳ、及びＳＥＩなどのアクセスユニット内で先頭のスライスセグメントよりも前に配置されるＮＡＬユニットは一纏めにしてＭＭＴパケット＃１に格納される。後続のスライスセグメントは、スライスセグメント毎に別々のＭＭＴパケットに格納される。 NAL units arranged before the first slice segment in the access unit, such as the access unit delimiter, SPS, PPS, and SEI, are collectively stored in MMT packet #1. Subsequent slice segments are stored in separate MMT packets for each slice segment.

なお、図９に示すように、アクセスユニット内で先頭のスライスセグメントよりも前に配置されるＮＡＬユニットが、先頭のスライスセグメントと同一のＭＭＴパケットに格納されてもよい。 Note that, as shown in FIG. 9, a NAL unit placed before the first slice segment in the access unit may be stored in the same MMT packet as the first slice segment.

また、シーケンス又はストリームの終端を示す、Ｅｎｄ－ｏｆ－Ｓｅｑｕｅｎｃｅ又はＥｎｄ－ｏｆ－ＢｉｔｓｔｒｅａｍなどのＮＡＬユニットが最終スライスセグメントの後に付加される場合には、これらは、最終スライスセグメントと同一のＭＭＴパケットに格納される。ただし、Ｅｎｄ－ｏｆ－Ｓｅｑｕｅｎｃｅ又はＥｎｄ－ｏｆ－ＢｉｔｓｔｒｅａｍなどのＮＡＬユニットは、復号処理の終了ポイント、又は２本のストリームの接続ポイントなどに挿入されるため、受信装置２００が、これらのＮＡＬユニットを、多重化レイヤにおいて容易に取得できることが望ましい場合がある。この場合には、これらのＮＡＬユニットは、スライスセグメントとは別のＭＭＴパケットに格納されてもよい。これにより、受信装置２００は、多重化レイヤにおいてこれらのＮＡＬユニットを容易に分離できる。 Additionally, if a NAL unit such as End-of-Sequence or End-of-Bitstream indicating the end of a sequence or stream is added after the final slice segment, these units are included in the same MMT packet as the final slice segment. Stored. However, since NAL units such as End-of-Sequence or End-of-Bitstream are inserted at the end point of decoding processing or the connection point of two streams, the receiving device 200 , may be desirable to be readily available at the multiplexing layer. In this case, these NAL units may be stored in a separate MMT packet from the slice segment. Thereby, receiving device 200 can easily separate these NAL units in the multiplexing layer.

なお、多重化方式として、ＴＳ、ＤＡＳＨ又はＲＴＰなどが用いられてもよい。これらの方式においても、送信装置１００は、異なるスライスセグメントをそれぞれ異なるパケットに格納する。これにより、受信装置２００が多重化レイヤにおいてスライスセグメントを分離できることを保証できる。 Note that TS, DASH, RTP, or the like may be used as the multiplexing method. In these methods as well, the transmitting device 100 stores different slice segments in different packets. This ensures that the receiving device 200 can separate slice segments in the multiplex layer.

例えば、ＴＳが用いられる場合、スライスセグメント単位で符号化データがＰＥＳパケット化される。ＲＴＰが用いられる場合、スライスセグメント単位で符号化データがＲＴＰパケット化される。これらの場合においても、図８に示すＭＭＴパケット＃１のように、スライスセグメントよりも前に配置されるＮＡＬユニットとスライスセグメントとが別々にパケット化されてもよい。 For example, when TS is used, encoded data is converted into PES packets in units of slice segments. When RTP is used, encoded data is converted into RTP packets in units of slice segments. Even in these cases, the NAL unit placed before the slice segment and the slice segment may be packetized separately, as in MMT packet #1 shown in FIG. 8.

ＴＳが用いられる場合、送信装置１００は、ｄａｔａａｌｉｇｎｍｅｎｔ記述子を用いることなどにより、ＰＥＳパケットに格納されるデータの単位を示す。また、ＤＡＳＨはセグメントと呼ばれるＭＰ４形式のデータ単位をＨＴＴＰなどによりダウンロードする方式であるため、送信装置１００は、送信にあたって符号化データのパケット化は行わない。このため、送信装置１００は、受信装置２００がＭＰ４において多重化レイヤでスライスセグメントを検出できるように、スライスセグメント単位でサブサンプルを作成し、サブサンプルの格納位置を示す情報をＭＰ４のヘッダに格納してもよい。 When a TS is used, the transmitting device 100 indicates the unit of data stored in the PES packet, such as by using a data alignment descriptor. Furthermore, since DASH is a method of downloading MP4 format data units called segments using HTTP or the like, the transmitting device 100 does not packetize encoded data during transmission. Therefore, the transmitting device 100 creates sub-samples for each slice segment and stores information indicating the storage location of the sub-samples in the MP4 header so that the receiving device 200 can detect the slice segments in the multiplex layer in MP4. You may.

以下、スライスセグメントのＭＭＴパケット化について、詳細に説明する。 MMT packetization of slice segments will be described in detail below.

図８に示すように、符号化データがパケット化されることで、ＳＰＳ及びＰＰＳなどのアクセスユニット内の全スライスセグメントの復号時に共通に参照されるデータがＭＭＴパケット＃１に格納される。この場合、受信装置２００は、ＭＭＴパケット＃１のペイロードデータと各スライスセグメントのデータとを連結し、得られたデータを復号部に出力する。このように、受信装置２００は、複数のＭＭＴパケットのペイロードを連結することで、復号部への入力データを容易に生成できる。 As shown in FIG. 8, by packetizing encoded data, data that is commonly referred to when decoding all slice segments in an access unit such as SPS and PPS is stored in MMT packet #1. In this case, receiving device 200 concatenates the payload data of MMT packet #1 and the data of each slice segment, and outputs the obtained data to the decoding unit. In this way, the receiving device 200 can easily generate input data to the decoding unit by concatenating payloads of multiple MMT packets.

図１０は、図８に示すＭＭＴパケットから復号部２０４Ａ～２０４Ｄへの入力データが生成される例を示す図である。逆多重化部２０３は、ＭＭＴパケット＃１とＭＭＴパケット＃２とのペイロードデータを連結させることで、復号部２０４Ａが、スライスセグメント１を復号するために必要なデータを生成する。逆多重化部２０３は、復号部２０４Ｂから復号部２０４Ｄについても、同様に入力データを生成する。つまり、逆多重化部２０３は、ＭＭＴパケット＃１とＭＭＴパケット＃３とのペイロードデータを連結させることで、復号部２０４Ｂの入力データを生成する。逆多重化部２０３は、ＭＭＴパケット＃１とＭＭＴパケット＃４とのペイロードデータを連結させることで、復号部２０４Ｃの入力データを生成する。逆多重化部２０３は、ＭＭＴパケット＃１とＭＭＴパケット＃５とのペイロードデータを連結させることで、復号部２０４Ｄの入力データを生成する。 FIG. 10 is a diagram showing an example in which input data to the decoding units 204A to 204D is generated from the MMT packet shown in FIG. 8. The demultiplexing unit 203 generates data necessary for the decoding unit 204A to decode slice segment 1 by concatenating the payload data of MMT packet #1 and MMT packet #2. Demultiplexing section 203 similarly generates input data for decoding sections 204B to 204D. That is, the demultiplexer 203 generates input data for the decoder 204B by concatenating the payload data of MMT packet #1 and MMT packet #3. The demultiplexing unit 203 generates input data for the decoding unit 204C by concatenating the payload data of MMT packet #1 and MMT packet #4. Demultiplexing section 203 generates input data for decoding section 204D by concatenating payload data of MMT packet #1 and MMT packet #5.

なお、逆多重化部２０３は、アクセスユニットデリミタ及びＳＥＩなど、復号処理に必要ではないＮＡＬユニットを、ＭＭＴパケット＃１のペイロードデータから除去し、復号処理に必要であるＳＰＳ及びＰＰＳのＮＡＬユニットのみを分離してスライスセグメントのデータに付加してもよい。 Note that the demultiplexing unit 203 removes NAL units that are not necessary for decoding processing, such as the access unit delimiter and SEI, from the payload data of MMT packet #1, and extracts only the SPS and PPS NAL units that are necessary for decoding processing. may be separated and added to the slice segment data.

図９に示すように符号化データがパケット化される場合には、逆多重化部２０３は、多重化レイヤにおいてアクセスユニットの先頭データを含むＭＭＴパケット＃１を１番目の復号部２０４Ａに出力する。また、逆多重化部２０３は、多重化レイヤにおいてアクセスユニットの先頭データを含むＭＭＴパケットを解析し、ＳＰＳ及びＰＰＳのＮＡＬユニットを分離し、分離したＳＰＳ及びＰＰＳのＮＡＬユニットを２番目以降のスライスセグメントのデータの各々に付加することで２番目以降の復号部の各々に対する入力データを生成する。 When the encoded data is packetized as shown in FIG. 9, the demultiplexing section 203 outputs MMT packet #1 containing the first data of the access unit in the multiplexing layer to the first decoding section 204A. . In addition, the demultiplexer 203 analyzes the MMT packet including the first data of the access unit in the multiplexing layer, separates the SPS and PPS NAL units, and slices the separated SPS and PPS NAL units into the second and subsequent slices. By adding it to each piece of data in the segment, input data for each of the second and subsequent decoding units is generated.

さらに、受信装置２００が、ＭＭＴパケットのヘッダに含まれる情報を用いて、ＭＭＴペイロードに格納されるデータのタイプ、及び、ペイロードにスライスセグメントが格納されている場合のアクセスユニット内における当該スライスセグメントのインデックス番号を識別できることが望ましい。ここで、データのタイプとは、スライスセグメント前データ（アクセスユニット内で先頭スライスセグメントよりも前に配置されるＮＡＬユニットをまとめて、このように呼ぶことにする）、及び、スライスセグメントのデータのいずれである。ＭＭＴパケットに、スライスセグメントなどのＭＰＵをフラグメント化した単位を格納する場合には、ＭＦＵ（ＭｅｄｉａＦｒａｇｍｅｎｔＵｎｉｔ）を格納するためのモードが用いられる。送信装置１００は、本モードを用いる場合には、例えば、ＭＦＵにおけるデータの基本単位であるＤａｔａＵｎｉｔを、サンプル（ＭＭＴにおけるデータ単位であり、アクセスユニットに相当する）、又は、サブサンプル（サンプルを分割した単位）に設定できる。 Further, the receiving device 200 uses the information included in the header of the MMT packet to determine the type of data stored in the MMT payload and, if a slice segment is stored in the payload, the slice segment in the access unit. It is desirable to be able to identify the index number. Here, the data type refers to pre-slice segment data (NAL units placed before the first slice segment in an access unit are collectively called this) and slice segment data. Either way. When storing a unit in which an MPU is fragmented, such as a slice segment, in an MMT packet, a mode for storing an MFU (Media Fragment Unit) is used. When using this mode, the transmitting device 100 converts the Data Unit, which is the basic unit of data in the MFU, into a sample (which is a data unit in MMT and corresponds to an access unit) or a subsample (which is a data unit in MMT and corresponds to an access unit). (divided units).

このとき、ＭＭＴパケットのヘッダは、Ｆｒａｇｍｅｎｔａｔｉｏｎｉｎｄｉｃａｔｏｒと呼ばれるフィールドと、Ｆｒａｇｍｅｎｔｃｏｕｎｔｅｒと呼ばれるフィールドとを含む。 At this time, the header of the MMT packet includes a field called a Fragmentation indicator and a field called a Fragment counter.

Ｆｒａｇｍｅｎｔａｔｉｏｎｉｎｄｉｃａｔｏｒは、ＭＭＴパケットのペイロードに格納されるデータが、Ｄａｔａｕｎｉｔをフラグメント化したものであるかどうか、フラグメント化したものである場合には、当該フラグメントがＤａｔａｕｎｉｔにおける先頭或いは最終のフラグメント、又は、先頭と最終とのどちらでもないフラグメントであるかを示す。言い換えると、あるパケットのヘッダに含まれるＦｒａｇｍｅｎｔａｔｉｏｎｉｎｄｉｃａｔｏｒは、（１）基本データ単位であるＤａｔａｕｎｉｔに当該パケットのみが含まれる、（２）Ｄａｔａｕｎｉｔが複数のパケットに分割して格納され、かつ、当該パケットがＤａｔａｕｎｉｔの先頭のパケットである、（３）Ｄａｔａｕｎｉｔが複数のパケットに分割して格納され、かつ、当該パケットがＤａｔａｕｎｉｔの先頭及び最後以外のパケットである、及び、（４）Ｄａｔａｕｎｉｔが複数のパケットに分割して格納され、かつ、当該パケットがＤａｔａｕｎｉｔの最後のパケットである、のいずれであるかを示す識別情報である。 The Fragmentation indicator indicates whether the data stored in the payload of the MMT packet is a fragmented Data unit, and if so, whether the fragment is the first or last fragment in the Data unit, or , indicates whether the fragment is neither the beginning nor the end. In other words, the Fragmentation indicator included in the header of a certain packet indicates that (1) only the packet is included in the Data unit, which is a basic data unit, (2) the Data unit is stored divided into multiple packets, and The packet is the first packet of the Data unit, (3) the Data unit is stored divided into multiple packets, and the packet is a packet other than the first and last packet of the Data unit, and (4) This is identification information indicating whether a Data unit is stored divided into a plurality of packets and whether the packet is the last packet of the Data unit.

Ｆｒａｇｍｅｎｔｃｏｕｎｔｅｒは、ＭＭＴパケットに格納されるデータが、Ｄａｔａｕｎｉｔにおいて何番目のフラグメントに相当するかを示すインデックス番号である。 The Fragment counter is an index number indicating which fragment in the Data unit the data stored in the MMT packet corresponds to.

従って、送信装置１００が、ＭＭＴにおけるサンプルをＤａｔａｕｎｉｔに設定し、スライスセグメント前データ、及び、各スライスセグメントを、それぞれＤａｔａｕｎｉｔのフラグメント単位に設定することで、受信装置２００は、ＭＭＴパケットのヘッダに含まれる情報を用いて、ペイロードに格納されるデータのタイプが識別できる。つまり、逆多重化部２０３は、ＭＭＴパケットのヘッダを参照して、各復号部２０４Ａ～２０４Ｄへの入力データを生成できる。 Therefore, by the transmitting device 100 setting samples in MMT in the Data unit, and setting the pre-slice segment data and each slice segment in fragment units of the Data unit, the receiving device 200 can read the header of the MMT packet. The information contained in the payload can be used to identify the type of data stored in the payload. In other words, the demultiplexing section 203 can generate input data to each of the decoding sections 204A to 204D by referring to the header of the MMT packet.

図１１は、サンプルがＤａｔａｕｎｉｔに設定され、スライスセグメント前データ、及び、スライスセグメントがＤａｔａｕｎｉｔのフラグメントとしてパケット化される場合の例を示す図である。 FIG. 11 is a diagram illustrating an example in which a sample is set in a Data unit, and pre-slice segment data and slice segments are packetized as a fragment of the Data unit.

スライスセグメント前データ、及びスライスセグメントは、フラグメント＃１からフラグメント＃５までの５つのフラグメントに分割される。各フラグメントは個別のＭＭＴパケットに格納される。このとき、ＭＭＴパケットのヘッダに含まれるＦｒａｇｍｅｎｔａｔｉｏｎｉｎｄｉｃａｔｏｒ及びＦｒａｇｍｅｎｔｃｏｕｎｔｅｒの値は図示する通りである。 The pre-slice segment data and the slice segment are divided into five fragments from fragment #1 to fragment #5. Each fragment is stored in a separate MMT packet. At this time, the values of the Fragmentation indicator and Fragment counter included in the header of the MMT packet are as shown in the figure.

例えば、Ｆｒａｇｍｅｎｔｉｎｄｉｃａｔｏｒは、２進数の２ビット値である。Ｄａｔａｕｎｉｔの先頭であるＭＭＴパケット＃１のＦｒａｇｍｅｎｔｉｎｄｉｃａｔｏｒ、最終であるＭＭＴパケット＃５のＦｒａｇｍｅｎｔｉｎｄｉｃａｔｏｒ、及び、その間のパケットであるＭＭＴパケット＃２からＭＭＴパケット＃４までのＦｒａｇｍｅｎｔｉｎｄｉｃａｔｏｒは、それぞれ別の値に設定される。具体的には、Ｄａｔａｕｎｉｔの先頭であるＭＭＴパケット＃１のＦｒａｇｍｅｎｔｉｎｄｉｃａｔｏｒは０１に設定され、最終であるＭＭＴパケット＃５のＦｒａｇｍｅｎｔｉｎｄｉｃａｔｏｒは１１に設定され、その間のパケットであるＭＭＴパケット＃２からＭＭＴパケット＃４までのＦｒａｇｍｅｎｔｉｎｄｉｃａｔｏｒは１０に設定される。なお、Ｄａｔａｕｎｉｔに一つのＭＭＴパケットのみが含まれる場合には、Ｆｒａｇｍｅｎｔｉｎｄｉｃａｔｏｒは００に設定される。 For example, the Fragment indicator is a 2-bit binary value. The Fragment indicator of MMT packet #1 which is the beginning of the Data unit, the Fragment indicator of MMT packet #5 which is the last one, and the Fragment indicators of MMT packet #2 to MMT packet #4 which are the packets between them are each different. set to the value. Specifically, the Fragment indicator of MMT packet #1, which is the beginning of the Data unit, is set to 01, the Fragment indicator of MMT packet #5, which is the last, is set to 11, and the Fragment indicator of MMT packet #1, which is the beginning of the data unit, is set to 11. Fragment indicators up to MMT packet #4 are set to 10. Note that when the Data unit includes only one MMT packet, the Fragment indicator is set to 00.

また、Ｆｒａｇｍｅｎｔｃｏｕｎｔｅｒは、ＭＭＴパケット＃１においてはフラグメントの総数である５から１を減算した値である４であり、後続パケットにおいては順に１ずつ減少し、最後のＭＭＴパケット＃５においては０である。 In addition, the Fragment counter is 4, which is the value obtained by subtracting 1 from the total number of fragments, 5, in MMT packet #1, and decreases by 1 in subsequent packets, and is 0 in the final MMT packet #5. be.

従って、受信装置２００は、スライスセグメント前データを格納するＭＭＴパケットを、Ｆｒａｇｍｅｎｔｉｎｄｉｃａｔｏｒ、及び、Ｆｒａｇｍｅｎｔｃｏｕｎｔｅｒのいずれかを用いて識別できる。また、受信装置２００は、Ｎ番目のスライスセグメントを格納するＭＭＴパケットを、Ｆｒａｇｍｅｎｔｃｏｕｎｔｅｒを参照することにより識別できる。 Therefore, the receiving device 200 can identify the MMT packet storing the pre-slice segment data using either the Fragment indicator or the Fragment counter. Further, the receiving device 200 can identify the MMT packet storing the Nth slice segment by referring to the Fragment counter.

ＭＭＴパケットのヘッダは、別途、Ｄａｔａｕｎｉｔが属するＭｏｖｉｅＦｒａｇｍｅｎｔのＭＰＵ内でのシーケンス番号と、ＭＰＵ自体のシーケンス番号と、Ｄａｔａｕｎｉｔが属するサンプルのＭｏｖｉｅＦｒａｇｍｅｎｔ内におけるシーケンス番号とを含む。逆多重化部２０３は、これらを参照することで、Ｄａｔａｕｎｉｔが属するサンプルを一意に決定できる。 The header of the MMT packet separately includes a sequence number within the MPU of the Movie Fragment to which the Data unit belongs, a sequence number of the MPU itself, and a sequence number within the Movie Fragment of the sample to which the Data unit belongs. By referring to these, the demultiplexer 203 can uniquely determine the sample to which the Data unit belongs.

更に、逆多重化部２０３は、Ｄａｔａｕｎｉｔ内におけるフラグメントのインデックス番号をＦｒａｇｍｅｎｔｃｏｕｎｔｅｒなどから決定できるため、パケットロスが発生した場合にも、フラグメントに格納されるスライスセグメントを一意に特定できる。例えば、逆多重化部２０３は、図１１に示すフラグメント＃４がパケットロスにより取得できなかった場合でも、フラグメント＃３の次に受信したフラグメントがフラグメント＃５であることが分かるため、フラグメント＃５に格納されるスライスセグメント４を、復号部２０４Ｃではなく復号部２０４Ｄに正しく出力することができる。 Further, since the demultiplexer 203 can determine the index number of a fragment in the data unit from the Fragment counter, it can uniquely identify the slice segment stored in the fragment even if a packet loss occurs. For example, even if fragment #4 shown in FIG. 11 cannot be acquired due to packet loss, the demultiplexer 203 knows that the fragment received next to fragment #3 is fragment #5, so It is possible to correctly output slice segment 4 stored in the decoding section 204D to the decoding section 204D instead of the decoding section 204C.

なお、パケットロスが発生しないことが保証される伝送路が使用される場合には、逆多重化部２０３は、ＭＭＴパケットのヘッダを参照してＭＭＴパケットに格納されるデータのタイプ、又はスライスセグメントのインデックス番号を決定せずに、到着したパケットを周期的に処理すればよい。例えば、アクセスユニットが、スライス前データ、及び、４つのスライスセグメントの計５つのＭＭＴパケットにより送信される場合には、受信装置２００は、復号を開始するアクセスユニットのスライス前データを決定した後は、受信したＭＭＴパケットを順に処理することで、スライス前データ、及び、４つのスライスセグメントのデータを順に取得できる。 Note that when a transmission path that is guaranteed not to cause packet loss is used, the demultiplexer 203 refers to the header of the MMT packet to determine the type of data stored in the MMT packet or the slice segment. Arrived packets may be processed periodically without determining the index number of the packet. For example, when an access unit is transmitted using a total of five MMT packets including pre-slice data and four slice segments, the receiving device 200 determines the pre-slice data of the access unit to start decoding. , by sequentially processing the received MMT packets, the pre-slice data and the data of the four slice segments can be sequentially acquired.

以下、パケット化の変形例について説明する。 Modified examples of packetization will be described below.

スライスセグメントは、必ずしもアクセスユニットの面内を水平方向と垂直方向との両方に分割されたものである必要はなく、図１に示すように、アクセスユニットを水平方向のみに分割されたものでもよいし、垂直方向のみに分割されたものでもよい。 The slice segment does not necessarily have to be one in which the plane of the access unit is divided both horizontally and vertically, and may be one in which the access unit is divided only in the horizontal direction, as shown in FIG. However, it may be divided only in the vertical direction.

また、水平方向のみにアクセスユニットが分割される場合には、タイルが用いられる必要はない。 Furthermore, if the access unit is divided only in the horizontal direction, there is no need to use tiles.

また、アクセスユニットにおける面内の分割数は任意であり、４つに限定されるものではない。但し、スライスセグメント及びタイルの領域サイズはＨ．２６５などの符号化規格の下限以上である必要がある。 Furthermore, the number of in-plane divisions in the access unit is arbitrary and is not limited to four. However, the area size of slice segments and tiles is H. The value must be at least the lower limit of the encoding standard such as H.265.

送信装置１００は、アクセスユニットにおける面内の分割方法を示す識別情報を、ＭＭＴメッセージ、又はＴＳのデスクリプタなどに格納してもよい。例えば、面内における水平方向と垂直方向との分割数とをそれぞれ示す情報が格納されてもよい。または、図３に示すように水平方向及び垂直方向にそれぞれ２等分されている、又は、図１に示すように水平方向に４等分されているなど、分割方法に対して固有の識別情報が割り当てられてもよい。例えば、図３に示すようにアクセスユニットが分割されている場合は、識別情報はモード１を示し、図１に示すようにアクセスユニットが分割されている場合には、識別情報はモード１を示す。 The transmitting device 100 may store identification information indicating an in-plane division method in an access unit in an MMT message, a TS descriptor, or the like. For example, information indicating the number of divisions in the horizontal direction and vertical direction within the plane may be stored. Or identification information unique to the division method, such as divided into two equal parts horizontally and vertically as shown in Figure 3, or divided into four equal parts horizontally as shown in Figure 1. may be assigned. For example, when the access unit is divided as shown in FIG. 3, the identification information indicates mode 1, and when the access unit is divided as shown in FIG. 1, the identification information indicates mode 1. .

また、面内の分割方法に関連する符号化条件の制約を示す情報が、多重化レイヤに含まれてもよい。例えば、１つのスライスセグメントが１つのタイルから構成されること示す情報が用いられてもよい。または、スライスセグメント或いはタイルの復号時に動き補償を行う場合の参照ブロックが、画面内の同一位置のスライスセグメント或いはタイルに制限される、又は、隣接スライスセグメントにおける所定の範囲内のブロックに限定されることなどを示す情報が用いられてもよい。 Further, information indicating constraints on encoding conditions related to the intra-plane division method may be included in the multiplexing layer. For example, information indicating that one slice segment is composed of one tile may be used. Or, reference blocks when performing motion compensation when decoding slice segments or tiles are limited to slice segments or tiles at the same position in the screen, or to blocks within a predetermined range in adjacent slice segments. Information indicating such things may also be used.

また、送信装置１００は、動画像の解像度に応じて、アクセスユニットを複数のスライスセグメントに分割するかどうかを切替えてもよい。例えば、送信装置１００は、処理対象の動画像が４Ｋ２Ｋの解像度の場合には面内の分割を行わずに、処理対象の動画像が８Ｋ４Ｋの場合にはアクセスユニットを４つに分割してもよい。８Ｋ４Ｋの動画像の場合の分割方法を予め規定しておくことにより、受信装置２００は、受信する動画像の解像度を取得することで、面内の分割の有無、及び分割方法を決定し、復号動作を切替えることができる。 Further, the transmitting device 100 may switch whether to divide the access unit into a plurality of slice segments depending on the resolution of the moving image. For example, if the moving image to be processed has a resolution of 4K2K, the transmitting device 100 may not perform in-plane division, but if the moving image to be processed is 8K4K, it may divide the access unit into four. good. By predefining the division method for 8K4K video images, the receiving device 200 determines whether or not to divide within the screen and the division method by acquiring the resolution of the received video image, and performs decoding. You can switch the operation.

また、受信装置２００は、面内の分割の有無を、ＭＭＴパケットのヘッダを参照することにより検出できる。例えば、アクセスユニットが分割されない場合には、ＭＭＴのＤａｔａｕｎｉｔがサンプルに設定されていれば、Ｄａｔａｕｎｉｔのフラグメントは行われない。従って、受信装置２００は、ＭＭＴパケットのヘッダに含まれるＦｒａｇｍｅｎｔｃｏｕｎｔｅｒの値が常にゼロの場合には、アクセスユニットは分割されないと判定できる。または、受信装置２００は、Ｆｒａｇｍｅｎｔａｔｉｏｎｉｎｄｉｃａｔｏｒの値が常に０１であるかどうかを検出してもよい。受信装置２００は、Ｆｒａｇｍｅｎｔａｔｉｏｎｉｎｄｉｃａｔｏｒの値が常に０１の場合もアクセスユニットは分割されないと判定できる。 Furthermore, the receiving device 200 can detect whether or not there is division within a plane by referring to the header of the MMT packet. For example, if the access unit is not divided and the MMT Data unit is set to sample, the Data unit will not be fragmented. Therefore, the receiving device 200 can determine that the access unit is not divided if the value of the Fragment counter included in the header of the MMT packet is always zero. Alternatively, the receiving device 200 may detect whether the value of the Fragmentation indicator is always 01. The receiving device 200 can determine that the access unit is not divided even if the value of the Fragmentation indicator is always 01.

また、受信装置２００は、アクセスユニットにおける面内の分割数と復号部の数とが一致しない場合にも対応できる。例えば、受信装置２００が、８Ｋ２Ｋの符号化データを実時間で復号できる２つの復号部２０４Ａ及び２０４Ｂを備える場合には、逆多重化部２０３は、復号部２０４Ａに対して、８Ｋ４Ｋの符号化データを構成する４つのスライスセグメントのうちの２つを出力する。 Furthermore, the receiving device 200 can also handle a case where the number of in-plane divisions in an access unit and the number of decoding units do not match. For example, when receiving device 200 includes two decoding sections 204A and 204B that can decode 8K2K encoded data in real time, demultiplexing section 203 sends 8K4K encoded data to decoding section 204A. Outputs two of the four slice segments that make up the segment.

図１２は、図８に示すようにＭＭＴパケット化されたデータが、２つの復号部２０４Ａ及び２０４Ｂに入力される場合の動作例を示す図である。ここで、受信装置２００は、復号部２０４Ａ及び２０４Ｂにおける復号結果を、そのまま統合して出力できることが望ましい。よって、逆多重化部２０３は、復号部２０４Ａ及び２０４Ｂの各々の復号結果が空間的に連続するように、復号部２０４Ａ及び２０４Ｂの各々に出力するスライスセグメントを選択する。 FIG. 12 is a diagram illustrating an example of the operation when the MMT packetized data as shown in FIG. 8 is input to two decoding units 204A and 204B. Here, it is desirable that the receiving device 200 be able to integrate and output the decoding results in the decoding sections 204A and 204B as they are. Therefore, demultiplexing section 203 selects slice segments to be output to each of decoding sections 204A and 204B so that the decoding results of each of decoding sections 204A and 204B are spatially continuous.

また、逆多重化部２０３は、動画像の符号化データの解像度又はフレームレートなどに応じて、使用する復号部を選択してもよい。例えば、受信装置２００が４Ｋ２Ｋの復号部を４つ備える場合には、入力画像の解像度が８Ｋ４Ｋであれば、受信装置２００は、４つ全ての復号部を用いて復号処理を行う。また、受信装置２００は、入力画像の解像度が４Ｋ２Ｋであれば１つの復号部のみを用いて復号処理を行う。または、逆多重化部２０３は、面内が４つに分割されていても、８Ｋ４Ｋを単一の復号部により実時間で復号できる場合には、全ての分割単位を統合して一つの復号部に出力する。 Further, the demultiplexing unit 203 may select a decoding unit to be used depending on the resolution or frame rate of the encoded data of the moving image. For example, in the case where the receiving device 200 includes four 4K2K decoding units, if the resolution of the input image is 8K4K, the receiving device 200 performs the decoding process using all four decoding units. Furthermore, if the resolution of the input image is 4K2K, the receiving device 200 performs the decoding process using only one decoding unit. Alternatively, even if the frame is divided into four, if 8K4K can be decoded in real time by a single decoding unit, the demultiplexing unit 203 integrates all division units into one decoding unit. Output to.

さらに、受信装置２００は、フレームレートを考慮して使用する復号部を決定してもよい。例えば、受信装置２００が、解像度が８Ｋ４Ｋである場合に実時間で復号可能なフレームレートの上限が６０ｆｐｓである復号部を２台備える場合に、８Ｋ４Ｋで１２０ｆｐｓの符号化データが入力されるケースがある。このとき、面内が４つの分割単位から構成されるとすると、図１２の例と同様に、スライスセグメント１とスライスセグメント２とが復号部２０４Ａに入力され、スライスセグメント３とスライスセグメント４とが復号部２０４Ｂに入力される。各々の復号部２０４Ａ及び２０４Ｂは、８Ｋ２Ｋ（解像度が８Ｋ４Ｋの半分）であれば１２０ｆｐｓまで実時間で復号できるため、これら２台の復号部２０４Ａ及び２０４Ｂにより復号処理が行われる。 Furthermore, receiving device 200 may decide which decoder to use in consideration of the frame rate. For example, if the receiving device 200 is equipped with two decoders whose upper limit of the frame rate that can be decoded in real time is 60 fps when the resolution is 8K4K, there is a case where encoded data of 120 fps at 8K4K is input. be. At this time, assuming that the in-plane is composed of four division units, slice segment 1 and slice segment 2 are input to the decoding unit 204A, and slice segment 3 and slice segment 4 are input to the decoding unit 204A, as in the example of FIG. The data is input to the decoding unit 204B. Since each of the decoding units 204A and 204B can decode in real time up to 120 fps for 8K2K (resolution is half of 8K4K), decoding processing is performed by these two decoding units 204A and 204B.

また、解像度及びフレームレートが同一であっても、符号化方式におけるプロファイル、或いはレベル、又は、Ｈ．２６４或いはＨ．２６５など符号化方式自体が異なると処理量が異なる。よって、受信装置２００は、これらの情報に基づいて使用する復号部を選択してもよい。なお、受信装置２００は、放送又は通信により受信した符号化データを全て復号することができない場合、又は、ユーザーが選択した領域を構成する全てのスライスセグメント又はタイルが復号できない場合には、復号部の処理範囲内で復号可能なスライスセグメント又はタイルを自動的に決定してもよい。または、受信装置２００は、ユーザーが復号する領域を選択するためのユーザインタフェースを提供してもよい。このとき、受信装置２００は、全て領域を復号できないことを示す警告メッセージを表示してもよいし、復号可能な領域、スライスセグメント又はタイルの個数を示す情報を表示してもよい。 Furthermore, even if the resolution and frame rate are the same, the profile or level of the encoding method, or the H. 264 or H. If the encoding method itself is different, such as H.265, the processing amount will be different. Therefore, receiving device 200 may select a decoding unit to use based on this information. Note that if the receiving device 200 cannot decode all the encoded data received through broadcasting or communication, or if it cannot decode all the slice segments or tiles that make up the area selected by the user, the receiving device 200 may Slice segments or tiles that can be decoded within the processing range of the decoder may be automatically determined. Alternatively, the receiving device 200 may provide a user interface for the user to select an area to decode. At this time, the receiving device 200 may display a warning message indicating that all regions cannot be decoded, or may display information indicating the number of decodable regions, slice segments, or tiles.

また、上記方法は、同一符号化データのスライスセグメントを格納するＭＭＴパケットが、放送及び通信など複数の伝送路を用いて送信及び受信される場合にも適用できる。 Furthermore, the above method can also be applied when MMT packets storing slice segments of the same encoded data are transmitted and received using multiple transmission paths such as broadcasting and communication.

また、送信装置１００は、分割単位の境界を目立たなくするために、各スライスセグメントの領域がオーバーラップするように符号化を行ってもよい。図１３に示す例では、８Ｋ４Ｋのピクチャが４つのスライスセグメント１～４に分割される。スライスセグメント１～３の各々は、例えば、８Ｋ×１．１Ｋであり、スライスセグメント４は８Ｋ×１Ｋである。また、隣接するスライスセグメントは互いにオーバーラップする。こうすることで、点線で示す４分割した場合の境界においては、符号化時の動き補償が効率的に実行できるため、境界部分の画質が向上する。このように、境界部分の画質劣化が低減される。 Further, the transmitting device 100 may perform encoding so that the regions of each slice segment overlap in order to make the boundaries between division units less noticeable. In the example shown in FIG. 13, an 8K4K picture is divided into four slice segments 1 to 4. Each of slice segments 1 to 3 is, for example, 8K×1.1K, and slice segment 4 is 8K×1K. Also, adjacent slice segments overlap each other. By doing so, motion compensation during encoding can be efficiently executed at the boundaries between the four divisions shown by the dotted lines, so that the image quality at the boundaries is improved. In this way, image quality deterioration at the boundary portion is reduced.

この場合、表示部２０５は、８Ｋ×１．１Ｋの領域から、８Ｋ×１Ｋの領域を切り出し、得られた領域を統合する。なお、送信装置１００は、スライスセグメントがオーバーラップして符号化されているかどうか、及び、オーバーラップの範囲を示す情報を、多重化レイヤ、又は、符号化データ内に含めて、別途送信してもよい。 In this case, the display unit 205 cuts out an 8K×1K area from the 8K×1.1K area and integrates the obtained areas. Note that the transmitting device 100 includes information indicating whether the slice segments are encoded in an overlapping manner and the range of overlap in the multiplex layer or encoded data, and separately transmits the information. Good too.

なお、タイルが使用される場合にも、同様の手法を適用可能である。 Note that a similar method can be applied even when tiles are used.

以下、送信装置１００の動作の流れを説明する。図１４は、送信装置１００の動作例を示すフローチャートである。 The flow of the operation of the transmitting device 100 will be described below. FIG. 14 is a flowchart illustrating an example of the operation of the transmitting device 100.

まず、符号化部１０１は、ピクチャ（アクセスユニット）を複数の領域である複数のスライスセグメント（タイル）に分割する（Ｓ１０１）。次に、符号化部１０１は、複数のスライスセグメントの各々を独立して復号が可能なように符号化することで、複数のスライスセグメントの各々に対応する符号化データを生成する（Ｓ１０２）。なお、符号化部１０１は、複数のスライスセグメントを単一の符号化部で符号化してもよし、複数の符号化部で並列処理してもよい。 First, the encoding unit 101 divides a picture (access unit) into a plurality of slice segments (tiles) that are a plurality of regions (S101). Next, the encoding unit 101 generates encoded data corresponding to each of the plurality of slice segments by encoding each of the plurality of slice segments so that each of the plurality of slice segments can be decoded independently (S102). Note that the encoding unit 101 may encode a plurality of slice segments using a single encoding unit, or may perform parallel processing using a plurality of encoding units.

次に、多重化部１０２は、符号化部１０１で生成された複数の符号化データを、複数のＭＭＴパケットに格納することで、複数の符号化データを多重化する（Ｓ１０３）。具体的には、図８及び図９に示すように、多重化部１０２は、一つのＭＭＴパケットに、異なるスライスセグメントに対応する符号化データが格納されないように、複数の符号化データを複数のＭＭＴパケットに格納する。また、多重化部１０２は、図８に示すように、ピクチャ内の全ての復号単位に対して共通に用いられる制御情報を、複数の符号化データが格納される複数のＭＭＴパケット＃２～＃５とは異なるＭＭＴパケット＃１に格納する。ここで制御情報は、アクセスユニットデリミタ、ＳＰＳ，ＰＰＳ及びＳＥＩのうち少なくとも一つを含む。 Next, the multiplexing unit 102 multiplexes the plurality of encoded data by storing the plurality of encoded data generated by the encoding unit 101 in a plurality of MMT packets (S103). Specifically, as shown in FIGS. 8 and 9, the multiplexing unit 102 combines a plurality of encoded data into a plurality of encoded data so that encoded data corresponding to different slice segments is not stored in one MMT packet. Store in MMT packet. Furthermore, as shown in FIG. 8, the multiplexing unit 102 transfers control information commonly used for all decoding units in a picture to multiple MMT packets #2 to ## in which multiple encoded data are stored. 5 is stored in MMT packet #1 different from MMT packet #5. Here, the control information includes at least one of an access unit delimiter, SPS, PPS, and SEI.

なお、多重化部１０２は、制御情報を、複数の符号化データが格納される複数のＭＭＴパケットのいずれかと同じＭＭＴパケットに格納してもよい。例えば、図９に示すように、多重化部１０２は、制御情報を、複数の符号化データが格納される複数のＭＭＴパケットのうちの先頭のＭＭＴパケット（図９のＭＭＴパケット＃１）に格納してもよい。 Note that the multiplexing unit 102 may store the control information in the same MMT packet as any of the multiple MMT packets in which multiple pieces of encoded data are stored. For example, as shown in FIG. 9, the multiplexing unit 102 stores the control information in the first MMT packet (MMT packet #1 in FIG. 9) among the multiple MMT packets in which multiple pieces of encoded data are stored. You may.

最後に、送信装置１００は、複数のＭＭＴパケットを送信する。具体的には、変調部１０３は、多重化により得られたデータを変調し、送信部１０４は、変調後のデータを送信する（Ｓ１０４）。 Finally, transmitting device 100 transmits a plurality of MMT packets. Specifically, modulating section 103 modulates the data obtained by multiplexing, and transmitting section 104 transmits the modulated data (S104).

図１５は、受信装置２００の構成例を示すブロック図であり、図７に示す逆多重化部２０３及びその後段の構成を詳細に示す図である。図１５に示すように、受信装置２００は、さらに、復号命令部２０６を備える。また、逆多重化部２０３は、タイプ判別部２１１と、制御情報取得部２１２と、スライス情報取得部２１３と、復号データ生成部２１４とを備える。 FIG. 15 is a block diagram showing an example of the configuration of the receiving device 200, and is a diagram showing in detail the configuration of the demultiplexer 203 shown in FIG. 7 and the subsequent stage. As shown in FIG. 15, receiving device 200 further includes a decoding command section 206. Further, the demultiplexing section 203 includes a type determining section 211 , a control information acquiring section 212 , a slice information acquiring section 213 , and a decoded data generating section 214 .

以下、受信装置２００の動作の流れを説明する。図１６は、受信装置２００の動作例を示すフローチャートである。ここでは、１つのアクセスユニットに対する動作を示す。複数のアクセスユニットの復号処理が実行される場合には、本フローチャートの処理が繰り返される。 The flow of the operation of the receiving device 200 will be described below. FIG. 16 is a flowchart illustrating an example of the operation of the receiving device 200. Here, the operation for one access unit is shown. When decoding processing for multiple access units is executed, the processing in this flowchart is repeated.

まず、受信装置２００は、は、例えば、送信装置１００により生成された複数のパケット（ＭＭＴパケット）を受信する（Ｓ２０１）。 First, the receiving device 200 receives, for example, a plurality of packets (MMT packets) generated by the transmitting device 100 (S201).

次に、タイプ判別部２１１は、受信パケットのヘッダを解析することで、受信パケットに格納されている符号化データのタイプを取得する（Ｓ２０２）。 Next, the type determination unit 211 acquires the type of encoded data stored in the received packet by analyzing the header of the received packet (S202).

次に、タイプ判別部２１１は、取得された符号化データのタイプに基づき、受信パケットに格納されているデータがスライスセグメント前データであるか、スライスセグメントのデータであるかを判定する（Ｓ２０３）。 Next, the type determination unit 211 determines whether the data stored in the received packet is pre-slice segment data or slice segment data based on the type of the acquired encoded data (S203). .

受信パケットに格納されているデータがスライスセグメント前データである場合（Ｓ２０３でＹｅｓ）、制御情報取得部２１２は、受信パケットのペイロードから処理対象のアクセスユニットのスライスセグメント前データを取得し、当該スライスセグメント前データをメモリに格納する（Ｓ２０４）。 If the data stored in the received packet is pre-slice segment data (Yes in S203), the control information acquisition unit 212 obtains the pre-slice segment data of the access unit to be processed from the payload of the received packet, and The pre-segment data is stored in memory (S204).

一方、受信パケットに格納されているデータがスライスセグメントのデータである場合（Ｓ２０３でＮｏ）、受信装置２００は、受信パケットのヘッダ情報を用いて、当該受信パケットに格納されているデータが、複数の領域のうちいずれの領域の符号化データであるかを判定する。具体的には、スライス情報取得部２１３は、受信パケットのヘッダを解析することで、受信パケットに格納されているスライスセグメントのインデックス番号Ｉｄｘを取得する（Ｓ２０５）。具体的には、インデックス番号Ｉｄｘは、アクセスユニット（ＭＭＴにおけるサンプル）のＭｏｖｉｅＦｒａｇｍｅｎｔ内におけるインデックス番号である。 On the other hand, if the data stored in the received packet is slice segment data (No in S203), the receiving device 200 uses the header information of the received packet to determine whether the data stored in the received packet is multiple segment data. It is determined which region of the regions the encoded data belongs to. Specifically, the slice information acquisition unit 213 acquires the index number Idx of the slice segment stored in the received packet by analyzing the header of the received packet (S205). Specifically, the index number Idx is an index number within the Movie Fragment of the access unit (sample in MMT).

なお、このステップＳ２０５の処理は、ステップＳ２０２においてまとめて行われてもよい。 Note that the processing in step S205 may be performed all at once in step S202.

次に、復号データ生成部２１４は、当該スライスセグメントを復号する復号部を決定する（Ｓ２０６）。具体的には、インデックス番号Ｉｄｘと複数の復号部とは予め対応付けられており、復号データ生成部２１４は、ステップＳ２０５で取得されたインデックス番号Ｉｄｘに対応する復号部を、当該スライスセグメントを復号する復号部を決定する。 Next, the decoded data generation unit 214 determines a decoding unit that decodes the slice segment (S206). Specifically, the index number Idx and a plurality of decoding units are associated in advance, and the decoded data generation unit 214 causes the decoding unit corresponding to the index number Idx acquired in step S205 to decode the slice segment. Decide which decoder to use.

なお、復号データ生成部２１４は、図１２の例において説明したように、アクセスユニット（ピクチャ）の解像度、アクセスユニットの複数のスライスセグメント（タイル）への分割方法、及び受信装置２００が備える複数の復号部の処理能力の少なくとも一つに基づき、当該スライスセグメントを復号する復号部を決定してもよい。例えば、復号データ生成部２１４は、アクセスユニットの分割方法を、ＭＭＴのメッセージ、又はＴＳのセクションなどのデスクリプタにおける識別情報に基づいて判別する。 Note that, as explained in the example of FIG. The decoding unit that decodes the slice segment may be determined based on at least one of the processing capabilities of the decoding unit. For example, the decoded data generation unit 214 determines the access unit division method based on identification information in a descriptor such as an MMT message or a TS section.

次に、復号データ生成部２１４は、複数のパケットのいずれかに含まれる、ピクチャ内の全ての復号単位に対して共通に用いられる制御情報と、複数のスライスセグメントの複数の符号化データの各々とを結合することで、複数の復号部へ入力される複数の入力データ（結合データ）を生成する。具体的には、復号データ生成部２１４は、受信パケットのペイロードからスライスセグメントのデータを取得する。復号データ生成部２１４は、ステップＳ２０４でメモリに格納されたスライスセグメント前データと、取得されたスライスセグメントのデータとを結合することで、ステップＳ２０６で決定された復号部への入力データを生成する（Ｓ２０７）。 Next, the decoded data generation unit 214 generates control information commonly used for all decoding units in a picture, which is included in any of the plurality of packets, and each of the plurality of encoded data of the plurality of slice segments. By combining these, a plurality of input data (combined data) to be input to a plurality of decoders is generated. Specifically, the decoded data generation unit 214 obtains slice segment data from the payload of the received packet. The decoded data generation unit 214 generates the input data to the decoding unit determined in step S206 by combining the pre-slice segment data stored in the memory in step S204 and the acquired slice segment data. (S207).

ステップＳ２０４又はＳ２０７の後、受信パケットのデータがアクセスユニットの最終データでない場合（Ｓ２０８でＮｏ）、ステップＳ２０１以降の処理が再度行われる。つまり、アクセスユニットに含まれる全てのスライスセグメントに対応する、複数の復号部２０４Ａ～２０４Ｄへの入力データが生成されるまで、上記処理が繰り返される。 After step S204 or S207, if the data of the received packet is not the final data of the access unit (No in S208), the processes from step S201 onwards are performed again. That is, the above process is repeated until input data to the plurality of decoding units 204A to 204D corresponding to all slice segments included in the access unit is generated.

なお、パケットが受信されるタイミングは、図１６に示すタイミングに限らず、予め又は順次複数のパケットが受信され、メモリ等に格納されてもよい。 Note that the timing at which the packets are received is not limited to the timing shown in FIG. 16, and a plurality of packets may be received in advance or sequentially and stored in a memory or the like.

一方、受信パケットのデータがアクセスユニットの最終データである場合（Ｓ２０８でＹｅｓ）、復号命令部２０６は、ステップＳ２０７で生成された、複数の入力データを、対応する復号部２０４Ａ～２０４Ｄへ出力する（Ｓ２０９）。 On the other hand, if the data of the received packet is the final data of the access unit (Yes in S208), the decoding instruction unit 206 outputs the plurality of input data generated in step S207 to the corresponding decoding units 204A to 204D. (S209).

次に、複数の復号部２０４Ａ～２０４Ｄは、アクセスユニットのＤＴＳに従い、複数の入力データを並列に復号することで、複数の復号画像を生成する（Ｓ２１０）。 Next, the plurality of decoding units 204A to 204D generate a plurality of decoded images by decoding the plurality of input data in parallel according to the DTS of the access unit (S210).

最後に、表示部２０５は、複数の復号部２０４Ａ～２０４Ｄで生成された複数の復号画像を結合することで表示画像を生成し、アクセスユニットのＰＴＳに従い当該表示画像を表示する（Ｓ２１１）。 Finally, the display unit 205 generates a display image by combining the plurality of decoded images generated by the plurality of decoding units 204A to 204D, and displays the display image according to the PTS of the access unit (S211).

なお、受信装置２００は、アクセスユニットのＤＴＳ及びＰＴＳを、ＭＰＵのヘッダ情報、又は、ＭｏｖｉｅＦｒａｇｍｅｎｔのヘッダ情報を格納するＭＭＴパケットのペイロードデータを解析することにより取得する。また、受信装置２００は、多重化方式としてＴＳが使用されている場合にはＰＥＳパケットのヘッダからアクセスユニットのＤＴＳ及びＰＴＳを取得する。受信装置２００は、多重化方式としてＲＴＰが使用されている場合にはＲＴＰパケットのヘッダからアクセスユニットのＤＴＳ及びＰＴＳを取得する。 Note that the receiving device 200 obtains the DTS and PTS of the access unit by analyzing payload data of an MMT packet that stores MPU header information or Movie Fragment header information. Furthermore, when TS is used as the multiplexing method, the receiving device 200 acquires the DTS and PTS of the access unit from the header of the PES packet. If RTP is used as the multiplexing method, the receiving device 200 obtains the DTS and PTS of the access unit from the header of the RTP packet.

また、表示部２０５は、複数の復号部の復号結果を統合する際に、隣接する分割単位の境界においてデブロックフィルタなどのフィルタ処理を行ってもよい。なお、単一の復号部の復号結果を表示する場合にはフィルタ処理は不要であるため、表示部２０５は、複数の復号部の復号結果の境界にフィルタ処理を行うかどうかに応じて処理を切替えてもよい。フィルタ処理が必要かどうかは、分割の有無などに応じて予め規定されていてもよい。または、フィルタ処理が必要かどうかを示す情報が、多重化レイヤに別途格納されてもよい。また、フィルタ係数などフィルタ処理に必要な情報は、ＳＰＳ、ＰＰＳ、ＳＥＩ、又はスライスセグメント内に格納される場合がある。復号部２０４Ａ～２０４Ｄ、又は逆多重化部２０３がＳＥＩを解析することによりこれらの情報を取得し、取得された情報を表示部２０５に出力する。表示部２０５は、これらの情報を用いてフィルタ処理を行う。なお、これらの情報がスライスセグメント内に格納される場合には、復号部２０４Ａ～２０４Ｄがこれらの情報を取得することが望ましい。 Further, when integrating the decoding results of the plurality of decoding units, the display unit 205 may perform filter processing such as a deblocking filter on the boundary between adjacent division units. Note that filtering is not necessary when displaying the decoding results of a single decoding unit, so the display unit 205 performs processing depending on whether or not to perform filtering on the boundaries of the decoding results of multiple decoding units. You may switch. Whether filter processing is necessary may be determined in advance depending on whether or not there is division. Alternatively, information indicating whether filter processing is necessary may be stored separately in the multiplexing layer. Additionally, information necessary for filter processing, such as filter coefficients, may be stored in the SPS, PPS, SEI, or slice segment. The decoding units 204A to 204D or the demultiplexing unit 203 acquire this information by analyzing the SEI, and output the acquired information to the display unit 205. The display unit 205 performs filter processing using this information. Note that when these pieces of information are stored in slice segments, it is desirable that the decoding units 204A to 204D acquire these pieces of information.

なお、上記説明では、フラグメントに格納されるデータの種類がスライスセグメント前データとスライスセグメントとの２種類である場合の例を示したが、データの種類は３種類以上であってもよい。この場合には、ステップＳ２０３においてタイプに応じた場合分けが行われる。 Note that in the above description, an example was given in which there are two types of data stored in a fragment: pre-slice segment data and slice segment data, but there may be three or more types of data. In this case, the cases are divided according to the type in step S203.

また、送信装置１００は、スライスセグメントのデータサイズが大きい場合にスライスセグメントをフラグメント化してＭＭＴパケットに格納してもよい。つまり、送信装置１００は、スライスセグメント前データ及びスライスセグメントをフラグメント化してもよい。この場合に、図１１に示したパケット化の例のようにアクセスユニットとＤａｔａｕｎｉｔとを等しく設定すると以下の問題が生じる。 Furthermore, when the data size of the slice segment is large, the transmitting device 100 may fragment the slice segment and store it in an MMT packet. That is, the transmitting device 100 may fragment the pre-slice segment data and the slice segment. In this case, if the access unit and data unit are set equally as in the packetization example shown in FIG. 11, the following problem will occur.

例えばスライスセグメント１が３つのフラグメントに分割される場合、スライスセグメント１がＦｒａｇｍｅｎｔｃｏｕｎｔｅｒ値が１から３の３つのパケットに分割して送信される。また、スライスセグメント２以降では、Ｆｒａｇｍｅｎｔｃｏｕｎｔｅｒ値が４以上となり、Ｆｒａｇｍｅｎｔｃｏｕｎｔｅｒの値とペイロードに格納されるデータとの対応付けが取れなくなる。従って、受信装置２００は、ＭＭＴパケットのヘッダの情報から、スライスセグメントの先頭データを格納するパケットを特定できない。 For example, when slice segment 1 is divided into three fragments, slice segment 1 is divided into three packets with Fragment counter values of 1 to 3 and transmitted. In addition, after the slice segment 2, the Fragment counter value becomes 4 or more, and the Fragment counter value cannot be associated with the data stored in the payload. Therefore, the receiving device 200 cannot identify the packet that stores the first data of the slice segment from the header information of the MMT packet.

このような場合には、受信装置２００は、ＭＭＴパケットのペイロードのデータを解析して、スライスセグメントの開始位置を特定してもよい。ここで、Ｈ．２６４又はＨ．２６５においてＮＡＬユニットを多重化レイヤに格納する形式として、ＮＡＬユニットヘッダの直前に特定のビット列からなるスタートコードが付加されるバイトストリームフォーマットと呼ばれる形式と、ＮＡＬユニットのサイズを示すフィールドが付加されるＮＡＬサイズフォーマットと呼ばれる形式との２種類がある。 In such a case, the receiving device 200 may identify the start position of the slice segment by analyzing the payload data of the MMT packet. Here, H. 264 or H. In H.265, the format for storing NAL units in the multiplex layer is a format called byte stream format in which a start code consisting of a specific bit string is added immediately before the NAL unit header, and a field indicating the size of the NAL unit is added. There are two types: a format called a NAL size format.

バイトストリームフォーマットは、ＭＰＥＧ－２システム及びＲＴＰなどにおいて用いられる。ＮＡＬサイズフォーマットは、ＭＰ４、並びにＭＰ４を使用するＤＡＳＨ及びＭＭＴなどにおいて用いられる。 The byte stream format is used in MPEG-2 systems, RTP, etc. The NAL size format is used in MP4, DASH and MMT, etc. that use MP4.

バイトストリームフォーマットが用いられる場合、受信装置２００は、パケットの先頭データがスタートコードと一致するかどうかを解析する。受信装置２００は、パケットの先頭データがスタートコードと一致していれば、その後に続くＮＡＬユニットヘッダからＮＡＬユニットのタイプを取得することで、当該パケットに含まれるデータがスライスセグメントのデータであるかどうかを検出できる。 When the byte stream format is used, the receiving device 200 analyzes whether the leading data of the packet matches the start code. If the first data of the packet matches the start code, the receiving device 200 determines whether the data included in the packet is slice segment data by acquiring the NAL unit type from the NAL unit header that follows. can detect whether

一方、ＮＡＬサイズフォーマットの場合には、受信装置２００は、ビット列に基づいてＮＡＬユニットの開始位置を検出できない。従って、受信装置２００は、ＮＡＬユニットの開始位置を取得するために、アクセスユニットの先頭ＮＡＬユニットから順に、ＮＡＬユニットのサイズ分だけデータの読出すことでポインタをシフトさせていく必要がある。 On the other hand, in the case of the NAL size format, the receiving device 200 cannot detect the start position of the NAL unit based on the bit string. Therefore, in order to obtain the starting position of the NAL unit, the receiving device 200 needs to shift the pointer by reading data by the size of the NAL unit in order from the first NAL unit of the access unit.

但し、ＭＭＴにおけるＭＰＵ又はＭｏｖｉｅＦｒａｇｍｅｎｔのヘッダにおいて、サブサンプル単位のサイズが示され、サブサンプルがスライス前データ又はスライスセグメントに対応する場合には、受信装置２００は、サブサンプルのサイズ情報に基づいて各ＮＡＬユニットの開始位置を特定できる。そのため、送信装置１００は、サブサンプル単位の情報がＭＰＵ又はＭｏｖｉｅＦｒａｇｍｅｎｔに存在するかどうかを示す情報を、ＭＭＴにおけるＭＰＴなどの、受信装置２００がデータの受信開始時に取得する情報に含めてもよい。 However, in the header of the MPU or Movie Fragment in MMT, the size of the subsample unit is indicated, and if the subsample corresponds to pre-slice data or a slice segment, the receiving device 200 uses the subsample size information based on the size information of the subsample. The starting position of each NAL unit can be specified. Therefore, the transmitting device 100 may include information indicating whether subsample unit information exists in the MPU or Movie Fragment in the information that the receiving device 200 acquires when starting data reception, such as MPT in MMT. .

なお、ＭＰＵのデータはＭＰ４フォーマットをベースに拡張したものである。ＭＰ４においては、Ｈ．２６４又はＨ．２６５のＳＰＳ及びＰＰＳなどのパラメータセットをサンプルデータとして格納可能なモードと、格納できないモードとがある。また、このモードを特定するための情報がＳａｍｐｌｅＥｎｔｒｙのエントリ名として示される。格納可能なモードが用いられており、パラメータセットがサンプルに含まれる場合には、受信装置２００は、上述した方法によりパラメータセットを取得する。 Note that the MPU data is based on and expanded from the MP4 format. In MP4, H. 264 or H. There are two modes: a mode in which parameter sets such as H.265 SPS and PPS can be stored as sample data, and a mode in which they cannot be stored. Further, information for specifying this mode is shown as an entry name of SampleEntry. If the storable mode is used and the parameter set is included in the sample, the receiving device 200 obtains the parameter set using the method described above.

一方、格納できないモードが用いられている場合には、パラメータセットは、ＳａｍｐｌｅＥｎｔｒｙ内のＤｅｃｏｄｅｒＳｐｅｃｉｆｉｃＩｎｆｏｒｍａｔｉｏｎとして格納される、又は、パラメータセット用のストリームを用いて格納される。ここで、パラメータセット用のストリームは一般的には使用されていないので、送信装置１００は、ＤｅｃｏｄｅｒＳｐｅｃｉｆｉｃＩｎｆｏｒｍａｔｉｏｎにパラメータセットを格納することが望ましい。この場合には、受信装置２００は、ＭＭＴパケットにおいてＭＰＵのメタデータ、又は、ＭｏｖｉｅＦｒａｇｍｅｎｔのメタデータとしてとして送信されるＳａｍｐｌｅＥｎｔｒｙを解析して、アクセスユニットが参照するパラメータセットを取得する。 On the other hand, when the non-storable mode is used, the parameter set is stored as Decoder Specific Information in SampleEntry or using a stream for parameter sets. Here, since streams for parameter sets are not generally used, it is desirable for the transmitting device 100 to store the parameter sets in Decoder Specific Information. In this case, the receiving device 200 analyzes the SampleEntry transmitted as MPU metadata or Movie Fragment metadata in the MMT packet, and obtains the parameter set referenced by the access unit.

パラメータセットがサンプルデータとして格納される場合には、受信装置２００は、ＳａｍｐｌｅＥｎｔｒｙを参照せずにサンプルデータのみを参照すれば復号に必要なパラメータセットが取得できる。このとき、送信装置１００は、ＳａｍｐｌｅＥｎｔｒｙにパラメータセットを格納しなくてもよい。こうすることで、送信装置１００は、異なるＭＰＵにおいて同一のＳａｍｐｌｅＥｎｔｒｙを用いることができるので、ＭＰＵ生成時の送信装置１００の処理負荷を低減できる。さらに、受信装置２００がＳａｍｐｌｅＥｎｔｒｙ内のパラメータセットを参照する必要がなくなるというメリットがある。 When the parameter set is stored as sample data, the receiving device 200 can obtain the parameter set necessary for decoding by referring only to the sample data without referring to SampleEntry. At this time, the transmitting device 100 does not need to store the parameter set in SampleEntry. By doing so, the transmitting device 100 can use the same SampleEntry in different MPUs, thereby reducing the processing load on the transmitting device 100 when generating the MPU. Furthermore, there is an advantage that the receiving device 200 does not need to refer to the parameter set in SampleEntry.

または、送信装置１００は、ＳａｍｐｌｅＥｎｔｒｙにデフォルトのパラメータセットを１つ格納し、アクセスユニットが参照するパラメータセットをサンプルデータに格納してもよい。従来のＭＰ４においては、ＳａｍｐｌｅＥｎｔｒｙにパラメータセットを格納するのが一般的であったため、ＳａｍｐｌｅＥｎｔｒｙにパラメータセットが存在しない場合、再生を停止する受信装置が存在する可能性がある。上記の方法を用いることで、この問題を解決できる。 Alternatively, the transmitting device 100 may store one default parameter set in SampleEntry and store the parameter set referred to by the access unit in sample data. In conventional MP4, it is common to store parameter sets in SampleEntry, so if a parameter set does not exist in SampleEntry, there is a possibility that there is a receiving device that stops playback. This problem can be solved by using the above method.

または、送信装置１００は、デフォルトのパラメータセットとは異なるパラメータセットが使用される場合にのみ、サンプルデータにパラメータセットを格納してもよい。 Alternatively, the transmitting device 100 may store a parameter set in the sample data only when a parameter set different from the default parameter set is used.

なお、両モード共に、パラメータセットをＳａｍｐｌｅＥｎｔｒｙに格納することは可能であるため、送信装置１００は、パラメータセットを常にＶｉｓｕａｌＳａｍｐｌｅＥｎｔｒｙに格納し、受信装置２００は常にＶｉｓｕａｌＳａｍｐｌｅＥｎｔｒｙからパラメータセットを取得してもよい。 Note that in both modes, it is possible to store the parameter set in SampleEntry, so the transmitting device 100 may always store the parameter set in VisualSampleEntry, and the receiving device 200 may always obtain the parameter set from VisualSampleEntry.

なお、ＭＭＴ規格においては、Ｍｏｏｖ及びＭｏｏｆなどＭＰ４のヘッダ情報は、ＭＰＵメタデータ、或いはムービーフラグメントメタデータとして伝送されるが、送信装置１００は、ＭＰＵメタデータ、および、ムービーフラグメントメタデータを必ずしも送信しなくてもよい。さらに、受信装置２００は、ＡＲＩＢ（ＡｓｓｏｃｉａｔｉｏｎｏｆＲａｄｉｏＩｎｄｕｓｔｒｉｅｓａｎｄＢｕｓｉｎｅｓｓｅｓ）規格のサービス、アセットのタイプ、又は、ＭＰＵメタの伝送有無などに基づいて、サンプルデータ内にＳＰＳ及びＰＰＳが格納されるかどうかを判定することも可能である。 Note that in the MMT standard, MP4 header information such as Moov and Moof is transmitted as MPU metadata or movie fragment metadata, but the transmitting device 100 does not necessarily transmit MPU metadata and movie fragment metadata. You don't have to. Furthermore, the receiving device 200 determines whether SPS and PPS are stored in the sample data based on the ARIB (Association of Radio Industries and Businesses) standard service, asset type, or whether or not MPU meta is transmitted. It is also possible to judge.

図１７は、スライスセグメント前データ及び各スライスセグメントが、それぞれ異なるＤａｔａｕｎｉｔに設定される場合の例を示す図である。 FIG. 17 is a diagram illustrating an example in which pre-slice segment data and each slice segment are set to different data units.

図１７に示す例では、スライスセグメント前データ、及びスライスセグメント１からスライスセグメント４までのデータサイズは、それぞれＬｅｎｇｔｈ＃１からＬｅｎｇｔｈ＃５である。ＭＭＴパケットのヘッダに含まれるＦｒａｇｍｅｎｔａｔｉｏｎｉｎｄｉｃａｔｏｒ、Ｆｒａｇｍｅｎｔｃｏｕｎｔｅｒ、及び、Ｏｆｆｓｅｔの各フィールド値は図中に示す通りである。 In the example shown in FIG. 17, the data sizes of the pre-slice segment data and slice segments 1 to 4 are Length #1 to Length #5, respectively. The field values of Fragmentation indicator, Fragment counter, and Offset included in the header of the MMT packet are as shown in the figure.

ここで、Ｏｆｆｓｅｔは、ペイロードデータが属するサンプル（アクセスユニット又はピクチャ）の符号化データの先頭から、当該ＭＭＴパケットに含まれるペイロードデータ（符号化データ）の先頭バイトまでのビット長（オフセット）を示すオフセット情報である。なお、Ｆｒａｇｍｅｎｔｃｏｕｎｔｅｒの値はフラグメントの総数から１を減算した値から開始するとして説明するが、他の値から開始してもよい。 Here, Offset indicates the bit length (offset) from the beginning of the encoded data of the sample (access unit or picture) to which the payload data belongs to the first byte of the payload data (encoded data) included in the MMT packet. This is offset information. Note that although the value of the Fragment counter will be explained as starting from the value obtained by subtracting 1 from the total number of fragments, it may start from another value.

図１８は、Ｄａｔａｕｎｉｔがフラグメント化される場合の例を示す図である。図１８に示す例では、スライスセグメント１が３つのフラグメントに分割され、それぞれＭＭＴパケット＃２からＭＭＴパケット＃４に格納される。このときも、各フラグメントのデータサイズを、それぞれＬｅｎｇｔｈ＃２＿１からＬｅｎｇｔｈ＃２＿３とすると、各フィールドの値は図中に示す通りである。 FIG. 18 is a diagram illustrating an example in which a Data unit is fragmented. In the example shown in FIG. 18, slice segment 1 is divided into three fragments, and each fragment is stored in MMT packet #2 to MMT packet #4. Also at this time, assuming that the data size of each fragment is Length #2_1 to Length #2_3, the values of each field are as shown in the figure.

このように、スライスセグメントなどのデータ単位がＤａｔａｕｎｉｔに設定される場合、アクセスユニットの先頭、及びスライスセグメントの先頭は、ＭＭＴパケットヘッダのフィールド値に基づいて以下のように決定できる。 In this way, when a data unit such as a slice segment is set to Data unit, the beginning of the access unit and the beginning of the slice segment can be determined as follows based on the field value of the MMT packet header.

Ｏｆｆｓｅｔの値が０であるパケットにおけるペイロードの先頭は、アクセスユニットの先頭である。 The head of the payload in a packet whose Offset value is 0 is the head of the access unit.

Ｏｆｆｓｅｔの値が０とは異なる値であり、かつ、Ｆｒａｇｍｅｎｔａｔｉｏｎｉｎｄｃａｔｏｒｎｏ値が００又は０１であるパケットのペイロードの先頭が、スライスセグメントの先頭である。 The beginning of the payload of a packet whose Offset value is different from 0 and whose Fragmentation index value is 00 or 01 is the beginning of a slice segment.

また、Ｄａｔａｕｎｉｔのフラグメント化が発生せず、パケットロスも発生しない場合には、受信装置２００は、アクセスユニットの先頭を検出した後に取得したスライスセグメントの数に基づいて、ＭＭＴパケットに格納されるスライスセグメントのインデックス番号を特定できる。 Furthermore, when fragmentation of the Data unit does not occur and no packet loss occurs, the receiving device 200 determines the number of slice segments that are stored in the MMT packet based on the number of slice segments acquired after detecting the beginning of the access unit. The index number of a slice segment can be determined.

また、スライスセグメント前データのＤａｔａｕｎｉｔがフラグメント化される場合においても、同様に、受信装置２００は、アクセスユニット、及びスライスセグメントの先頭を検出できる。 Further, even when the Data unit of the pre-slice segment data is fragmented, the receiving device 200 can similarly detect the access unit and the beginning of the slice segment.

また、パケットロスが発生した場合、又は、スライスセグメント前データに含まれるＳＰＳ、ＰＰＳ及びＳＥＩが別々のＤａｔａｕｎｉｔに設定された場合においても、受信装置２００は、ＭＭＴヘッダの解析結果に基づいてスライスセグメントの先頭データを格納したＭＭＴパケットを特定し、その後、スライスセグメントのヘッダを解析することで、ピクチャ（アクセスユニット）内におけるスライスセグメント又はタイルの開始位置を特定できる。スライスヘッダの解析に係る処理量は小さく、処理負荷は問題とならない。 Furthermore, even if a packet loss occurs or if the SPS, PPS, and SEI included in the pre-slice segment data are set to separate data units, the receiving device 200 performs slice processing based on the analysis result of the MMT header. The starting position of a slice segment or tile within a picture (access unit) can be identified by identifying the MMT packet that stores the start data of the segment and then analyzing the header of the slice segment. The amount of processing involved in analyzing the slice header is small, and the processing load is not a problem.

このように、複数のスライスセグメントの複数の符号化データの各々は、１以上のパケットに格納されるデータの単位である基本データ単位（Ｄａｔａｕｎｉｔ）と一対一で対応付けられている。また、複数の符号化データの各々は、１以上のＭＭＴパケットに格納される。 In this way, each of the plurality of encoded data of the plurality of slice segments is in one-to-one correspondence with a basic data unit (Data unit) that is a unit of data stored in one or more packets. Moreover, each of the plurality of encoded data is stored in one or more MMT packets.

各ＭＭＴパケットのヘッダ情報は、Ｆｒａｇｍｅｎｔａｔｉｏｎｉｎｄｉｃａｔｏｒ（識別情報）及びＯｆｆｓｅｔ（オフセット情報）を含む。 The header information of each MMT packet includes Fragmentation indicator (identification information) and Offset (offset information).

受信装置２００は、受信装置２００は、値が００又は０１であるＦｒａｇｍｅｎｔａｔｉｏｎｉｎｄｉｃａｔｏｒが含まれるヘッダ情報を有するパケットに含まれるペイロードデータの先頭を、各スライスセグメントの符号化データの先頭であると判定する。具体的には、値が０でないＯｆｆｓｅｔと、値が００又は０１であるＦｒａｇｍｅｎｔａｔｉｏｎｉｎｄｉｃａｔｏｒとが含まれるヘッダ情報を有するパケットに含まれるペイロードデータの先頭を、各スライスセグメントの符号化データの先頭であると判定する。 The receiving device 200 determines that the beginning of the payload data included in the packet having the header information including the Fragmentation indicator whose value is 00 or 01 is the beginning of the encoded data of each slice segment. . Specifically, the beginning of the payload data included in a packet having header information including an Offset whose value is not 0 and a Fragmentation indicator whose value is 00 or 01 is the beginning of the encoded data of each slice segment. It is determined that

また、図１７の例では、Ｄａｔａｕｎｉｔの先頭は、アクセスユニットの先頭、又は、スライスセグメントの先頭のいずれかであり、Ｆｒａｇｍｅｎｔａｔｉｏｎｉｎｄｉｃａｔｏｒの値は００又は０１である。さらに、受信装置２００は、ＮＡＬユニットのタイプを参照して、ＤａｔａＵｎｉｔの先頭がアクセスユニットデリミタ、又は、スライスセグメントのどちらであるかを判定することで、Ｏｆｆｓｅｔを参照せずに、アクセスユニットの先頭、又は、スライスセグメントの先頭を検出することも可能である。 Further, in the example of FIG. 17, the beginning of the Data unit is either the beginning of the access unit or the beginning of the slice segment, and the value of the Fragmentation indicator is 00 or 01. Furthermore, the receiving device 200 refers to the type of the NAL unit and determines whether the beginning of the Data Unit is an access unit delimiter or a slice segment, thereby determining whether the access unit is It is also possible to detect the beginning or the beginning of a slice segment.

このように、送信装置１００が、ＮＡＬユニットの先頭が必ずＭＭＴパケットのペイロードの先頭から開始されるようにパケット化を行うことで、スライスセグメント前データが複数のＤａｔａｕｎｉｔに分割される場合も含めて、受信装置２００は、Ｆｒａｇｍｅｎｔａｔｉｏｎｉｎｄｉｃａｔｏｒ及びＮＡＬユニットヘッダを解析することにより、アクセスユニット、又は、スライスセグメントの先頭を検出できる。ＮＡＬユニットのタイプは、ＮＡＬユニットヘッダの先頭バイトに存在する。従って、受信装置２００は、ＭＭＴパケットのヘッダ部を解析する際に、追加で１バイト分のデータを解析することによりＮＡＬユニットのタイプが取得できる。オーディオの場合には、受信装置２００は、アクセスユニットの先頭が検出できればよく、Ｆｒａｇｍｅｎｔａｔｉｏｎｉｎｄｉｃａｔｏｒの値が００又は０１であるかどうかに基づいて判定すればよい。 In this way, the transmitting device 100 performs packetization so that the beginning of the NAL unit always starts from the beginning of the payload of the MMT packet, even when the pre-slice segment data is divided into multiple Data units. The receiving device 200 can detect the beginning of an access unit or a slice segment by analyzing the Fragmentation indicator and the NAL unit header. The NAL unit type is present in the first byte of the NAL unit header. Therefore, when analyzing the header portion of the MMT packet, the receiving device 200 can acquire the type of the NAL unit by additionally analyzing 1 byte of data. In the case of audio, the receiving device 200 only needs to be able to detect the beginning of the access unit, and may make a determination based on whether the value of the Fragmentation indicator is 00 or 01.

また、上述したように、分割復号ができるように符号化された符号化データをＭＰＥＧ－２ＴＳのＰＥＳパケットに格納する場合には、送信装置１００は、ｄａｔａａｌｉｇｎｍｅｎｔ記述子を用いることが可能である。以下、符号化データのＰＥＳパケットへの格納方法の例について詳細に説明する。 Furthermore, as described above, when storing encoded data that is encoded so that it can be dividedly decoded in a PES packet of an MPEG-2 TS, the transmitting device 100 can use a data alignment descriptor. be. Hereinafter, an example of a method of storing encoded data in a PES packet will be described in detail.

例えば、ＨＥＶＣにおいては、送信装置１００は、ｄａｔａａｌｉｇｎｍｅｎｔ記述子を用いることにより、ＰＥＳパケットに格納されるデータがアクセスユニット、スライスセグメント、及び、タイルのいずれであるかを示すことができる。ＨＥＶＣにおけるアラインメントのタイプは、次のように規定されている。 For example, in HEVC, the transmitting device 100 can indicate whether the data stored in the PES packet is an access unit, a slice segment, or a tile by using a data alignment descriptor. The alignment types in HEVC are defined as follows.

アラインメントのタイプ＝８は、ＨＥＶＣのスライスセグメントを示す。アラインメントのタイプ＝９は、ＨＥＶＣのスライスセグメント又はアクセスユニットを示す。アラインメントのタイプ＝１２は、ＨＥＶＣのスライスセグメント又はタイルを示す。 Alignment type=8 indicates a HEVC slice segment. Alignment type=9 indicates HEVC slice segment or access unit. Alignment type=12 indicates HEVC slice segment or tile.

よって、送信装置１００は、例えば、タイプ９を用いることで、ＰＥＳパケットのデータがスライスセグメント又はスライスセグメント前データのいずれかであることを示すことができる。スライスセグメントではなく、スライスを示すタイプも別途規定されているため、送信装置１００は、スライスセグメントではなくスライスを示すタイプを使用してもよい。 Therefore, for example, by using type 9, the transmitting device 100 can indicate that the data of the PES packet is either a slice segment or pre-slice segment data. Since a type indicating a slice instead of a slice segment is also separately defined, the transmitting device 100 may use a type indicating a slice instead of a slice segment.

また、ＰＥＳパケットのヘッダに含まれるＤＴＳ及びＰＴＳは、アクセスユニットの先頭データを含むＰＥＳパケットにおいてのみ設定される。従って、受信装置２００は、タイプが９であり、かつ、ＰＥＳパケットにＤＴＳ又はＰＴＳのフィールドが存在すれば、ＰＥＳパケットにはアクセスユニット全体、又は、アクセスユニットにおける先頭の分割単位が格納されると判定できる。 Furthermore, the DTS and PTS included in the header of the PES packet are set only in the PES packet that includes the first data of the access unit. Therefore, if the type is 9 and the DTS or PTS field exists in the PES packet, the receiving device 200 assumes that the entire access unit or the first division unit in the access unit is stored in the PES packet. Can be judged.

また、送信装置１００は、アクセスユニットの先頭データを含むＰＥＳパケットを格納するＴＳパケットの優先度を示すｔｒａｎｓｐｏｒｔ＿ｐｒｉｏｒｉｔｙなどのフィールドを用いて、受信装置２００がパケットに含まれるデータを区別できるようにしてもよい。また、受信装置２００は、ＰＥＳパケットのペイロードがアクセスユニットデリミタであるかどうかを解析することでパケットに含まれるデータを判定してもよい。また、ＰＥＳパケットヘッダのｄａｔａ＿ａｌｉｇｎｍｅｎｔ＿ｉｎｄｉｃａｔｏｒは、これらのタイプに従ってＰＥＳパケットにデータが格納されているかどうかを示す。このフラグ（ｄａｔａ＿ａｌｉｇｎｍｅｎｔ＿ｉｎｄｉｃａｔｏｒ）が１にセットされていれば、ＰＥＳパケットに格納されているデータはｄａｔａａｌｉｇｎｍｅｎｔ記述子に示されるタイプに従うことが保証される。 Further, the transmitting device 100 may enable the receiving device 200 to distinguish the data included in the packet by using a field such as transport_priority that indicates the priority of the TS packet that stores the PES packet containing the first data of the access unit. good. Further, the receiving device 200 may determine the data included in the packet by analyzing whether the payload of the PES packet is an access unit delimiter. Additionally, data_alignment_indicator in the PES packet header indicates whether data is stored in the PES packet according to these types. If this flag (data_alignment_indicator) is set to 1, it is guaranteed that the data stored in the PES packet follows the type indicated in the data alignment descriptor.

また、送信装置１００は、スライスセグメントなどの分割復号可能な単位でＰＥＳパケット化する場合にのみｄａｔａａｌｉｇｎｍｅｎｔ記述子を使用してもよい。これにより、受信装置２００は、ｄａｔａａｌｉｇｎｍｅｎｔ記述子が存在する場合には、符号化データが分割復号可能な単位でＰＥＳパケット化されていると判断でき、ｄａｔａａｌｉｇｎｍｅｎｔ記述子が存在しなければ、符号化データがアクセスユニット単位でＰＥＳパケット化されていると判断できる。なお、ｄａｔａ＿ａｌｉｇｎｍｅｎｔ＿ｉｎｄｉｃａｔｏｒが１にセットされており、ｄａｔａａｌｉｇｎｍｅｎｔ記述子が存在しない場合には、ＰＥＳパケット化の単位がアクセスユニットであることはＭＰＥＧ－２ＴＳ規格において規定されている。 Further, the transmitting device 100 may use the data alignment descriptor only when converting into PES packets in division-decodable units such as slice segments. Thereby, if the data alignment descriptor exists, the receiving device 200 can determine that the encoded data has been made into PES packets in units that can be divided and decodable, and if the data alignment descriptor does not exist, the receiving device 200 can determine that the encoded data is divided into PES packets. It can be determined that the encoded data is converted into PES packets in units of access units. Note that when the data_alignment_indicator is set to 1 and the data alignment descriptor does not exist, it is specified in the MPEG-2 TS standard that the unit of PES packetization is an access unit.

受信装置２００は、ＰＭＴ内にｄａｔａａｌｉｇｎｍｅｎｔ記述子が含まれていれば、分割復号可能な単位でＰＥＳパケット化されていると判定し、パケット化された単位に基づいて、各復号部への入力データを生成することができる。また、受信装置２００は、ＰＭＴ内にｄａｔａａｌｉｇｎｍｅｎｔ記述子が含まれておらず、番組情報、又はその他の記述子の情報に基づいて、符号化データの並列復号が必要と判定される場合には、スライスセグメントのスライスヘッダなどを解析することにより、各復号部への入力データを生成する。また、符号化データを単一の復号部により復号可能である場合には、受信装置２００は、アクセスユニット全体のデータを当該の復号部で復号する。なお、符号化データがスライスセグメント又はタイルなどの分割復号可能な単位から構成されるかどうかを示す情報が、ＰＭＴの記述子などにより別途示されている場合、受信装置２００は、当該記述子の解析結果に基づいて符号化データを並列復号できるかどうかを判定してもよい。 If the data alignment descriptor is included in the PMT, the receiving device 200 determines that the PES packet has been made into units that can be divided and decoded, and inputs the data to each decoding unit based on the packetized unit. Data can be generated. Furthermore, if the PMT does not include a data alignment descriptor and it is determined that parallel decoding of encoded data is necessary based on program information or other descriptor information, the receiving device 200 , the slice header of the slice segment, etc. to generate input data to each decoding unit. Furthermore, if the encoded data can be decoded by a single decoding unit, the receiving device 200 decodes the data of the entire access unit using the decoding unit. Note that if information indicating whether the encoded data is composed of units that can be divided and decoded such as slice segments or tiles is separately indicated by a PMT descriptor, the receiving device 200 It may be determined whether the encoded data can be decoded in parallel based on the analysis result.

また、ＰＥＳパケットのヘッダに含まれるＤＴＳ及びＰＴＳは、アクセスユニットの先頭データを含むＰＥＳパケットにおいてのみ設定されるため、アクセスユニットが分割されてＰＥＳパケット化される場合には、２番目以降のＰＥＳパケットにはアクセスユニットのＤＴＳ及びＰＴＳを示す情報は含まれない。従って、復号処理を並列に行う場合、各復号部２０４Ａ～２０４Ｄ及び表示部２０５は、アクセスユニットの先頭データを含むＰＥＳパケットのヘッダに格納されるＤＴＳ及びＰＴＳを使用する。 Furthermore, since the DTS and PTS included in the header of the PES packet are set only in the PES packet that includes the first data of the access unit, when the access unit is divided into PES packets, the second and subsequent PES The packet does not include information indicating the DTS and PTS of the access unit. Therefore, when performing decoding processing in parallel, each decoding section 204A to 204D and display section 205 use the DTS and PTS stored in the header of the PES packet containing the first data of the access unit.

（実施の形態２）
実施の形態２では、ＭＭＴにおいて、ＮＡＬサイズフォーマットのデータをＭＰ４フォーマットベースのＭＰＵに格納する方法について説明する。なお、以下では、一例として、ＭＭＴに用いられるＭＰＵへの格納方法について説明するが、このような格納方法は、同じＭＰ４フォーマットベースであるＤＡＳＨにも適用可能である。 (Embodiment 2)
In the second embodiment, a method of storing NAL size format data in an MP4 format-based MPU in MMT will be described. Note that, although a storage method in the MPU used for MMT will be described below as an example, such a storage method can also be applied to DASH, which is based on the same MP4 format.

［ＭＰＵへの格納方法］
ＭＰ４フォーマットでは、複数のアクセスユニットをまとめて、一つのＭＰ４ファイルに格納する。ＭＭＴに用いられるＭＰＵは、メディア毎のデータが一つのＭＰ４ファイルに格納され、データには任意の数のアクセスユニットを含むことができる。ＭＰＵは、単体で復号可能な単位であるため、例えば、ＭＰＵにはＧＯＰ単位のアクセスユニットが格納される。 [How to store in MPU]
In the MP4 format, multiple access units are stored together in one MP4 file. In the MPU used for MMT, data for each medium is stored in one MP4 file, and the data can include an arbitrary number of access units. Since the MPU is a unit that can be decoded by itself, for example, the MPU stores access units in units of GOP.

図１９は、ＭＰＵの構成を示す図である。ＭＰＵの先頭は、ｆｔｙｐ、ｍｍｐｕ、及びｍｏｏｖであり、これらは、まとめてＭＰＵメタデータと定義される。ｍｏｏｖには、ファイルに共通の初期化情報、及びＭＭＴヒントトラックが格納される。 FIG. 19 is a diagram showing the configuration of the MPU. The beginning of the MPU is ftyp, mmpu, and moov, which are collectively defined as MPU metadata. moov stores initialization information common to files and an MMT hint track.

また、ｍｏｏｆには、サンプルやサブサンプル毎の初期化情報及びサイズ、提示時刻（ＰＴＳ）及び復号時刻（ＤＴＳ）を特定できる情報（ｓａｍｐｌｅ＿ｄｕｒａｔｉｏｎ、ｓａｍｐｌｅ＿ｓｉｚｅ、ｓａｍｐｌｅ＿ｃｏｍｐｏｓｉｔｉｏｎ＿ｔｉｍｅ＿ｏｆｆｓｅｔ）、並びにデータの位置を示すｄａｔａ＿ｏｆｆｓｅｔなどが格納される。 Moof also includes initialization information and size for each sample or subsample, information that can specify presentation time (PTS) and decoding time (DTS) (sample_duration, sample_size, sample_composition_time_offset), data_offset that indicates the position of data, etc. But Stored.

また、複数のアクセスユニットは、それぞれサンプルとしてｍｄａｔ（ｍｄａｔｂｏｘ）に格納される。ｍｏｏｆ及びｍｄａｔのうちサンプルを除くデータは、ムービーフラグメントメタデータ（以降では、ＭＦメタデータと記載する。）と定義され、ｍｄａｔのサンプルデータは、メディアデータと定義される。 Further, each of the plurality of access units is stored as a sample in mdat (mdat box). Data excluding samples among moof and mdat is defined as movie fragment metadata (hereinafter referred to as MF metadata), and sample data of mdat is defined as media data.

図２０は、ＭＦメタデータの構成を示す図である。図２０に示されるように、ＭＦメタデータは、より詳細には、ｍｏｏｆｂｏｘ（ｍｏｏｆ）の、ｔｙｐｅ、ｌｅｎｇｔｈ、及びｄａｔａと、ｍｄａｔｂｏｘ（ｍｄａｔ）のｔｙｐｅ及びｌｅｎｇｔｈとからなる。 FIG. 20 is a diagram showing the structure of MF metadata. As shown in FIG. 20, the MF metadata includes, in more detail, the type, length, and data of the moof box (moof), and the type, length, and length of the mdat box (mdat).

アクセスユニットをＭＰ４データに格納する際には、Ｈ．２６４やＨ．２６５のＳＰＳ、及び、ＰＰＳなどのパラメータセットをサンプルデータとして格納可能なモードと、格納できないモードがある。 When storing access units in MP4 data, H. 264 and H. There are two modes: a mode in which parameter sets such as H.265 SPS and PPS can be stored as sample data, and a mode in which they cannot be stored.

ここで、上記格納できないモードにおいては、パラメータセットは、ｍｏｏｖにおけるＳａｍｐｌｅＥｎｔｒｙのＤｅｃｏｄｅｒＳｐｅｃｉｆｉｃＩｎｆｏｒｍａｔｉｏｎに格納される。また、上記格納できるモードにおいては、パラメータセットは、サンプル内に含められる。 Here, in the above storage-unable mode, the parameter set is stored in Decoder Specific Information of SampleEntry in moov. Also, in the storable mode, the parameter set is included within the sample.

ＭＰＵメタデータ、ＭＦメタデータ、及びメディアデータは、それぞれＭＭＴペイロードに格納され、これらのデータを識別可能な識別子として、ＭＭＴペイロードのヘッダには、フラグメントタイプ（ＦＴ）が格納される。ＦＴ＝０は、ＭＰＵメタデータであることを示し、ＦＴ＝１は、ＭＦメタデータであることを示し、ＦＴ＝２はメディアデータであることを示す。 MPU metadata, MF metadata, and media data are each stored in an MMT payload, and a fragment type (FT) is stored in the header of the MMT payload as an identifier that can identify these data. FT=0 indicates MPU metadata, FT=1 indicates MF metadata, and FT=2 indicates media data.

なお、図１９では、ＭＰＵメタデータ単位及びＭＦメタデータ単位がデータユニットとしてＭＭＴペイロードに格納される例が図示されているが、ｆｔｙｐ、ｍｍｐｕ、ｍｏｏｖ、及びｍｏｏｆなどの単位がデータユニットとして、データユニット単位でＭＭＴペイロードに格納されてもよい。同様に、図１９では、サンプル単位がデータユニットとしてＭＭＴペイロードに格納される例が図示されている。しかしながら、サンプル単位やＮＡＬユニット単位でデータユニットが構成され、このようなデータユニットがデータユニット単位でＭＭＴペイロードに格納されてもよい。このようなデータユニットがさらにフラグメントされた単位でＭＭＴペイロードに格納されてもよい。 Note that although FIG. 19 shows an example in which MPU metadata units and MF metadata units are stored as data units in the MMT payload, units such as ftyp, mmpu, moov, and moof are stored as data units. It may be stored in the MMT payload in unit units. Similarly, FIG. 19 illustrates an example in which sample units are stored as data units in the MMT payload. However, a data unit may be configured in units of samples or in units of NAL units, and such data units may be stored in the MMT payload in units of data units. Such data units may be further stored in fragmented units in the MMT payload.

［従来の送信方法と課題］
従来、複数のアクセスユニットをＭＰ４フォーマットにカプセル化する際、ＭＰ４に格納されるサンプルがすべて揃った時点でｍｏｏｖ及びｍｏｏｆが作成されていた。 [Conventional transmission methods and issues]
Conventionally, when encapsulating multiple access units into the MP4 format, moovs and moofs were created when all the samples to be stored in the MP4 were completed.

ＭＰ４フォーマットを放送などを用いてリアルタイムに伝送する場合、例えば１つのＭＰ４ファイルに格納するサンプルがＧＯＰ単位であるとすると、ＧＯＰ単位の時間サンプルが蓄積された後にｍｏｏｖ及びｍｏｏｆが作成されるため、カプセル化に伴う遅延が発生する。このような送信側におけるカプセル化により、Ｅｎｄ－ｔｏ－Ｅｎｄ遅延が常にＧＯＰ単位時間分長くなる。これにより、リアルタイムにサービスの提供を行うことが困難となり、特に、ライブコンテンツが伝送される場合には視聴者に対するサービスの劣化につながる。 When transmitting MP4 format in real time using broadcasting etc., for example, if the samples stored in one MP4 file are in units of GOP, moov and moof are created after time samples in units of GOP are accumulated. There is a delay associated with encapsulation. Such encapsulation at the transmitter always increases the end-to-end delay by a GOP unit of time. This makes it difficult to provide services in real time, leading to deterioration of services for viewers, especially when live content is transmitted.

図２１は、データの送信順序を説明するための図である。ＭＭＴを放送に適用する場合、図２１の（ａ）に示されるように、ＭＰＵの構成順にＭＭＴパケットに載せて送信（ＭＭＴパケット＃１、＃２、＃３、＃４、＃５、＃６の順に送信）すると、ＭＭＴパケットの送信にはカプセル化による遅延が生じる。 FIG. 21 is a diagram for explaining the data transmission order. When applying MMT to broadcasting, as shown in FIG. ), there will be a delay in the transmission of the MMT packet due to encapsulation.

このカプセル化による遅延を防ぐために、図２１の（ｂ）に示されるように、ＭＰＵメタデータ及びＭＦメタデータなどのＭＰＵヘッダ情報を送らない（パケット＃１及び＃２を送信せず、パケット＃３－＃６をこの順に送信する）方法が提案されている。また、図２０の（ｃ）に示されるように、ＭＰＵヘッダ情報の作成を待たずにメディアデータを先に送信し、メディアデータの送信後にＭＰＵヘッダ情報を送信する（＃３－＃６、＃１、＃２の順で送信する）方法が考えられる。 In order to prevent the delay caused by this encapsulation, as shown in FIG. 3-#6 in this order) has been proposed. Also, as shown in FIG. 20(c), the media data is transmitted first without waiting for the creation of the MPU header information, and the MPU header information is transmitted after the media data is transmitted (#3-#6, # A possible method is to transmit the data in the order of #1 and #2.

受信装置は、ＭＰＵヘッダ情報が送信されていない場合、ＭＰＵヘッダ情報を用いずに復号する、また、受信装置は、ＭＰＵヘッダ情報がメディアデータに対して後送りされている場合には、ＭＰＵヘッダ情報の取得を待ってから復号する。 If the MPU header information is not transmitted, the receiving device decodes it without using the MPU header information, and if the MPU header information is sent later than the media data, the receiving device decodes the MPU header information. Wait for information to be obtained before decrypting.

しかしながら、従来のＭＰ４準拠の受信装置では、ＭＰＵヘッダ情報を用いずに復号することが保証されていない。また、受信装置が特別な処理によりＭＰＵヘッダを用いずに復号を行う場合に従来の送信方法を用いると復号処理が煩雑となり、実時間の復号が困難となる可能性が高い。また、受信装置がＭＰＵヘッダ情報の取得を待ってから復号を行う場合には、受信装置がヘッダ情報を取得するまでの間メディアデータのバッファリングが必要であるが、バッファモデルが規定されておらず、復号が保証されていなかった。 However, with conventional MP4-compliant receiving devices, decoding without using MPU header information is not guaranteed. Furthermore, if a conventional transmission method is used when the receiving device performs decoding without using an MPU header through special processing, the decoding process becomes complicated and there is a high possibility that real-time decoding becomes difficult. Additionally, if the receiving device performs decoding after waiting to obtain MPU header information, it is necessary to buffer the media data until the receiving device obtains the header information, but there is no defined buffer model. decryption was not guaranteed.

そこで、実施の形態２に係る送信装置は、図２０の（ｄ）に示されるように、ＭＰＵメタデータに共通の情報のみを格納することで、ＭＰＵメタデータをメディアデータより先に送信する。そして、実施の形態２に係る送信装置は、生成に遅延が発生するＭＦメタデータをメディアデータより後に送信する。これにより、メディアデータの復号を保証できる送信方法或いは受信方法を提供する。 Therefore, the transmitting device according to the second embodiment transmits the MPU metadata before the media data by storing only common information in the MPU metadata, as shown in FIG. 20(d). Then, the transmitting device according to the second embodiment transmits the MF metadata whose generation is delayed after the media data. This provides a transmission method or a reception method that can guarantee decoding of media data.

以下、図２１の（ａ）－（ｄ）の各送信方法を用いた場合の受信方法について説明する。 The reception method using each of the transmission methods shown in FIGS. 21(a) to 21(d) will be described below.

図２１に示される各送信方法では、まず、ＭＰＵメタデータ、ＭＦＵメタデータ、メディアデータの順にＭＰＵデータを構成する。 In each transmission method shown in FIG. 21, first, MPU data is configured in the order of MPU metadata, MFU metadata, and media data.

ＭＰＵデータを構成した後、送信装置が図２１の（ａ）に示されるように、ＭＰＵメタデータ、ＭＦメタデータ、メディアデータの順にデータを送信する場合、受信装置は、下記の（Ａ－１）及び（Ａ－２）のいずれかの方法で復号を行うことができる。 After configuring the MPU data, when the transmitting device transmits the data in the order of MPU metadata, MF metadata, and media data as shown in (a) of FIG. ) and (A-2).

（Ａ－１）受信装置は、ＭＰＵヘッダ情報（ＭＰＵメタデータ及びＭＦメタデータ）を取得後、ＭＰＵヘッダ情報を用いてメディアデータを復号する。 (A-1) After acquiring the MPU header information (MPU metadata and MF metadata), the receiving device decodes the media data using the MPU header information.

（Ａ－２）受信装置は、ＭＰＵヘッダ情報を用いずに、メディアデータを復号する。 (A-2) The receiving device decodes media data without using MPU header information.

このような方法はいずれも、送信側でカプセル化による遅延が発生するが、受信装置において、ＭＰＵヘッダ取得のためにメディアデータをバッファリングする必要がない利点がある。バッファリングをしない場合、バッファリングのためのメモリの搭載の必要はなく、さらにバッファリング遅延は発生しない。また、（Ａ－１）の方法は、ＭＰＵヘッダ情報を用いて復号を行うため、従来の受信装置にも適用可能ある。 In all of these methods, a delay occurs due to encapsulation on the transmitting side, but there is an advantage that the receiving device does not need to buffer the media data in order to obtain the MPU header. If buffering is not performed, there is no need to install memory for buffering, and no buffering delay occurs. Furthermore, since the method (A-1) performs decoding using MPU header information, it is also applicable to conventional receiving devices.

送信装置が図２１の（ｂ）に示されるように、メディアデータのみを送信する場合、受信装置は下記の（Ｂ－１）の方法で復号を行うことができる。 When the transmitting device transmits only media data as shown in FIG. 21(b), the receiving device can perform decoding using the method (B-1) below.

（Ｂ－１）受信装置は、ＭＰＵヘッダ情報を用いずに、メディアデータを復号する。 (B-1) The receiving device decodes media data without using MPU header information.

また、図示しないが、図２１の（ｂ）のメディアデータの送信よりも先にＭＰＵメタデータが送信されている場合、下記の（Ｂ－２）の方法で復号を行うことができる。 Furthermore, although not shown, if MPU metadata is transmitted before the media data is transmitted in FIG. 21(b), decoding can be performed using the method (B-2) below.

（Ｂ－２）受信装置は、ＭＰＵメタデータを用いてメディアデータを復号する。 (B-2) The receiving device decodes the media data using the MPU metadata.

上記（Ｂ－１）及び（Ｂ－２）の方法はいずれも、送信側でカプセル化による遅延が発生せず、かつ、ＭＰＵヘッダ取得のためにメディアデータをバッファリングする必要がない点が利点である。しかしながら、（Ｂ－１）及び（Ｂ－２）の方法はいずれも、ＭＰＵヘッダ情報を用いた復号を行わないため、復号に特別な処理が必要となる可能性がある。 The advantages of both methods (B-1) and (B-2) above are that there is no delay due to encapsulation on the sending side, and there is no need to buffer the media data to obtain the MPU header. It is. However, since both methods (B-1) and (B-2) do not perform decoding using MPU header information, special processing may be required for decoding.

送信装置が図２１の（ｃ）に示されるように、メディアデータ、ＭＰＵメタデータ、ＭＦメタデータの順にデータを送信する場合、受信装置は下記の（Ｃ－１）及び（Ｃ－２）のいずれかの方法で復号を行うことができる。 When the transmitting device transmits data in the order of media data, MPU metadata, and MF metadata as shown in (c) of FIG. 21, the receiving device performs the following (C-1) and (C-2). Decoding can be done using either method.

（Ｃ－１）受信装置は、ＭＰＵヘッダ情報（ＭＰＵメタデータ及びＭＦメタデータ）を取得後、メディアデータを復号する。 (C-1) After acquiring the MPU header information (MPU metadata and MF metadata), the receiving device decodes the media data.

（Ｃ－２）受信装置は、ＭＰＵヘッダ情報を用いずに、メディアデータを復号する。 (C-2) The receiving device decodes the media data without using the MPU header information.

上記（Ｃ－１）の方法が用いられる場合は、ＭＰＵヘッダ情報の取得のためにメディアデータをバッファリングする必要がある。これに対し、上記（Ｃ－２）の方法が用いられる場合は、ＭＰＵヘッダ情報の取得のためのバッファリングを行う必要はない。 When the method (C-1) above is used, it is necessary to buffer the media data in order to obtain the MPU header information. On the other hand, when the above method (C-2) is used, there is no need to perform buffering to obtain MPU header information.

また、上記（Ｃ－１）及び（Ｃ－２）のいずれの方法も、送信側においてカプセル化による遅延は発生しない。また、（Ｃ－２）の方法は、ＭＰＵヘッダ情報を用いないため、特別な処理が必要となる可能性がある。 Furthermore, in both methods (C-1) and (C-2), no delay due to encapsulation occurs on the transmitting side. Furthermore, since the method (C-2) does not use MPU header information, special processing may be required.

送信装置が、図２１の（ｄ）に示されるように、ＭＰＵメタデータ、メディアデータ、ＭＦメタデータの順にデータを送信する場合、受信装置は、下記の（Ｄ－１）及び（Ｄ－２）のいずれかの方法で復号を行うことができる。 When the transmitting device transmits data in the order of MPU metadata, media data, and MF metadata as shown in (d) of FIG. 21, the receiving device transmits data in the following order (D-1) and (D-2). ) can be used to decrypt the file.

（Ｄ－１）受信装置は、ＭＰＵメタデータを取得後、さらにＭＦメタデータを取得し、その後、メディアデータを復号する。 (D-1) After acquiring the MPU metadata, the receiving device further acquires the MF metadata, and then decodes the media data.

（Ｄ－２）受信装置は、ＭＰＵメタデータを取得後、ＭＦメタデータを用いずにメディアデータを復号する。 (D-2) After acquiring the MPU metadata, the receiving device decodes the media data without using the MF metadata.

上記（Ｄ－１）の方法が用いられる場合は、ＭＦメタデータ取得のためにメディアデータをバッファリングする必要があるが、上記（Ｄ－２）の方法の場合は、ＭＦメタデータ取得のためのバッファリングを行う必要はない。 When the method (D-1) above is used, it is necessary to buffer the media data in order to obtain MF metadata, but in the case of method (D-2) above, it is necessary to buffer the media data in order to obtain MF metadata. There is no need to buffer.

上記（Ｄ－２）の方法は、ＭＦメタデータを用いた復号を行わないため、特別な処理が必要となる可能性がある。 Since the above method (D-2) does not perform decoding using MF metadata, special processing may be required.

以上説明したように、ＭＰＵメタデータ及びＭＦメタデータを用いて復号できる場合は、従来のＭＰ４受信装置でも復号できるというメリットがある。 As explained above, if decoding is possible using MPU metadata and MF metadata, there is an advantage that it can be decoded by a conventional MP4 receiving device.

なお、図２１では、ＭＰＵデータは、ＭＰＵメタデータ、ＭＦＵメタデータ、メディアデータの順に構成されており、ｍｏｏｆにおいては、この構成に基づいてサンプルやサブサンプル毎の位置情報（オフセット）が定められている。また、ＭＦメタデータには、ｍｄａｔｂｏｘにおけるメディアデータ以外のデータ（ｂｏｘのサイズやタイプ）も含まれている。 Note that in FIG. 21, the MPU data is configured in the order of MPU metadata, MFU metadata, and media data, and in moof, position information (offset) for each sample or subsample is determined based on this configuration. ing. The MF metadata also includes data other than media data in the mdat box (box size and type).

このため、受信装置がＭＦメタデータに基づいてメディアデータを特定する場合には、受信装置は、データが送信された順番にかかわらず、ＭＰＵデータを構成した際の順番にデータを再構成した後、ＭＰＵメタデータのｍｏｏｖ或いはＭＦメタデータのｍｏｏｆを用いて復号を行う。 Therefore, when the receiving device identifies media data based on MF metadata, the receiving device reconstructs the data in the order in which the MPU data was configured, regardless of the order in which the data was transmitted. , decoding is performed using moov of MPU metadata or moof of MF metadata.

なお、図２１では、ＭＰＵデータは、ＭＰＵメタデータ、ＭＦＵメタデータ、メディアデータの順に構成されるが、図２１とは異なる順番でＭＰＵデータが構成され、位置情報（オフセット）が定められてもよい。 In addition, in FIG. 21, the MPU data is configured in the order of MPU metadata, MFU metadata, and media data, but even if the MPU data is configured in a different order from that in FIG. 21 and the position information (offset) is determined, good.

例えば、ＭＰＵデータがＭＰＵメタデータ、メディアデータ、ＭＦメタデータの順に構成され、ＭＦメタデータにおいて負の位置情報（オフセット）が示されてもよい。この場合も、データが送信される順番にかかわらず、受信装置は、送信側においてＭＰＵデータが構成された際の順番にデータを再構成した後、ｍｏｏｖ或いはｍｏｏｆを用いて復号を行う。 For example, the MPU data may be configured in the order of MPU metadata, media data, and MF metadata, and negative position information (offset) may be indicated in the MF metadata. In this case as well, regardless of the order in which the data is transmitted, the receiving device reconfigures the data in the order in which the MPU data was configured on the transmitting side, and then performs decoding using MOOV or MOof.

なお、送信装置は、ＭＰＵデータを構成する際の順番を示す情報をシグナリングし、受信装置は、シグナリングされた情報に基づいてデータを再構成してもよい。 Note that the transmitting device may signal information indicating the order in which MPU data is configured, and the receiving device may reconfigure the data based on the signaled information.

以上説明したように、受信装置は、図２１の（ｄ）に示されるように、パケット化されたＭＰＵメタデータ、パケット化されたメディアデータ（サンプルデータ）、パケット化されたＭＦメタデータをこの順に受信する。ここで、ＭＰＵメタデータは、第１のメタデータの一例であり、ＭＦメタデータは、第２のメタデータの一例である。 As explained above, the receiving device receives packetized MPU metadata, packetized media data (sample data), and packetized MF metadata as shown in FIG. 21(d). Receive in order. Here, the MPU metadata is an example of first metadata, and the MF metadata is an example of second metadata.

次に、受信装置は、受信されたＭＰＵメタデータ、受信されたＭＦメタデータ、及び受信されたサンプルデータを含むＭＰＵデータ（ＭＰ４フォーマットのファイル）を再構成する。そして、再構成されたＭＰＵデータに含まれるサンプルデータを、ＭＰＵメタデータ及びＭＦメタデータを用いて復号する。ＭＦメタデータは、送信側においてサンプルデータの生成後にのみ生成可能なデータ（例えば、ｍｂｏｘに格納されるｌｅｎｇｔｈ）を含むメタデータである。 Next, the receiving device reconstructs the MPU data (MP4 format file) including the received MPU metadata, the received MF metadata, and the received sample data. Then, sample data included in the reconstructed MPU data is decoded using the MPU metadata and MF metadata. MF metadata is metadata that includes data (for example, length stored in mbox) that can be generated only after sample data is generated on the transmitting side.

なお、上記受信装置の動作は、より詳細には、受信装置を構成する各構成要素によって行われる。例えば、受信装置は、上記データの受信を行う受信部と、上記ＭＰＵデータの再構成を行う再構成部と、上記ＭＰＵデータの復号を行う復号部とを備える。なお、受信部、生成部、及び復号部のそれぞれは、マイクロコンピュータ、プロセッサ、専用回路などによって実現される。 In addition, the operation of the above-mentioned receiving device is performed by each component that constitutes the receiving device in more detail. For example, the receiving device includes a receiving section that receives the data, a reconfiguration section that reconstructs the MPU data, and a decoding section that decodes the MPU data. Note that each of the receiving section, generating section, and decoding section is realized by a microcomputer, a processor, a dedicated circuit, or the like.

［ヘッダ情報を用いずに復号を行う方法］
次に、ヘッダ情報を用いずに復号を行う方法について説明する。ここでは、送信側でヘッダ情報を送るか送らないかにかかわらず、受信装置においてヘッダ情報を用いずに復号する方法を説明する。すなわち、この方法は、図２１を用いて説明したいずれの送信方法を用いた場合においても適用可能である。ただし、一部の復号方法は、特定の送信方法の場合にのみ適用可能な復号方法である。 [How to decrypt without using header information]
Next, a method of decoding without using header information will be described. Here, a method for decoding without using header information at the receiving device will be described, regardless of whether header information is sent on the transmitting side or not. That is, this method is applicable to any of the transmission methods described using FIG. 21. However, some decoding methods are applicable only to specific transmission methods.

図２２は、ヘッダ情報を用いずに復号を行う方法の例を示す図である。図２２では、メディアデータのみが含まれるＭＭＴペイロード及びＭＭＴパケットのみが図示されており、ＭＰＵメタデータやＭＦメタデータが含まれるＭＭＴペイロード及びＭＭＴパケットは図示されていない。また、以下の図２２の説明においては、同じＭＰＵに属するメディアデータは連続して伝送されるものとする。また、メディアデータとしてペイロードにサンプルが格納されている場合を例に説明するが、以下の図２２の説明においては、当然ＮＡＬユニットが格納されていてもよいし、フラグメントされたＮＡＬユニットが格納されていてもよい。 FIG. 22 is a diagram illustrating an example of a method for decoding without using header information. In FIG. 22, only the MMT payload and MMT packet containing only media data are illustrated, and the MMT payload and MMT packet containing MPU metadata and MF metadata are not illustrated. Furthermore, in the following description of FIG. 22, it is assumed that media data belonging to the same MPU is transmitted continuously. Furthermore, an example will be explained in which a sample is stored in the payload as media data, but in the following explanation of FIG. You can leave it there.

メディアデータを復号するためには、受信装置は、まず、復号に必要な初期化情報を取得しなければならない。また、メディアがビデオであれば、受信装置は、サンプル毎の初期化情報を取得したり、ランダムアクセス単位であるＭＰＵの開始位置を特定し、サンプル及びＮＡＬユニットの開始位置を取得しなければならない。また、受信装置は、それぞれサンプルの復号時刻（ＤＴＳ）や提示時刻（ＰＴＳ）を特定する必要がある。 In order to decode media data, a receiving device must first obtain initialization information necessary for decoding. Furthermore, if the media is a video, the receiving device must obtain initialization information for each sample, specify the starting position of the MPU, which is a random access unit, and obtain the starting positions of samples and NAL units. . Further, the receiving device needs to specify the decoding time (DTS) and presentation time (PTS) of each sample.

そこで、受信装置は、例えば、下記の方法を用いてヘッダ情報を用いずに復号を行うことができる。なお、ペイロードにＮＡＬユニット単位またはＮＡＬユニットをフラグメントした単位が格納される場合は、下記説明において「サンプル」を、「サンプルにおけるＮＡＬユニット」に読み替えればよい。 Therefore, the receiving device can perform decoding without using header information using, for example, the following method. Note that if the payload stores a unit of NAL unit or a unit of fragmented NAL unit, "sample" in the following description may be replaced with "NAL unit in sample."

＜ランダムアクセス（＝ＭＰＵの先頭サンプルを特定）＞
ヘッダ情報が送信されない場合に、受信装置がＭＰＵの先頭サンプルを特定するには、下記方法１と方法２がある。なお、ヘッダ情報が送信される場合には、方法３を用いることができる。 <Random access (=identifying the first sample of MPU)>
When header information is not transmitted, there are methods 1 and 2 below for the receiving device to identify the leading sample of the MPU. Note that when header information is transmitted, method 3 can be used.

［方法１］受信装置は、ＭＭＴパケットヘッダにおいて、’ＲＡＰ＿ｆｌａｇ＝１’であるＭＭＴパケットに含まれるサンプルを取得する。 [Method 1] The receiving device obtains a sample included in the MMT packet in which 'RAP_flag=1' in the MMT packet header.

［方法２］受信装置は、ＭＭＴペイロードヘッダにおいて、’ｓａｍｐｌｅｎｕｍｂｅｒ＝０’であるサンプルを取得する。 [Method 2] The receiving device obtains a sample with 'sample number=0' in the MMT payload header.

［方法３］受信装置は、メディアデータの前及び後ろの少なくともどちらか一方に、ＭＰＵメタデータ及びＭＦメタデータの少なくともどちらか一方が送信されている場合、受信装置は、ＭＭＴペイロードヘッダにおけるフラグメントタイプ（ＦＴ）がメディアデータへ切り替わったＭＭＴペイロードに含まれるサンプルを取得する。 [Method 3] If at least one of MPU metadata and MF metadata is transmitted before and after media data, the receiving device detects the fragment type in the MMT payload header. (FT) acquires a sample included in the MMT payload that has been switched to media data.

なお、方法１及び方法２において、１つのペイロードに異なるＭＰＵに属する複数のサンプルが混在する場合、どのＮＡＬユニットがランダムアクセスポイント（ＲＡＰ＿ｆｌａｇ＝１或いはｓａｍｐｌｅｎｕｍｂｅｒ＝０）であるか判定不能である。このため、１つのペイロードに異なるＭＰＵのサンプルを混在させないといった制約、または、１つのペイロードに異なるＭＰＵのサンプルが混在する場合は、最後（或いは最初）のサンプルがランダムアクセスポイントである場合に、ＲＡＰ＿ｆｌａｇを１とするといった制約などが必要である。 Note that in method 1 and method 2, when a plurality of samples belonging to different MPUs are mixed in one payload, it is impossible to determine which NAL unit is a random access point (RAP_flag=1 or sample number=0). For this reason, there are restrictions such as not mixing samples of different MPUs in one payload, or when samples of different MPUs are mixed in one payload, if the last (or first) sample is a random access point, RAP_flag Restrictions such as setting 1 to 1 are necessary.

また、受信装置がＮＡＬユニットの開始位置を取得するためには、サンプルの先頭ＮＡＬユニットから順に、ＮＡＬユニットのサイズ分だけデータの読出しポインタをシフトさせていく必要がある。 Furthermore, in order for the receiving device to obtain the starting position of the NAL unit, it is necessary to shift the data read pointer by the size of the NAL unit sequentially from the first NAL unit of the sample.

データがフラグメントされている場合は、受信装置は、ｆｒａｇｍｅｎｔ＿ｉｎｄｉｃａｔｏｒやｆｒａｇｍｅｎｔ＿ｎｕｍｂｅｒを参照することで、データユニットを特定できる。 If the data is fragmented, the receiving device can identify the data unit by referring to fragment_indicator and fragment_number.

＜サンプルのＤＴＳの決定＞
サンプルのＤＴＳの決定方法には、下記方法１と方法２がある。 <Determination of sample DTS>
Methods for determining the DTS of a sample include Method 1 and Method 2 below.

［方法１］受信装置は、予測構造に基づいて先頭サンプルのＤＴＳを決定する。ただし、この方法には符号化データの解析が必要であり、実時間での復号が困難である可能性があるため、次の方法２が望ましい。 [Method 1] The receiving device determines the DTS of the first sample based on the prediction structure. However, this method requires analysis of encoded data and may be difficult to decode in real time, so the following method 2 is preferable.

［方法２］受信装置は、先頭サンプルのＤＴＳを別途送信し、送信された先頭サンプルのＤＴＳを取得する。先頭サンプルのＤＴＳの送信方法は、例えば、ＭＰＵ先頭サンプルのＤＴＳを、ＭＭＴ－ＳＩを用いて送信する方法や、サンプル毎のＤＴＳをＭＭＴパケットヘッダ拡張領域を用いて送信する方法などがある。なお、ＤＴＳは、絶対値でもよいし、ＰＴＳに対する相対値であってもよい。また、送信側において先頭サンプルのＤＴＳが含まれているかどうかをシグナリングしてもよい。 [Method 2] The receiving device separately transmits the DTS of the first sample and acquires the transmitted DTS of the first sample. Examples of methods for transmitting the DTS of the first sample include a method of transmitting the DTS of the MPU first sample using MMT-SI, and a method of transmitting the DTS of each sample using the MMT packet header extension area. Note that DTS may be an absolute value or may be a relative value with respect to PTS. Alternatively, the transmitting side may signal whether or not the DTS of the first sample is included.

なお、方法１、方法２ともに、以降のサンプルのＤＴＳは、固定フレームレートであるとして算出する。 Note that in both method 1 and method 2, the DTS of subsequent samples is calculated assuming a fixed frame rate.

サンプル毎のＤＴＳをパケットヘッダに格納する方法として、拡張領域を用いる以外に、ＭＭＴパケットヘッダにおける３２ｂｉｔのＮＴＰタイムスタンプフィールドに、当該ＭＭＴパケットに含まれるサンプルのＤＴＳを格納する方法がある。１つのパケットヘッダのビット数（３２ｂｉｔ）でＤＴＳを表現できない場合は、ＤＴＳは、複数のパケットヘッダを用いて表現されてもよい。また、ＤＴＳは、パケットヘッダのＮＴＰタイムスタンプフィールドと拡張領域とを組み合わせて表現されてもよい。ＤＴＳ情報が含まれない場合は既知の値（例えばＡＬＬ０）とされる。 As a method for storing the DTS of each sample in the packet header, in addition to using an extension area, there is a method of storing the DTS of the sample included in the MMT packet in the 32-bit NTP timestamp field in the MMT packet header. If the DTS cannot be expressed using the number of bits (32 bits) of one packet header, the DTS may be expressed using a plurality of packet headers. Further, the DTS may be expressed by combining the NTP timestamp field of the packet header and an extension field. If DTS information is not included, a known value (for example, ALL0) is assumed.

＜サンプルのＰＴＳの決定＞
受信装置は、先頭サンプルのＰＴＳを、ＭＰＵに含まれるアセット毎のＭＰＵタイムスタンプ記述子から取得する。受信装置は、以降のサンプルＰＴＳについては、固定フレームレートであるものとして、ＰＯＣ等のサンプルの表示順を示すパラメータなどから算出する。このように、ヘッダ情報を用いずにＤＴＳ、及びＰＴＳを算出するためには、固定フレームレートによる送信が必須となる。 <Determination of sample PTS>
The receiving device obtains the PTS of the first sample from the MPU timestamp descriptor for each asset included in the MPU. The receiving device calculates the subsequent sample PTS based on a parameter indicating the display order of samples such as POC, assuming that the frame rate is fixed. In this way, in order to calculate DTS and PTS without using header information, transmission at a fixed frame rate is essential.

また、ＭＦメタデータが送信されている場合、受信装置は、ＭＦメタデータに示される先頭サンプルからのＤＴＳやＰＴＳの相対時刻情報と、ＭＰＵタイムスタンプ記述子に示されるＭＰＵ先頭サンプルのタイムスタンプの絶対値とからＤＴＳ及びＰＴＳの絶対値を算出できる。 In addition, when MF metadata is transmitted, the receiving device receives the relative time information of the DTS and PTS from the first sample indicated in the MF metadata, and the timestamp of the MPU first sample indicated in the MPU timestamp descriptor. The absolute values of DTS and PTS can be calculated from the absolute values.

なお、符号化データ解析してＤＴＳ及びＰＴＳを算出する際には、受信装置は、アクセスユニットに含まれるＳＥＩ情報を用いて算出してもよい。 Note that when analyzing the encoded data and calculating the DTS and PTS, the receiving device may use SEI information included in the access unit to calculate the DTS and PTS.

＜初期化情報（パラメータセット）＞
［ビデオの場合］
ビデオの場合、パラメータセットは、サンプルデータに格納される。また、ＭＰＵメタデータ及びＭＦメタデータが送信されない場合は、サンプルデータのみを参照することにより復号に必要なパラメータセットを取得できることを保証する。 <Initialization information (parameter set)>
[For video]
For video, the parameter set is stored in the sample data. Furthermore, if MPU metadata and MF metadata are not transmitted, it is guaranteed that the parameter set necessary for decoding can be obtained by referring only to sample data.

また、図２１の（ａ）及び（ｄ）のように、ＭＰＵメタデータがメディアデータよりも先に送信される場合、ＳａｍｐｌｅＥｎｔｒｙにはパラメータセットは格納しないと規定されてもよい。この場合、受信装置は、ＳａｍｐｌｅＥｎｔｒｙのパラメータセットは参照せずにサンプル内のパラメータセットのみを参照する。 Furthermore, as shown in FIGS. 21(a) and 21(d), when MPU metadata is transmitted before media data, it may be specified that no parameter set is stored in SampleEntry. In this case, the receiving device does not refer to the parameter set of SampleEntry, but only refers to the parameter set within the sample.

また、ＭＰＵメタデータがメディアデータよりも先に送信される場合、ＳａｍｐｌｅＥｎｔｒｙにはＭＰＵに共通のパラメータセットやデフォルトのパラメータセットが格納され、受信装置は、ＳａｍｐｌｅＥｎｔｒｙのパラメータセット及びサンプル内のパラメータセットを参照してもよい。ＳａｍｐｌｅＥｎｔｒｙにパラメータセットが格納されることにより、ＳａｍｐｌｅＥｎｔｒｙにパラメータセットが存在しないと再生できない従来の受信装置でも復号を行うことが可能となる。 Furthermore, when MPU metadata is transmitted before media data, SampleEntry stores a parameter set common to the MPU and a default parameter set, and the receiving device stores the parameter set of SampleEntry and the parameter set in the sample. You may refer to it. By storing the parameter set in the SampleEntry, it becomes possible to perform decoding even with a conventional receiving device that cannot reproduce data unless the parameter set exists in the SampleEntry.

［オーディオの場合］
オーディオの場合、復号にはＬＡＴＭヘッダが必要であり、ＭＰ４では、ＬＡＴＭヘッダがサンプルエントリに含められることが必須である。しかし、ヘッダ情報が送信されない場合は、受信装置がＬＡＴＭヘッダを取得することは困難であるため、別途ＳＩなどの制御情報にＬＡＴＭヘッダが含められる。なお、ＬＡＴＭヘッダは、メッセージ、テーブル、または記述子に含められてもよい。なお、ＬＡＴＭヘッダはサンプル内に含められることもある。 [For audio]
For audio, a LATM header is required for decoding, and for MP4, it is mandatory that the LATM header be included in the sample entry. However, if the header information is not transmitted, it is difficult for the receiving device to acquire the LATM header, so the LATM header is separately included in control information such as SI. Note that the LATM header may be included in a message, table, or descriptor. Note that the LATM header may be included within the sample.

受信装置は、復号開始前にＳＩなどからＬＡＴＭヘッダを取得し、オーディオの復号を開始する。或いは、図２１の（ａ）及び図２１の（ｄ）に示されるように、ＭＰＵメタデータがメディアデータよりも先に送信される場合は、受信装置は、ＬＡＴＭヘッダをメディアデータより先に受信可能である。したがって、ＭＰＵメタデータがメディアデータよりも先に送信される場合は、従来の受信装置を用いても復号を行うことが可能となる。 The receiving device obtains the LATM header from the SI or the like before starting decoding, and starts decoding the audio. Alternatively, as shown in FIGS. 21(a) and 21(d), if the MPU metadata is transmitted before the media data, the receiving device receives the LATM header before the media data. It is possible. Therefore, if MPU metadata is transmitted before media data, it is possible to decode it even using a conventional receiving device.

＜その他＞
送信順序や送信順序のタイプは、ＭＭＴパケットヘッダやペイロードヘッダ、或いは、ＭＰＴやその他のテーブル、メッセージ、記述子などの制御情報として通知されてもよい。なお、ここでの送信順序のタイプとは、例えば、図２１の（ａ）～（ｄ）の４つのタイプの送信順序であり、それぞれのタイプを識別するための識別子が復号開始前に取得できる場所に格納されればよい。 <Others>
The transmission order and the type of transmission order may be notified as control information such as an MMT packet header, payload header, MPT or other table, message, or descriptor. Note that the types of transmission order here are, for example, the four types of transmission orders shown in (a) to (d) in FIG. 21, and an identifier for identifying each type can be obtained before starting decoding. It should be stored somewhere.

また、送信順序のタイプは、オーディオとビデオとで異なるタイプが用いられてもよいし、オーディオとビデオとで共通のタイプが用いられてもよい。具体的には、例えば、オーディオは、図２１の（ａ）に示されるように、ＭＰＵメタデータ、ＭＦメタデータ、メディアデータの順番で送信され、ビデオは、図２１の（ｄ）に示されるように、ＭＰＵメタデータ、メディアデータ、ＭＦメタデータの順番で送信されてもよい。 Furthermore, different types of transmission order may be used for audio and video, or a common type may be used for audio and video. Specifically, for example, audio is transmitted in the order of MPU metadata, MF metadata, and media data as shown in FIG. 21(a), and video is transmitted in the order of MPU metadata, MF metadata, and media data as shown in FIG. 21(d). , MPU metadata, media data, and MF metadata may be transmitted in this order.

以上説明したような方法により、受信装置は、ヘッダ情報を用いずに復号を行うことが可能である。また、ＭＰＵメタデータがメディアデータよりも先に送信されている場合（図２１の（ａ）及び図２１の（ｄ））は、従来の受信装置でも復号を行うことが可能になる。 With the method described above, the receiving device can perform decoding without using header information. Furthermore, if the MPU metadata is transmitted before the media data (FIG. 21(a) and FIG. 21(d)), it becomes possible to decode it even with a conventional receiving device.

特に、ＭＦメタデータがメディアデータより後に送信されること（図２１の（ｄ））により、カプセル化による遅延を発生させず、かつ従来の受信装置でも復号を行うことが可能となる。 In particular, by transmitting the MF metadata after the media data ((d) in FIG. 21), a delay due to encapsulation does not occur, and even a conventional receiving device can perform decoding.

［送信装置の構成及び動作］
次に、送信装置の構成及び動作について説明する。図２３は、実施の形態２に係る送信装置のブロック図であり、図２４は、実施の形態２に係る送信方法のフローチャートである。 [Configuration and operation of transmitter]
Next, the configuration and operation of the transmitter will be explained. FIG. 23 is a block diagram of a transmitting device according to the second embodiment, and FIG. 24 is a flowchart of a transmitting method according to the second embodiment.

図２３に示されるように、送信装置１５は、符号化部１６と、多重化部１７と、送信部１８とを備える。 As shown in FIG. 23, the transmitting device 15 includes an encoding section 16, a multiplexing section 17, and a transmitting section 18.

符号化部１６は、符号化対象のビデオまたはオーディオを、例えば、Ｈ．２６５に従い符号化することで符号化データを生成する（Ｓ１０）。 The encoding unit 16 converts video or audio to be encoded into, for example, H. Encoded data is generated by encoding according to H.265 (S10).

多重化部１７は、符号化部１６により生成された符号化データを多重化（パケット化）する（Ｓ１１）。具体的には、多重化部１７は、ＭＰ４フォーマットのファイルを構成する、サンプルデータ、ＭＰＵメタデータ、及び、ＭＦメタデータ、のそれぞれをパケット化する。サンプルデータは、映像信号または音声信号が符号化されたデータであり、ＭＰＵメタデータは、第１のメタデータの一例であり、ＭＦメタデータは、第２のメタデータの一例である。第１のメタデータと第２のメタデータとは、いずれもサンプルデータの復号に用いられるメタデータであるが、これらの違いは、第２のメタデータがサンプルデータの生成後にのみ生成可能なデータを含むことである。 The multiplexing unit 17 multiplexes (packets) the encoded data generated by the encoding unit 16 (S11). Specifically, the multiplexing unit 17 packetizes each of sample data, MPU metadata, and MF metadata that constitute an MP4 format file. The sample data is data in which a video signal or an audio signal is encoded, MPU metadata is an example of first metadata, and MF metadata is an example of second metadata. The first metadata and the second metadata are both metadata used to decrypt sample data, but the difference between them is that the second metadata is data that can only be generated after the sample data is generated. It is to include.

ここで、サンプルデータの生成後にのみ生成可能なデータは、例えば、ＭＰ４フォーマットにおけるｍｄａｔに格納されるサンプルデータ以外のデータ（ｍｄａｔのヘッダ内のデータ。つまり、図２０に図示されるｔｙｐｅ及びｌｅｎｇｔｈ。）である。ここで、第２のメタデータには、このデータのうち少なくとも一部であるｌｅｎｇｔｈが含まれればよい。 Here, data that can be generated only after generation of sample data is, for example, data other than sample data stored in mdat in MP4 format (data in the header of mdat, that is, type and length shown in FIG. 20). ). Here, the second metadata only needs to include length, which is at least a part of this data.

送信部１８は、パケット化したＭＰ４フォーマットのファイルを送信する（Ｓ１２）。送信部１８は、例えば、図２１の（ｄ）に示される方法でＭＰ４フォーマットのファイルを送信する。つまり、パケット化されたＭＰＵメタデータ、パケット化されたサンプルデータ、パケット化されたＭＦメタデータをこの順に送信する。 The transmitter 18 transmits the packetized MP4 format file (S12). The transmitter 18 transmits the MP4 format file, for example, by the method shown in FIG. 21(d). That is, packetized MPU metadata, packetized sample data, and packetized MF metadata are transmitted in this order.

なお、符号化部１６、多重化部１７、及び送信部１８のそれぞれは、マイクロコンピュータ、プロセッサ、または専用回路などによって実現される。 Note that each of the encoding section 16, the multiplexing section 17, and the transmitting section 18 is realized by a microcomputer, a processor, a dedicated circuit, or the like.

［受信装置の構成］
次に、受信装置の構成及び動作について説明する。図２５は、実施の形態２に係る受信装置のブロック図である。 [Receiving device configuration]
Next, the configuration and operation of the receiving device will be explained. FIG. 25 is a block diagram of a receiving device according to Embodiment 2.

図２５に示されるように、受信装置２０は、パケットフィルタリング部２１と、送信順序タイプ判別部２２と、ランダムアクセス部２３と、制御情報取得部２４と、データ取得部２５と、ＰＴＳ、ＤＴＳ算出部２６と、初期化情報取得部２７と、復号命令部２８と、復号部２９と、提示部３０とを備える。 As shown in FIG. 25, the receiving device 20 includes a packet filtering unit 21, a transmission order type determining unit 22, a random access unit 23, a control information acquisition unit 24, a data acquisition unit 25, and a PTS and DTS calculation unit. unit 26 , initialization information acquisition unit 27 , decoding command unit 28 , decoding unit 29 , and presentation unit 30 .

［受信装置の動作１］
まず、メディアがビデオである場合に、受信装置２０が、ＭＰＵ先頭位置及びＮＡＬユニット位置を特定するための動作について説明する。図２６は、受信装置２０のこのような動作のフローチャートである。なお、ここでは、ＭＰＵデータの送信順序タイプは、送信装置１５（多重化部１７）によってＳＩ情報に格納されているとする。 [Receiving device operation 1]
First, when the media is a video, the operation of the receiving device 20 to specify the MPU start position and the NAL unit position will be described. FIG. 26 is a flowchart of such an operation of the receiving device 20. Note that here, it is assumed that the transmission order type of MPU data is stored in the SI information by the transmitting device 15 (multiplexing unit 17).

まず、パケットフィルタリング部２１は、受信したファイルに対してパケットフィルタリングを行う。送信順序タイプ判別部２２は、パケットフィルタリングによって得られるＳＩ情報を解析して、ＭＰＵデータの送信順序タイプを取得する（Ｓ２１）。 First, the packet filtering unit 21 performs packet filtering on the received file. The transmission order type determination unit 22 analyzes the SI information obtained by packet filtering and acquires the transmission order type of MPU data (S21).

次に、送信順序タイプ判別部２２は、パケットフィルタリング後のデータにＭＰＵヘッダ情報（ＭＰＵメタデータ或いはＭＦメタデータの少なくとも一方）が含まれているか否かを判定（判別）する（Ｓ２２）。ＭＰＵヘッダ情報（が含まれている場合（Ｓ２２でＹｅｓ）には、ランダムアクセス部２３は、ＭＭＴペイロードヘッダのフラグメントタイプがメディアデータへ切り替わることを検出することで、ＭＰＵ先頭サンプルを特定する（Ｓ２３）。 Next, the transmission order type determining unit 22 determines (determines) whether the data after packet filtering includes MPU header information (at least one of MPU metadata or MF metadata) (S22). If the MPU header information (is included (Yes in S22), the random access unit 23 identifies the MPU leading sample by detecting that the fragment type of the MMT payload header is switched to media data (S23). ).

一方、ＭＰＵヘッダ情報が含まれていない場合（Ｓ２２でＮｏ）には、ランダムアクセス部２３は、ＭＭＴパケットヘッダのＲＡＰ＿ｆｌａｇ或いはＭＭＴペイロードヘッダのｓａｍｐｌｅｎｕｍｂｅｒに基づいてＭＰＵ先頭サンプルを特定する（Ｓ２４）。 On the other hand, if the MPU header information is not included (No in S22), the random access unit 23 identifies the MPU leading sample based on the RAP_flag of the MMT packet header or the sample number of the MMT payload header (S24).

また、送信順序タイプ判別部２２は、パケットフィルタリングされたデータに、ＭＦメタデータが含まれているか否かを判定する（Ｓ２５）。ＭＦメタデータが含まれていると判定された場合（Ｓ２５でＹｅｓ）には、データ取得部２５は、ＭＦメタデータに含まれるサンプル、サブサンプルのオフセット、及びサイズ情報に基づいてＮＡＬユニットを読み出すことによりＮＡＬユニットを取得する（Ｓ２６）。一方、ＭＦメタデータが含まれていないと判定された場合（Ｓ２５でＮｏ）には、データ取得部２５は、サンプルの先頭ＮＡＬユニットから順に、ＮＡＬユニットのサイズのデータを読み出すことでＮＡＬユニットを取得する（Ｓ２７）。 Furthermore, the transmission order type determining unit 22 determines whether or not the packet-filtered data includes MF metadata (S25). If it is determined that MF metadata is included (Yes in S25), the data acquisition unit 25 reads the NAL unit based on the sample, subsample offset, and size information included in the MF metadata. A NAL unit is thereby acquired (S26). On the other hand, if it is determined that MF metadata is not included (No in S25), the data acquisition unit 25 reads data of the size of the NAL unit in order from the first NAL unit of the sample to Acquire (S27).

なお、受信装置２０は、ステップＳ２２において、ＭＰＵヘッダ情報が含まれていると判別された場合でも、ステップＳ２３ではなくステップＳ２４の処理を用いてＭＰＵ先頭サンプルを特定してもよい。また、ＭＰＵヘッダ情報が含まれていると判別された場合に、ステップＳ２３の処理とステップＳ２４の処理とが併用されてもよい。 Note that even if it is determined in step S22 that MPU header information is included, the receiving device 20 may identify the MPU first sample using the process in step S24 instead of step S23. Moreover, when it is determined that MPU header information is included, the process of step S23 and the process of step S24 may be used together.

また、受信装置２０は、ステップＳ２５において、ＭＦメタデータが含まれていると判定された場合でも、ステップＳ２６の処理を用いずにステップＳ２７の処理を用いてＮＡＬユニットを取得してもよい。また、ＭＦメタデータが含まれていると判定された場合に、ステップＳ２３の処理とステップＳ２４の処理とが併用されてもよい。 Furthermore, even if it is determined in step S25 that MF metadata is included, the receiving device 20 may acquire the NAL unit using the process in step S27 without using the process in step S26. Further, when it is determined that MF metadata is included, the processing of step S23 and the processing of step S24 may be used together.

また、ステップＳ２５においてＭＦメタデータが含まれていると判定された場合であって、ＭＦデータがメディアデータより後に送信されている場合が想定される。この場合、受信装置２０は、メディアデータをバッファリングし、ＭＦメタデータを取得するまで待ってからステップＳ２６の処理を行ってもよいし、受信装置２０は、ＭＦメタデータの取得を待たずにステップＳ２７の処理を行うか否かを判定してもよい。 Furthermore, a case is assumed in which it is determined in step S25 that MF metadata is included, and the MF data is transmitted after the media data. In this case, the receiving device 20 may perform the process of step S26 after buffering the media data and waiting until the MF metadata is obtained, or the receiving device 20 may buffer the media data and wait until the MF metadata is obtained before performing the process of step S26. It may be determined whether or not to perform the process of step S27.

例えば、受信装置２０は、メディアデータをバッファリングすることが可能なバッファサイズのバッファを保有しているかどうかに基づいてＭＦメタデータの取得を待つか否かを判定してもよい。また、受信装置２０は、Ｅｎｄ－ｔｏ－Ｅｎｄ遅延が小さくなるかどうかに基づいて、ＭＦメタデータの取得を待つか否かを判定してもよい。また、受信装置２０は、主としてステップＳ２６の処理を用いて復号処理を実施し、パケットロスなどが発生したときの処理モードの場合にステップＳ２７の処理を用いてもよい。 For example, the receiving device 20 may determine whether to wait for acquisition of MF metadata based on whether it has a buffer with a buffer size that can buffer media data. Further, the receiving device 20 may determine whether to wait for acquisition of MF metadata based on whether the end-to-end delay becomes small. Further, the receiving device 20 may perform the decoding process mainly using the process of step S26, and may use the process of step S27 in a processing mode when a packet loss or the like occurs.

なお、送信順序タイプがあらかじめ定められている場合は、ステップＳ２２及びステップＳ２６は省略されてもよいし、この場合、受信装置２０は、バッファサイズやＥｎｄ－ｔｏ－Ｅｎｄ遅延を考慮して、ＭＰＵ先頭サンプルの特定方法、及び、ＮＡＬユニットの特定方法を決定してもよい。 Note that if the transmission order type is predetermined, steps S22 and S26 may be omitted, and in this case, the receiving device 20 selects the MPU in consideration of the buffer size and End-to-End delay. The method for specifying the leading sample and the method for specifying the NAL unit may be determined.

なお、あらかじめ送信順序タイプが既知である場合は、受信装置２０において送信順序タイプ判別部２２は、不要である。 Note that if the transmission order type is known in advance, the transmission order type determination unit 22 in the receiving device 20 is not necessary.

また、上記図２６においては説明されないが、復号命令部２８は、ＰＴＳ、ＤＴＳ算出部２６において算出されたＰＴＳ及びＤＴＳ、初期化情報取得部２７において取得された初期化情報に基づいて、データ取得部において取得されたデータを復号部２９に出力する。復号部２９は、データを復号し、提示部３０は、復号後のデータを提示する。 Furthermore, although not explained in FIG. The data acquired in the section is output to the decoding section 29. The decoding unit 29 decodes the data, and the presentation unit 30 presents the decoded data.

［受信装置の動作２］
次に、受信装置２０が、送信順序タイプに基づいて初期化情報を取得し、初期化情報に基づいてメディアデータを復号する動作について説明する。図２７は、このような動作のフローチャートである。 [Receiving device operation 2]
Next, an operation in which the receiving device 20 acquires initialization information based on the transmission order type and decodes media data based on the initialization information will be described. FIG. 27 is a flowchart of such an operation.

まず、パケットフィルタリング部２１は、受信したファイルに対してパケットフィルタリングを行う。送信順序タイプ判別部２２は、パケットフィルタリングによって得られるＳＩ情報を解析し、送信順序タイプを取得する（Ｓ３０１）。 First, the packet filtering unit 21 performs packet filtering on the received file. The transmission order type determination unit 22 analyzes the SI information obtained by packet filtering and acquires the transmission order type (S301).

次に、送信順序タイプ判別部２２は、ＭＰＵメタデータが送信されているか否かを判定する（Ｓ３０２）。ＭＰＵメタデータが送信されていると判定された場合（Ｓ３０２でＹｅｓ）、送信順序タイプ判別部２２は、ステップＳ３０１の解析の結果、ＭＰＵメタデータがメディアデータより先に送信されているかどうかを判定する（Ｓ３０３）。ＭＰＵメタデータがメディアデータより先に送信されている場合（Ｓ３０３でＹｅｓ）、初期化情報取得部２７は、ＭＰＵメタデータに含まれる共通な初期化情報、及び、サンプルデータの初期化情報に基づいてメディアデータを復号する（Ｓ３０４）。 Next, the transmission order type determination unit 22 determines whether MPU metadata is being transmitted (S302). If it is determined that the MPU metadata has been transmitted (Yes in S302), the transmission order type determination unit 22 determines whether the MPU metadata is transmitted before the media data as a result of the analysis in step S301. (S303). If the MPU metadata is transmitted before the media data (Yes in S303), the initialization information acquisition unit 27 performs initialization based on the common initialization information included in the MPU metadata and the initialization information of the sample data. and decrypts the media data (S304).

一方、ＭＰＵメタデータがメディアデータより後に送信されていると判定された場合（Ｓ３０３でＮｏ）には、データ取得部２５は、ＭＰＵメタデータが取得されるまでメディアデータをバッファリングし（Ｓ３０５）、ＭＰＵメタデータが取得された後にステップＳ３０４の処理を実施する。 On the other hand, if it is determined that the MPU metadata is transmitted after the media data (No in S303), the data acquisition unit 25 buffers the media data until the MPU metadata is acquired (S305). , the process of step S304 is performed after the MPU metadata is acquired.

また、ステップＳ３０２において、ＭＰＵメタデータが送信されていないと判定された場合（Ｓ３０２でＮｏ）には、初期化情報取得部２７は、サンプルデータの初期化情報のみに基づいてメディアデータを復号する（Ｓ３０６）。 Further, if it is determined in step S302 that the MPU metadata has not been transmitted (No in S302), the initialization information acquisition unit 27 decodes the media data based only on the initialization information of the sample data. (S306).

なお、送信側においてサンプルデータの初期化情報に基づく場合のみメディアデータの復号が保証されている場合は、ステップＳ３０２、及びステップＳ３０３の判定に基づく処理を行わず、ステップＳ３０６の処理が用いられる。 Note that if decoding of the media data is guaranteed only based on the initialization information of the sample data on the transmitting side, the process based on the determinations in steps S302 and S303 is not performed, and the process in step S306 is used.

また、受信装置２０は、ステップＳ３０５の前に、メディアデータをバッファリングするか否かの判定を行ってもよい。この場合、受信装置２０は、メディアデータをバッファリングすると判定した場合にはステップＳ３０５の処理へ移行し、メディアデータをバッファリングしないと判定した場合には、ステップＳ３０６の処理へ移行する。メディアデータをバッファリングするか否かの判定は、受信装置２０のバッファサイズ、占有量に基づいて行われてもよいし、例えば、Ｅｎｄ－ｔｏ－Ｅｎｄ遅延の小さい方が選択されるなど、Ｅｎｄ－ｔｏ－Ｅｎｄ遅延を考慮して判定が行われてもよい。 Further, the receiving device 20 may determine whether or not to buffer the media data before step S305. In this case, if the receiving device 20 determines to buffer the media data, the process proceeds to step S305, and if it determines not to buffer the media data, the process proceeds to step S306. The determination as to whether or not to buffer media data may be made based on the buffer size and occupancy of the receiving device 20, or for example, the one with the smaller End-to-End delay may be selected. The determination may be made taking into account the -to-End delay.

［受信装置の動作３］
ここでは、ＭＦメタデータがメディアデータよりも後に送信される場合（図２１の（ｃ）、及び図２１の（ｄ））における送信方法や受信方法の詳細について説明する。以下では、図２１の（ｄ）の場合を例に説明する。なお、送信においては、図２１の（ｄ）の方法のみが用いられ、送信順序タイプのシグナリングは行われないものとする。 [Receiving device operation 3]
Here, details of the transmission method and reception method in the case where the MF metadata is transmitted after the media data (FIGS. 21(c) and 21(d)) will be described. Below, the case of FIG. 21(d) will be explained as an example. Note that in transmission, only the method shown in FIG. 21(d) is used, and transmission order type signaling is not performed.

先述のとおり、図２１の（ｄ）に示されるように、ＭＰＵメタデータ、メディアデータ、ＭＦメタデータの順でデータを送信する場合、
（Ｄ－１）受信装置２０は、ＭＰＵメタデータを取得した後、さらにＭＦメタデータを取得した後にメディアデータを復号する。 As mentioned above, when transmitting data in the order of MPU metadata, media data, and MF metadata as shown in FIG. 21(d),
(D-1) The receiving device 20 decodes the media data after acquiring the MPU metadata and further acquiring the MF metadata.

（Ｄ－２）受信装置２０は、ＭＰＵメタデータを取得した後、ＭＦメタデータを用いずにメディアデータを復号する。
の２通りの復号方法が可能である。 (D-2) After acquiring the MPU metadata, the receiving device 20 decodes the media data without using the MF metadata.
Two decoding methods are possible.

ここで、Ｄ－１は、ＭＦメタデータ取得のためのメディアデータのバッファリングが必要となるが、ＭＰＵヘッダ情報を用いて復号を行うことができるため、従来のＭＰ４準拠の受信装置で復号可能となる。また、Ｄ－２は、ＭＦメタデータ取得のためのメディアデータのバッファリングを必要としないが、ＭＦメタデータを用いて復号できないため、復号に特別な処理が必要となる。 Here, D-1 requires buffering of media data to obtain MF metadata, but it can be decoded using MPU header information, so it can be decoded by a conventional MP4-compliant receiving device. becomes. Further, D-2 does not require buffering of media data to obtain MF metadata, but requires special processing for decoding because it cannot be decoded using MF metadata.

また、図２１の（ｄ）の方法は、ＭＦメタデータは、メディアデータより後で送信されるため、カプセル化による遅延は発生せず、Ｅｎｄ－ｔｏ－Ｅｎｄ遅延を低減できるという利点を有する。 Furthermore, the method shown in FIG. 21(d) has the advantage that, since the MF metadata is transmitted after the media data, a delay due to encapsulation does not occur, and the end-to-end delay can be reduced.

受信装置２０は、受信装置２０の能力や、受信装置２０が提供するサービス品質に応じて、上記２通りの復号方法を選択することができる。 The receiving device 20 can select one of the above two decoding methods depending on the capability of the receiving device 20 and the quality of service provided by the receiving device 20.

送信装置１５は、受信装置２０における復号動作において、バッファのオーバーフローやアンダーフローの発生を低減して復号できることを保証しなければならない。Ｄ－１の方法を用いて復号する場合のデコーダモデルを規定するための要素としては、例えば下記のパラメータを用いることができる。 The transmitting device 15 must ensure that decoding can be performed while reducing the occurrence of buffer overflow or underflow in the decoding operation in the receiving device 20. For example, the following parameters can be used as elements for defining a decoder model when decoding is performed using method D-1.

・ＭＰＵを再構成するためのバッファサイズ（ＭＰＵバッファ）
例えば、バッファサイズ＝最大レート×最大ＭＰＵ時間×αであり、最大レートとは、符号化データのプロファイル、レベルの上限レート＋ＭＰＵヘッダのオーバーヘッドである。また、最大ＭＰＵ時間は、１ＭＰＵ＝１ＧＯＰ（ビデオ）とした場合のＧＯＰの最大時間長である。・Buffer size for reconfiguring MPU (MPU buffer)
For example, buffer size=maximum rate×maximum MPU time×α, where the maximum rate is the upper limit rate of the encoded data profile and level+the overhead of the MPU header. Further, the maximum MPU time is the maximum time length of a GOP when 1 MPU = 1 GOP (video).

ここで、オーディオは、上記ビデオに共通のＧＯＰ単位としてもよいし、別の単位でもよい。αは、オーバーフローを起こさないためのマージンであり、最大レート×最大ＭＰＵ時間に対して、乗算されてもよいし、加算されてもよい。乗算される場合は、α≧１であり、加算される場合は、α≧０である。 Here, the audio may be a GOP unit common to the video, or may be a separate unit. α is a margin to prevent overflow, and may be multiplied or added to maximum rate×maximum MPU time. When multiplied, α≧1, and when added, α≧0.

・ＭＰＵバッファへデータが入力されてから復号されるまでの復号遅延時間の上限。（ＭＰＥＧ－ＴＳのＳＴＤにおけるＴＳＴＤ＿ｄｅｌａｙ）
例えば、送信時には、最大ＭＰＵ時間、及び、復号遅延時間の上限値を考慮して、受信機におけるＭＰＵデータの取得完了時刻＜＝ＤＴＳとなるようにＤＴＳが設定される。 - Upper limit of decoding delay time from when data is input to the MPU buffer until it is decoded. (TSTD_delay in MPEG-TS STD)
For example, at the time of transmission, the maximum MPU time and the upper limit of the decoding delay time are taken into consideration, and the DTS is set so that the acquisition completion time of the MPU data in the receiver<=DTS.

また、送信装置１５は、Ｄ－１の方法を用いて復号する場合のデコーダモデルに従い、ＤＴＳ及びＰＴＳを付与してもよい。これにより、送信装置１５は、Ｄ－１の方法を用いて復号を行う受信装置の当該復号を保証すると同時に、Ｄ－２の方法を用いて復号が行われる場合に必要な補助情報を送信してもよい。 Further, the transmitting device 15 may add DTS and PTS according to the decoder model when decoding using method D-1. Thereby, the transmitting device 15 guarantees the decoding of the receiving device that performs decoding using method D-1, and at the same time transmits the auxiliary information necessary when decoding is performed using method D-2. It's okay.

例えば、送信装置１５は、Ｄ－２の方法を用いて復号する場合のデコーダバッファにおけるプリバッファリング時間をシグナリングすることにより、Ｄ－２の方法を用いて復号する受信装置の動作を保証できる。 For example, the transmitting device 15 can guarantee the operation of the receiving device that decodes using method D-2 by signaling the pre-buffering time in the decoder buffer when decoding using method D-2.

プリバッファリング時間は、メッセージ、テーブル、記述子などのＳＩ制御情報に含められてもよいし、ＭＭＴパケット、ＭＭＴペイロードのヘッダに含められてもよい。また、符号化データ内のＳＥＩが上書きされてもよい。Ｄ－１の方法を用いて復号するためのＤＴＳ及びＰＴＳは、ＭＰＵタイムスタンプ記述子、ＳａｍｐｌｌｅＥｎｔｒｙに格納され、Ｄ－２の方法を用いて復号するためのＤＴＳ及びＰＴＳ、またはプリバッファリング時間がＳＥＩにおいて記述されてもよい。 The pre-buffering time may be included in SI control information such as a message, table, or descriptor, or may be included in the header of an MMT packet or MMT payload. Furthermore, the SEI within the encoded data may be overwritten. DTS and PTS for decoding using method D-1 are stored in the MPU timestamp descriptor and SampleEntry, and DTS and PTS for decoding using method D-2 or pre-buffering time are stored in the MPU timestamp descriptor and SampleEntry. It may be described in SEI.

受信装置２０は、当該受信装置２０がＭＰＵヘッダを用いたＭＰ４準拠の復号動作のみに対応している場合は、復号方法Ｄ－１を選択し、Ｄ－１およびＤ－２の両方に対応している場合は、どちらか一方を選択してもよい。 If the receiving device 20 supports only MP4-compliant decoding using an MPU header, the receiving device 20 selects decoding method D-1 and supports both D-1 and D-2. If so, you may choose one or the other.

送信装置１５は、一方（本説明では、Ｄ－１）の復号動作を保証できるようにＤＴＳ、及びＰＴＳを付与し、さらに一方の復号動作を補助するための補助情報を送信してもよい。 The transmitting device 15 may provide DTS and PTS to ensure the decoding operation of one side (D-1 in this description), and may also transmit supplementary information to assist the decoding operation of one side.

また、Ｄ－２の方法が用いられる場合、Ｄ－１の方法が用いられる場合と比較して、ＭＦメタデータのプリバッファリングに起因する遅延により、Ｅｎｄ－ｔｏ－Ｅｎｄ遅延が大きくなる可能性が高い。したがって、受信装置２０は、Ｅｎｄ－ｔｏ－Ｅｎｄ遅延を小さくしたいときは、Ｄ－２の方法を選択して復号してもよい。例えば、受信装置２０は、常にＥｎｄ－ｔｏ－Ｅｎｄ遅延を削減したい場合に、常にＤ－２の方法を用いてもよい。また、受信装置２０は、ライブコンテンツや、選局、ザッピング動作など、低遅延で提示したい、低遅延提示モードで動作している場合のみＤ－２の方法を用いてもよい。 Also, when method D-2 is used, there is a possibility that the End-to-End delay will be larger due to the delay caused by pre-buffering of MF metadata compared to when method D-1 is used. is high. Therefore, when the receiving device 20 wants to reduce the end-to-end delay, it may select method D-2 for decoding. For example, when the receiving device 20 always wants to reduce the End-to-End delay, it may always use method D-2. Further, the receiving device 20 may use method D-2 only when operating in a low-delay presentation mode where it wants to present live content, channel selection, zapping operation, etc. with low delay.

図２８は、このような受信方法のフローチャートである。 FIG. 28 is a flowchart of such a receiving method.

まず、受信装置２０は、ＭＭＴパケットを受信し、ＭＰＵデータを取得する（Ｓ４０１）。そして、受信装置２０（送信順序タイプ判別部２２）は、当該プログラムを低遅延提示モードで提示するかどうかの判定を行う（Ｓ４０２）。 First, the receiving device 20 receives an MMT packet and obtains MPU data (S401). Then, the receiving device 20 (transmission order type determining unit 22) determines whether or not to present the program in the low-delay presentation mode (S402).

プログラムを低遅延提示モードで提示しない場合（Ｓ４０２でＮｏ）、受信装置２０（ランダムアクセス部２３及び初期化情報取得部２７）は、ヘッダ情報を用いてランダムアクセス、初期化情報を取得する（Ｓ４０５）。また、受信装置２０（ＰＴＳ、ＤＴＳ算出部２６、復号命令部２８、復号部２９、提示部３０）は、送信側で付与されたＰＴＳ、ＤＴＳに基づいてデコード及び提示処理を行う（Ｓ４０６）。 If the program is not presented in the low-delay presentation mode (No in S402), the receiving device 20 (random access unit 23 and initialization information acquisition unit 27) uses the header information to acquire random access and initialization information (S405). ). Further, the receiving device 20 (PTS, DTS calculating unit 26, decoding command unit 28, decoding unit 29, presentation unit 30) performs decoding and presentation processing based on the PTS and DTS given by the transmitting side (S406).

一方、プログラムを低遅延提示モードで提示する場合（Ｓ４０２でＹｅｓ）、受信装置２０（ランダムアクセス部２３及び初期化情報取得部２７）は、ヘッダ情報を用いない復号方法を用いて、ランダムアクセス、初期化情報を取得する（Ｓ４０３）。また、受信装置２０は、送信側で付与されたＰＴＳ、ＤＴＳ及びヘッダ情報を用いずに復号するための補助情報に基づいてデコード及び提示処理を行う（Ｓ４０４）。なお、ステップＳ４０３、及びステップＳ４０４において、ＭＰＵメタデータを用いて処理が行われてもよい。 On the other hand, when presenting the program in the low-delay presentation mode (Yes in S402), the receiving device 20 (random access unit 23 and initialization information acquisition unit 27) uses a decoding method that does not use header information to perform random access, Initialization information is acquired (S403). Further, the receiving device 20 performs decoding and presentation processing based on the PTS, DTS, and auxiliary information for decoding without using the header information provided on the transmitting side (S404). Note that in steps S403 and S404, processing may be performed using MPU metadata.

［補助データを用いた送受信方法］
以上、ＭＦメタデータがメディアデータより後に送信される場合（図２１の（ｃ）、及び図２１の（ｄ）の場合）における送受信動作について説明した。次に、送信装置１５がＭＦメタデータの一部の機能を有する補助データを送信することにより、より早く復号を開始でき、Ｅｎｄ－ｔｏ－Ｅｎｄ遅延を削減できる方法について説明する。ここでは、図２１の（ｄ）に示される送信方法に基づいて補助データがさらに送信される例について説明されるが、補助データを用いる方法は、図２１の（ａ）～（ｃ）に示される送信方法においても適用可能である。 [Transmission/reception method using auxiliary data]
The transmission and reception operations in the case where the MF metadata is transmitted after the media data (cases of (c) and (d) in FIG. 21) have been described above. Next, a method will be described in which the transmitting device 15 can start decoding earlier and reduce End-to-End delay by transmitting auxiliary data having some functions of MF metadata. Here, an example will be described in which auxiliary data is further transmitted based on the transmission method shown in FIG. 21(d), but methods using auxiliary data are shown in FIGS. 21(a) to (c). It is also applicable to other transmission methods.

図２９の（ａ）は、図２１の（ｄ）に示される方法を用いて送信されたＭＭＴパケットを示す図である。つまり、データは、ＭＰＵメタデータ、メディアデータ、ＭＦメタデータの順で送信される。 FIG. 29(a) is a diagram showing an MMT packet transmitted using the method shown in FIG. 21(d). That is, data is transmitted in the order of MPU metadata, media data, and MF metadata.

ここで、サンプル＃１、サンプル＃２、サンプル＃３、サンプル＃４はメディアデータに含まれるサンプルである。なお、ここではメディアデータは、サンプル単位でＭＭＴパケットに格納される例について説明されるが、メディアデータは、ＮＡＬユニット単位でＭＭＴパケットに格納されてもよいし、ＮＡＬユニットを分割した単位で格納されてもよい。なお、複数のＮＡＬユニットがアグリゲーションされてＭＭＴパケットに格納される場合もある。 Here, sample #1, sample #2, sample #3, and sample #4 are samples included in the media data. Note that although an example in which media data is stored in an MMT packet in units of samples will be explained here, media data may be stored in an MMT packet in units of NAL units, or in units of divided NAL units. may be done. Note that a plurality of NAL units may be aggregated and stored in an MMT packet.

先述のＤ－１で説明したように、図２１の（ｄ）に示される方法の場合、つまり、ＭＰＵメタデータ、メディアデータ、ＭＦメタデータの順でデータが送信される場合、ＭＰＵメタデータを取得後、さらにＭＦメタデータを取得し、その後、メディアデータを復号する方法がある。このようなＤ－１の方法では、ＭＦメタデータ取得のためのメディアデータのバッファリングが必要となるが、ＭＰＵヘッダ情報を用いて復号が行われるため、従来のＭＰ４準拠の受信装置にもＤ－１の方法は適用可能である利点がある。一方で、受信装置２０は、ＭＦメタデータ取得まで、復号開始を待たなければならない欠点がある。 As explained in D-1 above, in the case of the method shown in FIG. 21(d), that is, when data is transmitted in the order of MPU metadata, media data, and MF metadata, MPU metadata After the acquisition, there is a method of further acquiring MF metadata and then decoding the media data. In method D-1, buffering of media data is required to obtain MF metadata, but since decoding is performed using MPU header information, conventional MP4-compliant receiving devices can also use D-1. Method -1 has the advantage of being applicable. On the other hand, the receiving device 20 has the disadvantage that it has to wait to start decoding until it obtains the MF metadata.

これに対し、図２９の（ｂ）に示されるように、補助データを用いる手法においては、ＭＦメタデータより先に、補助データが送信される。 On the other hand, as shown in FIG. 29(b), in the method using auxiliary data, the auxiliary data is transmitted before the MF metadata.

ＭＦメタデータには、ムービーフラグメントに含まれる全てのサンプルのＤＴＳやＰＴＳ、オフセットやサイズを示す情報が含まれている。これに対し、補助データには、ムービーフラグメントに含まれるサンプルのうち、一部のサンプルのＤＴＳやＰＴＳ、オフセットやサイズを示す情報が含まれる。 The MF metadata includes information indicating the DTS, PTS, offset, and size of all samples included in the movie fragment. On the other hand, the auxiliary data includes information indicating the DTS, PTS, offset, and size of some of the samples included in the movie fragment.

例えば、ＭＦメタデータには、すべてのサンプル（サンプル＃１－サンプル＃４）の情報が含まれるのに対し、補助データには一部のサンプル（サンプル＃１－＃２）の情報が含まれる。 For example, MF metadata includes information about all samples (sample #1-sample #4), whereas auxiliary data includes information about some samples (sample #1-#2). .

図２９の（ｂ）に示される場合は、補助データが用いられることでサンプル＃１、及びサンプル＃２の復号が可能となるため、Ｄ－１の送信方法に対して、Ｅｎｄ－ｔｏ－Ｅｎｔ遅延が小さくなる。なお、補助データには、どのようにサンプルの情報が組み合わされて含められてもよいし、補助データは、繰り返し送信されてもよい。 In the case shown in FIG. 29(b), sample #1 and sample #2 can be decoded by using auxiliary data, so End-to-Ent Delay is reduced. Note that the auxiliary data may include any combination of sample information, and the auxiliary data may be transmitted repeatedly.

例えば、図２９の（ｃ）において、Ａのタイミングで補助情報を送信する場合は、送信装置１５は、補助情報にサンプル＃１の情報を含め、Ｂのタイミングで補助情報を送信する場合は、補助情報にサンプル＃１及びサンプル＃２の情報を含める。送信装置１５は、Ｃのタイミングで補助情報を送信する場合は、補助情報にはサンプル＃１、サンプル＃２、及びサンプル＃３の情報を含める。 For example, in (c) of FIG. 29, when transmitting the auxiliary information at timing A, the transmitting device 15 includes the information of sample #1 in the auxiliary information, and when transmitting the auxiliary information at timing B, Include information on sample #1 and sample #2 in the auxiliary information. When transmitting the auxiliary information at timing C, the transmitting device 15 includes information on sample #1, sample #2, and sample #3 in the auxiliary information.

なお、ＭＦメタデータには、サンプル＃１、サンプル＃２、サンプル＃３、及び、サンプル＃４の情報（ムービーフラグメントの中の全サンプルの情報）が含まれる。 Note that the MF metadata includes information on sample #1, sample #2, sample #3, and sample #4 (information on all samples in the movie fragment).

補助データは、必ずしも生成後、ただちに送信される必要はない。 Ancillary data does not necessarily need to be transmitted immediately after it is generated.

なお、ＭＭＴパケットやＭＭＴペイロードのヘッダにおいては、補助データが格納されていることを示すタイプが指定される。 Note that in the header of the MMT packet or MMT payload, a type indicating that auxiliary data is stored is specified.

例えば、補助データがＭＭＴペイロードにＭＰＵモードを用いて格納される場合は、ｆｒａｇｍｅｎｔ＿ｔｙｐｅフィールド値（例えば、ＦＴ＝３）として、補助データであることを示すデータタイプが指定される。補助データは、ｍｏｏｆの構成に基づくデータであってもよいし、その他の構成であってもよい。 For example, when auxiliary data is stored in the MMT payload using MPU mode, a data type indicating that it is auxiliary data is specified as the fragment_type field value (for example, FT=3). The auxiliary data may be data based on the moof configuration, or may have other configurations.

補助データが、ＭＭＴペイロードに制御信号（記述子、テーブル、メッセージ）として格納される場合は、補助データであることを示す記述子タグ、テーブルＩＤ、及びメッセージＩＤなどが指定される。 When auxiliary data is stored as a control signal (descriptor, table, message) in the MMT payload, a descriptor tag, table ID, message ID, etc. indicating that the auxiliary data is auxiliary data are specified.

また、ＭＭＴパケットやＭＭＴペイロードのヘッダにＰＴＳまたはＤＴＳが格納されてもよい。 Furthermore, PTS or DTS may be stored in the header of the MMT packet or MMT payload.

［補助データの生成例］
以下、送信装置がｍｏｏｆの構成に基づいて補助データを生成する例について説明する。図３０は、送信装置がｍｏｏｆの構成に基づいて補助データを生成する例を説明するための図である。 [Example of generation of auxiliary data]
An example in which the transmitting device generates auxiliary data based on the configuration of moof will be described below. FIG. 30 is a diagram for explaining an example in which the transmitting device generates auxiliary data based on the configuration of moof.

通常のＭＰ４では、図２０に示されるように、ムービーフラグメントに対してｍｏｏｆが作成される。ｍｏｏｆには、ムービーフラグメントに含まれるサンプルのＤＴＳやＰＴＳ、オフセットやサイズを示す情報が含まれている。 In normal MP4, a moof is created for a movie fragment, as shown in FIG. The moof contains information indicating the DTS, PTS, offset, and size of the sample included in the movie fragment.

ここでは、送信装置１５は、ＭＰＵを構成するサンプルデータの中で、一部のサンプルデータのみを用いてＭＰ４（ＭＰ４ファイル）を構成し、補助データを生成する。 Here, the transmitting device 15 configures MP4 (MP4 file) using only some sample data among the sample data that configures the MPU, and generates auxiliary data.

例えば、図３０の（ａ）に示されるように、送信装置１５は、ＭＰＵを構成するサンプル＃１－＃４のうち、サンプル＃１のみを用いてＭＰ４を生成し、そのうち、ｍｏｏｆ＋ｍｄａｔのヘッダを補助データとする。 For example, as shown in FIG. 30(a), the transmitting device 15 generates MP4 using only sample #1 among samples #1 to #4 configuring the MPU, and includes the header of moof+mdat. Use as auxiliary data.

次に、図３０の（ｂ）に示されるように、送信装置１５は、ＭＰＵを構成するサンプル＃１－＃４のうち、サンプル＃１及びサンプル＃２を用いてＭＰ４を生成し、そのうち、ｍｏｏｆ＋ｍｄａｔのヘッダを次の補助データとする。 Next, as shown in FIG. 30(b), the transmitting device 15 generates MP4 using sample #1 and sample #2 among samples #1 to #4 configuring the MPU, and among them, The header of moof+mdat is used as the next auxiliary data.

次に、図３０の（ｃ）に示されるように、送信装置１５は、ＭＰＵを構成するサンプル＃１－＃４のうち、サンプル＃１、サンプル＃２、及びサンプル＃３を用いてＭＰ４を生成し、そのうち、ｍｏｏｆ＋ｍｄａｔのヘッダを次の補助データとする。 Next, as shown in FIG. 30(c), the transmitting device 15 transmits MP4 using sample #1, sample #2, and sample #3 among samples #1 to #4 that constitute the MPU. The header of moof+mdat is used as the next auxiliary data.

次に、図３０の（ｄ）に示されるように、送信装置１５は、ＭＰＵを構成するサンプル＃１－＃４のうち、すべてのＭＰ４を生成し、そのうち、ｍｏｏｆ＋ｍｄａｔのヘッダがムービーフラグメントメタデータとなる。 Next, as shown in FIG. 30(d), the transmitting device 15 generates all MP4s out of samples #1 to #4 configuring the MPU, and among them, the header of moof+mdat is movie fragment metadata. becomes.

なお、ここでは、送信装置１５は、１サンプル毎に補助データを生成したが、Ｎサンプル毎に補助データを生成してもよい。Ｎの値は任意の数字であり、例えば、一つのＭＰＵを送信するときに補助データをＭ回送信する場合、Ｎ＝全サンプル／Ｍとされてもよい。 Although the transmitting device 15 generates auxiliary data for every sample here, it may also generate auxiliary data for every N samples. The value of N is an arbitrary number. For example, when transmitting auxiliary data M times when transmitting one MPU, N=total samples/M may be used.

なお、ｍｏｏｆにおけるサンプルのオフセットを示す情報は、後続のサンプル数のサンプルエントリ領域がＮＵＬＬ領域として確保された後のオフセット値であってもよい。 Note that the information indicating the sample offset in moof may be an offset value after the sample entry area for the number of subsequent samples is secured as a NULL area.

なお、ＭＦメタデータをフラグメントする構成となるように補助データが生成されてもよい。 Note that the auxiliary data may be generated in such a manner that the MF metadata is fragmented.

［補助データを用いた受信動作例］
図３０で説明したように生成された補助データの受信について説明する。図３１は、補助データの受信を説明するための図である。なお、図３１の（ａ）では、ＭＰＵを構成するサンプル数は３０であり、１０サンプル毎に補助データが生成され、送信されるものとする。 [Example of reception operation using auxiliary data]
The reception of auxiliary data generated as described in FIG. 30 will be explained. FIG. 31 is a diagram for explaining reception of auxiliary data. In FIG. 31(a), it is assumed that the number of samples forming the MPU is 30, and auxiliary data is generated and transmitted every 10 samples.

図３０の（ａ）において、補助データ＃１には、サンプル＃１－＃１０、補助データ＃２には、サンプル＃１－＃２０、ＭＦメタデータには、サンプル＃１－＃３０のサンプル情報がそれぞれ含まれる。 In (a) of FIG. 30, auxiliary data #1 contains samples #1 to #10, auxiliary data #2 contains samples #1 to #20, and MF metadata contains samples #1 to #30. Information is included for each.

なお、サンプル＃１－＃１０、サンプル＃１１－＃２０、及びサンプル＃２１－＃３０は、一つのＭＭＴペイロードに格納されているが、サンプル単位やＮＡＬ単位で格納されてもよいし、フラグメントやアグリゲーションした単位で格納されてもよい。 Although samples #1 to #10, samples #11 to #20, and samples #21 to #30 are stored in one MMT payload, they may be stored in sample units or NAL units, or in fragments. It may also be stored in an aggregated unit.

受信装置２０は、ＭＰＵメタ、サンプル、ＭＦメタ、及び補助データのパケットをそれぞれ受信する。 The receiving device 20 receives each packet of MPU meta, sample, MF meta, and auxiliary data.

受信装置２０は、サンプルデータを受信順に（後ろに）連結し、最新の補助データを受信した後に、これまでの補助データを更新する。また、受信装置２０は、最後に補助データをＭＦメタデータに置き換えることにより、完全なＭＰＵを構成できる。 The receiving device 20 concatenates the sample data in the order of reception (backwards), and updates the previous auxiliary data after receiving the latest auxiliary data. Further, the receiving device 20 can configure a complete MPU by finally replacing the auxiliary data with MF metadata.

受信装置２０は、補助データ＃１を受信した時点では、図３１の（ｂ）の上段のようにデータを連結し、ＭＰ４を構成する。これにより、受信装置２０は、ＭＰＵメタデータ、及び補助データ＃１の情報を用いてサンプル＃１－＃１０をパースすることができ、補助データに含まれるＰＴＳ、ＤＴＳ、オフセット、及びサイズの情報に基づいて復号を行うことができる。 When the receiving device 20 receives the auxiliary data #1, it concatenates the data as shown in the upper part of FIG. 31(b) to configure MP4. Thereby, the receiving device 20 can parse samples #1 to #10 using the MPU metadata and the information of the auxiliary data #1, and the information on the PTS, DTS, offset, and size included in the auxiliary data. Decoding can be performed based on

また、受信装置２０は、補助データ＃２を受信した時点では、図３１の（ｂ）の中段のようにデータを連結し、ＭＰ４を構成する。これにより、受信装置２０は、ＭＰＵメタデータ、及び補助データ＃２の情報を用いてサンプル＃１－＃２０をパースすることができ、補助データに含まれるＰＴＳ、ＤＴＳ、オフセット、サイズの情報に基づいて復号を行うことができる。 Furthermore, upon receiving the auxiliary data #2, the receiving device 20 concatenates the data as shown in the middle row of FIG. 31(b) to configure MP4. Thereby, the receiving device 20 can parse samples #1 to #20 using the MPU metadata and the information in the auxiliary data #2, and the information on the PTS, DTS, offset, and size included in the auxiliary data. decoding can be performed based on the

また、受信装置２０は、ＭＦメタデータを受信した時点では、図３１の（ｂ）の下段のようにデータを連結し、ＭＰ４を構成する。これにより、受信装置２０は、ＭＰＵメタデータ、及びＭＦメタデータを用いてサンプル＃１－＃３０をパースすることができ、ＭＦメタデータに含まれるＰＴＳ、ＤＴＳ、オフセット、及びサイズの情報に基づいて復号を行うことができる。 Furthermore, upon receiving the MF metadata, the receiving device 20 concatenates the data as shown in the lower part of FIG. 31(b) to configure MP4. Thereby, the receiving device 20 can parse samples #1 to #30 using the MPU metadata and MF metadata, and based on the PTS, DTS, offset, and size information included in the MF metadata. can be decrypted using

補助データが無い場合は、受信装置２０は、ＭＦメタデータの受信後にはじめてサンプルの情報を取得できるため、ＭＦメタデータの受信後に復号を開始する必要があった。しかしながら、送信装置１５が補助データを生成し、送信することにより、受信装置２０は、ＭＦメタデータの受信を待たずに、補助データを用いてサンプルの情報を取得できるため、復号開始時間を早めることができる。さらに、送信装置１５が図３０を用いて説明したｍｏｏｆに基づく補助データを生成することにより、受信装置２０は、従来のＭＰ４のパーサーをそのまま利用し、パースすることが可能である。 If there is no auxiliary data, the receiving device 20 can acquire sample information only after receiving the MF metadata, so it was necessary to start decoding after receiving the MF metadata. However, since the transmitting device 15 generates and transmits the auxiliary data, the receiving device 20 can acquire sample information using the auxiliary data without waiting for the reception of MF metadata, thereby speeding up the decoding start time. be able to. Further, since the transmitting device 15 generates the auxiliary data based on the moof described using FIG. 30, the receiving device 20 can use the conventional MP4 parser as is to perform parsing.

また、新たに生成する補助データやＭＦメタデータは、過去に送信した補助データと重複するサンプルの情報を含む。このため、パケットロスなどにより過去の補助データを取得できなかった場合でも、新たに取得する補助データやＭＦメタデータを用いることで、ＭＰ４を再構成し、サンプルの情報（ＰＴＳ、ＤＴＳ、サイズ、及びオフセット）を取得することが可能である。 Furthermore, newly generated auxiliary data and MF metadata include information on samples that overlap with previously transmitted auxiliary data. Therefore, even if past auxiliary data cannot be acquired due to packet loss, etc., MP4 can be reconstructed using newly acquired auxiliary data and MF metadata, and sample information (PTS, DTS, size, and offset).

なお、補助データは、必ずしも過去のサンプルデータの情報を含む必要はない。たとえば、補助データ＃１は、サンプルデータ＃１－＃１０に対応し、補助データ＃２は、サンプルデータ＃１１－＃２０に対応してもよい。例えば、図３１の（ｃ）に示されるように、送信装置１５は、完全なＭＦメタデータをデータユニットとして、データユニットをフラグメントした単位を補助データとして順次送出してもよい。 Note that the auxiliary data does not necessarily need to include information on past sample data. For example, auxiliary data #1 may correspond to sample data #1-#10, and auxiliary data #2 may correspond to sample data #11-#20. For example, as shown in FIG. 31C, the transmitting device 15 may sequentially transmit complete MF metadata as a data unit and fragments of the data unit as auxiliary data.

また、送信装置１５は、パケットロス対策のために、補助データを繰り返し伝送してもよいし、ＭＦメタデータを繰り返し伝送してもよい。 Furthermore, the transmitting device 15 may repeatedly transmit auxiliary data or MF metadata in order to prevent packet loss.

なお、補助データが格納されるＭＭＴパケット及びＭＭＴペイロードには、ＭＰＵメタデータ、ＭＦメタデータ、及びサンプルデータと同様に、ＭＰＵシーケンス番号、及びアセットＩＤが含まれる。 Note that the MMT packet and MMT payload in which auxiliary data are stored include an MPU sequence number and an asset ID, similar to the MPU metadata, MF metadata, and sample data.

以上のような補助データを用いた受信動作について図３２のフローチャートを用いて説明する。図３２は、補助データを用いた受信動作のフローチャートである。 The reception operation using the above-mentioned auxiliary data will be explained using the flowchart of FIG. 32. FIG. 32 is a flowchart of a receiving operation using auxiliary data.

まず、受信装置２０は、ＭＭＴパケットを受信し、パケットヘッダやペイロードヘッダを解析する（Ｓ５０１）。次に、受信装置２０は、フラグメントタイプが補助データか、ＭＦメタデータかを解析し（Ｓ５０２）、フラグメントタイプが補助データである場合には、過去の補助データを上書きして更新する（Ｓ５０３）。このとき、同一ＭＰＵの過去の補助データがない場合には、受信装置２０は、受信した補助データをそのまま新規の補助データとする。そして、受信装置２０は、ＭＰＵメタデータ、補助データ、及びサンプルデータに基づき、サンプルを取得し、復号を行う（Ｓ５０７）。 First, the receiving device 20 receives an MMT packet and analyzes the packet header and payload header (S501). Next, the receiving device 20 analyzes whether the fragment type is auxiliary data or MF metadata (S502), and if the fragment type is auxiliary data, it updates the previous auxiliary data by overwriting it (S503). . At this time, if there is no past auxiliary data of the same MPU, the receiving device 20 uses the received auxiliary data as new auxiliary data. Then, the receiving device 20 acquires a sample and decodes it based on the MPU metadata, auxiliary data, and sample data (S507).

一方、フラグメントタイプがＭＦメタデータである場合には、受信装置２０は、ステップＳ５０５において、過去の補助データをＭＦメタデータで上書きする（Ｓ５０５）。そして、受信装置２０は、ＭＰＵメタデータ、ＭＦメタデータ、及びサンプルデータに基づきサンプルを完全なＭＰＵの形で取得し、復号を行う（Ｓ５０６）。 On the other hand, if the fragment type is MF metadata, the receiving device 20 overwrites the past auxiliary data with MF metadata in step S505 (S505). Then, the receiving device 20 acquires a sample in the form of a complete MPU based on the MPU metadata, MF metadata, and sample data, and decodes it (S506).

なお、図３２において図示されないが、ステップＳ５０２において、受信装置２０は、フラグメントタイプがＭＰＵメタデータである場合には、データをバッファに格納し、サンプルデータである場合には、サンプル毎に後ろに連結したデータをバッファに格納する。 Although not shown in FIG. 32, in step S502, the receiving device 20 stores the data in a buffer when the fragment type is MPU metadata, and stores the data in the buffer after each sample when the fragment type is sample data. Store the concatenated data in a buffer.

パケットロスにより補助データが取得できなかった場合は、受信装置２０は、最新の補助データにより上書きを行うか、あるいは過去の補助データを用いることによりサンプルを復号することができる。 If the auxiliary data cannot be acquired due to packet loss, the receiving device 20 can decode the sample by overwriting with the latest auxiliary data or by using past auxiliary data.

なお、補助データの送出周期及び送出回数はあらかじめ定められた値であってもよい。送出周期や回数（カウント、カウンドダウン）の情報は、データと一緒に送信されてもよい。例えば、データユニットヘッダに、送出周期、送出回数、及びｉｎｉｔｉａｌ＿ｃｐｂ＿ｒｅｍｏｖａｌ＿ｄｅｌａｙなどのタイムスタンプが格納されてもよい。 Note that the sending cycle and the number of sending times of the auxiliary data may be predetermined values. Information on the transmission cycle and number of times (count, countdown) may be transmitted together with the data. For example, a transmission cycle, the number of transmissions, and a timestamp such as initial_cpb_removal_delay may be stored in the data unit header.

ＭＰＵの初めのサンプルの情報を含む補助データをｉｎｉｔｉａｌ＿ｃｐｂ＿ｒｅｍｏｖａｌ＿ｄｅｌａｙより先に１回以上送信することにより、ＣＰＢバッファモデルに従うことが可能となる。このとき、ＭＰＵタイムスタンプ記述子には、ｐｉｃｔｕｒｅｔｉｍｉｎｇＳＥＩに基づいた値を格納される。 By transmitting auxiliary data containing information of the first sample of the MPU one or more times before initial_cpb_removal_delay, it is possible to follow the CPB buffer model. At this time, a value based on the picture timing SEI is stored in the MPU timestamp descriptor.

なお、このような補助データが使用される受信動作における伝送方式は、ＭＭＴ方式に限定されず、ＭＰＥＧ－ＤＡＳＨなど、ＩＳＯＢＭＦＦファイルフォーマットで構成されるパケットをストリーミング伝送する場合などに適用可能である。 Note that the transmission method in the reception operation using such auxiliary data is not limited to the MMT method, and can be applied to streaming transmission of packets configured in the ISOBMFF file format, such as MPEG-DASH.

［１つのＭＰＵが複数のムービーフラグメントで構成される場合の送信方法］
上記図１９以降の説明においては、１つのＭＰＵが、１つのムービーフラグメントで構成されたが、ここでは、１つのＭＰＵが複数のムービーフラグメントで構成される場合について説明する。図３３は、複数のムービーフラグメントで構成されるＭＰＵの構成を示す図である。 [Transmission method when one MPU is composed of multiple movie fragments]
In the explanation from FIG. 19 onward, one MPU is composed of one movie fragment, but here, a case will be described where one MPU is composed of a plurality of movie fragments. FIG. 33 is a diagram showing the configuration of an MPU composed of a plurality of movie fragments.

図３３では、１つのＭＰＵに格納されるサンプル（＃１－＃６）は、２つのムービーフラグメントに分けて格納される。第１のムービーフラグメントは、サンプル＃１－＃３に基づいて生成され、対応するｍｏｏｆボックスが生成される。第２のムービーフラグメントは、サンプル＃４－＃６に基づいて生成され、対応するｍｏｏｆボックスが生成される。 In FIG. 33, samples (#1-#6) stored in one MPU are stored separately into two movie fragments. A first movie fragment is generated based on samples #1-#3 and a corresponding moof box is generated. A second movie fragment is generated based on samples #4-#6 and a corresponding moof box is generated.

第１のムービーフラグメントにおけるｍｏｏｆボックス及びｍｄａｔボックスのヘッダは、ムービーフラグメントメタデータ＃１としてＭＭＴペイロード及びＭＭＴパケットに格納される。一方、第２のムービーフラグメントにおけるｍｏｏｆボックス及びｍｄａｔボックスのヘッダは、ムービーフラグメントメタデータ＃２としてＭＭＴペイロード及びＭＭＴパケットに格納される。なお、図３３において、ムービーフラグメントメタデータが格納されたＭＭＴペイロードは、ハッチングされている。 The moof box and mdat box headers in the first movie fragment are stored in the MMT payload and MMT packet as movie fragment metadata #1. On the other hand, the headers of the moof box and mdat box in the second movie fragment are stored in the MMT payload and MMT packet as movie fragment metadata #2. Note that in FIG. 33, the MMT payload in which movie fragment metadata is stored is hatched.

なお、ＭＰＵを構成するサンプル数や、ムービーフラグメントを構成するサンプル数は任意である。例えば、ＭＰＵを構成するサンプル数をＧＯＰ単位のサンプル数とし、ＧＯＰ単位の２分の１のサンプル数をムービーフラグメントとして、２つのムービーフラグメントが構成されてもよい。 Note that the number of samples configuring an MPU and the number of samples configuring a movie fragment are arbitrary. For example, two movie fragments may be configured, with the number of samples configuring the MPU being the number of samples per GOP, and the number of samples being half the number of samples per GOP being the movie fragment.

なお、ここでは、一つのＭＰＵに２つのムービーフラグメント（ｍｏｏｆボックス及びｍｄａｔボックス）を含む例を示すが、１つのＭＰＵに含むムービーフラグメントは２つでなくとも、３つ以上であってもよい。また、ムービーフラグメントに格納するサンプルは等分したサンプル数でなく、任意のサンプル数に分割してもよい。 Note that although an example in which one MPU includes two movie fragments (moof box and mdat box) is shown here, the number of movie fragments included in one MPU does not have to be two, but may be three or more. Furthermore, the samples stored in a movie fragment may not be divided into equal numbers of samples, but may be divided into an arbitrary number of samples.

なお、図３３では、ＭＰＵメタデータ単位及びＭＦメタデータ単位がそれぞれデータユニットとしてＭＭＴペイロードに格納されている。しかしながら、送信装置１５は、ｆｔｙｐ、ｍｍｐｕ、ｍｏｏｖ、及びｍｏｏｆなどの単位をデータユニットとして、データユニット単位でＭＭＴペイロードに格納してもよいし、データユニットをフラグメントした単位でＭＭＴペイロードに格納してもよい。また、送信装置１５は、データユニットをアグリゲーションした単位でＭＭＴペイロードに格納してもよい。 Note that in FIG. 33, the MPU metadata unit and the MF metadata unit are each stored in the MMT payload as a data unit. However, the transmitting device 15 may store units such as ftyp, mmpu, moov, and moof as data units in the MMT payload in data unit units, or may store data units in fragmented units in the MMT payload. Good too. Further, the transmitting device 15 may store data units in the MMT payload in units of aggregated data units.

また、図３３では、サンプルは、サンプル単位でＭＭＴペイロードに格納されている。しかしながら、送信装置１５は、サンプル単位でなくともＮＡＬユニット単位または複数のＮＡＬユニットをまとめた単位でデータユニットを構成し、データユニット単位でＭＭＴペイロードに格納してもよい。また、送信装置１５は、データユニットをフラグメントした単位でＭＭＴペイロードに格納してもよいし、データユニットをアグリゲーションした単位でＭＭＴペイロードに格納してもよい。 Further, in FIG. 33, samples are stored in the MMT payload in units of samples. However, the transmitting device 15 may configure a data unit not only in units of samples but also in units of NAL units or units of a plurality of NAL units, and store the data units in the MMT payload in units of data units. Further, the transmitting device 15 may store the fragmented data unit in the MMT payload, or may store the data unit in the MMT payload in the aggregated unit.

なお、図３３では、ｍｏｏｆ＃１、ｍｄａｔ＃１、ｍｏｏｆ＃２、ｍｄａｔ＃２の順にＭＰＵが構成され、ｍｏｏｆ＃１には、対応するｍｄａｔ＃１が後ろについているものとしてｏｆｆｓｅｔが付与されている。しかしながら、ｍｄａｔ＃１がｍｏｏｆ＃１より前についているものしてｏｆｆｓｅｔが付与されてもよい。ただし、この場合、ｍｏｏｆ＋ｍｄａｔの形でムービーフラグメントメタデータを生成することはできず、ｍｏｏｆ及びｍｄａｔのヘッダはそれぞれ別々に伝送される。 In addition, in FIG. 33, the MPU is configured in the order of moof#1, mdat#1, moof#2, and mdat#2, and an offset is given to moof#1 assuming that the corresponding mdat#1 follows. There is. However, if mdat #1 comes before moof #1, offset may be assigned. However, in this case, movie fragment metadata cannot be generated in the form of moof+mdat, and the moof and mdat headers are transmitted separately.

次に、図３３で説明した構成のＭＰＵが伝送される場合のＭＭＴパケットの送信順序について説明する。図３４は、ＭＭＴパケットの送信順序を説明するための図である。 Next, the transmission order of MMT packets when the MPU having the configuration described in FIG. 33 is transmitted will be described. FIG. 34 is a diagram for explaining the transmission order of MMT packets.

図３４の（ａ）は、図３３に示されるＭＰＵの構成順序でＭＭＴパケットを送信する場合の送信順序を示している。図３４の（ａ）は、具体的には、ＭＰＵメタ、ＭＦメタ＃１、メディアデータ＃１（サンプル＃１－＃３）、ＭＦメタ＃２、メディアデータ＃２（サンプル＃４－＃６）の順に送信する例を示す。 FIG. 34(a) shows the transmission order when MMT packets are transmitted in the MPU configuration order shown in FIG. 33. Specifically, (a) in FIG. 34 shows MPU meta, MF meta #1, media data #1 (samples #1-#3), MF meta #2, media data #2 (samples #4-#6). ) is shown below.

図３４の（ｂ）は、ＭＰＵメタ、メディアデータ＃１（サンプル＃１－＃３）、ＭＦメタ＃１、メディアデータ＃２（サンプル＃４－＃６）、ＭＦメタ＃２の順に送信する例を示す。 In (b) of FIG. 34, MPU meta, media data #1 (samples #1 to #3), MF meta #1, media data #2 (samples #4 to #6), and MF meta #2 are transmitted in this order. Give an example.

図３４の（ｃ）は、メディアデータ＃１（サンプル＃１－＃３）、ＭＰＵメタ、ＭＦメタ＃１、メディアデータ＃２（サンプル＃４－＃６）、ＭＦメタ＃２の順に送信する例を示す。 In (c) of FIG. 34, media data #1 (samples #1 to #3), MPU meta, MF meta #1, media data #2 (samples #4 to #6), and MF meta #2 are transmitted in this order. Give an example.

ＭＦメタ＃１は、サンプル＃１－＃３を用いて生成され、ＭＦメタ＃２はサンプル＃４－＃６を用いて生成される。このため、図３４の（ａ）の送信方法が用いられる場合には、サンプルデータの送信にはカプセル化による遅延が発生する。 MF meta #1 is generated using samples #1 to #3, and MF meta #2 is generated using samples #4 to #6. Therefore, when the transmission method shown in FIG. 34A is used, a delay occurs in the transmission of sample data due to encapsulation.

これに対し、図３４の（ｂ）及び図３４の（ｃ）の送信方法が用いられる場合には、ＭＦメタを生成するのを待たずにサンプルを送信可能であるため、カプセル化による遅延は発生せず、Ｅｎｄ－ｔｏ－Ｅｎｄ遅延を低減できる。 On the other hand, when the transmission methods shown in FIGS. 34(b) and 34(c) are used, samples can be transmitted without waiting for MF meta to be generated, so the delay due to encapsulation is reduced. This does not occur, and the end-to-end delay can be reduced.

また、図３４の（ａ）送信順序においても、１つのＭＰＵが複数のムービーフラグメントに分割され、ＭＦメタに格納されるサンプル数が図１９の場合に対して小さくなっているため、図１９の場合よりもカプセル化による遅延量を小さくすることができる。 Also, in the transmission order (a) in FIG. 34, one MPU is divided into multiple movie fragments, and the number of samples stored in the MF meta is smaller than in the case of FIG. The amount of delay due to encapsulation can be made smaller than in the case of encapsulation.

なお、ここで示した方法以外に、例えば、送信装置１５は、ＭＦメタ＃１及びＭＦメタ＃２を連結し、ＭＰＵの最後にまとめて送信してもよい。この場合、異なるムービーフラグメントのＭＦメタがアグリゲーションされて、一つのＭＭＴペイロードに格納されてもよい。また、異なるＭＰＵのＭＦメタがまとめてアグリゲーションされてＭＭＴペイロードに格納されてもよい。 Note that, in addition to the method shown here, for example, the transmitting device 15 may connect MF meta #1 and MF meta #2 and transmit them together at the end of the MPU. In this case, MF metas of different movie fragments may be aggregated and stored in one MMT payload. Further, MF metas of different MPUs may be aggregated together and stored in the MMT payload.

［１つのＭＰＵが複数のムービーフラグメントで構成される場合の受信方法］
ここでは、図３４の（ｂ）で説明した送信順序で送信されたＭＭＴパケットを受信して復号する受信装置２０の動作例について説明する。図３５及び図３６は、このような動作例を説明するための図である。 [Reception method when one MPU is composed of multiple movie fragments]
Here, an example of the operation of the receiving device 20 that receives and decodes MMT packets transmitted in the transmission order described in FIG. 34(b) will be described. 35 and 36 are diagrams for explaining such an example of operation.

受信装置２０は、図３５に示されるような送信順序で送信された、ＭＰＵメタ、サンプル、及びＭＦメタを含むＭＭＴパケットをそれぞれ受信する。サンプルデータは、受信順に連結される。 The receiving device 20 receives MMT packets including an MPU meta, a sample, and an MF meta, which are transmitted in the transmission order shown in FIG. 35. Sample data is concatenated in the order received.

受信装置２０は、ＭＦメタ＃１を受信した時刻であるＴ１に、図３６の（１）に示されるようにデータを連結し、ＭＰ４を構成する。これにより、受信装置２０は、ＭＰＵメタデータ、及びＭＦメタ＃１の情報に基づいてサンプル＃１－＃３を取得することができ、ＭＦメタに含まれるＰＴＳ、ＤＴＳ、オフセット、及びサイズの情報に基づいて復号を行うことができる。 The receiving device 20 concatenates data as shown in (1) of FIG. 36 at T1, which is the time when MF meta #1 is received, and forms MP4. Thereby, the receiving device 20 can acquire samples #1 to #3 based on the MPU metadata and the information in the MF meta #1, and the information on the PTS, DTS, offset, and size included in the MF meta. Decoding can be performed based on

また、受信装置２０は、ＭＦメタ＃２を受信した時刻であるＴ２に、図３６の（２）に示されるようにデータを連結し、ＭＰ４を構成する。これにより、受信装置２０は、ＭＰＵメタデータ、及びＭＦメタ＃２の情報を基づいてサンプル＃４－＃６を取得することができ、ＭＦメタのＰＴＳ、ＤＴＳ、オフセット、及びサイズの情報に基づいて復号を行うことができる。また、受信装置２０は、図３６の（３）に示されるようにデータを連結し、ＭＰ４を構成することでＭＦメタ＃１及びＭＦメタ＃２の情報に基づいてサンプル＃１－＃６を取得してもよい。 Further, the receiving device 20 concatenates data at T2, which is the time when MF meta #2 is received, as shown in (2) of FIG. 36, and configures MP4. Thereby, the receiving device 20 can acquire samples #4 to #6 based on the MPU metadata and the information on the MF meta #2, and can obtain samples #4 to #6 based on the information on the PTS, DTS, offset, and size of the MF meta. can be decrypted using In addition, the receiving device 20 concatenates the data as shown in (3) of FIG. 36 to configure MP4, thereby generating samples #1 to #6 based on the information of MF meta #1 and MF meta #2. You may obtain it.

１つのＭＰＵが複数のムービーフラグメントに分割することで、ＭＰＵの中で初めのＭＦメタを取得するまでの時間が短縮されるため、復号開始時間を早めることができる。また、復号前のサンプルを蓄積するためのバッファサイズを小さくすることができる。 By dividing one MPU into a plurality of movie fragments, the time it takes for the MPU to acquire the first MF meta is shortened, so the decoding start time can be accelerated. Furthermore, the buffer size for storing samples before decoding can be reduced.

なお、送信装置１５は、ムービーフラグメントにおける初めのサンプルを送信（或いは受信）してからムービーフラグメントに対応するＭＦメタを送信（或いは受信）するまでの時間が、エンコーダで指定されるｉｎｉｔｉａｌ＿ｃｐｂ＿ｒｅｍｏｖａｌ＿ｄｅｌａｙより短い時間となるようにムービーフラグメントの分割単位を設定してもよい。このように設定することにより、受信バッファはｃｐｂバッファに従うことができ、低遅延の復号を実現できる。この場合、ＰＴＳ及びＤＴＳにはｉｎｉｔｉａｌ＿ｃｐｂ＿ｒｅｍｏｖａｌ＿ｄｅｌａｙに基づいた絶対時刻を用いることができる。 Note that the transmitting device 15 determines that the time from transmitting (or receiving) the first sample in the movie fragment to transmitting (or receiving) the MF meta corresponding to the movie fragment is shorter than the initial_cpb_removal_delay specified by the encoder. The division unit of the movie fragment may be set so that By setting in this way, the reception buffer can follow the cpb buffer, and low-delay decoding can be achieved. In this case, absolute time based on initial_cpb_removal_delay can be used for PTS and DTS.

また、送信装置１５は、ムービーフラグメントの分割を等間隔、或いは、後続のムービーフラグメントを前のムービーフラグメントより短い間隔で分割してもよい。これにより、受信装置２０は、サンプルの復号前に必ず当該サンプルの情報を含むＭＦメタを受信することができ、連続した復号が可能となる。 Further, the transmitting device 15 may divide the movie fragments at equal intervals, or divide subsequent movie fragments at shorter intervals than the previous movie fragment. Thereby, the receiving device 20 can always receive the MF meta including information about the sample before decoding the sample, and continuous decoding becomes possible.

ＰＴＳ、及びＤＴＳの絶対時刻の算出方法は、下記の２通りの方法を用いることができる。 The following two methods can be used to calculate the absolute time of PTS and DTS.

（１）ＰＴＳ及びＤＴＳの絶対時刻は、ＭＦメタ＃１やＭＦメタ＃２の受信時刻（Ｔ１或いはＴ２）、及びＭＦメタに含まれるＰＴＳ及びＤＴＳの相対時刻に基づいて決定される。 (1) The absolute time of PTS and DTS is determined based on the reception time (T1 or T2) of MF meta #1 and MF meta #2 and the relative time of PTS and DTS included in the MF meta.

（２）ＰＴＳ及びＤＴＳの絶対時刻は、ＭＰＵタイムスタンプ記述子等、送信側からシグナリングされる絶対時刻、及びＭＦメタに含まれるＰＴＳ及びＤＴＳの相対時刻に基づいて決定される。 (2) The absolute time of the PTS and DTS is determined based on the absolute time signaled from the transmitting side, such as the MPU time stamp descriptor, and the relative time of the PTS and DTS included in the MF meta.

また、（２－Ａ）送信装置１５がシグナリングする絶対時刻は、エンコーダから指定されるｉｎｉｔｉａｌ＿ｃｐｂ＿ｒｅｍｏｖａｌ＿ｄｅｌａｙに基づいて算出された絶対時刻であってもよい。 Furthermore, (2-A) the absolute time signaled by the transmitting device 15 may be an absolute time calculated based on initial_cpb_removal_delay specified by the encoder.

また、（２－Ｂ）送信装置１５がシグナリングする絶対時刻は、ＭＦメタの受信時刻の予測値に基づいて算出された絶対時刻であってもよい。 Furthermore, (2-B) the absolute time signaled by the transmitting device 15 may be an absolute time calculated based on a predicted value of the MF meta reception time.

なお、ＭＦメタ＃１及びＭＦメタ＃２は、繰り返し伝送されてもよい。ＭＦメタ＃１及びＭＦメタ＃２が繰り返し伝送されることにより、受信装置２０は、ＭＦメタをパケットロス等により取得できなかった場合でも、もう一度取得することができる。 Note that MF meta #1 and MF meta #2 may be transmitted repeatedly. By repeatedly transmitting MF meta #1 and MF meta #2, the receiving device 20 can obtain the MF meta again even if it is unable to obtain the MF meta due to packet loss or the like.

ムービーフラグメントを構成するサンプルを含むＭＦＵのペイロードヘッダには、ムービーフラグメントの順番を示す識別子を格納することができる。一方、ムービーフラグメントを構成するＭＦメタの順番を示す識別子はＭＭＴペイロードには含まれない。このため、受信装置２０は、ｐａｃｋｅｔ＿ｓｅｑｕｅｎｃｅ＿ｎｕｍｂｅｒでＭＦメタの順番を識別する。或いは、送信装置１５は、ＭＦメタが何番目のムービーフラグメントに属するかを示す識別子を、制御情報（メッセージ、テーブル、記述子）、ＭＭＴヘッダ、ＭＭＴペイロードヘッダ、またはデータユニットヘッダに格納してシグナリングしてもよい。 An identifier indicating the order of the movie fragments can be stored in the payload header of the MFU containing the samples making up the movie fragments. On the other hand, the MMT payload does not include an identifier indicating the order of MF meta that constitutes a movie fragment. Therefore, the receiving device 20 identifies the order of the MF meta using packet_sequence_number. Alternatively, the transmitting device 15 stores an identifier indicating which movie fragment the MF meta belongs to in control information (message, table, descriptor), an MMT header, an MMT payload header, or a data unit header, and sends the signal. You may.

なお、送信装置１５は、ＭＰＵメタ、ＭＦメタ、及びサンプルを、あらかじめ定められた所定の送信順序で送信し、受信装置２０は、あらかじめ定められた所定の送信順序に基づいて受信処理を実施してもよい。また、送信装置１５は、送信順序をシグナリングし、シグナリング情報に基づいて受信装置２０が受信処理を選択（判断）してもよい。 Note that the transmitting device 15 transmits the MPU meta, the MF meta, and the sample in a predetermined transmission order, and the receiving device 20 performs reception processing based on the predetermined transmission order. It's okay. Alternatively, the transmitting device 15 may signal the transmission order, and the receiving device 20 may select (judge) the receiving process based on the signaling information.

上記のような受信方法について、図３７を用いて説明する。図３７は、図３５及び図３６で説明した受信方法の動作のフローチャートである。 The above reception method will be explained using FIG. 37. FIG. 37 is a flowchart of the operation of the reception method described in FIGS. 35 and 36.

まず、受信装置２０は、ＭＭＴペイロードに示されるフラグメントタイプにより、ペイロードに含まれるデータが、ＭＰＵメタデータ、ＭＦメタデータであるか、サンプルデータ（ＭＦＵ）であるかを判別（識別）する（Ｓ６０１、Ｓ６０２）。データがサンプルデータである場合には、受信装置２０は、サンプルをバッファリングし、当該サンプルに対応するＭＦメタデータの受信、及び復号開始を待つ（Ｓ６０３）。 First, the receiving device 20 determines (identifies) whether the data included in the payload is MPU metadata, MF metadata, or sample data (MFU) based on the fragment type indicated in the MMT payload (S601 , S602). If the data is sample data, the receiving device 20 buffers the sample and waits for reception of MF metadata corresponding to the sample and start of decoding (S603).

一方、ステップＳ６０２において、データがＭＦメタデータである場合には、受信装置２０は、ＭＦメタデータよりサンプルの情報（ＰＴＳ、ＤＴＳ、位置情報、及びサイズ）を取得し、取得したサンプルの情報に基づいてサンプルを取得し、ＰＴＳ及びＤＴＳに基づいてサンプルを復号、提示する（Ｓ６０４）。 On the other hand, in step S602, if the data is MF metadata, the receiving device 20 acquires sample information (PTS, DTS, position information, and size) from the MF metadata, and applies the acquired sample information to the sample information. A sample is obtained based on the PTS and the DTS, and the sample is decoded and presented based on the PTS and DTS (S604).

なお、図示されないが、データがＭＰＵメタデータである場合、ＭＰＵメタデータには、復号に必要な初期化情報が含まれている。このため、受信装置２０はこれを蓄積し、ステップＳ６０４においてサンプルデータの復号に用いる。 Although not shown, if the data is MPU metadata, the MPU metadata includes initialization information necessary for decoding. Therefore, the receiving device 20 accumulates this and uses it to decode the sample data in step S604.

なお、受信装置２０は、受信したＭＰＵのデータ（ＭＰＵメタデータ、ＭＦメタデータ、及びサンプルデータ）を蓄積装置に蓄積する場合には、図１９または図３３で説明した、ＭＰＵの構成に並び替えた後に、蓄積する。 Note that when the receiving device 20 stores the received MPU data (MPU metadata, MF metadata, and sample data) in the storage device, it rearranges the received MPU data into the MPU configuration described in FIG. 19 or FIG. 33. After that, it accumulates.

なお、送信側においては、ＭＭＴパケットには、同一のパケットＩＤを持つパケットに対して、パケットシーケンス番号を付与する。このとき、ＭＰＵメタデータ、ＭＦメタデータ、サンプルデータを含むＭＭＴパケットが送信順序に並び替えられた後にパケットシーケンス番号が付与されてもよいし、並び替える前の順序でパケットシーケンス番号が付与されてもよい。 Note that on the transmitting side, packet sequence numbers are assigned to MMT packets with respect to packets having the same packet ID. At this time, packet sequence numbers may be assigned after the MMT packets including MPU metadata, MF metadata, and sample data are rearranged in the transmission order, or packet sequence numbers may be assigned in the order before rearranging. Good too.

並び替える前の順序でパケットシーケンス番号が付与される場合には、受信装置２０において、パケットシーケンス番号に基づいて、データをＭＰＵの構成順序に並び替えることができ、蓄積が容易となる。 When packet sequence numbers are assigned in the order before rearrangement, the receiving device 20 can rearrange the data in the configuration order of the MPU based on the packet sequence numbers, making storage easier.

［アクセスユニットの先頭及びスライスセグメントの先頭を検出する方法］
ＭＭＴパケットヘッダ、及びＭＭＴペイロードヘッダの情報に基づき、アクセスユニットの先頭やスライスセグメントの先頭を検出する方法について説明する。 [Method of detecting the beginning of an access unit and the beginning of a slice segment]
A method for detecting the beginning of an access unit or the beginning of a slice segment based on the information of the MMT packet header and the MMT payload header will be described.

ここでは、非ＶＣＬＮＡＬユニット（アクセスユニットデリミタ、ＶＰＳ、ＳＰＳ、ＰＰＳ、及びＳＥＩなど）を、まとめてデータユニットとしてＭＭＴペイロードに格納する場合、及び、非ＶＣＬＮＡＬユニットをそれぞれデータユニットとし、データユニットをアグリゲーションして１つのＭＭＴペイロードに格納する場合の２つの例を示す。 Here, when non-VCL NAL units (access unit delimiter, VPS, SPS, PPS, SEI, etc.) are collectively stored in the MMT payload as a data unit, and when each non-VCL NAL unit is stored as a data unit, Two examples of aggregating and storing in one MMT payload are shown below.

図３８は、非ＶＣＬＮＡＬユニットを、個別にデータユニットとし、アグリゲーションする場合を示す図である。 FIG. 38 is a diagram showing a case where non-VCL NAL units are individually treated as data units and aggregated.

図３８の場合、アクセスユニットの先頭は、ｆｒａｇｍｅｎｔ＿ｔｙｐｅ値がＭＦＵであるＭＭＴパケットであり、かつ、ａｇｇｒｅｇａｔｉｏｎ＿ｆｌａｇ値が１であり、かつｏｆｆｓｅｔ値が０であるデータユニットを含むＭＭＴペイロードの先頭データである。このとき、Ｆｒａｇｍｅｎｔａｔｉｏｎ＿ｉｎｄｉｃａｔｏｒ値は０である。 In the case of FIG. 38, the head of the access unit is an MMT packet whose fragment_type value is MFU, and is the head data of an MMT payload that includes a data unit whose aggregation_flag value is 1 and whose offset value is 0. At this time, the Fragmentation_indicator value is 0.

また、図３８の場合、スライスセグメントの先頭は、ｆｒａｇｍｅｎｔ＿ｔｙｐｅ値がＭＦＵであるＭＭＴパケットであり、かつａｇｇｒｅｇａｔｉｏｎ＿ｆｌａｇ値が０、ｆｒａｇｍｅｎｔａｔｉｏｎ＿ｉｎｄｉｃａｔｏｒ値が００或いは０１であるＭＭＴペイロードの先頭データである。 In the case of FIG. 38, the beginning of the slice segment is an MMT packet whose fragment_type value is MFU, and is the beginning data of an MMT payload whose aggregation_flag value is 0 and fragmentation_indicator value is 00 or 01.

図３９は、非ＶＣＬＮＡＬユニットを、まとめてデータユニットとする場合を示す図である。なお、パケットヘッダのフィールド値は、図１７（または図１８）で示した通りである。 FIG. 39 is a diagram showing a case in which non-VCL NAL units are combined into a data unit. Note that the field values of the packet header are as shown in FIG. 17 (or FIG. 18).

図３９の場合、アクセスユニットの先頭は、Ｏｆｆｓｅｔ値が０であるパケットにおけるペイロードの先頭データが、アクセスユニットの先頭となる。 In the case of FIG. 39, the head of the access unit is the head data of the payload in the packet whose Offset value is 0.

また、図３９の場合、スライスセグメントの先頭は、Ｏｆｆｓｅｔ値が０とは異なる値であり、ｆｒａｇｍｅｎｔａｔｉｏｎｉｎｄｉｃａｔｏｒ値が００或いは０１であるパケットのペイロードの先頭データが、スライスセグメントの先頭となる。 Further, in the case of FIG. 39, the beginning of the slice segment has an Offset value different from 0, and the beginning data of the payload of the packet whose fragmentation indicator value is 00 or 01 becomes the beginning of the slice segment.

［パケットロスが発生した場合の受信処理］
通常、パケットロスが発生する環境において、ＭＰ４形式のデータを伝送する場合、受信装置２０は、ＡＬＦＥＣ（ＡｐｐｌｉｃａｔｉｏｎＬａｙｅｒＦＥＣ）や、パケット再送制御等によりパケットを復元する。 [Reception processing when packet loss occurs]
Normally, when transmitting MP4 format data in an environment where packet loss occurs, the receiving device 20 restores the packet using ALFEC (Application Layer FEC), packet retransmission control, or the like.

しかし、放送のようなストリーミングにおいてＡＬ－ＦＥＣを用いられない場合にパケットロスが発生した場合には、パケットを復元できない。 However, if AL-FEC is not used in streaming such as broadcasting and packet loss occurs, the packets cannot be restored.

受信装置２０は、パケットロスによりデータが失われた後、再び映像や音声の復号を再開させる必要がある。そのためには、受信装置２０は、アクセスユニットやＮＡＬユニットの先頭を検出し、アクセスユニットやＮＡＬユニットの先頭から復号を開始する必要がある。 After data is lost due to packet loss, the receiving device 20 needs to restart decoding of video and audio. To do this, the receiving device 20 needs to detect the beginning of the access unit or NAL unit and start decoding from the beginning of the access unit or NAL unit.

しかし、ＭＰ４形式のＮＡＬユニットの先頭には、スタートコードがついていないため、受信装置２０は、ストリームを解析しても、アクセスユニットやＮＡＬユニットの先頭を検出できない。 However, since a start code is not attached to the beginning of a NAL unit in MP4 format, the receiving device 20 cannot detect the beginning of an access unit or a NAL unit even if it analyzes the stream.

図４０は、パケットロスが発生した場合の受信装置２０の動作のフローチャートである。 FIG. 40 is a flowchart of the operation of the receiving device 20 when a packet loss occurs.

受信装置２０は、ＭＭＴパケットやＭＭＴペイロードのヘッダにおけるＰａｃｋｅｔｓｅｑｕｅｎｃｅｎｕｍｂｅｒや、ｐａｃｋｅｔｃｏｕｎｔｅｒ、ｆｒａｇｍｅｎｔｃｏｕｎｔｅｒなどによりパケットロスを検出し（Ｓ７０１）、前後の関係から、どのパケットが消失したかを判定する（Ｓ７０２）。 The receiving device 20 detects packet loss based on the packet sequence number, packet counter, fragment counter, etc. in the header of the MMT packet or MMT payload (S701), and determines which packet has been lost from the context relationship (S702). .

受信装置２０は、パケットロスが発生していないと判定された場合（Ｓ７０２でＮｏ）には、ＭＰ４ファイルを構成し、アクセスユニット或いはＮＡＬユニットを復号する（Ｓ７０３）。 If it is determined that no packet loss has occurred (No in S702), the receiving device 20 constructs an MP4 file and decodes the access unit or NAL unit (S703).

受信装置２０は、パケットロスが発生したと判定された場合（Ｓ７０２でＹｅｓ）には、パケットロスしたＮＡＬユニットに相当するＮＡＬユニットをダミーデータにより生成し、ＭＰ４ファイルを構成する（Ｓ７０４）。受信装置２０は、ＮＡＬユニットにダミーデータを入れる場合には、ＮＡＬユニットのタイプにダミーデータであることを示す。 If it is determined that a packet loss has occurred (Yes in S702), the receiving device 20 generates a NAL unit corresponding to the NAL unit in which the packet loss occurred using dummy data, and configures an MP4 file (S704). When putting dummy data into a NAL unit, the receiving device 20 indicates that the data is dummy data as the type of the NAL unit.

また、受信装置２０は、図１７、図１８、図３８、及び図３９で説明した方法に基づいて、次のアクセスユニットやＮＡＬユニットの先頭を検出し、先頭データからデコーダに入力することで、復号を再開することができる（Ｓ７０５）。 Further, the receiving device 20 detects the beginning of the next access unit or NAL unit based on the method described in FIGS. 17, 18, 38, and 39, and inputs the data from the beginning to the decoder. Decoding can be restarted (S705).

なお、パケットロスが発生した場合には、受信装置２０は、パケットヘッダに基づいて検出された情報に基づいてアクセスユニット及びＮＡＬユニットの先頭から復号を再開してもよいし、ダミーデータのＮＡＬユニットを含む、再構成されたＭＰ４ファイルのヘッダ情報に基づいてアクセスユニット及びＮＡＬユニットの先頭から復号を再開してもよい。 Note that when a packet loss occurs, the receiving device 20 may restart decoding from the beginning of the access unit and the NAL unit based on information detected based on the packet header, or may resume decoding from the beginning of the access unit and the NAL unit or decode the dummy data NAL unit. Decoding may be restarted from the beginning of the access unit and NAL unit based on the header information of the reconstructed MP4 file, including the header information of the reconstructed MP4 file.

受信装置２０は、ＭＰ４ファイル（ＭＰＵ）を蓄積する際には、パケットロスにより消失したパケットデータ（ＮＡＬユニットなど）は、放送や通信から別途取得して蓄積（置き換え）してもよい。 When storing MP4 files (MPU), the receiving device 20 may separately acquire packet data (NAL units, etc.) lost due to packet loss from broadcasting or communication and store (replace) them.

このとき、受信装置２０は、消失したパケットを通信から取得する場合には、消失したパケットの情報（パケットＩＤや、ＭＰＵシーケンス番号、パケットシーケンス番号、ＩＰデータフロー番号、及びＩＰアドレスなど）をサーバーに通知し、当該パケットを取得する。受信装置２０は、消失したパケットのみに限らず、消失したパケット前後のパケット群を同時に取得してもよい。 At this time, when acquiring the lost packet from the communication, the receiving device 20 sends the information of the lost packet (packet ID, MPU sequence number, packet sequence number, IP data flow number, IP address, etc.) to the server. and obtain the packet. The receiving device 20 may obtain not only the lost packet but also a group of packets before and after the lost packet at the same time.

［ムービーフラグメントの構成方法］
ここでは、ムービーフラグメントの構成方法について詳細に説明する。 [How to configure movie fragments]
Here, a method for configuring movie fragments will be explained in detail.

図３３で説明されたように、ムービーフラグメントを構成するサンプル数、及び、１つのＭＰＵを構成するムービーフラグメント数は、任意である。例えば、ムービーフラグメントを構成するサンプル数、及び、１つのＭＰＵを構成するムービーフラグメント数は、固定的に定められた所定の数であってもよいし、動的に決定されてもよい。 As explained with reference to FIG. 33, the number of samples making up a movie fragment and the number of movie fragments making up one MPU are arbitrary. For example, the number of samples that make up a movie fragment and the number of movie fragments that make up one MPU may be a fixed predetermined number, or may be determined dynamically.

ここで、送信側（送信装置１５）において下記の条件を満たすようにムービーフラグメントが構成されることで、受信装置２０における低遅延の復号を保証することができる。 Here, by configuring a movie fragment on the transmitting side (transmitting device 15) so as to satisfy the following conditions, decoding with low delay in the receiving device 20 can be guaranteed.

その条件とは、以下の通りである。 The conditions are as follows.

送信装置１５は、受信装置２０が、任意のサンプル（Ｓａｍｐｌｅ（ｉ））の復号時刻（ＤＴＳ（ｉ））より前には必ず当該サンプルの情報を含むＭＦメタを受信できるように、サンプルデータを分割した単位をムービーフラグメントとしてＭＦメタを生成・送信する。 The transmitting device 15 transmits the sample data so that the receiving device 20 can always receive the MF meta including information about the sample (Sample(i)) before the decoding time (DTS(i)) of the sample. MF meta is generated and transmitted using the divided units as movie fragments.

具体的には、送信装置１５は、ＤＴＳ（ｉ）より前に符号化済のサンプル（ｉ番目のサンプルを含む）を用いてムービーフラグメントを構成する。 Specifically, the transmitting device 15 constructs a movie fragment using samples encoded before DTS(i) (including the i-th sample).

低遅延の復号を保証するように、ムービーフラグメントを構成するサンプル数や１つのＭＰＵを構成するムービーフラグメント数を動的に決定する方法としては、例えば、下記の方法が用いられる。 As a method for dynamically determining the number of samples making up a movie fragment and the number of movie fragments making up one MPU so as to guarantee low-delay decoding, for example, the following method is used.

（１）復号開始時、ＧＯＰ先頭のサンプルＳａｍｐｌｅ（０）の復号時刻ＤＴＳ（０）は、ｉｎｉｔｉａｌ＿ｃｐｂ＿ｒｅｍｏｖａｌ＿ｄｅｌａｙに基づいた時刻である。送信装置は、ＤＴＳ（０）より前の時刻に、符号化完了済のサンプルを用いて第１のムービーフラグメントを構成する。また、送信装置１５は、第１のムービーフラグメントに対応するＭＦメタデータを生成し、ＤＴＳ（０）より前の時刻に送信する。 (1) At the start of decoding, the decoding time DTS(0) of the first sample of the GOP, Sample(0), is a time based on initial_cpb_removal_delay. The transmitting device constructs a first movie fragment using encoded samples at a time before DTS(0). Furthermore, the transmitting device 15 generates MF metadata corresponding to the first movie fragment and transmits it at a time before DTS(0).

（２）送信装置１５は、以降のサンプルにおいても、上記の条件を満たすようにムービーフラグメントを構成する。 (2) The transmitting device 15 configures movie fragments in subsequent samples as well so as to satisfy the above conditions.

例えば、ムービーフラグメントの先頭のサンプルがｋ番目のサンプルであるとしたとき、ｋ番目のサンプルを含むムービーフラグメントのＭＦメタは、ｋ番目のサンプルの復号時刻ＤＴＳ（ｋ）までに送信される。送信装置１５は、ｌ番目のサンプルの符号化完了時刻がＤＴＳ（ｋ）より前であり、（ｌ＋１）番目のサンプルの符号化完了時刻がＤＴＳ（ｋ）より後である場合には、ｋ番目のサンプルからｌ番目のサンプルを用いてムービーフラグメントを構成する。 For example, when the first sample of a movie fragment is the k-th sample, the MF meta of the movie fragment including the k-th sample is transmitted by the decoding time DTS(k) of the k-th sample. If the encoding completion time of the l-th sample is before DTS(k) and the encoding completion time of the (l+1)th sample is after DTS(k), the transmitting device 15 transmits the A movie fragment is constructed using the lth sample from the samples in .

なお、送信装置１５は、ｋ番目のサンプルから、ｌ番目に満たないサンプルまでを用いてムービーフラグメントを構成してもよい。 Note that the transmitting device 15 may configure a movie fragment using samples from the k-th sample to samples less than the l-th sample.

（３）送信装置１５は、ＭＰＵ最後のサンプルの符号化完了後、残りのサンプルを用いてムービーフラグメントを構成し、当該ムービーフラグメントに対応するＭＦメタデータを生成し、送信する。 (3) After the encoding of the last MPU sample is completed, the transmitting device 15 configures a movie fragment using the remaining samples, generates MF metadata corresponding to the movie fragment, and transmits it.

なお、送信装置１５は、符号化完了済のすべてのサンプルを用いてムービーフラグメントを構成せずに、符号化完了済の一部のサンプルを用いてムービーフラグメントを構成してもよい。 Note that the transmitting device 15 may not configure a movie fragment using all encoded samples, but may configure a movie fragment using some encoded samples.

なお、上記では、低遅延の復号を保証するように、上記条件に基づいて動的に、ムービーフラグメントを構成するサンプル数、及び、１つのＭＰＵを構成するムービーフラグメント数が決定される例を示した。しかしながら、サンプル数及びムービーフラグメント数の決定方法は、このような方法に限定されるものではない。例えば、１つのＭＰＵを構成するムービーフラグメント数が所定の値に固定され、上記条件を満たすようにサンプル数が決定されてもよい。また、１つのＭＰＵを構成するムービーフラグメント数、及びムービーフラグメントを分割する時刻（或いはムービーフラグメントの符号量）が所定の値に固定され、上記条件を満たすようにサンプル数が決定されてもよい。 Note that the above example shows an example in which the number of samples forming a movie fragment and the number of movie fragments forming one MPU are dynamically determined based on the above conditions to ensure low-delay decoding. Ta. However, the method for determining the number of samples and the number of movie fragments is not limited to this method. For example, the number of movie fragments constituting one MPU may be fixed to a predetermined value, and the number of samples may be determined to satisfy the above conditions. Furthermore, the number of movie fragments constituting one MPU and the time at which the movie fragments are divided (or the code amount of the movie fragments) may be fixed to predetermined values, and the number of samples may be determined so as to satisfy the above conditions.

また、ＭＰＵが複数のムービーフラグメントに分割されている場合、ＭＰＵが複数のムービーフラグメントに分割されているかどうかを示す情報、分割されたムービーフラグメントの属性、または分割されたムービーフラグメントに対するＭＦメタの属性が送信されてもよい。 In addition, if the MPU is divided into multiple movie fragments, information indicating whether the MPU is divided into multiple movie fragments, attributes of the divided movie fragments, or attributes of the MF meta for the divided movie fragments. may be sent.

ここで、ムービーフラグメントの属性とは、ムービーフラグメントが、ＭＰＵの先頭のムービーフラグメントであるか、ＭＰＵの最後のムービーフラグメントであるか、それ以外のムービーフラグメントであるか等を示す情報である。 Here, the attribute of a movie fragment is information indicating whether the movie fragment is the first movie fragment of the MPU, the last movie fragment of the MPU, or any other movie fragment.

また、ＭＦメタの属性とは、ＭＦメタが、ＭＰＵの先頭のムービーフラグメントに対応するＭＦメタであるか、ＭＰＵの最後のムービーフラグメントに対応するＭＦメタであるか、それ以外のムービーフラグメントに対応するＭＦメタであるか等を示す情報である。 In addition, the attributes of the MF meta include whether the MF meta corresponds to the first movie fragment of the MPU, the MF meta that corresponds to the last movie fragment of the MPU, or the MF meta corresponds to other movie fragments. This is information indicating whether the MF meta is used or not.

なお、送信装置１５は、ムービーフラグメントを構成するサンプル数、及び、１つのＭＰＵを構成するムービーフラグメント数を制御情報として格納し、送信してもよい。 Note that the transmitting device 15 may store and transmit the number of samples making up a movie fragment and the number of movie fragments making up one MPU as control information.

［受信装置の動作］
上記のように構成されたムービーフラグメントに基づく受信装置２０の動作について説明する。 [Operation of receiving device]
The operation of the receiving device 20 based on the movie fragment configured as described above will be explained.

受信装置２０は、ＰＴＳ及びＤＴＳのそれぞれの絶対時刻を、ＭＰＵタイムスタンプ記述子等、送信側からシグナリングされる絶対時刻、及びＭＦメタに含まれるＰＴＳ及びＤＴＳの相対時刻に基づいて決定する。 The receiving device 20 determines the absolute time of each of the PTS and DTS based on the absolute time signaled from the transmitting side, such as the MPU time stamp descriptor, and the relative time of the PTS and DTS included in the MF meta.

受信装置２０は、ＭＰＵが複数のムービーフラグメントに分割されているかどうかの情報に基づいて、ＭＰＵが分割されている場合は、分割されたムービーフラグメントの属性に基づいて、下記のように処理をする。 The receiving device 20 performs the following processing based on the information as to whether the MPU is divided into a plurality of movie fragments, and, if the MPU is divided, based on the attributes of the divided movie fragments. .

（１）受信装置２０は、ムービーフラグメントがＭＰＵの先頭のムービーフラグメントである場合、ＭＰＵタイムスタンプ記述子に含まれる先頭サンプルのＰＴＳの絶対時刻、及びＭＦメタに含まれるＰＴＳ及びＤＴＳの相対時刻を用いて、ＰＴＳ及びＤＴＳの絶対時刻を生成する。 (1) If the movie fragment is the first movie fragment of the MPU, the receiving device 20 determines the absolute time of the PTS of the first sample included in the MPU timestamp descriptor and the relative time of the PTS and DTS included in the MF meta. to generate the absolute time of PTS and DTS.

（２）受信装置２０は、ムービーフラグメントがＭＰＵの先頭のムービーフラグメントでない場合、ＭＰＵタイムスタンプ記述子の情報を用いずに、ＭＦメタに含まれるＰＴＳ及びＤＴＳの相対時刻を用いて、ＰＴＳ及びＤＴＳの絶対時刻を生成する。 (2) If the movie fragment is not the first movie fragment of the MPU, the receiving device 20 uses the relative time of the PTS and DTS included in the MF meta, without using the information of the MPU timestamp descriptor, to generates an absolute time.

（３）受信装置２０は、ムービーフラグメントがＭＰＵの最後のムービーフラグメントである場合、すべてのサンプルのＰＴＳ及びＤＴＳの絶対時刻を算出後、ＰＴＳ及びＤＴＳの計算処理（相対時刻の加算処理）をリセットする。なお、リセット処理は、ＭＰＵ先頭のムービーフラグメントにおいて実施してもよい。 (3) If the movie fragment is the last movie fragment of the MPU, the receiving device 20 resets the PTS and DTS calculation process (relative time addition process) after calculating the absolute time of the PTS and DTS of all samples. do. Note that the reset process may be performed in the movie fragment at the beginning of the MPU.

受信装置２０は、下記のようにムービーフラグメントが分割されているかどうかの判定を行ってもよい。また、受信装置２０は、下記のようにムービーフラグメントの属性情報を取得してもよい。 The receiving device 20 may determine whether the movie fragment is divided as described below. Furthermore, the receiving device 20 may acquire attribute information of movie fragments as described below.

例えば、受信装置２０は、ＭＭＴＰ（ＭＭＴＰｒｏｔｏｃｏｌ）ペイロードヘッダに示されるムービーフラグメントの順番を示す識別子ｍｏｖｉｅ＿ｆｒａｇｍｅｎｔ＿ｓｅｑｕｅｎｃｅ＿ｎｕｍｂｅｒフィールド値に基づいて分割されているかどうかを判定してもよい。 For example, the receiving device 20 may determine whether the movie fragments are divided based on the identifier movie_fragment_sequence_number field value indicating the order of movie fragments indicated in the MMTP (MMT Protocol) payload header.

具体的には、受信装置２０は、１つのＭＰＵに含まれるムービーフラグメントの数が１であり、かつ、ｍｏｖｉｅ＿ｆｒａｇｍｅｎｔ＿ｓｅｑｕｅｎｃｅ＿ｎｕｍｂｅｒフィールド値が１であり、かつ、当該フィールド値が２以上の値が存在する場合に、当該ＭＰＵは複数のムービーフラグメントに分割されていると判定してもよい。 Specifically, when the number of movie fragments included in one MPU is 1, the movie_fragment_sequence_number field value is 1, and the field value has a value of 2 or more, , it may be determined that the MPU is divided into multiple movie fragments.

また、受信装置２０は、１つのＭＰＵに含まれるムービーフラグメントの数が１であり、かつ、ｍｏｖｉｅ＿ｆｒａｇｍｅｎｔ＿ｓｅｑｕｅｎｃｅ＿ｎｕｍｂｅｒフィールド値が０であり、かつ、当該フィールド値が０以外の値が存在する場合に、当該ＭＰＵは複数のムービーフラグメントに分割されていると判定してもよい。 Furthermore, when the number of movie fragments included in one MPU is 1, the movie_fragment_sequence_number field value is 0, and the field value has a value other than 0, the receiving device 20 may be determined to be divided into multiple movie fragments.

ムービーフラグメントの属性情報も同様に、ｍｏｖｉｅ＿ｆｒａｇｍｅｎｔ＿ｓｅｑｕｅｎｃｅ＿ｎｕｍｂｅｒに基づいて判定されてもよい。 The attribute information of the movie fragment may also be determined based on the movie_fragment_sequence_number.

なお、ｍｏｖｉｅ＿ｆｒｅａｇｍｅｎｔ＿ｓｅｑｕｅｎｃｅ＿ｎｕｍｂｅｒを用いずとも、一つＭＰＵに含まれるムービーフラグメントやＭＦメタの送信をカウントすることにより、ムービーフラグメントが分割されているかどうかや、ムービーフラグメントの属性情報を判定されてもよい。 Note that without using the movie_freagment_sequence_number, whether or not a movie fragment is divided and the attribute information of the movie fragment may be determined by counting the transmission of movie fragments and MF metas included in one MPU.

以上説明したような送信装置１５および受信装置２０の構成により、受信装置２０は、ＭＰＵよりも短い間隔でムービーフラグメントメタデータを受信でき、低遅延での復号開始が可能となる。また、ＭＰ４パースの方法に基づいた復号処理を用いて、低遅延での復号を行うことが可能となる。 With the configuration of the transmitting device 15 and receiving device 20 as described above, the receiving device 20 can receive movie fragment metadata at shorter intervals than the MPU, and can start decoding with low delay. Further, by using decoding processing based on the MP4 parsing method, it is possible to perform decoding with low delay.

以上説明したようにＭＰＵが複数のムービーフラグメントに分割されている場合の受信動作について、フローチャートを用いて説明する。図４１は、ＭＰＵが複数のムービーフラグメントに分割されている場合の受信動作のフローチャートである。なお、このフローチャートは、図３７のステップＳ６０４の動作をより詳細に図示するものである。 The reception operation when the MPU is divided into a plurality of movie fragments as described above will be explained using a flowchart. FIG. 41 is a flowchart of the reception operation when the MPU is divided into a plurality of movie fragments. Note that this flowchart illustrates the operation of step S604 in FIG. 37 in more detail.

まず、受信装置２０は、ＭＭＴＰペイロードヘッダに示されるデータ種別に基づいて、データ種別がＭＦメタである場合に、ＭＦメタデータを取得する（Ｓ８０１）。 First, the receiving device 20 acquires MF metadata based on the data type indicated in the MMTP payload header when the data type is MF meta (S801).

次に、受信装置２０は、ＭＰＵが複数のムービーフラグメントに分割されているかどうかを判定し（Ｓ８０２）、ＭＰＵが複数のムービーフラグメントに分割されている場合（Ｓ８０２でＹｅｓ）には、受信したＭＦメタデータがＭＰＵ先頭のメタデータであるかどうかを判定する（Ｓ８０３）。受信装置２０は、受信したＭＦメタデータがＭＰＵ先頭のＭＦメタデータである場合（Ｓ８０３でＹｅｓ）には、ＭＰＵタイムスタンプ記述子に示されるＰＴＳの絶対時刻、並びにＭＦメタデータに示されるＰＴＳ及びＤＴＳの相対時刻よりＰＴＳ及びＤＴＳの絶対時刻を算出し（Ｓ８０４）、ＭＰＵの最後のメタデータであるかどうかの判定を行う（Ｓ８０５）。 Next, the receiving device 20 determines whether the MPU is divided into a plurality of movie fragments (S802), and if the MPU is divided into a plurality of movie fragments (Yes in S802), the received MF It is determined whether the metadata is the first metadata of the MPU (S803). If the received MF metadata is the MF metadata at the beginning of the MPU (Yes in S803), the receiving device 20 sets the absolute time of the PTS indicated in the MPU timestamp descriptor, and the PTS and MF metadata indicated in the MF metadata. The absolute time of PTS and DTS is calculated from the relative time of DTS (S804), and it is determined whether it is the last metadata of the MPU (S805).

一方、受信装置２０は、受信したＭＦメタデータがＭＰＵ先頭のＭＦメタデータでない場合（Ｓ８０３でＮｏ）には、ＭＰＵタイムスタンプ記述子の情報は用いずＭＦメタデータに示されるＰＴＳ及びＤＴＳの相対時刻を用いてＰＴＳ及びＤＴＳの絶対時刻を算出し（Ｓ８０８）、ステップＳ８０５の処理に移行する。 On the other hand, if the received MF metadata is not the MF metadata at the beginning of the MPU (No in S803), the receiving device 20 does not use the information of the MPU timestamp descriptor and uses the relative information of the PTS and DTS indicated in the MF metadata. The absolute time of PTS and DTS is calculated using the time (S808), and the process moves to step S805.

ステップＳ８０５において、ＭＰＵ最後のＭＦメタデータであると判定された場合（Ｓ８０５でＹｅｓ）、受信装置２０は、すべてのサンプルのＰＴＳ及びＤＴＳの絶対時刻を算出後、ＰＴＳ及びＤＴＳの計算処理をリセットする。ステップＳ８０５においてＭＰＵ最後のＭＦメタデータでないと判定された場合（Ｓ８０５でＮｏ）、受信装置２０は処理を終了する。 If it is determined in step S805 that it is the last MF metadata of the MPU (Yes in S805), the receiving device 20 resets the PTS and DTS calculation process after calculating the absolute time of the PTS and DTS of all samples. do. If it is determined in step S805 that the MF metadata is not the last MF metadata of the MPU (No in S805), the receiving device 20 ends the process.

また、ステップＳ８０２においてＭＰＵが複数のムービーフラグメントに分割されていないと判定された場合（Ｓ８０２でＮｏ）には、受信装置２０は、ＭＰＵの後に送信されるＭＦメタデータに基づき、サンプルデータを取得し、ＰＴＳ及びＤＴＳを決定する（Ｓ８０７）。 Further, if it is determined in step S802 that the MPU is not divided into multiple movie fragments (No in S802), the receiving device 20 acquires sample data based on the MF metadata transmitted after the MPU. Then, PTS and DTS are determined (S807).

そして、図示されないが、受信装置２０は、最後に、決定したＰＴＳ及びＤＴＳに基づいて復号処理、提示処理を実施する。 Although not shown, the receiving device 20 finally performs decoding processing and presentation processing based on the determined PTS and DTS.

［ムービーフラグメントを分割したときに発生する課題、及び、その解決策］
これまで、ムービーフラグメントを分割することによりＥｎｄ－ｔｏ－Ｅｎｄ遅延を短縮する方法について説明してきた。ここからは、ムービーフラグメントを分割したときに新たに発生する課題、及び、その解決策について説明する。 [Issues that occur when dividing movie fragments and their solutions]
So far, we have described how to reduce end-to-end delay by dividing movie fragments. From now on, we will explain new problems that arise when movie fragments are divided, and solutions to them.

まず、背景として、符号化データにおけるピクチャ構造について説明する。図４２は、時間スケーラビリティを実現する際の各ＴｅｍｐｏｒａｌＩｄにおけるピクチャの予測構造の例を示す図である。 First, as background, the picture structure in encoded data will be explained. FIG. 42 is a diagram illustrating an example of a predicted structure of a picture in each TemporalId when achieving temporal scalability.

ＭＰＥＧ－４ＡＶＣやＨＥＶＣ（ＨｉｇｈＥｆｆｉｃｉｅｎｃｙＶｉｄｅｏＣｏｄｉｎｇ）などの符号化方式においては、他のピクチャから参照可能なＢピクチャ（双方向参照予測ピクチャ）を用いることにより時間方向のスケーラビリティ（時間スケーラビリティ）が実現できる。 In coding methods such as MPEG-4 AVC and HEVC (High Efficiency Video Coding), temporal scalability is achieved by using B pictures (bidirectional reference predicted pictures) that can be referenced from other pictures. can.

図４２の（ａ）に示されるＴｅｍｐｏｒａｌＩｄとは、符号化構造の階層の識別子であり、ＴｅｍｐｏｒａｌＩｄは、値が大きくなるほど深い階層であることを示す。四角のブロックはピクチャを示し、ブロック内のＩｘは、Ｉピクチャ（画面内予測ピクチャ）、Ｐｘは、Ｐピクチャ（前方参照予測ピクチャ）、Ｂｘ及びｂｘは、Ｂピクチャ（双方向参照予測ピクチャ）を示す。Ｉｘ／Ｐｘ／Ｂｘのｘは表示オーダーを示し、ピクチャを表示する順番を表わす。ピクチャ間の矢印は参照関係を示し、例えば、Ｂ４のピクチャはＩ０、Ｂ８を参照画像として予測画像を生成することを示す。ここで、一のピクチャが、自らのＴｅｍｐｏｒａｌＩｄより大きいＴｅｍｐｏｒａｌＩｄを持つ他のピクチャを参照画像として使うことは禁止されている。階層が規定されているのは時間スケーラビリティを持たせるためであり、例えば、図４２において全てのピクチャを復号すると１２０ｆｐｓ（ｆｒａｍｅｐｅｒｓｅｃｏｎｄ）の映像が得られるが、ＴｅｍｐｏｒａｌＩｄが０から３までの階層のみを復号すると６０ｆｐｓの映像が得られる。 TemporalId shown in (a) of FIG. 42 is an identifier of the hierarchy of the encoding structure, and the larger the value of TemporalId, the deeper the hierarchy. A square block indicates a picture, Ix in the block is an I picture (intra-screen predicted picture), Px is a P picture (forward reference predicted picture), and Bx and bx are B pictures (bidirectional reference predicted picture). show. The x in Ix/Px/Bx indicates a display order and indicates the order in which pictures are displayed. Arrows between pictures indicate reference relationships; for example, picture B4 indicates that a predicted image is generated using I0 and B8 as reference images. Here, one picture is prohibited from using another picture whose TemporalId is larger than its own TemporalId as a reference image. The reason why the layers are specified is to provide temporal scalability. For example, in FIG. 42, when all pictures are decoded, a video of 120 fps (frame per second) is obtained, but only the layers with TemporalId from 0 to 3 When decoded, a 60fps video is obtained.

図４３は、図４２の各ピクチャにおける復号時刻（ＤＴＳ）と表示時刻（ＰＴＳ）との関係を示す図である。例えば、図４３に示されるピクチャＩ０は、復号及び表示においてギャップが発生しないように、Ｂ４の復号完了後に表示される。 FIG. 43 is a diagram showing the relationship between decoding time (DTS) and display time (PTS) in each picture in FIG. 42. For example, picture I0 shown in FIG. 43 is displayed after the decoding of B4 is completed so that no gap occurs in decoding and display.

図４３に示されるように、予測構造にＢピクチャが含まれる場合などには、復号順と表示順とが異なるため、受信装置２０においてピクチャを復号後にピクチャの遅延処理、及び、ピクチャの並び替え（リオーダ）処理が必要となる。 As shown in FIG. 43, when the prediction structure includes a B picture, the decoding order and the display order are different, so the receiving device 20 performs picture delay processing and rearranges the pictures after decoding them. (Reorder) processing is required.

以上、時間方向のスケーラビリティにおけるピクチャの予測構造の例について説明したが、時間方向のスケーラビリティが用いられない場合においても、予測構造によっては、ピクチャの遅延処理、及び、リオーダ処理が必要となる場合がある。図４４は、ピクチャの遅延処理、及び、リオーダ処理が必要となるピクチャの予測構造の一例を示す図である。なお、図４４における数字は、復号順を示す。 Examples of picture prediction structures with temporal scalability have been described above, but even when temporal scalability is not used, picture delay processing and reorder processing may be necessary depending on the prediction structure. be. FIG. 44 is a diagram illustrating an example of a predicted structure of a picture that requires picture delay processing and reorder processing. Note that the numbers in FIG. 44 indicate the decoding order.

図４４に示されるように、予測構造によっては、復号順において先頭となるサンプルと、提示順において先頭となるサンプルが異なる場合があり、図４４では、提示順で先頭となるサンプルは、復号順で４番目のサンプルとなる。なお、図４４は、予測構造の一例を示すものであり、予測構造はこのような構造に限定されるものではない。他の予測構造においても、復号順において先頭となるサンプルと、提示順において先頭となるサンプルとが異なる場合がある。 As shown in FIG. 44, depending on the prediction structure, the first sample in decoding order may be different from the first sample in presentation order. In FIG. 44, the first sample in presentation order is different from the first sample in decoding order. This is the fourth sample. Note that FIG. 44 shows an example of a prediction structure, and the prediction structure is not limited to such a structure. In other prediction structures, the first sample in decoding order and the first sample in presentation order may be different.

図４５は、図３３と同様に、ＭＰ４形式で構成されるＭＰＵが複数のムービーフラグメントに分割されて、ＭＭＴＰペイロード、ＭＭＴＰパケットに格納される例を示す図である。なお、ＭＰＵを構成するサンプル数や、ムービーフラグメントを構成するサンプル数は任意である。例えば、ＭＰＵを構成するサンプル数をＧＯＰ単位のサンプル数とし、ＧＯＰ単位の２分の１のサンプル数をムービーフラグメントとして、２つのムービーフラグメントが構成されてもよい。１サンプルが１つのムービーフラグメントとされてもよいし、ＭＰＵを構成するサンプルが分割されなくてもよい。 Similar to FIG. 33, FIG. 45 is a diagram showing an example in which an MPU configured in MP4 format is divided into a plurality of movie fragments and stored in MMTP payloads and MMTP packets. Note that the number of samples configuring an MPU and the number of samples configuring a movie fragment are arbitrary. For example, two movie fragments may be configured, with the number of samples configuring the MPU being the number of samples per GOP, and the number of samples being half the number of samples per GOP being the movie fragment. One sample may be one movie fragment, and the samples making up the MPU may not be divided.

図４５では、１つのＭＰＵに２つのムービーフラグメント（ｍｏｏｆボックス及びｍｄａｔボックス）が含まれる例が示されているが、１つのＭＰＵに含まれるムービーフラグメントは２つでなくてもよい。１つのＭＰＵに含まれるムービーフラグメントは、３つ以上であってもよいし、ＭＰＵに含まれるサンプル数であってもよい。また、ムービーフラグメントに格納されるサンプルは等分したサンプル数でなく、任意のサンプル数に分割されてもよい。 Although FIG. 45 shows an example in which one MPU includes two movie fragments (moof box and mdat box), one MPU does not need to include two movie fragments. The number of movie fragments included in one MPU may be three or more, or may be the number of samples included in the MPU. Furthermore, the samples stored in a movie fragment may not be divided into equal numbers of samples, but may be divided into an arbitrary number of samples.

ムービーフラグメントメタデータ（ＭＦメタデータ）には、ムービーフラグメントに含まれるサンプルのＰＴＳ、ＤＴＳ、オフセット、及びサイズの情報が含まれており、受信装置２０は、サンプルを復号する際には、当該サンプルの情報を含むＭＦメタからＰＴＳ及びＤＴＳを抽出し、復号タイミングや提示タイミングを決定する。 The movie fragment metadata (MF metadata) includes information on the PTS, DTS, offset, and size of the sample included in the movie fragment, and when the receiving device 20 decodes the sample, the receiving device 20 The PTS and DTS are extracted from the MF meta that includes information, and the decoding timing and presentation timing are determined.

ここからは、詳細説明のために、ｉサンプルの復号時刻の絶対値をＤＴＳ（ｉ）と記載し、提示時刻の絶対値をＰＴＳ（ｉ）と記載する。 From here on, for detailed explanation, the absolute value of the decoding time of the i sample will be written as DTS(i), and the absolute value of the presentation time will be written as PTS(i).

ＭＦメタにおけるｍｏｏｆ内に格納されているタイムスタンプ情報のうちｉ番目のサンプルの情報は、具体的には、ｉ番目のサンプルと（ｉ＋１）番目のサンプルの復号時刻の相対値、及び、ｉ番目のサンプルの復号時刻と提示時刻の相対値であり、これらを以降ＤＴ（ｉ）及びＣＴ（ｉ）と記載する。 Specifically, the information of the i-th sample among the timestamp information stored in the moof in MF meta is the relative value of the decoding time of the i-th sample and the (i+1)-th sample, and the i-th These are the relative values of the decoding time and presentation time of the sample, and these are hereinafter referred to as DT(i) and CT(i).

ムービーフラグメントメタデータ＃１には、サンプル＃１－＃３のＤＴ（ｉ）及びＣＴ（ｉ）が含まれており、ムービーフラグメントメタデータ＃２には、サンプル＃４－＃６のＤＴ（ｉ）及びＣＴ（ｉ）が含まれている。 Movie fragment metadata #1 includes DT(i) and CT(i) of samples #1-#3, and movie fragment metadata #2 includes DT(i) of samples #4-#6. ) and CT(i).

また、ＭＰＵ先頭のアクセスユニットのＰＴＳ絶対値は、ＭＰＵタイムスタンプ記述子などに格納されており、受信装置２０は、ＭＰＵ先頭のアクセスユニットのＰＴＳ＿ＭＰＵと、ＣＴ及びＤＴとに基づいてＰＴＳ及びＤＴＳを算出する。 Further, the absolute value of PTS of the access unit at the beginning of the MPU is stored in the MPU time stamp descriptor, etc., and the receiving device 20 calculates the PTS and DTS based on the PTS_MPU of the access unit at the beginning of the MPU, and the CT and DT. calculate.

図４６は、＃１－＃１０のサンプルによりＭＰＵが構成される場合のＰＴＳ及びＤＴＳの算出方法と課題とを説明するための図である。 FIG. 46 is a diagram for explaining a method of calculating PTS and DTS and problems when an MPU is configured by samples #1 to #10.

図４６の（ａ）は、ＭＰＵがムービーフラグメントに分割されない例を示し、図４６の（ｂ）は、ＭＰＵが５サンプル単位の２つのムービーフラグメントに分割される例を示し、図４６の（ｃ）は、ＭＰＵがサンプル単位に１０のムービーフラグメントに分割される例を示す。 (a) of FIG. 46 shows an example in which the MPU is not divided into movie fragments, (b) in FIG. 46 shows an example in which the MPU is divided into two movie fragments of 5 samples each, and (c) in FIG. ) shows an example in which the MPU is divided into 10 movie fragments in units of samples.

図４５で説明したように、ＭＰＵタイムスタンプ記述子と、ＭＰ４内のタイムスタンプ情報（ＣＴ及びＤＴ）とを用いてＰＴＳ及びＤＴＳが算出される場合において、図４４における提示順で先頭となるサンプルは、復号順で４番目である。このため、ＭＰＵタイムスタンプ記述子に格納されているＰＴＳは、復号順で４番目のサンプルのＰＴＳ（絶対値）となる。なお、以降では、このサンプルをＡサンプルと呼ぶ。また、復号順で先頭のサンプルをＢサンプルと呼ぶ。 As explained in FIG. 45, when PTS and DTS are calculated using the MPU timestamp descriptor and the timestamp information (CT and DT) in MP4, the first sample in the presentation order in FIG. is the fourth in the decoding order. Therefore, the PTS stored in the MPU timestamp descriptor is the PTS (absolute value) of the fourth sample in the decoding order. Note that, hereinafter, this sample will be referred to as the A sample. Further, the first sample in decoding order is called a B sample.

タイムスタンプに係る絶対時刻情報は、ＭＰＵタイムスタンプ記述子の情報のみであるため、受信装置２０は、Ａサンプルが到着するまで、その他のサンプルのＰＴＳ（絶対時刻）及びＤＴＳ（絶対時刻）を算出できない。受信装置２０は、ＢサンプルのＰＴＳ及びＤＴＳも算出できない。 Since the absolute time information related to the time stamp is only the information of the MPU time stamp descriptor, the receiving device 20 calculates the PTS (absolute time) and DTS (absolute time) of the other samples until the A sample arrives. Can not. The receiving device 20 also cannot calculate the PTS and DTS of the B sample.

図４６の（ａ）の例では、Ａサンプルは、Ｂサンプルと同じムービーフラグメントに含まれ、一つのＭＦメタに格納される。このため、受信装置２０は、当該ＭＦメタを受信後、すぐにＢサンプルのＤＴＳを決定できる。 In the example of FIG. 46(a), the A sample is included in the same movie fragment as the B sample, and is stored in one MF meta. Therefore, the receiving device 20 can determine the DTS of the B sample immediately after receiving the MF meta.

図４６の（ｂ）の例では、Ａサンプルは、Ｂサンプルと同じムービーフラグメントに含まれ、一つのＭＦメタに格納される。このため、受信装置２０は、当該ＭＦメタを受信後、すぐにＢサンプルのＤＴＳを決定できる。 In the example of FIG. 46(b), the A sample is included in the same movie fragment as the B sample, and is stored in one MF meta. Therefore, the receiving device 20 can determine the DTS of the B sample immediately after receiving the MF meta.

図４６の（ｃ）の例では、Ａサンプルは、Ｂサンプルと異なるムービーフラグメントに含まれる。このため、受信装置２０は、Ａサンプルを含むムービーフラグメントのＣＴ及びＤＴを含むＭＦメタを受信後でなければ、ＢサンプルのＤＴＳを決定できない。 In the example of FIG. 46(c), the A sample is included in a different movie fragment from the B sample. Therefore, the receiving device 20 cannot determine the DTS of the B sample until after receiving the MF meta including the CT and DT of the movie fragment including the A sample.

したがって、図４６の（ｃ）の例の場合には、受信装置２０は、Ｂサンプルの到着後、すぐに復号を開始できない。 Therefore, in the example of FIG. 46(c), the receiving device 20 cannot start decoding immediately after the B sample arrives.

このように、Ｂサンプルを含むムービーフラグメントに、Ａサンプルが含まれない場合には、受信装置２０は、Ａサンプルを含むムービーフラグメントに係るＭＦメタを受信した後でなければ、Ｂサンプルの復号を開始できない。 In this way, if a movie fragment including a B sample does not include an A sample, the receiving device 20 decodes the B sample only after receiving the MF meta related to the movie fragment including the A sample. Unable to start.

提示順で先頭のサンプルと、デコード順で先頭のサンプルとが一致しない場合において、ＡサンプルとＢサンプルとが同一ムービーフラグメントに格納されなくなるまでにムービーフラグメントが分割されることにより、この課題は発生する。また、ＭＦメタが後送りであるか先送りであるかにかかわらず、この課題は発生する。 This problem occurs when the first sample in the presentation order and the first sample in the decoding order do not match, and the movie fragment is divided until the A sample and B sample are no longer stored in the same movie fragment. do. Moreover, this problem occurs regardless of whether the MF meta is postponed or postponed.

このように、提示順で先頭のサンプルと、デコード順で先頭のサンプルとが一致しない場合において、Ａサンプルと、Ｂサンプルとが同一ムービーフラグメントに格納されない場合には、Ｂサンプルの受信後、すぐにＤＴＳを決定できない。そこで、送信装置１５は、別途、ＢサンプルのＤＴＳ（絶対値）、或いはＢサンプルのＤＴＳ（絶対値）を受信側において算出可能な情報を送信する。このような情報は、制御情報やパケットヘッダ等を用いて送信されてもよい。 In this way, if the first sample in the presentation order and the first sample in the decoding order do not match, and if the A sample and the B sample are not stored in the same movie fragment, the DTS cannot be determined. Therefore, the transmitting device 15 separately transmits the DTS (absolute value) of the B samples or information that allows the receiving side to calculate the DTS (absolute value) of the B samples. Such information may be transmitted using control information, a packet header, or the like.

受信装置２０は、このような情報を用いてＢサンプルのＤＴＳ（絶対値）を算出する。図４７は、このような情報を用いてＤＴＳが算出される場合の受信動作のフローチャートである。 The receiving device 20 uses such information to calculate the DTS (absolute value) of the B samples. FIG. 47 is a flowchart of the reception operation when the DTS is calculated using such information.

受信装置２０は、ＭＰＵ先頭のムービーフラグメントを受信し（Ｓ９０１）、ＡサンプルとＢサンプルとが同一ムービーフラグメントに格納されているかどうかを判定する（Ｓ９０２）。同一ムービーフラグメントに格納されている場合（Ｓ９０２でＹｅｓ）は、受信装置２０は、ＢサンプルのＤＴＳ（絶対時刻）を用いず、ＭＦメタの情報のみを用いてＤＴＳを算出し、復号を開始する（Ｓ９０４）。なお、ステップＳ９０４において、受信装置２０は、ＢサンプルのＤＴＳを用いてＤＴＳを決定してもよい。 The receiving device 20 receives the movie fragment at the beginning of the MPU (S901), and determines whether the A sample and the B sample are stored in the same movie fragment (S902). If they are stored in the same movie fragment (Yes in S902), the receiving device 20 calculates the DTS using only the MF meta information without using the DTS (absolute time) of the B sample, and starts decoding. (S904). Note that in step S904, the receiving device 20 may determine the DTS using the DTS of B samples.

一方、ステップＳ９０２においてＡサンプルとＢサンプルとが同一ムービーフラグメントに格納されていない場合（Ｓ９０２でＮｏ）、受信装置２０は、ＢサンプルのＤＴＳ（絶対時刻）を取得し、ＤＴＳを決定し、復号を開始する（Ｓ９０３）。 On the other hand, if the A sample and the B sample are not stored in the same movie fragment in step S902 (No in S902), the receiving device 20 obtains the DTS (absolute time) of the B sample, determines the DTS, and decodes the sample. (S903).

なお、以上の説明では、ＭＭＴ規格におけるＭＦメタ（ＭＰ４形式のｍｏｏｆ内に格納されているタイムスタンプ情報）を用いて、各サンプルの復号時刻の絶対値と、提示時刻の絶対値とを算出する例について説明したが、ＭＦメタを、各サンプルの復号時刻の絶対値と、提示時刻の絶対値を算出に用いることができる任意の制御情報に置き換えて実施しても良いことは言うまでもない。このような制御情報の例としては、上述したｉ番目のサンプルと（ｉ＋１）番目のサンプルの復号時刻の相対値ＣＴ（ｉ）を、ｉ番目のサンプルと（ｉ＋１）番目のサンプルの提示時刻の相対値に置き換えた制御情報や、ｉ番目のサンプルと（ｉ＋１）番目のサンプルの復号時刻の相対値ＣＴ（ｉ）とｉ番目のサンプルと（ｉ＋１）番目のサンプルの提示時刻の相対値との両方を含む制御情報などがある。 Note that in the above explanation, the absolute value of the decoding time and the absolute value of the presentation time of each sample is calculated using MF meta (time stamp information stored in MP4 format moof) in the MMT standard. Although an example has been described, it goes without saying that the MF meta may be replaced with any control information that can be used to calculate the absolute value of the decoding time and the absolute value of the presentation time of each sample. An example of such control information is the relative value CT(i) of the decoding times of the i-th sample and the (i+1)-th sample described above, and the relative value CT(i) of the decoding times of the i-th sample and the (i+1)-th sample. The control information replaced with relative values, the relative value CT(i) of the decoding time of the i-th sample and the (i+1)-th sample, and the relative value of the presentation time of the i-th sample and the (i+1)-th sample. There is control information that includes both.

（実施の形態３）
［概要］
実施の形態３では、映像、音声、字幕、及びデータ放送などのコンテンツを放送で伝送する場合のコンテンツの送信方法及びデータ構造について説明する。つまり、放送ストリームの再生に特化したコンテンツの送信方法及びデータ構造について説明する。 (Embodiment 3)
[overview]
In Embodiment 3, a content transmission method and data structure when transmitting content such as video, audio, subtitles, and data broadcasting by broadcasting will be described. That is, a content transmission method and data structure specialized for playing a broadcast stream will be explained.

なお、実施の形態３では、多重化方式としてＭＭＴ方式（以下、単にＭＭＴとも記載する）が用いられる例について説明するが、ＭＰＥＧ－ＤＡＳＨまたはＲＴＰなど、その他の多重化方式が用いられてもよい。 Note that in Embodiment 3, an example will be described in which the MMT method (hereinafter also simply referred to as MMT) is used as the multiplexing method, but other multiplexing methods such as MPEG-DASH or RTP may also be used. .

まず、ＭＭＴにおけるデータユニット（ＤＵ：ＤａｔａＵｎｉｔ）のペイロードへの格納方法の詳細について説明する。図４８は、ＭＭＴにおけるデータユニットのペイロードへの格納方法を説明するための図である。 First, details of a method for storing a data unit (DU: Data Unit) in a payload in MMT will be explained. FIG. 48 is a diagram for explaining a method of storing a data unit in a payload in MMT.

ＭＭＴでは、送信装置は、ＭＰＵを構成するデータの一部を、データユニットとしてＭＭＴＰペイロードに格納し、ヘッダをつけて伝送する。ヘッダにはＭＭＴＰペイロードヘッダ、及び、ＭＭＴＰパケットヘッダが含まれる。なお、データユニットの単位は、ＮＡＬユニット単位でもよいし、サンプル単位でもよい。 In MMT, a transmitting device stores a part of data that constitutes an MPU in an MMTP payload as a data unit, attaches a header, and transmits the data. The header includes an MMTP payload header and an MMTP packet header. Note that the data unit may be a NAL unit or a sample.

図４８の（ａ）は、送信装置が複数のデータユニットをアグリゲーションして一つのペイロードに格納する例を示す。図４８の（ａ）の例では、複数のデータユニットそれぞれの先頭に、データユニットヘッダ（ＤＵＨ：ＤａｔａＵｎｉｔＨｅａｄｅｒ）、及び、データユニット長（ＤＵＬ：ＤａｔａＵｎｉｔＬｅｎｇｔｈ）が付与され、データユニットヘッダ及びデータユニット長が付与されたデータユニットが複数まとめてペイロードに格納される。 FIG. 48(a) shows an example in which a transmitting device aggregates a plurality of data units and stores them in one payload. In the example of FIG. 48(a), a data unit header (DUH) and a data unit length (DUL) are added to the beginning of each of a plurality of data units, and the data unit header and A plurality of data units to which a data unit length is assigned are collectively stored in the payload.

図４８の（ｂ）は、一つのデータユニットを一つのペイロードに格納する例を示す。図４８の（ｂ）の例では、データユニットの先頭に、データユニットヘッダが付与されてペイロードに格納される。図４８の（ｃ）は、一つのデータユニットを分割し、分割されたデータユニットに、データユニットヘッダが付与されてペイロードに格納される例を示す。 FIG. 48(b) shows an example in which one data unit is stored in one payload. In the example of FIG. 48(b), a data unit header is added to the beginning of the data unit and stored in the payload. FIG. 48C shows an example in which one data unit is divided, a data unit header is added to the divided data unit, and the data unit header is stored in the payload.

データユニットには、映像、音声、または字幕などの同期に関する情報を含むメディアであるｔｉｍｅｄ－ＭＦＵ、ファイルなど同期に関する情報を含まないメディアであるｎｏｎ－ｔｉｍｅｄ－ＭＦＵ、ＭＰＵメタデータ、ＭＦメタデータなどの種類があり、データユニットの種類に応じてデータユニットヘッダが定められる。なお、ＭＰＵメタデータ、及び、ＭＦメタデータにはデータユニットヘッダは存在しない。 Data units include timed-MFU, which is media that includes information related to synchronization such as video, audio, or subtitles, non-timed-MFU, which is media that does not include information related to synchronization, such as files, MPU metadata, MF metadata, etc. There are several types, and the data unit header is determined according to the type of data unit. Note that there is no data unit header in the MPU metadata and MF metadata.

また、送信装置は、異なる種類のデータユニットをアグリゲーションすることは原則としてできないが、異なる種類のデータユニットをアグリゲーションできるように規定されてもよい。例えば、サンプル毎にムービーフラグメントに分割されている場合などのＭＦメタデータのサイズが小さい場合、ＭＦメタデータとメディアデータとをアグリゲーションすることにより、パケット数を削減でき、さらに、伝送容量を削減することもできる。 Further, although the transmitting device cannot in principle aggregate different types of data units, it may be specified to be able to aggregate different types of data units. For example, if the size of MF metadata is small, such as when each sample is divided into movie fragments, the number of packets can be reduced by aggregating MF metadata and media data, and furthermore, the transmission capacity can be reduced. You can also do that.

データユニットがＭＦＵの場合は、ＭＰＵ（ＭＰ４）を構成するための情報など、ＭＰＵの一部の情報がヘッダとして格納される。 When the data unit is an MFU, some information of the MPU, such as information for configuring the MPU (MP4), is stored as a header.

例えば、ｔｉｍｅｄ－ＭＦＵのヘッダには、ｍｏｖｉｅ＿ｆｒａｇｍｅｎｔ＿ｓｅｑｕｅｎｃｅ＿ｎｕｍｂｅｒ、ｓａｍｐｌｅ＿ｎｕｍｂｅｒ、ｏｆｆｓｅｔ、ｐｒｉｏｒｉｔｙ、及び、ｄｅｐｅｎｄｅｎｃｙ＿ｃｏｕｎｔｅｒなどが含まれ、ｎｏｎ－ｔｉｍｅｄ－ＭＦＵのヘッダにはｉｔｅｍ＿ｉＤが含まれる。各フィールドの意味はＩＳＯ／ＩＥＣ２３００８－１あるいはＡＲＩＢＳＴＤ－Ｂ６０などの規格に示される。以下、このような規格において規定される各フィールドの意味について説明する。 For example, the header of a timed-MFU includes movie_fragment_sequence_number, sample_number, offset, priority, dependency_counter, etc., and the header of a non-timed-MFU includes item_iD. The meaning of each field is shown in standards such as ISO/IEC23008-1 or ARIB STD-B60. The meaning of each field defined in such standards will be explained below.

ｍｏｖｉｅ＿ｆｒａｇｍｅｎｔ＿ｓｅｑｕｅｎｃｅ＿ｎｕｍｂｅｒは、ＭＦＵが属するムービーフラグメントのシーケンス番号を示し、ＩＳＯ／ＩＥＣ１４４９６－１２にも示される。 movie_fragment_sequence_number indicates the sequence number of the movie fragment to which the MFU belongs, and is also indicated in ISO/IEC14496-12.

ｓａｍｐｌｅ＿ｎｕｍｂｅｒは、当該ＭＦＵが属するサンプル番号を示し、ＩＳＯ／ＩＥＣ１４４９６－１２にも示される。 sample_number indicates the sample number to which the MFU belongs, and is also indicated in ISO/IEC14496-12.

ｏｆｆｓｅｔは、当該ＭＦＵが属するサンプルにおける、ＭＦＵのオフセット量をバイト単位で示す。 offset indicates the offset amount of the MFU in the sample to which the MFU belongs, in bytes.

ｐｒｉｏｒｉｔｙは、当該ＭＦＵが属するＭＰＵにおける、ＭＦＵの相対的な重要度を示し、ｐｒｉｏｒｉｔｙの数字が大きいＭＦＵは、ｐｒｉｏｒｉｔｙの数字が小さいＭＦＵよりも重要であることを示す。 The priority indicates the relative importance of the MFU in the MPU to which the MFU belongs, and an MFU with a large priority number indicates that it is more important than an MFU with a small priority number.

ｄｅｐｅｎｄｅｎｃｙ＿ｃｏｕｎｔｅｒは、復号処理が当該ＭＦＵに依存しているＭＦＵ数（すなわち、このＭＦＵを復号処理しなければ、その復号処理を行うことができないＭＦＵの数）を示す。例えば、ＭＦＵがＨＥＶＣである場合においてＢピクチャまたはＰピクチャがＩピクチャを参照する場合、当該ＢピクチャまたはＰピクチャは、Ｉピクチャを復号処理しなければ復号処理を行うことができない。 dependency_counter indicates the number of MFUs on which the decoding process depends on the MFU (that is, the number of MFUs for which the decoding process cannot be performed unless this MFU is decoded). For example, when the MFU is HEVC and a B picture or a P picture references an I picture, the B picture or P picture cannot be decoded unless the I picture is decoded.

したがって、ＭＦＵがサンプル単位である場合は、ＩピクチャのＭＦＵにおけるｄｅｐｅｎｄｅｎｃｙ＿ｃｏｕｎｔｅｒには、当該Ｉピクチャを参照するピクチャ数が示される。ＭＦＵがＮＡＬユニット単位の場合は、Ｉピクチャに属するＭＦＵにおけるｄｅｐｅｎｄｅｎｃｙ＿ｃｏｕｎｔｅｒには、当該Ｉピクチャを参照するピクチャに属するＮＡＬユニット数が示される。さらに、時間方向階層符号化された映像信号の場合、拡張レイヤのＭＦＵは、ベースレイヤのＭＦＵに依存するため、ベースレイヤのＭＦＵにおけるｄｅｐｅｎｄｅｎｃｙ＿ｃｏｕｎｔｅｒには、拡張レイヤのＭＦＵの数が示される。本フィールドは、依存するＭＦＵ数が決定した後でなければ生成できない。 Therefore, when the MFU is in units of samples, the dependency_counter in the MFU of an I picture indicates the number of pictures that refer to the I picture. If the MFU is in units of NAL units, dependency_counter in the MFU belonging to an I picture indicates the number of NAL units belonging to a picture that refers to the I picture. Furthermore, in the case of a temporally hierarchically encoded video signal, the MFU of the enhancement layer depends on the MFU of the base layer, so the dependency_counter in the MFU of the base layer indicates the number of MFUs of the enhancement layer. This field can only be generated after the number of dependent MFUs has been determined.

ｉｔｅｍ＿ｉＤは、アイテムを一意に特定する識別子を示す。 item_id indicates an identifier that uniquely identifies an item.

［ＭＰ４非サポートモード］
図１９、及び、図２１で説明したように、送信装置がＭＭＴにおけるＭＰＵを伝送する方法としては、ＭＰＵメタデータまたはＭＦメタデータをメディアデータの前または後に送信する方法、及び、メディアデータのみを送信する方法がある。また、受信装置では、ＭＰ４に準拠した受信装置や受信方法を用いて復号を行う方法や、ヘッダを用いずに復号する方法がある。 [MP4 non-support mode]
As explained in FIGS. 19 and 21, methods for the transmitting device to transmit MPU in MMT include transmitting MPU metadata or MF metadata before or after media data, and transmitting only media data. There is a way to send it. Further, in the receiving device, there are a method of decoding using a receiving device and a receiving method compliant with MP4, and a method of decoding without using a header.

放送ストリーム再生に特化したデータの送信方法として、例えば、受信装置におけるＭＰ４再構成をサポートしない送信方法がある。 As a data transmission method specialized for broadcast stream playback, there is, for example, a transmission method that does not support MP4 reconfiguration in a receiving device.

受信装置におけるＭＰ４再構成をサポートしない送信方法とは、例えば、図２１の（ｂ）に示されるようにメタデータ（ＭＰＵメタデータ及びＭＦメタデータ）を送信しない方法がある。この場合、ＭＭＴＰパケットに含まれるフラグメントタイプ（データユニットの種類を示す情報）のフィールド値は、２（＝ＭＦＵ）固定である。 A transmission method that does not support MP4 reconfiguration in the receiving device is, for example, a method in which metadata (MPU metadata and MF metadata) is not transmitted, as shown in FIG. 21(b). In this case, the field value of the fragment type (information indicating the type of data unit) included in the MMTP packet is fixed to 2 (=MFU).

メタデータが送信されない場合は、これまで説明したように、ＭＰ４準拠の受信装置などでは、受信したデータをＭＰ４として復号することはできないが、メタデータ（ヘッダ）を用いずに復号することが可能である。 If metadata is not sent, as explained above, an MP4-compliant receiving device cannot decode the received data as MP4, but it is possible to decode it without using metadata (header). It is.

そのため、メタデータは放送ストリーム復号及び再生に必ずしも必須の情報ではない。同様に、図４８で説明した、ｔｉｍｅｄ－ＭＦＵにおけるデータユニットヘッダの情報は、受信装置においてＭＰ４を再構成するための情報である。放送ストリーム再生においてＭＰ４を再構成する必要はないため、ｔｉｍｅｄ－ＭＦＵにおけるデータユニットヘッダ（以下、ｔｉｍｅｄ－ＭＦＵヘッダとも記載する）の情報は、放送ストリーム再生に必ずしも必要な情報ではない。 Therefore, metadata is not necessarily essential information for broadcast stream decoding and playback. Similarly, the information in the data unit header in the timed-MFU described with reference to FIG. 48 is information for reconfiguring the MP4 in the receiving device. Since there is no need to reconstruct MP4 in broadcast stream playback, the information in the data unit header (hereinafter also referred to as timed-MFU header) in the timed-MFU is not necessarily information necessary for broadcast stream playback.

受信装置は、メタデータ、および、データユニットヘッダにおけるＭＰ４を再構成するための情報（以下、ＭＰ４構成情報とも記載する）を用いることにより、容易にＭＰ４を再構成することができる。しかし、受信装置は、メタデータ、および、データユニットヘッダにおけるＭＰ４構成情報のどちらか一方のみが伝送されていたとしても、ＭＰ４を再構成することはできない。メタデータ及びＭＰ４を再構成するための情報のどちらか一方のみが伝送されることによるメリットは少なく、必要でない情報を生成及び伝送することは、処理の増大や伝送効率の低下を招く。 The receiving device can easily reconstruct MP4 by using metadata and information for reconstructing MP4 in the data unit header (hereinafter also referred to as MP4 configuration information). However, the receiving device cannot reconstruct the MP4 even if only one of the metadata and the MP4 configuration information in the data unit header is transmitted. There is little merit in transmitting only either the metadata or the information for reconstructing the MP4, and generating and transmitting unnecessary information results in an increase in processing and a decrease in transmission efficiency.

そこで、送信装置は、下記の方法を用いてＭＰ４構成情報のデータ構造や伝送を制御する。送信装置は、メタデータが伝送されるかどうかに基づいて、データユニットヘッダにおいてＭＰ４構成情報を示すか否かを決定する。具体的には、送信装置は、メタデータが伝送される場合には、データユニットヘッダにおいてＭＰ４構成情報を示し、メタデータが伝送されない場合には、データユニットヘッダにおいてＭＰ４構成情報を示さない。 Therefore, the transmitter uses the method described below to control the data structure and transmission of the MP4 configuration information. The transmitting device determines whether to indicate MP4 configuration information in the data unit header based on whether metadata is transmitted. Specifically, the transmitting device indicates MP4 configuration information in the data unit header when metadata is transmitted, and does not indicate MP4 configuration information in the data unit header when metadata is not transmitted.

データユニットヘッダにおいてＭＰ４構成情報を示さない方法としては、例えば下記の方法を用いることができる。 As a method of not indicating MP4 configuration information in the data unit header, for example, the following method can be used.

１．送信装置は、ＭＰ４構成情報をｒｅｓｅｒｖｅｄとし、運用しない。これにより、ＭＰ４構成情報を生成する送出側の処理量（送信装置の処理量）を削減することができる。 1. The transmitting device sets the MP4 configuration information to reserved and does not operate it. This makes it possible to reduce the amount of processing on the sending side that generates MP4 configuration information (the amount of processing on the transmitting device).

２．送信装置は、ＭＰ４構成情報を削除し、ヘッダ圧縮する。これにより、ＭＰ４構成情報を生成する送出側の処理量を削減することができるとともに、伝送容量を削減することができる。 2. The transmitting device deletes the MP4 configuration information and compresses the header. Thereby, it is possible to reduce the amount of processing on the sending side that generates MP4 configuration information, and it is also possible to reduce the transmission capacity.

なお、送信装置は、ＭＰ４構成情報を削除し、ヘッダ圧縮する場合には、ＭＰ４構成情報を削除（圧縮）したことを示すフラグを示してもよい。フラグは、ヘッダ（ＭＭＴＰパケットヘッダ、ＭＭＴＰペイロードヘッダ、データユニットヘッダ）または制御情報などに示される。 Note that, when deleting MP4 configuration information and compressing the header, the transmitting device may display a flag indicating that the MP4 configuration information has been deleted (compressed). The flag is indicated in a header (MMTP packet header, MMTP payload header, data unit header), control information, or the like.

また、メタデータが伝送されるかどうかの情報は、あらかじめ定めていてもよいし、別途ヘッダや制御情報にシグナリングし、受信装置に伝送されてもよい。 Further, information as to whether or not metadata is to be transmitted may be determined in advance, or may be separately signaled in a header or control information and transmitted to the receiving device.

例えば、ＭＦＵヘッダに当該ＭＦＵに対応するメタデータが伝送されているかの情報が格納されてもよい。 For example, information as to whether metadata corresponding to the MFU is being transmitted may be stored in the MFU header.

一方、受信装置は、メタデータが伝送されているかどうかに基づいて、ＭＰ４構成情報が示されているかどうかを判定することができる。 On the other hand, the receiving device can determine whether MP4 configuration information is indicated based on whether metadata is being transmitted.

ここで、データの送信順序（例えば、ＭＰＵメタデータ、ＭＦメタデータ、メディアデータのような順序）が決まっている場合は、受信装置は、メディアデータの前にメタデータが受信されたかどうかに基づいて判定してもよい。 Here, if the data transmission order is fixed (e.g., MPU metadata, MF metadata, media data, etc.), the receiving device can transmit the data based on whether the metadata is received before the media data. It may be determined by

ＭＰ４構成情報が示されている場合には、受信装置は、ＭＰ４構成情報をＭＰ４の再構成に用いることができる。或いは、受信装置は、その他のアクセスユニットやＮＡＬユニットの先頭の検出などにＭＰ４構成情報を用いることができる。 If MP4 configuration information is indicated, the receiving device can use the MP4 configuration information for MP4 reconfiguration. Alternatively, the receiving device can use the MP4 configuration information to detect the beginning of other access units or NAL units.

なお、ＭＰ４構成情報は、ｔｉｍｅｄ－ＭＦＵヘッダの全部であってもよいし一部であってもよい。 Note that the MP4 configuration information may be all or a part of the timed-MFU header.

また、送信装置は、ｎｏｎ－ｔｉｍｅｄ－ＭＦＵヘッダにおいても同様に、メタデータが伝送されるかどうかに基づいて、ｎｏｎ－ｔｉｍｅｄ－ＭＦＵヘッダにおいてｉｔｅｍ
ｉｄを示すかどうかを決定してもよい。 Similarly, the transmitting device selects an item in the non-timed-MFU header based on whether or not metadata is transmitted in the non-timed-MFU header.
It may also be determined whether to indicate the id.

送信装置は、ｔｉｍｅｄ－ＭＦＵと、ｎｏｎ－ｔｉｍｅｄ－ＭＦＵとのどちらか一方においてのみＭＰ４構成情報を示すとしてもよい。どちらか一方にのみＭＰ４構成情報を示す場合、送信装置は、メタデータが伝送されるかどうかに加え、ｔｉｍｅｄ－ＭＦＵかｎｏｎ－ｔｉｍｅｄ－ＭＦＵかどうかに基づいてＭＰ４構成情報を示すかどうかを決定する。受信装置では、メタデータが伝送されるかどうか、および、ｔｉｍｅｄ／ｎｏｎ－ｔｉｍｅｄフラグに基づいてＭＰ４構成情報が示されるかどうかを判定することができる。 The transmitting device may indicate MP4 configuration information only in either the timed-MFU or the non-timed-MFU. If MP4 configuration information is to be shown to only one of the MFUs, the transmitting device determines whether to show the MP4 configuration information based on whether the MFU is a timed-MFU or a non-timed-MFU, in addition to whether metadata is transmitted. do. The receiving device can determine whether metadata is transmitted and whether MP4 configuration information is indicated based on the timed/non-timed flag.

なお、以上の説明においては、送信装置は、メタデータ（ＭＰＵメタデータ及びＭＦメタデータの両方）が伝送されるかどうかに基づいてＭＰ４構成情報を示すかどうかを決定した。しかしながら、送信装置は、メタデータの一部（ＭＰＵメタデータ、ＭＦメタデータのどちらか一方）が伝送されない場合に、ＭＰ４構成情報を示さないとしてもよい。 Note that in the above description, the transmitting device determines whether to indicate MP4 configuration information based on whether metadata (both MPU metadata and MF metadata) is transmitted. However, the transmitting device may not indicate the MP4 configuration information when part of the metadata (either MPU metadata or MF metadata) is not transmitted.

また、送信装置は、メタデータ以外の他の情報に基づいてＭＰ４構成情報を示すかどうかを決定してもよい。 Furthermore, the transmitting device may determine whether to indicate MP4 configuration information based on information other than metadata.

例えば、ＭＰ４サポートモード／ＭＰ４非サポートモードのようなモードが定義され、送信装置は、ＭＰ４サポートモードの場合には、データユニットヘッダにおいてＭＰ４構成情報を示し、ＭＰ４非サポートモードの場合には、データユニットヘッダにおいてＭＰ４構成情報を示さないとしてもよい。また、送信装置は、ＭＰ４サポートモードの場合には、メタデータを伝送し、かつデータユニットヘッダにおいてＭＰ４構成情報を示し、ＭＰ４非サポートモードの場合には、メタデータを伝送せずにデータユニットヘッダにおいてもＭＰ４構成情報を示さないとしてもよい。 For example, modes such as MP4 support mode/MP4 non-support mode are defined, and the transmitting device indicates the MP4 configuration information in the data unit header in case of MP4 support mode and the data unit header in case of MP4 non-support mode. MP4 configuration information may not be indicated in the unit header. Further, in the case of MP4 support mode, the transmitting device transmits metadata and indicates MP4 configuration information in the data unit header, and in the case of MP4 non-support mode, the transmitting device transmits metadata and indicates MP4 configuration information in the data unit header without transmitting metadata. Also, the MP4 configuration information may not be shown.

［送信装置の動作フロー］
次に、送信装置の動作フローについて説明する。図４９は、送信装置の動作フローである。 [Operation flow of transmitter]
Next, the operational flow of the transmitter will be explained. FIG. 49 is an operation flow of the transmitting device.

送信装置は、まず、メタデータを伝送するかどうかを判定する（Ｓ１００１）。送信装置は、メタデータを伝送すると判定した場合（Ｓ１００２でＹｅｓ）、ステップＳ１００３へ移行し、ＭＰ４構成情報を生成し、かつ、ヘッダに格納して伝送する（Ｓ１００３）。この場合、送信装置は、メタデータも生成し、かつ、伝送する。 The transmitting device first determines whether to transmit metadata (S1001). When the transmitting device determines to transmit metadata (Yes in S1002), the transmitting device moves to step S1003, generates MP4 configuration information, stores it in a header, and transmits it (S1003). In this case, the transmitting device also generates and transmits metadata.

一方、送信装置は、メタデータを伝送しないと判定した場合（Ｓ１００２でＮｏ）、ＭＰ４構成情報を生成せず、かつ、ヘッダにも格納せずに伝送する（Ｓ１００４）。この場合、送信装置は、メタデータを生成せず、伝送しない。 On the other hand, if the transmitting device determines not to transmit metadata (No in S1002), it transmits the MP4 configuration information without generating it or storing it in the header (S1004). In this case, the transmitting device does not generate or transmit metadata.

なお、ステップＳ１００１においてメタデータを伝送するかどうかは、あらかじめ定められていてもよいし、送信装置の内部でメタデータが生成されたかどうか、送信装置の内部でメタデータが伝送されているかどうかに基づいて判定されてもよい。 Note that whether or not to transmit metadata in step S1001 may be determined in advance, or may depend on whether metadata is generated within the transmitting device or whether metadata is transmitted within the transmitting device. The determination may be made based on

［受信装置の動作フロー］
次に、受信装置の動作フローについて説明する。図５０は、受信装置の動作フローである。 [Operation flow of receiving device]
Next, the operation flow of the receiving device will be explained. FIG. 50 is an operation flow of the receiving device.

受信装置は、まず、メタデータが伝送されているかどうかを判定する（Ｓ１１０１）。メタデータが伝送されているかどうかは、ＭＭＴＰパケットペイロードにおけるフラグメントタイプを監視することにより判定できる。また、伝送されているかどうかがあらかじめ定められていてもよい。 The receiving device first determines whether metadata is being transmitted (S1101). Whether metadata is being transmitted can be determined by monitoring the fragment type in the MMTP packet payload. Further, whether or not the information is being transmitted may be determined in advance.

受信装置は、メタデータが伝送されていると判定した場合（Ｓ１１０２でＹｅｓ）、ＭＰ４を再構成し、かつ、ＭＰ４構成情報を用いた復号処理を実行する（Ｓ１１０３）。一方、メタデータが伝送されていないと判定した場合（Ｓ１１０２でＮｏ）、ＭＰ４の再構成処理をせず、かつ、ＭＰ４構成情報を用いずに復号処理を実行する（Ｓ１１０４）。 If the receiving device determines that metadata is being transmitted (Yes in S1102), it reconfigures the MP4 and executes decoding processing using the MP4 configuration information (S1103). On the other hand, if it is determined that metadata has not been transmitted (No in S1102), the MP4 reconstruction process is not performed and the decoding process is executed without using the MP4 configuration information (S1104).

なお、受信装置は、これまで説明した方法を用いて、ＭＰ４構成情報を用いずにランダムアクセスポイントの検出、アクセスユニット先頭の検出、ＮＡＬユニット先頭の検出などをすることが可能であり、復号処理、パケットロスの検出、及びパケットロスからの復帰の処理をすることができる。 Note that the receiving device can detect a random access point, detect the beginning of an access unit, detect the beginning of a NAL unit, etc. without using the MP4 configuration information using the method described above, and can perform decoding processing. , detecting packet loss, and processing recovery from packet loss.

例えば、アクセスユニット先頭は、ａｇｇｒｅｇａｔｉｏｎ＿ｆｌａｇ値が１であるＭＭＴペイロードの先頭データである。このとき、Ｆｒａｇｍｅｎｔａｔｉｏｎ＿ｉｎｄｉｃａｔｏｒ値は０である。 For example, the beginning of the access unit is the beginning data of the MMT payload whose aggregation_flag value is 1. At this time, the Fragmentation_indicator value is 0.

また、スライスセグメントの先頭は、ａｇｇｒｅｇａｔｉｏｎ＿ｆｌａｇ値が０、ｆｒａｇｍｅｎｔａｔｉｏｎ＿ｉｎｄｉｃａｔｏｒ値が００或いは０１であるＭＭＴペイロードの先頭データである。 Further, the beginning of the slice segment is the beginning data of the MMT payload whose aggregation_flag value is 0 and the fragmentation_indicator value is 00 or 01.

受信装置は、以上のような情報に基づき、アクセスユニット先頭の検出、及び、スライスセグメントの検出を行うことができる。 The receiving device can detect the beginning of the access unit and the slice segment based on the above information.

なお、受信装置は、ｆｒａｇｍｅｎｔａｔｉｏｎ＿ｉｎｄｉｃａｔｏｒ値が００或いは０１であるデータユニットの先頭を含むパケットにおいて、ＮＡＬユニットヘッダを解析し、ＮＡＬユニットの種類がＡＵデリミタであること、及び、ＮＡＬユニットの種類がスライスセグメントであることを検出してもよい。 Note that the receiving device analyzes the NAL unit header in a packet containing the beginning of a data unit whose fragmentation_indicator value is 00 or 01, and determines that the type of the NAL unit is an AU delimiter and that the type of the NAL unit is a slice segment. It may be detected that

［放送シンプルモード］
これまでは、放送ストリーム再生に特化したデータの送信方法として、受信装置におけるＭＰ４構成情報をサポートしない方法を説明したが、放送ストリーム再生に特化したデータの送信方法は、これに限るものではない。 [Broadcast simple mode]
So far, we have explained a method that does not support MP4 configuration information in the receiving device as a data transmission method specialized for broadcast stream playback, but this is not the only method for transmitting data specialized for broadcast stream playback. do not have.

放送ストリーム再生に特化したデータの送信方法として、例えば下記の方法が用いられてもよい。 For example, the following method may be used as a data transmission method specialized for broadcast stream playback.

・送信装置は、放送の固定受信環境では、ＡＬ－ＦＥＣを用いなくてもよい。ＡＬ－ＦＥＣが用いられない場合は、ＭＭＴＰパケットヘッダにおけるＦＥＣ＿ｔｙｐｅは常に０固定とされる。 - The transmitter does not need to use AL-FEC in a fixed broadcast reception environment. When AL-FEC is not used, FEC_type in the MMTP packet header is always fixed to 0.

・送信装置は、放送の移動受信環境、及び、通信ＵＤＰ伝送モードにおいては、常にＡＬ－ＦＥＣを用いてもよい。ＡＬ－ＦＥＣが用いられる場合は、ＭＭＴＰパケットヘッダにおけるＦＥＣ＿ｔｙｐｅは、常に０或いは１である。 - The transmitter may always use AL-FEC in a broadcast mobile reception environment and in communication UDP transmission mode. When AL-FEC is used, FEC_type in the MMTP packet header is always 0 or 1.

・送信装置は、アセットのバルク伝送をしなくてもよい。アセットのバルク伝送がされない場合には、ＭＰＴ内部のアセットの伝送ローケーション数を示すｌｏｃａｔｉｏｎ＿ｉｎｆｏｌｏｃａｔｉｏｎは、１に固定されてよい。 - The transmitting device does not need to perform bulk transmission of assets. If bulk transmission of assets is not performed, location_infolocation indicating the number of transmission locations of assets within the MPT may be fixed to 1.

・送信装置は、アセット、プログラム、及びメッセージのハイブリッド伝送をしなくてもよい。 - The sending device does not have to perform hybrid transmission of assets, programs, and messages.

また、例えば、放送シンプルモードが規定され、送信装置は、放送シンプルモードである場合には、ＭＰ４非サポートモードとする、或いは、上記に示した放送ストリーム再生に特化したデータの送信方法を用いるとしてもよい。放送シンプルモードかどうかはあらかじめ定められていてもよいし、送信装置は、放送シンプルモードであることを示すフラグを制御情報として格納し、受信装置に伝送してもよい。 Further, for example, if the broadcast simple mode is defined and the transmitter is in the broadcast simple mode, the transmitting device may set the MP4 non-support mode, or use the data transmission method specialized for broadcast stream playback as described above. You can also use it as Whether or not it is broadcast simple mode may be determined in advance, or the transmitting device may store a flag indicating that it is broadcast simple mode as control information and transmit it to the receiving device.

また、送信装置は、図４９で説明した、ＭＰ４非サポートモードであるかどうか（メタデータが伝送されているかどうか）に基づいて、ＭＰ４非サポートモードである場合には、放送シンプルモードとして、上記に示した放送ストリーム再生に特化したデータの送信方法を用いてもよい。 Furthermore, based on whether or not the transmitting device is in the MP4 non-support mode (whether metadata is being transmitted or not), as explained in FIG. 49, if the transmitting device is in the MP4 non-support mode, The data transmission method specialized for broadcast stream playback shown in 2 may also be used.

受信装置は、放送シンプルモードである場合には、ＭＰ４非サポートモードであるとして、ＭＰ４を再構成せず復号処理をすることができる。 When the receiving device is in the broadcast simple mode, it is assumed that the receiving device is in an MP4 non-support mode and can perform decoding processing without reconstructing the MP4.

また、受信装置は、放送シンプルモードである場合には、放送に特化した機能であることを判定し、放送に特化して受信処理をすることができる。 Furthermore, when the receiving device is in the broadcast simple mode, it can determine that the function is specialized for broadcasting, and perform reception processing specialized for broadcasting.

これにより、放送シンプルモードである場合には、放送に特化した機能のみを用いることにより、送信装置及び受信装置にとって不要な処理を削減できるばかりでなく、不要な情報を圧縮して伝送しないことにより、伝送オーバーヘッドを削減することができる。 As a result, in the case of broadcast simple mode, by using only broadcast-specific functions, it is possible to not only reduce unnecessary processing for the transmitting device and receiving device, but also to avoid compressing and transmitting unnecessary information. Therefore, transmission overhead can be reduced.

なお、ＭＰ４非サポートモードが用いられる場合には、ＭＰ４構成以外の蓄積方法をサポートするヒント情報が示されてもよい。 Note that when the MP4 non-support mode is used, hint information that supports a storage method other than the MP4 configuration may be shown.

ＭＰ４構成以外の蓄積方法としては、例えば、ＭＭＴパケットやＩＰパケットをダイレクトに蓄積する方法や、ＭＭＴパケットをＭＰＥＧ－２ＴＳパケットに変換する方法などがある。 Examples of storage methods other than the MP4 configuration include a method of directly storing MMT packets and IP packets, and a method of converting MMT packets into MPEG-2 TS packets.

なお、ＭＰ４非サポートモードの場合には、ＭＰ４構成にしたがわないフォーマットが用いられてもよい。 Note that in the case of the MP4 non-support mode, a format that does not follow the MP4 configuration may be used.

例えば、ＭＦＵに格納されるデータは、ＭＰ４非サポートモードの場合には、ＭＰ４形式であるＮＡＬユニットの先頭にＮＡＬユニットのサイズがついた形式でなく、バイトスタートコードがついた形式にされてもよい。 For example, in the case of an MP4 non-support mode, the data stored in the MFU is not in the MP4 format with the NAL unit size appended to the beginning of the NAL unit, but in the format with a byte start code attached. good.

ＭＭＴでは、アセットのタイプを示すアセットタイプは、ＭＰ４ＲＥＧ（ｈｔｔｐ：／／ｗｗｗ．ｍｐ４ｒａ．ｏｒｇ）に登録される４ＣＣで記載され、映像信号としてＨＥＶＣを用いる場合、’ＨＥＶ１’または’ＨＶＣ１’が用いられる。’ＨＶＣ１’は、サンプルの中にパラメータセットを含んでもよい形式であり、’ＨＥＶ１’はサンプルの中にパラメータセットを含まず、ＭＰＵメタデータにおけるサンプルエントリにパラメータセットを含む形式である。 In MMT, the asset type indicating the type of asset is described in 4CC registered in MP4REG (http://www.mp4ra.org), and when HEVC is used as the video signal, 'HEV1' or 'HVC1' is used. It will be done. 'HVC1' is a format that may include a parameter set in the sample, and 'HEV1' is a format that does not include a parameter set in the sample but includes a parameter set in the sample entry in the MPU metadata.

放送シンプルモードまたはＭＰ４非サポートモードの場合において、ＭＰＵメタデータ及びＭＦメタデータが伝送されない場合には、必ずサンプルの中にパラメータセットを含めると規定されてもよい。また、アセットタイプに’ＨＥＶ１’と’ＨＶＣ１’のどちらが示されている場合にも、必ず’ＨＶＣ１’の形式をとると規定されてもよい。 In the case of broadcast simple mode or MP4 non-support mode, if MPU metadata and MF metadata are not transmitted, it may be specified that the parameter set is always included in the sample. Furthermore, it may be specified that the format is always 'HVC1' regardless of whether 'HEV1' or 'HVC1' is indicated as the asset type.

［補足１：送信装置］
以上のように、メタデータが送信されていない場合に、ＭＰ４構成情報をｒｅｓｅｒｖｅｄとし、運用しない送信装置は、図５１のように構成することも可能である。図５１は、送信装置の具体的構成の例を示す図である。 [Supplement 1: Transmitting device]
As described above, when metadata is not transmitted, the MP4 configuration information is set to reserved, and the transmitting device that is not operated can be configured as shown in FIG. 51. FIG. 51 is a diagram showing an example of a specific configuration of a transmitting device.

送信装置３００は、符号化部３０１と、付与部３０２と、送信部３０３とを備える。符号化部３０１、付与部３０２、及び送信部３０３のそれぞれは、例えば、マイクロコンピュータ、プロセッサ、または、専用回路などによって実現される。 The transmitting device 300 includes an encoding section 301, an adding section 302, and a transmitting section 303. Each of the encoding section 301, the adding section 302, and the transmitting section 303 is realized by, for example, a microcomputer, a processor, a dedicated circuit, or the like.

符号化部３０１は、映像信号または音声信号を符号化してサンプルデータを生成する。サンプルデータは、具体的には、データユニットである。 The encoding unit 301 encodes a video signal or an audio signal to generate sample data. Specifically, the sample data is a data unit.

付与部３０２は、映像信号または音声信号が符号化されたデータであるサンプルデータに、ＭＰ４構成情報を含むヘッダ情報を付与する。ＭＰ４構成情報は、受信側において当該サンプルデータをＭＰ４フォーマットのファイルとして再構成するための情報であって、サンプルデータの提示時刻が定められているか否かに応じて内容が異なる情報である。 The adding unit 302 adds header information including MP4 configuration information to sample data, which is data obtained by encoding a video signal or an audio signal. The MP4 configuration information is information for reconstructing the sample data as an MP4 format file on the receiving side, and the content differs depending on whether or not the presentation time of the sample data is determined.

上述のように、付与部３０２は、提示時刻が定められているサンプルデータ（同期に関する情報を含むサンプルデータ）の一例であるｔｉｍｅｄ－ＭＦＵのヘッダ（ヘッダ情報）に、ｍｏｖｉｅ＿ｆｒａｇｍｅｎｔ＿ｓｅｑｕｅｎｃｅ＿ｎｕｍｂｅｒ、ｓａｍｐｌｅ＿ｎｕｍｂｅｒ、ｏｆｆｓｅｔ、ｐｒｉｏｒｉｔｙ、及び、ｄｅｐｅｎｄｅｎｃｙ＿ｃｏｕｎｔｅｒなどのＭＰ４構成情報を含める。 As described above, the adding unit 302 adds movie_fragment_sequence_number, sample_number, offset, priority to the header (header information) of timed-MFU, which is an example of sample data (sample data including information regarding synchronization) for which the presentation time is determined. , and MP4 configuration information such as dependency_counter.

一方で、付与部３０２は、提示時刻が定められていないサンプルデータ（同期に関する情報を含まないサンプルデータ）の一例である、ｔｉｍｅｄ－ＭＦＵのヘッダ（ヘッダ情報）には、ｉｔｅｍ＿ｉｄなどのＭＰ４構成情報を含める。 On the other hand, the adding unit 302 adds MP4 configuration information such as item_id to the header (header information) of timed-MFU, which is an example of sample data for which the presentation time is not determined (sample data that does not include information regarding synchronization). Include.

そして、付与部３０２は、送信部３０３によってサンプルデータに対応するメタデータが送信されない場合（例えば、図２１の（ｂ）のような場合）には、サンプルデータの提示時刻が定められているか否かに応じて、ＭＰ４構成情報を含まないヘッダ情報をサンプルデータに付与する。 Then, if the transmitting unit 303 does not transmit metadata corresponding to the sample data (for example, in a case like (b) in FIG. 21), the assigning unit 302 determines whether the presentation time of the sample data has been determined Depending on the situation, header information that does not include MP4 configuration information is added to the sample data.

付与部３０２は、具体的には、サンプルデータの提示時刻が定められている場合には、第一のＭＰ４構成情報を含まないヘッダ情報をサンプルデータに付与し、サンプルデータの提示時刻が定められていない場合には、第二のＭＰ４構成情報を含むヘッダ情報を前記サンプルデータに付与する。 Specifically, when the presentation time of the sample data is determined, the adding unit 302 adds header information that does not include the first MP4 configuration information to the sample data, and when the presentation time of the sample data is determined. If not, header information including second MP4 configuration information is added to the sample data.

例えば、付与部３０２は、図４９のステップＳ１００４に示されるように、送信部３０３によってサンプルデータに対応するメタデータが送信されない場合には、ＭＰ４構成情報をｒｅｓｅｒｖｅｄ（固定値）とすることにより、ＭＰ４構成情報を実質的に生成せず、かつ、実質的にヘッダ（ヘッダ情報）に格納しない。なお、メタデータには、ＭＰＵメタデータ、及び、ムービーフラグメントメタデータが含まれる。 For example, as shown in step S1004 in FIG. 49, when the transmitting unit 303 does not transmit the metadata corresponding to the sample data, the providing unit 302 sets the MP4 configuration information to reserved (fixed value). MP4 configuration information is not substantially generated and is not substantially stored in the header (header information). Note that the metadata includes MPU metadata and movie fragment metadata.

送信部３０３は、ヘッダ情報が付与されたサンプルデータを送信する。送信部３０３は、より具体的には、ヘッダ情報が付与されたサンプルデータをＭＭＴ方式でパケット化して送信する。 The transmitter 303 transmits sample data to which header information is added. More specifically, the transmitter 303 packetizes the sample data to which header information has been added using the MMT method and transmits the packet.

上述のように、放送ストリームの再生に特化した送信方法及び受信方法では、受信装置でデータユニットをＭＰ４に再構成する必要はない。受信装置がＭＰ４に再構成する必要がない場合、ＭＰ４構成情報などの不要な情報を生成しないことで送信装置の処理は軽減される。 As described above, in the transmission method and reception method specialized for playing back broadcast streams, there is no need for the receiving device to reconfigure the data units into MP4. If the receiving device does not need to reconfigure to MP4, the processing of the transmitting device is reduced by not generating unnecessary information such as MP4 configuration information.

一方で、送信装置は、必要な情報は送らなくてはならないが、余計な追加情報などを別途送信しなくて済むように、規格との整合性は保つ必要がある。 On the other hand, although the transmitting device must transmit the necessary information, it must maintain consistency with the standard so that it does not have to separately transmit unnecessary additional information.

送信装置３００のような構成によれば、ＭＰ４構成情報が格納される領域を固定値にすることなどにより、ＭＰ４構成情報を送信せず、必要な情報のみを規格に基づいて送信し、余計な追加情報を送信しなくて済む効果が得られる。つまり、送信装置の構成及び送信装置の処理量を削減することができる。また、不要なデータが送信されないことにより、伝送効率を向上させることができる。 According to the configuration of transmitting device 300, by setting the area where MP4 configuration information is stored to a fixed value, MP4 configuration information is not transmitted, only necessary information is transmitted based on the standard, and unnecessary information is transmitted. This has the effect of not having to send additional information. In other words, the configuration of the transmitting device and the amount of processing of the transmitting device can be reduced. Furthermore, since unnecessary data is not transmitted, transmission efficiency can be improved.

［補足２：受信装置］
また、送信装置３００に対応する受信装置は、例えば、図５２のように構成されてもよい。図５２は、受信装置の構成の別の例を示す図である。 [Supplement 2: Receiving device]
Further, the receiving device corresponding to the transmitting device 300 may be configured as shown in FIG. 52, for example. FIG. 52 is a diagram showing another example of the configuration of the receiving device.

受信装置４００は、受信部４０１と、復号部４０２とを備える。受信部４０１、及び、復号部４０２は、例えば、マイクロコンピュータ、プロセッサ、または、専用回路などによって実現される。 Receiving device 400 includes a receiving section 401 and a decoding section 402. The receiving section 401 and the decoding section 402 are realized by, for example, a microcomputer, a processor, a dedicated circuit, or the like.

受信部４０１は、映像信号または音声信号が符号化されたデータであるサンプルデータであって、当該サンプルデータをＭＰ４フォーマットのファイルとして再構成するためのＭＰ４構成情報を含むヘッダ情報が付与されたサンプルデータを受信する。 The receiving unit 401 receives sample data, which is data obtained by encoding a video signal or an audio signal, to which header information including MP4 configuration information for reconstructing the sample data as an MP4 format file is added. Receive data.

復号部４０２は、受信部によってサンプルデータに対応するメタデータが受信されなかった場合であって、サンプルデータの提示時刻が定められている場合に、ＭＰ４構成情報を使用せずにサンプルデータを復号する。 The decoding unit 402 decodes the sample data without using MP4 configuration information when the metadata corresponding to the sample data is not received by the receiving unit and the presentation time of the sample data is determined. do.

例えば、復号部４０２は、図５０のステップＳ１１０４に示されるように、受信部４０１によってサンプルデータに対応するメタデータが受信されない場合には、ＭＰ４構成情報を用いずに復号処理を実行する。 For example, as shown in step S1104 in FIG. 50, if metadata corresponding to the sample data is not received by the receiving unit 401, the decoding unit 402 executes the decoding process without using the MP4 configuration information.

これにより、受信装置４００の構成及び受信装置４００における処理量を削減することができる。 Thereby, the configuration of the receiving device 400 and the amount of processing in the receiving device 400 can be reduced.

（実施の形態４）
［概要］
実施の形態４では、ファイルなど同期に関する情報を含まない非同期（ｎｏｎ－ｔｉｍｅｄ）メディアのＭＰＵへの格納方法と、ＭＭＴＰパケットでの伝送方法について説明する。なお、実施の形態４では、ＭＭＴにおけるＭＰＵを例に説明するが、同じＭＰ４ベースであるＤＡＳＨでも適用可能である。 (Embodiment 4)
[overview]
In the fourth embodiment, a method for storing non-timed media such as files that do not include information regarding synchronization in the MPU and a method for transmitting them using MMTP packets will be described. In the fourth embodiment, an MPU in MMT will be described as an example, but it is also applicable to DASH, which is also based on MP4.

まず、ｎｏｎ－ｔｉｍｅｄメディア（以下、「非同期メディアデータ」とも言う）のＭＰＵへの格納方法の詳細について図５３を用いて説明する。図５３は、ｎｏｎ－ｔｉｍｅｄメディアのＭＰＵへの格納方法、及び、ＭＭＴＰパケットでの伝送方法を示す図である。 First, the details of the method for storing non-timed media (hereinafter also referred to as "asynchronous media data") in the MPU will be explained using FIG. 53. FIG. 53 is a diagram showing a method of storing non-timed media in the MPU and a method of transmitting it using MMTP packets.

ｎｏｎ－ｔｉｍｅｄメディアを格納するＭＰＵは、ｆｔｙｐ、ｍｍｐｕ、ｍｏｏｖ、ｍｅｔａなどのボックスで構成され、ＭＰＵに格納するファイルに関する情報が格納される。ｍｅｔａボックス内には複数のｉｄａｔボックスを格納することができ、ｉｄａｔボックスには１つのファイルがｉｔｅｍとして格納される。 The MPU that stores non-timed media is composed of boxes such as ftyp, mmpu, moov, and meta, and stores information regarding files to be stored in the MPU. A plurality of idat boxes can be stored in the meta box, and one file is stored as an item in the idat box.

ｆｔｙｐ，ｍｍｐｕ，ｍｏｏｖ，ｍｅｔａボックスの一部はＭＰＵメタデータとして一つのデータユニットを構成し、ｉｔｅｍ或いはｉｄａｔボックスはＭＦＵとしてデータユニットを構成する。 Part of the ftyp, mmpu, moov, and meta boxes constitute one data unit as MPU metadata, and the item or idat box constitutes a data unit as MFU.

データユニットはアグリゲーションやフラグメントされた後、データユニットヘッダ、ＭＭＴＰペイロードヘッダ、及びＭＭＴＰパケットヘッダが付与されてＭＭＴＰパケットとして伝送される。 After the data unit is aggregated or fragmented, a data unit header, an MMTP payload header, and an MMTP packet header are added to the data unit, and the data unit is transmitted as an MMTP packet.

なお、図５３では、Ｆｉｌｅ＃１とＦｉｌｅ＃２とが１つのＭＰＵに格納される例を示している。ＭＰＵメタデータは分割されておらず、また、ＭＦＵは分割されてＭＭＴＰパケットに格納されているが、これに限るものではなく、データユニットのサイズに応じてアグリゲーションやフラグメントされてもよい。また、ＭＰＵメタデータは伝送されなくてもよく、その場合は、ＭＦＵのみが伝送される。 Note that FIG. 53 shows an example in which File #1 and File #2 are stored in one MPU. Although the MPU metadata is not divided and the MFU is divided and stored in the MMTP packet, the present invention is not limited to this, and the data unit may be aggregated or fragmented depending on the size of the data unit. Also, the MPU metadata may not be transmitted, in which case only the MFU is transmitted.

データユニットヘッダなどのヘッダ情報には、ｉｔｅｍＩＤ（アイテムを一意に特定する識別子）が示され、ＭＭＴＰペイロードヘッダやＭＭＴＰパケットヘッダには、パケットシーケンス番号（パケット毎のシーケンス番号）、及び、ＭＰＵシーケンス番号（ＭＰＵのシーケンス番号、アセット内で一意の番号。）が含まれる。 Header information such as a data unit header indicates an itemID (an identifier that uniquely identifies an item), and the MMTP payload header and MMTP packet header include a packet sequence number (sequence number for each packet) and an MPU sequence number. (MPU sequence number, unique number within the asset).

なお、データユニットヘッダ以外のＭＭＴＰペイロードヘッダやＭＭＴＰパケットヘッダのデータ構造はこれまで説明したｔｉｍｅｄメディア（以下、「同期メディアデータ」とも言う。）と同様であり、ａｇｇｒｅｇａｔｉｏｎ＿ｆｌａｇ，ｆｒａｇｍｅｎｔａｔｉｏｎ＿ｉｎｄｉｃａｔｏｒ，ｆｒａｇｍｅｎｔ＿ｃｏｕｎｔｅｒなどが含まれる。 Note that the data structure of the MMTP payload header and MMTP packet header other than the data unit header is the same as that of the timed media (hereinafter also referred to as "synchronous media data") described so far, and includes aggregation_flag, fragmentation_indicator, fragment_counter, etc. .

次に、ファイル（＝Ｉｔｅｍ＝ＭＦＵ）を分割してパケット化する場合のヘッダ情報の具体例をについて、図５４及び図５５を用いて説明する。 Next, a specific example of header information when dividing a file (=Item=MFU) into packets will be described using FIGS. 54 and 55.

図５４及び図５５は、ファイルが分割されることにより得られた複数の分割データ毎にパケット化して伝送する例を示す図である。図５４及び図５５は、具体的には、分割されたＭＭＴＰパケット毎のヘッダ情報であるデータユニットヘッダ、ＭＭＴＰペイロードヘッダ、ＭＭＴＰパケットヘッダのいずれかに含まれる情報（パケットシーケンス番号、フラグメントカウンタ、フラグメンテーションインジケータ、ＭＰＵシーケンス番号、アイテムＩＤ）を示す。なお、図５４は、Ｆｉｌｅ＃１がＭ個（Ｍ＜＝２５６）に分割される例を示す図であり、図５５は、Ｆｉｌｅ＃２がＮ個（２５６＜Ｎ）に分割される例を示す図である。 FIGS. 54 and 55 are diagrams showing an example in which a plurality of pieces of divided data obtained by dividing a file are packetized and transmitted. Specifically, FIGS. 54 and 55 show information (packet sequence number, fragment counter, fragmentation indicator, MPU sequence number, item ID). Note that FIG. 54 is a diagram showing an example in which File #1 is divided into M pieces (M<=256), and FIG. 55 is a diagram showing an example in which File #2 is divided into N pieces (256<N). FIG.

分割データ番号は、ファイル先頭からの分割データのインデックスを示しており、この情報は伝送されない。つまり、分割データ番号は、ヘッダ情報には含まれない。また、分割データ番号は、ファイルを分割することにより得られた複数の分割データのそれぞれに対応するパケットに付与される番号であり、先頭のパケットから昇順に１ずつ加算されて付与される番号である。 The divided data number indicates the index of divided data from the beginning of the file, and this information is not transmitted. In other words, the divided data number is not included in the header information. In addition, the divided data number is a number assigned to a packet corresponding to each of a plurality of divided data obtained by dividing a file, and is a number that is added by 1 in ascending order starting from the first packet. be.

パケットシーケンス番号は、同じパケットＩＤを持つパケットのシーケンス番号であり、図５４及び図５５では、ファイル先頭の分割データをＡとして、ファイル最後の分割データまで連続した番号が付与される。パケットシーケンス番号は、ファイル先頭の分割データから昇順に１ずつ加算されて付与される番号であり、分割データ番号に対応する番号である。 The packet sequence number is a sequence number of packets having the same packet ID, and in FIGS. 54 and 55, consecutive numbers are assigned from the first divided data of the file to A to the last divided data of the file. The packet sequence number is a number that is added by 1 in ascending order starting from the divided data at the beginning of the file, and is a number that corresponds to the divided data number.

フラグメントカウンタは、１つのファイルが分割されることにより得られた複数の分割データのうち当該分割データよりも後にある複数の分割データの数を示す。また、フラグメントカウンタは、１つのファイルを分割することにより得られた複数の分割データの数である分割データ数が２５６を超える場合、分割データ数を２５６で除した余りを示す。図５４の例では、分割データ数が２５６以下であるため、フラグメントカウンタのフィールド値は（Ｍ－分割データ番号）となる。一方、図５５の例では、分割データ数が２５６を超えるため、（Ｎ－分割データ番号）を２５６で除した値（（Ｎ－分割データ番号）％２５６）となる。 The fragment counter indicates the number of pieces of divided data that come after the piece of divided data among the pieces of divided data obtained by dividing one file. Furthermore, when the number of divided data, which is the number of multiple pieces of divided data obtained by dividing one file, exceeds 256, the fragment counter indicates the remainder when the number of divided data is divided by 256. In the example of FIG. 54, since the number of divided data is 256 or less, the field value of the fragment counter is (M-divided data number). On the other hand, in the example of FIG. 55, the number of divided data exceeds 256, so the value is (N-divided data number) divided by 256 ((N-divided data number)%256).

なお、フラグメンテーションインジケータは、ＭＭＴＰパケットに格納するデータの分割の状態を示し、分割されたデータユニットにおける先頭の分割データであるか、最後の分割データであるか、それ以外の分割データであるか、或いは、分割されていない１つ以上のデータユニットであるかを示す値である。具体的には、フラグメンテーションインジケータは、先頭の分割データでは「０１」であり、最後の分割データでは「１１」であり、残りの分割データでは「１０」であり、分割されていないデータユニットでは「００」である。 Note that the fragmentation indicator indicates the state of division of data stored in the MMTP packet, and indicates whether it is the first divided data, the last divided data, or other divided data in the divided data unit. Alternatively, it is a value indicating whether the data unit is one or more undivided data units. Specifically, the fragmentation indicator is "01" for the first segmented data, "11" for the last segmented data, "10" for the remaining segmented data, and "10" for the unfragmented data unit. 00".

本実施の形態では、分割データ数が２５６を超える場合、分割データ数を２５６で除した余りを示すとして説明するが、分割データ数は２５６に限らず、他の数（所定の数）であってもよい。 In this embodiment, when the number of divided data exceeds 256, the remainder after dividing the number of divided data by 256 will be described. However, the number of divided data is not limited to 256, and may be any other number (predetermined number) It's okay.

図５４及び図５５に示すようにファイルを分割し、ファイルを分割することにより得られた複数の分割データのそれぞれに従来のヘッダ情報を付与して伝送する場合、受信装置において、受信したＭＭＴＰパケットに格納されるデータが、元のファイルにおいて何番目の分割データであるか（分割データ番号）、及び、ファイルの分割データ数、または、分割データ番号及び分割データ数を導出できる情報がない。このため、従来の伝送方法では、ＭＭＴＰパケットを受信しても、受信したＭＭＴＰパケットに格納されるデータの分割データ番号や分割データ数を一意に検出することができない。 When a file is divided as shown in FIGS. 54 and 55, and conventional header information is added to each of a plurality of pieces of divided data obtained by dividing the file and transmitted, the receiving device receives a received MMTP packet. There is no information from which to derive the number of divided data in the original file (divided data number), the number of divided data of the file, or the divided data number and the number of divided data. Therefore, in the conventional transmission method, even if an MMTP packet is received, it is not possible to uniquely detect the divided data number or the number of divided data of the data stored in the received MMTP packet.

例えば、図５４のように分割データ数が２５６以下であり、分割データ数があらかじめ２５６以下であることが既知である場合には、フラグメントカウンタを参照することにより、分割データ番号や分割データ数を特定することが可能である。しかし、分割データ数が２５６以上である場合には、分割データ番号や分割データ数を特定できない。 For example, if the number of divided data is 256 or less as shown in Figure 54, and it is known in advance that the number of divided data is 256 or less, the divided data number and the number of divided data can be determined by referring to the fragment counter. It is possible to specify. However, if the number of divided data is 256 or more, the divided data number or the number of divided data cannot be specified.

なお、ファイルの分割データ数を２５６以下に制限する場合、１つのパケットで伝送可能なデータサイズをｘ［ｂｙｔｅｓ］とした場合、伝送可能なファイルの最大サイズはｘ＊２５６［ｂｙｔｅｓ］に制限される。例えば、放送ではｘ＝４ｋ［ｂｙｔｅｓ］が想定されており、この場合、伝送可能なファイルの最大サイズは４ｋ＊２５６＝１Ｍ［ｂｙｔｅｓ］に制限される。従って、１［Ｍｂｙｔｅｓ］を越えるファイルを伝送したい場合にはファイルの分割データ数を２５６以下に制限することはできない。 In addition, when limiting the number of divided data of a file to 256 or less, if the data size that can be transmitted in one packet is x [bytes], the maximum size of the file that can be transmitted is limited to x * 256 [bytes]. Ru. For example, in broadcasting, x=4k [bytes] is assumed, and in this case, the maximum size of a file that can be transmitted is limited to 4k*256=1M [bytes]. Therefore, when it is desired to transmit a file exceeding 1 [Mbytes], it is not possible to limit the number of divided data of the file to 256 or less.

また、例えば、フラグメンテーションインジケータを参照することでファイルの先頭の分割データや最後の分割データを検出することができるため、ファイルの最後の分割データを含むＭＭＴＰパケットが受信されるまでＭＭＴＰパケット数をカウントしたり、ファイルの最後の分割データを含むＭＭＴＰパケットを受信したりした後に、パケットシーケンス番号と組み合わせることにより、分割データ番号や分割データ数を算出することは可能であるため、フラグメンテーションインジケータとパケットシーケンス番号とを組み合わせることにより分割データ番号及び分割データ数をシグナリングするとしてもよい。しかし、ファイルの途中の分割データ（つまり、ファイルの先頭の分割データでもファイルの最後の分割データでもない分割データ）を含むＭＭＴＰパケットから受信を開始した場合、当該分割データの分割データ番号や分割データ数を特定できない。当該分割データの分割データ番号や分割データ数は、ファイル最後の分割データを含むＭＭＴＰパケットを受信後に初めて特定できる。 Also, for example, by referring to the fragmentation indicator, it is possible to detect the beginning fragmented data and the last fragmented data of a file, so the number of MMTP packets is counted until the MMTP packet containing the last fragmented data of the file is received. It is possible to calculate the fragment data number and the number of fragment data by combining it with the packet sequence number after receiving the MMTP packet containing the last fragment data of the file. The divided data number and the number of divided data may be signaled by combining the numbers. However, if reception starts from an MMTP packet that includes divided data in the middle of a file (that is, divided data that is not the first divided data of the file or the last divided data of the file), the divided data number of the divided data and the divided data I can't determine the number. The divided data number and number of divided data of the divided data can be specified only after receiving the MMTP packet containing the last divided data of the file.

図５４及び図５５で説明した課題、つまり、ファイルの分割データを含むパケットを途中から受信した時点で、ファイルの分割データの分割データ番号および分割データ数を一意に決定するために、次の方法を用いる。 In order to solve the problem explained in FIGS. 54 and 55, that is, to uniquely determine the divided data number and the number of divided data of the divided data of a file when a packet containing divided data of a file is received midway, the following method is used. Use.

まず、分割データ番号について説明する。 First, the divided data numbers will be explained.

分割データ番号については、ファイル（ｉｔｅｍ）の先頭の分割データにおけるパケットシーケンス番号をシグナリングする。 As for the divided data number, the packet sequence number in the first divided data of the file (item) is signaled.

シグナリングする方法として、ファイルを管理する制御情報に格納する。具体的には、図５４及び図５５においてファイルの先頭の分割データのパケットシーケンス番号Ａを、制御情報に格納する。受信装置では、制御情報よりＡの値を取得し、パケットヘッダに示されるパケットシーケンス番号より分割データ番号を算出する。 As a signaling method, it is stored in the control information that manages the file. Specifically, in FIGS. 54 and 55, the packet sequence number A of the divided data at the beginning of the file is stored in the control information. The receiving device obtains the value of A from the control information and calculates the divided data number from the packet sequence number indicated in the packet header.

分割データの分割データ番号は、当該分割データのパケットシーケンス番号から先頭の分割データのパケットシーケンス番号Ａを減ずることにより求められる。 The divided data number of the divided data is obtained by subtracting the packet sequence number A of the first divided data from the packet sequence number of the divided data.

ファイルを管理する制御情報としては、例えば、ＡＲＩＢＳＴＤ－Ｂ６０に規定されるアセット管理テーブルがある。アセット管理テーブルには、ファイル毎に、ファイルサイズやバージョン情報などが示され、データ伝送メッセージに格納されて伝送される。図５６は、アセット管理テーブルにおけるファイル毎のループのシンタックスを示す図である。 As control information for managing files, for example, there is an asset management table defined in ARIB STD-B60. The asset management table shows file size, version information, etc. for each file, and is stored and transmitted in a data transmission message. FIG. 56 is a diagram showing the syntax of a loop for each file in the asset management table.

既存のアセット管理テーブルの領域が拡張できない場合には、アイテムの情報を示すｉｔｅｍ＿ｉｎｆｏ＿ｂｙｔｅフィールドの一部の３２ｂｉｔの領域を用いてシグナリングしてもよい。ｉｔｅｍ＿ｉｎｆｏ＿ｂｙｔｅの一部領域に、ファイル（ｉｔｅｍ）の先頭の分割データにおけるパケットシーケンス番号が示されているかどうかを示すフラグを制御情報の例えばｒｅｓｅｒｖｅｄ＿ｆｕｔｕｒｅ＿ｕｓｅフィールドに示してもよい。 If the area of the existing asset management table cannot be expanded, signaling may be performed using a 32-bit area that is part of the item_info_byte field that indicates item information. A flag indicating whether a packet sequence number in the first divided data of the file (item) is indicated in a partial area of item_info_byte may be indicated in, for example, the reserved_future_use field of the control information.

データカルーセルなどファイルを繰り返し伝送する場合には、複数のパケットシーケンス番号を示してもよいし、直後に伝送するファイルの先頭のパケットシーケンス番号を示してもよい。 When repeatedly transmitting a file such as a data carousel, a plurality of packet sequence numbers may be indicated, or the first packet sequence number of the file to be transmitted immediately after may be indicated.

ファイル先頭の分割データのパケットシーケンス番号に限らず、ファイルの分割データ番号とパケットシーケンス番号を紐づける情報であればよい。 The information is not limited to the packet sequence number of the divided data at the beginning of the file, but may be any information that links the divided data number of the file and the packet sequence number.

次に、分割データ数について説明する。 Next, the number of divided data will be explained.

アセット管理テーブルに含まれるファイル毎のループの順番を、ファイルの伝送順と規定してもよい。これにより、伝送順で連続する２つのファイルの先頭のパケットシーケンス番号がわかるため、後に伝送されるファイルの先頭パケットシーケンス番号から前に伝送されるファイルの先頭パケットシーケンス番号を減ずることにより、前に伝送されるファイルの分割データ数を特定することができる。つまり、例えば、図５４に示すＦｉｌｅ＃１と図５５に示すＦｉｌｅ＃２とがこの順に連続したファイルである場合には、Ｆｉｌｅ＃１の最後のパケットシーケンス番号とＦｉｌｅ＃２の先頭のパケットシーケンス番号とは連続した番号が付与されている。 The loop order for each file included in the asset management table may be defined as the file transmission order. This allows you to know the first packet sequence number of two consecutive files in the transmission order, so by subtracting the first packet sequence number of the file transmitted earlier from the first packet sequence number of the file transmitted later, It is possible to specify the number of divided data of the file to be transmitted. That is, for example, if File #1 shown in FIG. 54 and File #2 shown in FIG. 55 are consecutive files in this order, the last packet sequence number of File #1 and the first packet sequence of File #2 Consecutive numbers are assigned.

また、ファイルの分割方法を規定することにより、ファイルの分割データ数を特定できるように規定してもよい。例えば、分割データ数がＮの場合、１～（Ｎ－１）番目の分割データのそれぞれのサイズはＬとし、Ｎ番目の分割データのサイズは端数（ｉｔｅｍ＿ｓｉｚｅ－Ｌ＊（Ｎ－１））と規定することにより、アセット管理テーブルに示されるｉｔｅｍ＿ｓｉｚｅから分割データ数を逆算できる。この場合、（ｉｔｅｍ＿ｓｉｚｅ／Ｌ）を切り上げた整数値が分割データ数となる。なお、ファイルの分割方法は、これに限るものではない。 Further, by specifying the file division method, it may be specified that the number of data to be divided into a file can be specified. For example, if the number of divided data is N, the size of each of the 1st to (N-1)th divided data is L, and the size of the Nth divided data is a fraction (item_size-L*(N-1)). By specifying this, the number of divided data can be calculated backward from item_size shown in the asset management table. In this case, the integer value obtained by rounding up (item_size/L) becomes the number of divided data. Note that the file division method is not limited to this.

また、分割データ数を直接アセット管理テーブルに格納してもよい。 Alternatively, the number of divided data may be directly stored in the asset management table.

受信装置では、上記の方法を用いることにより、制御情報を受信し、制御情報に基づいて分割データ数を算出する。また、制御情報に基づいてファイルの分割データ番号に対応するパケットシーケンス番号を算出できる。なお、制御情報の受信のタイミングより、分割データのパケットの受信のタイミングが早い場合は、制御情報を受信したタイミングで分割データ番号や分割データ数を算出してもよい。 The receiving device receives the control information and calculates the number of divided data based on the control information by using the above method. Furthermore, the packet sequence number corresponding to the divided data number of the file can be calculated based on the control information. Note that if the timing of receiving the divided data packet is earlier than the timing of receiving the control information, the divided data number and the number of divided data may be calculated at the timing of receiving the control information.

なお、上記方法を用いて分割データ番号、或いは分割データ数をシグナリングする場合、フラグメントカウンタに基づいて分割データ番号や分割データ数を特定することはなく、フラグメントカウンタは不要なデータとなる。そこで、非同期メディアの伝送において、上記方法等を用いて分割データ番号、および、分割データ数を特定できる情報がシグナリングされている場合には、フラグメントカウンタは運用しない、或いは、ヘッダ圧縮してもよい。これにより、送信装置や受信装置の処理量を削減することができ、伝送効率を向上させることもできる。つまり、非同期メディアを送信する場合には、フラグメントカウンタをｒｅｓｅｒｖｅｄ（無効化）としてもよい。具体的には、フラグメントカウンタの値を例えば「０」の固定値としてもよい。また、非同期メディアを受信する場合には、フラグメントカウンタを無視してもよい。 Note that when signaling the divided data number or the number of divided data using the above method, the divided data number or the number of divided data is not specified based on the fragment counter, and the fragment counter becomes unnecessary data. Therefore, in the transmission of asynchronous media, if the fragment data number and information that can identify the number of fragment data are signaled using the above method, etc., the fragment counter may not be operated, or the header may be compressed. . Thereby, the processing amount of the transmitting device and the receiving device can be reduced, and transmission efficiency can also be improved. That is, when transmitting asynchronous media, the fragment counter may be reserved (invalidated). Specifically, the value of the fragment counter may be set to a fixed value of "0", for example. Additionally, the fragment counter may be ignored when receiving asynchronous media.

映像や音声などの同期メディアを格納する場合においては、送出装置におけるＭＭＴＰパケットの送信順と受信装置におけるＭＭＴＰパケットの到着順が一致しており、かつ、パケットが再送されない。このような場合において、パケットロスを検出して再構成する必要がない場合には、フラグメントカウンタを運用しないとしてもよい。言い換えると、この場合では、フラグメントカウンタをｒｅｓｅｒｖｅｄ（無効化）としてもよい。 When storing synchronous media such as video and audio, the order of transmission of MMTP packets at the sending device and the order of arrival of MMTP packets at the receiving device match, and the packets are not retransmitted. In such a case, if there is no need to detect packet loss and reconfigure it, the fragment counter may not be operated. In other words, in this case, the fragment counter may be reserved (invalidated).

また、フラグメントカウンタを用いなくても、ランダムアクセスポイントの検出、アクセスユニット先頭の検出、ＮＡＬユニット先頭の検出などをすることが可能であり、復号処理や、パケットロスの検出、パケットロスからの復帰の処理をすることができる。 Furthermore, even without using a fragment counter, it is possible to detect random access points, detect the beginning of an access unit, detect the beginning of a NAL unit, etc., and perform decoding processing, detect packet loss, and recover from packet loss. can be processed.

また、ライブ放送などのリアルタイムなコンテンツの伝送では、より低遅延な伝送が求められ、符号化が完了したデータから順次パケット化して送出することが求められる。しかし、リアルタイムなコンテンツの伝送において、従来のフラグメントカウンタでは、先頭の分割データの送出時に分割データ数は決定できないため、先頭の分割データの送出は、データユニットの符号化がすべて完了し、分割データ数が決定した後となり、遅延が発生する。このような場合であっても、上記の方法を用いて、フラグメントカウンタを運用しないことにより、この遅延を削減できる。 In addition, in the transmission of real-time content such as live broadcasting, transmission with lower delay is required, and data that has been encoded must be sequentially packetized and transmitted. However, in real-time content transmission, conventional fragment counters cannot determine the number of fragmented data when sending the first fragmented data. After the number has been determined, there will be a delay. Even in such a case, this delay can be reduced by using the above method and not operating the fragment counter.

図５７は、受信装置における分割データ番号を特定する動作フローである。 FIG. 57 is an operation flow for specifying a divided data number in a receiving device.

受信装置は、ファイルの情報が記載された制御情報を取得する（Ｓ１２０１）。受信装置は、制御情報にファイル先頭のパケットシーケンス番号が示されているかどうかを判定し（Ｓ１２０２）、制御情報にファイル先頭のパケットシーケンス番号が示されている場合には（Ｓ１２０２でＹｅｓ）、ファイルの分割データの分割データ番号に対応するパケットシーケンス番号を算出する（Ｓ１２０３）。そして、受信装置は、分割データが格納されたＭＭＴＰパケットを取得後、取得したＭＭＴＰパケットのパケットヘッダに格納されるパケットシーケンス番号からファイルの分割データ番号を特定する（Ｓ１２０４）。一方、受信装置は、制御情報にファイル先頭のパケットシーケンス番号が示されていない場合には（Ｓ１２０２でＮｏ）、ファイル最後の分割データが含まれるＭＭＴＰパケットを取得後、取得したＭＭＴＰパケットのパケットヘッダに格納されるフラグメントインジケータ、および、パケットシーケンス番号を用いて分割データ番号を特定する（Ｓ１２０５）。 The receiving device acquires control information in which file information is described (S1201). The receiving device determines whether the packet sequence number at the beginning of the file is indicated in the control information (S1202), and if the packet sequence number at the beginning of the file is indicated in the control information (Yes at S1202), the receiving device The packet sequence number corresponding to the divided data number of the divided data is calculated (S1203). After acquiring the MMTP packet in which the divided data is stored, the receiving device identifies the divided data number of the file from the packet sequence number stored in the packet header of the acquired MMTP packet (S1204). On the other hand, if the packet sequence number at the beginning of the file is not indicated in the control information (No in S1202), the receiving device acquires the MMTP packet that includes the last divided data of the file, and then reads the packet header of the acquired MMTP packet. The divided data number is specified using the fragment indicator stored in the packet sequence number and the fragment indicator stored in the packet sequence number (S1205).

図５８は、受信装置における分割データ数を特定する動作フローである。 FIG. 58 is an operation flow for specifying the number of divided data in the receiving device.

受信装置は、ファイルの情報が記載された制御情報を取得する（Ｓ１３０１）。受信装置は、制御情報にファイルの分割データ数を算出可能な情報が含まれているかどうかを判定し（Ｓ１３０２）、分割データ数を算出可能な情報が含まれていると判定した場合には、（Ｓ１３０２でＹｅｓ）、制御情報に含まれる情報に基づいて分割データ数を算出する（Ｓ１３０３）。一方、受信装置は、分割データ数を算出不可能であると判定した場合には（Ｓ１３０２でＮｏ）、ファイル最後の分割データが含まれるＭＭＴＰパケットを取得後、取得したＭＭＴＰパケットのパケットヘッダに格納されるフラグメントインジケータ、および、パケットシーケンス番号を用いて分割データ数を特定する（Ｓ１３０４）。 The receiving device acquires control information in which file information is described (S1301). The receiving device determines whether the control information includes information that allows calculation of the number of data divisions of a file (S1302), and if it is determined that information that allows calculation of the number of division data of a file is included, (Yes in S1302), the number of divided data is calculated based on the information included in the control information (S1303). On the other hand, if the receiving device determines that the number of divided data cannot be calculated (No in S1302), after acquiring the MMTP packet that includes the last divided data of the file, the receiving device stores it in the packet header of the acquired MMTP packet. The number of divided data is specified using the fragment indicator and the packet sequence number (S1304).

図５９は、送信装置においてフラグメントカウンタを運用するかどうかを決定するための動作フローである。 FIG. 59 is an operational flow for determining whether to operate a fragment counter in a transmitting device.

まず、送信装置は、伝送するメディア（以下、「メディアデータ」とも言う。）が同期メディアか、非同期メディアかを判定する（Ｓ１４０１）。 First, the transmitting device determines whether the medium to be transmitted (hereinafter also referred to as "media data") is synchronous media or asynchronous media (S1401).

ステップＳ１４０１での判定の結果が同期メディアである場合（Ｓ１４０２で同期メディア）、送信装置は、同期メディアを伝送する環境において送受信のＭＭＴＰパケット順が一致し、かつパケットロス時にパケット再構成が不要かどうかを判定する（Ｓ１４０３）。送信装置は、不要であると判定した場合（Ｓ１４０３でＹｅｓ）、フラグメントカウンタを運用しない（Ｓ１４０４）。一方、送信装置は、不要でないと判定した場合（Ｓ１４０３でＮｏ）、フラグメントカウンタを運用する（Ｓ１４０５）。 If the result of the determination in step S1401 is synchronous media (synchronous media in S1402), the transmitting device determines whether the MMTP packet order for transmission and reception matches in an environment for transmitting synchronous media, and whether packet reassembly is not required in the event of packet loss. It is determined whether or not (S1403). If the transmitting device determines that it is unnecessary (Yes in S1403), it does not operate the fragment counter (S1404). On the other hand, if the transmitting device determines that it is not necessary (No in S1403), it operates a fragment counter (S1405).

ステップＳ１４０１での判定の結果が非同期メディアである場合（Ｓ１４０２で非同期メディア）、送信装置は、上述で説明した方法を用いて分割データ番号や分割データ数がシグナリングされるかどうかに基づいて、フラグメントカウンタを運用するか否かを決定する。具体的には、送信装置は、分割データ番号や分割データ数がシグナリングされる場合（Ｓ１４０６でＹｅｓ）、フラグメントカウンタを運用しない（Ｓ１４０４）。一方、送信装置は、分割データ番号や分割データ数がシグナリングされない場合（Ｓ１４０６でＮｏ）、フラグメントカウンタを運用する（Ｓ１４０５）。 If the result of the determination in step S1401 is asynchronous media (asynchronous media in S1402), the transmitting device uses the fragment Decide whether to use the counter. Specifically, when the divided data number and the number of divided data are signaled (Yes in S1406), the transmitting device does not operate the fragment counter (S1404). On the other hand, if the divided data number or the number of divided data is not signaled (No in S1406), the transmitting device operates a fragment counter (S1405).

なお、送信装置は、フラグメントカウンタを運用しない場合、フラグメントカウンタの値をｒｅｓｅｒｖｅｄとしてもよいし、ヘッダ圧縮をしてもよい。 Note that when the transmitting device does not operate the fragment counter, the transmitting device may reserve the value of the fragment counter, or may perform header compression.

なお、送信装置は、フラグメントカウンタを運用するかどうかに基づいて、上述した分割データ番号や分割データ数をシグナリングするかどうかを決定してもよい。 Note that the transmitting device may decide whether to signal the above-described divided data number or the number of divided data based on whether or not to operate a fragment counter.

なお、送信装置は、同期メディアがフラグメントカウンタを運用しない場合には、非同期メディアにおいて上述した方法を用いて分割データ番号や分割データ数をシグナリングしてもよい。逆に、非同期メディアがフラグメントカウンタを運用するかどうかに基づいて、同期メディアの運用を決定してもよい。この場合、同期メディアと非同期メディアとにおいて、フラグメントを運用するかどうかを同じ運用とすることができる。 Note that if the synchronous media does not operate a fragment counter, the transmitting device may signal the divided data number or the number of divided data using the method described above for the asynchronous media. Conversely, the operation of synchronous media may be determined based on whether or not asynchronous media operates a fragment counter. In this case, whether or not to use fragments can be the same for synchronous media and asynchronous media.

次に、分割データ数、分割データ番号を特定する方法（フラグメントカウンタを活用する場合）について説明する。図６０は、分割データ数、分割データ番号を特定する方法（フラグメントカウンタを活用する場合）について説明するための図である。 Next, a method for specifying the number of divided data and the divided data number (when using a fragment counter) will be explained. FIG. 60 is a diagram for explaining a method for specifying the number of divided data and the divided data number (when using a fragment counter).

図５４を用いて説明したように、分割データ数が２５６以下であり、分割データ数があらかじめ２５６以下であることが既知である場合には、フラグメントカウンタを参照することにより、分割データ番号や分割データ数を特定することが可能である。 As explained using FIG. 54, if the number of divided data is 256 or less, and it is known in advance that the number of divided data is 256 or less, by referring to the fragment counter, the divided data number and It is possible to specify the number of data.

ファイルの分割データ数を２５６以下に制限する場合、１つのパケットで伝送可能なデータサイズをｘ［ｂｙｔｅｓ］とした場合、伝送可能なファイルの最大サイズはｘ＊２５６［ｂｙｔｅｓ］に制限される。例えば、放送ではｘ＝４ｋ［ｂｙｔｅｓ］が想定されており、この場合、伝送可能なファイルの最大サイズは４ｋ＊２５６＝１Ｍ［ｂｙｔｅｓ］に制限される。 When limiting the number of divided data of a file to 256 or less, if the data size that can be transmitted in one packet is x [bytes], the maximum size of a file that can be transmitted is limited to x*256 [bytes]. For example, in broadcasting, x=4k [bytes] is assumed, and in this case, the maximum size of a file that can be transmitted is limited to 4k*256=1M [bytes].

ファイルサイズが、伝送可能なファイルの最大サイズを超える場合、分割ファイルのサイズがｘ＊２５６［ｂｙｔｅｓ］以下となるように予めファイルを分割する。ファイルが分割することにより得られた複数の分割ファイルのそれぞれは、一つのファイル（ｉｔｅｍ）として扱われ、さらに２５６以内に分割され、さらに分割されることにより得られた分割データがそれぞれＭＭＴＰパケットに格納されて伝送される。 If the file size exceeds the maximum size of a file that can be transmitted, the file is divided in advance so that the size of the divided file becomes x*256 [bytes] or less. Each of the multiple divided files obtained by dividing the file is treated as one file (item), and is further divided into 256 or less pieces, and the divided data obtained by further division is converted into MMTP packets. stored and transmitted.

なお、アイテムが分割ファイルであることを示す情報や、分割ファイル数、分割ファイルのシーケンス番号を制御情報に格納し、受信装置に送信してもよい。また、これらの情報をアセット管理テーブルに格納してもよいし、既存のフィールドｉｔｅｍ＿ｉｎｆｏ＿ｂｙｔｅの一部を用いて示してもよい。 Note that information indicating that the item is a divided file, the number of divided files, and the sequence number of the divided files may be stored in the control information and transmitted to the receiving device. Further, this information may be stored in the asset management table, or may be indicated using a part of the existing field item_info_byte.

受信装置は、アイテムが１つのファイルを分割することにより得られた複数の分割ファイルのうちの１つの分割ファイルである場合には、他の分割ファイルを特定し、元のファイルを再構成することができる。また、受信装置では、制御情報における分割ファイルの分割ファイル数、分割ファイルのインデックス、および、フラグメントカウンタを用いることにより、分割データ数や、分割データ番号を一意に特定することができる。また、パケットシーケンス番号などを用いることなく分割データ数や分割データ番号を一意に特定できる。 If the item is one of a plurality of split files obtained by splitting one file, the receiving device identifies the other split files and reconstructs the original file. I can do it. Furthermore, the receiving device can uniquely identify the number of divided data and the divided data number by using the number of divided files, the index of the divided file, and the fragment counter in the control information. Furthermore, the number of divided data and the divided data number can be uniquely specified without using a packet sequence number or the like.

ここで、１つのファイルを分割することにより得られた複数の分割ファイルそれぞれのｉｔｅｍ＿ｉｄは、互いに同じであることが望ましい。なお、別のｉｔｅｍ＿ｉｄを付与する場合は、他の制御情報などからファイルを一意に参照するために、先頭の分割ファイルのｉｔｅｍ＿ｉｄを示すとしてもよい。 Here, it is desirable that the item_id of each of the plurality of divided files obtained by dividing one file is the same. Note that when assigning another item_id, the item_id of the first divided file may be indicated in order to uniquely refer to the file from other control information or the like.

また、複数の分割ファイルは、必ず同じＭＰＵに属するとしてもよい。ＭＰＵに複数のファイルを格納する場合には、異なる種類のファイルは格納せず、必ず１つのファイルを分割したファイルが格納されているとしてもよい。受信装置はｉｔｅｍ毎のバージョン情報を確認せずとも、ＭＰＵ毎のバージョン情報を確認することで、ファイルの更新を検知できる。 Further, a plurality of divided files may always belong to the same MPU. When storing a plurality of files in the MPU, files of different types may not be stored, but files obtained by dividing one file may always be stored. The receiving device can detect file updates by checking the version information of each MPU without checking the version information of each item.

図６１は、フラグメントカウンタを活用する場合の送信装置の動作フローである。 FIG. 61 is an operation flow of the transmitting device when utilizing a fragment counter.

まず、送信装置は、伝送するファイルのサイズを確認する（Ｓ１５０１）。次に送信装置は、ファイルサイズがｘ＊２５６［ｂｙｔｅｓ］（ｘは、１つのパケットで伝送できるデータサイズ。例えば、ＭＴＵサイズ。）を超えるか否かを判定し（Ｓ１５０２）、ファイルサイズがｘ＊２５６［ｂｙｔｅｓ］を超える場合には（Ｓ１５０２でＹｅｓ）、分割ファイルのサイズがｘ＊２５６［ｂｙｔｅｓ］未満となるようにファイルを分割する（Ｓ１５０３）。そして、分割ファイルをアイテムとして伝送し、分割ファイルに関する情報（例えば、分割ファイルであること、分割ファイルにおけるシーケンス番号など）を制御情報に格納して伝送する（Ｓ１５０４）。一方、ファイルサイズがｘ＊２５６［ｂｙｔｅｓ］未満である場合には（Ｓ１５０２でＮｏ）、通常通りファイルをアイテムとして伝送する（Ｓ１５０５）。 First, the transmitting device checks the size of the file to be transmitted (S1501). Next, the transmitting device determines whether the file size exceeds x*256 [bytes] (x is the data size that can be transmitted in one packet. For example, MTU size) (S1502), and determines whether the file size exceeds x If the size exceeds *256 [bytes] (Yes in S1502), the file is divided so that the size of the divided file becomes less than x*256 [bytes] (S1503). Then, the divided file is transmitted as an item, and information regarding the divided file (for example, the fact that it is a divided file, the sequence number of the divided file, etc.) is stored in the control information and transmitted (S1504). On the other hand, if the file size is less than x*256 [bytes] (No in S1502), the file is transmitted as an item as usual (S1505).

図６２は、フラグメントカウンタを活用する場合の受信装置の動作フローである。 FIG. 62 is an operation flow of the receiving device when utilizing a fragment counter.

まず、受信装置は、アセット管理テーブルなどのファイルの伝送に関する制御情報を取得、解析する（Ｓ１６０１）。次に、受信装置は、所望のアイテムが分割ファイルであるかどうかを判定する（Ｓ１６０２）。受信装置は、所望のファイルが分割ファイルであると判定した場合には（Ｓ１６０２でＹｅｓ）、分割ファイルや分割ファイルのインデックスなど、ファイルを再構成するための情報を制御情報から取得する（Ｓ１６０３）。そして、受信装置は、分割ファイルを構成するアイテムを取得し、元のファイルを再構成する（Ｓ１６０４）。一方、受信装置は、所望のファイルが分割ファイルでないと判定した場合（Ｓ１６０２でＮｏ）、通常通りファイルを取得する（Ｓ１６０５）。 First, the receiving device acquires and analyzes control information regarding file transmission, such as an asset management table (S1601). Next, the receiving device determines whether the desired item is a split file (S1602). If the receiving device determines that the desired file is a divided file (Yes in S1602), the receiving device acquires information for reconfiguring the file, such as the divided file and the index of the divided file, from the control information (S1603). . The receiving device then obtains the items that make up the divided file and reconstructs the original file (S1604). On the other hand, if the receiving device determines that the desired file is not a divided file (No in S1602), it acquires the file as usual (S1605).

要するに、送信装置は、ファイル先頭の分割データのパケットシーケンス番号をシグナリングする。また、送信装置は、分割データ数を特定できる情報をシグナリングする。或いは、送信装置は、分割データ数を特定できる分割ルールを規定する。また、送信装置は、フラグメントカウンタを運用せずに、ｒｅｓｅｒｖｅｄ或いはヘッダ圧縮する。 In short, the transmitting device signals the packet sequence number of the divided data at the beginning of the file. Further, the transmitting device signals information that can specify the number of divided data. Alternatively, the transmitting device defines a division rule that can specify the number of divided data. Further, the transmitting device performs reserved or header compression without operating a fragment counter.

受信装置は、ファイル先頭のデータのパケットシーケンス番号がシグナリングされている場合には、ファイル先頭の分割データのパケットシーケンス番号と、ＭＭＴＰパケットのパケットシーケンス番号から、分割データ番号や分割データ数を特定する。 If the packet sequence number of the data at the beginning of the file is signaled, the receiving device identifies the divided data number and the number of divided data from the packet sequence number of the divided data at the beginning of the file and the packet sequence number of the MMTP packet. .

別の観点では、送信装置は、ファイルを分割し、分割ファイル毎にデータを分割して送信する。分割ファイルを紐づける情報（シーケンス番号、分割数など）をシグナリングする。 From another perspective, the transmitting device divides a file, divides data for each divided file, and transmits the data. Signal information that links divided files (sequence number, number of divisions, etc.).

受信装置は、フラグメントカウンタ、及び分割ファイルのシーケンス番号により、分割データ番号や分割データ数を特定する。 The receiving device identifies the divided data number and the number of divided data using the fragment counter and the sequence number of the divided file.

これにより、分割データ番号や分割データを一意に特定できる。また、途中の分割データを受信した時点で当該分割データの分割データ番号を特定できるため、待機時間を削減し、メモリも削減できる。 This makes it possible to uniquely identify the divided data number and divided data. Further, since the divided data number of the divided data in the middle can be specified at the time of receiving the divided data, the waiting time and memory can be reduced.

また、フラグメントカウンタを運用しないことにより、送受信装置の構成は処理量を削減することができる、また、伝送効率を向上させることができる。 Furthermore, by not operating a fragment counter, the configuration of the transmitter/receiver can reduce the processing amount and improve transmission efficiency.

図６３は、同一番組を複数のＩＰデータフローで送信する場合のサービス構成を示す図である。ここでは、サービスＩＤ＝２の番組の一部（映像・音声）のデータがＭＭＴ方式を用いたＩＰデータフローで送信され、同じサービスＩＤであり、当該一部のデータとは異なるデータが高度ＢＳデータ伝送方式を用いたＩＰデータフロー（この例ではファイル伝送プロトコルも異なるが同じプロトコルであってもよい。）で送信される例を示している。 FIG. 63 is a diagram showing a service configuration when the same program is transmitted using multiple IP data flows. Here, part of the data (video/audio) of the program with service ID = 2 is transmitted by IP data flow using the MMT method, and data that has the same service ID and is different from the part of the data is sent to the advanced BS. An example is shown in which data is transmitted using an IP data flow using a data transmission method (in this example, the file transmission protocol is different, but may be the same protocol).

送信装置は、受信装置において、複数のＩＰデータフローから構成されるデータが、復号時刻までに揃うことを保証できるように、ＩＰデータの多重化を行う。 The transmitting device multiplexes the IP data so that it can be guaranteed that the data composed of multiple IP data flows will be completed by the decoding time in the receiving device.

受信装置は、複数のＩＰデータフローから構成されるデータを用いて、復号時刻に基づいて処理することにより、保証された受信機動作を実現することができる。 The receiving device can realize guaranteed receiver operation by processing data made up of multiple IP data flows based on decoding time.

［補足：送信装置及び受信装置］
以上のように、フラグメントカウンタを運用せずにデータの送信を行う送信装置は、図６４のように構成することも可能である。また、フラグメントカウンタを運用せずにデータの受信を行う受信装置は、図６５のように構成することも可能である。図６４は、送信装置の具体的構成の例を示す図である。図６５は、受信装置の具体的構成の例を示す図である。 [Supplement: Transmitting device and receiving device]
As described above, a transmitting device that transmits data without operating a fragment counter can also be configured as shown in FIG. 64. Further, a receiving device that receives data without operating a fragment counter can also be configured as shown in FIG. 65. FIG. 64 is a diagram showing an example of a specific configuration of a transmitting device. FIG. 65 is a diagram showing an example of a specific configuration of a receiving device.

送信装置５００は、分割部５０１と、構成部５０２と、送信部５０３とを備える。分割部５０１、構成部５０２、及び送信部５０３のそれぞれは、例えば、マイクロコンピュータ、プロセッサ、または、専用回路などによって実現される。 Transmitting device 500 includes a dividing section 501, a configuration section 502, and a transmitting section 503. Each of the dividing unit 501, the configuration unit 502, and the transmitting unit 503 is realized by, for example, a microcomputer, a processor, a dedicated circuit, or the like.

受信装置６００は、受信部６０１と、判定部６０２と、構成部６０３とを備える。受信部６０１、判定部６０２、及び構成部６０３のそれぞれは、例えば、マイクロコンピュータ、プロセッサ、または、専用回路などによって実現される。 The receiving device 600 includes a receiving section 601, a determining section 602, and a configuration section 603. Each of the receiving section 601, the determining section 602, and the configuring section 603 is realized by, for example, a microcomputer, a processor, a dedicated circuit, or the like.

送信装置５００及び受信装置６００の各構成要素についての詳細な説明は、それぞれ、送信方法及び受信方法の説明において行う。 A detailed explanation of each component of the transmitting device 500 and the receiving device 600 will be given in the description of the transmitting method and the receiving method, respectively.

まず、送信方法について、図６６を用いて説明する。図６６は、送信装置による動作フロー（送信方法）である。 First, the transmission method will be explained using FIG. 66. FIG. 66 is an operation flow (transmission method) by the transmitting device.

まず、送信装置５００の分割部５０１は、データを複数の分割データに分割する（Ｓ１７０１）。 First, the dividing unit 501 of the transmitting device 500 divides data into a plurality of divided data (S1701).

次に、送信装置５００の構成部５０２は、複数の分割データのそれぞれにヘッダ情報を付与してパケット化することで、複数のパケットを構成する（Ｓ１７０２）。 Next, the configuration unit 502 of the transmitting device 500 configures a plurality of packets by adding header information to each of the plurality of divided data and packetizing the divided data (S1702).

そして、送信装置５００の送信部５０３は、構成された前記複数のパケットを送信する（Ｓ１７０３）。送信部５０３は、分割データ情報、および、無効化されたフラグメントカウンタの値を、送信する。なお、分割データ情報は、分割データ番号と、分割データ数とを特定するための情報である。また、分割データ番号は、当該分割データが複数の分割データのうちの何番目の分割データであるかを示す番号である。分割データ数は、複数の分割データの数である。 Then, the transmitting unit 503 of the transmitting device 500 transmits the plurality of configured packets (S1703). The transmitter 503 transmits the fragment data information and the invalidated fragment counter value. Note that the divided data information is information for specifying the divided data number and the number of divided data. Further, the divided data number is a number indicating which divided data the corresponding divided data is among the plurality of divided data. The number of divided data is the number of multiple pieces of divided data.

これにより、送信装置５００の処理量を削減することができる。 Thereby, the processing amount of the transmitting device 500 can be reduced.

次に、受信方法について、図６７を用いて説明する。図６７は、受信装置による動作フロー（受信方法）である。 Next, the reception method will be explained using FIG. 67. FIG. 67 is an operation flow (receiving method) by the receiving device.

まず、受信装置６００の受信部６０１は、複数のパケットを受信する（Ｓ１８０１）。 First, the receiving unit 601 of the receiving device 600 receives a plurality of packets (S1801).

次に、受信装置６００の判定部６０２は、受信した複数のパケットから、分割データ情報が、当取得されたか否かを判定する（Ｓ１８０２）。 Next, the determining unit 602 of the receiving device 600 determines whether the divided data information has been acquired from the plurality of received packets (S1802).

そして、受信装置６００の構成部６０３は、判定部６０２により、分割データ情報を取得したと判定された場合（Ｓ１８０２でＹｅｓ）、当該ヘッダ情報に含まれるフラグメントカウンタの値を使用せずに、受信した複数のパケットからデータを構成する（Ｓ１８０３）。 Then, when the determining unit 602 determines that the divided data information has been acquired (Yes in S1802), the configuration unit 603 of the receiving device 600 performs reception without using the value of the fragment counter included in the header information. Data is constructed from a plurality of packets (S1803).

一方で、構成部６０３は、判定部６０２により、分割データ情報を取得していないと判定された場合（Ｓ１８０２でＮｏ）、当該ヘッダ情報に含まれるフラグメントカウンタの値を用いて、受信した複数のパケットからデータを構成してもよい（Ｓ１８０４）。 On the other hand, if the determining unit 602 determines that the divided data information has not been acquired (No in S1802), the configuration unit 603 uses the value of the fragment counter included in the header information to Data may be configured from packets (S1804).

これにより、受信装置６００の処理量を削減することができる。 Thereby, the processing amount of the receiving device 600 can be reduced.

（実施の形態５）
［概要］
実施の形態５では、ＮＡＬサイズフォーマットでＮＡＬユニットを多重化レイヤに格納した場合の伝送パケット（ＴＬＶパケット）の送信方法について説明する。 (Embodiment 5)
[overview]
In Embodiment 5, a method for transmitting a transmission packet (TLV packet) when NAL units are stored in a multiplexing layer in a NAL size format will be described.

実施の形態１で説明したように、Ｈ．２６４やＨ．２６５のＮＡＬユニットを多重化レイヤに格納する際には、次の２種類の形式がある。１つは、ＮＡＬユニットヘッダの直前に特定のビット列からなるスタートコードを付加するバイトストリームフォーマットと呼ばれる形式である。もう１つは、ＮＡＬユニットのサイズを示すフィールドを付加するＮＡＬサイズフォーマットと呼ばれる形式である。バイトストリームフォーマットは、ＭＰＥＧ－２システムやＲＴＰなどにおいて用いられ、ＮＡＬサイズフォーマットはＭＰ４、あるいは、ＭＰ４を使用するＤＡＳＨやＭＭＴなどにおいて用いられる。 As explained in Embodiment 1, H. 264 and H. There are two types of formats for storing H.265 NAL units in the multiplexing layer. One is a format called a byte stream format in which a start code consisting of a specific bit string is added immediately before the NAL unit header. The other format is called the NAL size format, which adds a field indicating the size of the NAL unit. The byte stream format is used in the MPEG-2 system, RTP, etc., and the NAL size format is used in MP4, or DASH, MMT, etc. that use MP4.

バイトストリームフォーマットにおいて、スタートコードは３バイトで構成され、さらに任意のバイト（値は０であるバイト）を付加することもできる。 In the byte stream format, the start code consists of 3 bytes, and further arbitrary bytes (bytes whose value is 0) can be added.

一方で、一般的なＭＰ４におけるＮＡＬサイズフォーマットでは、サイズ情報は、１バイト、２バイト、および４バイトのいずれかで示される。このサイズ情報は、ＨＥＶＣサンプルエントリにおけるｌｅｎｇｔｈＳｉｚｅＭｉｎｕｓＯｎｅフィールドで示される。当該フィールドの値が「０」の場合は１バイト、「１」の場合は２バイト、「３」の場合は４バイトであることを示す。 On the other hand, in the general NAL size format in MP4, size information is indicated as 1 byte, 2 bytes, or 4 bytes. This size information is indicated in the lengthSizeMinusOne field in the HEVC sample entry. A value of "0" in the field indicates 1 byte, a value of "1" indicates 2 bytes, and a value of "3" indicates 4 bytes.

ここで、２０１４年７月に規格化された、ＡＲＩＢＳＴＤ－Ｂ６０「デジタル放送におけるＭＭＴによるメディアトランスポート方式」では、ＮＡＬユニットを多重化レイヤに格納する際、ＨＥＶＣエンコーダの出力がバイトストリームである場合、バイトスタートコードを除去し、３２ビット（符号なし整数）で示したバイト単位のＮＡＬユニットのサイズを長さ情報としてＮＡＬユニットの直前に付加する。なお、ＨＥＶＣサンプルエントリを含むＭＰＵメタデータを伝送せず、サイズ情報は３２ビット（４バイト）固定である。 Here, in ARIB STD-B60 "Media Transport Method Using MMT in Digital Broadcasting", which was standardized in July 2014, when storing NAL units in the multiplex layer, the output of the HEVC encoder is a byte stream. In this case, the byte start code is removed, and the size of the NAL unit in bytes indicated by 32 bits (unsigned integer) is added as length information immediately before the NAL unit. Note that the MPU metadata including the HEVC sample entry is not transmitted, and the size information is fixed at 32 bits (4 bytes).

また、ＡＲＩＢＳＴＤ－Ｂ６０「デジタル放送におけるＭＭＴによるメディアトランスポート方式」では、送信装置が受信装置におけるバッファ動作を保証するために送信の際に考慮する受信バッファモデルにおいては、映像信号の復号前バッファは、ＣＰＢであると規定されている。 In addition, in ARIB STD-B60 "Media Transport Method Using MMT in Digital Broadcasting", in the receive buffer model that the transmitter considers during transmission to ensure buffer operation in the receiver, the buffer before decoding of the video signal is is defined as CPB.

しかし、次のような課題がある。ＭＰＥＧ－２システムにおけるＣＰＢや、ＨＥＶＣにおけるＨＲＤでは、映像信号がバイトストリームフォーマットであることを前提に規定されている。このため、例えば、３バイトのスタートコードが付いたバイトストリームフォーマットであることを前提に伝送パケットのレート制御を行った場合、４バイトのサイズ領域が付加されたＮＡＬサイズフォーマットの伝送パケットを受信した受信装置は、ＡＲＩＢＳＴＤ－Ｂ６０における受信バッファモデルを満たすことができない可能性がある。また、ＡＲＩＢＳＴＤ－Ｂ６０における受信バッファモデルには、具体的なバッファサイズ、および、引き抜きレートが示されていないため、受信装置におけるバッファ動作を保証することは難しい。 However, there are the following issues. CPB in the MPEG-2 system and HRD in HEVC are defined on the premise that the video signal is in a byte stream format. For this reason, for example, if the rate of a transmission packet is controlled on the assumption that it is in a byte stream format with a 3-byte start code, a transmission packet in NAL size format with a 4-byte size field added is received. The receiving device may not be able to satisfy the receiving buffer model in ARIB STD-B60. Furthermore, since the reception buffer model in ARIB STD-B60 does not indicate a specific buffer size and extraction rate, it is difficult to guarantee buffer operation in the reception device.

したがって、上記の課題を解決するために、受信機におけるバッファ動作を保証するための受信バッファモデルを下記のように規定する。 Therefore, in order to solve the above problem, a reception buffer model for guaranteeing buffer operation in the receiver is defined as follows.

図６８は、ＡＲＩＢＳＴＤＢ－６０に規定されている受信バッファモデルに基づいて、特に放送伝送路のみを用いた場合の受信バッファモデルを示す。 FIG. 68 shows a receive buffer model based on the receive buffer model specified in ARIB STD B-60, particularly when only a broadcast transmission path is used.

受信バッファモデルは、ＴＬＶパケットバッファ（第１のバッファ）と、ＩＰパケットバッファ（第２のバッファ）と、ＭＭＴＰバッファ（第３のバッファ）と、復号前バッファ（第４のバッファ）とを備える。なお、放送伝送路では、デジッタバッファやＦＥＣのためのバッファは必要ないため、省略している。 The reception buffer model includes a TLV packet buffer (first buffer), an IP packet buffer (second buffer), an MMTP buffer (third buffer), and a pre-decoding buffer (fourth buffer). Note that the broadcast transmission path does not require a de-jitter buffer or a buffer for FEC, so they are omitted.

ＴＬＶパケットバッファは、ＴＬＶパケット（伝送パケット）を放送伝送路から受信し、受信したＴＬＶパケットに格納されている可変長のパケットヘッダ（ＩＰパケットヘッダ、ＩＰパケット圧縮時のフルヘッダ、ＩＰパケット圧縮時の圧縮ヘッダ）、および可変長のペイロードで構成されるＩＰパケットを、ヘッダ伸張された固定長のＩＰパケットヘッダを有するＩＰパケット（第１のパケット）に変換し、変換することにより得られたＩＰパケットを一定のビットレートで出力する。 The TLV packet buffer receives TLV packets (transmission packets) from the broadcast transmission path, and stores variable-length packet headers (IP packet headers, full headers when compressing IP packets, and full headers when compressing IP packets) stored in the received TLV packets. An IP packet obtained by converting an IP packet consisting of a compressed header (compressed header) and a variable length payload into an IP packet (first packet) having a fixed length IP packet header with the header expanded. Output at a constant bit rate.

ＩＰパケットバッファは、ＩＰパケットをパケットヘッダおよび可変長のペイロードを有するＭＭＴＰパケット（第２のパケット）に変換し、変換することにより得られたＭＭＴＰパケットを一定のビットレートで出力する。なお、ＩＰパケットバッファは、ＭＭＴＰバッファにマージされていても良い。 The IP packet buffer converts an IP packet into an MMTP packet (second packet) having a packet header and a variable length payload, and outputs the MMTP packet obtained by the conversion at a constant bit rate. Note that the IP packet buffer may be merged with the MMTP buffer.

ＭＭＴＰバッファは、出力されたＭＭＴＰパケットをＮＡＬユニットに変換し、変換することにより得られたＮＡＬユニットを一定のビットレートで出力する。 The MMTP buffer converts the output MMTP packet into a NAL unit, and outputs the NAL unit obtained by the conversion at a constant bit rate.

復号前バッファは、出力されたＮＡＬユニットを順次蓄積し、蓄積した複数のＮＡＬユニットからアクセスユニットを生成し、生成したアクセスユニットを当該アクセスユニットに対応した復号時刻のタイミングでデコーダに出力する。 The pre-decoding buffer sequentially accumulates the output NAL units, generates an access unit from the accumulated NAL units, and outputs the generated access unit to the decoder at the timing of the decoding time corresponding to the access unit.

図６８に示す受信バッファモデルでは、前段のＴＬＶパケットバッファおよびＩＰパケットバッファ以外のバッファであるＭＭＴＰバッファおよび復号前バッファは、ＭＰＥＧ－２ＴＳにおける受信バッファモデルを踏襲することが特徴的である。 The reception buffer model shown in FIG. 68 is characterized in that the MMTP buffer and pre-decoding buffer, which are buffers other than the TLV packet buffer and IP packet buffer at the previous stage, follow the reception buffer model in MPEG-2 TS.

例えば、映像におけるＭＭＴＰバッファ（ＭＭＴＰＢ１）は、ＭＰＥＧ－２ＴＳにおけるトランスポートバッファ（ＴＢ）、及び、多重化バッファ（ＭＢ）に相当するバッファで構成される。また、音声におけるＭＭＴＰバッファ（ＭＭＴＰＢｎ）は、ＭＰＥＧ－２ＴＳにおけるトランスポートバッファ（ＴＢ）に相当するバッファで構成される。 For example, the MMTP buffer (MMTP B1) in video is composed of a buffer corresponding to a transport buffer (TB) and a multiplexing buffer (MB) in MPEG-2 TS. Furthermore, the MMTP buffer (MMTP Bn) in audio is configured with a buffer equivalent to the transport buffer (TB) in MPEG-2 TS.

トランスポートバッファのバッファサイズは、ＭＰＥＧ－２ＴＳ同様であり固定値とする。例えば、ＭＴＵサイズのｎ倍（ｎは、小数であっても整数であってもよく、１以上とする。）とする。 The buffer size of the transport buffer is the same as MPEG-2 TS and is a fixed value. For example, it is assumed to be n times the MTU size (n may be a decimal number or an integer, and is 1 or more).

また、ＭＭＴＰパケットヘッダのオーバーヘッド率がＰＥＳパケットヘッダのオーバーヘッド率よりも小さくなるように、ＭＭＴＰパケットサイズを規定する。これにより、トランスポートバッファからの引き抜きレートは、ＭＰＥＧ－２ＴＳにおけるトランスポートバッファの引き抜きレートＲＸ１、ＲＸｎ、ＲＸｓをそのまま適用することができる。 Furthermore, the MMTP packet size is defined so that the overhead rate of the MMTP packet header is smaller than the overhead rate of the PES packet header. As a result, the transport buffer extraction rates RX1, RXn, and RXs in the MPEG-2 TS can be directly applied to the extraction rate from the transport buffer.

また、多重化バッファのサイズ、および、引き抜きレートは、それぞれ、ＭＰＥＧ－２
ＴＳにおけるＭＢサイズ、および、ＲＢＸ１とする。 Also, the size of the multiplexing buffer and the extraction rate are respectively MPEG-2
Let MB size in TS and RBX1 be.

以上の受信バッファモデルに加え、課題を解決するために、下記の制約を設ける。 In addition to the above reception buffer model, the following constraints are established to solve the problem.

ＨＥＶＣのＨＲＤ規定は、バイトストリーム形式が前提であり、ＭＭＴはＮＡＬユニットの先頭に４バイトのサイズ領域を付加するＮＡＬサイズ形式である。したがって、符号化時にはＮＡＬサイズ形式において、ＨＲＤを満たすようにレート制御を行う。 The HEVC HRD specification is based on a byte stream format, and MMT is a NAL size format that adds a 4-byte size area to the beginning of a NAL unit. Therefore, during encoding, rate control is performed in the NAL size format so as to satisfy the HRD.

つまり、送信装置では、上記の受信バッファモデルおよび制約に基づいて、伝送パケットのレート制御を行う。 That is, the transmitter performs rate control of transmission packets based on the above-described receive buffer model and constraints.

受信装置では、上記の信号を用いて受信処理をすることにより、アンダーフローやオーバーフローすることなる復号動作をすることができる。 By performing reception processing using the above-mentioned signals, the receiving device can perform a decoding operation that does not cause underflow or overflow.

なお、ＴＬＶパケットバッファの引き抜きレート（ＴＬＶパケットバッファがＩＰパケットを出力する際のビットレート）は、ＩＰヘッダ伸張後の伝送レートを考慮して設定する。 Note that the extraction rate of the TLV packet buffer (the bit rate at which the TLV packet buffer outputs IP packets) is set in consideration of the transmission rate after IP header expansion.

つまり、データサイズが可変長であるＴＬＶパケットを入力し、ＴＬＶヘッダの除去およびＩＰヘッダの伸張（復元）を実施した後、出力されるＩＰパケットの伝送レートを考慮する。言い換えれば、入力される伝送レートに対してヘッダの増減量を考慮する。 That is, after inputting a TLV packet whose data size is variable and removing the TLV header and decompressing (restoring) the IP header, the transmission rate of the output IP packet is considered. In other words, the increase or decrease of the header is considered with respect to the input transmission rate.

具体的には、データサイズが可変長であること、ＩＰヘッダ圧縮されるパケットとＩＰヘッダ圧縮されていないパケットとが混在すること、ＩＰｖ４，ＩＰｖ６などのパケット種別によりＩＰヘッダのサイズが異なることから、出力されるＩＰパケットの伝送レートは一意ではない。このため、可変長のデータサイズの平均パケット長を定め、ＴＬＶパケットから出力されるＩＰパケットの伝送レートを定める。 Specifically, the data size is variable length, packets with IP header compression and packets with uncompressed IP header coexist, and the size of the IP header differs depending on the packet type such as IPv4 or IPv6. , the transmission rate of the output IP packets is not unique. For this reason, the average packet length of the variable-length data size is determined, and the transmission rate of the IP packet output from the TLV packet is determined.

ここでは、ＩＰヘッダ伸張後の最大伝送速度を規定するために、ＩＰヘッダは常に圧縮されている場合を想定して伝送レートを定める。 Here, in order to define the maximum transmission rate after IP header expansion, the transmission rate is determined assuming that the IP header is always compressed.

また、ＩＰｖ４、ＩＰｖ６のパケット種別が混在する場合、或いは、パケット種別を区別することなく規定する場合は、ヘッダサイズが大きく、ヘッダ伸張後の増加率の大きいＩＰｖ６パケットを想定して伝送レートを定める。 In addition, when IPv4 and IPv6 packet types coexist, or when specifying packet types without distinguishing them, the transmission rate is determined assuming IPv6 packets with a large header size and a large increase rate after header expansion. .

例えば、ＴＬＶパケットバッファに入力されるＴＬＶパケットの平均パケット長がＳであり、ＴＬＶパケットに格納されるＩＰパケットはすべてＩＰｖ６パケットであり、ヘッダ圧縮されているとした場合の、ＴＬＶヘッダの除去及びＩＰヘッダの伸張後の最大出力伝送レートは、
入力レート×｛Ｓ／（Ｓ＋ＩＰｖ６ヘッダ圧縮量）｝
となる。 For example, if the average packet length of the TLV packets input to the TLV packet buffer is S, and all IP packets stored in the TLV packets are IPv6 packets, and the headers are compressed, the TLV header removal and The maximum output transmission rate after decompressing the IP header is:
Input rate x {S/(S+IPv6 header compression amount)}
becomes.

より具体的には、ＴＬＶパケットの平均パケット長Ｓを、
Ｓ＝０．７５×１５００（１５００は最大ＭＴＵサイズを想定）
を基準として設定し、
ＩＰｖ６ヘッダ圧縮量＝ＴＬＶヘッダ長－ＩＰｖ６ヘッダ長－ＵＤＰヘッダ長
＝３－４０－８
とした場合、ＴＬＶヘッダの除去及びＩＰヘッダの伸張後の最大出力伝送レートは、
入力レート×１．０４１７≒入力レート×１．０５
となる。 More specifically, the average packet length S of TLV packets is
S = 0.75 x 1500 (1500 assumes maximum MTU size)
set as the standard,
IPv6 header compression amount = TLV header length - IPv6 header length - UDP header length
=3-40-8
In this case, the maximum output transmission rate after removing the TLV header and expanding the IP header is:
Input rate x 1.0417 ≒ Input rate x 1.05
becomes.

図６９は、複数のデータユニットをアグリゲーションして一つのペイロードに格納する例を示す図である。 FIG. 69 is a diagram illustrating an example of aggregating a plurality of data units and storing them in one payload.

ＭＭＴ方式では、データユニットをアグリゲーションする際、図６９に示すように、データユニットの前に、データユニット長、及び、データユニットヘッダが付加される。 In the MMT method, when aggregating data units, a data unit length and a data unit header are added before the data unit, as shown in FIG. 69.

しかし、例えば、ＮＡＬサイズフォーマットの映像信号を一つのデータユニットとして格納する場合、図７０に示すように、一つのデータユニットに対して、サイズを示すフィールドが２つあり、情報として重複している。図７０は、複数のデータユニットをアグリゲーションして一つのペイロードに格納する例であって、ＮＡＬサイズフォーマットの映像信号を一つのデータユニットとした場合の例を示す図である。具体的には、ＮＡＬサイズフォーマットにおける先頭のサイズ領域（以降の説明では、「ｓｉｚｅ領域」と呼ぶ。）と、ＭＭＴＰペイロードヘッダにおけるデータユニットヘッダの前に位置するデータユニット長フィールドとのいずれもサイズを示すフィールドであり、情報として重複している。例えば、ＮＡＬユニットの長さがＬバイトである場合、ｓｉｚｅ領域にはＬバイトが示されており、データユニット長フィールドには、Ｌバイト＋「ｓｉｚｅ領域の長さ」（ｂｙｔｅ）が示される。ｓｉｚｅ領域と、データユニット長フィールドとで示される値は完全に一致はしていないが、一方の値から他方の値を容易に算出できるため、重複していると言える。 However, for example, when storing a video signal in the NAL size format as one data unit, as shown in FIG. 70, there are two fields indicating the size for one data unit, and the information is duplicated. . FIG. 70 is a diagram showing an example in which a plurality of data units are aggregated and stored in one payload, and a video signal in the NAL size format is used as one data unit. Specifically, both the first size area in the NAL size format (referred to as the "size area" in the following explanation) and the data unit length field located before the data unit header in the MMTP payload header have the same size. This is a field that indicates information, and is duplicated as information. For example, if the length of the NAL unit is L bytes, the size area indicates L bytes, and the data unit length field indicates L bytes + "length of size area" (byte). Although the values shown in the size area and the data unit length field do not completely match, it can be said that they overlap because the value of one can be easily calculated from the other.

このように、データのサイズ情報を内部に含むデータをデータユニットとして格納し、かつ、複数の当該データユニットをアグリゲーションして一つのペイロードに格納する場合、サイズ情報が重複するため、オーバーヘッドが大きく、伝送効率が悪いと言う課題がある。 In this way, when data that includes data size information is stored as a data unit, and multiple data units are aggregated and stored in one payload, the size information overlaps, resulting in large overhead. There is a problem with poor transmission efficiency.

そこで、送出装置では、データのサイズ情報を内部に含むデータをデータユニットとして格納し、かつ、複数の当該データユニットをアグリゲーションして一つのペイロードに格納する場合、図７１や図７２に示すように格納することが考えられる。 Therefore, when the sending device stores data that includes data size information as a data unit and aggregates a plurality of data units and stores them in one payload, as shown in FIGS. 71 and 72, It is possible to store it.

図７１に示すように、ｓｉｚｅ領域を含むＮＡＬユニットをデータユニットとして格納し、ＭＭＴＰペイロードヘッダに従来含まれるデータユニット長は示さないことが考えられる。図７１は、データユニット長が示されないＭＭＴＰパケットのペイロードの構成を示す図である。 As shown in FIG. 71, it is conceivable that the NAL unit including the size area is stored as a data unit, and the data unit length conventionally included in the MMTP payload header is not indicated. FIG. 71 is a diagram showing the structure of the payload of an MMTP packet in which the data unit length is not indicated.

また、図７２に示すように、データユニット長が示されているかどうかを示すフラグや、ｓｉｚｅ領域の長さを示す情報を新たにヘッダに格納してもよい。フラグやｓｉｚｅ領域の長さを示す情報を格納する場所は、データユニットヘッダなど、データユニット単位で示してもよいし、複数のデータユニットをアグリゲーションした単位で（パケット単位）で示してもよい。図７２は、パケット単位に付与されるｅｘｔｅｎｄ領域に示す例である。なお、上記の新規に示す情報の格納場所はこれに限るものではなく、ＭＭＴＰペイロードヘッダやＭＭＴＰパケットヘッダ、制御情報であってもよい。 Further, as shown in FIG. 72, a flag indicating whether the data unit length is indicated or information indicating the length of the size area may be newly stored in the header. The location where information indicating the length of the flag and size area is stored may be indicated in units of data units, such as in data unit headers, or may be indicated in units of aggregation of a plurality of data units (units of packets). FIG. 72 is an example of an extend area added to each packet. Note that the storage location of the newly indicated information is not limited to this, and may be the MMTP payload header, MMTP packet header, or control information.

受信側では、データユニット長が圧縮されているかどうかを示すフラグが、データユニット長が圧縮されていることを示している場合は、データユニット内部のｓｉｚｅ領域の長さ情報を取得し、ｓｉｚｅ領域の長さ情報に基づいて、ｓｉｚｅ領域を取得することにより、取得したｓｉｚｅ領域の長さ情報およびｓｉｚｅ領域を用いてデータユニット長を算出できる。 On the receiving side, if the flag indicating whether the data unit length is compressed indicates that the data unit length is compressed, it obtains the length information of the size area inside the data unit, and By acquiring the size area based on the length information of the size area, the data unit length can be calculated using the acquired length information of the size area and the size area.

以上の方法により、送出側でデータ量を削減することができ、伝送効率を向上できる。 By the above method, the amount of data can be reduced on the sending side and transmission efficiency can be improved.

なお、データユニット長を削減するのではなく、ｓｉｚｅ領域を削減することによりオーバーヘッド削減してもよい。ｓｉｚｅ領域を削減する場合には、ｓｉｚｅ領域が削減されているかどうかを示す情報や、データユニット長フィールドの長さを示す情報を格納してもよい。 Note that the overhead may be reduced by reducing the size area instead of reducing the data unit length. When reducing the size area, information indicating whether the size area has been reduced or information indicating the length of the data unit length field may be stored.

なお、ＭＭＴＰペイロードヘッダにも、長さ情報が含まれる。 Note that the MMTP payload header also includes length information.

ｓｉｚｅ領域を含むＮＡＬユニットをデータユニットとして格納する場合は、アグリゲーションする／しないにかかわらず、ＭＭＴＰペイロードヘッダにおけるペイロードサイズ領域を削減してもよい。 When storing a NAL unit including a size area as a data unit, the payload size area in the MMTP payload header may be reduced regardless of whether aggregation is performed or not.

また、ｓｉｚｅ領域を含まないデータをデータユニットとして格納する場合でも、アグリゲーションされており、データユニット長が示されている場合には、ＭＭＴＰペイロードヘッダにおけるペイロードサイズ領域を削減してもよい。 Furthermore, even when data that does not include the size area is stored as a data unit, if it is aggregated and the data unit length is indicated, the payload size area in the MMTP payload header may be reduced.

ペイロードサイズ領域を削減する場合には、上記と同様に、削減したかどうかを示すフラグや、削減したサイズフィールドの長さ情報を示してもよい。 When the payload size area is to be reduced, a flag indicating whether the payload size area has been reduced or the length information of the reduced size field may be shown in the same way as described above.

図７３は、受信装置の動作フローを示す。 FIG. 73 shows the operation flow of the receiving device.

送出装置では、上述したように、ｓｉｚｅ領域を含むＮＡＬユニットをデータユニットとして格納し、ＭＭＴＰペイロードヘッダに含まれるデータユニット長はＭＭＴＰパケットにおいて示さないとする。 In the sending device, as described above, the NAL unit including the size area is stored as a data unit, and the data unit length included in the MMTP payload header is not indicated in the MMTP packet.

以下では、データユニット長が示されているかどうかをフラグや、ｓｉｚｅ領域の長さ情報がＭＭＴＰパケットにおいて示されている場合を例に説明する。 In the following, a case will be explained using a flag to determine whether the data unit length is indicated or not, and a case where length information of the size area is indicated in the MMTP packet.

受信装置は、データユニットがサイズ領域を含み、データユニット長が削減されているかどうかを、送出側から送信される情報に基づいて判定する（Ｓ１９０１）。 The receiving device determines whether the data unit includes a size area and the data unit length has been reduced based on information transmitted from the sending side (S1901).

データユニット長が削減されていると判定した場合（Ｓ１９０２でＹｅｓ）、データユニット内部のサイズ領域の長さ情報を取得し、その後、データユニット内部のサイズ領域を解析し、データユニット長を算出することにより取得する（Ｓ１９０３）。 If it is determined that the data unit length has been reduced (Yes in S1902), obtain length information of the size area inside the data unit, and then analyze the size area inside the data unit to calculate the data unit length. (S1903).

一方、データユニット長が削減されていないと判定した場合（Ｓ１９０２でＮｏ）、通常どおり、データユニット長、および、データユニット内部のサイズ領域のいずれか一方からデータユニット長を算出する（Ｓ１９０４）。 On the other hand, if it is determined that the data unit length has not been reduced (No in S1902), the data unit length is calculated from either the data unit length or the size area inside the data unit as usual (S1904).

［補足：送信装置及び受信装置］
以上のように、符号化時に受信バッファモデルの規定を満たすようにレート制御を行う送信装置は、図７４のように構成することも可能である。また、送信装置から送信された伝送パケットを受信し、復号する受信装置は、図７５のように構成することも可能である。図７４は、送信装置の具体的構成の例を示す図である。図７５は、受信装置の具体的構成の例を示す図である。 [Supplement: Transmitting device and receiving device]
As described above, a transmitting device that performs rate control so as to satisfy the specifications of the reception buffer model during encoding can also be configured as shown in FIG. 74. Further, a receiving device that receives and decodes a transmission packet transmitted from a transmitting device can also be configured as shown in FIG. 75. FIG. 74 is a diagram illustrating an example of a specific configuration of a transmitting device. FIG. 75 is a diagram showing an example of a specific configuration of a receiving device.

送信装置７００は、生成部７０１と、送信部７０２とを備える。生成部７０１及び送信部７０２のそれぞれは、例えば、マイクロコンピュータ、プロセッサ、または、専用回路などによって実現される。 Transmitting device 700 includes a generating section 701 and a transmitting section 702. Each of the generation section 701 and the transmission section 702 is realized by, for example, a microcomputer, a processor, a dedicated circuit, or the like.

受信装置８００は、受信部８０１と、第１のバッファ８０２と、第２のバッファ８０３と、第３のバッファ８０４と、第４のバッファ８０５と、復号部８０６とを備える。受信部８０１、第１のバッファ８０２、第２のバッファ８０３、第３のバッファ８０４、第４のバッファ８０５及び復号部（デコーダ）８０６のそれぞれは、例えば、マイクロコンピュータ、プロセッサ、または、専用回路などによって実現される。 The receiving device 800 includes a receiving section 801, a first buffer 802, a second buffer 803, a third buffer 804, a fourth buffer 805, and a decoding section 806. Each of the receiving section 801, first buffer 802, second buffer 803, third buffer 804, fourth buffer 805, and decoding section (decoder) 806 is a microcomputer, a processor, a dedicated circuit, etc. realized by

送信装置７００及び受信装置８００の各構成要素についての詳細な説明は、それぞれ、送信方法及び受信方法の説明において行う。 A detailed explanation of each component of the transmitting device 700 and the receiving device 800 will be given in the description of the transmitting method and the receiving method, respectively.

まず、送信方法について、図７６を用いて説明する。図７６は、送信装置による動作フロー（送信方法）である。 First, the transmission method will be explained using FIG. 76. FIG. 76 is an operation flow (transmission method) by the transmitting device.

まず、送信装置７００の生成部７０１は、受信装置のバッファ動作を保証するために予め定められた受信バッファモデルによる規定を満たすようにレート制御を行うことで符号化ストリームを生成する（Ｓ２００１）。 First, the generating unit 701 of the transmitting device 700 generates an encoded stream by performing rate control so as to satisfy a predetermined specification based on a receiving buffer model in order to guarantee the buffer operation of the receiving device (S2001).

次に、送信装置７００の送信部７０２は、生成された符号化ストリームをパケット化し、パケット化することで得られた伝送パケットを送信する（Ｓ２００２）。 Next, the transmitter 702 of the transmitter 700 packetizes the generated encoded stream and transmits the transmission packet obtained by packetizing (S2002).

なお、送信装置７００において用いられる受信バッファモデルは、受信装置８００の構成の第１～第４のバッファ８０２～８０５を備える構成であるため、説明を省略する。 Note that the receiving buffer model used in the transmitting device 700 has a configuration including the first to fourth buffers 802 to 805 of the configuration of the receiving device 800, and therefore the description thereof will be omitted.

これにより、送信装置７００は、ＭＭＴのような方式を用いてデータ伝送する場合に、受信装置８００のバッファ動作を保証できる。 Thereby, the transmitting device 700 can guarantee the buffer operation of the receiving device 800 when transmitting data using a method such as MMT.

次に、受信方法について、図７７を用いて説明する。図７７は、受信装置による動作フロー（受信方法）である。 Next, the reception method will be explained using FIG. 77. FIG. 77 is an operation flow (receiving method) by the receiving device.

まず、受信装置８００の受信部８０１は、可変長のパケットヘッダおよび可変長のペイロードで構成された伝送パケットを受信する（Ｓ２１０１）。 First, the receiving unit 801 of the receiving device 800 receives a transmission packet composed of a variable-length packet header and a variable-length payload (S2101).

次に、受信装置８００の第１のバッファ８０２は、受信した伝送パケットに格納されている可変長のパケットヘッダおよび可変長のペイロードで構成されるパケットを、ヘッダ伸張された固定長のパケットヘッダを有する第１のパケットに変換し、変換することにより得られた前記第１のパケットを一定のビットレートで出力する（Ｓ２１０２）。 Next, the first buffer 802 of the receiving device 800 converts the packet consisting of the variable-length packet header and variable-length payload stored in the received transmission packet into a fixed-length packet header that has been header-expanded. and outputs the first packet obtained by the conversion at a constant bit rate (S2102).

次に、受信装置８００の第２のバッファ８０３は、変換することにより得られた第１のパケットをパケットヘッダおよび可変長のペイロードで構成される第２のパケットに変換し、変換することにより得られた第２のパケットを一定のビットレートで出力する（Ｓ２１０３）。 Next, the second buffer 803 of the receiving device 800 converts the first packet obtained by the conversion into a second packet consisting of a packet header and a variable length payload, and The second packet thus obtained is output at a constant bit rate (S2103).

次に、受信装置８００の第３のバッファ８０４は、出力された第２のパケットをＮＡＬユニットに変換し、変換することにより得られたＮＡＬユニットを一定のビットレートで出力する（Ｓ２１０４）。 Next, the third buffer 804 of the receiving device 800 converts the output second packet into a NAL unit, and outputs the NAL unit obtained by the conversion at a constant bit rate (S2104).

次に、受信装置８００の第４のバッファ８０５は、出力されたＮＡＬユニットを順次蓄積し、蓄積した複数のＮＡＬユニットからアクセスユニットを生成し、生成したアクセスユニットを当該アクセスユニットに対応した復号時刻のタイミングでデコーダに出力する（Ｓ２１０５）。 Next, the fourth buffer 805 of the receiving device 800 sequentially accumulates the output NAL units, generates an access unit from the accumulated plurality of NAL units, and transfers the generated access unit to the decoding time corresponding to the access unit. It is output to the decoder at the timing of (S2105).

そして、受信装置８００の復号部８０６は、第４のバッファにより出力されたアクセスユニットを復号する（Ｓ２１０６）。 Then, the decoding unit 806 of the receiving device 800 decodes the access unit output by the fourth buffer (S2106).

これにより、受信装置８００は、アンダーフローやオーバーフローすることなる復号動作を行うことができる。 Thereby, the receiving device 800 can perform a decoding operation that causes underflow or overflow.

（実施の形態６）
［概要］
実施の形態６では、実施の形態５で説明した受信バッファモデルを用いた具体的な送信方法及び受信方法について説明する。 (Embodiment 6)
[overview]
In Embodiment 6, a specific transmission method and reception method using the reception buffer model described in Embodiment 5 will be described.

図７８は、ＡＲＩＢＳＴＤ－Ｂ６０に規定されるＭＭＴ／ＴＬＶ方式のプロトコルスタックを示す図である。 FIG. 78 is a diagram showing a protocol stack of the MMT/TLV method defined in ARIB STD-B60.

ＭＭＴ方式では、パケットには、映像や音声などのデータを複数のＭＰＵ（Ｍｅｄｉａ
ＰｒｅｓｅｎｔａｔｉｏｎＵｎｉｔ）やＭＦＵ（ＭｅｄｉａＦｒａｇｍｅｎｔＵｎｉｔ）などの所定のデータユニットごとに格納し、ＭＭＴＰパケットヘッダを付与することで所定のパケットとしてのＭＭＴＰパケットを生成する（ＭＭＴＰパケット化する）。また、ＭＭＴＰにおける制御メッセージなどの制御情報に対しても、ＭＭＴＰパケットヘッダを付与することで、所定のパケットとしてのＭＭＴＰパケットを生成する。ＭＭＴＰパケットヘッダには、３２ビットのショートフォーマットのＮＴＰ（ＮｅｔｗｏｒｋＴｉｍｅＰｒｏｔｏｃｏｌ：ＩＥＴＦＲＦＣ５９０５に規定）を格納するフィールドが設けられており、通信回線のＱｏＳ制御等に用いることができる。 In the MMT method, data such as video and audio is sent to multiple MPUs (Media
The MMTP packet is stored in each predetermined data unit such as Presentation Unit (Presentation Unit) or MFU (Media Fragment Unit), and an MMTP packet is generated as a predetermined packet by adding an MMTP packet header (making it into an MMTP packet). Further, by adding an MMTP packet header to control information such as a control message in MMTP, an MMTP packet as a predetermined packet is generated. The MMTP packet header is provided with a field for storing a 32-bit short format NTP (Network Time Protocol: defined in IETF RFC 5905), which can be used for QoS control of a communication line, etc.

また、送信側（送信装置）の基準クロックをＲＦＣ５９０５に規定される６４ビットのロングフォーマットＮＴＰに同期させ、同期させた当該基準クロックを基に、ＰＴＳ（ＰｒｅｓｅｎｔａｔｉｏｎＴｉｍｅＳｔａｍｐ）や、ＤＴＳ（ＤｅｃｏｄｅＴｉｍｅＳｔａｍｐ）などのタイムスタンプを同期メディアに付与する。さらに、送信側の基準クロック情報を受信側に送信し、受信装置では送信側から受信した基準クロック情報を基に受信装置におけるシステムクロックを生成する。 In addition, the reference clock on the transmitting side (transmitting device) is synchronized with the 64-bit long format NTP specified in RFC 5905, and based on the synchronized reference clock, PTS (Presentation Time Stamp) and DTS (Decode Time Stamp) A timestamp such as Stamp) is attached to the synchronized media. Further, reference clock information on the transmitting side is transmitted to the receiving side, and the receiving device generates a system clock in the receiving device based on the reference clock information received from the transmitting side.

ＰＴＳやＤＴＳは、具体的には、ＭＭＴＰの制御情報であるＭＰＵタイムスタンプ記述子や、ＭＰＵ拡張タイムスタンプ記述子に格納されて、アセット毎にＭＰテーブルに格納され、制御メッセージとしてＭＭＴＰパケット化された後、伝送される。 Specifically, PTS and DTS are stored in the MPU timestamp descriptor and MPU extended timestamp descriptor, which are MMTP control information, stored in the MP table for each asset, and converted into MMTP packets as control messages. After that, it will be transmitted.

ＭＭＴＰパケット化されたデータは、ＵＤＰヘッダやＩＰヘッダが付与されＩＰパケットにカプセル化される。このとき、ＩＰヘッダやＵＤＰヘッダにおいて、送信元ＩＰアドレス、宛先ＩＰアドレス、送信元ポート番号、宛先ポート番号、プロトコル種別が同じもののパケットの集合をＩＰデータフローとする。なお、同じＩＰデータフローのＩＰパケットは、ヘッダが冗長であるため、一部のＩＰパケットではヘッダ圧縮される。 The MMTP packetized data is encapsulated into an IP packet with a UDP header or an IP header added thereto. At this time, a set of packets having the same source IP address, destination IP address, source port number, destination port number, and protocol type in the IP header or UDP header is defined as an IP data flow. Note that since IP packets of the same IP data flow have redundant headers, some IP packets are header compressed.

また、基準クロック情報として、６４ビットのＮＴＰタイムスタンプは、ＮＴＰパケットに格納され、ＩＰパケットに格納される。このとき、ＮＴＰパケットを格納するＩＰパケットでは、送信元ＩＰアドレス、宛先ＩＰアドレス、送信元ポート番号、宛先ポート番号、及びプロトコル種別は固定値であり、ＩＰパケットのヘッダは圧縮されない。 Further, as reference clock information, a 64-bit NTP time stamp is stored in the NTP packet and in the IP packet. At this time, in the IP packet storing the NTP packet, the source IP address, destination IP address, source port number, destination port number, and protocol type are fixed values, and the header of the IP packet is not compressed.

図７９は、ＴＬＶパケットの構成を示す図である。 FIG. 79 is a diagram showing the structure of a TLV packet.

ＴＬＶパケットには、図７９に示すように、ＩＰパケット、圧縮ＩＰパケット、ＡＭＴ（ＡｄｄｒｅｓｓＭａｐＴａｂｌｅ）やＮＩＴ（ＮｅｔｗｏｒｋＩｎｆｏｒｍａｔｉｏｎＴａｂｌｅ）などの伝送制御情報をデータとして含むことができ、これらのデータは、８ビットのデータタイプを用いて識別される。また、ＴＬＶパケットでは、１６ビットのフィールドを用いてデータ長（バイト単位）が示され、そのあとにデータの値を格納する。また、ＴＬＶパケットは、データタイプの前に１バイトのヘッダ情報を有し、当該ヘッダ情報は、合計４バイトのヘッダ領域に格納される。また、ＴＬＶパケットは、高度ＢＳ伝送方式における伝送スロットにマッピングされ、ＴＭＣＣ（ＴｒａｎｓｍｉｓｓｉｏｎａｎｄＭｕｌｔｉｐｌｅｘｉｎｇＣｏｎｆｉｇｕｒａｔｉｏｎＣｏｎｔｒｏｌ）制御情報にマッピング情報が格納される。 As shown in FIG. 79, the TLV packet can include, as data, IP packets, compressed IP packets, and transmission control information such as AMT (Address Map Table) and NIT (Network Information Table). Identified using an 8-bit data type. Furthermore, in the TLV packet, the data length (in bytes) is indicated using a 16-bit field, followed by storing the data value. Further, the TLV packet has 1 byte of header information before the data type, and the header information is stored in a header area of 4 bytes in total. Further, the TLV packet is mapped to a transmission slot in the advanced BS transmission system, and mapping information is stored in TMCC (Transmission and Multiplexing Configuration Control) control information.

図８０は、受信装置のブロック図の一例を示す図である。 FIG. 80 is a diagram illustrating an example of a block diagram of a receiving device.

受信装置９００では、まずチューナー９０１で受信した放送信号に対し、復調手段９０２において伝送路符号化データを復号、誤り訂正などを施し、ＴＬＶパケットを抽出する。そして、ＴＬＶ／ＩＰＤＥＭＵＸ手段９０３では、ＴＬＶのＤＥＭＵＸ処理やＩＰのＤＥＭＵＸ処理を行う。ＴＬＶのＤＥＭＵＸ処理は、ＴＬＶパケットのデータタイプに応じた処理を行う。例えば、ＴＬＶパケットが圧縮ＩＰパケットを有する場合は、当該圧縮ＩＰパケットの圧縮されたヘッダを復元する。ＩＰＤＥＭＵＸでは、ＩＰパケットやＵＤＰパケットのヘッダ解析などの処理を行い、ＭＭＴＰパケット及びＮＴＰパケットを抽出する。 In the receiving device 900, first, the demodulating means 902 decodes the transmission line encoded data, performs error correction, etc. on the broadcast signal received by the tuner 901, and extracts the TLV packet. The TLV/IP DEMUX unit 903 performs TLV DEMUX processing and IP DEMUX processing. TLV DEMUX processing is performed according to the data type of the TLV packet. For example, if the TLV packet includes a compressed IP packet, the compressed header of the compressed IP packet is restored. The IP DEMUX performs processing such as header analysis of IP packets and UDP packets, and extracts MMTP packets and NTP packets.

ＮＴＰクロック生成手段９０４では、抽出したＮＴＰパケットからＮＴＰクロックを再生する。ＭＭＴＰＤＥＭＵＸ手段９０５では、抽出したＭＭＴＰパケットヘッダに格納されているパケットＩＤを基に映像や音声などのコンポーネントや制御情報のフィルタリング処理を行う。制御情報取得手段９０６は、ＭＰテーブルの中に格納されるタイムスタンプ記述子を取得し、ＰＴＳ／ＤＴＳ算出手段９０７は、アクセスユニット毎のＰＴＳ及びＤＴＳを算出する。なお、タイムスタンプ記述子は、ＭＰＵタイムスタンプ記述子及びＭＰＵ拡張タイムスタンプ記述子の両記述子を含む。 The NTP clock generating means 904 reproduces the NTP clock from the extracted NTP packet. The MMTP DEMUX unit 905 performs filtering processing on components such as video and audio and control information based on the packet ID stored in the extracted MMTP packet header. Control information acquisition means 906 acquires the time stamp descriptor stored in the MP table, and PTS/DTS calculation means 907 calculates PTS and DTS for each access unit. Note that the timestamp descriptor includes both an MPU timestamp descriptor and an MPU extended timestamp descriptor.

アクセスユニット再生手段９０８は、ＭＭＴＰパケットからフィルタリングされた映像や音声などを、提示する単位のデータに変換する。提示する単位のデータとは、具体的には、映像信号のＮＡＬユニットやアクセスユニット、音声フレーム、字幕の提示単位などである。復号提示手段９０９は、ＮＴＰクロックの基準時刻情報をベースに、アクセスユニットのＰＴＳ／ＤＴＳが一致した時刻に、アクセスユニットを復号、提示する。 The access unit reproduction means 908 converts the video, audio, etc. filtered from the MMTP packet into data in units to be presented. Specifically, the unit of data to be presented is a NAL unit or access unit of a video signal, an audio frame, a subtitle presentation unit, or the like. The decoding and presenting means 909 decodes and presents the access unit at the time when the PTS/DTS of the access unit match based on the reference time information of the NTP clock.

なお、受信装置の構成は、これに限るものではない。 Note that the configuration of the receiving device is not limited to this.

図８１は、伝送スロットの構成を示す図である。伝送スロットは、１フレームあたり１２０スロットで構成される。伝送スロットへの変調方式の割り当ては５スロット単位である。また、１フレーム内では、最大１６のストリームを伝送可能である。ＴＬＶパケット列により構成されるＴＬＶストリームを識別するためのｔｌｖ＿ｓｔｒｅａｍ＿ｉｄがＴＭＣＣ内に格納され、スロットに格納されるｔｌｖ＿ｓｔｒｅａｍ＿ｉｄを特定できる。スロット毎に変調方式および符号化率を変えることができ、使用するスロット数が決定されれば、ＴＬＶストリームの伝送速度は一意に決定される。 FIG. 81 is a diagram showing the configuration of a transmission slot. Each frame consists of 120 transmission slots. Modulation schemes are assigned to transmission slots in units of 5 slots. Furthermore, a maximum of 16 streams can be transmitted within one frame. A tlv_stream_id for identifying a TLV stream constituted by a TLV packet string is stored in the TMCC, and a tlv_stream_id stored in a slot can be specified. The modulation method and coding rate can be changed for each slot, and once the number of slots to be used is determined, the transmission rate of the TLV stream is uniquely determined.

次に、実施の形態５の図６８で説明した受信バッファモデルにおけるＴＬＶパケットバッファのバッファサイズの規定方法を説明する。なお、ＴＬＶパケットバッファの引き抜きレートＲｘｉは、実施の形態５で説明した引き抜きレートを用いる。受信バッファモデルについては、実施の形態５と同様であるため、説明を省略する。 Next, a method for defining the buffer size of the TLV packet buffer in the reception buffer model described in FIG. 68 of the fifth embodiment will be described. Note that the extraction rate explained in the fifth embodiment is used as the extraction rate Rxi of the TLV packet buffer. The reception buffer model is the same as in Embodiment 5, so the explanation will be omitted.

ＴＬＶパケットバッファのバッファサイズは、単位時間Ｔにおける平均パケット長ＳのＴＬＶパケットで構成され、伝送レートＲｔで入力されたＴＬＶストリームを引き抜きレートＲｘｉで引き抜いた場合に想定される最大バッファ占有量を基準として算出されることにより規定される値である。ただし、平均パケット長Ｓは、ＴＬＶストリームの帯域をすべて用いてパケットを伝送した場合の平均とし、ＴＬＶヌルパケットおよびＴＬＶ－ＳＩで伝送される帯域を除いたＴＬＶストリームの帯域を平均したサイズではない。なお、ヘッダ圧縮量およびヘッダ伸張量が最も大きいパケットは、可変長パケットにおいて最もサイズの小さなパケットであり、上記ＴＬＶパケットバッファへの占有量が最大となるケースは、最小パケットサイズのパケットが最も連続した場合である。最小パケットサイズとなるパケットサイズは、ＴＬＶバッファのヘッダ量として規定もよいし、ＭＭＴパケットを格納する際の、ＴＬＶ／ＩＰ／ＵＤＰ／ＭＭＴヘッダの最小値としてもよい。 The buffer size of the TLV packet buffer is composed of TLV packets with an average packet length S in unit time T, and is based on the maximum buffer occupancy expected when a TLV stream input at a transmission rate Rt is extracted at a extraction rate Rxi. This is a value defined by calculating as follows. However, the average packet length S is the average size when packets are transmitted using all the TLV stream bandwidth, and is not the average size of the TLV stream bandwidth excluding the TLV null packet and the bandwidth transmitted by TLV-SI. . Note that the packet with the largest amount of header compression and header expansion is the smallest size packet among variable-length packets, and in the case where the amount of occupancy in the TLV packet buffer is the largest, the packet with the smallest packet size is the most consecutive packet. This is the case. The minimum packet size may be defined as the header amount of the TLV buffer, or may be the minimum value of the TLV/IP/UDP/MMT header when storing the MMT packet.

なお、上記の受信バッファモデルにおけるＴＬＶパケットバッファのバッファサイズの規定は言い換えれば、単位時間あたりに伝送されるＴＬＶパケットの最大パケット数を規定することと等価である。また、上記バッファサイズの規定は、単位時間あたりの最小パケットサイズのパケットの連続数を制約する規定とも言える。 In other words, the regulation of the buffer size of the TLV packet buffer in the above reception buffer model is equivalent to the regulation of the maximum number of TLV packets transmitted per unit time. Furthermore, the above buffer size regulation can also be said to be a regulation that limits the number of consecutive packets of the minimum packet size per unit time.

なお、上記バッファサイズの規定に用いられる、入力されるＴＬＶストリームの伝送レートＲｔは、システムにおいて想定される最大伝送レートを基準とすることが望ましい。なお、必ずしも最大伝送レートである必要はない。また、単位時間Ｔの設定方法は任意であるが、送出側の制御の容易さを考慮して設定することが望ましい。 Note that it is desirable that the transmission rate Rt of the input TLV stream, which is used to define the buffer size, be based on the maximum transmission rate expected in the system. Note that the transmission rate does not necessarily have to be the maximum transmission rate. Furthermore, although the method for setting the unit time T is arbitrary, it is desirable to set it in consideration of ease of control on the sending side.

送出設備（送信装置）は、例えば、入力の伝送レートＲｔ、および引き抜きレートＲｘｉを用いて決定されたバッファサイズをオーバーフローさせないように、パケット化しなければならない。 The transmission equipment (transmission device) must perform packetization so as not to overflow the buffer size determined using, for example, the input transmission rate Rt and the extraction rate Rxi.

なお、ＴＬＶパケットバッファのバッファサイズの算出で用いた伝送レートＲｔ、単位時間Ｔ、平均パケット長Ｓは、算出基準とするための値であり、必ずしも前記値を用いて送出することを規定するものではない。実際にはＴＬＶストリームの伝送速度は、伝送するレートによって異なり、例えば高度ＢＳ伝送方式（ＡＲＩＢＳＴＤ－Ｂ４４）を用いて伝送する場合は、１２０スロットのうち、いくつのスロット数を用いて伝送するか、あるいは、伝送するスロットをどの変調方式と符号化率の組み合わせで伝送するか等により伝送レートは異なる。したがって、送出設備は、どの伝送レートで伝送する場合でも、上記バッファをオーバーフローさせないように伝送すればよい。バッファをオーバーフローさせないように伝送できれば、どのような制御、アルゴリズムを用いて伝送してもよい。 Note that the transmission rate Rt, unit time T, and average packet length S used in calculating the buffer size of the TLV packet buffer are values used as calculation standards, and do not necessarily stipulate that the above values are used for transmission. isn't it. In reality, the transmission speed of a TLV stream varies depending on the transmission rate. For example, when transmitting using the advanced BS transmission method (ARIB STD-B44), how many slots out of 120 slots are used for transmission? Alternatively, the transmission rate differs depending on which combination of modulation method and coding rate is used to transmit the slot. Therefore, no matter what transmission rate the transmission equipment transmits, it is sufficient to transmit without overflowing the buffer. Any control or algorithm may be used for transmission as long as it can be transmitted without overflowing the buffer.

つまり、単位時間当たりに伝送されるＴＬＶパケットの最大パケット数は、上記の伝送レートに応じて設定された、互いに異なる値である複数のパケット数を含み、所定のデータ単位の伝送レートに対応するパケット数以下の複数のＴＬＶパケットを単位時間当たりに伝送することで、ＴＬＶパケットバッファがオーバーフローしないようにしてもよい。 In other words, the maximum number of TLV packets transmitted per unit time includes multiple numbers of packets with different values set according to the above transmission rate, and corresponds to the transmission rate of a predetermined data unit. The TLV packet buffer may be prevented from overflowing by transmitting a plurality of TLV packets equal to or less than the number of packets per unit time.

受信装置は、どの伝送レートで伝送された場合でも、上記で説明したバッファサイズ、引き抜きレートに基づくバッファを実装すれば、バッファをオーバーフローさせることなく、安定して受信動作が可能となる。つまり、受信装置は、伝送レートＲｔ、平均パケット長Ｓ、及び引き抜きレートＲｘｉを用いて求められる最大バッファ占有量よりも大きいバッファサイズのバッファを実装すればよい。 No matter what transmission rate transmission is performed, the receiving device can perform stable receiving operations without overflowing the buffer by implementing a buffer based on the buffer size and extraction rate described above. In other words, the receiving device only needs to implement a buffer with a buffer size larger than the maximum buffer occupancy determined using the transmission rate Rt, average packet length S, and extraction rate Rxi.

次に、伝送フレーム長単位（単位伝送期間）の所定の最大ＴＬＶパケット数を規定することのできる、ＴＬＶバッファモデルサイズの規定方法について説明する。 Next, a method for defining the TLV buffer model size that can define a predetermined maximum number of TLV packets per transmission frame length unit (unit transmission period) will be described.

図８２は、伝送フレームと、伝送フレームに格納される特定のｔｌｖ＿ｓｔｒｅａｍ＿ｉｄを有する１つのＴＬＶストリーム（ＴＬＶパケット列）とを示す図である。 FIG. 82 is a diagram showing a transmission frame and one TLV stream (TLV packet sequence) having a specific tlv_stream_id stored in the transmission frame.

また、図８２における（１）は、上述したバッファサイズの規定方法において、単位時間Ｔを、伝送フレームを所定の伝送レートで伝送した場合の伝送フレーム長とした場合を示す図である。伝送フレーム長は、平均パケット長Ｓの平均区間、或いは、最大ＴＬＶパケット数、最小パケットサイズの最大連続数が制約される区間である。例えば、高度ＢＳ伝送方式（ＡＲＩＢＳＴＤ－Ｂ４４）では、伝送フレーム長は３３．０４６４７ｍｓである。ここで、伝送フレーム長あたりに伝送できる最小パケットサイズの最大連続数はＭであるとする。 Further, (1) in FIG. 82 is a diagram showing a case where the unit time T is the transmission frame length when the transmission frame is transmitted at a predetermined transmission rate in the buffer size regulation method described above. The transmission frame length is the average section of the average packet length S, or the section where the maximum number of TLV packets and the maximum number of consecutive minimum packet sizes are restricted. For example, in the advanced BS transmission method (ARIB STD-B44), the transmission frame length is 33.04647 ms. Here, it is assumed that the maximum consecutive number of minimum packet sizes that can be transmitted per transmission frame length is M.

この場合、送出側（送信装置）は、ＴＬＶパケットバッファをオーバーフローさせないようにＴＬＶストリームを送出するために、伝送フレームの境界に関わらず、図８２の（１）に示すＡ区間、Ｂ区間、及びＣ区間のどの区間においても、ＴＬＶストリームに格納されるＴＬＶパケットの最大パケット数以下となるように、ＴＬＶストリームを生成するバッファモデルを満たすように送出してもよい。ここで、Ａ区間、Ｂ区間及びＣ区間のそれぞれの区間の区間長は、伝送フレーム長である。つまり、この場合、送出設備（送信装置）は、単位時間あたりに所定のパケット数（最大ＴＬＶパケット数）以下の複数のＴＬＶパケットを送出（送信）してもよい。 In this case, the sending side (transmitting device) sends out the TLV stream without overflowing the TLV packet buffer, regardless of the boundaries of the transmission frame. In any section of section C, the TLV stream may be transmitted so as to satisfy the buffer model for generating the TLV stream so that the number of TLV packets is equal to or less than the maximum number of TLV packets stored in the TLV stream. Here, the length of each of the A section, B section, and C section is the transmission frame length. That is, in this case, the transmission equipment (transmission device) may transmit (transmit) a plurality of TLV packets that are equal to or less than a predetermined number of packets (maximum number of TLV packets) per unit time.

また、所定のパケット数は、最小パケットサイズのＴＬＶパケットが連続しないように設定された値であってもよい。例えば、上述したように、所定のパケット数は、最小パケットサイズの最大連続数Ｍに設定された場合に算出される値であってもよい。 Further, the predetermined number of packets may be a value set so that TLV packets of the minimum packet size are not consecutive. For example, as described above, the predetermined number of packets may be a value calculated when the maximum consecutive number M of the minimum packet size is set.

また、例えば伝送フレーム単位などの所定のデータ単位で、伝送フレームあたりの最大パケット数、あるいは平均パケット長、あるいは最小パケットサイズのパケット連続数を規定してもよい。 Further, the maximum number of packets per transmission frame, the average packet length, or the number of consecutive packets of the minimum packet size may be defined in a predetermined data unit such as a transmission frame unit.

例えば、図８２における（２）に示すＤ区間、Ｅ区間及びＦ区間のように、伝送フレーム単位で、最大パケット数、あるいは平均パケット長、あるいは最小パケットサイズのパケット連続数を規定してもよい。つまり、送出設備（送信装置）は、互いに重ならない単位時間の期間であって、連続した複数の単位伝送期間（複数の伝送フレームに対応する複数の伝送フレーム長における期間）のそれぞれについて、当該単位伝送期間において伝送フレームを送信することで、単位時間あたりに所定のパケット数（最大ＴＬＶパケット数）以下の複数のＴＬＶパケットを送出（送信）してもよい。この場合、所定のデータ単位としての伝送フレーム単位（つまり１つの伝送フレーム）には、所定のパケット数以下の複数のＴＬＶパケットが格納される。 For example, the maximum number of packets, the average packet length, or the number of consecutive packets of the minimum packet size may be defined for each transmission frame, as shown in section (2) in FIG. 82, section E, and section F. . In other words, the transmission equipment (transmission device) transmits data in units of time that do not overlap each other, for each of a plurality of consecutive unit transmission periods (periods in a plurality of transmission frame lengths corresponding to a plurality of transmission frames). By transmitting a transmission frame during the transmission period, a plurality of TLV packets equal to or less than a predetermined number of packets (maximum number of TLV packets) may be sent out (sent) per unit time. In this case, a plurality of TLV packets equal to or less than a predetermined number of packets are stored in a transmission frame unit (that is, one transmission frame) as a predetermined data unit.

次に、受信バッファモデルにおけるＴＬＶパケットバッファのバッファサイズの算出方法の具体例について説明する。 Next, a specific example of a method for calculating the buffer size of the TLV packet buffer in the reception buffer model will be described.

ここで、伝送フレーム単位あたりに伝送できる最小パケットサイズの最大連続数をＭに設定する場合、実際に伝送されるＴＬＶストリームにおいて、最小パケットサイズの最大連続パケット数は２×Ｍとなる。なお、最大連続数Ｍは、上記伝送フレーム長あたりに伝送できる最小パケットサイズの最大連続数と同じ値である。図８２の例では、ＴＬＶストリームがＤ区間の後半にＭパケット連続し、Ｅ区間の前半にＭパケット連続するケースが最大連続パケット数となり、合計で（２×Ｍ）パケットが連続する。受信装置が（２×Ｍ）パケットの最小パケットサイズの連続に耐えうるバッファサイズを持つためには、伝送フレーム長あたりに伝送できる最小パケットサイズの最大連続数がＭとした場合に上述の方法を用いて算出されるバッファサイズの２倍のバッファサイズが必要となる。 Here, when the maximum number of consecutive packets of the minimum packet size that can be transmitted per transmission frame unit is set to M, the maximum number of consecutive packets of the minimum packet size in the TLV stream that is actually transmitted is 2×M. Note that the maximum consecutive number M is the same value as the maximum consecutive number of minimum packet sizes that can be transmitted per the transmission frame length. In the example of FIG. 82, the maximum number of consecutive packets is the case where the TLV stream has M packets consecutively in the second half of the D section and M packets consecutively in the first half of the E section, and (2×M) packets are consecutive in total. In order for the receiving device to have a buffer size that can withstand successive minimum packet sizes of (2 x M) packets, the above method should be used, assuming that the maximum number of consecutive minimum packet sizes that can be transmitted per transmission frame length is M. A buffer size that is twice the buffer size calculated using this method is required.

言い換えれば、単位時間Ｔを伝送フレーム長として上述の方法を用いて算出されるバッファサイズの２倍以上のバッファサイズを持つようにＴＬＶバッファサイズを規定することにより、送出設備は、単位時間Ｔを伝送フレーム長とした場合に算出される、最大パケット数、平均パケット長、あるいは最小パケットサイズのパケット連続数を、伝送フレーム単位での最大パケット数、平均パケット長、あるいは最小パケットサイズのパケット連続数とした場合でも、バッファモデルを満たすように送出することができる。 In other words, by specifying the TLV buffer size to have a buffer size that is more than twice the buffer size calculated using the method described above, where the unit time T is the transmission frame length, the transmission equipment can The maximum number of packets, average packet length, or number of consecutive packets with the minimum packet size calculated when the transmission frame length is calculated as the maximum number of packets, average packet length, or number of consecutive packets with the minimum packet size per transmission frame. Even in this case, it can be sent to satisfy the buffer model.

以上のように、ＴＬＶパケットバッファのバッファサイズ、引き抜きレートを設定することにより、伝送フレーム単位の最大パケット数、平均パケット長、あるいは最小パケットサイズのパケット連続数を規定することができる。なお、最大パケット数、平均パケット長、及び最小パケットサイズのパケット連続数は、ＴＬＶパケットバッファのバッファサイズ、入力レート（伝送レート）、引き抜きレート、及び最小パケットサイズにより算出できる。 As described above, by setting the buffer size and extraction rate of the TLV packet buffer, it is possible to define the maximum number of packets per transmission frame, the average packet length, or the number of consecutive packets of the minimum packet size. Note that the maximum number of packets, the average packet length, and the number of consecutive packets of the minimum packet size can be calculated from the buffer size of the TLV packet buffer, the input rate (transmission rate), the extraction rate, and the minimum packet size.

したがって、ＴＬＶパケットバッファのバッファサイズ、引き抜きレート、及び最小パケットサイズが定まっている場合は、ＴＬＶストリームの伝送速度（伝送レート）ごとに、伝送フレームに格納される、最大パケット数、平均パケット長、あるいは最小パケットサイズのパケット連続数を規定することができ、送出制御（パケット化、あるいは伝送フレームへのマッピング）が容易となる。 Therefore, if the buffer size, extraction rate, and minimum packet size of the TLV packet buffer are fixed, the maximum number of packets, average packet length, and Alternatively, the number of consecutive packets of the minimum packet size can be defined, which facilitates transmission control (packetization or mapping to transmission frames).

次に、図８２で説明した方法を用いて、ＴＬＶパケットバッファのバッファサイズ、および、伝送フレーム単位での最大パケット数、（あるいは最小パケットサイズのパケット最大連続数）を規定した場合の、送出方法について図８３を用いて説明する。図８３は、ＴＬＶパケットバッファのバッファサイズ、および、伝送フレーム単位での最大パケット数を規定した場合の送出方法のフローチャートである。 Next, using the method explained in FIG. 82, a transmission method when the buffer size of the TLV packet buffer and the maximum number of packets per transmission frame (or the maximum number of consecutive packets of the minimum packet size) are specified. This will be explained using FIG. 83. FIG. 83 is a flowchart of a sending method when the buffer size of the TLV packet buffer and the maximum number of packets per transmission frame are defined.

なお、フローチャートおよび説明ではＴＬＶパケットを伝送フレームに格納する例を示しているが、ＴＬＶパケットを伝送フレームに格納する場合に限らず、格納を検討する場合にも同フローを用いることができる。 Note that although the flowchart and description show an example of storing TLV packets in transmission frames, the same flow can be used not only when storing TLV packets in transmission frames but also when considering storage.

まず、伝送フレーム単位に、ＭＭＴパケットが格納されるＴＬＶパケットの格納の検討を開始する。 First, we will start considering storage of TLV packets in which MMT packets are stored in transmission frame units.

伝送フレームに格納するＴＬＶパケットがＭＭＴＰパケットを格納するＴＬＶパケット（実パケット）であるか否かを判定する（Ｓ２２０１）。 It is determined whether the TLV packet stored in the transmission frame is a TLV packet (actual packet) storing an MMTP packet (S2201).

当該ＴＬＶパケットがＭＭＴＰパケットを格納しているＴＬＶパケットであると判定した場合（Ｓ２２０１でＹｅｓ）、当該ＴＬＶパケットを伝送フレームに格納し、伝送フレーム内に格納されるパケット数を＋１カウントする（Ｓ２２０２）。 If it is determined that the TLV packet is a TLV packet storing an MMTP packet (Yes in S2201), the TLV packet is stored in the transmission frame, and the number of packets stored in the transmission frame is counted by +1 (S2202). ).

一方、ＴＬＶパケットがＭＭＴパケット以外を格納するＴＬＶパケット（ＮＵＬＬパケット或いはＴＬＶ－ＳＩ）であると判定した場合（Ｓ２２０１でＮｏ）、ＭＭＴパケット以外を格納するＴＬＶパケットのサイズを累積し、１．５ｋＢ（ＭＴＵサイズ）ごとにパケット数を１カウントする（Ｓ２２０３）。 On the other hand, if it is determined that the TLV packet is a TLV packet (NULL packet or TLV-SI) that stores other than MMT packets (No in S2201), the size of TLV packets that store other than MMT packets is accumulated and the size is 1.5 kB. The number of packets is counted by 1 for each (MTU size) (S2203).

つまり、ＴＬＶパケットには、無効なパケット（ＮＵＬＬパケット）とは異なる実パケット（ＭＭＴＰパケット）を格納している第１伝送パケットと、ＮＵＬＬパケットを格納している第２伝送パケットとの２種類がある。また、第１の伝送パケットは所定のパケットサイズの最大値が規定されており、第２の伝送パケットは第１の伝送パケットのパケットサイズの最大値を超えるパケットサイズとなる場合があるパケットと言う事もできる。ＴＬＶパケットを伝送フレームに格納するときにおいて、第１伝送パケットを伝送フレームに格納する場合、第１伝送パケットの数を第１パケット数としてカウントし、第２伝送パケットを伝送フレームに格納する場合、第２伝送パケットのパケットサイズを第１伝送パケットのパケットサイズの最大値（１．５ｋＢ）で除すことにより得られた整数値を第２パケット数としてカウントし、カウントすることにより得られた第１パケット数及び第２パケット数の総和が所定のパケット数以下となるように、複数のＴＬＶパケットを伝送フレームに格納する。なお、第２パケット数は、より具体的には、第２伝送パケットのパケットサイズを第１伝送パケットのパケットサイズの最大値で除して得られた商に１を加算した整数値である。例えば、第２のパケットのパケットサイズが常に第１のパケットのパケットサイズの最大値より小さい場合は、第２伝送パケットの数が第２パケット数と等価となることは言うまでもない。 In other words, there are two types of TLV packets: a first transmission packet that stores a real packet (MMTP packet) that is different from an invalid packet (NULL packet), and a second transmission packet that stores a NULL packet. be. Furthermore, the first transmission packet is defined to have a predetermined maximum packet size, and the second transmission packet is a packet whose packet size may exceed the maximum packet size of the first transmission packet. I can do things. When storing TLV packets in a transmission frame, when storing a first transmission packet in a transmission frame, the number of first transmission packets is counted as the number of first packets, and when storing a second transmission packet in a transmission frame, The integer value obtained by dividing the packet size of the second transmission packet by the maximum packet size (1.5 kB) of the first transmission packet is counted as the number of second packets, and the A plurality of TLV packets are stored in a transmission frame such that the sum of the number of first packets and the number of second packets is equal to or less than a predetermined number of packets. Note that the second packet number is, more specifically, an integer value obtained by adding 1 to the quotient obtained by dividing the packet size of the second transmission packet by the maximum value of the packet size of the first transmission packet. For example, if the packet size of the second packet is always smaller than the maximum packet size of the first packet, it goes without saying that the number of second transmission packets will be equivalent to the number of second packets.

ステップＳ２２０２またはステップＳ２２０３の後において、伝送フレーム内の累積ＴＬＶパケットサイズ（合計データサイズ）を算出する（Ｓ２２０４）。つまり、１つの伝送フレーム内に格納するＴＬＶパケットのパケットサイズの総和を算出する。 After step S2202 or step S2203, the cumulative TLV packet size (total data size) within the transmission frame is calculated (S2204). That is, the total sum of packet sizes of TLV packets stored in one transmission frame is calculated.

次に、ステップＳ２２０４で算出した合計データサイズが伝送フレーム単位の最大データサイズに達しているか否かを判定する（Ｓ２２０５）。 Next, it is determined whether the total data size calculated in step S2204 has reached the maximum data size in transmission frame units (S2205).

合計データサイズが伝送フレーム単位の最大データサイズに達している場合（ステップＳ２２０５でＹｅｓ）、ＴＬＶパケットを挿入せずにヌルパケットを配置して処理を終了する。 If the total data size has reached the maximum data size in transmission frame units (Yes in step S2205), a null packet is placed without inserting a TLV packet, and the process ends.

一方で、合計データサイズが伝送フレーム単位の最大データサイズに達していない場合（ステップＳ２２０５でＮｏ）、ステップＳ２２０２で算出した第１パケット数及びステップＳ２２０３において算出した第２パケット数の総和が伝送フレーム単位の最大パケット数（つまり、所定のパケット数）に達しているか否かを判定する（Ｓ２２０６）。 On the other hand, if the total data size does not reach the maximum data size per transmission frame (No in step S2205), the sum of the first packet number calculated in step S2202 and the second packet number calculated in step S2203 is the transmission frame. It is determined whether the maximum number of packets per unit (that is, a predetermined number of packets) has been reached (S2206).

そして、第１パケット数及び第２パケット数の総和が最大パケット数に達していると判定した場合（ステップＳ２２０６でＹｅｓ）、ＭＭＴＰパケットが格納されたＴＬＶパケット（第１伝送パケット）の伝送フレームへの格納を完了し、次のＭＭＴＰパケットが格納されたＴＬＶパケットは次の伝送フレームに格納する。 If it is determined that the sum of the first packet number and the second packet number has reached the maximum number of packets (Yes in step S2206), the transmission frame of the TLV packet (first transmission packet) in which the MMTP packet is stored is transmitted. The TLV packet in which the next MMTP packet has been stored is stored in the next transmission frame.

また、当該伝送フレームの残りの伝送帯域には、ＭＭＴＰパケットが格納されたＴＬＶパケット（第１伝送パケット）以外の例えばＮＵＬＬパケット或いはＴＬＶ－ＳＩが格納されたＴＬＶパケット（第２伝送パケット）を格納する（Ｓ２２０７）。 In addition, in the remaining transmission band of the transmission frame, other than the TLV packet (first transmission packet) in which the MMTP packet is stored, for example, a NULL packet or a TLV packet (second transmission packet) in which TLV-SI is stored is stored. (S2207).

以上のように、受信バッファモデルにおけるＴＬＶパケットバッファの規定を満たすための制御を、伝送フレームに格納するパケット数や、パケットサイズ、伝送フレームに格納できるバイト数、伝送フレームに格納できる最大パケット数のみを用いて容易に制御できる。 As described above, control to satisfy the TLV packet buffer specifications in the receive buffer model is limited to the number of packets to be stored in a transmission frame, the packet size, the number of bytes that can be stored in a transmission frame, and the maximum number of packets that can be stored in a transmission frame. can be easily controlled using

また、伝送フレームに格納されるＴＬＶパケットの最大パケット数を規定できることにより、受信装置の設計も容易となる。つまり、受信装置は、ＴＬＶパケットバッファとして機能するバッファのバッファサイズを、伝送レート、伝送パケットの平均パケット長、及び引き抜きレートを用いて求められる最大バッファ占有量よりも大きく設計すればよい。 Furthermore, by being able to define the maximum number of TLV packets stored in a transmission frame, the design of the receiving device becomes easier. In other words, the receiving device only needs to design the buffer size of the buffer that functions as a TLV packet buffer to be larger than the maximum buffer occupancy determined using the transmission rate, the average packet length of transmission packets, and the extraction rate.

なお、最大ＴＬＶストリームの伝送速度毎（変調方式、符号化率、使用伝送スロット数毎）に、１フレーム毎の１ＴＬＶストリームあたりの最大パケット数、平均パケット長、あるいは最小パケットサイズのパケット連続数を規定する場合、他の制約と合わせて規定してもよい。例えば、平均パケット長がＴＳパケット（平均１８８バイト或いは平均１８７バイト）以下とならにように制約してもよい。 In addition, for each maximum TLV stream transmission rate (modulation method, coding rate, number of used transmission slots), the maximum number of packets per TLV stream per frame, the average packet length, or the number of consecutive packets with the minimum packet size. If specified, it may be specified together with other constraints. For example, the average packet length may be restricted to be less than or equal to a TS packet (188 bytes or 187 bytes on average).

可変長パケットのパケット数が、固定長のＴＳパケットを格納する場合より大きくならない制約となり、実装、および受信装置の設計が容易となる。 This restricts the number of variable-length packets from becoming larger than when storing fixed-length TS packets, which facilitates implementation and design of the receiving device.

なお、ＴＬＶパケットバッファに入力されるデータを、特定のｔｌｖ＿ｓｔｒｅａｍ＿ｉｄを有する１つのＴＬＶストリームとしたが、特定のｔｌｖ＿ｓｔｒｅａｍ＿ｉｄ、特定のＩＰアドレス、特定のＵＤＰポート番号を有する１つのＩＰデータフローとすることにより、ＩＰデータフローごとの規定として置き換えてもよい。 Note that the data input to the TLV packet buffer is one TLV stream with a specific tlv_stream_id, but by setting it as one IP data flow with a specific tlv_stream_id, a specific IP address, and a specific UDP port number. , may be replaced as a regulation for each IP data flow.

また、例えば、高度ＢＳ／ＣＳ放送において、ＣＳの場合は、１つのＴＬＶストリーム内に複数のＩＰデータフロー（＝サービス）が格納される。この場合、サービス毎に受信バッファモデルや、伝送フレーム単位での最大パケット数等を規定することができる。 Further, for example, in advanced BS/CS broadcasting, in the case of CS, a plurality of IP data flows (=services) are stored in one TLV stream. In this case, a reception buffer model, a maximum number of packets per transmission frame, etc. can be defined for each service.

［補足：送信装置及び受信装置］
以上のように、受信装置のバッファ動作を保証するために予め定められた受信バッファモデルによる規定を満たした状態で複数の伝送パケットが格納された所定のデータ単位を送信する送信装置は、図８４のように構成することも可能である。また、所定のデータ単位を受信する受信装置は、図８５のように構成することも可能である。図８４は、送信装置の具体的構成の例を示す図である。図８５は、受信装置の具体的構成の例を示す図である。 [Supplement: Transmitting device and receiving device]
As described above, a transmitting device that transmits a predetermined data unit in which a plurality of transmission packets are stored while satisfying the regulations according to a predetermined receive buffer model to guarantee the buffer operation of the receiving device is shown in FIG. It is also possible to configure as follows. Further, a receiving device that receives a predetermined data unit can also be configured as shown in FIG. 85. FIG. 84 is a diagram showing an example of a specific configuration of a transmitting device. FIG. 85 is a diagram showing an example of a specific configuration of a receiving device.

送信装置１０００は、送信部１００１を備える。送信部１００１は、例えば、マイクロコンピュータ、プロセッサ、または、専用回路などによって実現される。つまり、送信装置１０００を構成する各処理部は、ソフトウェアによって実現されてもよいし、ハードウェアによって実現されてもよい。 Transmitting device 1000 includes a transmitting section 1001. The transmitter 1001 is realized by, for example, a microcomputer, a processor, a dedicated circuit, or the like. That is, each processing unit configuring transmitting device 1000 may be realized by software or hardware.

受信装置１１００は、受信部１１０１と、バッファ１１０２とを備える。受信部１１０１及びバッファ１１０２のそれぞれは、例えば、マイクロコンピュータ、プロセッサ、または、専用回路などによって実現される。つまり、受信装置１１００を構成する各処理部は、ソフトウェアによって実現されてもよいし、ハードウェアによって実現されてもよい。 Receiving device 1100 includes a receiving section 1101 and a buffer 1102. Each of the receiving unit 1101 and the buffer 1102 is realized by, for example, a microcomputer, a processor, or a dedicated circuit. That is, each processing unit configuring receiving device 1100 may be realized by software or hardware.

送信装置１０００及び受信装置１１００の各構成要素についての詳細な説明は、それぞれ、送信方法及び受信方法の説明において行う。 A detailed explanation of each component of the transmitting device 1000 and the receiving device 1100 will be given in the description of the transmitting method and the receiving method, respectively.

まず、送信方法については、図８６を用いて説明する。図８６は、送信装置による動作フロー（送信方法）である。 First, the transmission method will be explained using FIG. 86. FIG. 86 is an operation flow (transmission method) by the transmitting device.

送信方法は、受信装置１１００のバッファ動作を保証するために予め定められた受信バッファモデルによる規定を満たした状態で複数の伝送パケットを送信する送信方法である。 The transmission method is a transmission method in which a plurality of transmission packets are transmitted while satisfying regulations based on a predetermined reception buffer model in order to guarantee buffer operation of the reception device 1100.

まず、送信装置１０００の送信部１００１は、単位時間あたりに所定のパケット数以下の複数の伝送パケットを送信する（Ｓ２３０１）。 First, the transmitter 1001 of the transmitter 1000 transmits a plurality of transmission packets equal to or less than a predetermined number of packets per unit time (S2301).

なお、伝送パケットは、可変長のパケットヘッダおよび可変長のペイロードで構成される。また、受信バッファモデルは、伝送パケットを受信し、受信した伝送パケットに格納されている可変長のパケットヘッダおよび可変長のペイロードで構成される第１パケットを、ヘッダ伸張された固定長のパケットヘッダを有する第２パケットに変換し、変換することにより得られた第２パケットを所定の引き抜きレートで出力するバッファを含む。 Note that the transmission packet is composed of a variable length packet header and a variable length payload. In addition, the reception buffer model receives a transmission packet, converts the first packet consisting of a variable-length packet header and a variable-length payload stored in the received transmission packet into a fixed-length packet header that has been header expanded. , and outputs the second packet obtained by the conversion at a predetermined extraction rate.

これにより、送信装置１０００は、ＭＭＴのような方式を用いてデータ伝送する場合に、受信装置１１００のバッファ動作を保証できる。 Thereby, the transmitting device 1000 can guarantee the buffer operation of the receiving device 1100 when transmitting data using a method such as MMT.

次に、受信方法について、図８７を用いて説明する。図８７は、受信装置による動作フロー（受信方法）である。 Next, the reception method will be explained using FIG. 87. FIG. 87 is an operation flow (receiving method) by the receiving device.

まず、受信装置１１００の受信部１１０１は、それぞれが可変長のパケットヘッダおよび可変長のペイロードで構成される複数の伝送パケットで構成される複数の伝送パケットを受信する（Ｓ２４０１）。 First, the receiving unit 1101 of the receiving device 1100 receives a plurality of transmission packets each composed of a variable-length packet header and a variable-length payload (S2401).

次に、受信装置１１００のバッファ１１０２は、受信された複数の伝送パケットに格納されている可変長のパケットヘッダおよび可変長のペイロードで構成される第１パケットを、ヘッダ伸張された固定長のパケットヘッダを有する第２パケットに変換し、変換することにより得られた第２パケットを所定の引き抜きレートで出力する（Ｓ２４０２）。 Next, the buffer 1102 of the receiving device 1100 converts the first packet composed of the variable-length packet header and variable-length payload stored in the plurality of received transmission packets into a header-expanded fixed-length packet. The second packet is converted into a second packet having a header, and the second packet obtained by the conversion is output at a predetermined extraction rate (S2402).

なお、バッファのバッファサイズは、複数の伝送パケットの伝送レート、伝送パケットの平均パケット長、及び引き抜きレートを用いて求められる最大バッファ占有量よりも大きい。 Note that the buffer size of the buffer is larger than the maximum buffer occupancy calculated using the transmission rate of a plurality of transmission packets, the average packet length of transmission packets, and the extraction rate.

これにより、受信装置１１００は、バッファにおいてアンダーフローやオーバーフローが発生することなく処理を行うことができる。 Thereby, the receiving device 1100 can perform processing without underflow or overflow occurring in the buffer.

（その他の実施の形態）
以上、実施の形態に係る送信装置、受信装置、送信方法及び受信方法ついて説明したが、本発明は、この実施の形態に限定されるものではない。 (Other embodiments)
Although the transmitting device, the receiving device, the transmitting method, and the receiving method according to the embodiment have been described above, the present invention is not limited to this embodiment.

また、上記実施の形態に係る送信装置及び受信装置に含まれる各処理部は典型的には集積回路であるＬＳＩとして実現される。これらは個別に１チップ化されてもよいし、一部又は全てを含むように１チップ化されてもよい。 Furthermore, each processing section included in the transmitting device and receiving device according to the above embodiments is typically realized as an LSI, which is an integrated circuit. These may be integrated into one chip individually, or may be integrated into one chip including some or all of them.

また、集積回路化はＬＳＩに限るものではなく、専用回路又は汎用プロセッサで実現してもよい。ＬＳＩ製造後にプログラムすることが可能なＦＰＧＡ（ＦｉｅｌｄＰｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ）、又はＬＳＩ内部の回路セルの接続や設定を再構成可能なリコンフィギュラブル・プロセッサを利用してもよい。 Further, circuit integration is not limited to LSI, and may be realized using a dedicated circuit or a general-purpose processor. An FPGA (Field Programmable Gate Array) that can be programmed after the LSI is manufactured, or a reconfigurable processor that can reconfigure the connections and settings of circuit cells inside the LSI may be used.

上記各実施の形態において、各構成要素は、専用のハードウェアで構成されるか、各構成要素に適したソフトウェアプログラムを実行することによって実現されてもよい。各構成要素は、ＣＰＵ又はプロセッサなどのプログラム実行部が、ハードディスク又は半導体メモリなどの記録媒体に記録されたソフトウェアプログラムを読み出して実行することによって実現されてもよい。 In each of the embodiments described above, each component may be configured with dedicated hardware, or may be realized by executing a software program suitable for each component. Each component may be realized by a program execution unit such as a CPU or a processor reading and executing a software program recorded on a recording medium such as a hard disk or a semiconductor memory.

言い換えると、送信装置及び受信装置は、処理回路（ｐｒｏｃｅｓｓｉｎｇｃｉｒｃｕｉｔｒｙ）と、当該処理回路に電気的に接続された（当該制御回路からアクセス可能な）記憶装置（ｓｔｏｒａｇｅ）とを備える。処理回路は、専用のハードウェア及びプログラム実行部の少なくとも一方を含む。また、記憶装置は、処理回路がプログラム実行部を含む場合には、当該プログラム実行部により実行されるソフトウェアプログラムを記憶する。処理回路は、記憶装置を用いて、上記実施の形態に係る送信方法又は受信方法を実行する。 In other words, the transmitting device and the receiving device include a processing circuit and a storage electrically connected to the processing circuit (accessible from the control circuit). The processing circuit includes at least one of dedicated hardware and a program execution unit. Further, when the processing circuit includes a program execution section, the storage device stores a software program executed by the program execution section. The processing circuit uses the storage device to execute the transmission method or reception method according to the above embodiment.

さらに、本発明は上記ソフトウェアプログラムであってもよいし、上記プログラムが記録された非一時的なコンピュータ読み取り可能な記録媒体であってもよい。また、上記プログラムは、インターネット等の伝送媒体を介して流通させることができるのは言うまでもない。 Furthermore, the present invention may be the above-mentioned software program, or may be a non-transitory computer-readable recording medium on which the above-mentioned program is recorded. Furthermore, it goes without saying that the above program can be distributed via a transmission medium such as the Internet.

また、上記で用いた数字は、全て本発明を具体的に説明するために例示するものであり、本発明は例示された数字に制限されない。 Moreover, all the numbers used above are exemplified to specifically explain the present invention, and the present invention is not limited to the illustrated numbers.

また、ブロック図における機能ブロックの分割は一例であり、複数の機能ブロックを一つの機能ブロックとして実現したり、一つの機能ブロックを複数に分割したり、一部の機能を他の機能ブロックに移してもよい。また、類似する機能を有する複数の機能ブロックの機能を単一のハードウェア又はソフトウェアが並列又は時分割に処理してもよい。 Furthermore, the division of functional blocks in the block diagram is just an example; multiple functional blocks can be realized as one functional block, one functional block can be divided into multiple functional blocks, or some functions can be moved to other functional blocks. It's okay. Further, functions of a plurality of functional blocks having similar functions may be processed in parallel or in a time-sharing manner by a single piece of hardware or software.

また、上記の送信方法又は受信方法に含まれるステップが実行される順序は、本発明を具体的に説明するために例示するためのものであり、上記以外の順序であってもよい。また、上記ステップの一部が、他のステップと同時（並列）に実行されてもよい。 Furthermore, the order in which the steps included in the above transmission method or reception method are executed is for illustrative purposes in order to concretely explain the present invention, and an order other than the above may be used. Further, some of the above steps may be executed simultaneously (in parallel) with other steps.

以上、本発明の一つ又は複数の態様に係る送信装置、受信装置、送信方法及び受信方法について、実施の形態に基づいて説明したが、本発明は、この実施の形態に限定されるものではない。本発明の趣旨を逸脱しない限り、当業者が思いつく各種変形を本実施の形態に施したものや、異なる実施の形態における構成要素を組み合わせて構築される形態も、本発明の一つ又は複数の態様の範囲内に含まれてもよい。 The transmitting device, receiving device, transmitting method, and receiving method according to one or more aspects of the present invention have been described above based on the embodiments, but the present invention is not limited to these embodiments. do not have. Unless departing from the spirit of the present invention, various modifications that can be thought of by those skilled in the art to this embodiment, and embodiments constructed by combining components of different embodiments may also be applied to one or more of the present invention. may be included within the scope of the embodiment.

本発明は、ビデオデータ及びオーディオデータなどのメディアトランスポートを行う装置又は機器に適用できる。 INDUSTRIAL APPLICATION This invention is applicable to the apparatus or apparatus which performs media transport, such as video data and audio data.

１５、１００、３００、５００、７００、１０００送信装置
１６、１０１、３０１符号化部
１７、１０２多重化部
１８、１０４送信部
２０、２００、４００、６００、８００、１１００受信装置
２１パケットフィルタリング部
２２送信順序タイプ判別部
２３ランダムアクセス部
２４、２１２制御情報取得部
２５データ取得部
２６算出部
２７初期化情報取得部
２８、２０６復号命令部
２９、２０４Ａ、２０４Ｂ、２０４Ｃ、２０４Ｄ、４０２復号部
３０提示部
２０１チューナー
２０２復調部
２０３逆多重化部
２０５表示部
２１１タイプ判別部
２１３スライス情報取得部
２１４復号データ生成部
３０２付与部
３０３、５０３、７０２、１００１送信部
４０１、６０１、８０１、１１０１受信部
５０１分割部
５０２、６０３構成部
６０２判定部
７０１生成部
８０２第１のバッファ
８０３第２のバッファ
８０４第３のバッファ
８０５第４のバッファ
８０６復号部
１１０２バッファ 15, 100, 300, 500, 700, 1000 Transmitting device 16, 101, 301 Encoding section 17, 102 Multiplexing section 18, 104 Transmitting section 20, 200, 400, 600, 800, 1100 Receiving device 21 Packet filtering section 22 Transmission order type determination section 23 Random access section 24, 212 Control information acquisition section 25 Data acquisition section 26 Calculation section 27 Initialization information acquisition section 28, 206 Decoding instruction section 29, 204A, 204B, 204C, 204D, 402 Decoding section 30 Presentation Section 201 Tuner 202 Demodulation section 203 Demultiplexing section 205 Display section 211 Type discrimination section 213 Slice information acquisition section 214 Decoded data generation section 302 Addition section 303, 503, 702, 1001 Transmission section 401, 601, 801, 1101 Receiving section 501 Division unit 502, 603 configuration unit 602 determination unit 701 generation unit 802 first buffer 803 second buffer 804 third buffer 805 fourth buffer 806 decoding unit 1102 buffer

Claims

A transmission method for transmitting multiple transmission packets, the transmission method comprising:
The transmission packet is composed of a variable length packet header and a variable length payload,
A reception buffer model for receiving the plurality of transmission packets to be transmitted is:
The transmission packet is received, and a first packet consisting of a variable-length packet header and a variable-length payload stored in the received transmission packet is converted into a second packet having a header-expanded fixed-length packet header. and a buffer for outputting the second packet obtained by the conversion at a predetermined extraction rate,
storing the plurality of transmission packets of a predetermined number of packets or less in a predetermined data unit;
transmitting the plurality of transmission packets;
The plurality of transmission packets include a first transmission packet storing a real packet different from an invalid packet, and a second transmission packet storing the invalid packet,
In the storage,
When storing the first transmission packets in the predetermined data unit, the number of the first transmission packets is counted as the number of first packets,
When storing the second transmission packet in the predetermined data unit, 1 is added to the quotient obtained by dividing the packet size of the second transmission packet by the maximum packet size of the first transmission packet. Count the integer value as the second packet number,
storing the plurality of transmission packets in the predetermined data unit such that the sum of the first number of packets and the second number of packets obtained by counting is equal to or less than the predetermined number of packets;
The maximum value of the packet size of the first transmission packet is 1500B. The transmission method.

A transmitting device that transmits a predetermined data unit in which a plurality of transmission packets are stored,
The transmission packet is composed of a variable length packet header and a variable length payload,
A reception buffer model for receiving the plurality of transmission packets to be transmitted is:
The transmission packet is received, and a first packet consisting of a variable-length packet header and a variable-length payload stored in the received transmission packet is converted into a second packet having a header-expanded fixed-length packet header. and a buffer for outputting the second packet obtained by the conversion at a predetermined extraction rate,
comprising a transmitting unit that stores the plurality of transmission packets of a predetermined number or less in a predetermined data unit and transmits the plurality of transmission packets,
The plurality of transmission packets include a first transmission packet storing a real packet different from an invalid packet, and a second transmission packet storing the invalid packet,
The transmitter includes:
When storing the first transmission packets in the predetermined data unit, the number of the first transmission packets is counted as the number of first packets,
When storing the second transmission packet in the predetermined data unit, 1 is added to the quotient obtained by dividing the packet size of the second transmission packet by the maximum packet size of the first transmission packet. Count the integer value as the second packet number,
storing the plurality of transmission packets in the predetermined data unit such that the sum of the first number of packets and the second number of packets obtained by counting is equal to or less than the predetermined number of packets;
The maximum value of the packet size of the first transmission packet is 1500B. The transmitting device.