WO2017208818A1 - 送信装置、送信方法、受信装置および受信方法 - Google Patents
送信装置、送信方法、受信装置および受信方法 Download PDFInfo
- Publication number
- WO2017208818A1 WO2017208818A1 PCT/JP2017/018483 JP2017018483W WO2017208818A1 WO 2017208818 A1 WO2017208818 A1 WO 2017208818A1 JP 2017018483 W JP2017018483 W JP 2017018483W WO 2017208818 A1 WO2017208818 A1 WO 2017208818A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- subtitle
- information
- stream
- streams
- predetermined number
- Prior art date
Links
- 230000005540 biological transmission Effects 0.000 title claims description 63
- 238000000034 method Methods 0.000 title claims description 11
- 238000000605 extraction Methods 0.000 claims description 34
- 238000012545 processing Methods 0.000 claims description 20
- 238000003780 insertion Methods 0.000 claims description 4
- 230000037431 insertion Effects 0.000 claims description 4
- 238000004891 communication Methods 0.000 abstract description 5
- 238000004458 analytical method Methods 0.000 description 24
- 208000032041 Hearing impaired Diseases 0.000 description 14
- 239000000284 extract Substances 0.000 description 14
- 238000005516 engineering process Methods 0.000 description 13
- 238000006243 chemical reaction Methods 0.000 description 10
- 206010011878 Deafness Diseases 0.000 description 6
- 206010048865 Hypoacusis Diseases 0.000 description 6
- 238000010586 diagram Methods 0.000 description 3
- 101000609957 Homo sapiens PTB-containing, cubilin and LRP1-interacting protein Proteins 0.000 description 2
- 101150109471 PID2 gene Proteins 0.000 description 2
- 102100039157 PTB-containing, cubilin and LRP1-interacting protein Human genes 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 101100190466 Caenorhabditis elegans pid-3 gene Proteins 0.000 description 1
- 230000002730 additional effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005401 electroluminescence Methods 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H20/00—Arrangements for broadcast or for distribution combined with broadcast
- H04H20/28—Arrangements for simultaneous broadcast of plural pieces of information
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H60/00—Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
- H04H60/68—Systems specially adapted for using specific information, e.g. geographical or meteorological information
- H04H60/73—Systems specially adapted for using specific information, e.g. geographical or meteorological information using meta-information
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H60/00—Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
- H04H60/35—Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H60/00—Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
- H04H60/61—Arrangements for services using the result of monitoring, identification or recognition covered by groups H04H60/29-H04H60/54
- H04H60/65—Arrangements for services using the result of monitoring, identification or recognition covered by groups H04H60/29-H04H60/54 for using the result on users' side
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H60/00—Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
- H04H60/68—Systems specially adapted for using specific information, e.g. geographical or meteorological information
- H04H60/73—Systems specially adapted for using specific information, e.g. geographical or meteorological information using meta-information
- H04H60/74—Systems specially adapted for using specific information, e.g. geographical or meteorological information using meta-information using programme related information, e.g. title, composer or interpreter
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/236—Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/435—Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
Definitions
- the present technology relates to a transmission device, a transmission method, a reception device, and a reception method, and particularly to a transmission device that transmits a plurality of types of sub-tuttle information in parallel.
- TTML Timed Text Text Markup Language
- W3C World Wide Web Consortium
- the purpose of this technology is to simplify the transmission of multiple types of subtitle information.
- a subtitle encoding unit for generating a predetermined number of subtitle streams each having one or more subtitle information
- a transmission apparatus includes a transmission unit that transmits a container having a predetermined format including the predetermined number of subtitle streams.
- a predetermined number of subtitle streams each having one or two or more subtitle information are generated by the subtitle encoding unit.
- each of the predetermined number of subtitle streams may have segmented subtitle information.
- the transmission unit transmits a container having a predetermined format including a predetermined number of subtitle streams.
- the subtitle encoding unit may generate a plurality of subtitle streams each having subtitle information in different languages, and each of the plurality of subtitle streams may have a plurality of subtitle information having different contents. Further, for example, the subtitle encoding unit may generate a plurality of subtitle streams each having subtitle information having different contents, and each of the plurality of subtitle streams may have a plurality of subtitle information having different languages.
- a subtitle stream including a plurality of pieces of subtitle information can be generated and transmitted. Therefore, even if the type of subtitle information increases, an increase in the number of subtitle streams can be suppressed, and therefore transmission of a plurality of types of subtitle information can be simplified.
- an information insertion unit that inserts information on each of a predetermined number of subtitle streams may be further provided in the container.
- the information regarding each of the subtitle streams includes flag information indicating whether or not the corresponding subtitle stream has a plurality of subtitle information, identification information for identifying the corresponding subtitle stream, and each subtitle information included in the corresponding subtitle stream. Identification information to be identified may be included.
- On the receiving side it is possible to control display processing of user interface information for the user to perform a selection operation for displaying a desired subtitle, based on information regarding each of the predetermined number of subtitle streams.
- a receiving unit for receiving a container of a predetermined format including a predetermined number of subtitle streams each having one or more subtitle information;
- a control unit that controls a first extraction process for extracting one subtitle stream from the predetermined number of subtitle streams and a second extraction process for extracting one subtitle information from the extracted subtitle stream is provided.
- the device In the device.
- the receiving unit receives a container of a predetermined format including a predetermined number of subtitle streams each having one or more subtitle information.
- the control unit controls a first extraction process for extracting one subtitle stream from a predetermined number of subtitle streams and a second extraction process for extracting one subtitle information from the extracted one subtitle stream.
- information about each of a predetermined number of subtitle streams is inserted into the container, and the control unit performs the first extraction process and the second extraction process based on information about each of the predetermined number of subtitle streams.
- the display processing of the user interface information may be further controlled. In this case, the user can appropriately and efficiently perform the subtitle information selection operation based on the user interface information.
- one subtitle stream is extracted from a predetermined number of subtitle streams, and one subtitle information is further extracted from the extracted one subtitle stream. Therefore, even when a predetermined number of subtitle streams include a subtitle stream including a plurality of pieces of subtitle information, a desired subtitle display can be performed.
- FIG. 1 shows a configuration example of a transmission / reception system 10 as an embodiment.
- the transmission / reception system 10 includes a broadcast transmission system 100 and a television receiver 200.
- the broadcast transmission system 100 transmits a transport stream of MPEG-2 TS (hereinafter simply referred to as “transport stream TS”) as a container (multiplexed stream) on a broadcast wave.
- transport stream TS MPEG-2 TS
- the transport stream TS includes a predetermined number of subtitle streams together with a video stream having video data and an audio stream having audio data.
- Each of the predetermined number of subtitle streams has one or more subtitle information.
- subtitle information text information of a subtitle (caption), for example, TTML or a TTML derived format can be considered.
- the subtitle information is TTML
- the subtitle stream has segmented subtitle information.
- the broadcast transmission system 100 inserts information on each of a predetermined number of subtitle streams into a transport stream TS as a container.
- This information includes, for example, flag information indicating whether or not the corresponding subtitle stream has a plurality of subtitle information, identification information for identifying the corresponding subtitle stream, and identification information for identifying each subtitle information possessed by the corresponding subtitle stream Etc. are included.
- the receiving side can appropriately perform display processing of user interface information for the user to perform a selection operation for displaying a desired subtitle.
- the television receiver 200 receives the transport stream TS sent from the broadcast transmission system 100.
- the television receiver 200 obtains video data by performing decoding processing on a video stream having video data, and obtains audio data by performing decoding processing on an audio stream having audio data.
- the television receiver 200 extracts one subtitle stream from a predetermined number of subtitle streams in accordance with a user's selection operation, and extracts one subtitle information from the extracted one subtitle stream. Then, the television receiver 200 performs decoding processing on the extracted one subtitle information, obtains subtitle bitmap data, and superimposes it on the video data to obtain video data for display.
- the television receiver 200 uses the user interface information (FIG. 3B) for the convenience of the user's selection operation based on the information about each of the predetermined number of subtitle streams inserted in the transport stream TS. Display).
- the user can easily perform desired subtitle display by performing a selection operation based on the user interface information.
- the transport stream TS includes a subtitle stream 1 (Packet id1) and a subtitle stream 2 (Packet ⁇ ⁇ ⁇ ⁇ id2), each having three subtitle information.
- FIG. 2 shows an example of subtitle information that the subtitle streams 1 and 2 have.
- the subtitle stream 1 has three subtitle information whose language is “English” and whose contents are “general”, “for hearing impaired”, and “non-native”, respectively.
- the subtitle stream 2 has three pieces of subtitle information whose language is “French” and whose contents are “general”, “for the hearing impaired”, and “for non-native”, respectively.
- FIG. 3A shows a flow of subtitle information extraction processing for displaying a desired subtitle from the subtitle streams 1 and 2 in the television receiver 200.
- first extraction process a subtitle stream including subtitle information for performing desired subtitle display is extracted from the subtitle streams 1 and 2.
- second extraction process subtitle information for performing desired subtitle display is extracted from the extracted subtitle stream.
- FIG. 3B shows a display example of user interface information for a user's selection operation.
- “English” or “French” can be selected.
- “Subtitle Type Selection” section the “General subtitle”, “Hard ofearing subtitle” or “Non-native Subtitle” subtitle Selection is possible.
- the “general subtitle” is selected in “English”.
- FIG. 4A shows an example of a time-series change of the subtitle stream extracted by the stream extraction process.
- the subtitle stream whose display timing is T1 has subtitle information of “Normal1”, “Hard hearing1”, and “Non-native1”.
- “Normal1” is general subtitle information, and therefore the segment type (segment type) is set to 1, for example, “xxx yy” is subtitle information. “Hard of hearing1” is subtitle information for the hearing impaired, so the segment type (segment type) is 2, and is, for example, subtitle information displaying “ggggjjjj”. Since “Non-native1” is non-native subtitle information, the segment type (segment type) is 3, and is, for example, subtitle information displaying “Fff hi”.
- FIG. 4B shows an example of subtitle display when subtitle information “Normal1” is extracted from the subtitle stream whose display timing is T1 by the subtitle information extraction process.
- the subtitle stream whose display timing is T2 has subtitle information of “Normal2”, “Hard hearing2”, and “Non-native2”.
- “Normal2” is general subtitle information, and therefore the segment type (segment type) is set to 1, for example, “xxx yyxxxzzzz” is displayed. Since “Hard of hearing2” is subtitle information for hearing impaired persons, the segment type (segment type) is set to 2, for example, “G hg jkj jk”. Since “Non-native2” is non-native subtitle information, the segment type (segment type) is set to 3, for example, “Fff hiFjjj”.
- FIG. 4C shows a subtitle display example when subtitle information “Hard ofhearing2” is extracted from the subtitle stream whose display timing is T2 by the subtitle information extraction process.
- FIG. 5 shows a configuration example of the stream generation unit 110 of the broadcast transmission system 100.
- the stream generation unit 110 includes a control unit 111, a video encoder 112, an audio encoder 113, a text format conversion unit 114, a subtitle encoder 115, and a TS formatter (multiplexer) 116.
- the control unit 111 is configured to include, for example, a CPU (Central Processing Unit), and controls the operation of each unit of the stream generation unit 110.
- the video encoder 112 receives the video data DV, encodes the video data DV, and generates a video stream composed of video PES packets having encoded video data in the payload.
- the audio encoder 113 receives the audio data DA, encodes the audio data DA, and generates an audio stream composed of audio PES packets having encoded audio data.
- the text format conversion unit 114 receives text data (character code) DT and obtains TTML (Timed Text Markup Language) as subtitle information.
- FIG. 6 shows an example of a TTML structure.
- TTML is described on an XML basis.
- the TTML is composed of a head and a body. In the head, there are various elements such as metadata, styling, styling extension, layout, and the like.
- Metadata includes metadata title information, copyright information, and the like.
- the styling includes information such as region position, size, color, font (fontFamily), font size (fontSize), and text alignment (textAlign).
- the layout includes information such as an offset (padding), a background color (backgroundColor), and an alignment (displayAlign) in addition to the identifier (id) of the region in which the subtitle is arranged.
- the body includes information on the subtitle. For each subtitle, a display start timing and a display end timing are described, and text data is described.
- the text format conversion unit 114 obtains a plurality of types of TTML corresponding to the same display timing.
- TTML whose language is “English” and whose content is “general”
- Six types of TTML whose language is “French” and whose content is “non-native” are obtained.
- the subtitle encoder 115 converts the six types of TTML obtained by the text format conversion unit 114 into segments (TTML segments). Then, the subtitle encoder 115 generates the subtitle stream 1 including the subtitle PES packet in which the TTML segments (1) to (3) having the language “English” are arranged in the payload, and the language is “French”. A subtitle stream 2 including a subtitle PES packet in which a certain TTML segment (4) to (6) described above is arranged in the payload is generated.
- the subtitle streams 1 and 2 also include at least a font download segment (Font_download_segment) having download information for downloading a font file specified by the TTML font designation information. That is, the subtitle encoder 115 inserts a font download segment into the payload of the subtitle PES packet that constitutes the subtitle streams 1 and 2, respectively.
- Font_download_segment a font download segment having download information for downloading a font file specified by the TTML font designation information. That is, the subtitle encoder 115 inserts a font download segment into the payload of the subtitle PES packet that constitutes the subtitle streams 1 and 2, respectively.
- FIG. 7A shows a structure example (Syntax) of the subtitle PES packet (PES_packet).
- PES_startcode_prefix a fixed pattern “0x000001” is arranged.
- An 8-bit field of “stream_id” indicates a stream identifier.
- the 16-bit field of “PES_packet_length” indicates the number of subsequent bytes as the length (size) of the PES packet.
- PES_packet_length there is a field of “Optional_PES_header ()”. In this field, time stamps of PTS, DTS, and the like are arranged. After this field, there is a field “PES_packet_data_byte”. This field corresponds to the PES payload. In this field, “PES_data_byte_field ()” for containerizing data is arranged.
- FIG. 7B shows a structural example (Syntax) of “PES_data_byte_field ()”.
- the 8-bit field of “data_identifier” indicates an identifier for identifying the type of data in the container portion. Since the conventional subtitle (in the case of a bitmap) is supposed to be indicated by “0x20”, the text can be identified by a new value, for example, “0x21”.
- the 8-bit field of “subtitle_stream_id” indicates an identifier for identifying the type of the subtitle stream.
- a new value for example, “0x01”, can be distinguished from the conventional subtitle stream “0x00” that transmits a bitmap.
- FIG. 8A shows a structural example (Syntax) of a subtitle segment.
- FIG. 8B shows the content (Semantics) of main information in the structural example.
- the 8-bit field of “sync_byte” is a unique word indicating the start of a segment.
- An 8-bit field of “segment_type” indicates a segment type (segment type).
- FIG. 9 shows an example of the definition of the segment type (segment_type).
- segment_type For example, “0x01” indicates a general subtitle (Normal subtitle), “0x02” indicates a subtitle for the hearing impaired (Hard_of_hearing subtitle), and “0x03” indicates a non-native subtitle (Non-native subtitle).
- “0x11” indicates a subtitle of language 1 (English)
- “0x12” indicates a subtitle of language 2 (French).
- “0x84” indicates font download (Font Download).
- the 8-bit field of “segment_id” indicates segment identification.
- Segment_length is a 16-bit field indicating the number of subsequent bytes as the length (size) of the subtitle segment.
- a 4-bit field of “version_number” indicates information update. If an update is made, the value is incremented by one.
- segment_payload () When the segment type is “0x01”, “0x02”, “0x03”, “0x11”, “0x12”, a TTML document (see FIG. 6) is arranged in the field “segment_payload ()”.
- FIG. 10 shows a structure example (Syntax) of the segment payload (segment_payload ()) when the segment type is “0x84”, and FIG. 11 shows contents (Semantics) of main information in the structure example. .
- a 16-bit field of “original_network_id” indicates identification information of a network through which download data is transmitted.
- a 16-bit field of “transport_stream_id” indicates identification information of an individual transport stream.
- a 16-bit field of “service_id” indicates identification information of a service to be downloaded. In the case of a download target common to distribution media, the font file may be sent by another transport stream instead of its own transport stream. "Original_network_id”, “transport_stream_id”, and “service_id” information can be specified.
- the 8-bit field of “font_file_id” indicates the identification number assigned to the font file.
- a 24-bit field of “ISO — 639_language_code” indicates a code consisting of three characters for identifying a language. For example, “jpn” indicates Japanese and “eng” indicates English.
- the 8-bit field of “font_group_id” indicates the identification information of the font group and corresponds to the generic family of TTML.
- An 8-bit field of “font_name_id” indicates an individual font name.
- the 8-bit field “url_type” indicates the server type. For example, “0x01” indicates a font server (uncompressed URL), “0x02” indicates a general server (uncompressed URL), “0x11” indicates a font server (compressed URL), and “0x12” indicates a general server Indicates a server (compressed URL).
- the 8-bit field of “url_string_length” indicates the length (size) of the character code portion indicating the character string of the subsequent URL in bytes. The character code is placed in the “char” field.
- the TS formatter 116 transport-packets the video stream generated by the video encoder 112, the audio stream generated by the audio encoder 113, and the subtitle streams 1 and 2 generated by the subtitle encoder 115. Multiplexing is performed to obtain a transport stream TS as a container (multiplexed stream).
- the TS formatter 116 inserts information on each of the two subtitle streams 1 and 2 included in the transport stream TS into a PMT (Program Map). Specifically, a newly defined text subtitle descriptor (Text_subtitle_descriptor) having such information is generated and inserted into the subtitle elementary stream loop (Subtitle ES loop) corresponding to each of the subtitle streams 1 and 2.
- PMT Program Map
- FIG. 12 shows a structural example (Syntax) of a text subtitle descriptor.
- FIG. 13 shows the contents (Semantics) of main information in the structural example.
- the 8-bit field of “descriptor_tag” indicates a descriptor type, and here indicates a text subtitle descriptor.
- the 8-bit field of “descriptor_length” indicates the length (size) of the descriptor, and indicates the number of subsequent bytes as the descriptor length.
- the 8-bit field of “packet_type” indicates the packet type (packet type) as shown in FIG.
- FIG. 14 shows an example of the definition of the packet type (packet_type).
- packet_type For example, “0x01” indicates a general subtitle (Normal subtitle), “0x02” indicates a subtitle for the deaf (Hard_of_hearing subtitle), and “0x03” indicates a non-native subtitle (Non-native subtitle).
- “0x11” indicates a subtitle of language 1 (English)
- “0x12” indicates a subtitle of language 2 (French).
- “0x84” indicates font download (Font Download).
- the 1-bit field of “multiplexed_segment_packet_flag” indicates whether or not the subtitle stream includes a plurality of pieces of subtitle information.
- a 7-bit field of “number_of_segments” indicates the number of subtitle information included in the subtitle stream. Then, as many as the number of subtitle information, an 8-bit field of “segment_id”, an 8-bit field of “segment_type”, and a 24-bit field of “ISO_639_language_code” exist repeatedly.
- the field “segment_id” indicates segment identification.
- the “segment_type” field indicates the segment type.
- “ISO_639_language_code” indicates a three-character code for identifying a language.
- At least a font file designated by the TTML font designation information is downloaded to the subtitle elementary stream loop (Subtitle ES loop) corresponding to each of the subtitle streams 1 and 2.
- a font file descriptor (Font_file_descriptor) having the download information is inserted.
- FIG. 15 shows a structural example (Syntax) of a font file descriptor.
- the 8-bit field of “descriptor_tag” indicates a descriptor type, and here indicates a font file descriptor.
- the 8-bit field of “descriptor_length” indicates the length (size) of the descriptor, and indicates the number of subsequent bytes as the descriptor length. Since the other fields are the same as those in the segment payload structure example in the case where the segment type shown in FIG. 10 is “0x84”, detailed description thereof will be omitted.
- the video data DV is supplied to the video encoder 112.
- the video data DV is encoded, and a video stream composed of video PES packets having encoded image data in the payload is generated. This video stream is supplied to the TS formatter 116.
- the audio data DA is supplied to the audio encoder 113.
- the audio encoder 113 encodes the audio data DA and generates an audio stream composed of audio PES packets having encoded audio data. This audio stream is supplied to the TS formatter 116.
- the text data (character code) DT is supplied to the text format conversion unit 114.
- This text format conversion unit 114 obtains TTML as caption information (see FIG. 6).
- six types of TTML are obtained corresponding to the same display timing. That is, (1) TTML whose language is “English” and “general”, (2) TTML whose language is “English” and whose content is “for hearing impaired”, and (3) whose language is “English” and whose content is “English” TTML for “non-native”, (4) TTML with language “French” and content “General”, (5) TTML with language “French” and content “For hearing impaired”, (6) Language Six types of TTML with "French” and "Non-native" content are available.
- TTML 6 types obtained by the text format conversion unit 114 are supplied to the subtitle encoder 115.
- the subtitle encoder 115 six types of TTML are converted into segments (TTML segments) (see FIG. 8A and FIG. 6).
- the subtitle encoder 115 generates the subtitle stream 1 including the subtitle PES packet in which the TTML segments (1) to (3) having the language “English” are arranged in the payload, and the language is “French”.
- the subtitle stream 2 including the subtitle PES packet in which the TTML segments (4) to (6) described above are arranged in the payload is generated.
- the subtitle streams 1 and 2 are supplied to the TS formatter 116.
- the subtitle streams 1 and 2 also include a font download segment (Font_download_segment) having download information for downloading at least a font file designated by the TTML font designation information (see FIG. 8 (a), see FIG.
- the video stream generated by the video encoder 112 the audio stream generated by the audio encoder 113, and the subtitle streams 1 and 2 generated by the subtitle encoder 115 are transport packetized and multiplexed, and the container A transport stream TS as (multiplexed stream) is generated.
- a subtitle elementary stream loop (Subtitle ES loop) corresponding to each of the subtitle streams 1 and 2 under the PMT has a text subtitle descriptor (Subtitle ES loop) having information on the corresponding subtitle stream.
- Text_subtitle_descriptor) is inserted (see FIG. 12), and at least a font file descriptor (Font_file_descriptor) having download information for downloading a font file designated by the font designation information of TTML is inserted (FIG. 12). 15).
- FIG. 16 illustrates a configuration example of the transport stream TS.
- the configuration for the video and audio portions is omitted.
- a subtitle 2 / PES packet that is a PES packet of the subtitle stream 2 identified by PID2 “Subtitle2 PES” exists.
- a font download segment whose segment type is “0x84” is also inserted in this PES payload.
- a font download segment whose segment type is “0x84” is also inserted in this PES payload.
- the transport stream TS includes a PMT (Program Map Table) as PSI (Program Specific Information).
- PSI Program Specific Information
- This PSI is information describing to which program each elementary stream included in the transport stream TS belongs.
- the PMT has a program descriptor (Program Descriptor) that describes information related to the entire program.
- this PMT there is a subtitle 1 / elementary stream loop (Subtitle1 ES loop) having information related to the subtitle stream 1.
- information such as a PID (packet identifier) is arranged corresponding to the subtitle stream 1, and a descriptor that describes information related to the subtitle stream is also arranged.
- a text subtitle descriptor (Text_subtitle_descriptor) and a font file descriptor (Font_file_descriptor) are inserted (see FIGS. 12 and 15).
- the text subtitle descriptor has information regarding the corresponding subtitle stream. In this case, the packet type is “0x11”.
- the font file descriptor has download information for downloading at least a font file designated by the TTML font designation information.
- this PMT has a subtitle 2 elementary stream loop (Subtitle2 ES loop) having information related to the subtitle stream 2.
- information such as a PID (packet identifier) is arranged corresponding to the subtitle stream 2, and a descriptor describing information related to the subtitle stream is also arranged.
- a text subtitle descriptor (Text_subtitle_descriptor) and a font file descriptor (Font_file_descriptor) are inserted (see FIGS. 12 and 15).
- the text subtitle descriptor has information regarding the corresponding subtitle stream. In this case, the packet type is “0x12”.
- the font file descriptor has download information for downloading at least a font file designated by the TTML font designation information.
- FIG. 17 shows a configuration example of the television receiver 200.
- the television receiver 200 includes a receiving unit 201, a TS analyzing unit (demultiplexer) 202, a video decoder 203, a video superimposing unit 204, a panel driving circuit 205, and a display panel 206 as a monitor (display). is doing.
- the television receiver 200 includes an audio decoder 207, an audio output circuit 208, a speaker 209, and a subtitle decoder 210.
- the television receiver 200 includes a CPU 221, a flash ROM 222, a DRAM 223, an internal bus 224, a remote control receiver 225, a remote control transmitter 226, and a communication interface 227.
- the CPU 221 controls the operation of each part of the television receiver 200.
- the flash ROM 222 stores control software and data.
- the DRAM 223 constitutes a work area for the CPU 221.
- the CPU 221 develops software and data read from the flash ROM 222 on the DRAM 223 to activate the software, and controls each unit of the television receiver 200.
- the remote control receiving unit 225 receives the remote control signal (remote control code) transmitted from the remote control transmitter 226 and supplies it to the CPU 221.
- the CPU 221 controls each part of the television receiver 200 based on this remote control code.
- the CPU 221, flash ROM 222, and DRAM 223 are connected to the internal bus 224.
- the communication interface 227 communicates with a server existing on a network such as the Internet under the control of the CPU 221. This communication interface 227 is connected to the internal bus 224.
- the receiving unit 201 receives the transport stream TS transmitted from the broadcast transmission system 100 on a broadcast wave.
- the transport stream TS includes the video stream, the audio stream, and the subtitle streams 1 and 2.
- the TS analysis unit 202 extracts video, audio, and subtitle streams from the transport stream TS.
- the TS analysis unit 202 analyzes various information inserted in the header of each TS packet, and selectively selects a TS packet including data of video, audio, and subtitle PES packets based on “PID”. To obtain video, audio, and subtitle streams.
- the TS analysis unit 202 analyzes various information inserted in the header of each TS packet, extracts various information inserted in the transport stream TS based on “PID”, and sends it to the CPU 221. .
- This information includes a text subtitle descriptor and a font file descriptor (see FIGS. 12 and 15).
- the CPU 221 acquires information related to the corresponding subtitle stream from the text subtitle descriptor. This information includes, for example, flag information indicating whether or not the corresponding subtitle stream has a plurality of subtitle information, identification information for identifying the corresponding subtitle stream, and identification information for identifying each subtitle information possessed by the corresponding subtitle stream Etc. are included. Further, the CPU 221 acquires information for downloading a file of a font specified by at least the font specification information of TTML from the font file descriptor.
- the audio decoder 207 performs a decoding process on the audio stream extracted by the TS analysis unit 202 to obtain audio data.
- the audio output circuit 208 performs necessary processing such as D / A conversion and amplification on the audio data and supplies the audio data to the speaker 209.
- the video decoder 203 performs a decoding process on the video stream extracted by the TS analysis unit 202 to obtain video data.
- the subtitle decoder 210 performs decoding processing on the subtitle stream extracted by the TS analysis unit 202, and obtains TTML from timed text subtitle segments (TimedText subtitle segments).
- only one of the two subtitle streams 1 and 2 included in the transport stream TS is selectively extracted and supplied from the TS analysis unit 202 to the subtitle decoder 210. Further, in the subtitle decoder 210, only one of the three TTML segments included in the subtitle stream supplied from the TS analysis unit 202 is selectively extracted and subjected to decoding processing to obtain TTML.
- the selection of the stream is based on the selection information of the language of the user or the system, and the packet type (Packet_type) information (see FIG. 14) is supplied from the CPU 221 to the TS analysis unit 202. This is done by specifying.
- the user interface information for the user's selection operation shown in FIG. 3B the user selects “English” or “French” at the language selection “Language Selection”. Can be selected.
- This user interface information is displayed on the display panel 206 based on information related to each of a predetermined number of subtitle streams under the control of the CPU 221.
- the packet type when “English” is selected, the packet type is “0x11”, and the TS analysis unit 202 extracts the subtitle stream 1. For example, when “French” is selected, the packet type is “0x12”, and the TS analysis unit 202 extracts the subtitle stream 2.
- the selection of the TTML segment is based on the selection information of the contents of the user or the system, as shown in FIG. This is done by specifying the segment type.
- the user selects “General Subtitle”, “Hearing” at the content selection “Subtitle Type Selection”. It is possible to select “Subtitle for disabled people (Hard of Hearing Subtitle)” or “Non-native Subtitle”.
- the segment type is set to “0x01”, and the subtitle decoder 210 extracts TTML segments including “General” TTML.
- the segment type is “0x02”, and the subtitle decoder 210 extracts a TTML segment including TTML for “deaf person” Is done.
- the segment type is “0x03”, and the subtitle decoder 210 extracts TTML segments including TTML of “non-native”.
- the subtitle decoder 210 sends the TTML obtained by decoding the extracted one TTML segment to the CPU 221.
- the CPU 221 acquires caption display position information and the like from this TTML.
- the subtitle decoder 210 extracts the font download segment (see FIG. 8A and FIG. 10) included in the subtitle stream (PES packet) extracted by the TS analysis unit 202 and sends it to the CPU 221.
- the CPU 221 obtains at least information for downloading a font file designated by the font designation information of TTML from the font download segment.
- the subtitle decoder 210 converts text data (font data) of subtitles (subtitles) at each subtitle display position (region) included in the TTML into bitmap data (binary image information) under the control of the CPU 221. .
- the subtitle decoder 210 uses a font file designated by the font designation information of the TTML when obtaining the caption bitmap data.
- the CPU 221 appropriately selects the font file based on the download information inserted in the PES packet, the PMT, etc. as described above.
- a broadcast signal transport stream TS
- downloaded from a server on the network is used. If the file cannot be downloaded, a substitute font file (for example, a default font file) is used.
- the video superimposing unit 204 superimposes the subtitle bitmap data of each subtitle display position obtained by the subtitle decoder 210 on the video data obtained by the video decoder 203, and displays the display video data. obtain.
- the CPU 221 controls so that the superimposed position of the caption bitmap data becomes the caption display position determined by the caption display position information.
- the panel drive circuit 205 drives the display panel 206 based on the display video data obtained by the video superimposing unit 204.
- the display panel 206 includes, for example, an LCD (Liquid Crystal Display), an organic EL display (organic electroluminescence display), and the like.
- the receiving unit 201 receives the transport stream TS transmitted from the broadcast transmission system 100 on the broadcast wave.
- This transport stream TS includes a video stream, an audio stream, and subtitle streams 1 and 2.
- the transport stream TS is supplied to the TS analysis unit 202.
- the TS analysis unit 202 extracts video, audio, and subtitle streams from the transport stream TS.
- various information inserted in the transport stream TS is extracted and sent to the CPU 221.
- This information includes a text subtitle descriptor and a font file descriptor (see FIGS. 12 and 15).
- the CPU 221 acquires information on the corresponding subtitle stream from the text subtitle descriptor.
- the CPU 221 acquires information for downloading a file of a font specified by at least TTML font specification information from the font file descriptor.
- the video stream extracted by the TS analysis unit 202 is supplied to the video decoder 203.
- the video PES stream is decoded to obtain video data.
- the subtitle stream extracted by the TS analysis unit 202 is supplied to the subtitle decoder 210.
- the subtitle stream is decoded, and TTML is obtained from the timed text subtitled segments (TimedTextTimesubtitle segments).
- only one of the two subtitle streams 1 and 2 included in the transport stream TS is selectively extracted and supplied from the TS analysis unit 202 to the subtitle decoder 210. Further, in the subtitle decoder 210, only one of the three TTML segments included in the subtitle stream supplied from the TS analysis unit 202 is selectively extracted and subjected to decoding processing to obtain TTML.
- the selection of the stream in the TS analysis unit 202 is performed under the control of the CPU 221 based on the selection information of the language of the user or the system.
- the selection of the TTML segment in the subtitle decoder 210 is performed under the control of the CPU 221 based on the selection information of the user or system language.
- the user can display a desired subtitle by selecting a language and content.
- the subtitle decoder 210 extracts a font download segment from the subtitle stream obtained by the TS analysis unit 202 and sends it to the CPU 221.
- the CPU 221 acquires information for downloading a file of the font specified by at least the font designation information of TTML from the font download segment.
- TTML obtained by the subtitle decoder 210 is sent to the CPU 221.
- subtitle display position information and the like are acquired from the TTML.
- the subtitle decoder 210 extracts the font download segment (see FIG. 8A and FIG. 10) included in the subtitle stream (PES packet) extracted by the TS analysis unit 202 and sends it to the CPU 221.
- the CPU 221 obtains at least information for downloading a font file designated by the font designation information of TTML from the font download segment.
- subtitle decoder 210 Under the control of the CPU 221, text data (font data) of subtitles (subtitles) in each subtitle display position (region) included in the TTML is converted into bitmap data (binary image information).
- the subtitle decoder 210 uses a font file designated by font designation information of the TTML when subtitle bitmap data is obtained under the control of the CPU 221.
- the CPU 221 appropriately selects the font file based on the download information inserted in the PES packet, the PMT, etc. as described above.
- a broadcast signal transport stream TS
- downloaded from a server on the network is used. If the file cannot be downloaded, a substitute font file (for example, a default font file) is used.
- Bitmap data of subtitles at each subtitle display position output from the subtitle decoder 210 is supplied to the video superimposing unit 204.
- the video superimposing unit 204 superimposes subtitle bitmap data at each subtitle display position obtained by the subtitle decoder 210 on the video data obtained by the video decoder 203 to obtain video data for display.
- the CPU 221 controls the superimposed position of the caption bitmap data to be the caption display position based on the caption display position determined by the caption display position information.
- the display video data obtained by the video superimposing unit 204 is supplied to the panel drive circuit 205.
- the panel drive circuit 205 drives the display panel 206 based on the display video data. Thereby, the display panel 206 displays an image in which a caption (subtitle) is superimposed on each caption display position (region).
- the audio stream extracted by the TS analysis unit 202 is supplied to the audio decoder 207.
- the audio stream is decoded to obtain audio data.
- This audio data is supplied to the audio output circuit 208.
- the audio output circuit 208 performs necessary processing such as D / A conversion and amplification on the audio data.
- the processed audio data is supplied to the speaker 209. Thereby, an audio output corresponding to the display image on the display panel 206 is obtained from the speaker 209.
- the broadcast transmission system 100 generates and transmits a subtitle stream including a plurality of subtitle information (TTML segments). Therefore, even if the type of subtitle information increases, an increase in the number of subtitle streams can be suppressed, and therefore transmission of a plurality of types of subtitle information can be simplified.
- TTML segments subtitle information
- the broadcast transmission system 100 inserts information related to each of a predetermined number of subtitle streams into a subtitle stream TS as a container and transmits the subtitle stream TS. Therefore, on the receiving side, it is possible to control display processing of user interface information for the user to perform a selection operation for displaying a desired subtitle, based on information regarding each of the predetermined number of subtitle streams.
- the television receiver 200 extracts one subtitle stream from a predetermined number of subtitle streams, and further, one subtitle information (TTML segment) from the extracted one subtitle stream. Is extracted. Therefore, even when a predetermined number of subtitle streams include a subtitle stream including a plurality of pieces of subtitle information, a desired subtitle display can be performed.
- TTML segment subtitle information
- the transport stream TS generated by the broadcast transmission system 100 has the language “English” and the contents “general”, “for hearing impaired”, and “non-native”, respectively.
- An example is shown in which subtitle stream 2 (Packet id2) having subtitle information (TTML segment) is included.
- the transport stream TS generated by the broadcast transmission system 100 includes the subtitle stream 1 (Packet id1) having the subtitle information (TTML segment) whose content is “general” and the content is “for the hearing impaired”.
- subtitle stream 2 Packet (id2) having subtitle information (TTML segment)
- subtitle stream 3 Packet id3 having subtitle information (TTML segment) whose content is “non-native” is also conceivable.
- FIG. 19 shows an example of subtitle information that the subtitle streams 1, 2, and 3 have.
- the subtitle stream 1 has two pieces of subtitle information whose contents are “general” and whose languages are “English” and “French”, respectively.
- the subtitle stream 2 has two pieces of subtitle information whose contents are “for the hearing impaired” and whose languages are “English” and “French”, respectively.
- the subtitle stream 3 has two pieces of subtitle information whose contents are “non-native” and whose languages are “English” and “French”, respectively.
- FIG. 20A shows a case where the subtitle streams 1, 2, and 3 are included in the transport stream TS as described above, and a desired subtitle display is performed from the subtitle streams 1, 2, and 3 in the television receiver 200. The flow of the extraction process of subtitle information for this is shown.
- a subtitle stream including subtitle information for performing desired subtitle display is extracted from the subtitle streams 1, 2, and 3.
- subtitle information extraction process second extraction process
- subtitle information for performing desired subtitle display is extracted from the extracted subtitle stream.
- FIG. 20B shows a display example of user interface information for the user's selection operation.
- “English” or “French” can be selected.
- “Subtitle Type Selection” section the “General subtitle”, “Hard ofearing subtitle” or “Non-native Subtitle” subtitle Selection is possible.
- “French” indicates that “Subtitle for the hearing impaired” is selected.
- FIG. 21 shows a configuration example of the transport stream TS including the subtitle streams 1, 2, and 3 as described above.
- the configuration for the video and audio portions is omitted.
- a subtitle 1 and PES packet “Subtitle1 PES” that is a PES packet of the subtitle stream 1 identified by PID1
- a subtitle 2 and PES packet “Subtitle2 PES” that is a PES packet of the subtitle stream 2 identified by PID2 are used.
- there is a subtitle 3 / PES packet “Subtitle3 PES” which is a PES packet of the subtitle stream 3 identified by PID3.
- TTML segments having general subtitle information are inserted in the PES payload. That is, in this PES payload, an English (English) subtitle TTML segment with a segment type of “0x11” and a French (French) subtitle TTML segment with a segment type of “0x12” are inserted. In addition, a font download segment whose segment type is “0x84” is also inserted in this PES payload.
- TTML segments having subtitle information whose contents are intended for the hearing impaired are inserted in the PES payload. That is, in this PES payload, an English (English) subtitle TTML segment with a segment type of “0x11” and a French (French) subtitle TTML segment with a segment type of “0x12” are inserted. In addition, a font download segment whose segment type is “0x84” is also inserted in this PES payload.
- TTML segments having subtitle information whose contents are non-native are inserted in the PES payload. That is, in this PES payload, an English (English) subtitle TTML segment with a segment type of “0x11” and a French (French) subtitle TTML segment with a segment type of “0x12” are inserted. In addition, a font download segment whose segment type is “0x84” is also inserted in this PES payload.
- the transport stream TS includes a PMT (Program Map Table) as PSI (Program Specific Information).
- PSI Program Specific Information
- This PSI is information describing to which program each elementary stream included in the transport stream TS belongs.
- the PMT has a program descriptor (Program Descriptor) that describes information related to the entire program.
- this PMT there is a subtitle 1 / elementary stream loop (Subtitle1 ES loop) having information related to the subtitle stream 1.
- information such as a PID (packet identifier) is arranged corresponding to the subtitle stream 1, and a descriptor that describes information related to the subtitle stream is also arranged.
- a text subtitle descriptor (Text_subtitle_descriptor) and a font file descriptor (Font_file_descriptor) are inserted (see FIGS. 12 and 15).
- the text subtitle descriptor has information regarding the corresponding subtitle stream. In this case, the packet type is “0x01”.
- the font file descriptor has download information for downloading at least a font file designated by the TTML font designation information.
- this PMT has a subtitle 2 elementary stream loop (Subtitle2 ES loop) having information related to the subtitle stream 2.
- information such as a PID (packet identifier) is arranged corresponding to the subtitle stream 2, and a descriptor describing information related to the subtitle stream is also arranged.
- a text subtitle descriptor (Text_subtitle_descriptor) and a font file descriptor (Font_file_descriptor) are inserted (see FIGS. 12 and 15).
- the text subtitle descriptor has information regarding the corresponding subtitle stream. In this case, the packet type is “0x02”.
- the font file descriptor has download information for downloading at least a font file designated by the TTML font designation information.
- this PMT there is a subtitle 3 elementary stream loop (Subtitle2 ES loop) having information related to the subtitle stream 3.
- information such as a PID (packet identifier) is arranged corresponding to the subtitle stream 3, and a descriptor describing information related to the subtitle stream is also arranged.
- a text subtitle descriptor (Text_subtitle_descriptor) and a font file descriptor (Font_file_descriptor) are inserted (see FIGS. 12 and 15).
- the text subtitle descriptor has information regarding the corresponding subtitle stream. In this case, the packet type is “0x03”.
- the font file descriptor has download information for downloading at least a font file designated by the TTML font designation information.
- the container is a transport stream (MPEG-2 TS)
- MPEG-2 TS transport stream
- the present technology is not limited to the MPEG-2 TS container, and can be similarly realized even with other format containers such as MMT or ISOBMFF.
- the transmission / reception system 10 including the broadcast transmission system 100 and the television receiver 200 is shown, but the configuration of the transmission / reception system to which the present technology can be applied is not limited thereto.
- a configuration of a set top box and a monitor in which the television receiver 200 is connected by a digital interface such as HDMI (High-Definition Multimedia Interface) may be used.
- HDMI High-Definition Multimedia Interface
- HDMI High-Definition Multimedia Interface
- this technique can also take the following structures.
- a subtitle encoding unit that generates a predetermined number of subtitle streams each having one or more subtitle information
- a transmission apparatus comprising: a transmission unit that transmits a container of a predetermined format including the predetermined number of subtitle streams.
- the transmission device according to (1) wherein each of the predetermined number of subtitle streams has segmented subtitle information.
- the subtitle encoding unit generates a plurality of subtitle streams each having subtitle information in a different language, The transmission device according to (1) or (2), wherein each of the plurality of subtitle streams has a plurality of pieces of subtitle information having different contents.
- the subtitle encoding unit generates a plurality of subtitle streams each having subtitle information having different contents,
- the transmission device according to (1) or (2) wherein each of the plurality of subtitle streams has a plurality of pieces of subtitle information having different languages.
- the transmission device according to any one of (1) to (4) further including an information insertion unit that inserts information regarding each of the predetermined number of subtitle streams into the container.
- the transmission device according to any one of (5) to (7), wherein the information regarding each of the subtitle streams includes identification information for identifying each subtitle information included in the corresponding subtitle stream.
- a subtitle encoding step for generating a predetermined number of subtitle streams each having one or more subtitle information;
- a transmission method comprising a transmission step of transmitting a container of a predetermined format including the predetermined number of subtitle streams by a transmission unit.
- a receiving unit that receives a container of a predetermined format including a predetermined number of subtitle streams each having one or more subtitle information;
- a control unit that controls a first extraction process for extracting one subtitle stream from the predetermined number of subtitle streams and a second extraction process for extracting one subtitle information from the extracted subtitle stream is provided. apparatus.
- Information related to each of the predetermined number of subtitle streams is inserted into the container, The control unit The receiving device according to (10), further controlling display processing of user interface information for the first extraction processing and the second extraction processing based on information regarding each of the predetermined number of subtitle streams.
- the main feature of the present technology is that a subtitle stream including a plurality of subtitle information is generated and transmitted, so that an increase in the number of subtitle streams can be suppressed even when the type of subtitle information increases. This is to simplify the transmission of information (see FIGS. 2 and 16).
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Television Systems (AREA)
Abstract
Description
それぞれ1つまたは2つ以上のサブタイトル情報を持つ所定数のサブタイトルストリームを生成するサブタイトルエンコード部と、
上記所定数のサブタイトルストリームを含む所定フォーマットのコンテナを送信する送信部を備える
送信装置にある。
それぞれ一つまたは2つ以上のサブタイトル情報を持つ所定数のサブタイトルストリームを含む所定フォーマットのコンテナを受信する受信部と、
上記所定数のサブタイトルストリームから1つのサブタイトルストリームを抽出する第1の抽出処理と、該抽出された1つのサブタイトルストリームから1つのサブタイトル情報を抽出する第2の抽出処理を制御する制御部を備える
受信装置にある。
1.実施の形態
2.変形例
[送受信システムの構成例]
図1は、実施の形態としての送受信システム10の構成例を示している。この送受信システム10は、放送送出システム100とテレビ受信機200により構成されている。放送送出システム100は、コンテナ(多重化ストリーム)としてのMPEG-2 TSのトランスポートストリーム(以下、単に、「トランスポートストリームTS」という)を、放送波に載せて送信する。
図5は、放送送出システム100のストリーム生成部110の構成例を示している。このストリーム生成部110は、制御部111と、ビデオエンコーダ112と、オーディオエンコーダ113と、テキストフォーマット変換部114と、サブタイトルエンコーダ115と、TSフォーマッタ(マルチプレクサ)116を有している。
図16は、トランスポートストリームTSの構成例を示している。この構成例では、ビデオ、オーディオの部分についての構成は省略している。この構成例では、PID1で識別されるサブタイトルストリーム1のPESパケットであるサブタイトル1・PESパケット「Subtitle1 PES」が存在すると共に、PID2で識別されるサブタイトルストリーム2のPESパケットであるサブタイトル2・PESパケット「Subtitle2 PES」が存在する。
図17は、テレビ受信機200の構成例を示している。このテレビ受信機200は、受信部201と、TS解析部(デマルチプレクサ)202と、ビデオデコーダ203と、ビデオ重畳部204と、パネル駆動回路205と、モニタ(ディスプレイ)としての表示パネル206を有している。また、このテレビ受信機200は、オーディオデコーダ207と、オーディオ出力回路208と、スピーカ209と、サブタイトルデコーダ210を有している。また、このテレビ受信機200は、CPU221と、フラッシュROM222と、DRAM223と、内部バス224と、リモコン受信部225と、リモコン送信機226と、通信インタフェース227を有している。
なお、上述実施の形態においては、放送送出システム100で生成されるトランスポートストリームTSに、言語が「英語」で、内容がそれぞれ「一般」、「聴覚障害者向け」、「非ネイティブ向け」である3つのサブタイトル情報(TTMLセグメント)を持つサブタイトルストリーム1(Packet id1)と、言語が「フランス語」で、内容がそれぞれ「一般」、「聴覚障害者向け」、「非ネイティブ向け」である3つのサブタイトル情報(TTMLセグメント)を持つサブタイトルストリーム2(Packet id2)が含まれる例を示した。
(1)それぞれ1つまたは2つ以上のサブタイトル情報を持つ所定数のサブタイトルストリームを生成するサブタイトルエンコード部と、
上記所定数のサブタイトルストリームを含む所定フォーマットのコンテナを送信する送信部を備える
送信装置。
(2)上記所定数のサブタイトルストリームは、それぞれ、セグメント化されたサブタイトル情報を持つ
前記(1)に記載の送信装置。
(3)上記サブタイトルエンコード部は、それぞれ言語の異なるサブタイトル情報を持つ複数のサブタイトルストリームを生成し、
上記複数のサブタイトルストリームは、それぞれ、内容の異なる複数のサブタイトル情報を持つ
前記(1)または(2)に記載の送信装置。
(4)上記サブタイトルエンコード部は、それぞれ内容の異なるサブタイトル情報を持つ複数のサブタイトルストリームを生成し、
上記複数のサブタイトルストリームは、それぞれ、言語の異なる複数のサブタイトル情報を持つ
前記(1)または(2)に記載の送信装置。
(5)上記コンテナに、上記所定数のサブタイトルストリームのそれぞれに関する情報を挿入する情報挿入部をさらに備える
前記(1)から(4)のいずれかに記載の送信装置。
(6)上記サブタイトルストリームのそれぞれに関する情報には、対応するサブタイトルストリームが複数のサブタイトル情報を持つか否かを示すフラグ情報が含まれる
前記(5)に記載の送信装置。
(7)上記サブタイトルストリームのそれぞれに関する情報には、対応するサブタイトルストリームを識別する識別情報が含まれる
前記(5)または(6)に記載の送信装置。
(8)上記サブタイトルストリームのそれぞれに関する情報には、対応するサブタイトルストリームが持つ各サブタイトル情報を識別する識別情報が含まれる
前記(5)から(7)のいずれかに記載の送信装置。
(9)それぞれ1つまたは2つ以上のサブタイトル情報を持つ所定数のサブタイトルストリームを生成するサブタイトルエンコードステップと、
送信部により、上記所定数のサブタイトルストリームを含む所定フォーマットのコンテナを送信する送信ステップを有する
送信方法。
(10)それぞれ1つまたは2つ以上のサブタイトル情報を持つ所定数のサブタイトルストリームを含む所定フォーマットのコンテナを受信する受信部と、
上記所定数のサブタイトルストリームから1つのサブタイトルストリームを抽出する第1の抽出処理と、該抽出された1つのサブタイトルストリームから1つのサブタイトル情報を抽出する第2の抽出処理を制御する制御部を備える
受信装置。
(11)上記コンテナに、上記所定数のサブタイトルストリームのそれぞれに関する情報が挿入されており、
上記制御部は、
上記所定数のサブタイトルストリームのそれぞれに関する情報に基づいて、上記第1の抽出処理および上記第2の抽出処理のためのユーザインタフェース情報の表示処理をさらに制御する
前記(10)に記載の受信装置。
(12)受信部により、それぞれ1つまたは2つ以上のサブタイトル情報を持つ所定数のサブタイトルストリームを含む所定フォーマットのコンテナを受信する受信ステップと、
上記所定数のサブタイトルストリームから1つのサブタイトルストリームを抽出する第1の抽出処理と、該抽出された1つのサブタイトルストリームから1つのサブタイトル情報を抽出する第2の抽出処理を制御する制御ステップを有する
受信方法。
100・・・放送送出システム
110・・・ストリーム生成部
111・・・制御部
112・・・ビデオエンコーダ
113・・・オーディオエンコーダ
114・・・テキストフォーマット変換部
115・・・サブタイトルエンコーダ
116・・・TSフォーマッタ
200・・・テレビ受信機
201・・・受信部
202・・・TS解析部
203・・・ビデオデコーダ
204・・・ビデオ重畳部
205・・・パネル駆動回路
206・・・表示パネル
207・・・オーディオデコーダ
208・・・オーディオ出力回路
209・・・スピーカ
210・・・サブタイトルデコーダ
221・・・CPU
227・・・通信インタフェース
Claims (12)
- それぞれ1つまたは2つ以上のサブタイトル情報を持つ所定数のサブタイトルストリームを生成するサブタイトルエンコード部と、
上記所定数のサブタイトルストリームを含む所定フォーマットのコンテナを送信する送信部を備える
送信装置。 - 上記所定数のサブタイトルストリームは、それぞれ、セグメント化されたサブタイトル情報を持つ
請求項1に記載の送信装置。 - 上記サブタイトルエンコード部は、それぞれ言語の異なるサブタイトル情報を持つ複数のサブタイトルストリームを生成し、
上記複数のサブタイトルストリームは、それぞれ、内容の異なる複数のサブタイトル情報を持つ
請求項1に記載の送信装置。 - 上記サブタイトルエンコード部は、それぞれ内容の異なるサブタイトル情報を持つ複数のサブタイトルストリームを生成し、
上記複数のサブタイトルストリームは、それぞれ、言語の異なる複数のサブタイトル情報を持つ
請求項1に記載の送信装置。 - 上記コンテナに、上記所定数のサブタイトルストリームのそれぞれに関する情報を挿入する情報挿入部をさらに備える
請求項1に記載の送信装置。 - 上記サブタイトルストリームのそれぞれに関する情報には、対応するサブタイトルストリームが複数のサブタイトル情報を持つか否かを示すフラグ情報が含まれる
請求項5に記載の送信装置。 - 上記サブタイトルストリームのそれぞれに関する情報には、対応するサブタイトルストリームを識別する識別情報が含まれる
請求項5に記載の送信装置。 - 上記サブタイトルストリームのそれぞれに関する情報には、対応するサブタイトルストリームが持つ各サブタイトル情報を識別する識別情報が含まれる
請求項5に記載の送信装置。 - それぞれ1つまたは2つ以上のサブタイトル情報を持つ所定数のサブタイトルストリームを生成するサブタイトルエンコードステップと、
送信部により、上記所定数のサブタイトルストリームを含む所定フォーマットのコンテナを送信する送信ステップを有する
送信方法。 - それぞれ1つまたは2つ以上のサブタイトル情報を持つ所定数のサブタイトルストリームを含む所定フォーマットのコンテナを受信する受信部と、
上記所定数のサブタイトルストリームから1つのサブタイトルストリームを抽出する第1の抽出処理と、該抽出された1つのサブタイトルストリームから1つのサブタイトル情報を抽出する第2の抽出処理を制御する制御部を備える
受信装置。 - 上記コンテナに、上記所定数のサブタイトルストリームのそれぞれに関する情報が挿入されており、
上記制御部は、
上記所定数のサブタイトルストリームのそれぞれに関する情報に基づいて、上記第1の抽出処理および上記第2の抽出処理のためのユーザインタフェース情報の表示処理をさらに制御する
請求項10に記載の受信装置。 - 受信部により、それぞれ1つまたは2つ以上のサブタイトル情報を持つ所定数のサブタイトルストリームを含む所定フォーマットのコンテナを受信する受信ステップと、
上記所定数のサブタイトルストリームから1つのサブタイトルストリームを抽出する第1の抽出処理と、該抽出された1つのサブタイトルストリームから1つのサブタイトル情報を抽出する第2の抽出処理を制御する制御ステップを有する
受信方法。
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201780031780.8A CN109155866A (zh) | 2016-05-31 | 2017-05-17 | 发送装置、发送方法、接收装置和接收方法 |
AU2017274829A AU2017274829A1 (en) | 2016-05-31 | 2017-05-17 | Transmission device, transmission method, reception device, and reception method |
EP17806376.4A EP3468204A4 (en) | 2016-05-31 | 2017-05-17 | TRANSMITTING DEVICE, TRANSMITTING METHOD, RECEIVING DEVICE, AND RECEIVING METHOD |
JP2018520780A JP7020406B2 (ja) | 2016-05-31 | 2017-05-17 | 送信装置、送信方法、受信装置および受信方法 |
US16/094,539 US20190123842A1 (en) | 2016-05-31 | 2017-05-17 | Transmission device, transmission method, reception device, and reception method |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2016-109483 | 2016-05-31 | ||
JP2016109483 | 2016-05-31 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2017208818A1 true WO2017208818A1 (ja) | 2017-12-07 |
Family
ID=60478510
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2017/018483 WO2017208818A1 (ja) | 2016-05-31 | 2017-05-17 | 送信装置、送信方法、受信装置および受信方法 |
Country Status (6)
Country | Link |
---|---|
US (1) | US20190123842A1 (ja) |
EP (1) | EP3468204A4 (ja) |
JP (1) | JP7020406B2 (ja) |
CN (1) | CN109155866A (ja) |
AU (1) | AU2017274829A1 (ja) |
WO (1) | WO2017208818A1 (ja) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2019134296A (ja) * | 2018-01-31 | 2019-08-08 | 東芝映像ソリューション株式会社 | 受信機 |
JP2019134294A (ja) * | 2018-01-31 | 2019-08-08 | 東芝映像ソリューション株式会社 | 受信機 |
JP2019134292A (ja) * | 2018-01-31 | 2019-08-08 | 東芝映像ソリューション株式会社 | 受信機 |
JP2019134293A (ja) * | 2018-01-31 | 2019-08-08 | 東芝映像ソリューション株式会社 | 受信機 |
JP2019134290A (ja) * | 2018-01-31 | 2019-08-08 | 東芝映像ソリューション株式会社 | 受信機 |
JP2019134297A (ja) * | 2018-01-31 | 2019-08-08 | 東芝映像ソリューション株式会社 | 受信機 |
JP2019134295A (ja) * | 2018-01-31 | 2019-08-08 | 東芝映像ソリューション株式会社 | 受信機 |
JP2019134291A (ja) * | 2018-01-31 | 2019-08-08 | 東芝映像ソリューション株式会社 | 受信機 |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US12003825B1 (en) * | 2022-09-21 | 2024-06-04 | Amazon Technologies, Inc. | Enhanced control of video subtitles |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2012169885A (ja) | 2011-02-15 | 2012-09-06 | Sony Corp | 表示制御方法、記録媒体、表示制御装置 |
JP2013534097A (ja) * | 2010-06-18 | 2013-08-29 | サムスン エレクトロニクス カンパニー リミテッド | 字幕サービスを含むデジタル放送サービスを提供する方法及びその装置 |
WO2015093856A1 (en) * | 2013-12-19 | 2015-06-25 | Lg Electronics Inc. | Broadcast transmitting device and operating method thereof, and broadcast receiving device and operating method thereof |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5294982A (en) * | 1991-12-24 | 1994-03-15 | National Captioning Institute, Inc. | Method and apparatus for providing dual language captioning of a television program |
KR101061115B1 (ko) * | 2004-08-13 | 2011-08-31 | 엘지전자 주식회사 | 디지털 방송 수신기 및 그의 서브타이틀 데이터 처리 방법 |
WO2012169813A2 (ko) * | 2011-06-09 | 2012-12-13 | 엘지전자 주식회사 | 방송 서비스 전송 방법, 그 수신 방법 및 그 수신 장치 |
JP2013066075A (ja) * | 2011-09-01 | 2013-04-11 | Sony Corp | 送信装置、送信方法および受信装置 |
FR3025925B1 (fr) * | 2014-09-17 | 2016-12-23 | France Brevets | Procede de controle de modes de presentation de sous-titres |
-
2017
- 2017-05-17 EP EP17806376.4A patent/EP3468204A4/en not_active Withdrawn
- 2017-05-17 AU AU2017274829A patent/AU2017274829A1/en not_active Abandoned
- 2017-05-17 JP JP2018520780A patent/JP7020406B2/ja active Active
- 2017-05-17 WO PCT/JP2017/018483 patent/WO2017208818A1/ja unknown
- 2017-05-17 US US16/094,539 patent/US20190123842A1/en not_active Abandoned
- 2017-05-17 CN CN201780031780.8A patent/CN109155866A/zh active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2013534097A (ja) * | 2010-06-18 | 2013-08-29 | サムスン エレクトロニクス カンパニー リミテッド | 字幕サービスを含むデジタル放送サービスを提供する方法及びその装置 |
JP2012169885A (ja) | 2011-02-15 | 2012-09-06 | Sony Corp | 表示制御方法、記録媒体、表示制御装置 |
WO2015093856A1 (en) * | 2013-12-19 | 2015-06-25 | Lg Electronics Inc. | Broadcast transmitting device and operating method thereof, and broadcast receiving device and operating method thereof |
Non-Patent Citations (1)
Title |
---|
See also references of EP3468204A4 |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2019134296A (ja) * | 2018-01-31 | 2019-08-08 | 東芝映像ソリューション株式会社 | 受信機 |
JP2019134294A (ja) * | 2018-01-31 | 2019-08-08 | 東芝映像ソリューション株式会社 | 受信機 |
JP2019134292A (ja) * | 2018-01-31 | 2019-08-08 | 東芝映像ソリューション株式会社 | 受信機 |
JP2019134293A (ja) * | 2018-01-31 | 2019-08-08 | 東芝映像ソリューション株式会社 | 受信機 |
JP2019134290A (ja) * | 2018-01-31 | 2019-08-08 | 東芝映像ソリューション株式会社 | 受信機 |
JP2019134297A (ja) * | 2018-01-31 | 2019-08-08 | 東芝映像ソリューション株式会社 | 受信機 |
JP2019134295A (ja) * | 2018-01-31 | 2019-08-08 | 東芝映像ソリューション株式会社 | 受信機 |
JP2019134291A (ja) * | 2018-01-31 | 2019-08-08 | 東芝映像ソリューション株式会社 | 受信機 |
Also Published As
Publication number | Publication date |
---|---|
JPWO2017208818A1 (ja) | 2019-03-28 |
US20190123842A1 (en) | 2019-04-25 |
CN109155866A (zh) | 2019-01-04 |
EP3468204A1 (en) | 2019-04-10 |
EP3468204A4 (en) | 2019-05-08 |
JP7020406B2 (ja) | 2022-02-16 |
AU2017274829A1 (en) | 2018-12-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP7020406B2 (ja) | 送信装置、送信方法、受信装置および受信方法 | |
US10979664B2 (en) | Transmission device, transmission method, reception device and reception method | |
JP7176598B2 (ja) | 送信方法 | |
EP3236659B1 (en) | Transmission device, transmission method, reception device, and reception method | |
US11765330B2 (en) | Transmitter, transmission method, receiver, and reception method | |
US11290785B2 (en) | Transmission apparatus, transmission method, reception apparatus, and reception method for transmitting subtitle text information | |
CN109479154B (zh) | 发送装置、发送方法、接收装置和接收方法 | |
JP6868776B2 (ja) | 送信装置、送信方法、受信装置および受信方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
ENP | Entry into the national phase |
Ref document number: 2018520780 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 17806376 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2017274829 Country of ref document: AU Date of ref document: 20170517 Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 2017806376 Country of ref document: EP Effective date: 20190102 |