US20190286684A1 - Reception device, information processing method in reception device, transmission device, information processing device, and information processing method - Google Patents

Reception device, information processing method in reception device, transmission device, information processing device, and information processing method Download PDF

Info

Publication number
US20190286684A1
US20190286684A1 US16/281,957 US201916281957A US2019286684A1 US 20190286684 A1 US20190286684 A1 US 20190286684A1 US 201916281957 A US201916281957 A US 201916281957A US 2019286684 A1 US2019286684 A1 US 2019286684A1
Authority
US
United States
Prior art keywords
html document
transport media
dom tree
mmt
html
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US16/281,957
Inventor
Yoshiharu Dewa
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Saturn Licensing LLC
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Priority to US16/281,957 priority Critical patent/US20190286684A1/en
Publication of US20190286684A1 publication Critical patent/US20190286684A1/en
Assigned to SATURN LICENSING LLC reassignment SATURN LICENSING LLC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: SONY CORPORATION
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • G06F16/986Document structures and storage, e.g. HTML extensions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/858Linking data to content, e.g. by linking an URL to a video object, by creating a hotspot
    • H04N21/8586Linking data to content, e.g. by linking an URL to a video object, by creating a hotspot by using a URL
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/4782Web browsing, e.g. WebTV

Definitions

  • the present technology relates to a reception device, an information processing method in the reception device, a transmission device, an information processing device, and an information processing method. Specifically, the present technology relates to a reception device or the like which receives and processes display control data such as an HTML document for displaying a web page.
  • Display of the web page is performed by a web browser.
  • the web browser acquires an HTML document (an HTML file) from a web server, parses the HTML document to generate a Document Object Model (DOM) tree, generates various rendered elements based on the DOM tree and displays the web page (for example, refer to PTL 1).
  • HTML document an HTML file
  • DOM Document Object Model
  • the Document Object Model is present in the World Wide Web Consortium (W3C) standards.
  • the DOM defines a tree structure in which the outermost tag, ⁇ html>, is the top node in relation to one HTML document, and designates an interface for applying dynamic processing by JavaScript to each parameter of tags.
  • JavaScript is a registered trademark.
  • MMT MPEG Media Transport
  • ISO/IEC 23008-1 MPEG Media Transport
  • MMT not only defines the transport layer, but also a data structure referred to as MMT Composition Information (MMT-CI) which describes the configuration of the screen or a change with time.
  • MMT-CI configures presentation control information of transport media such as video, audio, and images.
  • the MMT-CI is defined by HTML 5.
  • Video Video
  • HTML 5 HyperText Markup Language
  • MMT Mobility Management Entity
  • the object of the present technology is to enable the access to a presentation control information (HTML document) element contained in the transport media stream from the HTML application side.
  • HTML document presentation control information
  • the concept of the present technology is a reception device which includes
  • a first reception unit which receives a first HTML document for displaying a web page
  • a second reception unit which receives a transport media stream containing a predetermined number of transport media and a second HTML document as presentation control information of the transport media
  • a DOM tree generation unit which generates a DOM tree of the first HTML document that is received by the first reception unit
  • the DOM tree generation unit links the DOM tree that is generated according to the second HTML document contained in the transport media stream beneath a node of the video element.
  • the first HTML document for displaying the web page is received by the first reception unit.
  • the transport media stream is received by the second reception unit.
  • the transport media stream contains the predetermined number of transport media and the second HTML document as the presentation control information of the transport media.
  • the transport media stream may be a transport stream in which first transport packets containing a payload of the transport media, and second transport packets containing information relating to the transport media are time division multiplexed.
  • the transport packets may be MMT packets
  • the second HTML document may be an MMT-CI.
  • the second HTML document may be data having an HTML structure to which the data structure of MPEG2-TS is mapped.
  • the DOM tree of the first HTML document which is received by the first reception unit is generated by the DOM tree generation unit.
  • the DOM tree generation unit When a video element that references the transport media stream is present in the first HTML document, the DOM tree that is generated according to the second HTML document contained in the transport media stream is linked beneath the node of the video element.
  • the DOM tree that is generated according to the second HTML document is linked beneath the node of the video element. Therefore, it is possible to access a presentation control information (HTML document) element contained in the transport media stream from the HTML application side.
  • HTML document presentation control information
  • an element acquisition unit which acquires a predetermined element of the second HTML document based on the DOM tree that is generated by the DOM tree generation unit may be further provided.
  • a display control unit which controls display of the web page based on the DOM tree that is generated by the DOM tree generation unit may be further provided, and the display control unit may display information relating to presentation control of the predetermined number of transport media on a display screen of the web page based on the predetermined element of the second HTML document that is acquired by the element acquisition unit.
  • the user can ascertain the information relating to the presentation control of the predetermined number of transport media presented on the display screen of the web page.
  • a program for accessing a specific element of the second HTML document may be contained in the first HTML document, and the element acquisition unit may acquire the specific element of the second HTML document based on the program. In this case, for example, it is possible to easily acquire a predetermined element of the second HTML document in the HTML application side.
  • reference information for acquiring a program for accessing a specific element of the second HTML document may be contained in the first HTML document, and the element acquisition unit may acquire the specific element of the second HTML document based on the program that is acquired using the reference information. In this case, for example, it is possible to easily acquire a predetermined element of the second HTML document in the HTML application side.
  • a holding portion which holds a first HTML document for displaying a web page which contains a video element that references a transport media stream containing a predetermined number of transport media and a second HTML document as presentation control information of the transport media, and which contains a program for accessing a specific element of the second HTML document or reference information for acquiring the program;
  • a transmission unit which transmits the first HTML document that is held.
  • the first HTML document for displaying a web page is held in the holding portion.
  • the video element that references the transport media stream which contains the predetermined number of transport media and the second HTML document as the presentation control information of the transport media is contained in the first HTML document.
  • the first HTML document contains a program for accessing a specific element of the second HTML document or reference information for acquiring the program.
  • the first HTML document that is held is transmitted by the transmission unit.
  • the transport media stream may be a transport stream in which first transport packets containing a payload of the transport media, and second transport packets containing information relating to the transport media are time division multiplexed.
  • the transport packets may be MMT packets
  • the second HTML document may be an MMT-CI.
  • the first HTML document for displaying a web page that is transmitted from the transmission unit contains a program for accessing a specific element of the second HTML document contained in the transport media stream or reference information for acquiring the program. Therefore, it is possible to easily acquire a predetermined element of the second HTML document in the HTML application side in the reception unit.
  • Another concept of the present technology is an information processing device which includes
  • a data acquisition unit which acquires a first HTML document for displaying a web page
  • a DOM tree generation unit which parses the first HTML document that is acquired by the data acquisition unit and generates a DOM tree in which a plurality of elements are associated with each other
  • the DOM tree generation unit links the DOM tree that is generated according to the second HTML document contained in the transport media stream beneath the video element.
  • the first HTML document for displaying a web page is acquired by the data acquisition unit.
  • the first HTML document that is acquired by the data acquisition unit is parsed, and a DOM tree in which a plurality of elements are associated with each other is generated by the DOM tree generation unit.
  • the DOM tree that is generated according to the second HTML document contained in the transport media stream is linked beneath the video element.
  • the DOM tree that is generated according to the second HTML document is linked beneath the node of the video element of the DOM tree of the first HTML document. Therefore, it is possible to access a presentation control information (HTML document) element contained in a transport media stream from the HTML application side.
  • HTML document presentation control information
  • Another concept of the present technology is an information processing device which includes
  • a data acquisition unit which acquires first display control data containing a plurality of structural units that define information relating to display control
  • a structured data generation unit which parses the first display control data that is acquired by the data acquisition unit and generates structured data in which the plurality of structural units are associated with each other,
  • the structured data generation unit links the structured data that is generated according to the second display control data contained in the transport media stream beneath the predetermined structural unit.
  • the present technology for example, it is possible to access the content of the presentation control information (HTML document) contained in the transport media stream from the HTML application side.
  • HTML document presentation control information
  • FIG. 1 is a block diagram illustrating a configuration example of a display system as an embodiment.
  • FIG. 2 is a diagram illustrating the basic structure of an MMT-CI.
  • FIG. 3 is a diagram illustrating the relationship between a browser screen and an MMT screen when a video (Video) element that references an MMT stream is present in an HTML 5 document.
  • FIG. 4 is a flowchart illustrating an example of a DOM tree generation process in an HTML DOM processing unit of a web browser.
  • FIG. 5 is a diagram illustrating an example of the mutual relationship between the DOM tree of an HTML document and the DOM tree of an MMT-CI linked thereto.
  • FIG. 6 is a diagram illustrating an example of the mutual relationship between the DOM tree of an HTML document and the DOM trees of two MMT-CIs linked thereto.
  • FIG. 7 is a diagram illustrating an example of an MMT-CI.
  • FIG. 8 is a diagram illustrating an example of the DOM tree of an MMT-CI.
  • FIG. 9 is a diagram illustrating an example of an HTML document.
  • FIG. 10 is a diagram illustrating an example the DOM tree of an HTML document.
  • FIG. 11 is a diagram illustrating an example in which the DOM tree of an MMT-CI is linked beneath a video element of the DOM tree of an HTML document.
  • FIG. 12 is a diagram illustrating an example of a browser screen in which an MMT screen is being displayed in a partial region.
  • FIGS. 13A, 13B are diagrams illustrating an example of an HTML document containing reference information of a program (a script) for accessing a specific element of an MMT-CI, and an example of a script file containing the program (the script) for accessing a specific element of the MMT-CI.
  • FIG. 1 illustrates a configuration example of a display system 10 .
  • a broadcasting station 110 a streaming server 120 , and a web (Web) server 130 are disposed on a transmission side, and a receiver 200 is disposed on a reception side.
  • Web web
  • the broadcasting station 110 generates transport packets of the MMT structure (refer to ISO/IEC CD 23008-1), that is, a transport media stream in which MMT packets are contained, and transmits the transport media stream to the reception side through an RF transmission channel.
  • the transport media stream will be referred to as the “MMT stream”, as appropriate.
  • the broadcasting station 110 RF modulates the MMT stream via an appropriate application layer or the like and subsequently transmits the MMT stream to the reception side through the RF transmission channel.
  • the streaming server 120 transmits an MMT stream which is the same as that handled by the broadcasting station 110 described above to the reception side through, for example, a communication network transmission channel such as an Internet 300 .
  • the streaming server 120 converts the MMT stream into IP packets and transmits the IP packets to the reception side through the communication network transmission channel.
  • First MMT packets containing a payload of transport media such as video and audio, and second MMT packets containing a payload of information relating to the transport media are time division multiplexed in the MMT stream, at least by the size of fragmented packets.
  • a data structure referred to as MMT Composition Information (MMT-CI) which describes the configuration of the screen or a change with time is defined as one type of information relating to the transport media.
  • the MMT-CI configures presentation control information of transport media such as video, audio, and images.
  • the MMT-CI is defined by HTML 5.
  • FIG. 2 illustrates the basic structure of an MMT-CI.
  • a root element is “html”, and has a head element and a body element.
  • the head element has a plurality of view elements in addition to a title element, and each of the view elements has a plurality of divLocation elements.
  • the body element has a plurality of div elements. It is possible to place video (video), audio (audio), image (img), and the like inside the div elements.
  • a source element or a src attribute is used for specifying each medium.
  • the view element determines the display position of a div of the body; however, the initial position is determined by the divLocation element therebelow.
  • the divLocation elements sequentially indicate the display position that changes in time-series manner. Therefore, the display position changes according to the divLocation elements without the page reloading.
  • the web server 130 holds HTML 5 documents (HTML files) for displaying web pages (an HTML 5 application) in a holding portion such as storage.
  • the web server 130 transmits an HTML 5 document that is being held in the holding portion to the reception side through a communication network transmission channel such as the Internet 300 using an IP transmission unit.
  • a video (video) element is permitted in HTML 5.
  • a video element is present in an HTML 5 document that is transmitted from the web server 130 to the reception side, and the source of the video element is in an MMT stream.
  • reference information for referencing an MMT stream is present as the video element.
  • the receiver 200 includes an IP reception unit 201 , a web (Web) browser 202 , an output unit 203 , an IP/RF reception unit 204 , an MMT decoding unit 205 , and an MMT player 206 .
  • the IP/RF reception unit 204 receives an MMT stream that is transmitted thereto through an RF transmission channel from the broadcasting station 110 after the MMT stream is parsed in the application layer.
  • the IP/RF reception unit 204 receives an MMT stream that is transmitted thereto from the web server 130 through a communication network transmission channel.
  • the MMT decoding unit 205 subjects the MMT packets contained in the MMT stream that is received by the IP/RF reception unit 204 to unpacketting and a decoding process, obtains data such as video, audio, and images as media data, and also obtains meta-data and messages. In this case, the MMT-CI which describes the configuration of the screen or a change with time is also obtained.
  • the MMT decoding unit 205 also performs the generation of the Document Object Model (DOM) tree of the MMT-CI.
  • DOM Document Object Model
  • the MMT player 206 generates output data of images and audio according to the configuration (the layout) of the screen, the change with time or the like specified by the MMT-CI based on data such as video, audio, and images obtained by the MMT decoding unit 205 .
  • the output unit 203 performs image display and audio output based on the output data of images and audio generated by the MMT player 206 , or the output data of images and audio generated by the web browser 202 .
  • the output unit is configured by a display which performs the image display, a speaker which performs audio output, or the like.
  • the IP reception unit 201 receives an HTML 5 document (an HTML file) that is transmitted thereto from the web server 130 through a communication network transmission channel such as the Internet 300 .
  • the web browser 202 parses the HTML 5 document that is received by the IP reception unit 201 , generates a DOM tree, and generates the output data of images and audio based on the DOM tree and the like.
  • the web browser 202 includes an HTML parsing unit 202 a which parses the HTML 5 document, and an HTML DOM processing unit 202 b which generates a DOM tree based on the parsed results.
  • the web browser 202 includes an HTML layout processing unit 202 c and an HTML rendering process unit 202 d which perform a layout process and a rendering process based on the DOM tree or the like and generate the output data of images and audio.
  • the HTML DOM processing unit 202 b links the DOM tree of the MMT-CI that is generated by the MMT decoding unit 205 as described above beneath the node of the video element.
  • FIG. 3 illustrates the relationship between a browser screen (HTML 5) and an MMT screen when a video (video) element that references an MMT stream is present in the HTML 5 document.
  • the MMT screen enters a state of being inserted into a portion of the browser screen.
  • the size adjustment and the like when inserting the MMT screen into a portion of the browser screen is performed by the HTML rendering process unit 202 d of the web browser 202 .
  • FIG. 4 illustrates an example of a DOM tree generation process in the HTML DOM processing unit 202 b of the web browser 202 .
  • the HTML DOM processing unit 202 b determines whether or not an element to parse is present in the HTML document in step ST 3 .
  • the HTML DOM processing unit 202 b parses an element of the HTML document for each tag in step ST 4 .
  • the HTML DOM processing unit 202 b determines whether or not a video element is present in the elements of the parsed HTML document in step ST 5 .
  • the HTML DOM processing unit 202 b determines whether or not the source is an MMT stream in step ST 6 .
  • the HTML DOM processing unit 202 b links the DOM tree of the MMT-CI beneath the node of the video element in step ST 7 .
  • the HTML DOM processing unit 202 b joins the document of the DOM tree of the MMT plane, which serves as otherPlane[i], to the child node of the video element.
  • the HTML DOM processing unit 202 b increments the value of i in step ST 8 , subsequently returns to the process of step ST 3 , and repeats the same processes as described above. Note that, when no video element is present in step ST 5 , and when the source of the video element is not an MMT stream in step ST 6 , the process immediately returns to the process of step ST 3 . In addition, when all the elements of the HTML document are parsed and there are no elements to parse in step ST 3 , the HTML DOM processing unit 202 b proceeds to step ST 9 and completes the DOM tree generation process.
  • FIG. 5 illustrates an example of the mutual relationship between the DOM tree of an HTML document and the DOM tree of an MMT-CI linked thereto. Using these DOM trees, it is possible to access each element of the HTML document and each element of the MMT-CI.
  • each element of the MMT-CI for example, a view element using a script such as that shown below.
  • This script is an example of a case in which the view element of the MMT-CI is accessed from the video element of the HTML document.
  • FIG. 6 is a diagram illustrating an example of the mutual relationship between the DOM tree of an HTML document and the DOM trees of two MMT-CIs linked thereto. Using these DOM trees, it is possible to access each element of the HTML document and each element of the two MMT-CIs.
  • FIG. 7 is a diagram illustrating an example of an MMT-CI.
  • the html element has a head element and a body element.
  • the head element has a title element and a view element. Textual data indicating the title is present in the title element.
  • the view element has two divLocation elements.
  • the body element has two div elements. One div element has a video element and an audio element, and the other div element has two img elements.
  • FIG. 8 illustrates an example of the DOM tree of an MMT-CI.
  • FIG. 9 illustrates an example of an HTML document.
  • the html element has a head element and a body element.
  • the head element has a title element. Textual data indicating the title is present in the title element.
  • the body element has a div element and a script element.
  • a script for accessing various elements and acquiring the elements is present as the script element.
  • FIG. 10 illustrates an example the DOM tree of the HTML document.
  • FIG. 11 illustrates an example in which the DOM tree of an MMT-CI is linked beneath the video element of the DOM tree of the HTML document.
  • the MMT stream is transmitted from the broadcasting station 110 to the reception side through the RF transmission channel.
  • the MMT stream is transmitted from the streaming server 120 to the reception side through a communication network transmission channel such as the Internet 300 .
  • the MMT stream that is transmitted from the broadcasting station 110 or the streaming server 120 is received by the IP/RF reception unit 204 of the receiver 200 .
  • the MMT stream is supplied to the MMT decoding unit 205 .
  • the MMT decoding unit 205 subjects the MMT packets contained in the MMT stream to unpacketting and a decoding process, obtains data such as video, audio, and images as media data, and further obtains to meta-data and messages.
  • the MMT-CI which describes the configuration of the screen or a change with time is also obtained by the MMT decoding unit 205 .
  • the parsing of the MMT-CI is performed and the DOM tree of the MMT-CI is generated.
  • the various data obtained by the MMT decoding unit 205 is supplied to the MMT player 206 .
  • the MMT player 206 generates output data of images and audio according to the configuration (the layout) of the screen, the change with time or the like specified by the MMT-CI based on data such as video, audio, and images obtained by the MMT decoding unit 205 .
  • the output data of images and audio is supplied to the output unit 203 .
  • image display and audio output are performed based on the output data of images and audio generated by the MMT player 206 .
  • an HTML 5 document for displaying a web page (an HTML 5 application) is transmitted to the reception side from the web server 130 through a communication network transmission channel such as the Internet 300 .
  • An HTML 5 document (an HTML file) that is transmitted from the web server 130 is received by the IP reception unit 201 of the receiver 200 .
  • the HTML 5 document is supplied to the web browser 202 .
  • the HTML 5 document that is received by the IP reception unit 201 is parsed, a DOM tree is generated, and output data of images and audio is generated based on the DOM tree or the like.
  • the web browser 202 when generating the DOM tree, when a video (video) element which references an MMT stream is present in the HTML 5 document, the DOM tree of the MMT-CI that is generated by the MMT decoding unit 205 as described above is linked beneath the node of the video element (refer to FIGS. 8, 10, and 11 ).
  • the output data of images and audio generated by the web browser 202 is supplied to the output unit 203 .
  • the output unit 203 when the browser screen is displayed, image display and audio output are performed based on the output data of images and audio generated by the web browser 202 .
  • a video (video) element that references the MMT stream is present in the HTML document, the MMT screen is inserted and displayed in a predetermined region of the browser screen (refer to FIG. 3 ).
  • accessing a predetermined element of the MMT-CI accessing a predetermined element of the MMT-CI, acquiring and using the element are performed either according to the operation of a viewer or automatically based on the linked DOM tree. For example, displaying information relating to the presentation control of the MMT screen on the browser screen is performed based on the acquired element.
  • a script for accessing a predetermined element of the MMT-CI is contained in the HTML document
  • acquiring the predetermined element of the MMT-CI is performed using the script. Note that, depending on the situation, the acquisition of a predetermined element of the HTML document is also performed.
  • FIG. 12 illustrates an example of the browser screen.
  • the MMT screen is inserted into a partial region of the browser screen.
  • the display region of the MMT screen is divided into two, video (video) and audio (audio) are displayed in one region, and image 1 (Image 1) is displayed in the other region.
  • the browser screen displays “Display time of Image 2: 18:00” in the proximity of the display region of the MMT screen.
  • Specific elements of the MMT-CI and the HTML document (the value “18:00” which is the value of the display end time, and the textual data indicating “Display start time of image2”) that are acquired using a script contained in the HTML document of FIG. 9 described above are used, for example, for the display information. Based on this display, the viewer of the browser screen can ascertain, in advance, that image 1 (Image 1) that is displayed in the display region of the MMT screen will switch to image 2 (Image 2) when the time reaches 18:00.
  • the DOM tree that is generated according to the MMT-CI is linked beneath the node of the video element.
  • the elements of the MMT-CI from the HTML application side.
  • an HTML document that is transmitted from the web server 130 contains reference information for acquiring a program (a script) for accessing a specific element of the MMT-CI.
  • the web browser 202 acquires a script file containing a program (a script) for accessing a specific element of the MMT-CI from the web server 130 based on the reference information.
  • FIG. 13A illustrates an example of an HTML document in this case.
  • FIG. 13B illustrates an example of a script file containing the program (the script) for accessing a specific element of the MMT-CI in this case.
  • the transport media stream is an MMT stream; however, the present technology can, naturally, be applied equally to a display system that handles a transport media stream that is similar to an MMT stream.
  • the second HTML document is an MMT-CI; however, for example, it is conceivable for this to be data having an HTML structure to which the data structure of MPEG2-TS is mapped.
  • streaming server 120 and the web server 130 are distinct from each other; however, a configuration in which these servers are formed of a single server is also conceivable.
  • the MMT decoding unit 205 and the MMT player 206 are present distinctly from the web browser 202 .
  • the MMT decoding unit 205 and the MMT player 206 are not necessary.
  • the present technology may adopt configurations such as the following.
  • a reception device including a first reception unit which receives a first HTML document for displaying a web page; a second reception unit which receives a transport media stream containing a predetermined number of transport media and a second HTML document as presentation control information of the transport media; and a DOM tree generation unit which generates a DOM tree of the first HTML document that is received by the first reception unit, in which, when a video element that references the transport media stream is present in the first HTML document, the DOM tree generation unit links the DOM tree that is generated according to the second HTML document contained in the transport media stream beneath a node of the video element.
  • the reception device further including an element acquisition unit which acquires a predetermined element of the second HTML document based on the DOM tree that is generated by the DOM tree generation unit.
  • the reception device further including a display control unit which controls display of the web page based on the DOM tree that is generated by the DOM tree generation unit, in which the display control unit displays information relating to presentation control of the predetermined number of transport media on a display screen of the web page based on the predetermined element of the second HTML document that is acquired by the element acquisition unit.
  • the transport media stream is a transport stream in which first transport packets containing a payload of the transport media, and second transport packets containing information relating to the transport media are time division multiplexed.
  • An information processing method in a reception device including a first reception unit which receives a first HTML document for displaying a web page; and a second reception unit which receives a transport media stream containing a predetermined number of transport media and a second HTML document as presentation control information of the transport media, the method including a step of generating a DOM tree of the first HTML document that is received by the first reception unit; and a step of linking the DOM tree that is generated according to the second HTML document contained in the transport media stream beneath a node of a video element when the video element that references the transport media stream is present in the first HTML document.
  • a transmission device including a holding portion which holds a first HTML document for displaying a web page which contains a video element that references a transport media stream containing a predetermined number of transport media and a second HTML document as presentation control information of the transport media, and which contains a program for accessing a specific element of the second HTML document or reference information for acquiring the program; and a transmission unit which transmits the first HTML document that is held.
  • the transport media stream is a transport stream in which first transport packets containing a payload of the transport media, and second transport packets containing information relating to the transport media are time division multiplexed.
  • An information processing device including a data acquisition unit which acquires a first HTML document for displaying a web page; and a DOM tree generation unit which parses the first HTML document that is acquired by the data acquisition unit and generates a DOM tree in which a plurality of elements are associated with each other, in which, when a video element that references a transport media stream containing a predetermined number of transport media and a second HTML document as presentation control information of the transport media is present in the first HTML document, the DOM tree generation unit links the DOM tree that is generated according to the second HTML document contained in the transport media stream beneath the video element.
  • An information processing method including a data acquisition step of causing a web browser to acquire a first HTML document for displaying a web page; and a DOM tree generation step of causing the web browser to parse the first HTML document that is acquired and to generate a DOM tree in which a plurality of elements are associated with each other, in which in the DOM tree generation step, when a video element that references a transport media stream containing a predetermined number of transport media and a second HTML document as presentation control information of the transport media is present in the first HTML document, the DOM tree that is generated according to the second HTML document contained in the transport media stream is linked beneath the video element.
  • An information processing device including a data acquisition unit which acquires first display control data containing a plurality of structural units that define information relating to display control; and a structured data generation unit which parses the first display control data that is acquired by the data acquisition unit and generates structured data in which the plurality of structural units are associated with each other, in which, when a predetermined structural unit that references a transport media stream containing a predetermined number of transport media and second display control data containing a plurality of the structural units that define presentation control information of the transport media is present in the first display control data, the structured data generation unit links the structured data that is generated according to the second display control data contained in the transport media stream beneath the predetermined structural unit.
  • An information processing method including a data acquisition step of acquiring first display control data containing a plurality of structural units that define information relating to display control; and a structured data generation step of parsing the first display control data that is acquired and generating structured data in which the plurality of structural units are associated with each other, in which, in the structured data generation step, when a predetermined structural unit that references a transport media stream containing a predetermined number of transport media and second display control data containing a plurality of the structural units that define presentation control information of the transport media is present in the first display control data, the structured data that is generated according to the second display control data contained in the transport media stream is linked beneath the predetermined structural unit.
  • a display system including a first transmission device which transmits a first HTML document for displaying a web page; a second transmission device which transmits a transport media stream containing a predetermined number of transport media and a second HTML document as presentation control information of the transport media; and a reception device which includes a first reception unit which receives the first HTML document that is transmitted from the first transmission device, and a second reception unit which receives the transport media stream which is transmitted from the second transmission device, in which the reception device includes a DOM tree generation unit which generates a DOM tree of the first HTML document that is received by the first reception unit, and a display control unit which controls display of the web page based on the DOM tree that is generated by the DOM tree generation unit, and in which, when a video element that references the transport media stream is present in the first HTML document, the DOM tree generation unit links the DOM tree that is generated according to the second HTML document contained in the transport media stream beneath a node of the video element.
  • the main characteristic of the present technology is to enable the access to elements of an MMT-CI from an HTML application side by, during the generation of a DOM tree of an HTML document (HTML application) for displaying a web page, when a video element that references the MMT stream is present, linking the DOM tree of the MMT-CI beneath a node of the video element (refer to FIG. 5 ).

Abstract

It is possible to access a presentation control information (HTML document) element contained in a transport media stream from an HTML application side.
A first HTML document for displaying a web page is received. In addition, the transport media stream which contains a predetermined number of transport media and a second HTML document as the presentation control information of the transport media is received. A DOM tree of the first HTML document is generated. When a video element that references the transport media stream is present in the first HTML document, the DOM tree that is generated according to the second HTML document contained in the transport media stream is linked beneath a node of the video element.

Description

    CROSS-REFERENCE TO RELATED APPLICATIONS
  • This application is a continuation of U.S. application Ser. No. 14/763,689, filed Jul. 27, 2015, which is a national stage of International Application No. PCT/JP2014/060874, filed on Apr. 16, 2014, the entire contents of which are incorporated herein by reference, and claims priority under 35 U.S.C. 119 to Japanese Application No. 2013-093436, filed Apr. 26, 2013.
  • TECHNICAL FIELD
  • The present technology relates to a reception device, an information processing method in the reception device, a transmission device, an information processing device, and an information processing method. Specifically, the present technology relates to a reception device or the like which receives and processes display control data such as an HTML document for displaying a web page.
  • BACKGROUND ART
  • Display of the web page is performed by a web browser. At this time, the web browser acquires an HTML document (an HTML file) from a web server, parses the HTML document to generate a Document Object Model (DOM) tree, generates various rendered elements based on the DOM tree and displays the web page (for example, refer to PTL 1).
  • The Document Object Model (DOM) is present in the World Wide Web Consortium (W3C) standards. The DOM defines a tree structure in which the outermost tag, <html>, is the top node in relation to one HTML document, and designates an interface for applying dynamic processing by JavaScript to each parameter of tags. Note that “JavaScript” is a registered trademark.
  • In recent years, a method defined in MPEG Media Transport (MMT) ISO/IEC 23008-1 is attracting attention as a transport method suitable for next generation broadcasting. MMT not only defines the transport layer, but also a data structure referred to as MMT Composition Information (MMT-CI) which describes the configuration of the screen or a change with time. The MMT-CI configures presentation control information of transport media such as video, audio, and images. The MMT-CI is defined by HTML 5.
  • CITATION LIST Patent Literature
  • PTL 1: Japanese Unexamined Patent Application Publication No. 2011-065489
  • SUMMARY OF INVENTION Technical Problem
  • There is a case in which a video (Video) element is present in an HTML 5 document from a web server. In this case, video display is performed according to the video element in a predetermined region of a web page screen that is displayed by a web browser. In this case, it is conceivable to reference an MMT transport media stream as a source of the video element.
  • The object of the present technology is to enable the access to a presentation control information (HTML document) element contained in the transport media stream from the HTML application side.
  • Solution to Problem
  • The concept of the present technology is a reception device which includes
  • a first reception unit which receives a first HTML document for displaying a web page;
  • a second reception unit which receives a transport media stream containing a predetermined number of transport media and a second HTML document as presentation control information of the transport media; and
  • a DOM tree generation unit which generates a DOM tree of the first HTML document that is received by the first reception unit,
  • in which, when a video element that references the transport media stream is present in the first HTML document, the DOM tree generation unit links the DOM tree that is generated according to the second HTML document contained in the transport media stream beneath a node of the video element.
  • In the present technology, the first HTML document for displaying the web page is received by the first reception unit. In addition, the transport media stream is received by the second reception unit. The transport media stream contains the predetermined number of transport media and the second HTML document as the presentation control information of the transport media.
  • For example, the transport media stream may be a transport stream in which first transport packets containing a payload of the transport media, and second transport packets containing information relating to the transport media are time division multiplexed. In this case, the transport packets may be MMT packets, and the second HTML document may be an MMT-CI. For example, the second HTML document may be data having an HTML structure to which the data structure of MPEG2-TS is mapped.
  • The DOM tree of the first HTML document which is received by the first reception unit is generated by the DOM tree generation unit. When a video element that references the transport media stream is present in the first HTML document, the DOM tree that is generated according to the second HTML document contained in the transport media stream is linked beneath the node of the video element.
  • In the present technology, when generating the DOM tree of the first HTML document, the DOM tree that is generated according to the second HTML document is linked beneath the node of the video element. Therefore, it is possible to access a presentation control information (HTML document) element contained in the transport media stream from the HTML application side.
  • Note that, in the present technology, for example, an element acquisition unit which acquires a predetermined element of the second HTML document based on the DOM tree that is generated by the DOM tree generation unit may be further provided. In this case, for example, it is possible to acquire and use a predetermined element of the second HTML document in the HTML application side.
  • For example, a display control unit which controls display of the web page based on the DOM tree that is generated by the DOM tree generation unit may be further provided, and the display control unit may display information relating to presentation control of the predetermined number of transport media on a display screen of the web page based on the predetermined element of the second HTML document that is acquired by the element acquisition unit. In this case, for example, the user can ascertain the information relating to the presentation control of the predetermined number of transport media presented on the display screen of the web page.
  • In addition, for example, a program for accessing a specific element of the second HTML document may be contained in the first HTML document, and the element acquisition unit may acquire the specific element of the second HTML document based on the program. In this case, for example, it is possible to easily acquire a predetermined element of the second HTML document in the HTML application side.
  • In addition, for example, reference information for acquiring a program for accessing a specific element of the second HTML document may be contained in the first HTML document, and the element acquisition unit may acquire the specific element of the second HTML document based on the program that is acquired using the reference information. In this case, for example, it is possible to easily acquire a predetermined element of the second HTML document in the HTML application side.
  • In addition, another concept of the present technology is a transmission device which includes
  • a holding portion which holds a first HTML document for displaying a web page which contains a video element that references a transport media stream containing a predetermined number of transport media and a second HTML document as presentation control information of the transport media, and which contains a program for accessing a specific element of the second HTML document or reference information for acquiring the program; and
  • a transmission unit which transmits the first HTML document that is held.
  • In the present technology, the first HTML document for displaying a web page is held in the holding portion. The video element that references the transport media stream which contains the predetermined number of transport media and the second HTML document as the presentation control information of the transport media is contained in the first HTML document. In addition, the first HTML document contains a program for accessing a specific element of the second HTML document or reference information for acquiring the program. The first HTML document that is held is transmitted by the transmission unit.
  • For example, the transport media stream may be a transport stream in which first transport packets containing a payload of the transport media, and second transport packets containing information relating to the transport media are time division multiplexed. In this case, the transport packets may be MMT packets, and the second HTML document may be an MMT-CI.
  • In the present technology, the first HTML document for displaying a web page that is transmitted from the transmission unit contains a program for accessing a specific element of the second HTML document contained in the transport media stream or reference information for acquiring the program. Therefore, it is possible to easily acquire a predetermined element of the second HTML document in the HTML application side in the reception unit.
  • In addition, another concept of the present technology is an information processing device which includes
  • a data acquisition unit which acquires a first HTML document for displaying a web page; and
  • a DOM tree generation unit which parses the first HTML document that is acquired by the data acquisition unit and generates a DOM tree in which a plurality of elements are associated with each other,
  • in which, when a video element that references a transport media stream containing a predetermined number of transport media and a second HTML document as presentation control information of the transport media is present in the first HTML document, the DOM tree generation unit links the DOM tree that is generated according to the second HTML document contained in the transport media stream beneath the video element.
  • In the present technology, the first HTML document for displaying a web page is acquired by the data acquisition unit. The first HTML document that is acquired by the data acquisition unit is parsed, and a DOM tree in which a plurality of elements are associated with each other is generated by the DOM tree generation unit.
  • In this case, when a video element that references the transport media stream that contains a predetermined number of transport media and the second HTML document as the presentation control information of the transport media is present in the first HTML document, the DOM tree that is generated according to the second HTML document contained in the transport media stream is linked beneath the video element.
  • In the present technology, the DOM tree that is generated according to the second HTML document is linked beneath the node of the video element of the DOM tree of the first HTML document. Therefore, it is possible to access a presentation control information (HTML document) element contained in a transport media stream from the HTML application side.
  • In addition, another concept of the present technology is an information processing device which includes
  • a data acquisition unit which acquires first display control data containing a plurality of structural units that define information relating to display control; and
  • a structured data generation unit which parses the first display control data that is acquired by the data acquisition unit and generates structured data in which the plurality of structural units are associated with each other,
  • in which, when a predetermined structural unit that references a transport media stream containing a predetermined number of transport media and second display control data containing a plurality of the structural units that define presentation control information of the transport media is present in the first display control data, the structured data generation unit links the structured data that is generated according to the second display control data contained in the transport media stream beneath the predetermined structural unit.
  • Advantageous Effects of Invention
  • According to the present technology, for example, it is possible to access the content of the presentation control information (HTML document) contained in the transport media stream from the HTML application side. Note that, the effects disclosed in the present specification are merely examples, embodiments are not to be limited thereto and additional effects may be present.
  • BRIEF DESCRIPTION OF DRAWINGS
  • FIG. 1 is a block diagram illustrating a configuration example of a display system as an embodiment.
  • FIG. 2 is a diagram illustrating the basic structure of an MMT-CI.
  • FIG. 3 is a diagram illustrating the relationship between a browser screen and an MMT screen when a video (Video) element that references an MMT stream is present in an HTML 5 document.
  • FIG. 4 is a flowchart illustrating an example of a DOM tree generation process in an HTML DOM processing unit of a web browser.
  • FIG. 5 is a diagram illustrating an example of the mutual relationship between the DOM tree of an HTML document and the DOM tree of an MMT-CI linked thereto.
  • FIG. 6 is a diagram illustrating an example of the mutual relationship between the DOM tree of an HTML document and the DOM trees of two MMT-CIs linked thereto.
  • FIG. 7 is a diagram illustrating an example of an MMT-CI.
  • FIG. 8 is a diagram illustrating an example of the DOM tree of an MMT-CI.
  • FIG. 9 is a diagram illustrating an example of an HTML document.
  • FIG. 10 is a diagram illustrating an example the DOM tree of an HTML document.
  • FIG. 11 is a diagram illustrating an example in which the DOM tree of an MMT-CI is linked beneath a video element of the DOM tree of an HTML document.
  • FIG. 12 is a diagram illustrating an example of a browser screen in which an MMT screen is being displayed in a partial region.
  • FIGS. 13A, 13B are diagrams illustrating an example of an HTML document containing reference information of a program (a script) for accessing a specific element of an MMT-CI, and an example of a script file containing the program (the script) for accessing a specific element of the MMT-CI.
  • DESCRIPTION OF EMBODIMENTS
  • Hereafter, description will be given of embodiments for realizing the invention (below, “embodiments”). Note that, the description will be given in the following order.
  • 1. Embodiment 2. Modification Example 1. Embodiment [Configuration Example of Display System]
  • FIG. 1 illustrates a configuration example of a display system 10. In the display system 10, a broadcasting station 110, a streaming server 120, and a web (Web) server 130 are disposed on a transmission side, and a receiver 200 is disposed on a reception side.
  • The broadcasting station 110 generates transport packets of the MMT structure (refer to ISO/IEC CD 23008-1), that is, a transport media stream in which MMT packets are contained, and transmits the transport media stream to the reception side through an RF transmission channel. Hereinafter, the transport media stream will be referred to as the “MMT stream”, as appropriate. In this case, the broadcasting station 110 RF modulates the MMT stream via an appropriate application layer or the like and subsequently transmits the MMT stream to the reception side through the RF transmission channel.
  • The streaming server 120 transmits an MMT stream which is the same as that handled by the broadcasting station 110 described above to the reception side through, for example, a communication network transmission channel such as an Internet 300. In this case, the streaming server 120 converts the MMT stream into IP packets and transmits the IP packets to the reception side through the communication network transmission channel.
  • First MMT packets containing a payload of transport media such as video and audio, and second MMT packets containing a payload of information relating to the transport media are time division multiplexed in the MMT stream, at least by the size of fragmented packets. A data structure referred to as MMT Composition Information (MMT-CI) which describes the configuration of the screen or a change with time is defined as one type of information relating to the transport media. The MMT-CI configures presentation control information of transport media such as video, audio, and images. The MMT-CI is defined by HTML 5.
  • FIG. 2 illustrates the basic structure of an MMT-CI. A root element is “html”, and has a head element and a body element. The head element has a plurality of view elements in addition to a title element, and each of the view elements has a plurality of divLocation elements. In addition, the body element has a plurality of div elements. It is possible to place video (video), audio (audio), image (img), and the like inside the div elements. A source element or a src attribute is used for specifying each medium. The view element determines the display position of a div of the body; however, the initial position is determined by the divLocation element therebelow. The divLocation elements sequentially indicate the display position that changes in time-series manner. Therefore, the display position changes according to the divLocation elements without the page reloading.
  • Returning to FIG. 1, the web server 130 holds HTML 5 documents (HTML files) for displaying web pages (an HTML 5 application) in a holding portion such as storage. In response to a request from the reception side, the web server 130 transmits an HTML 5 document that is being held in the holding portion to the reception side through a communication network transmission channel such as the Internet 300 using an IP transmission unit. It is well known that the presence of a video (video) element is permitted in HTML 5. For example, there is a case in which a video element is present in an HTML 5 document that is transmitted from the web server 130 to the reception side, and the source of the video element is in an MMT stream. In this case, reference information for referencing an MMT stream is present as the video element.
  • The receiver 200 includes an IP reception unit 201, a web (Web) browser 202, an output unit 203, an IP/RF reception unit 204, an MMT decoding unit 205, and an MMT player 206. The IP/RF reception unit 204 receives an MMT stream that is transmitted thereto through an RF transmission channel from the broadcasting station 110 after the MMT stream is parsed in the application layer. Alternatively, the IP/RF reception unit 204 receives an MMT stream that is transmitted thereto from the web server 130 through a communication network transmission channel.
  • The MMT decoding unit 205 subjects the MMT packets contained in the MMT stream that is received by the IP/RF reception unit 204 to unpacketting and a decoding process, obtains data such as video, audio, and images as media data, and also obtains meta-data and messages. In this case, the MMT-CI which describes the configuration of the screen or a change with time is also obtained. The MMT decoding unit 205 also performs the generation of the Document Object Model (DOM) tree of the MMT-CI.
  • The MMT player 206 generates output data of images and audio according to the configuration (the layout) of the screen, the change with time or the like specified by the MMT-CI based on data such as video, audio, and images obtained by the MMT decoding unit 205.
  • The output unit 203 performs image display and audio output based on the output data of images and audio generated by the MMT player 206, or the output data of images and audio generated by the web browser 202. The output unit is configured by a display which performs the image display, a speaker which performs audio output, or the like.
  • The IP reception unit 201 receives an HTML 5 document (an HTML file) that is transmitted thereto from the web server 130 through a communication network transmission channel such as the Internet 300. The web browser 202 parses the HTML 5 document that is received by the IP reception unit 201, generates a DOM tree, and generates the output data of images and audio based on the DOM tree and the like.
  • The web browser 202 includes an HTML parsing unit 202 a which parses the HTML 5 document, and an HTML DOM processing unit 202 b which generates a DOM tree based on the parsed results. In addition, the web browser 202 includes an HTML layout processing unit 202 c and an HTML rendering process unit 202 d which perform a layout process and a rendering process based on the DOM tree or the like and generate the output data of images and audio.
  • In the embodiment, when a video (video) element which references an MMT stream is present in the HTML 5 document, the HTML DOM processing unit 202 b links the DOM tree of the MMT-CI that is generated by the MMT decoding unit 205 as described above beneath the node of the video element. By performing the linking of the DOM trees in this manner, it becomes possible to access the elements of the MMT-CI from the HTML application side that is handled by the web browser 202.
  • FIG. 3 illustrates the relationship between a browser screen (HTML 5) and an MMT screen when a video (video) element that references an MMT stream is present in the HTML 5 document. In this case, the MMT screen enters a state of being inserted into a portion of the browser screen. Here, the size adjustment and the like when inserting the MMT screen into a portion of the browser screen is performed by the HTML rendering process unit 202 d of the web browser 202.
  • FIG. 4 illustrates an example of a DOM tree generation process in the HTML DOM processing unit 202 b of the web browser 202. The HTML DOM processing unit 202 b starts the DOM tree generation process in step ST1, and subsequently sets i=0 in step ST2.
  • Next, the HTML DOM processing unit 202 b determines whether or not an element to parse is present in the HTML document in step ST3. When an element to parse is present, the HTML DOM processing unit 202 b parses an element of the HTML document for each tag in step ST4.
  • Next, the HTML DOM processing unit 202 b determines whether or not a video element is present in the elements of the parsed HTML document in step ST5. When a video element is present in the elements, the HTML DOM processing unit 202 b determines whether or not the source is an MMT stream in step ST6.
  • When the source is an MMT stream, the HTML DOM processing unit 202 b links the DOM tree of the MMT-CI beneath the node of the video element in step ST7. In other words, the HTML DOM processing unit 202 b joins the document of the DOM tree of the MMT plane, which serves as otherPlane[i], to the child node of the video element.
  • Next, the HTML DOM processing unit 202 b increments the value of i in step ST8, subsequently returns to the process of step ST3, and repeats the same processes as described above. Note that, when no video element is present in step ST5, and when the source of the video element is not an MMT stream in step ST6, the process immediately returns to the process of step ST3. In addition, when all the elements of the HTML document are parsed and there are no elements to parse in step ST3, the HTML DOM processing unit 202 b proceeds to step ST9 and completes the DOM tree generation process.
  • FIG. 5 illustrates an example of the mutual relationship between the DOM tree of an HTML document and the DOM tree of an MMT-CI linked thereto. Using these DOM trees, it is possible to access each element of the HTML document and each element of the MMT-CI.
  • For example, it is possible to perform the access to the video element of the HTML document using a script (script) such as that shown below.
  • var videoElm=document.getElementsByTagName(‘video’)[0];
  • Alternatively, if an id is attached to the video element, it is possible to perform the access to the video element of the HTML document using a script such as that shown below.
  • var videoElm=document.getElementByID(“v1”);
  • In addition, it is possible to perform the access to each element of the MMT-CI, for example, a view element using a script such as that shown below. This script is an example of a case in which the view element of the MMT-CI is accessed from the video element of the HTML document.
  • var mmtElm=videoElm.otherPlane[0].document.getElementsByTagName(‘view’) [0];
  • In addition, relative access to the view element of the MMT-CI from the top of the HTML document is also conceivable. An example of a script in this case is shown below.
  • var mmtElm=document.firstChild.firstChild.childNode [1].otherPlane [0].document.getElementsByTagName(‘view’) [0];
  • Note that, since the DOM of the MMT-CI is appropriately updated in time series, when the DOM of the MMT-CI is re-written dynamically from the HTML application, integrity cannot be maintained. Therefore, it is necessary to use read only (read only). In other words, all of the DOM access from otherPlane onward is set to read only (read only).
  • FIG. 6 is a diagram illustrating an example of the mutual relationship between the DOM tree of an HTML document and the DOM trees of two MMT-CIs linked thereto. Using these DOM trees, it is possible to access each element of the HTML document and each element of the two MMT-CIs.
  • For example, it is possible to perform the access to the view element of the MMT-CI of the DOM tree (otherPlane[i]) using a script (script) such as that shown below.
  • var mmtElm=document.firstChild.firstChild.childNode [1].otherPlane[1].document.getElementsByTagName(‘view’) [0];
  • FIG. 7 is a diagram illustrating an example of an MMT-CI. The html element has a head element and a body element. The head element has a title element and a view element. Textual data indicating the title is present in the title element. The view element has two divLocation elements. In addition, the body element has two div elements. One div element has a video element and an audio element, and the other div element has two img elements. FIG. 8 illustrates an example of the DOM tree of an MMT-CI.
  • FIG. 9 illustrates an example of an HTML document. The html element has a head element and a body element. The head element has a title element. Textual data indicating the title is present in the title element. In addition, the body element has a div element and a script element. The div element has a p element and a video element. Textual data indicating “display start time of image2” is present in the p element. Reference information of the MMT stream “src=“http://sample.mmt”” is present as the video element.
  • A script for accessing various elements and acquiring the elements is present as the script element. Here, “var videoElm=document.getElementsByTagName(‘video’)[0];” is a script for accessing the video element of the HTML document. In addition, “var mmtElm=videoElm.otherPlain[0].document.getElementsByTagName(‘view’) [0];” is a script for accessing the view element of the MMT-CI from the video element of the HTML document.
  • Furthermore, “var endtime=mmtElm.getElememtById(‘Image1’).getAttributeNode(“MMT-CI:end”).nodeValue;” is a script for accessing the image1 element of the MMT-CI and acquiring the value of “end”. It is possible to acquire “18:00”, which is the value of the display end time, using the script.
  • In addition, “var pelm=document.getElementsByTagName(‘p’)[0];” and “pelm.innerText=pelm.innerText+endtime;” are scripts for accessing the p element of the MMT-CI and acquiring the textual data indicating “display start time of image2”.
  • FIG. 10 illustrates an example the DOM tree of the HTML document. FIG. 11 illustrates an example in which the DOM tree of an MMT-CI is linked beneath the video element of the DOM tree of the HTML document.
  • A simple description will be given of the operations of the display system 10 illustrated in FIG. 1. The MMT stream is transmitted from the broadcasting station 110 to the reception side through the RF transmission channel. Alternatively, the MMT stream is transmitted from the streaming server 120 to the reception side through a communication network transmission channel such as the Internet 300.
  • The MMT stream that is transmitted from the broadcasting station 110 or the streaming server 120 is received by the IP/RF reception unit 204 of the receiver 200. The MMT stream is supplied to the MMT decoding unit 205. The MMT decoding unit 205 subjects the MMT packets contained in the MMT stream to unpacketting and a decoding process, obtains data such as video, audio, and images as media data, and further obtains to meta-data and messages.
  • In addition, the MMT-CI which describes the configuration of the screen or a change with time is also obtained by the MMT decoding unit 205. In the MMT decoding unit 205, the parsing of the MMT-CI is performed and the DOM tree of the MMT-CI is generated. The various data obtained by the MMT decoding unit 205 is supplied to the MMT player 206.
  • The MMT player 206 generates output data of images and audio according to the configuration (the layout) of the screen, the change with time or the like specified by the MMT-CI based on data such as video, audio, and images obtained by the MMT decoding unit 205. The output data of images and audio is supplied to the output unit 203. In the output unit 203, when the MMT screen is displayed, image display and audio output are performed based on the output data of images and audio generated by the MMT player 206.
  • In addition, in response to a request from the reception side, an HTML 5 document (an HTML file) for displaying a web page (an HTML 5 application) is transmitted to the reception side from the web server 130 through a communication network transmission channel such as the Internet 300. An HTML 5 document (an HTML file) that is transmitted from the web server 130 is received by the IP reception unit 201 of the receiver 200. The HTML 5 document is supplied to the web browser 202.
  • In the web browser 202, the HTML 5 document that is received by the IP reception unit 201 is parsed, a DOM tree is generated, and output data of images and audio is generated based on the DOM tree or the like. In the web browser 202, when generating the DOM tree, when a video (video) element which references an MMT stream is present in the HTML 5 document, the DOM tree of the MMT-CI that is generated by the MMT decoding unit 205 as described above is linked beneath the node of the video element (refer to FIGS. 8, 10, and 11).
  • The output data of images and audio generated by the web browser 202 is supplied to the output unit 203. In the output unit 203, when the browser screen is displayed, image display and audio output are performed based on the output data of images and audio generated by the web browser 202. Here, when a video (video) element that references the MMT stream is present in the HTML document, the MMT screen is inserted and displayed in a predetermined region of the browser screen (refer to FIG. 3).
  • In the web browser 202, accessing a predetermined element of the MMT-CI, acquiring and using the element are performed either according to the operation of a viewer or automatically based on the linked DOM tree. For example, displaying information relating to the presentation control of the MMT screen on the browser screen is performed based on the acquired element. In the web browser 202, for example, when a script for accessing a predetermined element of the MMT-CI is contained in the HTML document, acquiring the predetermined element of the MMT-CI is performed using the script. Note that, depending on the situation, the acquisition of a predetermined element of the HTML document is also performed.
  • FIG. 12 illustrates an example of the browser screen. The MMT screen is inserted into a partial region of the browser screen. The display region of the MMT screen is divided into two, video (video) and audio (audio) are displayed in one region, and image 1 (Image 1) is displayed in the other region. The browser screen displays “Display time of Image 2: 18:00” in the proximity of the display region of the MMT screen.
  • Specific elements of the MMT-CI and the HTML document (the value “18:00” which is the value of the display end time, and the textual data indicating “Display start time of image2”) that are acquired using a script contained in the HTML document of FIG. 9 described above are used, for example, for the display information. Based on this display, the viewer of the browser screen can ascertain, in advance, that image 1 (Image 1) that is displayed in the display region of the MMT screen will switch to image 2 (Image 2) when the time reaches 18:00.
  • As described above, in the display system 10 illustrated in FIG. 1, in the web browser 202 of the receiver 200, during the generation of the DOM tree of the HTML document, when a video element that references an MMT stream is present in the HTML document, the DOM tree that is generated according to the MMT-CI is linked beneath the node of the video element.
  • Therefore, for example, it is possible to access the elements of the MMT-CI from the HTML application side. For example, it is possible to display information relating to the presentation control of the MMT screen on the browser screen (the display screen of web page) based on predetermined elements of the MMT-CI that are acquired. Accordingly, the user (the viewer) can ascertain the information relating to the presentation control of the transport media presented on the browser screen.
  • 2. Modification Example
  • Note that, in the embodiment described above, an example is given in which a program (a script) for accessing a specific element of the MMT-CI is contained in an HTML document that is transmitted from the web server 130 (refer to FIG. 9).
  • However, a configuration can be conceived in which an HTML document that is transmitted from the web server 130 contains reference information for acquiring a program (a script) for accessing a specific element of the MMT-CI. In this case, the web browser 202 acquires a script file containing a program (a script) for accessing a specific element of the MMT-CI from the web server 130 based on the reference information.
  • FIG. 13A illustrates an example of an HTML document in this case. Here, ““src=”http://sample.mmt”” is reference information for acquiring a program (a script) for accessing a specific element of the MMT-CI. In addition, FIG. 13B illustrates an example of a script file containing the program (the script) for accessing a specific element of the MMT-CI in this case.
  • In addition, in the embodiment described above, an example is given in which the transport media stream is an MMT stream; however, the present technology can, naturally, be applied equally to a display system that handles a transport media stream that is similar to an MMT stream. In other words, in the embodiment described above, an example is given in which the second HTML document is an MMT-CI; however, for example, it is conceivable for this to be data having an HTML structure to which the data structure of MPEG2-TS is mapped.
  • In addition, in the embodiment described above, an example is given in which the streaming server 120 and the web server 130 are distinct from each other; however, a configuration in which these servers are formed of a single server is also conceivable.
  • In addition, in the embodiment described above, an example is given in which, as the receiver 200, the MMT decoding unit 205 and the MMT player 206 are present distinctly from the web browser 202. However, it is conceivable to provide a web browser 202 which is provided with the functions of the MMT decoding unit 205 and the MMT player 206. In this case, the MMT decoding unit 205 and the MMT player 206 are not necessary.
  • In addition, the present technology may adopt configurations such as the following.
  • (1) A reception device including a first reception unit which receives a first HTML document for displaying a web page; a second reception unit which receives a transport media stream containing a predetermined number of transport media and a second HTML document as presentation control information of the transport media; and a DOM tree generation unit which generates a DOM tree of the first HTML document that is received by the first reception unit, in which, when a video element that references the transport media stream is present in the first HTML document, the DOM tree generation unit links the DOM tree that is generated according to the second HTML document contained in the transport media stream beneath a node of the video element.
  • (2) The reception device according to (1) further including an element acquisition unit which acquires a predetermined element of the second HTML document based on the DOM tree that is generated by the DOM tree generation unit.
  • (3) The reception device according to (2) further including a display control unit which controls display of the web page based on the DOM tree that is generated by the DOM tree generation unit, in which the display control unit displays information relating to presentation control of the predetermined number of transport media on a display screen of the web page based on the predetermined element of the second HTML document that is acquired by the element acquisition unit.
  • (4) The reception device according to (2) or (3), in which a program for accessing a specific element of the second HTML document is contained in the first HTML document, and in which the element acquisition unit acquires the specific element of the second HTML document based on the program.
  • (5) The reception device according to (2) or (3), in which reference information for acquiring a program for accessing a specific element of the second HTML document is contained in the first HTML document, and in which the element acquisition unit acquires the specific element of the second HTML document based on the program that is acquired using the reference information.
  • (6) The reception device according to any one of (1) to (5), in which the transport media stream is a transport stream in which first transport packets containing a payload of the transport media, and second transport packets containing information relating to the transport media are time division multiplexed.
  • (7) The reception device according to (6), in which the transport packets are MMT packets, and in which the second HTML document is an MMT-CI.
  • (8) An information processing method in a reception device, including a first reception unit which receives a first HTML document for displaying a web page; and a second reception unit which receives a transport media stream containing a predetermined number of transport media and a second HTML document as presentation control information of the transport media, the method including a step of generating a DOM tree of the first HTML document that is received by the first reception unit; and a step of linking the DOM tree that is generated according to the second HTML document contained in the transport media stream beneath a node of a video element when the video element that references the transport media stream is present in the first HTML document.
  • (9) A transmission device including a holding portion which holds a first HTML document for displaying a web page which contains a video element that references a transport media stream containing a predetermined number of transport media and a second HTML document as presentation control information of the transport media, and which contains a program for accessing a specific element of the second HTML document or reference information for acquiring the program; and a transmission unit which transmits the first HTML document that is held.
  • (10) The transmission device according to (9), in which the transport media stream is a transport stream in which first transport packets containing a payload of the transport media, and second transport packets containing information relating to the transport media are time division multiplexed.
  • (11) The transmission device according to (10), in which the transport packets are MMT packets, and in which the second HTML document is an MMT-CI.
  • (12) An information processing device including a data acquisition unit which acquires a first HTML document for displaying a web page; and a DOM tree generation unit which parses the first HTML document that is acquired by the data acquisition unit and generates a DOM tree in which a plurality of elements are associated with each other, in which, when a video element that references a transport media stream containing a predetermined number of transport media and a second HTML document as presentation control information of the transport media is present in the first HTML document, the DOM tree generation unit links the DOM tree that is generated according to the second HTML document contained in the transport media stream beneath the video element.
  • (13) An information processing method including a data acquisition step of causing a web browser to acquire a first HTML document for displaying a web page; and a DOM tree generation step of causing the web browser to parse the first HTML document that is acquired and to generate a DOM tree in which a plurality of elements are associated with each other, in which in the DOM tree generation step, when a video element that references a transport media stream containing a predetermined number of transport media and a second HTML document as presentation control information of the transport media is present in the first HTML document, the DOM tree that is generated according to the second HTML document contained in the transport media stream is linked beneath the video element.
  • (14) An information processing device including a data acquisition unit which acquires first display control data containing a plurality of structural units that define information relating to display control; and a structured data generation unit which parses the first display control data that is acquired by the data acquisition unit and generates structured data in which the plurality of structural units are associated with each other, in which, when a predetermined structural unit that references a transport media stream containing a predetermined number of transport media and second display control data containing a plurality of the structural units that define presentation control information of the transport media is present in the first display control data, the structured data generation unit links the structured data that is generated according to the second display control data contained in the transport media stream beneath the predetermined structural unit.
  • (15) An information processing method including a data acquisition step of acquiring first display control data containing a plurality of structural units that define information relating to display control; and a structured data generation step of parsing the first display control data that is acquired and generating structured data in which the plurality of structural units are associated with each other, in which, in the structured data generation step, when a predetermined structural unit that references a transport media stream containing a predetermined number of transport media and second display control data containing a plurality of the structural units that define presentation control information of the transport media is present in the first display control data, the structured data that is generated according to the second display control data contained in the transport media stream is linked beneath the predetermined structural unit.
  • (16) A display system including a first transmission device which transmits a first HTML document for displaying a web page; a second transmission device which transmits a transport media stream containing a predetermined number of transport media and a second HTML document as presentation control information of the transport media; and a reception device which includes a first reception unit which receives the first HTML document that is transmitted from the first transmission device, and a second reception unit which receives the transport media stream which is transmitted from the second transmission device, in which the reception device includes a DOM tree generation unit which generates a DOM tree of the first HTML document that is received by the first reception unit, and a display control unit which controls display of the web page based on the DOM tree that is generated by the DOM tree generation unit, and in which, when a video element that references the transport media stream is present in the first HTML document, the DOM tree generation unit links the DOM tree that is generated according to the second HTML document contained in the transport media stream beneath a node of the video element.
  • The main characteristic of the present technology is to enable the access to elements of an MMT-CI from an HTML application side by, during the generation of a DOM tree of an HTML document (HTML application) for displaying a web page, when a video element that references the MMT stream is present, linking the DOM tree of the MMT-CI beneath a node of the video element (refer to FIG. 5).
  • REFERENCE SIGNS LIST
      • 10 DISPLAY SYSTEM
      • 110 BROADCASTING STATION
      • 120 STREAMING SERVER
      • 130 WEB SERVER
      • 200 RECEIVER
      • 201 IP RECEPTION UNIT
      • 202 WEB BROWSER
      • 202 a HTML PARSING UNIT
      • 202 b HTML DOM PROCESSING UNIT
      • 202 c HTML LAYOUT PROCESSING UNIT
      • 202 d HTML RENDERING PROCESS UNIT
      • 203 OUTPUT UNIT
      • 204 IP/RF RECEPTION UNIT
      • 205 MMT DECODING UNIT
      • 206 MMT PLAYER
      • 300 INTERNET

Claims (9)

1. A reception device, comprising:
a first reception unit which receives a first HTML document for displaying a web page;
a second reception unit which receives a transport media stream containing a predetermined number of transport media and a second HTML document as presentation control information of the transport media; and
a DOM tree generation unit which generates a DOM tree of the first HTML document that is received by the first reception unit,
wherein, when a video element that references the transport media stream is present in the first HTML document, the DOM tree generation unit links the DOM tree that is generated according to the second HTML document contained in the transport media stream beneath a node of the video element.
2. The reception device according to claim 1, further comprising:
an element acquisition unit which acquires a predetermined element of the second HTML document based on the DOM tree that is generated by the DOM tree generation unit.
3. The reception device according to claim 2, further comprising:
a display control unit which controls display of the web page based on the DOM tree that is generated by the DOM tree generation unit,
wherein the display control unit displays information relating to presentation control of the predetermined number of transport media on a display screen of the web page based on the predetermined element of the second HTML document that is acquired by the element acquisition unit.
4. The reception device according to claim 2,
wherein a program for accessing a specific element of the second HTML document is contained in the first HTML document, and
wherein the element acquisition unit acquires the specific element of the second HTML document based on the program.
5. The reception device according to claim 2,
wherein reference information for acquiring a program for accessing a specific element of the second HTML document is contained in the first HTML document, and
wherein the element acquisition unit acquires the specific element of the second HTML document based on the program that is acquired using the reference information.
6. The reception device according to claim 1,
wherein the transport media stream is a transport stream in which first transport packets containing a payload of the transport media, and second transport packets containing information relating to the transport media are time division multiplexed.
7. The reception device according to claim 6,
wherein the transport packets are MMT packets, and
wherein the second HTML document is an MMT-CI.
8. An information processing method in a reception device including a first reception unit which receives a first HTML document for displaying a web page; and a second reception unit which receives a transport media stream containing a predetermined number of transport media and a second HTML document as presentation control information of the transport media, the method comprising:
a step of generating a DOM tree of the first HTML document that is received by the first reception unit; and
a step of linking the DOM tree that is generated according to the second HTML document contained in the transport media stream beneath a node of a video element when the video element that references the transport media stream is present in the first HTML document.
9. An information processing device, comprising:
a data acquisition unit which acquires a first HTML document for displaying a web page; and
a DOM tree generation unit which parses the first HTML document that is acquired by the data acquisition unit and generates a DOM tree in which a plurality of elements are associated with each other,
wherein, when a video element that references a transport media stream containing a predetermined number of transport media and a second HTML document as presentation control information of the transport media is present in the first HTML document, the DOM tree generation unit links the DOM tree that is generated according to the second HTML document contained in the transport media stream beneath the video element.
US16/281,957 2013-04-26 2019-02-21 Reception device, information processing method in reception device, transmission device, information processing device, and information processing method Abandoned US20190286684A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US16/281,957 US20190286684A1 (en) 2013-04-26 2019-02-21 Reception device, information processing method in reception device, transmission device, information processing device, and information processing method

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
JP2013093436A JP2014215859A (en) 2013-04-26 2013-04-26 Reception device, information processing method in reception device, transmission device, information processing device, and information processing method
JP2013-093436 2013-04-26
PCT/JP2014/060874 WO2014175148A1 (en) 2013-04-26 2014-04-16 Reception device, information processing method in reception device, transmission device, information processing device, and information processing method
US201514763689A 2015-07-27 2015-07-27
US16/281,957 US20190286684A1 (en) 2013-04-26 2019-02-21 Reception device, information processing method in reception device, transmission device, information processing device, and information processing method

Related Parent Applications (2)

Application Number Title Priority Date Filing Date
US14/763,689 Continuation US10262072B2 (en) 2013-04-26 2014-04-16 Reception device, information processing method in reception device, transmission device, information processing device, and information processing method
PCT/JP2014/060874 Continuation WO2014175148A1 (en) 2013-04-26 2014-04-16 Reception device, information processing method in reception device, transmission device, information processing device, and information processing method

Publications (1)

Publication Number Publication Date
US20190286684A1 true US20190286684A1 (en) 2019-09-19

Family

ID=51791727

Family Applications (2)

Application Number Title Priority Date Filing Date
US14/763,689 Active 2036-04-05 US10262072B2 (en) 2013-04-26 2014-04-16 Reception device, information processing method in reception device, transmission device, information processing device, and information processing method
US16/281,957 Abandoned US20190286684A1 (en) 2013-04-26 2019-02-21 Reception device, information processing method in reception device, transmission device, information processing device, and information processing method

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US14/763,689 Active 2036-04-05 US10262072B2 (en) 2013-04-26 2014-04-16 Reception device, information processing method in reception device, transmission device, information processing device, and information processing method

Country Status (6)

Country Link
US (2) US10262072B2 (en)
EP (1) EP2990958A4 (en)
JP (1) JP2014215859A (en)
KR (1) KR20160002711A (en)
CN (1) CN105144146B (en)
WO (1) WO2014175148A1 (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2014215859A (en) * 2013-04-26 2014-11-17 ソニー株式会社 Reception device, information processing method in reception device, transmission device, information processing device, and information processing method
US20170344523A1 (en) * 2016-05-25 2017-11-30 Samsung Electronics Co., Ltd Method and apparatus for presentation customization and interactivity
CN110545471B (en) * 2018-05-29 2021-12-14 北京字节跳动网络技术有限公司 Playing control method and device based on offline conversion and storage medium
CH715553A1 (en) * 2018-11-15 2020-05-15 Thidima Sa Method for generating content from a website in an extensible manner.
CN111327566B (en) * 2018-12-13 2023-05-02 阿里巴巴集团控股有限公司 Method, device and system for determining streaming media data
CN110837614A (en) * 2019-11-05 2020-02-25 上海嘉道信息技术有限公司 Method and system for efficiently generating webpage information extraction rule
US10922476B1 (en) * 2019-12-13 2021-02-16 Microsoft Technology Licensing, Llc Resource-efficient generation of visual layout information associated with network-accessible documents
US11461535B2 (en) * 2020-05-27 2022-10-04 Bank Of America Corporation Video buffering for interactive videos using a markup language
JP7372020B2 (en) * 2021-03-10 2023-10-31 株式会社Bloom Act Video generation and distribution processing device, video generation and distribution method, and video generation and distribution program

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6781609B1 (en) * 2000-05-09 2004-08-24 International Business Machines Corporation Technique for flexible inclusion of information items and various media types in a user interface
US7028331B2 (en) * 2001-02-28 2006-04-11 Sharp Laboratories, Inc. Content proxy method and apparatus for digital television environment
US7117216B2 (en) * 2001-06-07 2006-10-03 Sun Microsystems, Inc. Method and apparatus for runtime merging of hierarchical trees
KR20030095048A (en) * 2002-06-11 2003-12-18 엘지전자 주식회사 Multimedia refreshing method and apparatus
US20100306643A1 (en) * 2009-03-30 2010-12-02 Nokia Corporation Methods and Systems for Processing Document Object Models (DOM) to Process Video Content
JP2011065489A (en) 2009-09-17 2011-03-31 Sony Corp Information processing apparatus, data display method, and program
US8713420B2 (en) * 2011-06-30 2014-04-29 Cable Television Laboratories, Inc. Synchronization of web applications and media
US8913068B1 (en) * 2011-07-12 2014-12-16 Google Inc. Displaying video on a browser
WO2013055164A1 (en) * 2011-10-13 2013-04-18 삼성전자 주식회사 Method for displaying contents, method for synchronizing contents, and method and device for displaying broadcast contents
US9635394B2 (en) * 2013-01-24 2017-04-25 Electronics And Telecommunications Research Institute Method and device for flexible MMT asset transmission and reception
JP2014215859A (en) * 2013-04-26 2014-11-17 ソニー株式会社 Reception device, information processing method in reception device, transmission device, information processing device, and information processing method
EP2866436A1 (en) * 2013-10-23 2015-04-29 Thomson Licensing Method and apparatus for transmission and reception of media data

Also Published As

Publication number Publication date
CN105144146A (en) 2015-12-09
CN105144146B (en) 2018-06-05
WO2014175148A1 (en) 2014-10-30
US20150363505A1 (en) 2015-12-17
EP2990958A1 (en) 2016-03-02
JP2014215859A (en) 2014-11-17
EP2990958A4 (en) 2017-03-01
KR20160002711A (en) 2016-01-08
US10262072B2 (en) 2019-04-16

Similar Documents

Publication Publication Date Title
US20190286684A1 (en) Reception device, information processing method in reception device, transmission device, information processing device, and information processing method
US7076478B2 (en) Wrapper playlists on streaming media services
US9584837B2 (en) Receiving device and method of controlling the same, distribution device and distribution method, program, and distribution system
AU2010294783B2 (en) Method and device for providing complementary information
US11496585B2 (en) Browser navigation for facilitating data access
JP7181308B2 (en) MEDIA PLAYBACK LOADING CONTROL METHOD, APPARATUS AND STORAGE MEDIUM
US9898443B2 (en) Method and system for webpage processing
JP5414792B2 (en) Method and apparatus for providing rich media service
JP2020184784A (en) Receiver and program
EP3007450B1 (en) Transmission device, transmission method, receiving device, and receiving method
KR101621092B1 (en) Device and method for scene presentation of structured information
KR101958662B1 (en) Method and Apparatus for sharing java script object in webpage
CN110545460B (en) Media file preloading method and device and storage medium
US20140181882A1 (en) Method for transmitting metadata documents associated with a video
CN110545480A (en) Preloading control method and device of media file and storage medium
JP2010148141A (en) Broadcast reception terminal
KR101008181B1 (en) System and method for contents rendering for iptv
Concolato et al. In-Browser XML Document Streaming
KR20210025508A (en) Apparatus and method for transmitting broadcasting content based on atsc 3.0, and apparatus and method for receiving broadcasting content based on atsc 3.0
JP2006197353A (en) Broadcast reception terminal
Thomas-Kerr et al. Format-independent multimedia streaming
JP2006197354A (en) Broadcast reception system

Legal Events

Date Code Title Description
STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION

AS Assignment

Owner name: SATURN LICENSING LLC, NEW YORK

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:SONY CORPORATION;REEL/FRAME:052879/0780

Effective date: 20190911