WO2005006752A1 - Appareil et procede d'emission/reception d'informations vocales de guide de programme electronique - Google Patents

Appareil et procede d'emission/reception d'informations vocales de guide de programme electronique Download PDF

Info

Publication number
WO2005006752A1
WO2005006752A1 PCT/KR2003/002923 KR0302923W WO2005006752A1 WO 2005006752 A1 WO2005006752 A1 WO 2005006752A1 KR 0302923 W KR0302923 W KR 0302923W WO 2005006752 A1 WO2005006752 A1 WO 2005006752A1
Authority
WO
WIPO (PCT)
Prior art keywords
epg
voice
data
document
information
Prior art date
Application number
PCT/KR2003/002923
Other languages
English (en)
Inventor
Bong-Ho Lee
So-Ra Park
Woo-Suk Kim
Young-Ho Jeong
Jin-Hwan Lee
Young-Kwon Hahm
Soo-In Lee
Original Assignee
Electronics And Telecommunications Research Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Electronics And Telecommunications Research Institute filed Critical Electronics And Telecommunications Research Institute
Priority to AU2003289589A priority Critical patent/AU2003289589A1/en
Priority to EP03781082A priority patent/EP1649691A4/fr
Publication of WO2005006752A1 publication Critical patent/WO2005006752A1/fr

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • H04N21/8543Content authoring using a description language, e.g. Multimedia and Hypermedia information coding Expert Group [MHEG], eXtensible Markup Language [XML]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/08Systems for the simultaneous or sequential transmission of more than one television signal, e.g. additional information signals, the signals occupying wholly or partially the same frequency band, e.g. by time division
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/233Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • H04N21/2362Generation or processing of Service Information [SI]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/266Channel or content management, e.g. generation and management of keys and entitlement messages in a conditional access system, merging a VOD unicast channel into a multicast channel
    • H04N21/2665Gathering content from different sources, e.g. Internet and satellite
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/434Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/434Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
    • H04N21/4345Extraction or processing of SI, e.g. extracting service information from an MPEG stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/482End-user interface for program selection

Definitions

  • the present invention relates to an apparatus for transmitting/receiving voice electronics program guide information and a method thereof; and, more particularly, to a voice electronic program guide information transmitting/receiving apparatus that provides Electronics Program Guide (EPG) information in voice or in voice and graphics .
  • EPG Electronics Program Guide
  • DMB Digital Audio Broadcasting
  • EPG Electronics Program Guide
  • EPG is provided in the form of graphics and/or speech by receiving and processing service information (SI) of each broadcasting system which is transmitted through a broadcasting network in a receiver.
  • SI service information
  • the conventional EPG providing method has a limitation in providing useful program information to users because each broadcasting terminal has its own EPG processing method and EPG expression engine and the information provided by the SI is limited.
  • the received information is made up into sentences simply and then provided in a Text-To- Speech (TTS) method.
  • TTS Text-To- Speech
  • the conventional EPG providing method requires a help of an agent or an expert system to provide intelligent program guide. This make the receiver create voice sentences dynamically, thus bringing burden to the operation of the receiver and causing a problem in stability.
  • XML EPG Language
  • a terrestrial DAB EPG XML specification has advantages that it can provide the user with information more useful than a method that processes SI in a terminal directly and that it can provide an ' environment standardized according to a service provider or a terminal manufacturing company by using the XML.
  • EPG information is to be provided in voice by using the method, an additional device is required necessarily to provide the guide information in voice to the users because there is no scenario for providing the guide information in voice. Therefore, a method that can provide the EPG in voice without any additional voice guide scenario providing device is required in the current area of EPG technology for digital broadcasting program viewers in motion or driving and the visually handicapped persons.
  • an object of the present invention to provide a voice electronics program guide (EPG) information transmitting/receiving apparatus that can reduce the processing burden loaded on a receiver and provide EPG information in voice (or both voice and graphics) that sounds familiar to users by making up a voice EPG document, i.e., a content, in a transmitting system of a broadcasting station in advance, and to provide a method thereof.
  • EPG voice electronics program guide
  • an apparatus for transmitting EPG information in voice including: a voice EPG document authoring module for authoring a voice EPG document by receiving program information from other networks or other service providers; a data encoding module for converting data according to a broadcasting protocol to transmit file and directory objects corresponding to the voice EPG document, which is passed from the voice EPG document authoring module, through a broadcasting network periodically; a data multiplexing module for multiplexing voice EPG data obtained through the data conversion in the data encoding unit with other program data inputted from other networks or other service providers; and a transmitting module for transmitting data obtained from the multiplexing in the data multiplexing unit.
  • an apparatus for receiving EPG information in voice including: a receiving module for receiving broadcasting signals which include voice EPG data and other program data; a data demultiplexing module for • extracting desired data related to voice EPG by demultiplexing multiplexed program data delivered from the receiving module; a data decoding module for restoring an original voice EPG document from the data related to voice EPG, which are passed from the data demultiplexing module; and a voice EPG information providing module for providing the user with EPG information in voice by executing files related to voice EPG, which are passed from the data decoding module.
  • a method for transmitting voice EPG information to provide the EPG information in voice including the steps of: a) authoring a voice EPG document by receiving program information from other networks or other service providers in a voice EPG document authoring module; b) converting data according to a broadcasting protocol in a data encoding module to transmit file and directory objects corresponding to the voice EPG- document, which is passed from the voice EPG document authoring module, through a broadcasting network periodically; c) multiplexing the voice EPG data obtained through the data conversion in the data encoding module with other program data inputted from other networks or other service providers in a data multiplexing module; and d) transmitting data obtained from the data multiplexing module in a transmitting module.
  • a method for receiving voice EPG to provide the EPG in voice comprising the steps of: a) receiving broadcasting signals which include voice EPG data and other program data; b) extracting data related to desired voice EPG by demultiplexing multiplexed program data provided from a receiving unit in a data demultiplexing module; c) restoring an original voice EPG document based on the data related to voice EPG, which are provided from the data demultiplexing module in a data decoding module; and. d) providing the user with EPG information in voice by executing files related to voice EPG, which are transmitted from the data decoding module in a voice EPG platform.
  • Fig. 1 is a block diagram illustrating voice electronics program guide (EPG) information transmitting/ receiving apparatus in accordance with an embodiment of the present invention
  • Fig. 2 is a block diagram describing a voice EPG document authoring module 11 of a voice EPG information transmitting apparatus in accordance with an embodiment of the present invention
  • Fig. 3 is a diagram showing a hierarchical structure of a terrestrial Digital Multimedia Broadcasting (DMB) voice EPG application document that can be authored in a scenario template forming block 22 in accordance with an embodiment of the present invention
  • DMB Digital Multimedia Broadcasting
  • FIG. 4 is a diagram depicting a VoiceEPG XML application document forming block 23 of the voice EPG document authoring block 11 in accordance with an embodiment of the present invention
  • Fig. 5 is a flowchart describing a voice EPG document authoring method in accordance with an embodiment of the present invention
  • Fig. ⁇ is a diagram showing a dialog structure of a voice EPG application document produced in the voice EPG data transmitter side in accordance with an embodiment of the present invention
  • Fig. 7 is a block diagram illustrating a voice EPG platform of the voice EPG information receiving apparatus in accordance with an embodiment of the present invention
  • Fig. 8 presents an example of service guide scenario of the voice EPG information receiving apparatus in accordance with an embodiment of the present invention.
  • Fig. 1 is a block diagram illustrating voice electronics program guide (EPG) information transmitting/ receiving apparatus in accordance with an embodiment of the present invention.
  • the voice EPG information transmitting apparatus of the present invention includes a voice EPG document authoring module 11, a data encoding module 12, a data multiplexing module 13, and a transmitting module 14.
  • the voice EPG document authoring portion 11 makes up a voice EPG document based on program information inputted from a network or by an operator to make up a voice EPG document to be transmitted through a digital broadcasting system.
  • the data encoding module 12 converts data based on a corresponding broadcasting network protocol to transmit file and directory objects corresponding to the voice EPG document, which is transmitted from the voice EPG document authoring module 11, periodically through the broadcasting network.
  • the data multiplexing module 13 receives voice EPG data which are obtained by converting the data in the data encoding module '12 and multiplexes them with other program data inputted from the other networks or other service providers.
  • the transmitting module 14 transmits data obtained from the multiplexing in the data multiplexing module 13.
  • the voice EPG information receiving apparatus of the present invention includes a receiving module 15, a data demultiplexing module 16, a data decoding module 17, and a voice EPG platform 18.
  • the receiving module 15 receives radio frequency signals transmitted through the broadcasting network-, that is, broadcasting signals including voice EPG data and other program data.
  • the data demultiplexing module 16 demultiplexes multiplexed program data transmitted from the receiving module 15 and separates desired data related to the voice EPG.
  • the data decoding module 17 restores the original voice EPG document based on the data related to the voice EPG which are passed from the data demultiplexing module 16.
  • the voice EPG platform 18 provides user with EPG information in voice or both in voice and graphics by executing files related to voice EPG passed from the data decoding module 17.
  • the voice EPG document authoring module 11 of the present invention will be described later on with reference to Fig. 2.
  • the data .encoding module 12 encodes voice EPG documents.
  • the encoding method used here should be suitable to the type of the digital broadcasting system. If data are to be transmitted through a digital video broadcasting (DVB) network, which is one of digital video broadcasting system, the file and directory objects corresponding to the voice EPG documents are encoded in accordance with Object Carousel protocol. If data are to be transmitted through an Advanced Television System Committee (ATSC) broadcasting network, they are encoded based on Data Carousel protocol. If data are to be transmitted through a terrestrial Digital Audio Broadcasting (DAB) - network or a Digital Multimedia Broadcasting (DMB) network, they are encoded based on Multimedia Object Transfer (MOT) protocol respectively.
  • DAB Digital Audio Broadcasting
  • DMB Digital Multimedia Broadcasting
  • the data encoded in the data encoding module 12 can be transmitted out repeatedly based on Carousel protocol.
  • the voice EPG documents can be converted into an Internet Protocol (IP) and transmitted based on the protocol of each broadcasting system.
  • IP Internet Protocol
  • the DVB and ASTC they can be encapsulated and transmitted as IP data based on Multiprotocol Encapsulation (MPE) .
  • MPE Multiprotocol Encapsulation
  • the data multiplexing module 13 receives the voice EPG data obtained from, the conversion in the data encoding module 12 and multiplexes them with other program data inputted from the other networks or other service providers .
  • the data multiplexing module 13 produces multiplex configuration information (MCI) and service information (SI) by receiving program information for the programs from- the network or a user and multiplexes them with main service program data.
  • MCI multiplex configuration information
  • SI service information
  • an MPEG-2 multiplexer and/or remultiplexer can be used as the data multiplexing module 13.
  • a digital audio broadcasting packet multiplexer or an ensemble multiplexer can be used as the data multiplexing module 13.
  • the transmitting module 14 transmits the data from the data multiplexing module 13 in the form of broadcasting signals.
  • the receiving module 15 receives the broadcasting signals that include the voice EPG data and other program data. That is, the receiving module 15 comprises a tuner, a demodulator and a channel decoder to process baseband signals and receives radio frequency signals (broadcasting signals) transmitted through the broadcasting network.
  • the data demultiplexing module 16 separates data related to desired voice EPG by demultiplexing the multiplexed program data passed from the receiving module 15.
  • the data decoding module 17 receives streams or packet data that form the voice EPG from the data demultiplexing module 16 and restores the voice EPG document through decoding process.
  • the restored document is a document that actually provides voice EPG service, that is, it is a file.
  • a data carousel decoder or an object carousel decoder corresponds to the data decoding module 17.
  • DSM-CC digital storage media command and control
  • This function is embodied by a Multimedia Object Transfer (MOT) decoder in the digital audio broadcasting multimedia broadcasting system and it restores the directories and files that correspond to the voice EPG object after decoding multimedia object transmission (MOT) section.
  • MOT Multimedia Object Transfer
  • Fig. 2 is a block diagram describing a voice EPG document authoring module 11 of a voice EPG information transmitting apparatus in accordance with an embodiment of the present invention.
  • the voice EPG document authoring module 11 of the present invention includes an EPG data receiving block 21, a scenario template forming block 22, a voice EPG XML application document forming block 23, and a document transmitting block 24.
  • the EPG data receiving block 21 receives EPG data related to a digital or Internet broadcast program scheduling list information.
  • the scenario template forming block 22 defines a scenario template by defining a method of creating a document, a method of forming information of a document, a method of linking documents or dialogs, and the type of user interface to create a ' voice EPG application document.
  • the VoiceEPG XML application document forming block 23 receives EPG data, analyzes the EPG data, and creates a voice EPG application document in an extensive markup language (XML) , which defines a voice EPG scenario, based on a scenario template.
  • the document transmitting block 24 transmits the voice EPG application document through a broadcasting network or a communication network.
  • VoiceEPG XML a markup language for defining a voice EPG scenario
  • the VoiceEPG XML is a markup language contrived to generate a dialog-based scenario which is characterized by an interaction, i.e., dialog, between a system and a user to provided EPG information in voice.
  • a voice EPG scenario markup language created by adding elements for controlling a receiver, such as ⁇ tuning>, ⁇ watch>, and ⁇ reservation> to VoiceXML 2.0 contrived to describe a scenario and operated that are required to provide program guide information (functions and elements related to call are not used) .
  • a Voice EPG application can be composed of one or more documents and each of the documents also can contain one or more dialogs according to a pre-defined scenario. Dialogs have two types: ⁇ form> and ⁇ menu> .
  • the dialog ⁇ form> is a core structural element of the VoiceEPG XML. It prompts program guide information to the user and conversely responds to the requests from the users.
  • the dialog ⁇ menu> provides various options to the users and provides a transition to another dialog or document based on the option.
  • the user inputs of the VoiceEPG XML include voice input and Dual Tone Multifrequency (DTMF) input, and the VoiceEPG XML has a voice grammar and DTMF grammar.
  • the system outputs of the VoiceEPG XML include synthesized speech and pre-recorded audio and a dialog ⁇ prompt> element is used herein.
  • the VoiceEPG XML includes following elements to control a receiver. The receiver is controlled through a receiver control module situated in the lower part of an interpreter as a component when a user want tuning into another band or ensemble (in a case of DAB or DMB) or when the user want to watch or reserve a specific service or program.
  • the receiver control includes an element ⁇ tuning> for tuning into another band or ensemble, an element ⁇ watch> for selecting a service or a program, and an element ⁇ reservation> for making a reservation.
  • the receiver controlling element ⁇ tuning> for moving into another band or ensemble has following characteristics .
  • the ⁇ tuning> element is positioned only in ⁇ filled> or ⁇ block> within a dialog ⁇ form> element which defines program guide information on the tuned ensemble.
  • An identifier and frequency that are passed to the receiver control module through the ⁇ tuning> element can be specified in the attributes of the ⁇ tuning> element.
  • That information can be passed indirectly to the receiver control module just by giving a relevant information of external EPG data, such as DMB EPG XML, which -describes the identifier and frequency information.
  • external EPG data such as DMB EPG XML
  • the identifier and frequency attributes are to be defined.
  • the attributes are to be defined respectively according to each digital broadcasting system protocol.
  • the required identifier and frequency information can be provided indirectly by providing only the relevant information in connection with external data.
  • a terrestrial DMB EPG XML is used, what needs to be provided is information on the type of the elements of EPG XML and the identifiers of the elements.
  • the receiver controlling element ⁇ watch> gives the identifier of an item (herein, an item is a program or a service component) to the receiver control module when the user wants to see a service (including service components) or a program.
  • the element ⁇ watch> has following characteristics.
  • the ⁇ watch> element is positioned only in ⁇ filled> or ⁇ block> within the dialog ⁇ form> element which presents service and program guide information.
  • An identifier of an item that is transmitted to the receiver control module through the ⁇ watch> element can be defined in a form of attributes of ⁇ watch> element. That information can be passed indirectly to the receiver control module by giving a relevant information of external EPG data, such as DMB EPG XML, which describes the identifier and frequency information.
  • the ⁇ watch> element has the identifier of that item as its attributes.
  • the attributes of ⁇ watch> element are defined respectively according to each digital broadcasting system protocol. For example, in case where a program is received optionally in a terrestrial DMB system, " ⁇ Ecc>, ⁇ Eid>, and ⁇ Sid>" are provided to the receiver control module. Also, if the information is to be provided indirectly, only the relevant linking information with the external EPG data is specified just as clearly as the ⁇ tuning> element.
  • the receiver controlling element ⁇ reservation> passes an identifier of an item and time information to the receiver control module when the user wants to reserve a service (including service component) or a program.
  • the ⁇ reservation> element has following characteristics.
  • the ⁇ reservation> element is positioned only in ⁇ filled> or ⁇ block> within the dialog ⁇ form> which presents service and program guide information.
  • the identifier of an item and the time information, which are delivered to the receiver control module through the ⁇ reservation> element can be specified clearly in the form of attributes of ⁇ reservation> element.
  • ⁇ reservation> element can have the identifier of the item and the time information as its attributes.
  • the attributes of the ⁇ reservation> are defined respectively according to each digital broadcasting system protocol.
  • the EPG data receiving block 21 receives EPG data provided from the other networks or other service providers.
  • the EPG data are a list of program schedule that is currently on air and to be broadcasted. They are specified from the System Information (SI) of each broadcasting system or they can be expressed in another data form obtained by the filtering and analyzing process of the SI.
  • SI System Information
  • the scenario template forming block 22 defines a template to create a voice EPG application document. That is, it defines a styling of a document, a styling information of each document, a linking method of documents or dialogs, and the type of user interface.
  • the scenario template forming block 22 can provide a varity of forms of templates depending on the special usage and purpose for the same data.
  • a hierarchical structure of a terrestrial DMB voice EPG application document that can be formed through the scenario template forming block 22 is described hereafter. As shown in Fig.
  • the terrestrial DMB voice EPG application having an ensemble document as a top-level document, a schedule document for providing time base program information, a service document for providing each service base program information, a search document for searching a service or a program demanded by a user, and a preference document for providing user preference program guide information.
  • the ensemble document provides guide information for a low-level item (herein, the item is a document or a dialog) using one or more dialogs, responds to the user request for transiting to other items, and then eventually provides a link to the item.
  • each document is formed by classifying programs based on time slot (generally, one slot is four times) and by specifying the links to transit to other documents or dialogs defined therein.
  • the service document provides users with service base program information scheduled in a current time slot. It specifies the program information relevant to a specific service.
  • the search document provides a search mean to the user for a particular service or a program by using the method of speech input. It includes programs classified into genre or particular program groups.
  • the preference document accumulates and classifies the user preference programs or services and then let the user know previously or periodically.
  • the VoiceEPG XML application document forming block 23 receives and analyzes EPG data from the EPG data receiving block 21 and forms a voice EPG application document by using dialogs (including forms and menus) defined in the VoiceEPG XML and other elements based on a scenario template provided in the scenario template forming block 22.
  • the VoiceEPG XML application document forming block 23 includes a template analyzing unit 41, an EPG data analyzing unit 42, a controlling and managing unit 43, an EPG data converting unit 44, and a validity examining unit 45.
  • the EPG data analyzing unit 42 analyzes the type of EPG data and protocol turned in from the EPG data receiving block 21.
  • the template analyzing unit 41 analyzes a template inputted from the scenario template forming block 22.
  • the controlling and managing unit 43 receives requests from the user and performs controlling and managing during the formation of the voice EPG application document.
  • the EPG data converting unit 44 receives the analyzed EPG information and the analyzed template information from the EPG data analyzing unit 42 and the template analyzing unit 41 under the control/management of the controlling and managing unit 43 and forms the voice EPG application document based on the VoiceEPG XML.
  • the validity examining unit 45 examines a document whether the document is well formed in conformity with the data type defined in the VoiceEPG XML.
  • the EPG data analyzing unit 42 analyzes the EPG data handed over from the EPG information receiving apparatus 10, such as data type and protocol, to make a database and sends the analysis result to the EPG data converting unit 44.
  • the template analyzing unit 41 analyzes a template inputted from the scenario template forming block 22 and then sends the analyzed template information to the EPG data converting unit 44 or displays the analyzed template information on the. monitor for the sake of convenience in the creation of the document.
  • the controlling and managing unit 43 receives the requests from the user and performs controlling and managing during the creation of a voice EPG application document .
  • the EPG data converting unit 44 receives the analyzed EPG information and the analyzed template information and creates a document in the VoiceEPG XML based on a scenario of the analyzed template.
  • the validity examining unit 45 checks out the document validity in conformity with the data type defined in the VocieEPG XML and, if necessary, generates events.
  • the document transmitting block 24 is a server that stores and transmits documents that are drawn up. For example, if the documents are to be transmitted through a terrestrial DMB network, the stored documents, i.e., files, are delivered to an MOT carousel server and, at the same time, if they are to be transmitted through a DMB interactive network, the documents are transmitted to a DMB receiver or a communication terminal through a mobile communication network or a terrestrial return network.
  • the documents that need to be transmitted through an interactive network are those demanded by particular users among the voice EPG documents.
  • transmitting document through the interactive network makes it possible to provide customized or personalized EPG • service. Fig.
  • EPG data i.e., information on the programs currently on air or to be broadcasted later
  • EPG data receiving block 21 i.e., information on the programs currently on air or to be broadcasted later
  • a template for creating a voice EPG application document is created in the scenario template forming block 22. While the template is formed, a method of forming a document, a method of forming information for each document, a method of connecting documents or dialogs, and a user interface type are defined.
  • the VoiceEPG XML application document forming block 23 analyzes the type of EPG data delivered from the EPG data receiving block 21 in the EPG data analyzing unit 42, and analyzes the template inputted from the scenario template forming unit 22 in the template analyzing unit 41 to draw up a voice EPG scenario, i.e., a voice EPG application document, based on the VoiceEPG XML. Then, at step S504, the analyzed EPG information and the interpreted template information are inputted from the EPG data analyzing unit 42 and the template interpreting unit 41 and the voice EPG application document is drawn up by using the VoiceEPG XML based on the scenario of the template in the EPG data converting unit 44.
  • Fig. 6 is a diagram showing a dialog structure of a document made in the voice EPG information transmitting portion in accordance with an embodiment of the present , invention.
  • the voice EPG document of the present invention includes a top-level document 61 necessarily.
  • the top-level document 61 a document which should be executed first and * foremost when a voice EPG browser of the voice EPG platform 18 is operated, includes global variables and codes shared by other documents.
  • the top-level document 61 includes a plurality of sub- documents 62.
  • the VoiceEPG XML application document forming block 23 classifies the EPG information as upper- level information and lower-level information and creates the top-level document and the sub-documents based on the classified information.
  • the sub-documents 62 have many forms according to how they are created. For example, the sub-documents may be classified by genre.
  • Each of the sub-documents 62 is composed of one or more dialogs 63.
  • the dialog 63 means a finite state generated constantly at an arbitrary time for the communication between 'the user and the voice EPG platform. Each dialog is executed by another dialog and each dialog 63 is formed of a combination of sub-dialogs 63 and executed.
  • a sub-dialog 64 is a definite dialog.
  • Fig. 7 is a block diagram illustrating a voice EPG platform 18 of the voice EPG information receiving apparatus in accordance with an embodiment of the present invention.
  • the voice EPG platform 18 provides the user with EPG guide information as speech and receives user speech requests and it has a structure similar to a conventional web browser. However, unlike the conventional web browser, it can receive the user requests by using voice or other input devices and output the EPG information as speech.
  • the voice EPG platform 18 can provide EPG information which is written in texts in a document in the form of speech by using Text-To-Speech (TTS) technology capable of converting text into speech.
  • TTS Text-To-Speech
  • the voice EPG platform 18 can play a pre recorded audio file rather than TTS technology.
  • the voice EPG platform 18 has following general functions.
  • the voice EPG platform 18 is executed and terminated by the user.
  • program information is provided by speech in the form of interactive dialogs.
  • the voice EPG platform has an initiative of that session.
  • the term "initiative" means that the start and end of a session is flowed according to a predetermined order and a predetermined dialog sequence.
  • the voice EPG platform 18 outputs ' the program information as speech and simultaneously has graphical presentation on a display screen.
  • the voice EPG platform 18 can tune the program automatically.
  • the voice EPG platform 18 includes a document storing unit 71, an operation and management unit 72, a user request processing unit 74, a user preference processing unit 75, a receiver controlling unit 76, a graphic output unit 77, a speech output unit 78, and a voice EPG interpreting unit 73.
  • the document storing unit 71 stores files related to voice EPG delivered from the data decoding module 17 or an external device.
  • the operation and management unit 72 controls and manages the voice EPG interpreting unit 73.
  • the user request processing unit 74 receives user requests through a variety of input devices, processes them based on a function of voice recognition and a function of request extraction, and transmits them to the voice EPG interpreting unit 73 to provide the user with the EPG information as speech or both speech and graphics.
  • the user preference processing unit 75 searches a program preferred by the user automatically and guide the user to the program by receiving the user requests from the user request processing unit 74, collecting and analyzing information on the programs the user watches for a predetermined period, and transmitting the result to the voice EPG interpreting unit 73.
  • the receiver controlling unit 76 controls an actual digital broadcasting receiver to tune in to the program selected by the user under the control of the voice EPG interpreting unit 73 based on the information of the user request processing unit 74 or the user preference processing unit 75.
  • the graphic output unit 77 provides the user with the EPG information as speech through the speech output unit 78 and, at the same time, it receives the EPG information from the voice EPG interpreting unit 73 and outputs it in graphics.
  • the speech output unit 78 receives the EPG information from the voice EPG interpreting unit 73 and outputs it as speech.
  • the voice EPG interpreting unit 73 interprets files (markup language) related to EPG under the control of the operation and management unit 72, controls the receiver controlling unit 76 by receiving user requests from the user request processing unit 74 and user preference information from the user preference processing unit 75, and transmits EPG information to the speech output unit 78 and the graphic output unit 77.
  • the document storing unit 71 is a hard disk or detachable storing medium, such as flash memory and micro drive and the like. It stores the files related to voice EPG documents from the data decoding module or an external device. That is, the document storing unit 71 stores files corresponding to the voice EPG documents transmitted directly from a broadcasting network and stores documents and files from an external device.
  • the operation and management unit 72 controls and manages the voice EPG interpreting unit 73. To be specific, it initializes and monitors the voice EPG interpreting unit 73 and defines an initial document or manages environmental variables and system option setup.
  • the voice EPG interpreting unit 73 has a function similar to a conventional web browser interpreter. It interprets a voice EPG markup language and operates as a main control loop. Particularly, it analyzes the structure of the voice EPG document by using a built-in XML parser, interprets the content of the document, and executes a control structure according to the content.
  • the voice EPG interpreting unit 73 controls the general execution of the voice EPG document such as loading resources from the data decoding portion 17 or transiting to another document, controls voice input/output of the voice platform, processes various types of events generated from the voice platform, and performs general control related to transition to a new working document.
  • the user request processing unit 74 receives requests related to voice EPG from the user through diverse input devices, such as a key board, mouse, speech input -device, touch screen and touch pad, and then analyzes the user requests using the voice recognition engine or command analyzer, and finally sends the result to the voice EPG interpreting unit 73.
  • the voice recognizer includes an additional voice recognition engine and it receives the requests from the user as speech, analyzes the requests by extracting key words, and sends the result to the voice EPG interpreting unit 73.
  • the speech output unit 78 includes a speech synthesis engine and an audio file player and it provides EPG information as speech by using the speech synthesis engine and the audio file player.
  • the voice synthesis engine outputs EPG sentences and dialog sentences in the EPG document to the user as speech, and the audio file player plays a pre-recorded audio file.
  • the graphic output unit 77 outputs EPG information to the user graphically along with the speech content, which is outputted from the speech output unit) , and it has the same function as a conventional graphic device.
  • the user preference processing unit 75 receives user requests from the user request processing unit 74, collects and analyzes programs are preferred by the user for a predetermined period to obtain user preference, and transmits the user preference information to the voice EPG interpreting unit 73.
  • the voice EPG interpreting unit 73 searches the programs preferred by the user automatically and, if exists, narrates the programs information to the user. In order for the voice EPG interpreting unit 73 to search the programs preferred by the user and guide the user to the preferred programs, programs should be classified and drawn up in advance based on genre or characteristics.
  • the receiver controlling unit 76 receives information (or preferred channel information) on a channel (service or a program) selected by the user through the voice EPG interpreting unit 73 and practically controls the digital broadcasting receiver to tune the channel.
  • the provided channel information a physical or logical channel that should be selected by the digital broadcasting receiver, is an identifier of a channel or frequency information to be tuned or be selected.
  • Fig. 8 presents a service guide scenario of the voice EPG information receiving apparatus in accordance with an embodiment of the present invention. More particularly, it provides an example of an interface between the voice EPG platform and the user. As shown in Fig. 8, a message "the current ensemble provides four services. Do you want sequential program information, or genre? If you have any favorite program, please say .your program” is outputted in the voice EPG platform. Accordingly, if the user selects a "sports channel," a message "a sports channel is requested.
  • the present program has begun at 10 and will end at 1.
  • the next match of the soccer channel is scheduled at 2 p.m. tomorrow and the match is between Spain teams: R. Madrid vs. Barcelona. Do you want to watch the channel, or move to an upper-level menu? Please, input "watch” to watch the channel or "move” to move to upper-level menu.” If the user inputs "watch,” the voice EPG platform outputs a message "going to the requested channel. For EPG, input EPG or press EPG button. Hope you will enjoy the program. This is the end of EPG" and ends the EPG.
  • the technology of the present invention can provide the user in walking or driving with stability and convenience as well as constant mobility by making the transmitting part create the voice EPG document in advance and transmit it, and making the receiving part receive the voice EPG document and provide EPG information in voice.
  • the present invention can overcome the shortcoming that the graphic EPG is meaningless to the visually handicapped or illiterate people and it has an excellent effect that meaningful EPG information can be transmitted in voice.
  • the present invention can provide the user with a specific EPG document through an interactive network so it can provide a customized EPG service. While the present invention has been described with respect to certain preferred embodiments, it will be apparent to those skilled in the art that various changes and modifications may be made without departing from the scope of the invention as defined in the following claims.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Astronomy & Astrophysics (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Computer Security & Cryptography (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

L'invention concerne un appareil et un procédé d'émission/réception d'informations vocales de guide de programme électronique (EPG). L'appareil d'émission/réception d'informations vocales EPG réduit la charge d'un récepteur par création d'un document EPG vocal formant un contenu en avance dans un système de station de radiodiffusion, et transmet les informations EPG aux utilisateurs par une voix familière ou sous forme de données vocales et graphiques.
PCT/KR2003/002923 2003-07-11 2003-12-31 Appareil et procede d'emission/reception d'informations vocales de guide de programme electronique WO2005006752A1 (fr)

Priority Applications (2)

Application Number Priority Date Filing Date Title
AU2003289589A AU2003289589A1 (en) 2003-07-11 2003-12-31 Apparatus and method for transmitting/receiving voice electrics program guide information
EP03781082A EP1649691A4 (fr) 2003-07-11 2003-12-31 Appareil et procede d'emission/reception d'informations vocales de guide de programme electronique

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
KR10-2003-0047236 2003-07-11
KR20030047236 2003-07-11
KR10-2003-0097796 2003-12-26
KR20030097796 2003-12-26

Publications (1)

Publication Number Publication Date
WO2005006752A1 true WO2005006752A1 (fr) 2005-01-20

Family

ID=36117862

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2003/002923 WO2005006752A1 (fr) 2003-07-11 2003-12-31 Appareil et procede d'emission/reception d'informations vocales de guide de programme electronique

Country Status (4)

Country Link
EP (1) EP1649691A4 (fr)
KR (1) KR100740884B1 (fr)
AU (1) AU2003289589A1 (fr)
WO (1) WO2005006752A1 (fr)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007086683A1 (fr) 2006-01-24 2007-08-02 Electronics And Telecommunications Research Institute Système et procédé de service epg multimodal sur un système de radiodiffusion dmb/dab au moyen d'un langage xml epg étendu a étiquette vocale
DE102006006551A1 (de) * 2006-02-13 2007-08-16 Siemens Ag Verfahren und System zum Bereitstellen von Sprachdialoganwendungen
EP1770997A3 (fr) * 2005-09-26 2010-07-21 LG Electronics Inc. Système de diffusion destiné à fournir de l'information de programme et procédé correspondant

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100784110B1 (ko) * 2006-09-06 2007-12-10 에스케이 텔레콤주식회사 모바일 대규모 회의 통화 시스템, 회의통화 방법 및 모바일대규모 회의 통화 시스템 운용 방법
KR100912047B1 (ko) * 2007-07-05 2009-08-12 삼성전자주식회사 디지털방송수신기에서의 방송안내데이터 디코딩 방법 및장치

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR19980073254A (ko) * 1997-03-13 1998-11-05 김광호 디지털 위성 방송 수신 시스템의 전자 프로그램 안내 장치 및 방법
KR20000034600A (ko) * 1998-11-30 2000-06-26 전주범 텔레비전에서의 메뉴 제어 방법
EP1079617A2 (fr) 1999-08-26 2001-02-28 Matsushita Electric Industrial Co., Ltd. Filtrage automatique de l'information de télévision à l'aide de la reconnaissance de la parole et du langage naturel
WO2001015444A1 (fr) * 1999-08-19 2001-03-01 Sony Corporation Procede de transmission et recepteur

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5737030A (en) * 1995-10-16 1998-04-07 Lg Electronics Inc. Electronic program guide device
JP3604030B2 (ja) 1999-02-25 2004-12-22 日本ビクター株式会社 電子番組ガイドの送信装置および受信装置
US6314398B1 (en) * 1999-03-01 2001-11-06 Matsushita Electric Industrial Co., Ltd. Apparatus and method using speech understanding for automatic channel selection in interactive television
JP2000324417A (ja) 1999-05-12 2000-11-24 Toshiba Corp 補助情報再生装置
JP2001008173A (ja) 1999-06-21 2001-01-12 Mitsubishi Electric Corp データ伝送装置
US7483834B2 (en) * 2001-07-18 2009-01-27 Panasonic Corporation Method and apparatus for audio navigation of an information appliance

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR19980073254A (ko) * 1997-03-13 1998-11-05 김광호 디지털 위성 방송 수신 시스템의 전자 프로그램 안내 장치 및 방법
KR20000034600A (ko) * 1998-11-30 2000-06-26 전주범 텔레비전에서의 메뉴 제어 방법
WO2001015444A1 (fr) * 1999-08-19 2001-03-01 Sony Corporation Procede de transmission et recepteur
EP1079617A2 (fr) 1999-08-26 2001-02-28 Matsushita Electric Industrial Co., Ltd. Filtrage automatique de l'information de télévision à l'aide de la reconnaissance de la parole et du langage naturel

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
MCGLASHAN ET AL.: "Voice Extensible Markup Language (Voice XML) Version 2.0", W3C, February 2003 (2003-02-01)
See also references of EP1649691A4 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1770997A3 (fr) * 2005-09-26 2010-07-21 LG Electronics Inc. Système de diffusion destiné à fournir de l'information de programme et procédé correspondant
WO2007086683A1 (fr) 2006-01-24 2007-08-02 Electronics And Telecommunications Research Institute Système et procédé de service epg multimodal sur un système de radiodiffusion dmb/dab au moyen d'un langage xml epg étendu a étiquette vocale
DE102006006551A1 (de) * 2006-02-13 2007-08-16 Siemens Ag Verfahren und System zum Bereitstellen von Sprachdialoganwendungen
DE102006006551B4 (de) * 2006-02-13 2008-09-11 Siemens Ag Verfahren und System zum Bereitstellen von Sprachdialoganwendungen sowie mobiles Endgerät
US8583441B2 (en) 2006-02-13 2013-11-12 Nuance Communications, Inc. Method and system for providing speech dialogue applications

Also Published As

Publication number Publication date
AU2003289589A1 (en) 2005-01-28
EP1649691A1 (fr) 2006-04-26
EP1649691A4 (fr) 2009-05-27
KR100740884B1 (ko) 2007-07-19
KR20050009168A (ko) 2005-01-24

Similar Documents

Publication Publication Date Title
US7769589B2 (en) System and method for providing electronic program guide
JP3800267B2 (ja) 送信装置および送信方法、受信装置および受信方法、並びに伝送媒体
US7133903B2 (en) Event control device and digital broadcasting system
CN103069810B (zh) 虚拟频道声明对象脚本绑定
KR101409023B1 (ko) 어플리케이션 서비스 제공 방법 및 시스템
CN103119954A (zh) 接收器,接收方法和程序
JPH11103452A (ja) インタラクティブ番組における対話及び画面制御方法
EP1503589A1 (fr) Dispositif et méthode pour la mise à disposition de messages publicitaires sur télévision numérique
WO2006031084A1 (fr) Systeme et procede pour fournir un service de diffusion de donnees personnalisee, terminal d'utilisateur et procede pour utiliser un service de diffusion de donnees personnalisee et structure d'application de diffusion de donnees utilisee a cette fin
EP1977596A1 (fr) Système et procédé de service epg multimodal sur un système de radiodiffusion dmb/dab au moyen d'un langage xml epg étendu a étiquette vocale
JP2001292425A (ja) 摺動型グラフックウィンドウを用いたメディアコンテンツとのインターラクティブシステム
EP1649691A1 (fr) Appareil et procede d'emission/reception d'informations vocales de guide de programme electronique
KR102307330B1 (ko) 수신 장치 및 수신 방법
US20030148734A1 (en) Methods for broadcasting music and reproducing the broadcasting in digital broadcasting system
KR100669906B1 (ko) 음성 전자 프로그램 안내 저작 시스템 및 그 방법
KR101227492B1 (ko) 프로그램 안내를 위한 데이터 구조, 방법, 그리고 방송수신기
JP2004048430A (ja) 連動データ放送番組制作送出システム
KR101241878B1 (ko) 프로그램 안내를 위한 데이터 구조, 방법, 그리고 방송수신기
JP2000201317A (ja) 受信方法、受信装置、記憶装置及び記憶媒体
GB2395388A (en) Auditory EPG that provides navigational messages for the user

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2003781082

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2003781082

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: JP