WO2010078823A1 - Method, apparatus, and system for controlling media based on texts - Google Patents

Method, apparatus, and system for controlling media based on texts Download PDF

Info

Publication number
WO2010078823A1
WO2010078823A1 PCT/CN2009/076365 CN2009076365W WO2010078823A1 WO 2010078823 A1 WO2010078823 A1 WO 2010078823A1 CN 2009076365 W CN2009076365 W CN 2009076365W WO 2010078823 A1 WO2010078823 A1 WO 2010078823A1
Authority
WO
WIPO (PCT)
Prior art keywords
text
media
offset
parameter
value
Prior art date
Application number
PCT/CN2009/076365
Other languages
French (fr)
Chinese (zh)
Inventor
杨玮玮
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2010078823A1 publication Critical patent/WO2010078823A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/61Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio
    • H04L65/613Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio for the control of the source by the destination
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/10Architectures or entities
    • H04L65/102Gateways
    • H04L65/1023Media gateways

Definitions

  • the present invention relates to network communication technologies, and in particular, to a text-based media control method, apparatus and system. Background technique
  • MGC Media Gateway Controller
  • MG Media Gateway
  • the MGC is responsible for the call control function
  • the MG is responsible for the service bearer function, thereby separating the call control plane and the service bearer plane, thereby fully sharing network resources, simplifying equipment upgrades and service expansion, and greatly reducing development and maintenance costs.
  • the H.248 protocol provides a range of means to support interactive control of multimedia files, including playback, pause, resume, fast forward, and backward of voice files.
  • the H.248.9 (Advanced Audio Server Package) protocol also provides a variety of implementations for voice applications, such as TTS (Text To Speech), which is the MGC control MG for speech-to-speech speech synthesis.
  • TTS Text To Speech
  • the media playback content control method is only performed by means of time information and coded information, including NPT (Normal Play Time) format, SMPTE (Society of Motion Picture and Television Engineers, film and television). Engineers learn) format, UTC (Universal Time Code) format, Frame (frame number) format, Byte (bytes) format, and so on.
  • the above protocol lacks a control means for text files, thereby failing to support interactive operations on text information, for example, in speech synthesis, it is not possible to support paragraph level according to script text. control. Lack of the above mechanism leads to the language in H.248.9
  • the sound synthesis method is also limited to the most basic play and stop, and can not effectively control the text media of H.248 and meet the various needs of multimedia applications. Summary of the invention
  • Embodiments of the present invention provide a text-based media control method, apparatus, and system, which can solve the problem that an existing mechanism cannot support text-based media file operation control.
  • An embodiment of the present invention provides a text-based media control method, including:
  • the media gateway receives a command request for a media operation, where the command request includes a text-based media operation indication;
  • the media data is operated according to the text-based media operation indication.
  • An embodiment of the present invention provides a media gateway, including:
  • a receiving unit configured to receive a command request of a media operation, where the command request includes a text-based media operation indication
  • an operation unit configured to operate on the media data according to the text-based media operation indication.
  • An embodiment of the present invention provides a network system, including:
  • a media gateway controller configured to send a command request for a media operation to the media gateway, where the command request includes a text-based media operation indication
  • the media gateway is configured to receive a command request for a media operation, where the command request includes a text-based media operation indication, and operate the media data according to the text-based media operation indication.
  • a text-based media control method, apparatus, and system provided by an embodiment of the present invention, by a command request for a media operation sent to a media gateway, the command request includes a text-based media operation indication, and the media gateway is based on the text-based
  • the media operation instruction can control the playback of the text type media file, thereby realizing the effective control of the text type media and satisfying various requirements of the multimedia application.
  • FIG. 1 is a flowchart of a text-based media control method according to an embodiment of the present invention
  • FIG. 2 is a flowchart of a text-based media control method according to Embodiment 2 of the present invention.
  • FIG. 3 is a flowchart of a text-based media control method according to Embodiment 3 of the present invention.
  • FIG. 4 is a flowchart of a text-based media control method according to Embodiment 4 of the present invention.
  • Figure 5 is a schematic illustration of an MG of an embodiment of the present invention.
  • FIG. 6 is a schematic diagram of a network system in accordance with an embodiment of the present invention. detailed description
  • An embodiment of the present invention provides a text-based media control method, including: an MG receiving a command request for a media operation, where the command request includes a text-based media operation indication;
  • the media data is operated according to the text-based media operation indication.
  • the purpose of controlling the media file on the MG by text information can be realized by the text-based media operation instruction carried in the command request sent to the MG.
  • Embodiment 1 is a specific implementation manner for the MGC to implement interactive control of the text media on the MG by using the text-based media operation indication, where the media operation indication is a skip offset indication signal.
  • the media operation indication is a skip offset indication signal.
  • the embodiment may include the following steps:
  • the MGC instructs the MG to perform a TTS operation on the specific text content by sending a Modify Request message, where the Play Segment Identifier (playidid) signal is carried.
  • the MG sends a modification response message to the MGC, and performs a TTS operation.
  • This embodiment needs to instruct the MG to perform a hopping offset in the TTS operation, so steps S101 and S102 need to be performed. In other embodiments, it may not be necessary to perform these two steps, or the corresponding two steps need to be adjusted.
  • the MGC sends a text-based hopping offset indication signal to the MG, and the MG is required to perform a hopping offset operation of the media play, for example, requiring 20 sentences to jump forward at the current playback position.
  • the hopping offset indication signal is carried in the request message, and may be an H.248 request message such as Modify, Move, Add, or the like.
  • the skip offset indication signal can be implemented by means of a signal in H.248; and the skip offset indication signal is based on text, that is, carrying the text format based operation information in the skip offset indication signal. Specifically, it is defined in the following manner.
  • a jump offset indication signal is defined for the MGC to instruct the MG to perform a skip offset operation on the media resource or signal, for example, the signal is named "Jump".
  • This signal can be defined in an existing package or in a new package. For example, define a new package and name it "Play Offset Control Package (poc)".
  • the type of the signal can be set to Brief (Brief, BR) type, indicating that the signal can be automatically stopped or replaced by a new signal descriptor. This BR type signal has no expiration time limit.
  • This signal can be applied to a terminal (Termination) or a stream (Stream) on the terminal. If the signal is included in a signal descriptor that is sent to a terminal or stream, it indicates that it needs to be at the terminal or stream. Perform a jump on it.
  • the skip offset indication signal may include an operation object parameter, or may further include a skip offset value and a skip offset value unit.
  • the operation object parameter may include a media resource or a signal, and the operation object parameter may be defined by a signal parameter, and the operation object parameter may include at least one of the following parameters:
  • Signal Identifier (abbreviated as si), indicating the signal to perform the skip offset.
  • the type of this parameter is a string with the format "package ID/signal ID”.
  • Signal List ldentifier (abbreviated as sli), indicating the signal list for which the skip offset is to be performed.
  • the type of this parameter is an integer.
  • the type of the parameter is a string, for example, a URI (Universal Resource Locator) or an IRI (Internationalized Resource Identifier) format.
  • URI Universal Resource Locator
  • IRI Internationalized Resource Identifier
  • the above three parameters (1), (2), and (3) are used to identify the execution object of the jump offset operation, and the corresponding media resource or signal can be specified by any one of the three parameters or any combination between them. .
  • the jump offset value and the corresponding jump offset value unit may also be implemented by means of signal parameters.
  • the text format-based jump offset operation can be supported by adding a text format (such as a word, a sentence, or a segment information), which can be implemented. Control of text-based media files.
  • the execution content of the jump offset operation specifically includes a combination of one or two of the following parameters.
  • Jump Value (abbreviated as jv ) indicating the value of the jump offset to be performed.
  • the type of this parameter is an integer (Integer ), and the value includes all positive integers, 0 and negative integers. Among them, a positive integer indicates that the jump offset is performed in the positive direction; a negative integer indicates that the jump offset is performed in the negative direction.
  • Jump Unit (Value Unit, abbreviated as vu), indicating that the jump offset is to be performed.
  • the unit of value that is, the unit corresponding to the jump offset value.
  • the type is an enumeration.
  • the value of this parameter includes Millisecond (corresponding to the unit of milliseconds), Second (corresponding to the unit of seconds), Frame (corresponding to the frame), and Byte (corresponding to the byte).
  • support for text formatting is added, including: Word (corresponding to a single word), Sentence (corresponding to a single sentence), Paragraph (corresponding to a single paragraph), and so on.
  • This parameter can be set to be optional. A default value can be set in the configuration mode, so that the MGC can only send a skip value parameter.
  • the above two parameters (4) and (5) are used to indicate the execution content of the jump offset operation, that is, the offset size and the control granularity. These two parameters can also be defined in any combination.
  • the value of the jump offset value is a positive integer and 0, and define a Jump Direction (abbreviated as jd) parameter to indicate the direction in which the jump offset is to be performed.
  • the type of this parameter is Enumeration, and the value is Forward or Backward. The value is forward to indicate a jump in the positive direction; the value is backward to indicate a jump in the negative direction.
  • the embodiment of the present invention is not limited thereto, and only one parameter regarding the media operation indication may be defined, and the value of the parameter includes the above information, that is, the jump offset value and the skip offset value unit.
  • the jump offset value is 10 seconds backward
  • the MGC In order to maintain the state of the current playback signal on the MG, the MGC needs to carry the currently played signal and/or signal list through the signal descriptor while indicating that the MG needs to continue to perform the corresponding signal playback operation.
  • the embodiment of the present invention is not limited thereto, and the MGC may also include only the skip offset value in the Jump signal sent to the MG, and the skip offset value unit may be the default parameter.
  • the MG performs a skip offset operation on the specified media resource or signal according to the received request message.
  • the MG performs a skip offset operation on the previous TTS signal, that is, it skips 20 sentences forward and continues the TTS operation.
  • the MG also sends a request response message to the MGC.
  • the MG may not respond correctly.
  • the MG may return an error message to the MGC, such as error code 449 ("Unsupported or Unknown Parameter or Property Value", a parameter or attribute value that cannot be supported or unknown).
  • error code 449 "Unsupported or Unknown Parameter or Property Value"
  • the MGC issues a segment-based jump to the media audio data on the MG or a time-based jump to the text file, or the MG only supports a part of the value unit.
  • the MG can return an error message to the MGC.
  • a new error code to describe this error message, such as naming the error code "Unsupported Offset Unit". In the above scenario, the MG returns the error message to the MGC.
  • the operation of the text media file on the MG is implemented by carrying the text-based hopping offset indication signal in the message sent by the MGC to the MG, and thus the embodiment of the present invention implements the H.248-to-text media file.
  • Interactive control provides a simple and effective solution for text-based media control.
  • the method in this embodiment is that the MGC controls the offset operation of the media play on the MG by carrying the media operation indication in the existing media operation signal.
  • the media operation indication includes an offset value, or includes an offset value and an offset unit. As shown in FIG. 2, this embodiment may include the following steps:
  • the MGC instructs the MG to perform a text-to-speech operation on the specific text content by carrying a Play Segment Identifier (playid) signal in the Modify request message, and carries the offset value and the offset unit.
  • the signal parameters carrying the media operation indication including the offset value and the corresponding offset unit, which can be defined by means of signal parameters.
  • time and other methods may not be effective, so in the definition of values and value units, in addition to the time mode, set the text format to support text-based offset operations, you can achieve Control of text-based media files.
  • Parameter 1 Play Offset (po), which indicates the value to be offset.
  • This parameter represents the above offset value.
  • the type of this parameter is an integer (Integer ), and the value includes all positive integers, 0 and negative integers. Wherein, a positive integer indicates that the jump offset is performed in the positive direction; a negative integer indicates that the jump offset is performed in the negative direction.
  • Offset Unit (abbreviated as ou), which represents the unit of the offset to be executed, that is, the unit corresponding to the playback offset parameter.
  • the type is an enumeration.
  • the value of this parameter includes Millisecond (corresponding to the unit of milliseconds), Second (corresponding to the unit of seconds), Frame (corresponding to the frame), and Byte (corresponding to the byte).
  • support for text formatting is added, including: Word (corresponding to a single word), Sentence (corresponding to a single sentence), Paragraph (corresponding to a single paragraph), and so on.
  • Offset Unit This parameter can be set to optional. By setting a default value, the MGC can only send one parameter, that is, only the offset value.
  • the signal parameter is carried at the same time to indicate the offset of the signal playback, that is, the playback of the signal is not performed from the initial position of the file, but the offset parameter is added from the initial position. The position after the offset is represented begins.
  • the MG is required to start from the beginning of the file position to the position of the next 20 sentences, that is, to perform text-to-speech conversion from the 21st sentence of the text.
  • the MG starts the TTS operation from the specified offset position according to the received request message; and the MG sends a request response message to the MGC.
  • control of the offset operation of the media play on the MG is realized by carrying the signal parameters of the offset information in the existing media operation signal.
  • the MGC uses a text-based media operation indication to control the playback range of the media file on the MG, wherein the media operation indication is a text range value parameter, or a combination of a text range value parameter and a text range format parameter.
  • the text range format parameter includes a text-based value unit, and includes at least one of the following units: a word, a sentence, or a segment.
  • the text-based media operation indication includes a text range parameter, and the text range parameter includes: a value of one or more text ranges, or a combination of one or more text range values and text range format parameters;
  • the implementation of this embodiment may include the following steps:
  • the MGC sends a request message for the media operation to the MG, where the request message includes a text range value parameter and a text range format parameter. At the same time, the MGC can issue a detection event of the media operation result to understand the execution of the operation.
  • the MGC sends a Play Script signal to the MG by modifying the request message, instructing the MG to perform a TTS operation on the file script.
  • the text range format parameter (Text Range Format, trf) and text range value (trv) are carried in the message.
  • the trf parameter type can be an enumeration or a string. If the trf parameter is an enumeration type ( Enumeration ), the value range of the parameter can include Word (corresponding to a single word), Sentence (corresponding to a single sentence), Paragraph (corresponding to a single paragraph), and so on. Taking a string type as an example, the above different enumeration values can be respectively defined or agreed as a string, and It is necessary to pre-agreed possible strings between the MGC and the MG.
  • the trv parameter is of type string and can contain multiple text range information.
  • the format can be "first text range, second text range, ", where each text range has the format "
  • the value of a text-value of the second text may be that the first value is earlier or smaller than the second value, or the first value may be later or greater than the second value.
  • the MGC can also set the signal completion event (g/sc) and the TTS operation failure event (aastts/ttsfail) to indicate the execution of the MG detection signal.
  • the text range value parameter may also be included in the request message, and the text range format parameter is used as the default parameter.
  • a text range parameter may be defined, where the text range parameter includes: one or more text range values, or one or more text range values and text range format parameters. combination.
  • the text range format parameter includes a text-based value unit including at least one of the following units: a word, a sentence, or a segment. If only the text range value parameter is included, the text range format parameter is used as the default parameter.
  • the parameter can be carried in the request message of the media operation sent by the MGC to the MG to control the text media file on the MG.
  • the MG performs a TTS operation on the fifth to tenth sentences in the text script according to the instruction of the MGC. At the same time, the MG sends a response message to the MGC. That is, after receiving the Modify request message, the MG learns from the request message that the MGC wishes to perform a text-to-speech operation on the fifth to tenth sentences in the text script, and then performs the operation according to the instruction.
  • the MG reports the TTS operation result to the MGC by requesting the notification.
  • the MG reports a signal completion event (g/sc); if the operation fails, the MG reports a TTS operation failure event (aastts/ttsfail), and may also carry the failure cause information through the event parameter.
  • the MGC sends a notification response message to the MG.
  • Steps S303 and S304 are optional steps, and are not essential in the embodiment of the present invention.
  • the embodiment of the present invention is not limited to the solution described in Embodiment 3.
  • the playback of the text media on the MGC control MG can also implement the playback control of the text media on the MG by carrying text range control parameters in other command requests sent by the MGC.
  • the text range control parameter can be applied to various media control operations by carrying the text range control parameter in the command request sent to the MG, such as media play, pause, resume, fast forward, fast reverse, and play speed. Or bandwidth adjustment, etc., can achieve interactive playback control of text media on the MG.
  • the MGC controls the playback of the text media on the MG by a pause command carrying a text-based media operation indication.
  • this embodiment may include the following steps:
  • the MGC sends a pause command to the MG, requesting the MG to suspend the media operation.
  • the MGC may issue a pause command in H.248 to the MG. While issuing the pause command, the MGC can carry the text range format parameter and the text range value parameter, and assign the value "$" to the parameter, where "$" is the "Choose” wildcard, indicating that the MGC wants to obtain the MG. Perform a text that pauses the media operation Body range.
  • MGC can assign a text range format parameter to a specific value, such as "Word”, and assign a text range value parameter to "$", indicating that the MGC wants to know which word to play on the MG when the pause operation is in effect; Similarly, the MGC can also assign the value of both parameters to "$", indicating that the MG selects the granularity of the reported text range information, that is, the word, sentence or paragraph, that is, the MG itself decides to report the pause operation when the text media operation is performed.
  • the position to be executed is represented by the specific position of the word, sentence or paragraph.
  • the MG suspends the media operation according to the instruction of the MGC.
  • the operation is suspended when an instruction to request a pause is received.
  • the MG reports the specific text range when the pause operation is performed to the MGC by using a response message. If the MGC is issuing a pause indication, the text range format parameter carried in the pause indication is
  • the text range value parameter is the specific text range in which the MG reports to the MGC to suspend the operation. Which word should be played on the MG when the pause operation is in effect.
  • the text range format parameter and the text range value parameter carried in the pause indication issued by the MGC are both the MG and the MGC report the specific text range when the operation is suspended, the MG can actually play the words and sentences according to the actual play. Or the actual location of the paragraph is reported.
  • the text media control parameter is implemented in the media control command, and the text media control is implemented on the MG, and the MGC can obtain the MG through the feedback message of the MG when the text media is controlled according to the message.
  • the range of text to be executed on the MG is implemented in the media control command, and the text media control is implemented on the MG, and the MGC can obtain the MG through the feedback message of the MG when the text media is controlled according to the message.
  • the embodiment of the present invention is not limited thereto, and the wildcard parameter "$," used in the text control information in the fourth embodiment can also be applied to the fast forward and rewind operations of the MGC to play a text file on the MG. In the indication, it can be applied to the first embodiment and the second embodiment.
  • the text-based media control method of an embodiment of the present invention may be accomplished by a command request between the MGC and the MG including a text-based media control indication.
  • MGC's command types for MG include Add, Modify, Subtract, Move, and Audit. Value (AuditValue), Audit Capabilities, Notify, ServiceChange, etc.
  • Command parameters including: Property, Signal, Event, Statistic.
  • the text-based media control indication can also be carried in a packet formed by aggregation of parameters with business relevance.
  • Embodiments of the present invention enable operations such as play, pause, fast forward, and rewind of text media files on the MG through text-based media control indications, while text-based media control indications can also be carried in the MGC in a variety of ways. It is sent to the media control command request of the MG, and thus the embodiment of the present invention implements the interactive control of the H.248 text media file, providing a simple and effective solution for text-based media control.
  • An MG provided by an embodiment of the present invention, as shown in FIG. 5, includes:
  • the receiving unit 501 is configured to receive a command request for a media operation, where the command request includes a text-based media operation indication;
  • the operating unit 502 is configured to operate on the media data according to the text-based media operation indication.
  • the text-based media operation indication includes any one of the following: a skip offset indication signal; an offset value, or an offset value and an offset unit; a text range value parameter, or a text range Combination of format parameters and text range value parameters; text range parameters, including: one or more text range values, or a combination of one or more text range values and text range format parameters.
  • the offset unit and the text range format parameter, including the text-based value unit include at least one of the following units: a word, a sentence, or a segment.
  • a network system as shown in FIG. 6, is also provided in the embodiment of the present invention, including:
  • a media gateway controller 601 configured to send a media operation command request to the media gateway, where the command request includes a text-based media operation indication;
  • the media gateway 602 is configured to receive a command request for a media operation, where the command request includes a text-based media operation indication, and according to the text-based media operation indication, the number of media According to the operation.
  • the MGC and the MG, and the network system provided by the embodiments of the present invention can implement the MGC control of the text media file on the MG by referring to the foregoing embodiments of the text-based media control method, and can be performed by the text media sent to the MG by the MGC.
  • the command request includes a text-based media operation indication, and the MG can control the playback operation of the text-based media file according to the text-based media operation instruction, thereby implementing effective control of the text-based media and satisfying each of the multimedia applications. Class requirements.
  • the storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The technical field of the invention relates to network communications, and in particular to a method, apparatus, and system for controlling media based on texts. The method for controlling media based on texts includes: a media gateway receives a media operation command request which includes a media operation indication based on texts, and operates media data according to the media operation indication based on texts. The media gateway in the embodiments includes: a receiving unit for receiving the media operation command request which includes the media operation indication based on texts; an operation unit for operating the media data according to the media operation indication based on texts. The present invention is suitable for the media operation control in network communications.

Description

一种基于文本的媒体控制方法、 装置和系统 本申请要求了 2009年 1月 12 日提交的、 申请号为 200910001788.7、 发 明名称为"一种基于文本的媒体控制方法、 装置和系统"的中国申请的优先权, 其全部内容通过引用结合在本申请中。  A text-based media control method, device and system. The present application claims a Chinese application filed on January 12, 2009, with the application number 200910001788.7, entitled "A Text-Based Media Control Method, Apparatus and System" Priority is hereby incorporated by reference in its entirety.
技术领域 Technical field
本发明涉及网络通信技术, 尤其涉及一种基于文本的媒体控制方法、 装 置和系统。 背景技术  The present invention relates to network communication technologies, and in particular, to a text-based media control method, apparatus and system. Background technique
MGC ( Media Gateway Controller , 媒体网关控制器) 和 MG ( Media Gateway, 媒体网关)是分组网络中的两个关键构件。 MGC 负责呼叫控制功 能, MG负责业务承载功能, 藉此实现呼叫控制平面和业务承载平面的分离, 从而充分共享网络资源, 简化设备升级和业务扩展, 大大降低开发和维护成 本。  MGC (Media Gateway Controller) and MG (Media Gateway) are two key components in a packet network. The MGC is responsible for the call control function, and the MG is responsible for the service bearer function, thereby separating the call control plane and the service bearer plane, thereby fully sharing network resources, simplifying equipment upgrades and service expansion, and greatly reducing development and maintenance costs.
H.248协议(网关控制协议)提供了一系列支持对多媒体文件的交互式控 制的手段, 包括对语音文件的播放、 暂停、 继续、 快进、 后退等等。 H.248.9 (高级音频服务器包)协议中还提供了多种对语音应用的实现手段,例如 TTS ( Text To Speech, 文本语音转换包)就是 MGC控制 MG实现从文本到语音的 语音合成。 但在上述机制中, 媒体播放内容的控制手段只通过基于时间信息 和编码信息的方式执行 , 包括 NPT ( Normal Play Time , 常规播放时间)格式、 SMPTE ( Society of Motion Picture and Television Engineers, 电影与电视工程师 学会)格式、 UTC ( Universal Time Code, 世界时间代码)格式、 Frame (帧 数)格式、 Byte (字节数)格式等。  The H.248 protocol (Gateway Control Protocol) provides a range of means to support interactive control of multimedia files, including playback, pause, resume, fast forward, and backward of voice files. The H.248.9 (Advanced Audio Server Package) protocol also provides a variety of implementations for voice applications, such as TTS (Text To Speech), which is the MGC control MG for speech-to-speech speech synthesis. However, in the above mechanism, the media playback content control method is only performed by means of time information and coded information, including NPT (Normal Play Time) format, SMPTE (Society of Motion Picture and Television Engineers, film and television). Engineers learn) format, UTC (Universal Time Code) format, Frame (frame number) format, Byte (bytes) format, and so on.
在实现本发明的过程中, 发明人研究发现: 上述协议还缺乏对文本类文 件的控制手段, 从而无法支持对文本信息的交互式操作, 例如在语音合成中 无法支持按照脚本文本进行段落级的控制。缺乏上述机制导致 H.248.9中的语 音合成方法还仅限于最基本的播放和停止, 无法实现 H.248对文本类媒体的 有效控制和满足多媒体应用的各类需求。 发明内容 In the process of implementing the present invention, the inventors have found that: The above protocol lacks a control means for text files, thereby failing to support interactive operations on text information, for example, in speech synthesis, it is not possible to support paragraph level according to script text. control. Lack of the above mechanism leads to the language in H.248.9 The sound synthesis method is also limited to the most basic play and stop, and can not effectively control the text media of H.248 and meet the various needs of multimedia applications. Summary of the invention
本发明的实施例提供一种基于文本的媒体控制方法、 装置和系统, 能够 解决现有机制无法支持文本类媒体文件操作控制问题。  Embodiments of the present invention provide a text-based media control method, apparatus, and system, which can solve the problem that an existing mechanism cannot support text-based media file operation control.
本发明的实施例釆用如下技术方案:  Embodiments of the present invention use the following technical solutions:
本发明的实施例提供一种基于文本的媒体控制方法, 包括:  An embodiment of the present invention provides a text-based media control method, including:
媒体网关接收媒体操作的命令请求, 所述命令请求中包含基于文本的媒 体操作指示;  The media gateway receives a command request for a media operation, where the command request includes a text-based media operation indication;
根据所述基于文本的媒体操作指示, 对媒体数据进行操作。  The media data is operated according to the text-based media operation indication.
本发明的实施例提供一种媒体网关, 包括:  An embodiment of the present invention provides a media gateway, including:
接收单元, 用于接收媒体操作的命令请求, 所述命令请求中包含基于文 本的媒体操作指示;  a receiving unit, configured to receive a command request of a media operation, where the command request includes a text-based media operation indication;
操作单元, 用于根据所述基于文本的媒体操作指示, 对媒体数据进行操 作。  And an operation unit, configured to operate on the media data according to the text-based media operation indication.
本发明的实施例提供一种网络系统, 包括:  An embodiment of the present invention provides a network system, including:
媒体网关控制器, 用于向媒体网关发送媒体操作的命令请求, 所述命令 请求中包括基于文本的媒体操作指示;  a media gateway controller, configured to send a command request for a media operation to the media gateway, where the command request includes a text-based media operation indication;
所述媒体网关, 用于接收媒体操作的命令请求, 所述命令请求中包含基 于文本的媒体操作指示, 并根据所述基于文本的媒体操作指示, 对媒体数据 进行操作。  The media gateway is configured to receive a command request for a media operation, where the command request includes a text-based media operation indication, and operate the media data according to the text-based media operation indication.
本发明实施例提供的基于文本的媒体控制方法、 装置和系统, 通过在发 送给媒体网关的媒体操作的命令请求 , 所述命令请求中包含基于文本的媒体 操作指示, 媒体网关根据所述基于文本的媒体操作指示, 能够实现对文本类 媒体文件的播放进行控制 , 从而实现文本类媒体的有效控制和满足多媒体应 用的各类需求。 附图说明 A text-based media control method, apparatus, and system provided by an embodiment of the present invention, by a command request for a media operation sent to a media gateway, the command request includes a text-based media operation indication, and the media gateway is based on the text-based The media operation instruction can control the playback of the text type media file, thereby realizing the effective control of the text type media and satisfying various requirements of the multimedia application. DRAWINGS
为了更清楚地说明本发明实施例或现有技术中的技术方案, 下面将对实 施例或现有技术描述中所需要使用的附图作简单地介绍, 显而易见地, 下面 描述中的附图仅仅是本发明的一些实施例, 对于本领域普通技术人员来讲, 在不付出创造性劳动性的前提下, 还可以根据这些附图获得其他的附图。  In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the embodiments or the description of the prior art will be briefly described below. Obviously, the drawings in the following description are only It is a certain embodiment of the present invention, and other drawings can be obtained from those skilled in the art without any inventive labor.
图 1为本发明实施例一基于文本的媒体控制方法流程图;  1 is a flowchart of a text-based media control method according to an embodiment of the present invention;
图 2为本发明实施例二基于文本的媒体控制方法流程图;  2 is a flowchart of a text-based media control method according to Embodiment 2 of the present invention;
图 3为本发明实施例三基于文本的媒体控制方法流程图;  3 is a flowchart of a text-based media control method according to Embodiment 3 of the present invention;
图 4为本发明实施例四基于文本的媒体控制方法流程图;  4 is a flowchart of a text-based media control method according to Embodiment 4 of the present invention;
图 5为本发明的实施例 MG的示意图;  Figure 5 is a schematic illustration of an MG of an embodiment of the present invention;
图 6为本发明的实施例网络系统的示意图。 具体实施方式  Figure 6 is a schematic diagram of a network system in accordance with an embodiment of the present invention. detailed description
下面结合附图对本发明实施例提供的一种文本媒体的控制方法、 装置和 网络系统进行详细描述。  A text media control method, apparatus, and network system according to an embodiment of the present invention are described in detail below with reference to the accompanying drawings.
应当明确, 所描述的实施例仅仅是本发明一部分实施例, 而不是全部的 实施例。 基于本发明中的实施例, 本领域普通技术人员在没有作出创造性劳 动前提下所获得的所有其他实施例, 都属于本发明保护的范围。  It should be understood that the described embodiments are only a part of the embodiments of the invention, and not all of the embodiments. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
本发明的实施例提供一种基于文本的媒体控制方法, 包括: MG接收媒体 操作的命令请求, 所述命令请求中包含基于文本的媒体操作指示;  An embodiment of the present invention provides a text-based media control method, including: an MG receiving a command request for a media operation, where the command request includes a text-based media operation indication;
根据所述基于文本的媒体操作指示, 对媒体数据进行操作。  The media data is operated according to the text-based media operation indication.
通过本发明的实施例基于文本的媒体控制方法, 能够通过携带在发送给 MG的命令请求中的基于文本的媒体操作指示,实现通过文本信息来控制 MG 上的媒体文件的目的。  With the text-based media control method of the embodiment of the present invention, the purpose of controlling the media file on the MG by text information can be realized by the text-based media operation instruction carried in the command request sent to the MG.
以下将以更为具体的实现方式对本发明的实施例文本媒体的控制方法进 行详细描述。  The method of controlling the text medium of the embodiment of the present invention will be described in detail below in a more specific implementation.
实施例一 本实施例为 MGC通过使用基于文本的媒体操作指示,实现对 MG上文本 媒体的交互控制的具体实现方式, 其中媒体操作指示为跳跃偏移指示信号。 以 MG上的 TTS (文本语音转换)应用为例。 Embodiment 1 This embodiment is a specific implementation manner for the MGC to implement interactive control of the text media on the MG by using the text-based media operation indication, where the media operation indication is a skip offset indication signal. Take the TTS (Text-to-Speech) application on the MG as an example.
如图 1所示, 本实施例可以包括如下步骤:  As shown in FIG. 1, the embodiment may include the following steps:
5101、 MGC通过下发修改(Modify )请求消息指示 MG对特定文本内容 执行 TTS操作, 其中携带片段标识播放 ( Play Segment Identifier, playsid )信 号。  5101. The MGC instructs the MG to perform a TTS operation on the specific text content by sending a Modify Request message, where the Play Segment Identifier (playidid) signal is carried.
5102、 MG向 MGC发送修改响应消息, 并执行 TTS操作。  5102. The MG sends a modification response message to the MGC, and performs a TTS operation.
本实施例需要在 TTS操作中指示 MG执行跳跃偏移, 因而步骤 S101和 S102需要执行, 在其它实施例中, 可能不需要执行这两个步骤, 或者需要相 应的调整这两个步骤。  This embodiment needs to instruct the MG to perform a hopping offset in the TTS operation, so steps S101 and S102 need to be performed. In other embodiments, it may not be necessary to perform these two steps, or the corresponding two steps need to be adjusted.
5103、 MGC发送基于文本的跳跃偏移指示信号给 MG, 要求 MG执行媒 体播放的跳跃偏移操作, 例如要求在当前的播放位置向前跳跃 20个句子。  5103. The MGC sends a text-based hopping offset indication signal to the MG, and the MG is required to perform a hopping offset operation of the media play, for example, requiring 20 sentences to jump forward at the current playback position.
具体地, 跳跃偏移指示信号携带在请求消息中, 可以是修改(Modify ), 移动 ( Move ), 增加 ( Add )等 H.248请求消息。  Specifically, the hopping offset indication signal is carried in the request message, and may be an H.248 request message such as Modify, Move, Add, or the like.
跳跃偏移指示信号, 可以通过 H.248 中信号的方式实现; 并且该跳跃偏 移指示信号基于文本, 即在跳跃偏移指示信号中携带基于文本格式的操作信 息。 具体地按以下的方式定义。  The skip offset indication signal can be implemented by means of a signal in H.248; and the skip offset indication signal is based on text, that is, carrying the text format based operation information in the skip offset indication signal. Specifically, it is defined in the following manner.
定义一个跳跃偏移指示信号 ( Signal ), 用于 MGC指示 MG对媒体资源 或信号执行跳跃偏移操作, 例如将该信号命名为 "跳跃(Jump )"。 该信号可 以定义在现有包( Package )或新的包中, 例如定义一个新包, 命名为 "播放 偏移控制包(Play Offset Control Package, poc )"。 该信号的类型可以设置为 简短(Brief, BR )型, 表示该信号可以自动停止或者被新的信号描述符替代 停止, 这种 BR型的信号没有期满时间的限制。  A jump offset indication signal (Sign) is defined for the MGC to instruct the MG to perform a skip offset operation on the media resource or signal, for example, the signal is named "Jump". This signal can be defined in an existing package or in a new package. For example, define a new package and name it "Play Offset Control Package (poc)". The type of the signal can be set to Brief (Brief, BR) type, indicating that the signal can be automatically stopped or replaced by a new signal descriptor. This BR type signal has no expiration time limit.
该信号可以应用于终端 (Termination ) 或终端上的流( Stream )。 若该信 号被包含在下发给某个终端或流的信号描述符中, 即表示需要在该终端或流 上执行跳跃。 This signal can be applied to a terminal (Termination) or a stream (Stream) on the terminal. If the signal is included in a signal descriptor that is sent to a terminal or stream, it indicates that it needs to be at the terminal or stream. Perform a jump on it.
跳跃偏移指示信号可以包括操作对象参数, 或者进一步还可以包括跳跃 偏移值以及跳跃偏移值单位。  The skip offset indication signal may include an operation object parameter, or may further include a skip offset value and a skip offset value unit.
其中, 操作对象参数可以包括媒体资源或信号, 可以通过信号参数的方 式定义操作对象参数, 所述操作对象参数可以包括以下参数的至少之一: The operation object parameter may include a media resource or a signal, and the operation object parameter may be defined by a signal parameter, and the operation object parameter may include at least one of the following parameters:
( 1 )信号标识( Signal Identifier, 缩写为 si ), 表示要执行跳跃偏移的信 号。 该参数的类型为字符串, 格式为 "包标识 /信号标识"。 (1) Signal Identifier (abbreviated as si), indicating the signal to perform the skip offset. The type of this parameter is a string with the format "package ID/signal ID".
( 2 )信号列表标识(Signal List ldentifier, 缩写为 sli ), 表示要执行跳跃 偏移的信号列表。 该参数的类型为整数。  (2) Signal List ldentifier (abbreviated as sli), indicating the signal list for which the skip offset is to be performed. The type of this parameter is an integer.
( 3 )媒体资源标识 ( Media Resource Identifier, 缩写为 mri ), 表示要执 行跳跃偏移的媒体资源。该参数的类型为字符串,例如可以釆用 URI( Universal Resource Locator,通用资源位置 )或者 IRI( Internationalized Resource Identifier, 国际化资源标识)的格式。 当在一个终端或流上存在同种信号的多个实例时, 可以通过媒体资源标识来加以区分。  (3) Media Resource Identifier (mri), which indicates the media resource to perform the jump offset. The type of the parameter is a string, for example, a URI (Universal Resource Locator) or an IRI (Internationalized Resource Identifier) format. When multiple instances of the same signal exist on a terminal or stream, they can be distinguished by the media resource identification.
以上 ( 1 )、 (2 )、 (3 )三个参数用于标识出跳跃偏移操作的执行对象, 通 过三个参数中的任一个或它们之间的任意组合可以指定对应的媒体资源或信 号。  The above three parameters (1), (2), and (3) are used to identify the execution object of the jump offset operation, and the corresponding media resource or signal can be specified by any one of the three parameters or any combination between them. .
另外, 所述跳跃偏移值及对应跳跃偏移值单位, 也可以通过信号参数的 方式定义实现。 而在跳跃偏移值和跳跃偏移值单位的定义中, 除了时间方式, 还可以通过增加文本格式(如字、 句或段的信息) 来支持基于文本格式的跳 跃偏移操作, 可以实现对文本类媒体文件的控制。 跳跃偏移操作的执行内容 具体包括如下参数中一个或两个的组合。  In addition, the jump offset value and the corresponding jump offset value unit may also be implemented by means of signal parameters. In the definition of the jump offset value and the skip offset value unit, in addition to the time mode, the text format-based jump offset operation can be supported by adding a text format (such as a word, a sentence, or a segment information), which can be implemented. Control of text-based media files. The execution content of the jump offset operation specifically includes a combination of one or two of the following parameters.
( 4 )跳跃偏移值( Jump Value,缩写为 jv ),表示要执行跳跃偏移的数值。 该参数的类型为整数型 (Integer ), 取值包括所有的正整数, 0 和负整数。 其 中, 正整数表示向正方向执行跳跃偏移; 负整数表示向负方向执行跳跃偏移。  (4) Jump Value (abbreviated as jv ) indicating the value of the jump offset to be performed. The type of this parameter is an integer (Integer ), and the value includes all positive integers, 0 and negative integers. Among them, a positive integer indicates that the jump offset is performed in the positive direction; a negative integer indicates that the jump offset is performed in the negative direction.
( 5 )跳跃偏移值单位(Value Unit, 缩写为 vu ), 表示要执行跳跃偏移的 取值单位, 即对应跳跃偏移值的单位。 其类型为枚举, 该参数的取值除了包 括 Millisecond (对应表示以毫秒为单位)、 Second (对应表示以秒为单位)、 Frame (对应表示以帧为单位)、 Byte (对应表示以字节数为单位)之外, 增 加对文本格式的支持, 包括: Word (对应表示以单个单词为单位)、 Sentence (对应表示以单个句子为单位)、 Paragraph (对应表示以单个段落为单位 )等。 该参数可以设置为可选,通过配置的方式设定一个默认的取值, 这样 MGC可 以只需下发一个跳跃值参数。 (5) Jump Unit (Value Unit, abbreviated as vu), indicating that the jump offset is to be performed. The unit of value, that is, the unit corresponding to the jump offset value. The type is an enumeration. The value of this parameter includes Millisecond (corresponding to the unit of milliseconds), Second (corresponding to the unit of seconds), Frame (corresponding to the frame), and Byte (corresponding to the byte). In addition to the number, support for text formatting is added, including: Word (corresponding to a single word), Sentence (corresponding to a single sentence), Paragraph (corresponding to a single paragraph), and so on. This parameter can be set to be optional. A default value can be set in the configuration mode, so that the MGC can only send a skip value parameter.
以上 (4 )、 ( 5 ) 两个参数用于表示跳跃偏移操作的执行内容, 即偏移大 小及控制粒度。 这两个参数的定义方式还可以有多种任意组合。  The above two parameters (4) and (5) are used to indicate the execution content of the jump offset operation, that is, the offset size and the control granularity. These two parameters can also be defined in any combination.
例如可以将跳跃偏移值的取值定义为正整数和 0,再单独定义一个跳跃方 向 ( Jump Direction, 缩写为 jd )参数, 表示要执行跳跃偏移的方向。 该参数 的类型为枚举 ( Enumeration ), 取值为前进 ( Forward )或后退 ( Backward )„ 取值为前进表示向正方向跳跃; 取值为后退表示向负方向跳跃。  For example, you can define the value of the jump offset value as a positive integer and 0, and define a Jump Direction (abbreviated as jd) parameter to indicate the direction in which the jump offset is to be performed. The type of this parameter is Enumeration, and the value is Forward or Backward. The value is forward to indicate a jump in the positive direction; the value is backward to indicate a jump in the negative direction.
本发明的实施例并不局限于此, 还可以仅定义一个关于媒体操作指示的 参数, 该参数的取值同时包含上述信息, 即跳跃偏移值和跳跃偏移值单位。 例如, 可以将该参数命名为跳跃内容(Jump Content, 缩写为 jc ), 类型为字 符串, 取值格式可以为 "跳跃内容 = "取值单位 =跳跃值,," 即可以写为 "jc= "vu=jv"" , 这里取值单位和跳跃值的定义和内容同前面所述。 例如, jc= "Second=-10", 则表示跳跃偏移值为向后 10秒; 而 jc= "Sentence=20,,, 则 表示跳跃偏移值为向前 20个句子。  The embodiment of the present invention is not limited thereto, and only one parameter regarding the media operation indication may be defined, and the value of the parameter includes the above information, that is, the jump offset value and the skip offset value unit. For example, the parameter can be named Jump Content (abbreviated as jc), the type is a string, and the value format can be "jump content = "value unit = jump value," which can be written as "jc= "vu=jv"" , where the definition and content of the value unit and the jump value are the same as previously described. For example, jc= "Second=-10", the jump offset value is 10 seconds backward; and jc= " Sentence=20,,, indicates that the jump offset value is 20 forward sentences.
为了维护 MG上当前播放信号的状态, MGC在下发跳跃偏移指示的同时, 还需要通过信号描述符携带当前播放的信号和 /或信号列表, 来表示 MG需要 继续执行对应的信号播放操作。  In order to maintain the state of the current playback signal on the MG, the MGC needs to carry the currently played signal and/or signal list through the signal descriptor while indicating that the MG needs to continue to perform the corresponding signal playback operation.
在本实施例中, MGC向 MG发送跳跃 ( Jump )信号, 其中携带跳跃偏移 值和跳跃偏移值单位这两个参数, 具体方式为: "jv=20, vu=Sentence" , 表示 要求 MG执行向前 20个句子的跳跃偏移。 在下发的信号描述符中, 除跳跃信 号外, MGC需要同时携带当前在播放的 TTS操作信号(例如, 片段标识播放 信号)。 In this embodiment, the MGC sends a Jump signal to the MG, where the parameters of the hop offset value and the hop offset value unit are carried out, and the specific manner is: "jv=20, vu=Sentence", indicating that the MG is required. Perform a jump offset of the first 20 sentences. In the signal descriptor issued, except for the jump letter In addition to the number, the MGC needs to carry the TTS operation signal currently being played (for example, the segment identification play signal).
但本发明的实施例并不局限于此, MGC向 MG发送跳跃(Jump )信号中 还可以只包括跳跃偏移值, 而跳跃偏移值单位可以为默认的参数。  However, the embodiment of the present invention is not limited thereto, and the MGC may also include only the skip offset value in the Jump signal sent to the MG, and the skip offset value unit may be the default parameter.
S104、 MG根据接收到的请求消息, 对指定的媒体资源或信号执行跳跃 偏移操作。 本例中, MG对前面的 TTS信号执行跳跃偏移操作, 即向前跳跃 20个句子后继续 TTS操作。 同时, MG还向 MGC发送请求响应消息。  S104. The MG performs a skip offset operation on the specified media resource or signal according to the received request message. In this example, the MG performs a skip offset operation on the previous TTS signal, that is, it skips 20 sentences forward and continues the TTS operation. At the same time, the MG also sends a request response message to the MGC.
对于 MGC下发的跳跃偏移指示信号, MG可能无法正确的响应。  For the skip offset indication signal sent by the MGC, the MG may not respond correctly.
如果 MG不支持上述的跳跃偏移值或跳跃偏移值单位,或者 MGC下发的 跳跃偏移值已经超出了 MG上操作媒体的范围 (如对一个只有 20句的文本文 件下发 30句的跳跃偏移指示),则 MG可以向 MGC返回错误信息,例如是错 误码 449 ( "Unsupported or Unknown Parameter or Property Value" , 无法支持或 未知的参数或属性值)。 但却无法执行这些参数所属的操作时,例如 MGC对 MG上的媒体音频数据下 发基于句段的跳跃或是对文本文件下发基于时间的跳跃等, 或者 MG仅仅支 持取值单位中的一部分时,则 MG可以向 MGC返回错误信息。定义一个新的 错误码, 来描述这种错误信息, 例如将错误码命名为 "无法支持的偏移单位 ( Unsupported Offset Unit )"。 在上述情景中, MG返回该错误信息给 MGC。  If the MG does not support the above-mentioned hop offset value or hop offset value unit, or the hop offset value delivered by the MGC has exceeded the range of the operating media on the MG (for example, 30 sentences are sent to a text file with only 20 sentences). The hopping offset indication), the MG may return an error message to the MGC, such as error code 449 ("Unsupported or Unknown Parameter or Property Value", a parameter or attribute value that cannot be supported or unknown). However, when the operations to which the parameters belong cannot be performed, for example, the MGC issues a segment-based jump to the media audio data on the MG or a time-based jump to the text file, or the MG only supports a part of the value unit. At this time, the MG can return an error message to the MGC. Define a new error code to describe this error message, such as naming the error code "Unsupported Offset Unit". In the above scenario, the MG returns the error message to the MGC.
本实施例通过在 MGC发送给 MG的消息中携带基于文本的跳跃偏移指示 信号, 实现了对 MG上的文本媒体文件的操作, 因而本发明的实施例实现了 H.248对文本类媒体文件的交互控制,为基于文本的媒体控制提供了一种简单 有效的解决方案。 实施例二  In this embodiment, the operation of the text media file on the MG is implemented by carrying the text-based hopping offset indication signal in the message sent by the MGC to the MG, and thus the embodiment of the present invention implements the H.248-to-text media file. Interactive control provides a simple and effective solution for text-based media control. Embodiment 2
与实施例一不同,本实施例的方法是 MGC通过在现有的媒体操作信号中 携带媒体操作指示, 来控制 MG上媒体播放的偏移操作。 其中, 本实施例中 媒体操作指示包括偏移值, 或包括偏移值和偏移单位。 如图 2 所示, 本实施 例可以包括如下步骤: Different from the first embodiment, the method in this embodiment is that the MGC controls the offset operation of the media play on the MG by carrying the media operation indication in the existing media operation signal. Wherein, in this embodiment The media operation indication includes an offset value, or includes an offset value and an offset unit. As shown in FIG. 2, this embodiment may include the following steps:
S201、 MGC通过在修改(Modify )请求消息中携带片段标识播放 ( Play Segment Identifier, playsid )信号, 指示 MG对特定文本内容执行文本语音转 换操作, 并携带偏移值和偏移单位。  S201. The MGC instructs the MG to perform a text-to-speech operation on the specific text content by carrying a Play Segment Identifier (playid) signal in the Modify request message, and carries the offset value and the offset unit.
定义如下携带媒体操作指示的信号参数, 包括偏移值及对应的偏移单位, 可以通过信号参数的方式定义实现。 同样, 对于文本类文件或应用, 时间等 方式的控制可能无法生效, 因此在取值和取值单位的定义中, 除了时间方式, 设置文本格式, 以支持基于文本格式的偏移操作, 可以实现对文本类媒体文 件的控制。  The following describes the signal parameters carrying the media operation indication, including the offset value and the corresponding offset unit, which can be defined by means of signal parameters. Similarly, for text files or applications, time and other methods may not be effective, so in the definition of values and value units, in addition to the time mode, set the text format to support text-based offset operations, you can achieve Control of text-based media files.
参数一: 播放偏移 (Play Offset, 缩写为 po ), 表示要执行偏移的数值。 该参数即表示上述的偏移值。 该参数的类型为整数型 (Integer ), 取值包括所 有的正整数, 0和负整数。 其中, 正整数表示向正方向执行跳跃偏移; 负整数 表示向负方向执行跳跃偏移。  Parameter 1: Play Offset (po), which indicates the value to be offset. This parameter represents the above offset value. The type of this parameter is an integer (Integer ), and the value includes all positive integers, 0 and negative integers. Wherein, a positive integer indicates that the jump offset is performed in the positive direction; a negative integer indicates that the jump offset is performed in the negative direction.
参数二: 偏移单位( Offset Unit, 缩写为 ou ), 表示要执行偏移的取值单 位, 即对应播放偏移参数的单位。 其类型为枚举, 该参数的取值除了包括 Millisecond (对应表示以毫秒为单位)、 Second (对应表示以秒为单位)、 Frame (对应表示以帧为单位)、 Byte (对应表示以字节数为单位)之外, 增加对文 本格式的支持, 包括: Word (对应表示以单个单词为单位)、 Sentence (对应 表示以单个句子为单位)、 Paragraph (对应表示以单个段落为单位)等。 偏移 单位这个参数可以设置为可选, 通过配置的方式设定一个默认的取值, 这样 MGC可以只需下发一个参数, 即只下发偏移值这个参数。  Parameter 2: Offset Unit (abbreviated as ou), which represents the unit of the offset to be executed, that is, the unit corresponding to the playback offset parameter. The type is an enumeration. The value of this parameter includes Millisecond (corresponding to the unit of milliseconds), Second (corresponding to the unit of seconds), Frame (corresponding to the frame), and Byte (corresponding to the byte). In addition to the number, support for text formatting is added, including: Word (corresponding to a single word), Sentence (corresponding to a single sentence), Paragraph (corresponding to a single paragraph), and so on. Offset Unit This parameter can be set to optional. By setting a default value, the MGC can only send one parameter, that is, only the offset value.
当 MGC下发播放信号给 MG时, 同时携带上述信号参数,用以表示信号 播放的偏移量, 即信号的播放并非从文件的初始位置开始执行, 而是从初始 位置加上偏移参数所代表的偏移量后的位置开始执行。  When the MGC sends a playback signal to the MG, the signal parameter is carried at the same time to indicate the offset of the signal playback, that is, the playback of the signal is not performed from the initial position of the file, but the offset parameter is added from the initial position. The position after the offset is represented begins.
在本实施例中, 假定上述信号参数取值为: "po=20, ou=Sentence" , 表示 要求 MG从文件开始位置向后 20个句子的位置开始操作,也即是从文本第 21 个句子开始执行文本语音转换。 In this embodiment, it is assumed that the above signal parameter takes the value: "po=20, ou=Sentence" The MG is required to start from the beginning of the file position to the position of the next 20 sentences, that is, to perform text-to-speech conversion from the 21st sentence of the text.
S202、 MG根据接收到的请求消息, 从指定的偏移位置开始执行 TTS操 作; 同时 MG向 MGC发送请求响应消息。  S202. The MG starts the TTS operation from the specified offset position according to the received request message; and the MG sends a request response message to the MGC.
本实施例通过在现有的媒体操作信号中携带偏移信息的信号参数, 实现 了对 MG上媒体播放的偏移操作的控制。 实施例三  In this embodiment, the control of the offset operation of the media play on the MG is realized by carrying the signal parameters of the offset information in the existing media operation signal. Embodiment 3
本实施例为 MGC使用基于文本的媒体操作指示来控制 MG上媒体文件的 播放范围, 其中, 所述媒体操作指示为文本范围取值参数, 或文本范围取值 参数和文本范围格式参数的组合。 其中, 所述文本范围格式参数包括基于文 本的取值单元, 至少包括如下单元之一: 字、 句或段。  In this embodiment, the MGC uses a text-based media operation indication to control the playback range of the media file on the MG, wherein the media operation indication is a text range value parameter, or a combination of a text range value parameter and a text range format parameter. The text range format parameter includes a text-based value unit, and includes at least one of the following units: a word, a sentence, or a segment.
所述基于文本的媒体操作指示包括文本范围参数, 所述文本范围参数包 括: 一个及以上文本范围取值, 或一个及以上文本范围取值和文本范围格式 参数的组合;  The text-based media operation indication includes a text range parameter, and the text range parameter includes: a value of one or more text ranges, or a combination of one or more text range values and text range format parameters;
如图 3所示, 本实施例的实现方案可以包括如下步骤:  As shown in FIG. 3, the implementation of this embodiment may include the following steps:
S301、 MGC向 MG下发媒体操作的请求消息, 所述请求消息中包括文本 范围取值参数和文本范围格式参数。 同时, MGC可以下发媒体操作结果的检 测事件, 来了解操作的执行情况。  S301: The MGC sends a request message for the media operation to the MG, where the request message includes a text range value parameter and a text range format parameter. At the same time, the MGC can issue a detection event of the media operation result to understand the execution of the operation.
具体地, MGC通过修改(Modify )请求消息, 下发脚本播放( Play Script ) 信号给 MG, 指示 MG对文件脚本进行 TTS操作。 在该消息中携带了文本范 围格式参数( Text Range Format, trf)和文本范围取值参数( Text Range Value, trv )„  Specifically, the MGC sends a Play Script signal to the MG by modifying the request message, instructing the MG to perform a TTS operation on the file script. The text range format parameter (Text Range Format, trf) and text range value (trv) are carried in the message.
其中 trf 参数类型可以是枚举或字符串。 如果 trf 参数是枚举类型 ( Enumeration ),该参数的取值范围可以包括 Word(对应以单个单词为单位)、 Sentence (对应以单个句子为单位)、 Paragraph (对应以单个段落为单位)等。 而以字符串类型为例, 可以将上述不同枚举值分别作为字符串定义或约定,且 需要在 MGC与 MG之间预先约定可能的字符串。 The trf parameter type can be an enumeration or a string. If the trf parameter is an enumeration type ( Enumeration ), the value range of the parameter can include Word (corresponding to a single word), Sentence (corresponding to a single sentence), Paragraph (corresponding to a single paragraph), and so on. Taking a string type as an example, the above different enumeration values can be respectively defined or agreed as a string, and It is necessary to pre-agreed possible strings between the MGC and the MG.
trv参数的类型为字符串, 可以包含多个文本范围信息, 例如其格式可以 为 "第一文本范围, 第二文本范围, ... ... ", 其中每个文本范围的格式为 "第 一文本取值-第二文本取值 ", 可以是第一取值早于或小于第二取值, 也可以 是第一取值晚于或大于第二取值。  The trv parameter is of type string and can contain multiple text range information. For example, the format can be "first text range, second text range, ...", where each text range has the format " The value of a text-value of the second text may be that the first value is earlier or smaller than the second value, or the first value may be later or greater than the second value.
在本实施例中, 所述修改 (Modify ) 请求消息中包括的参数为: "trf=Sentence, trv=5-10", 表示 MGC希望 MG对文本中第 5到 10句执行文 本语音转换操作。 在修改请求消息中, MGC还可以同时设置信号完成事件 ( g/sc )和 TTS操作失败事件( aastts/ttsfail ), 指示 MG检测信号的执行情况。  In this embodiment, the parameter included in the Modify request message is: "trf=Sentence, trv=5-10", indicating that the MGC wants the MG to perform a text-to-speech operation on the fifth to tenth sentences in the text. In the modification request message, the MGC can also set the signal completion event (g/sc) and the TTS operation failure event (aastts/ttsfail) to indicate the execution of the MG detection signal.
在所述请求消息中也可以只包括文本范围取值参数, 而文本范围格式参 数作为默认的参数。  The text range value parameter may also be included in the request message, and the text range format parameter is used as the default parameter.
本发明的实施例并不局限于此, 还可以定义一个文本范围参数, 其中, 所述文本范围参数包括: 一个及以上文本范围取值, 或一个及以上文本范围 取值和文本范围格式参数的组合。 所述文本范围格式参数包括基于文本的取 值单元, 至少包括如下单元之一: 字、 句或段。 如果只包括文本范围取值参 数, 则文本范围格式参数作为默认的参数。  The embodiment of the present invention is not limited thereto, and a text range parameter may be defined, where the text range parameter includes: one or more text range values, or one or more text range values and text range format parameters. combination. The text range format parameter includes a text-based value unit including at least one of the following units: a word, a sentence, or a segment. If only the text range value parameter is included, the text range format parameter is used as the default parameter.
将该参数携带在 MGC发送给 MG的媒体操作的请求消息中便可以实现对 MG上文本媒体文件的控制。  The parameter can be carried in the request message of the media operation sent by the MGC to the MG to control the text media file on the MG.
例如将该文本范围参数命名为 "文本范围 (Text Range, tr )"。 该参数的 类型为字符串, 可以包含多个文本范围信息, 格式为 "第一文本范围, 第二 文本范围, ... ... " , 其中每个文本范围信息的取值格式为 "文本范围格式=文 本范围取值"。 例如对于上面所述的修改(Modify )请求消息中包括的参数: For example, name the text range parameter "Text Range, tr". The parameter is of type string and can contain multiple text range information in the format of "first text range, second text range, ...", where the value format of each text range information is "text" Range format = text range value". For example, the parameters included in the Modify request message described above:
"trf=Sentence, trv=5-10", 也可以为 tr= "Sentence=5-15, Sentence=20-25,,表示 在要执行媒体操作的文本范围从第 5句到第 15句、 从第 20句到第 25句。 "trf=Sentence, trv=5-10", can also be tr= "Sentence=5-15, Sentence=20-25," indicating that the text to be executed from the 5th to the 15th sentence, 20th to 25th sentences.
S302、 MG按照 MGC的指示对文本脚本中第 5到 10句执行 TTS操作。 同时 MG发送响应消息给 MGC。 即 MG在接收到所述修改(Modify )请求消息后, 从所述请求消息中获 知 MGC希望将文本脚本中第 5到 10句执行文本语音转换操作, 则根据该指 示, 执行该操作。 S302. The MG performs a TTS operation on the fifth to tenth sentences in the text script according to the instruction of the MGC. At the same time, the MG sends a response message to the MGC. That is, after receiving the Modify request message, the MG learns from the request message that the MGC wishes to perform a text-to-speech operation on the fifth to tenth sentences in the text script, and then performs the operation according to the instruction.
5303、 MG通过通报请求向 MGC上报 TTS操作结果。  5303. The MG reports the TTS operation result to the MGC by requesting the notification.
如果是操作成功, 则 MG上报信号完成事件(g/sc ); 如果操作失败, 则 MG上报 TTS操作失败事件(aastts/ttsfail ), 并且还可以通过事件参数携带失 败原因信息。  If the operation is successful, the MG reports a signal completion event (g/sc); if the operation fails, the MG reports a TTS operation failure event (aastts/ttsfail), and may also carry the failure cause information through the event parameter.
5304、 MGC发送通报响应消息给 MG。  5304. The MGC sends a notification response message to the MG.
步骤 S303、 S304为可选步骤, 在本发明的实施例中, 并不是必须的。 本发明的实施例并不局限于实施三所述的方案, MGC控制 MG上文本媒 体的播放还可以通过 MGC发送的其它命令请求中携带文本范围控制参数来 实现对 MG上文本媒体的播放控制。  Steps S303 and S304 are optional steps, and are not essential in the embodiment of the present invention. The embodiment of the present invention is not limited to the solution described in Embodiment 3. The playback of the text media on the MGC control MG can also implement the playback control of the text media on the MG by carrying text range control parameters in other command requests sent by the MGC.
本实施例通过在发送给 MG的命令请求中携带了文本范围控制参数, 可 以将文本范围控制参数应用于各类媒体控制操作中, 例如媒体播放、 暂停、 继续、 快进、 快退、 播放速度或带宽调整等, 能够实现 MG上对文本媒体的 交互式播放控制。 实施例四  In this embodiment, the text range control parameter can be applied to various media control operations by carrying the text range control parameter in the command request sent to the MG, such as media play, pause, resume, fast forward, fast reverse, and play speed. Or bandwidth adjustment, etc., can achieve interactive playback control of text media on the MG. Embodiment 4
本实施例为 MGC通过携带基于文本的媒体操作指示的暂停命令对 MG上 文本媒体的播放进行控制。  In this embodiment, the MGC controls the playback of the text media on the MG by a pause command carrying a text-based media operation indication.
如图 4所示, 本实施例可以包括如下步骤:  As shown in FIG. 4, this embodiment may include the following steps:
S401、 MGC向 MG发送暂停命令, 要求 MG暂停媒体操作。  S401. The MGC sends a pause command to the MG, requesting the MG to suspend the media operation.
例如当 MG上正在执行对某个或某些文本媒体片段的语音文本转换操作 时, 由于某些特定原因, MGC希望 MG暂停该操作,则 MGC可以下发 H.248 中的暂停命令给 MG。 而在下发暂停命令的同时, MGC可以携带文本范围格 式参数和文本范围取值参数, 并为参数赋值为 "$" , 其中的 "$" 是 "选择 ( Choose )" 通配符, 表示 MGC希望获得 MG执行暂停该媒体操作的文本具 体范围。 For example, when the voice text conversion operation for one or some text media segments is being performed on the MG, for some specific reason, the MGC wants the MG to suspend the operation, the MGC may issue a pause command in H.248 to the MG. While issuing the pause command, the MGC can carry the text range format parameter and the text range value parameter, and assign the value "$" to the parameter, where "$" is the "Choose" wildcard, indicating that the MGC wants to obtain the MG. Perform a text that pauses the media operation Body range.
MGC可以将文本范围格式参数赋值为某个特定值, 例如为 "Word" , 而 将文本范围取值参数赋值为 "$" ,表示 MGC希望知道暂停操作生效时 MG上 文本播放到哪一个单词; 同样, MGC也可以将两个参数的取值都赋值为 "$" , 表示由 MG选择上报文本范围信息的粒度, 即单词、 句子或段落, 即由 MG 自己决定上报暂停操作时文本媒体操作所执行到的位置用单词、 句子或段落 的具体位置来表示。  MGC can assign a text range format parameter to a specific value, such as "Word", and assign a text range value parameter to "$", indicating that the MGC wants to know which word to play on the MG when the pause operation is in effect; Similarly, the MGC can also assign the value of both parameters to "$", indicating that the MG selects the granularity of the reported text range information, that is, the word, sentence or paragraph, that is, the MG itself decides to report the pause operation when the text media operation is performed. The position to be executed is represented by the specific position of the word, sentence or paragraph.
5402、 MG根据 MGC的指示暂停媒体操作。  5402. The MG suspends the media operation according to the instruction of the MGC.
如果 MG正在进行某个文本媒体片段的语音文本转换操作时, 接收到要 求暂停的指示时, 则暂停该操作。  If the MG is in the process of performing a voice text conversion operation of a text media segment, the operation is suspended when an instruction to request a pause is received.
5403、 MG通过响应消息,向 MGC上报执行暂停操作时具体的文本范围。 如果 MGC在下发暂停指示时, 暂停指示中携带的文本范围格式参数是 5403. The MG reports the specific text range when the pause operation is performed to the MGC by using a response message. If the MGC is issuing a pause indication, the text range format parameter carried in the pause indication is
"Word" , 文本范围取值参数是 则 MG在向 MGC上报执行暂停该操作 的具体的文本范围应当为暂停操作生效时 MG上文本播放到哪一个单词。 当 然,如果 MGC下发的暂停指示中携带的文本范围格式参数和文本范围取值参 数都是 则 MG在向 MGC上报执行暂停该操作时具体的文本范围时, 可 以根据实际播放到的单词、 句子或段落的实际位置进行上报。 "Word", the text range value parameter is the specific text range in which the MG reports to the MGC to suspend the operation. Which word should be played on the MG when the pause operation is in effect. Of course, if the text range format parameter and the text range value parameter carried in the pause indication issued by the MGC are both the MG and the MGC report the specific text range when the operation is suspended, the MG can actually play the words and sentences according to the actual play. Or the actual location of the paragraph is reported.
本实施例通过在媒体控制的命令中携带了文本范围控制参数,实现了 MG 上文本媒体的控制, 并且, MGC能够通过 MG的反馈消息中获取 MG在根据 所述消息控制文本媒体时, 文本媒体在 MG上执行的文本范围。  In this embodiment, the text media control parameter is implemented in the media control command, and the text media control is implemented on the MG, and the MGC can obtain the MG through the feedback message of the MG when the text media is controlled according to the message. The range of text to be executed on the MG.
本发明的实施例并不局限于此, 实施例四中的文本控制信息中釆用的通 配参数 "$,,, 也可以应用于 MGC对 MG上播放文本文件的快进、 快退等操作 指示中, 即可以适用于实施例一和实施例二。  The embodiment of the present invention is not limited thereto, and the wildcard parameter "$," used in the text control information in the fourth embodiment can also be applied to the fast forward and rewind operations of the MGC to play a text file on the MG. In the indication, it can be applied to the first embodiment and the second embodiment.
本发明的实施例基于文本的媒体控制方法可以通过 MGC和 MG之间的包 括基于文本的媒体控制指示的命令请求来完成。 其中, MGC对 MG的命令类 型包括添加(Add )、 修改(Modify ), 删减(Subtract ), 移动 (Move )、 审计 值 (AuditValue )、 审计能力 ( AuditCapabilities )、 通报(Notify ), 服务改变 ( ServiceChange ) 等。 命令参数, 包括: 属性(Property ), 信号 (Signal ), 事件(Event )、 统计( Statistic )。 并且基于文本的媒体控制指示还可以携带在 具有业务相关性的参数聚合形成的包中。 The text-based media control method of an embodiment of the present invention may be accomplished by a command request between the MGC and the MG including a text-based media control indication. Among them, MGC's command types for MG include Add, Modify, Subtract, Move, and Audit. Value (AuditValue), Audit Capabilities, Notify, ServiceChange, etc. Command parameters, including: Property, Signal, Event, Statistic. And the text-based media control indication can also be carried in a packet formed by aggregation of parameters with business relevance.
本发明的实施例实现了通过基于文本的媒体控制指示来控制 MG上的文 本媒体文件的播放、 暂停、 快进以及后退等操作, 而基于文本的媒体控制指 示也能以多种方式携带在 MGC发送给 MG的媒体控制的命令请求中,因而本 发明的实施例实现了 H.248对文本类媒体文件的交互控制, 为基于文本的媒 体控制提供了一种简单有效的解决方案。 本发明的实施例提供的一种 MG, 如图 5所示, 包括:  Embodiments of the present invention enable operations such as play, pause, fast forward, and rewind of text media files on the MG through text-based media control indications, while text-based media control indications can also be carried in the MGC in a variety of ways. It is sent to the media control command request of the MG, and thus the embodiment of the present invention implements the interactive control of the H.248 text media file, providing a simple and effective solution for text-based media control. An MG provided by an embodiment of the present invention, as shown in FIG. 5, includes:
接收单元 501 , 用于接收媒体操作的命令请求, 所述命令请求中包含基于 文本的媒体操作指示;  The receiving unit 501 is configured to receive a command request for a media operation, where the command request includes a text-based media operation indication;
操作单元 502 , 用于根据所述基于文本的媒体操作指示,对媒体数据进行 操作。  The operating unit 502 is configured to operate on the media data according to the text-based media operation indication.
在上述实施例的基础上, 所述基于文本的媒体操作指示包括以下任一项: 跳跃偏移指示信号; 偏移值, 或偏移值和偏移单位; 文本范围取值参数, 或 文本范围格式参数和文本范围取值参数的组合; 文本范围参数, 包括: 一个 及以上文本范围取值, 或一个及以上文本范围取值和文本范围格式参数的组 合。 其中, 所述偏移单位和文本范围格式参数, 包括基于文本的取值单元, 至少包括如下单元之一: 字、 句或段。 本发明的实施例还提供的一种网络系统, 如图 6所示, 包括:  Based on the above embodiment, the text-based media operation indication includes any one of the following: a skip offset indication signal; an offset value, or an offset value and an offset unit; a text range value parameter, or a text range Combination of format parameters and text range value parameters; text range parameters, including: one or more text range values, or a combination of one or more text range values and text range format parameters. The offset unit and the text range format parameter, including the text-based value unit, include at least one of the following units: a word, a sentence, or a segment. A network system, as shown in FIG. 6, is also provided in the embodiment of the present invention, including:
媒体网关控制器 601 , 用于向媒体网关发送媒体操作的命令请求, 所述命 令请求中包括基于文本的媒体操作指示;  a media gateway controller 601, configured to send a media operation command request to the media gateway, where the command request includes a text-based media operation indication;
所述媒体网关 602 , 用于接收媒体操作的命令请求, 所述命令请求中包含 基于文本的媒体操作指示, 并根据所述基于文本的媒体操作指示, 对媒体数 据进行操作。 The media gateway 602 is configured to receive a command request for a media operation, where the command request includes a text-based media operation indication, and according to the text-based media operation indication, the number of media According to the operation.
本发明实施例提供的 MGC和 MG, 以及网络系统, 可以参照上述基于文 本的媒体控制方法的各个实施例实现 MGC对 MG上的文本媒体文件的控制, 可以通过在 MGC发送给 MG的文本媒体操作的命令请求中包括基于文本的媒 体操作指示, MG根据所述基于文本的媒体操作指示, 能够实现对文本类媒体 文件的播放操作进行控制, 从而实现文本类媒体的有效控制和满足多媒体应 用的各类需求。  The MGC and the MG, and the network system provided by the embodiments of the present invention can implement the MGC control of the text media file on the MG by referring to the foregoing embodiments of the text-based media control method, and can be performed by the text media sent to the MG by the MGC. The command request includes a text-based media operation indication, and the MG can control the playback operation of the text-based media file according to the text-based media operation instruction, thereby implementing effective control of the text-based media and satisfying each of the multimedia applications. Class requirements.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流 程, 是可以通过计算机程序来指令相关的硬件来完成, 所述的程序可存储于 一计算机可读取存储介质中, 该程序在执行时, 可包括如上述各方法的实施 例的流程。其中,所述的存储介质可为磁碟、光盘、只读存储记忆体( Read-Only Memory, ROM )或随机存 己忆体 ( Random Access Memory, RAM )等。  A person skilled in the art can understand that all or part of the process of implementing the above embodiment method can be completed by a computer program to instruct related hardware, and the program can be stored in a computer readable storage medium. In execution, the flow of an embodiment of the methods as described above may be included. The storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).
以上所述, 仅为本发明的具体实施方式, 但本发明的保护范围并不局限 于此, 任何熟悉本技术领域的技术人员在本发明揭露的技术范围内, 可轻易 想到的变化或替换, 都应涵盖在本发明的保护范围之内。 因此, 本发明的保 护范围应以权利要求的保护范围为准。  The above is only the specific embodiment of the present invention, but the scope of the present invention is not limited thereto, and any change or replacement that can be easily conceived by those skilled in the art within the technical scope of the present invention is All should be covered by the scope of the present invention. Therefore, the scope of protection of the present invention should be determined by the scope of the claims.

Claims

权 利 要 求 书 Claim
1、 一种基于文本的媒体控制方法, 其特征在于, 包括:  A text-based media control method, comprising:
媒体网关接收媒体操作的命令请求, 所述命令请求中包含基于文本的媒体 操作指示;  The media gateway receives a command request for a media operation, where the command request includes a text-based media operation indication;
根据所述基于文本的媒体操作指示, 对媒体数据进行操作。  The media data is operated according to the text-based media operation indication.
2、 根据权利要求 1所述的方法, 其特征在于, 所述媒体操作指示包括跳跃 偏移指示信号。  2. The method of claim 1, wherein the media operation indication comprises a skip offset indication signal.
3、 根据权利要求 2所述的方法, 其特征在于, 所述跳跃偏移指示信号包括 操作对象参数, 所述操作对象参数包括以下参数的至少之一:  3. The method according to claim 2, wherein the jump offset indication signal comprises an operation object parameter, and the operation object parameter comprises at least one of the following parameters:
信号标识, 表示要执行跳跃偏移的信号;  a signal identifier indicating a signal to perform a skip offset;
信号列表标识, 表示要执行跳跃偏移的信号列表;  a signal list identifier indicating a list of signals to perform a skip offset;
媒体资源标识, 表示要执行跳跃偏移的媒体资源。  Media resource identifier, indicating the media resource to perform the skip offset.
4、 根据权利要求 2所述的方法, 其特征在于, 所述跳跃偏移指示信号进一 步包括跳跃偏移值。  4. The method of claim 2, wherein the skip offset indication signal further comprises a skip offset value.
5、 根据权利要求 4所述的方法, 其特征在于, 所述跳跃偏移指示信号进一 步包括跳跃偏移值单位; 所述跳跃偏移值单位包括基于文本的取值单元, 包括: 字、 句或段。  The method according to claim 4, wherein the skip offset indication signal further comprises a skip offset value unit; the skip offset value unit comprises a text-based value unit, including: a word, a sentence Or paragraph.
6、 根据权利要求 1所述的方法, 其特征在于, 所述媒体操作指示包括偏移 值, 或包括偏移值和偏移单位。  6. The method according to claim 1, wherein the media operation indication comprises an offset value, or comprises an offset value and an offset unit.
7、 根据权利要求 6所述的方法, 其特征在于, 所述偏移单位包括基于文本 的取值单元, 至少包括如下单元之一: 字、 句或段。  7. The method according to claim 6, wherein the offset unit comprises a text-based value unit, and at least one of the following units: a word, a sentence or a segment.
8、 根据权利要求 1所述的方法, 其特征在于,  8. The method of claim 1 wherein:
所述基于文本的媒体操作指示包括文本范围取值参数, 或文本范围取值参 数和文本范围格式参数的组合; 或  The text-based media operation indication includes a text range value parameter, or a combination of a text range value parameter and a text range format parameter; or
所述基于文本的媒体操作指示包括文本范围参数, 所述文本范围参数包括: 一个及以上文本范围取值, 或一个及以上文本范围取值和文本范围格式参数的 组合; The text-based media operation indication includes a text range parameter, and the text range parameter includes: One or more text range values, or a combination of one or more text range values and text range format parameters;
其中, 所述文本范围格式参数包括基于文本的取值单元, 至少包括如下单 元之一: 字、 句或段。  The text range format parameter includes a text-based value unit, and includes at least one of the following units: a word, a sentence, or a segment.
9、 根据权利要求 1至 8中任一项所述的方法, 其特征在于, 所述媒体数据 为文本格式的媒体数据。  The method according to any one of claims 1 to 8, wherein the media data is media data in a text format.
10、 一种媒体网关, 其特征在于, 包括:  10. A media gateway, comprising:
接收单元, 用于接收媒体操作的命令请求, 所述命令请求中包含基于文本 的媒体操作指示;  a receiving unit, configured to receive a command request for a media operation, where the command request includes a text-based media operation indication;
操作单元, 用于根据所述基于文本的媒体操作指示, 对媒体数据进行操作。 And an operation unit, configured to operate on the media data according to the text-based media operation indication.
11、 根据权利要求 10所述的媒体网关, 其特征在于, 所述基于文本的媒体 操作指示包括以下信息项的至少之一: 11. The media gateway of claim 10, wherein the text-based media operation indication comprises at least one of the following information items:
跳跃偏移指示信号;  Jump offset indication signal;
偏移值, 或偏移值和偏移单位;  Offset value, or offset value and offset unit;
文本范围取值参数, 或文本范围格式参数和文本范围取值参数的组合; 文本范围参数, 包括: 一个及以上文本范围取值, 或一个及以上文本范围 取值和文本范围格式参数的组合;  a text range value parameter, or a combination of a text range format parameter and a text range value parameter; a text range parameter, including: a value of one or more text ranges, or a combination of one or more text range values and text range format parameters;
所述偏移单位和文本范围格式参数, 包括基于文本的取值单元, 至少包括 如下单元之一: 字、 句或段。  The offset unit and text range format parameters, including text-based value units, include at least one of the following units: a word, a sentence, or a segment.
12、 一种网络系统, 其特征在于, 包括:  12. A network system, comprising:
媒体网关控制器, 用于向媒体网关发送媒体操作的命令请求, 所述命令请 求中包括基于文本的媒体操作指示;  a media gateway controller, configured to send a command request for a media operation to the media gateway, where the command request includes a text-based media operation indication;
所述媒体网关, 用于接收媒体操作的命令请求, 所述命令请求中包含基于 文本的媒体操作指示, 并根据所述基于文本的媒体操作指示, 对媒体数据进行 操作。  The media gateway is configured to receive a command request for a media operation, where the command request includes a text-based media operation indication, and operate the media data according to the text-based media operation indication.
PCT/CN2009/076365 2009-01-12 2009-12-31 Method, apparatus, and system for controlling media based on texts WO2010078823A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN200910001788.7 2009-01-12
CN200910001788.7A CN101778090A (en) 2009-01-12 2009-01-12 Method, device and system based on text for controlling media

Publications (1)

Publication Number Publication Date
WO2010078823A1 true WO2010078823A1 (en) 2010-07-15

Family

ID=42316251

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2009/076365 WO2010078823A1 (en) 2009-01-12 2009-12-31 Method, apparatus, and system for controlling media based on texts

Country Status (2)

Country Link
CN (1) CN101778090A (en)
WO (1) WO2010078823A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030009337A1 (en) * 2000-12-28 2003-01-09 Rupsis Paul A. Enhanced media gateway control protocol
CN1874532A (en) * 2005-10-21 2006-12-06 华为技术有限公司 Method of media gateway controller for controlling media gateway to play back audio
CN1889515A (en) * 2006-04-03 2007-01-03 华为技术有限公司 Method for realizing recording pause function via H.248 protocol
CN1953053A (en) * 2005-10-21 2007-04-25 华为技术有限公司 A method to realize the function of text-to-speech convert

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030009337A1 (en) * 2000-12-28 2003-01-09 Rupsis Paul A. Enhanced media gateway control protocol
CN1874532A (en) * 2005-10-21 2006-12-06 华为技术有限公司 Method of media gateway controller for controlling media gateway to play back audio
CN1953053A (en) * 2005-10-21 2007-04-25 华为技术有限公司 A method to realize the function of text-to-speech convert
CN1889515A (en) * 2006-04-03 2007-01-03 华为技术有限公司 Method for realizing recording pause function via H.248 protocol

Also Published As

Publication number Publication date
CN101778090A (en) 2010-07-14

Similar Documents

Publication Publication Date Title
CN100461757C (en) Real-time flow-medium transmission method and system
EP2840799B1 (en) Method and system for multi-screen interaction
US7873059B2 (en) Gateway device
KR101632019B1 (en) Customized playback at sink device in wireless display system
CN101364941B (en) Content playback device, content playback method, and content playback system
CN101355500B (en) Content reproduction apparatus, content reproduction method and program
JP2016538754A (en) Method and apparatus for content distribution
WO2021204141A1 (en) Video live-streaming control, bridging, stream control and broadcast control methods, and client
WO2005086009A1 (en) Medium distribution device and medium reception device
JP2019525235A (en) Synchronized audio playback device
WO2010017710A1 (en) A control method, apparatus and system for realizing the detection of media resource playing status
CN100487788C (en) A method to realize the function of text-to-speech convert
JP2014075735A (en) Image processor and image processing method
WO2010078823A1 (en) Method, apparatus, and system for controlling media based on texts
WO2010017725A1 (en) Method, devic and apparatus for controlling media playing
CN101222542B (en) Method for implementing Text-To-Speech function
EP2214361B1 (en) Method for adjusting signal speed, media gateway and media gateway controller
CN101399964A (en) Control method, system and device for media playing
WO2009092302A1 (en) Method and device for realizing a recording service
WO2009121249A1 (en) Package publishing and applying control method and device
CN105142015A (en) Method of sharing and playing BHD file based on DLNA
CN101404821B (en) Status information reporting method, equipment and system based on separation structure
WO2009092271A1 (en) Method, system and device for realizing signal pause
JP2003018567A (en) Data reproducer and data transmitter
KR102292416B1 (en) System and method for providing music streaming service

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09837380

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 09837380

Country of ref document: EP

Kind code of ref document: A1