WO2012155659A1 - 一种基于远程呈现的媒体传输方法及系统 - Google Patents

一种基于远程呈现的媒体传输方法及系统 Download PDF

Info

Publication number
WO2012155659A1
WO2012155659A1 PCT/CN2012/072739 CN2012072739W WO2012155659A1 WO 2012155659 A1 WO2012155659 A1 WO 2012155659A1 CN 2012072739 W CN2012072739 W CN 2012072739W WO 2012155659 A1 WO2012155659 A1 WO 2012155659A1
Authority
WO
WIPO (PCT)
Prior art keywords
media
module
remote
terminal
logical channel
Prior art date
Application number
PCT/CN2012/072739
Other languages
English (en)
French (fr)
Inventor
叶小阳
孙博
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Priority to US14/130,264 priority Critical patent/US9344475B2/en
Priority to EP12785453.7A priority patent/EP2731331A4/en
Publication of WO2012155659A1 publication Critical patent/WO2012155659A1/zh

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • H04L65/765Media network packet handling intermediate
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/147Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/1066Session management
    • H04L65/1069Session establishment or de-establishment
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/40Support for services or applications
    • H04L65/403Arrangements for multi-party communication, e.g. for conferences
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • H04N7/152Multipoint control units therefor

Definitions

  • the present invention relates to telepresence technology, and more particularly to a remote presentation based media transmission method and system. Background technique
  • Telepresence is an advanced remote video conferencing system. Telepresence is deeply loved by high-end users with its true presence. In telepresence systems, listening to the voice, real size, and witnessing the communication are directly related to whether the user can have an immersive experience. A very important technical indicator for measuring telepresence systems.
  • each conference site has only one video conference terminal.
  • the video conference terminal encodes and transmits one channel of audio and/or one channel of video, and receives and decodes one channel of audio and/or video. Since there is only one input source and output of the sound, the user cannot feel which direction the sound is emitted from the venue, and since there is only one video input source and output source, the local acquisition and encoding picture needs to capture the entire screen of the venue.
  • the conference can only select the splicing screen of a certain venue or multiple remote venues, so that the video sent or received cannot meet the requirements of the real people.
  • a single site can have multiple audio and/or video input and output devices.
  • each screen displays an image of an agent participant, and each corresponding participant participant corresponds to the same way.
  • Audio input through the orientation information of the audio and the professional camera orientation area acquisition, can realize the sound recognition and the size of the real person, further realizing the realistic effect of eye contact.
  • the current telepresence system generally develops from a traditional video conferencing system.
  • the multi-screen conference site is composed of multiple video conferencing terminals and multi-audio video peripherals, and multiple videos of one venue.
  • the conference terminal establishes a signaling connection and a media logical channel with the remote endpoint (which may be a video conference terminal or a multipoint control unit (MCU)), and finally transmits the audio and video code stream between the plurality of endpoint pairs, and
  • the speaker and display device output multiple streams.
  • This method is cumbersome, and requires multiple video conference terminals to process signaling in one site.
  • Each terminal occupies an IP address, or an endpoint ID number (such as an H.323 ID), or a conference number, which lacks between terminals.
  • Mutual information processing mechanisms such as agent information
  • the synchronization between multiple streams is very difficult, affecting the user's body. Summary of the invention
  • the main object of the present invention is to provide a media transmission method and system based on remote presentation, which is easy to operate and can improve the user experience.
  • a telepresence-based media transmission method comprising:
  • the primary remote presentation terminal of the local media transmission system When the connection is established, the primary remote presentation terminal of the local media transmission system performs signaling interaction with the remote endpoint to establish a media logical channel between the local media transmission system and the remote endpoint; the media transmission system and the remote end
  • the endpoints transmit the same type of media stream through a media logical channel or through multiple media logical channels, and receive the same type of media stream through a media logical channel or through multiple media logical channels.
  • the media logical channel between the local media transmission system and the remote end point is set up: respectively, a media logical channel for transmitting a media stream between the media transmission module and the remote end point of each remote presentation terminal of the local side is established, and each of the recording Corresponding information of the media logical channel and the location of the audio input device and/or the video input device;
  • the media media transmission system and the remote endpoint respectively send the same type of media stream through multiple media logical channels as:
  • the audio input device and/or the video input device sends the collected audio and/or video data to
  • the media codec module of the remote presentation terminal corresponding to the location is performed.
  • Encoding; the media codec module of each telepresence terminal respectively encodes the input audio and/or video data, and transfers the encoded media code stream to the corresponding media transmission module; respectively, the media transmission modules of each telepresence terminal respectively encode
  • the subsequent media stream is sent to the remote endpoint through a media logical channel corresponding to the media source type and location.
  • the media logical channel between the local media transmission system and the remote end point is set up: respectively, media logical channels for receiving media streams between the media transmission module and the remote end point of each remote presentation terminal on the local side are set, and each is recorded. Corresponding information of the media logical channel and the location of the audio output device and/or the video output device;
  • the medium-side media transmission system and the remote end point respectively receive the same type of media stream through multiple media logical channels: the media transmission module of each remote presentation terminal receives the remote end through the established media logical channel respectively.
  • the media code stream is forwarded to the corresponding media codec module according to the corresponding relationship between the media logical channel and the position of the audio output device or the video output device; the media codec modules of each remote presentation terminal respectively receive
  • the obtained media stream is decoded and then output to a corresponding audio output device and/or video output device for playback.
  • the media logical channel between the local media transmission system and the remote endpoint is: establishing a plurality of media logical channels for transmitting a media stream between the media transmission module and the remote endpoint of the primary telepresence terminal, and recording each media Corresponding information of the logical channel and the location of the audio input device and/or the video input device;
  • the media media transmission system and the remote endpoint respectively send the same type of media stream through multiple media logical channels as:
  • the audio input device and/or the video input device sends the collected audio and/or video data to a media codec module of the remote presentation terminal corresponding to the location; each media codec module respectively encodes the input audio and/or video data, and transfers the encoded media code stream to the media transmission module of the main remote presentation terminal;
  • the media transmission module of the presentation terminal respectively passes the media code stream encoded by the media coding and decoding module of the local side to the media source type and
  • the media logical channel corresponding to the location is sent to the remote endpoint.
  • the media logical channel between the local media transmission system and the remote endpoint is: establishing a plurality of media logical channels for receiving media streams between the media transmission module and the remote endpoint of the primary telepresence terminal, and recording each media Corresponding information of the logical channel and the location of the audio output device and/or the video output device;
  • the medium-side media transmission system and the remote end point respectively receive the same type of media stream through multiple media logical channels:
  • the media transmission module of the main telepresence terminal respectively receives the remote end through each established media logical channel.
  • Multi-channel media stream and according to the corresponding relationship between the media logical channel and the audio output device and/or the video output device, respectively, the received media code stream is respectively transferred to the media codec module corresponding to the remote presentation terminal; each remote presentation terminal
  • the media codec module decodes the received audio and/or video code streams, respectively, and then outputs them to the corresponding audio output device and/or video output device for playback.
  • the media logical channel between the local media transmission system and the remote endpoint is: establishing, for each media type, a media for transmitting the media stream between the media transmission module of the primary telepresence terminal and the remote endpoint a logical channel that records the media type and location of the audio input device and/or the video input device of the present side;
  • the media transmission system of the present side and the remote end point transmit the same type of media stream through a media logical channel as:
  • the audio input device and/or the video input device sends the collected audio and/or video data to the corresponding location.
  • the media codec module of the telepresence terminal each media codec module respectively encodes the input audio and/or video data, and transfers the media code stream encoded by 4 bar to the media transmission module of the main telepresence terminal;
  • the media transmission module of the terminal sends the encoded media code stream through a media logical channel between the main remote presentation terminal and the remote endpoint, and the sent media packet header carries the corresponding media type and location information.
  • the media logical channel between the local media transmission system and the remote endpoint is: establishing a media transmission module between the primary remote presentation terminal and the remote endpoint for each media type Receiving a media logical channel of the media stream, recording the media type and location of the audio output device and/or the video output device of the local side;
  • the medium-side media transmission system and the remote end point receive the same type of media stream through a media logical channel: the media transmission module of the main telepresence terminal receives the remote media code stream from the media logical channel, Transmitting the media code stream to a media codec module corresponding to the remote presentation terminal by parsing the media type and location information identified by the packet header; and respectively receiving, by the media codec module of each remote presentation terminal, the received audio and/or video code The stream is decoded and then output to the corresponding audio output device and/or video output device for playback.
  • the media logical channel is distinguished by an IP address and a port number, and different media logical channels have different IP addresses and/or port numbers.
  • a telepresence-based media transmission system comprising: a main telepresence terminal and at least one auxiliary telepresence terminal; wherein
  • the primary remote presentation terminal is configured to perform signaling interaction with the remote endpoint when establishing a connection between the media transmission system and the remote endpoint, and establish media logic between the media transmission system and the remote endpoint Channels; and transmitting the same type of media stream through a media logical channel or through multiple media logical channels, and receiving the same type of media stream through a set of media logical channels or through multiple media logical channels respectively
  • the secondary telepresence terminal is configured to perform media stream transmission and reception through a media logical channel established by the main remote presentation terminal.
  • the system further includes a multi-channel audio input device and/or a video input device, the main remote presentation terminal at least: a signaling processing module, a media codec module, and a media transmission module, where the secondary telepresence terminal includes at least: a decoding module, a media transmission module; wherein
  • the signaling processing module is configured to be responsible for signaling interaction, and perform media capability negotiation, respectively, to establish media logical channels for transmitting media streams between the media transmission module and the remote endpoint of each remote presentation terminal in the system, and record each Media logic channel with audio input device and / or video input Corresponding information of the device location;
  • the audio input device and/or the video input device are configured to transmit the collected audio and/or video data to a media codec module of the remote presentation terminal at the corresponding location for encoding;
  • the media codec module is configured to encode the input audio and/or video data, and then forward the encoded media code stream to the corresponding media transmission module;
  • the media transmission module is configured to send the media code stream encoded by the media codec module to the remote endpoint through a media logical channel corresponding to the media source type and location.
  • the system further includes a multi-channel audio output device and/or a video output device, and the main telepresence terminal includes at least: a signaling processing module, a media codec module, and a media transmission module, where the secondary telepresence terminal includes at least: a decoding module, a media transmission module; wherein
  • the signaling processing module is configured to be responsible for signaling interaction, and perform media capability negotiation, respectively, to establish a media logical channel for receiving media streams between the media transmission module and the remote endpoint of each remote presentation terminal in the system, and record Corresponding information of each media logical channel and the location of the audio output device and/or the video output device;
  • the media transmission module is configured to receive the media stream of the remote media through the established media logical channel, and forward the media code stream to the corresponding media according to the corresponding information of the media logical channel and the location of the audio output device or the video output device.
  • Codec module processing
  • the media codec module is configured to decode the received media code stream and then output to a corresponding audio output device and/or video output device for playback.
  • the system further includes a multi-channel audio input device and/or a video input device, the main remote presentation terminal at least: a signaling processing module, a media codec module, and a media transmission module, where the secondary telepresence terminal includes at least: a decoding module, a media transmission module; wherein
  • the signaling processing module is configured to be responsible for signaling interaction, and perform media capability negotiation, and establish a plurality of media logical channels for transmitting media streams between the media transmission module of the main telepresence terminal and the remote endpoint, and record each media.
  • the audio input device and/or the video input device are configured to send the collected audio and/or video data to a media codec module of the remote presentation terminal at the corresponding location;
  • the media codec module is configured to encode the input audio and/or video data, and forward the encoded media code stream to the media transmission module of the primary telepresence terminal, where the media codec module of the secondary telepresence terminal Transmitting the encoded media code stream to the media transmission module of the main telepresence terminal by the corresponding media transmission module, and the media codec module of the main telepresence terminal directly transfers the encoded media code stream to the media transmission module of the main telepresence terminal;
  • the media transmission module of the primary telepresence terminal is configured to separately send the media code stream encoded by each media encoding and decoding module in the system to the remote endpoint through a media logical channel corresponding to the media source type and location.
  • the system further includes a multi-channel audio output device and/or a video output device, and the main telepresence terminal includes at least: a signaling processing module, a media codec module, and a media transmission module, where the secondary telepresence terminal includes at least: a decoding module, a media transmission module; wherein
  • the signaling processing module is configured to be responsible for signaling interaction, and perform media capability negotiation, and establish a plurality of media logical channels for receiving media streams between the media transmission module of the main telepresence terminal and the remote endpoint, and record each media. Corresponding information of the logical channel and the location of the audio output device and/or the video output device;
  • the media transmission module of the main remote presentation terminal is configured to receive the remote multi-media media code stream through the established media logical channels, and according to the correspondence between the media logic channel and the audio output device and/or the video output device, Transmitting the received media stream to the media codec module of the corresponding remote presentation terminal, where the media code stream is directly transferred to the media codec module of the main remote presentation terminal, and the media code stream is transferred to the corresponding media transmission module.
  • a media codec module of the secondary telepresence terminal
  • the media codec module is configured to decode the received audio and/or video code stream, It is then output to the corresponding audio output device and/or video output device for playback.
  • the system further includes a multi-channel audio input device and/or a video input device, the main remote presentation terminal at least: a signaling processing module, a media codec module, and a media transmission module, where the secondary telepresence terminal includes at least: a decoding module, a media transmission module; wherein
  • the signaling processing module is configured to be responsible for signaling interaction, and perform media capability negotiation, and establish a media for sending media streams between the media transmission module of the primary telepresence terminal and the remote endpoint for each media type.
  • a logical channel that records the media type and location of the audio input device and/or the video input device of the present side;
  • the audio input device and/or the video input device are configured to send the collected audio and/or video data to a media codec module of the remote presentation terminal at the corresponding location;
  • the media codec module is configured to encode the input audio and/or video data, and the media codec module of the main telepresence terminal transfers the encoded media code stream to the corresponding media transmission module, and the media of the remote telepresence terminal The codec module transfers the encoded media code stream to the media transmission module of the main remote presentation terminal through the corresponding media transmission module;
  • the media transmission module of the main telepresence terminal is configured to send the received encoded media code stream through a media logical channel between the main telepresence terminal and the remote endpoint, and the sent media packet carries the corresponding media type and location. information.
  • the system further includes a multi-channel audio output device and/or a video output device, and the main telepresence terminal includes at least: a signaling processing module, a media codec module, and a media transmission module, where the secondary telepresence terminal includes at least: a decoding module, a media transmission module; wherein
  • the signaling processing module is configured to be responsible for signaling interaction, and perform media capability negotiation, and establish a receiving media logical channel between the media transmission module of the primary telepresence terminal and the remote endpoint for each media type, and record the recording Media type and location of the side audio output device and/or video output device;
  • a media transmission module of the primary telepresence terminal configured to be connected from the media logical channel Receiving the media code stream of the remote end, by parsing the media type and location information identified by the packet header, transferring the media code stream to the media codec module corresponding to the remote presentation terminal, where the media code stream is directly transferred to the main remote presentation terminal
  • the media codec module transfers the media code stream to the media codec module of the auxiliary telepresence terminal through the corresponding media transmission module;
  • the media codec module is configured to decode the received audio and/or video code stream and then output to a corresponding audio output device and/or video output device for playback.
  • the media logical channel is distinguished by an IP address and a port number, and different media logical channels have different IP addresses and/or port numbers.
  • the embodiment of the present invention is based on a telepresence media transmission method and system.
  • the main remote presentation terminal of the local media transmission system performs signaling interaction with the remote endpoint, and the local media transmission system and the remote endpoint are established.
  • Media logical channel between the local media transmission system and the remote endpoint through multiple media logical channels, or through a media logical channel for media transmission.
  • IP address, or endpoint ID number, or conference number only one number (IP address, or endpoint ID number, or conference number) needs to be called, so that the operation is simple, and the agent information can be exchanged in the remote presentation system. Realize the effect of listening to the sound, and solve the problem of synchronization between the streams, so as to improve the user experience.
  • FIG. 1 is a schematic flowchart of a media transmission method based on remote presentation according to an embodiment of the present invention
  • FIG. 2 is a schematic flowchart of media transmission by a media presentation system based on remote presentation according to a plurality of media logical channels according to Embodiment 1 of the present invention
  • FIG. 3 is a schematic flowchart of media receiving by a remote presentation-based media transmission system through multiple media logical channels according to Embodiment 2 of the present invention
  • FIG. 4 is a schematic flowchart of media transmission by a remote presentation-based media transmission system through multiple media logical channels according to Embodiment 3 of the present invention
  • FIG. 5 is a multi-media logic of a media transmission system based on remote presentation according to Embodiment 4 of the present invention.
  • FIG. 6 is a schematic flowchart of media transmission by a media presentation system based on remote presentation according to a media logical channel
  • FIG. 7 is a schematic flow chart of media receiving by a remote presentation-based media transmission system through a media logical channel according to Embodiment 6 of the present invention.
  • FIG. 8 is a schematic flow chart of a remote presentation based media transmission method according to Embodiment 7 of the present invention.
  • Embodiment 8 of the present invention is a schematic flow chart of a remote presentation based media transmission method according to Embodiment 8 of the present invention.
  • FIG. 10 is a schematic structural diagram of a media transmission system based on remote presentation according to an embodiment of the present invention
  • FIG. 11 is a schematic structural diagram of another media transmission system based on remote presentation according to an embodiment of the present invention.
  • FIG. 12 is a schematic structural diagram of another media transmission system based on remote presentation according to an embodiment of the present invention.
  • FIG. 13 is a schematic structural diagram of another media transmission system based on remote presentation according to an embodiment of the present invention.
  • FIG. 14 is a schematic structural diagram of another media transmission system based on remote presentation according to an embodiment of the present invention.
  • FIG. 15 is a schematic structural diagram of another media transmission system based on remote presentation according to an embodiment of the present invention. detailed description
  • FIG. 1 is a schematic flowchart of a media transmission method based on remote presentation according to an embodiment of the present invention. As shown in FIG. 1 , the method includes:
  • Step 101 When the connection is established, the primary remote presentation terminal of the local media transmission system performs signaling interaction with the remote endpoint, and establishes a media logical channel between the media transmission system and the remote endpoint of the local side.
  • the number of the audio and video input and output devices of the remote presentation terminal on both sides, the remote presentation terminal location information, and the media type are generally required.
  • each of the foregoing information is not required to be carried in the interactive message, and some of the information may be inferred by other information, for example, the message sent by the primary remote presentation terminal to the remote endpoint is carried in: For the left, center, and right video, the remote endpoint can further acquire the telepresence system with three video input devices.
  • the desired channel and media type (such as audio) may be established according to the location information of the audio input device, the audio output device, the video input device, and/or the video output device. , video) and location (such as left, middle, right) mapping, and the mapping relationship.
  • the main remote presentation terminal needs to record the correspondence between the media type and location information and the media logical channel identifier.
  • Step 102 The media transmission system of the local side and the remote end point transmit the same type of media stream through a media logical channel or through multiple media logical channels, and through a media logical channel or through multiple media logical channels respectively. Receive the same type of media stream.
  • Example 1 The technical solution of the present invention will be further described in detail below through specific embodiments.
  • Example 1 The technical solution of the present invention will be further described in detail below through specific embodiments.
  • Step 201 The primary telepresence terminal establishes a call between the local side and the remote end point, and the signaling processing module of the main telepresence terminal is responsible for signaling. Interacting, and separately performing media capability negotiation and establishing a media logical channel for transmitting a media stream between the media transmission module and the remote end point of each remote presentation terminal on the side, recording each media logical channel and audio input device and/or video input Correspondence information of the device location.
  • Step 202 The audio input device and/or the video input device send the collected audio and/or video data to the media codec module of the remote presentation terminal at the corresponding location for encoding, and the media codec module of each remote presentation terminal respectively inputs the input
  • the audio and/or video data is encoded and the encoded media stream is forwarded to a corresponding media transmission module.
  • Step 203 The media transmission module of each telepresence terminal respectively transmits the encoded media code stream through a media logical channel corresponding to the media source type and location, that is, by being located with the audio input device and/or the video input device.
  • the corresponding media logical channel is sent to the remote endpoint.
  • FIG. 3 is a schematic flowchart of media receiving by a remote presentation-based media transmission system through multiple media logical channels according to Embodiment 2 of the present invention. As shown in FIG. 3, the method includes:
  • Step 301 The primary telepresence terminal establishes a call between the local end and the remote end point, and the signaling processing module of the main telepresence terminal is responsible for signaling interaction, and separately performs media capability negotiation, and establishes each media transmission module and the remote end point.
  • Step 302 The media transmission module of each telepresence terminal respectively receives the remote multi-media media stream through the established media logical channel, and according to the correspondence between the media logical channel and the audio output device or the video output device, the media stream is Transfer to the corresponding media codec module for processing.
  • Step 303 The media codec module of each remote presentation terminal separately receives the received media stream. The decoding is performed, and then output to the corresponding audio output device and/or video output device for playback.
  • Example 3 The media codec module of each remote presentation terminal separately receives the received media stream. The decoding is performed, and then output to the corresponding audio output device and/or video output device for playback.
  • FIG. 4 is a schematic flowchart of media transmission by a remote presentation-based media transmission system through multiple media logical channels according to Embodiment 3 of the present invention. As shown in FIG. 4, the method includes:
  • Step 401 The primary telepresence terminal establishes a call between the local end and the remote end point, and the signaling processing module of the main telepresence terminal is responsible for signaling interaction, and performs media capability negotiation, and establishes a media transmission module of the main telepresence terminal and the far A plurality of media logical channels for transmitting media streams between the end endpoints, and recording corresponding information of the media logical channels and the audio input devices and/or the video input device locations.
  • Step 402 The audio input device and/or the video input device send the collected audio and/or video data to a media codec module of the remote presentation terminal at the corresponding location, and each media codec module respectively inputs the input audio and/or video data. Encoding is performed, and the encoded media code stream is transferred to the media transmission module of the main telepresence terminal.
  • the media codec module of the primary telepresence terminal directly transfers the encoded media code stream to the media transmission module of the primary telepresence terminal, and the media codec module of the secondary telepresence terminal transmits the code through the corresponding media transmission module.
  • the media code stream is transferred to the media transmission module of the main telepresence terminal.
  • Step 403 The media transmission module of the primary telepresence terminal respectively sends the media code stream encoded by the media encoding and decoding module of the local side to the remote endpoint through a media logical channel corresponding to the type and location of the media source (audio and video input device location). .
  • Example 4 The media transmission module of the primary telepresence terminal respectively sends the media code stream encoded by the media encoding and decoding module of the local side to the remote endpoint through a media logical channel corresponding to the type and location of the media source (audio and video input device location).
  • FIG. 5 is a schematic flowchart of media receiving by a remote presentation-based media transmission system through multiple media logical channels according to Embodiment 4 of the present invention. As shown in FIG. 5, the method includes:
  • Step 501 The main telepresence terminal establishes a call between the local side and the remote end point, and the main remote presentation
  • the signaling processing module of the terminal is responsible for signaling interaction, and performs media capability negotiation, and establishes multiple media logical channels for receiving media streams between the media transmission module of the main telepresence terminal and the remote endpoint, and records each media logical channel. Correspondence information with the location of the audio output device and/or video output device.
  • Step 502 The media transmission module of the main telepresence terminal receives the remote multi-media media stream through the established media logical channels, and receives the corresponding relationship between the media logical channel and the audio output device and/or the video output device.
  • the obtained media code stream is respectively transferred to the media codec module corresponding to the remote presentation terminal for processing.
  • the media transmission module of the primary telepresence terminal directly transfers the media code stream to the media codec module of the primary telepresence terminal, and the media code stream of the secondary telepresence terminal transfers the media code stream to the secondary telepresence terminal.
  • Media codec module directly transfers the media code stream to the media codec module of the primary telepresence terminal, and the media code stream of the secondary telepresence terminal transfers the media code stream to the secondary telepresence terminal.
  • Step 503 The media codec module of each telepresence terminal separately decodes the received audio and/or video code stream, and outputs the same to the corresponding audio output device and/or video output device for playing.
  • Example 5 The media codec module of each telepresence terminal separately decodes the received audio and/or video code stream, and outputs the same to the corresponding audio output device and/or video output device for playing.
  • FIG. 6 is a schematic diagram of a process for media transmission by a media presentation system based on remote presentation according to a remote presentation through a media logical channel. As shown in FIG. 6, the method includes:
  • Step 601 The main telepresence terminal establishes a call between the local side and the remote end point, and the signaling processing module of the main telepresence terminal is responsible for signaling interaction, and performs media capability negotiation, and establishes a media transmission module of the main telepresence terminal and the far A media logical channel between the end endpoints for transmitting the media stream, recording the media type and location of the audio input device and/or the video input device of the present side.
  • Step 602 The audio input device and/or the video input device send the collected audio and/or video data to a media codec module of the remote presentation terminal at the corresponding location, and each media codec module respectively inputs the input audio and/or video data. Encoding is performed, and the encoded media code stream is transferred to the media transmission module of the main telepresence terminal. It should be noted that the media codec module of the primary telepresence terminal directly transfers the encoded media code stream to the media transmission module of the primary telepresence terminal, and the media codec module of the secondary telepresence terminal transmits the code through the corresponding media transmission module. The media code stream is transferred to the media transmission module of the main telepresence terminal.
  • Step 603 The media transmission module of the primary telepresence terminal transmits the encoded media code through a media logical channel between the primary telepresence terminal and the remote endpoint.
  • FIG. 7 is a schematic flowchart of media receiving by a telepresence-based media transmission system through a media logical channel according to Embodiment 6 of the present invention. As shown in FIG. 7, the method includes:
  • Step 701 The main telepresence terminal establishes a call between the local side and the remote end point, and the signaling processing module of the main telepresence terminal is responsible for signaling interaction, and performs media capability negotiation, and establishes a media transmission module of the main telepresence terminal and the far A media logical channel between the end endpoints that receives the media stream, recording the media type and location of the audio output device and/or the video output device of the present side.
  • Step 702 The media codec module of the primary telepresence terminal receives the remote media code stream from the media logical channel, and forwards the media code stream to the media codec module corresponding to the remote presentation terminal.
  • the primary telepresence terminal forwards the media code stream to the corresponding media codec module for decoding by parsing the media type and location information identified by the media packet header.
  • the media transmission module of the primary telepresence terminal directly transfers the media code stream to the media codec module of the primary telepresence terminal, and the media code module of the secondary telepresence terminal transfers the media code stream to the media codec module of the secondary telepresence terminal.
  • Step 703 The media codec module of each telepresence terminal separately decodes the received audio and/or video code stream, and outputs the data to the corresponding audio output device and/or video output device. Line play.
  • Example 7 The media codec module of each telepresence terminal separately decodes the received audio and/or video code stream, and outputs the data to the corresponding audio output device and/or video output device. Line play.
  • the telepresence-based media transmission system includes at least two remote presentation terminals and a plurality of audio and video input/output devices, wherein one remote presentation terminal (hereinafter referred to as “main remote presentation terminal") is responsible for signaling and
  • the agent of the media includes at least a protocol signaling processing module, a media codec module, and a media transmission module; and another one or more remote presentation terminals (hereinafter referred to as “secondary remote presentation terminals”) include at least a media codec module and a media transmission module.
  • One or more secondary telepresence terminals are respectively connected to the main telepresence terminal, and the main telepresence terminal and the auxiliary telepresence terminal are respectively connected with at least one audio input, one audio output, one video input and one video output device.
  • the primary telepresence terminal registers on the gatekeeper (GK) and provides the registered endpoint ID number.
  • the media logical channel established between the remote remote terminal and the remote end point is established between the remote presentation terminal and the remote end end of the local remote presentation terminal, that is, different addresses of the local terminals are used respectively, and the local end is used.
  • the code stream between the remote presentation terminal and the remote end is directly processed by each remote presentation terminal at the local end, and the primary and secondary remote presentation terminals of the local end respectively process the corresponding media stream to receive and transmit work.
  • FIG. 8 is a schematic flowchart of a remote presentation-based media transmission method according to Embodiment 7 of the present invention. As shown in FIG. 8, the method includes:
  • Step 801 The user inputs a number (such as an IP address, or an H.323 ID, or a conference number, etc.) of the called remote endpoint by calling a central control interface or a remote controller of the main remote presentation terminal, and the remote endpoint processes the call. Call, establish a connection. If the call connection is completed through the H.225 protocol, only the IP address or H.323 ID of the local remote presence terminal needs to be used for the call.
  • a number such as an IP address, or an H.323 ID, or a conference number, etc.
  • Step 802 The primary remote presentation terminal acquires the information of the secondary remote presentation terminal, including the media processing capability set and the media transceiver address of each secondary remote presentation terminal used by the local end in the current call.
  • Step 803 Perform media capability negotiation between the primary telepresence terminal and the remote endpoint.
  • the main remote presentation terminal includes the information of the secondary remote presentation terminal and the primary remote presentation terminal, and the identifier according to the location of the terminal, and constructs a capability set including the channel media type, quantity, and location identification information to be established to the remote endpoint.
  • the H.245 protocol can be used to notify the peer of the capability set and the capability description type supported by the local end when transmitting the capability set, and different code streams of different locations are distinguished by the capability description type.
  • 1, 2, 3 respectively represent left, center, and right audio
  • 4, 5, and 6 respectively represent left, center, and right video.
  • Step 804 Open the bidirectional media logical channel.
  • the local remote telepresence terminal sends an H.245 open media logical channel (openLogicalChannd) message, and the structure describes the corresponding relationship between the channel identifier and the media type and location, and the feature description of the channel itself, including at least the media sending address, respectively.
  • the remote endpoint replies to open the logical channel acknowledgement message openLogicalChannd Ack, the message includes at least the receiving address (IP address and port number) of the channel, and the local remote telepresence terminal records the sending channel information, including the channel identifier and the media type and location. Correspondence, receiving and sending addresses, etc.
  • Multiple transmit logical channels are established in the above manner.
  • the remote endpoint opens multiple media logical channels to the local end in the above manner.
  • Step 805 The local remote presentation terminal notifies the media transmission module of each remote presentation terminal to send and receive media stream data through the corresponding channel.
  • Step 806 The remote presentation terminals of the local end respectively transmit the multiple code streams between the remote end terminals and the remote end points.
  • the code stream collected by the audio or video I/O device is sent to the codec module of the terminal at the corresponding location for encoding, and then the corresponding media transmission module enters The line is sent, and the corresponding channel is selected according to the media logical channel information recorded above according to the location of the media source, for example, the left channel audio is transmitted through channel 1.
  • the local media transmission module receives the media code stream, and forwards to the corresponding media codec module according to the recorded media logical channel information, such as the received left channel video. , output to the corresponding location of the audio or video device to play.
  • Step 807 At the end, the local main remote presentation terminal notifies the media transmission module of each remote presentation terminal to stop media stream monitoring.
  • Step 808 The main remote presentation terminal is responsible for completing the termination of the conference, first closing each media logical channel, and finally completing the session removal.
  • each media logical channel is established between the primary telepresence terminal and the remote endpoint, and all media streams are sent and received by the main remote presentation terminal, and the main remote presentation terminal completes the code stream forwarding between the secondary remote presentation terminal and the secondary remote presentation terminal.
  • FIG. 9 is a schematic flowchart of a remote presentation-based media transmission method according to Embodiment 8 of the present invention. As shown in FIG. 9, the method includes:
  • Step 901 The user inputs a number (such as an IP address, or an H.323 ID, or a conference number, etc.) of the called remote endpoint by calling a central control interface or a remote controller of the main remote presentation terminal, and the remote endpoint processes the call. Call, establish a connection. If the call connection is completed through the H.225 protocol, only the IP address or H.323 ID of the local remote presence terminal needs to be used for the call.
  • a number such as an IP address, or an H.323 ID, or a conference number, etc.
  • Step 902 The primary telepresence terminal acquires the information of the secondary telepresence terminal, including the media processing capability set of each secondary telepresence terminal used by the local end, and the media transceiving address (including the IP address and the port number).
  • Step 903 Perform media capability negotiation between the primary telepresence terminal and the remote endpoint.
  • the capability set and the capability description type distinguish different streams of different locations by the capability description type.
  • a plurality of audio and/or video descriptors are added, and different values are assigned to different types and positions, for example, 1, 2, and 3 respectively represent left, middle, and right audio, 4, 5, and 6 respectively indicate left, center, and right video.
  • Step 904 Open the bidirectional media logical channel.
  • the local remote telepresence terminal sends an openLogicalChannel message, and the message uses the address of the local remote retrieving terminal to distinguish different channels, the same IP address and different port numbers.
  • the remote endpoint replies to open the logical channel acknowledgement message openLogicalChannel Ack, the message includes at least the receiving address (IP address and port number) of the channel, and the local remote telepresence terminal records the sending channel information, including the channel identifier and the media type and location. Correspondence, receiving and sending addresses, etc.
  • a plurality of transmission logical channels are established in the above manner.
  • the remote endpoint opens multiple media logical channels to the local primary remote presentation terminal in the above manner.
  • Step 905 The local remote presentation terminal establishes a media forwarding channel with each secondary remote presentation terminal, and maintains a forwarding channel between the primary remote presentation terminal and the secondary remote presentation terminal, and a remote remote presentation terminal and a remote endpoint.
  • Step 906 When the local end sends the code stream through different media logical channels, the code stream collected by the audio and/or video I/O device is sent to the codec module of the terminal at the corresponding location for encoding, and then the corresponding secondary remote presentation terminal media
  • the transmission module is transferred to the main remote presentation terminal through the corresponding forwarding channel, and the main remote presentation terminal passes between the recorded and the remote endpoint according to the location of the media source.
  • the corresponding media logical channel is sent, for example, the left channel audio is sent through channel 1.
  • the local remote presentation terminal media transmission module receives the media code stream, according to the recorded media logical channel information, such as the received left channel video, through the corresponding established above.
  • the forwarding channel between the primary telepresence terminal and the secondary telepresence terminal is forwarded to the media codec module of the corresponding secondary remote presentation terminal for decoding, and output to the audio or video device of the corresponding location for playing.
  • Step 907 At the end, the local main remote presentation terminal notifies the media transmission module of each terminal to stop media stream monitoring, and close the forwarding channel between the secondary telepresence terminal and the main telepresence terminal.
  • Step 908 The main remote presentation terminal is responsible for completing the termination of the conference, first closing each media logical channel, and finally completing the session removal.
  • the primary remote presentation terminal establishes a connection with a remote endpoint (which may be an MCU or a remote presentation terminal), which may be a point-to-point conference or a multipoint conference.
  • a remote endpoint which may be an MCU or a remote presentation terminal
  • the primary remote presentation may be The terminal initiates the call, or the primary remote presentation terminal accepts the call of the remote endpoint.
  • the embodiment of the present invention further provides a telepresence-based media transmission system, where the system includes: a main telepresence terminal and at least one auxiliary telepresence terminal;
  • the primary remote presentation terminal is configured to perform signaling interaction with the remote endpoint when establishing a connection between the media transmission system and the remote endpoint, and establish media logic between the media transmission system and the remote endpoint Channels; and transmitting the same type of media stream through a media logical channel or through multiple media logical channels, and receiving the same type of media stream through a set of media logical channels or through multiple media logical channels respectively ;
  • the secondary telepresence terminal is configured to perform media stream transmission and reception by using a media logical channel established by the main remote presentation terminal.
  • the main remote presentation terminal includes: a signaling processing module, a media codec module, and a media transmission module, where the secondary remote presentation terminal includes: a media codec module and a media transmission module;
  • the signaling processing module is configured to be responsible for signaling interaction, and perform media capability negotiation, respectively, to establish a media logical channel for transmitting a media stream between the media transmission module and the remote endpoint of each remote presentation terminal in the system, and record each Corresponding information of the media logical channel and the location of the audio input device and/or the video input device;
  • the audio input device and/or the video input device are configured to send the collected audio and/or video data to a media codec module of the remote presentation terminal at the corresponding location for encoding;
  • the media codec module is configured to encode the input audio and/or video data, and forward the encoded media code stream to the corresponding media transmission module;
  • the media transmission module is configured to send the media code stream encoded by the media codec module to the remote endpoint through a media logical channel corresponding to the media source type and location.
  • FIG. 11 is a schematic structural diagram of another remote transmission-based media transmission system according to an embodiment of the present invention. As shown in FIG. 11, the system may further include multiple audio output devices and/or video output devices, where the primary remote presentation terminal is at least The device includes: a signaling processing module, a media codec module, and a media transmission module, where the secondary remote presentation terminal includes: a media codec module and a media transmission module;
  • the signaling processing module is configured to be responsible for signaling interaction, and perform media capability negotiation, respectively, to establish a media logical channel for receiving media streams between the media transmission module and the remote endpoint of each remote presentation terminal in the system, and record Corresponding information of each media logical channel and the location of the audio output device and/or the video output device;
  • the media transmission module is configured to receive the media stream of the remote media through the established media logical channel, and forward the media code stream to the corresponding media according to the corresponding information of the media logical channel and the location of the audio output device or the video output device.
  • Codec module processing The media codec module is configured to decode the received media code stream, and then output to a corresponding audio output device and/or a video output device for playing.
  • FIG. 12 is a schematic structural diagram of another medium transmission system based on remote presentation according to an embodiment of the present invention.
  • the system further includes a multi-channel audio input device and/or a video input device, and the main remote presentation terminal includes at least a signaling processing module, a media codec module, and a media transmission module, where the secondary remote presentation terminal includes: a media codec module and a media transmission module;
  • the signaling processing module is configured to be responsible for signaling interaction, and perform media capability negotiation, and establish a plurality of media logical channels for transmitting media streams between the media transmission module of the main telepresence terminal and the remote endpoint, and record each media. Corresponding information of the logical channel and the location of the audio input device and/or the video input device;
  • the audio input device and/or the video input device configured to send the collected audio and/or video data to a media codec module of the remote presentation terminal at the corresponding location;
  • the media codec module is configured to encode the input audio and/or video data, and forward the encoded media code stream to the media transmission module of the main remote presentation terminal, where the media codec module of the secondary telepresence terminal Transmitting the encoded media code stream to the media transmission module of the main telepresence terminal by the corresponding media transmission module, and the media codec module of the main telepresence terminal directly transfers the encoded media code stream to the media transmission module of the main telepresence terminal;
  • the media transmission module of the primary telepresence terminal is configured to separately send the media code stream encoded by each media codec module in the system to the remote endpoint through a media logical channel corresponding to the media source type and location.
  • FIG. 13 is a schematic structural diagram of another medium transmission system based on remote presentation according to an embodiment of the present invention.
  • the system further includes a multi-channel audio output device and/or a video output device, where the main remote presentation terminal includes at least a signaling processing module, a media codec module, and a media transmission module, where the secondary remote presentation terminal includes: a media codec module and a media transmission module; among them,
  • the signaling processing module is configured to be responsible for signaling interaction, and perform media capability negotiation, and establish a plurality of media logical channels for receiving media streams between the media transmission module of the main telepresence terminal and the remote endpoint, and record each media. Corresponding information of the logical channel and the location of the audio output device and/or the video output device;
  • the media transmission module of the primary telepresence terminal is configured to receive the remote multi-media media stream respectively through the established media logical channels, and according to the correspondence between the media logical channel and the audio output device and/or the video output device, Transmitting the received media stream to the media codec module of the corresponding remote presentation terminal, where the media code stream is directly transferred to the media codec module of the main remote presentation terminal, and the media code stream is transferred to the corresponding media transmission module.
  • a media codec module of the secondary telepresence terminal
  • the media codec module is configured to decode the received audio and/or video code stream, and then output to a corresponding audio output device and/or video output device for playing.
  • FIG. 14 is a schematic structural diagram of another media transmission system based on remote presentation according to an embodiment of the present invention.
  • the system further includes a multi-channel audio input device and/or a video input device, where the main remote presentation terminal includes at least a signaling processing module, a media codec module, and a media transmission module, where the secondary remote presentation terminal includes: a media codec module and a media transmission module;
  • the signaling processing module is configured to be responsible for signaling interaction, and perform media capability negotiation, and establish a media for sending media streams between the media transmission module of the primary telepresence terminal and the remote endpoint for each media type.
  • a logical channel that records the media type and location of the audio input device and/or the video input device of the present side;
  • the audio input device and/or the video input device configured to send the collected audio and/or video data to a media codec module of the remote presentation terminal at the corresponding location;
  • the media codec module is configured to encode input audio and/or video data
  • the media codec module of the telepresence terminal transfers the encoded media code stream to the corresponding media transmission module
  • the media codec module of the auxiliary telepresence terminal transfers the encoded media code stream to the main remote presentation terminal through the corresponding media transmission module.
  • the media transmission module of the main telepresence terminal is configured to send the received encoded media code stream through a media logical channel between the main telepresence terminal and the remote endpoint, where the sent media packet carries the corresponding media type and location. information.
  • FIG. 15 is a schematic structural diagram of another medium transmission system based on remote presentation according to an embodiment of the present invention.
  • the system further includes a multi-channel audio output device and/or a video output device, where the main remote presentation terminal includes at least a signaling processing module, a media codec module, and a media transmission module, where the secondary telepresence terminal includes: a media codec module and a media transmission module; wherein the signaling processing module is configured to be responsible for signaling interaction, and Perform media capability negotiation, establish a receiving media logical channel between the media transmission module of the main telepresence terminal and the remote endpoint for each media type, and record the media type and location of the audio output device and/or the video output device of the local side. ;
  • the media transmission module of the primary remote presentation terminal is configured to receive a media stream of the remote media from the media logical channel, and forward the media code stream to the corresponding remote presentation terminal by parsing the media type and location information identified by the packet header
  • the media codec module processes, wherein the media code stream is directly transferred to the media codec module of the main remote presentation terminal, and the media code stream is transferred to the media codec module of the auxiliary remote presentation terminal by the corresponding media transmission module;
  • the media codec module is configured to decode the received audio and/or video code stream, and then output to a corresponding audio output device and/or video output device for playing.
  • different media logical channels can be distinguished by an IP address and a port number, and different media logical channels correspond to different IP addresses and/or port numbers.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Telephonic Communication Services (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Description

一种基于远程呈现的媒体传输方法及系统 技术领域
本发明涉及远程呈现技术, 尤其涉及一种基于远程呈现的媒体传输方 法及系统。 背景技术
远程呈现(telepresence )是一种高级的远程视频会议系统。 远程呈现 以其真实的临场感深受高端用户的喜爱, 在远程呈现系统中, 听声辨位、 真身大小、 目艮神交流直接关系到用户是否能够有身临其境的感受, 因此是 衡量远程呈现系统非常重要的技术指标。
在传统视频会议系统中, 每个会场仅有一个视频会议终端, 除了辅流 视频外, 该视频会议终端编码和发送一路音频和 /或一路视频, 接收并解码 输出一路音频和 /或视频。 由于声音的输入源和输出只有一个, 用户无法感 受到声音从会场的哪个方位发出, 并且, 由于视频输入源和输出源只有一 个, 因此本端的采集编码画面需要捕捉会场整体画面, 如果是多点会议只 能选看某一会场或者多个远端会场的拼接画面, 从而无论是发送还是接收 的视频都无法达到真人大 '』、的要求。
在远程呈现会议系统中, 单个会场可以有多个音频和 /或视频的输入输 出设备, 多屏会场中, 每个屏幕显示一处坐席与会者的图像, 相应的每处 坐席与会者对应了一路音频输入, 通过音频的方位信息和专业摄像头定向 区域采集, 可以实现听声辨位和真人大小, 进一步实现眼神交流的逼真效 果。
但是, 当前远程呈现系统一般都是从传统的视频会议系统发展而来, 多屏会场由多个视频会议终端和多音视频外设组成, 一个会场的多个视频 会议终端分别与远端端点(可以是视频会议终端或者多点控制单元( MCU ) ) 建立信令连接和媒体逻辑通道, 最终在上述多个端点对之间传送音视频码 流, 通过分列的音箱、 显示设备输出多路码流。 这种方式操作比较繁瑣, 并且在一个会场需要多个视频会议终端处理信令, 各终端分别占用一个 IP 地址、 或者端点 ID号 (如 H.323 ID )、 或者会议号, 缺乏各终端之间相互 信息处理的机制 (比如坐席信息), 而且多路码流之间的同步非常困难, 影 响用户体一险。 发明内容
有鉴于此, 本发明的主要目的在于提供一种基于远程呈现的媒体传输 方法及系统, 操作简便, 且能提高用户体验。
为达到上述目的, 本发明实施例的技术方案是这样实现的:
一种基于远程呈现的媒体传输方法, 媒体传输系统包括一个主远程呈 现终端和至少一个辅远程呈现终端, 该方法包括:
建立连接时, 由本侧媒体传输系统的主远程呈现终端与远端端点进行 信令交互, 建立所述本侧媒体传输系统与远端端点之间的媒体逻辑通道; 本侧媒体传输系统与远端端点之间通过一个媒体逻辑通道或者分别通 过多个媒体逻辑通道对同一类型的媒体流进行发送, 以及通过一个媒体逻 辑通道或者分别通过多个媒体逻辑通道对同一类型的媒体流进行接收。
所述建立本侧媒体传输系统与远端端点之间的媒体逻辑通道为: 分别 建立本侧各远程呈现终端的媒体传输模块与远端端点之间用于发送媒体流 的媒体逻辑通道, 记录各媒体逻辑通道与音频输入设备和 /或视频输入设备 位置的对应信息;
所述本侧媒体传输系统与远端端点之间分别通过多个媒体逻辑通道对 同一类型的媒体流进行发送为: 音频输入设备和 /或视频输入设备将采集的 音频和 /或视频数据发送给对应位置的远程呈现终端的媒体编解码模块进行 编码; 各远程呈现终端的媒体编解码模块分别对输入的音频和 /或视频数据 进行编码, 并把编码后的媒体码流转给对应的媒体传输模块; 各远程呈现 终端的媒体传输模块分别把编码后的媒体码流通过与媒体源类型和位置相 对应的媒体逻辑通道发送给远端端点。
所述建立本侧媒体传输系统与远端端点之间的媒体逻辑通道为: 分别 建立本侧各远程呈现终端的媒体传输模块与远端端点之间用于接收媒体流 的媒体逻辑通道, 记录各媒体逻辑通道与音频输出设备和 /或视频输出设备 位置的对应信息;
所述本侧媒体传输系统与远端端点之间分别通过多个媒体逻辑通道对 同一类型的媒体流进行接收为: 各远程呈现终端的媒体传输模块通过建立 的媒体逻辑通道分别接收远端的多路媒体码流, 并根据媒体逻辑通道与音 频输出设备或视频输出设备位置的对应关系, 将媒体码流分别转给对应的 媒体编解码模块处理; 各远程呈现终端的媒体编解码模块分别对接收到的 媒体码流进行解码, 之后输出到对应的音频输出设备和 /或视频输出设备进 行播放。
所述建立本侧媒体传输系统与远端端点之间的媒体逻辑通道为: 建立 主远程呈现终端的媒体传输模块与远端端点之间用于发送媒体流的多个媒 体逻辑通道, 记录各媒体逻辑通道与音频输入设备和 /或视频输入设备位置 的对应信息;
所述本侧媒体传输系统与远端端点之间分别通过多个媒体逻辑通道对 同一类型的媒体流进行发送为: 音频输入设备和 /或视频输入设备将采集的 音频和 /或视频数据发送给对应位置的远程呈现终端的媒体编解码模块; 各 媒体编解码模块分别对输入的音频和 /或视频数据进行编码, 并把编码后的 媒体码流转给主远程呈现终端的媒体传输模块; 主远程呈现终端的媒体传 输模块分别把本侧媒体编解码模块编码后的媒体码流通过与媒体源类型和 位置相对应的媒体逻辑通道发送给远端端点。
所述建立本侧媒体传输系统与远端端点之间的媒体逻辑通道为: 建立 主远程呈现终端的媒体传输模块与远端端点之间用于接收媒体流的多个媒 体逻辑通道, 记录各媒体逻辑通道与音频输出设备和 /或视频输出设备位置 的对应信息;
所述本侧媒体传输系统与远端端点之间分别通过多个媒体逻辑通道对 同一类型的媒体流进行接收为: 主远程呈现终端的媒体传输模块通过建立 的各媒体逻辑通道分别接收远端的多路媒体码流, 并根据媒体逻辑通道与 音频输出设备和 /或视频输出设备的对应关系, 将收到的媒体码流分别转给 对应远程呈现终端的媒体编解码模块处理; 各远程呈现终端的媒体编解码 模块分别对接收到的音频和 /或视频码流进行解码, 之后输出到对应的音频 输出设备和 /或视频输出设备进行播放。
所述建立本侧媒体传输系统与远端端点之间的媒体逻辑通道为: 对每 一种媒体类型建立主远程呈现终端的媒体传输模块与远端端点之间的一个 用于发送媒体流的媒体逻辑通道, 记录本侧音频输入设备和 /或视频输入设 备的媒体类型和位置;
所述本侧媒体传输系统与远端端点之间通过一个媒体逻辑通道对同一 类型的媒体流进行发送为: 音频输入设备和 /或视频输入设备将采集的音频 和 /或视频数据发送给对应位置的远程呈现终端的媒体编解码模块; 各媒体 编解码模块分别对输入的音频和 /或视频数据进行编码, 并 4巴编码后的媒体 码流转给主远程呈现终端的媒体传输模块; 主远程呈现终端的媒体传输模 块将所述编码后的媒体码流通过主远程呈现终端与远端端点之间的媒体逻 辑通道发送 , 发送的媒体包头中携带相应的媒体类型和位置信息。
所述建立本侧媒体传输系统与远端端点之间的媒体逻辑通道为: 对每 一种媒体类型建立主远程呈现终端的媒体传输模块与远端端点之间的一个 接收媒体码流的媒体逻辑通道, 记录本侧音频输出设备和 /或视频输出设备 的媒体类型和位置;
所述本侧媒体传输系统与远端端点之间通过一个媒体逻辑通道对同一 类型的媒体流进行接收为: 主远程呈现终端的媒体传输模块从所述媒体逻 辑通道接收远端的媒体码流, 通过解析包头所标识的媒体类型和位置信息, 将所述媒体码流转给对应远程呈现终端的媒体编解码模块处理; 各远程呈 现终端的媒体编解码模块分别对接收到的音频和 /或视频码流进行解码, 之 后输出到相应的音频输出设备和 /或视频输出设备进行播放。
通过 IP地址和端口号区分媒体逻辑通道, 不同的媒体逻辑通道对应的 IP地址和 /或端口号不同。
一种基于远程呈现的媒体传输系统, 包括: 主远程呈现终端和至少一 个辅远程呈现终端; 其中,
所述主远程呈现终端, 设置为在建立所述媒体传输系统与远端端点之 间的连接时, 与远端端点进行信令交互, 建立所述媒体传输系统与远端端 点之间的媒体逻辑通道; 以及通过建立的一个媒体逻辑通道或者分别通过 多个媒体逻辑通道对同一类型的媒体流进行发送, 通过建立的一个媒体逻 辑通道或者分别通过多个媒体逻辑通道对同一类型的媒体流进行接收; 所述辅远程呈现终端, 设置为通过主远程呈现终端建立的媒体逻辑通 道进行媒体流发送及接收。
该系统还包括多路音频输入设备和 /或视频输入设备, 所述主远程呈现 终端至少包括: 信令处理模块、 媒体编解码模块、 媒体传输模块, 所述辅 远程呈现终端至少包括: 媒体编解码模块、 媒体传输模块; 其中,
所述信令处理模块, 设置为负责信令交互, 并进行媒体能力协商, 分 别建立系统中各远程呈现终端的媒体传输模块与远端端点之间用于发送媒 体流的媒体逻辑通道, 记录各媒体逻辑通道与音频输入设备和 /或视频输入 设备位置的对应信息;
所述音频输入设备和 /或视频输入设备, 设置为将采集的音频和 /或视频 数据发送给对应位置的远程呈现终端的媒体编解码模块进行编码;
所述媒体编解码模块, 设置为对输入的音频和 /或视频数据进行编码, 并才巴编码后的媒体码流转给对应的媒体传输模块;
所述媒体传输模块, 设置为把媒体编解码模块编码后的媒体码流通过 与媒体源类型和位置相对应的媒体逻辑通道发送给远端端点。
该系统还包括多路音频输出设备和 /或视频输出设备, 所述主远程呈现 终端至少包括: 信令处理模块、 媒体编解码模块、 媒体传输模块, 所述辅 远程呈现终端至少包括: 媒体编解码模块、 媒体传输模块; 其中,
所述信令处理模块, 设置为负责信令交互, 并进行媒体能力协商, 分 别建立系统中各远程呈现终端的媒体传输模块与远端端点之间用于接收媒 体流的的媒体逻辑通道, 记录各媒体逻辑通道与音频输出设备和 /或视频输 出设备位置的对应信息;
所述媒体传输模块, 设置为通过建立的媒体逻辑通道接收远端的媒体 码流, 并根据媒体逻辑通道与音频输出设备或视频输出设备位置的对应信 息, 将媒体码流分别转给对应的媒体编解码模块处理;
所述媒体编解码模块, 设置为对接收到的媒体码流进行解码, 之后输 出到对应的音频输出设备和 /或视频输出设备进行播放。
该系统还包括多路音频输入设备和 /或视频输入设备, 所述主远程呈现 终端至少包括: 信令处理模块、 媒体编解码模块、 媒体传输模块, 所述辅 远程呈现终端至少包括: 媒体编解码模块、 媒体传输模块; 其中,
所述信令处理模块, 设置为负责信令交互, 并进行媒体能力协商, 建 立主远程呈现终端的媒体传输模块与远端端点之间用于发送媒体流的多个 媒体逻辑通道, 记录各媒体逻辑通道与音频输入设备和 /或视频输入设备位 置的对应信息;
所述音频输入设备和 /或视频输入设备, 设置为将采集的音频和 /或视频 数据发送给对应位置的远程呈现终端的媒体编解码模块;
所述媒体编解码模块, 设置为对输入的音频和 /或视频数据进行编码, 并把编码后的媒体码流转给主远程呈现终端的媒体传输模块, 其中, 辅远 程呈现终端的媒体编解码模块通过相应的媒体传输模块把编码后的媒体码 流转给主远程呈现终端的媒体传输模块, 主远程呈现终端的媒体编解码模 块直接把编码后的媒体码流转给主远程呈现终端的媒体传输模块;
所述主远程呈现终端的媒体传输模块, 设置为分别把系统中各媒体编 解码模块编码后的媒体码流通过与媒体源类型和位置相对应的媒体逻辑通 道发送给远端端点。
该系统还包括多路音频输出设备和 /或视频输出设备, 所述主远程呈现 终端至少包括: 信令处理模块、 媒体编解码模块、 媒体传输模块, 所述辅 远程呈现终端至少包括: 媒体编解码模块、 媒体传输模块; 其中,
所述信令处理模块, 设置为负责信令交互, 并进行媒体能力协商, 建 立主远程呈现终端的媒体传输模块与远端端点之间用于接收媒体流的多个 媒体逻辑通道, 记录各媒体逻辑通道与音频输出设备和 /或视频输出设备位 置的对应信息;
所述主远程呈现终端的媒体传输模块, 设置为通过建立的各媒体逻辑 通道分别接收远端的多路媒体码流, 并根据媒体逻辑通道与音频输出设备 和 /或视频输出设备的对应关系, 将收到的媒体码流分别转给对应远程呈现 终端的媒体编解码模块处理, 其中, 直接将媒体码流转给主远程呈现终端 的媒体编解码模块, 通过相应的媒体传输模块将媒体码流转给辅远程呈现 终端的媒体编解码模块;
所述媒体编解码模块,设置为对接收到的音频和 /或视频码流进行解码, 之后输出到对应的音频输出设备和 /或视频输出设备进行播放。
该系统还包括多路音频输入设备和 /或视频输入设备, 所述主远程呈现 终端至少包括: 信令处理模块、 媒体编解码模块、 媒体传输模块, 所述辅 远程呈现终端至少包括: 媒体编解码模块、 媒体传输模块; 其中,
所述信令处理模块, 设置为负责信令交互, 并进行媒体能力协商, 对 每一种媒体类型建立主远程呈现终端的媒体传输模块与远端端点之间的一 个用于发送媒体流的媒体逻辑通道, 记录本侧音频输入设备和 /或视频输入 设备的媒体类型和位置;
所述音频输入设备和 /或视频输入设备, 设置为将采集的音频和 /或视频 数据发送给对应位置的远程呈现终端的媒体编解码模块;
所述媒体编解码模块, 设置为对输入的音频和 /或视频数据进行编码, 主远程呈现终端的媒体编解码模块把编码后的媒体码流转给相应的媒体传 输模块, 辅远程呈现终端的媒体编解码模块把编码后的媒体码流通过相应 的媒体传输模块转给主远程呈现终端的媒体传输模块;
主远程呈现终端的媒体传输模块, 设置为将收到的编码后的媒体码流 通过主远程呈现终端与远端端点之间的媒体逻辑通道发送, 发送的媒体包 头中携带相应的媒体类型和位置信息。
该系统还包括多路音频输出设备和 /或视频输出设备, 所述主远程呈现 终端至少包括: 信令处理模块、 媒体编解码模块、 媒体传输模块, 所述辅 远程呈现终端至少包括: 媒体编解码模块、 媒体传输模块; 其中,
所述信令处理模块, 设置为负责信令交互, 并进行媒体能力协商, 对 每一种媒体类型建立主远程呈现终端的媒体传输模块与远端端点之间的一 个接收媒体逻辑通道, 记录本侧音频输出设备和 /或视频输出设备的媒体类 型和位置;
所述主远程呈现终端的媒体传输模块, 设置为从所述媒体逻辑通道接 收远端的媒体码流, 通过解析包头所标识的媒体类型和位置信息, 将所述 媒体码流转给对应远程呈现终端的媒体编解码模块处理, 其中, 直接将媒 体码流转给主远程呈现终端的媒体编解码模块, 通过相应的媒体传输模块 将媒体码流转给辅远程呈现终端的媒体编解码模块;
所述媒体编解码模块,设置为对接收到的音频和 /或视频码流进行解码, 之后输出到相应的音频输出设备和 /或视频输出设备进行播放。
媒体逻辑通道通过 IP地址和端口号区分, 不同的媒体逻辑通道对应的 IP地址和 /或端口号不同。
本发明实施例基于远程呈现的媒体传输方法及系统, 建立连接时, 由 本侧媒体传输系统的主远程呈现终端与远端端点进行信令交互, 建立所述 本侧媒体传输系统与远端端点之间的媒体逻辑通道; 所述本侧媒体传输系 统与远端端点之间通过多个媒体逻辑通道, 或者通过一个媒体逻辑通道进 行媒体传输。 通过本发明, 对某一会场的远程呈现系统进行呼叫时, 只需 要呼叫一个号码(IP地址、 或者端点 ID号、 或者会议号), 从而操作简便, 并且, 远程呈现系统内可以交互坐席信息, 实现听声辨位的效果, 并解决 码流之间的同步等问题, 从而能够提高用户体验。 附图说明
图 1为本发明实施例基于远程呈现的媒体传输方法流程示意图; 图 2为本发明实施例 1基于远程呈现的媒体传输系统通过多个媒体逻 辑通道进行媒体发送的流程示意图;
图 3为本发明实施例 2基于远程呈现的媒体传输系统通过多个媒体逻 辑通道进行媒体接收的流程示意图;
图 4为本发明实施例 3基于远程呈现的媒体传输系统通过多个媒体逻 辑通道进行媒体发送的流程示意图;
图 5为本发明实施例 4基于远程呈现的媒体传输系统通过多个媒体逻 辑通道进行媒体接收的流程示意图;
图 6为本发明实施例 5基于远程呈现的媒体传输系统通过一个媒体逻 辑通道进行媒体发送的流程示意图;
图 7为本发明实施例 6基于远程呈现的媒体传输系统通过一个媒体逻 辑通道进行媒体接收的流程示意图;
图 8为本发明实施例 7所述的基于远程呈现的媒体传输方法流程示意 图;
图 9为本发明实施例 8所述的基于远程呈现的媒体传输方法流程示意 图;
图 10为本发明实施例一种基于远程呈现的媒体传输系统结构示意图; 图 11 为本发明实施例另一种基于远程呈现的媒体传输系统结构示意 图;
图 12 为本发明实施例再一种基于远程呈现的媒体传输系统结构示意 图;
图 13 为本发明实施例再一种基于远程呈现的媒体传输系统结构示意 图;
图 14 为本发明实施例再一种基于远程呈现的媒体传输系统结构示意 图;
图 15 为本发明实施例再一种基于远程呈现的媒体传输系统结构示意 图。 具体实施方式
本发明的基本思想是: 建立连接时, 由本侧媒体传输系统的主远程呈 现终端与远端端点进行信令交互, 建立所述本侧媒体传输系统与远端端点 之间的媒体逻辑通道; 所述本侧媒体传输系统与远端端点之间通过多个媒 体逻辑通道, 或者通过一个媒体逻辑通道进行媒体传输。 图 1为本发明实施例基于远程呈现的媒体传输方法流程示意图,如图 1 所示, 该方法包括:
步驟 101 : 建立连接时, 由本侧媒体传输系统的主远程呈现终端与远端 端点进行信令交互, 建立本侧媒体传输系统与远端端点之间的媒体逻辑通 道。
需要说明的是, 主远程呈现终端与远端端点进行信令交互过程中, 一 般需要交互两侧的远程呈现终端音视频输入输出设备的数量、 远程呈现终 端位置信息、 媒体类型。 实际应用中, 并非需要在交互消息中分别携带上 述每一种信息, 其中某些信息可以通过其他信息进行推断, 例如, 主远程 呈现终端向远端端点发送的消息中携带: 远程呈现系统中具有左、 中、 右 路视频, 则远端端点可以进一步获取该远程呈现系统具有三路视频输入设 备。
主远程呈现终端与远端端点进行信令交互过程中, 可以根据音频输入 设备、 音频输出设备、 视频输入设备和 /或视频输出设备所处的位置信息, 建立期望的通道与媒体类型 (如音频、 视频)和位置 (如左路、 中路、 右 路) 的映射关系, 并交互所述映射关系。
需要说明的是, 建立和打开媒体逻辑通道时, 主远程呈现终端需要记 录媒体类型和位置信息与媒体逻辑通道标识的对应关系。
步驟 102:本侧媒体传输系统与远端端点之间通过一个媒体逻辑通道或 者分别通过多个媒体逻辑通道对同一类型的媒体流进行发送, 以及通过一 个媒体逻辑通道或者分别通过多个媒体逻辑通道对同一类型的媒体流进行 接收。
下面通过具体实施例对本发明的技术方案作进一步详细说明。 实施例 1
图 2为本发明实施例 1基于远程呈现的媒体传输系统通过多个媒体逻 辑通道进行媒体发送的流程示意图, 如图 2所示, 该方法包括: 步驟 201 : 主远程呈现终端建立本侧与远端端点之间的呼叫, 主远程呈 现终端的信令处理模块负责信令交互, 并分别进行媒体能力协商和建立本 侧各远程呈现终端的媒体传输模块与远端端点之间用于发送媒体流的媒体 逻辑通道, 记录各媒体逻辑通道与音频输入设备和 /或视频输入设备位置的 对应信息。
步驟 202: 音频输入设备和 /或视频输入设备将采集的音频和 /或视频数 据发送给对应位置的远程呈现终端的媒体编解码模块进行编码, 各远程呈 现终端的媒体编解码模块分别对输入的音频和 /或视频数据进行编码, 并把 编码后的媒体码流转给对应的媒体传输模块。
步驟 203:各远程呈现终端的媒体传输模块分别把编码后的媒体码流通 过与媒体源类型和位置对应的媒体逻辑通道进行发送, 即通过与所述音频 输入设备和 /或视频输入设备位置相对应的媒体逻辑通道发送给远端端点。 实施例 2
图 3为本发明实施例 2基于远程呈现的媒体传输系统通过多个媒体逻 辑通道进行媒体接收的流程示意图, 如图 3所示, 该方法包括:
步驟 301 : 主远程呈现终端建立本侧与远端端点之间的呼叫, 主远程呈 现终端的信令处理模块负责信令交互, 并分别进行媒体能力协商, 建立各 媒体传输模块与远端端点之间用于接收媒体流的媒体逻辑通道, 记录各媒 体逻辑通道与音频输出设备和 /或视频输出设备位置的对应信息。
步驟 302:各远程呈现终端的媒体传输模块通过建立的媒体逻辑通道分 别接收远端的多路媒体码流, 并根据媒体逻辑通道与音频输出设备或视频 输出设备位置的对应关系, 将媒体码流分别转给对应的媒体编解码模块处 理。
步驟 303:各远程呈现终端的媒体编解码模块分别对接收到的媒体码流 进行解码, 之后输出到对应的音频输出设备和 /或视频输出设备进行播放。 实施例 3
图 4为本发明实施例 3基于远程呈现的媒体传输系统通过多个媒体逻 辑通道进行媒体发送的流程示意图, 如图 4所示, 该方法包括:
步驟 401 : 主远程呈现终端建立本侧与远端端点之间的呼叫, 主远程呈 现终端的信令处理模块负责信令交互, 并进行媒体能力协商, 建立主远程 呈现终端的媒体传输模块与远端端点之间用于发送媒体流的多个媒体逻辑 通道, 记录各媒体逻辑通道与音频输入设备和 /或视频输入设备位置的对应 信息。
步驟 402: 音频输入设备和 /或视频输入设备将采集的音频和 /或视频数 据发送给对应位置的远程呈现终端的媒体编解码模块, 各媒体编解码模块 分别对输入的音频和 /或视频数据进行编码, 并把编码后的媒体码流转给主 远程呈现终端的媒体传输模块。
需要说明的是, 主远程呈现终端的媒体编解码模块直接将编码后的媒 体码流转给主远程呈现终端的媒体传输模块, 辅远程呈现终端的媒体编解 码模块通过相应的媒体传输模块将编码后的媒体码流转给主远程呈现终端 的媒体传输模块。
步驟 403:主远程呈现终端的媒体传输模块分别把本侧媒体编解码模块 编码后的媒体码流通过与媒体源 (音视频输入设备位置)类型和位置相对 应的媒体逻辑通道发送给远端端点。 实施例 4
图 5为本发明实施例 4基于远程呈现的媒体传输系统通过多个媒体逻 辑通道进行媒体接收的流程示意图, 如图 5所示, 该方法包括:
步驟 501 : 主远程呈现终端建立本侧与远端端点之间的呼叫, 主远程呈 现终端的信令处理模块负责信令交互, 并进行媒体能力协商, 建立主远程 呈现终端的媒体传输模块与远端端点之间用于接收媒体流的多个媒体逻辑 通道, 记录各媒体逻辑通道与音频输出设备和 /或视频输出设备位置的对应 信息。
步驟 502:主远程呈现终端的媒体传输模块通过建立的各媒体逻辑通道 分别接收远端的多路媒体码流, 并根据媒体逻辑通道与音频输出设备和 /或 视频输出设备的对应关系, 将收到的媒体码流分别转给对应远程呈现终端 的媒体编解码模块处理。
需要说明的是, 主远程呈现终端的媒体传输模块直接将媒体码流转给 主远程呈现终端的媒体编解码模块, 而通过辅远程呈现终端的媒体传输模 块将媒体码流转给所述辅远程呈现终端的媒体编解码模块。
步驟 503: 各远程呈现终端的媒体编解码模块分别对接收到的音频和 / 或视频码流进行解码, 并输出到对应的音频输出设备和 /或视频输出设备进 行播放。 实施例 5
图 6为本发明实施例 5基于远程呈现的媒体传输系统通过一个媒体逻 辑通道进行媒体发送的流程示意图, 如图 6所示, 该方法包括:
步驟 601 : 主远程呈现终端建立本侧与远端端点之间的呼叫, 主远程呈 现终端的信令处理模块负责信令交互, 并进行媒体能力协商, 建立主远程 呈现终端的媒体传输模块与远端端点之间的一个用于发送媒体流的媒体逻 辑通道, 记录本侧音频输入设备和 /或视频输入设备的媒体类型和位置。
步驟 602: 音频输入设备和 /或视频输入设备将采集的音频和 /或视频数 据发送给对应位置的远程呈现终端的媒体编解码模块, 各媒体编解码模块 分别对输入的音频和 /或视频数据进行编码, 并把编码后的媒体码流转给主 远程呈现终端的媒体传输模块。 需要说明的是, 主远程呈现终端的媒体编解码模块直接将编码后的媒 体码流转给主远程呈现终端的媒体传输模块, 辅远程呈现终端的媒体编解 码模块通过相应的媒体传输模块将编码后的媒体码流转给主远程呈现终端 的媒体传输模块。
步驟 603:主远程呈现终端的媒体传输模块将所述编码后的媒体码流通 过主远程呈现终端与远端端点之间的媒体逻辑通道发送。
需要说明的是, 发送前打包时, 需要在媒体包头标识相应的媒体类型 和位置信息。 实施例 6
图 7为本发明实施例 6基于远程呈现的媒体传输系统通过一个媒体逻 辑通道进行媒体接收的流程示意图, 如图 7所示, 该方法包括:
步驟 701 : 主远程呈现终端建立本侧与远端端点之间的呼叫, 主远程呈 现终端的信令处理模块负责信令交互, 并进行媒体能力协商, 建立主远程 呈现终端的媒体传输模块与远端端点之间的一个接收媒体码流的媒体逻辑 通道, 记录本侧音频输出设备和 /或视频输出设备的媒体类型和位置。
步驟 702:主远程呈现终端的媒体编解码模块从该媒体逻辑通道接收远 端的媒体码流, 并将该媒体码流转给对应远程呈现终端的媒体编解码模块 处理。
具体的, 主远程呈现终端通过解析媒体包头所标识的媒体类型和位置 信息, 将媒体码流转发到相应的媒体编解码模块进行解码。 主远程呈现终 端的媒体传输模块直接将媒体码流转给主远程呈现终端的媒体编解码模 块, 而通过辅远程呈现终端的媒体传输模块将媒体码流转给所述辅远程呈 现终端的媒体编解码模块。
步驟 703: 各远程呈现终端的媒体编解码模块分别对接收到的音频和 / 或视频码流进行解码, 并输出到相应的音频输出设备和 /或视频输出设备进 行播放。 实施例 7
本实施例中, 基于远程呈现的媒体传输系统至少包括两个以上远程呈 现终端和多个音视频输入 /输出设备, 其中一个远程呈现终端 (后文称 "主 远程呈现终端") 负责信令和媒体的代理, 至少包括协议信令处理模块、 媒 体编解码模块、媒体传输模块; 另外一个或以上远程呈现终端(后文称 "辅 远程呈现终端")至少包括媒体编解码模块和媒体传输模块。 一个或多个辅 远程呈现终端分别与主远程呈现终端相连, 主远程呈现终端和辅远程呈现 终端分别连接有至少一路音频输入、 一路音频输出、 一路视频输入和一路 视频输出设备。 主远程呈现终端在网守 (GK )上进行注册并对外提供注册 的端点 ID号。
本实施例中, 主远程呈现终端建立的与远端端点之间的媒体逻辑通道 分别在本端各远程呈现终端与远端端点之间建立, 即分别使用本端各终端 不同的地址, 本端远程呈现终端与远端之间的码流直接由本端各远程呈现 终端分别处理, 本端的主、 辅远程呈现终端分别处理相应的媒体流收发功
•6匕
匕。
图 8为本发明实施例 7所述的基于远程呈现的媒体传输方法流程示意 图, 如图 8所示, 该方法包括:
步驟 801: 用户通过连接主远程呈现终端的中控界面或者遥控器, 输入 被叫远端端点的号码(如 IP地址、 或者 H.323 ID, 或者会议号等)发起呼 叫, 远端端点处理该呼叫, 建立连接。 如通过 H.225协议完成呼叫连接, 呼叫时只需要使用本端主远程呈现终端的 IP地址或者 H.323 ID。
步驟 802: 所述主远程呈现终端获取辅远程呈现终端的信息, 包括本次 呼叫中本端使用的各辅远程呈现终端媒体处理能力集、 媒体收发地址。
步驟 803: 所述主远程呈现终端与远端端点之间进行媒体能力协商。 包 括所述主远程呈现终端根据上述辅远程呈现终端和主远程呈现终端的信 息, 以及根据终端的位置进行标识, 构造包含需要建立的通道媒体类型、 数量、 位置标识信息的能力集给远端端点, 比如可以采用 H.245协议, 在 发送能力集时告知对方本端支持的能力集以及能力描述类型, 通过能力描 述类型区分不同位置的不同码流。
例如, 在 H.245的终端能力集(terminalCapabilitySet )消息结构中, 增 加多路音频和 /或视频的描述符, 并约定不同值对应不同的类型和位置, 如
1、 2、 3分别表示左、 中、 右路音频, 4、 5、 6分别表示左、 中、 右路视频。 通过 terminalCapabilitySet发送本端能力集并接收远端端点发送的能力集, 进行能力协商确定各媒体逻辑通道对应的媒体类型和位置, 如媒体逻辑通 道 1对应接收远端端点左路音频 , 通道 6对应接收远端端点右路视频。
步驟 804: 打开双向媒体逻辑通道。 本端主远程呈现终端发送 H.245打 开媒体逻辑通道( openLogicalChannd ) 消息, 结构中描述上述通道标识与 媒体类型和位置的对应关系, 以及通道本身的特征描述, 至少包括媒体发 送地址, 分别使用本端各终端的地址(IP地址和端口号)。 远端端点回复打 开逻辑通道确认消息 openLogicalChannd Ack, 该消息至少包括该通道的接 收地址(IP地址和端口号), 本端主远程呈现终端记录该发送通道信息, 包 括通道标识与媒体类型和位置的对应关系, 接收和发送地址等。 分别通过 上述方式建立多个发送逻辑通道。 远端端点通过上述方式打开到本端的多 个媒体逻辑通道。
步驟 805:本端主远程呈现终端通知各远程呈现终端的媒体传输模块通 过相应通道收发媒体流数据。
步驟 806: 本端各远程呈现终端分别与远端端点之间传输多路码流。 本 端通过不同媒体逻辑通道发送码流时,音频或视频 I/O设备采集的码流发送 给对应位置的终端的编解码模块进行编码, 然后由对应的媒体传输模块进 行发送, 发送时根据媒体源所在位置通过上述与上述记录的媒体逻辑通道 信息选择相应的通道进行发送, 如左路音频通过通道 1进行发送。
本端通过不同媒体逻辑通道接收码流时, 本端媒体传输模块接收到媒 体码流, 根据上述记录的媒体逻辑通道信息, 如接收到的左路视频, 转发 到相应的媒体编解码模块进行解码, 输出到对应位置的音频或视频设备播 放。
步驟 807: 结束时, 本端主远程呈现终端通知各远程呈现终端的媒体传 输模块停止媒体流监听。
步驟 808: 由主远程呈现终端负责完成终止会议, 先关闭各媒体逻辑通 道, 最后完成会话拆除。 实施例 8
本实施例中, 各媒体逻辑通道建立在主远程呈现终端与远端端点之间 , 所有媒体流通过主远程呈现终端收发, 由主远程呈现终端完成与辅远程呈 现终端之间的码流转发。
图 9为本发明实施例 8所述的基于远程呈现的媒体传输方法流程示意 图, 如图 9所示, 该方法包括:
步驟 901 : 用户通过连接主远程呈现终端的中控界面或者遥控器, 输入 被叫远端端点的号码(如 IP地址、 或者 H.323 ID、 或者会议号等)发起呼 叫, 远端端点处理该呼叫, 建立连接。 如通过 H.225协议完成呼叫连接, 呼叫时只需要使用本端主远程呈现终端的 IP地址或者 H.323 ID。
步驟 902: 主远程呈现终端获取辅远程呈现终端的信息, 包括根据本次 呼叫本端使用的各辅远程呈现终端媒体处理能力集, 媒体收发地址(含 IP 地址和端口号)。
步驟 903: 所述主远程呈现终端与远端端点之间进行媒体能力协商。 包 括所述主远程呈现终端根据上述辅远程呈现终端和主远程呈现终端的信 息, 以及根据终端的位置进行标识, 构造包含需要建立的通道媒体类型、 数量、 位置标识信息的能力集给远端端点, 比如可以采用 H.245协议, 在 发送能力集时告知对方本端支持的能力集以及能力描述类型, 通过能力描 述类型区分不同位置的不同码流。
例如在 H.245的 terminalCapabilitySet消息结构中, 增加多路音频和 /或 视频的描述符, 并约定不同值对应不同的类型和位置, 如 1、 2、 3分别表 示左、 中、 右路音频, 4、 5、 6 分别表示左、 中、 右路视频。 通过 terminalCapabilitySet发送本端能力集并接收远端端点发送的能力集, 进行 能力协商确定各媒体逻辑通道对应的媒体类型和位置, 如媒体逻辑通道 1 对应接收远端端点左路音频, 通道 6对应接收远端端点右路视频。
步驟 904 : 打开双向媒体逻辑通道。 本端主远程呈现终端发送 openLogicalChannel消息, 消息中使用本端主远程呈现终端的地址区分不同 通道,相同 IP地址和不同的端口号,。远端端点回复打开逻辑通道确认消息 openLogicalChannel Ack , 该消息至少包括该通道的接收地址(IP地址和端 口号), 本端主远程呈现终端记录该发送通道信息, 包括通道标识与媒体类 型和位置的对应关系, 接收和发送地址等。 分别通过上述方式建立多个发 送逻辑通道。 远端端点通过上述方式打开多个到本端主远程呈现终端的媒 体逻辑通道。
步驟 905:本端主远程呈现终端建立与各辅远程呈现终端之间的媒体转 发通道, 并维护上述主远程呈现终端与辅远程呈现终端之间的转发通道和 主远程呈现终端与远端端点之间的收发通道的映射关系。
步驟 906: 本端通过不同媒体逻辑通道发送码流时, 音频和 /或视频 I/O 设备采集的码流发送给对应位置的终端的编解码模块进行编码, 然后由对 应的辅远程呈现终端媒体传输模块通过对应的转发通道转给主远程呈现终 端, 主远程呈现终端根据媒体源所在位置通过上述记录的与远端端点之间 的相应的媒体逻辑通道进行发送, 如左路音频则通过通道 1进行发送。 本端通过不同媒体逻辑通道接收码流时, 本端主远程呈现终端媒体传 输模块接收到媒体码流, 根据上述记录的媒体逻辑通道信息, 如接收到的 左路视频, 通过上述建立的相应的主远程呈现终端与辅远程呈现终端之间 的转发通道转发到相应的辅远程呈现终端的媒体编解码模块进行解码, 输 出到对应位置的音频或视频设备播放。
步驟 907: 结束时, 本端主远程呈现终端通知各终端的媒体传输模块停 止媒体流监听, 并关闭辅远程呈现终端与主远程呈现终端之间的转发通道。
步驟 908: 由主远程呈现终端负责完成终止会议, 先关闭各媒体逻辑通 道, 最后完成会话拆除。
需要说明的是, 本发明中所述主远程呈现终端建立与远端端点 (可以 是 MCU或者远程呈现终端)的连接, 可以是点对点会议或者多点会议, 具 体的, 可以是所述主远程呈现终端主动发起呼叫, 也可以是所述主远程呈 现终端接受远端端点的呼叫。
本发明实施例还相应地提出一种基于远程呈现的媒体传输系统, 该系 统包括: 主远程呈现终端和至少一个辅远程呈现终端; 其中,
所述主远程呈现终端, 用于在建立所述媒体传输系统与远端端点之间 的连接时, 与远端端点进行信令交互, 建立所述媒体传输系统与远端端点 之间的媒体逻辑通道; 以及通过建立的一个媒体逻辑通道或者分别通过多 个媒体逻辑通道对同一类型的媒体流进行发送, 通过建立的一个媒体逻辑 通道或者分别通过多个媒体逻辑通道对同一类型的媒体流进行接收;
所述辅远程呈现终端, 用于通过主远程呈现终端建立的媒体逻辑通道 进行媒体流发送及接收。
图 10为本发明实施例一种基于远程呈现的媒体传输系统结构示意图, 如图 10所示, 该系统还可以包括多路音频输入设备和 /或视频输入设备, 所 述主远程呈现终端至少包括: 信令处理模块、 媒体编解码模块、 媒体传输 模块, 所述辅远程呈现终端至少包括: 媒体编解码模块、 媒体传输模块; 其中,
所述信令处理模块, 用于负责信令交互, 并进行媒体能力协商, 分别 建立系统中各远程呈现终端的媒体传输模块与远端端点之间用于发送媒体 流的媒体逻辑通道, 记录各媒体逻辑通道与音频输入设备和 /或视频输入设 备位置的对应信息;
所述音频输入设备和 /或视频输入设备, 用于将采集的音频和 /或视频数 据发送给对应位置的远程呈现终端的媒体编解码模块进行编码;
所述媒体编解码模块, 用于对输入的音频和 /或视频数据进行编码, 并 把编码后的媒体码流转给对应的媒体传输模块;
所述媒体传输模块, 用于把媒体编解码模块编码后的媒体码流通过与 媒体源类型和位置相对应的媒体逻辑通道发送给远端端点。
图 11 为本发明实施例另一种基于远程呈现的媒体传输系统结构示意 图,如图 11所示,该系统还可以包括多路音频输出设备和 /或视频输出设备, 所述主远程呈现终端至少包括: 信令处理模块、 媒体编解码模块、 媒体传 输模块, 所述辅远程呈现终端至少包括: 媒体编解码模块、 媒体传输模块; 其中,
所述信令处理模块, 用于负责信令交互, 并进行媒体能力协商, 分别 建立系统中各远程呈现终端的媒体传输模块与远端端点之间用于接收媒体 流的的媒体逻辑通道, 记录各媒体逻辑通道与音频输出设备和 /或视频输出 设备位置的对应信息;
所述媒体传输模块, 用于通过建立的媒体逻辑通道接收远端的媒体码 流, 并根据媒体逻辑通道与音频输出设备或视频输出设备位置的对应信息, 将媒体码流分别转给对应的媒体编解码模块处理; 所述媒体编解码模块, 用于对接收到的媒体码流进行解码, 之后输出 到对应的音频输出设备和 /或视频输出设备进行播放。
图 12 为本发明实施例再一种基于远程呈现的媒体传输系统结构示意 图, 如图 12所示, 该系统还包括多路音频输入设备和 /或视频输入设备, 所 述主远程呈现终端至少包括: 信令处理模块、 媒体编解码模块、 媒体传输 模块, 所述辅远程呈现终端至少包括: 媒体编解码模块、 媒体传输模块; 其中,
所述信令处理模块, 用于负责信令交互, 并进行媒体能力协商, 建立 主远程呈现终端的媒体传输模块与远端端点之间用于发送媒体流的多个媒 体逻辑通道, 记录各媒体逻辑通道与音频输入设备和 /或视频输入设备位置 的对应信息;
所述音频输入设备和 /或视频输入设备, 用于将采集的音频和 /或视频数 据发送给对应位置的远程呈现终端的媒体编解码模块;
所述媒体编解码模块, 用于对输入的音频和 /或视频数据进行编码, 并 把编码后的媒体码流转给主远程呈现终端的媒体传输模块, 其中, 辅远程 呈现终端的媒体编解码模块通过相应的媒体传输模块把编码后的媒体码流 转给主远程呈现终端的媒体传输模块, 主远程呈现终端的媒体编解码模块 直接把编码后的媒体码流转给主远程呈现终端的媒体传输模块;
所述主远程呈现终端的媒体传输模块, 用于分别把系统中各媒体编解 码模块编码后的媒体码流通过与媒体源类型和位置相对应的媒体逻辑通道 发送给远端端点。
图 13 为本发明实施例再一种基于远程呈现的媒体传输系统结构示意 图, 如图 13所示, 该系统还包括多路音频输出设备和 /或视频输出设备, 所 述主远程呈现终端至少包括: 信令处理模块、 媒体编解码模块、 媒体传输 模块, 所述辅远程呈现终端至少包括: 媒体编解码模块、 媒体传输模块; 其中,
所述信令处理模块, 用于负责信令交互, 并进行媒体能力协商, 建立 主远程呈现终端的媒体传输模块与远端端点之间用于接收媒体流的多个媒 体逻辑通道, 记录各媒体逻辑通道与音频输出设备和 /或视频输出设备位置 的对应信息;
所述主远程呈现终端的媒体传输模块, 用于通过建立的各媒体逻辑通 道分别接收远端的多路媒体码流, 并根据媒体逻辑通道与音频输出设备和 / 或视频输出设备的对应关系, 将收到的媒体码流分别转给对应远程呈现终 端的媒体编解码模块处理, 其中, 直接将媒体码流转给主远程呈现终端的 媒体编解码模块, 通过相应的媒体传输模块将媒体码流转给辅远程呈现终 端的媒体编解码模块;
所述媒体编解码模块, 用于对接收到的音频和 /或视频码流进行解码, 之后输出到对应的音频输出设备和 /或视频输出设备进行播放。
图 14 为本发明实施例再一种基于远程呈现的媒体传输系统结构示意 图, 如图 14所示, 该系统还包括多路音频输入设备和 /或视频输入设备, 所 述主远程呈现终端至少包括: 信令处理模块、 媒体编解码模块、 媒体传输 模块, 所述辅远程呈现终端至少包括: 媒体编解码模块、 媒体传输模块; 其中,
所述信令处理模块, 用于负责信令交互, 并进行媒体能力协商, 对每 一种媒体类型建立主远程呈现终端的媒体传输模块与远端端点之间的一个 用于发送媒体流的媒体逻辑通道, 记录本侧音频输入设备和 /或视频输入设 备的媒体类型和位置;
所述音频输入设备和 /或视频输入设备, 用于将采集的音频和 /或视频数 据发送给对应位置的远程呈现终端的媒体编解码模块;
所述媒体编解码模块, 用于对输入的音频和 /或视频数据进行编码, 主 远程呈现终端的媒体编解码模块把编码后的媒体码流转给相应的媒体传输 模块, 辅远程呈现终端的媒体编解码模块把编码后的媒体码流通过相应的 媒体传输模块转给主远程呈现终端的媒体传输模块;
主远程呈现终端的媒体传输模块, 用于将收到的编码后的媒体码流通 过主远程呈现终端与远端端点之间的媒体逻辑通道发送, 发送的媒体包头 中携带相应的媒体类型和位置信息。
图 15为本发明实施例再一种基于远程呈现的媒体传输系统结构示意图, 如图 15所示, 该系统还包括多路音频输出设备和 /或视频输出设备, 所述主 远程呈现终端至少包括: 信令处理模块、 媒体编解码模块、 媒体传输模块, 所述辅远程呈现终端至少包括: 媒体编解码模块、 媒体传输模块; 其中, 所述信令处理模块, 用于负责信令交互, 并进行媒体能力协商, 对每 一种媒体类型建立主远程呈现终端的媒体传输模块与远端端点之间的一个 接收媒体逻辑通道, 记录本侧音频输出设备和 /或视频输出设备的媒体类型 和位置;
所述主远程呈现终端的媒体传输模块, 用于从所述媒体逻辑通道接收 远端的媒体码流, 通过解析包头所标识的媒体类型和位置信息, 将所述媒 体码流转给对应远程呈现终端的媒体编解码模块处理, 其中, 直接将媒体 码流转给主远程呈现终端的媒体编解码模块, 通过相应的媒体传输模块将 媒体码流转给辅远程呈现终端的媒体编解码模块;
所述媒体编解码模块, 用于对接收到的音频和 /或视频码流进行解码, 之后输出到相应的音频输出设备和 /或视频输出设备进行播放。
本发明中, 不同的媒体逻辑通道可以通过 IP地址和端口号进行区分, 不同的媒体逻辑通道对应的 IP地址和 /或端口号不同。
以上所述, 仅为本发明的较佳实施例而已, 并非用于限定本发明的保 护范围。

Claims

权利要求书
1、 一种基于远程呈现的媒体传输方法, 其中, 媒体传输系统包括一个 主远程呈现终端和至少一个辅远程呈现终端, 该方法包括:
建立连接时, 由本侧媒体传输系统的主远程呈现终端与远端端点进行 信令交互, 建立所述本侧媒体传输系统与远端端点之间的媒体逻辑通道; 本侧媒体传输系统与远端端点之间通过一个媒体逻辑通道或者分别通 过多个媒体逻辑通道对同一类型的媒体流进行发送, 以及通过一个媒体逻 辑通道或者分别通过多个媒体逻辑通道对同一类型的媒体流进行接收。
2、 根据权利要求 1所述的方法, 其中,
所述建立本侧媒体传输系统与远端端点之间的媒体逻辑通道为: 分别 建立本侧各远程呈现终端的媒体传输模块与远端端点之间用于发送媒体流 的媒体逻辑通道, 记录各媒体逻辑通道与音频输入设备和 /或视频输入设备 位置的对应信息;
所述本侧媒体传输系统与远端端点之间分别通过多个媒体逻辑通道对 同一类型的媒体流进行发送为: 音频输入设备和 /或视频输入设备将采集的 音频和 /或视频数据发送给对应位置的远程呈现终端的媒体编解码模块进行 编码; 各远程呈现终端的媒体编解码模块分别对输入的音频和 /或视频数据 进行编码, 并把编码后的媒体码流转给对应的媒体传输模块; 各远程呈现 终端的媒体传输模块分别把编码后的媒体码流通过与媒体源类型和位置相 对应的媒体逻辑通道发送给远端端点。
3、 根据权利要求 1所述的方法, 其中,
所述建立本侧媒体传输系统与远端端点之间的媒体逻辑通道为: 分别 建立本侧各远程呈现终端的媒体传输模块与远端端点之间用于接收媒体流 的媒体逻辑通道, 记录各媒体逻辑通道与音频输出设备和 /或视频输出设备 位置的对应信息;
所述本侧媒体传输系统与远端端点之间分别通过多个媒体逻辑通道对 同一类型的媒体流进行接收为: 各远程呈现终端的媒体传输模块通过建立 的媒体逻辑通道分别接收远端的多路媒体码流, 并根据媒体逻辑通道与音 频输出设备或视频输出设备位置的对应关系, 将媒体码流分别转给对应的 媒体编解码模块处理; 各远程呈现终端的媒体编解码模块分别对接收到的 媒体码流进行解码, 之后输出到对应的音频输出设备和 /或视频输出设备进 行播放。
4、 根据权利要求 1所述的方法, 其中,
所述建立本侧媒体传输系统与远端端点之间的媒体逻辑通道为: 建立 主远程呈现终端的媒体传输模块与远端端点之间用于发送媒体流的多个媒 体逻辑通道, 记录各媒体逻辑通道与音频输入设备和 /或视频输入设备位置 的对应信息;
所述本侧媒体传输系统与远端端点之间分别通过多个媒体逻辑通道对 同一类型的媒体流进行发送为: 音频输入设备和 /或视频输入设备将采集的 音频和 /或视频数据发送给对应位置的远程呈现终端的媒体编解码模块; 各 媒体编解码模块分别对输入的音频和 /或视频数据进行编码, 并把编码后的 媒体码流转给主远程呈现终端的媒体传输模块; 主远程呈现终端的媒体传 输模块分别把本侧媒体编解码模块编码后的媒体码流通过与媒体源类型和 位置相对应的媒体逻辑通道发送给远端端点。
5、 根据权利要求 1所述的方法, 其中,
所述建立本侧媒体传输系统与远端端点之间的媒体逻辑通道为: 建立 主远程呈现终端的媒体传输模块与远端端点之间用于接收媒体流的多个媒 体逻辑通道, 记录各媒体逻辑通道与音频输出设备和 /或视频输出设备位置 的对应信息; 所述本侧媒体传输系统与远端端点之间分别通过多个媒体逻辑通道对 同一类型的媒体流进行接收为: 主远程呈现终端的媒体传输模块通过建立 的各媒体逻辑通道分别接收远端的多路媒体码流, 并根据媒体逻辑通道与 音频输出设备和 /或视频输出设备的对应关系, 将收到的媒体码流分别转给 对应远程呈现终端的媒体编解码模块处理; 各远程呈现终端的媒体编解码 模块分别对接收到的音频和 /或视频码流进行解码, 之后输出到对应的音频 输出设备和 /或视频输出设备进行播放。
6、 根据权利要求 1所述的方法, 其中,
所述建立本侧媒体传输系统与远端端点之间的媒体逻辑通道为: 对每 一种媒体类型建立主远程呈现终端的媒体传输模块与远端端点之间的一个 用于发送媒体流的媒体逻辑通道, 记录本侧音频输入设备和 /或视频输入设 备的媒体类型和位置;
所述本侧媒体传输系统与远端端点之间通过一个媒体逻辑通道对同一 类型的媒体流进行发送为: 音频输入设备和 /或视频输入设备将采集的音频 和 /或视频数据发送给对应位置的远程呈现终端的媒体编解码模块; 各媒体 编解码模块分别对输入的音频和 /或视频数据进行编码, 并 4巴编码后的媒体 码流转给主远程呈现终端的媒体传输模块; 主远程呈现终端的媒体传输模 块将所述编码后的媒体码流通过主远程呈现终端与远端端点之间的媒体逻 辑通道发送 , 发送的媒体包头中携带相应的媒体类型和位置信息。
7、 根据权利要求 1所述的方法, 其中,
所述建立本侧媒体传输系统与远端端点之间的媒体逻辑通道为: 对每 一种媒体类型建立主远程呈现终端的媒体传输模块与远端端点之间的一个 接收媒体码流的媒体逻辑通道, 记录本侧音频输出设备和 /或视频输出设备 的媒体类型和位置;
所述本侧媒体传输系统与远端端点之间通过一个媒体逻辑通道对同一 类型的媒体流进行接收为: 主远程呈现终端的媒体传输模块从所述媒体逻 辑通道接收远端的媒体码流, 通过解析包头所标识的媒体类型和位置信息, 将所述媒体码流转给对应远程呈现终端的媒体编解码模块处理; 各远程呈 现终端的媒体编解码模块分别对接收到的音频和 /或视频码流进行解码, 之 后输出到相应的音频输出设备和 /或视频输出设备进行播放。
8、根据权利要求 1至 7任一项所述的方法, 其中, 通过 IP地址和端口 号区分媒体逻辑通道, 不同的媒体逻辑通道对应的 IP地址和 /或端口号不 同。
9、 一种基于远程呈现的媒体传输系统, 其中, 该系统包括: 主远程呈 现终端和至少一个辅远程呈现终端; 其中,
所述主远程呈现终端, 设置为在建立所述媒体传输系统与远端端点之 间的连接时, 与远端端点进行信令交互, 建立所述媒体传输系统与远端端 点之间的媒体逻辑通道; 以及通过建立的一个媒体逻辑通道或者分别通过 多个媒体逻辑通道对同一类型的媒体流进行发送, 通过建立的一个媒体逻 辑通道或者分别通过多个媒体逻辑通道对同一类型的媒体流进行接收; 所述辅远程呈现终端, 设置为通过主远程呈现终端建立的媒体逻辑通 道进行媒体流发送及接收。
10、 根据权利要求 9所述的系统, 其中, 该系统还包括多路音频输入 设备和 /或视频输入设备, 所述主远程呈现终端至少包括: 信令处理模块、 媒体编解码模块、 媒体传输模块, 所述辅远程呈现终端至少包括: 媒体编 解码模块、 媒体传输模块; 其中,
所述信令处理模块, 设置为负责信令交互, 并进行媒体能力协商, 分 别建立系统中各远程呈现终端的媒体传输模块与远端端点之间用于发送媒 体流的媒体逻辑通道, 记录各媒体逻辑通道与音频输入设备和 /或视频输入 设备位置的对应信息; 所述音频输入设备和 /或视频输入设备, 设置为将采集的音频和 /或视频 数据发送给对应位置的远程呈现终端的媒体编解码模块进行编码;
所述媒体编解码模块, 设置为对输入的音频和 /或视频数据进行编码, 并才巴编码后的媒体码流转给对应的媒体传输模块;
所述媒体传输模块, 设置为把媒体编解码模块编码后的媒体码流通过 与媒体源类型和位置相对应的媒体逻辑通道发送给远端端点。
11、 根据权利要求 9所述的系统, 其中, 该系统还包括多路音频输出 设备和 /或视频输出设备, 所述主远程呈现终端至少包括: 信令处理模块、 媒体编解码模块、 媒体传输模块, 所述辅远程呈现终端至少包括: 媒体编 解码模块、 媒体传输模块; 其中,
所述信令处理模块, 设置为负责信令交互, 并进行媒体能力协商, 分 别建立系统中各远程呈现终端的媒体传输模块与远端端点之间用于接收媒 体流的的媒体逻辑通道, 记录各媒体逻辑通道与音频输出设备和 /或视频输 出设备位置的对应信息;
所述媒体传输模块, 设置为通过建立的媒体逻辑通道接收远端的媒体 码流, 并根据媒体逻辑通道与音频输出设备或视频输出设备位置的对应信 息, 将媒体码流分别转给对应的媒体编解码模块处理;
所述媒体编解码模块, 设置为对接收到的媒体码流进行解码, 之后输 出到对应的音频输出设备和 /或视频输出设备进行播放。
12、 根据权利要求 9所述的系统, 其中, 该系统还包括多路音频输入 设备和 /或视频输入设备, 所述主远程呈现终端至少包括: 信令处理模块、 媒体编解码模块、 媒体传输模块, 所述辅远程呈现终端至少包括: 媒体编 解码模块、 媒体传输模块; 其中,
所述信令处理模块, 设置为负责信令交互, 并进行媒体能力协商, 建 立主远程呈现终端的媒体传输模块与远端端点之间用于发送媒体流的多个 媒体逻辑通道, 记录各媒体逻辑通道与音频输入设备和 /或视频输入设备位 置的对应信息;
所述音频输入设备和 /或视频输入设备, 设置为将采集的音频和 /或视频 数据发送给对应位置的远程呈现终端的媒体编解码模块;
所述媒体编解码模块, 设置为对输入的音频和 /或视频数据进行编码, 并把编码后的媒体码流转给主远程呈现终端的媒体传输模块, 其中, 辅远 程呈现终端的媒体编解码模块通过相应的媒体传输模块把编码后的媒体码 流转给主远程呈现终端的媒体传输模块, 主远程呈现终端的媒体编解码模 块直接把编码后的媒体码流转给主远程呈现终端的媒体传输模块;
所述主远程呈现终端的媒体传输模块, 设置为分别把系统中各媒体编 解码模块编码后的媒体码流通过与媒体源类型和位置相对应的媒体逻辑通 道发送给远端端点。
13、 根据权利要求 9所述的系统, 其中, 该系统还包括多路音频输出 设备和 /或视频输出设备, 所述主远程呈现终端至少包括: 信令处理模块、 媒体编解码模块、 媒体传输模块, 所述辅远程呈现终端至少包括: 媒体编 解码模块、 媒体传输模块; 其中,
所述信令处理模块, 设置为负责信令交互, 并进行媒体能力协商, 建 立主远程呈现终端的媒体传输模块与远端端点之间用于接收媒体流的多个 媒体逻辑通道, 记录各媒体逻辑通道与音频输出设备和 /或视频输出设备位 置的对应信息;
所述主远程呈现终端的媒体传输模块, 设置为通过建立的各媒体逻辑 通道分别接收远端的多路媒体码流, 并根据媒体逻辑通道与音频输出设备 和 /或视频输出设备的对应关系, 将收到的媒体码流分别转给对应远程呈现 终端的媒体编解码模块处理, 其中, 直接将媒体码流转给主远程呈现终端 的媒体编解码模块, 通过相应的媒体传输模块将媒体码流转给辅远程呈现 终端的媒体编解码模块;
所述媒体编解码模块,设置为对接收到的音频和 /或视频码流进行解码, 之后输出到对应的音频输出设备和 /或视频输出设备进行播放。
14、 根据权利要求 9所述的系统, 其中, 该系统还包括多路音频输入 设备和 /或视频输入设备, 所述主远程呈现终端至少包括: 信令处理模块、 媒体编解码模块、 媒体传输模块, 所述辅远程呈现终端至少包括: 媒体编 解码模块、 媒体传输模块; 其中,
所述信令处理模块, 设置为负责信令交互, 并进行媒体能力协商, 对 每一种媒体类型建立主远程呈现终端的媒体传输模块与远端端点之间的一 个用于发送媒体流的媒体逻辑通道, 记录本侧音频输入设备和 /或视频输入 设备的媒体类型和位置;
所述音频输入设备和 /或视频输入设备, 设置为将采集的音频和 /或视频 数据发送给对应位置的远程呈现终端的媒体编解码模块;
所述媒体编解码模块, 设置为对输入的音频和 /或视频数据进行编码, 主远程呈现终端的媒体编解码模块把编码后的媒体码流转给相应的媒体传 输模块, 辅远程呈现终端的媒体编解码模块把编码后的媒体码流通过相应 的媒体传输模块转给主远程呈现终端的媒体传输模块;
主远程呈现终端的媒体传输模块, 设置为将收到的编码后的媒体码流 通过主远程呈现终端与远端端点之间的媒体逻辑通道发送, 发送的媒体包 头中携带相应的媒体类型和位置信息。
15、 根据权利要求 9所述的系统, 其中, 该系统还包括多路音频输出 设备和 /或视频输出设备, 所述主远程呈现终端至少包括: 信令处理模块、 媒体编解码模块、 媒体传输模块, 所述辅远程呈现终端至少包括: 媒体编 解码模块、 媒体传输模块; 其中,
所述信令处理模块, 设置为负责信令交互, 并进行媒体能力协商, 对 每一种媒体类型建立主远程呈现终端的媒体传输模块与远端端点之间的一 个接收媒体逻辑通道, 记录本侧音频输出设备和 /或视频输出设备的媒体类 型和位置;
所述主远程呈现终端的媒体传输模块, 设置为从所述媒体逻辑通道接 收远端的媒体码流, 通过解析包头所标识的媒体类型和位置信息, 将所述 媒体码流转给对应远程呈现终端的媒体编解码模块处理, 其中, 直接将媒 体码流转给主远程呈现终端的媒体编解码模块, 通过相应的媒体传输模块 将媒体码流转给辅远程呈现终端的媒体编解码模块;
所述媒体编解码模块,设置为对接收到的音频和 /或视频码流进行解码, 之后输出到相应的音频输出设备和 /或视频输出设备进行播放。
16、 根据权利要求 9至 15任一项所述的系统, 其中,
媒体逻辑通道通过 IP地址和端口号区分, 不同的媒体逻辑通道对应的 IP地址和 /或端口号不同。
PCT/CN2012/072739 2011-07-08 2012-03-21 一种基于远程呈现的媒体传输方法及系统 WO2012155659A1 (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US14/130,264 US9344475B2 (en) 2011-07-08 2012-03-21 Media transmission method and system based on telepresence
EP12785453.7A EP2731331A4 (en) 2011-07-08 2012-03-21 METHOD AND SYSTEM FOR MULTIMEDIA TRANSMISSION BASED ON TELEPRESENCE

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201110191518.4 2011-07-08
CN201110191518.4A CN102868880B (zh) 2011-07-08 2011-07-08 一种基于远程呈现的媒体传输方法及系统

Publications (1)

Publication Number Publication Date
WO2012155659A1 true WO2012155659A1 (zh) 2012-11-22

Family

ID=47176252

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2012/072739 WO2012155659A1 (zh) 2011-07-08 2012-03-21 一种基于远程呈现的媒体传输方法及系统

Country Status (4)

Country Link
US (1) US9344475B2 (zh)
EP (1) EP2731331A4 (zh)
CN (1) CN102868880B (zh)
WO (1) WO2012155659A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104219486A (zh) * 2013-06-01 2014-12-17 中兴通讯股份有限公司 远程呈现端点的能力交互方法及装置、数据流

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102868873B (zh) * 2011-07-08 2017-10-17 中兴通讯股份有限公司 一种远程呈现方法、终端和系统
CN102843542B (zh) * 2012-09-07 2015-12-02 华为技术有限公司 多流会议的媒体协商方法、设备和系统
CN104219484B (zh) * 2013-06-01 2019-05-31 中兴通讯股份有限公司 远程呈现端点的能力交互方法及装置
CN104219487B (zh) * 2013-06-01 2019-05-07 中兴通讯股份有限公司 远程呈现端点的能力交互方法及装置、数据流
CN104519304B (zh) * 2013-09-29 2018-07-20 中兴通讯股份有限公司 端点信息交互处理方法、装置及远程呈现端点
CN104519023B (zh) * 2013-09-29 2019-08-27 中兴通讯股份有限公司 能力协商处理方法、装置及远程呈现端点
CN104519305A (zh) * 2013-09-29 2015-04-15 中兴通讯股份有限公司 端点信息交互处理方法、装置及远程呈现端点
EP3051773B1 (en) * 2013-10-25 2020-09-16 Huawei Technologies Co., Ltd. Multi-path auxiliary stream control method, control device, node and system
CN104601339B (zh) * 2013-10-30 2018-05-25 华为技术有限公司 网真会议的控制方法、装置、服务器和终端设备
CN105516065B (zh) * 2014-09-26 2018-08-14 华为技术有限公司 一种媒体控制方法和设备
CN105657327A (zh) * 2014-11-28 2016-06-08 中兴通讯股份有限公司 一种音视频处理方法、装置及系统
CN109151231B (zh) * 2017-06-27 2022-07-19 中兴通讯股份有限公司 客服系统、呼入业务的处理方法以及业务的处理方法
KR101861561B1 (ko) * 2017-07-24 2018-05-29 (주)유프리즘 복수 개의 영상회의용 단말을 이용하여 멀티 스크린 영상회의를 제공할 수 있는 영상회의 서버 및 그 방법
CN113890659A (zh) * 2021-03-17 2022-01-04 广州市保伦电子有限公司 一种基于管道的音频广播方法

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101198008A (zh) * 2008-01-03 2008-06-11 中兴通讯股份有限公司 一种实现多屏多画面的方法和系统
CN101534413A (zh) * 2009-04-14 2009-09-16 深圳华为通信技术有限公司 一种远程呈现的系统、装置和方法
CN101668160A (zh) * 2009-09-10 2010-03-10 深圳华为通信技术有限公司 视频图像数据处理方法、装置及视频会议系统及终端

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101471804B (zh) 2007-12-28 2011-08-10 华为技术有限公司 一种音频处理方法、系统和控制服务器
US8355041B2 (en) * 2008-02-14 2013-01-15 Cisco Technology, Inc. Telepresence system for 360 degree video conferencing
CN101370114B (zh) * 2008-09-28 2011-02-02 华为终端有限公司 视频及音频处理方法、多点控制单元和视频会议系统
NO332009B1 (no) * 2008-12-12 2012-05-21 Cisco Systems Int Sarl Fremgangsmate for a igangsette kommunikasjonsforbindelser
US8471888B2 (en) 2009-08-07 2013-06-25 Research In Motion Limited Methods and systems for mobile telepresence
US8537195B2 (en) * 2011-02-09 2013-09-17 Polycom, Inc. Automatic video layouts for multi-stream multi-site telepresence conferencing system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101198008A (zh) * 2008-01-03 2008-06-11 中兴通讯股份有限公司 一种实现多屏多画面的方法和系统
CN101534413A (zh) * 2009-04-14 2009-09-16 深圳华为通信技术有限公司 一种远程呈现的系统、装置和方法
CN101668160A (zh) * 2009-09-10 2010-03-10 深圳华为通信技术有限公司 视频图像数据处理方法、装置及视频会议系统及终端

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP2731331A4 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104219486A (zh) * 2013-06-01 2014-12-17 中兴通讯股份有限公司 远程呈现端点的能力交互方法及装置、数据流
CN104219486B (zh) * 2013-06-01 2019-05-17 中兴通讯股份有限公司 远程呈现端点的能力交互方法及装置

Also Published As

Publication number Publication date
CN102868880A (zh) 2013-01-09
CN102868880B (zh) 2017-09-05
EP2731331A1 (en) 2014-05-14
US9344475B2 (en) 2016-05-17
EP2731331A4 (en) 2015-04-01
US20140139618A1 (en) 2014-05-22

Similar Documents

Publication Publication Date Title
WO2012155659A1 (zh) 一种基于远程呈现的媒体传输方法及系统
WO2012155660A1 (zh) 一种远程呈现方法、终端和系统
US10045052B2 (en) System and method for transferring data
US8600530B2 (en) Method for determining an audio data spatial encoding mode
US8767591B2 (en) Multi-point video conference system and media processing method thereof
JP5320406B2 (ja) オーディオ処理の方法、システム、及び制御サーバ
WO2010034254A1 (zh) 视频及音频处理方法、多点控制单元和视频会议系统
WO2012119465A1 (zh) 一种远程呈现技术中媒体数据发送和播放的方法及系统
WO2012041117A1 (zh) 一种对视频会议终端集中监控的方法和系统及相关装置
JPWO2005094077A1 (ja) 多地点会議システムおよび多地点会議装置
WO2007140668A1 (fr) procédé et appareil pour réaliser une surveillance à distance dans un système de téléconférence
WO2015127799A1 (zh) 协商媒体能力的方法和设备
WO2011057511A1 (zh) 实现混音的方法、装置和系统
WO2011076041A1 (zh) 呼叫建立的方法、装置和系统
US9088690B2 (en) Video conference system
WO2012175025A1 (zh) 远程呈现会议系统、远程呈现会议的录制与回放方法
WO2014161403A1 (zh) 视频会议电视终端视频源的接入方法和系统
WO2011134224A1 (zh) 一种视频处理方法及其系统、mcu视频处理单元
WO2012075930A1 (zh) 多路辅流控制方法、装置及网络系统
WO2011023024A1 (zh) 一种监控信息传输的方法及系统
WO2013178185A1 (zh) 电视会议系统中丢包补偿的处理方法及装置
KR20100111844A (ko) 이동통신 시스템에서 화이트 보드 서비스 제공을 위한 장치 및 방법
WO2010094213A1 (zh) 多路媒体流传输和接收的方法、装置及系统
WO2007068139A1 (fr) Systeme et procede pour la commande de flux multimedias sur la communication video a plusieurs abonnes

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12785453

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 14130264

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2012785453

Country of ref document: EP