WO2021196860A1 - 视频传输方法、系统、相关设备及存储介质 - Google Patents

视频传输方法、系统、相关设备及存储介质 Download PDF

Info

Publication number
WO2021196860A1
WO2021196860A1 PCT/CN2021/074539 CN2021074539W WO2021196860A1 WO 2021196860 A1 WO2021196860 A1 WO 2021196860A1 CN 2021074539 W CN2021074539 W CN 2021074539W WO 2021196860 A1 WO2021196860 A1 WO 2021196860A1
Authority
WO
WIPO (PCT)
Prior art keywords
video picture
resolution
session
video
message
Prior art date
Application number
PCT/CN2021/074539
Other languages
English (en)
French (fr)
Inventor
孟斌
张云华
祝晔
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to EP21779370.2A priority Critical patent/EP4114026A4/en
Publication of WO2021196860A1 publication Critical patent/WO2021196860A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/4788Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/147Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/10Architectures or entities
    • H04L65/102Gateways
    • H04L65/1043Gateway controllers, e.g. media gateway control protocol [MGCP] controllers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/1066Session management
    • H04L65/1069Session establishment or de-establishment
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/1066Session management
    • H04L65/1101Session protocols
    • H04L65/1104Session initiation protocol [SIP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/40Support for services or applications
    • H04L65/403Arrangements for multi-party communication, e.g. for conferences
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • H04L65/765Media network packet handling intermediate
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/80Responding to QoS
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/233Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234363Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by altering the spatial resolution, e.g. for clients with a lower screen resolution
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234381Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by altering the temporal resolution, e.g. decreasing the frame rate by frame skipping
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/258Client or end-user data management, e.g. managing client capabilities, user preferences or demographics, processing of multiple end-users preferences to derive collaborative data
    • H04N21/25808Management of client data
    • H04N21/25825Management of client data involving client display capabilities, e.g. screen resolution of a mobile phone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/266Channel or content management, e.g. generation and management of keys and entitlement messages in a conditional access system, merging a VOD unicast channel into a multicast channel
    • H04N21/2662Controlling the complexity of the video stream, e.g. by scaling the resolution or bitrate of the video stream based on the client capabilities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • H04N21/440263Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by altering the spatial resolution, e.g. for displaying on a connected PDA
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/643Communication protocols
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/65Transmission of management data between client and server
    • H04N21/658Transmission by the client directed to the server
    • H04N21/6582Data stored in the client, e.g. viewing habits, hardware capabilities, credit card number
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems

Definitions

  • This application relates to the field of communication technology, and in particular to a video transmission method, a video transmission system, a video transmission device, and a storage medium.
  • the mobile communication network has been upgraded from 3G to 4G.
  • the 4G network also has the capability of high-definition voice and video calls.
  • various APPs have also developed vigorously.
  • Various instant messaging apps are constantly eroding the voice market of operators.
  • instant messaging APP will launch a more competitive business experience.
  • the two basic products that operators provide to the general public are traffic and phone calls.
  • the demand for traffic consumption has been increasing rapidly and has become the main source of income for operators.
  • the demand for phone calls has become saturated. Therefore, when planning a wireless network, operators give priority to ensuring the flow of planning, followed by making calls.
  • Traffic and phone calls behave differently in wireless networks. Among them, the traffic is characterized by small upstream traffic and large downstream traffic. Generally, the upstream traffic is 10% of the downstream traffic.
  • the construction standard recommended by domestic operators is 0.5Mbps for uplink and 5Mbps for downlink.
  • the feature of making a call is that audio calls require 24.85Kbps upstream and 24.85kbps downstream; video calls require 1Mbps upstream and 1Mbps downstream.
  • the requirement of the audio phone is also met at the same time.
  • the requirements of video telephony cannot be met.
  • most of the current wireless network consumption is in the downlink, and operators will devote more resources to the downlink.
  • the phone model consumes the same resources both upstream and downstream, which can easily cause a lot of waste of resources.
  • This application discloses a video transmission method, a video transmission system, a video transmission device and a storage medium, which can realize the use of small bandwidth in the uplink and large bandwidth in the downlink in the telephone service, so that the telephone service model can match the resource division of wireless traffic, and at the same time, it can reduce The cost of the operator.
  • an embodiment of the present application provides a video transmission method, including:
  • the first session boundary controller receives the first video picture sent by the second session boundary controller
  • the first session boundary controller performs super-division processing on the first video picture to obtain a second video picture, wherein the resolution of the second video picture is higher than the resolution of the first video picture;
  • the first session boundary controller sends the second video picture to the first terminal.
  • the first session boundary controller processes the received low-resolution first video picture sent by the second session boundary controller, and converts it into a high-resolution second video picture. Then the high-resolution second video picture is sent to the first terminal.
  • the first session boundary controller receives the first video picture with low resolution, and then sends the second video picture with high resolution to the terminal.
  • This method changes the bandwidth requirements of the existing video phone, changing it from uplink and downlink to using small bandwidth for uplink and large bandwidth for downlink, so that the video phone service model matches the resource division of wireless traffic, saving resources and reducing at the same time. The cost of the operator.
  • the method further includes:
  • the second session boundary controller performs super-division processing on the third video picture to obtain a fourth video picture, wherein the resolution of the fourth video picture is higher than the resolution of the third video picture;
  • the second session boundary controller sends the fourth video picture to the second terminal.
  • both the first session boundary controller and the second session boundary controller can support over-division.
  • the method further includes:
  • the first session boundary controller sends the third video picture to the second session boundary controller.
  • the method before the first session boundary controller receives the first video picture sent by the second session boundary controller, the method further includes:
  • the first session boundary controller sends a first message to the first terminal, where the first message carries the resolution of the video picture used in the sending direction of the first session boundary controller to instruct the first terminal
  • a terminal receives the video picture sent by the first session boundary controller, where the resolution of the video picture used in the sending direction of the first session boundary controller is the resolution of the second video picture.
  • the method further includes:
  • the second session boundary controller sends a second message carrying the indication information of the super-division capability to the second terminal to instruct the second terminal to send a video image that satisfies the indication information of the super-division capability.
  • the resolution of the video picture of the super-division capability indication information is the resolution of the first video picture.
  • video transmission negotiation is performed, and the video image reception, super-division processing, and transmission are performed based on the negotiated resolution, which improves the reliability of the implementation of the video transmission technology.
  • the first session boundary controller processes the received low-resolution first video picture sent by the second session boundary controller, converts it into a high-resolution second video picture, and then converts it into a high-resolution second video picture.
  • the high-resolution second video picture is sent to the first terminal.
  • the second message is a SIP message
  • the over-division capability indication information is located in the image attribute parameter of the SIP message.
  • the second message is an RTCP message
  • the RTCP message includes a name field, wherein, when the name field corresponds to a first preset value, it indicates that the RTCP message carries the super-division capability indication information.
  • the second message is an SPS message
  • the super-division capability indication information is located in a parameter of the SPS message.
  • the second message further carries asymmetric capability indication information, and the asymmetric capability indication information indicates that the first terminal supports different resolutions in the receiving direction and the sending direction.
  • This negotiation method eliminates the need for UE1 to negotiate again to determine the resolution of its sending direction, improves the resolution negotiation mechanism for video calls, simplifies the process, and can save air interface bandwidth resources and bearer network bandwidth resources.
  • this application also provides an audio transmission method, including:
  • the first session boundary controller receives the first audio sent by the second session boundary controller
  • the first session boundary controller performs super-division processing on the first audio to obtain a second audio, wherein the bit rate of the second audio is higher than the bit rate of the first audio;
  • the first session boundary controller sends the second audio to the first terminal.
  • the first session boundary controller processes the received low-bit-rate audio, converts it into a high-bit-rate audio, and then sends the high-bit-rate audio to the terminal.
  • This method changes the bandwidth requirements of existing voice calls, changing it from uplink and downlink to using small bandwidth for uplink and large bandwidth for downlink, so that the voice call service model matches the resource division of wireless traffic, which saves resources and at the same time reduces The cost of the operator.
  • the above method also includes:
  • the second session boundary controller performs super-division processing on the third audio to obtain a fourth audio, wherein the bit rate of the fourth audio is higher than the bit rate of the third audio;
  • the second session boundary controller sends the fourth audio to the second terminal.
  • both the first session boundary controller and the second session boundary controller can achieve over-division.
  • the above method also includes:
  • the first session boundary controller sends the third audio to the second session boundary controller.
  • the method further includes:
  • the first session boundary controller sends a first message to the first terminal, where the first message carries the bit rate of the audio used in the sending direction of the first session boundary controller to indicate the first
  • the terminal receives the audio sent by the first session boundary controller, where the bit rate of the audio used in the sending direction of the first session boundary controller is the bit rate of the second audio.
  • the method also includes:
  • the second session boundary controller sends a second message carrying the indication information of the super-division capability to the second terminal to instruct the second terminal to send audio that satisfies the indication information of the super-division capability.
  • the bit rate of the audio of the super-division capability indication information is the bit rate of the first audio.
  • audio transmission negotiation is performed, and audio reception, over-division processing, and transmission are performed based on the negotiated bit rate, which improves the reliability of the audio transmission technology.
  • the first session boundary controller processes the received low bit rate audio, converts it into a high bit rate audio, and then sends the high bit rate audio to the terminal.
  • This method changes the bandwidth requirements of existing voice calls, changing it from uplink and downlink to using small bandwidth for uplink and large bandwidth for downlink, so that the voice call service model matches the resource division of wireless traffic, which saves resources and at the same time reduces The cost of the operator.
  • the second message is a SIP message
  • the super-division capability indication information is located in the SIP message.
  • the second message is an RTCP message
  • the RTCP message includes a name field, wherein, when the name field corresponds to a first preset value, it indicates that the RTCP message carries the super-division capability indication information.
  • the second message is an SPS message
  • the super-division capability indication information is located in a parameter of the SPS message.
  • the second message further carries asymmetric capability indication information, and the asymmetric capability indication information indicates that the first terminal supports different code rates in the receiving direction and the sending direction.
  • audio transmission negotiation is performed, and audio reception, over-division processing, and transmission are performed based on the negotiated bit rate, which improves the reliability of the audio transmission technology.
  • the negotiation method eliminates the need for UE1 to negotiate again to determine the resolution of its sending direction, improves the audio call codec speed negotiation mechanism, simplifies the process, and saves air interface bandwidth resources and bearer network bandwidth resources.
  • the first session boundary controller processes the received low bit rate audio, converts it into a high bit rate audio, and then sends the high bit rate audio to the terminal.
  • This method changes the bandwidth requirements of existing voice calls, changing it from uplink and downlink to using small bandwidth for uplink and large bandwidth for downlink, so that the voice call service model matches the resource division of wireless traffic, which saves resources and at the same time reduces The cost of the operator.
  • an embodiment of the present application provides a multimedia data transmission method, which mainly includes:
  • the terminal sends a multimedia request message to the network side device, the multimedia request message carries asymmetric capability indication information, and the asymmetric capability indication information indicates that the first terminal supports different code rates or resolutions in the receiving direction and the sending direction ;
  • the terminal Receiving, by the terminal, a multimedia response message sent by the network side device, where the multimedia response message carries an uplink resolution and a downlink resolution or an uplink code rate and a downlink code rate;
  • the terminal uses the uplink resolution or the uplink code rate to send uplink multimedia data, and receives the downlink multimedia data transmitted using the downlink resolution or the downlink code rate.
  • the multimedia request message is an Invite message
  • the asymmetric capability indication information is located in the SDP field of the invite message.
  • the multimedia request message is an RTCP message
  • the RTCP message carries the asymmetric capability indication information
  • the multimedia request message is an SPS message
  • the SPS message carries the asymmetric capability indication information.
  • the network side device receives the multimedia request message of the terminal, and determines the uplink resolution and downlink resolution or uplink bit rate of the multimedia data transmitted by the terminal according to the asymmetric capability indication information carried in the multimedia request message And the downlink code rate, and then the network side device sends a multimedia response message to the terminal, and the multimedia response message carries the uplink resolution and the downlink resolution or the uplink code rate and the downlink code rate of the transmitted multimedia data.
  • the network side device is the first SBC.
  • the multimedia data is audio data or video data.
  • this application also provides a video transmission method, including:
  • the first terminal receives the second video picture sent by the first session boundary controller, where the second video picture is obtained by the first session boundary controller performing super-division processing on the first video picture, and the first The video picture is sent by the second session boundary controller to the first session boundary controller, wherein the resolution of the second video picture is higher than the resolution of the first video picture.
  • the second terminal receives a fourth video picture sent by the second session boundary controller, where the fourth video picture is obtained by the second session boundary controller performing super-division processing on the third video picture.
  • the third video picture is sent by the first session boundary controller to the second session boundary controller, wherein the resolution of the fourth video picture is higher than the resolution of the third video picture.
  • the method before the first terminal receives the second video picture sent by the first session boundary controller, the method further includes:
  • the first terminal receives a first message sent by the first session boundary controller, where the first message carries the resolution of the video picture used in the sending direction of the first session boundary controller;
  • a terminal receives the video picture sent by the first session boundary controller, where the resolution of the video picture used in the sending direction of the first session boundary controller is the resolution of the second video picture.
  • the method also includes:
  • the second terminal sends to the second session boundary controller a video picture that satisfies the super-division capability indication information, and the resolution of the video picture that satisfies the super-division capability indication information is the first video picture Resolution.
  • the present application also provides a video transmission system, including a first session boundary controller and a second session boundary controller, wherein the first session boundary controller is used for:
  • the second session boundary controller is used to:
  • the first session boundary controller is also used for:
  • the first session boundary controller is also used for:
  • the second session boundary controller is also used for:
  • the resolution of the video picture is the resolution of the first video picture.
  • the present application also provides an audio transmission system, including a first session boundary controller and a second session boundary controller, wherein the first session boundary controller is used for:
  • the second audio is sent to the first terminal.
  • the second session boundary controller is used for:
  • the fourth audio is sent to the second terminal.
  • the first session boundary controller is also used for:
  • the third audio is sent to the second session boundary controller.
  • the first session boundary controller is also used for:
  • the bit rate of the audio used in the sending direction of the first session boundary controller is the bit rate of the second audio.
  • the second session boundary controller is also used for:
  • the code rate of is the code rate of the first audio.
  • this application provides a transmission device that is used for the method and/or provided in any possible implementation manner of the first aspect and/or any possible implementation manner of the second aspect The method provided in any possible implementation manner of the third aspect and/or the method provided in any possible implementation manner of the fourth aspect.
  • the present application provides a computer storage medium, including computer instructions, which when the computer instructions run on an electronic device, cause the electronic device to execute any of the possible implementation manners and/or the first aspect
  • the embodiments of the present application provide a computer program product.
  • the computer program product runs on a computer, the computer executes any possible implementation manner of the first aspect and/or any possible implementation of the second aspect.
  • an embodiment of the present application provides a video transmission device, which includes a memory and a processor coupled to each other, the memory stores computer program code, and the processor calls and executes the computer program code in the memory , Enabling the video transmission device to execute the video/audio transmission method as described in the first aspect and/or the second aspect.
  • an embodiment of the present application provides a terminal, which includes a display, an input/output interface, a memory and a processor that are coupled to each other.
  • the memory stores computer program code, and the processor calls and executes the The computer program code in the memory enables the terminal to execute the multimedia data transmission method as described in the third aspect and/or the fourth aspect.
  • the device described in the fifth aspect, the device described in the sixth aspect, the device described in the seventh aspect, the computer storage medium described in the eighth aspect, and the computer program product described in the ninth aspect are provided above
  • the device described in the tenth aspect and the terminal described in the eleventh aspect are both used to execute the method provided in any one of the first aspect, the method provided in any one of the second aspect, and the method provided in any one of the third aspect.
  • Fig. 1 is a schematic diagram of a video transmission system provided by an embodiment of the present application.
  • FIG. 2 is a schematic flowchart of a video transmission method provided by an embodiment of the present application.
  • FIG. 3 is a schematic diagram of the structure of a message provided by an embodiment of the present application.
  • FIG. 4 is a schematic diagram of the structure of a message provided by an embodiment of the present application.
  • FIG. 5 is a schematic flowchart of a video transmission negotiation process provided by an embodiment of the present application.
  • FIG. 6 is a schematic flowchart of another video transmission negotiation process provided by an embodiment of the present application.
  • FIG. 7 is a schematic flowchart of still another video transmission negotiation process provided by an embodiment of the present application.
  • FIG. 8 is a schematic flowchart of another video transmission negotiation process provided by an embodiment of the present application.
  • FIG. 9 is a schematic flowchart of an audio transmission negotiation process provided by an embodiment of the present application.
  • FIG. 10 is a schematic structural diagram of a video transmission device provided by an embodiment of the present application.
  • FIG. 11 is a schematic structural diagram of a terminal provided by an embodiment of the present application.
  • the video transmission system may include a first terminal UE1, a second terminal UE2, a first session boundary controller SBC1, and a second session boundary controller SBC2.
  • the first session border controller SBC1 can be used to receive the video picture sent by the second session border controller and send it to the first terminal UE1.
  • the second session border controller SBC2 may be used to receive the video picture sent by the first session border controller and send it to the second terminal UE2.
  • first session border controller SBC1 can also be used to receive the video session request sent by the first terminal UE1, and send the video session request to the second session border controller SBC2.
  • the second session border controller SBC2 can be used to receive the video session request sent by the first session border controller SBC1 and send it to the second terminal UE2.
  • the aforementioned second session border controller SBC2 may also receive the message sent by the second terminal UE2 and forward it to the first session border controller SBC1.
  • the first session border controller SBC1 may be used to receive the message sent by the second session border controller SBC2 and forward it to the first terminal UE1.
  • the above-mentioned SBC1 and SBC2 may be the same SBC.
  • UE1 and UE2 are in the same cell or the same city.
  • FIG. 2 it is a schematic flowchart of a video transmission method provided by an embodiment of this application.
  • the video transmission method includes 201-203, as follows:
  • the first session boundary controller receives a first video image sent by the second session boundary controller
  • the video transmission method can be applied to a video transmission system, where the video transmission system can include a first terminal UE1, a first session boundary controller SBC1, a second session boundary controller SBC2, and a second terminal UE2.
  • the second terminal UE2 sends the first video picture to the second session boundary controller SBC2, and the second session boundary controller SBC2 forwards the first video picture to the first session boundary controller SBC1.
  • the first session boundary controller performs super-division processing on the first video picture to obtain a second video picture, wherein the resolution of the second video picture is higher than the resolution of the first video picture Rate;
  • the first session boundary controller supports over-division, that is, the first session boundary controller can improve the resolution of the received video picture.
  • Super resolution means that a low-resolution video is restored to a high-resolution video through mathematical operations, or a low-quality audio is restored to a high-quality audio through mathematical operations.
  • the above-mentioned first session boundary controller obtains a high-resolution second video picture by processing a low-resolution first video picture.
  • the first session boundary controller sends the second video picture to the first terminal.
  • the video transmission method can be applied to video call scenarios.
  • the first session boundary controller processes the received low-resolution first video picture sent by the second session boundary controller, and converts it into a high-resolution second video picture. Then the high-resolution second video picture is sent to the first terminal. That is, the first session boundary controller receives the first low-resolution video picture, and then sends the second high-resolution video picture to the terminal.
  • This method changes the bandwidth requirements of the existing video phone, changing it from uplink and downlink to using small bandwidth for uplink and large bandwidth for downlink, so that the video phone service model matches the resource division of wireless traffic, saving resources and reducing at the same time. The cost of the operator.
  • the first session boundary controller and the second session boundary controller in the embodiment of the present application may be the same device.
  • the first session boundary controller may receive the first video image sent by the second terminal.
  • the second session boundary controller may also support super-division processing, and the method may further include:
  • the second session boundary controller performs super-division processing on the third video picture to obtain a fourth video picture, wherein the resolution of the fourth video picture is higher than the resolution of the third video picture;
  • the second session boundary controller sends the fourth video picture to the second terminal.
  • UE1 sends the third video picture to the first session boundary controller, and the first session boundary controller sends the third video picture to the second session boundary controller.
  • the second session boundary controller supports super-division, it performs super-division processing on the third video picture to obtain a fourth video picture, and then the second session boundary controller sends the fourth video picture to UE2, in this way, completes the super-division processing.
  • the method may further include:
  • the first session boundary controller sends the third video picture to the second session boundary controller.
  • the video image sent by the UE1 can be directly forwarded to the second session border controller via the first session border controller, and the second session border controller can perform super-division processing.
  • the above-mentioned video transmission method before the method shown in FIG. 2 is executed, it further includes performing video transmission negotiation processing, and then determining the resolution of the above-mentioned first video image and the resolution of the above-mentioned second video image. Resolution.
  • the video transmission methods include A1-A5, as follows:
  • the first session boundary controller sends a first message to the first terminal, where the first message carries the resolution of the video picture used in the sending direction of the first session boundary controller to instruct the first terminal Receiving a video picture sent by the first session boundary controller, wherein the resolution of the video picture used in the sending direction of the first session boundary controller is the resolution of the second video picture;
  • the second session boundary controller sends a second message carrying over-division capability indication information to the second terminal to instruct the second terminal to send a video picture that satisfies the over-division capability indication information.
  • the resolution of the video picture of the over-division capability indication information is the resolution of the first video picture;
  • the above-mentioned super-division capability indication information may be information that the first session boundary controller supports super-division processing, for example, it may include that the first session boundary controller supports the received resolution of 180*320 in the receiving direction.
  • the video screen is converted into a 360*640 video screen. It may also include that the first session boundary controller supports converting a video screen with a resolution of 180*320 into a video screen with a resolution of 360*640 in the sending direction.
  • the second message carrying the over-division capability indication information may be the second session boundary controller directly forwarding the message sent by the first session boundary controller to the second session boundary controller, and the over-division capability indication information is the above-mentioned first session boundary controller.
  • the information that the session border controller supports for super-division processing may be the second session boundary controller directly forwarding the message sent by the first session boundary controller to the second session boundary controller, and the over-division capability indication information is the above-mentioned first session boundary controller.
  • the video transmission negotiation process is completed through steps A1-A2, in which the resolution of the video picture sent by the second session boundary controller to the first session boundary controller is determined, and the resolution of the video image sent by the first session boundary controller to the first terminal is determined. The resolution of the video screen.
  • the first session boundary controller receives the first video picture sent by the second session boundary controller
  • the second terminal UE2 sends the first video picture to the second session boundary controller SBC2, and the second session boundary controller SBC2 forwards the first video picture to the first session boundary controller SBC1.
  • the first session boundary controller performs super-division processing on the first video picture to obtain the second video picture, wherein the resolution of the second video picture is higher than that of the first video picture Resolution
  • the first session boundary controller supports over-division, that is, the first session boundary controller can improve the resolution of the received video picture.
  • the above-mentioned first session boundary controller obtains a high-resolution second video picture by processing a low-resolution first video picture.
  • the first session boundary controller sends the second video picture to the first terminal.
  • video transmission negotiation is performed, and the video image reception, super-division processing, and transmission are performed based on the negotiated resolution, which improves the reliability of the implementation of the video transmission technology.
  • the first session boundary controller processes the received low-resolution first video picture sent by the second session boundary controller, converts it into a high-resolution second video picture, and then converts it into a high-resolution second video picture.
  • the high-resolution second video picture is sent to the first terminal.
  • the above-mentioned message for video transmission negotiation processing may be a session initiation protocol (session initiation protocol, SIP) message.
  • SIP session initiation protocol
  • the over-division capability indication information may be located in the image attribute parameter of the SIP message.
  • the imageattr parameter format is as follows:
  • the magnifying grouping identifier Ampid is used to indicate the super-resolution grouping of the current resolution.
  • the enlarged grouping identifier Ampid is the super-division capability indication information.
  • the description is as follows:
  • the enlargement grouping identifier Ampid is added to the image attribute imageattr parameter of the SIP message to indicate the over-division capability indication information, and the negotiation can be completed on the signaling side.
  • the foregoing message for video transmission negotiation processing may be an RTCP message.
  • the second message is a Realtime Transport Control Protocol (RTCP) message
  • the RTCP message includes a name field, where the name field indicates the RTCP message when the name field corresponds to the first preset value Carry the super-division capability indication information.
  • the RTCP message includes a subtype field, a name field, and a PT field.
  • the PT field has a value of 204
  • the subtype field has a value of 1, and the name field is change.
  • the RTCP message When -image-size is used, it is used to indicate that the RTCP message also carries over-division capability indication information.
  • the above values are only an example, which does not limit the specific values.
  • the above-mentioned super-division capability indication information is obtained.
  • the negotiation can be completed on the media plane.
  • the above-mentioned message for video transmission negotiation processing may be a sequence parameter set (seq_parameter_set, SPS) message.
  • the second message is an SPS message
  • the super-division capability indication information is located in a parameter of the SPS message.
  • the SPS parameter in the H264 encoding and decoding will carry the resolution of the picture.
  • the embodiments of the present application extend the SPS parameters to carry over-division capability indication information.
  • the extended parameters are S_pic_width_in_mbs_minus1, S_pic_height_in_map_units_minus1.
  • the above two parameters respectively represent the width of the picture in units of macro blocks and the height of the picture in units of macro blocks. It is used to inform the opposite end that the picture currently received by the local end can be super-divided and synthesized according to the resolution.
  • the SPS parameter is extended to carry the super-division capability indication information.
  • the video transmission negotiation process includes steps 501-506, which are specifically as follows:
  • the user terminal UE1 sends a message to the first session border controller SBC1 to initiate a video session request.
  • the session request may be an Invite message, which carries the resolution of the video screen supported by the UE1 in the receiving direction and the resolution of the video screen supported by the UE1 in the sending direction.
  • All the resolutions supported by the UE1 in the receiving direction include 180*320, 360*640, and 480*640
  • all the resolutions supported by the UE1 in the sending direction include 180*320 and 360*640.
  • the first session boundary controller SBC1 After receiving the Invite message, the first session boundary controller SBC1 processes the Invite message according to the resolution supported by UE1 and the resolution supported by SBC1 carried in the Invite message to obtain an updated Invite message, SBC1 sends the updated Invite message to the second terminal UE2 through the second session border controller SBC2.
  • the first session boundary controller SBC1 supports over-division, that is, supports increasing the resolution of the received video picture.
  • the foregoing processing of the Invite message includes adding the super-division capability indication information supported by the first session boundary controller to the Invite message.
  • SBC1 sends the updated message to the second terminal UE2 through the second session border controller SBC2.
  • the updated message is used to inform the called side SBC1 to support 180* 320 super-divided to 360*640.
  • the second terminal UE2 sends a first session progress response message to the second session border controller SBC2.
  • UE2 responds by supporting the superdivision of 180*320 to 360*640 in the receiving direction according to the SBC1 indicated in the message.
  • the first session progress response message carries that UE2 supports low resolution 180*320 in the sending direction, and supports 180*320 and 360*640 in the receiving direction.
  • the second session border controller SBC2 forwards the first session progress response message to the first session border controller SBC1.
  • the second session boundary controller SBC2 receives the first session progress response message, it can be known according to the second message that SBC1 supports over-division. Since SBC2 does not support over-division, SBC2 responds to the first session. The progress response message is not processed, and is directly forwarded to SBC1.
  • the first session boundary controller SBC1 processes the first session progress response message to obtain a second session progress response message, and sends the second session progress response message to the first terminal UE1.
  • the first session boundary controller SBC1 learns that the UE2 supports low resolution 180*320 in the sending direction and 180*320 and 360*640 in the receiving direction. SBC1 determines that the resolution in the sending direction is the highest 360*640. Since SBC1 cannot determine the resolution of the video screen sent by UE1, SBC1 adjusts the resolution in the receiving direction to 180*320, 360*640 . SBC1 obtains the second session progress response message by adjusting the resolution in the downlink direction in the first session progress response message.
  • the first terminal UE1 sends a third message to UE2 according to the second session progress response message to complete the video transmission negotiation process.
  • UE1 makes a selection according to the resolution of the receiving direction in the second session progress response message, where UE1 selects a low resolution of 180*320 Send it.
  • UE1 completes the aforementioned video transmission negotiation process by sending an update message.
  • the result of the video transmission negotiation process is: UE1 sends a low-resolution 180*320 video image to SBC1, so that SBC1 performs super-division processing on the 180*320 video image to obtain a 360*640 video image, and then Sent to UE2 via SBC2.
  • the result of the video transmission negotiation process also includes: UE2 sends a low-resolution 180*320 video picture to SBC1 through SBC2, so that SBC1 performs super-division processing on the 180*320 video picture to obtain a 360*640 video picture, and then Send to UE1.
  • the resolution of the video picture to be subjected to the super-division processing is determined.
  • super-segment processing can be performed.
  • the video transmission negotiation process includes steps 601-606, which are specifically as follows:
  • the user terminal UE1 sends a video session request to the second session border controller SBC2 through the first session border controller SBC1.
  • the session request can be an Invite message, which carries the resolution of the video screen supported by UE1 in the receiving direction and the resolution of the video screen supported by UE1 in the sending direction.
  • the video screen resolution supported by UE1 in the receiving direction Resolutions include 180*320, 360*640, and 480*640
  • the resolutions supported by UE1 in the sending direction include 180*320 and 360*640.
  • SBC1 does not support over-division, and it can directly forward the message to the second session border controller SBC2 without adjusting the message.
  • the second session border controller SBC2 After receiving the Invite message, the second session border controller SBC2 processes it to obtain an updated Invite message, and sends the updated Invite message to UE2.
  • the second session border controller SBC2 supports over-division. Therefore, SBC2 adds the indication information of the super-division capability supported by the SBC2 to the above Invite message. Specifically, as shown in FIG. 6, SBC2 updates the message to instruct UE2 to give priority to the resolution of its receiving direction by 180*320.
  • the second terminal UE2 sends a first session progress response message to the second session border controller SBC2.
  • the UE2 responds according to the SBC2 indicated in the Invite message to support priority 180*320 in the receiving direction.
  • the first session progress response message carries that UE2 supports low resolution 180*320 in the sending direction, and supports 180*320 and 360*640 in the receiving direction.
  • the second session boundary controller SBC2 After receiving the first session progress response message, the second session boundary controller SBC2 processes the first session progress response message to obtain a second session progress response message, and sends the second session progress response message to The first session border controller SBC1.
  • the second session boundary controller SBC2 receives the first session progress response message, according to the message sent by UE1, it can be known that SBC1 does not support over-score. Since SBC2 supports over-score, SBC2 adjusts the progress of the first session
  • the sending direction in the response message is high resolution 360*640, and the resolution of the receiving direction is 180*320 and 360*640.
  • the first session boundary controller SBC1 processes the second session progress response message to obtain a third session progress response message, and sends it to the first terminal UE1.
  • the first session boundary controller SBC1 After the first session boundary controller SBC1 receives the second session progress response message, it learns that the SBC2 supports a high resolution 360*640 in the sending direction. SBC1 determines that the resolution of its sending direction to UE1 is a high resolution of 360*640. Since SBC1 cannot determine the resolution of the video image sent by UE1, SBC1 selects the resolution in the receiving direction as 180*320, 360*640.
  • the first terminal UE1 sends a negotiation completion message to UE2 to complete the video transmission negotiation process.
  • UE1 selects a low resolution 180*320 for transmission.
  • UE1 completes the aforementioned video transmission negotiation process by sending an update message.
  • the result of the above negotiation is that UE1 sends low-resolution video images to SBC2, and SBC2 can send high-resolution video images to UE2. Moreover, UE2 can also send low-resolution video images to SBC2, and SBC2 can send high-resolution video images to UE1.
  • the video transmission negotiation process includes steps 701-706, which are specifically as follows:
  • the user terminal UE1 sends a video session request to the first session border controller SBC1.
  • the session request can be an Invite message, which carries the resolution of the video screen supported by UE1 in the receiving direction and the resolution of the video screen supported by UE1 in the sending direction.
  • the resolution of the video screen supported by UE1 in the receiving direction Resolutions include 180*320, 360*640, and 480*640
  • the resolutions supported by UE1 in the sending direction include 180*320 and 360*640.
  • the first session boundary controller SBC1 After receiving the Invite message, the first session boundary controller SBC1 processes the Invite message according to the resolution supported by UE1 and the resolution supported by SBC1 carried in the Invite message to obtain the updated Invite message, SBC1 sends the updated Invite message to the second terminal UE2 through the second session border controller SBC2;
  • the first session border controller SBC1 supports over-division.
  • the foregoing processing of the Invite message includes adding the super-division capability indication information supported by the first session boundary controller to the Invite message.
  • the SBC1 sends the updated Invite message to the second terminal UE2 through the second session border controller SBC2.
  • the second terminal UE2 sends a first session progress response message to the second session border controller SBC2.
  • UE2 responds by supporting the superdivision of 180*320 to 360*640 in the receiving direction according to the SBC1 indicated in the Invite message.
  • the first session progress response message carries that UE2 supports low resolution 180*320 in the sending direction, and supports 180*320 and 360*640 in the receiving direction.
  • the second session boundary controller SBC2 After receiving the first session progress response message, the second session boundary controller SBC2 processes the first session progress response message to obtain a second session progress response message, and forwards it to the first session boundary control ⁇ SBC1;
  • SBC2 After the second session boundary controller SBC2 receives the first session progress response message, it can be known according to the updated message that SBC1 supports over-division. Since SBC2 also supports over-division, SBC2 sends its Choose the low resolution 180*320, and the receiving direction is 180*320 and 360*640. SBC2 sends the second session progress response message to SBC1.
  • the first session boundary controller SBC1 processes the second session progress response message to obtain a third session progress response message, and sends the third session progress response message to the first terminal UE1;
  • SBC1 supports super-division, and the resolution to determine the sending direction selects the highest 360*640. Since SBC1 cannot determine the resolution of the video picture sent by UE1, SBC1 selects the resolution of its receiving direction as 180 *320, 360*640.
  • the first terminal UE1 sends an update message to UE2 to complete the video transmission negotiation process.
  • UE1 selects a low resolution 180*320 for transmission.
  • UE1 completes the aforementioned video transmission negotiation process by sending an update message.
  • the result of the above negotiation is that UE1 sends low-resolution video images to SBC2, and SBC2 can send high-resolution video images to UE2. Moreover, UE2 can send low-resolution video images to SBC1, and SBC1 can send high-resolution video images to UE2.
  • the foregoing embodiment is only described with an example in which the session boundary controller supports super-division. If it can be combined, the terminal side may also support super-division, for example, UE1 supports super-division or UE2 supports super-division, or both UE1 and UE2 support super-division. This plan does not make specific restrictions on this.
  • the prior art does not define how the calling side as both parties to the call informs the opposite end, whether the calling side uses different resolutions in the sending and receiving directions.
  • This application defines an optional renegotiation process. In other words, when UE1 and UE2 are in negotiation, UE1 sends two negotiations to determine the resolution of the sending direction. Step 506 in FIG. 5, step 606 in FIG. 6, and step 706 in FIG. 7 are all re-negotiation processes. If this negotiation method is adopted, the second negotiation will increase the consumption of the signaling plane.
  • an embodiment of the present application also provides a negotiation method, which performs negotiation based on asymmetric capability indication information, which indicates that, for example, the first terminal supports different resolutions in the receiving direction and the sending direction, or in The receiving direction and the sending direction support different rates, etc.
  • asymmetric capability indication information which indicates that, for example, the first terminal supports different resolutions in the receiving direction and the sending direction, or in The receiving direction and the sending direction support different rates, etc.
  • Table 2 when the UE supports asymmetric video-codec or video-profile, it means that the UE supports different resolutions for uplink and downlink.
  • the UE supports asymmetric audio-codec it means that the UE supports different rates for uplink and downlink.
  • the above-mentioned asymmetric capability indication information can be added to the SDP of the SIP message.
  • the steps of the video transmission negotiation process can be simplified. Taking the example that the session boundary controllers on the calling side and the called side both support super-division, as shown in Figure 8, the video transmission negotiation process includes steps 801-805, which are specifically as follows:
  • the user terminal UE1 sends a message to the first session border controller SBC1 to initiate a video session request.
  • the message carries the resolutions of all video images supported by UE1 in the receiving direction and all video images supported by UE1 in the sending direction.
  • the resolution of the UE1 also carries the asymmetric capability indication information, indicating that the UE1 has different resolutions for the receiving direction and the sending direction.
  • the first session border controller SBC1 After receiving the above message, the first session border controller SBC1 processes the message according to the resolution supported by UE1 and the resolution supported by SBC1 carried in the message to obtain an updated Message, SBC1 sends the updated message to the second terminal UE2 through the second session border controller SBC2;
  • the first session border controller SBC1 supports over-division.
  • the foregoing processing of the message includes adding the super-division capability indication information supported by the first session boundary controller to the message.
  • the SBC1 sends the updated message to the second terminal UE2 through the second session border controller SBC2.
  • the second terminal UE2 sends a first session progress response message to the second session border controller SBC2.
  • UE2 judges its sending ability according to the recv parameter in the received message. Since the recommendation is to use 180*320, the send direction in the 183 message selects a resolution of 180*320. In addition, since UE2 also supports asymmetric capabilities, it also knows that UE1 supports asymmetric capabilities, so UE2 selects 360*640 in the receiving direction. Since UE2 does not support the super-division capability, it does not carry the ampid parameter in the 183 message.
  • the second session boundary controller SBC2 After receiving the first session progress response message, the second session boundary controller SBC2 processes the first session progress response message to obtain a second session progress response message, and forwards it to the first session boundary control ⁇ SBC1;
  • SBC2 After the second session boundary controller SBC2 receives the first session progress response message, it can be known from the updated message that SBC1 supports over-division. Since SBC2 also supports over-division, SBC2 will Choose a low resolution of 180*320, and choose a low resolution of 180*320 for the sending direction. SBC2 sends the second session progress response message to SBC1.
  • the first session boundary controller SBC1 processes the second session progress response message to obtain a third session progress response message, and sends the third session progress response message to the first terminal UE1;
  • SBC1 supports super-division, and the resolution for determining the sending direction selects the highest 360*640. Since SBC1 knows that UE1 supports asymmetric capability, SBC1 selects the resolution of its receiving direction as 180*320.
  • This negotiation method eliminates the need for UE1 to negotiate again to determine the resolution of its sending direction, improves the resolution negotiation mechanism of the video call, simplifies the process, and saves the bandwidth resources of the air interface and the bandwidth resources of the bearer network.
  • the embodiment of the present application also provides a method for audio transmission. That is, a low-quality audio is restored to a high-quality audio through data calculation for transmission.
  • An audio transmission method which may include:
  • the first session boundary controller receives the first audio sent by the second session boundary controller
  • the first session boundary controller performs super-division processing on the first audio to obtain a second audio, wherein the bit rate of the second audio is higher than the bit rate of the first audio;
  • the first session boundary controller sends the second audio to the first terminal.
  • the first session boundary controller processes the received low-bit-rate audio, converts it into a high-bit-rate audio, and then sends the high-bit-rate audio to the terminal.
  • This method changes the bandwidth requirements of existing voice calls, changing it from uplink and downlink to using small bandwidth for uplink and large bandwidth for downlink, so that the voice call service model matches the resource division of wireless traffic, which saves resources and at the same time reduces The cost of the operator.
  • the above audio transmission method may also include:
  • the second session boundary controller performs super-division processing on the third audio to obtain a fourth audio, wherein the bit rate of the fourth audio is higher than the bit rate of the third audio;
  • the second session boundary controller sends the fourth audio to the second terminal.
  • AMR codec is a multi-rate codec. Take AMR-WB as an example, which supports 9 speeds. The specific values are as follows: 0: 6.60 kbps, 1: 8.85 kbps, 2: 12.65 kbps, 3: 14.25 kbps, 4: 15.85 kbps, 5: 18.25 kbps, 6: 19.85 kbps, 7: 23.05 kbps, 8: 23.85 kbps.
  • audio-mode-set-R when audio transmission is supported, two fields need to be added: audio-mode-set-R and audio-mode-set-S, which are used to describe the capabilities of the receiving direction and the sending direction.
  • the audio transmission negotiation process may include the following processes:
  • UE1 sends an Invite message to the network, instructing the network to initiate an audio call;
  • SBC1 supports the super-division technology of AMR-WB, and informs the remote end through description in SDP that it supports the increase of 12.65kbps to 23.85kbps in both directions of sending and receiving.
  • SBC1 adjusts the mode-set set in SDP from [0,2,5,8] to the super-divided capability set [2,8];
  • SBC2 also supports AMR-WB super-division technology, which can delete the asymmetry-capability parameter.
  • UE2 selects to inform the network through 183 that it agrees to use the two rates of 12.65 and 23.85.
  • SBC2 knows that SBC1 supports 12.65 and 23,85 super-division capabilities, and SBC2 also supports these two rate super-division capabilities, so SBC2 tells the opposite end to use 12.65 in the receive direction and the transmit direction through the asymmetry-capability parameter.
  • SBC1 deletes the asymmetry-capability parameter from the SDP.
  • SBC1, and SBC2 inform UE1 and UE2 to adjust the sending rate to 12.65 through the CMR indication of RTP, respectively.
  • UE1 can send audio with low codec rate to SBC2 for super-division processing, and then SBC2 can send audio with high codec rate to UE2; and UE2 can send audio with low codec rate to SBC1 performs super-division processing, and then SBC1 can send audio with a high codec rate to UE1.
  • audio transmission negotiation is performed, and audio reception, over-division processing, and transmission are performed based on the negotiated bit rate, which improves the reliability of the audio transmission technology.
  • the first session boundary controller processes the received low bit rate audio, converts it into a high bit rate audio, and then sends the high bit rate audio to the terminal.
  • This method changes the bandwidth requirements of existing voice calls, changing it from uplink and downlink to using small bandwidth for uplink and large bandwidth for downlink, so that the voice call service model matches the resource division of wireless traffic, which saves resources and at the same time reduces The cost of the operator.
  • An embodiment of the present application also provides a video transmission system, including:
  • a receiving module configured to receive the first video picture sent by the second session boundary controller
  • a processing module configured to perform super-division processing on the first video picture to obtain a second video picture, wherein the resolution of the second video picture is higher than the resolution of the first video picture;
  • the sending module is used to send the second video picture to the first terminal.
  • the embodiment of the present application also provides an audio transmission system, including:
  • a receiving module configured to receive the first audio sent by the second session boundary controller
  • a processing module configured to perform super-division processing on the first audio to obtain a second audio, wherein the bit rate of the second audio is higher than the bit rate of the first audio;
  • the sending module is used to send the second audio to the first terminal.
  • the video transmission device 1000 includes a processor 1001 and a memory 1002 coupled to each other.
  • the memory 1002 stores computer program codes.
  • the processor 1001 calls and executes the computer program codes in the memory 1002. , So that the video transmission device 1000 executes the video transmission method provided in the above embodiment.
  • the video transmission device 1000 may be used to perform:
  • the video transmission device 1000 may also be used for:
  • the terminal 1100 includes a display 1101, an input/output interface 1102, a mutually coupled memory 1103, and a processor 1104.
  • the memory 1103 stores computer program codes, and the processor 1104 calls and executes the The computer program code in the memory 1103 enables the terminal 1100 to execute the multimedia data transmission method.
  • the terminal 1100 may be used to execute:
  • the multimedia request message carries asymmetric capability indication information, and the asymmetric capability indication information indicates that the first terminal supports different code rates or resolutions in the receiving direction and the sending direction;
  • the multimedia response message carries an uplink resolution and a downlink resolution or an uplink code rate and a downlink code rate;
  • the multimedia request message is an Invite message, and the asymmetric capability indication information is located in the SDP field of the invite message.
  • the multimedia request message is an RTCP message
  • the RTCP message carries the asymmetric capability indication information
  • the multimedia request message is an SPS message
  • the SPS message carries the asymmetric capability indication information.
  • the network side device receives the multimedia request message of the terminal, and determines the uplink resolution and downlink resolution or uplink bit rate of the multimedia data transmitted by the terminal according to the asymmetric capability indication information carried in the multimedia request message And the downlink code rate, and then the network side device sends a multimedia response message to the terminal, and the multimedia response message carries the uplink resolution and the downlink resolution or the uplink code rate and the downlink code rate of the transmitted multimedia data.
  • the network side device is the first SBC.
  • the multimedia data is audio data or video data.
  • terminal 1100 may also be used to execute:
  • the terminal 1100 may also be used to execute:
  • terminal 1100 may also be used to execute:
  • the terminal 1100 may also be used to execute:
  • the resolution of the video picture of the super-division capability indication information is the resolution of the first video picture.
  • the embodiments of the present application also provide a computer-readable storage medium that stores instructions in the computer-readable storage medium, and when it runs on a computer or a processor, the computer or the processor executes any one of the above methods. Or multiple steps.
  • the embodiments of the present application also provide a computer program product containing instructions.
  • the computer program product runs on a computer or a processor, the computer or the processor is caused to execute one or more steps in any of the foregoing methods.
  • the computer program product includes one or more computer instructions.
  • the computer may be a general-purpose computer, a special-purpose computer, a computer network, or other programmable devices.
  • the computer instructions may be stored in a computer-readable storage medium or transmitted through the computer-readable storage medium.
  • the computer instructions can be sent from a website site, computer, server, or data center to another website site, computer, Server or data center for transmission.
  • the computer-readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server or data center integrated with one or more available media.
  • the usable medium may be a magnetic medium (for example, a floppy disk, a hard disk, and a magnetic tape), an optical medium (for example, a DVD), or a semiconductor medium (for example, a solid state disk (SSD)).
  • the process can be completed by a computer program instructing relevant hardware.
  • the program can be stored in a computer readable storage medium. , May include the processes of the above-mentioned method embodiments.
  • the aforementioned storage media include: ROM or random storage RAM, magnetic disks or optical disks and other media that can store program codes.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Computer Security & Cryptography (AREA)
  • Computer Graphics (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

本申请实施例提供一种视频传输方法、系统、相关设备及存储介质,其中,该视频传输方法包括:第一会话边界控制器接收第二会话边界控制器发送的第一视频画面;所述第一会话边界控制器对所述第一视频画面进行超分处理,以得到第二视频画面,其中,所述第二视频画面的分辨率高于所述第一视频画面的分辨率;所述第一会话边界控制器将所述第二视频画面发送至第一终端。通过本申请实施例,第一会话边界控制器通过接收低分辨率的第一视频画面,然后发送高分辨率的第二视频画面给终端。该手段改变了现有视频电话的带宽需求,实现上行使用小带宽、下行使用大带宽,使视频电话业务模型匹配无线流量的资源划分,节省了资源,同时也降低了运营商的成本。

Description

视频传输方法、系统、相关设备及存储介质
本申请要求于2020年3月28日提交中国国家知识产权局、申请号为202010233007.3、发明名称为“视频传输方法、系统、相关设备及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及通信技术领域,尤其涉及一种视频传输方法、视频传输系统、视频传输设备及存储介质。
背景技术
移动通信网络已经从3G升级到了4G,4G网络除了上网更快以外,也具备高清语音和视频通话的能力。伴随4G网络的普及,各种APP也得到了蓬勃的发展。各种即时通信APP在不断地蚕食运营商的话音市场。同时面向未来的5G网络,即时通讯APP会推出更具备竞争力的业务体验。
面对当前的竞争和未来更大的风险,运营商需要不断的降低成本和提供更好的业务体验。运营商面向大众两个基本的产品是流量和打电话。流量消费需求一直在快速的增加,已经成为运营商的主要收入来源。而打电话的需求已经趋于饱和。所以运营商在规划无线网络时优先保证流量的规划,其次是打电话。流量和打电话在无线网络中表现又各不相同。其中,流量的特点是上行流量小,下行流量大。一般上行流量为下行流量的10%。以国内运营商推荐的建设标准是上行0.5Mbps,下行5Mbps。打电话的特点是音频电话要求上行24.85Kbps,下行24.85kbps;视频电话要求上行1Mbps,下行1Mbps。
也就是说,当数据域的流量要求达到覆盖标准时,音频电话的要求也同时满足。但是视频电话的要求却无法满足。且目前无线网络的消耗大部分是在下行,运营商会把更多的资源投放在下行上。然而电话的模型却是上下行消耗相同的资源,容易造成大量资源浪费。
发明内容
本申请公开了一种视频传输方法、视频传输系统、视频传输设备及存储介质,可以实现电话业务中上行使用小带宽、下行使用大带宽,使电话业务模型匹配无线流量的资源划分,同时可降低运营商的成本。
第一方面,本申请实施例提供一种视频传输方法,包括:
第一会话边界控制器接收第二会话边界控制器发送的第一视频画面;
所述第一会话边界控制器对所述第一视频画面进行超分处理,以得到第二视频画面,其中,所述第二视频画面的分辨率高于所述第一视频画面的分辨率;
所述第一会话边界控制器将所述第二视频画面发送至第一终端。
通过本申请实施例,第一会话边界控制器通过对接收到的第二会话边界控制器发送的低分辨率的第一视频画面进行处理,将其转化为一个高分辨率的第二视频画面,然后将该高分辨率的第二视频画面发送给第一终端。也就是说,第一会话边界控制器通过接收低分 辨率的第一视频画面,然后发送高分辨率的第二视频画面给终端。该手段改变了现有视频电话的带宽需求,将其由上行与下行一样改为上行使用小带宽、下行使用大带宽,使视频电话业务模型匹配无线流量的资源划分,节省了资源,同时也降低了运营商的成本。
可选的,所述方法还包括:
所述第二会话边界控制器接收所述第一会话边界控制器发送的第三视频画面;
所述第二会话边界控制器对所述第三视频画面进行超分处理,以得到第四视频画面,其中,所述第四视频画面的分辨率高于所述第三视频画面的分辨率;
所述第二会话边界控制器将所述第四视频画面发送至第二终端。
也就是说,第一会话边界控制器和第二会话边界控制器均可支持超分。
可选的,所述方法还包括:
所述第一会话边界控制器接收所述第一终端发送的第三视频画面;
所述第一会话边界控制器将所述第三视频画面发送至所述第二会话边界控制器。
其中,在所述第一会话边界控制器接收第二会话边界控制器发送的第一视频画面之前,所述方法还包括:
所述第一会话边界控制器向所述第一终端发送第一报文,所述第一报文携带所述第一会话边界控制器发送方向使用的视频画面的分辨率,以指示所述第一终端接收所述第一会话边界控制器发送的视频画面,其中,所述第一会话边界控制器发送方向使用的视频画面的分辨率为所述第二视频画面的分辨率。
进一步地,所述方法还包括:
所述第二会话边界控制器向所述第二终端发送携带超分能力指示信息的第二报文,以指示所述第二终端发送满足所述超分能力指示信息的视频画面,所述满足所述超分能力指示信息的视频画面的分辨率为所述第一视频画面的分辨率。
通过本申请实施例,在进行视频传输处理之前,通过进行视频传输协商,进而基于协商得到的分辨率进行视频画面的接收、超分处理以及发送,提高了实现视频传输技术的可靠性。
另一方面,第一会话边界控制器通过对接收到的第二会话边界控制器发送的低分辨率的第一视频画面进行处理,将其转化为一个高分辨率的第二视频画面,然后将该高分辨率的第二视频画面发送给第一终端。该手段改变了现有视频电话的带宽需求,将其由上行与下行一样改为上行使用小带宽、下行使用大带宽,使视频电话业务模型匹配无线流量的资源划分,节省了资源,同时也降低了运营商的成本。
其中,所述第二报文为SIP报文,所述超分能力指示信息位于所述SIP报文的图像属性参数中。
可替代的,所述第二报文为RTCP报文,所述RTCP报文包括名称字段,其中,所述名称字段对应第一预设值时指示所述RTCP报文携带所述超分能力指示信息。
可替代的,所述第二报文为SPS报文,所述超分能力指示信息位于所述SPS报文的参数中。
进一步地,所述第二报文还携带非对称能力指示信息,所述非对称能力指示信息指示所述第一终端在接收方向和发送方向支持不同的分辨率。
该协商方法省去了UE1再次协商以确定其发送方向的分辨率的步骤,完善了视频通话的分辨率协商机制,简化流程,可实现节省空口的带宽资源和承载网的带宽资源。
第二方面,本申请还提供一种音频传输方法,包括:
第一会话边界控制器接收第二会话边界控制器发送的第一音频;
所述第一会话边界控制器对所述第一音频进行超分处理,以得到第二音频,其中,所述第二音频的码率高于所述第一音频的码率;
所述第一会话边界控制器将所述第二音频发送至第一终端。
通过本申请实施例,第一会话边界控制器通过对接收到的低码率的音频进行处理,将其转化为一个高码率的音频,然后将该高码率的音频发送给终端。该手段改变了现有语音通话的带宽需求,将其由上行与下行一样改为上行使用小带宽、下行使用大带宽,使语音通话业务模型匹配无线流量的资源划分,节省了资源,同时也降低了运营商的成本。
上述方法还包括:
所述第二会话边界控制器接收所述第一会话边界控制器发送的第三音频;
所述第二会话边界控制器对所述第三音频进行超分处理,以得到第四音频,其中,所述第四音频的码率高于所述第三音频的码率;
所述第二会话边界控制器将所述第四音频发送至第二终端。
也就是说,第一会话边界控制器和第二会话边界控制器均可以实现超分。
上述方法还包括:
所述第一会话边界控制器接收所述第一终端发送的第三音频;
所述第一会话边界控制器将所述第三音频发送至所述第二会话边界控制器。
进一步地,在所述第一会话边界控制器接收第二会话边界控制器发送的第一音频之前,所述方法还包括:
所述第一会话边界控制器向所述第一终端发送第一报文,所述第一报文携带所述第一会话边界控制器发送方向使用的音频的码率,以指示所述第一终端接收所述第一会话边界控制器发送的音频,其中,所述第一会话边界控制器发送方向使用的音频的码率为所述第二音频的码率。
所述方法还包括:
所述第二会话边界控制器向所述第二终端发送携带超分能力指示信息的第二报文,以指示所述第二终端发送满足所述超分能力指示信息的音频,所述满足所述超分能力指示信息的音频的码率为所述第一音频的码率。
通过本申请实施例,在进行音频传输处理之前,通过进行音频传输协商,进而基于协商得到的码率进行音频的接收、超分处理以及发送,提高了实现音频传输技术的可靠性。
另一方面,第一会话边界控制器通过对接收到的低码率的音频进行处理,将其转化为一个高码率的音频,然后将该高码率的音频发送给终端。该手段改变了现有语音通话的带宽需求,将其由上行与下行一样改为上行使用小带宽、下行使用大带宽,使语音通话业务模型匹配无线流量的资源划分,节省了资源,同时也降低了运营商的成本。
可选的,所述第二报文为SIP报文,所述超分能力指示信息位于所述SIP报文中。
可替代的,所述第二报文为RTCP报文,所述RTCP报文包括名称字段,其中,所述名 称字段对应第一预设值时指示所述RTCP报文携带所述超分能力指示信息。
可替代的,所述第二报文为SPS报文,所述超分能力指示信息位于所述SPS报文的参数中。
进一步地,所述第二报文还携带非对称能力指示信息,所述非对称能力指示信息指示所述第一终端在接收方向和发送方向支持不同的码率。
通过本申请实施例,在进行音频传输处理之前,通过进行音频传输协商,进而基于协商得到的码率进行音频的接收、超分处理以及发送,提高了实现音频传输技术的可靠性。同时,该协商方法省去了UE1再次协商以确定其发送方向的分辨率的步骤,完善了音频通话的编解码速度协商机制,简化流程,可实现节省空口的带宽资源和承载网的带宽资源。
另一方面,第一会话边界控制器通过对接收到的低码率的音频进行处理,将其转化为一个高码率的音频,然后将该高码率的音频发送给终端。该手段改变了现有语音通话的带宽需求,将其由上行与下行一样改为上行使用小带宽、下行使用大带宽,使语音通话业务模型匹配无线流量的资源划分,节省了资源,同时也降低了运营商的成本。
第三方面,本申请实施例提供一种多媒体数据传输方法,主要包括:
终端向网络侧设备发送多媒体请求消息,所述多媒体请求消息中携带非对称能力指示信息,所述非对称能力指示信息指示所述第一终端在接收方向和发送方向支持不同的码率或分辨率;
所述终端接收所述网络侧设备发送的多媒体响应消息,所述多媒体响应消息中携带上行分辨率和下行分辨率或上行码率和下行码率;
所述终端使用所述上行分辨率或上行码率发送上行多媒体数据,接收使用所述下行分辨率或下行码率传输的下行多媒体数据。
在一个可能的方案中,所述多媒体请求消息为Invite消息,所述非对称能力指示信息位于所述invite消息的SDP字段中。
在一个可能的方案中,所述多媒体请求消息为RTCP报文,所述RTCP报文中携带所述非对称能力指示信息。
在一个可能的方案中,所述多媒体请求消息为SPS报文,所述SPS报文中携带所述非对称能力指示信息。
在一个可能的方案中,所述网络侧设备接收终端的多媒体请求消息,根据多媒体请求消息中携带的非对称能力指示信息确定所述终端传输多媒体数据的上行分辨率和下行分辨率或上行码率和下行码率,然后网络侧设备向终端发送多媒体响应消息,所述多媒体响应消息中携带传输多媒体数据的上行分辨率和下行分辨率或上行码率和下行码率。
在一个可能的方案中,所述网络侧设备为第一SBC。
在一个可能的方案中,所述多媒体数据为音频数据或视频数据。
第四方面,本申请还提供一种视频传输方法,包括:
第一终端接收第一会话边界控制器发送的第二视频画面,其中,所述第二视频画面是所述第一会话边界控制器对第一视频画面进行超分处理得到的,所述第一视频画面为第二会话边界控制器发送至所述第一会话边界控制器的,其中,所述第二视频画面的分辨率高于所述第一视频画面的分辨率。
还包括:
第二终端接收所述第二会话边界控制器发送的第四视频画面,其中,所述第四视频画面为所述第二会话边界控制器对第三视频画面进行超分处理得到的,所述第三视频画面为所述第一会话边界控制器发送至所述第二会话边界控制器的,其中,所述第四视频画面的分辨率高于所述第三视频画面的分辨率。
其中,在所述第一终端接收第一会话边界控制器发送的第二视频画面之前,所述方法还包括:
所述第一终端接收所述第一会话边界控制器发送的第一报文,所述第一报文携带所述第一会话边界控制器发送方向使用的视频画面的分辨率;以便所述第一终端接收所述第一会话边界控制器发送的视频画面,其中,所述第一会话边界控制器发送方向使用的视频画面的分辨率为所述第二视频画面的分辨率。
所述方法还包括:
所述第二终端接收所述第二会话边界控制器发送的携带超分能力指示信息的第二报文;
所述第二终端向所述第二会话边界控制器发送满足所述超分能力指示信息的视频画面,所述满足所述超分能力指示信息的视频画面的分辨率为所述第一视频画面的分辨率。
第五方面,本申请还提供一种视频传输系统,包括第一会话边界控制器和第二会话边界控制器,其中,所述第一会话边界控制器用于:
接收第二会话边界控制器发送的第一视频画面;
对所述第一视频画面进行超分处理,以得到第二视频画面,其中,所述第二视频画面的分辨率高于所述第一视频画面的分辨率;
将所述第二视频画面发送至第一终端。
所述第二会话边界控制器用于:
接收所述第一会话边界控制器发送的第三视频画面;
对所述第三视频画面进行超分处理,以得到第四视频画面,其中,所述第四视频画面的分辨率高于所述第三视频画面的分辨率;
将所述第四视频画面发送至第二终端。
所述第一会话边界控制器还用于:
接收所述第一终端发送的第三视频画面;
将所述第三视频画面发送至所述第二会话边界控制器。
所述第一会话边界控制器还用于:
向所述第一终端发送第一报文,所述第一报文携带所述第一会话边界控制器发送方向使用的视频画面的分辨率,以指示所述第一终端接收所述第一会话边界控制器发送的视频画面,其中,所述第一会话边界控制器发送方向使用的视频画面的分辨率为所述第二视频画面的分辨率。
其中,所述第二会话边界控制器还用于:
向所述第二终端发送携带超分能力指示信息的第二报文,以指示所述第二终端发送满足所述超分能力指示信息的视频画面,所述满足所述超分能力指示信息的视频画面的分辨 率为所述第一视频画面的分辨率。
第六方面,本申请还提供一种音频传输系统,包括第一会话边界控制器和第二会话边界控制器,其中,所述第一会话边界控制器用于:
接收第二会话边界控制器发送的第一音频;
对所述第一音频进行超分处理,以得到第二音频,其中,所述第二音频的码率高于所述第一音频的码率;
将所述第二音频发送至第一终端。
其中,所述第二会话边界控制器用于:
接收所述第一会话边界控制器发送的第三音频;
对所述第三音频进行超分处理,以得到第四音频,其中,所述第四音频的码率高于所述第三音频的码率;
将所述第四音频发送至第二终端。
其中,所述第一会话边界控制器还用于:
接收所述第一终端发送的第三音频;
将所述第三音频发送至所述第二会话边界控制器。
其中,所述第一会话边界控制器还用于:
向所述第一终端发送第一报文,所述第一报文携带所述第一会话边界控制器发送方向使用的音频的码率,以指示所述第一终端接收所述第一会话边界控制器发送的音频,其中,所述第一会话边界控制器发送方向使用的音频的码率为所述第二音频的码率。
其中,所述第二会话边界控制器还用于:
向所述第二终端发送携带超分能力指示信息的第二报文,以指示所述第二终端发送满足所述超分能力指示信息的音频,所述满足所述超分能力指示信息的音频的码率为所述第一音频的码率。
第七方面,本申请提供了一种传输设备,所述视频传输设备用于如第一方面任一种可能的实施方式和/或第二方面任一种可能的实施方式提供的方法和/或第三方面任一种可能的实施方式提供的方法和/或第四方面任一种可能的实施方式提供的方法。
第八方面,本申请提供了一种计算机存储介质,包括计算机指令,当所述计算机指令在电子设备上运行时,使得所述电子设备执行如第一方面任一种可能的实施方式和/或第二方面任一种可能的实施方式提供的方法和/或第三方面任一种可能的实施方式提供的方法和/或第四方面任一种可能的实施方式提供的方法。
第九方面,本申请实施例提供一种计算机程序产品,当计算机程序产品在计算机上运行时,使得计算机执行如第一方面任一种可能的实施方式和/或第二方面任一种可能的实施方式提供的方法和/或第三方面任一种可能的实施方式提供的方法和/或第四方面任一种可能的实施方式提供的方法。
第十方面,本申请实施例提供一种视频传输设备,其包括相互耦合的存储器和处理器,所述存储器中存储了计算机程序代码,所述处理器调用并执行所述存储器中的计算机程序代码,使得所述视频传输设备执行如第一方面和/或第二方面所述的视频/音频传输方法。
第十一方面,本申请实施例提供一种终端,其包括显示器、输入/输出接口、相互耦合 的存储器和处理器,所述存储器中存储了计算机程序代码,所述处理器调用并执行所述存储器中的计算机程序代码,使得所述终端执行如第三方面和/或第四方面所述的多媒体数据传输方法。
可以理解地,上述提供的第五方面所述的装置、第六方面所述的装置、第七方面所述的装置、第八方面所述的计算机存储介质、第九方面所述的计算机程序产品、第十方面所述的设备以及第十一方面所述的终端均用于执行第一方面中任一所提供的方法以及第二方面中任一所提供的方法以及第三方面中任一所提供的方法以及第四方面中任一所提供的方法。因此,其所能达到的有益效果可参考对应方法中的有益效果,此处不再赘述。
附图说明
下面对本申请实施例用到的附图进行介绍。
图1是本申请实施例提供的一种视频传输系统的示意图;
图2是本申请实施例提供的一种视频传输方法的流程示意图;
图3是本申请实施例提供的一种报文的结构示意图;
图4是本申请实施例提供的一种报文的结构示意图;
图5是本申请实施例提供的一种视频传输协商处理的流程示意图;
图6是本申请实施例提供的另一种视频传输协商处理的流程示意图;
图7是本申请实施例提供的再一种视频传输协商处理的流程示意图;
图8是本申请实施例提供的又一种视频传输协商处理的流程示意图;
图9是本申请实施例提供的一种音频传输协商处理的流程示意图;
图10是本申请实施例提供的一种视频传输设备的结构示意图;
图11是本申请实施例提供的一种终端的结构示意图。
具体实施方式
下面结合本申请实施例中的附图对本申请实施例进行描述。本申请实施例的实施方式部分使用的术语仅用于对本申请的具体实施例进行解释,而非旨在限定本申请。
参照图1,是本申请实施例提供的一种视频传输系统的示意图。该视频传输系统可包括第一终端UE1、第二终端UE2、第一会话边界控制器SBC1和第二会话边界控制器SBC2。其中,第一会话边界控制器SBC1可用于接收第二会话边界控制器发送的视频画面,并将其发送给第一终端UE1。第二会话边界控制器SBC2可用于接收第一会话边界控制器发送的视频画面,并将其发送给第二终端UE2。
进一步地,第一会话边界控制器SBC1还可用于接收第一终端UE1发送的视频会话请求,并将该视频会话请求发送给第二会话边界控制器SBC2。其中,第二会话边界控制器SBC2可用于接收第一会话边界控制器SBC1发送的视频会话请求,并将其发送给第二终端UE2。上述第二会话边界控制器SBC2还可以接收第二终端UE2发送的消息,并将其转发给第一会话边界控制器SBC1。第一会话边界控制器SBC1可用于接收第二会话边界控制器SBC2发送的消息并将其转发给第一终端UE1。
其中,上述SBC1和SBC2可以是同一个SBC。例如当UE1和UE2在同一个cell小区或同一个城市。
参照图2所示,为本申请实施例提供的一种视频传输方法的流程示意图。该视频传输方法包括201-203,具体如下:
201、第一会话边界控制器接收第二会话边界控制器发送的第一视频画面;
该视频传输方法可应用于视频传输系统,其中,该视频传输系统可包括第一终端UE1、第一会话边界控制器SBC1、第二会话边界控制器SBC2和第二终端UE2。
具体地,第二终端UE2向所述第二会话边界控制器SBC2发送第一视频画面,由所述第二会话边界控制器SBC2将所述第一视频画面转发至所述第一会话边界控制器SBC1。
202、所述第一会话边界控制器对所述第一视频画面进行超分处理,以得到第二视频画面,其中,所述第二视频画面的分辨率高于所述第一视频画面的分辨率;
其中,所述第一会话边界控制器支持超分,也就是说,第一会话边界控制器可以实现将接收到的视频画面的分辨率提高。
超分,即一个低分辨率的视频通过数学运算被修复成一个高分辨率的视频,或者一个低音质的音频通过数学运算被修复成一个高音质的音频。
上述第一会话边界控制器通过对低分辨率的第一视频画面进行处理,得到高分辨率的第二视频画面。
203、所述第一会话边界控制器将所述第二视频画面发送至第一终端。
其中,该视频传输方法可应用于视频通话场景中。
通过本申请实施例,第一会话边界控制器通过对接收到的第二会话边界控制器发送的低分辨率的第一视频画面进行处理,将其转化为一个高分辨率的第二视频画面,然后将该高分辨率的第二视频画面发送给第一终端。也就是说,第一会话边界控制器通过接收低分辨率的第一视频画面,然后发送高分辨率的第二视频画面给终端。该手段改变了现有视频电话的带宽需求,将其由上行与下行一样改为上行使用小带宽、下行使用大带宽,使视频电话业务模型匹配无线流量的资源划分,节省了资源,同时也降低了运营商的成本。
其中,本申请实施例的第一会话边界控制器和第二会话边界控制器可以是相同的设备。此时,第一会话边界控制器可以接收第二终端发送的第一视频画面。
上述以第一会话边界控制器进行超分处理进行说明。进一步地,第二会话边界控制器也可以支持超分处理,所述方法还可包括:
所述第二会话边界控制器接收所述第一会话边界控制器发送的第三视频画面;
所述第二会话边界控制器对所述第三视频画面进行超分处理,以得到第四视频画面,其中,所述第四视频画面的分辨率高于所述第三视频画面的分辨率;
所述第二会话边界控制器将所述第四视频画面发送至第二终端。
其中,UE1将第三视频画面发送给第一会话边界控制器,由所述第一会话边界控制器将所述第三视频画面发送给所述第二会话边界控制器。其中,所述第二会话边界控制器支持超分,其对所述第三视频画面进行超分处理得到第四视频画面,然后所述第二会话边界控制器将所述第四视频画面发送至UE2,以此完成超分处理。
进一步地,所述方法还可包括:
所述第一会话边界控制器接收所述第一终端发送的第三视频画面;
所述第一会话边界控制器将所述第三视频画面发送至所述第二会话边界控制器。
该实施例中UE1发送的视频画面可经由第一会话边界控制器直接转发到第二会话边界控制器,进而第二会话边界控制器可进行超分处理。
作为上述视频传输方法的一种具体的实现方式,其中,在执行图2所示的方法之前,还包括进行视频传输协商处理,进而确定上述第一视频画面的分辨率、上述第二视频画面的分辨率。本申请实施例提供的一种视频传输方法的流程示意图。该视频传输方法包括A1-A5,具体如下:
A1、第一会话边界控制器向第一终端发送第一报文,所述第一报文携带所述第一会话边界控制器发送方向使用的视频画面的分辨率,以指示所述第一终端接收所述第一会话边界控制器发送的视频画面,其中,所述第一会话边界控制器发送方向使用的视频画面的分辨率为所述第二视频画面的分辨率;
A2、第二会话边界控制器向第二终端发送携带超分能力指示信息的第二报文,以指示所述第二终端发送满足所述超分能力指示信息的视频画面,所述满足所述超分能力指示信息的视频画面的分辨率为第一视频画面的分辨率;
上述超分能力指示信息可以为所述第一会话边界控制器支持进行超分处理的信息,如可包括所述第一会话边界控制器在接收方向支持将接收到的分辨率为180*320的视频画面转化为360*640的视频画面。还可以包括所述第一会话边界控制器在发送方向支持将分辨率为180*320的视频画面转化为360*640的视频画面等。
该携带超分能力指示信息的第二报文可以是第二会话边界控制器直接转发第一会话边界控制器发送给第二会话边界控制器的报文,该超分能力指示信息为上述第一会话边界控制器所支持进行超分处理的信息。
通过步骤A1-A2即完成了视频传输协商处理,其中确定了第二会话边界控制器发送给第一会话边界控制器的视频画面的分辨率,以及第一会话边界控制器发送给第一终端的视频画面的分辨率。
基于此,在步骤A3-A5中进而可实现视频传输。
A3、所述第一会话边界控制器接收所述第二会话边界控制器发送的所述第一视频画面;
具体地,第二终端UE2向所述第二会话边界控制器SBC2发送第一视频画面,由所述第二会话边界控制器SBC2将所述第一视频画面转发至所述第一会话边界控制器SBC1。
A4、所述第一会话边界控制器对所述第一视频画面进行超分处理,以得到所述第二视频画面,其中,所述第二视频画面的分辨率高于所述第一视频画面的分辨率;
其中,所述第一会话边界控制器支持超分,也就是说,第一会话边界控制器可以实现将接收到的视频画面的分辨率提高。
上述第一会话边界控制器通过对低分辨率的第一视频画面进行处理,得到高分辨率的第二视频画面。
A5、所述第一会话边界控制器将所述第二视频画面发送至所述第一终端。
通过本申请实施例,在进行视频传输处理之前,通过进行视频传输协商,进而基于协商得到的分辨率进行视频画面的接收、超分处理以及发送,提高了实现视频传输技术的可靠性。
另一方面,第一会话边界控制器通过对接收到的第二会话边界控制器发送的低分辨率的第一视频画面进行处理,将其转化为一个高分辨率的第二视频画面,然后将该高分辨率的第二视频画面发送给第一终端。该手段改变了现有视频电话的带宽需求,将其由上行与下行一样改为上行使用小带宽、下行使用大带宽,使视频电话业务模型匹配无线流量的资源划分,节省了资源,同时也降低了运营商的成本。
其中,上述进行视频传输协商处理的报文可以为会话初始协议(session initiation protocol,SIP)报文。如所述第二报文为SIP报文,所述超分能力指示信息可位于SIP报文的图像属性参数中。其中,imageattr参数格式如下:
a=imageattr:PT send[x,y,sar,par,q]…/recv[x,y,sar,par,q]…
如表1所示,通过在imageattr参数中加入放大编组标识Ampid,该放大编组标识Ampid用于指示当前分辨率的超分能力编组。也就是说,该放大编组标识Ampid即为所述超分能力指示信息。
表1
Figure PCTCN2021074539-appb-000001
其中,当多个分辨率在同一个编组时,表明支持将低分辨率超分到同组的高分辨率。
如当UE或者SBC支持在接收方向将视频画面的分辨率从320*180升级到640*360,也支持从320*240升级到640*480时,则描述如下:
a=imageattr:PT send***/recv[180,320,ampid=1][360,640,ampid=1][240,320,ampid=2][480,640,ampid=2]
当UE或者SBC支持在接收方向将视频画面的分辨率从320*180升级到640*360,并支持640*360升级到1280*720,则描述如下:
a=imageattr:PT send***/recv[180,320,ampid=1][360,640,ampid=1,2][720,1280,ampid=2]
本申请实施例通过在SIP报文的图像属性imageattr参数中新增放大编组标识Ampid,进而指示超分能力指示信息,进而可实现在信令面完成协商。
可替代的,上述协商也可以采用在媒体面协商的方式进行。上述进行视频传输协商处理的报文可以为RTCP报文。如所述第二报文为实时传输协议(Realtime Transport Control Protocol,RTCP)报文,所述RTCP报文包括名称字段,其中,所述名称字段对应第一预设值时指示所述RTCP报文携带所述超分能力指示信息。具体地,如图3所示,所述RTCP报文包括子类型字段、名称字段、PT字段,其中,所述PT字段取值204、所述子类型字段取值1、所述名称字段为change-image-size时,用于指示所述RTCP报文还携带超分能力指示信息。上述各取值仅为一种示例,其并不限定具体的取值。如图4所示,通过获取报文中的Application-dependentdata信息,以得到上述超分能力指示信息。该申请实施例通过扩展信令面的RTCP报文,可实现在媒体面完成协商。
可替代的,上述进行视频传输协商处理的报文可以为序列参数集(seq_parameter_set,SPS)报文。如所述第二报文为SPS报文,所述超分能力指示信息位于所述SPS报文的参数中。其中,H264编解码中SPS参数会携带画面的分辨率。本申请实施例通过扩展SPS参数,进而可携带超分能力指示信息。
其中,该扩展的参数为S_pic_width_in_mbs_minus1、S_pic_height_in_map_units_minus1。上述两个参数分别表示以宏块为单位的画面宽度、以宏块为单位的画面高度。其用于告知对端,本端当前接收到的画面可以根据该分辨率进行超分合成。该实施例通过对SPS参数进行扩展,进而实现携带超分能力指示信息。
下面具体以超分能力指示信息位于SIP报文的图像属性参数中为例进行说明视频传输协商处理的具体过程。
作为一种实现形式,当仅主叫侧的会话边界控制器SBC1支持超分时,如图5所示,该视频传输协商处理包括步骤501-506,具体如下:
501,用户终端UE1向第一会话边界控制器SBC1发送报文以发起视频会话请求。
其中,该会话请求可以为Invite消息,其携带UE1在接收方向所支持的视频画面的分辨率以及UE1在发送方向所支持的视频画面的分辨率。如图3所示,UE1在接收方向所支持的所有分辨率包括180*320、360*640和480*640,UE1在发送方向所支持的所有分辨率包括180*320和360*640。
502,第一会话边界控制器SBC1接收到上述Invite消息后,根据所述Invite消息中携带的UE1所支持的分辨率以及SBC1自身所支持的分辨率,对所述Invite消息进行处理得到更新后的Invite消息,SBC1将所述更新后的Invite消息通过第二会话边界控制器SBC2发送至第二终端UE2。
其中,第一会话边界控制器SBC1支持超分,即支持将接收到的视频画面的分辨率提高。上述对所述Invite消息进行处理包括在所述Invite消息中加入所述第一会话边界控制器支持的超分能力指示信息。如图3所示,SBC1将所述更新后的报文通过第二会话边界控制器SBC2发送至第二终端UE2,该更新后的报文用于告知被叫侧SBC1在接收方向支持将180*320超分到360*640。
503,第二终端UE2向所述第二会话边界控制器SBC2发送第一会话进度应答消息。
其中,UE2根据报文中指示的SBC1在接收方向支持将180*320超分到360*640做出应答。其中,该第一会话进度应答消息携带UE2在发送方向支持低分辨率180*320,接收方向支持180*320和360*640。
504,第二会话边界控制器SBC2接收到所述第一会话进度应答消息后,将所述第一会话进度应答消息转发至所述第一会话边界控制器SBC1。
其中,第二会话边界控制器SBC2接收到所述第一会话进度应答消息后,根据所述第二报文可得知SBC1支持超分,由于SBC2不支持超分,因此,SBC2对第一会话进度应答消息不做处理,直接转发至SBC1。
505,所述第一会话边界控制器SBC1对所述第一会话进度应答消息进行处理得到第二会话进度应答消息,并将所述第二会话进度应答消息发送至所述第一终端UE1。
其中,第一会话边界控制器SBC1接收到所述第一会话进度应答消息后,得知UE2在发送方向支持低分辨率180*320,接收方向支持180*320和360*640。SBC1确定其发送方向的分辨率选择最高的360*640,由于SBC1不能确定UE1发送过来的视频画面的分辨率是哪一个,因此SBC1将其接收方向的分辨率调整为180*320、360*640。SBC1通过调整第一会话进度应答消息中下行方向的分辨率进而得到第二会话进度应答消息。
506,所述第一终端UE1根据所述第二会话进度应答消息向UE2发送第三报文以完成所述视频传输协商处理。
其中,由于SBC1不能确定UE1发送过来的视频画面的分辨率是哪一个,因此,UE1根据第二会话进度应答消息中的接收方向的分辨率做出选择,其中,UE1选择低分辨率180*320进行发送。UE1通过发送update消息以此完成上述视频传输协商处理。
其中,该视频传输协商处理得到的结果即为:UE1发送低分辨率180*320的视频画面至SBC1,以便SBC1对该180*320的视频画面进行超分处理得到360*640的视频画面,进而通过SBC2发送给UE2。该视频传输协商处理得到的结果还包括:UE2通过SBC2发送低分辨率180*320的视频画面至SBC1,以便SBC1对该180*320的视频画面进行超分处理得到360*640的视频画面,进而发送给UE1。
也就是说,通过上述视频传输协商处理,即确定了进行超分处理的视频画面的分辨率。进而可进行超分处理。
作为另一种实现形式,当仅被叫侧的会话边界控制器SBC2支持超分时,如图6所示,该视频传输协商处理包括步骤601-606,具体如下:
601,用户终端UE1通过第一会话边界控制器SBC1向第二会话边界控制器SBC2发送视频会话请求。该会话请求可以为Invite消息,携带UE1在接收方向所支持的视频画面的分辨率以及UE1在发送方向所支持的视频画面的分辨率,其中,如图6所示,UE1在接收方向所支持的分辨率包括180*320、360*640和480*640,UE1在发送方向所支持的分辨率包括180*320和360*640。
其中,SBC1不支持超分,其可对报文不作调整,直接转发至第二会话边界控制器SBC2。
602,所述第二会话边界控制器SBC2接收到上述Invite消息后,对其进行处理得到更 新后的Invite消息,并将所述更新后的Invite消息发送至UE2。
其中,第二会话边界控制器SBC2支持超分。因此,SBC2在上述Invite消息中加入所述SBC2支持的超分能力指示信息。具体地,如图6所示,SBC2通过更新报文,以指示UE2其接收方向的分辨率优先180*320。
603,第二终端UE2向所述第二会话边界控制器SBC2发送第一会话进度应答消息。
其中,UE2根据Invite消息中指示的SBC2在接收方向支持优先180*320做出应答。该第一会话进度应答消息携带UE2在发送方向支持低分辨率180*320,接收方向支持180*320和360*640。
604,第二会话边界控制器SBC2接收到所述第一会话进度应答消息后,对所述第一会话进度应答消息进行处理得到第二会话进度应答消息,并将第二会话进度应答消息发送至所述第一会话边界控制器SBC1。
其中,第二会话边界控制器SBC2接收到所述第一会话进度应答消息后,根据UE1发送的报文可得知SBC1不支持超分,由于SBC2支持超分,因此,SBC2调整第一会话进度应答消息中发送方向为高分辨率360*640,其中,接收方向的分辨率为180*320和360*640。
605,所述第一会话边界控制器SBC1将所述第二会话进度应答消息进行处理得到第三会话进度应答消息,并发送至所述第一终端UE1。
其中,第一会话边界控制器SBC1接收到所述第二会话进度应答消息后,得知SBC2在发送方向支持高分辨率360*640。SBC1确定其向UE1的发送方向的分辨率为高分辨率360*640。由于SBC1不能确定UE1发送过来的视频画面的分辨率是哪一个,因此SBC1将其接收方向的分辨率选择为180*320、360*640。
606,所述第一终端UE1向UE2发送完成协商消息以完成所述视频传输协商处理。
其中,UE1选择低分辨率180*320进行发送。UE1通过发送update消息以此完成上述视频传输协商处理。
上述协商的结果即为UE1发送低分辨率的视频画面至SBC2,SBC2可发送高分辨率的视频画面至UE2。且,UE2也可以发送低分辨率的视频画面至SBC2,SBC2可发送高分辨率的视频画面至UE1。
作为再一种实现形式,当主叫侧和被叫侧的会话边界控制器均支持超分时,如图7所示,该视频传输协商处理包括步骤701-706,具体如下:
701,用户终端UE1向第一会话边界控制器SBC1发送视频会话请求。
该会话请求可以为Invite消息,携带UE1在接收方向所支持的视频画面的分辨率以及UE1在发送方向所支持的视频画面的分辨率,其中,如图7所示,UE1在接收方向所支持的分辨率包括180*320、360*640和480*640,UE1在发送方向所支持的分辨率包括180*320和360*640。
702,第一会话边界控制器SBC1接收到上述Invite消息后,根据所述Invite消息中携带的UE1所支持的分辨率以及SBC1自身所支持的分辨率,对所述Invite消息进行处理得到更新后的Invite消息,SBC1将所述更新后的Invite消息通过第二会话边界控制器SBC2发送至第二终端UE2;
其中,第一会话边界控制器SBC1支持超分。上述对所述Invite消息进行处理包括在所述Invite消息中加入所述第一会话边界控制器支持的超分能力指示信息。SBC1将所述更新后的Invite消息通过第二会话边界控制器SBC2发送至第二终端UE2。
703,第二终端UE2向所述第二会话边界控制器SBC2发送第一会话进度应答消息;
其中,UE2根据Invite消息中指示的SBC1在接收方向支持将180*320超分到360*640做出应答。其中,该第一会话进度应答消息携带UE2在发送方向支持低分辨率180*320,接收方向支持180*320和360*640。
704,第二会话边界控制器SBC2接收到所述第一会话进度应答消息后,对所述第一会话进度应答消息进行处理得到第二会话进度应答消息,并转发至所述第一会话边界控制器SBC1;
其中,第二会话边界控制器SBC2接收到所述第一会话进度应答消息后,根据更新后的报文可得知SBC1支持超分,由于SBC2也支持超分,因此,SBC2将其发送方向的分辨率选择低的180*320,接收方向为180*320和360*640SBC2将第二会话进度应答消息发送至SBC1。
705,所述第一会话边界控制器SBC1对所述第二会话进度应答消息进行处理得到第三会话进度应答消息,并将所述第三会话进度应答消息发送至所述第一终端UE1;
其中,SBC1支持超分,其确定发送方向的分辨率选择最高的360*640,由于SBC1不能确定UE1发送过来的视频画面的分辨率是哪一个,因此SBC1将其接收方向的分辨率选择为180*320、360*640。
706,所述第一终端UE1向UE2发送update消息以完成所述视频传输协商处理。
其中,UE1选择低分辨率180*320进行发送。UE1通过发送update消息以此完成上述视频传输协商处理。
上述协商的结果即为UE1发送低分辨率的视频画面至SBC2,SBC2可发送高分辨率的视频画面至UE2。且,UE2可以发送低分辨率的视频画面至SBC1,SBC1可发送高分辨率的视频画面至UE2。
其中,上述实施例仅以会话边界控制器支持超分为例进行说明。可组合的,终端侧也可以支持超分,如UE1支持超分或者UE2支持超分,或者UE1、UE2均支持超分等。本方案对此不做具体限定。
现有技术中没有定义说明作为通话双方的主叫侧如何告知对端,主叫侧是否在收发两个方向上使用不同的分辨率。本申请定义了可选的再次协商过程。也就是说,UE1和UE2之间在进行协商时,UE1发送两次协商,才能确定其发送方向的分辨率。如图5中的步骤506、图6中的步骤606以及图7中的步骤706,其均为再次协商过程。如果采用该协商方法,该第二次协商会增加信令面的消耗。
基于此,本申请实施例还提供一种协商方法,其基于非对称能力指示信息进行协商,所述非对称能力指示信息指示如第一终端在接收方向和发送方向支持不同的分辨率,或者在接收方向和发送方向支持不同的速率等。如表2所示,当UE支持不对称的video-codec 或者video-profile时,就意味着该UE支持上下行使用不同的分辨率。当UE支持不对称的audio-codec时,就意味着该UE支持上下行使用不同的速率。
表2
Figure PCTCN2021074539-appb-000002
具体地,当UE具备接收方向和发送方向使用不同分辨率的能力时,则该非对称能力指示信息可表示为:a=asymmetry-capability:video-size;
当UE支持接收方向和发送支持使用不同的分辨率和不同的音频编解码速率时,则该非对称能力指示信息可表示为:a=asymmetry-capability:video-size&audio-mode-set-R/S。
其中,可通过在SIP报文的SDP中添加上述非对称能力指示信息。
具体地,基于上述非对称能力指示信息,则视频传输协商处理的步骤可进行简化。以主叫侧和被叫侧的会话边界控制器均支持超分为例,如图8所示,该视频传输协商处理包括步骤801-805,具体如下:
801,用户终端UE1向第一会话边界控制器SBC1发送报文以发起视频会话请求,该报文携带UE1在接收方向所支持的所有视频画面的分辨率以及UE1在发送方向所支持的所有视频画面的分辨率,还携带非对称能力指示信息,指示UE1具备接收方向和发送方向使用不同分辨率。
UE1初始Invite消息中通过SDP中的a=asymmetry-capability:video-size,告知SBC1和对端其支持在收发两个方向上使用不同的分辨率。
802,第一会话边界控制器SBC1接收到上述报文后,根据所述报文中携带的UE1所支持的分辨率以及SBC1自身所支持的分辨率,对所述报文进行处理得到更新后的报文,SBC1将所述更新后的报文通过第二会话边界控制器SBC2发送至第二终端UE2;
其中,第一会话边界控制器SBC1支持超分。上述对所述报文进行处理包括在所述报文中加入所述第一会话边界控制器支持的超分能力指示信息。SBC1将所述更新后的报文通过第二会话边界控制器SBC2发送至第二终端UE2。
803,第二终端UE2向所述第二会话边界控制器SBC2发送第一会话进度应答消息;
其中,UE2根据接收报文中recv参数判断其发送的能力,由于推荐是使用180*320,所以183消息中的send方向选择180*320这个分辨率。此外由于UE2也支持非对称的能力,其也知道UE1支持非对称能力,所以UE2在接收方向选择了360*640。由于UE2不支持超分能力,所以其没有在183消息中携带ampid参数。
804,第二会话边界控制器SBC2接收到所述第一会话进度应答消息后,对所述第一会话进度应答消息进行处理得到第二会话进度应答消息,并转发至所述第一会话边界控制器SBC1;
其中,第二会话边界控制器SBC2接收到所述第一会话进度应答消息后,根据更新后的 报文可得知SBC1支持超分,由于SBC2也支持超分,因此,SBC2将其接收方向的分辨率选择低的180*320,发送方向的分辨率也选择低的180*320。SBC2将第二会话进度应答消息发送至SBC1。
805,所述第一会话边界控制器SBC1对所述第二会话进度应答消息进行处理得到第三会话进度应答消息,并将所述第三会话进度应答消息发送至所述第一终端UE1;
其中,SBC1支持超分,其确定发送方向的分辨率选择最高的360*640,由于SBC1得知UE1支持非对称能力,因此SBC1将其接收方向的分辨率选择为180*320。
该协商方法省去了UE1再次协商以确定其发送方向的分辨率的步骤,完善了视频通话的分辨率协商机制,简化流程,节省了空口的带宽资源和承载网的带宽资源。
上述各实施例以视频传输为例进行说明。其中,本申请实施例还提供一种音频传输的方法。即将一个低音质的音频通过数据运算修复成一个高音质的音频进行传输。
一种音频传输方法,可包括:
第一会话边界控制器接收第二会话边界控制器发送的第一音频;
所述第一会话边界控制器对所述第一音频进行超分处理,以得到第二音频,其中,所述第二音频的码率高于所述第一音频的码率;
所述第一会话边界控制器将所述第二音频发送至第一终端。
通过本申请实施例,第一会话边界控制器通过对接收到的低码率的音频进行处理,将其转化为一个高码率的音频,然后将该高码率的音频发送给终端。该手段改变了现有语音通话的带宽需求,将其由上行与下行一样改为上行使用小带宽、下行使用大带宽,使语音通话业务模型匹配无线流量的资源划分,节省了资源,同时也降低了运营商的成本。
上述音频传输方法,还可以包括:
所述第二会话边界控制器接收所述第一会话边界控制器发送的第三音频;
所述第二会话边界控制器对所述第三音频进行超分处理,以得到第四音频,其中,所述第四音频的码率高于所述第三音频的码率;
所述第二会话边界控制器将所述第四音频发送至第二终端。
其中,该音频传输方法可参阅图1所述的视频传输方法的相关介绍,在此不再赘述。
在进行上述音频传输处理之前,还包括音频传输协商处理。
其中,AMR编解码是一个多速率编解码。以AMR-WB为例,其支持9种速率。具体值如下:0:6.60kbps、1:8.85kbps、2:12.65kbps、3:14.25kbps、4:15.85kbps、5:18.25kbps、6:19.85kbps、7:23.05kbps、8:23.85kbps。
在使用AMR编解码时,可以通过mode-set参数指定使用上述九个速率中的哪些。如表述使用6.60kbps、12.65kbps、18.25kbps和23.85kbps,则表示如下:
a=rtpmap:97AMR-WB
a=fmtp:97mode-set=0,2,5,8;
其中,当支持音频传输时,则需要增加两个字段:audio-mode-set-R和audio-mode-set-S,用于描述接收方向和发送方向的能力。
如:a=fmtp:97mode-set=0,2,5,8;asymmetry-capability:audio-mode-set-R[2,8], audio-mode-set-S[2,8]
则表示在两个方向上均支持从12.65kbps、23.85kbps之间的速率/码率超分。
下面以主叫侧和被叫侧的会话边界控制器均支持超分为例,如图9所示,该音频传输协商处理可包括以下流程:
901、UE1发送Invite报文给网络,指示网络其要发起一个音频呼叫;
902、SBC1支持AMR-WB的超分技术,在SDP中通过描述告知远端,其在收发两个方向都支持将12.65kbps提升到23.85kbps。SBC1将SDP中mode-set集合从[0,2,5,8]调整到超分的能力集合[2,8];
903、SBC2也支持AMR-WB的超分技术,其可以将asymmetry-capability参数删除。
904、UE2通过183选择告知网络,其同意使用12.65和23.85这两个速率。
905、SBC2知道SBC1支持12.65和23,85的超分能力,SBC2也支持这两个速率的超分,所以SBC2通过asymmetry-capability参数告知对端在接收方向和发送方向使用12.65。UE2应答的mode-set=2,8也跟随183消息告知SBC1。
906、由于UE1没有携带asymmetry-capability参数,所以SBC1将asymmetry-capability参数从SDP中删除。
907、SBC1和SBC2分别通过RTP的CMR指示告知UE1和UE2将发送速率调整到12.65。
通过上述音频传输协商,得出:UE1可发送低编解码速率的音频至SBC2进行超分处理,进而SBC2可发送高编解码速率的音频至UE2;且,UE2可发送低编解码速率的音频至SBC1进行超分处理,进而SBC1可发送高编解码速率的音频至UE1。
通过本申请实施例,在进行音频传输处理之前,通过进行音频传输协商,进而基于协商得到的码率进行音频的接收、超分处理以及发送,提高了实现音频传输技术的可靠性。
另一方面,第一会话边界控制器通过对接收到的低码率的音频进行处理,将其转化为一个高码率的音频,然后将该高码率的音频发送给终端。该手段改变了现有语音通话的带宽需求,将其由上行与下行一样改为上行使用小带宽、下行使用大带宽,使语音通话业务模型匹配无线流量的资源划分,节省了资源,同时也降低了运营商的成本。
本申请实施例还提供一种视频传输系统,包括:
接收模块,用于接收第二会话边界控制器发送的第一视频画面;
处理模块,用于对所述第一视频画面进行超分处理,以得到第二视频画面,其中,所述第二视频画面的分辨率高于所述第一视频画面的分辨率;
发送模块,用于将所述第二视频画面发送至第一终端。
本申请实施例还提供一种音频传输系统,包括:
接收模块,用于接收第二会话边界控制器发送的第一音频;
处理模块,用于对所述第一音频进行超分处理,以得到第二音频,其中,所述第二音频的码率高于所述第一音频的码率;
发送模块,用于将所述第二音频发送至第一终端。
参照图10所示,为本申请实施例提供的一种视频传输设备的结构示意图。如图10所 示,该视频传输设备1000包括相互耦合的处理器1001和存储器1002,所述存储器1002中存储了计算机程序代码,所述处理器1001调用并执行所述存储器1002中的计算机程序代码,使得所述视频传输设备1000执行以上实施例所提供的视频传输方法。
具体地,所述视频传输设备1000可用于执行:
接收第二会话边界控制器发送的第一视频画面;对所述第一视频画面进行超分处理,以得到第二视频画面,其中,所述第二视频画面的分辨率高于所述第一视频画面的分辨率;将所述第二视频画面发送至第一终端。
所述视频传输设备1000还可用于:
接收第二会话边界控制器发送的第一音频;对所述第一音频进行超分处理,以得到第二音频,其中,所述第二音频的码率高于所述第一音频的码率;将所述第二音频发送至第一终端。
参照图11所示,为本申请实施例提供的一种终端的结构示意图。如图11所示,该终端1100包括显示器1101、输入/输出接口1102、相互耦合的存储器1103和处理器1104,所述存储器1103中存储了计算机程序代码,所述处理器1104调用并执行所述存储器1103中的计算机程序代码,使得所述终端1100执行所述的多媒体数据传输方法。
具体地,所述终端1100可用于执行:
向网络侧设备发送多媒体请求消息,所述多媒体请求消息中携带非对称能力指示信息,所述非对称能力指示信息指示所述第一终端在接收方向和发送方向支持不同的码率或分辨率;
接收所述网络侧设备发送的多媒体响应消息,所述多媒体响应消息中携带上行分辨率和下行分辨率或上行码率和下行码率;
使用所述上行分辨率或上行码率发送上行多媒体数据,接收使用所述下行分辨率或下行码率传输的下行多媒体数据。
其中,所述多媒体请求消息为Invite消息,所述非对称能力指示信息位于所述invite消息的SDP字段中。
其中,所述多媒体请求消息为RTCP报文,所述RTCP报文中携带所述非对称能力指示信息。
在一个可能的方案中,所述多媒体请求消息为SPS报文,所述SPS报文中携带所述非对称能力指示信息。
在一个可能的方案中,所述网络侧设备接收终端的多媒体请求消息,根据多媒体请求消息中携带的非对称能力指示信息确定所述终端传输多媒体数据的上行分辨率和下行分辨率或上行码率和下行码率,然后网络侧设备向终端发送多媒体响应消息,所述多媒体响应消息中携带传输多媒体数据的上行分辨率和下行分辨率或上行码率和下行码率。
在一个可能的方案中,所述网络侧设备为第一SBC。
在一个可能的方案中,所述多媒体数据为音频数据或视频数据。
进一步地,所述终端1100还可用于执行:
接收第一会话边界控制器发送的第二视频画面,其中,所述第二视频画面是所述第一会话边界控制器对第一视频画面进行超分处理得到的,所述第一视频画面为第二会话边界 控制器发送至所述第一会话边界控制器的,其中,所述第二视频画面的分辨率高于所述第一视频画面的分辨率。
所述终端1100还可用于执行:
接收所述第二会话边界控制器发送的第四视频画面,其中,所述第四视频画面为所述第二会话边界控制器对第三视频画面进行超分处理得到的,所述第三视频画面为所述第一会话边界控制器发送至所述第二会话边界控制器的,其中,所述第四视频画面的分辨率高于所述第三视频画面的分辨率。
进一步地,所述终端1100还可用于执行:
接收所述第一会话边界控制器发送的第一报文,所述第一报文携带所述第一会话边界控制器发送方向使用的视频画面的分辨率;以便所述第一终端接收所述第一会话边界控制器发送的视频画面,其中,所述第一会话边界控制器发送方向使用的视频画面的分辨率为所述第二视频画面的分辨率。
所述终端1100还可用于执行:
接收所述第二会话边界控制器发送的携带超分能力指示信息的第二报文;向所述第二会话边界控制器发送满足所述超分能力指示信息的视频画面,所述满足所述超分能力指示信息的视频画面的分辨率为所述第一视频画面的分辨率。
本申请实施例还提供了一种计算机可读存储介质,该计算机可读存储介质中存储有指令,当其在计算机或处理器上运行时,使得计算机或处理器执行上述任一个方法中的一个或多个步骤。
本申请实施例还提供了一种包含指令的计算机程序产品。当该计算机程序产品在计算机或处理器上运行时,使得计算机或处理器执行上述任一个方法中的一个或多个步骤。
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。所述计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行所述计算机程序指令时,全部或部分地产生按照本申请实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机指令可以存储在计算机可读存储介质中,或者通过所述计算机可读存储介质进行传输。所述计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线)或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。所述计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。所述可用介质可以是磁性介质,(例如,软盘、硬盘、磁带)、光介质(例如,DVD)、或者半导体介质(例如,固态硬盘(solid state disk,SSD))等。
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,该流程可以由计算机程序来指令相关的硬件完成,该程序可存储于计算机可读取存储介质中,该程序在执行时,可包括如上述各方法实施例的流程。而前述的存储介质包括:ROM或随机存储记忆体RAM、磁碟或者光盘等各种可存储程序代码的介质。
以上所述,仅为本申请实施例的具体实施方式,但本申请实施例的保护范围并不局限 于此,任何在本申请实施例揭露的技术范围内的变化或替换,都应涵盖在本申请实施例的保护范围之内。因此,本申请实施例的保护范围应以所述权利要求的保护范围为准。

Claims (12)

  1. 一种视频传输方法,其特征在于,包括:
    第一会话边界控制器接收第二会话边界控制器发送的第一视频画面;
    所述第一会话边界控制器对所述第一视频画面进行超分处理,以得到第二视频画面,其中,所述第二视频画面的分辨率高于所述第一视频画面的分辨率;
    所述第一会话边界控制器将所述第二视频画面发送至第一终端。
  2. 根据权利要求1所述的方法,其特征在于,还包括:
    所述第二会话边界控制器接收所述第一会话边界控制器发送的第三视频画面;
    所述第二会话边界控制器对所述第三视频画面进行超分处理,以得到第四视频画面,其中,所述第四视频画面的分辨率高于所述第三视频画面的分辨率;
    所述第二会话边界控制器将所述第四视频画面发送至第二终端。
  3. 根据权利要求1或2所述的方法,其特征在于,还包括:
    所述第一会话边界控制器接收所述第一终端发送的第三视频画面;
    所述第一会话边界控制器将所述第三视频画面发送至所述第二会话边界控制器。
  4. 根据权利要求1至3任一项所述的方法,其特征在于,在所述第一会话边界控制器接收第二会话边界控制器发送的第一视频画面之前,所述方法还包括:
    所述第一会话边界控制器向所述第一终端发送第一报文,所述第一报文携带所述第一会话边界控制器发送方向使用的视频画面的分辨率,以指示所述第一终端接收所述第一会话边界控制器发送的视频画面,其中,所述第一会话边界控制器发送方向使用的视频画面的分辨率为所述第二视频画面的分辨率。
  5. 根据权利要求4所述的方法,其特征在于,所述方法还包括:
    所述第二会话边界控制器向所述第二终端发送携带超分能力指示信息的第二报文,以指示所述第二终端发送满足所述超分能力指示信息的视频画面,所述满足所述超分能力指示信息的视频画面的分辨率为所述第一视频画面的分辨率。
  6. 根据权利要求5所述的方法,其特征在于,所述第二报文为SIP报文,所述超分能力指示信息位于所述SIP报文的图像属性参数中。
  7. 根据权利要求5所述的方法,其特征在于,所述第二报文为RTCP报文,所述RTCP报文包括名称字段,其中,所述名称字段对应第一预设值时指示所述RTCP报文携带所述超分能力指示信息。
  8. 根据权利要求5所述的方法,其特征在于,所述第二报文为SPS报文,所述超分能力指示信息位于所述SPS报文的参数中。
  9. 根据权利要求5所述的方法,其特征在于,所述第二报文还携带非对称能力指示信息,所述非对称能力指示信息指示所述第一终端在接收方向和发送方向支持不同的分辨率。
  10. 一种视频传输设备,其特征在于,包括相互耦合的存储器和处理器,所述存储器中存储了计算机程序代码,所述处理器调用并执行所述存储器中的计算机程序代码,使得所述视频传输设备执行如权利要求1、3和4任一项所述的视频传输方法。
  11. 一种视频传输系统,其特征在于,包括第一会话边界控制器和第二会话边界控制 器,其中,所述第一会话边界控制器用于:
    接收第二会话边界控制器发送的第一视频画面;
    对所述第一视频画面进行超分处理,以得到第二视频画面,其中,所述第二视频画面的分辨率高于所述第一视频画面的分辨率;
    将所述第二视频画面发送至第一终端。
  12. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行以实现权利要求1至9任意一项所述的方法。
PCT/CN2021/074539 2020-03-28 2021-01-30 视频传输方法、系统、相关设备及存储介质 WO2021196860A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP21779370.2A EP4114026A4 (en) 2020-03-28 2021-01-30 VIDEO TRANSMISSION METHOD AND SYSTEM AND RELATED DEVICE AND STORAGE MEDIA

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010233007.3 2020-03-28
CN202010233007.3A CN113453081A (zh) 2020-03-28 2020-03-28 视频传输方法、系统、相关设备及存储介质

Publications (1)

Publication Number Publication Date
WO2021196860A1 true WO2021196860A1 (zh) 2021-10-07

Family

ID=77808135

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/074539 WO2021196860A1 (zh) 2020-03-28 2021-01-30 视频传输方法、系统、相关设备及存储介质

Country Status (3)

Country Link
EP (1) EP4114026A4 (zh)
CN (1) CN113453081A (zh)
WO (1) WO2021196860A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114900717A (zh) * 2022-05-13 2022-08-12 杭州网易智企科技有限公司 视频数据传输方法、装置、介质和计算设备

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104144313A (zh) * 2013-05-10 2014-11-12 中国电信股份有限公司 视频通信方法、系统与云端图像服务器
US20160212383A1 (en) * 2013-03-04 2016-07-21 Cisco Technology, Inc. Video Conference Virtual Endpoints
CN106254354A (zh) * 2016-08-10 2016-12-21 东北大学 一种非对称媒体参数的sdp协商方法
CN107888866A (zh) * 2016-09-30 2018-04-06 中兴通讯股份有限公司 多方视频会议控制方法、装置及会议视频服务器

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7692683B2 (en) * 2004-10-15 2010-04-06 Lifesize Communications, Inc. Video conferencing system transcoder
CN101119270A (zh) * 2007-09-14 2008-02-06 华中科技大学 网络边界处理方法
US8914462B2 (en) * 2009-04-14 2014-12-16 Lg Electronics Inc. Terminal and controlling method thereof
CN103096011B (zh) * 2011-11-08 2016-09-28 中国电信股份有限公司 视频通信协商方法、系统及视频终端
KR20140001477A (ko) * 2012-06-27 2014-01-07 삼성전자주식회사 영상통화의 효율적 세션 교섭을 위한 장치 및 방법
EP2884742B1 (en) * 2013-12-12 2016-09-07 Alcatel Lucent Process for increasing the resolution and the visual quality of video streams exchanged between users of a video conference service
CN103825912A (zh) * 2014-03-24 2014-05-28 联想(北京)有限公司 一种数据传输方法、电子设备及服务器
US10771736B2 (en) * 2014-06-30 2020-09-08 Microsoft Technology Licensing, Llc Compositing and transmitting contextual information during an audio or video call
WO2018012615A1 (ja) * 2016-07-14 2018-01-18 日本電信電話株式会社 通信方法、通信装置、通信システム及び通信プログラム
US20200036765A1 (en) * 2016-10-14 2020-01-30 Samsung Electronics Co., Ltd. Method and device for transmitting content
KR102656425B1 (ko) * 2016-12-07 2024-04-12 삼성전자주식회사 영상을 표시하는 전자 장치 및 방법
CN108990169B (zh) * 2017-09-08 2022-01-21 西安中兴新软件有限责任公司 一种数据传输、资源共享方法及用户终端、接入点设备
WO2019059721A1 (ko) * 2017-09-21 2019-03-28 에스케이텔레콤 주식회사 해상도 향상 기법을 이용한 영상의 부호화 및 복호화
CN110086746A (zh) * 2018-01-25 2019-08-02 中兴通讯股份有限公司 融合设备媒体控制方法、装置、会话控制器及通信终端
US11463757B2 (en) * 2018-09-27 2022-10-04 Intel Corporation Media streaming for receiver-enabled resolution

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160212383A1 (en) * 2013-03-04 2016-07-21 Cisco Technology, Inc. Video Conference Virtual Endpoints
CN104144313A (zh) * 2013-05-10 2014-11-12 中国电信股份有限公司 视频通信方法、系统与云端图像服务器
CN106254354A (zh) * 2016-08-10 2016-12-21 东北大学 一种非对称媒体参数的sdp协商方法
CN107888866A (zh) * 2016-09-30 2018-04-06 中兴通讯股份有限公司 多方视频会议控制方法、装置及会议视频服务器

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114900717A (zh) * 2022-05-13 2022-08-12 杭州网易智企科技有限公司 视频数据传输方法、装置、介质和计算设备
CN114900717B (zh) * 2022-05-13 2023-09-26 杭州网易智企科技有限公司 视频数据传输方法、装置、介质和计算设备

Also Published As

Publication number Publication date
EP4114026A4 (en) 2023-08-30
CN113453081A (zh) 2021-09-28
EP4114026A1 (en) 2023-01-04

Similar Documents

Publication Publication Date Title
WO2022095795A1 (zh) 通信方法、装置、计算机可读介质及电子设备
US11799922B2 (en) Network core facilitating terminal interoperation
CN111147893B (zh) 一种视频自适应方法、相关设备以及存储介质
US20220131912A1 (en) Call processing method and device
CN101755454B (zh) 确定移动视频电话之间的首选图像格式的方法和设备
WO2012075937A1 (zh) 视频通话方法及可视电话
US9826072B1 (en) Network-terminal interoperation using compatible payloads
WO2020048386A1 (zh) 交互信息传输方法及装置
CN111095879A (zh) 在实时文本消息中交换非文本内容
WO2017054286A1 (zh) 一种语音通话承载方法及装置
WO2023035894A1 (zh) 一种数据处理方法、设备、可读存储介质和程序产品
CN109831490A (zh) 业务访问方法及系统
US9800625B2 (en) Video calling method and device based on voice over long-term evolution and video over long-term evolution, and mobile terminal
WO2021196860A1 (zh) 视频传输方法、系统、相关设备及存储介质
CN110012363B (zh) 一种基于sip协议的视频聊天系统
WO2021218653A1 (zh) 媒体资源传输方法、相关装置及系统
WO2014166366A1 (zh) 一种在长期演进集群网络中进行能力协商的方法及装置
WO2020192435A1 (zh) 一种播放多媒体彩振、彩铃的方法、应用服务器
CN112333414A (zh) 视频通话方法、装置、电子设备及可读存储介质
WO2022007619A1 (zh) 视频传输方法、设备、系统及存储介质
CN113973230A (zh) 一种呼叫处理方法、装置及系统
CN112398854B (zh) 用于确定业务通信的编解码模式集的方法和装置
EP4358591A1 (en) Data transmission method and related device
CN108809911A (zh) VoLTE网络中实现二次拨号的方法、装置及存储介质
WO2022262294A1 (zh) 视频流传输方法以及相关通信装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21779370

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2021779370

Country of ref document: EP

Effective date: 20220929

NENP Non-entry into the national phase

Ref country code: DE