WO2017121299A1 - 一种调整媒体流传输的方法及装置 - Google Patents

一种调整媒体流传输的方法及装置 Download PDF

Info

Publication number
WO2017121299A1
WO2017121299A1 PCT/CN2017/070652 CN2017070652W WO2017121299A1 WO 2017121299 A1 WO2017121299 A1 WO 2017121299A1 CN 2017070652 W CN2017070652 W CN 2017070652W WO 2017121299 A1 WO2017121299 A1 WO 2017121299A1
Authority
WO
WIPO (PCT)
Prior art keywords
volume
audio stream
control device
central control
sending
Prior art date
Application number
PCT/CN2017/070652
Other languages
English (en)
French (fr)
Inventor
刘艳
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2017121299A1 publication Critical patent/WO2017121299A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/65Network streaming protocols, e.g. real-time transport protocol [RTP] or real-time control protocol [RTCP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • H04L65/762Media network packet handling at the source 

Definitions

  • the present invention relates to the field of communications, and in particular, to a method and apparatus for adjusting media stream transmission.
  • a multi-point video conferencing system allows a conference system in which three or more senders of different locations participate simultaneously.
  • the system also includes a central control device.
  • the sending end sends the media stream to the central control device through the network, where the media stream includes a video stream and an audio stream, and the central control device is responsible for receiving the media stream sent by each sending end through the network.
  • the determining unit determines One video stream or multiple video streams are combined and broadcasted into one video stream for reception by other stations.
  • the central control device sends the audio stream of the transmitter with a large volume to other senders, and at the same time, makes a policy selection for the video related to its audio.
  • the center control device will still receive a lower volume audio stream, but the lower volume audio stream will not be mixed to other senders, so that the lower volume audio stream still occupies the sender and the center. Controlling processing resources between devices causes waste of processing resources between the transmitting end and the central control device.
  • the Internet Engineering Task Force (English full name: internet engineering task force, English abbreviation: IETF) specifies the request for comment (English full name: request for comments, English abbreviation: RFC).
  • RFC3264 is based on the session description protocol (English name: session description protocol, English abbreviation: SDP) can be used to control media stream pause or recovery.
  • the session re-negotiation is initiated by either side of the session on the signaling plane, and the m-stream corresponding to the corresponding media stream to be suspended or restored is deactivated to implement the media stream in one direction or two directions. Send control.
  • the media stream corresponding to the channel of the media path will perform corresponding control, that is, pause.
  • the above method consumes too much signaling, and is not suitable for the situation where the multipoint site and the volume change dynamically.
  • An object of the present invention is to provide a method and apparatus for adjusting media stream transmission, which can adaptively control media stream pause according to volume, thereby improving utilization of processing resources between a transmitting end and a central control device.
  • a method for adjusting media stream transmission including:
  • the central control device acquires real-time transmission protocol RTP packets sent by the N transmitting ends, each of the RTP packets includes the volume of the audio stream, and N is a natural number greater than or equal to 2.
  • the current mainstream central control device has multiple points of control. Unit (English name: Multi-point Control Unit, English abbreviation: MCU), the sender is also the end of multimedia communication
  • the end device for example, may be a video conferencing terminal or a desktop video terminal;
  • the central control device determines a first volume threshold according to the volume of the N audio streams; and determines an audio stream whose volume is less than or equal to the first volume threshold;
  • the central control device sends a real-time transmission control protocol RTCP packet including a pause indication to each of the transmitting ends corresponding to the X audio streams whose volume is less than or equal to the first volume threshold, where X is greater than or equal to 1 And a natural number less than N, the pause indication is used to indicate that the sender corresponding to the X audio streams pauses to send an audio stream to the central control device, where the sender is a multimedia communication terminal device, for example, may be a video conference terminal or a desktop. Video terminal, etc.
  • the central control device determines the first volume threshold according to the volume of the N audio streams, and determines that the volume is less than or equal to An audio stream of the first volume threshold is sent to each of the transmitting ends corresponding to the X audio streams whose volume is less than or equal to the first volume threshold, and each of the transmitting ends sends an RTCP packet including a pause indication, indicating that each transmitting end pauses to the center control The device sends an audio stream.
  • the central control device determines the audio stream of the mix according to the volume of the audio stream reported by the sender, and further sends a pause indication to the sender that does not need to mix, so that the sender that does not need the mix pauses to send the audio stream to the central control device. It can effectively improve the utilization of processing resources between the sender and the central control device.
  • the central control device determines the first volume threshold according to the volume of the N audio streams.
  • the following two methods may be used:
  • the central control device sorts the volume of the N audio streams from largest to smallest; and then, from the audio streams sorted according to the volume of the N audio streams from large to small.
  • the first M road from the first road to the M road is determined as the M channel audio stream, M is a natural number greater than or equal to 1 and less than N, M represents the number of mixed channels, and M is less than or equal to the preset mixing path
  • the audio stream is a mixed audio stream; finally, the volume between the volume of the audio stream of the Mth channel and the volume of the audio stream of the M+1th channel is determined as the first volume threshold.
  • the specific method for determining the volume threshold dynamically by the central control device provided by the first implementation manner of the first aspect of the foregoing aspect can accurately determine the volume of the audio stream reported by the transmitting end in a more real-time manner.
  • the central control device determines the first volume threshold based on an average of the volume of the N audio streams.
  • the second implementation manner of the foregoing first aspect provides a specific method for the central control device to statically determine the volume threshold, so that the volume of the audio stream reported by the sender is faster.
  • the RTCP including the pause indication further includes a threshold, the threshold is used to indicate the first volume threshold, so that any one of the sending ends corresponding to the audio stream that receives the RTCP packet including the pause indication is monitored in real time according to the first volume threshold.
  • the volume of the audio stream when the volume of the audio stream of any of the transmitting ends is greater than the first volume threshold, sending an RTCP message including the recovery request to the central control device.
  • the RTCP message including the pause indication further includes a threshold type.
  • the third implementable manner of the foregoing first aspect provides the specific content included in the RTCP packet, so that the sender receives the content, stores the content, and determines whether the center is required according to the volume of the updated audio stream.
  • the control device sends an audio stream.
  • the RTCP message including the pause indication further includes a remaining maximum number, where the remaining maximum number is used to indicate the number of channels that can also be mixed, and the remaining may also be mixed.
  • the number of the roads is LM
  • L is the number of preset mixing channels
  • M is the number of mixed channels.
  • the RTCP including the pause indication
  • the message further includes whether the video stream is associated with the video stream, and the associated video stream is used to indicate that any one of the sending ends corresponding to the X audio streams pauses to send the audio stream to the central control device while suspending the control to the center.
  • the device transmits a video stream associated with the audio stream.
  • the fifth implementation manner of the foregoing first aspect provides specific content included in the RTCP message, so that the sender receives the content, stores the content, and pauses to send the video stream associated with the audio stream to the central control device. Improve the utilization of processing resources between the sender and the central control device.
  • the central control device is The method further includes: after each of the sending ends corresponding to the audio stream whose volume is less than or equal to the first volume threshold is sent by the sending end, the method further includes:
  • the central control device receives an RTCP packet including a pause response sent by each of the sending ends corresponding to the X audio streams, where the pause response is used to indicate each of the sending ends corresponding to the X audio streams.
  • the end has paused sending audio streams to the central control device.
  • the volume is less than the volume of the central control device
  • the central control device receives an RTCP packet that is sent by the first sending end and includes a recovery request, where the recovery request includes a volume of the audio stream that is updated by the first sending end, and the recovery request is used to request the central control device to indicate a transmitting end sends an audio stream to the central control device, where the first sending end is any one of the sending ends corresponding to the X audio streams;
  • the central control device determines whether the volume of the audio stream updated by the first transmitting end is greater than the volume of the audio stream of any of the mixed channels
  • the central control device sends according to the volume of the audio stream updated by the first sending end and N-1
  • the volume of the audio stream of the terminal determines a second volume threshold
  • the central control device determines an audio stream whose volume is less than or equal to the second volume threshold
  • the central control device sends an RTCP packet including a pause indication to each of the sending ends corresponding to the Y audio streams whose volume is less than or equal to the second volume threshold, where Y is greater than or equal to 1 and less than N. Natural number.
  • the central control device may determine, according to the volume of the audio stream carried by the recovery request sent by the sending end, whether to resume the sending of the audio stream, thereby improving the relationship between the transmitting end and the central control device. Handle resource utilization.
  • the central control device After receiving the RTCP packet that is sent by the first sending end and including the recovery request, the method further includes:
  • the central control device sends an RTCP packet including a recovery response to the first sending end, where the RTCP packet including the recovery response further includes
  • the number of the mixed channels may also be the difference between the preset number of mixing channels and the number of mixed channels, and the recovery response is used by the central control device to instruct the first transmitting end to be centrally controlled.
  • the device sends an audio stream;
  • the determining, by the central control device, whether the volume of the audio stream updated by the first sending end is greater than the volume of the audio stream of any one of the mixed channels includes:
  • the center control device determines whether the volume of the audio stream updated by the first transmitting end is greater than the volume of the audio stream of any of the mixed channels. .
  • the central control device is The method further includes: after each of the sending ends corresponding to the audio stream whose volume is less than or equal to the first volume threshold is sent by the sending end, the method further includes:
  • the central control device sends an RTCP packet including an update message to each of the sending ends corresponding to the X audio streams, where the update message includes a third volume threshold and a number of channels that can also be mixed.
  • the number of mixable paths is the difference between the number of preset mixes and the number of mixed channels.
  • a method for adjusting media stream transmission including:
  • the sending end sends a real-time transmission protocol RTP message to the central control device, where the RTP message includes the volume of the audio stream, and then receives the real-time transmission control protocol RTCP message sent by the central control device, including the suspension indication,
  • the pause indication is used to instruct the sender to suspend sending the audio stream to the central control device;
  • the sending end sends an RTCP message including a pause response to the central control device, where the pause response is used to indicate that the sending end has suspended sending the audio stream to the central control device.
  • the sending end sends the volume of the audio stream to the central control device, so that the central control device receives the volume of the audio stream reported by the N transmitting ends, according to the volume of the N audio streams. Determining a first volume threshold, and determining an audio stream whose volume is less than or equal to the first volume threshold, and transmitting, by the transmitting end corresponding to the X audio streams whose volume is less than or equal to the first volume threshold, a pause indication An RTCP message indicating that each sender pauses to send an audio stream to the central control device.
  • the central control device determines the audio stream of the mix according to the volume of the audio stream reported by the sender, and further sends a pause indication to the sender that does not need to mix, so that the sender that does not need the mix pauses to send the audio stream to the central control device. It can effectively improve the utilization of processing resources between the sender and the central control device.
  • the RTCP packet that includes the pause indication further includes a first volume threshold
  • the method further includes:
  • the sending end saves the first volume threshold
  • the sending end monitors the volume of the audio stream of the sending end
  • the sending end determines that the monitored volume of the audio stream updated by the sending end is greater than the first volume threshold
  • an RTCP packet including a recovery request to the central control device Sending, by the sending end, an RTCP packet including a recovery request to the central control device, where the recovery request includes a volume of the audio stream updated by the sending end, and the recovery request is used to request the central control device to indicate the sending end to the center
  • the control device sends an audio stream
  • the sending end receives the RTCP packet that is sent by the central control device and includes a recovery response, where the recovery response is used to instruct the sending end to send the audio stream to the central control device.
  • the method further includes:
  • the sending end receives an RTCP packet that is sent by the central control device and includes an update message, where the update message includes a third volume threshold and a number of channels that can also be mixed;
  • the sending end saves the third volume threshold
  • the sending end monitors the volume of the audio stream of the sending end
  • the sending end determines that the monitored volume of the audio stream updated by the sending end is greater than the third volume threshold
  • an RTCP packet including a recovery request to the central control device Sending, by the sending end, an RTCP packet including a recovery request to the central control device, where the recovery request includes a volume of the audio stream updated by the sending end, and the recovery request is used to request the central control device to indicate the sending end to the center
  • the control device sends an audio stream.
  • the including The RTCP message of the pause indication further includes a number of channels that can be mixed
  • the RTCP message including the recovery response further includes a number of channels that can also be mixed
  • the update message further includes a number of channels that can also be mixed
  • the The number of mixing channels is the difference between the preset number of mixing channels and the number of mixed channels
  • the transmitting end determines that the number of the mixable channels is greater than 0;
  • the sending end sends an RTCP packet including a recovery request to the central control device.
  • the including The RTCP message of the pause indication further includes whether the video stream is associated, and the associated video stream is used to instruct the sender to pause sending the audio stream to the central control device while suspending sending the audio stream to the central control device. Video stream.
  • the third aspect provides a central control device, including: a receiving unit, configured to receive an RTP message or an RTCP message sent by the sending end, and a processing unit, configured to process the received RTP message or the RTCP message, and send the unit Used to send RTCP packets to the sender.
  • a receiving unit configured to receive an RTP message or an RTCP message sent by the sending end
  • a processing unit configured to process the received RTP message or the RTCP message, and send the unit Used to send RTCP packets to the sender.
  • the fourth aspect provides a sending end, comprising: a receiving unit, configured to receive an RTCP message sent by the central control device, a processing unit, configured to process the received RTCP message, and send, send, to the central control device RTCP packet or RTP packet.
  • a sending end comprising: a receiving unit, configured to receive an RTCP message sent by the central control device, a processing unit, configured to process the received RTCP message, and send, send, to the central control device RTCP packet or RTP packet.
  • the functional modules described in the foregoing third and fourth aspects may be implemented by hardware, or may be implemented by hardware.
  • the hardware or software includes one or more modules corresponding to the functions described above.
  • a communication interface for performing functions of a receiving unit and a transmitting unit, a processor for completing a function of the processing unit, and a memory for storing a volume threshold.
  • the processor, communication interface, and memory are connected by a bus and communicate with each other.
  • the function of adjusting the behavior of the central control device in the method of media stream transmission provided by the first aspect, and the function of adjusting the behavior of the sender in the method of media stream transmission provided by the second aspect may be referred to.
  • the names of the central control device and the transmitting end are not limited to the devices themselves, and in actual implementation, these devices may appear under other names. As long as the functions of the respective devices are similar to the present invention, they are within the scope of the claims and the equivalents thereof.
  • FIG. 1 is a schematic diagram of a multipoint video conference system according to an embodiment of the present invention.
  • FIG. 2 is a schematic flow chart of an audio stream according to an embodiment of the present invention.
  • FIG. 3 is a schematic diagram of a flow of a video stream according to an embodiment of the present disclosure
  • FIG. 4 is a schematic structural diagram of a computer hardware according to an embodiment of the present invention.
  • FIG. 5 is a flowchart of a method for adjusting media stream transmission according to an embodiment of the present invention.
  • FIG. 6 is a flowchart of another method for adjusting media stream transmission according to an embodiment of the present invention.
  • FIG. 7 is a flowchart of still another method for adjusting media stream transmission according to an embodiment of the present invention.
  • FIG. 8 is a flowchart of still another method for adjusting media stream transmission according to an embodiment of the present invention.
  • FIG. 9 is a flowchart of still another method for adjusting media stream transmission according to an embodiment of the present invention.
  • FIG. 10 is a schematic structural diagram of an RTCP packet according to an embodiment of the present disclosure.
  • FIG. 11 is a schematic structural diagram of a center control device according to an embodiment of the present invention.
  • FIG. 12 is a schematic structural diagram of a transmitting end according to an embodiment of the present invention.
  • the basic principle of the present invention is that the central control device still receives the audio stream with a lower volume after the mixing, resulting in waste of processing resources between the transmitting end and the central control device, and the central control device reports the signal according to the transmitting end.
  • the volume of the audio stream determines a volume threshold, and determines an audio stream whose volume is less than or equal to a volume threshold, and transmits a real-time transmission control including a pause indication to each of the transmitting ends corresponding to the X audio streams whose volume is less than or equal to the volume threshold.
  • Protocol English name: Real-time Transport Control Protocol, English abbreviation: RTCP
  • the pause indication is used to instruct the sender to pause sending audio streams to the central control device.
  • the central control device determines the audio stream of the sound mixing according to the volume of the audio stream reported by the transmitting end, and further sends a pause indication to the transmitting end that does not need to mix, which can effectively improve the utilization of processing resources between the transmitting end and the central control device.
  • the embodiment of the present invention provides a schematic diagram of a multipoint video conference system.
  • the system includes: a central control device, a network, a transmitting end 1, a transmitting end 2, a transmitting end 3, a transmitting end 4, and a transmitting end 5.
  • the central control device and the sender are respectively connected to the network.
  • the five senders can be located at different venues, such as Shenzhen Conference Hall, Beijing Conference Hall, Shanghai Conference Hall, Chengdu Metropolitan Stadium and Xi'an Conference Hall.
  • the present invention does not limit the location of the conference here, and may also be in other venues, and is merely illustrative.
  • the transmitting end is also a multimedia communication terminal device, and for example, may be a video conference terminal or a desktop video terminal. It can also be other multimedia communication terminal devices.
  • the transmitting end is configured to collect signals such as video and audio of the site where the transmitting end is located, and transmit the signals to other transmitting ends or the central control device through the network.
  • the transmitting end can also be connected to a display device, such as a television set, which displays the image as an echo device.
  • the sender typically includes a core codec, camera, omnidirectional microphone, and remote control.
  • the core codec is used to transmit the image and sound code input by the camera and the microphone through the network, and at the same time, after decoding the video transmitted by the network, the image is restored to the display device, and the audio transmitted by the network is decoded, and the sound is restored. On the sound, real-time interaction with other senders is achieved.
  • the central control device is configured to switch the input multi-party conference signal, and the conference signal includes at least one of audio, video and data.
  • the central control device transmits the audio signal in a multi-channel hybrid mode or a switching manner, and directly transmits the video signal in a direct distribution manner, and adopts a broadcast mode or lossless audio coding for the data signal (English name: Meridian Lossless Packing, English abbreviation: MLP) Way to transfer.
  • MLP Meridian Lossless Packing
  • the central control device also completes the processing of communication control signals and network interface signals.
  • the central control device may follow all the audio streams. Sort to a small volume, get the top M audio stream with the highest volume, mix the first M audio stream, and send it to all the senders. Assume that the identifier * indicates the 2 channels of audio with the highest volume in the current site. As shown in FIG.
  • the audio stream 3 and the audio stream 4 are mixed, and the audio stream 3 and the audio stream 4 are mixed to the transmitter 1 and At the transmitting end 2, only the audio stream 3 is sent to the transmitting end 4, and only the audio stream 4 is sent to the transmitting end 3, without having to mix the audio stream of the receiving side to avoid hearing the echo, that is, it is not necessary to send the audio stream 3 To the sender 3, the audio stream 4 is sent to the sender 4.
  • the center control instructs the other sender to switch the screen to display the video related to the audio of the maximum volume, assuming that the maximum volume is the volume of the audio stream 4, as shown in FIG. 3, the sender 1 displays the video stream of the sender 4, and the sender 2 displays the transmission.
  • the transmitting end 4 still displays the video stream of the transmitting end 3, and the transmitting end 3 displays the video stream of the transmitting end 4.
  • the mainstream central control equipment has a multi-point control unit (English name: Multi-point Control Unit, English abbreviation: MCU), and the multi-point control unit may be replaced by other devices that implement the same function in the future, which are within the scope of the present invention. .
  • MCU Multi-point Control Unit
  • the network may be an IP network for transmitting signals between the central control device and the transmitting end of different sites.
  • the network may also be other types of transmission networks, and the invention is not limited herein.
  • the central control device and the transmitting end in FIG. 1 can be implemented in the manner of the computer device (or system) in FIG.
  • FIG. 4 is a schematic diagram of a computer device according to an embodiment of the present invention.
  • the computer device 100 includes at least one processor 101, a communication bus 102, a memory 103, and at least one communication interface 104.
  • the processor 101 can be a processor or a collective name for a plurality of processing elements.
  • the processor 101 may be a general-purpose central processing unit (English name: Central Processing Unit, English abbreviation: CPU), or may be an application-specific integrated circuit (English name: ASIC), or One or more integrated circuits for controlling the execution of the program of the present invention, such as: one or more microprocessors (English full name: digital signal processor, English abbreviation: DSP), or one or more field programmable gate arrays (English full name: Field Programmable Gate Array, English abbreviation: FPGA).
  • the processor 101 may include one or more CPUs, such as the one in FIG. CPU0 and CPU1.
  • computer device 100 can include multiple processors, such as processor 101 and processor 108 in FIG. Each of these processors can be a single-CPU processor or a multi-core processor.
  • a processor herein may refer to one or more devices, circuits, and/or processing cores for processing data, such as computer program instructions.
  • the communication bus 102 can be an industry standard architecture (English name: Industry Standard Architecture, English abbreviation: ISA) bus, external device interconnection (English full name: Peripheral Component, English abbreviation: PCI) bus or extended industry standard architecture (English full name) :Extended Industry Standard Architecture, English abbreviation: EISA) bus.
  • the bus can be divided into an address bus, a data bus, a control bus, and the like. For ease of representation, only one thick line is shown in Figure 4, but it does not mean that there is only one bus or one type of bus.
  • the memory 103 can be a read-only memory (English full name: read-only memory, English abbreviation: ROM) or other types of static storage devices that can store static information and instructions.
  • Random access memory English full name: random access memory, English abbreviation : RAM
  • dynamic storage devices that can store information and instructions
  • electrically erasable programmable read-only memory English full name: Electrically Erasable Programmable Read-Only Memory, English abbreviation: EEPROM
  • read-only optical disk English full name: Compact Disc Read-Only Memory, English abbreviation: CD-ROM) or other disc storage
  • CD storage including compressed discs, laser discs, CDs, digital versatile discs, Blu-ray discs, etc.
  • a device or any other medium that can be used to carry or store desired program code in the form of an instruction or data structure and that can be accessed by a computer, but is not limited thereto.
  • the memory can exist independently and be connected to the processor via a
  • the memory 103 is used to store application code for executing the solution of the present invention, and is controlled by the processor 101 for execution.
  • the processor 101 is configured to execute application code stored in the memory 103.
  • the communication interface 104 uses a device such as any transceiver for communicating with other devices or communication networks, such as Ethernet, Radio Access Network (RAN), and Wireless LAN (English name: Wireless Local Area Networks, English abbreviation) : WLAN) and so on.
  • the communication interface 104 may include a receiving unit that implements a receiving function, and a transmitting unit that implements a transmitting function.
  • the computer device 100 shown in FIG. 4 may be the transmitting end in FIG.
  • the communication interface 104 is configured to receive an RTCP packet that includes a pause indication sent by the central control device, an RTCP packet that includes a recovery response, or an RTCP packet that includes an update message.
  • the communication interface 104 is further configured to send a media stream to the central control device, where the media stream includes an audio stream and a video stream.
  • the communication interface 104 is further configured to send, to the central control device, an RTCP message including a suspension response or an RTCP message including a recovery request.
  • the processor 101 is configured to determine that the volume of the monitored audio stream updated by the sender is greater than a volume threshold.
  • the memory 103 is configured to store an RTCP message including a pause indication, an RTCP message including a recovery response, or content included in an RTCP message including the update message, for example, a volume threshold or a number of channels that can also be mixed.
  • Computer device 100 may also include an output device 105 and an input device 106.
  • the output device 105 can be a display device or an audio device, the display device is for displaying the received video stream, and the audio is used to output the received audio stream.
  • the input device 106 can be a camera or a microphone. The camera is used to acquire the scene of the venue, that is, the video stream, and the microphone is used to acquire the sound of the venue, that is, the audio stream.
  • the computer device 100 shown in FIG. 4 may be the central control device in FIG.
  • the communication interface 104 is configured to receive a real-time transport protocol (English full name: RTP) message sent by the sending end, and each of the RTP messages includes a volume of the audio stream, where N is greater than or equal to 2 Natural number.
  • RTP real-time transport protocol
  • the communication interface 104 is further configured to receive an RTCP packet that includes a pause response sent by the sending end, and an RTCP packet that includes the recovery request.
  • the communication interface 104 is further configured to send, to the sending end, an RTCP packet including a pause indication, an RTCP packet including a recovery response, and an RTCP packet including the update message.
  • the processor 101 is configured to determine a volume threshold according to a volume of the audio stream, and determine an audio stream whose volume is less than or equal to the first volume threshold.
  • the memory 103 is configured to store a volume threshold according to a volume of the audio stream, and an audio stream to be processed and a video stream related to the audio stream.
  • An embodiment of the present invention provides a method for adjusting media stream transmission, as shown in FIG. 5, including:
  • Step 201 The sending end 1 to the sending end N respectively send an RTP message to the central control device.
  • the RTP message includes an audio stream and a volume of the audio stream.
  • Step 202 The central control device receives the RTP message.
  • the central control device receives the RTP packets sent by the N senders.
  • Step 203 The central control device acquires N RTP packets.
  • the central control device acquires RTP packets sent by the N senders.
  • Each of the RTP messages includes a volume of an audio stream and an audio stream, and N is a natural number greater than or equal to 2.
  • Step 204 The central control device determines a first volume threshold.
  • the central control device determines the first volume threshold according to the volume of the N audio streams.
  • the first volume threshold may be a dynamic threshold, and is applicable to a conference with high activity, such as a seminar, and the dynamic threshold may be any value within a range of values.
  • the central control device sorts the volume of the N audio streams from large to small; and then from the audio stream sorted according to the volume of the N audio streams, the audio stream from the first channel to the Mth.
  • the first M channel audio stream of the road audio stream is determined as the M channel audio stream that the center control device needs to mix, M is a natural number greater than or equal to 1 and less than N, M represents the number of mixed channels, and M is less than or equal to the preset.
  • the number of mixing channels, the M channel audio stream is a mixed audio stream.
  • the central control device determines the volume between the volume of the audio stream of the Mth channel and the volume of the audio stream of the M+1th channel as the first volume threshold.
  • the first volume threshold may be a static threshold, and is applicable to a less active conference such as a training class or a presentation class, and the static threshold may be set to a basic background noise.
  • the central control device first obtains an average of the volume of the N audio streams, and then determines the first volume threshold based on an average of the volume of the N audio streams.
  • the central control device determines an audio stream whose volume is greater than the first volume threshold, and if the number of audio streams whose volume is greater than the first volume threshold is greater than a preset number of mixing channels, the volume is greater than the first volume
  • the audio stream of the threshold is sorted according to the volume from the largest to the smallest, and the audio stream of the preset mixing path is obtained, and the audio stream of the preset mixing path is mixed; if the volume is greater than the audio stream of the first volume threshold The number is less than or equal to the preset number of mixing channels, and the audio stream whose volume is greater than the first volume threshold is directly mixed.
  • Step 205 The central control device determines an audio stream whose volume is less than or equal to the first volume threshold.
  • Step 206 The central control device sends an RTCP packet including a pause indication to each of the sending ends corresponding to the X audio streams whose volume is less than or equal to the first volume threshold.
  • the pause indication is used to indicate that the sender corresponding to the X audio streams pauses to send the audio stream to the central control device.
  • the RTCP packet including the pause indication further includes a threshold, where the threshold is used to represent the first volume threshold, so that the X audio streams corresponding to the RTCP packet including the pause indication are received.
  • a transmitting end monitors the volume of the audio stream in real time according to the first volume threshold, and sends an RTCP packet including the recovery request to the central control device when the volume of the audio stream of any of the transmitting ends is greater than the first volume threshold.
  • the RTCP message including the pause indication further includes a remaining maximum number, where the remaining maximum number is used to indicate the number of channels that can be mixed, and the number of the mixed channels is LM, where L represents a preset mix.
  • the number of sound paths, M represents the number of mixed channels, and when the number of the mixed channels is greater than 0, any one of the transmitting ends corresponding to the X audio streams includes a recovery request to the central control device.
  • the RTCP message including the pause indication further includes whether the video stream is associated, and the associated video stream is used to indicate that any one of the sending ends corresponding to the X audio streams is suspended to the central control device.
  • the transmission of the audio stream is suspended while the video stream associated with the audio stream is sent to the central control device.
  • Step 207 The sending end N receives the RTCP packet that is sent by the central control device and includes a pause indication.
  • the pause indication is used to instruct the sender to pause sending an audio stream to the central control device.
  • Step 208 The sending end N sends an RTCP packet including a pause response to the central control device.
  • the pause response is used to indicate that the sender has suspended sending the audio stream to the central control device.
  • Step 209 The central control device sends an audio stream.
  • the central control device then sends the audio stream to all senders, but the central control device does not receive the audio stream of the sender N.
  • the central control device determines a first volume threshold according to the volume of the N audio streams, and determines an audio stream whose volume is less than or equal to the first volume threshold.
  • An RTCP message including a pause indication is sent to each of the transmitting ends corresponding to the X audio streams whose volume is less than or equal to the first volume threshold, indicating that each transmitting end pauses to send the audio stream to the central control device.
  • the central control device determines the audio stream of the mix according to the volume of the audio stream reported by the sender, and further sends a pause indication to the sender that does not need to mix, so that the sender that does not need the mix pauses to send the audio stream to the central control device. It can effectively improve the utilization of processing resources between the sender and the central control device.
  • An embodiment of the present invention provides a method for adjusting media stream transmission, and assumes that five senders participate in a conference, as shown in FIG. 6, including:
  • Step 301 The central control device receives the RTP packet.
  • the central control device receives RTP messages sent by all senders. For example, it is assumed that the central control device receives the RTP message sent by the transmitting end 1 including the volume of the first audio stream and the first audio stream, and also receives the second audio stream and the second audio stream sent by the transmitting end 2.
  • the RTP message of the volume also receives the RTP message including the volume of the third audio stream and the third audio stream sent by the transmitting end 3, and also receives the fourth audio stream and the fourth audio stream sent by the transmitting end 4.
  • the volume RTP message also receives an RTP message sent by the transmitting end 5 including the volume of the fifth audio stream and the fifth audio stream.
  • Step 302 The central control device determines a first volume threshold.
  • the central control device determines the first volume threshold according to the volume of the five audio streams. First, the center control device sorts the volume of the five audio streams from large to small.
  • the volume of the first audio stream is A
  • the volume of the second audio stream is B
  • the volume of the third audio stream is C
  • the volume of the fourth audio stream is D
  • the volume of the fifth audio stream is E
  • E>C >D>B>A the first three audio streams of the fifth audio stream, the third audio stream, and the fourth audio stream are determined to be centered from the audio stream sorted according to the volume of the five audio streams.
  • the control device requires a three-way audio stream for mixing. It should be noted that, in the prior art, the maximum mixing is three ways, and if more than three audio streams are mixed, the human ear may not recognize the multi-channel mixing. Of course, it is not limited to three ways.
  • the central control device determines any one of the volume between the volume of the audio stream of the fourth channel and the volume of the audio stream of the second channel as the first volume threshold.
  • Step 303 The central control device determines that the volume of the second audio stream is equal to the first volume threshold, and the volume of the first audio stream is less than the first volume threshold.
  • the audio stream whose audio stream has a volume equal to the first volume threshold may be mixed.
  • Step 304 The central control device sends an RTCP packet including a pause indication to the sender 1 and the sender 2.
  • the pause indication is used to instruct the sender 1 and the sender 2 to suspend transmission of the audio stream to the central control device.
  • the RTCP packet including the pause indication further includes a threshold, where the threshold is used to indicate the first volume threshold, so that any one of the sending ends corresponding to the X audio streams that receive the RTCP packet including the pause indication is sent.
  • the terminal monitors the volume of the audio stream in real time according to the first volume threshold, and sends an RTCP packet including the recovery request to the central control device when the volume of the audio stream of any of the transmitting ends is greater than the first volume threshold, X It is a natural number greater than or equal to 1 and less than N.
  • the RTCP message including the pause indication further includes a remaining maximum number and an associated video stream, the remaining maximum number is used to indicate the number of channels that can also be mixed, and the number of the remaining channels is LM, and L represents a preset mix.
  • the number of paths, M represents the number of mixed channels, and when the number of the mixable channels is greater than 0, any one of the transmitting ends corresponding to the X audio streams includes a recovery request RTCP to the central control device.
  • a message, wherein the associated video stream is used to indicate that any one of the sending ends corresponding to the X audio streams pauses to send an audio stream to the central control device while suspending transmission to the central control device Stream associated video stream.
  • Step 305 The sending end 1 receives the RTCP packet that is sent by the central control device and includes a pause indication.
  • Step 306 The sending end 2 receives the RTCP packet that is sent by the central control device and includes a pause indication.
  • Step 307 The sender 1 sends an RTCP packet including a suspension response to the central control device.
  • the pause response is used to indicate that the sender 1 has suspended sending an audio stream to the central control device.
  • Step 308 The sending end 2 sends an RTCP packet including a pause response to the central control device.
  • the pause response is used to indicate that the sender 2 has suspended sending an audio stream to the central control device.
  • Step 309 The central control device sends a mix.
  • the central control device sends a mix of the third audio stream of the transmitting end 3, the fourth audio stream of the transmitting end 4, and the fifth audio stream of the transmitting end 5 to the transmitting end 1.
  • the central control device transmits a mixture of the third audio stream of the transmitting end 3, the fourth audio stream of the transmitting end 4 and the fifth audio stream of the transmitting end 5 to the transmitting end 2.
  • the central control device transmits a mix of the fourth audio stream of the transmitting end 4 and the fifth audio stream of the transmitting end 5 to the transmitting end 3.
  • the center control device transmits a mixture of the third audio stream of the transmitting end 3 and the fifth audio stream of the transmitting end 5 to the transmitting end 4.
  • the center control device transmits a mixture of the third audio stream of the transmitting end 3 and the fourth audio stream of the transmitting end 4 to the transmitting end 5.
  • the central control device determines the first volume threshold according to the volume of the audio stream, and obtains that the volume of the transmitting end 1 is less than the first volume threshold, and the transmitting end 2 The volume is equal to the first volume threshold, and the RTCP message including the pause indication is sent to the sender 1 and the sender 2. Therefore, the central control device determines the audio stream of the mix according to the volume of the audio stream reported by the sender, and further sends a pause indication to the sender that does not need to mix, so that the sender that does not need the mix pauses to send the audio stream to the central control device. It can effectively improve the utilization of processing resources between the sender and the central control device.
  • the method steps shown in FIG. 6 above may be specifically implemented by the computer device shown in FIG.
  • the steps can all be implemented by the communication interface 104.
  • the determining, by the processor 101, the method step of determining the first volume threshold, and the processing of the message such as the audio stream whose volume is less than or equal to the first volume threshold is performed in step 303.
  • the transmission of the audio stream can also be resumed in the following manner.
  • the sending end that pauses the sending of the audio stream may continue to monitor the volume of the audio stream, determine whether the volume is greater than the volume threshold, and send the audio stream to the central control device, and specifically includes the following steps.
  • Step 310 The transmitting end 1 saves the first volume threshold.
  • Step 311 The transmitting end 2 saves the first volume threshold.
  • Step 312 The transmitting end 1 monitors the volume of the audio stream of the transmitting end.
  • Step 313 The sender 1 determines that the monitored volume of the audio stream updated by the sender is greater than the first volume threshold.
  • Step 314 The sender 1 sends an RTCP packet including a recovery request to the central control device.
  • the recovery request includes a volume of an audio stream updated by the transmitting end 1, and the recovery request is used by the transmitting end 1 to request the central control device to instruct the transmitting end 1 to transmit an audio stream to the central control device.
  • the RTCP packet including the recovery request further includes an association of the video stream, and the sender requests the center control device to instruct the sender to send the audio stream to the central control device, and also requests the center control device to instruct the sender to control the center.
  • the device sends a video stream.
  • the video stream associated with the audio stream needs to maintain operations related to the audio stream.
  • Step 315 The central control device receives the RTCP packet that is sent by the sender 1 and includes the recovery request.
  • Step 316 The central control device determines whether the volume of the audio stream updated by the sender 1 is greater than the volume of the audio stream of any of the mixed channels.
  • step 317 If the volume of the audio stream of the transmitting end 1 is greater than the volume of the audio stream of any of the mixed channels, perform step 317 to step 322; if the volume of the updated audio stream of the transmitting end 1 is less than or equal to all the mixed audio streams The volume control unit continues to send the mixed audio stream to the sender, and step 309 is performed.
  • Step 317 The central control device determines the second volume threshold according to the volume of the audio stream updated by the sender 1 and the volume of the audio stream of the other sender.
  • step 204 For a specific determination method, reference may be made to the specific description in step 204.
  • Step 318 The central control device determines that the volume of the second audio stream is equal to the second volume threshold, and the third audio stream The volume is less than the second volume threshold.
  • Step 319 The central control device sends an RTCP packet including a pause indication to the sender 3.
  • the central control device sends an RTCP packet including a pause indication to each of the sending ends corresponding to the Y audio streams whose volume is less than or equal to the second volume threshold.
  • Step 320 The sending end 3 sends an RTCP packet including a pause response to the central control device.
  • the RTCP packet including the pause response further includes a second volume threshold, so that the sender 3 saves the second volume threshold, monitors the volume of the audio stream of the sender 3, and determines that the monitored sender 3 is updated.
  • the volume of the audio stream is greater than the second volume threshold, and an RTCP message including a recovery request is sent to the central control device.
  • Step 321 The central control device sends an RTCP packet including a recovery response to the transmitting end 1.
  • the recovery response is used by the central control device to instruct the sender 1 to send an audio stream to the central control device.
  • Step 322 The central control device sends a mix.
  • the central control device After receiving the updated audio stream sent by the transmitting end 1, the central control device sends the audio stream reported by the transmitting end 4 and the audio stream reported by the transmitting end 5 to the transmitting end 1 again.
  • the central control device sends the audio stream updated by the sender 1 , the audio stream reported by the sender 4 and the audio stream reported by the sender 5 to the sender 2 .
  • the central control device sends the audio stream updated by the sender 1 , the audio stream reported by the sender 4 and the audio stream reported by the sender 5 to the sender 3 .
  • the central control device mixes the audio stream updated by the sender 1 with the audio stream reported by the sender 5 and sends it to the sender 4.
  • the audio stream updated by the transmitting end 1 of the central control device and the audio stream reported by the transmitting end 4 are mixed and sent to the transmitting end 5.
  • the sending end that receives the pause indication pauses sending the audio stream to the central control device, and saves the volume threshold, and continues to monitor the volume of the audio stream of the sending end in real time, and when the volume of the audio stream is greater than the volume threshold, the center is
  • the control device sends a resume request to facilitate sending an updated audio stream to the central control device so that the sender can receive a clear mix.
  • the method steps shown in FIG. 7 above may be specifically implemented by the computer device shown in FIG.
  • the saving of the first volume threshold in step 310 and step 311 is implemented by the memory 103; the sending of the RTCP message in step 314 and the sending of the audio stream in step 322, and other method steps of transmitting and receiving may be performed. It is implemented by the communication interface 104.
  • step 312 the volume of the audio stream of the sending end is monitored, and the volume of the audio stream updated by the sending end that is detected in step 313 is greater than the first volume threshold, and the second volume threshold is determined in step 317.
  • the method steps of the message can be implemented by the processor 101.
  • the transmission of the audio stream can also be resumed according to the following method, and the center control device can remix by determining whether the number of mixing channels can be remixed.
  • the sending response is sent to the sending end that pauses the sending of the audio stream, and specifically includes the following steps.
  • Step 323 The central control device determines whether the number of the mixed channels is smaller than the preset number of mixing channels of the central control device.
  • the central control device may receive the leaving message sent by the sending end of the central control device to allow the sending of the audio stream, and the central control device determines that the number of mixed channels is smaller than the preset mixing path of the central control device.
  • the volume of the audio stream may be different every moment, and the central control device may receive the audio stream sent by the sending end to be less than the volume threshold, and the transmitting end is not allowed to send the audio.
  • the central control device determines that the number of mixed channels is less than the preset number of mixing channels of the central control device.
  • step 324 or step 326 is performed.
  • step 324 is performed.
  • step 326 is performed.
  • step 309 is performed to continue to send the mix.
  • Step 324 The central control device sends an RTCP packet including a recovery response to the transmitting end 1.
  • the recovery response is used by the central control device to instruct the sender to send an audio stream to the central control device.
  • Step 325 The central control device sends a mix.
  • the central control device After receiving the updated audio stream sent by the transmitting end 1, the central control device sends the audio stream reported by the transmitting end 4 and the audio stream reported by the transmitting end 5 to the transmitting end 1 again.
  • the central control device sends the audio stream updated by the sender 1 , the audio stream reported by the sender 4 and the audio stream reported by the sender 5 to the sender 2 .
  • the central control device sends the audio stream updated by the sender 1 , the audio stream reported by the sender 4 and the audio stream reported by the sender 5 to the sender 3 .
  • the central control device mixes the audio stream updated by the sender 1 with the audio stream reported by the sender 5 and sends it to the sender 4.
  • the audio stream updated by the transmitting end 1 of the central control device and the audio stream reported by the transmitting end 4 are mixed and sent to the transmitting end 5.
  • Step 326 The central control device sends an RTCP packet including a recovery response to the transmitting end 2.
  • Step 327 The central control device sends a mix to the transmitting end 2, the transmitting end 4, and the transmitting end 5.
  • the central control device After the central control device receives the updated audio stream sent by the sending end 2, the central control device sends the audio stream updated by the sending end 2, the audio stream reported by the transmitting end 4, and the audio stream reported by the transmitting end 5 to the transmitting end. 1.
  • the central control device sends the audio stream reported by the transmitting end 4 and the audio stream reported by the transmitting end 5 to the transmitting end 2 again.
  • the central control device sends the audio stream updated by the sender 2, the audio stream reported by the sender 4, and the audio stream reported by the sender 5 to the sender 3.
  • the central control device mixes the audio stream updated by the sender 2 with the audio stream reported by the sender 5 and sends it to the sender 4.
  • the audio stream updated by the transmitting end 2 of the central control device and the audio stream reported by the transmitting end 4 are mixed and sent to the transmitting end 5.
  • step 323 may be performed first, and the central control device determines whether the number of mixed channels is smaller than the preset number of mixing channels of the central control device, and the number of mixed channels is equal to the preset number of mixed channels. The central control device further determines whether the volume of the audio stream of the transmitting end is greater than the volume of the audio stream of any of the mixed channels.
  • the transmitting end pauses to send the audio stream to the central control device, and if the central control device determines that the number of mixed channels is less than the preset number of mixing channels of the central control device, actively moves to the central control device.
  • the sending end of the sending audio stream sends an RTCP packet including a recovery response, to indicate that the sending end sends an audio stream to the central control device, and improves the utilization of processing resources between the transmitting end and the central control device.
  • step 324 The transmitting RTCP message and the sending audio stream described in step 325, as well as other method steps of transmitting and receiving, may be implemented by the communication interface 104.
  • the method step of determining whether the number of mixed channels is smaller than the number of preset mixing channels of the central control device, such as the step 323, may be implemented by the processor 101.
  • the update message may also be sent in the following manner.
  • the method further includes the following steps.
  • Step 328 The central control device sends an RTCP packet including the update message to the sender 1.
  • the update message includes a third volume threshold and a number of channels that can also be mixed, and the number of the mixer channels is a difference between the preset number of mixing channels and the number of mixed channels.
  • Step 329 The central control device sends an RTCP packet including the update message to the sending end 2.
  • Step 330 The sending end 1 receives the RTCP packet that is sent by the central control device and includes an update message.
  • Step 331 The sending end 2 receives the RTCP packet that is sent by the central control device and includes an update message.
  • Step 332 The transmitting end 1 saves the third volume threshold.
  • Step 333 The sender 1 monitors the volume of the audio stream of the sender.
  • step 334 is performed.
  • Step 334 The sender 1 sends an RTCP packet including a recovery request to the central control device.
  • the recovery request includes a volume of the audio stream updated by the sender, and the recovery request is used by the sender to request the center control device to instruct the sender to send an audio stream to the central control device.
  • Step 335 The central control device receives the RTCP packet that is sent by the sending end and includes the recovery request.
  • the transmitting end 2 can also perform steps 332 to 334.
  • the central control device may also actively send an RTCP packet including an update message to the sender that sends the audio stream to the central control device after the audio stream is suspended from the central control device.
  • the RTCP message of the update message includes a third volume threshold and a number of channels that can be mixed, so that the sender can determine that the volume of the audio stream updated by the sender is greater than the third volume threshold, or when the number of channels that can be mixed is greater than At 0, the audio stream is sent to the central control device, thereby improving the utilization of processing resources between the transmitting end and the central control device.
  • the method steps shown in FIG. 9 above may be specifically implemented by the computer device shown in FIG.
  • the saving the third volume threshold in step 332 is implemented by the memory 103; the sending of the RTCP message in step 328 and the receiving of the RTCP message in step 330, and other method steps of transmitting and receiving may be performed by communication.
  • Interface 104 is implemented.
  • the method step of processing the message, such as the volume of the audio stream of the transmitting end, as described in step 333, may be implemented by the processor 101.
  • the central control device sends an update message to the sending end in time, so that the sending end can resume sending the audio stream.
  • the update message can be sent periodically, and the period can be set according to the actual situation, which is not limited herein.
  • the present invention provides a schematic diagram of a structure of an RTCP packet, including:
  • Target Synchronization Source Identifier (Target SSRC), which is 32 bits. This identifier is randomly selected. The two synchronization sources participating in the same video conference cannot have the same SSRC.
  • Type is used to indicate that the RTCP message is a type of message in the pause, resume, update, or response.
  • the type of the RTCP message including the pause indication indicates a pause.
  • the version (Res) is used to indicate the version of the protocol.
  • the parameter length (Parameter Len) is used to indicate the length of the RTCP packet.
  • the RTCP message When the RTCP message is an RTCP message including a pause indication, the RTCP message further includes a pause identifier (Pause ID) for indicating the identifier of the sender that pauses the transmission of the audio stream.
  • a pause identifier (Pause ID) for indicating the identifier of the sender that pauses the transmission of the audio stream.
  • the RTCP message including the pause indication further includes a Type Threshold for indicating a dynamic threshold or a static threshold. Threshold Value is used to represent the volume threshold.
  • the RTCP message including the pause indication further includes a Remaining Mix Num for indicating the number of channels that can be mixed, indicating the maximum number of mixes that the center control device can support minus the actual mix. Number of roads.
  • the RTCP message including the pause indication further includes whether the associated video stream (Is Related Video) is used to indicate whether the central control device needs to perform the same pause or resume operation on the audio stream associated with the audio stream.
  • the associated video stream Is Related Video
  • pause identifier the threshold type, the threshold, the remaining maximum number, and whether the associated video stream can be set in the reserved bit of the RTCP message.
  • the RTCP message when the RTCP message is an RTCP message including a recovery request, the RTCP message including the recovery request further includes an audio level (Audio Level) for indicating a volume value of the audio stream.
  • an audio level Audio Level
  • the embodiment of the present invention provides a central control device 30, as shown in FIG.
  • the receiving unit 301 is configured to acquire a real-time transport protocol RTP message sent by the N sending ends, where the RTP message includes a volume of the audio stream, and N is a natural number greater than or equal to 2;
  • the processing unit 302 is configured to determine a first volume threshold according to a volume of the N audio streams
  • the processing unit 302 is further configured to determine an audio stream whose volume is less than or equal to the first volume threshold;
  • the sending unit 303 is configured to send, according to each of the sending ends corresponding to the X audio streams whose volume is less than or equal to the first volume threshold, a real-time transmission control protocol RTCP packet including a pause indication, where X is greater than or equal to 1 and less than a natural number of N, the pause indication is used to indicate that the sender corresponding to the X audio streams pauses to send an audio stream to the central control device.
  • the central control device determines a first volume threshold according to the volume of the N audio streams, and determines an audio stream whose volume is less than or equal to the first volume threshold.
  • An RTCP message including a pause indication is sent to each of the transmitting ends corresponding to the X audio streams whose volume is less than or equal to the first volume threshold, indicating that each transmitting end pauses to send the audio stream to the central control device.
  • the central control device determines the audio stream of the mix according to the volume of the audio stream reported by the sender, and further sends a pause indication to the sender that does not need to mix, so that the sender that does not need the mix pauses to send the audio stream to the central control device. It can effectively improve the utilization of processing resources between the sender and the central control device.
  • the center control device 30 is presented in the form of a functional unit.
  • the "unit” herein may refer to an application-specific integrated circuit (English name: ASIC), a circuit, a processor and a memory that execute one or more software or firmware programs, an integrated logic circuit, and/or Or other devices that provide the above functions.
  • ASIC application-specific integrated circuit
  • the central control device 30 can take the form shown in FIG.
  • the receiving unit 301, the processing unit 302 and the sending unit 303 can be implemented by the computer device of FIG. 4, and specifically, the receiving unit 301, and the transmitting unit 303 can be implemented by the communication interface 104.
  • Processing unit 302 can be implemented by processor 101.
  • the embodiment of the present invention provides a sending end 40, as shown in FIG. 12, including:
  • the sending unit 401 is configured to send a real-time transport protocol RTP message to the central control device, where the RTP message includes a volume of the audio stream;
  • the receiving unit 402 is configured to receive a real-time transmission control protocol (RTCP) packet that is sent by the central control device and includes a pause indication, where the suspension indication is used to instruct the sending end to suspend sending the audio stream to the central control device;
  • RTCP real-time transmission control protocol
  • the sending unit 401 is further configured to send, to the central control device, an RTCP packet that includes a pause response, where the pause response is used to indicate that the sending end has suspended sending the audio stream to the central control device.
  • the transmitting end sends the volume of the audio stream and the audio stream to the central control device, and after receiving the volume of the audio stream reported by the plurality of transmitting ends, the central control device determines the number of mixing channels and the volume threshold according to the volume of the audio stream.
  • the RTCP packet is sent to the corresponding sender of the audio stream, and the RTCP packet includes a pause indication, indicating that the sender pauses to send the audio stream to the central control device, and the sender receives the center control.
  • the RTCP packet including the pause indication further includes a first volume threshold.
  • the sender 40 further includes:
  • a monitoring unit 404 configured to monitor a volume of the audio stream of the sending end
  • the processing unit 405 is configured to determine that the monitored volume of the audio stream updated by the sending end is greater than the first volume threshold
  • the sending unit 401 is further configured to send, to the central control device, an RTCP packet that includes a recovery request, where the recovery request includes a volume of the audio stream that is updated by the sending end, and the recovery request is used to request the central control device Instructs the sender to send an audio stream to the central control device.
  • the receiving unit 402 is further configured to receive, by the central control device, an RTCP packet that includes an update message, where the update message includes a second volume threshold and a number of channels that can also be mixed;
  • the storage unit 403 is further configured to save the second volume threshold
  • the monitoring unit 404 is further configured to monitor a volume of the audio stream of the sending end;
  • the processing unit 405 is further configured to determine that the monitored volume of the audio stream updated by the sending end is greater than the second volume threshold.
  • the transmitting end 40 is presented in the form of a functional unit.
  • the "unit” herein may refer to an application-specific integrated circuit (English name: ASIC), a circuit, a processor and a memory that execute one or more software or firmware programs, an integrated logic circuit, and/or Or other devices that provide the above functions.
  • ASIC application-specific integrated circuit
  • the transmitting end 40 can take the form shown in FIG.
  • the sending unit 401, the receiving unit 402, the storage unit 403, the monitoring unit 404, and the processing unit 405 can be implemented by the computer device of FIG. 4.
  • the receiving unit 402, and the sending unit 401 can be implemented by the communication interface 104, and the processing unit 302 And monitoring unit 404 can be implemented by processor 101.
  • the embodiment of the present invention further provides a computer storage medium for storing computer software instructions used by the central control device shown in FIG. 11 above, which includes a program designed to execute the foregoing method embodiments. By executing the stored program, it is possible to control the pause of the audio stream.
  • the embodiment of the present invention further provides a computer storage medium for storing the computer software instructions used by the transmitting end shown in FIG. 12, which includes a program designed to execute the foregoing method embodiment. By executing the stored program, it is possible to control the pause of the audio stream.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may be physically included separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of hardware plus software functional units.
  • the foregoing program may be stored in a computer readable storage medium, and the program is executed when executed.
  • the foregoing storage medium includes: a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like. The medium of the code.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Telephonic Communication Services (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

本发明的实施例提供一种调整媒体流传输的方法及装置,涉及通信领域,能够根据音量自适应控制媒体流暂停,提高发送端与中心控制设备间的处理资源的利用率。包括:中心控制设备获取N个发送端发送的RTP报文,每个RTP报文包括音频流的音量,N为大于等于2的自然数;根据N个音频流的音量确定第一音量阈值;确定音量小于或等于第一音量阈值的音频流;向音量小于或等于第一音量阈值的X个音频流对应的发送端中每个发送端发送包括暂停指示的RTCP报文,X为大于等于1且小于N的自然数,暂停指示用于指示X个音频流对应的发送端暂停向中心控制设备发送音频流。

Description

一种调整媒体流传输的方法及装置
本申请要求于2016年1月13日提交中国专利局、申请号为201610022357.9、发明名称为“一种调整媒体流传输的方法及装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本发明涉及通信领域,尤其涉及一种调整媒体流传输的方法及装置。
背景技术
多点视频会议系统(英文全称:multi point video conferencing system,英文简称:MPVCS)允许3个或3个以上不同地点的发送端同时参与的会议系统,该系统还包括中心控制设备。发送端通过网络将媒体流发送给中心控制设备,媒体流包括视频流和音频流,中心控制设备负责接收各发送端通过网络发送的媒体流,中心控制设备获取到多路视频流后,确定将一路视频流或多路视频流合并成一路视频流广播出去,以供其他会场的发送端接收。通常,中心控制设备将音量大的发送端的音频流发送至其他发送端,同时,对与其音频相关的视频做策略选择。
中心控制设备仍然会接收到音量较低的音频流,但是,该音量较低的音频流不会被混音发送至其他发送端,这样,对于这些音量较低的音频流照样占用发送端与中心控制设备间的处理资源,导致发送端与中心控制设备间的处理资源的浪费。
在现有技术中,互联网工程任务组(英文全称:internet engineering task force,英文简称:IETF)规定了请求评议(英文全称:request for comments,英文简称:RFC)。RFC3264是基于会话描述协议(英文全称:session description protocol,英文简称:SDP)可用于控制媒体流暂停或恢复。具体的,通过在信令面由会话两端的任一侧发起会话重协商,在相应的需暂停或恢复的媒体流对应的m行置为去激活(inactive)来实现该媒体流单向或者双向的发送控制。待双方协商完成后,媒体路径对应通道的媒体流将会执行相应的控制,即暂停。但是,上述方法耗费信令太多,不适用于在多点会场,及音量动态变化的情况。
发明内容
本发明的目的在于提供一种调整媒体流传输的方法及装置,能够根据音量自适应控制媒体流暂停,从而提高发送端与中心控制设备间的处理资源的利用率。
上述目标和其他目标将通过独立权利要求中的特征来达成。进一步的实现方式在从属权利要求、说明书和附图中体现。
第一方面,提供一种调整媒体流传输的方法,包括:
首先,中心控制设备获取N个发送端发送的实时传输协议RTP报文,每个所述RTP报文包括音频流的音量,N为大于等于2的自然数,目前主流的中心控制设备有多点控制单元(英文全称:Multi-point Control Unit,英文简称:MCU),发送端也就是多媒体通信终 端设备,例如,可以是视频会议终端或桌面式视频终端等;
然后,中心控制设备根据N个音频流的音量确定第一音量阈值;并确定音量小于或等于所述第一音量阈值的音频流;
最后,中心控制设备向所述音量小于或等于所述第一音量阈值的X个音频流对应的发送端中每个发送端发送包括暂停指示的实时传输控制协议RTCP报文,X为大于等于1且小于N的自然数,所述暂停指示用于指示所述X个音频流对应的发送端暂停向中心控制设备发送音频流,发送端也就是多媒体通信终端设备,例如,可以是视频会议终端或桌面式视频终端等。
上述第一方面提供的调整媒体流传输的方法,中心控制设备接收到N个发送端上报的音频流的音量后,根据N个音频流的音量确定第一音量阈值,并确定音量小于或等于所述第一音量阈值的音频流,向音量小于或等于第一音量阈值的X个音频流对应的发送端中每个发送端发送包括暂停指示的RTCP报文,指示每个发送端暂停向中心控制设备发送音频流。从而中心控制设备根据发送端上报的音频流的音量来确定混音的音频流,进一步向不需要混音的发送端发送暂停指示,使得不需要混音的发送端暂停向中心控制设备发送音频流,能够有效提高发送端与中心控制设备间的处理资源的利用率。
其中,中心控制设备根据N个音频流的音量确定第一音量阈值具体的可以采用以下两种方法:
在第一方面的第一种可实现方式中,首先,中心控制设备按照N路音频流的音量从大到小排序;然后,从按照N路音频流的音量从大到小排序的音频流中,将从第一路至第M路的前M路确定为M路音频流,M为大于等于1且小于N的自然数,M表示已混音路数,且M小于或等于预设混音路数L,即已混音的音频流的路数可以与预设混音路数相等也可以小于预设混音路数,X表示未混音路数,N=M+X,所述M路音频流为已混音的音频流;最后,将第M路的音频流的音量与第M+1路的音频流的音量间的音量确定为所述第一音量阈值。
上述第一方面的第一种可实现方式提供的中心控制设备动态的确定音量阈值的具体方法,能够更加实时准确地判断发送端上报的音频流的音量。
在第一方面的第二种可实现方式中,所述中心控制设备根据N个音频流的音量的平均值确定所述第一音量阈值。
上述第一方面的第二种可实现方式提供的中心控制设备静态的确定音量阈值的具体方法,这样判断发送端上报的音频流的音量的速度较快。
结合第一方面、第一方面的第一种可实现方式和第一方面的第二种可实现方式中任一种可实现方式,在第三种可实现方式中,所述包括暂停指示的RTCP报文还包括阈值,所述阈值用于表示所述第一音量阈值,使得接收到包括暂停指示的RTCP报文的音频流对应的发送端中任一发送端根据所述第一音量阈值实时监测音频流的音量,当所述任一发送端的音频流的音量大于所述第一音量阈值时向所述中心控制设备发送包括恢复请求的RTCP报文。所述包括暂停指示的RTCP报文还包括阈值类型。
上述第一方面的第三种可实现方式提供了RTCP报文包括的具体内容,这样以便于发送端接收到这些内容,存储这些内容,根据更新后的音频流的音量大小来判断是否需要向中心控制设备发送音频流。
结合第一方面、第一方面的第一种可实现方式和第一方面的第三种可实现方式中任一 种可实现方式,在第四种可实现方式中,所述包括暂停指示的RTCP报文还包括剩余最大数,所述剩余最大数用于表示还可混音路数,所述还可混音路数为L-M,L表示预设混音路数,M表示已混音路数,当所述还可混音路数大于0时,使所述X个音频流对应的发送端中任一发送端向所述中心控制设备包括恢复请求RTCP报文。
结合第一方面、第一方面的第一种可实现方式和第一方面的第四种可实现方式中任一种可实现方式,在第五种可实现方式中,所述包括暂停指示的RTCP报文还包括是否关联视频流,所述是否关联视频流用于指示所述X个音频流对应的发送端中任一发送端暂停向所述中心控制设备发送音频流的同时暂停向所述中心控制设备发送与所述音频流关联的视频流。
上述第一方面的第五种可实现方式提供了RTCP报文包括的具体内容,这样以便于发送端接收到这些内容,存储这些内容,暂停向中心控制设备发送与所述音频流关联的视频流,提高发送端与中心控制设备间的处理资源的利用率。
结合第一方面、第一方面的第一种可实现方式至第一方面的第五种可实现方式中任一种可实现方式,在第六种可实现方式中,在所述中心控制设备向所述音量小于等于所述第一音量阈值的音频流对应的发送端中每个发送端发送包括暂停指示的RTCP报文之后,所述方法还包括:
所述中心控制设备接收所述X个音频流对应的发送端中每个发送端发送的包括暂停响应的RTCP报文,所述暂停响应用于表示X个音频流对应的发送端中每个发送端已暂停向中心控制设备发送音频流。
结合第一方面的第一种可实现方式至第一方面的第六种可实现方式中任一种可实现方式,在第七种可实现方式中,在所述中心控制设备向所述音量小于或等于所述第一音量阈值的X个音频流对应的发送端中每个发送端发送包括暂停指示的RTCP报文之后,所述方法还包括:
所述中心控制设备接收第一发送端发送的包括恢复请求的RTCP报文,所述恢复请求包括所述第一发送端更新的音频流的音量,所述恢复请求用于请求中心控制设备指示第一发送端向中心控制设备发送音频流,所述第一发送端为X个音频流对应的发送端中的任一发送端;
所述中心控制设备判断所述第一发送端更新的音频流的音量是否大于已混音路数中任一路音频流的音量;
若所述第一发送端更新的音频流的音量大于已混音路数中任一路音频流的音量,所述中心控制设备根据所述第一发送端更新的音频流的音量和N-1发送端的音频流的音量确定第二音量阈值;
所述中心控制设备确定音量小于或等于所述第二音量阈值的音频流;
所述中心控制设备向所述音量小于或等于所述第二音量阈值的Y个音频流对应的发送端中每个发送端发送包括暂停指示的RTCP报文,Y为大于等于1且小于N的自然数。
上述第一方面的第七种可实现方式中,中心控制设备可以根据发送端发送的恢复请求携带的音频流的音量来判断是否恢复发送端发送音频流,从而提高发送端与中心控制设备间的处理资源的利用率。
结合第一方面的第七种可实现方式,在第八种可实现方式中,在所述中心控制设备接 收第一发送端发送的包括恢复请求的RTCP报文之后,所述方法还包括:
所述中心控制设备判断已混音路数是否小于所述中心控制设备预设混音路数;
当所述已混音路数小于所述预设混音路数,所述中心控制设备向所述第一发送端发送包括恢复响应的RTCP报文,所述包括恢复响应的RTCP报文还包括还可混音路数,所述还可混音路数为预设混音路数与已混音路数之差,所述恢复响应用于中心控制设备指示所述第一发送端向中心控制设备发送音频流;
所述中心控制设备判断所述第一发送端更新的音频流的音量是否大于已混音路数中任一路音频流的音量包括:
当所述已混音路数等于所述预设混音路数,所述中心控制设备判断所述第一发送端更新的音频流的音量是否大于已混音路数中任一路音频流的音量。
结合第一方面、第一方面的第一种可实现方式至第一方面的第八种可实现方式中任一种可实现方式,在第九种可实现方式中,在所述中心控制设备向所述音量小于等于所述第一音量阈值的音频流对应的发送端中每个发送端发送包括暂停指示的RTCP报文之后,所述方法还包括:
所述中心控制设备向所述X个音频流对应的发送端中每个发送端发送包括更新消息的RTCP报文,所述更新消息包括第三音量阈值和还可混音路数,所述还可混音路数为预设混音路数与已混音路数之差。
第二方面,提供一种调整媒体流传输的方法,包括:
首先,发送端向中心控制设备发送实时传输协议RTP报文,所述RTP报文包括音频流的音量;再接收所述中心控制设备发送的包括暂停指示的实时传输控制协议RTCP报文,所述暂停指示用于指示发送端暂停向中心控制设备发送音频流;
然后,所述发送端向所述中心控制设备发送包括暂停响应的RTCP报文,所述暂停响应用于表示发送端已暂停向中心控制设备发送音频流。
上述第二方面提供的调整媒体流传输的方法,发送端向中心控制设备发送音频流的音量,使得中心控制设备接收到N个发送端上报的音频流的音量后,根据N个音频流的音量确定第一音量阈值,并确定音量小于或等于所述第一音量阈值的音频流,向音量小于或等于第一音量阈值的X个音频流对应的发送端中每个发送端发送包括暂停指示的RTCP报文,指示每个发送端暂停向中心控制设备发送音频流。从而中心控制设备根据发送端上报的音频流的音量来确定混音的音频流,进一步向不需要混音的发送端发送暂停指示,使得不需要混音的发送端暂停向中心控制设备发送音频流,能够有效提高发送端与中心控制设备间的处理资源的利用率。
在第二方面的第一种可实现方式中,所述包括暂停指示的RTCP报文还包括第一音量阈值,所述方法还包括:
所述发送端保存所述第一音量阈值;
所述发送端监测该发送端的音频流的音量;
所述发送端判断监测到的该发送端更新的音频流的音量大于所述第一音量阈值;
所述发送端向所述中心控制设备发送包括恢复请求的RTCP报文,所述恢复请求包括所述发送端更新的音频流的音量,所述恢复请求用于请求中心控制设备指示发送端向中心控制设备发送音频流;
所述发送端接收所述中心控制设备发送的包括恢复响应的RTCP报文,所述恢复响应用于指示发送端向中心控制设备发送音频流。
在第二方面的第二种可实现方式中,在所述发送端向所述中心控制设备发送包括暂停响应的RTCP报文之后,所述方法还包括:
所述发送端接收所述中心控制设备发送的包括更新消息的RTCP报文,所述更新消息包括第三音量阈值和还可混音路数;
所述发送端保存所述第三音量阈值;
所述发送端监测该发送端的音频流的音量;
所述发送端判断监测到的该发送端更新的音频流的音量大于所述第三音量阈值;
所述发送端向所述中心控制设备发送包括恢复请求的RTCP报文,所述恢复请求包括所述发送端更新的音频流的音量,所述恢复请求用于请求中心控制设备指示发送端向中心控制设备发送音频流。
结合第二方面、第二方面的第一种可实现方式至第二方面的第二种可实现方式中任一种可实现方式,在第二方面的第三种可实现方式中,所述包括暂停指示的RTCP报文还包括还可混音路数,所述包括恢复响应的RTCP报文还包括还可混音路数,所述更新消息还包括还可混音路数,所述还可混音路数为预设混音路数与已混音路数之差,所述方法还包括:
所述发送端判断所述还可混音路数大于0;
所述发送端向所述中心控制设备发送包括恢复请求的RTCP报文。
结合第二方面、第二方面的第一种可实现方式至第二方面的第三种可实现方式中任一种可实现方式,在第二方面的第四种可实现方式中,所述包括暂停指示的RTCP报文还包括是否关联视频流,所述是否关联视频流用于指示发送端暂停向所述中心控制设备发送音频流的同时暂停向所述中心控制设备发送与所述音频流关联的视频流。
第三方面,提供一种中心控制设备,包括:接收单元,用于接收发送端发送的RTP报文或RTCP报文,处理单元,用于处理接收到的RTP报文或RTCP报文,发送单元,用于向发送端发送RTCP报文。具体的实现方式可以参考第一方面提供的调整媒体流传输的方法中中心控制设备的行为的功能。
第四方面,提供一种发送端,包括:接收单元,用于接收中心控制设备发送的RTCP报文,处理单元,用于处理接收到的RTCP报文,发送单元,用于向中心控制设备发送RTCP报文或RTP报文。具体的实现方式可以参考第二方面提供的调整媒体流传输的方法中发送端的行为的功能。
需要说明的是,上述第三方面和第四方面所述功能模块可以通过硬件实现,也可以通过硬件执行相应的软件实现。所述硬件或软件包括一个或多个与上述功能相对应的模块。例如,通信接口,用于完成接收单元和发送单元的功能,处理器,用于完成处理单元的功能,存储器,用于存储音量阈值。处理器、通信接口和存储器通过总线连接并完成相互间的通信。具体的,可以参考第一方面提供的调整媒体流传输的方法中中心控制设备的行为的功能,以及第二方面提供的调整媒体流传输的方法中发送端的行为的功能。
本发明中,中心控制设备和发送端的名字对设备本身不构成限定,在实际实现中,这些设备可以以其他名称出现。只要各个设备的功能和本发明类似,属于本发明权利要求及其等同技术的范围之内。
本发明的这些方面或其他方面在以下实施例的描述中会更加简明易懂。
附图说明
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。
图1为本发明实施例提供的一种多点视频会议系统示意图;
图2为本发明实施例提供的一种音频流的流向示意图;
图3为本发明实施例提供的一种视频流的流向示意图;
图4为本发明实施例提供的一种计算机硬件结构示意图;
图5为本发明实施例提供的一种调整媒体流传输的方法流程图;
图6为本发明实施例提供的另一种调整媒体流传输的方法流程图;
图7为本发明实施例提供的又一种调整媒体流传输的方法流程图;
图8为本发明实施例提供的再一种调整媒体流传输的方法流程图;
图9为本发明实施例提供的又一种调整媒体流传输的方法流程图;
图10为本发明实施例提供的一种RTCP报文结构示意图;
图11为本发明实施例提供的一种中心控制设备结构示意图;
图12为本发明实施例提供的一种发送端结构示意图。
具体实施方式
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚地描述。
本发明的基本原理在于:中心控制设备在混音之后,仍然会接收到音量较低的音频流,导致发送端与中心控制设备间的处理资源的浪费的情况下,中心控制设备根据发送端上报的音频流的音量确定音量阈值,并确定音量小于或等于音量阈值的音频流,向音量小于或等于音量阈值的X个音频流对应的发送端中每个发送端发送包括暂停指示的实时传输控制协议(英文全称:Real-time Transport Control Protocol,英文简称:RTCP)报文,暂停指示用于指示发送端暂停向中心控制设备发送音频流。从而中心控制设备根据发送端上报的音频流的音量来确定混音的音频流,进一步向不需要混音的发送端发送暂停指示,能够有效提高发送端与中心控制设备间的处理资源的利用率。
下面将参考附图详细描述本发明的实施方式。
实施例1
本发明实施例提供一种多点视频会议系统示意图,如图1所示,包括:中心控制设备、网络、发送端1、发送端2、发送端3、发送端4和发送端5。中心控制设备和发送端分别与网络连接。
5个发送端可以位于不同地点会场,例如,深圳会场,北京会场,上海会场,成都会场和西安会场。本发明在此对会议的地点不做限定,还可以在其他会场,这里只是示意性说明。
发送端也就是多媒体通信终端设备,例如,可以是视频会议终端或桌面式视频终端, 也可以是其他多媒体通信终端设备。发送端用于采集该发送端所处会场的视频和音频等信号,通过网络传输到其他发送端或中心控制设备。发送端也可以接上显示设备,例如电视机,电视机作为回显设备显示图像。发送端通常包括核心编解码器、摄像头、全向麦克风和遥控器。核心编解码器用于将摄像头和麦克风输入的图像及声音编码通过网络发送,同时将网络传输来的视频解码后,将图像还原到显示设备上,并将网络传输来的音频解码后,将声音还原到音响上,即实现了与其他发送端的实时交互。
中心控制设备用于对输入的多路会议信号进行切换,会议信号包含音频、视频及数据中至少一种信号。中心控制设备对音频信号采取多路混合的方式或切换方式传送,对视频信号采取直接分配的方式传送,对于数据信号采取广播方式或无损音频编码(英文全称:Meridian Lossless Packing,英文简称:MLP)方式传送。此外,中心控制设备还要完成对通信控制信号和网络接口信号的处理。
示例的,中心控制设备接收到发送端1的音频流1、发送端2的音频流2、发送端3的音频流3和发送端4的音频流4后,可以将对所有音频流按照从大到小的音量进行排序,获取最大音量的前M路音频流,将前M路音频流进行混音,再发送至所有发送端。假设标识*号的表示当前会场中音量最大的2路音频,如图2所示,将音频流3和音频流4混音,将音频流3和音频流4的混音发送至发送端1和发送端2,只将音频流3发送至发送端4,只将音频流4发送至发送端3,而不必将接收方的音频流混音,避免其听到回音,即不必将音频流3发送至发送端3,将音频流4发送至发送端4。
若发送端1显示发送端3的视频流,发送端2显示发送端3的视频流,发送端4显示发送端3的视频流,发送端3显示发送端1的视频流,此时,中心控制设备指示其他发送端切换画面,显示与最大音量的音频相关的视频,假设最大音量为音频流4的音量,如图3所示,发送端1显示发送端4的视频流,发送端2显示发送端4的视频流,发送端4仍然显示发送端3的视频流,发送端3显示发送端4的视频流。
目前主流的中心控制设备有多点控制单元(英文全称:Multi-point Control Unit,英文简称:MCU),多点控制单元未来有可能被其他实现相同功能的设备替代,都在本发明的范围内。
网络可以是IP网络,用于传输中心控制设备与不同会场的发送端间的信号。网络也可以是其他形式的传输网络,本发明在此不做限定。
如图4所示,图1中的中心控制设备和发送端可以以图4中的计算机设备(或系统)的方式来实现。
图4所示为本发明实施例提供的计算机设备示意图。计算机设备100包括至少一个处理器101,通信总线102,存储器103以及至少一个通信接口104。
处理器101可以是一个处理器,也可以是多个处理元件的统称。例如,处理器101可以是一个通用中央处理器(英文全称:Central Processing Unit,英文简称:CPU),也可以是特定应用集成电路(英文全称:application-specific integrated circuit,英文简称:ASIC),或一个或多个用于控制本发明方案程序执行的集成电路,例如:一个或多个微处理器(英文全称:digital signal processor,英文简称:DSP),或,一个或者多个现场可编程门阵列(英文全称:Field Programmable Gate Array,英文简称:FPGA)。
在具体实现中,作为一种实施例,处理器101可以包括一个或多个CPU,例如图4中的 CPU0和CPU1。
在具体实现中,作为一种实施例,计算机设备100可以包括多个处理器,例如图4中的处理器101和处理器108。这些处理器中的每一个可以是一个单核(single-CPU)处理器,也可以是一个多核(multi-CPU)处理器。这里的处理器可以指一个或多个设备、电路、和/或用于处理数据(例如计算机程序指令)的处理核。
通信总线102可以是工业标准体系结构(英文全称:Industry Standard Architecture,英文简称:ISA)总线、外部设备互连(英文全称:Peripheral Component,英文简称:PCI)总线或扩展工业标准体系结构(英文全称:Extended Industry Standard Architecture,英文简称:EISA)总线等。该总线可以分为地址总线、数据总线、控制总线等。为便于表示,图4中仅用一条粗线表示,但并不表示仅有一根总线或一种类型的总线。
存储器103可以是只读存储器(英文全称:read-only memory,英文简称:ROM)或可存储静态信息和指令的其他类型的静态存储设备,随机存取存储器(英文全称:random access memory,英文简称:RAM)或者可存储信息和指令的其他类型的动态存储设备,也可以是电可擦可编程只读存储器(英文全称:Electrically Erasable Programmable Read-Only Memory,英文简称:EEPROM)、只读光盘(英文全称:Compact Disc Read-Only Memory,英文简称:CD-ROM)或其他光盘存储、光碟存储(包括压缩光碟、激光碟、光碟、数字通用光碟、蓝光光碟等)、磁盘存储介质或者其他磁存储设备、或者能够用于携带或存储具有指令或数据结构形式的期望的程序代码并能够由计算机存取的任何其他介质,但不限于此。存储器可以是独立存在,通过总线与处理器相连接。存储器也可以和处理器集成在一起。
其中,所述存储器103用于存储执行本发明方案的应用程序代码,并由处理器101来控制执行。所述处理器101用于执行所述存储器103中存储的应用程序代码。
所述通信接口104,使用任何收发器一类的装置,用于与其他设备或通信网络通信,如以太网,无线接入网(RAN),无线局域网(英文全称:Wireless Local Area Networks,英文简称:WLAN)等。通信接口104可以包括接收单元实现接收功能,以及发送单元实现发送功能。
在具体实现中,作为一种实施例,图4所示的计算机设备100可以是图1中的发送端。
通信接口104,用于接收中心控制设备发送的包括暂停指示的RTCP报文、包括恢复响应的RTCP报文或包括更新消息的RTCP报文。
所述通信接口104,还用于向中心控制设备发送媒体流,媒体流包括音频流和视频流。
所述通信接口104,还用于向中心控制设备发送包括暂停响应的RTCP报文或包括恢复请求的RTCP报文。
处理器101,用于判断监测到的发送端更新的音频流的音量大于音量阈值。
存储器103,用于存储包括暂停指示的RTCP报文、包括恢复响应的RTCP报文或包括更新消息的RTCP报文中包括的内容,例如,音量阈值或还可混音路数。
计算机设备100还可以包括输出设备105和输入设备106。输出设备105可以是显示设备或音响,显示设备用于显示接收到的视频流,音响用于输出接收到的音频流。输入设备106可以是摄像头或者麦克风,摄像头用于获取会场的景象,即视频流,麦克风用于获取会场的声音,即音频流。
在具体实现中,作为一种实施例,图4所示的计算机设备100可以是图1中的中心控制设备。
通信接口104,用于接收发送端发送的实时传输协议(英文全称:real-time transport protocol,英文简称:RTP)报文,每个所述RTP报文包括音频流的音量,N为大于等于2的自然数。
所述通信接口104,还用于接收发送端发送的包括暂停响应的RTCP报文,以及包括恢复请求的RTCP报文。
所述通信接口104,还用于向发送端发送包括暂停指示的RTCP报文、发送包括恢复响应的RTCP报文,以及包括更新消息的RTCP报文。
处理器101,用于根据音频流的音量确定音量阈值,确定音量小于或等于所述第一音量阈值的音频流。
存储器103,用于存储根据音频流的音量确定音量阈值,以及待处理的音频流和与该音频流相关的视频流。
实施例2
本发明实施例提供一种调整媒体流传输的方法,如图5所示,包括:
步骤201、发送端1至发送端N分别向中心控制设备发送RTP报文。
RTP报文包括音频流和该音频流的音量。
步骤202、中心控制设备接收RTP报文。
中心控制设备接收N个发送端分别发送的RTP报文。
步骤203、中心控制设备获取N个RTP报文。
中心控制设备获取N个发送端发送的RTP报文。每个所述RTP报文包括音频流和音频流的音量,N为大于等于2的自然数。
步骤204、中心控制设备确定第一音量阈值。
具体的,中心控制设备根据N个音频流的音量确定第一音量阈值。
可选的,该第一音量阈值可以为动态阈值,适用于活跃度较高的会议如研讨类,动态阈值可以为一个取值范围内的任意一个值。具体的,首先,中心控制设备按照N路音频流的音量从大到小排序;再从按照N路音频流的音量从大到小排序的音频流中,将从第一路音频流至第M路音频流的前M路音频流确定为中心控制设备需要进行混音的M路音频流,M为大于等于1且小于N的自然数,M表示已混音路数,且M小于或等于预设混音路数,所述M路音频流为已混音的音频流。进一步的,中心控制设备将第M路的音频流的音量与第M+1路的音频流的音量间的音量确定为所述第一音量阈值。
可选的,该第一音量阈值可以为静态阈值,适用于活跃度较低的会议如培训类或宣讲类,静态阈值可设置为基础性地背景噪音。中心控制设备先获取N个音频流的音量的平均值,然后,根据N个音频流的音量的平均值确定所述第一音量阈值。进一步的,中心控制设备确定音量大于所述第一音量阈值的音频流,若音量大于所述第一音量阈值的音频流的个数大于预设混音路数,将音量大于所述第一音量阈值的音频流按照音量从大到小排序,获取预设混音路数的音频流,将预设混音路数的音频流进行混音;若音量大于所述第一音量阈值的音频流的个数小于等于预设混音路数,直接将音量大于所述第一音量阈值的音频流进行混音。
步骤205、中心控制设备确定音量小于或等于第一音量阈值的音频流。
步骤206、中心控制设备向音量小于或等于第一音量阈值的X个音频流对应的发送端中每个发送端发送包括暂停指示的RTCP报文。
所述暂停指示用于指示X个音频流对应的发送端暂停向中心控制设备发送音频流。
进一步的,所述包括暂停指示的RTCP报文还包括阈值,所述阈值用于表示所述第一音量阈值,使得接收到包括暂停指示的RTCP报文的X个音频流对应的发送端中任一发送端根据所述第一音量阈值实时监测音频流的音量,当所述任一发送端的音频流的音量大于所述第一音量阈值时向所述中心控制设备发送包括恢复请求的RTCP报文。
可选的,所述包括暂停指示的RTCP报文还包括剩余最大数,所述剩余最大数用于表示还可混音路数,所述还可混音路数为L-M,L表示预设混音路数,M表示已混音路数,当所述还可混音路数大于0时,使所述X个音频流对应的发送端中任一发送端向所述中心控制设备包括恢复请求RTCP报文;
可选的,所述包括暂停指示的RTCP报文还包括是否关联视频流,所述是否关联视频流用于指示所述X个音频流对应的发送端中任一发送端暂停向所述中心控制设备发送音频流的同时暂停向所述中心控制设备发送与所述音频流关联的视频流。
步骤207、发送端N接收中心控制设备发送的包括暂停指示的RTCP报文。
所述暂停指示用于指示发送端暂停向中心控制设备发送音频流。
步骤208、发送端N向中心控制设备发送包括暂停响应的RTCP报文。
所述暂停响应用于表示发送端已暂停向中心控制设备发送音频流。
步骤209、中心控制设备发送音频流。
中心控制设备再向所有发送端发送音频流,但是中心控制设备不接收发送端N的音频流。
这样一来,中心控制设备接收到N个发送端上报的音频流的音量后,根据N个音频流的音量确定第一音量阈值,并确定音量小于或等于所述第一音量阈值的音频流,向音量小于或等于第一音量阈值的X个音频流对应的发送端中每个发送端发送包括暂停指示的RTCP报文,指示每个发送端暂停向中心控制设备发送音频流。从而中心控制设备根据发送端上报的音频流的音量来确定混音的音频流,进一步向不需要混音的发送端发送暂停指示,使得不需要混音的发送端暂停向中心控制设备发送音频流,能够有效提高发送端与中心控制设备间的处理资源的利用率。
实施例3
本发明实施例提供一种调整媒体流传输的方法,假设有5个发送端参加会议,如图6所示,包括:
步骤301、中心控制设备接收RTP报文。
中心控制设备接收所有发送端发送的RTP报文。示例的,假设中心控制设备接收到发送端1发送的包括第一音频流和第一音频流的音量的RTP报文,还接收到发送端2发送的包括第二音频流和第二音频流的音量的RTP报文,还接收到发送端3发送的包括第三音频流和第三音频流的音量的RTP报文,还接收到发送端4发送的包括第四音频流和第四音频流的音量的RTP报文,还接收到发送端5发送的包括第五音频流和第五音频流的音量的RTP报文。
步骤302、中心控制设备确定第一音量阈值。
中心控制设备按照五路音频流的音量确定第一音量阈值,首先,中心控制设备按照五路音频流的音量从大到小排序。
假设第一音频流的音量为A,第二音频流的音量为B,第三音频流的音量为C,第四音频流的音量为D,第五音频流的音量为E,若E>C>D>B>A,再从按照五路音频流的音量从大到小排序的音频流中,将第五音频流、第三音频流和第四音频流这前三路音频流确定为中心控制设备需要进行混音的三路音频流。需要说明的是,通常,现有技术中最多混音三路,如果混合大于三路的音频流,人耳可能无法识别该多路混音。当然,也可以不限于三路。
进一步的,中心控制设备将第四路的音频流的音量与第二路的音频流的音量间的任一个音量确定为所述第一音量阈值。
步骤303、中心控制设备确定第二音频流的音量等于第一音量阈值,以及第一音频流的音量小于第一音量阈值。
需要说明的是,在一种实现方式中,若已混音路数小于预设混音路数,也可以将音频流的音量等于第一音量阈值的音频流进行混音。
步骤304、中心控制设备向发送端1和发送端2发送包括暂停指示的RTCP报文。
所述暂停指示用于指示发送端1和发送端2暂停向中心控制设备发送音频流。进一步的,包括暂停指示的RTCP报文还包括阈值,所述阈值用于表示所述第一音量阈值,使得接收到包括暂停指示的RTCP报文的X个音频流对应的发送端中任一发送端根据所述第一音量阈值实时监测音频流的音量,当所述任一发送端的音频流的音量大于所述第一音量阈值时向所述中心控制设备发送包括恢复请求的RTCP报文,X为大于等于1且小于N的自然数。
包括暂停指示的RTCP报文还包括剩余最大数和是否关联视频流,所述剩余最大数用于表示还可混音路数,所述还可混音路数为L-M,L表示预设混音路数,M表示已混音路数,当所述还可混音路数大于0时,使所述X个音频流对应的发送端中任一发送端向所述中心控制设备包括恢复请求RTCP报文,所述是否关联视频流用于指示所述X个音频流对应的发送端中任一发送端暂停向所述中心控制设备发送音频流的同时暂停向所述中心控制设备发送与所述音频流关联的视频流。
步骤305、发送端1接收中心控制设备发送的包括暂停指示的RTCP报文。
步骤306、发送端2接收中心控制设备发送的包括暂停指示的RTCP报文。
步骤307、发送端1向中心控制设备发送包括暂停响应的RTCP报文。
所述暂停响应用于表示发送端1已暂停向中心控制设备发送音频流。
步骤308、发送端2向中心控制设备发送包括暂停响应的RTCP报文。
所述暂停响应用于表示发送端2已暂停向中心控制设备发送音频流。
步骤309、中心控制设备发送混音。
其中,中心控制设备将发送端3的第三音频流、发送端4的第四音频流和发送端5的第五音频流的混音发送至发送端1。
中心控制设备将发送端3的第三音频流、发送端4的第四音频流和发送端5的第五音频流的混音发送至发送端2。
中心控制设备将发送端4的第四音频流和发送端5的第五音频流的混音发送至发送端3。
中心控制设备将发送端3的第三音频流和发送端5的第五音频流的混音发送至发送端4。
中心控制设备将发送端3的第三音频流和发送端4的第四音频流的混音发送至发送端5。
这样一来,中心控制设备接收到5个发送端上报的音频流的音量后,根据音频流的音量确定第一音量阈值,并取得发送端1的音量小于第一音量阈值,以及发送端2的音量等于第一音量阈值,向发送端1和发送端2发送包括暂停指示的RTCP报文。从而中心控制设备根据发送端上报的音频流的音量来确定混音的音频流,进一步向不需要混音的发送端发送暂停指示,使得不需要混音的发送端暂停向中心控制设备发送音频流,能够有效提高发送端与中心控制设备间的处理资源的利用率。
上述图6所示的方法步骤具体的可以由图4所示的计算机设备实现。示例的,步骤301所述的接收RTP报文,和步骤304所述的发送RTCP报文,以及步骤305所述的接收RTCP报文,步骤309所述的发送音频流,以及其他发送接收的方法步骤都可以由通信接口104来实现。步骤302所述的确定第一音量阈值,步骤303所述的确定音量小于或等于第一音量阈值的音频流等处理报文的方法步骤可以由处理器101来实现。
如图7所示,在发送端被暂停向中心控制设备发送音频流之后,还可以按照下面的方法来恢复音频流的发送。例如,可以在步骤309之后,暂停发送音频流的发送端可以继续监测音频流的音量,判断音量是否大于音量阈值,从而向中心控制设备发送音频流,具体的还包括以下步骤。
步骤310、发送端1保存第一音量阈值。
步骤311、发送端2保存第一音量阈值。
步骤312、发送端1监测该发送端的音频流的音量。
步骤313、发送端1判断监测到的该发送端更新的音频流的音量大于第一音量阈值。
步骤314、发送端1向中心控制设备发送包括恢复请求的RTCP报文。
所述恢复请求包括发送端1更新的音频流的音量,所述恢复请求用于发送端1请求中心控制设备指示发送端1向中心控制设备发送音频流。
进一步的,包括恢复请求的RTCP报文还包括视频流的关联情况,发送端请求中心控制设备指示该发送端向中心控制设备发送音频流的同时,还请求中心控制设备指示该发送端向中心控制设备发送视频流。与音频流关联的视频流都需要保持同音频流相关的操作。
步骤315、中心控制设备接收发送端1发送的包括恢复请求的RTCP报文。
步骤316、中心控制设备判断发送端1更新的音频流的音量是否大于已混音路数中任一路音频流的音量。
判断发送端1再次上报的更新的音频流的音量是否大于第五音频流的音量、第四音频流的音量和第三音频流的音量。
若发送端1的音频流的音量大于已混音路数中任一路音频流的音量,执行步骤317至步骤322;若发送端1更新的音频流的音量小于或等于所有已混音的音频流的音量,中心控制设备继续按照已混音的音频流发送给发送端,执行步骤309。
假设发送端1再次上报更新的音频流的音量大于第三音频流的音量。
步骤317、中心控制设备根据发送端1更新的音频流的音量和其他发送端的音频流的音量确定第二音量阈值。
具体的确定方法可以参考步骤204中的具体描述。
步骤318、中心控制设备确定第二音频流的音量等于第二音量阈值,以及第三音频流的 音量小于第二音量阈值。
步骤319、中心控制设备向发送端3发送包括暂停指示的RTCP报文。
所述中心控制设备向所述音量小于或等于所述第二音量阈值的Y个音频流对应的发送端中每个发送端发送包括暂停指示的RTCP报文。
步骤320、发送端3向中心控制设备发送包括暂停响应的RTCP报文。
需要说明的是,包括暂停响应的RTCP报文还包括第二音量阈值,以便于发送端3保存该第二音量阈值,监测发送端3的音频流的音量,判断监测到的该发送端3更新的音频流的音量大于所述第二音量阈值,向所述中心控制设备发送包括恢复请求的RTCP报文。
步骤321、中心控制设备向发送端1发送包括恢复响应的RTCP报文。
所述恢复响应用于中心控制设备指示该发送端1向中心控制设备发送音频流。
步骤322、中心控制设备发送混音。
其中,中心控制设备接收发送端1发送的更新的音频流后,中心控制设备将发送端4再次上报的音频流和发送端5再次上报的音频流发送至发送端1。
中心控制设备将发送端1更新的音频流、发送端4再次上报的音频流和发送端5再次上报的音频流发送至发送端2。
中心控制设备将发送端1更新的音频流、发送端4再次上报的音频流和发送端5再次上报的音频流发送至发送端3。
中心控制设备将发送端1更新的音频流和发送端5再次上报的音频流混音后发送至发送端4。
中心控制设备发送端1更新的音频流和发送端4再次上报的音频流混音后发送至发送端5。
这样一来,接收到暂停指示的发送端,暂停向中心控制设备发送音频流,并保存音量阈值,通过继续实时监测该发送端的音频流的音量,当音频流的音量大于音量阈值时,向中心控制设备发送恢复请求,以便于向中心控制设备发送更新的音频流,使得发送端能够接收到清楚的混音。
上述图7所示的方法步骤具体的可以由图4所示的计算机设备实现。示例的,步骤310和步骤311所述的保存第一音量阈值由存储器103来实现;步骤314所述的发送RTCP报文和步骤322所述的发送音频流,以及其他发送接收的方法步骤都可以由通信接口104来实现。步骤312所述的监测该发送端的音频流的音量,步骤313所述的判断监测到的该发送端更新的音频流的音量大于第一音量阈值,步骤317所述的确定第二音量阈值等处理报文的方法步骤,可以由处理器101来实现。
如图8所示,在发送端被暂停向中心控制设备发送音频流之后,还可以按照下面的方法来恢复音频流的发送,中心控制设备通过判断混音路数是否还可以再混音,从而向暂停发送音频流的发送端发送恢复响应,具体的还包括以下步骤。
步骤323、中心控制设备判断已混音路数是否小于中心控制设备预设混音路数。
可选的,中心控制设备可能接收到中心控制设备允许发送音频流的发送端发送的离会消息,中心控制设备判断已混音路数小于中心控制设备预设混音路数。
可选的,由于音频流是实时传输,每时每刻的音频流的音量大小可能不同,中心控制设备可能接收到发送端发送的音频流的音量小于音量阈值,则不允许该发送端发送音频流, 中心控制设备判断已混音路数小于中心控制设备预设混音路数。
当所述已混音路数小于所述预设混音路数,执行步骤324或步骤326。
需要说明的是,当发送端1的音频流的音量大于发送端2的音频流的音量,且发送端1的音频流的音量大于第一音量阈值,执行步骤324。同理,当发送端2的音频流的音量大于发送端1的音频流的音量,且发送端2的音频流的音量大于第一音量阈值,执行步骤326。
当所述已混音路数等于所述预设混音路数,执行步骤309,继续发送混音。
步骤324、中心控制设备向发送端1发送包括恢复响应的RTCP报文。
所述恢复响应用于中心控制设备指示该发送端向中心控制设备发送音频流。
步骤325、中心控制设备发送混音。
其中,中心控制设备接收发送端1发送的更新的音频流后,中心控制设备将发送端4再次上报的音频流和发送端5再次上报的音频流发送至发送端1。
中心控制设备将发送端1更新的音频流、发送端4再次上报的音频流和发送端5再次上报的音频流发送至发送端2。
中心控制设备将发送端1更新的音频流、发送端4再次上报的音频流和发送端5再次上报的音频流发送至发送端3。
中心控制设备将发送端1更新的音频流和发送端5再次上报的音频流混音后发送至发送端4。
中心控制设备发送端1更新的音频流和发送端4再次上报的音频流混音后发送至发送端5。
步骤326、中心控制设备向发送端2发送包括恢复响应的RTCP报文。
步骤327、中心控制设备向发送端2、发送端4和发送端5发送混音。
其中,中心控制设备接收发送端2发送的更新的音频流后,中心控制设备将发送端2更新的音频流、发送端4再次上报的音频流和发送端5再次上报的音频流发送至发送端1。
中心控制设备将发送端4再次上报的音频流和发送端5再次上报的音频流发送至发送端2。
中心控制设备将发送端2更新的音频流、发送端4再次上报的音频流和发送端5再次上报的音频流发送至发送端3。
中心控制设备将发送端2更新的音频流和发送端5再次上报的音频流混音后发送至发送端4。
中心控制设备发送端2更新的音频流和发送端4再次上报的音频流混音后发送至发送端5。
需要说明的是,在步骤316之前,可以先执行步骤323,中心控制设备判断已混音路数是否小于中心控制设备预设混音路数,当已混音路数等于预设混音路数,中心控制设备再判断所述发送端的音频流的音量是否大于已混音路数中任一路音频流的音量。
这样一来,接收到暂停指示的发送端,暂停向中心控制设备发送音频流后,若中心控制设备判断已混音路数小于中心控制设备预设混音路数,主动向暂停向中心控制设备发送音频流的发送端发送包括恢复响应的RTCP报文,来指示该发送端向中心控制设备发送音频流,提高发送端与中心控制设备间的处理资源的利用率。
上述图8所示的方法步骤具体的可以由图4所示的计算机设备实现。示例的,步骤324 所述的发送RTCP报文和步骤325所述的发送音频流,以及其他发送接收的方法步骤都可以由通信接口104来实现。步骤323所述的判断已混音路数是否小于中心控制设备预设混音路数等处理报文的方法步骤,可以由处理器101来实现。
如图9所示,在发送端被暂停向中心控制设备发送音频流之后,还可以按照下面的方法来发送更新消息。例如,可以在步骤309之后,所述方法还包括以下步骤。
步骤328、中心控制设备向发送端1发送包括更新消息的RTCP报文。
所述更新消息包括第三音量阈值和还可混音路数,所述还可混音路数为预设混音路数与已混音路数之差。
步骤329、中心控制设备向发送端2发送包括更新消息的RTCP报文。
步骤330、发送端1接收中心控制设备发送的包括更新消息的RTCP报文。
步骤331、发送端2接收中心控制设备发送的包括更新消息的RTCP报文。
步骤332、发送端1保存第三音量阈值。
步骤333、发送端1监测该发送端的音频流的音量。
当监测到的该发送端1更新的音频流的音量大于第三音量阈值时,或,当还可混音路数大于0时,执行步骤334。
步骤334、发送端1向中心控制设备发送包括恢复请求的RTCP报文。
所述恢复请求包括所述发送端更新的音频流的音量,所述恢复请求用于发送端请求中心控制设备指示该发送端向中心控制设备发送音频流。
步骤335、中心控制设备接收发送端1发送的包括恢复请求的RTCP报文。
中心控制设备接收发送端1发送的包括恢复请求的RTCP报文之后的详细步骤可以参考步骤316及以后的步骤所述,本发明在此不再赘述。
同理,发送端2也可以执行步骤332至步骤334。
这样一来,接收到暂停指示的发送端,暂停向中心控制设备发送音频流后,中心控制设备还可以主动向暂停向中心控制设备发送音频流的发送端发送包括更新消息的RTCP报文,该更新消息的RTCP报文包括第三音量阈值和还可混音路数,使得发送端通过判断该发送端更新的音频流的音量大于第三音量阈值时,或,当还可混音路数大于0时,再向中心控制设备发送音频流,从而提高发送端与中心控制设备间的处理资源的利用率。
上述图9所示的方法步骤具体的可以由图4所示的计算机设备实现。示例的,步骤332所述的保存第三音量阈值由存储器103来实现;步骤328所述的发送RTCP报文和步骤330所述的接收RTCP报文,以及其他发送接收的方法步骤都可以由通信接口104来实现。步骤333所述的监测该发送端的音频流的音量等处理报文的方法步骤,可以由处理器101来实现。
上述方法,中心控制设备向发送端及时发送更新消息,使得发送端能够恢复发送音频流。实际应用中,可以周期性的发送更新消息,周期可以根据实际情况自行设定,这里不做限定。
如图10所示,本发明提供一种RTCP报文结构示意图,包括:
目标同步信源标识符(Target SSRC),占32位,该标识符是随机选择的,参加同一视频会议的两个同步信源不能有相同的SSRC。
类型(Type)用于表示RTCP报文为暂停、恢复、更新或响应中那种类型的报文。该包括暂停指示的RTCP报文的类型表示暂停。
版本(Res)用于表示协议的版本。
参数长度(Parameter Len)用于表示RTCP报文的长度。
当RTCP报文为包括暂停指示的RTCP报文,该RTCP报文还包括暂停标识(Pause ID)用于表示暂停发送音频流的发送端的标识。
该包括暂停指示的RTCP报文还包括阈值类型(Type Threshold)用于表示动态阈值或静态阈值。阈值(Threshold Value)用于表示音量阈值。
可选的,该包括暂停指示的RTCP报文还包括剩余最大数(Remaining Mix Num)用于表示还可混音路数,表示中心控制设备能支持的最大混音路数减去实际已经混音路数。
可选的,该包括暂停指示的RTCP报文还包括是否关联视频流(Is Related Video)用于表示中心控制设备是否需要对音频流关联的视频流做同音频流相同的暂停或恢复操作,可以用是或否表示。是就表示需要对音频流关联的视频流做同音频流相同的暂停或恢复操作;否就表示不需要对音频流关联的视频流做同音频流相同的暂停或恢复操作。
需要说明的是,暂停标识、阈值类型、阈值、剩余最大数和是否关联视频流可以在RTCP报文的保留位设置。
进一步,当RTCP报文为包括恢复请求的RTCP报文,该包括恢复请求的RTCP报文还包括音频流的音量(Audio Level)用于表示音频流的音量值。
实施例4
本发明实施例提供一种中心控制设备30,如图11所示,包括:
接收单元301,用于获取N个发送端发送的实时传输协议RTP报文,每个所述RTP报文包括音频流的音量,N为大于等于2的自然数;
处理单元302,用于根据N个音频流的音量确定第一音量阈值;
所述处理单元302,还用于确定音量小于或等于所述第一音量阈值的音频流;
发送单元303,用于向所述音量小于或等于所述第一音量阈值的X个音频流对应的发送端中每个发送端发送包括暂停指示的实时传输控制协议RTCP报文,X为大于等于1且小于N的自然数,所述暂停指示用于指示所述X个音频流对应的发送端暂停向中心控制设备发送音频流。
这样一来,中心控制设备接收到N个发送端上报的音频流的音量后,根据N个音频流的音量确定第一音量阈值,并确定音量小于或等于所述第一音量阈值的音频流,向音量小于或等于第一音量阈值的X个音频流对应的发送端中每个发送端发送包括暂停指示的RTCP报文,指示每个发送端暂停向中心控制设备发送音频流。从而中心控制设备根据发送端上报的音频流的音量来确定混音的音频流,进一步向不需要混音的发送端发送暂停指示,使得不需要混音的发送端暂停向中心控制设备发送音频流,能够有效提高发送端与中心控制设备间的处理资源的利用率。
在本实施例中,中心控制设备30是以功能单元的形式来呈现。这里的“单元”可以指特定应用集成电路(英文全称:application-specific integrated circuit,英文简称:ASIC),电路,执行一个或多个软件或固件程序的处理器和存储器,集成逻辑电路,和/或其他可以提供上述功能的器件。在一个简单的实施例中,本领域的技术人员可以想到中心控制设备30可以采用图11所示的形式。接收单元301,处理单元302和发送单元303可以通过图4的计算机设备来实现,具体的,接收单元301,和发送单元303可以由通信接口104实现, 处理单元302可以由处理器101实现。
实施例5
本发明实施例提供一种发送端40,如图12所示,包括:
发送单元401,用于向中心控制设备发送实时传输协议RTP报文,所述RTP报文包括音频流的音量;
接收单元402,用于接收所述中心控制设备发送的包括暂停指示的实时传输控制协议RTCP报文,所述暂停指示用于指示发送端暂停向中心控制设备发送音频流;
所述发送单元401,还用于向所述中心控制设备发送包括暂停响应的RTCP报文,所述暂停响应用于表示发送端已暂停向中心控制设备发送音频流。
这样一来,发送端向中心控制设备发送音频流和音频流的音量,中心控制设备接收到多个发送端上报的音频流的音量后,根据音频流的音量确定混音路数以及音量阈值,当音频流的音量小于等于音量阈值,向该音频流所对应的发送端发送RTCP报文,该RTCP报文包括暂停指示,指示发送端暂停向中心控制设备发送音频流,发送端接收到中心控制设备发送的暂停指示,后暂停向中心控制设备发送音频流。从而使得不需要混音的发送端暂停向中心控制设备发送音频流,能够有效提高发送端与中心控制设备间的处理资源的利用率。
所述包括暂停指示的RTCP报文还包括第一音量阈值,如图12所示,所述发送端40还包括:
存储单元403,用于保存所述第一音量阈值;
监测单元404,用于监测该发送端的音频流的音量;
处理单元405,用于判断监测到的该发送端更新的音频流的音量大于所述第一音量阈值;
所述发送单元401,还用于向所述中心控制设备发送包括恢复请求的RTCP报文,所述恢复请求包括所述发送端更新的音频流的音量,所述恢复请求用于请求中心控制设备指示发送端向中心控制设备发送音频流。
所述接收单元402,还用于接收所述中心控制设备发送的包括更新消息的RTCP报文,所述更新消息包括第二音量阈值和还可混音路数;
所述存储单元403,还用于保存所述第二音量阈值;
所述监测单元404,还用于监测该发送端的音频流的音量;
所述处理单元405,还用于判断监测到的该发送端更新的音频流的音量大于所述第二音量阈值。
在本实施例中,发送端40是以功能单元的形式来呈现。这里的“单元”可以指特定应用集成电路(英文全称:application-specific integrated circuit,英文简称:ASIC),电路,执行一个或多个软件或固件程序的处理器和存储器,集成逻辑电路,和/或其他可以提供上述功能的器件。在一个简单的实施例中,本领域的技术人员可以想到发送端40可以采用图12所示的形式。发送单元401、接收单元402、存储单元403、监测单元404和处理单元405可以通过图4的计算机设备来实现,具体的,接收单元402,和发送单元401可以由通信接口104实现,处理单元302和监测单元404可以由处理器101实现。
本发明实施例还提供了一种计算机存储介质,用于储存为上述图11所示的中心控制设备所用的计算机软件指令,其包含用于执行上述方法实施例所设计的程序。通过执行存储的程序,可以实现控制音频流的暂停。
本发明实施例还提供了一种计算机存储介质,用于储存为上述图12所示的发送端所用的计算机软件指令,其包含用于执行上述方法实施例所设计的程序。通过执行存储的程序,可以实现控制音频流的暂停。
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理包括,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用硬件加软件功能单元的形式实现。
本领域普通技术人员可以理解:实现上述方法实施例的全部或部分步骤可以通过程序指令相关的硬件来完成,前述的程序可以存储于一计算机可读取存储介质中,该程序在执行时,执行包括上述方法实施例的步骤;而前述的存储介质包括:只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random-Access Memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。
以上所述,仅为本发明的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应以所述权利要求的保护范围为准。

Claims (22)

  1. 一种调整媒体流传输的方法,其特征在于,包括:
    中心控制设备获取N个发送端发送的实时传输协议RTP报文,每个所述RTP报文包括音频流的音量,N为大于等于2的自然数;
    所述中心控制设备根据N个音频流的音量确定第一音量阈值;
    所述中心控制设备确定音量小于或等于所述第一音量阈值的音频流;
    所述中心控制设备向所述音量小于或等于所述第一音量阈值的X个音频流对应的发送端中每个发送端发送包括暂停指示的实时传输控制协议RTCP报文,X为大于等于1且小于N的自然数,所述暂停指示用于指示所述X个音频流对应的发送端暂停向中心控制设备发送音频流。
  2. 根据权利要求1所述的方法,其特征在于,所述中心控制设备根据N个音频流的音量确定第一音量阈值包括:
    所述中心控制设备按照N路音频流的音量从大到小排序;
    所述中心控制设备从按照N路音频流的音量从大到小排序的音频流中,将从第一路至第M路的前M路确定为M路音频流,M为大于等于1且小于N的自然数,M表示已混音路数,且M小于或等于预设混音路数L,所述M路音频流为已混音的音频流;
    所述中心控制设备将第M路的音频流的音量与第M+1路的音频流的音量间的音量确定为所述第一音量阈值。
  3. 根据权利要求1所述的方法,其特征在于,所述中心控制设备根据N个音频流的音量确定第一音量阈值包括:
    所述中心控制设备根据N个音频流的音量的平均值确定所述第一音量阈值。
  4. 根据权利要求1-3任一项权利要求所述的方法,其特征在于,所述包括暂停指示的RTCP报文还包括阈值,所述阈值用于表示所述第一音量阈值,使得接收到包括暂停指示的RTCP报文的X个音频流对应的发送端中任一发送端根据所述第一音量阈值实时监测音频流的音量,当所述任一发送端的音频流的音量大于所述第一音量阈值时向所述中心控制设备发送包括恢复请求的RTCP报文。
  5. 根据权利要求1-4任一项权利要求所述的方法,其特征在于,所述包括暂停指示的RTCP报文还包括剩余最大数,所述剩余最大数用于表示还可混音路数,所述还可混音路数为L-M,L表示预设混音路数,M表示已混音路数,当所述还可混音路数大于0时,使所述X个音频流对应的发送端中任一发送端向所述中心控制设备包括恢复请求RTCP报文。
  6. 根据权利要求1-5任一项权利要求所述的方法,其特征在于,所述包括暂停指示的RTCP报文还包括是否关联视频流,所述是否关联视频流用于指示所述X个音频流对应的发送端中任一发送端暂停向所述中心控制设备发送音频流的同时暂停向所述中心控制设备发送与所述音频流关联的视频流。
  7. 根据权利要求2-6任一项权利要求所述的方法,其特征在于,在所述中心控制设备向所述音量小于或等于所述第一音量阈值的X个音频流对应的发送端中每个发送端发送包括暂停指示的RTCP报文之后,所述方法还包括:
    所述中心控制设备接收第一发送端发送的包括恢复请求的RTCP报文,所述恢复请求包括所述第一发送端更新的音频流的音量,所述恢复请求用于请求中心控制设备指示第一发送端 向中心控制设备发送音频流,所述第一发送端为X个音频流对应的发送端中的任一发送端;
    所述中心控制设备判断所述第一发送端更新的音频流的音量是否大于已混音路数中任一路音频流的音量;
    若所述第一发送端更新的音频流的音量大于已混音路数中任一路音频流的音量,所述中心控制设备根据所述第一发送端更新的音频流的音量和N-1发送端的音频流的音量确定第二音量阈值;
    所述中心控制设备确定音量小于或等于所述第二音量阈值的音频流;
    所述中心控制设备向所述音量小于或等于所述第二音量阈值的Y个音频流对应的发送端中每个发送端发送包括暂停指示的RTCP报文,Y为大于等于1且小于N的自然数。
  8. 一种调整媒体流传输的方法,其特征在于,包括:
    发送端向中心控制设备发送实时传输协议RTP报文,所述RTP报文包括音频流的音量;
    所述发送端接收所述中心控制设备发送的包括暂停指示的实时传输控制协议RTCP报文,所述暂停指示用于指示发送端暂停向中心控制设备发送音频流;
    所述发送端向所述中心控制设备发送包括暂停响应的RTCP报文,所述暂停响应用于表示发送端已暂停向中心控制设备发送音频流。
  9. 根据权利要求8所述的方法,其特征在于,所述包括暂停指示的RTCP报文还包括第一音量阈值,所述方法还包括:
    所述发送端保存所述第一音量阈值;
    所述发送端监测该发送端的音频流的音量;
    所述发送端判断监测到的该发送端更新的音频流的音量大于所述第一音量阈值;
    所述发送端向所述中心控制设备发送包括恢复请求的RTCP报文,所述恢复请求包括所述发送端更新的音频流的音量,所述恢复请求用于请求中心控制设备指示发送端向中心控制设备发送音频流。
  10. 根据权利要求8或9所述的方法,其特征在于,所述包括暂停指示的RTCP报文还包括还可混音路数,包括恢复响应的RTCP报文还包括还可混音路数,所述还可混音路数为预设混音路数与已混音路数之差,所述方法还包括:
    所述发送端判断所述还可混音路数大于0;
    所述发送端向所述中心控制设备发送包括恢复请求的RTCP报文。
  11. 根据权利要求8-10任一项权利要求所述的方法,其特征在于,所述包括暂停指示的RTCP报文还包括是否关联视频流,所述是否关联视频流用于指示发送端暂停向所述中心控制设备发送音频流的同时暂停向所述中心控制设备发送与所述音频流关联的视频流。
  12. 一种中心控制设备,其特征在于,包括:
    接收单元,用于获取N个发送端发送的实时传输协议RTP报文,每个所述RTP报文包括音频流的音量,N为大于等于2的自然数;
    处理单元,用于根据N个音频流的音量确定第一音量阈值;
    所述处理单元,还用于确定音量小于或等于所述第一音量阈值的音频流;
    发送单元,用于向所述音量小于或等于所述第一音量阈值的X个音频流对应的发送端中每个发送端发送包括暂停指示的实时传输控制协议RTCP报文,X为大于等于1且小于N的自然数,所述暂停指示用于指示所述X个音频流对应的发送端暂停向中心控制设备发送音频流。
  13. 根据权利要求12所述的中心控制设备,其特征在于,所述处理单元,具体用于:
    按照N路音频流的音量从大到小排序;
    从按照N路音频流的音量从大到小排序的音频流中,将从第一路至第M路的前M路确定为M路音频流,M为大于等于1且小于N的自然数,M表示已混音路数,且M小于或等于预设混音路数L,所述M路音频流为已混音的音频流;
    将第M路的音频流的音量与第M+1路的音频流的音量间的音量确定为所述第一音量阈值。
  14. 根据权利要求12所述的中心控制设备,其特征在于,所述处理单元,具体用于:
    根据N个音频流的音量的平均值确定所述第一音量阈值。
  15. 根据权利要求12-14任一项权利要求所述的中心控制设备,其特征在于,所述包括暂停指示的RTCP报文还包括阈值,所述阈值用于表示所述第一音量阈值,使得接收到包括暂停指示的RTCP报文的X个音频流对应的发送端中任一发送端根据所述第一音量阈值实时监测音频流的音量,当所述任一发送端的音频流的音量大于所述第一音量阈值时向所述中心控制设备发送包括恢复请求的RTCP报文。
  16. 根据权利要求12-15任一项权利要求所述的中心控制设备,其特征在于,所述包括暂停指示的RTCP报文还包括剩余最大数,所述剩余最大数用于表示还可混音路数,所述还可混音路数为L-M,L表示预设混音路数,M表示已混音路数,当所述还可混音路数大于0时,使所述X个音频流对应的发送端中任一发送端向所述中心控制设备包括恢复请求RTCP报文。
  17. 根据权利要求12-16任一项权利要求所述的中心控制设备,其特征在于,所述包括暂停指示的RTCP报文还包括是否关联视频流,所述是否关联视频流用于指示所述X个音频流对应的发送端中任一发送端暂停向所述中心控制设备发送音频流的同时暂停向所述中心控制设备发送与所述音频流关联的视频流。
  18. 根据权利要求13-17任一项权利要求所述的中心控制设备,其特征在于,
    所述接收单元,还用于接收第一发送端发送的包括恢复请求的RTCP报文,所述恢复请求包括所述第一发送端更新的音频流的音量,所述恢复请求用于请求中心控制设备指示第一发送端向中心控制设备发送音频流,所述第一发送端为音频流对应的发送端中的任一发送端;
    所述处理单元,还用于判断所述第一发送端更新的音频流的音量是否大于已混音路数中任一路音频流的音量;
    若所述第一发送端更新的音频流的音量大于已混音路数中任一路音频流的音量,所述处理单元,还用于根据所述第一发送端更新的音频流的音量和N-1发送端的音频流的音量确定第二音量阈值;
    所述处理单元,还用于确定音量小于或等于所述第二音量阈值的音频流;
    所述发送单元,还用于向所述音量小于或等于所述第二音量阈值的Y个音频流对应的发送端中每个发送端发送包括暂停指示的RTCP报文,Y为大于等于1且小于N的自然数。
  19. 一种发送端,其特征在于,包括:
    发送单元,用于向中心控制设备发送实时传输协议RTP报文,所述RTP报文包括音频流的音量;
    接收单元,用于接收所述中心控制设备发送的包括暂停指示的实时传输控制协议RTCP报文,所述暂停指示用于指示发送端暂停向中心控制设备发送音频流;
    所述发送单元,还用于向所述中心控制设备发送包括暂停响应的RTCP报文,所述暂停响应用于表示发送端已暂停向中心控制设备发送音频流。
  20. 根据权利要求19所述的发送端,其特征在于,所述包括暂停指示的RTCP报文还包 括第一音量阈值,所述发送端还包括:
    存储单元,用于保存所述第一音量阈值;
    监测单元,用于监测该发送端的音频流的音量;
    处理单元,用于判断监测到的该发送端更新的音频流的音量大于所述第一音量阈值;
    所述发送单元,还用于向所述中心控制设备发送包括恢复请求的RTCP报文,所述恢复请求包括所述发送端更新的音频流的音量,所述恢复请求用于请求中心控制设备指示发送端向中心控制设备发送音频流。
  21. 根据权利要求19或20所述的发送端,其特征在于,所述包括暂停指示的RTCP报文还包括还可混音路数,包括恢复响应的RTCP报文还包括还可混音路数,所述还可混音路数为预设混音路数与已混音路数之差,
    所述处理单元,还用于判断所述还可混音路数大于0;
    所述发送单元,还用于向所述中心控制设备发送包括恢复请求的RTCP报文。
  22. 根据权利要求19-21任一项权利要求所述的发送端,其特征在于,所述包括暂停指示的RTCP报文还包括是否关联视频流,所述是否关联视频流用于指示发送端暂停向所述中心控制设备发送音频流的同时暂停向所述中心控制设备发送与所述音频流关联的视频流。
PCT/CN2017/070652 2016-01-13 2017-01-09 一种调整媒体流传输的方法及装置 WO2017121299A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610022357.9 2016-01-13
CN201610022357.9A CN106973253B (zh) 2016-01-13 2016-01-13 一种调整媒体流传输的方法及装置

Publications (1)

Publication Number Publication Date
WO2017121299A1 true WO2017121299A1 (zh) 2017-07-20

Family

ID=59310789

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/070652 WO2017121299A1 (zh) 2016-01-13 2017-01-09 一种调整媒体流传输的方法及装置

Country Status (2)

Country Link
CN (1) CN106973253B (zh)
WO (1) WO2017121299A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112188144A (zh) * 2020-09-14 2021-01-05 浙江华创视讯科技有限公司 音频的发送方法及装置、存储介质和电子装置

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107967921B (zh) * 2017-12-04 2021-09-07 苏州科达科技股份有限公司 会议系统的音量调节方法及装置
CN110086978A (zh) * 2018-01-25 2019-08-02 浙江宇视科技有限公司 多路音频传输方法、装置及终端设备
CN109901811B (zh) * 2019-02-26 2022-09-06 北京华夏电通科技股份有限公司 应用于数字化庭审中的混音方法及装置
CN110995946B (zh) * 2019-12-25 2021-08-20 苏州科达科技股份有限公司 混音方法、装置、设备、系统及可读存储介质
CN111541860B (zh) * 2019-12-30 2021-07-27 宁波菊风系统软件有限公司 一种实时音频传输系统及其使用方法
CN114448588B (zh) * 2022-01-14 2024-01-23 杭州网易智企科技有限公司 音频传输方法、装置、电子设备及计算机可读存储介质

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020071027A1 (en) * 2000-12-08 2002-06-13 Nec Corporation Multipoint video-meeting control system
CN101179693A (zh) * 2007-09-26 2008-05-14 深圳市丽视视讯科技有限公司 一种会议电视系统的混音处理方法
CN101309390A (zh) * 2007-05-17 2008-11-19 华为技术有限公司 视讯通信系统、装置及其字幕显示方法
CN101489091A (zh) * 2009-01-23 2009-07-22 深圳华为通信技术有限公司 一种语音信号传输处理方法及装置
CN102118523A (zh) * 2009-12-30 2011-07-06 北京大唐高鸿数据网络技术有限公司 一种用于集中式电话会议的混音控制方法
CN103220258A (zh) * 2012-01-20 2013-07-24 华为技术有限公司 会议混音方法、终端和媒体资源服务器

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103354557A (zh) * 2009-08-31 2013-10-16 华为终端有限公司 媒体控制服务器级联系统及控制多媒体码流的方法和装置
US20140098715A1 (en) * 2012-10-09 2014-04-10 Tv Ears, Inc. System for streaming audio to a mobile device using voice over internet protocol

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020071027A1 (en) * 2000-12-08 2002-06-13 Nec Corporation Multipoint video-meeting control system
CN101309390A (zh) * 2007-05-17 2008-11-19 华为技术有限公司 视讯通信系统、装置及其字幕显示方法
CN101179693A (zh) * 2007-09-26 2008-05-14 深圳市丽视视讯科技有限公司 一种会议电视系统的混音处理方法
CN101489091A (zh) * 2009-01-23 2009-07-22 深圳华为通信技术有限公司 一种语音信号传输处理方法及装置
CN102118523A (zh) * 2009-12-30 2011-07-06 北京大唐高鸿数据网络技术有限公司 一种用于集中式电话会议的混音控制方法
CN103220258A (zh) * 2012-01-20 2013-07-24 华为技术有限公司 会议混音方法、终端和媒体资源服务器

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112188144A (zh) * 2020-09-14 2021-01-05 浙江华创视讯科技有限公司 音频的发送方法及装置、存储介质和电子装置

Also Published As

Publication number Publication date
CN106973253A (zh) 2017-07-21
CN106973253B (zh) 2020-04-14

Similar Documents

Publication Publication Date Title
WO2017121299A1 (zh) 一种调整媒体流传输的方法及装置
US10356667B2 (en) User equipment handover method, and base station
US11153533B2 (en) System and method for scalable media switching conferencing
RU2597490C2 (ru) Способы связи в реальном времени, обеспечивающие паузу и возобновление, и связанные с ними устройства
WO2022095795A1 (zh) 通信方法、装置、计算机可读介质及电子设备
WO2020001572A1 (zh) 通信方法及装置
JP5143837B2 (ja) サービスセンタ、ユーザ装置、方法及びコンピュータ読取可能な媒体
US20070168523A1 (en) Multicast-unicast adapter
KR20170134464A (ko) 컨퍼런스 오디오 관리
JP4738058B2 (ja) リアルタイムマルチメディア情報の効率的なルーティング
WO2022068529A1 (zh) Mbs业务的切换方法及相关装置
WO2021148012A1 (zh) 报告信息的发送方法和通信装置以及通信系统
US9270937B2 (en) Real time stream provisioning infrastructure
US9560096B2 (en) Local media rendering
US11606537B2 (en) System and method for scalable media switching conferencing
WO2016082577A1 (zh) 视频会议的处理方法及装置
US9912623B2 (en) Systems and methods for adaptive context-aware control of multimedia communication sessions
KR20120067457A (ko) 유니캐스트 기반의 네트워크에서 그룹 메시지 전송 방법 및 장치
WO2017000636A1 (zh) 媒体会话处理方法方法和相关设备及通信系统
WO2016045496A1 (zh) 一种媒体控制方法和设备
CN113490155B (zh) 多播广播业务的通信方法、装置、介质及电子设备
US20220029843A1 (en) System for reducing bandwidth usage on a multicast computer network
US11026056B2 (en) MBMS bearer handling
Chou et al. Seamless streaming media for heterogeneous mobile networks
US20220006563A1 (en) Method and apparatus for efficient delivery of source and forward error correction streams in systems supporting mixed unicast multicast transmission

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17738113

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17738113

Country of ref document: EP

Kind code of ref document: A1