CN111083428A - Audio and video data processing method and device, computer equipment and storage medium - Google Patents

Audio and video data processing method and device, computer equipment and storage medium Download PDF

Info

Publication number
CN111083428A
CN111083428A CN201911382904.4A CN201911382904A CN111083428A CN 111083428 A CN111083428 A CN 111083428A CN 201911382904 A CN201911382904 A CN 201911382904A CN 111083428 A CN111083428 A CN 111083428A
Authority
CN
China
Prior art keywords
audio
video data
video
controller
input channel
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911382904.4A
Other languages
Chinese (zh)
Inventor
冯卫林
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Feixun Digital Technology Co ltd
Kyland Technology Co Ltd
Original Assignee
Beijing Feixun Digital Technology Co ltd
Kyland Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Feixun Digital Technology Co ltd, Kyland Technology Co Ltd filed Critical Beijing Feixun Digital Technology Co ltd
Priority to CN201911382904.4A priority Critical patent/CN111083428A/en
Publication of CN111083428A publication Critical patent/CN111083428A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/233Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/233Processing of audio elementary streams
    • H04N21/2335Processing of audio elementary streams involving reformatting operations of audio signals, e.g. by converting from one coding standard to another
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/23418Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4398Processing of audio elementary streams involving reformatting operations of audio signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/44008Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Telephonic Communication Services (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The embodiment of the invention discloses an audio and video data processing method and device, computer equipment and a storage medium. The method comprises the following steps: establishing and starting at least one audio and video data input channel according to a conference establishment request of a controller; receiving audio and video data sent by at least one video conference terminal through a controller through an audio and video data input channel; carrying out data analysis on the audio and video acquisition data to obtain at least one service logic corresponding to the audio and video data; calling a media processing service unit corresponding to the service logic according to the service logic to process the matched service logic to obtain an audio and video data processing result; and sending the audio and video data processing result to each video conference terminal. The intercommunication of audio and video data between the multimedia command scheduling system and a third-party video conference system is realized, the connection of a plurality of video conference terminals and a controller does not need to be disconnected, and a large amount of bandwidth resources are not occupied.

Description

Audio and video data processing method and device, computer equipment and storage medium
Technical Field
The embodiment of the invention relates to the technical field of communication, in particular to an audio and video data processing method and device, computer equipment and a storage medium.
Background
With the continuous development of communication technology and internet technology, people or groups in two or more different places are realized, and teleconferences in which voice, image and file data are mutually transmitted and real-time and interactively communicated are widely applied.
At present, the intercommunication of audio and video data between the multimedia commanding and scheduling system and a third-party video conference system is realized mainly in a mode that the multimedia commanding and scheduling system is directly butted with a plurality of video conference terminals in the third-party video conference system. The third-party video conference system can comprise a plurality of video conference terminals, and the third-party video conference system is different from a manufacturer of the multimedia commanding and scheduling system.
In the method in the prior art, when a plurality of video conference terminals of a third-party video conference system need to be accessed, the video conference terminals need to disconnect a controller (Media Control Unit, MCU) in the third-party video conference system first, so as to realize connection with a multimedia command and scheduling system; before the communication connection between the third-party video conference system and the multimedia commanding and scheduling system is successfully established, a plurality of video conference terminals in the third-party video conference system need to be registered in a registration server corresponding to the multimedia commanding and scheduling system, so that a large amount of bandwidth resources are occupied when audio and video data intercommunication is carried out between the multimedia commanding and scheduling system and the plurality of video conference terminals.
Disclosure of Invention
The embodiment of the invention provides an audio and video data processing method and device, computer equipment and a storage medium, so that the intercommunication of audio and video data between a multimedia commanding and scheduling system and a third-party multimedia commanding and scheduling system is realized, the connection between a plurality of video conference terminals in the third-party video conference system and a controller does not need to be disconnected, and a large amount of bandwidth resources cannot be occupied for transmitting the audio and video data.
In a first aspect, an embodiment of the present invention provides an audio and video data processing method, which is applied to a multimedia commanding and scheduling system, where the multimedia commanding and scheduling system is connected to a controller in a third-party video conference system, and the controller is connected to at least one video conference terminal, and the method includes:
establishing and starting at least one audio and video data input channel according to the conference establishment request of the controller;
receiving audio and video data sent by the at least one video conference terminal through the controller through the audio and video data input channel;
performing data analysis on the audio and video acquisition data to obtain at least one service logic corresponding to the audio and video data;
calling a media processing service unit corresponding to the service logic according to the service logic to process the matched service logic to obtain an audio and video data processing result;
and sending the audio and video data processing result to each video conference terminal through the controller through an audio and video data output channel.
In a second aspect, an embodiment of the present invention further provides an audio/video data processing apparatus, which is applied to a multimedia command and scheduling system, and the apparatus includes:
the audio and video data input channel establishing module is used for establishing and starting at least one audio and video data input channel according to the conference establishing request of the controller;
the audio and video data receiving module is used for receiving the audio and video data sent by the at least one video conference terminal through the controller through the audio and video data input channel;
the data analysis module is used for carrying out data analysis on the audio and video acquisition data to obtain at least one service logic corresponding to the audio and video data;
the processing result acquisition module is used for calling a media processing service unit corresponding to the service logic according to the service logic to process the matched service logic to obtain an audio and video data processing result;
and the processing result sending module is used for sending the audio and video data processing result to each video conference terminal through the controller through an audio and video data output channel.
In a third aspect, an embodiment of the present invention further provides a computer device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor implements the audio and video data processing method according to any embodiment of the present invention when executing the program.
In a fourth aspect, the present invention further provides a storage medium containing computer-executable instructions, where the computer-executable instructions are used to execute the audio-video data processing method according to any one of the embodiments of the present invention when executed by a computer processor.
According to the embodiment of the invention, at least one audio/video data input channel is established and started according to a conference establishment request of a controller; receiving audio and video data sent by at least one video conference terminal through a controller through an audio and video data input channel; carrying out data analysis on the audio and video acquisition data to obtain at least one service logic corresponding to the audio and video data; calling a media processing service unit corresponding to the service logic according to the service logic to process the matched service logic to obtain an audio and video data processing result; and transmitting the audio and video data processing result to each video conference terminal through the controller through the audio and video data output channel. The intercommunication of audio and video data between the multimedia commanding and scheduling system and a third-party multimedia commanding and scheduling system is realized, the connection between a plurality of video conference terminals in the third-party video conference system and a controller does not need to be disconnected, and a large amount of bandwidth resources cannot be occupied for transmitting the audio and video data.
Drawings
Fig. 1 is a flowchart of an audio/video data processing method in an embodiment of the present invention;
fig. 2 is a flowchart of an audio/video data processing method according to a second embodiment of the present invention;
fig. 3 is a schematic structural diagram of a multimedia scheduling system according to a second embodiment of the present invention;
fig. 4 is an external structural diagram of a transcoding media server in a multimedia scheduling system according to a second embodiment of the present invention;
fig. 5 is a schematic internal structural diagram of a transcoding media server in a multimedia scheduling system according to a second embodiment of the present invention;
fig. 6 is a schematic diagram of a session creation process applicable to the second embodiment of the present invention;
FIG. 7 is a diagram illustrating a process of creating a mixer according to a second embodiment of the present invention;
fig. 8 is a schematic diagram of a process of creating a mixer, which is applicable to the second embodiment of the present invention;
fig. 9 is a schematic diagram of an input flow of audio/video data applied in the second embodiment of the present invention;
fig. 10 is a schematic diagram of an audio/video data input disconnection process applicable to the second embodiment of the present invention;
fig. 11 is a schematic diagram of an output flow of audio/video data applied in the second embodiment of the present invention;
fig. 12 is a schematic diagram of an audio/video data output disconnection process applicable to the second embodiment of the present invention;
fig. 13 is a schematic structural diagram of an audio/video data processing apparatus according to a third embodiment of the present invention;
fig. 14 is a schematic structural diagram of a computer device in the fourth embodiment of the present invention.
Detailed Description
The embodiments of the present invention will be described in further detail with reference to the drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of and not restrictive on the broad invention. It should be further noted that, for convenience of description, only some structures, not all structures, relating to the embodiments of the present invention are shown in the drawings.
Example one
Fig. 1 is a flowchart of an audio/video data processing method according to an embodiment of the present invention, where this embodiment is applicable to a case of implementing intercommunication of audio/video data between a multimedia commanding and scheduling system and a third-party multimedia commanding and scheduling system, and the method may be executed by an audio/video data processing apparatus, and the apparatus may be implemented in a software and/or hardware manner, and the apparatus may be implemented by an independent hardware device, for example, an independent server is added to a multimedia commanding system, or an existing central management server in the multimedia commanding system may be integrated. The method is applied to a multimedia commanding and scheduling system, the multimedia commanding and scheduling system is connected with a controller in a third-party video conference system, and the controller is connected with at least one video conference terminal. Specifically, referring to fig. 1, the audio/video data processing method specifically includes the following steps:
and S110, establishing and starting at least one audio and video data input channel according to the conference establishment request of the controller.
Specifically, before the video conference is created, a communication connection between the multimedia command scheduling system and a controller of a third-party video conference system needs to be established, and after the communication connection is successfully established, at least one audio/video data input channel can be further established and opened. It should be noted that the number of the video conference terminals connected to the controller may be one or more, and is not limited in the embodiment of the present invention. For example, the video conference terminal may collect audio and video data of a conference room or a room corresponding to the video conference terminal, and transmit the collected audio data to the multimedia commanding and scheduling system through an audio and video data input channel.
It should be noted that, in the embodiment of the present invention, the number of the audio/video data input channels may be one or multiple, and the embodiment of the present invention is not limited thereto.
Optionally, the establishing and starting at least one audio/video data input channel according to the conference establishment request of the controller may further include: establishing a session according to a conference establishing request of a controller; establishing and configuring at least one mixer and/or at least one mixer in a session according to terminal information of at least one video conference terminal included in the conference establishment request; and establishing and opening at least one audio/video data input channel according to the session.
Specifically, after receiving a conference establishment request sent by the controller, the multimedia commanding and scheduling system analyzes the received conference establishment request, creates a session corresponding to the conference establishment request, and returns an Identification (ID) of the created session to the controller of the third-party video conference system.
Further, at least one mixer, and/or at least one mixer may be established and configured in the session according to the terminal information of the at least one video conference terminal included in the conference establishment request. For example, if the conference establishment request includes terminal information of a video conference terminal a and terminal information of a video conference terminal B, the terminal information of the video conference terminal a and the terminal information of the video conference terminal B may include: internet Protocol (IP) of the video conference terminal a and the video conference terminal B, port information of the video conference terminal a and the video conference terminal B, and the like. At least one mixer, and/or at least one mixer may be established and configured by the terminal information of the video conference terminal a and the video conference terminal B included in the received conference establishment request.
It should be noted that the configured mixer can process video data collected by different video conference terminals; the audio data collected by different video conference terminals can be processed through the configured mixer.
Optionally, a mixer may be created according to the video resolution, the code rate, the frame rate, and the split screen number; and/or creating a mixer based on terminal information of at least one video conference terminal participating in the session. Specifically, a plurality of mixers can be created according to the resolution, the code rate, the frame rate and the split screen number of the video collected by the video conference terminal; the mixer may also be created based on terminal information of at least one video conference terminal participating in the session.
For example, if the video conference terminal participating in the session includes a video conference terminal a and a video conference terminal B, a plurality of mixers may be created according to the resolution, the bitrate, the frame rate, and the split screen number of videos acquired by the video conference terminal a and the video conference terminal B; a plurality of mixers may be created according to IP information or port information according to the video conference terminal a and the video conference terminal B.
It should be noted that, for different mixers or mixers, the input and output audio and video signals may be different video conference terminals, for example, in one session, the audio and video signals input by the mixer a may be video signals collected by the video conference terminal a and the video conference terminal B, and the video signal processed by the mixer a may be output to the video conference terminal C; the audio and video signals input by the mixer B can be video signals collected by the video conference terminal a and the video conference terminal B, and the video signals processed by the mixer a can be output to the video conference terminal a.
Furthermore, at least one audio/video data input channel can be established and opened according to the established session. It should be noted that the number of the audio/video data input channels may be one or multiple; the audio and video data input channel can transmit audio and video data acquired by one video conference terminal and can also simultaneously transmit audio and video data acquired by a plurality of video conference terminals.
And S120, receiving audio and video data sent by at least one video conference terminal through the controller through the audio and video data input channel.
Specifically, after at least one audio/video data input channel is established and opened, audio/video data sent by at least one video conference terminal via the controller can be received through each audio/video data input channel. The controller sends the audio and video data collected by different video conference terminals to the multimedia command and scheduling system through different input stream audio and video data channels.
Illustratively, the multimedia commanding and scheduling system establishes and opens two audio and video data input channels, namely an audio and video data input channel A and an audio and video data input channel B; and the controller transmits the audio and video data collected by the video conference terminal A, the video conference terminal B and the video conference terminal C to the multimedia command and scheduling system through different audio and video data input channels. For example, the controller can send the audio and video data collected by the video conference terminal a and the video conference terminal B to the multimedia commanding and scheduling system through the audio and video data input channel a; the controller can also send the audio and video data collected by the video conference terminal C to the multimedia commanding and dispatching system through the audio and video data input channel B.
Optionally, the receiving, through the audio and video data input channel, the audio and video data sent by the at least one video conference terminal via the controller may further include: and when the audio and video data input channel is disconnected, stopping using the audio and video data input channel, and deleting the audio and video data input channel.
Specifically, when it is detected that the client software performs a disconnection operation on the created audio/video input channel, the controller cannot continuously send audio/video data to the multimedia commanding and scheduling system through the channel; the multimedia commanding and dispatching system stops using the audio and video data input channel and deletes the audio and video data input channel.
It should be noted that, since each session may include a plurality of mixers and mixers, a phenomenon that a certain audio/video data is in the plurality of mixers and mixers may exist, and at this time, the connection ID is multiplexed, it is only necessary to stop using the audio/video data input channel and delete the audio/video data input channel; deleting the connection ID if the connection ID is not multiplexed; wherein the connection ID is the created session ID.
And S130, carrying out data analysis on the audio and video data to obtain at least one service logic corresponding to the audio and video data.
Specifically, after the multimedia commanding and scheduling system receives the audio and video data sent by at least one video conference terminal through the controller through the audio and video data input channel, the multimedia commanding and scheduling system can analyze the received audio and video data, so as to obtain at least one service logic corresponding to the audio and video data. The service logic corresponding to the audio/video data may be a conference control service logic, a sound mixing control service logic, or a mixing control service logic.
And S140, calling a media processing service unit corresponding to the service logic according to the service logic to process the matched service logic to obtain an audio and video data processing result.
Specifically, after the received audio and video data is analyzed to obtain at least one service logic corresponding to the audio data, the media processing service unit corresponding to the service logic can be called according to the service logic to process the matched service logic, so as to obtain an audio and video data processing result. Wherein, each media processing service unit is a media processing service unit in the controller. For example, the controller interface may be called by Remote Procedure Call (RPC), and different media processing service units may be called according to different service logics to process different service logics.
The advantage of this arrangement is that different media processing service units in the controller process different service logics, so that the load of the controller can be reasonably distributed without the problem of excessive load of a certain media processing service unit.
Illustratively, if the received audio and video data is analyzed and the obtained service logic is a conference control service logic, a media processing service unit corresponding to the conference control service logic is called to process the conference control service logic; if the received audio and video data is analyzed and the obtained service logic is the audio mixing control service logic, calling a media processing service unit corresponding to the audio mixing control service logic to process the audio mixing control service logic; and if the received audio and video data is analyzed and the obtained service logic is the mixed screen control service logic, calling a media processing service unit corresponding to the mixed screen control service logic to process the mixed screen control service logic.
And S150, transmitting the audio and video data processing result to each video conference terminal through the audio and video data output channel.
Specifically, after the audio and video data processing result is obtained by calling different media processing service units of the controller to process different service logics, the audio and video data processing result can be sent to each video conference terminal of the third video conference system through the audio and video data output channel.
Optionally, the sending the audio/video data processing result to each video conference terminal through the audio/video data output channel may further include: and when the audio and video data output channel is detected to be disconnected, stopping using the audio and video data output channel, and deleting the audio and video data output channel.
Specifically, when it is detected that the client software performs a disconnection operation on the created audio/video output channel, the controller cannot continuously send an audio/video data processing result to each video conference terminal through the channel; the controller stops using the audio and video data output channel and deletes the audio and video data output channel. It should be noted that, since the controller sends the processed audio/video data to different video conference terminals, there is no situation that the connection is multiplexed, and therefore, the connection ID, that is, the session, needs to be deleted finally.
The scheme of the embodiment can be applied to a multimedia commanding and scheduling system, wherein the multimedia commanding and scheduling system is connected with a controller in a third-party video conference system, and the controller is connected with at least one video conference terminal. Specifically, at least one audio/video data input channel is established and opened according to a conference establishment request of the controller; receiving audio and video data sent by at least one video conference terminal through a controller through an audio and video data input channel; carrying out data analysis on the audio and video acquisition data to obtain at least one service logic corresponding to the audio and video data; calling a media processing service unit corresponding to the service logic according to the service logic to process the matched service logic to obtain an audio and video data processing result; and transmitting the audio and video data processing result to each video conference terminal through the audio and video data output channel. The intercommunication of audio and video data between the multimedia command scheduling system and a third-party video conference system is realized, the connection of a plurality of video conference terminals and a controller does not need to be disconnected, and a large amount of bandwidth resources are not occupied.
Example two
Fig. 2 is a flowchart of an audio and video data processing method according to a second embodiment of the present invention, where this embodiment refines the second embodiment of the present invention on the basis of the foregoing embodiment, and specifically, before receiving, through an audio and video data input channel, audio and video data sent by at least one video conference terminal via a controller, the method may further include: detecting whether the audio and video data input channel is successfully established, and if so, starting the audio and video data input channel; before the audio/video data processing result is sent to each video conference terminal through the audio/video channel, the method may further include: and acquiring the controller IP and a port for sending an audio and video data processing result. Specifically, referring to fig. 2, the method specifically includes the following steps:
and S210, establishing and starting at least one audio and video data input channel according to the conference establishment request of the controller.
S220, whether the audio and video data input channel is successfully established is detected, and if the audio and video data input channel is successfully established, the audio and video data input channel is started.
Specifically, before receiving, through the audio/video data input channel, the audio/video data sent by at least one video conference terminal via the controller, the method may further include: and detecting whether the audio and video data input channel is successfully established, and if so, starting the audio and video data input channel.
Specifically, before receiving the audio and video data sent by at least one video conference terminal via the controller through the audio and video data input channel, the multimedia commanding and scheduling system can also detect whether the audio and video data input channel is successfully created, and if the audio and video data input channel is successfully created, the audio and video data input channel is opened.
Optionally, detecting whether the audio/video data input channel is successfully created, and if the audio/video data input channel is successfully created, before starting the audio/video data input channel, the method may further include: and acquiring the IP of the controller and a port for the controller to send audio and video data.
In a specific example of the embodiment of the present invention, a create connection instruction create may be sent to the controller through the multimedia command scheduling system, and a result returned by the controller is received; if the result returned by the controller is the ID corresponding to the audio and video data input channel, the audio and video data input channel is successfully established, namely the controller can forward the audio and video data acquired by at least one video conference terminal through the audio and video data input channel; if the result returned by the controller is an error code, the audio/video data input channel is not successfully established, namely the controller cannot forward the audio/video data acquired by at least one video conference terminal through the audio/video data input channel.
And S230, receiving audio and video data sent by at least one video conference terminal through the controller through the audio and video data input channel.
And S240, carrying out data analysis on the audio and video data to obtain at least one service logic corresponding to the audio and video data.
And S250, calling a media processing service unit corresponding to the service logic according to the service logic to process the matched service logic to obtain an audio and video data processing result.
And S260, acquiring the controller IP and a port for sending an audio and video data processing result.
Specifically, before the audio/video data processing result is sent to each video conference terminal through the audio/video data output channel, the method may further include: and acquiring the controller IP and a port for sending an audio and video data processing result.
Further, after acquiring the controller IP and the port for forwarding the audio/video data processing result, the method may further include: and creating an audio and video data output channel according to the controller IP and the port for sending the audio and video data processing result, detecting whether the video output channel is successfully created, and if the video output channel is successfully created, starting the audio and video data output channel.
In a specific example of the embodiment of the present invention, a create connection instruction create may be sent to the controller through the multimedia command scheduling system, and a result returned by the controller is received; if the result returned by the controller is the ID corresponding to the audio and video data output channel, the audio and video data output channel is successfully established, namely the controller can send the audio and video data processing result to at least one video conference terminal through the audio and video data output channel; and if the result returned by the controller is an error code, the audio and video data output channel is not successfully established, namely the controller cannot send the audio and video data processing result to at least one video conference terminal through the audio and video data output channel.
And S270, transmitting the audio and video data processing result to each video conference terminal through the audio and video data output channel.
In this embodiment, on the basis of the above embodiment, the embodiment of the present invention is refined, and before receiving audio and video data sent by at least one video conference terminal via a controller through an audio and video data input channel, an internet protocol IP of the controller and a port through which the controller sends the audio and video data may also be obtained; before the audio and video data processing result is sent to each video conference terminal through the audio and video channel, the controller IP and the port for sending the audio and video data processing result can be obtained. The intercommunication of audio and video data between the multimedia command scheduling system and a third-party video conference system is realized, the connection of a plurality of video conference terminals and a controller does not need to be disconnected, and a large amount of bandwidth resources are not occupied.
Application scenarios
For a better understanding of the embodiments of the invention, the present application continues to describe embodiments of the invention. Specifically, fig. 3 is a schematic structural diagram of a multimedia commanding and dispatching system in an embodiment of the present invention, and as can be seen from fig. 3, the multimedia commanding and dispatching system 310 includes: a central management server 311, a transcoding media server 312 and a plurality of command scheduling terminals 313; the third-party video conference system 320 includes: a controller 321 and a plurality of video conference terminals 322.
The multimedia command scheduling system 310 establishes a communication connection with the controller 321 in the third-party video conference system 320 through the transcoding media server 312, and for example, the transcoding media server 312 may establish a communication connection with the controller 321 in the third-party video conference system 320 through a Session Initiation Protocol (SIP) or an Application Programming Interface (API).
It should be noted that the transcoding composite media server 312 in the multimedia commanding and dispatching system 310 can be an independent process, as shown in fig. 4, which can be activated by the system operation and maintenance through the rest interface, and is connected to the central management server 311 and the controller 321 through the sip message interface; the media compositiontranscoding server 312 may also compose and transcode the audio/video data collected by the video conference terminals 322 and forwarded to the multimedia commanding and scheduling system 310 through the controller 321, and output the converted audio/video data to the controller 321.
Specifically, the internal structure of the transcoding media server 312 is as shown in fig. 5, and it should be noted that the transcoding media server 312 mainly includes a signaling processing unit 510, a load balancing unit 520, and a media processing service unit 530; the number of the media processing service units 530 may be one or more, and fig. 5 illustrates three media processing service units as an example, which is not intended to limit the embodiments of the present invention; the media processing service unit 530 may include a streaming media management, a mixer, a media engine, and the like.
The signaling processing unit 510 is configured to analyze the audio and video data or the instruction received through the sip message interface and sent by the central management server 311 or the controller 321, analyze the audio and video data or the instruction to obtain a service logic corresponding to the audio and video data or the instruction, and call a corresponding interface of the load balancing unit 520 through the service logic obtained through the analysis. The service logic obtained by the analysis may be a conference control service logic, a sound mixing control service logic, or a mixing control service logic. For example, if the service logic analyzed is a conference control service logic, a conference control service interface of the load balancing unit 520 may be called; if the analyzed service logic is the audio mixing control service logic, the audio mixing control service interface of the load balancing unit 520 may be called; if the service logic analyzed is the mixing control service logic, the mixing control service interface of the load balancing unit 520 may be called.
The load balancing unit 520 may call an interface of the controller 321 through an RPC, and reasonably allocate the service logic to the host of the controller 321 with the lowest load for processing according to the host load of the controller 321, that is, select different media processing units to process the service logic obtained by analysis.
The streaming media management can realize the unified management of different business logics, mixers and media engines obtained by analyzing streaming media objects. Wherein the mixer may mix the input video stream according to a mixing mode; the mixer may mix the input audio streams according to a mixing strategy; the media engine can realize the coding and decoding of the audio and video data.
In a specific example of the embodiment of the present invention, the external interaction process of the transcoding media server 312 may include: creating a session, creating a mixer, flowing in audio and video data, flowing out the audio and video data, disconnecting input and output, and the like. These procedures will be described in detail below:
fig. 6 is a schematic diagram of a session creation flow applicable to the second embodiment of the present invention. Specifically, when a conference is created, a corresponding session needs to be created, the controller 321 of the third-party video conference system sends a "creatssession" instruction to the transcoding composite media server 312, the signaling processing unit analyzes the instruction received by the transcoding composite media server 312, and performs internal interface call through the media processing service unit, and finally returns a session ID to the controller 321; the controller 321 receives the session ID, and completes creation of a session.
Further, after the session is created, a mixer may be created according to the pictures seen by the members of the conference, and the mixer may be set accordingly. Fig. 7 is a schematic diagram of a process of creating a mixer applicable to the second embodiment of the present invention. Specifically, the signaling parsing unit parses the received 'creatvideomix' instruction sent by the controller 321, performs internal interface calling through the media processing service unit, and finally returns the mixer ID to the controller 321; the controller 321 receives the mixer ID, and completes creation of one mixer.
Fig. 8 is a schematic diagram of a process of creating a mixer, which is applicable to the second embodiment of the present invention. Specifically, the signaling analysis unit analyzes the received "creataudiomix" command sent by the controller 321, performs internal interface call through the media processing service unit, and finally returns the mixer ID to the controller 321; the controller 321 receives the mixer ID, and completes creation of one mixer.
It should be noted that different images seen and sounds heard by different conference members can be realized by creating different mixers and mixers. For example, a plurality of different mixers and mixers may also be created according to the pictures seen by the conference members and the sounds heard by the conference members, which is not described herein again in the embodiments of the present invention.
Specifically, after a plurality of different mixers and mixers are created according to the method, an input stream can be pulled according to the input condition of audio/video data, and an audio/video data input channel can be created and opened. Specifically, fig. 9 is a schematic diagram of an input flow of audio and video data applicable to the second embodiment of the present invention. Specifically, the signaling parsing unit parses the received "getaddress" instruction sent by the controller 321, performs internal interface call through the media processing service unit, and finally returns the destination IP of the input stream and the port of the audio/video to the controller 321; a further signaling analysis unit analyzes the received 'creatconnect' instruction sent by the controller 321, performs internal interface calling through the media processing service unit, and finally returns the audio/video data input channel ID to the controller 321; after the signaling negotiation is completed, video input and audio input are required to be added, whether the video input and the audio input are added successfully or not is judged according to a returned error code (code), if the video input and the audio input are added successfully, a video channel and an audio channel are created, and an audio and video data channel is created and started.
Specifically, when a certain audio/video data input channel is disconnected, the audio/video data input channel needs to be sequentially stopped from being called and deleted, because a plurality of mixers and mixers exist in one session, one input stream exists in the plurality of mixers and mixers, the connection ID is multiplexed, and if the connection ID is not multiplexed, the connection needs to be deleted; if the connection ID is multiplexed, only the audio and video input channel is required to be called to stop, and the audio and video input data is required to be deleted. Specifically, fig. 10 is a schematic diagram of an audio/video data input disconnection process applicable in the second embodiment of the present invention, where when client software sends an input stream disconnection operation instruction to a controller of a third video conference system, the controller sends an instruction to stop using an audio/video input channel, delete the audio/video data input channel, and delete a connection to a transcoding media server in sequence, and finally, an audio/video data input disconnection operation is implemented.
Specifically, when an on-demand composite stream is output, an IP and audio/video port for outputting composite audio/video data needs to be acquired, and a connection, an output, an audio/video output channel needs to be created and opened. Specifically, fig. 11 is a schematic diagram of an output flow of audio and video data applicable to the second embodiment of the present invention. Specifically, before the controller replies the session requested by the client software, an IP and audio/video port sent by the synthetic audio/video data needs to be acquired; after the signaling negotiation is finished, a connection is established, and the returned connection ID represents the unique identifier of the connection; adding an output stream after the connection is successfully established, judging whether the output is successfully added according to a returned code, if so, establishing an audio and video data output channel, and starting the channel after the audio and video data output channel is successfully established; and after the setting is finished, the synthesized audio and video data is output through the acquired IP and audio and video ports.
Furthermore, when the output is disconnected, the audio and video data output channel needs to be stopped, and the audio and video data input needs to be deleted.
The following are exemplary: if the whole conference needs to be deleted, the session corresponding to the conference needs to be deleted after the disconnection input and output operations are completed. When the code returned in the process of pulling the input stream or the output stream is failed, the whole session is failed, and the conference needs to be opened again.
Fig. 12 is a schematic diagram of an audio/video data output disconnection process applicable in the second embodiment of the present invention, where when the client software sends an output stream disconnection operation instruction to the controller of the third video conference system, the controller sends an instruction to the composite transcoding media server to stop using the audio/video output channel, delete the audio/video data output channel, and delete the connection in sequence, so as to finally implement an audio/video data output disconnection operation.
In the above example, the transcoding media server in the multimedia command and dispatch system establishes and opens at least one audio/video data input channel according to the conference establishment request of the controller; receiving audio and video data sent by at least one video conference terminal through a controller through an audio and video data input channel; carrying out data analysis on the audio and video acquisition data to obtain at least one service logic corresponding to the audio and video data; calling a media processing service unit corresponding to the service logic according to the service logic to process the matched service logic to obtain an audio and video data processing result; and sending the audio and video data processing result to each video conference terminal. The intercommunication of audio and video data between the multimedia command scheduling system and a third-party video conference system is realized, the connection of a plurality of video conference terminals and a controller does not need to be disconnected, and a large amount of bandwidth resources are not occupied.
EXAMPLE III
Fig. 13 is a schematic structural diagram of an audio/video data processing apparatus in a third embodiment of the present invention, where the apparatus is applied to a multimedia commanding and scheduling system, and the apparatus may execute an audio/video data processing method related in any embodiment of the present invention, and may be implemented in a software and/or hardware manner, specifically, referring to fig. 13, the apparatus mainly includes: the device comprises an audio and video data input channel establishing module 131, an audio and video data receiving module 132, a data analyzing module 133, a processing result acquiring module 134 and a processing result sending module 135.
The audio/video data input channel establishing module 131 is configured to establish and open at least one audio/video data input channel according to a conference establishing request of the controller;
the audio/video data receiving module 132 is configured to receive, through an audio/video data input channel, audio/video data sent by at least one video conference terminal via the controller;
the data analysis module 133 is configured to perform data analysis on the audio/video acquisition data to obtain at least one service logic corresponding to the audio/video data;
the processing result obtaining module 134 is configured to invoke, according to the service logic, a media processing service unit corresponding to the service logic to process the matched service logic, so as to obtain an audio/video data processing result;
and the processing result sending module 135 is configured to send the processing result of the audio and video data to each video conference terminal through the controller through the audio and video data output channel.
In the scheme of the embodiment, at least one audio and video data input channel is established and started through the audio and video data input channel establishing module according to a conference establishing request of the controller; the audio and video data receiving module receives audio and video data sent by at least one video conference terminal through the controller through an audio and video data input channel; performing data analysis on the audio and video acquisition data through a data analysis module to obtain at least one service logic corresponding to the audio and video data; calling a media processing service unit corresponding to the service logic according to the service logic by a processing result acquisition module to process the matched service logic to obtain an audio and video data processing result; and the processing result sending module sends the audio and video data processing result to each video conference terminal through the controller through the audio and video data output channel. The intercommunication of audio and video data between the multimedia command scheduling system and a third-party video conference system is realized, the connection of a plurality of video conference terminals and a controller does not need to be disconnected, and a large amount of bandwidth resources are not occupied.
Optionally, the audio/video data input channel establishing module 131 further includes: a session establishing unit for establishing a session according to a conference establishing request of the controller; establishing and configuring at least one mixer and/or at least one mixer in a session according to terminal information of at least one video conference terminal included in the conference establishment request; and establishing and opening at least one audio/video data input channel according to the session.
Optionally, the session creating unit is further specifically configured to create a mixer according to the video resolution, the code rate, the frame rate, and the split screen number; and/or creating a mixer based on terminal information of at least one video conference terminal participating in the session.
Optionally, the audio/video data processing apparatus further includes: and the audio and video data input channel starting module is used for detecting whether the audio and video data input channel is successfully established or not, and if the audio and video data input channel is successfully established, starting the audio and video data input channel.
Optionally, the audio/video data processing apparatus further includes: and the port acquisition module is used for acquiring the controller IP and a port for sending an audio and video data processing result.
Optionally, the port obtaining module further includes: and the audio and video data output channel starting unit is used for establishing an audio and video data output channel according to the controller IP and the port for sending the audio and video data processing result, detecting whether the video output channel is established successfully or not, and starting the audio and video data output channel if the video output channel is established successfully.
Optionally, the audio/video data receiving module 132 is further specifically configured to stop using the audio/video data input channel and delete the audio/video data input channel when the audio/video data input channel is disconnected.
Optionally, the processing result sending module 135 is further specifically configured to, when detecting that the audio/video data output channel is disconnected, stop using the audio/video data output channel, and delete the audio/video data output channel.
The audio and video data processing device provided by the embodiment of the invention can execute the audio and video data processing method provided by any embodiment of the invention, and has the corresponding functional modules and beneficial effects of the execution method.
Example four
Fig. 14 is a schematic structural diagram of a computer apparatus according to a fourth embodiment of the present invention, as shown in fig. 14, the computer apparatus includes a processor 140, a memory 141, an input device 142, and an output device 143; the number of processors 140 in the computer device may be one or more, and one processor 140 is taken as an example in fig. 14; the processor 140, the memory 141, the input device 142 and the output device 143 in the computer apparatus may be connected by a bus or other means, and the bus connection is exemplified in fig. 14.
The memory 141 is used as a computer-readable storage medium, and can be used to store software programs, computer-executable programs, and modules, such as program instructions/modules corresponding to the audio/video data processing method in the embodiment of the present invention (for example, the audio/video data input channel establishing module 131, the audio/video data receiving module 132, the data analyzing module 133, the processing result obtaining module 134, and the processing result sending module 135 in the audio/video data processing apparatus). The processor 140 executes various functional applications and data processing of the computer device by running software programs, instructions and modules stored in the memory 141, that is, implements the above-described audio and video data processing method.
The memory 141 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function; the storage data area may store data created according to the use of the terminal, and the like. Further, the memory 141 may include a high speed random access memory, and may also include a non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device. In some examples, memory 141 may further include memory located remotely from processor 140, which may be connected to a computer device via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The input device 142 may be used to receive input numeric or character information and generate key signal inputs related to user settings and function controls of the computer apparatus. The output device 143 may include a display device such as a display screen.
EXAMPLE five
An embodiment of the present invention further provides a storage medium containing computer-executable instructions, where the computer-executable instructions are executed by a computer processor to perform an audio and video data processing method, where the method includes:
establishing and starting at least one audio and video data input channel according to a conference establishment request of a controller;
receiving audio and video data sent by at least one video conference terminal through a controller through an audio and video data input channel;
performing data analysis on the audio and video data to obtain at least one service logic corresponding to the audio and video data;
calling a media processing service unit corresponding to the service logic according to the service logic to process the matched service logic to obtain an audio and video data processing result;
and transmitting the audio and video data processing result to each video conference terminal through the audio and video data output channel.
Of course, the storage medium provided by the embodiment of the present invention contains computer-executable instructions, and the computer-executable instructions are not limited to the operations of the method described above, and may also perform related operations in the audio and video data processing method provided by any embodiment of the present invention.
From the above description of the embodiments, it is obvious for those skilled in the art that the present invention can be implemented by software and necessary general hardware, and certainly, can also be implemented by hardware, but the former is a better embodiment in many cases. Based on such understanding, the technical solutions of the present invention may be embodied in the form of a software product, which may be stored in a computer-readable storage medium, such as a floppy disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a FLASH Memory (FLASH), a hard disk or an optical disk of a computer, and includes several instructions for enabling a computer device (which may be a personal computer, a server, or a network device) to execute the methods according to the embodiments of the present invention.
It should be noted that, in the embodiment of the audio/video data processing apparatus, each unit and each module included in the embodiment are only divided according to functional logic, but are not limited to the above division, as long as the corresponding function can be implemented; in addition, specific names of the functional units are only for convenience of distinguishing from each other, and are not used for limiting the protection scope of the present invention.
It is to be noted that the foregoing is only illustrative of the preferred embodiments of the present invention and the technical principles employed. It will be understood by those skilled in the art that the present invention is not limited to the particular embodiments described herein, but is capable of various obvious changes, rearrangements and substitutions as will now become apparent to those skilled in the art without departing from the scope of the invention. Therefore, although the present invention has been described in greater detail by the above embodiments, the present invention is not limited to the above embodiments, and may include other equivalent embodiments without departing from the spirit of the present invention, and the scope of the present invention is determined by the scope of the appended claims.

Claims (10)

1. A method for processing audio and video data is applied to a multimedia commanding and scheduling system, the multimedia commanding and scheduling system is connected with a controller in a third-party video conference system, the controller is connected with at least one video conference terminal, and the method is characterized by comprising the following steps:
establishing and starting at least one audio and video data input channel according to a conference establishment request of a controller;
receiving audio and video data sent by the at least one video conference terminal through the controller through the audio and video data input channel;
performing data analysis on the audio and video data to obtain at least one service logic corresponding to the audio and video data;
calling a media processing service unit corresponding to the service logic according to the service logic to process the matched service logic to obtain an audio and video data processing result;
and sending the audio and video data processing result to each video conference terminal through an audio and video data output channel.
2. The method of claim 1, wherein at least one audio-video data input channel is established and opened according to a conference establishment request of the controller, further comprising:
establishing a session according to the conference establishment request of the controller;
establishing and configuring at least one mixer and/or at least one mixer in the session according to the terminal information of the at least one video conference terminal included in the conference establishment request;
and establishing and opening at least one audio and video data input channel according to the session.
3. The method according to claim 2, wherein establishing and configuring at least one mixer, and/or at least one mixer in the session comprises:
creating the mixer according to the video resolution, the code rate, the frame rate and the split screen number; and/or the presence of a gas in the gas,
and creating the mixer according to the terminal information of at least one video conference terminal participating in the session.
4. The method according to any one of claims 1 to 3, wherein before receiving, through the audiovisual data input channel, audiovisual data sent by the at least one video conference terminal via the controller, further comprising:
and detecting whether the audio and video data input channel is successfully established, and if so, starting the audio and video data input channel.
5. The method according to claim 1, wherein before sending the audio/video data processing result to each of the video conference terminals through the audio/video channel, the method further comprises:
and acquiring the controller IP and a port for sending the audio and video data processing result, creating an audio and video data output channel according to the controller IP and the port for sending the audio and video data processing result, detecting whether the video output channel is successfully created, and if so, starting the audio and video data output channel.
6. The method of claim 1, wherein receiving, via the audio-video data input channel, audio-video data sent by the at least one video conference terminal via the controller further comprises:
and when the audio and video data input channel is disconnected, stopping using the audio and video data input channel, and deleting the audio and video data input channel.
7. The method of claim 1, wherein the audio/video data processing result is sent to each of the video conference terminals through an audio/video data output channel, further comprising:
and when the audio and video data output channel is detected to be disconnected, stopping using the audio and video data output channel, and deleting the audio and video data output channel.
8. An audio and video data processing device is applied to a multimedia commanding and dispatching system and is characterized by comprising:
the audio and video data input channel establishing module is used for establishing and starting at least one audio and video data input channel according to a conference establishing request of the controller;
the audio and video data receiving module is used for receiving the audio and video data sent by the at least one video conference terminal through the controller through the audio and video data input channel;
the data analysis module is used for carrying out data analysis on the audio and video acquisition data to obtain at least one service logic corresponding to the audio and video data;
the processing result acquisition module is used for calling a media processing service unit corresponding to the service logic according to the service logic to process the matched service logic to obtain an audio and video data processing result;
and the processing result sending module is used for sending the audio and video data processing result to each video conference terminal through the controller through an audio and video data output channel.
9. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the audio-visual data processing method according to any one of claims 1 to 7 when executing said program.
10. A storage medium containing computer-executable instructions for performing the audio-visual data processing method of any one of claims 1 to 7 when executed by a computer processor.
CN201911382904.4A 2019-12-27 2019-12-27 Audio and video data processing method and device, computer equipment and storage medium Pending CN111083428A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911382904.4A CN111083428A (en) 2019-12-27 2019-12-27 Audio and video data processing method and device, computer equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911382904.4A CN111083428A (en) 2019-12-27 2019-12-27 Audio and video data processing method and device, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
CN111083428A true CN111083428A (en) 2020-04-28

Family

ID=70318921

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911382904.4A Pending CN111083428A (en) 2019-12-27 2019-12-27 Audio and video data processing method and device, computer equipment and storage medium

Country Status (1)

Country Link
CN (1) CN111083428A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112866619A (en) * 2021-01-05 2021-05-28 浙江大学 Teleconference control method and device, electronic equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102246458A (en) * 2008-12-15 2011-11-16 微软公司 Video conferencing subscription using multiple bit rate streams
US20150381936A1 (en) * 2010-04-27 2015-12-31 Lifesize Communications, Inc. Recording a Videoconference Using Separate Video
CN105704421A (en) * 2016-03-16 2016-06-22 国网山东省电力公司信息通信公司 Video conference main and parallel session networking architecture and method
CN109922300A (en) * 2019-03-14 2019-06-21 视联动力信息技术股份有限公司 A kind of method and server of view networking MCU invitation internet MCU meeting
CN110149305A (en) * 2019-04-01 2019-08-20 视联动力信息技术股份有限公司 A kind of method and transfer server of the multi-party playing audio-video based on view networking

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102246458A (en) * 2008-12-15 2011-11-16 微软公司 Video conferencing subscription using multiple bit rate streams
US20150381936A1 (en) * 2010-04-27 2015-12-31 Lifesize Communications, Inc. Recording a Videoconference Using Separate Video
CN105704421A (en) * 2016-03-16 2016-06-22 国网山东省电力公司信息通信公司 Video conference main and parallel session networking architecture and method
CN109922300A (en) * 2019-03-14 2019-06-21 视联动力信息技术股份有限公司 A kind of method and server of view networking MCU invitation internet MCU meeting
CN110149305A (en) * 2019-04-01 2019-08-20 视联动力信息技术股份有限公司 A kind of method and transfer server of the multi-party playing audio-video based on view networking

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112866619A (en) * 2021-01-05 2021-05-28 浙江大学 Teleconference control method and device, electronic equipment and storage medium
CN112866619B (en) * 2021-01-05 2021-12-28 浙江大学 Teleconference control method and device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
KR101205386B1 (en) Audio processing method, system and control server
US8830294B2 (en) Method and system for video conference control, videoconferencing network equipment, and videoconferencing site
KR100880150B1 (en) Multi-point video conference system and media processing method thereof
US7257641B1 (en) Multipoint processing unit
CN110971863B (en) Multi-point control unit cross-area conference operation method, device, equipment and system
CN110062191B (en) Multi-party group meeting method and server based on video network
WO2016184001A1 (en) Video monitoring processing method and apparatus
CN110475094B (en) Video conference processing method and device and readable storage medium
US9825997B2 (en) Mediator for optimizing the transmission of media contents between a multimedia resource function and a plurality of terminals
KR20100121384A (en) System and method for providing service related to telephone to a plurality of devices using upnp in the home network
WO2016019775A1 (en) Conference migration method, device and system
US20150237086A1 (en) Local Media Rendering
CN111049850A (en) Physical isolation method of video monitoring system and video monitoring system
CN112511783A (en) Mixed display method and device of audio and video stream, server and storage medium
CN111131759B (en) Real-time multimedia transmission system and use method thereof
TWI495345B (en) Video conference system and method using the same
US9264662B2 (en) Chat preauthorization
CN111083428A (en) Audio and video data processing method and device, computer equipment and storage medium
US9936164B2 (en) Media control method and device
US10291882B2 (en) Call processing method and gateway
US20220391452A1 (en) Method for conducting an audio and/or video conference
CN115334059A (en) Audio and video intercommunication method, device, equipment and storage medium
US11102451B2 (en) Videoconferencing server for providing multi-screen videoconferencing by using a plurality of videoconferencing terminals and method therefor
CN109862305B (en) Method and device for adjusting stream during meeting of video network
WO2016101623A1 (en) Remote interaction method and device in multipoint audio and video communication

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20200428