CN110062191B - Multi-party group meeting method and server based on video network - Google Patents

Multi-party group meeting method and server based on video network Download PDF

Info

Publication number
CN110062191B
CN110062191B CN201910310303.6A CN201910310303A CN110062191B CN 110062191 B CN110062191 B CN 110062191B CN 201910310303 A CN201910310303 A CN 201910310303A CN 110062191 B CN110062191 B CN 110062191B
Authority
CN
China
Prior art keywords
server
conference
video
terminal
sent
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910310303.6A
Other languages
Chinese (zh)
Other versions
CN110062191A (en
Inventor
乔金龙
幺学佳
萧辉
李学军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Visionvera Information Technology Co Ltd
Original Assignee
Visionvera Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Visionvera Information Technology Co Ltd filed Critical Visionvera Information Technology Co Ltd
Priority to CN201910310303.6A priority Critical patent/CN110062191B/en
Publication of CN110062191A publication Critical patent/CN110062191A/en
Application granted granted Critical
Publication of CN110062191B publication Critical patent/CN110062191B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/1066Session management
    • H04L65/1069Session establishment or de-establishment
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/1066Session management
    • H04L65/1101Session protocols
    • H04L65/1104Session initiation protocol [SIP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/40Support for services or applications
    • H04L65/403Arrangements for multi-party communication, e.g. for conferences
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems

Abstract

The embodiment of the application discloses a multi-party conference method and a server based on video networking. The method comprises the following steps: a conference initiation protocol SIP server receives a first conference access request sent by a video networking terminal through a transit server and a second conference access request sent by a conference terminal; the first conference joining request and the second conference joining request carry a target conference identifier and the SIP server IP; sending a first conference joining response to the video network terminal through the transfer server according to the target conference identifier and the SIP server IP, and sending a second conference joining response to the conference terminal; the first conference response is used for establishing a first data transmission channel with the video network terminal, and the second conference response is used for establishing a second data transmission channel with the conference terminal. The data intercommunication between the video networking terminal and the Internet terminal is realized, and the data intercommunication is not limited by the difference of protocol layers and the number of the group objects.

Description

Multi-party group meeting method and server based on video network
Technical Field
The present application relates to the field of video networking technologies, and in particular, to a multi-party conferencing method and a server based on video networking.
Background
With the rapid development of the video networking, video conferences, video teaching and the like based on the video networking are widely popularized in the aspects of life, work, learning and the like of users.
The video network terminal device is a device operating at a link layer, and data of the video network terminal device is also transmitted in a network where the video network terminal device is located by using a two-layer network protocol, that is, high-quality video telephone audio data and video data transmission among a plurality of video network devices is realized through a V2V protocol (a video network To video network protocol). And common visual terminal equipment performs data interaction on three layers of the network based on protocols such as TCP/UDP/IP and the like.
How to implement the service intercommunication between the terminal devices based on different protocols is a problem to be solved urgently.
Disclosure of Invention
In view of the above, embodiments of the present application are proposed to provide a multi-party video networking based method and a corresponding SIP server that overcome or at least partially solve the above problems.
In a first aspect, an embodiment of the present application discloses a method for multiparty group meeting based on a video network, where the method includes:
a conference initiation protocol SIP server receives a first conference access request sent by a video networking terminal through a transit server and a second conference access request sent by a conference terminal; the first conference entering request and the second conference entering request carry a target conference identifier and the SIP server IP;
sending a first conference joining response to the video network terminal through the transfer server according to the target conference identifier and the SIP server IP, and sending a second conference joining response to the conference terminal; the first conference response is used for establishing a first data transmission channel with the video network terminal, and the second conference response is used for establishing a second data transmission channel with the conference terminal.
Optionally, the transfer server includes a video networking server and a collaboration server;
the first meeting-entering request is sent by the video network terminal, passes through the video network server, is subjected to protocol conversion by the protocol conversion server and is sent to the SIP server;
the first conference response is sent by the SIP server, protocol conversion is carried out by the protocol conversion server, and the first conference response is sent to the video network terminal through the video network server.
Optionally, the first data transmission channel is configured to transmit a mixed data stream sent by the SIP server and/or a first data stream sent by the video networking terminal via the transit server;
the second data transmission channel is used for transmitting the mixed data stream sent by the SIP server and/or a second data stream sent by the conference terminal;
and the mixed data stream is obtained by mixing and screen-mixing the received first data stream and the second data stream by the SIP server.
Optionally, the transfer server includes a video networking server and a collaboration server;
the first data transmission channel is used for transmitting the mixed data stream sent by the SIP server and/or the first data stream sent by the video network terminal via the transit server, and includes:
the first data transmission channel is used for transmitting the mixed data stream which is sent by the SIP server, subjected to protocol conversion by the protocol conversion server and sent to the video network terminal through the video network server; and/or
The first data transmission channel is used for transmitting the first data stream which is sent by the video network terminal, passes through the video network server and is subjected to protocol conversion by the protocol conversion server.
In a second aspect, an embodiment of the present application discloses a server, including:
the conference initiating protocol SIP server is used for receiving a first conference access request sent by the video networking terminal through the transit server and a second conference access request sent by the conference terminal; the first conference entering request and the second conference entering request carry a target conference identifier and the SIP server IP;
the message sending module is used for sending a first conference-joining response to the video network terminal through the transfer server according to the target conference identifier and the SIP server IP and sending a second conference-joining response to the conference terminal; the first conference response is used for establishing a first data transmission channel with the video network terminal, and the second conference response is used for establishing a second data transmission channel with the conference terminal.
Optionally, the transfer server includes a video networking server and a collaboration server;
the first meeting-entering request is sent by the video network terminal, passes through the video network server, is subjected to protocol conversion by the protocol conversion server and is sent to the SIP server;
the first conference response is sent by the SIP server, protocol conversion is carried out by the protocol conversion server, and the first conference response is sent to the video network terminal through the video network server.
Optionally, the first data transmission channel is configured to transmit a mixed data stream sent by the SIP server and/or a first data stream sent by the video networking terminal via the transit server;
the second data transmission channel is used for transmitting the mixed data stream sent by the SIP server and/or a second data stream sent by the conference terminal;
and the mixed data stream is obtained by mixing and screen-mixing the received first data stream and the second data stream by the SIP server.
Optionally, the transfer server includes a video networking server and a collaboration server;
the first data transmission channel is used for transmitting the mixed data stream sent by the SIP server and/or the first data stream sent by the video network terminal via the transit server, and includes:
the first data transmission channel is used for transmitting the mixed data stream which is sent by the SIP server, subjected to protocol conversion by the protocol conversion server and sent to the video network terminal through the video network server; and/or
The first data transmission channel is used for transmitting the first data stream which is sent by the video network terminal, passes through the video network server and is subjected to protocol conversion by the protocol conversion server.
In a third aspect, an embodiment of the present application further discloses a computer device, which includes a memory, a processor, and a computer program stored on the memory and executable on the processor, and when the processor executes the computer program, the method of any one of the first aspect is implemented.
In a fourth aspect, an embodiment of the present application further discloses a computer-readable storage medium, where a computer program for executing any one of the methods in the first aspect is stored in the computer-readable storage medium.
According to the multiparty conference method based on the video network, the SIP server receives a first conference access request sent by a video network terminal through a transfer server and a second conference access request sent by a conference terminal; the first conference entering request and the second conference entering request carry a target conference identifier and the SIP server IP; sending a first conference joining response to the video network terminal through the transfer server according to the target conference identifier and the SIP server IP, and sending a second conference joining response to the conference terminal; the first conference response is used for establishing a first data transmission channel with the video network terminal, and the second conference response is used for establishing a second data transmission channel with the conference terminal. Therefore, data intercommunication between the video networking terminal and the Internet terminal is realized, limitation caused by protocol layer difference is avoided, limitation of the number of group meeting objects is avoided, and application of the video networking is expanded.
Drawings
Fig. 1 is a schematic networking diagram of a video network provided in an embodiment of the present application;
fig. 2 is a schematic hardware structure diagram of a node server according to an embodiment of the present application;
fig. 3 is a schematic hardware structure diagram of an access switch according to an embodiment of the present application;
fig. 4 is a schematic hardware structure diagram of an ethernet protocol conversion gateway according to an embodiment of the present application;
FIG. 5 is a system architecture diagram provided by an embodiment of the present application;
FIG. 6 is a flowchart illustrating steps of a multi-party video networking-based conferencing method according to an embodiment of the present application;
FIG. 7 is a flow chart of an embodiment of a method provided by an embodiment of the present application;
FIG. 8 is a diagram illustrating an example of an embodiment of a method provided by an embodiment of the present application;
fig. 9 is a block diagram of a server according to an embodiment of the present application.
Detailed Description
In order to make the aforementioned objects, features and advantages of the present application more comprehensible, the present application is described in further detail with reference to the accompanying drawings and the detailed description.
The video networking is an important milestone for network development, is a real-time network, can realize high-definition video real-time transmission, and pushes a plurality of internet applications to high-definition video, and high-definition faces each other.
The video networking adopts a real-time high-definition video exchange technology, can integrate required services such as dozens of services of video, voice, pictures, characters, communication, data and the like on a system platform on a network platform, such as high-definition video conference, video monitoring, intelligent monitoring analysis, emergency command, digital broadcast television, delayed television, network teaching, live broadcast, VOD on demand, television mail, Personal Video Recorder (PVR), intranet (self-office) channels, intelligent video broadcast control, information distribution and the like, and realizes high-definition quality video broadcast through a television or a computer.
To better understand the embodiments of the present application, the following description refers to the internet of view:
some of the technologies applied in the video networking are as follows:
network Technology (Network Technology)
Network technology innovation in video networking has improved over traditional Ethernet (Ethernet) to face the potentially enormous video traffic on the network. Unlike pure network Packet Switching (Packet Switching) or network Circuit Switching (Circuit Switching), the internet of vision technology employs network Packet Switching to satisfy the demand of Streaming (translated into Streaming, and continuous broadcasting, which is a data transmission technology, converting received data into a stable and continuous stream, and continuously transmitting the stream, so that the sound heard by the user or the image seen by the user is very smooth, and the user can start browsing on the screen before the whole data is transmitted). The video networking technology has the advantages of flexibility, simplicity and low price of packet switching, and simultaneously has the quality and safety guarantee of circuit switching, thereby realizing the seamless connection of the whole network switching type virtual circuit and the data format.
Switching Technology (Switching Technology)
The video network adopts two advantages of asynchronism and packet switching of the Ethernet, eliminates the defects of the Ethernet on the premise of full compatibility, has end-to-end seamless connection of the whole network, is directly communicated with a user terminal, and directly bears an IP data packet. The user data does not require any format conversion across the entire network. The video networking is a higher-level form of the Ethernet, is a real-time exchange platform, can realize the real-time transmission of the whole-network large-scale high-definition video which cannot be realized by the existing Internet, and pushes a plurality of network video applications to high-definition and unification.
Server Technology (Server Technology)
The server technology on the video networking and unified video platform is different from the traditional server, the streaming media transmission of the video networking and unified video platform is established on the basis of connection orientation, the data processing capacity of the video networking and unified video platform is independent of flow and communication time, and a single network layer can contain signaling and data transmission. For voice and video services, the complexity of video networking and unified video platform streaming media processing is much simpler than that of data processing, and the efficiency is greatly improved by more than one hundred times compared with that of a traditional server.
Storage Technology (Storage Technology)
The super-high speed storage technology of the unified video platform adopts the most advanced real-time operating system in order to adapt to the media content with super-large capacity and super-large flow, the program information in the server instruction is mapped to the specific hard disk space, the media content is not passed through the server any more, and is directly sent to the user terminal instantly, and the general waiting time of the user is less than 0.2 second. The optimized sector distribution greatly reduces the mechanical motion of the magnetic head track seeking of the hard disk, the resource consumption only accounts for 20% of that of the IP internet of the same grade, but concurrent flow which is 3 times larger than that of the traditional hard disk array is generated, and the comprehensive efficiency is improved by more than 10 times.
Network Security Technology (Network Security Technology)
The structural design of the video network completely eliminates the network security problem troubling the internet structurally by the modes of independent service permission control each time, complete isolation of equipment and user data and the like, generally does not need antivirus programs and firewalls, avoids the attack of hackers and viruses, and provides a structural carefree security network for users.
Service Innovation Technology (Service Innovation Technology)
The unified video platform integrates services and transmission, and is not only automatically connected once whether a single user, a private network user or a network aggregate. The user terminal, the set-top box or the PC are directly connected to the unified video platform to obtain various multimedia video services in various forms. The unified video platform adopts a menu type configuration table mode to replace the traditional complex application programming, can realize complex application by using very few codes, and realizes infinite new service innovation.
Networking of the video network is as follows:
the video network is a centralized control network structure, and the network can be a tree network, a star network, a ring network and the like, but on the basis of the centralized control node, the whole network is controlled by the centralized control node in the network.
As shown in fig. 1, the video network is divided into an access network and a metropolitan network.
The devices of the access network part can be mainly classified into 3 types: node server, access switch, terminal (including various set-top boxes, coding boards, memories, etc.). The node server is connected to an access switch, which may be connected to a plurality of terminals and may be connected to an ethernet network.
The node server is a node which plays a centralized control function in the access network and can control the access switch and the terminal. The node server can be directly connected with the access switch or directly connected with the terminal.
Similarly, devices of the metropolitan network portion may also be classified into 3 types: a metropolitan area server, a node switch and a node server. The metro server is connected to a node switch, which may be connected to a plurality of node servers.
The node server is a node server of the access network part, namely the node server belongs to both the access network part and the metropolitan area network part.
The metropolitan area server is a node which plays a centralized control function in the metropolitan area network and can control a node switch and a node server. The metropolitan area server can be directly connected with the node switch or directly connected with the node server.
Therefore, the whole video network is a network structure with layered centralized control, and the network controlled by the node server and the metropolitan area server can be in various structures such as tree, star and ring.
The access network part can form a unified video platform (circled part), and a plurality of unified video platforms can form a video network; each unified video platform may be interconnected via metropolitan area and wide area video networking.
Video networking device classification
1.1 devices in the video network of the embodiment of the present application can be mainly classified into 3 types: servers, switches (including ethernet gateways), terminals (including various set-top boxes, code boards, memories, etc.). The video network as a whole can be divided into a metropolitan area network (or national network, global network, etc.) and an access network.
1.2 wherein the devices of the access network part can be mainly classified into 3 types: node servers, access switches (including ethernet gateways), terminals (including various set-top boxes, code boards, memories, etc.).
The specific hardware structure of each access network device is as follows:
a node server:
as shown in fig. 2, the system mainly includes a network interface module 201, a switching engine module 202, a CPU module 203, and a disk array module 204.
The network interface module 201, the CPU module 203, and the disk array module 204 all enter the switching engine module 202; the switching engine module 202 performs an operation of looking up the address table 205 on the incoming packet, thereby obtaining the direction information of the packet; and stores the packet in a queue of the corresponding packet buffer 206 based on the packet's steering information; if the queue of the packet buffer 206 is nearly full, it is discarded; the switching engine module 202 polls all packet buffer queues for forwarding if the following conditions are met: 1) the port send buffer is not full; 2) the queue packet counter is greater than zero. The disk array module 204 mainly implements control over the hard disk, including initialization, read-write, and other operations on the hard disk; the CPU module 203 is mainly responsible for protocol processing with an access switch and a terminal (not shown in the figure), configuring an address table 205 (including a downlink protocol packet address table, an uplink protocol packet address table, and a data packet address table), and configuring the disk array module 204.
The access switch:
as shown in fig. 3, the network interface module (downstream network interface module 301, upstream network interface module 302), the switching engine module 303, and the CPU module 304 are mainly included.
Wherein, the packet (uplink data) coming from the downlink network interface module 301 enters the packet detection module 305; the packet detection module 305 detects whether the Destination Address (DA), the Source Address (SA), the packet type, and the packet length of the packet meet the requirements, if so, allocates a corresponding stream identifier (stream-id) and enters the switching engine module 303, otherwise, discards the stream identifier; the packet (downstream data) coming from the upstream network interface module 302 enters the switching engine module 303; the incoming data packet of the CPU module 304 enters the switching engine module 303; the switching engine module 303 performs an operation of looking up the address table 306 on the incoming packet, thereby obtaining the direction information of the packet; if the packet entering the switching engine module 303 is from the downstream network interface to the upstream network interface, the packet is stored in the queue of the corresponding packet buffer 307 in association with the stream-id; if the queue of the packet buffer 307 is nearly full, it is discarded; if the packet entering the switching engine module 303 is not from the downlink network interface to the uplink network interface, the data packet is stored in the queue of the corresponding packet buffer 307 according to the guiding information of the packet; if the queue of the packet buffer 307 is nearly full, it is discarded.
The switching engine module 303 polls all packet buffer queues, which in this embodiment is divided into two cases:
if the queue is from the downlink network interface to the uplink network interface, the following conditions are met for forwarding: 1) the port send buffer is not full; 2) the queued packet counter is greater than zero; 3) and obtaining the token generated by the code rate control module.
If the queue is not from the downlink network interface to the uplink network interface, the following conditions are met for forwarding: 1) the port send buffer is not full; 2) the queue packet counter is greater than zero.
The rate control module 208 is configured by the CPU module 204, and generates tokens for packet buffer queues from all downstream network interfaces to upstream network interfaces at programmable intervals to control the rate of upstream forwarding.
The CPU module 304 is mainly responsible for protocol processing with the node server, configuration of the address table 306, and configuration of the code rate control module 308.
Ethernet protocol conversion gateway
As shown in fig. 4, the apparatus mainly includes a network interface module (a downlink network interface module 401 and an uplink network interface module 402), a switching engine module 403, a CPU module 404, a packet detection module 405, a rate control module 408, an address table 406, a packet buffer 407, a MAC adding module 409, and a MAC deleting module 410.
Wherein, the data packet coming from the downlink network interface module 401 enters the packet detection module 405; the packet detection module 405 detects whether the ethernet MAC DA, the ethernet MAC SA, the ethernet length or frame type, the video network destination address DA, the video network source address SA, the video network packet type, and the packet length of the packet meet the requirements, and if so, allocates a corresponding stream identifier (stream-id); then, the MAC deletion module 410 subtracts MAC DA, MAC SA, length or frame type (2byte) and enters the corresponding receiving buffer, otherwise, discards it;
the downlink network interface module 401 detects the sending buffer of the port, and if there is a packet, obtains the ethernet MAC DA of the corresponding terminal according to the video networking destination address DA of the packet, adds the ethernet MAC DA of the terminal, the MAC SA of the ethernet coordination gateway, and the ethernet length or frame type, and sends the packet.
The other modules in the ethernet protocol gateway function similarly to the access switch.
A terminal:
the system mainly comprises a network interface module, a service processing module and a CPU module; for example, the set-top box mainly comprises a network interface module, a video and audio coding and decoding engine module and a CPU module; the coding board mainly comprises a network interface module, a video and audio coding engine module and a CPU module; the memory mainly comprises a network interface module, a CPU module and a disk array module.
1.3 devices of the metropolitan area network part can be mainly classified into 3 types: node server, node exchanger, metropolitan area server. The node switch mainly comprises a network interface module, a switching engine module and a CPU module; the metropolitan area server mainly comprises a network interface module, a switching engine module and a CPU module.
2. Video networking packet definition
2.1 Access network packet definition
The data packet of the access network mainly comprises the following parts: destination Address (DA), Source Address (SA), reserved bytes, payload (pdu), CRC.
As shown in the following table, the data packet of the access network mainly includes the following parts:
DA SA Reserved Payload CRC
the Destination Address (DA) is composed of 8 bytes (byte), the first byte represents the type of the data packet (e.g. various protocol packets, multicast data packets, unicast data packets, etc.), there are at most 256 possibilities, the second byte to the sixth byte are metropolitan area network addresses, and the seventh byte and the eighth byte are access network addresses.
The Source Address (SA) is also composed of 8 bytes (byte), defined as the same as the Destination Address (DA).
The reserved byte consists of 2 bytes.
The payload part has different lengths according to types of different datagrams, and is 64 bytes if the type of the datagram is a variety of protocol packets, or is 1056 bytes if the type of the datagram is a unicast packet, but is not limited to the above 2 types.
The CRC consists of 4 bytes and is calculated in accordance with the standard ethernet CRC algorithm.
2.2 metropolitan area network packet definition
The topology of a metropolitan area network is a graph and there may be 2, or even more than 2, connections between two devices, i.e., there may be more than 2 connections between a node switch and a node server, a node switch and a node switch, and a node switch and a node server. However, the metro network address of the metro network device is unique, and in order to accurately describe the connection relationship between the metro network devices, parameters are introduced in the embodiment of the present application: a label to uniquely describe a metropolitan area network device.
In this specification, the definition of the Label is similar to that of a Label of Multi-Protocol Label switching (MPLS), and assuming that there are two connections between a device a and a device B, there are 2 labels for a packet from the device a to the device B, and 2 labels for a packet from the device B to the device a. The label is classified into an incoming label and an outgoing label, and assuming that the label (incoming label) of the packet entering the device a is 0x0000, the label (outgoing label) of the packet leaving the device a may become 0x 0001. The network access process of the metro network is a network access process under centralized control, that is, address allocation and label allocation of the metro network are both dominated by the metro server, and the node switch and the node server are both passively executed, which is different from label allocation of MPLS, and label allocation of MPLS is a result of mutual negotiation between the switch and the server.
As shown in the following table, the data packet of the metro network mainly includes the following parts:
DA SA Reserved label (R) Payload CRC
Namely Destination Address (DA), Source Address (SA), Reserved byte (Reserved), tag, payload (pdu), CRC. The format of the tag may be defined by reference to the following: the tag is 32 bits with the upper 16 bits reserved and only the lower 16 bits used, and its position is between the reserved bytes and payload of the packet.
The video network protocol is a communication protocol in the video network, and is mainly a protocol developed based on a data link information exchange function of a two-layer switch.
Session Initiation Protocol (SIP) is an application-layer signaling control Protocol for creating, modifying, and releasing sessions for one or more participants. These sessions may be Internet multimedia conferences, IP telephony or multimedia distribution. The participants of the session may communicate by multicast (multicast), mesh unicast (unicast), or a mixture of both.
In the public network video conference, a public IP address of the Internet is used as an IP of a video Micro Control Unit (MCU), and then the conference is established on the MCU.
Currently, with the popularization and development of video networking services across the country, the video networking high-definition video networking interaction technology plays a significant role in various industries. The video networking adopts the most advanced real-time high-definition video exchange technology in the world, realizes the real-time transmission of the high-definition video in the whole network which can not be realized by the Internet at present, integrates dozens of services such as high-definition video conferences, video monitoring, remote training, intelligent monitoring and analysis, emergency command, video telephone, live broadcast, television mail, information distribution and the like into a system platform, and realizes the real-time interconnection and intercommunication of high-definition quality video communication through various terminal devices.
The embodiment of the application is a technical scheme for adding the video networking terminal into the internet conference in an active calling mode, the SIP can support the video conference and can also support video monitoring, and compared with the traditional complex H323 protocol, the SIP is a convenient and simple protocol, so that in practical application, the SIP is often used for supporting the video conference.
In the prior art, a video conference terminal under a video networking protocol calls a video conference terminal of an SIP (session initiation protocol) by means of a gateway SIP protocol, the video conference function of the video conference terminal of the SIP is limited to a video conference function of a video networking point-to-point call, and at most, two devices can only be supported to carry out a conference (call connection is carried out aiming at the SIP ID of the other side). This advanced conferencing modality of multipoint group conferencing has not been implemented. Therefore, in order to perform a multi-party conference, a new video network terminal needs to actively call and join the conference created by the SIP MCU server, and talk with a conference terminal already in the conference.
Fig. 5 shows a system architecture to which the embodiment of the present application is applicable, and as shown in fig. 5, the system includes n video networking terminals, a video networking switch, a video networking server, a protocol conversion server (abbreviated as "protocol conversion server"), an internet switch, n conference terminals, and an MCU server.
The video network terminal directly establishes network connection with a video network server through the exchange routing network equipment of the communication line; and the conference terminal network access authorization needs to deploy a protocol conversion server, the protocol conversion server comprises double network cards, one network card of the protocol conversion server is connected with the video network server through the video network switch, and the other network card of the protocol conversion server is connected with the conference terminal through the internet switch. And the video networking terminal is connected with the video networking server through switch equipment and a network cable. The connection between the video network terminal and the conference terminal and the video network server is established through the network architecture, so that the internal and external communication and the data connection between the terminal and the video network server are completed.
For the access of the conference terminal to the video network server, the connection between the terminal and the server needs to be realized by using the protocol conversion server as a medium for intermediate protocol conversion, and the specific implementation process may be as follows: firstly, each protocol conversion server allowed to access the network is registered in the networking server, the registration information of the protocol conversion server is inherent information such as a serial number (including equipment type and equipment identification information), a downlink port number, a mask code interval and the like of the protocol conversion server, and the protocol conversion server which is not registered cannot access the network. The method comprises the following specific steps:
the method comprises the following steps: the server of the video network sends a query packet to each of its ports.
Step two: and after receiving the query packet, the protocol conversion server returns a response packet containing the sequence number of the protocol conversion server to the video network server.
Step three: the video network server searches for the corotation server information corresponding to the serial number in a registration information table, where the corotation server information may specifically include a physical address (Media Access Control, MAC) of a corotation server and an MAC address of a conference terminal bound under the corotation server.
Step four: the video network server sends a network access command to the protocol conversion server, wherein the network access command comprises the address of the protocol conversion server in the video network and the MAC address of the protocol conversion server.
Step five: the protocol conversion server returns a response after receiving the network access command, and the protocol conversion server is accessed to the video network.
Step six: the video network server sends a device state query instruction to the protocol conversion server which is accessed to the network at regular time, and checks whether the protocol conversion server works normally.
Step seven: the video network server informs the protocol conversion server of the MAC address of the conference terminal bound by the protocol conversion server, and the mapping between the MAC address of the conference terminal and the video network address to be allocated to the conference terminal, so that the conference terminal can access the network through the protocol conversion server.
After the conference terminal is accessed to the video networking server, a conference terminal user can send request data to the protocol conversion server, the protocol conversion server carries out analysis processing, an instruction is sent out through one end of a network card communicated with the video networking server, then the video networking server carries out scheduling control on video signals and/or audio signals, the video signals and/or the audio signals are sent back to the protocol conversion server, and the network card connected with the internet is sent out to the conference terminal user.
The video network server can be used for uniformly managing the terminals after network access authorization. For example, the video network server may maintain a terminal netlist of all terminals, where the terminal netlist records the number, name, address, and the like of the terminal, and the video network server performs control scheduling only on the terminals configured in the terminal netlist. When video conference service is initiated, the video network server selects which terminals need to participate from the terminal network access list, and the video conference only serves the terminals.
Fig. 6 illustrates a multi-party conference method based on the video network, which is applicable to a SIP server, according to an embodiment of the present application. The method comprises the following steps:
step 601: the SIP server receives a first conference access request sent by the video networking terminal through the transit server and a second conference access request sent by the conference terminal; and the first conference joining request and the second conference joining request carry a target conference identifier and the SIP server IP.
Step 602: sending a first conference joining response to the video network terminal through the transfer server according to the target conference identifier and the SIP server IP, and sending a second conference joining response to the conference terminal; the first conference response is used for establishing a first data transmission channel with the video network terminal, and the second conference response is used for establishing a second data transmission channel with the conference terminal.
In one possible embodiment, the transfer server comprises a video network server and a corotation server; the first meeting-entering request is sent by the video network terminal, passes through the video network server, is subjected to protocol conversion by the protocol conversion server and is sent to the SIP server; the first conference response is sent by the SIP server, protocol conversion is carried out by the protocol conversion server, and the first conference response is sent to the video network terminal through the video network server.
In a possible implementation manner, the first data transmission channel is used for transmitting a mixed data stream sent by the SIP server and/or a first data stream sent by the video network terminal via the transit server; the second data transmission channel is used for transmitting the mixed data stream sent by the SIP server and/or a second data stream sent by the conference terminal; and the mixed data stream is obtained by mixing and screen-mixing the received first data stream and the second data stream by the SIP server.
In one possible embodiment, the transfer server comprises a video network server and a corotation server; the first data transmission channel is used for transmitting the mixed data stream which is sent by the SIP server, subjected to protocol conversion by the protocol conversion server and sent to the video network terminal through the video network server; and/or the first data transmission channel is used for transmitting the first data stream which is sent by the video network terminal, passes through the video network server and is subjected to protocol conversion by the protocol conversion server.
In the multi-party conference method based on the video network provided by the embodiment of the application, one server can generate a plurality of virtual terminals, each virtual terminal corresponds to one calling channel, and the attribute of each virtual terminal comprises an SIP conference address to be entered. And converting the video networking number into the conference address of the SIP, so that the video networking number can join the call channel of the conference on behalf of the video networking terminal. Further, the video networking number and the video conference address of the SIP are mapped one by one, and the automatic conversion of the video networking number into the video calling address of the SIP is completed.
Taking the internet conference of the video network terminal actively calling and joining the SIP as an example, the video network terminal is used as a calling party. Firstly, a conference room is required to be created on an SIP server, the number of a conference is designated as 101, then a conference terminal under the SIP is started, and a mode of '101 @ SIP server IP' is input in a calling address field to join in the conference. And binding and configuring the virtual terminal video networking number and the conference address to be added in a virtual terminal configuration file of the protocol conversion server, so that a calling channel object can be automatically created to be converted into a protocol conversion conference after the virtual terminal is configured by the SIP server, address mapping is carried out when the calling number is input into the video networking terminal, and the video networking calling signaling is successfully accessed into a conference room after being converted into the SIP calling signaling. Therefore, a plurality of terminals can be supported to carry out the same operation and enter the same SIP conference, and therefore the multi-party conference is realized.
It should be noted that a channel is an abstract concept in a call flow, and is equivalent to transmitting data in a pipe. The channel object is call information and audio/video data which the virtual terminal of the protocol conversion server needs to send to the SIP server through a channel.
Because the code rates and formats of the video signals and/or audio signals output by different devices may not be uniform, the encoding module in the terminal needs to perform encoding and decoding processing: and processing the audio and video data into a unified audio and video code rate source signal. After receiving the network stream sent by the video network server, the video network terminal decodes the network stream to obtain corresponding video and/or audio, and then displays the video and audio on the video network terminal.
In a possible implementation manner, after receiving the source signals sent by the terminal, the video network server may send the source signals to a Multiplexer (MUX) device, mix the source signals through the MUX device to obtain a network stream, and send the network stream back to the video network server through the MUX device. For example, in a video conference system, a conference terminal may send 2 input sources to a video network server, the video network server sends the 2 input sources to a MUX device for mixing, the MUX device returns a mixed network stream to the video network server, and the video network server sends the mixed network stream according to a scheduling request.
The use of MUX devices for mixing multiple audio and video has many advantages, such as: according to the scheduling requirement, multi-path audio and video MUX mixing is carried out, and a multi-program Transport Stream (TS Transport Stream) is generated; the method supports the superposition of the field name into the network flow; according to the scheduling requirement, video screen combination and audio mixing are carried out; the MUX equipment is provided with a plurality of different interfaces for receiving audio and video data and can perform corresponding mixing according to the requirements of the terminal, so that dynamic switching of mixed streams and screen-combined streams is supported; the format of the stream file output by the device can be configured into a TS file, mp4(MPEG-4Part 14) and other formats; the code rate and the resolution of the video after screen closing can be set by self; supporting parallel processing of multiple groups of streams; the output network stream and the recording stream are transmitted to each terminal, so that the functions of conference watching and network recording are realized.
In one implementation, the video network terminal device may be a Set Top Box (STB), generally called a Set Top Box or Set Top Box, which is a device for connecting a tv Set and an external signal source, and converts a compressed digital signal into tv content and displays the tv content on the tv Set. Generally, the set-top box may be connected to a camera and a microphone for collecting multimedia data such as video data and audio data, and may also be connected to a television for playing multimedia data such as video data and audio data.
Therefore, in practical applications, a user can trigger the video network terminal to generate a control service application instruction through some operations in a menu (gtml) file, such as dialing a user number of a set top box of an opposite terminal, and send the control service application instruction to the video network server.
The video network terminal device may further include an external interface, such as a USB interface, an HDMI _ OUTx2 interface, an HDMI _ IN interface, a dongle interface, an RCA (Radio Corporation of American, RC interface), an AV interface, and the like.
Fig. 7 is a diagram showing specific steps of the multi-party conference method according to the embodiment of the present application, where before a conference joining request is made, an environment is first built, a collaboration server is started, and a virtual terminal number and an SIP protocol conference address are configured. The video networking module, the audio mixing and screen mixing module, the conference control module, the RTP encapsulation module and the SIP protocol conversion module belong to a protocol conversion server.
Each call is initiated by a terminal, and then a channel is established for audio and video transmission; the destination address called by using the video networking number is a conference room address of an SIP protocol, and the source address and the destination address need to be associated; the video networking number is the identification of the virtual terminal of the protocol conversion server; the virtual terminal of the protocol conversion server converts the video networking protocol and the SIP protocol, on one hand, the video networking identity needs to be indicated, and on the other hand, the virtual terminal needs to know which SIP conference needs to be entered after the protocol conversion.
Step 701: and the video networking terminal sends a first call request and the audio and video stream of the video networking protocol to the video networking server through the video networking number in the call configuration file.
Step 702: the video networking server forwards the first call request and the audio video stream to the video networking module.
Step 703: because the communication between the internal modules of the protocol conversion server adopts a message mechanism based on the Yate call engine, the video networking module encapsulates the first call request into a Yate type second call request, and the audio and video stream is sent to the SIP protocol conversion module.
Step 704: and the SIP protocol conversion module converts the second call request and the audio and video stream into a third call request under the SIP and sends the third call request to the SIP MCU server.
Step 705: and after receiving the third call request, the SIP MCU server returns the first call response to the SIP coordination module.
Step 706: the SIP protocol module identifies a first meeting invitation message according to the first call response and sends the first meeting invitation message to the conference control module through a Yate message.
Step 707: and the SIP coordination module simultaneously sends the first call response to the video networking module.
Step 708: and after the conference control module is started, an activation message (Yate message) is sent to activate the audio mixing and screen mixing module.
Step 709: after receiving the first call response, the video networking module converts the first call response into a second call response of the video networking protocol and sends the second call response to the video networking server.
Step 710: and the video network server forwards the second call response to the video network terminal.
Step 711: and calling the same SIP conference address on the butted third-party equipment, namely the conference terminal, and accessing the conference.
The above is the conference call flow of the video network terminal and the conference terminal.
Step 712: the SIP MCU server formulates a displayed split screen mode according to the requirements of a user, and performs screen mixing on the audio and video stream to obtain a first audio and video stream; and sending the first audio and video stream to the conference terminal.
Step 713: and the SIP MCU server also sends the first audio and video stream to an RTP encapsulation module of the SIP protocol conversion server.
Step 714: the RTP encapsulation module sends the received first audio and video stream to the mixed screen audio mixing module, the module receives a Yate mixed screen instruction from the conference control module, and the mixed screen audio mixing module carries out coding and decoding to generate a second audio and video stream which can be normally watched by the video network terminal.
Step 715: and the mixed screen audio mixing module sends the second audio and video stream to the video networking module.
Step 716: and similarly, the SIP MCU server performs audio mixing and screen mixing processing on the audio and video streams of the conference terminals entering the conference, and then respectively sends the audio and video streams to the conference terminals participating in the conference.
Step 717 and step 718: and after the video networking module receives the second audio and video stream, packaging the second audio and video stream into a third audio and video stream of the video networking protocol.
Step 719: and the video network module sends the third audio and video stream to the video network server.
Step 720: and the video network server forwards the third audio and video stream to the video network terminal.
The above is the flow of the video networking terminal actively calling and joining in the session of the SIP protocol and carrying out the multipoint conference with the SIP conference terminal.
To more clearly illustrate the multi-party video networking based method provided by the embodiment of the present application, an example of the step flow is now performed based on fig. 8:
step 801: and configuring the protocol conversion server to enable the virtual terminal of the protocol conversion server to access the network.
Step 802: the video network terminal initiatively initiates a call, inputs a five-digit video network terminal number and sends a first call instruction to the video network server.
Step 803: and the video network server converts the first call instruction into a second call signaling and sends the second call signaling to the coordination server.
Step 804 a: and after receiving the second call signaling, the cooperation server starts to initiate a first registration request to the SIP server so as to register on the SIP server.
Step 804 b: simultaneously with step 804a, the conference terminal also starts to call the SIP server, sending a second registration request to register on the SIP server.
Step 805: the SIP server replies a registration response to both parties, allowing registration.
Step 806 a: the collaboration server sends a first access request to the SIP server.
Step 806 b: and the conference terminal sends a second access request to the SIP server.
Step 807: and the SIP server replies to the access response message and starts processing.
Step 808 ab: the SIP server replies a first conference invitation message and a response trigger message to the collaboration server and the conference terminal, and the access request message sent by the collaboration server and the conference terminal is successfully processed.
Step 809 ab: and when the virtual terminal and the conference terminal of the collaboration server receive the first conference invitation message and the response trigger message, replying a confirmation signaling to the SIP server.
Step 810: and the virtual terminal of the collaboration server converts the first session invitation message into a second session invitation message and sends the second session invitation message to the video network server.
Step 811: and the video network server converts the second conference invitation message into a third conference invitation message, sends the third conference invitation message to the video network terminal and informs the video network terminal of entering the conference.
Step 812 ab: and after the video network terminal and the conference terminal both enter the conference, the video network server starts to send an encoder opening instruction to the protocol conversion server and the video network terminal.
Steps 813 and 814: and the video networking terminal sends the first audio and video stream and the second audio and video stream of the video networking protocol to the video networking server.
Step 815 ab: and the video network server forwards the first audio and video stream and the second audio and video stream to the corotation server.
Step 816: the protocol conversion server extracts the first audio and video stream and the second audio and video stream contained in the protocol and encapsulates the first audio and video stream and the second audio and video stream into an audio and video stream of an RTP protocol, and the audio and video stream of the conference terminal is sent to the SIP server.
Step 817: and the SIP server performs sound mixing and screen mixing on the received two paths of audio and video streams.
Step 818: and the SIP server respectively sends the audio and video streams after the audio mixing and the screen mixing to the cooperation server and the conference terminal.
Step 819: after receiving the audio and video stream encapsulated by the RTP, the coordination server decapsulates the audio and video stream into a third audio and video stream of mixed audio and a fourth audio and video stream of mixed screen, and then encapsulates the video networking protocol.
Step 820: and after the encapsulation of the coordination server is completed, the encapsulated third audio and video stream and the encapsulated fourth audio and video stream are sent to the video networking server.
Step 821: and the video network server forwards the video network to the video network terminal.
Step 822: the video network terminal can normally listen to and watch the conference content.
Step 823: and after the conference is finished, the video networking terminal sends a first hang-up instruction to the video networking server.
Step 824: and the video network server converts the first hang-up instruction into a second hang-up instruction and sends the second hang-up instruction to the coordination server.
Step 825: the cooperation server converts the second hang-up instruction into a third hang-up signaling and sends the third hang-up signaling to the SIP server, and the conference terminal also sends a fourth hang-up instruction to the SIP server.
Step 826: and the SIP server respectively replies a first hang-up response message to the protocol conversion server and the conference terminal.
Step 827: after receiving the first hang-up response message, the protocol conversion server converts the first hang-up response message into a second hang-up response message of the video networking protocol to the video networking server.
Step 828: and the video network server converts the second hang-up response message into a third hang-up response message and sends the third hang-up response message to the video network terminal.
Step 829: and the video network terminal exits the conference.
Step 830: and the video networking server sends an encoder closing instruction to close the video networking terminal and the audio/video codec of the protocol conversion server.
In summary, the video networking based multiparty conference joining method provided by the embodiment of the application converts the video networking number called by the video networking terminal and the video conference address through the protocol conversion server, so that the video networking terminal can actively call and join the conference created by the SIP server; and after the call is successfully established, the audio and video data of the user can be sent to the SIP server, and the mixed audio and video is displayed and played according to the mixing request, so that the user requirements are met.
The embodiment of the application solves the limitation of single function of point-to-point calling of the video networking terminal and the Internet video conference terminal under the SIP in the prior art, increases the function of adding the video networking terminal into the conference by calling the SIP conference number and actively calling, perfects the video networking and conference terminal under the SIP in a video conference butt joint mode, and meets the requirement of multi-party meeting under the user SIP.
It should be noted that the SIP protocol applied in the technical solution of the embodiment of the present application is only an example, and other protocols are within the protection scope of the technical solution disclosed in the embodiment of the present application.
It should be noted that, for simplicity of description, the method embodiments are described as a series of acts or combination of acts, but those skilled in the art will recognize that the embodiments are not limited by the order of acts described, as some steps may occur in other orders or concurrently depending on the embodiments. Further, those skilled in the art will also appreciate that the embodiments described in the specification are presently preferred and that no particular act is required of the embodiments of the application.
Based on the same technical concept, referring to fig. 9, a block diagram of a SIP server provided in the embodiment of the present application is shown, where the apparatus may be applied in a video network, and specifically may include the following modules:
a message receiving module 901, configured to receive, by a conference initiation protocol SIP server, a first conference access request sent by a video networking terminal via a transit server and a second conference access request sent by a conference terminal; and the first conference joining request and the second conference joining request carry a target conference identifier and the SIP server IP.
A message sending module 902, configured to send a first conference joining response to the video networking terminal via the transfer server according to the target conference identifier and the SIP server IP, and send a second conference joining response to the conference terminal; the first conference response is used for establishing a first data transmission channel with the video network terminal, and the second conference response is used for establishing a second data transmission channel with the conference terminal.
In one possible embodiment, the transfer server comprises a video network server and a corotation server; the first meeting-entering request is sent by the video network terminal, passes through the video network server, is subjected to protocol conversion by the protocol conversion server and is sent to the SIP server; the first conference response is sent by the SIP server, protocol conversion is carried out by the protocol conversion server, and the first conference response is sent to the video network terminal through the video network server.
In a possible implementation manner, the first data transmission channel is used for transmitting a mixed data stream sent by the SIP server and/or a first data stream sent by the video network terminal via the transit server; the second data transmission channel is used for transmitting the mixed data stream sent by the SIP server and/or a second data stream sent by the conference terminal; and the mixed data stream is obtained by mixing and screen-mixing the received first data stream and the second data stream by the SIP server.
In one possible embodiment, the transfer server comprises a video network server and a corotation server; the first data transmission channel is used for transmitting the mixed data stream which is sent by the SIP server, subjected to protocol conversion by the protocol conversion server and sent to the video network terminal through the video network server; and/or the first data transmission channel is used for transmitting the first data stream which is sent by the video network terminal, passes through the video network server and is subjected to protocol conversion by the protocol conversion server.
For the device embodiment, since it is basically similar to the method embodiment, the description is simple, and for the relevant points, refer to the partial description of the method embodiment.
The embodiments in the present specification are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other.
As will be appreciated by one of skill in the art, embodiments of the present application may be provided as a method, apparatus, or computer program product. Accordingly, embodiments of the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, embodiments of the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
Embodiments of the present application are described with reference to flowchart illustrations and/or block diagrams of methods, terminal devices (systems), and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing terminal to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing terminal, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing terminal to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing terminal to cause a series of operational steps to be performed on the computer or other programmable terminal to produce a computer implemented process such that the instructions which execute on the computer or other programmable terminal provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
While preferred embodiments of the present application have been described, additional variations and modifications of these embodiments may occur to those skilled in the art once they learn of the basic inventive concepts. Therefore, it is intended that the appended claims be interpreted as including the preferred embodiment and all such alterations and modifications as fall within the true scope of the embodiments of the application.
Finally, it should also be noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or terminal that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or terminal. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or terminal that comprises the element.
The invention provides a multi-party conference method and a server based on video networking. The detailed description is given, and the principle and the implementation of the present application are explained by applying specific examples, and the above description of the embodiments is only used to help understanding the method and the core idea of the present application; meanwhile, for a person skilled in the art, according to the idea of the present application, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present application.

Claims (10)

1. A method for multi-party conferencing over a video network, the method comprising:
a conference initiation protocol SIP server receives a first conference access request sent by a video networking terminal through a transit server and a second conference access request sent by a conference terminal; the first conference entering request and the second conference entering request carry a target conference identifier and the SIP server IP, and the first conference entering request represents that the video network terminal actively calls and joins in a conference created by the SIP server and communicates with a conference terminal in the conference;
sending a first conference joining response to the video network terminal through the transfer server according to the target conference identifier and the SIP server IP, and sending a second conference joining response to the conference terminal; the first conference response is used for establishing a first data transmission channel with the video network terminal, and the second conference response is used for establishing a second data transmission channel with the conference terminal.
2. The method of claim 1, wherein the transit server comprises a video networking server and a corotation server;
the first meeting-entering request is sent by the video network terminal, passes through the video network server, is subjected to protocol conversion by the protocol conversion server and is sent to the SIP server;
the first conference response is sent by the SIP server, protocol conversion is carried out by the protocol conversion server, and the first conference response is sent to the video network terminal through the video network server.
3. The method of claim 1, wherein the first data transmission channel is used for transmitting a mixed data stream sent by the SIP server and/or a first data stream sent by the video networking terminal via the transit server;
the second data transmission channel is used for transmitting the mixed data stream sent by the SIP server and/or a second data stream sent by the conference terminal;
and the mixed data stream is obtained by mixing and screen-mixing the received first data stream and the second data stream by the SIP server.
4. The method of claim 3, wherein the transit server comprises a video networking server and a corotation server;
the first data transmission channel is used for transmitting the mixed data stream sent by the SIP server and/or the first data stream sent by the video network terminal via the transit server, and includes:
the first data transmission channel is used for transmitting the mixed data stream which is sent by the SIP server, subjected to protocol conversion by the protocol conversion server and sent to the video network terminal through the video network server; and/or
The first data transmission channel is used for transmitting the first data stream which is sent by the video network terminal, passes through the video network server and is subjected to protocol conversion by the protocol conversion server.
5. A server, characterized in that the server comprises:
the conference initiating protocol SIP server is used for receiving a first conference access request sent by the video networking terminal through the transit server and a second conference access request sent by the conference terminal; the first conference entering request and the second conference entering request carry a target conference identifier and the SIP server IP, and the first conference entering request represents that the video network terminal actively calls and joins in a conference created by the SIP server and communicates with a conference terminal in the conference;
the message sending module is used for sending a first conference-joining response to the video network terminal through the transfer server according to the target conference identifier and the SIP server IP and sending a second conference-joining response to the conference terminal; the first conference response is used for establishing a first data transmission channel with the video network terminal, and the second conference response is used for establishing a second data transmission channel with the conference terminal.
6. The server of claim 5, wherein the transit server comprises a video networking server and a corotation server;
the first meeting-entering request is sent by the video network terminal, passes through the video network server, is subjected to protocol conversion by the protocol conversion server and is sent to the SIP server;
the first conference response is sent by the SIP server, protocol conversion is carried out by the protocol conversion server, and the first conference response is sent to the video network terminal through the video network server.
7. The server according to claim 5, wherein the first data transmission channel is used for transmitting a mixed data stream sent by the SIP server and/or a first data stream sent by the video network terminal via the transit server;
the second data transmission channel is used for transmitting the mixed data stream sent by the SIP server and/or a second data stream sent by the conference terminal;
and the mixed data stream is obtained by mixing and screen-mixing the received first data stream and the second data stream by the SIP server.
8. The server of claim 7, wherein the transit server comprises a video networking server and a corotation server;
the first data transmission channel is used for transmitting the mixed data stream sent by the SIP server and/or the first data stream sent by the video network terminal via the transit server, and includes:
the first data transmission channel is used for transmitting the mixed data stream which is sent by the SIP server, subjected to protocol conversion by the protocol conversion server and sent to the video network terminal through the video network server; and/or
The first data transmission channel is used for transmitting the first data stream which is sent by the video network terminal, passes through the video network server and is subjected to protocol conversion by the protocol conversion server.
9. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the method of any one of claims 1 to 4 when executing the computer program.
10. A computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program for executing the method of any one of claims 1 to 4.
CN201910310303.6A 2019-04-17 2019-04-17 Multi-party group meeting method and server based on video network Active CN110062191B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910310303.6A CN110062191B (en) 2019-04-17 2019-04-17 Multi-party group meeting method and server based on video network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910310303.6A CN110062191B (en) 2019-04-17 2019-04-17 Multi-party group meeting method and server based on video network

Publications (2)

Publication Number Publication Date
CN110062191A CN110062191A (en) 2019-07-26
CN110062191B true CN110062191B (en) 2021-03-02

Family

ID=67319288

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910310303.6A Active CN110062191B (en) 2019-04-17 2019-04-17 Multi-party group meeting method and server based on video network

Country Status (1)

Country Link
CN (1) CN110062191B (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110545395B (en) * 2019-08-13 2020-12-04 视联动力信息技术股份有限公司 Video networking conference switching method and device
CN110475094B (en) * 2019-08-20 2021-07-20 视联动力信息技术股份有限公司 Video conference processing method and device and readable storage medium
CN110995456B (en) * 2019-11-18 2022-12-23 视联动力信息技术股份有限公司 Multi-terminal conference implementation method and first terminal
CN111447391A (en) * 2020-02-20 2020-07-24 平安科技(深圳)有限公司 Conference data synchronization method and device, computer equipment and storage medium
CN111478884B (en) * 2020-03-06 2022-11-15 视联动力信息技术股份有限公司 Method, device and storage medium for data transmission among different video networks
CN111835717B (en) * 2020-06-04 2023-05-12 视联动力信息技术股份有限公司 Method and device for acquiring monitoring code stream and readable storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103546717A (en) * 2012-07-16 2014-01-29 中兴通讯股份有限公司 Audio/video conference connection establishment method and device
CN107968928A (en) * 2016-10-19 2018-04-27 北京视联动力国际信息技术有限公司 A kind of method and apparatus of terminal communication
CN108234421A (en) * 2016-12-21 2018-06-29 北京视联动力国际信息技术有限公司 It is a kind of to regard networked terminals and the method and system of internet terminal audio data intercommunication

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105706445A (en) * 2016-01-18 2016-06-22 王晓光 Video network meeting method and system
CN108632398B (en) * 2017-07-27 2019-11-12 视联动力信息技术股份有限公司 A kind of conference access method and system, association turn server and conference management terminal

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103546717A (en) * 2012-07-16 2014-01-29 中兴通讯股份有限公司 Audio/video conference connection establishment method and device
CN107968928A (en) * 2016-10-19 2018-04-27 北京视联动力国际信息技术有限公司 A kind of method and apparatus of terminal communication
CN108234421A (en) * 2016-12-21 2018-06-29 北京视联动力国际信息技术有限公司 It is a kind of to regard networked terminals and the method and system of internet terminal audio data intercommunication

Also Published As

Publication number Publication date
CN110062191A (en) 2019-07-26

Similar Documents

Publication Publication Date Title
CN108574688B (en) Method and device for displaying participant information
CN110062191B (en) Multi-party group meeting method and server based on video network
CN110166728B (en) Video networking conference opening method and device
CN109640028B (en) Method and device for carrying out conference combining on multiple video networking terminals and multiple Internet terminals
CN109803111B (en) Method and device for watching video conference after meeting
CN109618120B (en) Video conference processing method and device
CN109120879B (en) Video conference processing method and system
CN110049271B (en) Video networking conference information display method and device
CN110475094B (en) Video conference processing method and device and readable storage medium
CN108881796B (en) Video data processing method and video networking protocol conversion server
CN110460804B (en) Conference data transmitting method, system, device and computer readable storage medium
CN110049273B (en) Video networking-based conference recording method and transfer server
CN109743522B (en) Communication method and device based on video networking
CN109218306B (en) Audio and video data stream processing method and system
CN111131754A (en) Control split screen method and device of conference management system
CN110830750A (en) Data transmission method and device based on video network
CN110149305B (en) Video network-based multi-party audio and video playing method and transfer server
CN110855926A (en) Video conference processing method and device
CN110049268B (en) Video telephone connection method and device
CN111614927A (en) Video session establishment method, device, electronic equipment and storage medium
CN111327868A (en) Method, terminal, server, device and medium for setting conference speaking party role
CN109286775B (en) Multi-person conference control method and system
CN109451001B (en) Communication method and system
CN109587433B (en) Point adjusting method and point adjusting device
CN109922300B (en) Method and server for inviting video network MCU conference to Internet MCU conference

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant