CN112653868A

CN112653868A - Cloud-based multi-person remote scene secure video conference communication system

Info

Publication number: CN112653868A
Application number: CN202011085609.5A
Authority: CN
Inventors: 杜婷
Original assignee: Beijing Guoneng Internet Information Technology Co ltd
Current assignee: Beijing Guoneng Internet Information Technology Co ltd
Priority date: 2020-10-12
Filing date: 2020-10-12
Publication date: 2021-04-13

Abstract

The invention relates to the technical field of audio and video, and discloses a cloud multi-person long-distance scene secure video conference communication system, wherein the architecture of the system comprises a basic support layer, a conference control and transmission layer, a media layer, a coding and decoding layer and an access layer; the basic support layer provides XMPP, the XMPP transmits XML data streams in TCP, a central main server is not arranged, and a customized XMPP server can be independently operated at any node. The cloud-based multi-person long-distance scene secure video conference communication system is typical image communication, an image and sound signal is changed into a digital signal at a communication sending end, and the digital signal is reproduced into information which can be obtained by vision and hearing at a receiving end.

Description

Cloud-based multi-person remote scene secure video conference communication system

Technical Field

The invention relates to the technical field of audio and video, in particular to a cloud multi-person long-distance scene secure video conference communication system.

Background

The video conference system, also called as a conference television system, refers to a system device for realizing a teleconference by mutually transmitting sound, images and file data through a transmission line and multimedia equipment by individuals or groups in two or more different places, wherein the system device realizes the teleconference.

However, the existing video conference system is limited by a system architecture, so that the video conference system has fewer functions and can only be applied to some places, and therefore a cloud multi-person long-distance scene secure video conference communication system is provided.

Disclosure of Invention

Aiming at the defects in the prior art, the invention provides a cloud multi-person long-distance scene secure video conference communication system.

The invention provides the following technical scheme: a cloud multi-person long-distance scene secure video conference communication system is structurally characterized by comprising a basic support layer, a conference control and transmission layer, a media layer, a coding and decoding layer and an access layer;

the basic support layer provides XMPP, the XMPP transmits XML data streams by TCP, a central main server is not arranged, and a customized XMPP server can be independently operated at any node;

the control and transmission layer adopts an HTTPS protocol to authenticate the user and the server and ensure that data are sent to correct clients and servers;

the platform of the media layer is realized based on the RTC, and for terminal PCs or large-screen equipment participating in the meeting, clients or plug-ins do not need to be additionally installed, and the meeting can be performed only by a compatible browser to perform audio and video interaction;

the platform is internally provided with a connection and transmission technology solution aiming at NAT, a local area network, a virtual machine, a wide area network and the like, uses the key NAT and firewall penetration technology of STUN, ICE, TURN and RTP-over-TCP, and supports the proxy.

The RTC built-in VP8/VP9 and VP8/VP9 of the coding and decoding layer are open source standards, and the method is used without paying any cost, decoding is added to software, and extra equipment for adding decoding is not required to be purchased.

The access layer can be accessed through a browser supporting the RTC at present, a client and other plug-ins do not need to be installed, and the access can be applied and accessed anytime and anywhere through the access of communication modes such as the Internet, an office network, a government affair network, a mobile 3G/4G, a satellite, microwave and the like.

Preferably, the basic support layer adopts a mechanism SASL for expanding the authentication capability of the C/S mode, the SASL provides a general method for adding the authentication support to the connection-based protocol, and the XMPP uses a common XML namespace to meet the requirements of the SASL.

Preferably, the base support layer employs TLS for providing privacy and data integrity between two communicating applications, which protocol consists of two layers: the TLS recording protocol and the TLS handshake protocol.

Preferably, the system supports APP clients installed on mobile devices such as smart phones, pads and PCs, and covers systems such as windows, android and IOS.

Preferably, the audio of the system adopts Opus coding, supports ultra wide band (24kHz sampling rate) and full band (48kHz sampling rate) voice, and adopts technologies such as complete echo cancellation, noise suppression, reverberation suppression, automatic gain control, sound orientation, beam forming and the like, so as to realize better sound effect and ensure optimal hearing experience.

Preferably, the system adopts a special forward error correction and packet loss retransmission algorithm and an intelligent regulation mechanism, so that the video is not blocked and not displayed under the condition that the network loses 30% of packets, and the video can be transmitted under the condition that the network loses 50% of packets; ensure the sound to be clear and smooth under the condition of 50% packet loss of the network and the sound to be distinguished under the condition of 80% packet loss.

Preferably, the system adopts an H264/H265 SVC flexible video coding architecture, the self-adaptive call rate adapts to various network accesses, the bandwidth of 64K to 8Mbps is automatically detected, the video SVC layering is dynamically adjusted in real time according to the bandwidth change, and the optimal video experience is ensured.

Compared with the prior art, the invention has the following beneficial effects:

the cloud-based multi-person long-distance scene security video conference communication system is typical image communication, an image and sound signal is converted into a digital signal at a communication sending end, and the digital signal is reproduced into visual and auditory acquirable information at a receiving end.

Drawings

FIG. 1 is a schematic diagram of the present invention;

FIG. 2 is a logic diagram of the present invention.

Detailed Description

To make the objects, technical solutions and advantages of the embodiments of the present disclosure clearer, technical solutions of the embodiments of the present disclosure will be clearly and completely described below with reference to the accompanying drawings of the embodiments of the present disclosure, and in order to keep the following description of the embodiments of the present disclosure clear and concise, detailed descriptions of known functions and known parts of the disclosure are omitted to avoid unnecessarily obscuring the concepts of the present disclosure.

Referring to fig. 1-2, a cloud multi-user long-distance scene secure video conference communication system is provided, which has a framework including a basic support layer, a conference control and transmission layer, a media layer, a coding and decoding layer, and an access layer;

the basic support layer adopts a mechanism SASL for expanding the verification capability of the C/S mode, the SASL provides a general method for adding the verification support to the protocol based on the connection, and the XMPP uses a common XML name space to meet the requirement of the SASL.

The underlying support layer employs TLS for providing privacy and data integrity between two communicating applications, which consists of two layers: the TLS recording protocol and the TLS handshake protocol.

The system supports APP clients installed on mobile devices such as smart phones, pads and PCs, and covers systems such as windows, android and IOS.

The system adopts Opus coding for audio frequency, supports ultra wide band (24kHz sampling rate) and full band (48kHz sampling rate) voice, and has the technologies of complete echo cancellation, noise suppression, reverberation suppression, automatic gain control, sound orientation, beam forming and the like, thereby realizing better sound effect and ensuring optimal hearing experience.

The system adopts a special forward error correction and packet loss retransmission algorithm and an intelligent regulation mechanism, so that the video is not jammed and not displayed under the condition that the network loses 30% of packets, and the video can be transmitted under the condition that the network loses 50% of packets; ensure the sound to be clear and smooth under the condition of 50% packet loss of the network and the sound to be distinguished under the condition of 80% packet loss.

The system adopts an H264/H265 SVC flexible video coding architecture, the self-adaptive call rate adapts to various network accesses, the bandwidth of 64K to 8Mbps is automatically detected, the video SVC layering is dynamically adjusted in real time according to the bandwidth change, and the optimal video experience is ensured.

The main functional points of the system are as follows:

electronic sign-in: after the participants enter the conference, the platform automatically pops up an electronic sign-in list for the participants to sign in, and after the conference is finished, managers can export the conference sign-in result. The electronic check-in function is an optional function, and whether the electronic check-in is started or not can be selected when a conference is established.

Sharing the electronic whiteboard: the system platform distributes an electronic whiteboard for each meeting by default, after the meeting starts, participants can write and sketch on the whiteboard, the platform shares whiteboard content to other participants in real time, multi-user cooperation and sharing are achieved, the whiteboard provides a storage function, after storage, even if the meeting is finished, a user can also see all previous electronic whiteboards in a historical meeting, the electronic whiteboards can also be used as a workbench for displaying and annotating electronic pictures, and the user can upload local pictures to be displayed on the electronic whiteboards, so that the participants can annotate the pictures. The system will automatically retain the annotation results.

Sharing a conference summary: the system platform provides a public meeting summary for each meeting default, and provides a personal meeting summary for each participant, wherein the summary content can be edited and seen by all participants; after the personal meeting summary belongs to the personal of the participants and the summary is stored, the user can check the public summary of the meeting through the meeting history after logging in the platform.

Screen sharing: when sharing, the whole screen sharing, window sharing or label sharing can be selected, and all participants can see the screen sharing picture, such as shared PPT and other meeting and lecture scenes.

Character interaction: all conference participants can communicate with each other by characters and expressions or interact with each other through the function.

File sharing: the participants select files to be shared and send the files to the platform, and other participants preview or download the files through the platform. All shared files and platforms are uniformly filed and stored. After the meeting is finished, the user can check or download the shared files of the past meeting through the historical meeting records at any time.

And (3) turning off the microphone and the camera: the moderator role can close a microphone and a camera of a certain person in the meeting place, and the person can also apply for opening the microphone and the camera through the operation of lifting hands.

And (3) live conference: the video can be recorded in the live broadcast process so as to support the playback of the live broadcast at any time; recording can also be realized in a common conference; the recording is completely finished based on cloud service, and any recording server does not need to be configured independently; recorded contents are also stored in the cloud, and the recording space is completely expanded as required. For a recorded complete video, the video owner can quickly request, link and share the video; the online video-on-demand system is convenient for other users to use any equipment to carry out online video-on-demand anytime and anywhere.

Double flow: the double-stream is as the name implies that the stream of the camera and the stream of the screen are mixed into one stream to be displayed on different screens, so that the camera and the screen can be conveniently and well communicated in the video conference.

A conference mode: in the process of meeting, except split screens, the platform provides two modes for video display: main screen mode, squared figure mode.

The above embodiments are only exemplary embodiments of the present invention, and are not intended to limit the present invention, and the scope of the present invention is defined by the claims. Various modifications and equivalents may be made by those skilled in the art within the spirit and scope of the present invention, and such modifications and equivalents should also be considered as falling within the scope of the present invention.

Claims

1. A cloud multi-person long-distance scene secure video conference communication system is characterized in that the architecture of the system is divided into a basic support layer, a conference control and transmission layer, a media layer, a coding and decoding layer and an access layer;

2. The cloud multi-person remote scene secure video conference communication system according to claim 1, wherein: the basic support layer adopts a mechanism SASL for expanding the verification capability of the C/S mode, the SASL provides a general method for adding the verification support to the protocol based on the connection, and the XMPP uses a common XML name space to meet the requirement of the SASL.

3. The cloud multi-person remote scene secure video conference communication system according to claim 1, wherein: the base support layer employs TLS for providing privacy and data integrity between two communicating applications, which consists of two layers: the TLS recording protocol and the TLS handshake protocol.

4. The cloud multi-person remote scene secure video conference communication system according to claim 1, wherein: the system supports APP clients installed on mobile devices such as smart phones, pads and PCs, and covers systems such as windows, android and IOS.

5. The cloud multi-person remote scene secure video conference communication system according to claim 1, wherein: the system has the advantages that the audio frequency adopts Opus coding, ultra wide band (24kHz sampling rate) and full band (48kHz sampling rate) voice, complete echo cancellation, noise suppression, reverberation suppression, automatic gain control, sound orientation, beam forming and other technologies are supported, better sound effect is achieved, and optimal hearing experience is guaranteed.

6. The cloud multi-person remote scene secure video conference communication system according to claim 1, wherein: the system adopts a special forward error correction and packet loss retransmission algorithm and an intelligent regulation mechanism, so that the video is not jammed and not displayed under the condition that 30% of packet loss occurs in a network, and the video can be transmitted under the condition that 50% of packet loss occurs; ensure the sound to be clear and smooth under the condition of 50% packet loss of the network and the sound to be distinguished under the condition of 80% packet loss.

7. The cloud multi-person remote scene secure video conference communication system according to claim 1, wherein: the system adopts an H264/H265 SVC flexible video coding architecture, the self-adaptive call rate adapts to various network accesses, the bandwidth of 64K to 8Mbps is automatically detected, the video SVC layering is dynamically adjusted in real time according to the bandwidth change, and the optimal video experience is ensured.