CN112653868A - Cloud-based multi-person remote scene secure video conference communication system - Google Patents

Cloud-based multi-person remote scene secure video conference communication system Download PDF

Info

Publication number
CN112653868A
CN112653868A CN202011085609.5A CN202011085609A CN112653868A CN 112653868 A CN112653868 A CN 112653868A CN 202011085609 A CN202011085609 A CN 202011085609A CN 112653868 A CN112653868 A CN 112653868A
Authority
CN
China
Prior art keywords
layer
communication system
video conference
conference communication
video
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011085609.5A
Other languages
Chinese (zh)
Inventor
杜婷
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Guoneng Internet Information Technology Co ltd
Original Assignee
Beijing Guoneng Internet Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Guoneng Internet Information Technology Co ltd filed Critical Beijing Guoneng Internet Information Technology Co ltd
Priority to CN202011085609.5A priority Critical patent/CN112653868A/en
Publication of CN112653868A publication Critical patent/CN112653868A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/643Communication protocols
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/647Control signaling between network components and server or clients; Network processes for video distribution between server and clients, e.g. controlling the quality of the video stream, by dropping packets, protecting content from unauthorised alteration within the network, monitoring of network load, bridging between two different networks, e.g. between IP and wireless
    • H04N21/64784Data processing by the network
    • H04N21/64792Controlling the complexity of the content stream, e.g. by dropping packets

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Computer Security & Cryptography (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention relates to the technical field of audio and video, and discloses a cloud multi-person long-distance scene secure video conference communication system, wherein the architecture of the system comprises a basic support layer, a conference control and transmission layer, a media layer, a coding and decoding layer and an access layer; the basic support layer provides XMPP, the XMPP transmits XML data streams in TCP, a central main server is not arranged, and a customized XMPP server can be independently operated at any node. The cloud-based multi-person long-distance scene secure video conference communication system is typical image communication, an image and sound signal is changed into a digital signal at a communication sending end, and the digital signal is reproduced into information which can be obtained by vision and hearing at a receiving end.

Description

Cloud-based multi-person remote scene secure video conference communication system
Technical Field
The invention relates to the technical field of audio and video, in particular to a cloud multi-person long-distance scene secure video conference communication system.
Background
The video conference system, also called as a conference television system, refers to a system device for realizing a teleconference by mutually transmitting sound, images and file data through a transmission line and multimedia equipment by individuals or groups in two or more different places, wherein the system device realizes the teleconference.
However, the existing video conference system is limited by a system architecture, so that the video conference system has fewer functions and can only be applied to some places, and therefore a cloud multi-person long-distance scene secure video conference communication system is provided.
Disclosure of Invention
Aiming at the defects in the prior art, the invention provides a cloud multi-person long-distance scene secure video conference communication system.
The invention provides the following technical scheme: a cloud multi-person long-distance scene secure video conference communication system is structurally characterized by comprising a basic support layer, a conference control and transmission layer, a media layer, a coding and decoding layer and an access layer;
the basic support layer provides XMPP, the XMPP transmits XML data streams by TCP, a central main server is not arranged, and a customized XMPP server can be independently operated at any node;
the control and transmission layer adopts an HTTPS protocol to authenticate the user and the server and ensure that data are sent to correct clients and servers;
the platform of the media layer is realized based on the RTC, and for terminal PCs or large-screen equipment participating in the meeting, clients or plug-ins do not need to be additionally installed, and the meeting can be performed only by a compatible browser to perform audio and video interaction;
the platform is internally provided with a connection and transmission technology solution aiming at NAT, a local area network, a virtual machine, a wide area network and the like, uses the key NAT and firewall penetration technology of STUN, ICE, TURN and RTP-over-TCP, and supports the proxy.
The RTC built-in VP8/VP9 and VP8/VP9 of the coding and decoding layer are open source standards, and the method is used without paying any cost, decoding is added to software, and extra equipment for adding decoding is not required to be purchased.
The access layer can be accessed through a browser supporting the RTC at present, a client and other plug-ins do not need to be installed, and the access can be applied and accessed anytime and anywhere through the access of communication modes such as the Internet, an office network, a government affair network, a mobile 3G/4G, a satellite, microwave and the like.
Preferably, the basic support layer adopts a mechanism SASL for expanding the authentication capability of the C/S mode, the SASL provides a general method for adding the authentication support to the connection-based protocol, and the XMPP uses a common XML namespace to meet the requirements of the SASL.
Preferably, the base support layer employs TLS for providing privacy and data integrity between two communicating applications, which protocol consists of two layers: the TLS recording protocol and the TLS handshake protocol.
Preferably, the system supports APP clients installed on mobile devices such as smart phones, pads and PCs, and covers systems such as windows, android and IOS.
Preferably, the audio of the system adopts Opus coding, supports ultra wide band (24kHz sampling rate) and full band (48kHz sampling rate) voice, and adopts technologies such as complete echo cancellation, noise suppression, reverberation suppression, automatic gain control, sound orientation, beam forming and the like, so as to realize better sound effect and ensure optimal hearing experience.
Preferably, the system adopts a special forward error correction and packet loss retransmission algorithm and an intelligent regulation mechanism, so that the video is not blocked and not displayed under the condition that the network loses 30% of packets, and the video can be transmitted under the condition that the network loses 50% of packets; ensure the sound to be clear and smooth under the condition of 50% packet loss of the network and the sound to be distinguished under the condition of 80% packet loss.
Preferably, the system adopts an H264/H265 SVC flexible video coding architecture, the self-adaptive call rate adapts to various network accesses, the bandwidth of 64K to 8Mbps is automatically detected, the video SVC layering is dynamically adjusted in real time according to the bandwidth change, and the optimal video experience is ensured.
Compared with the prior art, the invention has the following beneficial effects:
the cloud-based multi-person long-distance scene security video conference communication system is typical image communication, an image and sound signal is converted into a digital signal at a communication sending end, and the digital signal is reproduced into visual and auditory acquirable information at a receiving end.
Drawings
FIG. 1 is a schematic diagram of the present invention;
FIG. 2 is a logic diagram of the present invention.
Detailed Description
To make the objects, technical solutions and advantages of the embodiments of the present disclosure clearer, technical solutions of the embodiments of the present disclosure will be clearly and completely described below with reference to the accompanying drawings of the embodiments of the present disclosure, and in order to keep the following description of the embodiments of the present disclosure clear and concise, detailed descriptions of known functions and known parts of the disclosure are omitted to avoid unnecessarily obscuring the concepts of the present disclosure.
Referring to fig. 1-2, a cloud multi-user long-distance scene secure video conference communication system is provided, which has a framework including a basic support layer, a conference control and transmission layer, a media layer, a coding and decoding layer, and an access layer;
the basic support layer provides XMPP, the XMPP transmits XML data streams by TCP, a central main server is not arranged, and a customized XMPP server can be independently operated at any node;
the basic support layer adopts a mechanism SASL for expanding the verification capability of the C/S mode, the SASL provides a general method for adding the verification support to the protocol based on the connection, and the XMPP uses a common XML name space to meet the requirement of the SASL.
The underlying support layer employs TLS for providing privacy and data integrity between two communicating applications, which consists of two layers: the TLS recording protocol and the TLS handshake protocol.
The control and transmission layer adopts an HTTPS protocol to authenticate the user and the server and ensure that data are sent to correct clients and servers;
the platform of the media layer is realized based on the RTC, and for terminal PCs or large-screen equipment participating in the meeting, clients or plug-ins do not need to be additionally installed, and the meeting can be performed only by a compatible browser to perform audio and video interaction;
the platform is internally provided with a connection and transmission technology solution aiming at NAT, a local area network, a virtual machine, a wide area network and the like, uses the key NAT and firewall penetration technology of STUN, ICE, TURN and RTP-over-TCP, and supports the proxy.
The RTC built-in VP8/VP9 and VP8/VP9 of the coding and decoding layer are open source standards, and the method is used without paying any cost, decoding is added to software, and extra equipment for adding decoding is not required to be purchased.
The access layer can be accessed through a browser supporting the RTC at present, a client and other plug-ins do not need to be installed, and the access can be applied and accessed anytime and anywhere through the access of communication modes such as the Internet, an office network, a government affair network, a mobile 3G/4G, a satellite, microwave and the like.
The system supports APP clients installed on mobile devices such as smart phones, pads and PCs, and covers systems such as windows, android and IOS.
The system adopts Opus coding for audio frequency, supports ultra wide band (24kHz sampling rate) and full band (48kHz sampling rate) voice, and has the technologies of complete echo cancellation, noise suppression, reverberation suppression, automatic gain control, sound orientation, beam forming and the like, thereby realizing better sound effect and ensuring optimal hearing experience.
The system adopts a special forward error correction and packet loss retransmission algorithm and an intelligent regulation mechanism, so that the video is not jammed and not displayed under the condition that the network loses 30% of packets, and the video can be transmitted under the condition that the network loses 50% of packets; ensure the sound to be clear and smooth under the condition of 50% packet loss of the network and the sound to be distinguished under the condition of 80% packet loss.
The system adopts an H264/H265 SVC flexible video coding architecture, the self-adaptive call rate adapts to various network accesses, the bandwidth of 64K to 8Mbps is automatically detected, the video SVC layering is dynamically adjusted in real time according to the bandwidth change, and the optimal video experience is ensured.
The main functional points of the system are as follows:
electronic sign-in: after the participants enter the conference, the platform automatically pops up an electronic sign-in list for the participants to sign in, and after the conference is finished, managers can export the conference sign-in result. The electronic check-in function is an optional function, and whether the electronic check-in is started or not can be selected when a conference is established.
Sharing the electronic whiteboard: the system platform distributes an electronic whiteboard for each meeting by default, after the meeting starts, participants can write and sketch on the whiteboard, the platform shares whiteboard content to other participants in real time, multi-user cooperation and sharing are achieved, the whiteboard provides a storage function, after storage, even if the meeting is finished, a user can also see all previous electronic whiteboards in a historical meeting, the electronic whiteboards can also be used as a workbench for displaying and annotating electronic pictures, and the user can upload local pictures to be displayed on the electronic whiteboards, so that the participants can annotate the pictures. The system will automatically retain the annotation results.
Sharing a conference summary: the system platform provides a public meeting summary for each meeting default, and provides a personal meeting summary for each participant, wherein the summary content can be edited and seen by all participants; after the personal meeting summary belongs to the personal of the participants and the summary is stored, the user can check the public summary of the meeting through the meeting history after logging in the platform.
Screen sharing: when sharing, the whole screen sharing, window sharing or label sharing can be selected, and all participants can see the screen sharing picture, such as shared PPT and other meeting and lecture scenes.
Character interaction: all conference participants can communicate with each other by characters and expressions or interact with each other through the function.
File sharing: the participants select files to be shared and send the files to the platform, and other participants preview or download the files through the platform. All shared files and platforms are uniformly filed and stored. After the meeting is finished, the user can check or download the shared files of the past meeting through the historical meeting records at any time.
And (3) turning off the microphone and the camera: the moderator role can close a microphone and a camera of a certain person in the meeting place, and the person can also apply for opening the microphone and the camera through the operation of lifting hands.
And (3) live conference: the video can be recorded in the live broadcast process so as to support the playback of the live broadcast at any time; recording can also be realized in a common conference; the recording is completely finished based on cloud service, and any recording server does not need to be configured independently; recorded contents are also stored in the cloud, and the recording space is completely expanded as required. For a recorded complete video, the video owner can quickly request, link and share the video; the online video-on-demand system is convenient for other users to use any equipment to carry out online video-on-demand anytime and anywhere.
Double flow: the double-stream is as the name implies that the stream of the camera and the stream of the screen are mixed into one stream to be displayed on different screens, so that the camera and the screen can be conveniently and well communicated in the video conference.
A conference mode: in the process of meeting, except split screens, the platform provides two modes for video display: main screen mode, squared figure mode.
The above embodiments are only exemplary embodiments of the present invention, and are not intended to limit the present invention, and the scope of the present invention is defined by the claims. Various modifications and equivalents may be made by those skilled in the art within the spirit and scope of the present invention, and such modifications and equivalents should also be considered as falling within the scope of the present invention.

Claims (7)

1. A cloud multi-person long-distance scene secure video conference communication system is characterized in that the architecture of the system is divided into a basic support layer, a conference control and transmission layer, a media layer, a coding and decoding layer and an access layer;
the basic support layer provides XMPP, the XMPP transmits XML data streams by TCP, a central main server is not arranged, and a customized XMPP server can be independently operated at any node;
the control and transmission layer adopts an HTTPS protocol to authenticate the user and the server and ensure that data are sent to correct clients and servers;
the platform of the media layer is realized based on the RTC, and for terminal PCs or large-screen equipment participating in the meeting, clients or plug-ins do not need to be additionally installed, and the meeting can be performed only by a compatible browser to perform audio and video interaction;
the platform is internally provided with a connection and transmission technology solution aiming at NAT, a local area network, a virtual machine, a wide area network and the like, uses the key NAT and firewall penetration technology of STUN, ICE, TURN and RTP-over-TCP, and supports the proxy.
The RTC built-in VP8/VP9 and VP8/VP9 of the coding and decoding layer are open source standards, and the method is used without paying any cost, decoding is added to software, and extra equipment for adding decoding is not required to be purchased.
The access layer can be accessed through a browser supporting the RTC at present, a client and other plug-ins do not need to be installed, and the access can be applied and accessed anytime and anywhere through the access of communication modes such as the Internet, an office network, a government affair network, a mobile 3G/4G, a satellite, microwave and the like.
2. The cloud multi-person remote scene secure video conference communication system according to claim 1, wherein: the basic support layer adopts a mechanism SASL for expanding the verification capability of the C/S mode, the SASL provides a general method for adding the verification support to the protocol based on the connection, and the XMPP uses a common XML name space to meet the requirement of the SASL.
3. The cloud multi-person remote scene secure video conference communication system according to claim 1, wherein: the base support layer employs TLS for providing privacy and data integrity between two communicating applications, which consists of two layers: the TLS recording protocol and the TLS handshake protocol.
4. The cloud multi-person remote scene secure video conference communication system according to claim 1, wherein: the system supports APP clients installed on mobile devices such as smart phones, pads and PCs, and covers systems such as windows, android and IOS.
5. The cloud multi-person remote scene secure video conference communication system according to claim 1, wherein: the system has the advantages that the audio frequency adopts Opus coding, ultra wide band (24kHz sampling rate) and full band (48kHz sampling rate) voice, complete echo cancellation, noise suppression, reverberation suppression, automatic gain control, sound orientation, beam forming and other technologies are supported, better sound effect is achieved, and optimal hearing experience is guaranteed.
6. The cloud multi-person remote scene secure video conference communication system according to claim 1, wherein: the system adopts a special forward error correction and packet loss retransmission algorithm and an intelligent regulation mechanism, so that the video is not jammed and not displayed under the condition that 30% of packet loss occurs in a network, and the video can be transmitted under the condition that 50% of packet loss occurs; ensure the sound to be clear and smooth under the condition of 50% packet loss of the network and the sound to be distinguished under the condition of 80% packet loss.
7. The cloud multi-person remote scene secure video conference communication system according to claim 1, wherein: the system adopts an H264/H265 SVC flexible video coding architecture, the self-adaptive call rate adapts to various network accesses, the bandwidth of 64K to 8Mbps is automatically detected, the video SVC layering is dynamically adjusted in real time according to the bandwidth change, and the optimal video experience is ensured.
CN202011085609.5A 2020-10-12 2020-10-12 Cloud-based multi-person remote scene secure video conference communication system Pending CN112653868A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011085609.5A CN112653868A (en) 2020-10-12 2020-10-12 Cloud-based multi-person remote scene secure video conference communication system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011085609.5A CN112653868A (en) 2020-10-12 2020-10-12 Cloud-based multi-person remote scene secure video conference communication system

Publications (1)

Publication Number Publication Date
CN112653868A true CN112653868A (en) 2021-04-13

Family

ID=75347044

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011085609.5A Pending CN112653868A (en) 2020-10-12 2020-10-12 Cloud-based multi-person remote scene secure video conference communication system

Country Status (1)

Country Link
CN (1) CN112653868A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115378915A (en) * 2022-08-11 2022-11-22 深圳乐播科技有限公司 Data sharing method and electronic equipment

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115378915A (en) * 2022-08-11 2022-11-22 深圳乐播科技有限公司 Data sharing method and electronic equipment
CN115378915B (en) * 2022-08-11 2024-01-16 深圳乐播科技有限公司 Data sharing method and electronic equipment

Similar Documents

Publication Publication Date Title
US8255552B2 (en) Interactive video collaboration framework
US6944136B2 (en) Two-way audio/video conferencing system
US8471890B1 (en) Adaptive video communication channel
US7773581B2 (en) Method and apparatus for conferencing with bandwidth control
US8860776B2 (en) Conference terminal, conference server, conference system and data processing method
US7532231B2 (en) Video conference recorder
US20070011710A1 (en) Interactive news gathering and media production control system
CN110535871B (en) WebRTC-based classroom real-time video projection method and system
US20050237931A1 (en) Method and apparatus for conferencing with stream selectivity
GB2428529A (en) Interactive news gathering and media production control system
CN113099155B (en) Video conference system suitable for multiple scenes
US20040170159A1 (en) Digital audio and/or video streaming system
CN112422583A (en) Method and system for fusion and intercommunication of multi-protocol video application
US7792063B2 (en) Method, apparatus, and computer program product for gatekeeper streaming
CN112653868A (en) Cloud-based multi-person remote scene secure video conference communication system
EP2637404A1 (en) Method and device for controlling multiple auxiliary streams, and network system
CN117176972A (en) Cloud conference audio and video transmission system and method based on WebRTC technology
CN112311784A (en) Screen projection system and screen projection method
KR20020050903A (en) Education, Broad Casting, Multi-User Image Conference and Chatting Service Provision Method Using Internet
JP2003271530A (en) Communication system, inter-system relevant device, program and recording medium
CN110719435B (en) Method and system for carrying out terminal conference
CN108495076A (en) A kind of method and system of live video stream embedded in real-time session
Andberg Video conferencing in distance education
TW567726B (en) Method and system providing remote conference between at least two parties
CN117650932A (en) Multilayer safe real-time audio and video method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20210413