CN108063911B - Video conference capacity expansion method - Google Patents

Video conference capacity expansion method Download PDF

Info

Publication number
CN108063911B
CN108063911B CN201711492238.0A CN201711492238A CN108063911B CN 108063911 B CN108063911 B CN 108063911B CN 201711492238 A CN201711492238 A CN 201711492238A CN 108063911 B CN108063911 B CN 108063911B
Authority
CN
China
Prior art keywords
video
conference
user
audio
conference room
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201711492238.0A
Other languages
Chinese (zh)
Other versions
CN108063911A (en
Inventor
彭霄
赵建华
徐晓峰
李夏宾
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
GRANDSTREAM NETWORKS Inc
Shenzhen Grandstream Networks Technologies Co ltd
Original Assignee
GRANDSTREAM NETWORKS Inc
Shenzhen Grandstream Networks Technologies Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by GRANDSTREAM NETWORKS Inc, Shenzhen Grandstream Networks Technologies Co ltd filed Critical GRANDSTREAM NETWORKS Inc
Priority to CN201711492238.0A priority Critical patent/CN108063911B/en
Publication of CN108063911A publication Critical patent/CN108063911A/en
Application granted granted Critical
Publication of CN108063911B publication Critical patent/CN108063911B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/21Server components or server architectures
    • H04N21/218Source of audio or video content, e.g. local disk arrays
    • H04N21/2187Live feed
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • H04N21/85406Content authoring involving a specific file format, e.g. MP4 format
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • H04N7/152Multipoint control units therefor
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • H04N7/155Conference systems involving storage of or access to video conference sessions

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Security & Cryptography (AREA)
  • Databases & Information Systems (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention discloses a video conference capacity expansion method, which mainly comprises the following operations: step a, successively creating a small conference room and a large conference room, wherein the small conference room is used for users who exchange audio and video in real time, the large conference room is used for meeting in a live broadcasting mode, and the large conference room is mainly used for users who only receive audio and video data; b, encrypting and transmitting the user audio and video data of the small conference room to a media server, decrypting the user audio and video data by the media server, transmitting the decrypted user audio and video data to an MCU for transcoding, and transmitting the transcoded user audio and video data back to the media server, wherein the media server transmits the transcoded user audio and video data to each user of the small conference room; and step c, the user in the conference room only answers the audio and the video and does not transmit the audio and the video to the media server. The invention combines the traditional real-time conference architecture and the live broadcast technology, provides the architecture thought of large and small conference rooms, and can well meet the requirements of large conference rooms. Compared with the traditional architecture, the architecture of the scheme has the advantages of relatively low technical difficulty, low cost and strong practicability.

Description

Video conference capacity expansion method
Technical Field
The invention belongs to the technical field of network telephones, and particularly relates to a control management technology of a network telephone conference.
Background
With the rapid development of the internet and the continuous expansion of the enterprise scale, the low-cost multi-user real-time communication between different countries becomes an urgent demand of a plurality of enterprises, a large-scale international conference is continuously created, the demand of common users for multimedia experience is upgraded, the rapid development of a video conference technology is promoted, the cloud service is mature day by day, and the rapid popularization of a real-time cloud conference system is promoted.
Fig. 1 is a diagram of a conventional capacity expansion network of a conference room in the prior art. As shown in the figure, core components of the existing real-time cloud video conference technology are mainly a media server, a signaling server, an MCU and the like, the cloud server depends on a third-party cloud server provider, but if the current real-time high-definition video technology architecture supports more than ten thousand people, the adjustment difficulty is very large, the structural design of the server needs to be greatly changed, especially, series performance problems caused by bridging of the media server and the like, and problems such as maintenance cost and the like need to be effectively solved only by accumulating technical reserves of a technical team to a certain extent, and the cost pressure caused by capacity expansion of the MCU and the media server can be increased by times.
If the existing manpower and technology reserves can be reserved on the basis of the existing architecture, under the condition that the operation cost is controllable, the online real-time high-definition video conference of tens of thousands of levels is supported in a relatively short time, and the cost sharing and profit increase of enterprises have practical requirements.
Disclosure of Invention
The invention aims to provide a video conference capacity expansion method, which can realize online real-time high-definition video conferences of over ten thousand levels under the condition that the existing server is not greatly adjusted.
In order to achieve the above object, the present invention discloses a video conference capacity expansion method, which mainly comprises the following operations: a, successively creating a small conference room and a large conference room, wherein the small conference room is aimed at users who exchange audio and video in real time, the large conference room is conferred in a live broadcast mode, and is mainly aimed at users who only receive audio and video data; b, encrypting and transmitting the user audio and video data of the small conference room to a media server, decrypting the user audio and video data by the media server, transmitting the decrypted user audio and video data to an MCU for transcoding, and transmitting the transcoded user audio and video data back to the media server, wherein the media server transmits the transcoded user audio and video data to each user of the small conference room; and c, the user in the conference room only answers the audio and the video and does not transmit the audio and the video to the media server.
Preferably, the MCU transcoding specifically means: the audio is transcoded into AAC codes, the video is transcoded into H264 codes, and the video generates a plurality of groups of video streams with different resolutions and different code rates according to the situation.
Preferably, the media server encapsulates the audio and video data transcoded by the MCU into a plurality of mp4 format fragment files, and generates an mpd file according to the specification of mpeg. The mp4 format fragment file and the mpd file are transmitted to each node server through the CDN network.
Preferably, the large conference room requests the mpd file according to the address of the node server obtained in the conference entering stage, and the user downloads the corresponding mp4 file according to the information carried in the requested mpd file and performs decoding and playing.
Preferably, the information carried in the mpd file includes: video length, code rate, resolution, video clip URL address and other information.
Preferably, the client performs adaptive downloading of files with corresponding code rates according to respective network conditions and media files with different code rates configured in the mpd.
Preferably, the client can predict the bandwidth required by the next downloaded fragment according to the bandwidth condition of downloading the current mp4 fragment, and select the next downloaded file according to the predicted bandwidth, thereby achieving the purpose of self-adaptive downloading.
Preferably, the user can select to enter the conference with a common role or a role code on the login interface, when the role code is selected to enter the conference, the user firstly sends a REGISTER signaling to REGISTER to the signaling server, then initiates a conference entering signaling request to the signaling server, carries the corresponding role code, joins in a small conference room and creates a media channel.
Preferably, when the general role is selected to enter the conference, the user REGISTERs by sending REGISTER signaling to the signaling server, and then requests the signaling server for the address of the node server, and the signaling server sends NOTIFY to NOTIFY the allocated address of the node server after the network condition and the node load condition of the user are evaluated.
The invention provides a video conference capacity expansion method, which avoids large capacity expansion of a media server and an MCU (microprogrammed control unit), reduces the high cost expenditure of the media server and the MCU server, realizes the large capacity expansion of a video conference system by a live broadcast technology Mpeg dash, greatly saves the development and maintenance cost and greatly shortens the development period.
Drawings
FIG. 1 is a diagram of a conventional capacity expansion network of a conference room in the prior art;
fig. 2 is a diagram of a network architecture for capacity expansion of large and small conference rooms in an embodiment of the present invention.
Detailed Description
The basic principle of the invention is as follows: under the condition that the existing server architecture is not adjusted greatly, the method adopts a part of technology related to live broadcast: and the MpegDash converts the media stream of the existing system into a stream form supported by the MpegDash, and realizes large-capacity expansion of the video conference system by combining with less delay of the CDN and the parameter tuning.
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and do not limit the invention.
Compared with the prior art, the network architecture of the scheme of the invention has the main differences that: the traditional real-time video conference system supports a large-capacity conference technical scheme. The conventional real-time video conference system needs to support large-capacity conferences, such as: when more than one thousand or ten thousand people meet, a plurality of media servers are needed to be deployed at the same time for media processing, and the aim of media distribution on participants of more than one thousand or ten thousand people in one meeting room is achieved through bridges among the media servers. The media server has high requirements on hardware and high hardware cost. The requirement of a large-capacity conference on media processing is high, and the existing mainstream open-source media server (such as an asterisk) can only support parallel processing of high-definition videos of hundreds of people generally through conference bridging. To support a larger capacity of media processing capability, the media server needs to be modified in architecture, which is difficult in technology.
The real-time video conference system has high requirements on real-time performance, and common live broadcast technologies are limited by various technical limitations: the real-time performance of the RTMP live broadcast technology is relatively good in the live broadcast technology, but the RTMP live broadcast technology must depend on the flash technology, needs additional plug-in support, increases the complexity of a scheme and increases the operation complexity of a user; the HLS live broadcast technology can be directly played by a mainstream browser without depending on a plug-in, but the main disadvantage of the technology is that the delay is too high to meet the requirement of a real-time video conference.
Another live broadcast technique: mpeg dash does not need plug-in support, a mainstream browser (chrome/firefox/safari/edge/ie) only needs to contain a dash file to well support playing, video delay is relatively low, delay is reduced to be within 1 second through parameter optimization, and the requirements of a real-time conference such as a mode of only listening to no words in a specific scene can be basically met.
Fig. 2 is a diagram of a network architecture for capacity expansion of large and small conference rooms in an embodiment of the present invention. As can be seen from the figure, in the embodiment of the present invention, the architecture of the video conference mainly includes the following core modules: media server, MCU, signaling server, CDN network, and node server.
Wherein the media server: the media server is responsible for forwarding and processing the media data; MCU: the function of coding and decoding related to transcoding of media and the like is responsible; a signaling server: is responsible for processing and distributing message signaling; CDN network: the content distribution network is mainly characterized in that the node servers are deployed at key positions on the network, and the requests of users are redirected to the node servers closest to the users according to key information such as network flow, load conditions and the like, so that the file downloading speed of the users is fastest, and the delay is lowest. A node server: and is responsible for storing and distributing files.
In the embodiment of the present invention, the conference communication method based on the network architecture mainly includes the following steps:
step 1: a conference room is created. The user appoints a large conference on the Web management platform, and when the appointment comes, a meeting place of a small conference room is established first, the media server, the signaling server and the MCU corresponding to the small conference room perform resource allocation, then a large conference room is established, and related resources of the large conference room are established: independent coding and decoding channels, audio and video packaging channels and the like required in a large conference room. The small conference room is a conference room with a traditional structure and mainly aims at users with high real-time requirements or users needing to carry out real-time audio and video communication, the size of the conference room is fixed below 200 people, the large conference room is a conference room which is accessed in a new structure in a live broadcast mode and mainly aims at users only receiving audio and video data, and the users have no strict requirements on real-time performance.
Step 2: the user enters the conference room. The user can choose to enter the meeting with a common role or a role code in the login interface. If selecting role code joining, the user firstly sends REGISTER signaling to REGISTER to the signaling server, and then initiates a joining signaling request to the signaling server, namely sends INVITE signaling, which carries corresponding role code. Then, SDP media negotiation such as media type, media port, key, etc. is performed. After the signaling negotiation is passed, the signaling server, according to the role code type of the user, is as follows: the moderator or workgroup to assign different meeting place rights including video, shared screen, etc., while joining the small conference room and creating a media channel. If the user selects the common role to enter the conference, the user REGISTERs by sending a REGISTER signaling to the signaling server, and then requests the signaling server for the address of the node server: and sending INFO to the signaling server, replying 200 OK by the server, and sending NOTIFY to inform the address of the node server allocated by the user after the signaling server evaluates the network condition and the node load condition of the user. The above evaluated network conditions include: packet loss rate/network delay/whether the number of node server connections reaches a threshold, etc.
And step 3: audio and video transmission in small conference rooms. And finally, the media server encrypts and transmits the audio and video of the user in each conference room again according to different user keys in the conference rooms.
And 4, step 4: audio and video transmission in large conference rooms. For a user in a large conference room, the user only answers the audio and the video and does not transmit the audio and the video to the media server, and the audio and video data in the small conference room are transcoded by the MCU and then transmitted back to the media server. The MCU transcoding specifically refers to: the audio is transcoded into AAC codes, the video is transcoded into H264 codes, and the video generates several groups of video streams with different resolutions and different code rates according to the package condition, wherein the video streams are 360P 400kbps/480P 800kbps/720P 1.5mbps. And the media server packages the returned audio and video data into a plurality of mp4 format fragment files and generates an mpd file according to the specification of mpeg. And transmitting the mp4 fragment file and the mpd file to each node server through the CDN.
The method comprises the following steps that a user in a large conference room requests an mpd file according to an address of a node server acquired in a conference entering stage, and the user requests information carried in the mpd file according to the requested information, such as: and downloading information such as video length, code rate, resolution, video clip URL address and the like, downloading a corresponding mp4 file, and decoding and playing. The client side carries out self-adaptive downloading of files with corresponding code rates according to respective network conditions and media files with different code rates configured in the mpd. And predicting the bandwidth required by the next downloaded fragment according to the bandwidth condition of downloading the current mp4 fragment, and selecting the next downloaded file according to the predicted bandwidth, thereby achieving the purpose of self-adaptive downloading.
The audio and video delay problem is the main problem of the live broadcast technology, and the delay can be reduced to an acceptable range by adjusting the GOP interval of the video for tuning. Such as: adjusting the GOP to 1-2s or lower can effectively reduce the delay and the code rate will increase accordingly. If the code rate is to be further reduced, GDR coding may be employed.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.

Claims (8)

1. A video conference capacity expansion method is characterized by comprising the following steps:
step a, successively creating a small conference room and a large conference room, wherein the small conference room is used for users who exchange audio and video in real time, the large conference room is accessed in a live broadcasting mode, and specifically comprises the following steps of: a user appoints a large conference on a Web management platform; meanwhile, the user can select to enter the conference by a common role or a role code on a login interface, when the role code is selected to enter the conference, the user firstly sends a REGISTER signaling to REGISTER to a signaling server, then initiates a conference entering signaling request to the signaling server, carries a corresponding role code, joins the conference room and creates a media channel; when the common role is selected to enter the conference, a user REGISTERs by sending a REGISTER signaling to the signaling server, then requests a node server address from the signaling server, and the signaling server sends a NOTIFY to NOTIFY the distributed node server address after the network condition and the node load condition of the user are evaluated;
b, encrypting and transmitting user audio and video data of the small conference room to a media server, decrypting the user audio and video data by the media server, transmitting the decrypted user audio and video data to an MCU for transcoding, and transmitting the transcoded user audio and video data back to the media server, wherein the media server transmits the transcoded user audio and video data to each user of the small conference room;
and c, the user in the large conference room only answers the audio and the video and does not transmit the audio and the video to the media server.
2. The video conference expansion method according to claim 1, wherein the MCU transcoding specifically refers to: the audio is transcoded into AAC codes, the video is transcoded into H264 codes, and the video generates a plurality of groups of video streams with different resolutions and different code rates according to the situation.
3. The video conference capacity expansion method according to claim 2, wherein the media server encapsulates audio and video data transcoded by the MCU into a plurality of mp4 format fragment files, and generates an mpd file according to the specification of mpeg.
4. The video conference capacity expansion method according to claim 3, wherein the mp4 format fragment file and the mpd file are delivered to each node server through a CDN network.
5. The video conference expansion method according to claim 4, wherein the mpd file is requested by the large conference room according to the address of the node server obtained at the conference entering stage, and the user downloads the corresponding mp4 file according to the information carried in the requested mpd file and performs decoding and playing.
6. The video conference capacity expansion method according to claim 5, wherein the information carried in the mpd file comprises: video length, bitrate, resolution and video clip URL address.
7. The video conference capacity expansion method according to claim 6, wherein the client adaptively downloads files with corresponding bit rates according to respective network conditions and media files with different bit rates configured in the mpd.
8. The video conference expansion method according to claim 7, wherein the client can predict the bandwidth required by the next downloaded segment according to the bandwidth condition of downloading the current mp4 segment, and select the next downloaded file according to the predicted bandwidth, thereby achieving the purpose of adaptive downloading.
CN201711492238.0A 2017-12-30 2017-12-30 Video conference capacity expansion method Active CN108063911B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711492238.0A CN108063911B (en) 2017-12-30 2017-12-30 Video conference capacity expansion method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711492238.0A CN108063911B (en) 2017-12-30 2017-12-30 Video conference capacity expansion method

Publications (2)

Publication Number Publication Date
CN108063911A CN108063911A (en) 2018-05-22
CN108063911B true CN108063911B (en) 2022-11-11

Family

ID=62141200

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711492238.0A Active CN108063911B (en) 2017-12-30 2017-12-30 Video conference capacity expansion method

Country Status (1)

Country Link
CN (1) CN108063911B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108881801B (en) * 2018-06-29 2020-11-24 苏州科达科技股份有限公司 Code stream transmission method, system, electronic equipment and storage medium for video conference
CN109257344B (en) * 2018-09-06 2021-01-26 广州高清视信数码科技股份有限公司 WebRTC media gateway based on Docker container technology and interworking method thereof
CN111385515B (en) * 2018-12-27 2022-01-11 北京紫荆视通科技有限公司 Video conference data transmission method and video conference data transmission system
CN111669606A (en) * 2019-03-09 2020-09-15 苏州知了信息科技有限公司 Live broadcast system for offline and online synchronous teaching
CN114567747A (en) * 2020-11-27 2022-05-31 北京新媒传信科技有限公司 Conference data transmission method and conference system

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101064823A (en) * 2006-04-30 2007-10-31 互联天下科技发展(深圳)有限公司 Video interdynamic network living broadcast method
US20110137438A1 (en) * 2009-12-07 2011-06-09 Vimicro Electronics Corporation Video conference system and method based on video surveillance system
CN102571848B (en) * 2010-12-27 2014-11-12 佛山络威网络技术有限公司 P2P (Peer-to-Peer)-based cascade method for hyper-scale conference
CN102281460A (en) * 2011-08-18 2011-12-14 宋健 Network television live broadcasting method and system implemented on basis of video conference
CN103248862A (en) * 2013-05-11 2013-08-14 中煤科工集团武汉设计研究院 Video conference system for long distance pipeline coal transportation and preparation method of system equipment
CN104754284B (en) * 2013-12-26 2018-08-10 中国移动通信集团公司 A kind of live broadcast of video conference method, equipment and system
CN103974147A (en) * 2014-03-07 2014-08-06 北京邮电大学 MPEG (moving picture experts group)-DASH protocol based online video playing control system with code rate switch control and static abstract technology

Also Published As

Publication number Publication date
CN108063911A (en) 2018-05-22

Similar Documents

Publication Publication Date Title
CN108063911B (en) Video conference capacity expansion method
US11770564B2 (en) Systems, devices and methods for distributing data with multi-tiered encoding
US10827223B2 (en) Systems and methods for transmission of data streams
US9544628B2 (en) Method and apparatus for network based adaptive streaming
She et al. IPTV over WiMAX: Key success factors, challenges, and solutions [advances in mobile multimedia]
KR101983432B1 (en) Devices, systems, and methods for converting or translating dynamic adaptive streaming over http(dash) to http live streaming(hls)
US8854418B2 (en) Integrated media gateway processing and control to reduce latency for 2-way video conference applications
CN101682355B (en) Method and apparatus providing scalability for channel change requests in a switched digital video system
TW201603546A (en) Unicast ABR streaming
US10499094B2 (en) Transmission apparatus, transmitting method, reception apparatus, and receiving method
US8316108B2 (en) Method and apparatus for obtaining media over a communications network
CN104093088A (en) System and method for achieving self-adaptive stream media play control
US9270937B2 (en) Real time stream provisioning infrastructure
US20150200987A1 (en) System and methods for dynamic transcoder rate adaption for adaptive bit rate streaming
CN111131759B (en) Real-time multimedia transmission system and use method thereof
KR100502186B1 (en) HDTV internet broadcast service system
CN103701827A (en) Method for realizing stream media play based on self-adaption stream media service gateway system
CN105493513B (en) Content providing, method, terminal device, system and storage medium
Gilani et al. QoENGN: A QoE Framework for Video Streaming over Next Generation Mobile Networks
KR100805805B1 (en) Apparatus and method of dynamic processing of scalable information for scalable video coding
Djuric et al. GSTV: An Integrated, Adaptive and Scalable Digital Multimedia Content Distribution System
Hysenbelliu Towards an Enhanced Quality of IPTV Media Server Architecture over Software Defined Networking
WO2009103346A1 (en) Method and apparatus for obtaining media over a communications network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant