CN116847128B - Video superposition processing method based on 5G VoLTE video teleconference - Google Patents

Video superposition processing method based on 5G VoLTE video teleconference Download PDF

Info

Publication number
CN116847128B
CN116847128B CN202311126945.3A CN202311126945A CN116847128B CN 116847128 B CN116847128 B CN 116847128B CN 202311126945 A CN202311126945 A CN 202311126945A CN 116847128 B CN116847128 B CN 116847128B
Authority
CN
China
Prior art keywords
video
real
time
stream
audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202311126945.3A
Other languages
Chinese (zh)
Other versions
CN116847128A (en
Inventor
张现
龚盖
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhongke Rongxin Technology Co ltd
Original Assignee
Zhongke Rongxin Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhongke Rongxin Technology Co ltd filed Critical Zhongke Rongxin Technology Co ltd
Priority to CN202311126945.3A priority Critical patent/CN116847128B/en
Publication of CN116847128A publication Critical patent/CN116847128A/en
Application granted granted Critical
Publication of CN116847128B publication Critical patent/CN116847128B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/40Support for services or applications
    • H04L65/403Arrangements for multi-party communication, e.g. for conferences
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/65Network streaming protocols, e.g. real-time transport protocol [RTP] or real-time control protocol [RTCP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/21Server components or server architectures
    • H04N21/218Source of audio or video content, e.g. local disk arrays
    • H04N21/2187Live feed
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/233Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/61Network physical structure; Signal processing
    • H04N21/6106Network physical structure; Signal processing specially adapted to the downstream path of the transmission network
    • H04N21/6131Network physical structure; Signal processing specially adapted to the downstream path of the transmission network involving transmission via a mobile phone network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/61Network physical structure; Signal processing
    • H04N21/6156Network physical structure; Signal processing specially adapted to the upstream path of the transmission network
    • H04N21/6181Network physical structure; Signal processing specially adapted to the upstream path of the transmission network involving transmission via a mobile phone network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/643Communication protocols
    • H04N21/6437Real-time Transport Protocol [RTP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems

Abstract

The application discloses a video superposition processing method based on a 5G VoLTE video teleconference, which relates to the technical field of 5G communication, and comprises the following steps: acquiring input full media content; video rendering and superposition processing are carried out on the full-media content to obtain a real-time audio-video stream, and the real-time audio-video stream is input to an internal stream server; and transmitting the real-time audio and video stream to a VoLTE end of a 5G core network through a converged communication gateway. The method can perform video rendering and superposition processing on the input full-media content to obtain real-time audio and video streaming, and the processed real-time audio and video streaming reaches the VoLTE end of the 5G core network through the converged communication gateway, so that a video teleconference APP is not required to be installed, and the video teleconference under the 5G environment is realized.

Description

Video superposition processing method based on 5G VoLTE video teleconference
Technical Field
The application relates to the technical field of 5G communication, in particular to a video superposition processing method and system based on a 5G VoLTE video teleconference.
Background
With the continuous development of society, the requirements of various communities of society for communication are also higher and higher, and particularly, the emergence of smart phones makes video phones and video teleconferences an increasingly popular communication mode. The video teleconference systems at present are various, but most video data and voice signals in the video teleconference systems are realized in a network, and a corresponding video teleconference APP needs to be installed, and before entering the video teleconference, a corresponding conference number needs to be input into the APP to enter the corresponding video teleconference, so that the operation is complex, and the user experience is poor.
Disclosure of Invention
Aiming at the defects in the prior art, the video superposition processing method and the system based on the 5G VoLTE video teleconference can convert full-media content video rendering and superposition into real-time video stream to be transmitted to the VoLTE end of the 5G core network, and the video teleconference APP is not required to be installed, so that the video teleconference under the 5G environment is realized.
In a first aspect, a video overlay processing method based on a 5G VoLTE video teleconference provided by an embodiment of the present application includes:
acquiring input full media content;
video rendering and superposition processing are carried out on the full-media content to obtain a real-time audio-video stream, and the real-time audio-video stream is input to an internal stream server;
and transmitting the real-time audio and video stream to a VoLTE end of a 5G core network through a converged communication gateway.
In a second aspect, a video overlay processing system based on a 5G VoLTE video teleconference provided by an embodiment of the present application includes: an acquisition module, a video superposition module and a terminal module,
the acquisition module is used for acquiring input full media content;
the video superposition module is used for performing video rendering and superposition processing on the all-media content to obtain a real-time audio and video stream, and inputting the real-time audio and video stream into the internal stream server;
the terminal module is used for receiving the real-time audio and video stream sent by the video superposition module or sending the audio and video data to the video superposition module.
The application has the beneficial effects that:
according to the video superposition processing method and system based on the 5G VoLTE video teleconference, video rendering and superposition processing can be carried out on input full-media content to obtain real-time audio and video streaming, and the processed real-time audio and video streaming reaches the VoLTE end of the 5G core network through the converged communication gateway, so that the video teleconference APP is not required to be installed, and the video teleconference in the 5G environment is realized.
Drawings
In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below. Like elements or portions are generally identified by like reference numerals throughout the several figures. In the drawings, elements or portions thereof are not necessarily drawn to scale.
Fig. 1 shows a flowchart of a video overlay processing method based on a 5G VoLTE video teleconference according to a first embodiment of the present application;
fig. 2 is a block diagram of a video overlay processing system based on a 5G VoLTE video teleconference according to another embodiment of the present application.
Detailed Description
The following description of the embodiments of the present application will be made clearly and fully with reference to the accompanying drawings, in which it is evident that the embodiments described are some, but not all embodiments of the application. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.
It should be understood that the terms "comprises" and "comprising," when used in this specification and the appended claims, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
It is also to be understood that the terminology used in the description of the application herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the application. As used in this specification and the appended claims, the singular forms "a," "an," and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise.
It should be further understood that the term "and/or" as used in the present specification and the appended claims refers to any and all possible combinations of one or more of the associated listed items, and includes such combinations.
As used in this specification and the appended claims, the term "if" may be interpreted as "when..once" or "in response to a determination" or "in response to detection" depending on the context. Similarly, the phrase "if a determination" or "if a [ described condition or event ] is detected" may be interpreted in the context of meaning "upon determination" or "in response to determination" or "upon detection of a [ described condition or event ]" or "in response to detection of a [ described condition or event ]".
It is noted that unless otherwise indicated, technical or scientific terms used herein should be given the ordinary meaning as understood by one of ordinary skill in the art to which this application belongs.
As shown in fig. 1, a flowchart of a video overlay processing method based on a 5G VoLTE video teleconference according to a first embodiment of the present application is shown, including the following steps:
acquiring input full media content;
video rendering and superposition processing are carried out on the full-media content to obtain a real-time audio-video stream, and the real-time audio-video stream is input to an internal stream server;
and transmitting the real-time audio and video stream to a VoLTE end of a 5G core network through a converged communication gateway.
In this embodiment, the full media content includes a camera real-time stream, a short video, a media live stream, and/or an on-demand stream, and supported protocols RTMP (real-time messaging protocol), RTSP (real-time streaming protocol), FLV (FLASH VIDEO), m3u8, GB28181, and the like. And transcoding, mixing, overlapping and overlapping the full media content with DTMF (dual tone multiple frequency) keys to obtain a real-time audio and video stream, storing the real-time audio and video stream by a streaming server, transmitting the real-time video stream to a VoLTE (Voice over Long term evolution) end of a 5G core network by the streaming server through an RTP/RTCP (real-time transport protocol) protocol by a converged communication gateway. RTP/RTCP protocol stack: the real-time transport protocol RTP is a transport protocol for multimedia data streams over the Internet, which is defined to operate in one-to-one or one-to-many transmission situations, with the purpose of providing time information and achieving stream synchronization. RTP itself only ensures the transmission of real-time data and does not provide a reliable transport mechanism for the sequential delivery of packets nor flow control or congestion control, which requires the provision of these services by means of RTCP. The real-time transport control protocol RTCP is responsible for managing the quality of the transmission for exchanging control information between the current application processes. During an RTP session, each participant periodically transmits RTCP packets containing statistics of the number of packets sent, the number of packets lost, etc., so that the server can use this information to dynamically change the transmission rate and even the payload type. RTP and RTCP are used in combination to optimize transmission efficiency with more efficient feedback and minimal overhead.
Specifically, the specific method for performing video rendering and superposition processing on the full media content to obtain the real-time audio/video stream comprises the following steps:
performing resolution scaling, picture rotation and code rate adjustment on the video to obtain an adjusted video; and performing audio format conversion on the audio in the adjusted video to obtain converted audio, such as AAC-AMR-WB and the like. And carrying out screen mixing processing on the multiple paths of real-time video streams to obtain one path of real-time video stream.
The audio and video can be further processed in the later period conveniently by adjusting corresponding parameters and converting audio formats of the video.
Specifically, the method for carrying out screen mixing processing on multiple paths of real-time video streams to obtain one path of real-time video stream comprises the following steps:
decoding each path of real-time video stream in the multipath real-time video streams in a concurrent decoding mode, decoding the real-time video streams into a video frame sequence, synthesizing images of a plurality of video frames through a video synthesis technology to generate a new video frame sequence, encoding the new video frame sequence through an encoder, outputting one path of real-time video stream in a target format, and packaging the one path of real-time video stream into a real-time transmission protocol stream.
The data volume of the mixed screen processing of the multipath real-time video is large, the real-time requirement is high, and the decoding capability of the server is required to be very high. The ARM server with super-strong computing power has 1024 computing power, supports 500 paths of cloud mobile phone examples to run concurrently, and is also applicable to AI computing and edge computing scenes because a processor is internally provided with a 6T OPS special NPU. In order to improve the concurrency, the concurrency decoding at least comprises two stages of concurrency, wherein one stage is the concurrency decoding of a plurality of real-time stream decoders, and the second stage is the concurrency decoding of the Frame level of the decoder. And the decoders respectively process different video sources and respectively and independently output data frames, and the data frames are stored in a queue mode for the subsequent modules to call. The decoders are mutually independent, have no restriction relation, can be processed concurrently, and improve the CPU utilization rate. And the decoder adopts a thread pool mode to decode the concurrent of the Frame level. When the decoder is initialized, a corresponding number of decoding threads are initialized according to the number of CPU cores of the server. The decoding thread inputs data and decodes the data.
And a plurality of decoders are adopted to respectively process the multipath real-time video streams to obtain independently output video frames, and the independently output video frames are stored as video frame queues so as to facilitate the subsequent processing. According to the pixel proportion of the target output video, calculating the actual size of the video picture occupied by the target picture, performing scaling operation on the video frame, checking the color space format of the image, judging whether the image is in YUV format, if not, unifying the color space format into YUV format, and adding the processed video frame into the video frame queue to be processed from the tail of the queue. Thus, a plurality of YUV images can be synthesized into a YUV image, and three components of the image can be rearranged and combined into a new image according to the arrangement positions of the images.
The specific method for synthesizing the images of a plurality of video frames by the video synthesis technology comprises the following steps: accessing a video frame queue to be processed, taking out a video frame from the head of the queue as a reference sequence, keeping the number of the video frames consistent with the reference sequence in a frame inserting or discarding mode for the rest video frame sequence in unit time, and taking the time stamp of the reference sequence as the time stamp of the synthesized video frame sequence. In order to improve video quality, when inserting or discarding data frames, processing can be performed according to the estimation result of real motion of the video object. And taking the combined video frame sequence as original data, inputting the original data into an encoder, encoding the video frame sequence by the encoder according to parameters set by target output, converting the encoded video frame sequence into one real-time video stream after screen mixing, and packaging the one real-time video stream into a real-time transmission protocol stream. The converged communication gateway transmits the real-time transmission protocol stream to the VoLTE end of the 5G core network. The converged communication is a gateway service for the intercommunication of the video engine and the 5G core network, and the data is transmitted to the 5G core network in a low-delay communication mode, so that the audio and video data sent by the original terminal VoLTE is also supported to be received, and the bidirectional audio and video function is realized.
After loading the full media content, superposing a DTMF menu style through a dynamic menu technology, rendering a current page, rendering a DTMF menu in real time, and supporting the action behavior of the DTMF menu at the same time: such as: returning to the previous, next, collection, transfer, download, etc., some menu actions may invoke external interface modes. And converting the overlapped pictures into a real-time video stream. Superimposing the full media content includes: superposing short video links, setting a display area, and forming a picture-in-picture form; one or more cameras are used for superposing real-time streams and direct on-demand streams, setting a display area, forming a picture-in-picture form and the like. And the DTMF menu style is overlapped by a dynamic menu technology, so that the picture layout style can be flexibly set.
According to the video superposition processing method based on the 5G VoLTE video teleconference, video rendering and superposition processing can be carried out on input full-media content to obtain real-time audio and video streaming, and the processed real-time audio and video streaming can reach the VoLTE end of the 5G core network through the converged communication gateway, so that the video teleconference APP is not required to be installed, and the video teleconference in the 5G environment is realized.
In the first embodiment, a video overlay processing method based on a 5G VoLTE video teleconference is provided, and correspondingly, the application also provides a video overlay processing system based on the 5G VoLTE video teleconference. Fig. 2 is a block diagram of a video overlay processing system based on a 5G VoLTE video teleconference according to a second embodiment of the present application. Since the apparatus embodiments are substantially similar to the method embodiments, the description is relatively simple, and reference is made to the description of the method embodiments for relevant points. The device embodiments described below are merely illustrative.
Referring to fig. 2, a block diagram of a video overlay processing system based on a 5G VoLTE video teleconference according to another embodiment of the present application is shown, where the system includes: the system comprises an acquisition module, a video superposition module and a terminal module, wherein the acquisition module is used for acquiring input full-media content; the video superposition module is used for performing video rendering and superposition processing on the all-media content to obtain a real-time audio and video stream, and inputting the real-time audio and video stream into the internal stream server; the terminal module is used for receiving the real-time audio and video stream sent by the video superposition module or sending the audio and video data to the video superposition module.
According to the video superposition processing system based on the 5G VoLTE video teleconference, video rendering and superposition processing can be carried out on input full-media content to obtain real-time audio and video streaming, and the processed real-time audio and video streaming can reach the VoLTE end of the 5G core network through the converged communication gateway, so that the video teleconference under the 5G environment is realized without installing a video teleconference APP.
Finally, it should be noted that: the above embodiments are only for illustrating the technical solution of the present application, and not for limiting the same; although the application has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical scheme described in the foregoing embodiments can be modified or some or all of the technical features thereof can be replaced by equivalents; such modifications and substitutions do not depart from the spirit of the application, and are intended to be included within the scope of the appended claims and description.

Claims (4)

1. A video superposition processing method based on a 5G VoLTE video teleconference is characterized by comprising the following steps:
acquiring input full media content;
video rendering and superposition processing are carried out on the full-media content to obtain a real-time audio-video stream, and the real-time audio-video stream is input to an internal stream server;
transmitting the real-time audio and video stream to a VoLTE end of a 5G core network through a converged communication gateway;
the full media content includes: camera real-time stream, short video, media live stream and/or on-demand stream;
the specific method for performing video rendering and superposition processing on the full media content to obtain the real-time audio/video stream comprises the following steps:
performing resolution scaling, picture rotation and code rate adjustment on the video to obtain an adjusted video;
performing audio format conversion on the audio in the adjusted video to obtain converted audio;
the specific method for performing video rendering and superposition processing on the full media content to obtain the real-time audio/video stream further comprises the following steps:
mixing the multiple paths of real-time video streams to obtain one path of real-time video stream;
the specific method for carrying out screen mixing processing on the multipath real-time video streams to obtain one path of real-time video stream comprises the following steps:
decoding each of the multiple paths of real-time video streams by adopting a concurrent decoding mode, wherein the concurrent decoding at least comprises two stages of concurrent decoding, one stage is the concurrent decoding of a plurality of real-time stream decoders, the other stage is the concurrent decoding of a Frame level of the decoder, and the real-time video streams are decoded into video Frame sequences;
synthesizing the images of a plurality of video frames by a video synthesis technology to generate a new video frame sequence;
encoding the new video frame sequence through an encoder, outputting a real-time video stream of a target format, and packaging the real-time video stream into a real-time transmission protocol stream;
the specific method for performing video rendering and superposition processing on the full media content to obtain the real-time audio/video stream further comprises the following steps:
after loading the full media content, superposing a DTMF menu style through a dynamic menu technology, rendering a current page, rendering the DTMF menu in real time, and converting the superposed picture into a real-time video stream.
2. The video superposition processing method according to claim 1, wherein the specific method for decoding each of the multiple real-time video streams by adopting a concurrent decoding manner comprises:
processing the multipath real-time video streams by adopting a plurality of decoders to obtain independently output video frames;
storing the independently output video frames as a video frame queue;
according to the pixel proportion of the target output video, calculating the actual size of the video picture occupied by the target picture, performing scaling operation on the video frame, checking the color space format of the image, unifying the color space formats, and adding the processed video frame from the tail of the queue to the video frame queue to be processed.
3. The video superimposition processing method according to claim 2, wherein the specific method for synthesizing the images of the plurality of video frames by the video synthesis technique includes:
accessing a video frame queue to be processed, taking out a video frame from the head of the queue as a reference sequence, keeping the number of the video frames consistent with the reference sequence in a frame inserting or discarding mode for the rest video frame sequence in unit time, and taking the time stamp of the reference sequence as the time stamp of the synthesized video frame sequence.
4. A video overlay processing system based on a 5G VoLTE video teleconference, comprising: an acquisition module, a video superposition module and a terminal module,
the acquisition module is used for acquiring input full media content, and the full media content comprises: camera real-time stream, short video, media live stream and/or on-demand stream;
the video superposition module is used for performing video rendering and superposition processing on the all-media content to obtain a real-time audio and video stream, and inputting the real-time audio and video stream into the internal stream server;
the specific method for performing video rendering and superposition processing on the full media content to obtain the real-time audio/video stream comprises the following steps:
performing resolution scaling, picture rotation and code rate adjustment on the video to obtain an adjusted video;
performing audio format conversion on the audio in the adjusted video to obtain converted audio;
the specific method for performing video rendering and superposition processing on the full media content to obtain the real-time audio/video stream further comprises the following steps:
mixing the multiple paths of real-time video streams to obtain one path of real-time video stream;
the specific method for carrying out screen mixing processing on the multipath real-time video streams to obtain one path of real-time video stream comprises the following steps:
decoding each of the multiple paths of real-time video streams by adopting a concurrent decoding mode, wherein the concurrent decoding at least comprises two stages of concurrent decoding, one stage is the concurrent decoding of a plurality of real-time stream decoders, the other stage is the concurrent decoding of a Frame level of the decoder, and the real-time video streams are decoded into video Frame sequences;
synthesizing the images of a plurality of video frames by a video synthesis technology to generate a new video frame sequence;
encoding the new video frame sequence through an encoder, outputting a real-time video stream of a target format, and packaging the real-time video stream into a real-time transmission protocol stream;
the specific method for performing video rendering and superposition processing on the full media content to obtain the real-time audio/video stream further comprises the following steps:
after loading all media content, superposing the all media content, superposing a DTMF menu style through a dynamic menu technology, rendering a current page, rendering a DTMF menu in real time, and converting the superposed picture into a real-time video stream;
the terminal module is used for receiving the real-time audio and video stream sent by the video superposition module or sending the audio and video data to the video superposition module.
CN202311126945.3A 2023-09-04 2023-09-04 Video superposition processing method based on 5G VoLTE video teleconference Active CN116847128B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311126945.3A CN116847128B (en) 2023-09-04 2023-09-04 Video superposition processing method based on 5G VoLTE video teleconference

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311126945.3A CN116847128B (en) 2023-09-04 2023-09-04 Video superposition processing method based on 5G VoLTE video teleconference

Publications (2)

Publication Number Publication Date
CN116847128A CN116847128A (en) 2023-10-03
CN116847128B true CN116847128B (en) 2023-11-28

Family

ID=88171105

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311126945.3A Active CN116847128B (en) 2023-09-04 2023-09-04 Video superposition processing method based on 5G VoLTE video teleconference

Country Status (1)

Country Link
CN (1) CN116847128B (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108076301A (en) * 2016-11-11 2018-05-25 联芯科技有限公司 The method for processing video frequency and system of VoLTE video multi-party telephones
WO2021143043A1 (en) * 2020-01-16 2021-07-22 腾讯科技(深圳)有限公司 Multi-person instant messaging method, system, apparatus and electronic device
CN113572749A (en) * 2021-07-20 2021-10-29 北京承启通科技有限公司 VoLTE-based video calling method and system
CN113630512A (en) * 2021-08-04 2021-11-09 宁波菊风系统软件有限公司 Rich media call mobile terminal system and use method thereof
CN114339111A (en) * 2020-09-25 2022-04-12 华为技术有限公司 Video call method and device
CN114710821A (en) * 2022-03-15 2022-07-05 上海井星信息科技有限公司 Method, system and storage medium for VoLTE relay to access SIP contact center
CN116017597A (en) * 2022-12-06 2023-04-25 中国工商银行股份有限公司 Interactive session processing method and device based on 5G VoLTE

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9838528B2 (en) * 2014-07-21 2017-12-05 Verizon Patent And Licensing Inc. Voice and video calling over long term evolution-based user interface

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108076301A (en) * 2016-11-11 2018-05-25 联芯科技有限公司 The method for processing video frequency and system of VoLTE video multi-party telephones
WO2021143043A1 (en) * 2020-01-16 2021-07-22 腾讯科技(深圳)有限公司 Multi-person instant messaging method, system, apparatus and electronic device
CN114339111A (en) * 2020-09-25 2022-04-12 华为技术有限公司 Video call method and device
CN113572749A (en) * 2021-07-20 2021-10-29 北京承启通科技有限公司 VoLTE-based video calling method and system
CN113630512A (en) * 2021-08-04 2021-11-09 宁波菊风系统软件有限公司 Rich media call mobile terminal system and use method thereof
CN114710821A (en) * 2022-03-15 2022-07-05 上海井星信息科技有限公司 Method, system and storage medium for VoLTE relay to access SIP contact center
CN116017597A (en) * 2022-12-06 2023-04-25 中国工商银行股份有限公司 Interactive session processing method and device based on 5G VoLTE

Also Published As

Publication number Publication date
CN116847128A (en) 2023-10-03

Similar Documents

Publication Publication Date Title
US10015440B2 (en) Multiple channel communication using multiple cameras
CN108055496B (en) Live broadcasting method and system for video conference
US9300705B2 (en) Methods and systems for interfacing heterogeneous endpoints and web-based media sources in a video conference
EP2863632B1 (en) System and method for real-time adaptation of a conferencing system to current conditions of a conference session
US9781386B2 (en) Virtual multipoint control unit for unified communications
EP3562163B1 (en) Audio-video synthesis method and system
KR101081803B1 (en) Method and system for conducting continuous presence conferences
KR100880150B1 (en) Multi-point video conference system and media processing method thereof
TWI423679B (en) Scalable video coding and decoding
US20110261151A1 (en) Video and audio processing method, multipoint control unit and videoconference system
AU2011305593B2 (en) System and method for the control and management of multipoint conferences
US20080100694A1 (en) Distributed caching for multimedia conference calls
JP6377784B2 (en) A method for one-to-many audio-video streaming with audio-video synchronization capture
EP2965508B1 (en) Video conference virtual endpoints
US20220329883A1 (en) Combining Video Streams in Composite Video Stream with Metadata
CN110943909A (en) Audio and video fusion communication server
CN110602522B (en) Multi-path real-time live webRTC stream synthesis method
WO2010083737A1 (en) Method and apparatus for processing voice signal, method and apparatus for transmitting voice signal
CN101242513A (en) Dual-stream transmission method in video conference and video conference system
CN113194278A (en) Conference control method and device and computer readable storage medium
CN116847128B (en) Video superposition processing method based on 5G VoLTE video teleconference
CN114598853A (en) Video data processing method and device and network side equipment
WO2004015994A1 (en) Method and apparatus of transferring the desktop of pc to video communication terminal
CN115209163A (en) Data processing method, data processing device, storage medium and electronic equipment
EP2290977A1 (en) Server device, communication method, and program

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant