CN102082945A - Method for realizing multi-party video calls, video terminal and system - Google Patents

Method for realizing multi-party video calls, video terminal and system Download PDF

Info

Publication number
CN102082945A
CN102082945A CN 201110032329 CN201110032329A CN102082945A CN 102082945 A CN102082945 A CN 102082945A CN 201110032329 CN201110032329 CN 201110032329 CN 201110032329 A CN201110032329 A CN 201110032329A CN 102082945 A CN102082945 A CN 102082945A
Authority
CN
China
Prior art keywords
video
terminal
video terminal
image data
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 201110032329
Other languages
Chinese (zh)
Inventor
张明远
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qingdao Hisense Electronics Co Ltd
Original Assignee
Qingdao Hisense Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qingdao Hisense Electronics Co Ltd filed Critical Qingdao Hisense Electronics Co Ltd
Priority to CN 201110032329 priority Critical patent/CN102082945A/en
Publication of CN102082945A publication Critical patent/CN102082945A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Telephonic Communication Services (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The invention relates to the technical field of video processing, and discloses a method for realizing multi-party video calls, a video terminal and a system. The method comprises the following steps that: a local video terminal creates an image display window corresponding to at least one other video terminal in an image layer; image data transmitted by the other video terminal is received through a network, wherein the image data is generated by extracting data frames from video streams of the other video terminal according to a preset frequency; the image data is decoded and the decoded image data is displayed in the image display window corresponding to the other video terminal in the image layer; local video streams are acquired in real time; and the local video streams are decoded and the decoded video is displayed in a video layer of the local video terminal. By the invention, multiple video calls can be realized, and the video call quality is improved under the condition of not increasing the bandwidth.

Description

Realize method, video terminal and the system of multi-party video conversation
Technical field
The present invention relates to technical field of video processing, be specifically related to a kind of method, video terminal and system that realizes the multi-party video conversation.
Background technology
At present; video conference is just becoming the important communication means that enterprise promotes operating efficiency, reduces business travel cost; special under the influence of global financial crisis; in order to reduce traveling expense; video conference just becomes the most important collaborative work means of enterprise, and more and more enterprises begins to build multi-party communication systems such as video conference.
In realizing process of the present invention, the inventor finds that there are the following problems at least in the prior art:
In existing video calling, because Video Decoder can only be separated one or two video flowing simultaneously, therefore, all be that both sides are the conversation between two clients usually, look logical the video call function that waits as 3C.
Summary of the invention
The embodiment of the invention provides a kind of method, video terminal and system that realizes the multi-party video conversation at the problem that above-mentioned prior art exists, and realizes a plurality of video callings.
For this reason, embodiment of the invention first aspect provides a kind of method that realizes the multi-party video conversation, comprising:
The local video terminal is created the image display window of corresponding at least one other video terminal in image layer;
Receive the image data that described other video terminals send by network, described image data is that described other video terminals generate by preset frequency extracted data frame from the video flowing of oneself;
The described image data of decoding, and decoded picture image display window corresponding with described other video terminals in described image layer shown;
Obtain local video stream in real time;
The described local video of decoding flows, and the video layer of decoded video in the local video terminal shown.
Preferably, described method also comprises:
The local video terminal generates image data by preset frequency extracted data frame from described local video stream;
Described image data is sent to described other video terminals by network.
Preferably, described local video terminal comprises by preset frequency extracted data frame generation image data from described local video stream:
The local video terminal by preset frequency extracted data frame, obtains view data from described local video stream;
Described view data is compressed;
View data after compression extracts delegation every predetermined row, generates described image data.
Preferably, described method also comprises:
When described local video terminal need be conversed with described other video terminals, initiate call request according to the IP address of described other video terminals;
Receive the voice data of the video terminal transmission of described IP address correspondence, realize the conversation of the video terminal corresponding with described IP address.
Preferably, described local video terminal comprises in the image display window of corresponding at least one other video terminal of image layer establishment:
Described local video terminal is conversed with a plurality of other video terminals if desired, then creates the image display window of corresponding each other video terminal respectively in image layer according to the IP address of each other video terminal.
Second aspect, the embodiment of the invention provide a kind of video terminal, comprising:
The display window creating unit is used for the image display window at corresponding at least one other video terminal of image layer establishment;
Receiving element is used for receiving the image data that described other video terminals send by network, and described image data is that described other video terminals generate by preset frequency extracted data frame from the video flowing of oneself;
The picture decoding unit, described image data is used to decode;
Image-display units is used for the picture after the described picture decoding unit decodes is shown in the described image layer image display window corresponding with described other video terminals;
The video flowing acquiring unit is used for obtaining in real time local video stream;
The decoding video stream unit, described local video stream is used to decode;
Video display unit is used for the video layer of the video after the described decoding video stream unit decodes in the local video terminal shown.
Preferably, described video terminal also comprises:
The image data generation unit is used for generating image data from described local video stream by preset frequency extracted data frame;
Transmitting element is used for described image data is sent to described other video terminals by network.
Preferably, described image data generation unit comprises:
First extracts subelement, is used for flowing by preset frequency extracted data frame from described local video, obtains view data;
The compression subelement is used for described view data is compressed;
Second extracts subelement, and the view data that is used for after the described compression subelement compression extracts delegation every predetermined row, generates described image data.
Preferably, described video terminal also comprises:
Call request when being used for conversing with described other video terminals, is initiated according to the IP address of described other video terminals in the call request unit;
Telephony unit is used to receive the voice data that the video terminal of described IP address correspondence sends, and realizes the conversation of the video terminal corresponding with described IP address.
Preferably, described display window creating unit specifically is used for when needs are conversed with a plurality of other video terminals, creates the image display window of corresponding each other video terminal respectively in image layer according to the IP address of each other video terminal.
The third aspect, the embodiment of the invention provides a kind of video system, comprise: by a plurality of video terminals that network links to each other, described video terminal comprises: the display window creating unit is used for the image display window at corresponding at least one other video terminal of image layer establishment; Receiving element is used for receiving the image data that described other video terminals send by network, and described image data is that described other video terminals generate by preset frequency extracted data frame from the video flowing of oneself; The picture decoding unit, described image data is used to decode;
Image-display units is used for the picture after the described picture decoding unit decodes is shown in the described image layer image display window corresponding with described other video terminals; The video flowing acquiring unit is used for obtaining in real time local video stream; The decoding video stream unit, described local video stream is used to decode; Video display unit is used for the video layer of the video after the described decoding video stream unit decodes in the local video terminal shown.
Preferably, described video terminal also comprises:
The image data generation unit is used for generating image data from described local video stream by preset frequency extracted data frame;
Transmitting element is used for described image data is sent to described other video terminals by network.
Preferably, described image data generation unit comprises:
First extracts subelement, is used for flowing by preset frequency extracted data frame from described local video, obtains view data;
The compression subelement is used for described view data is compressed;
Second extracts subelement, and the view data that is used for after the described compression subelement compression extracts delegation every predetermined row, generates described image data.
Preferably, described video terminal also comprises:
Call request when being used for conversing with described other video terminals, is initiated according to the IP address of described other video terminals in the call request unit;
Telephony unit is used to receive the voice data that the video terminal of described IP address correspondence sends, and realizes the conversation of the video terminal corresponding with described IP address.
Preferably, described display window creating unit specifically is used for when needs are conversed with a plurality of other video terminals, creates the image display window of corresponding each other video terminal respectively in image layer according to the IP address of each other video terminal.
Method, video terminal and the system of the conversation of realization multi-party video that the embodiment of the invention provides for local video stream, show by real-time decoding and at the video layer of local video terminal; Video flowing for other video terminals, not as prior art, directly to transmit described video flowing by network, but the video flowing of oneself is pressed preset frequency extracted data frame by other video terminals, generate image data, that is to say, what other video terminals that the local video terminal receives transmitted is not video flowing, but the image data that generates according to described video flowing, not only can realize the multi-party video conversation, and significantly reduced the data volume of Network Transmission in the multi-party video conversation further, avoided because the restriction of Network Transmission bandwidth to the influence of video reception and demonstration, has improved video speech quality.
Description of drawings
In order to be illustrated more clearly in the embodiment of the present application or technical scheme of the prior art, to do to introduce simply to the accompanying drawing of required use among the embodiment below, apparently, the accompanying drawing that describes below only is some embodiment that put down in writing among the present invention, for those of ordinary skills, can also obtain other accompanying drawing according to these accompanying drawings.
Fig. 1 is the video window schematic diagram of video terminal in the embodiment of the invention;
Fig. 2 is the flow chart that the embodiment of the invention realizes the method for multi-party video conversation;
Fig. 3 is a kind of structural representation of embodiment of the invention video terminal;
Fig. 4 is the another kind of structural representation of embodiment of the invention video terminal;
Fig. 5 is the process chart of embodiment of the invention video terminal to local data flow;
Fig. 6 is the process chart of embodiment of the invention video terminal to the image data of other video terminals of reception.
Embodiment
In order to make those skilled in the art person understand the scheme of the embodiment of the invention better, the embodiment of the invention is described in further detail below in conjunction with drawings and embodiments.
The embodiment of the invention realizes method, video terminal and the system of multi-party video conversation, based on human-eye visual characteristic: human eye surpasses 12 two field pictures and will feel it is smooth the per second that is identified in of image, for local video stream, show by real-time decoding and at the video layer of local video terminal; And for the video flowing of other video terminals, by other video terminals the video flowing of oneself is pressed preset frequency extracted data frame, generate image data, that is to say, what other video terminals that the local video terminal receives transmitted is not video flowing, but according to the image data that described video flowing generates, not only can realize the multi-party video conversation, and since transmission be the image data that generates according to video flowing, thereby significantly reduced the data volume of Network Transmission in the multi-party video conversation.And the described image data of local video decoding terminals, and with the corresponding image display window demonstration in described image layer of decoded picture, because described image data is to extract from the video flowing of described other video terminals by certain frequency to generate, therefore the picture behind local decode also can constantly refresh according to certain frequency, obtains smooth partner video.
The embodiment of the invention realizes in the method for multi-party video conversation, the multi-level display mode of local video terminal support, be provided with image layer and video layer, described image layer is used in the processing of picture, literal, described video layer is used for the processing of audio, video data, promptly, distribute corresponding internal memory for different processing.When the data of different levels show, need the stack mixed processing, the concrete processing mode embodiment of the invention is not done qualification.
In embodiments of the present invention, show the video image of other video terminals, show local video image at video layer in image layer.Can have one or more with other video terminals of local video terminal call, when a plurality of other video terminals are arranged, the local video terminal can show the video image of other video terminals at the different display window of display layer respectively, and as shown in Figure 1, voice adopt the mode that exchanges with a side.
As shown in Figure 2, be the flow chart that the embodiment of the invention realizes the method for multi-party video conversation, may further comprise the steps:
Step 201, local video terminal are created the image display window of corresponding at least one other video terminal in image layer.
Described local video terminal is conversed with a plurality of other video terminals if desired, then can create the image display window of corresponding each other video terminal according to the IP address of each other video terminal respectively in image layer.
Step 202 receives the image data that described other video terminals send by network, and described image data is that described other video terminals generate by preset frequency extracted data frame from the video flowing of oneself.
Described network can be the internet, also can be dedicated network etc.
Step 203, the described image data of decoding, and decoded picture image display window corresponding with described other video terminals in described image layer shown.
When concrete the application, the local video terminal can utilize this machine image layer display mechanism (such as getting display mode ready, perhaps the image data in the internal memory is directly shown or the like in image layer), by the picture behind the continuous refresh decoder of timer, when per second is changing greater than the picture more than 15 width of cloth, what human eye was seen will be continuous images.
Step 204 is obtained local video stream in real time.
Particularly, can directly obtain local video stream from local camera.
Step 205, the described local video stream of decoding, and with the video layer demonstration of decoded video in the local video terminal.
Because the encoding and decoding of video flowing are different with the code encoding/decoding mode of picture, and the encoding and decoding of video flowing are than the code decode algorithm complexity of picture, and operand is big.Therefore, the embodiment of the invention realizes that the method for multi-party video conversation adopts video decode to local video stream, and to the video flowing of other video terminals of its conversation, adopt the mode of picture decoding to obtain and show corresponding video image, significantly reduced the operand of transmitted data amount and decoding.Concrete, can adopt more existing decoding processes to the decoding processing of described local video stream and to the decoding processing of described image data, to this, the embodiment of the invention is not done qualification.
In addition, for the video calling of TV,, can only finish the processing of video pictures by client computer oneself owing to there is not the support of server.And, because the restriction of bandwidth, making that both sides' conversation video is many times also smooth inadequately in the prior art, quality can not satisfy the demands.And the embodiment of the invention realizes the method for multi-party video conversation, adopts image data to replace video stream data, has significantly reduced the operand of transmitted data amount and decoding.With prior art comparable bandwidths condition under, also can access and obtain smooth conversation video, guarantee the video pictures quality.
Need to prove, the embodiment of the invention realizes in the method for multi-party video conversation, the local video terminal is carried out synchronously to the processing of local video stream and the processing to the image data of other video terminals of receiving, that is to say the sequencing on above-mentioned steps 101 to 103 has no time to 105 with step 104.
In addition, in embodiments of the present invention, described local video terminal can also initiatively be initiated the interactive voice with any one other video terminal, and detailed process is as follows:
When described local video terminal need be conversed with described other video terminals, initiate call request according to the IP address of described other video terminals;
Receive the voice data of the video terminal transmission of described IP address correspondence, realize the conversation of the video terminal corresponding with described IP address.
Certainly, described local video terminal if accept this request, then sends voice data to this side's of asking video terminal after the call request that receives other video terminals initiations, realize interactive voice.
In addition, can see the video image of described local video terminal, in embodiments of the present invention, also can further may further comprise the steps in order to make other video terminals user:
The local video terminal generates image data by preset frequency extracted data frame from described local video stream, and described image data is sent to described other video terminals by network.
Particularly, in order further to reduce the required bandwidth of transmission image data, when generating image data, can be undertaken by following processing procedure:
(1) the local video terminal by preset frequency extracted data frame, obtains view data from described local video stream;
(2) described view data is compressed;
Particularly, can view data be compressed, be about to the picture that image compression becomes the image display window size, to reduce the data volume of image data biglyyer according to the size of the image display window of making an appointment;
(3) picture after will compressing according to the algorithm that reels off raw silk from cocoons reels off raw silk from cocoons, and promptly the view data after the compression extracts delegation every predetermined row, generates described image data.
Like this, can reduce the data volume of picture effectively.
Correspondingly, when showing, can picture be reduced according to the inverse process of the described algorithm that reels off raw silk from cocoons.
Realize the method for multi-party video conversation according to the embodiment of the invention, if n terminal video arranged, each terminal all adopts the camera recorded video of 720p resolution, the image display window size of supposing each video is 300*240, the ratio of reeling off raw silk from cocoons is 1/100, be to extract delegation in per 100 row view data, obtain image data.
Suppose that the Network Transmission bandwidth is 10MKb/s, the video flowing that per second can send be 10M/ ((1280*720)/8) (wherein, 10M is the network bandwidth, (1280*720)/8 be the size of data of 720p resolution one two field picture), then according to the method for the embodiment of the invention, can transmit 10M/ (((300*240)/8) * (99/100)) (wherein, 10M is the network bandwidth, (300*240)/8 represent width 300*240, it highly is the image data size of 300*240, the image data of (data after ((300*240)/8) * (99/100) expression is reeled off raw silk from cocoons) has reduced by 15 times data volume.
For the bandwidth of video smoothness equally one to one,, then can support 1 pair 15 video according to the method for the embodiment of the invention.
Need to prove that in actual applications, the local video terminal can read the voice data of this video terminal and realize according to the sign of object video in the image display window corresponding with it with the conversation of other video terminals.Particularly, each video terminal, the capital sends video pictures data and the voice data of oneself, voice data is by TCP (Transmission Control Protocol, transmission control protocol)/IP (InternetProtocol, Internet Protocol) transmit, the same with image data, the voice data of transmission also has the unique identification of each object video.
On each video terminal, can determine whether to accept and resolve the other side's voice data by clicking the window of any one object video.Such as at video window shown in Figure 1, when clicking the IP1 video window, begin to receive and resolve this object video corresponding audio data, receive these object video corresponding audio data after, carry out audio decoder in this locality, sound.When clicking the IP2 video window, stop to receive and resolving the voice data of IP1 video window, select to begin to receive and parsing IP2 window corresponding audio data.Because it is that network sound intermediate frequency data and image data always are simultaneously transmission and receive the decode respectively by video terminal, therefore during certain video window corresponding audio of selective reception, reception and parsing to video pictures can not be affected, thereby guarantee smooth sound, video calling.
As seen through the above description of the embodiments, those skilled in the art can be well understood to the embodiment of the invention and can realize by the mode that software adds essential general hardware platform.Based on such understanding, the part that technical scheme of the present invention contributes to prior art in essence in other words can embody with the form of software product, this computer software product can be stored in the storage medium, as ROM/RAM, magnetic disc, CD etc., comprise that some instructions are with so that a computer equipment (can be a personal computer, server, the perhaps network equipment etc.) carry out the described method of some part of each embodiment of the present invention or embodiment.
Correspondingly, the embodiment of the invention also provides a kind of video terminal, as shown in Figure 3, is a kind of structural representation of this video terminal.
In this embodiment, described video terminal 300 comprises:
Display window creating unit 301 is used for the image display window at corresponding at least one other video terminal of image layer establishment;
Receiving element 302 is used for receiving the image data that described other video terminals send by network, and described image data is that described other video terminals generate by preset frequency extracted data frame from the video flowing of oneself;
Picture decoding unit 303, described image data is used to decode;
Image-display units 304 is used for described picture decoding unit 303 decoded pictures are shown in the described image layer image display window corresponding with described other video terminals;
Video flowing acquiring unit 305 is used for obtaining in real time local video stream;
Decoding video stream unit 306, described local video stream is used to decode;
Video display unit 307 is used for the video layer of described decoding video stream unit 306 decoded videos in the local video terminal shown.
Need to prove, when described video terminal 300 need be conversed with a plurality of other video terminals, display window creating unit 301 can be created the image display window of corresponding each other video terminal according to the IP address of each other video terminal respectively in image layer.
The video terminal of the embodiment of the invention, local video stream is adopted video decode, and to the video flowing of other video terminals of its conversation, adopt the mode of picture decoding to obtain and show corresponding video image, significantly reduced the operand of transmitted data amount and decoding.Concrete, can adopt more existing decoding processes to the decoding processing of described local video stream and to the decoding processing of described image data, to this, the embodiment of the invention is not done qualification.
As shown in Figure 4, be the another kind of structural representation of this video terminal.
With embodiment illustrated in fig. 3 different be that in this embodiment, video terminal 400 also further comprises:
Image data generation unit 401 is used for generating image data from described local video stream by preset frequency extracted data frame;
Transmitting element 402 is used for described image data is sent to described other video terminals by network.
In embodiments of the present invention, described image data generation unit 401 can comprise:
First extracts subelement, is used for flowing by preset frequency extracted data frame from described local video, obtains view data;
The compression subelement is used for described view data is compressed;
Second extracts subelement, and the view data that is used for after the described compression subelement compression extracts delegation every predetermined row, generates described image data.
Certainly, in actual applications, described image data generation unit 401 is not limited in above-mentioned implementation, and other implementations can also be arranged, such as, can be directly with the view data after the compression as the image data that sends to other video terminals.Like this, the transmission bandwidth of Xu Yaoing is big relatively.
Utilize the video terminal of the embodiment of the invention, not only local video stream is adopted video decode, to with the video flowing of other video terminals of its conversation, adopt the mode of picture decoding to obtain and show corresponding video image, and this video terminal also generates image data and sends to other video terminals according to local video stream, thereby realizes the multi-party video conversation, and significantly reduced the operand of transmitted data amount and decoding, guaranteed video transmission and display quality.
Need to prove that the video terminal of the embodiment of the invention can also initiatively be initiated the interactive voice with any one other video terminal.Correspondingly, Fig. 3 and embodiment illustrated in fig. 4 in, described video terminal also can further comprise: call request unit and telephony unit (not shown), wherein:
Call request when being used for conversing with described other video terminals, is initiated according to the IP address of described other video terminals in described call request unit;
Described telephony unit is used to receive the voice data that the video terminal of described IP address correspondence sends, and realizes the conversation of the video terminal corresponding with described IP address.
Certainly, described video terminal if accept this request, then sends voice data by described telephony unit to this side's of asking video terminal after the call request that receives other video terminals initiations, realize interactive voice.
Apparatus embodiments described above only is schematic, wherein said unit as the separating component explanation can or can not be physically to separate also, the parts that show as the unit can be or can not be physical locations also, promptly can be positioned at a place, perhaps also can be distributed on a plurality of network element.Can select wherein some or all of module to realize the purpose of present embodiment scheme according to the actual needs.Those of ordinary skills promptly can understand and implement under the situation of not paying creative work.
Further combined with Fig. 4 the processing procedure of embodiment of the invention video terminal to the image data of local data flow and other video terminals is described below.
As shown in Figure 5, be the process chart of embodiment of the invention video terminal to local data flow, may further comprise the steps:
Step 501, video flowing acquiring unit are obtained local video stream;
Step 502, decoding to local video stream in the decoding video stream unit, and shows at video layer;
Step 503, first extracts subelement carries out the video flowing extraction, promptly presses preset frequency extracted data frame from described local video stream, obtains view data;
Step 504, the compression subelement carries out image compression, promptly according to predefined image display window size described view data is compressed;
Step 505, the second extraction subelement carries out image and reels off raw silk from cocoons, and promptly the view data after the compression is extracted delegation every predetermined row, and the data that obtain reeling off raw silk from cocoons form image data;
Step 506, transmitting element encapsulates the back with described image data according to the network transmission protocol and sends.
Repeat above-mentioned steps 503 to 506, the image data that forms according to local video stream can be sent to other video terminals continuously, thereby make other video terminals can show the video image of described local video terminal in real time.
Need to prove that above-mentioned steps 503 to 506 is carried out synchronously with step 502.
As shown in Figure 6, be the process chart of embodiment of the invention video terminal to the image data of other video terminals of reception, may further comprise the steps:
Step 601 is provided with timer, is set to 50ms such as timing;
Step 602, receiving element if the network data that receives is a video data, are then therefrom extracted image data by the network receiving network data;
Step 603, the described image data of picture decoding unit decodes;
Step 604 judges whether timer is overtime; If then execution in step 605; Otherwise, execution in step 601;
Step 605, image-display units shows decoded picture at the display window of image layer correspondence.
Repeat above-mentioned steps 604 to 605, decoded picture was refreshed according to the time of timer setting, promptly refresh, thereby obtain continuous images according to certain frequency, that is to say, restore the video image of other video terminals in the local video terminal.
Correspondingly, the embodiment of the invention also provides a kind of video system, and described video system comprises: by a plurality of video terminals that network links to each other, the concrete structure of each video terminal can be with reference to each embodiment of front.
Each embodiment in this specification all adopts the mode of going forward one by one to describe, and identical similar part is mutually referring to getting final product between each embodiment, and each embodiment stresses all is difference with other embodiment.Especially, for system embodiment, because it is substantially similar in appearance to method embodiment, so describe fairly simplely, relevant part gets final product referring to the part explanation of method and apparatus embodiment.
More than the embodiment of the invention is described in detail, used embodiment herein the present invention set forth, the explanation of above embodiment just is used for help understanding method and apparatus of the present invention; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, the part that all can change in specific embodiments and applications, in sum, this description should not be construed as limitation of the present invention.

Claims (11)

1. a method that realizes the multi-party video conversation is characterized in that, comprising:
The local video terminal is created the image display window of corresponding at least one other video terminal in image layer;
Receive the image data that described other video terminals send by network, described image data is that described other video terminals generate by preset frequency extracted data frame from the video flowing of oneself;
The described image data of decoding, and decoded picture image display window corresponding with described other video terminals in described image layer shown;
Obtain local video stream in real time;
The described local video of decoding flows, and the video layer of decoded video in the local video terminal shown.
2. method according to claim 1 is characterized in that, described method also comprises:
The local video terminal generates image data by preset frequency extracted data frame from described local video stream;
Described image data is sent to described other video terminals by network.
3. method according to claim 2 is characterized in that, described local video terminal generates image data by preset frequency extracted data frame and comprises from described local video stream:
The local video terminal by preset frequency extracted data frame, obtains view data from described local video stream;
Described view data is compressed;
View data after compression extracts delegation every predetermined row, generates described image data.
4. according to claim 1 or 2 or 3 described methods, it is characterized in that described method also comprises:
When described local video terminal need be conversed with described other video terminals, initiate call request according to the IP address of described other video terminals;
Receive the voice data of the video terminal transmission of described IP address correspondence, realize the conversation of the video terminal corresponding with described IP address.
5. method according to claim 1 is characterized in that, the image display window that described local video terminal is created corresponding at least one other video terminal in image layer comprises:
Described local video terminal is conversed with a plurality of other video terminals if desired, then creates the image display window of corresponding each other video terminal respectively in image layer according to the IP address of each other video terminal.
6. a video terminal is characterized in that, comprising:
The display window creating unit is used for the image display window at corresponding at least one other video terminal of image layer establishment;
Receiving element is used for receiving the image data that described other video terminals send by network, and described image data is that described other video terminals generate by preset frequency extracted data frame from the video flowing of oneself;
The picture decoding unit, described image data is used to decode;
Image-display units is used for the picture after the described picture decoding unit decodes is shown in the described image layer image display window corresponding with described other video terminals;
The video flowing acquiring unit is used for obtaining in real time local video stream;
The decoding video stream unit, described local video stream is used to decode;
Video display unit is used for the video layer of the video after the described decoding video stream unit decodes in the local video terminal shown.
7. video terminal according to claim 6 is characterized in that, described video terminal also comprises:
The image data generation unit is used for generating image data from described local video stream by preset frequency extracted data frame;
Transmitting element is used for described image data is sent to described other video terminals by network.
8. video terminal according to claim 7 is characterized in that, described image data generation unit comprises:
First extracts subelement, is used for flowing by preset frequency extracted data frame from described local video, obtains view data;
The compression subelement is used for described view data is compressed;
Second extracts subelement, and the view data that is used for after the described compression subelement compression extracts delegation every predetermined row, generates described image data.
9. according to claim 6 or 7 or 8 described video terminals, it is characterized in that described video terminal also comprises:
Call request when being used for conversing with described other video terminals, is initiated according to the IP address of described other video terminals in the call request unit;
Telephony unit is used to receive the voice data that the video terminal of described IP address correspondence sends, and realizes the conversation of the video terminal corresponding with described IP address.
10. video terminal according to claim 6 is characterized in that,
Described display window creating unit specifically is used for when needs are conversed with a plurality of other video terminals, creates the image display window of corresponding each other video terminal respectively in image layer according to the IP address of each other video terminal.
11. a video system is characterized in that, comprising: by a plurality of video terminals that network links to each other, each video terminal is as each described video terminal of claim 6 to 10.
CN 201110032329 2011-01-26 2011-01-26 Method for realizing multi-party video calls, video terminal and system Pending CN102082945A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 201110032329 CN102082945A (en) 2011-01-26 2011-01-26 Method for realizing multi-party video calls, video terminal and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 201110032329 CN102082945A (en) 2011-01-26 2011-01-26 Method for realizing multi-party video calls, video terminal and system

Publications (1)

Publication Number Publication Date
CN102082945A true CN102082945A (en) 2011-06-01

Family

ID=44088674

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201110032329 Pending CN102082945A (en) 2011-01-26 2011-01-26 Method for realizing multi-party video calls, video terminal and system

Country Status (1)

Country Link
CN (1) CN102082945A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104980817A (en) * 2015-07-13 2015-10-14 无锡天脉聚源传媒科技有限公司 Video stream frame extraction method and device
CN105407408A (en) * 2014-09-11 2016-03-16 腾讯科技(深圳)有限公司 Method for realizing multiplayer voice and video communication on mobile terminal and mobile terminal
CN105681731A (en) * 2016-03-04 2016-06-15 福建星网智慧科技股份有限公司 Multi-media-terminal-based method and system for realizing four-party video conference
CN109257615A (en) * 2018-09-25 2019-01-22 视联动力信息技术股份有限公司 A kind of method and apparatus that net cast is shown
WO2020063087A1 (en) * 2018-09-28 2020-04-02 华为技术有限公司 Display method and device

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1972431A (en) * 2006-12-14 2007-05-30 北京中星微电子有限公司 A video conference image processing system
CN101772958A (en) * 2007-07-31 2010-07-07 惠普开发有限公司 Video conferencing system and method
CN101867825A (en) * 2010-06-25 2010-10-20 中国传媒大学 Device for circularly monitoring multi-channel video and method thereof

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1972431A (en) * 2006-12-14 2007-05-30 北京中星微电子有限公司 A video conference image processing system
CN101772958A (en) * 2007-07-31 2010-07-07 惠普开发有限公司 Video conferencing system and method
CN101867825A (en) * 2010-06-25 2010-10-20 中国传媒大学 Device for circularly monitoring multi-channel video and method thereof

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105407408A (en) * 2014-09-11 2016-03-16 腾讯科技(深圳)有限公司 Method for realizing multiplayer voice and video communication on mobile terminal and mobile terminal
CN105407408B (en) * 2014-09-11 2019-08-16 腾讯科技(深圳)有限公司 A kind of method and mobile terminal for realizing more people's audio-videos in mobile terminal
CN104980817A (en) * 2015-07-13 2015-10-14 无锡天脉聚源传媒科技有限公司 Video stream frame extraction method and device
CN105681731A (en) * 2016-03-04 2016-06-15 福建星网智慧科技股份有限公司 Multi-media-terminal-based method and system for realizing four-party video conference
CN109257615A (en) * 2018-09-25 2019-01-22 视联动力信息技术股份有限公司 A kind of method and apparatus that net cast is shown
WO2020063087A1 (en) * 2018-09-28 2020-04-02 华为技术有限公司 Display method and device

Similar Documents

Publication Publication Date Title
JP5638997B2 (en) Method and system for adapting CP placement according to interactions between conference attendees
US9392226B2 (en) Generating and rendering synthesized views with multiple video streams in telepresence video conference sessions
US8085290B2 (en) System and method for displaying a videoconference
US7707247B2 (en) System and method for displaying users in a visual conference between locations
JP5508450B2 (en) Automatic video layout for multi-stream and multi-site telepresence conferencing system
US8860776B2 (en) Conference terminal, conference server, conference system and data processing method
CN101291417B (en) Polling method and system for videoconference system
JP6172610B2 (en) Video conferencing system
US8970657B2 (en) Removing a self image from a continuous presence video image
CN112543297A (en) Video conference live broadcasting method, device and system
CN102025970A (en) Method and system for automatically adjusting display mode of video conference
CN111385515B (en) Video conference data transmission method and video conference data transmission system
JP2017523632A (en) Method, apparatus and system for visual presentation
CN102082945A (en) Method for realizing multi-party video calls, video terminal and system
CN113542660A (en) Method, system and storage medium for realizing conference multi-picture high-definition display
WO2016206471A1 (en) Multimedia service processing method, system and device
CN108156413B (en) Video conference transmission method and device and MCU
CN111083427B (en) Data processing method of embedded terminal and 4K video conference system
TW201528822A (en) System and method of controlling video conference
CN112839197B (en) Image code stream processing method, device, system and storage medium
WO2012100410A1 (en) Method, video terminal and system for enabling multi-party video calling
CN112399126A (en) Video processing method and device, terminal equipment and storage medium
CN117714764A (en) Video playing method, device, equipment and storage medium
CN117041470A (en) Conference display method and device, conference system, storage medium and electronic device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20110601