Embodiment
The invention provides a kind of new technical scheme is applied to be used in the video conferencing system multimedia communication is controlled, exactly, be that the parameter specification index of vision signal that terminal sends is controlled, make terminal before sending vision signal, compress it earlier, preferably, at least can adopt according to resolution and/or image figure place index and compress, and preferably, control method provided by the invention is directly with every road video signal compression point control unit (MCU at the most, Multi-point Control Unit) needed specification, thus bandwidth saved effectively.Therefore, the present invention realizes the redundancy of vision signal is cut to minimum when guaranteeing the video conference quality, like this, can reduce the computation burden of multipoint control unit on picture decoding and image compression link effectively.
Particularly, Fig. 1 illustrates according to a specific embodiment of the present invention, is used to control the network topological diagram of the compound video conferencing system of many pictures.In this embodiment, video conferencing system comprises multipoint control unit 3, and wherein, described multipoint control unit 3 is used for controlling the video conference of multiple spot, it receives the vision signal of self terminal 9 and handles, and then the processed video signal is sent to described terminal 9.It will be appreciated by those skilled in the art that described terminal 9 comprises the video capturing device (not shown in figure 1) usually, wherein, described video capturing device can be camera or video frequency collection card.Described terminal 9 will send to described multipoint control unit 3 by the vision signal that above-mentioned video capturing device captures, and receive the vision signal from described multipoint control unit 3, and preferably such vision signal be shown by a display terminal.More specifically, in the prior art, terminal 91, terminal 92, the video signal coding that terminal 93 and terminal 94 (not shown in figure 1)s will capture separately becomes multimedia code stream, and this multimedia code stream is sent to described multipoint control unit 3 by network, then, described multipoint control unit 3 is decoded to the multichannel multimedia code stream that receives respectively, respectively the image that decoding obtains is compressed according to the image composite mode then, and carry out compound to the image after the compression, at last combination picture is sent to described terminal 91 after encoded, terminal 92, terminal 93 and terminal 94, like this in terminal 91, terminal 92, show identical or different image respectively on terminal 93 and the terminal 94, thereby realize video conference.
For example, the resolution of the display device of described terminal 91 is 1920 * 1200, described terminal 92, the resolution of the display device of terminal 93 and terminal 94 is 1024 * 768, and the resolution of the image that above-mentioned four terminals capture all is 1920 * 1200, the image composite mode as shown in Figure 3, be about to image that above-mentioned 4 terminals capture and be combined into as shown in Figure 3 combination picture, wherein, little image 43 to 46 in this image to be sent is corresponding with described terminal 91 to 94 respectively, in order to satisfy the definition of all terminal pictures, the resolution of above-mentioned little image 43 to 46 is 960 * 600, at this moment, in the prior art, the resolution that each terminal sends to the image of described multipoint control unit 3 all is 1920 * 1200, therefore, described multipoint control unit 3 must be compressed to 960 * 600 from 1920 * 1200 with the resolution of the multiway images that receives before combination picture, and, though only need reaching 960 * 600, the resolution of little image just can guarantee definition, but it still is 1920 * 1200 that described terminal 91 to 94 sends to the resolution of the image of described multipoint control unit 3, described like this multipoint control unit 3 must be that 1920 * 1200 multimedia code stream is decoded to resolution, rather than be that 960 * 600 multimedia code stream is decoded to resolution, this means that amount of calculation will increase greatly, thereby described multipoint control unit 3 (for example has to comprise a plurality of high-performance digital signal processors, DSP) finish huge computation burden, cause the cost of described multipoint control unit 3 very high.
And in this embodiment, described multipoint control unit 3 is according to the resolution of above-mentioned little image institute actual needs, the requesting terminal sends the image of corresponding resolution, for example, described multipoint control unit 3 requests (or indication) terminal transmission resolution is 960 * 600 image, rather than resolution is 1920 * 1200 image, the amount of calculation that can significantly reduce like this that 3 pairs of multimedia code streams of described multipoint control unit are decoded and the image that receives is compressed, thus reduce cost.
Further, those skilled in the art understand, communicate by network between described multipoint control unit 3 and the terminal 9, wherein, this network can be wired and/or wireless network, and for example, described multipoint control unit 3 can communicate by the packet switching network with terminal 9, also can communicate, can also communicate by mobile communications network by repeater satellite.Those skilled in the art can not repeat them here in conjunction with the above-mentioned network of existing techniques in realizing.
Those skilled in the art understand, the function of multipoint control unit in video conference comprises image complex function and image switching function at least, in this embodiment, described multipoint control unit 3 is used for realizing the image complex function, can reduce the amount of calculation on the link in decoding and compression by technical scheme of the present invention.And in a variation example of this embodiment, described multipoint control unit can be used for realizing the image switching function, for example, image correspondingly switches to a back spokesman from previous spokesman when the spokesman changes, again for example, the chairman can reach the purpose of making an inspection tour the meeting-place by the image that switches each sub-venue, at this moment, preferably, the image compression link can be omitted, but still can reduce in the amount of calculation of decoding on the link by technical scheme of the present invention, such variation example still can be achieved by technical scheme provided by the invention, does not influence flesh and blood of the present invention.
Fig. 1 is described the network topology structure that is used to control the compound video conferencing system of many pictures, and Fig. 2 is described the control method that specifically is applied to each equipment room in the video conferencing system and carries out multimedia communication.Particularly, Fig. 2 illustrates according to first embodiment of the invention, the flow chart of the control method of in the control appliance of video conferencing system multimedia communication being handled.
In the present embodiment, execution in step S210 at first obtains the graphic attribute of each terminal respectively; Enter step S211 then, obtain the image composite mode of each terminal respectively; Next execution in step S212 determines the only parameter specification index corresponding with this terminal according to the graphic attribute of described each terminal and corresponding with it image composite mode.Preferably, described graphic attribute is meant that terminal presentation facility supports the resolution and the image figure place of (or be provided with), for example, in Windows XP operating system, choosing " graphic attribute " can open " graphic attribute " then in the desktop right click is provided with window, resolution and image figure place can be set in this window.
Particularly, this step can realize based on the H323 agreement, or rather, can realize by the H.245 agreement of expanding under the agreement H.323, wherein, H.245 agreement is control protocol for multimedia communication (Control Protocol for Multimedia Communication), and H.323 this agreement is providing signaling mechanism in the agreement.
It will be appreciated by those skilled in the art that for the ease of agreement is expanded H.245 agreement has kept the leeway of expansion when definition.For example, H.245 agreement has adopted the mode of extending marking (Extension Marker), has promptly all increased in many message structures " ... ", and Kuo Zhan message can be placed on after the extending marking like this.Therefore, in order to obtain the graphic attribute of terminal, can be with described graphic attribute as the parameter in terminal capabilities set (Terminal Capability Set) request (Request) message, and send this terminal capabilities set request message by terminal.
More specifically, at first setting up the required network of video conference makes and sets up physical connection between the equipment, open the special logic passage (Logical Channel) of the control of signaling H.245 then, this logical channel is the logical channel of independently, separating with all medium types, i.e. a LC0.After logical channel LC0 sets up, multipoint control unit sends the principal and subordinate to each terminal and determines (Master Slave Determination) request message, and each terminal sends the principal and subordinate to multipoint control unit and determines to confirm that (Master Slave Determination Acknowledge) response (Response) message is to finish principal and subordinate's deterministic process.Then, each terminal sends terminal capabilities set request message to multipoint control unit, wherein, this terminal capabilities set comprises resolution and image figure place, be about to above-mentioned resolution and image figure place and send to described multipoint control unit as the concentrated parameter of this terminal capability, described then multipoint control unit is concentrated from this terminal capability according to the agreement of making an appointment and is obtained graphic attribute, and sends terminal capabilities set confirmation (Terminal Capability Set Acknowledge) response message to finish capabilities exchange (Capability Exchange) process to each terminal.
In the present embodiment, except the graphic attribute that obtains each terminal, also to obtain the image composite mode of each terminal respectively, wherein, described image composite mode has been indicated by compound image shared size in combination picture, for example, Fig. 3 and Fig. 4 are respectively two kinds of different composite modes.Particularly, can set described image composite mode by computer or remote controller indication multipoint control unit, for example, video conference has 5 meeting-place, then can control multipoint control unit by remote controller the terminal that the image composite mode is set at each meeting-place is shown the image that the image by all the other 4 meeting-place as shown in Figure 3 is composited; Can also set described image composite mode by the chairman, for example, one token is set in video conferencing system, wherein, the user of the terminal at token place is the chairman, and the chairman can carry out compound to main meeting-place image and spokesman's image according to image composite mode shown in Figure 3.Those skilled in the art understand, described image composite mode can be dynamic, therefore can obtain described image composite mode in advance according to actual conditions before video conference begins, can obtain described image composite mode at any time in meeting yet, this does not influence flesh and blood of the present invention.
After graphic attribute that obtains each terminal respectively and image composite mode, further, obtain according to the graphic attribute of described each terminal and corresponding with it image composite mode and determine the only parameter specification index corresponding with this terminal.In the present embodiment, be used to refer to the required minimum parameter specification index of combination picture by obtaining described image attributes, and just can be used to refer to the required minimum parameter specification index of the medium and small image of this combination picture in conjunction with the described image composite mode that obtains again, wherein, described minimum parameter specification index is meant that this parameter specification index will satisfy the video conference user to the minimum quality requirement of image, this minimum parameter specification index can be defined as described only parameter specification index then.
For example, if each terminal is supported the image figure place difference of (or setting), have 8,16 and 24, and the image that captures is 32, at this moment, for each terminal, 24 bit images and 32 bit images can both satisfy the quality requirement of each terminal, and wherein, the quality of 24 bit images will be lower than the quality of 32 bit images, therefore, 24 bit images are defined as described only parameter specification index.It will be appreciated by those skilled in the art that when described parameter specification index only comprises the figure place of image can omit " the image composite mode that obtains each terminal respectively ", this does not influence flesh and blood of the present invention.
Can determine the only parameter specification index of each terminal by step S210~S212, and in step S213, determine described form control signaling according to the only parameter specification index of described each terminal.In the present embodiment, described form control signaling is determined by the only parameter specification index of described each terminal, particularly, H.245 protocol definition non-standard message (Non standard Message) structure realize non-standard control signaling, wherein, this non-standard message structure is defined as:
NonStandardMessage ::=SEQUENCE
{
nonStandardData NonStandardParameter,
}
NonStandardParameter ::=SEQUENCE
{
nonStandardIdentifier NonStandardIdentifier,
data OCTET?STRING
}
According to above-mentioned definition, can define order (Command) message as described form control signaling, with the parameter of described only parameter specification index as this command messages, wherein, this command messages requires terminal to finish the action of appointment, promptly sends vision signal with described only parameter specification index.
At last, execution in step S214, send form control signaling to a plurality of terminals respectively, wherein, described form control signaling is used to indicate this terminal to send the parameter specification index of vision signal to described control appliance, it is characterized in that it is different with the form control parameter specification index that signaling comprised of issuing other-end that the described form of issuing first terminal is controlled the parameter specification index that signaling comprised.In the present embodiment, after principal and subordinate's deterministic process and capabilities exchange procedure are finished, multipoint control unit sends form control signaling with the form of command messages to a plurality of terminals, particularly, described form control signaling is encoded according to the coding rule of agreement (Information Technology-ASN.1Encoding Rules-Specification of Packed Encoding Rules (PER)) regulation X.691, then it is encapsulated, for example, can adopt the method that is packaged into SRP (SimpleRetransmission Protocol) frame, wherein, the form of described SRP frame is as shown in the table:
Head (0xf9) |
Sequence number (1 byte) |
Message field can comprise one or more message |
Error detection code (2 byte) |
H.223, packet after will encapsulating then sends by multiplex layer.It will be appreciated by those skilled in the art that described multipoint control unit sends described form control signaling according to the concrete condition of video conferencing system network, for example, can send by packet switching network; Also communication network sends via satellite; Can also send by mobile radio communication.Those skilled in the art understand, the terminal that described first terminal can be the main meeting-place, spokesman's terminal and chairman's terminal etc., area ratio general shared in the image of its image of gathering after compound is bigger, and described first terminal is dynamic, and promptly the arbitrary terminal in the video conference can be as described first terminal.
Change in the example at one, be not by described graphic attribute is transmitted as the parameter of standard message in step S210, but obtain described graphic attribute by the definition non-standard message.Particularly, can define the graphic attribute request message according to the non-standard message structure of protocol definition H.245, wherein, described graphic attribute request message is used to refer to terminal and replys corresponding graphic attribute response message, and terminal then sends to multipoint control unit with described graphic attribute as the parameter of described graphic attribute response message.In other words, described multipoint control unit can send above-mentioned graphic attribute request message to one or more terminals by its control and management according to above-mentioned non-standard message structure, and the terminal that receives this request message is correspondingly returned requested graphic attribute.
And in another variation example of present embodiment, step S210~S212 is changed to and receives the only parameter specification index corresponding with this terminal that each terminal sends respectively.Those skilled in the art understand, in video conferencing system, the amount of calculation that image is decoded is greater than the amount of calculation that image is compressed/enlarges usually, therefore, if the resolution of the image that terminal captures is lower than described only parameter specification index, preferably, terminal is determined only parameter specification index with oneself, and above-mentioned definite only parameter specification index is sent to multipoint control unit.For example, before meeting, multipoint control unit can the handling capacity exchange process or other signalling interactive process obtain the resolution that terminal can be supported, the vision signal of avoiding being sent is not supported by terminal.The described only parameter specification index that multipoint control unit is determined is a resolution 1024 * 768, and the resolution of the image that captures of terminal is 800 * 600, then this moment, this terminal oneself determined that only parameter specification index is a resolution 800 * 600, and will be somebody's turn to do the only parameter specification index of oneself determining and send to multipoint control unit, then terminal is that 800 * 600 image sends to multipoint control unit with resolution, multipoint control unit will receive the resolution of image expand 1024 * 768 to.At this moment, though need expand the resolution of image to 1024 * 768, but it is that 800 * 600 image is decoded that multipoint control unit only needs resolution, decodes with respect to the image that for resolution is 1024 * 768, can reduce the computation burden of multipoint control unit.
Further, it will be appreciated by those skilled in the art that under many circumstances that described each terminal all can be determined parameter specification index separately respectively and send to described multipoint control unit.For example, determine the image resolution ratio that this terminal is fit to according to actual needs by concrete operating personnel; Again for example, the ultimate resolution of each terminal is sent to described multipoint control unit as described parameter specification index, at this moment, the parameter specification index of each terminal also may be different.Those skilled in the art can need realize the different methods of obtaining described parameter specification index in conjunction with concrete enforcement, do not repeat them here.
And in another variation example of present embodiment, step S210~S212 is changed to the only parameter specification index of determining described each terminal according to the ability of the picture style of video conference and each terminal.The ability that it will be appreciated by those skilled in the art that each terminal is not necessarily identical, and for example, each terminal respectively has the own resolution of being supported, for different terminals, its resolution of supporting is not quite similar.In the present embodiment, preferably, the handling capacity exchange process obtains the ability of each terminal.For example, can after finishing, principal and subordinate's deterministic process carry out capabilities exchange procedure to obtain the ability of terminal in the incipient stage of video conference, and also can be in video conference, as terminal add/when withdrawing from meeting, carry out capabilities exchange procedure to obtain the ability of terminal.Determine the only parameter specification index of described each terminal then according to the ability of the picture style of video conference and each terminal, for example descend Fig. 3, Fig. 4 A, the described embodiment of Fig. 4 B.Those skilled in the art can determine the only parameter specification index of described each terminal according to actual conditions, do not repeat them here.
And in another variation example of present embodiment, step S210~S212 is changed to the only parameter specification index of determining described each terminal at random.For example, can in 0~1 number range, generate a random number, when the value of this random number greater than 0.5 the time, determine that then described only parameter specification index is 1024 * 768,16 bit images.Those skilled in the art understand, in such variation example, determined parameter specification index and the actual incompatible situation of terminal capability may occur, thereby cause problem such as distortion, but replenish as one of the present invention, still can consider such variation example in appropriate circumstances.
In above-mentioned steps, emphasis is described the control procedure of image resolution ratio as described parameter specification index, it will be appreciated by those skilled in the art that except that image resolution ratio, the size of image is also relevant with the figure place of image, and it also can be used as the part of described parameter specification index.For example, the number of greyscale levels scope is that the figure place of 0~255 gray level image is 8, and each pixel needs 8bit in this moment image, promptly account for 1 byte, the figure place of corresponding coloured image is 24 with it, in addition, also has the image of 16 bit images, 32 bit images and various non-standard figure places.The figure place of image is big more, then it is decoded and the amount of calculation compressed just big more.
For example, if the figure place of the image that camera is caught of a plurality of terminals is 32, and the display of described a plurality of terminals is only supported 16 bit images, the described a plurality of terminals of then described multipoint control unit 3 requests send the image of 16 bit images or (described as follows) corresponding yuv format to it, rather than 32 bit images.Therefore, by the figure place of image that control terminal sends, also can alleviate the computation burden of described multipoint control unit 3.
It will be appreciated by those skilled in the art that described parameter specification index can comprise the figure place of described image resolution ratio or image individually respectively, also can comprise this two indexs, perhaps comprise other more indexs according to actual needs.Particularly, can determine described parameter specification index, not repeat them here with reference to the determined index of agreement H.223.
Those skilled in the art understand, in another preferred embodiment, earlier image is transformed into YUV (referring to YCbCr in the present embodiment) form from rgb format at video conference, and then transmit, this common way can reduce volume of transmitted data, wherein, YUV is a kind of basic color space, and Y refers to the legibility (Luminamce) of color, and U and V then are meant tone (Chrominance).Corresponding relation between RGB and the YUV is shown below:
Drawn by following formula, when the RGB image was 24, the color space size of YUV was as shown in the table:
When RGB image figure place changed, the big young pathbreaker of YUV color space did corresponding variation.Therefore,, can reduce the size of YUV color space by the figure place of compression RGB image, thus the computation burden of conserve bandwidth and coding/decoding.Further, human eye to compare the change of photopic vision diopter color the change sensitivity many, promptly for human eye, the Y component is more important than U component, therefore, can indicating terminal before sending image, suitably abandon U component and V component, to reach the purpose that reduces amount of calculation.For example, can send the partly image of sampling (subsampling) by indicating terminal, wherein, the common mode of above-mentioned part sampling has YUV444 (not having compression), YUV422 (33.3% compression), YUV411 (50.0% compression) and YUV420 (50.0% compression).Those skilled in the art can realize the conversion between above-mentioned YUV and the RGB by list of references " conversion between YUV and the RGB " (carrying " Changchun University's journal ", 2004 the 4th phases) at least, do not repeat them here.
Fig. 3 illustrates according to a specific embodiment of the present invention, the schematic diagram of image composite mode in video conferencing system.In this embodiment, video conferencing system comprises 5 terminals (terminal first, second, third, fourth, penta), the resolution of the image that each terminal captures all is 1920 * 1200, the shown image of each terminal is to be composited by all the other 4 images that terminal captured, be that combination picture is to be composited by image 43, image 44, image 45 and image 46, wherein, the resolution of above-mentioned 4 images in described combination picture is identical, and the wide high size of each image in promptly above-mentioned 4 images all is half of wide high size of described combination picture.
Particularly, in the present embodiment, multipoint control unit at first obtains the resolution of each terminal, for example, the resolution of terminal first is 1920 * 1200, the resolution of all the other 4 terminals is 800 * 600, can know according to above-mentioned image composite mode, in the shown combination picture of terminal first, the resolution of image 43 to 46 all is 480 * 300, in the shown combination picture of all the other 4 terminals, the resolution of image 43 to 46 all is 200 * 150, at this moment, can determine that the described only parameter specification index corresponding with the terminal first is resolution 200 * 150, the described only parameter specification index corresponding with all the other 4 terminals is that resolution is 480 * 300.Therefore, after the terminal first captures image, the resolution of image is compressed to 200 * 150 from 1920 * 1200, sends to described multipoint control unit after encoded then; After all the other 4 terminals capture image, the resolution of image is compressed to 480 * 300 from 1920 * 1200, sends to described multipoint control unit after encoded then.Described multipoint control unit receives the vision signal from above-mentioned 5 terminals, and to described decoding video signal, then do not need described vision signal is carried out resolution compression, but directly described vision signal is carried out Combined Processing, and, at last the vision signal behind the described coding is sent to above-mentioned 5 terminals respectively compound back to described encoding video signal.
And change in the example at one, the resolution of described 5 terminals can be other specifications, for example the resolution of terminal first is 1920 * 1200, terminal second, third resolution are 1024 * 800, terminal fourth, penta resolution are 800 * 600, at this moment, then above-mentioned multipoint control unit correspondingly instructs each terminal to determine that according to described multipoint control unit resolution sends corresponding video signals to it according to control method provided by the invention after obtaining these resolution.Those skilled in the art can realize such process in conjunction with prior art, do not repeat them here.
Fig. 4 A illustrates according to another embodiment of the present invention, the schematic diagram of image composite mode in video conferencing system.In this embodiment, the shown combination picture of all terminals all is composited by two parts image, a part is an image 41, another part is an image 42, wherein, image 41 is caught from the main meeting-place, and image 42 is caught from the spokesman, and the image that each terminal camera captures all is 1920 * 1200.If the resolution of each terminal that multipoint control unit gets access to is 800 * 600, the image composite mode that gets access to is that the size dimension ratio of image 41 and image 42 is 4: 1, then for the terminal of catching the main meeting-place image, described only parameter specification index is a resolution 800 * 600, and for all the other terminals, described only parameter specification index is a resolution 200 * 150, each terminal to described only parameter specification index, and sends to described multipoint control unit with the image compression that captures after encoded then.Those skilled in the art understand, above-mentioned main meeting-place and spokesman are dynamic, and promptly arbitrary meeting-place of conference participation can be as the main meeting-place, and can switch in real time in the process of meeting, and along with spokesman's variation, image 42 can show the image of collection from different terminals.
Fig. 4 B illustrates according to another embodiment of the present invention, the schematic diagram of image composite mode in video conferencing system.In this embodiment, the shown combination picture of all terminals all is composited by a master image 411 and nine little images, and wherein, little image 421 is in above-mentioned nine little images.
Those skilled in the art understand, above-mentioned Fig. 3, Fig. 4 A and the shown embodiment of Fig. 4 B are to non restrictive description of the present invention, in the practical video meeting, the image composite mode also comprises any compound corresponding composition that generates of a plurality of images, for example, whole image 9 five equilibriums, 16 five equilibriums and whole image are composited by 1 master image and 15 little images, and this does not influence flesh and blood of the present invention.
Fig. 5 illustrates according to the first embodiment of the present invention, a kind of flow chart of the auxiliary control method of in the terminal of video conferencing system, multimedia communication being handled, such assist control flow process and the control procedure in the control appliance (for example multipoint control unit) of video conferencing system adapt, and be for example embodiment illustrated in fig. 2.In the present embodiment, at first execution in step S230 receives the form control signaling from the control appliance of video conferencing system.Preferably, can receive the packet be packaged with described form control signaling and this packet is decoded by logical channel LC0, then in message field, obtain described form control signaling, wherein, described form control signaling is used to indicate this terminal to send the parameter specification index of vision signal to described control appliance.
Execution in step S231 compresses processing according to described form control signaling to vision signal to be sent then, makes compression rear video signal meet the indicated parameter specification index of described form control signaling.For example, described form control signaling indicating terminal sends 1024 * 768 vision signal to multipoint control unit, if the resolution of the vision signal that this terminal captures is higher than 1024 * 768, then with above-mentioned video signal compression to 1024 * 768.Again for example, described form control signaling indicating terminal sends 300 * 200 vision signal, if the resolution of the vision signal that this terminal captures is D1 form (720 * 576), then not according to prior art (H.232 agreement) with resolution compression to CIF form (352 * 576), but with resolution compression to described form control signaling indicated 300 * 200.Again for example, described form control signaling indicating terminal sends 24 vision signal, if the image figure place of the vision signal that this terminal captures is higher than 24, then the image figure place with above-mentioned vision signal is compressed to 24.Those skilled in the art can be in conjunction with the compression of existing techniques in realizing to vision signal, for example at least can be with reference to " conversion of compression domain image/video spatial resolution and color Processing Technology " (Chinese doctorate paper full-text database, 2007), do not repeat them here.
Follow execution in step S232, described compression rear video signal is carried out encoding process, particularly, can adopt and H.261, H.263 and H.264 wait agreement encoding video signal, those skilled in the art can realize the coding to vision signal with reference to " based on research and the realization of MCU in the protocol of I P video conference H.323 " that be stated from Chinese outstanding master thesis full-text database (2006) at least, do not repeat them here.
At last, execution in step S234 sends described encoded video signal.Particularly, after finishing principal and subordinate's deterministic process and capabilities exchange procedure, terminal sends to multipoint control unit and opens logical channel (OpenLogicalChannel) request message, multipoint control unit is replied and is opened logical channel confirmation (OpenLogicalChannelAck) response message, at this moment, set up the logical channel that is used for the transmission of video code stream, like this, can send described encoded video signal by this logical channel.
Those skilled in the art understand, the main distinction of above-mentioned steps and prior art is that each terminal receives the control command from multipoint control unit, comprise in this instruction that each terminal of requirement sends the parameter specification index of vision signal to multipoint control unit, and the parameter specification index difference that is required of each terminal, for example as the image resolution ratio of the terminal of key frame usually than the image resolution ratio height of other-end.Further, in the present invention, described multipoint control unit may be adjusted the parameter specification index of each terminal at any time according to concrete needs, for example will be replaced by terminal B by terminal A as the terminal of key frame, at this moment, the parameter specification index that terminal A is required obviously reduces, and the parameter specification index that terminal B is required then is enhanced usually; Again for example, multipoint control unit also may require this terminal to improve the resolution that it sends vision signal owing to the not fogging Chu that some terminals send.Particularly, those skilled in the art understand, in embodiment illustrated in fig. 2, described multipoint control unit can obtain the attribute of each terminal at any time again and determine the parameter specification index of each terminal, perhaps directly determine the parameter specification index of each terminal, and correspondingly, then each terminal sends corresponding video signals according to the parameter specification index that is redefined to described multipoint control unit, does not repeat them here.
Fig. 6 illustrates according to another embodiment of the present invention, is used to control the network topological diagram of the compound video conferencing system of many pictures.In this embodiment, multipoint control unit 31, multipoint control unit 32 and multipoint control unit 33 cascades, collaboratively multimedia communication is controlled, wherein, multipoint control unit 32 and multipoint control unit 33 are controlled conference group respectively, be that 32 pairs of described multipoint control units comprise that the conference group of terminal 91, terminal 92 and terminal 93 controls, 33 pairs of conference group that comprise terminal 94 of described multipoint control unit are controlled, and the multimedia communication between 31 pairs of different conference group of described multipoint control unit is controlled.Particularly, during remaining terminal is caught in described terminal 91 only shows from same conference group image, then the method for carrying out first embodiment shown in Figure 2 by described multipoint control unit 32 is controlled multimedia communication.And needing to show the terminal of other conference group when described terminal 91, for example described terminal 94 during the image of seizure, is then controlled multimedia communication by described multipoint control unit 31.Preferably, described multipoint control unit 31 obtains the graphic attribute and the image composite mode of described terminal 91 to 93 by described multipoint control unit 32, obtain the graphic attribute and the image composite mode of described terminal 94 by described multipoint control unit 33, determine described only parameter specification index according to above-mentioned graphic attribute and image composite mode then, and described only parameter specification index being sent to each terminal by described multipoint control unit 32 and described multipoint control unit 33, each terminal is according to described only parameter specification index compressed video signal and send.
In this embodiment, according to described only parameter specification index vision signal is compressed processing by terminal, and in a variation example of this embodiment, terminal is not compressed processing to vision signal, but according to described only parameter specification index vision signal is compressed processing by described multipoint control unit 32 and described multipoint control unit 33, be about to " terminal " that described multipoint control unit 32 and described multipoint control unit 33 are considered as described multipoint control unit 31.Those skilled in the art can realize such video conferencing system and each multipoint control unit wherein, be applied to control device provided by the invention in the multipoint control unit etc. in conjunction with prior art, do not repeat them here.
The cascade that it will be appreciated by those skilled in the art that described multipoint control unit is not limited to 3 multipoint control units, can be by the control of a plurality of multipoint control unit cooperation realizations to multimedia communication; Also being not limited to two-stage, can be multistage cascade, and this does not influence flesh and blood of the present invention.
Fig. 7 illustrates according to the first embodiment of the present invention, is used for the structural representation of control device that multimedia communication is handled in the control appliance (for example multipoint control unit) of video conferencing system.Particularly, this control device 5 comprises that first determines device 51, the second definite device 52 and first dispensing device 53.Wherein, described first determines that device 51 is used for determining the only parameter specification index of described each terminal; Described second determines that device 52 is used for determining described form control signaling according to the only parameter specification index of described each terminal, and wherein said form control signaling is used to indicate this terminal to send the parameter specification index of vision signal to described control appliance; Described first dispensing device 53 is used for sending form control signaling to a plurality of terminals respectively, and this device 53 can be realized with reference to prior art.Those skilled in the art understand, in above-mentioned a plurality of parameter specification index, it is different with the form control parameter specification index that signaling comprised of issuing other-end that the described form of issuing first terminal is controlled the parameter specification index that signaling comprised, preferably, this first terminal is to handle the terminal of key frame, promptly this terminal send to the vision signal of described multipoint control unit will be as the key frame in the whole video meeting.It is obviously different that the control that such control device carries out each terminal and prior art have, and it is identical for example requiring the parameter specification index of each terminal in the prior art usually.
Particularly, described first definite device 51 can be determined described parameter specification index in several ways.For example in the present embodiment, described first determines that device 51 comprises first deriving means 511, it is used for obtaining respectively the graphic attribute of each terminal, obtain the image composite mode of each terminal respectively, and determine the only parameter specification index corresponding with this terminal according to the graphic attribute of described each terminal and corresponding with it image composite mode.Particularly, those skilled in the art understand, preferably, described graphic attribute is meant that terminal presentation facility supports the resolution and/or the image figure place of (or be provided with), for example can be on the basis of agreement H.245 with described graphic attribute as the parameter in terminal capabilities set (Terminal Capability Set) request (Request) message, and send this terminal capabilities set request message, thereby obtain such graphic attribute to terminal.And described image composite mode can be dynamic, and therefore described first determines that device 51 can obtain described image composite mode in advance according to actual conditions before video conference begins, and also can obtain described image composite mode at any time in meeting.Further, this device 51 is used to refer to the required minimum parameter specification index of combination picture by obtaining described image attributes, just can be used to refer to the required minimum parameter specification index of the medium and small image of this combination picture in conjunction with the described image composite mode that obtains again, and should be defined as described only parameter specification index by minimum parameter specification index.It will be appreciated by those skilled in the art that at one to change in the example that described device 51 can realize that each sub-device is realized part of functions wherein respectively, for example obtains the function of graphic attribute, does not repeat them here by a plurality of sub-devices.
And change in the example at one, described first determines that device 51 comprises first receiving device (not shown among Fig. 7), it is used for receiving respectively the only parameter specification index corresponding with this terminal that each terminal sends.For example, described each terminal is determined parameter specification index separately respectively and is sent to described multipoint control unit, preferably, determines the image resolution ratio that this terminal is fit to according to actual needs by concrete operating personnel.
Change in the example at another, described first determines that device 51 comprises that the 3rd determines device (not shown among Fig. 7), and it is used for determining at random the only parameter specification index of described each terminal.For example, can in 0~1 number range, generate a random number, when the value of this random number greater than 0.5 the time, determine that then described only parameter specification index is 1024 * 768,16 bit images.
Preferably, described second determines that device 52 can directly will place the ad-hoc location of a packet (control signaling) at the only parameter specification index of a terminal, can correspondingly obtain described parameter specification index from this ad-hoc location so that this control signaling is received the back terminal by terminal; Change in the example at one, described device 52 is encrypted the ad-hoc location that is placed in this control signaling with described parameter specification index, and this moment, then receiving terminal encryption, deciphering rule according to a preconcerted arrangement obtained described parameter specification index after described content is decrypted; Change in the example at another, after upsetting the parameter specification index, described device 52 places the diverse location of this control signaling respectively, for example determine the position of every byte according to hash algorithm, these change example does not influence flesh and blood of the present invention, does not repeat them here.
Fig. 8 illustrates according to the first embodiment of the present invention, the sub controlling unit of in the terminal equipment of video conferencing system multimedia communication being handled.Particularly, described sub controlling unit 6 comprises the 3rd receiving system 61 and the 3rd dispensing device 62.Wherein, described the 3rd receiving system 61 is used to receive the form control signaling from the control appliance of video conferencing system, and similarly, described form control signaling is used to indicate this terminal to send the parameter specification index of vision signal to described control appliance; Described the 3rd dispensing device 62 is used for sending vision signal according to described form control signaling to described control appliance.Preferably, described the 3rd receiving system 61 is packaged with the packet of described form control signaling by logical channel LC0 reception and this packet is decoded, then in message field, obtain described form control signaling, for example the ad-hoc location from this control signaling takes out described form control signaling, thereby obtains the parameter specification index from control device 5 shown in Figure 7.
Preferably, described the 3rd dispensing device 62 comprises: first compression set 621, it is used for according to described form control signaling vision signal to be sent being compressed processing, makes compression rear video signal meet the indicated parameter specification index of described form control signaling; Second code device 622, it is used for described compression rear video signal is carried out encoding process; And the 4th dispensing device 623, it is used to send described encoded video signal.For example, if described form control signaling indicating terminal sends 1024 * 768 vision signal to multipoint control unit, and the resolution of the vision signal that this terminal captures is higher than 1024 * 768, and then described first compression set 621 is with above-mentioned video signal compression to 1024 * 768.Correspondingly, described encoding process and process of transmitting can be achieved with reference to prior art, do not repeat them here.
In conjunction with Fig. 1 and Fig. 6, it will be appreciated by those skilled in the art that preferably the control device 5 that Fig. 7 provided places the multipoint control unit 3 of video conferencing system, be used to control the multimedia communication of this unit 3 and each terminal 9, especially control the compound vision signal of many pictures; Correspondingly, the sub controlling unit 6 that Fig. 8 provided is placed in each terminal 9, is used for cooperating with described control device 5 control procedure of the vision signal that realizes that many pictures are compound.Those skilled in the art understand, these devices can be realized with reference to prior art, for example described control device 5 can be on the basis of the control module in the existing multipoint control unit with reference to shown in Figure 7 being achieved, and mainly increase by first and determine that device 51 and second determines device 52; Similarly, described sub controlling unit 6 can be on the basis of the video conference module in the existing terminal with reference to shown in Figure 8 being achieved, and mainly increase the 3rd receiving system 61 and first compression set 621.For example, Fig. 1 or embodiment illustrated in fig. 6 in, place the control device of multipoint control unit 3 can also comprise the second receiving system (not shown), it is used to receive the vision signal from a plurality of terminals; The first code device (not shown), it is used for described encoding video signal; And the second dispensing device (not shown), it is used for the vision signal behind the described coding is sent to described each terminal respectively.And the second such receiving system, first code device, second dispensing device all can be achieved with reference to prior art, do not repeat them here.
Further, those skilled in the art understand, above-mentioned shown in Figure 7 first determines that device 51 can redefine the parameter specification index of each terminal whenever and wherever possible, and these parameter specification index are sent to relevant terminal by the control signaling, each terminal is then by placing the 3rd dispensing device 62 in it according to the parameter specification index compressed video signal that is required accordingly and send to corresponding multipoint control unit.In such embodiments, adjust vision signal and have certain flexibility, thereby can meet the different needs.
The present invention realizes control to video conferencing system by the mode that is different from prior art, especially the vision signal that sends between multipoint control unit and each terminal is controlled, and preferably controls these vision signals by modes such as graphics resolutions.By technical scheme provided by the invention, can instruct different terminals to send the vision signal of different resolution, thereby make different terminals can send the vision signal of only resolution according to actual needs, the resolution that has changed the vision signal of terminal transmission of the prior art is higher than the present situation of the needed resolution of video conferencing system, thereby saved the multipoint control unit spent resource of decoding, improved the efficient of video conferencing system.It will be appreciated by those skilled in the art that according to actual needs, can dynamically control these vision signals, for example, meeting begin preceding, have terminal to add/when withdrawing from meeting and picture style when changing, can control to these vision signals, this also influences flesh and blood of the present invention.
From another angle, this programme also can be understood that to have expanded 323 signalings, and is synthetic at MCU end direct decoding after terminal realization image compression by dynamic adjustment terminal code distinguishability, reduces the decoding cost, the reduction code stream.Behind joining meeting to terminal, its code stream form of uploading is controlled by MCU.The requirement that MCU is compound according to current picture sends the picture of different resolution to different demanding terminals, and can dynamically adjust in meeting, changes to adapt to the compound style of picture.For not participating in the compound terminal of picture, require it not upload code stream, to save system bandwidth.
More than specific embodiments of the invention are described.It will be appreciated that the present invention is not limited to above-mentioned specific implementations, those skilled in the art can make various distortion or modification within the scope of the claims, and this does not influence flesh and blood of the present invention.