WO2014172907A1 - 视频会议处理方法及设备 - Google Patents

视频会议处理方法及设备 Download PDF

Info

Publication number
WO2014172907A1
WO2014172907A1 PCT/CN2013/074860 CN2013074860W WO2014172907A1 WO 2014172907 A1 WO2014172907 A1 WO 2014172907A1 CN 2013074860 W CN2013074860 W CN 2013074860W WO 2014172907 A1 WO2014172907 A1 WO 2014172907A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
video information
personal
participant
information
Prior art date
Application number
PCT/CN2013/074860
Other languages
English (en)
French (fr)
Inventor
董建明
葛鹏
汤畅
孙喆
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to CN201380001428.1A priority Critical patent/CN104380720B/zh
Priority to EP13866482.6A priority patent/EP2816801B1/en
Priority to PCT/CN2013/074860 priority patent/WO2014172907A1/zh
Priority to US14/335,238 priority patent/US9392191B2/en
Publication of WO2014172907A1 publication Critical patent/WO2014172907A1/zh

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • H04N5/265Mixing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • H04N7/152Multipoint control units therefor

Definitions

  • the present invention relates to communication technologies, and in particular, to a video conference processing method and device. Background technique
  • video conferencing has become the fastest growing multimedia communication method.
  • Traditional business and executive meetings have been converted to video conferencing.
  • the video conferencing system distributes multiple pieces of data, such as static and dynamic images, voice, text, and pictures, to individuals or groups of individuals or groups in two or more different places through existing electrical communication transmission media.
  • geographically dispersed users can get together, exchange information through images, sounds, etc., and can replace the on-site meetings in effect.
  • a camera and a display device are usually disposed at each site, and the camera can collect images of the local site, and the display device simultaneously displays images of the local site and images of other sites. Since the displayed images are all in the venue, the display space of each site image is limited, but the number of participants in each site is uncertain. It is difficult to ensure that the viewer clearly sees the speaker or wants Participants to see. Moreover, when the number of sites is relatively large, the images of all the sites are displayed on the same display device, and the effect is not good. Summary of the invention
  • the embodiment of the present invention provides a video conference processing method and device, which avoids the defect that the display effect is poor due to the inequality of the site when the overall video of the site is used, and provides flexibility for the most efficient information display.
  • an embodiment of the present invention provides a video conference processing method, including:
  • the video conference server receives the first video information sent by the first video terminal, and the second video information sent by the second video terminal, where the first video information includes each participant in the conference site where the first video terminal is located.
  • personal video information, the second video information includes the second video information received by the second video conference server, and the first video information And obtaining a preset number of the personal video information, and combining the preset number of the personal video information to generate a composite video, so that in the composite video, the preset number of the individuals
  • the participants corresponding to the video information are in the same background of the conference site;
  • the video conference server sends the composite video to the third video terminal for display.
  • the method further includes:
  • the first participant replacement instruction carries the first a participant identifier and a second participant identifier
  • the first participant replacement instruction is used to indicate that the first participant identifier included in the composite video is replaced by personal video information indicated by the second participant identifier Instructed personal video information
  • the personal video information indicated by the first participant identifier is personal video information included in the composite video
  • the personal video information indicated by the second participant identifier is included in the synthesis Personal video information of the first video information other than in the video, or personal video information included in the second video information other than the synthesized video
  • the video conference server replaces the personal video information indicated by the first participant identifier in the composite video with the personal video information indicated by the second participant identifier according to the first participant replacement instruction;
  • the video conference server sends the composite video after replacing the personal video information to the third video terminal for display.
  • the method further includes:
  • the second participant replacement instruction is used to indicate that the personal video information included in the composite video is replaced by the personal video information indicated by the third participant identifier, and the third participant identifier indicates the personal video information indicated by the third participant identifier.
  • the video conference server selects target personal video information from the personal video information included in the composite video according to the second participant replacement instruction, and replaces the selected target personal video information with the third participant identifier. Instructed personal video information;
  • the video conference server sends the composite video after replacing the personal video information to the third video terminal for display.
  • the second participant replacement instruction further carries location information
  • the video conference server replaces the second participant according to the instruction Selecting the target personal video information from the personal video information included in the composite video includes:
  • the video conference server uses the personal video information corresponding to the location information included in the second participant replacement instruction included in the composite video as the target personal video information according to the second participant replacement instruction. .
  • the method further includes:
  • the video conference server receives an add instruction sent by the first video terminal, the second video terminal, or the third video terminal, where the add command carries a fourth participant identifier, and the adding The instruction is used to indicate adding personal video information indicated by the fourth participant identifier in the composite video, where the personal video information indicated by the fourth participant identifier is included in the video other than the synthesized video.
  • the add command carries a fourth participant identifier
  • the instruction is used to indicate adding personal video information indicated by the fourth participant identifier in the composite video, where the personal video information indicated by the fourth participant identifier is included in the video other than the synthesized video.
  • the video server adds the personal video information indicated by the fourth participant identifier to the composite video according to the adding instruction;
  • the video conference server sends the composite video after adding the personal video information to the third video terminal for display.
  • the method further includes:
  • the video conference server receives the deletion instruction sent by the first video terminal, the second video terminal, or the third video terminal, where the deletion instruction carries a fifth participant identifier, and the deleting The instruction is used to indicate that the personal video information indicated by the fifth participant identifier is deleted from the personal video information included in the composite video, where the fifth participant identifies the indicated personal
  • the video information is personal video information included in the composite video; the personal video information indicated by the fifth participant identifier is deleted from the information;
  • the video conference server sends the composite video after deleting the personal video information to the third video terminal for display.
  • the video conference server combines the preset quantity of the personal video information To generate a composite video including:
  • the video conference server splices the corresponding image in the preset number of the personal video information to generate a composite image, where the corresponding image of the preset number of the personal video information is synchronized in time series; A plurality of the composite images are combined to generate a composite video.
  • the video conference server combines the preset quantity of the personal video information To generate a composite video including:
  • the video conference server arranges the image information extracted from the preset number of the personal video information in a preset background image to generate a composite image, where the first video information and the second video information are used.
  • the preset number of the image information included in the personal video information obtained is synchronized in time series; a plurality of the composite images are combined to generate a composite video.
  • an embodiment of the present invention provides a video conference processing method, including:
  • the composite video sent by the video conference server, where the composite video receives the first video information received by the video conference server from the first video terminal, and the second video received from the second video terminal Obtaining a preset number of personal video information in the information, and synthesizing the preset number of the personal video information, wherein in the composite video, the preset number of the personal video information respectively correspond to the meeting
  • the first video information includes personal video information of each participant in the site where the first video terminal is located, and the second video information includes where the second video terminal is located. Personal video information of each participant in the venue;
  • the third video terminal displays the composite video.
  • the method further includes:
  • the third video information includes personal video information of each participant in the conference site where the third video terminal is located;
  • the method further includes:
  • the third video terminal generates a first participant replacement instruction according to the received handover indication information input by the user, where the first participant replacement instruction carries the first participant identifier and the second participant identifier, where The first participant replacement instruction is used to indicate that the personal video information indicated by the first participant identifier included in the composite video is replaced by the personal video information indicated by the second participant identifier, the first participant
  • the personal video information indicated by the identifier is personal video information included in the composite video
  • the personal video information indicated by the second participant identifier is included in the first video information except the composite video.
  • the third video terminal sends the first participant replacement instruction to the video conference server to enable the video conference server to use the second participant identifier to indicate a personal video according to the first participant replacement instruction.
  • the information replaces the personal video information indicated by the first participant identifier in the composite video;
  • the third video terminal receives the composite video that is sent by the video conference server according to the first participant replacement instruction and replaces the personal video information, and displays the composite video.
  • the method further includes:
  • the third video terminal generates a second participant replacement finger carrying the third participant identifier
  • the second participant replacement instruction is used to indicate that the personal video information indicated by the third participant identifier is used to replace the individual included in the synthesized video sent to the first video terminal or the second video terminal.
  • the target personal video information is selected from the personal video information included in the synthesized video of the second video terminal, and the selected target personal video information is replaced with the personal video information indicated by the third participant identifier.
  • the method further includes:
  • the third video terminal generates an add instruction according to the received indication information of the received user input, where the add instruction carries a fourth participant identifier, where the add instruction is used to indicate adding the
  • the fourth participant identifier indicates the personal video information indicated, and the personal video information indicated by the fourth participant identifier is personal video information included in the first video information except the synthesized video, or is included in Personal video information of the second video information other than the synthesized video;
  • the third video terminal receives the composite video that is sent by the video conference server according to the adding instruction and adds the personal video information, and displays the video.
  • the method further includes:
  • the third video terminal generates a deletion instruction according to the received deletion instruction information input by the user, where the deletion instruction carries a fifth participant identifier, where the deletion instruction is used to indicate that the content included in the composite video is included Deleting the personal video information indicated by the fifth participant identifier in the personal video information, where the personal video information indicated by the fifth participant identifier is personal video information included in the composite video;
  • an embodiment of the present invention provides a video conference server, including:
  • a receiving unit configured to receive first video information that is sent by the first video terminal, and second video information that is sent by the second video terminal, where the first video information includes each of the sites where the first video terminal is located.
  • a personal video information of the participant the second video information including the second processing unit, connected to the receiving unit, for obtaining from the received second video information and the first video information Predetermining the number of the personal video information, and synthesizing the preset number of the personal video information to generate a composite video, so that in the composite video, the preset number of the personal video information respectively The corresponding participants are in the same venue background;
  • a sending unit connected to the processing unit, configured to send the composite video to the third video terminal for display.
  • the receiving unit is further configured to receive a first participant replacement command sent by the first video terminal, the second video terminal, and the third video terminal, where
  • the first participant replacement instruction carries a first participant identifier and a second participant identifier, where the first participant replacement instruction is used to indicate that the composite is replaced by the personal video information indicated by the second participant identifier.
  • the personal video information indicated by the first participant identifier included in the video, the personal video information indicated by the first participant identifier is personal video information included in the composite video, and the second participant identifier indication Personal video information is personal video information included in the first video information other than the composite video, or personal video information included in the second video information other than the composite video;
  • the processing unit is further configured to replace the personal video information indicated by the first participant identifier in the composite video with the personal video information indicated by the second participant identifier according to the first participant replacement instruction;
  • the sending unit is further configured to send the composite video after replacing the personal video information to the third video terminal for display.
  • the receiving unit is further configured to receive the first video end a first participant replacement instruction sent by the second video terminal or the third video terminal, where the second participant replacement instruction carries a third participant identifier, the second participant The replacement instruction is used to indicate that the personal video information included in the composite video is replaced by the personal video information indicated by the third participant identifier, and the personal video information indicated by the third participant identifier is included in the synthesized video.
  • the receiving unit is further configured to receive the first video end a first participant replacement instruction sent by the second video terminal or the third video terminal, where the second participant replacement instruction carries a third participant identifier, the second participant The replacement instruction is used to indicate that the personal video information included in the composite video is replaced by the personal video information indicated by the third participant identifier, and the personal video information indicated by the third participant identifier is included in the synthesized video.
  • the processing unit is further configured to select target personal video information from the personal video information included in the composite video according to the second participant replacement instruction, and replace the selected target personal video information with the third participant Identifying the indicated personal video information;
  • the sending unit is further configured to send the composite video after replacing the personal video information to the third video terminal for display.
  • the second participant replacement instruction further carries location information
  • the processing unit is further configured to use the personal video information corresponding to the location information included in the second participant replacement instruction included in the composite video as the target individual according to the second participant replacement instruction. Video information.
  • the receiving unit is further configured to receive an adding instruction sent by the first video terminal, the second video terminal, or the third video terminal, where the adding instruction And carrying the fourth participant identifier, where the adding instruction is used to indicate that the personal video information indicated by the fourth participant identifier is added to the composite video, where the personal video information indicated by the fourth participant identifier is included Personal video information of the first video information other than the synthesized video, or personal video information included in the second video information other than the synthesized video;
  • the processing unit is further configured to add personal video information indicated by the fourth participant identifier to the composite video according to the adding instruction;
  • the sending unit is further configured to send the synthesized video after adding the personal video information to the third video terminal for display.
  • the receiving unit is further configured to receive a deletion instruction sent by the first video terminal, the second video terminal, or the third video terminal, where the deletion instruction Carrying a fifth participant identifier, the deletion instruction is used to indicate from the composite video Deleting the personal video information indicated by the fifth participant identifier in the included personal video information, where the personal video information indicated by the fifth participant identifier is personal video information included in the composite video;
  • the processing unit is further configured to delete the personal video information indicated by the fifth participant identifier in the personal video information included in the composite video according to the deleting instruction;
  • the sending unit is further configured to send the composite video after deleting the personal video information to the third video terminal for display.
  • the processing unit is further configured to use the preset quantity of the personal video Corresponding images in the information are spliced to generate a composite image, wherein corresponding images of the preset number of the personal video information are synchronized in time series; and the plurality of composite images are combined to generate a composite video.
  • the processing unit is further configured to use the preset quantity of the personal video
  • the image information included in the information is arranged in a preset background image to generate a composite image, wherein the preset number of the image information included in the personal video information obtained from the first video information and the second video information Synchronizing in time series; combining the plurality of composite images to generate a composite video.
  • the embodiment of the present invention provides a third video terminal, including:
  • a receiving unit configured to receive a composite video sent by the video conference server, where the composite video receives the first video information received by the video conference server from the first video terminal, and the second video information received from the second video terminal Obtaining a preset number of personal video information in the video information, and synthesizing the preset number of the personal video information, where the preset number of the personal video information respectively correspond to The participant is in the same background of the conference site; the first video information includes personal video information of each participant in the conference site where the first video terminal is located, and the second video information includes where the second video terminal is located. Personal video information of each participant in the venue;
  • a display unit connected to the receiving unit, for displaying the composite video.
  • the receiving unit is further configured to receive video information sent by at least one video collection device;
  • the third video terminal further includes: The first processing unit is connected to the receiving unit, and packages the received video information to form a third video information, where each of the at least one video collecting device is configured to collect the third video.
  • the video information of the at least one participant in the site where the terminal is located the third video first sending unit is connected to the processing unit, and configured to send the third video information to the video conference server, so that Generating a composite video according to the third video information and the first video information, and sending the composite video to the second video terminal, or generating a composite video according to the third video information and the second video information The first video terminal.
  • the third video terminal further includes:
  • a second processing unit configured to generate, according to the received handover indication information of the user input, a first participant replacement instruction, where the first participant replacement instruction carries a first participant identifier and a second participant identifier, The first participant replacement instruction is used to indicate that the personal video information indicated by the first participant identifier included in the composite video is replaced by the personal video information indicated by the second participant identifier, where the first participant attends
  • the personal video information indicated by the identifier is the personal video information included in the composite video, and the personal video information indicated by the second participant identifier is included in the first video information except the synthesized video.
  • a second sending unit configured to send, by the second processing unit, the first participant replacement instruction to the video conference server, to enable the video conference server to replace the instruction according to the first participant
  • the personal video information indicated by the second participant identifier replaces the personal video information indicated by the first participant identifier in the composite video
  • the receiving unit is further configured to receive the composite video that is sent by the video conference server according to the first participant replacement instruction and replace the personal video information, and display the video through the display unit.
  • the third video terminal further includes:
  • a third processing unit configured to: when it is detected that the voice collection device in the site where the third video terminal is located has a voice input within a preset time range, determine a third participant to indicate the participant who is speaking And generating a second participant replacement instruction carrying the third participant identifier, where the second participant replacement instruction is used to indicate that the personal video information indicated by the third participant identifier is replaced and sent to the a composite video of the first video terminal or the second video terminal Personal video information included;
  • a third sending unit configured to be connected to the third processing unit, configured to send the second participant replacement instruction to the video conference server, to enable the video conference server to follow the second participant replacement instruction Selecting target personal video information from the personal video information included in the synthesized video sent to the first video terminal or the second video terminal, and replacing the selected target personal video information with the third participant identifier indication Personal video information.
  • the third video terminal further includes:
  • a fourth processing unit configured to generate an add instruction according to the received indication information of the received user input, where the add instruction carries a fourth participant identifier, where the add instruction is used to indicate adding in the composite video
  • the fourth participant identifies the indicated personal video information, and the personal video information indicated by the fourth participant identifier is personal video information included in the first video information except the synthesized video, or includes Personal video information of the second video information other than the synthesized video;
  • a fourth sending unit connected to the fourth processing unit, configured to send the adding instruction to the video conference server, to enable the video server to add the fourth in the composite video according to the adding instruction
  • the participant identifies the personal video information indicated
  • the receiving unit is further configured to receive the synthesized video that is sent by the video conference server according to the adding instruction and add the personal video information, and display the video through the display unit.
  • the third video terminal further includes:
  • a fifth processing unit configured to generate a delete instruction according to the received deletion indication information of the user input, where the deletion instruction carries a fifth participant identifier, where the deletion instruction is used to indicate that the composite video is included
  • the personal video information indicated by the fifth participant identifier is deleted from the personal video information, and the personal video information indicated by the fifth participant identifier is personal video information included in the composite video;
  • the fifth sending unit is connected to the fifth processing unit, and configured to send the deletion instruction to the video conference server to enable the video server to include personal video information included in the composite video according to the deletion instruction. Deleting the personal video information indicated by the fifth participant identifier; the receiving unit is further configured to receive the synthesized video that is sent by the video conference server according to the deletion instruction and delete the personal video information, and pass the The display unit is displayed.
  • an embodiment of the present invention provides a video conference server, including: a processor, Letter interface, memory and bus;
  • the processor, the communication interface, and the memory are interconnected by the bus; the communication interface is configured to receive first video information sent by the first video terminal, and second video information sent by the second video terminal.
  • the first video information includes personal video information of each participant in the site where the first video terminal is located, and the second video information includes the memory, where the instruction or data is stored;
  • the processor calls an instruction stored in the memory to obtain a preset number of the personal video information from the received second video information and the first video information, and the pre- And the number of the personal video information is combined to generate a composite video, so that the participants corresponding to the preset number of the personal video information are in a consistent conference background in the composite video;
  • the communication interface is further configured to send the composite video to the third video terminal for display.
  • the communications interface is further configured to receive, by the first video terminal, the second video terminal, and the third participant, a first participant replacement command, where
  • the first participant replacement instruction carries a first participant identifier and a second participant identifier, where the first participant replacement instruction is used to indicate that the composite is replaced by the personal video information indicated by the second participant identifier.
  • the personal video information indicated by the first participant identifier included in the video, the personal video information indicated by the first participant identifier is personal video information included in the composite video, and the second participant identifier indication Personal video information is personal video information included in the first video information other than the composite video, or personal video information included in the second video information other than the composite video;
  • the processor is further configured to invoke the instruction and data of the memory to implement, replacing, by the first participant replacement instruction, the first video in the composite video with personal video information indicated by the second participant identifier The participant identifies the personal video information indicated;
  • the communication interface is further configured to send the composite video after replacing the personal video information to the third video terminal for display.
  • the communication interface is further configured to receive a first participant replacement instruction sent by the first video terminal, the second video terminal, or the third video terminal,
  • the second participant replacement instruction carries a third participant identifier, where the second participant replacement instruction is used to replace the personal video information indicated by the third participant identifier to replace the composite video included.
  • Personal video information, the personal video information indicated by the third participant identifier is personal video information included in the first video information other than the synthesized video, or included in the synthesized video Personal video information of the second video information;
  • the processor is further configured to invoke the instruction and data of the memory to implement, selecting, according to the second participant replacement instruction, target personal video information from the personal video information included in the composite video, The target personal video information is replaced with the personal video information indicated by the third participant identifier;
  • the communication interface is further configured to send the composite video after replacing the personal video information to the third video terminal for display.
  • the second participant replacement instruction further carries location information
  • the processor is further configured to invoke the instruction and data of the memory to implement, according to the personal video information corresponding to the second location information, as the target personal video information.
  • the communication interface is further configured to receive an add instruction sent by the first video terminal, the second video terminal, or the third video terminal, where the adding instruction And carrying the fourth participant identifier, where the adding instruction is used to indicate that the personal video information indicated by the fourth participant identifier is added to the composite video, where the personal video information indicated by the fourth participant identifier is included Personal video information of the first video information other than the synthesized video, or personal video information included in the second video information other than the synthesized video;
  • the processor is further configured to invoke the instruction and data of the memory to implement, adding, according to the adding instruction, personal video information indicated by the fourth participant identifier in the composite video;
  • the communication interface is further configured to send the synthesized video after adding the personal video information to the third video terminal for display.
  • the communication interface is further configured to receive a deletion instruction sent by the first video terminal, the second video terminal, or the third video terminal, where the deletion instruction Carrying a fifth participant identifier, the deletion instruction is used to indicate from the composite video Deleting the personal video information indicated by the fifth participant identifier in the included personal video information, where the personal video information indicated by the fifth participant identifier is personal video information included in the composite video;
  • the processor is further configured to invoke the instruction and the data of the memory to implement, deleting, according to the deleting instruction, personal video information indicated by the fifth participant identifier in the personal video information included in the composite video;
  • the communication interface is further configured to send the composite video after deleting the personal video information to the third video terminal for display.
  • the processor is further configured to use the preset quantity of the personal video Corresponding images in the information are spliced to generate a composite image, wherein corresponding images of the preset number of the personal video information are synchronized in time series; and the plurality of composite images are combined to generate a composite video.
  • the processor is further configured to use the preset quantity of the personal video
  • the image information included in the information is arranged in a preset background image to generate a composite image, wherein the preset number of the image information included in the personal video information obtained from the first video information and the second video information Synchronizing in time series; combining the plurality of composite images to generate a composite video.
  • an embodiment of the present invention provides a third video terminal, including: a processor, a communication interface, a memory, a bus, and a display;
  • processor, the communication interface, the memory, and the display are interconnected by the bus;
  • the communication interface is configured to receive a composite video sent by a video conference server, where the composite video is received by the video conference server from the first video terminal, and received from the second video terminal. Obtaining a preset amount of personal video information in the second video information, and synthesizing the preset number of the personal video information, where the preset number of the personal video information respectively correspond to
  • the first video information includes the personal video information of each participant in the conference site where the first video terminal is located, and the second video information includes the second video terminal.
  • the memory is configured to store instructions or data;
  • the processor invokes instructions stored in the memory to effect display of the composite video through the display.
  • the communication interface is further configured to receive video information sent by at least one video collection device;
  • the processor is further configured to package the received video information to form a third video information, where each of the at least one video collection device is configured to collect the third video terminal in the conference site
  • the video information of the at least one participant, the third video information including the communication interface is further configured to send the third video information to the video conference server, so that the video conference server is configured according to the And generating, by the third video information and the first video information, a composite video, to the second video terminal, or generating a composite video according to the third video information and the second video information, and sending the composite video to the first video terminal.
  • the processor is further configured to invoke the instruction and the data of the memory, to generate, by using the received handover indication information of the user input, to generate a first participant replacement instruction, where The first participant replacement instruction carries a first participant identifier and a second participant identifier, where the first participant replacement instruction is used to indicate that the composite video is replaced by the personal video information indicated by the second participant identifier.
  • the first participant identifier included in the personal video information indicated by the first participant identifier, and the personal video information indicated by the first participant identifier is personal video information included in the composite video, where the second participant identifier indicates
  • the personal video information is personal video information included in the first video information other than the synthesized video, or personal video information included in the second video information other than the synthesized video;
  • the communication interface is further configured to send, by the video conference server, the first participant replacement instruction, to enable the video conference server to indicate an individual indicated by the second participant identifier according to the first participant replacement instruction.
  • the video information replaces the personal video information indicated by the first participant identifier in the composite video; and receives the composite video that is sent by the video conference server according to the first participant replacement instruction to replace the personal video information. And displayed by the display.
  • the processor is further configured to: when it is detected that the voice collection device in the site where the third video terminal is located has a voice input within a preset time range, determine to indicate The third participant ID of the participant who is speaking; the generation carries the third participant a second participant replacement instruction, where the second participant replacement instruction is used to indicate that the personal video information indicated by the third participant identifier is replaced by the first video terminal or the second Personal video information contained in the composite video of the video terminal;
  • the communication interface is further configured to send the second participant replacement instruction to the video conference server, to enable the video conference server to send from the first video terminal to the first video terminal according to the second participant replacement instruction. Or selecting the target personal video information from the personal video information included in the synthesized video of the second video terminal, and replacing the selected target personal video information with the personal video information indicated by the third participant identifier.
  • the processor is further configured to invoke the instruction and the data of the memory to implement, to generate an add instruction according to the received indication information of the received user input, where the add instruction is carried in There is a fourth participant identifier, where the adding instruction is used to indicate that the personal video information indicated by the fourth participant identifier is added to the composite video, and the personal video information indicated by the fourth participant identifier is included in the Personal video information of the first video information other than the synthesized video, or personal video information included in the second video information other than the synthesized video;
  • the communication interface is further configured to send the adding instruction to the video conference server, so that the video server adds the personal video information indicated by the fourth participant identifier to the composite video according to the adding instruction; Receiving, by the video conference server, the synthesized video added by adding the personal video information according to the adding instruction, and displaying the video through the display.
  • the processor is further configured to invoke the instruction and the data of the memory, to generate a delete instruction according to the received deletion indication information of the user input, where the deletion instruction is carried in There is a fifth participant identifier, where the deletion instruction is used to indicate that the personal video information indicated by the fifth participant identifier is deleted from the personal video information included in the composite video, and the fifth participant identifies the indicated personal Video information is personal video information included in the composite video;
  • the communication interface is further configured to send the deletion instruction to the video conference server to enable the personal video information indicated by the fifth participant identifier to be received; and receive the deletion sent by the video conference server according to the deletion instruction.
  • the composite video after the personal video information is displayed by the display.
  • the video conference server receives the first video information sent by the first video terminal, and the second video information sent by the second video terminal, where the first The video information includes personal video information of each participant in the conference site where the first video terminal is located, the second video information includes personal video information of each participant in the conference site where the second video terminal is located; and the received second video information And a total amount of personal video information is obtained in the first video information, and a preset number of personal video information is combined to generate a composite video, so that a preset number of personal video information corresponding to the participant in the composite video All are in a consistent venue background; the composite video is sent to the third video terminal for display.
  • FIG. 1 is a flowchart of a first video conference processing method according to an embodiment of the present invention
  • FIG. 2 is a schematic diagram of a site layout according to an embodiment of the present invention.
  • FIG. 3 is a flowchart of a second video conference processing method according to an embodiment of the present invention.
  • FIG. 4 is a flowchart of a third video conference processing method according to an embodiment of the present invention.
  • FIG. 5 is a flowchart of a fourth video conference processing method according to an embodiment of the present invention.
  • FIG. 6 is a flowchart of a fifth video conference processing method according to an embodiment of the present invention.
  • FIG. 7 is a schematic diagram of an image unit according to an embodiment of the present invention.
  • FIG. 8 is a flowchart of a sixth video conference processing method according to an embodiment of the present invention.
  • FIG. 9 is a flowchart of a seventh video conference processing method according to an embodiment of the present invention.
  • FIG. 10 is a flowchart of a method for processing a video conference according to an embodiment of the present invention
  • FIG. 11 is a flowchart of a method for processing a video conference according to an embodiment of the present invention
  • FIG. 13 is a schematic structural diagram of a first video conference server according to an embodiment of the present disclosure
  • FIG. 14 is a schematic structural diagram of a first video terminal according to an embodiment of the present invention
  • FIG. 15 is a schematic structural diagram of a second video terminal according to an embodiment of the present disclosure
  • FIG. 17 is a schematic structural diagram of a third first video terminal according to an embodiment of the present invention.
  • the technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention.
  • the embodiments are a part of the embodiments of the invention, and not all of the embodiments. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present invention without creative work are within the scope of the present invention.
  • FIG. 1 is a flowchart of a first video conference processing method according to an embodiment of the present invention.
  • the video conference processing method provided in this embodiment specifically includes:
  • Step S10 The video conference server receives the first video information sent by the first video terminal, and the second video information sent by the second video terminal, where the first video information includes the first
  • Step S20 The video conference server obtains a preset number of the personal video information from the received second video information and the first video information, and the preset number of the personal videos.
  • the information is synthesized to generate a composite video, so that the participants corresponding to the preset number of the personal video information are in a consistent conference background in the composite video;
  • Step S30 The video conference server sends the composite video to the third video terminal for display.
  • a video terminal is disposed in each venue, and a video collection device for collecting video of the conference site, a voice collection device for collecting the voice of the conference site, and a display device for displaying other conference video are also provided.
  • the video collection device may be a camera
  • the voice collection device may be a microphone
  • the display device may be a display or a television.
  • Video collection device, voice collection device and display device It can be integrated in the video terminal or it can be set separately.
  • a video collection device and a sound collection device can be set up for each participant in the venue, specifically for collecting real-time personal video and voice of the participant.
  • the video terminal packages the personal video information collected by each video collection device to form a video information and sends the video information to the video conference server, and simultaneously transmits the voice collected by the sound collection device to the video conference server. It is also possible to set up a video collection device and a sound collection device in the venue to uniformly collect video and voice of all participants.
  • the video terminal sends the video collected by the video collection device as video information to the video conference server, and simultaneously transmits the voice collected by the voice collection device to the video conference server, where the video information also includes each participant's Personal video information, the video conferencing server can separate each participant's personal video information.
  • the site background of each site can be set consistently. Therefore, the site background of the participants in each personal video information is the same. It is also possible to make the venue background of the participants displayed in the composite video consistent during the process of generating the composite video.
  • the video conference server generates a composite video for each video terminal according to the video information sent by each video terminal, so that the video terminal of each site displays the video images of other sites.
  • the video conference server generates a composite video for a video terminal as an example.
  • the video terminal is a third video terminal, and the other video terminals are the first video terminal and the second video terminal.
  • the video information received by the conference server from the first video terminal is the first video information
  • the video information received by the video conference server from the second video terminal is the second video information
  • the video information received by the video conference server from the third video terminal is the third video terminal.
  • the first, second, and third in this embodiment are only used for distinguishing, and are not used for order definition.
  • the video conference server obtains a preset amount of personal video information from the received second video information and the first video information in the process of generating a composite video for the third video terminal.
  • a preset number of personal video information may be obtained from the first video information and the second video information according to a preset rule.
  • the preset number may be specifically set according to actual display effect requirements and specification parameters of the display device, and the obtained personal video information may be all personal video information in the first video information, or may be all in the second video information.
  • the personal video information may also be part of the personal video information in the first video information, and part of the personal video information in the second video information.
  • the preset rule may be in multiple manners.
  • a preset number of personal videos may be determined from the video information sent by the video terminal that first accesses the video conference server.
  • Information in the second implementation, each site has a priority Determining a preset amount of personal video information from the video information sent by the video terminal corresponding to the site with the highest priority in the identification of the importance of the site; in the third implementation manner, each participant has a priority to identify the The importance level of the participant may be determined according to the priority of the preset number of video information.
  • the video terminal may also send the participant identifier to indicate the participant who is speaking to the video conference.
  • the server such that the video conferencing server can view the video information of the participant who is speaking as part of the composite video.
  • the preset rule can be set according to the actual meeting needs, and is not limited to this embodiment.
  • the video conference server synthesizes the preset number of personal video information into a composite video, and transmits the synthesized video to the third video terminal.
  • the third video terminal displays the composite video to the participant of the site where the third video terminal is located, or the third video terminal displays the composite video through a separate display device to implement the video conference process.
  • the video conference server receives the first video information sent by the first video terminal, and the second video information sent by the second video terminal, where the first video information includes the second video terminal Personal video information of each participant in the conference; a preset amount of personal video information is obtained from the received second video information and the first video information, and a preset number of personal video information is synthesized to generate a composite video, In the composite video, the preset corresponding personal video information is respectively in the same conference site background; the composite video is sent to the third video terminal for display.
  • the generation of the synthesized video is based on the personal video information of the participant, it avoids the defect of the display space due to the inequality of the site when the overall video of the site is used, and breaks through the limitation of the physical space.
  • the most efficient information display provides flexibility.
  • FIG. 2 is a schematic diagram of a site layout according to an embodiment of the present invention.
  • a video collection device 001 a background wall device 002, a large-screen display device 003, a sound location input device 004, and a participant seat may be disposed in the conference site.
  • the participant seat device 005 is used to provide seats for the participants.
  • the seat can be used with a fixed seat, such as a sofa, or a non-fixed seat, such as a swivel chair with wheels.
  • the number of seats is, for example, six as shown in FIG. One.
  • the participant seat device 005 can be arranged with a semi-circular table, and the seat of the participant seat device 005 is also arranged in the shape of a table arc.
  • the large screen display device 003 can be composed of multiple or one large size display groups
  • the built device should not be smaller than a fixed size to ensure that the image captured by the video capture device 001 is visually close to the true size of the person when the device is developed, and the large-screen display device 003 is set to an arc. shape.
  • the number of participants in the composite video displayed by the large-screen display device 003 is also six, it is possible to realize that the participant of the site and the participant displayed by the large-screen display device 003 are meeting at a round table at the same venue. a feeling of.
  • the video collection device 001 can cooperate with the sound localization input device 004, or an instruction input by other means, to capture an image of a designated area.
  • the sound localization input device 004 is composed of a plurality of or one sound pickup device and a sound localization device.
  • the sound localization device captures the sounding direction of the participant, generates a command to send to the video collecting device 001, and inputs the voice information of the sounding bit.
  • the wall device 002 of different venues has the same structural form, and is arranged behind the large-screen display device 003 and the attendant seating device 005 and must not be lower than a fixed size.
  • a plurality of texture forms may be arranged on the wall device 002, so that the video conference server generates a composite video for the personal video information synthesis process, and the composite video and the layout presented by the large-screen display device 003 are behind the large-screen display device 003 and the participant seats.
  • the wall device 002 behind the device 005 is joined/spliced to each other, so that the participants can perceive a sense of experience in communicating in the same space.
  • Video terminal 006 may be implemented by, but not limited to, using a visually audible multipoint controller, a physical button controller, or other form.
  • the video terminal 006 controls the communication requirements in each site, including but not limited to: display mode switching, personal video information switching, voice-activated switching, document presentation, and the like.
  • the process of initializing the video conference server may be: generating a composite video of the personal video information in the video information sent by the first accessed video terminal according to the setting rule, thereby simplifying the initial processing flow. Moreover, the speed of providing synthesized video to the video terminal can be increased, and the waiting time of the user can be shortened.
  • the setting rule is specifically used to indicate how to determine personal video information from the video information sent by the first accessed video terminal.
  • the number of personal video information in the video information sent by the first accessed video terminal is exactly equal to or smaller than the preset number, all the personal video information in the video information may be directly generated into a composite video;
  • the number of personal video information in the video information sent by the video terminal that is accessed first is greater than the preset number, and may be sequentially selected according to the order of the personal video information in the video information, or may be selected according to the preset of the user. Personal video information.
  • FIG. 3 is a flowchart of a second video conference processing method according to an embodiment of the present invention.
  • the video conference server sends the composite video to After the third video terminal performs the display, the method may further include:
  • Step S40 The video conference server receives a first participant replacement instruction sent by the first video terminal, the second video terminal, or the third video terminal, where the first participant replaces the instruction Carrying a first participant identifier and a second participant identifier, where the first participant replacement instruction is used to replace the first video content included in the composite video with the personal video information indicated by the second participant identifier
  • the participant identifies the personal video information indicated, the personal video information indicated by the first participant identifier is personal video information included in the composite video, and the personal video information indicated by the second participant identifier is included in the Personal video information of the first video information other than the synthesized video, or personal video information included in the second video information other than the synthesized video;
  • Step S50 The video conference server replaces the personal video information indicated by the first participant identifier in the composite video with the personal video information indicated by the second participant identifier according to the first participant replacement instruction.
  • Step S60 The video conference server sends the composite video that replaces the personal video information to the third video terminal for display.
  • each participant participating in the conference may be assigned an identifier in advance, and the processing related to the participant in the video conference may be implemented by the identifier.
  • the number of seats and the position of each site are fixed.
  • each seat can be assigned an identifier to distinguish different participants.
  • the video terminal of each site accesses the video conference server, and the video conference server may determine a preset amount of personal video information from the video information sent by the first accessed video terminal, generate a composite video, and send the message to the first Three video terminals.
  • the administrator or participant of the site where the third video terminal is located can switch the personal video information in the composite video as needed, for example, switching the personal video information of a certain participant in the composite video to the participant who wants to see. Personal video information.
  • the third video terminal can provide a visual human-computer interaction interface, and the human-computer interaction interface can implement the input of the switching indication information by using, but not limited to, a touch screen, a keyboard, or a gravity sensing, and the touch screen or the operation interface display can display the image interaction interface. .
  • the third video terminal displays the picture and number of each participant for the user, and the user can directly click the picture of the two participants to be switched or input the number of the two participants to implement the switching.
  • the third video terminal generates the first participant replacement instruction according to the input of the user, where the first participant replacement instruction carries the first participant And the second participant identifier, the first participant identifier is used to indicate the participant before the handover, and the second participant identifier is used to indicate the participant after the handover.
  • the video conference server may also synchronize information sent to the participant corresponding to the personal video information included in the third video terminal to other video terminals, such as the first video terminal and the second video terminal, and thus, the first video terminal or
  • the management personnel or the participants of the site where the second video terminal is located may perform the above-mentioned switching operations according to the needs of the conference, and details are not described herein again.
  • the seats in each venue are fixed and the number is the same.
  • Each venue has six seats, seat 1, seat 2, seat 3, seat 4, seat 5, and seat 6.
  • the third video terminal is set in the site 3. Initially, the third video terminal displays the video information of the six participants of the site 1. The user only clicks on the picture of the participant who wants to see, or enters the number of the participant. If the user inputs the participant of the seat 2, the first participant identifier carried in the first participant replacement command generated by the third video terminal is used to indicate the participant of the seat 1 of the venue 1 The participant identifies the participant who is used to indicate the seat 2 of the venue 2.
  • the above switching process is manually triggered by the user, and the video terminal provides a manual switching mode for the user.
  • the user selects the mode, the user needs to manually input the trigger switching process.
  • the number of the second participant identifiers may be one or more, that is, the personal video information of one participant in the composite video is replaced with the personal video information of one or more other participants, in order to ensure display. Effect, the number of second participant IDs should not be too much.
  • FIG. 4 is a flowchart of a third video conference processing method according to an embodiment of the present invention. As shown in FIG. 4, in this embodiment, after the video conference server sends the composite video to the third video terminal for display, the method may further include:
  • Step S41 The video conference server receives a second participant replacement instruction sent by the first video terminal, the second video terminal, or the third video terminal, where the second participant replacement instruction carries a third participant identifier, where the second participant replacement instruction is used to indicate that the personal video information included in the composite video is replaced by the personal video information indicated by the third participant identifier, where the third participant identifier indicates The personal video information is personal video information included in the first video information other than the synthesized video, or personal video information included in the second video information other than the synthesized video;
  • Step S51 The video conference server receives the combination according to the second participant replacement instruction. Selecting the target personal video information into the personal video information included in the video, and replacing the selected target personal video information with the personal video information indicated by the third participant identifier;
  • Step S61 The video conference server sends the composite video that replaces the personal video information to the third video terminal for display.
  • the handover process can also be triggered by voice control.
  • the video terminal can recognize the state in which the participant is speaking by using the voice uploaded by the voice collection device, or can also set a speaking button, and the participant touches the button when speaking, so that The video terminal can know which participant is speaking.
  • the video terminal generates a second participant replacement instruction that carries an identification that is useful to indicate the participant who is speaking.
  • the video conference server may determine the replaced participant according to the second preset rule.
  • the second preset rule may also be preset. For example, the corresponding participant in the synthesized video may be replaced according to the location of the participant who is speaking.
  • the third video terminal is set in the conference site 3. Initially, the third video terminal displays the personal video information of the six participants of the site 1. The participant of the seat 2 of the site 2 starts to speak, and the video terminal of the field 2 detects the video terminal. When the participant speaks, a participant replacement instruction is generated, and the participant replacement instruction carries an identifier of the participant who is used to indicate the seat 2 of the venue 2 .
  • the video conference server receives the participant replacement instruction, in the process of generating a composite video for other video terminals, the personal video information of the speaking participant is synthesized into the synthesized video to implement switching, for example, to be sent to A personal video information in the composite video of the third video terminal is replaced with the personal video information of the participant of the venue 2 seat 2.
  • the video terminal can also provide a voice switching mode for the user, and when the user selects the mode, the switching process is triggered by the sound.
  • the manual switching mode and the voice-activated switching module can coexist, and the voice switching can be switched to the main switching mode to ensure that the participant can see the speaker, and then manually switch to the auxiliary switching mode to ensure that the participant can see and want to see. Important attendees to.
  • the second participant replacement instruction further carries location information
  • the video conference server is configured from the personal video information included in the composite video according to the second participant replacement instruction. Selecting the target personal video information may specifically include: And the video conference server uses the personal video information corresponding to the location information included in the second participant replacement instruction included in the composite video as the target personal video information according to the second participant replacement instruction. .
  • the corresponding relationship between the identifier of the participant and the location information of the participant is stored in the video conference server, and the location information of the participant may be obtained from the foregoing relationship according to the third participant identifier, or may be sent in the video terminal.
  • the second participant replaces the instruction carrying the location information.
  • FIG. 5 is a flowchart of a fourth video conference processing method according to an embodiment of the present invention. As shown in FIG. 5, in this embodiment, after the video conference server sends the composite video to the third video terminal for display, the method may further include:
  • Step S42 The video conference server receives an add instruction sent by the first video terminal, the second video terminal, or the third video terminal, where the add instruction carries a fourth participant identifier,
  • the adding instruction is used to indicate that the personal video information indicated by the fourth participant identifier is added to the composite video, and the personal video information indicated by the fourth participant identifier is included in the synthesized video.
  • Step S52 The video server adds the personal video information indicated by the fourth participant identifier to the composite video according to the adding instruction.
  • Step S62 The video conference server sends the synthesized video after adding the personal video information to the third video terminal for display.
  • the administrator or the participant of the site where the video terminal is located can add the video of the participant to be seen in the synthesized video as needed, and the administrator or the participant can input the addition instruction information, so that the video terminal generates the addition. instruction.
  • the number of the fourth participant identifiers may be one or more. To ensure the display effect, the number of the fourth participant identifiers is not excessive.
  • FIG. 6 is a flowchart of a fifth video conference processing method according to an embodiment of the present invention.
  • the method further includes: Step S43: The video conference server receives a deletion instruction sent by the first video terminal, the second video terminal, or the third video terminal, where the deletion instruction carries a fifth participant identifier.
  • the deletion instruction is used to indicate that the personal video information indicated by the fifth participant identifier is deleted from the personal video information included in the composite video, and the personal video information indicated by the fifth participant identifier is included in the Synthesizing personal video information in the video;
  • Step S53 The video server deletes the personal video information indicated by the fifth participant identifier in the personal video information included in the composite video according to the deletion instruction.
  • Step S63 The video conference server sends the composite video after deleting the personal video information to the third video terminal for display.
  • the administrator or the participant of the site where the video terminal is located can delete the video of a certain participant in the composite video as needed, and the administrator or the participant can input the deletion instruction information, so that the video terminal generates the deletion instruction.
  • the number of the fifth participant identifiers may be one or more.
  • the position of the deleted video information in the composite video may be displayed by static face conversion.
  • the video conferencing server can generate composite video in a variety of ways to ensure that the size of each participant in the composite video is always available.
  • the participant's seat and video collection device are arranged in a corresponding physical location, and the video collection device may be based on, but not limited to, face capture, human body infrared feature capture and the like.
  • the captured participants are arranged in a preset size image unit, as shown in Fig. 7, the image unit has the size el.l X el.2, and the participant screen is arranged in el.lx el The el.7 axis position in a .2 size image unit.
  • the user is allowed to move in the el. l X el. 2 size image unit with el. 7 as the left and right small range of the axis position within the threshold.
  • the unit size of el.l X el.2 satisfies the dimensional specifications of el.3, el.5, el.6.
  • the image unit of el. lx el .3 is the smallest display unit, and the size of el.5 is defined to meet the natural hand movements of the user in the multi-point video conference, ensuring the behavior of the participants within a certain range. Can be successfully captured by the video capture device.
  • the size of el.6 is defined based on the fact that the participant's picture captured by the video collection device is in the sitting position. When the participant has the requirement of standing, the picture taken by the video collection device can ensure the standing status of the participant. The picture is fully captured, avoiding the situation where the head is out of range.
  • step S30 the video conference server will use the preset number of the The composite video information is synthesized to generate a composite video, which may specifically include:
  • the video conference server splices the corresponding image in the preset number of the personal video information to generate a composite image, where the corresponding image of the preset number of the personal video information is synchronized in time series; A plurality of the composite images are combined to generate a composite video.
  • a merged area of size el. lx el .4 may be set in the image unit shown in FIG. 7 for multiple image units of el. lx el .2 size
  • the overlap in the synthesis process is performed to merge the merged regions.
  • the merged area is merged, so that each personal video information is naturally connected, and the display effect of the video conference is improved.
  • the display mechanism based on the invention is compatible.
  • the size of the personal video in the video information sent by a video terminal is fl x f2, and the ratio of the vertical direction of the picture is adjusted to match the el. l, and the size of the left and right sides of the picture is e 1.1 X e 1.4 Merged with other personal video information.
  • the icon can be the above el. lx el .2 size specification, and the size is set to el. l el The combined area of .4.
  • the personal video information in the composite video is a horizontal row to achieve a display effect of the simulated conference site.
  • the personal video information in the composite video can also be displayed in multiple rows, which can realize the display effect of the simulated ladder venue.
  • step S30 the video conference server combines the preset number of the personal video information to generate a composite video, which may specifically include:
  • the video conference server arranges the image information extracted from the preset number of the personal video information in a preset background image to generate a composite image, where the first video information and the second video information are used.
  • the preset number of the image information included in the personal video information obtained is synchronized in time series; a plurality of the composite images are combined to generate a composite video.
  • the video conference server may extract the portrait of the participant in the personal video information from the current background image based on, but not limited to, prior art such as image capture, and merge the preset into the preset.
  • the background image to get the composite video.
  • the preset background image may be consistent with the image of the background wall to form a screen effect with a unified venue feeling.
  • the work of portrait capture can also be performed by the video terminal. Now, the video terminal can directly send the captured portrait to the video conference server.
  • FIG. 8 is a flowchart of a sixth video conference processing method according to an embodiment of the present invention.
  • the video conference processing method provided in this embodiment may be implemented in conjunction with the method applied to the video conference server according to any embodiment of the present invention.
  • the specific implementation process is not described herein.
  • the video conference processing method provided in this embodiment specifically includes:
  • Step C10 The third video terminal receives the synthesized video sent by the video conference server, where the synthesized video is received by the video conference server from the first video terminal, and received from the second video terminal. Obtaining a preset number of personal video information in the second video information, and synthesizing the preset number of the personal video information, wherein in the composite video, the preset number of the personal video information respectively The corresponding participant is in the same background of the conference site, the first video information includes personal video information of each participant in the conference site where the first video terminal is located, and the second video information includes the second video. Personal video information of each participant in the venue where the terminal is located;
  • Step C20 The third video terminal displays the synthesized video.
  • the third video terminal receives the synthesized video sent by the video conference server, where the synthesized video receives the first video information from the first video terminal through the video conference server, and the second video terminal A total amount of personal video information is obtained in the received second video information, and a preset number of personal video information is synthesized, and in the synthesized video, the preset number of personal video information respectively correspond to the participants.
  • the synthesized video receives the first video information from the first video terminal through the video conference server
  • the second video terminal A total amount of personal video information is obtained in the received second video information, and a preset number of personal video information is synthesized, and in the synthesized video, the preset number of personal video information respectively correspond to the participants.
  • the composite video is displayed. Since the generation of the synthesized video is based on the video information of the participant, it avoids the defect of the display effect due to the inequality of the site when the overall video of the site is used, and breaks through the limitation of the physical space.
  • the information display of efficiency provides flexibility.
  • FIG. 9 is a flowchart of a seventh video conference processing method according to an embodiment of the present invention. As shown in FIG. 9, in this embodiment, in step C10, the method may further include:
  • Step C30 The third video terminal receives video information sent by at least one video collection device, and packages the received video information to form third video information, where each video in the at least one video collection device
  • the collection device is configured to collect video information of at least one participant in the conference site where the third video terminal is located, where the third video information includes the conference site where the third video terminal is located. Personal video information for each participant.
  • Step C31 The third video terminal sends the third video information to the video conference server, so that the video conference server generates a composite video according to the third video information and the first video information. Generating, by the second video terminal, a composite video according to the third video information and the second video information, to the first video terminal.
  • the method may further include:
  • Step C40 The third video terminal generates a first participant replacement instruction according to the received handover indication information input by the user, where the first participant replacement instruction carries the first participant identifier and the second participant.
  • the first participant replacement instruction is used to indicate that the personal video information indicated by the second participant identifier is used to replace the personal video information indicated by the first participant identifier included in the composite video, where
  • the personal video information indicated by the participant identifier is personal video information included in the composite video, and the personal video information indicated by the second participant identifier is included in the first video except the composite video.
  • Step C50 The third video terminal sends the first participant replacement instruction to the video conference server, so that the video conference server uses the second participant identifier indication according to the first participant replacement instruction.
  • Personal video information replaces personal video information indicated by the first participant identifier in the composite video;
  • Step C60 The third video terminal receives the composite video that is sent by the video conference server according to the first participant replacement command and replaces the personal video information, and displays the composite video.
  • FIG. 10 is a flowchart of an eighth video conference processing method according to an embodiment of the present invention. As shown in FIG. 10, in this embodiment, after the third video terminal displays the composite video, the method further includes:
  • Step C41 When the third video terminal detects that the voice collection device in the site where the third video terminal is located has a voice input within a preset time range, determine a third party to indicate the participant who is speaking. Participant identification;
  • Step C51 The third video terminal generates a second participant replacement instruction that carries the third participant identifier, where the second participant replacement instruction is used to indicate that the third participant identifier is used to indicate Personal video information replacement is sent to the first video terminal or the second video terminal
  • the composite video contains personal video information
  • Step C61 The third video terminal sends the second participant replacement instruction to the video conference server, so that the video conference server sends the first participant replacement command from the first participant to the first
  • the target personal video information is selected from the personal video information included in the synthesized video of the video terminal or the second video terminal, and the selected target personal video information is replaced with the personal video information indicated by the third participant identifier.
  • the third video terminal captures a sound whose sound intensity is greater than a preset threshold for a period of time, it may be considered that the presence of the participant is speaking, to avoid frequent triggering of the switching process by the sudden sound.
  • FIG. 11 is a flowchart of a ninth video conference processing method according to an embodiment of the present invention. As shown in FIG. 11, in this embodiment, after the third video terminal displays the composite video, the method further includes:
  • Step C42 The third video terminal generates an add instruction according to the received indication information of the received user input, where the add instruction carries a fourth participant identifier, where the add instruction is used to indicate the synthesized video. Adding personal video information indicated by the fourth participant identifier, where the personal video information indicated by the fourth participant identifier is personal video information included in the first video information except the synthesized video, Or personal video information included in the second video information other than the synthesized video;
  • Step C52 The third video terminal sends the add instruction to the video conference server, so that the video server adds the personal video indicated by the fourth participant identifier to the composite video according to the adding instruction.
  • Step C62 The third video terminal receives the composite video that is sent by the video conference server according to the adding instruction and adds the personal video information, and displays the video.
  • FIG. 12 is a flowchart of a tenth video conference processing method according to an embodiment of the present invention. As shown in FIG. 12, in this embodiment, after the third video terminal displays the composite video, the method further includes:
  • Step C43 The third video terminal generates a delete instruction according to the received deletion indication information input by the user, where the deletion instruction carries a fifth participant identifier, where the deletion instruction is used to indicate the synthesized video.
  • the personal video information indicated by the fifth participant identifier is deleted from the included personal video information, and the personal video information indicated by the fifth participant identifier is included in the Describe the personal video information in the composite video;
  • Step C53 The third video terminal sends the deletion instruction information to the video conference server to delete the personal video information indicated by the fifth participant identifier.
  • Step C63 The third video terminal receives the composite video that is sent by the video conference server according to the deletion instruction and deletes the personal video information, and displays the composite video.
  • FIG. 13 is a schematic structural diagram of a first video conference server according to an embodiment of the present invention.
  • the video conference server provided in this embodiment may implement various steps of the video conference processing method applied to the video conference server according to any embodiment of the present invention. The specific implementation process is not described herein.
  • the receiving unit 1 1 is configured to receive the first video information that is sent by the first video terminal, and the second video information that is sent by the second video terminal, where the first video information includes the site where the first video terminal is located.
  • Personal video information of each participant, the second video information including the first processing unit 12, connected to the receiving unit 11 for receiving the second video information, and the first A preset number of the personal video information is obtained in the video information, and the preset number of the personal video information is combined to generate a composite video, so that in the composite video, the preset number of the The participants corresponding to the personal video information are in the same background of the conference site;
  • the sending unit 13 is connected to the processing unit 12 and configured to send the synthesized video to the third video terminal for display.
  • the video conference server provided in this embodiment is based on the video information of the participant, and avoids the defect that the display effect is poor due to the inequality of the site when the overall video of the site is used. Breaking through the limitations of physical space provides flexibility for the most efficient display of information.
  • the receiving unit 11 is further configured to receive a first participant replacement command sent by the first video terminal, the second video terminal, and the third video terminal, where
  • the first participant replacement instruction carries a first participant identifier and a second participant identifier, where the first participant replacement instruction is used to indicate that the personal video information indicated by the second participant identifier is used to replace the
  • the personal video information indicated by the first participant identifier included in the composite video, the personal video information indicated by the first participant identifier is personal video information included in the composite video, and the second participant identifier
  • the indicated personal video information is personal video information included in the first video information other than the synthesized video, or personal video information included in the second video information other than the synthesized video .
  • the processing unit 12 is further configured to replace the personal video information indicated by the first participant identifier in the composite video with the personal video information indicated by the second participant identifier according to the first participant replacement instruction.
  • the sending unit 13 is further configured to send the composite video after replacing the personal video information to the third video terminal for display.
  • the receiving unit 11 is further configured to receive a first participant replacement instruction sent by the first video terminal, the second video terminal, or the third video terminal, where
  • the second participant replacement instruction carries a third participant identifier, where the second participant replacement instruction is used to indicate that the personal video information included in the composite video is replaced by the personal video information indicated by the third participant identifier.
  • the personal video information indicated by the third participant identifier is personal video information included in the first video information other than the synthesized video, or included in the second except the composite video Personal video information for video information.
  • the processing unit 12 is further configured to select target personal video information from the personal video information included in the composite video according to the second participant replacement instruction, and replace the selected target personal video information with the third The participant identifies the personal video information indicated.
  • the sending unit 13 is further configured to send the composite video after replacing the personal video information to the third video terminal for display.
  • the second participant replacement instruction further carries location information.
  • the processing unit 12 is further configured to use the personal video information corresponding to the location information included in the second participant replacement instruction included in the composite video as the target according to the second participant replacement instruction. Personal video information.
  • the receiving unit 11 is further configured to receive an adding instruction sent by the first video terminal, the second video terminal, or the third video terminal, where the adding instruction carries a fourth participant identifier, where the adding instruction is used to indicate that the personal video information indicated by the fourth participant identifier is added to the composite video, and the personal video information indicated by the fourth participant identifier is included in the Personal video information of the first video information other than the synthesized video, or personal video information included in the second video information other than the synthesized video.
  • the processing unit 12 is further configured to add the fourth participant to the synthesized video according to the adding instruction. The person identifies the indicated personal video information.
  • the sending unit 13 is further configured to send the synthesized video after adding the personal video information to the third video terminal for display.
  • the receiving unit 11 is further configured to receive a deletion instruction sent by the first video terminal, the second video terminal, or the third video terminal, where the deletion instruction carries a fifth participant identifier, where the deletion instruction is used to indicate that the personal video information indicated by the fifth participant identifier is deleted from the personal video information included in the composite video, and the fifth participant identifier indicates the personal video
  • the information is personal video information included in the composite video.
  • the personal video information indicated by the fifth participant identifier is deleted from the information.
  • the sending unit 13 is further configured to send the composite video after deleting the personal video information to the third video terminal for display.
  • the processing unit 12 is further configured to splicing a corresponding image of the preset number of the personal video information to generate a composite image, where the preset number of the personal video information is The corresponding images in are synchronized in time series; a plurality of the composite images are combined to generate a composite video.
  • the processing unit 12 is further configured to: arrange the image information included in the preset number of the personal video information in a preset background image to generate a composite image, where the first video information is used. And the preset quantity of the image information included in the personal video information obtained in the second video information is synchronized in time series; combining the plurality of the composite images to generate a composite video.
  • FIG. 14 is a schematic structural diagram of a first type of third video terminal according to an embodiment of the present invention.
  • the third video terminal 600 provided in this embodiment may implement various steps of the video conference processing method for the third video terminal provided by any embodiment of the present invention, and the specific implementation process is not described herein.
  • the third video terminal 600 provided in this embodiment includes:
  • the receiving unit 21 is configured to receive the synthesized video sent by the video conference server, where the composite video receives the first video information received by the video conference server from the first video terminal, and receives the first video information from the second video terminal.
  • the participants are all in the same conference site background;
  • the first video information includes each participant in the conference site where the first video terminal is located.
  • the second video information includes personal video information of each participant in the conference site where the second video terminal is located;
  • the display unit 22 is connected to the receiving unit 21 for displaying the composite video.
  • the third video terminal 600 provided in this embodiment is based on the video information of the participant, and avoids the display effect due to the inequality of the site when the overall video of the site is used. Defects, breaking through the limitations of physical space, provide flexibility for the most efficient information display.
  • FIG. 15 is a schematic structural diagram of a second third video terminal 600 according to an embodiment of the present invention.
  • the receiving unit 21 is further configured to receive video information sent by at least one video buffer device.
  • the third video terminal 600 may further include:
  • the first processing unit 211 is connected to the receiving unit 21, and packages the received video information to form third video information, where each video collecting device in the at least one video collecting device is used to collect the first The video information of the at least one participant in the conference site where the third video terminal 600 is located, and the third video information includes personal video information of each participant in the conference site where the third video terminal 600 is located.
  • the first sending unit 212 is connected to the first processing unit 211, and configured to send the third video information to the video conference server, so that the video conference server according to the third video information and the The first video information generation composite video is sent to the second video terminal, or the composite video is generated according to the third video information and the second video information, and sent to the first video terminal.
  • the third video terminal 600 may further include:
  • the second processing unit 221 is configured to generate, according to the received handover indication information of the user input, a first participant replacement instruction, where the first participant replacement instruction carries the first participant identifier and the second participant identifier.
  • the first participant replacement instruction is used to indicate that the personal video information indicated by the first participant identifier included in the composite video is replaced by the personal video information indicated by the second participant identifier, where the first The personal video information indicated by the participant identifier is personal video information included in the composite video, and the personal video information indicated by the second participant identifier is included in the first video other than the synthesized video.
  • a second sending unit 222 connected to the second processing unit 221, for sending to the video conference Sending, by the server, the first participant replacement instruction, so that the video conference server replaces the first part of the composite video with personal video information indicated by the second participant identifier according to the first participant replacement instruction a participant identifies the personal video information indicated;
  • the receiving unit 21 is further configured to receive the composite video that is sent by the video conference server according to the first participant replacement instruction and replace the personal video information, and display the video through the display unit 22.
  • the third video terminal 600 may further include:
  • the third processing unit 231 is configured to: when it is detected that the voice collection device in the site where the third video terminal 600 is located has a voice input within a preset time range, determine a third to indicate the participant who is speaking And generating, by the second participant, a second participant replacement command, where the second participant replacement instruction is used to indicate that the personal video information indicated by the third participant identifier is used instead of sending Personal video information included in the composite video of the first video terminal or the second video terminal;
  • the third sending unit 232 is connected to the third processing unit 231, and configured to send the second participant replacement instruction to the video conference server to enable the video conference server to replace the second participant according to the instruction Selecting target personal video information from the personal video information included in the composite video sent to the first video terminal or the second video terminal, and replacing the selected target personal video information with the third participant identifier Indicated personal video information.
  • the third video terminal 600 may further include:
  • the fourth processing unit 241 is configured to generate an add instruction according to the received indication information of the received user input, where the add instruction carries a fourth participant identifier, where the add instruction is used to indicate in the synthesized video. Adding personal video information indicated by the fourth participant identifier, where the personal video information indicated by the fourth participant identifier is personal video information included in the first video information except the synthesized video, or Personal video information included in the second video information other than the synthesized video;
  • the fourth sending unit 242 is connected to the fourth processing unit 241, and configured to send the adding instruction to the video conference server, to enable the video server to add the foregoing in the composite video according to the adding instruction.
  • the fourth participant identifies the personal video information indicated;
  • the receiving unit 21 is further configured to receive the synthesized video that is sent by the video conference server according to the adding instruction and add the personal video information, and display the video through the display unit 22.
  • the third video terminal 600 may further include:
  • the fifth processing unit 251 is configured to generate a delete instruction according to the received deletion indication information input by the user, where the deletion instruction carries a fifth participant identifier, where the deletion instruction is used to indicate from the composite video Deleting the personal video information indicated by the fifth participant identifier in the included personal video information, where the personal video information indicated by the fifth participant identifier is personal video information included in the composite video;
  • the fifth sending unit 252 is connected to the fifth processing unit 251, and configured to send the deletion instruction to the video conference server, so that the video server is included in the composite video according to the deletion instruction. Deleting the personal video information indicated by the fifth participant identifier in the video information;
  • the receiving unit 21 is further configured to receive the composite video that is sent by the video conference server according to the deletion instruction and delete the personal video information, and display the video through the display unit 22.
  • FIG. 16 is a schematic structural diagram of a second video conference server according to an embodiment of the present invention.
  • the video conference server 700 provided in this embodiment may implement various steps of the video conference processing method applied to the video conference server according to any embodiment of the present invention, and the specific implementation process is not described herein.
  • the video conference server 700 provided in this embodiment includes: a processor 710, a communication interface 720, a memory 730, and a bus 740, wherein the processor 710, the communication interface 720, and the memory 730 are interconnected by the bus 740.
  • the communication interface 720 is configured to receive first video information that is sent by the first video terminal, and second video information that is sent by the second video terminal, where the second video information includes the second video Personal video information of each participant in the venue where the terminal is located.
  • the memory 730 is used to store instructions or data.
  • the processor 710 calls an instruction stored in the memory 730 to obtain a preset number of the personal video information from the received second video information and the first video information, and The preset number of the personal video information is combined to generate a composite video, so that the participants corresponding to the preset number of the personal video information are all in a consistent venue background in the composite video.
  • the communication interface 720 is further configured to send the composite video to the third video terminal for display.
  • the communication interface 720 is further configured to receive the first video terminal, a first participant replacement instruction sent by the second video terminal and the third video terminal, where the first participant replacement instruction carries a first participant identifier and a second participant identifier, where the first participant attends
  • the replacement instruction is used to indicate that the personal video information indicated by the first participant identifier included in the composite video is replaced by the personal video information indicated by the second participant identifier, where the first participant identifies the indicated personal
  • the video information is personal video information included in the composite video
  • the personal video information indicated by the second participant identifier is personal video information included in the first video information except the synthesized video. Or personal video information included in the second video information other than the synthesized video.
  • the processor 710 is further configured to invoke the instruction and data of the memory 730 to implement, replacing the composite video with the personal video information indicated by the second participant identifier according to the first participant replacement instruction.
  • the first participant identifies the indicated personal video information.
  • the communication interface 720 is further configured to send the composite video after replacing the personal video information to the third video terminal for display.
  • the communication interface 720 is further configured to receive a first participant replacement instruction sent by the first video terminal, the second video terminal, or the third video terminal, where
  • the second participant replacement instruction carries a third participant identifier, where the second participant replacement instruction is used to indicate that the personal video information included in the composite video is replaced by the personal video information indicated by the third participant identifier.
  • the personal video information indicated by the third participant identifier is personal video information included in the first video information other than the synthesized video, or included in the second except the composite video Personal video information for video information.
  • the processor 710 is further configured to invoke the instruction and data of the memory 730 to implement, according to the second participant replacement instruction, select target personal video information from the personal video information included in the composite video, and select the selected The target personal video information is replaced with personal video information indicated by the third participant identification.
  • the communication interface 720 is further configured to send the composite video after replacing the personal video information to the third video terminal for display.
  • the second participant replacement instruction further carries location information.
  • the processor 710 is further configured to invoke the instruction and data of the memory 730 to implement, according to the second personal video information corresponding to the location information, as the target personal video information.
  • the communication interface 720 is further configured to receive an add instruction sent by the first video terminal, the second video terminal, or the third video terminal, where the adding instruction And carrying the fourth participant identifier, where the adding instruction is used to indicate that the personal video information indicated by the fourth participant identifier is added to the composite video, where the personal video information indicated by the fourth participant identifier is included Personal video information of the first video information other than the synthesized video, or personal video information included in the second video information other than the synthesized video.
  • the processor 710 is further configured to invoke the instruction and data of the memory 730 to implement, adding, according to the adding instruction, personal video information indicated by the fourth participant identifier in the composite video.
  • the communication interface 720 is further configured to send the synthesized video after adding the personal video information to the third video terminal for display.
  • the communication interface 720 is further configured to receive a deletion instruction sent by the first video terminal, the second video terminal, or the third video terminal, where the deletion instruction carries a fifth participant identifier, where the deletion instruction is used to indicate that the personal video information indicated by the fifth participant identifier is deleted from the personal video information included in the composite video, and the fifth participant identifier indicates the personal video
  • the information is personal video information included in the composite video.
  • the processor 710 is further configured to invoke the instruction and data of the memory 730 to delete, according to the deletion instruction, the personal video information indicated by the fifth participant identifier in the personal video information included in the composite video. .
  • the communication interface 720 is further configured to send the composite video after deleting the personal video information to the third video terminal for display.
  • the processor 710 is further configured to splicing a corresponding image of the preset number of the personal video information to generate a composite image, where the preset number of the personal video information
  • the corresponding images in are synchronized in time series; a plurality of the composite images are combined to generate a composite video.
  • the processor 710 is further configured to: arrange the image information included in the preset number of the personal video information in a preset background image to generate a composite image, where the first video information is obtained from the first video information. And the preset quantity of the image information included in the personal video information obtained in the second video information is synchronized in time series; combining the plurality of the composite images to generate a composite video.
  • FIG. 17 is a schematic structural diagram of a third first video terminal according to an embodiment of the present invention. As shown in FIG. 17, the first video terminal 800 provided in this embodiment may specifically implement any embodiment of the present invention.
  • the first video terminal 800 provided in this embodiment includes: a processor 810, a communication interface 820, and a The memory 830 and the bus 840, wherein the processor 810, the communication interface 820, and the memory 830 are interconnected by the bus 840.
  • the communication interface 820 is configured to receive a composite video sent by a video conference server, where the composite video is received by the video conference server from the first video terminal, and received from the second video terminal.
  • the memory 830 is used to store instructions or data.
  • the processor 810 invokes instructions stored in the memory 830 to effect display of the composite video through the display.
  • the communication interface 820 is further configured to receive video information sent by at least one video collection device.
  • the processor 810 is further configured to package the received video information to form a third video information, where each of the at least one video collection device is configured to collect the conference site where the third video terminal is located.
  • Video information of at least one participant, the third video information interface 820 is further configured to send the third video information to the video conference server, so that the video conference server is configured according to the third video information and Sending, by the first video information, a composite video to the second video terminal, or generating a composite video according to the third video information and the second video information, and sending the composite video to the first video terminal.
  • the processor 810 is further configured to invoke the instruction and the data of the memory 830 to generate a first participant replacement instruction according to the received switching indication information input by the user, where the first The participant replacement instruction carries a first participant identifier and a second participant identifier, where the first participant replacement instruction is used to indicate that the personal video information indicated by the second participant identifier is used to replace the content included in the composite video.
  • the first participant identifies the indicated personal video information
  • the personal video information indicated by the first participant identifier is personal video information included in the composite video
  • the second participant identifies the indicated personal video.
  • the information is personal video information included in the first video information other than the synthesized video, or personal video information included in the second video information other than the synthesized video.
  • the communication interface 820 is further configured to send the first participant replacement instruction to the video conference server to enable the video conference service Replacing the personal video information indicated by the first participant identifier in the composite video with the personal video information indicated by the second participant identifier according to the first participant replacement instruction; receiving the video conference server according to the The first participant replaces the synthesized video sent by the instruction replacing the personal video information and is displayed by the display.
  • the processor 810 is further configured to: when it is detected that the voice collection device in the site where the third video terminal is located has a voice input within a preset time range, determine to indicate that the voice is being spoken. a third participant identifier of the participant, generating a second participant replacement instruction carrying the third participant identifier, where the second participant replacement instruction is used to indicate the indication indicated by the third participant identifier
  • the personal video information replaces personal video information contained in the composite video transmitted to the first video terminal or the second video terminal.
  • the communication interface 820 is further configured to send the second participant replacement instruction to the video conference server, so that the video conference server sends the first video from the first video according to the second participant replacement instruction.
  • the target personal video information is selected from the personal video information included in the synthesized video of the terminal or the second video terminal, and the selected target personal video information is replaced with the personal video information indicated by the third participant identifier.
  • the processor 810 is further configured to invoke the instruction and the data of the memory 830 to generate an add instruction according to the received indication information of the received user input, where the add command carries the first a fourth participant identifier, where the adding instruction is used to indicate that the personal video information indicated by the fourth participant identifier is added to the composite video, and the personal video information indicated by the fourth participant identifier is included in the Personal video information of the first video information other than in the synthesized video, or personal video information included in the second video information other than the synthesized video.
  • the communication interface 820 is further configured to send the adding instruction to the video conference server to enable the video server to add the personal video information indicated by the fourth participant identifier to the composite video according to the adding instruction. Receiving, by the video conference server, the synthesized video added by adding the personal video information according to the adding instruction, and displaying the same through the display.
  • the processor 810 is further configured to invoke the instruction and the data of the memory 830 to implement, according to the received deletion instruction information input by the user, to generate a deletion instruction, where the deletion instruction carries a a fifth participant identifier, where the deletion instruction is used to indicate that the personal video information indicated by the fifth participant identifier is deleted from the personal video information included in the composite video, and the fifth participant identifier indicates the personal video information Is personal video information included in the composite video.
  • the communication interface 820 is further configured to send the deletion to the video conference server. The command is used to enable the video server to delete the personal video information indicated by the fifth participant identifier in the personal video information included in the composite video according to the deletion instruction; and receive the video conference server according to the deletion instruction.
  • the synthesized video sent after deleting the personal video information is displayed by the display.

Abstract

本发明实施例提供一种视频会议处理方法及设备,该视频会议处理方法,包括:视频会议服务器接收第一视频终端发送的第一视频信息,以及第二视频终端发送的第二视频信息;视频会议服务器从接收到的第二视频信息、及第一视频信息中共获得预设数量的个人视频信息,并将预设数量的个人视频信息合成以生成合成视频;视频会议服务器将合成视频发送给第三视频终端进行显示。本发明实施例提供的视频会议处理方法及设备,避免了以会场的整体视频为基础时,由于会场的不均等性造成的显示效果不佳的缺陷,为最大效率的信息显示提供了灵活性。

Description

视频会议处理方法及设备 技术领域 本发明实施例涉及通信技术, 尤其涉及一种视频会议处理方法及设备。 背景技术
随着网络技术的飞速发展, 视频会议已经成为当前发展最快的多媒体通 信方式。 传统的商务、 行政会议已经向视频会议转换。 视频会议系统将两个 或两个以上不同地方的个人或群体, 通过现有的各种电气通讯传输媒体, 将 人物的静、 动态图像、 语音、 文字、 图片等多种资料分送到各个用户的计算 机上, 使得在地理上分散的用户可以共聚一处, 通过图像、 声音等多种方式 交流信息, 在效果上可以代替现场举行的会议。
现有技术中, 通常在每个会场设置摄像头和显示装置, 摄像头可以釆集 本地会场的图像, 显示装置同时显示本地会场的图像和其他会场的图像。 由 于显示的图像均以会场为单位, 每个会场图像的显示空间有限, 但是每个会 场的参会人员的数量又不确定, 艮难保证对于观看者而言, 清楚地看到发言 者或者想要看到的参会人员。 而且, 当会场的数量比较多时, 在同一显示装 置上显示所有会场的图像, 效果也不佳。 发明内容
本发明实施例提供一种视频会议处理方法及设备, 以避免以会场的整体 视频为基础时, 由于会场的不均等性造成的显示效果不佳的缺陷, 为最大效 率的信息显示提供灵活性。
第一方面, 本发明实施例提供一种视频会议处理方法, 包括:
视频会议服务器接收第一视频终端发送的第一视频信息, 以及第二视频 终端发送的第二视频信息, 其中, 所述第一视频信息包括所述第一视频终端 所在的会场中每个与会者的个人视频信息, 所述第二视频信息包括所述第二 所述视频会议服务器从接收到的所述第二视频信息、 及所述第一视频信 息中共获得预设数量的所述个人视频信息, 并将所述预设数量的所述个人视 频信息合成以生成合成视频, 以使在所述合成视频中, 所述预设数量的所述 个人视频信息分别对应的与会者均处于一致的会场背景中;
所述视频会议服务器将所述合成视频发送给所述第三视频终端进行显 示。
在第一种可能的实现方式中, 所述视频会议服务器将所述合成视频发送 给所述第三视频终端进行显示之后, 所述方法还包括:
所述视频会议服务器接收所述第一视频终端、 所述第二视频终端、 或所 述第三视频终端发送的第一与会者替换指令, 其中, 所述第一与会者替换指 令中携带有第一与会者标识和第二与会者标识, 所述第一与会者替换指令用 于指示用所述第二与会者标识指示的个人视频信息替换所述合成视频中包含 的所述第一与会者标识指示的个人视频信息, 所述第一与会者标识指示的个 人视频信息是包含于所述合成视频中的个人视频信息, 所述第二与会者标识 指示的个人视频信息是包含于除所述合成视频中之外的所述第一视频信息的 个人视频信息、 或包含于除所述合成视频中之外的所述第二视频信息的个人 视频信息;
所述视频会议服务器根据所述第一与会者替换指令用所述第二与会者标 识指示的个人视频信息替换所述合成视频中所述第一与会者标识指示的个人 视频信息;
所述视频会议服务器将替换所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
在第二种可能的实现方式中, 所述视频会议服务器将所述合成视频发送 给所述第三视频终端进行显示之后, 所述方法还包括:
所述视频会议服务器接收所述第一视频终端、 所述第二视频终端、 或第 三视频终端发送的第二与会者替换指令, 其中, 所述第二与会者替换指令中 携带有第三与会者标识, 所述第二与会者替换指令用于指示用所述第三与会 者标识指示的个人视频信息替换所述合成视频包含的个人视频信息, 所述第 三与会者标识指示的个人视频信息是包含于除所述合成视频中之外的所述第 一视频信息的个人视频信息、 或包含于除所述合成视频中之外的所述第二视 频信息的个人视频信息; 所述视频会议服务器根据所述第二与会者替换指令从所述合成视频所包 含的个人视频信息中选择目标个人视频信息, 将选择的所述目标个人视频信 息替换为所述第三与会者标识指示的个人视频信息;
所述视频会议服务器将替换所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
结合第一方面的第二种实现方式, 在第三种可能的实现方式中, 所述第 二与会者替换指令中还携带有位置信息, 所述视频会议服务器根据所述第二 与会者替换指令从所述合成视频所包含的个人视频信息中选择目标个人视频 信息包括:
所述视频会议服务器根据所述第二与会者替换指令将所述合成视频中所 包含的与所述第二与会者替换指令中所包含位置信息对应的个人视频信息作 为所述的目标个人视频信息。
在第四种可能的实现方式中, 所述视频会议服务器将所述合成视频发送 给所述第三视频终端进行显示之后, 所述方法还包括:
所述视频会议服务器接收所述第一视频终端、 所述第二视频终端、 或所 述第三视频终端发送的添加指令, 其中, 所述添加指令中携带有第四与会者 标识, 所述添加指令用于指示在所述合成视频中添加所述第四与会者标识指 示的个人视频信息, 所述第四与会者标识指示的个人视频信息是包含于除所 述合成视频中之外的所述第一视频信息的个人视频信息、 或包含于除所述合 成视频中之外的所述第二视频信息的个人视频信息;
所述视频服务器根据所述添加指令在所述合成视频中添加所述第四与会 者标识指示的个人视频信息;
所述视频会议服务器将添加所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
在第五种可能的实现方式中, 所述视频会议服务器将所述合成视频发送 给所述第三视频终端进行显示之后, 所述方法还包括:
所述视频会议服务器接收所述第一视频终端、 所述第二视频终端、 或所 述第三视频终端发送的删除指令, 其中, 所述删除指令中携带有第五与会者 标识, 所述删除指令用于指示从所述合成视频所包含的个人视频信息中删除 所述第五与会者标识指示的个人视频信息, 所述第五与会者标识指示的个人 视频信息是包含于所述合成视频中的个人视频信息; 息中删除所述第五与会者标识指示的个人视频信息;
所述视频会议服务器将删除所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
结合第一方面或第一方面的第一至第五任一种可能的实现方式, 在第六 种可能的实现方式中, 所述视频会议服务器将所述预设数量的所述个人视频 信息合成以生成合成视频包括:
所述视频会议服务器对所述预设数量的所述个人视频信息中的对应图像 进行拼接, 生成合成图像, 其中, 所述预设数量的所述个人视频信息中的对 应图像在时序上同步; 组合多幅所述合成图像以生成合成视频。
结合第一方面或第一方面的第一至第五任一种可能的实现方式, 在第七 种可能的实现方式中, 所述视频会议服务器将所述预设数量的所述个人视频 信息合成以生成合成视频包括:
所述视频会议服务器将从所述预设数量的所述个人视频信息提取出的图 像信息排列在预设背景图像中生成合成图像, 其中, 从所述第一视频信息、 所述第二视频信息中获得的所述预设数量的所述个人视频信息包含的图像信 息在时序上同步; 组合多幅所述合成图像以生成合成视频。
第二方面, 本发明实施例提供一种视频会议处理方法, 包括:
第三视频终端接收视频会议服务器发送的合成视频, 其中, 所述合成视 频通过所述视频会议服务器从第一视频终端接收到的第一视频信息、 及从第 二视频终端接收到的第二视频信息中共获得预设数量的个人视频信息, 并将 所述预设数量的所述个人视频信息合成而得到, 在所述合成视频中, 所述预 设数量的所述个人视频信息分别对应的与会者均处于一致的会场背景中, 所 述第一视频信息包括所述第一视频终端所在的会场中每个与会者的个人视频 信息, 所述第二视频信息包括所述第二视频终端所在的会场中每个与会者的 个人视频信息;
所述第三视频终端将所述合成视频进行显示。
在第一种可能的实现方式中, 所述方法还包括:
所述第三视频终端接收至少一个视频釆集装置发送的视频信息, 将接收 到的视频信息打包形成第三视频信息, 其中, 所述至少一个视频釆集装置中 的每个视频釆集装置用以釆集第三视频终端所在的会场中至少一个与会者的 视频信息, 所述第三视频信息包括所述第三视频终端所在的会场中每个与会 者的个人视频信息;
所述第三视频终端将所述第三视频信息发送给所述视频会议服务器, 以 使所述视频会议服务器根据所述第三视频信息和所述第一视频信息生成合成 视频发送给所述第二视频终端, 或根据所述第三视频信息和所述第二视频信 息生成合成视频发送给所述第一视频终端。
在第二种可能的实现方式中, 所述第三视频终端将所述合成视频进行显 示之后, 所述方法还包括:
所述第三视频终端根据接收到的用户输入的切换指示信息生成第一与会 者替换指令, 其中, 所述第一与会者替换指令中携带有第一与会者标识和第 二与会者标识, 所述第一与会者替换指令用于指示用所述第二与会者标识指 示的个人视频信息替换所述合成视频中包含的所述第一与会者标识指示的个 人视频信息, 所述第一与会者标识指示的个人视频信息是包含于所述合成视 频中的个人视频信息, 所述第二与会者标识指示的个人视频信息是包含于除 所述合成视频中之外的所述第一视频信息的个人视频信息、 或包含于除所述 合成视频中之外的所述第二视频信息的个人视频信息;
所述第三视频终端向所述视频会议服务器发送所述第一与会者替换指令 用以使所述视频会议服务器根据所述第一与会者替换指令用所述第二与会者 标识指示的个人视频信息替换所述合成视频中所述第一与会者标识指示的个 人视频信息;
所述第三视频终端接收所述视频会议服务器根据所述第一与会者替换指 令发送的替换所述个人视频信息后的所述合成视频并显示。
在第三种可能的实现方式中, 所述第三视频终端将所述合成视频进行显 示之后, 所述方法还包括:
所述第三视频终端当在预设时间范围内检测到所述第三视频终端所在的 会场中的语音釆集装置具有语音输入时, 确定用以指示正在发言的与会者的 第三与会者标识;
所述第三视频终端生成携带有所述第三与会者标识的第二与会者替换指 令, 其中, 所述第二与会者替换指令用于指示用所述第三与会者标识指示的 个人视频信息替换发送给所述第一视频终端或所述第二视频终端的合成视频 包含的个人视频信息;
所述第三视频终端向所述视频会议服务器发送所述第二与会者替换指令 用以使所述视频会议服务器根据所述第二与会者替换指令从所述发送给所述 第一视频终端或第二视频终端的合成视频所包含的个人视频信息中选择目标 个人视频信息, 将选择的所述目标个人视频信息替换为所述第三与会者标识 指示的个人视频信息。
在第四种可能的实现方式中, 所述第三视频终端将所述合成视频进行显 示之后, 所述方法还包括:
所述第三视频终端根据接收到的用户输入的添加指示信息生成添加指 令, 其中, 所述添加指令中携带有第四与会者标识, 所述添加指令用于指示 在所述合成视频中添加所述第四与会者标识指示的个人视频信息, 所述第四 与会者标识指示的个人视频信息是包含于除所述合成视频中之外的所述第一 视频信息的个人视频信息、 或包含于除所述合成视频中之外的所述第二视频 信息的个人视频信息;
所述第三视频终端向所述视频会议服务器发送所述添加指令用以使所述 视频服务器根据所述添加指令在所述合成视频中添加所述第四与会者标识指 示的个人视频信息;
所述第三视频终端接收所述视频会议服务器根据所述添加指令发送的添 加所述个人视频信息后的所述合成视频并显示。
在第五种可能的实现方式中, 所述第三视频终端将所述合成视频进行显 示之后, 所述方法还包括:
所述第三视频终端根据接收到的用户输入的删除指示信息生成删除指 令, 其中, 所述删除指令中携带有第五与会者标识, 所述删除指令用于指示 从所述合成视频所包含的个人视频信息中删除所述第五与会者标识指示的个 人视频信息, 所述第五与会者标识指示的个人视频信息是包含于所述合成视 频中的个人视频信息;
所述第三视频终端向所述视频会议服务器发送所述删除指令用以使所述 所述第五与会者标识指示的个人视频信息;
所述第三视频终端接收所述视频会议服务器根据所述删除指令发送的删 除所述个人视频信息后的所述合成视频并显示。
第三方面, 本发明实施例提供一种视频会议服务器, 包括:
接收单元, 用于接收第一视频终端发送的第一视频信息, 以及第二视频 终端发送的第二视频信息, 其中, 所述第一视频信息包括所述第一视频终端 所在的会场中每个与会者的个人视频信息, 所述第二视频信息包括所述第二 处理单元, 与所述接收单元相连, 用于从接收到的所述第二视频信息、 及所述第一视频信息中共获得预设数量的所述个人视频信息, 并将所述预设 数量的所述个人视频信息合成以生成合成视频, 以使在所述合成视频中, 所 述预设数量的所述个人视频信息分别对应的与会者均处于一致的会场背景 中;
发送单元, 与所述处理单元相连, 用于将所述合成视频发送给所述第三 视频终端进行显示。
在第一种可能的实现方式中, 所述接收单元还用于接收所述第一视频终 端、 所述第二视频终端、 所述第三视频终端发送的第一与会者替换指令, 其 中, 所述第一与会者替换指令中携带有第一与会者标识和第二与会者标识, 所述第一与会者替换指令用于指示用所述第二与会者标识指示的个人视频信 息替换所述合成视频中包含的所述第一与会者标识指示的个人视频信息, 所 述第一与会者标识指示的个人视频信息是包含于所述合成视频中的个人视频 信息, 所述第二与会者标识指示的个人视频信息是包含于除所述合成视频中 之外的所述第一视频信息的个人视频信息、 或包含于除所述合成视频中之外 的所述第二视频信息的个人视频信息;
所述处理单元还用于根据所述第一与会者替换指令用所述第二与会者标 识指示的个人视频信息替换所述合成视频中所述第一与会者标识指示的个人 视频信息;
所述发送单元还用于将替换所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
在第二种可能的实现方式中, 所述接收单元还用于接收所述第一视频终 端、 所述第二视频终端、 或所述第三视频终端发送的第一与会者替换指令, 其中, 所述第二与会者替换指令中携带有第三与会者标识, 所述第二与会者 替换指令用于指示用所述第三与会者标识指示的个人视频信息替换所述合成 视频包含的个人视频信息, 所述第三与会者标识指示的个人视频信息是包含 于除所述合成视频中之外的所述第一视频信息的个人视频信息、 或包含于除 所述合成视频中之外的所述第二视频信息的个人视频信息;
所述处理单元还用于根据所述第二与会者替换指令从所述合成视频所包 含的个人视频信息中选择目标个人视频信息, 将选择的所述目标个人视频信 息替换为所述第三与会者标识指示的个人视频信息;
所述发送单元还用于将替换所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
结合第三方面的第二种实现方式, 在第三种可能的实现方式中, 所述第 二与会者替换指令中还携带有位置信息;
所述处理单元还用于根据所述第二与会者替换指令将所述合成视频中所 包含的与所述第二与会者替换指令中所包含位置信息对应的个人视频信息作 为所述的目标个人视频信息。
在第四种可能的实现方式中, 所述接收单元还用于接收所述第一视频终 端、 所述第二视频终端、 或所述第三视频终端发送的添加指令, 其中, 所述 添加指令中携带有第四与会者标识, 所述添加指令用于指示在所述合成视频 中添加所述第四与会者标识指示的个人视频信息, 所述第四与会者标识指示 的个人视频信息是包含于除所述合成视频中之外的所述第一视频信息的个人 视频信息、 或包含于除所述合成视频中之外的所述第二视频信息的个人视频 信息;
所述处理单元还用于根据所述添加指令在所述合成视频中添加所述第四 与会者标识指示的个人视频信息;
所述发送单元还用于将添加所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
在第五种可能的实现方式中, 所述接收单元还用于接收所述第一视频终 端、 所述第二视频终端、 或所述第三视频终端发送的删除指令, 其中, 所述 删除指令中携带有第五与会者标识, 所述删除指令用于指示从所述合成视频 所包含的个人视频信息中删除所述第五与会者标识指示的个人视频信息, 所 述第五与会者标识指示的个人视频信息是包含于所述合成视频中的个人视频 信息;
所述处理单元还用于根据所述删除指令在所述合成视频所包含的个人视 频信息中删除所述第五与会者标识指示的个人视频信息;
所述发送单元还用于将删除所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
结合第三方面或第三方面的第一至第五任一种可能的实现方式, 在第六 种可能的实现方式中, 所述处理单元还用于对所述预设数量的所述个人视频 信息中的对应图像进行拼接, 生成合成图像, 其中, 所述预设数量的所述个 人视频信息中的对应图像在时序上同步; 组合多幅所述合成图像以生成合成 视频。
结合第三方面或第三方面的第一至第五任一种可能的实现方式, 在第七 种可能的实现方式中, 所述处理单元还用于将所述预设数量的所述个人视频 信息包含的图像信息排列在预设背景图像中生成合成图像, 其中, 从所述第 一视频信息、 所述第二视频信息中获得的所述预设数量的所述个人视频信息 包含的图像信息在时序上同步; 组合多幅所述合成图像以生成合成视频。
第四方面, 本发明实施例提供一种第三视频终端, 包括:
接收单元, 用于接收视频会议服务器发送的合成视频, 其中, 所述合成 视频通过所述视频会议服务器从第一视频终端接收到的第一视频信息、 及从 第二视频终端接收到的第二视频信息中共获得预设数量的个人视频信息, 并 将所述预设数量的所述个人视频信息合成而得到, 在所述合成视频中, 所述 预设数量的所述个人视频信息分别对应的与会者均处于一致的会场背景中; 所述第一视频信息包括所述第一视频终端所在的会场中每个与会者的个人视 频信息, 所述第二视频信息包括所述第二视频终端所在的会场中每个与会者 的个人视频信息;
显示单元, 与所述接收单元相连, 用于将所述合成视频进行显示。
在第一种可能的实现方式中, 所述接收单元还用于接收至少一个视频釆 集装置发送的视频信息;
所述第三视频终端还包括: 第一处理单元, 与所述接收单元相连, 将接收到的视频信息打包形成第 三视频信息, 其中, 所述至少一个视频釆集装置中的每个视频釆集装置用以 釆集第三视频终端所在的会场中至少一个与会者的视频信息, 所述第三视频 第一发送单元, 与所述处理单元相连, 用于将所述第三视频信息发送给 所述视频会议服务器, 以使所述视频会议服务器根据所述第三视频信息和所 述第一视频信息生成合成视频发送给所述第二视频终端, 或根据所述第三视 频信息和所述第二视频信息生成合成视频发送给所述第一视频终端。
在第二种可能的实现方式中, 所述的第三视频终端, 还包括:
第二处理单元, 用于根据接收到的用户输入的切换指示信息生成第一与 会者替换指令, 其中, 所述第一与会者替换指令中携带有第一与会者标识和 第二与会者标识, 所述第一与会者替换指令用于指示用所述第二与会者标识 指示的个人视频信息替换所述合成视频中包含的所述第一与会者标识指示的 个人视频信息, 所述第一与会者标识指示的个人视频信息是包含于所述合成 视频中的个人视频信息, 所述第二与会者标识指示的个人视频信息是包含于 除所述合成视频中之外的所述第一视频信息的个人视频信息、 或包含于除所 述合成视频中之外的所述第二视频信息的个人视频信息;
第二发送单元, 与所述第二处理单元相连, 用于向所述视频会议服务器 发送所述第一与会者替换指令用以使所述视频会议服务器根据所述第一与会 者替换指令用所述第二与会者标识指示的个人视频信息替换所述合成视频中 所述第一与会者标识指示的个人视频信息;
所述接收单元还用于接收所述视频会议服务器根据所述第一与会者替换 指令发送的替换所述个人视频信息后的所述合成视频并通过所述显示单元显 示。
在第三种可能的实现方式中, 所述第三视频终端, 还包括:
第三处理单元, 用于当在预设时间范围内检测到所述第三视频终端所在 的会场中的语音釆集装置具有语音输入时, 确定用以指示正在发言的与会者 的第三与会者标识; 生成携带有所述第三与会者标识的第二与会者替换指 令, 其中, 所述第二与会者替换指令用于指示用所述第三与会者标识指示的 个人视频信息替换发送给所述第一视频终端或所述第二视频终端的合成视频 包含的个人视频信息;
第三发送单元, 与所述第三处理单元相连, 用于向所述视频会议服务器 发送所述第二与会者替换指令用以使所述视频会议服务器根据所述第二与会 者替换指令从所述发送给所述第一视频终端或第二视频终端的合成视频所包 含的个人视频信息中选择目标个人视频信息, 将选择的所述目标个人视频信 息替换为所述第三与会者标识指示的个人视频信息。
在第四种可能的实现方式中, 所述第三视频终端, 还包括:
第四处理单元, 用于根据接收到的用户输入的添加指示信息生成添加指 令, 其中, 所述添加指令中携带有第四与会者标识, 所述添加指令用于指示 在所述合成视频中添加所述第四与会者标识指示的个人视频信息, 所述第四 与会者标识指示的个人视频信息是包含于除所述合成视频中之外的所述第一 视频信息的个人视频信息、 或包含于除所述合成视频中之外的所述第二视频 信息的个人视频信息;
第四发送单元, 与所述第四处理单元相连, 用于向所述视频会议服务器 发送所述添加指令用以使所述视频服务器根据所述添加指令在所述合成视频 中添加所述第四与会者标识指示的个人视频信息;
所述接收单元还用于接收所述视频会议服务器根据所述添加指令发送的 添加所述个人视频信息后的所述合成视频并通过所述显示单元显示。
在第五种可能的实现方式中, 所述第三视频终端, 还包括:
第五处理单元, 用于根据接收到的用户输入的删除指示信息生成删除指 令, 其中, 所述删除指令中携带有第五与会者标识, 所述删除指令用于指示 从所述合成视频所包含的个人视频信息中删除所述第五与会者标识指示的个 人视频信息, 所述第五与会者标识指示的个人视频信息是包含于所述合成视 频中的个人视频信息;
第五发送单元, 与所述第五处理单元相连, 用于向所述视频会议服务器 发送所述删除指令用以使所述视频服务器根据所述删除指令在所述合成视频 所包含的个人视频信息中删除所述第五与会者标识指示的个人视频信息; 所述接收单元还用于接收所述视频会议服务器根据所述删除指令发送的 删除所述个人视频信息后的所述合成视频并通过所述显示单元显示。
第五方面, 本发明实施例提供一种视频会议服务器, 包括: 处理器, 通 信接口, 存储器和总线;
其中所述处理器、 所述通信接口和所述存储器通过所述总线互联; 所述通信接口, 用于接收第一视频终端发送的第一视频信息, 以及第二 视频终端发送的第二视频信息, 其中, 所述第一视频信息包括所述第一视频 终端所在的会场中每个与会者的个人视频信息, 所述第二视频信息包括所述 所述存储器, 用于存储指令或数据;
所述处理器调用存储在所述存储器中的指令以实现从接收到的所述第 二视频信息、 及所述第一视频信息中共获得预设数量的所述个人视频信息, 并将所述预设数量的所述个人视频信息合成以生成合成视频, 以使在所述合 成视频中, 所述预设数量的所述个人视频信息分别对应的与会者均处于一致 的会场背景中;
所述通信接口还用于将所述合成视频发送给所述第三视频终端进行显 示。
在第一种可能的实现方式中, 所述通信接口还用于接收所述第一视频终 端、 所述第二视频终端、 所述第三视频终端发送的第一与会者替换指令, 其 中, 所述第一与会者替换指令中携带有第一与会者标识和第二与会者标识, 所述第一与会者替换指令用于指示用所述第二与会者标识指示的个人视频信 息替换所述合成视频中包含的所述第一与会者标识指示的个人视频信息, 所 述第一与会者标识指示的个人视频信息是包含于所述合成视频中的个人视频 信息, 所述第二与会者标识指示的个人视频信息是包含于除所述合成视频中 之外的所述第一视频信息的个人视频信息、 或包含于除所述合成视频中之外 的所述第二视频信息的个人视频信息;
所述处理器还用于调用所述存储器的指令和数据以实现, 根据所述 第一与会者替换指令用所述第二与会者标识指示的个人视频信息替换所述合 成视频中所述第一与会者标识指示的个人视频信息;
所述通信接口还用于将替换所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
在第二种可能的实现方式中, 所述通信接口还用于接收所述第一视频终 端、 所述第二视频终端、 或所述第三视频终端发送的第一与会者替换指令, 其中, 所述第二与会者替换指令中携带有第三与会者标识, 所述第二与会者 替换指令用于指示用所述第三与会者标识指示的个人视频信息替换所述合成 视频包含的个人视频信息, 所述第三与会者标识指示的个人视频信息是包含 于除所述合成视频中之外的所述第一视频信息的个人视频信息、 或包含于除 所述合成视频中之外的所述第二视频信息的个人视频信息;
所述处理器还用于调用所述存储器的指令和数据以实现, 根据所述第二 与会者替换指令从所述合成视频所包含的个人视频信息中选择目标个人视频 信息, 将选择的所述目标个人视频信息替换为所述第三与会者标识指示的个 人视频信息;
所述通信接口还用于将替换所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
结合第五方面的第二种实现方式, 在第三种可能的实现方式中, 所述第 二与会者替换指令中还携带有位置信息;
所述处理器还用于调用所述存储器的指令和数据以实现, 根据所述第二 包含位置信息对应的个人视频信息作为所述的目标个人视频信息。
在第四种可能的实现方式中, 所述通信接口还用于接收所述第一视频终 端、 所述第二视频终端、 或所述第三视频终端发送的添加指令, 其中, 所述 添加指令中携带有第四与会者标识, 所述添加指令用于指示在所述合成视频 中添加所述第四与会者标识指示的个人视频信息, 所述第四与会者标识指示 的个人视频信息是包含于除所述合成视频中之外的所述第一视频信息的个人 视频信息、 或包含于除所述合成视频中之外的所述第二视频信息的个人视频 信息;
所述处理器还用于调用所述存储器的指令和数据以实现, 根据所述添加 指令在所述合成视频中添加所述第四与会者标识指示的个人视频信息;
所述通信接口还用于将添加所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
在第五种可能的实现方式中, 所述通信接口还用于接收所述第一视频终 端、 所述第二视频终端、 或所述第三视频终端发送的删除指令, 其中, 所述 删除指令中携带有第五与会者标识, 所述删除指令用于指示从所述合成视频 所包含的个人视频信息中删除所述第五与会者标识指示的个人视频信息, 所 述第五与会者标识指示的个人视频信息是包含于所述合成视频中的个人视频 信息;
所述处理器还用于调用所述存储器的指令和数据以实现, 根据所述删除 指令在所述合成视频所包含的个人视频信息中删除所述第五与会者标识指示 的个人视频信息;
所述通信接口还用于将删除所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
结合第五方面或第五方面的第一至第五任一种可能的实现方式, 在第六 种可能的实现方式中, 所述处理器还用于对所述预设数量的所述个人视频信 息中的对应图像进行拼接, 生成合成图像, 其中, 所述预设数量的所述个人 视频信息中的对应图像在时序上同步; 组合多幅所述合成图像以生成合成视 频。
结合第五方面或第五方面的第一至第五任一种可能的实现方式, 在第七 种可能的实现方式中, 所述处理器还用于将所述预设数量的所述个人视频信 息包含的图像信息排列在预设背景图像中生成合成图像, 其中, 从所述第一 视频信息、 所述第二视频信息中获得的所述预设数量的所述个人视频信息包 含的图像信息在时序上同步; 组合多幅所述合成图像以生成合成视频。
第六方面, 本发明实施例提供一种第三视频终端, 包括: 处理器, 通信 接口, 存储器、 总线和显示器;
其中所述处理器、 所述通信接口、 所述存储器和所述显示器通过所述总 线互联;
所述通信接口用于接收视频会议服务器发送的合成视频, 其中, 所述合 成视频通过所述视频会议服务器从第一视频终端接收到的第一视频信息、 及 从第二视频终端接收到的第二视频信息中共获得预设数量的个人视频信息, 并将所述预设数量的所述个人视频信息合成而得到, 在所述合成视频中, 所 述预设数量的所述个人视频信息分别对应的与会者均处于一致的会场背景 中; 所述第一视频信息包括所述第一视频终端所在的会场中每个与会者的个 人视频信息, 所述第二视频信息包括所述第二视频终端所在的会场中每个与 会者的个人视频信息; 所述存储器, 用于存储指令或数据;
所述处理器调用存储在所述存储器中的指令以实现将所述合成视频通 过所述显示器进行显示。
在第一种可能的实现方式中, 所述通信接口还用于接收至少一个视频釆 集装置发送的视频信息;
所述处理器还用于将接收到的视频信息打包形成第三视频信息, 其 中, 所述至少一个视频釆集装置中的每个视频釆集装置用以釆集第三视频终 端所在的会场中至少一个与会者的视频信息, 所述第三视频信息包括所述第 所述通信接口还用于将所述第三视频信息发送给所述视频会议服务器, 以使所述视频会议服务器根据所述第三视频信息和所述第一视频信息生成合 成视频发送给所述第二视频终端, 或根据所述第三视频信息和所述第二视频 信息生成合成视频发送给所述第一视频终端。
在第二种可能的实现方式中, 所述处理器还用于调用所述存储器的指 令和数据以实现, 根据接收到的用户输入的切换指示信息生成第一与会者 替换指令, 其中, 所述第一与会者替换指令中携带有第一与会者标识和第二 与会者标识, 所述第一与会者替换指令用于指示用所述第二与会者标识指示 的个人视频信息替换所述合成视频中包含的所述第一与会者标识指示的个人 视频信息, 所述第一与会者标识指示的个人视频信息是包含于所述合成视频 中的个人视频信息, 所述第二与会者标识指示的个人视频信息是包含于除所 述合成视频中之外的所述第一视频信息的个人视频信息、 或包含于除所述合 成视频中之外的所述第二视频信息的个人视频信息;
所述通信接口还用于向所述视频会议服务器发送所述第一与会者替换指 令用以使所述视频会议服务器根据所述第一与会者替换指令用所述第二与会 者标识指示的个人视频信息替换所述合成视频中所述第一与会者标识指示的 个人视频信息; 接收所述视频会议服务器根据所述第一与会者替换指令发送 的替换所述个人视频信息后的所述合成视频并通过所述显示器显示。
在第三种可能的实现方式中, 所述处理器还用于当在预设时间范围内 检测到所述第三视频终端所在的会场中的语音釆集装置具有语音输入时, 确 定用以指示正在发言的与会者的第三与会者标识; 生成携带有所述第三与会 者标识的第二与会者替换指令, 其中, 所述第二与会者替换指令用于指示用 所述第三与会者标识指示的个人视频信息替换发送给所述第一视频终端或所 述第二视频终端的合成视频包含的个人视频信息;
所述通信接口还用于向所述视频会议服务器发送所述第二与会者替换指 令用以使所述视频会议服务器根据所述第二与会者替换指令从所述发送给所 述第一视频终端或第二视频终端的合成视频所包含的个人视频信息中选择目 标个人视频信息, 将选择的所述目标个人视频信息替换为所述第三与会者标 识指示的个人视频信息。
在第四种可能的实现方式中, 所述处理器还用于调用所述存储器的指 令和数据以实现, 根据接收到的用户输入的添加指示信息生成添加指令,其 中, 所述添加指令中携带有第四与会者标识, 所述添加指令用于指示在所述 合成视频中添加所述第四与会者标识指示的个人视频信息, 所述第四与会者 标识指示的个人视频信息是包含于除所述合成视频中之外的所述第一视频信 息的个人视频信息、 或包含于除所述合成视频中之外的所述第二视频信息的 个人视频信息;
所述通信接口还用于向所述视频会议服务器发送所述添加指令用以使所 述视频服务器根据所述添加指令在所述合成视频中添加所述第四与会者标识 指示的个人视频信息; 接收所述视频会议服务器根据所述添加指令发送的添 加所述个人视频信息后的所述合成视频并通过所述显示器显示。
在第五种可能的实现方式中, 所述处理器还用于调用所述存储器的指 令和数据以实现, 根据接收到的用户输入的删除指示信息生成删除指令,其 中, 所述删除指令中携带有第五与会者标识, 所述删除指令用于指示从所述 合成视频所包含的个人视频信息中删除所述第五与会者标识指示的个人视频 信息, 所述第五与会者标识指示的个人视频信息是包含于所述合成视频中的 个人视频信息;
所述通信接口还用于向所述视频会议服务器发送所述删除指令用以使所 除所述第五与会者标识指示的个人视频信息; 接收所述视频会议服务器根据 所述删除指令发送的删除所述个人视频信息后的所述合成视频并通过所述显 示器显示。 由上述技术方案可知, 本发明实施例提供的视频会议处理方法及设备, 视频会议服务器接收第一视频终端发送的第一视频信息, 以及第二视频终端 发送的第二视频信息, 其中, 第一视频信息包括第一视频终端所在的会场中 每个与会者的个人视频信息, 第二视频信息包括第二视频终端所在的会场中 每个与会者的个人视频信息; 从接收到的第二视频信息、 及第一视频信息中 共获得预设数量的个人视频信息, 并将预设数量的个人视频信息合成以生成 合成视频, 以使在合成视频中, 预设数量的个人视频信息分别对应的与会者 均处于一致的会场背景中; 将合成视频发送给第三视频终端进行显示。 由于 合成视频的生成是以与会者的个人视频信息为基础, 避免了以会场的整体视 频为基础时, 由于会场的不均等性造成的显示效果不佳的缺陷, 突破了物理 空间的限制, 为最大效率的信息显示提供了灵活性。 附图说明 为了更清楚地说明本发明实施例或现有技术中的技术方案, 下面将 对实施例或现有技术描述中所需要使用的附图作一简单地介绍, 显而易 见地, 下面描述中的附图是本发明的一些实施例, 对于本领域普通技术 人员来讲, 在不付出创造性劳动性的前提下, 还可以根据这些附图获得 其他的附图。
图 1为本发明实施例提供的第一种视频会议处理方法流程图;
图 2为本发明实施例提供的会场布局示意图;
图 3为本发明实施例提供的第二种视频会议处理方法流程图;
图 4为本发明实施例提供的第三种视频会议处理方法流程图;
图 5为本发明实施例提供的第四种视频会议处理方法流程图;
图 6为本发明实施例提供的第五种视频会议处理方法流程图;
图 7为本发明实施例提供的图像单元示意图;
图 8为本发明实施例提供的第六种视频会议处理方法流程图;
图 9为本发明实施例提供的第七种视频会议处理方法流程图;
图 10为本发明实施例提供的第八种视频会议处理方法流程图; 图 1 1为本发明实施例提供的第九种视频会议处理方法流程图; 图 12为本发明实施例提供的第十种视频会议处理方法流程图; 图 13为本发明实施例提供的第一种视频会议服务器结构示意图;
图 14为本发明实施例提供的第一种第一视频终端结构示意图; 图 15为本发明实施例提供的第二种第一视频终端结构示意图; 图 16为本发明实施例提供的第二种视频会议服务器结构示意图;
图 17为本发明实施例提供的第三种第一视频终端结构示意图。 具体实施方式 为使本发明实施例的目的、 技术方案和优点更加清楚, 下面将结合 本发明实施例中的附图, 对本发明实施例中的技术方案进行清楚、 完整 地描述, 显然, 所描述的实施例是本发明一部分实施例, 而不是全部的 实施例。 基于本发明中的实施例, 本领域普通技术人员在没有作出创造 性劳动前提下所获得的所有其他实施例, 都属于本发明保护的范围。
图 1为本发明实施例提供的第一种视频会议处理方法流程图。 如图 1所 频的处理过程, 本实施例提供的视频会议处理方法具体包括:
步骤 S 10、 视频会议服务器接收第一视频终端发送的第一视频信息, 以 及第二视频终端发送的第二视频信息, 其中, 所述第一视频信息包括所述第
步骤 S20、 所述视频会议服务器从接收到的所述第二视频信息、 及所述 第一视频信息中共获得预设数量的所述个人视频信息, 并将所述预设数量的 所述个人视频信息合成以生成合成视频, 以使在所述合成视频中, 所述预设 数量的所述个人视频信息分别对应的与会者均处于一致的会场背景中;
步骤 S30、 所述视频会议服务器将所述合成视频发送给所述第三视频终 端进行显示。
具体地, 参与视频会议的会场为至少两个, 每个会场中至少有一个参与 者。 每个会场中设置有视频终端, 会场中还设置有用以釆集会场视频的视频 釆集装置, 用以釆集会场语音的语音釆集装置, 以及用以显示其他会场视频 的显示装置。 视频釆集装置可以为摄像头, 语音釆集装置可以为麦克风, 显 示装置可以为显示器或电视机等。 视频釆集装置、 语音釆集装置和显示装置 可以集成在视频终端中, 也可以单独设置。 可以在会场中为每个参与者都设 置一个视频釆集装置和一个声音釆集装置, 专门用于釆集该参与者的实时个 人视频和语音。 视频终端将每个视频釆集装置釆集到的个人视频信息打包形 成视频信息发送给视频会议服务器, 同时将声音釆集装置釆集到的语音发送 给视频会议服务器。 也可以在会场中设置一个视频釆集装置和声音釆集装 置, 统一釆集所有参与者的视频和语音。 视频终端将视频釆集装置釆集到的 视频作为视频信息发送给视频会议服务器, 同时将声音釆集装置釆集到的语 音发送给视频会议服务器, 则该视频信息中也包括每个参与者的个人视频信 息, 视频会议服务器可以将每个参与者的个人视频信息分割开来。 每个会场 的会场背景可以一致设置, 因此, 每个个人视频信息中的与会者所在的会场 背景一致。 也可以在生成合成视频的过程中, 使得合成视频中所显示的与会 者的会场背景一致。
视频会议服务器根据每个视频终端发送的视频信息, 为每个视频终端分 别生成合成视频, 以使的每个会场的视频终端所显示的为其他会场的视频图 像。 为了描述方面, 视频会议服务器为一个视频终端生成合成视频的过程为 例对本实施例进行说明, 该视频终端为第三视频终端, 则其他的视频终端为 第一视频终端和第二视频终端, 视频会议服务器从第一视频终端接收到的视 频信息为第一视频信息, 视频会议服务器从第二视频终端接收到的视频信息 为第二视频信息, 视频会议服务器从第三视频终端接收到的视频信息为第三 视频信息。 本实施例中的第一、 第二和第三仅用于区分, 不用于顺序限定。
视频会议服务器在为第三视频终端生成合成视频的过程中, 从接收到的 第二视频信息、 及第一视频信息中共获得预设数量的个人视频信息。 在实际 实现过程中, 可以根据预设规则从第一视频信息和第二视频信息中获取预设 数量的个人视频信信息。 该预设数量可以根据实际的显示效果需要和显示装 置的规格参数来具体设置, 获取到的个人视频信息可以全是第一视频信息中 的个人视频信息, 也可以全是第二视频信息中的个人视频信息, 还可以部分 时第一视频信息中的个人视频信息,部分为第二视频信息中的个人视频信息。 预设规则可以有多种方式, 例如, 在第一种实现方式中, 对于视频会议系统 的初始处理, 可以从首先接入视频会议服务器的视频终端发送的视频信息中 确定预设数量的个人视频信息; 在第二种实现方式中, 每个会场具有优先级 以标识该会场的重要程度, 从优先级最高的会场对应的视频终端发送的视频 信息中确定预设数量的个人视频信息; 在第三种实现方式中, 每个与会者具 有优先级以标识该与会者的重要程度, 可以按照优先级由高到低确定预设数 量的视频信息; 在第四种实现方式中, 视频终端还可以发送用以指示正在发 言的与会者的与会者标识给视频会议服务器, 以使得视频会议服务器可以将 正在发言的与会者的视频信息作为合成视频的一部分。 预设规则可以根据实 际的会议需要来设置, 不以本实施例为限。
视频会议服务器将该预设数量的个人视频信息合成为合成视频, 并将该 合成视频发送给第三视频终端。 第三视频终端将该合成视频显示给该第三视 频终端所在会场的与会者, 或者, 该第三视频终端通过独立的显示装置显示 该合成视频, 以实现视频会议过程。
本实施例提供的视频会议处理方法, 视频会议服务器接收第一视频终端 发送的第一视频信息, 以及第二视频终端发送的第二视频信息, 其中, 第一 视频信息包括第二视频终端所在的会场中每个与会者的个人视频信息; 从接 收到的第二视频信息、 及第一视频信息中共获得预设数量的个人视频信息, 并将预设数量的个人视频信息合成以生成合成视频, 以使在合成视频中, 预 设数量的个人视频信息分别对应的与会者均处于一致的会场背景中; 将合成 视频发送给第三视频终端进行显示。 由于合成视频的生成是以与会者的个人 视频信息为基础, 避免了以会场的整体视频为基础时, 由于会场的不均等性 造成的显示效果不佳的缺陷, 突破了物理空间的限制, 为最大效率的信息显 示提供了灵活性。
在实际应用过程中, 为了达到更好的视频会议效果, 每个会场可以统一 布局。 图 2为本发明实施例提供的会场布局示意图, 如图 2所示, 会场中可 以设置有视频釆集装置 001、 背景墙装置 002、 大屏显示装置 003、 声音定位 录入装置 004、 与会者坐席装置 005和视频终端 006。 与会者坐席装置 005用 于给与会者提供座位, 座位可以使用固定式坐席, 如沙发等, 也可以使用非 固定式坐席, 如带轮子的转椅等, 座位的数量例如为图 2所示的六个。 与会 者坐席装置 005可以摆设半圓形的桌子, 与会者坐席装置 005的座位也按照 桌子圓弧的形状摆设。 大屏显示装置 003可以由多个或一个大尺寸显示器组 建的装置, 其大小不得低于一个固定的尺寸, 以保证视频釆集装置 001拍摄 的图像在本装置显像时得到接近与真实比例人物大小的视觉感受, 大屏显示 装置 003设置成圓弧状。 当大屏显示装置 003所显示的合成视频中的与会者 的数量也为六个, 则可以实现会场的与会者与大屏显示装置 003所显示的与 会者在同一会场围着一张圓桌进行会议的感觉。 对于每个与会者设置一个视 频釆集装置 001 , 视频釆集装置 001可以与声音定位录入装置 004配合, 或 者其他手段输入的指令, 拍摄指定区域的图像。 声音定位录入装置 004由多 个或一个收音装置与声音定位装置组成。 通过声音定位装置捕捉与会者发声 方向, 生成指令发送给视频釆集装置 001 , 并录入该发声位的语音信息。 不 同会场的背景墙装置 002结构形态一致, 布局在大屏显示装置 003和与会者 坐席装置 005后方并不得低于一个固定的尺寸。 背景墙装置 002上可以布置 多种纹理形态, 以便于视频会议服务器对个人视频信息合成处理生成合成视 频, 通过大屏显示装置 003呈现的合成视频与布局在大屏显示装置 003后方 及与会者坐席装置 005后方的背景墙装置 002, 互相接合 /拼合, 实现与会者 从感知上产生在同一空间中进行沟通的体验感。 视频终端 006可以通过但不 限于使用可视化的可触摸的多点控制器, 实体按键控制器或其他形式实现。 通过该视频终端 006对各会场中的沟通需求进行控制, 包括但不限于: 显示 模式的切换, 个人视频信息的切换, 声控切换的开关, 文档的演示等。
在本实施例中, 视频会议服务器初始化的处理过程, 可以为: 根据设置 规则将最先接入的视频终端发送的视频信息中的个人视频信息生成合成视 频, 由此可以简化初始的处理流程, 而且, 可以提高为视频终端提供合成视 频的速度, 缩短用户的等待时间。 该设置规则具体用以指示如何从最先接入 的视频终端发送的视频信息中确定个人视频信息。 例如, 若最先接入的视频 终端发送的视频信息中的个人视频信息的数量恰好等于或小于所述预设数 量, 则可以直接将视频信息中的所有的个人视频信息生成合成视频; 若最先 接入的视频终端发送的视频信息中的个人视频信息的数量多于所述预设数 量, 可以按照视频信息中个人视频信息的顺序, 依次选取, 也可以根据用户 预设选取特定位置与会者的个人视频信息。
图 3为本发明实施例提供的第二种视频会议处理方法流程图。 如图 3所 示, 在本实施例中, 步骤 S30, 所述视频会议服务器将所述合成视频发送给 所述第三视频终端进行显示之后, 所述方法还可以包括:
步骤 S40、 所述视频会议服务器接收所述第一视频终端、 所述第二视频 终端、 或所述第三视频终端发送的第一与会者替换指令, 其中, 所述第一与 会者替换指令中携带有第一与会者标识和第二与会者标识, 所述第一与会者 替换指令用于指示用所述第二与会者标识指示的个人视频信息替换所述合成 视频中包含的所述第一与会者标识指示的个人视频信息, 所述第一与会者标 识指示的个人视频信息是包含于所述合成视频中的个人视频信息, 所述第二 与会者标识指示的个人视频信息是包含于除所述合成视频中之外的所述第一 视频信息的个人视频信息、 或包含于除所述合成视频中之外的所述第二视频 信息的个人视频信息;
步骤 S50、 所述视频会议服务器根据所述第一与会者替换指令用所述第 二与会者标识指示的个人视频信息替换所述合成视频中所述第一与会者标识 指示的个人视频信息;
步骤 S60、 所述视频会议服务器将替换所述个人视频信息后的所述合成 视频发送给所述第三视频终端进行显示。
具体地, 可以预先为参加会议的每个与会者分配标识, 则在视频会议过 程中与该与会者相关的处理都可以通过该标识实现。 在会场为统一布局的应 用场景下, 每个会场中的座位数量及摆放位置固定, 为了简化处理流程, 可 以为每个座位分配标识, 以达到对不同的与会者进行区分的目的。
在初始过程中, 每个会场的视频终端接入视频会议服务器, 视频会议服 务器可以从最先接入的视频终端发送的视频信息中确定预设数量的个人视频 信息, 生成合成视频并发送给第三视频终端。 第三视频终端所在的会场的管 理人员或者参与人员可以根据需要切换合成视频中的个人视频信息, 例如, 将合成视频中的某个与会者的个人视频信息, 切换为想要看到的与会者的个 人视频信息。 第三视频终端可以提供可视的人机交互界面, 该人机交互界面 可以通过但不限于触摸屏、 键盘或重力感应等方式实现切换指示信息的输 入, 触摸屏或者操作界面显示屏可以显示图像交互界面。 第三视频终端为用 户显示每个与会者的图片以及编号, 用户可以直接点击需要切换的两位与会 者的图片或者输入两位与会者的编号以实现切换。 第三视频终端根据用户的 输入生成该第一与会者替换指令, 该第一与会者替换指令中携带有第一与会 者标识和第二与会者标识, 第一与会者标识用以指示切换前的与会者, 第二 与会者标识用以指示切换后的与会者。
视频会议服务器还可以将发送给第三视频终端中所包含的个人视频信息 所对应的与会者的信息同步给其他视频终端, 例如第一视频终端和第二视频 终端, 因此, 第一视频终端或第二视频终端所在会场的管理人员或者参与人 员也可以根据会议需要进行上述切换操作, 在此不再赘述。
在另一种应用场景下, 若每个会场的座位固定且数量一致。 例如, 设置 有三个会场, 分别为会场 1、 会场 2和会场 3 , 每个会场都设置有六个座位, 分别为座位 1 , 座位 2, 座位 3 , 座位 4, 座位 5和座位 6。 第三视频终端设 置在会场 3 , 初始时, 第三视频终端显示的为会场 1 的六位与会者的视频信 息, 用户只点击想要看到的与会者的图片, 或者输入该与会者的编号, 若用 户输入的为会场 2座位 3的与会者, 则第三视频终端生成的第一与会者替换 指令中所携带的第一与会者标识用以指示会场 1座位 3的与会者, 第二与会 者标识用以指示会场 2座位 3的与会者。
上述切换过程是通过用户手动触发, 视频终端为用户提供手动切换模 式, 当用户选择该模式时, 需要用户手动输入触发切换流程。
在实际应用中, 第二与会者标识的数量可以为一个或多个, 即, 将合成 视频中的一个与会者的个人视频信息替换为一个或多个其他与会者的个人视 频信息, 为了保证显示效果, 第二与会者标识的数量不宜过多。
图 4为本发明实施例提供的第三种视频会议处理方法流程图。 如图 4所 示, 在本实施例中, 步骤 S30, 所述视频会议服务器将所述合成视频发送给 所述第三视频终端进行显示之后, 所述方法还可以包括:
步骤 S41、 所述视频会议服务器接收所述第一视频终端、 所述第二视频 终端、 或第三视频终端发送的第二与会者替换指令, 其中, 所述第二与会者 替换指令中携带有第三与会者标识, 所述第二与会者替换指令用于指示用所 述第三与会者标识指示的个人视频信息替换所述合成视频包含的个人视频信 息, 所述第三与会者标识指示的个人视频信息是包含于除所述合成视频中之 外的所述第一视频信息的个人视频信息、 或包含于除所述合成视频中之外的 所述第二视频信息的个人视频信息;
步骤 S51、 所述视频会议服务器根据所述第二与会者替换指令从所述合 成视频所包含的个人视频信息中选择目标个人视频信息, 将选择的所述目标 个人视频信息替换为所述第三与会者标识指示的个人视频信息;
步骤 S61、 所述视频会议服务器将替换所述个人视频信息后的所述合成 视频发送给所述第三视频终端进行显示。
具体地, 切换过程还可以通过声控触发。 当某个会场中有与会者发言 时, 视频终端可以通过语音釆集装置上传的语音识别到与会者处于发言的状 态, 或者, 还可以设置发言按钮, 与会者发言时触按该按钮, 以使得视频终 端可以获知哪位与会者正在发言。 视频终端生成第二与会者替换指令, 该第 二与会者替换指令中携带有用以指示正在发言的与会者的标识。 视频会议服 务器可以根据第二预设规则确定被替换的与会者, 该第二预设规则也可以预 先设置, 例如可以为根据发言的与会者的位置, 将合成视频中相应为的与会 者替换。
仍以设置有三个会场, 分别为会场 1、 会场 2和会场 3 , 每个会场都设置 有六个座位, 分别为座位 1 , 座位 2, 座位 3 , 座位 4, 座位 5和座位 6为例 进行说明。 第三视频终端设置在会场 3 , 初始时, 第三视频终端显示的为会 场 1的六位与会者的个人视频信息, 会场 2的座位 2的与会者开始发言, 则 会场 2的视频终端检测到该与会者发言时, 生成与会者替换指令, 与会者替 换指令中携带有用以指示会场 2座位 2的与会者的标识。 视频会议服务器当 接收到该与会者替换指令时, 在为其他视频终端生成合成视频的过程中, 将 该发言的与会者的个人视频信息合成到合成视频中, 以实现切换, 例如, 将 发送给第三视频终端的合成视频中的某个个人视频信息替换为会场 2座位 2 的与会者的个人视频信息。
视频终端还可以为用户提供声控切换模式, 当用户选择该模式时, 通过 声音触发切换流程。 当然, 在实际应用过程中。 手动切换模式和声控切换模 块可以并存, 可以以声控切换为主要的切换模式, 以保证与会者可以看到发 言者, 再以手动切换为辅助的切换模式, 以保证与会者可以看到想要看到的 重要与会者。
在本实施例中, 所述第二与会者替换指令中还携带有位置信息, 步骤 S51 , 所述视频会议服务器根据所述第二与会者替换指令从所述合成视频所 包含的个人视频信息中选择目标个人视频信息, 具体可以包括: 所述视频会议服务器根据所述第二与会者替换指令将所述合成视频中所 包含的与所述第二与会者替换指令中所包含位置信息对应的个人视频信息作 为所述的目标个人视频信息。
具体地, 视频会议服务器中存储有与会者的标识和该与会者的位置信息 的对应关系, 可以根据第三与会者标识从上述对应关系中获得该与会者的位 置信息, 也可以在视频终端发送的第二与会者替换指令中携带位置信息。 当 每个会场中的与会者数量一致, 合成视频中的视频信息的数量也与会场中的 与会者的数量相同时, 会场中所显示的合成视频中每个与会者的位置与实际 会场中的与会者的位置相同, 切换时, 将位置相同的与会者之间进行切换, 可以降低切换时所带来的突兀感。
图 5为本发明实施例提供的第四种视频会议处理方法流程图。 如图 5所 示, 在本实施例中, 步骤 S30, 所述视频会议服务器将所述合成视频发送给 所述第三视频终端进行显示之后, 所述方法还可以包括:
步骤 S42、 所述视频会议服务器接收所述第一视频终端、 所述第二视频 终端、 或所述第三视频终端发送的添加指令, 其中, 所述添加指令中携带有 第四与会者标识, 所述添加指令用于指示在所述合成视频中添加所述第四与 会者标识指示的个人视频信息, 所述第四与会者标识指示的个人视频信息是 包含于除所述合成视频中之外的所述第一视频信息的个人视频信息、 或包含 于除所述合成视频中之外的所述第二视频信息的个人视频信息;
步骤 S52、 所述视频服务器根据所述添加指令在所述合成视频中添加所 述第四与会者标识指示的个人视频信息;
步骤 S62、 所述视频会议服务器将添加所述个人视频信息后的所述合成 视频发送给所述第三视频终端进行显示。
具体地, 视频终端所在的会场的管理人员或者参与人员可以根据需要在 合成视频中添加想要看到的与会者的视频, 则管理人员或者参与人员可以输 入添加指示信息, 以使得视频终端生成添加指令。 该第四与会者标识的数量 可以为一个或多个, 为了保证显示效果, 第四与会者标识的数量不宜过多。
图 6为本发明实施例提供的第五种视频会议处理方法流程图。 如图 6所 示, 在本实施例中, 步骤 S30, 所述视频会议服务器将所述合成视频发送给 所述第三视频终端进行显示之后, 所述方法还包括: 步骤 S43、 所述视频会议服务器接收所述第一视频终端、 所述第二视频 终端、 或所述第三视频终端发送的删除指令, 其中, 所述删除指令中携带有 第五与会者标识, 所述删除指令用于指示从所述合成视频所包含的个人视频 信息中删除所述第五与会者标识指示的个人视频信息, 所述第五与会者标识 指示的个人视频信息是包含于所述合成视频中的个人视频信息;
步骤 S53、 所述视频服务器根据所述删除指令在所述合成视频所包含的 个人视频信息中删除所述第五与会者标识指示的个人视频信息;
步骤 S63、 所述视频会议服务器将删除所述个人视频信息后的所述合成 视频发送给所述第三视频终端进行显示。
具体地, 视频终端所在的会场的管理人员或者参与人员可以根据需要在 合成视频中删除某个与会者的视频, 则管理人员或者参与人员可以输入删除 指示信息, 以使得视频终端生成删除指令。 该第五与会者标识的数量可以为 一个或多个, 为了保证显示效果, 合成视频中被删除的视频信息的位置可以 通过静态换面来显示。
视频会议服务器生成合成视频的方式可以有多种, 以保证合成视频中每 个与会者的尺寸一直即可。 为了降低视频会议服务器的处理难度, 与会者的 座位和视频釆集装置布置在一个对应的物理位置上, 视频釆集装置可以基于 但不仅限于人脸捕捉, 人体红外特征捕捉等现有技术, 将所拍摄到的参与总 者布置在一个预设尺寸的图像单元中, 如图 7 所示, 该图像单元的尺寸为 el.l X el.2, 并将与会者画面布置在以 el.l x el.2尺寸的图像单元中的 el.7轴 心位置。 因为视频是一种连续动态的画面, 故允许用户在 el. l X el.2尺寸的 图像单元中以 el.7 为轴心位置的左右小范围的在阈值内的移动。 优选地, el.l X el.2 的单元尺寸满足 el.3 , el.5 , el.6 的尺寸规格要求。 其中, 一个 el. l x el .3 的图像单元为最小显示单元, el.5 的尺寸定义是为了符合用户在 多点视频会议中自然的手部动作, 保证在一定范围内的与会者的行为都可以 被视频釆集装置拍摄成功。 el.6 的尺寸定义是基于视频釆集装置拍摄到的与 会者画面是以坐姿为主的情况下, 与会者有站立的需求时, 保证视频釆集装 置拍摄的画面可以将参与人的站立状况的画面全面拍摄成功, 避免出现头部 超出摄像范围情况。
在本实施例中, 步骤 S30, 所述视频会议服务器将所述预设数量的所述 个人视频信息合成以生成合成视频, 具体可以包括:
所述视频会议服务器对所述预设数量的所述个人视频信息中的对应图像 进行拼接, 生成合成图像, 其中, 所述预设数量的所述个人视频信息中的对 应图像在时序上同步; 组合多幅所述合成图像以生成合成视频。
具体地, 在合成视频的一种实现方式中, 图 7所示的图像单元中还可以 设置尺寸为 el . l x el .4的合并区域, 用于多个 el . l x el .2尺寸的图像单元进 行合成过程中的重叠, 融合合并区域。 通过合并区域的设置, 并在个人视频 信息的合并过程中, 将合并区域进行融合处理, 使得每个个人视频信息之间 衔接自然, 提高了视频会议的显示效果。
当系统中的视频终端发送过来的视频信息的规格不是上述预设规格时, 基于发明的以与会者为单位的显示机制进行兼容。 某个视频终端发送的视频 信息中的个人视频的尺寸为 fl x f2, 调整其竖向方向画面比例与 el . l吻合, 在画面的左右两侧保留尺寸为 e 1.1 X e 1.4合并区域用于与其他个人视频信息 合并。 当某个会场没有视频只有语音接入时, 也可以为该会场设置一个明显 的图标表示接入源, 该图标可以是上述 el . l x el .2尺寸规格, 并设置有尺寸 为 el . l el .4的合并区域。
在实际应用中, 优选地, 合成视频中的个人视频信息为一横排, 以实现 模拟会议现场的显示效果。 合成视频中的个人视频信息也可以为多排显示, 可以实现模拟阶梯会场的显示效果。
在本实施例中, 步骤 S30, 所述视频会议服务器将所述预设数量的所述 个人视频信息合成以生成合成视频, 具体可以包括:
所述视频会议服务器将从所述预设数量的所述个人视频信息提取出的图 像信息排列在预设背景图像中生成合成图像, 其中, 从所述第一视频信息、 所述第二视频信息中获得的所述预设数量的所述个人视频信息包含的图像信 息在时序上同步; 组合多幅所述合成图像以生成合成视频。
具体地, 在合成视频的一种实现方式中, 视频会议服务器可以基于但不 仅限于图像抠取等现有技术将个人视频信息中的与会者的人像从当前背景图 像抠取出, 并合并到预设背景图像中, 以得到该合成视频。 当会场中设置背 景墙时, 预设背景图像具体可以与背景墙的图像一致, 以形成具有统一会场 感的画面效果。 在实际应用过程中, 人像抠取的工作也可以由视频终端实 现, 视频终端可以直接将抠取好的人像发送给视频会议服务器。
图 8为本发明实施例提供的第六种视频会议处理方法流程图。 如图 8所 示, 本实施例提供的视频会议处理方法具体可以与本发明任意实施例提供的 应用于视频会议服务器的方法配合实现, 具体实现过程在此不再赘述。 本实 施例提供的视频会议处理方法具体包括:
步骤 C10、 第三视频终端接收视频会议服务器发送的合成视频, 其中, 所述合成视频通过所述视频会议服务器从第一视频终端接收到的第一视频信 息、 及从第二视频终端接收到的第二视频信息中共获得预设数量的个人视频 信息, 并将所述预设数量的所述个人视频信息合成而得到, 在所述合成视频 中, 所述预设数量的所述个人视频信息分别对应的与会者均处于一致的会场 背景中, 所述第一视频信息包括所述第一视频终端所在的会场中每个与会者 的个人视频信息, 所述第二视频信息包括所述第二视频终端所在的会场中每 个与会者的个人视频信息;
步骤 C20、 所述第三视频终端将所述合成视频进行显示。
本实施例提供的视频会议处理方法, 第三视频终端接收视频会议服务器 发送的合成视频, 其中, 合成视频通过视频会议服务器从第一视频终端接收 到的第一视频信息、 及从第二视频终端接收到的第二视频信息中共获得预设 数量的个人视频信息, 并将预设数量的个人视频信息合成而得到, 在合成视 频中,预设数量的个人视频信息分别对应的与会者均处于一致的会场背景中,
将合成视频进行显示。 由于合成视频的生成是以与会者的视频信息为基础, 避免了以会场的整体视频为基础时, 由于会场的不均等性造成的显示效果不 佳的缺陷, 突破了物理空间的限制, 为最大效率的信息显示提供了灵活性。
图 9为本发明实施例提供的第七种视频会议处理方法流程图。 如图 9所 示, 在本实施例中, 步骤 C10 , 所述方法还可以包括:
步骤 C30、 所述第三视频终端接收至少一个视频釆集装置发送的视频信 息, 将接收到的视频信息打包形成第三视频信息, 其中, 所述至少一个视频 釆集装置中的每个视频釆集装置用以釆集第三视频终端所在的会场中至少一 个与会者的视频信息, 所述第三视频信息包括所述第三视频终端所在的会场 中每个与会者的个人视频信息。
步骤 C31、 所述第三视频终端将所述第三视频信息发送给所述视频会议 服务器, 以使所述视频会议服务器根据所述第三视频信息和所述第一视频信 息生成合成视频发送给所述第二视频终端, 或根据所述第三视频信息和所述 第二视频信息生成合成视频发送给所述第一视频终端。
在本实施例中, 步骤 C20, 所述第三视频终端将所述合成视频进行显示 之后, 所述方法还可以包括:
步骤 C40, 所述第三视频终端根据接收到的用户输入的切换指示信息生 成第一与会者替换指令, 其中, 所述第一与会者替换指令中携带有第一与会 者标识和第二与会者标识, 所述第一与会者替换指令用于指示用所述第二与 会者标识指示的个人视频信息替换所述合成视频中包含的所述第一与会者标 识指示的个人视频信息, 所述第一与会者标识指示的个人视频信息是包含于 所述合成视频中的个人视频信息, 所述第二与会者标识指示的个人视频信息 是包含于除所述合成视频中之外的所述第一视频信息的个人视频信息、 或包 含于除所述合成视频中之外的所述第二视频信息的个人视频信息;
步骤 C50、 所述第三视频终端向所述视频会议服务器发送所述第一与会 者替换指令用以使所述视频会议服务器根据所述第一与会者替换指令用所述 第二与会者标识指示的个人视频信息替换所述合成视频中所述第一与会者标 识指示的个人视频信息;
步骤 C60、 所述第三视频终端接收所述视频会议服务器根据所述第一与 会者替换指令发送的替换所述个人视频信息后的所述合成视频并显示。
图 10为本发明实施例提供的第八种视频会议处理方法流程图。 如图 10 所示, 在本实施例中, 步骤 C20, 所述第三视频终端将所述合成视频进行显 示之后, 所述方法还包括:
步骤 C41、 所述第三视频终端当在预设时间范围内检测到所述第三视频 终端所在的会场中的语音釆集装置具有语音输入时, 确定用以指示正在发言 的与会者的第三与会者标识;
步骤 C51、 所述第三视频终端生成携带有所述第三与会者标识的第二与 会者替换指令, 其中, 所述第二与会者替换指令用于指示用所述第三与会者 标识指示的个人视频信息替换发送给所述第一视频终端或所述第二视频终端 的合成视频包含的个人视频信息;
步骤 C61、 所述第三视频终端向所述视频会议服务器发送所述第二与会 者替换指令用以使所述视频会议服务器根据所述第二与会者替换指令从所述 发送给所述第一视频终端或第二视频终端的合成视频所包含的个人视频信息 中选择目标个人视频信息, 将选择的所述目标个人视频信息替换为所述第三 与会者标识指示的个人视频信息。
具体地, 第三视频终端在持续一段时间内捕捉到声音强度大于预设阈值 的声音时, 可以认为检测到存在与会者正在发言, 以避免突发的声响对切换 流程的频繁触发。
图 11 为本发明实施例提供的第九种视频会议处理方法流程图。 如图 11 所示, 在本实施例中, 步骤 C20, 所述第三视频终端将所述合成视频进行显 示之后, 所述方法还包括:
步骤 C42、 所述第三视频终端根据接收到的用户输入的添加指示信息生 成添加指令, 其中, 所述添加指令中携带有第四与会者标识, 所述添加指令 用于指示在所述合成视频中添加所述第四与会者标识指示的个人视频信息, 所述第四与会者标识指示的个人视频信息是包含于除所述合成视频中之外的 所述第一视频信息的个人视频信息、 或包含于除所述合成视频中之外的所述 第二视频信息的个人视频信息;
步骤 C52、 所述第三视频终端向所述视频会议服务器发送所述添加指令 用以使所述视频服务器根据所述添加指令在所述合成视频中添加所述第四与 会者标识指示的个人视频信息;
步骤 C62、 所述第三视频终端接收所述视频会议服务器根据所述添加指 令发送的添加所述个人视频信息后的所述合成视频并显示。
图 12为本发明实施例提供的第十种视频会议处理方法流程图。 如图 12 所示, 在本实施例中, 步骤 C20, 所述第三视频终端将所述合成视频进行显 示之后, 所述方法还包括:
步骤 C43、 所述第三视频终端根据接收到的用户输入的删除指示信息生 成删除指令, 其中, 所述删除指令中携带有第五与会者标识, 所述删除指令 用于指示从所述合成视频所包含的个人视频信息中删除所述第五与会者标识 指示的个人视频信息, 所述第五与会者标识指示的个人视频信息是包含于所 述合成视频中的个人视频信息;
步骤 C53、 所述第三视频终端向所述视频会议服务器发送所述删除指令 信息中删除所述第五与会者标识指示的个人视频信息;
步骤 C63、 所述第三视频终端接收所述视频会议服务器根据所述删除指 令发送的删除所述个人视频信息后的所述合成视频并显示。
图 13 为本发明实施例提供的第一种视频会议服务器结构示意图。 如图 13所示, 本实施例提供的视频会议服务器具体可以实现本发明任意实施例提 供的应用于视频会议服务器的视频会议处理方法的各个步骤, 具体实现过程 在此不再赞述。
本实施例提供的视频会议服务器, 具体包括:
接收单元 1 1 , 用于接收第一视频终端发送的第一视频信息, 以及第二视 频终端发送的第二视频信息, 其中, 所述第一视频信息包括所述第一视频终 端所在的会场中每个与会者的个人视频信息, 所述第二视频信息包括所述第 处理单元 12, 与所述接收单元 1 1相连, 用于从接收到的所述第二视频 信息、 及所述第一视频信息中共获得预设数量的所述个人视频信息, 并将所 述预设数量的所述个人视频信息合成以生成合成视频, 以使在所述合成视频 中, 所述预设数量的所述个人视频信息分别对应的与会者均处于一致的会场 背景中;
发送单元 13 , 与所述处理单元 12相连, 用于将所述合成视频发送给所 述第三视频终端进行显示。
本实施例提供的视频会议服务器, 由于合成视频的生成是以与会者的视 频信息为基础, 避免了以会场的整体视频为基础时, 由于会场的不均等性造 成的显示效果不佳的缺陷, 突破了物理空间的限制, 为最大效率的信息显示 提供了灵活性。
在本实施例中, 所述接收单元 1 1还用于接收所述第一视频终端、所述第 二视频终端、 所述第三视频终端发送的第一与会者替换指令, 其中, 所述第 一与会者替换指令中携带有第一与会者标识和第二与会者标识, 所述第一与 会者替换指令用于指示用所述第二与会者标识指示的个人视频信息替换所述 合成视频中包含的所述第一与会者标识指示的个人视频信息, 所述第一与会 者标识指示的个人视频信息是包含于所述合成视频中的个人视频信息, 所述 第二与会者标识指示的个人视频信息是包含于除所述合成视频中之外的所述 第一视频信息的个人视频信息、 或包含于除所述合成视频中之外的所述第二 视频信息的个人视频信息。 所述处理单元 12还用于根据所述第一与会者替 换指令用所述第二与会者标识指示的个人视频信息替换所述合成视频中所述 第一与会者标识指示的个人视频信息。 所述发送单元 13 还用于将替换所述 个人视频信息后的所述合成视频发送给所述第三视频终端进行显示。
在本实施例中, 所述接收单元 11还用于接收所述第一视频终端、所述第 二视频终端、 或所述第三视频终端发送的第一与会者替换指令, 其中, 所述 第二与会者替换指令中携带有第三与会者标识, 所述第二与会者替换指令用 于指示用所述第三与会者标识指示的个人视频信息替换所述合成视频包含的 个人视频信息, 所述第三与会者标识指示的个人视频信息是包含于除所述合 成视频中之外的所述第一视频信息的个人视频信息、 或包含于除所述合成视 频中之外的所述第二视频信息的个人视频信息。 所述处理单元 12还用于根 据所述第二与会者替换指令从所述合成视频所包含的个人视频信息中选择目 标个人视频信息, 将选择的所述目标个人视频信息替换为所述第三与会者标 识指示的个人视频信息。 所述发送单元 13 还用于将替换所述个人视频信息 后的所述合成视频发送给所述第三视频终端进行显示。
在本实施例中, 所述第二与会者替换指令中还携带有位置信息。 所述处 理单元 12还用于根据所述第二与会者替换指令将所述合成视频中所包含的 与所述第二与会者替换指令中所包含位置信息对应的个人视频信息作为所述 的目标个人视频信息。
在本实施例中, 所述接收单元 11还用于接收所述第一视频终端、所述第 二视频终端、 或所述第三视频终端发送的添加指令, 其中, 所述添加指令中 携带有第四与会者标识, 所述添加指令用于指示在所述合成视频中添加所述 第四与会者标识指示的个人视频信息, 所述第四与会者标识指示的个人视频 信息是包含于除所述合成视频中之外的所述第一视频信息的个人视频信息、 或包含于除所述合成视频中之外的所述第二视频信息的个人视频信息。 所述 处理单元 12还用于根据所述添加指令在所述合成视频中添加所述第四与会 者标识指示的个人视频信息。 所述发送单元 13 还用于将添加所述个人视频 信息后的所述合成视频发送给所述第三视频终端进行显示。
在本实施例中, 所述接收单元 11还用于接收所述第一视频终端、所述第 二视频终端、 或所述第三视频终端发送的删除指令, 其中, 所述删除指令中 携带有第五与会者标识, 所述删除指令用于指示从所述合成视频所包含的个 人视频信息中删除所述第五与会者标识指示的个人视频信息, 所述第五与会 者标识指示的个人视频信息是包含于所述合成视频中的个人视频信息。 所述 息中删除所述第五与会者标识指示的个人视频信息。 所述发送单元 13 还用 于将删除所述个人视频信息后的所述合成视频发送给所述第三视频终端进行 显示。
在本实施例中, 所述处理单元 12还用于对所述预设数量的所述个人视 频信息中的对应图像进行拼接, 生成合成图像, 其中, 所述预设数量的所述 个人视频信息中的对应图像在时序上同步; 组合多幅所述合成图像以生成合 成视频。
在本实施例中, 所述处理单元 12还用于将所述预设数量的所述个人视 频信息包含的图像信息排列在预设背景图像中生成合成图像, 其中, 从所述 第一视频信息、 所述第二视频信息中获得的所述预设数量的所述个人视频信 息包含的图像信息在时序上同步; 组合多幅所述合成图像以生成合成视频。
图 14为本发明实施例提供的第一种第三视频终端结构示意图。 如图 14 所示, 本实施例提供的第三视频终端 600具体可以实现本发明任意实施例提 供的用于第三视频终端的视频会议处理方法的各个步骤, 具体实现过程在此 不再赘述。
本实施例提供的第三视频终端 600, 包括:
接收单元 21 , 用于接收视频会议服务器发送的合成视频, 其中, 所述合 成视频通过所述视频会议服务器从第一视频终端接收到的第一视频信息、 及 从第二视频终端接收到的第二视频信息中共获得预设数量的个人视频信息, 并将所述预设数量的所述个人视频信息合成而得到, 在所述合成视频中, 所 述预设数量的所述个人视频信息分别对应的与会者均处于一致的会场背景 中; 所述第一视频信息包括所述第一视频终端所在的会场中每个与会者的个 人视频信息, 所述第二视频信息包括所述第二视频终端所在的会场中每个与 会者的个人视频信息;
显示单元 22, 与所述接收单元 21相连, 用于将所述合成视频进行显示。 本实施例提供的第三视频终端 600, 由于合成视频的生成是以与会者的 视频信息为基础, 避免了以会场的整体视频为基础时, 由于会场的不均等性 造成的显示效果不佳的缺陷, 突破了物理空间的限制, 为最大效率的信息显 示提供了灵活性。
图 15为本发明实施例提供的第二种第三视频终端 600结构示意图。 如图 15所示, 在本实施例中, 所述接收单元 21还用于接收至少一个视频釆集装 置发送的视频信息。 所述第三视频终端 600还可以包括:
第一处理单元 211 , 与所述接收单元 21相连, 将接收到的视频信息打包 形成第三视频信息, 其中, 所述至少一个视频釆集装置中的每个视频釆集装 置用以釆集第三视频终端 600所在的会场中至少一个与会者的视频信息, 所 述第三视频信息包括所述第三视频终端 600所在的会场中每个与会者的个人 视频信息。
第一发送单元 212, 与所述第一处理单元 211相连, 用于将所述第三视 频信息发送给所述视频会议服务器, 以使所述视频会议服务器根据所述第三 视频信息和所述第一视频信息生成合成视频发送给所述第二视频终端, 或根 据所述第三视频信息和所述第二视频信息生成合成视频发送给所述第一视频 终端。
在本实施例中, 所述第三视频终端 600还可以包括:
第二处理单元 221 , 用于根据接收到的用户输入的切换指示信息生成第 一与会者替换指令, 其中, 所述第一与会者替换指令中携带有第一与会者标 识和第二与会者标识, 所述第一与会者替换指令用于指示用所述第二与会者 标识指示的个人视频信息替换所述合成视频中包含的所述第一与会者标识指 示的个人视频信息, 所述第一与会者标识指示的个人视频信息是包含于所述 合成视频中的个人视频信息, 所述第二与会者标识指示的个人视频信息是包 含于除所述合成视频中之外的所述第一视频信息的个人视频信息、 或包含于 除所述合成视频中之外的所述第二视频信息的个人视频信息;
第二发送单元 222, 与所述第二处理单元 221相连, 用于向所述视频会 议服务器发送所述第一与会者替换指令用以使所述视频会议服务器根据所述 第一与会者替换指令用所述第二与会者标识指示的个人视频信息替换所述合 成视频中所述第一与会者标识指示的个人视频信息;
所述接收单元 21 还用于接收所述视频会议服务器根据所述第一与会者 替换指令发送的替换所述个人视频信息后的所述合成视频并通过所述显示单 元 22显示。
在本实施例中, 所述第三视频终端 600还可以包括:
第三处理单元 231 , 用于当在预设时间范围内检测到所述第三视频终端 600 所在的会场中的语音釆集装置具有语音输入时, 确定用以指示正在发言 的与会者的第三与会者标识; 生成携带有所述第三与会者标识的第二与会者 替换指令, 其中, 所述第二与会者替换指令用于指示用所述第三与会者标识 指示的个人视频信息替换发送给所述第一视频终端或所述第二视频终端的合 成视频包含的个人视频信息;
第三发送单元 232, 与所述第三处理单元 231相连, 用于向所述视频会 议服务器发送所述第二与会者替换指令用以使所述视频会议服务器根据所述 第二与会者替换指令从所述发送给所述第一视频终端或第二视频终端的合成 视频所包含的个人视频信息中选择目标个人视频信息, 将选择的所述目标个 人视频信息替换为所述第三与会者标识指示的个人视频信息。
在本实施例中, 所述第三视频终端 600还可以包括:
第四处理单元 241 , 用于根据接收到的用户输入的添加指示信息生成添 加指令, 其中, 所述添加指令中携带有第四与会者标识, 所述添加指令用于 指示在所述合成视频中添加所述第四与会者标识指示的个人视频信息, 所述 第四与会者标识指示的个人视频信息是包含于除所述合成视频中之外的所述 第一视频信息的个人视频信息、 或包含于除所述合成视频中之外的所述第二 视频信息的个人视频信息;
第四发送单元 242, 与所述第四处理单元 241相连, 用于向所述视频会 议服务器发送所述添加指令用以使所述视频服务器根据所述添加指令在所述 合成视频中添加所述第四与会者标识指示的个人视频信息;
所述接收单元 21 还用于接收所述视频会议服务器根据所述添加指令发 送的添加所述个人视频信息后的所述合成视频并通过所述显示单元 22显示。 在本实施例中 , 所述第三视频终端 600还可以包括:
第五处理单元 251 , 用于根据接收到的用户输入的删除指示信息生成删 除指令, 其中, 所述删除指令中携带有第五与会者标识, 所述删除指令用于 指示从所述合成视频所包含的个人视频信息中删除所述第五与会者标识指示 的个人视频信息, 所述第五与会者标识指示的个人视频信息是包含于所述合 成视频中的个人视频信息;
第五发送单元 252, 与所述第五处理单元 251相连, 用于向所述视频会 议服务器发送所述删除指令用以使所述视频服务器根据所述删除指令在所述 合成视频所包含的个人视频信息中删除所述第五与会者标识指示的个人视频 信息;
所述接收单元 21 还用于接收所述视频会议服务器根据所述删除指令发 送的删除所述个人视频信息后的所述合成视频并通过所述显示单元 22显示。
图 16 为本发明实施例提供的第二种视频会议服务器结构示意图。 如图 16所示, 本实施例提供的视频会议服务器 700具体可以实现本发明任意实施 例提供的应用于视频会议服务器的视频会议处理方法的各个步骤, 具体实现 过程在此不再赘述。
本实施例提供的视频会议服务器 700包括: 处理器 710, 通信接口 720, 存储器 730和总线 740, 其中所述处理器 710、 所述通信接口 720和所述存储 器 730通过所述总线 740互联。 所述通信接口 720用于接收第一视频终端发 送的第一视频信息, 以及第二视频终端发送的第二视频信息, 其中, 所述第 息, 所述第二视频信息包括所述第二视频终端所在的会场中每个与会者的个 人视频信息。 所述存储器 730用于存储指令或数据。 所述处理器 710调用 存储在所述存储器 730 中的指令以实现从接收到的所述第二视频信息、 及 所述第一视频信息中共获得预设数量的所述个人视频信息, 并将所述预设数 量的所述个人视频信息合成以生成合成视频, 以使在所述合成视频中, 所述 预设数量的所述个人视频信息分别对应的与会者均处于一致的会场背景中。 所述通信接口 720 还用于将所述合成视频发送给所述第三视频终端进行显 示。
在本实施例中, 所述通信接口 720还用于接收所述第一视频终端、 所述 第二视频终端、 所述第三视频终端发送的第一与会者替换指令, 其中, 所述 第一与会者替换指令中携带有第一与会者标识和第二与会者标识, 所述第 ― 与会者替换指令用于指示用所述第二与会者标识指示的个人视频信息替换所 述合成视频中包含的所述第一与会者标识指示的个人视频信息, 所述第一与 会者标识指示的个人视频信息是包含于所述合成视频中的个人视频信息, 所 述第二与会者标识指示的个人视频信息是包含于除所述合成视频中之外的所 述第一视频信息的个人视频信息、 或包含于除所述合成视频中之外的所述第 二视频信息的个人视频信息。 所述处理器 710还用于调用所述存储器 730的 指令和数据以实现, 根据所述第一与会者替换指令用所述第二与会者标识 指示的个人视频信息替换所述合成视频中所述第一与会者标识指示的个人视 频信息。 所述通信接口 720还用于将替换所述个人视频信息后的所述合成视 频发送给所述第三视频终端进行显示。
在本实施例中, 所述通信接口 720还用于接收所述第一视频终端、 所述 第二视频终端、 或所述第三视频终端发送的第一与会者替换指令, 其中, 所 述第二与会者替换指令中携带有第三与会者标识, 所述第二与会者替换指令 用于指示用所述第三与会者标识指示的个人视频信息替换所述合成视频包含 的个人视频信息, 所述第三与会者标识指示的个人视频信息是包含于除所述 合成视频中之外的所述第一视频信息的个人视频信息、 或包含于除所述合成 视频中之外的所述第二视频信息的个人视频信息。 所述处理器 710还用于调 用所述存储器 730的指令和数据以实现, 根据所述第二与会者替换指令从所 述合成视频所包含的个人视频信息中选择目标个人视频信息, 将选择的所述 目标个人视频信息替换为所述第三与会者标识指示的个人视频信息。 所述通 信接口 720还用于将替换所述个人视频信息后的所述合成视频发送给所述第 三视频终端进行显示。
在本实施例中, 所述第二与会者替换指令中还携带有位置信息。 所述处 理器 710还用于调用所述存储器 730的指令和数据以实现, 根据所述第二与 含位置信息对应的个人视频信息作为所述的目标个人视频信息。
在本实施例中, 所述通信接口 720还用于接收所述第一视频终端、 所述 第二视频终端、 或所述第三视频终端发送的添加指令, 其中, 所述添加指令 中携带有第四与会者标识, 所述添加指令用于指示在所述合成视频中添加所 述第四与会者标识指示的个人视频信息, 所述第四与会者标识指示的个人视 频信息是包含于除所述合成视频中之外的所述第一视频信息的个人视频信 息、 或包含于除所述合成视频中之外的所述第二视频信息的个人视频信息。 所述处理器 710还用于调用所述存储器 730的指令和数据以实现, 根据所述 添加指令在所述合成视频中添加所述第四与会者标识指示的个人视频信息。 所述通信接口 720还用于将添加所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
在本实施例中, 所述通信接口 720还用于接收所述第一视频终端、 所述 第二视频终端、 或所述第三视频终端发送的删除指令, 其中, 所述删除指令 中携带有第五与会者标识, 所述删除指令用于指示从所述合成视频所包含的 个人视频信息中删除所述第五与会者标识指示的个人视频信息, 所述第五与 会者标识指示的个人视频信息是包含于所述合成视频中的个人视频信息。 所 述处理器 710还用于调用所述存储器 730的指令和数据以实现, 根据所述删 除指令在所述合成视频所包含的个人视频信息中删除所述第五与会者标识指 示的个人视频信息。 所述通信接口 720还用于将删除所述个人视频信息后的 所述合成视频发送给所述第三视频终端进行显示。
在本实施例中, 所述处理器 710还用于对所述预设数量的所述个人视频 信息中的对应图像进行拼接, 生成合成图像, 其中, 所述预设数量的所述个 人视频信息中的对应图像在时序上同步; 组合多幅所述合成图像以生成合成 视频。
在本实施例中, 所述处理器 710还用于将所述预设数量的所述个人视频 信息包含的图像信息排列在预设背景图像中生成合成图像, 其中, 从所述第 一视频信息、 所述第二视频信息中获得的所述预设数量的所述个人视频信息 包含的图像信息在时序上同步; 组合多幅所述合成图像以生成合成视频。
图 17为本发明实施例提供的第三种第一视频终端结构示意图。 如图 17 所示, 本实施例提供的第一视频终端 800具体可以实现本发明任意实施例提 赘述。
本实施例提供的第一视频终端 800包括: 处理器 810, 通信接口 820, 存 储器 830和总线 840 , 其中所述处理器 810、 所述通信接口 820和所述存储器 830通过所述总线 840互联。 所述通信接口 820用于接收视频会议服务器发 送的合成视频, 其中, 所述合成视频通过所述视频会议服务器从第一视频终 端接收到的第一视频信息、 及从第二视频终端接收到的第二视频信息中共获 得预设数量的个人视频信息, 并将所述预设数量的所述个人视频信息合成而 得到, 在所述合成视频中, 所述预设数量的所述个人视频信息分别对应的与 会者均处于一致的会场背景中; 所述第一视频信息包括所述第一视频终端所 在的会场中每个与会者的个人视频信息, 所述第二视频信息包括所述第二视 频终端所在的会场中每个与会者的个人视频信息。 所述存储器 830用于存储 指令或数据。 所述处理器 810调用存储在所述存储器 830中的指令以实现 将所述合成视频通过所述显示器进行显示。
在本实施例中, 所述通信接口 820还用于接收至少一个视频釆集装置发 送的视频信息。 所述处理器 810还用于将接收到的视频信息打包形成第三视 频信息, 其中, 所述至少一个视频釆集装置中的每个视频釆集装置用以釆集 第三视频终端所在的会场中至少一个与会者的视频信息, 所述第三视频信息 接口 820还用于将所述第三视频信息发送给所述视频会议服务器, 以使所述 视频会议服务器根据所述第三视频信息和所述第一视频信息生成合成视频发 送给所述第二视频终端, 或根据所述第三视频信息和所述第二视频信息生成 合成视频发送给所述第一视频终端。
在本实施例中, 所述处理器 810还用于调用所述存储器 830的指令和 数据以实现, 根据接收到的用户输入的切换指示信息生成第一与会者替换 指令, 其中, 所述第一与会者替换指令中携带有第一与会者标识和第二与会 者标识, 所述第一与会者替换指令用于指示用所述第二与会者标识指示的个 人视频信息替换所述合成视频中包含的所述第一与会者标识指示的个人视频 信息, 所述第一与会者标识指示的个人视频信息是包含于所述合成视频中的 个人视频信息, 所述第二与会者标识指示的个人视频信息是包含于除所述合 成视频中之外的所述第一视频信息的个人视频信息、 或包含于除所述合成视 频中之外的所述第二视频信息的个人视频信息。 所述通信接口 820还用于向 所述视频会议服务器发送所述第一与会者替换指令用以使所述视频会议服务 器根据所述第一与会者替换指令用所述第二与会者标识指示的个人视频信息 替换所述合成视频中所述第一与会者标识指示的个人视频信息; 接收所述视 频会议服务器根据所述第一与会者替换指令发送的替换所述个人视频信息后 的所述合成视频并通过所述显示器显示。
在本实施例中, 所述处理器 810还用于当在预设时间范围内检测到所述 第三视频终端所在的会场中的语音釆集装置具有语音输入时, 确定用以指示 正在发言的与会者的第三与会者标识; 生成携带有所述第三与会者标识的第 二与会者替换指令, 其中, 所述第二与会者替换指令用于指示用所述第三与 会者标识指示的个人视频信息替换发送给所述第一视频终端或所述第二视频 终端的合成视频包含的个人视频信息。 所述通信接口 820还用于向所述视频 会议服务器发送所述第二与会者替换指令用以使所述视频会议服务器根据所 述第二与会者替换指令从所述发送给所述第一视频终端或第二视频终端的合 成视频所包含的个人视频信息中选择目标个人视频信息, 将选择的所述目标 个人视频信息替换为所述第三与会者标识指示的个人视频信息。
在本实施例中, 所述处理器 810还用于调用所述存储器 830的指令和 数据以实现, 根据接收到的用户输入的添加指示信息生成添加指令, 其中, 所述添加指令中携带有第四与会者标识, 所述添加指令用于指示在所述合成 视频中添加所述第四与会者标识指示的个人视频信息, 所述第四与会者标识 指示的个人视频信息是包含于除所述合成视频中之外的所述第一视频信息的 个人视频信息、 或包含于除所述合成视频中之外的所述第二视频信息的个人 视频信息。 所述通信接口 820还用于向所述视频会议服务器发送所述添加指 令用以使所述视频服务器根据所述添加指令在所述合成视频中添加所述第四 与会者标识指示的个人视频信息; 接收所述视频会议服务器根据所述添加指 令发送的添加所述个人视频信息后的所述合成视频并通过所述显示器显示。
在本实施例中, 所述处理器 810还用于调用所述存储器 830的指令和 数据以实现, 根据接收到的用户输入的删除指示信息生成删除指令, 其中, 所述删除指令中携带有第五与会者标识, 所述删除指令用于指示从所述合成 视频所包含的个人视频信息中删除所述第五与会者标识指示的个人视频信 息, 所述第五与会者标识指示的个人视频信息是包含于所述合成视频中的个 人视频信息。 所述通信接口 820还用于向所述视频会议服务器发送所述删除 指令用以使所述视频服务器根据所述删除指令在所述合成视频所包含的个人 视频信息中删除所述第五与会者标识指示的个人视频信息; 接收所述视频会 议服务器根据所述删除指令发送的删除所述个人视频信息后的所述合成视频 并通过所述显示器显示。
本领域普通技术人员可以理解: 实现上述方法实施例的全部或部分步骤 可以通过程序指令相关的硬件来完成, 前述的程序可以存储于一计算机可读 取存储介质中, 该程序在执行时, 执行包括上述方法实施例的步骤; 而前述 的存储介质包括: ROM、 RAM, 磁碟或者光盘等各种可以存储程序代码的 介质。
最后应说明的是: 以上各实施例仅用以说明本发明的技术方案, 而非对 其限制; 尽管参照前述各实施例对本发明进行了详细的说明, 本领域的普通 技术人员应当理解: 其依然可以对前述各实施例所记载的技术方案进行修 改, 或者对其中部分或者全部技术特征进行等同替换; 而这些修改或者替 换, 并不使相应技术方案的本质脱离本发明各实施例技术方案的范围。

Claims

权 利 要 求 书
1、 一种视频会议处理方法, 其特征在于, 包括:
视频会议服务器接收第一视频终端发送的第一视频信息, 以及第二视频 终端发送的第二视频信息, 其中, 所述第一视频信息包括所述第一视频终端 所在的会场中每个与会者的个人视频信息, 所述第二视频信息包括所述第二 所述视频会议服务器从接收到的所述第二视频信息、 及所述第一视频信 息中共获得预设数量的所述个人视频信息, 并将所述预设数量的所述个人视 频信息合成以生成合成视频, 以使在所述合成视频中, 所述预设数量的所述 个人视频信息分别对应的与会者均处于一致的会场背景中;
所述视频会议服务器将所述合成视频发送给所述第三视频终端进行显 示。
2、 根据权利要求 1所述的视频会议处理方法, 其特征在于, 所述视频会 议服务器将所述合成视频发送给所述第三视频终端进行显示之后, 所述方法 还包括:
所述视频会议服务器接收所述第一视频终端、 所述第二视频终端、 或所 述第三视频终端发送的第一与会者替换指令, 其中, 所述第一与会者替换指 令中携带有第一与会者标识和第二与会者标识, 所述第一与会者替换指令用 于指示用所述第二与会者标识指示的个人视频信息替换所述合成视频中包含 的所述第一与会者标识指示的个人视频信息, 所述第一与会者标识指示的个 人视频信息是包含于所述合成视频中的个人视频信息, 所述第二与会者标识 指示的个人视频信息是包含于除所述合成视频中之外的所述第一视频信息的 个人视频信息、 或包含于除所述合成视频中之外的所述第二视频信息的个人 视频信息;
所述视频会议服务器根据所述第一与会者替换指令用所述第二与会者标 识指示的个人视频信息替换所述合成视频中所述第一与会者标识指示的个人 视频信息;
所述视频会议服务器将替换所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
3、 根据权利要求 1所述的视频会议处理方法, 其特征在于, 所述视频会 议服务器将所述合成视频发送给所述第三视频终端进行显示之后, 所述方法 还包括:
所述视频会议服务器接收所述第一视频终端、 所述第二视频终端、 或第 三视频终端发送的第二与会者替换指令, 其中, 所述第二与会者替换指令中 携带有第三与会者标识, 所述第二与会者替换指令用于指示用所述第三与会 者标识指示的个人视频信息替换所述合成视频包含的个人视频信息, 所述第 三与会者标识指示的个人视频信息是包含于除所述合成视频中之外的所述第 一视频信息的个人视频信息、 或包含于除所述合成视频中之外的所述第二视 频信息的个人视频信息;
所述视频会议服务器根据所述第二与会者替换指令从所述合成视频所包 含的个人视频信息中选择目标个人视频信息, 将选择的所述目标个人视频信 息替换为所述第三与会者标识指示的个人视频信息;
所述视频会议服务器将替换所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
4、 根据权利要求 3 所述的视频会议处理方法, 其特征在于, 所述第二 与会者替换指令中还携带有位置信息 , 所述视频会议服务器根据所述第二与 会者替换指令从所述合成视频所包含的个人视频信息中选择目标个人视频信 息包括:
所述视频会议服务器根据所述第二与会者替换指令将所述合成视频中所 包含的与所述第二与会者替换指令中所包含位置信息对应的个人视频信息作 为所述的目标个人视频信息。
5、 根据权利要求 1 所述的视频会议处理方法, 其特征在于, 所述视频 会议服务器将所述合成视频发送给所述第三视频终端进行显示之后, 所述方 法还包括:
所述视频会议服务器接收所述第一视频终端、 所述第二视频终端、 或所 述第三视频终端发送的添加指令, 其中, 所述添加指令中携带有第四与会者 标识, 所述添加指令用于指示在所述合成视频中添加所述第四与会者标识指 示的个人视频信息, 所述第四与会者标识指示的个人视频信息是包含于除所 述合成视频中之外的所述第一视频信息的个人视频信息、 或包含于除所述合 成视频中之外的所述第二视频信息的个人视频信息; 所述视频服务器根据所述添加指令在所述合成视频中添加所述第四与会 者标识指示的个人视频信息;
所述视频会议服务器将添加所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
6、 根据权利要求 1 所述的视频会议处理方法, 其特征在于, 所述视频 会议服务器将所述合成视频发送给所述第三视频终端进行显示之后, 所述方 法还包括:
所述视频会议服务器接收所述第一视频终端、 所述第二视频终端、 或所 述第三视频终端发送的删除指令, 其中, 所述删除指令中携带有第五与会者 标识, 所述删除指令用于指示从所述合成视频所包含的个人视频信息中删除 所述第五与会者标识指示的个人视频信息, 所述第五与会者标识指示的个人 视频信息是包含于所述合成视频中的个人视频信息; 息中删除所述第五与会者标识指示的个人视频信息;
所述视频会议服务器将删除所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
7、 根据权利要求 1-6任一所述的视频会议处理方法, 其特征在于, 所述 视频会议服务器将所述预设数量的所述个人视频信息合成以生成合成视频包 括:
所述视频会议服务器对所述预设数量的所述个人视频信息中的对应图像 进行拼接, 生成合成图像, 其中, 所述预设数量的所述个人视频信息中的对 应图像在时序上同步; 组合多幅所述合成图像以生成合成视频。
8、 根据权利要求 1-6任一所述的视频会议处理方法, 其特征在于, 所述 视频会议服务器将所述预设数量的所述个人视频信息合成以生成合成视频包 括:
所述视频会议服务器将从所述预设数量的所述个人视频信息提取出的图 像信息排列在预设背景图像中生成合成图像, 其中, 从所述第一视频信息、 所述第二视频信息中获得的所述预设数量的所述个人视频信息包含的图像信 息在时序上同步; 组合多幅所述合成图像以生成合成视频。
9、 一种视频会议处理方法, 其特征在于, 包括: 第三视频终端接收视频会议服务器发送的合成视频, 其中, 所述合成视 频通过所述视频会议服务器从第一视频终端接收到的第一视频信息、 及从第 二视频终端接收到的第二视频信息中共获得预设数量的个人视频信息, 并将 所述预设数量的所述个人视频信息合成而得到, 在所述合成视频中, 所述预 设数量的所述个人视频信息分别对应的与会者均处于一致的会场背景中, 所 信息, 所述第二视频信息包括所述第二视频终端所在的会场中每个与会者的 个人视频信息;
所述第三视频终端将所述合成视频进行显示。
10、 根据权利要求 9所述的视频会议处理方法, 其特征在于, 所述方法 还包括:
所述第三视频终端接收至少一个视频釆集装置发送的视频信息, 将接收 到的视频信息打包形成第三视频信息, 其中, 所述至少一个视频釆集装置中 的每个视频釆集装置用以釆集第三视频终端所在的会场中至少一个与会者的 视频信息, 所述第三视频信息包括所述第三视频终端所在的会场中每个与会 者的个人视频信息;
所述第三视频终端将所述第三视频信息发送给所述视频会议服务器, 以 使所述视频会议服务器根据所述第三视频信息和所述第一视频信息生成合成 视频发送给所述第二视频终端, 或根据所述第三视频信息和所述第二视频信 息生成合成视频发送给所述第一视频终端。
1 1、 根据权利要求 9所述的视频会议处理方法, 其特征在于, 所述第三 视频终端将所述合成视频进行显示之后, 所述方法还包括:
所述第三视频终端根据接收到的用户输入的切换指示信息生成第一与会 者替换指令, 其中, 所述第一与会者替换指令中携带有第一与会者标识和第 二与会者标识, 所述第一与会者替换指令用于指示用所述第二与会者标识指 示的个人视频信息替换所述合成视频中包含的所述第一与会者标识指示的个 人视频信息, 所述第一与会者标识指示的个人视频信息是包含于所述合成视 频中的个人视频信息, 所述第二与会者标识指示的个人视频信息是包含于除 所述合成视频中之外的所述第一视频信息的个人视频信息、 或包含于除所述 合成视频中之外的所述第二视频信息的个人视频信息; 所述第三视频终端向所述视频会议服务器发送所述第一与会者替换指令 用以使所述视频会议服务器根据所述第一与会者替换指令用所述第二与会者 标识指示的个人视频信息替换所述合成视频中所述第一与会者标识指示的个 人视频信息;
所述第三视频终端接收所述视频会议服务器根据所述第一与会者替换指 令发送的替换所述个人视频信息后的所述合成视频并显示。
12、 根据权利要求 9所述的视频会议处理方法, 其特征在于, 所述第三 视频终端将所述合成视频进行显示之后, 所述方法还包括:
所述第三视频终端当在预设时间范围内检测到所述第三视频终端所在的 会场中的语音釆集装置具有语音输入时, 确定用以指示正在发言的与会者的 第三与会者标识;
所述第三视频终端生成携带有所述第三与会者标识的第二与会者替换指 令, 其中, 所述第二与会者替换指令用于指示用所述第三与会者标识指示的 个人视频信息替换发送给所述第一视频终端或所述第二视频终端的合成视频 包含的个人视频信息;
所述第三视频终端向所述视频会议服务器发送所述第二与会者替换指令 用以使所述视频会议服务器根据所述第二与会者替换指令从所述发送给所述 第一视频终端或第二视频终端的合成视频所包含的个人视频信息中选择目标 个人视频信息, 将选择的所述目标个人视频信息替换为所述第三与会者标识 指示的个人视频信息。
13、 根据权利要求 9所述的视频会议处理方法, 其特征在于, 所述第三 视频终端将所述合成视频进行显示之后, 所述方法还包括:
所述第三视频终端根据接收到的用户输入的添加指示信息生成添加指 令, 其中, 所述添加指令中携带有第四与会者标识, 所述添加指令用于指示 在所述合成视频中添加所述第四与会者标识指示的个人视频信息, 所述第四 与会者标识指示的个人视频信息是包含于除所述合成视频中之外的所述第一 视频信息的个人视频信息、 或包含于除所述合成视频中之外的所述第二视频 信息的个人视频信息;
所述第三视频终端向所述视频会议服务器发送所述添加指令用以使所述 视频服务器根据所述添加指令在所述合成视频中添加所述第四与会者标识指 示的个人视频信息;
所述第三视频终端接收所述视频会议服务器根据所述添加指令发送的添 加所述个人视频信息后的所述合成视频并显示。
14、 根据权利要求 9所述的视频会议处理方法, 其特征在于, 所述第三 视频终端将所述合成视频进行显示之后, 所述方法还包括:
所述第三视频终端根据接收到的用户输入的删除指示信息生成删除指 令, 其中, 所述删除指令中携带有第五与会者标识, 所述删除指令用于指示 从所述合成视频所包含的个人视频信息中删除所述第五与会者标识指示的个 人视频信息, 所述第五与会者标识指示的个人视频信息是包含于所述合成视 频中的个人视频信息;
所述第三视频终端向所述视频会议服务器发送所述删除指令用以使所述 所述第五与会者标识指示的个人视频信息;
所述第三视频终端接收所述视频会议服务器根据所述删除指令发送的删 除所述个人视频信息后的所述合成视频并显示。
15、 一种视频会议服务器, 其特征在于, 包括:
接收单元, 用于接收第一视频终端发送的第一视频信息, 以及第二视频 终端发送的第二视频信息, 其中, 所述第一视频信息包括所述第一视频终端 所在的会场中每个与会者的个人视频信息, 所述第二视频信息包括所述第二 处理单元, 与所述接收单元相连, 用于从接收到的所述第二视频信息、 及所述第一视频信息中共获得预设数量的所述个人视频信息, 并将所述预设 数量的所述个人视频信息合成以生成合成视频, 以使在所述合成视频中, 所 述预设数量的所述个人视频信息分别对应的与会者均处于一致的会场背景 中;
发送单元, 与所述处理单元相连, 用于将所述合成视频发送给所述第三 视频终端进行显示。
16、 根据权利要求 15所述的视频会议服务器, 其特征在于:
所述接收单元还用于接收所述第一视频终端、 所述第二视频终端、 或所述 第三视频终端发送的第一与会者替换指令, 其中, 所述第一与会者替换指令 中携带有第一与会者标识和第二与会者标识, 所述第一与会者替换指令用于 指示用所述第二与会者标识指示的个人视频信息替换所述合成视频中包含的 所述第一与会者标识指示的个人视频信息, 所述第一与会者标识指示的个人 视频信息是包含于所述合成视频中的个人视频信息, 所述第二与会者标识指 示的个人视频信息是包含于除所述合成视频中之外的所述第一视频信息的个 人视频信息、 或包含于除所述合成视频中之外的所述第二视频信息的个人视 频信息;
所述处理单元还用于根据所述第一与会者替换指令用所述第二与会者标 识指示的个人视频信息替换所述合成视频中所述第一与会者标识指示的个人 视频信息;
所述发送单元还用于将替换所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
17、 根据权利要求 15所述的视频会议服务器, 其特征在于:
所述接收单元还用于接收所述第一视频终端、 或所述第二视频终端、 或 第三视频终端发送的第二与会者替换指令, 其中, 所述第二与会者替换指令 中携带有第三与会者标识, 所述第二与会者替换指令用于指示用所述第三与 会者标识指示的个人视频信息替换所述合成视频包含的个人视频信息, 所述 第三与会者标识指示的个人视频信息是包含于除所述合成视频中之外的所述 第一视频信息的个人视频信息、 或包含于除所述合成视频中之外的所述第二 视频信息的个人视频信息;
所述处理单元还用于根据所述第二与会者替换指令从所述合成视频所包 含的个人视频信息中选择目标个人视频信息, 将选择的所述目标个人视频信 息替换为所述第三与会者标识指示的个人视频信息;
所述发送单元还用于将替换所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
18、根据权利要求 17所述的视频会议服务器, 其特征在于: 所述第二与 会者替换指令中还携带有位置信息;
所述处理单元还用于根据所述第二与会者替换指令将所述合成视频中所 包含的与所述第二与会者替换指令中所包含位置信息对应的个人视频信息作 为所述的目标个人视频信息。
19、 根据权利要求 15所述的视频会议服务器, 其特征在于: 所述接收单元还用于接收所述第一视频终端、 所述第二视频终端、 或所 述第三视频终端发送的添加指令, 其中, 所述添加指令中携带有第四与会者 标识, 所述添加指令用于指示在所述合成视频中添加所述第四与会者标识指 示的个人视频信息, 所述第四与会者标识指示的个人视频信息是包含于除所 述合成视频中之外的所述第一视频信息的个人视频信息、 或包含于除所述合 成视频中之外的所述第二视频信息的个人视频信息;
所述处理单元还用于根据所述添加指令在所述合成视频中添加所述第四 与会者标识指示的个人视频信息;
所述发送单元还用于将添加所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
20、 根据权利要求 15所述的视频会议服务器, 其特征在于:
所述接收单元还用于接收所述所述第一视频终端、 所述第二视频终端、 第三视频终端发送的删除指令, 其中, 所述删除指令中携带有第五与会者标 识, 所述删除指令用于指示从所述合成视频所包含的个人视频信息中删除所 述第五与会者标识指示的个人视频信息, 所述第五与会者标识指示的个人视 频信息是包含于所述合成视频中的个人视频信息;
所述处理单元还用于根据所述删除指令在所述合成视频所包含的个人视 频信息中删除所述第五与会者标识指示的个人视频信息;
所述发送单元还用于将删除所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
21、 根据权利要求 15-20任一所述的视频会议服务器, 其特征在于: 所述处理单元还用于对所述预设数量的所述个人视频信息中的对应图像 进行拼接, 生成合成图像, 其中, 所述预设数量的所述个人视频信息中的对 应图像在时序上同步; 组合多幅所述合成图像以生成合成视频。
22、 根据权利要求 15-20任一所述的视频会议服务器, 其特征在于: 所述处理单元还用于将所述预设数量的所述个人视频信息包含的图像信 息排列在预设背景图像中生成合成图像, 其中, 从所述第一视频信息、 所述 第二视频信息中获得的所述预设数量的所述个人视频信息包含的图像信息在 时序上同步; 组合多幅所述合成图像以生成合成视频。
23、 一种第三视频终端, 其特征在于, 包括:
接收单元, 用于接收视频会议服务器发送的合成视频, 其中, 所述合成 视频通过所述视频会议服务器从第一视频终端接收到的第一视频信息、 及从 第二视频终端接收到的第二视频信息中共获得预设数量的个人视频信息, 并 将所述预设数量的所述个人视频信息合成而得到, 在所述合成视频中, 所述 预设数量的所述个人视频信息分别对应的与会者均处于一致的会场背景中; 所述第一视频信息包括所述第一视频终端所在的会场中每个与会者的个人视 频信息, 所述第二视频信息包括所述第二视频终端所在的会场中每个与会者 的个人视频信息;
显示单元, 与所述接收单元相连, 用于将所述合成视频进行显示。
24、 根据权利要求 23所述的第三视频终端, 其特征在于:
所述接收单元还用于接收至少一个视频釆集装置发送的视频信息; 所述第三视频终端还包括:
第一处理单元, 与所述接收单元相连, 将接收到的视频信息打包形成第 三视频信息, 其中, 所述至少一个视频釆集装置中的每个视频釆集装置用以 釆集第三视频终端所在的会场中至少一个与会者的视频信息, 所述第三视频 第一发送单元, 与所述第一处理单元相连, 用于将所述第三视频信息发 送给所述视频会议服务器, 以使所述视频会议服务器根据所述第三视频信息 和所述第一视频信息生成合成视频发送给所述第二视频终端, 或根据所述第 三视频信息和所述第二视频信息生成合成视频发送给所述第一视频终端。
25、 根据权利要求 23所述的第三视频终端, 其特征在于, 还包括: 第二处理单元, 用于根据接收到的用户输入的切换指示信息生成第一与 会者替换指令, 其中, 所述第一与会者替换指令中携带有第一与会者标识和 第二与会者标识, 所述第一与会者替换指令用于指示用所述第二与会者标识 指示的个人视频信息替换所述合成视频中包含的所述第一与会者标识指示的 个人视频信息, 所述第一与会者标识指示的个人视频信息是包含于所述合成 视频中的个人视频信息, 所述第二与会者标识指示的个人视频信息是包含于 除所述合成视频中之外的所述第一视频信息的个人视频信息、 或包含于除所 述合成视频中之外的所述第二视频信息的个人视频信息; 第二发送单元, 与所述第二处理单元相连, 用于向所述视频会议服务器 发送所述第一与会者替换指令用以使所述视频会议服务器根据所述第一与会 者替换指令用所述第二与会者标识指示的个人视频信息替换所述合成视频中 所述第一与会者标识指示的个人视频信息;
所述接收单元还用于接收所述视频会议服务器根据所述第一与会者替换 指令发送的替换所述个人视频信息后的所述合成视频并通过所述显示单元显 示。
26、 根据权利要求 23所述的第三视频终端, 其特征在于, 还包括: 第三处理单元, 用于当在预设时间范围内检测到所述第三视频终端所在 的会场中的语音釆集装置具有语音输入时, 确定用以指示正在发言的与会者 的第三与会者标识; 生成携带有所述第三与会者标识的第二与会者替换指 令, 其中, 所述第二与会者替换指令用于指示用所述第三与会者标识指示的 个人视频信息替换发送给所述第一视频终端或所述第二视频终端的合成视频 包含的个人视频信息;
第三发送单元, 与所述第三处理单元相连, 用于向所述视频会议服务器 发送所述第二与会者替换指令用以使所述视频会议服务器根据所述第二与会 者替换指令从所述发送给所述第一视频终端或第二视频终端的合成视频所包 含的个人视频信息中选择目标个人视频信息, 将选择的所述目标个人视频信 息替换为所述第三与会者标识指示的个人视频信息。
27、 根据权利要求 23所述的第三视频终端, 其特征在于, 还包括: 第四处理单元, 用于根据接收到的用户输入的添加指示信息生成添加指 令, 其中, 所述添加指令中携带有第四与会者标识, 所述添加指令用于指示 在所述合成视频中添加所述第四与会者标识指示的个人视频信息, 所述第四 与会者标识指示的个人视频信息是包含于除所述合成视频中之外的所述第一 视频信息的个人视频信息、 或包含于除所述合成视频中之外的所述第二视频 信息的个人视频信息;
第四发送单元, 与所述第四处理单元相连, 用于向所述视频会议服务器 发送所述添加指令用以使所述视频服务器根据所述添加指令在所述合成视频 中添加所述第四与会者标识指示的个人视频信息;
所述接收单元还用于接收所述视频会议服务器根据所述添加指令发送的 添加所述个人视频信息后的所述合成视频并通过所述显示单元显示。
28、 根据权利要求 23所述的第三视频终端, 其特征在于, 还包括: 第五处理单元, 用于根据接收到的用户输入的删除指示信息生成删除指 令, 其中, 所述删除指令中携带有第五与会者标识, 所述删除指令用于指示 从所述合成视频所包含的个人视频信息中删除所述第五与会者标识指示的个 人视频信息, 所述第五与会者标识指示的个人视频信息是包含于所述合成视 频中的个人视频信息;
第五发送单元, 与所述第五处理单元相连, 用于向所述视频会议服务器 发送所述删除指令用以使所述视频服务器根据所述删除指令在所述合成视频 所包含的个人视频信息中删除所述第五与会者标识指示的个人视频信息; 所述接收单元还用于接收所述视频会议服务器根据所述删除指令发送的 删除所述个人视频信息后的所述合成视频并通过所述显示单元显示。
29、 一种视频会议服务器, 其特征在于, 包括: 处理器, 通信接口, 存 储器和总线;
其中所述处理器、 所述通信接口和所述存储器通过所述总线互联; 所述通信接口, 用于接收第一视频终端发送的第一视频信息, 以及第二 视频终端发送的第二视频信息, 其中, 所述第一视频信息包括所述第一视频 终端所在的会场中每个与会者的个人视频信息, 所述第二视频信息包括所述 所述存储器, 用于存储指令或数据;
所述处理器调用存储在所述存储器中的指令以实现从接收到的所述第 二视频信息、 及所述第一视频信息中共获得预设数量的所述个人视频信息, 并将所述预设数量的所述个人视频信息合成以生成合成视频, 以使在所述合 成视频中, 所述预设数量的所述个人视频信息分别对应的与会者均处于一致 的会场背景中;
所述通信接口还用于将所述合成视频发送给所述第三视频终端进行显 示。
30、 根据权利要求 29所述的视频会议服务器, 其特征在于:
所述通信接口还用于接收所述第一视频终端、 所述第二视频终端、 所述 第三视频终端发送的第一与会者替换指令, 其中, 所述第一与会者替换指令 中携带有第一与会者标识和第二与会者标识, 所述第一与会者替换指令用于 指示用所述第二与会者标识指示的个人视频信息替换所述合成视频中包含的 所述第一与会者标识指示的个人视频信息, 所述第一与会者标识指示的个人 视频信息是包含于所述合成视频中的个人视频信息, 所述第二与会者标识指 示的个人视频信息是包含于除所述合成视频中之外的所述第一视频信息的个 人视频信息、 或包含于除所述合成视频中之外的所述第二视频信息的个人视 频信息;
所述处理器还用于调用所述存储器的指令和数据以实现, 根据所述 第一与会者替换指令用所述第二与会者标识指示的个人视频信息替换所述合 成视频中所述第一与会者标识指示的个人视频信息;
所述通信接口还用于将替换所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
31、 根据权利要求 29所述的视频会议服务器, 其特征在于:
所述通信接口还用于接收所述第一视频终端、 所述第二视频终端、 或所 述第三视频终端发送的第一与会者替换指令, 其中, 所述第二与会者替换指 令中携带有第三与会者标识, 所述第二与会者替换指令用于指示用所述第三 与会者标识指示的个人视频信息替换所述合成视频包含的个人视频信息, 所 述第三与会者标识指示的个人视频信息是包含于除所述合成视频中之外的所 述第一视频信息的个人视频信息、 或包含于除所述合成视频中之外的所述第 二视频信息的个人视频信息;
所述处理器还用于调用所述存储器的指令和数据以实现, 根据所述第二 与会者替换指令从所述合成视频所包含的个人视频信息中选择目标个人视频 信息, 将选择的所述目标个人视频信息替换为所述第三与会者标识指示的个 人视频信息;
所述通信接口还用于将替换所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
32、根据权利要求 31所述的视频会议服务器, 其特征在于: 所述第二与 会者替换指令中还携带有位置信息;
所述处理器还用于调用所述存储器的指令和数据以实现, 根据所述第二 包含位置信息对应的个人视频信息作为所述的目标个人视频信息。
33、 根据权利要求 29所述的视频会议服务器, 其特征在于:
所述通信接口还用于接收所述第一视频终端、 所述第二视频终端、 或所 述第三视频终端发送的添加指令, 其中, 所述添加指令中携带有第四与会者 标识, 所述添加指令用于指示在所述合成视频中添加所述第四与会者标识指 示的个人视频信息, 所述第四与会者标识指示的个人视频信息是包含于除所 述合成视频中之外的所述第一视频信息的个人视频信息、 或包含于除所述合 成视频中之外的所述第二视频信息的个人视频信息;
所述处理器还用于调用所述存储器的指令和数据以实现, 根据所述添加 指令在所述合成视频中添加所述第四与会者标识指示的个人视频信息;
所述通信接口还用于将添加所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
34、 根据权利要求 29所述的视频会议服务器, 其特征在于:
所述通信接口还用于接收所述第一视频终端、 所述第二视频终端、 或所 述第三视频终端发送的删除指令, 其中, 所述删除指令中携带有第五与会者 标识, 所述删除指令用于指示从所述合成视频所包含的个人视频信息中删除 所述第五与会者标识指示的个人视频信息, 所述第五与会者标识指示的个人 视频信息是包含于所述合成视频中的个人视频信息;
所述处理器还用于调用所述存储器的指令和数据以实现, 根据所述删除 指令在所述合成视频所包含的个人视频信息中删除所述第五与会者标识指示 的个人视频信息;
所述通信接口还用于将删除所述个人视频信息后的所述合成视频发送给 所述第三视频终端进行显示。
35、 根据权利要求 29-34任一所述的视频会议服务器, 其特征在于: 所述处理器还用于对所述预设数量的所述个人视频信息中的对应图像进 行拼接, 生成合成图像, 其中, 所述预设数量的所述个人视频信息中的对应 图像在时序上同步; 组合多幅所述合成图像以生成合成视频。
36、 根据权利要求 29-34任一所述的视频会议服务器, 其特征在于: 所述处理器还用于将所述预设数量的所述个人视频信息包含的图像信息 排列在预设背景图像中生成合成图像, 其中, 从所述第一视频信息、 所述第 二视频信息中获得的所述预设数量的所述个人视频信息包含的图像信息在时 序上同步; 组合多幅所述合成图像以生成合成视频。
37、 一种第三视频终端, 其特征在于, 包括: 处理器, 通信接口, 存储 器、 总线和显示器;
其中所述处理器、 所述通信接口、 所述存储器和所述显示器通过所述总 线互联;
所述通信接口用于接收视频会议服务器发送的合成视频, 其中, 所述合 成视频通过所述视频会议服务器从第一视频终端接收到的第一视频信息、 及 从第二视频终端接收到的第二视频信息中共获得预设数量的个人视频信息, 并将所述预设数量的所述个人视频信息合成而得到, 在所述合成视频中, 所 述预设数量的所述个人视频信息分别对应的与会者均处于一致的会场背景 中; 所述第一视频信息包括所述第一视频终端所在的会场中每个与会者的个 人视频信息, 所述第二视频信息包括所述第二视频终端所在的会场中每个与 会者的个人视频信息;
所述存储器, 用于存储指令或数据;
所述处理器调用存储在所述存储器中的指令以实现将所述合成视频通 过所述显示器进行显示。
38、 根据权利要求 37所述的第三视频终端, 其特征在于:
所述通信接口还用于接收至少一个视频釆集装置发送的视频信息; 所述处理器还用于将接收到的视频信息打包形成第三视频信息, 其 中, 所述至少一个视频釆集装置中的每个视频釆集装置用以釆集第三视频终 端所在的会场中至少一个与会者的视频信息, 所述第三视频信息包括所述第 所述通信接口还用于将所述第三视频信息发送给所述视频会议服务器, 以使所述视频会议服务器根据所述第三视频信息和所述第一视频信息生成合 成视频发送给所述第二视频终端, 或根据所述第三视频信息和所述第二视频 信息生成合成视频发送给所述第一视频终端。
39、 根据权利要求 37所述的第三视频终端, 其特征在于:
所述处理器还用于调用所述存储器的指令和数据以实现, 根据接收 到的用户输入的切换指示信息生成第一与会者替换指令, 其中, 所述第一与 会者替换指令中携带有第一与会者标识和第二与会者标识, 所述第一与会者 替换指令用于指示用所述第二与会者标识指示的个人视频信息替换所述合成 视频中包含的所述第一与会者标识指示的个人视频信息, 所述第一与会者标 识指示的个人视频信息是包含于所述合成视频中的个人视频信息, 所述第二 与会者标识指示的个人视频信息是包含于除所述合成视频中之外的所述第一 视频信息的个人视频信息、 或包含于除所述合成视频中之外的所述第二视频 信息的个人视频信息;
所述通信接口还用于向所述视频会议服务器发送所述第一与会者替换指 令用以使所述视频会议服务器根据所述第一与会者替换指令用所述第二与会 者标识指示的个人视频信息替换所述合成视频中所述第一与会者标识指示的 个人视频信息; 接收所述视频会议服务器根据所述第一与会者替换指令发送 的替换所述个人视频信息后的所述合成视频并通过所述显示器显示。
40、 根据权利要求 37所述的第三视频终端, 其特征在于:
所述处理器还用于当在预设时间范围内检测到所述第三视频终端所在 的会场中的语音釆集装置具有语音输入时, 确定用以指示正在发言的与会者 的第三与会者标识; 生成携带有所述第三与会者标识的第二与会者替换指 令, 其中, 所述第二与会者替换指令用于指示用所述第三与会者标识指示的 个人视频信息替换发送给所述第一视频终端或所述第二视频终端的合成视频 包含的个人视频信息;
所述通信接口还用于向所述视频会议服务器发送所述第二与会者替换指 令用以使所述视频会议服务器根据所述第二与会者替换指令从所述发送给所 述第一视频终端或第二视频终端的合成视频所包含的个人视频信息中选择目 标个人视频信息, 将选择的所述目标个人视频信息替换为所述第三与会者标 识指示的个人视频信息。
41、 根据权利要求 37所述的第三视频终端, 其特征在于:
所述处理器还用于调用所述存储器的指令和数据以实现, 根据接收 到的用户输入的添加指示信息生成添加指令, 其中, 所述添加指令中携带有 第四与会者标识, 所述添加指令用于指示在所述合成视频中添加所述第四与 会者标识指示的个人视频信息, 所述第四与会者标识指示的个人视频信息是 包含于除所述合成视频中之外的所述第一视频信息的个人视频信息、 或包含 于除所述合成视频中之外的所述第二视频信息的个人视频信息; 所述通信接口还用于向所述视频会议服务器发送所述添加指令用以使所 述视频服务器根据所述添加指令在所述合成视频中添加所述第四与会者标识 指示的个人视频信息; 接收所述视频会议服务器根据所述添加指令发送的添 加所述个人视频信息后的所述合成视频并通过所述显示器显示。
42、 根据权利要求 37所述的第三视频终端, 其特征在于:
所述处理器还用于调用所述存储器的指令和数据以实现, 根据接收 到的用户输入的删除指示信息生成删除指令, 其中, 所述删除指令中携带有 第五与会者标识, 所述删除指令用于指示从所述合成视频所包含的个人视频 信息中删除所述第五与会者标识指示的个人视频信息, 所述第五与会者标识 指示的个人视频信息是包含于所述合成视频中的个人视频信息;
所述通信接口还用于向所述视频会议服务器发送所述删除指令用以使所 除所述第五与会者标识指示的个人视频信息; 接收所述视频会议服务器根据 所述删除指令发送的删除所述个人视频信息后的所述合成视频并通过所述显 示器显示。
PCT/CN2013/074860 2013-04-27 2013-04-27 视频会议处理方法及设备 WO2014172907A1 (zh)

Priority Applications (4)

Application Number Priority Date Filing Date Title
CN201380001428.1A CN104380720B (zh) 2013-04-27 2013-04-27 视频会议处理方法及设备
EP13866482.6A EP2816801B1 (en) 2013-04-27 2013-04-27 Video conference processing method and device
PCT/CN2013/074860 WO2014172907A1 (zh) 2013-04-27 2013-04-27 视频会议处理方法及设备
US14/335,238 US9392191B2 (en) 2013-04-27 2014-07-18 Method and device for processing video conference

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2013/074860 WO2014172907A1 (zh) 2013-04-27 2013-04-27 视频会议处理方法及设备

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US14/335,238 Continuation US9392191B2 (en) 2013-04-27 2014-07-18 Method and device for processing video conference

Publications (1)

Publication Number Publication Date
WO2014172907A1 true WO2014172907A1 (zh) 2014-10-30

Family

ID=51791024

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2013/074860 WO2014172907A1 (zh) 2013-04-27 2013-04-27 视频会议处理方法及设备

Country Status (4)

Country Link
US (1) US9392191B2 (zh)
EP (1) EP2816801B1 (zh)
CN (1) CN104380720B (zh)
WO (1) WO2014172907A1 (zh)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9807341B2 (en) * 2016-02-19 2017-10-31 Microsoft Technology Licensing, Llc Communication event
US10929561B2 (en) * 2017-11-06 2021-02-23 Microsoft Technology Licensing, Llc Removing personally identifiable data before transmission from a device
CN108055495A (zh) * 2017-12-14 2018-05-18 南京美桥信息科技有限公司 一种可视虚拟聚会方法和系统
CN108259781B (zh) * 2017-12-27 2021-01-26 努比亚技术有限公司 视频合成方法、终端及计算机可读存储介质
WO2020010620A1 (zh) * 2018-07-13 2020-01-16 深圳市大疆创新科技有限公司 波浪识别方法、装置、计算机可读存储介质和无人飞行器
CN112887653B (zh) * 2021-01-25 2022-10-21 联想(北京)有限公司 一种信息处理方法和信息处理装置
CN117640877B (zh) * 2024-01-24 2024-03-29 浙江华创视讯科技有限公司 线上会议的画面重构方法及电子设备

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101141613A (zh) * 2007-10-10 2008-03-12 中国联合通信有限公司 一种视频会议切换控制系统及方法
CN102265613A (zh) * 2008-12-23 2011-11-30 坦德伯格电信公司 用于处理在多个视频会议终端之间的会议中的图像的方法、设备和计算机程序
CN102498717A (zh) * 2009-06-24 2012-06-13 思科系统国际公司 用于修改合成视频信号布局的方法和设备
CN202551219U (zh) * 2012-03-31 2012-11-21 福州一点通广告装饰有限公司 远程三维虚拟仿真合成系统

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100316639B1 (ko) * 1998-05-22 2002-01-16 윤종용 다지점 영상회의 시스템 및 그에 따른 구현방법
EP1228629A1 (en) * 1999-10-08 2002-08-07 Nortel Networks Limited Method, apparatus, and article of manufacture for web-based control of a call server
US7139015B2 (en) * 2004-01-20 2006-11-21 Polycom, Inc. Method and apparatus for mixing compressed video
US7477281B2 (en) * 2004-11-09 2009-01-13 Nokia Corporation Transmission control in multiparty conference
JP4408845B2 (ja) 2005-07-27 2010-02-03 シャープ株式会社 映像合成装置及びプログラム
US8166205B2 (en) 2007-07-31 2012-04-24 Cisco Technology, Inc. Overlay transport virtualization
US20090083639A1 (en) * 2007-09-26 2009-03-26 Mckee Cooper Joel Distributed conference and information system
US8345082B2 (en) * 2008-10-08 2013-01-01 Cisco Technology, Inc. System and associated methodology for multi-layered site video conferencing
CN101610401A (zh) 2009-03-17 2009-12-23 郑仰湖 全景可视泊车系统
US8830293B2 (en) * 2009-05-26 2014-09-09 Cisco Technology, Inc. Video superposition for continuous presence
CN102082944B (zh) * 2009-11-30 2016-03-09 华为终端有限公司 一种包含远程呈现会场的会议控制方法、装置及系统
JP5190084B2 (ja) 2010-03-30 2013-04-24 株式会社日立製作所 仮想マシンのマイグレーション方法およびシステム
US9183560B2 (en) * 2010-05-28 2015-11-10 Daniel H. Abelow Reality alternate
US8423646B2 (en) 2010-07-09 2013-04-16 International Business Machines Corporation Network-aware virtual machine migration in datacenters
US8760488B2 (en) * 2010-10-22 2014-06-24 Litl Llc Video integration
CN102164091B (zh) 2011-05-13 2015-01-21 北京星网锐捷网络技术有限公司 一种mac地址表建立方法及运营商边缘设备
US8754926B1 (en) * 2011-11-29 2014-06-17 Google Inc. Managing nodes of a synchronous communication conference
US9148625B2 (en) * 2012-09-21 2015-09-29 Cisco Technology, Inc. Transition control in a videoconference

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101141613A (zh) * 2007-10-10 2008-03-12 中国联合通信有限公司 一种视频会议切换控制系统及方法
CN102265613A (zh) * 2008-12-23 2011-11-30 坦德伯格电信公司 用于处理在多个视频会议终端之间的会议中的图像的方法、设备和计算机程序
CN102498717A (zh) * 2009-06-24 2012-06-13 思科系统国际公司 用于修改合成视频信号布局的方法和设备
CN202551219U (zh) * 2012-03-31 2012-11-21 福州一点通广告装饰有限公司 远程三维虚拟仿真合成系统

Also Published As

Publication number Publication date
EP2816801B1 (en) 2018-05-30
EP2816801A4 (en) 2015-07-22
US20140327727A1 (en) 2014-11-06
CN104380720B (zh) 2017-11-28
CN104380720A (zh) 2015-02-25
EP2816801A1 (en) 2014-12-24
US9392191B2 (en) 2016-07-12

Similar Documents

Publication Publication Date Title
US10057542B2 (en) System for immersive telepresence
WO2014172907A1 (zh) 视频会议处理方法及设备
JP5199249B2 (ja) ビデオストリームをアラインするための融合空間
JP5508450B2 (ja) マルチストリームかつマルチサイトのテレプレゼンス会議システムのための自動的なビデオレイアウト
US9148625B2 (en) Transition control in a videoconference
US20090327418A1 (en) Participant positioning in multimedia conferencing
WO2015085949A1 (zh) 视频会议方法、装置及系统
TW200939775A (en) Techniques to generate a visual composition for a multimedia conference event
JP2007282072A (ja) 電子会議システム、電子会議支援プログラム、電子会議支援方法、電子会議システムにおける情報端末装置
CN103597468A (zh) 用于视频通信系统中改进的交互式内容共享的系统和方法
WO2012149796A1 (zh) 视频会议中视频资源管理的方法及装置
US8848021B2 (en) Remote participant placement on a unit in a conference room
JP2007104354A (ja) テレビ会議システム、テレビ会議方法及びテレビ会議端末装置
WO2014187282A1 (zh) 一种建立视频会议界面的方法、装置及视频终端
JP2006303997A (ja) テレビ会議システム
EP3024223B1 (en) Videoconference terminal, secondary-stream data accessing method, and computer storage medium
CN101939989A (zh) 虚拟桌子
CN105306872B (zh) 控制多点视频会议的方法、装置和系统
CN102016818A (zh) 预定和进行中事件参加者之间的通信
KR101577144B1 (ko) 디스플레이부의 화면공유를 통한 원격회의시스템 및 방법
JP2003339037A (ja) ネットワーク会議システム、ネットワーク会議方法およびネットワーク会議プログラム
TW201141226A (en) Virtual conversing method
JP2023026478A (ja) 情報処理装置、プログラム、方法、システム
CN102016816A (zh) 事件之间的消息传送
JP2003339034A (ja) ネットワーク会議システム、ネットワーク会議方法およびネットワーク会議プログラム

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 2013866482

Country of ref document: EP

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13866482

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE