WO2022088050A1 - 视频会议实现方法、装置、系统以及存储介质 - Google Patents
视频会议实现方法、装置、系统以及存储介质 Download PDFInfo
- Publication number
- WO2022088050A1 WO2022088050A1 PCT/CN2020/125289 CN2020125289W WO2022088050A1 WO 2022088050 A1 WO2022088050 A1 WO 2022088050A1 CN 2020125289 W CN2020125289 W CN 2020125289W WO 2022088050 A1 WO2022088050 A1 WO 2022088050A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- conference
- terminal
- information
- video
- video data
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 75
- 230000001960 triggered effect Effects 0.000 claims description 18
- 230000000694 effects Effects 0.000 abstract description 30
- 238000012545 processing Methods 0.000 description 19
- 238000010586 diagram Methods 0.000 description 18
- 238000004891 communication Methods 0.000 description 11
- 230000002093 peripheral effect Effects 0.000 description 10
- 230000001133 acceleration Effects 0.000 description 9
- 230000008569 process Effects 0.000 description 7
- 230000006870 function Effects 0.000 description 6
- 230000003287 optical effect Effects 0.000 description 6
- 230000006835 compression Effects 0.000 description 5
- 238000007906 compression Methods 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 238000004590 computer program Methods 0.000 description 4
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 241000282412 Homo Species 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 239000000919 ceramic Substances 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 230000009977 dual effect Effects 0.000 description 2
- 239000004973 liquid crystal related substance Substances 0.000 description 2
- 230000033001 locomotion Effects 0.000 description 2
- PICXIOQBANWBIZ-UHFFFAOYSA-N zinc;1-oxidopyridine-2-thione Chemical class [Zn+2].[O-]N1C=CC=CC1=S.[O-]N1C=CC=CC1=S PICXIOQBANWBIZ-UHFFFAOYSA-N 0.000 description 2
- 238000013459 approach Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 239000010409 thin film Substances 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment
- H04N5/262—Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
- H04N5/2624—Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects for obtaining an image which is composed of whole input images, e.g. splitscreen
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/15—Conference systems
Definitions
- the present application relates to the field of network technologies, and in particular, to a method, device, system and storage medium for implementing a video conference.
- Video conferencing is an emerging method of video conferencing. Through video conferencing, multiple people can have online meetings, which is convenient and fast.
- the terminals of a plurality of participants are connected to the server.
- the plurality of terminals transmit the collected video data to the server, and the server according to the predetermined layout method,
- the video data transmitted by the multiple terminals are sequentially arranged in different picture areas to form conference video data including the video data of each terminal.
- the areas allocated by different terminals may not match the size of the video data collected by the terminals themselves, which leads to the problem that the effect of the conference video data is poor.
- Embodiments of the present application provide a video conference implementation method, device, system, and storage medium.
- the technical solution is as follows:
- An aspect of the present application provides a method for implementing a video conference, the method comprising:
- the layout information includes the information of at least two participant areas in the conference screen, and the information of the at least two participant areas and the at least two terminals participating in the video conference.
- Correspondence information includes the information of at least two participant areas in the conference screen, and the information of the at least two participant areas and the at least two terminals participating in the video conference.
- the obtaining the layout information of the conference screen of the video conference includes:
- the layout information sent by the master terminal in the at least two terminals is received.
- the method further includes:
- the master control instruction indicates that the first terminal of the at least two terminals is the master control terminal
- the first terminal is determined to be the master control terminal.
- the master control instruction includes at least one of a speaking instruction triggered by the first terminal, a master control switching instruction triggered by the first terminal, and a master switching instruction triggered by the master control terminal before the current moment. .
- the sending the information of the corresponding participant area to the at least two terminals includes:
- the information of the participant area corresponding to the second terminal is sent to the second terminal.
- the obtaining conference video data based on the conference multimedia data includes:
- the shared video data is processed according to the layout information to obtain the conference video data; wherein, in the conference video data, the shared video data occupies The area is larger than the area occupied by other conference multimedia data except the shared video data.
- the method further includes:
- the conference video data is displayed on the display screen.
- the sending the conference video data to the at least two terminals includes:
- an embodiment of the present application provides a method for implementing a video conference, and the method includes:
- the multimedia data obtained based on the information of the participant area and the multimedia data including:
- Character recognition is performed on the multimedia data to determine the area where the character is located in the multimedia data
- the multimedia data is processed to obtain the participant multimedia data; wherein, in the participant video data, the proportion of the area where the person is located occupies greater than the specified value.
- the method further includes:
- the layout information includes information on at least two participant areas in the conference screen, and correspondence between the at least two participant areas and at least two terminals participating in the video conference relationship information.
- the obtaining information of the participant area corresponding to the terminal provided by the conference device includes:
- the receiving conference video data provided by the conference device includes:
- the displaying the conference video data includes:
- an embodiment of the present application provides a video conference implementation device, and the video conference implementation device includes:
- a layout obtaining module is used to obtain the layout information of the conference screen of the video conference, the layout information includes the information of at least two participant areas in the conference screen, and the information of the at least two participant areas and participating in the video conference The corresponding relationship of at least two terminals;
- a sending module configured to send the information of the participant area corresponding to the at least two terminals to the at least two terminals
- a receiving module configured to receive the multimedia data provided by the at least two terminals based on the information of the corresponding participant area
- a conference video acquisition module for acquiring conference video data based on the multimedia data of the conference participants
- a conference video sending module configured to send the conference video data to the at least two terminals.
- an embodiment of the present application provides a video conference implementation device, and the video conference implementation device includes:
- an area acquisition module used for acquiring the information of the participant area corresponding to the terminal provided by the conference device
- a video acquisition module for acquiring video data
- Participating video acquisition module for obtaining participant multimedia data based on the information of the participant area and the multimedia data
- a conference-participating video providing module is used for providing the conference-participating multimedia data to the conference device.
- an embodiment of the present application provides a system for implementing a video conference, where the system includes a conference device and at least two terminals;
- the conference device is configured to acquire the layout information of the conference screen of the video conference, the layout information includes the information of at least two participant areas in the conference screen, and the information of the at least two participant areas and participating in the video conference. Correspondence information of at least two terminals of ;
- the conference device is configured to send the information of the corresponding participant area to the at least two terminals;
- the terminal is used for acquiring multimedia data
- the terminal is configured to obtain the multimedia data of the meeting based on the information of the participant area and the multimedia data;
- the terminal is configured to provide the conference device with the multimedia data for the conference
- the conference device is configured to acquire conference video data based on the multimedia data of the conference participants;
- the conference device is configured to send the conference video data to the at least two terminals;
- the terminal is used for displaying the conference video data.
- the conference device is used to display the conference video data on a display screen
- a second terminal among the at least two terminals is configured to send layout adjustment information to the conference device;
- the conference apparatus is configured to send the conference video data to a first terminal of the at least two terminals, and send the conference video data adjusted based on the layout adjustment information to the second terminal;
- the first terminal is configured to display the conference video data
- the second terminal is configured to display the adjusted conference video data
- an embodiment of the present application provides a conference device, the conference device includes a processor and a memory, the memory stores at least one instruction, at least a piece of program, code set or instruction set, the at least one instruction, The at least one piece of program, the code set or the instruction set is loaded and executed by the processor to implement the above-mentioned video conference implementation method.
- a terminal includes a processor and a memory, the memory stores at least one instruction, at least a section of program, code set or instruction set, the at least one instruction, all the The at least one piece of program, the code set or the instruction set is loaded and executed by the processor to implement the above-mentioned video conference implementation method.
- a computer non-transitory storage medium stores at least one instruction, at least one program, a code set or an instruction set, the at least one instruction, all the The at least one piece of the program, the code set or the instruction set is loaded and executed by the processor to implement the above-mentioned video conference implementation method.
- a computer program product or computer program comprising computer instructions stored in a computer readable storage medium.
- the processor of the computer device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions to cause the computer device to perform the methods provided in the various optional implementations described above.
- the terminals participating in the video conference can collect the multimedia data of the conference participants according to the layout information according to the layout information, and then participate in the video conference.
- the conference device can generate conference video data of matching size, and provide the conference video data to the terminals participating in the video conference.
- the display effect of the conference video data is relatively high. it is good. The problem of poor effect of conference video data in the related art is solved, and the display effect of conference video data in the video conference is improved.
- Fig. 1 is a kind of display schematic diagram of the conference screen of video conference
- FIG. 2 is a schematic structural diagram of an implementation environment involved in an embodiment of the present application.
- FIG. 3 is a flowchart of a method for implementing a video conference provided by an embodiment of the present application
- FIG. 4 is a flowchart of another video conference implementation method provided by an embodiment of the present application.
- FIG. 5 is a flowchart of another video conference implementation method provided by an embodiment of the present application.
- Fig. 6 is a kind of flow chart of sending the information of participant area in the embodiment shown in Fig. 5;
- Fig. 7 is a kind of flow chart of acquiring the multimedia data of meeting in the embodiment shown in Fig. 5;
- FIG. 8 is a schematic diagram of a conference screen of a video conference in the embodiment shown in FIG. 5;
- FIG. 9 is a schematic display diagram of a terminal in the embodiment shown in FIG. 5;
- Fig. 10 is a flow chart of displaying conference video data in the embodiment shown in Fig. 5;
- FIG. 11 is a block diagram of an apparatus for implementing a video conference provided by an embodiment of the present application.
- FIG. 12 is a block diagram of another apparatus for implementing a video conference provided by an embodiment of the present application.
- FIG. 13 is a structural block diagram of a video conference implementation system provided by an embodiment of the present application.
- FIG. 14 is a schematic structural diagram of a conference device provided by an embodiment of the present application.
- FIG. 15 is a structural block diagram of a terminal provided by an embodiment of the present application.
- Video conference system also known as conference TV system, refers to two or more individuals or groups in different places, through transmission lines and multimedia equipment, to transmit audio, video and document data to each other to achieve instant and interactive communication, System equipment to realize remote conference.
- each terminal collects video data of the participants through its own camera. After each terminal collects the video data, it will send the collected video data to the conference device, and the conference device will receive the video data.
- the video data sent by multiple terminals in the video conference are combined into one conference video data.
- the display screen displays the conference video data
- the video data collected by each terminal participating in the video conference will be displayed in different areas of the display screen.
- FIG. 1 which is a schematic diagram of the display of a conference screen of a video conference, wherein the conference screen includes four areas, and the four areas respectively display terminal A, terminal B, terminal C and terminal D provided video data 11, 12, 13, 14.
- the conference screen can be displayed on the display screen of each terminal participating in the video conference, and the technical effect of the video conference can be realized through the conference screen.
- the size of the video data collected by different terminals participating in the video conference may be different. to collect video data, resulting in inconsistent parameters of the collected video data (such as screen ratio, resolution, bit rate, etc.), so that the size of the video data of each terminal in the conference screen generated by the conference device is the same as the conference screen
- the problem of size mismatch such as the slanted shaded area in the conference screen shown in Figure 1. This will lead to a problem that the display effect of the video conference is poor.
- FIG. 2 is a schematic structural diagram of an implementation environment involved in an embodiment of the present application.
- the implementation environment includes a conference device 21 and a plurality of terminals 22 .
- the conference device 21 may include one server or a server cluster composed of multiple servers, and the conference device 21 may establish a connection with the terminal 22 through a wired connection or a wireless connection.
- the conference device 21 and the terminal 22 may also be the same hardware device, which is not limited in this embodiment of the present application.
- the conference device 21 may include a multipoint control unit (Multipoint Control Unit, MCU), which is used to implement multi-terminal calls and connections.
- MCU Multipoint Control Unit
- the terminal 22 may include a mobile phone, a tablet computer, a desktop computer, a notebook computer, and various smart wearable devices.
- the terminal 22 may have one or more components such as a display screen, a microphone, a speaker, and a speaker connection component (such as a speaker interface or a wireless connection module, etc.), so as to realize various functions of the video conference.
- FIG. 3 is a flowchart of a method for implementing a video conference provided by an embodiment of the present application. This embodiment is illustrated by applying the method for implementing a video conference to a conference device in the implementation environment shown in FIG. 2 .
- the video conference implementation method may include the following steps:
- Step 301 Acquire layout information of a conference screen of a video conference, where the layout information includes information of at least two participant areas in the conference screen, and correspondence information between the at least two participant areas and at least two terminals participating in the video conference.
- Step 302 Send the information of the corresponding participant area to the at least two terminals.
- Step 303 Receive the multimedia data of the meeting provided by the at least two terminals based on the information of the corresponding participant area.
- Step 304 Acquire conference video data based on the multimedia data of the conference participants.
- Step 305 Send conference video data to the at least two terminals.
- the multimedia data for the conference may include one or more of audio data, video data, image data and text data.
- the video conference implementation method acquires the layout information of the conference screen of the video conference, and sends the layout information to the terminals participating in the video conference, so that the terminals participating in the video conference can
- the layout information is used to collect conference multimedia data that conforms to the layout information, and then the terminals participating in the video conference send the obtained conference multimedia data to the conference device.
- the conference terminal provides the conference video data, and the display effect of the conference video data is good. The problem of poor effect of conference video data in the related art is solved, and the display effect of conference video data in the video conference is improved.
- the conference device will perform processing such as cropping and compression on the video data uploaded by the terminals participating in the video conference, so that the video data can meet various requirements of the conference screen of the video conference, but, This obviously will seriously increase the computational burden of the conference device.
- the conference device may not perform processing such as compressing and cropping the video image, and the video data collected by the multiple terminals participating in the video conference is collected according to the participant area sent by the conference device.
- processing the instructions of the information the computing load of the conference device can be greatly reduced, and the smoothness of the video conference can be improved.
- FIG. 4 is a flowchart of another video conference implementation method provided by an embodiment of the present application. This embodiment is illustrated by applying the video conference implementation method to a terminal in the implementation environment shown in FIG. 2 as an example.
- the video conference implementation method may include the following steps:
- Step 401 Obtain the information of the participant area corresponding to the terminal provided by the conference device.
- Step 402 Acquire multimedia data.
- Step 403 based on the information of the participant area and the multimedia data, obtain the participant multimedia data.
- Step 404 Provide the conference device with multimedia data for the conference.
- the terminals participating in the video conference can collect the multimedia data of the conference participants according to the layout information provided by the conference device, and then the terminals participating in the video conference will get After the multimedia data of the conference participants are sent to the conference device, the conference device can generate conference video data of matching size, and provide the conference video data to the terminals participating in the video conference, and the display effect of the conference video data is better.
- the problem of poor effect of conference video data in the related art is solved, and the display effect of conference video data in the video conference is improved.
- Fig. 5 is a flowchart of another method for implementing a video conference provided by an embodiment of the present application. This embodiment is illustrated by applying the method for implementing a video conference to the implementation environment shown in Fig. 2 as an example.
- the video conference implementation method may include the following steps:
- Step 501 The conference apparatus receives the layout information sent by the master terminal among the at least two terminals participating in the video conference.
- the layout information includes information of at least two participant areas in the conference screen, and correspondence information between the at least two participant areas and at least two terminals participating in the video conference.
- the at least two participant areas are used to display video data provided by terminals participating in the video conference.
- the information of the participant area may include various parameters of the participant area, such as area shape, resolution, bit rate, color, and area edge length.
- the terminal can process the collected video data into participant multimedia data matching the participant area, that is, video data that can be displayed in the participant area.
- the terminals participating in the video conference can be determined before this step.
- the conference device can establish a video conference, and the terminal can participate in the video conference through authentication information (such as user name and password, etc.). Before the video conference starts, the conference device can determine the terminals participating in the video conference.
- step 501 can be re-executed to re-determine the layout information.
- the master control terminal may be one terminal among multiple terminals participating in the video conference, and the terminal may be used to control the video conference.
- Step 501 provides a way to obtain layout information, but the conference device may also obtain layout information in other ways, for example, the conference device locally pre-stores layout information, etc., which is not limited in this embodiment of the present application.
- Step 502 The conference device sends the information of the participant area corresponding to the at least two terminals participating in the video conference.
- the conference device may send information about the participant area corresponding to each terminal to each terminal participating in the video conference. For example, terminal A participating in the video conference corresponds to the participant area 51 in the video screen, and terminal B participating in the video conference corresponds to the participant area 52 in the video screen, then the conference device can send the information of the participant area 51 to the terminal A, and sends the information of the participant area 52 to the terminal B.
- the conference device may also send the layout information to each terminal participating in the video conference, that is, each terminal participating in the video conference can receive the information of all participant areas.
- the conference device can directly send the information of the corresponding participant area to the terminal participating in the video conference, or, as shown in FIG. 6, it is another way of sending the information of the participant area in step 502,
- This approach can include:
- Sub-step 5021 The conference apparatus receives the layout application information sent by the second terminal of the at least two terminals participating in the video conference.
- the second terminal may be one of the terminals participating in the video conference, and the terminal may actively send layout application information to the conference device.
- Sub-step 5022 Based on the layout application information, the conference device sends the information of the participant area corresponding to the second terminal to the second terminal.
- the conference apparatus may send the information of the participant area corresponding to the second terminal to the second terminal.
- Each terminal participating in the video conference may, after preparing for the video conference, send layout application information to the conference device to obtain the information of the corresponding participant area.
- Step 503 The terminal acquires multimedia data.
- the multimedia data may include video data collected by the terminal participating in the video conference through the camera assembly, or shared video data provided by the terminal, or image and audio data provided by the terminal.
- the video data collected by the camera assembly may be the video data of the participant itself, or the video data of the surrounding environment of the participant.
- the terminal is a mobile phone
- the front camera (or rear camera) of the mobile phone may be used to obtain the video data. Capture video data from attendees.
- the shared video data can include some pictures that the participants want to show, such as slide presentations (such as PPT), documents, tables, pictures, video, audio, web pages or other multimedia information stored locally on the terminal, which can be recorded by recording. Display window or application window to provide.
- slide presentations such as PPT
- documents such as PPT
- tables pictures
- video audio, web pages or other multimedia information stored locally on the terminal, which can be recorded by recording.
- Step 504 The terminal obtains the multimedia data of the participant based on the information of the participant area and the multimedia data.
- the terminal After the terminal collects the multimedia data, it can process the multimedia data based on the information of the participant area to obtain the participant multimedia data.
- the multimedia data may be video data with a resolution of 800 ⁇ 600 collected by the terminal, and the resolution recorded in the information of the participant area is 200 ⁇ 300, then the terminal may use the video with a resolution of 800 ⁇ 600.
- the data is cropped into video data with a resolution of 200 ⁇ 300, and the video data with a resolution of 200 ⁇ 300 can be used as multimedia data for the conference.
- the terminal can also adjust other parameters of the video data, such as bit rate and color, according to the information of the participant area.
- step 504 may include the following two sub-steps:
- Sub-step 5041 the terminal performs person identification on the multimedia data to determine the area where the person is located in the multimedia data.
- the terminal can identify the region where the human face (or the human face and the shoulders) in the multimedia data (eg, video data) is located by using the person identification technology.
- the terminal may divide each frame of video data into a plurality of small-sized images, and identify whether each area contains the head and shoulders features of a person through the adaboost algorithm (an iterative algorithm). If yes, record the coordinates and size of the area; if it is judged to be no, record no coordinate information, and so on, traverse the entire image to determine the area occupied by the head and shoulders in the image.
- adaboost algorithm an iterative algorithm
- Sub-step 5042 the terminal processes the multimedia data based on the information of the participant area and the area where the person is located to obtain the participant multimedia data.
- the terminal can perform operations such as cropping and compression on the collected multimedia data (such as video data) based on the information of the participant area and the area where the person is located, so that in the video data of the participant, the area where the person is located accounts for more than A specified value, exemplarily, the specified value is 50%-75%.
- the collected multimedia data such as video data
- the resolution of the information recorded in the participant area is 200 ⁇ 300
- the video data collected by the terminal is 800 ⁇ 600 resolution
- the terminal can process the video data into a resolution of 800 ⁇ 600. 200 ⁇ 300, and the proportion of the area where the person is located is 75% of the multimedia data of the participants.
- the area where the person is located in the multimedia data of the conference is larger, the person in the video screen can be highlighted, and the effect of the video conference can be improved.
- Step 505 The terminal provides the conference device with multimedia data for the conference.
- the terminal can process and obtain the multimedia data of the conference in real time, and transmit the multimedia data of the conference to the conference device in real time.
- Step 506 The conference device acquires conference video data based on the multimedia data of the conference participants.
- the conference device may acquire conference participant multimedia data provided by multiple terminals participating in the video conference, and combine the plurality of conference participant multimedia data into conference video data.
- the conference device may layout a plurality of multimedia data of participants in the conference screen according to the layout information determined in step 501 .
- the conference device processes the shared video data according to the layout information to obtain conference video data.
- the area occupied by the shared video data is larger than the shared video data.
- the display area of the shared video data is made larger than the display area of other multimedia data of the conference, so as to highlight the shared video data and improve the display effect of the video conference.
- the layout information may record the information of the conference area corresponding to the shared video data, and send it to the terminal that provides the shared video data in step 501.
- the terminal participating in the video conference may report to the main controller in advance
- the terminal or the conference device sends out sharing information to indicate that the shared video data is to be provided, so that the master terminal or the conference device can determine the corresponding layout information.
- a certain terminal may temporarily provide shared video data.
- the terminal may also send sharing information to the master terminal (or conference device), and perform step 501 again to determine new layout information.
- the shared video data may have an identifier, and the conference device may use the identifier to determine whether the shared video data is included in the multimedia data for the conference.
- FIG. 8 is a schematic diagram of a conference screen of a video conference in an embodiment of the present application
- the shared video data 81 is displayed in the largest area
- other displays include:
- the video data 82 of the participants is displayed in other smaller areas. Under such a layout, all the participants can clearly see the picture presented by the shared video data, which improves the display effect of the video conference.
- Step 507 The conference device sends conference video data to at least two terminals participating in the video conference.
- the conference device can send conference video data to each terminal participating in the video conference.
- the conference device can also individually adjust the conference video data displayed to one or more terminals.
- step 507 may include:
- the conference apparatus receives the layout adjustment information sent by the second terminal of the at least two terminals.
- the second terminal may send the layout adjustment information to the conference device.
- the layout adjustment information may include a layout mode different from the above layout information. For example, compared with the above layout information, in the layout adjustment information, a certain terminal corresponds to The size of the conference area of the terminal becomes larger, and the size of the conference area corresponding to the other terminal becomes smaller.
- the conference device adjusts the conference video data based on the layout adjustment information.
- the conference device can adjust the conference video data based on the layout adjustment information through a local image processing component (eg, MCU). It should be noted that the conference device may only adjust the conference video data for sending to the second terminal, and does not adjust the conference video data for sending to other terminals (such as the first terminal) that have not sent layout adjustment information. data.
- a local image processing component eg, MCU.
- the conference device sends the adjusted conference video data to the second terminal.
- the conference device sends the conference video data before adjustment to other terminals participating in the video conference except the second terminal.
- the conference video data displayed in the second terminal is different from the conference video data displayed by other terminals participating in the video conference, which enriches the diversity and user experience of the video conference.
- Step 508 The terminal displays the video data of the conference.
- the terminal can display the conference video data obtained from the conference device on the display screen.
- FIG. 9 which is a schematic diagram of the display of a terminal
- the terminal 90 displays a conference screen 91 composed of conference video data on a display screen. Participants can watch the conference screen 91.
- the conference screen 91 may also have a speaking button 912 and a moderator switching button 913.
- the speaking button 912 is triggered when a participant speaks, and the moderator switching button 913 is triggered when a participant wants to apply to be the moderator.
- step 508 may include the following two sub-steps:
- Step 5081 the terminal receives the adjustment instruction.
- the adjustment instruction may be triggered by a participant operating the terminal, and the adjustment instruction is used to adjust the layout of the conference screen presented by the display screen of the terminal.
- an adjustment instruction can be triggered on terminal A to adjust the layout of the conference screen so that the video data provided by participant B can be displayed.
- the display area becomes larger (correspondingly, the area occupied by the video data provided by the terminals of other participants may be reduced).
- Step 5082 The terminal displays the conference video data adjusted according to the adjustment instruction.
- the terminal can adjust the conference video data provided by the conference device according to the adjustment instruction, obtain the adjusted conference video data, and display the adjusted conference video data on the display screen.
- the terminal can dynamically adjust the conference video data through a local image processing component (such as a graphics processor (Graphics Processing Unit, GPU)).
- a local image processing component such as a graphics processor (Graphics Processing Unit, GPU)
- the adjustment instruction is used to adjust the layout of the conference screen presented by the local display screen of the terminal, but will not affect the conference screen presented in other terminals participating in the video conference.
- a participant operating terminal A wants to change the layout of the conference screen, he can trigger an adjustment instruction in terminal A to adjust the layout of the conference screen presented on the display screen of terminal A.
- the layout of the conference screen presented by the display screen of the terminal B may still be the layout in the conference video data provided by the conference device.
- Step 509 The conference device displays conference video data on the display screen.
- the conference device also has a display screen, and then the conference device can display conference video data on the display screen.
- the conference device can display conference video data on the display screen.
- One or more participants can participate in the video conference through the conference device, and control the video conference through the conference device, such as controlling the layout information of the conference video data.
- Step 510 when the main control instruction indicates that the first terminal among the at least two terminals participating in the video conference is the main control terminal, the conference apparatus determines the first terminal as the main control terminal.
- the main control terminal can also be modified. That is, the first terminal can be determined as the master terminal.
- the master control instruction may include a speaking instruction triggered by the first terminal, a master control switching instruction triggered by the first terminal, and a master control terminal triggered by the master control terminal before the current moment (that is, confirmed by the previous master control terminal). At least one of a switching instruction and a control instruction triggered by the conference device. For example, when the conference device has a display screen, a control instruction can be triggered by the conference device to adjust and control the main control terminal.
- an identifier can also be added to the conference area corresponding to the master control terminal, so that each terminal can identify the master control terminal, which facilitates the conduct of the video conference.
- the realization method of the video conference can be made more flexible.
- the video conference implementation method acquires the layout information of the conference screen of the video conference, and sends the layout information to the terminals participating in the video conference, so that the terminals participating in the video conference can
- the layout information is used to collect conference multimedia data that conforms to the layout information.
- the conference device can generate conference video data of matching size, and send the video
- the conference terminal provides the conference video data, and the display effect of the conference video data is good. The problem of poor effect of conference video data in the related art is solved, and the display effect of conference video data in the video conference is improved.
- the conference device may not perform processing such as compression and cropping on the video images, and the video data collected by the multiple terminals participating in the video conference is collected according to the participants sent by the conference device.
- the indication of the information in the area is processed, which can greatly reduce the computing load of the conference device, thereby improving the smoothness of the video conference.
- the apparatus 1100 for implementing a video conference includes:
- the layout acquisition module 1110 is used to acquire the layout information of the conference screen of the video conference, the layout information includes the information of at least two participant areas in the conference screen, and the correspondence between the at least two participant areas and at least two terminals participating in the video conference relationship information.
- the sending module 1120 is configured to send the information of the participant area corresponding to the at least two terminals to the at least two terminals.
- the receiving module 1130 is configured to receive conference multimedia data provided by at least two terminals based on the information of the corresponding participant area.
- the conference video acquisition module 1140 is configured to acquire conference video data based on the multimedia data of the conference participants.
- the conference video sending module 1150 is configured to send conference video data to at least two terminals.
- the video conference implementation device acquires the layout information of the conference screen of the video conference, and sends the layout information to the terminals participating in the video conference, so that the terminals participating in the video conference can
- the layout information is used to collect conference multimedia data that conforms to the layout information.
- the conference device can generate conference video data of matching size, and send the video
- the conference terminal provides the conference video data, and the display effect of the conference video data is good. The problem of poor effect of conference video data in the related art is solved, and the display effect of conference video data in the video conference is improved.
- a layout acquisition module for:
- the conference device further includes:
- the host confirmation module is configured to determine the first terminal as the host terminal when the host control instruction indicates that the first terminal of the at least two terminals is the host terminal.
- the master control instruction includes at least one of a speaking instruction triggered by the first terminal, a master control switching instruction triggered by the first terminal, and a master switching instruction triggered by the master control terminal before the current moment.
- a sending module for:
- the information of the participant area corresponding to the second terminal is sent to the second terminal.
- a conference video acquisition module including:
- the shared video data is processed according to the shared layout information to obtain the video data of the conference.
- the area occupied by the shared video data is larger than that of other multimedia data of the conference except the shared video data. area occupied.
- the conference device further includes:
- the conference display module is used to display conference video data on the display screen.
- FIG. 12 is a block diagram of another apparatus for implementing a video conference provided by an embodiment of the present application.
- the apparatus 1200 for implementing a video conference includes:
- an area acquisition module 1210 configured to acquire the information of the participant area corresponding to the terminal provided by the conference device
- a video acquisition module 1220 configured to acquire multimedia data
- Participating video acquisition module 1230 is used to obtain participant multimedia data based on the information and multimedia data of the participant area;
- a meeting video providing module 1240 is used to provide meeting multimedia data to the conference device
- the conference data receiving module 1250 is used for receiving conference video data provided by the conference device
- the conference data display module 1260 is used for displaying conference video data.
- the terminals participating in the video conference can collect the multimedia data of the conference participants according to the layout information provided by the conference device, and then the terminals participating in the video conference will receive After the multimedia data of the conference participants are sent to the conference device, the conference device can generate conference video data of matching size, and provide the conference video data to the terminals participating in the video conference, and the display effect of the conference video data is better.
- the problem of poor effect of conference video data in the related art is solved, and the display effect of conference video data in the video conference is improved.
- a video acquisition module for participating in the conference used for:
- the video data is processed to obtain the multimedia data of the meeting.
- the proportion of the area where the character is located is greater than the specified value.
- the terminal further includes:
- the layout information sending module is used for sending layout information to the conference device, where the layout information includes information of at least two participant areas in the conference screen, and correspondence information between the at least two participant areas and at least two terminals participating in the video conference.
- a region acquisition module for:
- the conference data display module is used for:
- FIG. 13 is a structural block diagram of a system for implementing a video conference provided by an embodiment of the present application.
- the system 1300 includes a conference device 131 and at least two terminals 132 .
- the conference device 131 is used to obtain the layout information of the conference screen of the video conference.
- the layout information includes the information of at least two participant areas in the conference screen, and the corresponding relationship information between the at least two participant areas and at least two terminals participating in the video conference. .
- the conference device 131 is configured to send the information of the participant area corresponding to the at least two terminals to the at least two terminals.
- the terminal 132 is used to obtain multimedia data.
- the terminal 132 is used to obtain the multimedia data for the conference based on the information of the conference area and the multimedia data.
- the terminal 132 is used to provide the conference device with multimedia data for the conference.
- the conference device 131 is configured to acquire conference video data based on the multimedia data of the conference participants.
- the conference device 131 is configured to send conference video data to at least two terminals;
- the terminal 132 is used to present conference video data.
- the method performed by the conference device 131 and the terminal 132 may also refer to the embodiment shown in FIG. 5 , and details are not described herein again.
- the video conference implementation system acquires the layout information of the conference screen of the video conference, and sends the layout information to the terminals participating in the video conference, so that the terminals participating in the video conference can
- the layout information is used to collect conference multimedia data that conforms to the layout information.
- the conference device can generate conference video data of matching size, and send the video
- the conference terminal provides the conference video data, and the display effect of the conference video data is good. The problem of poor effect of conference video data in the related art is solved, and the display effect of conference video data in the video conference is improved.
- the conference device 131 is used to display conference video data on the display screen.
- the second terminal 1321 of the at least two terminals is used to send the layout adjustment information to the conference device 131 .
- the conference device 131 is configured to send conference video data to the first terminal 1322 (the first terminal may be a terminal that does not send layout adjustment information to the conference device) among the at least two terminals, and send the conference video data to the second terminal 1321 based on the layout adjustment information. post-conference video data.
- the first terminal 1322 is used to display the conference video data
- the second terminal 1321 is used to display the adjusted conference video data. That is, the conference video data displayed by the terminal that has not sent the layout adjustment information to the conference device is different from the conference video data displayed by the terminal that has sent the layout adjustment information to the conference device.
- a computer program product or computer program comprising computer instructions stored in a computer readable storage medium.
- the processor of the computer device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions to cause the computer device to perform the methods provided in the various optional implementations described above.
- FIG. 14 is a schematic structural diagram of a conference device 1400 provided by an embodiment of the present application, where the conference device 1400 may be a server.
- the conference device 1400 includes a central processing unit (Central Processing Unit, CPU) 1401, a memory 1402, and a system bus 1403 connecting the memory 1402 and the central processing unit 1401.
- the memory 1402 may include, for example, a hard disk or A computer readable medium (not shown) such as a Compact Disc Read-Only Memory (CD-ROM).
- CD-ROM Compact Disc Read-Only Memory
- Computer-readable storage media may include computer storage media (which may be non-transitory storage media) and communication media.
- Computer storage media includes volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data.
- Computer storage media include random access memory (Random Access Memory, RAM), read-only memory (Read-Only Memory, ROM), erasable programmable read-only memory (Erasable Programmable Read Only Memory, EPROM), electrified erasable memory Programmable Read-Only Memory (Electrically Erasable Programmable Read-Only Memory, EEPROM), flash memory or other solid-state storage technology, CD-ROM, Digital Versatile Disc (DVD) or other optical storage, cassettes, magnetic tapes, Disk storage or other magnetic storage devices.
- RAM Random Access Memory
- ROM read-only memory
- EPROM erasable programmable Read Only Memory
- EPROM Erasable Programmable Read Only Memory
- EEPROM Electrical Erasable Programmable Read-Only Memory
- flash memory or other solid-state storage technology
- CD-ROM Compact Disc
- DVD Digital Versatile Disc
- the above-mentioned memory 1402 further includes one or more programs, and the one or more programs are stored in the memory and configured to be executed by the CPU, so as to realize the video conference implementation method provided by the embodiment of the present application.
- FIG. 15 shows a structural block diagram of a terminal 1500 provided by an embodiment of the present application.
- the terminal 1500 may be: a smart phone, a tablet computer, a moving picture expert compression standard audio layer 3 (Moving Picture Experts Group Audio Layer III, MP3) player, a moving picture expert compression standard audio layer 4 (Moving Picture Experts Group Audio Layer IV) , MP4) player, laptop or desktop computer.
- Terminal 1500 may also be called user equipment, portable terminal, laptop terminal, desktop terminal, and the like by other names.
- the terminal 1500 includes: a processor 1501 and a memory 1502 .
- the processor 1501 may include one or more processing cores, such as a 4-core processor, an 8-core processor, and the like.
- the processor 1501 can use at least one hardware form among digital signal processing (Digital Signal Processing, DSP), field-programmable gate array (Field-Programmable Gate Array, FPGA), and programmable logic array (Programmable Logic Array, PLA).
- DSP Digital Signal Processing
- FPGA field-programmable gate array
- PLA programmable logic array
- the processor 1501 may also include a main processor and a coprocessor.
- the main processor is a processor used to process data in the wake-up state, also called a central processing unit (CPU);
- the coprocessor is a A low-power processor for processing data in a standby state.
- the processor 1501 may be integrated with a graphics processor (Graphics Processing Unit, GPU), and the GPU is used for rendering and drawing the content that needs to be displayed on the display screen.
- the processor 1501 may further include an artificial intelligence (Artificial Intelligence, AI) processor for processing computing operations related to machine learning.
- AI Artificial Intelligence
- Memory 1502 may include one or more computer-readable storage media (which may be non-transitory storage media), which may be non-transitory. Memory 1502 may also include high-speed random access memory, as well as non-volatile memory, such as one or more disk storage devices, flash storage devices. In some embodiments, the non-transitory computer-readable storage medium in the memory 1502 is used to store at least one instruction, and the at least one instruction is used to be executed by the processor 1501 to implement the video conference provided by the method embodiments in this application. Implementation.
- the terminal 1500 may optionally further include: a peripheral device interface 1503 and at least one peripheral device.
- the processor 1501, the memory 1502 and the peripheral device interface 1503 can be connected through a bus or a signal line.
- Each peripheral device can be connected to the peripheral device interface 1503 through a bus, a signal line or a circuit board.
- the peripheral device includes: at least one of a radio frequency circuit 1504 , a touch display screen 1505 , a camera 1506 , an audio circuit 1507 , a positioning component 1508 and a power supply 1509 .
- the peripheral device interface 1503 may be used to connect at least one peripheral device related to input/output (I/O) to the processor 1501 and the memory 1502 .
- processor 1501, memory 1502, and peripherals interface 1503 are integrated on the same chip or circuit board; in some other embodiments, any one of processor 1501, memory 1502, and peripherals interface 1503 or The two can be implemented on a separate chip or circuit board, which is not limited in this embodiment.
- the radio frequency circuit 1504 is used for receiving and transmitting radio frequency (Radio Frequency, RF) signals, also called electromagnetic signals.
- the radio frequency circuit 1504 communicates with communication networks and other communication devices via electromagnetic signals.
- the radio frequency circuit 1504 converts electrical signals into electromagnetic signals for transmission, or converts received electromagnetic signals into electrical signals.
- radio frequency circuitry 1504 includes an antenna system, an RF transceiver, one or more amplifiers, tuners, oscillators, digital signal processors, codec chipsets, subscriber identity module cards, and the like.
- the radio frequency circuit 1504 may communicate with other terminals through at least one wireless communication protocol.
- the wireless communication protocol includes but is not limited to: World Wide Web, Metropolitan Area Network, Intranet, various generations of mobile communication networks (2G, 3G, 4G and 5G), wireless local area network and/or Wireless Fidelity (Wireless Fidelity, WiFi) network.
- the radio frequency circuit 1504 may further include a circuit related to near field communication (Near Field Communication, NFC), which is not limited in this application.
- NFC Near Field Communication
- the display screen 1505 is used to display a user interface (User Interface, UI).
- the UI can include graphics, text, icons, video, and any combination thereof.
- the display screen 1505 also has the ability to acquire touch signals on or above the surface of the display screen 1505 .
- the touch signal can be input to the processor 1501 as a control signal for processing.
- the display screen 1505 may also be used to provide virtual buttons and/or virtual keyboards, also referred to as soft buttons and/or soft keyboards.
- the display screen 1505 there may be one display screen 1505, which is provided on the front panel of the terminal 1500; in other embodiments, there may be at least two display screens 1505, which are respectively arranged on different surfaces of the terminal 1500 or in a folded design; In still other embodiments, the display screen 1505 may be a flexible display screen disposed on a curved surface or a folding surface of the terminal 1500 . Even, the display screen 1505 can also be set as a non-rectangular irregular figure, that is, a special-shaped screen.
- the display screen 1505 can be made of materials such as a liquid crystal display (Liquid Crystal Display, LCD), an organic light-emitting diode (Organic Light-Emitting Diode, OLED).
- the camera assembly 1506 is used to capture images or video.
- the camera assembly 1506 includes a front camera and a rear camera.
- the front camera is arranged on the front panel of the terminal, and the rear camera is arranged on the back of the terminal.
- there are at least two rear cameras which are any one of a main camera, a depth-of-field camera, a wide-angle camera, and a telephoto camera, so as to realize the fusion of the main camera and the depth-of-field camera to realize the background blur function, the main camera Integrate with wide-angle camera to achieve panoramic shooting and virtual reality (VR) shooting functions or other integrated shooting functions.
- the camera assembly 1506 may also include a flash.
- the flash can be a single color temperature flash or a dual color temperature flash. Dual color temperature flash refers to the combination of warm light flash and cold light flash, which can be used for light compensation under different color temperatures.
- Audio circuitry 1507 may include a microphone and speakers.
- the microphone is used to collect the sound waves of the user and the environment, convert the sound waves into electrical signals and input them to the processor 1501 for processing, or to the radio frequency circuit 1504 to realize voice communication.
- the microphone may also be an array microphone or an omnidirectional collection microphone.
- the speaker is used to convert the electrical signal from the processor 1501 or the radio frequency circuit 1504 into sound waves.
- the loudspeaker can be a traditional thin-film loudspeaker or a piezoelectric ceramic loudspeaker.
- the speaker When the speaker is a piezoelectric ceramic speaker, it can not only convert electrical signals into sound waves audible to humans, but also convert electrical signals into sound waves inaudible to humans for distance measurement and other purposes.
- the audio circuit 1507 may also include a headphone jack.
- the positioning component 1508 is used to locate the current geographic location of the terminal 1500 to implement navigation or a Location Based Service (LBS).
- LBS Location Based Service
- the positioning component 1508 may be a positioning component based on the US Global Positioning System (Global Positioning System, GPS), China's Beidou system or Russia's Galileo system.
- the power supply 1509 is used to power various components in the terminal 1500 .
- the power source 1509 may be alternating current, direct current, primary batteries, or rechargeable batteries.
- the rechargeable battery may be a wired rechargeable battery or a wireless rechargeable battery. Wired rechargeable batteries are batteries that are charged through wired lines, and wireless rechargeable batteries are batteries that are charged through wireless coils.
- the rechargeable battery can also be used to support fast charging technology.
- the terminal 1500 also includes one or more sensors 1510 .
- the one or more sensors 1510 include, but are not limited to, an acceleration sensor 1511 , a gyro sensor 1512 , a pressure sensor 1513 , a fingerprint sensor 1514 , an optical sensor 1515 , and a proximity sensor 1516 .
- the acceleration sensor 1511 can detect the magnitude of acceleration on the three coordinate axes of the coordinate system established by the terminal 1500 .
- the acceleration sensor 1511 can be used to detect the components of the gravitational acceleration on the three coordinate axes.
- the processor 1501 can control the touch display screen 1505 to display the user interface in a landscape view or a portrait view according to the gravitational acceleration signal collected by the acceleration sensor 1511 .
- the acceleration sensor 1511 can also be used for game or user movement data collection.
- the gyroscope sensor 1512 can detect the body direction and rotation angle of the terminal 1500 , and the gyroscope sensor 1512 can cooperate with the acceleration sensor 1511 to collect 3D actions of the user on the terminal 1500 .
- the processor 1501 can implement the following functions according to the data collected by the gyro sensor 1512 : motion sensing (such as changing the UI according to the user's tilt operation), image stabilization during shooting, game control, and inertial navigation.
- the pressure sensor 1513 may be disposed on the side frame of the terminal 1500 and/or the lower layer of the touch display screen 1505.
- the processor 1501 performs left and right hand identification or shortcut operations according to the holding signal collected by the pressure sensor 1513.
- the processor 1501 controls the operability controls on the UI interface according to the user's pressure operation on the touch display screen 1505.
- the operability controls include at least one of button controls, scroll bar controls, icon controls, and menu controls.
- the fingerprint sensor 1514 is used to collect the user's fingerprint, and the processor 1501 identifies the user's identity according to the fingerprint collected by the fingerprint sensor 1514, or the fingerprint sensor 1514 identifies the user's identity according to the collected fingerprint. When the user's identity is identified as a trusted identity, the processor 1501 authorizes the user to perform relevant sensitive operations, including unlocking the display screen, viewing encrypted information, downloading software, making payments, and changing settings.
- the fingerprint sensor 1514 may be provided on the front, back or side of the terminal 1500 . When the terminal 1500 is provided with physical buttons or a manufacturer's logo, the fingerprint sensor 1514 may be integrated with the physical buttons or the manufacturer's logo.
- Optical sensor 1515 is used to collect ambient light intensity.
- the processor 1501 can control the display brightness of the touch display screen 1505 according to the ambient light intensity collected by the optical sensor 1515 . Specifically, when the ambient light intensity is high, the display brightness of the touch display screen 1505 is increased; when the ambient light intensity is low, the display brightness of the touch display screen 1505 is decreased.
- the processor 1501 can also dynamically adjust the shooting parameters of the camera assembly 1506 according to the ambient light intensity collected by the optical sensor 1515 .
- a proximity sensor 1516 also called a distance sensor, is usually provided on the front panel of the terminal 1500.
- the proximity sensor 1516 is used to collect the distance between the user and the front of the terminal 1500 .
- the processor 1501 controls the touch display screen 1505 to switch from the bright screen state to the off screen state; when the proximity sensor 1516 detects When the distance between the user and the front of the terminal 1500 gradually increases, the processor 1501 controls the touch display screen 1505 to switch from the off-screen state to the bright-screen state.
- FIG. 15 does not constitute a limitation on the terminal 1500, and may include more or less components than the one shown, or combine some components, or adopt different component arrangements.
- At least one of A and B is only an association relationship to describe related objects, indicating that there can be three kinds of relationships, for example, at least one of A and B can mean that A exists alone, while There are three cases of A and B, and B alone.
- at least one of A, B, and C means that there can be seven kinds of relationships, which can be expressed as: A alone exists, B alone exists, C alone exists, A and B exist simultaneously, A and C exist simultaneously, and both exist C and B, there are seven cases of A, B and C at the same time.
- At least one of A, B, C, and D means that fifteen kinds of relationships can exist, which can mean: A alone exists, B alone exists, C alone exists, D alone exists, both A and B exist, and at the same time A and C exist, A and D exist simultaneously, C and B exist simultaneously, D and B exist simultaneously, C and D exist simultaneously, A, B, and C exist simultaneously, A, B, and D exist simultaneously, and A and C exist simultaneously and D, B, C, and D exist at the same time, and A, B, C, and D exist at the same time, these fifteen cases.
- first and second are used for descriptive purposes only, and should not be construed as indicating or implying relative importance.
- the term “plurality” refers to two or more, unless expressly limited otherwise.
- the disclosed apparatus and method may be implemented in other manners.
- the apparatus embodiments described above are only illustrative.
- the division of the units is only a logical function division. In actual implementation, there may be other division methods.
- multiple units or components may be combined or Can be integrated into another system, or some features can be ignored, or not implemented.
- the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of devices or units, and may be in electrical, mechanical or other forms.
- the units described as separate components may or may not be physically separated, and components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Telephonic Communication Services (AREA)
Abstract
本申请公开了一种视频会议实现方法、装置、系统以及计算机存储介质,属于网络技术领域。该方法包括:获取视频会议的会议画面的布局信息;向至少两个终端发送对应的与会者区域的信息;接收至少两个终端基于对应的与会者区域的信息提供的与会多媒体数据;基于与会多媒体数据获取会议视频数据;向至少两个终端发送会议视频数据。本申请通过将该布局信息发送给参与视频会议的终端,以使参与该视频会议的终端能够根据该布局信息来采集符合该布局信息的与会多媒体数据,会议装置即可以生成尺寸匹配的会议视频数据,会议视频数据的显示效果较好。解决了相关技术中会议视频数据的效果较差的问题,提升了视频会议中会议视频数据的显示效果。
Description
本申请涉及网络技术领域,特别涉及一种视频会议实现方法、装置、系统以及存储介质。
视频会议是目前的一种新兴的视频会议方式,通过视频会议,可以实现多人在线上进行会议,方便快捷。
目前,一种视频会议的实现方式中,由多个与会人员的终端与服务器建立连接,在开始视频会议后,这多个终端将采集到的视频数据传输至服务器,服务器根据预定的布局方式,将多个终端传输的视频数据依次布局在不同的画面区域中,以形成包括各个终端的视频数据的会议视频数据。
但是,上述服务器生成的会议视频数据中,不同终端所分配到的区域与终端自身所采集的视频数据的尺寸可能存在不匹配的情况,进而导致会议视频数据的效果较差的问题。
发明内容
本申请实施例提供了一种视频会议实现方法、装置、系统以及存储介质。所述技术方案如下:
本申请的一方面,提供了一种视频会议实现方法,所述方法包括:
获取视频会议的会议画面的布局信息,所述布局信息包括所述会议画面中至少两个与会者区域的信息,以及所述至少两个与会者区域和参与所述视频会议的至少两个终端的对应关系信息;
向所述至少两个终端发送对应的与会者区域的信息;
接收所述至少两个终端基于对应的与会者区域的信息提供的与会多媒体数据;
基于所述与会多媒体数据获取会议视频数据;
向所述至少两个终端发送所述会议视频数据。
可选地,所述获取视频会议的会议画面的布局信息,包括:
接收所述至少两个终端中的主控终端发送的所述布局信息。
可选地,所述接收所述主控终端发送的所述布局信息之后,所述方法还包括:
在主控指令指示所述至少两个终端中的第一终端为主控终端时,将所述第一终端确定为主控终端。
可选地,所述主控指令包括所述第一终端触发的发言指令、所述第一终端触发的主控切换指令、当前时刻之前的主控终端触发的主控切换指令中的至少一种。
可选地,所述向所述至少两个终端发送对应的与会者区域的信息,包括:
接收所述至少两个终端中的第二终端发送的布局申请信息;
基于所述布局申请信息,向所述第二终端发送所述第二终端对应的与会者区域的信息。
可选地,所述基于所述与会多媒体数据获取会议视频数据,包括:
当所述与会多媒体数据中包括共享视频数据时,根据所述布局信息对所述共享视频数据进行处理,得到所述会议视频数据;其中,所述会议视频数据中,所述共享视频数据所占的区域大于除所述共享视频数据外的其它与会多媒体数据所占的区域。
可选地,所述基于所述与会多媒体数据获取会议视频数据之后,所述方法还包括:
在显示屏展示所述会议视频数据。
可选地,所述向所述至少两个终端发送所述会议视频数据,包括:
接收所述至少两个终端中的第二终端发送的布局调整信息;
基于所述布局调整信息对所述会议视频数据进行调整;
向所述第二终端发送调整后的会议视频数据。
另一方面,本申请实施例提供一种视频会议实现方法,所述方法包括:
获取会议装置提供的终端对应的与会者区域的信息;
获取视频数据;
基于所述与会者区域的信息以及所述多媒体数据,得到与会多媒体数据;
向所述会议装置提供所述与会多媒体数据;
接收所述会议装置提供的会议视频数据;
展示所述会议视频数据。
可选地,所述基于所述与会者区域的信息以及所述多媒体数据,得到与会多媒体数据,包括:
对所述多媒体数据进行人物识别,以确定所述多媒体数据中人物所在的区域;
基于所述与会者区域的信息以及所述人物所在的区域,对所述多媒体数据进行处理,得到所述与会多媒体数据;其中,所述与会视屏数据中,所述人物所在的区域所占的比例大于指定值。
可选地,所述方法还包括:
向所述会议装置发送布局信息,所述布局信息包括所述会议画面中至少两个与会者区域的信息,以及所述至少两个与会者区域和参与所述视频会议的至少两个终端的对应关系信息。
可选地,所述获取会议装置提供的终端对应的与会者区域的信息,包括:
向所述会议装置发送布局申请信息;
接收所述会议装置反馈的所述终端对应的与会者区域的信息。
可选地,所述接收所述会议装置提供的会议视频数据,包括:
向所述会议装置发送布局调整信息;
接收所述会议装置基于所述布局调整信息调整后的会议视频数据。
可选地,所述展示所述会议视频数据,包括:
接收调整指令;
展示根据所述调整指令调整后的会议视频数据。
另一方面,本申请实施例提供一种视频会议实现装置,所述视频会议实现装置包括:
布局获取模块,用于获取视频会议的会议画面的布局信息,所述布局信息包括所述会议画面中至少两个与会者区域的信息,以及所述至少两个与会者区域和参与所述视频会议的至少两个终端的对应关系;
发送模块,用于向所述至少两个终端发送所述至少两个终端对应的与会者区域的信息;
接收模块,用于接收所述至少两个终端基于对应的与会者区域的信息提供的与会多媒体数据;
会议视频获取模块,用于基于所述与会多媒体数据获取会议视频数据;
会议视频发送模块,用于向所述至少两个终端发送所述会议视频数据。
另一方面,本申请实施例提供一种视频会议实现装置,所述视频会议实现装置包括:
区域获取模块,用于获取会议装置提供的终端对应的与会者区域的信息;
视频获取模块,用于获取视频数据;
与会视频获取模块,用于基于所述与会者区域的信息以及所述多媒体数据,得到与会多媒体数据;
与会视频提供模块,用于向所述会议装置提供所述与会多媒体数据。
另一方面,本申请实施例提供一种视频会议实现系统,所述系统包括会议装置以及至少两个终端;
所述会议装置用于获取视频会议的会议画面的布局信息,所述布局信息包括所述会议画面中至少两个与会者区域的信息,以及所述至少两个与会者区域和参与所述视频会议的至少两个终端的对应关系信息;
所述会议装置用于向所述至少两个终端发送对应的与会者区域的信息;
所述终端用于获取多媒体数据;
所述终端用于基于所述与会者区域的信息以及所述多媒体数据,得到与会多媒体数据;
所述终端用于向所述会议装置提供所述与会多媒体数据;
所述会议装置用于基于所述与会多媒体数据获取会议视频数据;
所述会议装置用于向所述至少两个终端发送所述会议视频数据;
所述终端用于展示所述会议视频数据。
可选地,所述会议装置用于在显示屏展示所述会议视频数据;
所述至少两个终端中的第二终端用于向所述会议装置发送布局调整信息;
所述会议装置用于向所述至少两个终端中的第一终端发送所述会议视频数据,并向所述第二终端发送基于所述布局调整信息调整后的会议视频数据;
所述第一终端用于展示所述会议视频数据,所述第二终端用于展示所述调整后的会议视频数据。
另一方面,本申请实施例提供一种会议装置,所述会议装置包括处理器和存储器,所述存储器中存储有至少一条指令、至少一段程序、代码集或指令集,所述至少一条指令、所述至少一段程序、所述代码集或指令集由所述处理器加载并执行以实现如上述的视频会议实现方法。
根据本申请的另一方面,提供一种终端,所述会议装置包括处理器和存储器,所述存储器中存储有至少一条指令、至少一段程序、代码集或指令集,所述至少一条指令、所述至少一段程序、所述代码集或指令集由所述处理器加载并执行以实现如上述的视频会议实现方法。
根据本申请的另一方面,提供一种计算机非瞬态存储介质,所述计算机非瞬态存储介质中存储有至少一条指令、至少一段程序、代码集或指令集,所述至少一条指令、所述至少一段程序、所述代码集或指令集由处理器加载并执行以实现如上述的视频会议实现方法。
根据本申请的另一方面,提供了一种计算机程序产品或计算机程序,该计算机程序产品或计算机程序包括计算机指令,该计算机指令存储在计算机可读存储介质中。计算机设备的处理器从计算机可读存储介质读取该计算机指令,处理器执行该计算机指令,使得该计算机设备执行上述各种可选实现方式中提供的方法。
本申请实施例提供的技术方案带来的有益效果至少包括:
通过获取视频会议的会议画面的布局信息,并将该布局信息发送给参与视频会议的终端,以使参与该视频会议的终端能够根据该布局信息来采集符合该布局信息的与会多媒体数据,之后参与该视频会议的终端将各自得到的与会多媒体数据给会议装置后,会议装置即可以生成尺寸匹配的会议视频数据,并向参与该视频会议的终端提供该会议视频数据,会议视频数据的显示效果较好。解决了相关技术中会议视频数据的效果较差的问题,提升了视频会议中会议视频数据的显示效果。
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。
图1是一种视频会议的会议画面的显示示意图;
图2是本申请实施例所涉及的一种实施环境的结构示意图;
图3是本申请实施例提供的一种视频会议实现方法的流程图;
图4是本申请实施例提供的另一种视频会议实现方法的流程图;
图5是本申请实施例提供的另一种视频会议实现方法的流程图;
图6是图5所示的实施例中一种发送与会者区域的信息的流程图;
图7是图5所示的实施例中一种获取与会多媒体数据的流程图;
图8是图5所示的实施例中一种视频会议的会议画面的示意图;
图9是图5所示的实施例中一种终端的显示示意图;
图10是图5所示的实施例中一种展示会议视频数据的流程图;
图11是本申请实施例提供的一种视频会议实现装置的框图;
图12是本申请实施例提供的另一种视频会议实现装置的框图;
图13是本申请实施例提供的一种视频会议实现系统的结构框图;
图14是本申请实施例提供的一种会议装置的结构示意图;
图15是本申请实施例提供的一种终端的结构框图。
通过上述附图,已示出本申请明确的实施例,后文中将有更详细的描述。这些附图和文字描述并不是为了通过任何方式限制本申请构思的范围,而是通过参考特定实施例为本领域技术人员说明本申请的概念。
为使本申请的目的、技术方案和优点更加清楚,下面将结合附图对本申请实施方式作进一步地详细描述。
视频会议系统,又称会议电视系统,这是指两个或两个以上不同地方的个人或群体,通过传输线路及多媒体设备,将声音、影像及文件资料互传,实现即时且互动的沟通,以实现远程会议的系统设备。
目前的一种视频会议实现方式中,由各个终端通过各自的摄像头来采集与会人员的视频数据,各个终端采集到视频数据后,会将采集到的视频数据发送给会议装置,会议装置将接收到的多个终端发送的视频数据合并为一个会议视频数据,显示屏显示该会议视频数据时,各个参与视频会议的终端所采集的视频数据会在显示屏的不同区域进行显示。示例性的,如图1所示,其为一种视频会议的会议画面的显示示意图,其中,该会议画面包括4个区域,这四个区域分别显示有终端A、终端B、终端C和终端D所提供的视频数据11、12、13、14。参与视频会议的各个终端的显示屏上均可以显示有该会议画面,通过该会议画面即可以实现视频会议的技术效果。
可以看出,上述视频会议的实现方式中,参与视频会议的不同终端所采集 的视频数据的尺寸可能不同,例如一个与会人员通过笔记本电脑的摄像头来采集视频数据,另一个与会人员通过手机的摄像头来采集视频数据,进而导致采集的视频数据的各种参数(如画面比例、分辨率和码率等)不一致,如此使得会议装置所生成的会议画面中各个终端的视频数据的尺寸与会议画面的尺寸不匹配的问题,例如图1所示会议画面中的斜线阴影区域。如此会导致视频会议的显示效果较差的问题。
图2是本申请实施例所涉及的一种实施环境的结构示意图。该实施环境包括会议装置21和多个终端22。
会议装置21可以包括一个服务器或者多个服务器构成的服务器集群,会议装置21可以通过,有线连接或无线连接的方式与终端22建立连接。此外,在一些实施方式中,会议装置21也可以与终端22为同一种硬件设备,本申请实施例对此不进行限制。会议装置21中可以包括多点控制单元(Multi point Control Unit,MCU),用于实现多端的呼叫和连接。
终端22可以包括手机、平板电脑、台式计算机、笔记本型计算机以及各种智能可穿戴设备。终端22可以具有显示屏、麦克风、扬声器、扬声器连接组件(如扬声器接口或无线连接模块等)等组件的一种或多种,以实现视频会议的各种功能。
图3是本申请实施例提供的一种视频会议实现方法的流程图,本实施例以该视频会议实现方法应用于图2所示实施环境中的会议装置中来举例说明。该视频会议实现方法可以包括如下几个步骤:
步骤301、获取视频会议的会议画面的布局信息,布局信息包括会议画面中至少两个与会者区域的信息,以及至少两个与会者区域和参与视频会议的至少两个终端的对应关系信息。
步骤302、向该至少两个终端发送对应的与会者区域的信息。
步骤303、接收该至少两个终端基于对应的与会者区域的信息提供的与会多媒体数据。
步骤304、基于与会多媒体数据获取会议视频数据。
步骤305、向该至少两个终端发送会议视频数据。
其中,与会多媒体数据可以包括音频数据、视频数据、图像数据以及文本 数据中的一种或多种。
综上所述,本申请实施例提供的视频会议实现方法,通过获取视频会议的会议画面的布局信息,并将该布局信息发送给参与视频会议的终端,以使参与该视频会议的终端能够根据该布局信息来采集符合该布局信息的与会多媒体数据,之后参与该视频会议的终端将各自得到的与会多媒体数据给会议装置后,会议装置即可以生成尺寸匹配的会议视频数据,并向参与该视频会议的终端提供该会议视频数据,会议视频数据的显示效果较好。解决了相关技术中会议视频数据的效果较差的问题,提升了视频会议中会议视频数据的显示效果。
此外,一种视频会议的实现方式中,会议装置会对参与视频会议的终端上传的视频数据进行裁剪和压缩等处理,以使该视频数据能够符合视频会议的会议画面的各种要求,但是,这显然会严重增加会议装置的运算负担。
而本申请实施例提供的视频会议实现方法中,会议装置可以不对视频图像进行压缩和裁剪等处理,而由参与视频会议的多个终端各自对采集的视频数据,按照会议装置发送的与会者区域的信息的指示进行处理,可大大减少会议装置的运算负载,进而能够提高视频会议的流畅程度。
图4是本申请实施例提供的另一种视频会议实现方法的流程图,本实施例以该视频会议实现方法应用于图2所示实施环境中的终端中来举例说明。该视频会议实现方法可以包括如下几个步骤:
步骤401、获取会议装置提供的终端对应的与会者区域的信息。
步骤402、获取多媒体数据。
步骤403、基于与会者区域的信息以及多媒体数据,得到与会多媒体数据。
步骤404、向会议装置提供与会多媒体数据。
综上所述,本申请实施例提供的视频会议实现方法,参与视频会议的终端能够根据会议装置提供的布局信息来采集符合该布局信息的与会多媒体数据,之后参与该视频会议的终端将各自得到的与会多媒体数据给会议装置后,会议装置即可以生成尺寸匹配的会议视频数据,并向参与该视频会议的终端提供该会议视频数据,会议视频数据的显示效果较好。解决了相关技术中会议视频数据的效果较差的问题,提升了视频会议中会议视频数据的显示效果。
图5是本申请实施例提供的另一种视频会议实现方法的流程图,本实施例 以该视频会议实现方法应用于图2所示实施环境中来举例说明。该视频会议实现方法可以包括如下几个步骤:
步骤501、会议装置接收参与视频会议的至少两个终端中的主控终端发送的布局信息。
其中,布局信息包括会议画面中至少两个与会者区域的信息,以及至少两个与会者区域和参与视频会议的至少两个终端的对应关系信息。该视频会议的会议画面中,这至少两个与会者区域用于显示参与视频会议的终端所提供的视频数据。而与会者区域的信息可以包括该与会者区域的各种参数,例如区域形状、分辨率、码率、颜色以及区域边缘长度等。
通过该与会者区域的信息,终端即可以将采集到的视频数据处理为与该与会者区域所匹配的与会多媒体数据,也即是正好可以在与会者区域进行展示的视频数据。
参与视频会议的终端可以在本步骤之前进行确定。示例性的,会议装置可以建立视频会议,终端可以通过验证信息(如用户名和密码等)以参与该视频会议。在视频会议开始之前,会议装置可以确定参与视频会议的终端。
当然,视频会议开始之后,也可以有新的终端参与视频会议,对于此种情况,可以通过重新执行步骤501,来重新确定布局信息。
主控终端可以是参与视频会议的多个终端中的一个终端,该终端可以用于对视频会议进行控制。步骤501提供的是获取布局信息的一种方式,但是,会议装置还可以通过其他方式来获取布局信息,例如,会议装置本地预存有布局信息等,本申请实施例对此不进行限制。
步骤502、会议装置向参与视频会议的至少两个终端对应的与会者区域的信息。
会议装置可以向每个参与视频会议的终端发送每个终端对应的与会者区域的信息。例如,参与视频会议的终端A与视频画面中的与会者区域51对应,参与视频会议的终端B与视频画面中的与会者区域52对应,则会议装置可以将与会者区域51的信息发送给终端A,并将与会者区域52的信息发送给终端B。
在一种示例性实施例中,会议装置也可以将布局信息发送给每一个参与视频会议的终端,也即是每个参与视频会议的终端都可以收到所有与会者区域的信息。
在步骤502中,会议装置可以直接向参与视频会议的终端发送对应的与会 者区域的信息,或者,可以如图6所示,其为步骤502中另一种发送与会者区域的信息的方式,该方式可以包括:
子步骤5021、会议装置接收参与视频会议的至少两个终端中的第二终端发送的布局申请信息。
第二终端可以是参与视频会议的终端中的一个终端,该终端可以主动向会议装置发送布局申请信息。
子步骤5022、会议装置基于布局申请信息,向第二终端发送第二终端对应的与会者区域的信息。
会议装置在接收到第二终端的布局申请信息后,可以向第二终端发送第二终端对应的与会者区域的信息。
参与视频会议的每个终端,均可以在准备好进行视频会议后,向会议装置发送布局申请信息,以获取对应的与会者区域的信息。
步骤503、终端获取多媒体数据。
该多媒体数据可以包括参与视频会议的终端通过摄像组件采集视频数据,或者是终端提供的共享视频数据,或者是终端提供的图像、音频数据等。
其中,摄像组件采集视频数据可以是与会者自身的视频数据,或者是与会者周围环境的视频数据,示例性的,当终端为手机时,可以通过手机的前置摄像头(或者后置摄像头)来采集与会者的视频数据。
共享视频数据可以包括与会者要进行展示的一些画面,如幻灯片演示文稿(如PPT)、文档、表格、图片、存储在终端本地的视频、音频、网页或其它多媒体信息,这些信息可以通过录制显示屏窗口或应用程序窗口来提供。
步骤504、终端基于与会者区域的信息以及多媒体数据,得到与会多媒体数据。
终端采集到多媒体数据后,可以基于与会者区域的信息对多媒体数据进行处理,得到与会多媒体数据。
示例性的,多媒体数据可以为终端采集到分辨率为800×600的视频数据,而与会者区域的信息中记载的分辨率为200×300,则终端可以将该分辨率为800×600的视频数据裁剪为分辨率为200×300的视频数据,该分辨率为200×300的视频数据即可以作为与会多媒体数据。此外,终端还可以根据与会者区域的信息对视频数据的其它参数进行调整,如码率和颜色等。
在一种示例性的实施例中,如图7所示,步骤504可以包括下面两个子步 骤:
子步骤5041、终端对多媒体数据进行人物识别,以确定多媒体数据中人物所在的区域。
终端可以通过人物识别技术,来识别出多媒体数据(例如为视频数据)中人脸(或者人脸以及肩部)所在的区域。
示例性的,终端可以对视频数据的每一帧图像进行区域划分,分成多个的小尺寸图像,通过adaboost算法(一种迭代算法)依次识别各区域是否包含人的头肩部特征,如果判断为是,则记录区域坐标以及尺寸,如果判断为否,则不记录坐标信息,以此类推,对整个像进行遍历搜索,从而确定图像中的头肩部所占的区域。
子步骤5042、终端基于与会者区域的信息以及人物所在的区域,对多媒体数据进行处理,得到与会多媒体数据。
终端可以基于与会者区域的信息以及人物所在的区域,对采集到的多媒体数据(例如为视频数据)进行裁剪的压缩等操作,以使该与会视屏数据中,人物所在的区域所占的比例大于指定值,示例性的,该指定值为50%-75%。
在一个示例性的实施例中,与会者区域的信息记录的分辨率为200×300,终端采集到的为分辨率为800×600的视频数据,则终端可以将该视频数据处理为分辨率为200×300,且人物所在区域所占的区域比例为75%的与会多媒体数据。
通过人物识别,以使与会多媒体数据中,人物所在的区域较大,可以凸显视频画面中的人物,提升视频会议的效果。
步骤505、终端向会议装置提供与会多媒体数据。
终端可以实时处理得到与会多媒体数据,并将该与会多媒体数据实时传输至会议装置。
步骤506、会议装置基于与会多媒体数据获取会议视频数据。
会议装置可以获取参与视频会议的多个终端提供的与会多媒体数据,并将这多个与会多媒体数据合并为会议视频数据。会议装置可以根据步骤501中确定的布局信息在会议画面中布局多个与会多媒体数据。
可选地,当与会多媒体数据中包括共享视频数据时,会议装置根据布局信息对共享视频数据进行处理,得到会议视频数据,该会议视频数据中,共享视频数据所占的区域大于除共享视频数据外的其它与会多媒体数据所占的区域, 也即是使共享视频数据的显示区域大于其他的与会多媒体数据的显示区域,以凸显该共享视频数据,进而提高视频会议的显示效果。
其中,布局信息中可以记录有共享视频数据所对应的与会区域的信息,并在步骤501中发送给了提供该共享视频数据的终端,此种情况下,参与视频会议的终端可以预先向主控终端或会议装置发出共享信息,以表明要提供共享视频数据,以使主控终端或会议装置可以确定出对应的布局信息。
此外,也可能存在某个终端临时提供共享视频数据,此种情况下,该终端也可以向主控终端(或会议装置)发出共享信息,并重新执行步骤501,以确定新的布局信息。
共享视频数据中可以具有标识,会议装置可以通过该标识来确定与会多媒体数据中是否包括共享视频数据。
示例性的,如图8所示,其为本申请实施例中一种视频会议的会议画面的示意图,在该会议画面中,共享视频数据81处于最大的区域中进行显示,而其他的显示有与会者的视频数据82显示在其他较小的区域中,如此布局下,所有与会者均可以较为清楚的看到共享视频数据所呈现的画面,提升了视频会议的显示效果。
步骤507、会议装置向参与视频会议的至少两个终端发送会议视频数据。
会议装置可以向参与视频会议的每个终端发送会议视频数据。
在一种示例性的实施例中,会议装置还可以单独调整对一个或多个终端所展示的会议视频数据。示例性的,步骤507可以包括:
1)会议装置接收至少两个终端中的第二终端发送的布局调整信息。
第二终端可以向会议装置发送该布局调整信息,该布局调整信息中,可以包括有不同于上述布局信息的布局方式,例如,相较于上述布局信息,该布局调整信息中,某个终端对应的与会区域的尺寸变大,另一个终端对应的与会区域的尺寸变小。
2)会议装置基于布局调整信息对会议视频数据进行调整。
会议装置可以通过本地的图像处理组件(如MCU)来基于布局调整信息对会议视频数据进行调整。需要说明的是,会议装置可以仅对用于向第二终端发送的会议视频数据进行了调整,对于用于发送给其他未发送布局调整信息的终端(如第一终端),并不调整会议视频数据。
3)会议装置向第二终端发送调整后的会议视频数据。
对应的,会议装置向参与视频会议的除第二终端外的其它终端发送调整前的会议视频数据。如此情况下,第二终端中所显示的会议视频数据就与其它参与视频会议的终端所显示的会议视频数据不同,丰富了视频会议的多样性和用户体验。
步骤508、终端展示会议视频数据。
终端可以在显示屏中展示从会议装置处获取的会议视频数据。示例性的,如图9所示,其为一种终端的显示示意图,该终端90在显示屏中展示了会议视频数据所构成的会议画面91,与会者可以观看该会议画面91,该会议画面91中还可以具有发言按键912、主持切换按键913,发言按键912用于与会者发言时触发,主持切换按键913用于与会者想要申请成为主持人时触发。
在一种示例性实施例中,如图10所示,步骤508可以包括下面两个子步骤:
步骤5081、终端接收调整指令。
该调整指令可以由操作终端的与会者来触发,该调整指令用于调整该终端的显示屏所呈现的会议画面的布局。
示例性的,当操作终端A的与会者想要突出显示与会者B的终端提供的视频数据时,可以在终端A触发调整指令,以调整会议画面的布局方式,使与会者B提供的视频数据的显示区域变大(相应的,其他与会者的终端提供的视频数据所占的区域可能会缩小)。
步骤5082、终端展示根据调整指令调整后的会议视频数据。
终端可以根据调整指令来对会议装置提供的会议视频数据进行调整,得到调整后的会议视频数据,并在显示屏展示该调整后的会议视频数据。终端可以通过本地的图像处理组件(如图形处理器(Graphics Processing Unit,GPU))来对会议视频数据实现动态调整。
需要说明的是,该调整指令用于调整终端本地的显示屏所呈现的会议画面的布局,但不会对其他参与视频会议的终端中所呈现的会议画面产生影响。
示例性的,操作终端A的与会者想要对会议画面的布局进行更改,则可以在终端A中触发调整指令,以调整终端A的显示屏中呈现的会议画面的布局。但是,终端B的显示屏所呈现的会议画面的布局可以仍为会议装置所提供的会议视频数据中的布局。
步骤509、会议装置在显示屏展示会议视频数据。
在一种示例性的实施例中,会议装置也具有显示屏,继而会议装置可以在 该显示屏上展示会议视频数据,在此基础上,会议装置处也可以存在一个或多个与会者,这一个或多个与会者可以通过该会议装置来参与视频会议,并通过该会议装置来对视频会议进行控制,如控制会议视频数据的布局信息等。
步骤510、会议装置在主控指令指示参与视频会议的至少两个终端中第一终端为主控终端时,将第一终端确定为主控终端。
在本申请实施例提供的视频会议实现方法中,主控终端还可以进行更改,例如当会议装置在主控指令指示参与视频会议的至少两个终端中第一终端为主控终端时,会议装置即可以将第一终端确定为主控终端。
其中,该主控指令可以包括第一终端触发的发言指令、第一终端触发的主控切换指令、当前时刻之前的主控终端(也即是由上一任主控终端进行确认)触发的主控切换指令、会议装置触发的控制指令中的至少一种。例如,当会议装置具有显示屏时,则可以通过会议装置来触发控制指令,以对主控终端进行调整和控制。此外,会议视频数据中,还可以对主控终端对应的与会区域添加标识,以使各个终端能够辨识出主控终端,便于视频会议的进行。
通过主控终端的切换,可以使视频会议实现方法更加灵活。
综上所述,本申请实施例提供的视频会议实现方法,通过获取视频会议的会议画面的布局信息,并将该布局信息发送给参与视频会议的终端,以使参与该视频会议的终端能够根据该布局信息来采集符合该布局信息的与会多媒体数据,之后参与该视频会议的终端将各自得到的与会多媒体数据给会议装置后,会议装置即可以生成尺寸匹配的会议视频数据,并向参与该视频会议的终端提供该会议视频数据,会议视频数据的显示效果较好。解决了相关技术中会议视频数据的效果较差的问题,提升了视频会议中会议视频数据的显示效果。
此外,本申请实施例提供的视频会议实现方法中,会议装置可以不对视频图像进行压缩和裁剪等处理,而由参与视频会议的多个终端各自对采集的视频数据,按照会议装置发送的与会者区域的信息的指示进行处理,可大大减少会议装置的运算负载,进而能够提高视频会议的流畅程度。
图11是本申请实施例提供的一种视频会议实现装置的框图,该视频会议实现装置1100包括:
布局获取模块1110,用于获取视频会议的会议画面的布局信息,布局信息包括会议画面中至少两个与会者区域的信息,以及至少两个与会者区域和参与 视频会议的至少两个终端的对应关系信息。
发送模块1120,用于向至少两个终端发送至少两个终端对应的与会者区域的信息。
接收模块1130,用于接收至少两个终端基于对应的与会者区域的信息提供的与会多媒体数据。
会议视频获取模块1140,用于基于与会多媒体数据获取会议视频数据。
会议视频发送模块1150,用于向至少两个终端发送会议视频数据。
综上所述,本申请实施例提供的视频会议实现装置,通过获取视频会议的会议画面的布局信息,并将该布局信息发送给参与视频会议的终端,以使参与该视频会议的终端能够根据该布局信息来采集符合该布局信息的与会多媒体数据,之后参与该视频会议的终端将各自得到的与会多媒体数据给会议装置后,会议装置即可以生成尺寸匹配的会议视频数据,并向参与该视频会议的终端提供该会议视频数据,会议视频数据的显示效果较好。解决了相关技术中会议视频数据的效果较差的问题,提升了视频会议中会议视频数据的显示效果。
可选地,布局获取模块,用于:
接收至少两个终端中主控终端发送的布局信息。
可选地,会议装置还包括:
主持确认模块,用于在主控指令指示至少两个终端中第一终端为主控终端时,将第一终端确定为主控终端。
可选地,主控指令包括第一终端触发的发言指令、第一终端触发的主控切换指令、当前时刻之前的主控终端触发的主控切换指令中的至少一种。
可选地,发送模块,用于:
接收至少两个终端中的第二终端发送的布局申请信息;
基于布局申请信息,向第二终端发送第二终端对应的与会者区域的信息。
可选地,会议视频获取模块,包括:
当与会多媒体数据中包括共享视频数据时,根据共享布局信息对共享视频数据进行处理,得到会议视频数据,会议视频数据中,共享视频数据所占的区域大于除共享视频数据外的其它与会多媒体数据所占的区域。
可选地,该会议装置还包括:
会议展示模块,用于在显示屏展示会议视频数据。
图12是本申请实施例提供的另一种视频会议实现装置的框图,该视频会议实现装置1200包括:
区域获取模块1210,用于获取会议装置提供的终端对应的与会者区域的信息;
视频获取模块1220,用于获取多媒体数据;
与会视频获取模块1230,用于基于与会者区域的信息以及多媒体数据,得到与会多媒体数据;
与会视频提供模块1240,用于向会议装置提供与会多媒体数据;
会议数据接收模块1250,用于接收会议装置提供的会议视频数据;
会议数据展示模块1260,用于展示会议视频数据。
综上所述,本申请实施例提供的视频会议实现装置,参与视频会议的终端能够根据会议装置提供的布局信息来采集符合该布局信息的与会多媒体数据,之后参与该视频会议的终端将各自得到的与会多媒体数据给会议装置后,会议装置即可以生成尺寸匹配的会议视频数据,并向参与该视频会议的终端提供该会议视频数据,会议视频数据的显示效果较好。解决了相关技术中会议视频数据的效果较差的问题,提升了视频会议中会议视频数据的显示效果。
可选地,与会视频获取模块,用于:
对视频数据进行人物识别,以确定视频数据中人物所在的区域;
基于与会者区域的信息以及人物所在的区域,对视频数据进行处理,得到与会多媒体数据,与会视屏数据中,人物所在的区域所占的比例大于指定值。
可选地,终端还包括:
布局信息发送模块,用于向会议装置发送布局信息,布局信息包括会议画面中至少两个与会者区域的信息,以及至少两个与会者区域和参与视频会议的至少两个终端的对应关系信息。
可选地,区域获取模块,用于:
向会议装置发送布局申请信息;
接收会议装置反馈的终端对应的与会者区域的信息。
可选地,会议数据展示模块,用于:
接收调整指令;
展示根据调整指令调整后的会议视频数据。
图13是本申请实施例提供的一种视频会议实现系统的结构框图,该系统1300包括会议装置131以及至少两个终端132。
会议装置131用于获取视频会议的会议画面的布局信息,布局信息包括会议画面中至少两个与会者区域的信息,以及至少两个与会者区域和参与视频会议的至少两个终端的对应关系信息。
会议装置131用于向至少两个终端发送至少两个终端对应的与会者区域的信息。
终端132用于获取多媒体数据。
终端132用于基于与会者区域的信息以及多媒体数据,得到与会多媒体数据。
终端132用于向会议装置提供与会多媒体数据。
会议装置131用于基于与会多媒体数据获取会议视频数据。
会议装置131用于向至少两个终端发送会议视频数据;
终端132用于展示会议视频数据。
此外,本申请实施例中,会议装置131和终端132所执行的方法还可以参考图5所示的实施例,在此不再赘述。
综上所述,本申请实施例提供的视频会议实现系统,通过获取视频会议的会议画面的布局信息,并将该布局信息发送给参与视频会议的终端,以使参与该视频会议的终端能够根据该布局信息来采集符合该布局信息的与会多媒体数据,之后参与该视频会议的终端将各自得到的与会多媒体数据给会议装置后,会议装置即可以生成尺寸匹配的会议视频数据,并向参与该视频会议的终端提供该会议视频数据,会议视频数据的显示效果较好。解决了相关技术中会议视频数据的效果较差的问题,提升了视频会议中会议视频数据的显示效果。
可选地,会议装置131用于在显示屏展示会议视频数据。
至少两个终端中的第二终端1321用于向会议装置131发送布局调整信息。
会议装置131用于向至少两个终端中的第一终端1322(第一终端可以是未向会议装置发送布局调整信息的终端)发送会议视频数据,并向第二终端1321发送基于布局调整信息调整后的会议视频数据。
第一终端1322用于展示会议视频数据,第二终端1321用于展示调整后的会议视频数据。也即是未向会议装置发送布局调整信息的终端所展示的会议视频数据与向会议装置发送了布局调整信息的终端所展示的会议视频数据不同。
根据本申请的另一方面,提供了一种计算机程序产品或计算机程序,该计算机程序产品或计算机程序包括计算机指令,该计算机指令存储在计算机可读存储介质中。计算机设备的处理器从计算机可读存储介质读取该计算机指令,处理器执行该计算机指令,使得该计算机设备执行上述各种可选实现方式中提供的方法。
图14是本申请实施例提供的一种会议装置1400的结构示意图,该会议装置1400可以为服务器。示例地,如图14所示,该会议装置1400包括中央处理单元(Central Processing Unit,CPU)1401、存储器1402,以及连接存储器1402和中央处理单元1401的系统总线1403,存储器1402可以包括诸如硬盘或者紧凑型光盘只读储存器(Compact Disc Read-Only Memory,CD-ROM)之类的计算机可读介质(未示出)。
不失一般性,计算机可读存储介质可以包括计算机存储介质(可以为非瞬态的存储介质)和通信介质。计算机存储介质包括以用于存储诸如计算机可读指令、数据结构、程序模块或其他数据等信息的任何方法或技术实现的易失性和非易失性、可移动和不可移动介质。计算机存储介质包括随机存取存储器(Random Access Memory,RAM)、只读存储器(Read-Only Memory,ROM)、可擦除可编程只读存储器(Erasable Programmable Read Only Memory,EPROM)、带电可擦写可编程只读存储器(Electrically Erasable Programmable Read-Only Memory,EEPROM)、闪存或其他固态存储其技术,CD-ROM、数字多功能光盘(Digital Versatile Disc,DVD)或其他光学存储、磁带盒、磁带、磁盘存储或其他磁性存储设备。当然,本领域技术人员可知计算机存储介质不局限于上述几种。
上述存储器1402还包括一个或者一个以上的程序,一个或者一个以上程序存储于存储器中,被配置由CPU执行,以实现本申请实施例提供的视频会议实现方法。
图15示出了本申请实施例提供的一种终端1500的结构框图。该终端1500可以是:智能手机、平板电脑、动态影像专家压缩标准音频层面3(Moving Picture Experts Group Audio Layer III,MP3)播放器、动态影像专家压缩标准音频层面 4(Moving Picture Experts Group Audio Layer IV,MP4)播放器、笔记本电脑或台式电脑。终端1500还可能被称为用户设备、便携式终端、膝上型终端、台式终端等其他名称。
通常,终端1500包括有:处理器1501和存储器1502。
处理器1501可以包括一个或多个处理核心,比如4核心处理器、8核心处理器等。处理器1501可以采用数字信号处理(Digital Signal Processing,DSP)、现场可编程门阵列(Field-Programmable Gate Array,FPGA)、可编程逻辑阵列(Programmable Logic Array,PLA)中的至少一种硬件形式来实现。处理器1501也可以包括主处理器和协处理器,主处理器是用于对在唤醒状态下的数据进行处理的处理器,也称中央处理器(Central Processing Unit,CPU);协处理器是用于对在待机状态下的数据进行处理的低功耗处理器。在一些实施例中,处理器1501可以在集成有图像处理器(Graphics Processing Unit,GPU),GPU用于负责显示屏所需要显示的内容的渲染和绘制。一些实施例中,处理器1501还可以包括人工智能(Artificial Intelligence,AI)处理器,该AI处理器用于处理有关机器学习的计算操作。
存储器1502可以包括一个或多个计算机可读存储介质(可以为非瞬态的存储介质),该计算机可读存储介质可以是非暂态的。存储器1502还可包括高速随机存取存储器,以及非易失性存储器,比如一个或多个磁盘存储设备、闪存存储设备。在一些实施例中,存储器1502中的非暂态的计算机可读存储介质用于存储至少一个指令,该至少一个指令用于被处理器1501所执行以实现本申请中方法实施例提供的视频会议实现方法。
在一些实施例中,终端1500还可选包括有:外围设备接口1503和至少一个外围设备。处理器1501、存储器1502和外围设备接口1503之间可以通过总线或信号线相连。各个外围设备可以通过总线、信号线或电路板与外围设备接口1503相连。具体地,外围设备包括:射频电路1504、触摸显示屏1505、摄像头1506、音频电路1507、定位组件1508和电源1509中的至少一种。
外围设备接口1503可被用于将输入/输出(Input/Output,I/O)相关的至少一个外围设备连接到处理器1501和存储器1502。在一些实施例中,处理器1501、存储器1502和外围设备接口1503被集成在同一芯片或电路板上;在一些其他实施例中,处理器1501、存储器1502和外围设备接口1503中的任意一个或两个可以在单独的芯片或电路板上实现,本实施例对此不加以限定。
射频电路1504用于接收和发射射频(Radio Frequency,RF)信号,也称电磁信号。射频电路1504通过电磁信号与通信网络以及其他通信设备进行通信。射频电路1504将电信号转换为电磁信号进行发送,或者,将接收到的电磁信号转换为电信号。可选地,射频电路1504包括:天线系统、RF收发器、一个或多个放大器、调谐器、振荡器、数字信号处理器、编解码芯片组、用户身份模块卡等等。射频电路1504可以通过至少一种无线通信协议来与其它终端进行通信。该无线通信协议包括但不限于:万维网、城域网、内联网、各代移动通信网络(2G、3G、4G及5G)、无线局域网和/或无线保真(Wireless Fidelity,WiFi)网络。在一些实施例中,射频电路1504还可以包括近距离无线通信(Near Field Communication,NFC)有关的电路,本申请对此不加以限定。
显示屏1505用于显示用户界面(User Interface,UI)。该UI可以包括图形、文本、图标、视频及其它们的任意组合。当显示屏1505是触摸显示屏时,显示屏1505还具有采集在显示屏1505的表面或表面上方的触摸信号的能力。该触摸信号可以作为控制信号输入至处理器1501进行处理。此时,显示屏1505还可以用于提供虚拟按钮和/或虚拟键盘,也称软按钮和/或软键盘。在一些实施例中,显示屏1505可以为一个,设置终端1500的前面板;在另一些实施例中,显示屏1505可以为至少两个,分别设置在终端1500的不同表面或呈折叠设计;在再一些实施例中,显示屏1505可以是柔性显示屏,设置在终端1500的弯曲表面上或折叠面上。甚至,显示屏1505还可以设置成非矩形的不规则图形,也即异形屏。显示屏1505可以采用液晶显示屏(Liquid Crystal Display,LCD)、有机发光二极管(Organic Light-Emitting Diode,OLED)等材质制备。
摄像头组件1506用于采集图像或视频。可选地,摄像头组件1506包括前置摄像头和后置摄像头。通常,前置摄像头设置在终端的前面板,后置摄像头设置在终端的背面。在一些实施例中,后置摄像头为至少两个,分别为主摄像头、景深摄像头、广角摄像头、长焦摄像头中的任意一种,以实现主摄像头和景深摄像头融合实现背景虚化功能、主摄像头和广角摄像头融合实现全景拍摄以及虚拟现实(Virtual Reality,VR)拍摄功能或者其它融合拍摄功能。在一些实施例中,摄像头组件1506还可以包括闪光灯。闪光灯可以是单色温闪光灯,也可以是双色温闪光灯。双色温闪光灯是指暖光闪光灯和冷光闪光灯的组合,可以用于不同色温下的光线补偿。
音频电路1507可以包括麦克风和扬声器。麦克风用于采集用户及环境的声 波,并将声波转换为电信号输入至处理器1501进行处理,或者输入至射频电路1504以实现语音通信。出于立体声采集或降噪的目的,麦克风可以为多个,分别设置在终端1500的不同部位。麦克风还可以是阵列麦克风或全向采集型麦克风。扬声器则用于将来自处理器1501或射频电路1504的电信号转换为声波。扬声器可以是传统的薄膜扬声器,也可以是压电陶瓷扬声器。当扬声器是压电陶瓷扬声器时,不仅可以将电信号转换为人类可听见的声波,也可以将电信号转换为人类听不见的声波以进行测距等用途。在一些实施例中,音频电路1507还可以包括耳机插孔。
定位组件1508用于定位终端1500的当前地理位置,以实现导航或基于位置的服务(Location Based Service,LBS)。定位组件1508可以是基于美国的全球定位系统(Global Positioning System,GPS)、中国的北斗系统或俄罗斯的伽利略系统的定位组件。
电源1509用于为终端1500中的各个组件进行供电。电源1509可以是交流电、直流电、一次性电池或可充电电池。当电源1509包括可充电电池时,该可充电电池可以是有线充电电池或无线充电电池。有线充电电池是通过有线线路充电的电池,无线充电电池是通过无线线圈充电的电池。该可充电电池还可以用于支持快充技术。
在一些实施例中,终端1500还包括有一个或多个传感器1510。该一个或多个传感器1510包括但不限于:加速度传感器1511、陀螺仪传感器1512、压力传感器1513、指纹传感器1514、光学传感器1515以及接近传感器1516。
加速度传感器1511可以检测以终端1500建立的坐标系的三个坐标轴上的加速度大小。比如,加速度传感器1511可以用于检测重力加速度在三个坐标轴上的分量。处理器1501可以根据加速度传感器1511采集的重力加速度信号,控制触摸显示屏1505以横向视图或纵向视图进行用户界面的显示。加速度传感器1511还可以用于游戏或者用户的运动数据的采集。
陀螺仪传感器1512可以检测终端1500的机体方向及转动角度,陀螺仪传感器1512可以与加速度传感器1511协同采集用户对终端1500的3D动作。处理器1501根据陀螺仪传感器1512采集的数据,可以实现如下功能:动作感应(比如根据用户的倾斜操作来改变UI)、拍摄时的图像稳定、游戏控制以及惯性导航。
压力传感器1513可以设置在终端1500的侧边框和/或触摸显示屏1505的下 层。当压力传感器1513设置在终端1500的侧边框时,可以检测用户对终端1500的握持信号,由处理器1501根据压力传感器1513采集的握持信号进行左右手识别或快捷操作。当压力传感器1513设置在触摸显示屏1505的下层时,由处理器1501根据用户对触摸显示屏1505的压力操作,实现对UI界面上的可操作性控件进行控制。可操作性控件包括按钮控件、滚动条控件、图标控件、菜单控件中的至少一种。
指纹传感器1514用于采集用户的指纹,由处理器1501根据指纹传感器1514采集到的指纹识别用户的身份,或者,由指纹传感器1514根据采集到的指纹识别用户的身份。在识别出用户的身份为可信身份时,由处理器1501授权该用户执行相关的敏感操作,该敏感操作包括解锁显示屏、查看加密信息、下载软件、支付及更改设置等。指纹传感器1514可以被设置终端1500的正面、背面或侧面。当终端1500上设置有物理按键或厂商Logo时,指纹传感器1514可以与物理按键或厂商Logo集成在一起。
光学传感器1515用于采集环境光强度。在一个实施例中,处理器1501可以根据光学传感器1515采集的环境光强度,控制触摸显示屏1505的显示亮度。具体地,当环境光强度较高时,调高触摸显示屏1505的显示亮度;当环境光强度较低时,调低触摸显示屏1505的显示亮度。在另一个实施例中,处理器1501还可以根据光学传感器1515采集的环境光强度,动态调整摄像头组件1506的拍摄参数。
接近传感器1516,也称距离传感器,通常设置在终端1500的前面板。接近传感器1516用于采集用户与终端1500的正面之间的距离。在一个实施例中,当接近传感器1516检测到用户与终端1500的正面之间的距离逐渐变小时,由处理器1501控制触摸显示屏1505从亮屏状态切换为息屏状态;当接近传感器1516检测到用户与终端1500的正面之间的距离逐渐变大时,由处理器1501控制触摸显示屏1505从息屏状态切换为亮屏状态。
本领域技术人员可以理解,图15中示出的结构并不构成对终端1500的限定,可以包括比图示更多或更少的组件,或者组合某些组件,或者采用不同的组件布置。
本申请中术语“A和B的至少一种”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和B的至少一种,可以表示:单独存在A, 同时存在A和B,单独存在B这三种情况。同理,“A、B和C的至少一种”表示可以存在七种关系,可以表示:单独存在A,单独存在B,单独存在C,同时存在A和B,同时存在A和C,同时存在C和B,同时存在A、B和C这七种情况。同理,“A、B、C和D的至少一种”表示可以存在十五种关系,可以表示:单独存在A,单独存在B,单独存在C,单独存在D,同时存在A和B,同时存在A和C,同时存在A和D,同时存在C和B,同时存在D和B,同时存在C和D,同时存在A、B和C,同时存在A、B和D,同时存在A、C和D,同时存在B、C和D,同时存在A、B、C和D,这十五种情况。
在本申请中,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性。术语“多个”指两个或两个以上,除非另有明确的限定。
在本申请所提供的几个实施例中,应该理解到,所揭露的装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。
本领域普通技术人员可以理解实现上述实施例的全部或部分步骤可以通过硬件来完成,也可以通过程序来指令相关的硬件完成,所述的程序可以存储于一种计算机可读存储介质中,上述提到的存储介质可以是只读存储器,磁盘或光盘等。
以上所述仅为本申请的可选实施例,并不用以限制本申请,凡在本申请的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本申请的保护范围之内。
Claims (20)
- 一种视频会议实现方法,其特征在于,所述方法包括:获取视频会议的会议画面的布局信息,所述布局信息包括所述会议画面中至少两个与会者区域的信息,以及所述至少两个与会者区域和参与所述视频会议的至少两个终端的对应关系信息;向所述至少两个终端发送对应的与会者区域的信息;接收所述至少两个终端基于对应的与会者区域的信息提供的与会多媒体数据;基于所述与会多媒体数据获取会议视频数据;向所述至少两个终端发送所述会议视频数据。
- 根据权利要求1所述的方法,其特征在于,所述获取视频会议的会议画面的布局信息,包括:接收所述至少两个终端中的主控终端发送的所述布局信息。
- 根据权利要求2所述的方法,其特征在于,所述接收所述主控终端发送的所述布局信息之后,所述方法还包括:在主控指令指示所述至少两个终端中的第一终端为主控终端时,将所述第一终端确定为主控终端。
- 根据权利要求3所述的方法,其特征在于,所述主控指令包括所述第一终端触发的发言指令、所述第一终端触发的主控切换指令、当前时刻之前的主控终端触发的主控切换指令中的至少一种。
- 根据权利要求1-4任一所述的方法,其特征在于,所述向所述至少两个终端发送对应的与会者区域的信息,包括:接收所述至少两个终端中的第二终端发送的布局申请信息;基于所述布局申请信息,向所述第二终端发送所述第二终端对应的与会者区域的信息。
- 根据权利要求1-4任一所述的方法,其特征在于,所述基于所述与会多媒体数据获取会议视频数据,包括:当所述与会多媒体数据中包括共享视频数据时,根据所述布局信息对所述共享视频数据进行处理,得到所述会议视频数据;其中,所述会议视频数据中,所述共享视频数据所占的区域大于除所述共享视频数据外的其它与会多媒体数据所占的区域。
- 根据权利要求1-4任一所述的方法,其特征在于,所述基于所述与会多媒体数据获取会议视频数据之后,所述方法还包括:在显示屏展示所述会议视频数据。
- 根据权利要求1-4任一所述的方法,其特征在于,所述向所述至少两个终端发送所述会议视频数据,包括:接收所述至少两个终端中的第二终端发送的布局调整信息;基于所述布局调整信息对所述会议视频数据进行调整;向所述第二终端发送调整后的会议视频数据。
- 一种视频会议实现方法,其特征在于,所述方法包括:获取会议装置提供的终端对应的与会者区域的信息;获取多媒体数据;基于所述与会者区域的信息以及所述多媒体数据,得到与会多媒体数据;向所述会议装置提供所述与会多媒体数据;接收所述会议装置提供的会议视频数据;展示所述会议视频数据。
- 根据权利要求9所述的方法,其特征在于,所述基于所述与会者区域的信息以及所述多媒体数据,得到与会多媒体数据,包括:对所述多媒体数据进行人物识别,以确定所述多媒体数据中人物所在的区域;基于所述与会者区域的信息以及所述人物所在的区域,对所述多媒体数据 进行处理,得到所述与会多媒体数据;其中,所述与会视屏数据中,所述人物所在的区域所占的比例大于指定值。
- 根据权利要求9所述的方法,其特征在于,所述方法还包括:向所述会议装置发送布局信息,所述布局信息包括所述会议画面中至少两个与会者区域的信息,以及所述至少两个与会者区域和参与所述视频会议的至少两个终端的对应关系信息。
- 根据权利要求9-11任一所述的方法,其特征在于,所述获取会议装置提供的终端对应的与会者区域的信息,包括:向所述会议装置发送布局申请信息;接收所述会议装置反馈的所述终端对应的与会者区域的信息。
- 根据权利要求9-11任一所述的方法,其特征在于,所述接收所述会议装置提供的会议视频数据,包括:向所述会议装置发送布局调整信息;接收所述会议装置基于所述布局调整信息调整后的会议视频数据。
- 根据权利要求9-10任一所述的方法,其特征在于,所述展示所述会议视频数据,包括:接收调整指令;展示根据所述调整指令调整后的会议视频数据。
- 一种视频会议实现系统,其特征在于,所述系统包括会议装置以及至少两个终端;所述会议装置用于获取视频会议的会议画面的布局信息,所述布局信息包括所述会议画面中至少两个与会者区域的信息,以及所述至少两个与会者区域和参与所述视频会议的至少两个终端的对应关系信息;所述会议装置用于向所述至少两个终端发送对应的与会者区域的信息;所述终端用于获取多媒体数据;所述终端用于基于所述与会者区域的信息以及所述多媒体数据,得到与会多媒体数据;所述终端用于向所述会议装置提供所述与会多媒体数据;所述会议装置用于基于所述与会多媒体数据获取会议视频数据;所述会议装置用于向所述至少两个终端发送所述会议视频数据;所述终端用于展示所述会议视频数据。
- 根据权利要求15所述的系统,其特征在于,所述会议装置用于在显示屏展示所述会议视频数据;所述至少两个终端中的第二终端用于向所述会议装置发送布局调整信息;所述会议装置用于向所述至少两个终端中的第一终端发送所述会议视频数据,并向所述第二终端发送基于所述布局调整信息调整后的会议视频数据;所述第一终端用于展示所述会议视频数据,所述第二终端用于展示所述调整后的会议视频数据。
- 一种会议装置,其特征在于,所述会议装置包括处理器和存储器,所述存储器中存储有至少一条指令、至少一段程序、代码集或指令集,所述至少一条指令、所述至少一段程序、所述代码集或指令集由所述处理器加载并执行以实现如权利要求1至8任一所述的视频会议实现方法。
- 一种终端,其特征在于,所述会议装置包括处理器和存储器,所述存储器中存储有至少一条指令、至少一段程序、代码集或指令集,所述至少一条指令、所述至少一段程序、所述代码集或指令集由所述处理器加载并执行以实现如权利要求9至14任一所述的视频会议实现方法。
- 一种计算机非瞬态存储介质,其特征在于,所述计算机非瞬态存储介质中存储有至少一条指令、至少一段程序、代码集或指令集,所述至少一条指令、所述至少一段程序、所述代码集或指令集由处理器加载并执行以实现如权利要求1至8任一所述的视频会议实现方法。
- 一种计算机非瞬态存储介质,其特征在于,所述计算机非瞬态存储介质中存储有至少一条指令、至少一段程序、代码集或指令集,所述至少一条指令、所述至少一段程序、所述代码集或指令集由处理器加载并执行以实现如权利要求9至14任一所述的视频会议实现方法。
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202080002558.7A CN114698409A (zh) | 2020-10-30 | 2020-10-30 | 视频会议实现方法、装置、系统以及存储介质 |
US17/615,881 US20220353434A1 (en) | 2020-10-30 | 2020-10-30 | Method, apparatus and system for implementing video conference, and storage medium |
PCT/CN2020/125289 WO2022088050A1 (zh) | 2020-10-30 | 2020-10-30 | 视频会议实现方法、装置、系统以及存储介质 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2020/125289 WO2022088050A1 (zh) | 2020-10-30 | 2020-10-30 | 视频会议实现方法、装置、系统以及存储介质 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022088050A1 true WO2022088050A1 (zh) | 2022-05-05 |
Family
ID=81381578
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/125289 WO2022088050A1 (zh) | 2020-10-30 | 2020-10-30 | 视频会议实现方法、装置、系统以及存储介质 |
Country Status (3)
Country | Link |
---|---|
US (1) | US20220353434A1 (zh) |
CN (1) | CN114698409A (zh) |
WO (1) | WO2022088050A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2023226974A1 (zh) * | 2022-05-24 | 2023-11-30 | 北京字节跳动网络技术有限公司 | 图像显示方法、装置和电子设备 |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1571508A (zh) * | 2003-07-19 | 2005-01-26 | 华为技术有限公司 | 一种实现多画面的方法 |
CN102572368A (zh) * | 2010-12-16 | 2012-07-11 | 中兴通讯股份有限公司 | 分布式视频的处理方法、系统及多点控制单元 |
US20160173824A1 (en) * | 2007-05-21 | 2016-06-16 | Polycom, Inc. | Dynamic adaption of a continuous presence videoconferencing layout based on video content |
CN107948578A (zh) * | 2017-12-28 | 2018-04-20 | 深圳华望技术有限公司 | 视频会议系统传送带宽及分辨率的调整方法及调整装置 |
CN108270989A (zh) * | 2016-12-30 | 2018-07-10 | 中移(杭州)信息技术有限公司 | 一种视频图像处理方法和装置 |
CN108462854A (zh) * | 2017-02-17 | 2018-08-28 | 陈延祚 | 视频图像处理方法及其相关系统 |
CN108718399A (zh) * | 2018-07-09 | 2018-10-30 | 深圳市潮流网络技术有限公司 | 一种基于浏览器页面的视频会议布局方法 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10110831B2 (en) * | 2016-09-27 | 2018-10-23 | Panasonic Intellectual Property Management Co., Ltd. | Videoconference device |
US10540971B2 (en) * | 2017-12-15 | 2020-01-21 | Blue Jeans Network, Inc. | System and methods for in-meeting group assistance using a virtual assistant |
CN112788276A (zh) * | 2019-11-11 | 2021-05-11 | 中兴通讯股份有限公司 | 视频流显示方法、发送方法、装置、终端、服务器及介质 |
-
2020
- 2020-10-30 WO PCT/CN2020/125289 patent/WO2022088050A1/zh active Application Filing
- 2020-10-30 CN CN202080002558.7A patent/CN114698409A/zh active Pending
- 2020-10-30 US US17/615,881 patent/US20220353434A1/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1571508A (zh) * | 2003-07-19 | 2005-01-26 | 华为技术有限公司 | 一种实现多画面的方法 |
US20160173824A1 (en) * | 2007-05-21 | 2016-06-16 | Polycom, Inc. | Dynamic adaption of a continuous presence videoconferencing layout based on video content |
CN102572368A (zh) * | 2010-12-16 | 2012-07-11 | 中兴通讯股份有限公司 | 分布式视频的处理方法、系统及多点控制单元 |
CN108270989A (zh) * | 2016-12-30 | 2018-07-10 | 中移(杭州)信息技术有限公司 | 一种视频图像处理方法和装置 |
CN108462854A (zh) * | 2017-02-17 | 2018-08-28 | 陈延祚 | 视频图像处理方法及其相关系统 |
CN107948578A (zh) * | 2017-12-28 | 2018-04-20 | 深圳华望技术有限公司 | 视频会议系统传送带宽及分辨率的调整方法及调整装置 |
CN108718399A (zh) * | 2018-07-09 | 2018-10-30 | 深圳市潮流网络技术有限公司 | 一种基于浏览器页面的视频会议布局方法 |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2023226974A1 (zh) * | 2022-05-24 | 2023-11-30 | 北京字节跳动网络技术有限公司 | 图像显示方法、装置和电子设备 |
Also Published As
Publication number | Publication date |
---|---|
CN114698409A (zh) | 2022-07-01 |
US20220353434A1 (en) | 2022-11-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110830811B (zh) | 直播互动方法及装置、系统、终端、存储介质 | |
US20220164159A1 (en) | Method for playing audio, terminal and computer-readable storage medium | |
CN111464830B (zh) | 图像显示的方法、装置、系统、设备及存储介质 | |
WO2021043121A1 (zh) | 一种图像换脸的方法、装置、系统、设备和存储介质 | |
US11962897B2 (en) | Camera movement control method and apparatus, device, and storage medium | |
CN111462742B (zh) | 基于语音的文本显示方法、装置、电子设备及存储介质 | |
WO2022134632A1 (zh) | 作品处理方法及装置 | |
CN112788359B (zh) | 直播处理方法、装置、电子设备及存储介质 | |
CN111142838A (zh) | 音频播放方法、装置、计算机设备及存储介质 | |
CN108848405B (zh) | 图像处理方法和装置 | |
CN111818358A (zh) | 音频文件的播放方法、装置、终端及存储介质 | |
CN111083513B (zh) | 直播画面处理方法、装置、终端及计算机可读存储介质 | |
CN110837300B (zh) | 虚拟交互的方法、装置、电子设备及存储介质 | |
CN112738606B (zh) | 音频文件的处理方法、装置、终端及存储介质 | |
CN111953852B (zh) | 通话记录生成方法、装置、终端及存储介质 | |
WO2022088050A1 (zh) | 视频会议实现方法、装置、系统以及存储介质 | |
CN113573120A (zh) | 音频的处理方法及电子设备 | |
CN110942426B (zh) | 图像处理的方法、装置、计算机设备和存储介质 | |
CN114143280A (zh) | 会话显示方法、装置、电子设备及存储介质 | |
CN111064994B (zh) | 视频图像处理方法及装置、存储介质 | |
WO2023125358A1 (zh) | 视频处理方法、装置、计算机设备及存储介质 | |
US20220405879A1 (en) | Method for processing images and electronic device | |
CN115334413B (zh) | 语音信号处理方法、系统、装置及电子设备 | |
CN111091512B (zh) | 图像处理方法及装置、计算机可读存储介质 | |
CN111526145B (zh) | 音频传输反馈的方法、装置、系统、设备及存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20959212 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 11-08-2023) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20959212 Country of ref document: EP Kind code of ref document: A1 |