WO2022262134A1 - Image display method, apparatus and device, and storage medium - Google Patents

Image display method, apparatus and device, and storage medium Download PDF

Info

Publication number
WO2022262134A1
WO2022262134A1 PCT/CN2021/118489 CN2021118489W WO2022262134A1 WO 2022262134 A1 WO2022262134 A1 WO 2022262134A1 CN 2021118489 W CN2021118489 W CN 2021118489W WO 2022262134 A1 WO2022262134 A1 WO 2022262134A1
Authority
WO
WIPO (PCT)
Prior art keywords
image display
image
target image
target
orientation
Prior art date
Application number
PCT/CN2021/118489
Other languages
French (fr)
Chinese (zh)
Inventor
陈文明
倪世坤
张世明
吕周谨
Original Assignee
深圳壹秘科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳壹秘科技有限公司 filed Critical 深圳壹秘科技有限公司
Publication of WO2022262134A1 publication Critical patent/WO2022262134A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/695Control of camera direction for changing a field of view, e.g. pan, tilt or based on tracking of objects

Definitions

  • the present application relates to the technical field of video display, and in particular to an image display method, device, equipment and storage medium.
  • the main purpose of this application is to provide an image display method, device, device, and storage medium, aiming to solve the problem that the existing technology can only be used in a single application scenario when conducting a video conversation, and cannot be used according to the number of participants and the meeting Mode selection uses different display methods, technical issues with poor user experience.
  • the application provides an image display method, the method comprising the following steps:
  • the preset private image or the current target image captured by the camera is displayed according to the target image display strategy.
  • the target image display strategy includes a target image panorama display strategy
  • the current target image acquired by the camera is displayed according to the target image panorama display strategy.
  • the target image display strategy includes a target image wide-angle display strategy
  • the current target image acquired by the camera is displayed according to the target image wide-angle display strategy.
  • the target image display policy includes a target image privacy display policy
  • the method before displaying the preset privacy image or the current target image acquired by the camera according to the target image display strategy in response to the lens orientation turning to the lens orientation, the method includes:
  • performing image segmentation on the current conference target image through a preset image segmentation model to obtain a segmented image includes:
  • Segmenting the current meeting target image by using a preset image segmentation model based on the initial segmentation point and the preset direction to obtain a segmented image.
  • the method further includes:
  • the present application also proposes an image display device, the image display device includes:
  • the instruction receiving module is configured to receive the meeting request instruction, determine the target image display mode according to the meeting request instruction, and acquire the current meeting target information;
  • the lens adjustment module is configured to determine the orientation of the lens according to the target image display mode, and adjust the orientation of the lens according to the orientation of the lens;
  • a strategy confirmation module configured to determine a target image display strategy based on the camera orientation and the current meeting target information
  • the image display module is configured to display a preset private image or a current target image captured by a camera according to the target image display policy in response to the lens turning to the lens orientation.
  • an image display device which includes: a memory, a processor, and an image display program stored in the memory and operable on the processor.
  • the above-mentioned image display program is configured to realize the steps of the above-mentioned image display method.
  • the present application also proposes a storage medium, on which an image display program is stored, and when the image display program is executed by a processor, the steps of the above-mentioned image display method are realized.
  • the present application determines the target image display mode according to the conference request command when receiving the conference request command, and obtains the current meeting target information, determines the lens orientation according to the target image display mode, and adjusts the lens according to the lens orientation Orientation: Determine the target image display strategy based on the camera orientation and the current meeting target information. When the lens orientation is rotated to the lens orientation, the preset privacy image or the current target captured by the camera will be displayed according to the target image display strategy. image for display.
  • this application determines the target image display mode through the meeting request command input by the user, and adjusts different lens orientations according to different target image display modes, and is suitable for image display in multiple scenarios.
  • the lens orientation and the current Combining the target information of the participants to select the appropriate target image display strategy, it is possible to select different image display methods according to the target information of the participants, avoiding that the video session can only be used in a single application scenario, and cannot be used according to the participants.
  • the number of people, the conference mode selection uses different display methods, and technical issues such as poor user experience.
  • FIG. 1 is a schematic structural diagram of an image display device in a hardware operating environment involved in an embodiment of the present application
  • FIG. 2 is a schematic flow chart of the first embodiment of the image display method of the present application.
  • FIG. 3 is a schematic diagram of a video conferencing device in an embodiment of an image display method of the present application
  • FIG. 4 is a schematic flow chart of the second embodiment of the image display method of the present application.
  • FIG. 5 is a schematic diagram of a VIP image display strategy of an embodiment of the image display method of the present application.
  • FIG. 6 is a schematic diagram of a moderator image display strategy in an embodiment of the image display method of the present application.
  • FIG. 7 is a schematic diagram of a multi-person conversation image display strategy in an embodiment of the image display method of the present application.
  • FIG. 8 is a schematic diagram of an alternate speech image display strategy in an embodiment of the image display method of the present application.
  • FIG. 9 is a schematic flowchart of a third embodiment of the image display method of the present application.
  • FIG. 10 is a schematic diagram of a wide-angle mode scene of an embodiment of the image display method of the present application.
  • FIG. 11 is a schematic diagram of a target image wide-angle display strategy in an embodiment of the image display method of the present application.
  • FIG. 12 is a schematic flowchart of a fourth embodiment of the image display method of the present application.
  • FIG. 13 is a structural block diagram of the first embodiment of the image display device of the present application.
  • FIG. 1 is a schematic structural diagram of an image display device in a hardware operating environment involved in the solution of the embodiment of the present application.
  • the image display device may include: a processor 1001 , such as a central processing unit (Central Processing Unit, CPU), a communication bus 1002 , a user interface 1003 , a network interface 1004 , and a memory 1005 .
  • the communication bus 1002 is configured to realize connection and communication between these components.
  • the user interface 1003 may include a display screen (Display) and an input unit such as a keyboard (Keyboard).
  • the user interface 1003 may also include a standard wired interface and a wireless interface.
  • the network interface 1004 may optionally include a standard wired interface and a wireless interface (such as a Wireless-Fidelity (Wi-Fi) interface).
  • Memory 1005 can be a high-speed random access memory (Random Access Memory, RAM), can also be a stable non-volatile memory (Non-Volatile Memory, NVM), such as disk storage.
  • RAM Random Access Memory
  • NVM Non-Volatile Memory
  • the memory 1005 may also be a storage device independent of the aforementioned processor 1001 .
  • FIG. 1 does not constitute a limitation on the image display device, and may include more or less components than those shown in the illustration, or combine some components, or arrange different components.
  • the memory 1005 as a storage medium may include an operating system, a network communication module, a user interface module, and an image display program.
  • the network interface 1004 is mainly configured to communicate data with the network server;
  • the user interface 1003 is mainly configured to perform data interaction with the user;
  • the processor 1001, memory 1005 may be set in the image display device, and the image display device calls the image display program stored in the memory 1005 through the processor 1001, and executes the image display method provided in the embodiment of the present application.
  • FIG. 2 is a schematic flowchart of a first embodiment of an image display method of the present application.
  • the image display method includes the following steps:
  • Step S10 receiving a meeting request instruction, determining a target image display mode according to the meeting request instruction, and acquiring current meeting target information.
  • the execution subject of this embodiment may be an image display device, wherein the image display device may be a controller of a video conferencing device, such as a personal computer, a control chip, etc., or other devices capable of video conferencing. device, which is not specifically limited in this embodiment.
  • the meeting request command may be a request command input by the user through the button of the video conference device, the remote control or the mobile APP to control the operation of the video conference device, and the meeting request command may include the start of the conference initial mode signal, in this embodiment, for the selection of the video conferencing device, refer to FIG. 3 , and describe the video conferencing device shown in FIG. 3 as an example.
  • the video conferencing device has five modules: 1. Microphone array module; 2. Motor and drive module; 3. Lens module; 4. Sensor module; 5. Controller;
  • the microphone array module is configured to collect the sound signal of the user during the video conference, and send the sound signal to the controller to detect the sound direction.
  • the microphone array can It is a multi-microphone array, for example: 4-microphone array, 6-microphone array, and 8-microphone array, etc., which is not specifically limited in this embodiment.
  • the motor and the driving module are configured to rotate according to the signal sent by the controller, so that the lens can be rotated to realize the adjustment of the lens orientation.
  • the direction of rotation is not limited, that is, in When adjusting the orientation of the lens, the motor and the driving module may rotate clockwise or counterclockwise, which is not specifically limited in this embodiment.
  • the selection of the lens in the lens module can be an image acquisition lens that can realize image acquisition at an angle of 220°, or other lenses that can achieve the same or similar functions;
  • the sensor module is configured to detect motors and drive module control The lens is rotated, and the orientation of the lens is detected to complete the rotation operation.
  • the target image display mode may be panorama mode, wide-angle mode, privacy mode, etc.
  • the video conferencing device controller may activate the corresponding target image display mode.
  • the current conference participation target information may be information such as the number of people currently participating in the conference, face information, and conference participation.
  • Step S20 Determine the orientation of the lens according to the target image display mode, and adjust the orientation of the lens according to the orientation of the lens.
  • different target image display modes correspond to different lens orientations.
  • 1 indicates a panoramic mode, and the lens orientation corresponding to the panoramic mode is upward;
  • 2 indicates a wide-angle mode , the lens orientation corresponding to the wide-angle mode is facing forward;
  • 3 indicates the privacy mode, and the lens orientation corresponding to the privacy mode is facing downward.
  • the lens is controlled to rotate according to the motor and the driving module, so that the lens orientation is rotated to the lens orientation corresponding to the target image display mode determined according to the conference request input by the user.
  • Step S30 Determine a target image display strategy based on the camera orientation and the current meeting target information.
  • the image display strategy may be the multi-person conversation image display strategy in the panorama mode, the host image display strategy, the VIP image display strategy, and the alternate speech image display strategy, etc., and may also be the wide-angle mode
  • the specific target image display strategy is determined according to the camera orientation and the current meeting target information.
  • the lens orientation is determined by the image display mode. Therefore, in this embodiment, according to three different target image display modes, the lens of the video conferencing device has three orientations, which are: lens Up, Camera Forward, and Camera Down.
  • Step S40 In response to the camera turning to the camera orientation, display the preset private image or the current target image captured by the camera according to the target image display policy.
  • the current target image may be a conference image captured by a camera when the video conference device is in a panorama mode or a wide-angle mode.
  • step S40 it also includes:
  • the preset image segmentation model can be configured to perform image segmentation on the target image captured by the camera, extract the person image in the target image, and mark it as a segmented image.
  • the current target image is an expanded view obtained after expanding the target image collected by the camera.
  • the expansion point needs to be determined first.
  • the determination of the expansion point position can be Determining the segmentation initial point of the current conference target image based on the camera orientation; segmenting the current conference target image through a preset image segmentation model based on the segmentation initial point and preset direction to obtain a segmented image.
  • the participants who speak during the video conference can also be marked, so that the user can more intuitively understand who is Speaking, can have a better experience.
  • step S40 it also includes:
  • the preset mouth shape detection model is configured to detect the mouth movement of the participant in the image of the current participant.
  • the mouth will move, thereby judging the participant who is speaking
  • the target since the target may not emit a sound signal, but detects the presence of mouth movement, the initial speaker information can be obtained through the preset mouth detection model.
  • the accurate target speaker can be determined by using the sound signal obtained by the microphone to predict the direction through the preset direction prediction model and the initial speaker information, so as to mark the image of the target speaker.
  • the marking of the target speaker can be by increasing the brightness of the image of the target speaker in the display image, marking different colors, etc., or by other marking methods with the same or similar marking functions, which are not covered in this embodiment. Specific restrictions.
  • the target image display mode is determined according to the meeting request instruction, and the current meeting target information is obtained
  • the lens orientation is determined according to the target image display mode
  • the camera orientation is adjusted according to the lens orientation.
  • Camera orientation based on the lens orientation and the current meeting target information to determine the target image display strategy, when the lens orientation is rotated to the lens orientation, according to the target image display strategy, the preset privacy image or the current image captured by the camera The target image is displayed.
  • the target image display mode is determined through the meeting request command input by the user, and different camera orientations are adjusted according to different target image display modes.
  • Select the appropriate target image display strategy and may choose different image display methods according to the different target information of the participants, avoiding that it can only be used in a single application scenario during a video session, and cannot be used according to the number of participants and the conference mode choose to use different display methods, technical issues with poor user experience.
  • FIG. 4 is a schematic flowchart of a second embodiment of an image display method of the present application.
  • the step S30 includes:
  • Step S301 Responding to the fact that the camera orientation is upward, acquire the number of conference participants and the degree of conference participation in the current conference participant information.
  • the degree of meeting participation can be calculated by judging the total number of sound signals collected by the microphone array of the video conferencing equipment, the source of the sound signals, etc., and then processing the sound signals to obtain the degree of meeting participation can be the sound signal The greater the total number of , the higher the degree of conference participation, and the greater the number of sources of sound signals, the higher the degree of conference participation.
  • Step S302 Determine a target image panorama display strategy according to the camera orientation, the number of conference participants and the conference participation degree.
  • the target image panorama display strategy includes four target image display strategies: 1. VIP image display strategy; 2. Moderator image display strategy; 3. Multi-person conversation image display strategy; 4. , Alternate speech image display strategy; it is necessary to select an appropriate target image display strategy according to the specific number of participants and the degree of participation in the meeting.
  • step S301 including:
  • the degree of meeting participation does not exceed a first preset value, and when the number of meeting participants is less than a second preset value, according to the meeting request instruction, the number of meeting participants and the meeting participation degree Determine moderator image display strategy;
  • the moderator information may be a long-term display target image, and the moderator information may be the conference moderator, the organizer, or a person who needs to be permanently displayed, or the person directly in front of the video conferencing device
  • the moderator information can be manually switched by adjusting the placement of the video conferencing equipment, or can be switched through the buttons on the video conference equipment, remote control, mobile APP, etc. This implementation Examples are not specifically limited.
  • the target image display strategy is the VIP image display strategy; when the conference participation does not exceed the first preset value, and when the number of participants is less than the second preset value, the target image
  • the display strategy is the moderator's image display strategy, and the first preset value and the second preset value may be a value preset by the user, which is not specifically limited in this embodiment.
  • a VIP position will be set.
  • the VIP position will be fixed in the upper left corner, and other numbers and positions may not be fixed.
  • the number of non-VIP positions is 5 , when it is detected that someone is speaking, the target image corresponding to the speaking participant target is extracted to several other positions through detection, and will be clockwise in the five frames of the non-VIP position starting from the VIP position.
  • the degree of participation in the meeting and the number of participants change during the video conference, it can also be detected through the preset humanoid detection model and the preset face detection model, and then re-according to different meeting participation and the number of participants Change target image display strategy.
  • the video conferencing device detects the information of the host, it confirms that A is the host.
  • the degree of participation in the meeting exceeds the first preset value, And when the number of participants is greater than the second preset value, the target image display strategy is the VIP image display strategy. Therefore, the video image display positions of six people are as shown in FIG. 5 .
  • the target image display strategy is the moderator image display strategy, so , the display positions of the video images of the six people are shown in Figure 6.
  • step S302 also includes:
  • an alternate speaking image display strategy is determined according to the conference request instruction, the number of conference participants, and the degree of conference participation.
  • the obtained target images of the participants will be integrated to obtain a panorama containing all the target images of the participants, and the panorama will be fixed on the display image.
  • the panorama will be fixed on the display image.
  • At the lower end there will be three image frames fixed on the upper end of the display image. These three image frames will be adjusted to display the target image when it is detected that there are participants speaking. If the number of participants in the panorama is not enough for three people, On the upper part of the display image, only the image frame corresponding to the number of people is displayed, and the part with insufficient number of people displays a black frame.
  • the order of these three image frames is fixed, for example: currently there are A, B, C, D, E, F six people, then display which image frame is on the top of the image in the following 10 order: ABC, ABD, ABE, ABF, ACD, ACE, ACF, ADE, ADF, AEF.
  • the screen displaying the image has only one portrait frame, showing one person;
  • the screen displaying the image is divided into left and right, and 2 people are displayed;
  • the screen displaying the image is divided into 4 equal parts according to top, bottom, left and right, and 3 people are displayed, but the screen in the lower right corner is displayed on a black screen;
  • the screen displaying the image is divided into 4 equal parts according to top, bottom, left and right, and 4 people are displayed;
  • the screen displaying the image is divided into 9 equal parts according to the top, bottom, left and right, and 7 people are displayed, but the 2 screens in the lower right corner are displayed on a black screen;
  • the screen displaying the image is divided into 9 equal parts according to the top, bottom, left and right, and 8 people are displayed, but the screen in the lower right corner is displayed on a black screen;
  • the screen displaying the image is divided into 9 equal parts according to top, bottom, left and right, and 9 people are displayed;
  • the screen shows only 9 people, and the initially displayed people are displayed clockwise according to the first person who spoke, and when the 10th person speaks, replace the last person who spoke;
  • the video conferencing device when it does not detect the moderator information, it judges the quantitative relationship between the number of conference participants and the second preset value, and when the number of conference participants does not exceed the second preset value, the target The image display strategy is a multi-person conversation image display strategy; when the number of participants in the meeting exceeds a second preset value, the target image display strategy is an alternate speaking image display strategy.
  • the target image display strategy is a multi-person conversation image display strategy, showing The image is shown in Figure 7.
  • the upper end of the previously displayed image is A, D, and F, and it is detected that C is speaking at this time, then according to the switching rule, the upper end of the displayed image will be switched to A, D, and F.
  • C, D three target images of participants.
  • the target image display strategy is an alternate speech image display strategy, and the images are displayed according to the cutting rule as shown in FIG. 8 .
  • the target image display mode is determined according to the meeting request instruction, and the current meeting target information is obtained
  • the lens orientation is determined according to the target image display mode, and the camera orientation is adjusted according to the lens orientation.
  • Camera orientation if the camera orientation is upward, then obtain the number of participants and the degree of participation in the meeting in the current meeting target information, according to the camera orientation, the number of participants and the conference participation
  • the target image panorama display strategy is determined according to the degree, and when the lens orientation is rotated to the lens orientation, the preset privacy image or the current target image captured by the camera is displayed according to the target image display strategy.
  • the target image display mode is determined through the meeting request command input by the user, and different camera orientations are adjusted according to different target image display modes, which is applicable to image display in multiple scenarios.
  • the combination of the number and the degree of meeting participation selects an appropriate target image panorama display strategy, and may select different image display methods according to the different target information of the participants, so as to avoid that it can only be used in a single application scenario when conducting a video session. It is impossible to choose different display methods according to the number of participants and the conference mode, and the technical problem is that the user experience is poor.
  • FIG. 9 is a schematic flowchart of a third embodiment of an image display method of the present application.
  • the step S30 includes:
  • Step S301A In response to the camera orientation being forward facing, determine a target image wide-angle display strategy according to the camera orientation and the current conference participant information.
  • the target display mode is the wide-angle mode, that is, when the camera orientation is forward, then the target image wide-angle display strategy is determined according to the lens orientation and the current conference participant information.
  • the lens of the video conferencing device adopts an image that can capture a wide-angle range of 220°, and the larger the angle on both sides of the lens, the greater the distortion of the image , in order to have a good image display effect, the angles on both sides of the obtained 220° image can be cut off by 20° to achieve the effect of a 180° image.
  • the existing four people A, B, C, and D are divided into four people located in front of the video conferencing equipment. D and the two people will be farther away from the lens, so the portraits of B and C displayed on the monitor are larger than those of A and D, so as to distinguish the distance relationship between the participants and the video conferencing equipment, and the final display image Refer to Figure 11.
  • the portrait of A will be enlarged accordingly and placed in the center; when the three people in ACD leave, only B is left sitting in the original position, because B itself is at the side , so when zooming in, it will give priority to keeping B in the display frame, and finally the position of B will be zoomed in to a certain extent, and will be a little to the left;
  • the image of AD is enlarged in the same proportion and placed in the center; when there are only BC on the entire conference table, the portraits of B and C are detected and enlarged in the same proportion, but because the sitting positions of BC and BC are different Move to the side, so the magnification ratio of BC is smaller than that of AD; when there are only two BDs on the entire conference table, after detecting B and D, the portrait and other scenes in the screen will be enlarged in the same ratio, because B is closer and closer to the lens Move to the side, so when zooming in on the same scale,
  • the target image display mode is determined according to the meeting request instruction, and the current meeting target information is obtained
  • the lens orientation is determined according to the target image display mode, and the camera orientation is adjusted according to the lens orientation.
  • the lens orientation if the lens orientation is the lens facing forward, then determine the wide-angle display strategy of the target image according to the lens orientation and the current participant information, and when the lens orientation rotates to the lens orientation, set the target image wide-angle display strategy according to the target image. Set a private image or the current target image acquired by the camera for display.
  • the wide-angle display mode of the target image is determined through the meeting request command input by the user, and the camera orientation is adjusted to the front of the camera according to the wide-angle display mode of the target image, which is suitable for image display in multiple scenarios. It is possible to choose a different image display method according to the different target information of the participants, so as to avoid that the video session can only be used in a single application scenario, and the conference cannot be used according to the number of participants. Mode selection uses different display methods, technical issues with poor user experience.
  • FIG. 12 is a schematic flowchart of a fourth embodiment of an image display method of the present application.
  • the step S30 includes:
  • Step S301B In response to the camera orientation being downward, determine a target image privacy display policy according to the lens orientation.
  • the target display mode is the privacy mode, that is, when the camera orientation is the lens downward, then the target image privacy display policy is determined according to the lens orientation.
  • this mode is mainly to protect the privacy of the conference. When there is something to discuss in the local area and you don’t want the other party to see the local screen and hear the local discussion sound, without closing the ongoing video call conference, It can be implemented in this manner.
  • the privacy mode can also be entered into the privacy mode from the panoramic mode or the wide-angle mode.
  • the device controller when the current target image display mode is panoramic mode, one party may need to conduct internal discussions and temporarily turn off the camera.
  • the device controller receives the display prohibition instruction, it controls the camera to rotate according to the display prohibition instruction, and controls the sensor to detect the current lens orientation; microphone.
  • the user when the current target image display mode is the wide-angle mode, the user sends an instruction by pressing a button of the video conferencing device or a remote control, etc., and when the video conferencing device controller receives the display prohibition instruction, it controls the display according to the display prohibition instruction.
  • the camera is rotated and the sensor is controlled to detect the current lens orientation. If the current lens orientation is downward, a preset privacy image is displayed and the microphone is turned off.
  • the video conference needs to be continued.
  • the user sends an instruction by pressing a button of the video conference device or a remote control, and the controller of the video conference device receives the display start instruction.
  • the display opening command controls the rotation of the camera, and when the lens restoration position is rotated to the lens position corresponding to the target image display strategy, the microphone is turned on, and the target image of the current participant is obtained back, and the target image of the current participant is passed through the preset image
  • the segmentation model performs image segmentation to obtain multiple segmented images.
  • the preset privacy image may be one or more images preset by the user, and the preset privacy image is configured to block the current video image and remind other video users that the video is not interrupted.
  • the target image display mode is determined according to the meeting request instruction, and the current meeting target information is obtained
  • the lens orientation is determined according to the target image display mode, and the camera orientation is adjusted according to the lens orientation.
  • Camera orientation if the lens orientation is facing downward, then determine the target image privacy display policy according to the lens orientation, and when the lens orientation rotates to the lens orientation, set the preset privacy image according to the target image privacy display policy to show.
  • the target image privacy display mode is determined through the meeting request command input by the user, and the camera orientation is adjusted to the lens downward according to the target image privacy display mode, which is suitable for image display in multiple scenarios, according to the camera orientation and current meeting target information It is possible to choose a different image display method according to the different target information of the participants, so as to avoid that the video session can only be used in a single application scenario, and the conference cannot be used according to the number of participants. Mode selection uses different display methods, technical issues with poor user experience.
  • the embodiment of the present application also proposes a storage medium, on which an image display program is stored, and when the image display program is executed by a processor, the steps of the above-mentioned image display method are realized.
  • the storage medium adopts all the technical solutions of all the above-mentioned embodiments, it at least has all the beneficial effects brought by the technical solutions of the above-mentioned embodiments, which will not be repeated here.
  • FIG. 13 is a structural block diagram of the first embodiment of the image display device of the present application.
  • the image display device proposed in the embodiment of the present application includes:
  • the instruction receiving module 10 is configured to receive a meeting request instruction, determine a target image display mode according to the meeting request instruction, and acquire current meeting target information.
  • the lens adjustment module 20 is configured to determine the orientation of the lens according to the target image display mode, and adjust the orientation of the lens according to the orientation of the lens.
  • the policy confirmation module 30 is configured to determine a target image display policy based on the camera orientation and the current meeting target information.
  • the image display module 40 is configured to display a preset private image or a current target image captured by a camera according to the target image display policy in response to the camera turning to the lens orientation.
  • the policy confirmation module 30 is further configured to obtain the number of conference participants and the degree of conference participation in the current conference participant information in response to the camera orientation being upward; according to the The target image panorama display strategy is determined by the lens orientation, the number of participants in the meeting and the degree of participation in the meeting; when the lens orientation is rotated to the lens orientation, the preset privacy image or the current image acquired by the camera is determined according to the target image display strategy.
  • the displaying of the target image further includes: displaying the current target image acquired by the camera according to the target image panorama display strategy when the lens orientation is rotated to the lens orientation.
  • the policy confirmation module 30 is further configured to determine the wide-angle display strategy of the target image according to the lens orientation and the current participant information in response to the lens orientation being the lens facing forward;
  • displaying the preset privacy image or the current target image acquired by the camera according to the target image display strategy also includes: when the lens orientation is rotated to the lens orientation, displaying the camera according to the target image wide-angle display strategy The obtained current target image is displayed.
  • the image display module 30 is further configured to determine a target image privacy display strategy according to the lens orientation in response to the lens orientation being downward; when the lens orientation is rotated to the lens orientation , displaying a preset privacy image or a current target image acquired by a camera according to the target image display strategy, and further comprising: displaying a preset privacy image according to the target image privacy display strategy when the lens orientation is rotated to the lens orientation , and turn off the microphone.
  • the image display module 40 is further configured to obtain the image of the current conference target through the camera, and turn on the microphone; perform image segmentation on the current conference target image through a preset image segmentation model to obtain the Segmenting the image; performing image processing on the segmented image to obtain the current target image.
  • the policy confirmation module 40 is further configured to determine the initial segmentation point of the current conference target image based on the camera orientation; The image is segmented through a preset image segmentation model to obtain a segmented image.
  • the image display module 40 is further configured to perform mouth shape detection on the current target image through a preset mouth shape detection model to obtain an initial speaker image; obtain the current sound signal through a microphone, and Determine speaker information based on the current sound signal; determine a target speaker image according to the speaker information and the initial speaker image, and mark the target speaker image.
  • the target image display mode is determined according to the meeting request instruction, and the current meeting target information is obtained
  • the lens orientation is determined according to the target image display mode
  • the camera orientation is adjusted according to the lens orientation.
  • Camera orientation based on the lens orientation and the current meeting target information to determine the target image display strategy, when the lens orientation is rotated to the lens orientation, according to the target image display strategy, the preset privacy image or the current image captured by the camera The target image is displayed.
  • the target image display mode is determined through the meeting request command input by the user, and different camera orientations are adjusted according to different target image display modes.
  • Select the appropriate target image display strategy and may choose different image display methods according to the different target information of the participants, avoiding that it can only be used in a single application scenario during a video session, and cannot be used according to the number of participants and the conference mode choose to use different display methods, technical issues with poor user experience.

Abstract

Disclosed in the present application are an image display method, apparatus and device, and a storage medium. In the present application, a target image display mode is determined by means of a conference request instruction that is input by a user, and different lens directions are adjusted according to different target image display modes, so as to be suitable for image presentation in a plurality of scenarios. An appropriate target image display strategy is selected according to a combination of the lens directions and the current conference participation target information; and different image display manners can be selected according to different pieces of conference participation target information, thereby avoiding the problem of a product only being able to be used in a single application scenario during video conferencing, and different display manners being unable to be selected and used according to the number of conference participants and conference modes.

Description

图像显示方法、装置、设备及存储介质Image display method, device, equipment and storage medium
本申请要求于2021年9月7号申请的、申请号为202111047995.3的中国专利申请的优先权,其全部内容通过引用结合于此。This application claims priority to a Chinese patent application with application number 202111047995.3 filed on September 7, 2021, the entire contents of which are hereby incorporated by reference.
技术领域technical field
本申请涉及视频显示技术领域,尤其涉及一种图像显示方法、装置、设备及存储介质。The present application relates to the technical field of video display, and in particular to an image display method, device, equipment and storage medium.
背景技术Background technique
随着科学技术的发展,远程办公、远程会议越来越受人们的欢迎,沟通也已经超越了时间、空间的限制。人们对会议沟通产品的的功能需求也越来越多,对产品的性能要求也越来越高。因此,诞生了很多的音视频会议办公产品,有固定对焦的视频产品,有自动对焦的视频产品,有转动的云台视频产品,往往这些产品都是比较简单的,而且只是一个拍摄装置,在进行视频会话时只能够在单一应用场景下进行使用,无法根据参会的人数,会议模式选择使用不同的显示方式,用户体验较差。With the development of science and technology, telecommuting and teleconferencing are becoming more and more popular, and communication has surpassed the limitations of time and space. People have more and more functional requirements for conference communication products, and the performance requirements of products are also getting higher and higher. Therefore, many audio and video conference office products have been born, including fixed-focus video products, auto-focus video products, and rotating pan-tilt video products. Often these products are relatively simple, and they are just a shooting device. When conducting a video session, it can only be used in a single application scenario, and different display methods cannot be selected according to the number of participants and the conference mode, and the user experience is poor.
上述内容仅用于辅助理解本申请的技术方案,并不代表承认上述内容是现有技术。The above content is only used to assist in understanding the technical solution of the present application, and does not mean that the above content is admitted as prior art.
技术问题technical problem
本申请的主要目的在于提供一种图像显示方法、装置、设备及存储介质,旨在解决现有技术在进行视频会话时只能够在单一应用场景下进行使用,且无法根据参会的人数和会议模式选择使用不同的显示方式,用户体验较差的技术问题。The main purpose of this application is to provide an image display method, device, device, and storage medium, aiming to solve the problem that the existing technology can only be used in a single application scenario when conducting a video conversation, and cannot be used according to the number of participants and the meeting Mode selection uses different display methods, technical issues with poor user experience.
技术解决方案technical solution
为实现上述目的,本申请提供了一种图像显示方法,所述方法包括以下步骤:In order to achieve the above object, the application provides an image display method, the method comprising the following steps:
接收到会议请求指令,根据所述会议请求指令确定目标图像显示模式,并获取当前参会目标信息;Receiving the meeting request instruction, determining the target image display mode according to the meeting request instruction, and obtaining the current meeting target information;
根据所述目标图像显示模式确定镜头方位,并根据所述镜头方位调整镜头朝向;determining the lens orientation according to the target image display mode, and adjusting the lens orientation according to the lens orientation;
基于所述镜头方位与所述当前参会目标信息确定目标图像显示策略;determining a target image display strategy based on the lens orientation and the current meeting target information;
响应于镜头朝向转动至所述镜头方位,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示。In response to the camera turning to the camera orientation, the preset private image or the current target image captured by the camera is displayed according to the target image display strategy.
可选地,所述目标图像显示策略包括目标图像全景显示策略;Optionally, the target image display strategy includes a target image panorama display strategy;
响应于所述镜头方位为镜头朝上,获取所述当前参会目标信息中的参会目标数量与会议参与度;Responding to the fact that the camera orientation is facing upwards, obtain the number of participants and the degree of participation in the meeting in the information about the current meeting participants;
根据所述镜头方位、所述参会目标数量以及所述会议参与度确定目标图像全景显示策略;Determine a target image panorama display strategy according to the lens orientation, the number of participants in the meeting, and the degree of participation in the meeting;
响应于镜头朝向转动至所述镜头方位,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示,还包括:In response to the lens turning to the lens orientation, displaying the preset privacy image or the current target image acquired by the camera according to the target image display strategy, further comprising:
响应于镜头朝向转动至所述镜头方位,根据所述目标图像全景显示策略对摄像头获取的当前目标图像进行展示。In response to the camera turning to the camera orientation, the current target image acquired by the camera is displayed according to the target image panorama display strategy.
可选地,所述目标图像显示策略包括目标图像广角显示策略;Optionally, the target image display strategy includes a target image wide-angle display strategy;
响应于所述镜头方位为镜头朝前,根据所述镜头方位与所述当前参会信息确定目标图像广角显示策略;In response to the camera orientation being forward facing, determine a target image wide-angle display strategy according to the lens orientation and the current conference participant information;
响应于镜头朝向转动至所述镜头方位,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示,还包括:In response to the lens turning to the lens orientation, displaying the preset privacy image or the current target image acquired by the camera according to the target image display strategy, further comprising:
响应于镜头朝向转动至所述镜头方位,根据所述目标图像广角显示策略对摄像头获取的当前目标图像进行展示。In response to the lens turning to the lens orientation, the current target image acquired by the camera is displayed according to the target image wide-angle display strategy.
可选地,所述目标图像显示策略包括目标图像隐私显示策略;Optionally, the target image display policy includes a target image privacy display policy;
响应于所述镜头方位为镜头朝下,根据所述镜头方位确定目标图像隐私显示策略;Responsive to the camera orientation being downward, determining a target image privacy display policy according to the lens orientation;
响应于镜头朝向转动至所述镜头方位,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示,还包括:In response to the lens turning to the lens orientation, displaying the preset privacy image or the current target image acquired by the camera according to the target image display strategy, further comprising:
响应于镜头朝向转动至所述镜头方位,根据所述目标图像隐私显示策略显示预设隐私图像,并关闭麦克风。In response to the camera turning to the camera orientation, displaying a preset privacy image according to the target image privacy display policy, and turning off the microphone.
可选地,所述响应于镜头朝向转动至所述镜头方位,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示之前,包括:Optionally, before displaying the preset privacy image or the current target image acquired by the camera according to the target image display strategy in response to the lens orientation turning to the lens orientation, the method includes:
通过摄像头获取当前参会目标图像,并开启麦克风;Obtain the image of the current participant target through the camera, and turn on the microphone;
将所述当前参会目标图像通过预设图像分割模型进行图像分割,获得已分割图像;Segmenting the image of the current participant target through a preset image segmentation model to obtain a segmented image;
将所述已分割图像进行图像处理,获得当前目标图像。Perform image processing on the segmented image to obtain the current target image.
可选地,所述将所述当前参会目标图像通过预设图像分割模型进行图像分割,获得已分割图像,包括:Optionally, performing image segmentation on the current conference target image through a preset image segmentation model to obtain a segmented image includes:
基于镜头方位确定所述当前参会目标图像的分割初始点;Determining the initial segmentation point of the current participant target image based on the camera orientation;
基于所述分割初始点与预设方向将所述当前参会目标图像通过预设图像分割模型进行图像分割,获得已分割图像。Segmenting the current meeting target image by using a preset image segmentation model based on the initial segmentation point and the preset direction to obtain a segmented image.
可选地,所述响应于镜头朝向转动至所述镜头方位,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示之后,还包括:Optionally, after displaying the preset privacy image or the current target image captured by the camera according to the target image display strategy in response to the lens orientation turning to the lens orientation, the method further includes:
对所述当前目标图像通过预设嘴型检测模型进行嘴型检测,获得初始发言人图像;performing mouth shape detection on the current target image through a preset mouth shape detection model to obtain an initial speaker image;
通过麦克风获取当前声音信号,根据所述当前声音信号确定发言人信息;Obtaining a current sound signal through a microphone, and determining speaker information according to the current sound signal;
根据所述发言人信息以及所述初始发言人图像确定目标发言人图像,并标记所述目标发言人图像。Determine a target speaker image according to the speaker information and the initial speaker image, and mark the target speaker image.
此外,为实现上述目的,本申请还提出一种图像显示装置,所述图像显示装置包括:In addition, in order to achieve the above purpose, the present application also proposes an image display device, the image display device includes:
指令接收模块,被配置为接收会议请求指令,根据所述会议请求指令确定目标图像显示模式,并获取当前参会目标信息;The instruction receiving module is configured to receive the meeting request instruction, determine the target image display mode according to the meeting request instruction, and acquire the current meeting target information;
镜头调整模块,被配置为根据所述目标图像显示模式确定镜头方位,并根据所述镜头方位调整镜头朝向;The lens adjustment module is configured to determine the orientation of the lens according to the target image display mode, and adjust the orientation of the lens according to the orientation of the lens;
策略确认模块,被配置为基于所述镜头方位与所述当前参会目标信息确定目标图像显示策略;A strategy confirmation module, configured to determine a target image display strategy based on the camera orientation and the current meeting target information;
图像显示模块,被配置为响应于镜头朝向转动至所述镜头方位,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示。The image display module is configured to display a preset private image or a current target image captured by a camera according to the target image display policy in response to the lens turning to the lens orientation.
此外,为实现上述目的,本申请还提出一种图像显示设备,所述图像显示设备包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的图像显示程序,所述图像显示程序配置为实现如上文所述的图像显示方法的步骤。In addition, in order to achieve the above purpose, the present application also proposes an image display device, which includes: a memory, a processor, and an image display program stored in the memory and operable on the processor. The above-mentioned image display program is configured to realize the steps of the above-mentioned image display method.
此外,为实现上述目的,本申请还提出一种存储介质,所述存储介质上存储有图像显示程序,所述图像显示程序被处理器执行时实现如上文所述的图像显示方法的步骤。In addition, in order to achieve the above purpose, the present application also proposes a storage medium, on which an image display program is stored, and when the image display program is executed by a processor, the steps of the above-mentioned image display method are realized.
有益效果Beneficial effect
本申请通过在接收到会议请求指令时,根据所述会议请求指令确定目标图像显示模式,并获取当前参会目标信息,根据所述目标图像显示模式确定镜头方位,并根据所述镜头方位调整镜头朝向,基于所述镜头方位与所述当前参会目标信息确定目标图像显示策略,在镜头朝向转动至所述镜头方位时,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示。与现有技术相比,本申请通过用户输入的会议请求指令确定目标图像显示模式,并根据不同的目标图像显示模式调整不同的镜头方位,适用多个场景下的图像展示,根据镜头方位以及当前参会目标信息的结合选择合适的目标图像显示策略,可能根据参会目标信息的不同选择不同的图像显示方式,避免了在进行视频会话时只能够在单一应用场景下进行使用,无法根据参会的人数,会议模式选择使用不同的显示方式,用户体验较差的技术问题。The present application determines the target image display mode according to the conference request command when receiving the conference request command, and obtains the current meeting target information, determines the lens orientation according to the target image display mode, and adjusts the lens according to the lens orientation Orientation: Determine the target image display strategy based on the camera orientation and the current meeting target information. When the lens orientation is rotated to the lens orientation, the preset privacy image or the current target captured by the camera will be displayed according to the target image display strategy. image for display. Compared with the prior art, this application determines the target image display mode through the meeting request command input by the user, and adjusts different lens orientations according to different target image display modes, and is suitable for image display in multiple scenarios. According to the lens orientation and the current Combining the target information of the participants to select the appropriate target image display strategy, it is possible to select different image display methods according to the target information of the participants, avoiding that the video session can only be used in a single application scenario, and cannot be used according to the participants. The number of people, the conference mode selection uses different display methods, and technical issues such as poor user experience.
附图说明Description of drawings
图1是本申请实施例方案涉及的硬件运行环境的图像显示设备的结构示意图;FIG. 1 is a schematic structural diagram of an image display device in a hardware operating environment involved in an embodiment of the present application;
图2为本申请图像显示方法第一实施例的流程示意图;FIG. 2 is a schematic flow chart of the first embodiment of the image display method of the present application;
图3为本申请图像显示方法一实施例的视频会议设备示意图;FIG. 3 is a schematic diagram of a video conferencing device in an embodiment of an image display method of the present application;
图4为本申请图像显示方法第二实施例的流程示意图;FIG. 4 is a schematic flow chart of the second embodiment of the image display method of the present application;
图5为本申请图像显示方法一实施例的VIP图像显示策略示意图;FIG. 5 is a schematic diagram of a VIP image display strategy of an embodiment of the image display method of the present application;
图6为本申请图像显示方法一实施例的主持人图像显示策略示意图;FIG. 6 is a schematic diagram of a moderator image display strategy in an embodiment of the image display method of the present application;
图7为本申请图像显示方法一实施例的多人会话图像显示策略示意图;FIG. 7 is a schematic diagram of a multi-person conversation image display strategy in an embodiment of the image display method of the present application;
图8为本申请图像显示方法一实施例的交替发言图像显示策略示意图;FIG. 8 is a schematic diagram of an alternate speech image display strategy in an embodiment of the image display method of the present application;
图9为本申请图像显示方法第三实施例的流程示意图;FIG. 9 is a schematic flowchart of a third embodiment of the image display method of the present application;
图10为本申请图像显示方法一实施例的广角模式场景示意图;FIG. 10 is a schematic diagram of a wide-angle mode scene of an embodiment of the image display method of the present application;
图11为本申请图像显示方法一实施例的目标图像广角显示策略示意图;FIG. 11 is a schematic diagram of a target image wide-angle display strategy in an embodiment of the image display method of the present application;
图12为本申请图像显示方法第四实施例的流程示意图;FIG. 12 is a schematic flowchart of a fourth embodiment of the image display method of the present application;
图13为本申请图像显示装置第一实施例的结构框图。FIG. 13 is a structural block diagram of the first embodiment of the image display device of the present application.
本申请目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The realization, functional features and advantages of the present application will be further described in conjunction with the embodiments and with reference to the accompanying drawings.
本发明的实施方式Embodiments of the present invention
应当理解,此处所描述的具体实施例仅用以解释本申请,并不用于限定本申请。It should be understood that the specific embodiments described here are only used to explain the present application, not to limit the present application.
参照图1,图1为本申请实施例方案涉及的硬件运行环境的图像显示设备结构示意图。Referring to FIG. 1 , FIG. 1 is a schematic structural diagram of an image display device in a hardware operating environment involved in the solution of the embodiment of the present application.
如图1所示,该图像显示设备可以包括:处理器1001,例如中央处理器(Central Processing Unit,CPU),通信总线1002、用户接口1003,网络接口1004,存储器1005。其中,通信总线1002被配置为实现这些组件之间的连接通信。用户接口1003可以包括显示屏(Display)、输入单元比如键盘(Keyboard),用户接口1003可选的还可以包括标准的有线接口、无线接口。网络接口1004可选的可以包括标准的有线接口、无线接口(如无线保真(Wireless-Fidelity,Wi-Fi)接口)。存储器1005可以是高速的随机存取存储器(Random Access Memory,RAM),也可以是稳定的非易失性存储器(Non-Volatile Memory,NVM),例如磁盘存储器。存储器1005可选的还可以是独立于前述处理器1001的存储装置。As shown in FIG. 1 , the image display device may include: a processor 1001 , such as a central processing unit (Central Processing Unit, CPU), a communication bus 1002 , a user interface 1003 , a network interface 1004 , and a memory 1005 . Wherein, the communication bus 1002 is configured to realize connection and communication between these components. The user interface 1003 may include a display screen (Display) and an input unit such as a keyboard (Keyboard). Optionally, the user interface 1003 may also include a standard wired interface and a wireless interface. The network interface 1004 may optionally include a standard wired interface and a wireless interface (such as a Wireless-Fidelity (Wi-Fi) interface). Memory 1005 can be a high-speed random access memory (Random Access Memory, RAM), can also be a stable non-volatile memory (Non-Volatile Memory, NVM), such as disk storage. Optionally, the memory 1005 may also be a storage device independent of the aforementioned processor 1001 .
本领域技术人员可以理解,图1中示出的结构并不构成对图像显示设备的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。Those skilled in the art can understand that the structure shown in FIG. 1 does not constitute a limitation on the image display device, and may include more or less components than those shown in the illustration, or combine some components, or arrange different components.
如图1所示,作为一种存储介质的存储器1005中可以包括操作系统、网络通信模块、用户接口模块以及图像显示程序。As shown in FIG. 1 , the memory 1005 as a storage medium may include an operating system, a network communication module, a user interface module, and an image display program.
在图1所示的图像显示设备中,网络接口1004主要被配置为与网络服务器进行数据通信;用户接口1003主要被配置为与用户进行数据交互;本申请图像显示设备中的处理器1001、存储器1005可以设置在图像显示设备中,所述图像显示设备通过处理器1001调用存储器1005中存储的图像显示程序,并执行本申请实施例提供的图像显示方法。In the image display device shown in Figure 1, the network interface 1004 is mainly configured to communicate data with the network server; the user interface 1003 is mainly configured to perform data interaction with the user; the processor 1001, memory 1005 may be set in the image display device, and the image display device calls the image display program stored in the memory 1005 through the processor 1001, and executes the image display method provided in the embodiment of the present application.
本申请实施例提供了一种图像显示方法,参照图2,图2为本申请一种图像显示方法第一实施例的流程示意图。An embodiment of the present application provides an image display method. Referring to FIG. 2 , FIG. 2 is a schematic flowchart of a first embodiment of an image display method of the present application.
本实施例中,所述图像显示方法包括以下步骤:In this embodiment, the image display method includes the following steps:
步骤S10:接收到会议请求指令,根据所述会议请求指令确定目标图像显示模式,并获取当前参会目标信息。Step S10: receiving a meeting request instruction, determining a target image display mode according to the meeting request instruction, and acquiring current meeting target information.
需要说明的是,本实施例的执行主体可以是图像显示设备,其中图像显示设备可以是视频会议设备的控制器,例如:个人电脑、控制芯片等,还可以是其他可以进行视频会议的设备控制器,本实施例不做具体限制。It should be noted that the execution subject of this embodiment may be an image display device, wherein the image display device may be a controller of a video conferencing device, such as a personal computer, a control chip, etc., or other devices capable of video conferencing. device, which is not specifically limited in this embodiment.
可理解的是,会议请求指令可以是用户通过视频会议设备的按钮、遥控器或者手机APP等输入的控制视频会议设备开始运行的请求指令,所述会议请求指令中可以包含有会议初始模式的开启信号,在本实施例中,对于视频会议设备的选择,参考图3,将图3所示的视频会议设备为例进行说明。It can be understood that the meeting request command may be a request command input by the user through the button of the video conference device, the remote control or the mobile APP to control the operation of the video conference device, and the meeting request command may include the start of the conference initial mode signal, in this embodiment, for the selection of the video conferencing device, refer to FIG. 3 , and describe the video conferencing device shown in FIG. 3 as an example.
需要说明的是,本实施例中,所述视频会议设备有五个模块构成:1、麦克风阵列模块;2、马达及驱动模块;3、镜头模块;4、传感器模块;5、控制器;所述麦克风阵列模块被配置为采集用户在进行视频会议时的声音信号,并将所述声音信号发送至控制器进行声音方位的检测,为了使得采集到的声音信号更完整清晰,所述麦克风阵列可以是多麦阵列,例如:4麦阵列、6麦阵列以及8麦阵列等,本实施例不做具体限制。It should be noted that, in this embodiment, the video conferencing device has five modules: 1. Microphone array module; 2. Motor and drive module; 3. Lens module; 4. Sensor module; 5. Controller; The microphone array module is configured to collect the sound signal of the user during the video conference, and send the sound signal to the controller to detect the sound direction. In order to make the collected sound signal more complete and clear, the microphone array can It is a multi-microphone array, for example: 4-microphone array, 6-microphone array, and 8-microphone array, etc., which is not specifically limited in this embodiment.
其中,马达及驱动模块被配置为根据控制器发出的信号进行转动,使得镜头能够进行旋转,实现镜头方位的调整,其马达及驱动模块进行转动的过程中,不限制其转动的方向,即在进行镜头方位调整时,马达及驱动模块可以是顺时针旋转也可以是逆时针旋转,本实施例不做具体限制。Wherein, the motor and the driving module are configured to rotate according to the signal sent by the controller, so that the lens can be rotated to realize the adjustment of the lens orientation. During the rotation of the motor and the driving module, the direction of rotation is not limited, that is, in When adjusting the orientation of the lens, the motor and the driving module may rotate clockwise or counterclockwise, which is not specifically limited in this embodiment.
在本实施例中,镜头模块中镜头的选择可以是能够实现220°角度图像采集的图像采集镜头,也可以是其他可实现相同或者相似功能的镜头;传感器模块被配置为检测马达及驱动模块控制镜头进行旋转,检测镜头方位,以完成旋转操作。In this embodiment, the selection of the lens in the lens module can be an image acquisition lens that can realize image acquisition at an angle of 220°, or other lenses that can achieve the same or similar functions; the sensor module is configured to detect motors and drive module control The lens is rotated, and the orientation of the lens is detected to complete the rotation operation.
需要说明的是,目标图像显示模式可以是全景模式、广角模式以及隐私模式等,根据用户输入的会议请求指令,视频会议设备控制器可以启动对应的目标图像显示模式。It should be noted that the target image display mode may be panorama mode, wide-angle mode, privacy mode, etc., and according to the conference request instruction input by the user, the video conferencing device controller may activate the corresponding target image display mode.
可理解的是,当前参会目标信息可以是当前参与会议的人员数量信息、人脸信息以及会议参与度等信息。It is understandable that the current conference participation target information may be information such as the number of people currently participating in the conference, face information, and conference participation.
步骤S20:根据所述目标图像显示模式确定镜头方位,并根据所述镜头方位调整镜头朝向。Step S20: Determine the orientation of the lens according to the target image display mode, and adjust the orientation of the lens according to the orientation of the lens.
需要说明的是,不同的目标图像显示模式所对应的镜头方位不同,在本实施例中,参考图3:1表示全景模式,所述全景模式对应的镜头方位为镜头朝上;2表示广角模式,所述广角模式对应的镜头方位为镜头朝前;3表示隐私模式,所述隐私模式对应的镜头方位为镜头朝下。It should be noted that different target image display modes correspond to different lens orientations. In this embodiment, refer to FIG. 3: 1 indicates a panoramic mode, and the lens orientation corresponding to the panoramic mode is upward; 2 indicates a wide-angle mode , the lens orientation corresponding to the wide-angle mode is facing forward; 3 indicates the privacy mode, and the lens orientation corresponding to the privacy mode is facing downward.
在具体实现中,根据目标图像显示模式确定镜头方位后,根据马达及驱动模块控制镜头进行旋转,以使镜头朝向旋转到根据用户输入的会议请求指令确定的目标图像显示模式对应的镜头方位。In a specific implementation, after the lens orientation is determined according to the target image display mode, the lens is controlled to rotate according to the motor and the driving module, so that the lens orientation is rotated to the lens orientation corresponding to the target image display mode determined according to the conference request input by the user.
步骤S30:基于所述镜头方位与所述当前参会目标信息确定目标图像显示策略。Step S30: Determine a target image display strategy based on the camera orientation and the current meeting target information.
需要说明的是,在本实施例中,图像显示策略可以是全景模式下的多人会话图像显示策略、主持人图像显示策略、VIP图像显示策略以及交替发言图像显示策略等,还可以是广角模式下的正常图像显示策略等,本实施例中根据镜头方位以及当前参会目标信息确定具体的目标图像显示策略。It should be noted that, in this embodiment, the image display strategy may be the multi-person conversation image display strategy in the panorama mode, the host image display strategy, the VIP image display strategy, and the alternate speech image display strategy, etc., and may also be the wide-angle mode In this embodiment, the specific target image display strategy is determined according to the camera orientation and the current meeting target information.
可理解的是,在确定目标图像显示策略之前,通过图像显示模式确定镜头方位,因此在本实施例中根据三种不同的目标图像显示模式,视频会议设备的镜头有三种朝向,分别为:镜头向上、镜头向前以及镜头向下。It can be understood that before determining the target image display strategy, the lens orientation is determined by the image display mode. Therefore, in this embodiment, according to three different target image display modes, the lens of the video conferencing device has three orientations, which are: lens Up, Camera Forward, and Camera Down.
步骤S40:响应于镜头朝向转动至所述镜头方位,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示。Step S40: In response to the camera turning to the camera orientation, display the preset private image or the current target image captured by the camera according to the target image display policy.
需要说明的是,当前目标图像可以是视频会议设备处于全景模式或者广角模式时,通过摄像头采集到的会议图像。It should be noted that the current target image may be a conference image captured by a camera when the video conference device is in a panorama mode or a wide-angle mode.
进一步地,为了摄像头采集到的图像更适于进行显示,步骤S40之前,还包括:Further, in order to make the image collected by the camera more suitable for display, before step S40, it also includes:
通过摄像头获取当前参会目标图像,并开启麦克风;Obtain the image of the current participant target through the camera, and turn on the microphone;
将所述当前参会目标图像通过预设图像分割模型进行图像分割,获得已分割图像;Segmenting the image of the current participant target through a preset image segmentation model to obtain a segmented image;
将所述已分割图像进行图像处理,获得当前目标图像。Perform image processing on the segmented image to obtain the current target image.
需要说明的是,预设图像分割模型可以被配置为对摄像头采集到的目标图像进行图像分割,提取出目标图像中的人物图像,标记为已分割图像。It should be noted that the preset image segmentation model can be configured to perform image segmentation on the target image captured by the camera, extract the person image in the target image, and mark it as a segmented image.
可理解的是,当前目标图像是根据摄像头采集到的目标图像进行展开后获得的展开图,在进行展开的过程中,需要先确定展开点,在本实施例中,展开点位置的确定可以是通过基于镜头方位确定所述当前参会目标图像的分割初始点;基于所述分割初始点与预设方向将所述当前参会目标图像通过预设图像分割模型进行图像分割,获得已分割图像。It can be understood that the current target image is an expanded view obtained after expanding the target image collected by the camera. During the expansion process, the expansion point needs to be determined first. In this embodiment, the determination of the expansion point position can be Determining the segmentation initial point of the current conference target image based on the camera orientation; segmenting the current conference target image through a preset image segmentation model based on the segmentation initial point and preset direction to obtain a segmented image.
在具体实现中,在对摄像头采集到的当前参会目标图像进行分割的过程中,还需要根据预设人脸检测模型以及人形检测模型对所述当前参会目标图像进行检测,避免将完整的人裁切成一半,造成目标图像的不完整。In the specific implementation, in the process of segmenting the current meeting target image collected by the camera, it is also necessary to detect the current meeting target image according to the preset face detection model and human figure detection model, so as to avoid the complete The person is cut in half, resulting in incompleteness of the target image.
此外,在根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示之后,还可以标记出在进行视频会议时进行发言的参会人员,让用户更直观的了解谁在发言,能够有更好的体验。In addition, after displaying the preset privacy image or the current target image captured by the camera according to the target image display policy, the participants who speak during the video conference can also be marked, so that the user can more intuitively understand who is Speaking, can have a better experience.
进一步地,为了标记发言的参会目标,步骤S40之后,还包括:Further, in order to mark the participant target of speaking, after step S40, it also includes:
对所述当前目标图像通过预设嘴型检测模型进行嘴型检测,获得初始发言人图像;performing mouth shape detection on the current target image through a preset mouth shape detection model to obtain an initial speaker image;
通过麦克风获取当前声音信号,根据所述当前声音信号确定发言人信息;Obtaining a current sound signal through a microphone, and determining speaker information according to the current sound signal;
根据所述发言人信息以及所述初始发言人图像确定目标发言人图像,并标记所述目标发言人图像。Determine a target speaker image according to the speaker information and the initial speaker image, and mark the target speaker image.
需要说明的是,预设嘴型检测模型被配置为检测当前参会目标图像中的参会目标的嘴部运动,当参会目标发言时,嘴部会发生运动,从而判断正在进行发言的参会目标,同时由于可能参会目标没有发出声音信号,但是检测到存在嘴部运动,所以可以通过预设嘴部检测模型获得初始发言人信息。It should be noted that the preset mouth shape detection model is configured to detect the mouth movement of the participant in the image of the current participant. When the participant speaks, the mouth will move, thereby judging the participant who is speaking At the same time, since the target may not emit a sound signal, but detects the presence of mouth movement, the initial speaker information can be obtained through the preset mouth detection model.
可理解的是,将麦克风获得的声音信号通过预设方位预测模型进行方位预测以及初始发言人信息就可以确定准确的目标发言人,从而对目标发言人图像进行标记。It can be understood that the accurate target speaker can be determined by using the sound signal obtained by the microphone to predict the direction through the preset direction prediction model and the initial speaker information, so as to mark the image of the target speaker.
值得说明的是,对于目标发言人的标记可以是将展示图像中的目标发言人图像提高亮度、标记不同的颜色等,还可以是其他具有相同或者相似标记功能的标记方法,本实施例不做具体限制。It is worth noting that the marking of the target speaker can be by increasing the brightness of the image of the target speaker in the display image, marking different colors, etc., or by other marking methods with the same or similar marking functions, which are not covered in this embodiment. Specific restrictions.
本实施例通过在接收到会议请求指令时,根据所述会议请求指令确定目标图像显示模式,并获取当前参会目标信息,根据所述目标图像显示模式确定镜头方位,并根据所述镜头方位调整镜头朝向,基于所述镜头方位与所述当前参会目标信息确定目标图像显示策略,在镜头朝向转动至所述镜头方位时,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示。本实施例通过用户输入的会议请求指令确定目标图像显示模式,并根据不同的目标图像显示模式调整不同的镜头方位,适用多个场景下的图像展示,根据镜头方位以及当前参会目标信息的结合选择合适的目标图像显示策略,可能根据参会目标信息的不同选择不同的图像显示方式,避免了在进行视频会话时只能够在单一应用场景下进行使用,且无法根据参会的人数和会议模式选择使用不同的显示方式,用户体验较差的技术问题。In this embodiment, when a meeting request instruction is received, the target image display mode is determined according to the meeting request instruction, and the current meeting target information is obtained, the lens orientation is determined according to the target image display mode, and the camera orientation is adjusted according to the lens orientation. Camera orientation, based on the lens orientation and the current meeting target information to determine the target image display strategy, when the lens orientation is rotated to the lens orientation, according to the target image display strategy, the preset privacy image or the current image captured by the camera The target image is displayed. In this embodiment, the target image display mode is determined through the meeting request command input by the user, and different camera orientations are adjusted according to different target image display modes. Select the appropriate target image display strategy, and may choose different image display methods according to the different target information of the participants, avoiding that it can only be used in a single application scenario during a video session, and cannot be used according to the number of participants and the conference mode Choose to use different display methods, technical issues with poor user experience.
参考图4,图4为本申请一种图像显示方法第二实施例的流程示意图。Referring to FIG. 4 , FIG. 4 is a schematic flowchart of a second embodiment of an image display method of the present application.
基于上述第一实施例,在本实施例中,所述步骤S30,包括:Based on the first embodiment above, in this embodiment, the step S30 includes:
步骤S301:响应于所述镜头方位为镜头朝上,获取所述当前参会目标信息中的参会目标数量与会议参与度。Step S301: Responding to the fact that the camera orientation is upward, acquire the number of conference participants and the degree of conference participation in the current conference participant information.
值得说明的是,会议参与度可以是通过判断视频会议设备的麦克风阵列采集的声音信号总数量、声音信号的来源等进行计算获得,再对声音信号进行处理获得会议参与度时,可以是声音信号的总数量越多,会议参与度越高,声音信号的来源数量越多,会议参与度越高。It is worth noting that the degree of meeting participation can be calculated by judging the total number of sound signals collected by the microphone array of the video conferencing equipment, the source of the sound signals, etc., and then processing the sound signals to obtain the degree of meeting participation can be the sound signal The greater the total number of , the higher the degree of conference participation, and the greater the number of sources of sound signals, the higher the degree of conference participation.
步骤S302:根据所述镜头方位、所述参会目标数量以及所述会议参与度确定目标图像全景显示策略。Step S302: Determine a target image panorama display strategy according to the camera orientation, the number of conference participants and the conference participation degree.
需要说明的是,在镜头朝上时,目标图像全景显示策略包含有四种目标图像显示策略:1、VIP图像显示策略;2、主持人图像显示策略;3、多人会话图像显示策略;4、交替发言图像显示策略;需要根据具体的参会目标数量以及会议参与度选择合适的目标图像显示策略。It should be noted that when the camera is facing upwards, the target image panorama display strategy includes four target image display strategies: 1. VIP image display strategy; 2. Moderator image display strategy; 3. Multi-person conversation image display strategy; 4. , Alternate speech image display strategy; it is necessary to select an appropriate target image display strategy according to the specific number of participants and the degree of participation in the meeting.
进一步地,由于参会目标数量不同、会议参与度不同,选择图像显示策略时,需要选择不同的图像显示策略,即步骤S301,包括:Further, due to the different number of meeting participants and different meeting participation degrees, when selecting an image display strategy, it is necessary to select a different image display strategy, that is, step S301, including:
在所述镜头方位为镜头朝上时,获取所述当前参会目标信息中的参会目标数量以及会议参与度;When the camera orientation is facing upward, obtain the number of participants and the degree of participation in the meeting in the current meeting target information;
判断所述当前参会目标信息中是否存在主持人信息;Judging whether there is host information in the current meeting target information;
响应于所述当前参会目标信息中存在主持人信息,将所述会议参与度与第一预设值进行对比,并将所述参会目标数量与第二预设值进行对比;In response to host information being present in the current conference participation target information, comparing the conference participation degree with a first preset value, and comparing the number of conference participation targets with a second preset value;
在所述会议参与度超过第一预设值时,且当所述参会目标数量大于第二预设值,则根据所述会议请求指令、所述参会目标数量以及所述会议参与度确定VIP图像显示策略;When the degree of meeting participation exceeds a first preset value, and when the number of meeting participants is greater than a second preset value, determine according to the meeting request instruction, the number of meeting participants and the meeting participation degree VIP image display strategy;
在所述会议参与度不超过第一预设值时,且当所述参会目标数量小于第二预设值,则根据所述会议请求指令、所述参会目标数量以及所述会议参与度确定主持人图像显示策略;When the degree of meeting participation does not exceed a first preset value, and when the number of meeting participants is less than a second preset value, according to the meeting request instruction, the number of meeting participants and the meeting participation degree Determine moderator image display strategy;
可理解的是,主持人信息可以是需要长期展示的参会目标图像,所述主持人信息可以是会议主持人、组织者或者需要固定显示的人等,还可以是视频会议设备正前方的人作为默认的主持人,在实际操作中,主持人信息可以通过调整视频会议设备的摆放手动切换主持人,也可以通过视频会议设备上的按键、遥控器、手机APP等切换主持人,本实施例不做具体限制。It is understandable that the moderator information may be a long-term display target image, and the moderator information may be the conference moderator, the organizer, or a person who needs to be permanently displayed, or the person directly in front of the video conferencing device As the default moderator, in actual operation, the moderator information can be manually switched by adjusting the placement of the video conferencing equipment, or can be switched through the buttons on the video conference equipment, remote control, mobile APP, etc. This implementation Examples are not specifically limited.
其次,在存在主持人信息时,根据会议参与度以及参会人数的不同选择不同的图像显示策略,在实际操作中,在会议参与度超过第一预设值时,且当所述参会目标数量大于第二预设值,则目标图像显示策略为VIP图像显示策略;在会议参与度不超过第一预设值时,且当所述参会目标数量小于第二预设值,则目标图像显示策略为主持人图像显示策略,所述第一预设值以及第二预设值可以是用户预先设置的一个数值,本实施例不做具体限制。Secondly, when there is moderator information, different image display strategies are selected according to the degree of meeting participation and the number of participants. In actual operation, when the degree of meeting participation exceeds the first preset value, and when the participation target If the number is greater than the second preset value, the target image display strategy is the VIP image display strategy; when the conference participation does not exceed the first preset value, and when the number of participants is less than the second preset value, the target image The display strategy is the moderator's image display strategy, and the first preset value and the second preset value may be a value preset by the user, which is not specifically limited in this embodiment.
值得说明的是,VIP图像显示策略中,会设置一个VIP位置,一般会将VIP位置固定在左上角,其他的数量与位置可以不固定,在本实施例中,非VIP位置的数量为5个,当检测到有人说话时,通过检测将说话的参会目标对应的目标图像提取到其他几个位置上,且会以VIP位置为起点顺时针在非VIP位置的五个框内。It is worth noting that in the VIP image display strategy, a VIP position will be set. Generally, the VIP position will be fixed in the upper left corner, and other numbers and positions may not be fixed. In this embodiment, the number of non-VIP positions is 5 , when it is detected that someone is speaking, the target image corresponding to the speaking participant target is extracted to several other positions through detection, and will be clockwise in the five frames of the non-VIP position starting from the VIP position.
应当理解的是,主持人图像显示策略中,会将获取到的所有参会目标图像进行整合,获得一个包含有所有参会目标图像的全景图,将所述全景图固定在展示图像的下端,并将主持人图像固定在上端,可能会出现多个主持人信息的状况,因此在展示图像的上端可以放置多个主持人图像。It should be understood that in the moderator's image display strategy, all acquired target images of participants will be integrated to obtain a panorama containing all images of target participants, and the panorama will be fixed at the lower end of the display image. And fix the moderator image on the top, there may be a situation of multiple moderator information, so multiple moderator images can be placed on the top of the display image.
此外,若是在进行视频会议过程中,会议参与度以及参会人数发生变化,也可以通过预设人形检测模型、预设人脸检测模型检测到,并重新根据不同的会议参与度以及参会人数改变目标图像展示策略。In addition, if the degree of participation in the meeting and the number of participants change during the video conference, it can also be detected through the preset humanoid detection model and the preset face detection model, and then re-according to different meeting participation and the number of participants Change target image display strategy.
在具体实现中,现有A、B、C、D、E、F六人,当视频会议设备检测到主持人信息后,确认A是主持人,在会议参与度超过第一预设值时,且当所述参会目标数量大于第二预设值,则目标图像显示策略为VIP图像显示策略,因此,六人在视频图像展示位置如图5所示。In a specific implementation, there are currently six people A, B, C, D, E, and F. When the video conferencing device detects the information of the host, it confirms that A is the host. When the degree of participation in the meeting exceeds the first preset value, And when the number of participants is greater than the second preset value, the target image display strategy is the VIP image display strategy. Therefore, the video image display positions of six people are as shown in FIG. 5 .
其中,若A是主持人,且在会议参与度不超过第一预设值时,且当所述参会目标数量小于第二预设值,则目标图像显示策略为主持人图像显示策略,因此,六人的视频图像展示位置如图6所示。Wherein, if A is the moderator, and when the degree of participation in the meeting does not exceed the first preset value, and when the number of participants in the meeting is less than the second preset value, the target image display strategy is the moderator image display strategy, so , the display positions of the video images of the six people are shown in Figure 6.
进一步地,在视频会议过程中,有可能出现没有主持人的情况,因此步骤S302,还包括:Furthermore, during the video conference, there may be no moderator, so step S302 also includes:
响应于当前参会目标信息中不存在主持人信息,将所述参会目标数量与第二预设值进行对比;In response to the fact that there is no moderator information in the current conference target information, comparing the number of conference participation targets with a second preset value;
确定所述参会目标数量不超过第二预设值时,根据所述会议请求指令、所述参会目标数量以及所述会议参与度确定多人会话图像显示策略;When it is determined that the number of participants in the meeting does not exceed a second preset value, determine a multi-person session image display strategy according to the meeting request instruction, the number of participants in the meeting, and the degree of participation in the meeting;
确定所述参会目标数量超过第二预设值时,根据所述会议请求指令、所述参会目标数量以及所述会议参与度确定交替发言图像显示策略。When it is determined that the number of conference participants exceeds a second preset value, an alternate speaking image display strategy is determined according to the conference request instruction, the number of conference participants, and the degree of conference participation.
值得说明的是,在多人会话图像显示策略中,会将所获取到的参会目标图像进行整合,获得一个包含有所有参会目标图像的全景图,将所述全景图固定在展示图像的下端,在展示图像的上端会固定有三个图像框,这三个图像框,在检测到有参会目标发言时,会进行调整展示目标图像,若全景图中的参会目标数量不够三人,在展示图像的上端,只显示对应人数的图像框,人数不够的部分则显示黑框,此外,这三个图像框的顺序是固定的,例如:当前有A、B、C、D、E、F六人,则展示图像上端的是哪个图像框展示顺序为:ABC、ABD、ABE、ABF、ACD、ACE、ACF、ADE、ADF、AEF这10种顺序。It is worth noting that in the multi-person session image display strategy, the obtained target images of the participants will be integrated to obtain a panorama containing all the target images of the participants, and the panorama will be fixed on the display image. At the lower end, there will be three image frames fixed on the upper end of the display image. These three image frames will be adjusted to display the target image when it is detected that there are participants speaking. If the number of participants in the panorama is not enough for three people, On the upper part of the display image, only the image frame corresponding to the number of people is displayed, and the part with insufficient number of people displays a black frame. In addition, the order of these three image frames is fixed, for example: currently there are A, B, C, D, E, F six people, then display which image frame is on the top of the image in the following 10 order: ABC, ABD, ABE, ABF, ACD, ACE, ACF, ADE, ADF, AEF.
应当理解的是,在交替发言图像显示策略中,不会存在固定数量的图像框,且会根据参会目标数量对展示图像进行切割,切割规律为:It should be understood that in the alternate speech image display strategy, there will not be a fixed number of image frames, and the display images will be cut according to the number of participants. The cutting rule is:
当只有1个人时,展示图像的画面只有一个人像框,显示1个人;When there is only one person, the screen displaying the image has only one portrait frame, showing one person;
当只有2个人时,展示图像的画面左右等分,显示2个人;When there are only 2 people, the screen displaying the image is divided into left and right, and 2 people are displayed;
当只有3个人时,展示图像的画面按上下、左右进行4等分,显示3个人,但右下角的画面采用黑屏显示;When there are only 3 people, the screen displaying the image is divided into 4 equal parts according to top, bottom, left and right, and 3 people are displayed, but the screen in the lower right corner is displayed on a black screen;
当只有4个人时,展示图像的画面按上下、左右进行4等分,显示4个人;When there are only 4 people, the screen displaying the image is divided into 4 equal parts according to top, bottom, left and right, and 4 people are displayed;
当只有5个人时,如图12进行6等分,显示5个人,但右下角的画面采用黑屏显示;When there are only 5 people, divide it into 6 equal parts as shown in Figure 12, and display 5 people, but the screen in the lower right corner is displayed on a black screen;
当只有6个人时,如图12进行6等分,显示6个人;When there are only 6 people, perform 6 equal divisions as shown in Figure 12, showing 6 people;
当只有7个人时,展示图像的画面按上下、左右进行9等分,显示7个人,但右下角2个画面采用黑屏显示;When there are only 7 people, the screen displaying the image is divided into 9 equal parts according to the top, bottom, left and right, and 7 people are displayed, but the 2 screens in the lower right corner are displayed on a black screen;
当只有8个人时,展示图像的画面按上下、左右进行9等分,显示8个人,但右下角的画面采用黑屏显示;When there are only 8 people, the screen displaying the image is divided into 9 equal parts according to the top, bottom, left and right, and 8 people are displayed, but the screen in the lower right corner is displayed on a black screen;
当只有9个人时,展示图像的画面按上下、左右进行9等分,显示9个人;When there are only 9 people, the screen displaying the image is divided into 9 equal parts according to top, bottom, left and right, and 9 people are displayed;
当人数多于9个人时,画面显示只有9个人,初始显示的人员根据第一个说话的人员顺时针分别显示,当有第10个人说话,替换最后说话的人;When the number of people is more than 9, the screen shows only 9 people, and the initially displayed people are displayed clockwise according to the first person who spoke, and when the 10th person speaks, replace the last person who spoke;
可理解的是,当视频会议设备未检测到主持人信息,转而判断参会目标数量与第二预设值的数量关系,在所述参会目标数量不超过第二预设值时,目标图像显示策略为多人会话图像显示策略;在所述参会目标数量超过第二预设值时,目标图像显示策略为交替发言图像显示策略。It is understandable that when the video conferencing device does not detect the moderator information, it judges the quantitative relationship between the number of conference participants and the second preset value, and when the number of conference participants does not exceed the second preset value, the target The image display strategy is a multi-person conversation image display strategy; when the number of participants in the meeting exceeds a second preset value, the target image display strategy is an alternate speaking image display strategy.
在具体实现中,现有A、B、C、D、E、F六人,在所述参会目标数量不超过第二预设值时,目标图像显示策略为多人会话图像显示策略,展示图像如图7所示,在本实施例中,若之前展示图像的上端是A、D、F三人,此时检测到C进行发言,则按照切换规律,展示图像的上端会切换为A、C、D三个参会目标图像。In a specific implementation, there are currently six people A, B, C, D, E, and F. When the number of participants in the meeting does not exceed the second preset value, the target image display strategy is a multi-person conversation image display strategy, showing The image is shown in Figure 7. In this embodiment, if the upper end of the previously displayed image is A, D, and F, and it is detected that C is speaking at this time, then according to the switching rule, the upper end of the displayed image will be switched to A, D, and F. C, D three target images of participants.
其中在所述参会目标数量超过第二预设值时,目标图像显示策略为交替发言图像显示策略,根据切割规律展示图像如图8所示。Wherein, when the number of participating targets exceeds the second preset value, the target image display strategy is an alternate speech image display strategy, and the images are displayed according to the cutting rule as shown in FIG. 8 .
本实施例通过在接收到会议请求指令时,根据所述会议请求指令确定目标图像显示模式,并获取当前参会目标信息,根据所述目标图像显示模式确定镜头方位,并根据所述镜头方位调整镜头朝向,若所述镜头方位为镜头朝上,则获取所述当前参会目标信息中的参会目标数量与会议参与度,根据所述镜头方位、所述参会目标数量以及所述会议参与度确定目标图像全景显示策略,在镜头朝向转动至所述镜头方位时,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示。本实施例通过用户输入的会议请求指令确定目标图像显示模式,并根据不同的目标图像显示模式调整不同的镜头方位,适用多个场景下的图像展示,根据所述镜头方位、所述参会目标数量以及所述会议参与度的结合选择合适的目标图像全景显示策略,可能根据参会目标信息的不同选择不同的图像显示方式,避免了在进行视频会话时只能够在单一应用场景下进行使用,无法根据参会的人数,会议模式选择使用不同的显示方式,用户体验较差的技术问题。In this embodiment, when a meeting request instruction is received, the target image display mode is determined according to the meeting request instruction, and the current meeting target information is obtained, the lens orientation is determined according to the target image display mode, and the camera orientation is adjusted according to the lens orientation. Camera orientation, if the camera orientation is upward, then obtain the number of participants and the degree of participation in the meeting in the current meeting target information, according to the camera orientation, the number of participants and the conference participation The target image panorama display strategy is determined according to the degree, and when the lens orientation is rotated to the lens orientation, the preset privacy image or the current target image captured by the camera is displayed according to the target image display strategy. In this embodiment, the target image display mode is determined through the meeting request command input by the user, and different camera orientations are adjusted according to different target image display modes, which is applicable to image display in multiple scenarios. The combination of the number and the degree of meeting participation selects an appropriate target image panorama display strategy, and may select different image display methods according to the different target information of the participants, so as to avoid that it can only be used in a single application scenario when conducting a video session. It is impossible to choose different display methods according to the number of participants and the conference mode, and the technical problem is that the user experience is poor.
参考图9,图9为本申请一种图像显示方法第三实施例的流程示意图。Referring to FIG. 9 , FIG. 9 is a schematic flowchart of a third embodiment of an image display method of the present application.
基于上述第一实施例,在本实施例中,所述步骤S30,包括:Based on the first embodiment above, in this embodiment, the step S30 includes:
步骤S301A:响应于所述镜头方位为镜头朝前,根据所述镜头方位与所述当前参会信息确定目标图像广角显示策略。Step S301A: In response to the camera orientation being forward facing, determine a target image wide-angle display strategy according to the camera orientation and the current conference participant information.
可理解的是,若目标显示模式为广角模式,即镜头方位为镜头向前时,若镜头方位为镜头朝前,则根据镜头方位与所述当前参会信息确定目标图像广角显示策略。It can be understood that if the target display mode is the wide-angle mode, that is, when the camera orientation is forward, then the target image wide-angle display strategy is determined according to the lens orientation and the current conference participant information.
值的说明的是,在本实施例中,由于镜头朝前时,视频会议设备的镜头采用的是能够采集到220°的广角范围的图像,由于镜头两边的角度越大,图像的畸变越大,为了有一个好的图像展示效果,可以将获得的220°的图片两边的角度各裁掉20°,实现180°图像的效果,。The description of the value is that in this embodiment, when the lens is facing forward, the lens of the video conferencing device adopts an image that can capture a wide-angle range of 220°, and the larger the angle on both sides of the lens, the greater the distortion of the image , in order to have a good image display effect, the angles on both sides of the obtained 220° image can be cut off by 20° to achieve the effect of a 180° image.
在具体实现中,参考图10,现有A、B、C、D四人分为位于视频会议设备的前方,此时,摄像头采集到四个人的图像,但是由于B、C两人相对A、D两人距离镜头的位置会远点,所以再显示器上显示的画面B、C的人像比A、D的人像要大一些,以区别参会目标与视频会议设备的距离关系,最终的展示图像参考图11。In the specific implementation, with reference to FIG. 10 , the existing four people A, B, C, and D are divided into four people located in front of the video conferencing equipment. D and the two people will be farther away from the lens, so the portraits of B and C displayed on the monitor are larger than those of A and D, so as to distinguish the distance relationship between the participants and the video conferencing equipment, and the final display image Refer to Figure 11.
其中,若BCD三人离开只剩余A坐在原位置,此时会将A的人像进行相应的放大,并进行居中放置;当ACD三人离开只剩余B坐在原位置,因为B本身位置是靠边的,因此放大时会优先保持B整个人在显示框内,最终B的位置会有一定的放大,且会偏左一点;当整个会议桌上只有AD两人时,由于AD两人距离视频会议设备相同,因此将AD的图像进行同比例放大并进行居中放置;当整个会议桌上只有BC两人时,检测到B和C后将人像进行同比例的放大,但是因为BC两人坐的位置会靠边,因此BC的放大比例要小于AD;当整个会议桌上只有BD两人时,检测到B和D后将人像及画面中的其他场景进行同比例的放大,因为B距离镜头更近且更靠边,因此同比例放大时,同时检测B的位置,避免B的身体超出显示画面。Among them, if the three people in BCD leave and only A is left sitting in the original position, the portrait of A will be enlarged accordingly and placed in the center; when the three people in ACD leave, only B is left sitting in the original position, because B itself is at the side , so when zooming in, it will give priority to keeping B in the display frame, and finally the position of B will be zoomed in to a certain extent, and will be a little to the left; The same, so the image of AD is enlarged in the same proportion and placed in the center; when there are only BC on the entire conference table, the portraits of B and C are detected and enlarged in the same proportion, but because the sitting positions of BC and BC are different Move to the side, so the magnification ratio of BC is smaller than that of AD; when there are only two BDs on the entire conference table, after detecting B and D, the portrait and other scenes in the screen will be enlarged in the same ratio, because B is closer and closer to the lens Move to the side, so when zooming in on the same scale, the position of B is detected at the same time to prevent B's body from exceeding the display screen.
本实施例通过在接收到会议请求指令时,根据所述会议请求指令确定目标图像显示模式,并获取当前参会目标信息,根据所述目标图像显示模式确定镜头方位,并根据所述镜头方位调整镜头朝向,若镜头方位为镜头朝前,则根据镜头方位与所述当前参会信息确定目标图像广角显示策略,在镜头朝向转动至所述镜头方位时,根据所述目标图像广角显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示。本实施例通过用户输入的会议请求指令确定目标图像广角显示模式,并根据目标图像广角显示模式调整镜头方位为镜头向前,适用多个场景下的图像展示,根据镜头方位以及当前参会目标信息的结合选择合适的目标图像显示策略,可能根据参会目标信息的不同选择不同的图像显示方式,避免了在进行视频会话时只能够在单一应用场景下进行使用,无法根据参会的人数,会议模式选择使用不同的显示方式,用户体验较差的技术问题。In this embodiment, when a meeting request instruction is received, the target image display mode is determined according to the meeting request instruction, and the current meeting target information is obtained, the lens orientation is determined according to the target image display mode, and the camera orientation is adjusted according to the lens orientation. The lens orientation, if the lens orientation is the lens facing forward, then determine the wide-angle display strategy of the target image according to the lens orientation and the current participant information, and when the lens orientation rotates to the lens orientation, set the target image wide-angle display strategy according to the target image. Set a private image or the current target image acquired by the camera for display. In this embodiment, the wide-angle display mode of the target image is determined through the meeting request command input by the user, and the camera orientation is adjusted to the front of the camera according to the wide-angle display mode of the target image, which is suitable for image display in multiple scenarios. It is possible to choose a different image display method according to the different target information of the participants, so as to avoid that the video session can only be used in a single application scenario, and the conference cannot be used according to the number of participants. Mode selection uses different display methods, technical issues with poor user experience.
参考图12,图12为本申请一种图像显示方法第四实施例的流程示意图。Referring to FIG. 12 , FIG. 12 is a schematic flowchart of a fourth embodiment of an image display method of the present application.
基于上述第一实施例,在本实施例中,所述步骤S30,包括:Based on the first embodiment above, in this embodiment, the step S30 includes:
步骤S301B:响应于所述镜头方位为镜头朝下,根据所述镜头方位确定目标图像隐私显示策略。Step S301B: In response to the camera orientation being downward, determine a target image privacy display policy according to the lens orientation.
可理解的是,若目标显示模式为隐私模式,即镜头方位为镜头向下时,若镜头方位为镜头朝下,则根据镜头方位确定目标图像隐私显示策略。It can be understood that if the target display mode is the privacy mode, that is, when the camera orientation is the lens downward, then the target image privacy display policy is determined according to the lens orientation.
需要说明的是,此模式下主要是实现会议隐私的保护,当本地方有需要讨论的且不想对方看到本地的画面和听到本地的讨论声音,而又不关闭正在进行的视频通话会议,可采用此方式进行实现,此外,隐私模式还可以由全景模式或者广角模式进入到隐私模式。It should be noted that this mode is mainly to protect the privacy of the conference. When there is something to discuss in the local area and you don’t want the other party to see the local screen and hear the local discussion sound, without closing the ongoing video call conference, It can be implemented in this manner. In addition, the privacy mode can also be entered into the privacy mode from the panoramic mode or the wide-angle mode.
在具体实现中,在当前目标图像展示模式为全景模式时,可能一方需要进行内部讨论,需要暂时关闭摄像头,此时,用户通过按下视频会议设备的按钮或者遥控器等方式发出指令,视频会议设备控制器在接收到显示禁止指令时,根据所述显示禁止指令控制摄像头进行转动,并控制传感器检测当前镜头方位,若所述当前镜头方位为镜头向下,则显示预设隐私图像,并关闭麦克风。In a specific implementation, when the current target image display mode is panoramic mode, one party may need to conduct internal discussions and temporarily turn off the camera. When the device controller receives the display prohibition instruction, it controls the camera to rotate according to the display prohibition instruction, and controls the sensor to detect the current lens orientation; microphone.
其中,在当前目标图像展示模式为广角模式时,用户通过按下视频会议设备的按钮或者遥控器等方式发出指令,视频会议设备控制器在接收到显示禁止指令时,根据所述显示禁止指令控制摄像头进行转动,并控制传感器检测当前镜头方位,若所述当前镜头方位为镜头向下,则显示预设隐私图像,并关闭麦克风。Wherein, when the current target image display mode is the wide-angle mode, the user sends an instruction by pressing a button of the video conferencing device or a remote control, etc., and when the video conferencing device controller receives the display prohibition instruction, it controls the display according to the display prohibition instruction. The camera is rotated and the sensor is controlled to detect the current lens orientation. If the current lens orientation is downward, a preset privacy image is displayed and the microphone is turned off.
需要说明的是,在讨论完毕后,需要继续开始视频会议,此时用户通过按下视频会议设备的按钮或者遥控器等方式发出指令,视频会议设备控制器在接收到显示开启指令时,根据所述显示开启指令控制摄像头转动,在镜头还原方位转动至所述目标图像显示策略对应的镜头方位时,开启麦克风,返回获取当前参会目标图像,并将所述当前参会目标图像通过预设图像分割模型进行图像分割,获得多个已分割图像的步骤。It should be noted that after the discussion is completed, the video conference needs to be continued. At this time, the user sends an instruction by pressing a button of the video conference device or a remote control, and the controller of the video conference device receives the display start instruction. The display opening command controls the rotation of the camera, and when the lens restoration position is rotated to the lens position corresponding to the target image display strategy, the microphone is turned on, and the target image of the current participant is obtained back, and the target image of the current participant is passed through the preset image The segmentation model performs image segmentation to obtain multiple segmented images.
可理解的是,预设隐私图像可以是用户预先设置的一张或者多张图像,预设隐私图像被配置为遮挡当前视频图像,并提醒其他视频用户,视频并未中断。It can be understood that the preset privacy image may be one or more images preset by the user, and the preset privacy image is configured to block the current video image and remind other video users that the video is not interrupted.
本实施例通过在接收到会议请求指令时,根据所述会议请求指令确定目标图像显示模式,并获取当前参会目标信息,根据所述目标图像显示模式确定镜头方位,并根据所述镜头方位调整镜头朝向,若所述镜头方位为镜头朝下,则根据所述镜头方位确定目标图像隐私显示策略,在镜头朝向转动至所述镜头方位时,根据所述目标图像隐私显示策略对预设隐私图像进行展示。本实施例通过用户输入的会议请求指令确定目标图像隐私显示模式,并根据目标图像隐私显示模式调整镜头方位为镜头向下,适用多个场景下的图像展示,根据镜头方位以及当前参会目标信息的结合选择合适的目标图像显示策略,可能根据参会目标信息的不同选择不同的图像显示方式,避免了在进行视频会话时只能够在单一应用场景下进行使用,无法根据参会的人数,会议模式选择使用不同的显示方式,用户体验较差的技术问题。In this embodiment, when a meeting request instruction is received, the target image display mode is determined according to the meeting request instruction, and the current meeting target information is obtained, the lens orientation is determined according to the target image display mode, and the camera orientation is adjusted according to the lens orientation. Camera orientation, if the lens orientation is facing downward, then determine the target image privacy display policy according to the lens orientation, and when the lens orientation rotates to the lens orientation, set the preset privacy image according to the target image privacy display policy to show. In this embodiment, the target image privacy display mode is determined through the meeting request command input by the user, and the camera orientation is adjusted to the lens downward according to the target image privacy display mode, which is suitable for image display in multiple scenarios, according to the camera orientation and current meeting target information It is possible to choose a different image display method according to the different target information of the participants, so as to avoid that the video session can only be used in a single application scenario, and the conference cannot be used according to the number of participants. Mode selection uses different display methods, technical issues with poor user experience.
此外,本申请实施例还提出一种存储介质,所述存储介质上存储有图像显示程序,所述图像显示程序被处理器执行时实现如上文所述的图像显示方法的步骤。In addition, the embodiment of the present application also proposes a storage medium, on which an image display program is stored, and when the image display program is executed by a processor, the steps of the above-mentioned image display method are realized.
由于本存储介质采用了上述所有实施例的全部技术方案,因此至少有上述实施例的技术方案所带来的所有有益效果,在此不再一一赘述。Since the storage medium adopts all the technical solutions of all the above-mentioned embodiments, it at least has all the beneficial effects brought by the technical solutions of the above-mentioned embodiments, which will not be repeated here.
参照图13,图13为本申请图像显示装置第一实施例的结构框图。Referring to FIG. 13 , FIG. 13 is a structural block diagram of the first embodiment of the image display device of the present application.
如图13所示,本申请实施例提出的图像显示装置包括:As shown in Figure 13, the image display device proposed in the embodiment of the present application includes:
指令接收模块10,被配置为接收会议请求指令,根据所述会议请求指令确定目标图像显示模式,并获取当前参会目标信息。The instruction receiving module 10 is configured to receive a meeting request instruction, determine a target image display mode according to the meeting request instruction, and acquire current meeting target information.
镜头调整模块20,被配置为根据所述目标图像显示模式确定镜头方位,并根据所述镜头方位调整镜头朝向。The lens adjustment module 20 is configured to determine the orientation of the lens according to the target image display mode, and adjust the orientation of the lens according to the orientation of the lens.
策略确认模块30,被配置为基于所述镜头方位与所述当前参会目标信息确定目标图像显示策略。The policy confirmation module 30 is configured to determine a target image display policy based on the camera orientation and the current meeting target information.
图像显示模块40,被配置为响应于镜头朝向转动至所述镜头方位,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示。The image display module 40 is configured to display a preset private image or a current target image captured by a camera according to the target image display policy in response to the camera turning to the lens orientation.
在一实施例中,所述策略确认模块30,还被配置为响应于所述镜头方位为镜头朝上,获取所述当前参会目标信息中的参会目标数量与会议参与度;根据所述镜头方位、所述参会目标数量以及所述会议参与度确定目标图像全景显示策略;在镜头朝向转动至所述镜头方位时,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示,还包括:在镜头朝向转动至所述镜头方位时,根据所述目标图像全景显示策略对摄像头获取的当前目标图像进行展示。In an embodiment, the policy confirmation module 30 is further configured to obtain the number of conference participants and the degree of conference participation in the current conference participant information in response to the camera orientation being upward; according to the The target image panorama display strategy is determined by the lens orientation, the number of participants in the meeting and the degree of participation in the meeting; when the lens orientation is rotated to the lens orientation, the preset privacy image or the current image acquired by the camera is determined according to the target image display strategy. The displaying of the target image further includes: displaying the current target image acquired by the camera according to the target image panorama display strategy when the lens orientation is rotated to the lens orientation.
在一实施例中,所述策略确认模块30,还被配置为响应于镜头方位为镜头朝前,根据镜头方位与所述当前参会信息确定目标图像广角显示策略;在镜头朝向转动至所述镜头方位时,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示,还包括:在镜头朝向转动至所述镜头方位时,根据所述目标图像广角显示策略对摄像头获取的当前目标图像进行展示。In one embodiment, the policy confirmation module 30 is further configured to determine the wide-angle display strategy of the target image according to the lens orientation and the current participant information in response to the lens orientation being the lens facing forward; When the lens is in the orientation, displaying the preset privacy image or the current target image acquired by the camera according to the target image display strategy also includes: when the lens orientation is rotated to the lens orientation, displaying the camera according to the target image wide-angle display strategy The obtained current target image is displayed.
在一实施例中,所述图像显示模块30,还被配置为响应于所述镜头方位为镜头朝下,根据所述镜头方位确定目标图像隐私显示策略;在镜头朝向转动至所述镜头方位时,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示,还包括:在镜头朝向转动至所述镜头方位时,根据所述目标图像隐私显示策略显示预设隐私图像,并关闭麦克风。In an embodiment, the image display module 30 is further configured to determine a target image privacy display strategy according to the lens orientation in response to the lens orientation being downward; when the lens orientation is rotated to the lens orientation , displaying a preset privacy image or a current target image acquired by a camera according to the target image display strategy, and further comprising: displaying a preset privacy image according to the target image privacy display strategy when the lens orientation is rotated to the lens orientation , and turn off the microphone.
在一实施例中,所述图像显示模块40,还被配置为通过摄像头获取当前参会目标图像,并开启麦克风;将所述当前参会目标图像通过预设图像分割模型进行图像分割,获得已分割图像;将所述已分割图像进行图像处理,获得当前目标图像。In one embodiment, the image display module 40 is further configured to obtain the image of the current conference target through the camera, and turn on the microphone; perform image segmentation on the current conference target image through a preset image segmentation model to obtain the Segmenting the image; performing image processing on the segmented image to obtain the current target image.
在一实施例中,所述策略确认模块40,还被配置为基于镜头方位确定所述当前参会目标图像的分割初始点;基于所述分割初始点与预设方向将所述当前参会目标图像通过预设图像分割模型进行图像分割,获得已分割图像。In one embodiment, the policy confirmation module 40 is further configured to determine the initial segmentation point of the current conference target image based on the camera orientation; The image is segmented through a preset image segmentation model to obtain a segmented image.
在一实施例中,所述图像显示模块40,还被配置为对所述当前目标图像通过预设嘴型检测模型进行嘴型检测,获得初始发言人图像;通过麦克风获取当前声音信号,根据所述当前声音信号确定发言人信息;根据所述发言人信息以及所述初始发言人图像确定目标发言人图像,并标记所述目标发言人图像。In an embodiment, the image display module 40 is further configured to perform mouth shape detection on the current target image through a preset mouth shape detection model to obtain an initial speaker image; obtain the current sound signal through a microphone, and Determine speaker information based on the current sound signal; determine a target speaker image according to the speaker information and the initial speaker image, and mark the target speaker image.
本实施例通过在接收到会议请求指令时,根据所述会议请求指令确定目标图像显示模式,并获取当前参会目标信息,根据所述目标图像显示模式确定镜头方位,并根据所述镜头方位调整镜头朝向,基于所述镜头方位与所述当前参会目标信息确定目标图像显示策略,在镜头朝向转动至所述镜头方位时,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示。本实施例通过用户输入的会议请求指令确定目标图像显示模式,并根据不同的目标图像显示模式调整不同的镜头方位,适用多个场景下的图像展示,根据镜头方位以及当前参会目标信息的结合选择合适的目标图像显示策略,可能根据参会目标信息的不同选择不同的图像显示方式,避免了在进行视频会话时只能够在单一应用场景下进行使用,且无法根据参会的人数和会议模式选择使用不同的显示方式,用户体验较差的技术问题。In this embodiment, when a meeting request instruction is received, the target image display mode is determined according to the meeting request instruction, and the current meeting target information is obtained, the lens orientation is determined according to the target image display mode, and the camera orientation is adjusted according to the lens orientation. Camera orientation, based on the lens orientation and the current meeting target information to determine the target image display strategy, when the lens orientation is rotated to the lens orientation, according to the target image display strategy, the preset privacy image or the current image captured by the camera The target image is displayed. In this embodiment, the target image display mode is determined through the meeting request command input by the user, and different camera orientations are adjusted according to different target image display modes. Select the appropriate target image display strategy, and may choose different image display methods according to the different target information of the participants, avoiding that it can only be used in a single application scenario during a video session, and cannot be used according to the number of participants and the conference mode Choose to use different display methods, technical issues with poor user experience.
应当理解的是,以上仅为举例说明,对本申请的技术方案并不构成任何限定,在具体应用中,本领域的技术人员可以根据需要进行设置,本申请对此不做限制。It should be understood that the above is only an example, and does not constitute any limitation to the technical solution of the present application. In a specific application, those skilled in the art can make settings according to needs, and the present application does not limit this.
需要说明的是,以上所描述的工作流程仅仅是示意性的,并不对本申请的保护范围构成限定,在实际应用中,本领域的技术人员可以根据实际的需要选择其中的部分或者全部来实现本实施例方案的目的,此处不做限制。It should be noted that the workflow described above is only illustrative and does not limit the scope of protection of this application. In practical applications, those skilled in the art can select part or all of them to implement according to actual needs. The purpose of the scheme of this embodiment is not limited here.
另外,未在本实施例中详尽描述的技术细节,可参见本申请任意实施例所提供的图像显示方法,此处不再赘述。In addition, for technical details not exhaustively described in this embodiment, reference may be made to the image display method provided in any embodiment of the present application, which will not be repeated here.
此外,需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者系统不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者系统所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者系统中还存在另外的相同要素。Furthermore, it should be noted that in this document, the term "comprises", "comprises" or any other variation thereof is intended to cover a non-exclusive inclusion such that a process, method, article or system comprising a set of elements includes not only those elements, but also other elements not expressly listed, or elements inherent in such a process, method, article, or system. Without further limitations, an element defined by the phrase "comprising a..." does not preclude the presence of additional identical elements in the process, method, article or system comprising that element.
上述本申请实施例序号仅仅为了描述,不代表实施例的优劣。The serial numbers of the above embodiments of the present application are for description only, and do not represent the advantages and disadvantages of the embodiments.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如只读存储器(Read Only Memory,ROM)/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,或者网络设备等)执行本申请各个实施例所述的方法。Through the description of the above embodiments, those skilled in the art can clearly understand that the methods of the above embodiments can be implemented by means of software plus a necessary general-purpose hardware platform, and of course also by hardware, but in many cases the former is better implementation. Based on this understanding, the essence of the technical solution of this application or the part that contributes to the prior art can be embodied in the form of a software product, and the computer software product is stored in a storage medium (such as a read-only memory (Read Only Memory) , ROM)/RAM, magnetic disk, optical disk), including several instructions to enable a terminal device (which can be a mobile phone, computer, server, or network device, etc.) to execute the methods described in various embodiments of the present application.
以上仅为本申请的可选实施例,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。The above are only optional embodiments of the application, and are not intended to limit the patent scope of the application. Any equivalent structure or equivalent process transformation made by using the specification and drawings of the application, or directly or indirectly used in other related technologies fields, are all included in the scope of patent protection of this application in the same way.

Claims (10)

  1. 一种图像显示方法,包括:An image display method, comprising:
    接收到会议请求指令,根据所述会议请求指令确定目标图像显示模式,并获取当前参会目标信息;Receiving the meeting request instruction, determining the target image display mode according to the meeting request instruction, and obtaining the current meeting target information;
    根据所述目标图像显示模式确定镜头方位,并根据所述镜头方位调整镜头朝向;determining the lens orientation according to the target image display mode, and adjusting the lens orientation according to the lens orientation;
    基于所述镜头方位与所述当前参会目标信息确定目标图像显示策略;determining a target image display strategy based on the lens orientation and the current meeting target information;
    响应于镜头朝向转动至所述镜头方位,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示。In response to the camera turning to the camera orientation, the preset private image or the current target image captured by the camera is displayed according to the target image display strategy.
  2. 如权利要求1所述的图像显示方法,其中,所述目标图像显示策略包括目标图像全景显示策略;The image display method according to claim 1, wherein the target image display strategy comprises a target image panorama display strategy;
    所述基于所述镜头方位与所述当前参会目标信息确定目标图像显示策略,包括:The determining target image display strategy based on the lens orientation and the current meeting target information includes:
    响应于所述镜头方位为镜头朝上,获取所述当前参会目标信息中的参会目标数量与会议参与度;Responding to the fact that the camera orientation is facing upwards, obtain the number of participants and the degree of participation in the meeting in the information about the current meeting participants;
    根据所述镜头方位、所述参会目标数量以及所述会议参与度确定目标图像全景显示策略;Determine a target image panorama display strategy according to the lens orientation, the number of participants in the meeting, and the degree of participation in the meeting;
    响应于镜头朝向转动至所述镜头方位,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示,还包括:In response to the lens turning to the lens orientation, displaying the preset privacy image or the current target image acquired by the camera according to the target image display strategy, further comprising:
    响应于镜头朝向转动至所述镜头方位,根据所述目标图像全景显示策略对摄像头获取的当前目标图像进行展示。In response to the camera turning to the camera orientation, the current target image acquired by the camera is displayed according to the target image panorama display strategy.
  3. 如权利要求1所述的图像显示方法,其中,所述目标图像显示策略包括目标图像广角显示策略;The image display method according to claim 1, wherein the target image display strategy comprises a target image wide-angle display strategy;
    所述基于所述镜头方位与所述当前参会目标信息确定目标图像显示策略,包括:The determining target image display strategy based on the lens orientation and the current meeting target information includes:
    响应于所述镜头方位为镜头朝前,根据所述镜头方位与所述当前参会信息确定目标图像广角显示策略;In response to the camera orientation being forward facing, determine a target image wide-angle display strategy according to the lens orientation and the current conference participant information;
    响应于镜头朝向转动至所述镜头方位,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示,还包括:In response to the lens turning to the lens orientation, displaying the preset privacy image or the current target image acquired by the camera according to the target image display strategy, further comprising:
    响应于镜头朝向转动至所述镜头方位,根据所述目标图像广角显示策略对摄像头获取的当前目标图像进行展示。In response to the lens turning to the lens orientation, the current target image acquired by the camera is displayed according to the target image wide-angle display strategy.
  4. 如权利要求1-3任一项所述的图像显示方法,其中,所述目标图像显示策略包括目标图像隐私显示策略;The image display method according to any one of claims 1-3, wherein the target image display strategy includes a target image privacy display strategy;
    所述基于所述镜头方位与所述当前参会目标信息确定目标图像显示策略,包括:The determining target image display strategy based on the lens orientation and the current meeting target information includes:
    响应于所述镜头方位为镜头朝下,根据所述镜头方位确定目标图像隐私显示策略;Responsive to the camera orientation being downward, determining a target image privacy display policy according to the lens orientation;
    响应于镜头朝向转动至所述镜头方位,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示,还包括:In response to the lens turning to the lens orientation, displaying the preset privacy image or the current target image acquired by the camera according to the target image display strategy, further comprising:
    响应于镜头朝向转动至所述镜头方位,根据所述目标图像隐私显示策略显示预设隐私图像,并关闭麦克风。In response to the camera turning to the camera orientation, displaying a preset privacy image according to the target image privacy display policy, and turning off the microphone.
  5. 如权利要求1-3任一项所述的图像显示方法,其中,所述在镜头朝向转动至所述镜头方位时,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示之前,包括:The image display method according to any one of claims 1-3, wherein when the camera orientation is rotated to the camera orientation, the preset privacy image or the current target image acquired by the camera is displayed according to the target image display strategy Before making a presentation, include:
    通过摄像头获取当前参会目标图像,并开启麦克风;Obtain the image of the current participant target through the camera, and turn on the microphone;
    将所述当前参会目标图像通过预设图像分割模型进行图像分割,获得已分割图像;Segmenting the image of the current participant target through a preset image segmentation model to obtain a segmented image;
    将所述已分割图像进行图像处理,获得所述当前目标图像。performing image processing on the segmented image to obtain the current target image.
  6. 如权利要求5所述的图像显示方法,其中,所述将所述当前参会目标图像通过预设图像分割模型进行图像分割,获得已分割图像,包括:The image display method according to claim 5, wherein said performing image segmentation on said current meeting target image through a preset image segmentation model to obtain a segmented image comprises:
    基于镜头方位确定所述当前参会目标图像的分割初始点;Determining the initial segmentation point of the current participant target image based on the camera orientation;
    基于所述分割初始点与预设方向将所述当前参会目标图像通过预设图像分割模型进行图像分割,获得已分割图像。Segmenting the current meeting target image by using a preset image segmentation model based on the initial segmentation point and the preset direction to obtain a segmented image.
  7. 如权利要求1-3任一项所述的图像显示方法,其中,所述在镜头朝向转动至所述镜头方位时,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示之后,还包括:The image display method according to any one of claims 1-3, wherein when the camera orientation is rotated to the camera orientation, the preset privacy image or the current target image acquired by the camera is displayed according to the target image display strategy After the presentation, also include:
    对所述当前目标图像通过预设嘴型检测模型进行嘴型检测,获得初始发言人图像;performing mouth shape detection on the current target image through a preset mouth shape detection model to obtain an initial speaker image;
    通过麦克风获取当前声音信号,根据所述当前声音信号确定发言人信息;Obtaining a current sound signal through a microphone, and determining speaker information according to the current sound signal;
    根据所述发言人信息以及所述初始发言人图像确定目标发言人图像,并标记所述目标发言人图像。Determine a target speaker image according to the speaker information and the initial speaker image, and mark the target speaker image.
  8. 一种图像显示装置,包括:An image display device, comprising:
    指令接收模块,被配置为接收会议请求指令,根据所述会议请求指令确定目标图像显示模式,并获取当前参会目标信息;The instruction receiving module is configured to receive the meeting request instruction, determine the target image display mode according to the meeting request instruction, and acquire the current meeting target information;
    镜头调整模块,被配置为根据所述目标图像显示模式确定镜头方位,并根据所述镜头方位调整镜头朝向;The lens adjustment module is configured to determine the orientation of the lens according to the target image display mode, and adjust the orientation of the lens according to the orientation of the lens;
    策略确认模块,被配置为基于所述镜头方位与所述当前参会目标信息确定目标图像显示策略;A strategy confirmation module, configured to determine a target image display strategy based on the camera orientation and the current meeting target information;
    图像显示模块,被配置为响应于镜头朝向转动至所述镜头方位,根据所述目标图像显示策略对预设隐私图像或摄像头获取的当前目标图像进行展示。The image display module is configured to display a preset private image or a current target image captured by a camera according to the target image display policy in response to the lens turning to the lens orientation.
  9. 一种图像显示设备,所述图像显示设备包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的图像显示程序,所述图像显示程序被所述处理器执行时实现如权利要求1至7中任一项所述的图像显示方法。An image display device, the image display device comprising: a memory, a processor, and an image display program stored in the memory and operable on the processor, when the image display program is executed by the processor Realizing the image display method as claimed in any one of claims 1 to 7.
  10. 一种存储介质,所述存储介质上存储有图像显示程序,所述图像显示程序被处理器执行时实现如权利要求1至7任一项所述的图像显示方法。A storage medium, on which an image display program is stored, and when the image display program is executed by a processor, the image display method according to any one of claims 1 to 7 is realized.
PCT/CN2021/118489 2021-09-07 2021-09-15 Image display method, apparatus and device, and storage medium WO2022262134A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111047995.3A CN113905204B (en) 2021-09-07 2021-09-07 Image display method, device, equipment and storage medium
CN202111047995.3 2021-09-07

Publications (1)

Publication Number Publication Date
WO2022262134A1 true WO2022262134A1 (en) 2022-12-22

Family

ID=79188827

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/118489 WO2022262134A1 (en) 2021-09-07 2021-09-15 Image display method, apparatus and device, and storage medium

Country Status (2)

Country Link
CN (1) CN113905204B (en)
WO (1) WO2022262134A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116489502A (en) * 2023-05-12 2023-07-25 深圳星河创意科技开发有限公司 Remote conference method based on AI camera docking station and AI camera docking station
CN117640877A (en) * 2024-01-24 2024-03-01 浙江华创视讯科技有限公司 Picture reconstruction method for online conference and electronic equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102833518A (en) * 2011-06-13 2012-12-19 华为终端有限公司 Method and device for optimally configuring MCU (multipoint control unit) multipicture
US9237140B1 (en) * 2013-03-07 2016-01-12 Cisco Technologies, Inc. Acceptance of policies for cross-company online sessions
CN112351237A (en) * 2020-11-05 2021-02-09 安徽马钢和菱实业有限公司 Automatic switching decision algorithm for main video of video conference
CN112601044A (en) * 2020-12-08 2021-04-02 深圳市焦点数字科技有限公司 Conference scene picture self-adaption method
CN113139491A (en) * 2021-04-30 2021-07-20 厦门盈趣科技股份有限公司 Video conference control method, system, mobile terminal and storage medium

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
NO332170B1 (en) * 2009-10-14 2012-07-16 Cisco Systems Int Sarl Camera control device and method
US9113032B1 (en) * 2011-05-31 2015-08-18 Google Inc. Selecting participants in a video conference
US9124762B2 (en) * 2012-12-20 2015-09-01 Microsoft Technology Licensing, Llc Privacy camera
CN105306868B (en) * 2014-06-17 2019-07-26 三亚中兴软件有限责任公司 Video conferencing system and method
CN109257559A (en) * 2018-09-28 2019-01-22 苏州科达科技股份有限公司 A kind of image display method, device and the video conferencing system of panoramic video meeting
JP7225735B2 (en) * 2018-11-27 2023-02-21 株式会社リコー VIDEO CONFERENCE SYSTEM, COMMUNICATION TERMINAL AND MICROPHONE CONTROL METHOD OF COMMUNICATION TERMINAL

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102833518A (en) * 2011-06-13 2012-12-19 华为终端有限公司 Method and device for optimally configuring MCU (multipoint control unit) multipicture
US9237140B1 (en) * 2013-03-07 2016-01-12 Cisco Technologies, Inc. Acceptance of policies for cross-company online sessions
CN112351237A (en) * 2020-11-05 2021-02-09 安徽马钢和菱实业有限公司 Automatic switching decision algorithm for main video of video conference
CN112601044A (en) * 2020-12-08 2021-04-02 深圳市焦点数字科技有限公司 Conference scene picture self-adaption method
CN113139491A (en) * 2021-04-30 2021-07-20 厦门盈趣科技股份有限公司 Video conference control method, system, mobile terminal and storage medium

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116489502A (en) * 2023-05-12 2023-07-25 深圳星河创意科技开发有限公司 Remote conference method based on AI camera docking station and AI camera docking station
CN116489502B (en) * 2023-05-12 2023-10-31 深圳星河创意科技开发有限公司 Remote conference method based on AI camera docking station and AI camera docking station
CN117640877A (en) * 2024-01-24 2024-03-01 浙江华创视讯科技有限公司 Picture reconstruction method for online conference and electronic equipment

Also Published As

Publication number Publication date
CN113905204B (en) 2023-02-14
CN113905204A (en) 2022-01-07

Similar Documents

Publication Publication Date Title
US9860486B2 (en) Communication apparatus, communication method, and communication system
CA2874715C (en) Dynamic video and sound adjustment in a video conference
US8289363B2 (en) Video conferencing
US20100118112A1 (en) Group table top videoconferencing device
RU2549169C2 (en) Image processing device, image processing method and computer-readable data medium
WO2022262134A1 (en) Image display method, apparatus and device, and storage medium
US11477393B2 (en) Detecting and tracking a subject of interest in a teleconference
US10979666B2 (en) Asymmetric video conferencing system and method
JPH1042264A (en) Video conference system
CN112333391A (en) Method and device for automatically tracking portrait based on sound, intelligent terminal and medium
EP4075794A1 (en) Region of interest based adjustment of camera parameters in a teleconferencing environment
EP4106326A1 (en) Multi-camera automatic framing
TWI248021B (en) Method and system for correcting out-of-focus eyesight of attendant images in video conferencing
JP6565777B2 (en) COMMUNICATION DEVICE, CONFERENCE SYSTEM, PROGRAM, AND DISPLAY CONTROL METHOD
TWI785511B (en) Target tracking method applied to video transmission
JP2005110160A (en) Imaging apparatus
WO2022007681A1 (en) Photographing control method, mobile terminal, and computer readable storage medium
JP2000244885A (en) Image photographing device, method therefor, storage medium and video conference system
WO2023235329A1 (en) Framework for simultaneous subject and desk capture during videoconferencing
JP2002262138A (en) Image pickup system, video conference system, monitoring system, and information terminal with image pickup function
CN117319594A (en) Conference personnel tracking display method, device, equipment and readable storage medium
TW202345589A (en) Audiovisual system and control method thereof

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21945714

Country of ref document: EP

Kind code of ref document: A1