CN117055732A - Picture switching method, switching device and intelligent conference equipment - Google Patents

Picture switching method, switching device and intelligent conference equipment Download PDF

Info

Publication number
CN117055732A
CN117055732A CN202311051650.4A CN202311051650A CN117055732A CN 117055732 A CN117055732 A CN 117055732A CN 202311051650 A CN202311051650 A CN 202311051650A CN 117055732 A CN117055732 A CN 117055732A
Authority
CN
China
Prior art keywords
gesture
picture
mode
display
conference
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311051650.4A
Other languages
Chinese (zh)
Inventor
邹建财
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Jingwah Information Technology Co ltd
Original Assignee
Shenzhen Jingwah Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Jingwah Information Technology Co ltd filed Critical Shenzhen Jingwah Information Technology Co ltd
Priority to CN202311051650.4A priority Critical patent/CN117055732A/en
Publication of CN117055732A publication Critical patent/CN117055732A/en
Pending legal-status Critical Current

Links

Landscapes

  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The invention discloses a picture switching method, a switching device and intelligent conference equipment. The picture switching method comprises the following steps: acquiring gesture actions of participants; comparing the gesture action with the gesture actions in the gesture library, and determining the gesture type of the gesture action; if the gesture acts as a mode switching gesture, determining a conference display mode according to the repetition times of the mode switching gesture, and adjusting a display picture of the projection device according to the conference display mode; if the gesture motion is an amplifying gesture motion, amplifying the current output picture, and transmitting the amplified current output picture to a projection device for display; if the gesture is a shrinking gesture, the current output picture is subjected to shrinking processing, and the current output picture after the shrinking processing is transmitted to the projection device for display, so that the meeting participants can realize the switching of different pictures according to different gesture, the diversity of picture switching is realized, and the operation is simple.

Description

Picture switching method, switching device and intelligent conference equipment
Technical Field
The present invention relates to the field of intelligent conference technologies, and in particular, to a method and an apparatus for switching pictures, and an intelligent conference device.
Background
Video conferencing has become an effective tool for remote communication nowadays, but in video conferencing, switching operation of display screens is complicated.
Disclosure of Invention
The invention provides a picture switching method, a switching device and intelligent conference equipment, so that different pictures can be switched according to different gesture actions of participants, the diversity of picture switching is realized, and the operation is simple.
According to an aspect of the present invention, there is provided a picture switching method including:
acquiring gesture actions of participants;
comparing the gesture action with the gesture actions in the gesture library, and determining the gesture type of the gesture action;
if the gesture acts as a mode switching gesture, determining a conference display mode according to the repetition times of the mode switching gesture, and adjusting a display picture of the projection device according to the conference display mode; the conference display mode comprises a normal mode, a panoramic mode, a picture rotation mode and a sound source positioning mode;
if the gesture motion is an amplifying gesture motion, amplifying the current output picture, and transmitting the amplified current output picture to a projection device for display;
if the gesture is a zoom-out gesture, the current output picture is zoomed out, and the zoomed out current output picture is transmitted to the showing device for showing.
Further, if the current gesture is a mode switching gesture, determining a conference display mode according to the acquired times of the mode switching gesture includes:
if the determined conference display mode is a normal mode, controlling the projection device to display a display picture combining the conference whole picture and the conference participant single picture;
if the determined conference display mode is a panoramic mode, controlling the projection device to display the whole conference picture;
if the determined conference display mode is a picture rotation mode, controlling a projection device to display each single picture of the participants according to a first preset frequency;
and if the determined conference display mode is a sound source positioning mode, controlling the projection device to display the single picture of the speaking participants according to the position information of the sound source.
Further, if the determined conference display mode is a sound source positioning mode, controlling the projection device to display a single picture of a speaking participant according to the position information of the sound source, and then comprising:
continuously acquiring gesture actions of participants;
if the acquired gesture is an intelligent following gesture; the output picture is subjected to intelligent following processing, and the output picture after the intelligent following processing is transmitted to a projection device for display; the intelligent following processing is that the participants who will speak always serve as the center of the picture.
Further, the intelligent following processing is performed on the output picture, and then the method further comprises the following steps:
continuously acquiring gesture actions of participants;
and if the acquired gesture is the exiting gesture, controlling to cancel intelligent following processing.
Further, if the determined conference display mode is a sound source positioning mode, controlling the projection device to display a single picture of a speaking participant according to the position information of the sound source, and then comprising:
and in the process of executing the sound source positioning mode, if the sound signal cannot be detected within the first preset time after the sound signal is stopped, displaying the panoramic mode picture.
Further, acquiring gesture actions of the participants, the method further includes:
recording various gesture actions required to be used, modeling the recorded gesture actions, and storing the gesture actions after the modeling into the gesture library.
Further, recording the gesture required to be used, and then includes:
recording various gesture actions under different environments, and optimizing the recorded various gesture actions.
According to another aspect of the present invention, there is provided a picture switching apparatus including:
the gesture action acquisition module is used for acquiring gesture actions of participants;
the gesture type acquisition module is used for comparing the gesture action with the gesture actions in the gesture library and determining the gesture type of the gesture action;
the conference display mode determining module is used for determining a conference display mode according to the repetition times of the mode switching gesture when the gesture action is the mode switching gesture, and adjusting a display picture of the projection device according to the conference display mode; the conference display mode comprises a normal mode, a panoramic mode, a picture rotation mode and a sound source positioning mode;
the gesture amplifying action determining module is used for amplifying the current output picture when the gesture action is the gesture amplifying action, and transmitting the amplified current output picture to the showing device for showing;
and the gesture reduction action determining module is used for carrying out reduction processing on the current output picture when the current gesture action is the gesture reduction action, and transmitting the current output picture after the reduction processing to the showing device for showing.
Further, the conference presentation mode determining module includes:
the normal mode display unit is used for controlling the projection device to display a display picture combining the whole conference picture and the single picture of the participants when the determined conference display mode is the normal mode;
the panoramic mode display picture is used for controlling the projection device to display the whole conference picture when the determined conference display mode is the panoramic mode;
the display picture of the rotation mode is used for controlling the projection device to display each single picture of the participants according to the first preset frequency when the determined conference display mode is the picture rotation mode;
and the sound source positioning display picture is used for controlling the projection device to display the single picture of the speaking participants according to the position information of the sound source when the determined conference display mode is the sound source positioning mode.
According to another aspect of the present invention, there is provided an intelligent conference apparatus including the picture switching device described in any of the above embodiments; the intelligent conference device further comprises at least one camera.
According to the picture switching method provided by the embodiment of the invention, the gesture actions of the participants are obtained; comparing the gesture action with the gesture actions in the gesture library, and determining the gesture type of the gesture action; when the gesture is a mode switching gesture, determining a conference display mode according to the repetition times of the mode switching gesture, and adjusting a display picture of the projection device according to the conference display mode; when the gesture is an amplifying gesture, amplifying the current output picture, and transmitting the amplified current output picture to a projection device for display; when the gesture is a shrinking gesture, the current output picture is subjected to shrinking processing, the current output picture after the shrinking processing is transmitted to the projection device for display, and the meeting participants can realize the switching of different pictures according to different gesture, so that the diversity of picture switching is realized, and the operation is simple.
It should be understood that the description in this section is not intended to identify key or critical features of the embodiments of the invention or to delineate the scope of the invention. Other features of the present invention will become apparent from the description that follows.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings required for the description of the embodiments will be briefly described below, and it is apparent that the drawings in the following description are only some embodiments of the present invention, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
Fig. 1 is a flowchart of a picture switching method according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of a mode-switched gesture provided in accordance with an embodiment of the present invention;
FIG. 3 is a schematic diagram of a gesture with enlarged screen according to an embodiment of the present invention;
FIG. 4 is a schematic diagram of a gesture with reduced frame according to an embodiment of the present invention;
FIG. 5 is a schematic diagram of an intelligently following gesture provided in accordance with an embodiment of the present invention;
FIG. 6 is a schematic diagram of a gesture action to exit intelligent follow provided in accordance with an embodiment of the present invention;
fig. 7 is a schematic structural diagram of a frame switching device according to an embodiment of the present invention.
Detailed Description
In order that those skilled in the art will better understand the present invention, a technical solution in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in which it is apparent that the described embodiments are only some embodiments of the present invention, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the present invention without making any inventive effort, shall fall within the scope of the present invention.
It should be noted that the terms "first," "second," and the like in the description and the claims of the present invention and the above figures are used for distinguishing between similar objects and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used may be interchanged where appropriate such that the embodiments of the invention described herein may be implemented in sequences other than those illustrated or otherwise described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
An embodiment of the present invention provides a picture switching method, fig. 1 is a flowchart of a picture switching method provided according to an embodiment of the present invention, and referring to fig. 1, the picture switching method includes:
s110, acquiring gesture actions of participants.
The gesture motion includes a mode switching gesture motion, a screen zooming gesture motion, and the like, which is not limited in the embodiment of the present invention.
S120, comparing the gesture actions with the gesture actions in the gesture library, and determining the gesture type of the gesture actions.
Specifically, the gesture actions in the gesture library may be set according to actual needs, which is not limited in the embodiment of the present invention. Fig. 2 is a schematic diagram of a mode switching gesture provided according to an embodiment of the present invention, where, as shown in fig. 2, when the acquired gesture of the participant is the gesture shown in fig. 2, that is, only the gesture with the thumb and the index finger open, the acquired gesture of the participant is determined to be the mode switching gesture; fig. 3 is a schematic diagram of a gesture motion with enlarged screen, as shown in fig. 3, when the acquired gesture motion of a participant is the gesture shown in fig. 3, that is, the gesture with ten fingers of two hands opening and moving outwards, the acquired gesture motion of the participant is determined to be an enlarged gesture motion; fig. 4 is a schematic diagram of a gesture motion with reduced screen, as shown in fig. 4, where when the acquired gesture motion of the participants is the gesture shown in fig. 4, that is, the gesture with ten fingers of two hands opening and moving inward, the acquired gesture motion of the participants is determined to be the gesture motion with reduced screen.
S130, if the gesture acts as a mode switching gesture, determining a conference display mode according to the repetition times of the mode switching gesture, and adjusting a display picture of the projection device according to the conference display mode; the conference display mode comprises a normal mode, a panoramic mode, a picture rotation mode and a sound source positioning mode.
Specifically, when the acquired gesture of the participant is a mode switching gesture, determining a conference display mode according to the repetition number of the mode switching gesture, and when the mode switching gesture shown in fig. 2 is acquired only once, for example, determining that the conference display mode is a normal mode, and displaying a display picture of the normal mode through a showing device; when the mode switching gesture shown in fig. 2 is only acquired twice, the conference display mode can be determined to be the panoramic mode, and a display picture of the panoramic mode is displayed through a showing device, wherein after the mode switching gesture shown in fig. 2 is detected, the five-finger fist making gesture is detected, and then the mode switching gesture shown in fig. 2 is detected again, namely, the mode switching gesture shown in fig. 2 is acquired twice; when only three mode switching gestures shown in fig. 2 are acquired, determining that the conference display mode is a picture rotation mode, and displaying a display picture of the picture rotation mode through a projection device, wherein after the mode switching gestures shown in fig. 2 are detected, detecting a five-finger fist-making gesture, then detecting the mode switching gestures shown in fig. 2 again, detecting the five-finger fist-making gesture again, and then detecting the mode switching gestures shown in fig. 2 again, namely acquiring the three mode switching gestures shown in fig. 2; when the mode switching gesture shown in fig. 2 is only acquired four times, the conference display mode can be determined to be the sound source positioning mode, and the display picture of the sound source positioning mode is displayed through the projection device, wherein after the mode switching gesture shown in fig. 2 is detected, the five-finger fist-making gesture is detected, then the mode switching gesture shown in fig. 2 is detected again, the five-finger fist-making gesture is detected again, then the mode switching gesture shown in fig. 2 is detected again, and then the mode switching gesture shown in fig. 2 is detected again, namely, the mode switching gesture shown in fig. 2 is acquired four times. The display picture in the normal mode can be a display picture combining the whole conference picture and the single picture of the participants; the display picture of the panoramic mode can be a whole picture of the conference; the display picture of the picture rotation mode can be each single picture for displaying the participants according to the preset frequency, the preset frequency can be set according to the actual situation, and the embodiment of the invention is not limited to the preset frequency; the display screen of the sound source positioning mode can be a single screen of the participants speaking.
And S140, if the gesture is an amplifying gesture, amplifying the current output picture, and transmitting the amplified current output picture to the showing device for showing.
Specifically, when the amplifying gesture shown in fig. 3 is obtained once, the amplifying process can be performed on the current output picture once, and the amplified current output picture is transmitted to the projection device to be displayed, and when the conference display mode is the normal mode, for example, when the amplifying gesture shown in fig. 3 is obtained once, the amplifying process can be performed on the single picture of the participants who display the amplifying gesture once; when the conference display mode is a panoramic mode, the whole conference picture can be amplified once every time the amplifying gesture shown in fig. 3 is acquired; when the conference display mode is a picture rotation mode, amplifying a single picture of a currently displayed participant once every time when the amplifying gesture shown in fig. 3 is obtained; when the conference display mode is the sound source positioning mode, the single picture of the speaking participants can be amplified once when the amplifying gesture shown in fig. 3 is acquired once.
And S150, if the gesture is a shrinking gesture, performing shrinking processing on the current output picture, and transmitting the current output picture after the shrinking processing to the showing device for showing.
Specifically, when the zoom-out gesture shown in fig. 4 is obtained once, a zoom-out process can be performed on the current output picture, and the current output picture after the zoom-out process is transmitted to the projection device for display, and when the conference display mode is the normal mode, for example, when the zoom-out gesture shown in fig. 4 is obtained once, a zoom-out process can be performed on a single picture of a participant who displays the zoom-out gesture; when the conference display mode is the panoramic mode, the whole conference picture can be subjected to one-time shrinking processing every time the shrinking gesture action shown in fig. 4 is obtained once; when the conference display mode is a picture rotation mode, once the shrinking gesture action shown in fig. 4 is obtained, the single picture of the currently displayed participants can be subjected to one-time shrinking treatment; when the conference display mode is the sound source positioning mode, the single picture of the speaking participants can be subjected to one time of reduction processing every time the reduction gesture action shown in fig. 4 is acquired.
According to the picture switching method provided by the embodiment of the invention, the gesture actions of the participants are obtained; comparing the gesture action with the gesture actions in the gesture library, and determining the gesture type of the gesture action; when the gesture is a mode switching gesture, determining a conference display mode according to the repetition times of the mode switching gesture, and adjusting a display picture of the projection device according to the conference display mode; when the gesture is an amplifying gesture, amplifying the current output picture, and transmitting the amplified current output picture to a projection device for display; when the gesture is a shrinking gesture, the current output picture is subjected to shrinking processing, the current output picture after the shrinking processing is transmitted to the projection device for display, and the meeting participants can realize the switching of different pictures through different gesture, so that the diversity of picture switching is realized, and the operation is simple.
Further, if the current gesture is a mode switching gesture, determining a conference display mode according to the acquired times of the mode switching gesture includes:
if the determined conference display mode is a normal mode, controlling the projection device to display a display picture combining the conference whole picture and the conference participant single picture;
if the determined conference display mode is a panoramic mode, controlling the projection device to display the whole conference picture;
if the determined conference display mode is a picture rotation mode, controlling a projection device to display each single picture of the participants according to a first preset frequency;
and if the determined conference display mode is a sound source positioning mode, controlling the projection device to display the single picture of the speaking participants according to the position information of the sound source.
Specifically, one or more ultra-high definition and optical zoom cameras can be configured in the conference room, when the conference display picture is in a normal mode, the display picture combined with the conference integral picture and the conference participant single picture can be set as the conference integral picture above the display picture, and the lower part of the display picture is set as the conference participant single picture; the upper part of the display picture can be set as a single picture of the participants, the lower part of the display picture is set as a conference whole picture, and the information such as the name, position and the like of each participant is displayed in the image.
When the conference display mode is a normal mode or a panoramic mode, the conference whole picture can be spliced by an advanced 360-degree image splicing technology to the ultra-high definition images acquired by different cameras, so that a panoramic undistorted output picture is realized, and the single picture of each participant in the conference place can be extracted by the face recognition technology and displayed in the panoramic output picture.
When the conference display mode is a normal mode or a picture rotation mode, an advanced character recognition technology can be adopted to control the projection device to display each single picture of the participants according to a first preset frequency, wherein the first preset frequency can be set according to actual conditions, and the embodiment of the invention is not limited to the first preset frequency.
When the conference display mode is a sound source positioning mode, the sound source position information of the speaking participants can be determined according to the sound source positioning technology in the microphone array, then the single picture of the speaking participants is automatically captured through the camera, the single picture of the speaking participants is displayed through the projection device, and the picture is always displayed at the center position of the output image in a proper size.
Further, fig. 5 is a schematic diagram of an intelligent following gesture according to an embodiment of the present invention, referring to fig. 5, if a determined conference display mode is a sound source positioning mode, a single picture of a participant speaking is displayed by a display device according to position information of a sound source, and then the method includes:
continuously acquiring gesture actions of participants;
if the acquired gesture is an intelligent following gesture; the output picture is subjected to intelligent following processing, and the output picture after the intelligent following processing is transmitted to a projection device for display; the intelligent following processing is that the participants who will speak always serve as the center of the picture.
Specifically, the gesture actions of the participants are continuously acquired, when the acquired gesture actions of the participants are the gestures shown in fig. 5, namely, the thumb and the index finger are meshed, and the rest three fingers are open, the gesture actions of the acquired participants are determined to be intelligent following gestures, at this time, the camera locks the participants who make the intelligent following gestures through the face recognition technology and the figure following technology, the participants move at will in a meeting place, the pictures of the participants can be always in the center of an output image, and the participants move leftwards, and as an example, the camera automatically follows the images of the participants leftwards, and if the participants move in other directions, the camera also automatically follows the images; if the participants walk far away from the camera, the camera can automatically enlarge the picture of the participants, so that the images of the participants are always displayed in the center position of the output picture in the most suitable size; if the participant walks close to the camera, the picture thereof will be appropriately reduced, in any case ensuring that the picture thereof is always displayed in the proper size at the center of the output image.
General intelligence follow gesture can use simultaneously with sound source localization mode, ensures that the contestant who will speak always is as the picture center, and the example, when the contestant talks, accessible sound source localization will speak the contestant always be in the center of output image, when the contestant does not speak, accessible intelligence follows the contestant who will speak always be in the center of output image.
Further, fig. 6 is a schematic diagram of a gesture action for exiting intelligent follow-up, and referring to fig. 6, the intelligent follow-up processing is performed on an output picture, and then the method further includes:
continuously acquiring gesture actions of participants;
and if the acquired gesture is the exiting gesture, controlling to cancel intelligent following processing.
Specifically, the gesture action of the participating person is continuously acquired, when the acquired gesture action of the participating person is the gesture shown in fig. 6, namely, the gesture of all the palms is opened, the acquired gesture action of the participating person is determined to be the exiting gesture action, and at this time, the camera will not continue to follow the participating person who makes the intelligent following gesture.
Further, if the determined conference display mode is a sound source positioning mode, controlling the projection device to display a single picture of a speaking participant according to the position information of the sound source, and then comprising:
and in the process of executing the sound source positioning mode, if the sound signal cannot be detected within the first preset time after the sound signal is stopped, displaying a panoramic mode picture.
The first preset duration is set according to specific situations, which is not limited in the embodiment of the present invention.
Specifically, in the process of executing the sound source positioning mode, if the sound signal cannot be detected within the first preset time after the sound signal is stopped, it is indicated that the talking of the participants is stopped, at this time, the single picture of the participants who are speaking can be canceled without any gesture action, and the picture is converted into the panoramic mode picture.
Further, acquiring gesture actions of the participants, the method further includes:
recording various gesture actions required to be used, modeling the recorded gesture actions, and storing the gesture actions after the modeling into the gesture library.
The gesture actions to be recorded include mode switching gesture, gesture amplifying gesture action, gesture shrinking action, intelligent following gesture, gesture exiting action and the like, which are not limited in the embodiment of the invention, and different gesture actions can be added according to actual requirements.
Specifically, a camera records gesture motion videos to be used, the recorded gesture motion videos are split into images of one frame and one frame, modeling processing is carried out on the split images of one frame and one frame, and gesture motions after modeling processing are stored in the gesture library.
Further, recording the gesture required to be used, and then includes:
recording various gesture actions under different environments, and optimizing the recorded various gesture actions.
Specifically, recording gesture actions can be performed in as many use scenes as possible, each gesture action of the training personnel is recorded under different environments by using a camera, and after each gesture action of the training personnel is recorded under strong light conditions, each gesture action of the training personnel is recorded under weak light conditions, recorded gesture action videos are compared frame by frame, recorded gesture actions are optimized, and recognition accuracy of gesture actions is further improved.
An embodiment of the present invention provides a picture switching device, fig. 7 is a schematic structural diagram of a picture switching device provided according to an embodiment of the present invention, and referring to fig. 7, a picture switching device 200 includes:
the gesture motion obtaining module 210 is configured to obtain a gesture motion of a participant;
the gesture type obtaining module 220 is configured to compare the gesture action with gesture actions in the gesture library, and determine a gesture type of the gesture action;
the conference display mode determining module 230 is configured to determine a conference display mode according to the repetition number of the mode switching gesture when the gesture is the mode switching gesture, and adjust a display screen of the projection device according to the conference display mode; the conference display mode comprises a normal mode, a panoramic mode, a picture rotation mode and a sound source positioning mode;
the gesture-enlarging action determining module 240 is configured to enlarge the current output picture when the gesture action is a gesture-enlarging action, and transmit the enlarged current output picture to the projection device for display;
the zoom-out gesture determining module 250 is configured to, when the current gesture is a zoom-out gesture, zoom out the current output frame, and transmit the zoomed-out current output frame to the projection device for display.
Further, the conference presentation mode determining module 230 includes:
the normal mode display unit is used for controlling the projection device to display a display picture combining the whole conference picture and the single picture of the participants when the determined conference display mode is the normal mode;
the panoramic mode display picture is used for controlling the projection device to display the whole conference picture when the determined conference display mode is the panoramic mode;
the display picture of the rotation mode is used for controlling the projection device to display each single picture of the participants according to the first preset frequency when the determined conference display mode is the picture rotation mode;
and the sound source positioning display picture is used for controlling the projection device to display the single picture of the speaking participants according to the position information of the sound source when the determined conference display mode is the sound source positioning mode.
Further, the follow gesture determination module includes:
the following gesture acquisition unit is used for continuously acquiring gesture actions of the participants after the determined conference display mode is a sound source positioning mode and the projection device is controlled to display single pictures of the participants speaking according to the position information of the sound source;
the intelligent following processing unit is used for carrying out intelligent following processing on the output picture when the acquired gesture action is the intelligent following gesture, and transmitting the output picture after the intelligent following processing to the projection device for display; the intelligent following processing is that the participants who will speak always serve as the center of the picture.
Further, the exit follow determination module includes:
the exit gesture acquisition unit is used for continuously acquiring gesture actions of the participants after the output picture is subjected to intelligent following processing;
and the intelligent follow-up exit unit is used for controlling to cancel intelligent follow-up processing when the acquired gesture acts as the exit gesture acts.
Further, the panorama picture reproduction module includes:
and after the determined conference display mode is the sound source positioning mode and the single picture of the participants who display the speech is controlled by the projection device according to the position information of the sound source, if the sound signal cannot be detected within the first preset time after the sound signal is stopped in the process of executing the sound source positioning mode, displaying a panoramic mode picture.
Further, the modeling processing module includes:
before the gesture actions of the participants are acquired, recording various gesture actions required to be used, modeling the recorded gesture actions, and storing the gesture actions after modeling into a gesture library.
Further, the optimization processing module includes:
after recording the gesture actions required to be used, recording various gesture actions in different environments, and optimizing the recorded various gesture actions.
The picture switching device provided by the embodiment of the invention can execute the picture switching method provided by any embodiment of the invention, and has the corresponding functional modules and beneficial effects of the execution method.
The embodiment of the invention provides intelligent conference equipment, which comprises the picture switching device in any of the embodiments; the intelligent conference device further comprises at least one camera.
At least one can be understood as one or more than one ultra-high definition and optical zoom cameras can be configured, so that the whole conference room is ensured to be completely covered without dead angles, and the output picture is clear.
Specifically, the audio and video generated by the conference can be uploaded to the cloud through a network to process video pictures and audio contents, and the background of the conference scene can be dynamically replaced in real time or virtual objects are added, so that the conference scene in the video is richer, and the cost of conference place arrangement is greatly reduced; the simultaneous frequency content can be synchronously displayed on the screen in real time, and for example, any language can be displayed on the video picture according to the requirement, so that participants of different native languages can communicate easily, and a large amount of translation cost can be saved. After the conference is finished, conference contents can be archived and sorted according to requirements, and files such as characters, audio, pictures and videos are output, so that conference records become easier, and a large amount of sorting work is saved.
According to the embodiment of the invention, the picture switching method designed by the embodiment of the invention is arranged in the intelligent conference equipment, and the gesture actions of the participants are obtained; comparing the gesture action with the gesture actions in the gesture library, and determining the gesture type of the gesture action; when the gesture is a mode switching gesture, determining a conference display mode according to the repetition times of the mode switching gesture, and adjusting a display picture of the projection device according to the conference display mode; when the gesture is an amplifying gesture, amplifying the current output picture, and transmitting the amplified current output picture to a projection device for display; when the gesture is a shrinking gesture, the current output picture is subjected to shrinking processing, the current output picture after the shrinking processing is transmitted to the projection device for display, and the meeting participants can realize the switching of different pictures according to different gesture, so that the diversity of picture switching is realized, and the operation is simple.
It should be appreciated that various forms of the flows shown above may be used to reorder, add, or delete steps. For example, the steps described in the present invention may be performed in parallel, sequentially, or in a different order, so long as the desired results of the technical solution of the present invention are achieved, and the present invention is not limited herein.
The above embodiments do not limit the scope of the present invention. It will be apparent to those skilled in the art that various modifications, combinations, sub-combinations and alternatives are possible, depending on design requirements and other factors. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present invention should be included in the scope of the present invention.

Claims (10)

1. A picture switching method, comprising:
acquiring gesture actions of participants;
comparing the gesture action with gesture actions in a gesture library, and determining the gesture type of the gesture action;
if the gesture is a mode switching gesture, determining a conference display mode according to the repetition times of the mode switching gesture, and adjusting a display picture of the projection device according to the conference display mode; the conference display mode comprises a normal mode, a panoramic mode, a picture rotation mode and a sound source positioning mode;
if the gesture is an amplifying gesture, amplifying the current output picture, and transmitting the amplified current output picture to a showing device for showing;
and if the gesture is a shrinking gesture, performing shrinking processing on the current output picture, and transmitting the current output picture after the shrinking processing to a showing device for showing.
2. The method according to claim 1, wherein if the current gesture is a mode switch gesture, determining a conference presentation mode according to the number of collected mode switch gestures includes:
if the determined conference display mode is the normal mode, controlling the projection device to display a display picture combining a conference whole picture and a conference participant single picture;
if the determined conference display mode is the panoramic mode, controlling the projection device to display a conference whole picture;
if the determined conference display mode is the picture rotation mode, controlling the projection device to display each single picture of the participants according to a first preset frequency;
and if the determined conference display mode is the sound source positioning mode, controlling the projection device to display the single picture of the speaking participants according to the position information of the sound source.
3. The screen switching method according to claim 2, wherein if the determined conference presentation mode is the sound source localization mode, controlling the projection device to present a single screen of a speaking participant according to the position information of the sound source, and then comprising:
continuously acquiring gesture actions of participants;
if the acquired gesture is an intelligent following gesture; the output picture is subjected to intelligent following processing, and the output picture after the intelligent following processing is transmitted to a projection device for display; the intelligent following processing is that the participants who will speak always serve as the center of the picture.
4. A picture switching method as claimed in claim 3, characterized in that the output picture is subjected to an intelligent follow-up process, and further comprising:
continuously acquiring gesture actions of participants;
and if the acquired gesture is the exiting gesture, controlling to cancel intelligent following processing.
5. The screen switching method according to claim 2, wherein if the determined conference presentation mode is the sound source localization mode, controlling the projection device to present a single screen of a speaking participant according to the position information of the sound source, and then comprising:
and in the process of executing the sound source positioning mode, if the sound signal cannot be detected within the first preset time after the sound signal is stopped, displaying the panoramic mode picture.
6. The screen switching method according to claim 1, wherein acquiring the gesture of the participant further comprises:
recording various gesture actions required to be used, modeling the recorded gesture actions, and storing the gesture actions after the modeling into the gesture library.
7. The method for switching pictures according to claim 6, wherein recording the gesture required to be used comprises:
recording various gesture actions under different environments, and optimizing the recorded various gesture actions.
8. A picture switching apparatus, comprising:
the gesture action acquisition module is used for acquiring gesture actions of participants;
the gesture type acquisition module is used for comparing the gesture action with gesture actions in the gesture library and determining the gesture type of the gesture action;
the conference display mode determining module is used for determining a conference display mode according to the repetition times of the mode switching gesture when the gesture acts as the mode switching gesture, and adjusting a display picture of the projection device according to the conference display mode; the conference display mode comprises a normal mode, a panoramic mode, a picture rotation mode and a sound source positioning mode;
the gesture amplifying action determining module is used for amplifying the current output picture when the gesture action is the gesture amplifying action, and transmitting the amplified current output picture to the showing device for showing;
and the shrinking gesture motion determining module is used for carrying out shrinking processing on the current output picture when the current gesture motion is the shrinking gesture motion, and transmitting the current output picture after the shrinking processing to the showing device for showing.
9. The screen switching apparatus according to claim 8, wherein the conference presentation mode determination module includes:
the normal mode display unit is used for controlling the projection device to display a display picture combining a conference integral picture and a conference participant single picture when the determined conference display mode is the normal mode;
the panoramic mode display picture is used for controlling the projection device to display a conference integral picture when the determined conference display mode is the panoramic mode;
a rotation mode display picture for controlling the projection device to display each single picture of the participants according to a first preset frequency when the determined conference display mode is the picture rotation mode;
and the sound source positioning display picture is used for controlling the projection device to display the single picture of the participants speaking according to the position information of the sound source when the determined conference display mode is the sound source positioning mode.
10. An intelligent conference device, characterized by comprising the picture switching device according to any one of claims 8-9; the intelligent conference device further comprises at least one camera.
CN202311051650.4A 2023-08-18 2023-08-18 Picture switching method, switching device and intelligent conference equipment Pending CN117055732A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311051650.4A CN117055732A (en) 2023-08-18 2023-08-18 Picture switching method, switching device and intelligent conference equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311051650.4A CN117055732A (en) 2023-08-18 2023-08-18 Picture switching method, switching device and intelligent conference equipment

Publications (1)

Publication Number Publication Date
CN117055732A true CN117055732A (en) 2023-11-14

Family

ID=88654985

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311051650.4A Pending CN117055732A (en) 2023-08-18 2023-08-18 Picture switching method, switching device and intelligent conference equipment

Country Status (1)

Country Link
CN (1) CN117055732A (en)

Similar Documents

Publication Publication Date Title
US11403509B2 (en) Systems and methods for providing feedback for artificial intelligence-based image capture devices
CN112287844B (en) Student situation analysis method and device, electronic device and storage medium
CN106791485B (en) Video switching method and device
EP3905203B1 (en) Method and apparatus for processing video, and storage medium
CN112199016B (en) Image processing method, image processing device, electronic equipment and computer readable storage medium
JP2006197505A (en) Camera controller, camera system, electronic conference system and camera control method
CN112887609B (en) Shooting method and device, electronic equipment and storage medium
US11076127B1 (en) System and method for automatically framing conversations in a meeting or a video conference
US11310443B2 (en) Video processing method, apparatus and storage medium
US11848031B2 (en) System and method for performing a rewind operation with a mobile image capture device
CN110502117A (en) Screenshot method and electric terminal in electric terminal
CN115733943A (en) Recording and broadcasting interaction system and method based on multi-camera automatic tracking linkage
US7986336B2 (en) Image capture apparatus with indicator
CN117055732A (en) Picture switching method, switching device and intelligent conference equipment
CN113411532B (en) Method, device, terminal and storage medium for recording content
CN111614928B (en) Positioning method, terminal device and conference system
CN114245018A (en) Image shooting method and device
CN114078280A (en) Motion capture method, motion capture device, electronic device and storage medium
WO2024062971A1 (en) Information processing device, information processing method, and information processing program
US20230394614A1 (en) Image collection method and apparatus, terminal, and storage medium
US10939070B1 (en) Systems and methods for generating video images in a centered view mode
CN115086611A (en) Lightweight video supervision method, system and equipment
CN116847187A (en) Shooting method, shooting device, electronic equipment and storage medium
CN117336601A (en) Display method, display device and electronic equipment
CN117202081A (en) Audio processing method and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination