CN112188145A - Video conference method and system, and computer readable storage medium - Google Patents
Video conference method and system, and computer readable storage medium Download PDFInfo
- Publication number
- CN112188145A CN112188145A CN202010988005.5A CN202010988005A CN112188145A CN 112188145 A CN112188145 A CN 112188145A CN 202010988005 A CN202010988005 A CN 202010988005A CN 112188145 A CN112188145 A CN 112188145A
- Authority
- CN
- China
- Prior art keywords
- speaker
- face
- video
- voice recognition
- module
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 41
- 238000012544 monitoring process Methods 0.000 claims abstract description 25
- 238000004088 simulation Methods 0.000 claims abstract description 24
- 230000000694 effects Effects 0.000 abstract description 4
- 238000010586 diagram Methods 0.000 description 10
- 238000004590 computer program Methods 0.000 description 7
- 230000006870 function Effects 0.000 description 7
- 238000012545 processing Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/15—Conference systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/168—Feature extraction; Face representation
- G06V40/171—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Oral & Maxillofacial Surgery (AREA)
- General Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Acoustics & Sound (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Telephonic Communication Services (AREA)
Abstract
The invention discloses a video conference method and a system thereof, and a computer readable storage medium, wherein the video conference method comprises the following steps: monitoring the face state of a speaker in a video conference; if the face state is monitored to meet a first preset condition, replacing the real face of the speaker in the video picture with a prestored face model of the speaker; performing voice recognition on the speaker; and adding lip simulation actions in the face model of the speaker in the video picture according to the voice recognition condition. The video conference method and the video conference system can improve the video conference effect and improve the video conference efficiency.
Description
Technical Field
The present invention relates to the field of video communication technologies, and in particular, to a video conference method and system, and a computer-readable storage medium.
Background
With the development of internet technology, video conferences are more and more widely applied.
The inventor finds that in the process of implementing the invention, a large number of participants may occur in the video conference process, the time of all the participants needs to be gathered in the conference process, and when a speaker has a poor state in a video picture or frequently enters and exits the picture, the conference feeling of other people is affected, and the conference efficiency is reduced.
The information disclosed in this background section is only for enhancement of understanding of the general background of the invention and should not be taken as an acknowledgement or any form of suggestion that this information forms the prior art already known to a person skilled in the art.
Disclosure of Invention
The invention aims to provide a video conference method and a video conference system, which can improve the video conference effect and improve the video conference efficiency.
In order to achieve the above object, the present invention provides a video conference method, including: monitoring the face state of a speaker in a video conference; if the face state is monitored to meet a first preset condition, replacing the real face of the speaker in the video picture with a prestored face model of the speaker; performing voice recognition on the speaker; and adding lip simulation actions in the face model of the speaker in the video picture according to the voice recognition condition.
In an embodiment of the present invention, the monitoring the face state of the speaker in the video conference includes: and acquiring the face information of the speaker in the video stream in real time.
In an embodiment of the present invention, the first preset condition includes that the eye-closing time of the speaker exceeds a first preset threshold, the number of times that the face of the speaker enters or exits the video frame within a preset time exceeds a second preset threshold, or the duration of the state that the face of the speaker is not fully displayed in the video frame exceeds a third preset threshold.
In an embodiment of the present invention, the video conference method further includes: replacing the real face of the speaker in the video picture with a pre-stored face model of the speaker, and then continuing to monitor the face state of the speaker; and if the face state of the speaker is monitored to meet a second preset condition, switching the face model of the speaker in the video picture back to the real face of the speaker.
In an embodiment of the present invention, the second preset condition is that a state duration time of a complete display of the face of the speaker in the video image exceeds a fourth preset threshold and an eye closing time of the speaker does not exceed the first preset threshold.
Based on the same inventive concept, the invention also provides a video conference method, which comprises the following steps: when a first face switching request is received, replacing the real face of the speaker in a video picture with a prestored face model of the speaker; performing voice recognition on the speaker; and adding lip simulation actions in the face model of the speaker in the video picture according to the voice recognition condition.
In an embodiment of the present invention, the video conference method further includes: and when a second face switching request is received, switching the face model of the speaker in the video picture back to the real face of the speaker.
Based on the same inventive concept, the invention also provides a video conference system, which comprises: the face switching device comprises a face state monitoring module, a first face switching module, a voice recognition module and a lip simulation module. The face state monitoring module is used for monitoring the face state of a speaker in the video conference. The first face switching module is coupled with the face state monitoring module and used for replacing a real face of a speaker in a video picture with a prestored face model of the speaker if the face state monitoring module monitors that the face state meets a first preset condition. The voice recognition module is coupled with the first face switching module and is used for performing voice recognition on a speaker after the first face switching module replaces the real face of the speaker in the video picture with a pre-stored face model of the speaker. And the lip simulation module is coupled with the voice recognition module and is used for adding lip simulation actions in a face model of a speaker in a video picture according to voice recognition conditions.
In an embodiment of the present invention, the face state monitoring module monitors the face state of a speaker by acquiring face information of the speaker in a video stream in real time.
In an embodiment of the present invention, the first preset condition includes that the eye-closing time of the speaker exceeds a first preset threshold, the number of times that the face of the speaker enters or exits the video frame within a preset time exceeds a second preset threshold, or the duration of the state that the face of the speaker is not fully displayed in the video frame exceeds a third preset threshold.
In an embodiment of the present invention, the video conference system further includes: and the second face switching module is coupled with the face state monitoring module and used for switching the face model of the speaker in the video picture back to the real face of the speaker if the face state monitoring module monitors that the face state of the speaker meets a second preset condition. The second preset condition is that the state duration time of the complete display of the face of the speaker in the video picture exceeds a fourth preset threshold value and the eye closing time of the speaker does not exceed the first preset threshold value.
Based on the same inventive concept, the invention also provides a video conference system, which comprises: the third face switches module, speech recognition module, lip simulation module. And the third face switching module is used for replacing the real face of the speaker in the video picture with a prestored face model of the speaker when receiving the first face switching request. The voice recognition module is coupled with the third face switching module and is used for performing voice recognition on the speaker after the third face switching module replaces the real face of the speaker in the video picture with the pre-stored face model of the speaker. And the lip simulation module is coupled with the voice recognition module and is used for adding lip simulation actions in the face model of the speaker in the video picture according to the voice recognition condition.
In an embodiment of the present invention, the video conference system further includes: and the fourth face switching module. And the fourth face switching module is used for switching the face model of the speaker in the video picture back to the real face of the speaker when receiving the second face switching request.
Based on the same inventive concept, the present invention also provides a computer-readable storage medium for executing the video conference method according to any one of the above embodiments.
Compared with the prior art, according to the video conference method and system and the computer readable storage medium, the function of switching the real human face and the human face model is designed, when the speaker is in a poor state or needs to leave the camera temporarily due to a busy state, the real human face can be replaced by the human face model in an automatic or manual mode, and the lip action is restored through voice recognition, so that the sense of reality of the human face model in a video picture can be kept, the video conference effect and the video conference efficiency are improved, and the feeling of participants is improved.
Drawings
Fig. 1 is a block diagram of the steps of a video conferencing method according to an embodiment of the present invention.
Fig. 2 is a block diagram of a video conferencing system according to an embodiment of the present invention.
Detailed Description
The following detailed description of the present invention is provided in conjunction with the accompanying drawings, but it should be understood that the scope of the present invention is not limited to the specific embodiments.
Throughout the specification and claims, unless explicitly stated otherwise, the word "comprise", or variations such as "comprises" or "comprising", will be understood to imply the inclusion of a stated element or component but not the exclusion of any other element or component.
In order to overcome the problems in the prior art, the following embodiments of the video conference method and system and the computer-readable storage medium design a function of switching between a real face and a face model, and when a speaker is in a poor state or needs to leave a camera temporarily due to a busy state, the real face can be replaced by the face model in an automatic or manual manner.
Fig. 1 is a video conferencing method according to an embodiment of the present invention. Through this embodiment, can the automatic identification speaker's state, when the speaker state is not good or when busy, can switch into the face model with the real face in the video picture automatically, improve participant's perception, improve meeting efficiency.
The video conference method includes the following steps.
The face state of the speaker in the video conference is monitored in step S1. Specifically, the face state can be acquired by acquiring the face information of the speaker in the video stream in real time.
The real face is replaced with a face model in step S2: if the face state is monitored to meet a first preset condition, replacing the real face of the speaker in the video picture with a pre-stored face model of the speaker. The first preset condition comprises that the eye closing time of a speaker exceeds a first preset threshold, the frequency of the face of the speaker entering and exiting the video picture within the preset time exceeds a second preset threshold or the state duration time of the face of the speaker displaying incompletely in the video picture exceeds a third preset threshold. If the eye closing time of the speaker exceeds 5s, the frequency of frequently entering and exiting the video picture by the speaker within 5s exceeds 3 times, or the face of the speaker only displays 80% or the head roll angle is 45%, and the duration is 10s, the speaker can be determined to be in a poor or busy state, and the face switching can be performed at the moment.
Speech recognition is performed on the speaker in step S3.
A lip simulation action is added in step S4: and adding lip simulation actions in the face model of the speaker in the video picture according to the voice recognition condition.
In order to switch back to the real face when the speaker is in a good state, in a preferred embodiment, the video conference method further includes: replacing the real face of the speaker in the video picture with a pre-stored face model of the speaker, and then continuing to monitor the face state of the speaker; and if the face state of the speaker is monitored to meet a second preset condition, switching the face model of the speaker in the video picture back to the real face of the speaker. The second preset condition is that the state duration time of the face of the speaker completely displayed in the video picture exceeds a fourth preset threshold value and the eye closing time of the speaker does not exceed the first preset threshold value. For example, the whole face of the speaker is displayed for 10 seconds or more, and the eye-closed state exceeding 5 seconds does not occur.
Based on the same inventive concept, the invention also provides another video conference method, and the video conference method of one embodiment comprises the following steps: when a first face switching request is received, replacing the real face of the speaker in a video picture with a prestored face model of the speaker; performing voice recognition on the speaker; and adding lip simulation actions in the face model of the speaker in the video picture according to the voice recognition condition. Through the implementation mode, when a speaker is busy, the real face in the video picture can be manually controlled to be switched into the face model, so that the sensitivity of participants is improved, and the conference efficiency is improved.
In order to be able to manually switch the face model back to a real face, in a preferred embodiment, the video conference method further comprises: and when a second face switching request is received, switching the face model of the speaker in the video picture back to the real face of the speaker.
Based on the same inventive concept, the invention also provides a video conference system. As shown in fig. 2, a video conference system of an embodiment includes: the system comprises a face state monitoring module 10, a first face switching module 11, a voice recognition module 12 and a lip simulation module 13.
The face state monitoring module 10 is used for monitoring the face state of a speaker in a video conference. Specifically, the face state monitoring module 10 may monitor the face state of the speaker by acquiring the face information of the speaker in the video stream in real time.
The first face switching module 11 is coupled to the face state monitoring module 10, and configured to replace, by the first face switching module 11, a real face of a speaker in the video picture with a pre-stored face model of the speaker if the face state monitoring module 10 monitors that the face state meets a first preset condition. The first preset condition comprises one or more of the condition that the eye closing time of the speaker exceeds a first preset threshold, the frequency of the face of the speaker entering and exiting the video picture within the preset time exceeds a second preset threshold, and the duration of the state that the face of the speaker is not completely displayed in the video picture exceeds a third preset threshold.
The voice recognition module 12 is coupled to the first face switching module 11, and configured to perform voice recognition on a speaker after the first face switching module 11 replaces a real face of the speaker in the video picture with a pre-stored face model of the speaker.
The lip simulation module 13 is coupled to the voice recognition module 12, and is configured to add a lip simulation action to the face model of the speaker in the video frame according to the voice recognition condition.
Preferably, the video conference system of this embodiment further includes: and the second face switching module 14 is coupled to the face state monitoring module 10, and configured to switch the face model of the speaker in the video picture back to the real face of the speaker if the face state monitoring module 10 monitors that the face state of the speaker meets a second preset condition. The second preset condition is that the state duration time of the face of the speaker completely displayed in the video picture exceeds a fourth preset threshold value and the eye closing time of the speaker does not exceed the first preset threshold value.
Based on the same inventive concept, the invention also provides another video conference system. The video conference system of an embodiment includes: the third face switches module, speech recognition module, lip simulation module.
And the third face switching module is used for replacing the real face of the speaker in the video picture with a prestored face model of the speaker when receiving the first face switching request. The voice recognition module is coupled with the third face switching module and is used for performing voice recognition on the speaker after the third face switching module replaces the real face of the speaker in the video picture with the pre-stored face model of the speaker. And the lip simulation module is coupled with the voice recognition module and is used for adding a lip simulation action in the face model of the speaker in the video picture according to the voice recognition condition.
Preferably, the video conference system of this embodiment further includes: and the fourth face switching module. And the fourth face switching module is used for switching the face model of the speaker in the video picture back to the real face of the speaker when receiving the second face switching request.
Based on the same inventive concept, the present invention also provides a computer-readable storage medium for executing the video conference method of any one of the above embodiments.
In summary, according to the video conference method and system and the computer-readable storage medium of the embodiments, a function of switching between the real face and the face model is designed, when the speaker is in a poor state or needs to leave the camera temporarily due to a busy state, the real face can be replaced by the face model in an automatic or manual manner, and the lip motion is restored through voice recognition, so that the sense of reality of the face model in the video picture can be maintained, the video conference effect and the video conference efficiency are improved, and the experience of participants is improved. And only manage for the speaker, the resource that needs is few, and easy to deploy.
As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The foregoing descriptions of specific exemplary embodiments of the present invention have been presented for purposes of illustration and description. It is not intended to limit the invention to the precise form disclosed, and obviously many modifications and variations are possible in light of the above teaching. The exemplary embodiments were chosen and described in order to explain certain principles of the invention and its practical application to enable one skilled in the art to make and use various exemplary embodiments of the invention and various alternatives and modifications as are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the claims and their equivalents.
Claims (10)
1. A video conferencing method, the video conferencing method comprising:
monitoring the face state of a speaker in a video conference;
if the face state is monitored to meet a first preset condition, replacing the real face of the speaker in the video picture with a prestored face model of the speaker;
performing voice recognition on the speaker; and
and adding lip simulation actions in the face model of the speaker in the video picture according to the voice recognition condition.
2. The video conferencing method of claim 1, wherein the monitoring of the face state of the speaker in the video conference comprises:
and acquiring the face information of the speaker in the video stream in real time.
3. The video conference method according to claim 1, wherein the first preset condition includes that the eye-closing time of the speaker exceeds a first preset threshold, the number of times the face of the speaker enters or exits the video screen within a preset time exceeds a second preset threshold, or the duration of the state in which the face of the speaker is not fully displayed in the video screen exceeds a third preset threshold.
4. The video conferencing method of claim 1, wherein the video conferencing method further comprises:
replacing the real face of the speaker in the video picture with a pre-stored face model of the speaker, and then continuing to monitor the face state of the speaker;
and if the face state of the speaker is monitored to meet a second preset condition, switching the face model of the speaker in the video picture back to the real face of the speaker.
5. The video conference method according to claim 4, wherein the second preset condition is that a duration of a state in which the face of the speaker is fully displayed in the video screen exceeds a fourth preset threshold and an eye-closing time of the speaker does not exceed the first preset threshold.
6. A video conferencing method, the video conferencing method comprising:
when a first face switching request is received, replacing a real face of a speaker in a video picture with a prestored face model of the speaker;
performing voice recognition on the speaker; and
and adding lip simulation actions in the face model of the speaker in the video picture according to the voice recognition condition.
7. The video conferencing method of claim 6, wherein the video conferencing method further comprises:
and when a second face switching request is received, switching the face model of the speaker in the video picture back to the real face of the speaker.
8. A video conferencing system, the video conferencing system comprising:
the face state monitoring module is used for monitoring the face state of a speaker in the video conference;
the first face switching module is coupled with the face state monitoring module and used for replacing the real face of a speaker in a video picture with a prestored face model of the speaker if the face state monitoring module monitors that the face state meets a first preset condition;
the voice recognition module is coupled with the first face switching module and used for performing voice recognition on a speaker after the first face switching module replaces the real face of the speaker in a video picture with a pre-stored face model of the speaker; and
and the lip simulation module is coupled with the voice recognition module and is used for adding a lip simulation action in a face model of a speaker in a video picture according to the voice recognition condition.
9. A video conferencing system, the video conferencing system comprising:
the third face switching module is used for replacing the real face of the speaker in the video picture with a prestored face model of the speaker when receiving the first face switching request;
the voice recognition module is coupled with the third face switching module and used for performing voice recognition on a speaker after the third face switching module replaces the real face of the speaker in the video picture with a prestored face model of the speaker; and
and the lip simulation module is coupled with the voice recognition module and is used for adding a lip simulation action in the face model of the speaker in the video picture according to the voice recognition condition.
10. A computer-readable storage medium for performing the video conferencing method of any of claims 1-7.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010988005.5A CN112188145A (en) | 2020-09-18 | 2020-09-18 | Video conference method and system, and computer readable storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010988005.5A CN112188145A (en) | 2020-09-18 | 2020-09-18 | Video conference method and system, and computer readable storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112188145A true CN112188145A (en) | 2021-01-05 |
Family
ID=73956446
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010988005.5A Pending CN112188145A (en) | 2020-09-18 | 2020-09-18 | Video conference method and system, and computer readable storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112188145A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114727043A (en) * | 2022-03-07 | 2022-07-08 | 国网山东省电力公司信息通信公司 | Control method and system for automatic meeting place lens switching |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103218842A (en) * | 2013-03-12 | 2013-07-24 | 西南交通大学 | Voice synchronous-drive three-dimensional face mouth shape and face posture animation method |
CN110136229A (en) * | 2019-05-27 | 2019-08-16 | 广州亮风台信息科技有限公司 | A kind of method and apparatus changed face for real-time virtual |
CN110267079A (en) * | 2018-03-30 | 2019-09-20 | 腾讯科技(深圳)有限公司 | The replacement method and device of face in video to be played |
CN110599359A (en) * | 2019-09-05 | 2019-12-20 | 深圳追一科技有限公司 | Social contact method, device, system, terminal equipment and storage medium |
CN110674706A (en) * | 2019-09-05 | 2020-01-10 | 深圳追一科技有限公司 | Social contact method and device, electronic equipment and storage medium |
CN110719415A (en) * | 2019-09-30 | 2020-01-21 | 深圳市商汤科技有限公司 | Video image processing method and device, electronic equipment and computer readable medium |
CN110808048A (en) * | 2019-11-13 | 2020-02-18 | 联想(北京)有限公司 | Voice processing method, device, system and storage medium |
CN111353336A (en) * | 2018-12-21 | 2020-06-30 | 华为技术有限公司 | Image processing method, device and equipment |
-
2020
- 2020-09-18 CN CN202010988005.5A patent/CN112188145A/en active Pending
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103218842A (en) * | 2013-03-12 | 2013-07-24 | 西南交通大学 | Voice synchronous-drive three-dimensional face mouth shape and face posture animation method |
CN110267079A (en) * | 2018-03-30 | 2019-09-20 | 腾讯科技(深圳)有限公司 | The replacement method and device of face in video to be played |
CN111353336A (en) * | 2018-12-21 | 2020-06-30 | 华为技术有限公司 | Image processing method, device and equipment |
CN110136229A (en) * | 2019-05-27 | 2019-08-16 | 广州亮风台信息科技有限公司 | A kind of method and apparatus changed face for real-time virtual |
CN110599359A (en) * | 2019-09-05 | 2019-12-20 | 深圳追一科技有限公司 | Social contact method, device, system, terminal equipment and storage medium |
CN110674706A (en) * | 2019-09-05 | 2020-01-10 | 深圳追一科技有限公司 | Social contact method and device, electronic equipment and storage medium |
CN110719415A (en) * | 2019-09-30 | 2020-01-21 | 深圳市商汤科技有限公司 | Video image processing method and device, electronic equipment and computer readable medium |
CN110808048A (en) * | 2019-11-13 | 2020-02-18 | 联想(北京)有限公司 | Voice processing method, device, system and storage medium |
Non-Patent Citations (3)
Title |
---|
张怡暄等: "基于帧间差异的人脸篡改视频检测方法", 《信息安全学报》 * |
林爱华等: "语音驱动人脸唇形动画的实现", 《计算机工程》 * |
范懿文等: "支持表情细节的语音驱动人脸动画", 《计算机辅助设计与图形学学报》 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114727043A (en) * | 2022-03-07 | 2022-07-08 | 国网山东省电力公司信息通信公司 | Control method and system for automatic meeting place lens switching |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105005430B (en) | A kind of window display method and terminal | |
EP3275181B1 (en) | Eye gaze correction | |
CN111654715B (en) | Live video processing method and device, electronic equipment and storage medium | |
US9888211B1 (en) | Replacing live video of a meeting participant with recorded video of the meeting participant during an online meeting | |
CN111107415B (en) | Picture-in-picture playing method, storage medium, electronic equipment and system for live broadcasting room | |
EP3275180B1 (en) | Eye gaze correction | |
CN108877848B (en) | Method and device for responding to user operation in virtual three-dimensional room speaking mode | |
CN111405234A (en) | Video conference information system and method with integration of cloud computing and edge computing | |
WO2016169496A1 (en) | Video conference image presentation method and device therefor | |
CN111083397A (en) | Recorded broadcast picture switching method, system, readable storage medium and equipment | |
CN112307800A (en) | Method and device for displaying electronic nameplate in video conference | |
CN109257188A (en) | Web conference prompts treating method and apparatus | |
CN109859753A (en) | Voice-activated method and device applied to digital court | |
CN112188145A (en) | Video conference method and system, and computer readable storage medium | |
CN111131757B (en) | Video conference display method, device and storage medium | |
CN113206974B (en) | Video picture switching method and system | |
WO2016176226A1 (en) | Eye gaze correction | |
CN112118414B (en) | Video session method, electronic device, and computer storage medium | |
US20230230416A1 (en) | Establishing private communication channels | |
WO2021245759A1 (en) | Voice conference device, voice conference system, and voice conference method | |
WO2016176225A1 (en) | Eye gaze correction | |
CN111414838A (en) | Attention detection method, device, system, terminal and storage medium | |
CN113596349A (en) | Conference method, system, device and storage medium for automatic linkage of speech position and video | |
Lee et al. | Influence of audio-visual attention on perceived quality of standard definition multimedia content | |
WO2023249005A1 (en) | Screen synthesis method using web conference system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20210105 |
|
RJ01 | Rejection of invention patent application after publication |